BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 018958
         (348 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|255563110|ref|XP_002522559.1| cysteine protease, putative [Ricinus communis]
 gi|223538250|gb|EEF39859.1| cysteine protease, putative [Ricinus communis]
          Length = 344

 Score =  382 bits (980), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 185/338 (54%), Positives = 242/338 (71%), Gaps = 9/338 (2%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
           + ++  LLV+  +    SRS HE S+   H+ WM Q+GR YK  +EKE R KIFKEN+E+
Sbjct: 9   LVLMAMLLVTLWASQSWSRSLHEASMELRHKTWMTQYGRVYKGNVEKEKRFKIFKENVEF 68

Query: 78  IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRST-TSSTFKYQNLSMTDV 136
           IE  N  GN+ YKLG N F+DLTN+EFRA + GY M   SH+S+  + +F+Y+N+  T V
Sbjct: 69  IESFNNNGNKPYKLGINAFTDLTNEEFRASHNGYTMSMSSHQSSYRTKSFRYENV--TAV 126

Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG- 195
           P SLDWR KGAVT IK+Q +CGCCWAF+AVAA+EGITK+ +G LI LSEQ+L+DC T+G 
Sbjct: 127 PPSLDWRTKGAVTHIKDQGQCGCCWAFSAVAAMEGITKLSTGTLISLSEQELVDCDTSGM 186

Query: 196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKP-AAAKISNYEEVPSGDE 254
           + GC GG  + AF +II+N G+ TE  YPY+ V G+C+  +    AAKI+ YE VP+ DE
Sbjct: 187 DQGCEGGLMDDAFEFIIENNGLTTEANYPYEGVDGSCNTRKAANHAAKITGYENVPAYDE 246

Query: 255 QALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLI 314
           +AL KAV+ QPVS+AI A  + FQ Y  GIF G CGT+LDH VT+VG+GT++DG  YWL+
Sbjct: 247 EALRKAVANQPVSVAIDAGESAFQHYSSGIFTGDCGTELDHGVTVVGYGTSDDGTKYWLV 306

Query: 315 KNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
           KNSWG +WG+ GY+++ RD    EGLCGI    SYP A
Sbjct: 307 KNSWGTSWGEDGYIRMERDIDAKEGLCGIAMEPSYPTA 344


>gi|224076970|ref|XP_002305073.1| predicted protein [Populus trichocarpa]
 gi|222848037|gb|EEE85584.1| predicted protein [Populus trichocarpa]
          Length = 340

 Score =  379 bits (973), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 181/342 (52%), Positives = 241/342 (70%), Gaps = 11/342 (3%)

Query: 12  KINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIF 71
           K NT      + +L + A+++       ++ +++ HE+WMAQHGR Y D  EKE R  IF
Sbjct: 5   KCNTRIFLPFLLILAAWATKIACRPLDEQEYMLKRHEEWMAQHGRVYGDMKEKEKRYLIF 64

Query: 72  KENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNL 131
           KEN+E IE  N   +R YKLG N+F+DLTN+EFRA+Y GYK  S       SS+F+Y+NL
Sbjct: 65  KENIERIEAFNNGSDRGYKLGVNKFADLTNEEFRAMYHGYKRQSSK---LMSSSFRYENL 121

Query: 132 SMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDC 191
           S  D+PTS+DWR+ GAVTP+K+Q  CGCCWAF+ VAA+EGI K+++GNLI LSEQQL+DC
Sbjct: 122 S--DIPTSMDWRNDGAVTPVKDQGTCGCCWAFSTVAAIEGIIKLQTGNLISLSEQQLVDC 179

Query: 192 STNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAA-AKISNYEEVP 250
            T GN GC GG  + AF YII+N G+ +ED YPYQ V GTCS+ +  +  A+I+ YE+VP
Sbjct: 180 -TAGNKGCQGGLMDTAFQYIIRNGGLTSEDNYPYQGVDGTCSSEKAASTEAQITGYEDVP 238

Query: 251 SGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGAN 310
             +E ALL+AV+ QPVS+ +     +FQ YK G+FNG CGTQ +HAVT +G+GT  DG +
Sbjct: 239 QNNENALLQAVAKQPVSVGVDGGGNDFQFYKSGVFNGDCGTQQNHAVTAIGYGTDIDGTD 298

Query: 311 YWLIKNSWGNTWGDAGYMKIVR----DEGLCGIGTRSSYPLA 348
           YWL+KNSWG +WG+ GYM++ R     EGLCG+   +SYP A
Sbjct: 299 YWLVKNSWGTSWGENGYMRMRRGIGSSEGLCGVAMDASYPTA 340


>gi|255564908|ref|XP_002523447.1| cysteine protease, putative [Ricinus communis]
 gi|223537275|gb|EEF38906.1| cysteine protease, putative [Ricinus communis]
          Length = 342

 Score =  372 bits (954), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 184/337 (54%), Positives = 238/337 (70%), Gaps = 10/337 (2%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
           MF+ + ++   ASQ  S RS H+ ++ E HE WMA++GR YKD  EKE R +IF+ N+E+
Sbjct: 10  MFVALLVVGLWASQAWS-RSLHDAAMNERHEMWMAKYGRVYKDNSEKERRFEIFRNNVEF 68

Query: 78  IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
           IE  NK GNR YKL  N+F+DLTN+EF+    GYK  S     T  S+F+Y N+  T VP
Sbjct: 69  IESFNKLGNRPYKLDINEFADLTNEEFKVSKNGYKRSSGVGL-TEKSSFRYANV--TAVP 125

Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-N 196
           TS+DWR  GAVTPIK+Q +CGCCWAF+AVAA+EGITK+ +G LI LSEQ+L+DC T+G +
Sbjct: 126 TSMDWRQNGAVTPIKDQGQCGCCWAFSAVAAMEGITKLSTGKLISLSEQELVDCDTSGED 185

Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKP-AAAKISNYEEVPSGDEQ 255
            GC GG  + AF +I QN G+ TE  YPYQ   GTC+  +    AAKI+ YE+VP+  E 
Sbjct: 186 QGCEGGLMDDAFEFIKQNGGLTTEANYPYQGTDGTCNTNKAGNDAAKITGYEDVPANSED 245

Query: 256 ALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIK 315
           ALLKAV+ QPVS+AI A  + FQ Y  G+F G CGT+LDH VT VG+GT++DG  YWL+K
Sbjct: 246 ALLKAVASQPVSVAIDASGSAFQFYSGGVFTGDCGTELDHGVTAVGYGTSDDGTKYWLVK 305

Query: 316 NSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
           NSWG +WG+ GY+++ RD    EGLCGI  + SYP A
Sbjct: 306 NSWGTSWGEDGYIRMERDIEAKEGLCGIAMQPSYPTA 342


>gi|255564910|ref|XP_002523448.1| cysteine protease, putative [Ricinus communis]
 gi|223537276|gb|EEF38907.1| cysteine protease, putative [Ricinus communis]
          Length = 341

 Score =  369 bits (948), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 186/337 (55%), Positives = 238/337 (70%), Gaps = 11/337 (3%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
           MF+ + ++    SQ  S RS H+ ++ E HE WM ++GR YKD  EKE R +IF+ N+E+
Sbjct: 10  MFVALLVVGLWVSQAWS-RSLHDAAMNERHEMWMVKYGRVYKDNSEKERRFEIFRNNVEF 68

Query: 78  IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
           IE  NK GNR YKL  N+F+DLTN+EF+A   GYK  S    S  SS F+Y N+  T VP
Sbjct: 69  IESFNKPGNRPYKLDINEFADLTNEEFKASRNGYKRSSNVGLSEKSS-FRYGNV--TAVP 125

Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-N 196
           TS+DWR KGAVTPIK+Q +CGCCWAF+AVAA+EGITK+ +G LI LSEQ+L+DC T+G +
Sbjct: 126 TSMDWRQKGAVTPIKDQGQCGCCWAFSAVAAMEGITKLSTGKLISLSEQELVDCDTSGED 185

Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKP-AAAKISNYEEVPSGDEQ 255
            GC GG  + AF +I QN G+ TE  YPYQ   GTC+  +    AAKI+ YE+VP+  E 
Sbjct: 186 QGCEGGLMDDAFEFIKQNGGLTTEANYPYQGTDGTCNTNKAGNDAAKITGYEDVPANSED 245

Query: 256 ALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIK 315
           ALLKAV+ QPVS+AI A  + FQ Y  G+F G CGT+LDH VT VG+GT+ DG  YWL+K
Sbjct: 246 ALLKAVASQPVSVAIDASGSAFQFYSGGVFTGDCGTELDHGVTAVGYGTS-DGTKYWLVK 304

Query: 316 NSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
           NSWG +WG+ GY+++ RD    EGLCGI  +SSYP A
Sbjct: 305 NSWGTSWGEDGYIRMERDIEAKEGLCGIAMQSSYPTA 341


>gi|124484401|dbj|BAF46311.1| cysteine proteinase precursor [Ipomoea nil]
          Length = 339

 Score =  369 bits (948), Expect = e-99,   Method: Compositional matrix adjust.
 Identities = 176/341 (51%), Positives = 241/341 (70%), Gaps = 11/341 (3%)

Query: 14  NTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKE 73
           N+  + I + L+ + ++ + +SR+  +  +   HE+WMAQ+GR YK+E+EK  R  IFKE
Sbjct: 4   NSLKLLIALALVFATSAYLATSRTLLDSLMAVRHEQWMAQYGRVYKNEVEKTKRYNIFKE 63

Query: 74  NLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
           N+EYIE  NK G + YKLG N F+DLTN EF A   GY +P   H  ++++ F+Y+N+S 
Sbjct: 64  NVEYIESFNKAGTKPYKLGINAFADLTNKEFIASRNGYILP---HECSSNTPFRYENVSA 120

Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST 193
             VPT++DWR KGAVTP+K+Q +CGCCWAF+AVAA+EGITK+ +GNLI LSEQ+L+DC  
Sbjct: 121 --VPTTVDWRKKGAVTPVKDQGQCGCCWAFSAVAAMEGITKLSTGNLISLSEQELVDCDV 178

Query: 194 NG-NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAK-ISNYEEVPS 251
            G + GC GG  + AF +II N+G+ TE  YPYQ   G+C  ++   +A  IS YE+VP+
Sbjct: 179 KGIDQGCEGGLMDDAFTFIINNKGLTTESNYPYQGTDGSCKKSKSSNSAAKISGYEDVPA 238

Query: 252 GDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANY 311
             E AL KAV+ QPVS+AI A  ++FQ Y  G+F G CGT+LDH VT VG+G  EDG+ Y
Sbjct: 239 NSESALEKAVANQPVSVAIDAGGSDFQFYSSGVFTGECGTELDHGVTAVGYGIAEDGSKY 298

Query: 312 WLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
           WL+KNSWG +WG+ GY+++ +D    EGLCGI  +SSYP A
Sbjct: 299 WLVKNSWGTSWGEKGYIRMQKDIEAKEGLCGIAMQSSYPSA 339


>gi|225443827|ref|XP_002274223.1| PREDICTED: vignain-like [Vitis vinifera]
          Length = 340

 Score =  369 bits (946), Expect = 2e-99,   Method: Compositional matrix adjust.
 Identities = 184/338 (54%), Positives = 240/338 (71%), Gaps = 12/338 (3%)

Query: 19  FIIITLLVS--CASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLE 76
            I ITLL+    ASQ +S R+ HE S+ E HE WM  +GR+YKD  EKE R KIFKEN+E
Sbjct: 7   IICITLLIMGVWASQALS-RTLHEVSMSERHEDWMGLYGRTYKDIAEKERRFKIFKENVE 65

Query: 77  YIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV 136
           YIE  N  GNR YKL  N+F+D TN+EF+A   GY M S   RS+  ++F+Y+N++   V
Sbjct: 66  YIESVNSAGNRRYKLSINEFADQTNEEFKASRNGYNMSSRP-RSSEITSFRYENVAA--V 122

Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG- 195
           P+S+DWR KGAVTPIK+Q +CGCCWAF+AVAA+EG+T++++G LI LSEQ+L+DC T+G 
Sbjct: 123 PSSMDWRKKGAVTPIKDQGQCGCCWAFSAVAAMEGVTQLKTGELISLSEQELVDCDTSGE 182

Query: 196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAK-ISNYEEVPSGDE 254
           + GC GG  + AF +II N G+ TE  YPY+ V  TC+  +  ++A  I NYE+VP+  E
Sbjct: 183 DQGCGGGLMDSAFEFIIGNGGLTTEANYPYKGVDATCNKKKAASSAAKIKNYEDVPANSE 242

Query: 255 QALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLI 314
            ALLKAV+  PVS+AI A  ++FQ Y  G+F G CGT+LDH VT VG+G T+DG  YWL+
Sbjct: 243 AALLKAVAQHPVSVAIDAGGSDFQFYSSGVFTGQCGTELDHGVTAVGYGKTDDGTKYWLV 302

Query: 315 KNSWGNTWGDAGYMKIVR----DEGLCGIGTRSSYPLA 348
           KNSWG  WG+ GY+ + R    DEGLCGI   +SYP A
Sbjct: 303 KNSWGTGWGEDGYIWMERDIGADEGLCGIAMEASYPTA 340


>gi|24285904|gb|AAL14199.1| cysteine proteinase precursor [Ipomoea batatas]
 gi|56961686|gb|AAK15148.2| cysteine proteinase-like protein [Ipomoea batatas]
          Length = 341

 Score =  368 bits (945), Expect = 2e-99,   Method: Compositional matrix adjust.
 Identities = 174/323 (53%), Positives = 235/323 (72%), Gaps = 11/323 (3%)

Query: 32  VVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKL 91
           + +SR+  +  +V  HE+WMAQ+GR Y++E+EK  R  IFKEN+EYIE  NK G + YKL
Sbjct: 24  LATSRTLSDSLMVVRHEQWMAQYGRVYENEVEKTKRFNIFKENVEYIESFNKAGTKPYKL 83

Query: 92  GTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPI 151
           G N F+DLTN EF+A   GYK+P   H  ++++ F+Y+N+S   VPT++DWR KGAVTP+
Sbjct: 84  GINAFADLTNQEFKASRNGYKLP---HDCSSNTPFRYENVS--SVPTTVDWRTKGAVTPV 138

Query: 152 KNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAY 210
           K+Q +CGCCWAF+AVAA+EGITK+ +GNLI LSEQ+L+DC   G + GC GG  + AF++
Sbjct: 139 KDQGQCGCCWAFSAVAAMEGITKLSTGNLISLSEQELVDCDVKGIDQGCEGGLMDDAFSF 198

Query: 211 IIQNQGIATEDEYPYQAVPGTCSAAQKPAAAK-ISNYEEVPSGDEQALLKAVSMQPVSIA 269
           II N+G+ TE  YPYQ   G+C  ++   +A  IS YE+VP+  E AL KAV+ QPVS+A
Sbjct: 199 IINNKGLTTESNYPYQGTDGSCKKSKSSNSAAKISGYEDVPANSESALEKAVANQPVSVA 258

Query: 270 IAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMK 329
           I A  ++FQ Y  G+F G CGT+LDH VT VG+G  EDG+ YWL+KNSWG +WG+ GY++
Sbjct: 259 IDAGGSDFQFYSSGVFTGECGTELDHGVTAVGYGIAEDGSKYWLVKNSWGTSWGEKGYIR 318

Query: 330 IVRD----EGLCGIGTRSSYPLA 348
           + +D    EGLCGI  +SSYP A
Sbjct: 319 MQKDIEAKEGLCGIAMQSSYPSA 341


>gi|409190991|gb|AFV30165.1| cysteine proteinase [Lotus japonicus]
          Length = 342

 Score =  368 bits (945), Expect = 2e-99,   Method: Compositional matrix adjust.
 Identities = 180/338 (53%), Positives = 241/338 (71%), Gaps = 15/338 (4%)

Query: 19  FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
           F ++  L   A QV SSR+  + S+ E HE+WMA++GR YKD  EKE R  IFKEN+ YI
Sbjct: 12  FALVLCLGLWAFQV-SSRTLQDASMQERHEQWMARYGRVYKDLQEKEKRFSIFKENVNYI 70

Query: 79  EKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMTDV 136
           E +N  G++ YKLG NQF+DLTN+EF A    +K  M S   R+TT   FKY+N++    
Sbjct: 71  EASNNAGDKPYKLGVNQFADLTNEEFIATRNKFKGHMSSSITRTTT---FKYENVT---A 124

Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG- 195
           P+++DWR +GAVTP+KNQ  CGCCWAF+AVAA EGI K+ +GNL+ LSEQ+L+DC T+G 
Sbjct: 125 PSTVDWRQEGAVTPVKNQGTCGCCWAFSAVAATEGIHKLSTGNLVSLSEQELVDCDTSGA 184

Query: 196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDE 254
           + GC GG  + AF +IIQN G+ TE +YPYQ V GTC+  ++    A I+ YE+VPS +E
Sbjct: 185 DQGCQGGLMDDAFKFIIQNGGLNTEAQYPYQGVDGTCNTNEEATHVATITGYEDVPSNNE 244

Query: 255 QALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLI 314
           QAL +AV+ QP+SIAI A  ++FQ+Y+ G+F G CGTQLDH V +VG+G ++DG  YWL+
Sbjct: 245 QALQQAVANQPISIAIDASGSDFQNYQSGVFTGSCGTQLDHGVAVVGYGVSDDGTKYWLV 304

Query: 315 KNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
           KNSWG  WG+ GY+++ RD    EGLCG+  + SYP A
Sbjct: 305 KNSWGADWGEEGYIRMQRDVDAPEGLCGLAMQPSYPTA 342


>gi|13491750|gb|AAK27968.1|AF242372_1 cysteine protease [Ipomoea batatas]
          Length = 339

 Score =  368 bits (944), Expect = 2e-99,   Method: Compositional matrix adjust.
 Identities = 175/323 (54%), Positives = 233/323 (72%), Gaps = 11/323 (3%)

Query: 32  VVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKL 91
           + +SR+  +  +V  HE+WMAQ+GR YK E EK  R  IFKEN+EYIE  NK G + YKL
Sbjct: 22  LATSRTLSDSLMVVRHEQWMAQYGRVYKTEAEKTKRFNIFKENVEYIESFNKAGTKPYKL 81

Query: 92  GTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPI 151
           G N F+DLTN EF+A   GYK+P   H  ++++ F+Y+N+S   VPT++DWR KGAVTP+
Sbjct: 82  GINAFADLTNQEFKASRNGYKLP---HDCSSNTPFRYENVS--SVPTTVDWRTKGAVTPV 136

Query: 152 KNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAY 210
           K+Q +CGCCWAF+AVAA+EGITK+ +GNLI LSEQ+L+DC   G + GC GG  + AF++
Sbjct: 137 KDQGQCGCCWAFSAVAAMEGITKLSTGNLISLSEQELVDCDVKGTDQGCEGGLMDDAFSF 196

Query: 211 IIQNQGIATEDEYPYQAVPGTCSAAQKPAAAK-ISNYEEVPSGDEQALLKAVSMQPVSIA 269
           II N+G+ TE  YPYQ   G+C  ++   +A  IS YE+VP+  E AL KAV+ QPVS+A
Sbjct: 197 IINNKGLTTESNYPYQGTDGSCKKSKSSNSAAKISGYEDVPANSESALEKAVANQPVSVA 256

Query: 270 IAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMK 329
           I A  ++FQ Y  G+F G CGT+LDH VT VG+G  EDG+ YWL+KNSWG +WG+ GY++
Sbjct: 257 IDAGGSDFQFYSSGVFTGECGTELDHGVTAVGYGIAEDGSKYWLVKNSWGTSWGEKGYIR 316

Query: 330 IVRD----EGLCGIGTRSSYPLA 348
           + +D    EGLCGI  +SSYP A
Sbjct: 317 MQKDIEAKEGLCGIAMQSSYPSA 339


>gi|224076968|ref|XP_002305072.1| predicted protein [Populus trichocarpa]
 gi|222848036|gb|EEE85583.1| predicted protein [Populus trichocarpa]
          Length = 305

 Score =  367 bits (942), Expect = 5e-99,   Method: Compositional matrix adjust.
 Identities = 172/309 (55%), Positives = 226/309 (73%), Gaps = 11/309 (3%)

Query: 43  VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTND 102
           +++ HE+WMAQHGR Y D  EKE R  IFKEN+E IE  N   +R YKLG N+F+DLTN+
Sbjct: 1   MLKRHEEWMAQHGRVYGDMKEKEKRYLIFKENIERIEAFNNGSDRGYKLGVNKFADLTNE 60

Query: 103 EFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWA 162
           EFRA+Y GYK  S       SS+F+Y+NLS  D+PTS+DWR+ GAVTP+K+Q  CGCCWA
Sbjct: 61  EFRAMYHGYKRQSSK---LMSSSFRYENLS--DIPTSMDWRNDGAVTPVKDQGTCGCCWA 115

Query: 163 FAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDE 222
           F+ VAA+EGI K+++GNLI LSEQQL+DC T GN GC GG  + AF YII+N G+ +ED 
Sbjct: 116 FSTVAAIEGIIKLQTGNLISLSEQQLVDC-TAGNKGCQGGLMDTAFQYIIRNGGLTSEDN 174

Query: 223 YPYQAVPGTCSAAQKPAA-AKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYK 281
           YPYQ V GTCS+ +  +  A+I+ YE+VP  +E ALL+AV+ QPVS+A+     +F+ YK
Sbjct: 175 YPYQGVDGTCSSEKAASTEAQITGYEDVPQNNENALLQAVAKQPVSVAVDGGGNDFRFYK 234

Query: 282 EGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR----DEGLC 337
            G+F G CGT L+H VT +G+GT  DG +YWL+KNSWG +WG++GY ++ R     EGLC
Sbjct: 235 SGVFEGDCGTNLNHGVTAIGYGTDSDGTDYWLVKNSWGTSWGESGYTRMQRGIGASEGLC 294

Query: 338 GIGTRSSYP 346
           G+   +SYP
Sbjct: 295 GVAMDASYP 303


>gi|535454|gb|AAA50755.1| cysteine proteinase [Alnus glutinosa]
          Length = 340

 Score =  367 bits (941), Expect = 6e-99,   Method: Compositional matrix adjust.
 Identities = 178/337 (52%), Positives = 242/337 (71%), Gaps = 13/337 (3%)

Query: 19  FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
            +++  L + ASQ+ ++RS  + S+ E HE+WMA +GR YKD  EK+ R KIF+EN+  I
Sbjct: 10  LVVMVTLGALASQLAAARSLQDASMRERHEEWMASYGRVYKDINEKQKRYKIFEENVALI 69

Query: 79  EKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHR-STTSSTFKYQNLSMTDVP 137
           E +NK+ N+ YKL  NQF+DLTN+EF+A    +K     H  ST S++FKY N+S   VP
Sbjct: 70  ESSNKDANKPYKLSVNQFADLTNEEFKASRNRFK----GHICSTKSTSFKYGNVSA--VP 123

Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-N 196
           +++DWR KGAVTP+K+Q +CGCCWAF+AVAA EGITK+ +G LI LSEQ+L+DC T+G +
Sbjct: 124 SAMDWRMKGAVTPVKDQGQCGCCWAFSAVAATEGITKLTTGELISLSEQELVDCDTSGVD 183

Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSA-AQKPAAAKISNYEEVPSGDEQ 255
            GC GG  + AF +I  N G+A+E  YPY+ V GTC+   Q   AA+I+ +E+VP+  E+
Sbjct: 184 QGCEGGLMDNAFTFIQHNHGLASEANYPYKGVDGTCNTNKQAIHAAEINGFEDVPANSEE 243

Query: 256 ALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIK 315
           ALL AV+ QPVS+AI A  + FQ Y +G+F G CGTQLDH VT VG+GT++DG  YWL+K
Sbjct: 244 ALLNAVAHQPVSVAIDAGGSGFQFYSKGVFIGACGTQLDHGVTAVGYGTSDDGTKYWLVK 303

Query: 316 NSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
           NSWG  WG+ GY+++ RD    EGLCGI  ++SYP A
Sbjct: 304 NSWGTQWGEEGYIRMQRDVDAKEGLCGIAMKASYPTA 340


>gi|144905104|dbj|BAF56427.1| cysteine proteinase [Lotus japonicus]
          Length = 342

 Score =  366 bits (940), Expect = 7e-99,   Method: Compositional matrix adjust.
 Identities = 179/338 (52%), Positives = 241/338 (71%), Gaps = 15/338 (4%)

Query: 19  FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
           F ++  L   A QV SSR+  + S+ E HE+WMA++G+ YKD  EKE R  IF+EN++YI
Sbjct: 12  FALVLCLGLWAFQV-SSRTLQDASMHERHEQWMARYGKVYKDLQEKEKRFNIFQENVKYI 70

Query: 79  EKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMTDV 136
           E +N  GN+ YKLG NQF+DLTN EF A    +K  M S   R+TT   FKY+N++    
Sbjct: 71  EASNNAGNKPYKLGVNQFTDLTNKEFIATRNKFKGHMSSSITRTTT---FKYENVT---A 124

Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG- 195
           P+++DWR +GAVTP+KNQ  CGCCWAF+AVAA EGI K+ +GNL+ LSEQ+L+DC T+G 
Sbjct: 125 PSTVDWRQEGAVTPVKNQGTCGCCWAFSAVAATEGIHKLSTGNLVSLSEQELVDCDTSGA 184

Query: 196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDE 254
           + GC GG  + AF +IIQN G+ TE +YPYQ V GTC+  ++    A I+ YE+VPS +E
Sbjct: 185 DQGCQGGLMDDAFKFIIQNGGLNTEAQYPYQGVDGTCNTNEEVTHVATITGYEDVPSNNE 244

Query: 255 QALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLI 314
           QAL +AV+ QP+S+AI A  ++FQ+Y+ G+F G CGTQLDH V +VG+G ++DG  YWL+
Sbjct: 245 QALQQAVANQPISVAIDASGSDFQNYQSGVFTGSCGTQLDHGVAVVGYGVSDDGTKYWLV 304

Query: 315 KNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
           KNSWG  WG+ GY+++ RD    EGLCGI  + SYP A
Sbjct: 305 KNSWGEDWGEEGYIRMQRDVEAPEGLCGIAMQPSYPTA 342


>gi|224114698|ref|XP_002316833.1| predicted protein [Populus trichocarpa]
 gi|222859898|gb|EEE97445.1| predicted protein [Populus trichocarpa]
          Length = 305

 Score =  366 bits (940), Expect = 8e-99,   Method: Compositional matrix adjust.
 Identities = 174/310 (56%), Positives = 228/310 (73%), Gaps = 10/310 (3%)

Query: 44  VEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDE 103
           +E HE WMAQ+GR+YK  +EKE RL IFK N+E+IE  NK G + YKL  N+F+DLTN+E
Sbjct: 1   MERHETWMAQYGRAYKGHVEKERRLNIFKNNVEFIESFNKVGKKPYKLSVNEFADLTNEE 60

Query: 104 FRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAF 163
           F+A   GYKM S    S+++  F+Y+N+S   VP+++DWR KGAVTPIK+Q +CGCCWAF
Sbjct: 61  FQASRNGYKM-SAHLSSSSTKPFRYENVSA--VPSTMDWRKKGAVTPIKDQGQCGCCWAF 117

Query: 164 AAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYIIQNQGIATEDE 222
           +AVAA EGIT++ +G LI LSEQ+L+DC T+G + GC GG  + AF +IIQN+G+ TE  
Sbjct: 118 SAVAATEGITQLSTGKLISLSEQELVDCDTSGEDQGCNGGLMDDAFDFIIQNKGLTTEAN 177

Query: 223 YPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKE 282
           YPYQ   G C++ +  AAAKI+ YE+VP+  E ALLKAV+ QPVS+AI A  + FQ Y  
Sbjct: 178 YPYQGADGACNSGK--AAAKITGYEDVPANSEAALLKAVANQPVSVAIDAGGSAFQFYSS 235

Query: 283 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCG 338
           G+F G CGT LDH VT VG+G ++DG  YWL+KNSWG +WG+ GY+++ RD    EGLCG
Sbjct: 236 GVFTGDCGTDLDHGVTAVGYGMSDDGTKYWLVKNSWGTSWGENGYIRMERDIDAQEGLCG 295

Query: 339 IGTRSSYPLA 348
           I   +SYP A
Sbjct: 296 IAMEASYPTA 305


>gi|297819568|ref|XP_002877667.1| hypothetical protein ARALYDRAFT_348033 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297323505|gb|EFH53926.1| hypothetical protein ARALYDRAFT_348033 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 341

 Score =  365 bits (936), Expect = 2e-98,   Method: Compositional matrix adjust.
 Identities = 181/344 (52%), Positives = 241/344 (70%), Gaps = 14/344 (4%)

Query: 15  TTPMFIIITLLVSCASQVVSSRS-THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKE 73
           T+ +F ++ +++S  +   +SR    E S +E HE+WM++  R Y D+ EK  R +IFK+
Sbjct: 2   TSIIFFLLAIILSSRTSGATSRGGLFEASAIEKHEQWMSRFHRVYSDDSEKTSRFEIFKK 61

Query: 74  NLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHR-STTSS----TFKY 128
           NL+++E  N   N+TY L  N+FSDLT++EF+A YTG  +P    R STT S    +F+Y
Sbjct: 62  NLKFVESFNMNTNKTYTLDVNEFSDLTDEEFKARYTGLVVPEGMTRMSTTDSHETVSFRY 121

Query: 129 QNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQL 188
           +N+  T    S+DWR++GAVT +K+Q++CGCCWAF+AVAAVEG+TKI  G L+ LSEQQL
Sbjct: 122 ENVGET--GESMDWREEGAVTSVKHQQQCGCCWAFSAVAAVEGMTKIAKGELVSLSEQQL 179

Query: 189 LDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEE 248
           LDCST  N+GC GG   KAF YI++NQGI  ED YPYQ    TC +    AAA IS YE 
Sbjct: 180 LDCSTE-NDGCDGGIMWKAFDYIVENQGITAEDNYPYQGAQQTCES-NHVAAATISGYET 237

Query: 249 VPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDG 308
           VP  DE+ALLKAVS QPVS+AI     EF  Y  GIFNG CGT L+HAVTIVG+G +E+G
Sbjct: 238 VPQNDEEALLKAVSQQPVSVAIEGSGYEFIHYSGGIFNGECGTHLNHAVTIVGYGVSEEG 297

Query: 309 ANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
             YWL+KNSWG +WG+ GYM+I+RD    +G+CG+ + + YP+A
Sbjct: 298 IKYWLLKNSWGESWGEDGYMRIMRDVDAPQGMCGLASLAYYPVA 341


>gi|318136892|gb|ADV41672.1| cysteine protease [Nicotiana tabacum]
          Length = 349

 Score =  364 bits (934), Expect = 4e-98,   Method: Compositional matrix adjust.
 Identities = 174/338 (51%), Positives = 236/338 (69%), Gaps = 7/338 (2%)

Query: 18  MFIIITLLVSCASQVVSSRS-THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLE 76
           + +    L    SQV SSR   +E S+   H++W+A H + YKD  EKEMR KIFKEN+E
Sbjct: 12  LALFFIFLGVWRSQVASSRPINYEASMRARHDQWIAHHDKVYKDLNEKEMRFKIFKENVE 71

Query: 77  YIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV 136
            IE  N   ++ YKLG N+FSDLTN++FR L+TGYK   P   S++     ++  ++TD+
Sbjct: 72  RIEAFNAGEDKGYKLGVNKFSDLTNEKFRVLHTGYKRSHPKVMSSSKPKTHFRYANVTDI 131

Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG- 195
           P ++DWR KGAVTPIK+QKECGCCWAF+AVAA EG+ ++++G LI LSEQ+L+DC   G 
Sbjct: 132 PPTMDWRKKGAVTPIKDQKECGCCWAFSAVAATEGLHQLKTGKLIPLSEQELVDCDVEGE 191

Query: 196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKP-AAAKISNYEEVPSGDE 254
           + GC GG  + AF +I++N+G+ TE  YPY+   G C+  +   +AAKI+ YE+VP+  E
Sbjct: 192 DEGCSGGLLDTAFDFILKNKGLTTEANYPYKGEDGVCNKKKSALSAAKIAGYEDVPANSE 251

Query: 255 QALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLI 314
           +ALL+AV+ QPVS+AI   S +FQ Y  G+F+G C T L+HAVT VG+G T DG  YW+I
Sbjct: 252 KALLQAVANQPVSVAIDGSSFDFQFYSSGVFSGSCSTWLNHAVTAVGYGATTDGTKYWII 311

Query: 315 KNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
           KNSWG+ WGD+GYM+I RD    EGLCG+   +SYP A
Sbjct: 312 KNSWGSKWGDSGYMRIKRDVHEKEGLCGLAMDASYPTA 349


>gi|37780051|gb|AAP32198.1| cysteine protease 12 [Trifolium repens]
          Length = 343

 Score =  364 bits (934), Expect = 4e-98,   Method: Compositional matrix adjust.
 Identities = 178/325 (54%), Positives = 233/325 (71%), Gaps = 15/325 (4%)

Query: 33  VSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRT-YKL 91
           V+SR+  + S+ E HE+WM  +G+ YK+  E+E RL+IF ENL+YIE +N  GN+  YKL
Sbjct: 25  VTSRTLQDDSIFERHEQWMTHYGKVYKNPQEREKRLRIFTENLKYIEASNNAGNKKPYKL 84

Query: 92  GTNQFSDLTNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVT 149
           G NQF+DLTN+EF A    +K  M S   R+TT   FKY+N   T VP+++DWR KGAVT
Sbjct: 85  GINQFADLTNEEFIASRNKFKGHMCSSIIRTTT---FKYEN---TSVPSTVDWRKKGAVT 138

Query: 150 PIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAF 208
           P+KNQ +CGCCWAF+A+AA EGI KI +G L+ LSEQ+L+DC TNG + GC GG  + AF
Sbjct: 139 PVKNQGQCGCCWAFSAIAATEGIHKISTGKLVSLSEQELVDCDTNGVDQGCEGGLMDDAF 198

Query: 209 AYIIQNQGIATEDEYPYQAVPGTCSAAQ-KPAAAKISNYEEVPSGDEQALLKAVSMQPVS 267
            +IIQN GI+TE  YPYQ V GTC A +   +AA I+ YE+VP+ +E AL KAV+ QP+S
Sbjct: 199 KFIIQNNGISTEAGYPYQGVDGTCKANEASTSAATITGYEDVPANNENALQKAVANQPIS 258

Query: 268 IAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGY 327
           +AI A  ++FQ YK G+F G CGT+LDH VT VG+G + DG  YWL+KNSWG  WG+ GY
Sbjct: 259 VAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGISNDGTKYWLVKNSWGTDWGEEGY 318

Query: 328 MKIVRD----EGLCGIGTRSSYPLA 348
           +++ R     EGLCGI  ++SYP A
Sbjct: 319 IRMQRSIDAAEGLCGIAMQASYPTA 343


>gi|18408828|ref|NP_566920.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|12324451|gb|AAG52191.1|AC012329_18 putative cysteine proteinase; 15366-14136 [Arabidopsis thaliana]
 gi|6723404|emb|CAB66413.1| cysteine protease-like protein [Arabidopsis thaliana]
 gi|332645009|gb|AEE78530.1| putative cysteine proteinase [Arabidopsis thaliana]
          Length = 341

 Score =  363 bits (933), Expect = 4e-98,   Method: Compositional matrix adjust.
 Identities = 185/344 (53%), Positives = 239/344 (69%), Gaps = 14/344 (4%)

Query: 15  TTPMFIIITLLVSCASQVVSSRS-THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKE 73
           T+ +F ++ +L+S  +  V+SR    E S VE HE+WM++  R Y D+ EK  R +IF  
Sbjct: 2   TSIVFFLLAILLSSRTSGVTSRGGLFEASAVEKHEQWMSRFNRVYSDDSEKTSRFEIFTN 61

Query: 74  NLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHR-STTSS----TFKY 128
           NL+++E  N   N+TY L  N+FSDLT++EF+A YTG  +P    R STT S    +F+Y
Sbjct: 62  NLKFVESINMNTNKTYTLDVNEFSDLTDEEFKARYTGLVVPEGMTRISTTDSHETVSFRY 121

Query: 129 QNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQL 188
           +N+  T    S+DW  +GAVT +K+Q++CGCCWAF+AVAAVEG+TKI +G L+ LSEQQL
Sbjct: 122 ENVGETG--ESMDWIQEGAVTSVKHQQQCGCCWAFSAVAAVEGMTKIANGELVSLSEQQL 179

Query: 189 LDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEE 248
           LDCST  NNGC GG   KAF YI +NQGI TED YPYQ    TC +    AAA IS YE 
Sbjct: 180 LDCSTE-NNGCGGGIMWKAFDYIKENQGITTEDNYPYQGAQQTCES-NHLAAATISGYET 237

Query: 249 VPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDG 308
           VP  DE+ALLKAVS QPVS+AI     EF  Y  GIFNG CGTQL HAVTIVG+G +E+G
Sbjct: 238 VPQNDEEALLKAVSQQPVSVAIEGSGYEFIHYSGGIFNGECGTQLTHAVTIVGYGVSEEG 297

Query: 309 ANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
             YWL+KNSWG +WG+ GYM+I+RD    +G+CG+ + + YP+A
Sbjct: 298 IKYWLLKNSWGESWGENGYMRIMRDVDSPQGMCGLASLAYYPVA 341


>gi|37780045|gb|AAP32195.1| cysteine protease 5 [Trifolium repens]
          Length = 343

 Score =  363 bits (933), Expect = 5e-98,   Method: Compositional matrix adjust.
 Identities = 178/325 (54%), Positives = 233/325 (71%), Gaps = 15/325 (4%)

Query: 33  VSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGN-RTYKL 91
           V+SR+  + S+ E HE+WM  +G+ YK+  E+E RL+IF ENL+YIE +N  GN + YKL
Sbjct: 25  VTSRTLQDDSIFERHEQWMTHYGKVYKNPQEREKRLRIFTENLKYIEASNNAGNNKPYKL 84

Query: 92  GTNQFSDLTNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVT 149
           G NQF+DLTN+EF A    +K  M S   R+TT   FKY+N   T VP+++DWR KGAVT
Sbjct: 85  GINQFADLTNEEFIASRNKFKGHMCSSIIRTTT---FKYEN---TSVPSTVDWRKKGAVT 138

Query: 150 PIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAF 208
           P+KNQ +CGCCWAF+A+AA EGI KI +G L+ LSEQ+L+DC TNG + GC GG  + AF
Sbjct: 139 PVKNQGQCGCCWAFSAIAATEGIHKISTGKLVSLSEQELVDCDTNGVDQGCEGGLMDDAF 198

Query: 209 AYIIQNQGIATEDEYPYQAVPGTCSAAQ-KPAAAKISNYEEVPSGDEQALLKAVSMQPVS 267
            +IIQN GI+TE  YPYQ V GTC A +   +AA I+ YE+VP+ +E AL KAV+ QP+S
Sbjct: 199 KFIIQNNGISTEAGYPYQGVDGTCKANEASTSAATITGYEDVPANNENALQKAVANQPIS 258

Query: 268 IAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGY 327
           +AI A  ++FQ YK G+F G CGT+LDH VT VG+G + DG  YWL+KNSWG  WG+ GY
Sbjct: 259 VAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGISNDGTKYWLVKNSWGTDWGEEGY 318

Query: 328 MKIVRD----EGLCGIGTRSSYPLA 348
           +++ R     EGLCGI  ++SYP A
Sbjct: 319 IRMQRSIDAAEGLCGIAMQASYPTA 343


>gi|144905108|dbj|BAF56428.1| cysteine proteinase [Lotus japonicus]
          Length = 342

 Score =  363 bits (933), Expect = 5e-98,   Method: Compositional matrix adjust.
 Identities = 178/346 (51%), Positives = 240/346 (69%), Gaps = 18/346 (5%)

Query: 11  FKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKI 70
             I++  + ++   L   A+    +R+  + S+ E HE+WM Q+G+ Y D  EKE+R  I
Sbjct: 7   LNISSLALLLVFGFLAFEAN----ARTLEDVSLKERHEQWMTQYGKVYTDSYEKELRSNI 62

Query: 71  FKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRAL--YTGYKMPSPSHRSTTSSTFKY 128
           FKEN++ IE  N  GN+ YKLG NQF+DLTN+EF+A   + G+   +    ST + TFKY
Sbjct: 63  FKENVQRIEAFNNAGNKPYKLGINQFADLTNEEFKARNRFKGHMCSN----STRTPTFKY 118

Query: 129 QNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQL 188
           +++S   VP SLDWR KGAVTPIK+Q +CGCCWAF+AVAA EGITK+ +G LI LSEQ+L
Sbjct: 119 EDVS--SVPASLDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGITKLSTGKLISLSEQEL 176

Query: 189 LDCSTNG-NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSA-AQKPAAAKISNY 246
           +DC T G + GC GG  + AF +I+QN+G+ TE +YPYQ V  TC+A A+   AA I  +
Sbjct: 177 VDCDTKGVDQGCEGGLMDDAFKFIMQNKGLNTEAKYPYQGVDATCNANAEAKDAASIKGF 236

Query: 247 EEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTE 306
           E+VP+  E ALLKAV+ QP+S+AI A  +EFQ Y  G+F G CGT+LDH VT VG+G ++
Sbjct: 237 EDVPANSESALLKAVANQPISVAIDASGSEFQFYSSGLFTGSCGTELDHGVTAVGYGVSD 296

Query: 307 DGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
           DG  YWL+KNSWG  WG+ GY+++ RD    EGLCGI  ++SYP A
Sbjct: 297 DGTKYWLVKNSWGEQWGEEGYIRMQRDVAAEEGLCGIAMQASYPTA 342


>gi|357474579|ref|XP_003607574.1| Cysteine protease [Medicago truncatula]
 gi|355508629|gb|AES89771.1| Cysteine protease [Medicago truncatula]
          Length = 345

 Score =  363 bits (932), Expect = 7e-98,   Method: Compositional matrix adjust.
 Identities = 178/324 (54%), Positives = 232/324 (71%), Gaps = 12/324 (3%)

Query: 33  VSSRSTHEQSVV-EIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGN-RTYK 90
           V+SR+  + S++ E HE+WM  +G+ YKD  E+E RLKIFKEN+ YIE +N  GN + YK
Sbjct: 26  VTSRTLQDDSIIYEKHEQWMVHYGKVYKDLQERENRLKIFKENVNYIEASNNAGNNKLYK 85

Query: 91  LGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTP 150
           LG NQF+D+TN+EF A    +K    S   T +STFKY+N S   VP+++DWR KGAVTP
Sbjct: 86  LGINQFADITNEEFIASRNKFKGHMCS-SITKTSTFKYENAS---VPSTVDWRKKGAVTP 141

Query: 151 IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFA 209
           +KNQ +CGCCWAF+AVAA EGI K+ +G L+ LSEQ+L+DC T G + GC GG  + AF 
Sbjct: 142 VKNQGQCGCCWAFSAVAATEGIHKLSTGKLVSLSEQELVDCDTKGVDQGCEGGLMDDAFK 201

Query: 210 YIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSI 268
           +IIQN G+ TE +YPYQ V GTCSA +    AA I+ YE+VP+ +E AL KAV+ QP+S+
Sbjct: 202 FIIQNHGLHTEAQYPYQGVDGTCSANETSTPAATIAGYEDVPANNENALQKAVANQPISV 261

Query: 269 AIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYM 328
           AI A  ++FQ YK G+F G CGTQLDH VT VG+G + DG  YWL+KNSWGN WG+ GY+
Sbjct: 262 AIDASGSDFQFYKSGVFTGSCGTQLDHGVTAVGYGISNDGTKYWLVKNSWGNDWGEEGYI 321

Query: 329 KIVRD----EGLCGIGTRSSYPLA 348
           ++ R     +GLCGI   +SYP A
Sbjct: 322 RMQRSVDAAQGLCGIAMMASYPTA 345


>gi|144905116|dbj|BAF56430.1| cysteine proteinase [Lotus japonicus]
          Length = 341

 Score =  362 bits (930), Expect = 1e-97,   Method: Compositional matrix adjust.
 Identities = 180/344 (52%), Positives = 243/344 (70%), Gaps = 17/344 (4%)

Query: 13  INTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFK 72
           +N T + ++  L+    S   ++R+  + S+ E HE+WMAQ+G+ YKD  EKE+R KIFK
Sbjct: 7   LNITSLTLL--LVFGFLSFEANARTLEDASMHERHEQWMAQYGKVYKDSYEKELRSKIFK 64

Query: 73  ENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRAL--YTGYKMPSPSHRSTTSSTFKYQN 130
           EN++ IE  N  GN++YKLG NQF+DLTN+EF+A   + G+   +    ST + TFKY++
Sbjct: 65  ENVQRIEAFNNAGNKSYKLGINQFADLTNEEFKARNRFKGHMCSN----STRTPTFKYEH 120

Query: 131 LSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLD 190
           +  T VP SLDWR KGAVTPIK+Q +CGCCWAF+AVAA EGITK+ +G LI LSEQ+L+D
Sbjct: 121 V--TSVPASLDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGITKLSTGKLISLSEQELVD 178

Query: 191 CSTNG-NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSA-AQKPAAAKISNYEE 248
           C T G + GC GG  + AF +I+QN+G+ TE +YPYQ V  TC+A A+   AA I  +E+
Sbjct: 179 CDTKGVDQGCEGGLMDDAFKFIMQNKGLNTEAKYPYQGVDATCNANAEAKDAASIKGFED 238

Query: 249 VPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDG 308
           VP+  E ALLKAV+ QP+S+AI A  +EFQ Y  G+F G CGT+LDH VT VG+G ++ G
Sbjct: 239 VPANSESALLKAVANQPISVAIDASGSEFQFYSSGVFTGSCGTELDHGVTAVGYG-SDGG 297

Query: 309 ANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
             YWL+KNSWG  WG+ GY+++ RD    EGLCG   ++SYP A
Sbjct: 298 TKYWLVKNSWGEQWGEQGYIRMQRDVAAEEGLCGFAMQASYPTA 341


>gi|50355621|dbj|BAD29959.1| cysteine protease [Daucus carota]
          Length = 361

 Score =  362 bits (929), Expect = 1e-97,   Method: Compositional matrix adjust.
 Identities = 175/337 (51%), Positives = 238/337 (70%), Gaps = 10/337 (2%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
           M   + LL + A Q  +SR+  E S+ E HE+WM Q+GR YKDE EK +R +IF +N+++
Sbjct: 29  MIAALILLGAWACQA-TSRTLPEASMFERHEQWMIQYGRVYKDEAEKSVRFQIFMDNVKF 87

Query: 78  IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
           IE+ NK+G ++YKL  N+F+D TN+EF+A   GYKM + S R + ++ F+Y+N+  T VP
Sbjct: 88  IEEFNKDGRQSYKLAVNEFADQTNEEFQASRNGYKM-AVSSRPSQTTLFRYENV--TAVP 144

Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-N 196
           +S+DWR KGAVTP+K+Q +CG CWAF+ +AA EGITK+++G LI LSEQ+L+DC   G +
Sbjct: 145 SSMDWRKKGAVTPVKDQGQCGSCWAFSTIAATEGITKLKTGKLISLSEQELVDCDKTGED 204

Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQ 255
            GC GG  E  F +I++N+GIA E  YPY A  GTC++ ++ + AAKIS YE+VP+  E 
Sbjct: 205 QGCEGGYMEDGFEFIVKNKGIALEASYPYTAADGTCNSKEEASRAAKISGYEKVPANSET 264

Query: 256 ALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIK 315
           ALLKAV+ QPVS++I A    FQ Y  G+F G CGT LDH VT VG+G T DG  YWL+K
Sbjct: 265 ALLKAVANQPVSVSIDASGVAFQFYSSGVFTGECGTDLDHGVTAVGYGKTSDGTKYWLVK 324

Query: 316 NSWGNTWGDAGYMKIVR----DEGLCGIGTRSSYPLA 348
           NSWG +WGD+GY+ + R      GLCGI   +SYP A
Sbjct: 325 NSWGASWGDSGYIMMQRGVAAKGGLCGIAMDASYPTA 361


>gi|357474573|ref|XP_003607571.1| Cysteine proteinase EP-B [Medicago truncatula]
 gi|34329348|gb|AAQ63885.1| putative cysteine proteinase [Medicago truncatula]
 gi|355508626|gb|AES89768.1| Cysteine proteinase EP-B [Medicago truncatula]
          Length = 345

 Score =  362 bits (929), Expect = 2e-97,   Method: Compositional matrix adjust.
 Identities = 179/324 (55%), Positives = 230/324 (70%), Gaps = 12/324 (3%)

Query: 33  VSSRSTHEQSVV-EIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGN-RTYK 90
           V+SR+  + S++ E HE+WM  +G+ YKD  E+E RLKIFKEN+ YIE +N  GN + YK
Sbjct: 26  VTSRTLQDDSIIYEKHEQWMVHYGKVYKDLQERENRLKIFKENVNYIEASNNAGNNKLYK 85

Query: 91  LGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTP 150
           LG NQF+DLTN+EF A    +K    S   T +STFKY+N S   VP+++DWR KGAVTP
Sbjct: 86  LGINQFADLTNEEFIASRNKFKGHMCS-SITKTSTFKYENAS---VPSTVDWRKKGAVTP 141

Query: 151 IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFA 209
           +KNQ +CGCCWAF+AVAA EGI K+ +G L+ LSEQ+L+DC T G + GC GG  + AF 
Sbjct: 142 VKNQGQCGCCWAFSAVAATEGIHKLSTGKLVSLSEQELVDCDTKGVDQGCEGGLMDDAFK 201

Query: 210 YIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSI 268
           +IIQN G+ TE +YPYQ V GTCSA +    A  I+ YE+VP+ +EQAL KAV+ QP+S+
Sbjct: 202 FIIQNHGLNTEAQYPYQGVDGTCSANKASIHAVTITGYEDVPANNEQALQKAVANQPISV 261

Query: 269 AIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYM 328
           AI A  ++FQ YK G+F G CGT+LDH VT VG+G   DG  YWL+KNSWG  WG+ GY+
Sbjct: 262 AIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVGNDGTKYWLVKNSWGTDWGEEGYI 321

Query: 329 KIVRD----EGLCGIGTRSSYPLA 348
           K+ R     EGLCGI   +SYP A
Sbjct: 322 KMQRGVDAAEGLCGIAMEASYPTA 345


>gi|37780047|gb|AAP32196.1| cysteine protease 8 [Trifolium repens]
          Length = 343

 Score =  360 bits (925), Expect = 4e-97,   Method: Compositional matrix adjust.
 Identities = 172/324 (53%), Positives = 231/324 (71%), Gaps = 13/324 (4%)

Query: 33  VSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLG 92
           V+SR+  + S+ E H +WM+Q+G+ YKD  E+E R KIF EN+ Y+E +N +  ++YKLG
Sbjct: 25  VTSRTLQDDSMYERHGQWMSQYGKIYKDHQERETRFKIFTENVNYVEASNADDTKSYKLG 84

Query: 93  TNQFSDLTNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTP 150
            NQF+DLTN+EF A    +K  M S   R+TT   FKY+N+S   +P+++DWR KGAVTP
Sbjct: 85  INQFADLTNEEFVASRNKFKGHMCSSITRTTT---FKYENVSA--IPSTVDWRKKGAVTP 139

Query: 151 IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFA 209
           +KNQ +CGCCWAF+AVAA EGI K+ +G LI LSEQ+L+DC T G + GC GG  + AF 
Sbjct: 140 VKNQGQCGCCWAFSAVAATEGIHKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFK 199

Query: 210 YIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSI 268
           +IIQN G++TE +YPY+ V GTC+A +    A  I+ YE+VP+  EQAL KAV+ QP+S+
Sbjct: 200 FIIQNHGLSTEAQYPYEGVDGTCNANKASVQAVTITGYEDVPANSEQALQKAVANQPISV 259

Query: 269 AIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYM 328
           AI A  ++FQ YK G+F G CGT+LDH VT VG+G + DG  YWL+KNSWG  WG+ GY+
Sbjct: 260 AIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSNDGTKYWLVKNSWGTDWGEEGYI 319

Query: 329 KIVRD----EGLCGIGTRSSYPLA 348
            + R     EGLCGI  ++SYP A
Sbjct: 320 MMQRGVEAAEGLCGIAMQASYPTA 343


>gi|357474527|ref|XP_003607548.1| Cysteine protease [Medicago truncatula]
 gi|358347211|ref|XP_003637653.1| Cysteine protease [Medicago truncatula]
 gi|355503588|gb|AES84791.1| Cysteine protease [Medicago truncatula]
 gi|355508603|gb|AES89745.1| Cysteine protease [Medicago truncatula]
          Length = 345

 Score =  360 bits (924), Expect = 5e-97,   Method: Compositional matrix adjust.
 Identities = 179/327 (54%), Positives = 228/327 (69%), Gaps = 11/327 (3%)

Query: 29  ASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGN-R 87
           A QV S     + ++ E HE+WM  +G+ YKD  E+E RLKIFKEN+ YIE +N  GN +
Sbjct: 23  AIQVTSRTLQDDSNIYEKHEQWMVHYGKVYKDLQERENRLKIFKENVNYIEASNNAGNNK 82

Query: 88  TYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGA 147
            YKLG NQF+DLTN+EF A    +K    S   T +STFKY+N S   VP+++DWR KGA
Sbjct: 83  LYKLGINQFADLTNEEFIASRNKFKGHMCS-SITKTSTFKYENAS---VPSTVDWRKKGA 138

Query: 148 VTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREK 206
           VTP+KNQ +CGCCWAF+AVAA EGI K+ +G L+ LSEQ+L+DC T G + GC GG  + 
Sbjct: 139 VTPVKNQGQCGCCWAFSAVAATEGIHKLSTGKLVSLSEQELVDCDTKGVDQGCEGGLMDD 198

Query: 207 AFAYIIQNQGIATEDEYPYQAVPGTCSAAQKP-AAAKISNYEEVPSGDEQALLKAVSMQP 265
           AF +IIQN G+ TE +YPYQ V GTCSA +    A  I+ YE+VP+ +EQAL KAV+ QP
Sbjct: 199 AFKFIIQNHGLNTEAQYPYQGVDGTCSANKASIHAVTITGYEDVPANNEQALQKAVANQP 258

Query: 266 VSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDA 325
           +S+AI A  ++FQ YK G+F G CGT+LDH VT VG+G   DG  YWL+KNSWG  WG+ 
Sbjct: 259 ISVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVGNDGTKYWLVKNSWGTDWGEE 318

Query: 326 GYMKIVRD----EGLCGIGTRSSYPLA 348
           GY+K+ R     EGLCGI   +SYP A
Sbjct: 319 GYIKMQRGVDAAEGLCGIAMEASYPTA 345


>gi|225446589|ref|XP_002280263.1| PREDICTED: vignain [Vitis vinifera]
          Length = 339

 Score =  360 bits (923), Expect = 7e-97,   Method: Compositional matrix adjust.
 Identities = 178/334 (53%), Positives = 240/334 (71%), Gaps = 14/334 (4%)

Query: 21  IITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEK 80
           ++ +L + ASQ  +SRS HE S+ E HE WMA++GR YKD  EKE R KIFK+N+  IE 
Sbjct: 14  LLFILAAWASQA-TSRSLHEASMYERHEDWMARYGRMYKDANEKEKRFKIFKDNVARIES 72

Query: 81  ANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSL 140
            NK  ++TYKL  N+F+DLTN+EFR+L   +K    +H  + ++TFKY+N+  T VP+++
Sbjct: 73  FNKAMDKTYKLSINEFADLTNEEFRSLRNRFK----AHICSEATTFKYENV--TAVPSTI 126

Query: 141 DWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGC 199
           DWR KGAVTPIK+Q++CGCCWAF+AVAA EGIT+I +G LI LSEQ+L+DC T G N GC
Sbjct: 127 DWRKKGAVTPIKDQQQCGCCWAFSAVAATEGITQITTGKLISLSEQELVDCDTGGENQGC 186

Query: 200 LGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKP-AAAKISNYEEVPSGDEQALL 258
            GG  + AF + I+  G+A+E  YPY+   GTC++ ++   AAKI  YE+VP+ +E+AL 
Sbjct: 187 SGGLMDDAFRF-IKIHGLASEATYPYEGDDGTCNSKKEAHPAAKIKGYEDVPANNEKALQ 245

Query: 259 KAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSW 318
           KAV+ QPV++AI A   EFQ Y  G+F G CGT+LDH V  VG+G  +DG  YWL+KNSW
Sbjct: 246 KAVAHQPVAVAIDAGGFEFQFYTSGVFTGQCGTELDHGVAAVGYGIGDDGMMYWLVKNSW 305

Query: 319 GNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
           G  WG+ GY+++ RD    EGLCGI  ++SYP A
Sbjct: 306 GTGWGEEGYIRMQRDVTAKEGLCGIAMQASYPTA 339


>gi|224076972|ref|XP_002305074.1| predicted protein [Populus trichocarpa]
 gi|224106329|ref|XP_002333698.1| predicted protein [Populus trichocarpa]
 gi|222837984|gb|EEE76349.1| predicted protein [Populus trichocarpa]
 gi|222848038|gb|EEE85585.1| predicted protein [Populus trichocarpa]
          Length = 307

 Score =  359 bits (922), Expect = 9e-97,   Method: Compositional matrix adjust.
 Identities = 170/312 (54%), Positives = 224/312 (71%), Gaps = 11/312 (3%)

Query: 43  VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTND 102
           +++ HE+WMAQHGR Y D  EKE R  IFKEN+E IE  N   +R YKLG N+F+DLTN+
Sbjct: 1   MLKRHEEWMAQHGRVYGDMKEKEKRYLIFKENIERIEAFNNGSDRGYKLGVNKFADLTNE 60

Query: 103 EFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWA 162
           EFRA++ GYK  S       SS+F+++NLS   +PTS+DWR  GAVTP+K+Q  CGCCWA
Sbjct: 61  EFRAMHHGYKRQSSK---LMSSSFRHENLSA--IPTSMDWRKAGAVTPVKDQGTCGCCWA 115

Query: 163 FAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYIIQNQGIATED 221
           F+AVAA+EGI K+++G LI LSEQQL+DC   G + GC GG  + AF +I++N G+ +E 
Sbjct: 116 FSAVAAIEGIIKLKTGKLISLSEQQLVDCDVKGVDQGCGGGLMDNAFQFILRNGGLTSEA 175

Query: 222 EYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSY 280
            YPYQ V GTC + +  +  AKI+ YE+VP  +E ALL+AV+ QPVS+A+     +FQ Y
Sbjct: 176 TYPYQGVDGTCKSKKTASIEAKITGYEDVPVNNENALLQAVAKQPVSVAVEGGGYDFQFY 235

Query: 281 KEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGL 336
           K G+F G CGT LDHAVT +G+GT  DG NYWL+KNSWG +WG++GYM++ R     EGL
Sbjct: 236 KSGVFKGDCGTYLDHAVTAIGYGTNSDGTNYWLVKNSWGTSWGESGYMRMQRGIGAREGL 295

Query: 337 CGIGTRSSYPLA 348
           CG+   +SYP A
Sbjct: 296 CGVAMDASYPTA 307


>gi|50355613|dbj|BAD29955.1| cysteine protease [Daucus carota]
          Length = 365

 Score =  359 bits (922), Expect = 1e-96,   Method: Compositional matrix adjust.
 Identities = 175/346 (50%), Positives = 243/346 (70%), Gaps = 15/346 (4%)

Query: 10  SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
           + K   TP+ ++ T+ V   + + ++RS +E S+ E H++WMA++GR YK   EK  R  
Sbjct: 4   TIKHQCTPLALLFTIGV--LASLAAARSLNEASMTETHDQWMARYGRVYKTANEKNRRST 61

Query: 70  IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHR-STTSSTFKY 128
           IF+ENL+YI+  NK  N+ YKLG N+F+DLTN+EF      +K    SH  +T ++ F+Y
Sbjct: 62  IFQENLKYIQTFNKANNKPYKLGVNEFADLTNEEFTTSRNKFK----SHVCATVTNVFRY 117

Query: 129 QNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQL 188
           +N+  T VP ++DWR KGAVTPIKNQ +CGCCWAF+AVAA+EGIT++++G LI LSEQ+L
Sbjct: 118 ENV--TAVPATMDWRKKGAVTPIKNQGQCGCCWAFSAVAAMEGITQLKTGKLISLSEQEL 175

Query: 189 LDCSTNG-NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKP-AAAKISNY 246
           +DC TNG + GC GG  + AF +I QN G++TE  YPY    GTC+A ++   AA I+ +
Sbjct: 176 VDCDTNGEDQGCEGGLMDYAFDFIQQNHGLSTETNYPYSGTDGTCNANKEANHAATITGH 235

Query: 247 EEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTE 306
           E+VP+  E ALLKAV+ QP+S+AI A  ++FQ Y  G+F G CGT+LDH VT VG+GT  
Sbjct: 236 EDVPANSESALLKAVANQPISVAIDASGSDFQFYSSGVFTGECGTELDHGVTAVGYGTAA 295

Query: 307 DGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
           DG  YWL+KNSWG +WG+ GY+++ R     EGLCGI  ++SYP A
Sbjct: 296 DGTKYWLVKNSWGTSWGEEGYIQMQRGVAAAEGLCGIAMQASYPTA 341


>gi|312281697|dbj|BAJ33714.1| unnamed protein product [Thellungiella halophila]
          Length = 347

 Score =  359 bits (921), Expect = 1e-96,   Method: Compositional matrix adjust.
 Identities = 174/339 (51%), Positives = 230/339 (67%), Gaps = 11/339 (3%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVV--EIHEKWMAQHGRSYKDELEKEMRLKIFKENL 75
           +F+I++L+ S    +  SR   +  ++  + H++WMA+HGR Y D  EK  R  +FK N+
Sbjct: 8   IFLIVSLISSFCLSITLSRPLDDNELIMQKRHDEWMAKHGRVYADMKEKNNRYVVFKRNV 67

Query: 76  EYIEKANK-EGNRTYKLGTNQFSDLTNDEFRALYTGYKMPS--PSHRSTTSSTFKYQNLS 132
           E IE+ N     RT+KL  NQF+DLTNDEFR++YTGYK  S   S   T +S+F+YQN+S
Sbjct: 68  ERIERLNNVPAGRTFKLAVNQFADLTNDEFRSMYTGYKGGSVLSSQSGTKTSSFRYQNVS 127

Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
              +P S+DWR KGAVTPIKNQ  CGCCWAF+AVAA+EG TKI+ G LI LSEQQL+DC 
Sbjct: 128 SGALPVSVDWRKKGAVTPIKNQGTCGCCWAFSAVAAIEGATKIKKGKLISLSEQQLVDCD 187

Query: 193 TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQ-KPAAAKISNYEEVPS 251
           TN + GC GG  + AF +I+   G+ TE  YPY+    TC     KP A  I+ YE+VP 
Sbjct: 188 TN-DFGCSGGLMDTAFEHIMATGGLTTESNYPYKGKDATCKIKNTKPTATSITGYEDVPV 246

Query: 252 GDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANY 311
            DE+AL+KAV+ QPVSI I     +FQ Y  G+F G C T LDHAVT VG+G + +G+ Y
Sbjct: 247 NDEKALMKAVAHQPVSIGIEGGGFDFQFYGSGVFTGECTTYLDHAVTAVGYGQSSNGSKY 306

Query: 312 WLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
           W+IKNSWG  WG++GYM+I +D    +GLCG+  ++SYP
Sbjct: 307 WIIKNSWGTKWGESGYMRIKKDVKDKKGLCGLAMKASYP 345


>gi|47524507|gb|AAT34987.1| putative cysteine protease [Gossypium hirsutum]
          Length = 344

 Score =  357 bits (917), Expect = 3e-96,   Method: Compositional matrix adjust.
 Identities = 177/343 (51%), Positives = 245/343 (71%), Gaps = 16/343 (4%)

Query: 18  MFIIITLLVS-CASQVVS--SRSTHEQSVVEIHEKWMAQHGRSYKDELE--KEMRLKIFK 72
           +F+ + L++S C S  ++  SR   ++  +  HE+WM+QHGR Y DE E  K  R  +FK
Sbjct: 6   IFLFVALVLSFCFSIQLAGLSRPLLDEDSMR-HEEWMSQHGRVYADEQEDHKNKRFNVFK 64

Query: 73  ENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSP-SHRSTTSSTFKYQNL 131
           EN+E IE+ N    +T+KL  NQF+DLTN+EFRA Y G+K P   S + T  + F+Y+N+
Sbjct: 65  ENVERIEEFND--GKTFKLAINQFADLTNEEFRASYNGFKGPMVLSSQITKPTPFRYENV 122

Query: 132 SMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDC 191
           S + +P S+DWR KGAVTP+KNQ +CGCCWAF+AVAA+EGIT+I +G LI LSEQ+L+DC
Sbjct: 123 S-SALPVSVDWRKKGAVTPVKNQGQCGCCWAFSAVAAIEGITQISTGKLISLSEQELVDC 181

Query: 192 STNG-NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQ-KPAAAKISNYEEV 249
            T G ++GC GG  + AF +II N G+ TE  YPY+   GTC+  +  P A  I+ YE+V
Sbjct: 182 DTKGIDHGCEGGLMDTAFEFIINNGGLTTESNYPYKGEDGTCNFNKTNPIAVSITGYEDV 241

Query: 250 PSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGA 309
           P+ DEQAL+KAV+ QPVS+AI A  ++FQ Y  G+F G CGT+LDHAVT VG+G +EDG+
Sbjct: 242 PANDEQALMKAVAHQPVSVAIEAGGSDFQFYSSGVFTGECGTELDHAVTAVGYGESEDGS 301

Query: 310 NYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
            YW++KNSWG  WG++GY+++ +D    +GLCGI  ++SYP A
Sbjct: 302 KYWIVKNSWGTKWGESGYIEMQKDIKVKQGLCGIAMQASYPTA 344


>gi|357477459|ref|XP_003609015.1| Cysteine proteinase [Medicago truncatula]
 gi|355510070|gb|AES91212.1| Cysteine proteinase [Medicago truncatula]
          Length = 345

 Score =  357 bits (916), Expect = 5e-96,   Method: Compositional matrix adjust.
 Identities = 179/337 (53%), Positives = 235/337 (69%), Gaps = 11/337 (3%)

Query: 19  FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
              I  L  CA QV +SRS    S+ E HE+WM+Q+ + YKD  E+E R KIF  N+ YI
Sbjct: 13  LTFIFCLGLCAIQV-TSRSLQVDSMYERHEQWMSQYSKVYKDPQEREERHKIFTANVNYI 71

Query: 79  EKANKEGN-RTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
           E  N + N + YKLG NQF+DLTN+EF A    +K    S  + T+ TFKY+N+S   +P
Sbjct: 72  EVFNNDANNKLYKLGINQFADLTNEEFIASRNKFKGHMCSSIAKTT-TFKYENVSA--IP 128

Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-N 196
           +++DWR KGAVTP+KNQ +CGCCWAF+AVAA EGITK+ +G L+ LSEQ+L+DC T G +
Sbjct: 129 STVDWRKKGAVTPVKNQGQCGCCWAFSAVAATEGITKLSTGKLVSLSEQELVDCDTKGVD 188

Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQ 255
            GC GG  + AF +IIQN G++TE  YPYQ V GTC+A +    AA I+ YE+VP+ +EQ
Sbjct: 189 QGCEGGLMDDAFKFIIQNHGLSTEAAYPYQGVDGTCNANKASIHAATITGYEDVPANNEQ 248

Query: 256 ALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIK 315
           AL KAV+ QP+S+AI A  ++FQ YK G+F+G CGT+LDH VT VG+G   DG  YWL+K
Sbjct: 249 ALQKAVANQPISVAIDASGSDFQFYKSGVFSGSCGTELDHGVTAVGYGVGNDGTKYWLVK 308

Query: 316 NSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
           NSWG  WG+ GY+++ R     EGLCGI  ++SYP A
Sbjct: 309 NSWGTDWGEEGYIRMQRGVDAAEGLCGIAMQASYPTA 345


>gi|10336513|dbj|BAB13759.1| cysteine proteinase [Astragalus sinicus]
          Length = 343

 Score =  357 bits (915), Expect = 5e-96,   Method: Compositional matrix adjust.
 Identities = 177/345 (51%), Positives = 242/345 (70%), Gaps = 15/345 (4%)

Query: 14  NTTPMFIIITLLVSCA--SQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIF 71
           N    +I + LL+     +  V+SR+  + S+ E H++WM Q+ + Y D  E E R +IF
Sbjct: 4   NKQLYYISLALLMCLGLWAVQVTSRTLQDASMYERHQQWMGQYAKIYNDHQEWEKRFQIF 63

Query: 72  KENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYK--MPSPSHRSTTSSTFKYQ 129
           KEN+ YIE +NKEG R YKLG NQF DLTN+EF A    +K  M S   R+   +T+KY+
Sbjct: 64  KENVNYIETSNKEGGRFYKLGVNQFVDLTNEEFIAPRNRFKGHMCSSIIRT---NTYKYE 120

Query: 130 NLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLL 189
           N+  T VP+++DWR KGAVTP+K+Q +CGCCWAF+AVAA EGI ++ +G LI LSEQ+L+
Sbjct: 121 NV--TTVPSNVDWRQKGAVTPVKDQGQCGCCWAFSAVAATEGIHQLSTGKLISLSEQELV 178

Query: 190 DCSTNG-NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYE 247
           DC T G + GC GG  + AF +IIQN G+ TE +YPYQ V GTC+A +    AA I++YE
Sbjct: 179 DCDTKGVDQGCEGGLMDDAFKFIIQNHGLDTEAKYPYQGVDGTCNANEASINAATITSYE 238

Query: 248 EVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTED 307
           +VP+ +EQAL KAV+ QP+S+AI A  ++FQ Y  G+F G CGT+LDH VT VG+G ++D
Sbjct: 239 DVPTNNEQALQKAVANQPISVAIDASGSDFQFYTSGVFTGSCGTELDHGVTAVGYGVSDD 298

Query: 308 GANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
           G  YWL+KNSWG +WG+ GY+++ R     EGLCGI  ++SYP+A
Sbjct: 299 GTKYWLVKNSWGTSWGEEGYIRMQRGVDAVEGLCGIAMQASYPIA 343


>gi|84181681|gb|AAW78661.2| senescence-specific cysteine protease [Nicotiana tabacum]
          Length = 349

 Score =  356 bits (913), Expect = 9e-96,   Method: Compositional matrix adjust.
 Identities = 170/341 (49%), Positives = 238/341 (69%), Gaps = 11/341 (3%)

Query: 19  FIIITLLVSC----ASQVVSSRS-THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKE 73
           ++ + L   C    +SQV  SR   +E ++   H++W+  H + YKD  EKE+R +IFKE
Sbjct: 9   YLCLALFFICLGLWSSQVALSRPINYEATMRARHDQWIVHHEKVYKDLNEKEVRFQIFKE 68

Query: 74  NLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
           N+E IE  N   ++ YKLG N+FSDLTN+EFR L+TGYK   P   +++     ++  ++
Sbjct: 69  NVERIEAFNAGEDKGYKLGFNKFSDLTNEEFRVLHTGYKRSHPKVMTSSKGKTHFRYTNV 128

Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST 193
           TD+P ++DWR KGAVTPIK+QKECGCCWAF+AVAA+EG+ ++++G LI LSEQ+L+DC  
Sbjct: 129 TDIPPTMDWRKKGAVTPIKDQKECGCCWAFSAVAAMEGLHQLKTGELIPLSEQELVDCDV 188

Query: 194 NG-NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKP-AAAKISNYEEVPS 251
            G + GC GG  + AF +I++N+G+ TE  YPY+   G C+  +   +AAKI+ YE+VP+
Sbjct: 189 EGEDEGCSGGLLDTAFDFILKNKGLTTEVNYPYKGEDGVCNKKKSALSAAKITGYEDVPA 248

Query: 252 GDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANY 311
             E+ALL+AV+ QPVS+AI   S +FQ Y  G+F+G C T L+HAVT VG+G T DG  Y
Sbjct: 249 NSEKALLQAVANQPVSVAIDGSSFDFQFYSSGVFSGSCSTWLNHAVTAVGYGATTDGTKY 308

Query: 312 WLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
           W+IKNSWG+ WGD+GYM+I RD    EGLCG+   +SYP A
Sbjct: 309 WIIKNSWGSKWGDSGYMRIKRDVHEKEGLCGLAMDASYPTA 349


>gi|357483847|ref|XP_003612210.1| Cysteine proteinase [Medicago truncatula]
 gi|355513545|gb|AES95168.1| Cysteine proteinase [Medicago truncatula]
          Length = 344

 Score =  355 bits (912), Expect = 2e-95,   Method: Compositional matrix adjust.
 Identities = 174/325 (53%), Positives = 230/325 (70%), Gaps = 14/325 (4%)

Query: 33  VSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANK-EGNRTYKL 91
           V+SR+  + S+ E HE+WM  +G+ YKD  E+E R KIF EN++YIE  N  + N +YKL
Sbjct: 25  VTSRTLQDGSMHERHERWMNHYGKVYKDHQEREKRFKIFTENMKYIEAFNNGDNNESYKL 84

Query: 92  GTNQFSDLTNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVT 149
           G NQF+DLTN+EF A    +K  M S   R+TT   FKY+N+S   +P+++DWR KGAVT
Sbjct: 85  GINQFADLTNEEFVASRNKFKGHMCSSIIRTTT---FKYENVSA--IPSTVDWRKKGAVT 139

Query: 150 PIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAF 208
           P+KNQ +CGCCWAF+AVAA EGI K+ +G L+ LSEQ+L+DC T G + GC GG  + AF
Sbjct: 140 PVKNQGQCGCCWAFSAVAATEGIHKLSTGKLVSLSEQELVDCDTKGVDQGCEGGLMDDAF 199

Query: 209 AYIIQNQGIATEDEYPYQAVPGTCSAAQKP-AAAKISNYEEVPSGDEQALLKAVSMQPVS 267
            +IIQN G+ TE +YPYQ V GTC+A +    A  I+ YE+VP+ +EQAL KAV+ QP+S
Sbjct: 200 KFIIQNHGLNTEAQYPYQGVDGTCNANKASIQATTITGYEDVPANNEQALQKAVANQPIS 259

Query: 268 IAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGY 327
           +AI A  ++FQ YK G+F G CGT+LDH VT VG+G + DG  YWL+KNSWG  WG+ GY
Sbjct: 260 VAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSNDGTKYWLVKNSWGTDWGEEGY 319

Query: 328 MKIVRD----EGLCGIGTRSSYPLA 348
           + + R     EGLCGI  ++SYP A
Sbjct: 320 IMMQRGVEAAEGLCGIAMQASYPTA 344


>gi|225446583|ref|XP_002280204.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1 [Vitis vinifera]
          Length = 341

 Score =  355 bits (911), Expect = 2e-95,   Method: Compositional matrix adjust.
 Identities = 176/336 (52%), Positives = 241/336 (71%), Gaps = 16/336 (4%)

Query: 21  IITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEK 80
           ++ +L + ASQ  ++R+ HE S+ E HE WM Q+GR YKD  EK  R KIFK+N+  IE 
Sbjct: 14  LLFVLAAWASQA-TARNLHEASMYERHEDWMVQYGREYKDADEKSKRYKIFKDNVARIES 72

Query: 81  ANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHR-STTSSTFKYQNLSMTDVPTS 139
            NK  +++YKL  N+F+DLTN+EFRA    +K    +H  ST +++FKY+N+  T VP++
Sbjct: 73  FNKAMDKSYKLSINEFADLTNEEFRASRNRFK----AHICSTEATSFKYENV--TAVPST 126

Query: 140 LDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNG 198
           +DWR KGAVTPIK+Q +CG CWAF+AVAA+EGIT++ +G LI LSEQ+L+DC T+G + G
Sbjct: 127 VDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQG 186

Query: 199 CLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCS--AAQKPAAAKISNYEEVPSGDEQA 256
           C GG  + AF +I QN G+ TE  YPY    GTC+   A  P AAKI+ YE+VP+ +E+A
Sbjct: 187 CSGGLMDDAFKFIEQNHGLTTEANYPYAGTDGTCNRKKAAHP-AAKINGYEDVPANNEKA 245

Query: 257 LLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKN 316
           L KAV+ QP+++AI A  +EFQ Y  G+F G CGT+LDH V+ VG+GT++DG  YWL+KN
Sbjct: 246 LQKAVAHQPIAVAIDAGGSEFQFYSSGVFTGQCGTELDHGVSAVGYGTSDDGMKYWLVKN 305

Query: 317 SWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
           SWG  WG+ GY+++ RD    EGLCGI  ++SYP A
Sbjct: 306 SWGTGWGEEGYIRMQRDVTAKEGLCGIAMQASYPTA 341


>gi|224135841|ref|XP_002327317.1| predicted protein [Populus trichocarpa]
 gi|222835687|gb|EEE74122.1| predicted protein [Populus trichocarpa]
          Length = 342

 Score =  355 bits (911), Expect = 2e-95,   Method: Compositional matrix adjust.
 Identities = 170/336 (50%), Positives = 227/336 (67%), Gaps = 9/336 (2%)

Query: 19  FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
           F    L++   +  V+SR   E S+   HE+WM   G+ Y D  EKE R +IFK+N+EYI
Sbjct: 10  FFAFILILGMWAYEVASRELQEPSMSARHEQWMETFGKVYADAAEKERRFEIFKDNVEYI 69

Query: 79  EKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPT 138
           E  N  GN+ YKL  N+F+DLTN+E +    GY+ P  + R    ++FKY+N+  T VP 
Sbjct: 70  ESFNTAGNKPYKLSVNKFADLTNEELKVARNGYRRPLQT-RPMKVTSFKYENV--TAVPA 126

Query: 139 SLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NN 197
           ++DWR KGAVTPIK+Q +CG CWAF+ VAA EGI ++ +G L+ LSEQ+L+DC T G + 
Sbjct: 127 TMDWRKKGAVTPIKDQGQCGSCWAFSTVAATEGINQLTTGKLVSLSEQELVDCDTQGEDQ 186

Query: 198 GCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAA-AKISNYEEVPSGDEQA 256
           GC GG  E  F +II+N GI TE  YPYQA  GTC++ ++ +  AKI+ YE VP+  E A
Sbjct: 187 GCEGGLMEDGFEFIIKNHGITTEANYPYQAADGTCNSKKEASRIAKITGYESVPANSEAA 246

Query: 257 LLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKN 316
           LLKAV+ QP+S++I A  ++FQ Y  G+F G CGT+LDH VT VG+G T DG  YWL+KN
Sbjct: 247 LLKAVASQPISVSIDAGGSDFQFYSSGVFTGQCGTELDHGVTAVGYGETSDGTKYWLVKN 306

Query: 317 SWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
           SWG +WG+ GY+++ RD    EGLCGI   SSYP A
Sbjct: 307 SWGTSWGEEGYIRMQRDTEAEEGLCGIAMDSSYPTA 342


>gi|118627554|emb|CAL64936.1| putative cysteine protease 8 [Trifolium pratense]
          Length = 344

 Score =  355 bits (911), Expect = 2e-95,   Method: Compositional matrix adjust.
 Identities = 172/325 (52%), Positives = 231/325 (71%), Gaps = 14/325 (4%)

Query: 33  VSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANK-EGNRTYKL 91
           V+SR+  + S+ E H +WM+Q+G+ YKD  E+E R KIFKEN+ YIE  N  +  ++YKL
Sbjct: 25  VTSRTLQDDSMYERHGQWMSQYGKIYKDHQERETRFKIFKENVNYIETFNNADDTKSYKL 84

Query: 92  GTNQFSDLTNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVT 149
           G NQF+DLTN+EF A    +K  M S   R+T+   FKY+N+S   +P+++DWR KGAVT
Sbjct: 85  GINQFADLTNEEFIASRNKFKGHMCSSIMRTTS---FKYENVS--GIPSTVDWRKKGAVT 139

Query: 150 PIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAF 208
           P+KNQ +CGCCWAF+AVAA EGI K+ +G LI LSEQ+L+DC T G + GC GG  + AF
Sbjct: 140 PVKNQGQCGCCWAFSAVAATEGIHKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAF 199

Query: 209 AYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVS 267
            +IIQN G++TE +YPY+ V GTC+A +    A  I+ YE+VP+  EQAL KAV+ QP+S
Sbjct: 200 KFIIQNHGLSTEAQYPYEGVDGTCNANKASVQAVTITGYEDVPANSEQALQKAVANQPIS 259

Query: 268 IAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGY 327
           +AI A  ++FQ YK G+F G CGT+LDH VT VG+G + DG  YWL+KNSWG  WG+ GY
Sbjct: 260 VAIDASGSDFQFYKSGVFTGACGTELDHGVTAVGYGVSNDGTKYWLVKNSWGTDWGEEGY 319

Query: 328 MKIVRD----EGLCGIGTRSSYPLA 348
           + + R     EG+CGI  ++SYP A
Sbjct: 320 IMMQRGIEAAEGICGIAMQASYPTA 344


>gi|225446581|ref|XP_002280246.1| PREDICTED: vignain [Vitis vinifera]
          Length = 341

 Score =  355 bits (910), Expect = 2e-95,   Method: Compositional matrix adjust.
 Identities = 176/336 (52%), Positives = 239/336 (71%), Gaps = 16/336 (4%)

Query: 21  IITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEK 80
           ++ +L + ASQ  ++RS HE S+ E HE WM Q+GR YKD  EK  R KIFK+N+  IE 
Sbjct: 14  LLFVLAAWASQA-TARSLHEASMYERHEDWMVQYGREYKDADEKSKRYKIFKDNVARIES 72

Query: 81  ANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHR-STTSSTFKYQNLSMTDVPTS 139
            NK  +++YKL  N+F+DLTN+EFRA    +K    +H  ST +++FKY+N+  T VP++
Sbjct: 73  FNKAMDKSYKLSINEFADLTNEEFRASRNRFK----AHICSTEATSFKYENV--TAVPST 126

Query: 140 LDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNG 198
           +DWR KGAVTPIK+Q +CG CWAF+AVAA+EGIT++ +G LI LSEQ+L+DC T+G + G
Sbjct: 127 VDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQG 186

Query: 199 CLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCS--AAQKPAAAKISNYEEVPSGDEQA 256
           C GG  + AF +I QN G+ TE  YPY    GTC+   A  P AAKI+ YE+VP+ +E+A
Sbjct: 187 CSGGLMDDAFKFIEQNHGLTTEANYPYAGTDGTCNRKKAAHP-AAKINGYEDVPANNEKA 245

Query: 257 LLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKN 316
           L KAV+ QP+++AI A  +EFQ Y  G+F G CGT+LDH V  VG+GT++DG  YWL+KN
Sbjct: 246 LQKAVAHQPIAVAIDASGSEFQFYSSGVFTGQCGTELDHGVAAVGYGTSDDGMKYWLVKN 305

Query: 317 SWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
           SW   WG+ GY+++ RD    EGLCGI  ++SYP A
Sbjct: 306 SWSTGWGEEGYIRMQRDVTAKEGLCGIAMQASYPTA 341


>gi|297826875|ref|XP_002881320.1| hypothetical protein ARALYDRAFT_321132 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297327159|gb|EFH57579.1| hypothetical protein ARALYDRAFT_321132 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 341

 Score =  355 bits (910), Expect = 2e-95,   Method: Compositional matrix adjust.
 Identities = 171/335 (51%), Positives = 232/335 (69%), Gaps = 7/335 (2%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
           +F I+    S +     + + HE S +E HE+WMA+  R Y+DELEK+MR  +FK+NL++
Sbjct: 10  IFTILFTTFSISQATSRTVTFHEPSSLEKHEQWMARFSRVYRDELEKQMRRDVFKKNLKF 69

Query: 78  IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
           IE  NK+GN++YKLG N+F+D TN+EF A++TG K  S      T S+  +    M  V 
Sbjct: 70  IENFNKKGNKSYKLGVNEFADWTNEEFLAIHTGLKGLSSKVVDETISSRSWNISDMVGV- 128

Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNN 197
            S DWR +GAVTP+K Q +CGCCWAF+AVAAVEG+TKI  GNL+ LSEQQLLDC    + 
Sbjct: 129 -SKDWRAEGAVTPVKYQGQCGCCWAFSAVAAVEGVTKIAGGNLVSLSEQQLLDCDREYDR 187

Query: 198 GCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQAL 257
           GC GG    AF YIIQN+GIA+E++Y YQ   G C ++ +P AA+IS ++ VPS +EQAL
Sbjct: 188 GCDGGIMSDAFNYIIQNRGIASENDYSYQGSDGRCRSSARP-AARISGFQTVPSNNEQAL 246

Query: 258 LKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNS 317
           L+AVS QPVS+++ A    F  Y  G+++G CGT  +HAVT VG+GT++DG  YWL KNS
Sbjct: 247 LEAVSRQPVSVSMDANGDGFMHYSGGVYDGPCGTSSNHAVTFVGYGTSQDGTKYWLAKNS 306

Query: 318 WGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
           WG TWG+ GY++I RD    +G+CG+   + YP+A
Sbjct: 307 WGETWGEKGYIRIRRDVAWPQGMCGVAQYAFYPVA 341


>gi|356542631|ref|XP_003539770.1| PREDICTED: vignain-like [Glycine max]
          Length = 343

 Score =  354 bits (909), Expect = 3e-95,   Method: Compositional matrix adjust.
 Identities = 175/338 (51%), Positives = 239/338 (70%), Gaps = 15/338 (4%)

Query: 20  IIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIE 79
           + + LL    +   ++R+  + S+ E HE+WMAQHG+ YKD  EKE+R KIF++N++ IE
Sbjct: 12  LALLLLFGFWAFSANTRTLEDASMHERHEQWMAQHGKVYKDHHEKELRYKIFQQNVKGIE 71

Query: 80  KANKEGNRTYKLGTNQFSDLTNDEFRAL--YTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
             N  GN+++KLG NQF+DLT +EF+A+    GY     S     +STFKY+++  T VP
Sbjct: 72  GFNNAGNKSHKLGVNQFADLTEEEFKAINKLKGYMWSKISR----TSTFKYEHV--TKVP 125

Query: 138 TSLDWRDKGAVTPIKNQK-ECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGN 196
            +LDWR KGAVTPIK+Q  +CG CWAFAAVAA EGITK+ +G LI LSEQ+L+DC TNG+
Sbjct: 126 ATLDWRQKGAVTPIKSQGLKCGSCWAFAAVAATEGITKLTTGELISLSEQELIDCDTNGD 185

Query: 197 NG-CLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSA-AQKPAAAKISNYEEVPSGDE 254
           NG C  G  ++AF +I+QN+G+ATE  YPYQAV GTC+A  +    A I  YE+VP+ +E
Sbjct: 186 NGGCKWGIIQEAFKFIVQNKGLATEASYPYQAVDGTCNAKVESKHVASIKGYEDVPANNE 245

Query: 255 QALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLI 314
            ALL AV+ QPVS+ + +   +F+ Y  G+ +G CGT  DHAVT+VG+G ++DG  YWLI
Sbjct: 246 TALLNAVANQPVSVLVDSSDYDFRFYSSGVLSGSCGTTFDHAVTVVGYGVSDDGTKYWLI 305

Query: 315 KNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
           KNSWG  WG+ GY++I RD    EG+CGI  ++SYP+A
Sbjct: 306 KNSWGVYWGEQGYIRIKRDVAAKEGMCGIAMQASYPIA 343


>gi|147839728|emb|CAN70559.1| hypothetical protein VITISV_032465 [Vitis vinifera]
          Length = 341

 Score =  353 bits (905), Expect = 9e-95,   Method: Compositional matrix adjust.
 Identities = 175/336 (52%), Positives = 238/336 (70%), Gaps = 16/336 (4%)

Query: 21  IITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEK 80
           ++ +L + ASQ  ++R  HE S+ E HE WM Q+GR YKD  EK  R KIFK+N+  IE 
Sbjct: 14  LLFVLAAWASQA-TARXLHEASMYERHEDWMVQYGREYKDADEKSKRYKIFKDNVARIES 72

Query: 81  ANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHR-STTSSTFKYQNLSMTDVPTS 139
            NK  +++YKL  N+F+DLTN+EFRA    +K    +H  ST +++FKY+N+  T VP++
Sbjct: 73  FNKAMDKSYKLSINEFADLTNEEFRASRNRFK----AHICSTEATSFKYENV--TAVPST 126

Query: 140 LDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNG 198
           +DWR KGAVTPIK+Q +CG CWAF+AVAA+EGIT++ +G LI LSEQ+L+DC T+G + G
Sbjct: 127 VDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQG 186

Query: 199 CLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCS--AAQKPAAAKISNYEEVPSGDEQA 256
           C GG  + AF +I QN G+ TE  YPY    GTC+   A  P AAKI+ YE+VP+ +E+A
Sbjct: 187 CSGGLMDDAFKFIEQNHGLTTEANYPYAGTDGTCNRKKAAHP-AAKINGYEDVPANNEKA 245

Query: 257 LLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKN 316
           L KAV+ QP+++AI A  +EFQ Y  G+F G CGT+LDH V  VG+GT++DG  YWL+KN
Sbjct: 246 LQKAVAHQPIAVAIDASGSEFQFYSSGVFTGQCGTELDHGVAAVGYGTSDDGMKYWLVKN 305

Query: 317 SWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
           SW   WG+ GY+++ RD    EGLCGI  ++SYP A
Sbjct: 306 SWSTGWGEEGYIRMQRDVTVKEGLCGIAMQASYPTA 341


>gi|357471211|ref|XP_003605890.1| Cysteine proteinase [Medicago truncatula]
 gi|355506945|gb|AES88087.1| Cysteine proteinase [Medicago truncatula]
          Length = 343

 Score =  353 bits (905), Expect = 9e-95,   Method: Compositional matrix adjust.
 Identities = 176/326 (53%), Positives = 227/326 (69%), Gaps = 17/326 (5%)

Query: 33  VSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANK-EGNRTYKL 91
           V+SR T +  + E H +WM+Q+G+ YKD  E+E R KIF EN+ YIE  NK + N+ Y L
Sbjct: 25  VTSR-TLQDDMYERHRQWMSQYGKVYKDSQEREKRFKIFTENVNYIEAFNKGDNNKLYTL 83

Query: 92  GTNQFSDLTNDEF---RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAV 148
           G NQF+DLTNDEF   R  + G+   S     T +STFKY+N S   +P+S+DWR KGAV
Sbjct: 84  GVNQFADLTNDEFTSSRNKFKGHMCSSI----TRTSTFKYENASA--IPSSVDWRKKGAV 137

Query: 149 TPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKA 207
           TP+KNQ +CGCCWAF+AVAA EGI K+ +G LI LSEQ+L+DC T G + GC GG  + A
Sbjct: 138 TPVKNQGQCGCCWAFSAVAATEGIHKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDA 197

Query: 208 FAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPV 266
           F +IIQN G+ TE  YPYQ V GTC+A +    A  I+ YE+VP+ +EQAL KAV+ QP+
Sbjct: 198 FKFIIQNHGLNTEANYPYQGVDGTCNANKGSINAVTITGYEDVPTNNEQALQKAVANQPI 257

Query: 267 SIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAG 326
           S+AI A  ++FQ YK G+F G CGT+LDH VT VG+G + DG  YWL+KNSWG  WG+ G
Sbjct: 258 SVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSNDGTKYWLVKNSWGTEWGEEG 317

Query: 327 YMKIVRD----EGLCGIGTRSSYPLA 348
           Y+ + R     EGLCGI  ++SYP A
Sbjct: 318 YIMMQRGVDAAEGLCGIAMQASYPTA 343


>gi|356542633|ref|XP_003539771.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
          Length = 341

 Score =  352 bits (904), Expect = 1e-94,   Method: Compositional matrix adjust.
 Identities = 178/340 (52%), Positives = 232/340 (68%), Gaps = 14/340 (4%)

Query: 15  TTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKEN 74
           T  +F+I      CA +  ++R+  +  + E HE+WMA HG+ YK   EKE + +IF EN
Sbjct: 10  TLALFLIFAF---CAFEA-NARTLEDAPMRERHEQWMATHGKVYKHSYEKEQKYQIFMEN 65

Query: 75  LEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
           ++ IE  N  G + YKLG N F+DLTN+EF+A+   +K    S R+ T+ TF+Y+N+  T
Sbjct: 66  VQRIEAFNNAGXKPYKLGINHFADLTNEEFKAI-NRFKGHVCSKRTRTT-TFRYENV--T 121

Query: 135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN 194
            VP SLDWR KGAVTPIK+Q +CGCCWAF+AVAA EGITK+R+G LI LSEQ+L+DC T 
Sbjct: 122 AVPASLDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGITKLRTGKLISLSEQELVDCDTK 181

Query: 195 G-NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSA-AQKPAAAKISNYEEVPSG 252
           G + GC GG  + AF +I+QN+G+ATE  YPY+   GTC+A A    A  I  YE+VP+ 
Sbjct: 182 GVDQGCEGGLMDDAFKFILQNKGLATEAIYPYEGFDGTCNAKADGNHAGSIKGYEDVPAN 241

Query: 253 DEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYW 312
            E ALLKAV+ QPVS+AI A   +FQ Y  G+F G CGT LDH VT VG+G  +DG  YW
Sbjct: 242 SESALLKAVANQPVSVAIEASGFKFQFYSGGVFTGSCGTNLDHGVTSVGYGVGDDGTKYW 301

Query: 313 LIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
           L+KNSWG  WG+ GY+++ RD    EGLCGI   +SYP A
Sbjct: 302 LVKNSWGVKWGEKGYIRMQRDVAAKEGLCGIAMLASYPSA 341


>gi|147788834|emb|CAN64655.1| hypothetical protein VITISV_005140 [Vitis vinifera]
          Length = 341

 Score =  352 bits (902), Expect = 2e-94,   Method: Compositional matrix adjust.
 Identities = 176/340 (51%), Positives = 238/340 (70%), Gaps = 17/340 (5%)

Query: 19  FIIITLLVSCASQV--VSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLE 76
           +I + LL   A+      +R+ HE S+ E HE WMAQ+GR YKD  EK  R KIFK+N+ 
Sbjct: 9   YICLALLFVLAAWASHAKARNLHEASMYERHEDWMAQYGRVYKDAGEKSKRYKIFKDNVA 68

Query: 77  YIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHR-STTSSTFKYQNLSMTD 135
            IE  NK  N++YKL  N+F+DLTN+EFRA    +K    +H  ST +++FKY+++    
Sbjct: 69  RIESFNKAMNKSYKLSINEFADLTNEEFRASRNRFK----AHICSTEATSFKYEHVXA-- 122

Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG 195
           VP+++DWR KGAVTPIK+Q +CG CWAF+AVAA+EGIT++ +G LI LSEQ+L+DC T+G
Sbjct: 123 VPSTVDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSG 182

Query: 196 -NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCS--AAQKPAAAKISNYEEVPSG 252
            + GC GG  + AF +I QN G+ TE  YPY    GTC+   A  PAA KI+ YE+VP+ 
Sbjct: 183 EDQGCSGGLMDDAFKFIEQNHGLTTEANYPYAGTDGTCNRKKAAHPAA-KINGYEDVPAN 241

Query: 253 DEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYW 312
           +E+AL KAV+ QP+++AI A   EFQ Y  G+F G CGT+LDH V+ VG+GT++DG  YW
Sbjct: 242 NEKALQKAVAHQPIAVAIDAGGFEFQFYSSGVFTGQCGTELDHGVSAVGYGTSDDGMKYW 301

Query: 313 LIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
           L+KNSWG  WG+ GY+++ RD    EGLCGI  ++SYP A
Sbjct: 302 LVKNSWGTGWGEEGYIRMQRDVTEKEGLCGIAMQASYPTA 341


>gi|356577811|ref|XP_003557016.1| PREDICTED: vignain-like [Glycine max]
          Length = 343

 Score =  351 bits (901), Expect = 2e-94,   Method: Compositional matrix adjust.
 Identities = 172/324 (53%), Positives = 227/324 (70%), Gaps = 13/324 (4%)

Query: 33  VSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLG 92
           V+ RS  + S+ E HE+WM ++G+ YKD  E+E R +IFKEN+ YIE  N   N+ YKL 
Sbjct: 25  VTCRSLQDASMYERHEQWMTRYGKVYKDPQEREKRFRIFKENVNYIEAFNNAANKRYKLA 84

Query: 93  TNQFSDLTNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTP 150
            NQF+DLTN+EF A    +K  M S   R+TT   FKY+N+  T VP+++DWR KGAVTP
Sbjct: 85  INQFADLTNEEFIAPRNRFKGHMCSSIIRTTT---FKYENV--TAVPSTVDWRQKGAVTP 139

Query: 151 IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFA 209
           IK+Q +CGCCWAF+AVAA EGI  + SG LI LSEQ+L+DC T G + GC GG  + AF 
Sbjct: 140 IKDQGQCGCCWAFSAVAATEGIHALTSGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFK 199

Query: 210 YIIQNQGIATEDEYPYQAVPGTCSAAQKP-AAAKISNYEEVPSGDEQALLKAVSMQPVSI 268
           ++IQN G+ TE  YPY+ V G C+  +    AA I+ YE+VP+ +E+AL KAV+ QPVS+
Sbjct: 200 FVIQNHGLNTEANYPYKGVDGKCNVNEAANDAATITGYEDVPANNEKALQKAVANQPVSV 259

Query: 269 AIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYM 328
           AI A  ++FQ YK G+F G CGT+LDH VT VG+G + DG  YWL+KNSWG  WG+ GY+
Sbjct: 260 AIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSNDGTEYWLVKNSWGTEWGEEGYI 319

Query: 329 KIVR----DEGLCGIGTRSSYPLA 348
           ++ R    +EGLCGI  ++SYP A
Sbjct: 320 RMQRGVNSEEGLCGIAMQASYPTA 343


>gi|18403438|ref|NP_565780.1| cysteine proteinase-like protein [Arabidopsis thaliana]
 gi|2342728|gb|AAB67626.1| cysteine proteinase [Arabidopsis thaliana]
 gi|330253821|gb|AEC08915.1| cysteine proteinase-like protein [Arabidopsis thaliana]
          Length = 345

 Score =  351 bits (901), Expect = 2e-94,   Method: Compositional matrix adjust.
 Identities = 168/343 (48%), Positives = 238/343 (69%), Gaps = 13/343 (3%)

Query: 18  MFIIITLLVSCASQVVSSRST------HEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIF 71
           + +++T+L+   +    S++T       EQS+V+ HE+WMA+  R Y+DELEK MR  +F
Sbjct: 4   IMVLVTVLIILFTGFRISQATSRTVIFREQSMVDKHEQWMARFSREYRDELEKNMRRDVF 63

Query: 72  KENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYK-MPSPSHRSTTSSTFKYQN 130
           K+NL++IE  NK+GN++YKLG N+F+D TN+EF A++TG K +   S     + T   Q 
Sbjct: 64  KKNLKFIENFNKKGNKSYKLGVNEFADWTNEEFLAIHTGLKGLTEVSPSKVVAKTISSQT 123

Query: 131 LSMTD-VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLL 189
            +++D V  S DWR +GAVTP+K Q +CGCCWAF+AVAAVEG+ KI  GNL+ LSEQQLL
Sbjct: 124 WNVSDMVVESKDWRAEGAVTPVKYQGQCGCCWAFSAVAAVEGVAKIAGGNLVSLSEQQLL 183

Query: 190 DCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEV 249
           DC    + GC GG    AF Y++QN+GIA+E++Y YQ   G C +  +P AA+IS ++ V
Sbjct: 184 DCDREYDRGCDGGIMSDAFNYVVQNRGIASENDYSYQGSDGGCRSNARP-AARISGFQTV 242

Query: 250 PSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGA 309
           PS +E+ALL+AVS QPVS+++ A    F  Y  G+++G CGT  +HAVT VG+GT++DG 
Sbjct: 243 PSNNERALLEAVSRQPVSVSMDATGDGFMHYSGGVYDGPCGTSSNHAVTFVGYGTSQDGT 302

Query: 310 NYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
            YWL KNSWG TWG+ GY++I RD    +G+CG+   + YP+A
Sbjct: 303 KYWLAKNSWGETWGEKGYIRIRRDVAWPQGMCGVAQYAFYPVA 345


>gi|356577813|ref|XP_003557017.1| PREDICTED: uncharacterized protein LOC100801364 [Glycine max]
          Length = 890

 Score =  351 bits (901), Expect = 3e-94,   Method: Compositional matrix adjust.
 Identities = 171/324 (52%), Positives = 226/324 (69%), Gaps = 13/324 (4%)

Query: 33  VSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLG 92
           V+ RS  + S+ E HE+WM ++G+ YKD  E+E R +IFKEN+ YIE  N   N+ YKL 
Sbjct: 572 VTCRSLQDASMYERHEQWMTRYGKVYKDPQEREKRFRIFKENVNYIEAFNNAANKRYKLA 631

Query: 93  TNQFSDLTNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTP 150
            NQF+DLTN+EF A    +K  M S   R+TT   FKY+N+  T VP+++DWR KGAVTP
Sbjct: 632 INQFADLTNEEFIAPRNRFKGHMCSSIIRTTT---FKYENV--TAVPSTVDWRQKGAVTP 686

Query: 151 IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFA 209
           IK+Q +CGCCWAF+AVAA EGI  + SG LI LSEQ+L+DC T G + GC GG  + AF 
Sbjct: 687 IKDQGQCGCCWAFSAVAATEGIHALTSGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFK 746

Query: 210 YIIQNQGIATEDEYPYQAVPGTCSAAQKP-AAAKISNYEEVPSGDEQALLKAVSMQPVSI 268
           ++IQN G+ TE  YPY+ V G C+A +       I+ YE+VP+ +E+AL KAV+ QPVS+
Sbjct: 747 FVIQNHGLNTEANYPYKGVDGKCNANEAANDVVTITGYEDVPANNEKALQKAVANQPVSV 806

Query: 269 AIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYM 328
           AI A  ++FQ YK G+F G CGT+LDH VT VG+G + DG  YWL+KNSWG  WG+ GY+
Sbjct: 807 AIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSNDGTEYWLVKNSWGTEWGEEGYI 866

Query: 329 KIVR----DEGLCGIGTRSSYPLA 348
           ++ R    +EGLCGI  ++SYP A
Sbjct: 867 RMQRGVDSEEGLCGIAMQASYPTA 890


>gi|356545063|ref|XP_003540965.1| PREDICTED: thiol protease SEN102-like [Glycine max]
          Length = 361

 Score =  351 bits (900), Expect = 3e-94,   Method: Compositional matrix adjust.
 Identities = 175/356 (49%), Positives = 239/356 (67%), Gaps = 13/356 (3%)

Query: 1   MVLIFERSGSFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKD 60
            +L F  +   K +   + + + L ++  +  V+ RS  + S+ E HE+WM ++G+ YKD
Sbjct: 11  FLLFFASTMVAKNHFCHISLAMLLCMAFLAFQVTCRSLQDASMYERHEQWMTRYGKVYKD 70

Query: 61  ELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYK--MPSPSH 118
             E+E R +IFKEN+ YIE  N   N+ YKL  NQF+DLTN+EF A    +K  M S   
Sbjct: 71  PQEREKRFRIFKENVNYIEAFNNAANKRYKLAINQFADLTNEEFIAPRNRFKGHMCSSII 130

Query: 119 RSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSG 178
           R+TT   FKY+N+  T VP+++DWR KGAVTPIK+Q +CGCCWAF+AVAA EGI  + SG
Sbjct: 131 RTTT---FKYENV--TAVPSTVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALTSG 185

Query: 179 NLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQK 237
            LI LSEQ+L+DC T G + GC GG  + AF ++IQN G+ TE  YPY+ V G C+A + 
Sbjct: 186 KLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFVIQNHGLNTEANYPYKGVDGKCNANEA 245

Query: 238 P-AAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHA 296
                 I+ YE+VP+ +E+AL KAV+ QPVS+AI A  ++FQ YK G+F G CGT+LDH 
Sbjct: 246 ANDVVTITGYEDVPANNEKALQKAVANQPVSVAIDASGSDFQFYKSGVFTGSCGTELDHG 305

Query: 297 VTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR----DEGLCGIGTRSSYPLA 348
           VT VG+G + DG  YWL+KNSWG  WG+ GY+++ R    +EGLCGI  ++SYP A
Sbjct: 306 VTAVGYGVSNDGTEYWLVKNSWGTEWGEEGYIRMQRGVDSEEGLCGIAMQASYPTA 361


>gi|357452075|ref|XP_003596314.1| Cysteine proteinase [Medicago truncatula]
 gi|355485362|gb|AES66565.1| Cysteine proteinase [Medicago truncatula]
          Length = 341

 Score =  350 bits (899), Expect = 5e-94,   Method: Compositional matrix adjust.
 Identities = 179/340 (52%), Positives = 233/340 (68%), Gaps = 21/340 (6%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
           +F+ + LL    S   +SR+     + E+HE+WM QHG+ YK   EK+ R  IFKEN+ Y
Sbjct: 14  LFLCLGLL----SFQATSRTLQNDPMYEMHEQWMVQHGKVYKAAHEKQKRFGIFKENVNY 69

Query: 78  IEKANKEGNRTYKLGTNQFSDLTNDEF---RALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
           IE  N  GN++YKLG N F+DLTN EF   R  + GY         +  +TFKY+N+S  
Sbjct: 70  IEAFNNVGNKSYKLGLNHFADLTNHEFIAARNKFNGYL------HGSIITTFKYKNVS-- 121

Query: 135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN 194
           DVP+++DWR +GAVTP+KNQ +CGCCWAF+AVA+ EGI K+ +GNL+ LSEQ+L+DC TN
Sbjct: 122 DVPSAVDWRQEGAVTPVKNQGQCGCCWAFSAVASTEGIHKLTTGNLVSLSEQELVDCDTN 181

Query: 195 G-NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQ-KPAAAKISNYEEVPSG 252
           G + GC GG  + AF +IIQN G++TE EYPYQ V GTC+  +   +AA IS YE VP  
Sbjct: 182 GEDQGCEGGLMDDAFEFIIQNNGLSTEAEYPYQGVDGTCNKTEVGSSAATISGYENVPVN 241

Query: 253 DEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYW 312
           DEQAL KAV+ QPVS+AI A  ++FQ YK G+F G CGT+LDH V +VG+G  ED   YW
Sbjct: 242 DEQALQKAVANQPVSVAIDASGSDFQFYKSGVFTGSCGTELDHGVAVVGYGVGEDETEYW 301

Query: 313 LIKNSWGNTWGDAGYMKIVR----DEGLCGIGTRSSYPLA 348
           L+KNSWG  WG+ GY+++ R     EGLCGI  + SYP A
Sbjct: 302 LVKNSWGTQWGEEGYIRMQRGVDASEGLCGIAMQPSYPTA 341


>gi|242068363|ref|XP_002449458.1| hypothetical protein SORBIDRAFT_05g013840 [Sorghum bicolor]
 gi|241935301|gb|EES08446.1| hypothetical protein SORBIDRAFT_05g013840 [Sorghum bicolor]
          Length = 350

 Score =  350 bits (897), Expect = 7e-94,   Method: Compositional matrix adjust.
 Identities = 170/342 (49%), Positives = 238/342 (69%), Gaps = 7/342 (2%)

Query: 10  SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
           +F      +  ++T++V       S+    E+++   H++WMA+HGR+YKDE EK  R +
Sbjct: 12  TFTAAALMILAVMTMVVEARDLSTSTGGYGEEAMKVRHQQWMAEHGRTYKDEAEKARRFQ 71

Query: 70  IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQ 129
           +FK N ++++++N  G ++Y+L  N+F+D+TNDEF A+YTG K P P+     +  FKY+
Sbjct: 72  VFKANADFVDRSNAAGGKSYELAINEFADMTNDEFVAMYTGLK-PVPAGPKKMAG-FKYE 129

Query: 130 NLSMTDVPT-SLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQL 188
           NL+++DV   ++DWR KGAVT IKNQ +CGCCWAFAAVAAVE I +I +GNL+ LSEQQ+
Sbjct: 130 NLTLSDVDQQAVDWRQKGAVTGIKNQGQCGCCWAFAAVAAVESIHQITTGNLVSLSEQQV 189

Query: 189 LDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEE 248
           LDC T+GNNGC GG  + AF YII N G+ATED YPY A  GTC ++ +P A  IS+Y++
Sbjct: 190 LDCDTDGNNGCNGGYIDNAFQYIISNGGLATEDAYPYAAAQGTCQSSVQP-AVTISSYQD 248

Query: 249 VPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNG-VCGT-QLDHAVTIVGFGTTE 306
           VPSGDE AL  AV+ QPV++AI A++  FQ Y  G+     CGT  L+HAVT VG+ T E
Sbjct: 249 VPSGDEAALAAAVANQPVAVAIDAHNN-FQFYSSGVLTADTCGTPSLNHAVTAVGYSTAE 307

Query: 307 DGANYWLIKNSWGNTWGDAGYMKIVRDEGLCGIGTRSSYPLA 348
           DG  YWL+KN WG  WG+ GY+++ R    CG+  ++SYP+A
Sbjct: 308 DGTPYWLLKNQWGQNWGEGGYLRVERGTNACGVAQQASYPVA 349


>gi|255568299|ref|XP_002525124.1| cysteine protease, putative [Ricinus communis]
 gi|223535583|gb|EEF37251.1| cysteine protease, putative [Ricinus communis]
          Length = 342

 Score =  350 bits (897), Expect = 7e-94,   Method: Compositional matrix adjust.
 Identities = 171/338 (50%), Positives = 229/338 (67%), Gaps = 12/338 (3%)

Query: 19  FIIITLLVSCA--SQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLE 76
           F++I L    A  +   S+R  HE ++VE HEKWMA+HG+ YKD+ EK  R +IFK N+E
Sbjct: 9   FLLIALFFVLAMWADQASTRELHESTMVERHEKWMAKHGKVYKDDEEKLRRFQIFKNNVE 68

Query: 77  YIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV 136
           +IE +N  GN +Y LG N+F+DLTN+EFRA + GYK P  + R  T   FKY+N+  T +
Sbjct: 69  FIESSNAAGNNSYMLGINRFADLTNEEFRASWNGYKRPLDASRIVT--PFKYENV--TAL 124

Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG- 195
           P S+DWR KGAVT IK+Q+ECG CWAF+AVAA EG+ K+R+G L+ LSEQ+L+DC   G 
Sbjct: 125 PYSMDWRRKGAVTSIKDQRECGSCWAFSAVAATEGVHKLRTGKLVSLSEQELVDCDVKGE 184

Query: 196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDE 254
           + GC GG  E AF +I +N GI TE  Y Y+   G C   ++ +  AKI+ Y+ VP   E
Sbjct: 185 DKGCQGGLMEDAFKFIKRNGGITTEANYAYRGRDGKCDTKKEASHVAKITGYQVVPENSE 244

Query: 255 QALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLI 314
            ALLKAV+ QPVS++I A S  FQ Y+ GI+ G CG+ L+H V  VG+GT+  G+ YW++
Sbjct: 245 AALLKAVAHQPVSVSIDAGSMSFQFYQSGIYAGSCGSDLNHGVAAVGYGTSSSGSKYWIV 304

Query: 315 KNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
           KNSWG  WG+ GY+++ RD    +GLCGI    SYP A
Sbjct: 305 KNSWGPEWGERGYVRMKRDITSRKGLCGIAMDCSYPTA 342


>gi|5823018|gb|AAD53011.1|AF089848_1 senescence-specific cysteine protease [Brassica napus]
          Length = 346

 Score =  350 bits (897), Expect = 8e-94,   Method: Compositional matrix adjust.
 Identities = 171/338 (50%), Positives = 226/338 (66%), Gaps = 10/338 (2%)

Query: 18  MFIIITLLVS-CASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLE 76
           +F+I++L+ S C S  +S     E  + + H++WMA+HGR+Y D  EK  R  +FK N+E
Sbjct: 8   IFLIVSLVSSFCFSTTLSRLLDDELIMQKKHDEWMAEHGRTYADMNEKNNRYVVFKRNVE 67

Query: 77  YIEKANK-EGNRTYKLGTNQFSDLTNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNLSM 133
            IE+ N     RT+KL  NQF+DLTNDEFR +YTGYK      S   T S++F+YQN+  
Sbjct: 68  RIERLNNVPAGRTFKLAVNQFADLTNDEFRFMYTGYKGDFVLFSQSQTKSTSFRYQNVFF 127

Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST 193
             +P ++DWR KGAVTPIKNQ  CGCCWAF+AVAA+EG T+I+ G LI LSEQQL+DC T
Sbjct: 128 GALPIAVDWRKKGAVTPIKNQGSCGCCWAFSAVAAIEGATQIKKGKLISLSEQQLVDCDT 187

Query: 194 NGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCS-AAQKPAAAKISNYEEVPSG 252
           N + GC GG  + AF +I+   G+ TE  YPY+     C   + KP+AA I+ YE+VP  
Sbjct: 188 N-DFGCSGGLMDTAFEHIMATGGLTTESNYPYKGEDANCKIKSTKPSAASITGYEDVPVN 246

Query: 253 DEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYW 312
           DE AL+KAV+ QPVS+ I     +FQ Y  G+F G C T LDHAVT VG+  +  G+ YW
Sbjct: 247 DENALMKAVAHQPVSVGIEGGGFDFQFYSSGVFTGECTTYLDHAVTAVGYSQSSAGSKYW 306

Query: 313 LIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
           +IKNSWG  WG+ GYM+I +D    EGLCG+  ++SYP
Sbjct: 307 IIKNSWGTKWGEGGYMRIKKDIKDKEGLCGLAMKASYP 344


>gi|356515086|ref|XP_003526232.1| PREDICTED: vignain-like [Glycine max]
          Length = 343

 Score =  349 bits (896), Expect = 9e-94,   Method: Compositional matrix adjust.
 Identities = 175/339 (51%), Positives = 236/339 (69%), Gaps = 17/339 (5%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
           M + +T L   A QV + R+  + S+ E HE+WM ++G+ YKD  E+E R ++FKEN+ Y
Sbjct: 14  MLLCMTFL---AFQV-TCRTLQDASMYERHEQWMTRYGKVYKDPQEREKRFRVFKENVNY 69

Query: 78  IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMTD 135
           IE  N   N++YKLG NQF+DLTN EF A   G+K  M S   R+TT   FK++N++ T 
Sbjct: 70  IEAFNNAANKSYKLGINQFADLTNKEFIAPRNGFKGHMCSSIIRTTT---FKFENVTAT- 125

Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG 195
            P+++DWR KGAVTPIK+Q +CGCCWAF+AVAA EGI  + +G LI LSEQ+L+DC T G
Sbjct: 126 -PSTVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALSAGKLISLSEQELVDCDTKG 184

Query: 196 -NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAK-ISNYEEVPSGD 253
            + GC GG  + AF +IIQN G+ TE  YPY+ V G C+A +    A  I+ YE+VP+ +
Sbjct: 185 VDQGCEGGLMDDAFKFIIQNHGLNTEANYPYKGVDGKCNANEAAKNAATITGYEDVPANN 244

Query: 254 EQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWL 313
           E AL KAV+ QPVS+AI A  ++FQ YK G+F G CGT+LDH VT VG+G ++DG  YWL
Sbjct: 245 EMALQKAVANQPVSVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSDDGTEYWL 304

Query: 314 IKNSWGNTWGDAGYMKIVR----DEGLCGIGTRSSYPLA 348
           +KNSWG  WG+ GY+++ R    +EGLCGI  ++SYP A
Sbjct: 305 VKNSWGTEWGEEGYIRMQRGVDSEEGLCGIAMQASYPTA 343


>gi|357167190|ref|XP_003581045.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like
           [Brachypodium distachyon]
          Length = 415

 Score =  349 bits (896), Expect = 9e-94,   Method: Compositional matrix adjust.
 Identities = 166/337 (49%), Positives = 230/337 (68%), Gaps = 10/337 (2%)

Query: 19  FIIITLLVSCASQVVSSRS-THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
           F+I  L  +CA   +++R  T + S+V  HE+WMA++GR Y D  EK  RL++FK N+ +
Sbjct: 82  FLIAILACTCAVSALAARDLTDDLSMVARHEQWMAKYGRVYNDVAEKAQRLEVFKANVAF 141

Query: 78  IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
           IE  N  GN  + L  NQF+D+T DEFRA +TGYK P P+++  T+  FKY N+S+  +P
Sbjct: 142 IELVNA-GNDKFSLEANQFADMTVDEFRAAHTGYK-PVPANKGRTTQ-FKYANVSLDALP 198

Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-N 196
            S+DWR KGAVTPIK+Q +CGCCWAF+ VA+VEGI K+ +G LI LSEQ+L+DC  +G +
Sbjct: 199 ASMDWRAKGAVTPIKDQGQCGCCWAFSTVASVEGIVKLSTGKLISLSEQELVDCDVDGMD 258

Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKP-AAAKISNYEEVPSGDEQ 255
            GC GG  + AF +II N G+ TE  YPY     +C++ ++    A I  YE+VPS DE 
Sbjct: 259 QGCEGGLMDNAFEFIIDNGGLTTEGNYPYTGTDDSCNSNKESNDVASIKGYEDVPSNDET 318

Query: 256 ALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIK 315
           +LLKAV+ QPVSIA+      F+ YK G+ +G CGT+LDH +  VG+G T DG  +WL+K
Sbjct: 319 SLLKAVAAQPVSIAVDGGDNLFRFYKGGVLSGACGTELDHGIAAVGYGITSDGTKFWLMK 378

Query: 316 NSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
           NSWG +WG+ G++++ RD    EGLCG+  + SYP A
Sbjct: 379 NSWGTSWGEKGFIRMERDIADEEGLCGLAMQPSYPTA 415


>gi|225446585|ref|XP_002280215.1| PREDICTED: vignain [Vitis vinifera]
          Length = 341

 Score =  349 bits (896), Expect = 9e-94,   Method: Compositional matrix adjust.
 Identities = 175/336 (52%), Positives = 239/336 (71%), Gaps = 16/336 (4%)

Query: 21  IITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEK 80
           ++  L + ASQ  ++R+  E S+ E HE WMAQ+GR YKD  EK  R KIFK+N+  IE 
Sbjct: 14  LLFFLAAWASQA-TARNLLEASMYERHEDWMAQYGRVYKDADEKSKRYKIFKDNVARIES 72

Query: 81  ANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHR-STTSSTFKYQNLSMTDVPTS 139
            NK  +++YKL  N+F+DLTN+EFRA    +K    +H  ST +++FKY++++   VP++
Sbjct: 73  FNKAMDKSYKLSINEFADLTNEEFRASRNRFK----AHICSTEATSFKYEHVAA--VPST 126

Query: 140 LDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNG 198
           +DWR KGAVTPIK+Q +CG CWAF+AVAA+EGIT++ +G LI LSEQ+L+DC T+G + G
Sbjct: 127 VDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQG 186

Query: 199 CLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCS--AAQKPAAAKISNYEEVPSGDEQA 256
           C GG  + AF +I QN G+ATE  YPY    GTC+   A  PAA KI+ YE+VP+ +E+A
Sbjct: 187 CNGGLMDDAFKFIEQNHGLATEANYPYAGTDGTCNRKKAAHPAA-KINGYEDVPANNEKA 245

Query: 257 LLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKN 316
           L KAV+ QP+++AI A   EFQ Y  G+F G CGT+LDH V  VG+GT++DG  YWL+KN
Sbjct: 246 LQKAVAHQPIAVAIDAGGFEFQFYSSGVFTGQCGTELDHGVAAVGYGTSDDGMKYWLVKN 305

Query: 317 SWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
           SWG  WG+ GY+++ RD    EGLCGI  ++SYP A
Sbjct: 306 SWGTGWGEVGYIRMQRDVTAKEGLCGIAMQASYPTA 341


>gi|255568297|ref|XP_002525123.1| cysteine protease, putative [Ricinus communis]
 gi|223535582|gb|EEF37250.1| cysteine protease, putative [Ricinus communis]
          Length = 349

 Score =  349 bits (895), Expect = 1e-93,   Method: Compositional matrix adjust.
 Identities = 170/343 (49%), Positives = 227/343 (66%), Gaps = 14/343 (4%)

Query: 12  KINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIF 71
           KI    +F ++ +   CA Q  +SR  HE  +   HEKWMA+HG+ YKD+ EK  R +IF
Sbjct: 8   KILPIALFFVLAM---CADQA-ASRELHELEMTGRHEKWMAKHGKVYKDDKEKLRRFQIF 63

Query: 72  KENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNL 131
           K N+ +IE  N  GN++Y LG N+F+DLTN+EFRA + GYK P  + R  T   FKY+N+
Sbjct: 64  KSNVVFIESFNTAGNKSYMLGINKFADLTNEEFRAFWNGYKRPLGASRKIT--PFKYENV 121

Query: 132 SMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDC 191
             T +P+S+DWR KGAVTPIK+Q  CG CWAF+AVAA EGI K+R+G L+ LSEQ+L+DC
Sbjct: 122 --TALPSSIDWRSKGAVTPIKDQGVCGSCWAFSAVAATEGIHKLRTGKLVSLSEQELVDC 179

Query: 192 STNG-NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEV 249
              G + GC GG    AF +I ++ G+ +E  YPYQ   G C   ++ + A KI+ Y+ V
Sbjct: 180 DVKGQDKGCQGGLMVDAFKFIKRHGGMTSEANYPYQGRDGKCDTKKEASRAVKITGYQAV 239

Query: 250 PSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGA 309
           P   E ALLKAV+ QPVS+AI A S  FQ Y+ GIF G+CG  ++H V  VG+G +  G+
Sbjct: 240 PKNSEAALLKAVANQPVSVAIDAGSLSFQFYRSGIFTGICGKDINHGVAAVGYGRSNSGS 299

Query: 310 NYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
            YW++KNSWG  WG+ GY+++ RD    EGLCGI    SYP A
Sbjct: 300 KYWIVKNSWGTEWGEKGYIRMKRDVRSKEGLCGIAMECSYPTA 342


>gi|356539398|ref|XP_003538185.1| PREDICTED: vignain-like [Glycine max]
          Length = 343

 Score =  349 bits (895), Expect = 1e-93,   Method: Compositional matrix adjust.
 Identities = 171/323 (52%), Positives = 224/323 (69%), Gaps = 14/323 (4%)

Query: 34  SSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGT 93
           ++R+  +  + E HE+WMA HG+ Y    EKE + + FKEN++ IE  N  GN+ YKLG 
Sbjct: 27  NARTLEDAPMRERHEQWMAIHGKVYTHSYEKEQKYQTFKENVQRIEAFNHAGNKPYKLGI 86

Query: 94  NQFSDLTNDEFRAL--YTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPI 151
           N F+DLTN+EF+A+  + G+       + T + TF+Y+N  MT VP +LDWR +GAVTPI
Sbjct: 87  NHFADLTNEEFKAINRFKGH----VCSKITRTPTFRYEN--MTAVPATLDWRQEGAVTPI 140

Query: 152 KNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAY 210
           K+Q +CGCCWAF+AVAA EGITK+ +G LI LSEQ+L+DC T G + GC GG  + AF +
Sbjct: 141 KDQGQCGCCWAFSAVAATEGITKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKF 200

Query: 211 IIQNQGIATEDEYPYQAVPGTCSA-AQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIA 269
           I+QN+G+A E  YPY+ V GTC+A A+   A  I  YE+VP+  E ALLKAV+ QPVS+A
Sbjct: 201 ILQNKGLAAEAIYPYEGVDGTCNAKAEGNHATSIKGYEDVPANSESALLKAVANQPVSVA 260

Query: 270 IAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMK 329
           I A   EFQ Y  G+F G CGT LDH VT VG+G ++DG  YWL+KNSWG  WGD GY++
Sbjct: 261 IEASGFEFQFYSGGVFTGSCGTNLDHGVTAVGYGVSDDGTKYWLVKNSWGVKWGDKGYIR 320

Query: 330 IVRD----EGLCGIGTRSSYPLA 348
           + RD    EGLCGI   +SYP A
Sbjct: 321 MQRDVAAKEGLCGIAMLASYPNA 343


>gi|359485281|ref|XP_002280230.2| PREDICTED: LOW QUALITY PROTEIN: KDEL-tailed cysteine endopeptidase
           CEP1 [Vitis vinifera]
          Length = 341

 Score =  348 bits (894), Expect = 2e-93,   Method: Compositional matrix adjust.
 Identities = 175/336 (52%), Positives = 237/336 (70%), Gaps = 16/336 (4%)

Query: 21  IITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEK 80
           ++ +L + ASQ  ++R+ HE S+ E HE WMAQ+GR YKD  EK  R KIFK+N+  IE 
Sbjct: 14  LLFVLAAWASQA-TARNLHEASMYERHEDWMAQYGRVYKDADEKSKRYKIFKDNVARIES 72

Query: 81  ANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHR-STTSSTFKYQNLSMTDVPTS 139
            NK  +++YKL  N+F+DLTN+EF      +K    +H  ST +++FKY+N+  T VP++
Sbjct: 73  FNKAMDKSYKLSINEFADLTNEEFGTSRNRFK----AHICSTEATSFKYENV--TAVPST 126

Query: 140 LDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNG 198
           +DWR KGAVTPIK+Q +CG CWAF+AVAA+EGIT++ +G LI LSEQ+L+DC T+G + G
Sbjct: 127 IDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQG 186

Query: 199 CLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCS--AAQKPAAAKISNYEEVPSGDEQA 256
           C GG  + AF +I QN G+ TE  YPY    GTC+   A  P AAKI+ YE+VP+ +E+A
Sbjct: 187 CNGGLMDDAFKFIKQNHGLTTEANYPYAGTDGTCNRKKAAHP-AAKINGYEDVPANNEKA 245

Query: 257 LLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKN 316
           L KAV  QP+++AI A   EFQ Y  G+F G CGT+LDH V  VG+GT++DG  YWL+KN
Sbjct: 246 LQKAVVHQPIAVAIDAGGFEFQFYSSGVFTGQCGTELDHGVAAVGYGTSDDGMKYWLVKN 305

Query: 317 SWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
           SWG  WG+ GY+++ RD    EGLCGI  ++SYP A
Sbjct: 306 SWGTGWGEEGYIRMQRDVTAKEGLCGIAMQASYPTA 341


>gi|319826926|gb|ADV74756.1| cysteine protease [Lactuca sativa]
          Length = 363

 Score =  348 bits (893), Expect = 2e-93,   Method: Compositional matrix adjust.
 Identities = 165/321 (51%), Positives = 222/321 (69%), Gaps = 8/321 (2%)

Query: 33  VSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLG 92
            +SR+ ++ +++  HE+WMA HGR Y DE EK++R +IFK N+ YI+  N   +++Y L 
Sbjct: 41  ATSRTLNDPTMIARHEQWMAHHGRIYTDENEKQLRFQIFKNNVAYIDAHNARSDQSYTLE 100

Query: 93  TNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIK 152
            N+F+DLTNDEFRA   GYK    S     S  F+Y N+S   VP  +DWR +GAVTP+K
Sbjct: 101 VNKFADLTNDEFRASRNGYKKQPDSDSHVVSGLFRYANVSA--VPDEVDWRKEGAVTPVK 158

Query: 153 NQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYI 211
           +Q +CGCCWAF+AVAA+EGI K+ +G L+ LSEQ+L+DC  +G + GC GG  E AF +I
Sbjct: 159 DQGDCGCCWAFSAVAAMEGINKLENGKLVSLSEQELVDCDIDGIDQGCEGGLMENAFQFI 218

Query: 212 IQNQGIATEDEYPYQAVPGTCSAAQKP-AAAKISNYEEVPSGDEQALLKAVSMQPVSIAI 270
            + +G+A E  YPY    G C+  +    AAKIS +E+VP+ +E+ALL+AV+ QPVSIAI
Sbjct: 219 EKRKGLAAESVYPYTGEDGICNTKKAAIPAAKISGHEKVPANNEKALLQAVANQPVSIAI 278

Query: 271 AAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKI 330
            A   EFQ Y  G+F G CGT+LDHA+T VG+G T DG  YWL+KNSWG +WG+ GY++I
Sbjct: 279 DASGYEFQFYSGGVFTGSCGTELDHAITAVGYGATMDGTKYWLMKNSWGASWGENGYIRI 338

Query: 331 VRD----EGLCGIGTRSSYPL 347
            RD    EGLCGI    SYP+
Sbjct: 339 KRDSLAKEGLCGIAMDPSYPV 359


>gi|18401420|ref|NP_565649.1| cysteine proteinase-like protein [Arabidopsis thaliana]
 gi|4314384|gb|AAD15594.1| cysteine proteinase [Arabidopsis thaliana]
 gi|17381154|gb|AAL36389.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|20465849|gb|AAM20029.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|330252901|gb|AEC07995.1| cysteine proteinase-like protein [Arabidopsis thaliana]
          Length = 348

 Score =  348 bits (893), Expect = 2e-93,   Method: Compositional matrix adjust.
 Identities = 179/348 (51%), Positives = 231/348 (66%), Gaps = 17/348 (4%)

Query: 16  TPMFIIITLLVSCASQVVSSR-STHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKEN 74
           + +  I+T+ +S  + + +SR S  E S +E HE+WMA+  R Y DE EK  R  IFK+N
Sbjct: 3   STIIFILTIFLSYRTSLATSRGSLFEASAIEKHEQWMARFNRVYSDETEKRNRFNIFKKN 62

Query: 75  LEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSST------FKY 128
           LE+++  N     TYK+  N+FSDLT++EFRA +TG  +P    R +T S+      F+Y
Sbjct: 63  LEFVQNFNMNNKITYKVDINEFSDLTDEEFRATHTGLVVPEAITRISTLSSGKNTVPFRY 122

Query: 129 QNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQL 188
            N+S  D   S+DWR +GAVTP+K Q  CG CWAF+AVAAVEGITKI  G L+ LSEQQL
Sbjct: 123 GNVS--DNGESMDWRQEGAVTPVKYQGRCGGCWAFSAVAAVEGITKITKGELVSLSEQQL 180

Query: 189 LDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA----AAKIS 244
           LDC  + N GC GG   KAF YII+NQGI TED YPYQ    TCS++   +    AA IS
Sbjct: 181 LDCDRDYNQGCRGGIMSKAFEYIIKNQGITTEDNYPYQESQQTCSSSTTLSSSFRAATIS 240

Query: 245 NYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGT 304
            YE VP  +E+ALL+AVS QPVS+ I      F+ Y  G+FNG CGT L HAVTIVG+G 
Sbjct: 241 GYETVPMNNEEALLQAVSQQPVSVGIEGTGAAFRHYSGGVFNGECGTDLHHAVTIVGYGM 300

Query: 305 TEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
           +E+G  YW++KNSWG TWG+ GYM+I RD    +G+CG+   + YPLA
Sbjct: 301 SEEGTKYWVVKNSWGETWGENGYMRIKRDVDAPQGMCGLAILAFYPLA 348


>gi|356543038|ref|XP_003539970.1| PREDICTED: vignain-like [Glycine max]
          Length = 343

 Score =  347 bits (890), Expect = 4e-93,   Method: Compositional matrix adjust.
 Identities = 175/324 (54%), Positives = 226/324 (69%), Gaps = 13/324 (4%)

Query: 33  VSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLG 92
           V+SR+  + S+ E HE+WMA++ + YKD  E+E R KIFKEN+ YIE  N   N+ YKLG
Sbjct: 25  VTSRTLQDASMYERHEEWMARYAKVYKDPEEREKRFKIFKENVNYIEAFNNAANKPYKLG 84

Query: 93  TNQFSDLTNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTP 150
            NQF+DLTN+EF A    +K  M S   R+TT   FKY+N+  T +P+++DWR KGAVTP
Sbjct: 85  INQFADLTNEEFIAPRNRFKGHMCSSITRTTT---FKYENV--TALPSTVDWRQKGAVTP 139

Query: 151 IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFA 209
           IK+Q +CGCCWAF+AVAA EGI  + SG LI LSEQ+++DC T G + GC GG  + AF 
Sbjct: 140 IKDQGQCGCCWAFSAVAATEGIHALNSGKLISLSEQEVVDCDTKGEDQGCAGGFMDGAFK 199

Query: 210 YIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAK-ISNYEEVPSGDEQALLKAVSMQPVSI 268
           +IIQN G+ TE  YPY+AV G C+A +    A  I+ YE+VP  +E+AL KAV+ QPVS+
Sbjct: 200 FIIQNHGLNTEANYPYKAVDGKCNANEAANHAATITGYEDVPVNNEKALQKAVANQPVSV 259

Query: 269 AIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYM 328
           AI A  ++FQ YK G+F G CGTQLDH VT VG+G + DG  YWL+KNSWG  WG+ GY+
Sbjct: 260 AIDASGSDFQFYKTGVFTGSCGTQLDHGVTAVGYGVSADGTQYWLVKNSWGTEWGEEGYI 319

Query: 329 KIVR----DEGLCGIGTRSSYPLA 348
            + R     EGLCGI   +SYP A
Sbjct: 320 MMQRGVKAQEGLCGIAMMASYPTA 343


>gi|102140014|gb|ABF70145.1| cysteine protease, putative [Musa acuminata]
          Length = 373

 Score =  347 bits (890), Expect = 4e-93,   Method: Compositional matrix adjust.
 Identities = 172/316 (54%), Positives = 222/316 (70%), Gaps = 11/316 (3%)

Query: 40  EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
           + S+ E H +WMA+HGR+YKD  EKE RL IFK N+EYIE  N  G R Y+L  NQF+DL
Sbjct: 28  DASMAERHVEWMARHGRTYKDAAEKEQRLGIFKSNVEYIESFNA-GKRKYQLAANQFADL 86

Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGC 159
           T++EF+A++TG+K PS +      + F++ +LS   VP S+DWR KGAVTP+K+Q  CG 
Sbjct: 87  THEEFKAMHTGFK-PSGTGAKKAGNGFRHGSLS--SVPDSVDWRSKGAVTPVKDQGLCGS 143

Query: 160 CWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYIIQNQGIA 218
           CWAF  VAAVEGITKI +G LI LSEQQL+DC  +G + GC GG  + AF +I+ N GI 
Sbjct: 144 CWAFTVVAAVEGITKIVTGKLISLSEQQLVDCDVHGKDQGCQGGDMDAAFEFIVNNGGIT 203

Query: 219 TEDEYPYQAVPGTCSAAQKP-AAAKISNYEEVPSGDEQALLKAVSMQPVSIAI-AAYSTE 276
           +E  YPY+ V   C+A       A I ++E+VP+ DE+AL KAV+ QPVS+ I A  S +
Sbjct: 204 SEANYPYEEVQRLCNAHNASFVVATIESHEDVPTNDEKALRKAVANQPVSVGIDAGSSLD 263

Query: 277 FQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD--- 333
           FQ Y  G+F+G CGT LDHAVT+VG+GTT DG  YWL KNSWG TWG+ GY+++ RD   
Sbjct: 264 FQLYSGGVFSGECGTDLDHAVTVVGYGTTSDGTKYWLAKNSWGETWGENGYIRMERDVAA 323

Query: 334 -EGLCGIGTRSSYPLA 348
            EGLCGI  ++SYP A
Sbjct: 324 KEGLCGIAMQASYPTA 339


>gi|400180377|gb|AFP73327.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  347 bits (890), Expect = 5e-93,   Method: Compositional matrix adjust.
 Identities = 173/345 (50%), Positives = 235/345 (68%), Gaps = 13/345 (3%)

Query: 10  SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
           + K++   + I +  ++S  +     RS  E SV E HE WM++HGR YKDE+EK  R  
Sbjct: 2   AMKVDLMNILITLFFVISMFNTQTRGRSQPELSVSERHELWMSRHGRVYKDEVEKGERFM 61

Query: 70  IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
           IFKEN+++IE  NK GN +YKLG N+F+D+T+ EF A +TG  +P    SPS  S+T   
Sbjct: 62  IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119

Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
           FK  +LS  D+P++LDWR+ GAVT +K+Q  CGCCWAF+AV ++EG  KI +GNL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179

Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
           Q+LLDC+TN N GC GG    AF +II+N GI+ E +Y YQ    TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYQGEQYTCRSQEKTAAVQISS 238

Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
           Y+ VP G E +LL+AV+ QPVSI IAA S + Q Y  G ++G C  +++HAVT +G+GT 
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296

Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
           E G  YWL+KNSWG +WG+ G+MKI+RD     GLC I   SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341


>gi|297826061|ref|XP_002880913.1| hypothetical protein ARALYDRAFT_481640 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297326752|gb|EFH57172.1| hypothetical protein ARALYDRAFT_481640 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 347

 Score =  347 bits (890), Expect = 5e-93,   Method: Compositional matrix adjust.
 Identities = 180/348 (51%), Positives = 232/348 (66%), Gaps = 16/348 (4%)

Query: 15  TTPMFIIITLLVSCASQVVSSRS-THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKE 73
           ++ +  I+T+ +S  + + +SR    E S +E HE+WMA+  R Y DE EK  R  IFK+
Sbjct: 2   SSTIIFILTIFLSYRTSLATSRGGLFEASPIEKHEQWMARFNRVYSDESEKRNRFNIFKK 61

Query: 74  NLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSST-----FKY 128
           NLE+++  N   N TYKL  N+FSDLT++EFRA +TG  +P      +T S+     F+Y
Sbjct: 62  NLEFVQSFNMNKNITYKLDVNEFSDLTDEEFRATHTGLVVPEEITGISTLSSDKTVPFRY 121

Query: 129 QNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQL 188
            N+S  D   S+DWR +GAVTP+K Q  CG CWAF+AVAAVEGITKI  G L+ LSEQQL
Sbjct: 122 GNVS--DTGESMDWRQEGAVTPVKYQGRCGGCWAFSAVAAVEGITKITKGELVSLSEQQL 179

Query: 189 LDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA----AAKIS 244
           LDC T+ N GC GG   KAF YII+NQGI TED YPYQ    TCS++   +    AA IS
Sbjct: 180 LDCDTDYNQGCHGGIMSKAFEYIIKNQGITTEDNYPYQESQQTCSSSTTLSSSFRAATIS 239

Query: 245 NYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGT 304
            YE VP  +E+ALL+AVS QPVS+ I      F+ Y  GIFNG CGT L HAVTIVG+G 
Sbjct: 240 GYETVPMNNEEALLQAVSQQPVSVGIEGTGAGFRHYSGGIFNGECGTDLHHAVTIVGYGM 299

Query: 305 TEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
           +E+G  YW++KNSWG TWG+ G+M+I RD    +G+CG+   + YPLA
Sbjct: 300 SEEGTKYWVVKNSWGETWGEDGFMRIKRDVDAPQGMCGLAMLAFYPLA 347


>gi|400180419|gb|AFP73348.1| cysteine protease [Solanum lycopersicoides]
          Length = 343

 Score =  346 bits (887), Expect = 9e-93,   Method: Compositional matrix adjust.
 Identities = 171/342 (50%), Positives = 236/342 (69%), Gaps = 8/342 (2%)

Query: 10  SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
           + KI+   + I +  ++S  +   ++RS  + SV E HE WM++HGR YKDE+EK  R  
Sbjct: 2   AMKIDLMSILITLFFVISMFNSQTTARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61

Query: 70  IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSST-FKY 128
           IFKEN+++IE  NK GN +YKLG N+F+D+T++EF   +TG  +PS    S  SST FK 
Sbjct: 62  IFKENMKFIESVNKAGNLSYKLGINEFADITSEEFLTKFTGINIPSYLSPSPMSSTEFKI 121

Query: 129 QNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQL 188
            +LS  D+P++LDWR+ GAVT +KNQ +CGCCWAF+AV ++EG  KI +GNL++ SEQ+L
Sbjct: 122 NDLSDDDMPSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYKIATGNLMEFSEQEL 181

Query: 189 LDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEE 248
           LDC+TN N GC GG    AF +I +N GI++E +Y YQ    TC + +K AA +IS+Y+ 
Sbjct: 182 LDCTTN-NYGCNGGFMTNAFDFIKENGGISSESDYEYQGQQYTCRSQEKTAAVQISSYQV 240

Query: 249 VPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDG 308
           VP G E +LL+AV+ QPVSI IAA S + Q Y  G ++G C  +++HAVT +G+GT E G
Sbjct: 241 VPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTDEKG 298

Query: 309 ANYWLIKNSWGNTWGDAGYMKIVRDE----GLCGIGTRSSYP 346
             YWL+KNSWG +WG+ G+MKI+RD     G C I   SSYP
Sbjct: 299 QKYWLLKNSWGTSWGENGFMKIIRDSGNPGGHCDIAKMSSYP 340


>gi|224099295|ref|XP_002334495.1| predicted protein [Populus trichocarpa]
 gi|222872550|gb|EEF09681.1| predicted protein [Populus trichocarpa]
          Length = 342

 Score =  346 bits (887), Expect = 1e-92,   Method: Compositional matrix adjust.
 Identities = 168/336 (50%), Positives = 220/336 (65%), Gaps = 9/336 (2%)

Query: 19  FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
           F    L++   +  V+SR   E  +   HE+WMA +G+ Y D  EKE R KIFK N+EYI
Sbjct: 10  FFAFILILGMWAFEVASRELQESYMSARHEQWMATYGKVYVDAAEKERRFKIFKNNVEYI 69

Query: 79  EKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPT 138
           E  N  GN+ YKL  N+F+D TN++F+    GY+ P  + R    ++FKY+N+  T VP 
Sbjct: 70  ESFNTAGNKPYKLSVNKFADQTNEKFKGARNGYRRPFQT-RPMKVTSFKYENV--TAVPA 126

Query: 139 SLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NN 197
           ++DWR KGAVTPIK+Q +CG CWAF+ VAA EGI ++ +G L+ LSEQ+L+DC   G + 
Sbjct: 127 TMDWRKKGAVTPIKDQGQCGSCWAFSTVAATEGINQLTTGKLVSLSEQELVDCDNQGEDQ 186

Query: 198 GCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTC-SAAQKPAAAKISNYEEVPSGDEQA 256
           GC GG  E  F +II+N GI TE  YPYQA  GTC S  Q    AKI+ YE VP+  E  
Sbjct: 187 GCEGGLMEDGFEFIIKNHGITTEANYPYQAADGTCNSKKQASHIAKITGYESVPANSEAE 246

Query: 257 LLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKN 316
           LLK V+ QP+S++I A  ++FQ Y  G+F G CGT+LDH VT VG+G T DG  YWL+KN
Sbjct: 247 LLKVVANQPISVSIDAGGSDFQFYSSGVFTGKCGTELDHGVTAVGYGETSDGTKYWLVKN 306

Query: 317 SWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
           SW  +WG+ GY+++ RD    EGLCGI   SSYP A
Sbjct: 307 SWXTSWGEEGYIRMQRDIDAEEGLCGIAMDSSYPTA 342


>gi|356543076|ref|XP_003539989.1| PREDICTED: vignain-like [Glycine max]
          Length = 343

 Score =  346 bits (887), Expect = 1e-92,   Method: Compositional matrix adjust.
 Identities = 174/324 (53%), Positives = 226/324 (69%), Gaps = 13/324 (4%)

Query: 33  VSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLG 92
           V+SR+  + S+ E HE+WMA++ + YKD  E+E R KIFKEN+ YIE  N   ++ YKLG
Sbjct: 25  VTSRTLQDASMYERHEEWMARYAKVYKDPEEREKRFKIFKENVNYIEAFNNAADKPYKLG 84

Query: 93  TNQFSDLTNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTP 150
            NQF+DLTN+EF A    +K  M S   R+TT   FKY+N+  T +P+++DWR KGAVTP
Sbjct: 85  INQFADLTNEEFIAPRNKFKGHMCSSITRTTT---FKYENV--TALPSTVDWRQKGAVTP 139

Query: 151 IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFA 209
           IK+Q +CGCCWAF+AVAA EGI  + SG LI LSEQ+++DC T G + GC GG  + AF 
Sbjct: 140 IKDQGQCGCCWAFSAVAATEGIHALNSGKLISLSEQEVVDCDTKGEDQGCAGGFMDGAFK 199

Query: 210 YIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAK-ISNYEEVPSGDEQALLKAVSMQPVSI 268
           +IIQN G+ TE  YPY+AV G C+A +    A  I+ YE+VP  +E+AL KAV+ QPVS+
Sbjct: 200 FIIQNHGLNTEANYPYKAVDGKCNANEAANHAATITGYEDVPVNNEKALQKAVANQPVSV 259

Query: 269 AIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYM 328
           AI A  ++FQ YK G+F G CGTQLDH VT VG+G + DG  YWL+KNSWG  WG+ GY+
Sbjct: 260 AIDASGSDFQFYKTGVFTGSCGTQLDHGVTAVGYGVSADGTQYWLVKNSWGTEWGEEGYI 319

Query: 329 KIVR----DEGLCGIGTRSSYPLA 348
            + R     EGLCGI   +SYP A
Sbjct: 320 MMQRGVKAQEGLCGIAMMASYPTA 343


>gi|400180443|gb|AFP73358.1| cysteine protease, partial [Solanum habrochaites]
          Length = 345

 Score =  346 bits (887), Expect = 1e-92,   Method: Compositional matrix adjust.
 Identities = 172/345 (49%), Positives = 235/345 (68%), Gaps = 13/345 (3%)

Query: 10  SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
           + K++   + I +  ++S  +    +RS  + SV E HE WM++HGR YKDE+EK  R  
Sbjct: 2   AMKVDLMSILITLFFVISMFNSQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61

Query: 70  IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
           IFKEN+++IE  NK GN +YKLG N+F+D+T++EF A +TG  +P    SPS   +T   
Sbjct: 62  IFKENMKFIESVNKAGNLSYKLGMNEFADITSEEFLAKFTGLNIPNSYLSPSPMPSTE-- 119

Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
           FK  +LS  D+P++LDWR+ GAVT +KNQ +CGCCWAF+AV ++EG  KI +GNL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179

Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
           Q+LLDC+TN N GC GG    AF +II+N GI+ E +Y Y     TC +  K AA +ISN
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQGKTAAVQISN 238

Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
           Y+ VP G E +LL+AV+ QPVSI IAA S + Q Y  G ++G C  +++HAVT +G+GT 
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SHDLQFYAGGTYDGSCANRINHAVTAIGYGTD 296

Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRDE----GLCGIGTRSSYP 346
           E G  YWL+KNSWG +WG+ G+MKI+RD     GLC I   SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPAGLCDIAKMSSYP 341


>gi|40806498|gb|AAR92154.1| putative cysteine protease 1 [Iris x hollandica]
          Length = 340

 Score =  345 bits (886), Expect = 1e-92,   Method: Compositional matrix adjust.
 Identities = 177/345 (51%), Positives = 232/345 (67%), Gaps = 17/345 (4%)

Query: 13  INTTPMFIIITLLVSC--ASQVVSSRSTHE-QSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
             T  +   + LL+    ASQ  + RS  E +S++E HE+WMAQHGR YK+  EK  R +
Sbjct: 4   FKTVKLLPALALLIVAIWASQGEAGRSLGENKSMLERHEQWMAQHGRVYKNAAEKAHRFE 63

Query: 70  IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQ 129
           IF+ N+E IE  N E N  +KLG NQF+DLTN+EF+   T      PS  ++T S FKY+
Sbjct: 64  IFRANVERIESFNAE-NHKFKLGVNQFADLTNEEFKTRNT----LKPSKMASTKS-FKYE 117

Query: 130 NLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLL 189
           N+  T VP ++DWR KGAVTPIK+Q +CG CWAF+AVAA EGITK+ +G LI LSEQ+++
Sbjct: 118 NV--TAVPATMDWRTKGAVTPIKDQGQCGSCWAFSAVAATEGITKLSTGKLISLSEQEVV 175

Query: 190 DCS-TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYE 247
           DC  T+ + GC GG  + AF YII+N+GI TE  YPY+A  GTC+  +  + AA I+ YE
Sbjct: 176 DCDVTSDDQGCNGGEMDDAFEYIIKNKGITTEANYPYKAADGTCNTKKAASHAASITGYE 235

Query: 248 EVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTED 307
           +V    E ALLKA + QP+++AI A    FQ Y  G+F G CGT LDH VT+VG+G T D
Sbjct: 236 DVTVNSEAALLKAAANQPIAVAIDAGDFAFQMYSSGVFTGDCGTDLDHGVTLVGYGATSD 295

Query: 308 GANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
           G  YWL+KNSWG +WG+ GY+++ RD    EGLCGI   +SYP A
Sbjct: 296 GTKYWLVKNSWGTSWGEDGYIRMERDVDAKEGLCGIAMDASYPTA 340


>gi|110737404|dbj|BAF00646.1| putative cysteine proteinase [Arabidopsis thaliana]
          Length = 345

 Score =  345 bits (886), Expect = 2e-92,   Method: Compositional matrix adjust.
 Identities = 166/343 (48%), Positives = 236/343 (68%), Gaps = 13/343 (3%)

Query: 18  MFIIITLLVSCASQVVSSRST------HEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIF 71
           + +++T+L+   +    S++T       EQS+V+ HE+WMA+  R Y+DELEK MR  +F
Sbjct: 4   IMVLVTVLIILFTGFRISQATSRTVIFREQSMVDKHEQWMARFSREYRDELEKNMRRDVF 63

Query: 72  KENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYK-MPSPSHRSTTSSTFKYQN 130
           K+NL++IE  NK+GN++YKLG N+F+D TN+EF A++TG K +   S     + T   Q 
Sbjct: 64  KKNLKFIENFNKKGNKSYKLGVNEFADWTNEEFLAIHTGLKGLTEVSPSKVVAKTISSQT 123

Query: 131 LSMTD-VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLL 189
            +++D V  S DWR +GAVTP+K Q +CGCCWAF+AVAAVEG+ KI  GNL+ LSEQQLL
Sbjct: 124 WNVSDMVVESKDWRAEGAVTPVKYQGQCGCCWAFSAVAAVEGVAKIAGGNLVSLSEQQLL 183

Query: 190 DCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEV 249
           DC    +  C GG    AF Y++QN+GIA+E++Y YQ   G C +  +P AA+IS ++ V
Sbjct: 184 DCDREYDRDCDGGIMSDAFNYVVQNRGIASENDYSYQGSDGGCRSNARP-AARISGFQTV 242

Query: 250 PSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGA 309
           PS +E+ALL+AVS QPVS+++ A    F  Y  G+++G CGT  +HAVT VG+GT++DG 
Sbjct: 243 PSNNERALLEAVSRQPVSVSMDATGDGFMHYSGGVYDGPCGTSSNHAVTFVGYGTSQDGT 302

Query: 310 NYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
            YWL KNSWG TW + GY++I RD    +G+CG+   + YP+A
Sbjct: 303 KYWLAKNSWGETWEEKGYIRIRRDVAWPQGMCGVAQYAFYPVA 345


>gi|356515048|ref|XP_003526213.1| PREDICTED: vignain-like [Glycine max]
          Length = 350

 Score =  345 bits (885), Expect = 2e-92,   Method: Compositional matrix adjust.
 Identities = 171/337 (50%), Positives = 234/337 (69%), Gaps = 14/337 (4%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
           +  ++ LL  C SQV+S R+ HE S+ E HE+WM ++G+ YKD  EK+ RL IFK+N+E+
Sbjct: 10  ILALVLLLSICTSQVMS-RNLHEASMSERHEQWMKKYGKVYKDAAEKQKRLLIFKDNVEF 68

Query: 78  IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
           IE  N  GN+ YKL  N  +D TN+EF A + GYK    SH  T    FKY N+  TD+P
Sbjct: 69  IESFNAAGNKPYKLSINHLADQTNEEFVASHNGYKYKG-SHSQT---PFKYGNV--TDIP 122

Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNN 197
           T++DWR  GAVT +K+Q +CG CWAF+ VAA EGI +I +G L+ LSEQ+L+DC +  ++
Sbjct: 123 TAVDWRQNGAVTAVKDQGQCGSCWAFSTVAATEGIYQISTGMLMSLSEQELVDCDSV-DH 181

Query: 198 GCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQA 256
           GC GG  E  F +II+N GI++E  YPY AV GTC A+++ + AA+I  YE VP+  E+A
Sbjct: 182 GCDGGLMEDGFEFIIKNGGISSEANYPYTAVDGTCDASKEASPAAQIKGYETVPANSEEA 241

Query: 257 LLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGAN-YWLIK 315
           L +AV+ QPVS++I A  + FQ Y  G+F G CGTQLDH VT+VG+GTT+DG + YW++K
Sbjct: 242 LQQAVANQPVSVSIDAGGSGFQFYSSGVFTGQCGTQLDHGVTVVGYGTTDDGTHEYWIVK 301

Query: 316 NSWGNTWGDAGYMKIVR----DEGLCGIGTRSSYPLA 348
           NSWG  WG+ GY+++ R     EGLCGI   +SYP+ 
Sbjct: 302 NSWGTQWGEEGYIRMQRGIDAQEGLCGIAMDASYPMG 338


>gi|400180441|gb|AFP73357.1| cysteine protease [Solanum habrochaites]
          Length = 344

 Score =  345 bits (884), Expect = 3e-92,   Method: Compositional matrix adjust.
 Identities = 170/345 (49%), Positives = 237/345 (68%), Gaps = 13/345 (3%)

Query: 10  SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
           + K++   + I +  ++S  +    +RS  + SV E HE WM++HGR YKDE+EK  R  
Sbjct: 2   AMKVDLMSILITLFFVISMFNSQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61

Query: 70  IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
           IFKEN+++IE  NK GN +YKLG N+F+D+T++EF A +TG  +P    SPS  S+T   
Sbjct: 62  IFKENMKFIESVNKAGNLSYKLGMNEFADITSEEFLAKFTGLNIPNSYLSPSPMSSTE-- 119

Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
           FK  ++S  D+P++LDWR+ GAVT +KNQ +CGCCWAF+AV ++EG  KI +GNL++ SE
Sbjct: 120 FKINDISDDDMPSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179

Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
           Q+LLDC+TN N GC GG    AF +I +N GI+ E +Y Y     TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIRENGGISRESDYEYLGQQYTCRSQEKTAAVQISS 238

Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
           Y+ VP G E +LL+AV+ QPVSI IAA S + Q Y  G ++G C  +++HAVT +G+GT 
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCANRINHAVTAIGYGTD 296

Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
           E+G  YWL+KNSWG +WG+ G+MKI+RD     GLC I   SSYP
Sbjct: 297 ENGQKYWLLKNSWGTSWGEKGFMKIIRDYGNPSGLCDIAKLSSYP 341


>gi|297794671|ref|XP_002865220.1| senescence-associated gene 12 [Arabidopsis lyrata subsp. lyrata]
 gi|297311055|gb|EFH41479.1| senescence-associated gene 12 [Arabidopsis lyrata subsp. lyrata]
          Length = 346

 Score =  345 bits (884), Expect = 3e-92,   Method: Compositional matrix adjust.
 Identities = 166/338 (49%), Positives = 226/338 (66%), Gaps = 10/338 (2%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEI-HEKWMAQHGRSYKDELEKEMRLKIFKENLE 76
           +F+ + +  S    +  SR    + +++  H +WM +HGR Y D  EK  R  +FK N+E
Sbjct: 8   IFLFVAIFSSFYFSISLSRPLDNELIMQKRHIEWMTKHGRVYADVKEKSNRYVVFKSNVE 67

Query: 77  YIEKANK-EGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSP--SHRSTTSSTFKYQNLSM 133
            IE  N     RT+KL  NQF+DLTNDEFR++YTG+K  S   S   T +++F+YQN+S 
Sbjct: 68  RIEHLNNIPAGRTFKLAVNQFADLTNDEFRSMYTGFKGVSSLSSQSQTKTTSFRYQNVSS 127

Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST 193
             +P S+DWR KGAVTPIKNQ  CGCCWAF+AVAA+EG T+I+ G LI LSEQQL+DC T
Sbjct: 128 GALPISVDWRTKGAVTPIKNQGSCGCCWAFSAVAAIEGATQIKKGKLISLSEQQLVDCDT 187

Query: 194 NGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQ-KPAAAKISNYEEVPSG 252
           N + GC GG  + AF +I+   G+ TE  YPY+    TC++ +  P A  I+ YE+VP  
Sbjct: 188 N-DFGCEGGLMDTAFEHIMATGGLTTESNYPYKGEDATCNSKKTNPKATSITGYEDVPVN 246

Query: 253 DEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYW 312
           DEQAL+KAV+ QPVS+ I     +FQ Y  G+F G C T LDHAVT +G+G + +G+ YW
Sbjct: 247 DEQALMKAVAHQPVSVGIEGGGFDFQFYSSGVFTGECTTYLDHAVTAIGYGQSTNGSKYW 306

Query: 313 LIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
           +IKNSWG  WG++GYM+I +D    +GLCG+  ++SYP
Sbjct: 307 IIKNSWGTKWGESGYMRIQKDIKDKQGLCGLAMKASYP 344


>gi|224081320|ref|XP_002306369.1| predicted protein [Populus trichocarpa]
 gi|222855818|gb|EEE93365.1| predicted protein [Populus trichocarpa]
          Length = 340

 Score =  345 bits (884), Expect = 3e-92,   Method: Compositional matrix adjust.
 Identities = 174/349 (49%), Positives = 236/349 (67%), Gaps = 20/349 (5%)

Query: 11  FKINTTPMFIIITLLVSCAS--QVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRL 68
            ++     FI + LL    +     ++R+  + S+ E HE+WMAQ+GR YKD+ EKE R 
Sbjct: 1   MRLTKQSQFICLALLFVLGAWPSKSAARTLQDVSMYERHEQWMAQYGRVYKDDAEKETRY 60

Query: 69  KIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYK--MPSPSHRSTTSSTF 126
            IFKEN+  I+  N +  ++YKLG NQF+DL+N+EF+A    +K  M SP      +  F
Sbjct: 61  NIFKENVARIDAFNSQTGKSYKLGVNQFADLSNEEFKASRNRFKGHMCSPQ-----AGPF 115

Query: 127 KYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQ 186
           +Y+N+S   VP ++DWR KGAVTP+K+Q +CGCCWAF+AVAA+EGI ++ +G LI LSEQ
Sbjct: 116 RYENVSA--VPATMDWRKKGAVTPVKDQGQCGCCWAFSAVAAMEGINQLTTGKLISLSEQ 173

Query: 187 QLLDCSTNG-NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA--AAKI 243
           +++DC T G + GC GG  + AF +I QN+G+ TE  YPY    GTC+  QK A  AAKI
Sbjct: 174 EVVDCDTKGEDQGCNGGLMDDAFKFIEQNKGLTTEANYPYTGTDGTCN-TQKEATHAAKI 232

Query: 244 SNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFG 303
           + +E+VP+  E AL+KAV+ QPVS+AI A   EFQ Y  GIF G CGTQLDH VT VG+G
Sbjct: 233 TGFEDVPANSEAALMKAVAKQPVSVAIDAGGFEFQFYSSGIFTGSCGTQLDHGVTAVGYG 292

Query: 304 TTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
            + DG  YWL+KNSWG  WG+ GY+++ +D    EGLCGI  ++SYP A
Sbjct: 293 IS-DGTKYWLVKNSWGAQWGEEGYIRMQKDISAKEGLCGIAMQASYPSA 340


>gi|356517348|ref|XP_003527349.1| PREDICTED: vignain-like [Glycine max]
          Length = 343

 Score =  345 bits (884), Expect = 3e-92,   Method: Compositional matrix adjust.
 Identities = 169/324 (52%), Positives = 224/324 (69%), Gaps = 13/324 (4%)

Query: 33  VSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLG 92
           V+ R+  + S+ E HE+WM ++ + YKD  E+E R KIFKEN+ YIE  N   N+ Y LG
Sbjct: 25  VTCRTLQDASMYERHEEWMGRYAKVYKDPQERERRFKIFKENVNYIEAFNNAANKPYTLG 84

Query: 93  TNQFSDLTNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTP 150
            NQF+DLTN+EF A    +K  M S   R+TT   FKY+N+  T +P+++DWR KGAVTP
Sbjct: 85  INQFADLTNEEFIAPRNRFKGHMCSSITRTTT---FKYENV--TAIPSTVDWRQKGAVTP 139

Query: 151 IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFA 209
           IK+Q +CGCCWAF+AVAA EGI  + +G LI LSEQ+++DC T G + GC GG  + AF 
Sbjct: 140 IKDQGQCGCCWAFSAVAATEGIHALSAGKLISLSEQEVVDCDTKGEDQGCAGGFMDGAFK 199

Query: 210 YIIQNQGIATEDEYPYQAVPGTCSA-AQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSI 268
           +IIQN G+  E  YPY+AV G C+A A     A I+ YE+VP  +E+AL KAV+ QPVS+
Sbjct: 200 FIIQNHGLNNEPNYPYKAVDGKCNAKAAANHVATITGYEDVPVNNEKALQKAVANQPVSV 259

Query: 269 AIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYM 328
           AI A  ++FQ Y+ G+F G CGT+LDH VT VG+G + DG  YWL+KNSWG  WG+ GY+
Sbjct: 260 AIDASGSDFQFYQSGVFTGSCGTELDHGVTAVGYGVSADGTEYWLVKNSWGTEWGEEGYI 319

Query: 329 KIVR----DEGLCGIGTRSSYPLA 348
           ++ R    +EGLCGI   +SYP A
Sbjct: 320 RMQRGVKAEEGLCGIAMMASYPTA 343


>gi|356543116|ref|XP_003540009.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
           max]
          Length = 337

 Score =  344 bits (883), Expect = 3e-92,   Method: Compositional matrix adjust.
 Identities = 171/337 (50%), Positives = 231/337 (68%), Gaps = 15/337 (4%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
           +  ++ LL  C SQV+S R  HE S+ E HE+WM ++G+ YKD  EK+ RL IFK+N+E+
Sbjct: 10  ILALVLLLSICTSQVMS-RYLHEASMSERHEQWMKKYGKVYKDAAEKQKRLLIFKDNVEF 68

Query: 78  IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSST-FKYQNLSMTDV 136
           IE  N  GN+ YKLG N  +D TN+EF A + GYK     H+++ S T FKY+N+  T V
Sbjct: 69  IESFNAAGNKPYKLGINHLADQTNEEFVASHNGYK-----HKASHSQTPFKYENV--TGV 121

Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGN 196
           P ++DWR+ GAVT +K+Q +CG CWAF+ VAA EGI +I +  L+ LSEQ+L+DC +  +
Sbjct: 122 PNAVDWRENGAVTAVKDQGQCGSCWAFSTVAATEGIYQITTSMLMSLSEQELVDCDSV-D 180

Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQ 255
           +GC GG  E  F +II+N GI++E  YPY AV GTC A ++ + AA+I  YE VP+  E 
Sbjct: 181 HGCDGGYMEGGFEFIIKNGGISSEANYPYTAVDGTCDANKEASPAAQIKGYETVPANSED 240

Query: 256 ALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIK 315
           AL KAV+ QPVS+ I A  + FQ Y  G+F G CGTQLDH VT VG+G+T+DG  YW++K
Sbjct: 241 ALQKAVANQPVSVTIDAGGSAFQFYSSGVFTGQCGTQLDHGVTAVGYGSTDDGTQYWIVK 300

Query: 316 NSWGNTWGDAGYMKIVR----DEGLCGIGTRSSYPLA 348
           NSWG  WG+ GY+++ R     EGLCGI   +SYP A
Sbjct: 301 NSWGTQWGEEGYIRMQRGTDAQEGLCGIAMDASYPTA 337


>gi|356577763|ref|XP_003556992.1| PREDICTED: vignain-like [Glycine max]
          Length = 343

 Score =  344 bits (883), Expect = 3e-92,   Method: Compositional matrix adjust.
 Identities = 169/324 (52%), Positives = 224/324 (69%), Gaps = 13/324 (4%)

Query: 33  VSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLG 92
           V+ R+  + S+ E HE+WM ++ + YKD  E+E R KIFKEN+ YIE  N   N+ Y LG
Sbjct: 25  VTCRTLQDASMYERHEEWMGRYAKVYKDPQERERRFKIFKENVNYIEAFNNAANKPYTLG 84

Query: 93  TNQFSDLTNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTP 150
            NQF+DLTN+EF A    +K  M S   R+TT   FKY+N+  T +P+++DWR KGAVTP
Sbjct: 85  INQFADLTNEEFIAPRNRFKGHMCSSITRTTT---FKYENV--TAIPSTVDWRQKGAVTP 139

Query: 151 IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFA 209
           IK+Q +CGCCWAF+AVAA EGI  + +G LI LSEQ+++DC T G + GC GG  + AF 
Sbjct: 140 IKDQGQCGCCWAFSAVAATEGIHALSAGKLISLSEQEVVDCDTKGEDQGCAGGFMDGAFK 199

Query: 210 YIIQNQGIATEDEYPYQAVPGTCSA-AQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSI 268
           +IIQN G+  E  YPY+AV G C+A A     A I+ YE+VP  +E+AL KAV+ QPVS+
Sbjct: 200 FIIQNHGLNNEPNYPYKAVDGKCNAKAAANHVATITGYEDVPVNNEKALQKAVANQPVSV 259

Query: 269 AIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYM 328
           AI A  ++FQ Y+ G+F G CGT+LDH VT VG+G + DG  YWL+KNSWG  WG+ GY+
Sbjct: 260 AIDASGSDFQFYQSGVFTGSCGTELDHGVTAVGYGVSADGTEYWLVKNSWGTEWGEEGYI 319

Query: 329 KIVR----DEGLCGIGTRSSYPLA 348
           ++ R    +EGLCGI   +SYP A
Sbjct: 320 RMQRGVKAEEGLCGIAMMASYPTA 343


>gi|224121800|ref|XP_002330656.1| predicted protein [Populus trichocarpa]
 gi|222872260|gb|EEF09391.1| predicted protein [Populus trichocarpa]
          Length = 342

 Score =  344 bits (883), Expect = 3e-92,   Method: Compositional matrix adjust.
 Identities = 168/336 (50%), Positives = 220/336 (65%), Gaps = 9/336 (2%)

Query: 19  FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
           F    L++   +  V+SR   E  +   HE+WMA +G+ Y D  EKE R KIFK N+EYI
Sbjct: 10  FFAFILILGMWAFEVASRELQESYMSARHEQWMATYGKVYVDAAEKERRFKIFKNNVEYI 69

Query: 79  EKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPT 138
           E  N  GN+ YKL  N+F+D TN++F+    GY+ P  + R    ++FKY+N+  T VP 
Sbjct: 70  ESFNTAGNKPYKLSVNKFADQTNEKFKGARNGYRRPFQT-RPMKVTSFKYENV--TAVPA 126

Query: 139 SLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NN 197
           ++DWR KGAVT IK+Q +CG CWAF+ VAA EGI ++ +G L+ LSEQ+L+DC   G + 
Sbjct: 127 TMDWRKKGAVTLIKDQGQCGSCWAFSTVAATEGINQLTTGKLVSLSEQELVDCDIQGEDQ 186

Query: 198 GCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTC-SAAQKPAAAKISNYEEVPSGDEQA 256
           GC GG  E  F +II+N GI TE  YPYQA  GTC S  Q    AKI+ YE VP+  E  
Sbjct: 187 GCEGGLMEDGFEFIIKNHGITTEANYPYQAADGTCNSKKQASHIAKITGYESVPANSEAE 246

Query: 257 LLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKN 316
           LLK V+ QP+S++I A  ++FQ Y  G+F G CGT+LDH VT VG+G T DG  YWL+KN
Sbjct: 247 LLKVVANQPISVSIDAGGSDFQFYSSGVFTGKCGTELDHGVTAVGYGETSDGTKYWLVKN 306

Query: 317 SWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
           SWG +WG+ GY+++ RD    EGLCGI   SSYP A
Sbjct: 307 SWGTSWGEEGYIRMQRDIDTEEGLCGIAMDSSYPTA 342


>gi|242072572|ref|XP_002446222.1| hypothetical protein SORBIDRAFT_06g005410 [Sorghum bicolor]
 gi|241937405|gb|EES10550.1| hypothetical protein SORBIDRAFT_06g005410 [Sorghum bicolor]
          Length = 340

 Score =  344 bits (882), Expect = 4e-92,   Method: Compositional matrix adjust.
 Identities = 163/336 (48%), Positives = 221/336 (65%), Gaps = 12/336 (3%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
           +  I+ L + C + + +     + ++V  HE+WMAQ+ R YKD  EK  R ++FK N+++
Sbjct: 8   ILAILGLALFCGAALAARDLNDDSAMVARHEQWMAQYNRVYKDATEKAQRFEVFKANVKF 67

Query: 78  IEKANKEGNRTYKLGTNQFSDLTNDEFRALYT--GYKMPSPSHRSTTSSTFKYQNLSMTD 135
           IE  N  GNR + LG NQF+DLTNDEFRA  T  G+K PSP    T    F+Y+N+S+  
Sbjct: 68  IESFNAGGNRKFWLGVNQFADLTNDEFRATKTNKGFK-PSPVKVPTG---FRYENVSVDA 123

Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG 195
           +P S+DWR KGAVTPIK+Q +CGCCWAF+AVAA EGI KI +  LI LSEQ+L+DC  +G
Sbjct: 124 LPASIDWRTKGAVTPIKDQGQCGCCWAFSAVAATEGIVKISTDKLISLSEQELVDCDVHG 183

Query: 196 -NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDE 254
            + GC GG  + AF +II+N G+ TE  YPY A  G C +    +AA I  +E+VP+ DE
Sbjct: 184 EDQGCEGGLMDDAFKFIIKNGGLTTESSYPYTATDGKCKSGTN-SAANIKGFEDVPANDE 242

Query: 255 QALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLI 314
            AL+KAV+ QPVS+A+      FQ Y  G+  G CGT LDH +  +G+G T DG  YWL+
Sbjct: 243 AALMKAVANQPVSVAVDGGDMTFQLYSGGVMTGSCGTDLDHGIAAIGYGQTSDGTKYWLL 302

Query: 315 KNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
           KNSWG TWG+ GY+++ +D     G+CG+    SYP
Sbjct: 303 KNSWGTTWGENGYLRMEKDISDKRGMCGLAMEPSYP 338


>gi|400180422|gb|AFP73349.1| cysteine protease [Solanum chmielewskii]
          Length = 344

 Score =  344 bits (882), Expect = 4e-92,   Method: Compositional matrix adjust.
 Identities = 171/345 (49%), Positives = 236/345 (68%), Gaps = 13/345 (3%)

Query: 10  SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
           + K++   + I +  ++S  +     RS  + SV E HE WM++HGR YKDE+EK  R  
Sbjct: 2   AMKVDLMNILITLFFVISIFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61

Query: 70  IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
           IFKEN+++IE  NK GN +YKLG N+F+D+T+ EF A +TG  +P    SPS  S+T   
Sbjct: 62  IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119

Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
           FK  +LS  D+P++LDWR+ GAVT +K+Q +CGCCWAF+AV ++EG  KI +GNL++ SE
Sbjct: 120 FKTNDLSDDDMPSNLDWRESGAVTQVKHQGQCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179

Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
           Q+LLDC+TN N GC GG    AF +II+N GI+ E +Y Y     TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQEKTAAVQISS 238

Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
           Y+ VP G E +LL+AV+ QPVSI IAA S + Q Y  G ++G C  +++HAVT +G+GT 
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYSGGTYDGSCADRINHAVTAIGYGTD 296

Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
           E+G  YWL+KNSWG +WG+ G+MKI+RD     GLC I   SSYP
Sbjct: 297 EEGQKYWLLKNSWGTSWGENGFMKIIRDSGDPSGLCDIAKMSSYP 341


>gi|356543124|ref|XP_003540013.1| PREDICTED: vignain-like [Glycine max]
 gi|356543126|ref|XP_003540014.1| PREDICTED: vignain-like [Glycine max]
          Length = 337

 Score =  343 bits (881), Expect = 5e-92,   Method: Compositional matrix adjust.
 Identities = 171/337 (50%), Positives = 230/337 (68%), Gaps = 15/337 (4%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
           +  ++ LL  C SQV+S R+ HE S+ E HE+WM ++G+ YKD  EK+ RL IFK+N+E+
Sbjct: 10  ILALVLLLSICTSQVMS-RNLHEASMSERHEQWMKKYGKVYKDAAEKQKRLLIFKDNVEF 68

Query: 78  IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSST-FKYQNLSMTDV 136
           IE  N  GNR YKL  N  +D TN+EF A + GYK     H+ + S T FKY+N+  T V
Sbjct: 69  IESFNAAGNRPYKLSINHLADQTNEEFVASHNGYK-----HKGSHSQTPFKYENV--TGV 121

Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGN 196
           P ++DWR+ GAVT +K+Q +CG CWAF+ VAA EGI +I +  L+ LSEQ+L+DC +  +
Sbjct: 122 PNAVDWRENGAVTAVKDQGQCGSCWAFSTVAATEGIYQITTSMLMSLSEQELVDCDSV-D 180

Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQ 255
           +GC GG  E  F +II+N GI++E  YPY AV GTC A ++ + AA+I  YE VP+  E 
Sbjct: 181 HGCDGGYMEGGFEFIIKNGGISSEANYPYTAVDGTCDANKEASPAAQIKGYETVPANSED 240

Query: 256 ALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIK 315
           AL KAV+ QPVS+ I A  + FQ Y  G+F G CGTQLDH VT VG+G+T+DG  YW++K
Sbjct: 241 ALQKAVANQPVSVTIDAGGSAFQFYSSGVFTGQCGTQLDHGVTAVGYGSTDDGTQYWIVK 300

Query: 316 NSWGNTWGDAGYMKIVR----DEGLCGIGTRSSYPLA 348
           NSWG  WG+ GY+++ R     EGLCGI   +SYP A
Sbjct: 301 NSWGTQWGEEGYIRMQRGTDAQEGLCGIAMDASYPTA 337


>gi|400180357|gb|AFP73317.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  343 bits (881), Expect = 6e-92,   Method: Compositional matrix adjust.
 Identities = 171/345 (49%), Positives = 235/345 (68%), Gaps = 13/345 (3%)

Query: 10  SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
           + K++   + I +  ++S  +    +RS  + SV E HE WM++HGR YKDE+EK  R  
Sbjct: 2   AMKVDLMSILITLFFVISMFNSQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61

Query: 70  IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
           IFKEN+++IE  NK GN +YKLG N+F+D+T+ EF A +TG  +P    SPS  S+T   
Sbjct: 62  IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119

Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
           FK  +LS  D+P++LDWR+ GAVT +K+Q  CGCCWAF+AV ++EG  KI +GNL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179

Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
           Q+LLDC+TN N GC GG    AF +II+N GI+ E +Y Y     TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQEKTAAVQISS 238

Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
           Y+ VP G E +LL+AV+ QPVSI IAA S + Q Y  G ++G C  +++HAVT +G+GT 
Sbjct: 239 YKVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296

Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
           E G  YWL+KNSWG +WG+ G+MKI+RD     GLC I   SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDYGNPSGLCDIAKMSSYP 341


>gi|400180426|gb|AFP73351.1| cysteine protease [Solanum corneliomuelleri]
          Length = 344

 Score =  343 bits (881), Expect = 6e-92,   Method: Compositional matrix adjust.
 Identities = 171/345 (49%), Positives = 235/345 (68%), Gaps = 13/345 (3%)

Query: 10  SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
           + K++   + I +  ++S  +     RS  + SV E HE WM++HGR YKDE+EK  R  
Sbjct: 2   AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61

Query: 70  IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
           IFKEN+++IE  NK GN +YKLG N+F+D+T+ EF A +TG  +P    SPS  S+T   
Sbjct: 62  IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119

Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
           FK  +LS  D+P++LDWR+ GAVT +K+Q  CGCCWAF+AV ++EG  KI +GNL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179

Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
           Q+LLDC+TN N GC GG    AF +II+N GI+ E +Y Y     TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQEKTAAVQISS 238

Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
           Y+ VP G E +LL+AV+ QPVSI IAA S + Q Y  G ++G C  +++HAVT +G+GT 
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296

Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
           E+G  YWL+KNSWG +WG+ G+MKI+RD     GLC I   SSYP
Sbjct: 297 ENGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341


>gi|357446975|ref|XP_003593763.1| Cysteine proteinase [Medicago truncatula]
 gi|355482811|gb|AES64014.1| Cysteine proteinase [Medicago truncatula]
          Length = 350

 Score =  343 bits (880), Expect = 6e-92,   Method: Compositional matrix adjust.
 Identities = 174/330 (52%), Positives = 229/330 (69%), Gaps = 9/330 (2%)

Query: 22  ITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKA 81
           I LL +CA   +S R+  E SVVE H++WM ++ R+Y +  E E R KIFKENLEYIE  
Sbjct: 9   IILLWACAYPTMS-RTLTESSVVEAHQQWMMKYERTYTNSSEMEKRKKIFKENLEYIENF 67

Query: 82  NKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLD 141
           N  GN++YKLG N++SDLT++EF A +TG+K+      S   S     NL+  DVPT+ D
Sbjct: 68  NNVGNKSYKLGLNRYSDLTSEEFIASHTGFKVSDQLSDSKMRSVAIPFNLN-DDVPTNFD 126

Query: 142 WRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLG 201
           WR+KG VT +KNQ++CGCCWAF AVAAVEGI KI++GNLI LSEQQL+DC    ++GC G
Sbjct: 127 WREKGVVTDVKNQRQCGCCWAFTAVAAVEGIVKIKNGNLISLSEQQLVDCDRQ-SSGCGG 185

Query: 202 GSREKAFAYIIQNQGIATEDEYPYQAVP-GTCSAAQKPAAAKISNYEEVPSGDEQALLKA 260
           G    AF  II+++GI  ED+YPY+A    TC   Q P AA+I+ Y +VP+ DEQ LL+A
Sbjct: 186 GDFVLAFDSIIKSRGIVKEDDYPYKANDVQTCQLGQIPGAAQINGYFKVPANDEQQLLRA 245

Query: 261 VSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGN 320
           V  QPVS+AI+  S +F  Y  G++ G CG +L+HAVTI+G+G +E G  YWLIKNSWG 
Sbjct: 246 VLQQPVSVAIST-SYDFHHYMGGVYEGSCGPKLNHAVTIIGYGVSEAGKKYWLIKNSWGE 304

Query: 321 TWGDAGYMKIVRDE----GLCGIGTRSSYP 346
           TWG+ GYMK++R+     G C I   ++YP
Sbjct: 305 TWGEKGYMKVLRESSATGGQCSIAVHAAYP 334


>gi|400180435|gb|AFP73355.1| cysteine protease [Solanum pennellii]
          Length = 344

 Score =  343 bits (880), Expect = 7e-92,   Method: Compositional matrix adjust.
 Identities = 171/345 (49%), Positives = 235/345 (68%), Gaps = 13/345 (3%)

Query: 10  SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
           + K++   + I +  ++S  +    +RS  + SV E HE WM++HGR YKDE+EK  R  
Sbjct: 2   AMKVDLMSILITLFFVISMFNTQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61

Query: 70  IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
           IFKEN+++IE  NK GN +YKLG N+F+D+T+ EF A +TG  +P    SPS  S+T   
Sbjct: 62  IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYVSPSPMSSTE-- 119

Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
           FK  +LS  D+P++LDWR+ GAVT +KNQ +CGCCWAF+AV ++EG  KI +GNL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179

Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
           Q+LLDC+TN N GC GG    AF +I +N GI+ E +Y Y     TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGQQYTCRSQEKTAAVQISS 238

Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
           Y+ VP G E +LL+AV+ QPVSI IAA S + Q Y  G ++G C  +++HAVT +G+GT 
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCANRINHAVTAIGYGTD 296

Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRDE----GLCGIGTRSSYP 346
           E G  YWL+KNSWG +WG+ G+MKI+RD     GLC I   SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGEDGFMKIIRDSGNPAGLCDIAKVSSYP 341


>gi|400180355|gb|AFP73316.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  343 bits (880), Expect = 8e-92,   Method: Compositional matrix adjust.
 Identities = 171/345 (49%), Positives = 234/345 (67%), Gaps = 13/345 (3%)

Query: 10  SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
           + K++   + I +  ++S  +     RS  + SV E HE WM++HGR YKDE+EK  R  
Sbjct: 2   AMKVDLMSILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61

Query: 70  IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
           IFKEN+++IE  NK GN +YKLG N+F+D+T+ EF A +TG  +P    SPS  S+T   
Sbjct: 62  IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119

Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
           FK  +LS  D+P++LDWR+ GAVT +K+Q  CGCCWAF+AV ++EG  KI +GNL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179

Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
           Q+LLDC+TN N GC GG    AF +II+N GI+ E +Y Y     TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCRSQEKTAAVQISS 238

Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
           Y+ VP G E +LL+AV+ QPVSI IAA S + Q Y  G ++G C  +++HAVT +G+GT 
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296

Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
           E G  YWL+KNSWG +WG+ G+MKI+RD     GLC I   SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341


>gi|388497270|gb|AFK36701.1| unknown [Lotus japonicus]
          Length = 343

 Score =  343 bits (880), Expect = 8e-92,   Method: Compositional matrix adjust.
 Identities = 183/340 (53%), Positives = 242/340 (71%), Gaps = 14/340 (4%)

Query: 17  PMFIIITLLVSCASQVVSSRSTHEQS--VVEIHEKWMAQHGRSYKDELEKEMRLKIFKEN 74
           P+  + T+L +CA   +S     E S  V + H++WM Q+GRSY ++ E E R KIF EN
Sbjct: 6   PIIALCTMLWACAYTAMSRTLYDETSSVVAKTHQQWMLQYGRSYTNDAEMEKRFKIFMEN 65

Query: 75  LEYIEKANKE-GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
           LEYIEK N   GN++YKL  NQFSDLTN+EF A +TG  M  PS  S++S      +L +
Sbjct: 66  LEYIEKFNNAPGNKSYKLDLNQFSDLTNEEFIASHTGL-MIDPSKPSSSSKRASPASLDL 124

Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST 193
           +D PTSLDWR++GAVT +KNQ  CG CWAF+AVAAVEGI KI++GNLI LSEQQL+DC++
Sbjct: 125 SDTPTSLDWREQGAVTDVKNQGNCGSCWAFSAVAAVEGIVKIKNGNLISLSEQQLVDCAS 184

Query: 194 N-GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQK-PAAAKISNYEEVPS 251
           N  N GC GG  + AF+YI +N GIA+E++Y Y+   GTC   +    AA+IS YE+VP+
Sbjct: 185 NEQNQGCGGGFMDNAFSYITEN-GIASENDYQYRGGAGTCQNNEMITPAARISGYEDVPA 243

Query: 252 GDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT-EDGAN 310
           G++Q LL AVS QPVS+AIA     F  YKEGI++G CG+ L+H VT+VG+GT+ EDG  
Sbjct: 244 GEDQLLL-AVSQQPVSVAIAV-GQSFHLYKEGIYSGPCGSSLNHGVTLVGYGTSEEDGTK 301

Query: 311 YWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
           YWLIKNSWG +WG+ GYM+++R+    EG CGI  ++S+P
Sbjct: 302 YWLIKNSWGESWGENGYMRLLRESGQSEGHCGIAVKASHP 341


>gi|400180428|gb|AFP73352.1| cysteine protease [Solanum corneliomuelleri]
          Length = 344

 Score =  343 bits (880), Expect = 8e-92,   Method: Compositional matrix adjust.
 Identities = 171/345 (49%), Positives = 235/345 (68%), Gaps = 13/345 (3%)

Query: 10  SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
           + K++   + I +  ++S  +    +RS  + SV E HE WM++HGR YKDE+EK  R  
Sbjct: 2   AMKVDLMNILITLFFVISMFNTQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61

Query: 70  IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
           IFKEN+++IE  NK GN +YKLG N+F+D+T+ EF A +TG  +P    SPS  S+T   
Sbjct: 62  IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119

Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
           FK  +LS  D+P++LDWR+ GAVT +K+Q  CGCCWAF+AV ++EG  KI +GNL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179

Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
           Q+LLDC+TN N GC GG    AF +II+N GI+ E +Y Y     TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCRSQEKTAAVQISS 238

Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
           Y+ VP G E +LL+AV+ QPVSI IAA S + Q Y  G ++G C  +++HAVT +G+GT 
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296

Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
           E G  YWL+KNSWG +WG+ G+MKI+RD     GLC I   SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341


>gi|400180345|gb|AFP73311.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  343 bits (880), Expect = 8e-92,   Method: Compositional matrix adjust.
 Identities = 171/345 (49%), Positives = 235/345 (68%), Gaps = 13/345 (3%)

Query: 10  SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
           + K++   + I +  ++S  +     RS  + SV E HE WM++HGR YKDE+EK  R  
Sbjct: 2   AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61

Query: 70  IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
           IFKEN+++IE  NK GN +YKLG N+F+D+T+ EF A +TG  +P    SPS  S+T   
Sbjct: 62  IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119

Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
           FK  +LS  D+P++LDWR+ GAVT +K+Q  CGCCWAF+AV ++EG  KI +GNL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179

Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
           Q+LLDC+TN N GC GG    AF +II+N GI+ E +Y Y     TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCDGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQEKTAAVQISS 238

Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
           Y+ VP G E +LL+AV+ QPVSI IAA S + Q Y  G ++G C  +++HAVT +G+GT 
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296

Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
           E+G  YWL+KNSWG +WG+ G+MKI+RD     GLC I   SSYP
Sbjct: 297 ENGQKYWLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341


>gi|400180453|gb|AFP73363.1| cysteine protease [Solanum chilense]
          Length = 344

 Score =  343 bits (879), Expect = 9e-92,   Method: Compositional matrix adjust.
 Identities = 171/345 (49%), Positives = 235/345 (68%), Gaps = 13/345 (3%)

Query: 10  SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
           + K++   + I +  ++S  +    +RS  + SV E HE WM++HGR YKDE+EK  R  
Sbjct: 2   AMKVDLMNILITLFFVISMFNTQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61

Query: 70  IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
           IFKEN+++IE  NK GN +YKLG N+F+D+T+ EF A +TG  +P    SPS  S+T   
Sbjct: 62  IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPVSSTE-- 119

Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
           FK  +LS  D+P++LDWR+ GAVT +K+Q  CGCCWAF+AV ++EG  KI +GNL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179

Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
           Q+LLDC+TN N GC GG    AF +II+N GI+ E +Y Y     TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCRSQEKTAAVQISS 238

Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
           Y+ VP G E +LL+AV+ QPVSI IAA S + Q Y  G ++G C  +++HAVT +G+GT 
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296

Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
           E G  YWL+KNSWG +WG+ G+MKI+RD     GLC I   SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341


>gi|400180347|gb|AFP73312.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  342 bits (878), Expect = 1e-91,   Method: Compositional matrix adjust.
 Identities = 171/345 (49%), Positives = 234/345 (67%), Gaps = 13/345 (3%)

Query: 10  SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
           + K++   + I +  ++S  +     RS  + SV E HE WM++HGR YKDE+EK  R  
Sbjct: 2   AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61

Query: 70  IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
           IFKEN+++IE  NK GN +YKLG N+F+D+T+ EF A +TG  +P    SPS  S+T   
Sbjct: 62  IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119

Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
           FK  +LS  D+P++LDWR+ GAVT +K+Q  CGCCWAF+AV ++EG  KI +GNL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179

Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
           Q+LLDC+TN N GC GG    AF +II+N GI+ E +Y Y     TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCDGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQEKTAAVQISS 238

Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
           Y+ VP G E +LL+AV+ QPVSI IAA S + Q Y  G ++G C  +++HAVT +G+GT 
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296

Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
           E G  YWL+KNSWG +WG+ G+MKI+RD     GLC I   SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341


>gi|400180365|gb|AFP73321.1| cysteine protease [Solanum peruvianum]
 gi|400180395|gb|AFP73336.1| cysteine protease [Solanum peruvianum]
 gi|400180405|gb|AFP73341.1| cysteine protease [Solanum peruvianum]
 gi|400180409|gb|AFP73343.1| cysteine protease [Solanum peruvianum]
 gi|400180411|gb|AFP73344.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  342 bits (878), Expect = 1e-91,   Method: Compositional matrix adjust.
 Identities = 171/345 (49%), Positives = 234/345 (67%), Gaps = 13/345 (3%)

Query: 10  SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
           + K++   + I +  ++S  +     RS  + SV E HE WM++HGR YKDE+EK  R  
Sbjct: 2   AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61

Query: 70  IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
           IFKEN+++IE  NK GN +YKLG N+F+D+T+ EF A +TG  +P    SPS  S+T   
Sbjct: 62  IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119

Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
           FK  +LS  D+P++LDWR+ GAVT +K+Q  CGCCWAF+AV ++EG  KI +GNL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179

Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
           Q+LLDC+TN N GC GG    AF +II+N GI+ E +Y Y     TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQEKTAAVQISS 238

Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
           Y+ VP G E +LL+AV+ QPVSI IAA S + Q Y  G ++G C  +++HAVT +G+GT 
Sbjct: 239 YKVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296

Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
           E G  YWL+KNSWG +WG+ G+MKI+RD     GLC I   SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341


>gi|400180451|gb|AFP73362.1| cysteine protease [Solanum chilense]
          Length = 344

 Score =  342 bits (878), Expect = 1e-91,   Method: Compositional matrix adjust.
 Identities = 171/345 (49%), Positives = 234/345 (67%), Gaps = 13/345 (3%)

Query: 10  SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
           + K++   + I +  ++S  +     RS  E SV E HE WM++HGR YKDE+EK  R  
Sbjct: 2   AMKVDLMNILITLFFVISMFNTQTRGRSQPELSVSERHELWMSRHGRVYKDEVEKGERFM 61

Query: 70  IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
           IFKEN+++IE  NK GN +YKLG N+F+D+T+ EF A +TG  +P    SPS  S+T   
Sbjct: 62  IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119

Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
           FK  +LS  D+P++LDWR+ GAVT +K+Q  CGCCWAF+AV ++EG  KI +GNL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179

Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
           Q+LLDC+TN N GC GG    AF +I +N GI++E +Y Y     TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCDGGFMTNAFDFIKENGGISSESDYEYLGQQYTCRSQEKTAAVQISS 238

Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
           Y+ VP G E +LL+AV+ QPVSI IAA S + Q Y  G ++G C  +++HAVT +G+GT 
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296

Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
           E G  YWL+KNSWG +WG+ G+MKI+RD     GLC I   SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDSGDPSGLCDIAKMSSYP 341


>gi|400180445|gb|AFP73359.1| cysteine protease, partial [Solanum chilense]
          Length = 345

 Score =  342 bits (877), Expect = 1e-91,   Method: Compositional matrix adjust.
 Identities = 171/345 (49%), Positives = 234/345 (67%), Gaps = 13/345 (3%)

Query: 10  SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
           + K++   + I +  ++S  +     RS  E SV E HE WM++HGR YKDE+EK  R  
Sbjct: 2   AMKVDLMNILITLFFVISMFNTQTRGRSQPELSVSERHELWMSRHGRVYKDEVEKGERFM 61

Query: 70  IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
           IFKEN+++IE  NK GN +YKLG N+F+D+T+ EF A +TG  +P    SPS  S+T   
Sbjct: 62  IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119

Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
           FK  +LS  D+P++LDWR+ GAVT +K+Q  CGCCWAF+AV ++EG  KI +GNL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179

Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
           Q+LLDC+TN N GC GG    AF +I +N GI++E +Y Y     TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCDGGFMTNAFDFIKENGGISSESDYEYLGQQYTCRSQEKTAAVQISS 238

Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
           Y+ VP G E +LL+AV+ QPVSI IAA S + Q Y  G ++G C  +++HAVT +G+GT 
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296

Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
           E G  YWL+KNSWG +WG+ G+MKI+RD     GLC I   SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDSGDPSGLCDIAKMSSYP 341


>gi|356517350|ref|XP_003527350.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
           max]
 gi|356577765|ref|XP_003556993.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
           max]
          Length = 343

 Score =  342 bits (877), Expect = 1e-91,   Method: Compositional matrix adjust.
 Identities = 170/325 (52%), Positives = 227/325 (69%), Gaps = 15/325 (4%)

Query: 33  VSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLG 92
           V+SR+  + S+ E HE+WMA++G+ YKD  EKE R ++FKEN+ YIE  N   N+ YKLG
Sbjct: 25  VASRTLQDASMYERHEQWMARYGKVYKDPEEKEKRFRVFKENVNYIEAFNNAANKPYKLG 84

Query: 93  TNQFSDLTNDEF---RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVT 149
            NQF+DLT++EF   R  + G+   S    +T ++TFKY+N+++  +P S+DWR KGAVT
Sbjct: 85  INQFADLTSEEFIVPRNRFNGHTRSS----NTRTTTFKYENVTV--LPDSIDWRQKGAVT 138

Query: 150 PIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAF 208
           PIKNQ  CGCCWAF+A+AA EGI KI +G L+ LSEQ+++DC T G ++GC GG  + AF
Sbjct: 139 PIKNQGSCGCCWAFSAIAATEGIHKISTGKLVSLSEQEVVDCDTKGTDHGCEGGYMDGAF 198

Query: 209 AYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVS 267
            +IIQN GI TE  YPY+ V G C+  ++   AA I+ YE+VP  +E+AL KAV+ QPVS
Sbjct: 199 KFIIQNHGINTEASYPYKGVDGKCNIKEEAVHAATITGYEDVPINNEKALQKAVANQPVS 258

Query: 268 IAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGY 327
           +AI A   +FQ YK GIF G CGT+LDH VT VG+G   +G  YWL+KNSWG  WG+ GY
Sbjct: 259 VAIDASGADFQFYKSGIFTGSCGTELDHGVTAVGYGENNEGTKYWLVKNSWGTEWGEEGY 318

Query: 328 MKIVRD----EGLCGIGTRSSYPLA 348
           + + R     EG+CGI   +SYP A
Sbjct: 319 IMMQRGVKAVEGICGIAMMASYPTA 343


>gi|400180389|gb|AFP73333.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  342 bits (877), Expect = 1e-91,   Method: Compositional matrix adjust.
 Identities = 171/345 (49%), Positives = 234/345 (67%), Gaps = 13/345 (3%)

Query: 10  SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
           + K++   + I +  ++S  +     RS  + SV E HE WM++HGR YKDE+EK  R  
Sbjct: 2   AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61

Query: 70  IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
           IFKEN+++IE  NK GN +YKLG N+F+D+T+ EF A +TG  +P    SPS  S+T   
Sbjct: 62  IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119

Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
           FK  +LS  D+P++LDWR+ GAVT +K+Q  CGCCWAF+AV ++EG  KI +GNL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179

Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
           Q+LLDC+TN N GC GG    AF +II+N GI+ E +Y Y     TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCRSQEKTAAVQISS 238

Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
           Y+ VP G E +LL+AV+ QPVSI IAA S + Q Y  G ++G C  +++HAVT +G+GT 
Sbjct: 239 YKVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296

Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
           E G  YWL+KNSWG +WG+ G+MKI+RD     GLC I   SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341


>gi|400180367|gb|AFP73322.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  342 bits (877), Expect = 2e-91,   Method: Compositional matrix adjust.
 Identities = 171/345 (49%), Positives = 234/345 (67%), Gaps = 13/345 (3%)

Query: 10  SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
           + K++   + I +  ++S  +     RS  + SV E HE WM++HGR YKDE+EK  R  
Sbjct: 2   AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKVERFM 61

Query: 70  IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
           IFKEN+++IE  NK GN +YKLG N+F+D+T+ EF A +TG  +P    SPS  S+T   
Sbjct: 62  IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119

Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
           FK  +LS  D+P++LDWR+ GAVT +K+Q  CGCCWAF+AV ++EG  KI +GNL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179

Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
           Q+LLDC+TN N GC GG    AF +II+N GI+ E +Y Y     TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCRSQEKTAAVQISS 238

Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
           Y+ VP G E +LL+AV+ QPVSI IAA S + Q Y  G ++G C  +++HAVT +G+GT 
Sbjct: 239 YKVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296

Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
           E G  YWL+KNSWG +WG+ G+MKI+RD     GLC I   SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341


>gi|400180449|gb|AFP73361.1| cysteine protease [Solanum chilense]
          Length = 344

 Score =  342 bits (876), Expect = 2e-91,   Method: Compositional matrix adjust.
 Identities = 170/345 (49%), Positives = 235/345 (68%), Gaps = 13/345 (3%)

Query: 10  SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
           + K++   + I +  ++S  +     RS  + SV E HE WM++HGR YKDE+EK  R  
Sbjct: 2   AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61

Query: 70  IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
           IFK+N+++IE  NK GN +YKLG N+F+D+T+ EF A +TG  +P    SPS  S+T   
Sbjct: 62  IFKKNMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119

Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
           FK  +LS  D+P++LDWR+ GAVT +K+Q +CGCCWAF+AV ++EG  KI +G L++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGQCGCCWAFSAVGSLEGAYKIATGKLMEFSE 179

Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
           Q+LLDC+TN N GC GG    AF +II+N GI+ E +Y Y     TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCRSQEKTAAVQISS 238

Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
           Y+ VP G E +LL+AV+ QPVSI IAA S + Q Y EG ++G C  +++HAVT +G+GT 
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAEGTYDGSCADRINHAVTAIGYGTD 296

Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
           E G  YWL+KNSWG +WG+ G+MKI+RD     GLC I   SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341


>gi|310656789|gb|ADP02218.1| Peptidase_C1 domain-containing protein [Triticum aestivum]
          Length = 341

 Score =  342 bits (876), Expect = 2e-91,   Method: Compositional matrix adjust.
 Identities = 160/335 (47%), Positives = 221/335 (65%), Gaps = 9/335 (2%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
           +  I+  +  C+S V+S+R   + ++VE HE+WMA+  R YKD  EK  R ++FK N+ +
Sbjct: 8   LLAIVGCICLCSSAVLSARELGDTAMVERHEQWMAKFNRVYKDGTEKAQRFEVFKANVAF 67

Query: 78  IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
           IE  N E NR + LG NQF+DLTNDEFRA  T   +     R+ T   FKY N+S+  +P
Sbjct: 68  IESFNAE-NRKFWLGVNQFTDLTNDEFRATKTNKGLKMSGGRAPTG--FKYSNVSIDALP 124

Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-N 196
           T++DWR KG VTPIK+Q +CGCCWAF+AV A EGI K+ +G LI LSEQ+L+DC  +G +
Sbjct: 125 TAVDWRTKGVVTPIKDQGQCGCCWAFSAVVATEGIVKLSTGKLISLSEQELVDCDVHGVD 184

Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTC-SAAQKPAAAKISNYEEVPSGDEQ 255
            GC GG  + AF +II+N G+ TE  YPY A  G C ++    + A I  YE+VP+ DE 
Sbjct: 185 QGCEGGEMDDAFKFIIKNGGLTTEANYPYTAQDGQCKTSIASNSVATIKGYEDVPANDES 244

Query: 256 ALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIK 315
           +L+KAV+ QPVS+A+      FQ Y  G+  G CGT LDH +  +G+G T DG  YWL+K
Sbjct: 245 SLMKAVANQPVSVAVDGGDVIFQHYSGGVMTGSCGTDLDHGIAAIGYGMTSDGTKYWLLK 304

Query: 316 NSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
           NSWG TWG++GY+++ +D     G+CG+  + SYP
Sbjct: 305 NSWGTTWGESGYLRMEKDISDKSGMCGLAMQPSYP 339


>gi|400180375|gb|AFP73326.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  342 bits (876), Expect = 2e-91,   Method: Compositional matrix adjust.
 Identities = 170/345 (49%), Positives = 235/345 (68%), Gaps = 13/345 (3%)

Query: 10  SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
           + K++   + I +  ++S  +    +RS  + SV E HE WM++HGR YKDE+EK  R  
Sbjct: 2   AMKVDLMSILITLFFVISMFNSQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61

Query: 70  IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
           IFKEN+++IE  NK GN +YKLG N+F+D+T+ EF A +TG  +P    SPS  S+T   
Sbjct: 62  IFKENIKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119

Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
           FK  +LS  D+P++LDWR+ GAVT +K+Q  CGCCWAF+AV ++EG  KI +GNL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179

Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
           Q+LLDC+TN N GC GG    AF +I +N GI++E +Y Y     TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCDGGFMTNAFDFIKENGGISSESDYEYLGEQYTCRSQEKTAAVQISS 238

Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
           Y+ VP G E +LL+AV+ QPVSI IAA S + Q Y  G ++G C  +++HAVT +G+GT 
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296

Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRDE----GLCGIGTRSSYP 346
           E G  YWL+KNSWG +WG+ G+MKI+RD     GLC I   SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPAGLCDIAKMSSYP 341


>gi|18422605|ref|NP_568651.1| senescence-associated protein 12 [Arabidopsis thaliana]
 gi|13877737|gb|AAK43946.1|AF370131_1 putative senescence-specific cysteine protease SAG12 [Arabidopsis
           thaliana]
 gi|9758936|dbj|BAB09317.1| senescence-specific cysteine protease [Arabidopsis thaliana]
 gi|14532898|gb|AAK64131.1| putative senescence-specific cysteine protease SAG12 [Arabidopsis
           thaliana]
 gi|332007929|gb|AED95312.1| senescence-associated protein 12 [Arabidopsis thaliana]
          Length = 346

 Score =  342 bits (876), Expect = 2e-91,   Method: Compositional matrix adjust.
 Identities = 168/338 (49%), Positives = 224/338 (66%), Gaps = 10/338 (2%)

Query: 18  MFIIITLLVS-CASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLE 76
           +F+ + +  S C S  +S    +E  + + H +WM +HGR Y D  E+  R  +FK N+E
Sbjct: 8   IFLFVAIFSSFCFSITLSRPLDNELIMQKRHIEWMTKHGRVYADVKEENNRYVVFKNNVE 67

Query: 77  YIEKANK-EGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSP--SHRSTTSSTFKYQNLSM 133
            IE  N     RT+KL  NQF+DLTNDEFR++YTG+K  S   S   T  S F+YQN+S 
Sbjct: 68  RIEHLNSIPAGRTFKLAVNQFADLTNDEFRSMYTGFKGVSALSSQSQTKMSPFRYQNVSS 127

Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST 193
             +P S+DWR KGAVTPIKNQ  CGCCWAF+AVAA+EG T+I+ G LI LSEQQL+DC T
Sbjct: 128 GALPVSVDWRKKGAVTPIKNQGSCGCCWAFSAVAAIEGATQIKKGKLISLSEQQLVDCDT 187

Query: 194 NGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQ-KPAAAKISNYEEVPSG 252
           N + GC GG  + AF +I    G+ TE  YPY+    TC++ +  P A  I+ YE+VP  
Sbjct: 188 N-DFGCEGGLMDTAFEHIKATGGLTTESNYPYKGEDATCNSKKTNPKATSITGYEDVPVN 246

Query: 253 DEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYW 312
           DEQAL+KAV+ QPVS+ I     +FQ Y  G+F G C T LDHAVT +G+G + +G+ YW
Sbjct: 247 DEQALMKAVAHQPVSVGIEGGGFDFQFYSSGVFTGECTTYLDHAVTAIGYGESTNGSKYW 306

Query: 313 LIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
           +IKNSWG  WG++GYM+I +D    +GLCG+  ++SYP
Sbjct: 307 IIKNSWGTKWGESGYMRIQKDVKDKQGLCGLAMKASYP 344


>gi|357160300|ref|XP_003578721.1| PREDICTED: oryzain beta chain-like [Brachypodium distachyon]
          Length = 349

 Score =  342 bits (876), Expect = 2e-91,   Method: Compositional matrix adjust.
 Identities = 161/344 (46%), Positives = 229/344 (66%), Gaps = 14/344 (4%)

Query: 17  PMFIIITLLVSC---ASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKE 73
           P   ++ +++ C    S V+S+R   + ++VE HE+WMAQHGR YKD  EK  R + F+ 
Sbjct: 4   PKVFLLAVVLGCICLCSTVLSARELGDAAMVERHEQWMAQHGRVYKDGAEKARRFEAFRN 63

Query: 74  NLEYIEKANKEGNR-TYKLGTNQFSDLTNDEFRALYT--GY--KMPSPSHRSTTSSTFKY 128
           N+ +IE  N  GNR  + LG NQF+DLTNDEFRA  T  G+  +  +  ++++ + TF+Y
Sbjct: 64  NVVFIESFNAAGNRRKFWLGVNQFTDLTNDEFRATKTNKGFIKRNAAAVNKASPTGTFRY 123

Query: 129 QNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQL 188
            N+S   +P ++DWR KGAVTPIKNQ +CGCCWAF+AVAA EGI ++ +G L+ LSEQ+L
Sbjct: 124 SNVSADALPAAVDWRAKGAVTPIKNQGQCGCCWAFSAVAATEGIVQLSTGKLVPLSEQEL 183

Query: 189 LDCSTNG-NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQK-PAAAKISNY 246
           +DC  NG ++GC GG  + AF +II+N G+ +E  YPY A  G C A     + A I  Y
Sbjct: 184 VDCDANGADHGCEGGEMDDAFEFIIKNGGLTSETNYPYTAQDGQCKAKNTINSVATIKGY 243

Query: 247 EEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTE 306
           E+VP+ DE +L+KAV+ QPVS+A+      FQ Y  G+ +G CGT LDH +  VG+G  +
Sbjct: 244 EDVPANDEASLMKAVAAQPVSVAVDGGDMVFQHYAGGVLSGSCGTSLDHGIVAVGYGAAD 303

Query: 307 DGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
           DG  +WL+KNSWG TWG+ GY+++ +D     G+CG+  + SYP
Sbjct: 304 DGTKFWLMKNSWGTTWGEDGYIRMEKDVADAGGMCGLAMQPSYP 347


>gi|400180381|gb|AFP73329.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  341 bits (875), Expect = 3e-91,   Method: Compositional matrix adjust.
 Identities = 171/345 (49%), Positives = 234/345 (67%), Gaps = 13/345 (3%)

Query: 10  SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
           + K++   + I +  ++S  +     RS  + SV E HE WM++HGR YKDE+EK  R  
Sbjct: 2   AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61

Query: 70  IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
           IFKEN+++IE  NK GN +YKLG N+F+D+T+ EF A +TG  +P    SPS  S+T   
Sbjct: 62  IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119

Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
           FK  +LS  D+P++LDWR+ GAVT +K+Q  CGCCWAF+AV ++EG  KI +GNL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179

Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
           Q+LLDC+TN N GC GG    AF +II+N GI+ E +Y Y     TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQEKTAAVQISS 238

Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
           Y+ VP G E +LL+AV+ QPVSI IAA S + Q Y  G ++G C  +++HAVT +G+GT 
Sbjct: 239 YKVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296

Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
           E G  YWL+KNSWG +WG+ G+MKI+RD     GLC I   SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDSGDPSGLCDITKMSSYP 341


>gi|242072398|ref|XP_002446135.1| hypothetical protein SORBIDRAFT_06g002170 [Sorghum bicolor]
 gi|241937318|gb|EES10463.1| hypothetical protein SORBIDRAFT_06g002170 [Sorghum bicolor]
          Length = 338

 Score =  341 bits (875), Expect = 3e-91,   Method: Compositional matrix adjust.
 Identities = 159/339 (46%), Positives = 227/339 (66%), Gaps = 9/339 (2%)

Query: 13  INTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFK 72
           +++    + I    S  S V+++R   + ++VE HE WM ++GR YKD  EK  R ++FK
Sbjct: 2   VSSKAFLLAILGCASLCSSVLAARELSDAAMVERHENWMVEYGRVYKDAAEKARRFEVFK 61

Query: 73  ENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
           +N+ ++E  N   N  + LG NQF+DLT +EF+A   G+K  S     TT   FKY+NLS
Sbjct: 62  DNVAFVESFNTNKNNKFWLGINQFADLTIEEFKA-NKGFKPISAEKVPTTG--FKYENLS 118

Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
           ++ +PT++DWR KGAVTPIKNQ +CGCCWAF+AVAA+EGI K+ +GNLI LSEQ+L+DC 
Sbjct: 119 VSALPTAVDWRTKGAVTPIKNQGQCGCCWAFSAVAAMEGIVKLSTGNLISLSEQELVDCD 178

Query: 193 TNG-NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPS 251
           T+  + GC GG  + AF ++I+N G+AT   YPY+AV G C    K +AA I  +E+VP 
Sbjct: 179 THSMDEGCEGGWMDSAFEFVIKNGGLATVSSYPYKAVDGKCKGGSK-SAATIKGHEDVPV 237

Query: 252 GDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANY 311
            DE AL+KAV+ QPVS+A+ A    F  Y  G+  G CGT+LDH +  +G+G   DG  Y
Sbjct: 238 NDEAALMKAVANQPVSVAVDASDRTFMLYSGGVMTGSCGTELDHGIAAIGYGVESDGTKY 297

Query: 312 WLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
           W++KNSWG TWG+ G++++ +D    +G+CG+  + SYP
Sbjct: 298 WILKNSWGTTWGEKGFLRMEKDISDKQGMCGLAMKPSYP 336


>gi|413944253|gb|AFW76902.1| hypothetical protein ZEAMMB73_056195 [Zea mays]
          Length = 340

 Score =  341 bits (874), Expect = 3e-91,   Method: Compositional matrix adjust.
 Identities = 160/336 (47%), Positives = 220/336 (65%), Gaps = 12/336 (3%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
           +  +++    C + + +     + ++V  HE+WMAQ+ R YKD  EK  R ++FK N+++
Sbjct: 8   ILAVLSFAFFCGAALAARDLNEDSAMVARHEQWMAQYSRVYKDAAEKARRFEVFKANVKF 67

Query: 78  IEKANKEGNRTYKLGTNQFSDLTNDEFRALYT--GYKMPSPSHRSTTSSTFKYQNLSMTD 135
           IE  N  GNR + LG NQF+DLTNDEFR   T  G+K PS    ST    F+Y+N+S+  
Sbjct: 68  IESFNTGGNRKFWLGINQFADLTNDEFRTTKTNKGFK-PSLDKVSTG---FRYENVSVDA 123

Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG 195
           +P ++DWR  GAVTPIK+Q +CGCCWAF+AVAA EGI KI +G LI LSEQ+L+DC  +G
Sbjct: 124 IPATIDWRTNGAVTPIKDQGQCGCCWAFSAVAATEGIVKISTGKLISLSEQELVDCDVHG 183

Query: 196 -NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDE 254
            + GC GG  + AF +II+N G+ TE  YPY A  G C +    +AA I  YE+VP+ DE
Sbjct: 184 EDQGCEGGLMDDAFKFIIKNGGLTTESNYPYTAADGKCKSGSN-SAANIKGYEDVPTNDE 242

Query: 255 QALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLI 314
            AL+KAV+ QPVS+A+      FQ Y  G+  G CGT LDH +  +G+G T DG  YWL+
Sbjct: 243 AALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGKTSDGTKYWLM 302

Query: 315 KNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
           KNSWG TWG+ GY+++ +D    +G+CG+    SYP
Sbjct: 303 KNSWGTTWGENGYLRMEKDISDKKGMCGLAMEPSYP 338


>gi|400180359|gb|AFP73318.1| cysteine protease [Solanum peruvianum]
 gi|400180477|gb|AFP73375.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  341 bits (874), Expect = 4e-91,   Method: Compositional matrix adjust.
 Identities = 170/345 (49%), Positives = 234/345 (67%), Gaps = 13/345 (3%)

Query: 10  SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
           + K++   + I +  ++S  +    +RS  + SV E HE WM++HGR YKDE+EK  R  
Sbjct: 2   AMKVDLMSILITLFFVISMFNSQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61

Query: 70  IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
           IFKEN+++IE  NK GN +YKLG N+F+D+T+ EF A +TG  +P    SPS  S+T   
Sbjct: 62  IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119

Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
           FK  +LS  D+P++LDWR+ GAVT +K+Q  CGCCWAF+AV ++EG  KI +GNL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179

Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
           Q+LLDC+TN N GC GG    AF +I +N GI+ E +Y Y     TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKTAAVQISS 238

Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
           Y+ VP G E +LL+AV+ QPVSI IAA S + Q Y  G ++G C  +++HAVT +G+GT 
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296

Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
           E G  YWL+KNSWG +WG+ G+MKI+RD     GLC I   SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341


>gi|413953667|gb|AFW86316.1| hypothetical protein ZEAMMB73_635707 [Zea mays]
          Length = 340

 Score =  341 bits (874), Expect = 4e-91,   Method: Compositional matrix adjust.
 Identities = 156/334 (46%), Positives = 218/334 (65%), Gaps = 8/334 (2%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
           +  I+     C + + +   + + ++V  HE+WMAQ+ R YKD  EK  R ++FK N+++
Sbjct: 8   ILAILGFAFFCGAALAARDLSDDSAMVARHEQWMAQYSRVYKDASEKARRFEVFKANVKF 67

Query: 78  IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
           IE  N  GN  + LG NQF+DLTNDEFR++ T     S + +  T   F+Y+N+S+  +P
Sbjct: 68  IESFNAGGNNKFWLGVNQFADLTNDEFRSIKTNKGFKSSNMKIPTG--FRYENVSVDALP 125

Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-N 196
           T++DWR KGAVTPIK+Q +CGCCWAF+AVAA EGI KI +G L+ L+EQ+L+DC  +G +
Sbjct: 126 TTIDWRTKGAVTPIKDQGQCGCCWAFSAVAATEGIVKISTGKLVSLAEQELVDCDVHGED 185

Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQA 256
            GC GG  + AF +II N G+ TE  YPY A  G C +    +AA I  YE+VP+ DE A
Sbjct: 186 QGCEGGLMDDAFKFIINNGGLTTESSYPYTAADGKCKSGSN-SAATIKGYEDVPANDEAA 244

Query: 257 LLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKN 316
           L+KAV+ QPVS+A+      FQ Y  G+  G CGT LDH +  +G+G T DG  YWL+KN
Sbjct: 245 LMKAVANQPVSVAVDGGDMTFQFYSSGVMTGSCGTDLDHGIAAIGYGKTSDGTKYWLMKN 304

Query: 317 SWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
           SWG TWG+ GY+++ +D     G+CG+    SYP
Sbjct: 305 SWGTTWGENGYLRMEKDISDKRGMCGLAMEPSYP 338


>gi|400180463|gb|AFP73368.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  340 bits (873), Expect = 4e-91,   Method: Compositional matrix adjust.
 Identities = 170/345 (49%), Positives = 234/345 (67%), Gaps = 13/345 (3%)

Query: 10  SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
           + K++   + I +  ++S  +    +RS  + SV E HE WM++HGR YKDE+EK  R  
Sbjct: 2   AMKVDLMNILITLFFVISMFNTQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61

Query: 70  IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
           IFKEN+++IE  NK GN +YKLG N+F+D+T+ EF A +TG  +P    SPS  S+T   
Sbjct: 62  IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119

Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
           FK  +LS  D+P++LDWR+ GAVT +K+Q  CGCCWAF+AV ++EG  KI +GNL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179

Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
           Q+LLDC+TN N GC GG    AF +I +N GI+ E +Y Y     TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKTAAVQISS 238

Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
           Y+ VP G E +LL+AV+ QPVSI IAA S + Q Y  G ++G C  +++HAVT +G+GT 
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296

Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
           E G  YWL+KNSWG +WG+ G+MKI+RD     GLC I   SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDSGDPSGLCDIAKMSSYP 341


>gi|356517358|ref|XP_003527354.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
 gi|356577767|ref|XP_003556994.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
          Length = 343

 Score =  340 bits (873), Expect = 5e-91,   Method: Compositional matrix adjust.
 Identities = 168/324 (51%), Positives = 223/324 (68%), Gaps = 13/324 (4%)

Query: 33  VSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLG 92
           V+ R+  + S+ E HE+WM ++ + YKD  E+E R KIFKEN+ YIE  N   N+ Y LG
Sbjct: 25  VTCRTLQDASMYERHEEWMGRYAKVYKDPQERERRFKIFKENVNYIEAFNNAANKPYTLG 84

Query: 93  TNQFSDLTNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTP 150
            NQF+DLTN+EF A    +K  M S   R+TT   FKY+N+  T +P+++DWR KGAVTP
Sbjct: 85  INQFADLTNEEFIAPRNRFKGHMCSSITRTTT---FKYENV--TAIPSTVDWRQKGAVTP 139

Query: 151 IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFA 209
           IK+Q +CGCCWAF+AVAA EGI  + +G LI LSEQ+++DC T G + GC GG  + AF 
Sbjct: 140 IKDQGQCGCCWAFSAVAATEGIHALSAGKLISLSEQEVVDCDTKGEDQGCAGGFMDGAFK 199

Query: 210 YIIQNQGIATEDEYPYQAVPGTCSA-AQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSI 268
           +IIQN G+  E  YPY+AV G C+A A     A I+ YE+VP  +E+AL KAV+ QPVS+
Sbjct: 200 FIIQNHGLNNEPNYPYKAVDGKCNAKAAANHVATITGYEDVPVNNEKALQKAVANQPVSV 259

Query: 269 AIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYM 328
           AI A  ++FQ Y+ G+F G CGT+LDH VT VG+G + DG  YWL+KNSWG  WG+ GY+
Sbjct: 260 AIDASGSDFQFYQSGVFTGSCGTELDHGVTAVGYGVSADGTEYWLVKNSWGTEWGEEGYI 319

Query: 329 KIVR----DEGLCGIGTRSSYPLA 348
           ++ R    +EGL GI   +SYP A
Sbjct: 320 RMQRGVKAEEGLXGIAMMASYPTA 343


>gi|20334373|gb|AAM19207.1|AF493232_1 cysteine protease [Solanum pimpinellifolium]
 gi|400180424|gb|AFP73350.1| cysteine protease [Solanum pimpinellifolium]
 gi|400180433|gb|AFP73354.1| cysteine protease [Solanum lycopersicum]
          Length = 344

 Score =  340 bits (872), Expect = 5e-91,   Method: Compositional matrix adjust.
 Identities = 171/345 (49%), Positives = 234/345 (67%), Gaps = 13/345 (3%)

Query: 10  SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
           + K++   + I +  ++S  +     RS  + SV E HE WM++HGR YKDE+EK  R  
Sbjct: 2   AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61

Query: 70  IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
           IFKEN+++IE  NK GN +YKLG N+F+D+T+ EF A +TG  +P    SPS  S+T   
Sbjct: 62  IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119

Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
           FK  +LS   +P++LDWR+ GAVT +K+Q  CGCCWAF+AV ++EG  KI +GNL++ SE
Sbjct: 120 FKINDLSDDYMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179

Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
           Q+LLDC+TN N GC GG    AF +II+N GI+ E +Y Y     TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGLMTNAFDFIIENGGISRESDYEYLGEQYTCRSREKTAAVQISS 238

Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
           Y+ VP G E +LL+AV+ QPVSI IAA S + Q Y  G ++G C  Q++HAVT +G+GT 
Sbjct: 239 YKVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGNCADQINHAVTAIGYGTD 296

Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
           E+G  YWL+KNSWG +WG+ G+MKI+RD     GLC I   SSYP
Sbjct: 297 EEGQKYWLLKNSWGTSWGENGFMKIIRDSGDPSGLCDIAKMSSYP 341


>gi|242072394|ref|XP_002446133.1| hypothetical protein SORBIDRAFT_06g002160 [Sorghum bicolor]
 gi|241937316|gb|EES10461.1| hypothetical protein SORBIDRAFT_06g002160 [Sorghum bicolor]
          Length = 338

 Score =  340 bits (872), Expect = 5e-91,   Method: Compositional matrix adjust.
 Identities = 161/338 (47%), Positives = 226/338 (66%), Gaps = 10/338 (2%)

Query: 15  TTPMFIIITL-LVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKE 73
           T+  F++  L   S  S V+++R   + ++VE HE WM ++GR YKD  EK  R + FK 
Sbjct: 3   TSKAFLLAILGCASLCSSVLAARELSDAAMVERHENWMVEYGRVYKDAAEKARRFEAFKH 62

Query: 74  NLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
           N+ ++E  N      + LG NQF+DLT +EF+A   G+K  S     TT   FKY+NLS+
Sbjct: 63  NVAFVESFNTNKKNKFWLGVNQFADLTTEEFKA-NKGFKPISAEMVPTTG--FKYENLSV 119

Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST 193
           + +PT++DWR KGAVTPIKNQ +CGCCWAF+AVAA+EGI K+ +GNLI LSEQ+L+DC T
Sbjct: 120 SALPTAVDWRTKGAVTPIKNQGQCGCCWAFSAVAAMEGIVKLSTGNLISLSEQELVDCDT 179

Query: 194 NG-NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSG 252
           +  + GC GG  + AF ++I+N G+ATE  YPY+AV G C    K +AA I  +E+VP  
Sbjct: 180 HSMDEGCEGGWMDSAFEFVIKNGGLATESSYPYKAVDGKCKGGSK-SAATIKGHEDVPVN 238

Query: 253 DEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYW 312
           DE AL+KAV+ QPVS+A+ A    F  Y  G+  G CGT+LDH +  +G+G   DG  YW
Sbjct: 239 DEAALMKAVANQPVSVAVDASDRTFMLYSGGVMTGSCGTELDHGIAAIGYGVESDGTKYW 298

Query: 313 LIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
           ++KNSWG TWG+ G++++ +D    +G+CG+  + SYP
Sbjct: 299 ILKNSWGTTWGEKGFLRMEKDISDKQGMCGLAMKPSYP 336


>gi|297851334|ref|XP_002893548.1| peptidase C1A papain family protein [Arabidopsis lyrata subsp.
           lyrata]
 gi|297339390|gb|EFH69807.1| peptidase C1A papain family protein [Arabidopsis lyrata subsp.
           lyrata]
          Length = 346

 Score =  340 bits (871), Expect = 7e-91,   Method: Compositional matrix adjust.
 Identities = 171/342 (50%), Positives = 230/342 (67%), Gaps = 13/342 (3%)

Query: 18  MFIIITLLVSC--ASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENL 75
           MF+ +T+L      SQ  S  + HE  V E H++WM +  R Y DELEK+MR  +FK+NL
Sbjct: 7   MFVSLTILSMSLKVSQATSRVTFHEPIVAEHHQQWMTRFSRVYSDELEKQMRFDVFKKNL 66

Query: 76  EYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYK----MPSPSHRSTTSSTFKYQNL 131
           ++IEK NK+G+RTYKLG N+F+D T +EF A +TG K    +PS         ++ + N+
Sbjct: 67  KFIEKFNKKGDRTYKLGVNEFADWTKEEFIATHTGLKGFNGIPSSEFVDEMIPSWNW-NV 125

Query: 132 SMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDC 191
           S    P   DWR +GAVTP+K Q +CGCCWAF++VAAVEG+TKI  GNL+ LSEQQLLDC
Sbjct: 126 SDVAGPEIKDWRYEGAVTPVKYQGQCGCCWAFSSVAAVEGLTKIVGGNLVSLSEQQLLDC 185

Query: 192 STNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPS 251
               +NGC GG    AF+YII+N+GIA+E  YPYQ   GTC    KP+A  I  ++ VPS
Sbjct: 186 DRERDNGCNGGIMSDAFSYIIKNRGIASEASYPYQETEGTCRYNAKPSAW-IRGFQTVPS 244

Query: 252 GDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFN-GVCGTQLDHAVTIVGFGTTEDGAN 310
            +E+ALL+AVS QPVS++I A    F  Y  G+++   CGT ++HAVT VG+GT+ +G  
Sbjct: 245 NNERALLEAVSRQPVSVSIDADGPGFMHYSGGVYDEPYCGTDVNHAVTFVGYGTSPEGIK 304

Query: 311 YWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
           YWL KNSWG TWG+ GY++I RD    +G+CG+   + YP+A
Sbjct: 305 YWLAKNSWGETWGENGYIRIRRDVAWPQGMCGVAQYAFYPVA 346


>gi|144905112|dbj|BAF56429.1| cysteine proteinase [Lotus japonicus]
          Length = 341

 Score =  340 bits (871), Expect = 7e-91,   Method: Compositional matrix adjust.
 Identities = 173/337 (51%), Positives = 222/337 (65%), Gaps = 13/337 (3%)

Query: 19  FIIITLLVSCASQVVSSRSTHEQ--SVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLE 76
            + + LL++     V SR  HE   S++E HE+WMA++ + YKD  EKE R  IFK+N+E
Sbjct: 11  ILALFLLLAVGISRVISRELHETETSLIERHEQWMAKYDKVYKDAAEKEKRFLIFKDNVE 70

Query: 77  YIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV 136
           +IE  N  GN+ YKLG N  +DLT +EF+A   G K        TTS  FKY+N+  T +
Sbjct: 71  FIESFNAAGNKPYKLGVNHLADLTIEEFKASRNGLKRSYDYEVGTTS--FKYENV--TAI 126

Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG- 195
           P S+DWR KGAVTPIK+Q +CG CWAF+ VAA EGI KI +G L+ LSEQ+L+DC   G 
Sbjct: 127 PASVDWRKKGAVTPIKDQGQCGSCWAFSTVAATEGIHKISTGKLVSLSEQELVDCDRKGT 186

Query: 196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQ 255
           + GC GG  E  F +II+N GI TE  YPY+AV G+C  A  P AA+I  YE+VP   E+
Sbjct: 187 DQGCEGGYMEDGFEFIIKNGGITTEANYPYKAVDGSCKNATAP-AAQIKGYEKVPVNSEK 245

Query: 256 ALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIK 315
           ALLKAV+ QPVS++I A    F  Y  GIF G CGT+LDH VT VG+G   +G +YW++K
Sbjct: 246 ALLKAVANQPVSVSIDAADGSFMFYSSGIFTGECGTELDHGVTAVGYGRA-NGTDYWIVK 304

Query: 316 NSWGNTWGDAGYMKIVR----DEGLCGIGTRSSYPLA 348
           NSWG  WG+ GY+++ R     EGLCGI   SSYP A
Sbjct: 305 NSWGTVWGEQGYIRMQRGIAAKEGLCGIAMDSSYPTA 341


>gi|400180467|gb|AFP73370.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  340 bits (871), Expect = 7e-91,   Method: Compositional matrix adjust.
 Identities = 170/345 (49%), Positives = 234/345 (67%), Gaps = 13/345 (3%)

Query: 10  SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
           + K++   + I +  ++S  +    +RS  + SV E HE WM++HGR YKDE+EK  R  
Sbjct: 2   AMKVDLMNILITLFFVISMFNTQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61

Query: 70  IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
           IFKEN+++IE  NK GN +YKLG N+F+D+T+ EF A +TG  +P    SPS  S+T   
Sbjct: 62  IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119

Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
           FK  +LS  D+P++LDWR+ GAVT +K+Q  CGCCWAF+AV ++EG  KI +GNL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179

Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
           Q+LLDC+TN N GC GG    AF +I +N GI+ E +Y Y     TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKTAAVQISS 238

Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
           Y+ VP G E +LL+AV+ QPVSI IAA S + Q Y  G ++G C  +++HAVT +G+GT 
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296

Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
           E G  YWL+KNSWG +WG+ G+MKI+RD     GLC I   SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341


>gi|400180393|gb|AFP73335.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  340 bits (871), Expect = 7e-91,   Method: Compositional matrix adjust.
 Identities = 170/345 (49%), Positives = 233/345 (67%), Gaps = 13/345 (3%)

Query: 10  SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
           + K++   + I +  ++S  +     RS  + SV E HE WM++HGR YKDE+EK  R  
Sbjct: 2   AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKVERFM 61

Query: 70  IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
           IFKEN+++IE  NK GN +YKLG N+F+D+T+ EF A +TG  +P    SPS  S+T   
Sbjct: 62  IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119

Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
            K  +LS  D+P++LDWR+ GAVT +K+Q  CGCCWAF+AV ++EG  KI +GNL++ SE
Sbjct: 120 LKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179

Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
           Q+LLDC+TN N GC GG    AF +II+N GI+ E +Y Y     TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCRSQEKTAAVQISS 238

Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
           Y+ VP G E +LL+AV+ QPVSI IAA S + Q Y  G ++G C  +++HAVT +G+GT 
Sbjct: 239 YKVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296

Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
           E G  YWL+KNSWG +WG+ G+MKI+RD     GLC I   SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341


>gi|400180407|gb|AFP73342.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  340 bits (871), Expect = 8e-91,   Method: Compositional matrix adjust.
 Identities = 170/345 (49%), Positives = 233/345 (67%), Gaps = 13/345 (3%)

Query: 10  SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
           + K++   + I +  ++S  +     RS  + SV E HE WM++HGR YKDE+EK  R  
Sbjct: 2   AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKVERFM 61

Query: 70  IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
           IFKEN+++IE  NK GN +YKLG N+F+D+T+ EF A +TG  +P    SPS  S+T   
Sbjct: 62  IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119

Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
           FK  +LS  D+P++LDWR+ GAVT +K+Q  CGCCWAF+AV ++EG  KI +GNL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179

Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
           Q+LLDC+TN N GC GG    AF +I +N GI+ E +Y Y     TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKTAAVQISS 238

Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
           Y+ VP G E +LL+AV+ QPVSI IAA S + Q Y  G ++G C  +++HAVT +G+GT 
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296

Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
           E G  YWL+KNSWG +WG+ G+MKI+RD     GLC I   SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341


>gi|195628596|gb|ACG36128.1| vignain precursor [Zea mays]
          Length = 362

 Score =  340 bits (871), Expect = 8e-91,   Method: Compositional matrix adjust.
 Identities = 163/349 (46%), Positives = 230/349 (65%), Gaps = 19/349 (5%)

Query: 15  TTPMFIIITLLVS---CASQVVS--------SRST--HEQSVVEIHEKWMAQHGRSYKDE 61
           TT M ++  + ++   C + V +         R+T   E  ++  ++KWMAQ+ R YKD+
Sbjct: 14  TTLMLLLCVIAIADCICQAAVAARVEPSTTVGRTTGGDEAMMMARYKKWMAQYRRKYKDD 73

Query: 62  LEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPS--PSHR 119
            EK  R ++FK N E+I+++N  G + Y LGTNQF+DLT+ EF A+YTG + P+  PS  
Sbjct: 74  AEKAHRFQVFKANAEFIDRSNAGGKKKYVLGTNQFADLTSKEFAAMYTGLRKPAAVPSGA 133

Query: 120 STTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGN 179
               + FKYQN +  D    +DWR +GAVTP+KNQ +CGCCWAF+AV A+EG+  I +GN
Sbjct: 134 KQIPAGFKYQNFTRLDDDVQVDWRQQGAVTPVKNQGQCGCCWAFSAVGAMEGLIMITTGN 193

Query: 180 LIQLSEQQLLDCS-TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKP 238
           L+ LSEQQ+LDC  ++GN GC GG  + AF Y++ N G+ TED YPY AV GTC   Q  
Sbjct: 194 LVSLSEQQILDCDESDGNQGCNGGYMDNAFQYVVNNGGVTTEDAYPYSAVQGTCQNVQP- 252

Query: 239 AAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNG-VCGTQLDHAV 297
            AA IS ++++PSGDE AL  AV+ QPVS+ +   S+ FQ Y+ GI++G  CGT ++HAV
Sbjct: 253 -AATISGFQDLPSGDENALANAVANQPVSVGVDGGSSPFQFYQGGIYDGDGCGTDMNHAV 311

Query: 298 TIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDEGLCGIGTRSSYP 346
           T +G+G  + G  YW++KNSWG  WG+ G+M++    G CGI T +SYP
Sbjct: 312 TAIGYGADDQGTQYWILKNSWGTGWGENGFMQLQMGVGACGISTMASYP 360


>gi|400180455|gb|AFP73364.1| cysteine protease [Solanum peruvianum]
 gi|400180459|gb|AFP73366.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  340 bits (871), Expect = 8e-91,   Method: Compositional matrix adjust.
 Identities = 170/345 (49%), Positives = 233/345 (67%), Gaps = 13/345 (3%)

Query: 10  SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
           + K++   + I +  ++S  +     RS  + SV E HE WM++HGR YKDE+EK  R  
Sbjct: 2   AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61

Query: 70  IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
           IFKEN+++IE  NK GN +YKLG N+F+D+T+ EF A +TG  +P    SPS  S+T   
Sbjct: 62  IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119

Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
           FK  +LS  D+P++LDWR+ GAVT +K+Q  CGCCWAF+AV ++EG  KI +GNL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179

Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
           Q+LLDC+TN N GC GG    AF +I +N GI+ E +Y Y     TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKTAAVQISS 238

Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
           Y+ VP G E +LL+AV+ QPVSI IAA S + Q Y  G ++G C  +++HAVT +G+GT 
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296

Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
           E G  YWL+KNSWG +WG+ G+MKI+RD     GLC I   SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDSGDPSGLCDIAKMSSYP 341


>gi|400180403|gb|AFP73340.1| cysteine protease [Solanum peruvianum]
 gi|400180413|gb|AFP73345.1| cysteine protease [Solanum peruvianum]
 gi|400180415|gb|AFP73346.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  340 bits (871), Expect = 8e-91,   Method: Compositional matrix adjust.
 Identities = 170/345 (49%), Positives = 233/345 (67%), Gaps = 13/345 (3%)

Query: 10  SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
           + K++   + I +  ++S  +     RS  + SV E HE WM++HGR YKDE+EK  R  
Sbjct: 2   AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61

Query: 70  IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
           IFKEN+++IE  NK GN +YKLG N+F+D+T+ EF A +TG  +P    SPS  S+T   
Sbjct: 62  IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119

Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
           FK  +LS  D+P++LDWR+ GAVT +K+Q  CGCCWAF+AV ++EG  KI +GNL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179

Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
           Q+LLDC+TN N GC GG    AF +I +N GI+ E +Y Y     TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKTAAVQISS 238

Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
           Y+ VP G E +LL+AV+ QPVSI IAA S + Q Y  G ++G C  +++HAVT +G+GT 
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296

Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRDE----GLCGIGTRSSYP 346
           E G  YWL+KNSWG +WG+ G+MKI+RD     GLC I   SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPAGLCDIAKMSSYP 341


>gi|400180353|gb|AFP73315.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  339 bits (870), Expect = 9e-91,   Method: Compositional matrix adjust.
 Identities = 170/345 (49%), Positives = 233/345 (67%), Gaps = 13/345 (3%)

Query: 10  SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
           + K++   + I +  ++S  +     RS  + SV E HE WM++HGR YKDE+EK  R  
Sbjct: 2   AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61

Query: 70  IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
           IFKEN+++IE  NK GN +YKLG N+F+D+T+ EF A +TG  +P    SPS  S+T   
Sbjct: 62  IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119

Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
           F   +LS  D+P++LDWR+ GAVT +K+Q  CGCCWAF+AV ++EG  KI +GNL++ SE
Sbjct: 120 FIINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179

Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
           Q+LLDC+TN N GC GG    AF +II+N GI+ E +Y Y     TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCRSQEKTAAVQISS 238

Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
           Y+ VP G E +LL+AV+ QPVSI IAA S + Q Y  G ++G C  +++HAVT +G+GT 
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296

Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRDE----GLCGIGTRSSYP 346
           E G  YWL+KNSWG +WG+ G+MKI+RD     GLC I   SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPAGLCDIAKMSSYP 341


>gi|400180351|gb|AFP73314.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  339 bits (870), Expect = 9e-91,   Method: Compositional matrix adjust.
 Identities = 170/345 (49%), Positives = 233/345 (67%), Gaps = 13/345 (3%)

Query: 10  SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
           + K++   + I +  ++S  +     RS  + SV E HE WM++HGR YKDE+EK  R  
Sbjct: 2   AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61

Query: 70  IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
           IFKEN+++IE  NK GN +YKLG N+F+D+T+ EF A +TG  +P    SPS  S+T   
Sbjct: 62  IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119

Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
           FK  +LS  D+P++LDWR+ GAVT +K+Q  CGCCWAF+AV ++EG  KI +GNL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179

Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
           Q+LLDC+TN N GC GG    AF +I +N GI+ E +Y Y     TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKTAAVQISS 238

Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
           Y+ VP G E +LL+AV+ QPVSI IAA S + Q Y  G ++G C  +++HAVT +G+GT 
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296

Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
           E G  YWL+KNSWG +WG+ G+MKI+RD     GLC I   SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341


>gi|400180369|gb|AFP73323.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  339 bits (870), Expect = 1e-90,   Method: Compositional matrix adjust.
 Identities = 170/345 (49%), Positives = 233/345 (67%), Gaps = 13/345 (3%)

Query: 10  SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
           + K++   + I +  ++S  +     RS  + SV E HE WM++HGR YKDE+EK  R  
Sbjct: 2   AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61

Query: 70  IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
           IFKEN+++IE  NK GN +YKLG N+F+D+T+ EF A +TG  +P    SPS  S+T   
Sbjct: 62  IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119

Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
           FK  +LS  D+P++LDWR+ GAVT +K+Q  CGCCWAF+AV ++EG  KI +GNL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179

Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
           Q+LLDC+TN N GC GG    AF +I +N GI+ E +Y Y     TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGQQYTCRSQEKTAAVQISS 238

Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
           Y+ VP G E +LL+AV+ QPVSI IAA S + Q Y  G ++G C  +++HAVT +G+GT 
Sbjct: 239 YKVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296

Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
           E G  YWL+KNSWG +WG+ G+MKI+RD     GLC I   SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341


>gi|356517426|ref|XP_003527388.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
           max]
          Length = 343

 Score =  339 bits (870), Expect = 1e-90,   Method: Compositional matrix adjust.
 Identities = 167/324 (51%), Positives = 225/324 (69%), Gaps = 13/324 (4%)

Query: 33  VSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLG 92
           V+ R+  + S+ E H +WMA++ + YKD  E+E R +IFKEN+ YIE  N   N++YKL 
Sbjct: 25  VTCRTLQDASMYERHAQWMARYAKVYKDPQEREKRFRIFKENVNYIETFNSADNKSYKLD 84

Query: 93  TNQFSDLTNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTP 150
            NQF+DLTN+EF A    +K  M S   R+TT   FKY+N+++  +P+++DWR KGAVTP
Sbjct: 85  INQFADLTNEEFIAPRNRFKGHMCSSITRTTT---FKYENVTV--IPSTVDWRQKGAVTP 139

Query: 151 IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNN-GCLGGSREKAFA 209
           IK+Q +CGCCWAF+AVAA EGI  + +G LI LSEQ+++DC T G + GC GG  + AF 
Sbjct: 140 IKDQGQCGCCWAFSAVAATEGIHALNAGKLISLSEQEVVDCDTKGQDQGCAGGFMDGAFK 199

Query: 210 YIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAK-ISNYEEVPSGDEQALLKAVSMQPVSI 268
           +IIQN G+ TE  YPY+A  G C+A      A  I+ YE+VP  +E+AL KAV+ QPVS+
Sbjct: 200 FIIQNHGLNTEPNYPYKAADGKCNAKAAANHAATITGYEDVPVNNEKALQKAVANQPVSV 259

Query: 269 AIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYM 328
           AI A  ++FQ YK G+F G CGT+LDH VT VG+G + DG  YWL+KNSWG  WG+ GY+
Sbjct: 260 AIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSADGTEYWLVKNSWGTEWGEEGYI 319

Query: 329 KIVR----DEGLCGIGTRSSYPLA 348
           ++ R    +EGLCGI   +SYP A
Sbjct: 320 RMQRGVKAEEGLCGIAMMASYPTA 343


>gi|400180383|gb|AFP73330.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  339 bits (870), Expect = 1e-90,   Method: Compositional matrix adjust.
 Identities = 170/345 (49%), Positives = 233/345 (67%), Gaps = 13/345 (3%)

Query: 10  SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
           + K++   + I +  ++S  +     RS  + SV E HE WM++HGR YKDE+EK  R  
Sbjct: 2   AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61

Query: 70  IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
           IFKEN+++IE  NK GN +YKLG N+F+D+T+ EF A +TG  +P    SPS  S+T   
Sbjct: 62  IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119

Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
           F   +LS  D+P++LDWR+ GAVT +K+Q  CGCCWAF+AV ++EG  KI +GNL++ SE
Sbjct: 120 FIINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179

Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
           Q+LLDC+TN N GC GG    AF +II+N GI+ E +Y Y     TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQEKTAAVQISS 238

Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
           Y+ VP G E +LL+AV+ QPVSI IAA S + Q Y  G ++G C  +++HAVT +G+GT 
Sbjct: 239 YKVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296

Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
           E G  YWL+KNSWG +WG+ G+MKI+RD     GLC I   SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341


>gi|413953668|gb|AFW86317.1| hypothetical protein ZEAMMB73_339067 [Zea mays]
          Length = 433

 Score =  339 bits (870), Expect = 1e-90,   Method: Compositional matrix adjust.
 Identities = 157/331 (47%), Positives = 216/331 (65%), Gaps = 8/331 (2%)

Query: 21  IITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEK 80
           II     C + + +   + +  +V  HE+WMAQ+ R YKD  EK  R ++FK N+++IE 
Sbjct: 104 IIGFAFFCGAAMAARDLSDDSVMVARHEQWMAQYSRVYKDASEKARRFEVFKANVQFIES 163

Query: 81  ANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSL 140
            N  GN  + LG NQF+DLTNDEFR+  T   + S + +  T   F+Y+N+S   +PT++
Sbjct: 164 FNAGGNNKFWLGVNQFADLTNDEFRSTKTNKGLKSSNMKIPTG--FRYENVSADALPTTI 221

Query: 141 DWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGC 199
           DWR KGAVTPIK+Q +CGCCWAF+AVAA EGI KI +G L+ L+EQ+L+DC  +G + GC
Sbjct: 222 DWRTKGAVTPIKDQGQCGCCWAFSAVAATEGIVKISTGKLVSLAEQELVDCDVHGEDQGC 281

Query: 200 LGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLK 259
            GG  + AF +II+N G+ TE  YPY A  G C +    +AA I  YE+VP+ DE AL+K
Sbjct: 282 EGGLMDDAFKFIIKNGGLTTESSYPYTAADGKCKSGSN-SAATIKGYEDVPANDEAALMK 340

Query: 260 AVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWG 319
           AV+ QPVS+A+      FQ Y  G+  G CGT LDH +  +G+G T DG  YWL+KNSWG
Sbjct: 341 AVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGKTSDGTKYWLMKNSWG 400

Query: 320 NTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
            TWG+ GY+++ +D     G+CG+    SYP
Sbjct: 401 TTWGENGYLRMEKDISDKRGMCGLAMEPSYP 431


>gi|255580659|ref|XP_002531152.1| cysteine protease, putative [Ricinus communis]
 gi|223529265|gb|EEF31237.1| cysteine protease, putative [Ricinus communis]
          Length = 340

 Score =  339 bits (869), Expect = 1e-90,   Method: Compositional matrix adjust.
 Identities = 171/335 (51%), Positives = 232/335 (69%), Gaps = 15/335 (4%)

Query: 21  IITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEK 80
           +I  L + ASQ ++ R+  + S+ E HE+WM +  R Y D  EKE+R KIFKEN++ IE 
Sbjct: 14  LIFFLGALASQAIA-RTLQDASIHEKHEEWMTRFKRVYSDAKEKEIRYKIFKENVQRIES 72

Query: 81  ANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHR-STTSSTFKYQNLSMTDVPTS 139
            NK   ++YKLG NQF+DLTN+EF+     +K     H  S+ +  F+Y+N+  T VP+S
Sbjct: 73  FNKASEKSYKLGINQFADLTNEEFKTSRNRFK----GHMCSSQAGPFRYENI--TAVPSS 126

Query: 140 LDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNG 198
           +DWR +GAVT IK+Q +CG CWAF+AVAAVEGIT++ +  LI LSEQ+L+DC T G + G
Sbjct: 127 MDWRKEGAVTAIKDQGQCGSCWAFSAVAAVEGITQLATSKLISLSEQELVDCDTKGEDQG 186

Query: 199 CLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKP-AAAKISNYEEVPSGDEQAL 257
           C GG  + AF +I QNQG+ TE  YPY+   GTC+  Q+   AAKI+ +E+VP+ +E AL
Sbjct: 187 CQGGLMDDAFKFIEQNQGLTTEANYPYEGSDGTCNTKQEANHAAKINGFEDVPANNEGAL 246

Query: 258 LKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNS 317
           +KAV+ QPVS+AI A   EFQ Y  GIF G CGT+LDH V  VG+G + +G NYWL+KNS
Sbjct: 247 MKAVAKQPVSVAIDAGGFEFQFYSSGIFTGDCGTELDHGVAAVGYGES-NGMNYWLVKNS 305

Query: 318 WGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
           WG  WG+ GY+++ +D    EGLCGI  ++SYP A
Sbjct: 306 WGTQWGEEGYIRMQKDIDAKEGLCGIAMQASYPTA 340


>gi|400180457|gb|AFP73365.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  339 bits (869), Expect = 1e-90,   Method: Compositional matrix adjust.
 Identities = 170/345 (49%), Positives = 233/345 (67%), Gaps = 13/345 (3%)

Query: 10  SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
           + K++   + I +  ++S  +     RS  + SV E HE WM++HGR YKDE+EK  R  
Sbjct: 2   AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61

Query: 70  IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
           IFKEN+++IE  NK GN +YKLG N+F+D+T+ EF A +TG  +P    SPS  S+T   
Sbjct: 62  IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119

Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
           F   +LS  D+P++LDWR+ GAVT +K+Q  CGCCWAF+AV ++EG  KI +GNL++ SE
Sbjct: 120 FIINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179

Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
           Q+LLDC+TN N GC GG    AF +II+N GI+ E +Y Y     TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCRSQEKTAAVQISS 238

Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
           Y+ VP G E +LL+AV+ QPVSI IAA S + Q Y  G ++G C  +++HAVT +G+GT 
Sbjct: 239 YKVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296

Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
           E G  YWL+KNSWG +WG+ G+MKI+RD     GLC I   SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341


>gi|400180373|gb|AFP73325.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  339 bits (869), Expect = 1e-90,   Method: Compositional matrix adjust.
 Identities = 169/345 (48%), Positives = 234/345 (67%), Gaps = 13/345 (3%)

Query: 10  SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
           + K++   + I +  ++S  +     RS  + SV E HE WM++HG  YKDE+EK  R  
Sbjct: 2   AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGHVYKDEVEKGERFM 61

Query: 70  IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
           IFKEN+++IE  NK GN +YKLG N+F+D+T+ EF A +TG  +P    SPS  S+T   
Sbjct: 62  IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119

Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
           FK  +LS  D+P++LDWR+ GAVT +K+Q +CGCCWAF+AV ++EG  KI +GNL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGQCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179

Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
           Q+LLDC+TN N GC GG    AF +I +N GI++E +Y Y     TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCDGGFMTNAFDFIKENGGISSESDYEYLGEQYTCRSQEKTAAVQISS 238

Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
           Y+ VP G E +LL+AV+ QPVSI IAA S + Q Y  G ++G C  +++HAVT +G+GT 
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296

Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRDE----GLCGIGTRSSYP 346
           E G  YWL+KNSWG +WG+ G+MKI+RD     GLC I   SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPAGLCDIAKMSSYP 341


>gi|400180385|gb|AFP73331.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  339 bits (869), Expect = 1e-90,   Method: Compositional matrix adjust.
 Identities = 170/345 (49%), Positives = 233/345 (67%), Gaps = 13/345 (3%)

Query: 10  SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
           + K++   + I +  ++S  +     RS  + SV E HE WM++HGR YKDE+EK  R  
Sbjct: 2   AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61

Query: 70  IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
           IFKEN+++IE  NK GN +YKLG N+F+D+T+ EF A +TG  +P    SPS  S+T   
Sbjct: 62  IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119

Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
           FK  +LS  D+P++LDWR+ GAVT +K+Q  CGCCWAF+AV ++EG  KI +GNL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179

Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
           Q+LLDC+TN N GC GG    AF +I +N GI+ E +Y Y     TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKTAAVQISS 238

Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
           Y+ VP G E +LL+AV+ QPVSI IAA S + Q Y  G ++G C  +++HAVT +G+GT 
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296

Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
           E G  YWL+KNSWG +WG+ G+MKI+RD     GLC I   SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDSGDPSGLCDITKMSSYP 341


>gi|242072392|ref|XP_002446132.1| hypothetical protein SORBIDRAFT_06g002150 [Sorghum bicolor]
 gi|241937315|gb|EES10460.1| hypothetical protein SORBIDRAFT_06g002150 [Sorghum bicolor]
          Length = 337

 Score =  339 bits (869), Expect = 1e-90,   Method: Compositional matrix adjust.
 Identities = 158/338 (46%), Positives = 226/338 (66%), Gaps = 11/338 (3%)

Query: 15  TTPMFIIITL-LVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKE 73
           T+  F++  L   S  S V+++R   + ++VE HE WM ++GR YKD  EK  R + FK 
Sbjct: 3   TSKAFLLAILGCASLCSSVLAARELSDAAMVERHENWMVEYGRVYKDAAEKARRFEAFKH 62

Query: 74  NLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
           N+ ++E  N      + LG NQF+DLT +EF+A   G+K   P+     ++ FKY+NLS+
Sbjct: 63  NVAFVESFNTNKKNKFWLGVNQFADLTTEEFKA-NKGFK---PTAEKVPTTGFKYENLSV 118

Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST 193
           + +PT++DWR KGAVTPIKNQ +CGCCWAF+AVAA+EGI K+ +GNLI LSEQ+L+DC T
Sbjct: 119 SALPTAVDWRTKGAVTPIKNQGQCGCCWAFSAVAAMEGIVKLSTGNLISLSEQELVDCDT 178

Query: 194 NG-NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSG 252
           +  + GC GG  + AF ++I+N G+ATE  YPY+AV G C    K +AA I  +E+VP  
Sbjct: 179 HSMDEGCEGGWMDSAFEFVIKNGGLATESNYPYKAVDGKCKGGSK-SAATIKGHEDVPVN 237

Query: 253 DEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYW 312
           +E AL+KAV+ QPVS+A+ A    F  Y  G+  G CGT+LDH +  +G+G   DG  YW
Sbjct: 238 NEAALMKAVANQPVSVAVDASDRTFMLYSGGVMTGSCGTELDHGIAAIGYGMESDGTKYW 297

Query: 313 LIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
           ++KNSWG TWG+ G++++ +D     G+CG+  + SYP
Sbjct: 298 ILKNSWGTTWGEKGFLRMEKDITDKRGMCGLAMKPSYP 335


>gi|1046373|gb|AAC49135.1| SAG12 protein [Arabidopsis thaliana]
          Length = 346

 Score =  338 bits (868), Expect = 2e-90,   Method: Compositional matrix adjust.
 Identities = 167/338 (49%), Positives = 224/338 (66%), Gaps = 10/338 (2%)

Query: 18  MFIIITLLVS-CASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLE 76
           +F+ + +  S C S  +S    +E  + + H +WM +HGR Y D  E+  R  +FK N+E
Sbjct: 8   IFLFVAIFSSFCFSITLSRPLDNELIMQKRHIEWMTKHGRVYADVKEENNRYVVFKNNVE 67

Query: 77  YIEKANK-EGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSP--SHRSTTSSTFKYQNLSM 133
            IE  N     RT+KL  NQF+DLTNDEF ++YTG+K  S   S   T  S F+YQN+S 
Sbjct: 68  RIEHLNSIPAGRTFKLAVNQFADLTNDEFCSMYTGFKGVSALSSQSQTKMSPFRYQNVSS 127

Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST 193
             +P S+DWR KGAVTPIKNQ  CGCCWAF+AVAA+EG T+I+ G LI LSEQQL+DC T
Sbjct: 128 GALPVSVDWRKKGAVTPIKNQGSCGCCWAFSAVAAIEGATQIKKGKLISLSEQQLVDCDT 187

Query: 194 NGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQ-KPAAAKISNYEEVPSG 252
           N + GC GG  + AF +I    G+ TE +YPY+    TC++ +  P A  I+ YE+VP  
Sbjct: 188 N-DFGCEGGLMDTAFEHIKATGGLTTESDYPYKGEDATCNSKKTNPKATSITGYEDVPVN 246

Query: 253 DEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYW 312
           DEQAL+KAV+ QPVS+ I     +FQ Y  G+F G C T LDHAVT +G+G + +G+ YW
Sbjct: 247 DEQALMKAVAHQPVSVGIEGGGFDFQFYSSGVFTGECTTYLDHAVTAIGYGESTNGSKYW 306

Query: 313 LIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
           +IKNSWG  WG++GYM+I +D    +GLCG+  ++SYP
Sbjct: 307 IIKNSWGTKWGESGYMRIQKDVKDKQGLCGLAMKASYP 344


>gi|125551397|gb|EAY97106.1| hypothetical protein OsI_19029 [Oryza sativa Indica Group]
          Length = 350

 Score =  338 bits (868), Expect = 2e-90,   Method: Compositional matrix adjust.
 Identities = 164/348 (47%), Positives = 228/348 (65%), Gaps = 16/348 (4%)

Query: 15  TTPMFIIITLLVSC------ASQVVSSRSTH-EQSVVEIHEKWMAQHGRSYKDELEKEMR 67
           + P+ + I   + C       + V ++R    + ++   HE+WMAQHGR YKD  EK  R
Sbjct: 5   SKPLLLAILCCIVCLYSSSGGAIVAAARELGGDAAMAARHERWMAQHGRVYKDAAEKARR 64

Query: 68  LKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYK-MPSPSHRSTTSSTF 126
           L++FK N+ +IE  N  G   Y LG NQF+DLT++EF+A  T  K   +P++    S+ F
Sbjct: 65  LEVFKANVAFIESFNAGGKNRYWLGVNQFADLTSEEFKATMTNSKGFSTPNNGVRVSTGF 124

Query: 127 KYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQ 186
           KY+N+S   +P S+DWR KGAVT IK+Q +CGCCWAF+AVAA+EGI K+ +G LI LSEQ
Sbjct: 125 KYENVSADALPASVDWRTKGAVTRIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQ 184

Query: 187 QLLDCSTNGNN-GCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTC-SAAQKPAAAKIS 244
           +L+DC  +GN+ GC GG  + AF +I+ N G+  E  YPY A  G C + A    AA I 
Sbjct: 185 ELVDCDVDGNDQGCEGGEIDGAFQFILSNGGLTAEANYPYTAEDGRCKTTAAADVAASIR 244

Query: 245 NYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGT 304
            YE+VP+ DE +L+KAV+ QPVS+A+ A  ++FQ Y  G+  G CGT LDH VT++G+G 
Sbjct: 245 GYEDVPANDEPSLMKAVAGQPVSVAVDA--SKFQFYGGGVMAGECGTSLDHGVTVIGYGA 302

Query: 305 TEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
             DG  YWL+KNSWG TWG+AGY+++ +D     G+CG+  + SYP A
Sbjct: 303 ASDGTKYWLVKNSWGTTWGEAGYLRMEKDIDDKRGMCGLAMQPSYPTA 350


>gi|400180379|gb|AFP73328.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  338 bits (868), Expect = 2e-90,   Method: Compositional matrix adjust.
 Identities = 169/345 (48%), Positives = 233/345 (67%), Gaps = 13/345 (3%)

Query: 10  SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
           + K++   + I +  +++  +     RS  + SV E HE WM++HGR YKDE+EK  R  
Sbjct: 2   AMKVDLMNILITLFFVITMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61

Query: 70  IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
           IFKEN+++IE  NK GN +YKLG N+F+D+T+ EF A +TG  +P    SPS  S+T   
Sbjct: 62  IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119

Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
           FK  +LS  D+P++LDWR+ GAVT +K+Q  CGCCWAF+AV ++EG  KI +GNL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179

Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
           Q+LLDC+TN N GC GG    AF +I +N GI+ E +Y Y     TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCDGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKTAAVQISS 238

Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
           Y+ VP G E +LL+AV+ QPVSI IAA S + Q Y  G ++G C  +++HAVT +G+GT 
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296

Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
           E G  YWL+KNSWG +WG+ G+MKI+RD     GLC I   SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341


>gi|255580657|ref|XP_002531151.1| cysteine protease, putative [Ricinus communis]
 gi|223529264|gb|EEF31236.1| cysteine protease, putative [Ricinus communis]
          Length = 340

 Score =  338 bits (868), Expect = 2e-90,   Method: Compositional matrix adjust.
 Identities = 172/335 (51%), Positives = 232/335 (69%), Gaps = 15/335 (4%)

Query: 21  IITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEK 80
           +I LL +  SQ ++ R+  + S+ E HE+WM++ GR Y D  EKE+R KIFKEN++ IE 
Sbjct: 14  LIFLLGALVSQAMA-RTLQDASMHEKHEEWMSRFGRVYNDGNEKEIRYKIFKENVQRIES 72

Query: 81  ANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHR-STTSSTFKYQNLSMTDVPTS 139
            NK   ++YKLG NQF+DLTN+EF+     +K     H  S+ +  F+Y+NL  T  P+S
Sbjct: 73  FNKASGKSYKLGINQFADLTNEEFKTSRNRFK----GHMCSSQAGPFRYENL--TAAPSS 126

Query: 140 LDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNG 198
           +DWR KGAVT IK+Q +CG CWAF+AVAAVEGIT++ +  LI LSEQ+L+DC T G + G
Sbjct: 127 MDWRKKGAVTAIKDQGQCGSCWAFSAVAAVEGITQLATSKLISLSEQELVDCDTKGEDQG 186

Query: 199 CLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKP-AAAKISNYEEVPSGDEQAL 257
           C GG  + AF +I QNQG+ TE  YPY+   GTC+  Q+   AAKI+ +E+VP+ +E AL
Sbjct: 187 CQGGLMDDAFKFIEQNQGLTTEANYPYEGSDGTCNTKQEANHAAKINGFEDVPANNEGAL 246

Query: 258 LKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNS 317
           +KAV+ QPVS+AI A    FQ Y  GIF G CGT+LDH V  VG+G + +G NYWL+KNS
Sbjct: 247 MKAVAKQPVSVAIDAGGFGFQFYSSGIFTGDCGTELDHGVAAVGYGES-NGMNYWLVKNS 305

Query: 318 WGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
           WG  WG+ GY+++ +D    EGLCGI  ++SYP A
Sbjct: 306 WGTQWGEEGYIRMQKDIDAKEGLCGIAMQASYPTA 340


>gi|255557851|ref|XP_002519955.1| cysteine protease, putative [Ricinus communis]
 gi|223541001|gb|EEF42559.1| cysteine protease, putative [Ricinus communis]
          Length = 321

 Score =  338 bits (867), Expect = 2e-90,   Method: Compositional matrix adjust.
 Identities = 175/336 (52%), Positives = 225/336 (66%), Gaps = 35/336 (10%)

Query: 20  IIITLLV---SCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLE 76
           + I LLV   + ASQ ++ +  +E ++VE HE+WMA+HGR+Y+D  EKE R +IFK NLE
Sbjct: 9   LAIALLVVFSTWASQAMARQLINEDALVEKHEQWMARHGRTYQDSEEKERRFQIFKSNLE 68

Query: 77  YIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV 136
           YI+  NK  N+TY+LG N F+DL+++E+ A YT  KMP                    +V
Sbjct: 69  YIDNFNKASNQTYQLGLNNFADLSHEEYVATYTARKMP-------------------VEV 109

Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGN 196
           P S+DWRD GAVTPIKNQ +CGCCWAF+A AAVEGI      N + LS QQLLDC ++ N
Sbjct: 110 PESIDWRDHGAVTPIKNQYQCGCCWAFSAAAAVEGIV----ANGVSLSAQQLLDCVSD-N 164

Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQA 256
            GC GG    AF YIIQNQGIA E +YPYQ +   CS+  + AAA+IS +E+V   DE+A
Sbjct: 165 QGCKGGWMNNAFNYIIQNQGIALETDYPYQQMQQMCSS--RMAAAQISGFEDVTPKDEEA 222

Query: 257 LLKAVSMQPVSIAIAAYST-EFQSYKEGIFNGV-CGTQLDHAVTIVGFGTTEDGANYWLI 314
           L++AV+ QPVS+ I A S   F+ YKEG+F    CG    HAVT+VG+GT+EDG  YWL 
Sbjct: 223 LMRAVAKQPVSVTIDATSNPNFKLYKEGVFTAAGCGNGHSHAVTLVGYGTSEDGTKYWLA 282

Query: 315 KNSWGNTWGDAGYMKIVRDEGL----CGIGTRSSYP 346
           KNSWG TWG++GYM++ RD GL    CGI   +SYP
Sbjct: 283 KNSWGETWGESGYMRLQRDIGLEGGPCGIALYASYP 318


>gi|400180417|gb|AFP73347.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  338 bits (867), Expect = 3e-90,   Method: Compositional matrix adjust.
 Identities = 168/343 (48%), Positives = 233/343 (67%), Gaps = 9/343 (2%)

Query: 10  SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
           + K++   + I +  ++S  +     RS  + SV E HE WM++HGR YKDE+EK  R  
Sbjct: 2   AMKVDLMNILITVFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61

Query: 70  IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPS--PSHRSTTSSTFK 127
           IFKEN+++IE  NK GN +YKLG N+F+D+T+ EF A +TG  +P+   S    +S+ FK
Sbjct: 62  IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPLSSTEFK 121

Query: 128 YQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQ 187
             +LS  D+P++LDWR+ GAVT +K+Q  CGCCWAF+AV ++EG  KI +GNL++ SEQ+
Sbjct: 122 INDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQE 181

Query: 188 LLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYE 247
           LLDC+TN N GC GG    AF +II+N GI+ E +Y Y     TC + +K AA +IS+Y+
Sbjct: 182 LLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQEKTAAVQISSYK 240

Query: 248 EVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTED 307
            VP G E +LL+AV+ QPVSI IAA S + Q Y  G ++G C  +++HAVT +G+GT E 
Sbjct: 241 VVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTDEK 298

Query: 308 GANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
           G  YWL+KNSWG +WG+ G+MKI+RD     GLC I   SSYP
Sbjct: 299 GQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341


>gi|356554921|ref|XP_003545789.1| PREDICTED: LOW QUALITY PROTEIN: thiol protease SEN102-like [Glycine
           max]
          Length = 439

 Score =  338 bits (866), Expect = 3e-90,   Method: Compositional matrix adjust.
 Identities = 172/355 (48%), Positives = 233/355 (65%), Gaps = 15/355 (4%)

Query: 4   IFERSGSFKINTTPMFIIITLLVSCASQV--VSSRSTHEQSVVEIHEKWMAQHGRSYKDE 61
           IF+R  +         I + +L+  A     V+  +  + S+ E HE+WM +HG+ YKD 
Sbjct: 90  IFKRDSTMVAKNHFYHISLAMLLCTAFLAFQVTCCTLQDASMYERHEQWMTRHGKVYKDP 149

Query: 62  LEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYK--MPSPSHR 119
            E+E R +IF EN+ Y+E  N   N+ YKLG NQF DLTN EF A    +K  M S   R
Sbjct: 150 REREKRFRIFNENVNYVEAFNNAANKPYKLGINQFXDLTNQEFIAPRNRFKGHMCSSIIR 209

Query: 120 STTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGN 179
           +TT   FKY+N+  T VP+++DWR  GAVTP+K+Q +CGCCWAF+AVAA EGI  +  G 
Sbjct: 210 TTT---FKYENV--TTVPSTVDWRQNGAVTPVKDQGQCGCCWAFSAVAATEGIHALSGGK 264

Query: 180 LIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKP 238
           LI LSEQ+L+DC T G + GC GG  + A+ +IIQN G+ TE  YPY+ V G C+A +  
Sbjct: 265 LISLSEQELVDCDTKGVDQGCEGGLMDDAYKFIIQNHGLNTEANYPYKGVDGKCNANEAA 324

Query: 239 AAAK-ISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAV 297
             A  I+ YE+VP+ +E+AL KAV+ QPVS+AI A S++FQ YK G F G CGT+LDH V
Sbjct: 325 NHAATITGYEDVPANNEKALQKAVANQPVSVAIDASSSDFQFYKSGAFTGSCGTELDHGV 384

Query: 298 TIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR----DEGLCGIGTRSSYPLA 348
           T VG+G ++ G  YWL+KNSWG  WG+ GY+++ R    +EG+CGI  ++SYP A
Sbjct: 385 TAVGYGVSDHGTKYWLVKNSWGTEWGEEGYIRMQRGVDSEEGVCGIAMQASYPTA 439


>gi|414587996|tpg|DAA38567.1| TPA: hypothetical protein ZEAMMB73_390779 [Zea mays]
          Length = 343

 Score =  337 bits (865), Expect = 4e-90,   Method: Compositional matrix adjust.
 Identities = 158/344 (45%), Positives = 231/344 (67%), Gaps = 14/344 (4%)

Query: 13  INTTPMFIIITLLVSCA----SQVVSSRS-THEQSVVEIHEKWMAQHGRSYKDELEKEMR 67
           +++    +++ +L  CA    S V+++R  + + ++ E HE+WMA +GR YKD  EK  R
Sbjct: 2   VSSRAFLLLLAILTGCACSFPSPVLAARELSDDAAMAERHERWMAVYGRVYKDAAEKARR 61

Query: 68  LKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFK 127
            ++FK+NL ++E  N +    + LG NQF+DLT +EF+A   G+K  S     TT   FK
Sbjct: 62  FEVFKDNLAFVESFNADKKNKFWLGVNQFADLTTEEFKA-NKGFKPISAEEVPTTG--FK 118

Query: 128 YQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQ 187
           Y+NLS++ +PT++DWR KGAVTPIKNQ +CGCCWAF+AVAA+EGI K+ + NL+ LSEQ+
Sbjct: 119 YENLSVSALPTAVDWRTKGAVTPIKNQGQCGCCWAFSAVAAMEGIVKLSTDNLVSLSEQE 178

Query: 188 LLDCSTNG-NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNY 246
           L+DC T+  + GC GG  + AF ++I+N G+ATE  YPY+AV G C    K +AA I  +
Sbjct: 179 LVDCDTHSMDEGCEGGWMDSAFEFVIKNGGLATESSYPYKAVDGKCKGGSK-SAATIKGH 237

Query: 247 EEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTE 306
           E+VP  +E AL+KAV+ QPVS+A+ A    F  Y  G+  G CGTQLDH +  +G+G   
Sbjct: 238 EDVPPNNEAALMKAVASQPVSVAVDASDRTFMLYSGGVMTGSCGTQLDHGIAAIGYGVES 297

Query: 307 DGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
           DG  YW++KNSWG TWG+  ++++ +D    +G+CG+  + SYP
Sbjct: 298 DGTKYWILKNSWGTTWGEKRFLRMEKDISDKQGMCGLAMKPSYP 341


>gi|400180363|gb|AFP73320.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  337 bits (865), Expect = 4e-90,   Method: Compositional matrix adjust.
 Identities = 169/345 (48%), Positives = 232/345 (67%), Gaps = 13/345 (3%)

Query: 10  SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
           + K++   + I +  ++S  +     RS  + SV E HE WM++HGR YKDE+EK  R  
Sbjct: 2   AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61

Query: 70  IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
           IFKEN+++IE  NK GN +YKLG N+F+D+T+ EF A +TG  +P    SPS  S+T   
Sbjct: 62  IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119

Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
            K  +LS  D+P++LDWR+ GAVT +K+Q  CGCCWAF+AV ++EG  KI +GNL++ SE
Sbjct: 120 LKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179

Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
           Q+LLDC+TN N GC GG    AF +I +N GI+ E +Y Y     TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKTAAVQISS 238

Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
           Y+ VP G E +LL+AV+ QPVSI IAA S + Q Y  G ++G C  +++HAVT +G+GT 
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296

Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
           E G  YWL+KNSWG +WG+ G+MKI+RD     GLC I   SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341


>gi|414588010|tpg|DAA38581.1| TPA: hypothetical protein ZEAMMB73_156486 [Zea mays]
          Length = 347

 Score =  337 bits (864), Expect = 5e-90,   Method: Compositional matrix adjust.
 Identities = 164/347 (47%), Positives = 223/347 (64%), Gaps = 18/347 (5%)

Query: 17  PMFIIITLL----VSCASQVVSSR---STHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
           P  +++ +L      C++ V+++R      E ++V  HE+WM QHGR YKDE +K  R  
Sbjct: 4   PKALLLAILGCGVCLCSAAVLAARELGGDDELAMVARHEQWMVQHGRVYKDETDKAHRFL 63

Query: 70  IFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTF 126
           +FK N+++IE  N     GNR + LG NQF+DLTNDEFRA  T         +  T   F
Sbjct: 64  VFKANVKFIESFNAAAAAGNRKFWLGVNQFADLTNDEFRATKTNKGFNPNVVKVPTG--F 121

Query: 127 KYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQ 186
           +YQNLS+  +P ++DWR KGAVTPIK+Q +CGCCWAF+AVAA EGI KI +G L  LSEQ
Sbjct: 122 RYQNLSIDALPQTVDWRTKGAVTPIKDQGQCGCCWAFSAVAATEGIVKISTGKLTSLSEQ 181

Query: 187 QLLDCSTNG-NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
           +L+DC  +G + GC GG  + AF +II+N G+ TE  YPY A  G C +     AA I  
Sbjct: 182 ELVDCDVHGEDQGCNGGEMDDAFKFIIKNGGLTTESNYPYTAQDGQCKSGSN-GAATIKG 240

Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
           YE+VP+ DE AL+KAV+ QPVS+A+      FQ Y  G+  G CGT LDH +  +G+G T
Sbjct: 241 YEDVPANDEAALMKAVASQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGKT 300

Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
            DG  YWL+KNSWG TWG+ G++++ +D    +G+CG+  + SYP A
Sbjct: 301 SDGTKYWLMKNSWGTTWGENGFLRMEKDIADKKGMCGLAMQPSYPTA 347


>gi|400180349|gb|AFP73313.1| cysteine protease [Solanum peruvianum]
 gi|400180469|gb|AFP73371.1| cysteine protease [Solanum peruvianum]
 gi|400180471|gb|AFP73372.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  337 bits (864), Expect = 5e-90,   Method: Compositional matrix adjust.
 Identities = 169/345 (48%), Positives = 232/345 (67%), Gaps = 13/345 (3%)

Query: 10  SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
           + K++   + I +  ++S  +     RS  + SV E HE WM++HGR YKDE+EK  R  
Sbjct: 2   AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKVERFM 61

Query: 70  IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
           IFKEN+++IE  NK GN +YKLG N+F+D+T+ EF A +TG  +P    SPS  S+T   
Sbjct: 62  IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119

Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
            K  +LS  D+P++LDWR+ GAVT +K+Q  CGCCWAF+AV ++EG  KI +GNL++ SE
Sbjct: 120 LKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179

Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
           Q+LLDC+TN N GC GG    AF +I +N GI+ E +Y Y     TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKTAAVQISS 238

Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
           Y+ VP G E +LL+AV+ QPVSI IAA S + Q Y  G ++G C  +++HAVT +G+GT 
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296

Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
           E G  YWL+KNSWG +WG+ G+MKI+RD     GLC I   SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341


>gi|400180447|gb|AFP73360.1| cysteine protease [Solanum chilense]
          Length = 345

 Score =  337 bits (864), Expect = 5e-90,   Method: Compositional matrix adjust.
 Identities = 169/345 (48%), Positives = 233/345 (67%), Gaps = 12/345 (3%)

Query: 10  SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
           + K++   + I +  ++S  +     RS  + SV E HE WM++HGR YKDE+EK  R  
Sbjct: 2   AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61

Query: 70  IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
           IFKEN+++IE  NK GN +YKLG N+F+D+T+ EF A +TG  +P    SPS  S+T   
Sbjct: 62  IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFK 121

Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
            K  +LS  D+P++LDWR+ GAVT +K+Q +CGCCWAF+AV ++EG  KI +G L++ SE
Sbjct: 122 -KINDLSDDDMPSNLDWRESGAVTQVKHQGQCGCCWAFSAVGSLEGAYKIATGKLMEFSE 180

Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
           Q+LLDC+TN N GC GG    AF +II+N GI+ E +Y Y     TC + +K AA +IS+
Sbjct: 181 QELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCRSQEKTAAVQISS 239

Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
           Y+ VP G E +LL+AV+ QPVSI IAA S + Q Y  G ++G C  +++HAVT +G+GT 
Sbjct: 240 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 297

Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
           E G  YWL+KNSWG +WG+ G+MKI+RD     GLC I   SSYP
Sbjct: 298 EKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 342


>gi|20334377|gb|AAM19209.1|AF493234_1 cysteine protease [Solanum lycopersicum]
 gi|400180431|gb|AFP73353.1| cysteine protease [Solanum lycopersicum]
          Length = 345

 Score =  337 bits (864), Expect = 5e-90,   Method: Compositional matrix adjust.
 Identities = 170/345 (49%), Positives = 233/345 (67%), Gaps = 12/345 (3%)

Query: 10  SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
           + K++   + I +  ++S  +     RS  + SV E HE WM++HGR YKDE+EK  R  
Sbjct: 2   AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61

Query: 70  IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
           IFKEN+++IE  NK GN +YKLG N+F+D+T+ EF A +TG  +P    SPS  S+T   
Sbjct: 62  IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFK 121

Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
            K  +LS   +P++LDWR+ GAVT +K+Q  CGCCWAF+AV ++EG  KI +GNL++ SE
Sbjct: 122 -KINDLSDDYMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 180

Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
           Q+LLDC+TN N GC GG    AF +II+N GI+ E +Y Y     TC + +K AA +IS+
Sbjct: 181 QELLDCTTN-NYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQEKTAAVQISS 239

Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
           Y+ VP G E +LL+AV+ QPVSI IAA S + Q Y  G ++G C  +++HAVT +G+GT 
Sbjct: 240 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGNCADRINHAVTAIGYGTD 297

Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
           E+G  YWL+KNSWG +WG+ GYMKI+RD     GLC I   SSYP
Sbjct: 298 EEGQKYWLLKNSWGTSWGENGYMKIIRDSGDPSGLCDIAKMSSYP 342


>gi|400180371|gb|AFP73324.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  337 bits (863), Expect = 6e-90,   Method: Compositional matrix adjust.
 Identities = 169/345 (48%), Positives = 232/345 (67%), Gaps = 13/345 (3%)

Query: 10  SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
           + K++   + I +  ++S  +     RS  + SV E HE WM++HGR YKDE+EK  R  
Sbjct: 2   AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61

Query: 70  IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
           IFKEN+++IE  NK GN +YKLG N+F+D+T+ EF A +TG  +P    SPS  S+T   
Sbjct: 62  IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119

Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
           F   +LS  D+P++LDWR+ GAVT +K+Q  CGCCWAF+AV ++EG  KI +GNL++ SE
Sbjct: 120 FIINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179

Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
           Q+LLDC+TN N GC GG    AF +I +N GI+ E +Y Y     TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKTAAVQISS 238

Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
           Y+ VP G E +LL+AV+ QPVSI IAA S + Q Y  G ++G C  +++HAVT +G+GT 
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296

Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
           E G  YWL+KNSWG +WG+ G+MKI+RD     GLC I   SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341


>gi|400180399|gb|AFP73338.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  337 bits (863), Expect = 6e-90,   Method: Compositional matrix adjust.
 Identities = 169/345 (48%), Positives = 232/345 (67%), Gaps = 13/345 (3%)

Query: 10  SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
           + K++   + I +  ++S  +     RS  + SV E HE WM++HGR YKDE+EK  R  
Sbjct: 2   AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61

Query: 70  IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
           IFKEN+++IE  NK GN +YKLG N+F+D+T+ EF A +TG  +P    SPS  S+T   
Sbjct: 62  IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119

Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
           F   +LS  D+P++LDWR+ GAVT +K+Q  CGCCWAF+AV ++EG  KI +GNL++ SE
Sbjct: 120 FIINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179

Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
           Q+LLDC+TN N GC GG    AF +I +N GI+ E +Y Y     TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKTAAVQISS 238

Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
           Y+ VP G E +LL+AV+ QPVSI IAA S + Q Y  G ++G C  +++HAVT +G+GT 
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296

Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
           E G  YWL+KNSWG +WG+ G+MKI+RD     GLC I   SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDSGDPSGLCDIAKMSSYP 341


>gi|400180391|gb|AFP73334.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  337 bits (863), Expect = 6e-90,   Method: Compositional matrix adjust.
 Identities = 169/345 (48%), Positives = 233/345 (67%), Gaps = 13/345 (3%)

Query: 10  SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
           + K++   + I +  ++S  +    +RS  + SV E HE WM++HGR YKDE+EK  R  
Sbjct: 2   AMKVDLMSILITLFFVISMFNSQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61

Query: 70  IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
           IFKEN+++IE  NK GN +YKLG N+F+D+T+ EF A +TG  +P    SPS  S+T   
Sbjct: 62  IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119

Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
           FK  +LS  D+P++LDWR+ GAVT +K+Q  CGCCWAF+AV ++EG  KI +GNL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179

Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
           Q+LLDC+TN N GC GG    AF +I +N GI+ E +Y Y     TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKTAAVQISS 238

Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
           Y+ VP G E +LL+AV+ QPVSI IAA S + Q    G ++G C  +++HAVT +G+GT 
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFCAGGTYDGSCADRINHAVTAIGYGTD 296

Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
           E G  YWL+KNSWG +WG+ G+MKI+RD     GLC I   SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341


>gi|400180465|gb|AFP73369.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  337 bits (863), Expect = 7e-90,   Method: Compositional matrix adjust.
 Identities = 169/345 (48%), Positives = 232/345 (67%), Gaps = 13/345 (3%)

Query: 10  SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
           + K++   + I +  ++S  +     RS  + SV E HE WM++HGR YKDE+EK  R  
Sbjct: 2   AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61

Query: 70  IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
           IFKEN+++IE  NK GN +YKLG N+F+D+T+ EF A +TG  +P    SPS  S+T   
Sbjct: 62  IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119

Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
           FK  +LS  D+P++LDWR+ GAVT +K+Q  CGCCWAF+AV ++E   KI +GNL++ SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEVAYKIATGNLMEFSE 179

Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
           Q+LLDC+TN N GC GG    AF +I +N GI+ E +Y Y     TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKTAAVQISS 238

Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
           Y+ VP G E +LL+AV+ QPVSI IAA S + Q Y  G ++G C  +++HAVT +G+GT 
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296

Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRDE----GLCGIGTRSSYP 346
           E G  YWL+KNSWG +WG+ G+MKI+RD     GLC I   SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPAGLCDIAKMSSYP 341


>gi|400180461|gb|AFP73367.1| cysteine protease [Solanum peruvianum]
 gi|400180473|gb|AFP73373.1| cysteine protease [Solanum peruvianum]
 gi|400180475|gb|AFP73374.1| cysteine protease [Solanum peruvianum]
 gi|400180479|gb|AFP73376.1| cysteine protease [Solanum peruvianum]
 gi|400180481|gb|AFP73377.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  337 bits (863), Expect = 7e-90,   Method: Compositional matrix adjust.
 Identities = 169/345 (48%), Positives = 232/345 (67%), Gaps = 13/345 (3%)

Query: 10  SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
           + K++   + I +  ++S  +     RS  + SV E HE WM++HGR YKDE+EK  R  
Sbjct: 2   AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61

Query: 70  IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
           IFKEN+++IE  NK GN +YKLG N+F+D+T+ EF A +TG  +P    SPS  S+T   
Sbjct: 62  IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119

Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
           F   +LS  D+P++LDWR+ GAVT +K+Q  CGCCWAF+AV ++EG  KI +GNL++ SE
Sbjct: 120 FIINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179

Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
           Q+LLDC+TN N GC GG    AF +I +N GI+ E +Y Y     TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKTAAVQISS 238

Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
           Y+ VP G E +LL+AV+ QPVSI IAA S + Q Y  G ++G C  +++HAVT +G+GT 
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296

Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
           E G  YWL+KNSWG +WG+ G+MKI+RD     GLC I   SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341


>gi|30690594|ref|NP_564321.2| cysteine proteinase-like protein [Arabidopsis thaliana]
 gi|28393492|gb|AAO42167.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|332192920|gb|AEE31041.1| cysteine proteinase-like protein [Arabidopsis thaliana]
          Length = 355

 Score =  336 bits (862), Expect = 9e-90,   Method: Compositional matrix adjust.
 Identities = 169/347 (48%), Positives = 233/347 (67%), Gaps = 15/347 (4%)

Query: 15  TTPMFIIITLLVSC----ASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKI 70
           T+ +F++++L +       SQ  S  + HE  V E H++WM +  R Y DELEK+MR  +
Sbjct: 11  TSILFMLVSLTILSMNLKVSQATSRVTFHEPIVAEHHQQWMTRFSRVYSDELEKQMRFDV 70

Query: 71  FKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYK----MPSPSHRSTTSSTF 126
           FK+NL++IEK NK+G+RTYKLG N+F+D T +EF A +TG K    +PS         ++
Sbjct: 71  FKKNLKFIEKFNKKGDRTYKLGVNEFADWTREEFIATHTGLKGVNGIPSSEFVDEMIPSW 130

Query: 127 KYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQ 186
            + N+S      + DWR +GAVTP+K Q +CGCCWAF++VAAVEG+TKI   NL+ LSEQ
Sbjct: 131 NW-NVSDVAGRETKDWRYEGAVTPVKYQGQCGCCWAFSSVAAVEGLTKIVGNNLVSLSEQ 189

Query: 187 QLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNY 246
           QLLDC    +NGC GG    AF+YII+N+GIA+E  YPYQA  GTC    KP+A  I  +
Sbjct: 190 QLLDCDRERDNGCNGGIMSDAFSYIIKNRGIASEASYPYQAAEGTCRYNGKPSAW-IRGF 248

Query: 247 EEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFN-GVCGTQLDHAVTIVGFGTT 305
           + VPS +E+ALL+AVS QPVS++I A    F  Y  G+++   CGT ++HAVT VG+GT+
Sbjct: 249 QTVPSNNERALLEAVSKQPVSVSIDADGPGFMHYSGGVYDEPYCGTNVNHAVTFVGYGTS 308

Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
            +G  YWL KNSWG TWG+ GY++I RD    +G+CG+   + YP+A
Sbjct: 309 PEGIKYWLAKNSWGETWGENGYIRIRRDVAWPQGMCGVAQYAFYPVA 355


>gi|77554625|gb|ABA97421.1| Vignain precursor, putative [Oryza sativa Japonica Group]
 gi|222630746|gb|EEE62878.1| hypothetical protein OsJ_17681 [Oryza sativa Japonica Group]
          Length = 350

 Score =  336 bits (861), Expect = 1e-89,   Method: Compositional matrix adjust.
 Identities = 162/346 (46%), Positives = 226/346 (65%), Gaps = 16/346 (4%)

Query: 15  TTPMFIIITLLVSC------ASQVVSSRSTH-EQSVVEIHEKWMAQHGRSYKDELEKEMR 67
           + P+ + I   + C       + V ++R    + ++   HE+WMAQHGR YKD  EK  R
Sbjct: 5   SKPLLLAILCCIVCLYSSSGGAIVAAARELGGDAAMAARHERWMAQHGRVYKDAAEKARR 64

Query: 68  LKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYK-MPSPSHRSTTSSTF 126
           L++FK N+ +IE  N  G   Y LG NQF+DLT++EF+A  T  K   +P++    S+ F
Sbjct: 65  LEVFKANVAFIESFNAGGKNRYWLGVNQFADLTSEEFKATMTNSKGFSTPNNGVRVSTGF 124

Query: 127 KYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQ 186
           KY+N+S   +P S+DWR KGAVT IK+Q +CGCCWAF+AVAA+EG  K+ +G LI LSEQ
Sbjct: 125 KYENVSADALPASVDWRTKGAVTRIKDQGQCGCCWAFSAVAAMEGFVKLSTGKLISLSEQ 184

Query: 187 QLLDCSTNGNN-GCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTC-SAAQKPAAAKIS 244
           +L+DC  +GN+ GC GG  + AF +I+ N G+  E  YPY A  G C + A    AA I 
Sbjct: 185 ELVDCDVDGNDQGCEGGEIDGAFQFILSNGGLTAEANYPYTAEDGRCKTTAAADVAASIR 244

Query: 245 NYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGT 304
            YE+VP+ DE +L+KAV+ QPVS+A+ A  ++FQ Y  G+  G CGT LDH VT++G+G 
Sbjct: 245 GYEDVPANDEPSLMKAVAGQPVSVAVDA--SKFQFYGGGVMAGECGTSLDHGVTVIGYGA 302

Query: 305 TEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
             DG  YWL+KNSWG TWG+AGY+++ +D     G+CG+  + SYP
Sbjct: 303 ASDGTKYWLVKNSWGTTWGEAGYLRMEKDIDDKRGMCGLAMQPSYP 348


>gi|400180361|gb|AFP73319.1| cysteine protease [Solanum peruvianum]
 gi|400180397|gb|AFP73337.1| cysteine protease [Solanum peruvianum]
 gi|400180401|gb|AFP73339.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  336 bits (861), Expect = 1e-89,   Method: Compositional matrix adjust.
 Identities = 169/345 (48%), Positives = 232/345 (67%), Gaps = 13/345 (3%)

Query: 10  SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
           + K++   + I +  ++S  +     RS  + SV E HE WM++HGR YKDE+EK  R  
Sbjct: 2   AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61

Query: 70  IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
           IFKEN+++IE  NK GN +YKLG N+F+D+T+ EF A +TG  +P    SPS  S+T   
Sbjct: 62  IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119

Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
           F   +LS  D+P++LDWR+ GAVT +K+Q  CGCCWAF+AV ++EG  KI +GNL++ SE
Sbjct: 120 FIINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179

Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
           Q+LLDC+TN N GC GG    AF +I +N GI+ E +Y Y     TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKTAAVQISS 238

Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
           Y+ VP G E +LL+AV+ QPVSI IAA S + Q Y  G ++G C  +++HAVT +G+GT 
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296

Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
           E G  YWL+KNSWG +WG+ G+MKI+RD     GLC I   SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDSGDPSGLCDITKMSSYP 341


>gi|125547256|gb|EAY93078.1| hypothetical protein OsI_14879 [Oryza sativa Indica Group]
          Length = 339

 Score =  336 bits (861), Expect = 1e-89,   Method: Compositional matrix adjust.
 Identities = 159/336 (47%), Positives = 220/336 (65%), Gaps = 9/336 (2%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
           +F I++ L  C++ + +   +   ++V  HE+WM Q+GR YKD  EK  R +IFK N+ +
Sbjct: 8   LFAILSCLCLCSAVLAAREQSDHAAMVARHERWMEQYGRVYKDATEKARRFEIFKANVAF 67

Query: 78  IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
           IE  N  GN  + LG NQF+DLTN EFRA  T       + R  T  TF+Y+N+S+  +P
Sbjct: 68  IESFNA-GNHKFWLGVNQFADLTNYEFRATKTNKGFIPSTVRVPT--TFRYENVSIDTLP 124

Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-N 196
            ++DWR KGAVTPIK+Q +CGCCWAF+AVAA+EGI K+ +G LI LSEQ+L+DC  +G +
Sbjct: 125 ATVDWRTKGAVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVHGED 184

Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQA 256
            GC GG  + AF +II+N G+ TE +YPY A  G C+     +AA I  YEEVP+ +E A
Sbjct: 185 QGCEGGLMDDAFKFIIKNGGLTTESKYPYTAADGKCNGGSN-SAATIKGYEEVPANNEAA 243

Query: 257 LLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKN 316
           L+KAV+ QPVS+A+      FQ Y  G+  G CGT LDH +  +G+G   DG  YWL+KN
Sbjct: 244 LMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIVAIGYGKDGDGTQYWLLKN 303

Query: 317 SWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
           SWG TWG+ G++++ +D     G+CG+    SYP A
Sbjct: 304 SWGTTWGENGFLRMEKDISDKRGMCGLAMEPSYPTA 339


>gi|255563134|ref|XP_002522571.1| cysteine protease, putative [Ricinus communis]
 gi|223538262|gb|EEF39871.1| cysteine protease, putative [Ricinus communis]
          Length = 343

 Score =  336 bits (861), Expect = 1e-89,   Method: Compositional matrix adjust.
 Identities = 172/338 (50%), Positives = 228/338 (67%), Gaps = 18/338 (5%)

Query: 20  IIITLLV---SCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLE 76
           ++ITLL+   +  SQ +     + +++ E HE+WMA+HGR+Y D  EKE R +IFK NL+
Sbjct: 10  LVITLLMILGTWVSQAMPRPLLNAEAIAEKHEQWMARHGRTYHDNAEKERRFQIFKNNLD 69

Query: 77  YIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPS--PSHRSTTSSTFKYQNLSMT 134
           YIE  NK  N+TYKLG N+FSDL+ +EF   Y GY+MP+  P+  +T   TF     +  
Sbjct: 70  YIENFNKAFNKTYKLGLNKFSDLSEEEFVTTYNGYEMPTTLPTANTTVKPTFFSNYYNQD 129

Query: 135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN 194
           +VP S+DWR+ G VT +KNQ ECGCCWAF+AVAAVEGI    +GN   LS QQLLDC   
Sbjct: 130 EVPESIDWRENGVVTSVKNQGECGCCWAFSAVAAVEGI----AGNGASLSAQQLLDC-VG 184

Query: 195 GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDE 254
            N+GC GG+  KAF YI+QNQGI ++ +YPY+     C +     AA+I+ YE V    E
Sbjct: 185 DNSGCGGGTMIKAFEYIVQNQGIVSDTDYPYEQTQEMCRSGSN-VAARITGYESVIQ-SE 242

Query: 255 QALLKAVSMQPVSIAIAAYS-TEFQSYKEGIFNGV-CGTQLDHAVTIVGFGTTEDGANYW 312
           +AL +AV+ QP+S+AI A S   F+SY  G+F+   CGT L HAVT+VG+GTTEDG  YW
Sbjct: 243 EALKRAVAKQPISVAIDASSGPNFKSYISGVFSAEDCGTHLTHAVTLVGYGTTEDGTKYW 302

Query: 313 LIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
           L+KNSWG  WG++GYM++ RD    EG CGI  ++SYP
Sbjct: 303 LVKNSWGEEWGESGYMRLQRDVGAMEGPCGIAMQASYP 340


>gi|356515036|ref|XP_003526207.1| PREDICTED: thiol protease SEN102-like [Glycine max]
          Length = 336

 Score =  335 bits (860), Expect = 1e-89,   Method: Compositional matrix adjust.
 Identities = 164/321 (51%), Positives = 219/321 (68%), Gaps = 13/321 (4%)

Query: 33  VSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLG 92
           V  R  HE S+ E HE+WM ++G+ YKD  EK+ R +IFK+N+E+IE  N +GN+ YKLG
Sbjct: 24  VMCRKLHETSMRERHEQWMTEYGKVYKDAAEKDKRFQIFKDNVEFIESFNADGNKPYKLG 83

Query: 93  TNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIK 152
            N  +DLT +EF+A   G+K P   H  +T +TFKY+N+  T +P ++DWR KGAVTPIK
Sbjct: 84  VNHLADLTVEEFKASRNGFKRP---HEFST-TTFKYENV--TAIPAAIDWRTKGAVTPIK 137

Query: 153 NQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYI 211
           +Q +CG CWAF+ +AA EGI +I +G L+ LSEQ+L+DC T G + GC GG  E  F +I
Sbjct: 138 DQGQCGSCWAFSTIAATEGIHQITTGKLVSLSEQELVDCDTKGVDQGCEGGYMEDGFEFI 197

Query: 212 IQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIA 271
           I+N GI +E  YPY+AV G C+ A  P  A+I  YE+VP   E AL KAV+ QPVS++I 
Sbjct: 198 IKNGGITSETNYPYKAVDGKCNKATSP-VAQIKGYEKVPPNSETALQKAVANQPVSVSID 256

Query: 272 AYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIV 331
           A    F  Y  GI+NG CGT+LDH VT VG+GT  +G +YW++KNSWG  WG+ GY+++ 
Sbjct: 257 ADGAGFMFYSSGIYNGECGTELDHGVTAVGYGTA-NGTDYWIVKNSWGTQWGEKGYVRMQ 315

Query: 332 R----DEGLCGIGTRSSYPLA 348
           R      GLCGI   SSYP +
Sbjct: 316 RGIAAKHGLCGIALDSSYPTS 336


>gi|388512155|gb|AFK44139.1| unknown [Medicago truncatula]
          Length = 340

 Score =  335 bits (859), Expect = 2e-89,   Method: Compositional matrix adjust.
 Identities = 164/322 (50%), Positives = 220/322 (68%), Gaps = 12/322 (3%)

Query: 33  VSSRSTHEQ-SVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKL 91
           V SR  +E  S+ E HE+WM+++G+ YKD +EKE R  IFK+N+E+IE  N   N+ YKL
Sbjct: 25  VMSRKLYESPSLQERHEQWMSEYGKLYKDAIEKEKRFMIFKDNVEFIESFNAADNKPYKL 84

Query: 92  GTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPI 151
             N  +DLT DEF+A   GYK      R   +++FKY+N+  T +P ++DWR KGAVTPI
Sbjct: 85  SVNHLADLTLDEFKASRNGYK---KIDREFATTSFKYENV--TAIPEAVDWRVKGAVTPI 139

Query: 152 KNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAY 210
           K+Q +CG CWAF+ VAA+EGI +I +G LI LSEQ+L+DC T G + GC GG  E  F +
Sbjct: 140 KDQGQCGSCWAFSTVAAIEGINQITTGKLISLSEQELVDCDTKGEDQGCEGGLMEDGFEF 199

Query: 211 IIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAI 270
           II+N GI +E  YPY+A  G+CSAA     AKI+ YE+VP   E +LLKAV+ QP+S++I
Sbjct: 200 IIKNGGITSETNYPYKAADGSCSAATTAPVAKITGYEKVPVNSEISLLKAVANQPISVSI 259

Query: 271 AAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKI 330
            A  + F  Y  GI+ G CGT+LDH VT VG+G+  +G +YW++KNSWG  WG+ GY+++
Sbjct: 260 DASDSSFMFYSSGIYTGECGTELDHGVTAVGYGSA-NGTDYWIVKNSWGTVWGEKGYIRM 318

Query: 331 VR----DEGLCGIGTRSSYPLA 348
            R     EGLCGI   SSYP A
Sbjct: 319 QRGIADKEGLCGIAMDSSYPTA 340


>gi|350535639|ref|NP_001233949.1| phytophthora-inhibited protease 1 [Solanum lycopersicum]
 gi|108937128|gb|ABG23376.1| phytophthora-inhibited protease 1 [Solanum lycopersicum]
          Length = 345

 Score =  335 bits (859), Expect = 2e-89,   Method: Compositional matrix adjust.
 Identities = 168/348 (48%), Positives = 232/348 (66%), Gaps = 12/348 (3%)

Query: 8   SGSFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMR 67
           S  F  N T + ++ ++L S    +V+SR+  E S++E HE WM  HGR YKD++EKE R
Sbjct: 3   SNFFLKNITVVLLLFSIL-SLYPFIVTSRNLKELSMLERHENWMVHHGRVYKDDIEKEHR 61

Query: 68  LKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFK 127
            K FKEN+E+IE  NK G + YKL  N+++DLT +EF   + G      S + +T++T  
Sbjct: 62  FKTFKENVEFIESFNKNGTQRYKLAVNKYADLTTEEFTTSFMGLDTSLLSQQESTATTTS 121

Query: 128 YQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQ 187
           ++  S+T+VP S+DWR +G+VT +K+Q  CGCCWAF+A AA+EG  +I +  LI LSEQQ
Sbjct: 122 FKYDSVTEVPNSMDWRKRGSVTGVKDQGVCGCCWAFSAAAAIEGAYQIANNELISLSEQQ 181

Query: 188 LLDCSTNGNNGCLGGSREKAFAYIIQNQ--GIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
           LLDCST  N GC GG    A+ +++QN   GI TE  YPY+     C   Q PAA  I+ 
Sbjct: 182 LLDCSTQ-NKGCEGGLMTVAYDFLLQNNGGGITTETNYPYEEAQNVCKTEQ-PAAVTING 239

Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
           YE VPS DE +LLKAV  QP+S+ IAA + EF  Y  GI++G C ++L+HAVT++G+GT+
Sbjct: 240 YEVVPS-DESSLLKAVVNQPISVGIAA-NDEFHMYGSGIYDGSCNSRLNHAVTVIGYGTS 297

Query: 306 -EDGANYWLIKNSWGNTWGDAGYMKIVRDEGL----CGIGTRSSYPLA 348
            EDG  YW++KNSWG+ WG+ GYM+I RD G+    CGI   +S+P A
Sbjct: 298 EEDGTKYWIVKNSWGSDWGEEGYMRIARDVGVDGGHCGIAKVASFPTA 345


>gi|38346003|emb|CAD40112.2| OSJNBa0035O13.5 [Oryza sativa Japonica Group]
 gi|125589427|gb|EAZ29777.1| hypothetical protein OsJ_13835 [Oryza sativa Japonica Group]
          Length = 339

 Score =  335 bits (858), Expect = 2e-89,   Method: Compositional matrix adjust.
 Identities = 158/336 (47%), Positives = 220/336 (65%), Gaps = 9/336 (2%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
           +F I++ L  C++ + +   +   ++V  HE+WM Q+GR YKD  EK  R +IFK N+ +
Sbjct: 8   LFAILSCLCLCSAVLAAREQSDHAAMVARHERWMEQYGRVYKDATEKARRFEIFKANVAF 67

Query: 78  IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
           IE  N  GN  + LG NQF+DLTN EFRA  T       + R  T  TF+Y+N+S+  +P
Sbjct: 68  IESFNA-GNHKFWLGVNQFADLTNYEFRATKTNKGFIPSTVRVPT--TFRYENVSIDTLP 124

Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-N 196
            ++DWR KGAVTPIK+Q +CGCCWAF+AVAA+EGI K+ +G LI LSEQ+L+DC  +G +
Sbjct: 125 ATVDWRTKGAVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVHGED 184

Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQA 256
            GC GG  + AF +II+N G+ TE +YPY A  G C+     +AA I  YE+VP+ +E A
Sbjct: 185 QGCEGGLMDDAFKFIIKNGGLTTESKYPYTAADGKCNGGSN-SAATIKGYEDVPANNEAA 243

Query: 257 LLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKN 316
           L+KAV+ QPVS+A+      FQ Y  G+  G CGT LDH +  +G+G   DG  YWL+KN
Sbjct: 244 LMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIVAIGYGKDGDGTQYWLLKN 303

Query: 317 SWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
           SWG TWG+ G++++ +D     G+CG+    SYP A
Sbjct: 304 SWGTTWGENGFLRMEKDISDKRGMCGLAMEPSYPTA 339


>gi|357458911|ref|XP_003599736.1| Cysteine proteinase [Medicago truncatula]
 gi|357474719|ref|XP_003607644.1| Cysteine proteinase [Medicago truncatula]
 gi|355488784|gb|AES69987.1| Cysteine proteinase [Medicago truncatula]
 gi|355508699|gb|AES89841.1| Cysteine proteinase [Medicago truncatula]
          Length = 340

 Score =  334 bits (857), Expect = 3e-89,   Method: Compositional matrix adjust.
 Identities = 162/322 (50%), Positives = 219/322 (68%), Gaps = 12/322 (3%)

Query: 33  VSSRSTHEQ-SVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKL 91
           V SR  +E  S+ E HE+WM +HG+ Y+D +EKE R  IFK+N+E+IE  N   N+ YKL
Sbjct: 25  VMSRKLYESLSLQERHEQWMTEHGKVYEDAIEKEKRFMIFKDNVEFIESFNAADNQPYKL 84

Query: 92  GTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPI 151
             N  +DLT DEF+A   GYK      R  T+++FKY+N+  T +P ++DWR KGAVTPI
Sbjct: 85  SVNHLADLTLDEFKASRNGYK---KIDREFTTTSFKYENV--TAIPAAVDWRVKGAVTPI 139

Query: 152 KNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAY 210
           K+Q +CG CWAF+ VAA EGI +I +G L+ LSEQ+L+DC T G + GC GG  E  F +
Sbjct: 140 KDQGQCGSCWAFSTVAATEGINQITTGKLVSLSEQELVDCDTKGEDQGCEGGLMEDGFEF 199

Query: 211 IIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAI 270
           II+N GI +E  YPY+A  G+C+ A     AKI+ YE+VP   E++LLKAV+ QP+S++I
Sbjct: 200 IIKNGGITSETNYPYKAADGSCNTATTTPVAKITGYEKVPVNSEKSLLKAVANQPISVSI 259

Query: 271 AAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKI 330
            A  + F  Y  GI+ G CGT+LDH VT VG+G+  +G +YW++KNSWG  WG+ GY+++
Sbjct: 260 DASDSSFMFYSSGIYTGECGTELDHGVTAVGYGSA-NGTDYWIVKNSWGTVWGEKGYIRM 318

Query: 331 VR----DEGLCGIGTRSSYPLA 348
            R     EGLCGI   SSYP A
Sbjct: 319 QRGIAAKEGLCGIAMDSSYPTA 340


>gi|226508570|ref|NP_001141984.1| uncharacterized protein LOC100274134 precursor [Zea mays]
 gi|194706676|gb|ACF87422.1| unknown [Zea mays]
 gi|413920745|gb|AFW60677.1| vignain [Zea mays]
          Length = 363

 Score =  334 bits (857), Expect = 3e-89,   Method: Compositional matrix adjust.
 Identities = 158/330 (47%), Positives = 222/330 (67%), Gaps = 7/330 (2%)

Query: 22  ITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKA 81
           +   V  ++ V  +    E  ++  ++KWMAQ+ R YKD+ EK  R ++FK N E+I+++
Sbjct: 34  VAARVEPSTTVGRTTGGDEAMMMARYKKWMAQYRRKYKDDAEKAHRFQVFKANAEFIDRS 93

Query: 82  NKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPS--PS-HRSTTSSTFKYQNLSMTDVPT 138
           N  G + Y LGTNQF+DLT+ EF A+YTG + P+  PS  +   ++  KYQN +  D   
Sbjct: 94  NAGGKKKYVLGTNQFADLTSKEFAAMYTGLRKPAAVPSGAKQIPAAGSKYQNFTRLDDDV 153

Query: 139 SLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TNGNN 197
            +DWR +GAVTP+KNQ +CGCCWAF+AV A+EG+  I +GNL+ LSEQQ+LDC  ++GN 
Sbjct: 154 QVDWRQQGAVTPVKNQGQCGCCWAFSAVGAMEGLIMITTGNLVSLSEQQILDCDESDGNQ 213

Query: 198 GCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQAL 257
           GC GG  + AF Y+I N G+ TED YPY AV GTC   Q   AA IS ++++PSGDE AL
Sbjct: 214 GCNGGYMDNAFQYVINNGGVTTEDAYPYSAVQGTCQNVQP--AATISGFQDLPSGDENAL 271

Query: 258 LKAVSMQPVSIAIAAYSTEFQSYKEGIFNG-VCGTQLDHAVTIVGFGTTEDGANYWLIKN 316
             AV+ QPVS+ +   S+ FQ Y+ GI++G  CGT ++HAVT +G+G  + G  YW++KN
Sbjct: 272 ANAVANQPVSVGVDGGSSPFQFYQGGIYDGDGCGTDMNHAVTAIGYGADDQGTQYWILKN 331

Query: 317 SWGNTWGDAGYMKIVRDEGLCGIGTRSSYP 346
           SWG  WG+ G+M++    G CGI T +SYP
Sbjct: 332 SWGTGWGENGFMQLQMGVGACGISTMASYP 361


>gi|413917937|gb|AFW57869.1| hypothetical protein ZEAMMB73_830006 [Zea mays]
          Length = 443

 Score =  333 bits (855), Expect = 5e-89,   Method: Compositional matrix adjust.
 Identities = 157/326 (48%), Positives = 222/326 (68%), Gaps = 12/326 (3%)

Query: 19  FIIITLLV-SCA---SQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKEN 74
           F++++++  +CA   S      +  +Q++V  HE+WMA++ R Y D  EK  R ++FK N
Sbjct: 9   FVLLSVVAWACALSGSLAARDLADQDQAMVARHEEWMAKYDRVYSDAAEKARRFEVFKAN 68

Query: 75  LEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYK----MPSPSHRSTTSST-FKYQ 129
           +  IE  N  GN  + L  N+F+DLT+DEFRA +TGY+      S   RS T++T FKY 
Sbjct: 69  MALIESVNA-GNHKFWLEANRFADLTDDEFRATWTGYRPKTAAASSKGRSRTATTGFKYA 127

Query: 130 NLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLL 189
           N+S+ DVP S+DWR KGAVTPIKNQ ECGCCWAF+AVA++EG+ K+ +G L+ LSEQ+L+
Sbjct: 128 NVSLDDVPASVDWRTKGAVTPIKNQGECGCCWAFSAVASMEGVVKLSTGKLVSLSEQELV 187

Query: 190 DCSTNG-NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYE 247
           DC  NG + GC GG  + AF +I+ N G+ TE  YPY A  GTC++ +    AA I  YE
Sbjct: 188 DCDVNGMDQGCEGGEMDDAFDFIVGNGGLTTESRYPYTASDGTCNSNEASGDAASIKGYE 247

Query: 248 EVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTED 307
           +VP+ DE +L KAV+ QPVS+A+    + F+ YK G+ +G CGT+LDH +  VG+G   D
Sbjct: 248 DVPANDEASLRKAVANQPVSVAVDGGDSHFRFYKGGVLSGACGTELDHGIAAVGYGVASD 307

Query: 308 GANYWLIKNSWGNTWGDAGYMKIVRD 333
           G  YW++KNSWG +WG+AGY+++ RD
Sbjct: 308 GTKYWVMKNSWGTSWGEAGYIRMERD 333


>gi|400180387|gb|AFP73332.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  333 bits (855), Expect = 5e-89,   Method: Compositional matrix adjust.
 Identities = 168/345 (48%), Positives = 231/345 (66%), Gaps = 13/345 (3%)

Query: 10  SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
           + K++   + I +  ++S  +     RS  + SV E HE WM++HGR YKDE+EK  R  
Sbjct: 2   AMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKVERFM 61

Query: 70  IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSST 125
           IFKEN+++IE  NK GN +YKLG N+F+D+T+ EF A +TG  +P    SPS  S+T   
Sbjct: 62  IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE-- 119

Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
            K  +LS  D+P++LDW + GAVT +K+Q  CGCCWAF+AV ++EG  KI +GNL++ SE
Sbjct: 120 LKINDLSDDDMPSNLDWIESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179

Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
           Q+LLDC+TN N GC GG    AF +I +N GI+ E +Y Y     TC + +K AA +IS+
Sbjct: 180 QELLDCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKTAAVQISS 238

Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
           Y+ VP G E +LL+AV+ QPVSI IAA S + Q Y  G ++G C  +++HAVT +G+GT 
Sbjct: 239 YQVVPEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCADRINHAVTAIGYGTD 296

Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
           E G  YWL+KNSWG +WG+ G+MKI+RD     GLC I   SSYP
Sbjct: 297 EKGQKYWLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341


>gi|302143416|emb|CBI21977.3| unnamed protein product [Vitis vinifera]
          Length = 297

 Score =  333 bits (855), Expect = 5e-89,   Method: Compositional matrix adjust.
 Identities = 164/304 (53%), Positives = 220/304 (72%), Gaps = 13/304 (4%)

Query: 51  MAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTG 110
           MA++GR YKD  EKE R KIFK+N+  IE  NK  ++TYKL  N+F+DLTN+EFR+L   
Sbjct: 1   MARYGRMYKDANEKEKRFKIFKDNVARIESFNKAMDKTYKLSINEFADLTNEEFRSLRNR 60

Query: 111 YKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVE 170
           +K    +H  + ++TFKY+N+  T VP+++DWR KGAVTPIK+Q++CGCCWAF+AVAA E
Sbjct: 61  FK----AHICSEATTFKYENV--TAVPSTIDWRKKGAVTPIKDQQQCGCCWAFSAVAATE 114

Query: 171 GITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVP 229
           GIT+I +G LI LSEQ+L+DC T G N GC GG  + AF + I+  G+A+E  YPY+   
Sbjct: 115 GITQITTGKLISLSEQELVDCDTGGENQGCSGGLMDDAFRF-IKIHGLASEATYPYEGDD 173

Query: 230 GTCSAAQKP-AAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGV 288
           GTC++ ++   AAKI  YE+VP+ +E+AL KAV+ QPV++AI A   EFQ Y  G+F G 
Sbjct: 174 GTCNSKKEAHPAAKIKGYEDVPANNEKALQKAVAHQPVAVAIDAGGFEFQFYTSGVFTGQ 233

Query: 289 CGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSS 344
           CGT+LDH V  VG+G  +DG  YWL+KNSWG  WG+ GY+++ RD    EGLCGI  ++S
Sbjct: 234 CGTELDHGVAAVGYGIGDDGMMYWLVKNSWGTGWGEEGYIRMQRDVTAKEGLCGIAMQAS 293

Query: 345 YPLA 348
           YP A
Sbjct: 294 YPTA 297


>gi|20334375|gb|AAM19208.1|AF493233_1 cysteine protease [Solanum pennellii]
          Length = 337

 Score =  333 bits (855), Expect = 6e-89,   Method: Compositional matrix adjust.
 Identities = 167/341 (48%), Positives = 231/341 (67%), Gaps = 12/341 (3%)

Query: 10  SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
           + KI+   + I +  ++S  +     RS  + SV E HE WM++HGR YKDE+EK  R  
Sbjct: 2   AMKIDLMSILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61

Query: 70  IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQ 129
           IFKEN+++IE  NK GN +YKLG N+F+D+T+ EF A +TG  +P+ S+ S +       
Sbjct: 62  IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPN-SYLSPS----PIN 116

Query: 130 NLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLL 189
           +LS  D+P++LDWR+ GAVT +KNQ +CGCCWAF+AV ++EG  KI +GNL++ SEQ+LL
Sbjct: 117 DLSDDDMPSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELL 176

Query: 190 DCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEV 249
           DC+TN N GC GG    AF +I +N GI+ E +Y Y     TC + +K AA +IS+Y+ V
Sbjct: 177 DCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGQQYTCRSQEKTAAVQISSYQVV 235

Query: 250 PSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGA 309
           P G E +LL+AV+ QPVSI IAA S + Q Y  G ++G C  +++HAVT +G+GT E G 
Sbjct: 236 PEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCANRINHAVTAIGYGTDEKGQ 293

Query: 310 NYWLIKNSWGNTWGDAGYMKIVRDE----GLCGIGTRSSYP 346
            YWL+KNSWG +WG+ G+MKI+RD     GLC I   SSYP
Sbjct: 294 KYWLLKNSWGTSWGEDGFMKIIRDSGNPAGLCDIAKVSSYP 334


>gi|224093956|ref|XP_002310053.1| predicted protein [Populus trichocarpa]
 gi|224147016|ref|XP_002336386.1| predicted protein [Populus trichocarpa]
 gi|222834869|gb|EEE73318.1| predicted protein [Populus trichocarpa]
 gi|222852956|gb|EEE90503.1| predicted protein [Populus trichocarpa]
          Length = 340

 Score =  333 bits (855), Expect = 6e-89,   Method: Compositional matrix adjust.
 Identities = 165/340 (48%), Positives = 232/340 (68%), Gaps = 18/340 (5%)

Query: 19  FIIITLLVSCAS--QVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLE 76
           F+ + LL    +     ++R+  +  + E HE+WM Q+GR YKD+ E+  R  IFKEN+ 
Sbjct: 9   FVCLALLFILGAWPSKSTARTLLDAPMYERHEQWMTQYGRVYKDDNERATRYSIFKENVA 68

Query: 77  YIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMT 134
            I+  N +  ++YKLG NQF+DLTN+EF+A    +K  M SP      +  F+Y+N+S  
Sbjct: 69  RIDAFNSQTGKSYKLGVNQFADLTNEEFKASRNRFKGHMCSPQ-----AGPFRYENVSA- 122

Query: 135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN 194
            VP+++DWR +GAVTP+K+Q +CGCCWAF+AVAA+EGI K+ +G LI LSEQ+++DC T 
Sbjct: 123 -VPSTVDWRKEGAVTPVKDQGQCGCCWAFSAVAAMEGINKLTTGKLISLSEQEVVDCDTK 181

Query: 195 G-NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSG 252
           G + GC GG  + AF +I QN+G+ TE  YPY+   GTC+  +    AAKI+ +E+VP+ 
Sbjct: 182 GEDQGCNGGLMDDAFKFIEQNKGLTTEANYPYKGTDGTCNTNKAAIHAAKITGFEDVPAN 241

Query: 253 DEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYW 312
            E AL+KAV+ QPVS+AI A  ++FQ Y  GIF G C TQLDH VT VG+G + DG+ YW
Sbjct: 242 SEAALMKAVAKQPVSVAIDAGGSDFQFYSSGIFTGSCDTQLDHGVTAVGYGVS-DGSKYW 300

Query: 313 LIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
           L+KNSWG  WG+ GY+++ +D    EGLCGI  ++SYP A
Sbjct: 301 LVKNSWGAQWGEEGYIRMQKDISAKEGLCGIAMQASYPTA 340


>gi|297740489|emb|CBI30671.3| unnamed protein product [Vitis vinifera]
          Length = 320

 Score =  333 bits (854), Expect = 7e-89,   Method: Compositional matrix adjust.
 Identities = 173/338 (51%), Positives = 226/338 (66%), Gaps = 32/338 (9%)

Query: 19  FIIITLLVS--CASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLE 76
            I ITLL+    ASQ +S R+ HE S+ E HE WM  +GR+YKD  EKE R KIFKEN+E
Sbjct: 7   IICITLLIMGVWASQALS-RTLHEVSMSERHEDWMGLYGRTYKDIAEKERRFKIFKENVE 65

Query: 77  YIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV 136
           YIE  NK                    F+A   GY M S   RS+  ++F+Y+N++   V
Sbjct: 66  YIESVNK--------------------FKASRNGYNMSSRP-RSSEITSFRYENVAA--V 102

Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG- 195
           P+S+DWR KGAVTPIK+Q +CGCCWAF+AVAA+EG+T++++G LI LSEQ+L+DC T+G 
Sbjct: 103 PSSMDWRKKGAVTPIKDQGQCGCCWAFSAVAAMEGVTQLKTGELISLSEQELVDCDTSGE 162

Query: 196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAK-ISNYEEVPSGDE 254
           + GC GG  + AF +II N G+ TE  YPY+ V  TC+  +  ++A  I NYE+VP+  E
Sbjct: 163 DQGCGGGLMDSAFEFIIGNGGLTTEANYPYKGVDATCNKKKAASSAAKIKNYEDVPANSE 222

Query: 255 QALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLI 314
            ALLKAV+  PVS+AI A  ++FQ Y  G+F G CGT+LDH VT VG+G T+DG  YWL+
Sbjct: 223 AALLKAVAQHPVSVAIDAGGSDFQFYSSGVFTGQCGTELDHGVTAVGYGKTDDGTKYWLV 282

Query: 315 KNSWGNTWGDAGYMKIVR----DEGLCGIGTRSSYPLA 348
           KNSWG  WG+ GY+ + R    DEGLCGI   +SYP A
Sbjct: 283 KNSWGTGWGEDGYIWMERDIGADEGLCGIAMEASYPTA 320


>gi|400180437|gb|AFP73356.1| cysteine protease [Solanum pennellii]
          Length = 337

 Score =  333 bits (853), Expect = 9e-89,   Method: Compositional matrix adjust.
 Identities = 166/341 (48%), Positives = 231/341 (67%), Gaps = 12/341 (3%)

Query: 10  SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
           + K++   + I +  ++S  +     RS  + SV E HE WM++HGR YKDE+EK  R  
Sbjct: 2   AMKVDLMSILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFM 61

Query: 70  IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQ 129
           IFKEN+++IE  NK GN +YKLG N+F+D+T+ EF A +TG  +P+ S+ S +       
Sbjct: 62  IFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPN-SYLSPS----PIN 116

Query: 130 NLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLL 189
           +LS  D+P++LDWR+ GAVT +KNQ +CGCCWAF+AV ++EG  KI +GNL++ SEQ+LL
Sbjct: 117 DLSDDDMPSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELL 176

Query: 190 DCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEV 249
           DC+TN N GC GG    AF +I +N GI+ E +Y Y     TC + +K AA +IS+Y+ V
Sbjct: 177 DCTTN-NYGCNGGFMTNAFDFIKENGGISRESDYEYLGQQYTCRSQEKTAAVQISSYQVV 235

Query: 250 PSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGA 309
           P G E +LL+AV+ QPVSI IAA S + Q Y  G ++G C  +++HAVT +G+GT E G 
Sbjct: 236 PEG-ETSLLQAVTKQPVSIGIAA-SQDLQFYAGGTYDGSCANRINHAVTAIGYGTDEKGQ 293

Query: 310 NYWLIKNSWGNTWGDAGYMKIVRDE----GLCGIGTRSSYP 346
            YWL+KNSWG +WG+ G+MKI+RD     GLC I   SSYP
Sbjct: 294 KYWLLKNSWGTSWGEDGFMKIIRDSGNPAGLCDIAKVSSYP 334


>gi|356543122|ref|XP_003540012.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
          Length = 342

 Score =  332 bits (852), Expect = 1e-88,   Method: Compositional matrix adjust.
 Identities = 167/330 (50%), Positives = 222/330 (67%), Gaps = 16/330 (4%)

Query: 28  CASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNR 87
           C SQV  SR  H+ S+ E HE+WM ++G+ YKD  E E R  IF+ N+E+IE  N  GN+
Sbjct: 20  CTSQV-KSRKLHDASMYERHEQWMEKYGKVYKDSAEXEKRFLIFENNVEFIESFNAAGNK 78

Query: 88  TYKLGTNQFSDLTNDEFRALYTGYKMPSPSH----RSTTSSTFKYQNLSMTDVPTSLDWR 143
            YKL  N  +D TN+EF A + GYK    SH    R TT + FKY+N+  TD+P ++DWR
Sbjct: 79  PYKLSINHLADQTNEEFMASHKGYK---GSHWQGLRITTQTPFKYENV--TDIPWAVDWR 133

Query: 144 DKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGS 203
            KG  T IK+Q +CG CWAF+AVAA EGI +I +GNL+ LSEQ+L+DC +  ++GC GG 
Sbjct: 134 QKGDATSIKDQGQCGICWAFSAVAATEGIYQITTGNLVSLSEQELVDCDSV-DHGCDGGL 192

Query: 204 REKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVS 262
            E  F +II+N GI++E  YPY AV GTC   ++ +  A+I  YE VP   E+ L KAV+
Sbjct: 193 MEHGFEFIIKNGGISSEANYPYTAVNGTCDTNKEASPGAQIKGYETVPVNCEEELQKAVA 252

Query: 263 MQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTW 322
            QPVS++I A  + FQ Y  G+F G CGTQLDH VT VG+G+T+DG  YW++KNSWG  W
Sbjct: 253 NQPVSVSIDAGGSAFQFYSSGVFTGQCGTQLDHGVTAVGYGSTDDGIQYWIVKNSWGTQW 312

Query: 323 GDAGYMKIVR----DEGLCGIGTRSSYPLA 348
           G+ GY++++R     EGLCGI   +SYP A
Sbjct: 313 GEEGYIRMLRGIDAQEGLCGIAMDASYPTA 342


>gi|116309130|emb|CAH66233.1| H0825G02.10 [Oryza sativa Indica Group]
          Length = 339

 Score =  332 bits (852), Expect = 1e-88,   Method: Compositional matrix adjust.
 Identities = 157/336 (46%), Positives = 219/336 (65%), Gaps = 9/336 (2%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
           +F I++ L  C++ + +   +   ++V  HE+WM Q+GR YKD  EK  R +IFK N+ +
Sbjct: 8   LFAILSCLCLCSAVLAAREQSDHAAMVARHERWMEQYGRVYKDATEKARRFEIFKANVAF 67

Query: 78  IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
           IE  N  GN  + L  NQF+DLTN EFRA  T       + R  T  TF+Y+N+S+  +P
Sbjct: 68  IESFNA-GNHKFWLSVNQFADLTNYEFRATKTNKGFIPSTVRVPT--TFRYENVSIDTLP 124

Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-N 196
            ++DWR KGAVTPIK+Q +CGCCWAF+AVAA+EGI K+ +G LI LSEQ+L+DC  +G +
Sbjct: 125 ATVDWRTKGAVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVHGED 184

Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQA 256
            GC GG  + AF +II+N G+ TE +YPY A  G C+     +AA I  YE+VP+ +E A
Sbjct: 185 QGCEGGLMDDAFKFIIKNGGLTTESKYPYTAADGKCNGGSN-SAATIKGYEDVPANNEAA 243

Query: 257 LLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKN 316
           L+KAV+ QPVS+A+      FQ Y  G+  G CGT LDH +  +G+G   DG  YWL+KN
Sbjct: 244 LMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIVAIGYGKDGDGTQYWLLKN 303

Query: 317 SWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
           SWG TWG+ G++++ +D     G+CG+    SYP A
Sbjct: 304 SWGTTWGENGFLRMEKDISDKRGMCGLAMEPSYPTA 339


>gi|9502421|gb|AAF88120.1|AC021043_13 Putative cysteine proteinase [Arabidopsis thaliana]
          Length = 331

 Score =  332 bits (852), Expect = 1e-88,   Method: Compositional matrix adjust.
 Identities = 166/328 (50%), Positives = 223/328 (67%), Gaps = 11/328 (3%)

Query: 30  SQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTY 89
           SQ  S  + HE  V E H++WM +  R Y DELEK+MR  +FK+NL++IEK NK+G+RTY
Sbjct: 6   SQATSRVTFHEPIVAEHHQQWMTRFSRVYSDELEKQMRFDVFKKNLKFIEKFNKKGDRTY 65

Query: 90  KLGTNQFSDLTNDEFRALYTGYK----MPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDK 145
           KLG N+F+D T +EF A +TG K    +PS         ++ + N+S      + DWR +
Sbjct: 66  KLGVNEFADWTREEFIATHTGLKGVNGIPSSEFVDEMIPSWNW-NVSDVAGRETKDWRYE 124

Query: 146 GAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSRE 205
           GAVTP+K Q +CGCCWAF++VAAVEG+TKI   NL+ LSEQQLLDC    +NGC GG   
Sbjct: 125 GAVTPVKYQGQCGCCWAFSSVAAVEGLTKIVGNNLVSLSEQQLLDCDRERDNGCNGGIMS 184

Query: 206 KAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQP 265
            AF+YII+N+GIA+E  YPYQA  GTC    KP+A  I  ++ VPS +E+ALL+AVS QP
Sbjct: 185 DAFSYIIKNRGIASEASYPYQAAEGTCRYNGKPSAW-IRGFQTVPSNNERALLEAVSKQP 243

Query: 266 VSIAIAAYSTEFQSYKEGIFN-GVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGD 324
           VS++I A    F  Y  G+++   CGT ++HAVT VG+GT+ +G  YWL KNSWG TWG+
Sbjct: 244 VSVSIDADGPGFMHYSGGVYDEPYCGTNVNHAVTFVGYGTSPEGIKYWLAKNSWGETWGE 303

Query: 325 AGYMKIVRD----EGLCGIGTRSSYPLA 348
            GY++I RD    +G+CG+   + YP+A
Sbjct: 304 NGYIRIRRDVAWPQGMCGVAQYAFYPVA 331


>gi|38345008|emb|CAD40026.2| OSJNBa0052O21.11 [Oryza sativa Japonica Group]
 gi|125589414|gb|EAZ29764.1| hypothetical protein OsJ_13822 [Oryza sativa Japonica Group]
          Length = 339

 Score =  332 bits (851), Expect = 1e-88,   Method: Compositional matrix adjust.
 Identities = 153/336 (45%), Positives = 219/336 (65%), Gaps = 9/336 (2%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
           +F I+  L  C++ + +   + + ++   HE+WMAQ+GR Y+D+ EK  R ++FK N+ +
Sbjct: 8   LFAILGCLCLCSAVLAARELSDDAAMAARHERWMAQYGRVYRDDAEKARRFEVFKANVAF 67

Query: 78  IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
           IE  N  GN  + LG NQF+DLTNDEFR + T       + R  T   F+Y+N+++  +P
Sbjct: 68  IESFNA-GNHNFWLGVNQFADLTNDEFRWMKTNKGFIPSTTRVPTG--FRYENVNIDALP 124

Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-N 196
            ++DWR KGAVTPIK+Q +CGCCWAF+AVAA+EGI K+ +G LI LSEQ+L+DC  +G +
Sbjct: 125 ATVDWRTKGAVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVHGED 184

Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQA 256
            GC GG  + AF +II+N G+ TE  YPY A    C +    + A I  YE+VP+ +E A
Sbjct: 185 QGCEGGLMDDAFKFIIKNGGLTTESNYPYAAADDKCKSVSN-SVASIKGYEDVPANNEAA 243

Query: 257 LLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKN 316
           L+KAV+ QPVS+A+      FQ YK G+  G CGT LDH +  +G+G   DG  YWL+KN
Sbjct: 244 LMKAVANQPVSVAVDGGDMTFQFYKGGVMTGSCGTDLDHGIVAIGYGKASDGTKYWLLKN 303

Query: 317 SWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
           SWG TWG+ G++++ +D     G+CG+    SYP A
Sbjct: 304 SWGTTWGENGFLRMEKDISDKRGMCGLAMEPSYPTA 339


>gi|356515050|ref|XP_003526214.1| PREDICTED: vignain-like [Glycine max]
          Length = 344

 Score =  332 bits (851), Expect = 2e-88,   Method: Compositional matrix adjust.
 Identities = 168/339 (49%), Positives = 230/339 (67%), Gaps = 12/339 (3%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
           M  +   L    SQV+  R  H+ ++ E HE WMA++G+ YKD  EKE R +IFK+N+E+
Sbjct: 10  MLALFLFLAVGISQVMP-RKLHQTALRERHENWMAEYGKIYKDAAEKEKRFQIFKDNVEF 68

Query: 78  IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTS-STFKYQNLSMTDV 136
           IE  N  GN+ YKLG N  +DLT +EF+    G K       +T   + FKY+N+  TD+
Sbjct: 69  IESFNAAGNKPYKLGVNHLADLTLEEFKDSRNGLKRTYEFSTTTFKLNGFKYENV--TDI 126

Query: 137 PTSLDWRDKGAVTPIKNQ-KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG 195
           P ++DWR KGAVTPIK+Q  +CG CWAF+ VAA EGI +I +G L+ LSEQ+L+DC +  
Sbjct: 127 PEAIDWRVKGAVTPIKDQGDQCGSCWAFSTVAATEGIYQISTGMLMSLSEQELVDCDSV- 185

Query: 196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDE 254
           ++GC GG  E  F +II+N GI++E  YPY AV GTC A+++ + AA+I  YE VP+  E
Sbjct: 186 DHGCDGGLMEDGFEFIIKNGGISSEANYPYTAVDGTCDASKEASPAAQIKGYETVPANSE 245

Query: 255 QALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGAN-YWL 313
           +AL +AV+ QPVS++I A  + FQ Y  G+F G CGTQLDH VT+VG+GTT+DG + YW+
Sbjct: 246 EALQQAVANQPVSVSIDAGGSGFQFYSSGVFTGQCGTQLDHGVTVVGYGTTDDGTHEYWI 305

Query: 314 IKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
           +KNSWG  WG+ GY+++ R     EGLCGI   +SYP A
Sbjct: 306 VKNSWGTQWGEEGYIRMQRGIDALEGLCGIAMDASYPTA 344


>gi|357474725|ref|XP_003607647.1| Cysteine proteinase [Medicago truncatula]
 gi|355508702|gb|AES89844.1| Cysteine proteinase [Medicago truncatula]
          Length = 340

 Score =  332 bits (851), Expect = 2e-88,   Method: Compositional matrix adjust.
 Identities = 162/322 (50%), Positives = 219/322 (68%), Gaps = 12/322 (3%)

Query: 33  VSSRSTHEQ-SVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKL 91
           V SR  +E  S+ E HE+WM+++G+ YKD +EKE R  IFK+N+E+IE  N   N+ YKL
Sbjct: 25  VMSRKLYESPSLQERHEQWMSEYGKLYKDAIEKEKRFMIFKDNVEFIESFNAADNKPYKL 84

Query: 92  GTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPI 151
             N  +DLT DEF+A   GYK      R   +++FKY+N+  T +P ++DWR KGAVTPI
Sbjct: 85  SVNHLADLTLDEFKASRNGYK---KIDREFATTSFKYENV--TAIPEAVDWRVKGAVTPI 139

Query: 152 KNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAY 210
           K+Q +CG CWAF+ VAA+EGI +I +G LI LSEQ+L+DC T G + GC GG  E  F +
Sbjct: 140 KDQGQCGSCWAFSTVAAIEGINQITTGKLISLSEQELVDCDTKGEDQGCEGGLMEDGFEF 199

Query: 211 IIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAI 270
           II+N GI +E  YPY+A  G+C+ A     AKI+ YE+VP   E +LLKAV+ QP+S++I
Sbjct: 200 IIKNGGITSETNYPYKAADGSCNTATTAPVAKITGYEKVPVNSEISLLKAVANQPISVSI 259

Query: 271 AAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKI 330
            A  + F  Y  GI+ G CGT+LDH VT VG+G+  +G +YW++KNSWG  WG+ GY+++
Sbjct: 260 DASDSSFMFYSSGIYTGECGTELDHGVTAVGYGSA-NGTDYWIVKNSWGTVWGEKGYIRM 318

Query: 331 VR----DEGLCGIGTRSSYPLA 348
            R     EGLCGI   SSYP A
Sbjct: 319 QRGIADKEGLCGIAMDSSYPTA 340


>gi|224162986|ref|XP_002338508.1| predicted protein [Populus trichocarpa]
 gi|222872535|gb|EEF09666.1| predicted protein [Populus trichocarpa]
          Length = 306

 Score =  332 bits (851), Expect = 2e-88,   Method: Compositional matrix adjust.
 Identities = 161/314 (51%), Positives = 221/314 (70%), Gaps = 16/314 (5%)

Query: 43  VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTND 102
           + E HE+WM Q+GR YKD+ E+  R  IFKEN+  I+  N +  ++YKLG NQF+DLTN+
Sbjct: 1   MYERHEQWMTQYGRVYKDDNERATRYSIFKENVARIDAFNSQTGKSYKLGVNQFADLTNE 60

Query: 103 EFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCC 160
           EF+A    +K  M SP      +  F+Y+N+S   VP+++DWR +GAVTP+K+Q +CGCC
Sbjct: 61  EFKASRNRFKGHMCSPQ-----AGPFRYENVSA--VPSTVDWRKEGAVTPVKDQGQCGCC 113

Query: 161 WAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYIIQNQGIAT 219
           WAF+AVAA+EGI K+ +G LI LSEQ+++DC T G + GC GG  + AF +I QN+G+ T
Sbjct: 114 WAFSAVAAMEGINKLTTGKLISLSEQEVVDCDTKGEDQGCNGGLMDDAFKFIEQNKGLTT 173

Query: 220 EDEYPYQAVPGTCSAAQKP-AAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQ 278
           E  YPY+   GTC+  +    AAKI+ +E+VP+  E AL+KAV+ QPVS+AI A  ++FQ
Sbjct: 174 EANYPYKGTDGTCNTKKSAIHAAKITGFEDVPANSEAALMKAVAKQPVSVAIDAGGSDFQ 233

Query: 279 SYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----E 334
            Y  GIF G C TQLDH VT VG+G + DG+ YWL+KNSWG  WG+ GY+++ +D    E
Sbjct: 234 FYSSGIFTGSCDTQLDHGVTAVGYGVS-DGSKYWLVKNSWGAQWGEEGYIRMQKDISAKE 292

Query: 335 GLCGIGTRSSYPLA 348
           GLCGI  ++SYP A
Sbjct: 293 GLCGIAMQASYPTA 306


>gi|357452869|ref|XP_003596711.1| Cysteine proteinase [Medicago truncatula]
 gi|355485759|gb|AES66962.1| Cysteine proteinase [Medicago truncatula]
          Length = 344

 Score =  332 bits (850), Expect = 2e-88,   Method: Compositional matrix adjust.
 Identities = 171/345 (49%), Positives = 221/345 (64%), Gaps = 21/345 (6%)

Query: 14  NTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKE 73
           N   +F I+TL  S    V+SSR      ++E HE+WM +HG+ YKD  EKE R +IFKE
Sbjct: 11  NILTLFFILTLWTSL---VISSR------LLEKHEQWMEEHGKFYKDAAEKEQRFQIFKE 61

Query: 74  NLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALY-TGYKMPSPS---HRSTTSSTFKYQ 129
           NLE+IE  N  G+  + L  NQF D TNDEF+A Y  G K P            S F+Y+
Sbjct: 62  NLEFIESFNAAGDNGFNLSINQFGDQTNDEFKANYLNGKKKPLIGVGIAAIEEESVFRYE 121

Query: 130 NLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLL 189
           N+  T+VP ++DWR++GAVTPIK+Q  CG CWAFA VAA+EGI +I +G L+ LSEQ+L+
Sbjct: 122 NV--TEVPATMDWRERGAVTPIKHQHLCGSCWAFATVAAIEGIHQITTGRLVSLSEQELV 179

Query: 190 DC-STNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKP-AAAKISNYE 247
           DC  TN  +GC GG  E A  +I++  GI +E  YPY  V G C+  +     AKI  YE
Sbjct: 180 DCVKTNTTDGCNGGYVEDACDFIVKKGGITSETNYPYTRVDGKCNVRKGTYNVAKIKGYE 239

Query: 248 EVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTED 307
            VP+ +E+ALLKAV+ QP+++ IAA    FQ Y  GI  G CG  LDH VTIVG+GT++D
Sbjct: 240 HVPANNEKALLKAVANQPIAVYIAATKRAFQFYSSGILKGKCGIDLDHTVTIVGYGTSDD 299

Query: 308 GANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
           G  YWL+KNSWG  WG+ GY+KI RD    EG CGI    +YP+ 
Sbjct: 300 GVKYWLVKNSWGTKWGEKGYIKIKRDVHAKEGSCGIAMVPTYPIV 344


>gi|116309178|emb|CAH66275.1| OSIGBa0147O06.5 [Oryza sativa Indica Group]
          Length = 339

 Score =  332 bits (850), Expect = 2e-88,   Method: Compositional matrix adjust.
 Identities = 153/336 (45%), Positives = 218/336 (64%), Gaps = 9/336 (2%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
           +F I+  L  C++ + +   + + ++   HE+WMAQ+GR YKD+ EK  R ++FK N+ +
Sbjct: 8   LFAILGCLCLCSAVLAARELSDDAAMAARHERWMAQYGRMYKDDAEKARRFEVFKANVAF 67

Query: 78  IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
           IE  N  GN  + LG NQF+DLTNDEFR+  T       + R  T   F+Y+N+++  +P
Sbjct: 68  IESFNA-GNHKFWLGVNQFADLTNDEFRSTKTNKGFIPSTTRVPTG--FRYENVNIDALP 124

Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-N 196
            ++DWR KG VTPIK+Q +CGCCWAF+AVAA+EGI K+ +G LI LSEQ+L+DC  +G +
Sbjct: 125 ATMDWRTKGVVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVHGED 184

Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQA 256
            GC GG  + AF +II+N G+ TE  YPY A    C +    + A I  YE+VP+ +E A
Sbjct: 185 QGCEGGLMDDAFKFIIKNGGLTTESNYPYAAADDKCKSVSN-SVASIKGYEDVPANNEAA 243

Query: 257 LLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKN 316
           L+KAV+ QPVS+A+      FQ YK G+  G CGT LDH +  +G+G   DG  YWL+KN
Sbjct: 244 LMKAVANQPVSVAVDGGDMTFQFYKGGVMTGSCGTDLDHGIVAIGYGKASDGTKYWLLKN 303

Query: 317 SWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
           SWG TWG+ G++++ +D     G+CG+    SYP A
Sbjct: 304 SWGTTWGENGFLRMEKDISDKRGMCGLAMEPSYPTA 339


>gi|18396939|ref|NP_564320.1| Papain family cysteine protease [Arabidopsis thaliana]
 gi|9502427|gb|AAF88126.1|AC021043_19 Putative cysteine proteinase [Arabidopsis thaliana]
 gi|67633400|gb|AAY78625.1| peptidase C1A papain family protein [Arabidopsis thaliana]
 gi|332192919|gb|AEE31040.1| Papain family cysteine protease [Arabidopsis thaliana]
          Length = 346

 Score =  332 bits (850), Expect = 2e-88,   Method: Compositional matrix adjust.
 Identities = 161/314 (51%), Positives = 222/314 (70%), Gaps = 8/314 (2%)

Query: 42  SVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTN 101
           S+V+ H++WM Q  R Y DE EK++RL++  ENL++IE  N  GN++YKLG N+F+D T 
Sbjct: 34  SIVDYHQQWMIQFSRVYDDEFEKQLRLQVLTENLKFIESFNNMGNQSYKLGVNEFTDWTK 93

Query: 102 DEFRALYTGYK-MPSPSHRSTTSSTFKYQNLSMTDV-PTSLDWRDKGAVTPIKNQKECGC 159
           +EF A YTG + +   S     + T    N +++DV  T+ DWR++GAVTP+K+Q ECG 
Sbjct: 94  EEFLATYTGLRGVNVTSPFEVVNETKPAWNWTVSDVLGTNKDWRNEGAVTPVKSQGECGG 153

Query: 160 CWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIAT 219
           CWAF+A+AAVEG+TKI  GNLI LSEQQLLDC+   NNGC GG+   AF YII+++GI++
Sbjct: 154 CWAFSAIAAVEGLTKIARGNLISLSEQQLLDCTREQNNGCKGGTFVNAFNYIIKHRGISS 213

Query: 220 EDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQS 279
           E+EYPYQ   G C +  +PA   I  +E VPS +E+ALL+AVS QPV++AI A    F  
Sbjct: 214 ENEYPYQVKEGPCRSNARPAIL-IRGFENVPSNNERALLEAVSRQPVAVAIDASEAGFVH 272

Query: 280 YKEGIFNGV-CGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----E 334
           Y  G++N   CGT ++HAVT+VG+GT+ +G  YWL KNSWG TWG+ GY++I RD    +
Sbjct: 273 YSGGVYNARNCGTSVNHAVTLVGYGTSPEGMKYWLAKNSWGKTWGENGYIRIRRDVEWPQ 332

Query: 335 GLCGIGTRSSYPLA 348
           G+CG+   +SYP+A
Sbjct: 333 GMCGVAQYASYPVA 346


>gi|125547236|gb|EAY93058.1| hypothetical protein OsI_14861 [Oryza sativa Indica Group]
          Length = 339

 Score =  331 bits (848), Expect = 4e-88,   Method: Compositional matrix adjust.
 Identities = 153/336 (45%), Positives = 218/336 (64%), Gaps = 9/336 (2%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
           +F I+  L  C++ + +   + + ++   HE+WMAQ+GR Y+D+ EK  R ++FK N+ +
Sbjct: 8   LFAILGCLCLCSAVLAARELSDDAAMAARHERWMAQYGRVYRDDAEKARRFEVFKANVAF 67

Query: 78  IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
           IE  N  GN  + LG NQF+DLTNDEFR   T       + R  T   F+Y+N+++  +P
Sbjct: 68  IESFNA-GNHNFWLGVNQFADLTNDEFRWTKTNKGFIPSTTRVPTG--FRYENVNIDALP 124

Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-N 196
            ++DWR KGAVTPIK+Q +CGCCWAF+AVAA+EGI K+ +G LI LSEQ+L+DC  +G +
Sbjct: 125 ATVDWRTKGAVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVHGED 184

Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQA 256
            GC GG  + AF +II+N G+ TE  YPY A    C +    + A I  YE+VP+ +E A
Sbjct: 185 QGCEGGLMDDAFKFIIKNGGLTTESNYPYAAADDKCKSVSN-SVASIKGYEDVPANNEAA 243

Query: 257 LLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKN 316
           L+KAV+ QPVS+A+      FQ YK G+  G CGT LDH +  +G+G   DG  YWL+KN
Sbjct: 244 LMKAVANQPVSVAVDGGDMTFQFYKGGVMTGSCGTDLDHGIVAIGYGKASDGTKYWLLKN 303

Query: 317 SWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
           SWG TWG+ G++++ +D     G+CG+    SYP A
Sbjct: 304 SWGTTWGENGFLRMEKDISDKRGMCGLAMEPSYPTA 339


>gi|21666724|gb|AAM73806.1|AF448505_1 cysteine proteinase [Brassica napus]
 gi|21666726|gb|AAM73807.1|AF448506_1 cysteine proteinase [Brassica napus]
          Length = 343

 Score =  330 bits (847), Expect = 5e-88,   Method: Compositional matrix adjust.
 Identities = 159/335 (47%), Positives = 224/335 (66%), Gaps = 7/335 (2%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
           +F+I++L+ S +     SR   E ++ + H  WM +HGR Y D  EK  R  +FK N+E 
Sbjct: 8   IFLIVSLVSSFSLSTTLSRPLDEVTMQKRHAAWMTEHGRVYADANEKNNRYVVFKRNVES 67

Query: 78  IEKANK-EGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV 136
           IE+ N+ +   T+KL  NQF+DLTN+EFR++YTGYK  S     T  ++F+YQ++S   +
Sbjct: 68  IERLNEVQYGLTFKLAVNQFADLTNEEFRSMYTGYKGNSVLSSRTKPTSFRYQHVSSDAL 127

Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGN 196
           P S+DWR KGAVTPIK+Q  CG CWAF+AVAA+EG+ +I+ G LI LSEQ+L+DC TN +
Sbjct: 128 PISVDWRKKGAVTPIKDQGSCGSCWAFSAVAAIEGVAQIKKGKLISLSEQELVDCDTN-D 186

Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQ-KPAAAKISNYEEVPSGDEQ 255
           +GC+GG    AF Y +   G+ +E  YPY++  GTC+  + K  A  I  +E+VP+ DE+
Sbjct: 187 DGCMGGYMNSAFNYTMTTGGLTSESNYPYKSTDGTCNINKTKQIATSIKGFEDVPANDEK 246

Query: 256 ALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIK 315
           AL+KAV+  PVSI IA   T FQ Y  G+F+G C T LDH V +VG+G + +G+ YW++K
Sbjct: 247 ALMKAVAHHPVSIGIAGGGTGFQFYSSGVFSGECSTHLDHGVAVVGYGKSSNGSKYWILK 306

Query: 316 NSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
           NSWG  WG+ GYM+I +D     G CG+   +SYP
Sbjct: 307 NSWGPKWGERGYMRIKKDTKAKHGQCGLAMNASYP 341


>gi|357160599|ref|XP_003578815.1| PREDICTED: vignain-like [Brachypodium distachyon]
          Length = 339

 Score =  328 bits (842), Expect = 2e-87,   Method: Compositional matrix adjust.
 Identities = 155/341 (45%), Positives = 219/341 (64%), Gaps = 9/341 (2%)

Query: 13  INTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFK 72
           I    +  I+  L  C+S + +     + S+   HE WMAQ+GR YKD  EK  + ++FK
Sbjct: 3   IPKASILAILGCLCFCSSVLAARELNDDLSMAARHETWMAQYGRVYKDAAEKAQKFEVFK 62

Query: 73  ENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
            N  +I+  N E N  + LG NQF+DLTN+EF+A  T     S  +++  S+ FKY+NL 
Sbjct: 63  ANARFIDSFNAE-NHKFWLGINQFADLTNEEFKATKTNKGFIS--NKARVSTGFKYENLK 119

Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
           +  +PTS+DWR KGAVTP+K+Q +CGCCWAF+AVAA EGI K+ +G L+ LSEQ+L+DC 
Sbjct: 120 IEALPTSIDWRTKGAVTPVKDQGQCGCCWAFSAVAATEGIVKLSTGKLVSLSEQELVDCD 179

Query: 193 TNG-NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPS 251
            +G + GC GG  + AF +II N G+  E  YPY A  G C +  K +A  I +YE+VP+
Sbjct: 180 VHGEDQGCEGGLMDDAFKFIITNGGLTQESSYPYDAEDGKCKSGSK-SAGTIKSYEDVPA 238

Query: 252 GDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANY 311
            +E AL+KAV+ QPVS+A+      FQ Y  G+  G CGT LDH +  +G+G T DG  +
Sbjct: 239 NNEGALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGVTSDGTKF 298

Query: 312 WLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
           WL+KNSWG TWG+ G++++ +D    +G+CG+    SYP A
Sbjct: 299 WLMKNSWGTTWGENGFLRMEKDIADKKGMCGLAMEPSYPTA 339


>gi|356521444|ref|XP_003529366.1| PREDICTED: thiol protease SEN102-like [Glycine max]
          Length = 340

 Score =  328 bits (841), Expect = 2e-87,   Method: Compositional matrix adjust.
 Identities = 166/335 (49%), Positives = 221/335 (65%), Gaps = 14/335 (4%)

Query: 19  FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
           F ++ L   C +   SSR+  E S+   HE+WMA H R Y D  EK+ R +IFKENLE+I
Sbjct: 13  FFMLFLTCICRA---SSRTLSESSIATQHEEWMAMHDRVYADSAEKDRRQQIFKENLEFI 69

Query: 79  EKANKEGNRTYKLGTNQFSDLTNDEFRALYTG--YKMPSPSHRSTTSSTFKYQNLSMTDV 136
           EK N EG + Y L  N F+DLTN+EF A +TG  YK P+       + +  +  +S+ D+
Sbjct: 70  EKHNNEGKKRYNLSLNSFADLTNEEFVASHTGALYKPPTQLGSFKINHSLGFHKMSVGDI 129

Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGN 196
             SLDWR +GAV  IKNQ  CG CWAF+AVAAVEGI +I++G L+ LSEQ L+DC++  N
Sbjct: 130 EASLDWRKRGAVNDIKNQGRCGSCWAFSAVAAVEGINQIKNGQLVSLSEQNLVDCAS--N 187

Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQA 256
           +GC G   EKAF Y I++ G+A E+EYPY    GTCS    P A +I  Y+ V   +E+ 
Sbjct: 188 DGCHGQYVEKAFDY-IRDYGLANEEEYPYVETVGTCSGNSNP-AIQIRGYQSVTPQNEEQ 245

Query: 257 LLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKN 316
           LL AV+ QPVS+ + A    FQ Y  G+F+G CGT+L+HAVTIVG+G   +G  YWLI+N
Sbjct: 246 LLTAVASQPVSVLLEAKGQGFQFYSGGVFSGECGTELNHAVTIVGYGEEAEG-KYWLIRN 304

Query: 317 SWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
           SWG +WG+ GYMK++RD    +GLCGI  ++SYP 
Sbjct: 305 SWGKSWGEGGYMKLMRDTGNPQGLCGINMQASYPF 339


>gi|356517308|ref|XP_003527330.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
           max]
          Length = 342

 Score =  328 bits (841), Expect = 3e-87,   Method: Compositional matrix adjust.
 Identities = 158/336 (47%), Positives = 222/336 (66%), Gaps = 8/336 (2%)

Query: 19  FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
           ++I+ L+++  +  V SR   E    E HEKWMAQ+GR YKD  EKE R ++FK N+ +I
Sbjct: 9   YLILFLVLAVWTSHVMSRRLSEACTSERHEKWMAQYGRVYKDAAEKEKRFQVFKNNVHFI 68

Query: 79  EKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPT 138
           E  N  G++ + L  NQF+DL ++EF+AL    +  +    ++T ++F+Y+  S+T +P 
Sbjct: 69  ESFNAAGDKPFNLSINQFADLNDEEFKALLINVQKKASWVETSTETSFRYE--SVTKIPA 126

Query: 139 SLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNG 198
           ++DWR +GAVTPIK+Q  CG CWAF+AVAA EGI +I +G L+ LSEQ+L+DC    + G
Sbjct: 127 TIDWRKRGAVTPIKDQGRCGSCWAFSAVAATEGIHQITTGKLVPLSEQELVDCVKGESEG 186

Query: 199 CLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKP-AAAKISNYEEVPSGDEQAL 257
           C+GG  + AF +I +  GIA+E  YPY+ V  TC   ++    A+I  YE+VPS +E+AL
Sbjct: 187 CIGGYVDDAFEFIAKKGGIASETHYPYKGVNKTCKVKKETHGVAEIKGYEKVPSNNEKAL 246

Query: 258 LKAVSMQPVSIAIAAYSTEFQSYKEGIFNGV-CGTQLDHAVTIVGFGTTEDGANYWLIKN 316
           LKAV+ QPVS+ I A +  F+ Y  GIFN   CGT  +HAV +VG+G   DG+ YWL+KN
Sbjct: 247 LKAVANQPVSVYIDAGTHAFKYYSSGIFNARNCGTDPNHAVAVVGYGKALDGSKYWLVKN 306

Query: 317 SWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
           SWG  WG+ GY++I RD    EGLCGI     YP A
Sbjct: 307 SWGTEWGERGYIRIKRDIRAKEGLCGIAKYPYYPTA 342


>gi|356515056|ref|XP_003526217.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
           max]
          Length = 342

 Score =  328 bits (841), Expect = 3e-87,   Method: Compositional matrix adjust.
 Identities = 159/336 (47%), Positives = 222/336 (66%), Gaps = 8/336 (2%)

Query: 19  FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
           ++I+ L++S  +  V SR   E    E HEKWMAQ+GR YKD  EKE R ++FK N+ +I
Sbjct: 9   YLILFLVLSVWTSHVMSRRLSEACTSERHEKWMAQYGRVYKDAAEKEKRFQVFKNNVHFI 68

Query: 79  EKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPT 138
           E  N  G++ + L  NQF+DL ++EF+AL    +  +    ++T ++F+Y+  S+T +P 
Sbjct: 69  ESFNAAGDKPFNLSINQFADLNDEEFKALLINVQKKASWVETSTQTSFRYE--SVTKIPA 126

Query: 139 SLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNG 198
           ++DWR +GAVTPIK+Q  CG CWAF+AVAA EGI +I +G L+ LSEQ+L+DC    + G
Sbjct: 127 TIDWRKRGAVTPIKDQGRCGSCWAFSAVAATEGIHQITTGKLVPLSEQELVDCVKGESEG 186

Query: 199 CLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKP-AAAKISNYEEVPSGDEQAL 257
           C+GG  + AF +I +  GIA+E  YPY+ V  TC   ++    A+I  YE+VPS +E+AL
Sbjct: 187 CIGGYVDDAFEFIAKKGGIASETHYPYKGVNKTCKVKKETHGVAEIKGYEKVPSNNEKAL 246

Query: 258 LKAVSMQPVSIAIAAYSTEFQSYKEGIFN-GVCGTQLDHAVTIVGFGTTEDGANYWLIKN 316
           LKAV+ QPVS+ I A +  F+ Y  GIFN   CGT  +HAV +VG+G   DG+ YWL+KN
Sbjct: 247 LKAVANQPVSVYIDAGTHAFKYYSSGIFNVRNCGTDPNHAVAVVGYGKALDGSKYWLVKN 306

Query: 317 SWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
           SWG  WG+ GY++I RD    EGLCGI     YP A
Sbjct: 307 SWGTEWGERGYIRIKRDIRAKEGLCGIAKYPYYPTA 342


>gi|297851332|ref|XP_002893547.1| peptidase C1A papain family protein [Arabidopsis lyrata subsp.
           lyrata]
 gi|297339389|gb|EFH69806.1| peptidase C1A papain family protein [Arabidopsis lyrata subsp.
           lyrata]
          Length = 345

 Score =  328 bits (840), Expect = 3e-87,   Method: Compositional matrix adjust.
 Identities = 161/326 (49%), Positives = 223/326 (68%), Gaps = 8/326 (2%)

Query: 30  SQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTY 89
           S+  S  + HE ++   H+KWM    R Y DE EK+MRL++F ENL++IE  N  G+++Y
Sbjct: 21  SEATSRVALHEPTIFYYHQKWMINFSRVYDDEFEKQMRLEVFTENLKFIENFNNMGSQSY 80

Query: 90  KLGTNQFSDLTNDEFRALYTGYK-MPSPSHRSTTSSTFKYQNLSMTDV-PTSLDWRDKGA 147
           KLG N+F+D T +EF A +TG   +   S     + T    N +++DV  T+ DWR++GA
Sbjct: 81  KLGVNKFTDWTKEEFLATHTGLSGINVTSPFEVVNETTPAWNWTVSDVLGTTKDWRNEGA 140

Query: 148 VTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKA 207
           VTP+K Q ECG CWAF+A+AAVEG+TKI  GNLI LSEQQLLDC+   NNGC GG+  +A
Sbjct: 141 VTPVKYQGECGGCWAFSAIAAVEGLTKIARGNLISLSEQQLLDCAREQNNGCKGGTMIEA 200

Query: 208 FAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVS 267
           F YI++N G+++E+ YPYQ   G C +   PA   I  +E VPS +E+ALL+AVS QPV+
Sbjct: 201 FNYIVKNGGVSSENAYPYQVKEGPCRSNDIPAIV-IRGFENVPSNNERALLEAVSRQPVA 259

Query: 268 IAIAAYSTEFQSYKEGIFNGV-CGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAG 326
           + I A  T F  Y  G++N   CGT ++HAVT+VG+GT+++G  YWL KNSWG TWG+ G
Sbjct: 260 VDIDASETGFIHYSGGVYNARDCGTSVNHAVTLVGYGTSQEGIKYWLAKNSWGKTWGENG 319

Query: 327 YMKIVRD----EGLCGIGTRSSYPLA 348
           Y++I RD    +G+CG+   +SYP+A
Sbjct: 320 YIRIRRDVEWPQGMCGVAQYASYPVA 345


>gi|449448298|ref|XP_004141903.1| PREDICTED: germination-specific cysteine protease 1-like [Cucumis
           sativus]
 gi|449531757|ref|XP_004172852.1| PREDICTED: germination-specific cysteine protease 1-like [Cucumis
           sativus]
          Length = 365

 Score =  327 bits (839), Expect = 4e-87,   Method: Compositional matrix adjust.
 Identities = 160/341 (46%), Positives = 227/341 (66%), Gaps = 13/341 (3%)

Query: 15  TTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKEN 74
           TT + ++    +S ++  +S RS  E  V EI++ W+A+HG++Y    E+E R +IFKEN
Sbjct: 5   TTSLALLSFFFLSISASALSRRSDGE--VREIYDLWLAKHGKAYNGIDEREKRFQIFKEN 62

Query: 75  LEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTF--KYQNLS 132
           L++I+  N E NRTYK+G N F+DLTN+E+RALY G + P P+ R   + T   +Y   +
Sbjct: 63  LKFIDDHNSE-NRTYKVGLNMFADLTNEEYRALYLGTRSP-PARRVMKAKTASRRYAVNN 120

Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
           +  +P S+DWR +GAV P+KNQ  CG CWAF+ +AAVEGI +I +G LI LSEQ+L+ C 
Sbjct: 121 LDRLPESMDWRTRGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVSCD 180

Query: 193 TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPS 251
              N+GC GG  + AF +II N G+ TE++YPY+A  G C   +K A    I  YE+VP+
Sbjct: 181 KKYNSGCNGGLMDYAFQFIIDNGGLDTEEDYPYEAFDGQCDPTRKNAKVVSIDAYEDVPA 240

Query: 252 GDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANY 311
            DE++L KAV+ QPVS+AI A     Q Y+ G+F G CG+ LDH V  VG+G  E+G +Y
Sbjct: 241 NDEESLKKAVAHQPVSVAIEASGLALQLYQSGVFTGKCGSALDHGVVAVGYG-KENGVDY 299

Query: 312 WLIKNSWGNTWGDAGYMKIVRD-----EGLCGIGTRSSYPL 347
           WL++NSWG +WG+ GY K+ R+     EG CGI  ++SYP+
Sbjct: 300 WLVRNSWGTSWGEDGYFKLERNVKHITEGKCGIAMQASYPV 340


>gi|212275830|ref|NP_001130503.1| cysteine protease 1 [Zea mays]
 gi|194689328|gb|ACF78748.1| unknown [Zea mays]
 gi|219886279|gb|ACL53514.1| unknown [Zea mays]
 gi|238010470|gb|ACR36270.1| unknown [Zea mays]
 gi|413920875|gb|AFW60807.1| cysteine protease 1 [Zea mays]
          Length = 354

 Score =  327 bits (839), Expect = 4e-87,   Method: Compositional matrix adjust.
 Identities = 166/342 (48%), Positives = 232/342 (67%), Gaps = 17/342 (4%)

Query: 18  MFIIITLLVSCASQVVSSRSTH---EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKEN 74
           + I+    +   ++ +SS ST    E+++   H++WMA+HGR+Y+DE EK  R ++FK N
Sbjct: 19  LTILAVTTMMAEARDLSSTSTGGYGEEAMKVRHQQWMAEHGRTYRDEAEKAHRFQVFKAN 78

Query: 75  LEYIEKANKEGN--RTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
            ++++ +N  G+  ++Y+L  N+F+D+TNDEF A+YTG + P P+     +  FKY N++
Sbjct: 79  ADFVDASNAAGDDKKSYRLELNEFADMTNDEFMAMYTGLR-PVPAGAKKMAG-FKYGNVT 136

Query: 133 MTDVPT---SLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLL 189
           ++D      ++DWR KGAVT IKNQ +CGCCWAFAAVAAVEGI +I +GNL+ LSEQQ+L
Sbjct: 137 LSDADDDQQTVDWRQKGAVTGIKNQGQCGCCWAFAAVAAVEGIHQITTGNLVSLSEQQVL 196

Query: 190 DCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEV 249
           DC T+GNNGC GG  + AF YI+ N G+ TED YPY A    C + Q  AA  IS Y++V
Sbjct: 197 DCDTDGNNGCNGGYIDNAFQYIVGNGGLGTEDAYPYTAAQAMCQSVQPVAA--ISGYQDV 254

Query: 250 PSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGV-CGT--QLDHAVTIVGFGTTE 306
           PSGDE AL  AV+ QPVS+AI A++  FQ Y  G+     C T   L+HAVT VG+GT E
Sbjct: 255 PSGDEAALAAAVANQPVSVAIDAHN--FQLYGGGVMTAASCSTPPNLNHAVTAVGYGTAE 312

Query: 307 DGANYWLIKNSWGNTWGDAGYMKIVRDEGLCGIGTRSSYPLA 348
           DG  YWL+KN WG  WG+ GY+++ R    CG+  ++SYP+A
Sbjct: 313 DGTPYWLLKNQWGQNWGEGGYLRLERGANACGVAQQASYPVA 354


>gi|5823020|gb|AAD53012.1|AF089849_1 senescence-specific cysteine protease [Brassica napus]
          Length = 344

 Score =  327 bits (837), Expect = 6e-87,   Method: Compositional matrix adjust.
 Identities = 160/336 (47%), Positives = 224/336 (66%), Gaps = 8/336 (2%)

Query: 18  MFIIITLLVSCASQVVSSRST-HEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLE 76
           +F+I++L+ S +  +  SR    E ++ + H +WM +HGR Y D  EK  R  +FK N+E
Sbjct: 8   IFLIVSLVSSFSLSITLSRPLLDEVAMQKRHAEWMTEHGRVYADANEKNNRYAVFKRNVE 67

Query: 77  YIEKANK-EGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD 135
            IE+ N  +   T+KL  NQF+DLTN+EFR++YTG+K  S     T  ++F+YQN+S   
Sbjct: 68  RIERLNDVQSGLTFKLAVNQFADLTNEEFRSMYTGFKGNSVLSSRTKPTSFRYQNVSSDA 127

Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG 195
           +P S+DWR KGAVTPIK+Q  CG CWAF+AVAA+EG+ +I+ G LI LSEQ+L+DC TN 
Sbjct: 128 LPVSVDWRKKGAVTPIKDQGLCGSCWAFSAVAAIEGVAQIKKGKLISLSEQELVDCDTN- 186

Query: 196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQ-KPAAAKISNYEEVPSGDE 254
           + GC+GG  + AF Y I   G+ +E  YPY++  GTC+  + K  A  I  +E+VP+ DE
Sbjct: 187 DGGCMGGLMDTAFNYTITIGGLTSESNYPYKSTNGTCNFNKTKQIATSIKGFEDVPANDE 246

Query: 255 QALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLI 314
           +AL+KAV+  PVSI IA     FQ Y  G+F+G C T LDH VT VG+G +++G  YW++
Sbjct: 247 KALMKAVAHHPVSIGIAGGDIGFQFYSSGVFSGECTTHLDHGVTAVGYGRSKNGLKYWIL 306

Query: 315 KNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
           KNSWG  WG+ GYM+I +D     G CG+   +SYP
Sbjct: 307 KNSWGPKWGERGYMRIKKDIKPKHGQCGLAMNASYP 342


>gi|310656790|gb|ADP02219.1| Peptidase_C1 domain-containing protein [Triticum aestivum]
          Length = 419

 Score =  327 bits (837), Expect = 6e-87,   Method: Compositional matrix adjust.
 Identities = 153/318 (48%), Positives = 209/318 (65%), Gaps = 5/318 (1%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
           +  II  +  C+S V+S+R   + ++VE HE+WMA+  R YKD  EK  R K FK N+ +
Sbjct: 8   LLAIIGSICLCSSTVLSARELGDAAMVEKHEQWMAKFNRVYKDSTEKAQRFKAFKANVAF 67

Query: 78  IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
           IE  N  GN  + LG NQF+DLTNDEFRA  T   +     R+ T   FKY N+S   +P
Sbjct: 68  IESFNT-GNHKFWLGVNQFTDLTNDEFRATKTNKGLKRNGARAPTR--FKYNNVSTDALP 124

Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-N 196
            ++DWR KG VTPIK+Q +CGCCWAF+AVAA EGI K+ +G L+ LSEQ+L+DC  +G +
Sbjct: 125 AAVDWRTKGVVTPIKDQGQCGCCWAFSAVAATEGIVKLSTGKLVSLSEQELVDCDVHGVD 184

Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTC-SAAQKPAAAKISNYEEVPSGDEQ 255
            GC GG  + AF +II+N G+ TE  YPY A  G C ++    + A I  YE+VP+ DE 
Sbjct: 185 QGCEGGEMDNAFKFIIKNGGLTTEANYPYTAQDGQCKTSTTSNSVATIKGYEDVPANDES 244

Query: 256 ALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIK 315
           +L+KAV+ QPVS+A+      FQ Y  G+  G CGT LDH +  +G+G T DG  +WL+K
Sbjct: 245 SLMKAVANQPVSVAVDGGDVIFQHYSGGVMTGSCGTDLDHGIVAIGYGMTSDGTKFWLLK 304

Query: 316 NSWGNTWGDAGYMKIVRD 333
           NSWG TWG++GY+++ +D
Sbjct: 305 NSWGTTWGESGYLRMEKD 322


>gi|357167196|ref|XP_003581047.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like
           [Brachypodium distachyon]
          Length = 338

 Score =  327 bits (837), Expect = 7e-87,   Method: Compositional matrix adjust.
 Identities = 153/338 (45%), Positives = 219/338 (64%), Gaps = 10/338 (2%)

Query: 19  FIIITLLVSCASQVVSSRSTHEQS--VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLE 76
           F+   ++ + A   + +R   +    +   HE+WMA++GR Y D  EK  RL++FK N+ 
Sbjct: 3   FLFALVVCTFALGALGARDLADDDWLIAARHEQWMARYGRVYSDVAEKARRLEVFKANVG 62

Query: 77  YIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV 136
           +IE  N  GN  + L  NQF+D+T DEFRA++ GYKM     ++  +  F+Y N+S+ D+
Sbjct: 63  FIESVNA-GNHKFWLEANQFADITKDEFRAMHKGYKMQVIGSKARATG-FRYANVSIDDL 120

Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-G 195
           P S+DWR  GAVTP+K+Q +CGCCWAF+ VA++EGI K+ +G LI LSEQ+L+DC     
Sbjct: 121 PASVDWRANGAVTPVKDQGQCGCCWAFSTVASMEGIVKVSTGKLISLSEQELVDCDVGMQ 180

Query: 196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKP-AAAKISNYEEVPSGDE 254
           N GC GG  + AF +I+ N G+ TE +YPY    GTC++ ++   AA I  YE+VP+ DE
Sbjct: 181 NKGCGGGLMDNAFEFIVNNGGLDTEADYPYTGADGTCNSNKESNIAASIKGYEDVPANDE 240

Query: 255 QALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLI 314
            +L KAV+ QPVSIA+      F+ YK G+  G CGT+LDH V  VG+G   DG  YWL+
Sbjct: 241 ASLQKAVAAQPVSIAVDGGDDLFRFYKGGVLTGACGTELDHGVAAVGYGVAGDGTKYWLV 300

Query: 315 KNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
           KNSWG +WG+ G++++ RD     G+CG+  + SYP A
Sbjct: 301 KNSWGTSWGEDGFIRLERDVADEAGMCGLAMKPSYPTA 338


>gi|1208549|gb|AAC49455.1| Pseudotzain [Pseudotsuga menziesii]
          Length = 454

 Score =  325 bits (834), Expect = 1e-86,   Method: Compositional matrix adjust.
 Identities = 153/314 (48%), Positives = 220/314 (70%), Gaps = 8/314 (2%)

Query: 40  EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
           + +++E++E W+AQH ++Y    EK+ +  +FK+N  YI + N +GN +YKLG NQF+DL
Sbjct: 37  DDAIMELYELWLAQHKKAYNGLDEKQKKFSVFKDNFLYIHQHNNQGNPSYKLGLNQFADL 96

Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGC 159
           +++EF+A Y G K+ +   R + S + +YQ     D+P S+DWR+KGAVT +KNQ  CG 
Sbjct: 97  SHEEFKAAYLGTKLDA-KKRLSRSPSPRYQYSVGEDLPESIDWREKGAVTAVKNQGSCGS 155

Query: 160 CWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIAT 219
           CWAF+ VAAVEGI +I +GNL  LSEQ+L+DC T+ N GC GG  + AF +II N G+ +
Sbjct: 156 CWAFSTVAAVEGINQIVTGNLTSLSEQELVDCDTSYNQGCNGGLMDYAFQFIISNGGLDS 215

Query: 220 EDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQ 278
           ED+YPY+A  G+C A +K A    I +YE+VP  DE++L KA + QP+S+AI A    FQ
Sbjct: 216 EDDYPYKANNGSCDAYRKNAHVVTIDDYEDVPENDEKSLKKAAANQPISVAIEASGRAFQ 275

Query: 279 SYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----- 333
            Y+ G+F   CGTQLDH VT+VG+G +E G +YWL+KNSWGN+WG+ G++K+ R+     
Sbjct: 276 FYESGVFTSNCGTQLDHGVTLVGYG-SESGIDYWLVKNSWGNSWGEKGFIKLQRNLEGAS 334

Query: 334 EGLCGIGTRSSYPL 347
            G+CGI   +SYP+
Sbjct: 335 TGMCGIAMEASYPV 348


>gi|356545118|ref|XP_003540992.1| PREDICTED: thiol protease SEN102-like [Glycine max]
          Length = 337

 Score =  325 bits (833), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 165/335 (49%), Positives = 218/335 (65%), Gaps = 14/335 (4%)

Query: 20  IIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIE 79
           I + LL++     + SR  HE S+ E HE+WMA++G+ YKD  EKE R  IFK N+E+IE
Sbjct: 11  IALFLLLALGIPQMMSRKLHETSMRERHEQWMAEYGKVYKDAAEKEKRFLIFKHNVEFIE 70

Query: 80  KANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTS 139
             N   N+ YKLG N  +DLT +EF+A   G K P       +++ FKY+N+  T +P +
Sbjct: 71  SFNAAANKPYKLGVNHLADLTVEEFKASRNGLKRP----YELSTTPFKYENV--TAIPAA 124

Query: 140 LDWRDKGAVTPIKNQKEC-GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NN 197
           +DWR KGAVT IK+Q +C G CWAF+ VAA EGI +I +G L+ LSEQ+L+DC T G + 
Sbjct: 125 IDWRTKGAVTSIKDQGQCAGSCWAFSTVAATEGIHQITTGKLVSLSEQELVDCDTKGVDQ 184

Query: 198 GCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQAL 257
           GC GG  E  F +II+N GI +E  YPY+AV G C+ A  P  A+I  YE+VP   E+ L
Sbjct: 185 GCEGGYMEDGFEFIIKNGGITSEANYPYKAVDGKCNKATSP-VAQIKGYEKVPPNSEKTL 243

Query: 258 LKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNS 317
            KAV+ QPVS++I A    F  Y  GI+NG CGT+LDH VT VG+G   +G +YWL+KNS
Sbjct: 244 QKAVANQPVSVSIDANGEGFMFYSSGIYNGECGTELDHGVTAVGYGIA-NGTDYWLVKNS 302

Query: 318 WGNTWGDAGYMKIVR----DEGLCGIGTRSSYPLA 348
           WG  WG+ GY+++ R      GLCGI   SSYP A
Sbjct: 303 WGTQWGEKGYVRMQRGVAAKHGLCGIALDSSYPTA 337


>gi|226502454|ref|NP_001140922.1| hypothetical protein [Zea mays]
 gi|223948637|gb|ACN28402.1| unknown [Zea mays]
 gi|413920877|gb|AFW60809.1| hypothetical protein ZEAMMB73_830238 [Zea mays]
          Length = 354

 Score =  325 bits (833), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 166/342 (48%), Positives = 231/342 (67%), Gaps = 17/342 (4%)

Query: 18  MFIIITLLVSCASQVVSSRSTH---EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKEN 74
           + I+    +   ++ +SS ST    E+++   H++WMA+HGR+Y+DE EK  R ++FK N
Sbjct: 19  LTILAVKTMMAEARDLSSTSTGGYGEEAMKVRHQQWMAEHGRTYRDEAEKAHRFQVFKAN 78

Query: 75  LEYIEKANKEGN--RTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
            ++++ +N  G+  ++Y++  N+F+D+TNDEF A+YTG + P P+     +  FKY N++
Sbjct: 79  ADFVDASNAAGDDKKSYRMELNEFADMTNDEFMAMYTGLR-PVPAGAKKMAG-FKYGNVT 136

Query: 133 MTDVPT---SLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLL 189
           ++D      ++DWR KGAVT IKNQ +CGCCWAFAAVAAVEGI +I +GNL+ LSEQQ+L
Sbjct: 137 LSDADDNQQTVDWRQKGAVTGIKNQGQCGCCWAFAAVAAVEGIHQITTGNLVSLSEQQVL 196

Query: 190 DCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEV 249
           DC T GNNGC GG  + AF YI  N G+ATED YPY A    C + Q  AA  IS Y++V
Sbjct: 197 DCDTEGNNGCNGGYIDNAFQYIAGNGGLATEDAYPYTAAQAMCQSVQPVAA--ISGYQDV 254

Query: 250 PSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGV-CGT--QLDHAVTIVGFGTTE 306
           PSGDE AL  AV+ QPVS+AI A++  FQ Y  G+     C T   L+HAVT VG+GT E
Sbjct: 255 PSGDEAALAAAVANQPVSVAIDAHN--FQLYGGGVMTAASCSTPPNLNHAVTAVGYGTAE 312

Query: 307 DGANYWLIKNSWGNTWGDAGYMKIVRDEGLCGIGTRSSYPLA 348
           DG  YWL+KN WG  WG+ GY+++ R    CG+  ++SYP+A
Sbjct: 313 DGTPYWLLKNQWGQNWGEGGYLRLERGANACGVAQQASYPVA 354


>gi|356515052|ref|XP_003526215.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
           max]
          Length = 339

 Score =  325 bits (832), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 156/333 (46%), Positives = 220/333 (66%), Gaps = 9/333 (2%)

Query: 19  FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
           ++I+ L+++  +  V SR   E    E HEKWMAQ+G+ Y D  EKE R +IFK N+++I
Sbjct: 9   YLILFLILTVWTFHVMSRRLSEVCTSERHEKWMAQYGKLYTDAAEKEKRFQIFKNNVQFI 68

Query: 79  EKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPT 138
           E  N  G++ + L  NQF+DL N+EF+A     +       + T ++F+Y+  S+T +P 
Sbjct: 69  ESFNAAGDKPFNLSINQFADLHNEEFKASLINVQKKESGVETATETSFRYE--SITKIPV 126

Query: 139 SLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNG 198
           ++DWR +GAVTPIK+Q  CG CWAF+ VAA+EGI +I +G L+ LSEQ+L+DC    + G
Sbjct: 127 TMDWRKRGAVTPIKDQGNCGSCWAFSTVAAIEGIHQITTGKLVSLSEQELVDCVKGKSEG 186

Query: 199 CLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKP-AAAKISNYEEVPSGDEQAL 257
           C  G +E+AF ++ +N G+A+E  YPY+A   TC   ++    A+I  YE VPS  E+AL
Sbjct: 187 CNFGYKEEAFEFVAKNGGLASEISYPYKANNKTCMVKKETQGVAQIKGYENVPSNSEKAL 246

Query: 258 LKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNS 317
           LKAV+ QPVS+ I A + +F  Y  GIF G CGT  +HAVT++G+G    GA YWL+KNS
Sbjct: 247 LKAVANQPVSVYIDAGALQF--YSSGIFTGKCGTAPNHAVTVIGYGKARGGAKYWLVKNS 304

Query: 318 WGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
           WG  WG+ GY+K+ RD    EGLCGI T +SYP
Sbjct: 305 WGTKWGEKGYIKMKRDIRAKEGLCGIATNASYP 337


>gi|116786779|gb|ABK24233.1| unknown [Picea sitchensis]
          Length = 463

 Score =  325 bits (832), Expect = 3e-86,   Method: Compositional matrix adjust.
 Identities = 151/314 (48%), Positives = 219/314 (69%), Gaps = 8/314 (2%)

Query: 40  EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
           + +++E++E W+AQH ++Y    EK+ R  +FK+N  YI + N +GN +YKLG NQF+DL
Sbjct: 37  DDAIMELYELWLAQHKKAYNGLGEKQNRFSVFKDNFLYIHQHNNQGNPSYKLGLNQFADL 96

Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGC 159
           +++EF+A Y G K+ +   R + S + +YQ     D+P S+DWR+KGAVT +K+Q  CG 
Sbjct: 97  SHEEFKATYLGAKLDT-KKRLSNSPSPRYQYSDGEDLPESIDWREKGAVTAVKDQGSCGS 155

Query: 160 CWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIAT 219
           CWAF+ VAAVEGI +I +GNL  LSEQ+L+DC T+ N GC GG  + AF +II N G+ +
Sbjct: 156 CWAFSTVAAVEGINQIVTGNLTSLSEQELVDCDTSYNQGCNGGLMDYAFQFIINNGGLDS 215

Query: 220 EDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQ 278
           ED+YPY+A  G+C A +K A    I +YE+VP  DE++L KA + QP+S+AI A    FQ
Sbjct: 216 EDDYPYKANDGSCDAYRKNAHVVTIDDYEDVPENDEKSLKKAAANQPISVAIEASGRAFQ 275

Query: 279 SYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----- 333
            Y+ G+F   CGTQLDH VT+VG+G +E G +YW++KNSWG +WG+ G++++ R+     
Sbjct: 276 FYESGVFTSTCGTQLDHGVTLVGYG-SESGTDYWIVKNSWGKSWGEKGFIRLQRNIEGVS 334

Query: 334 EGLCGIGTRSSYPL 347
            G+CGI   +SYPL
Sbjct: 335 TGMCGIAMEASYPL 348


>gi|357160572|ref|XP_003578808.1| PREDICTED: vignain-like [Brachypodium distachyon]
          Length = 339

 Score =  324 bits (831), Expect = 3e-86,   Method: Compositional matrix adjust.
 Identities = 153/341 (44%), Positives = 216/341 (63%), Gaps = 9/341 (2%)

Query: 13  INTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFK 72
           I    +  I+  L  C S + +     + S+V  HE WM Q+GR YKD  EK  + ++FK
Sbjct: 3   IPKASLLAILGCLCLCGSVLAARELNDDLSMVARHENWMLQYGRVYKDAAEKAQKFEVFK 62

Query: 73  ENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
            N E+I   N  GN  + LG NQF+D+TN+EF+A  T     S   R  T   F Y+N+S
Sbjct: 63  ANAEFINSFNA-GNHKFWLGINQFADITNEEFKATKTNKGFISNKVRVPTG--FMYENMS 119

Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
              +P ++DWR KGAVTPIK+Q +CGCCWAF+AVAA+EGI K+ +G L+ LSEQ+L+DC 
Sbjct: 120 FDALPATIDWRTKGAVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLVSLSEQELVDCD 179

Query: 193 TNG-NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPS 251
            +G + GC GG  + AF +II+N G+  E  YPY A  G C +    +AA I +YE+VP+
Sbjct: 180 VHGEDQGCEGGLMDDAFKFIIKNGGLTQESNYPYDAADGKCKSGSS-SAATIKSYEDVPA 238

Query: 252 GDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANY 311
            +E AL+KAV+ QPVS+A+      FQ Y  G+  G CGT LDH +  +G+GTT DG  +
Sbjct: 239 NNEGALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGTTSDGTKF 298

Query: 312 WLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
           W++KNSWG +WG+ G++++ +D    +G+CG+    SYP A
Sbjct: 299 WIMKNSWGTSWGENGFLRMEKDIADKKGMCGLAMEPSYPTA 339


>gi|302143412|emb|CBI21973.3| unnamed protein product [Vitis vinifera]
          Length = 320

 Score =  324 bits (831), Expect = 3e-86,   Method: Compositional matrix adjust.
 Identities = 166/338 (49%), Positives = 228/338 (67%), Gaps = 37/338 (10%)

Query: 19  FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
             ++ +L + ASQ  ++R+ HE S+ E HE WM Q+GR YKD  EK  R KIFK+N+  I
Sbjct: 12  LALLFVLAAWASQA-TARNLHEASMYERHEDWMVQYGREYKDADEKSKRYKIFKDNVARI 70

Query: 79  EKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHR-STTSSTFKYQNLSMTDVP 137
           E  NK  +++YKL  N+F+DLTN+EFRA    +K    +H  ST +++FKY+N+  T VP
Sbjct: 71  ESFNKAMDKSYKLSINEFADLTNEEFRASRNRFK----AHICSTEATSFKYENV--TAVP 124

Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-N 196
           +++DWR KGAVTPIK+Q +CG CWAF+AVAA+EGIT++ +G LI LSEQ+L+DC T+G +
Sbjct: 125 STVDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGED 184

Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCS--AAQKPAAAKISNYEEVPSGDE 254
            GC                       YPY    GTC+   A  P AAKI+ YE+VP+ +E
Sbjct: 185 QGCT---------------------NYPYAGTDGTCNRKKAAHP-AAKINGYEDVPANNE 222

Query: 255 QALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLI 314
           +AL KAV+ QP+++AI A  +EFQ Y  G+F G CGT+LDH V+ VG+GT++DG  YWL+
Sbjct: 223 KALQKAVAHQPIAVAIDAGGSEFQFYSSGVFTGQCGTELDHGVSAVGYGTSDDGMKYWLV 282

Query: 315 KNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
           KNSWG  WG+ GY+++ RD    EGLCGI  ++SYP A
Sbjct: 283 KNSWGTGWGEEGYIRMQRDVTAKEGLCGIAMQASYPTA 320


>gi|356533293|ref|XP_003535200.1| PREDICTED: LOW QUALITY PROTEIN: cysteine proteinase RD21a-like
           [Glycine max]
          Length = 466

 Score =  324 bits (830), Expect = 5e-86,   Method: Compositional matrix adjust.
 Identities = 153/316 (48%), Positives = 216/316 (68%), Gaps = 9/316 (2%)

Query: 40  EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
           E     ++E W+ +HG++Y    EKE R KIFK+NL +IE+ N  G+++YKLG N+F+DL
Sbjct: 41  ESHTRHVYEAWLVKHGKAYNALGEKERRFKIFKDNLRFIEEHNGAGDKSYKLGLNKFADL 100

Query: 100 TNDEFRALYTGYKMPSPSHRSTTSS--TFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
           TN+E+RA++ G +   P +++   +  T +Y   +  ++P  +DWR+KGAVTPIK+Q +C
Sbjct: 101 TNEEYRAMFLGTRTRGPKNKAAVVAKKTDRYAYRAGEELPAMVDWREKGAVTPIKDQGQC 160

Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
           G CWAF+ V AVEGI +I +GNL  LSEQ+L+DC    N GC GG  + AF +I+QN GI
Sbjct: 161 GSCWAFSTVGAVEGINQIVTGNLTSLSEQELVDCDRGYNMGCNGGLMDYAFEFIVQNGGI 220

Query: 218 ATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTE 276
            TE++YPY A   TC   +K A    I  YE+VP+ DE++L+KAV+ QPVS+AI A   E
Sbjct: 221 DTEEDYPYHAKDNTCDPNRKNARVVTIDGYEDVPTNDEKSLMKAVANQPVSVAIEAGGME 280

Query: 277 FQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR---- 332
           FQ Y+ G+F G CGT LDH V  VG+G TE+G +YWL++NSWG+ WG+ GY+K+ R    
Sbjct: 281 FQLYQSGVFTGRCGTNLDHGVVAVGYG-TENGTDYWLVRNSWGSAWGENGYIKLERNVQN 339

Query: 333 -DEGLCGIGTRSSYPL 347
            + G CGI   +SYP+
Sbjct: 340 TETGKCGIAIEASYPI 355


>gi|225458701|ref|XP_002284973.1| PREDICTED: cysteine proteinase RD21a-like [Vitis vinifera]
          Length = 467

 Score =  324 bits (830), Expect = 5e-86,   Method: Compositional matrix adjust.
 Identities = 160/344 (46%), Positives = 227/344 (65%), Gaps = 17/344 (4%)

Query: 18  MFIIITLLVSCASQVVSSRSTH--------EQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
           +F+++ L  +    ++    TH        ++ V+ ++E W+A+HG+SY    EKE R +
Sbjct: 14  LFLLLGLASALDMSIIGYDETHGDKSSWRTDEDVMAVYEAWLAKHGKSYNALGEKERRFQ 73

Query: 70  IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQ 129
           IFK+NL +I++ N E NRTYK+G N+F+DLTN+E+R++Y G +  +   RS+   + +Y 
Sbjct: 74  IFKDNLRFIDEHNAE-NRTYKVGLNRFADLTNEEYRSMYLGTRTAA-KRRSSNKISDRYA 131

Query: 130 NLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLL 189
                 +P S+DWR KGAV  +K+Q  CG CWAF+ +AAVEGI KI +G LI LSEQ+L+
Sbjct: 132 FRVGDSLPESVDWRKKGAVVEVKDQGSCGSCWAFSTIAAVEGINKIVTGGLISLSEQELV 191

Query: 190 DCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEE 248
           DC T+ N GC GG  + AF +II N GI +E++YPY+A  G C   +K A    I  YE+
Sbjct: 192 DCDTSYNEGCNGGLMDYAFEFIINNGGIDSEEDYPYKASDGRCDQYRKNAKVVTIDGYED 251

Query: 249 VPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDG 308
           VP  DE++L KAV+ QPVS+AI A   EFQ Y+ GIF G CGT LDH VT VG+G TE+G
Sbjct: 252 VPENDEKSLEKAVANQPVSVAIEAGGREFQLYQSGIFTGRCGTALDHGVTAVGYG-TENG 310

Query: 309 ANYWLIKNSWGNTWGDAGYMKIVRD-----EGLCGIGTRSSYPL 347
            +YW++KNSWG +WG+ GY+++ RD      G CGI   +SYP+
Sbjct: 311 VDYWIVKNSWGASWGEEGYIRMERDLATSATGKCGIAMEASYPI 354


>gi|302143411|emb|CBI21972.3| unnamed protein product [Vitis vinifera]
          Length = 320

 Score =  323 bits (829), Expect = 6e-86,   Method: Compositional matrix adjust.
 Identities = 166/338 (49%), Positives = 226/338 (66%), Gaps = 37/338 (10%)

Query: 19  FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
             ++ +L + ASQ  ++RS HE S+ E HE WM Q+GR YKD  EK  R KIFK+N+  I
Sbjct: 12  LALLFVLAAWASQA-TARSLHEASMYERHEDWMVQYGREYKDADEKSKRYKIFKDNVARI 70

Query: 79  EKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHR-STTSSTFKYQNLSMTDVP 137
           E  NK  +++YKL  N+F+DLTN+EFRA    +K    +H  ST +++FKY+N+  T VP
Sbjct: 71  ESFNKAMDKSYKLSINEFADLTNEEFRASRNRFK----AHICSTEATSFKYENV--TAVP 124

Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-N 196
           +++DWR KGAVTPIK+Q +CG CWAF+AVAA+EGIT++ +G LI LSEQ+L+DC T+G +
Sbjct: 125 STVDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGED 184

Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCS--AAQKPAAAKISNYEEVPSGDE 254
            GC                       YPY    GTC+   A  P AAKI+ YE+VP+ +E
Sbjct: 185 QGCT---------------------NYPYAGTDGTCNRKKAAHP-AAKINGYEDVPANNE 222

Query: 255 QALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLI 314
           +AL KAV+ QP+++AI A  +EFQ Y  G+F G CGT+LDH V  VG+GT++DG  YWL+
Sbjct: 223 KALQKAVAHQPIAVAIDASGSEFQFYSSGVFTGQCGTELDHGVAAVGYGTSDDGMKYWLV 282

Query: 315 KNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
           KNSW   WG+ GY+++ RD    EGLCGI  ++SYP A
Sbjct: 283 KNSWSTGWGEEGYIRMQRDVTAKEGLCGIAMQASYPTA 320


>gi|357130141|ref|XP_003566711.1| PREDICTED: xylem cysteine proteinase 1-like [Brachypodium
           distachyon]
          Length = 457

 Score =  323 bits (829), Expect = 6e-86,   Method: Compositional matrix adjust.
 Identities = 155/311 (49%), Positives = 217/311 (69%), Gaps = 10/311 (3%)

Query: 43  VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTND 102
           ++E+ EKW+A+H ++Y    EK  R ++FK+NL++I+K N+E   +Y LG N+F+DLT++
Sbjct: 146 IIELFEKWLAKHQKAYASFEEKLHRFEVFKDNLKHIDKVNREVT-SYWLGLNEFADLTHE 204

Query: 103 EFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWA 162
           EF+A Y G   P+P+  S  S  FKY+++S  D+P S+DWR KGAVT +KNQ +CG CWA
Sbjct: 205 EFKATYLGLAPPAPARESRGS--FKYEDVSADDLPKSVDWRTKGAVTEVKNQGQCGSCWA 262

Query: 163 FAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDE 222
           F+ VAAVEGI  I +GNL  LSEQ+L+DCS +GNNGC GG  + AF+YI  + G+ TE+ 
Sbjct: 263 FSTVAAVEGINAIVTGNLTALSEQELIDCSVDGNNGCNGGLMDYAFSYIASSGGLHTEEA 322

Query: 223 YPYQAVPGTCSAAQK--PAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSY 280
           YPY    G+C   +K    A  IS YE+VP+ +EQAL+KA++ QPVS+AI A    FQ Y
Sbjct: 323 YPYLMEEGSCGDGKKSESEAVTISGYEDVPAHNEQALIKALAHQPVSVAIEASGRHFQFY 382

Query: 281 KEGIFNGVCGTQLDHAVTIVGFGTTE-DGANYWLIKNSWGNTWGDAGYMKIVR----DEG 335
             G+F+G CGTQLDH V  VG+G+ +  G +Y +++NSWG  WG+ GY+++ R     EG
Sbjct: 383 SGGVFDGPCGTQLDHGVAAVGYGSDKGKGHDYIIVRNSWGAKWGEKGYIRMKRGTGKGEG 442

Query: 336 LCGIGTRSSYP 346
           LCGI   +SYP
Sbjct: 443 LCGINKMASYP 453


>gi|171702829|dbj|BAG16370.1| cysteine protease [Brassica oleracea var. italica]
          Length = 332

 Score =  323 bits (829), Expect = 6e-86,   Method: Compositional matrix adjust.
 Identities = 156/328 (47%), Positives = 220/328 (67%), Gaps = 7/328 (2%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
           +F+I++L+ S +     SR   E ++ + H  WM +HGR Y D  EK  R  +FK N+E 
Sbjct: 2   IFLIVSLVSSFSLSTTLSRPLDEVTMQKRHAAWMTEHGRVYADANEKNNRYVVFKRNVES 61

Query: 78  IEKANK-EGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV 136
           IE+ N+ +   T+KL  NQF+DLTN+EFR++YTGYK  S     T  ++F+YQ++S   +
Sbjct: 62  IERLNEVQYGLTFKLAVNQFADLTNEEFRSMYTGYKGNSVLSSRTKPTSFRYQHVSSDAL 121

Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGN 196
           P S+DWR KGAVTPIK+Q  CG CWAF+AVAA+EG+ +I+ G LI LSEQ+L+DC TN +
Sbjct: 122 PISVDWRKKGAVTPIKDQGSCGSCWAFSAVAAIEGVAQIKKGKLISLSEQELVDCDTN-D 180

Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQ-KPAAAKISNYEEVPSGDEQ 255
           +GC+GG    AF Y +   G+ +E  YPY++  GTC+  + K  A  I  +E+VP+ DE+
Sbjct: 181 DGCMGGYMNSAFNYTMTTGGLTSESNYPYKSTDGTCNINKTKQIATSIKGFEDVPANDEK 240

Query: 256 ALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIK 315
           AL+KAV+  PVSI IA   T FQ Y  G+F+G C T LDH V +VG+G + +G+ YW++K
Sbjct: 241 ALMKAVAHHPVSIGIAGGGTGFQFYSSGVFSGECSTHLDHGVAVVGYGKSSNGSKYWILK 300

Query: 316 NSWGNTWGDAGYMKIVRD----EGLCGI 339
           NSWG  WG+ GYM+I +D     G CG+
Sbjct: 301 NSWGPKWGERGYMRIKKDTKAKHGQCGL 328


>gi|357113934|ref|XP_003558756.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like
           [Brachypodium distachyon]
          Length = 346

 Score =  323 bits (828), Expect = 7e-86,   Method: Compositional matrix adjust.
 Identities = 152/337 (45%), Positives = 216/337 (64%), Gaps = 10/337 (2%)

Query: 18  MFIIITLLVSCASQVVSSR--STHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENL 75
           +  I+  L  C++ V+++R     + ++   HE+WMAQ GR YKD  EK  RL++FK N+
Sbjct: 10  LVAIVGCLCLCSTAVLAARELGDADNAMAARHEQWMAQFGRVYKDPAEKAHRLEVFKANV 69

Query: 76  EYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD 135
            +IE  N E N  + LG NQF+DLTNDEFRA  T   +     R   +  FKY ++S+  
Sbjct: 70  AFIESFNAE-NHEFWLGANQFADLTNDEFRASKTNKGIKQGGVRDAPTG-FKYSDVSIDA 127

Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG 195
           +P S+DWR KGAVTPIKNQ +CG CWAF+AVAA EG+ K+ +G L+ LSEQ+L+DC  +G
Sbjct: 128 LPASVDWRTKGAVTPIKNQGQCGSCWAFSAVAATEGVVKLSTGKLVSLSEQELVDCDVHG 187

Query: 196 -NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQK-PAAAKISNYEEVPSGD 253
            + GC+GG  + AF +II+N G+ TE  YPY      C + +    AA I  YE+VP+ D
Sbjct: 188 VDQGCMGGWMDDAFKFIIKNGGLTTEANYPYTGEDDKCKSNETVNVAATIKGYEDVPAND 247

Query: 254 EQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWL 313
           E AL+KAV+ QPVS+ +      FQ Y  G+  G CG ++DH +  +G+G T +G  YWL
Sbjct: 248 ESALMKAVAHQPVSVVVDGGDMTFQLYAGGVMTGSCGVEMDHGIAAIGYGATSNGTKYWL 307

Query: 314 IKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
           +KNSWG TWG+ G++++ +D     G+CG+  + SYP
Sbjct: 308 MKNSWGTTWGEKGFLRMAKDIPDKRGMCGLAMKPSYP 344


>gi|147790682|emb|CAN61026.1| hypothetical protein VITISV_001146 [Vitis vinifera]
          Length = 469

 Score =  323 bits (827), Expect = 1e-85,   Method: Compositional matrix adjust.
 Identities = 163/354 (46%), Positives = 231/354 (65%), Gaps = 23/354 (6%)

Query: 14  NTTPMFIIITLLVSCAS------QVVSSRSTH--------EQSVVEIHEKWMAQHGRSYK 59
           +++ M + + LL+  AS       ++    TH        ++ V+ ++E W+A+HG+SY 
Sbjct: 6   SSSSMAVFLFLLLGLASASAXDMSIIGYDETHGDKSSWRTDEDVMAVYEAWLAKHGKSYN 65

Query: 60  DELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHR 119
              EKE R +IFK+NL +I++ N E NRTYK+G N+F+DLTN+E+R++Y G +  +   R
Sbjct: 66  ALGEKERRFQIFKDNLRFIDEHNAE-NRTYKVGLNRFADLTNEEYRSMYLGTRTAA-KRR 123

Query: 120 STTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGN 179
           S+   + +Y       +P S+DWR KGAV  +K+Q  CG CWAF+ +AAVEGI KI +G 
Sbjct: 124 SSNKISDRYAFRVGDSLPESVDWRKKGAVVEVKDQGSCGSCWAFSTIAAVEGINKIVTGG 183

Query: 180 LIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA 239
           LI LSEQ+L+DC T+ N GC GG  + AF +II N GI +E++YPY+A  G C   +K A
Sbjct: 184 LISLSEQELVDCDTSYNEGCNGGLMDYAFEFIINNGGIDSEEDYPYKASDGRCDQYRKNA 243

Query: 240 -AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVT 298
               I  YE+VP  DE++L KAV+ QPVS+AI A   EFQ Y+ GIF G CGT LDH VT
Sbjct: 244 XVVTIDGYEDVPENDEKSLEKAVANQPVSVAIEAGGREFQLYQSGIFTGRCGTALDHGVT 303

Query: 299 IVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-----EGLCGIGTRSSYPL 347
            VG+G TE+G +YW++KNSWG +WG+ GY+++ RD      G CGI   +SYP+
Sbjct: 304 AVGYG-TENGVDYWIVKNSWGASWGEEGYIRMERDLATSATGKCGIAMEASYPI 356


>gi|356515040|ref|XP_003526209.1| PREDICTED: thiol protease SEN102-like [Glycine max]
          Length = 342

 Score =  323 bits (827), Expect = 1e-85,   Method: Compositional matrix adjust.
 Identities = 166/339 (48%), Positives = 223/339 (65%), Gaps = 14/339 (4%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
           M  +   L    SQV+  R  H+ ++ E HE WMA++G+ YKD  EKE R +IFK+N+E+
Sbjct: 10  MLALFLFLAVGISQVMP-RKLHQTALRERHENWMAEYGKMYKDAAEKEKRFQIFKDNVEF 68

Query: 78  IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTS-STFKYQNLSMTDV 136
           IE  N  GN+ YKLG N  +DLT +EF+    G K       +T   + FKY+N+  TD+
Sbjct: 69  IESFNAAGNKPYKLGVNHLADLTLEEFKDSRNGLKRTYEFSTTTFKLNGFKYENV--TDI 126

Query: 137 PTSLDWRDKGAVTPIKNQ-KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG 195
           P ++DWR KGAVTPIK+Q  +CG CWAF+ +AA EGI +I +GNL+ LSEQ+L+DC +  
Sbjct: 127 PEAIDWRVKGAVTPIKDQGDQCGSCWAFSTIAATEGIHQISTGNLVSLSEQELVDCDSV- 185

Query: 196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSA--AQKPAAAKISNYEEVPSGD 253
           ++GC GG  E  F +II+N GI +E  YPY+ V GTC+   A  P  A+I  YE VPS  
Sbjct: 186 DDGCEGGFMEDGFEFIIKNGGITSETNYPYKGVDGTCNTTIAASP-VAQIKGYEIVPSYS 244

Query: 254 EQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWL 313
           E+AL KAV+ QPVS++I A +  F  Y  GI+NG CGT LDH VT VG+G TE+G +YW+
Sbjct: 245 EEALQKAVANQPVSVSIHATNATFMFYSSGIYNGECGTDLDHGVTAVGYG-TENGTDYWI 303

Query: 314 IKNSWGNTWGDAGYMKIVR----DEGLCGIGTRSSYPLA 348
           +KNSWG  WG+ GY+++ R      G+CGI   SSYP A
Sbjct: 304 VKNSWGTQWGEKGYIRMHRGIAAKHGICGIALDSSYPTA 342


>gi|357160569|ref|XP_003578807.1| PREDICTED: vignain-like [Brachypodium distachyon]
          Length = 339

 Score =  322 bits (825), Expect = 2e-85,   Method: Compositional matrix adjust.
 Identities = 153/341 (44%), Positives = 214/341 (62%), Gaps = 9/341 (2%)

Query: 13  INTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFK 72
           I    +  I+  L  C+S + +     + S+V  HE WM Q+GR YKD  EK  + ++FK
Sbjct: 3   IPKASLLAILGCLCFCSSVLAARELNDDLSMVARHESWMLQYGRVYKDAAEKASKFEVFK 62

Query: 73  ENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
            N  +I+  N  GN  + LG NQF+D+TN EF+A  T     S   R+ T   F Y+N+S
Sbjct: 63  ANAGFIDSFNA-GNHKFWLGINQFADITNKEFKATKTNKGFISNKVRAPTG--FSYENVS 119

Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
              +P S+DWR KGAVTP+K+Q +CGCCWAF+AVAA EGI K+ +G L+ LSEQ+L+DC 
Sbjct: 120 FDALPASIDWRTKGAVTPVKDQGQCGCCWAFSAVAATEGIVKLSTGKLVSLSEQELVDCD 179

Query: 193 TNG-NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPS 251
            +G + GC GG  + AF +II N G+  E  YPY A  G C +  K +A  I +YE+VP+
Sbjct: 180 VHGEDQGCEGGLMDDAFKFIISNGGLTQESSYPYDAEDGKCKSGSK-SAGTIKSYEDVPA 238

Query: 252 GDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANY 311
            +E AL+KAV+ QPVS+A+      FQ Y  G+  G CGT LDH +  +G+G T DG  Y
Sbjct: 239 NNEGALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGVTSDGTKY 298

Query: 312 WLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
           WL+KNSWG +WG+ G++++ +D    +G+CG+    SYP A
Sbjct: 299 WLMKNSWGTSWGENGFLRMEKDIADKKGMCGLAMEPSYPTA 339


>gi|60100207|gb|AAX13273.1| putative cysteine protease [Oryza sativa Japonica Group]
          Length = 349

 Score =  322 bits (825), Expect = 2e-85,   Method: Compositional matrix adjust.
 Identities = 156/316 (49%), Positives = 215/316 (68%), Gaps = 10/316 (3%)

Query: 42  SVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNR-TYKLGTNQFSDLT 100
           ++ + HE+WMA+HGR+Y D+ EK  RL++F++N+ +IE  N   ++  + L  NQF+DLT
Sbjct: 35  AMAQRHERWMAKHGRAYADDAEKARRLEVFRDNVAFIESVNAAASQHKFWLEENQFADLT 94

Query: 101 NDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCC 160
           N EFRA  TG + PS S  +   ++F+Y N+S  D+P S+DWR KGAV P+K+Q +CGCC
Sbjct: 95  NAEFRATRTGLR-PSSSRGNRAPTSFRYANVSTGDLPASVDWRGKGAVNPVKDQGDCGCC 153

Query: 161 WAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYIIQNQGIAT 219
           WAF+AVAA+EG  K+ +G L+ LSEQQL+ C   G + GC GG  + AF +II+N G+A 
Sbjct: 154 WAFSAVAAMEGAVKLATGKLVSLSEQQLVSCDVKGEDQGCEGGLMDDAFDFIIKNGGLAA 213

Query: 220 EDEYPYQAVPGTC-SAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQ 278
           E +YPY A    C +A    AAA I  YE+VP+ DE ALLKAV+ QPVS+AI      FQ
Sbjct: 214 ESDYPYTASDDKCATAGAGAAAATIKGYEDVPANDEAALLKAVANQPVSVAIDGGDRHFQ 273

Query: 279 SYKEGIFNGV--CGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR---- 332
            YK G+ +G   C T+LDHA+T VG+G   DG  YWL+KNSWG +WG+ GY+++ R    
Sbjct: 274 FYKGGVLSGAAGCATELDHAITAVGYGVASDGTKYWLMKNSWGTSWGEDGYVRMERGVAD 333

Query: 333 DEGLCGIGTRSSYPLA 348
            EG+CG+   +SYP A
Sbjct: 334 KEGVCGLAMMASYPTA 349


>gi|357458909|ref|XP_003599735.1| Cysteine proteinase [Medicago truncatula]
 gi|357474677|ref|XP_003607623.1| Cysteine proteinase [Medicago truncatula]
 gi|355488783|gb|AES69986.1| Cysteine proteinase [Medicago truncatula]
 gi|355508678|gb|AES89820.1| Cysteine proteinase [Medicago truncatula]
          Length = 342

 Score =  322 bits (825), Expect = 2e-85,   Method: Compositional matrix adjust.
 Identities = 164/338 (48%), Positives = 215/338 (63%), Gaps = 11/338 (3%)

Query: 17  PMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLE 76
           PMF+I T  +     V+SSR   E  +   HEKWM Q G+SYKD  EKE R +IFK N+E
Sbjct: 10  PMFLIFTTWM--LPYVMSSR-VLEPYLSNKHEKWMTQFGKSYKDAAEKEKRFQIFKNNVE 66

Query: 77  YIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTG-YKMPSPSHRSTTSSTFKYQNLSMTD 135
           +IE  N  GN+ + L  N F+DLTN+EF+A   G  K+         +++F+Y N+  T 
Sbjct: 67  FIELFNAVGNKPFNLSINHFADLTNEEFKASLNGNKKLHDKFDILNETTSFRYHNV--TS 124

Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG 195
           VP S+DWR +GAVTPIKNQ  CG CWAF+ VA++EGI +I +G L+ LSEQ+L+DC    
Sbjct: 125 VPASMDWRKRGAVTPIKNQGSCGSCWAFSTVASIEGIHQITTGELVSLSEQELIDCVRGN 184

Query: 196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDE 254
           ++GC GG  E AF +I +  G+A+E  YPY+     C   ++    A+I  YE+VPS  E
Sbjct: 185 SSGCSGGYLEDAFKFIAKKGGMASETNYPYKETDEKCKFKKESKHVAEIKGYEKVPSNSE 244

Query: 255 QALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLI 314
             LLKAV+ QPVS+ + A    FQ Y  GIF G CGT  DH VTIVG+G + D   YWL+
Sbjct: 245 NDLLKAVANQPVSVYVDAGDYVFQFYSGGIFTGKCGTDTDHVVTIVGYGVSLDYTEYWLV 304

Query: 315 KNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
           KNSWG  WG+ GYMK+ R+    +GLCGI T  SYP+A
Sbjct: 305 KNSWGTGWGEKGYMKLKRNVDSKKGLCGIATNPSYPVA 342


>gi|146216004|gb|ABQ10204.1| cysteine protease Cp6 [Actinidia deliciosa]
          Length = 461

 Score =  322 bits (824), Expect = 2e-85,   Method: Compositional matrix adjust.
 Identities = 158/326 (48%), Positives = 225/326 (69%), Gaps = 8/326 (2%)

Query: 26  VSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEG 85
           +S   ++ SSR+  E  V+ ++E W+ +HG+SY    EKE R +IFK+NL +I++ N E 
Sbjct: 27  MSIIGELSSSRTDDE--VMAMYESWLVKHGKSYNAIGEKEKRFQIFKDNLRFIDEHNAE- 83

Query: 86  NRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDK 145
           +RTYK+G N+F+DLTNDE+R++Y G +  S    ST   + +Y  ++   +P S+DWR+K
Sbjct: 84  SRTYKVGLNRFADLTNDEYRSMYLGARTGSRRRLSTQKRSDRYVPVAGESLPDSVDWREK 143

Query: 146 GAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSRE 205
           GAV  +K+Q  CG CWAF+ +AAVEGI +I +G+LI LSEQ+L+DC T+ N GC GG  +
Sbjct: 144 GAVVGVKDQGSCGSCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMD 203

Query: 206 KAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQ 264
            AF +II+N GI TE++YPY A  G C   +K A    I +YE+VP  +EQAL KAV+ Q
Sbjct: 204 YAFEFIIKNGGIDTEEDYPYNARDGRCDQYRKNAKVVTIDDYEDVPVNNEQALQKAVANQ 263

Query: 265 PVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGD 324
           PVS+AI A    FQ Y+ G+F G CGT LDH VT VG+G TE+  +YW++KNSWG++WG+
Sbjct: 264 PVSVAIEASGMAFQFYESGVFTGNCGTALDHGVTAVGYG-TENSVDYWIVKNSWGSSWGE 322

Query: 325 AGYMKIVRDEGL---CGIGTRSSYPL 347
           +GY+++ R+ G    CGI    SYP+
Sbjct: 323 SGYIRMERNTGATGKCGIAVEPSYPI 348


>gi|356517310|ref|XP_003527331.1| PREDICTED: vignain-like [Glycine max]
          Length = 342

 Score =  322 bits (824), Expect = 2e-85,   Method: Compositional matrix adjust.
 Identities = 156/336 (46%), Positives = 221/336 (65%), Gaps = 8/336 (2%)

Query: 19  FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
           ++I+ L+++  +  V SR   E    E HEKWMAQ+GR YKD  EKE R ++FK N+ +I
Sbjct: 9   YLILFLVLAVWTSHVMSRRLSEACTSERHEKWMAQYGRVYKDAAEKEKRFQVFKNNVHFI 68

Query: 79  EKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPT 138
           E  N  G++ + L  NQF+DL ++EF+AL    +  +    ++T ++F+Y+  S+T +P 
Sbjct: 69  ESFNAAGDKPFNLSINQFADLNDEEFKALLINVQKKASWVETSTETSFRYE--SVTKIPA 126

Query: 139 SLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNG 198
           ++D R +GAVTPIK+Q  CG CWAF+AVAA EGI +I +G L+ LSEQ+L+DC    + G
Sbjct: 127 TIDRRKRGAVTPIKDQGRCGSCWAFSAVAATEGIHQITTGKLVPLSEQELVDCVKGESEG 186

Query: 199 CLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKP-AAAKISNYEEVPSGDEQAL 257
           C+GG  + AF +I +  GIA+E  YPY+ V  TC   ++    A+I  YE+VPS +E+AL
Sbjct: 187 CIGGYVDDAFEFIAKKGGIASETHYPYKGVNKTCKVKKETHGVAEIKGYEKVPSNNEKAL 246

Query: 258 LKAVSMQPVSIAIAAYSTEFQSYKEGIFNGV-CGTQLDHAVTIVGFGTTEDGANYWLIKN 316
           LKAV+ QPVS+ I A +  F+ Y  GIFN   CGT  +HAV +VG+G   D + YWL+KN
Sbjct: 247 LKAVANQPVSVYIDAGTHAFKYYSSGIFNARNCGTDPNHAVAVVGYGKALDDSKYWLVKN 306

Query: 317 SWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
           SWG  WG+ GY++I RD    EGLCGI     YP+A
Sbjct: 307 SWGTEWGERGYIRIKRDIRAKEGLCGIAKYPYYPIA 342


>gi|21070926|gb|AAM34401.1|AF377947_7 putative cysteine proteinase [Oryza sativa Japonica Group]
 gi|31712050|gb|AAP68356.1| putative cysteine protease [Oryza sativa Japonica Group]
 gi|40538988|gb|AAR87245.1| putative cysteine protease [Oryza sativa Japonica Group]
 gi|108711126|gb|ABF98921.1| Papain family cysteine protease containing protein, expressed
           [Oryza sativa Japonica Group]
 gi|125545747|gb|EAY91886.1| hypothetical protein OsI_13535 [Oryza sativa Indica Group]
          Length = 350

 Score =  322 bits (824), Expect = 2e-85,   Method: Compositional matrix adjust.
 Identities = 159/310 (51%), Positives = 210/310 (67%), Gaps = 10/310 (3%)

Query: 47  HEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN----KEGNRTYKLGTNQFSDLTND 102
           HEKWMA+HG++YKDE EK  RL++F+ N + I+  N    K+G   ++L TN+F+DLT+D
Sbjct: 42  HEKWMAKHGKTYKDEEEKARRLEVFRANAKLIDSFNAAAEKDGGGGHRLATNRFADLTDD 101

Query: 103 EFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWA 162
           EFRA  TGY+ P P+  +     F Y+N S+   P S+DWR  GAVT +K+Q  CGCCWA
Sbjct: 102 EFRAARTGYQRP-PAAVAGAGGGFLYENFSLAAAPQSMDWRAMGAVTGVKDQGSCGCCWA 160

Query: 163 FAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYIIQNQGIATED 221
           F+AVAAVEG+ KIR+G L+ LSEQ+L+DC   G + GC GG  + AF YI +  G+A E 
Sbjct: 161 FSAVAAVEGLAKIRTGQLVSLSEQELVDCDVRGEDQGCEGGLMDTAFQYIARRGGLAAES 220

Query: 222 EYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYK 281
            YPY+ V G C AA   AAA I  +++VPS DE AL+ AV+ QPVS+AI      F+ Y 
Sbjct: 221 SYPYRGVDGACRAAAGRAAASIRGFQDVPSNDEGALMAAVARQPVSVAINGAGYVFRFYD 280

Query: 282 EGIFNGV-CGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD---EGLC 337
            G+  G  CGT+L+HAVT VG+GT  DG  YWL+KNSWG +WG+ GY++I R    EG C
Sbjct: 281 RGVLGGAGCGTELNHAVTAVGYGTASDGTGYWLMKNSWGASWGEGGYVRIRRGVGREGAC 340

Query: 338 GIGTRSSYPL 347
           GI   +SYP+
Sbjct: 341 GIAQMASYPV 350


>gi|356515038|ref|XP_003526208.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
           max]
          Length = 339

 Score =  322 bits (824), Expect = 2e-85,   Method: Compositional matrix adjust.
 Identities = 154/333 (46%), Positives = 219/333 (65%), Gaps = 9/333 (2%)

Query: 19  FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
           ++I+ L+++  +  V SR   E    E HEKWMAQ+G+ Y D  EKE R +IFK N+++I
Sbjct: 9   YLILFLILTVWTFHVMSRRLSEVCTSERHEKWMAQYGKLYTDAAEKEKRFQIFKNNVQFI 68

Query: 79  EKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPT 138
           E  N  G++ + L  NQF+DL N+EF+A     +       + T ++F+Y+  S+T +P 
Sbjct: 69  ESFNAAGDKPFNLSINQFADLHNEEFKASLINVQKKESGVETATETSFRYE--SITKIPV 126

Query: 139 SLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNG 198
           ++DWR +GAVTPIK+Q  CG CWAF+ VAA+EGI +I +G L+ LSEQ+L+DC    + G
Sbjct: 127 TMDWRKRGAVTPIKDQGNCGSCWAFSIVAAIEGIHQITTGKLVSLSEQELVDCVKGKSEG 186

Query: 199 CLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKP-AAAKISNYEEVPSGDEQAL 257
           C  G +E+AF ++ +N G+A+E  YPY+A   TC   ++    A+I  YE VPS  E+AL
Sbjct: 187 CNFGYKEEAFEFVAKNGGLASEISYPYKANNKTCMVKKETQGVAQIKGYENVPSNSEKAL 246

Query: 258 LKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNS 317
           LKAV+ QPVS+ I A + +F  Y  GIF G CGT  +HA T++G+G    GA YWL+KNS
Sbjct: 247 LKAVANQPVSVYIDAGALQF--YSSGIFTGKCGTAPNHAATVIGYGKARGGAKYWLVKNS 304

Query: 318 WGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
           WG  WG+ GY+++ RD    EGLCGI T +SYP
Sbjct: 305 WGTKWGEKGYIRMKRDIRAKEGLCGIATNASYP 337


>gi|357160591|ref|XP_003578813.1| PREDICTED: vignain-like [Brachypodium distachyon]
          Length = 339

 Score =  321 bits (823), Expect = 3e-85,   Method: Compositional matrix adjust.
 Identities = 151/341 (44%), Positives = 218/341 (63%), Gaps = 9/341 (2%)

Query: 13  INTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFK 72
           I    +  I+  L   AS + +     + S+V  HE WM+Q+GRSYKD  EK+ + ++FK
Sbjct: 3   IPNASLLAILGCLCFFASGLAARELNDDLSMVARHESWMSQYGRSYKDAAEKDRKFEVFK 62

Query: 73  ENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
            N  +I+  N + N  + LG NQF+D+TN+EF+   T     S   R++T   F Y+N+S
Sbjct: 63  ANAAFIDSFNAK-NHKFWLGINQFADITNEEFKVTKTNKGFISNKVRASTG--FSYENVS 119

Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
           +  +P ++DWR KGAVTP+K+Q +CGCCWAF+AVAA EGI K+ +G L+ LSEQ+L+DC 
Sbjct: 120 IDALPATIDWRTKGAVTPVKDQGQCGCCWAFSAVAATEGIVKLSTGKLVSLSEQELVDCD 179

Query: 193 TNG-NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPS 251
            +G + GC GG  + AF +II N G+  E  YPY A  G C +  K +A  I +YE+VP+
Sbjct: 180 VHGEDQGCEGGLMDDAFKFIITNGGLTQESSYPYDAEDGKCKSGSK-SAGTIKSYEDVPA 238

Query: 252 GDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANY 311
            +E AL+KAV+ QPVS+A+      FQ Y  G+  G CGT LDH +  +G+G T DG  Y
Sbjct: 239 NNEGALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGVTSDGTKY 298

Query: 312 WLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
           WL+KNSWG +WG+ G++++ +D    +G+CG+    SYP A
Sbjct: 299 WLMKNSWGTSWGENGFLRMEKDIADKKGMCGLAMEPSYPTA 339


>gi|38346007|emb|CAD40110.2| OSJNBa0035O13.9 [Oryza sativa Japonica Group]
 gi|125589429|gb|EAZ29779.1| hypothetical protein OsJ_13837 [Oryza sativa Japonica Group]
          Length = 314

 Score =  321 bits (822), Expect = 4e-85,   Method: Compositional matrix adjust.
 Identities = 156/315 (49%), Positives = 214/315 (67%), Gaps = 10/315 (3%)

Query: 43  VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNR-TYKLGTNQFSDLTN 101
           + + HE+WMA+HGR+Y D+ EK  RL++F++N+ +IE  N   ++  + L  NQF+DLTN
Sbjct: 1   MAQRHERWMAKHGRAYADDAEKARRLEVFRDNVAFIESVNAAASQHKFWLEENQFADLTN 60

Query: 102 DEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCW 161
            EFRA  TG + PS S  +   ++F+Y N+S  D+P S+DWR KGAV P+K+Q +CGCCW
Sbjct: 61  AEFRATRTGLR-PSSSRGNRAPTSFRYANVSTGDLPASVDWRGKGAVNPVKDQGDCGCCW 119

Query: 162 AFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYIIQNQGIATE 220
           AF+AVAA+EG  K+ +G L+ LSEQQL+ C   G + GC GG  + AF +II+N G+A E
Sbjct: 120 AFSAVAAMEGAVKLATGKLVSLSEQQLVSCDVKGEDQGCEGGLMDDAFDFIIKNGGLAAE 179

Query: 221 DEYPYQAVPGTC-SAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQS 279
            +YPY A    C +A    AAA I  YE+VP+ DE ALLKAV+ QPVS+AI      FQ 
Sbjct: 180 SDYPYTASDDKCATAGAGAAAATIKGYEDVPANDEAALLKAVANQPVSVAIDGGDRHFQF 239

Query: 280 YKEGIFNGV--CGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR----D 333
           YK G+ +G   C T+LDHA+T VG+G   DG  YWL+KNSWG +WG+ GY+++ R     
Sbjct: 240 YKGGVLSGAAGCATELDHAITAVGYGVASDGTKYWLMKNSWGTSWGEDGYVRMERGVADK 299

Query: 334 EGLCGIGTRSSYPLA 348
           EG+CG+   +SYP A
Sbjct: 300 EGVCGLAMMASYPTA 314


>gi|297819566|ref|XP_002877666.1| hypothetical protein ARALYDRAFT_906213 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297323504|gb|EFH53925.1| hypothetical protein ARALYDRAFT_906213 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 304

 Score =  320 bits (820), Expect = 6e-85,   Method: Compositional matrix adjust.
 Identities = 164/315 (52%), Positives = 213/315 (67%), Gaps = 27/315 (8%)

Query: 40  EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
           E S +E HE+WM++  R Y D+ EK  R +IFK+NL+++E  N   N TYKL  N+FSDL
Sbjct: 11  EASAIEKHEQWMSRFNRVYSDDSEKTSRFEIFKKNLKFVESFNMNTNNTYKLDVNKFSDL 70

Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGC 159
           T++EF+A Y G      +  S  + +F+Y+N+S T    S+DWR +GAVTP+K+Q +CGC
Sbjct: 71  TDEEFQARYMGLVPEGMTGDSQKTVSFRYENVSETG--ESMDWRLEGAVTPVKDQGQCGC 128

Query: 160 CWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNN-GCLGGSREKAFAYIIQNQGIA 218
           CWAFAAVAAVEG+TKI +G L+ LSEQQL+DCST  NN GC GG    A+ YI +NQGI 
Sbjct: 129 CWAFAAVAAVEGVTKIANGELVSLSEQQLVDCSTANNNMGCDGGLALTAYDYIKENQGIT 188

Query: 219 TEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQ 278
           +E+ YPYQAV  TC +   PAAA IS YE VP  DE+ALLKAVS                
Sbjct: 189 SEENYPYQAVQQTCKSTD-PAAATISGYEAVPKDDEEALLKAVS---------------- 231

Query: 279 SYKEGIF-NGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD---- 333
             + GIF +  CGT   HAVTIVG+GT+E+G  YWL+KNSWG +WG+ GYM+I RD    
Sbjct: 232 --QHGIFEDEYCGTDSHHAVTIVGYGTSEEGIKYWLLKNSWGESWGENGYMRIKRDVDEP 289

Query: 334 EGLCGIGTRSSYPLA 348
           +G+CG+  R+ YP+A
Sbjct: 290 QGMCGLAHRAYYPVA 304


>gi|218202087|gb|EEC84514.1| hypothetical protein OsI_31214 [Oryza sativa Indica Group]
          Length = 348

 Score =  320 bits (820), Expect = 6e-85,   Method: Compositional matrix adjust.
 Identities = 149/327 (45%), Positives = 212/327 (64%), Gaps = 9/327 (2%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
           +F I+  L  C++ + +   + + ++   HE+WMAQ+GR YKD+ EK  R ++FK N  +
Sbjct: 8   LFAILGCLCLCSAVLAARELSDDAAMAARHERWMAQYGRMYKDDAEKARRFEVFKANAAF 67

Query: 78  IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
           IE  N  GN  + LG NQF+DLTNDEFR   T       + R  T   F+Y+N+++  +P
Sbjct: 68  IESFNA-GNHKFWLGVNQFADLTNDEFRLTKTNKGFIPSTTRVPTG--FRYENVNIDALP 124

Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-N 196
            ++DWR KG VTPIK+Q +CGCCWAF+AVAA+EGI K+ +G LI LSEQ+L+DC  +G +
Sbjct: 125 ATMDWRTKGVVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVHGED 184

Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQA 256
            GC GG  + AF +II+N G+ TE  YPY A    C +    + A I  YE+VP+ +E A
Sbjct: 185 QGCEGGLMDDAFKFIIKNGGLTTESNYPYAAADDKCKSVSN-SVASIKGYEDVPANNEAA 243

Query: 257 LLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKN 316
           L+KAV+ QPVS+A+      FQ YK G+  G CGT LDH +  +G+G   DG  YWL+KN
Sbjct: 244 LMKAVANQPVSVAVDGDDMTFQFYKGGVMIGSCGTDLDHGIVAIGYGKASDGTKYWLLKN 303

Query: 317 SWGNTWGDAGYMKIVRD----EGLCGI 339
           SWG TWG+ G++++ +D     G+CG+
Sbjct: 304 SWGMTWGENGFLRMEKDISDKRGMCGL 330


>gi|116787404|gb|ABK24495.1| unknown [Picea sitchensis]
 gi|224286306|gb|ACN40861.1| unknown [Picea sitchensis]
          Length = 452

 Score =  320 bits (820), Expect = 6e-85,   Method: Compositional matrix adjust.
 Identities = 153/323 (47%), Positives = 222/323 (68%), Gaps = 10/323 (3%)

Query: 32  VVSSRSTHEQ-SVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYK 90
           ++SS+   E  +++E++E W+A+H R+Y    EK+ R  +FK+N  YI + N +GNR+YK
Sbjct: 26  IISSKDLREDDAIMELYELWLAEHKRAYNGLDEKQKRFSVFKDNFLYIHEHN-QGNRSYK 84

Query: 91  LGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTP 150
           LG NQF+DL+++EF+A Y G K+ +    S   S  +YQ     D+P S+DWR+KGAVT 
Sbjct: 85  LGLNQFADLSHEEFKATYLGAKLDTKKRLSRPPSR-RYQYSDGEDLPESIDWREKGAVTS 143

Query: 151 IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAY 210
           +K+Q  CG CWAF+ VAAVEGI +I +G+LI LSEQ+L+DC T+ N GC GG  + AF +
Sbjct: 144 VKDQGSCGSCWAFSTVAAVEGINQIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAFEF 203

Query: 211 IIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIA 269
           II N G+ +E++YPY A  G+C + +K A    I +YE+VP  DE++L KA + QP+S+A
Sbjct: 204 IINNGGLDSEEDYPYTAYDGSCDSYRKNAHVVTIDDYEDVPENDEKSLKKAAANQPISVA 263

Query: 270 IAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMK 329
           I A   EFQ Y  G+F   CGTQLDH VT+VG+G +E G +YW +KNSWG +WG+ G+++
Sbjct: 264 IEASGREFQFYDSGVFTSTCGTQLDHGVTLVGYG-SESGTDYWTVKNSWGKSWGEEGFIR 322

Query: 330 IVRD-----EGLCGIGTRSSYPL 347
           + R+      G+CGI   +SYP+
Sbjct: 323 LQRNIEVASTGMCGIAMEASYPV 345


>gi|125547258|gb|EAY93080.1| hypothetical protein OsI_14881 [Oryza sativa Indica Group]
          Length = 314

 Score =  320 bits (820), Expect = 6e-85,   Method: Compositional matrix adjust.
 Identities = 156/315 (49%), Positives = 214/315 (67%), Gaps = 10/315 (3%)

Query: 43  VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNR-TYKLGTNQFSDLTN 101
           + + HE+WMA+HGR+Y D+ EK  RL++F++N+ +IE  N   ++  + L  NQF+DLTN
Sbjct: 1   MAQRHERWMAKHGRAYADDAEKVRRLEVFRDNVAFIESVNAAASQHKFWLEENQFADLTN 60

Query: 102 DEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCW 161
            EFRA  TG + PS S  +   ++F+Y N+S  D+P S+DWR KGAV P+K+Q +CGCCW
Sbjct: 61  AEFRATRTGLR-PSSSRGNRAPTSFRYANVSTGDLPASVDWRGKGAVNPVKDQGDCGCCW 119

Query: 162 AFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYIIQNQGIATE 220
           AF+AVAA+EG  K+ +G L+ LSEQQL+ C   G + GC GG  + AF +II+N G+A E
Sbjct: 120 AFSAVAAMEGAVKLATGKLVSLSEQQLVSCDVKGEDQGCEGGLMDDAFDFIIKNGGLAAE 179

Query: 221 DEYPYQAVPGTC-SAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQS 279
            +YPY A    C +A    AAA I  YE+VP+ DE ALLKAV+ QPVS+AI      FQ 
Sbjct: 180 SDYPYTASDDKCATAGAGAAAATIKGYEDVPANDEAALLKAVANQPVSVAIDGGDRHFQF 239

Query: 280 YKEGIFNGV--CGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR----D 333
           YK G+ +G   C T+LDHA+T VG+G   DG  YWL+KNSWG +WG+ GY+++ R     
Sbjct: 240 YKGGVLSGAAGCATELDHAITAVGYGVASDGTKYWLMKNSWGTSWGEDGYVRMERGVADK 299

Query: 334 EGLCGIGTRSSYPLA 348
           EG+CG+   +SYP A
Sbjct: 300 EGVCGLAMMASYPTA 314


>gi|302143415|emb|CBI21976.3| unnamed protein product [Vitis vinifera]
          Length = 322

 Score =  320 bits (819), Expect = 8e-85,   Method: Compositional matrix adjust.
 Identities = 166/336 (49%), Positives = 226/336 (67%), Gaps = 35/336 (10%)

Query: 21  IITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEK 80
           ++ +L + ASQ  ++R+ HE S+ E HE WMAQ+GR YKD  EK  R KIFK+N+  IE 
Sbjct: 14  LLFVLAAWASQA-TARNLHEASMYERHEDWMAQYGRVYKDADEKSKRYKIFKDNVARIES 72

Query: 81  ANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHR-STTSSTFKYQNLSMTDVPTS 139
            NK  +++YKL  N+F+DLTN+EF      +K    +H  ST +++FKY+N+  T VP++
Sbjct: 73  FNKAMDKSYKLSINEFADLTNEEFGTSRNRFK----AHICSTEATSFKYENV--TAVPST 126

Query: 140 LDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNG 198
           +DWR KGAVTPIK+Q +CG CWAF+AVAA+EGIT++ +G LI LSEQ+L+DC T+G + G
Sbjct: 127 IDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQG 186

Query: 199 CLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCS--AAQKPAAAKISNYEEVPSGDEQA 256
           C G +                   YPY    GTC+   A  P AAKI+ YE+VP+ +E+A
Sbjct: 187 CNGAN-------------------YPYAGTDGTCNRKKAAHP-AAKINGYEDVPANNEKA 226

Query: 257 LLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKN 316
           L KAV  QP+++AI A   EFQ Y  G+F G CGT+LDH V  VG+GT++DG  YWL+KN
Sbjct: 227 LQKAVVHQPIAVAIDAGGFEFQFYSSGVFTGQCGTELDHGVAAVGYGTSDDGMKYWLVKN 286

Query: 317 SWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
           SWG  WG+ GY+++ RD    EGLCGI  ++SYP A
Sbjct: 287 SWGTGWGEEGYIRMQRDVTAKEGLCGIAMQASYPTA 322


>gi|171702841|dbj|BAG16376.1| cysteine protease [Brassica rapa var. perviridis]
          Length = 333

 Score =  319 bits (818), Expect = 1e-84,   Method: Compositional matrix adjust.
 Identities = 157/329 (47%), Positives = 220/329 (66%), Gaps = 8/329 (2%)

Query: 18  MFIIITLLVSCASQVVSSRST-HEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLE 76
           +F+I++L+ S +  +  SR    E ++ + H +WM +HGR Y D  EK  R  +FK N+E
Sbjct: 2   IFLIVSLVSSFSLSITLSRPLLDEVAMQKRHAEWMTEHGRVYADANEKNNRYAVFKRNVE 61

Query: 77  YIEKANK-EGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD 135
            IE+ N  +   T+KL  NQF+DLTN+EFR++YTG+K  S     T  ++F+YQN+S   
Sbjct: 62  RIERLNDVQSGLTFKLAVNQFADLTNEEFRSMYTGFKGNSVLSSRTKPTSFRYQNVSSDA 121

Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG 195
           +P S+DWR KGAVTPIK+Q  CG CWAF+AVAA+EG+ +I+ G LI LSEQ+L+DC TN 
Sbjct: 122 LPVSVDWRKKGAVTPIKDQGLCGSCWAFSAVAAIEGVAQIKKGKLISLSEQELVDCDTN- 180

Query: 196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQ-KPAAAKISNYEEVPSGDE 254
           + GC+GG  + AF Y I   G+ +E  YPY++  GTC+  + K  A  I  +E+VP+ DE
Sbjct: 181 DGGCMGGLMDTAFNYTITIGGLTSESNYPYKSTNGTCNFNKTKQIATSIKGFEDVPANDE 240

Query: 255 QALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLI 314
           +AL+KAV+  PVSI IA     FQ Y  G+F+G C T LDH VT VG+G +++G  YW++
Sbjct: 241 KALMKAVAHHPVSIGIAGGDIGFQFYSSGVFSGECTTHLDHGVTAVGYGRSKNGLKYWIL 300

Query: 315 KNSWGNTWGDAGYMKIVRD----EGLCGI 339
           KNSWG  WG+ GYM+I +D     G CG+
Sbjct: 301 KNSWGPKWGERGYMRIKKDIKPKHGQCGL 329


>gi|414588007|tpg|DAA38578.1| TPA: hypothetical protein ZEAMMB73_159244 [Zea mays]
          Length = 307

 Score =  319 bits (818), Expect = 1e-84,   Method: Compositional matrix adjust.
 Identities = 147/309 (47%), Positives = 207/309 (66%), Gaps = 9/309 (2%)

Query: 43  VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTND 102
           + E HE+WMA++ R YKD  EK  R ++FK+N  ++E  N +    + LG NQF+DLT +
Sbjct: 1   MAERHERWMAEYDRVYKDAAEKARRFEVFKDNFAFVESFNADKKNKFWLGVNQFADLTTE 60

Query: 103 EFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWA 162
           EF+A   G+K  S     TT   FKY+NLS++ +PT++DWR KGAVTPIKNQ +CGCCWA
Sbjct: 61  EFKA-NKGFKPISAEEVPTTG--FKYENLSVSALPTAVDWRTKGAVTPIKNQGQCGCCWA 117

Query: 163 FAAVAAVEGITKIRSGNLIQLSEQQLLDCST-NGNNGCLGGSREKAFAYIIQNQGIATED 221
           F+A+AA+EGI K+ +GNL+ LSEQ+ +DC T N + GC GG  + AF ++I+N G+ATE 
Sbjct: 118 FSAIAAMEGIVKLSTGNLVSLSEQEPVDCDTHNMDEGCEGGWMDNAFEFVIKNGGLATES 177

Query: 222 EYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYK 281
            YPY+ V G C    K +AA I  +E+VP  +E AL+K V+ QPVS+A+ A    F  Y 
Sbjct: 178 SYPYKVVDGKCKGGSK-SAATIKGHEDVPPNNEAALMKVVASQPVSVAVDASDRTFMLYS 236

Query: 282 EGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLC 337
            G+  G CGTQLDH +  +G+G   D   YW++KNSWG TWG+ G++++ +D     G+C
Sbjct: 237 GGVMTGSCGTQLDHGIAAIGYGVESDDTKYWILKNSWGTTWGEKGFLRMEKDISDKRGMC 296

Query: 338 GIGTRSSYP 346
            +  + SYP
Sbjct: 297 DLAMKPSYP 305


>gi|194352750|emb|CAQ00103.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
 gi|326514262|dbj|BAJ92281.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326519402|dbj|BAJ96700.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326524351|dbj|BAK00559.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326531998|dbj|BAK01375.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 356

 Score =  319 bits (817), Expect = 1e-84,   Method: Compositional matrix adjust.
 Identities = 153/313 (48%), Positives = 218/313 (69%), Gaps = 10/313 (3%)

Query: 41  QSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLT 100
           + +VE+ EKW+A+H ++Y    EK  R ++FK+NL++I+K N+E   +Y LG N+F+DLT
Sbjct: 43  ERLVELFEKWLAKHQKAYASFEEKLHRFEVFKDNLKHIDKINRE-VTSYWLGLNEFADLT 101

Query: 101 NDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCC 160
           +DEF+A Y G  + +   R  +S +F+Y+++S +D+P S+DWR KGAVT +KNQ +CG C
Sbjct: 102 HDEFKAAYLG--LDAAPARRGSSRSFRYEDVSASDLPKSVDWRKKGAVTEVKNQGQCGSC 159

Query: 161 WAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATE 220
           WAF+ VAAVEGI  I +GNL  LSEQ+L+DCS +GN+GC GG  + AF+YI  + G+ TE
Sbjct: 160 WAFSTVAAVEGINAIVTGNLTALSEQELIDCSVDGNSGCNGGLMDYAFSYIASSGGLHTE 219

Query: 221 DEYPYQAVPGTCSAAQKP--AAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQ 278
           + YPY    G+C   +K    A  IS YE+VP+ DEQAL+KA++ QPVS+AI A    FQ
Sbjct: 220 EAYPYLMEEGSCGDGKKAESEAVTISGYEDVPANDEQALIKALAHQPVSVAIEASGRHFQ 279

Query: 279 SYKEGIFNGVCGTQLDHAVTIVGFGTTE-DGANYWLIKNSWGNTWGDAGYMKIVR----D 333
            Y  G+F+G CG QLDH V  VG+G+ +  G +Y +++NSWG  WG+ GY+++ R     
Sbjct: 280 FYSGGVFDGPCGAQLDHGVAAVGYGSDKGKGHDYIIVRNSWGAQWGEKGYIRMKRGTSNG 339

Query: 334 EGLCGIGTRSSYP 346
           EGLCGI   +SYP
Sbjct: 340 EGLCGINKMASYP 352


>gi|242086591|ref|XP_002439128.1| hypothetical protein SORBIDRAFT_09g000960 [Sorghum bicolor]
 gi|241944413|gb|EES17558.1| hypothetical protein SORBIDRAFT_09g000960 [Sorghum bicolor]
          Length = 371

 Score =  319 bits (817), Expect = 1e-84,   Method: Compositional matrix adjust.
 Identities = 154/315 (48%), Positives = 216/315 (68%), Gaps = 10/315 (3%)

Query: 38  THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
            H   ++++ E+W+A++ ++Y    EK  R ++FK+NL +I++ANK+   TY LG N F+
Sbjct: 57  VHHDRLIKLFEEWVAKYRKAYASFEEKLHRFEVFKDNLHHIDEANKKVT-TYWLGLNAFA 115

Query: 98  DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
           DLT+DEF+A Y G + P    + TT S F+Y  ++  DVP S+DWR KGAVT +KNQ +C
Sbjct: 116 DLTHDEFKATYLGLRQPET--KKTTDSRFRYGGVADDDVPASVDWRKKGAVTDVKNQGQC 173

Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
           G CWAF+ VAAVEGI +I +GNL  LSEQ+L+DCST+GNNGC GG  + AF+YI  + G+
Sbjct: 174 GSCWAFSTVAAVEGINQIVTGNLTSLSEQELVDCSTDGNNGCNGGVMDNAFSYIASSGGL 233

Query: 218 ATEDEYPYQAVPGTCS--AAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYST 275
            TE+ YPY    G C   A        IS YE+VP+ DEQAL+KA++ QP+S+AI A   
Sbjct: 234 RTEEAYPYLMEEGDCDDKARDGEQVVTISGYEDVPANDEQALVKALAHQPLSVAIEASGR 293

Query: 276 EFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-- 333
            FQ Y  G+FNG CG++LDH V  VG+G+++ G +Y ++KNSWG+ WG+ GY+++ R   
Sbjct: 294 HFQFYSGGVFNGPCGSELDHGVAAVGYGSSK-GQDYIIVKNSWGSHWGEKGYIRMKRGTG 352

Query: 334 --EGLCGIGTRSSYP 346
             EGLCGI   +SYP
Sbjct: 353 KPEGLCGINKMASYP 367


>gi|356543114|ref|XP_003540008.1| PREDICTED: LOW QUALITY PROTEIN: KDEL-tailed cysteine endopeptidase
           CEP1-like [Glycine max]
          Length = 343

 Score =  318 bits (816), Expect = 2e-84,   Method: Compositional matrix adjust.
 Identities = 163/331 (49%), Positives = 222/331 (67%), Gaps = 17/331 (5%)

Query: 28  CASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNR 87
           C SQV  SR  H+ S+ E HE+WM ++G+ YKD  E + R  IF+ N+E+IE  N  GN+
Sbjct: 20  CTSQV-KSRKLHDASMYERHEQWMEKYGKVYKDSAEMQKRFLIFENNVEFIESFNAAGNK 78

Query: 88  TYKLGTNQFSDLTNDEFRALYTGYKMPSPSH----RSTTSSTFKYQNLSMTDVPTSLDWR 143
            YKL  N  +D TN+EF A + GYK    SH    R TT + FKY+N+  TD+P ++DWR
Sbjct: 79  PYKLSINHLADQTNEEFMASHKGYK---GSHWQGLRITTQTPFKYENV--TDIPWAVDWR 133

Query: 144 DKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGS 203
            KG VT IK+Q +CG CWAF+AVAA EGI +I +GNL+ LSE++L+DC +  ++GC GG 
Sbjct: 134 QKGDVTSIKDQAQCGNCWAFSAVAATEGIYQITTGNLVSLSEKELVDCDSV-DHGCDGGL 192

Query: 204 REKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVS 262
            E  F +II+N GI++E  YPY AV GTC   ++ +  A+I+ YE VP   E+ L KAV+
Sbjct: 193 MEHGFEFIIKNGGISSEANYPYTAVNGTCDTNKEASPVAQITGYETVPVNCEEELQKAVA 252

Query: 263 MQ-PVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNT 321
            Q  +S++I A  + FQ Y  G+F G CGTQLDH VT VG+G+T+ G  YW++KNSWG  
Sbjct: 253 NQLTMSVSIDAGGSAFQFYPSGVFTGQCGTQLDHGVTAVGYGSTDYGTQYWIVKNSWGTQ 312

Query: 322 WGDAGYMKIVR----DEGLCGIGTRSSYPLA 348
           WG+ GY++++R     EGLCGI   +SYP A
Sbjct: 313 WGEEGYIRMLRGIDAQEGLCGIAMDASYPTA 343


>gi|356515046|ref|XP_003526212.1| PREDICTED: thiol protease SEN102-like [Glycine max]
          Length = 342

 Score =  318 bits (815), Expect = 3e-84,   Method: Compositional matrix adjust.
 Identities = 165/339 (48%), Positives = 222/339 (65%), Gaps = 14/339 (4%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
           M  +   L    SQV+  R  H+ ++ E HE WMA++G+ YKD  EKE R +IFK+N+E+
Sbjct: 10  MLALFLFLAVGISQVMP-RKLHQTALRERHENWMAEYGKMYKDAAEKEKRFQIFKDNVEF 68

Query: 78  IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTS-STFKYQNLSMTDV 136
           IE  N  GN+ YKLG N  +DLT +EF+    G K       +T   + FKY+N+  TD+
Sbjct: 69  IESFNAAGNKPYKLGVNHLADLTLEEFKDSRNGLKRTYEFSTTTFKLNGFKYENV--TDI 126

Query: 137 PTSLDWRDKGAVTPIKNQ-KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG 195
           P ++DWR KGAVTPIK+Q  +CG  WAF+ +AA EGI +I +GNL+ LSEQ+L+DC +  
Sbjct: 127 PEAIDWRVKGAVTPIKDQGDQCGRFWAFSTIAATEGIHQISTGNLVSLSEQELVDCDSV- 185

Query: 196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSA--AQKPAAAKISNYEEVPSGD 253
           ++GC GG  E  F +II+N GI +E  YPY+ V GTC+   A  P  A+I  YE VPS  
Sbjct: 186 DDGCEGGFMEDGFEFIIKNGGITSETNYPYKGVDGTCNTTIAASP-VAQIKGYEIVPSYS 244

Query: 254 EQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWL 313
           E+AL KAV+ QPVS++I A +  F  Y  GI+NG CGT LDH VT VG+G TE+G +YW+
Sbjct: 245 EEALKKAVANQPVSVSIHATNATFMFYSSGIYNGECGTDLDHGVTAVGYG-TENGTDYWI 303

Query: 314 IKNSWGNTWGDAGYMKIVR----DEGLCGIGTRSSYPLA 348
           +KNSWG  WG+ GY+++ R      G+CGI   SSYP A
Sbjct: 304 VKNSWGTQWGEKGYIRMHRGIAAKHGICGIALDSSYPTA 342


>gi|297598407|ref|NP_001045533.2| Os01g0971400 [Oryza sativa Japonica Group]
 gi|15289977|dbj|BAB63672.1| putative cysteine protease CP1 [Oryza sativa Japonica Group]
 gi|125529282|gb|EAY77396.1| hypothetical protein OsI_05384 [Oryza sativa Indica Group]
 gi|125573472|gb|EAZ14987.1| hypothetical protein OsJ_04922 [Oryza sativa Japonica Group]
 gi|215740756|dbj|BAG97412.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215741010|dbj|BAG97505.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215765325|dbj|BAG87022.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215767338|dbj|BAG99566.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|255674119|dbj|BAF07447.2| Os01g0971400 [Oryza sativa Japonica Group]
          Length = 365

 Score =  317 bits (813), Expect = 4e-84,   Method: Compositional matrix adjust.
 Identities = 154/322 (47%), Positives = 218/322 (67%), Gaps = 15/322 (4%)

Query: 37  STHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQF 96
           ++HE+ ++E+ EK+MA++ ++Y    EK  R ++FK+NL +I++ NK+    Y LG N+F
Sbjct: 43  ASHER-LMELFEKFMAKYRKAYSSLEEKLRRFEVFKDNLNHIDEENKK-ITGYWLGLNEF 100

Query: 97  SDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKE 156
           +DLT+DEF+A Y G  + +P+ R++    F+Y+ +    +P  +DWR KGAVT +KNQ +
Sbjct: 101 ADLTHDEFKAAYLGLTL-TPARRNSNDQLFRYEEVEAASLPKEVDWRKKGAVTEVKNQGQ 159

Query: 157 CGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQG 216
           CG CWAF+ VAAVEGI  I +GNL +LSEQ+L+DC T+GNNGC GG  + AF+YI  N G
Sbjct: 160 CGSCWAFSTVAAVEGINAIVTGNLTRLSEQELIDCDTDGNNGCSGGLMDYAFSYIAANGG 219

Query: 217 IATEDEYPYQAVPGTC--------SAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSI 268
           + TE+ YPY    GTC           +  AA  IS YE+VP  +EQALLKA++ QPVS+
Sbjct: 220 LHTEESYPYLMEEGTCRRGSTEGDDDGEAAAAVTISGYEDVPRNNEQALLKALAHQPVSV 279

Query: 269 AIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYM 328
           AI A    FQ Y  G+F+G CGT+LDH VT VG+GT   G +Y ++KNSWG+ WG+ GY+
Sbjct: 280 AIEASGRNFQFYSGGVFDGPCGTRLDHGVTAVGYGTASKGHDYIIVKNSWGSHWGEKGYI 339

Query: 329 KIVR----DEGLCGIGTRSSYP 346
           ++ R     +GLCGI   +SYP
Sbjct: 340 RMRRGTGKHDGLCGINKMASYP 361


>gi|255563136|ref|XP_002522572.1| cysteine protease, putative [Ricinus communis]
 gi|223538263|gb|EEF39872.1| cysteine protease, putative [Ricinus communis]
          Length = 340

 Score =  317 bits (812), Expect = 5e-84,   Method: Compositional matrix adjust.
 Identities = 174/339 (51%), Positives = 222/339 (65%), Gaps = 15/339 (4%)

Query: 16  TPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENL 75
           T + I++ +LV+  SQ +      E +V E HE+WMA+HGR+Y+D+ EKE R  IFK+NL
Sbjct: 7   TKLAIVLMILVTWVSQAMPRPLIDEDAVAEKHEQWMARHGRTYQDDEEKERRFHIFKKNL 66

Query: 76  EYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPS--PSHRSTTSSTFKYQNLSM 133
           ++IE  N   NRTYKLG N F+DLT++EF A YTGYKMP   P+   TT +T     L  
Sbjct: 67  KHIENFNNAFNRTYKLGLNHFADLTDEEFLATYTGYKMPKVLPTANITTKTTQSSDVLYE 126

Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST 193
            +VP S+DWR +G VTP+KNQ  CGCCWAF+A AAVEGI     GN + LS QQLLDC  
Sbjct: 127 ANVPESIDWRTRGVVTPVKNQGRCGCCWAFSAAAAVEGII----GNGVSLSAQQLLDCVP 182

Query: 194 NGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGD 253
           + +NGC GG  + AF YIIQNQG+A+   YPYQ +   C  +    AA+IS Y +V   D
Sbjct: 183 D-SNGCNGGFMDNAFRYIIQNQGLASATYYPYQLMREMCRPSNN--AARISGYVDVTPAD 239

Query: 254 EQALLKAVSMQPVSIAIAAYS-TEFQSYKEGIFNGV-CGTQLDHAVTIVGFGTTEDGANY 311
           E+ L  AV+ QPVS A+ A S   F+ Y  GIF    CG+ L HA+TIVG+GT+ +G  Y
Sbjct: 240 EETLKSAVARQPVSAAVDATSELNFKYYGGGIFPPQDCGSTLTHAITIVGYGTSAEGTKY 299

Query: 312 WLIKNSWGNTWGDAGYMKIVRDE----GLCGIGTRSSYP 346
           WLIKNSWG  WG+ GYM++ RD     G CGI  R+SYP
Sbjct: 300 WLIKNSWGEGWGEGGYMRLQRDVGSYGGACGIALRASYP 338


>gi|356543118|ref|XP_003540010.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
          Length = 339

 Score =  317 bits (811), Expect = 6e-84,   Method: Compositional matrix adjust.
 Identities = 163/339 (48%), Positives = 223/339 (65%), Gaps = 17/339 (5%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQS--VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENL 75
           +  ++ LL  C SQV+S R+ HE S  + E HE+W  ++G+ YKD  EK+ RL IFK+N+
Sbjct: 10  ILALVLLLPICISQVMS-RNLHEASXCMSERHEQWTKKYGKVYKDAAEKQKRLLIFKDNV 68

Query: 76  EYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSST-FKYQNLSMT 134
           E+IE  N  GN+ YKL  N  +D TN+EF A + GYK     H+ + S T FKY+N+  T
Sbjct: 69  EFIESFNAAGNKPYKLSINHLTDQTNEEFVASHNGYK-----HKGSHSQTPFKYENI--T 121

Query: 135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN 194
            VP ++DWR+ GAV  +K+Q +CG CWAF+ VA  EGI +I +  L+ LSEQ+L+DC + 
Sbjct: 122 GVPNAVDWRENGAVXAMKDQGQCGNCWAFSTVATTEGIYQITTSMLMSLSEQELVDCDSV 181

Query: 195 GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGD 253
            ++GC GG  E  F +I +N GI++E  YPY AV GT  A ++ + AA+I  YE VP+  
Sbjct: 182 -DHGCDGGYMEGGFEFIXKNGGISSEANYPYTAVDGTYDANKEASPAAQIKGYETVPANS 240

Query: 254 EQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWL 313
           E AL KAV+ QPVS+ I    + FQ    G+F G CGTQLDH VT VG+G+T+DG  YW+
Sbjct: 241 EDALQKAVANQPVSVTIDVGGSAFQFNSSGVFTGQCGTQLDHGVTAVGYGSTDDGTQYWI 300

Query: 314 IKNSWGNTWGDAGYMKIVR----DEGLCGIGTRSSYPLA 348
           +KNSWG  WG+ GY+++ R     EGLCGI   +SYP A
Sbjct: 301 VKNSWGTQWGEEGYIRMQRGTDAQEGLCGIAMDASYPTA 339


>gi|363814535|ref|NP_001242660.1| uncharacterized protein LOC100807362 precursor [Glycine max]
 gi|255636658|gb|ACU18666.1| unknown [Glycine max]
          Length = 367

 Score =  317 bits (811), Expect = 8e-84,   Method: Compositional matrix adjust.
 Identities = 161/344 (46%), Positives = 221/344 (64%), Gaps = 18/344 (5%)

Query: 18  MFIIITLLVSCASQVVSSRSTH--------EQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
           +F ++ +  +    ++S   +H        ++ V+ I+E+W+ +HG+ Y    EKE R +
Sbjct: 15  LFTVLAVSSALDMSIISYDRSHADKSGWKSDEEVMSIYEEWLVKHGKVYNAVEEKEKRFQ 74

Query: 70  IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQ 129
           IFK+NL +IE+ N   NRTYK+G N+FSDL+N+E+R+ Y G K+  PS R     + +Y 
Sbjct: 75  IFKDNLNFIEEHNAV-NRTYKVGLNRFSDLSNEEYRSKYLGTKI-DPS-RMMARPSRRYS 131

Query: 130 NLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLL 189
                ++P S+DWR +GAV  +KNQ EC  CWAF+A+AAVEGI KI +GNL  LSEQ+LL
Sbjct: 132 PRVADNLPESVDWRKEGAVVRVKNQSECEGCWAFSAIAAVEGINKIVTGNLTALSEQELL 191

Query: 190 DCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEE 248
           DC    N GC GG  + AF +II N GI TE++YP+Q   G C   +  A A  I  YE 
Sbjct: 192 DCDRTVNAGCSGGLVDYAFEFIINNGGIDTEEDYPFQGADGICDQYKINARAVTIDGYER 251

Query: 249 VPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDG 308
           VP+ DE AL KAV+ QPVS+AI AY  EFQ Y+ GIF G CGT +DH VT VG+G TE+G
Sbjct: 252 VPAYDELALKKAVANQPVSVAIEAYGKEFQLYESGIFTGTCGTSIDHGVTAVGYG-TENG 310

Query: 309 ANYWLIKNSWGNTWGDAGYMKIVRD-----EGLCGIGTRSSYPL 347
            +YW++KNSWG  WG+AGY+ + R+      G CGI   + YP+
Sbjct: 311 IDYWIVKNSWGENWGEAGYVGMERNIAEDTAGKCGIAILTLYPI 354


>gi|356545116|ref|XP_003540991.1| PREDICTED: vignain-like [Glycine max]
          Length = 342

 Score =  316 bits (809), Expect = 1e-83,   Method: Compositional matrix adjust.
 Identities = 158/346 (45%), Positives = 220/346 (63%), Gaps = 11/346 (3%)

Query: 8   SGSFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMR 67
           S S K N   +F+++T+  S   QV+S R +   S V+ HEKWMAQ+G+ YKD  EKE R
Sbjct: 3   SFSQKKNILVVFLVLTVWTS---QVMSRRLSEAYSSVK-HEKWMAQYGKVYKDAAEKEKR 58

Query: 68  LKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFK 127
            +IFK N+ +IE  +  G++ + L  NQF+DL   +F+AL    +    + R+ T++   
Sbjct: 59  FQIFKNNVHFIESFHAAGDKPFNLSINQFADL--HKFKALLINGQKKEHNVRTATATEAS 116

Query: 128 YQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQ 187
           ++  S+T +P+SLDWR +GAVTPIK+Q  C  CWAF+ VA +EG+ +I  G L+ LSEQ+
Sbjct: 117 FKYDSVTRIPSSLDWRKRGAVTPIKDQGTCRSCWAFSTVATIEGLHQITKGELVSLSEQE 176

Query: 188 LLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKP-AAAKISNY 246
           L+DC    + GC GG  E AF +I +  G+A+E  YPY+ V  TC   ++     +I  Y
Sbjct: 177 LVDCVKGDSEGCYGGYVEDAFEFIAKKGGVASETHYPYKGVNKTCKVKKETHGVVQIKGY 236

Query: 247 EEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTE 306
           E+VPS  E+ALLKAV+ QPVS  + A    FQ Y  GIF G CGT +DH+VT+VG+G   
Sbjct: 237 EQVPSNSEKALLKAVAHQPVSAYVEAGGYAFQFYSSGIFTGKCGTDIDHSVTVVGYGKAR 296

Query: 307 DGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
            G  YWL+KNSWG  WG+ GY+++ RD    EGLCGI T + YP A
Sbjct: 297 GGNKYWLVKNSWGTEWGEKGYIRMKRDIRAKEGLCGIATGALYPTA 342


>gi|115468686|ref|NP_001057942.1| Os06g0582600 [Oryza sativa Japonica Group]
 gi|55296512|dbj|BAD68726.1| putative cysteine proteinase [Oryza sativa Japonica Group]
 gi|113595982|dbj|BAF19856.1| Os06g0582600 [Oryza sativa Japonica Group]
 gi|215695236|dbj|BAG90427.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 357

 Score =  316 bits (809), Expect = 1e-83,   Method: Compositional matrix adjust.
 Identities = 170/357 (47%), Positives = 224/357 (62%), Gaps = 20/357 (5%)

Query: 8   SGSFKINTTPMFIII----TLLVSCASQVVSSRSTHEQSVV-EIHEKWMAQHGRSYKDEL 62
           S SF +    + II+    T LV  A +  ++    + S + E +EKW A HGR+YKD L
Sbjct: 5   SSSFSLAAILLIIIMYCCPTGLVEAARKGPAAAGGGDDSAMRERYEKWAADHGRTYKDSL 64

Query: 63  EKEMRLKIFKENLEYIEKANKEGNR-TYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRST 121
           EK  R ++F+ N  +I+  N  G + + +L TN+F+DLTN+EF A Y G    +P     
Sbjct: 65  EKARRFEVFRTNALFIDSFNAAGGKKSPRLTTNKFADLTNEEF-AEYYGRPFSTPV---I 120

Query: 122 TSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLI 181
             S F Y N+  +DVP +++WRD+GAVT +KNQK+C  CWAF+AVAAVEGI +IRS NL+
Sbjct: 121 GGSGFMYGNVRTSDVPANINWRDRGAVTQVKNQKDCASCWAFSAVAAVEGIHQIRSHNLV 180

Query: 182 QLSEQQLLDCSTNGNN-GCLGGSREKAFAYIIQNQGIATEDEYPYQ-AVPGTCSAAQKPA 239
            LS QQLLDCST  NN GC  G  ++AF YI  N GIA E +YPY+    GTC A+ KP 
Sbjct: 181 ALSTQQLLDCSTGRNNHGCNRGDMDEAFRYITSNGGIAAESDYPYEDRALGTCRASGKPV 240

Query: 240 AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIF----NGVCGTQLDH 295
           AA I  ++ VP  +E ALL AV+ QPVS+A+       Q +  G+F    N  C T L+H
Sbjct: 241 AASIRGFQYVPPNNETALLLAVAHQPVSVALDGVGKVSQFFSSGVFGAMQNETCTTDLNH 300

Query: 296 AVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
           A+T VG+GT E G  YWL+KNSWG  WG+ GYMKI RD     GLCG+  + SYP+A
Sbjct: 301 AMTAVGYGTDEHGTKYWLMKNSWGTDWGEGGYMKIARDVASNTGLCGLAMQPSYPVA 357


>gi|242072390|ref|XP_002446131.1| hypothetical protein SORBIDRAFT_06g002140 [Sorghum bicolor]
 gi|241937314|gb|EES10459.1| hypothetical protein SORBIDRAFT_06g002140 [Sorghum bicolor]
          Length = 328

 Score =  315 bits (808), Expect = 2e-83,   Method: Compositional matrix adjust.
 Identities = 151/338 (44%), Positives = 220/338 (65%), Gaps = 20/338 (5%)

Query: 15  TTPMFIIITL-LVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKE 73
           T+  F++  L   S  S V+++R   + ++VE HE WM ++GR YKD  EK  R ++FK+
Sbjct: 3   TSKAFLLAILGCASLCSSVLAARELSDAAMVERHENWMVEYGRVYKDAAEKARRFQVFKD 62

Query: 74  NLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
           N+ ++E  N   N  + LG NQF+DLT +EF+A   G+K   P+     ++ FKY+NLS+
Sbjct: 63  NVAFVESFNTNKNNKFWLGVNQFADLTTEEFKA-NKGFK---PTAEKVPTTGFKYENLSV 118

Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST 193
           + +PT++DWR KGAVTPIKNQ +C         AA+EGI K+ +GNLI LSEQ+L+DC T
Sbjct: 119 SALPTAVDWRTKGAVTPIKNQGQC---------AAMEGIVKLSTGNLISLSEQELVDCDT 169

Query: 194 NG-NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSG 252
           +  + GC GG  + AF ++I+N G+ATE  YPY+AV G C    K +AA I  +E+VP  
Sbjct: 170 HSMDEGCEGGWMDSAFEFVIKNGGLATESNYPYKAVDGKCKGGSK-SAATIKGHEDVPVN 228

Query: 253 DEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYW 312
           +E AL+KAV+ QPVS+A+ A    F  Y  G+  G CGT+LDH +  +G+G   DG  YW
Sbjct: 229 NEAALMKAVANQPVSVAVDASDRTFMLYSGGVMTGSCGTELDHGIAAIGYGMESDGTKYW 288

Query: 313 LIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
           ++KNSWG TWG+ G++++ +D     G+CG+  + SYP
Sbjct: 289 ILKNSWGTTWGEKGFLRMEKDITDKRGMCGLAMKPSYP 326


>gi|356564154|ref|XP_003550321.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
          Length = 476

 Score =  315 bits (807), Expect = 2e-83,   Method: Compositional matrix adjust.
 Identities = 159/364 (43%), Positives = 226/364 (62%), Gaps = 27/364 (7%)

Query: 9   GSFKINTTP----------MFIIITLLVSCASQVVSSRSTH---------EQSVVEIHEK 49
           GS  I T+P          +F +  +  +    ++S  S H         E+ ++ ++E+
Sbjct: 2   GSSSITTSPATMTMAAIVLLFTVFAVSSALDMSIISYDSAHADKAATLRTEEELMSMYEQ 61

Query: 50  WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYT 109
           W+ +HG+ Y    EKE R +IFK+NL +I+  N   +RTYKLG N+F+DLTN+E+RA Y 
Sbjct: 62  WLVKHGKVYNALGEKEKRFQIFKDNLRFIDDHNSAEDRTYKLGLNRFADLTNEEYRAKYL 121

Query: 110 GYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAV 169
           G K+  P+ R   + + +Y       +P S+DWR +GAV P+K+Q  CG CWAF+A+ AV
Sbjct: 122 GTKI-DPNRRLGKTPSNRYAPRVGDKLPDSVDWRKEGAVPPVKDQGGCGSCWAFSAIGAV 180

Query: 170 EGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVP 229
           EGI KI +G LI LSEQ+L+DC T  N GC GG  + AF +II N GI ++++YPY+ V 
Sbjct: 181 EGINKIVTGELISLSEQELVDCDTGYNQGCNGGLMDYAFEFIINNGGIDSDEDYPYRGVD 240

Query: 230 GTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGV 288
           G C   +K A    I +YE+VP+ DE AL KAV+ QPVS+AI     EFQ Y  G+F G 
Sbjct: 241 GRCDTYRKNAKVVSIDDYEDVPAYDELALKKAVANQPVSVAIEGGGREFQLYVSGVFTGR 300

Query: 289 CGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-----EGLCGIGTRS 343
           CGT LDH V  VG+GT + G +YW+++NSWG++WG+ GY+++ R+      G CGI    
Sbjct: 301 CGTALDHGVVAVGYGTAK-GHDYWIVRNSWGSSWGEDGYIRLERNLANSRSGKCGIAIEP 359

Query: 344 SYPL 347
           SYPL
Sbjct: 360 SYPL 363


>gi|242038089|ref|XP_002466439.1| hypothetical protein SORBIDRAFT_01g007820 [Sorghum bicolor]
 gi|241920293|gb|EER93437.1| hypothetical protein SORBIDRAFT_01g007820 [Sorghum bicolor]
          Length = 353

 Score =  315 bits (807), Expect = 2e-83,   Method: Compositional matrix adjust.
 Identities = 154/311 (49%), Positives = 211/311 (67%), Gaps = 6/311 (1%)

Query: 43  VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTND 102
           +V  HEKWMA+HGR+Y DE EK  RL+IF+ N E+I+  N  G  +++L TN+F+DLT++
Sbjct: 43  MVSRHEKWMAEHGRTYTDEAEKARRLEIFRANAEFIDSFNDAGKHSHRLATNRFADLTDE 102

Query: 103 EFRALYTGYKMPSPSHRSTTSST-FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCW 161
           EFRA  TG++       +  S   F+Y+N S+ D   S+DWR  GAVT +K+Q ECGCCW
Sbjct: 103 EFRAARTGFRPRPAPAAAAGSGGRFRYENFSLADAAQSVDWRAMGAVTGVKDQGECGCCW 162

Query: 162 AFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYIIQNQGIATE 220
           AF+AVAAVEG+ KIR+G L+ LSEQ+L+DC  NG + GC GG  + AF +I +  G+A+E
Sbjct: 163 AFSAVAAVEGLNKIRTGRLVSLSEQELVDCDVNGEDQGCEGGLMDDAFQFIERRGGLASE 222

Query: 221 DEYPYQAVPGTCSAAQKPAAAK-ISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQS 279
             YPYQ   G+C ++   A A  I  +E+VP  +E AL  AV+ QPVS+AI      F+ 
Sbjct: 223 SGYPYQGDDGSCRSSAAAARAASIRGHEDVPRNNEAALAAAVANQPVSVAINGEDYAFRF 282

Query: 280 YKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKI---VRDEGL 336
           Y  G+  G CGT L+HA+T VG+GT  DG+ YWL+KNSWG +WG+ GY++I   VR EG+
Sbjct: 283 YDSGVLGGECGTDLNHAITAVGYGTAADGSKYWLMKNSWGTSWGEGGYVRIRRGVRGEGV 342

Query: 337 CGIGTRSSYPL 347
           CG+    SYP+
Sbjct: 343 CGLAKLPSYPV 353


>gi|226504984|ref|NP_001151293.1| cysteine protease 1 precursor [Zea mays]
 gi|195645596|gb|ACG42266.1| cysteine protease 1 precursor [Zea mays]
          Length = 340

 Score =  315 bits (806), Expect = 3e-83,   Method: Compositional matrix adjust.
 Identities = 156/338 (46%), Positives = 218/338 (64%), Gaps = 12/338 (3%)

Query: 19  FIIITLLVS----CASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKEN 74
           F++  L+V     C +    + +    ++   HEKWMA+HGR+YKDE EK  RL++F+ N
Sbjct: 6   FLLAVLVVGSAVLCTAAAPRALAAAAAAMASRHEKWMAEHGRAYKDEAEKARRLEVFRAN 65

Query: 75  LEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYK-MPSPSHRSTTSSTFKYQNLSM 133
            E I+  N  G  +++L TN+F+DLT  EFRA  TG +  P+PS     +  F+Y+N S+
Sbjct: 66  AELIDSFNAAGTHSHRLATNRFADLTVQEFRAARTGLRPRPAPS---AGAGRFRYENFSL 122

Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST 193
            D   S+DWR  GAVT +K+Q   GCCWAF+AVAAVEG+ KIR+G L+ LSEQ+L+DC  
Sbjct: 123 ADAAQSVDWRAMGAVTGVKDQGASGCCWAFSAVAAVEGLNKIRTGRLVSLSEQELVDCDV 182

Query: 194 NG-NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSG 252
           +G + GC GG  + AF ++ +  G+A+E  YPYQ   G C ++   AAA I  +E+VP  
Sbjct: 183 SGVDQGCDGGLMDNAFQFVARRGGLASESGYPYQCRDGPCRSSAAAAAASIRGHEDVPRN 242

Query: 253 DEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYW 312
           +E AL  AV+ QPVS+AI      F+ Y  G+  G CGT L+HA+T VG+GT  DG  YW
Sbjct: 243 NEAALAAAVAHQPVSVAINGEDMAFRFYDSGVLGGACGTDLNHAITAVGYGTAADGTRYW 302

Query: 313 LIKNSWGNTWGDAGYMKI---VRDEGLCGIGTRSSYPL 347
           L+KNSWG +WG+ GY++I   VR EG+CG+    SYP+
Sbjct: 303 LMKNSWGASWGEGGYVRIRRGVRGEGVCGLAKLPSYPV 340


>gi|449500145|ref|XP_004161017.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
          Length = 349

 Score =  315 bits (806), Expect = 3e-83,   Method: Compositional matrix adjust.
 Identities = 158/335 (47%), Positives = 224/335 (66%), Gaps = 14/335 (4%)

Query: 20  IIITLLVSCASQVVSSRSTHEQSV---VEIHEKWMAQHGRSYKDELEKEMRLKIFKENLE 76
           + IT  ++    +V     H  S+   +E+ E WM++H ++Y+   EK  R +IF +NL+
Sbjct: 17  LFITYAIAHDFSIVGYSPEHLASMDKTIELFESWMSKHSKTYRSIEEKLHRFEIFLDNLK 76

Query: 77  YIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV 136
           +I++ NK+ + +Y LG N+F+DL+++EF++ Y G ++  P  RS  S  F Y ++   D+
Sbjct: 77  HIDETNKKVS-SYWLGLNEFADLSHEEFKSKYLGLRVEFPRKRS--SRGFSYGDVE--DL 131

Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGN 196
           P S+DWR KGAVTP+KNQ  CG CWAF+ VAAVEGI +I +GNL  LSEQ+L+DC  + N
Sbjct: 132 PESVDWRTKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSLSEQELIDCDRSFN 191

Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTC-SAAQKPAAAKISNYEEVPSGDEQ 255
           NGC GG  + AF YI+ N G+  E++YPY    G C    ++     IS YE+VP+ DEQ
Sbjct: 192 NGCYGGLMDYAFQYIMSNSGLRKEEDYPYLMEEGRCIREKEQFEVVTISGYEDVPANDEQ 251

Query: 256 ALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIK 315
           +LLKA+S QPVS+AI A S  FQ YK GIF G CGTQ+DH VT VG+G++E G +Y ++K
Sbjct: 252 SLLKALSHQPVSVAIEASSRNFQFYKGGIFTGRCGTQMDHGVTAVGYGSSE-GTDYIIVK 310

Query: 316 NSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
           NSWG  WG+ GY+++ R+    EGLCGI   +SYP
Sbjct: 311 NSWGPKWGENGYIRMKRNTGKPEGLCGINQMASYP 345


>gi|413933049|gb|AFW67600.1| cysteine protease 1 [Zea mays]
          Length = 341

 Score =  314 bits (805), Expect = 3e-83,   Method: Compositional matrix adjust.
 Identities = 156/338 (46%), Positives = 219/338 (64%), Gaps = 11/338 (3%)

Query: 19  FIIITLLVS----CASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKEN 74
           F++  L+V     C +    + +    ++   HEKWMA+HGR+YKDE EK  RL++F+ N
Sbjct: 6   FLLAVLVVGSAVLCTAAAPRALAAAAAAMASRHEKWMAEHGRAYKDEAEKARRLEVFRAN 65

Query: 75  LEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
            E I+  N  G  +++L TN+F+DLT +EFRA  TG + P P+  S  +  F+Y+N S+ 
Sbjct: 66  AELIDSFNAAGTHSHRLATNRFADLTVEEFRAARTGLR-PRPAP-SAGAGRFRYENFSLA 123

Query: 135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN 194
           D   S+DWR  GAVT +K+Q  CGCCWAF+AVAAVEG+ KIR+G L+ LSEQ+L+DC  +
Sbjct: 124 DAAQSVDWRAMGAVTGVKDQGACGCCWAFSAVAAVEGLNKIRTGRLVSLSEQELVDCDVS 183

Query: 195 G-NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAK-ISNYEEVPSG 252
           G + GC GG  + AF ++ +  G+A+E  YPYQ   G C ++   A A  I  +E+VP  
Sbjct: 184 GVDQGCDGGLMDNAFQFVARRGGLASESGYPYQGRDGPCRSSAAAARAASIRGHEDVPRN 243

Query: 253 DEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYW 312
           +E AL  AV+ QPVS+AI      F+ Y  G+  G CGT L+HA+T VG+GT  DG  YW
Sbjct: 244 NEAALAAAVANQPVSVAINGEDMAFRFYDSGVLGGACGTDLNHAITAVGYGTANDGTRYW 303

Query: 313 LIKNSWGNTWGDAGYMKI---VRDEGLCGIGTRSSYPL 347
           L+KNSWG +WG+ GY++I   VR EG+CG+    SYP+
Sbjct: 304 LMKNSWGASWGEGGYVRIRRGVRGEGVCGLAKLPSYPV 341


>gi|4731374|gb|AAD28477.1|AF133839_1 papain-like cysteine protease [Sandersonia aurantiaca]
          Length = 357

 Score =  314 bits (805), Expect = 3e-83,   Method: Compositional matrix adjust.
 Identities = 155/343 (45%), Positives = 218/343 (63%), Gaps = 18/343 (5%)

Query: 17  PMFIIITLLV-SCASQVVSSRSTH-EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKEN 74
           P+ +++ L   S  S  +  +    E S+  ++E+W + H  S +D  +K+ R  +FKEN
Sbjct: 6   PVLLVLALAFGSTLSIPIKEKDLESEDSLWSLYERWRSHHAVS-RDLDQKQKRFNVFKEN 64

Query: 75  LEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYK------MPSPSHRSTTSSTFKY 128
           +++I + NK  + T+KL  N+F D+TN EFRA Y G K      M    H S + + F Y
Sbjct: 65  VKFIHEFNKNKDVTFKLALNKFGDMTNQEFRAKYAGSKVHHHRTMKGSRHGSGSGAKFMY 124

Query: 129 QNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQL 188
           +N      P S+DWR++GAV  +KNQ +CG CWAF+A+AAVEGI +I +  L+ LSEQ+L
Sbjct: 125 ENAV---APPSIDWRERGAVAAVKNQGQCGSCWAFSAIAAVEGINQIVTKELVPLSEQEL 181

Query: 189 LDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEE 248
           +DC T+ N GC GG  + AF +I  N GI TED YPYQA   TC   +   A  I  YE+
Sbjct: 182 IDCDTDQNQGCSGGLMDYAFEFIKNNGGITTEDVYPYQAEDATCK--KNSPAVVIDGYED 239

Query: 249 VPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDG 308
           VP+ DE AL+KAV+ QPV++AI A    FQ Y EG+F G CGT+LDH V +VG+GTT+DG
Sbjct: 240 VPTNDEDALMKAVANQPVAVAIEASGYVFQFYSEGVFTGRCGTELDHGVAVVGYGTTQDG 299

Query: 309 ANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
             YW ++NSWG  WG++GY+++ R      GLCGI  ++SYP+
Sbjct: 300 TKYWTVRNSWGADWGESGYVRMQRGIKATHGLCGIAMQASYPI 342


>gi|356553978|ref|XP_003545327.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
          Length = 496

 Score =  314 bits (804), Expect = 4e-83,   Method: Compositional matrix adjust.
 Identities = 154/324 (47%), Positives = 214/324 (66%), Gaps = 10/324 (3%)

Query: 30  SQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTY 89
           +   +SRS  E  ++ ++E+W+ +HG+ Y    EKE R +IFK+NL +I+  N + +RTY
Sbjct: 64  AHAATSRSDEE--LMSMYEQWLVKHGKVYNALGEKEKRFQIFKDNLRFIDDHNSQEDRTY 121

Query: 90  KLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVT 149
           KLG N+F+DLTN+E+RA Y G K+  P+ R   + + +Y       +P S+DWR +GAV 
Sbjct: 122 KLGLNRFADLTNEEYRAKYLGTKI-DPNRRLGKTPSNRYAPRVGDKLPESVDWRKEGAVP 180

Query: 150 PIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFA 209
           P+K+Q  CG CWAF+A+ AVEGI KI +G LI LSEQ+L+DC T  N GC GG  + AF 
Sbjct: 181 PVKDQGGCGSCWAFSAIGAVEGINKIVTGELISLSEQELVDCDTGYNEGCNGGLMDYAFE 240

Query: 210 YIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSI 268
           +II N GI +E++YPY+ V G C   +K A    I +YE+VP+ DE AL KAV+ QPVS+
Sbjct: 241 FIINNGGIDSEEDYPYRGVDGRCDTYRKNAKVVSIDDYEDVPAYDELALKKAVANQPVSV 300

Query: 269 AIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYM 328
           AI     EFQ Y  G+F G CGT LDH V  VG+GT  +G +YW+++NSWG +WG+ GY+
Sbjct: 301 AIEGGGREFQLYVSGVFTGRCGTALDHGVVAVGYGTA-NGHDYWIVRNSWGPSWGEDGYI 359

Query: 329 KIVRD-----EGLCGIGTRSSYPL 347
           ++ R+      G CGI    SYPL
Sbjct: 360 RLERNLANSRSGKCGIAIEPSYPL 383


>gi|224056176|ref|XP_002298740.1| predicted protein [Populus trichocarpa]
 gi|222845998|gb|EEE83545.1| predicted protein [Populus trichocarpa]
          Length = 455

 Score =  313 bits (803), Expect = 5e-83,   Method: Compositional matrix adjust.
 Identities = 150/313 (47%), Positives = 207/313 (66%), Gaps = 7/313 (2%)

Query: 40  EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
           E     I+E W+ +HGR+Y    EKE R +IFK+NL++I++ N  GN +YKLG N+F+DL
Sbjct: 18  EAETRRIYEMWLVKHGRAYNALGEKERRFEIFKDNLKFIDEHNSVGNPSYKLGLNKFADL 77

Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGC 159
           +NDE+R++Y G +M           + +Y      D+P ++DWR+KGAV P+K+Q +CG 
Sbjct: 78  SNDEYRSVYLGTRMDGKGRLLGGPKSERYLFKEGDDLPETVDWREKGAVAPVKDQGQCGS 137

Query: 160 CWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIAT 219
           CWAF+ V AVEGI +I +GNL  LSEQ+L+DC    N GC GG  + AF +II+N GI T
Sbjct: 138 CWAFSTVGAVEGINQIVTGNLTSLSEQELVDCDKTYNLGCNGGLMDYAFDFIIENGGIDT 197

Query: 220 EDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQ 278
           E++YPY+A+   C   +K A    I  YE+VP  DE++L KAV+ QPVS+AI A    FQ
Sbjct: 198 EEDYPYKAIDSMCDPNRKNARVVTIDGYEDVPQNDEKSLKKAVANQPVSVAIEAGGRGFQ 257

Query: 279 SYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----- 333
            Y+ G+F G CGTQLDH V  VG+G TE G +YW+++NSWG  WG+ GY+++ RD     
Sbjct: 258 LYQSGVFTGSCGTQLDHGVVTVGYG-TEHGVDYWIVRNSWGPAWGENGYIRMERDVASTE 316

Query: 334 EGLCGIGTRSSYP 346
            G CGI   +SYP
Sbjct: 317 TGKCGIAMEASYP 329


>gi|357474523|ref|XP_003607546.1| Cysteine proteinase [Medicago truncatula]
 gi|358347207|ref|XP_003637651.1| Cysteine proteinase [Medicago truncatula]
 gi|355503586|gb|AES84789.1| Cysteine proteinase [Medicago truncatula]
 gi|355508601|gb|AES89743.1| Cysteine proteinase [Medicago truncatula]
          Length = 345

 Score =  313 bits (803), Expect = 5e-83,   Method: Compositional matrix adjust.
 Identities = 168/351 (47%), Positives = 230/351 (65%), Gaps = 27/351 (7%)

Query: 13  INTTPMFIIITLLVSCASQVVSSRS----THEQ--SVVEIHEK----WMAQHGRSYKDEL 62
           I TT +FI++ L  +C   V++S S    TH+Q  S VE  +K    W+ +HGR YK   
Sbjct: 5   ILTTTIFILLMLCNTC---VIASESECPPTHKQKSSDVEAMKKRFDGWVKRHGRKYKHND 61

Query: 63  EKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTT 122
           E+E+R  I++ N++YI+  N + N +Y L  N+F+DLTN+EF++ Y G      SH    
Sbjct: 62  EREVRFGIYQANVQYIQCKNAQKN-SYNLTDNKFADLTNEEFQSTYMGLSTRLRSH---- 116

Query: 123 SSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQ 182
           ++ F+Y      D+P S DWR +GAVT I +Q +CG CWAFAAVAAVEGI KI+SG LI 
Sbjct: 117 NTGFRYDEHG--DLPESKDWRKEGAVTEIMDQGQCGGCWAFAAVAAVEGINKIKSGKLIS 174

Query: 183 LSEQQLLDCST-NGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-A 240
           LSEQ+L+DC   +GN GC GG  E A+ +II+N G+ TE +YPY+ V GTC   +    A
Sbjct: 175 LSEQELIDCDVKSGNQGCQGGLMETAYTFIIENGGLTTEQDYPYEGVDGTCKMEKAAHYA 234

Query: 241 AKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIV 300
           A IS YEEVP+ +E  L  A + QPVS+AI A    FQ Y EG+F+G+CG QL+H VT+V
Sbjct: 235 ASISGYEEVPADNEAKLKAAAAHQPVSVAIDAGGYSFQFYSEGVFSGICGKQLNHGVTVV 294

Query: 301 GFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
           G+G  E    YW++KNSWG  WG++GY+++ RD    EG+CGI  ++SYPL
Sbjct: 295 GYG-KETINKYWIVKNSWGADWGESGYIRMKRDTLSKEGMCGIAMQASYPL 344


>gi|449454309|ref|XP_004144898.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
 gi|449471311|ref|XP_004153272.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
          Length = 349

 Score =  313 bits (803), Expect = 6e-83,   Method: Compositional matrix adjust.
 Identities = 158/335 (47%), Positives = 223/335 (66%), Gaps = 14/335 (4%)

Query: 20  IIITLLVSCASQVVSSRSTHEQSV---VEIHEKWMAQHGRSYKDELEKEMRLKIFKENLE 76
           + IT   +    +V     H  S+   +E+ E WM++H ++Y+   EK  R +IF +NL+
Sbjct: 17  LFITYATAHDFSIVGYSPEHLASMDKTIELFESWMSKHSKAYRSIEEKLHRFEIFLDNLK 76

Query: 77  YIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV 136
           +I++ NK+ + +Y LG N+F+DL+++EF++ Y G ++  P  RS  S  F Y ++   D+
Sbjct: 77  HIDETNKKVS-SYWLGLNEFADLSHEEFKSKYLGLRVEFPRKRS--SRGFSYGDVE--DL 131

Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGN 196
           P S+DWR KGAVTP+KNQ  CG CWAF+ VAAVEGI +I +GNL  LSEQ+L+DC  + N
Sbjct: 132 PESVDWRTKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSLSEQELIDCDRSFN 191

Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTC-SAAQKPAAAKISNYEEVPSGDEQ 255
           NGC GG  + AF YI+ N G+  E++YPY    G C    ++     IS YE+VP+ DEQ
Sbjct: 192 NGCYGGLMDYAFQYIMSNSGLRKEEDYPYLMEEGRCIREKEQFEVVTISGYEDVPANDEQ 251

Query: 256 ALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIK 315
           +LLKA+S QPVS+AI A S  FQ YK GIF G CGTQ+DH VT VG+G++E G +Y ++K
Sbjct: 252 SLLKALSHQPVSVAIEASSRNFQFYKGGIFTGRCGTQMDHGVTAVGYGSSE-GTDYIIVK 310

Query: 316 NSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
           NSWG  WG+ GY+++ R+    EGLCGI   +SYP
Sbjct: 311 NSWGPKWGENGYIRMKRNTGKPEGLCGINQMASYP 345


>gi|242070333|ref|XP_002450443.1| hypothetical protein SORBIDRAFT_05g005530 [Sorghum bicolor]
 gi|241936286|gb|EES09431.1| hypothetical protein SORBIDRAFT_05g005530 [Sorghum bicolor]
          Length = 351

 Score =  313 bits (803), Expect = 7e-83,   Method: Compositional matrix adjust.
 Identities = 166/355 (46%), Positives = 231/355 (65%), Gaps = 22/355 (6%)

Query: 10  SFKINTTPM------FIIITLLVSCASQVVSSRSTH------EQSVVEIHEKWMAQHGRS 57
           S+  N  P+       + +  + +C    V++R         E+++   HEKWM +HGR+
Sbjct: 3   SYIANNKPLITAAVALLTVLAIANCIGCAVAARDLSSSTGYGEEAMTARHEKWMVEHGRT 62

Query: 58  YKDELEKEMRLKIFKENLEYIEKANKE-GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSP 116
           YKDE EK  R ++FK N  +++ +N   G + Y L  N+F+D+T+DEF A YTG+K P P
Sbjct: 63  YKDEAEKARRFQVFKANAAFVDTSNAAAGGKKYHLAINRFADMTHDEFMARYTGFK-PLP 121

Query: 117 SHRSTTSSTFKYQNLSMT-DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKI 175
           +        FKY N++++ +   ++DWR KGAVT +KNQ++CGCCWAF+AVAA+EG+ +I
Sbjct: 122 ATGKKMPG-FKYANVTLSSEDQQAVDWRKKGAVTDVKNQQKCGCCWAFSAVAAIEGMHQI 180

Query: 176 RSGNLIQLSEQQLLDCST-NGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSA 234
            +G L+ LSEQQL+DCST   NNGC GG+ E AF Y+I N GIATE  YPY A+ G C  
Sbjct: 181 NTGELVSLSEQQLVDCSTNGNNNGCGGGTMEDAFQYVIGNNGIATEAAYPYTAMQGMCQN 240

Query: 235 AQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNG-VCGTQL 293
            Q PA A + +Y++VP  DE AL  AV+ QPVS+A+ A    FQ YK G+     CGT L
Sbjct: 241 VQ-PAVA-VRSYQQVPRDDEDALAAAVAGQPVSVAVDA--NNFQFYKGGVMTADSCGTNL 296

Query: 294 DHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDEGLCGIGTRSSYPLA 348
           +HAVT VG+GT EDG  YWL+KN WG+TWG+ GY+++ R  G CG+   +SYP+A
Sbjct: 297 NHAVTAVGYGTAEDGTPYWLLKNQWGSTWGEEGYLRLQRGVGACGVAKDASYPVA 351


>gi|350538043|ref|NP_001234324.1| cysteine protease TDI-65 precursor [Solanum lycopersicum]
 gi|5726641|gb|AAD48496.1|AF172856_1 cysteine protease TDI-65 [Solanum lycopersicum]
 gi|2828252|emb|CAA05894.1| CYP1 [Solanum lycopersicum]
          Length = 466

 Score =  313 bits (802), Expect = 8e-83,   Method: Compositional matrix adjust.
 Identities = 153/351 (43%), Positives = 223/351 (63%), Gaps = 11/351 (3%)

Query: 7   RSGSFKINTTPMFIIITLLVSCASQVVSSRSTH-----EQSVVEIHEKWMAQHGRSYKDE 61
            S +  I+   M I  TL  +    ++S   TH     +  V  ++E W+ +HG+SY   
Sbjct: 4   HSSTLTISILLMLIFSTLSSASDMSIISYDETHIHRRTDDEVSALYESWLIEHGKSYNAL 63

Query: 62  LEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRST 121
            EK+ R +IFK+NL YI++ N   N++YKLG  +F+DLTN+E+R++Y G K      + +
Sbjct: 64  GEKDKRFQIFKDNLRYIDEQNSVPNQSYKLGLTKFADLTNEEYRSIYLGTKSSGDRKKLS 123

Query: 122 TSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLI 181
            + + +Y       +P S+DWR+KG +  +K+Q  CG CWAF+AVAA+E I  I +GNLI
Sbjct: 124 KNKSDRYLPKVGDSLPESIDWREKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGNLI 183

Query: 182 QLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-A 240
            LSEQ+L+DC  + N GC GG  + AF ++I+N GI TE++YPY+   G C   +K A  
Sbjct: 184 SLSEQELVDCDRSYNEGCDGGLMDYAFEFVIKNGGIDTEEDYPYKERNGVCDQYRKNAKV 243

Query: 241 AKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIV 300
            KI +YE+VP  +E+AL KAV+ QPVSIA+ A   +FQ YK GIF G CGT +DH V I 
Sbjct: 244 VKIDSYEDVPVNNEKALQKAVAHQPVSIALEAGGRDFQHYKSGIFTGKCGTAVDHGVVIA 303

Query: 301 GFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
           G+G TE+G +YW+++NSWG  WG+ GY+++ R+     GLCG+    SYP+
Sbjct: 304 GYG-TENGMDYWIVRNSWGANWGENGYLRVQRNVASSSGLCGLAIEPSYPV 353


>gi|115461667|ref|NP_001054433.1| Os05g0108600 [Oryza sativa Japonica Group]
 gi|14719319|gb|AAK73137.1|AC079022_10 putative cysteine proteinase [Oryza sativa]
 gi|33151125|gb|AAP97431.1| cysteine protease CP1 [Oryza sativa]
 gi|52353572|gb|AAU44138.1| cysteine proteinase CP1 [Oryza sativa Japonica Group]
 gi|113577984|dbj|BAF16347.1| Os05g0108600 [Oryza sativa Japonica Group]
 gi|125550541|gb|EAY96250.1| hypothetical protein OsI_18148 [Oryza sativa Indica Group]
          Length = 358

 Score =  313 bits (801), Expect = 1e-82,   Method: Compositional matrix adjust.
 Identities = 154/317 (48%), Positives = 216/317 (68%), Gaps = 11/317 (3%)

Query: 37  STHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQF 96
           ++H++ ++E+ EKW+A++ ++Y    EK  R ++FK+NL +I+  NK+   +Y LG N+F
Sbjct: 42  ASHDR-LIELFEKWVAKYRKAYASFEEKVRRFEVFKDNLNHIDDINKK-VTSYWLGLNEF 99

Query: 97  SDLTNDEFRALYTGYKMPSPSHRST---TSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKN 153
           +DLT+DEF+A Y G   P P+  ++   +S  F+Y  +S  +VP  +DWR K AVT +KN
Sbjct: 100 ADLTHDEFKATYLGL-TPPPTRSNSKHYSSEEFRYGKMSNGEVPKEMDWRKKNAVTEVKN 158

Query: 154 QKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQ 213
           Q +CG CWAF+ VAAVEGI  I +GNL  LSEQ+L+DCST+GNNGC GG  + AF+YI  
Sbjct: 159 QGQCGSCWAFSTVAAVEGINAIVTGNLTSLSEQELIDCSTDGNNGCNGGLMDYAFSYIAS 218

Query: 214 NQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAY 273
             G+ TE+ YPY    G C   +  A   IS YE+VP+ DEQAL+KA++ QPVS+AI A 
Sbjct: 219 TGGLRTEEAYPYAMEEGDCDEGKGAAVVTISGYEDVPANDEQALVKALAHQPVSVAIEAS 278

Query: 274 STEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR- 332
              FQ Y  G+F+G CG QLDH VT VG+GT++ G +Y ++KNSWG  WG+ GY+++ R 
Sbjct: 279 GRHFQFYSGGVFDGPCGEQLDHGVTAVGYGTSK-GQDYIIVKNSWGPHWGEKGYIRMKRG 337

Query: 333 ---DEGLCGIGTRSSYP 346
               EGLCGI   +SYP
Sbjct: 338 TGKGEGLCGINKMASYP 354


>gi|224133760|ref|XP_002321654.1| predicted protein [Populus trichocarpa]
 gi|222868650|gb|EEF05781.1| predicted protein [Populus trichocarpa]
          Length = 362

 Score =  313 bits (801), Expect = 1e-82,   Method: Compositional matrix adjust.
 Identities = 155/341 (45%), Positives = 216/341 (63%), Gaps = 14/341 (4%)

Query: 19  FIIITLLVSCASQVVSSRSTHE------QSVVEIHEKWMAQHGRSYKDELEKEMRLKIFK 72
           F+ + L ++    +  S   HE      +S+ +++E+W + H  S   + EK  R  +FK
Sbjct: 6   FLFVALSLALVLGITESLDFHEKDLESEESLWDLYERWRSHHTVSTSLD-EKHKRFNVFK 64

Query: 73  ENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPS-HRSTTSSTFKYQNL 131
           EN+ ++ K NK G + YKL  N+F+D+TN EFR++Y G K+      R TT     +   
Sbjct: 65  ENVMHVHKTNKMG-KPYKLKLNKFADMTNHEFRSVYAGSKVKHHRMFRGTTRGNGSFMYG 123

Query: 132 SMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDC 191
            +  VPTS+DWR KGAVT +K+Q +CG CWAF+ + AVEGI  I++  L+ LSEQ+L+DC
Sbjct: 124 KVEKVPTSVDWRKKGAVTAVKDQGQCGSCWAFSTIVAVEGINYIKTNELVSLSEQELVDC 183

Query: 192 STNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQK-PAAAKISNYEEVP 250
            T  N GC GG  E AF +I + +GI TE  YPY+A  G C AA++   A  I  YE+VP
Sbjct: 184 DTTENQGCNGGLMEYAFEFIKKKRGITTESTYPYKAEDGHCDAAKENNPAVSIDGYEKVP 243

Query: 251 SGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGAN 310
             DE ALLKA + QPVS+AI A  ++FQ Y EG+F G CGT+LDH V +VG+GTT DG  
Sbjct: 244 ENDEDALLKAAANQPVSVAIDAGGSDFQFYSEGVFIGECGTELDHGVAVVGYGTTLDGTK 303

Query: 311 YWLIKNSWGNTWGDAGYMKIVR----DEGLCGIGTRSSYPL 347
           YW+++NSWG  WG+ GY+++ R     EGLCGI   +SYP+
Sbjct: 304 YWIVRNSWGPEWGEKGYIRMQRGISDKEGLCGIAMEASYPI 344


>gi|30141027|dbj|BAC75927.1| cysteine protease-5 [Helianthus annuus]
          Length = 365

 Score =  312 bits (800), Expect = 1e-82,   Method: Compositional matrix adjust.
 Identities = 147/326 (45%), Positives = 214/326 (65%), Gaps = 14/326 (4%)

Query: 33  VSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLG 92
           V+S +  ++ V   +E W+A+HG++Y    EKE R +IF +NL++I++ N  GNR+YK+G
Sbjct: 22  VTSNTRTDEEVRNTYELWLARHGKTYNALGEKESRFRIFADNLKFIDEHNLSGNRSYKVG 81

Query: 93  TNQFSDLTNDEFRALYTG-----YKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGA 147
            NQF+DLTN+E+R++Y G     Y+  +   R   S  +  Q   M   P  +DWR++GA
Sbjct: 82  LNQFADLTNEEYRSMYLGTKVDPYRRIAKMQRGEISRRYAVQENEM--FPAKVDWRERGA 139

Query: 148 VTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKA 207
           V+P+KNQ  CG CWAF+ VA+VEGI KI +G+LI LSEQ+L+DC    N+GC GGS + A
Sbjct: 140 VSPVKNQGGCGSCWAFSTVASVEGINKIVTGDLISLSEQELVDCDNKYNSGCNGGSMDYA 199

Query: 208 FAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAA-AKISNYEEVPSGDEQALLKAVSMQPV 266
           F +I+ N GI +E +YPY+ V   C   +  A    I  YE+VP  +E+AL+KAV+ QPV
Sbjct: 200 FQFIVSNGGIDSESDYPYKGVGAVCDPVRNKAKIVSIDGYEDVPPMNEKALMKAVAHQPV 259

Query: 267 SIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAG 326
           S+ I A    FQ Y  G+  G CGT LDH V +VG+G +E+G +YW+++NSWG  WG+ G
Sbjct: 260 SVGIEASGRAFQLYTSGVLTGSCGTNLDHGVVVVGYG-SENGKDYWIVRNSWGPEWGEDG 318

Query: 327 YMKIVRDE-----GLCGIGTRSSYPL 347
           Y+++ R+      G+CGI   +SYP+
Sbjct: 319 YIRMERNMVDTPVGMCGITLMASYPI 344


>gi|242092702|ref|XP_002436841.1| hypothetical protein SORBIDRAFT_10g009840 [Sorghum bicolor]
 gi|241915064|gb|EER88208.1| hypothetical protein SORBIDRAFT_10g009840 [Sorghum bicolor]
          Length = 328

 Score =  312 bits (799), Expect = 2e-82,   Method: Compositional matrix adjust.
 Identities = 152/336 (45%), Positives = 210/336 (62%), Gaps = 24/336 (7%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
           +  I+ L   C + + +     + ++V  HE+WM Q+ R YKD  EK  R ++FK N+++
Sbjct: 8   ILAILGLAFFCGAALAARDLNDDSAMVARHEQWMVQYSRVYKDTTEKARRFEVFKANVKF 67

Query: 78  IEKANKEGNRTYKLGTNQFSDLTNDEFRALYT--GYKMPSPSHRSTTSSTFKYQNLSMTD 135
           IE  N  GNR + LG NQF+DLTNDEFRA  T  G+K PSP   ST    F+Y+N+S+  
Sbjct: 68  IESFNAGGNRKFWLGVNQFADLTNDEFRATKTNKGFK-PSPVKVSTG---FRYENVSVDA 123

Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG 195
           +P ++DWR KGAVTPIK+Q +C            EGI KI +G LI LSEQ+L+DC  +G
Sbjct: 124 LPATIDWRTKGAVTPIKDQGQC------------EGIVKISTGKLISLSEQELVDCDVHG 171

Query: 196 -NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDE 254
            + GC GG  + AF +II+N G+ TE  YPY A  G C +    +AA +  +E+VP+ DE
Sbjct: 172 EDQGCEGGLMDDAFKFIIKNGGLTTESSYPYTAADGKCKSGSN-SAATVKGFEDVPANDE 230

Query: 255 QALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLI 314
            AL+KAV+ QPVS+A+      FQ Y  G+  G CGT LDH +  +G+G T DG  YWL+
Sbjct: 231 AALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGQTSDGTKYWLL 290

Query: 315 KNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
           KNSWG TWG+ GY+++ +D     G+CG+    SYP
Sbjct: 291 KNSWGTTWGENGYLRMEKDISDKRGMCGLAMEPSYP 326


>gi|1173630|gb|AAB37233.1| cysteine proteinase [Phalaenopsis sp. SM9108]
          Length = 359

 Score =  312 bits (799), Expect = 2e-82,   Method: Compositional matrix adjust.
 Identities = 151/341 (44%), Positives = 215/341 (63%), Gaps = 12/341 (3%)

Query: 18  MFIIITLLVSCASQVVSSRSTH---EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKEN 74
           + ++ + L S A+  +         E S+  ++E+W + H  S +D  EK+ R  +FKEN
Sbjct: 6   LILVASFLASVAATAIDIADKDLETEDSLWNLYERWRSHHTVS-RDLDEKQKRFNVFKEN 64

Query: 75  LEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSP-----SHRSTTSSTFKYQ 129
             YI   NK  +  YKL  N+F+DLTN EFR+ Y G ++        S R   +++F YQ
Sbjct: 65  PRYIHDFNKRKDIPYKLRLNKFADLTNHEFRSTYAGSRINHHRSLRGSRRGGATNSFMYQ 124

Query: 130 NLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLL 189
           +L    +P S+DWR KGAVT +K+Q +CG CWAF+ VAAVEGI +I++  L+ LSEQ+L+
Sbjct: 125 SLDSRSLPASIDWRQKGAVTAVKDQGQCGSCWAFSTVAAVEGINQIKTKKLLSLSEQELI 184

Query: 190 DCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEV 249
           DC T+ NNGC GG  + AF +I +N GI++E EYPY A    C+  +K     I  +E+V
Sbjct: 185 DCDTDENNGCNGGLMDYAFDFIKKNGGISSEAEYPYAAEDSYCATEKKSHVVSIDGHEDV 244

Query: 250 PSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGA 309
           P+ DE +LLKAV+ QPVSIAI A   +FQ Y EG+F G  GT+LDH V IVG+G T+ G 
Sbjct: 245 PANDEDSLLKAVANQPVSIAIEASGYDFQFYSEGVFTGRSGTELDHGVAIVGYGKTQQGT 304

Query: 310 NYWLIKNSWGNTWGDAGYMKI---VRDEGLCGIGTRSSYPL 347
            YW+++NSWG  WG+ GY++I      + LCG+   +SYP+
Sbjct: 305 KYWIVRNSWGAEWGEKGYIRISAASDSKRLCGLAMEASYPI 345


>gi|22093636|dbj|BAC06931.1| putative cysteine proteinase [Oryza sativa Japonica Group]
 gi|50510021|dbj|BAD30633.1| putative cysteine proteinase [Oryza sativa Japonica Group]
          Length = 352

 Score =  312 bits (799), Expect = 2e-82,   Method: Compositional matrix adjust.
 Identities = 156/353 (44%), Positives = 222/353 (62%), Gaps = 21/353 (5%)

Query: 8   SGSFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMR 67
           S   ++    + +++   +S  ++V  + ++   ++   H+KWMA+HGR+YKD  EK  R
Sbjct: 5   SSKLQVMAASLLLVVAGGLSTMAKV--TMASRAGTMEARHDKWMAEHGRTYKDAAEKARR 62

Query: 68  LKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFK 127
            ++FK N++ I+++N  GN+ Y+L TN+F+DLT+ EF A+YTGY   +  + +  ++T  
Sbjct: 63  FRVFKANVDLIDRSNAAGNKRYRLATNRFTDLTDAEFAAMYTGYNPANTMYAAANATT-- 120

Query: 128 YQNLSMTD--VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
              LS  D   P  +DWR +GAVT +KNQ+ CGCCWAF+ VAAVEGI +I +G L+ LSE
Sbjct: 121 --RLSSEDDQQPAEVDWRQQGAVTGVKNQRSCGCCWAFSTVAAVEGIHQITTGELVSLSE 178

Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTC----SAAQKPAAA 241
           QQLLDC+ NG  GC GGS + AF Y+  + G+ TE  Y YQ   G C    S++    AA
Sbjct: 179 QQLLDCADNG--GCTGGSLDNAFQYMANSGGVTTEAAYAYQGAQGACQFDASSSASGVAA 236

Query: 242 KISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNG-VCGTQLDHAVTIV 300
            IS Y+ V   DE +L  AV+ QPVS+AI      F+ Y  G+F    CGT+LDHAV +V
Sbjct: 237 TISGYQRVNPNDEGSLAAAVASQPVSVAIEGSGAMFRHYGSGVFTADSCGTKLDHAVAVV 296

Query: 301 GFGTTEDGA---NYWLIKNSWGNTWGDAGYMKIVRD---EGLCGIGTRSSYPL 347
           G+G   DG+    YW+IKNSWG TWGD GYMK+ +D   +G CG+    SYP+
Sbjct: 297 GYGAEADGSGGGGYWIIKNSWGTTWGDGGYMKLEKDVGSQGACGVAMAPSYPV 349


>gi|18418684|ref|NP_567983.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
 gi|71153408|sp|O65493.1|XCP1_ARATH RecName: Full=Xylem cysteine proteinase 1; Short=AtXCP1; Flags:
           Precursor
 gi|6708181|gb|AAF25831.1|AF191027_1 papain-type cysteine endopeptidase XCP1 [Arabidopsis thaliana]
 gi|3080415|emb|CAA18734.1| cysteine proteinase-like protein [Arabidopsis thaliana]
 gi|7270487|emb|CAB80252.1| cysteine proteinase-like protein [Arabidopsis thaliana]
 gi|26449881|dbj|BAC42063.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|28827736|gb|AAO50712.1| unknown protein [Arabidopsis thaliana]
 gi|332661101|gb|AEE86501.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
          Length = 355

 Score =  312 bits (799), Expect = 2e-82,   Method: Compositional matrix adjust.
 Identities = 148/314 (47%), Positives = 215/314 (68%), Gaps = 9/314 (2%)

Query: 38  THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
           T+   ++E+ E WM++H ++YK   EK  R ++F+ENL +I++ N E N +Y LG N+F+
Sbjct: 42  TNTDKLLELFESWMSEHSKAYKSVEEKVHRFEVFRENLMHIDQRNNEIN-SYWLGLNEFA 100

Query: 98  DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
           DLT++EF+  Y G   P  S +   S+ F+Y+++  TD+P S+DWR KGAV P+K+Q +C
Sbjct: 101 DLTHEEFKGRYLGLAKPQFSRKRQPSANFRYRDI--TDLPKSVDWRKKGAVAPVKDQGQC 158

Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
           G CWAF+ VAAVEGI +I +GNL  LSEQ+L+DC T  N+GC GG  + AF YII   G+
Sbjct: 159 GSCWAFSTVAAVEGINQITTGNLSSLSEQELIDCDTTFNSGCNGGLMDYAFQYIISTGGL 218

Query: 218 ATEDEYPYQAVPGTCSAAQKPAA-AKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTE 276
             ED+YPY    G C   ++      IS YE+VP  D+++L+KA++ QPVS+AI A   +
Sbjct: 219 HKEDDYPYLMEEGICQEQKEDVERVTISGYEDVPENDDESLVKALAHQPVSVAIEASGRD 278

Query: 277 FQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD--- 333
           FQ YK G+FNG CGT LDH V  VG+G+++ G++Y ++KNSWG  WG+ G++++ R+   
Sbjct: 279 FQFYKGGVFNGKCGTDLDHGVAAVGYGSSK-GSDYVIVKNSWGPRWGEKGFIRMKRNTGK 337

Query: 334 -EGLCGIGTRSSYP 346
            EGLCGI   +SYP
Sbjct: 338 PEGLCGINKMASYP 351


>gi|2511689|emb|CAB17074.1| cysteine proteinase precursor [Phaseolus vulgaris]
          Length = 364

 Score =  312 bits (799), Expect = 2e-82,   Method: Compositional matrix adjust.
 Identities = 154/338 (45%), Positives = 216/338 (63%), Gaps = 10/338 (2%)

Query: 17  PMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLE 76
           P  ++++   S A+  +S  +  E  V++++E+W+ +H + Y    EKE R ++FK+NL 
Sbjct: 7   PTLLLLSFTFSHAT-AMSIINYSENEVMDMYEEWLVKHRKVYNGLDEKEKRFQVFKDNLG 65

Query: 77  YIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSST-FKYQNLSMTD 135
           +I+  N + N TY LG N+F+D+TN+E+RA+Y G +  +      T +T  +Y   S   
Sbjct: 66  FIQDHNAQ-NNTYTLGLNKFADITNEEYRAMYLGTRTDAKRRVMKTQNTGHRYAYNSGDQ 124

Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG 195
           +P  +DWR KGAV PIK+Q  CG CWAF+ VAAVEGI  I +G  + LSEQ+L+DC    
Sbjct: 125 LPVHVDWRLKGAVGPIKDQGNCGSCWAFSTVAAVEGINNIVTGEFVSLSEQELVDCDREY 184

Query: 196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDE 254
           + GC GG  + AF +IIQN GI TE++YPYQ + GTC   +K     +I  YE+VPS +E
Sbjct: 185 DEGCNGGLMDYAFQFIIQNGGIDTEEDYPYQGIDGTCDQTKKKTKVVQIDGYEDVPSNNE 244

Query: 255 QALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLI 314
            AL KAVS QPVS+AI A     Q Y+ G+F G CGT LDH V +VG+G TE+G +YWL+
Sbjct: 245 NALKKAVSHQPVSVAIEASGRALQLYQSGVFTGKCGTALDHGVVVVGYG-TENGVDYWLV 303

Query: 315 KNSWGNTWGDAGYMKIVRD-----EGLCGIGTRSSYPL 347
           +NSWG  WG+ GY K+ R+     EG CGI    SYP+
Sbjct: 304 RNSWGTGWGEDGYFKMERNVRSTSEGKCGIAMDCSYPV 341


>gi|218198967|gb|EEC81394.1| hypothetical protein OsI_24614 [Oryza sativa Indica Group]
          Length = 342

 Score =  311 bits (798), Expect = 2e-82,   Method: Compositional matrix adjust.
 Identities = 157/343 (45%), Positives = 218/343 (63%), Gaps = 22/343 (6%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
           + ++   L + A   ++SR+   ++    H+KWMA+HGR+YKD  EK  R ++FK N++ 
Sbjct: 6   LLVVAGGLSTMAKVTMASRAGTMEAR---HDKWMAEHGRTYKDAAEKARRFRVFKANVDL 62

Query: 78  IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD-- 135
           I+++N  GN+ Y+L TN+F+DLT+ EF A+YTGY   +  + +  ++T     LS  D  
Sbjct: 63  IDRSNAAGNKRYRLATNRFTDLTDAEFAAMYTGYNPANTMYAAANATT----RLSSEDDQ 118

Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG 195
            P  +DWR +GAVT +KNQ+ CGCCWAF+ VAAVEGI +I +G L+ LSEQQLLDC+ NG
Sbjct: 119 QPAEVDWRQQGAVTGVKNQRSCGCCWAFSTVAAVEGIHQITTGELVSLSEQQLLDCADNG 178

Query: 196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTC----SAAQKPAAAKISNYEEVPS 251
             GC GGS + AF Y+  + G+ TE  Y YQ   G C    S++    AA IS Y+ V  
Sbjct: 179 --GCTGGSLDNAFQYMANSGGVTTEAAYAYQGAQGACQFDASSSASGVAATISGYQRVNP 236

Query: 252 GDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNG-VCGTQLDHAVTIVGFGTTEDGA- 309
            DE +L  AV+ QPVS+AI      F+ Y  G+F    CGT+LDHAV +VG+G   DG+ 
Sbjct: 237 NDEGSLAAAVASQPVSVAIEGSGAMFRHYGSGVFTADSCGTKLDHAVAVVGYGAEADGSG 296

Query: 310 --NYWLIKNSWGNTWGDAGYMKIVRD---EGLCGIGTRSSYPL 347
              YW+IKNSWG TWGD GYMK+ +D   +G CG+    SYP+
Sbjct: 297 GGGYWIIKNSWGTTWGDGGYMKLEKDVGSQGACGVAMAPSYPV 339


>gi|124484387|dbj|BAF46304.1| cysteine proteinase precursor [Ipomoea nil]
          Length = 474

 Score =  311 bits (798), Expect = 2e-82,   Method: Compositional matrix adjust.
 Identities = 151/313 (48%), Positives = 211/313 (67%), Gaps = 8/313 (2%)

Query: 40  EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
           E  V E+ E W+ +HG+SY    EK+ R KIF++NL+YI++ N   NR+YKLG N+F+D+
Sbjct: 43  EDEVKEMFESWLVKHGKSYNAVDEKDKRFKIFRDNLKYIDEKNSLENRSYKLGLNRFADI 102

Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGC 159
           TN+E+R  Y G K  + S     S + +Y  ++   +P S+DWR+KGAVT +K+Q  CG 
Sbjct: 103 TNEEYRTGYLGAKRDA-SRNMVKSKSDRYAPVAGDSLPDSIDWREKGAVTGVKDQGSCGS 161

Query: 160 CWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIAT 219
           CWAF+ +AAVEG+ ++ +GNLI LSEQ+L+DC    N GC GG    AF +II+N GI +
Sbjct: 162 CWAFSTIAAVEGVNQLATGNLISLSEQELVDCDRKINQGCNGGDMGYAFQFIIKNGGIDS 221

Query: 220 EDEYPYQAVPGTCSAAQKPAA--AKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEF 277
           E++YPY    G C + ++  A  A I  YEEVP  +E++L KAV+ QPVS+AI A   +F
Sbjct: 222 EEDYPYTGKDGKCDSYRQNNAKVASIDGYEEVPVNNEKSLQKAVANQPVSVAIEAGGYDF 281

Query: 278 QSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD---- 333
           Q Y  GIF G CGT LDH V  VG+G TE+G +YW++KNSWG+ WG+ GY+++ R+    
Sbjct: 282 QLYSSGIFTGSCGTDLDHGVAAVGYG-TENGVDYWIVKNSWGDYWGEKGYVRMQRNVKAK 340

Query: 334 EGLCGIGTRSSYP 346
            GLCGI   +SYP
Sbjct: 341 TGLCGIAMEASYP 353


>gi|302816222|ref|XP_002989790.1| hypothetical protein SELMODRAFT_184826 [Selaginella moellendorffii]
 gi|300142356|gb|EFJ09057.1| hypothetical protein SELMODRAFT_184826 [Selaginella moellendorffii]
          Length = 358

 Score =  311 bits (798), Expect = 2e-82,   Method: Compositional matrix adjust.
 Identities = 150/317 (47%), Positives = 213/317 (67%), Gaps = 14/317 (4%)

Query: 40  EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
           ++S   ++EKWM  HGR Y    EKE R +IF++N EYIE+ N++ N+TY LG N F+D+
Sbjct: 27  DRSFRALYEKWMVDHGRVYNGIGEKERRFQIFRDNAEYIEEHNRQVNQTYWLGLNNFADM 86

Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGC 159
           T+DEF+ALY G K+P  +   T  S F+Y++   T++P   DWR KGAV  +KNQ  CG 
Sbjct: 87  THDEFKALYFGTKVPLSN---TIKSGFRYKD--ATNLPLDTDWRSKGAVATVKNQGACGS 141

Query: 160 CWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIAT 219
           CWAF+ VAAVEG+ +I +G L+ LSEQ+L+DC    N GC GG  + AF +IIQN G+ +
Sbjct: 142 CWAFSTVAAVEGVNQIVTGELVSLSEQELVDCDKQKNQGCNGGLMDSAFEFIIQNGGLDS 201

Query: 220 EDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQ 278
           E +YPY+AV G+C  +++ +    I  +E+VP+  E  LLKAV+ QPVS+AI A    FQ
Sbjct: 202 EADYPYKAVSGSCDESRRNSHVVTIDGFEDVPAESEADLLKAVANQPVSVAIEASGRNFQ 261

Query: 279 SYKEGIFNGVCGTQLDHAVTIVGFGT--TEDGA--NYWLIKNSWGNTWGDAGYMKIVRD- 333
            Y  G++ G CG +LDH V  VG+GT  T DG   +YW+++NSWG+ WG++GY+++ R+ 
Sbjct: 262 LYSGGVYTGHCGYELDHGVVAVGYGTSKTPDGVATDYWIVRNSWGDAWGESGYIRLQRNV 321

Query: 334 ---EGLCGIGTRSSYPL 347
               G CGI   +SYP+
Sbjct: 322 ASPRGKCGIAMMASYPV 338


>gi|297802418|ref|XP_002869093.1| hypothetical protein ARALYDRAFT_491113 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297314929|gb|EFH45352.1| hypothetical protein ARALYDRAFT_491113 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 355

 Score =  311 bits (798), Expect = 2e-82,   Method: Compositional matrix adjust.
 Identities = 148/314 (47%), Positives = 214/314 (68%), Gaps = 9/314 (2%)

Query: 38  THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
           T  + ++E+ E WM++H + YK   EK  R ++F+ENL +I++ N E N +Y LG N+F+
Sbjct: 42  TSTEKLLELFESWMSEHSKVYKSVEEKVHRFEVFRENLMHIDQRNNEIN-SYWLGLNEFA 100

Query: 98  DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
           DLT++EF+  Y G   P  S +   S+ F+Y+++  TD+P S+DWR KGAV P+K+Q +C
Sbjct: 101 DLTHEEFKGRYLGLAKPQFSRKRQPSANFRYRDI--TDLPKSVDWRKKGAVAPVKDQGQC 158

Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
           G CWAF+ VAAVEGI +I +GNL  LSEQ+L+DC T  N+GC GG  + AF YII   G+
Sbjct: 159 GSCWAFSTVAAVEGINQITTGNLSSLSEQELIDCDTTFNSGCNGGLMDYAFQYIISTGGL 218

Query: 218 ATEDEYPYQAVPGTCSAAQKPAA-AKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTE 276
             ED+YPY    G C   ++      IS YE+VP  D+++L+KA++ QPVS+AI A   +
Sbjct: 219 HKEDDYPYLMEEGICQEQKEDVERVTISGYEDVPENDDESLVKALAHQPVSVAIEASGRD 278

Query: 277 FQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD--- 333
           FQ YK G+FNG CGT LDH V  VG+G+++ G++Y ++KNSWG  WG+ G++++ R+   
Sbjct: 279 FQFYKGGVFNGQCGTDLDHGVAAVGYGSSK-GSDYVIVKNSWGPRWGEKGFIRMKRNTGK 337

Query: 334 -EGLCGIGTRSSYP 346
            EGLCGI   +SYP
Sbjct: 338 PEGLCGINKMASYP 351


>gi|302816909|ref|XP_002990132.1| hypothetical protein SELMODRAFT_428615 [Selaginella moellendorffii]
 gi|300142145|gb|EFJ08849.1| hypothetical protein SELMODRAFT_428615 [Selaginella moellendorffii]
          Length = 358

 Score =  311 bits (797), Expect = 3e-82,   Method: Compositional matrix adjust.
 Identities = 149/311 (47%), Positives = 210/311 (67%), Gaps = 14/311 (4%)

Query: 46  IHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFR 105
           ++EKWM  HGR Y    EKE R +IF++N EYIE+ N++ N+TY LG N F+D+T+DEF+
Sbjct: 33  LYEKWMVDHGRVYNGIGEKERRFQIFRDNAEYIEEHNRQVNQTYWLGLNNFADMTHDEFK 92

Query: 106 ALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAA 165
           ALY G K+P  +   T  S F+Y++   T++P   DWR KGAV  +KNQ  CG CWAF+ 
Sbjct: 93  ALYFGTKVPLSN---TIKSGFRYED--ATNLPLDTDWRSKGAVATVKNQGACGSCWAFST 147

Query: 166 VAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPY 225
           VAAVEG+ +I +G L+ LSEQ+L+DC    N GC GG  + AF +IIQN G+ +E +YPY
Sbjct: 148 VAAVEGVNQIVTGELVSLSEQELVDCDKQKNQGCNGGLMDSAFEFIIQNGGLDSEADYPY 207

Query: 226 QAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGI 284
           +AV G+C  +++ +    I  +E+VP+  E  LLKAV+ QPVS+AI A    FQ Y  G+
Sbjct: 208 KAVSGSCDESRRNSHVVTIDGFEDVPAESEADLLKAVANQPVSVAIEASGRNFQLYSGGV 267

Query: 285 FNGVCGTQLDHAVTIVGFGT--TEDGA--NYWLIKNSWGNTWGDAGYMKIVRD----EGL 336
           + G CG +LDH V  VG+GT  T DG   +YW+++NSWG+ WG++GY+++ R+     G 
Sbjct: 268 YTGHCGYELDHGVVAVGYGTSKTPDGVATDYWIVRNSWGDAWGESGYIRLQRNVASSRGK 327

Query: 337 CGIGTRSSYPL 347
           CGI   +SYP+
Sbjct: 328 CGIAMMASYPV 338


>gi|255568345|ref|XP_002525147.1| cysteine protease, putative [Ricinus communis]
 gi|223535606|gb|EEF37274.1| cysteine protease, putative [Ricinus communis]
          Length = 347

 Score =  311 bits (797), Expect = 3e-82,   Method: Compositional matrix adjust.
 Identities = 150/307 (48%), Positives = 210/307 (68%), Gaps = 13/307 (4%)

Query: 47  HEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRA 106
           ++KW+ Q+GR Y  + E  +R  I+  N+++IE  N + N ++KL  N+F+DLTNDEF +
Sbjct: 46  YDKWLEQYGRKYDTKDEYLLRFGIYHSNIQFIEYINSQ-NLSFKLTDNKFADLTNDEFNS 104

Query: 107 LYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAV 166
           +Y GY++     RS       + + + TD+P ++DWR+ GAVTPIK+Q +CG CWAF+AV
Sbjct: 105 IYLGYQI-----RSYKRRNLSHMHENSTDLPDAVDWRENGAVTPIKDQGQCGSCWAFSAV 159

Query: 167 AAVEGITKIRSGNLIQLSEQQLLDCSTNGNN-GCLGGSREKAFAYIIQNQGIATEDEYPY 225
           AAVEGI KI++GNL+ LSEQ+L+DC  NG+N GC GG  EKAF +I    G+ TE++YPY
Sbjct: 160 AAVEGINKIKTGNLVSLSEQELVDCDVNGDNKGCNGGFMEKAFTFIKSIGGLTTENDYPY 219

Query: 226 QAVPGTCSAAQKP-AAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGI 284
           +   G+C  A+    A  I  YE VP+ +E +L  AVS QPVS+AI A   EFQ Y EG+
Sbjct: 220 KGTDGSCEKAKTDNHAVIIGGYETVPANNENSLKVAVSKQPVSVAIDASGYEFQLYSEGV 279

Query: 285 FNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIG 340
           F+G CG QL+H VTIVG+G   +G  YWL+KNSWG  WG++GY+++ RD    +G+CGI 
Sbjct: 280 FSGYCGIQLNHGVTIVGYGDN-NGQKYWLVKNSWGKGWGESGYIRMKRDSSDTKGMCGIA 338

Query: 341 TRSSYPL 347
              SYP+
Sbjct: 339 MEPSYPI 345


>gi|356515044|ref|XP_003526211.1| PREDICTED: LOW QUALITY PROTEIN: thiol protease SEN102-like [Glycine
           max]
          Length = 337

 Score =  311 bits (797), Expect = 3e-82,   Method: Compositional matrix adjust.
 Identities = 161/333 (48%), Positives = 216/333 (64%), Gaps = 14/333 (4%)

Query: 20  IIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIE 79
           + + LL+S     V SR  HE S+ E HE W+A++G+ YK   EKE   +IFKEN+E+IE
Sbjct: 11  LALFLLLSIEISQVMSRKLHETSLREEHENWIARYGQVYKVAAEKET-FQIFKENVEFIE 69

Query: 80  KANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTS 139
             N   N+ YKLG N F+DLT +EF+    G K    +H  + +  FKY+N+  TD+P +
Sbjct: 70  SFNAAANKPYKLGVNLFADLTLEEFKDFRFGLK---KTHEFSITP-FKYENV--TDIPEA 123

Query: 140 LDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNG 198
           LDWR+KGAVTPIK+Q +CG CWAF+ VAA EGI +I +GNL+ L EQ+L+ C T G + G
Sbjct: 124 LDWREKGAVTPIKDQGQCGSCWAFSTVAATEGIHQITTGNLVSLXEQELVSCDTKGVDQG 183

Query: 199 CLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAA-QKPAAAKISNYEEVPSGDEQAL 257
           C GG  E  F +II+N GI T+  YPY+ V GTC+        A+I  YE VPS  E+AL
Sbjct: 184 CEGGYMEDGFEFIIKNGGITTKANYPYKGVNGTCNTTIAASTVAQIKGYETVPSYSEEAL 243

Query: 258 LKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNS 317
            KAV+ QPVS++I A +  F  Y  GI+ G CGT LDH VT VG+GTT +  +YW++KNS
Sbjct: 244 QKAVANQPVSVSIDANNGHFMFYAGGIYTGECGTDLDHGVTAVGYGTTNE-TDYWIVKNS 302

Query: 318 WGNTWGDAGYMKIVR----DEGLCGIGTRSSYP 346
           WG  W + G++++ R      GLCG+   SSYP
Sbjct: 303 WGTGWDEKGFIRMQRGITVKHGLCGVALDSSYP 335


>gi|1256830|gb|AAB68374.1| cysteine endopeptidase 1 [Phaseolus vulgaris]
 gi|2959418|emb|CAA12118.1| cysteine protease [Phaseolus vulgaris]
          Length = 364

 Score =  311 bits (796), Expect = 4e-82,   Method: Compositional matrix adjust.
 Identities = 154/338 (45%), Positives = 215/338 (63%), Gaps = 10/338 (2%)

Query: 17  PMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLE 76
           P  ++++   S A+  +S  +  E  V++++E+W+ +H + Y    EKE R ++FK+NL 
Sbjct: 7   PTLLLLSFTFSHAT-AMSIINYSENEVMDMYEEWLVKHRKVYNGLDEKEKRFQVFKDNLG 65

Query: 77  YIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSST-FKYQNLSMTD 135
           +I+  N + N TY LG N+F+D+TN E+RA+Y G +  +      T +T  +Y   S   
Sbjct: 66  FIQDHNAQ-NNTYTLGLNKFADITNKEYRAMYLGTRTDAKRRVMKTQNTGHRYAYNSGDQ 124

Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG 195
           +P  +DWR KGAV PIK+Q  CG CWAF+ VAAVEGI  I +G  + LSEQ+L+DC    
Sbjct: 125 LPVHVDWRLKGAVGPIKDQGNCGSCWAFSTVAAVEGINNIVTGEFVSLSEQELVDCDREY 184

Query: 196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDE 254
           + GC GG  + AF +IIQN GI TE++YPYQ + GTC   +K     +I  YE+VPS +E
Sbjct: 185 DEGCNGGLMDYAFQFIIQNGGIDTEEDYPYQGIDGTCDETKKKTKVVQIDGYEDVPSNNE 244

Query: 255 QALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLI 314
            AL KAVS QPVS+AI A     Q Y+ G+F G CGT LDH V +VG+G TE+G +YWL+
Sbjct: 245 NALKKAVSHQPVSVAIEASGRALQLYQSGVFTGKCGTALDHGVVVVGYG-TENGVDYWLV 303

Query: 315 KNSWGNTWGDAGYMKIVRD-----EGLCGIGTRSSYPL 347
           +NSWG  WG+ GY K+ R+     EG CGI    SYP+
Sbjct: 304 RNSWGTGWGEDGYFKMERNVRSTSEGKCGIAMDCSYPV 341


>gi|194352752|emb|CAQ00104.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
          Length = 351

 Score =  311 bits (796), Expect = 4e-82,   Method: Compositional matrix adjust.
 Identities = 154/317 (48%), Positives = 220/317 (69%), Gaps = 11/317 (3%)

Query: 37  STHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQF 96
           S+H++ +VE+ EKW+A+H ++Y    EK  R ++FK+NL+ I++ N+E   +Y LG N+F
Sbjct: 35  SSHDR-LVELFEKWLAKHQKAYASFEEKLHRFEVFKDNLKLIDEINRE-VTSYWLGLNEF 92

Query: 97  SDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKE 156
           +DLT+DEF+  Y G  +  P  R ++S +F+Y+N++  D+P ++DWR KGAVT +KNQ +
Sbjct: 93  ADLTHDEFKTTYLG--LSPPPARRSSSRSFRYENVAAHDLPKAVDWRKKGAVTDVKNQGQ 150

Query: 157 CGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQG 216
           CG CWAF+ VAAVEGI  I +GNL  LSEQ+L+DCS +GN+GC GG  + AF+YI  + G
Sbjct: 151 CGSCWAFSTVAAVEGINAIVTGNLTALSEQELIDCSVDGNSGCNGGMMDYAFSYIASSGG 210

Query: 217 IATEDEYPYQAVPGTCSAAQK--PAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYS 274
           + TE+ YPY    G+C   +K    A  IS YE+VP+ DEQAL+KA++ QPVS+AI A  
Sbjct: 211 LHTEEAYPYLMEEGSCGDGKKSESEAVSISGYEDVPTKDEQALIKALAHQPVSVAIEASG 270

Query: 275 TEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTE-DGANYWLIKNSWGNTWGDAGYMKIVR- 332
             FQ Y  G+F+G CG QLDH V  VG+G+ +  G +Y ++KNSWG  WG+ GY+++ R 
Sbjct: 271 RHFQFYSGGVFDGPCGAQLDHGVAAVGYGSDKGKGHDYIIVKNSWGGKWGEKGYIRMKRG 330

Query: 333 ---DEGLCGIGTRSSYP 346
               EGLCGI   +SYP
Sbjct: 331 TGKSEGLCGINKMASYP 347


>gi|50355611|dbj|BAD29954.1| cysteine protease [Daucus carota]
          Length = 474

 Score =  310 bits (795), Expect = 4e-82,   Method: Compositional matrix adjust.
 Identities = 155/355 (43%), Positives = 226/355 (63%), Gaps = 14/355 (3%)

Query: 1   MVLIFERSGSFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKD 60
           +VL F  +    +++     IIT   +        R TH+Q ++ ++E W+ +H ++Y  
Sbjct: 16  LVLFFSLASFLMLSSASDMSIITYDETHGLNSPPLR-THDQ-LLSLYESWLVKHHKNYNA 73

Query: 61  ELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRS 120
             EKE R  IFK+N+ ++++ N   N++YKLG N+F+DLTNDE+R+LY   KM     ++
Sbjct: 74  LGEKETRFGIFKDNVGFVDRHNSMRNQSYKLGLNKFADLTNDEYRSLYLSGKMMKRERKN 133

Query: 121 TTSSTFKYQNLSMTD---VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRS 177
                F+       D   +P S+DWRD+GAV P+K+Q +CG CWAF+ V AVEGI KI +
Sbjct: 134 EDG--FRSDRFVFEDGDHLPESVDWRDRGAVAPVKDQGQCGSCWAFSTVGAVEGINKIVT 191

Query: 178 GNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQK 237
           G LI LSEQ+L+DC    N GC GG  + AF +I++N GI TED+YPY+ V G C   +K
Sbjct: 192 GELISLSEQELVDCDNGYNQGCNGGLMDYAFEFIVKNGGIDTEDDYPYKGVDGLCDQNRK 251

Query: 238 PA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHA 296
            A    I+ YE+VP  DE++L KAV+ QPVS+AI A    FQ Y+ G+F G CGT+LDH 
Sbjct: 252 NAKVVTINGYEDVPHNDEKSLKKAVAHQPVSVAIEAGGRAFQLYESGVFTGQCGTELDHG 311

Query: 297 VTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-----EGLCGIGTRSSYP 346
           V  VG+G +E+G +YW+++NSWG  WG++GY+++ R+      G CGI  ++SYP
Sbjct: 312 VVAVGYG-SENGKDYWIVRNSWGPDWGESGYIRLERNVASTSTGKCGIAMQASYP 365


>gi|374713649|gb|AEZ65082.1| cysteine protease [Carica papaya]
          Length = 471

 Score =  310 bits (795), Expect = 5e-82,   Method: Compositional matrix adjust.
 Identities = 153/353 (43%), Positives = 225/353 (63%), Gaps = 18/353 (5%)

Query: 11  FKINTTPMFIIITLLVSCASQVV----------SSRSTHEQSVVEIHEKWMAQHGRSYKD 60
           F++     F+ +   +S AS  +           S    E  +++++E W+ +HG++Y  
Sbjct: 6   FRLCIAISFLFMVFSLSLASMSIIDYDLPADPLQSTERTEAHMMKMYEHWLVKHGKNYNA 65

Query: 61  ELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSH-R 119
             EKE R +IFK+NL ++++ N    RTYKLG  +F+DLTN+E+RA+Y G KM      R
Sbjct: 66  IGEKERRFEIFKDNLRFVDEQNSVPGRTYKLGLTKFADLTNEEYRAMYLGAKMEKKEKLR 125

Query: 120 STTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGN 179
           +  S  + ++  +  D+P+ +DWR+KGAVT +K+Q +CG CWAF+ V +VEGI +I +G+
Sbjct: 126 TERSQRYLHKAGNDDDLPSHVDWREKGAVTEVKDQGQCGSCWAFSTVGSVEGINQIVTGD 185

Query: 180 LIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA 239
           LI LSEQ+L+DC    N GC GG  + AF +II+N GI +E +YPY+A    C + +K A
Sbjct: 186 LISLSEQELVDCDKAYNQGCNGGLMDYAFEFIIKNGGIDSEADYPYRASDNMCDSNRKNA 245

Query: 240 -AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVT 298
               I  YE+VP  DE++L KAV+ QPVS+AI A   EFQ Y+ G+F G CGT LDH V 
Sbjct: 246 HVVTIDGYEDVPENDEESLKKAVANQPVSVAIEAGGREFQLYQSGVFTGRCGTNLDHGVV 305

Query: 299 IVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR-----DEGLCGIGTRSSYP 346
            VG+G TE+G +YW+++NSWG  WG++GY+++ R     D G CGI   +SYP
Sbjct: 306 AVGYG-TENGIDYWIVRNSWGPKWGESGYIRMERNVASTDTGKCGIAMEASYP 357


>gi|242092700|ref|XP_002436840.1| hypothetical protein SORBIDRAFT_10g009830 [Sorghum bicolor]
 gi|241915063|gb|EER88207.1| hypothetical protein SORBIDRAFT_10g009830 [Sorghum bicolor]
          Length = 328

 Score =  310 bits (795), Expect = 5e-82,   Method: Compositional matrix adjust.
 Identities = 151/337 (44%), Positives = 210/337 (62%), Gaps = 24/337 (7%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
           +  I+ L   C + + +     + ++V  HE+WM Q+ R YKD  EK  R ++FK N+++
Sbjct: 8   ILAILGLAFFCGAALAARDLNDDSAMVARHEQWMVQYSRVYKDTTEKARRFEVFKANVKF 67

Query: 78  IEKANKEGNRTYKLGTNQFSDLTNDEFRALYT--GYKMPSPSHRSTTSSTFKYQNLSMTD 135
           IE  N  GNR + LG NQF+DLTNDEFRA  T  G+K PSP    T    F+Y+N+S+  
Sbjct: 68  IESFNAGGNRKFWLGVNQFADLTNDEFRATKTNKGFK-PSPVKVPTG---FRYENVSVDA 123

Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG 195
           +P ++DWR KGAVTPIK+Q +C            EGI KI +G LI LSEQ+L+DC  +G
Sbjct: 124 LPATIDWRTKGAVTPIKDQGQC------------EGIVKISTGKLISLSEQELVDCDVHG 171

Query: 196 -NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDE 254
            + GC GG  + AF +II+N G+ TE  YPY A  G C +    +AA +  +E+VP+ DE
Sbjct: 172 EDQGCEGGLMDDAFQFIIKNGGLTTESSYPYTAADGKCKSGSN-SAATVKGFEDVPANDE 230

Query: 255 QALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLI 314
            AL+KAV+ QPVS+A+      FQ Y  G+  G CGT LDH +  +G+G T DG  YWL+
Sbjct: 231 AALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGQTSDGTKYWLL 290

Query: 315 KNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
           KNSWG TWG+ GY+++ +D     G+CG+    SYP+
Sbjct: 291 KNSWGTTWGENGYLRMEKDISDKRGMCGLAMEPSYPI 327


>gi|297852302|ref|XP_002894032.1| F2G19.31/F2G19.31 [Arabidopsis lyrata subsp. lyrata]
 gi|297339874|gb|EFH70291.1| F2G19.31/F2G19.31 [Arabidopsis lyrata subsp. lyrata]
          Length = 455

 Score =  310 bits (795), Expect = 5e-82,   Method: Compositional matrix adjust.
 Identities = 156/350 (44%), Positives = 226/350 (64%), Gaps = 25/350 (7%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQS-------------VVEIHEKWMAQHGRSYKDE--L 62
           M I+   +V+ AS V  S  ++++              V+ I+E W+ +HG++      +
Sbjct: 1   MVILFLAMVAVASAVDMSIISYDEKHGVSTTGGRSDAEVMSIYEAWLVKHGKAQNQNSLV 60

Query: 63  EKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTT 122
           EK+ R +IFK+NL +I+  NK+ N +Y+LG  +F+DLTNDE+R+ Y G KM     R T+
Sbjct: 61  EKDRRFEIFKDNLRFIDDHNKK-NLSYRLGLTRFADLTNDEYRSKYLGAKMEKKGERRTS 119

Query: 123 SSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQ 182
               +Y+     ++P S+DWR KGAV  +K+Q  CG CWAF+ + AVEGI +I +G+LI 
Sbjct: 120 Q---RYEARVGDELPESIDWRKKGAVAEVKDQGSCGSCWAFSTIGAVEGINQIVTGDLIT 176

Query: 183 LSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AA 241
           LSEQ+L+DC T+ N GC GG  + AF +II+N GI T+ +YPY+ V GTC   +K A   
Sbjct: 177 LSEQELVDCDTSYNEGCNGGLMDYAFEFIIKNGGIDTDKDYPYKGVDGTCDQIRKNAKVV 236

Query: 242 KISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVG 301
            I +YE+VP+  E++L KAV+ QPVS+AI A    FQ Y  GIF+G CGTQLDH V  VG
Sbjct: 237 TIDSYEDVPTYSEESLKKAVAHQPVSVAIEAGGRAFQLYDSGIFDGTCGTQLDHGVVAVG 296

Query: 302 FGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
           +G TE+G +YW+++NSWG +WG++GY+K+ R+     G CGI    SYP+
Sbjct: 297 YG-TENGKDYWIVRNSWGKSWGESGYLKMARNIASSSGKCGIAIEPSYPI 345


>gi|358248896|ref|NP_001239703.1| uncharacterized protein LOC100799247 precursor [Glycine max]
 gi|255636729|gb|ACU18700.1| unknown [Glycine max]
          Length = 341

 Score =  310 bits (795), Expect = 5e-82,   Method: Compositional matrix adjust.
 Identities = 148/311 (47%), Positives = 204/311 (65%), Gaps = 9/311 (2%)

Query: 45  EIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEF 104
           E HEKWMAQ+G+ YKD  EKE R ++FK N+++IE  N  G++ + L  NQF+DL ++EF
Sbjct: 33  ERHEKWMAQYGKVYKDAAEKEKRFQVFKNNVQFIESFNAAGDKPFNLSINQFADLHDEEF 92

Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQK-ECGCCWAF 163
           +AL    +  +    + T ++F+Y+N+  T +P+++DWR +GAVTPIK+Q   CG CWAF
Sbjct: 93  KALLNNVQKKASRVETATETSFRYENV--TKIPSTMDWRKRGAVTPIKDQGYTCGSCWAF 150

Query: 164 AAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEY 223
           A VA VE + +I +G L+ LSEQ+L+DC    + GC GG  E AF +I    GI +E  Y
Sbjct: 151 ATVATVESLHQITTGELVSLSEQELVDCVRGDSEGCRGGYVENAFEFIANKGGITSEAYY 210

Query: 224 PYQAVPGTCSAAQKP-AAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKE 282
           PY+    +C   ++    A+I  YE VPS  E+ALLKAV+ QPVS+ I A +  F+ Y  
Sbjct: 211 PYKGKDRSCKVKKETHGVARIIGYESVPSNSEKALLKAVANQPVSVYIDAGAIAFKFYSS 270

Query: 283 GIFNGV-CGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLC 337
           GIF    CGT LDHAV +VG+G   DG  YWL+KNSW   WG+ GYM+I RD    +GLC
Sbjct: 271 GIFEARNCGTHLDHAVAVVGYGKLRDGTKYWLVKNSWSTAWGEKGYMRIKRDIRAKKGLC 330

Query: 338 GIGTRSSYPLA 348
           GI + +SYP+A
Sbjct: 331 GIASNASYPIA 341


>gi|18141285|gb|AAL60580.1|AF454958_1 senescence-associated cysteine protease [Brassica oleracea]
          Length = 485

 Score =  310 bits (794), Expect = 7e-82,   Method: Compositional matrix adjust.
 Identities = 152/352 (43%), Positives = 227/352 (64%), Gaps = 25/352 (7%)

Query: 13  INTTPMFIIITLLVSCAS------------QVVSSRSTHEQSVVEIHEKWMAQHGRSYKD 60
           +N+  + + +T++V  ++              VSSRS  E  V  ++E+W+ +HG++   
Sbjct: 4   LNSATVILFLTMIVVSSAMDMSIISYDKNHHTVSSRSDAE--VSRLYEEWLVKHGKAQNS 61

Query: 61  ELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRS 120
             EK+ R +IFK+NL +I++ N + N +Y+LG  +F+DLTNDE+R++Y G ++     R 
Sbjct: 62  LTEKDRRFEIFKDNLRFIDEHNGK-NLSYRLGLTKFADLTNDEYRSMYLGSRL----KRK 116

Query: 121 TTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNL 180
            T S+ +Y+      +P S+DWR +GAV  +K+Q  CG CWAF+ + AVEGI KI +G+L
Sbjct: 117 ATKSSLRYEVRVGDAIPESVDWRKEGAVAEVKDQGSCGSCWAFSTIGAVEGINKIVTGDL 176

Query: 181 IQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA- 239
           I LSEQ+L+DC T+ N GC GG  + AF +II N GI TE++YPY+ V G C   +K A 
Sbjct: 177 ITLSEQELVDCDTSYNEGCNGGLMDYAFEFIINNGGIDTEEDYPYKGVDGRCDQTRKNAK 236

Query: 240 AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTI 299
              I  YE+VP+  E++L KA+S QP+S+AI      FQ Y  GIF+G+CGT LDH V  
Sbjct: 237 VVTIDLYEDVPANSEESLKKALSHQPISVAIEGGGRAFQLYDSGIFDGICGTDLDHGVVA 296

Query: 300 VGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
           VG+G TE+G +YW++KNSWG +WG++GY+++ R+     G CGI    SYP+
Sbjct: 297 VGYG-TENGKDYWIVKNSWGTSWGESGYIRMERNIASSAGKCGIAVEPSYPI 347


>gi|34223513|gb|AAQ62999.1| oil palm polygalacturonase allergen PEST472 [Elaeis guineensis]
          Length = 525

 Score =  310 bits (793), Expect = 8e-82,   Method: Compositional matrix adjust.
 Identities = 151/315 (47%), Positives = 212/315 (67%), Gaps = 10/315 (3%)

Query: 40  EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRTYKLGTNQF 96
           E+ +  ++E W+A+HGR+     EKE R +IFK+N+ +I+  N     G+R+++LG N+F
Sbjct: 43  EEEMRLLYEGWLAKHGRADNALGEKERRFEIFKDNVRFIDAHNAAADSGHRSFRLGLNRF 102

Query: 97  SDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKE 156
           +D+TN+E+R +Y G + P+   R     + +Y+  +  ++P S+DWRDKGAVT +K+Q  
Sbjct: 103 ADMTNEEYRTVYLGTR-PASHRRRARLGSDRYRYNAGEELPESVDWRDKGAVTTVKDQGS 161

Query: 157 CGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQG 216
           CG CWAF+ +AAVEGI KI +G+LI LSEQ+L+DC    N GC GG  + AF +II N G
Sbjct: 162 CGSCWAFSTIAAVEGINKIVTGDLISLSEQELVDCDNGQNQGCNGGLMDYAFEFIINNGG 221

Query: 217 IATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYST 275
           I TE++YPY+A  G C   +K A    I  YE+VP  DE+AL KAV+ QPVS+AI A   
Sbjct: 222 IDTEEDYPYKARDGKCDQYRKNAKVVSIDGYEDVPVNDEKALQKAVANQPVSVAIEAGGR 281

Query: 276 EFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-- 333
           EFQ Y  GIF G CGT LDH V  VG+G TE+G +YW+++NSWG  WG++GY+++ R+  
Sbjct: 282 EFQLYHSGIFTGRCGTDLDHGVVAVGYG-TENGKDYWIVRNSWGGDWGESGYIRMERNVN 340

Query: 334 --EGLCGIGTRSSYP 346
              G CGI   SSYP
Sbjct: 341 ASTGKCGIAMESSYP 355


>gi|414585111|tpg|DAA35682.1| TPA: cysteine proteinase Mir3 [Zea mays]
          Length = 468

 Score =  310 bits (793), Expect = 8e-82,   Method: Compositional matrix adjust.
 Identities = 152/324 (46%), Positives = 211/324 (65%), Gaps = 12/324 (3%)

Query: 32  VVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRT 88
           +VS     ++    ++ +WMA HGR+Y    E+E R ++F++NL YI+  N     G  +
Sbjct: 31  IVSYGERSDEEARRMYAEWMAAHGRTYNAVGEEERRYQVFRDNLRYIDAHNAAADAGVHS 90

Query: 89  YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAV 148
           ++LG N+F+DLTNDE+RA Y G +      R   +   +Y      D+P S+DWR KGAV
Sbjct: 91  FRLGLNRFADLTNDEYRATYLGARTRPQRERKLGA---RYHAADNEDLPESVDWRAKGAV 147

Query: 149 TPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAF 208
             +K+Q  CG CWAF+ +AAVEGI +I +G+LI LSEQ+L+DC T+ N GC GG  + AF
Sbjct: 148 AEVKDQGSCGSCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAF 207

Query: 209 AYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVS 267
            +II N GI TE +YPY+   G C   +K A    I +YE+VP+ DE++L KAV+ QPVS
Sbjct: 208 EFIINNGGIDTEKDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANDEKSLQKAVANQPVS 267

Query: 268 IAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGY 327
           +AI A  T FQ Y  GIF G CGT LDH VT VG+G TE+G +YW++KNSWG++WG++GY
Sbjct: 268 VAIEAAGTAFQLYSSGIFTGSCGTALDHGVTAVGYG-TENGKDYWIVKNSWGSSWGESGY 326

Query: 328 MKIVRD----EGLCGIGTRSSYPL 347
           +++ R+     G CGI    SYPL
Sbjct: 327 VRMERNIKASSGKCGIAVEPSYPL 350


>gi|171702843|dbj|BAG16377.1| cysteine protease [Brassica rapa var. perviridis]
          Length = 431

 Score =  310 bits (793), Expect = 8e-82,   Method: Compositional matrix adjust.
 Identities = 148/322 (45%), Positives = 217/322 (67%), Gaps = 13/322 (4%)

Query: 31  QVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYK 90
             VSSRS  E  V  ++E+W+ +HG++     EK+ R +IFK+NL +I++ N + N +Y+
Sbjct: 28  HTVSSRSDVE--VSRLYEEWVVKHGKAQNSLTEKDRRFEIFKDNLRFIDEHNGK-NLSYR 84

Query: 91  LGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTP 150
           LG  +F+DLTNDE+R++Y G ++     R  T ++ +Y+      +P S+DWR +GAV  
Sbjct: 85  LGLTKFADLTNDEYRSMYLGSRL----KRKATKTSLRYEARVGDAIPESVDWRKEGAVAE 140

Query: 151 IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAY 210
           +K+Q  CG CWAF+ + AVEGI KI +G+LI LSEQ+L+DC T+ N GC GG  + AF +
Sbjct: 141 VKDQGSCGSCWAFSTIGAVEGINKIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAFEF 200

Query: 211 IIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIA 269
           II+N GI TE++YPY+ V G C   +K A    I +YE+VP+  E++L KA+S QP+S+A
Sbjct: 201 IIKNGGIDTEEDYPYKGVDGRCDQTRKNAKVVTIDSYEDVPANSEESLKKALSHQPISVA 260

Query: 270 IAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMK 329
           I      FQ Y  GIF+G+CGT LDH V  VG+G TE+G +YW++KNSWG +WG++GY++
Sbjct: 261 IEGGGRAFQLYDSGIFDGICGTDLDHGVVAVGYG-TENGKDYWIVKNSWGTSWGESGYIR 319

Query: 330 IVRD----EGLCGIGTRSSYPL 347
           + R+     G CGI    SYP+
Sbjct: 320 MERNIASSAGKCGIAVEPSYPI 341


>gi|146215996|gb|ABQ10200.1| cysteine protease Cp2 [Actinidia deliciosa]
          Length = 376

 Score =  310 bits (793), Expect = 8e-82,   Method: Compositional matrix adjust.
 Identities = 147/320 (45%), Positives = 219/320 (68%), Gaps = 9/320 (2%)

Query: 35  SRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTN 94
           S S  ++ V+ I+ +W+A+HG++Y    E+E R +IFK+NL+++++ N E NR+YK+G N
Sbjct: 35  SSSRTDEEVMGIYAEWLAKHGKAYNGIGERERRFEIFKDNLKFVDEHNSE-NRSYKVGLN 93

Query: 95  QFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD-VPTSLDWRDKGAVTPIKN 153
           +F+DLTN+E+R+++ G K  S      + S  +   +  +D +P S+DWR+ GAV PIK+
Sbjct: 94  RFADLTNEEYRSMFLGTKTDSKRRFMKSKSASRRYAVQDSDMLPESVDWRESGAVAPIKD 153

Query: 154 QKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQ 213
           Q  CG CWAF+ VAAVEG+ +I +G +IQLSEQ+L+DC    + GC GG  + AF +II 
Sbjct: 154 QGSCGSCWAFSTVAAVEGVNQIATGEMIQLSEQELVDCDRTYDAGCNGGLMDYAFEFIIN 213

Query: 214 NQGIATEDEYPYQAVPGTCSAAQK-PAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAA 272
           N GI TE++YPY+ V GTC   +K      I++YE+VP  DE AL KAV+ QPVS+AI A
Sbjct: 214 NGGIDTEEDYPYRGVDGTCDPERKNTKVVSINDYEDVPPYDEMALKKAVAHQPVSVAIEA 273

Query: 273 YSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR 332
               FQ Y  G+F G CG  LDH V +VG+G T++GA++W+++NSWG +WG+ GY+++ R
Sbjct: 274 SGRAFQLYLSGVFTGECGRALDHGVVVVGYG-TDNGADHWIVRNSWGTSWGENGYIRMER 332

Query: 333 D-----EGLCGIGTRSSYPL 347
           +      G CGI  ++SYP+
Sbjct: 333 NVVDNFGGKCGIAMQASYPI 352


>gi|356559055|ref|XP_003547817.1| PREDICTED: cysteine proteinase RD21a [Glycine max]
          Length = 366

 Score =  310 bits (793), Expect = 9e-82,   Method: Compositional matrix adjust.
 Identities = 152/341 (44%), Positives = 213/341 (62%), Gaps = 9/341 (2%)

Query: 13  INTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFK 72
           + +T +F+  TL  SCA    +  +  +  V+ ++E+W+ +H + Y    EK+ R ++FK
Sbjct: 8   VTSTLLFLSFTL--SCAIDTSTITNYTDNEVMTMYEEWLVKHQKVYNGLREKDKRFQVFK 65

Query: 73  ENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSST-FKYQNL 131
           +NL +I++ N   N TYKLG NQF+D+TN+E+R +Y G K  +      T ST  +Y   
Sbjct: 66  DNLGFIQEHNNNQNNTYKLGLNQFADMTNEEYRVMYFGTKSDAKRRLMKTKSTGHRYAYS 125

Query: 132 SMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDC 191
           +   +P  +DWR KGAV PIK+Q  CG CWAF+ VA VE I KI +G  + LSEQ+L+DC
Sbjct: 126 AGDRLPVHVDWRVKGAVAPIKDQGSCGSCWAFSTVATVEAINKIVTGKFVSLSEQELVDC 185

Query: 192 STNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVP 250
               N GC GG  + AF +IIQN GI T+ +YPY+   G C   +K A    I  +E+VP
Sbjct: 186 DRAYNEGCNGGLMDYAFEFIIQNGGIDTDKDYPYRGFDGICDPTKKNAKVVNIDGFEDVP 245

Query: 251 SGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGAN 310
             DE AL KAV+ QPVSIAI A   + Q Y+ G+F G CGT LDH V +VG+G +E+G +
Sbjct: 246 PYDENALKKAVAHQPVSIAIEASGRDLQLYQSGVFTGKCGTSLDHGVVVVGYG-SENGVD 304

Query: 311 YWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
           YWL++NSWG  WG+ GY K+ R+     G CGI   +SYP+
Sbjct: 305 YWLVRNSWGTGWGEDGYFKMQRNVRTPTGKCGITMEASYPV 345


>gi|5777889|emb|CAB53515.1| cysteine protease [Solanum tuberosum]
          Length = 466

 Score =  310 bits (793), Expect = 9e-82,   Method: Compositional matrix adjust.
 Identities = 152/351 (43%), Positives = 221/351 (62%), Gaps = 11/351 (3%)

Query: 7   RSGSFKINTTPMFIIITLLVSCASQVVSSRSTH-----EQSVVEIHEKWMAQHGRSYKDE 61
            S +  I+   M I  TL  +    ++S   TH     +  V  ++E W+ +HG+SY   
Sbjct: 4   HSSTLTISLLLMLIFSTLSSASDMSIISYDETHIHHRSDDEVSALYESWLIEHGKSYNAL 63

Query: 62  LEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRST 121
            EK+ R +IFK+NL+YI++ N   N++YKLG  +F+DLTN+E+R++Y G K      + +
Sbjct: 64  GEKDKRFQIFKDNLKYIDEQNSVPNQSYKLGLTKFADLTNEEYRSIYLGTKSSGDRRKLS 123

Query: 122 TSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLI 181
            + + +Y       +P S+DWRDKG +  +K+Q  CG CWAF+AVAA+E I  I +GNLI
Sbjct: 124 KNKSDRYLPKVGDSLPESVDWRDKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGNLI 183

Query: 182 QLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-A 240
            LSEQ+L+DC  + N GC GG  + AF ++I N GI TE++YPY+     C   +K A  
Sbjct: 184 SLSEQELVDCDKSYNEGCDGGLMDYAFEFVINNGGIDTEEDYPYKERNDVCDQYRKNAKV 243

Query: 241 AKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIV 300
            KI +YE+VP  +E+AL KAV+ QPVSIAI A   + Q YK GIF G CGT +DH V   
Sbjct: 244 VKIDSYEDVPVNNEKALQKAVAHQPVSIAIEAGGRDLQHYKSGIFTGKCGTAVDHGVVAA 303

Query: 301 GFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
           G+G +E+G +YW+++NSWG  WG+ GY+++ R+     GLCG+ T  SYP+
Sbjct: 304 GYG-SENGMDYWIVRNSWGAKWGEKGYLRVQRNVASSSGLCGLATEPSYPV 353


>gi|225428879|ref|XP_002285299.1| PREDICTED: cysteine proteinase RD21a-like [Vitis vinifera]
          Length = 469

 Score =  310 bits (793), Expect = 9e-82,   Method: Compositional matrix adjust.
 Identities = 148/314 (47%), Positives = 211/314 (67%), Gaps = 8/314 (2%)

Query: 40  EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
           +  V+ ++E W+ +HG+SY    E+E R +IFK+NL +IE+ N   NRTYK+G N+F+DL
Sbjct: 47  DAEVMAVYEAWLVKHGKSYNALGERERRFEIFKDNLRFIEEHNAV-NRTYKVGLNRFADL 105

Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGC 159
           TN+E+R+ Y G +  +      +  + +Y   +  D+P S+DWR+KGAV P+K+Q  CG 
Sbjct: 106 TNEEYRSRYLGRRDETRRGLRASRVSDRYSFRAGEDLPESVDWREKGAVVPVKDQGNCGS 165

Query: 160 CWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIAT 219
           CWAF+ +AAVEGI +I +G+LI LSEQ+L+DC  + N GC GG  + AF +II N GI +
Sbjct: 166 CWAFSTIAAVEGINQIATGDLISLSEQELVDCDKSYNQGCNGGLMDYAFEFIINNGGIDS 225

Query: 220 EDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQ 278
           E++YPY+A   TC   +K A    I  YE+VP  DE++L KAV+ QPVS+AI A    FQ
Sbjct: 226 EEDYPYRAADTTCDPNRKNARVVSIDGYEDVPQNDERSLKKAVANQPVSVAIEAGGRAFQ 285

Query: 279 SYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR-----D 333
            Y+ G+F G CGTQLDH V  VG+G TE+  +YW+++NSWG  WG++GY+K+ R     +
Sbjct: 286 LYQSGVFTGQCGTQLDHGVVAVGYG-TENSVDYWIVRNSWGPNWGESGYIKLERNLAGTE 344

Query: 334 EGLCGIGTRSSYPL 347
            G CGI    SYP+
Sbjct: 345 TGKCGIAIEPSYPI 358


>gi|116781957|gb|ABK22314.1| unknown [Picea sitchensis]
          Length = 369

 Score =  309 bits (792), Expect = 1e-81,   Method: Compositional matrix adjust.
 Identities = 154/315 (48%), Positives = 207/315 (65%), Gaps = 12/315 (3%)

Query: 40  EQSVVEIHEKWMAQHGRSYK-DELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSD 98
           E+S+  +++ W  QH  S   D  E   R +IFKEN++YI+  NK+ +  YKLG N+F+D
Sbjct: 39  EKSLRSLYDNWALQHRSSRSLDSEEHAERFEIFKENVKYIDSVNKK-DSPYKLGLNKFAD 97

Query: 99  LTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECG 158
           L+N+EF+A+Y G KM     R   S +F YQN     +P S+DWR KGAV  +KNQ  CG
Sbjct: 98  LSNEEFKAIYMGTKMDLRGDREVQSGSFMYQN--SEPLPASIDWRQKGAVAAVKNQGHCG 155

Query: 159 CCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIA 218
            CWAF+ VA+VEGI  I +GNL+ LSEQQL+DCST  N+GC GG  + AF YII N GI 
Sbjct: 156 SCWAFSTVASVEGINYITTGNLVSLSEQQLVDCSTE-NSGCNGGLMDTAFQYIINNGGIV 214

Query: 219 TEDEYPYQAVPGTCSAAQ---KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYST 275
           TED YPY A    CS+ +   +     I  +E+VP+ +EQAL +AV+ QPVS+AI A   
Sbjct: 215 TEDNYPYTAEATECSSTKINSQTTRVVIDGFEDVPANNEQALKEAVAHQPVSVAIEASGQ 274

Query: 276 EFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-- 333
           +FQ Y  G+F G CGT LDH V  VG+GT+ +G NYW+++NSWG  WG+ GY+++ +   
Sbjct: 275 DFQFYSTGVFTGKCGTALDHGVVAVGYGTSPEGINYWIVRNSWGPKWGEEGYIRMQQGIE 334

Query: 334 --EGLCGIGTRSSYP 346
             EG CGI  ++SYP
Sbjct: 335 AAEGKCGIAMQASYP 349


>gi|62526575|gb|AAX84673.1| cysteine protease CP1 [Manihot esculenta]
          Length = 467

 Score =  309 bits (792), Expect = 1e-81,   Method: Compositional matrix adjust.
 Identities = 156/343 (45%), Positives = 220/343 (64%), Gaps = 16/343 (4%)

Query: 18  MFIIITLLVSCASQVVSSRSTH--------EQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
           +F+  TL  +    ++S   TH        +  V+ I+E+W+ + G+ Y    E+E R +
Sbjct: 15  LFLSFTLSSASDMSIISYDQTHATKSSWRTDDEVMAIYEEWLVKQGKVYNALGEREKRFQ 74

Query: 70  IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQ 129
           +FK+NL +I++ N E NRTYKLG N F+DLTN+E+R+ Y G +     +R   +S  +Y 
Sbjct: 75  VFKDNLRFIDEHNSE-NRTYKLGLNGFADLTNEEYRSTYLGARGGMKRNRLRKTSD-RYA 132

Query: 130 NLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLL 189
                 +P S+DWR +GAV  +K+Q  CG CWAF+ +AAVEGI KI +G+LI LSEQ+L+
Sbjct: 133 PRVGESLPDSVDWRKEGAVAEVKDQGSCGSCWAFSTIAAVEGINKIVTGDLISLSEQELV 192

Query: 190 DCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEE 248
           DC T+ N GC GG  + AF +II N GI TE++YPY A  G C   +K A    I +YE+
Sbjct: 193 DCDTSYNEGCNGGLMDYAFEFIINNGGIDTEEDYPYLARDGRCDTYRKNAKVVTIDDYED 252

Query: 249 VPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDG 308
           VP   E AL KAV+ QPVS+AI A   +FQ Y  GIF+G CGTQLDH V  VG+G TE+G
Sbjct: 253 VPVNSETALQKAVANQPVSVAIEAGGRDFQFYASGIFSGRCGTQLDHGVAAVGYG-TENG 311

Query: 309 ANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
            +YW+++NSWG +WG+ GY+++ R      G+CGI   +SYP+
Sbjct: 312 KDYWIVRNSWGKSWGENGYLRMARSINSPTGICGIAMEASYPI 354


>gi|171702831|dbj|BAG16371.1| cysteine protease [Brassica oleracea var. italica]
          Length = 441

 Score =  309 bits (792), Expect = 1e-81,   Method: Compositional matrix adjust.
 Identities = 149/322 (46%), Positives = 215/322 (66%), Gaps = 13/322 (4%)

Query: 31  QVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYK 90
             VSSRS  E  V  ++E+W+ +HG++     EK+ R +IFK+NL +I++ N + N +Y+
Sbjct: 28  HTVSSRSDAE--VSRLYEEWLVKHGKAQNSLTEKDRRFEIFKDNLRFIDEHNGK-NLSYR 84

Query: 91  LGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTP 150
           LG  +F+DLTNDE+R++Y G ++     R  T S+ +Y+      +P S+DWR +GAV  
Sbjct: 85  LGLTKFADLTNDEYRSMYLGSRL----KRKATKSSLRYEVRVGDAIPESVDWRKEGAVAE 140

Query: 151 IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAY 210
           +K+Q  CG CWAF+ + AVEGI KI +G+LI LSEQ+L+DC T+ N GC GG  + AF +
Sbjct: 141 VKDQGSCGSCWAFSTIGAVEGINKIVTGDLITLSEQELVDCDTSYNEGCNGGLMDYAFEF 200

Query: 211 IIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIA 269
           II N GI TE++YPY+ V G C   +K A    I  YE+VP+  E++L KA+S QP+S+A
Sbjct: 201 IINNGGIDTEEDYPYKGVDGRCDQTRKNAKVVTIDLYEDVPANSEESLKKALSHQPISVA 260

Query: 270 IAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMK 329
           I      FQ Y  GIF+G+CGT LDH V  VG+G TE+G +YW++KNSWG +WG++GY++
Sbjct: 261 IEGGGRAFQLYDSGIFDGICGTDLDHGVVAVGYG-TENGKDYWIVKNSWGTSWGESGYIR 319

Query: 330 IVRD----EGLCGIGTRSSYPL 347
           + R+     G CGI    SYP+
Sbjct: 320 MERNIASSAGKCGIAVEPSYPI 341


>gi|224103643|ref|XP_002313136.1| predicted protein [Populus trichocarpa]
 gi|222849544|gb|EEE87091.1| predicted protein [Populus trichocarpa]
          Length = 477

 Score =  309 bits (792), Expect = 1e-81,   Method: Compositional matrix adjust.
 Identities = 152/350 (43%), Positives = 219/350 (62%), Gaps = 16/350 (4%)

Query: 10  SFKINTTPMFIIITLLVSCASQVVSSRSTH-------EQSVVEIHEKWMAQHGRSYKDEL 62
           SF    T  F+ + L +     ++     H       E   + ++E W+ ++G++Y    
Sbjct: 7   SFAFLATFYFLSVCLAID--MSIIDYNLKHGQVPERTEAETLRLYEMWLVKYGKAYNALG 64

Query: 63  EKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTT 122
           EKE R +IFK+NL+++++ N  GN +YKLG N+F+DL+N+E+RA Y G +M         
Sbjct: 65  EKERRFEIFKDNLKFVDQHNSVGNPSYKLGLNKFADLSNEEYRAAYLGTRMDGKRRLLGG 124

Query: 123 SSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQ 182
             + +Y      D+P S+DWR+KGAV P+K+Q +CG CWAF+ V AVEGI +I +GNL  
Sbjct: 125 PKSARYLFKDGDDLPESVDWREKGAVAPVKDQGQCGSCWAFSTVGAVEGINQIVTGNLTS 184

Query: 183 LSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AA 241
           LSEQ+L+DC    N GC GG  + AF +I++N GI TE++YPY+AV   C   +K A   
Sbjct: 185 LSEQELVDCDKVYNQGCNGGLMDYAFEFIMKNGGIDTEEDYPYKAVDSMCDPNRKNARVV 244

Query: 242 KISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVG 301
            I  YE+VP  DE++L KAV+ QPVS+AI A    FQ Y+ G+F G CGTQLDH V  VG
Sbjct: 245 TIDGYEDVPQNDEKSLRKAVANQPVSVAIEAGGRAFQLYQSGVFTGSCGTQLDHGVVAVG 304

Query: 302 FGTTEDGANYWLIKNSWGNTWGDAGYMKIVR-----DEGLCGIGTRSSYP 346
           +G TE+G +YW+++NSWG  WG+ GY+++ R     + G CGI   +SYP
Sbjct: 305 YG-TENGVDYWVVRNSWGPAWGENGYIRMERNVASTETGKCGIAMEASYP 353


>gi|13897890|gb|AAK48495.1|AF259983_1 putative cysteine protease [Ipomoea batatas]
          Length = 462

 Score =  309 bits (791), Expect = 1e-81,   Method: Compositional matrix adjust.
 Identities = 152/316 (48%), Positives = 209/316 (66%), Gaps = 7/316 (2%)

Query: 37  STHEQSVVEIHEKWMAQHGRSYKD-ELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQ 95
           S  ++ V+ ++E W+ +HG+SY     EK+ R +IFK+NL YI++ N  G+R+YKLG N+
Sbjct: 39  SRSDEEVMALYESWLVEHGKSYNGLGGEKDKRFEIFKDNLRYIDEQNSRGDRSYKLGLNR 98

Query: 96  FSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQK 155
           F+DLTN+E+R+ Y G K  +    + T S  +Y   +   +P S+DWR+KGAV  +K+Q 
Sbjct: 99  FADLTNEEYRSTYLGAKTDARRRIAKTKSDRRYAPKAGGSLPDSIDWREKGAVAEVKDQG 158

Query: 156 ECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQ 215
            CG CWAF+ +AAVEGI +I +G LI LSEQ+L+DC T+ N GC GG  + AF +II+N 
Sbjct: 159 SCGSCWAFSTIAAVEGINQIVTGELISLSEQELVDCDTSYNEGCNGGLMDYAFEFIIKNG 218

Query: 216 GIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYS 274
           GI TE +YPY    G C   +K A    I  YE+V   DE AL +AV+ QPVS+AI A  
Sbjct: 219 GIDTEADYPYTGRYGRCDQTRKNAKVVSIDGYEDVTPYDEAALKEAVAGQPVSVAIEAGG 278

Query: 275 TEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD- 333
            +FQ Y  GIF G CGT LDH VT VG+G TE+G +YW++KNSW  +WG+ GY+++ R+ 
Sbjct: 279 RDFQLYSSGIFTGSCGTDLDHGVTAVGYG-TENGVDYWIVKNSWAASWGEKGYLRMQRNV 337

Query: 334 ---EGLCGIGTRSSYP 346
               GLCGI    SYP
Sbjct: 338 KDKNGLCGIAIEPSYP 353


>gi|148907299|gb|ABR16787.1| unknown [Picea sitchensis]
          Length = 372

 Score =  309 bits (791), Expect = 1e-81,   Method: Compositional matrix adjust.
 Identities = 156/320 (48%), Positives = 212/320 (66%), Gaps = 18/320 (5%)

Query: 40  EQSVVEIHEKWMAQHGRSYK--DELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
           ++S+  +++KW  QH RS +  D  E   R +IFKEN+++I+  NK+ +  YKLG N+F+
Sbjct: 38  DESLRGLYDKWALQH-RSTRSLDSDEHARRFEIFKENVKHIDSVNKK-DGPYKLGLNKFA 95

Query: 98  DLTNDEFRALYTGYKMPSPS----HRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKN 153
           DL+N+EF+A++   KM         R   S +F YQN     +P S+DWR KGAVTP+KN
Sbjct: 96  DLSNEEFKAMHMTTKMEKHKSLRGDRGVESGSFMYQNSKR--LPASIDWRKKGAVTPVKN 153

Query: 154 QKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQ 213
           Q +CG CWAF+ +A+VEGI  I++G L+ LSEQQL+DCS   N GC GG  + AF YII 
Sbjct: 154 QGQCGSCWAFSTIASVEGINYIKTGKLVSLSEQQLVDCSKE-NAGCNGGLMDNAFQYIID 212

Query: 214 NQGIATEDEYPYQAVPGTCSAAQ---KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAI 270
           N GI TEDEYPY A  G CS  +   K  A  I  +E+VP+ +E AL KAV+ QPVSIAI
Sbjct: 213 NGGIVTEDEYPYTAEAGECSTTKIESKSIATIIDGFEDVPANNEGALKKAVAHQPVSIAI 272

Query: 271 AAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKI 330
            A   +FQ Y  G+F G CGT+LDH V +VG+G + +G NYW+++NSWG  WG+ GY+++
Sbjct: 273 EASGHDFQFYSTGVFTGKCGTELDHGVVVVGYGKSPEGINYWIVRNSWGPEWGEQGYIRM 332

Query: 331 VRD----EGLCGIGTRSSYP 346
            R     EG CGI  ++SYP
Sbjct: 333 QRGIEATEGKCGISMQASYP 352


>gi|226495425|ref|NP_001148706.1| cysteine protease 1 precursor [Zea mays]
 gi|195621544|gb|ACG32602.1| cysteine protease 1 precursor [Zea mays]
          Length = 463

 Score =  309 bits (791), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 152/324 (46%), Positives = 210/324 (64%), Gaps = 12/324 (3%)

Query: 32  VVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRT 88
           +VS      +    ++ +WMA HGR+Y    E+E R ++F++NL YI+  N     G  +
Sbjct: 26  IVSYGERSXEEARRMYAEWMAAHGRTYNAVGEEERRYQVFRDNLRYIDAHNAAADAGVHS 85

Query: 89  YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAV 148
           ++LG N+F+DLTNDE+RA Y G +      R   +   +Y      D+P S+DWR KGAV
Sbjct: 86  FRLGLNRFADLTNDEYRATYLGARTRPQRERKLGA---RYHAADNEDLPESVDWRAKGAV 142

Query: 149 TPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAF 208
             +K+Q  CG CWAF+ +AAVEGI +I +G+LI LSEQ+L+DC T+ N GC GG  + AF
Sbjct: 143 AEVKDQGSCGSCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAF 202

Query: 209 AYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVS 267
            +II N GI TE +YPY+   G C   +K A    I +YE+VP+ DE++L KAV+ QPVS
Sbjct: 203 EFIINNGGIDTEKDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANDEKSLQKAVANQPVS 262

Query: 268 IAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGY 327
           +AI A  T FQ Y  GIF G CGT LDH VT VG+G TE+G +YW++KNSWG++WG++GY
Sbjct: 263 VAIEAAGTAFQLYSSGIFTGSCGTALDHGVTAVGYG-TENGKDYWIVKNSWGSSWGESGY 321

Query: 328 MKIVRD----EGLCGIGTRSSYPL 347
           +++ R+     G CGI    SYPL
Sbjct: 322 VRMERNIKASSGKCGIAVEPSYPL 345


>gi|326502440|dbj|BAJ95283.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 349

 Score =  308 bits (790), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 152/317 (47%), Positives = 218/317 (68%), Gaps = 9/317 (2%)

Query: 38  THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
           T + ++V  HEKWMA+HGR+Y +E EK  RL++F+ N + I+  N   + T++L TN+F+
Sbjct: 35  TVDSAMVSRHEKWMAEHGRTYANEEEKARRLEVFRANAKLIDSFNSAEDSTHRLATNRFA 94

Query: 98  DLTNDEFRALYTGYKMP--SPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQK 155
           DLT++EFRA  TG + P  + +   + +  F+Y+N S+ D   S+DWR  GAVT +K+Q 
Sbjct: 95  DLTDEEFRAARTGLRRPPAAAAGAGSGAGGFRYENFSLADAAGSMDWRAMGAVTGVKDQG 154

Query: 156 ECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNN-GCLGGSREKAFAYIIQN 214
            CGCCWAF+AVAAVEG+TKIR+G L+ LSEQQL+DC   G++ GC GG  + AF Y+I  
Sbjct: 155 SCGCCWAFSAVAAVEGLTKIRTGRLVSLSEQQLVDCDVYGDDEGCAGGLMDNAFEYMINR 214

Query: 215 QGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYS 274
            G+ TE  YPY+   G+C   +  +AA I  YE+VP+ +E AL+ AV+ QPVS+AI    
Sbjct: 215 GGLTTESSYPYRGTDGSCR--RSASAASIRGYEDVPANNEAALMAAVAHQPVSVAINGGD 272

Query: 275 TEFQSYKEGIFNGV-CGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKI--- 330
           + F+ Y  G+  G  CGT+L+HA+T VG+GT  DG  YW++KNSWG +WG+ GY++I   
Sbjct: 273 SVFRFYDSGVLGGSGCGTELNHAITAVGYGTASDGTKYWIMKNSWGGSWGEGGYVRIRRG 332

Query: 331 VRDEGLCGIGTRSSYPL 347
           VR EG+CG+   +SYP+
Sbjct: 333 VRGEGVCGLAQLASYPV 349


>gi|356563584|ref|XP_003550041.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
          Length = 366

 Score =  308 bits (790), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 153/339 (45%), Positives = 212/339 (62%), Gaps = 9/339 (2%)

Query: 15  TTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKEN 74
           +T +F+  TL  SCA    +  +  +  V+ ++E+W+ +H + Y    EK+ R ++FK+N
Sbjct: 10  STLLFLSFTL--SCAIDTSTITNYTDNEVMTMYEEWLVKHQKVYNGLGEKDKRFQVFKDN 67

Query: 75  LEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSST-FKYQNLSM 133
           L +I++ N   N TYKLG N+F+D+TN+E+R +Y G K  +      T ST  +Y   + 
Sbjct: 68  LGFIQEHNNNQNNTYKLGLNKFADMTNEEYRVMYFGTKSDAKRRLMKTKSTGHRYAYSAG 127

Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST 193
             +P  +DWR KGAV PIK+Q  CG CWAF+ VA VE I KI +G  + LSEQ+L+DC  
Sbjct: 128 DQLPVHVDWRVKGAVAPIKDQGSCGSCWAFSTVATVEAINKIVTGKFVSLSEQELVDCDR 187

Query: 194 NGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSG 252
             N GC GG  + AF +IIQN GI T+ +YPY+   G C   +K A A  I  YE+VP  
Sbjct: 188 AYNQGCNGGLMDYAFEFIIQNGGIDTDKDYPYRGFDGICDPTKKNAKAVNIDGYEDVPPY 247

Query: 253 DEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYW 312
           DE AL KAV+ QPVSIAI A     Q Y+ G+F G CGT LDH V +VG+G +E+G +YW
Sbjct: 248 DENALKKAVARQPVSIAIEASGRALQLYQSGVFTGECGTSLDHGVVVVGYG-SENGVDYW 306

Query: 313 LIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
           L++NSWG  WG+ GY K+ R+     G CGI   +SYP+
Sbjct: 307 LVRNSWGTGWGEDGYFKMQRNVRTPTGKCGITMEASYPV 345


>gi|37780043|gb|AAP32194.1| cysteine protease 1 [Trifolium repens]
          Length = 292

 Score =  308 bits (790), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 153/295 (51%), Positives = 206/295 (69%), Gaps = 14/295 (4%)

Query: 63  EKEMRLKIFKENLEYIEKANKE-GNRTYKLGTNQFSDLTNDEFRALYTGYK--MPSPSHR 119
           E+E RL+IF +N+ YIE +N    N+ YKL  N+F+DLTN+EF A    +K  M S   R
Sbjct: 3   EREKRLRIFNKNVNYIEASNSAVNNKLYKLSINKFADLTNEEFIASRNKFKGHMCSSIIR 62

Query: 120 STTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGN 179
           +TT   FKY+N S   +P+++DWR KGAVTP+KNQ +CG CWAF+AVAA EGI ++ +G 
Sbjct: 63  TTT---FKYENASA--IPSTVDWRKKGAVTPVKNQGQCGSCWAFSAVAATEGIHQLSTGK 117

Query: 180 LIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKP 238
           L+ LSEQ+L+DC T G + GC GG  + AF +IIQN G++TE +YPY+ V GTC+A +  
Sbjct: 118 LVSLSEQELIDCDTKGVDQGCEGGLMDDAFKFIIQNHGLSTEVQYPYEGVDGTCNANKAS 177

Query: 239 A-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAV 297
             A  I+ YE+VP+ +E AL KAV+ QP+S+AI A  ++FQ Y  G+F G CGT+LDH V
Sbjct: 178 IHAVTITGYEDVPANNELALQKAVANQPISVAIDASGSDFQFYNSGVFTGSCGTELDHGV 237

Query: 298 TIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
           T VG+G   DG  YWL+KNSWG  WG+ GY+++ R     EGLCGI  ++SYP A
Sbjct: 238 TAVGYGVGNDGTKYWLVKNSWGADWGEEGYIRMQRGIAAAEGLCGIAMQASYPTA 292


>gi|297816028|ref|XP_002875897.1| hypothetical protein ARALYDRAFT_347926 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297321735|gb|EFH52156.1| hypothetical protein ARALYDRAFT_347926 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 361

 Score =  308 bits (790), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 154/342 (45%), Positives = 217/342 (63%), Gaps = 19/342 (5%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHG--RSYKDELEKEMRLKIFKENL 75
           +F ++ L  +C           E+ + +++++W + H   RS     E+E R  +F+ N+
Sbjct: 9   LFSLVILETACGFDYEDKEIESEEGLSKLYDRWRSHHSVPRSLH---EREKRFNVFRHNV 65

Query: 76  EYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHR-----STTSSTFKYQN 130
            ++  +NK+ NR+YKL  N+F+DLT  EF+  YTG K+    HR        S  F Y +
Sbjct: 66  MHVHNSNKK-NRSYKLKLNKFADLTIHEFKNAYTGSKIKH--HRMLQGPKRGSKQFMYDH 122

Query: 131 LSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLD 190
            +++ +P+S+DWR KGAVT IKNQ +CG CWAF+ VAAVEGI KI++  L+ LSEQ+L+D
Sbjct: 123 ENVSKLPSSVDWRKKGAVTEIKNQGKCGSCWAFSTVAAVEGINKIKTNKLVSLSEQELVD 182

Query: 191 CSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAA-AKISNYEEV 249
           C TN N GC GG  E AF +I +N GI TED YPY+ + G C A++       I  +E V
Sbjct: 183 CDTNQNEGCNGGLMEIAFEFIKKNGGITTEDSYPYEGIDGKCDASKDNGVLVTIDGHENV 242

Query: 250 PSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGA 309
           P  DE ALLKAV+ QPVS+AI A S++FQ Y EG+F G CGT+L+H V  VG+G ++ G 
Sbjct: 243 PENDENALLKAVANQPVSVAIDAGSSDFQFYSEGVFTGDCGTELNHGVATVGYG-SQGGK 301

Query: 310 NYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
            YW+++NSWG  WG+ GY+KI R     EG CGI   +SYP+
Sbjct: 302 KYWIVRNSWGTEWGEGGYIKIERGIDEPEGRCGIAMEASYPI 343


>gi|226529105|ref|NP_001150196.1| cysteine protease 1 precursor [Zea mays]
 gi|194701798|gb|ACF84983.1| unknown [Zea mays]
 gi|194704800|gb|ACF86484.1| unknown [Zea mays]
 gi|195637480|gb|ACG38208.1| cysteine protease 1 precursor [Zea mays]
 gi|413919895|gb|AFW59827.1| cysteine protease 1 [Zea mays]
          Length = 470

 Score =  308 bits (789), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 147/318 (46%), Positives = 217/318 (68%), Gaps = 13/318 (4%)

Query: 40  EQSVVEIHEKWMAQHGRSY----KDELEKEMRLKIFKENLEYIEKAN-KEGNRTYKLGTN 94
           E  V  +++ W+A+HGR+Y    + E E++ R  +F +NL +++  N + G R ++LG N
Sbjct: 50  EPEVRAMYDLWLAEHGRAYNALGEGEGERDRRFLVFWDNLRFVDAHNERAGARGFRLGMN 109

Query: 95  QFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQ 154
           QF+DLTNDEFRA Y G  +P+    +     +++   +  ++P S+DWR+KGAV P+KNQ
Sbjct: 110 QFADLTNDEFRAAYLGAMVPAARRGAVVGERYRHDG-AAEELPESVDWREKGAVAPVKNQ 168

Query: 155 KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQ 213
            +CG CWAF+AV++VE + +I +G ++ LSEQ+L++CST+ GN+GC GG  + AF +II+
Sbjct: 169 GQCGSCWAFSAVSSVESVNQIVTGEMVTLSEQELVECSTDGGNSGCNGGLMDAAFDFIIK 228

Query: 214 NQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAA 272
           N GI TED+YPY+AV G C   +K A    I  +E+VP  DE++L KAV+ QPVS+AI A
Sbjct: 229 NGGIDTEDDYPYRAVDGKCDMNRKNARVVSIDGFEDVPENDEKSLQKAVAHQPVSVAIEA 288

Query: 273 YSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR 332
              EFQ YK G+F+G C T LDH V  VG+G  E+G +YW+++NSWG  WG+AGY+++ R
Sbjct: 289 GGREFQLYKSGVFSGSCTTNLDHGVVAVGYG-AENGKDYWIVRNSWGPKWGEAGYIRMER 347

Query: 333 D----EGLCGIGTRSSYP 346
           +     G CGI   +SYP
Sbjct: 348 NVNASTGKCGIAMMASYP 365


>gi|58531896|gb|AAW78660.1| cysteine protease [Nicotiana tabacum]
          Length = 361

 Score =  308 bits (789), Expect = 3e-81,   Method: Compositional matrix adjust.
 Identities = 151/313 (48%), Positives = 207/313 (66%), Gaps = 16/313 (5%)

Query: 45  EIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEF 104
           E++E+W + H  S   + EK+ R  +FK N+ Y+   NK+ ++ YKL  N+F+D+TN EF
Sbjct: 36  ELYERWRSHHTVSRSLD-EKDKRFNVFKANVHYVHNFNKK-DKPYKLKLNKFADMTNHEF 93

Query: 105 RALYTGYKMPSPSHRS-----TTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGC 159
           R  Y G K+    HRS       + TF Y N+   DVP S+DWR KGAVTP+K+Q +CG 
Sbjct: 94  RHHYAGSKIKH--HRSFLGASRANGTFMYANVE--DVPPSVDWRKKGAVTPVKDQGKCGS 149

Query: 160 CWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIAT 219
           CWAF+ V AVEGI +I++  L+ LSEQ+L+DC T+ N GC GG  + AF +I +  GI T
Sbjct: 150 CWAFSTVVAVEGINQIKTNELVSLSEQELVDCDTSQNQGCNGGLMDMAFEFIKKKGGINT 209

Query: 220 EDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQ 278
           E+ YPY A  G C   ++ +    I  YE+VP  DE +LLKAV+ QPVS+AI A  ++FQ
Sbjct: 210 EENYPYMAEGGECDIQKRNSPVVSIDGYEDVPPNDEDSLLKAVANQPVSVAIQASGSDFQ 269

Query: 279 SYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR----DE 334
            Y EG+F G CGT+LDH V IVG+GTT DG  YW+++NSWG  WG+ GY+++ R    +E
Sbjct: 270 FYSEGVFTGDCGTELDHGVAIVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQREIDAEE 329

Query: 335 GLCGIGTRSSYPL 347
           GLCGI  + SYP+
Sbjct: 330 GLCGIAMQPSYPI 342


>gi|334185815|ref|NP_680113.3| putative cysteine proteinase [Arabidopsis thaliana]
 gi|75313879|sp|Q9STL4.1|CEP2_ARATH RecName: Full=KDEL-tailed cysteine endopeptidase CEP2; Flags:
           Precursor
 gi|4678354|emb|CAB41164.1| cysteine endopeptidase-like protein [Arabidopsis thaliana]
 gi|332644882|gb|AEE78403.1| putative cysteine proteinase [Arabidopsis thaliana]
          Length = 361

 Score =  308 bits (789), Expect = 3e-81,   Method: Compositional matrix adjust.
 Identities = 153/342 (44%), Positives = 217/342 (63%), Gaps = 19/342 (5%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHG--RSYKDELEKEMRLKIFKENL 75
           +F ++ L  +C           E+ +  ++++W + H   RS     E+E R  +F+ N+
Sbjct: 9   LFSLVILQTACGFDYDDKEIESEEGLSTLYDRWRSHHSVPRSLN---EREKRFNVFRHNV 65

Query: 76  EYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTG-----YKMPSPSHRSTTSSTFKYQN 130
            ++   NK+ NR+YKL  N+F+DLT +EF+  YTG     ++M     R +    + ++N
Sbjct: 66  MHVHNTNKK-NRSYKLKLNKFADLTINEFKNAYTGSNIKHHRMLQGPKRGSKQFMYDHEN 124

Query: 131 LSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLD 190
           LS   +P+S+DWR KGAVT IKNQ +CG CWAF+ VAAVEGI KI++  L+ LSEQ+L+D
Sbjct: 125 LSK--LPSSVDWRKKGAVTEIKNQGKCGSCWAFSTVAAVEGINKIKTNKLVSLSEQELVD 182

Query: 191 CSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAA-AKISNYEEV 249
           C T  N GC GG  E AF +I +N GI TED YPY+ + G C A++       I  +E+V
Sbjct: 183 CDTKQNEGCNGGLMEIAFEFIKKNGGITTEDSYPYEGIDGKCDASKDNGVLVTIDGHEDV 242

Query: 250 PSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGA 309
           P  DE ALLKAV+ QPVS+AI A S++FQ Y EG+F G CGT+L+H V  VG+G +E G 
Sbjct: 243 PENDENALLKAVANQPVSVAIDAGSSDFQFYSEGVFTGSCGTELNHGVAAVGYG-SERGK 301

Query: 310 NYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
            YW+++NSWG  WG+ GY+KI R+    EG CGI   +SYP+
Sbjct: 302 KYWIVRNSWGAEWGEGGYIKIEREIDEPEGRCGIAMEASYPI 343


>gi|2414570|emb|CAB16317.1| cysteine proteinase precursor [Nicotiana tabacum]
          Length = 374

 Score =  308 bits (788), Expect = 3e-81,   Method: Compositional matrix adjust.
 Identities = 154/318 (48%), Positives = 206/318 (64%), Gaps = 13/318 (4%)

Query: 38  THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
           + E  V   +E W+A+HGR+Y    EKE R +IFK+NL +IE  N  GNRTYK+G NQF+
Sbjct: 41  SDEDQVKNRYEMWLAEHGRAYNALGEKEKRFEIFKDNLRFIEGHNNSGNRTYKVGLNQFA 100

Query: 98  DLTNDEFRALYTGYKMPSPSHRSTTSS---TFKYQNLSMTDVPTSLDWRDKGAVTPIKNQ 154
           DLTN+E+R +Y G K  S + R    S   + +Y +     +P S+DWR +GAV PIKNQ
Sbjct: 101 DLTNEEYRTMYLGTK--SDARRRFVKSKNPSQRYASRPNELMPHSVDWRKRGAVAPIKNQ 158

Query: 155 KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQN 214
             CG CWAF+ VAAVEGI +I +G +I LSEQ+L+DC    N+GC GG  + AF +II N
Sbjct: 159 GSCGSCWAFSTVAAVEGINQIVTGEMITLSEQELVDCDRVQNSGCNGGLMDYAFEFIISN 218

Query: 215 QGIATEDEYPYQAVPGTCSAAQKP-AAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAY 273
            G+ TE  YPY+ V G C   +K      I  YE+VP  +E+AL KAV+ QPV +AI A 
Sbjct: 219 GGMDTEKHYPYRGVEGRCDPVRKNYKVVSIDGYEDVPR-NERALQKAVAHQPVCVAIEAS 277

Query: 274 STEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD 333
              FQ Y  G+F G CG ++DH V +VG+G +EDG +YW+++NSWG  WG+ GY+K+ R+
Sbjct: 278 GRAFQLYSSGVFTGECGEEVDHGVVVVGYG-SEDGVDYWIVRNSWGTKWGENGYVKMERN 336

Query: 334 E-----GLCGIGTRSSYP 346
                 G CGI T +SYP
Sbjct: 337 VKKSHLGKCGIMTEASYP 354


>gi|62320725|dbj|BAD95392.1| cysteine proteinase RD21A [Arabidopsis thaliana]
          Length = 433

 Score =  308 bits (788), Expect = 3e-81,   Method: Compositional matrix adjust.
 Identities = 150/323 (46%), Positives = 216/323 (66%), Gaps = 12/323 (3%)

Query: 32  VVSSRSTHEQSVVEIHEKWMAQHGR--SYKDELEKEMRLKIFKENLEYIEKANKEGNRTY 89
           V ++    E  V+ I+E W+ +HG+  S    +EK+ R +IFK+NL ++++ N E N +Y
Sbjct: 35  VSTTGGRSEAEVMSIYEAWLVKHGKAQSQNSLVEKDRRFEIFKDNLRFVDEHN-EKNLSY 93

Query: 90  KLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVT 149
           +LG  +F+DLTNDE+R+ Y G KM     R T+    +Y+     ++P S+DWR KGAV 
Sbjct: 94  RLGLTRFADLTNDEYRSKYLGAKMEKKGERRTS---LRYEARVGDELPESIDWRKKGAVA 150

Query: 150 PIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFA 209
            +K+Q  CG CWAF+ + AVEGI +I +G+LI LSEQ+L+DC T+ N GC GG  + AF 
Sbjct: 151 EVKDQGGCGSCWAFSTIGAVEGINQIVTGDLITLSEQELVDCDTSYNEGCNGGLMDYAFE 210

Query: 210 YIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSI 268
           +II+N GI T+ +YPY+ V GTC   +K A    I +YE+VP+  E++L KAV+ QP+SI
Sbjct: 211 FIIKNGGIDTDKDYPYKGVDGTCDQIRKNAKVVTIDSYEDVPTYSEESLKKAVAHQPISI 270

Query: 269 AIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYM 328
           AI A    FQ Y  GIF+G CGTQLDH V  VG+G TE+G +YW+++NSWG +WG++GY+
Sbjct: 271 AIEAGGRAFQLYDSGIFDGSCGTQLDHGVVAVGYG-TENGKDYWIVRNSWGKSWGESGYL 329

Query: 329 KIVRD----EGLCGIGTRSSYPL 347
           ++ R+     G CGI    SYP+
Sbjct: 330 RMARNIASSSGKCGIAIEPSYPI 352


>gi|13432122|sp|P80884.2|ANAN_ANACO RecName: Full=Ananain; Flags: Precursor
 gi|2623956|emb|CAA05487.1| Ananain precursor [Ananas comosus]
          Length = 345

 Score =  308 bits (788), Expect = 3e-81,   Method: Compositional matrix adjust.
 Identities = 148/333 (44%), Positives = 211/333 (63%), Gaps = 10/333 (3%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
           +F+ + L V  AS   +S       +++  E+WMA++GR YKD  EK +R +IFK N+ +
Sbjct: 8   VFLFLFLCVMWASPSAASCDEPSDPMMKQFEEWMAEYGRVYKDNDEKMLRFQIFKNNVNH 67

Query: 78  IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
           IE  N     +Y LG NQF+D+TN+EF A YTG  +P    R    S   + ++ ++ VP
Sbjct: 68  IETFNNRNGNSYTLGINQFTDMTNNEFVAQYTGLSLPLNIKREPVVS---FDDVDISSVP 124

Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNN 197
            S+DWRD GAVT +KNQ  CG CWAFA++A VE I KI+ GNL+ LSEQQ+LDC+   + 
Sbjct: 125 QSIDWRDSGAVTSVKNQGRCGSCWAFASIATVESIYKIKRGNLVSLSEQQVLDCAV--SY 182

Query: 198 GCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQAL 257
           GC GG   KA+++II N+G+A+   YPY+A  GTC     P +A I+ Y  V   +E+ +
Sbjct: 183 GCKGGWINKAYSFIISNKGVASAAIYPYKAAKGTCKTNGVPNSAYITRYTYVQRNNERNM 242

Query: 258 LKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNS 317
           + AVS QP++ A+ A S  FQ YK G+F G CGT+L+HA+ I+G+G    G  +W+++NS
Sbjct: 243 MYAVSNQPIAAALDA-SGNFQHYKRGVFTGPCGTRLNHAIVIIGYGQDSSGKKFWIVRNS 301

Query: 318 WGNTWGDAGYMKIVRDE----GLCGIGTRSSYP 346
           WG  WG+ GY+++ RD     GLCGI     YP
Sbjct: 302 WGAGWGEGGYIRLARDVSSSFGLCGIAMDPLYP 334


>gi|242081867|ref|XP_002445702.1| hypothetical protein SORBIDRAFT_07g024430 [Sorghum bicolor]
 gi|241942052|gb|EES15197.1| hypothetical protein SORBIDRAFT_07g024430 [Sorghum bicolor]
          Length = 372

 Score =  308 bits (788), Expect = 3e-81,   Method: Compositional matrix adjust.
 Identities = 149/316 (47%), Positives = 203/316 (64%), Gaps = 13/316 (4%)

Query: 40  EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
           E+++  ++E+W  +H  + +D  +K  R  +FKEN+  I   N+  +  YKL  N+F D+
Sbjct: 40  EEALWALYERWRGRHAVA-RDLGDKARRFNVFKENVRLIHDFNQR-DEPYKLRLNRFGDM 97

Query: 100 TNDEFRALYTGYKMPSP----SHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQK 155
           T DEFR  Y G ++         R  ++S+F Y      D+PTS+DWR KGAVT +K+Q 
Sbjct: 98  TADEFRRHYAGSRVAHHRMFRGDRQGSASSFMYAG--ARDLPTSVDWRQKGAVTDVKDQG 155

Query: 156 ECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQ 215
           +CG CWAF+ +AAVEGI  I++ NL  LSEQQL+DC T GN GC GG  + AF YI ++ 
Sbjct: 156 QCGSCWAFSTIAAVEGINAIKTKNLTSLSEQQLVDCDTKGNAGCDGGLMDYAFQYIAKHG 215

Query: 216 GIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYST 275
           G+A ED YPY+A   +C  +  PA   I  YE+VP+ DE AL KAV+ QPVS+AI A  +
Sbjct: 216 GVAAEDAYPYKARQASCKKSPAPAVT-IDGYEDVPANDESALKKAVAHQPVSVAIEASGS 274

Query: 276 EFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-- 333
            FQ Y EG+F G CGT+LDH VT VG+G   DG  YW++KNSWG  WG+ GY+++ RD  
Sbjct: 275 HFQFYSEGVFAGRCGTELDHGVTAVGYGVAADGTKYWVVKNSWGPEWGEKGYIRMARDVA 334

Query: 334 --EGLCGIGTRSSYPL 347
             EG CGI   +SYP+
Sbjct: 335 AKEGHCGIAMEASYPV 350


>gi|18401614|ref|NP_564497.1| cysteine proteinase RD21a [Arabidopsis thaliana]
 gi|1172873|sp|P43297.1|RD21A_ARATH RecName: Full=Cysteine proteinase RD21a; Short=RD21; Flags:
           Precursor
 gi|12321010|gb|AAG50628.1|AC083835_13 cysteine protease, putative [Arabidopsis thaliana]
 gi|435619|dbj|BAA02374.1| thiol protease [Arabidopsis thaliana]
 gi|18175926|gb|AAL59952.1| putative cysteine proteinase RD21A [Arabidopsis thaliana]
 gi|22136972|gb|AAM91715.1| putative cysteine proteinase RD21A [Arabidopsis thaliana]
 gi|332194014|gb|AEE32135.1| cysteine proteinase RD21a [Arabidopsis thaliana]
          Length = 462

 Score =  307 bits (787), Expect = 4e-81,   Method: Compositional matrix adjust.
 Identities = 150/323 (46%), Positives = 216/323 (66%), Gaps = 12/323 (3%)

Query: 32  VVSSRSTHEQSVVEIHEKWMAQHGR--SYKDELEKEMRLKIFKENLEYIEKANKEGNRTY 89
           V ++    E  V+ I+E W+ +HG+  S    +EK+ R +IFK+NL ++++ N E N +Y
Sbjct: 35  VSTTGGRSEAEVMSIYEAWLVKHGKAQSQNSLVEKDRRFEIFKDNLRFVDEHN-EKNLSY 93

Query: 90  KLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVT 149
           +LG  +F+DLTNDE+R+ Y G KM     R T+    +Y+     ++P S+DWR KGAV 
Sbjct: 94  RLGLTRFADLTNDEYRSKYLGAKMEKKGERRTS---LRYEARVGDELPESIDWRKKGAVA 150

Query: 150 PIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFA 209
            +K+Q  CG CWAF+ + AVEGI +I +G+LI LSEQ+L+DC T+ N GC GG  + AF 
Sbjct: 151 EVKDQGGCGSCWAFSTIGAVEGINQIVTGDLITLSEQELVDCDTSYNEGCNGGLMDYAFE 210

Query: 210 YIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSI 268
           +II+N GI T+ +YPY+ V GTC   +K A    I +YE+VP+  E++L KAV+ QP+SI
Sbjct: 211 FIIKNGGIDTDKDYPYKGVDGTCDQIRKNAKVVTIDSYEDVPTYSEESLKKAVAHQPISI 270

Query: 269 AIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYM 328
           AI A    FQ Y  GIF+G CGTQLDH V  VG+G TE+G +YW+++NSWG +WG++GY+
Sbjct: 271 AIEAGGRAFQLYDSGIFDGSCGTQLDHGVVAVGYG-TENGKDYWIVRNSWGKSWGESGYL 329

Query: 329 KIVRD----EGLCGIGTRSSYPL 347
           ++ R+     G CGI    SYP+
Sbjct: 330 RMARNIASSSGKCGIAIEPSYPI 352


>gi|358348957|ref|XP_003638507.1| Cysteine proteinase [Medicago truncatula]
 gi|355504442|gb|AES85645.1| Cysteine proteinase [Medicago truncatula]
          Length = 362

 Score =  307 bits (787), Expect = 4e-81,   Method: Compositional matrix adjust.
 Identities = 157/349 (44%), Positives = 218/349 (62%), Gaps = 22/349 (6%)

Query: 15  TTPMFIIITLLVSCASQVVSSRSTHE------QSVVEIHEKWMAQHGRSYKDELEKEMRL 68
           TT   ++I L ++    V  S   H+      +S+ +++E+W + H  S ++  EK+ R 
Sbjct: 2   TTKKLLLIVLSIALVLVVSESFDFHDKDVSSDESLWDLYERWRSHHTVS-RNLNEKQKRF 60

Query: 69  KIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHR-----STTS 123
            +FK N+ ++   NK  ++ YKL  N+F+D+TN EF+  Y G K+    HR        S
Sbjct: 61  NVFKSNVMHVHNTNKM-DKPYKLKLNKFADMTNHEFKTTYAGSKVNH--HRMFRGTPRVS 117

Query: 124 STFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQL 183
            TF Y+N   T  P S+DWR KGAVT +K+Q +CG CWAF+ V AVEGI +I++  L+ L
Sbjct: 118 GTFMYENF--TKAPASVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQIKTNRLVPL 175

Query: 184 SEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAK 242
           SEQ+L+DC    N GC GG  E AF YI Q  GI TE  YPY A  G+C A ++   A  
Sbjct: 176 SEQELIDCDNQENQGCNGGLMEYAFEYIKQKGGITTESYYPYTANDGSCDATKENVPAVS 235

Query: 243 ISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGF 302
           I  +E VP+ DE ALLKAV+ QPVS+AI A  ++FQ Y EG+F G CG +L+H V IVG+
Sbjct: 236 IDGHETVPANDEDALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDCGKELNHGVAIVGY 295

Query: 303 GTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
           GTT DG NYW+++NSWG  WG+ GY+++ R+    EGLCGI   +SYP+
Sbjct: 296 GTTVDGTNYWIVRNSWGAEWGEQGYIRMKRNVSNKEGLCGIAMEASYPV 344


>gi|14517542|gb|AAK62661.1| F2G19.31/F2G19.31 [Arabidopsis thaliana]
 gi|19548039|gb|AAL87383.1| F2G19.31/F2G19.31 [Arabidopsis thaliana]
          Length = 462

 Score =  307 bits (787), Expect = 4e-81,   Method: Compositional matrix adjust.
 Identities = 150/323 (46%), Positives = 216/323 (66%), Gaps = 12/323 (3%)

Query: 32  VVSSRSTHEQSVVEIHEKWMAQHGR--SYKDELEKEMRLKIFKENLEYIEKANKEGNRTY 89
           V ++    E  V+ I+E W+ +HG+  S    +EK+ R +IFK+NL ++++ N E N +Y
Sbjct: 35  VSTTGGRSEAEVMSIYEAWLVKHGKAQSQNSLVEKDRRFEIFKDNLRFVDEHN-EKNLSY 93

Query: 90  KLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVT 149
           +LG  +F+DLTNDE+R+ Y G KM     R T+    +Y+     ++P S+DWR KGAV 
Sbjct: 94  RLGLTRFADLTNDEYRSKYLGAKMEKKGERRTS---LRYEARVGDELPESIDWRKKGAVA 150

Query: 150 PIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFA 209
            +K+Q  CG CWAF+ + AVEGI +I +G+LI LSEQ+L+DC T+ N GC GG  + AF 
Sbjct: 151 EVKDQGGCGSCWAFSTIGAVEGINQIVTGDLITLSEQELVDCDTSYNEGCNGGLMDYAFE 210

Query: 210 YIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSI 268
           +II+N GI T+ +YPY+ V GTC   +K A    I +YE+VP+  E++L KAV+ QP+SI
Sbjct: 211 FIIKNGGIDTDKDYPYKGVDGTCDQIRKNAKVVTIDSYEDVPTYSEESLKKAVAHQPISI 270

Query: 269 AIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYM 328
           AI A    FQ Y  GIF+G CGTQLDH V  VG+G TE+G +YW+++NSWG +WG++GY+
Sbjct: 271 AIEAGGRAFQLYDSGIFDGSCGTQLDHGVVAVGYG-TENGKDYWIVRNSWGKSWGESGYL 329

Query: 329 KIVRD----EGLCGIGTRSSYPL 347
           ++ R+     G CGI    SYP+
Sbjct: 330 RMARNIASSSGKCGIAIEPSYPI 352


>gi|46401612|dbj|BAD16614.1| cysteine proteinase [Dianthus caryophyllus]
          Length = 459

 Score =  307 bits (787), Expect = 4e-81,   Method: Compositional matrix adjust.
 Identities = 149/339 (43%), Positives = 228/339 (67%), Gaps = 11/339 (3%)

Query: 18  MFIIITLLVSCA---SQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKEN 74
           +F II ++ S A   S +  + +  +  +  ++E W+ +HG++Y    EK++R  IFK+N
Sbjct: 11  LFSIIFIVSSSALDLSIIDRAFNRPDDEIASLYETWLVKHGKNYNGLGEKQLRFNIFKDN 70

Query: 75  LEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPS-HRSTTSSTFKYQNLSM 133
           L ++++ N E N ++KLG N+F+DLTN+E+R++Y G +  S +  RS  S + +Y   + 
Sbjct: 71  LRFVDERNSE-NLSFKLGLNRFADLTNEEYRSVYLGTRPRSVAVARSGRSKSDRYAFRAG 129

Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST 193
             +P S+DWR KGAV  IK+Q  CG CWAF+A+AAVEG+ +I +G+LI LSEQ+L++C T
Sbjct: 130 DTLPESVDWRKKGAVAGIKDQGSCGSCWAFSAIAAVEGVNQIVTGDLISLSEQELVECDT 189

Query: 194 NGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSG 252
           + N+GC GG  + AF +II+N+GI ++++YPY    G C   +K A    I +YE+ P  
Sbjct: 190 SYNDGCDGGLMDYAFEFIIKNEGIDSDEDYPYTGRDGRCDTNRKNAKVVTIDDYEDSPVY 249

Query: 253 DEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYW 312
           DE++L KAV+ QPVS+AI     +FQ Y  G+F G CGT LDH V +VG+G TEDG +YW
Sbjct: 250 DEKSLQKAVANQPVSVAIEGGGRDFQLYDSGVFTGKCGTALDHGVAVVGYG-TEDGLDYW 308

Query: 313 LIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
           +++NSWG+TWG+ GY+++ R+     G+CGI    SYP+
Sbjct: 309 IVRNSWGDTWGEGGYIRMQRNTKLPSGICGIAIEPSYPI 347


>gi|224136808|ref|XP_002326950.1| predicted protein [Populus trichocarpa]
 gi|222835265|gb|EEE73700.1| predicted protein [Populus trichocarpa]
          Length = 456

 Score =  307 bits (787), Expect = 4e-81,   Method: Compositional matrix adjust.
 Identities = 154/343 (44%), Positives = 216/343 (62%), Gaps = 17/343 (4%)

Query: 18  MFIIITLLVSCASQVVSSRSTH--------EQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
           +F++  L  +    ++S   TH        +  V+ ++E+W+ +HG++Y    EKE R +
Sbjct: 5   LFLVFALSSAFDMSIISYHQTHATKSSWRTDDEVMAMYEEWLVKHGKNYNALGEKEKRFE 64

Query: 70  IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQ 129
           IFK+NL +I++ N E NRTY +G N+F+DLTN+EFR++Y G +         TS   +Y 
Sbjct: 65  IFKDNLMFIDQHNSE-NRTYTVGLNRFADLTNEEFRSMYLGTRTGHKKRLPKTSD--RYA 121

Query: 130 NLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLL 189
                 +P S+DWR +GAV  +K+Q  CG CWAF+ +AAVEGI KI +G+LI LSEQ+L+
Sbjct: 122 PRVGDSLPDSVDWRKEGAVAEVKDQGGCGSCWAFSTIAAVEGINKIVTGDLIALSEQELV 181

Query: 190 DCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEE 248
           DC T+ N GC GG  + AF +II N GI TED+YPY    G C   +K A    I +YE+
Sbjct: 182 DCDTSYNEGCNGGLMDYAFEFIINNGGIDTEDDYPYLGRDGRCDTYRKNAKVVSIDSYED 241

Query: 249 VPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDG 308
           VP  DE AL KAV+ QPVS+AI      FQ Y  G+F G CGT LDH V  VG+G TE G
Sbjct: 242 VPENDETALKKAVANQPVSVAIEGGGRNFQLYNSGVFTGECGTSLDHGVAAVGYG-TEKG 300

Query: 309 ANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
            +YW+++NSWG +WG++GY+++ R+     G CGI    SYP+
Sbjct: 301 KDYWIVRNSWGKSWGESGYIRMERNIASPTGKCGIAIEPSYPI 343


>gi|226507950|ref|NP_001151278.1| LOC100284911 precursor [Zea mays]
 gi|195645488|gb|ACG42212.1| vignain precursor [Zea mays]
          Length = 376

 Score =  307 bits (786), Expect = 5e-81,   Method: Compositional matrix adjust.
 Identities = 150/320 (46%), Positives = 205/320 (64%), Gaps = 19/320 (5%)

Query: 40  EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
           E+++  ++E+W  +H  + +D  +K  R  +FK N+  I + N+  +  YKL  N+F D+
Sbjct: 42  EEALWALYERWRGRHALA-RDLGDKARRFNVFKANVRLIHEFNRR-DEPYKLRLNRFGDM 99

Query: 100 TNDEFRALYTGYKMPSPSHR--------STTSSTFKYQNLSMTDVPTSLDWRDKGAVTPI 151
           T DEFR  Y G ++    HR        S+ S++F Y +    DVP S+DWR KGAVT +
Sbjct: 100 TADEFRRHYAGSRVAH--HRMFRGDRQGSSASASFMYAD--ARDVPASVDWRQKGAVTDV 155

Query: 152 KNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYI 211
           K+Q +CG CWAF+ +AAVEGI  I++ NL  LSEQQL+DC T  N GC GG  + AF YI
Sbjct: 156 KDQGQCGSCWAFSTIAAVEGINAIKTKNLTSLSEQQLVDCDTKANAGCNGGLMDYAFQYI 215

Query: 212 IQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIA 271
            ++ G+A ED YPY+A   +C  +  P    I  YE+VP+ DE AL KAV+ QPVS+AI 
Sbjct: 216 AKHGGVAAEDAYPYRARQASCKKSPAPVVT-IDGYEDVPANDESALKKAVAHQPVSVAIE 274

Query: 272 AYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIV 331
           A  + FQ Y EG+F+G CGT+LDH VT VG+G T DG  YWL+KNSWG  WG+ GY+++ 
Sbjct: 275 ASGSHFQFYSEGVFSGRCGTELDHGVTAVGYGVTADGTKYWLVKNSWGPEWGEKGYIRMA 334

Query: 332 RD----EGLCGIGTRSSYPL 347
           RD    EG CGI   +SYP+
Sbjct: 335 RDVAAKEGHCGIAMEASYPV 354


>gi|148927382|gb|ABR19827.1| cysteine proteinase [Elaeis guineensis]
          Length = 470

 Score =  307 bits (786), Expect = 5e-81,   Method: Compositional matrix adjust.
 Identities = 153/319 (47%), Positives = 210/319 (65%), Gaps = 12/319 (3%)

Query: 36  RSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRTYKLG 92
           RS  E  +  ++E W+A+HGR+Y    EKE R +IFK+N+ +I+  N     G+R+++LG
Sbjct: 41  RSEEEMRI--LYEGWLAKHGRAYNALGEKERRFEIFKDNVLFIDAHNAAADAGHRSFRLG 98

Query: 93  TNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIK 152
            N+F+D+TN+E+RA+Y G + P+   R     + +Y+  +  D+P S+DWR KGAV  +K
Sbjct: 99  LNRFADMTNEEYRAVYLGTR-PAGHRRRARVGSDRYRYNAGEDLPESVDWRAKGAVAAVK 157

Query: 153 NQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYII 212
           +Q  CG CWAF+ VAAVEGI KI +G+LI LSEQ+L+DC    N GC GG  +  F +II
Sbjct: 158 DQGSCGSCWAFSTVAAVEGINKIVTGDLISLSEQELVDCDNGYNQGCNGGLMDYGFEFII 217

Query: 213 QNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIA 271
            N GI TE++YPY A  G C   +K A    I  YE+VP  DE+AL KAV+ QPVS+AI 
Sbjct: 218 NNGGIDTEEDYPYTARDGKCDQYRKNAKVVSIDGYEDVPVNDEKALQKAVANQPVSVAIE 277

Query: 272 AYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIV 331
           A   EFQ Y  GIF G CGT LDH V  VG+G TE+G +YW+++NSWG  WG++GY+++ 
Sbjct: 278 AGGREFQLYHSGIFTGRCGTDLDHGVVAVGYG-TENGKDYWIVRNSWGGDWGESGYIRME 336

Query: 332 RD----EGLCGIGTRSSYP 346
           R+     G CGI    SYP
Sbjct: 337 RNVNTSTGKCGIAIEPSYP 355


>gi|2224810|emb|CAB09698.1| cysteine proteinase [Hordeum vulgare subsp. vulgare]
          Length = 349

 Score =  307 bits (786), Expect = 5e-81,   Method: Compositional matrix adjust.
 Identities = 151/317 (47%), Positives = 217/317 (68%), Gaps = 9/317 (2%)

Query: 38  THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
           T + ++V  HEKWMA+HGR+Y +E EK  RL++F+ N + I+  N   + T++L TN+F+
Sbjct: 35  TVDAAMVSRHEKWMAEHGRTYANEEEKARRLEVFRANAKLIDSFNSAEDSTHRLATNRFA 94

Query: 98  DLTNDEFRALYTGYKMP--SPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQK 155
           DLT++EFRA  TG + P  + +   + +  F+Y+N S+ D   S+DWR  GAVT +K+Q 
Sbjct: 95  DLTDEEFRAARTGLRRPPAAAAGAGSGAGGFRYENFSLADAAGSMDWRAMGAVTGVKDQG 154

Query: 156 ECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNN-GCLGGSREKAFAYIIQN 214
            CGCCWAF+AVAAVEG+TKIR+G L+ LSEQQL+DC   G++ GC GG  + AF Y+I  
Sbjct: 155 SCGCCWAFSAVAAVEGLTKIRTGRLVSLSEQQLVDCDVYGDDEGCAGGLMDNAFEYMINR 214

Query: 215 QGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYS 274
            G+ TE  YPY+   G+C   +  +AA I  YE+VP+ +E AL+ AV+ QPVS+AI    
Sbjct: 215 GGLTTESSYPYRGTDGSCR--RSASAASIRGYEDVPANNEAALMAAVAHQPVSVAINGGD 272

Query: 275 TEFQSYKEGIFNGV-CGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKI--- 330
           + F+ Y  G+  G  CGT+L+HA+T  G+GT  DG  YW++KNSWG +WG+ GY++I   
Sbjct: 273 SVFRFYDSGVLGGSGCGTELNHAITAAGYGTASDGTKYWIMKNSWGGSWGEGGYVRIRRG 332

Query: 331 VRDEGLCGIGTRSSYPL 347
           VR EG+CG+   +SYP+
Sbjct: 333 VRGEGVCGLAQLASYPV 349


>gi|2463586|dbj|BAA22545.1| FB22 precursor [Ananas comosus]
          Length = 340

 Score =  307 bits (786), Expect = 5e-81,   Method: Compositional matrix adjust.
 Identities = 148/332 (44%), Positives = 211/332 (63%), Gaps = 8/332 (2%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
           +F+ + L V  AS   +SR      +++  E+WMA++GR YKD  EK  R +IFK N+ +
Sbjct: 8   VFLFLFLCVMWASPSAASRDEPSDPMMKRFEEWMAEYGRVYKDNDEKMRRFQIFKNNVNH 67

Query: 78  IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
           IE  N     +Y LG N+F+D+TN+EF   YTG  +P    R    S   + +++++ V 
Sbjct: 68  IETFNNRNGNSYTLGINKFTDMTNNEFVTQYTGVSLPLNFKREPVVS---FDDVNISAVG 124

Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNN 197
            S+DWRD GAVT +K+Q  CG CWAF+A+A VEGI KI +G L+ LSEQ++LDC+ +  N
Sbjct: 125 QSIDWRDYGAVTEVKDQNPCGSCWAFSAIATVEGIYKIVTGYLVSLSEQEVLDCAVS--N 182

Query: 198 GCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQAL 257
           GC GG  + A+ +II N G+A+E +YPYQA  G C+A   P +A I+ Y  V S DE ++
Sbjct: 183 GCDGGFVDNAYDFIISNNGVASEADYPYQAYEGDCTANSWPNSAYITGYSYVRSNDESSM 242

Query: 258 LKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNS 317
             AV  QP++ AI A    FQ Y  G+F+G CGT L+HA+TI+G+G    G  YW++KNS
Sbjct: 243 KYAVWNQPIAAAIDASGDNFQYYNGGVFSGPCGTSLNHAITIIGYGQDSSGTQYWIVKNS 302

Query: 318 WGNTWGDAGYMKIVR---DEGLCGIGTRSSYP 346
           WG++WG+ GY+++ R     GLCGI     YP
Sbjct: 303 WGSSWGERGYVRMARGVSSSGLCGIAMDPLYP 334


>gi|356543112|ref|XP_003540007.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
           max]
          Length = 345

 Score =  307 bits (786), Expect = 6e-81,   Method: Compositional matrix adjust.
 Identities = 149/324 (45%), Positives = 205/324 (63%), Gaps = 10/324 (3%)

Query: 33  VSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLG 92
           + SR   E    E HE WMAQ+G+ YKD  EK+ R +IFK N+ +IE  N  G++ + L 
Sbjct: 24  IMSRRLFEACTSERHENWMAQYGKVYKDAAEKKKRFQIFKNNVHFIESFNTAGDKPFNLS 83

Query: 93  TNQFSDLTNDEFRALYTGYKMPSPSHRST---TSSTFKYQNLSMTDVPTSLDWRDKGAVT 149
            NQF+DL ++EF+AL T       S   T   T ++FKY  +  T +  ++DWR +GAVT
Sbjct: 84  INQFADLHDEEFKALLTNGNKKVRSVVGTATETETSFKYNRV--TKLLATMDWRKRGAVT 141

Query: 150 PIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFA 209
           PIK+Q+ CG CWAF+AVAA+EGI +I +  L+ LSEQ+L+DC    + GC GG  E AF 
Sbjct: 142 PIKDQRRCGSCWAFSAVAAIEGIHQITTSKLVSLSEQELVDCVKGESEGCNGGYMEDAFE 201

Query: 210 YIIQNQGIATEDEYPYQAVPGTCSAAQKP-AAAKISNYEEVPSGDEQALLKAVSMQPVSI 268
           ++ +  GIA+E  YPY+    +C   ++    ++I  YE+VPS  E+AL KAV+ QPVS+
Sbjct: 202 FVAKKGGIASESYYPYKGKDKSCKVKKETHGVSQIKGYEKVPSNSEKALQKAVAHQPVSV 261

Query: 269 AIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYM 328
            + A    FQ Y  GIF G CGT  DHA+T+VG+G +  G  YWL+KNSWG  WG+ GY+
Sbjct: 262 YVEAGGNAFQFYSSGIFTGKCGTNTDHAITVVGYGKSRGGTKYWLVKNSWGAGWGEKGYI 321

Query: 329 KIVRD----EGLCGIGTRSSYPLA 348
           ++ RD    EGLCGI   + YP A
Sbjct: 322 RMKRDIRAKEGLCGIAMNAFYPTA 345


>gi|255538210|ref|XP_002510170.1| cysteine protease, putative [Ricinus communis]
 gi|223550871|gb|EEF52357.1| cysteine protease, putative [Ricinus communis]
          Length = 469

 Score =  307 bits (786), Expect = 6e-81,   Method: Compositional matrix adjust.
 Identities = 154/332 (46%), Positives = 222/332 (66%), Gaps = 19/332 (5%)

Query: 32  VVSSRSTH--------EQSVVEIHEKWMAQHGRSYKDEL---EKEMRLKIFKENLEYIEK 80
           +VS   TH        +  V+ I+E+W+ ++G+++ +     EKE R ++FK+NL +I++
Sbjct: 28  IVSYDQTHLTKSSWRTDDEVMAIYEEWLVKNGKAHSNNNALGEKERRFQVFKDNLRFIDE 87

Query: 81  ANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSL 140
            N E NR+YK+G N+F+DLTN+E+R++Y G +  +  +R + SS  +Y       +P S+
Sbjct: 88  HNSE-NRSYKVGLNRFADLTNEEYRSMYLGARSGAKRNRLSRSSN-RYLPRVGDSLPDSV 145

Query: 141 DWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCL 200
           DWR +GAV  +K+Q  CG CWAF+ +AAVEGI KI +G+LI LSEQ+L+DC  + N GC 
Sbjct: 146 DWRKEGAVAEVKDQGSCGSCWAFSTIAAVEGINKIVTGDLISLSEQELVDCDRSYNEGCN 205

Query: 201 GGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLK 259
           GG  + AF +II N GI +E++YPY A  GTC   +K A    I NYE+VP  DE+AL K
Sbjct: 206 GGLMDYAFQFIINNGGIDSEEDYPYLARDGTCDTYRKNAKVVTIDNYEDVPVNDEKALQK 265

Query: 260 AVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWG 319
           AV+ QPVS+AI A   EFQ Y+ GIF G CGT LDH V  VG+G TE+G +YW+++NSWG
Sbjct: 266 AVANQPVSVAIEAGGREFQFYQSGIFTGRCGTALDHGVAAVGYG-TENGKDYWIVRNSWG 324

Query: 320 NTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
            +WG++GY+++ R+     G CGI    SYP+
Sbjct: 325 KSWGESGYIRMERNIATATGKCGIAIEPSYPI 356


>gi|224083868|ref|XP_002307151.1| predicted protein [Populus trichocarpa]
 gi|222856600|gb|EEE94147.1| predicted protein [Populus trichocarpa]
          Length = 298

 Score =  307 bits (786), Expect = 6e-81,   Method: Compositional matrix adjust.
 Identities = 155/314 (49%), Positives = 213/314 (67%), Gaps = 24/314 (7%)

Query: 43  VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTND 102
           + E HE+WMAQ+GR YKD+ EKE R  IFKEN+  I+  N +  ++Y LG NQF+DL+N+
Sbjct: 1   MYERHEQWMAQYGRVYKDDAEKETRYNIFKENVARIDAFNSQTGKSYNLGVNQFADLSNE 60

Query: 103 EFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCC 160
           EF+A    +K  M SP      +  F+Y+N+S   VP ++DWR KGAVTP+K+Q +C   
Sbjct: 61  EFKASRNRFKGHMCSPQ-----AGPFRYENVSA--VPATMDWRKKGAVTPVKDQGQC--- 110

Query: 161 WAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYIIQNQGIAT 219
                VAA+EGI ++ +G LI LSEQ+++DC T G + GC GG  + AF +I QN+G+ T
Sbjct: 111 -----VAAMEGINQLTTGKLISLSEQEVVDCDTKGEDQGCNGGLMDDAFKFIEQNKGLTT 165

Query: 220 EDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQ 278
           E  YPY    GTC+  ++ + AAKI+ +++VP+  E AL+KAV+ QPVS+AI A   EFQ
Sbjct: 166 EANYPYTGTDGTCNTQKEVSHAAKITGFQDVPANSEAALMKAVAKQPVSVAIDAGGFEFQ 225

Query: 279 SYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----E 334
            Y  GIF G CGT+LDH VT VG+G + DG  YWL+KNSWG  WG+ GY+++ +D    E
Sbjct: 226 FYSSGIFTGSCGTELDHGVTAVGYGGS-DGTKYWLVKNSWGAQWGEEGYIRMQKDISAKE 284

Query: 335 GLCGIGTRSSYPLA 348
           GLCGI  ++SYP A
Sbjct: 285 GLCGIAMQASYPTA 298


>gi|255567869|ref|XP_002524912.1| cysteine protease, putative [Ricinus communis]
 gi|223535747|gb|EEF37409.1| cysteine protease, putative [Ricinus communis]
          Length = 366

 Score =  307 bits (786), Expect = 6e-81,   Method: Compositional matrix adjust.
 Identities = 155/349 (44%), Positives = 216/349 (61%), Gaps = 18/349 (5%)

Query: 15  TTPMFIIITLLVSCASQVVSSRSTH--------EQSVVEIHEKWMAQHGRSYKDELEKEM 66
           +T +F+  TL  +    ++S    H        +  V+ ++  W+A+H ++Y    E+E 
Sbjct: 8   STLLFLFFTLSSAWDMSILSHNHGHHHQSSWRSDNEVISMYNWWLAKHSKTYNKLGEREK 67

Query: 67  RLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSS-- 124
           R +IFK NL +I++ N   NRTYK+G  +F+DLTN+E+RA + G K   P  R   S   
Sbjct: 68  RFEIFKNNLRFIDEHNNSKNRTYKVGLTRFADLTNEEYRAKFLGTK-SDPKRRLMKSKNP 126

Query: 125 TFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLS 184
           + +Y   +   +P S+DWR  GAV+ IK+Q  CG CWAF+ +AAVEG+ KI +G LI LS
Sbjct: 127 SQRYAFKAGDVLPESIDWRQSGAVSAIKDQGSCGSCWAFSTIAAVEGVNKIVTGELISLS 186

Query: 185 EQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQ-KPAAAKI 243
           EQ+L+DC  + N GC GG  + AF +II N GI T+ +YPYQAV G C   + K  A  I
Sbjct: 187 EQELVDCDRSYNAGCNGGLMDNAFQFIINNGGIDTDKDYPYQAVDGKCDTTKVKNKAVTI 246

Query: 244 SNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFG 303
             +E+V + DE AL KAV+ QPVS+AI A     Q Y+ G+F G CG+ LDH V IVG+G
Sbjct: 247 DGFEDVMAFDEMALQKAVAHQPVSVAIEASGMALQFYQSGVFTGECGSALDHGVVIVGYG 306

Query: 304 TTEDGANYWLIKNSWGNTWGDAGYMKIVRD-----EGLCGIGTRSSYPL 347
            TEDG +YWL++NSWG  WG+ GY+K+ R+      G CGI   SSYP+
Sbjct: 307 -TEDGIDYWLVRNSWGRDWGENGYIKMQRNVVDTFTGKCGIAMESSYPI 354


>gi|414870137|tpg|DAA48694.1| TPA: vignain [Zea mays]
          Length = 484

 Score =  306 bits (785), Expect = 7e-81,   Method: Compositional matrix adjust.
 Identities = 149/321 (46%), Positives = 204/321 (63%), Gaps = 20/321 (6%)

Query: 40  EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
           E+++  ++E+W  +H  + +D  +K  R  +FK N+  I + N+  +  YKL  N+F D+
Sbjct: 149 EEALWALYERWRGRHALA-RDLGDKARRFNVFKANVRLIHEFNRR-DEPYKLRLNRFGDM 206

Query: 100 TNDEFRALYTGYKMPSPSHR---------STTSSTFKYQNLSMTDVPTSLDWRDKGAVTP 150
           T DEFR  Y G ++    HR         S ++S+F Y +    DVP S+DWR KGAVT 
Sbjct: 207 TADEFRRHYAGSRVAH--HRMFRGDRQGSSASASSFMYAD--ARDVPASVDWRQKGAVTD 262

Query: 151 IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAY 210
           +K+Q +CG CWAF+ +AAVEGI  I++ NL  LSEQQL+DC T  N GC GG  + AF Y
Sbjct: 263 VKDQGQCGSCWAFSTIAAVEGINAIKTKNLTSLSEQQLVDCDTKANAGCNGGLMDYAFQY 322

Query: 211 IIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAI 270
           I ++ G+A ED YPY+A   +C  +  P    I  YE+VP+ DE AL KAV+ QPVS+AI
Sbjct: 323 IAKHGGVAAEDAYPYRARQASCKKSPAPVVT-IDGYEDVPANDESALKKAVAHQPVSVAI 381

Query: 271 AAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKI 330
            A  + FQ Y EG+F+G CGT+LDH V  VG+G T DG  YWL+KNSWG  WG+ GY+++
Sbjct: 382 EASGSHFQFYSEGVFSGRCGTELDHGVAAVGYGVTADGTKYWLVKNSWGPEWGEKGYIRM 441

Query: 331 VRD----EGLCGIGTRSSYPL 347
            RD    EG CGI   +SYP+
Sbjct: 442 ARDVAAKEGHCGIAMEASYPV 462


>gi|2511693|emb|CAB17076.1| cysteine proteinase precursor [Phaseolus vulgaris]
          Length = 455

 Score =  306 bits (785), Expect = 8e-81,   Method: Compositional matrix adjust.
 Identities = 152/344 (44%), Positives = 220/344 (63%), Gaps = 17/344 (4%)

Query: 18  MFIIITLLVSCASQVVSSRSTH--------EQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
           +F +  L  +    ++S  + H        ++ V  ++E+W+ +HG+ Y    EK+ R +
Sbjct: 3   LFALFALSSALDMSIISYDNAHQDKATWRTDEEVNSLYEEWLVKHGKLYNALGEKDKRFQ 62

Query: 70  IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQ 129
           IFK+NL +I++ N E NRTYKLG N+F+DLTN+E+RA Y G K+  P+ R   + + +Y 
Sbjct: 63  IFKDNLRFIDQQNAE-NRTYKLGLNRFADLTNEEYRARYLGTKI-DPNRRLGRTPSNRYA 120

Query: 130 NLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLL 189
                 +P S+DWR +GAV P+K+Q  CG CWAF+A+ AVEGI KI +G+LI LSEQ+L+
Sbjct: 121 PRVGETLPDSVDWRKEGAVVPVKDQASCGSCWAFSAIGAVEGINKIVTGDLISLSEQELV 180

Query: 190 DCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEE 248
           DC T  N GC GG  + AF +II+N GI +E++YPY+ V G C   +K A    I  YE+
Sbjct: 181 DCDTGYNMGCNGGLMDYAFEFIIKNGGIDSEEDYPYKGVDGRCDEYRKNAKVVSIDGYED 240

Query: 249 VPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDG 308
           V + DE AL KAV+ QPVS+A+     EFQ Y  G+F G CGT LDH V  VG+G T++G
Sbjct: 241 VNTYDELALKKAVANQPVSVAVEGGGREFQLYSSGVFTGRCGTALDHGVVAVGYG-TDNG 299

Query: 309 ANYWLIKNSWGNTWGDAGYMKIVRD-----EGLCGIGTRSSYPL 347
            ++W+++NSWG  WG+ GY+++ R+      G CGI    SYP+
Sbjct: 300 HDFWIVRNSWGADWGEEGYIRLERNLGNSRSGKCGIAIEPSYPI 343


>gi|3377950|emb|CAA08861.1| cysteine proteinase precursor, AN11 [Ananas comosus]
          Length = 357

 Score =  306 bits (784), Expect = 8e-81,   Method: Compositional matrix adjust.
 Identities = 151/335 (45%), Positives = 214/335 (63%), Gaps = 12/335 (3%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
           +F+ + L V  AS   +SR      +++  E+WMA++GR YKD  EK  R +IFK N+ +
Sbjct: 8   VFLFLFLCVMWASPSAASRDEPSDPMMKRFEEWMAEYGRVYKDNDEKMRRFQIFKNNVNH 67

Query: 78  IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
           IE  N     +Y LG NQF+D+TN+EF A YTG  +P    R    S   + ++ ++ VP
Sbjct: 68  IETFNSRNGNSYTLGINQFTDMTNNEFVAQYTGVSLPLNIEREPVVS---FDDVDISAVP 124

Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNN 197
            S+DWR+ GAVT +KN   CG CWAFAA+A VE I KI+ G LI LSEQQ+LDC+   + 
Sbjct: 125 QSIDWRNYGAVTSVKNHIPCGSCWAFAAIATVESIYKIKRGYLISLSEQQVLDCAV--SY 182

Query: 198 GCLGGSREKAFAYIIQNQGIATEDEYPYQAV--PGTCSAAQKPAAAKISNYEEVPSGDEQ 255
           GC GG   KA+ +II N+G+A+   YPY+A    GTC     P +A I+ Y  V S +E+
Sbjct: 183 GCDGGWVNKAYDFIISNKGVASAAIYPYKASQGQGTCRINGVPNSAYITGYTRVQSNNER 242

Query: 256 ALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIK 315
           +++ AVS QP++ +I A S +FQ YK G+F+G CGT L+HA+TI+G+G    G  +W+++
Sbjct: 243 SMMYAVSNQPIAASIEA-SGDFQHYKRGVFSGPCGTSLNHAITIIGYGQDSSGKKFWIVR 301

Query: 316 NSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
           NSWG +WG+ GY+++ RD     GLCGI  R  YP
Sbjct: 302 NSWGASWGERGYIRMARDVSSSSGLCGIAIRPLYP 336


>gi|242077600|ref|XP_002448736.1| hypothetical protein SORBIDRAFT_06g032320 [Sorghum bicolor]
 gi|241939919|gb|EES13064.1| hypothetical protein SORBIDRAFT_06g032320 [Sorghum bicolor]
          Length = 467

 Score =  306 bits (784), Expect = 8e-81,   Method: Compositional matrix adjust.
 Identities = 148/315 (46%), Positives = 216/315 (68%), Gaps = 11/315 (3%)

Query: 40  EQSVVEIHEKWMAQHGRSYKDEL-EKEMRLKIFKENLEYIEKAN-KEGNRTYKLGTNQFS 97
           E  V  ++E W+ +HGR   + L E + R ++F +NL +++  N + G   ++LG NQF+
Sbjct: 49  EAEVRAMYELWLVEHGRRVSNVLGEHDSRFRVFWDNLRFVDAHNERAGEHGFRLGMNQFA 108

Query: 98  DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
           DLTNDEFRA Y G ++P+   RS  +    Y++    ++P S+DWR+KGAV P+KNQ +C
Sbjct: 109 DLTNDEFRAAYLGARIPAA--RSGNAVGEMYRHDGAEELPESVDWREKGAVAPVKNQGQC 166

Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQG 216
           G CWAF+AV++VE I +I +G ++ LSEQ+L++CST+ GN+GC GG  + AF +II+N G
Sbjct: 167 GSCWAFSAVSSVESINQIVTGEMVTLSEQELVECSTDGGNSGCNGGLMDAAFNFIIKNGG 226

Query: 217 IATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYST 275
           I TED+YPY+AV G C   ++ A    I  +E+VP  DE++L KAV+ QPVS+AI A   
Sbjct: 227 IDTEDDYPYKAVDGKCDINRRNAKVVSIDAFEDVPENDEKSLQKAVAHQPVSVAIEAGGR 286

Query: 276 EFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-- 333
           +FQ YK G+F+G C T LDH V  VG+G TE+G +YW+++NSWG  WG+AGY+++ R+  
Sbjct: 287 QFQLYKSGVFSGSCTTNLDHGVVAVGYG-TENGKDYWIVRNSWGPKWGEAGYIRMERNIN 345

Query: 334 --EGLCGIGTRSSYP 346
              G CGI   +SYP
Sbjct: 346 ATTGKCGIAMMASYP 360


>gi|162459393|ref|NP_001105993.1| cysteine protease component of protease-inhibitor complex precursor
           [Zea mays]
 gi|6682829|dbj|BAA88898.1| cysteine protease component of protease-inhibitor complex [Zea
           mays]
          Length = 465

 Score =  306 bits (784), Expect = 9e-81,   Method: Compositional matrix adjust.
 Identities = 151/324 (46%), Positives = 211/324 (65%), Gaps = 12/324 (3%)

Query: 32  VVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRT 88
           +VS     ++    ++ +WMA HGR+Y    E+E R ++F++NL YI+  N     G  +
Sbjct: 29  IVSYGERSDEEARRMYAEWMAAHGRTYNAVGEEERRYQVFRDNLRYIDAHNAAADAGVHS 88

Query: 89  YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAV 148
           ++LG N+F+DLTNDE+RA Y G +      R   +   +Y      D+P S+DWR KGAV
Sbjct: 89  FRLGLNRFADLTNDEYRATYLGARTRPQRERKLGA---RYHAADNEDLPESVDWRAKGAV 145

Query: 149 TPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAF 208
             +K+Q   G CWAF+ +AAVEGI +I +G+LI LSEQ+L+DC T+ N GC GG  + AF
Sbjct: 146 AEVKDQGSYGSCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAF 205

Query: 209 AYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVS 267
            +II N GI TE +YPY+   G C   +K A    I +YE+VP+ DE++L KAV+ QPVS
Sbjct: 206 EFIINNGGIDTEKDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANDEKSLQKAVANQPVS 265

Query: 268 IAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGY 327
           +AI A  T+FQ Y  GIF G CGT LDH VT VG+G TE+G +YW++KNSWG++WG++GY
Sbjct: 266 VAIEAAGTQFQLYSSGIFTGSCGTALDHGVTAVGYG-TENGKDYWIVKNSWGSSWGESGY 324

Query: 328 MKIVRD----EGLCGIGTRSSYPL 347
           +++ R+     G CGI    SYPL
Sbjct: 325 VRMERNIKASSGKCGIAVEPSYPL 348


>gi|2463584|dbj|BAA22544.1| FBSB precursor [Ananas comosus]
          Length = 356

 Score =  306 bits (784), Expect = 9e-81,   Method: Compositional matrix adjust.
 Identities = 152/334 (45%), Positives = 209/334 (62%), Gaps = 11/334 (3%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
           +F+ + L V  AS   +S       +++  E+WM ++GR YKD  EK  R +IFK N+ +
Sbjct: 8   VFLFLFLCVMWASPSAASADEPSDPMMKRFEEWMVEYGRVYKDNDEKMRRFQIFKNNVNH 67

Query: 78  IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTG-YKMPSPSHRSTTSSTFKYQNLSMTDV 136
           IE  N     +Y LG NQF+D+TN+EF A YTG    P    R    S   + ++ ++ V
Sbjct: 68  IETFNSRNENSYTLGINQFTDMTNNEFIAQYTGGISRPLNIEREPVVS---FDDVDISAV 124

Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGN 196
           P S+DWRD GAVT +KNQ  CG CWAFAA+A VE I KI+ G L  LSEQQ+LDC+    
Sbjct: 125 PQSIDWRDYGAVTSVKNQNPCGACWAFAAIATVESIYKIKKGILEPLSEQQVLDCAKG-- 182

Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQA 256
            GC GG   +AF +II N+G+A+   YPY+A  GTC     P +A I+ Y  VP  +E +
Sbjct: 183 YGCKGGWEFRAFEFIISNKGVASGAIYPYKAAKGTCKTNGVPNSAYITGYARVPRNNESS 242

Query: 257 LLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKN 316
           ++ AVS QP+++A+ A +  FQ YK G+FNG CGT L+HAVT +G+G   +G  YW++KN
Sbjct: 243 MMYAVSKQPITVAVDA-NANFQYYKSGVFNGPCGTSLNHAVTAIGYGQDSNGKKYWIVKN 301

Query: 317 SWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
           SWG  WG+AGY+++ RD     G+CGI   S YP
Sbjct: 302 SWGARWGEAGYIRMARDVSSSSGICGIAIDSLYP 335


>gi|30141019|dbj|BAC75923.1| cysteine protease-1 [Helianthus annuus]
          Length = 461

 Score =  306 bits (784), Expect = 9e-81,   Method: Compositional matrix adjust.
 Identities = 151/350 (43%), Positives = 219/350 (62%), Gaps = 19/350 (5%)

Query: 15  TTPMFIIITLLVSCASQVVSSRSTH------------EQSVVEIHEKWMAQHGRSYKDEL 62
           T   F +I+++ +    +++  +TH            +  V  ++E W+ +HG++Y    
Sbjct: 8   TLSFFALISIISAMDMSIINYDATHMSSSSSSAPLRTDDEVNALYESWLVKHGKTYNALG 67

Query: 63  EKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTT 122
           EK+ R +IFK+NL +I++ N  G+ TYKLG N+F+DLTN+E+R  YTG K      + + 
Sbjct: 68  EKDRRFQIFKDNLRFIDEHN-SGDHTYKLGLNKFADLTNEEYRMTYTGIKTIDDKKKLSK 126

Query: 123 SSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQ 182
             + +Y   S   +P  +DWR++GAVT +K+Q  CG CWAF+   +VEG+ KI +G+LI 
Sbjct: 127 MKSDRYAYRSGDSLPEYVDWREQGAVTDVKDQGSCGSCWAFSTTGSVEGVNKIVTGDLIS 186

Query: 183 LSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AA 241
           +SEQ+L++C T+ N GC GG  + AF +II+N GI TE++YPY    G C   +K A   
Sbjct: 187 VSEQELVNCDTSYNQGCNGGLMDYAFEFIIKNGGIDTEEDYPYTGKDGKCDKNKKNAKVV 246

Query: 242 KISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVG 301
            I +YE+VP  DE +L KAVS QPV++AI A   +FQ Y  GIF G CGT LDH V   G
Sbjct: 247 TIDSYEDVPVNDESSLKKAVSNQPVAVAIEAGGRDFQFYTSGIFTGSCGTALDHGVLAAG 306

Query: 302 FGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
           +G TEDG +YWL+KNSWG  WG+ GY+K+ R+     G CGI   +SYP+
Sbjct: 307 YG-TEDGKDYWLVKNSWGAEWGEGGYLKMERNIADKSGKCGIAMEASYPI 355


>gi|238006338|gb|ACR34204.1| unknown [Zea mays]
          Length = 465

 Score =  306 bits (784), Expect = 1e-80,   Method: Compositional matrix adjust.
 Identities = 143/308 (46%), Positives = 208/308 (67%), Gaps = 8/308 (2%)

Query: 46  IHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN-KEGNRTYKLGTNQFSDLTNDEF 104
           ++E W+A+HGR+Y    E++ R ++F +NL +++  N +     ++LG NQF+DLTNDEF
Sbjct: 51  LYELWLAEHGRAYNALGERDRRFRVFWDNLRFVDAHNERAAEHGFRLGMNQFADLTNDEF 110

Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFA 164
           RA Y G ++P+   R T             ++P S+DWR+KGAV P+KNQ +CG CWAF+
Sbjct: 111 RAAYLGARIPASRRRGTAVGERYRHGGGAEELPESVDWREKGAVAPVKNQGQCGSCWAFS 170

Query: 165 AVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEY 223
           AV++VE + +I +G ++ LSEQ+L++CST+ GN+GC GG  + AF +II+N GI TE +Y
Sbjct: 171 AVSSVESVNQIVTGEMVTLSEQELVECSTDGGNSGCNGGLMDAAFDFIIKNGGIDTEGDY 230

Query: 224 PYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKE 282
           PY+AV G C   ++ A    I  +E+VP  DE++L KAV+ QPVS+AI A   EFQ YK 
Sbjct: 231 PYKAVDGKCDINRENAKVVSIDGFEDVPENDEKSLQKAVAHQPVSVAIEAGGREFQLYKA 290

Query: 283 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCG 338
           G+F G C T LDH V  VG+G TE+G +YW+++NSWG  WG+ GY+++ R+     G CG
Sbjct: 291 GVFTGTCTTNLDHGVVAVGYG-TENGKDYWIVRNSWGAKWGEDGYIRMERNVNATTGKCG 349

Query: 339 IGTRSSYP 346
           I   +SYP
Sbjct: 350 IAMMASYP 357


>gi|146215978|gb|ABQ10191.1| actinidin Act1c [Actinidia eriantha]
          Length = 368

 Score =  306 bits (784), Expect = 1e-80,   Method: Compositional matrix adjust.
 Identities = 158/340 (46%), Positives = 217/340 (63%), Gaps = 14/340 (4%)

Query: 13  INTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFK 72
           ++ + +F    L++S A   + ++ T+++ V  ++E W+ +HG+SY    E+E R +IFK
Sbjct: 8   VSMSLLFFSTLLILSLA---LDAKRTNDE-VKAMYESWLIKHGKSYNSLGERERRFEIFK 63

Query: 73  ENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
           E L +I++ N + +R+YK+G NQF+DLTN+EFR+ Y G+   S    + T  + +Y+   
Sbjct: 64  ETLRFIDEHNADTSRSYKVGLNQFADLTNEEFRSTYLGFTRGS----NKTKVSNRYEPRV 119

Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
              +P  +DWR +GAV  IKNQ +CG CWAF+A+AAVEGI KI +GNLI LSEQ+L+DC 
Sbjct: 120 GQVLPDYVDWRSEGAVVDIKNQGQCGSCWAFSAIAAVEGINKIVTGNLISLSEQELVDCG 179

Query: 193 -TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSA-AQKPAAAKISNYEEVP 250
            T    GC GG     F +II N GI TE+ YPY A  G C    Q      I NYE VP
Sbjct: 180 RTQSTKGCDGGYMTDGFEFIINNGGINTEENYPYTAQEGQCDLNLQNEKYVTIDNYENVP 239

Query: 251 SGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGAN 310
             +E AL  AV+ QPVS+A+ +    FQ Y  GIF G CGT  DHAVTIVG+G TE G +
Sbjct: 240 YYNEWALQTAVAYQPVSVALESAGDAFQHYSSGIFTGPCGTATDHAVTIVGYG-TEGGID 298

Query: 311 YWLIKNSWGNTWGDAGYMKIVRD---EGLCGIGTRSSYPL 347
           YW++KNSW  TWG+ GYM+I+R+    G CGI T  SYP+
Sbjct: 299 YWIVKNSWDTTWGEEGYMRILRNVGGAGTCGIATMPSYPV 338


>gi|226501480|ref|NP_001150266.1| cysteine protease 1 precursor [Zea mays]
 gi|195637948|gb|ACG38442.1| cysteine protease 1 precursor [Zea mays]
          Length = 462

 Score =  306 bits (784), Expect = 1e-80,   Method: Compositional matrix adjust.
 Identities = 143/308 (46%), Positives = 209/308 (67%), Gaps = 8/308 (2%)

Query: 46  IHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN-KEGNRTYKLGTNQFSDLTNDEF 104
           ++E W+A+HGR+Y    E++ R ++F +NL +++  N +     ++LG NQF+DLTNDEF
Sbjct: 48  LYELWLAEHGRAYNALGERDRRFRVFWDNLRFVDAHNERAAEHGFRLGMNQFADLTNDEF 107

Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFA 164
           RA Y G ++P+   R T             ++P S+DWR+KGAV P+KNQ +CG CWAF+
Sbjct: 108 RAAYLGARIPAARRRGTAVGERYRHGGGAEELPESVDWREKGAVAPVKNQGQCGSCWAFS 167

Query: 165 AVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEY 223
           AV++VE + +I +G ++ LSEQ+L++CST+ GN+GC GG  + AF +II+N GI TE +Y
Sbjct: 168 AVSSVESVNQIVTGEMVTLSEQELVECSTDGGNSGCNGGLMDAAFDFIIKNGGIDTEGDY 227

Query: 224 PYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKE 282
           PY+AV G C   ++ A    I  +E+VP  DE++L KAV+ QPVS+AI A   EFQ YK 
Sbjct: 228 PYKAVDGKCDINRENAKVVSIDGFEDVPENDEKSLQKAVAHQPVSVAIEAGGREFQLYKA 287

Query: 283 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCG 338
           G+F+G C T LDH V  VG+G TE+G +YW+++NSWG  WG+ GY+++ R+     G CG
Sbjct: 288 GVFSGTCTTNLDHGVVAVGYG-TENGKDYWIVRNSWGAKWGEDGYIRMERNVNATTGKCG 346

Query: 339 IGTRSSYP 346
           I   +SYP
Sbjct: 347 IAMMASYP 354


>gi|225438807|ref|XP_002283263.1| PREDICTED: germination-specific cysteine protease 1-like isoform 1
           [Vitis vinifera]
          Length = 374

 Score =  306 bits (783), Expect = 1e-80,   Method: Compositional matrix adjust.
 Identities = 146/315 (46%), Positives = 214/315 (67%), Gaps = 9/315 (2%)

Query: 40  EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
           E+ V+ +++ WMA+HG++Y    EKE R +IFK+NL++I++ N + NRTYK+G N+F+DL
Sbjct: 39  EEEVMGMYQWWMAKHGKAYNGLGEKEKRFEIFKDNLKFIDEHNAQ-NRTYKVGLNRFADL 97

Query: 100 TNDEFRALYTGYKM-PSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECG 158
           TN+E+RA+Y G +  P        +++ +Y  +    +P S+DWR+ GAV P+K+Q+ CG
Sbjct: 98  TNEEYRAIYLGTRSDPKRRFAKLKNASPRYAVMPGEVLPESVDWRETGAVNPVKDQRSCG 157

Query: 159 CCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIA 218
            CWAF+ VAAVEGI +I +G LI LSEQ+L+DC T  + GC GG  + AF +II+N G+ 
Sbjct: 158 SCWAFSTVAAVEGINQIVTGELISLSEQELVDCDTEYDMGCNGGLMDYAFDFIIKNGGLD 217

Query: 219 TEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEF 277
           TE +YPY    G C+ + K +    I  YE+VP  DE+AL KAV+ QPVS+A+ A     
Sbjct: 218 TEKDYPYTGFDGECNLSGKSSKVVSIDGYEDVPPFDEKALQKAVAHQPVSVAVEAGGRAL 277

Query: 278 QSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD---- 333
           Q Y  GIF G CGT LDH +  VG+G TE+G +YW+++NSWG++WG+ GY+++ R+    
Sbjct: 278 QLYVSGIFTGECGTALDHGIVAVGYG-TENGTDYWIVRNSWGSSWGENGYIRMERNMADA 336

Query: 334 -EGLCGIGTRSSYPL 347
             G CGI   +SYP+
Sbjct: 337 FSGKCGIAMEASYPI 351


>gi|414584879|tpg|DAA35450.1| TPA: cysteine protease 1 [Zea mays]
          Length = 522

 Score =  306 bits (783), Expect = 1e-80,   Method: Compositional matrix adjust.
 Identities = 143/308 (46%), Positives = 208/308 (67%), Gaps = 8/308 (2%)

Query: 46  IHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN-KEGNRTYKLGTNQFSDLTNDEF 104
           ++E W+A+HGR+Y    E++ R ++F +NL +++  N +     ++LG NQF+DLTNDEF
Sbjct: 108 LYELWLAEHGRAYNALGERDRRFRVFWDNLRFVDAHNERAAEHGFRLGMNQFADLTNDEF 167

Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFA 164
           RA Y G ++P+   R T             ++P S+DWR+KGAV P+KNQ +CG CWAF+
Sbjct: 168 RAAYLGARIPASRRRGTAVGERYRHGGGAEELPESVDWREKGAVAPVKNQGQCGSCWAFS 227

Query: 165 AVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEY 223
           AV++VE + +I +G ++ LSEQ+L++CST+ GN+GC GG  + AF +II+N GI TE +Y
Sbjct: 228 AVSSVESVNQIVTGEMVTLSEQELVECSTDGGNSGCNGGLMDAAFDFIIKNGGIDTEGDY 287

Query: 224 PYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKE 282
           PY+AV G C   ++ A    I  +E+VP  DE++L KAV+ QPVS+AI A   EFQ YK 
Sbjct: 288 PYKAVDGKCDINRENAKVVSIDGFEDVPENDEKSLQKAVAHQPVSVAIEAGGREFQLYKA 347

Query: 283 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCG 338
           G+F G C T LDH V  VG+G TE+G +YW+++NSWG  WG+ GY+++ R+     G CG
Sbjct: 348 GVFTGTCTTNLDHGVVAVGYG-TENGKDYWIVRNSWGAKWGEDGYIRMERNVNATTGKCG 406

Query: 339 IGTRSSYP 346
           I   +SYP
Sbjct: 407 IAMMASYP 414


>gi|125533982|gb|EAY80530.1| hypothetical protein OsI_35710 [Oryza sativa Indica Group]
          Length = 378

 Score =  305 bits (782), Expect = 2e-80,   Method: Compositional matrix adjust.
 Identities = 146/330 (44%), Positives = 207/330 (62%), Gaps = 22/330 (6%)

Query: 38  THEQSVVEIHEKWMAQH--------GRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTY 89
           + E+S+  ++E+W +++        G    D+ E   R  +F EN  YI +AN+ G R +
Sbjct: 33  SSEESLRALYERWRSRYTVSRPAASGGVGNDDGEARRRFNVFVENARYIHEANRRGGRPF 92

Query: 90  KLGTNQFSDLTNDEFRALYTGYKMPSPSHRS------TTSSTFKYQNLSMTDVPTSLDWR 143
           +L  N+F+D+T DEFR  Y G +  +  HRS          +F+Y      ++P ++DWR
Sbjct: 93  RLALNKFADMTTDEFRRTYAGSR--ARHHRSLRGGRGGEGGSFRYGGDDEDNLPPAVDWR 150

Query: 144 DKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGS 203
           ++GAVT IK+Q +CG CWAF+AVAAVEG+ KI++G L+ LSEQ+L+DC T  N GC GG 
Sbjct: 151 ERGAVTGIKDQGQCGSCWAFSAVAAVEGVNKIKTGRLVTLSEQELVDCDTGDNQGCDGGL 210

Query: 204 REKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAA-AKISNYEEVPSGDEQALLKAVS 262
            + AF +I +N GI TE  YPY+A  G C+ A+  +    I  YE+VP+ DE AL KAV+
Sbjct: 211 MDYAFQFIKRNGGITTESNYPYRAEQGRCNKAKASSHDVTIDGYEDVPANDESALQKAVA 270

Query: 263 MQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTW 322
            QPV++A+ A   +FQ Y EG+F G CGT LDH V  VG+G T DG  YW++KNSWG  W
Sbjct: 271 NQPVAVAVEASGQDFQFYSEGVFTGECGTDLDHGVAAVGYGITRDGTKYWIVKNSWGEDW 330

Query: 323 GDAGYMKIVR-----DEGLCGIGTRSSYPL 347
           G+ GY+++ R       GLCGI   +SYP+
Sbjct: 331 GERGYIRMQRGVSSDSNGLCGIAMEASYPV 360


>gi|75277440|sp|O23791.1|BROM1_ANACO RecName: Full=Fruit bromelain; AltName: Allergen=Ana c 2; Flags:
           Precursor
 gi|2342496|dbj|BAA21849.1| bromelain [Ananas comosus]
          Length = 351

 Score =  305 bits (782), Expect = 2e-80,   Method: Compositional matrix adjust.
 Identities = 146/333 (43%), Positives = 212/333 (63%), Gaps = 10/333 (3%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
           +F+ + L    AS   +SR      +++  E+WMA++GR YKD+ EK  R +IFK N+++
Sbjct: 8   VFLFLFLCAMWASPSAASRDEPNDPMMKRFEEWMAEYGRVYKDDDEKMRRFQIFKNNVKH 67

Query: 78  IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
           IE  N     +Y LG NQF+D+T  EF A YTG  +P    R    S   + +++++ VP
Sbjct: 68  IETFNSRNENSYTLGINQFTDMTKSEFVAQYTGVSLPLNIEREPVVS---FDDVNISAVP 124

Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNN 197
            S+DWRD GAV  +KNQ  CG CW+FAA+A VEGI KI++G L+ LSEQ++LDC+   + 
Sbjct: 125 QSIDWRDYGAVNEVKNQNPCGSCWSFAAIATVEGIYKIKTGYLVSLSEQEVLDCAV--SY 182

Query: 198 GCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQAL 257
           GC GG   KA+ +II N G+ TE+ YPY A  GTC+A   P +A I+ Y  V   DE+++
Sbjct: 183 GCKGGWVNKAYDFIISNNGVTTEENYPYLAYQGTCNANSFPNSAYITGYSYVRRNDERSM 242

Query: 258 LKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNS 317
           + AVS QP++  I A S  FQ Y  G+F+G CGT L+HA+TI+G+G    G  YW+++NS
Sbjct: 243 MYAVSNQPIAALIDA-SENFQYYNGGVFSGPCGTSLNHAITIIGYGQDSSGTKYWIVRNS 301

Query: 318 WGNTWGDAGYMKIVR----DEGLCGIGTRSSYP 346
           WG++WG+ GY+++ R      G+CGI     +P
Sbjct: 302 WGSSWGEGGYVRMARGVSSSSGVCGIAMAPLFP 334


>gi|363807062|ref|NP_001242584.1| uncharacterized protein LOC100804015 precursor [Glycine max]
 gi|255640677|gb|ACU20623.1| unknown [Glycine max]
          Length = 366

 Score =  305 bits (782), Expect = 2e-80,   Method: Compositional matrix adjust.
 Identities = 150/338 (44%), Positives = 212/338 (62%), Gaps = 9/338 (2%)

Query: 16  TPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENL 75
           T +F+  TL  +  +  + + + +E  V+ ++E+W+ +H + Y +  +K+ R ++FK+NL
Sbjct: 9   TLLFLSFTLSYAIKTSTIINYTDNE--VMAMYEEWLVRHQKGYNELGKKDKRFQVFKDNL 66

Query: 76  EYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD 135
            +I++ N   N TYKLG N+F+D+TN+E+RA+Y G K  +      T ST      S  D
Sbjct: 67  GFIQEHNNNLNNTYKLGLNKFADMTNEEYRAMYLGTKSNAKRRLMKTKSTGHRYAFSARD 126

Query: 136 -VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN 194
            +P  +DWR KGAV PIK+Q  CG CWAF+ VA VE I KI +G  + LSEQ+L+DC   
Sbjct: 127 RLPVHVDWRMKGAVAPIKDQGSCGSCWAFSTVATVEAINKIVTGKFVSLSEQELVDCDRA 186

Query: 195 GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGD 253
            N GC GG  + AF +IIQN GI T+ +YPY+   G C   +K A    I  YE+VP  D
Sbjct: 187 YNEGCNGGLMDYAFEFIIQNGGIDTDKDYPYRGFDGICDPTKKNAKVVNIDGYEDVPPYD 246

Query: 254 EQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWL 313
           E AL KAV+ QPVS+AI A     Q Y+ G+F G CGT LDH V +VG+G +E+G +YWL
Sbjct: 247 ENALKKAVAHQPVSVAIEASGRALQLYQSGVFTGKCGTSLDHGVVVVGYG-SENGVDYWL 305

Query: 314 IKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
           ++NSWG  WG+ GY K+ R+     G CGI   +SYP+
Sbjct: 306 VRNSWGTGWGEDGYFKMQRNVRTSTGKCGITMEASYPV 343


>gi|255555337|ref|XP_002518705.1| cysteine protease, putative [Ricinus communis]
 gi|223542086|gb|EEF43630.1| cysteine protease, putative [Ricinus communis]
          Length = 471

 Score =  305 bits (782), Expect = 2e-80,   Method: Compositional matrix adjust.
 Identities = 148/313 (47%), Positives = 210/313 (67%), Gaps = 9/313 (2%)

Query: 40  EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
           +  V  ++E W+ +HG++Y    EKE R +IFK+NL +I++ N   +R+YK+G N+F+DL
Sbjct: 44  DSQVRRMYEMWLVEHGKAYNALGEKEKRFEIFKDNLRFIDEHNSV-DRSYKVGLNRFADL 102

Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGC 159
           TN+E++A++ G KM   +    T S  +Y      D+P ++DWR+KGAV P+K+Q +CG 
Sbjct: 103 TNEEYKAMFLGTKMERKNRFLGTRSQ-RYLFKDGDDLPENVDWREKGAVVPVKDQGQCGS 161

Query: 160 CWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIAT 219
           CWAF+ V AVEGI +I +G LI LSEQ+L+DC  + N GC GG  + AF +II N GI T
Sbjct: 162 CWAFSTVGAVEGINQIVTGELISLSEQELVDCDKSYNQGCNGGLMDYAFEFIINNGGIDT 221

Query: 220 EDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQ 278
           E++YPY+A    C   +K A    I  YE+VP  DE +L KAV+ QPVS+AI A    FQ
Sbjct: 222 EEDYPYKASDNICDPNRKNAKVVTIDGYEDVPENDENSLKKAVAHQPVSVAIEAGGRAFQ 281

Query: 279 SYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----- 333
            YK G+F G CGT+LDH V  VG+G TE+G NYW+++NSWG+ WG++GY+++ R+     
Sbjct: 282 LYKSGVFTGRCGTELDHGVVAVGYG-TENGVNYWIVRNSWGSAWGESGYIRMERNVANTK 340

Query: 334 EGLCGIGTRSSYP 346
            G CGI  + SYP
Sbjct: 341 TGKCGIAIQPSYP 353


>gi|224131910|ref|XP_002328138.1| predicted protein [Populus trichocarpa]
 gi|222837653|gb|EEE76018.1| predicted protein [Populus trichocarpa]
          Length = 349

 Score =  305 bits (782), Expect = 2e-80,   Method: Compositional matrix adjust.
 Identities = 148/309 (47%), Positives = 210/309 (67%), Gaps = 11/309 (3%)

Query: 43  VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTND 102
           +VE+ E W++ HG++Y    EK  R ++FKENL++I++ NKE   +Y LG N+F+DL+++
Sbjct: 43  LVELFESWISGHGKAYNSLEEKLHRFEVFKENLKHIDQRNKEVT-SYWLGLNEFADLSHE 101

Query: 103 EFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWA 162
           EF++ + G     P  R  +S  F Y+++   D+P S+DWR KGAVTP+KNQ  CG CWA
Sbjct: 102 EFKSKFLGLYPEFP--RKKSSEDFSYRDV--VDLPKSIDWRKKGAVTPVKNQGSCGSCWA 157

Query: 163 FAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDE 222
           F+ VAAVEGI +I +GNL  LSEQQL+DC T+ NNGC GG  + AF +I+ N G+  E++
Sbjct: 158 FSTVAAVEGINQIVAGNLTSLSEQQLIDCDTSFNNGCNGGLMDYAFEFIVNNGGLHKEED 217

Query: 223 YPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYK 281
           YPY    GTC   ++      IS Y +VP  DEQ+LLKA++ QP+S+AI A   +FQ Y 
Sbjct: 218 YPYLMEEGTCDEKREEMEVVTISGYHDVPRNDEQSLLKALAHQPLSVAIDASGRDFQFYS 277

Query: 282 EGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLC 337
            G+F+G CGT LDH V  VG+G++  G +Y ++KNSWG  WG+ GY+++ R+    EGLC
Sbjct: 278 GGVFSGPCGTDLDHGVAAVGYGSSS-GIDYIIVKNSWGPKWGERGYLRMKRNTGKPEGLC 336

Query: 338 GIGTRSSYP 346
           GI   +SYP
Sbjct: 337 GINKMASYP 345


>gi|28192373|gb|AAK07730.1| CPR1-like cysteine proteinase [Nicotiana tabacum]
          Length = 374

 Score =  305 bits (781), Expect = 2e-80,   Method: Compositional matrix adjust.
 Identities = 153/318 (48%), Positives = 206/318 (64%), Gaps = 13/318 (4%)

Query: 38  THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
           + E  V   +E W+A+HGR+Y    EKE R +IFK+NL +IE+ N  GNRTYK+G NQF+
Sbjct: 41  SDEDQVKNRYEMWLAEHGRAYNALGEKEKRFEIFKDNLRFIEEHNNSGNRTYKVGLNQFA 100

Query: 98  DLTNDEFRALYTGYKMPSPSHRSTTSS---TFKYQNLSMTDVPTSLDWRDKGAVTPIKNQ 154
           DLTN+E+R +Y G K  S + R    S   + +Y +     +P S+DWR +GAV PIKNQ
Sbjct: 101 DLTNEEYRTMYLGTK--SDARRRFVKSKNPSQRYASRPNELMPHSVDWRKRGAVAPIKNQ 158

Query: 155 KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQN 214
             CG CWAF+ VAAV GI +I +G +I LSEQ+L+DC    N+GC GG  + AF +II N
Sbjct: 159 GSCGSCWAFSTVAAVGGINQIVTGEMITLSEQELVDCDRVQNSGCNGGLMDYAFEFIISN 218

Query: 215 QGIATEDEYPYQAVPGTCSAAQKP-AAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAY 273
            G+ TE  YPY+ V G C   +K      I  YE+VP  +E+AL KAV+ QPV +AI A 
Sbjct: 219 GGMDTEKHYPYRGVEGRCDPVRKNYKVVSIDGYEDVPR-NERALQKAVAHQPVCVAIEAS 277

Query: 274 STEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD 333
              FQ Y  G+F G CG ++DH V +VG+G +EDG +YW+++NSWG  WG+ GY+K+ R+
Sbjct: 278 GRAFQLYSSGVFTGECGEEVDHGVVVVGYG-SEDGVDYWIVRNSWGTKWGENGYVKMERN 336

Query: 334 E-----GLCGIGTRSSYP 346
                 G CGI T +SYP
Sbjct: 337 VKKSHLGKCGIMTEASYP 354


>gi|357124027|ref|XP_003563708.1| PREDICTED: germination-specific cysteine protease 1-like
           [Brachypodium distachyon]
          Length = 334

 Score =  305 bits (781), Expect = 2e-80,   Method: Compositional matrix adjust.
 Identities = 148/322 (45%), Positives = 206/322 (63%), Gaps = 13/322 (4%)

Query: 40  EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE----GNRTYKLGTNQ 95
           ++++ E +EKWMA+ GR+YKD  EK  R ++FK N  +I+  N      G    KL TN+
Sbjct: 13  DKAMRERYEKWMAEQGRTYKDSTEKARRFEVFKSNAHFIDSHNAATGPGGKSRPKLTTNK 72

Query: 96  FSDLTNDEFRALY-TGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQ 154
           F+DLT DEFR +Y TG+++        T + FK+  +S++DVP S+DWR +GAVT +K+Q
Sbjct: 73  FADLTEDEFRNIYVTGHRVNYRPTSLVTDTVFKFGAVSLSDVPPSIDWRARGAVTSVKDQ 132

Query: 155 KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQN 214
             C CCWAF++ AAVEGI +I +GN + LS QQL+DCS   N  C  G  +KA+ YI ++
Sbjct: 133 HLCACCWAFSSAAAVEGIHQITTGNQVSLSVQQLVDCSNAANEKCKAGEIDKAYEYIARS 192

Query: 215 QGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYS 274
            G+  + +YPY+   GTC    K A A+IS ++ VP+ +E ALL AV+ QPVS+A+   S
Sbjct: 193 GGLVADQDYPYEGHSGTCRVYGKQAVARISGFQYVPARNETALLLAVAHQPVSVALDGLS 252

Query: 275 TEFQSYKEGIFNGV---CGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIV 331
              Q    GIF      C T L+HA+TIVG+GT E G  YWL+KNSWG+ WGD GY+K  
Sbjct: 253 RALQHIGTGIFGSAGEPCTTNLNHAMTIVGYGTDEHGTRYWLMKNSWGSDWGDKGYVKFA 312

Query: 332 RD-----EGLCGIGTRSSYPLA 348
           RD      G+CG+   +SYP+A
Sbjct: 313 RDVASEINGVCGLALEASYPVA 334


>gi|109390302|gb|ABG33750.1| cysteine protease [Hevea brasiliensis]
          Length = 457

 Score =  305 bits (781), Expect = 2e-80,   Method: Compositional matrix adjust.
 Identities = 150/343 (43%), Positives = 220/343 (64%), Gaps = 16/343 (4%)

Query: 18  MFIIITLLVSCASQVVSSRSTH--------EQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
           +F   TL  +    ++S   +H        +  V+ I+E W+ +HG++Y    EKE R +
Sbjct: 5   LFFASTLSSASDLSIISYDQSHGTKSSWRTDDEVMAIYEDWLVKHGKAYNSLGEKERRFE 64

Query: 70  IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQ 129
           +FK+NL +I++ N E NRTY++G N+F+DLTN+E+R++Y G  +           + +Y 
Sbjct: 65  VFKDNLRFIDEHNSE-NRTYRVGLNRFADLTNEEYRSMYLG-ALSGIRRNKLRKISDRYT 122

Query: 130 NLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLL 189
                 +P S+DWR +GAV  +K+Q  CG CWAF+AVAAVEGI KI +G+LI LSEQ+L+
Sbjct: 123 PRVGDSLPDSVDWRKEGAVVGVKDQGSCGSCWAFSAVAAVEGINKIVTGDLISLSEQELV 182

Query: 190 DCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEE 248
           DC  + N GC GG  +  F +II N GI +E++YPY A  G C   +K A    I +YE+
Sbjct: 183 DCDNSYNEGCNGGLMDYGFEFIINNGGIDSEEDYPYLARDGRCDTYRKNARVVSIDSYED 242

Query: 249 VPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDG 308
           VP  +E AL KAV+ QPVS+AI A   +FQ Y  G+F+G CGT LDH V  VG+G TE+G
Sbjct: 243 VPVNNEAALQKAVANQPVSVAIEAGGRDFQLYSSGVFSGRCGTALDHGVVAVGYG-TENG 301

Query: 309 ANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
            +YW+++NSWG +WG++GY+++ R+     G+CGI   +SYP+
Sbjct: 302 QDYWIVRNSWGKSWGESGYLRMARNIRKPTGICGIAMEASYPI 344


>gi|297791625|ref|XP_002863697.1| hypothetical protein ARALYDRAFT_917391 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297309532|gb|EFH39956.1| hypothetical protein ARALYDRAFT_917391 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 463

 Score =  305 bits (781), Expect = 2e-80,   Method: Compositional matrix adjust.
 Identities = 161/357 (45%), Positives = 230/357 (64%), Gaps = 22/357 (6%)

Query: 1   MVLIFERSG-SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYK 59
           M+L+    G S+ I+ +    II+   +     VSSRS  E  V  I+E WM +HG+   
Sbjct: 9   MILLLAMIGVSYAIDMS----IISYDENHHISTVSSRSDAE--VERIYEAWMVEHGKKKM 62

Query: 60  DE----LEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPS 115
           ++     EK+ R +IFK+NL YI++ N + N +YKLG  +F+DLTNDE+R++Y G K   
Sbjct: 63  NQNGLGAEKDQRFEIFKDNLRYIDEHNTK-NLSYKLGLTRFADLTNDEYRSMYLGAK--- 118

Query: 116 PSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKI 175
           P  R   +S  +Y+      +P S+DWR +GAV  +K+Q  CG CWAF+ + AVEGI KI
Sbjct: 119 PVKRVLKTSD-RYEARVGDALPDSVDWRKEGAVADVKDQGSCGSCWAFSTIGAVEGINKI 177

Query: 176 RSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAA 235
            +G+LI LSEQ+L+DC T+ N GC GG  + AF +II+N GI TE +YPY+A  G C   
Sbjct: 178 VTGDLISLSEQELVDCDTSYNQGCNGGLMDYAFEFIIKNGGIDTEADYPYKAADGRCDQN 237

Query: 236 QKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLD 294
           +K A    I +YE+VP   E +L KA++ QP+S+AI A    FQ Y  G+F+G+CGT+LD
Sbjct: 238 RKNAKVVTIDSYEDVPENSEASLKKALAHQPISVAIEAGGRAFQLYSSGVFDGICGTELD 297

Query: 295 HAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
           H V  VG+G TE+G +YW+++NSWGN WG++GY+K+ R+     G CGI   +SYP+
Sbjct: 298 HGVVAVGYG-TENGKDYWIVRNSWGNRWGESGYIKMARNIAEPTGKCGIAMEASYPI 353


>gi|50355617|dbj|BAD29957.1| cysteine protease [Daucus carota]
          Length = 437

 Score =  305 bits (781), Expect = 2e-80,   Method: Compositional matrix adjust.
 Identities = 144/313 (46%), Positives = 210/313 (67%), Gaps = 6/313 (1%)

Query: 40  EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
           +  V+ ++  W+ +HG+SY    EKE R +IFK+NL YI+  N + +R+Y+LG N+F+DL
Sbjct: 42  DDEVMTMYNSWLVKHGKSYNALGEKETRFQIFKDNLRYIDNHNADPDRSYELGLNRFADL 101

Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGC 159
           TN+E+RA Y G K      + +   + +Y  +   ++P S+DWR+KGAV  +K+Q  CG 
Sbjct: 102 TNEEYRAKYLGTKSRESRPKLSKGPSDRYAPVEGEELPDSIDWREKGAVAAVKDQGSCGS 161

Query: 160 CWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIAT 219
           CWAF+A+ AVEGI +I +G LI LSEQ+L+DC  + N GC GG  + AF +II+N GI +
Sbjct: 162 CWAFSAIGAVEGINQITTGELITLSEQELVDCDRSYNEGCEGGLMDYAFNFIIKNGGIDS 221

Query: 220 EDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQ 278
           + +YPY    GTC+  ++ A    I +YE+VP  DE+AL KA + QP+S+AI A   +FQ
Sbjct: 222 DLDYPYTGRDGTCNQNKENAKVVTIDSYEDVPVYDEKALQKAAANQPISVAIEAGGMDFQ 281

Query: 279 SYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----E 334
            Y  GIF G CGT +DH V +VG+G +E+G +YW+++NSWG  WG+AGY+K+ R+     
Sbjct: 282 LYVSGIFTGKCGTAVDHGVVVVGYG-SEEGMDYWIVRNSWGAAWGEAGYLKMQRNVGKSS 340

Query: 335 GLCGIGTRSSYPL 347
           GLCGI    SYP+
Sbjct: 341 GLCGITIEPSYPV 353


>gi|162463464|ref|NP_001104879.1| cysteine proteinase Mir3 precursor [Zea mays]
 gi|2425066|gb|AAB88263.1| cysteine proteinase Mir3 [Zea mays]
          Length = 480

 Score =  305 bits (781), Expect = 2e-80,   Method: Compositional matrix adjust.
 Identities = 150/324 (46%), Positives = 210/324 (64%), Gaps = 12/324 (3%)

Query: 32  VVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRT 88
           +VS     ++    ++ +WMA HGR+Y     +E R ++F++NL YI+  N     G  +
Sbjct: 29  IVSYGERTDEEARRMYAEWMAAHGRTYNAVGAEERRYQVFRDNLRYIDAHNAAADAGVHS 88

Query: 89  YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAV 148
           ++LG N+F+DLTNDE+ A Y G +      R   +   +Y      D+P S+DWR KGAV
Sbjct: 89  FRLGLNRFADLTNDEYPATYLGARTRPQRDRKLGA---RYHAADNEDLPESVDWRAKGAV 145

Query: 149 TPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAF 208
             +K+Q  CG CWAF+ +AAVEGI +I +G+LI LSEQ+L+DC T+ N GC GG  + AF
Sbjct: 146 AEVKDQGSCGTCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAF 205

Query: 209 AYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVS 267
            +II N GI TE +YPY+   G C   +K A    I +YE+VP+ DE++L KAV+ QPVS
Sbjct: 206 EFIINNGGIDTEKDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANDEKSLQKAVANQPVS 265

Query: 268 IAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGY 327
           +AI A  T FQ Y  GIF G CGT+LDH VT VG+G TE+G +YW++KNSWG++WG++GY
Sbjct: 266 VAIEAAGTAFQLYSSGIFTGSCGTRLDHGVTAVGYG-TENGKDYWIVKNSWGSSWGESGY 324

Query: 328 MKIVRD----EGLCGIGTRSSYPL 347
           +++ R+     G CGI    SYPL
Sbjct: 325 VRMERNIKASSGKCGIAVEPSYPL 348


>gi|225428328|ref|XP_002279940.1| PREDICTED: cysteine proteinase-like [Vitis vinifera]
          Length = 707

 Score =  305 bits (781), Expect = 2e-80,   Method: Compositional matrix adjust.
 Identities = 147/304 (48%), Positives = 210/304 (69%), Gaps = 10/304 (3%)

Query: 48  EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRAL 107
           E W+++HG+ YK   EK  R ++F+ENL +I++ NKE + +Y LG N+F+DL+++EF++ 
Sbjct: 405 ESWVSKHGKVYKSMEEKLHRFEVFRENLNHIDERNKEVS-SYWLGLNEFADLSHEEFKSK 463

Query: 108 YTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVA 167
           Y G +   P  R   S  F+Y++++  D+P S+DWR KGAVT +KNQ  CG CWAF+ VA
Sbjct: 464 YLGLRAEFPRSRDY-SGEFRYRDVA--DLPESVDWRKKGAVTHVKNQGACGSCWAFSTVA 520

Query: 168 AVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQA 227
           AVEGI +I +GNL  LSEQ+L+DC T  N+GC GG  + AFA+I  N G+  ED+YPY  
Sbjct: 521 AVEGINQIVTGNLTTLSEQELIDCDTTFNSGCNGGLMDYAFAFIASNGGLHKEDDYPYLM 580

Query: 228 VPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFN 286
             GTC   ++      IS YE+VP  DE++LLKA++ QP+S+AI A   +FQ Y  G+FN
Sbjct: 581 EEGTCEEQKEDVDIVTISGYEDVPEKDEESLLKALAHQPLSVAIEASGRDFQFYSGGVFN 640

Query: 287 GVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTR 342
           G CGT+LDH V  VG+G+++ G +Y ++KNSWG  WG+ GY+++ R+    EGLCGI   
Sbjct: 641 GPCGTELDHGVAAVGYGSSK-GLDYIIVKNSWGPKWGEKGYIRMKRNTGKTEGLCGINKM 699

Query: 343 SSYP 346
           +SYP
Sbjct: 700 ASYP 703


>gi|297602242|ref|NP_001052232.2| Os04g0203500 [Oryza sativa Japonica Group]
 gi|255675217|dbj|BAF14146.2| Os04g0203500 [Oryza sativa Japonica Group]
          Length = 336

 Score =  305 bits (780), Expect = 2e-80,   Method: Compositional matrix adjust.
 Identities = 147/335 (43%), Positives = 210/335 (62%), Gaps = 10/335 (2%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
           +F I+  L  C++ + +   + + ++   HE+WMAQ+GR YKD+ EK  R ++FK N+ +
Sbjct: 8   LFAILGCLCLCSAVLAARELSDDAAMAARHERWMAQYGRMYKDDAEKARRFEVFKANVAF 67

Query: 78  IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
           IE  N  GN  + LG NQF+DLTNDEFR+  T       + R  T   F+ +N+++  +P
Sbjct: 68  IESFN-AGNHKFWLGVNQFADLTNDEFRSTKTNKGFIPSTTRVPTG--FRNENVNIDALP 124

Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNN 197
            ++DWR KG VTPIK+Q +CGCCWAF+AVAA+EGI K+ +G LI  S  + L   T  + 
Sbjct: 125 ATMDWRTKGVVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISHSLNKSL--LTVMSM 182

Query: 198 GCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQAL 257
           GC GG  + AF +II+N G+ TE  YPY AV     +    + A I  YE+VP+ +E AL
Sbjct: 183 GCEGGLMDDAFKFIIKNGGLTTESNYPYAAVDDKFKSVSN-SVASIKGYEDVPANNEAAL 241

Query: 258 LKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNS 317
           +KAV+ QPVS+A+      FQ YK G+  G CGT LDH +  +G+G   DG  YWL+KNS
Sbjct: 242 MKAVANQPVSVAVDGGDMTFQFYKGGVMTGSCGTDLDHGIVAIGYGKASDGTKYWLLKNS 301

Query: 318 WGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
           WG TWG+ G++++ +D     G+CG+    SYP A
Sbjct: 302 WGMTWGENGFLRMEKDISDKRGMCGLAMEPSYPTA 336


>gi|544129|sp|P25803.2|CYSEP_PHAVU RecName: Full=Vignain; AltName: Full=Bean endopeptidase; AltName:
           Full=Cysteine proteinase EP-C1; Flags: Precursor
 gi|20994|emb|CAA44816.1| endopeptidase [Phaseolus vulgaris]
          Length = 362

 Score =  305 bits (780), Expect = 3e-80,   Method: Compositional matrix adjust.
 Identities = 155/344 (45%), Positives = 211/344 (61%), Gaps = 14/344 (4%)

Query: 16  TPMFIIITLLVSCASQVVSSRSTH------EQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
           T   + + L  S    V +S   H      E+S+ +++E+W + H  S +   EK  R  
Sbjct: 3   TKKLLWVVLSFSLVLGVANSFDFHDKDLASEESLWDLYERWRSHHTVS-RSLGEKHKRFN 61

Query: 70  IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPS-HRSTTSSTFKY 128
           +FK NL ++   NK  ++ YKL  N+F+D+TN EFR+ Y G K+  P   R T      +
Sbjct: 62  VFKANLMHVHNTNKM-DKPYKLKLNKFADMTNHEFRSTYAGSKVNHPRMFRGTPHENGAF 120

Query: 129 QNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQL 188
               +  VP S+DWR KGAVT +K+Q +CG CWAF+ V AVEGI +I++  L+ LSEQ+L
Sbjct: 121 MYEKVVSVPPSVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQIKTNKLVALSEQEL 180

Query: 189 LDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQ-KPAAAKISNYE 247
           +DC    N GC GG  E AF +I Q  GI TE  YPY+A  GTC A++    A  I  +E
Sbjct: 181 VDCDKEENQGCNGGLMESAFEFIKQKGGITTESNYPYKAQEGTCDASKVNDLAVSIDGHE 240

Query: 248 EVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTED 307
            VP+ DE ALLKAV+ QPVS+AI A  ++FQ Y EG+F G C T L+H V IVG+GTT D
Sbjct: 241 NVPANDEDALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDCSTDLNHGVAIVGYGTTVD 300

Query: 308 GANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
           G NYW+++NSWG  WG+ GY+++ R+    EGLCGI    SYP+
Sbjct: 301 GTNYWIVRNSWGPEWGEHGYIRMQRNISKKEGLCGIAMLPSYPI 344


>gi|146216000|gb|ABQ10202.1| cysteine protease Cp4 [Actinidia deliciosa]
          Length = 463

 Score =  305 bits (780), Expect = 3e-80,   Method: Compositional matrix adjust.
 Identities = 149/339 (43%), Positives = 220/339 (64%), Gaps = 13/339 (3%)

Query: 18  MFIIITLLVSCASQVVSSRSTH-----EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFK 72
           +F+      +    ++S   TH     +   + I+EKW+  HG++Y    EKE R +IFK
Sbjct: 13  LFLCFAFSSALDMSIISYDQTHPPQRTDAEAMAIYEKWLTTHGKAYNAIGEKERRFEIFK 72

Query: 73  ENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
           +NL ++++ N     +Y++G N+F+DLTN+E+R+++ G  M     RS ++ + +Y   +
Sbjct: 73  DNLRFVDEHNAVAG-SYRVGLNRFADLTNEEYRSMFLGGNMEM-KERSASTKSDRYAFRA 130

Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
              +P S+DWR+KGAV+P+K+Q +CG CWAF+ ++AVEGI +I +G LI LSEQ+L+DC 
Sbjct: 131 GDKLPGSVDWREKGAVSPVKDQGQCGSCWAFSTISAVEGINQIVTGELISLSEQELVDCD 190

Query: 193 TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPS 251
            + N GC GG  +  F +II N GI TE++YPY+AV GTC   +K A    I+ YE+VP 
Sbjct: 191 KSYNMGCNGGLMDYGFQFIINNGGIDTEEDYPYRAVDGTCDQFRKNARVVSINGYEDVPE 250

Query: 252 GDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANY 311
            DE +L KAV+ QPVS+AI A    FQ Y+ G+F G CGT LDH V  VG+G TE+G +Y
Sbjct: 251 DDENSLKKAVANQPVSVAIEAGGRAFQLYESGVFTGHCGTNLDHGVVAVGYG-TENGVDY 309

Query: 312 WLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
           W ++NSWG  WG+ GY+K+ R+     G CGI + +SYP
Sbjct: 310 WTVRNSWGPKWGENGYIKLERNINATSGKCGIASMASYP 348


>gi|115484973|ref|NP_001067630.1| Os11g0255300 [Oryza sativa Japonica Group]
 gi|530335|emb|CAA56844.1| cysteine protease [Oryza sativa Japonica Group]
 gi|5761322|dbj|BAA83472.1| cysteine endopeptidase [Oryza sativa Japonica Group]
 gi|62732672|gb|AAX94791.1| cysteine proteinase (EC 3.4.22.-) - rice [Oryza sativa Japonica
           Group]
 gi|62732673|gb|AAX94792.1| cysteine proteinase (EC 3.4.22.-) - rice [Oryza sativa Japonica
           Group]
 gi|62732674|gb|AAX94793.1| cysteine proteinase (EC 3.4.22.-) - rice [Oryza sativa Japonica
           Group]
 gi|77549615|gb|ABA92412.1| Thiol protease SEN102 precursor, putative, expressed [Oryza sativa
           Japonica Group]
 gi|77549616|gb|ABA92413.1| Thiol protease SEN102 precursor, putative, expressed [Oryza sativa
           Japonica Group]
 gi|77549617|gb|ABA92414.1| Thiol protease SEN102 precursor, putative, expressed [Oryza sativa
           Japonica Group]
 gi|113644852|dbj|BAF27993.1| Os11g0255300 [Oryza sativa Japonica Group]
 gi|125576789|gb|EAZ18011.1| hypothetical protein OsJ_33558 [Oryza sativa Japonica Group]
 gi|215701098|dbj|BAG92522.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 378

 Score =  304 bits (779), Expect = 3e-80,   Method: Compositional matrix adjust.
 Identities = 145/330 (43%), Positives = 207/330 (62%), Gaps = 22/330 (6%)

Query: 38  THEQSVVEIHEKWMAQH--------GRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTY 89
           + E+S+  ++E+W +++        G    D+ E   R  +F EN  YI +AN+ G R +
Sbjct: 33  SSEESLRALYERWRSRYTVSRPAASGGVGNDDGEARRRFNVFVENARYIHEANRRGGRPF 92

Query: 90  KLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSS------TFKYQNLSMTDVPTSLDWR 143
           +L  N+F+D+T DEFR  Y G +  +  HRS +        +F+Y      ++P ++DWR
Sbjct: 93  RLALNKFADMTTDEFRRTYAGSR--ARHHRSLSGGRGGEGGSFRYGGDDEDNLPPAVDWR 150

Query: 144 DKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGS 203
           ++GAVT IK+Q +CG CWAF+ VAAVEG+ KI++G L+ LSEQ+L+DC T  N GC GG 
Sbjct: 151 ERGAVTGIKDQGQCGSCWAFSTVAAVEGVNKIKTGRLVTLSEQELVDCDTGDNQGCDGGL 210

Query: 204 REKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAA-AKISNYEEVPSGDEQALLKAVS 262
            + AF +I +N GI TE  YPY+A  G C+ A+  +    I  YE+VP+ DE AL KAV+
Sbjct: 211 MDYAFQFIKRNGGITTESNYPYRAEQGRCNKAKASSHDVTIDGYEDVPANDESALQKAVA 270

Query: 263 MQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTW 322
            QPV++A+ A   +FQ Y EG+F G CGT LDH V  VG+G T DG  YW++KNSWG  W
Sbjct: 271 NQPVAVAVEASGQDFQFYSEGVFTGECGTDLDHGVAAVGYGITRDGTKYWIVKNSWGEDW 330

Query: 323 GDAGYMKIVR-----DEGLCGIGTRSSYPL 347
           G+ GY+++ R       GLCGI   +SYP+
Sbjct: 331 GERGYIRMQRGVSSDSNGLCGIAMEASYPV 360


>gi|312282059|dbj|BAJ33895.1| unnamed protein product [Thellungiella halophila]
          Length = 379

 Score =  304 bits (779), Expect = 3e-80,   Method: Compositional matrix adjust.
 Identities = 146/308 (47%), Positives = 203/308 (65%), Gaps = 9/308 (2%)

Query: 46  IHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFR 105
           I E W+ +HG+ Y    EKE RL IFK+NL +I   N E N  Y+LG N+F+DL+  E++
Sbjct: 63  IFESWIVKHGKVYDSVAEKERRLTIFKDNLRFITNRNSE-NLGYRLGLNRFADLSLHEYK 121

Query: 106 ALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAA 165
            +  G     P +    SS+ +Y+  +   +P S+DWR++GAVT +K+Q  C  CWAF+ 
Sbjct: 122 EICHGADPKPPRNHVFMSSSDRYKTSAGDVLPKSVDWRNEGAVTEVKDQGHCRSCWAFST 181

Query: 166 VAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPY 225
           V AVEG+ KI +G L+ LSEQ L++C+   NNGC GG  E A+ +I+ N G+ T+++YPY
Sbjct: 182 VGAVEGLNKIVTGELVTLSEQDLINCNKE-NNGCGGGKVETAYEFIVSNGGLGTDNDYPY 240

Query: 226 QAVPGTCSAAQKP--AAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEG 283
           +AV G C    K       I  YE +P+ DE AL+KAV+ QPV+  I + S EFQ Y+ G
Sbjct: 241 KAVNGACDGRLKENIKNVMIDGYENLPANDELALMKAVAHQPVTAVIDSSSREFQLYESG 300

Query: 284 IFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGI 339
           +F+G CGT L+H V +VG+G TE+G NYW+++NSWGNTWG+AGYMK+ R+     GLCGI
Sbjct: 301 VFDGRCGTNLNHGVVVVGYG-TENGRNYWIVRNSWGNTWGEAGYMKMARNIANPRGLCGI 359

Query: 340 GTRSSYPL 347
             R SYPL
Sbjct: 360 AMRVSYPL 367


>gi|226496089|ref|NP_001149658.1| cysteine protease 1 precursor [Zea mays]
 gi|195629242|gb|ACG36262.1| cysteine protease 1 precursor [Zea mays]
          Length = 469

 Score =  304 bits (779), Expect = 4e-80,   Method: Compositional matrix adjust.
 Identities = 149/324 (45%), Positives = 209/324 (64%), Gaps = 12/324 (3%)

Query: 32  VVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRT 88
           +VS     E+    ++ +WMA HGR+Y    E+E R ++F++NL Y++  N     G  +
Sbjct: 31  IVSYGERSEEEARRMYAEWMAAHGRTYNAVGEEERRFEVFRDNLRYVDAHNAAADAGVHS 90

Query: 89  YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAV 148
           ++LG N+F+DLTNDE+RA Y G +      R       +Y      D+P S+DWR KGAV
Sbjct: 91  FRLGLNRFADLTNDEYRATYLGVRSRPQRERRLGD---RYLAGDNEDLPESVDWRAKGAV 147

Query: 149 TPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAF 208
             IK+Q  CG CWAF+ +AAVEGI +I +G++I LSEQ+L+DC T+ N GC GG  + AF
Sbjct: 148 AEIKDQGSCGSCWAFSTIAAVEGINQIVTGDMISLSEQELVDCDTSYNQGCNGGLMDYAF 207

Query: 209 AYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVS 267
            +II N GI TE++YPY+   G C   +K A    I +YE+VP+  E++L KAV+ QP+S
Sbjct: 208 EFIINNGGIDTEEDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANSEKSLQKAVANQPIS 267

Query: 268 IAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGY 327
           +AI A    FQ Y  GIF G CGT LDH VT VG+G TE+G +YW++KNSWG++WG++GY
Sbjct: 268 VAIEAGGRAFQLYNSGIFTGTCGTALDHGVTAVGYG-TENGKDYWIVKNSWGSSWGESGY 326

Query: 328 MKIVRD----EGLCGIGTRSSYPL 347
           +++ R+     G CGI    SYPL
Sbjct: 327 VRMERNIKASSGKCGIAVEPSYPL 350


>gi|157093728|gb|ABV22590.1| KDEL-tailed cysteine endopeptidase [Solanum lycopersicum]
          Length = 360

 Score =  304 bits (779), Expect = 4e-80,   Method: Compositional matrix adjust.
 Identities = 150/318 (47%), Positives = 205/318 (64%), Gaps = 16/318 (5%)

Query: 40  EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
           E+   E++E+W + H  S   + EK  R  +FK N+ Y+   NK+ ++ YKL  N+F+D+
Sbjct: 31  EEKFWELYERWRSHHTVSRSLD-EKHKRFNVFKANVHYVHNFNKK-DKPYKLKLNKFADM 88

Query: 100 TNDEFRALYTGYKMPSPSHR-----STTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQ 154
           TN EFR  Y G K+    HR     S  + TF Y N    +VP S+DWR KGAVTP+K+Q
Sbjct: 89  TNHEFRQHYAGSKIKH--HRTLLGASRANGTFMYANED--NVPPSIDWRKKGAVTPVKDQ 144

Query: 155 KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQN 214
            +CG CWAF+ V AVEGI +I++  L+ LSEQ+L+DC T  N GC GG  + AF +I + 
Sbjct: 145 GQCGSCWAFSTVVAVEGINQIKTKKLVSLSEQELVDCDTTENQGCNGGLMDPAFDFIKKR 204

Query: 215 QGIATEDEYPYQAVPGTCSAAQK-PAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAY 273
            GI TE+ YPY+A    C   ++      I  +E+VP  DE ALLKAV+ QP+S+AI A 
Sbjct: 205 GGITTEERYPYKAEDDKCDIQKRNTPVVSIDGHEDVPPNDEDALLKAVANQPISVAIDAS 264

Query: 274 STEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR- 332
            ++FQ Y EG+F G CGT+LDH V IVG+GTT DG  YW++KNSWG  WG+ GY+++ R 
Sbjct: 265 GSQFQFYSEGVFTGECGTELDHGVAIVGYGTTVDGTKYWIVKNSWGAGWGEKGYIRMQRK 324

Query: 333 ---DEGLCGIGTRSSYPL 347
              +EGLCGI  + SYP+
Sbjct: 325 VDAEEGLCGIAMQPSYPI 342


>gi|182375363|gb|ACB87490.1| mucunain [Mucuna pruriens]
          Length = 422

 Score =  304 bits (778), Expect = 4e-80,   Method: Compositional matrix adjust.
 Identities = 146/311 (46%), Positives = 211/311 (67%), Gaps = 9/311 (2%)

Query: 44  VEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDE 103
           + ++E+W+ +HG++Y    EK+ R  IFK+NL +I+  N + NRTYKLG N+F+DLTN+E
Sbjct: 1   MSLYEQWLVKHGKAYNALGEKDKRFDIFKDNLRFIDDHNAD-NRTYKLGLNRFADLTNEE 59

Query: 104 FRALYTGYKM-PSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWA 162
           +RA Y G ++ P+     T + + +Y      ++P S+DWR++ AV P+K+Q  CG CWA
Sbjct: 60  YRARYLGTRIDPNRRFVKTKTQSNRYAPRVGDNLPESVDWRNESAVLPVKDQGNCGSCWA 119

Query: 163 FAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDE 222
           F+ + AVEGI KI +G+LI LSEQ+L+DC T+ N GC GG  + A+ +II N GI +E++
Sbjct: 120 FSTIGAVEGINKIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAYEFIINNGGIDSEED 179

Query: 223 YPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYK 281
           YPY+AV GTC   +K A    I +YE+VP+ DE AL KAV+ QPVS+AI     EFQ Y 
Sbjct: 180 YPYRAVDGTCDQYRKNAKVVTIDSYEDVPANDELALKKAVANQPVSVAIEGGGREFQLYV 239

Query: 282 EGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-----EGL 336
            G+F G CGT LDH V  VG+G+ + G +YW+++NSWG +WG+ GY+++ R+      G 
Sbjct: 240 SGVFTGRCGTALDHGVVAVGYGSVK-GHDYWIVRNSWGASWGEEGYVRLERNLAKSRSGK 298

Query: 337 CGIGTRSSYPL 347
           CGI    SYP+
Sbjct: 299 CGIAIEPSYPI 309


>gi|413919736|gb|AFW59668.1| cysteine protease 1 [Zea mays]
          Length = 469

 Score =  303 bits (777), Expect = 6e-80,   Method: Compositional matrix adjust.
 Identities = 148/324 (45%), Positives = 209/324 (64%), Gaps = 12/324 (3%)

Query: 32  VVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRT 88
           +VS     E+    ++ +WMA HGR+Y    E+E R ++F++NL Y++  N     G  +
Sbjct: 31  IVSYGERSEEEARRMYAEWMAAHGRTYNAVGEEERRFEVFRDNLRYVDAHNAAADAGVHS 90

Query: 89  YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAV 148
           ++LG N+F+DLTNDE+RA Y G +      R       +Y      D+P S+DWR KGAV
Sbjct: 91  FRLGLNRFADLTNDEYRATYLGVRSRPQRERRLGD---RYLAGDNEDLPESVDWRAKGAV 147

Query: 149 TPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAF 208
             +K+Q  CG CWAF+ +AAVEGI +I +G++I LSEQ+L+DC T+ N GC GG  + AF
Sbjct: 148 AEVKDQGSCGSCWAFSTIAAVEGINQIVTGDMISLSEQELVDCDTSYNQGCNGGLMDYAF 207

Query: 209 AYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVS 267
            +II N GI TE++YPY+   G C   +K A    I +YE+VP+  E++L KAV+ QP+S
Sbjct: 208 EFIINNGGIDTEEDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANSEKSLQKAVANQPIS 267

Query: 268 IAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGY 327
           +AI A    FQ Y  GIF G CGT LDH VT VG+G TE+G +YW++KNSWG++WG++GY
Sbjct: 268 VAIEAGGRAFQLYNSGIFTGTCGTALDHGVTAVGYG-TENGKDYWIVKNSWGSSWGESGY 326

Query: 328 MKIVRD----EGLCGIGTRSSYPL 347
           +++ R+     G CGI    SYPL
Sbjct: 327 VRMERNIKASSGKCGIAVEPSYPL 350


>gi|148927394|gb|ABR19828.1| cysteine proteinase [Elaeis guineensis]
          Length = 469

 Score =  303 bits (777), Expect = 7e-80,   Method: Compositional matrix adjust.
 Identities = 148/317 (46%), Positives = 212/317 (66%), Gaps = 10/317 (3%)

Query: 40  EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRTYKLGTNQF 96
           +  V  +++ W AQH RSY    E E RL+IF++NL +I++ N     G  +++LG  +F
Sbjct: 40  DDEVHRLYQAWKAQHARSYNALDEDEQRLEIFRDNLRFIDQHNAAANAGKYSFRLGLTRF 99

Query: 97  SDLTNDEFRALYTGYKMP-SPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQK 155
           +DLTN+E+R+ Y G +   S   R++T  + +Y+  S  D+P S+DWRDKGAV  +K+Q 
Sbjct: 100 ADLTNEEYRSTYLGVRTAGSRRRRNSTVGSNRYRFRSSDDLPDSIDWRDKGAVVDVKDQG 159

Query: 156 ECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQ 215
            CG CWAF+ +AAVEGI  I +G+LI LSEQ+L+DC T  N GC GG  + AF +II N 
Sbjct: 160 SCGSCWAFSTIAAVEGINHIVTGDLISLSEQELVDCDTYYNQGCNGGLMDYAFEFIISNG 219

Query: 216 GIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYS 274
           GI T+++YPY    G+C   +K A    I +YE+VP  DE++L KAV+ QPVS+AI A  
Sbjct: 220 GIDTDEDYPYTGRDGSCDQYRKNAHVVTIDSYEDVPINDEKSLQKAVANQPVSVAIEAGG 279

Query: 275 TEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD- 333
             FQ Y+ GIF G CGT+LDH VT +G+G +E+G  YW++KNSWG+ WG++GY+++ R+ 
Sbjct: 280 RAFQLYESGIFTGYCGTELDHGVTAIGYG-SENGKYYWIVKNSWGSDWGESGYIRMERNI 338

Query: 334 ---EGLCGIGTRSSYPL 347
               G CGI   +SYP+
Sbjct: 339 NSATGKCGIAMEASYPI 355


>gi|334904467|gb|AEH26024.1| cysteine peptidase [Ananas comosus]
          Length = 352

 Score =  303 bits (776), Expect = 8e-80,   Method: Compositional matrix adjust.
 Identities = 149/334 (44%), Positives = 210/334 (62%), Gaps = 11/334 (3%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
           +F+ + L V  AS   +SR      +++  E+WMA++GR YKD  EK  R +IFK N+ +
Sbjct: 8   VFLFLFLCVMWASPSAASRDEPSDPMMKRFEEWMAEYGRVYKDNDEKMRRFQIFKNNVNH 67

Query: 78  IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTG-YKMPSPSHRSTTSSTFKYQNLSMTDV 136
           IE  N     +Y LG NQF+D+T  EF A YTG    P    R    S   + +++++ V
Sbjct: 68  IETFNSHNGNSYTLGINQFTDMTKSEFVAQYTGGISRPLNIEREPVVS---FDDVNISAV 124

Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGN 196
           P S+DWRD GAV  +KNQ  CG CWAFAA+A VEGI KI++G L+ LSEQ++LDC+   +
Sbjct: 125 PQSIDWRDYGAVNEVKNQNPCGSCWAFAAIATVEGIYKIKTGYLVSLSEQEVLDCAV--S 182

Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQA 256
            GC GG   KA+ +II N G+ TE+ YPYQA  GTC+A   P +A I+ Y  V   DE++
Sbjct: 183 YGCKGGWVNKAYDFIISNNGVTTEENYPYQAYQGTCNANSFPNSAYITGYSYVRRNDERS 242

Query: 257 LLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKN 316
           ++ AVS QP++  I A S  FQ Y  G+F+G CGT L+HA+TI+G+G    G  YW+++N
Sbjct: 243 MMYAVSNQPIAALIDA-SENFQYYNGGVFSGPCGTSLNHAITIIGYGQDSSGTKYWIVRN 301

Query: 317 SWGNTWGDAGYMKIVR----DEGLCGIGTRSSYP 346
           SWG++WG+ GY+++ R      G CGI     +P
Sbjct: 302 SWGSSWGEGGYVRMARGVSSSSGACGIAMSPLFP 335


>gi|118486542|gb|ABK95110.1| unknown [Populus trichocarpa]
          Length = 465

 Score =  303 bits (776), Expect = 8e-80,   Method: Compositional matrix adjust.
 Identities = 152/329 (46%), Positives = 210/329 (63%), Gaps = 17/329 (5%)

Query: 32  VVSSRSTH--------EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANK 83
           ++S   TH        +  V+ ++E+W+ +HG++Y    EKE R +IFK+NL +I++ N 
Sbjct: 28  IISYHQTHATKSSWRTDDEVMAMYEEWLVKHGKNYNALGEKEKRFEIFKDNLMFIDQHNS 87

Query: 84  EGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWR 143
           E NRTY +G N+F+DLTN+EFR++Y G +         TS   +Y       +P S+DWR
Sbjct: 88  E-NRTYTVGLNRFADLTNEEFRSMYLGTRTGHKKRLPKTSD--RYAPRVGDSLPDSVDWR 144

Query: 144 DKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGS 203
            +GAV  +K+Q  CG CWAF+ +AAVEGI KI +G+LI LSEQ+L+DC T+ N GC GG 
Sbjct: 145 KEGAVAEVKDQGGCGSCWAFSTIAAVEGINKIVTGDLIALSEQELVDCDTSYNEGCNGGL 204

Query: 204 REKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVS 262
            + AF +II N GI TED+YPY    G C   +K A    I +YE+VP  DE AL KAV+
Sbjct: 205 MDYAFEFIINNGGIDTEDDYPYLGRDGRCDTYRKNAKVVSIDSYEDVPENDETALKKAVA 264

Query: 263 MQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTW 322
            QPVS+AI      FQ Y  G+F G CGT LDH V  VG+G TE G +YW+++NSWG +W
Sbjct: 265 NQPVSVAIEGGGRNFQLYNSGVFTGECGTSLDHGVAAVGYG-TEKGKDYWIVRNSWGKSW 323

Query: 323 GDAGYMKIVRD----EGLCGIGTRSSYPL 347
           G++GY+++ R+     G CGI    SYP+
Sbjct: 324 GESGYIRMERNIASPTGKCGIAIEPSYPI 352


>gi|357437715|ref|XP_003589133.1| Cysteine proteinase [Medicago truncatula]
 gi|87240770|gb|ABD32628.1| Granulin; Peptidase C1A, papain [Medicago truncatula]
 gi|355478181|gb|AES59384.1| Cysteine proteinase [Medicago truncatula]
          Length = 474

 Score =  303 bits (776), Expect = 9e-80,   Method: Compositional matrix adjust.
 Identities = 157/356 (44%), Positives = 230/356 (64%), Gaps = 17/356 (4%)

Query: 1   MVLIFERSGSFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKD 60
           +VLI     SF ++      II+   +   +  S R+  E  V+ ++E+W+ +HG+SY  
Sbjct: 14  IVLIIS---SFTVSLALDMSIISYDKTHPDKSTSKRTNKE--VLTMYEEWLVKHGKSYNG 68

Query: 61  ELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRS 120
             EK+ R +IFK+NL++I++ N   N TY+LG  +F+DLTN+E+R+ + G K+  P+ R 
Sbjct: 69  LGEKDKRFEIFKDNLKFIDEHNGL-NSTYRLGLTRFADLTNEEYRSKFLGTKI-DPNRRM 126

Query: 121 TT---SSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRS 177
                S + +Y       +P S+DWR +GAV  +K+Q  CG CWAF+A+AAVEGI KI +
Sbjct: 127 KKLGGSKSNRYAPRVGDKLPESVDWRKEGAVVGVKDQASCGSCWAFSAIAAVEGINKIVT 186

Query: 178 GNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQK 237
           G+LI LSEQ+L+DC T+ N GC GG  + AF +II N GI +ED+YPY+AV G C   +K
Sbjct: 187 GDLISLSEQELVDCDTSYNEGCNGGLMDYAFEFIISNGGIDSEDDYPYKAVDGRCDQNRK 246

Query: 238 PA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHA 296
            A    I +YE+VP+ DE AL KAV+ QP+++A+     EFQ Y+ G+F G CGT LDH 
Sbjct: 247 NAKVVTIDDYEDVPAYDELALQKAVANQPIAVAVEGGGREFQLYEYGVFTGRCGTALDHG 306

Query: 297 VTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-----EGLCGIGTRSSYPL 347
           V  VG+G TE+G +YW+++NSWG +WG+ GY+++ R+      G CGI    SYP+
Sbjct: 307 VAAVGYG-TENGKDYWIVRNSWGGSWGEQGYIRLERNLASSRAGKCGIAIEPSYPI 361


>gi|357437719|ref|XP_003589135.1| Cysteine proteinase [Medicago truncatula]
 gi|355478183|gb|AES59386.1| Cysteine proteinase [Medicago truncatula]
          Length = 457

 Score =  303 bits (775), Expect = 9e-80,   Method: Compositional matrix adjust.
 Identities = 157/356 (44%), Positives = 230/356 (64%), Gaps = 17/356 (4%)

Query: 1   MVLIFERSGSFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKD 60
           +VLI     SF ++      II+   +   +  S R+  E  V+ ++E+W+ +HG+SY  
Sbjct: 14  IVLIIS---SFTVSLALDMSIISYDKTHPDKSTSKRTNKE--VLTMYEEWLVKHGKSYNG 68

Query: 61  ELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRS 120
             EK+ R +IFK+NL++I++ N   N TY+LG  +F+DLTN+E+R+ + G K+  P+ R 
Sbjct: 69  LGEKDKRFEIFKDNLKFIDEHNGL-NSTYRLGLTRFADLTNEEYRSKFLGTKI-DPNRRM 126

Query: 121 TT---SSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRS 177
                S + +Y       +P S+DWR +GAV  +K+Q  CG CWAF+A+AAVEGI KI +
Sbjct: 127 KKLGGSKSNRYAPRVGDKLPESVDWRKEGAVVGVKDQASCGSCWAFSAIAAVEGINKIVT 186

Query: 178 GNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQK 237
           G+LI LSEQ+L+DC T+ N GC GG  + AF +II N GI +ED+YPY+AV G C   +K
Sbjct: 187 GDLISLSEQELVDCDTSYNEGCNGGLMDYAFEFIISNGGIDSEDDYPYKAVDGRCDQNRK 246

Query: 238 PA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHA 296
            A    I +YE+VP+ DE AL KAV+ QP+++A+     EFQ Y+ G+F G CGT LDH 
Sbjct: 247 NAKVVTIDDYEDVPAYDELALQKAVANQPIAVAVEGGGREFQLYEYGVFTGRCGTALDHG 306

Query: 297 VTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-----EGLCGIGTRSSYPL 347
           V  VG+G TE+G +YW+++NSWG +WG+ GY+++ R+      G CGI    SYP+
Sbjct: 307 VAAVGYG-TENGKDYWIVRNSWGGSWGEQGYIRLERNLASSRAGKCGIAIEPSYPI 361


>gi|146215992|gb|ABQ10198.1| actinidin Act4b [Actinidia eriantha]
          Length = 379

 Score =  303 bits (775), Expect = 9e-80,   Method: Compositional matrix adjust.
 Identities = 148/309 (47%), Positives = 205/309 (66%), Gaps = 9/309 (2%)

Query: 43  VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTND 102
           V+ + E W+ ++G+SY    EKE R +IFK+NL ++++ N + NR+YK+G NQFSDLT +
Sbjct: 44  VMAMFESWLVEYGKSYNALGEKERRFEIFKDNLRFVDEHNADVNRSYKVGLNQFSDLTLE 103

Query: 103 EFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWA 162
           E+ ++Y G K         T+ + +Y+      +P S+DWR KGAV  +KNQ  CG CW 
Sbjct: 104 EYSSIYLGTKF----DMRMTNVSDRYEPRVGDQLPNSIDWRKKGAVLGVKNQGNCGSCWT 159

Query: 163 FAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYIIQNQGIATED 221
           FA +AAVE I +I +GNLI LSEQQ++DC     NNGC GGSR  A+ +II N GI TE 
Sbjct: 160 FAPIAAVEAINQIVTGNLISLSEQQIVDCQRKSPNNGCKGGSRAGAYQFIIDNGGINTEA 219

Query: 222 EYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYK 281
            YPY+A  G C   +      I  YE VP  +E+AL KAVS Q VS+ IA+ S+EF++YK
Sbjct: 220 NYPYKAQDGECDEQKNQKYVTIDRYENVPRKNEKALQKAVSNQLVSVGIASNSSEFKAYK 279

Query: 282 EGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR---DEGLCG 338
            GIF G CG ++DHAVTIVG+G TE G +YW+++NSWG+ WG+ GY+++ R   + G C 
Sbjct: 280 SGIFTGPCGAKIDHAVTIVGYG-TEGGMDYWIVRNSWGSNWGENGYVRMQRNVGNAGTCF 338

Query: 339 IGTRSSYPL 347
           I T  +YP+
Sbjct: 339 IATSPNYPV 347


>gi|168017893|ref|XP_001761481.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162687165|gb|EDQ73549.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 471

 Score =  303 bits (775), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 143/315 (45%), Positives = 205/315 (65%), Gaps = 11/315 (3%)

Query: 40  EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
           +  ++++  +W+ +H R Y    EK+ R +IFK+NL YI   NK+  ++Y LG N+FSDL
Sbjct: 45  DDGMLDVFHQWLERHSRVYHSLSEKQRRFQIFKDNLHYIHNHNKQ-EKSYWLGLNKFSDL 103

Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGC 159
           T+DEFRALY G +    +H       F Y+++   ++   +DWR KGAV+ +K+Q  CG 
Sbjct: 104 THDEFRALYLGIRPAGRAHGLRNGDRFIYEDVVAEEM---VDWRKKGAVSDVKDQGSCGS 160

Query: 160 CWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIAT 219
           CWAF+A+ +VEG+  I +G LI LSEQ+L+DC    N GC GG  + AF +II+N GI T
Sbjct: 161 CWAFSAIGSVEGVNAIVTGELISLSEQELVDCDRGQNQGCNGGLMDYAFDFIIKNGGIDT 220

Query: 220 EDEYPYQAVPGTCSAAQKPAA--AKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEF 277
           E++YPY+A  G C  A+K  +    I +Y++VP+  E +LLKAVS  PVS+AI A   +F
Sbjct: 221 EEDYPYKATDGQCDEARKETSKVVVIDDYQDVPTKSESSLLKAVSKNPVSVAIEAGGRDF 280

Query: 278 QSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR----- 332
           Q Y+ G+F G CGT LDH V  VG+GT +DG NYW++KNSWG +WG+ GY+++ R     
Sbjct: 281 QHYQGGVFTGPCGTDLDHGVLAVGYGTDDDGVNYWIVKNSWGPSWGEKGYIRMERMGSNS 340

Query: 333 DEGLCGIGTRSSYPL 347
             G CGI    S+P+
Sbjct: 341 TSGKCGINIEPSFPI 355


>gi|109119897|dbj|BAE96008.1| cysteine proteinase [Triticum aestivum]
          Length = 377

 Score =  303 bits (775), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 143/321 (44%), Positives = 203/321 (63%), Gaps = 15/321 (4%)

Query: 40  EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
           E+++ +++E+W   H R  +   EK  R   FK N+ +I   NK G+R Y+L  N+F D+
Sbjct: 39  EEALWDLYERWQTAH-RVPRHHAEKHRRFGTFKSNVHFIHSHNKRGDRPYRLRLNRFGDM 97

Query: 100 TNDEFRALYTGYKMPSPSHRSTTSST-----FKYQNLSMTDVPTSLDWRDKGAVTPIKNQ 154
           +  EFRA + G ++ S   R   ++      F Y  ++++D+P S+DWR KGAVT +KNQ
Sbjct: 98  SQAEFRATFAGSRV-SDRRRDGPATPPSVPGFMYAAVNVSDLPRSVDWRQKGAVTGVKNQ 156

Query: 155 KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQN 214
            +CG CWAF+ V +VEGI  IR+G L+ LSEQ+L+DC T  N+GC GG  + AF YI +N
Sbjct: 157 GKCGSCWAFSTVVSVEGINAIRTGKLVSLSEQELIDCDTADNDGCEGGLMDNAFEYIKKN 216

Query: 215 QGIATEDEYPYQAVPGTCSAAQ----KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAI 270
            G+ TE  YPY+A  GTC AA+     P    I  +++VP+  E+AL KAV+ QPVS+ I
Sbjct: 217 GGLTTEAAYPYRAANGTCKAAKVAKSSPMVVHIDGHQDVPANSEEALAKAVANQPVSVGI 276

Query: 271 AAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKI 330
            A    F  Y EG+F G CGT+LDH V +VG+G  EDG  YW +KNSWG +WG+ GY+++
Sbjct: 277 DASGKAFMFYSEGVFTGECGTELDHGVAVVGYGVAEDGKAYWTVKNSWGPSWGEKGYIRV 336

Query: 331 VRDE----GLCGIGTRSSYPL 347
            +D     GLCGI   +SY +
Sbjct: 337 EKDSGAEGGLCGIAMEASYAV 357


>gi|50355623|dbj|BAD29960.1| cysteine protease [Daucus carota]
          Length = 460

 Score =  302 bits (773), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 143/326 (43%), Positives = 213/326 (65%), Gaps = 12/326 (3%)

Query: 32  VVSSRSTH-----EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGN 86
           +++   TH     +  ++  +E W+ +HG+SY    EKE R +IFK+N  YI++ N   +
Sbjct: 24  IITYDQTHAVGSTDDVIMAAYESWLVKHGKSYNALGEKEQRFQIFKDNFLYIDEQNAAKD 83

Query: 87  RTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKG 146
           R++KLG N+F+DLTN+E+R+ YTG +    S +  +  + +Y +L+   +P S+DWR+ G
Sbjct: 84  RSFKLGLNRFADLTNEEYRSKYTGIRTKD-SRKKVSGKSQRYASLAGESLPESVDWREHG 142

Query: 147 AVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREK 206
           AV  +K+Q +CG CWAF+ ++AVEGI +I +G LI LSEQ+L+DC  + N GC GG  + 
Sbjct: 143 AVASVKDQGQCGSCWAFSTISAVEGINQIATGKLITLSEQELVDCDRSYNEGCNGGLMDD 202

Query: 207 AFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQP 265
           AF +II N GI ++ +YPY    G C   +K A    I +YE+VP  DE+AL KA + QP
Sbjct: 203 AFQFIINNGGIDSDADYPYTGRDGQCDQYRKNAKVVTIDSYEDVPEYDEKALQKAAANQP 262

Query: 266 VSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDA 325
           +S+AI A   +FQ Y  GIF G CGT LDH V +VG+G TE+G +YW+++NSWG  WG+ 
Sbjct: 263 ISVAIEASGRDFQFYDSGIFTGKCGTDLDHGVVVVGYG-TENGKDYWIVRNSWGADWGEK 321

Query: 326 GYMKIVR----DEGLCGIGTRSSYPL 347
           GY+++ R      G+CGI +  SYP+
Sbjct: 322 GYLRMERGISSKAGICGITSEPSYPV 347


>gi|18422289|ref|NP_568620.1| Granulin repeat cysteine protease family protein [Arabidopsis
           thaliana]
 gi|9757832|dbj|BAB08269.1| cysteine protease component of protease-inhibitor complex
           [Arabidopsis thaliana]
 gi|17065064|gb|AAL32686.1| cysteine protease component of protease-inhibitor complex
           [Arabidopsis thaliana]
 gi|21387153|gb|AAM47980.1| cysteine protease component of protease-inhibitor complex
           [Arabidopsis thaliana]
 gi|332007522|gb|AED94905.1| Granulin repeat cysteine protease family protein [Arabidopsis
           thaliana]
          Length = 463

 Score =  302 bits (773), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 153/356 (42%), Positives = 227/356 (63%), Gaps = 27/356 (7%)

Query: 13  INTTPMFIIITLL-VSCA-----------SQVVSSRSTHEQSVVEIHEKWMAQHGRSYKD 60
           +  +PM +++ ++ VS A             + +  S  +  V  I+E WM +HG+   +
Sbjct: 4   LKLSPMILLLAMIGVSYAMDMSIISYDENHHITTETSRSDSEVERIYEAWMVEHGKKKMN 63

Query: 61  E----LEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSP 116
           +     EK+ R +IFK+NL +I++ N + N +YKLG  +F+DLTN+E+R++Y G K   P
Sbjct: 64  QNGLGAEKDQRFEIFKDNLRFIDEHNTK-NLSYKLGLTRFADLTNEEYRSMYLGAK---P 119

Query: 117 SHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIR 176
           + R   +S  +YQ      +P S+DWR +GAV  +K+Q  CG CWAF+ + AVEGI KI 
Sbjct: 120 TKRVLKTSD-RYQARVGDALPDSVDWRKEGAVADVKDQGSCGSCWAFSTIGAVEGINKIV 178

Query: 177 SGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQ 236
           +G+LI LSEQ+L+DC T+ N GC GG  + AF +II+N GI TE +YPY+A  G C   +
Sbjct: 179 TGDLISLSEQELVDCDTSYNQGCNGGLMDYAFEFIIKNGGIDTEADYPYKAADGRCDQNR 238

Query: 237 KPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDH 295
           K A    I +YE+VP   E +L KA++ QP+S+AI A    FQ Y  G+F+G+CGT+LDH
Sbjct: 239 KNAKVVTIDSYEDVPENSEASLKKALAHQPISVAIEAGGRAFQLYSSGVFDGLCGTELDH 298

Query: 296 AVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
            V  VG+G TE+G +YW+++NSWGN WG++GY+K+ R+     G CGI   +SYP+
Sbjct: 299 GVVAVGYG-TENGKDYWIVRNSWGNRWGESGYIKMARNIEAPTGKCGIAMEASYPI 353


>gi|118124|sp|P25250.1|CYSP2_HORVU RecName: Full=Cysteine proteinase EP-B 2; Flags: Precursor
 gi|1146118|gb|AAA85036.1| cysteine proteinase EPB2 precursor [Hordeum vulgare subsp. vulgare]
          Length = 373

 Score =  302 bits (773), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 140/317 (44%), Positives = 202/317 (63%), Gaps = 11/317 (3%)

Query: 40  EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
           E+++ +++E+W + H R  +   EK  R   FK N  +I   NK G+  Y+L  N+F D+
Sbjct: 39  EEALWDLYERWQSAH-RVRRHHAEKHRRFGTFKSNAHFIHSHNKRGDHPYRLHLNRFGDM 97

Query: 100 TNDEFRALYTG-YKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECG 158
              EFRA + G  +  +PS +  +   F Y  L+++D+P S+DWR KGAVT +K+Q +CG
Sbjct: 98  DQAEFRATFVGDLRRDTPS-KPPSVPGFMYAALNVSDLPPSVDWRQKGAVTGVKDQGKCG 156

Query: 159 CCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIA 218
            CWAF+ V +VEGI  IR+G+L+ LSEQ+L+DC T  N+GC GG  + AF YI  N G+ 
Sbjct: 157 SCWAFSTVVSVEGINAIRTGSLVSLSEQELIDCDTADNDGCQGGLMDNAFEYIKNNGGLI 216

Query: 219 TEDEYPYQAVPGTCSAAQ----KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYS 274
           TE  YPY+A  GTC+ A+     P    I  +++VP+  E+ L +AV+ QPVS+A+ A  
Sbjct: 217 TEAAYPYRAARGTCNVARAAQNSPVVVHIDGHQDVPANSEEDLARAVANQPVSVAVEASG 276

Query: 275 TEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE 334
             F  Y EG+F G CGT+LDH V +VG+G  EDG  YW +KNSWG +WG+ GY+++ +D 
Sbjct: 277 KAFMFYSEGVFTGECGTELDHGVAVVGYGVAEDGKAYWTVKNSWGPSWGEQGYIRVEKDS 336

Query: 335 ----GLCGIGTRSSYPL 347
               GLCGI   +SYP+
Sbjct: 337 GASGGLCGIAMEASYPV 353


>gi|357465603|ref|XP_003603086.1| Cysteine proteinase [Medicago truncatula]
 gi|355492134|gb|AES73337.1| Cysteine proteinase [Medicago truncatula]
          Length = 474

 Score =  302 bits (773), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 158/359 (44%), Positives = 228/359 (63%), Gaps = 28/359 (7%)

Query: 14  NTTPMFIIITLLV------SCASQVVSSRSTH--------EQSVVEIHEKWMAQHGR--S 57
           N +PM +I+ +        +    ++S   TH        ++ V  I+E+W  +HG+  +
Sbjct: 6   NRSPMLVILIVFTLFTATFALDMSIISYDKTHSDKSSRRSDKEVKNIYEEWRVKHGKLNN 65

Query: 58  YKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPS 117
             D  EK+ R +IFK+NL++I++ N E NRTYK+G N+F+DL+N+E+R+ Y G K+    
Sbjct: 66  NIDGSEKDKRFEIFKDNLKFIDEHNAE-NRTYKVGLNRFADLSNEEYRSRYLGTKIDPIG 124

Query: 118 H---RSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITK 174
               R+ T S  +Y       +P S+DWR +GAV  +K+Q  CG CWAF+ +AAVEGI K
Sbjct: 125 MMMARTKTRSN-RYAPSVGDKLPKSVDWRSQGAVVQVKDQGSCGSCWAFSTIAAVEGINK 183

Query: 175 IRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSA 234
           I +G L+ LSEQ+L+DC    N GC GG  E AF +II N GI ++++YPY+ V G C  
Sbjct: 184 IVTGELVSLSEQELVDCDRTVNAGCDGGLMEYAFEFIINNGGIDSDEDYPYRGVDGKCDQ 243

Query: 235 AQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQL 293
            +K A    I +YE+VP+ DE AL KAV+ QP+S+AI A   EFQ Y  GIF G CGT L
Sbjct: 244 YKKNARVVSIDDYEQVPAYDELALKKAVANQPISVAIEAGGREFQLYVSGIFTGKCGTAL 303

Query: 294 DHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-----EGLCGIGTRSSYPL 347
           DH VT VG+G TE+G +YW+++NSWG +WG++GY+++ R+      G CGI  +SSYP+
Sbjct: 304 DHGVTAVGYG-TENGVDYWIVRNSWGKSWGESGYVRMERNLAASVAGKCGIVMQSSYPI 361


>gi|2342494|dbj|BAA21848.1| bromelain [Ananas comosus]
 gi|2463582|dbj|BAA22543.1| FB31 precursor [Ananas comosus]
          Length = 352

 Score =  301 bits (772), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 148/333 (44%), Positives = 211/333 (63%), Gaps = 9/333 (2%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
           +F+ + L V  AS   +SR      +++  E+WMA++GR YKD  EK  R +IFK N+ +
Sbjct: 8   VFLFLFLCVMWASPSAASRDEPSDPMMKRFEEWMAEYGRVYKDNDEKMRRFQIFKNNVNH 67

Query: 78  IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTG-YKMPSPSHRSTTSSTFKYQNLSMTDV 136
           IE  N     +Y LG N+F+D+TN+EF A YTG    P    +    S   + +++++ V
Sbjct: 68  IETFNNRNGNSYTLGINKFTDMTNNEFVAQYTGGISRPLNIEKEPVVS---FDDVNISAV 124

Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGN 196
             S+DWRD GAVT +K+Q  CG CWAF+A+A VEGI KI +G L+ LSEQ++LDC+ +  
Sbjct: 125 GQSIDWRDYGAVTEVKDQNPCGSCWAFSAIATVEGIYKIVTGYLVSLSEQEVLDCAVS-- 182

Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQA 256
           NGC GG  + A+ +II N G+A+E +YPYQA  G C+A   P +A I+ Y  V S DE +
Sbjct: 183 NGCDGGFVDNAYDFIISNNGVASEADYPYQAYQGDCAANSWPNSAYITGYSYVRSNDESS 242

Query: 257 LLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKN 316
           +  AV  QP++ AI A    FQ Y  G+F+G CGT L+HA+TI+G+G    G  YW++KN
Sbjct: 243 MKYAVWNQPIAAAIDASGDNFQYYNGGVFSGPCGTSLNHAITIIGYGQDSSGTQYWIVKN 302

Query: 317 SWGNTWGDAGYMKIVR---DEGLCGIGTRSSYP 346
           SWG++WG+ GY+++ R     GLCGI     YP
Sbjct: 303 SWGSSWGERGYIRMARGVSSSGLCGIAMDPLYP 335


>gi|226533314|ref|NP_001150119.1| xylem cysteine proteinase 2 [Zea mays]
 gi|195636886|gb|ACG37911.1| xylem cysteine proteinase 2 precursor [Zea mays]
 gi|223946183|gb|ACN27175.1| unknown [Zea mays]
 gi|413951209|gb|AFW83858.1| Xylem cysteine proteinase 2 [Zea mays]
          Length = 385

 Score =  301 bits (772), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 153/335 (45%), Positives = 210/335 (62%), Gaps = 27/335 (8%)

Query: 37  STHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQF 96
           S+HE S+ E+ E+W+++H R+Y    EK  R ++FK+NL +I++ N++ + +Y LG N+F
Sbjct: 50  SSHE-SLAELFERWLSRHRRAYASLEEKLRRFQVFKDNLHHIDETNRKVS-SYWLGLNEF 107

Query: 97  SDLTNDEFRALYTGYK------MPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTP 150
           +DLT+DEF+A Y G +                     Y+ +    +P S+DWR KGAVT 
Sbjct: 108 ADLTHDEFKATYLGLRSSVGDGGSGIDDDDEPEEEEGYEGVDGASLPKSVDWRSKGAVTG 167

Query: 151 IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAY 210
           +KNQ +CG CWAF+ VAAVEGI +I +GNL  LSEQ+L+DC T+GNNGC GG  + AF+Y
Sbjct: 168 VKNQGQCGSCWAFSTVAAVEGINQIVTGNLTALSEQELIDCDTDGNNGCNGGLMDYAFSY 227

Query: 211 IIQNQGIATEDEYPYQAVPGTC---------------SAAQKPAAAKISNYEEVPSGDEQ 255
           I  N G+ TE+ YPY    GTC                A    A   IS YE+VP  +EQ
Sbjct: 228 IAHNGGLHTEEAYPYLMEEGTCQRSSSSEKKWPGSSEDANDDAAVVTISGYEDVPRNNEQ 287

Query: 256 ALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIK 315
           ALLKA++ QPVS+AI A    FQ Y  G+F+G CGTQLDH V  VG+GT   G +Y ++K
Sbjct: 288 ALLKALAQQPVSVAIEASGRNFQFYSGGVFDGPCGTQLDHGVAAVGYGTAAKGHDYIIVK 347

Query: 316 NSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
           NSWG +WG+ GY+++ R     +GLCGI   +SYP
Sbjct: 348 NSWGPSWGEKGYIRMRRGTGKRQGLCGINKMASYP 382


>gi|297792329|ref|XP_002864049.1| hypothetical protein ARALYDRAFT_495086 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297309884|gb|EFH40308.1| hypothetical protein ARALYDRAFT_495086 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 361

 Score =  301 bits (772), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 154/347 (44%), Positives = 216/347 (62%), Gaps = 26/347 (7%)

Query: 19  FIIITLLVSCASQVVSSRSTHEQ------SVVEIHEKWMAQH--GRSYKDELEKEMRLKI 70
           FI++ L +    +   S   HE+      S+ E++E+W + H   RS +   EK  R  +
Sbjct: 4   FIVLALCMLMVLETTKSLDFHEKDVESEDSLWELYERWKSHHTIARSLE---EKAKRFNV 60

Query: 71  FKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHR-----STTSST 125
           FK N+++I + NK+ N +YKL  N+F D+T++EFR  Y G  +    HR       T+ +
Sbjct: 61  FKHNVKHIHETNKKEN-SYKLKLNKFGDMTSEEFRRTYAGSNIKH--HRMFQGERQTTKS 117

Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
           F Y N+    +PTS+DWR  GAVTP+KNQ +CG CWAF+ V AVEGI +IR+  L  LSE
Sbjct: 118 FMYANVDT--LPTSVDWRKNGAVTPVKNQGQCGSCWAFSTVVAVEGINQIRTKKLTSLSE 175

Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKIS 244
           Q+L+DC TN N GC GG  + AF +I +  G+ +E  YPY+A   TC   ++ A    I 
Sbjct: 176 QELVDCDTNKNQGCNGGLMDLAFEFIKEKGGLTSELVYPYKASDETCDTNKENAPVVSID 235

Query: 245 NYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGT 304
            +E+VP   E  L+KAV+ QPVS+AI A  ++FQ Y EG+F G CGT+L+H V +VG+GT
Sbjct: 236 GHEDVPKNSEVDLMKAVAHQPVSVAIDAGGSDFQFYSEGVFTGRCGTELNHGVAVVGYGT 295

Query: 305 TEDGANYWLIKNSWGNTWGDAGYMKIVR----DEGLCGIGTRSSYPL 347
           T DG  YW++KNSWG  WG+ GY+++ R     EGLCGI   +SYPL
Sbjct: 296 TIDGTKYWIVKNSWGEEWGEKGYIRMQRGIRHKEGLCGIAMEASYPL 342


>gi|356515080|ref|XP_003526229.1| PREDICTED: vignain-like [Glycine max]
          Length = 284

 Score =  301 bits (772), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 150/286 (52%), Positives = 196/286 (68%), Gaps = 15/286 (5%)

Query: 72  KENLEYIEKANKEGNRTYKLGTNQFSDLTNDEF---RALYTGYKMPSPSHRSTTSSTFKY 128
           KEN+ YIE  N   N+ YKLG NQF+DLT++EF   R  + G+   S    +T ++TFKY
Sbjct: 5   KENVNYIEAFNNAANKPYKLGINQFADLTSEEFIVPRNRFNGHMRFS----NTRTTTFKY 60

Query: 129 QNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQL 188
           +N+++  +P S+DWR KGAVTPIKNQ  CGCCWAF+A+AA EGI KI +G L+ LSEQ++
Sbjct: 61  ENVTV--LPDSIDWRQKGAVTPIKNQGSCGCCWAFSAIAATEGIHKISTGKLVSLSEQEV 118

Query: 189 LDCSTNG-NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNY 246
           +DC T G ++GC GG  + AF +IIQN GI TE  YPY+ V G C+  ++   A  I+ Y
Sbjct: 119 VDCDTKGTDHGCEGGYMDGAFKFIIQNHGINTEASYPYKGVDGKCNIKEEAVHATTITGY 178

Query: 247 EEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTE 306
           E+VP  +E+AL KAV+ QPVS+AI A   +FQ YK GIF G CGT+LDH VT VG+G   
Sbjct: 179 EDVPINNEKALQKAVANQPVSVAIDARGADFQFYKSGIFTGSCGTELDHGVTAVGYGENN 238

Query: 307 DGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
           +G  YWL+KNSWG  WG+ GY  + R     EG+CGI   +SYP A
Sbjct: 239 EGTKYWLVKNSWGTEWGEEGYTMMQRGVKAVEGICGIAMLASYPTA 284


>gi|118120|sp|P25249.1|CYSP1_HORVU RecName: Full=Cysteine proteinase EP-B 1; Flags: Precursor
 gi|1146116|gb|AAA85035.1| cysteine proteinase EPB1 precursor [Hordeum vulgare subsp. vulgare]
          Length = 371

 Score =  301 bits (772), Expect = 3e-79,   Method: Compositional matrix adjust.
 Identities = 138/316 (43%), Positives = 198/316 (62%), Gaps = 9/316 (2%)

Query: 40  EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
           E+++ +++E+W + H R  +   EK  R   FK N  +I   NK G+  Y+L  N+F D+
Sbjct: 39  EEALWDLYERWQSAH-RVRRHHAEKHRRFGTFKSNAHFIHSHNKRGDHPYRLHLNRFGDM 97

Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGC 159
              EFRA + G        +  +   F Y  L+++D+P S+DWR KGAVT +K+Q +CG 
Sbjct: 98  DQAEFRATFVGDLRRDTPAKPPSVPGFMYAALNVSDLPPSVDWRQKGAVTGVKDQGKCGS 157

Query: 160 CWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIAT 219
           CWAF+ V +VEGI  IR+G+L+ LSEQ+L+DC T  N+GC GG  + AF YI  N G+ T
Sbjct: 158 CWAFSTVVSVEGINAIRTGSLVSLSEQELIDCDTADNDGCQGGLMDNAFEYIKNNGGLIT 217

Query: 220 EDEYPYQAVPGTCSAAQ----KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYST 275
           E  YPY+A  GTC+ A+     P    I  +++VP+  E+ L +AV+ QPVS+A+ A   
Sbjct: 218 EAAYPYRAARGTCNVARAAQNSPVVVHIDGHQDVPANSEEDLARAVANQPVSVAVEASGK 277

Query: 276 EFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE- 334
            F  Y EG+F G CGT+LDH V +VG+G  EDG  YW +KNSWG +WG+ GY+++ +D  
Sbjct: 278 AFMFYSEGVFTGDCGTELDHGVAVVGYGVAEDGKAYWTVKNSWGPSWGEQGYIRVEKDSG 337

Query: 335 ---GLCGIGTRSSYPL 347
              GLCGI   +SYP+
Sbjct: 338 ASGGLCGIAMEASYPV 353


>gi|225446523|ref|XP_002275891.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP2 [Vitis vinifera]
          Length = 358

 Score =  301 bits (772), Expect = 3e-79,   Method: Compositional matrix adjust.
 Identities = 149/307 (48%), Positives = 202/307 (65%), Gaps = 12/307 (3%)

Query: 47  HEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRA 106
           +E+W+ QHGR YK+  E +    I++ N+ +I   N + N ++ L  NQF+D+TN+E++A
Sbjct: 45  YERWLVQHGRRYKNRDEWQRHFGIYQSNVRFINYINAQ-NFSFTLTDNQFADMTNEEYKA 103

Query: 107 LYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAV 166
           LY G      S ++   S+FK +   +  +P S+DWR  GAVTP++NQ ECG CWAF+ V
Sbjct: 104 LYMGLGTSETSRKN--QSSFKRERSKV--LPISVDWRKMGAVTPVRNQGECGSCWAFSTV 159

Query: 167 AAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPY 225
           AAVEGI KIR+G L+ LSEQ+LLDC  + GN GC GG    AF +I QN GI T   YPY
Sbjct: 160 AAVEGINKIRTGKLVSLSEQELLDCDIDSGNEGCNGGYMVNAFKFIKQNGGITTARNYPY 219

Query: 226 QAVPGTCSAAQKP-AAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGI 284
               G C+  +      KIS YE VP  +E+ L  AV+ QPVS+AI A   EFQ Y +GI
Sbjct: 220 IGEQGICNKDKAANHVVKISGYETVPPNNEKILQAAVAKQPVSVAIDAGGYEFQLYSKGI 279

Query: 285 FNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR----DEGLCGIG 340
           FNG CG QL+HAVT++G+G  ++G  YWL+KNSWG  WG+AGY +++R    DEG+CGI 
Sbjct: 280 FNGFCGKQLNHAVTVIGYG-EDNGKKYWLVKNSWGTGWGEAGYARMIRDSRDDEGICGIA 338

Query: 341 TRSSYPL 347
             +SYP+
Sbjct: 339 MEASYPI 345


>gi|326493368|dbj|BAJ85145.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 436

 Score =  301 bits (772), Expect = 3e-79,   Method: Compositional matrix adjust.
 Identities = 147/333 (44%), Positives = 213/333 (63%), Gaps = 12/333 (3%)

Query: 22  ITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKA 81
           ++L  +    +VS     E+ V  ++ +WMA+HG +Y    E+E R + F++NL YI++ 
Sbjct: 18  VSLAAAADMSIVSYGERSEEEVRRMYAEWMAEHGSTYNAIGEEERRFEAFRDNLRYIDQH 77

Query: 82  N---KEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPT 138
           N     G  +++LG N+F+DLTN+E+R+ Y G +      R  ++   +YQ     ++P 
Sbjct: 78  NAAADAGVHSFRLGLNRFADLTNEEYRSTYLGARTKPDRERKLSA---RYQAADNDELPE 134

Query: 139 SLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNG 198
           S+DWR KGAV  +K+Q  CG CWAF+A+AAVEGI +I +G++I LSEQ+L+DC T+ N G
Sbjct: 135 SVDWRKKGAVGAVKDQGGCGSCWAFSAIAAVEGINQIVTGDMIPLSEQELVDCDTSYNQG 194

Query: 199 CLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQAL 257
           C GG  + AF +II N GI +E++YPY+     C A +K A    I  YE+VP   E++L
Sbjct: 195 CNGGLMDYAFEFIINNGGIDSEEDYPYKERDNRCDANKKNAKVVTIDGYEDVPVNSEKSL 254

Query: 258 LKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNS 317
            KAV+ QP+S+AI A    FQ YK GIF G CGT LDH V  VG+G TE+G +YWL++NS
Sbjct: 255 QKAVANQPISVAIEAGGRAFQLYKSGIFTGTCGTALDHGVAAVGYG-TENGKDYWLVRNS 313

Query: 318 WGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
           WG+ WG+ GY+++ R+     G CGI    SYP
Sbjct: 314 WGSVWGEDGYIRMERNIKASSGKCGIAVEPSYP 346


>gi|255540425|ref|XP_002511277.1| cysteine protease, putative [Ricinus communis]
 gi|46395620|sp|O65039.1|CYSEP_RICCO RecName: Full=Vignain; AltName: Full=Cysteine endopeptidase; Flags:
           Precursor
 gi|2944446|gb|AAC62396.1| cysteine endopeptidase precursor [Ricinus communis]
 gi|223550392|gb|EEF51879.1| cysteine protease, putative [Ricinus communis]
          Length = 360

 Score =  301 bits (771), Expect = 3e-79,   Method: Compositional matrix adjust.
 Identities = 150/312 (48%), Positives = 203/312 (65%), Gaps = 16/312 (5%)

Query: 46  IHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFR 105
           ++E+W + H  S +   EK+ R  +FK N  ++  ANK  ++ YKL  N+F+D+TN EFR
Sbjct: 37  LYERWRSHHTVS-RSLHEKQKRFNVFKHNAMHVHNANKM-DKPYKLKLNKFADMTNHEFR 94

Query: 106 ALYTGYKMPSPSHR-----STTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCC 160
             Y+G K+    HR        + TF Y+ +    VP S+DWR KGAVT +K+Q +CG C
Sbjct: 95  NTYSGSKVKH--HRMFRGGPRGNGTFMYEKVDT--VPASVDWRKKGAVTSVKDQGQCGSC 150

Query: 161 WAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATE 220
           WAF+ + AVEGI +I++  L+ LSEQ+L+DC T+ N GC GG  + AF +I Q  GI TE
Sbjct: 151 WAFSTIVAVEGINQIKTNKLVSLSEQELVDCDTDQNQGCNGGLMDYAFEFIKQRGGITTE 210

Query: 221 DEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQS 279
             YPY+A  GTC  +++ A A  I  +E VP  DE ALLKAV+ QPVS+AI A  ++FQ 
Sbjct: 211 ANYPYEAYDGTCDVSKENAPAVSIDGHENVPENDENALLKAVANQPVSVAIDAGGSDFQF 270

Query: 280 YKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR----DEG 335
           Y EG+F G CGT+LDH V IVG+GTT DG  YW +KNSWG  WG+ GY+++ R     EG
Sbjct: 271 YSEGVFTGSCGTELDHGVAIVGYGTTIDGTKYWTVKNSWGPEWGEKGYIRMERGISDKEG 330

Query: 336 LCGIGTRSSYPL 347
           LCGI   +SYP+
Sbjct: 331 LCGIAMEASYPI 342


>gi|18141283|gb|AAL60579.1|AF454957_1 senescence-associated cysteine protease [Brassica oleracea]
          Length = 460

 Score =  301 bits (771), Expect = 3e-79,   Method: Compositional matrix adjust.
 Identities = 150/357 (42%), Positives = 224/357 (62%), Gaps = 24/357 (6%)

Query: 9   GSFKINTTPMFIIITLLVSCASQVVSSRSTH---------EQSVVEIHEKWMAQHGRSYK 59
           GS K+    + ++I +  +    ++S    H         +  V  I+E WM +HG+  +
Sbjct: 2   GSVKVTILLLAMMIGVSYAADMSIISYDEKHHITAENERSDAEVARIYEAWMEKHGKKAQ 61

Query: 60  DE----LEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPS 115
                  EK+ R +IFK+NL +I++ N + N +YKLG  +F+DLTN+E+R++Y G K   
Sbjct: 62  SNGLVGEEKDQRFEIFKDNLRFIDEHNNK-NLSYKLGLTRFADLTNEEYRSIYLGAK--- 117

Query: 116 PSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKI 175
            S +    ++ +YQ      +P S+DWR +GAV  +K+Q  CG CWAF+ + AVEGI KI
Sbjct: 118 -SKKRVLKTSDRYQPRVGDAIPDSVDWRKEGAVAAVKDQGSCGSCWAFSTIGAVEGINKI 176

Query: 176 RSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAA 235
            +G+LI LSEQ+L+DC T+ N GC GG  + AF +II+N GI TE++YPY+A  G C   
Sbjct: 177 VTGDLISLSEQELVDCDTSYNQGCNGGLMDYAFEFIIKNGGIDTEEDYPYKAADGRCDQT 236

Query: 236 QKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLD 294
           +K A    I  YE+VP  +E AL K ++ QP+S+AI A    FQ Y  G+F+G+CGT+LD
Sbjct: 237 RKNAKVVTIDAYEDVPENNEAALKKTLANQPISVAIEAGGRAFQLYSSGVFDGICGTELD 296

Query: 295 HAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
           H V  VG+G TE+G +YW+++NSWG +WG++GY+K+ R+     G CGI   +SYP+
Sbjct: 297 HGVVAVGYG-TENGKDYWIVRNSWGGSWGESGYIKMARNIAEPTGKCGIAMEASYPI 352


>gi|217073894|gb|ACJ85307.1| unknown [Medicago truncatula]
 gi|388507498|gb|AFK41815.1| unknown [Medicago truncatula]
          Length = 362

 Score =  301 bits (771), Expect = 3e-79,   Method: Compositional matrix adjust.
 Identities = 154/349 (44%), Positives = 216/349 (61%), Gaps = 22/349 (6%)

Query: 15  TTPMFIIITLLVSCASQVVSSRSTHE------QSVVEIHEKWMAQHGRSYKDELEKEMRL 68
           TT   ++I L ++    V  S   H+      +S+ +++E+W + H  S ++  EK+ R 
Sbjct: 2   TTKKLLLIVLSIALVLVVSESFDFHDKDVSSDESLWDLYERWRSHHTVS-RNLNEKQKRF 60

Query: 69  KIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHR-----STTS 123
            +FK N+ ++   NK  ++ YKL  N+F+D+TN EF+  Y G K+    HR        S
Sbjct: 61  NVFKSNVMHVHNTNKM-DKPYKLKLNKFADMTNHEFKTTYAGSKVNH--HRMFRGTPRVS 117

Query: 124 STFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQL 183
            TF Y+N   T  P S+DWR KGAVT +K+Q +CG CWAF+ V AVEGI +I++  L+ L
Sbjct: 118 GTFMYENF--TKAPASVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQIKTNRLVPL 175

Query: 184 SEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAK 242
           SEQ+L+DC    N GC GG  E AF YI Q  G+ TE  YPY A  G+C A ++      
Sbjct: 176 SEQELIDCDNQENQGCNGGLMEYAFEYIKQKGGVTTESYYPYTANDGSCDATKENVPTVS 235

Query: 243 ISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGF 302
           I  +E VP+ DE ALLKAV+ QPVS+AI A  ++FQ Y EG+F G CG +L+H V IVG+
Sbjct: 236 IDGHETVPANDEDALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDCGKELNHGVAIVGY 295

Query: 303 GTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
           GTT DG NYW+++NSWG  WG+ G +++ R+    EGLCGI   +SYP+
Sbjct: 296 GTTVDGTNYWIVRNSWGAEWGEQGCIRMKRNVSNKEGLCGIAMEASYPV 344


>gi|40806500|gb|AAR92155.1| putative cysteine protease 2 [Iris x hollandica]
          Length = 359

 Score =  301 bits (771), Expect = 3e-79,   Method: Compositional matrix adjust.
 Identities = 145/315 (46%), Positives = 204/315 (64%), Gaps = 11/315 (3%)

Query: 40  EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
           E+S+  ++E+W + H  S +D  EK  R  +FKEN ++I + NK+ +  YKLG N+F+D+
Sbjct: 33  EESLWGLYERWRSHHTVS-RDLSEKNKRFNVFKENAKFIHEFNKK-DAPYKLGLNKFADM 90

Query: 100 TNDEFRALYTGYKMPSP-SHRSTTSST--FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKE 156
           TN EFR+ Y G K+    + R T  +T  F Y+N+    +P S+DWR +GAV P+K+Q +
Sbjct: 91  TNQEFRSTYAGSKIHHHRTQRGTPRATGSFMYENVH--SIPASVDWRTQGAVAPVKDQGQ 148

Query: 157 CGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQG 216
           CG CWAF+ +A+VEGI KI++  L+ LS QQL+DC T+ N GC GG  + AF +I  N G
Sbjct: 149 CGSCWAFSTIASVEGINKIKTNQLVPLSGQQLVDCDTDQNEGCNGGLMDYAFEFIKSNGG 208

Query: 217 IATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTE 276
           I +E  YPY A  G+C++        I  YE+VP+ +E AL+KAV+ Q VS+AI A    
Sbjct: 209 ITSESAYPYTAEQGSCASESSAPVVTIDGYEDVPANNEAALMKAVANQVVSVAIEASGMA 268

Query: 277 FQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD--- 333
           FQ Y EG+F G CG +LDH V +VG+G T DG  YW+++NSWG  WG+ GY+++ R    
Sbjct: 269 FQFYSEGVFTGSCGNELDHGVAVVGYGATRDGTKYWIVRNSWGAEWGEKGYIRMQRGIRA 328

Query: 334 -EGLCGIGTRSSYPL 347
             GLCGI    SYPL
Sbjct: 329 RHGLCGIAMEPSYPL 343


>gi|302764466|ref|XP_002965654.1| hypothetical protein SELMODRAFT_230713 [Selaginella moellendorffii]
 gi|300166468|gb|EFJ33074.1| hypothetical protein SELMODRAFT_230713 [Selaginella moellendorffii]
          Length = 345

 Score =  301 bits (771), Expect = 3e-79,   Method: Compositional matrix adjust.
 Identities = 143/307 (46%), Positives = 201/307 (65%), Gaps = 12/307 (3%)

Query: 45  EIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEF 104
           ++++KW+ +HG++Y    E + R +IFKEN+ YI   N   N ++ LG N+F+DLTN EF
Sbjct: 36  QVYQKWIQEHGKAYNSAHEYKKRFQIFKENVNYINSHNARRNNSHSLGLNKFADLTNSEF 95

Query: 105 RALYTG-YKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAF 163
           R LY G  + P+P H     +        + D  TS+DWR KG VT IK+Q +CG CWAF
Sbjct: 96  RGLYVGRLQRPAPFHEVGDIAL-------VADTATSVDWRKKGGVTEIKDQGDCGSCWAF 148

Query: 164 AAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEY 223
           +AVAAVEG+T + +G L+ LSEQ+L+DC T  N GC GG  + AF Y+I+N GI ++  Y
Sbjct: 149 SAVAAVEGLTFLSTGTLVSLSEQELVDCDTTVNQGCDGGIMDYAFQYMIRNGGITSQSNY 208

Query: 224 PYQAVPGTCSAAQ-KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKE 282
           PY+A+ G C   + K  AA I+ ++ +P   E+ LL+AV+ QPVS+AI A   +FQ Y  
Sbjct: 209 PYRALRGACDKDKVKYHAATINGFQAIPPQSEELLLRAVANQPVSVAIEAGGQDFQLYSS 268

Query: 283 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD---EGLCGI 339
           G+F G CG+ LDH V IVG+GT   G  YWL+KNSWG+ WG++GY+++ R     G+CGI
Sbjct: 269 GVFTGECGSNLDHGVAIVGYGTDAGGRQYWLVKNSWGSGWGESGYVRMERQGPGAGVCGI 328

Query: 340 GTRSSYP 346
              +SYP
Sbjct: 329 NLDASYP 335


>gi|355344587|gb|AER60490.1| cysteine proteases [Gossypium hirsutum]
          Length = 371

 Score =  301 bits (771), Expect = 3e-79,   Method: Compositional matrix adjust.
 Identities = 148/333 (44%), Positives = 212/333 (63%), Gaps = 9/333 (2%)

Query: 22  ITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKA 81
           I+ L    +   SS    +  V+ +++ W+ QHG++Y    E+E R +IFK+NL +I++ 
Sbjct: 20  ISTLTLNQNHPSSSSWRSDDEVMGLYKSWVIQHGKAYNGIGEEEKRFEIFKDNLRFIDEH 79

Query: 82  NKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSS--TFKYQNLSMTDVPTS 139
           N   N TYKLG N+F+DLTN E+RA + G +   P  R   S   + +Y + +  ++P S
Sbjct: 80  NSNNNTTYKLGLNKFADLTNQEYRAKFLGTRT-DPRRRLMKSKIPSSRYAHRAGDNLPDS 138

Query: 140 LDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGC 199
           +DWRD GAV+P+K+Q  CG CWAF+ +A VEGI KI SG L+ LSEQ+L+DC  + + GC
Sbjct: 139 VDWRDHGAVSPVKDQGSCGSCWAFSTIATVEGINKIVSGELVSLSEQELVDCDRSYDAGC 198

Query: 200 LGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALL 258
            GG  + AF +I+ N GI TE +YPY      C   +K A    I  YE+VP+ +E AL 
Sbjct: 199 NGGLMDYAFQFIMDNGGIDTEKDYPYLGFNNQCDPTKKNAKVVSIDGYEDVPN-NENALK 257

Query: 259 KAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSW 318
           KAV+ QPVSIAI A    FQ Y+ G+FNG CG  LDH V  VG+GT ++G +YW+++NSW
Sbjct: 258 KAVAHQPVSIAIEAGGRAFQLYESGVFNGECGLALDHGVVAVGYGTDDNGQDYWIVRNSW 317

Query: 319 GNTWGDAGYMKIVR----DEGLCGIGTRSSYPL 347
           G+ WG+ GY+++ R    + G CGI   +SYP+
Sbjct: 318 GSNWGENGYIRMERNINANTGKCGIAMEASYPV 350


>gi|1345573|emb|CAA40073.1| endopeptidase (EP-C1) [Phaseolus vulgaris]
          Length = 361

 Score =  301 bits (771), Expect = 3e-79,   Method: Compositional matrix adjust.
 Identities = 154/344 (44%), Positives = 210/344 (61%), Gaps = 14/344 (4%)

Query: 16  TPMFIIITLLVSCASQVVSSRSTH------EQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
           T   + + L  S    V +S   H      E+S+ +++E+W + H  S +   EK  R  
Sbjct: 2   TKKLLWVVLSFSLVLGVANSFDFHDKDLASEESLWDLYERWRSHHTVS-RSLGEKHKRFN 60

Query: 70  IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPS-HRSTTSSTFKY 128
           +FK NL ++   NK  ++ YKL  N+F+D+TN EFR+ Y G K+      R T      +
Sbjct: 61  VFKANLMHVHNTNKM-DKPYKLKLNKFADMTNHEFRSTYAGSKVNHHRMFRGTPHENGAF 119

Query: 129 QNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQL 188
               +  VP S+DWR KGAVT +K+Q +CG CWAF+ V AVEGI +I++  L+ LSEQ+L
Sbjct: 120 MYEKVVSVPPSVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQIKTNKLVALSEQEL 179

Query: 189 LDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQ-KPAAAKISNYE 247
           +DC    N GC GG  E AF +I Q  GI TE  YPY+A  GTC A++    A  I  +E
Sbjct: 180 VDCDKEENQGCNGGLMESAFEFIKQKGGITTESNYPYKAQEGTCDASKVNDLAVSIDGHE 239

Query: 248 EVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTED 307
            VP+ DE ALLKAV+ QPVS+AI A  ++FQ Y EG+F G C T L+H V IVG+GTT D
Sbjct: 240 NVPANDEDALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDCSTDLNHGVAIVGYGTTVD 299

Query: 308 GANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
           G NYW+++NSWG  WG+ GY+++ R+    EGLCGI    SYP+
Sbjct: 300 GTNYWIVRNSWGPEWGEHGYIRMQRNISKKEGLCGIAMLPSYPI 343


>gi|388517427|gb|AFK46775.1| unknown [Medicago truncatula]
          Length = 362

 Score =  301 bits (771), Expect = 3e-79,   Method: Compositional matrix adjust.
 Identities = 154/349 (44%), Positives = 216/349 (61%), Gaps = 22/349 (6%)

Query: 15  TTPMFIIITLLVSCASQVVSSRSTHE------QSVVEIHEKWMAQHGRSYKDELEKEMRL 68
           TT   ++I L ++    V  S   H+      +S+ +++E+W + H  S ++  EK+ R 
Sbjct: 2   TTKKLLLIVLSIALVLVVSESFDFHDKDVSSDESLWDLYERWRSHHTVS-RNLNEKQKRF 60

Query: 69  KIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHR-----STTS 123
            +FK N+ ++   NK  ++ YKL  N+F+D+TN EF+  Y G K+    HR        S
Sbjct: 61  NVFKSNVMHVHNTNKM-DKPYKLKLNKFADMTNHEFKTTYAGTKVNH--HRMFRGTPRVS 117

Query: 124 STFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQL 183
            TF Y+N   T  P S+DWR KGAVT +K+Q +CG CWAF+ V AVEGI +I++  L+ L
Sbjct: 118 GTFMYENF--TKAPASVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQIKTNRLVPL 175

Query: 184 SEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAK 242
           SEQ+L+DC    N GC GG  E AF YI Q  G+ TE  YPY A  G+C A ++      
Sbjct: 176 SEQELIDCDNQENQGCNGGLMEYAFEYIKQKGGVTTESYYPYTANDGSCDATKENVPTVS 235

Query: 243 ISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGF 302
           I  +E VP+ DE ALLKAV+ QPVS+AI A  ++FQ Y EG+F G CG +L+H V IVG+
Sbjct: 236 IDGHETVPANDEDALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDCGKELNHGVAIVGY 295

Query: 303 GTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
           GTT DG NYW+++NSWG  WG+ G +++ R+    EGLCGI   +SYP+
Sbjct: 296 GTTVDGTNYWIVRNSWGAEWGEQGCIRMKRNVSNKEGLCGIAMEASYPV 344


>gi|42567068|ref|NP_567686.2| putative cysteine proteinase [Arabidopsis thaliana]
 gi|332659371|gb|AEE84771.1| putative cysteine proteinase [Arabidopsis thaliana]
          Length = 356

 Score =  301 bits (770), Expect = 3e-79,   Method: Compositional matrix adjust.
 Identities = 156/346 (45%), Positives = 228/346 (65%), Gaps = 20/346 (5%)

Query: 16  TPMFIIITLLVSCASQVVSSRST---HEQSVVE---IHEKWMAQHGRSYKDEL-EKEMRL 68
           T +F++I  ++S  S  +   +T   H +S  E   I + WM++HG++Y + L EKE R 
Sbjct: 10  TILFLLIVFVLSAPSSAMDLPATSGGHNRSNEEVEFIFQMWMSKHGKTYTNALGEKERRF 69

Query: 69  KIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKY 128
           + FK+NL +I++ N + N +Y+LG  +F+DLT  E+R L+ G   P P  R+  +S  +Y
Sbjct: 70  QNFKDNLRFIDQHNAK-NLSYQLGLTRFADLTVQEYRDLFPG--SPKPKQRNLKTSR-RY 125

Query: 129 QNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQL 188
             L+   +P S+DWR +GAV+ IK+Q  C  CWAF+ VAAVEG+ KI +G LI LSEQ+L
Sbjct: 126 VPLAGDQLPESVDWRQEGAVSEIKDQGTCNSCWAFSTVAAVEGLNKIVTGELISLSEQEL 185

Query: 189 LDCSTNGNNGCLG-GSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA--AAKISN 245
           +DC+   NNGC G G  + AF ++I N G+ +E +YPYQ   G+C+  Q  +     I +
Sbjct: 186 VDCNL-VNNGCYGSGLMDTAFQFLINNNGLDSEKDYPYQGTQGSCNRKQSTSNKVITIDS 244

Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
           YE+VP+ DE +L KAV+ QPVS+ +   S EF  Y+  I+NG CGT LDHA+ IVG+G +
Sbjct: 245 YEDVPANDEISLQKAVAHQPVSVGVDKKSQEFMLYRSCIYNGPCGTNLDHALVIVGYG-S 303

Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
           E+G +YW+++NSWG TWGDAGY+KI R+    +GLCGI   +SYP+
Sbjct: 304 ENGQDYWIVRNSWGTTWGDAGYIKIARNFEDPKGLCGIAMLASYPI 349


>gi|302143380|emb|CBI21941.3| unnamed protein product [Vitis vinifera]
          Length = 354

 Score =  301 bits (770), Expect = 4e-79,   Method: Compositional matrix adjust.
 Identities = 149/307 (48%), Positives = 202/307 (65%), Gaps = 12/307 (3%)

Query: 47  HEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRA 106
           +E+W+ QHGR YK+  E +    I++ N+ +I   N + N ++ L  NQF+D+TN+E++A
Sbjct: 41  YERWLVQHGRRYKNRDEWQRHFGIYQSNVRFINYINAQ-NFSFTLTDNQFADMTNEEYKA 99

Query: 107 LYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAV 166
           LY G      S ++   S+FK +   +  +P S+DWR  GAVTP++NQ ECG CWAF+ V
Sbjct: 100 LYMGLGTSETSRKN--QSSFKRERSKV--LPISVDWRKMGAVTPVRNQGECGSCWAFSTV 155

Query: 167 AAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPY 225
           AAVEGI KIR+G L+ LSEQ+LLDC  + GN GC GG    AF +I QN GI T   YPY
Sbjct: 156 AAVEGINKIRTGKLVSLSEQELLDCDIDSGNEGCNGGYMVNAFKFIKQNGGITTARNYPY 215

Query: 226 QAVPGTCSAAQKP-AAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGI 284
               G C+  +      KIS YE VP  +E+ L  AV+ QPVS+AI A   EFQ Y +GI
Sbjct: 216 IGEQGICNKDKAANHVVKISGYETVPPNNEKILQAAVAKQPVSVAIDAGGYEFQLYSKGI 275

Query: 285 FNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR----DEGLCGIG 340
           FNG CG QL+HAVT++G+G  ++G  YWL+KNSWG  WG+AGY +++R    DEG+CGI 
Sbjct: 276 FNGFCGKQLNHAVTVIGYG-EDNGKKYWLVKNSWGTGWGEAGYARMIRDSRDDEGICGIA 334

Query: 341 TRSSYPL 347
             +SYP+
Sbjct: 335 MEASYPI 341


>gi|302796898|ref|XP_002980210.1| hypothetical protein SELMODRAFT_153766 [Selaginella moellendorffii]
 gi|300151826|gb|EFJ18470.1| hypothetical protein SELMODRAFT_153766 [Selaginella moellendorffii]
          Length = 479

 Score =  301 bits (770), Expect = 4e-79,   Method: Compositional matrix adjust.
 Identities = 151/322 (46%), Positives = 204/322 (63%), Gaps = 16/322 (4%)

Query: 38  THEQSVVEIHEKWMAQHGRSYKDEL--------EKEMRLKIFKENLEYIEKANKEGNRTY 89
           + E+ +  + + WM QHG+SY D          EK  R  IFK+NL +I   N E N+ Y
Sbjct: 48  SSEERLQALFDSWMLQHGKSYADNALSGDSQAGEKATRYGIFKDNLRFIHGEN-EKNQGY 106

Query: 90  KLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVT 149
            LG N F+DLTN+EFRA   G +      R T+   F+Y ++ + D+P S+DWR+KGAV 
Sbjct: 107 FLGLNAFADLTNEEFRAQRHGGRFDRSRER-TSHEEFRYGSVQLKDLPDSIDWREKGAVV 165

Query: 150 PIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFA 209
            +K+Q  CG CWAF+AVAA+EG+ K+ +G L+ LSEQ+L+DC    + GC GG  + AF 
Sbjct: 166 GVKDQGSCGSCWAFSAVAAIEGVNKLATGELVSLSEQELVDCDKGEDEGCNGGLMDYAFG 225

Query: 210 YIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSI 268
           ++I+N G+ TE +YPY+     C  ++  A    I  YE+VP  DE ALLKAV+ QPVS+
Sbjct: 226 FVIKNGGLDTEADYPYKGYGTRCDRSKMNAKVVTIDGYEDVPVNDETALLKAVAHQPVSV 285

Query: 269 AIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYM 328
           AI A  +  Q Y+ GIF G CGT LDH VT VG+G  EDG  YW+IKNSWG+ WG+ GY+
Sbjct: 286 AIDAGGSSMQFYRSGIFTGRCGTDLDHGVTNVGYG-KEDGKAYWIIKNSWGSNWGEKGYV 344

Query: 329 KIVRD----EGLCGIGTRSSYP 346
           K+ R+     GLCGI   +SYP
Sbjct: 345 KMARNTGLAAGLCGINMEASYP 366


>gi|3377948|emb|CAA08860.1| cysteine proteinase precursor, AN8 [Ananas comosus]
          Length = 356

 Score =  301 bits (770), Expect = 4e-79,   Method: Compositional matrix adjust.
 Identities = 150/334 (44%), Positives = 207/334 (61%), Gaps = 11/334 (3%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
           +F+ + L V  AS   +S       +++  E+WM ++GR YKD  EK  R +IFK N+ +
Sbjct: 8   VFLFLFLCVMWASPSAASADEPSDPMMKRFEEWMVEYGRVYKDNDEKMRRFQIFKNNVNH 67

Query: 78  IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTG-YKMPSPSHRSTTSSTFKYQNLSMTDV 136
           IE  N     +Y LG NQF+D+TN+EF A YTG    P    R    S   + ++ ++ V
Sbjct: 68  IETFNSRNKDSYTLGINQFTDMTNNEFVAQYTGGISRPLNIEREPVVS---FDDVDISAV 124

Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGN 196
           P S+DWRD GAVT +KNQ  CG CWAFAA+A VE I KI+ G L  LSEQQ+LDC+    
Sbjct: 125 PQSIDWRDYGAVTSVKNQNPCGACWAFAAIATVESIYKIKKGILEPLSEQQVLDCAK--G 182

Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQA 256
            GC GG   +AF +II N+G+A+   YPY+A  GTC     P +A I+ Y  VP  +E +
Sbjct: 183 YGCKGGWEFRAFEFIISNKGVASVAIYPYKAAKGTCKTNGVPNSAYITGYARVPRNNESS 242

Query: 257 LLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKN 316
           ++ AVS QP+++A+ A +   Q Y  G+FNG CGT L+HAVT +G+G   +G  YW++KN
Sbjct: 243 MMYAVSKQPITVAVDANANS-QYYNSGVFNGPCGTSLNHAVTAIGYGQDSNGKKYWIVKN 301

Query: 317 SWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
           SWG  WG+AGY+++ RD     G+CGI   S YP
Sbjct: 302 SWGARWGEAGYIRMARDVSSSSGICGIAIDSLYP 335


>gi|255539310|ref|XP_002510720.1| cysteine protease, putative [Ricinus communis]
 gi|223551421|gb|EEF52907.1| cysteine protease, putative [Ricinus communis]
          Length = 349

 Score =  300 bits (769), Expect = 5e-79,   Method: Compositional matrix adjust.
 Identities = 149/314 (47%), Positives = 211/314 (67%), Gaps = 11/314 (3%)

Query: 38  THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
           T    ++++ E W+++ GR Y+   EK  R +IFK+NL +I+  NK+  R Y LG N+F+
Sbjct: 38  TSNDKLIDLFESWISRFGRVYESAEEKLERFEIFKDNLFHIDDTNKK-VRNYWLGLNEFA 96

Query: 98  DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
           DL+++EF+  Y G K P  S R+     F Y++++   +P S+DWR KGAVTP+KNQ  C
Sbjct: 97  DLSHEEFKNKYLGLK-PDLSKRAQCPEEFTYKDVA---IPKSVDWRKKGAVTPVKNQGSC 152

Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
           G CWAF+ VAAVEGI +I +GNL  LSEQ+L+DC T  NNGC GG  + AFAYI+ N G+
Sbjct: 153 GSCWAFSTVAAVEGINQIVTGNLTSLSEQELIDCDTTYNNGCNGGLMDYAFAYIVANGGL 212

Query: 218 ATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTE 276
             E++YPY    GTC   ++ + A  IS Y +VP   E++LLKA++ QP+SIAI A   +
Sbjct: 213 HKEEDYPYIMEEGTCDMRKEESDAVTISGYHDVPQNSEESLLKALANQPLSIAIEASGRD 272

Query: 277 FQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD--- 333
           FQ Y  G+F+G CGT+LDH V  VG+GT++ G +Y ++KNSWG  WG+ GY+++ R    
Sbjct: 273 FQFYSGGVFDGHCGTELDHGVAAVGYGTSK-GLDYIIVKNSWGPKWGEKGYIRMKRKTSK 331

Query: 334 -EGLCGIGTRSSYP 346
            EG+CGI   +SYP
Sbjct: 332 PEGICGIYKMASYP 345


>gi|359483514|ref|XP_003632971.1| PREDICTED: LOW QUALITY PROTEIN: oryzain beta chain-like [Vitis
           vinifera]
          Length = 340

 Score =  300 bits (769), Expect = 5e-79,   Method: Compositional matrix adjust.
 Identities = 148/340 (43%), Positives = 222/340 (65%), Gaps = 11/340 (3%)

Query: 18  MFIIITLLVSCASQVVS---SRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKEN 74
           +F+ +TL +       S   SR  HE S+ E HE+WMA++ R+YKD+ E+E R  +FK+N
Sbjct: 3   LFVCMTLHIYYLEHRASEATSRPLHEASMYERHEQWMARYSRNYKDDAEEERRFXMFKDN 62

Query: 75  LEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
           +++I+  +  GN   KLG N  +D+T++EFRA    +K+P      + +++F++QN+  T
Sbjct: 63  VDFIQTFDTAGNMPNKLGVNALADMTHEEFRASGNTFKIPPNLGLRSETTSFRHQNV--T 120

Query: 135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN 194
            +P+++DWR K  VT IKNQ +CG CWAF+AVAA+EGI K+++   I LSEQ+L+DC   
Sbjct: 121 RIPSTMDWRKKRTVTHIKNQLQCGGCWAFSAVAAMEGIAKLQTSKSISLSEQELVDCDIF 180

Query: 195 GNN-GCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSG 252
           G+N GC GG  + AF +IIQN+G+ +E  Y Y+ V G C+  ++ + AA+I++YE +P  
Sbjct: 181 GSNIGCEGGCMDDAFKFIIQNRGLNSEARYLYKGVEGHCNKKKESSRAARINDYENMPEF 240

Query: 253 DEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYW 312
            E+ALLK V+ QP+S+AI A  + FQ Y+ GI     G  LD+ VT  G+G + DG  +W
Sbjct: 241 SEKALLKVVAHQPISVAIDAGGSAFQFYEIGIITXESGNDLDYGVTTDGYGRSADGKKHW 300

Query: 313 LIKNSWGNTWGDAGYMKIVR----DEGLCGIGTRSSYPLA 348
           L+KNSWG  WG+ GY ++ R      GLCG   ++SYP A
Sbjct: 301 LVKNSWGTDWGENGYTRMERGVKATTGLCGFTMQASYPTA 340


>gi|537437|gb|AAC35211.1| cysteine proteinase [Hemerocallis hybrid cultivar]
          Length = 359

 Score =  300 bits (769), Expect = 6e-79,   Method: Compositional matrix adjust.
 Identities = 140/312 (44%), Positives = 202/312 (64%), Gaps = 6/312 (1%)

Query: 40  EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
           E+S+  ++EKW A H  S +D  + + R  +FKEN+++I + N++ + TYKL  N+F D+
Sbjct: 34  EESLWSLYEKWRAHHAVS-RDLDDTDKRFNVFKENVKFIHEFNQKKDATYKLALNKFGDM 92

Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGC 159
           TN EFR+ Y G K+             ++      D+PTS+DWR+KGAVT +K+Q +CG 
Sbjct: 93  TNQEFRSTYAGSKIDHHMTLRGVKDAGEFSYEKFHDLPTSVDWREKGAVTGVKDQGQCGS 152

Query: 160 CWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIAT 219
           CWAF+ V AVEGI +I++  L+ LSEQQL+DC T  N+GC GG  + AF +I  N G+++
Sbjct: 153 CWAFSTVVAVEGINQIKTNELVSLSEQQLVDCDTK-NSGCNGGLMDYAFDFIKNNGGLSS 211

Query: 220 EDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQS 279
           ED YPY A   +C +    A   I  Y++VP  +E AL+KAV+ QPVS+AI A    FQ 
Sbjct: 212 EDSYPYLAEQKSCGSEANSAVVTIDGYQDVPRNNEAALMKAVANQPVSVAIEASGYAFQF 271

Query: 280 YKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR----DEG 335
           Y +G+F+G CGT+LDH V  VG+G  +DG  YW++KNSWG  WG++GY+++ R      G
Sbjct: 272 YSQGVFSGHCGTELDHGVAAVGYGVDDDGKKYWIVKNSWGEGWGESGYIRMERGIKDKRG 331

Query: 336 LCGIGTRSSYPL 347
            CGI   +SYP+
Sbjct: 332 KCGIAMEASYPI 343


>gi|255547982|ref|XP_002515048.1| cysteine protease, putative [Ricinus communis]
 gi|223546099|gb|EEF47602.1| cysteine protease, putative [Ricinus communis]
          Length = 359

 Score =  300 bits (769), Expect = 6e-79,   Method: Compositional matrix adjust.
 Identities = 149/315 (47%), Positives = 215/315 (68%), Gaps = 12/315 (3%)

Query: 40  EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
           E+S+  ++E+W + H  S +   EK  R  +FKENL++I K N++ +R YKL  N+F+D+
Sbjct: 33  EESLWNLYERWRSHHTVS-RSLTEKNQRFNVFKENLKHIHKVNQK-DRPYKLRLNKFADM 90

Query: 100 TNDEFRALYTGYKMPSPS--HRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
           TN EF   Y G K+      H S   + F ++N S  ++P+S+DWR +GAVT +K+Q +C
Sbjct: 91  TNHEFLQHYGGSKVSHYRMFHGSRRQTGFAHENTS--NLPSSIDWRKQGAVTGVKDQGKC 148

Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
           G CWAF++VAAVEGI KI++G LI LSEQ+L+DC++  N+GC GG  E+AF++I +  G+
Sbjct: 149 GSCWAFSSVAAVEGINKIKTGELISLSEQELVDCNSV-NHGCDGGLMEQAFSFIEKTGGL 207

Query: 218 ATEDEYPYQAVPGTC-SAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTE 276
            TE+ YPY+A  G C SA        I  YE VP  DE AL++AV+ QPVSIAI A   +
Sbjct: 208 TTENNYPYRAKDGYCDSAKMNTPMVTIDGYEMVPENDEHALMQAVANQPVSIAIDAGGQD 267

Query: 277 FQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR---- 332
           FQ Y EG++ G CGT+L+H V +VG+G T+DG  YW++KNSWG+ WG+ G++++ R    
Sbjct: 268 FQFYSEGVYTGDCGTELNHGVALVGYGATQDGTKYWIVKNSWGSEWGENGFIRMQRENDV 327

Query: 333 DEGLCGIGTRSSYPL 347
           +EGLCGI   +SYP+
Sbjct: 328 EEGLCGITLEASYPI 342


>gi|168063167|ref|XP_001783545.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162664932|gb|EDQ51634.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 461

 Score =  300 bits (768), Expect = 6e-79,   Method: Compositional matrix adjust.
 Identities = 149/317 (47%), Positives = 205/317 (64%), Gaps = 15/317 (4%)

Query: 39  HEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSD 98
           HE  ++E    W  +HG++Y D  +   R  ++K+NL YI  +  E NRTY LG  +F+D
Sbjct: 46  HENLLLEQFAAWAHKHGKAYHDAEQCLHRFAVWKDNLAYIRHS--ETNRTYSLGLTKFAD 103

Query: 99  LTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECG 158
           LTN+EFR +YTG ++   S R+   + F+Y +   ++ P S+DWR  GAVT +K+Q  CG
Sbjct: 104 LTNEEFRRMYTGTRIDR-SRRAKRRTGFRYAD---SEAPESVDWRKNGAVTSVKDQGSCG 159

Query: 159 CCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIA 218
            CWAF+AV +VEGI  IR+G  + LSEQ+L+DC    N GC GG  + AF +IIQN GI 
Sbjct: 160 SCWAFSAVGSVEGINAIRNGEAVSLSEQELVDCDLEYNQGCNGGLMDYAFDFIIQNGGID 219

Query: 219 TEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEF 277
           TE +YPY+   G C  ++K A    I  YE+VP  DE+AL KAV+ QPVS+AI A   +F
Sbjct: 220 TEKDYPYKGFDGRCDNSKKNAHVVTIDGYEDVPENDEEALKKAVAGQPVSVAIEAGGRDF 279

Query: 278 QSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD---- 333
           Q Y +G+F+G CGT LDH V  VG+G TEDG +YW++KNSWG  WG++GY+++ R+    
Sbjct: 280 QLYAQGVFSGECGTDLDHGVLAVGYG-TEDGVDYWIVKNSWGEYWGESGYLRMKRNMKDS 338

Query: 334 ---EGLCGIGTRSSYPL 347
               GLCGI    SY +
Sbjct: 339 NDGPGLCGINIEPSYAV 355


>gi|146215990|gb|ABQ10197.1| actinidin Act4a [Actinidia eriantha]
          Length = 385

 Score =  300 bits (768), Expect = 6e-79,   Method: Compositional matrix adjust.
 Identities = 148/310 (47%), Positives = 205/310 (66%), Gaps = 10/310 (3%)

Query: 43  VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTND 102
           V+ + E W+ ++G+SY    EKE R +IFK+NL ++++ N + NR+YK+G NQFSDLT+ 
Sbjct: 44  VIAMFESWLVEYGKSYNALGEKERRFEIFKDNLRFVDEHNADVNRSYKVGLNQFSDLTDA 103

Query: 103 EFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWA 162
           E+ ++Y G K     +   T+ + +Y+      +P S+DWR KGAV  +KNQ  CG CW 
Sbjct: 104 EYSSIYLGTKF----NIRMTNVSDRYEPRVGDQLPDSVDWRKKGAVLGVKNQGNCGSCWT 159

Query: 163 FAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATED 221
           FA++AAVEGI KI +GNLI LSEQ+++DC     NNGC GG+   A+ +II N GI TE 
Sbjct: 160 FASIAAVEGINKIVTGNLISLSEQEIVDCQRKYPNNGCNGGTLSGAYQFIINNGGINTEA 219

Query: 222 EYPYQAVPGTCSAAQKPAA-AKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSY 280
            YPY    G C   +K      I  YE VPS +E+AL KAV+ QPVS+ IA+ ST F+SY
Sbjct: 220 NYPYTGRDGVCDQNKKNKKYVTIDRYENVPSNNEKALQKAVAFQPVSVVIASNSTAFKSY 279

Query: 281 KEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD---EGLC 337
           K GIFNG CG ++DH VTIVG+G TE G +YW+++NSWG  WG++GY+++ R+    G C
Sbjct: 280 KSGIFNGPCGPRIDHGVTIVGYG-TEGGKDYWIVRNSWGPNWGESGYVRMQRNVGGSGKC 338

Query: 338 GIGTRSSYPL 347
            I     YP+
Sbjct: 339 FIARAPVYPV 348


>gi|1223922|gb|AAA92063.1| cysteinyl endopeptidase [Vigna radiata]
          Length = 362

 Score =  300 bits (768), Expect = 6e-79,   Method: Compositional matrix adjust.
 Identities = 149/316 (47%), Positives = 203/316 (64%), Gaps = 12/316 (3%)

Query: 40  EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
           E+S+ +++E+W + H  S +   EK  R  +FKEN+ ++   NK  ++ YKL  N+F+D+
Sbjct: 33  EESLWDLYERWRSHHTVS-RSLTEKHKRFNVFKENVMHVHNTNKM-DKPYKLKLNKFADM 90

Query: 100 TNDEFRALYTGYKMPSPSHRSTT---SSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKE 156
           TN EFR+ Y G K+        T   + TF Y+ +    VP S+DWR KGAVT +K+Q +
Sbjct: 91  TNHEFRSTYAGSKVNHHKMFRGTQHGNGTFMYEKVG--SVPASVDWRKKGAVTDVKDQGQ 148

Query: 157 CGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQG 216
           CG CWAF+ V AVEGI +I++  L+ LSEQ+L+DC    N GC GG  E AF +I Q  G
Sbjct: 149 CGSCWAFSTVVAVEGINQIKTDKLVSLSEQELVDCDKEENQGCNGGLMESAFEFIKQKGG 208

Query: 217 IATEDEYPYQAVPGTCSAAQ-KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYST 275
           I TE  YPY A  GTC A++    A  I  +E VP  DE ALLKAV+ QPVS+AI A  +
Sbjct: 209 ITTESNYPYTAQEGTCDASKVNDLAVSIDGHENVPVNDENALLKAVANQPVSVAIDAGGS 268

Query: 276 EFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-- 333
           +FQ Y EG+  G C T L+H V IVG+GTT DG NYW+++NSWG  WG+ GY+++ R+  
Sbjct: 269 DFQFYSEGVLTGDCNTDLNHGVAIVGYGTTVDGTNYWIVRNSWGPEWGEQGYIRMQRNIS 328

Query: 334 --EGLCGIGTRSSYPL 347
             EGLCGI   +SYP+
Sbjct: 329 KKEGLCGIAMMASYPI 344


>gi|3451077|emb|CAA20473.1| cysteine proteinase-like protein [Arabidopsis thaliana]
 gi|7269200|emb|CAB79307.1| cysteine proteinase-like protein [Arabidopsis thaliana]
          Length = 355

 Score =  300 bits (768), Expect = 7e-79,   Method: Compositional matrix adjust.
 Identities = 156/345 (45%), Positives = 227/345 (65%), Gaps = 19/345 (5%)

Query: 16  TPMFIIITLLVSCASQVVSSRST---HEQSVVE---IHEKWMAQHGRSYKDEL-EKEMRL 68
           T +F++I  ++S  S  +   +T   H +S  E   I + WM++HG++Y + L EKE R 
Sbjct: 10  TILFLLIVFVLSAPSSAMDLPATSGGHNRSNEEVEFIFQMWMSKHGKTYTNALGEKERRF 69

Query: 69  KIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKY 128
           + FK+NL +I++ N + N +Y+LG  +F+DLT  E+R L+ G   P P  R+  +S  +Y
Sbjct: 70  QNFKDNLRFIDQHNAK-NLSYQLGLTRFADLTVQEYRDLFPG--SPKPKQRNLKTSR-RY 125

Query: 129 QNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQL 188
             L+   +P S+DWR +GAV+ IK+Q  C  CWAF+ VAAVEG+ KI +G LI LSEQ+L
Sbjct: 126 VPLAGDQLPESVDWRQEGAVSEIKDQGTCNSCWAFSTVAAVEGLNKIVTGELISLSEQEL 185

Query: 189 LDCSTNGNNGCLG-GSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQ-KPAAAKISNY 246
           +DC+   NNGC G G  + AF ++I N G+ +E +YPYQ   G+C+  Q       I +Y
Sbjct: 186 VDCNL-VNNGCYGSGLMDTAFQFLINNNGLDSEKDYPYQGTQGSCNRKQVHLLVITIDSY 244

Query: 247 EEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTE 306
           E+VP+ DE +L KAV+ QPVS+ +   S EF  Y+  I+NG CGT LDHA+ IVG+G +E
Sbjct: 245 EDVPANDEISLQKAVAHQPVSVGVDKKSQEFMLYRSCIYNGPCGTNLDHALVIVGYG-SE 303

Query: 307 DGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
           +G +YW+++NSWG TWGDAGY+KI R+    +GLCGI   +SYP+
Sbjct: 304 NGQDYWIVRNSWGTTWGDAGYIKIARNFEDPKGLCGIAMLASYPI 348


>gi|449525012|ref|XP_004169515.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
          Length = 459

 Score =  300 bits (768), Expect = 8e-79,   Method: Compositional matrix adjust.
 Identities = 152/335 (45%), Positives = 219/335 (65%), Gaps = 12/335 (3%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKD-ELEKEMRLKIFKENLE 76
            F+ I L  +  S ++  R+  E  V+ ++++W A+HG+ + +   E E R  IFK+NL+
Sbjct: 14  FFLFIALSAASPSSIIPQRTDDE--VMALYDQWRAKHGKLHNNLGAEPENRFHIFKDNLK 71

Query: 77  YIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV 136
           +I++ N + N  Y+LG N F+DLTN+E+R+ Y G K  S S R+ TS+  +Y      D+
Sbjct: 72  FIDEINAQ-NLPYRLGLNVFADLTNEEYRSRYLGGKFASGSRRNRTSN--RYLPRLGDDL 128

Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGN 196
           P S+DWR KGAV P+K+Q  CG CWAF+ VA+VE I +I +G+LI LSEQ+L+DC  + N
Sbjct: 129 PDSIDWRAKGAVAPVKDQGSCGSCWAFSTVASVEAINQIVTGDLIALSEQELVDCDRSYN 188

Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQ 255
            GC GG  + AF +II+N G+ TE++YPY     +C   +K A    I +YE+VP  +E+
Sbjct: 189 EGCNGGLMDYAFEFIIENGGLDTEEDYPYYGFDSSCIQYKKNAKVVAIDSYEDVPVNNEK 248

Query: 256 ALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIK 315
           AL KAVS Q VS+AI      FQ Y+ GIF G CGT LDH V +VG+G +E G +YW+++
Sbjct: 249 ALQKAVSKQVVSVAIEGGGRSFQLYQSGIFTGRCGTDLDHGVNVVGYG-SEGGVDYWIVR 307

Query: 316 NSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
           NSWG +WG++GY+K+ R+     GLCGI    SYP
Sbjct: 308 NSWGGSWGESGYVKMQRNIASPTGLCGIAMEPSYP 342


>gi|449460678|ref|XP_004148072.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Cucumis
           sativus]
          Length = 317

 Score =  300 bits (767), Expect = 9e-79,   Method: Compositional matrix adjust.
 Identities = 155/316 (49%), Positives = 208/316 (65%), Gaps = 16/316 (5%)

Query: 37  STHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQF 96
           S+    + + ++KWM ++GR YK   E E R  I++ N++YI+  N   N ++ L  N F
Sbjct: 9   SSCSSDIQDRYQKWMDKYGRQYKSREEWERRFTIYQANVQYIDNFNSM-NHSHTLAENNF 67

Query: 97  SDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKE 156
           +DLTN+EF+A Y GYK  S        + F+Y N  M ++PT++DWR +GAVTPIKNQ +
Sbjct: 68  ADLTNEEFKATYLGYKTVS-----IPDTCFRYGN--MVNLPTNVDWRQEGAVTPIKNQGQ 120

Query: 157 CGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TNGNNGCLGGSREKAFAYIIQNQ 215
           CG CWAF+AVAAVEGI KI++G LI LSEQ+L+DC  T+GN GC GG   KAF + I+  
Sbjct: 121 CGSCWAFSAVAAVEGINKIKAGKLISLSEQELVDCDVTSGNQGCNGGYMYKAFEF-IKRT 179

Query: 216 GIATEDEYPYQAVPGTCS-AAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYS 274
           G+ TE EYPYQ     C+   +K     IS YE+VP  DE++L  AV+ QPVS+AI A  
Sbjct: 180 GLTTEIEYPYQGAESACNEQKEKYQFVSISGYEKVPVNDEKSLKAAVANQPVSVAIDAEG 239

Query: 275 TEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD- 333
             FQ Y  GIF+G CG QL+H V IVG+G T + A YWL+KNSWG  WG++GY+++ RD 
Sbjct: 240 NNFQFYSGGIFSGNCGNQLNHGVAIVGYGETSNQA-YWLVKNSWGTDWGESGYIRMKRDS 298

Query: 334 ---EGLCGIGTRSSYP 346
              +G CGI   +SYP
Sbjct: 299 TDRQGTCGIAMMASYP 314


>gi|449524070|ref|XP_004169046.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like, partial
           [Cucumis sativus]
          Length = 314

 Score =  299 bits (766), Expect = 1e-78,   Method: Compositional matrix adjust.
 Identities = 155/316 (49%), Positives = 208/316 (65%), Gaps = 16/316 (5%)

Query: 37  STHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQF 96
           S+    + + ++KWM ++GR YK   E E R  I++ N++YI+  N   N ++ L  N F
Sbjct: 9   SSCSSDIQDRYQKWMDKYGRQYKSREEWERRFTIYQANVQYIDNFNSM-NHSHTLAENNF 67

Query: 97  SDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKE 156
           +DLTN+EF+A Y GYK  S        + F+Y N  M ++PT++DWR +GAVTPIKNQ +
Sbjct: 68  ADLTNEEFKATYLGYKTVS-----IPDTCFRYGN--MVNLPTNVDWRQEGAVTPIKNQGQ 120

Query: 157 CGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TNGNNGCLGGSREKAFAYIIQNQ 215
           CG CWAF+AVAAVEGI KI++G LI LSEQ+L+DC  T+GN GC GG   KAF + I+  
Sbjct: 121 CGSCWAFSAVAAVEGINKIKAGKLISLSEQELVDCDVTSGNQGCNGGYMYKAFEF-IKRT 179

Query: 216 GIATEDEYPYQAVPGTCS-AAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYS 274
           G+ TE EYPYQ     C+   +K     IS YE+VP  DE++L  AV+ QPVS+AI A  
Sbjct: 180 GLTTEIEYPYQGAESACNEQKEKYQFVSISGYEKVPVNDEKSLKAAVANQPVSVAIDAEG 239

Query: 275 TEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD- 333
             FQ Y  GIF+G CG QL+H V IVG+G T + A YWL+KNSWG  WG++GY+++ RD 
Sbjct: 240 NNFQFYSGGIFSGNCGNQLNHGVAIVGYGETSNQA-YWLVKNSWGTDWGESGYIRMKRDS 298

Query: 334 ---EGLCGIGTRSSYP 346
              +G CGI   +SYP
Sbjct: 299 TDKQGTCGIAMMASYP 314


>gi|146215982|gb|ABQ10193.1| actinidin Act2b [Actinidia eriantha]
          Length = 378

 Score =  299 bits (766), Expect = 1e-78,   Method: Compositional matrix adjust.
 Identities = 155/340 (45%), Positives = 208/340 (61%), Gaps = 11/340 (3%)

Query: 13  INTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFK 72
           I+ + +F    L++S A  + +S       V+ ++E W+ + G+SY    EKEMR +IFK
Sbjct: 8   ISMSLLFFSTLLILSLALDIENSVQRTNDQVMAMYESWLVEQGKSYNSLDEKEMRFEIFK 67

Query: 73  ENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
           ENL  I+  N + NR+Y LG N+F+DLT++E+R+ Y G KM   +  S      +Y    
Sbjct: 68  ENLRIIDDHNADANRSYSLGLNRFADLTDEEYRSTYLGLKMGPKTDVSN-----EYMPKV 122

Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
              +P  +DWR  GAV  +KNQ  C  CWAF+AV AVEGI KI +GNLI LSEQ+L+DC 
Sbjct: 123 GEALPDYVDWRTVGAVVGVKNQGLCSSCWAFSAVTAVEGINKIVTGNLISLSEQELVDCG 182

Query: 193 -TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAA-AKISNYEEVP 250
            T    GC  G    AF +II N GI TED YPY A  G C+ + K      I NY+ VP
Sbjct: 183 RTQRTKGCNRGLMTDAFQFIINNGGINTEDNYPYTAKDGQCNLSLKNQKYVTIDNYKNVP 242

Query: 251 SGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGAN 310
           S +E AL KAV+ QPVS+ + +   +F+ Y  GIF G CGT +DH VTIVG+G TE G +
Sbjct: 243 SNNEMALKKAVAYQPVSVGVESEGGKFKLYTSGIFTGFCGTAVDHGVTIVGYG-TERGMD 301

Query: 311 YWLIKNSWGNTWGDAGYMKIVRD---EGLCGIGTRSSYPL 347
           YW++KNSWG  WG+ GY++I R+    G CGI    SYP+
Sbjct: 302 YWIVKNSWGTNWGENGYIRIQRNIGGAGKCGIARMPSYPV 341


>gi|351721126|ref|NP_001237199.1| cysteine proteinase precursor [Glycine max]
 gi|31559530|dbj|BAC77523.1| cysteine proteinase [Glycine max]
 gi|31559532|dbj|BAC77524.1| cysteine proteinase [Glycine max]
          Length = 362

 Score =  299 bits (766), Expect = 1e-78,   Method: Compositional matrix adjust.
 Identities = 156/347 (44%), Positives = 214/347 (61%), Gaps = 26/347 (7%)

Query: 19  FIIITLLVSCASQVVSSRSTH------EQSVVEIHEKWMAQH--GRSYKDELEKEMRLKI 70
           F+ + L +S    V +S   H      E+S+ +++E+W + H   RS  D   K  R  +
Sbjct: 6   FLWVVLSLSLVLGVANSFDFHDKDLESEESLWDLYERWRSHHTVSRSLGD---KHKRFNV 62

Query: 71  FKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHR-----STTSST 125
           FK N+ ++   NK  ++ YKL  N+F+D+TN EFR+ Y G K+    HR        + T
Sbjct: 63  FKANMMHVHNTNKM-DKPYKLKLNKFADMTNHEFRSTYAGSKVNH--HRMFRDMPRGNGT 119

Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
           F Y+ +    VP S+DWR KGAVT +K+Q  CG CWAF+ V AVEGI +I++  L+ LSE
Sbjct: 120 FMYEKVG--SVPASVDWRKKGAVTDVKDQGHCGSCWAFSTVVAVEGINQIKTNKLVSLSE 177

Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQ-KPAAAKIS 244
           Q+L+DC T  N GC GG  E AF +I Q  GI TE  YPY A  GTC A++    A  I 
Sbjct: 178 QELVDCDTEENAGCNGGLMESAFQFIKQKGGITTESYYPYTAQDGTCDASKANDLAVSID 237

Query: 245 NYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGT 304
            +E VP  DE ALLKAV+ QPVS+AI A  ++FQ Y EG+F G C T+L+H V IVG+G 
Sbjct: 238 GHENVPGNDENALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDCSTELNHGVAIVGYGA 297

Query: 305 TEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
           T DG +YW+++NSWG  WG+ GY+++ R+    EGLCGI   +SYP+
Sbjct: 298 TVDGTSYWIVRNSWGPEWGELGYIRMQRNISKKEGLCGIAMLASYPI 344


>gi|242074728|ref|XP_002447300.1| hypothetical protein SORBIDRAFT_06g032360 [Sorghum bicolor]
 gi|241938483|gb|EES11628.1| hypothetical protein SORBIDRAFT_06g032360 [Sorghum bicolor]
          Length = 471

 Score =  299 bits (766), Expect = 1e-78,   Method: Compositional matrix adjust.
 Identities = 146/317 (46%), Positives = 208/317 (65%), Gaps = 11/317 (3%)

Query: 40  EQSVVEIHEKWMAQHGRSYKDEL-EKEMRLKIFKENLEYIEKAN-KEGNRTYKLGTNQFS 97
           E  V  ++E+WMA+HG++  + L E + R + F +NL +++  N + G R Y+LG N+F+
Sbjct: 45  EAQVRAMYEQWMARHGKAASNALGEHDRRFRAFWDNLRFVDAHNARAGARGYRLGINRFA 104

Query: 98  DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
           DLTN EFRA Y      + +  +T ++  +Y++  +  +P  +DWR KGAV P+KNQ +C
Sbjct: 105 DLTNAEFRAAY--LSAGARNGTATAATGERYRHDGVEALPEFVDWRQKGAVAPVKNQGQC 162

Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYIIQNQG 216
           G CWAF+AV AVEGI +I +G L+ LSEQ+L+DCS NG N GC GG  + AFA+I+ N G
Sbjct: 163 GSCWAFSAVGAVEGINQIVTGELVTLSEQELVDCSKNGQNGGCDGGMMDDAFAFIVGNGG 222

Query: 217 IATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYST 275
           I T+ +YPY A  G C  A++      I  +E VP  DE++L KAV+ QPV++AI A   
Sbjct: 223 IDTDKDYPYTARDGKCDVAKRSRHVVSIDGFEGVPRNDEKSLQKAVAHQPVAVAIEAGGR 282

Query: 276 EFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGA-NYWLIKNSWGNTWGDAGYMKIVRD- 333
           EFQ Y+ G+F G CGT LDH V  VG+GT  DG  +YWL++NSWG  WG+ GY+++ R+ 
Sbjct: 283 EFQLYQSGVFTGRCGTSLDHGVVAVGYGTEADGGRDYWLVRNSWGADWGEGGYIRMERNV 342

Query: 334 ---EGLCGIGTRSSYPL 347
               G CGI   +SYP+
Sbjct: 343 GARAGKCGIAMEASYPV 359


>gi|147772785|emb|CAN62838.1| hypothetical protein VITISV_003391 [Vitis vinifera]
          Length = 298

 Score =  299 bits (766), Expect = 1e-78,   Method: Compositional matrix adjust.
 Identities = 157/334 (47%), Positives = 210/334 (62%), Gaps = 55/334 (16%)

Query: 21  IITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEK 80
           ++ +L + ASQ  +SRS HE S+ E HE WMA++GR YKD  EKE R KIFK+N+     
Sbjct: 14  LLFILAAWASQA-TSRSLHEASMYERHEDWMARYGRMYKDANEKEKRFKIFKDNV----- 67

Query: 81  ANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSL 140
                                                     ++TFKY+N+  T VP+++
Sbjct: 68  ----------------------------------------AQATTFKYENV--TAVPSTI 85

Query: 141 DWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGC 199
           DWR KGAVTPIK+Q++CG CWAF+AVAA EGIT+I +G LI LSEQ+L+DC T G N GC
Sbjct: 86  DWRKKGAVTPIKDQQQCGSCWAFSAVAATEGITQITTGKLISLSEQELVDCDTGGENQGC 145

Query: 200 LGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKP-AAAKISNYEEVPSGDEQALL 258
            GG  + AF +I  + G+A+E  YPY+   GTC++ ++   AAKI  YE+VP+ +E+AL 
Sbjct: 146 SGGLXDDAFRFIXIH-GLASEATYPYEGDDGTCNSKKEAHPAAKIKGYEDVPANNEKALQ 204

Query: 259 KAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSW 318
           KAV+ QPV++AI A   EFQ Y  G+F G CGT+LDH V  VG+G  +DG  YWL+KNSW
Sbjct: 205 KAVAHQPVAVAIDAGGFEFQFYTSGVFTGQCGTELDHGVAAVGYGIGDDGMXYWLVKNSW 264

Query: 319 GNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
           G  WG+ GY+++ RD    EGLCGI  ++SYP A
Sbjct: 265 GTGWGEEGYIRMQRDVTAKEGLCGIAMQASYPTA 298


>gi|50355615|dbj|BAD29956.1| cysteine protease [Daucus carota]
          Length = 423

 Score =  299 bits (766), Expect = 1e-78,   Method: Compositional matrix adjust.
 Identities = 144/302 (47%), Positives = 207/302 (68%), Gaps = 10/302 (3%)

Query: 51  MAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTG 110
           + +H ++Y     KE R +IFK+NL +I++ NK  N+++KLG N+F+DL+N+E+++++ G
Sbjct: 11  LVKHHKNYNALGAKEKRFEIFKDNLRFIDEHNKGVNQSFKLGLNKFADLSNEEYKSMFLG 70

Query: 111 YKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVE 170
            +M     +   S  FKY      ++P S+DWR+KGAV P+K+Q +CG CWAF+ VAAVE
Sbjct: 71  GRMVR-DRKGFESDRFKYG--VGDELPQSVDWREKGAVAPVKDQGQCGSCWAFSTVAAVE 127

Query: 171 GITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPG 230
           GI +I +G+LI LSEQ+L+DC    N GC GG  + AF +I++N GI TED+YPY+ V G
Sbjct: 128 GINQIATGDLISLSEQELVDCDKGFNQGCNGGFMDYAFEFIVKNGGIDTEDDYPYKGVDG 187

Query: 231 TCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVC 289
            C   +K A    I+ +E+VP  DE++L KAV+ QPVS+AI A    FQ Y+ GIFNG+C
Sbjct: 188 QCDQNRKNAKVVTINGFEDVPQNDEKSLKKAVAHQPVSVAIEAGGRAFQLYESGIFNGLC 247

Query: 290 GTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR-----DEGLCGIGTRSS 344
           GT LDH V  VG+G TEDG +YW+++NSWG  WG+ GY+++ R     + G CGI  + S
Sbjct: 248 GTDLDHGVVAVGYG-TEDGKDYWIVRNSWGPNWGENGYIRLERNVASTNTGKCGIAMQPS 306

Query: 345 YP 346
           YP
Sbjct: 307 YP 308


>gi|297809383|ref|XP_002872575.1| hypothetical protein ARALYDRAFT_911472 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297318412|gb|EFH48834.1| hypothetical protein ARALYDRAFT_911472 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 371

 Score =  299 bits (766), Expect = 1e-78,   Method: Compositional matrix adjust.
 Identities = 152/355 (42%), Positives = 220/355 (61%), Gaps = 26/355 (7%)

Query: 16  TPMFIIITLLVSCAS----QVVSSRSTHEQSVVE-------------IHEKWMAQHGRSY 58
           T + ++  ++ SCA+     VVSS + H  +                I + WM +HG+ Y
Sbjct: 8   TLILLVAMVITSCATAMDMSVVSSNNNHHLTTSPGRLHSGFDAEASLIFDSWMVKHGKVY 67

Query: 59  KDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSH 118
               EKE RL IF++NL +I   N E N +Y+LG  QF+DL+  E+  +  G     P +
Sbjct: 68  GSVAEKERRLTIFEDNLRFISNRNAE-NLSYRLGLTQFADLSLHEYGEVCHGADPRPPRN 126

Query: 119 RSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSG 178
               +S+ +Y+  +   +P S+DWR++GAVT +K+Q  C  CWAF+ V AVEG+ KI +G
Sbjct: 127 HVFMTSSDRYKTSAGDVLPKSVDWRNEGAVTEVKDQGHCRSCWAFSTVGAVEGLNKIVTG 186

Query: 179 NLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKP 238
            L+ LSEQ L++C+   NNGC GG  E A+ +I++N G+ T+++YPY+AV G C    K 
Sbjct: 187 ELVTLSEQDLINCNKE-NNGCGGGKVETAYEFIMKNGGLGTDNDYPYKAVNGVCDGRLKE 245

Query: 239 --AAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHA 296
                 I  +E +P+ DE AL+KAV+ QPV+  I + S EFQ Y+ G+F+G CGT L+H 
Sbjct: 246 NNKNVMIDGFENLPANDEFALMKAVAHQPVTAVIDSSSREFQLYESGVFDGSCGTNLNHG 305

Query: 297 VTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
           V +VG+G TE+G +YWL+KNS GNTWG+AGYMK+ R+     GLCGI  R+SYPL
Sbjct: 306 VVVVGYG-TENGRDYWLVKNSRGNTWGEAGYMKMARNIANPRGLCGIAMRASYPL 359


>gi|2463588|dbj|BAA22546.1| FB1035 precursor [Ananas comosus]
          Length = 324

 Score =  299 bits (765), Expect = 1e-78,   Method: Compositional matrix adjust.
 Identities = 142/303 (46%), Positives = 198/303 (65%), Gaps = 10/303 (3%)

Query: 48  EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRAL 107
           E+WMA++GR YKD  EK  R +IFK N+++IE  N     +Y LG NQF+D+T  EF A 
Sbjct: 11  EEWMAEYGRIYKDNDEKMRRFQIFKNNVKHIETFNSRNGNSYTLGINQFTDMTKSEFVAQ 70

Query: 108 YTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVA 167
           YTG  +P    R    S   + +++++ VP S+DWRD GAV  +KNQ  CG CWAFAA+A
Sbjct: 71  YTGVSLPLNIEREPVVS---FDDVNISAVPQSIDWRDYGAVNEVKNQNPCGSCWAFAAIA 127

Query: 168 AVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQA 227
            VEGI KI++G L+ LSEQ++LDC+   + GC GG   KA+ +II N G+ TE+ YPYQA
Sbjct: 128 TVEGIYKIKTGYLVSLSEQEVLDCAV--SYGCKGGWVNKAYDFIISNNGVTTEENYPYQA 185

Query: 228 VPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNG 287
             GTC+A   P +A I+ Y  V   DE++++ AVS QP++  I A S  FQ Y  G+F+G
Sbjct: 186 YQGTCNANSFPNSAYITGYSYVRRNDERSMMYAVSNQPIAALIDA-SENFQYYNGGVFSG 244

Query: 288 VCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR----DEGLCGIGTRS 343
            CGT L+HA+TI+G+G    G  YW+++NSWG++WG+ GY+++ R      G CGI    
Sbjct: 245 PCGTSLNHAITIIGYGQDSSGTKYWIVRNSWGSSWGEGGYVRMARGVSSSSGACGIAMSP 304

Query: 344 SYP 346
            +P
Sbjct: 305 LFP 307


>gi|3980198|emb|CAA46863.1| thiolprotease [Pisum sativum]
          Length = 464

 Score =  299 bits (765), Expect = 1e-78,   Method: Compositional matrix adjust.
 Identities = 148/349 (42%), Positives = 223/349 (63%), Gaps = 19/349 (5%)

Query: 16  TPMFIIITLLVSCASQ--VVSSRSTH--------EQSVVEIHEKWMAQHGRSYKDELEKE 65
           T +FI +T  +S A    ++S   TH           V+ ++E+W+ +HG++Y    EKE
Sbjct: 6   TILFITLTFTLSLALDMCIISYDKTHPDKSTPRTNDQVLTMYEEWLVKHGKNYNALGEKE 65

Query: 66  MRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKM-PSPSHRSTTSS 124
            R +IFK+NL +I++ N + N +++LG N+F+DLTN+E+R  + G ++ P+  +R   S 
Sbjct: 66  KRFEIFKDNLGFIDEHNSK-NLSFRLGLNRFADLTNEEYRTRFLGTRINPNRRNRKVNSQ 124

Query: 125 TFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLS 184
           T +Y       +P S+DWR +GAV  +K+Q  CG CWAF+A+AAVEG+ K+ +G+LI LS
Sbjct: 125 TNRYATRVGDKLPESVDWRKEGAVVGVKDQGSCGSCWAFSAIAAVEGVNKLATGDLISLS 184

Query: 185 EQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKI 243
           EQ+L+DC T+ N GC GG  + AF +II    +  E++YPY+A+ G C   +K A    I
Sbjct: 185 EQELVDCDTSYNEGCNGGLMDYAFEFIINMVALTPEEDYPYRAIDGRCDQNRKNAKVVSI 244

Query: 244 SNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFG 303
             YE+VP+ DE AL KAV+ Q +++A+     EFQ Y  G+F G CGT LDH V  VG+G
Sbjct: 245 DQYEDVPAYDEGALKKAVANQVIAVAVEGGGREFQLYDSGVFTGRCGTALDHGVAAVGYG 304

Query: 304 TTEDGANYWLIKNSWGNTWGDAGYMKIVRD-----EGLCGIGTRSSYPL 347
            TE+G +YW+++NSWG +WG+AGY+++ R+      G CGI    SYP+
Sbjct: 305 -TENGKDYWIVRNSWGGSWGEAGYIRLERNLATSKSGKCGIAIEPSYPI 352


>gi|302759380|ref|XP_002963113.1| hypothetical protein SELMODRAFT_270344 [Selaginella moellendorffii]
 gi|300169974|gb|EFJ36576.1| hypothetical protein SELMODRAFT_270344 [Selaginella moellendorffii]
          Length = 479

 Score =  299 bits (765), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 150/322 (46%), Positives = 204/322 (63%), Gaps = 16/322 (4%)

Query: 38  THEQSVVEIHEKWMAQHGRSYKDEL--------EKEMRLKIFKENLEYIEKANKEGNRTY 89
           + E+ +  + + WM QHG+SY +          EK  R  IFK+NL +I   N E N+ Y
Sbjct: 48  SSEERLQALFDSWMLQHGKSYAENALSGDSQAGEKATRYGIFKDNLRFIHGEN-EKNQGY 106

Query: 90  KLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVT 149
            LG N F+DLTN+EFRA   G +      R T+   F+Y ++ + D+P S+DWR+KGAV 
Sbjct: 107 FLGLNAFADLTNEEFRAQRHGGRFDRSRER-TSYEEFRYGSVQLKDLPDSIDWREKGAVV 165

Query: 150 PIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFA 209
            +K+Q  CG CWAF+AVAA+EG+ K+ +G L+ LSEQ+L+DC    + GC GG  + AF 
Sbjct: 166 GVKDQGSCGSCWAFSAVAAIEGVNKLATGELVSLSEQELVDCDKGEDEGCNGGLMDYAFG 225

Query: 210 YIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSI 268
           ++I+N G+ TE +YPY+     C  ++  A    I  YE+VP  DE ALLKAV+ QPVS+
Sbjct: 226 FVIKNGGLDTEADYPYKGYGTRCDRSKMNAKVVTIDGYEDVPVNDETALLKAVAHQPVSV 285

Query: 269 AIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYM 328
           AI A  +  Q Y+ GIF G CGT LDH VT VG+G  EDG  YW+IKNSWG+ WG+ GY+
Sbjct: 286 AIDAGGSSMQFYRSGIFTGRCGTDLDHGVTNVGYG-KEDGKAYWIIKNSWGSNWGEKGYI 344

Query: 329 KIVRD----EGLCGIGTRSSYP 346
           K+ R+     GLCGI   +SYP
Sbjct: 345 KMARNTGLAAGLCGINMEASYP 366


>gi|18423124|ref|NP_568722.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|75309064|sp|Q9FGR9.1|CEP1_ARATH RecName: Full=KDEL-tailed cysteine endopeptidase CEP1; AltName:
           Full=Cysteine proteinase CP56; Short=AtCP56; Flags:
           Precursor
 gi|9759028|dbj|BAB09397.1| cysteine endopeptidase [Arabidopsis thaliana]
 gi|20258850|gb|AAM13907.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|308097832|gb|ADO14465.1| papain [Arabidopsis thaliana]
 gi|332008536|gb|AED95919.1| putative cysteine proteinase [Arabidopsis thaliana]
          Length = 361

 Score =  299 bits (765), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 151/345 (43%), Positives = 216/345 (62%), Gaps = 22/345 (6%)

Query: 19  FIIITLLVSCASQVVSSRSTH------EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFK 72
           FI++ L +    +       H      E S+ E++E+W + H  +   E EK  R  +FK
Sbjct: 4   FIVLALCMLMVLETTKGLDFHNKDVESENSLWELYERWRSHHTVARSLE-EKAKRFNVFK 62

Query: 73  ENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTG-----YKMPSPSHRSTTSSTFK 127
            N+++I + NK+ +++YKL  N+F D+T++EFR  Y G     ++M     ++T S  F 
Sbjct: 63  HNVKHIHETNKK-DKSYKLKLNKFGDMTSEEFRRTYAGSNIKHHRMFQGEKKATKS--FM 119

Query: 128 YQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQ 187
           Y N++   +PTS+DWR  GAVTP+KNQ +CG CWAF+ V AVEGI +IR+  L  LSEQ+
Sbjct: 120 YANVNT--LPTSVDWRKNGAVTPVKNQGQCGSCWAFSTVVAVEGINQIRTKKLTSLSEQE 177

Query: 188 LLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNY 246
           L+DC TN N GC GG  + AF +I +  G+ +E  YPY+A   TC   ++ A    I  +
Sbjct: 178 LVDCDTNQNQGCNGGLMDLAFEFIKEKGGLTSELVYPYKASDETCDTNKENAPVVSIDGH 237

Query: 247 EEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTE 306
           E+VP   E  L+KAV+ QPVS+AI A  ++FQ Y EG+F G CGT+L+H V +VG+GTT 
Sbjct: 238 EDVPKNSEDDLMKAVANQPVSVAIDAGGSDFQFYSEGVFTGRCGTELNHGVAVVGYGTTI 297

Query: 307 DGANYWLIKNSWGNTWGDAGYMKIVR----DEGLCGIGTRSSYPL 347
           DG  YW++KNSWG  WG+ GY+++ R     EGLCGI   +SYPL
Sbjct: 298 DGTKYWIVKNSWGEEWGEKGYIRMQRGIRHKEGLCGIAMEASYPL 342


>gi|222629922|gb|EEE62054.1| hypothetical protein OsJ_16838 [Oryza sativa Japonica Group]
          Length = 336

 Score =  298 bits (764), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 148/298 (49%), Positives = 200/298 (67%), Gaps = 10/298 (3%)

Query: 56  RSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPS 115
           ++Y    EK  R ++FK+NL +I+  NK+   +Y LG N+F+DLT+DEF+A Y G   P 
Sbjct: 38  KAYASFEEKVRRFEVFKDNLNHIDDINKK-VTSYWLGLNEFADLTHDEFKATYLGL-TPP 95

Query: 116 PSHRST---TSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGI 172
           P+  ++   +S  F+Y  +S  +VP  +DWR K AVT +KNQ +CG CWAF+ VAAVEGI
Sbjct: 96  PTRSNSKHYSSEEFRYGKMSNGEVPKEMDWRKKNAVTEVKNQGQCGSCWAFSTVAAVEGI 155

Query: 173 TKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTC 232
             I +GNL  LSEQ+L+DCST+GNNGC GG  + AF+YI    G+ TE+ YPY    G C
Sbjct: 156 NAIVTGNLTSLSEQELIDCSTDGNNGCNGGLMDYAFSYIASTGGLRTEEAYPYAMEEGDC 215

Query: 233 SAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQ 292
              +  A   IS YE+VP+ DEQAL+KA++ QPVS+AI A    FQ Y  G+F+G CG Q
Sbjct: 216 DEGKGAAVVTISGYEDVPANDEQALVKALAHQPVSVAIEASGRHFQFYSGGVFDGPCGEQ 275

Query: 293 LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR----DEGLCGIGTRSSYP 346
           LDH VT VG+GT++ G +Y ++KNSWG  WG+ GY+++ R     EGLCGI   +SYP
Sbjct: 276 LDHGVTAVGYGTSK-GQDYIIVKNSWGPHWGEKGYIRMKRGTGKGEGLCGINKMASYP 332


>gi|356563155|ref|XP_003549830.1| PREDICTED: vignain-like [Glycine max]
          Length = 361

 Score =  298 bits (764), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 150/318 (47%), Positives = 207/318 (65%), Gaps = 17/318 (5%)

Query: 40  EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
           E+ + +++E+W + H  S +   EK  R  +FK N+ ++  +NK  ++ YKL  N+F+D+
Sbjct: 33  EEGLWDLYERWRSHHTVS-RSLDEKHNRFNVFKGNVMHVHSSNKM-DKPYKLKLNRFADM 90

Query: 100 TNDEFRALYTGYKMPSPSHR-----STTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQ 154
           TN EFR++Y G K+    HR        + TF YQN+    VP+S+DWR KGAVT +K+Q
Sbjct: 91  TNHEFRSIYAGSKVNH--HRMFRGTPRGNGTFMYQNVDR--VPSSVDWRKKGAVTDVKDQ 146

Query: 155 KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQN 214
            +CG CWAF+ + AVEGI +I++  L+ LSEQ+L+DC T  N GC GG  E AF +I Q 
Sbjct: 147 GQCGSCWAFSTIVAVEGINQIKTHKLVPLSEQELVDCDTTQNQGCNGGLMESAFEFIKQ- 205

Query: 215 QGIATEDEYPYQAVPGTCSAAQ-KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAY 273
            GI T   YPY+A  GTC A++    A  I  +E VP  +E ALLKAV+ QPVS+AI A 
Sbjct: 206 YGITTASNYPYEAKDGTCDASKVNEPAVSIDGHENVPVNNEAALLKAVAHQPVSVAIEAG 265

Query: 274 STEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD 333
             +FQ Y EG+F G CGT LDH V IVG+GTT+DG  YW +KNSWG+ WG+ GY+++ R 
Sbjct: 266 GIDFQFYSEGVFTGNCGTALDHGVAIVGYGTTQDGTKYWTVKNSWGSEWGEKGYIRMKRS 325

Query: 334 ----EGLCGIGTRSSYPL 347
               +GLCGI   +SYP+
Sbjct: 326 ISVKKGLCGIAMEASYPI 343


>gi|595986|gb|AAA79915.1| cysteine proteinase, partial [Dianthus caryophyllus]
          Length = 427

 Score =  298 bits (764), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 144/309 (46%), Positives = 208/309 (67%), Gaps = 12/309 (3%)

Query: 48  EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEK----ANKEGNRTYKLGTNQFSDLTNDE 103
           + W+ +H ++Y    EKE R  IF++NLE+I++     N  G   ++LG N+F+DLTNDE
Sbjct: 6   QSWLVKHRKNYNALGEKEKRFAIFRDNLEFIDQHNNNNNGGGGGEFELGLNKFADLTNDE 65

Query: 104 FRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAF 163
           FR +Y G K P    ++ +  + +Y      ++P S+DWR KGAV+ +K+Q +CG CWAF
Sbjct: 66  FRRIYFGVKRP---EKAESVKSDRYAVKEGDELPESVDWRKKGAVSHVKDQGQCGSCWAF 122

Query: 164 AAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEY 223
           +A+ AVEGI KI +G+LI LSEQ+L+DC T+ N+GC GG  + AF +II N GI T+ +Y
Sbjct: 123 SAIGAVEGINKIVTGDLITLSEQELVDCDTSYNSGCDGGLMDYAFRFIINNGGIDTDKDY 182

Query: 224 PYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKE 282
           PY+A  G+C + +K A    I   E+VP+ +E+AL KAV+ QPV +AI A   +FQ YK 
Sbjct: 183 PYKATDGSCDSNRKNAKVVTIDGLEDVPANNEKALQKAVAHQPVRLAIEAGGRDFQLYKS 242

Query: 283 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCG 338
           G+F G CGT LDH V  VG+GTT+DG +YW+++NSWG+ WG+ GY+++ R+     G CG
Sbjct: 243 GVFTGSCGTSLDHGVVAVGYGTTDDGKDYWIVRNSWGDDWGEDGYIRMERNTESKSGKCG 302

Query: 339 IGTRSSYPL 347
           I    SYP+
Sbjct: 303 IAIEPSYPV 311


>gi|359359066|gb|AEV40973.1| putative oryzain beta chain precursor [Oryza punctata]
          Length = 461

 Score =  298 bits (764), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 142/308 (46%), Positives = 214/308 (69%), Gaps = 13/308 (4%)

Query: 47  HEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRT--YKLGTNQFSDLTNDEF 104
           ++ W+A++GRSY    E+E R ++F +NL++++  N   +    ++LG N+F+DLTNDEF
Sbjct: 49  YDLWLAENGRSYNALGERERRFRVFWDNLKFVDAHNARADEHGGFRLGMNRFADLTNDEF 108

Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFA 164
           R+ + G K+   S     ++  +Y++  + ++P S+DWR+KGAV P+KNQ +CG CWAF+
Sbjct: 109 RSTFLGAKVVERSR----AAGERYRHDGVEELPESVDWREKGAVAPVKNQGQCGSCWAFS 164

Query: 165 AVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYIIQNQGIATEDEY 223
           AV+ VE I ++ +G +I LSEQ+L++CSTNG N+GC GG  + AF +II+N GI TED+Y
Sbjct: 165 AVSTVESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMDDAFDFIIKNGGIDTEDDY 224

Query: 224 PYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKE 282
           PY+AV G C   ++ A    I  +E+VP  DE++L KAV+ QPVS+AI A   EFQ Y  
Sbjct: 225 PYKAVDGKCDINRENAKVVSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGGREFQLYHS 284

Query: 283 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCG 338
           G+F+G CGT LDH V  VG+G T++G +YW+++NSWG  WG++GY+++ R+     G CG
Sbjct: 285 GVFSGRCGTSLDHGVVAVGYG-TDNGKDYWIVRNSWGPKWGESGYVRMERNINATTGKCG 343

Query: 339 IGTRSSYP 346
           I   +SYP
Sbjct: 344 IAMMASYP 351


>gi|194352754|emb|CAQ00105.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
 gi|326513690|dbj|BAJ87864.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326514532|dbj|BAJ96253.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 463

 Score =  298 bits (764), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 147/333 (44%), Positives = 213/333 (63%), Gaps = 12/333 (3%)

Query: 22  ITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKA 81
           ++L  +    +VS     E+ V  ++ +WMA+HG +Y    E+E R + F++NL YI++ 
Sbjct: 18  VSLAAAADMSIVSYGERSEEEVRRMYAEWMAEHGSTYNAIGEEERRFEAFRDNLRYIDQH 77

Query: 82  N---KEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPT 138
           N     G  +++LG N+F+DLTN+E+R+ Y G +      R  ++   +YQ     ++P 
Sbjct: 78  NAAADAGVHSFRLGLNRFADLTNEEYRSTYLGARTKPDRERKLSA---RYQAADNDELPE 134

Query: 139 SLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNG 198
           S+DWR KGAV  +K+Q  CG CWAF+A+AAVEGI +I +G++I LSEQ+L+DC T+ N G
Sbjct: 135 SVDWRKKGAVGAVKDQGGCGSCWAFSAIAAVEGINQIVTGDMIPLSEQELVDCDTSYNQG 194

Query: 199 CLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQAL 257
           C GG  + AF +II N GI +E++YPY+     C A +K A    I  YE+VP   E++L
Sbjct: 195 CNGGLMDYAFEFIINNGGIDSEEDYPYKERDNRCDANKKNAKVVTIDGYEDVPVNSEKSL 254

Query: 258 LKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNS 317
            KAV+ QP+S+AI A    FQ YK GIF G CGT LDH V  VG+G TE+G +YWL++NS
Sbjct: 255 QKAVANQPISVAIEAGGRAFQLYKSGIFTGTCGTALDHGVAAVGYG-TENGKDYWLVRNS 313

Query: 318 WGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
           WG+ WG+ GY+++ R+     G CGI    SYP
Sbjct: 314 WGSVWGEDGYIRMERNIKASSGKCGIAVEPSYP 346


>gi|226503129|ref|NP_001149806.1| LOC100283433 precursor [Zea mays]
 gi|195634783|gb|ACG36860.1| xylem cysteine proteinase 2 precursor [Zea mays]
 gi|219884977|gb|ACL52863.1| unknown [Zea mays]
          Length = 377

 Score =  298 bits (763), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 148/317 (46%), Positives = 209/317 (65%), Gaps = 14/317 (4%)

Query: 38  THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
           T    +V + E+W+A++ ++Y    EK  R ++FK+NL +I++AN++   +Y LG N F+
Sbjct: 63  TQHDRLVRLFEEWVAKYRKAYGSFEEKLRRFEVFKDNLHHIDEANRKEVTSYWLGLNAFA 122

Query: 98  DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV--PTSLDWRDKGAVTPIKNQK 155
           DLT+DEF+A Y G  +P    + T+   F+Y  +       P S+DWR KGAVT +KNQ 
Sbjct: 123 DLTHDEFKATYLGL-LP----KRTSGGRFRYGGVGDGGDEVPASVDWRKKGAVTEVKNQG 177

Query: 156 ECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQ 215
           +CG CWAF+ VAAVEGI +I +GNL  LSEQQL+DCST+GNNGC GG  + AF++I    
Sbjct: 178 QCGSCWAFSTVAAVEGINQIVTGNLTSLSEQQLVDCSTDGNNGCSGGVMDNAFSFIATGA 237

Query: 216 GIATEDEYPYQAVPGTCS--AAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAY 273
           G+ +E+ YPY    G C   A        IS YE+VP+ DEQAL+KA++ QPVS+AI A 
Sbjct: 238 GLRSEEAYPYLMEEGDCDDRARDGEVLVTISGYEDVPANDEQALVKALAHQPVSVAIEAS 297

Query: 274 STEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD 333
              FQ Y  G+F+G CG++LDH V  VG+G+++ G +Y ++KNSWG  WG+ GY+++ R 
Sbjct: 298 GRHFQFYSGGVFDGPCGSELDHGVAAVGYGSSK-GQDYIIVKNSWGTHWGEKGYIRMKRG 356

Query: 334 ----EGLCGIGTRSSYP 346
               EGLCGI   +SYP
Sbjct: 357 TGKPEGLCGINKMASYP 373


>gi|357467173|ref|XP_003603871.1| Cysteine proteinase [Medicago truncatula]
 gi|355492919|gb|AES74122.1| Cysteine proteinase [Medicago truncatula]
 gi|388499154|gb|AFK37643.1| unknown [Medicago truncatula]
          Length = 350

 Score =  298 bits (763), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 139/309 (44%), Positives = 214/309 (69%), Gaps = 10/309 (3%)

Query: 43  VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTND 102
           ++E+ E WM++HG+ Y+   EK +R ++FK+NL++I+  NK  +  Y LG N+F+DL++ 
Sbjct: 43  LIELFESWMSRHGKIYETIEEKLLRFEVFKDNLKHIDDRNKVVS-NYWLGLNEFADLSHQ 101

Query: 103 EFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWA 162
           EF+  Y G K+     R ++   F Y+++   D+P S+DWR KGAVTP+KNQ +CG CWA
Sbjct: 102 EFKNKYLGLKVDLSQRRESSEEEFTYRDV---DLPKSVDWRKKGAVTPVKNQGQCGSCWA 158

Query: 163 FAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDE 222
           F+ VAAVEGI +I +GNL  LSEQ+L+DC T  NNGC GG  + AF++I++N G+  E++
Sbjct: 159 FSTVAAVEGINQIVTGNLTSLSEQELIDCDTTYNNGCNGGLMDYAFSFIVKNGGLHKEED 218

Query: 223 YPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYK 281
           YPY     TC   ++ +    I+ Y +VP  +EQ+LLKA++ QP+S+AI A   +FQ Y 
Sbjct: 219 YPYIMEESTCEMKKEVSEVVTINGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQFYS 278

Query: 282 EGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLC 337
            G+F+G CG++LDH V+ VG+GT++ G +Y ++KNSWG  WG+ G++++ R+    EG+C
Sbjct: 279 GGVFDGHCGSELDHGVSAVGYGTSK-GLDYIIVKNSWGAKWGEKGFIRMKRNIGKSEGIC 337

Query: 338 GIGTRSSYP 346
           G+   +SYP
Sbjct: 338 GLYKMASYP 346


>gi|445927|prf||1910332A Cys endopeptidase
          Length = 362

 Score =  298 bits (763), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 149/318 (46%), Positives = 204/318 (64%), Gaps = 16/318 (5%)

Query: 40  EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
           E+S+ +++E+W + H  S +   EK  R  +FK N+ ++   NK  ++ YKL  N+F+D+
Sbjct: 33  EESLWDLYERWRSHHTVS-RSLGEKHKRFNVFKANVMHVHNTNKM-DKPYKLKLNKFADM 90

Query: 100 TNDEFRALYTG-----YKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQ 154
           TN EFR+ Y G     +KM   S     S TF Y+ +    VP S+DWR KGAVT +K+Q
Sbjct: 91  TNHEFRSTYAGSKVNHHKMFRGSQHG--SGTFMYEKVG--SVPASVDWRKKGAVTDVKDQ 146

Query: 155 KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQN 214
            +CG CWAF+ + AVEGI +I++  L+ LSEQ+L+DC    N GC GG  E AF +I Q 
Sbjct: 147 GQCGSCWAFSTIVAVEGINQIKTNKLVSLSEQELVDCDKEENQGCNGGLMESAFEFIKQK 206

Query: 215 QGIATEDEYPYQAVPGTCSAAQ-KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAY 273
            GI TE  YPY+A  GTC  ++    A  I  +E VP  DE ALLKAV+ QPVS+AI A 
Sbjct: 207 GGITTESNYPYKAQEGTCDESKVNDLAVSIDGHENVPVNDENALLKAVANQPVSVAIDAG 266

Query: 274 STEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD 333
            ++FQ Y EG+F G C T L+H V IVG+GTT DG NYW+++NSWG  WG+ GY+++ R+
Sbjct: 267 GSDFQFYSEGVFTGDCNTDLNHGVAIVGYGTTVDGTNYWIVRNSWGPEWGEQGYIRMQRN 326

Query: 334 ----EGLCGIGTRSSYPL 347
               EGLCGI   +SYP+
Sbjct: 327 ISKKEGLCGIAMMASYPI 344


>gi|18413507|ref|NP_567377.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|30315953|sp|Q9SUS9.1|CPR4_ARATH RecName: Full=Probable cysteine proteinase At4g11320; Flags:
           Precursor
 gi|5596478|emb|CAB51416.1| drought-inducible cysteine proteinase RD21A precursor-like protein
           [Arabidopsis thaliana]
 gi|7267831|emb|CAB81233.1| drought-inducible cysteine proteinase RD21A precursor-like protein
           [Arabidopsis thaliana]
 gi|14334764|gb|AAK59560.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|15293257|gb|AAK93739.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|332657596|gb|AEE82996.1| putative cysteine proteinase [Arabidopsis thaliana]
          Length = 371

 Score =  298 bits (763), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 150/353 (42%), Positives = 220/353 (62%), Gaps = 26/353 (7%)

Query: 18  MFIIITLLVSCAS----QVVSSRSTHE--------QSVVE-----IHEKWMAQHGRSYKD 60
           +F++  ++ SCA+     VVSS   H         Q + +     + E WM +HG+ Y  
Sbjct: 10  IFLLALVIASCATAMDMSVVSSNDNHHVTAGPGRRQGIFDAEATLMFESWMVKHGKVYDS 69

Query: 61  ELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRS 120
             EKE RL IF++NL +I   N E N +Y+LG N+F+DL+  E+  +  G     P +  
Sbjct: 70  VAEKERRLTIFEDNLRFITNRNAE-NLSYRLGLNRFADLSLHEYGEICHGADPRPPRNHV 128

Query: 121 TTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNL 180
             +S+ +Y+      +P S+DWR++GAVT +K+Q  C  CWAF+ V AVEG+ KI +G L
Sbjct: 129 FMTSSNRYKTSDGDVLPKSVDWRNEGAVTEVKDQGLCRSCWAFSTVGAVEGLNKIVTGEL 188

Query: 181 IQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKP-- 238
           + LSEQ L++C+   NNGC GG  E A+ +I+ N G+ T+++YPY+A+ G C    K   
Sbjct: 189 VTLSEQDLINCNKE-NNGCGGGKVETAYEFIMNNGGLGTDNDYPYKALNGVCEGRLKEDN 247

Query: 239 AAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVT 298
               I  YE +P+ DE AL+KAV+ QPV+  + + S EFQ Y+ G+F+G CGT L+H V 
Sbjct: 248 KNVMIDGYENLPANDEAALMKAVAHQPVTAVVDSSSREFQLYESGVFDGTCGTNLNHGVV 307

Query: 299 IVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
           +VG+G TE+G +YW++KNS G+TWG+AGYMK+ R+     GLCGI  R+SYPL
Sbjct: 308 VVGYG-TENGRDYWIVKNSRGDTWGEAGYMKMARNIANPRGLCGIAMRASYPL 359


>gi|413942348|gb|AFW74997.1| Xylem cysteine proteinase 2 [Zea mays]
          Length = 391

 Score =  298 bits (763), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 148/317 (46%), Positives = 209/317 (65%), Gaps = 14/317 (4%)

Query: 38  THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
           T    +V + E+W+A++ ++Y    EK  R ++FK+NL +I++AN++   +Y LG N F+
Sbjct: 77  TQHDRLVRLFEEWVAKYRKAYGSFEEKLRRFEVFKDNLHHIDEANRKEVTSYWLGLNAFA 136

Query: 98  DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV--PTSLDWRDKGAVTPIKNQK 155
           DLT+DEF+A Y G  +P    + T+   F+Y  +       P S+DWR KGAVT +KNQ 
Sbjct: 137 DLTHDEFKATYLGL-LP----KRTSGGRFRYGGVGDGGDEVPASVDWRKKGAVTEVKNQG 191

Query: 156 ECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQ 215
           +CG CWAF+ VAAVEGI +I +GNL  LSEQQL+DCST+GNNGC GG  + AF++I    
Sbjct: 192 QCGSCWAFSTVAAVEGINQIVTGNLTSLSEQQLVDCSTDGNNGCSGGVMDNAFSFIATGA 251

Query: 216 GIATEDEYPYQAVPGTCS--AAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAY 273
           G+ +E+ YPY    G C   A        IS YE+VP+ DEQAL+KA++ QPVS+AI A 
Sbjct: 252 GLRSEEAYPYLMEEGDCDDRARDGEVLVTISGYEDVPANDEQALVKALAHQPVSVAIEAS 311

Query: 274 STEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD 333
              FQ Y  G+F+G CG++LDH V  VG+G+++ G +Y ++KNSWG  WG+ GY+++ R 
Sbjct: 312 GRHFQFYSGGVFDGPCGSELDHGVAAVGYGSSK-GQDYIIVKNSWGTHWGEKGYIRMKRG 370

Query: 334 ----EGLCGIGTRSSYP 346
               EGLCGI   +SYP
Sbjct: 371 TGKPEGLCGINKMASYP 387


>gi|359491865|ref|XP_002273243.2| PREDICTED: xylem cysteine proteinase 1-like [Vitis vinifera]
          Length = 351

 Score =  298 bits (762), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 146/314 (46%), Positives = 212/314 (67%), Gaps = 10/314 (3%)

Query: 38  THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
           T    + ++ E WM++HG+SY+   EK  R ++F++NL++I++ NK+ + +Y LG N+F+
Sbjct: 39  TSMDKLTDLFESWMSKHGKSYRSFEEKLHRFEVFQDNLKHIDETNKKVS-SYWLGLNEFA 97

Query: 98  DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
           DL+++EF+  Y G K+  P  R +    F Y++++  D+P S+DWR KGAV  +KNQ  C
Sbjct: 98  DLSHEEFKRKYLGLKIELPKRRDSPEE-FSYKDVA--DLPKSVDWRKKGAVAHVKNQGAC 154

Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
           G CWAF+ VAAVEGI +I +GNL  LSEQ+L+DC    NNGC GG  + AFA+II N G+
Sbjct: 155 GSCWAFSTVAAVEGINQIVTGNLTALSEQELIDCDKPFNNGCNGGLMDYAFAFIISNGGL 214

Query: 218 ATEDEYPYQAVPGTCSAAQKP-AAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTE 276
             E++YPY    GTC   ++      IS Y +VP  +EQ+ LKA++ QP+S+AI A S  
Sbjct: 215 RKEEDYPYVMEEGTCGEKKEELEVVTISGYHDVPEDNEQSFLKALANQPLSVAIEASSRG 274

Query: 277 FQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD--- 333
           FQ Y  GIFNG CGT+LDH V  VG+GT++ G +Y  +KNSWG+ WG+ GY+++ R+   
Sbjct: 275 FQFYSGGIFNGHCGTELDHGVAAVGYGTSK-GVDYITVKNSWGSKWGEKGYIRMKRNVGK 333

Query: 334 -EGLCGIGTRSSYP 346
            EG+CGI   +SYP
Sbjct: 334 PEGICGIYKMASYP 347


>gi|297809385|ref|XP_002872576.1| hypothetical protein ARALYDRAFT_489965 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297318413|gb|EFH48835.1| hypothetical protein ARALYDRAFT_489965 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 371

 Score =  298 bits (762), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 151/357 (42%), Positives = 223/357 (62%), Gaps = 28/357 (7%)

Query: 16  TPMFIIITLLV--SCAS----QVVSSRSTHE--------QSVVE-----IHEKWMAQHGR 56
           + M +++  +V  SCA+     +VSS   H         Q V +     + E WM +HG+
Sbjct: 6   SAMLVLLLAMVISSCATAMDMSIVSSNDNHHVTNGPGRRQGVFDAEATLMFESWMVKHGK 65

Query: 57  SYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSP 116
            Y+   EKE RL IF++NL +I   N E N +Y+LG N+F+DL+  E+  +  G     P
Sbjct: 66  VYESVAEKERRLTIFEDNLRFITNRNAE-NLSYRLGLNRFADLSLHEYAQICHGADPRPP 124

Query: 117 SHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIR 176
            +    +S+ +Y+      +P S+DWR++GAVT +K+Q +C  CWAF+ V AVEG+ KI 
Sbjct: 125 RNHVFMTSSNRYKTSDGDVLPKSVDWRNEGAVTEVKDQGQCRSCWAFSTVGAVEGLNKIV 184

Query: 177 SGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQ 236
           +G L+ LSEQ L++C+   NNGC GG  E A+ +I+ N G+ T+++YPY+A+ G C+   
Sbjct: 185 TGELVTLSEQDLINCNKE-NNGCGGGKVETAYEFIMNNGGLGTDNDYPYKALNGVCNDRL 243

Query: 237 KP--AAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLD 294
           K       I  YE +P+ DE AL+KAV+ QPV+  + + S EFQ Y  G+F+G CGT L+
Sbjct: 244 KENNKNVMIDGYENLPANDESALMKAVAHQPVTAVVDSSSREFQLYASGVFDGTCGTNLN 303

Query: 295 HAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
           H V +VG+G TE+G +YW+++NS GNTWG+AGYMK+ R+     GLCGI  R+SYPL
Sbjct: 304 HGVVVVGYG-TENGRDYWIVRNSRGNTWGEAGYMKMARNIANPRGLCGIAMRASYPL 359


>gi|242092704|ref|XP_002436842.1| hypothetical protein SORBIDRAFT_10g009850 [Sorghum bicolor]
 gi|241915065|gb|EER88209.1| hypothetical protein SORBIDRAFT_10g009850 [Sorghum bicolor]
          Length = 296

 Score =  298 bits (762), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 145/311 (46%), Positives = 197/311 (63%), Gaps = 24/311 (7%)

Query: 43  VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTND 102
           +V  HE+WM Q+ R YKD  EK  R ++FK N+++IE  N  GNR + LG NQF+DLTND
Sbjct: 1   MVARHEQWMVQYSRVYKDATEKAQRFEVFKSNVKFIESFNAGGNRKFWLGVNQFADLTND 60

Query: 103 EFRALYT--GYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCC 160
           EFRA  T  G+K PSP    T    F+Y+N+S+  +P ++DWR KGAVTPIK+Q +C   
Sbjct: 61  EFRATKTNKGFK-PSPVKVPTG---FRYENISVDALPATIDWRTKGAVTPIKDQGQC--- 113

Query: 161 WAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYIIQNQGIAT 219
                    EGI KI +G LI LSEQ+L+DC  +G + GC GG  + AF +II+  G+ T
Sbjct: 114 ---------EGIVKISTGKLISLSEQELVDCDVHGEDQGCEGGLMDDAFKFIIKKGGLTT 164

Query: 220 EDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQS 279
           E  YPY A  G C +    + A +  +E+VP+ DE +L+KAV+ QPVS+A+      FQ 
Sbjct: 165 ESSYPYTAADGKCKSGSN-SVATVKGFEDVPANDEASLMKAVANQPVSVAVDGGDMTFQF 223

Query: 280 YKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EG 335
           Y  G+  G CGT LDH +  +G+G T DG  YWL+KNSWG TWG+ GY+++ +D     G
Sbjct: 224 YSGGVMTGSCGTDLDHGIAAIGYGQTSDGTKYWLLKNSWGTTWGENGYLRMEKDISDKRG 283

Query: 336 LCGIGTRSSYP 346
           +CG+    SYP
Sbjct: 284 MCGLAMEPSYP 294


>gi|146215980|gb|ABQ10192.1| actinidin Act2a [Actinidia deliciosa]
          Length = 378

 Score =  298 bits (762), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 154/340 (45%), Positives = 210/340 (61%), Gaps = 11/340 (3%)

Query: 13  INTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFK 72
           I+ + +F    L++S A  + +S       V+ ++E W+ +HG+SY    EKEMR +IFK
Sbjct: 8   ISKSLLFFSTLLILSSAIDIENSVQRTNDQVMAMYESWLVEHGKSYNSLDEKEMRFEIFK 67

Query: 73  ENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
           ENL  I+  N + NR+Y LG N+F+DLT++E+R+ Y G K         T  + +Y    
Sbjct: 68  ENLRIIDDHNADANRSYSLGLNRFADLTDEEYRSTYLGLK-----RGPKTDVSNQYMPKV 122

Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
              +P  +DWR  GAV  +KNQ  C  CWAF+AVAAVEGI KI +GNLI LSEQ+L+DC 
Sbjct: 123 GDALPDYVDWRTVGAVVGVKNQGLCSSCWAFSAVAAVEGINKIVTGNLISLSEQELVDCG 182

Query: 193 -TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAA-AKISNYEEVP 250
            T    GC  G    AF +II N GI TE+ YPY A  G C+ + K      I +Y+ VP
Sbjct: 183 RTQITKGCNRGLMTDAFKFIINNGGINTENNYPYTAKDGQCNLSLKNQKYVTIDSYKNVP 242

Query: 251 SGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGAN 310
           S +E AL KAV+ QPVS+ + +   +F+ Y  GIF G CGT +DH VTIVG+G TE G +
Sbjct: 243 SNNEMALKKAVAYQPVSVGVESEGGKFKLYTSGIFTGSCGTAVDHGVTIVGYG-TERGMD 301

Query: 311 YWLIKNSWGNTWGDAGYMKIVRD---EGLCGIGTRSSYPL 347
           YW++KNSWG  WG++GY++I R+    G CGI    SYP+
Sbjct: 302 YWIVKNSWGTNWGESGYIRIQRNIGGAGKCGIAKMPSYPV 341


>gi|18402225|ref|NP_566633.1| Granulin repeat cysteine protease family protein [Arabidopsis
           thaliana]
 gi|11994461|dbj|BAB02463.1| cysteine proteinase [Arabidopsis thaliana]
 gi|17065298|gb|AAL32803.1| cysteine proteinase [Arabidopsis thaliana]
 gi|20260004|gb|AAM13349.1| cysteine proteinase [Arabidopsis thaliana]
 gi|332642713|gb|AEE76234.1| Granulin repeat cysteine protease family protein [Arabidopsis
           thaliana]
          Length = 452

 Score =  298 bits (762), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 150/345 (43%), Positives = 219/345 (63%), Gaps = 12/345 (3%)

Query: 10  SFKINTTPMFIIITLLVSCASQVVSSRST--HEQSVVEIHEKWMAQHGRSYKDELEKEMR 67
           S K  T  + I   LL+S +   V++  T  +E     ++E+W+ ++ ++Y    EKE R
Sbjct: 4   SIKSITLALLIFSVLLISLSLGSVTATETTRNEAEARRMYERWLVENRKNYNGLGEKERR 63

Query: 68  LKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFK 127
            +IFK+NL+++E+ +   NRTY++G  +F+DLTNDEFRA+Y   KM             K
Sbjct: 64  FEIFKDNLKFVEEHSSIPNRTYEVGLTRFADLTNDEFRAIYLRSKM---ERTRVPVKGEK 120

Query: 128 YQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQ 187
           Y       +P ++DWR KGAV P+K+Q  CG CWAF+A+ AVEGI +I++G LI LSEQ+
Sbjct: 121 YLYKVGDSLPDAIDWRAKGAVNPVKDQGSCGSCWAFSAIGAVEGINQIKTGELISLSEQE 180

Query: 188 LLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVP-GTCSAAQKPA-AAKISN 245
           L+DC T+ N+GC GG  + AF +II+N GI TE++YPY A     C++ +K      I  
Sbjct: 181 LVDCDTSYNDGCGGGLMDYAFKFIIENGGIDTEEDYPYIATDVNVCNSDKKNTRVVTIDG 240

Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
           YE+VP  DE++L KA++ QP+S+AI A    FQ Y  G+F G CGT LDH V  VG+G +
Sbjct: 241 YEDVPQNDEKSLKKALANQPISVAIEAGGRAFQLYTSGVFTGTCGTSLDHGVVAVGYG-S 299

Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
           E G +YW+++NSWG+ WG++GY K+ R+     G CG+   +SYP
Sbjct: 300 EGGQDYWIVRNSWGSNWGESGYFKLERNIKESSGKCGVAMMASYP 344


>gi|449438381|ref|XP_004136967.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
          Length = 479

 Score =  298 bits (762), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 144/313 (46%), Positives = 207/313 (66%), Gaps = 9/313 (2%)

Query: 40  EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
           ++ V  ++E W+  HG++Y    EKE R +IFK+NL +I++ N+E +RTYK+G  +F+DL
Sbjct: 55  DEEVAALYESWLVHHGKAYNAIGEKERRFEIFKDNLRFIDEHNRE-SRTYKVGLTRFADL 113

Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGC 159
           TN+E+RA + G +  S   R + + + +Y      D+P  +DWR KGAV  +K+Q +CG 
Sbjct: 114 TNEEYRARFLGGRF-SRKPRLSAAKSGRYAAALGDDLPDDVDWRKKGAVATVKDQGQCGS 172

Query: 160 CWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIAT 219
           CWAF++VAAVEGI +I +G LI LSEQ+L+DC  + N GC GG  + AF +II N GI T
Sbjct: 173 CWAFSSVAAVEGINQIVTGELIPLSEQELVDCDKSFNMGCNGGLMDYAFQFIIGNGGIDT 232

Query: 220 EDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQ 278
           E++YPY+     C   +K A    I  YE+VP  DE +L KAV+ QPVS+AI A    FQ
Sbjct: 233 EEDYPYKGRDAACDPNRKNAKVVTIDGYEDVPENDESSLKKAVANQPVSVAIEAGGRAFQ 292

Query: 279 SYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----- 333
            Y+ G+F G CGT LDH V  VG+G T++G +YW+++NSWG  WG++GY+++ R+     
Sbjct: 293 LYQSGVFTGRCGTDLDHGVVAVGYG-TDNGTDYWIVRNSWGKDWGESGYIRLERNVANIT 351

Query: 334 EGLCGIGTRSSYP 346
            G CGI  + SYP
Sbjct: 352 TGKCGIAVQPSYP 364


>gi|118158|sp|P12412.1|CYSEP_VIGMU RecName: Full=Vignain; AltName: Full=Bean endopeptidase; AltName:
           Full=Cysteine proteinase; AltName:
           Full=Sulfhydryl-endopeptidase; Short=SH-EP; Contains:
           RecName: Full=Vignain-1; Contains: RecName:
           Full=Vignain-2; Flags: Precursor
 gi|22062|emb|CAA33753.1| sulfhydryl-pre-endopeptidase (AA -20 to 342) [Vigna mungo]
 gi|22066|emb|CAA36181.1| sulfhydryl-endopeptidase [Vigna mungo]
          Length = 362

 Score =  297 bits (761), Expect = 4e-78,   Method: Compositional matrix adjust.
 Identities = 149/318 (46%), Positives = 203/318 (63%), Gaps = 16/318 (5%)

Query: 40  EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
           E+S+ +++E+W + H  S +   EK  R  +FK N+ ++   NK  ++ YKL  N+F+D+
Sbjct: 33  EESLWDLYERWRSHHTVS-RSLGEKHKRFNVFKANVMHVHNTNKM-DKPYKLKLNKFADM 90

Query: 100 TNDEFRALYTG-----YKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQ 154
           TN EFR+ Y G     +KM   S     S TF Y+ +    VP S+DWR KGAVT +K+Q
Sbjct: 91  TNHEFRSTYAGSKVNHHKMFRGSQHG--SGTFMYEKVG--SVPASVDWRKKGAVTDVKDQ 146

Query: 155 KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQN 214
            +CG CWAF+ + AVEGI +I++  L+ LSEQ+L+DC    N GC GG  E AF +I Q 
Sbjct: 147 GQCGSCWAFSTIVAVEGINQIKTNKLVSLSEQELVDCDKEENQGCNGGLMESAFEFIKQK 206

Query: 215 QGIATEDEYPYQAVPGTCSAAQ-KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAY 273
            GI TE  YPY A  GTC  ++    A  I  +E VP  DE ALLKAV+ QPVS+AI A 
Sbjct: 207 GGITTESNYPYTAQEGTCDESKVNDLAVSIDGHENVPVNDENALLKAVANQPVSVAIDAG 266

Query: 274 STEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD 333
            ++FQ Y EG+F G C T L+H V IVG+GTT DG NYW+++NSWG  WG+ GY+++ R+
Sbjct: 267 GSDFQFYSEGVFTGDCNTDLNHGVAIVGYGTTVDGTNYWIVRNSWGPEWGEQGYIRMQRN 326

Query: 334 ----EGLCGIGTRSSYPL 347
               EGLCGI   +SYP+
Sbjct: 327 ISKKEGLCGIAMMASYPI 344


>gi|358343350|ref|XP_003635767.1| Cysteine proteinase [Medicago truncatula]
 gi|355501702|gb|AES82905.1| Cysteine proteinase [Medicago truncatula]
          Length = 338

 Score =  297 bits (761), Expect = 4e-78,   Method: Compositional matrix adjust.
 Identities = 154/345 (44%), Positives = 222/345 (64%), Gaps = 17/345 (4%)

Query: 11  FKINTTPMFIIITL--LVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRL 68
            K   T   +I+ L  + S   ++ +  ST+   + + +E W+ ++GR Y+D  E E+R 
Sbjct: 1   MKTTITLSIVILNLWIIASACPEIHTKNSTNPAVMKKRYETWLKRYGRHYRDREEWEVRF 60

Query: 69  KIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKY 128
            I++ N++YIE  N + N +YKL  N+F+D+TN+EF++ Y GY +P    R    + F+Y
Sbjct: 61  DIYQSNVQYIEFYNSQ-NYSYKLIDNRFADITNEEFKSTYLGY-LP----RFRVQTEFRY 114

Query: 129 QNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQL 188
                 ++P S+DWR KGAVT +K+Q  CG CWAF+AVAAVEGI KI++ NL+ LSEQQL
Sbjct: 115 H--KHGELPKSIDWRKKGAVTHVKDQGRCGSCWAFSAVAAVEGINKIKTENLVSLSEQQL 172

Query: 189 LDCS-TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQ-KPAAAKISNY 246
           +DC   +GN GC GG    AF YI ++ GIAT  EYPY+   G C+ ++ K  A  IS Y
Sbjct: 173 IDCDIKSGNEGCEGGDMYIAFNYIKKHGGIATAKEYPYKGRDGNCNKSKAKNNAVTISGY 232

Query: 247 EEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTE 306
           E VP+ +E+ L  AV+ QPVSIA  A    FQ Y +GIF+G CG  L+H +TIVG+G  E
Sbjct: 233 ESVPARNEKMLKAAVAHQPVSIATDAGGYAFQFYSKGIFSGSCGKNLNHGMTIVGYG-EE 291

Query: 307 DGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
           +G  YW++KNSW N WG++GY+++ RD    +G CGI   ++YP+
Sbjct: 292 NGDKYWIVKNSWANDWGESGYVRMKRDTKDKDGTCGIAMDATYPV 336


>gi|357162587|ref|XP_003579458.1| PREDICTED: oryzain beta chain-like [Brachypodium distachyon]
          Length = 470

 Score =  297 bits (761), Expect = 4e-78,   Method: Compositional matrix adjust.
 Identities = 145/317 (45%), Positives = 206/317 (64%), Gaps = 11/317 (3%)

Query: 40  EQSVVEIHEKWMAQHGRSYKDEL-EKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQ 95
           E     I+  W A+HG    + L E+E R + F +NL +++  N     G   ++LG N+
Sbjct: 45  EAEARAIYGLWRAEHGSGNSNSLGEEERRFRAFWDNLRFVDAHNARAAAGEEGFRLGMNR 104

Query: 96  FSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQK 155
           F+DLTNDEFRA Y G K       +      +Y++  + ++P ++DWR+KGAV P+KNQ 
Sbjct: 105 FADLTNDEFRAAYLGVKGAGQRRSARAGVGERYRHDGVEELPEAVDWREKGAVAPVKNQG 164

Query: 156 ECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYIIQN 214
           +CG CWAF+AV+AVE I ++ +G L+ LSEQ+L++C  NG +NGC GG  + AF +II N
Sbjct: 165 QCGSCWAFSAVSAVESINQLVTGELVTLSEQELVECDINGQSNGCNGGLMDDAFDFIINN 224

Query: 215 QGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAY 273
            GI TED+YPY+A+ G C   ++ A    I  +E+VP  DE++L KAV+ QPVS+AI A 
Sbjct: 225 GGIDTEDDYPYKALDGKCDINRRNAKVVSIDGFEDVPENDEKSLQKAVAHQPVSVAIEAG 284

Query: 274 STEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD 333
             EFQ Y  G+F G CGT+LDH V  VG+G TE+G +YW+++NSWG  WG+AGY+++ R+
Sbjct: 285 GREFQLYHSGVFTGRCGTELDHGVVAVGYG-TENGKDYWIVRNSWGPKWGEAGYLRMERN 343

Query: 334 ----EGLCGIGTRSSYP 346
                G CGI   SSYP
Sbjct: 344 INATTGKCGIAMMSSYP 360


>gi|351726339|ref|NP_001237379.1| cysteine proteinase precursor [Glycine max]
 gi|31559526|dbj|BAC77521.1| cysteine proteinase [Glycine max]
 gi|31559528|dbj|BAC77522.1| cysteine proteinase [Glycine max]
          Length = 362

 Score =  297 bits (761), Expect = 5e-78,   Method: Compositional matrix adjust.
 Identities = 151/320 (47%), Positives = 203/320 (63%), Gaps = 20/320 (6%)

Query: 40  EQSVVEIHEKWMAQH--GRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
           E+S  +++E+W + H   RS  D   K  R  +FK N+ ++   NK  ++ YKL  N+F+
Sbjct: 33  EESFWDLYERWRSHHTVSRSLGD---KHKRFNVFKANVMHVHNTNKM-DKPYKLKLNKFA 88

Query: 98  DLTNDEFRALYTGYKMPSPSHR-----STTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIK 152
           D+TN EFR+ Y G K+    HR        + TF Y+ +    VP S+DWR  GAVT +K
Sbjct: 89  DMTNHEFRSTYAGSKVNH--HRMFQGTPRGNGTFMYEKVG--SVPPSVDWRKNGAVTGVK 144

Query: 153 NQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYII 212
           +Q +CG CWAF+ V AVEGI +I++  L+ LSEQ+L+DC T  N GC GG  E AF +I 
Sbjct: 145 DQGQCGSCWAFSTVVAVEGINQIKTNKLVSLSEQELVDCDTKKNAGCNGGLMESAFEFIK 204

Query: 213 QNQGIATEDEYPYQAVPGTCSAAQ-KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIA 271
           Q  GI TE  YPY A  GTC A++    A  I  +E VP+ DE ALLKAV+ QPVS+AI 
Sbjct: 205 QKGGITTESNYPYTAQDGTCDASKANDLAVSIDGHENVPANDENALLKAVANQPVSVAID 264

Query: 272 AYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIV 331
           A  ++FQ Y EG+F G C T+L+H V IVG+GTT DG NYW ++NSWG  WG+ GY+++ 
Sbjct: 265 AGGSDFQFYSEGVFTGDCSTELNHGVAIVGYGTTVDGTNYWTVRNSWGPEWGEQGYIRMQ 324

Query: 332 RD----EGLCGIGTRSSYPL 347
           R     EGLCGI   +SYP+
Sbjct: 325 RSISKKEGLCGIAMMASYPI 344


>gi|57282619|emb|CAE54307.1| cysteine proteinase [Gossypium hirsutum]
          Length = 372

 Score =  297 bits (761), Expect = 5e-78,   Method: Compositional matrix adjust.
 Identities = 145/315 (46%), Positives = 206/315 (65%), Gaps = 9/315 (2%)

Query: 40  EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
           +  V+ +++ W+ QHG++Y    E+E R +IFK+NL +I++ N   N TYKLG N+F+DL
Sbjct: 39  DDEVMGLYKSWVIQHGKAYNGIGEEEKRFEIFKDNLRFIDEHNSNNNTTYKLGLNKFADL 98

Query: 100 TNDEFRALYTGYKMPSPSHRSTTSS--TFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
           TN E+RA + G +   P  R   S   + +Y + +  ++P S++WRD GAV+ +K+Q  C
Sbjct: 99  TNQEYRAKFLGTRT-DPRRRLMKSKIPSSRYAHRAGDNLPDSVNWRDHGAVSRVKDQGSC 157

Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
           G CWAF+A+AAVEGI KI SG LI LSEQ+L+DC  + + GC GG  + AF +II N GI
Sbjct: 158 GSCWAFSAIAAVEGINKIVSGELISLSEQELVDCDRSYDAGCNGGLMDYAFQFIIDNGGI 217

Query: 218 ATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTE 276
            TE +YPY      C   +K A    I  YE+VP+ +E AL KAV+ QPVSIAI A    
Sbjct: 218 DTEKDYPYLGFNNQCDPTKKNAKVVSIDGYEDVPN-NENALKKAVAHQPVSIAIEAGGRA 276

Query: 277 FQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR---- 332
           FQ Y+ G+FNG CG  LDH V  VG+G+ ++G +YW+++NSWG  WG+ GY+++ R    
Sbjct: 277 FQLYESGVFNGECGLALDHGVVAVGYGSDDNGQDYWIVRNSWGGNWGENGYIRMERNINA 336

Query: 333 DEGLCGIGTRSSYPL 347
           + G CGI   +SYP+
Sbjct: 337 NTGKCGIAMEASYPV 351


>gi|18413505|ref|NP_567376.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|30315954|sp|Q9SUT0.1|CPR3_ARATH RecName: Full=Probable cysteine proteinase At4g11310; Flags:
           Precursor
 gi|5596477|emb|CAB51415.1| drought-inducible cysteine proteinase RD21A precursor-like protein
           [Arabidopsis thaliana]
 gi|7267830|emb|CAB81232.1| drought-inducible cysteine proteinase RD21A precursor-like protein
           [Arabidopsis thaliana]
 gi|332657595|gb|AEE82995.1| putative cysteine proteinase [Arabidopsis thaliana]
          Length = 364

 Score =  297 bits (761), Expect = 5e-78,   Method: Compositional matrix adjust.
 Identities = 152/350 (43%), Positives = 221/350 (63%), Gaps = 21/350 (6%)

Query: 16  TPMFIIITLLV--SCASQVVSSRSTHE-----QSVVE-----IHEKWMAQHGRSYKDELE 63
           + M I++  +V  SCA+ +  S  +++      SV +     I E WM +HG+ Y    E
Sbjct: 6   SAMLILLVAMVIASCATAIDMSVVSYDDNNRLHSVFDAEASLIFESWMVKHGKVYGSVAE 65

Query: 64  KEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTS 123
           KE RL IF++NL +I   N E N +Y+LG   F+DL+  E++ +  G     P +    +
Sbjct: 66  KERRLTIFEDNLRFINNRNAE-NLSYRLGLTGFADLSLHEYKEVCHGADPRPPRNHVFMT 124

Query: 124 STFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQL 183
           S+ +Y+  +   +P S+DWR++GAVT +K+Q  C  CWAF+ V AVEG+ KI +G L+ L
Sbjct: 125 SSDRYKTSADDVLPKSVDWRNEGAVTEVKDQGHCRSCWAFSTVGAVEGLNKIVTGELVTL 184

Query: 184 SEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKP--AAA 241
           SEQ L++C+   NNGC GG  E A+ +I++N G+ T+++YPY+AV G C    K      
Sbjct: 185 SEQDLINCNKE-NNGCGGGKLETAYEFIMKNGGLGTDNDYPYKAVNGVCDGRLKENNKNV 243

Query: 242 KISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVG 301
            I  YE +P+ DE AL+KAV+ QPV+  I + S EFQ Y+ G+F+G CGT L+H V +VG
Sbjct: 244 MIDGYENLPANDESALMKAVAHQPVTAVIDSSSREFQLYESGVFDGSCGTNLNHGVVVVG 303

Query: 302 FGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
           +G TE+G +YWL+KNS G TWG+AGYMK+ R+     GLCGI  R+SYPL
Sbjct: 304 YG-TENGRDYWLVKNSRGITWGEAGYMKMARNIANPRGLCGIAMRASYPL 352


>gi|37780041|gb|AAP32193.1| cysteine protease 14 [Trifolium repens]
          Length = 351

 Score =  297 bits (760), Expect = 6e-78,   Method: Compositional matrix adjust.
 Identities = 142/310 (45%), Positives = 214/310 (69%), Gaps = 11/310 (3%)

Query: 43  VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTND 102
           ++E+ E WM++HG+ Y+   EK +R ++FK+NL++I++ NK  +  Y LG N+F+DL++ 
Sbjct: 43  LIELFESWMSRHGKIYETIEEKLLRFEVFKDNLKHIDERNKIVS-NYWLGLNEFADLSHQ 101

Query: 103 EFRALYTGYKMPSPSHR-STTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCW 161
           EF+  Y G K+     R S+    F Y+++   D+P S+DWR KGAVTP+KNQ +CG CW
Sbjct: 102 EFKNKYLGLKVNLSQRRESSNEEEFTYRDV---DLPKSVDWRKKGAVTPVKNQGQCGSCW 158

Query: 162 AFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATED 221
           AF+ VAAVEGI +I +GNL  LSEQ+L+DC T  NNGC GG  + AF++I+QN G+  ED
Sbjct: 159 AFSTVAAVEGINQIVTGNLTSLSEQELIDCDTTYNNGCNGGLMDYAFSFIVQNGGLHKED 218

Query: 222 EYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSY 280
           +YPY     TC   ++      I+ Y +VP  +EQ+LLKA++ QP+S+AI A S +FQ Y
Sbjct: 219 DYPYIMEESTCEMKKEETQVVTINGYHDVPQNNEQSLLKALANQPLSVAIEASSRDFQFY 278

Query: 281 KEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGL 336
             G+F+G CG+ LDH V+ VG+GT+++  +Y ++KNSWG  WG+ G++++ R+    EG+
Sbjct: 279 SGGVFDGHCGSDLDHGVSAVGYGTSKN-LDYIIVKNSWGAKWGEKGFIRMKRNIGKPEGI 337

Query: 337 CGIGTRSSYP 346
           CG+   +SYP
Sbjct: 338 CGLYKMASYP 347


>gi|2144501|pir||TAGB actinidain (EC 3.4.22.14) precursor - kiwi fruit
 gi|166317|gb|AAA32629.1| actinidin [Actinidia deliciosa]
          Length = 380

 Score =  296 bits (759), Expect = 7e-78,   Method: Compositional matrix adjust.
 Identities = 154/340 (45%), Positives = 212/340 (62%), Gaps = 10/340 (2%)

Query: 13  INTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFK 72
           ++ + +F    L++S A    +        V  ++E W+ ++G+SY    E E R +IFK
Sbjct: 8   VSMSLLFFSTLLILSLAFNAKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWERRFEIFK 67

Query: 73  ENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
           E L +I++ N + NR+YK+G NQF+DLT++EFR+ Y G+   S S+++  S+  +Y+   
Sbjct: 68  ETLRFIDEHNADTNRSYKVGLNQFADLTDEEFRSTYLGFT--SGSNKTKVSN--RYEPRV 123

Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
              +P+ +DWR  GAV  IK+Q ECG CWAF+A+A VEGI KI +G LI LSEQ+L+DC 
Sbjct: 124 GQVLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIDCG 183

Query: 193 -TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAA-QKPAAAKISNYEEVP 250
            T    GC GG     F +II N GI TE+ YPY A  G C+   Q      I  YE VP
Sbjct: 184 RTQNTRGCNGGYITDGFQFIINNGGINTEENYPYTAQDGECNVELQNEKYVTIDTYENVP 243

Query: 251 SGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGAN 310
             +E AL  AV+ QPVS+A+ A    F+ Y  GIF G CGT +DHAVTIVG+G TE G +
Sbjct: 244 YNNEWALQTAVTYQPVSVALDAAGDAFKQYSSGIFTGPCGTAIDHAVTIVGYG-TEGGID 302

Query: 311 YWLIKNSWGNTWGDAGYMKIVRD---EGLCGIGTRSSYPL 347
           YW++KNSW  TWG+ GYM+I+R+    G CGI T  SYP+
Sbjct: 303 YWIVKNSWDTTWGEEGYMRILRNVGGAGTCGIATMPSYPV 342


>gi|312451836|gb|ADQ85985.1| actinidin [Actinidia chinensis]
          Length = 380

 Score =  296 bits (759), Expect = 7e-78,   Method: Compositional matrix adjust.
 Identities = 154/340 (45%), Positives = 212/340 (62%), Gaps = 10/340 (2%)

Query: 13  INTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFK 72
           ++ + +F    L++S A    +        V  ++E W+ ++G+SY    E E R +IFK
Sbjct: 8   VSMSLLFFSTLLILSLAFNTKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWERRFEIFK 67

Query: 73  ENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
           E L +I++ N + NR+YK+G NQF+DLT++EFR+ Y G+   S S+++  S+  +Y+   
Sbjct: 68  ETLRFIDEHNADTNRSYKVGLNQFADLTDEEFRSTYLGFT--SGSNKTKVSN--RYEPRV 123

Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
              +P+ +DWR  GAV  IK+Q ECG CWAF+A+A VEGI KI +G LI LSEQ+L+DC 
Sbjct: 124 GQVLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIDCG 183

Query: 193 -TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSA-AQKPAAAKISNYEEVP 250
            T    GC GG     F +II N GI TE+ YPY A  G C+   Q      I  YE VP
Sbjct: 184 RTQNTRGCNGGYITDGFQFIINNGGINTEENYPYTAQDGECNVDLQNEKYVTIDTYENVP 243

Query: 251 SGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGAN 310
             +E AL  AV+ QPVS+A+ A    F+ Y  GIF G CGT +DHAVTIVG+G TE G +
Sbjct: 244 YNNEWALQTAVTYQPVSVALDAAGDAFKQYSSGIFTGPCGTAIDHAVTIVGYG-TEGGID 302

Query: 311 YWLIKNSWGNTWGDAGYMKIVRD---EGLCGIGTRSSYPL 347
           YW++KNSW  TWG+ GYM+I+R+    G CGI T  SYP+
Sbjct: 303 YWIVKNSWDTTWGEEGYMRILRNVGGAGTCGIATMPSYPV 342


>gi|20260334|gb|AAM13065.1| drought-inducible cysteine proteinase RD21A precursor-like protein
           [Arabidopsis thaliana]
 gi|23197782|gb|AAN15418.1| drought-inducible cysteine proteinase RD21A precursor-like protein
           [Arabidopsis thaliana]
          Length = 357

 Score =  296 bits (759), Expect = 8e-78,   Method: Compositional matrix adjust.
 Identities = 149/346 (43%), Positives = 219/346 (63%), Gaps = 19/346 (5%)

Query: 18  MFIIITLLVSCASQVVSSRSTHE-----QSVVE-----IHEKWMAQHGRSYKDELEKEMR 67
           + ++  ++ SCA+ +  S  +++      SV +     I E WM +HG+ Y    EKE R
Sbjct: 3   ILLVAMVIASCATAIDMSVVSYDDNNRLHSVFDAEASLIFESWMVKHGKVYGSVAEKERR 62

Query: 68  LKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFK 127
           L IF++NL +I   N E N +Y+LG   F+DL+  E++ +  G     P +    +S+ +
Sbjct: 63  LTIFEDNLRFINNRNAE-NLSYRLGLTGFADLSLHEYKEVCHGADPRPPRNHVFMTSSDR 121

Query: 128 YQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQ 187
           Y+  +   +P S+DWR++GAVT +K+Q  C  CWAF+ V AVEG+ KI +G L+ LSEQ 
Sbjct: 122 YKTSADDVLPKSVDWRNEGAVTEVKDQGHCRSCWAFSTVGAVEGLNKIVTGELVTLSEQD 181

Query: 188 LLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKP--AAAKISN 245
           L++C+   NNGC GG  E A+ +I++N G+ T+++YPY+AV G C    K       I  
Sbjct: 182 LINCNKE-NNGCGGGKLETAYEFIMKNGGLGTDNDYPYKAVNGVCDGRLKENNKNVMIDG 240

Query: 246 YEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT 305
           YE +P+ DE AL+KAV+ QPV+  I + S EFQ Y+ G+F+G CGT L+H V +VG+G T
Sbjct: 241 YENLPANDESALMKAVAHQPVTAVIDSSSREFQLYESGVFDGSCGTNLNHGVVVVGYG-T 299

Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
           E+G +YWL+KNS G TWG+AGYMK+ R+     GLCGI  R+SYPL
Sbjct: 300 ENGRDYWLVKNSRGITWGEAGYMKMARNIANPRGLCGIAMRASYPL 345


>gi|172052260|gb|ACB70409.1| cysteine protease [Nicotiana tabacum]
          Length = 361

 Score =  296 bits (759), Expect = 8e-78,   Method: Compositional matrix adjust.
 Identities = 147/313 (46%), Positives = 204/313 (65%), Gaps = 16/313 (5%)

Query: 45  EIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEF 104
           E++E+W + H  S +   EK+ R  +FK N+ Y+   NK+ ++ YKL  N+F+D+TN EF
Sbjct: 36  ELYERWRSHHTVS-RSLDEKDKRFNVFKANVHYVHNFNKK-DKPYKLKLNKFADMTNHEF 93

Query: 105 RALYTGYKMPSPSHR-----STTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGC 159
           R  Y G K+    HR     S  + TF Y +     VP ++DWR KGAVTP+K+Q +CG 
Sbjct: 94  RHHYAGSKIKH--HRTFLGASRANGTFMYAHED--SVPPTVDWRKKGAVTPVKDQGKCGS 149

Query: 160 CWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIAT 219
           CWAF+ V AVEGI +I++  L+ LSEQ+L+DC T+ N GC GG  + AF +I +  GI T
Sbjct: 150 CWAFSTVVAVEGINQIKTNELVSLSEQELVDCDTSQNQGCNGGLMDMAFEFIKKKGGINT 209

Query: 220 EDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQ 278
           E+ YPY A  G C   ++ +    I  +E+VP  DE +LLKAV+ QPVS+AI A  ++FQ
Sbjct: 210 EENYPYMAEGGECDIQKRNSPVVSIDGHEDVPPNDEGSLLKAVANQPVSVAIQASGSDFQ 269

Query: 279 SYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR----DE 334
            Y EG+F G CGT+LDH V IVG+GTT D   YW++KNSWG  WG+ GY+++ R    +E
Sbjct: 270 FYSEGVFTGDCGTELDHGVAIVGYGTTLDRTKYWIVKNSWGPEWGEKGYIRMQREIDAEE 329

Query: 335 GLCGIGTRSSYPL 347
           GLCGI  + SYP+
Sbjct: 330 GLCGIAMQPSYPI 342


>gi|30141021|dbj|BAC75924.1| cysteine protease-2 [Helianthus annuus]
          Length = 362

 Score =  296 bits (758), Expect = 9e-78,   Method: Compositional matrix adjust.
 Identities = 145/317 (45%), Positives = 206/317 (64%), Gaps = 15/317 (4%)

Query: 40  EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
           E ++ +++E+W  +   ++ ++L    R  +FK N+ ++ + NK  ++ YKL  N+F+D+
Sbjct: 33  EDNLWDMYERWRHKVATNHGEKLR---RFNVFKSNVLHVHETNKM-DKPYKLKLNKFADM 88

Query: 100 TNDEFRALYTGYKMP----SPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQK 155
           TN EFR++Y G K+     S     + S TF Y N+    VPTS+DWR KGAV P+K+Q 
Sbjct: 89  TNHEFRSVYAGSKIHHHDRSLQGDRSGSKTFMYANVE--SVPTSVDWRKKGAVAPVKDQG 146

Query: 156 ECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQ 215
           +CG CWAF+ VAAVEGI KI++  L+ LSEQ+L+DC T  N GC GG  + AF +I +  
Sbjct: 147 QCGSCWAFSTVAAVEGINKIKTNELVSLSEQELVDCDTLENQGCNGGLMDLAFDFIKKTG 206

Query: 216 GIATEDEYPYQAVPGTC-SAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYS 274
           G+  ED YPY A  G C S         I  +E+VP  DEQ+L+KAV+ QPV++AI A S
Sbjct: 207 GLTREDAYPYAAEDGKCDSNKMNSPVVSIDGHEDVPKNDEQSLMKAVANQPVAVAIDAGS 266

Query: 275 TEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR-- 332
           ++FQ Y EG+F G CGTQLDH V  VG+GTT DG  YW+++NSWG+ WG+ GY+++ R  
Sbjct: 267 SDFQFYSEGVFTGKCGTQLDHGVAAVGYGTTLDGTKYWIVRNSWGSEWGEKGYIRMERGI 326

Query: 333 --DEGLCGIGTRSSYPL 347
               GLCGI   +SYP+
Sbjct: 327 SDKRGLCGIAMEASYPI 343


>gi|225456820|ref|XP_002278323.1| PREDICTED: vignain [Vitis vinifera]
          Length = 360

 Score =  296 bits (758), Expect = 9e-78,   Method: Compositional matrix adjust.
 Identities = 149/318 (46%), Positives = 201/318 (63%), Gaps = 16/318 (5%)

Query: 40  EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
           E+S+  ++E+W + H  S +   EK  R  +FKEN+ ++ + NK+ +  YKL  N+F+D+
Sbjct: 31  EESLWNLYERWRSHHTVS-RSLDEKHKRFNVFKENVNFVHEFNKK-DEPYKLKLNKFADM 88

Query: 100 TNDEFRALYTGYKMPSPSHRSTTSS-----TFKYQNLSMTDVPTSLDWRDKGAVTPIKNQ 154
           TN EFR+ Y G K+    HR    S     +F Y+ +    VP S+DWR KGAVTPIK+Q
Sbjct: 89  TNHEFRSTYAGSKVNH--HRMFRGSQHAAGSFMYEKVK--SVPPSVDWRKKGAVTPIKDQ 144

Query: 155 KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQN 214
            +CG CWAF+ V AVEGI  I++  L+ LSEQ+L+DC T+ N GC GG    AF +I + 
Sbjct: 145 GQCGSCWAFSTVVAVEGINHIKTNKLVSLSEQELVDCDTSENQGCNGGLMGYAFEFIKEK 204

Query: 215 QGIATEDEYPYQAVPGTCSAAQ-KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAY 273
            GI TE  YPY A  GTC  ++       I  +E VP  +E ALLKA + QP+S+AI A 
Sbjct: 205 GGITTEQSYPYTAEDGTCDVSKVNSPVVSIDGHETVPPNNEDALLKAAANQPISVAIDAG 264

Query: 274 STEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR- 332
            + FQ Y EG+F G CGT LDH V IVG+GTT DG  YW++KNSWG  WG+ GY+++ R 
Sbjct: 265 GSAFQFYSEGVFAGRCGTDLDHGVAIVGYGTTLDGTKYWIVKNSWGTDWGENGYIRMKRG 324

Query: 333 ---DEGLCGIGTRSSYPL 347
               EGLCGI   +SYP+
Sbjct: 325 ISAKEGLCGIAVEASYPI 342


>gi|357166359|ref|XP_003580684.1| PREDICTED: oryzain alpha chain-like [Brachypodium distachyon]
          Length = 456

 Score =  296 bits (758), Expect = 1e-77,   Method: Compositional matrix adjust.
 Identities = 146/324 (45%), Positives = 210/324 (64%), Gaps = 12/324 (3%)

Query: 32  VVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRT 88
           +VS     E+ V  ++ +WMA++GR+Y    E+E R ++F++NL Y+++ N     G  +
Sbjct: 27  IVSYGERSEEEVRRMYVEWMAENGRTYNAIGEEERRFEVFRDNLRYVDQHNAAADAGLHS 86

Query: 89  YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAV 148
           ++LG N+F+DLTN+E+R  Y G +      R  +    +YQ     ++P S+DWR+KGAV
Sbjct: 87  FRLGLNRFADLTNEEYRDTYLGVRTKPVRERRLSG---RYQAADNEELPESVDWREKGAV 143

Query: 149 TPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAF 208
             +K+Q  CG CWAF+A+AAVEGI +I +G++I LSEQ+L+DC T+ N GC GG  + AF
Sbjct: 144 AKVKDQGGCGSCWAFSAIAAVEGINQIVTGDMIALSEQELVDCDTSYNQGCNGGLMDYAF 203

Query: 209 AYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVS 267
            +II N GI +E++YPY+     C A +K A    I  YE+VP   E +L KAV+ QP+S
Sbjct: 204 EFIINNGGIDSEEDYPYKERDNRCDANKKNAKVVTIDGYEDVPVNSELSLKKAVANQPIS 263

Query: 268 IAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGY 327
           +AI A    FQ YK GIF G CGT LDH VT VG+G +E+G +YW++KNSWG  WG+ GY
Sbjct: 264 VAIEAGGRAFQLYKSGIFTGRCGTALDHGVTAVGYG-SENGKDYWIVKNSWGTVWGEDGY 322

Query: 328 MKIVRD----EGLCGIGTRSSYPL 347
           +++ R+     G CGI    SYPL
Sbjct: 323 VRLERNIKATSGKCGIAIEPSYPL 346


>gi|242055753|ref|XP_002457022.1| hypothetical protein SORBIDRAFT_03g047290 [Sorghum bicolor]
 gi|241928997|gb|EES02142.1| hypothetical protein SORBIDRAFT_03g047290 [Sorghum bicolor]
          Length = 378

 Score =  296 bits (757), Expect = 1e-77,   Method: Compositional matrix adjust.
 Identities = 153/339 (45%), Positives = 214/339 (63%), Gaps = 31/339 (9%)

Query: 37  STHEQSVVEIHEKWMAQHGRSYKDELEKEMR-LKIFKENLEYIEKANKEGNRTYKLGTNQ 95
           S+HE S+ E+ E+W+++H +     LE+++R  ++FK+NL +I++ N++ + +Y LG N+
Sbjct: 39  SSHE-SLAELFERWLSRHRKGAYASLEEKLRRFEVFKDNLHHIDETNRKVS-SYWLGLNE 96

Query: 96  FSDLTNDEFRALYTGYKMPSPS------HRSTTSST-------------FKYQNLSMTDV 136
           F+DLT+DEF+A Y G             H                    F+Y+ +    +
Sbjct: 97  FADLTHDEFKATYLGLSPSGGGGDVVHMHHDDDDEEPEEEGSSSSSSFRFRYEGVDAARL 156

Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGN 196
           P S+DWR KGAVT +KNQ +CG CWAF+ VAAVEGI +I +GNL  LSEQ+L+DC T+GN
Sbjct: 157 PKSVDWRSKGAVTGVKNQGQCGSCWAFSTVAAVEGINQIVTGNLTALSEQELVDCDTDGN 216

Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQA 256
           NGC GG  + AF+YI  N G+ TE+ YPY    GTCS     A   IS YE+VP  +EQA
Sbjct: 217 NGCNGGLMDYAFSYIAHNGGLHTEEAYPYLMEEGTCSRGSSAAVVTISGYEDVPRNNEQA 276

Query: 257 LLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT--EDG---ANY 311
           LLKA++ QPVS+AI A     Q Y  G+F+G CGTQLDH V  VG+GT   ++G   A+Y
Sbjct: 277 LLKALAHQPVSVAIEASGRNLQFYSGGVFDGPCGTQLDHGVAAVGYGTAGKDNGHVVADY 336

Query: 312 WLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
            ++KNSWG +WG+ GY+++ R     +GLCGI    SYP
Sbjct: 337 IIVKNSWGPSWGEKGYIRMRRGTGKRQGLCGINKMPSYP 375


>gi|146215984|gb|ABQ10194.1| actinidin Act2c [Actinidia arguta]
          Length = 378

 Score =  296 bits (757), Expect = 1e-77,   Method: Compositional matrix adjust.
 Identities = 151/340 (44%), Positives = 210/340 (61%), Gaps = 11/340 (3%)

Query: 13  INTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFK 72
           I+ + +F    L++S A  +V+S       V +++E W+ + G+SY    EKEMR +IFK
Sbjct: 8   ISMSLLFFSTLLILSSALDIVNSAQRTNDQVRDMYESWLVEQGKSYNSLDEKEMRFEIFK 67

Query: 73  ENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
           +NL  I+  N + NR++ LG N+F+DLT++E+R+ Y G+K   P  + +     K  ++ 
Sbjct: 68  DNLRIIDDHNADANRSFSLGLNRFADLTDEEYRSTYLGFK-SGPKAKVSNRYVPKVGDV- 125

Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
              +P  +DWR  GAV  +KNQ  C  CWAF+AVAAVEGI KI +GNL+ LSEQ+L+DC 
Sbjct: 126 ---LPNYVDWRTVGAVVGVKNQGLCSSCWAFSAVAAVEGINKIMTGNLLSLSEQELVDCG 182

Query: 193 -TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSA-AQKPAAAKISNYEEVP 250
            T    GC  G    AF +II N GI TED YPY A  G C+   Q      I +YE VP
Sbjct: 183 RTQSTRGCNRGYMTDAFQFIINNGGINTEDNYPYTAQDGQCNRYLQNQKYVTIDDYENVP 242

Query: 251 SGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGAN 310
           S +E AL  AV+ QPVS+ + +   +F+ Y  GIF   CGT +DH VTIVG+G TE G +
Sbjct: 243 SNNEWALQNAVAHQPVSVGLESEGGKFKLYTSGIFTQYCGTAIDHGVTIVGYG-TERGLD 301

Query: 311 YWLIKNSWGNTWGDAGYMKIVRD---EGLCGIGTRSSYPL 347
           YW++KNSWG  WG+ GY++I R+    G CGI   +SYP+
Sbjct: 302 YWIVKNSWGTNWGENGYIRIQRNIGGAGKCGIARMASYPV 341


>gi|193806686|sp|A5HII1.1|ACTN_ACTDE RecName: Full=Actinidain; Short=Actinidin; AltName: Full=Allergen
           Act d 1; AltName: Allergen=Act d 1; Flags: Precursor
 gi|146215974|gb|ABQ10189.1| actinidin Act1a [Actinidia deliciosa]
          Length = 380

 Score =  296 bits (757), Expect = 1e-77,   Method: Compositional matrix adjust.
 Identities = 154/340 (45%), Positives = 212/340 (62%), Gaps = 10/340 (2%)

Query: 13  INTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFK 72
           ++ + +F    L++S A    +        V  ++E W+ ++G+SY    E E R +IFK
Sbjct: 8   VSMSLLFFSTLLILSLAFNAKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWERRFEIFK 67

Query: 73  ENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
           E L +I++ N + NR+YK+G NQF+DLT++EFR+ Y G+   S S+++  S+  +Y+   
Sbjct: 68  ETLRFIDEHNADTNRSYKVGLNQFADLTDEEFRSTYLGFT--SGSNKTKVSN--RYEPRV 123

Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
              +P+ +DWR  GAV  IK+Q ECG CWAF+A+A VEGI KI +G LI LSEQ+L+DC 
Sbjct: 124 GQVLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIDCG 183

Query: 193 -TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSA-AQKPAAAKISNYEEVP 250
            T    GC GG     F +II N GI TE+ YPY A  G C+   Q      I  YE VP
Sbjct: 184 RTQNTRGCNGGYITDGFQFIINNGGINTEENYPYTAQDGECNLDLQNEKYVTIDTYENVP 243

Query: 251 SGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGAN 310
             +E AL  AV+ QPVS+A+ A    F+ Y  GIF G CGT +DHAVTIVG+G TE G +
Sbjct: 244 YNNEWALQTAVTYQPVSVALDAAGDAFKHYSSGIFTGPCGTAIDHAVTIVGYG-TEGGID 302

Query: 311 YWLIKNSWGNTWGDAGYMKIVRD---EGLCGIGTRSSYPL 347
           YW++KNSW  TWG+ GYM+I+R+    G CGI T  SYP+
Sbjct: 303 YWIVKNSWDTTWGEEGYMRILRNVGGAGTCGIATMPSYPV 342


>gi|15984|emb|CAA34486.1| unnamed protein product [Actinidia deliciosa]
          Length = 380

 Score =  296 bits (757), Expect = 1e-77,   Method: Compositional matrix adjust.
 Identities = 154/340 (45%), Positives = 212/340 (62%), Gaps = 10/340 (2%)

Query: 13  INTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFK 72
           ++ + +F    L++S A    +        V  ++E W+ ++G+SY    E E R +IFK
Sbjct: 8   VSMSLLFFSTLLILSLAFNAKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWERRFEIFK 67

Query: 73  ENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
           E L +I++ N + NR+YK+G NQF+DLT++EFR+ Y G+   S S+++  S+  +Y+   
Sbjct: 68  ETLRFIDEHNADTNRSYKVGLNQFADLTDEEFRSTYLGFT--SGSNKTKVSN--RYEPRF 123

Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
              +P+ +DWR  GAV  IK+Q ECG CWAF+A+A VEGI KI +G LI LSEQ+L+DC 
Sbjct: 124 GQVLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIDCG 183

Query: 193 -TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSA-AQKPAAAKISNYEEVP 250
            T    GC GG     F +II N GI TE+ YPY A  G C+   Q      I  YE VP
Sbjct: 184 RTQNTRGCNGGYITDGFQFIINNGGINTEENYPYTAQDGECNLDLQNEKYVTIDTYENVP 243

Query: 251 SGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGAN 310
             +E AL  AV+ QPVS+A+ A    F+ Y  GIF G CGT +DHAVTIVG+G TE G +
Sbjct: 244 YNNEWALQTAVTYQPVSVALDAAGDAFKHYSSGIFTGPCGTAIDHAVTIVGYG-TEGGID 302

Query: 311 YWLIKNSWGNTWGDAGYMKIVRD---EGLCGIGTRSSYPL 347
           YW++KNSW  TWG+ GYM+I+R+    G CGI T  SYP+
Sbjct: 303 YWIVKNSWDTTWGEEGYMRILRNVGGAGTCGIATMPSYPV 342


>gi|45738078|gb|AAS75836.1| fastuosain precursor [Bromelia fastuosa]
          Length = 324

 Score =  295 bits (756), Expect = 1e-77,   Method: Compositional matrix adjust.
 Identities = 140/308 (45%), Positives = 198/308 (64%), Gaps = 10/308 (3%)

Query: 43  VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTND 102
           ++E  E+WMA++GR Y D  EK  R +IFK N+ +IE  N     +Y LG NQF+D+TN+
Sbjct: 6   MMERFEEWMAEYGRVYNDNAEKMRRFQIFKNNVNHIETFNNRSGNSYTLGVNQFTDMTNN 65

Query: 103 EFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWA 162
           EF A YTG  +P    R    S   + ++ ++ VP S+DWRD GAVT +KNQ  CG CWA
Sbjct: 66  EFLARYTGASLPLNIERDPVVS---FDDVDISAVPQSIDWRDYGAVTSVKNQGSCGSCWA 122

Query: 163 FAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDE 222
           F+A+A VEGI KI++GNLI LSEQ++LDC+   + GC GG   KA+ +II N G+ +   
Sbjct: 123 FSAIATVEGIYKIKAGNLISLSEQEVLDCAL--SYGCDGGWVNKAYDFIISNNGVTSFAN 180

Query: 223 YPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKE 282
            PY+   G C+    P  A I+ Y  V S +E++++ AV+ QP++  I A   +FQ YK 
Sbjct: 181 LPYKGYKGPCNHNDLPNKAYITGYTYVQSNNERSMMIAVANQPIAALIDA-GGDFQYYKS 239

Query: 283 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCG 338
           G+F G CGT L+HA+T++G+G T  G  YW++KNSWG +WG+ GY+++ RD     GLCG
Sbjct: 240 GVFTGSCGTSLNHAITVIGYGQTSSGTKYWIVKNSWGTSWGERGYIRMARDVSSPYGLCG 299

Query: 339 IGTRSSYP 346
           I     +P
Sbjct: 300 IAMAPLFP 307


>gi|359359213|gb|AEV41117.1| putative oryzain beta chain precursor [Oryza officinalis]
          Length = 465

 Score =  295 bits (756), Expect = 2e-77,   Method: Compositional matrix adjust.
 Identities = 142/307 (46%), Positives = 212/307 (69%), Gaps = 12/307 (3%)

Query: 47  HEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN-KEGNRTYKLGTNQFSDLTNDEFR 105
           ++ W+A++GRSY    E E R ++F +NL + +  N +  +  ++LG N+F+DLTN+EFR
Sbjct: 54  YDLWLAENGRSYNALGEHERRFRVFWDNLRFADAHNARADDHGFRLGMNRFADLTNEEFR 113

Query: 106 ALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAA 165
           A + G K+   S     ++  +Y++  + ++P S+DWR+KGAV P+KNQ +CG CWAF+A
Sbjct: 114 ATFLGAKVVERSR----AAGERYRHDGVEELPESVDWREKGAVAPVKNQGQCGSCWAFSA 169

Query: 166 VAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYIIQNQGIATEDEYP 224
           V+ VE I ++ +G +I LSEQ+L++CSTNG N+GC GG  + AF +II+N GI TED+YP
Sbjct: 170 VSTVESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMDDAFDFIIKNGGIDTEDDYP 229

Query: 225 YQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEG 283
           Y+AV G C   ++ A    I  +E+VP  DE++L KAV+ QPVS+AI A   EFQ Y  G
Sbjct: 230 YKAVDGKCDINRENAKVVSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGGREFQLYHSG 289

Query: 284 IFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGI 339
           +F+G CGT LDH V  VG+G T++G +YW+++NSWG  WG++GY+++ R+     G CGI
Sbjct: 290 VFSGRCGTSLDHGVVAVGYG-TDNGKDYWIVRNSWGPKWGESGYVRMERNINVTTGKCGI 348

Query: 340 GTRSSYP 346
              +SYP
Sbjct: 349 AMMASYP 355


>gi|222425026|dbj|BAH20463.1| cysteine protease [Spinacia oleracea]
          Length = 473

 Score =  295 bits (756), Expect = 2e-77,   Method: Compositional matrix adjust.
 Identities = 149/325 (45%), Positives = 215/325 (66%), Gaps = 14/325 (4%)

Query: 34  SSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGT 93
           SSRS  E  V+ I+E W+ QH ++Y    EKE R  IFK+NLE+I++ N + ++T+K+G 
Sbjct: 42  SSRSDDE--VMRIYESWLVQHRKNYNALGEKEKRFAIFKDNLEFIDQHNSDDSQTFKVGL 99

Query: 94  NQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFK-----YQNLSMTDVPTSLDWRDKGAV 148
           N+F+DLTN+EFR++Y G K  S S    +S+  K     Y      ++P ++DWR  GAV
Sbjct: 100 NKFADLTNEEFRSVYLGRKKSSSSSPLLSSAKSKVKSDRYLFKEGDELPEAVDWRKNGAV 159

Query: 149 TPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAF 208
             +K+Q +CG CWAF+ +AAVEGI +I +G L+ LSEQ+L+DC T+ N+GC GG  + A+
Sbjct: 160 AKVKDQGQCGSCWAFSTIAAVEGINQIVTGELLSLSEQELVDCDTSYNSGCDGGLMDYAY 219

Query: 209 AYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVS 267
            +II N GI T+ +YPY A  G C   +K A    I ++E+VP  DE+AL KAV+ QPVS
Sbjct: 220 EFIINNGGIDTDADYPYTAKDGKCDQYRKNAKVVTIDDFEDVPENDEKALQKAVAHQPVS 279

Query: 268 IAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGY 327
           +AI A  + FQ Y+ G+F G CG  LDH V  VG+G ++DG +YW+++NSWG  WG++GY
Sbjct: 280 VAIEAGGSTFQFYQSGVFTGKCGADLDHGVVAVGYG-SDDGKDYWIVRNSWGADWGESGY 338

Query: 328 MKIVRD-----EGLCGIGTRSSYPL 347
           +++ R+      G CGI    SYP+
Sbjct: 339 IRMERNLETVKTGKCGIAIEPSYPI 363


>gi|297830592|ref|XP_002883178.1| hypothetical protein ARALYDRAFT_479457 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297329018|gb|EFH59437.1| hypothetical protein ARALYDRAFT_479457 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 452

 Score =  295 bits (756), Expect = 2e-77,   Method: Compositional matrix adjust.
 Identities = 148/340 (43%), Positives = 216/340 (63%), Gaps = 12/340 (3%)

Query: 15  TTPMFIIITLLVSCASQVVSSRST--HEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFK 72
           T  + I   LL+S +   V++  T  +E     ++E+W+ ++ ++Y    EKE R +IF 
Sbjct: 9   TLALLIFSMLLISLSLGSVTAADTTRNEAEARRMYEQWLVENRKNYNGLGEKETRFEIFT 68

Query: 73  ENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
           +NL+YIE+ N   N+T+++G  +F+DLTNDEFRA+Y   KM             +Y    
Sbjct: 69  DNLKYIEEHNSVPNQTFEVGLTRFADLTNDEFRAIYLRSKM---ERTRVPVKGERYLYKV 125

Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
              +P  +DWR KGAV P+K+Q  CG CWAF+A+ AVEGI +I++G LI LSEQ+L+DC 
Sbjct: 126 GDTLPDQIDWRAKGAVNPVKDQGNCGSCWAFSAIGAVEGINQIKTGELISLSEQELVDCD 185

Query: 193 TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAV-PGTCSAAQKPA-AAKISNYEEVP 250
           T+ N GC GG  + AF +II+N GI TE++YPY A     C++ +K +    I  YE+VP
Sbjct: 186 TSYNGGCGGGLMDYAFKFIIENGGIDTEEDYPYTATDDNICNSDKKNSRVVTIDGYEDVP 245

Query: 251 SGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGAN 310
             DE++L KA++ QP+S+AI A    FQ YK G+F G CGT LDH V  VG+G +E G +
Sbjct: 246 QNDEKSLKKALANQPISVAIEAGGRAFQLYKSGVFTGTCGTSLDHGVVAVGYG-SEGGQD 304

Query: 311 YWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
           YW+++NSWG+ WG++GY K+ R+     G CG+   +SYP
Sbjct: 305 YWIVRNSWGSNWGESGYFKLERNIKESSGKCGVAMMASYP 344


>gi|111073715|dbj|BAF02546.1| triticain alpha [Triticum aestivum]
 gi|388890585|gb|AFK80346.1| cysteine endopeptidase EP alpha [Secale cereale x Triticum durum]
          Length = 461

 Score =  295 bits (756), Expect = 2e-77,   Method: Compositional matrix adjust.
 Identities = 145/323 (44%), Positives = 209/323 (64%), Gaps = 12/323 (3%)

Query: 32  VVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRT 88
           +VS     E+ V  ++ +WM++H R+Y    E+E R ++F++NL YI++ N     G  +
Sbjct: 26  IVSYGERSEEEVRRMYAEWMSEHRRTYNAIGEEERRFEVFRDNLRYIDQHNAAADAGLHS 85

Query: 89  YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAV 148
           ++LG N+F+DLTN+E+R+ Y G +      R  ++   +YQ     ++P ++DWR KGAV
Sbjct: 86  FRLGLNRFADLTNEEYRSTYLGARTKPDRERKLSA---RYQADDNEELPETVDWRKKGAV 142

Query: 149 TPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAF 208
             IK+Q  CG CWAF+A+AAVEGI +I +G++I LSEQ+L+DC T+ N GC GG  + AF
Sbjct: 143 AAIKDQGGCGSCWAFSAIAAVEGINQIVTGDMIPLSEQELVDCDTSYNEGCNGGLMDYAF 202

Query: 209 AYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVS 267
            +II N GI +E++YPY+     C A +K A    I  YE+VP   E++L KAV+ QP+S
Sbjct: 203 EFIINNGGIDSEEDYPYKERDNRCDANKKNAKVVTIDGYEDVPVNSEKSLQKAVANQPIS 262

Query: 268 IAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGY 327
           +AI A    FQ YK GIF G CGT LDH V  VG+G TE+G +YWL++NSWG  WG+ GY
Sbjct: 263 VAIEAGGRAFQLYKSGIFTGTCGTALDHGVAAVGYG-TENGKDYWLVRNSWGTVWGEDGY 321

Query: 328 MKIVRD----EGLCGIGTRSSYP 346
           +++ R+     G CGI    SYP
Sbjct: 322 IRMERNIKASSGKCGIAVEPSYP 344


>gi|18396952|ref|NP_564322.1| cysteine proteinase-like protein [Arabidopsis thaliana]
 gi|332192922|gb|AEE31043.1| cysteine proteinase-like protein [Arabidopsis thaliana]
          Length = 334

 Score =  295 bits (756), Expect = 2e-77,   Method: Compositional matrix adjust.
 Identities = 154/346 (44%), Positives = 219/346 (63%), Gaps = 25/346 (7%)

Query: 13  INTTPMFIIITLLVS--CASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKI 70
           ++   +F+ +T+L      SQ     + +EQS+V+ H++WM Q  R YKDE EKEMRLK+
Sbjct: 2   VSVRSVFVALTILSMDLRISQARPHVTLNEQSIVDYHQQWMTQFSRVYKDESEKEMRLKV 61

Query: 71  FKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQN 130
           FK+NL++IE  N  GN++Y LG N+F+D   +EF A +TG ++   S     + T   +N
Sbjct: 62  FKKNLKFIENFNNMGNQSYTLGVNEFTDWKTEEFLATHTGLRVNVTSLSELFNKTKPSRN 121

Query: 131 LSMTDVPT---SLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQ 187
            +M+D+     S DWRD+GAVTP+K Q  C              +TKI   NL+ LSEQQ
Sbjct: 122 WNMSDIDMEDESKDWRDEGAVTPVKYQGAC-------------RLTKISGKNLLTLSEQQ 168

Query: 188 LLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSA-AQKPAAAKISNY 246
           L+DC    N GC GG  E+AF YII+N G++ E EYPYQ    +C A A++    +I  +
Sbjct: 169 LIDCDIEKNGGCNGGEFEEAFKYIIKNGGVSLETEYPYQVKKESCRANARRAPHTQIRGF 228

Query: 247 EEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGV-CGTQLDHAVTIVGFGTT 305
           + VPS +E+ALL+AV  QPVS+ I A +  F  YK G++ G+ CGT ++HAVTIVG+GT 
Sbjct: 229 QMVPSHNERALLEAVRRQPVSVLIDARADSFGHYKGGVYAGLDCGTDVNHAVTIVGYGTM 288

Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
             G NYW++KNSWG +WG+ GYM+I RD    +G+CGI   ++YP+
Sbjct: 289 -SGLNYWVLKNSWGESWGENGYMRIRRDVEWPQGMCGIAQVAAYPV 333


>gi|357143305|ref|XP_003572875.1| PREDICTED: xylem cysteine proteinase 1-like [Brachypodium
           distachyon]
          Length = 473

 Score =  295 bits (756), Expect = 2e-77,   Method: Compositional matrix adjust.
 Identities = 148/311 (47%), Positives = 208/311 (66%), Gaps = 14/311 (4%)

Query: 43  VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTND 102
           +V++   W  +H + Y    EK  R ++FK+NL++I + N+  N +Y LG NQF+D+ ++
Sbjct: 44  LVDLFSSWSVKHSKIYVSPEEKVKRYEVFKQNLKHIVETNRR-NGSYWLGLNQFADVAHE 102

Query: 103 EFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCC 160
           EF++ Y G K  M  P+   T    F+Y+N    ++P S+DWR KGAVTP+KNQ ECG C
Sbjct: 103 EFKSTYLGLKTGMDGPARAPTA---FRYEN--SVNLPWSVDWRKKGAVTPVKNQGECGSC 157

Query: 161 WAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATE 220
           WAF+ VAAVEGI +I +G L  LSEQ+L+DC T  ++GC GG  + AFAYI+ N GI T+
Sbjct: 158 WAFSTVAAVEGINQIATGKLESLSEQELMDCDTTFDHGCGGGFMDFAFAYIMGNLGIHTD 217

Query: 221 DEYPYQAVPGTCSAAQ-KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQS 279
           D+YPY    G C   Q +     IS YE+VP   E +LLKA++ QP+S+ IAA S +FQ 
Sbjct: 218 DDYPYLMEEGYCKEKQPQSKVVTISGYEDVPENSEVSLLKALAHQPISVGIAAGSKDFQF 277

Query: 280 YKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR----DEG 335
           YK G+F G CGT+LDHA+T VG+G++ DG +Y ++KNSWG +WG+ GY +I R     EG
Sbjct: 278 YKRGVFEGSCGTELDHALTAVGYGSS-DGQDYIIMKNSWGKSWGEQGYFRIKRGTGKPEG 336

Query: 336 LCGIGTRSSYP 346
           +C I + +SYP
Sbjct: 337 VCSIYSMASYP 347


>gi|297816030|ref|XP_002875898.1| hypothetical protein ARALYDRAFT_485194 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297321736|gb|EFH52157.1| hypothetical protein ARALYDRAFT_485194 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 363

 Score =  295 bits (755), Expect = 2e-77,   Method: Compositional matrix adjust.
 Identities = 148/318 (46%), Positives = 208/318 (65%), Gaps = 17/318 (5%)

Query: 40  EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
           E++V +++E+W   H  + +   E   R  +F+ N+ ++ + NK+ N+ YKL  N+F+D+
Sbjct: 30  EENVWKLYERWRDHHSVT-RASHEALKRFNVFRHNVLHVHRTNKK-NKPYKLKVNRFADI 87

Query: 100 TNDEFRALYTG-----YKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQ 154
           T+ EFR+ Y G     ++M     R   S  F Y+N+  T VP+S+DWR+KGAVT +KNQ
Sbjct: 88  THHEFRSSYAGSNVKHHRMLRGPKRG--SGGFMYENV--TRVPSSVDWREKGAVTEVKNQ 143

Query: 155 KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQN 214
           ++CG CWAF+ VAAVEGI KIR+  L+ LSEQ+L+DC T  N GC GG  E AF +I  N
Sbjct: 144 QDCGSCWAFSTVAAVEGINKIRTNKLVSLSEQELVDCDTEENQGCAGGLMEPAFEFIKNN 203

Query: 215 QGIATEDEYPYQA--VPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAA 272
            GI TE+ YPY +  V    + +       I  +E VP  DE+ALLKAV+ QPVS+AI A
Sbjct: 204 GGIKTEETYPYDSNDVQFCRAKSIDGETVTIDGHEHVPENDEEALLKAVAHQPVSVAIDA 263

Query: 273 YSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR 332
            S++FQ Y EG+F G CGTQL+H V IVG+G T++G  YW+++NSWG  WG+ GY++I R
Sbjct: 264 GSSDFQLYSEGVFIGECGTQLNHGVVIVGYGETKNGTKYWIVRNSWGPEWGEGGYVRIER 323

Query: 333 ----DEGLCGIGTRSSYP 346
               +EG CGI   +SYP
Sbjct: 324 GISENEGRCGIAMEASYP 341


>gi|38345188|emb|CAE03344.2| OSJNBb0005B05.11 [Oryza sativa Japonica Group]
 gi|125589403|gb|EAZ29753.1| hypothetical protein OsJ_13812 [Oryza sativa Japonica Group]
          Length = 323

 Score =  295 bits (754), Expect = 2e-77,   Method: Compositional matrix adjust.
 Identities = 142/336 (42%), Positives = 205/336 (61%), Gaps = 25/336 (7%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
           +F I+  L  C++ + +   + + ++   HE+WMAQ+GR YKD+ EK  R ++FK N+ +
Sbjct: 8   LFAILGCLCLCSAVLAARELSDDAAMAARHERWMAQYGRMYKDDAEKARRFEVFKANVAF 67

Query: 78  IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
           IE  N  GN  + LG NQF+DLTNDEFR+  T       + R  T   F+ +N+++  +P
Sbjct: 68  IESFNA-GNHKFWLGVNQFADLTNDEFRSTKTNKGFIPSTTRVPTG--FRNENVNIDALP 124

Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-N 196
            ++DWR KG VTPIK+Q +CGCCWAF+AVAA+E                +L+DC  +G +
Sbjct: 125 ATMDWRTKGVVTPIKDQGQCGCCWAFSAVAAME----------------ELVDCDVHGED 168

Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQA 256
            GC GG  + AF +II+N G+ TE  YPY AV     +    + A I  YE+VP+ +E A
Sbjct: 169 QGCEGGLMDDAFKFIIKNGGLTTESNYPYAAVDDKFKSVSN-SVASIKGYEDVPANNEAA 227

Query: 257 LLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKN 316
           L+KAV+ QPVS+A+      FQ YK G+  G CGT LDH +  +G+G   DG  YWL+KN
Sbjct: 228 LMKAVANQPVSVAVDGGDMTFQFYKGGVMTGSCGTDLDHGIVAIGYGKASDGTKYWLLKN 287

Query: 317 SWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
           SWG TWG+ G++++ +D     G+CG+    SYP A
Sbjct: 288 SWGMTWGENGFLRMEKDISDKRGMCGLAMEPSYPTA 323


>gi|262360187|gb|ACY38051.2| cysteine proteinase C1A [Dactylis glomerata]
          Length = 365

 Score =  295 bits (754), Expect = 3e-77,   Method: Compositional matrix adjust.
 Identities = 149/321 (46%), Positives = 203/321 (63%), Gaps = 20/321 (6%)

Query: 40  EQSVVEIHEKWMAQHG---RSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQF 96
           E+S+  ++E W + H    R    E E   R  +FKEN+ YI +ANK+ +R ++L  N+F
Sbjct: 33  EESLRGLYETWRSHHTVSRRGLGAEAEAR-RFNVFKENVRYIHEANKK-DRPFRLALNKF 90

Query: 97  SDLTNDEFRALYTGYKMPSPSHRS------TTSSTFKYQNLSMTDVPTSLDWRDKGAVTP 150
           +D+T DEFR  Y G ++    HRS          +F Y +    ++P ++DWR KGAVTP
Sbjct: 91  ADMTTDEFRRTYAGSRVRH--HRSLSGGRRQGGGSFMYADAE--NLPAAVDWRQKGAVTP 146

Query: 151 IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAY 210
           IK+Q +CG CWAF+ + AVEGI KIR+G L+ LSEQ+L+DC+   N+GC GG  + AF +
Sbjct: 147 IKDQGQCGSCWAFSTIVAVEGINKIRTGRLVSLSEQELMDCNIGENDGCNGGLMDVAFQF 206

Query: 211 IIQNQGIATEDEYPYQAVPGTCSAAQKPAA-AKISNYEEVPSGDEQALLKAVSMQPVSIA 269
           I QN GI TE  YPYQ    +C  +++ +    I  YE+VP+ DE AL KAV+ QPVS+A
Sbjct: 207 IQQNGGITTEASYPYQGEQNSCDQSKENSHDVSIDGYEDVPANDESALQKAVANQPVSVA 266

Query: 270 IAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMK 329
           I A   +FQ Y EG+F    GT LDH V  VG+GTT DG  YW++KNSWG  WG+ GY++
Sbjct: 267 IDASGNDFQFYSEGVFTTDGGTDLDHGVAAVGYGTTRDGTKYWIVKNSWGEDWGEKGYIR 326

Query: 330 IVRD----EGLCGIGTRSSYP 346
           + R     EGLCGI   +SYP
Sbjct: 327 MQRGVKQAEGLCGIAMEASYP 347


>gi|312451845|gb|ADQ85986.1| actinidin [Actinidia chinensis]
          Length = 380

 Score =  295 bits (754), Expect = 3e-77,   Method: Compositional matrix adjust.
 Identities = 153/340 (45%), Positives = 211/340 (62%), Gaps = 10/340 (2%)

Query: 13  INTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFK 72
           ++ + +F    L++S A    +        V  ++E W+ ++G+SY    E E R +IFK
Sbjct: 8   VSMSLLFFSTLLILSLAFNAKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWERRFEIFK 67

Query: 73  ENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
           E L +I++ N + NR+YK+G NQF+DLT++EFR+ Y G+   S S+++  S+  +Y+   
Sbjct: 68  ETLRFIDEHNADTNRSYKVGLNQFADLTDEEFRSTYLGFT--SGSNKTKVSN--RYEPRV 123

Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
              +P+ +DWR  GAV  IK+Q ECG CWAF+A+A VEGI KI +G LI LSEQ+L+DC 
Sbjct: 124 GQVLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIDCG 183

Query: 193 -TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSA-AQKPAAAKISNYEEVP 250
            T    GC G      F +II N GI TE+ YPY A  G C+   Q      I  YE VP
Sbjct: 184 RTQNTRGCNGSYITDGFPFIINNGGINTEENYPYTAQDGECNVDLQNEKYVTIDTYENVP 243

Query: 251 SGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGAN 310
             +E AL  AV+ QPVS+A+ A    F+ Y  GIF G CGT +DHAVTIVG+G TE G +
Sbjct: 244 YNNEWALQTAVTYQPVSVALDAAGDAFKQYSSGIFTGPCGTAIDHAVTIVGYG-TEGGID 302

Query: 311 YWLIKNSWGNTWGDAGYMKIVRD---EGLCGIGTRSSYPL 347
           YW++KNSW  TWG+ GYM+I+R+    G CGI T  SYP+
Sbjct: 303 YWIVKNSWDTTWGEEGYMRILRNVGGAGTCGIATMPSYPV 342


>gi|18408616|ref|NP_566901.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|75313880|sp|Q9STL5.1|CEP3_ARATH RecName: Full=KDEL-tailed cysteine endopeptidase CEP3; Flags:
           Precursor
 gi|4678353|emb|CAB41163.1| cysteine endopeptidase precursor-like protein [Arabidopsis
           thaliana]
 gi|26453052|dbj|BAC43602.1| putative cysteine endopeptidase precursor [Arabidopsis thaliana]
 gi|332644885|gb|AEE78406.1| putative cysteine proteinase [Arabidopsis thaliana]
          Length = 364

 Score =  295 bits (754), Expect = 3e-77,   Method: Compositional matrix adjust.
 Identities = 148/318 (46%), Positives = 205/318 (64%), Gaps = 17/318 (5%)

Query: 40  EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
           E++V +++E+W   H  S +   E   R  +F+ N+ ++ + NK+ N+ YKL  N+F+D+
Sbjct: 31  EENVWKLYERWRGHHSVS-RASHEAIKRFNVFRHNVLHVHRTNKK-NKPYKLKINRFADI 88

Query: 100 TNDEFRALYTG-----YKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQ 154
           T+ EFR+ Y G     ++M     R   S  F Y+N+  T VP+S+DWR+KGAVT +KNQ
Sbjct: 89  THHEFRSSYAGSNVKHHRMLRGPKRG--SGGFMYENV--TRVPSSVDWREKGAVTEVKNQ 144

Query: 155 KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQN 214
           ++CG CWAF+ VAAVEGI KIR+  L+ LSEQ+L+DC T  N GC GG  E AF +I  N
Sbjct: 145 QDCGSCWAFSTVAAVEGINKIRTNKLVSLSEQELVDCDTEENQGCAGGLMEPAFEFIKNN 204

Query: 215 QGIATEDEYPYQAVPGTCSAAQKPAA--AKISNYEEVPSGDEQALLKAVSMQPVSIAIAA 272
            GI TE+ YPY +       A         I  +E VP  DE+ LLKAV+ QPVS+AI A
Sbjct: 205 GGIKTEETYPYDSSDVQFCRANSIGGETVTIDGHEHVPENDEEELLKAVAHQPVSVAIDA 264

Query: 273 YSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR 332
            S++FQ Y EG+F G CGTQL+H V IVG+G T++G  YW+++NSWG  WG+ GY++I R
Sbjct: 265 GSSDFQLYSEGVFIGECGTQLNHGVVIVGYGETKNGTKYWIVRNSWGPEWGEGGYVRIER 324

Query: 333 ----DEGLCGIGTRSSYP 346
               +EG CGI   +SYP
Sbjct: 325 GISENEGRCGIAMEASYP 342


>gi|89274062|dbj|BAE80740.1| cysteine proteinase [Platycodon grandiflorus]
          Length = 462

 Score =  294 bits (753), Expect = 4e-77,   Method: Compositional matrix adjust.
 Identities = 149/325 (45%), Positives = 203/325 (62%), Gaps = 10/325 (3%)

Query: 29  ASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRT 88
           A+    S    +  V+ ++E W+ +HG+SY    EKE R +IFK+NL +I++ N E N +
Sbjct: 32  ATHASKSSWRTDDEVMAMYESWLVKHGKSYNALGEKEKRFQIFKDNLRFIDEHNAEENLS 91

Query: 89  YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAV 148
           YK+G N+F+DLTN+E+R+ Y G K   P      S   +Y       +P S+DWR KGAV
Sbjct: 92  YKVGLNRFADLTNEEYRSTYLGAK-SKPKLSKVKSD--RYAPRVGDSLPESVDWRAKGAV 148

Query: 149 TPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAF 208
            PIK+Q  CG CWAF+ V AVEGI +I +G LI LSEQ+L+DC  + N GC GG  +  F
Sbjct: 149 APIKDQGSCGSCWAFSTVNAVEGINQIVTGELITLSEQELVDCDKSYNEGCDGGLMDYGF 208

Query: 209 AYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVS 267
            +II N GI T+ +YPY      C   +K A    I +YE+VP  +E+AL KAV+ QPVS
Sbjct: 209 EFIINNGGIDTDKDYPYLGRDARCDQYRKNAKVVTIDSYEDVPVNNEEALKKAVASQPVS 268

Query: 268 IAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGY 327
           + I      FQ Y  GIF G CGT LDH V +VG+G TE G +YW+++NSWG++WG+AGY
Sbjct: 269 VGIEGGGRAFQFYDSGIFTGKCGTALDHGVNVVGYG-TEKGKDYWIVRNSWGSSWGEAGY 327

Query: 328 MKIVRD-----EGLCGIGTRSSYPL 347
           +++ R+      G CGI    SYPL
Sbjct: 328 IRMERNLAGTSVGKCGIAMEPSYPL 352


>gi|356508490|ref|XP_003522989.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
          Length = 349

 Score =  294 bits (753), Expect = 4e-77,   Method: Compositional matrix adjust.
 Identities = 142/309 (45%), Positives = 210/309 (67%), Gaps = 11/309 (3%)

Query: 43  VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTND 102
           ++E+ E WM++HG+ Y+   EK +R +IFK+NL++I++ NK  +  Y LG N+F+DL++ 
Sbjct: 43  LIELFESWMSKHGKIYQSIEEKLLRFEIFKDNLKHIDERNKVVS-NYWLGLNEFADLSHQ 101

Query: 103 EFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWA 162
           EF+  Y G K+   S R  +   F Y+++   ++P S+DWR KGAV P+KNQ  CG CWA
Sbjct: 102 EFKNKYLGLKVDY-SRRRESPEEFTYKDV---ELPKSVDWRKKGAVAPVKNQGSCGSCWA 157

Query: 163 FAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDE 222
           F+ VAAVEGI +I +GNL  LSEQ+L+DC    NNGC GG  + AF++I++N G+  E++
Sbjct: 158 FSTVAAVEGINQIVTGNLTSLSEQELIDCDRTYNNGCNGGLMDYAFSFIVENGGLHKEED 217

Query: 223 YPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYK 281
           YPY    GTC   ++      IS Y +VP  +EQ+LLKA++ QP+S+AI A   +FQ Y 
Sbjct: 218 YPYIMEEGTCEMTKEETEVVTISGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQFYS 277

Query: 282 EGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLC 337
            G+F+G CG+ LDH V  VG+GT + G +Y ++KNSWG+ WG+ GY+++ R+    EG+C
Sbjct: 278 GGVFDGHCGSDLDHGVAAVGYGTAK-GVDYIIVKNSWGSKWGEKGYIRMRRNIGKPEGIC 336

Query: 338 GIGTRSSYP 346
           GI   +SYP
Sbjct: 337 GIYKMASYP 345


>gi|37780039|gb|AAP32192.1| cysteine protease 14 [Trifolium repens]
          Length = 351

 Score =  294 bits (753), Expect = 4e-77,   Method: Compositional matrix adjust.
 Identities = 142/310 (45%), Positives = 212/310 (68%), Gaps = 11/310 (3%)

Query: 43  VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTND 102
           ++E+ E WM++HG+ Y+   EK +R ++FK+NL++I+  NK  +  Y LG N+F+DL++ 
Sbjct: 43  LIELFESWMSRHGKIYETIEEKLLRFEVFKDNLKHIDDRNKIVS-NYWLGLNEFADLSHQ 101

Query: 103 EFRALYTGYKMPSPSHR-STTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCW 161
           EF+  Y G K+     R S+    F Y+++   D+P S+DWR KGAVTP+KNQ +CG CW
Sbjct: 102 EFKNKYLGLKVDLSQRRESSNEEEFTYRDV---DLPKSVDWRKKGAVTPVKNQGQCGSCW 158

Query: 162 AFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATED 221
           AF+ VAAVEGI +I +GNL  LSEQ+L+DC T  NNGC GG  + AF++I QN G+  E+
Sbjct: 159 AFSTVAAVEGINQIVTGNLTSLSEQELIDCDTTYNNGCNGGLMDYAFSFIGQNGGLHKEE 218

Query: 222 EYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSY 280
           +YPY     TC   ++      I+ Y +VP  +EQ+LLKA++ QP+S+AI A S +FQ Y
Sbjct: 219 DYPYIMEESTCEMKKEETQVVTINGYHDVPQNNEQSLLKALANQPLSVAIEASSRDFQFY 278

Query: 281 KEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGL 336
             G+F+G CG+ LDH V+ VG+GT+++  +Y ++KNSWG  WG+ G++++ RD    EG+
Sbjct: 279 SGGVFDGHCGSDLDHGVSAVGYGTSKN-LDYIIVKNSWGAKWGEKGFIRMKRDIGKPEGI 337

Query: 337 CGIGTRSSYP 346
           CG+   +SYP
Sbjct: 338 CGLYKMASYP 347


>gi|190358935|sp|P00785.4|ACTN_ACTCH RecName: Full=Actinidain; Short=Actinidin; AltName: Allergen=Act c
           1; Flags: Precursor
 gi|12744965|gb|AAK06862.1|AF343446_1 actinidin protease [Actinidia chinensis]
          Length = 380

 Score =  294 bits (752), Expect = 5e-77,   Method: Compositional matrix adjust.
 Identities = 153/340 (45%), Positives = 211/340 (62%), Gaps = 10/340 (2%)

Query: 13  INTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFK 72
           ++ + +F    L++S A    +        V  ++E W+ ++G+SY    E E R +IFK
Sbjct: 8   VSMSLLFFSTLLILSLAFNAKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWERRFEIFK 67

Query: 73  ENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
           E L +I++ N + NR+YK+G NQF+DLT++EFR+ Y   +  S S+++  S+  +Y+   
Sbjct: 68  ETLRFIDEHNADTNRSYKVGLNQFADLTDEEFRSTYL--RFTSGSNKTKVSN--RYEPRV 123

Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
              +P+ +DWR  GAV  IK+Q ECG CWAF+A+A VEGI KI +G LI LSEQ+L+DC 
Sbjct: 124 GQVLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIDCG 183

Query: 193 -TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSA-AQKPAAAKISNYEEVP 250
            T    GC GG     F +II N GI TE+ YPY A  G C+   Q      I  YE VP
Sbjct: 184 RTQNTRGCNGGYITDGFQFIINNGGINTEENYPYTAQDGECNVDLQNEKYVTIDTYENVP 243

Query: 251 SGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGAN 310
             +E AL  AV+ QPVS+A+ A    F+ Y  GIF G CGT +DHAVTIVG+G TE G +
Sbjct: 244 YNNEWALQTAVTYQPVSVALDAAGDAFKQYSSGIFTGPCGTAVDHAVTIVGYG-TEGGID 302

Query: 311 YWLIKNSWGNTWGDAGYMKIVRD---EGLCGIGTRSSYPL 347
           YW++KNSW  TWG+ GYM+I+R+    G CGI T  SYP+
Sbjct: 303 YWIVKNSWDTTWGEEGYMRILRNVGGAGTCGIATMPSYPV 342


>gi|296081395|emb|CBI16828.3| unnamed protein product [Vitis vinifera]
          Length = 359

 Score =  294 bits (752), Expect = 5e-77,   Method: Compositional matrix adjust.
 Identities = 145/314 (46%), Positives = 204/314 (64%), Gaps = 8/314 (2%)

Query: 40  EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
           E+S+ +++E+W + H  S +D  EK  R  +FKEN +++ K N+  ++ YKL  N+F+D+
Sbjct: 33  EESLWDLYERWRSYHTVS-RDLEEKNKRFNVFKENTKHVHKVNQM-DKPYKLKLNKFADM 90

Query: 100 TNDEFRALYTGYKMPSPSH-RSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECG 158
           TN EFR+ Y G K+      R     T  + +   T +P S+DWR KGAVT IK+Q +CG
Sbjct: 91  TNHEFRSSYGGSKVKHYRMLRGDRRGTGGFMHEKTTYLPPSVDWRKKGAVTGIKDQGKCG 150

Query: 159 CCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIA 218
            CWAF+ V  VEGI +I++  L+ LSEQQL+DC  + ++GC GG  E AF +I +N GI 
Sbjct: 151 SCWAFSTVVGVEGINQIKTKELLSLSEQQLIDCDRSDDHGCNGGLMESAFEFIKKNGGIT 210

Query: 219 TEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEF 277
           TE+ YPY+A    C   +  A    I  +E VP  DE+AL+KAV+ QPVS+AI A  ++ 
Sbjct: 211 TENNYPYKAKDERCDMLKMNAPVVTIDGHESVPVNDERALMKAVAHQPVSVAIDAGGSDL 270

Query: 278 QSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD---- 333
           Q Y EG+F+G CGT+LDH V IVG+GTT DG  YW++KNSWG  WG+ GY+++ R     
Sbjct: 271 QFYSEGVFDGECGTELDHGVAIVGYGTTLDGTKYWIVKNSWGAEWGEKGYIRMARGIQAA 330

Query: 334 EGLCGIGTRSSYPL 347
           EG CGI   +SYP+
Sbjct: 331 EGQCGIAMEASYPV 344


>gi|359473128|ref|XP_002285397.2| PREDICTED: vignain-like [Vitis vinifera]
          Length = 357

 Score =  293 bits (751), Expect = 6e-77,   Method: Compositional matrix adjust.
 Identities = 145/314 (46%), Positives = 204/314 (64%), Gaps = 8/314 (2%)

Query: 40  EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
           E+S+ +++E+W + H  S +D  EK  R  +FKEN +++ K N+  ++ YKL  N+F+D+
Sbjct: 31  EESLWDLYERWRSYHTVS-RDLEEKNKRFNVFKENTKHVHKVNQM-DKPYKLKLNKFADM 88

Query: 100 TNDEFRALYTGYKMPSPSH-RSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECG 158
           TN EFR+ Y G K+      R     T  + +   T +P S+DWR KGAVT IK+Q +CG
Sbjct: 89  TNHEFRSSYGGSKVKHYRMLRGDRRGTGGFMHEKTTYLPPSVDWRKKGAVTGIKDQGKCG 148

Query: 159 CCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIA 218
            CWAF+ V  VEGI +I++  L+ LSEQQL+DC  + ++GC GG  E AF +I +N GI 
Sbjct: 149 SCWAFSTVVGVEGINQIKTKELLSLSEQQLIDCDRSDDHGCNGGLMESAFEFIKKNGGIT 208

Query: 219 TEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEF 277
           TE+ YPY+A    C   +  A    I  +E VP  DE+AL+KAV+ QPVS+AI A  ++ 
Sbjct: 209 TENNYPYKAKDERCDMLKMNAPVVTIDGHESVPVNDERALMKAVAHQPVSVAIDAGGSDL 268

Query: 278 QSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD---- 333
           Q Y EG+F+G CGT+LDH V IVG+GTT DG  YW++KNSWG  WG+ GY+++ R     
Sbjct: 269 QFYSEGVFDGECGTELDHGVAIVGYGTTLDGTKYWIVKNSWGAEWGEKGYIRMARGIQAA 328

Query: 334 EGLCGIGTRSSYPL 347
           EG CGI   +SYP+
Sbjct: 329 EGQCGIAMEASYPV 342


>gi|255538788|ref|XP_002510459.1| cysteine protease, putative [Ricinus communis]
 gi|223551160|gb|EEF52646.1| cysteine protease, putative [Ricinus communis]
          Length = 422

 Score =  293 bits (751), Expect = 6e-77,   Method: Compositional matrix adjust.
 Identities = 147/340 (43%), Positives = 213/340 (62%), Gaps = 11/340 (3%)

Query: 13  INTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFK 72
           +N      +ITLL      +  S  +    + ++ E W  +HG++Y  + +K  R KIF+
Sbjct: 1   MNFLSALFLITLLFF---NLSISSFSSSSDISKLFESWTKEHGKTYTSKEDKLYRFKIFE 57

Query: 73  ENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
           EN E+++K N +GN +Y L  N F+DLT+ EF+A   G    S S +  +   F   +  
Sbjct: 58  ENYEFVKKHNSQGNSSYTLSLNAFADLTHHEFKASRLGLSAFSTSGK-LSRRNFPLHDF- 115

Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
           + DVP S+DWR KGAV+ +K+Q  CG CW+F+A  A+EGI KI +G+L+ LSEQ+L+DC 
Sbjct: 116 VGDVPISIDWRKKGAVSQVKDQGNCGACWSFSATGAIEGINKIVTGSLVSLSEQELVDCD 175

Query: 193 TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQ-KPAAAKISNYEEVPS 251
            + NNGC GG  + A+ ++I+N GI TE++YPYQA   TC+  + K     I  Y +VP 
Sbjct: 176 RSYNNGCEGGLMDYAYQFVIENNGIDTEEDYPYQAREKTCNKEKLKRHVVTIDGYTDVPQ 235

Query: 252 GDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANY 311
            +E+ LLKAV+ QPVS+ I      FQ Y +GIF G C T LDHAV IVG+G +E+G +Y
Sbjct: 236 NNEKELLKAVAAQPVSVGICGSERAFQLYSKGIFTGPCSTSLDHAVLIVGYG-SENGVDY 294

Query: 312 WLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
           W++KNSWG  WG  GYM ++R+    +GLCGI   +S+P+
Sbjct: 295 WIVKNSWGTHWGINGYMYMLRNSGNSQGLCGINMLASFPV 334


>gi|32396020|gb|AAP41847.1| senescence-associated cysteine protease [Anthurium andraeanum]
          Length = 460

 Score =  293 bits (751), Expect = 6e-77,   Method: Compositional matrix adjust.
 Identities = 151/325 (46%), Positives = 208/325 (64%), Gaps = 12/325 (3%)

Query: 32  VVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANK-EGNRTYK 90
           +V+ R+  E+ V  ++E W+  +G++Y    EKE R +IF +NL YI+  N+ E N +Y 
Sbjct: 25  IVAERT--EEEVRLLYEGWLVGNGKAYNLLGEKERRFEIFWDNLRYIDDHNRAENNHSYT 82

Query: 91  LGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT--DVPTSLDWRDKGAV 148
           LG  +F+DLTN+E+R+ Y G K      R    +  + ++LS    D+P  +DWR+KGAV
Sbjct: 83  LGLTRFADLTNEEYRSTYLGVKPGQVRPRRANRAPGRGRDLSANGDDLPQKVDWREKGAV 142

Query: 149 TPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAF 208
            PIK+Q  CG CWAF+ VAAVEGI +I +G+LI LSEQ+L+DC T  N GC GG  + AF
Sbjct: 143 APIKDQGGCGSCWAFSTVAAVEGINQIVTGDLIVLSEQELVDCDTAYNEGCNGGLMDYAF 202

Query: 209 AYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVS 267
            +II N GI TE++YPY+   G C   +K A    I +YE+V   DE AL  AV+ QPVS
Sbjct: 203 QFIISNGGIDTEEDYPYKERDGLCDPNRKNAKVVSIDSYEDVLENDEHALKTAVAHQPVS 262

Query: 268 IAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGY 327
           +AI      FQ YK GIF+G CG  LDH V  VG+G TE G +YW+++NSWG +WG+AGY
Sbjct: 263 VAIEGGGRSFQLYKSGIFDGRCGIDLDHGVVAVGYG-TESGKDYWIVRNSWGKSWGEAGY 321

Query: 328 MKIVRD-----EGLCGIGTRSSYPL 347
           +++ R+      G CGI    SYP+
Sbjct: 322 IRMERNLPSSSSGKCGIAIEPSYPI 346


>gi|535473|emb|CAA53377.1| cysteine protease [Vicia sativa]
          Length = 368

 Score =  293 bits (751), Expect = 6e-77,   Method: Compositional matrix adjust.
 Identities = 148/337 (43%), Positives = 208/337 (61%), Gaps = 11/337 (3%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
            F +IT  ++   Q+ + RS  E  V+ ++E+W+ +H + Y    EK+ R +IFK+NL +
Sbjct: 12  FFSLITFSLALDIQLPTGRSNDE--VMTMYEEWLVKHQKVYNGLREKDQRFQIFKDNLNF 69

Query: 78  IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSST-FKYQNLSMTDV 136
           I++ N + N TY +G N+F+D+TN+E+R +Y G +            T  +Y   S   +
Sbjct: 70  IDEHNAQ-NYTYIVGLNKFADMTNEEYRDMYLGTRSDIKRRIMKNKITGHRYAYNSGDRL 128

Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGN 196
           P  +DWR KGA+T IK+Q  CG CWAF+ +A VE I KI +G L+ LSEQ+L+DC    N
Sbjct: 129 PVHVDWRLKGAITHIKDQGSCGSCWAFSTIATVEAINKIVTGKLVSLSEQELVDCDRAFN 188

Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAA-AKISNYEEVPSGDEQ 255
            GC GG  + AF +II N GI T+  YPY+   G C   +K A    I  YE+VPS +E 
Sbjct: 189 EGCNGGLMDYAFEFIIGNGGIDTDQHYPYKGFEGRCDPTRKKAKIVSIDGYEDVPSNNEN 248

Query: 256 ALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIK 315
           AL KAV+ QPVS+AI A     Q Y+ G+F G CGT LDHAV IVG+G +E+G +YWL++
Sbjct: 249 ALKKAVAHQPVSVAIEASGRALQLYQSGVFTGKCGTSLDHAVVIVGYG-SENGLDYWLVR 307

Query: 316 NSWGNTWGDAGYMKIVRD-----EGLCGIGTRSSYPL 347
           NSWG  WG+ GY K+ R+      G CGI   +SYP+
Sbjct: 308 NSWGTNWGEDGYFKMERNVKGTHTGKCGIAVEASYPV 344


>gi|168057475|ref|XP_001780740.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162667829|gb|EDQ54449.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 463

 Score =  293 bits (751), Expect = 6e-77,   Method: Compositional matrix adjust.
 Identities = 139/314 (44%), Positives = 204/314 (64%), Gaps = 13/314 (4%)

Query: 40  EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
           + +++++  +W+  H R Y+   EK  R +IFKEN  YI   NK+  ++Y LG N+FSDL
Sbjct: 42  DDAILDVFHQWLETHSRVYRSLSEKHHRFQIFKENFLYIHAHNKQ-QKSYWLGLNKFSDL 100

Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGC 159
           T+ EFRA Y G K   P +R    + F Y+++   +    +DWR KGAVT +K+Q  CG 
Sbjct: 101 THQEFRAQYLGTK---PVNRQRKEANFMYEDV---EAEPKVDWRLKGAVTDVKDQGACGS 154

Query: 160 CWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIAT 219
           CWAF+AV +VEG+  I++G L+ LSEQ+L+DC    N GC GG  + AF +II+N GI T
Sbjct: 155 CWAFSAVGSVEGVNAIKTGELVSLSEQELVDCDRKQNQGCNGGLMDYAFEFIIKNGGIDT 214

Query: 220 EDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQ 278
           E +YPY+A  G C   ++ +    I +Y++VP+  E AL+KA++  PVS+AI A   +FQ
Sbjct: 215 EKDYPYKARDGRCDEGRRNSKVVVIDDYQDVPTQSESALMKALTKNPVSVAIEAGGRDFQ 274

Query: 279 SYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR-----D 333
            Y+ G+F G CG++LDH V  VG+GT +DG NYW++KNSWG  WG+ GY+++ R      
Sbjct: 275 HYQGGVFTGPCGSELDHGVLAVGYGTDDDGVNYWIVKNSWGPGWGEKGYIRMERFGSDST 334

Query: 334 EGLCGIGTRSSYPL 347
           +G CGI   +S+P+
Sbjct: 335 DGKCGINIEASFPI 348


>gi|50355619|dbj|BAD29958.1| cysteine protease [Daucus carota]
          Length = 496

 Score =  293 bits (751), Expect = 7e-77,   Method: Compositional matrix adjust.
 Identities = 142/313 (45%), Positives = 200/313 (63%), Gaps = 7/313 (2%)

Query: 40  EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
           +     + E W+  HG+SY    E+E R +IFK NL YI++ N   +R +KLG N+F+DL
Sbjct: 38  DDEATTLFESWLVTHGKSYNALGEEEKRFQIFKNNLRYIDEQNLVEDRGFKLGLNKFADL 97

Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGC 159
           TN+E+R+ YTG K      +  ++ + +Y  LS   +P S+DWR+ GAV  +K+Q  CG 
Sbjct: 98  TNEEYRSKYTGIK-SKDLRKKVSAKSGRYATLSGESLPESVDWRESGAVATVKDQGSCGS 156

Query: 160 CWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIAT 219
           CWAF+ ++AVEGI +I +G LI LSEQ+L+DC  + N GC GG  + AF +II N GI T
Sbjct: 157 CWAFSTISAVEGINQIATGKLITLSEQELVDCDRSYNEGCNGGLMDYAFEFIINNGGIDT 216

Query: 220 EDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQ 278
           + +YPY    G C   +K A    I +YE+VP+ DE AL KA + QP+S+AI A   +FQ
Sbjct: 217 DVDYPYTGRDGKCDQYRKNAKVVTIDSYEDVPAYDELALKKAAANQPISVAIEASGRDFQ 276

Query: 279 SYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR----DE 334
            Y  GIF G CG  LDH V +VG+G TE+G +YW+++NSWG  WG+ GY+++ R      
Sbjct: 277 FYDSGIFTGKCGIALDHGVVVVGYG-TENGKDYWIVRNSWGADWGENGYLRMERGISSKT 335

Query: 335 GLCGIGTRSSYPL 347
           G+CGI    SYP+
Sbjct: 336 GICGIAIEPSYPV 348


>gi|146215976|gb|ABQ10190.1| actinidin Act1b [Actinidia arguta]
          Length = 380

 Score =  293 bits (751), Expect = 7e-77,   Method: Compositional matrix adjust.
 Identities = 152/335 (45%), Positives = 209/335 (62%), Gaps = 10/335 (2%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
           +F    L++S A    +        +  ++E W+ ++G+SY    E E R +IFKE L +
Sbjct: 13  LFFSTLLVLSLAFNAKNLTKRTNDELKAMYESWLTKYGKSYNSLGEWERRFEIFKETLRF 72

Query: 78  IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
           I++ N + NR+Y++G NQF+D TN+EF++ Y G+   S S++   S+  +Y+      +P
Sbjct: 73  IDEHNADTNRSYRVGLNQFADQTNEEFQSTYLGFT--SGSNKMKVSN--RYEPRVGQVLP 128

Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TNGN 196
             +DWR  GAV  IK+Q +CG CWAF+A+A VEGI KI +G+LI LSEQ+L+DC  T   
Sbjct: 129 DYVDWRSAGAVVDIKSQGQCGSCWAFSAIATVEGINKIVTGDLISLSEQELVDCGRTQNT 188

Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSA-AQKPAAAKISNYEEVPSGDEQ 255
            GC GGS    F +II N GI TE  YPY A  G C+   Q    A I  YE VP  +E 
Sbjct: 189 RGCDGGSITDGFQFIINNGGINTEANYPYTAEDGQCNLDLQNEKYASIDTYENVPYNNEW 248

Query: 256 ALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIK 315
           AL  AV+ QPVS+A+ A    FQ Y  GIF G CGT +DHAVTIVG+G TE G +YW++K
Sbjct: 249 ALQTAVAYQPVSVALEAAGDAFQHYSSGIFTGPCGTAVDHAVTIVGYG-TEGGIDYWIVK 307

Query: 316 NSWGNTWGDAGYMKIVRD---EGLCGIGTRSSYPL 347
           NSW  TWG+ GY++I+R+    G CGI T+ SYP+
Sbjct: 308 NSWDTTWGEEGYIRILRNVGGAGTCGIATKPSYPV 342


>gi|115441717|ref|NP_001045138.1| Os01g0907600 [Oryza sativa Japonica Group]
 gi|5761329|dbj|BAA83473.1| cysteine endopeptidase [Oryza sativa]
 gi|20804884|dbj|BAB92565.1| cysteine endopeptidase [Oryza sativa Japonica Group]
 gi|56785107|dbj|BAD82745.1| putative cysteine proteinase [Oryza sativa Japonica Group]
 gi|113534669|dbj|BAF07052.1| Os01g0907600 [Oryza sativa Japonica Group]
 gi|119395242|gb|ABL74582.1| cysteine endopeptidase [Oryza sativa Japonica Group]
 gi|125528777|gb|EAY76891.1| hypothetical protein OsI_04850 [Oryza sativa Indica Group]
 gi|125573036|gb|EAZ14551.1| hypothetical protein OsJ_04473 [Oryza sativa Japonica Group]
          Length = 371

 Score =  293 bits (751), Expect = 7e-77,   Method: Compositional matrix adjust.
 Identities = 144/330 (43%), Positives = 199/330 (60%), Gaps = 13/330 (3%)

Query: 28  CASQVVSSRSTH-EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGN 86
           CA+     R    ++++ +++E+W   H    +   EK  R   FK+N+ YI + NK G 
Sbjct: 26  CAAIPFDERDLESDEALWDLYERWQEHH-HVPRHHGEKHRRFGAFKDNVRYIHEHNKRGG 84

Query: 87  RTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSST---FKYQNLSMTDVPTSLDWR 143
           R Y+L  N+F D+  +EFRA + G            +     F Y+ +   D+P ++DWR
Sbjct: 85  RGYRLRLNRFGDMGREEFRATFAGSHANDLRRDGLAAPPLPGFMYEGVR--DLPRAVDWR 142

Query: 144 DKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGS 203
            KGAVT +K+Q +CG CWAF+ V +VEGI  IR+G L+ LSEQ+L+DC T  N+GC GG 
Sbjct: 143 RKGAVTGVKDQGKCGSCWAFSTVVSVEGINAIRTGRLVSLSEQELIDCDTADNSGCQGGL 202

Query: 204 REKAFAYIIQNQGIATEDEYPYQAVPGTCSA--AQKPAAAKISNYEEVPSGDEQALLKAV 261
            E AF YI  + GI TE  YPY+A  GTC A  A++     I  ++ VP+  E AL KAV
Sbjct: 203 MENAFEYIKHSGGITTESAYPYRAANGTCDAVRARRAPLVVIDGHQNVPANSEAALAKAV 262

Query: 262 SMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNT 321
           + QPVS+AI A    FQ Y +G+F G CGT LDH V +VG+G T DG  YW++KNSWG  
Sbjct: 263 ANQPVSVAIDAGDQSFQFYSDGVFAGDCGTDLDHGVAVVGYGETNDGTEYWIVKNSWGTA 322

Query: 322 WGDAGYMKIVRDE----GLCGIGTRSSYPL 347
           WG+ GY+++ RD     GLCGI   +SYP+
Sbjct: 323 WGEGGYIRMQRDSGYDGGLCGIAMEASYPV 352


>gi|194352756|emb|CAQ00106.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
          Length = 476

 Score =  293 bits (750), Expect = 8e-77,   Method: Compositional matrix adjust.
 Identities = 142/320 (44%), Positives = 206/320 (64%), Gaps = 14/320 (4%)

Query: 40  EQSVVEIHEKWMAQHGRSYKDEL----EKEMRLKIFKENLEYIEKANKE---GNRTYKLG 92
           E     +++ W+A+HG           ++E R   F +NL +++  N     G   ++L 
Sbjct: 45  EAEARAVYDLWLAEHGGGSSPNANSIADRERRFSAFWDNLRFVDAHNARAAAGEEGFRLA 104

Query: 93  TNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIK 152
            N+F+DLTNDEFRA Y G K  +  +R+      +Y++    ++P ++DWR+KGAV P+K
Sbjct: 105 MNRFADLTNDEFRAAYLGVKGAAERNRAGRVVGDRYRHDGAEELPEAVDWREKGAVAPVK 164

Query: 153 NQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYI 211
           NQ +CG CWAF+AV+ VE I +I +G ++ LSEQ+L++C  NG ++GC GG  + AF +I
Sbjct: 165 NQGQCGSCWAFSAVSTVESINQIVTGEMVTLSEQELVECDINGQSSGCNGGLMDDAFEFI 224

Query: 212 IQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAI 270
           I+N GI TED+YPY+AV G C   +K A    I  +E+VP  DE++L KAV+  PVS+AI
Sbjct: 225 IKNGGIDTEDDYPYKAVDGRCDVLRKNAKVVSIDGFEDVPENDEKSLQKAVAHHPVSVAI 284

Query: 271 AAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKI 330
            A   EFQ Y  G+F+G CGTQLDH V  VG+G TE+G +YW+++NSWG  WG+AGY+++
Sbjct: 285 EAGGREFQLYHSGVFSGRCGTQLDHGVVAVGYG-TENGKDYWIVRNSWGPNWGEAGYLRM 343

Query: 331 VRD----EGLCGIGTRSSYP 346
            R+     G CGI   SSYP
Sbjct: 344 ERNINVTSGKCGIAMMSSYP 363


>gi|326507362|dbj|BAK03074.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 473

 Score =  293 bits (750), Expect = 9e-77,   Method: Compositional matrix adjust.
 Identities = 142/320 (44%), Positives = 206/320 (64%), Gaps = 14/320 (4%)

Query: 40  EQSVVEIHEKWMAQHGRSYKDEL----EKEMRLKIFKENLEYIEKANKE---GNRTYKLG 92
           E     +++ W+A+HG           ++E R   F +NL +++  N     G   ++L 
Sbjct: 45  EAEARAVYDLWLAEHGGGSSPNANSIADRERRFSAFWDNLRFVDAHNARAAAGEEGFRLA 104

Query: 93  TNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIK 152
            N+F+DLTNDEFRA Y G K  +  +R+      +Y++    ++P ++DWR+KGAV P+K
Sbjct: 105 MNRFADLTNDEFRAAYLGVKGAAERNRAGRVVGERYRHDGAEELPEAVDWREKGAVAPVK 164

Query: 153 NQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYI 211
           NQ +CG CWAF+AV+ VE I +I +G ++ LSEQ+L++C  NG ++GC GG  + AF +I
Sbjct: 165 NQGQCGSCWAFSAVSTVESINQIVTGEMVTLSEQELVECDINGQSSGCNGGLMDDAFEFI 224

Query: 212 IQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAI 270
           I+N GI TED+YPY+AV G C   +K A    I  +E+VP  DE++L KAV+  PVS+AI
Sbjct: 225 IKNGGIDTEDDYPYKAVDGRCDVLRKNAKVVSIDGFEDVPENDEKSLQKAVAHHPVSVAI 284

Query: 271 AAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKI 330
            A   EFQ Y  G+F+G CGTQLDH V  VG+G TE+G +YW+++NSWG  WG+AGY+++
Sbjct: 285 EAGGREFQLYHSGVFSGRCGTQLDHGVVAVGYG-TENGKDYWIVRNSWGPNWGEAGYLRM 343

Query: 331 VRD----EGLCGIGTRSSYP 346
            R+     G CGI   SSYP
Sbjct: 344 ERNINVTSGKCGIAMMSSYP 363


>gi|204307508|gb|ACI00280.1| triticain beta 2 [Hordeum vulgare]
          Length = 473

 Score =  293 bits (750), Expect = 9e-77,   Method: Compositional matrix adjust.
 Identities = 142/320 (44%), Positives = 206/320 (64%), Gaps = 14/320 (4%)

Query: 40  EQSVVEIHEKWMAQHGRSYKDEL----EKEMRLKIFKENLEYIEKANKE---GNRTYKLG 92
           E     +++ W+A+HG           ++E R   F +NL +++  N     G   ++L 
Sbjct: 45  EAEARAVYDLWLAEHGGGSSPNANSIADRERRFSAFWDNLRFVDAHNARAAAGEEGFRLA 104

Query: 93  TNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIK 152
            N+F+DLTNDEFRA Y G K  +  +R+      +Y++    ++P ++DWR+KGAV P+K
Sbjct: 105 MNRFADLTNDEFRAAYLGVKGAAERNRAGRVVGERYRHDGAEELPEAVDWREKGAVAPVK 164

Query: 153 NQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYI 211
           NQ +CG CWAF+AV+ VE I +I +G ++ LSEQ+L++C  NG ++GC GG  + AF +I
Sbjct: 165 NQGQCGSCWAFSAVSTVESINQIVTGEMVTLSEQELVECDINGQSSGCNGGLMDDAFEFI 224

Query: 212 IQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAI 270
           I+N GI TED+YPY+AV G C   +K A    I  +E+VP  DE++L KAV+  PVS+AI
Sbjct: 225 IKNGGIDTEDDYPYKAVDGRCDVLRKNAKVVSIDGFEDVPENDEKSLQKAVAHHPVSVAI 284

Query: 271 AAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKI 330
            A   EFQ Y  G+F+G CGTQLDH V  VG+G TE+G +YW+++NSWG  WG+AGY+++
Sbjct: 285 EAGGREFQLYHSGVFSGRCGTQLDHGVVAVGYG-TENGKDYWIVRNSWGPNWGEAGYLRM 343

Query: 331 VRD----EGLCGIGTRSSYP 346
            R+     G CGI   SSYP
Sbjct: 344 ERNINVTSGKCGIAMMSSYP 363


>gi|242071345|ref|XP_002450949.1| hypothetical protein SORBIDRAFT_05g021550 [Sorghum bicolor]
 gi|241936792|gb|EES09937.1| hypothetical protein SORBIDRAFT_05g021550 [Sorghum bicolor]
          Length = 371

 Score =  293 bits (749), Expect = 1e-76,   Method: Compositional matrix adjust.
 Identities = 145/321 (45%), Positives = 204/321 (63%), Gaps = 16/321 (4%)

Query: 40  EQSVVEIHEKW----MAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQ 95
           E+S+  ++E+W    M       +++ +K     +FKEN+ YI +ANK+G R+++L  N+
Sbjct: 35  EESLRALYEQWRSHYMVSRPAGLQEQDDKARWFNVFKENVRYIHEANKKG-RSFRLALNK 93

Query: 96  FSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT-----DVPTSLDWRDKGAVTP 150
           F+D+T DEFR  Y      +  HR+ +S   ++ + S       ++P ++DWR +GAVT 
Sbjct: 94  FADMTTDEFRRAYAAGSR-TRHHRALSSGIRRHGDGSFMYAQAGNLPLAVDWRQRGAVTG 152

Query: 151 IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAY 210
           IK+Q +CG CWAF+ +AAVEGI KIR+G L+ LSEQ+L+DC    N GC GG  + AF Y
Sbjct: 153 IKDQGQCGSCWAFSTIAAVEGINKIRTGKLVSLSEQELVDCDDVDNQGCNGGLMDYAFQY 212

Query: 211 IIQNQGIATEDEYPYQAVPGTCS-AAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIA 269
           I +N GI TE  YPY A   +C+ A ++     I  YE+VP+ +E AL KAV+ QPVSIA
Sbjct: 213 IKRNGGITTESNYPYLAEQRSCNKAKERSHDVTIDGYEDVPANNEDALQKAVANQPVSIA 272

Query: 270 IAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMK 329
           I A   +FQ Y EG+F G CGT+LDH V  VG+G T DG  YW++KNSWG  WG+ GY++
Sbjct: 273 IEASGQDFQFYSEGVFTGSCGTELDHGVAAVGYGITRDGTKYWIVKNSWGEDWGERGYIR 332

Query: 330 IVR----DEGLCGIGTRSSYP 346
           + R     +GLCGI    SYP
Sbjct: 333 MQRGISDSQGLCGIAMEPSYP 353


>gi|374713651|gb|AEZ65083.1| cysteine protease [Carica papaya]
          Length = 467

 Score =  293 bits (749), Expect = 1e-76,   Method: Compositional matrix adjust.
 Identities = 150/348 (43%), Positives = 216/348 (62%), Gaps = 21/348 (6%)

Query: 18  MFIIITLLVSCASQ--VVSSRSTH--------EQSVVEIHEKWMAQHGRSYKDELEKEMR 67
           +F+++    S A    +VS    H        +  V+ ++E W+ +HG++Y    EKE R
Sbjct: 10  LFLLMIFTASSAVDMSIVSYDQRHADKSSWRTDDEVMAMYEAWLVKHGKAYNALGEKEKR 69

Query: 68  LKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSH--RSTTSST 125
             IFK+NL +I++ N + N TY+LG N+F+DLTN+E+R++Y G K P  +   R  +  +
Sbjct: 70  FGIFKDNLRFIDEHNSQ-NLTYRLGLNRFADLTNEEYRSMYLGVK-PGATRVTRKVSRKS 127

Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
            ++       +P  +DWR +GAV  +K+Q  CG CWAF+ +AAVEGI +I +G+LI LSE
Sbjct: 128 DRFAARVGDALPDFIDWRKEGAVVGVKDQGSCGSCWAFSTIAAVEGINQIVTGDLISLSE 187

Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKIS 244
           Q+L+DC T+ N GC GG  + AF +II N GI +E++YPY+A    C   +K A    I 
Sbjct: 188 QELVDCDTSYNEGCNGGLMDYAFEFIINNGGIDSEEDYPYRAADQKCDQYRKNANVVSID 247

Query: 245 NYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGT 304
            YE+VP  DE AL KAV+ QPVS+AI A    FQ Y+ G+F G CGT LDH V  VG+G 
Sbjct: 248 GYEDVPENDEAALKKAVAKQPVSVAIEAGGRAFQLYQSGVFTGKCGTSLDHGVAAVGYG- 306

Query: 305 TEDGANYWLIKNSWGNTWGDAGYMKIVRD-----EGLCGIGTRSSYPL 347
           TE+G +YW++ NSWG  WG+ GY+++ R+      G CGI    SYP+
Sbjct: 307 TENGQDYWIVGNSWGKNWGEDGYIRMERNLAGSSSGKCGIAIGPSYPI 354


>gi|374530932|gb|AEP83812.2| cysteine endopeptidase EP8 [Secale cereale x Triticum durum]
          Length = 364

 Score =  293 bits (749), Expect = 1e-76,   Method: Compositional matrix adjust.
 Identities = 146/317 (46%), Positives = 203/317 (64%), Gaps = 14/317 (4%)

Query: 40  EQSVVEIHEKWMAQHGRSYKD--ELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
           E+++  ++E+W + +  S +      +E R  +FKEN  YI + NK+ +R ++L  N+F+
Sbjct: 33  EENLRGLYERWRSHYTVSRRGLGADAEERRFNVFKENARYIHEGNKK-DRPFRLALNKFA 91

Query: 98  DLTNDEFRALYTGYKMP---SPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQ 154
           D+T DEFR  Y G ++    S S       +F+Y +    ++P ++DWR KGAVT IK+Q
Sbjct: 92  DMTTDEFRRTYAGSRVRHHLSLSGGRRGDGSFRYGDAD--NLPPAVDWRQKGAVTAIKDQ 149

Query: 155 KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQN 214
            +CG CWAF+ + AVEGI KIR+G L+ LSEQ+L+DC    N GC GG  + AF +I +N
Sbjct: 150 GQCGSCWAFSTIVAVEGINKIRTGKLVSLSEQELMDCDNVNNQGCDGGLMDYAFQFIHKN 209

Query: 215 QGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAY 273
            GI TE  YPYQ   G+C  A++ A A  I  YE+VP+ DE AL KAV+ QPVS+AI A 
Sbjct: 210 -GITTESNYPYQGEQGSCDLAKEKAHAVTIDGYEDVPANDESALQKAVAGQPVSVAIDAS 268

Query: 274 STEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD 333
             +FQ Y EG+F G C T LDH V  VG+GTT DG  YW++KNSWG  WG+ GY+++ R 
Sbjct: 269 GNDFQFYSEGVFTGECSTDLDHGVAAVGYGTTRDGTKYWIVKNSWGEDWGEKGYIRMQRG 328

Query: 334 ----EGLCGIGTRSSYP 346
               EG CGI  ++SYP
Sbjct: 329 VSQAEGQCGIAMQASYP 345


>gi|414879123|tpg|DAA56254.1| TPA: hypothetical protein ZEAMMB73_708930 [Zea mays]
          Length = 368

 Score =  293 bits (749), Expect = 1e-76,   Method: Compositional matrix adjust.
 Identities = 141/316 (44%), Positives = 197/316 (62%), Gaps = 12/316 (3%)

Query: 40  EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
           ++++ +++E+W   H R ++   EK  R   FKEN  +I   NK G+R Y+L  N+F D+
Sbjct: 35  DEALWDLYERWQTHH-RVHRHHGEKGRRFGTFKENARFIHAHNKRGDRPYRLRLNRFGDM 93

Query: 100 TNDEFRALYTGYKMPSPSHRSTTSST---FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKE 156
             +EFR+ +   ++       T +     F Y +   TD+P S+DWR KGAVT +KNQ  
Sbjct: 94  GREEFRSGFADSRINDLRREPTAAPAVPGFMYDD--ATDLPRSVDWRQKGAVTAVKNQGR 151

Query: 157 CGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQG 216
           CG CWAF+ V AVEGI  IR+G+L+ LSEQ+L+DC T+  NGC GG  E AF +I  + G
Sbjct: 152 CGSCWAFSTVVAVEGINAIRTGSLVSLSEQELIDCDTD-ENGCQGGLMENAFEFIKSHGG 210

Query: 217 IATEDEYPYQAVPGTCSAAQ--KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYS 274
           I TE  YPY A  GTC  A+  +     I  ++ VP+G E AL KAV+ QPVS+AI A  
Sbjct: 211 ITTESAYPYHASNGTCDGARARRGRVVAIDGHQAVPAGSEDALAKAVAHQPVSVAIDAGG 270

Query: 275 TEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR-- 332
              Q Y EG+F G CGT LDH V  VG+G ++DG  YW++KNSWG +WG+ GY+++ R  
Sbjct: 271 QALQFYSEGVFTGDCGTDLDHGVAAVGYGVSDDGTPYWIVKNSWGPSWGEGGYIRMQRGT 330

Query: 333 -DEGLCGIGTRSSYPL 347
            + GLCGI   +S+P+
Sbjct: 331 GNGGLCGIAMEASFPI 346


>gi|224065647|ref|XP_002301901.1| predicted protein [Populus trichocarpa]
 gi|222843627|gb|EEE81174.1| predicted protein [Populus trichocarpa]
          Length = 336

 Score =  293 bits (749), Expect = 1e-76,   Method: Compositional matrix adjust.
 Identities = 146/314 (46%), Positives = 209/314 (66%), Gaps = 10/314 (3%)

Query: 38  THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
           T    ++++ E W+++HG+ Y+   EK +R +IFK+NL +I++ NK+    Y LG N+FS
Sbjct: 24  TSGDKIIDLFESWISKHGKIYESIEEKWLRFEIFKDNLFHIDETNKK-VVNYWLGLNEFS 82

Query: 98  DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
           DL+++EF+  Y G K+   S R   S  F Y+++    +P S+DWR KGAVT +KNQ  C
Sbjct: 83  DLSHEEFKNKYLGLKV-DMSERRECSQEFNYKDV--MSIPKSVDWRKKGAVTDVKNQGSC 139

Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
           G CWAF+ VAAVEGI +I +GNL  LSEQ+L+DC T  N GC GG  + AF+YII N G+
Sbjct: 140 GSCWAFSTVAAVEGINQIVTGNLTSLSEQELVDCDTTNNYGCNGGLMDYAFSYIISNGGL 199

Query: 218 ATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTE 276
             E +YPY    GTC   ++ +    IS Y +VP   E++LLKA++ QP+S+AI A   +
Sbjct: 200 HKEVDYPYIMEEGTCEMRKEESEVVTISGYHDVPQNSEESLLKALANQPLSVAIEASGRD 259

Query: 277 FQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD--- 333
           FQ Y  G+F+G CGTQLDH V  VG+G+T +G +Y ++KNSWG+ WG+ GY+++ R+   
Sbjct: 260 FQFYSGGVFDGHCGTQLDHGVAAVGYGST-NGLDYIIVKNSWGSKWGEKGYIRMKRNTGK 318

Query: 334 -EGLCGIGTRSSYP 346
             GLCGI   +SYP
Sbjct: 319 PAGLCGINKMASYP 332


>gi|302763831|ref|XP_002965337.1| hypothetical protein SELMODRAFT_230602 [Selaginella moellendorffii]
 gi|300167570|gb|EFJ34175.1| hypothetical protein SELMODRAFT_230602 [Selaginella moellendorffii]
          Length = 343

 Score =  293 bits (749), Expect = 1e-76,   Method: Compositional matrix adjust.
 Identities = 151/337 (44%), Positives = 212/337 (62%), Gaps = 17/337 (5%)

Query: 21  IITLLVSCASQVVSSR--STHEQSVVEIH---EKWMAQHGRSYKDELEKEMRLKIFKENL 75
           +I L+V  A+    +R  +  +   +EI    E W A+HG+SY  +LEK  RL IF + L
Sbjct: 10  LILLVVVGATPFAIARPAALEDGRALEIKNMFEDWAAKHGKSYSSDLEKARRLMIFSDTL 69

Query: 76  EYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTG-YKMPSPSHRSTTSSTFKYQNLSMT 134
            YIEK N + N T+ LG N+FSDLTN EFRA++ G +K P    R         +++ ++
Sbjct: 70  AYIEKHNAQPNTTFTLGLNKFSDLTNAEFRAMHVGKFKRPRYQDRLPAED----EDVDVS 125

Query: 135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN 194
            +PTSLDWR KGAVTPIK+Q +CG CWAF+A+A++E    + +  L+ LSEQQL+DC T 
Sbjct: 126 SLPTSLDWRQKGAVTPIKDQGDCGSCWAFSAIASIESAHFLATKELVSLSEQQLMDCDTV 185

Query: 195 GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSA---AQKPAAAKISNYEEVPS 251
            + GC GG  E AF ++++N G+ TE  YPY    G+C+A   A     A+I+ ++ V  
Sbjct: 186 -DAGCDGGLMETAFKFVVKNGGVTTEASYPYTGSVGSCNANKVAIINKVAEITGFKVVTE 244

Query: 252 GDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANY 311
               AL+KAVS  PV+++I      FQ+YK GI +G CG  LDH V ++G+G TE G  Y
Sbjct: 245 DSADALMKAVSKTPVTVSICGSDENFQNYKSGILSGQCGDSLDHGVLLIGYG-TEGGMPY 303

Query: 312 WLIKNSWGNTWGDAGYMKIVRD--EGLCGIGTRSSYP 346
           W+IKNSWG +WG+ G+MKI R   +G+CG+   SSYP
Sbjct: 304 WIIKNSWGTSWGEDGFMKIERKDGDGICGMNGDSSYP 340


>gi|156142226|gb|ABU51882.1| ervatamin-C precursor [Tabernaemontana divaricata]
          Length = 365

 Score =  292 bits (748), Expect = 1e-76,   Method: Compositional matrix adjust.
 Identities = 151/316 (47%), Positives = 208/316 (65%), Gaps = 12/316 (3%)

Query: 34  SSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGT 93
           SS    ++ V EI+E W+A+H + Y   +E E R +IFK+NL++I++ N E N TYK+G 
Sbjct: 32  SSAWRTDEEVKEIYELWLAKHDKVYSGLVEYEKRFEIFKDNLKFIDEHNSE-NHTYKMGL 90

Query: 94  NQFSDLTNDEFRALYTGYKMPSPSH-RSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIK 152
             ++DLTN+EF+A+Y G +  +    + T + + +Y   +  ++P  +DWR KGAVTP+K
Sbjct: 91  TPYTDLTNEEFQAIYLGTRSDTIHRLKRTINISERYAYEAGDNLPEQIDWRKKGAVTPVK 150

Query: 153 NQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYII 212
           NQ +CG CWAF+ V+ VE I +IR+GNLI LSEQQL+DC+   N+GC GG+   A+ YII
Sbjct: 151 NQGKCGSCWAFSTVSTVESINQIRTGNLISLSEQQLVDCNKK-NHGCKGGAFVYAYQYII 209

Query: 213 QNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAA 272
            N GI TE  YPY+AV G C AA+K    +I  Y+ VP  +E AL KAV+ QP  +AI A
Sbjct: 210 DNGGIDTEANYPYKAVQGPCRAAKK--VVRIDGYKGVPHCNENALKKAVASQPSVVAIDA 267

Query: 273 YSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGY--MKI 330
            S +FQ YK GIF+G CGT+L+H V IVG+       +YW+++NSWG  WG+ GY  MK 
Sbjct: 268 SSKQFQHYKSGIFSGPCGTKLNHGVVIVGYWK-----DYWIVRNSWGRYWGEQGYIRMKR 322

Query: 331 VRDEGLCGIGTRSSYP 346
           V   GLCGI     YP
Sbjct: 323 VGGCGLCGIARLPYYP 338


>gi|160858205|dbj|BAF93840.1| triticain beta 2 [Triticum aestivum]
          Length = 469

 Score =  292 bits (748), Expect = 1e-76,   Method: Compositional matrix adjust.
 Identities = 145/320 (45%), Positives = 209/320 (65%), Gaps = 16/320 (5%)

Query: 40  EQSVVEIHEKWMAQHGR-SYKDE---LEKEMRLKIFKENLEYIEKANKE---GNRTYKLG 92
           E     +++ W+A+HG  SY +     E+E R + F +NL +++  N     G   ++L 
Sbjct: 43  EAEARAVYDLWLAEHGGGSYPNANSIPERERRFRAFWDNLRFVDAHNARAAAGEEGFRLA 102

Query: 93  TNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIK 152
            N+F+DLTNDEFRA Y G K      R       +Y++    ++P ++DWR+KGAV P+K
Sbjct: 103 MNRFADLTNDEFRAAYLGVK--GQRARPGRVVGERYRHDGAEELPEAVDWREKGAVAPVK 160

Query: 153 NQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYI 211
           NQ +CG CWAF+A++ VE I +I +G ++ LSEQ+L++C TNG ++GC GG  + AF +I
Sbjct: 161 NQGQCGSCWAFSAISTVESINQIVTGEMVTLSEQELVECDTNGQSSGCNGGLMDDAFEFI 220

Query: 212 IQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAI 270
           I+N GI TED+YPY+A+ G C   +K A    I  +E+VP  DE++L KAV+ QPVS+AI
Sbjct: 221 IKNGGIDTEDDYPYKAIDGRCDVLRKNAKVVSIDGFEDVPENDEKSLQKAVAHQPVSVAI 280

Query: 271 AAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKI 330
            A   EFQ Y  G+F+G CGTQLDH V  VG+G TE+G +YW+++NSWG  WG+AGY+++
Sbjct: 281 EAGGREFQLYHSGVFSGRCGTQLDHGVVAVGYG-TENGKDYWIVRNSWGPNWGEAGYLRM 339

Query: 331 VRD----EGLCGIGTRSSYP 346
            R+     G CGI   SSYP
Sbjct: 340 ERNINVTSGKCGIAMMSSYP 359


>gi|449522968|ref|XP_004168497.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
          Length = 348

 Score =  292 bits (748), Expect = 1e-76,   Method: Compositional matrix adjust.
 Identities = 143/314 (45%), Positives = 211/314 (67%), Gaps = 10/314 (3%)

Query: 38  THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
           T    ++E+ E+W++ HG+ Y+   EK  R ++FK+NL++I++ NK+   +Y LG N+F+
Sbjct: 36  TSMDRLIELFEEWISNHGKIYETIEEKWHRFEVFKDNLKHIDETNKKVT-SYWLGVNEFA 94

Query: 98  DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
           DLT+ EF+ +Y G K+ S   R +    F Y+++   D+P S+DWR KGAVT +KNQ  C
Sbjct: 95  DLTHQEFKNMYLGLKVESSRTRQSPEE-FTYKDV--VDLPKSVDWRKKGAVTRVKNQGSC 151

Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
           G CWAF+ VAAVEGI KI  GNL  LSEQ+L+DC    NNGC GG  + AF++I+ + G+
Sbjct: 152 GSCWAFSTVAAVEGINKIVGGNLTSLSEQELIDCDRPYNNGCHGGLMDYAFSFIVSSGGL 211

Query: 218 ATEDEYPYQAVPGTCSAAQ-KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTE 276
             E++YPY  V  TC   + +     IS Y++VP  +E +L+KA++ QP+S+AI A   +
Sbjct: 212 HKEEDYPYLEVESTCDNKKGELEVVTISGYKDVPENNEASLIKALAHQPLSVAIEASGRD 271

Query: 277 FQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD--- 333
           FQ Y  G+F+G CGTQLDH VT VG+G+++ G +Y ++KNSWG  WG+ GY+++ R+   
Sbjct: 272 FQFYSGGVFDGPCGTQLDHGVTAVGYGSSK-GVDYIIVKNSWGPKWGEKGYIRMKRNTGK 330

Query: 334 -EGLCGIGTRSSYP 346
             GLCGI   +SYP
Sbjct: 331 PAGLCGINKMASYP 344


>gi|30141025|dbj|BAC75926.1| cysteine protease-4 [Helianthus annuus]
          Length = 352

 Score =  292 bits (747), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 144/349 (41%), Positives = 217/349 (62%), Gaps = 18/349 (5%)

Query: 11  FKINTTPMFIIITLLVSCASQV--------VSSRSTHEQSVVEIHEKWMAQHGRSYKDEL 62
           F    T +F++   +++C++               T    V+ + E W+A+H + Y+   
Sbjct: 5   FSSKKTSLFLVFVSVLACSALANEFSILGYAPEDLTSIHKVIHLFESWLAKHSKIYESLD 64

Query: 63  EKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTT 122
           EK  R +IF +NL++I+  NK+ +  Y LG N+F+DLT++EF+  + G K   P  +  +
Sbjct: 65  EKLHRFEIFMDNLKHIDDTNKKVS-NYWLGLNEFADLTHEEFKNKFLGLKGELPERKDES 123

Query: 123 SSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQ 182
              F Y++    D+P S+DWR KGAV P+KNQ +CG CWAF+ VAAVEGI +I +GNL  
Sbjct: 124 IEEFSYRDF--VDLPKSVDWRKKGAVAPVKNQGQCGSCWAFSTVAAVEGINQIVTGNLTM 181

Query: 183 LSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AA 241
           LSEQ+L+DC T  NNGC GG  + AFAY++++ G+  E+EYPY    GTC   +  +   
Sbjct: 182 LSEQELIDCDTTFNNGCNGGLMDYAFAYVMRS-GLHKEEEYPYIMSEGTCDEKKDVSETV 240

Query: 242 KISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVG 301
            IS Y +VP  +E + LKA++ QP+S+AI A   +FQ Y  G+F+G CGT+LDH V  VG
Sbjct: 241 TISGYHDVPRNNEDSFLKALANQPISVAIEASGRDFQFYSGGVFDGHCGTELDHGVAAVG 300

Query: 302 FGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
           +GTT+ G +Y +++NSWG  WG+ GY+++ R      G+CG+   +SYP
Sbjct: 301 YGTTK-GLDYVIVRNSWGPKWGEKGYIRMKRKTGKPHGMCGLYMMASYP 348


>gi|1169186|sp|P43156.1|CYSP_HEMSP RecName: Full=Thiol protease SEN102; Flags: Precursor
 gi|396568|emb|CAA52425.1| thiol-protease [Hemerocallis hybrid cultivar]
          Length = 360

 Score =  292 bits (747), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 150/349 (42%), Positives = 219/349 (62%), Gaps = 25/349 (7%)

Query: 17  PMFIIITLL----VSCASQVVSSRS--THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKI 70
           P FI + L+    +S A  +  +      E S+  ++EKW   H  + +D  EK  R  +
Sbjct: 4   PKFIALALVALSFLSIAQSIPFTEKDLASEDSLWNLYEKWRTHHTVA-RDLDEKNRRFNV 62

Query: 71  FKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRS-----TTSST 125
           FKEN+++I + N++ +  YKL  N+F D+TN EFR+ Y G K+    HRS       + +
Sbjct: 63  FKENVKFIHEFNQKKDAPYKLALNKFGDMTNQEFRSKYAGSKIQH--HRSQRGIQKNTGS 120

Query: 126 FKYQNLSMTDVPT-SLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLS 184
           F Y+N+    +P  S+DWR KGAVT +K+Q +CG CWAF+ +A+VEGI +I++G L+ LS
Sbjct: 121 FMYENVG--SLPAASIDWRAKGAVTGVKDQGQCGSCWAFSTIASVEGINQIKTGELVSLS 178

Query: 185 EQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSA--AQKPAAAK 242
           EQ+L+DC T+ N GC GG  + AF +I Q  GI TED YPY    GTC++     P  + 
Sbjct: 179 EQELVDCDTSYNEGCNGGLMDYAFEFI-QKNGITTEDSYPYAEQDGTCASNLLNSPVVS- 236

Query: 243 ISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGF 302
           I  +++VP+ +E AL++AV+ QP+S++I A    FQ Y EG+F G CGT+LDH V IVG+
Sbjct: 237 IDGHQDVPANNENALMQAVANQPISVSIEASGYGFQFYSEGVFTGRCGTELDHGVAIVGY 296

Query: 303 GTTEDGANYWLIKNSWGNTWGDAGYMKIVR----DEGLCGIGTRSSYPL 347
           G T DG  YW++KNSWG  WG++GY+++ R      G CGI   +SYP+
Sbjct: 297 GATRDGTKYWIVKNSWGEEWGESGYIRMQRGISDKRGKCGIAMEASYPI 345


>gi|1174171|gb|AAB41816.1| NTH1 [Pisum sativum]
          Length = 367

 Score =  292 bits (747), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 148/335 (44%), Positives = 213/335 (63%), Gaps = 14/335 (4%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
           +F +ITL  S +  + S RS  E  V+ ++EKW+ +H + Y    EK  R +IFK+NL +
Sbjct: 10  LFGLITL--SLSLDMSSGRSNKE--VMTMYEKWLVKHQKVYYGLGEKNQRFQIFKDNLIF 65

Query: 78  IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
           I++ N   N +Y++G N+FSD+TN E+R  Y      +      TS  + Y+      +P
Sbjct: 66  IDEHNAP-NHSYRVGLNEFSDITNKEYRDTYLSRWSNNNIKNKITSVRYAYKAGHNNKLP 124

Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNN 197
            S+DWR  GA+TPIKNQ  CG CWAF+AVAAVE I KI +G+L+ LSEQ+L+DC    N 
Sbjct: 125 VSVDWR--GALTPIKNQGSCGACWAFSAVAAVEAINKIVTGSLVSLSEQELVDCDRTKNK 182

Query: 198 GCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQK-PAAAKISNYEEVPSGDEQA 256
           GC GG++  A+ +I++N G+ ++ +YPY     TC+ A+K      I+ Y+ V    E A
Sbjct: 183 GCNGGNQVNAYRFIVENGGLDSQIDYPYLGRQSTCNQAKKNTKVVSINGYKNVQRNSESA 242

Query: 257 LLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKN 316
           L++AV+ QPVS+ I AY  +FQ Y+ G+F G CGT LDHAV +VG+G +E+G +YWL+KN
Sbjct: 243 LMEAVANQPVSVGIEAYGKDFQLYQSGVFTGSCGTSLDHAVVVVGYG-SENGKDYWLVKN 301

Query: 317 SWGNTWGDAGYMKIVR-----DEGLCGIGTRSSYP 346
           SWG  WG+ GY+KI R     + G CGI   ++YP
Sbjct: 302 SWGTNWGERGYLKIERNLKNTNTGKCGIAMDATYP 336


>gi|356517188|ref|XP_003527271.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
          Length = 350

 Score =  292 bits (747), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 142/309 (45%), Positives = 209/309 (67%), Gaps = 11/309 (3%)

Query: 43  VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTND 102
           ++E+ E WM++HG+ Y++  EK +R +IFK+NL++I++ NK  +  Y LG N+F+DL++ 
Sbjct: 44  LIELFESWMSRHGKIYENIEEKLLRFEIFKDNLKHIDERNKVVS-NYWLGLNEFADLSHR 102

Query: 103 EFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWA 162
           EF   Y G K+   S R  +   F Y+++   ++P S+DWR KGAV P+KNQ  CG CWA
Sbjct: 103 EFNNKYLGLKVDY-SRRRESPEEFTYKDV---ELPKSVDWRKKGAVAPVKNQGSCGSCWA 158

Query: 163 FAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDE 222
           F+ VAAVEGI +I +GNL  LSEQ+L+DC    NNGC GG  + AF++I++N G+  E++
Sbjct: 159 FSTVAAVEGINQIVTGNLTSLSEQELIDCDRTYNNGCNGGLMDYAFSFIVENGGLHKEED 218

Query: 223 YPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYK 281
           YPY    GTC   ++      IS Y +VP  +EQ+LLKA++ QP+S+AI A   +FQ Y 
Sbjct: 219 YPYIMEEGTCEMTKEETQVVTISGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQFYS 278

Query: 282 EGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLC 337
            G+F+G CG+ LDH V  VG+GT + G +Y  +KNSWG+ WG+ GY+++ R+    EG+C
Sbjct: 279 GGVFDGHCGSDLDHGVAAVGYGTAK-GVDYITVKNSWGSKWGEKGYIRMRRNIGKPEGIC 337

Query: 338 GIGTRSSYP 346
           GI   +SYP
Sbjct: 338 GIYKMASYP 346


>gi|449455625|ref|XP_004145553.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
          Length = 351

 Score =  292 bits (747), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 143/314 (45%), Positives = 211/314 (67%), Gaps = 10/314 (3%)

Query: 38  THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
           T    ++E+ E+W++ HG+ Y+   EK  R ++FK+NL++I++ NK+   +Y LG N+F+
Sbjct: 39  TSMDRLIELFEEWISNHGKIYETIEEKWHRFEVFKDNLKHIDETNKKVT-SYWLGVNEFA 97

Query: 98  DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
           DLT+ EF+ +Y G K+ S   R +    F Y+++   D+P S+DWR KGAVT +KNQ  C
Sbjct: 98  DLTHQEFKNMYLGLKVESSRTRQSPEE-FTYKDV--VDLPKSVDWRKKGAVTRVKNQGSC 154

Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
           G CWAF+ VAAVEGI KI  GNL  LSEQ+L+DC    NNGC GG  + AF++I+ + G+
Sbjct: 155 GSCWAFSTVAAVEGINKIVGGNLTSLSEQELIDCDRPYNNGCHGGLMDYAFSFIVSSGGL 214

Query: 218 ATEDEYPYQAVPGTCSAAQ-KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTE 276
             E++YPY  V  TC   + +     IS Y++VP  +E +L+KA++ QP+S+AI A   +
Sbjct: 215 HKEEDYPYLEVESTCDNKKGELEVVTISGYKDVPENNEASLIKALAHQPLSVAIEASGRD 274

Query: 277 FQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD--- 333
           FQ Y  G+F+G CGTQLDH VT VG+G+++ G +Y ++KNSWG  WG+ GY+++ R+   
Sbjct: 275 FQFYSGGVFDGPCGTQLDHGVTAVGYGSSK-GVDYIIVKNSWGPKWGEKGYIRMKRNTGK 333

Query: 334 -EGLCGIGTRSSYP 346
             GLCGI   +SYP
Sbjct: 334 PAGLCGINKMASYP 347


>gi|357507505|ref|XP_003624041.1| Cysteine proteinase [Medicago truncatula]
 gi|355499056|gb|AES80259.1| Cysteine proteinase [Medicago truncatula]
          Length = 342

 Score =  291 bits (746), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 154/316 (48%), Positives = 205/316 (64%), Gaps = 25/316 (7%)

Query: 42  SVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL-- 99
           S+ E  E W  ++G  YKD  E++   +IFK N+ YI+  N  GN+ YKL  N+F D   
Sbjct: 37  SLSERFEYWKTKYGVVYKDVAEQKKHFQIFKHNVAYIDYFNAAGNKPYKLAINRFVDKPI 96

Query: 100 --TNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
             ++D F            +  +T ++TFKY+N+  TD+P ++DWR +GAVTPIKNQ +C
Sbjct: 97  EDSDDGFER----------TTTTTPTTTFKYENV--TDIPATVDWRKRGAVTPIKNQGKC 144

Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGN-NGCLGGSREKAFAYIIQNQG 216
           G CWAF+AVAA+EGI KI SGNL+ LSEQQL+DC  +G   GC  G+   AF +I++N G
Sbjct: 145 GSCWAFSAVAAIEGIQKITSGNLVSLSEQQLVDCDRSGRTKGCDNGNMINAFKFILENGG 204

Query: 217 IATEDEYPYQ-AVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYST 275
           IATE  YPY+  V GTC         +I +YEEVPS  E +LLKAV+ QPVS+ I     
Sbjct: 205 IATEANYPYKRVVKGTCKKVSH--KVQIKSYEEVPSNSEDSLLKAVANQPVSVGIDMRGM 262

Query: 276 EFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-- 333
            F+ Y  GIF G CGT+ +HA+TIVG+GT++DG  YWL+KNSW   WG+ GY++I RD  
Sbjct: 263 -FKFYSSGIFTGECGTKPNHALTIVGYGTSKDGIKYWLVKNSWSKRWGEKGYIRIKRDID 321

Query: 334 --EGLCGIGTRSSYPL 347
             EGLCGI  + SYP+
Sbjct: 322 AKEGLCGIAMKPSYPI 337


>gi|111073717|dbj|BAF02547.1| triticain beta [Triticum aestivum]
          Length = 472

 Score =  291 bits (746), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 144/320 (45%), Positives = 208/320 (65%), Gaps = 16/320 (5%)

Query: 40  EQSVVEIHEKWMAQHGRSYKDEL----EKEMRLKIFKENLEYIEKANKE---GNRTYKLG 92
           E     +++ W+A++G           E+E R + F +NL +++  N     G   Y+LG
Sbjct: 46  EAEARAVYDLWLAENGGGSSPNANSIPERERRFRAFWDNLNFVDAHNARAAAGEEGYRLG 105

Query: 93  TNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIK 152
            N+F+DLTNDEFRA Y G K  +   R       +Y++    ++P ++DWR+KGAV P+K
Sbjct: 106 MNRFADLTNDEFRAAYLGVK--AQRARPGRMVGERYRHDGAEELPEAVDWREKGAVAPVK 163

Query: 153 NQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYI 211
           NQ +CG CWAF+AV+ VE I +I +G ++ LSEQ+L++C TNG ++GC GG  + AF +I
Sbjct: 164 NQGQCGSCWAFSAVSTVESINQIVTGEMVTLSEQELVECDTNGQSSGCNGGLMDDAFEFI 223

Query: 212 IQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAI 270
           I+N GI TED+YPY+A+ G C   +K A    I  +E+VP  DE++L KAV+ QPVS+AI
Sbjct: 224 IKNGGIDTEDDYPYKAIDGRCDVLRKNAKVVSIDGFEDVPENDEKSLQKAVAHQPVSVAI 283

Query: 271 AAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKI 330
            A   EFQ Y  G+F+G CGTQLDH V  VG+G TE+G +YW+++NSWG  WG++GY+++
Sbjct: 284 EAGGREFQLYHSGVFSGRCGTQLDHGVVAVGYG-TENGKDYWIVRNSWGPNWGESGYLRM 342

Query: 331 VRD----EGLCGIGTRSSYP 346
            R+     G CGI   SSYP
Sbjct: 343 ERNINVTSGKCGIAMMSSYP 362


>gi|356508487|ref|XP_003522988.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
          Length = 349

 Score =  291 bits (746), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 143/309 (46%), Positives = 207/309 (66%), Gaps = 11/309 (3%)

Query: 43  VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTND 102
           ++E+ E WM++HG+ Y+   EK  R  IFK+NL++I++ NK  +  Y LG N+F+DL++ 
Sbjct: 43  LIELFESWMSRHGKIYQSIEEKLHRFDIFKDNLKHIDERNKVVS-NYWLGLNEFADLSHQ 101

Query: 103 EFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWA 162
           EF+  Y G K+   S R  +   F Y++    ++P S+DWR KGAVT +KNQ  CG CWA
Sbjct: 102 EFKNKYLGLKVDY-SRRRESPEEFTYKDF---ELPKSVDWRKKGAVTQVKNQGSCGSCWA 157

Query: 163 FAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDE 222
           F+ VAAVEGI +I +GNL  LSEQ+L+DC    NNGC GG  + AF++I++N G+  E++
Sbjct: 158 FSTVAAVEGINQIVTGNLTSLSEQELIDCDRTYNNGCNGGLMDYAFSFIVENGGLHKEED 217

Query: 223 YPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYK 281
           YPY    GTC   ++      IS Y +VP  +EQ+LLKA+  QP+S+AI A   +FQ Y 
Sbjct: 218 YPYIMEEGTCEMTKEETEVVTISGYHDVPQNNEQSLLKALVNQPLSVAIEASGRDFQFYS 277

Query: 282 EGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLC 337
            G+F+G CG+ LDH V  VG+GT++ G NY ++KNSWG+ WG+ GY+++ R+    EG+C
Sbjct: 278 GGVFDGHCGSDLDHGVAAVGYGTSK-GVNYIIVKNSWGSKWGEKGYIRMRRNIGKPEGIC 336

Query: 338 GIGTRSSYP 346
           GI   +SYP
Sbjct: 337 GIYKMASYP 345


>gi|195637152|gb|ACG38044.1| vignain precursor [Zea mays]
          Length = 377

 Score =  291 bits (746), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 142/292 (48%), Positives = 187/292 (64%), Gaps = 18/292 (6%)

Query: 68  LKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHR-------- 119
             +FK N+  I + N+  +  YKL  N+F D+T DEFR  Y G ++    HR        
Sbjct: 70  FNVFKANVRLIHEFNRR-DEPYKLRLNRFGDMTADEFRRHYAGSRVAH--HRMFRGDRQG 126

Query: 120 STTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGN 179
           S+ S++F Y +    DVP S+DWR KGAVT +K+Q +CG CWAF+ +AAVEGI  I++ N
Sbjct: 127 SSASASFMYAD--ARDVPASVDWRQKGAVTDVKDQGQCGSCWAFSTIAAVEGINAIKTKN 184

Query: 180 LIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA 239
           L  LSEQQL+DC T  N GC GG  + AF YI ++ G+A ED YPY+A   +C  +  P 
Sbjct: 185 LTSLSEQQLVDCDTKANAGCNGGLMDYAFQYIAKHGGVAAEDAYPYRARQASCKKSPAPV 244

Query: 240 AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTI 299
              I  YE+VP+ DE AL KAV+ QPVS+AI A  + FQ Y EG+F+G CGT+LDH V  
Sbjct: 245 VT-IDGYEDVPANDESALKKAVAHQPVSVAIEASGSHFQFYSEGVFSGRCGTELDHGVAA 303

Query: 300 VGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
           VG+G T DG  YWL+KNSWG  WG+ GY+++ RD    EG CGI   +SYP+
Sbjct: 304 VGYGVTADGTKYWLVKNSWGPEWGEKGYIRMARDVAAKEGHCGIAMEASYPV 355


>gi|37780049|gb|AAP32197.1| cysteine protease 10 [Trifolium repens]
          Length = 272

 Score =  291 bits (746), Expect = 3e-76,   Method: Compositional matrix adjust.
 Identities = 142/271 (52%), Positives = 191/271 (70%), Gaps = 13/271 (4%)

Query: 86  NRTYKLGTNQFSDLTNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMTDVPTSLDWR 143
           N+ YKLG N+F+DLTN+EF+A    +K  M S   R+TT   FKY+N S   +P+++DWR
Sbjct: 7   NKLYKLGINKFADLTNEEFKASRNKFKGHMCSSIIRTTT---FKYENASA--IPSTVDWR 61

Query: 144 DKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGG 202
            KGAVTP+KNQ +CG CWAF+AVAA EGI ++ +G L+ LSEQ+L+DC T G + GC GG
Sbjct: 62  KKGAVTPVKNQGQCGSCWAFSAVAATEGIHQLSTGKLVSLSEQELIDCDTKGVDQGCEGG 121

Query: 203 SREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAV 261
             + AF +IIQN G++TE +YPY+ V GTC+  +    A  I+ YE+VP+ +E AL KAV
Sbjct: 122 LMDDAFKFIIQNHGLSTEVQYPYEGVDGTCNTNEASIHAVTITGYEDVPANNELALQKAV 181

Query: 262 SMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNT 321
           + QP+S+AI A  ++FQ Y  G+F G CGT+LDH VT VG+G   DG  YWL+KNSWG  
Sbjct: 182 ANQPISVAIDASGSDFQFYNSGVFTGSCGTELDHGVTAVGYGVGNDGTKYWLVKNSWGAD 241

Query: 322 WGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
           WG+ GY+++ R     EGLCGI  ++SYP A
Sbjct: 242 WGEEGYIRMQRGIDAAEGLCGIAMQASYPTA 272


>gi|224096714|ref|XP_002310708.1| predicted protein [Populus trichocarpa]
 gi|222853611|gb|EEE91158.1| predicted protein [Populus trichocarpa]
          Length = 356

 Score =  291 bits (745), Expect = 3e-76,   Method: Compositional matrix adjust.
 Identities = 152/325 (46%), Positives = 207/325 (63%), Gaps = 18/325 (5%)

Query: 38  THEQS-------VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYK 90
           +H+QS       V+ I++ W+ +HG++Y    EK  R +IFK NL +I++ N + NRTYK
Sbjct: 12  SHDQSSWRSDDEVMSIYKWWLQKHGKAYNRLGEKAKRFEIFKNNLRFIDEHNSQ-NRTYK 70

Query: 91  LGTNQFSDLTNDEFRALYTGYKMPSPSHR--STTSSTFKYQNLSMTDVPTSLDWRDKGAV 148
           +G  +F+DLTN E+RA++ G +   P  R   + + + +Y   +   +P S+DWR KGAV
Sbjct: 71  VGLTKFADLTNQEYRAMFLGTR-SDPKRRLMKSKNPSERYAYKAGDKLPESVDWRGKGAV 129

Query: 149 TPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAF 208
            PIK+Q  CG CWAF+ VAAVEGI +I +G LI LSEQ+L+DC    N GC GG  + AF
Sbjct: 130 NPIKDQGSCGSCWAFSTVAAVEGINQIVTGELISLSEQELVDCDRFYNAGCNGGLMDYAF 189

Query: 209 AYIIQNQGIATEDEYPYQAVPGTCSAAQ-KPAAAKISNYEEVPSGDEQALLKAVSMQPVS 267
            +II N G+ TE +YPY     TC   + K  A  I  +E+V   DE+AL KAV+ QPVS
Sbjct: 190 QFIINNGGLDTEKDYPYLGNDDTCDRDKMKTKAVSIDGFEDVLPFDEKALQKAVAHQPVS 249

Query: 268 IAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGY 327
           +AI A     Q Y+ G+F G CGT LDH V +VG+G TE G +YWL++NSWG  WG+ GY
Sbjct: 250 VAIEASGMALQFYQSGVFTGECGTALDHGVVVVGYG-TEKGLDYWLVRNSWGTEWGEHGY 308

Query: 328 MKI---VRD--EGLCGIGTRSSYPL 347
           +K+   VRD   G CGI   SSYP+
Sbjct: 309 IKMQRNVRDTYTGRCGIAMESSYPV 333


>gi|22759715|dbj|BAC10906.1| cysteine proteinase [Zinnia elegans]
          Length = 352

 Score =  291 bits (744), Expect = 4e-76,   Method: Compositional matrix adjust.
 Identities = 144/344 (41%), Positives = 216/344 (62%), Gaps = 17/344 (4%)

Query: 15  TTPMFIIITLLVSCASQ-------VVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMR 67
           T+ +F+ +++L   A               T    V+ + E W+ +H + Y+   EK  R
Sbjct: 10  TSLLFLFVSILACSALAHEFSILGYAPEDLTSIHKVIHLFESWLVKHSKFYESLDEKLHR 69

Query: 68  LKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFK 127
            +IF +NL++I++ NK+ +  Y LG N+F+DLT++EF+  + G+K      +  +S  F 
Sbjct: 70  FEIFMDNLKHIDETNKKVS-NYWLGLNEFADLTHEEFKHKFLGFKGELAERKDESSKEFG 128

Query: 128 YQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQ 187
           Y++    D+P S+DWR KGAV P+KNQ +CG CWAF+ VAAVEGI +I +GNL  LSEQ+
Sbjct: 129 YRDF--VDLPKSVDWRKKGAVAPVKNQGQCGSCWAFSTVAAVEGINQIVTGNLTMLSEQE 186

Query: 188 LLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNY 246
           L+DC T  NNGC GG  + AFAY++++ G+  E+EYPY    GTC   +  +    IS Y
Sbjct: 187 LIDCDTTFNNGCNGGLMDYAFAYVMRS-GLHKEEEYPYIMSEGTCDEKKDVSEKVTISGY 245

Query: 247 EEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTE 306
            +VP  DE + LKA++ QP+S+AI A   +FQ Y  G+F+G CGT+LDH V  VG+GTT+
Sbjct: 246 HDVPRNDEASFLKALANQPISVAIEASGRDFQFYSGGVFDGHCGTELDHGVAAVGYGTTK 305

Query: 307 DGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
            G +Y +++NSWG  WG+ GY+++ R      G+CG+   +SYP
Sbjct: 306 -GLDYVIVRNSWGPKWGEKGYIRMKRGSGKPHGMCGLYMMASYP 348


>gi|18394919|ref|NP_564126.1| Xylem cysteine proteinase 2 [Arabidopsis thaliana]
 gi|71153409|sp|Q9LM66.2|XCP2_ARATH RecName: Full=Xylem cysteine proteinase 2; Short=AtXCP2; Flags:
           Precursor
 gi|4836904|gb|AAD30607.1|AC007369_17 Putative cysteine proteinase [Arabidopsis thaliana]
 gi|6708183|gb|AAF25832.1|AF191028_1 papain-type cysteine endopeptidase XCP2 [Arabidopsis thaliana]
 gi|28466959|gb|AAO44088.1| At1g20850 [Arabidopsis thaliana]
 gi|110743795|dbj|BAE99733.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|332191910|gb|AEE30031.1| Xylem cysteine proteinase 2 [Arabidopsis thaliana]
          Length = 356

 Score =  291 bits (744), Expect = 4e-76,   Method: Compositional matrix adjust.
 Identities = 140/315 (44%), Positives = 215/315 (68%), Gaps = 11/315 (3%)

Query: 38  THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
           +H++ ++E+ E W++   ++Y+   EK +R ++FK+NL++I++ NK+G ++Y LG N+F+
Sbjct: 43  SHDK-LIELFENWISNFEKAYETVEEKFLRFEVFKDNLKHIDETNKKG-KSYWLGLNEFA 100

Query: 98  DLTNDEFRALYTGYKMPSPSHRSTTS-STFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKE 156
           DL+++EF+ +Y G K          S + F Y+++    VP S+DWR KGAV  +KNQ  
Sbjct: 101 DLSHEEFKKMYLGLKTDIVRRDEERSYAEFAYRDVEA--VPKSVDWRKKGAVAEVKNQGS 158

Query: 157 CGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQG 216
           CG CWAF+ VAAVEGI KI +GNL  LSEQ+L+DC T  NNGC GG  + AF YI++N G
Sbjct: 159 CGSCWAFSTVAAVEGINKIVTGNLTTLSEQELIDCDTTYNNGCNGGLMDYAFEYIVKNGG 218

Query: 217 IATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYST 275
           +  E++YPY    GTC   +  +    I+ +++VP+ DE++LLKA++ QP+S+AI A   
Sbjct: 219 LRKEEDYPYSMEEGTCEMQKDESETVTINGHQDVPTNDEKSLLKALAHQPLSVAIDASGR 278

Query: 276 EFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-- 333
           EFQ Y  G+F+G CG  LDH V  VG+G+++ G++Y ++KNSWG  WG+ GY+++ R+  
Sbjct: 279 EFQFYSGGVFDGRCGVDLDHGVAAVGYGSSK-GSDYIIVKNSWGPKWGEKGYIRLKRNTG 337

Query: 334 --EGLCGIGTRSSYP 346
             EGLCGI   +S+P
Sbjct: 338 KPEGLCGINKMASFP 352


>gi|388519351|gb|AFK47737.1| unknown [Medicago truncatula]
          Length = 359

 Score =  291 bits (744), Expect = 4e-76,   Method: Compositional matrix adjust.
 Identities = 149/346 (43%), Positives = 214/346 (61%), Gaps = 15/346 (4%)

Query: 10  SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
           S  I +   F +ITL ++  +   S RS  E  V+ ++E+W+ +H + Y    EK+ R +
Sbjct: 3   SITITSLLFFSLITLSLAMDT---SMRSNEE--VMTMYEEWLVKHHKVYNGLGEKDQRFE 57

Query: 70  IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSH--RSTTSSTFK 127
           IFK+NL +I++ N + N TYK+G N+F+D TN+E+R +Y G K  +  +  +   ++  +
Sbjct: 58  IFKDNLGFIDEHNAQ-NYTYKVGLNKFADTTNEEYRNMYLGTKNDAKRNVMKIKITTGHR 116

Query: 128 YQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQ 187
           Y   S   +P  +DWR KGAV  IK+Q  CG CWAF+ +A VE I KI +G L+ LSEQ+
Sbjct: 117 YAFNSGDRLPVHVDWRSKGAVAHIKDQGSCGSCWAFSTIATVEAINKIVTGKLVSLSEQE 176

Query: 188 LLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNY 246
           L+DC    N GC GG  + AF +I++N GI TE +YPY+   G C   +K A    I  Y
Sbjct: 177 LVDCDRAFNEGCNGGLMDYAFEFIVENGGIDTEQDYPYKGFEGRCDPTRKNAKVVSIDGY 236

Query: 247 EEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTE 306
           E+VP+ +E AL KAV  QPVS+AI A     Q Y+ G+F G CGT LDH V +VG+G  E
Sbjct: 237 EDVPAYNENALKKAVFHQPVSVAIEAGGRALQLYQSGVFTGRCGTNLDHGVVVVGYG-FE 295

Query: 307 DGANYWLIKNSWGNTWGDAGYMKIVR-----DEGLCGIGTRSSYPL 347
           +G +YWL++NSWG  WG+ GY K+ R     + G CGI  ++SYP+
Sbjct: 296 NGVDYWLVRNSWGTNWGEDGYFKLERNVKKINTGKCGIAMQASYPV 341


>gi|220983358|dbj|BAH11164.1| cysteine protease [Hordeum vulgare]
          Length = 462

 Score =  291 bits (744), Expect = 4e-76,   Method: Compositional matrix adjust.
 Identities = 143/315 (45%), Positives = 204/315 (64%), Gaps = 12/315 (3%)

Query: 40  EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRTYKLGTNQF 96
           E+ V  ++ +WMA+H  +Y    E+E R + F+ NL YI++ N     G  +++LG N+F
Sbjct: 35  EEEVRRMYAEWMAEHHSTYNPIGEEERRFEAFRNNLRYIDQHNAAADAGVHSFRLGLNRF 94

Query: 97  SDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKE 156
           +DLTN+E+R+ Y G +      R  ++   +YQ     ++P S+DWR KGAV  +K+Q  
Sbjct: 95  ADLTNEEYRSTYLGARTKPDRERKLSA---RYQAADNDELPESVDWRKKGAVGAVKDQGG 151

Query: 157 CGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQG 216
           CG CWAF+A+AAVEGI +I +G++I LSEQ+L+DC T+ N GC GG  + AF +II N G
Sbjct: 152 CGSCWAFSAIAAVEGINQIVTGDMIPLSEQELVDCDTSYNQGCNGGLMDYAFEFIINNGG 211

Query: 217 IATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYST 275
           I +E++YPY+     C A +K A    I  YE+VP   E++L KAV+ QP+S+AI A   
Sbjct: 212 IDSEEDYPYKERDNRCDANKKNAKVVTIDGYEDVPVNSEKSLQKAVANQPISVAIEAGGR 271

Query: 276 EFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-- 333
            FQ YK GIF G CGT LDH V  VG+G TE+G +YWL++NSWG+ WG+ GY+++ R+  
Sbjct: 272 AFQLYKSGIFTGTCGTALDHGVAAVGYG-TENGKDYWLVRNSWGSVWGENGYIRMERNIK 330

Query: 334 --EGLCGIGTRSSYP 346
              G CGI    SYP
Sbjct: 331 ASSGKCGIAVEPSYP 345


>gi|302790836|ref|XP_002977185.1| hypothetical protein SELMODRAFT_106228 [Selaginella moellendorffii]
 gi|300155161|gb|EFJ21794.1| hypothetical protein SELMODRAFT_106228 [Selaginella moellendorffii]
          Length = 299

 Score =  291 bits (744), Expect = 5e-76,   Method: Compositional matrix adjust.
 Identities = 142/303 (46%), Positives = 199/303 (65%), Gaps = 8/303 (2%)

Query: 46  IHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFR 105
           + E W A+HG+SY  + EK  RL IF + L YIEK N + N T+ LG N+FSDLTN EFR
Sbjct: 1   MFEDWAAKHGKSYSSDSEKARRLMIFSDTLAYIEKHNAQPNTTFTLGLNKFSDLTNAEFR 60

Query: 106 ALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAA 165
           A Y G K  SP ++    +  K  ++ ++ +PTSLDWR +GAVTPIK+Q +CG CWAF+A
Sbjct: 61  ANYVG-KFKSPRYQDRRPA--KDVDVDVSSLPTSLDWRQEGAVTPIKDQGQCGSCWAFSA 117

Query: 166 VAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPY 225
           +A++E    + +  L+ LSEQQL+DC T  + GC GG  E AF ++++N G+ TE+ YPY
Sbjct: 118 IASIESAHFLATKELVSLSEQQLIDCDTV-DQGCQGGFPEDAFKFVVENGGVTTEEAYPY 176

Query: 226 QAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIF 285
               G+C+ A K    +I+ Y++V      AL+KAVS  PV++ I      FQ+Y+ GI 
Sbjct: 177 TGFAGSCN-ANKNKVVEITGYKDVTKDSADALMKAVSKTPVTVGICGSDQNFQNYRSGIL 235

Query: 286 NGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD--EGLCGIGTRS 343
           +G C    DHAV ++G+G TE G  YW+IKNSWG +WG+ G+MKI +   EG+CG+  +S
Sbjct: 236 SGQCSNSRDHAVLVIGYG-TEGGMPYWIIKNSWGTSWGENGFMKIKKKDGEGMCGMNGQS 294

Query: 344 SYP 346
           SYP
Sbjct: 295 SYP 297


>gi|2224808|emb|CAB09697.1| cysteine endopeptidase EP-A [Hordeum vulgare subsp. vulgare]
 gi|326502180|dbj|BAK06781.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 365

 Score =  291 bits (744), Expect = 5e-76,   Method: Compositional matrix adjust.
 Identities = 144/315 (45%), Positives = 197/315 (62%), Gaps = 10/315 (3%)

Query: 40  EQSVVEIHEKWMAQHGRSYKD--ELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
           E+S+  ++E+W + +  S +      +E R  +FKEN  Y+ + NK  +R ++L  N+F+
Sbjct: 34  EESLRGLYERWRSHYTVSRRGLGADAEERRFNVFKENARYVHEGNKR-DRPFRLALNKFA 92

Query: 98  DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD-VPTSLDWRDKGAVTPIKNQKE 156
           D+T DEFR  Y G ++      S           +  D +P ++DWR KGAVT IK+Q +
Sbjct: 93  DMTTDEFRRTYAGSRVRHHLSLSGGRRGDGGFRYADADNLPPAVDWRQKGAVTAIKDQGQ 152

Query: 157 CGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQG 216
           CG CWAF+ + AVEGI KIR+G L+ LSEQ+L+DC    N GC GG  + AF +I +N G
Sbjct: 153 CGSCWAFSTIVAVEGINKIRTGKLVSLSEQELMDCDNVNNQGCEGGLMDYAFQFIQKN-G 211

Query: 217 IATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYST 275
           I TE  YPYQ   G+C  A++ A A  I  YE+VP+ DE AL KAV+ QPVS+AI A   
Sbjct: 212 ITTESNYPYQGEQGSCDQAKENAQAVTIDGYEDVPANDESALQKAVAGQPVSVAIDASGQ 271

Query: 276 EFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-- 333
           +FQ Y EG+F G C T LDH V  VG+G T DG  YW++KNSWG  WG+ GY+++ R   
Sbjct: 272 DFQFYSEGVFTGECSTDLDHGVAAVGYGATRDGTKYWIVKNSWGEDWGEKGYIRMQRGVS 331

Query: 334 --EGLCGIGTRSSYP 346
             EGLCGI  ++SYP
Sbjct: 332 QTEGLCGIAMQASYP 346


>gi|356517184|ref|XP_003527269.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
          Length = 350

 Score =  290 bits (743), Expect = 5e-76,   Method: Compositional matrix adjust.
 Identities = 142/309 (45%), Positives = 209/309 (67%), Gaps = 11/309 (3%)

Query: 43  VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTND 102
           ++E+ E W+++HG+ Y+   EK  R +IFK+NL++I++ NK  +  Y LG N+F+DL++ 
Sbjct: 44  LIELFESWISRHGKIYQSIEEKLHRFEIFKDNLKHIDERNKVVS-NYWLGLNEFADLSHQ 102

Query: 103 EFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWA 162
           EF+  Y G K+   S R  +   F Y+++   ++P S+DWR KGAVT +KNQ  CG CWA
Sbjct: 103 EFKNKYLGLKVDY-SRRRESPEEFTYKDV---ELPKSVDWRKKGAVTQVKNQGSCGSCWA 158

Query: 163 FAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDE 222
           F+ VAAVEGI +I +GNL  LSEQ+L+DC    NNGC GG  + AF++I++N G+  E++
Sbjct: 159 FSTVAAVEGINQIVTGNLTSLSEQELIDCDRTYNNGCNGGLMDYAFSFIVENDGLHKEED 218

Query: 223 YPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYK 281
           YPY    GTC  A++      IS Y +VP  +EQ+LLKA++ QP+S+AI A   +FQ Y 
Sbjct: 219 YPYIMEEGTCEMAKEETEVVTISGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQFYS 278

Query: 282 EGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLC 337
            G+F+G CG+ LDH V  VG+GT + G +Y  +KNSWG+ WG+ GY+++ R+    EG+C
Sbjct: 279 GGVFDGHCGSDLDHGVAAVGYGTAK-GVDYITVKNSWGSKWGEKGYIRMRRNIGKPEGIC 337

Query: 338 GIGTRSSYP 346
           GI   +SYP
Sbjct: 338 GIYKMASYP 346


>gi|146215986|gb|ABQ10195.1| actinidin Act2d [Actinidia eriantha]
          Length = 381

 Score =  290 bits (742), Expect = 6e-76,   Method: Compositional matrix adjust.
 Identities = 148/340 (43%), Positives = 207/340 (60%), Gaps = 11/340 (3%)

Query: 13  INTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFK 72
           I+ + +F    L++S A  + +S       V+ ++E W+ + G+SY    EKEMR +IFK
Sbjct: 10  ISMSLLFFSTLLILSSALDIKNSVQRTNDQVMAMYESWLVEQGKSYNSLDEKEMRFEIFK 69

Query: 73  ENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
           ENL  I+  N + NR+Y LG N+F+DLT++E+R+ Y G+K    +  S      +Y    
Sbjct: 70  ENLRIIDDHNADANRSYSLGLNRFADLTDEEYRSTYLGFKSGPKAKVSN-----RYVPKV 124

Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
              +P  +DWR  GAV  +K+Q  C  CWAF+AVAAVEGI KI +GNLI LSEQ+L+DC 
Sbjct: 125 GVVLPNYVDWRTVGAVVGVKDQGLCSSCWAFSAVAAVEGINKIVTGNLISLSEQELVDCG 184

Query: 193 -TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAA-AKISNYEEVP 250
            T    GC  G    AF +II N GI TED YPY A  G C   +K      I NYE++P
Sbjct: 185 RTQRTRGCNRGYMNDAFQFIIDNGGINTEDNYPYTAQDGQCDWYRKNQRYVTIDNYEQLP 244

Query: 251 SGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGAN 310
           + +E  L  AV+ QP+++ + +   +F+ Y  GI+ G CGT +DH VTIVG+G TE G +
Sbjct: 245 ANNEWVLQNAVAYQPITVGLESEGGKFKLYTSGIYTGYCGTAIDHGVTIVGYG-TERGLD 303

Query: 311 YWLIKNSWGNTWGDAGYMKIVRD---EGLCGIGTRSSYPL 347
           YW++KNSWG  WG+ GY++I R+    G CGI    SYP+
Sbjct: 304 YWIVKNSWGTNWGENGYIRIQRNIGGAGKCGIAMVPSYPV 343


>gi|302790828|ref|XP_002977181.1| hypothetical protein SELMODRAFT_106402 [Selaginella moellendorffii]
 gi|300155157|gb|EFJ21790.1| hypothetical protein SELMODRAFT_106402 [Selaginella moellendorffii]
          Length = 337

 Score =  290 bits (742), Expect = 7e-76,   Method: Compositional matrix adjust.
 Identities = 149/335 (44%), Positives = 211/335 (62%), Gaps = 15/335 (4%)

Query: 21  IITLLVSCASQVVSSR--STHEQSVVEIH---EKWMAQHGRSYKDELEKEMRLKIFKENL 75
           +I L+V  A+    +R  +  +   +EI    E W A+HG+SY  + EK  RL IF + L
Sbjct: 6   LILLVVVGATPFAIARPAALEDGRALEIKNMFEDWAAKHGKSYSSDWEKARRLMIFSDTL 65

Query: 76  EYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTG-YKMPSPSHRSTTSSTFKYQNLSMT 134
            YIEK N + N T+ LG N+FSDLTN EFRA++ G +K P    R         +++ ++
Sbjct: 66  AYIEKHNAQPNTTFTLGLNKFSDLTNAEFRAMHVGKFKRPRYQDRLPAED----EDVDVS 121

Query: 135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN 194
            +PTSLDWR KGAVTPIK+Q +CG CWAF+A+A++E    + +  L+ LSEQQL+DC T 
Sbjct: 122 SLPTSLDWRQKGAVTPIKDQGDCGSCWAFSAIASIESAHFLATKELVSLSEQQLMDCDTV 181

Query: 195 GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQ-KPAAAKISNYEEVPSGD 253
            + GC GG  E AF ++++N G+ TE  YPY    G+C+A + K   A+I+ ++ V    
Sbjct: 182 -DAGCDGGLMETAFKFVVKNGGVTTEAAYPYTGSVGSCNANKAKNKVAEITGFKVVTEDS 240

Query: 254 EQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWL 313
             AL+KAVS  PV+++I      FQ+YK GI +G C   LDH V ++G+G TE G  YW+
Sbjct: 241 ADALMKAVSKTPVTVSICGSDENFQNYKSGILSGKCDDSLDHGVLLIGYG-TEGGMPYWI 299

Query: 314 IKNSWGNTWGDAGYMKIVRD--EGLCGIGTRSSYP 346
           IKNSWG +WG+ G+MKI R   +G+CG+   SSYP
Sbjct: 300 IKNSWGTSWGEDGFMKIERKDGDGMCGMNGDSSYP 334


>gi|224133764|ref|XP_002321655.1| predicted protein [Populus trichocarpa]
 gi|222868651|gb|EEF05782.1| predicted protein [Populus trichocarpa]
          Length = 360

 Score =  290 bits (742), Expect = 8e-76,   Method: Compositional matrix adjust.
 Identities = 145/317 (45%), Positives = 205/317 (64%), Gaps = 14/317 (4%)

Query: 40  EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
           E+S+ +++EKW + H  S   + EK  R  +F+ N+ ++   NK  ++ YKL  N+F+D+
Sbjct: 31  EESLWDLYEKWRSHHTVSTSLD-EKRKRFNVFRANVLHVHNTNKM-DKPYKLKLNKFADM 88

Query: 100 TNDEFRALYTGYKMPSPSH---RSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKE 156
           TN EFR  Y   K+   +        + +F Y N+    VP S+DWR KGAVTP+K+Q +
Sbjct: 89  TNHEFRTAYASSKVKHHTMFRGAPLGNGSFMYGNID--KVPASIDWRKKGAVTPVKDQGK 146

Query: 157 CGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQG 216
           CG CWAF+ + AVEGI  I++  LI LSEQ+L+DC+T  N+GC GG  + AF +I + +G
Sbjct: 147 CGSCWAFSTIVAVEGINFIKTNKLISLSEQELVDCNTGENHGCNGGLMDYAFEFITKQKG 206

Query: 217 IATEDEYPYQAVPGTCSA--AQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYS 274
           I TE  YPY+A  G C A  A +PA + I  +E+V   +E ALLKAV+ QPVS+AI A  
Sbjct: 207 ITTEANYPYRAQDGHCDANKANQPAVS-IDGHEDVLHNNENALLKAVANQPVSVAIDAGG 265

Query: 275 TEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD- 333
           ++FQ Y EG+F G CG +LDH V IVG+GTT DG  YW+++NSWG  WG+ GY+++ R  
Sbjct: 266 SDFQFYSEGVFTGECGKELDHGVAIVGYGTTVDGTKYWIVRNSWGPEWGERGYIRMQRGI 325

Query: 334 ---EGLCGIGTRSSYPL 347
               GLCGI   +SYP+
Sbjct: 326 SDRRGLCGIAMEASYPI 342


>gi|414591545|tpg|DAA42116.1| TPA: hypothetical protein ZEAMMB73_388689 [Zea mays]
          Length = 384

 Score =  290 bits (741), Expect = 8e-76,   Method: Compositional matrix adjust.
 Identities = 139/323 (43%), Positives = 200/323 (61%), Gaps = 18/323 (5%)

Query: 40  EQSVVEIHEKWMAQHGR----SYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQ 95
           E+S+  ++E+W + + R       D+ ++  R  +FKEN  Y+ +AN++  R ++L  N+
Sbjct: 34  EESLRALYERWRSHYHRVSPRDGDDKQQQARRFNVFKENARYVHEANRKDGRPFRLALNK 93

Query: 96  FSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNL-------SMTDVPTSLDWRDKGAV 148
           F+D+T DEFR  Y G +  +  HR+       + +          T++P ++DWR +GAV
Sbjct: 94  FADMTTDEFRRTYAGSR--TRHHRAQLGEARSFAHAQHGRGGSGTTNLPPAVDWRLRGAV 151

Query: 149 TPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAF 208
           T +K+Q +CG CWAF+A+AAVEG+ KI +G L+ LSEQ+L+DC    N GC GG  + AF
Sbjct: 152 TGVKDQGQCGSCWAFSAIAAVEGVNKIMTGKLVSLSEQELVDCDDVDNQGCDGGLMDYAF 211

Query: 209 AYIIQNQGIATEDEYPYQAVPGTCS-AAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVS 267
            YI +N G+ TE  YPY A   +C+ A ++     I  YE+VP+ +E AL KAV+ QPV+
Sbjct: 212 QYIQRNGGVTTESNYPYLAEQRSCNKAKERSHDVTIDGYEDVPANNEDALQKAVASQPVA 271

Query: 268 IAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGY 327
           +AI A   +FQ Y EG+F G CGT LDH V  VG+GTT DG  YW +KNSWG  WG+ GY
Sbjct: 272 VAIEASGQDFQFYSEGVFTGSCGTDLDHGVAAVGYGTTGDGTKYWTVKNSWGEDWGERGY 331

Query: 328 MKIVR----DEGLCGIGTRSSYP 346
           +++ R      GLCGI    SYP
Sbjct: 332 IRMQRGVPDSRGLCGIAMEPSYP 354


>gi|351629615|gb|AEQ54771.1| KDDL-tailed cysteine proteinase CP4 [Coffea canephora]
          Length = 359

 Score =  290 bits (741), Expect = 8e-76,   Method: Compositional matrix adjust.
 Identities = 148/340 (43%), Positives = 217/340 (63%), Gaps = 20/340 (5%)

Query: 20  IIITLLVSCASQVVSSRS-THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
           ++  +LV+  S  ++ R    E+S+ +++E+W + H  S +D  EK  R  +FK N+ +I
Sbjct: 12  VLAVILVAAMSMEITERDLASEESLWDLYERWRSHHTVS-RDLSEKRKRFNVFKANVHHI 70

Query: 79  EKANKEGNRTYKLGTNQFSDLTNDEFRALYTG----YKMPSPSHRSTTSSTFKYQNLSMT 134
            K N++ ++ YKL  N F+D+TN EFR  Y+     Y+M   S  +T     K ++L   
Sbjct: 71  HKVNQK-DKPYKLKLNSFADMTNHEFREFYSSKVKHYRMLHGSRANTGFMHGKTESL--- 126

Query: 135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN 194
             P S+DWR +GAVT +KNQ +CG CWAF+ V  VEGI KI++G L+ LSEQ+L+DC T+
Sbjct: 127 --PASVDWRKQGAVTGVKNQGKCGSCWAFSTVVGVEGINKIKTGQLVSLSEQELVDCETD 184

Query: 195 GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGD 253
            N GC GG  E A+ +I ++ GI TE  YPY+A  G+C +++  A A  I  +E VP+ D
Sbjct: 185 -NEGCNGGLMENAYEFIKKSGGITTERLYPYKARDGSCDSSKMNAPAVTIDGHEMVPAND 243

Query: 254 EQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNG-VCGTQLDHAVTIVGFGTTEDGANYW 312
           E AL+KAV+ QPVS+AI A  ++ Q Y EG++ G  CG +LDH V +VG+GT  DG  YW
Sbjct: 244 ENALMKAVANQPVSVAIDASGSDMQFYSEGVYAGDSCGNELDHGVAVVGYGTALDGTKYW 303

Query: 313 LIKNSWGNTWGDAGYMKIVR-----DEGLCGIGTRSSYPL 347
           ++KNSWG  WG+ GY+++ R     + G+CGI   +SYPL
Sbjct: 304 IVKNSWGTGWGEQGYIRMQRGVDAAEGGVCGIAMEASYPL 343


>gi|121308860|dbj|BAF43527.1| cysteine proteinase [Zinnia elegans]
          Length = 352

 Score =  290 bits (741), Expect = 8e-76,   Method: Compositional matrix adjust.
 Identities = 139/311 (44%), Positives = 205/311 (65%), Gaps = 10/311 (3%)

Query: 41  QSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLT 100
             V+ + E W+ +H + Y+   EK  R +IF +NL++I++ NK+ +  Y LG N+F+DLT
Sbjct: 43  HKVIHLFESWLVKHSKFYESLDEKLHRFEIFMDNLKHIDETNKKVS-NYWLGLNEFADLT 101

Query: 101 NDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCC 160
           ++EF+  + G+K      +  +S  F Y++    D+P S+DWR KGAV P+KNQ +CG C
Sbjct: 102 HEEFKHKFLGFKGELAERKDESSKEFGYRDF--VDLPKSVDWRKKGAVAPVKNQGQCGNC 159

Query: 161 WAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATE 220
           WAF+ VAAVEGI +I +GNL  LSEQ+L+DC T  NNGC GG  + AFAY++++ G+  E
Sbjct: 160 WAFSTVAAVEGINQIVTGNLTMLSEQELIDCDTTFNNGCNGGLMDYAFAYVMRS-GLHKE 218

Query: 221 DEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQS 279
           +EYPY    GTC   +  +    IS Y +VP  DE + LKA++ QP+S+AI A   +FQ 
Sbjct: 219 EEYPYIMSEGTCDEKKDVSEKVTISGYHDVPRNDEASFLKALANQPISVAIEASGRDFQF 278

Query: 280 YKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EG 335
           Y  G+F+G CGT+LDH V  VG+GTT+ G +Y +++NSWG  WG+ GY+++ R      G
Sbjct: 279 YSGGVFDGHCGTELDHGVAAVGYGTTK-GLDYVIVRNSWGPKWGEKGYIRMKRGSGKPHG 337

Query: 336 LCGIGTRSSYP 346
           +CG+   +SYP
Sbjct: 338 MCGLYMMASYP 348


>gi|46395939|sp|Q94B08.2|GCP1_ARATH RecName: Full=Germination-specific cysteine protease 1; Flags:
           Precursor
 gi|4006883|emb|CAB16767.1| cysteine proteinase [Arabidopsis thaliana]
 gi|7270637|emb|CAB80354.1| cysteine proteinase [Arabidopsis thaliana]
          Length = 376

 Score =  290 bits (741), Expect = 1e-75,   Method: Compositional matrix adjust.
 Identities = 147/322 (45%), Positives = 209/322 (64%), Gaps = 16/322 (4%)

Query: 40  EQSVVEIHEKWMAQHGRSYKDEL----EKEMRLKIFKENLEYIEKANKEG-NRTYKLGTN 94
           ++ V  I+ +W A+HG++  +      +++ R  IFK+NL +I+  N++  N TYKLG  
Sbjct: 42  DEEVRSIYLQWSAEHGKTNNNNNGIINDQDKRFNIFKDNLRFIDLHNEDNKNATYKLGLT 101

Query: 95  QFSDLTNDEFRALYTGYKMPSPSHRSTTSSTF--KYQN-LSMTDVPTSLDWRDKGAVTPI 151
           +F+DLTNDE+R LY G +   P+ R   +     KY   ++  +VP ++DWR KGAV PI
Sbjct: 102 KFTDLTNDEYRKLYLGART-EPARRIAKAKNVNQKYSAAVNGKEVPETVDWRQKGAVNPI 160

Query: 152 KNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYI 211
           K+Q  CG CWAF+  AAVEGI KI +G LI LSEQ+L+DC  + N GC GG  + AF +I
Sbjct: 161 KDQGTCGSCWAFSTTAAVEGINKIVTGELISLSEQELVDCDKSYNQGCNGGLMDYAFQFI 220

Query: 212 IQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAI 270
           ++N G+ TE +YPY+   G C++  K +    I  YE+VP+ DE AL KA+S QPVS+AI
Sbjct: 221 MKNGGLNTEKDYPYRGFGGKCNSFLKNSRVVSIDGYEDVPTKDETALKKAISYQPVSVAI 280

Query: 271 AAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKI 330
            A    FQ Y+ GIF G CGT LDHAV  VG+G +E+G +YW+++NSWG  WG+ GY+++
Sbjct: 281 EAGGRIFQHYQSGIFTGSCGTNLDHAVVAVGYG-SENGVDYWIVRNSWGPRWGEEGYIRM 339

Query: 331 VRD-----EGLCGIGTRSSYPL 347
            R+      G CGI   +SYP+
Sbjct: 340 ERNLAASKSGKCGIAVEASYPV 361


>gi|297745594|emb|CBI40759.3| unnamed protein product [Vitis vinifera]
          Length = 300

 Score =  289 bits (740), Expect = 1e-75,   Method: Compositional matrix adjust.
 Identities = 143/301 (47%), Positives = 206/301 (68%), Gaps = 10/301 (3%)

Query: 51  MAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTG 110
           M++HG+SY+   EK  R ++F++NL++I++ NK+ + +Y LG N+F+DL+++EF+  Y G
Sbjct: 1   MSKHGKSYRSFEEKLHRFEVFQDNLKHIDETNKKVS-SYWLGLNEFADLSHEEFKRKYLG 59

Query: 111 YKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVE 170
            K+  P  R +    F Y++++  D+P S+DWR KGAV  +KNQ  CG CWAF+ VAAVE
Sbjct: 60  LKIELPKRRDSPEE-FSYKDVA--DLPKSVDWRKKGAVAHVKNQGACGSCWAFSTVAAVE 116

Query: 171 GITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPG 230
           GI +I +GNL  LSEQ+L+DC    NNGC GG  + AFA+II N G+  E++YPY    G
Sbjct: 117 GINQIVTGNLTALSEQELIDCDKPFNNGCNGGLMDYAFAFIISNGGLRKEEDYPYVMEEG 176

Query: 231 TCSAAQKP-AAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVC 289
           TC   ++      IS Y +VP  +EQ+ LKA++ QP+S+AI A S  FQ Y  GIFNG C
Sbjct: 177 TCGEKKEELEVVTISGYHDVPEDNEQSFLKALANQPLSVAIEASSRGFQFYSGGIFNGHC 236

Query: 290 GTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSY 345
           GT+LDH V  VG+GT++ G +Y  +KNSWG+ WG+ GY+++ R+    EG+CGI   +SY
Sbjct: 237 GTELDHGVAAVGYGTSK-GVDYITVKNSWGSKWGEKGYIRMKRNVGKPEGICGIYKMASY 295

Query: 346 P 346
           P
Sbjct: 296 P 296


>gi|255646088|gb|ACU23531.1| unknown [Glycine max]
          Length = 362

 Score =  289 bits (740), Expect = 1e-75,   Method: Compositional matrix adjust.
 Identities = 150/322 (46%), Positives = 201/322 (62%), Gaps = 24/322 (7%)

Query: 40  EQSVVEIHEKWMAQHGRSYKDEL----EKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQ 95
           E+S  +++E+W     RSY+       +K  R  +FK N+ ++   NK  ++ YKL  N+
Sbjct: 33  EESFWDLYERW-----RSYRTVSRSLGDKHKRFNVFKANVMHVHNTNKM-DKPYKLKLNK 86

Query: 96  FSDLTNDEFRALYTGYKMPSPSHR-----STTSSTFKYQNLSMTDVPTSLDWRDKGAVTP 150
           F+D+TN EFR+ Y G K+    HR        + TF Y+ +    VP S DWR  GAVT 
Sbjct: 87  FADMTNHEFRSTYAGSKVNH--HRMFQGTPRGNGTFMYEKVG--SVPPSADWRKNGAVTG 142

Query: 151 IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAY 210
           +K+Q +CG CWAF+ V AVEGI +I++  L+ LSEQ+L+DC T  N GC GG  E AF +
Sbjct: 143 VKDQGQCGSCWAFSTVVAVEGINQIKTNKLVSLSEQELVDCDTKKNAGCNGGLMESAFEF 202

Query: 211 IIQNQGIATEDEYPYQAVPGTCSAAQ-KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIA 269
           I Q  GI TE  YPY A  GTC A++    A  I  +E VP+ DE ALLKAV+ QPVS+A
Sbjct: 203 IKQKGGITTESNYPYTAQDGTCDASKANDLAVSIDGHENVPANDENALLKAVANQPVSVA 262

Query: 270 IAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMK 329
           I A   +FQ Y EG+F G C T+L+H V IVG+GTT DG NYW ++NSWG  WG+ GY++
Sbjct: 263 IDAGGFDFQFYFEGVFTGDCSTELNHGVAIVGYGTTVDGTNYWTVRNSWGPEWGEQGYIR 322

Query: 330 ----IVRDEGLCGIGTRSSYPL 347
               I + EGLCGI   +SYP+
Sbjct: 323 MQRSIFKKEGLCGIAMMASYPI 344


>gi|317106666|dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas]
          Length = 441

 Score =  289 bits (740), Expect = 1e-75,   Method: Compositional matrix adjust.
 Identities = 138/309 (44%), Positives = 197/309 (63%), Gaps = 7/309 (2%)

Query: 43  VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTND 102
           +  + E W  QHG++Y  + EK  RLK+F++N +++ + N +GN +Y L  N F+DLT+ 
Sbjct: 26  IAHLFETWCQQHGKTYASQEEKLFRLKVFQDNYDFVTEHNSQGNSSYTLSLNAFADLTHH 85

Query: 103 EFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWA 162
           EF+A   G    + +  +   S  +  +  + DVP S+DWR  GAVT +K+Q  CG CW+
Sbjct: 86  EFKASRLGLSSAASASLNVDRSNRQIPDF-VADVPASVDWRKNGAVTQVKDQGNCGACWS 144

Query: 163 FAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDE 222
           F+A  A+EGI KI +G+L+ LSEQ+L+DC  + NNGC GG  + AF ++I N GI TE++
Sbjct: 145 FSATGAIEGINKIVTGSLVSLSEQELVDCDKSYNNGCEGGIMDYAFQFVIDNHGIDTEED 204

Query: 223 YPYQAVPGTCSAAQ-KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYK 281
           YPYQ    +C+  + K     I  Y +VP  +E+ LLKAV+ QPVS+ I      FQ Y 
Sbjct: 205 YPYQGRDRSCNKEKLKRHVVTIDGYVDVPQNNEKELLKAVANQPVSVGICGSERAFQLYS 264

Query: 282 EGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLC 337
           +GIF G C T LDHAV IVG+G +E+G +YW++KNSWG+ WG  GYM + R+     GLC
Sbjct: 265 KGIFTGPCSTSLDHAVLIVGYG-SENGVDYWIVKNSWGSYWGMDGYMHMQRNSGSSRGLC 323

Query: 338 GIGTRSSYP 346
           GI   +SYP
Sbjct: 324 GINMLASYP 332


>gi|357115272|ref|XP_003559414.1| PREDICTED: thiol protease SEN102-like [Brachypodium distachyon]
          Length = 360

 Score =  289 bits (739), Expect = 1e-75,   Method: Compositional matrix adjust.
 Identities = 147/319 (46%), Positives = 205/319 (64%), Gaps = 19/319 (5%)

Query: 47  HEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGN-------RTYKLGTNQFSDL 99
           HE WMA+HGR+Y D  EK  RL+IF+ N E I+  N + +        +++L TN+F+DL
Sbjct: 43  HESWMAEHGRTYADAEEKARRLEIFRANAERIDSFNSKADAAAGESVDSHRLATNRFADL 102

Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM-TDVPTSLDWRDKGAVTPIKNQKECG 158
           T++EFRA  TG + P+          F+Y+N S+  D   S+DWR  GAVT +K+Q  CG
Sbjct: 103 TDEEFRAARTGLRRPAAVA-GAVGGGFRYENFSLQADAAGSMDWRAMGAVTGVKDQGSCG 161

Query: 159 CCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNN-GCLGGSREKAFAYIIQNQGI 217
           CCWAF+AVAA+EG+TKIR+G L+ LSEQQL+DC   G++ GC GG  + AF YI +  G+
Sbjct: 162 CCWAFSAVAAMEGLTKIRTGRLVSLSEQQLVDCDVYGDDQGCEGGLMDNAFQYISRQGGL 221

Query: 218 ATEDEYPYQAVP-GTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTE 276
           A+E  YPY     G+C + +   AA I  +E+VP+ +E AL+ AV+ QPVS+AI      
Sbjct: 222 ASESAYPYSGEDGGSCRSGRAQPAASIRGHEDVPANNEGALMAAVAHQPVSVAINGGDYV 281

Query: 277 FQSYKE----GIFNGVC-GTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKI- 330
           F+ Y         NG C  T+LDHA+T VG+G   DG  YWL+KNSWG+ WG++GY++I 
Sbjct: 282 FRFYDRGVLGAGGNGGCESTELDHAITAVGYGMAGDGTGYWLMKNSWGSGWGESGYVRIR 341

Query: 331 --VRDEGLCGIGTRSSYPL 347
              R EG+CG+   +SYP+
Sbjct: 342 RGSRGEGVCGLAKLASYPV 360


>gi|413951606|gb|AFW84255.1| hypothetical protein ZEAMMB73_933931 [Zea mays]
          Length = 379

 Score =  289 bits (739), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 144/355 (40%), Positives = 208/355 (58%), Gaps = 23/355 (6%)

Query: 12  KINTTPMFIIITLLVSCASQVVSSRSTHEQSVV------EIHEKWMAQHGRSYKDELEKE 65
           +++ T + + +  + S A ++  +    E+ +       +++E+W   H R ++   EK 
Sbjct: 3   QVSKTLLLVALVFVSSAAVELCRAIDFDERDLASDEALWDLYERWQTHH-RVHRHHGEKG 61

Query: 66  MRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKM------PSPSHR 119
            R   FKEN+ +I   NK G+R Y+L  N+F D+  +EFR+ +   ++       SP+ R
Sbjct: 62  RRFGTFKENVRFIHAHNKRGDRPYRLRLNRFGDMGREEFRSTFADSRINDLRRQDSPAAR 121

Query: 120 STTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGN 179
           +     F Y   S  D P S+DWR +GAVT +K+Q  CG CWAF+ V AVEGI  IR+G+
Sbjct: 122 AGAVPGFMYD--SAADPPRSVDWRQEGAVTGVKDQGHCGSCWAFSTVVAVEGINAIRTGS 179

Query: 180 LIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQ--- 236
           L  LSEQ+L+DC T+  NGC GG  E AF +I    GI TE  YPY+A  GTC   +   
Sbjct: 180 LASLSEQELIDCDTD-ENGCQGGLMENAFEFIKSFGGITTEAAYPYRASNGTCDGDRARR 238

Query: 237 -KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDH 295
                  I  ++ VP+G E AL KAV+ QPVS+A+ A    FQ Y EG+F G CGT LDH
Sbjct: 239 GGGVVVVIDGHQMVPAGSEDALAKAVAHQPVSVAVDAGGQAFQFYSEGVFTGDCGTDLDH 298

Query: 296 AVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR---DEGLCGIGTRSSYPL 347
            V  VG+G  +DG  YW++KNSWG +WG+ GY+++ R   + GLCGI   +S+P+
Sbjct: 299 GVAAVGYGVGDDGTPYWIVKNSWGTSWGEGGYIRMQRGAGNGGLCGIAMEASFPI 353


>gi|186516984|ref|NP_195406.2| cysteine proteinase1 [Arabidopsis thaliana]
 gi|15290508|gb|AAK92229.1| cysteine proteinase [Arabidopsis thaliana]
 gi|332661313|gb|AEE86713.1| cysteine proteinase1 [Arabidopsis thaliana]
          Length = 376

 Score =  288 bits (738), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 147/322 (45%), Positives = 208/322 (64%), Gaps = 16/322 (4%)

Query: 40  EQSVVEIHEKWMAQHGRSYKDEL----EKEMRLKIFKENLEYIEKANKEG-NRTYKLGTN 94
           ++ V  I+ +W A+HG++  +      +++ R  IFK+NL +I+  N+   N TYKLG  
Sbjct: 42  DEEVRSIYLQWSAEHGKTNNNNNGIINDQDKRFNIFKDNLRFIDLHNENNKNATYKLGLT 101

Query: 95  QFSDLTNDEFRALYTGYKMPSPSHRSTTSSTF--KYQN-LSMTDVPTSLDWRDKGAVTPI 151
           +F+DLTNDE+R LY G +   P+ R   +     KY   ++  +VP ++DWR KGAV PI
Sbjct: 102 KFTDLTNDEYRKLYLGART-EPARRIAKAKNVNQKYSAAVNGKEVPETVDWRQKGAVNPI 160

Query: 152 KNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYI 211
           K+Q  CG CWAF+  AAVEGI KI +G LI LSEQ+L+DC  + N GC GG  + AF +I
Sbjct: 161 KDQGTCGSCWAFSTTAAVEGINKIVTGELISLSEQELVDCDKSYNQGCNGGLMDYAFQFI 220

Query: 212 IQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAI 270
           ++N G+ TE +YPY+   G C++  K +    I  YE+VP+ DE AL KA+S QPVS+AI
Sbjct: 221 MKNGGLNTEKDYPYRGFGGKCNSFLKNSRVVSIDGYEDVPTKDETALKKAISYQPVSVAI 280

Query: 271 AAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKI 330
            A    FQ Y+ GIF G CGT LDHAV  VG+G +E+G +YW+++NSWG  WG+ GY+++
Sbjct: 281 EAGGRIFQHYQSGIFTGSCGTNLDHAVVAVGYG-SENGVDYWIVRNSWGPRWGEEGYIRM 339

Query: 331 VRD-----EGLCGIGTRSSYPL 347
            R+      G CGI   +SYP+
Sbjct: 340 ERNLAASKSGKCGIAVEASYPV 361


>gi|413951605|gb|AFW84254.1| hypothetical protein ZEAMMB73_933931 [Zea mays]
          Length = 423

 Score =  288 bits (738), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 144/355 (40%), Positives = 208/355 (58%), Gaps = 23/355 (6%)

Query: 12  KINTTPMFIIITLLVSCASQVVSSRSTHEQSVV------EIHEKWMAQHGRSYKDELEKE 65
           +++ T + + +  + S A ++  +    E+ +       +++E+W   H R ++   EK 
Sbjct: 47  QVSKTLLLVALVFVSSAAVELCRAIDFDERDLASDEALWDLYERWQTHH-RVHRHHGEKG 105

Query: 66  MRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKM------PSPSHR 119
            R   FKEN+ +I   NK G+R Y+L  N+F D+  +EFR+ +   ++       SP+ R
Sbjct: 106 RRFGTFKENVRFIHAHNKRGDRPYRLRLNRFGDMGREEFRSTFADSRINDLRRQDSPAAR 165

Query: 120 STTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGN 179
           +     F Y   S  D P S+DWR +GAVT +K+Q  CG CWAF+ V AVEGI  IR+G+
Sbjct: 166 AGAVPGFMYD--SAADPPRSVDWRQEGAVTGVKDQGHCGSCWAFSTVVAVEGINAIRTGS 223

Query: 180 LIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQ--- 236
           L  LSEQ+L+DC T+  NGC GG  E AF +I    GI TE  YPY+A  GTC   +   
Sbjct: 224 LASLSEQELIDCDTD-ENGCQGGLMENAFEFIKSFGGITTEAAYPYRASNGTCDGDRARR 282

Query: 237 -KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDH 295
                  I  ++ VP+G E AL KAV+ QPVS+A+ A    FQ Y EG+F G CGT LDH
Sbjct: 283 GGGVVVVIDGHQMVPAGSEDALAKAVAHQPVSVAVDAGGQAFQFYSEGVFTGDCGTDLDH 342

Query: 296 AVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR---DEGLCGIGTRSSYPL 347
            V  VG+G  +DG  YW++KNSWG +WG+ GY+++ R   + GLCGI   +S+P+
Sbjct: 343 GVAAVGYGVGDDGTPYWIVKNSWGTSWGEGGYIRMQRGAGNGGLCGIAMEASFPI 397


>gi|297799636|ref|XP_002867702.1| hypothetical protein ARALYDRAFT_329301 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297313538|gb|EFH43961.1| hypothetical protein ARALYDRAFT_329301 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 357

 Score =  288 bits (738), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 149/332 (44%), Positives = 218/332 (65%), Gaps = 17/332 (5%)

Query: 27  SCASQVVSSRSTHEQSVVE---IHEKWMAQHGRSYKDEL-EKEMRLKIFKENLEYIEKAN 82
           S A  + ++   H +S  E   I + WM++HG++Y + L EKE R + FK+NL +I++ N
Sbjct: 25  SSAIDLPATSGGHNRSNEEVGFIFQMWMSKHGKTYTNALGEKERRFQNFKDNLRFIDQHN 84

Query: 83  KEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDW 142
            + N +Y+LG  +F+DLT  E+R L+ G   P P  R+   S  +Y  L    +P S+DW
Sbjct: 85  AK-NLSYQLGLTRFADLTVQEYRDLFPG--SPKPKQRNLRISR-RYVPLDGDQLPESVDW 140

Query: 143 RDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLG- 201
           R++GAV+ IK+Q  C  CWAF+ VAAVEGI KI +G L+ LSEQ+L+DC+   NNGC G 
Sbjct: 141 RNEGAVSAIKDQGTCNSCWAFSTVAAVEGINKIVTGELVSLSEQELVDCNLV-NNGCYGS 199

Query: 202 GSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAA--KISNYEEVPSGDEQALLK 259
           G+ + AF ++I N G+ ++ +YPYQ   G C+  +  +     I +YE+VP+ DE +L K
Sbjct: 200 GTMDAAFQFLINNGGLDSDTDYPYQGSQGYCNRKESTSNKIITIDSYEDVPANDEISLQK 259

Query: 260 AVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWG 319
           AV+ QPVS+ +   S EF  Y+ GI+NG CGT LDHA+ IVG+G +E+G +YW+++NSWG
Sbjct: 260 AVAHQPVSVGVDKKSQEFMLYRSGIYNGPCGTDLDHALVIVGYG-SENGQDYWIVRNSWG 318

Query: 320 NTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
            TWGDAGY K+ R+     G+CGI   +SYP+
Sbjct: 319 TTWGDAGYAKMARNFEYPSGVCGIAMLASYPV 350


>gi|413938554|gb|AFW73105.1| hypothetical protein ZEAMMB73_931917 [Zea mays]
          Length = 361

 Score =  288 bits (738), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 148/353 (41%), Positives = 218/353 (61%), Gaps = 24/353 (6%)

Query: 17  PMFIIITLLVSCASQVVSSRSTHEQSVV-----------EIHEKWMAQHGRSYKDELEKE 65
           P   +   ++  A    S+    + SVV            +   W  +HG+ Y    EK 
Sbjct: 3   PKLAVAVFVLFLAFAACSANHHRDPSVVGYSQEDLALPSSLFRSWSVKHGKLYASPTEKL 62

Query: 66  MRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSP---SHRSTT 122
            R +IFK+NL +I + N++ N +Y LG NQF+D+ ++EF+A Y G K   P   + ++ T
Sbjct: 63  ERYEIFKQNLMHIAETNRK-NGSYWLGLNQFADVAHEEFKASYLGLKRALPRAGAPQTRT 121

Query: 123 SSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQ 182
            + F+Y   +   +P S+DWR KGAVTP+KNQ +CG CWAF++VAAVEGI +I +G L+ 
Sbjct: 122 PTAFRYAAAAAGSLPWSVDWRYKGAVTPVKNQGKCGSCWAFSSVAAVEGINQIVTGKLVS 181

Query: 183 LSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAA- 241
           LSEQ+L+DC T  ++GC GG+ + AFAY++ +QGI  ED+YPY    G C   Q      
Sbjct: 182 LSEQELVDCDTTLDHGCEGGTMDLAFAYMMGSQGIHAEDDYPYLMEEGYCKEKQPCVLGI 241

Query: 242 ---KISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVT 298
               ++ +E+VP   E +LLKA++ QPVS+ IAA S +FQ Y+ G+F+G C  +LDHA+T
Sbjct: 242 TEQDLTGFEDVPENSEISLLKALAHQPVSVGIAAGSRDFQFYRGGVFDGACSVELDHALT 301

Query: 299 IVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIV----RDEGLCGIGTRSSYPL 347
            VG+G++  G NY  +KNSWG  WG+ GY++I     + EG+CGI T +SYP+
Sbjct: 302 AVGYGSSY-GQNYITMKNSWGKNWGEQGYVRIKMGTGKPEGVCGIYTMASYPV 353


>gi|413953666|gb|AFW86315.1| hypothetical protein ZEAMMB73_539008 [Zea mays]
          Length = 314

 Score =  288 bits (737), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 139/334 (41%), Positives = 199/334 (59%), Gaps = 34/334 (10%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
           +  I+     C + + +   + + ++V  HE+WMAQ+ R YKD  EK  R K        
Sbjct: 8   ILAILGFAFFCGAALAARDLSDDSAMVARHEQWMAQYSRVYKDASEKARRFK-------- 59

Query: 78  IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
                             F+DLTN EFR++ T     S + +  T   F+Y+N+S   +P
Sbjct: 60  ------------------FADLTNHEFRSVKTNKGFKSSNMKILTG--FRYENVSADALP 99

Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-N 196
           T++DWR KG VTPIK+Q +CGCC AF+AVAA EGI KI +G L+ L++Q+L+DC  +G +
Sbjct: 100 TTIDWRTKGVVTPIKDQGQCGCCSAFSAVAATEGIVKISTGKLVSLADQELVDCDVHGED 159

Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQA 256
            GC GG  + AF +II+N G+ TE  YPY A  G C++    +AA I  YE+VP+ DE A
Sbjct: 160 QGCEGGLMDDAFKFIIKNGGLTTESSYPYTAADGKCNSGSN-SAATIKGYEDVPANDEAA 218

Query: 257 LLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKN 316
           L+KA++ QPVS+A+      F+ Y  G+  G CGT LDH +  +G+G T DG  YWL+KN
Sbjct: 219 LMKAMANQPVSVAVDGGDMTFRFYSGGVMTGSCGTDLDHGIAAIGYGKTSDGTKYWLMKN 278

Query: 317 SWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
           SWG TWG+ GY+++ +D     G+CG+    SYP
Sbjct: 279 SWGTTWGENGYLRMEKDISDKRGMCGLAMEPSYP 312


>gi|115448287|ref|NP_001047923.1| Os02g0715000 [Oryza sativa Japonica Group]
 gi|42408029|dbj|BAD09165.1| putative cysteine proteinase [Oryza sativa Japonica Group]
 gi|113537454|dbj|BAF09837.1| Os02g0715000 [Oryza sativa Japonica Group]
 gi|215737450|dbj|BAG96580.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215765786|dbj|BAG87483.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222623551|gb|EEE57683.1| hypothetical protein OsJ_08138 [Oryza sativa Japonica Group]
          Length = 366

 Score =  288 bits (737), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 156/351 (44%), Positives = 216/351 (61%), Gaps = 31/351 (8%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIHEK--------------WMAQHGRSYKDELE 63
           M  ++   V+C++    + S H+ SVV   ++              W  +H + Y    E
Sbjct: 16  MLFLLLGFVACSA----TASHHDPSVVGYSQEDLALPNKLVGLFTSWSVKHSKIYASPKE 71

Query: 64  KEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTT- 122
           K  R +IFK NL +I + N+  N +Y LG N F+D+ ++EF+A Y G K P  + R    
Sbjct: 72  KVKRYEIFKRNLRHIVETNRR-NGSYWLGLNHFADIAHEEFKASYLGLK-PGLARRDAQP 129

Query: 123 --SSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNL 180
             S+TF+Y N    ++P ++DWR KGAVTP+KNQ ECG CWAF+ VAAVEGI +I +G L
Sbjct: 130 HGSTTFRYAN--AVNLPWAVDWRKKGAVTPVKNQGECGSCWAFSTVAAVEGINQIVTGKL 187

Query: 181 IQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAA 240
           + LSEQ+L+DC    N+GC GG  + AFAYI+ NQGI TE++YPY    G C   Q  + 
Sbjct: 188 VSLSEQELMDCDNTFNHGCRGGLMDFAFAYIMGNQGIYTEEDYPYLMEEGYCREKQPHSK 247

Query: 241 A-KISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTI 299
              I+ YE+VP+  E +LLKA++ QPVS+ IAA S +FQ YK GIF+G CG Q DHA+T 
Sbjct: 248 VITITGYEDVPANSETSLLKALAHQPVSVGIAAGSRDFQFYKGGIFDGECGIQPDHALTA 307

Query: 300 VGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR----DEGLCGIGTRSSYP 346
           VG+G+   G +Y ++KNSWG  WG+ GY +I R     EG+C I   +SYP
Sbjct: 308 VGYGSYY-GQDYIIMKNSWGKNWGEQGYFRIRRGTGKPEGVCDIYKIASYP 357


>gi|255646767|gb|ACU23856.1| unknown [Glycine max]
          Length = 350

 Score =  288 bits (737), Expect = 3e-75,   Method: Compositional matrix adjust.
 Identities = 140/309 (45%), Positives = 208/309 (67%), Gaps = 11/309 (3%)

Query: 43  VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTND 102
           ++E+ E WM++HG+ Y++  EK +R +IFK+NL++I++ NK  +  Y LG ++F+DL++ 
Sbjct: 44  LIELFESWMSRHGKIYENIEEKLLRFEIFKDNLKHIDERNKVVS-NYWLGLSEFADLSHR 102

Query: 103 EFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWA 162
           EF   Y G K+   S R  +   F Y+++   ++P S+DWR KGAV P+KNQ  CG CWA
Sbjct: 103 EFNNKYLGLKVDY-SRRRESPEEFTYKDV---ELPKSVDWRKKGAVAPVKNQGSCGSCWA 158

Query: 163 FAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDE 222
           F+ VAAVEGI +I +GNL  LSEQ+L+DC    NNGC GG  + AF++I++N G+  E++
Sbjct: 159 FSTVAAVEGINQIVTGNLTSLSEQELIDCDRTYNNGCNGGLMDYAFSFIVENGGLHKEED 218

Query: 223 YPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYK 281
           YPY    G C   ++      IS Y +VP  +EQ+LLKA++ QP+S+AI A   +FQ Y 
Sbjct: 219 YPYIMEEGACEMTKEETQVVTISGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQFYS 278

Query: 282 EGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLC 337
            G+F+G CG+ LDH V  VG+GT + G +Y  +KNSWG+ WG+ GY+++ R+    EG+C
Sbjct: 279 GGVFDGHCGSDLDHGVAAVGYGTAK-GVDYITVKNSWGSKWGEKGYIRMRRNIGKPEGIC 337

Query: 338 GIGTRSSYP 346
           GI   +SYP
Sbjct: 338 GIYKMASYP 346


>gi|297802228|ref|XP_002868998.1| cysteine proteinase [Arabidopsis lyrata subsp. lyrata]
 gi|297314834|gb|EFH45257.1| cysteine proteinase [Arabidopsis lyrata subsp. lyrata]
          Length = 375

 Score =  288 bits (737), Expect = 3e-75,   Method: Compositional matrix adjust.
 Identities = 146/322 (45%), Positives = 206/322 (63%), Gaps = 16/322 (4%)

Query: 40  EQSVVEIHEKWMAQHGRSYKDEL----EKEMRLKIFKENLEYIEKAN-KEGNRTYKLGTN 94
           ++ V  I+ +W A HG++  +      +++ R  IFK+NL +I+  N K  N TYKLG  
Sbjct: 42  DEEVRSIYLQWSADHGKTNNNNNGIINDQDKRFNIFKDNLRFIDLHNEKNKNATYKLGLT 101

Query: 95  QFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD---VPTSLDWRDKGAVTPI 151
           +F+DLTN+E+R+LY G +   P  R   +     +  +  D   VP ++DWR KGAV PI
Sbjct: 102 KFTDLTNEEYRSLYLGART-EPVRRIAKAKNVNQKYSAAVDGKEVPETVDWRLKGAVNPI 160

Query: 152 KNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYI 211
           K+Q  CG CWAF+  AAVEGI KI +G LI LSEQ+L+DC  + N GC GG  + AF +I
Sbjct: 161 KDQGTCGSCWAFSTAAAVEGINKIVTGELISLSEQELVDCDNSYNQGCNGGLMDYAFQFI 220

Query: 212 IQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAI 270
           ++N G+ TE +YPY+   G C++  K A    I  YE+VP+ DE AL +A+S+QPVS+AI
Sbjct: 221 MKNGGLKTEKDYPYRGFGGKCNSFLKNAKVVSIDGYEDVPTKDETALKRAISLQPVSVAI 280

Query: 271 AAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKI 330
            A    FQ Y+ GIF G CGT LDHAV  VG+G +E+G +YW+++NSWG  WG+ GY+++
Sbjct: 281 EAGGRIFQHYQTGIFTGNCGTNLDHAVVAVGYG-SENGVDYWIVRNSWGPRWGEEGYIRM 339

Query: 331 VRD-----EGLCGIGTRSSYPL 347
            R+      G CGI   +SYP+
Sbjct: 340 ERNLASSKSGKCGIAVEASYPV 361


>gi|224085750|ref|XP_002307688.1| predicted protein [Populus trichocarpa]
 gi|222857137|gb|EEE94684.1| predicted protein [Populus trichocarpa]
          Length = 436

 Score =  288 bits (737), Expect = 3e-75,   Method: Compositional matrix adjust.
 Identities = 137/312 (43%), Positives = 200/312 (64%), Gaps = 18/312 (5%)

Query: 45  EIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEF 104
           ++ E W  +HG+SY  + E+  RLK+F++N +++ K N +GN +Y L  N F+DLT+ EF
Sbjct: 27  QLFETWCKEHGKSYTSQEERSHRLKVFEDNYDFVTKHNSKGNSSYSLALNAFADLTHHEF 86

Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMT----DVPTSLDWRDKGAVTPIKNQKECGCC 160
           +    G         S       ++NL +T    D+P S+DWR+KG VT +K+Q  CG C
Sbjct: 87  KTSRLGL--------SAAPLNLAHRNLEITGVVGDIPASIDWRNKGVVTNVKDQGSCGAC 138

Query: 161 WAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATE 220
           W+F+A  A+EGI KI +G+L+ LSEQ+L++C  + N+GC GG  + AF ++I N GI TE
Sbjct: 139 WSFSATGAIEGINKIVTGSLVSLSEQELIECDKSYNDGCGGGLMDYAFQFVINNHGIDTE 198

Query: 221 DEYPYQAVPGTCSAAQ-KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQS 279
           ++YPY+A  GTC+  + K     I  Y +VP  +E+ LL+AV+ QPVS+ I      FQ 
Sbjct: 199 EDYPYRARDGTCNKDRMKRRVVTIDKYVDVPENNEKQLLQAVAAQPVSVGICGSERAFQM 258

Query: 280 YKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EG 335
           Y +GIF G C T LDHAV IVG+G +E+G +YW++KNSWG  WG  GYM + R+    +G
Sbjct: 259 YSKGIFTGPCSTSLDHAVLIVGYG-SENGVDYWIVKNSWGTGWGMRGYMHMQRNSGNSQG 317

Query: 336 LCGIGTRSSYPL 347
           +CGI   +SYP+
Sbjct: 318 VCGINMLASYPV 329


>gi|357156854|ref|XP_003577598.1| PREDICTED: thiol protease SEN102-like [Brachypodium distachyon]
          Length = 368

 Score =  288 bits (736), Expect = 4e-75,   Method: Compositional matrix adjust.
 Identities = 142/326 (43%), Positives = 206/326 (63%), Gaps = 21/326 (6%)

Query: 40  EQSVVEIHEKWMAQH-------GRSYKDEL---EKEMRLKIFKENLEYIEKANKEGNRTY 89
           E+S+  ++E+W +++       G   + +L   +   R  +FKEN++YI +ANK+ +R +
Sbjct: 31  EESLRGLYERWRSRYTVSPSTPGSGLRGKLADHDPARRFNVFKENVKYIHEANKK-DRPF 89

Query: 90  KLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD---VPTSLDWRDKG 146
           +L  N+F+D+T DE R  Y G ++    HR+ +       N + +D   +P ++DWR+KG
Sbjct: 90  RLALNKFADMTTDELRHSYAGSRVRH--HRALSGGRRAQGNFTYSDAENLPPAVDWREKG 147

Query: 147 AVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREK 206
           AVT IK+Q +CG CWAF+ +AAVE I KIR+G L+ LSEQ+L+DC    + GC GG  + 
Sbjct: 148 AVTGIKDQGQCGSCWAFSTIAAVESINKIRTGKLVSLSEQELMDCDNVNDQGCDGGLMDY 207

Query: 207 AFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAA-AKISNYEEVPSGDEQALLKAVSMQP 265
           AF +I +N G+ +E  YPYQ    TC  A++      I  YE+VP+ DE AL KAV+ QP
Sbjct: 208 AFQFIQKNGGVTSEANYPYQGQQNTCDQAKENTHDVAIDGYEDVPANDESALQKAVAYQP 267

Query: 266 VSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDA 325
           VS+AI A   +FQ Y EG+F G C T LDH V  VG+GT  DG  YW++KNSWG  WG+ 
Sbjct: 268 VSVAIEASGQDFQFYSEGVFTGQCTTDLDHGVAAVGYGTARDGTKYWIVKNSWGLDWGEK 327

Query: 326 GYMKIVRD----EGLCGIGTRSSYPL 347
           GY+++ R     EGLCGI  ++SYP+
Sbjct: 328 GYIRMQRGVSQAEGLCGIAMQASYPI 353


>gi|357507617|ref|XP_003624097.1| Cysteine protease [Medicago truncatula]
 gi|355499112|gb|AES80315.1| Cysteine protease [Medicago truncatula]
          Length = 340

 Score =  288 bits (736), Expect = 4e-75,   Method: Compositional matrix adjust.
 Identities = 152/338 (44%), Positives = 216/338 (63%), Gaps = 19/338 (5%)

Query: 16  TPMFIIITLLVSCASQVVSSRSTHEQSVV--EIHEKWMAQHGRSYKDELEKEMRLKIFKE 73
           T + I+I + V   S   +    ++QS+   E ++ W  ++   YKD+ E+E  ++IFK 
Sbjct: 9   TLINILIVIWVMFPS---NQNQENDQSLTLSERYKHWKIKYRVIYKDDAEEEKHIQIFKH 65

Query: 74  NLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
           N+ YI+  N  GN++YKL  N+F+DL  +     +   K+       TTSS FKY+N+  
Sbjct: 66  NVAYIDSFNAAGNKSYKLTINRFADLPTEPSDDGFKKRKL-----EPTTSSLFKYKNI-- 118

Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLD-CS 192
           TD+P ++DWR +GAVTP+KNQ+ECG CWAF+AV A+EGI +I SGNL+ LSEQ+L+D   
Sbjct: 119 TDIPAAVDWRKRGAVTPVKNQRECGSCWAFSAVGALEGIQQITSGNLVSLSEQELVDRVR 178

Query: 193 TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSG 252
           +N  NGC GG    AF ++++N GIATE  YPY+ V G  ++ +     +I +YE+VP  
Sbjct: 179 SNWTNGCNGGYLIDAFEFVLENGGIATEASYPYRGVKGN-NSKKVSRQVQIKSYEQVPRN 237

Query: 253 DEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYW 312
            E +LLK V+ QPVS+ I   S   + Y  GIF G CGT+ +HAV IVG+GT+ DG  YW
Sbjct: 238 SEDSLLKVVANQPVSVGIDI-SGMIRFYSSGIFTGECGTKPNHAVIIVGYGTSNDGTKYW 296

Query: 313 LIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
           L+KNSWG  WG+  Y+++ RD    EGLCGI   +SYP
Sbjct: 297 LVKNSWGIRWGEKRYIRMKRDIDAKEGLCGIPMDASYP 334


>gi|222629675|gb|EEE61807.1| hypothetical protein OsJ_16426 [Oryza sativa Japonica Group]
          Length = 459

 Score =  287 bits (735), Expect = 4e-75,   Method: Compositional matrix adjust.
 Identities = 146/324 (45%), Positives = 202/324 (62%), Gaps = 12/324 (3%)

Query: 32  VVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRT 88
           +VS     E+    ++ +W A+HG+SY    E+E R   F++NL YI++ N     G  +
Sbjct: 26  IVSYGERSEEEARRLYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHS 85

Query: 89  YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAV 148
           ++LG N+F+DLTN+E+R  Y G +      R  +       N ++   P S+DWR KGAV
Sbjct: 86  FRLGLNRFADLTNEEYRDTYLGLRNKPRRERKVSDRYLAADNEAL---PESVDWRTKGAV 142

Query: 149 TPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAF 208
             IK+Q  CG CWAF+A+AAVEGI +I +G+LI LSEQ+L+DC T+ N GC GG  + AF
Sbjct: 143 AEIKDQGGCGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAF 202

Query: 209 AYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVS 267
            +II N GI TED+YPY+     C   +K A    I +YE+V    E +L KAV+ QPVS
Sbjct: 203 DFIINNGGIDTEDDYPYKGKDERCDVNRKNAKVVTIDSYEDVTPNSETSLQKAVANQPVS 262

Query: 268 IAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGY 327
           +AI A    FQ Y  GIF G CGT LDH V  VG+G TE+G +YW+++NSWG +WG++GY
Sbjct: 263 VAIEAGGRAFQLYSSGIFTGKCGTALDHGVAAVGYG-TENGKDYWIVRNSWGKSWGESGY 321

Query: 328 MKIVRD----EGLCGIGTRSSYPL 347
           +++ R+     G CGI    SYPL
Sbjct: 322 VRMERNIKASSGKCGIAVEPSYPL 345


>gi|109939734|sp|P25776.2|ORYA_ORYSJ RecName: Full=Oryzain alpha chain; Flags: Precursor
 gi|78192122|gb|ABB30151.1| oryzain alpha [Oryza sativa Japonica Group]
          Length = 458

 Score =  287 bits (735), Expect = 5e-75,   Method: Compositional matrix adjust.
 Identities = 146/324 (45%), Positives = 202/324 (62%), Gaps = 12/324 (3%)

Query: 32  VVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRT 88
           +VS     E+    ++ +W A+HG+SY    E+E R   F++NL YI++ N     G  +
Sbjct: 25  IVSYGERSEEEARRLYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHS 84

Query: 89  YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAV 148
           ++LG N+F+DLTN+E+R  Y G +      R  +       N ++   P S+DWR KGAV
Sbjct: 85  FRLGLNRFADLTNEEYRDTYLGLRNKPRRERKVSDRYLAADNEAL---PESVDWRTKGAV 141

Query: 149 TPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAF 208
             IK+Q  CG CWAF+A+AAVEGI +I +G+LI LSEQ+L+DC T+ N GC GG  + AF
Sbjct: 142 AEIKDQGGCGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAF 201

Query: 209 AYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVS 267
            +II N GI TED+YPY+     C   +K A    I +YE+V    E +L KAV+ QPVS
Sbjct: 202 DFIINNGGIDTEDDYPYKGKDERCDVNRKNAKVVTIDSYEDVTPNSETSLQKAVANQPVS 261

Query: 268 IAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGY 327
           +AI A    FQ Y  GIF G CGT LDH V  VG+G TE+G +YW+++NSWG +WG++GY
Sbjct: 262 VAIEAGGRAFQLYSSGIFTGKCGTALDHGVAAVGYG-TENGKDYWIVRNSWGKSWGESGY 320

Query: 328 MKIVRD----EGLCGIGTRSSYPL 347
           +++ R+     G CGI    SYPL
Sbjct: 321 VRMERNIKASSGKCGIAVEPSYPL 344


>gi|125540888|gb|EAY87283.1| hypothetical protein OsI_08685 [Oryza sativa Indica Group]
          Length = 357

 Score =  287 bits (735), Expect = 5e-75,   Method: Compositional matrix adjust.
 Identities = 156/351 (44%), Positives = 215/351 (61%), Gaps = 31/351 (8%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIHEK--------------WMAQHGRSYKDELE 63
           M  ++   V+C++    + S H+ SVV   ++              W  +H + Y    E
Sbjct: 7   MLFLLLGFVACSA----TASHHDPSVVGYSQEDLALPNKLVGLFTSWSVKHSKIYASPKE 62

Query: 64  KEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTT- 122
           K  R +IFK NL +I + N+  N +Y LG N F+D+ ++EF+A Y G K P  + R    
Sbjct: 63  KVKRYEIFKRNLRHIVETNRR-NGSYWLGLNHFADIAHEEFKASYLGLK-PGLARRDAQP 120

Query: 123 --SSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNL 180
             S+TF+Y N    ++P ++DWR KGAVTP+KNQ ECG CWAF+ VAAVEGI +I +G L
Sbjct: 121 HGSTTFRYAN--AVNLPWAVDWRKKGAVTPVKNQGECGSCWAFSTVAAVEGINQIVTGKL 178

Query: 181 IQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAA 240
           + LSEQ+L+DC    N+GC GG  + AFAYI+ NQGI TE++YPY    G C   Q  + 
Sbjct: 179 VSLSEQELMDCDNTFNHGCRGGLMDFAFAYIMGNQGIYTEEDYPYLMEEGYCREKQPHSK 238

Query: 241 A-KISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTI 299
              I+ YE+VP   E +LLKA++ QPVS+ IAA S +FQ YK GIF+G CG Q DHA+T 
Sbjct: 239 VITITGYEDVPENSETSLLKALAHQPVSVGIAAGSRDFQFYKGGIFDGECGIQPDHALTA 298

Query: 300 VGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR----DEGLCGIGTRSSYP 346
           VG+G+   G +Y ++KNSWG  WG+ GY +I R     EG+C I   +SYP
Sbjct: 299 VGYGSYY-GQDYIIMKNSWGKNWGEQGYFRIRRGTGKPEGVCDIYKIASYP 348


>gi|297603535|ref|NP_001054211.2| Os04g0670200 [Oryza sativa Japonica Group]
 gi|109939735|sp|P25777.2|ORYB_ORYSJ RecName: Full=Oryzain beta chain; Flags: Precursor
 gi|32488398|emb|CAE02823.1| OSJNBa0043A12.28 [Oryza sativa Japonica Group]
 gi|90399163|emb|CAJ86092.1| H0818H01.14 [Oryza sativa Indica Group]
 gi|125550169|gb|EAY95991.1| hypothetical protein OsI_17862 [Oryza sativa Indica Group]
 gi|215766596|dbj|BAG98700.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|255675868|dbj|BAF16125.2| Os04g0670200 [Oryza sativa Japonica Group]
          Length = 466

 Score =  287 bits (735), Expect = 5e-75,   Method: Compositional matrix adjust.
 Identities = 141/310 (45%), Positives = 212/310 (68%), Gaps = 15/310 (4%)

Query: 47  HEKWMAQHGRSYKDEL--EKEMRLKIFKENLEYIEKANKEGNRT--YKLGTNQFSDLTND 102
           ++ W+A++G    + L  E E R  +F +NL++++  N   +    ++LG N+F+DLTN+
Sbjct: 52  YDLWLAENGGGSPNALGGEHERRFLVFWDNLKFVDAHNARADERGGFRLGMNRFADLTNE 111

Query: 103 EFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWA 162
           EFRA + G K+   + RS  +   +Y++  + ++P S+DWR+KGAV P+KNQ +CG CWA
Sbjct: 112 EFRATFLGAKV---AERSRAAGE-RYRHDGVEELPESVDWREKGAVAPVKNQGQCGSCWA 167

Query: 163 FAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYIIQNQGIATED 221
           F+AV+ VE I ++ +G +I LSEQ+L++CSTNG N+GC GG  + AF +II+N GI TED
Sbjct: 168 FSAVSTVESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMDDAFDFIIKNGGIDTED 227

Query: 222 EYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSY 280
           +YPY+AV G C   ++ A    I  +E+VP  DE++L KAV+ QPVS+AI A   EFQ Y
Sbjct: 228 DYPYKAVDGKCDINRENAKVVSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGGREFQLY 287

Query: 281 KEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGL 336
             G+F+G CGT LDH V  VG+G T++G +YW+++NSWG  WG++GY+++ R+     G 
Sbjct: 288 HSGVFSGRCGTSLDHGVVAVGYG-TDNGKDYWIVRNSWGPKWGESGYVRMERNINVTTGK 346

Query: 337 CGIGTRSSYP 346
           CGI   +SYP
Sbjct: 347 CGIAMMASYP 356


>gi|356515062|ref|XP_003526220.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
          Length = 337

 Score =  287 bits (734), Expect = 5e-75,   Method: Compositional matrix adjust.
 Identities = 142/308 (46%), Positives = 195/308 (63%), Gaps = 8/308 (2%)

Query: 47  HEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRA 106
           HEKWMAQHG+ YKD  EKE  L+IF+ N+E+IE  +  G++++ L TNQF+DL ++EF+A
Sbjct: 32  HEKWMAQHGKVYKDAAEKERCLQIFENNMEFIESFDVCGDKSFNLSTNQFADLHDEEFKA 91

Query: 107 LYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFA-A 165
           L T       S  +TT + F+Y N+  T +P S+DWR +G VTPIK+Q +C  CWAF+  
Sbjct: 92  LLTNGHKKEHSLWTTTETLFRYDNV--TKIPASMDWRKRGVVTPIKDQGKCLSCWAFSLC 149

Query: 166 VAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPY 225
           VA +EG+ +I +  L+ LSEQ+L+D     + GC G   E AF +I +   I +E  YPY
Sbjct: 150 VATIEGLHQIITSELVPLSEQELVDFVKGESEGCYGDYVEDAFKFITKKGRIESETHYPY 209

Query: 226 QAVPGTCSAAQKP-AAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGI 284
           + V  TC   ++    A+I  Y++VPS  E ALLKAV+ Q VS+++ A  + FQ Y  GI
Sbjct: 210 KGVNNTCKVKKETHGVAQIKGYKKVPSKSENALLKAVANQLVSVSVEARDSAFQFYSSGI 269

Query: 285 FNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIG 340
           F G CGT  DH V +  +G + DG  YWL KNSWG  WG+ GY++I  D    EGLCGI 
Sbjct: 270 FTGKCGTDTDHRVALASYGESGDGTKYWLAKNSWGTEWGEKGYIRIKXDIPAKEGLCGIA 329

Query: 341 TRSSYPLA 348
               YP+A
Sbjct: 330 KYPYYPIA 337


>gi|110737959|dbj|BAF00916.1| cysteine proteinase [Arabidopsis thaliana]
          Length = 376

 Score =  287 bits (734), Expect = 6e-75,   Method: Compositional matrix adjust.
 Identities = 146/322 (45%), Positives = 207/322 (64%), Gaps = 16/322 (4%)

Query: 40  EQSVVEIHEKWMAQHGRSYKDEL----EKEMRLKIFKENLEYIEKANKEG-NRTYKLGTN 94
           ++ V  I+ +W A+HG++  +      +++ R  IFK+NL +I+  N+   N TYKLG  
Sbjct: 42  DEEVRSIYLQWSAEHGKTNNNNNGIINDQDKRFNIFKDNLRFIDLHNENNKNATYKLGLT 101

Query: 95  QFSDLTNDEFRALYTGYKMPSPSHRSTTSSTF--KYQN-LSMTDVPTSLDWRDKGAVTPI 151
           +F+DLTNDE+R LY G +   P+ R   +     KY   ++  +VP ++DWR KGAV PI
Sbjct: 102 KFTDLTNDEYRKLYLGART-EPARRIAKAKNVNQKYSAAVNGKEVPETVDWRQKGAVNPI 160

Query: 152 KNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYI 211
           K+Q  CG CWAF+  AAVEGI KI +G LI LSEQ+L+DC  + N GC GG  + AF +I
Sbjct: 161 KDQGTCGSCWAFSTTAAVEGINKIVTGELISLSEQELVDCDKSYNQGCNGGLMDYAFQFI 220

Query: 212 IQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAI 270
           ++N G+ TE +YPY+   G C++  K +    I  YE+VP+ DE AL KA+S QPV +AI
Sbjct: 221 MKNGGLNTEKDYPYRGFGGKCNSFLKNSRVVSIDGYEDVPTKDETALKKAISYQPVRVAI 280

Query: 271 AAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKI 330
            A    FQ Y+ GIF G CGT LDHAV  VG+G +E+G +YW+++NSWG  WG+ GY+++
Sbjct: 281 EAGGRIFQHYQSGIFTGSCGTNLDHAVVAVGYG-SENGVDYWIVRNSWGPRWGEEGYIRM 339

Query: 331 VRD-----EGLCGIGTRSSYPL 347
            R+      G CGI   +SYP+
Sbjct: 340 ERNLAASKSGKCGIAVEASYPV 361


>gi|226506492|ref|NP_001140873.1| uncharacterized protein LOC100272949 precursor [Zea mays]
 gi|194701540|gb|ACF84854.1| unknown [Zea mays]
          Length = 379

 Score =  287 bits (734), Expect = 6e-75,   Method: Compositional matrix adjust.
 Identities = 144/355 (40%), Positives = 207/355 (58%), Gaps = 23/355 (6%)

Query: 12  KINTTPMFIIITLLVSCASQVVSSRSTHEQSVV------EIHEKWMAQHGRSYKDELEKE 65
           +++ T + + +  + S A ++  +    E+ +       +++E+W   H R ++   EK 
Sbjct: 3   QVSKTLLLVALVFVSSAAVELCRAIDFDERDLASDEALWDLYERWQTHH-RVHRHHGEKG 61

Query: 66  MRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKM------PSPSHR 119
            R   FKEN+ +I   NK G+R Y+L  N+F D+  +EFR+ +   ++       SP+ R
Sbjct: 62  RRFGTFKENVRFIHAHNKRGDRPYRLRLNRFGDMGREEFRSTFADSRINDLRRQDSPAAR 121

Query: 120 STTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGN 179
           +     F Y   S  D P S+DWR +GAVT +K Q  CG CWAF+ V AVEGI  IR+G+
Sbjct: 122 AGAVPGFMYD--SAADPPRSVDWRQEGAVTGVKVQGHCGSCWAFSTVVAVEGINAIRTGS 179

Query: 180 LIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQ--- 236
           L  LSEQ+L+DC T+  NGC GG  E AF +I    GI TE  YPY+A  GTC   +   
Sbjct: 180 LASLSEQELIDCDTD-ENGCQGGLMENAFEFIKSFGGITTEAAYPYRASNGTCDGDRARR 238

Query: 237 -KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDH 295
                  I  ++ VP+G E AL KAV+ QPVS+A+ A    FQ Y EG+F G CGT LDH
Sbjct: 239 GGGVVVVIDGHQMVPAGSEDALAKAVAHQPVSVAVDAGGQAFQFYSEGVFTGDCGTDLDH 298

Query: 296 AVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR---DEGLCGIGTRSSYPL 347
            V  VG+G  +DG  YW++KNSWG +WG+ GY+++ R   + GLCGI   +S+P+
Sbjct: 299 GVAAVGYGVGDDGTPYWIVKNSWGTSWGEGGYIRMQRGAGNGGLCGIAMEASFPI 353


>gi|146215988|gb|ABQ10196.1| actinidin Act3a [Actinidia eriantha]
          Length = 380

 Score =  287 bits (734), Expect = 6e-75,   Method: Compositional matrix adjust.
 Identities = 150/341 (43%), Positives = 217/341 (63%), Gaps = 12/341 (3%)

Query: 13  INTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFK 72
           I+ + +F    L+ S A     S       V+ ++E W+ ++G+SY    E+EMR++IFK
Sbjct: 8   ISMSLLFFSTFLIFSFAIDAKISPLRTNDEVMALYESWLVKYGKSYNSLGEREMRIEIFK 67

Query: 73  ENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
           ENL +I++ N + NR+Y +G NQF+DLT++E+R+ Y G+K    S +S  S+ +  Q   
Sbjct: 68  ENLRFIDEHNADPNRSYTVGLNQFADLTDEEYRSTYLGFK---SSLKSKVSNRYMPQVGE 124

Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
           +  +P  +DWR  GAV  +KNQ  C  CWAFA +A VE I +I +G+LI LSEQ+L+DC+
Sbjct: 125 V--LPDYVDWRTTGAVVDVKNQGLCSSCWAFATIATVESINQIITGDLISLSEQELVDCN 182

Query: 193 -TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAA-AKISNYEEVP 250
            T  N GC GG  + A+ +II N GI TE+ YPY      C   +K      I +YE+VP
Sbjct: 183 RTPINEGCKGGFMDDAYEFIINNGGINTEENYPYIGQDDQCDEPKKNQNYVTIDSYEQVP 242

Query: 251 SGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFN-GVCGTQLDHAVTIVGFGTTEDGA 309
             DE A+ +AV+ QPVS+AI AY   F+ Y+ GIF  G CGT L+HAVTI+G+G TE+G 
Sbjct: 243 PNDELAMKRAVAYQPVSVAIDAYCLGFRFYQSGIFTGGSCGTTLNHAVTIIGYG-TENGI 301

Query: 310 NYWLIKNSWGNTWGDAGYMKIVRD---EGLCGIGTRSSYPL 347
           +YW++KNS+G  WG++GY K+ R+   EG CGI +   YP+
Sbjct: 302 DYWIVKNSYGTQWGESGYGKVQRNVGGEGRCGIASYPFYPV 342


>gi|242066206|ref|XP_002454392.1| hypothetical protein SORBIDRAFT_04g029960 [Sorghum bicolor]
 gi|241934223|gb|EES07368.1| hypothetical protein SORBIDRAFT_04g029960 [Sorghum bicolor]
          Length = 356

 Score =  287 bits (734), Expect = 6e-75,   Method: Compositional matrix adjust.
 Identities = 147/312 (47%), Positives = 210/312 (67%), Gaps = 11/312 (3%)

Query: 43  VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTND 102
           +V + + W  +H + Y    EK  R  IFK+NL +I + N++ N +Y LG NQF+D+T++
Sbjct: 41  LVNLFKSWSVKHRKIYVSPKEKLKRYGIFKQNLMHIAETNRK-NGSYWLGLNQFADITHE 99

Query: 103 EFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCC 160
           EF+A + G K  +     ++ T +TF+Y   +  ++P S+DWR KGAVTP+KNQ +CG C
Sbjct: 100 EFKANHLGLKQGLSRMGAQTRTPTTFRYA--AAANLPWSVDWRYKGAVTPVKNQGKCGSC 157

Query: 161 WAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATE 220
           WAF++VAAVEGI +I +G L+ LSEQ+L+DC T  ++GC GG  + AFAYI+ +QGI  E
Sbjct: 158 WAFSSVAAVEGINQIVTGKLVSLSEQELMDCDTMLDHGCEGGLMDFAFAYIMGSQGIHAE 217

Query: 221 DEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQS 279
           D+YPY    G C   Q  A    I+ YE+VP   E +LLKA++ QPVS+ IAA S +FQ 
Sbjct: 218 DDYPYLMEEGYCKEKQPYANVVTITGYEDVPENSEISLLKALAHQPVSVGIAAGSRDFQF 277

Query: 280 YKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIV----RDEG 335
           YK G+F+G C  +LDHA+T VG+G++  G NY  +KNSWG  WG+ GY++I     + EG
Sbjct: 278 YKGGVFDGSCSDELDHALTAVGYGSSY-GQNYITMKNSWGKNWGEQGYVRIKMGTGKPEG 336

Query: 336 LCGIGTRSSYPL 347
           +CGI T +SYP+
Sbjct: 337 VCGIYTMASYPV 348


>gi|357446979|ref|XP_003593765.1| Cysteine proteinase [Medicago truncatula]
 gi|355482813|gb|AES64016.1| Cysteine proteinase [Medicago truncatula]
          Length = 364

 Score =  287 bits (734), Expect = 6e-75,   Method: Compositional matrix adjust.
 Identities = 155/292 (53%), Positives = 197/292 (67%), Gaps = 11/292 (3%)

Query: 61  ELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRS 120
           ELEK  R +IFK NLEYIE  N  GN++YKLG NQ+SDLT+DEF A +TG K+      S
Sbjct: 78  ELEK--RKRIFKNNLEYIENFNNAGNKSYKLGLNQYSDLTSDEFLASHTGLKVSKQLSSS 135

Query: 121 TTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNL 180
              S     NL+  DVPT+ DWR +GAVT +K+Q  CGCCWAF+ VAAVEG  KI +G L
Sbjct: 136 KMRSAAVPFNLN-DDVPTNFDWRQQGAVTDVKDQGSCGCCWAFSVVAAVEGAVKINTGEL 194

Query: 181 IQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSA-AQKPA 239
           I LSEQQL+DC    N+GC GG+ + AF YIIQ +GI +E +YPYQ    TC    Q   
Sbjct: 195 ISLSEQQLVDCDER-NSGCHGGNMDSAFKYIIQ-KGIVSEADYPYQEGSQTCQLNDQMKF 252

Query: 240 AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTI 299
            A+I+N+ +VP+ DEQ LL+AV+ QPVS+ I     EFQ Y   +++G CG  ++HAVT 
Sbjct: 253 EAQITNFIDVPANDEQQLLQAVAQQPVSVGIEV-GDEFQHYMGDVYSGTCGQSMNHAVTA 311

Query: 300 VGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE----GLCGIGTRSSYPL 347
           VG+G +EDG  YWLIKNSWG  WG+ GYMK++R+     G CGI   +SYP+
Sbjct: 312 VGYGVSEDGTKYWLIKNSWGKGWGEEGYMKLLRESGEPGGQCGIAAHASYPI 363


>gi|224102377|ref|XP_002312656.1| predicted protein [Populus trichocarpa]
 gi|222852476|gb|EEE90023.1| predicted protein [Populus trichocarpa]
          Length = 358

 Score =  286 bits (733), Expect = 8e-75,   Method: Compositional matrix adjust.
 Identities = 146/317 (46%), Positives = 202/317 (63%), Gaps = 16/317 (5%)

Query: 40  EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
           E+ + +++E+W + H  S +   EK+ R  +FKENL++I K N + +R YKL  N F+D+
Sbjct: 33  EERLRDLYERWRSHHTVS-RSLAEKQERFNVFKENLKHIHKVNHK-DRPYKLKLNSFADM 90

Query: 100 TNDEFRALYTGYKMPS----PSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQK 155
           TN EF   Y G K+         R  T S  +      + +P+S+DWR  GAVT IK+Q 
Sbjct: 91  TNHEFLQHYGGSKVSHYRVLRGQRQGTGSMHE----DTSKLPSSVDWRKNGAVTGIKDQG 146

Query: 156 ECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQ 215
           +CG CWAF+ VAAVEGI KI++G LI LSEQ+L+DC ++ N+GC GG  E AF +I Q  
Sbjct: 147 KCGSCWAFSTVAAVEGINKIKTGELISLSEQELVDCDSD-NHGCNGGLMEDAFNFIKQIG 205

Query: 216 GIATEDEYPYQAVPGTC-SAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYS 274
           G+ +E+ YPY+A    C S         I  YE VP  DE AL+KAV+ QPV+IA+ A  
Sbjct: 206 GLTSENTYPYRAKEEPCDSNKMNSPVVNIDGYEMVPENDENALMKAVANQPVAIAMDAGG 265

Query: 275 TEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR-- 332
            + Q Y E IF G CGT+L+H V +VG+GTT+DG  YW++KNSWG  WG+ GY+++ R  
Sbjct: 266 KDLQFYSEAIFTGDCGTELNHGVALVGYGTTQDGTKYWIVKNSWGTDWGEKGYIRMQRGI 325

Query: 333 --DEGLCGIGTRSSYPL 347
             +EGLCGI   +SYP+
Sbjct: 326 DAEEGLCGITMEASYPV 342


>gi|115477767|ref|NP_001062479.1| Os08g0556900 [Oryza sativa Japonica Group]
 gi|42407937|dbj|BAD09076.1| putative cysteine proteinase [Oryza sativa Japonica Group]
 gi|113624448|dbj|BAF24393.1| Os08g0556900 [Oryza sativa Japonica Group]
 gi|125562525|gb|EAZ07973.1| hypothetical protein OsI_30231 [Oryza sativa Indica Group]
 gi|215701458|dbj|BAG92882.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 385

 Score =  286 bits (733), Expect = 8e-75,   Method: Compositional matrix adjust.
 Identities = 147/320 (45%), Positives = 201/320 (62%), Gaps = 19/320 (5%)

Query: 40  EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
           E+++ E++E+W  QH R  +D  EK  R  +FK+N+  I + N+  +  YKL  N+F D+
Sbjct: 41  EEALWELYERWRGQH-RVARDLGEKARRFNVFKDNVRLIHEFNRR-DEPYKLRLNRFGDM 98

Query: 100 TNDEFRALYTGYKMPSPSH------RSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKN 153
           T DEFR  Y   ++   SH      R    S F Y      D+P ++DWR+KGAV  +K+
Sbjct: 99  TADEFRRAYASSRV---SHHRMFRGRGERRSGFMYAG--ARDLPAAVDWREKGAVGAVKD 153

Query: 154 QKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYII 212
           Q +CG CWAF+ +AAVEGI  IR+ NL  LSEQQL+DC T  GN GC GG  + AF YI 
Sbjct: 154 QGQCGSCWAFSTIAAVEGINAIRTSNLTALSEQQLVDCDTKTGNAGCDGGLMDNAFQYIA 213

Query: 213 QNQGIATEDEYPYQAVPGTCSAAQKPAA-AKISNYEEVPSGDEQALLKAVSMQPVSIAIA 271
           ++ G+A    YPY+A   +C ++   +    I  YE+VP+  E AL KAV+ QPVS+AI 
Sbjct: 214 KHGGVAASSAYPYRARQSSCKSSAASSPAVTIDGYEDVPANSESALKKAVANQPVSVAIE 273

Query: 272 AYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIV 331
           A  + FQ Y EG+F G CGT+LDH V  VG+GTT DG  YW+++NSWG  WG+ GY+++ 
Sbjct: 274 AGGSHFQFYSEGVFAGKCGTELDHGVAAVGYGTTVDGTKYWIVRNSWGADWGEKGYIRMK 333

Query: 332 RD----EGLCGIGTRSSYPL 347
           RD    EGLCGI   +SYP+
Sbjct: 334 RDVSAKEGLCGIAMEASYPI 353


>gi|302763837|ref|XP_002965340.1| hypothetical protein SELMODRAFT_143126 [Selaginella moellendorffii]
 gi|302790566|ref|XP_002977050.1| hypothetical protein SELMODRAFT_232903 [Selaginella moellendorffii]
 gi|300155026|gb|EFJ21659.1| hypothetical protein SELMODRAFT_232903 [Selaginella moellendorffii]
 gi|300167573|gb|EFJ34178.1| hypothetical protein SELMODRAFT_143126 [Selaginella moellendorffii]
          Length = 300

 Score =  286 bits (732), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 140/303 (46%), Positives = 198/303 (65%), Gaps = 8/303 (2%)

Query: 46  IHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFR 105
           + E W A+HG+SY  + EK  RL IF + L YIEK N   N T+ LG N+FSDLTN EFR
Sbjct: 1   MFEGWAAKHGKSYSSDWEKARRLMIFSDTLAYIEKHNALPNTTFTLGLNKFSDLTNAEFR 60

Query: 106 ALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAA 165
           A Y G K   P ++    +  K  ++ ++ +PTSLDWR +GAVTPIK+Q +CG CWAF+A
Sbjct: 61  ANYVG-KFKPPRYQDRRPA--KDVDVDVSSLPTSLDWRQEGAVTPIKDQGQCGSCWAFSA 117

Query: 166 VAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPY 225
           +A++E    + +  L+ LSEQQL+DC T  + GC GG  E AF ++++N G+ TE+ YPY
Sbjct: 118 IASIESAHFLATKELVSLSEQQLIDCDTV-DQGCQGGFPEDAFKFVVENGGVTTEEAYPY 176

Query: 226 QAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIF 285
               G+C+ A K    +I+ Y++V      AL+KAVS  PV++ I      FQ+Y+ GI 
Sbjct: 177 TGFAGSCN-ANKNKVVEITGYKDVTKDSADALMKAVSKTPVTVGICGSDQNFQNYRSGIL 235

Query: 286 NGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD--EGLCGIGTRS 343
           +G C    DHAV ++G+G TE G  YW+IKNSWG +WG+ G+M+I ++  EG+CG+  +S
Sbjct: 236 SGHCSNSRDHAVLVIGYG-TEGGMPYWIIKNSWGTSWGEDGFMRIKKEDGEGMCGMNGQS 294

Query: 344 SYP 346
           SYP
Sbjct: 295 SYP 297


>gi|30141023|dbj|BAC75925.1| cysteine protease-3 [Helianthus annuus]
          Length = 348

 Score =  286 bits (732), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 140/317 (44%), Positives = 203/317 (64%), Gaps = 17/317 (5%)

Query: 40  EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
           ++S+ +++E+W +QH  S   + EK+ R  +FK N+ +I + N+ G + YKL  N+F+D+
Sbjct: 33  DKSLWDLYERWGSQHMVSRAPD-EKKKRFNVFKYNVNHINRVNQLG-KPYKLKLNEFADM 90

Query: 100 TNDEFRALYTG----YKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQK 155
           TN EF+A +      ++M     R T      + +   TD P S+DWR  GAV PIKNQ 
Sbjct: 91  TNHEFKAGFDSKILHFRMLKGKRRQTP-----FTHAKTTDPPPSIDWRTNGAVNPIKNQG 145

Query: 156 ECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQ 215
            CG CWAF+ +  VEGI KI++  L+ LSEQ+L+DC T+   GC GG  E  + +I +  
Sbjct: 146 RCGSCWAFSTIVGVEGINKIKTNQLVSLSEQELVDCETDCE-GCNGGLMENGYEFIKETG 204

Query: 216 GIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYS 274
           G+ TE  YPY A  G C  +++ +   KI  +E VP+ DE A+L+AV+ QPVSIAI A  
Sbjct: 205 GVTTEQIYPYFARNGRCDISKRNSPVVKIDGFENVPANDESAMLRAVANQPVSIAIDAGG 264

Query: 275 TEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR-- 332
             FQ Y +G+FNG CGT+L+H V IVG+GTT+DG NYW+++NSWG  WG+ GY+++ R  
Sbjct: 265 LNFQFYSQGVFNGACGTELNHGVAIVGYGTTQDGTNYWIVRNSWGTGWGEQGYVRMQRGV 324

Query: 333 --DEGLCGIGTRSSYPL 347
              EGLCG+   +SYP+
Sbjct: 325 NVPEGLCGLAMDASYPI 341


>gi|218195711|gb|EEC78138.1| hypothetical protein OsI_17694 [Oryza sativa Indica Group]
          Length = 458

 Score =  286 bits (732), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 145/324 (44%), Positives = 202/324 (62%), Gaps = 12/324 (3%)

Query: 32  VVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRT 88
           +VS     E+    ++ +W A+HG++Y    E+E R   F++NL YI++ N     G  +
Sbjct: 25  IVSYGERSEEEARRLYAEWKAEHGKNYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHS 84

Query: 89  YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAV 148
           ++LG N+F+DLTN+E+R  Y G +      R  +       N ++   P S+DWR KGAV
Sbjct: 85  FRLGLNRFADLTNEEYRDTYLGLRNKPRRERKVSDRYLAADNEAL---PESVDWRTKGAV 141

Query: 149 TPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAF 208
             IK+Q  CG CWAF+A+AAVEGI +I +G+LI LSEQ+L+DC T+ N GC GG  + AF
Sbjct: 142 AEIKDQGGCGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAF 201

Query: 209 AYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVS 267
            +II N GI TED+YPY+     C   +K A    I +YE+V    E +L KAV+ QPVS
Sbjct: 202 DFIINNGGIDTEDDYPYKGKDERCDVNRKNAKVVTIDSYEDVTPNSETSLQKAVANQPVS 261

Query: 268 IAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGY 327
           +AI A    FQ Y  GIF G CGT LDH V  VG+G TE+G +YW+++NSWG +WG++GY
Sbjct: 262 VAIEAGGRAFQLYSSGIFTGKCGTALDHGVAAVGYG-TENGKDYWIVRNSWGKSWGESGY 320

Query: 328 MKIVRD----EGLCGIGTRSSYPL 347
           +++ R+     G CGI    SYPL
Sbjct: 321 VRMERNIKASSGKCGIAVEPSYPL 344


>gi|194352762|emb|CAQ00109.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
 gi|326517250|dbj|BAJ99991.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 367

 Score =  286 bits (732), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 149/315 (47%), Positives = 199/315 (63%), Gaps = 9/315 (2%)

Query: 40  EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
           E S+  ++E+W  QH  + +D  EK  R  +F+EN+  I + N+ G+  YKL  N+F D+
Sbjct: 40  EDSLWALYERWREQHTVA-RDLGEKARRFNVFRENVRLIHEFNR-GDAPYKLRLNRFGDM 97

Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFKYQN---LSMTDVPTSLDWRDKGAVTPIKNQKE 156
           T DEFR  Y   ++      S       + +    S+ DVP S+DWR KGAVT +K+Q +
Sbjct: 98  TADEFRRAYASSRVSHHRMFSLKEGGGGFMHGSAASVRDVPPSVDWRQKGAVTAVKDQGQ 157

Query: 157 CGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQG 216
           CG CWAF+ +AAVEGI  IRS NL  LSEQQL+DC T  N GC GG  + AF YI ++ G
Sbjct: 158 CGSCWAFSTIAAVEGINAIRSKNLTSLSEQQLVDCDTKSNAGCNGGLMDYAFQYIAKHGG 217

Query: 217 IATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTE 276
           +A ED YPY+A   +    +  A   I  YE+VP+ DE AL KAV+ QPV++AI A  + 
Sbjct: 218 VAAEDAYPYKARQASSCNKKPSAVVTIDGYEDVPANDETALKKAVAAQPVAVAIEASGSH 277

Query: 277 FQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD--- 333
           FQ Y EG+F G CGT+LDH V  VG+GTT DG  YW++KNSWG  WG+ GY+++ RD   
Sbjct: 278 FQFYSEGVFAGKCGTELDHGVAAVGYGTTVDGTKYWIVKNSWGPEWGEKGYIRMKRDVKD 337

Query: 334 -EGLCGIGTRSSYPL 347
            EGLCGI   +SYP+
Sbjct: 338 KEGLCGIAMEASYPV 352


>gi|2224812|emb|CAB09699.1| cysteine endopeptidase EP-A [Hordeum vulgare subsp. vulgare]
          Length = 365

 Score =  286 bits (732), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 143/317 (45%), Positives = 197/317 (62%), Gaps = 14/317 (4%)

Query: 40  EQSVVEIHEKWMAQHGRSYKD--ELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
           E+S+  ++E+W + +  S +      +E R  +FK+N  Y+ + NK  +  ++L  N+F+
Sbjct: 34  EESLRGLYERWRSHYTVSRRGLGADAEERRFNVFKQNARYVHEGNKR-DMPFRLALNKFA 92

Query: 98  DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD---VPTSLDWRDKGAVTPIKNQ 154
           D+T DEFR  Y G ++    H S +            D   +P ++DWR KGAVT IK+Q
Sbjct: 93  DMTTDEFRRTYAGSRVRH--HLSLSGGRRGDGGFRYGDADNLPPAVDWRQKGAVTAIKDQ 150

Query: 155 KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQN 214
            +CG CWAF+ + AVEGI KIR+G L+ LSEQ+L+DC    N GC GG  + AF +I +N
Sbjct: 151 GQCGSCWAFSTIVAVEGINKIRTGKLVSLSEQELMDCDNVNNQGCDGGLMDYAFQFIQKN 210

Query: 215 QGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAY 273
            GI TE  YPYQ   G+C  A++ A A  I  YE+VP+ DE AL KAV+ QPVS+AI A 
Sbjct: 211 -GITTESNYPYQGEQGSCDQAKENAQAVTIDGYEDVPANDESALQKAVAGQPVSVAIDAS 269

Query: 274 STEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD 333
             +FQ Y EG+F G C T LDH V  VG+G T DG  YW++KNSWG  WG+ GY+++ R 
Sbjct: 270 GQDFQFYSEGVFTGECSTDLDHGVAAVGYGATRDGTKYWIVKNSWGEDWGEKGYIRMQRG 329

Query: 334 ----EGLCGIGTRSSYP 346
               EGLCGI  ++SYP
Sbjct: 330 VSQTEGLCGIAMQASYP 346


>gi|218183|dbj|BAA14403.1| oryzain beta precursor [Oryza sativa Japonica Group]
          Length = 471

 Score =  286 bits (731), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 141/310 (45%), Positives = 211/310 (68%), Gaps = 15/310 (4%)

Query: 47  HEKWMAQHGRSYKDEL--EKEMRLKIFKENLEYIEKANKEGNRT--YKLGTNQFSDLTND 102
           ++ W+A++G    + L  E E R  +F +NL++++  N   +    ++LG N+F+DLTN+
Sbjct: 51  YDLWLAENGGGSPNALGGEHERRFLVFWDNLKFVDAHNARADEGGGFRLGMNRFADLTNE 110

Query: 103 EFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWA 162
           EFRA + G K+   + RS  +   +Y++  + ++P S+DWR+KGAV P+KNQ +CG CWA
Sbjct: 111 EFRATFLGAKV---AERSRAAGE-RYRHDGVEELPESVDWREKGAVAPVKNQGQCGSCWA 166

Query: 163 FAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYIIQNQGIATED 221
           F+AV+ VE I ++ +G +I LSEQ+L++CSTNG N+GC GG    AF +II+N GI TED
Sbjct: 167 FSAVSTVESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMADAFDFIIKNGGIDTED 226

Query: 222 EYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSY 280
           +YPY+AV G C   ++ A    I  +E+VP  DE++L KAV+ QPVS+AI A   EFQ Y
Sbjct: 227 DYPYKAVDGKCDINRENAKVVSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGGREFQLY 286

Query: 281 KEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGL 336
             G+F+G CGT LDH V  VG+G T++G +YW+++NSWG  WG++GY+++ R+     G 
Sbjct: 287 HSGVFSGRCGTSLDHGVVAVGYG-TDNGKDYWIVRNSWGPKWGESGYVRMERNINVTTGK 345

Query: 337 CGIGTRSSYP 346
           CGI   +SYP
Sbjct: 346 CGIAMMASYP 355


>gi|242032709|ref|XP_002463749.1| hypothetical protein SORBIDRAFT_01g005350 [Sorghum bicolor]
 gi|241917603|gb|EER90747.1| hypothetical protein SORBIDRAFT_01g005350 [Sorghum bicolor]
          Length = 381

 Score =  285 bits (730), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 145/324 (44%), Positives = 201/324 (62%), Gaps = 22/324 (6%)

Query: 40  EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNR-TYKLGTNQFSD 98
           ++++ +++E+W   H R ++   EK  R   FKEN+ +I   NK G+R +Y+L  N+F D
Sbjct: 39  DEALWDLYERWQTHH-RVHRHHGEKGRRFGTFKENVRFIHAHNKRGDRPSYRLRLNRFGD 97

Query: 99  LTNDEFRALYTG--------YKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTP 150
           +  +EFR+ +          Y+  SP+  +T    F Y +   TDVP S+DWR  GAVT 
Sbjct: 98  MGPEEFRSTFADSRINDLRRYRESSPA--ATAVPGFMYDD--ATDVPRSVDWRQHGAVTA 153

Query: 151 IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAY 210
           +KNQ  CG CWAF+ V AVEGI  IR+G+L+ LSEQ+L+DC T   NGC GG  E AF +
Sbjct: 154 VKNQGRCGSCWAFSTVVAVEGINAIRTGSLVSLSEQELVDCDT-AENGCQGGLMENAFDF 212

Query: 211 IIQNQGIATEDEYPYQAVPGTCS---AAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVS 267
           I    GI TE  YPY+A  GTC    A +      I  ++ VP+G E AL KAV+ QPVS
Sbjct: 213 IKSYGGITTESAYPYRASNGTCDGMRARRGRVHVSIDGHQMVPTGSEDALAKAVARQPVS 272

Query: 268 IAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTE-DGANYWLIKNSWGNTWGDAG 326
           +AI A    FQ Y EG+F G CGT LDH V +VG+G ++ DG  YW++KNSWG +WG+ G
Sbjct: 273 VAIDAGGQAFQFYSEGVFTGDCGTDLDHGVAVVGYGVSDVDGTPYWIVKNSWGPSWGEGG 332

Query: 327 YMKIVR---DEGLCGIGTRSSYPL 347
           Y+++ R   + GLCGI   +S+P+
Sbjct: 333 YIRMQRGAGNGGLCGIAMEASFPI 356


>gi|168006315|ref|XP_001755855.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162693174|gb|EDQ79528.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 454

 Score =  285 bits (730), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 144/317 (45%), Positives = 203/317 (64%), Gaps = 14/317 (4%)

Query: 39  HEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSD 98
           +E+ + E    W  +HG+ Y    E   R  ++K+NLEYI++ + E NR+Y LG  +F+D
Sbjct: 38  NERLLSEQFGAWAHKHGKVYSSLEEHAHRYMVWKDNLEYIQR-HSEKNRSYWLGLTKFAD 96

Query: 99  LTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECG 158
           +TNDEFR  YTG ++   S RS   + F+Y +   ++ P S+DWR KGAVT +K+Q  CG
Sbjct: 97  ITNDEFRRQYTGTRIDR-SKRSKRKTGFRYAD---SEAPESVDWRKKGAVTTVKDQGSCG 152

Query: 159 CCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIA 218
            CWAF+A+ +VEGI  IR+G  + LSEQ+L+DC    N GC GG  + AF +I++N GI 
Sbjct: 153 SCWAFSAIGSVEGINAIRTGEAVSLSEQELVDCDLEYNQGCNGGLMDYAFDFILENGGID 212

Query: 219 TEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEF 277
           TE++YPY+ + G C   +K A    I  YE+VP  DE+AL KAV+ QPVS+AI A   +F
Sbjct: 213 TENDYPYKGLDGRCDNNKKNAHVVTIDGYEDVPENDEEALKKAVAGQPVSVAIEAGGRDF 272

Query: 278 QSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD---- 333
           Q Y  G+F G CGT LDH V  VG+G +E   +YW++KNSWG  WG++GY+++ R+    
Sbjct: 273 QLYSGGVFTGECGTDLDHGVLAVGYG-SEGSLDYWIVKNSWGEYWGESGYLRMQRNIKDS 331

Query: 334 ---EGLCGIGTRSSYPL 347
               GLCGI    SY +
Sbjct: 332 NHQFGLCGINIEPSYAV 348


>gi|224083362|ref|XP_002306996.1| predicted protein [Populus trichocarpa]
 gi|222856445|gb|EEE93992.1| predicted protein [Populus trichocarpa]
          Length = 336

 Score =  285 bits (730), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 142/314 (45%), Positives = 208/314 (66%), Gaps = 10/314 (3%)

Query: 38  THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
           T    ++++ E W+++H + Y+   EK  R +IFK+NL +I++ NK+    Y LG N+F+
Sbjct: 24  TSRDRIIDLFESWISKHQKIYESIEEKWHRFEIFKDNLFHIDETNKK-VVNYWLGLNEFA 82

Query: 98  DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
           DL+++EF+  Y G  +   S+R   S  F Y+++S   +P S+DWR KGAVT +KNQ  C
Sbjct: 83  DLSHEEFKNKYLGLNV-DLSNRRECSEEFTYKDVS--SIPKSVDWRKKGAVTDVKNQGSC 139

Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
           G CWAF+ VAAVEGI +I +GNL  LSEQ+L+DC T  NNGC GG  + AFAYII N G+
Sbjct: 140 GSCWAFSTVAAVEGINQIVTGNLTSLSEQELVDCDTTYNNGCNGGLMDYAFAYIISNGGL 199

Query: 218 ATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTE 276
             E++YPY    GTC   +  +    IS Y +VP   E++LLKA++ QP+S+AI A   +
Sbjct: 200 HKEEDYPYIMEEGTCEMRKAESEVVTISGYHDVPQNSEESLLKALANQPLSVAIDASGRD 259

Query: 277 FQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD--- 333
           FQ Y  G+F+G CGT+LDH V  VG+G+ + G ++ ++KNSWG+ WG+ G++++ R+   
Sbjct: 260 FQFYSGGVFDGHCGTELDHGVAAVGYGSAK-GLDFIVVKNSWGSKWGEKGFIRMKRNTGK 318

Query: 334 -EGLCGIGTRSSYP 346
             GLCGI   +SYP
Sbjct: 319 PAGLCGINKMASYP 332


>gi|2351107|dbj|BAA21929.1| bromelain [Ananas comosus]
          Length = 312

 Score =  285 bits (730), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 138/299 (46%), Positives = 194/299 (64%), Gaps = 7/299 (2%)

Query: 51  MAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTG 110
           MA++GR YKD  EK  R +IFK N+ +IE  N     +Y LG N+F+D+TN+EF A YTG
Sbjct: 1   MAEYGRVYKDNDEKMRRFQIFKNNVNHIETFNNRNGNSYTLGINKFTDMTNNEFVAQYTG 60

Query: 111 YKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVE 170
             +  P +         + +++++ V  S+DWRD GAVT +K+Q  CG CWAF+A+A VE
Sbjct: 61  -GISRPLNIEK-EPVVSFDDVNISAVGQSIDWRDYGAVTEVKDQNPCGSCWAFSAIATVE 118

Query: 171 GITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPG 230
           GI KI +G L+ LSEQ++LDC+ +  NGC GG  + A+ +II N G+A+E +YPYQA  G
Sbjct: 119 GIYKIVTGYLVSLSEQEVLDCAVS--NGCDGGFVDNAYDFIISNNGVASEADYPYQAYQG 176

Query: 231 TCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCG 290
            C+A   P +A I+ Y  V S DE ++  AV  QP++ AI A    FQ Y  G+F+G CG
Sbjct: 177 DCAANSWPNSAYITGYSYVRSNDESSMKYAVWNQPIAAAIDASGDNFQYYNGGVFSGPCG 236

Query: 291 TQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR---DEGLCGIGTRSSYP 346
           T L+HA+TI+G+G    G  YW++KNSWG++WG+ GY+++ R     GLCGI     YP
Sbjct: 237 TSLNHAITIIGYGQDSSGTQYWIVKNSWGSSWGERGYIRMARGVSSSGLCGIAMDPLYP 295


>gi|18141281|gb|AAL60578.1|AF454956_1 senescence-associated cysteine protease [Brassica oleracea]
          Length = 445

 Score =  285 bits (730), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 142/321 (44%), Positives = 211/321 (65%), Gaps = 12/321 (3%)

Query: 33  VSSRSTHEQ-SVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKL 91
           V++++ H     V++ E+W+ ++ ++Y    EK+ R +IF +NL+++++ N   N++Y+L
Sbjct: 22  VTAKADHRNPEEVKMFERWLVENHKNYNGLGEKDKRFEIFMDNLKFVQEHNSVPNQSYEL 81

Query: 92  GTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPI 151
           G  +F+DLTN+EFRA+Y   KM     R +  S     N+    +P  +DWR KGAV P+
Sbjct: 82  GLTRFADLTNEEFRAIYLRSKMERT--RDSVKSERYLHNVG-DKLPDEVDWRAKGAVVPV 138

Query: 152 KNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYI 211
           K+Q  CG CWAF+A+ AVEGI +I++G L+ LSEQ+L+DC T+ NNGC GG  + AF +I
Sbjct: 139 KDQGSCGSCWAFSAIGAVEGINQIKTGELVSLSEQELVDCDTSYNNGCGGGLMDYAFQFI 198

Query: 212 IQNQGIATEDEYPYQAVPGT-CSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIA 269
           I N GI TE++YPY A     C+  +K      I  YE+VP  +E +L KA++ QP+S+A
Sbjct: 199 ISNGGIDTEEDYPYTATDDNICNTDKKNTRVVTIDGYEDVPE-NENSLKKALANQPISVA 257

Query: 270 IAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMK 329
           I A    FQ YK G+F G CGT LDH V  VG+GT+E G +YW+I+NSWG+ WG++GY+K
Sbjct: 258 IEAGGRGFQLYKSGVFTGTCGTALDHGVVAVGYGTSE-GQDYWIIRNSWGSNWGESGYIK 316

Query: 330 IVRD----EGLCGIGTRSSYP 346
           + R+     G CG+   +SYP
Sbjct: 317 LQRNIKDSSGKCGVAMMASYP 337


>gi|302790570|ref|XP_002977052.1| hypothetical protein SELMODRAFT_268054 [Selaginella moellendorffii]
 gi|300155028|gb|EFJ21661.1| hypothetical protein SELMODRAFT_268054 [Selaginella moellendorffii]
          Length = 300

 Score =  285 bits (730), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 140/303 (46%), Positives = 197/303 (65%), Gaps = 8/303 (2%)

Query: 46  IHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFR 105
           + E W A+HG+SY  + EK  RL IF + L YIEK N   N T+ LG N+FSDLTN EFR
Sbjct: 1   MFEGWAAKHGKSYSSDWEKARRLMIFSDTLAYIEKHNALPNTTFTLGLNKFSDLTNAEFR 60

Query: 106 ALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAA 165
           A Y G K   P ++    +  K  ++ ++ +PTSLDWR +GAVTPIK+Q +CG CWAF+A
Sbjct: 61  ANYVG-KFKPPRYQDRRPA--KDVDVDVSSLPTSLDWRQEGAVTPIKDQGQCGSCWAFSA 117

Query: 166 VAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPY 225
           +A++E    + +  L+ LSEQQL+DC T  + GC GG  E AF ++++N G+ TE+ YPY
Sbjct: 118 IASIESAHFLATKELVSLSEQQLIDCDTV-DQGCQGGFPEDAFKFVVENGGVTTEEAYPY 176

Query: 226 QAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIF 285
               G+C+ A K    +I+ Y++V      AL+KAVS  PV++ I      FQ+Y+ GI 
Sbjct: 177 TGFAGSCN-ANKNKVVEITGYKDVTKDSADALMKAVSKTPVTVGICGSDQNFQNYRSGIL 235

Query: 286 NGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD--EGLCGIGTRS 343
           +G C    DHAV ++G+G TE G  YW+IKNSWG +WG+ G+M+I +   EG+CG+  +S
Sbjct: 236 SGHCSNSRDHAVLVIGYG-TEGGMPYWIIKNSWGTSWGEDGFMRIKKKDGEGMCGMNGQS 294

Query: 344 SYP 346
           SYP
Sbjct: 295 SYP 297


>gi|4100157|gb|AAD10337.1| cysteine proteinase precursor [Hordeum vulgare]
          Length = 365

 Score =  285 bits (729), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 143/317 (45%), Positives = 196/317 (61%), Gaps = 14/317 (4%)

Query: 40  EQSVVEIHEKWMAQHGRSYKD--ELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
           E+S+  ++E+W + +  S +       E R  +FK+N  Y+ + NK  +  ++L  N+F+
Sbjct: 34  EESLRGLYERWRSHYTVSRRGLGADAGERRFNVFKQNARYVHEGNKR-DMPFRLALNKFA 92

Query: 98  DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD---VPTSLDWRDKGAVTPIKNQ 154
           D+T DEFR  Y G ++    H S +            D   +P ++DWR KGAVT IK+Q
Sbjct: 93  DMTTDEFRRTYAGSRVRH--HLSLSGGRRGDGGFRYGDADNLPPAVDWRQKGAVTAIKDQ 150

Query: 155 KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQN 214
            +CG CWAF+ + AVEGI KIR+G L+ LSEQ+L+DC    N GC GG  + AF +I +N
Sbjct: 151 GQCGSCWAFSTIVAVEGINKIRTGKLVSLSEQELMDCDNVNNQGCDGGLMDYAFQFIQKN 210

Query: 215 QGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAY 273
            GI TE  YPYQ   G+C  A++ A A  I  YE+VP+ DE AL KAV+ QPVS+AI A 
Sbjct: 211 -GITTESNYPYQGEQGSCDQAKENAQAVTIDGYEDVPANDESALQKAVAGQPVSVAIDAS 269

Query: 274 STEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD 333
             +FQ Y EG+F G C T LDH V  VG+G T DG  YW++KNSWG  WG+ GY+++ R 
Sbjct: 270 GQDFQFYSEGVFTGECSTDLDHGVAAVGYGATRDGTKYWIVKNSWGEDWGEKGYIRMQRG 329

Query: 334 ----EGLCGIGTRSSYP 346
               EGLCGI  ++SYP
Sbjct: 330 VSQTEGLCGIAMQASYP 346


>gi|600111|emb|CAA84378.1| cysteine proteinase [Vicia sativa]
          Length = 359

 Score =  285 bits (728), Expect = 3e-74,   Method: Compositional matrix adjust.
 Identities = 149/317 (47%), Positives = 203/317 (64%), Gaps = 16/317 (5%)

Query: 40  EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
           E+S+  ++E+W + H  + ++  EK  R  +FK N+ ++   NK  ++ YKL  N+F D+
Sbjct: 33  EKSLWNLYERWRSHHTVT-RNLDEKHNRFNVFKANVMHVHNTNKL-DKPYKLKLNKFGDM 90

Query: 100 TNDEFRALYTGYKMPSPSHR-----STTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQ 154
           TN EFR +Y   K+    HR     S  + TF Y+N    DVP+S+DWR+KGAVT +K+Q
Sbjct: 91  TNYEFRRIYADSKISH--HRMFRGMSHENGTFMYEN--AVDVPSSIDWRNKGAVTGVKDQ 146

Query: 155 KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQN 214
            +CG CWAF+ +AAVEGI +I++  L+ LSEQQL+DC T  N GC GG  E AF +I QN
Sbjct: 147 GQCGSCWAFSTIAAVEGINQIKTQKLVSLSEQQLVDCDTEENEGCNGGLMEYAFEFIKQN 206

Query: 215 QGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYS 274
            GI TE  YPY A  GTC   ++  A  I  +E VP  +E ALLKA + QPVS+AI A  
Sbjct: 207 -GITTESNYPYAAKDGTCDVEKEDKAVSIDGHENVPINNEAALLKAAAKQPVSVAIDAGG 265

Query: 275 TEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR-- 332
             FQ Y EG+F G C T L+H V IVG+G T+D   YW++KNSWG+ WG+ GY+++ R  
Sbjct: 266 YNFQFYSEGVFTGHCDTDLNHGVAIVGYGVTQDRTKYWIMKNSWGSEWGEQGYIRMQRGI 325

Query: 333 --DEGLCGIGTRSSYPL 347
              EGLCGI   +SYP+
Sbjct: 326 SSREGLCGIAMEASYPI 342


>gi|38345906|emb|CAE04498.2| OSJNBb0059K02.8 [Oryza sativa Japonica Group]
          Length = 458

 Score =  285 bits (728), Expect = 3e-74,   Method: Compositional matrix adjust.
 Identities = 145/324 (44%), Positives = 202/324 (62%), Gaps = 12/324 (3%)

Query: 32  VVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRT 88
           +VS     E+    ++ +W A+HG+SY    E+E R   F++NL YI++ N     G  +
Sbjct: 25  IVSYGERSEEEARRLYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHS 84

Query: 89  YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAV 148
           ++LG N+F+DLTN+E+R  Y G +      R  +       N ++   P S+DWR KGAV
Sbjct: 85  FRLGLNRFADLTNEEYRDTYLGLRNKPRRERKVSDRYLAADNEAL---PESVDWRTKGAV 141

Query: 149 TPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAF 208
             IK+Q+  G CWAF+A+AAVEGI +I +G+LI LSEQ+L+DC T+ N GC GG  + AF
Sbjct: 142 AEIKDQEVAGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAF 201

Query: 209 AYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVS 267
            +II N GI TED+YPY+     C   +K A    I +YE+V    E +L KAV+ QPVS
Sbjct: 202 DFIINNGGIDTEDDYPYKGKDERCDVNRKNAKVVTIDSYEDVTPNSETSLQKAVANQPVS 261

Query: 268 IAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGY 327
           +AI A    FQ Y  GIF G CGT LDH V  VG+G TE+G +YW+++NSWG +WG++GY
Sbjct: 262 VAIEAGGRAFQLYSSGIFTGKCGTALDHGVAAVGYG-TENGKDYWIVRNSWGKSWGESGY 320

Query: 328 MKIVRD----EGLCGIGTRSSYPL 347
           +++ R+     G CGI    SYPL
Sbjct: 321 VRMERNIKASSGKCGIAVEPSYPL 344


>gi|113120269|gb|ABI30274.1| VS-B, partial [Vasconcellea stipulata]
          Length = 341

 Score =  285 bits (728), Expect = 3e-74,   Method: Compositional matrix adjust.
 Identities = 144/296 (48%), Positives = 190/296 (64%), Gaps = 10/296 (3%)

Query: 41  QSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLT 100
           +S + + E WM +H + YK   EK  R + FK+NL YI++ NK+ N +Y LG N+F+DLT
Sbjct: 42  ESSIRLFESWMLKHDKVYKTIDEKIYRFETFKDNLMYIDETNKK-NNSYWLGLNEFADLT 100

Query: 101 NDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCC 160
           +DEF+  Y G  +P  S     S   ++ N  + D P S+DWR KGAVTP+KNQ  CG C
Sbjct: 101 HDEFKEKYVG-SIPEDSMIIEQSDDVEFPNKHVVDYPESIDWRQKGAVTPVKNQNPCGSC 159

Query: 161 WAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATE 220
           WAF+ VA VEGI KI +GNLI LSEQ+LLDC    ++GC GG +  +  Y++ N G+ TE
Sbjct: 160 WAFSTVATVEGINKIVTGNLISLSEQELLDCDRR-SHGCKGGYQTTSLKYVVDN-GVHTE 217

Query: 221 DEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQS 279
            EYPY+   G C A  K      I+ Y+ VPS DE +L+K +S+QPVS+ + +    FQ 
Sbjct: 218 KEYPYEKKQGNCRAKNKKGLKVYINGYKRVPSNDEISLIKTISIQPVSVLVESKGRPFQF 277

Query: 280 YKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDEG 335
           YK G+F G CGT+LDHAVT VG+     G +Y LIKNSWG  WGD GY+KI R  G
Sbjct: 278 YKGGVFGGPCGTKLDHAVTAVGY-----GKDYILIKNSWGPKWGDKGYIKIKRASG 328


>gi|118127|sp|P25251.1|CYSP4_BRANA RecName: Full=Cysteine proteinase COT44; Flags: Precursor
          Length = 328

 Score =  285 bits (728), Expect = 3e-74,   Method: Compositional matrix adjust.
 Identities = 144/317 (45%), Positives = 204/317 (64%), Gaps = 15/317 (4%)

Query: 44  VEIHEKWMAQHGRSYKDEL----EKEMRLKIFKENLEYIEKANKEG-NRTYKLGTNQFSD 98
           + I+ +W  +HG+S  +      +++ R  IFK+NL +I+  N+   N TYKLG   F++
Sbjct: 1   MSIYLRWSLEHGKSNSNSNGIINQQDERFNIFKDNLRFIDLHNENNKNATYKLGLTIFAN 60

Query: 99  LTNDEFRALYTGYKMPSPSHRSTTSST--FKYQN-LSMTDVPTSLDWRDKGAVTPIKNQK 155
           LTNDE+R+LY G +   P  R T +     KY   +++ +VP ++DWR KGAV  IK+Q 
Sbjct: 61  LTNDEYRSLYLGART-EPVRRITKAKNVNMKYSAAVNVDEVPVTVDWRQKGAVNAIKDQG 119

Query: 156 ECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQ 215
            CG CWAF+  AAVEGI KI +G L+ LSEQ+L+DC  + N GC GG  + AF +I++N 
Sbjct: 120 TCGSCWAFSTAAAVEGINKIVTGELVSLSEQELVDCDKSYNQGCNGGLMDYAFQFIMKNG 179

Query: 216 GIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYS 274
           G+ TE +YPY    G C++  K +    I  YE+VPS DE AL +AVS QPVS+AI A  
Sbjct: 180 GLNTEKDYPYHGTNGKCNSLLKNSRVVTIDGYEDVPSKDETALKRAVSYQPVSVAIDAGG 239

Query: 275 TEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD- 333
             FQ Y+ GIF G CGT +DHAV  VG+G +E+G +YW+++NSWG  WG+ GY+++ R+ 
Sbjct: 240 RAFQHYQSGIFTGKCGTNMDHAVVAVGYG-SENGVDYWIVRNSWGTRWGEDGYIRMERNV 298

Query: 334 ---EGLCGIGTRSSYPL 347
               G CGI   +SYP+
Sbjct: 299 ASKSGKCGIAIEASYPV 315


>gi|341850671|gb|AEK97329.1| chromoplast senescence-associated protein 12 [Brassica rapa var.
           parachinensis]
          Length = 260

 Score =  285 bits (728), Expect = 3e-74,   Method: Compositional matrix adjust.
 Identities = 139/259 (53%), Positives = 181/259 (69%), Gaps = 8/259 (3%)

Query: 95  QFSDLTNDEFRALYTGYKMPS--PSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIK 152
           QF+++TNDEFR++YTGYK  S   S   T S++F+YQN+S   +P ++DWR KGAVTPIK
Sbjct: 1   QFAEITNDEFRSMYTGYKGDSVLSSQSQTKSTSFRYQNVSSGALPIAVDWRKKGAVTPIK 60

Query: 153 NQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYII 212
           NQ  CGCCWAF+AVAA+EG T+I+ G LI LSEQQL+DC TN + GC GG  + AF +I+
Sbjct: 61  NQGSCGCCWAFSAVAAIEGATQIKKGKLISLSEQQLVDCDTN-DFGCSGGLIDTAFEHIM 119

Query: 213 QNQGIATEDEYPYQAVPGTCS-AAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIA 271
              G+ TE  YPY+    TC   +  P+AA I+ YE+VP  DE AL+KAV+ QPVS+ I 
Sbjct: 120 ATGGLTTESNYPYKGEDATCKIKSTXPSAASITGYEDVPVNDENALMKAVAHQPVSVGIE 179

Query: 272 AYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIV 331
               +FQ Y  G+F G C T LDHAVT VG+  +  G+ YW+IKNSWG  WG+ GYM+I 
Sbjct: 180 GGGFDFQFYSSGVFTGECTTYLDHAVTAVGYSQSSAGSKYWIIKNSWGTKWGEGGYMRIK 239

Query: 332 RD----EGLCGIGTRSSYP 346
           +D    EGLCG+  ++SYP
Sbjct: 240 KDIKDKEGLCGLAMKASYP 258


>gi|218181|dbj|BAA14402.1| oryzain alpha precursor [Oryza sativa Japonica Group]
          Length = 458

 Score =  284 bits (727), Expect = 4e-74,   Method: Compositional matrix adjust.
 Identities = 145/324 (44%), Positives = 200/324 (61%), Gaps = 12/324 (3%)

Query: 32  VVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRT 88
           +VS     E+    ++ +W A+HG+SY    E+E R   F++NL YI++ N     G  +
Sbjct: 25  IVSYGERSEEEARRLYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHS 84

Query: 89  YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAV 148
           ++LG N+F+DLTN+E+R  Y G +      R  +       N ++   P S+DWR KGAV
Sbjct: 85  FRLGLNRFADLTNEEYRDTYLGLRNKPRRERKVSDRYLAADNEAL---PESVDWRTKGAV 141

Query: 149 TPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAF 208
             IK+Q  CG CWAF+A+AAVE I +I +G+LI LSEQ+L+DC T+ N GC GG  + AF
Sbjct: 142 AEIKDQGGCGSCWAFSAIAAVEDINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAF 201

Query: 209 AYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVS 267
            +II N GI TED+YPY+     C   +K A    I +YE+V    E +L KAV  QPVS
Sbjct: 202 DFIINNGGIDTEDDYPYKGKDERCDVNRKNAKVVTIDSYEDVTPNSETSLQKAVRNQPVS 261

Query: 268 IAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGY 327
           +AI A    FQ Y  GIF G CGT LDH V  VG+G TE+G +YW+++NSWG +WG++GY
Sbjct: 262 VAIEAGGRAFQLYSSGIFTGKCGTALDHGVAAVGYG-TENGKDYWIVRNSWGKSWGESGY 320

Query: 328 MKIVRD----EGLCGIGTRSSYPL 347
           +++ R+     G CGI    SYPL
Sbjct: 321 VRMERNIKASSGKCGIAVEPSYPL 344


>gi|168058022|ref|XP_001781010.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162667567|gb|EDQ54194.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 457

 Score =  284 bits (726), Expect = 5e-74,   Method: Compositional matrix adjust.
 Identities = 145/309 (46%), Positives = 198/309 (64%), Gaps = 17/309 (5%)

Query: 50  WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYT 109
           W  +HG+ Y    E+  R  ++K+NLEYI++ + E N +Y LG  +F+DLTN+EFR  YT
Sbjct: 48  WAHKHGKVYSAAEERAHRFLVWKDNLEYIQR-HSEKNLSYWLGLTKFADLTNEEFRRQYT 106

Query: 110 GYKMPSPSH----RSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAA 165
           G ++         R+ T S F+Y N   ++ P S+DWR+KGAVT +K+Q  CG CWAF+A
Sbjct: 107 GTRIDRSRRLKKGRNATGS-FRYAN---SEAPKSIDWREKGAVTSVKDQGSCGSCWAFSA 162

Query: 166 VAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPY 225
           V +VEGI  IR+G+ I LS Q+L+DC    N GC GG  + AF ++IQN GI TE +YPY
Sbjct: 163 VGSVEGINAIRTGDAISLSVQELVDCDKKYNQGCNGGLMDYAFDFVIQNGGIDTEKDYPY 222

Query: 226 QAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGI 284
           Q   G C   +  A    I +YE+VP  DE+AL KAV+ QPVS+AI A   +FQ Y  G+
Sbjct: 223 QGYDGRCDVNKMNARVVTIDSYEDVPENDEEALKKAVAGQPVSVAIEAGGRDFQLYSGGV 282

Query: 285 FNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD------EGLCG 338
           F G CGT LDH V  VG+G +E G +YW++KNSWG  WG++GY+++ R+       GLCG
Sbjct: 283 FTGRCGTDLDHGVLAVGYG-SEKGLDYWIVKNSWGEYWGESGYLRMQRNLKDDNGYGLCG 341

Query: 339 IGTRSSYPL 347
           I    SY +
Sbjct: 342 INIEPSYAV 350


>gi|3688528|emb|CAA06243.1| pre-pro-TPE4A protein [Pisum sativum]
          Length = 360

 Score =  284 bits (726), Expect = 5e-74,   Method: Compositional matrix adjust.
 Identities = 148/318 (46%), Positives = 206/318 (64%), Gaps = 17/318 (5%)

Query: 40  EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
           E+S+ +++E+W + H  + +   EK  R  +FK N+ ++   NK  ++ YKL  N+F+D+
Sbjct: 33  EKSLWDLYERWRSHHTVT-RSLDEKHNRFNVFKANVMHVHNTNKL-DKPYKLKLNKFADM 90

Query: 100 TNDEFRALYTGYKMPSPSHR-----STTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQ 154
           TN EFR +Y   K+    HR     S  + TF Y+N+   +VP+S+DWR KGAVT +K+Q
Sbjct: 91  TNYEFRRIYADSKVSH--HRMFRGMSNENGTFMYENVK--NVPSSIDWRKKGAVTDVKDQ 146

Query: 155 KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQN 214
            +CG CWAF+ + AVEGI +I++  L+ LSEQ+L+DC T GN GC GG  E AF +I QN
Sbjct: 147 GQCGSCWAFSTIVAVEGINQIKTQKLVSLSEQELVDCDTGGNEGCNGGLMEYAFEFIKQN 206

Query: 215 QGIATEDEYPYQAVPGTCSAAQKPAA-AKISNYEEVPSGDEQALLKAVSMQPVSIAIAAY 273
            GI TE  YPY A  GTC   ++  A   I  YE VP  +E ALLKA + QPVS+AI A 
Sbjct: 207 -GITTESNYPYAAKDGTCDLKKEDKAEVSIDGYENVPINNEAALLKAAAKQPVSVAIDAG 265

Query: 274 STEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR- 332
              FQ Y EG+F+G CGT L+H V +VG+G T+D   YW++KNSWG+ WG+ GY+++ R 
Sbjct: 266 GYNFQFYSEGVFSGHCGTDLNHGVAVVGYGVTQDRTKYWIVKNSWGSEWGEQGYIRMQRG 325

Query: 333 ---DEGLCGIGTRSSYPL 347
               EGLCGI   +SYP+
Sbjct: 326 ISHKEGLCGIAMEASYPI 343


>gi|357129125|ref|XP_003566217.1| PREDICTED: thiol protease SEN102-like [Brachypodium distachyon]
          Length = 380

 Score =  283 bits (725), Expect = 7e-74,   Method: Compositional matrix adjust.
 Identities = 150/322 (46%), Positives = 207/322 (64%), Gaps = 16/322 (4%)

Query: 40  EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
           E+S+  ++E+W A+H  S +D  EK  R  +F+EN   + + N   +  YKL  N+F+DL
Sbjct: 42  EESLWALYERWRARHTVS-RDLAEKSRRFNVFRENARLVHEFNLRRDAPYKLRLNRFADL 100

Query: 100 TNDEFRALY-----TGYKMPSPSHRSTTSSTFKYQNLSMTD---VPTSLDWRDKGAVTPI 151
           T+DEFR  Y     + ++M  P   +  +     +  S T    +PTS+DWR+KGAVT +
Sbjct: 101 TSDEFRRSYASSRVSHHRMFKP-RAANNNDDDDDKGSSFTHGGALPTSVDWREKGAVTGV 159

Query: 152 KNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYI 211
           K+Q +CG CWAF+ +AAVEGI  IR+ NL  LSEQQL+DC T  N GC GG  + AF+YI
Sbjct: 160 KDQGQCGSCWAFSTIAAVEGINAIRTNNLTSLSEQQLVDCDTKTNAGCDGGLMDDAFSYI 219

Query: 212 IQNQGIATEDEYPYQAVPGTCSAAQKPAAAKIS--NYEEVPSGDEQALLKAVSMQPVSIA 269
            ++ G+A E  YPY+A   +   ++K AAA +S   YE+VP  DE AL KAV+ QPV++A
Sbjct: 220 AKHGGVAAEKSYPYRARQSSSCNSKKAAAAVVSIDGYEDVPRNDETALKKAVAAQPVAVA 279

Query: 270 IAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMK 329
           I A  + FQ Y EG+F G CGT+LDH V  VG+G T DG  YW++KNSWG  WG+ GY++
Sbjct: 280 IEAGGSHFQFYSEGVFAGKCGTELDHGVAAVGYGVTVDGTKYWIVKNSWGEEWGEKGYIR 339

Query: 330 IVRD----EGLCGIGTRSSYPL 347
           + RD    EGLCGI   +SYP+
Sbjct: 340 MKRDVADKEGLCGIAMEASYPV 361


>gi|255032|gb|AAB23155.1| COT44=cysteine proteinase homolog [Brassica napus, seedling, rapid
           cycling base population CrGC5, Peptide, 328 aa]
          Length = 328

 Score =  283 bits (725), Expect = 7e-74,   Method: Compositional matrix adjust.
 Identities = 144/317 (45%), Positives = 203/317 (64%), Gaps = 15/317 (4%)

Query: 44  VEIHEKWMAQHGRSYKDEL----EKEMRLKIFKENLEYIEKANKEG-NRTYKLGTNQFSD 98
           + I+ +W  +HG+S  +      +++ R  IFK+NL +I+  N+   N TYKLG   F++
Sbjct: 1   MSIYLRWSLEHGKSNSNSNGIINQQDERFNIFKDNLRFIDLHNENNKNATYKLGLTIFAN 60

Query: 99  LTNDEFRALYTGYKMPSPSHRSTTSST--FKYQN-LSMTDVPTSLDWRDKGAVTPIKNQK 155
           LTNDE+R+LY G +   P  R T +     KY   ++  +VP ++DWR KGAV  IK+Q 
Sbjct: 61  LTNDEYRSLYLGART-EPVRRITKAKNVNMKYSAAVNDVEVPVTVDWRQKGAVNAIKDQG 119

Query: 156 ECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQ 215
            CG CWAF+  AAVEGI KI +G L+ LSEQ+L+DC  + N GC GG  + AF +I++N 
Sbjct: 120 TCGSCWAFSTAAAVEGINKIVTGELVSLSEQELVDCDKSYNQGCNGGLMDYAFQFIMKNG 179

Query: 216 GIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYS 274
           G+ TE +YPY    G C++  K +    I  YE+VPS DE AL +AVS QPVS+AI A  
Sbjct: 180 GLNTEKDYPYHGTNGKCNSLLKNSRVVTIDGYEDVPSKDETALKRAVSYQPVSVAIDAGG 239

Query: 275 TEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD- 333
             FQ Y+ GIF G CGT +DHAV  VG+G +E+G +YW+++NSWG  WG+ GY+++ R+ 
Sbjct: 240 RAFQHYQSGIFTGKCGTNMDHAVVAVGYG-SENGVDYWIVRNSWGTRWGEDGYIRMERNV 298

Query: 334 ---EGLCGIGTRSSYPL 347
               G CGI   +SYP+
Sbjct: 299 ASKSGKCGIAIEASYPV 315


>gi|297845064|ref|XP_002890413.1| hypothetical protein ARALYDRAFT_472321 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297336255|gb|EFH66672.1| hypothetical protein ARALYDRAFT_472321 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 357

 Score =  283 bits (724), Expect = 7e-74,   Method: Compositional matrix adjust.
 Identities = 144/355 (40%), Positives = 224/355 (63%), Gaps = 25/355 (7%)

Query: 13  INTTPMFIIITLLVSCASQVVSSRSTHEQSVV--------------EIHEKWMAQHGRSY 58
           +++    +   L +S A+  +S  ++H+ S+V              E+ E W++   ++Y
Sbjct: 3   LSSPSRILCFPLALSAATLSLSVAASHDYSIVGYSPEDLESHDKLIELFENWISNFEKAY 62

Query: 59  KDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSH 118
           +   EK +R ++FK+NL++I++ NK+  ++Y LG N+F+DL+++EF+ +Y G K      
Sbjct: 63  ETVEEKLLRFEVFKDNLKHIDETNKK-VKSYWLGLNEFADLSHEEFKKMYLGLKTDIVRR 121

Query: 119 RSTTS-STFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRS 177
               S + F Y+++    VP S+DWR KGAV  +KNQ  CG CWAF+ VAAVEGI KI +
Sbjct: 122 DEERSYAEFAYRDVEA--VPKSVDWRKKGAVAEVKNQGSCGSCWAFSTVAAVEGINKIVT 179

Query: 178 GNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQK 237
           GNL  LSEQ+L+DC T  NNGC GG  + AF YI++N G+  E++YPY    GTC   + 
Sbjct: 180 GNLTTLSEQELIDCDTTYNNGCNGGLMDYAFEYIVKNGGLRKEEDYPYSMEEGTCEMQKD 239

Query: 238 PA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKE-GIFNGVCGTQLDH 295
            +    I  +++VP+ DE++LLKA++ QP+S+AI A   EFQ Y    +F+G CG  LDH
Sbjct: 240 ESETVTIDGHQDVPTNDEKSLLKALAHQPLSVAIDASGREFQFYSGVSVFDGRCGVDLDH 299

Query: 296 AVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
            V  VG+G+++ G++Y ++KNSWG  WG+ GY+++ R+    EGLCGI   +S+P
Sbjct: 300 GVAAVGYGSSK-GSDYIIVKNSWGPKWGEKGYIRLKRNTGKPEGLCGINKMASFP 353


>gi|359359118|gb|AEV41024.1| putative oryzain beta chain precursor [Oryza minuta]
          Length = 493

 Score =  283 bits (724), Expect = 9e-74,   Method: Compositional matrix adjust.
 Identities = 143/340 (42%), Positives = 213/340 (62%), Gaps = 45/340 (13%)

Query: 47  HEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRT--YKLGTNQFSDLTNDEF 104
           ++ W+A++GRSY    E+E R ++F +NL++++  N   +    ++LG N+F+DLTNDEF
Sbjct: 49  YDLWLAENGRSYNALGERERRFRVFWDNLKFVDAHNARADEHGGFRLGMNRFADLTNDEF 108

Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC------- 157
           RA + G K    S     ++  +Y++  + ++P S+DWR+KGAV P+KNQ +C       
Sbjct: 109 RATFLGAKFVERSR----AAGERYRHDGVEELPESVDWREKGAVAPVKNQGQCVDRIIVW 164

Query: 158 -------------------------GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
                                    G CWAF+AV+ VE I ++ +G +I LSEQ+L++CS
Sbjct: 165 NSMVRIYVVDAGCMLENPLMGLTVQGSCWAFSAVSTVESINQLVTGEMITLSEQELVECS 224

Query: 193 TNG-NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVP 250
           TNG N+GC GG  + AF +II+N GI TED+YPY+AV G C   ++ A    I  +E+VP
Sbjct: 225 TNGQNSGCNGGLMDDAFDFIIKNGGIDTEDDYPYKAVDGKCDINRENAKVVSIDGFEDVP 284

Query: 251 SGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGAN 310
             DE++L KAV+ QPVS+AI A   EFQ Y  G+F+G CGT LDH V  VG+G T++G +
Sbjct: 285 QNDEKSLQKAVAHQPVSVAIEAGGREFQLYHSGVFSGRCGTSLDHGVVAVGYG-TDNGKD 343

Query: 311 YWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
           YW+++NSWG  WG++GY+++ R+     G CGI   +SYP
Sbjct: 344 YWIVRNSWGPKWGESGYVRMERNINATTGKCGIAMMASYP 383


>gi|449447027|ref|XP_004141271.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
          Length = 458

 Score =  283 bits (723), Expect = 1e-73,   Method: Compositional matrix adjust.
 Identities = 149/337 (44%), Positives = 216/337 (64%), Gaps = 17/337 (5%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKD-ELEKEMRLKIFKENLE 76
            F+ I L  +  S ++  R+  E  V+ ++++W A+HG+ + +   E E R  IFK+NL+
Sbjct: 14  FFLFIALSAASPSSIIPQRTDDE--VMALYDQWRAKHGKLHNNLGAEPENRFHIFKDNLK 71

Query: 77  YIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV 136
           +I++ N + N  Y+LG N F+DLTN+E+R+ Y G K  S S R+ TS+  +Y      D+
Sbjct: 72  FIDEINAQ-NLPYRLGLNVFADLTNEEYRSRYLGGKFASGSRRNRTSN--RYLPRLGDDL 128

Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGN 196
           P S+DWR KGAV P+K+Q  CG CWAF+ VA+VE I +I +G+LI LSEQ+L+DC  + N
Sbjct: 129 PDSIDWRAKGAVAPVKDQGSCGSCWAFSTVASVEAINQIVTGDLIALSEQELVDCDRSYN 188

Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQA 256
            GC GG  + AF +II+N G+ TE++YPY     +C   +K A   I  YE+VP  +E+A
Sbjct: 189 EGCNGGLMDYAFEFIIENGGLDTEEDYPYYGFDSSCIQYKKNA---IDGYEDVPVNNEKA 245

Query: 257 LLKA---VSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWL 313
           L KA     +  VS+AI      FQ Y+ GIF G CGT LDH V +VG+G +E G +YW+
Sbjct: 246 LQKAVSKQVVSVVSVAIEGGGRSFQLYQSGIFTGRCGTDLDHGVNVVGYG-SEGGVDYWI 304

Query: 314 IKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
           ++NSWG +WG++GY+K+ R+     GLCGI    SYP
Sbjct: 305 VRNSWGGSWGESGYVKMQRNIASPTGLCGIAMEPSYP 341


>gi|414589857|tpg|DAA40428.1| TPA: Vignain [Zea mays]
          Length = 377

 Score =  282 bits (722), Expect = 1e-73,   Method: Compositional matrix adjust.
 Identities = 146/344 (42%), Positives = 208/344 (60%), Gaps = 30/344 (8%)

Query: 28  CASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNR 87
           C + +V+        ++E  E+WM +HGR Y D  EK+ RL++++ N+E +E  N  GN 
Sbjct: 39  CGAALVA----RADPMLERFEQWMGRHGRLYADAGEKQRRLEVYRRNVELVETFNSMGN- 93

Query: 88  TYKLGTNQFSDLTNDEFRALYTGYKMPSP---SHRSTTSSTFKYQNLSM------TDVPT 138
            Y+L  N+F+DLTN+EFRA   G+  P     +  ST  ST       +      +D+P 
Sbjct: 94  GYRLADNKFADLTNEEFRAKMLGFGRPRSGGGAGHSTAPSTVACIGSGLMGRQGYSDLPK 153

Query: 139 SLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNG 198
           S+DWR+KGAV P+K+Q +CG CWAF+AVAA+EGI +I++G L+ LSEQ+L+DC T    G
Sbjct: 154 SVDWREKGAVAPVKSQGDCGSCWAFSAVAAIEGINQIKNGKLVSLSEQELVDCDTKA-IG 212

Query: 199 CLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQ-KPAAAKISNYEEVPSGDEQAL 257
           C GG    AF ++++N+G+ TE  YPYQ + G C   + K +A  IS Y  V    E  L
Sbjct: 213 CAGGYMSWAFEFVMKNRGLTTERNYPYQGLNGACQTPKLKESAVSISGYMNVTPSSEPDL 272

Query: 258 LKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTED---------- 307
           L+A + QPVS+A+ A S  +Q Y  G+F G C  +L+H VT+VG+G T+           
Sbjct: 273 LRAAAAQPVSVAVDAGSFVWQLYGGGVFTGPCTAELNHGVTVVGYGETQGDTDGDGSGVP 332

Query: 308 GANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
           G  YW++KNSWG  WGDAGY+ + R+     GLCGI    SYP+
Sbjct: 333 GKKYWIVKNSWGPEWGDAGYILMQREASVASGLCGIAMLPSYPV 376


>gi|14600257|gb|AAK71314.1|AF388175_1 papain-like cysteine peptidase XBCP3 [Arabidopsis thaliana]
          Length = 437

 Score =  282 bits (722), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 139/311 (44%), Positives = 202/311 (64%), Gaps = 11/311 (3%)

Query: 43  VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTND 102
           + E+ + W  +HG++Y  E E++ R++IFK+N +++ + N   N TY L  N F+DLT+ 
Sbjct: 28  ISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTHH 87

Query: 103 EFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT-DVPTSLDWRDKGAVTPIKNQKECGCCW 161
           EF+A   G  + +PS    +    K Q+L  +  VP S+DWR KGAVT +K+Q  CG CW
Sbjct: 88  EFKASRLGLSVSAPSVIMAS----KGQSLGGSVKVPDSVDWRKKGAVTNVKDQGSCGACW 143

Query: 162 AFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATED 221
           +F+A  A+EGI +I +G+LI LSEQ+L+DC  + N GC GG  + AF ++I+N GI TE 
Sbjct: 144 SFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNHGIDTEK 203

Query: 222 EYPYQAVPGTCSAAQ-KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSY 280
           +YPYQ   GTC   + K     I +Y  V S DE+AL++AV+ QPVS+ I      FQ Y
Sbjct: 204 DYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSERAFQLY 263

Query: 281 KEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGL 336
             GIF+G C T LDHAV IVG+G +++G +YW++KNSWG +WG  G+M + R+    +G+
Sbjct: 264 SRGIFSGPCSTSLDHAVLIVGYG-SQNGVDYWIVKNSWGKSWGMDGFMHMQRNTENSDGV 322

Query: 337 CGIGTRSSYPL 347
           CGI   +SYP+
Sbjct: 323 CGINMLASYPI 333


>gi|18391078|ref|NP_563855.1| xylem bark cysteine peptidase 3 [Arabidopsis thaliana]
 gi|110741821|dbj|BAE98853.1| papain-like cysteine peptidase XBCP3 [Arabidopsis thaliana]
 gi|111074448|gb|ABH04597.1| At1g09850 [Arabidopsis thaliana]
 gi|332190386|gb|AEE28507.1| xylem bark cysteine peptidase 3 [Arabidopsis thaliana]
          Length = 437

 Score =  282 bits (722), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 139/311 (44%), Positives = 202/311 (64%), Gaps = 11/311 (3%)

Query: 43  VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTND 102
           + E+ + W  +HG++Y  E E++ R++IFK+N +++ + N   N TY L  N F+DLT+ 
Sbjct: 28  ISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTHH 87

Query: 103 EFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT-DVPTSLDWRDKGAVTPIKNQKECGCCW 161
           EF+A   G  + +PS    +    K Q+L  +  VP S+DWR KGAVT +K+Q  CG CW
Sbjct: 88  EFKASRLGLSVSAPSVIMAS----KGQSLGGSVKVPDSVDWRKKGAVTNVKDQGSCGACW 143

Query: 162 AFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATED 221
           +F+A  A+EGI +I +G+LI LSEQ+L+DC  + N GC GG  + AF ++I+N GI TE 
Sbjct: 144 SFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNHGIDTEK 203

Query: 222 EYPYQAVPGTCSAAQ-KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSY 280
           +YPYQ   GTC   + K     I +Y  V S DE+AL++AV+ QPVS+ I      FQ Y
Sbjct: 204 DYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSERAFQLY 263

Query: 281 KEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGL 336
             GIF+G C T LDHAV IVG+G +++G +YW++KNSWG +WG  G+M + R+    +G+
Sbjct: 264 SSGIFSGPCSTSLDHAVLIVGYG-SQNGVDYWIVKNSWGKSWGMDGFMHMQRNTENSDGV 322

Query: 337 CGIGTRSSYPL 347
           CGI   +SYP+
Sbjct: 323 CGINMLASYPI 333


>gi|226507844|ref|NP_001148894.1| LOC100282514 precursor [Zea mays]
 gi|194703250|gb|ACF85709.1| unknown [Zea mays]
 gi|195622994|gb|ACG33327.1| vignain precursor [Zea mays]
          Length = 356

 Score =  282 bits (722), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 146/344 (42%), Positives = 208/344 (60%), Gaps = 30/344 (8%)

Query: 28  CASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNR 87
           C + +V+        ++E  E+WM +HGR Y D  EK+ RL++++ N+E +E  N  GN 
Sbjct: 18  CGAALVA----RADPMLERFEQWMGRHGRLYADAGEKQRRLEVYRRNVELVETFNSMGN- 72

Query: 88  TYKLGTNQFSDLTNDEFRALYTGYKMPSP---SHRSTTSSTFKYQNLSM------TDVPT 138
            Y+L  N+F+DLTN+EFRA   G+  P     +  ST  ST       +      +D+P 
Sbjct: 73  GYRLADNKFADLTNEEFRAKMLGFGRPRSGGGAGHSTAPSTVACIGSGLMGRQGYSDLPK 132

Query: 139 SLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNG 198
           S+DWR+KGAV P+K+Q +CG CWAF+AVAA+EGI +I++G L+ LSEQ+L+DC T    G
Sbjct: 133 SVDWREKGAVAPVKSQGDCGSCWAFSAVAAIEGINQIKNGKLVSLSEQELVDCDTKA-IG 191

Query: 199 CLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQ-KPAAAKISNYEEVPSGDEQAL 257
           C GG    AF ++++N+G+ TE  YPYQ + G C   + K +A  IS Y  V    E  L
Sbjct: 192 CAGGYMSWAFEFVMKNRGLTTERNYPYQGLNGACQTPKLKESAVSISGYMNVTPSSEPDL 251

Query: 258 LKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTED---------- 307
           L+A + QPVS+A+ A S  +Q Y  G+F G C  +L+H VT+VG+G T+           
Sbjct: 252 LRAAAAQPVSVAVDAGSFVWQLYGGGVFTGPCTAELNHGVTVVGYGETQGDTDGDGSGVP 311

Query: 308 GANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
           G  YW++KNSWG  WGDAGY+ + R+     GLCGI    SYP+
Sbjct: 312 GKKYWIVKNSWGPEWGDAGYILMQREASVASGLCGIAMLPSYPV 355


>gi|359359168|gb|AEV41073.1| putative cysteine protease [Oryza minuta]
          Length = 499

 Score =  282 bits (721), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 146/322 (45%), Positives = 208/322 (64%), Gaps = 18/322 (5%)

Query: 40  EQSVVEIHEKWMAQH---GRSYKDEL-EKEMRLKIFKENLEYIEKANKEGNRT--YKLGT 93
           E     +++ W+A+H   G S+   + E E R ++F +NL++++  N   +    ++LG 
Sbjct: 58  EAEARAVYDLWVARHRHGGGSHNGLVGEYERRFRVFWDNLKFVDAHNARADEHGGFRLGM 117

Query: 94  NQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVT-PIK 152
           N+F+DLTNDEFRA Y G   P+   R    +   Y++  +  +P S+DWRDKGAV  P+K
Sbjct: 118 NRFADLTNDEFRAAYLG-TTPAGRGRHVGEA---YRHDGVEALPDSVDWRDKGAVVAPVK 173

Query: 153 NQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYI 211
           NQ +CG CWAF+AVAAVEGI KI +G L+ LSEQ+L++C+ NG N+GC GG  + AFA+I
Sbjct: 174 NQGQCGSCWAFSAVAAVEGINKIVTGELVSLSEQELVECARNGANSGCNGGMMDDAFAFI 233

Query: 212 IQNQGIATEDEYPYQAVPGTCSAAQKP-AAAKISNYEEVPSGDEQALLKAVSMQPVSIAI 270
            +N G+ TE++YPY A+ G C+ A+K      I  +E+VP  DE +L KAV+ QPVS+AI
Sbjct: 234 ARNGGLDTEEDYPYTAMDGKCNLAKKSRKVVSIDGFEDVPENDELSLQKAVAHQPVSVAI 293

Query: 271 AAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGT-TEDGANYWLIKNSWGNTWGDAGYMK 329
            A   EFQ Y  G+F G CGT LDH V  VG+GT    G +YW ++NSWG  WG+ GY++
Sbjct: 294 DAGGREFQLYDSGVFTGRCGTSLDHGVVAVGYGTDAATGTDYWTVRNSWGPDWGENGYIR 353

Query: 330 IVRD----EGLCGIGTRSSYPL 347
           + R+     G CGI   +SYP+
Sbjct: 354 MERNVTARTGKCGIAMMASYPI 375


>gi|297744465|emb|CBI37727.3| unnamed protein product [Vitis vinifera]
          Length = 331

 Score =  282 bits (721), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 140/304 (46%), Positives = 198/304 (65%), Gaps = 31/304 (10%)

Query: 48  EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRAL 107
           E W+++HG+ YK   EK  R ++F+ENL +I++ NKE + +Y LG N+F+DL+++EF++ 
Sbjct: 50  ESWVSKHGKVYKSMEEKLHRFEVFRENLNHIDERNKEVS-SYWLGLNEFADLSHEEFKSK 108

Query: 108 YTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVA 167
                                    + D+P S+DWR KGAVT +KNQ  CG CWAF+ VA
Sbjct: 109 ------------------------DVADLPESVDWRKKGAVTHVKNQGACGSCWAFSTVA 144

Query: 168 AVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQA 227
           AVEGI +I +GNL  LSEQ+L+DC T  N+GC GG  + AFA+I  N G+  ED+YPY  
Sbjct: 145 AVEGINQIVTGNLTTLSEQELIDCDTTFNSGCNGGLMDYAFAFIASNGGLHKEDDYPYLM 204

Query: 228 VPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFN 286
             GTC   ++      IS YE+VP  DE++LLKA++ QP+S+AI A   +FQ Y  G+FN
Sbjct: 205 EEGTCEEQKEDVDIVTISGYEDVPEKDEESLLKALAHQPLSVAIEASGRDFQFYSGGVFN 264

Query: 287 GVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTR 342
           G CGT+LDH V  VG+G+++ G +Y ++KNSWG  WG+ GY+++ R+    EGLCGI   
Sbjct: 265 GPCGTELDHGVAAVGYGSSK-GLDYIIVKNSWGPKWGEKGYIRMKRNTGKTEGLCGINKM 323

Query: 343 SSYP 346
           +SYP
Sbjct: 324 ASYP 327


>gi|357126406|ref|XP_003564878.1| PREDICTED: cysteine proteinase EP-B 1-like [Brachypodium
           distachyon]
          Length = 377

 Score =  282 bits (721), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 144/328 (43%), Positives = 205/328 (62%), Gaps = 27/328 (8%)

Query: 40  EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRT--------YKL 91
           E+++ E++ +W + H    +   EK  R   FK N+ +I   N   N T        Y+L
Sbjct: 35  EEALWELYTRWQSAHRLPPQHHAEKHRRFGTFKSNVLFIHAHNTRLNDTSTNNNGPSYRL 94

Query: 92  GTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSST----FKYQNLSMTDVPTSLDWRDKGA 147
             N+F D+   EFR+ + G     P HR T  +     F Y   ++ D+P ++DWR KGA
Sbjct: 95  RLNRFGDMDQAEFRSTFAG-----PLHRHTRPAQSIPGFIYD--TVKDIPQAVDWRQKGA 147

Query: 148 VTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREK 206
           VT +K+Q +CG CWAF+AVA+VEG+  IR+G+L+ LSEQ+L+DC T G +NGC GG  E 
Sbjct: 148 VTGVKDQGKCGSCWAFSAVASVEGLNAIRTGSLVSLSEQELIDCDTGGDDNGCQGGLMES 207

Query: 207 AFAYIIQNQ-GIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQ 264
           AF +I  +  G+ATE  YPY A  GTC+A +  + + +I  ++ VP+G+E+AL KAV+ Q
Sbjct: 208 AFEFIAHSAGGLATEAAYPYHASNGTCNANRGSSVSVRIDGHQSVPAGNEEALAKAVAHQ 267

Query: 265 PVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTT-EDGANYWLIKNSWGNTWG 323
           PVS+AI A    FQ Y EG+F G CG++LDH V +VG+G   EDG  YW++KNSWG  WG
Sbjct: 268 PVSVAIDAGGQAFQFYSEGVFTGDCGSELDHGVAVVGYGVAEEDGKEYWIVKNSWGPGWG 327

Query: 324 DAGYMKIVRDE----GLCGIGTRSSYPL 347
           + GY+++ RD     GLCGI   +SYP+
Sbjct: 328 EHGYVRMQRDSGVDGGLCGIAMEASYPV 355


>gi|302763109|ref|XP_002964976.1| hypothetical protein SELMODRAFT_83176 [Selaginella moellendorffii]
 gi|302763113|ref|XP_002964978.1| hypothetical protein SELMODRAFT_83554 [Selaginella moellendorffii]
 gi|300167209|gb|EFJ33814.1| hypothetical protein SELMODRAFT_83176 [Selaginella moellendorffii]
 gi|300167211|gb|EFJ33816.1| hypothetical protein SELMODRAFT_83554 [Selaginella moellendorffii]
          Length = 300

 Score =  281 bits (720), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 137/303 (45%), Positives = 196/303 (64%), Gaps = 8/303 (2%)

Query: 46  IHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFR 105
           + E W A+H +SY  + EK  RL +F + L YIEK N + N T+ LG N+FSDLTN EFR
Sbjct: 1   MFEDWAAKHDKSYSSDWEKARRLMVFSDTLAYIEKHNAQPNTTFTLGLNKFSDLTNAEFR 60

Query: 106 ALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAA 165
           A Y G K   P ++    +  K  ++ ++ +PTSLDWR +GAVTPIK+Q +CG CWAF+A
Sbjct: 61  ANYVG-KFKPPRYQDRRPA--KDVDVDVSSLPTSLDWRQEGAVTPIKDQGQCGSCWAFSA 117

Query: 166 VAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPY 225
           +A++E    + +  L+ LSEQQL+DC T  + GC GG  + AF ++++N G+ TE+ YPY
Sbjct: 118 IASIESAHFLATKELVSLSEQQLIDCDTV-DQGCQGGFPDDAFKFVVENGGVTTEEAYPY 176

Query: 226 QAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIF 285
               G+C+   K    +I+ Y++V      AL+KAVS  PV++ I      FQ+Y+ GI 
Sbjct: 177 TGFAGSCN-TNKNKVVEITGYKDVTKDSADALMKAVSKTPVTVGICGSDQNFQNYRSGIL 235

Query: 286 NGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD--EGLCGIGTRS 343
           +G C    DHAV ++G+G TE G  YW+IKNSWG +WG+ G+MKI +   EG+CG+  +S
Sbjct: 236 SGQCCNSRDHAVLVIGYG-TEGGMPYWIIKNSWGTSWGEDGFMKIKKKDGEGMCGMNGQS 294

Query: 344 SYP 346
           SYP
Sbjct: 295 SYP 297


>gi|359359120|gb|AEV41026.1| putative cysteine protease [Oryza minuta]
          Length = 464

 Score =  281 bits (720), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 146/322 (45%), Positives = 208/322 (64%), Gaps = 18/322 (5%)

Query: 40  EQSVVEIHEKWMAQH---GRSYKDEL-EKEMRLKIFKENLEYIEKANKEGNRT--YKLGT 93
           E     +++ W+A+H   G S+   + E E R ++F +NL++++  N   +    ++LG 
Sbjct: 59  EAEARAVYDLWVARHRHGGGSHNGFVGEYERRFRVFWDNLKFVDAHNAHADEHGGFRLGM 118

Query: 94  NQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAV-TPIK 152
           N+F+DLTNDEFRA Y G    +P+ R        Y++  +  +P S+DWRDKGAV +P+K
Sbjct: 119 NRFADLTNDEFRAAYLG---TTPAGRGRHVGEM-YRHDGVEALPDSVDWRDKGAVVSPVK 174

Query: 153 NQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYI 211
           NQ +CG CWAF+AVAAVEGI KI +G L+ LSEQ+L++C+ N GN+GC GG  + AFA+I
Sbjct: 175 NQGQCGSCWAFSAVAAVEGINKIVTGELVSLSEQELVECARNRGNSGCNGGIMDDAFAFI 234

Query: 212 IQNQGIATEDEYPYQAVPGTCSAAQKP-AAAKISNYEEVPSGDEQALLKAVSMQPVSIAI 270
            +N G+ TE++YPY A+ G C  A+K      I  +E+VP  DE +L KAV+ QPVS+AI
Sbjct: 235 TRNGGLDTEEDYPYTAMDGKCDLAKKSRKVVSIDGFEDVPENDELSLQKAVAHQPVSVAI 294

Query: 271 AAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGT-TEDGANYWLIKNSWGNTWGDAGYMK 329
            A   EFQ Y  G+F G CGT LDH V  VG+GT    G +YW ++NSWG  WG+ GY++
Sbjct: 295 DAGGREFQLYDSGVFTGRCGTSLDHGVVAVGYGTDAATGTDYWTVRNSWGPDWGENGYIR 354

Query: 330 IVRD----EGLCGIGTRSSYPL 347
           + R+     G CGI   +SYP+
Sbjct: 355 MERNVTARTGKCGIAMMASYPI 376


>gi|359359215|gb|AEV41119.1| putative cysteine protease [Oryza officinalis]
          Length = 499

 Score =  281 bits (720), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 146/322 (45%), Positives = 208/322 (64%), Gaps = 18/322 (5%)

Query: 40  EQSVVEIHEKWMAQH---GRSYKDEL-EKEMRLKIFKENLEYIEKANKEGNRT--YKLGT 93
           E     +++ W+A+H   G S+   + E E R ++F +NL++++  N   +    ++LG 
Sbjct: 58  EAEARAVYDLWVARHRHGGDSHNGLVGEYERRFRVFWDNLKFVDAHNARADEHGGFRLGM 117

Query: 94  NQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVT-PIK 152
           N+F+DLTNDEFRA Y G   P+   R    +   Y++  +  +P S+DWRDKGAV  P+K
Sbjct: 118 NRFADLTNDEFRAAYLG-TTPAGRGRHVGEA---YRHDGVEVLPDSVDWRDKGAVVAPVK 173

Query: 153 NQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYI 211
           NQ +CG CWAF+AVAAVEGI KI +G L+ LSEQ+L++C+ NG N+GC GG  + AFA+I
Sbjct: 174 NQGQCGSCWAFSAVAAVEGINKIVTGELVSLSEQELVECARNGANSGCNGGMMDDAFAFI 233

Query: 212 IQNQGIATEDEYPYQAVPGTCSAAQKP-AAAKISNYEEVPSGDEQALLKAVSMQPVSIAI 270
            +N G+ TE++YPY A+ G C+ A+K      I  +E+VP  DE +L KAV+ QPVS+AI
Sbjct: 234 ARNGGLDTEEDYPYTAMDGKCNLAKKSRKVVSIDGFEDVPENDELSLQKAVAHQPVSVAI 293

Query: 271 AAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGT-TEDGANYWLIKNSWGNTWGDAGYMK 329
            A   EFQ Y  G+F G CGT LDH V  VG+GT    G +YW ++NSWG  WG+ GY++
Sbjct: 294 DAGGREFQLYDSGVFTGRCGTSLDHGVVAVGYGTDAATGTDYWTVRNSWGPDWGENGYIR 353

Query: 330 IVRD----EGLCGIGTRSSYPL 347
           + R+     G CGI   +SYP+
Sbjct: 354 MERNVTARTGKCGIAMMASYPI 375


>gi|333069454|gb|AEF13978.1| chymopapain [Carica papaya]
          Length = 352

 Score =  281 bits (719), Expect = 3e-73,   Method: Compositional matrix adjust.
 Identities = 143/318 (44%), Positives = 201/318 (63%), Gaps = 16/318 (5%)

Query: 38  THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
           T  + ++++ + WM +H + Y+   EK  R +IF++NL YI++ NK+ N +Y LG N F+
Sbjct: 39  TSIERLIQLFDSWMLKHNKIYESIDEKIYRFEIFRDNLMYIDETNKK-NNSYWLGLNGFA 97

Query: 98  DLTNDEFRALYTGY---KMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQ 154
           DL+NDEF+  Y G          H      T+K+    +T+ P S+DWR KGAVTP+KNQ
Sbjct: 98  DLSNDEFKKKYVGSVAEDFTGLEHFDNEDFTYKH----VTNYPQSIDWRAKGAVTPVKNQ 153

Query: 155 KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQN 214
             CG CWAF+ +A VEG+ KI +GNL++LSEQ+L+DC  N ++GC GG +  +  Y+  N
Sbjct: 154 GSCGSCWAFSTIATVEGVNKIVTGNLLELSEQELVDCDKN-SHGCKGGYQTTSLQYVADN 212

Query: 215 QGIATEDEYPYQAVPGTCSAAQKPAA-AKISNYEEVPSGDEQALLKAVSMQPVSIAIAAY 273
            G+ T   YPYQA    C A  KP    KI+ Y+ VPS  E + L A++ QP+S+ + A 
Sbjct: 213 -GVHTSKVYPYQAKAMQCRATDKPGPKVKITGYKRVPSNCETSFLGALANQPLSVLVEAG 271

Query: 274 STEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR- 332
              FQ YK G+F+G CGT+LDHAVT VG+GT+ DG NY +IKNSWG  WG+ GYM++ R 
Sbjct: 272 GKPFQLYKSGVFDGPCGTKLDHAVTAVGYGTS-DGKNYIIIKNSWGPNWGEKGYMRLKRQ 330

Query: 333 ---DEGLCGIGTRSSYPL 347
               +G CG+   S YP 
Sbjct: 331 SGNSQGTCGVYKSSYYPF 348


>gi|302781881|ref|XP_002972714.1| hypothetical protein SELMODRAFT_98707 [Selaginella moellendorffii]
 gi|300159315|gb|EFJ25935.1| hypothetical protein SELMODRAFT_98707 [Selaginella moellendorffii]
          Length = 446

 Score =  281 bits (718), Expect = 4e-73,   Method: Compositional matrix adjust.
 Identities = 137/293 (46%), Positives = 189/293 (64%), Gaps = 14/293 (4%)

Query: 65  EMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYK---MPSPSHRST 121
           + R + FKEN  YIE+ N+ G  +Y+LG NQFSDLT++EFR  + G +   + SP  +  
Sbjct: 32  DRRFETFKENFRYIEEHNRAGKHSYRLGLNQFSDLTSEEFRQRFLGLRPDLIDSPVLKMP 91

Query: 122 TSSTFK--YQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGN 179
             S  +  +QN+   D+P S+DWR  GAVT  K+Q  CG CWAFA   A+EGI +I +G 
Sbjct: 92  RDSDIEEGFQNV---DLPASVDWRKHGAVTAPKDQGSCGGCWAFATTGAIEGINQIVTGQ 148

Query: 180 LIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQ-KP 238
           L+ LSEQ+L+DC    + GC GG  E A+ +I++N G+ TE +YPY A    C+  +   
Sbjct: 149 LMSLSEQELIDCDKKADKGCDGGLMENAYQFIVENGGLDTETDYPYHASESHCNMKKLNS 208

Query: 239 AAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVT 298
               I  YE +P GDEQALL+AV+ QPVS+AI   S +FQ Y  G+F G CG +++H V 
Sbjct: 209 RVVAIDGYEAIPDGDEQALLRAVAKQPVSVAIEGASKDFQHYASGVFTGHCGEEINHGVL 268

Query: 299 IVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE----GLCGIGTRSSYPL 347
           IVG+G TEDG +YW++KNSW  TWGD G++K+ R+     GLC I T +SYP+
Sbjct: 269 IVGYG-TEDGLDYWIVKNSWAATWGDGGFVKMQRNTGKRGGLCSINTLASYPV 320


>gi|2507252|sp|P14080.2|PAPA2_CARPA RecName: Full=Chymopapain; AltName: Full=Papaya proteinase II;
           Short=PPII; Flags: Precursor
 gi|1332461|emb|CAA66378.1| chymopapain [Carica papaya]
          Length = 352

 Score =  280 bits (717), Expect = 5e-73,   Method: Compositional matrix adjust.
 Identities = 143/318 (44%), Positives = 201/318 (63%), Gaps = 16/318 (5%)

Query: 38  THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
           T  + ++++ + WM +H + Y+   EK  R +IF++NL YI++ NK+ N +Y LG N F+
Sbjct: 39  TSIERLIQLFDSWMLKHNKIYESIDEKIYRFEIFRDNLMYIDETNKK-NNSYWLGLNGFA 97

Query: 98  DLTNDEFRALYTGY---KMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQ 154
           DL+NDEF+  Y G+         H      T+K+    +T+ P S+DWR KGAVTP+KNQ
Sbjct: 98  DLSNDEFKKKYVGFVAEDFTGLEHFDNEDFTYKH----VTNYPQSIDWRAKGAVTPVKNQ 153

Query: 155 KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQN 214
             CG CWAF+ +A VEGI KI +GNL++LSEQ+L+DC  + + GC GG +  +  Y + N
Sbjct: 154 GACGSCWAFSTIATVEGINKIVTGNLLELSEQELVDCDKH-SYGCKGGYQTTSLQY-VAN 211

Query: 215 QGIATEDEYPYQAVPGTCSAAQKPAA-AKISNYEEVPSGDEQALLKAVSMQPVSIAIAAY 273
            G+ T   YPYQA    C A  KP    KI+ Y+ VPS  E + L A++ QP+S+ + A 
Sbjct: 212 NGVHTSKVYPYQAKQYKCRATDKPGPKVKITGYKRVPSNCETSFLGALANQPLSVLVEAG 271

Query: 274 STEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR- 332
              FQ YK G+F+G CGT+LDHAVT VG+GT+ DG NY +IKNSWG  WG+ GYM++ R 
Sbjct: 272 GKPFQLYKSGVFDGPCGTKLDHAVTAVGYGTS-DGKNYIIIKNSWGPNWGEKGYMRLKRQ 330

Query: 333 ---DEGLCGIGTRSSYPL 347
               +G CG+   S YP 
Sbjct: 331 SGNSQGTCGVYKSSYYPF 348


>gi|242094002|ref|XP_002437491.1| hypothetical protein SORBIDRAFT_10g028010 [Sorghum bicolor]
 gi|241915714|gb|EER88858.1| hypothetical protein SORBIDRAFT_10g028010 [Sorghum bicolor]
          Length = 397

 Score =  280 bits (717), Expect = 6e-73,   Method: Compositional matrix adjust.
 Identities = 146/334 (43%), Positives = 207/334 (61%), Gaps = 28/334 (8%)

Query: 40  EQSVVEIHEKWMAQHGRSYKD----ELEKEMRLKIFKENLEYIEKANKE---GNRTYKLG 92
           ++ V  ++E W ++HGR   +      E  +RL++F++NL YI+  N E   G  T++LG
Sbjct: 47  DEEVRRMYEAWKSKHGRPRGNCDMAGDEDRLRLEVFRDNLRYIDAHNAEADAGLHTFRLG 106

Query: 93  TNQFSDLTNDEFRALYTGYKMP---SPSHRSTTSSTFKYQNLSMT----------DVPTS 139
              F+DLT +E+R    G++      PS R+  S        S            D+P +
Sbjct: 107 LTPFADLTLEEYRGRALGFRARHRGGPSARAAASRVGSGGTRSHHRRPRPRPRCGDLPDA 166

Query: 140 LDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGC 199
           +DWR  GAVT +KNQ++CG CWAF+AVAA+EGI  I +GNL+ LSEQ+++DC T  ++GC
Sbjct: 167 IDWRQLGAVTDVKNQEQCGGCWAFSAVAAIEGINAIVTGNLVSLSEQEIIDCDTQ-DSGC 225

Query: 200 LGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSA--AQKPAAAKISNYEEVPSGDEQAL 257
            GG  E AF ++I N GI +E +YP+ A  GTC A  A     A I  + EV S +E AL
Sbjct: 226 NGGQMENAFQFVIDNGGIDSEADYPFIATDGTCDANKANDEKVAAIDGFVEVASNNETAL 285

Query: 258 LKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNS 317
            +AV++QPVS+AI A    FQ Y  GIFNG CGT LDH VT+VG+G +E+G  YW++KNS
Sbjct: 286 QEAVAIQPVSVAIDAGGRAFQHYSSGIFNGPCGTNLDHGVTVVGYG-SENGKAYWIVKNS 344

Query: 318 WGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
           W ++WG+AGY++I R+     G CGI   +SYP+
Sbjct: 345 WSDSWGEAGYIRIRRNVFLPVGKCGIAMDASYPV 378


>gi|449469929|ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
 gi|449529596|ref|XP_004171784.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
          Length = 431

 Score =  280 bits (716), Expect = 7e-73,   Method: Compositional matrix adjust.
 Identities = 140/309 (45%), Positives = 197/309 (63%), Gaps = 10/309 (3%)

Query: 42  SVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTN 101
           +V E+ E W  +HG+SY    EK  RL +F +N E++   N   N +Y L  N ++DLT+
Sbjct: 24  NVSELFEIWCTEHGKSYSSAEEKLYRLGVFADNYEFVTHHNNLDNSSYTLSLNSYADLTH 83

Query: 102 DEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCW 161
            EF+    G+   SP+ R+      +  +L   DVP SLDWR KGAVT +K+Q  CG CW
Sbjct: 84  HEFKVSRLGF---SPALRNFRPVLPQEPSLP-RDVPDSLDWRKKGAVTAVKDQGSCGACW 139

Query: 162 AFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATED 221
           +F+A  A+EGI +I +G+LI LSEQ+L+DC  + N+GC GG  + A+ ++I N GI TE+
Sbjct: 140 SFSATGAMEGINQIMTGSLISLSEQELIDCDRSYNSGCGGGLMDYAYQFVISNHGIDTEN 199

Query: 222 EYPYQAVPGTCSAAQ-KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSY 280
           +YPYQA  G+C   + +     I  Y ++PS DE  LL+AV+ QPVS+ I      FQ Y
Sbjct: 200 DYPYQARDGSCRKDKLQRNVVTIDGYADIPSNDEGKLLQAVAAQPVSVGICGSERAFQLY 259

Query: 281 KEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGL 336
            +GIF+G C T LDHAV IVG+G +E+G +YW++KNSWG +WG  GYM + R+    EG+
Sbjct: 260 SKGIFSGPCSTSLDHAVLIVGYG-SENGVDYWIVKNSWGKSWGMDGYMHMQRNSGNSEGV 318

Query: 337 CGIGTRSSY 345
           CGI   +SY
Sbjct: 319 CGINKLASY 327


>gi|30685308|ref|NP_566634.2| putative cysteine proteinase [Arabidopsis thaliana]
 gi|30315949|sp|Q9LT77.1|CPR1_ARATH RecName: Full=Probable cysteine proteinase At3g19400; Flags:
           Precursor
 gi|11994462|dbj|BAB02464.1| cysteine proteinase [Arabidopsis thaliana]
 gi|332642715|gb|AEE76236.1| putative cysteine proteinase [Arabidopsis thaliana]
          Length = 362

 Score =  280 bits (715), Expect = 9e-73,   Method: Compositional matrix adjust.
 Identities = 142/316 (44%), Positives = 202/316 (63%), Gaps = 12/316 (3%)

Query: 39  HEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSD 98
           +E  V  ++E+W+ ++ ++Y    EKE R KIFK+NL+++++ N   +RT+++G  +F+D
Sbjct: 36  NETEVRLMYEQWLVENRKNYNGLGEKERRFKIFKDNLKFVDEHNSVPDRTFEVGLTRFAD 95

Query: 99  LTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECG 158
           LTN+EFRA+Y   KM   +  S  +  + Y+   +  +P  +DWR  GAV  +K+Q  CG
Sbjct: 96  LTNEEFRAIYLRKKMER-TKDSVKTERYLYKEGDV--LPDEVDWRANGAVVSVKDQGNCG 152

Query: 159 CCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYIIQNQGI 217
            CWAF+AV AVEGI +I +G LI LSEQ+L+DC     N GC GG    AF +I++N GI
Sbjct: 153 SCWAFSAVGAVEGINQITTGELISLSEQELVDCDRGFVNAGCDGGIMNYAFEFIMKNGGI 212

Query: 218 ATEDEYPYQAVP-GTCSAAQ--KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYS 274
            T+ +YPY A   G C+A +        I  YE+VP  DE++L KAV+ QPVS+AI A S
Sbjct: 213 ETDQDYPYNANDLGLCNADKNNNTRVVTIDGYEDVPRDDEKSLKKAVAHQPVSVAIEASS 272

Query: 275 TEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD- 333
             FQ YK G+  G CG  LDH V +VG+G+T  G +YW+I+NSWG  WGD+GY+K+ R+ 
Sbjct: 273 QAFQLYKSGVMTGTCGISLDHGVVVVGYGSTS-GEDYWIIRNSWGLNWGDSGYVKLQRNI 331

Query: 334 ---EGLCGIGTRSSYP 346
               G CGI    SYP
Sbjct: 332 DDPFGKCGIAMMPSYP 347


>gi|4469153|emb|CAB38314.1| chymopapain isoform II [Carica papaya]
          Length = 352

 Score =  280 bits (715), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 143/318 (44%), Positives = 200/318 (62%), Gaps = 16/318 (5%)

Query: 38  THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
           T  + ++++ + WM +H + Y+   EK  R +IF++NL YI++ NK+ N +Y LG N F+
Sbjct: 39  TSIERLIQLFDSWMLKHNKIYESIDEKIYRFEIFRDNLMYIDETNKK-NNSYWLGLNGFA 97

Query: 98  DLTNDEFRALYTGY---KMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQ 154
           DL+NDEF+  Y G+         H      T+K+    +T+ P S+DWR KGAVTP+KNQ
Sbjct: 98  DLSNDEFKKKYVGFVAEDFTGLEHFDNEDFTYKH----VTNYPQSIDWRAKGAVTPVKNQ 153

Query: 155 KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQN 214
             CG CWAF+ +A VEGI KI +GNL++LSEQ+L+DC  + + GC GG +  +  Y + N
Sbjct: 154 GACGSCWAFSTIATVEGINKIVTGNLLELSEQELVDCDKH-SYGCKGGYQTTSLQY-VAN 211

Query: 215 QGIATEDEYPYQAVPGTCSAAQKPAA-AKISNYEEVPSGDEQALLKAVSMQPVSIAIAAY 273
            G+ T   YPYQA    C A  KP    KI+ Y+ VPS  E + L A++ QP+S  + A 
Sbjct: 212 NGVHTSKVYPYQAKQYKCRATDKPGPKVKITGYKRVPSNCETSFLGALANQPLSFLVEAG 271

Query: 274 STEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR- 332
              FQ YK G+F+G CGT+LDHAVT VG+GT+ DG NY +IKNSWG  WG+ GYM++ R 
Sbjct: 272 GKPFQLYKSGVFDGPCGTKLDHAVTAVGYGTS-DGKNYIIIKNSWGPNWGEKGYMRLKRQ 330

Query: 333 ---DEGLCGIGTRSSYPL 347
               +G CG+   S YP 
Sbjct: 331 SGNSQGTCGVYKSSYYPF 348


>gi|356517368|ref|XP_003527359.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
          Length = 332

 Score =  280 bits (715), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 155/339 (45%), Positives = 215/339 (63%), Gaps = 27/339 (7%)

Query: 19  FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
           F ++  +   A QV + R+  + S+ E HE+ M ++G+ YKD  ++      FKEN+ YI
Sbjct: 12  FAMLLCMAFLAFQV-TCRTLQDASMXERHEQRMTRYGKVYKDPPKRX-----FKENVNYI 65

Query: 79  EKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPT 138
           E  N   N+ YK G NQF+       R  + G+ M S   R TT   FK++N++ T  P+
Sbjct: 66  EACNNAANKPYKRGINQFAP------RNRFKGH-MCSSIIRITT---FKFENVTAT--PS 113

Query: 139 SLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NN 197
           ++D R KGAVTPIK+Q +CGCCWAF+AVAA EGI  + +G LI LSEQ+L+DC T G + 
Sbjct: 114 TVDCRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALSAGKLISLSEQELVDCDTKGVDX 173

Query: 198 GCLGGSREKAFAYIIQNQGIATEDEYP-YQAVPGTCSAAQKPAAAK--ISNYEEVPSGDE 254
           GC GG  + AF +IIQN G+    + P Y  V G C+A +    A   I+ YE+VP+ +E
Sbjct: 174 GCEGGLMDDAFKFIIQNHGLKHXSQLPLYMGVDGKCNANEAAKNAATIITGYEDVPANNE 233

Query: 255 QALL-KAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWL 313
           +A L KAV+  PVS AI A  ++FQ YK G+F G CGT+LDH VT VG+G ++DG  YWL
Sbjct: 234 KAHLQKAVANNPVSEAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSDDGTEYWL 293

Query: 314 IKNSWGNTWGDAGYMKIVR----DEGLCGIGTRSSYPLA 348
           +KNSWG  WG+ GY+++ R    +E LCGI  ++SYP A
Sbjct: 294 VKNSWGTEWGEEGYIRMQRGVDSEEALCGIAVQASYPSA 332


>gi|26452046|dbj|BAC43113.1| putative cysteine proteinase RD21A precursor [Arabidopsis thaliana]
          Length = 362

 Score =  279 bits (714), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 142/316 (44%), Positives = 202/316 (63%), Gaps = 12/316 (3%)

Query: 39  HEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSD 98
           +E  V  ++E+W+ ++ ++Y    EKE R KIFK+NL+++++ N   +RT+++G  +F+D
Sbjct: 36  NETEVRLMYEQWLVENRKNYNGLGEKERRFKIFKDNLKFVDEHNSVPDRTFEVGLTRFAD 95

Query: 99  LTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECG 158
           LTN+EFRA+Y   KM   +  S  +  + Y+   +  +P  +DWR  GAV  +K+Q  CG
Sbjct: 96  LTNEEFRAIYLRKKMER-NKDSVKTERYLYKEGDV--LPDEVDWRANGAVVSVKDQGNCG 152

Query: 159 CCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYIIQNQGI 217
            CWAF+AV AVEGI +I +G LI LSEQ+L+DC     N GC GG    AF +I++N GI
Sbjct: 153 SCWAFSAVGAVEGINQITTGELISLSEQELVDCDRGFVNAGCDGGIMNYAFEFIMKNGGI 212

Query: 218 ATEDEYPYQAVP-GTCSAAQ--KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYS 274
            T+ +YPY A   G C+A +        I  YE+VP  DE++L KAV+ QPVS+AI A S
Sbjct: 213 ETDQDYPYNANDLGLCNADKNNNTRVVTIDGYEDVPRDDEKSLKKAVAHQPVSVAIEASS 272

Query: 275 TEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD- 333
             FQ YK G+  G CG  LDH V +VG+G+T  G +YW+I+NSWG  WGD+GY+K+ R+ 
Sbjct: 273 QAFQLYKSGVMTGTCGISLDHGVVVVGYGSTS-GEDYWIIRNSWGLNWGDSGYVKLQRNI 331

Query: 334 ---EGLCGIGTRSSYP 346
               G CGI    SYP
Sbjct: 332 DDPFGKCGIAMMPSYP 347


>gi|90399361|emb|CAJ86180.1| H0212B02.7 [Oryza sativa Indica Group]
          Length = 470

 Score =  279 bits (713), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 146/336 (43%), Positives = 202/336 (60%), Gaps = 24/336 (7%)

Query: 32  VVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRT 88
           +VS     E+    ++ +W A+HG++Y    E+E R   F++NL YI++ N     G  +
Sbjct: 25  IVSYGERSEEEARRLYAEWKAEHGKNYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHS 84

Query: 89  YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAV 148
           ++LG N+F+DLTN+E+R  Y G +      R  +       N ++   P S+DWR KGAV
Sbjct: 85  FRLGLNRFADLTNEEYRDTYLGLRNKPRRERKVSDRYLAADNEAL---PESVDWRTKGAV 141

Query: 149 TPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAF 208
             IK+Q  CG CWAF+A+AAVEGI +I +G+LI LSEQ+L+DC T+ N GC GG  + AF
Sbjct: 142 AEIKDQGGCGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAF 201

Query: 209 AYIIQNQGIATEDEYPYQAVPGTCSAA------------QKPA-AAKISNYEEVPSGDEQ 255
            +II N GI TED+YPY+     C               QK A    I +YE+V    E 
Sbjct: 202 DFIINNGGIDTEDDYPYKGKDERCDVNRVSFVFFAPLVFQKNAKVVTIDSYEDVTPNSET 261

Query: 256 ALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIK 315
           +L KAV+ QPVS+AI A    FQ Y  GIF G CGT LDH V  VG+G TE+G +YW+++
Sbjct: 262 SLQKAVANQPVSVAIEAGGRAFQLYSSGIFTGKCGTALDHGVAAVGYG-TENGKDYWIVR 320

Query: 316 NSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
           NSWG +WG++GY+++ R+     G CGI    SYPL
Sbjct: 321 NSWGKSWGESGYVRMERNIKASSGKCGIAVEPSYPL 356


>gi|242055323|ref|XP_002456807.1| hypothetical protein SORBIDRAFT_03g043220 [Sorghum bicolor]
 gi|241928782|gb|EES01927.1| hypothetical protein SORBIDRAFT_03g043220 [Sorghum bicolor]
          Length = 369

 Score =  278 bits (711), Expect = 3e-72,   Method: Compositional matrix adjust.
 Identities = 136/317 (42%), Positives = 198/317 (62%), Gaps = 13/317 (4%)

Query: 40  EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
           ++++ +++E+W   H        EK  R   FKEN+ +I   NK G+R Y+L  N+F D+
Sbjct: 35  DEALWDLYERWQTHHHVHRHHG-EKGRRFGTFKENVRFIHAHNKRGDRPYRLSLNRFGDM 93

Query: 100 TNDEFRALYTGYKM----PSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQK 155
             +EFR+ +   ++     + S  +     F Y  +  TD+P S+DWR +GAVT +K+Q 
Sbjct: 94  GREEFRSTFADSRINDLRRAESPAAPAVPGFMYDGV--TDLPPSVDWRKEGAVTAVKDQG 151

Query: 156 ECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQ 215
            CG CWAF+ V +VEGI  IR+G+L+ LSEQ+L+DC T+  NGC GG  E AF +I    
Sbjct: 152 HCGSCWAFSTVVSVEGINAIRTGSLVSLSEQELIDCDTD-ENGCQGGLMENAFEFIKSYG 210

Query: 216 GIATEDEYPYQAVPGTCSA--AQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAY 273
           G+ TE  YPY+A  GTC +  +++     I  ++ VP+G E AL KAV+ QPVS+AI A 
Sbjct: 211 GVTTESAYPYRASNGTCDSVRSRRGQIVSIDGHQMVPTGSEDALAKAVANQPVSVAIDAG 270

Query: 274 STEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR- 332
              FQ Y EG+F G CGT LDH V  VG+G ++DG  YW++KNSWG +WG+ GY+++ R 
Sbjct: 271 GQAFQFYSEGVFTGDCGTDLDHGVAAVGYGVSDDGTAYWIVKNSWGPSWGEGGYIRMQRG 330

Query: 333 --DEGLCGIGTRSSYPL 347
             + GLCGI   +S+P+
Sbjct: 331 AGNGGLCGIAMEASFPI 347


>gi|357130490|ref|XP_003566881.1| PREDICTED: actinidain-like [Brachypodium distachyon]
          Length = 350

 Score =  278 bits (710), Expect = 3e-72,   Method: Compositional matrix adjust.
 Identities = 140/330 (42%), Positives = 197/330 (59%), Gaps = 7/330 (2%)

Query: 21  IITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEK 80
           ++ L+ +     V+++     +V   HE+WMA+ GR Y D  EK  R  +F  N  Y++ 
Sbjct: 14  LLVLVATAVFHAVAAQGEAGLTVAARHEQWMAKFGRVYTDANEKARRQAVFGANARYVDA 73

Query: 81  ANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSL 140
            N+ GNRTY LG N+FSDLT++EF   + GY+   P   + +        L+  ++P S 
Sbjct: 74  VNRAGNRTYTLGLNEFSDLTDNEFAKTHLGYREFRPETANISKGVDPGYGLA-GNIPKSF 132

Query: 141 DWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCL 200
           DWR KGAVT +K+Q  CGCCWAFAAVAA EG+ KI  G LI +SEQQ+LDC+T GNN C 
Sbjct: 133 DWRTKGAVTEVKSQGGCGCCWAFAAVAATEGLVKIAKGTLISMSEQQVLDCTT-GNNTCK 191

Query: 201 GGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKP-AAAKISNYEEVP-SGDEQALL 258
           GG    A +Y+  + G+ TE++Y Y A  G C     P  A  + + E +P  G+E  L 
Sbjct: 192 GGYMNDALSYVFASGGLQTEEDYEYNAEKGACRRDVTPNPATSVGHAEYMPLDGNEFLLQ 251

Query: 259 KAVSMQPVSIAIAAYSTEFQSYKEGIFNG--VCGTQLDHAVTIVGFGTTEDGAN-YWLIK 315
           K V+ QPV +A+ AY T+F++Y  G+F G   CG  LDH  T+VG+G  + G   YWL+K
Sbjct: 252 KLVARQPVVVAVEAYGTDFKNYGGGVFTGSPSCGQNLDHFFTVVGYGFADGGKQMYWLVK 311

Query: 316 NSWGNTWGDAGYMKIVRDEGLCGIGTRSSY 345
           N WG +WG++GYM+I R       G  ++Y
Sbjct: 312 NQWGTSWGESGYMRIARGSSARNCGMTNNY 341


>gi|2160175|gb|AAB60738.1| Strong similarity to Dianthus cysteine proteinase (gb|U17135)
           [Arabidopsis thaliana]
          Length = 416

 Score =  278 bits (710), Expect = 4e-72,   Method: Compositional matrix adjust.
 Identities = 139/318 (43%), Positives = 204/318 (64%), Gaps = 18/318 (5%)

Query: 43  VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTND 102
           + E+ + W  +HG++Y  E E++ R++IFK+N +++ + N   N TY L  N F+DLT+ 
Sbjct: 26  ISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTHH 85

Query: 103 EFRALYTGYKMPSPSHRSTTSSTFKYQNLS-MTDVPTSLDWRDKGAVTPIKNQKECGCCW 161
           EF+A   G  + +PS    +    K Q+L     VP S+DWR KGAVT +K+Q  CG CW
Sbjct: 86  EFKASRLGLSVSAPSVIMAS----KGQSLGGSVKVPDSVDWRKKGAVTNVKDQGSCGACW 141

Query: 162 AFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATED 221
           +F+A  A+EGI +I +G+LI LSEQ+L+DC  + N GC GG  + AF ++I+N GI TE 
Sbjct: 142 SFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNHGIDTEK 201

Query: 222 EYPYQAVPGTCSAAQ-KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAA-------Y 273
           +YPYQ   GTC   + K     I +Y  V S DE+AL++AV+ QPVS+ I         Y
Sbjct: 202 DYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSERAFQLY 261

Query: 274 STEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD 333
           S++F    +GIF+G C T LDHAV IVG+G +++G +YW++KNSWG +WG  G+M + R+
Sbjct: 262 SSKFYLLMQGIFSGPCSTSLDHAVLIVGYG-SQNGVDYWIVKNSWGKSWGMDGFMHMQRN 320

Query: 334 ----EGLCGIGTRSSYPL 347
               +G+CGI   +SYP+
Sbjct: 321 TENSDGVCGINMLASYPI 338


>gi|302812789|ref|XP_002988081.1| hypothetical protein SELMODRAFT_183539 [Selaginella moellendorffii]
 gi|300144187|gb|EFJ10873.1| hypothetical protein SELMODRAFT_183539 [Selaginella moellendorffii]
          Length = 425

 Score =  278 bits (710), Expect = 4e-72,   Method: Compositional matrix adjust.
 Identities = 139/309 (44%), Positives = 193/309 (62%), Gaps = 15/309 (4%)

Query: 50  WMAQHGRSYKDELE-KEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALY 108
           W A+ G+         + R + FKEN  YIE+ N+ G  +Y+LG NQFSDLT++EFR  +
Sbjct: 16  WCAKFGKECASSNSLGDHRFETFKENFRYIEEHNRAGKHSYRLGLNQFSDLTSEEFRQRF 75

Query: 109 TGYK---MPSPSHRSTTSSTFK--YQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAF 163
            G +   + SP  +    S  +  +QN+   D+P S+DWR  GAVT  K+Q  CG CWAF
Sbjct: 76  LGLRPDLIDSPVLKMPRDSDIEEGFQNV---DLPASVDWRQHGAVTAPKDQGSCGGCWAF 132

Query: 164 AAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEY 223
           A   A+EGI +I +G L+ LSEQ+L+DC    + GC GG  E A+ +I++N G+ TE +Y
Sbjct: 133 ATTGAIEGINQIVTGQLVSLSEQELIDCDKKADKGCDGGLMENAYQFIVENGGLDTETDY 192

Query: 224 PYQAVPGTCSAAQ-KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKE 282
           PY A    C+  +       I  Y+ +P GDEQALL AV+ QPVS+AI   S +FQ Y  
Sbjct: 193 PYHASESHCNMKKLNSRVVAIDGYKAIPEGDEQALLLAVAKQPVSVAIEGASKDFQHYAS 252

Query: 283 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE----GLCG 338
           G+F G CG +++H V IVG+G TEDG +YW++KNSW  TWGD G++K+ R+     GLC 
Sbjct: 253 GVFTGHCGEEINHGVLIVGYG-TEDGLDYWIVKNSWAATWGDGGFVKMQRNTGKRGGLCS 311

Query: 339 IGTRSSYPL 347
           I T +SYP+
Sbjct: 312 INTLASYPV 320


>gi|57118009|gb|AAW34136.1| cysteine protease gp3a [Zingiber officinale]
          Length = 475

 Score =  277 bits (709), Expect = 4e-72,   Method: Compositional matrix adjust.
 Identities = 153/344 (44%), Positives = 214/344 (62%), Gaps = 16/344 (4%)

Query: 13  INTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFK 72
           ++  P   I+TL    A    + RS  E  +  I+++W  +H  +  D+   + RL++FK
Sbjct: 21  VSVVPPLDILTLSKQ-AWAAPAGRSDEEVRI--IYQEWRVKHRPAENDQYVGDYRLEVFK 77

Query: 73  ENLEYIEKANKEGNR---TYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQ 129
           ENL ++++ N   +R    Y+LG N+F+DLTN+E+RA +   +  S   RST+       
Sbjct: 78  ENLRFVDEHNAAADRGEHAYRLGMNRFADLTNEEYRARFL--RDLSRLGRSTSGEISNQY 135

Query: 130 NLSMTDV-PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQL 188
            L   DV P S+DWR+KGAV  +KNQ  CG CWAFAA+AAVEGI +I +G+LI LSEQQL
Sbjct: 136 RLREGDVLPDSIDWREKGAVVAVKNQGRCGSCWAFAAIAAVEGINQIVTGDLISLSEQQL 195

Query: 189 LDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYE 247
           +DCST  N GC GG   +AF YII N G+ +E+ YPY    GTC+  ++ A    I +Y 
Sbjct: 196 VDCSTR-NYGCEGGWPYRAFQYIINNGGVNSEEHYPYTGTNGTCNTTKENAHVVSIDSYR 254

Query: 248 EVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTED 307
            VPS DE++L KA + QP+S+ I A    FQ Y  GIF G C T L+H VT+VG+G TE+
Sbjct: 255 NVPSNDEKSLQKAAANQPISVGIDASGRNFQLYHSGIFTGSCNTSLNHGVTVVGYG-TEN 313

Query: 308 GANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
           G +YW++KNSWG  WG++GY+ + R+     G CGI    SYP+
Sbjct: 314 GNDYWIVKNSWGENWGNSGYILMERNIAESSGKCGIAISPSYPI 357


>gi|1514953|dbj|BAA11170.1| cysteine proteinase [Oryza sativa (japonica cultivar-group)]
          Length = 368

 Score =  277 bits (708), Expect = 6e-72,   Method: Compositional matrix adjust.
 Identities = 140/329 (42%), Positives = 194/329 (58%), Gaps = 14/329 (4%)

Query: 28  CASQVVSSRSTH-EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGN 86
           CA+     R    ++++ +++E+W   H    +   EK  R   FK+N+ YI + NK   
Sbjct: 26  CAAIPFDERDLESDEALWDLYERWQEHH-HVPRHHGEKHRRFGAFKDNVRYIHEHNKRAP 84

Query: 87  RTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSST---FKYQNLSMTDVPTSLDWR 143
               L  N+F D+  +EFRA + G            +     F Y+ +   D+P ++DWR
Sbjct: 85  GYAPL--NRFGDMGREEFRATFAGSHANDLRRDGLAAPPLPGFMYEGVR--DLPRAVDWR 140

Query: 144 DKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGS 203
            KGAVT +K+Q +CG CWAF+ V +VEGI  IR+G L+ LSEQ+L+DC T  N+GC GG 
Sbjct: 141 RKGAVTGVKDQGKCGSCWAFSTVVSVEGINAIRTGRLVSLSEQELIDCDTADNSGCQGGL 200

Query: 204 REKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQ-KPAAAKISNYEEVPSGDEQALLKAVS 262
            E AF YI  + GI TE  YPY+A  GTC A + +     I  ++ VP+  E AL KAV+
Sbjct: 201 MENAFEYIKHSGGITTESAYPYRAANGTCDAVRARGGLVVIDGHQNVPANSEAALAKAVA 260

Query: 263 MQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTW 322
            QPVS+AI A    FQ Y +G+F G CGT LDH V +VG+G T DG  YW++KNSWG  W
Sbjct: 261 NQPVSVAIDAGDQSFQFYSDGVFAGDCGTDLDHGVAVVGYGETNDGTEYWIVKNSWGTAW 320

Query: 323 GDAGYMKIVRDE----GLCGIGTRSSYPL 347
           G+ GY+++ RD     GLCGI   +SYP+
Sbjct: 321 GEGGYIRMQRDSGYDGGLCGIAMEASYPV 349


>gi|359359166|gb|AEV41071.1| putative oryzain beta chain precursor [Oryza minuta]
          Length = 464

 Score =  276 bits (707), Expect = 8e-72,   Method: Compositional matrix adjust.
 Identities = 140/307 (45%), Positives = 209/307 (68%), Gaps = 12/307 (3%)

Query: 47  HEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN-KEGNRTYKLGTNQFSDLTNDEFR 105
           ++ W+A++GRSY    E E R ++F +NL + +  N +  +  ++LG N+F+DLTN+EFR
Sbjct: 53  YDLWLAENGRSYNALGEHERRFRVFWDNLRFADAHNARADDHGFRLGMNRFADLTNEEFR 112

Query: 106 ALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAA 165
           A + G K+   S     ++  +Y++  + ++P S+DWR+KGAV P+KNQ +CG CWAF+A
Sbjct: 113 ATFLGAKVVERSR----AAGERYRHDGVEELPESVDWREKGAVAPVKNQGQCGSCWAFSA 168

Query: 166 VAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGS-REKAFAYIIQNQGIATEDEYP 224
           V+ VE I ++ +G +I LSEQ+L++CSTNG NG   G   + AF +II+N GI TED+YP
Sbjct: 169 VSTVESINQLVTGEMITLSEQELVECSTNGQNGGCNGGLMDDAFDFIIKNGGIDTEDDYP 228

Query: 225 YQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEG 283
           Y+AV G C   ++ A    I  +E+VP  DE++L KAV+ QPVS+AI A   EFQ Y  G
Sbjct: 229 YKAVDGKCDINRENAKVVSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGGREFQLYHSG 288

Query: 284 IFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGI 339
           +F+G CGT LDH V  VG+G T++G +YW+++NSWG  WG++GY+++ R+     G CGI
Sbjct: 289 VFSGRCGTSLDHGVVAVGYG-TDNGKDYWIVRNSWGPKWGESGYVRMERNINVTTGKCGI 347

Query: 340 GTRSSYP 346
              +SYP
Sbjct: 348 AMMASYP 354


>gi|4426617|gb|AAD20453.1| cysteine endopeptidase precursor [Oryza sativa]
          Length = 368

 Score =  276 bits (707), Expect = 8e-72,   Method: Compositional matrix adjust.
 Identities = 140/329 (42%), Positives = 194/329 (58%), Gaps = 14/329 (4%)

Query: 28  CASQVVSSRSTH-EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGN 86
           CA+     R    ++++ +++E+W   H    +   EK  R   FK+N+ YI + NK   
Sbjct: 26  CAAIPFDERDLESDEALWDLYERWQEHH-HVPRHHGEKHRRFGAFKDNVRYIHEHNKRAP 84

Query: 87  RTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSST---FKYQNLSMTDVPTSLDWR 143
               L  N+F D+  +EFRA + G            +     F Y+ +   D+P ++DWR
Sbjct: 85  GYPPL--NRFGDMGREEFRATFAGSHANDLRRDGLAAPPLPGFMYEGVR--DLPRAVDWR 140

Query: 144 DKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGS 203
            KGAVT +K+Q +CG CWAF+ V +VEGI  IR+G L+ LSEQ+L+DC T  N+GC GG 
Sbjct: 141 RKGAVTGVKDQGKCGSCWAFSTVVSVEGINAIRTGRLVSLSEQELIDCDTADNSGCQGGL 200

Query: 204 REKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQ-KPAAAKISNYEEVPSGDEQALLKAVS 262
            E AF YI  + GI TE  YPY+A  GTC A + +     I  ++ VP+  E AL KAV+
Sbjct: 201 MENAFEYIKHSGGITTESAYPYRAANGTCDAVRARGGLVVIDGHQNVPANSEAALAKAVA 260

Query: 263 MQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTW 322
            QPVS+AI A    FQ Y +G+F G CGT LDH V +VG+G T DG  YW++KNSWG  W
Sbjct: 261 NQPVSVAIDAGDQSFQFYSDGVFAGDCGTDLDHGVAVVGYGETNDGTEYWIVKNSWGTAW 320

Query: 323 GDAGYMKIVRDE----GLCGIGTRSSYPL 347
           G+ GY+++ RD     GLCGI   +SYP+
Sbjct: 321 GEGGYIRMQRDSGYDGGLCGIAMEASYPV 349


>gi|255635584|gb|ACU18142.1| unknown [Glycine max]
          Length = 345

 Score =  276 bits (707), Expect = 9e-72,   Method: Compositional matrix adjust.
 Identities = 136/309 (44%), Positives = 204/309 (66%), Gaps = 10/309 (3%)

Query: 43  VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTND 102
           ++E+ E WM++HG+ Y+   EK +R +IFK+NL++I++ NK  +  Y LG N+F+DL++ 
Sbjct: 43  LIELFESWMSKHGKIYQSIEEKLLRFEIFKDNLKHIDERNKVVS-NYWLGLNEFADLSHQ 101

Query: 103 EFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWA 162
           EF+  Y G K+   S R  +   F Y+++   ++P S+DWR KGAV P+KNQ  CG CWA
Sbjct: 102 EFKNKYLGLKVDY-SRRRESPEEFTYKDV---ELPKSVDWRKKGAVAPVKNQGSCGSCWA 157

Query: 163 FAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDE 222
           F+ VAAVEGI +I +GNL  LSEQ+L+DC    +NGC GG  + AF++I++N G+  E++
Sbjct: 158 FSTVAAVEGINQIVTGNLTSLSEQELIDCDRTYSNGCNGGLMDYAFSFIVENGGLHKEED 217

Query: 223 YPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYK 281
           YPY    GTC   ++      IS Y +VP  +EQ+LLKA++ Q +S+AI A   +FQ Y 
Sbjct: 218 YPYIMEEGTCEMTKEETEVVTISGYHDVPQNNEQSLLKALANQSLSVAIEASGRDFQFYS 277

Query: 282 EGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKI---VRDEGLCG 338
            G+F+G CG+ LDH V  VG+GT + G +Y ++KNSWG+ WG+ GY+++   +   G   
Sbjct: 278 GGVFDGHCGSDLDHGVAAVGYGTAK-GVDYIIVKNSWGSKWGEKGYIRMRGTLETRGNLR 336

Query: 339 IGTRSSYPL 347
               +SYPL
Sbjct: 337 YLQMASYPL 345


>gi|57118011|gb|AAW34137.1| cysteine protease gp3b [Zingiber officinale]
          Length = 466

 Score =  276 bits (706), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 146/317 (46%), Positives = 205/317 (64%), Gaps = 13/317 (4%)

Query: 40  EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNR---TYKLGTNQF 96
           ++ V  I+++W A+H  +  D+   + RL++FKENL ++++ N   +R    Y+LG N+F
Sbjct: 36  DEEVRIIYQEWRAKHRPAENDQYVGDYRLEVFKENLRFVDEHNAAADRGEHAYRLGMNRF 95

Query: 97  SDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV-PTSLDWRDKGAVTPIKNQK 155
           +DLTN+E+RA +   +  S   RST+        L   DV P S+DWR+KGAV  +K+Q 
Sbjct: 96  ADLTNEEYRARFL--RDLSRLGRSTSGEISNQYRLREGDVLPDSIDWREKGAVVAVKSQG 153

Query: 156 ECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQ 215
            CG CWAFAA+A VEGI +I +G+LI LSEQQL+DCST  N+GC GG   +AF YII N 
Sbjct: 154 RCGSCWAFAAIATVEGINQIVTGDLISLSEQQLVDCSTR-NHGCEGGWPYRAFQYIINNG 212

Query: 216 GIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYS 274
           G+ +E+ YPY    GTC+  +  A    I +Y  VPS DE++L KAV+ QP+S+ I A  
Sbjct: 213 GVNSEEHYPYTGTNGTCNTTKGNAHVVSIDSYRNVPSNDEKSLQKAVANQPISVGINASG 272

Query: 275 TEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD- 333
             FQ Y  GIF G C T L+H VT+VG+GT  +G +YW++KNSWG +WGD+GY+ + R+ 
Sbjct: 273 RNFQLYHSGIFTGSCNTSLNHGVTVVGYGTV-NGNDYWIVKNSWGESWGDSGYILMERNI 331

Query: 334 ---EGLCGIGTRSSYPL 347
               G CGI    SYP+
Sbjct: 332 AESSGKCGIAISPSYPI 348


>gi|4469155|emb|CAB38315.1| chymopapain isoform III [Carica papaya]
          Length = 361

 Score =  276 bits (705), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 142/318 (44%), Positives = 199/318 (62%), Gaps = 16/318 (5%)

Query: 38  THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
           T  + ++++ + WM +H + Y+   EK  R +IF++NL YI++ NK+ N +Y LG N F+
Sbjct: 39  TSIERLIQLFDSWMLKHNKIYESIDEKIYRFEIFRDNLMYIDETNKK-NNSYWLGLNGFA 97

Query: 98  DLTNDEFRALYTGY---KMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQ 154
           DL+NDEF+  Y G+         H      T+K+    +T+ P S+DWR KGAVTP+KNQ
Sbjct: 98  DLSNDEFKKKYVGFVAEDFTGLEHFDNEDFTYKH----VTNYPQSIDWRAKGAVTPVKNQ 153

Query: 155 KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQN 214
             CG CWAF+ +A VEGI KI +GNL++LSEQ+L+DC  + + GC GG +  +  Y + N
Sbjct: 154 GACGSCWAFSTIATVEGINKIVTGNLLELSEQELVDCDKH-SYGCKGGYQTTSLQY-VAN 211

Query: 215 QGIATEDEYPYQAVPGTCSAAQKPAA-AKISNYEEVPSGDEQALLKAVSMQPVSIAIAAY 273
            G+ T   YP QA    C A  KP    KI+ Y+ VPS  E + L A++ QP+S  + A 
Sbjct: 212 NGVHTSKVYPCQAKQYKCRATDKPGPKVKITGYKRVPSNCETSFLGALANQPLSFLVEAG 271

Query: 274 STEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR- 332
              FQ YK G+F+G CGT+LDHAVT VG+GT+ DG NY +IKNSWG  WG+ GYM++ R 
Sbjct: 272 GKPFQLYKSGVFDGPCGTKLDHAVTAVGYGTS-DGKNYIIIKNSWGPNWGEKGYMRLKRQ 330

Query: 333 ---DEGLCGIGTRSSYPL 347
               +G CG+   S YP 
Sbjct: 331 SGNSQGTCGVYKSSYYPF 348


>gi|224081756|ref|XP_002306486.1| predicted protein [Populus trichocarpa]
 gi|222855935|gb|EEE93482.1| predicted protein [Populus trichocarpa]
          Length = 352

 Score =  276 bits (705), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 139/311 (44%), Positives = 197/311 (63%), Gaps = 9/311 (2%)

Query: 44  VEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDE 103
           + +++ W+A+HG++Y    E+  R +IFK NL +I++ N + N TYK+G  +F+DLTN+E
Sbjct: 1   MSMYKWWLAKHGKAYNGLGEEAERFEIFKNNLRFIDEHNSQ-NHTYKVGLTKFADLTNEE 59

Query: 104 FRALYTGYKMPSPSH-RSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWA 162
           +RA++ G +  +      + S + +Y   +   +P S+DWR KGAV PIK+Q  CG CWA
Sbjct: 60  YRAMFLGTRSDAKRRLMKSKSPSERYAFKAGDKLPESVDWRAKGAVNPIKDQGSCGSCWA 119

Query: 163 FAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDE 222
           F+ VAAVEGI +I +G LI LSEQ+L+DC    N GC GG  + AF +II N G+ TE +
Sbjct: 120 FSTVAAVEGINQIVTGELISLSEQELVDCDRTYNAGCNGGLMDYAFQFIINNGGLDTEKD 179

Query: 223 YPYQA-VPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYK 281
           YPY            K  A  I  +E+V   DE+AL KAV+ QPVS+AI A     Q Y+
Sbjct: 180 YPYVGDDDKCDKDKMKTKAVSIDGFEDVLPYDEKALQKAVAHQPVSVAIEASGMALQFYQ 239

Query: 282 EGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-----EGL 336
            G+F G CGT LDH V +VG+  +E+G +YWL++NSWG  WG+ GY+K+ R+      G 
Sbjct: 240 SGVFTGECGTALDHGVVVVGY-ASENGLDYWLVRNSWGTEWGEHGYIKMQRNVGDTYTGR 298

Query: 337 CGIGTRSSYPL 347
           CGI   SSYP+
Sbjct: 299 CGIAMESSYPV 309


>gi|944916|gb|AAA74430.1| cysteine proteinase [Mesembryanthemum crystallinum]
          Length = 367

 Score =  275 bits (704), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 147/318 (46%), Positives = 207/318 (65%), Gaps = 22/318 (6%)

Query: 40  EQSVVEIHEKWMAQH--GRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
           ++++ +++E+W + +   RS+    EK+ R  +FKEN++YI + NK  ++ YKL  NQF 
Sbjct: 37  DETLWDLYERWRSVYTSARSFG---EKQNRFHVFKENVKYINEVNKM-DKPYKLRLNQFG 92

Query: 98  DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
           DLT  EF   Y   K+   +     S  F Y+N+   +VP S+DWR KGAVTP+KNQ  C
Sbjct: 93  DLTPSEFARTYANSKIIEGTRNE--SGGFMYENV---EVPRSIDWRVKGAVTPVKNQGRC 147

Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
           G CWAF+A AAVEGI +I +G LI LSEQQL+DC T  N+GC GG+  +AF YI Q  GI
Sbjct: 148 GGCWAFSAAAAVEGINQITTGQLISLSEQQLIDCDTQ-NSGCRGGTMGRAFEYIKQRGGI 206

Query: 218 ATEDEYPYQAVPGTC--SAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAY-- 273
            +E  YPY+A  G C  +  Q+P  + I  Y  +    E A+LK ++ QPVS+A+ A   
Sbjct: 207 TSEANYPYKAQAGMCKNNLIQRPTVS-IDGYYNIRR-SEDAVLKILAHQPVSVAVDATTW 264

Query: 274 -STEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR 332
            S ++  Y +G+F G CGT+L+H VT VG+GTT DG +YW+IKNSWG TWG+ GYM+++R
Sbjct: 265 SSLDWMFYFQGVFTGPCGTKLNHGVTAVGYGTTNDGYDYWIIKNSWGETWGERGYMRMLR 324

Query: 333 D---EGLCGIGTRSSYPL 347
                GLCGI  ++S+P+
Sbjct: 325 GVSPYGLCGIAMQASFPI 342


>gi|356509992|ref|XP_003523725.1| PREDICTED: oryzain alpha chain-like [Glycine max]
          Length = 439

 Score =  275 bits (704), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 138/313 (44%), Positives = 202/313 (64%), Gaps = 16/313 (5%)

Query: 45  EIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNR-----TYKLGTNQFSDL 99
           E+ EKW  +H ++Y  E EK  RLK+F++N  ++ + N+  N      +Y L  N F+DL
Sbjct: 31  ELFEKWCKEHSKTYSSEEEKLYRLKVFEDNYAFVAQHNQNANNNNNNSSYTLSLNAFADL 90

Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGC 159
           T+ EF+    G  +     +   +     Q+  +  +P+ +DWR  GAVTP+K+Q  CG 
Sbjct: 91  THHEFKTTRLGLPLTLLRFKRPQNQ----QSRDLLHIPSQIDWRQSGAVTPVKDQASCGA 146

Query: 160 CWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIAT 219
           CWAF+A  A+EGI KI +G+L+ LSEQ+L+DC T+ N+GC GG  + A+ ++I N+GI T
Sbjct: 147 CWAFSATGAIEGINKIVTGSLVSLSEQELIDCDTSYNSGCGGGLMDFAYQFVIDNKGIDT 206

Query: 220 EDEYPYQAVPGTCSAAQ-KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQ 278
           ED+YPYQA   +CS  + K  A  I +Y +VP  +E+ +LKAV+ QPVS+ I     EFQ
Sbjct: 207 EDDYPYQARQRSCSKDKLKRRAVTIEDYVDVPPSEEE-ILKAVASQPVSVGICGSEREFQ 265

Query: 279 SYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----E 334
            Y +GIF G C T LDHAV IVG+G +E+G +YW++KNSWG  WG  GY+ ++R+    +
Sbjct: 266 LYSKGIFTGPCSTFLDHAVLIVGYG-SENGVDYWIVKNSWGKYWGMNGYIHMIRNSGNSK 324

Query: 335 GLCGIGTRSSYPL 347
           G+CGI T +SYP+
Sbjct: 325 GICGINTLASYPV 337


>gi|90265242|emb|CAH67695.1| H0624F09.3 [Oryza sativa Indica Group]
          Length = 494

 Score =  275 bits (703), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 139/295 (47%), Positives = 194/295 (65%), Gaps = 14/295 (4%)

Query: 63  EKEMRLKIFKENLEYIEKANKEGNRT--YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRS 120
           E E R ++F +NL++++  N   +    ++LG N+F+DLTN EFRA Y G   P+   R 
Sbjct: 84  EHERRFRVFWDNLKFVDAHNARADERGGFRLGMNRFADLTNGEFRATYLG-TTPAGRGRR 142

Query: 121 TTSSTFKYQNLSMTDVPTSLDWRDKGAVT-PIKNQKECGCCWAFAAVAAVEGITKIRSGN 179
              +   Y++  +  +P S+DWRDKGAV  P+KNQ +CG CWAF+AVAAVEGI KI +G 
Sbjct: 143 VGEA---YRHDGVEALPDSVDWRDKGAVVAPVKNQGQCGSCWAFSAVAAVEGINKIVTGE 199

Query: 180 LIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKP 238
           L+ LSEQ+L++C+ NG N+GC GG  + AFA+I +N G+ TE++YPY A+ G C+ A++ 
Sbjct: 200 LVSLSEQELVECARNGQNSGCNGGIMDDAFAFIARNGGLDTEEDYPYTAMDGKCNLAKRS 259

Query: 239 -AAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAV 297
                I  +E+VP  DE +L KAV+ QPVS+AI A   EFQ Y  G+F G CGT LDH V
Sbjct: 260 RKVVSIDGFEDVPENDELSLQKAVAHQPVSVAIDAGGREFQLYDSGVFTGRCGTNLDHGV 319

Query: 298 TIVGFGT-TEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
             VG+GT    GA YW ++NSWG  WG+ GY+++ R+     G CGI   +SYP+
Sbjct: 320 VAVGYGTDAATGAAYWTVRNSWGPDWGENGYIRMERNVTARTGKCGIAMMASYPI 374


>gi|641905|gb|AAC49406.1| cysteine proteinase [Zinnia violacea]
          Length = 342

 Score =  275 bits (703), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 131/296 (44%), Positives = 197/296 (66%), Gaps = 6/296 (2%)

Query: 41  QSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLT 100
             V+ + E  + +H + Y+   EK  R +IF +NL++I++ NK+ +  Y LG N+F+DLT
Sbjct: 43  HKVIHLFESSLVKHSKIYESFDEKLHRFEIFMDNLKHIDETNKKVS-NYWLGLNEFADLT 101

Query: 101 NDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCC 160
           ++EF+  + G+K      +  +   F+Y++    D+P S+DWR KGAV+P+KNQ +CG C
Sbjct: 102 HEEFKNKFLGFKGELAERKDESIEQFRYRDF--VDLPKSVDWRKKGAVSPVKNQGQCGSC 159

Query: 161 WAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATE 220
           WAF+ VAAVEGI +I +GNL  LSEQ+L+DC T  NNGC GG  + AFAY+ +N G+  E
Sbjct: 160 WAFSTVAAVEGINQIVTGNLTVLSEQELIDCDTTFNNGCNGGLMDYAFAYVTRN-GLHKE 218

Query: 221 DEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQS 279
           +EYPY    GTC   +  +    IS Y +VP  +E + LKA++ QP+S+AI A   +FQ 
Sbjct: 219 EEYPYIMSEGTCDEKRDASEKVTISGYHDVPRNNEDSFLKALANQPISVAIEASGRDFQF 278

Query: 280 YKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDEG 335
           Y  G+F+G CGT+LDH V  VG+GT++ G +Y +++NSWG  WG+ GY+++ R+ G
Sbjct: 279 YSGGVFDGHCGTELDHGVAAVGYGTSK-GLDYVIVRNSWGPKWGEKGYIRMKRNTG 333


>gi|386648112|gb|AFJ15103.1| mexicain-like cystein protease, partial [Jacaratia mexicana]
          Length = 348

 Score =  275 bits (703), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 145/315 (46%), Positives = 198/315 (62%), Gaps = 17/315 (5%)

Query: 38  THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
           T  + ++ + E WM +H R Y +  EK  R +IFK+NL YI++ NK+ N +Y LG N+F 
Sbjct: 39  TSTERLIRLFESWMLKHDRVYNNIEEKIHRFEIFKDNLMYIDETNKK-NNSYWLGLNEFV 97

Query: 98  DLTNDEFRALYTG-YKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKE 156
           DLT+DEF+  Y G       +   +    F Y+++   D P S+DWRDKGAVTP+K    
Sbjct: 98  DLTHDEFKEKYVGSIGEDFVTIEQSNDEEFPYKHV--VDYPESIDWRDKGAVTPVK-PNP 154

Query: 157 CGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQG 216
           CG CWAF+ VA VEGI KI +G LI LSEQ+LLDC    ++GC GG +  +  Y++ N G
Sbjct: 155 CGSCWAFSTVATVEGINKIVTGKLISLSEQELLDCDRR-SHGCKGGYQTTSLQYVVDN-G 212

Query: 217 IATEDEYPYQAVPGTCSAAQKPAA-AKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYST 275
           + TE EYPY+   G C A +K     +I+ Y+ VP+ DE +L++A++ QPVS+ + +   
Sbjct: 213 VHTEKEYPYEKKQGKCRAKEKKGTKVQITGYKRVPANDEISLIQAIANQPVSVLLESKGR 272

Query: 276 EFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR--- 332
            FQ YK GIFNG CGT+LDHAVT +G+G T     Y LIKNSWG  WG+ GY+KI R   
Sbjct: 273 AFQLYKGGIFNGPCGTKLDHAVTAIGYGKT-----YILIKNSWGPNWGEKGYLKIKRASG 327

Query: 333 -DEGLCGIGTRSSYP 346
             EG CG+   S +P
Sbjct: 328 KSEGTCGVYKSSYFP 342


>gi|326494040|dbj|BAJ85482.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 355

 Score =  275 bits (703), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 139/312 (44%), Positives = 188/312 (60%), Gaps = 14/312 (4%)

Query: 47  HEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRA 106
           HE+WMA++GR Y D  EK  R ++F  N  +I+  N+ GNRTY LG N FSDLTN+EF  
Sbjct: 41  HERWMAKYGRVYADAAEKLRRQEVFAANARHIDAVNRAGNRTYTLGLNHFSDLTNEEFAQ 100

Query: 107 LYTGYK-MPSPS----HRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCW 161
            + GY+  P P       S+ ++     +  +   P S+DWR +GAVTP+K+Q  CG CW
Sbjct: 101 THLGYRHQPGPGGLRPEDSSPAAAVNVTDAQLQSTPDSVDWRARGAVTPVKHQGHCGSCW 160

Query: 162 AFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATED 221
           AFAAVAA EG+ +I +GNLI +SEQQ+LDC T G + C  G    A  YI  + G+ TE 
Sbjct: 161 AFAAVAATEGLVQIATGNLISMSEQQVLDC-TGGTSSCKSGYVNAALTYITASGGLQTEA 219

Query: 222 EYPYQAVPGTC---SAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQ 278
            Y Y A  G C    A+   AAA   +   + +GDE AL   V+ QPV++A+ A   +F 
Sbjct: 220 AYAYSAEQGACRSGGASPNSAAAVGVHRSAMLNGDEGALQVLVAGQPVAVAVEA-EPDFH 278

Query: 279 SYKEGIFNG--VCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDEGL 336
            YK G++ G   CG +L HAVT+VG+G   DG  YW++KN WG  WG+ GYM++ R  G 
Sbjct: 279 HYKSGVYVGSPSCGQKLHHAVTVVGYGADGDGQGYWVVKNQWGAGWGEVGYMRLTRGNGG 338

Query: 337 --CGIGTRSSYP 346
             CG+ T + YP
Sbjct: 339 NNCGMATHAYYP 350


>gi|115461226|ref|NP_001054213.1| Os04g0670500 [Oryza sativa Japonica Group]
 gi|62510688|sp|Q7XR52.2|CYSP1_ORYSJ RecName: Full=Cysteine protease 1; AltName: Full=OsCP1; Flags:
           Precursor
 gi|38345300|emb|CAE02828.2| OSJNBa0043A12.33 [Oryza sativa Japonica Group]
 gi|113565784|dbj|BAF16127.1| Os04g0670500 [Oryza sativa Japonica Group]
 gi|215741575|dbj|BAG98070.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 490

 Score =  275 bits (703), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 139/295 (47%), Positives = 194/295 (65%), Gaps = 14/295 (4%)

Query: 63  EKEMRLKIFKENLEYIEKANKEGNRT--YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRS 120
           E E R ++F +NL++++  N   +    ++LG N+F+DLTN EFRA Y G   P+   R 
Sbjct: 84  EHERRFRVFWDNLKFVDAHNARADERGGFRLGMNRFADLTNGEFRATYLG-TTPAGRGRR 142

Query: 121 TTSSTFKYQNLSMTDVPTSLDWRDKGAVT-PIKNQKECGCCWAFAAVAAVEGITKIRSGN 179
              +   Y++  +  +P S+DWRDKGAV  P+KNQ +CG CWAF+AVAAVEGI KI +G 
Sbjct: 143 VGEA---YRHDGVEALPDSVDWRDKGAVVAPVKNQGQCGSCWAFSAVAAVEGINKIVTGE 199

Query: 180 LIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKP 238
           L+ LSEQ+L++C+ NG N+GC GG  + AFA+I +N G+ TE++YPY A+ G C+ A++ 
Sbjct: 200 LVSLSEQELVECARNGQNSGCNGGIMDDAFAFIARNGGLDTEEDYPYTAMDGKCNLAKRS 259

Query: 239 -AAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAV 297
                I  +E+VP  DE +L KAV+ QPVS+AI A   EFQ Y  G+F G CGT LDH V
Sbjct: 260 RKVVSIDGFEDVPENDELSLQKAVAHQPVSVAIDAGGREFQLYDSGVFTGRCGTNLDHGV 319

Query: 298 TIVGFGT-TEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
             VG+GT    GA YW ++NSWG  WG+ GY+++ R+     G CGI   +SYP+
Sbjct: 320 VAVGYGTDAATGAAYWTVRNSWGPDWGENGYIRMERNVTARTGKCGIAMMASYPI 374


>gi|57118007|gb|AAW34135.1| cysteine protease gp2b [Zingiber officinale]
          Length = 379

 Score =  275 bits (702), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 144/332 (43%), Positives = 210/332 (63%), Gaps = 14/332 (4%)

Query: 24  LLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANK 83
           L +S     V  RS  E  V  ++ +W A++  + K     E RL++FKENL++++K N 
Sbjct: 30  LTLSKQGGAVPVRSDEE--VRMLYLEWRAKNHPAEKYLDLNEYRLEVFKENLQFVDKHNA 87

Query: 84  EGNR---TYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSS-TFKYQNLSMTDVPTS 139
             +R   T++LG N+F+DLTN+E+R  +   +  S   RS +   + +Y+     D+P S
Sbjct: 88  AADRGEHTFRLGMNRFADLTNEEYRTRF--LRDFSRLRRSASGKISSRYRLREGDDLPDS 145

Query: 140 LDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGC 199
           +DWR+KGAV P+KNQ  CG CWAF+ VAAVEGI +I +G+LI LSEQQL+DC+T  N+GC
Sbjct: 146 IDWREKGAVVPVKNQGGCGSCWAFSTVAAVEGINQIVTGDLISLSEQQLVDCTT-ANHGC 204

Query: 200 LGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLK 259
            GG    AF +I+ N GI +E+ YPY+   G C++        I +YE VPS +EQ+L K
Sbjct: 205 RGGWMNPAFQFIVNNGGINSEETYPYRGQNGICNSTVNAPVVSIDSYENVPSHNEQSLQK 264

Query: 260 AVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWG 319
           AV+ QPVS+ + A   +FQ Y+ GIF G C    +HA+T+VG+GT  D  +Y  +KNSWG
Sbjct: 265 AVANQPVSVTMDAAGRDFQLYRSGIFTGSCNISANHALTVVGYGTEND-KDYRTVKNSWG 323

Query: 320 NTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
             WG++GY+++ R+     G CGI   +SYP+
Sbjct: 324 KNWGESGYIRVERNIGNPNGKCGITRFASYPV 355


>gi|302142276|emb|CBI19479.3| unnamed protein product [Vitis vinifera]
          Length = 388

 Score =  274 bits (700), Expect = 5e-71,   Method: Compositional matrix adjust.
 Identities = 142/308 (46%), Positives = 193/308 (62%), Gaps = 41/308 (13%)

Query: 46  IHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFR 105
           ++E W+A+HG+SY    EKE R +IFK+NL +I++ N E NRTYK+ +++++    D   
Sbjct: 3   VYEAWLAKHGKSYNALGEKERRFQIFKDNLRFIDEHNAE-NRTYKI-SDRYAFRVGDS-- 58

Query: 106 ALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAA 165
                                         +P S+DWR KGAV  +K+Q  CG CWAF+ 
Sbjct: 59  ------------------------------LPESVDWRKKGAVVEVKDQGSCGSCWAFST 88

Query: 166 VAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPY 225
           +AAVEGI KI +G LI LSEQ+L+DC T+ N GC GG  + AF +II N GI +E++YPY
Sbjct: 89  IAAVEGINKIVTGGLISLSEQELVDCDTSYNEGCNGGLMDYAFEFIINNGGIDSEEDYPY 148

Query: 226 QAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGI 284
           +A  G C   +K A    I  YE+VP  DE++L KAV+ QPVS+AI A   EFQ Y+ GI
Sbjct: 149 KASDGRCDQYRKNAKVVTIDGYEDVPENDEKSLEKAVANQPVSVAIEAGGREFQLYQSGI 208

Query: 285 FNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-----EGLCGI 339
           F G CGT LDH VT VG+G TE+G +YW++KNSWG +WG+ GY+++ RD      G CGI
Sbjct: 209 FTGRCGTALDHGVTAVGYG-TENGVDYWIVKNSWGASWGEEGYIRMERDLATSATGKCGI 267

Query: 340 GTRSSYPL 347
              +SYP+
Sbjct: 268 AMEASYPI 275


>gi|242094000|ref|XP_002437490.1| hypothetical protein SORBIDRAFT_10g028000 [Sorghum bicolor]
 gi|241915713|gb|EER88857.1| hypothetical protein SORBIDRAFT_10g028000 [Sorghum bicolor]
          Length = 372

 Score =  274 bits (700), Expect = 6e-71,   Method: Compositional matrix adjust.
 Identities = 135/316 (42%), Positives = 198/316 (62%), Gaps = 12/316 (3%)

Query: 40  EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQF 96
           +  V  ++E W ++HG  +  +    +RL++F++NL YI+  N E   G  T++LG   F
Sbjct: 45  DDEVRRMYEAWKSEHGHGHGSD--DRLRLEVFRDNLRYIDAHNAEADAGLHTFRLGLTPF 102

Query: 97  SDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKE 156
           +DLT +E+R    G++          S +         D+P ++DWR+ GAVT +KNQ++
Sbjct: 103 ADLTLEEYRGRALGFRARRGGASRVGSGSSYRPRPRGGDLPDAIDWRELGAVTGVKNQEQ 162

Query: 157 CGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQG 216
           CG CWAF+AVAA+EGI +I +GNL+ LSEQ+++DC T  + GC GG  + AF ++I N G
Sbjct: 163 CGGCWAFSAVAAIEGINEIVTGNLVSLSEQEIIDCDTQ-DGGCNGGEMQNAFQFVINNGG 221

Query: 217 IATEDEYPYQAVPGTCSAAQ-KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYST 275
           I TE +YPY      C A +       I  +  V + +E AL +AV+ QPVS+AI A   
Sbjct: 222 IDTEADYPYLGTDAACDANRVNERVVTIDGFVSVATENETALQEAVANQPVSVAIDASGR 281

Query: 276 EFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-- 333
           +FQ Y  GIFNG CGTQLDH VT VG+G +E+G +YW++KNSW ++WG+AGY++I R+  
Sbjct: 282 KFQHYTSGIFNGPCGTQLDHGVTAVGYG-SENGKDYWIVKNSWSSSWGEAGYIRIRRNVA 340

Query: 334 --EGLCGIGTRSSYPL 347
              G CGI   +SYP+
Sbjct: 341 AATGKCGIAMDASYPV 356


>gi|384253406|gb|EIE26881.1| hypothetical protein COCSUDRAFT_21961 [Coccomyxa subellipsoidea
           C-169]
          Length = 481

 Score =  273 bits (698), Expect = 8e-71,   Method: Compositional matrix adjust.
 Identities = 136/306 (44%), Positives = 196/306 (64%), Gaps = 13/306 (4%)

Query: 50  WMAQHGRSYKDELEK-EMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALY 108
           W+    ++YKD +E+ E +  ++ +NLE++   N E + T+KLG   F+DLT+DE+R   
Sbjct: 51  WVEHLQKAYKDNVEEYERKFSVWLDNLEFVHSHN-EKDSTFKLGLTNFADLTHDEYRQHA 109

Query: 109 TGYKMPSPSHRSTTSSTFKYQNLSMTD--VPTSLDWRDKGAVTPIKNQKECGCCWAFAAV 166
            GY+   P  + T   T K       D   P S+DWR KGAVT +KNQ++CG CWAF+  
Sbjct: 110 LGYR---PELKGTGLGTGKSTGFQYADYEAPPSIDWRKKGAVTDVKNQQQCGSCWAFSTT 166

Query: 167 AAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQ 226
            +VEG   I SG L+ LSEQ+L+DC    ++GC GG  + AF++II+N GI TE +Y Y+
Sbjct: 167 GSVEGANAIYSGELVSLSEQELVDCDVTQDHGCHGGLMDFAFSFIIRNGGIDTEKDYKYK 226

Query: 227 AVPGTCS-AAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIF 285
           A  G C+ A +K     I +YE+VP  DE AL KA + QP+S+AI A   EFQ Y  G+F
Sbjct: 227 AQDGVCNIAKEKRHVVTIDSYEDVPPNDESALKKAAANQPISVAIEADQREFQLYAGGVF 286

Query: 286 NGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR----DEGLCGIGT 341
           +  CGT LDH V +VG+G +++G +YW++KNSWG+ WGD+GY+++ R      G CGI  
Sbjct: 287 DAPCGTALDHGVLVVGYG-SDNGTDYWIVKNSWGDFWGDSGYIRLARGISNSAGQCGIAM 345

Query: 342 RSSYPL 347
           ++SYP+
Sbjct: 346 QASYPI 351


>gi|297843784|ref|XP_002889773.1| hypothetical protein ARALYDRAFT_471096 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297335615|gb|EFH66032.1| hypothetical protein ARALYDRAFT_471096 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 439

 Score =  273 bits (698), Expect = 8e-71,   Method: Compositional matrix adjust.
 Identities = 139/311 (44%), Positives = 198/311 (63%), Gaps = 13/311 (4%)

Query: 45  EIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEF 104
           E+ + W  +HG++Y  E E++ R++IFK+N +++ + N   N TY L  N F+DLT+ EF
Sbjct: 30  ELFDDWCQRHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTHHEF 89

Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLS-MTDVPTSLDWRDKGAVTPIKNQKECGCCWAF 163
           +A   G  + + S    +    K Q+L     VP S+DWR KGAVT +K+Q  CG CW+F
Sbjct: 90  KASRLGLSVSASSLIMAS----KGQSLGGNAKVPDSVDWRKKGAVTNVKDQGSCGACWSF 145

Query: 164 AAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEY 223
           +A  A+EGI +I +G+LI LSEQ+L+DC  + N GC GG  + AF ++I+N GI TE +Y
Sbjct: 146 SATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNHGIDTEKDY 205

Query: 224 PYQAVPGTCSAAQ-KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKE 282
           PYQ   GTC   + K     I +Y  V S DE+AL +AV+ QPVS+ I      FQ Y  
Sbjct: 206 PYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALREAVAAQPVSVGICGSERAFQLYSR 265

Query: 283 --GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGL 336
             GIF+G C T LDHAV IVG+G +++G +YW++KNSWG +WG  G+M + R+    EG+
Sbjct: 266 VSGIFSGPCSTSLDHAVLIVGYG-SQNGVDYWIVKNSWGKSWGMDGFMHMQRNTGNSEGI 324

Query: 337 CGIGTRSSYPL 347
           CGI   +SYP+
Sbjct: 325 CGINMLASYPI 335


>gi|357154164|ref|XP_003576692.1| PREDICTED: vignain-like [Brachypodium distachyon]
          Length = 427

 Score =  273 bits (698), Expect = 8e-71,   Method: Compositional matrix adjust.
 Identities = 136/309 (44%), Positives = 188/309 (60%), Gaps = 11/309 (3%)

Query: 48  EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRAL 107
           E+WM +HGR+Y +  EK+ R +++KENL  IE+ N  G   Y L  N+F+DLTN+EFRA 
Sbjct: 120 EQWMGKHGRAYANGGEKQRRFEVYKENLALIEEFNS-GGHGYTLTDNKFADLTNEEFRAK 178

Query: 108 YTGYKMPSPSHRSTTSSTFKYQNL----SMTDVPTSLDWRDKGAVTPIKNQKECGCCWAF 163
             G     P  R           L    + TD+P  +DWR KGAV  +KNQ  CG CWAF
Sbjct: 179 MLGGLGADPDRRRRARHASNALELPGNDNSTDLPKDVDWRKKGAVVEVKNQGSCGSCWAF 238

Query: 164 AAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEY 223
           +AVAA+EG+ +I++G L+ LSEQ+L+DC      GC GG    AF +++ N G+ TE  Y
Sbjct: 239 SAVAAMEGLNQIKNGKLVSLSEQELVDCDAEAV-GCAGGFMSWAFEFVMANHGLTTEASY 297

Query: 224 PYQAVPGTCSAAQ-KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKE 282
           PY+ + G C  A+   ++  I+ Y  V    E  LLK  ++QPVS+A+ A    FQ Y  
Sbjct: 298 PYKGINGACQTAKLNESSVSITGYVNVTVNSEAELLKVAAVQPVSVAVDAGGFLFQLYAG 357

Query: 283 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE----GLCG 338
           G+F+G C  Q++H VT+VG+G T+    YW++KNSWG  WG+AGYM + RD     GLCG
Sbjct: 358 GVFSGPCTAQINHGVTVVGYGETDKAEKYWIVKNSWGPEWGEAGYMLMQRDAGVPTGLCG 417

Query: 339 IGTRSSYPL 347
           I   +SYP+
Sbjct: 418 IAMLASYPV 426


>gi|386648114|gb|AFJ15104.1| mexicain-like cystein protease, partial [Jacaratia mexicana]
          Length = 323

 Score =  273 bits (698), Expect = 9e-71,   Method: Compositional matrix adjust.
 Identities = 143/312 (45%), Positives = 199/312 (63%), Gaps = 16/312 (5%)

Query: 41  QSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLT 100
           + +V + E W  ++ + YK+  EK  R +IFK+NL YI++ NK+ N +Y LG N+F+DLT
Sbjct: 16  ERLVRLFESWTLENDKIYKNIDEKIYRFEIFKDNLMYIDETNKK-NSSYWLGLNEFADLT 74

Query: 101 NDEFRALYTG-YKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGC 159
           +DEF+A Y G     S     +    F Y+++   D P S+DWR KGAVTP+KNQ  CG 
Sbjct: 75  HDEFKAKYVGSLGEDSTIIEQSDDEEFPYKHV--VDYPESIDWRQKGAVTPVKNQNPCGS 132

Query: 160 CWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIAT 219
           CWAF+ VA VEGI KI +G LI LSEQ+LLDC    ++GC GG +  +  Y+  N G+ T
Sbjct: 133 CWAFSTVATVEGINKIVTGKLISLSEQELLDCDRR-SHGCKGGYQTTSLQYVADN-GVHT 190

Query: 220 EDEYPYQAVPGTCSAAQKPAA-AKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQ 278
           E EYPY+   G C A  K  +  KI+ Y+ VP+ +E +L++A++ QPVS+ + +    FQ
Sbjct: 191 EKEYPYEKKQGKCRAKDKKGSKVKITGYKRVPANNEVSLIQAIANQPVSVVVESKGRAFQ 250

Query: 279 SYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR----DE 334
            YK GIF G CGT++DHAVT VG+     G NY LIKNSWG  WG+ GY++I R     +
Sbjct: 251 FYKGGIFEGPCGTKVDHAVTAVGY-----GKNYILIKNSWGPKWGEKGYIRIKRASGKSK 305

Query: 335 GLCGIGTRSSYP 346
           G CG+ + S +P
Sbjct: 306 GTCGVYSSSYFP 317


>gi|357160095|ref|XP_003578656.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP2-like
           [Brachypodium distachyon]
          Length = 377

 Score =  273 bits (697), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 147/334 (44%), Positives = 196/334 (58%), Gaps = 31/334 (9%)

Query: 41  QSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE--GNRTYKLGTNQFSD 98
           Q++    ++W A+HGR+Y    E+  RL+++  N+ YIE AN +     TY+LG   ++D
Sbjct: 47  QTMAPRFQRWKAEHGRAYATRDEELRRLRVYARNVRYIEAANGDPAAGLTYQLGETAYTD 106

Query: 99  LTNDEFRALYTGYKMPSP---SHRSTTSSTFK---------------YQNLSMTDVPTSL 140
           LT DEF A+YT    PSP   +H    +                   Y N+S    P S+
Sbjct: 107 LTADEFTAMYTS---PSPVLSAHDDEAAGAMMITTRAGAVDAGGQQVYFNVSTAGAPASV 163

Query: 141 DWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCL 200
           DWR KGAVT +KNQ  CG CWAF+ VA VEGI +IR+GNLI LSEQ+L+DC T  + GC 
Sbjct: 164 DWRAKGAVTEVKNQGRCGSCWAFSTVAVVEGIHQIRTGNLISLSEQELVDCDTL-DYGCD 222

Query: 201 GGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLK 259
           GG    A  +I  N GIATE +YPY    G C A + P  AA IS +  V +  E +L  
Sbjct: 223 GGVSYHALEWIASNGGIATEADYPYTGKDGACVANKLPLHAAAISGFARVATRSEPSLAN 282

Query: 260 AVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIV-GFGTTEDGANYWLIKNSW 318
           AV+ QPV+++I A    FQ Y +G++NG CGT+L+H VT+V       DG  YW++KNSW
Sbjct: 283 AVAAQPVAVSIEAGGANFQHYVKGVYNGPCGTRLNHGVTVVGYGEEEGDGEKYWIVKNSW 342

Query: 319 GNTWGDAGYMKIVRD-----EGLCGIGTRSSYPL 347
           G  WGD GY ++ +D     EGLCGI  R S+PL
Sbjct: 343 GKKWGDGGYFRMKKDVAGKPEGLCGIAIRPSFPL 376


>gi|226499806|ref|NP_001151335.1| cysteine protease 1 [Zea mays]
 gi|195645896|gb|ACG42416.1| cysteine protease 1 precursor [Zea mays]
          Length = 258

 Score =  273 bits (697), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 140/261 (53%), Positives = 183/261 (70%), Gaps = 12/261 (4%)

Query: 94  NQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPT---SLDWRDKGAVTP 150
           N+F+D+TNDEF A+YTG + P P+     +  FKY N++++D      ++DWR KGAVT 
Sbjct: 4   NEFADMTNDEFMAMYTGLR-PVPAGAKKMAG-FKYGNVTLSDADDDQQTVDWRQKGAVTG 61

Query: 151 IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAY 210
           IK+Q++CGCCWAFAAVAAVEGI +I +GNL+ LSEQQ+LDC T+GNNGC GG  + AF Y
Sbjct: 62  IKDQRQCGCCWAFAAVAAVEGIHQITTGNLVSLSEQQVLDCDTDGNNGCNGGYIDNAFQY 121

Query: 211 IIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAI 270
           I+ N G+ATED YPY A    C + Q  AA  IS Y++VPSGDE AL  AV+ QPVS+AI
Sbjct: 122 IVGNGGLATEDAYPYTAAQAMCQSVQPVAA--ISGYQDVPSGDEAALAAAVANQPVSVAI 179

Query: 271 AAYSTEFQSYKEGIFNGV-CGT--QLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGY 327
            A++  FQ Y  G+     C T   L+HAVT VG+GT EDG  YWL+KN WG  WG+ GY
Sbjct: 180 DAHN--FQLYGGGVMTAASCSTPPNLNHAVTAVGYGTAEDGTPYWLLKNQWGQNWGEGGY 237

Query: 328 MKIVRDEGLCGIGTRSSYPLA 348
           +++ R    CG+  ++SYP+A
Sbjct: 238 LRLERGANACGVAQQASYPVA 258


>gi|24653514|ref|NP_523735.2| cysteine proteinase-1, isoform C [Drosophila melanogaster]
 gi|118572624|sp|Q95029.2|CATL_DROME RecName: Full=Cathepsin L; AltName: Full=Cysteine proteinase 1;
           Contains: RecName: Full=Cathepsin L heavy chain;
           Contains: RecName: Full=Cathepsin L light chain; Flags:
           Precursor
 gi|21627209|gb|AAM68565.1| cysteine proteinase-1, isoform C [Drosophila melanogaster]
          Length = 371

 Score =  273 bits (697), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 141/316 (44%), Positives = 195/316 (61%), Gaps = 14/316 (4%)

Query: 46  IHEKWMA---QHGRSYKDELEKEMRLKIFKENLEYIEKANK---EGNRTYKLGTNQFSDL 99
           + E+W     +H ++Y+DE E+  RLKIF EN   I K N+   EG  ++KL  N+++DL
Sbjct: 55  VMEEWHTFKLEHRKNYQDETEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADL 114

Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFK---YQNLSMTDVPTSLDWRDKGAVTPIKNQKE 156
            + EFR L  G+             +FK   + + +   +P S+DWR KGAVT +K+Q  
Sbjct: 115 LHHEFRQLMNGFNYTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGH 174

Query: 157 CGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQ 215
           CG CWAF++  A+EG    +SG L+ LSEQ L+DCST  GNNGC GG  + AF YI  N 
Sbjct: 175 CGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNG 234

Query: 216 GIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYS 274
           GI TE  YPY+A+  +C   +    A    + ++P GDE+ + +AV ++ PVS+AI A  
Sbjct: 235 GIDTEKSYPYEAIDDSCHFNKGTVGATDRGFTDIPQGDEKKMAEAVATVGPVSVAIDASH 294

Query: 275 TEFQSYKEGIFN-GVCGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR 332
             FQ Y EG++N   C  Q LDH V +VGFGT E G +YWL+KNSWG TWGD G++K++R
Sbjct: 295 ESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGEDYWLVKNSWGTTWGDKGFIKMLR 354

Query: 333 D-EGLCGIGTRSSYPL 347
           + E  CGI + SSYPL
Sbjct: 355 NKENQCGIASASSYPL 370


>gi|357166364|ref|XP_003580686.1| PREDICTED: oryzain alpha chain-like [Brachypodium distachyon]
          Length = 360

 Score =  273 bits (697), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 142/332 (42%), Positives = 213/332 (64%), Gaps = 18/332 (5%)

Query: 25  LVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN-- 82
           + S + Q+ S     E+    ++ +W AQHG    +E  +E R + F++NL YI++ N  
Sbjct: 26  IASSSGQIRS-----EEETRRMYAEWTAQHGSPITNE--EEGRYEAFRDNLRYIDEHNAA 78

Query: 83  -KEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLD 141
              G  +++LG N+F+ LTN+E+RA Y G ++ S +       + +Y+      +P S+D
Sbjct: 79  ADAGIHSFRLGLNRFAGLTNEEYRAAYLGLRLRSGAVGDLRKPSARYEAADGEALPESVD 138

Query: 142 WRDKGAVTPIKNQ-KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCL 200
           WR+KGAV  +K+Q + CG  WAF+A+AAVE I +I +G LI LSEQ+L+DC T+ N GC 
Sbjct: 139 WREKGAVGKVKDQGRSCGSAWAFSAIAAVESINQIVTGELISLSEQELMDCDTSYNAGCD 198

Query: 201 GGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQK-PAAAKISNYEEVPSGDEQALLK 259
           GG  + AF +II N GI T+++YPY+A   +C A ++   A  I +YE++   +E++L K
Sbjct: 199 GGLMDDAFEFIISNGGIDTDEDYPYKARNDSCDANKRNRKAVTIDDYEDLRM-NEKSLQK 257

Query: 260 AVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWG 319
           AVS QPVS+AI A   +FQ YK GIF G CGT LDHA TIVG+G +E+G +YW++K S+G
Sbjct: 258 AVSNQPVSVAIEAGGRDFQLYKSGIFTGTCGTDLDHATTIVGYG-SENGTDYWIVKESYG 316

Query: 320 NTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
            +WG++GY ++ R+     G CGI    SYP+
Sbjct: 317 TSWGESGYARMERNIKETSGKCGIAMLPSYPV 348


>gi|242049716|ref|XP_002462602.1| hypothetical protein SORBIDRAFT_02g028840 [Sorghum bicolor]
 gi|241925979|gb|EER99123.1| hypothetical protein SORBIDRAFT_02g028840 [Sorghum bicolor]
          Length = 384

 Score =  273 bits (697), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 147/361 (40%), Positives = 204/361 (56%), Gaps = 61/361 (16%)

Query: 43  VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTND 102
           ++E  E+WM +HGR Y D  EK+ RL++++ N+  +E  N   N  Y+L  N+F+DLTN+
Sbjct: 28  MLERFEQWMGRHGRLYADAGEKQRRLEVYRRNVALVETFNSMSNGGYRLADNKFADLTNE 87

Query: 103 EFRALYTGYKMPSPSHRSTTSSTF-------------KYQNLSMTDVPTSLDWRDKGAVT 149
           EFRA   G+  P P  R+T  +T              +Y +    ++P S+DWR+KGAV 
Sbjct: 88  EFRAKMLGFGRPPPHGRATGHTTTPGTVACIGSGLGRRYSD----ELPKSVDWREKGAVA 143

Query: 150 PIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFA 209
           P+KNQ ECG CWAF+AVAA+EGI +I++G L+ LSEQ+L+DC T    GC GG    AF 
Sbjct: 144 PVKNQGECGSCWAFSAVAAIEGINQIKNGKLVSLSEQELVDCDTKA-IGCAGGYMSWAFE 202

Query: 210 YIIQNQGIATEDEYPYQ-----------AVPGTCS--------------AAQKP----AA 240
           +++ N G+ TE  YPYQ           A+P  C+              A Q P    +A
Sbjct: 203 FVMNNSGLTTERNYPYQGTYAHGNRKTHALPFDCTKGSSTCDSRAGMNGACQTPKLKESA 262

Query: 241 AKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIV 300
             IS Y  V +  E  LL+A + QPVS+A+ A S  +Q Y  G+F G C   L+H VT+V
Sbjct: 263 VSISGYVNVTASSEPDLLRAAAAQPVSVAVDAGSFVWQLYGGGVFTGPCTADLNHGVTVV 322

Query: 301 GFGTTE----------DGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
           G+G T+           G  YW++KNSWG  WGDAGY+ + R+     GLCGI    SYP
Sbjct: 323 GYGETQRDTDGDGTGVPGQKYWIVKNSWGPEWGDAGYILMQREASVASGLCGIALLPSYP 382

Query: 347 L 347
           +
Sbjct: 383 V 383


>gi|255522980|gb|ACU12382.1| RE21773p [Drosophila melanogaster]
          Length = 375

 Score =  273 bits (697), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 141/316 (44%), Positives = 195/316 (61%), Gaps = 14/316 (4%)

Query: 46  IHEKWMA---QHGRSYKDELEKEMRLKIFKENLEYIEKANK---EGNRTYKLGTNQFSDL 99
           + E+W     +H ++Y+DE E+  RLKIF EN   I K N+   EG  ++KL  N+++DL
Sbjct: 59  VMEEWHTFKLEHRKNYQDETEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADL 118

Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFK---YQNLSMTDVPTSLDWRDKGAVTPIKNQKE 156
            + EFR L  G+             +FK   + + +   +P S+DWR KGAVT +K+Q  
Sbjct: 119 LHHEFRQLMNGFNYTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGH 178

Query: 157 CGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQ 215
           CG CWAF++  A+EG    +SG L+ LSEQ L+DCST  GNNGC GG  + AF YI  N 
Sbjct: 179 CGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNG 238

Query: 216 GIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYS 274
           GI TE  YPY+A+  +C   +    A    + ++P GDE+ + +AV ++ PVS+AI A  
Sbjct: 239 GIDTEKSYPYEAIDDSCHFNKGTVGATDRGFTDIPQGDEKKMAEAVATVGPVSVAIDASH 298

Query: 275 TEFQSYKEGIFN-GVCGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR 332
             FQ Y EG++N   C  Q LDH V +VGFGT E G +YWL+KNSWG TWGD G++K++R
Sbjct: 299 ESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGEDYWLVKNSWGTTWGDKGFIKMLR 358

Query: 333 D-EGLCGIGTRSSYPL 347
           + E  CGI + SSYPL
Sbjct: 359 NKENQCGIASASSYPL 374


>gi|255078398|ref|XP_002502779.1| cysteine endopeptidase [Micromonas sp. RCC299]
 gi|226518045|gb|ACO64037.1| cysteine endopeptidase [Micromonas sp. RCC299]
          Length = 414

 Score =  273 bits (697), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 138/318 (43%), Positives = 201/318 (63%), Gaps = 17/318 (5%)

Query: 42  SVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSD 98
           S+ ++  +W  +HG++Y  E EKE+RLKIF +N E+++K N E   G  T+ +G N  +D
Sbjct: 63  SLSDLFHEWTQKHGKTYDSEEEKELRLKIFADNHEFVQKHNAEYENGEHTHFVGLNHLAD 122

Query: 99  LTNDEFRALYTGYKMPSPSHRS-TTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
           LT DEF+ +  GY     + R+   +ST++Y +++    P  +DW   GAVTP+KNQK+C
Sbjct: 123 LTKDEFKKML-GYNAALRASRAPVDASTWEYADVT---PPEEIDWVASGAVTPVKNQKQC 178

Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
           G CWAF+   AVEG+  I++G LI LSE++L+ CSTNGN GC GG  +  F +I+ N+GI
Sbjct: 179 GSCWAFSTTGAVEGVNAIKTGKLISLSEEELISCSTNGNMGCNGGLMDNGFEWIVNNRGI 238

Query: 218 ATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTE 276
            TED + Y A    C   ++   A  I  +++VPS DE +L+KAVS QPVS+AI A    
Sbjct: 239 DTEDGWEYVAKEEKCGFFRRHHRAVAIDGFKDVPSNDEDSLMKAVSQQPVSVAIEADHQS 298

Query: 277 FQSYKEGIFNGV-CGTQLDHAVTIVGFGT---TEDGANYWLIKNSWGNTWGDAGYMKIVR 332
           FQ Y  G+++   CGT+LDH V +VG+G    +    ++W IKNSWG  WG+ GY++I +
Sbjct: 299 FQLYAGGVYSAKDCGTELDHGVLLVGYGVDPKSTKHKHFWKIKNSWGPAWGEDGYIRIAK 358

Query: 333 D----EGLCGIGTRSSYP 346
                EG CG+  + SYP
Sbjct: 359 GGSGVEGQCGVAMQPSYP 376


>gi|24653516|ref|NP_725347.1| cysteine proteinase-1, isoform A [Drosophila melanogaster]
 gi|24653518|ref|NP_725348.1| cysteine proteinase-1, isoform B [Drosophila melanogaster]
 gi|1658527|gb|AAB18345.1| cysteine proteinase 1 [Drosophila melanogaster]
 gi|2305221|gb|AAB65749.1| cysteine proteinase-1 [Drosophila melanogaster]
 gi|7303249|gb|AAF58311.1| cysteine proteinase-1, isoform A [Drosophila melanogaster]
 gi|21627210|gb|AAM68566.1| cysteine proteinase-1, isoform B [Drosophila melanogaster]
 gi|54650754|gb|AAV36956.1| LP06554p [Drosophila melanogaster]
 gi|220951982|gb|ACL88534.1| Cp1-PA [synthetic construct]
          Length = 341

 Score =  272 bits (696), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 141/316 (44%), Positives = 195/316 (61%), Gaps = 14/316 (4%)

Query: 46  IHEKWMA---QHGRSYKDELEKEMRLKIFKENLEYIEKANK---EGNRTYKLGTNQFSDL 99
           + E+W     +H ++Y+DE E+  RLKIF EN   I K N+   EG  ++KL  N+++DL
Sbjct: 25  VMEEWHTFKLEHRKNYQDETEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADL 84

Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFK---YQNLSMTDVPTSLDWRDKGAVTPIKNQKE 156
            + EFR L  G+             +FK   + + +   +P S+DWR KGAVT +K+Q  
Sbjct: 85  LHHEFRQLMNGFNYTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGH 144

Query: 157 CGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQ 215
           CG CWAF++  A+EG    +SG L+ LSEQ L+DCST  GNNGC GG  + AF YI  N 
Sbjct: 145 CGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNG 204

Query: 216 GIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYS 274
           GI TE  YPY+A+  +C   +    A    + ++P GDE+ + +AV ++ PVS+AI A  
Sbjct: 205 GIDTEKSYPYEAIDDSCHFNKGTVGATDRGFTDIPQGDEKKMAEAVATVGPVSVAIDASH 264

Query: 275 TEFQSYKEGIFN-GVCGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR 332
             FQ Y EG++N   C  Q LDH V +VGFGT E G +YWL+KNSWG TWGD G++K++R
Sbjct: 265 ESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGEDYWLVKNSWGTTWGDKGFIKMLR 324

Query: 333 D-EGLCGIGTRSSYPL 347
           + E  CGI + SSYPL
Sbjct: 325 NKENQCGIASASSYPL 340


>gi|162459488|ref|NP_001105571.1| maize insect resistance1 precursor [Zea mays]
 gi|5731354|gb|AAB70820.2| cysteine protease Mir1 [Zea mays]
          Length = 398

 Score =  272 bits (696), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 137/331 (41%), Positives = 205/331 (61%), Gaps = 28/331 (8%)

Query: 40  EQSVVEIHEKWMAQHGR--------------SYKDELEKEMRLKIFKENLEYIEKANKE- 84
           ++ V  ++E W ++HGR                ++E ++ +RL++F++NL YI+  N E 
Sbjct: 47  DEEVRRMYEAWKSKHGRGGSSNDDCDMAPGDDEQEEEDRRLRLEVFRDNLRYIDAHNAEA 106

Query: 85  --GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDW 142
             G  T++LG   F+DLT +E+R    G++       +   S +  +     D+P ++DW
Sbjct: 107 DAGLHTFRLGLTPFADLTLEEYRGRVLGFRARGRRSGARYGSGYSVRG---GDLPDAIDW 163

Query: 143 RDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGG 202
           R  GAVT +K+Q++CG CWAF+AVAA+EG+  I +GNL+ LSEQ+++DC    ++GC GG
Sbjct: 164 RQLGAVTEVKDQQQCGGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQ-DSGCDGG 222

Query: 203 SREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKP--AAAKISNYEEVPSGDEQALLKA 260
             E AF ++I N GI TE +YP+    GTC A+++     A I    EV S +E AL +A
Sbjct: 223 QMENAFRFVIGNGGIDTEADYPFIGTDGTCDASKEKNEKVATIDGLVEVASNNETALQEA 282

Query: 261 VSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGN 320
           V++QPVS+AI A    FQ Y  GIFNG CGT LDH VT VG+G +E G +YW++KNSW  
Sbjct: 283 VAIQPVSVAIDASGRAFQHYSSGIFNGPCGTSLDHGVTAVGYG-SESGKDYWIVKNSWSA 341

Query: 321 TWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
           +WG+AGY+++ R+     G CGI   +SYP+
Sbjct: 342 SWGEAGYIRMRRNVPRPTGKCGIAMDASYPV 372


>gi|195484843|ref|XP_002090843.1| GE12574 [Drosophila yakuba]
 gi|194176944|gb|EDW90555.1| GE12574 [Drosophila yakuba]
          Length = 341

 Score =  271 bits (694), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 140/316 (44%), Positives = 196/316 (62%), Gaps = 14/316 (4%)

Query: 46  IHEKWMA---QHGRSYKDELEKEMRLKIFKENLEYIEKANK---EGNRTYKLGTNQFSDL 99
           + E+W     +H ++Y+D+ E+  RLKIF EN   I K N+   EG  ++KL  N+++DL
Sbjct: 25  VMEEWHTFKLEHRKNYQDDTEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADL 84

Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFK---YQNLSMTDVPTSLDWRDKGAVTPIKNQKE 156
            + EFR L  G+          T  +FK   + + +   +P S+DWR KGAVT +K+Q  
Sbjct: 85  LHHEFRQLMNGFNYTLHKQLRATDDSFKGVTFISPAHVTLPKSVDWRSKGAVTAVKDQGH 144

Query: 157 CGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQ 215
           CG CWAF++  A+EG    +SG L+ LSEQ L+DCST  GNNGC GG  + AF YI  N 
Sbjct: 145 CGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNG 204

Query: 216 GIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYS 274
           GI TE  YPY+A+  +C   +    A    + ++P GDE+ + +AV ++ PVS+AI A  
Sbjct: 205 GIDTEKSYPYEAIDDSCHFNKGTIGATDRGFTDIPQGDEKKMAEAVATVGPVSVAIDASH 264

Query: 275 TEFQSYKEGIFN-GVCGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR 332
             FQ Y EG++N   C  Q LDH V +VGFGT E G +YWL+KNSWG TWGD G++K++R
Sbjct: 265 ESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGDDYWLVKNSWGTTWGDKGFIKMLR 324

Query: 333 D-EGLCGIGTRSSYPL 347
           + +  CGI + SSYPL
Sbjct: 325 NKDNQCGIASASSYPL 340


>gi|57118005|gb|AAW34134.1| cysteine protease gp2a [Zingiber officinale]
          Length = 381

 Score =  271 bits (694), Expect = 3e-70,   Method: Compositional matrix adjust.
 Identities = 141/333 (42%), Positives = 208/333 (62%), Gaps = 14/333 (4%)

Query: 23  TLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN 82
            L +S     V  RS  E  V  ++ +W  ++  + K     E RL++FKENL+++++ N
Sbjct: 31  VLTLSKQGGAVPVRSDEE--VRMLYLEWRVKNHPAEKYLDLNEYRLEVFKENLQFVDEHN 88

Query: 83  KEGNR---TYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSS-TFKYQNLSMTDVPT 138
              +R   T+ LG N+F+DLTN+E+R  +   +  S   RS +   + +Y+     D+P 
Sbjct: 89  AAADRGEHTFLLGMNRFADLTNEEYRTRF--LRDFSRLRRSASGKISSRYRLREGDDLPD 146

Query: 139 SLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNG 198
           S+DWR+ GAV P+KNQ  CG CWAF+ VAAVEGI +I +G+LI LSEQQL+DC+T  N+G
Sbjct: 147 SIDWRENGAVVPVKNQGGCGSCWAFSTVAAVEGINQIVTGDLISLSEQQLVDCTT-ANHG 205

Query: 199 CLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALL 258
           C GG    AF +I+ N GI +E+ YPY+   G C++        I +YE VPS +EQ+L 
Sbjct: 206 CRGGWMNPAFQFIVNNGGINSEETYPYRGQNGICNSTVNAPVVSIDSYENVPSHNEQSLQ 265

Query: 259 KAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSW 318
           KAV+ QPVS+ + A   +FQ Y+ GIF G C    +HA+T+VG+GT  D  ++W++KNSW
Sbjct: 266 KAVANQPVSVTMDAAGRDFQLYRSGIFTGSCNISANHALTVVGYGTEND-KDFWIVKNSW 324

Query: 319 GNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
           G  WG++GY++  R+     G CGI   +SYP+
Sbjct: 325 GKNWGESGYIRAERNIENPNGKCGITRFASYPV 357


>gi|413943290|gb|AFW75939.1| maize insect resistance1 [Zea mays]
          Length = 435

 Score =  271 bits (693), Expect = 4e-70,   Method: Compositional matrix adjust.
 Identities = 139/332 (41%), Positives = 204/332 (61%), Gaps = 26/332 (7%)

Query: 40  EQSVVEIHEKWMAQHGR-------------SYKDELEKEMRLKIFKENLEYIEKANKE-- 84
           ++ V  ++E W ++HGR               + E ++ +RL++F++NL YI+K N E  
Sbjct: 77  DEEVRRMYEAWKSKHGRGGSSNDDCDMAPGDDEQEEDRRLRLEVFRDNLRYIDKHNAEAD 136

Query: 85  -GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD--VPTSLD 141
            G  T++LG   F+DLT DE+R    G++  +    +       Y+        +P ++D
Sbjct: 137 AGLHTFRLGLTPFADLTLDEYRGRVLGFRARARRSGARYGHGHGYRARPRGGDLLPDAID 196

Query: 142 WRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLG 201
           WR  GAVT +K+Q++CG CWAF+AVAA+EGI  I +GNL+ LSEQ+++DC    ++GC G
Sbjct: 197 WRQLGAVTEVKDQQQCGGCWAFSAVAAIEGINAIATGNLVSLSEQEIIDCDAQ-DSGCDG 255

Query: 202 GSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQK--PAAAKISNYEEVPSGDEQALLK 259
           G  E AF ++I N GI TE +YP+    GTC A+++     A I    EV S +E AL +
Sbjct: 256 GQMENAFRFVIGNGGIDTEADYPFIGTDGTCDASKENNEKVATIDGLVEVASNNETALQE 315

Query: 260 AVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWG 319
           AV++QPVS+AI A    FQ Y  GIFNG CGT LDH VT VG+G +E G +YW++KNSW 
Sbjct: 316 AVAIQPVSVAIDASGRAFQHYSSGIFNGPCGTSLDHGVTAVGYG-SESGKDYWIVKNSWS 374

Query: 320 NTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
            +WG+AGY+++ R+     G CGI   +SYP+
Sbjct: 375 ASWGEAGYIRMRRNVPRPTGKCGIAMDASYPV 406


>gi|223946391|gb|ACN27279.1| unknown [Zea mays]
          Length = 279

 Score =  271 bits (692), Expect = 5e-70,   Method: Compositional matrix adjust.
 Identities = 131/262 (50%), Positives = 170/262 (64%), Gaps = 18/262 (6%)

Query: 99  LTNDEFRALYTGYKMPSPSHR---------STTSSTFKYQNLSMTDVPTSLDWRDKGAVT 149
           +T DEFR  Y G ++    HR         S ++S+F Y +    DVP S+DWR KGAVT
Sbjct: 1   MTADEFRRHYAGSRVAH--HRMFRGDRQGSSASASSFMYAD--ARDVPASVDWRQKGAVT 56

Query: 150 PIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFA 209
            +K+Q +CG CWAF+ +AAVEGI  I++ NL  LSEQQL+DC T  N GC GG  + AF 
Sbjct: 57  DVKDQGQCGSCWAFSTIAAVEGINAIKTKNLTSLSEQQLVDCDTKANAGCNGGLMDYAFQ 116

Query: 210 YIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIA 269
           YI ++ G+A ED YPY+A   +C  +  P    I  YE+VP+ DE AL KAV+ QPVS+A
Sbjct: 117 YIAKHGGVAAEDAYPYRARQASCKKSPAPVVT-IDGYEDVPANDESALKKAVAHQPVSVA 175

Query: 270 IAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMK 329
           I A  + FQ Y EG+F+G CGT+LDH V  VG+G T DG  YWL+KNSWG  WG+ GY++
Sbjct: 176 IEASGSHFQFYSEGVFSGRCGTELDHGVAAVGYGVTADGTKYWLVKNSWGPEWGEKGYIR 235

Query: 330 IVRD----EGLCGIGTRSSYPL 347
           + RD    EG CGI   +SYP+
Sbjct: 236 MARDVAAKEGHCGIAMEASYPV 257


>gi|296090463|emb|CBI40282.3| unnamed protein product [Vitis vinifera]
          Length = 386

 Score =  270 bits (691), Expect = 6e-70,   Method: Compositional matrix adjust.
 Identities = 140/308 (45%), Positives = 193/308 (62%), Gaps = 41/308 (13%)

Query: 46  IHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFR 105
           ++E W+ +HG+SY    E+E R +IFK+NL +IE+ N   NRTYK+G +++S      FR
Sbjct: 3   VYEAWLVKHGKSYNALGERERRFEIFKDNLRFIEEHNAV-NRTYKVG-DRYS------FR 54

Query: 106 ALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAA 165
           A                            D+P S+DWR+KGAV P+K+Q  CG CWAF+ 
Sbjct: 55  A--------------------------GEDLPESVDWREKGAVVPVKDQGNCGSCWAFST 88

Query: 166 VAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPY 225
           +AAVEGI +I +G+LI LSEQ+L+DC  + N GC GG  + AF +II N GI +E++YPY
Sbjct: 89  IAAVEGINQIATGDLISLSEQELVDCDKSYNQGCNGGLMDYAFEFIINNGGIDSEEDYPY 148

Query: 226 QAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGI 284
           +A   TC   +K A    I  YE+VP  DE++L KAV+ QPVS+AI A    FQ Y+ G+
Sbjct: 149 RAADTTCDPNRKNARVVSIDGYEDVPQNDERSLKKAVANQPVSVAIEAGGRAFQLYQSGV 208

Query: 285 FNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR-----DEGLCGI 339
           F G CGTQLDH V  VG+G TE+  +YW+++NSWG  WG++GY+K+ R     + G CGI
Sbjct: 209 FTGQCGTQLDHGVVAVGYG-TENSVDYWIVRNSWGPNWGESGYIKLERNLAGTETGKCGI 267

Query: 340 GTRSSYPL 347
               SYP+
Sbjct: 268 AIEPSYPI 275


>gi|195583187|ref|XP_002081405.1| GD10995 [Drosophila simulans]
 gi|194193414|gb|EDX06990.1| GD10995 [Drosophila simulans]
          Length = 341

 Score =  270 bits (691), Expect = 6e-70,   Method: Compositional matrix adjust.
 Identities = 140/316 (44%), Positives = 195/316 (61%), Gaps = 14/316 (4%)

Query: 46  IHEKWMA---QHGRSYKDELEKEMRLKIFKENLEYIEKANK---EGNRTYKLGTNQFSDL 99
           + E+W     +H ++Y+D+ E+  RLKIF EN   I K N+   EG  ++KL  N+++DL
Sbjct: 25  VMEEWHTFKLEHRKNYQDDTEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADL 84

Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFK---YQNLSMTDVPTSLDWRDKGAVTPIKNQKE 156
            + EFR L  G+             +FK   + + +   +P S+DWR KGAVT +K+Q  
Sbjct: 85  LHHEFRQLMNGFNYTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGH 144

Query: 157 CGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQ 215
           CG CWAF++  A+EG    +SG L+ LSEQ L+DCST  GNNGC GG  + AF YI  N 
Sbjct: 145 CGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNG 204

Query: 216 GIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYS 274
           GI TE  YPY+A+  +C   +    A    + ++P GDE+ + +AV ++ PVS+AI A  
Sbjct: 205 GIDTEKSYPYEAIDDSCHFNKGTIGATDRGFTDIPQGDEKKMAEAVATVGPVSVAIDASH 264

Query: 275 TEFQSYKEGIFN-GVCGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR 332
             FQ Y EG++N   C  Q LDH V +VGFGT E G +YWL+KNSWG TWGD G++K++R
Sbjct: 265 ESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGDDYWLVKNSWGTTWGDKGFIKMLR 324

Query: 333 D-EGLCGIGTRSSYPL 347
           + E  CGI + SSYPL
Sbjct: 325 NKENQCGIASASSYPL 340


>gi|558563|emb|CAA57538.1| cysteine proteinase [Cicer arietinum]
          Length = 325

 Score =  270 bits (689), Expect = 9e-70,   Method: Compositional matrix adjust.
 Identities = 135/308 (43%), Positives = 186/308 (60%), Gaps = 9/308 (2%)

Query: 46  IHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFR 105
           ++EKW+ +H + Y    EK+ R +IFK+NL +I++ N + N +YK+G N+F+D+ N+E+R
Sbjct: 3   MYEKWLVKHQKMYNGLGEKDTRFQIFKDNLRFIDEHNAQ-NYSYKVGLNKFADINNEEYR 61

Query: 106 ALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAA 165
            +Y G K  +      T  T      +   V   +DWR KGAVT IK+Q  CG CWAF+ 
Sbjct: 62  DMYLGTKSDAKRRVMKTKITGHRITYNSVIVTVKVDWRLKGAVTHIKDQGSCGSCWAFST 121

Query: 166 VAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPY 225
           +A VE I KI +G  + LSEQ+L+DC    N GC GG  + AF +II+N GI T+ +YPY
Sbjct: 122 IATVEAINKIVTGKFVSLSEQELVDCDRAFNEGCNGGLMDYAFEFIIRNGGIDTDQDYPY 181

Query: 226 QAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGI 284
                 C   +K A    I  YE+VPS    AL KAV+ QPVS+AIA      Q Y+ G+
Sbjct: 182 NGFERKCDPTKKNAKVVSIDGYEDVPSY-MNALKKAVAHQPVSVAIAGLGRALQLYQSGV 240

Query: 285 FNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE-----GLCGI 339
           F G CGT LDH V +VG+G +E+G +YWL++NSWG  WG+ GY KI           CGI
Sbjct: 241 FTGKCGTDLDHGVVVVGYG-SENGVDYWLVRNSWGTNWGEDGYFKIASRNVKSLYRKCGI 299

Query: 340 GTRSSYPL 347
              +SYP+
Sbjct: 300 AMEASYPV 307


>gi|225458143|ref|XP_002280937.1| PREDICTED: cysteine proteinase RD21a [Vitis vinifera]
 gi|302142569|emb|CBI19772.3| unnamed protein product [Vitis vinifera]
          Length = 436

 Score =  270 bits (689), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 135/320 (42%), Positives = 197/320 (61%), Gaps = 13/320 (4%)

Query: 33  VSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLG 92
           V S  +   S  ++ E W  Q+G++Y  E EK  RLK+F+EN  ++ + N   N +Y L 
Sbjct: 15  VHSSVSEASSTADLFEAWCEQYGKTYSSEEEKASRLKVFEENHAFVTQHNSMANASYTLA 74

Query: 93  TNQFSDLTNDEFRALYTGYKMPSPSH-RSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPI 151
            N F+DLT+ EF+A   G+   SP   +S  S     Q L    VP ++DWR  GAVT +
Sbjct: 75  LNAFADLTHHEFKASRLGF---SPGRAQSIRSVGTPVQEL---HVPPAVDWRKSGAVTGV 128

Query: 152 KNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYI 211
           K+Q  CG CW+F+   A+EGI KI +G+L+ LSEQ+L+DC  + N+GC GG  + A+ ++
Sbjct: 129 KDQGNCGGCWSFSTTGAIEGINKIVTGSLVSLSEQELVDCDRSYNSGCEGGLMDYAYQFV 188

Query: 212 IQNQGIATEDEYPYQAVPGTCSAAQ-KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAI 270
           I+NQGI +E +YPY  +   C+  + K     I  Y ++P  DE+ LL+ V+ QPVS+ I
Sbjct: 189 IKNQGIDSEADYPYVGMDKPCNKEKLKKHIVTIDGYTDIPPNDEKQLLQVVAKQPVSVGI 248

Query: 271 AAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKI 330
                 FQ Y +G++ G C + LDHAV IVG+G TEDG ++W++KNSWG  WG  GY+ +
Sbjct: 249 CGSEKTFQLYSKGVYTGPCSSTLDHAVLIVGYG-TEDGVDFWIVKNSWGEHWGMRGYIHM 307

Query: 331 VRD----EGLCGIGTRSSYP 346
           +R+    EG+CGI   +SYP
Sbjct: 308 LRNNGTAEGICGINMLASYP 327


>gi|218202389|gb|EEC84816.1| hypothetical protein OsI_31898 [Oryza sativa Indica Group]
          Length = 350

 Score =  270 bits (689), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 138/329 (41%), Positives = 195/329 (59%), Gaps = 21/329 (6%)

Query: 38  THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
           T    +++  E+WM +HGR+Y D  EK+ R ++++ N+E +E  N   N  YKL  N+F+
Sbjct: 23  TRADLMLDRFEQWMIRHGRAYTDSGEKQRRFEVYRRNVELVETFNSMSN-GYKLADNKFA 81

Query: 98  DLTNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMTDV-PTSLDWRDKGAVTPIKNQ 154
           DLTN+EFRA   G++  +  P   +T S+       S  D+ P S+DWR KGAV  +KNQ
Sbjct: 82  DLTNEEFRAKMLGFRPHVTIPQISNTCSADIAMPGESSDDILPKSVDWRKKGAVVEVKNQ 141

Query: 155 KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQN 214
            +CG CWAF+AVAA+EGI +I++G L+ LSEQ+L+DC      GC GG    AF +++ N
Sbjct: 142 GDCGSCWAFSAVAAIEGINQIKNGELVSLSEQELVDCDDEA-VGCGGGYMSWAFEFVVGN 200

Query: 215 QGIATEDEYPYQAVPGTCSAAQ-KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAY 273
            G+ TE  YPY A  G C AA+   +A  I+ Y  V    E  L +A + QPVS+A+   
Sbjct: 201 HGLTTEASYPYHAANGACQAAKLNQSAVAIAGYRNVTPSSEPDLARAAAAQPVSVAVDGG 260

Query: 274 STEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGAN----------YWLIKNSWGNTWG 323
           S  FQ Y  G++ G C   ++H VT+VG+G +E   +          YW++KNSWG  WG
Sbjct: 261 SFMFQLYGSGVYTGPCTADVNHGVTVVGYGESEPKTDGGGAAKGGEKYWIVKNSWGAEWG 320

Query: 324 DAGYMKIVRD-----EGLCGIGTRSSYPL 347
           DAGY+ + RD      GLCGI    SYP+
Sbjct: 321 DAGYILMQRDVAGLASGLCGIALLPSYPV 349


>gi|195334204|ref|XP_002033774.1| GM21500 [Drosophila sechellia]
 gi|194125744|gb|EDW47787.1| GM21500 [Drosophila sechellia]
          Length = 341

 Score =  270 bits (689), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 139/316 (43%), Positives = 195/316 (61%), Gaps = 14/316 (4%)

Query: 46  IHEKWMA---QHGRSYKDELEKEMRLKIFKENLEYIEKANK---EGNRTYKLGTNQFSDL 99
           + E+W     +H ++Y+D+ E+  RLKIF EN   I K N+   EG  ++KL  N+++DL
Sbjct: 25  VMEEWHTFKLEHRKNYQDDTEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADL 84

Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFK---YQNLSMTDVPTSLDWRDKGAVTPIKNQKE 156
            + EFR L  G+             +FK   + + +   +P S+DWR KGAVT +K+Q  
Sbjct: 85  LHHEFRQLMNGFNYTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGH 144

Query: 157 CGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQ 215
           CG CWAF++  A+EG    +SG L+ LSEQ L+DCST  GNNGC GG  + AF YI  N 
Sbjct: 145 CGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNG 204

Query: 216 GIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYS 274
           GI TE  YPY+A+  +C   +    A    + ++P GDE+ + +AV ++ PV++AI A  
Sbjct: 205 GIDTEKSYPYEAIDDSCHFNKGTIGATDRGFTDIPQGDEKKMAEAVATVGPVAVAIDASH 264

Query: 275 TEFQSYKEGIFN-GVCGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR 332
             FQ Y EG++N   C  Q LDH V +VGFGT E G +YWL+KNSWG TWGD G++K++R
Sbjct: 265 ESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGEDYWLVKNSWGTTWGDKGFIKMLR 324

Query: 333 D-EGLCGIGTRSSYPL 347
           + E  CGI + SSYPL
Sbjct: 325 NKENQCGIASASSYPL 340


>gi|326503122|dbj|BAJ99186.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326512552|dbj|BAJ99631.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 389

 Score =  269 bits (688), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 148/357 (41%), Positives = 203/357 (56%), Gaps = 37/357 (10%)

Query: 24  LLVSCASQVVSSRSTHEQSVVEIHEK--------WMAQHGRSYKDELEKEMRLKIFKENL 75
           +L  C+S+ +++ S H    ++ H          WM    RSY    EK  R K+++ N+
Sbjct: 29  MLAGCSSESLTTSSEHSDIGIDKHHDLMMARFHVWMTVQNRSYPTSSEKAHRFKVYRSNM 88

Query: 76  EYIEKANKEGNR---TYKLGTNQFSDLTNDEFRALYTGYKMPSPSHR----------STT 122
            YIE  N E      TY+LG   F+DLT++EF +LYTG K+P   HR          +T 
Sbjct: 89  RYIEALNAEATTSGFTYELGEGPFTDLTDEEFISLYTG-KIPDDDHREDGVHDEQIITTH 147

Query: 123 SSTFK-------YQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKI 175
           + +         Y N S    P  +DWR +GAVTP+K+Q +CG CWAF  VA +EGI KI
Sbjct: 148 AGSVNGAEGVTVYANFS-AGAPIRMDWRKRGAVTPVKDQGKCGSCWAFPTVATIEGIHKI 206

Query: 176 RSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAA 235
           + G L+ LSEQQL+DC    + GC GG    AF +IIQN GI T   Y Y+A  G C   
Sbjct: 207 KRGRLVSLSEQQLVDCDFL-DGGCNGGWPRNAFQWIIQNGGITTTSSYTYKAAEGQCKGN 265

Query: 236 QKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGT-QLD 294
           +KP AAKI+ Y +V S  E +++  V+ QP++ +I  +  +FQ YK GI+NG C T +L+
Sbjct: 266 RKP-AAKITGYRKVKSNSEVSMVNIVANQPIAASIVVHGGQFQHYKGGIYNGPCATSKLN 324

Query: 295 HAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE----GLCGIGTRSSYPL 347
           H +TIVG+G    GA YW++KNSWG  WG+ GYM + R      G CGI  R  +PL
Sbjct: 325 HVITIVGYGQQAYGAKYWIVKNSWGAAWGNKGYMLMKRGTKNPLGQCGIAVRPIFPL 381


>gi|255635645|gb|ACU18172.1| unknown [Glycine max]
          Length = 355

 Score =  269 bits (688), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 135/339 (39%), Positives = 203/339 (59%), Gaps = 12/339 (3%)

Query: 18  MFIIITLLVSCASQVVSSRSTH--------EQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
           +F++  +  +    ++S  + H        +  V+ + E+W+ +H + Y    EKE R +
Sbjct: 8   LFMVFAVSSALDMSIISHDNAHADRATRRTDDEVMSMFEEWLVKHDKVYNALGEKEKRFQ 67

Query: 70  IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQ 129
           IFK NL +I++ N   NRTYKLG N F+DLTN E+RA+Y       P     T    +Y 
Sbjct: 68  IFKNNLRFIDERNSL-NRTYKLGLNVFADLTNAEYRAMYLRTWDDGPRLDLDTPPRNRYV 126

Query: 130 NLSMTDVPTSLDWRDKGAVTPIKNQ-KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQL 188
                 +P S+DWR +GAVTP+KNQ   C  CWAF AV AVE + KI++G+LI LSEQ++
Sbjct: 127 PRVGDTIPKSVDWRKEGAVTPVKNQGATCNSCWAFTAVGAVESLVKIKTGDLISLSEQEV 186

Query: 189 LDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEE 248
           +DC+T+ + GC GG  +  + YI +N GI+ E +YPY+   G C + +K A   I  +  
Sbjct: 187 VDCTTSSSRGCGGGDIQHGYIYIRKN-GISLEKDYPYRGDEGKCDSNKKNAIVTIDGHGW 245

Query: 249 VPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDG 308
           VP+  E+AL + ++ QPV++ I A   EFQ Y  G+F G CGT+L+HA+ +VG+G  +DG
Sbjct: 246 VPTQLEEALKQGIANQPVAVPIPADDYEFQYYTSGVFKGKCGTELNHALLLVGYGAEKDG 305

Query: 309 ANYWLIKNSWGNTWGDAGYMKIVRDEGLCGIGTRSSYPL 347
            +YW+ KNS+ + WG+ GY++I R    C  G    YP+
Sbjct: 306 -DYWIAKNSYSDKWGENGYIRIQRKLSTCKFGNGGYYPI 343


>gi|194883222|ref|XP_001975702.1| GG20414 [Drosophila erecta]
 gi|190658889|gb|EDV56102.1| GG20414 [Drosophila erecta]
          Length = 341

 Score =  269 bits (688), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 138/316 (43%), Positives = 197/316 (62%), Gaps = 14/316 (4%)

Query: 46  IHEKWMA---QHGRSYKDELEKEMRLKIFKENLEYIEKANK---EGNRTYKLGTNQFSDL 99
           + E+W     +H ++Y+D+ E+  RLKIF EN   I K N+   EG  ++KL  N+++DL
Sbjct: 25  VMEEWHTFKLEHRKNYQDDTEERFRLKIFNENKHKIAKHNQRYAEGKVSFKLAVNKYADL 84

Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFK---YQNLSMTDVPTSLDWRDKGAVTPIKNQKE 156
            + EFR L  G+         +T  +FK   + + +   +P S+DWR KGAVT +K+Q  
Sbjct: 85  LHHEFRQLMNGFNYTLHKQLRSTDDSFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGH 144

Query: 157 CGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQ 215
           CG CWAF++  A+EG    +SG L+ LSEQ L+DCST  GNNGC GG  + AF YI  N 
Sbjct: 145 CGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNG 204

Query: 216 GIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYS 274
           GI TE  YPY+A+  +C   +    A    + ++P GDE+ + +AV ++ PV++AI A  
Sbjct: 205 GIDTEKSYPYEAIDDSCHFNKGAIGATDRGFTDIPQGDEKKMAEAVATVGPVAVAIDASH 264

Query: 275 TEFQSYKEGIFN-GVCGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR 332
             FQ Y EG++N   C  Q LDH V +VG+GT E G +YWL+KNSWG TWGD G++K++R
Sbjct: 265 ESFQFYSEGVYNEPQCDAQNLDHGVLVVGYGTDESGDDYWLVKNSWGTTWGDKGFIKMLR 324

Query: 333 D-EGLCGIGTRSSYPL 347
           + +  CGI + SSYPL
Sbjct: 325 NKDNQCGIASASSYPL 340


>gi|66735056|gb|AAY53767.1| cysteine protease [Saprolegnia parasitica]
          Length = 523

 Score =  269 bits (687), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 131/304 (43%), Positives = 189/304 (62%), Gaps = 8/304 (2%)

Query: 50  WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYT 109
           WM +      + LE   R ++F  N + IE  NK+ + ++ +G N++S LT DEF+ L T
Sbjct: 31  WMKKFAVKL-NPLEWVHRFEVFILNDQRIEAHNKDASSSFTMGHNEYSHLTFDEFKKLRT 89

Query: 110 GYKMPSPSH-RSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAA 168
           G ++ SPS+ +S          ++MTDVP  +DW ++G VTP+KNQ  CG CWAF+   A
Sbjct: 90  GLRV-SPSYIQSRAKYALMAPAVNMTDVPNEMDWVEQGGVTPVKNQGMCGSCWAFSTTGA 148

Query: 169 VEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAV 228
           +EG   + S  L+ +SEQ+L+DC  NG+ GC GG  + AF ++  ++G+  E++YPY A 
Sbjct: 149 IEGAAFVSSKQLVSVSEQELVDCDHNGDMGCNGGLMDNAFKWVKTHKGLCKEEDYPYHAK 208

Query: 229 PGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGV 288
            GTC+  +     K++ + +VP+ DEQAL  AV+ QPVS+AI A   EFQ YK G+F+  
Sbjct: 209 EGTCALKKCKPVTKVTAFHDVPANDEQALKAAVAKQPVSVAIEADQPEFQFYKSGVFDKS 268

Query: 289 CGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR----DEGLCGIGTRSS 344
           CGT+LDH V +VG+G  E G  YW +KNSWG  WGD GY+K+ R    + G CG+    S
Sbjct: 269 CGTKLDHGVLVVGYG-EEGGKKYWKVKNSWGADWGDKGYIKLAREFGPETGQCGVAMVPS 327

Query: 345 YPLA 348
           YP A
Sbjct: 328 YPTA 331


>gi|357130486|ref|XP_003566879.1| PREDICTED: fruit bromelain-like [Brachypodium distachyon]
          Length = 354

 Score =  269 bits (687), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 141/320 (44%), Positives = 190/320 (59%), Gaps = 17/320 (5%)

Query: 42  SVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTN 101
           +V   HE+WMA+ GR+YKD  EK  R ++F  N  +++  N+ GNRTY LG N FSDLT+
Sbjct: 33  TVASRHERWMARFGRAYKDADEKARRQEVFGANARHVDAVNRSGNRTYTLGLNHFSDLTD 92

Query: 102 DEFRALYTGYK--MPSPSH--RSTTSSTFKYQNLS--MTDVPTSLDWRDKGAVTPIKNQK 155
            EF   + GY+   P P    R       K   L+    DVP S+DWR +GAVT IKNQ+
Sbjct: 93  HEFLQQHLGYRHHQPGPGGLLRPEDQDMSKATALADYGQDVPDSVDWRAQGAVTEIKNQR 152

Query: 156 ECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQ 215
            CG CWAFAAVAA EG+ KI +GNLI +SEQQ+LDC T G N C GG    A  Y+  + 
Sbjct: 153 SCGSCWAFAAVAATEGLVKIATGNLISMSEQQVLDC-TGGGNTCDGGDINAALRYVAASG 211

Query: 216 GIATEDEYPYQAVPGTC---SAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAA 272
           G+  E  Y Y A  G C   S A   A+   + +  +  GDE AL    + QPV++A+ A
Sbjct: 212 GLQPEAAYAYAAQKGACRGASPANSAASVGGARFARL-GGDEGALRGLAAGQPVAVALEA 270

Query: 273 YSTEFQSYKEGIFNG--VCGTQLDHAVTIVGFGTTED-GANYWLIKNSWGNTWGDAGYMK 329
              +F+ YK G++ G   CG +L+H VT+VG+G  +D G  YW++KN WG  WG+ GYM+
Sbjct: 271 SEPDFRHYKSGVYAGSASCGRRLNHGVTVVGYGAEDDSGDEYWVVKNQWGTLWGEKGYMR 330

Query: 330 IVRDE---GLCGIGTRSSYP 346
           + R +     CGI + + YP
Sbjct: 331 VARGDVAGANCGIASYAYYP 350


>gi|115479933|ref|NP_001063560.1| Os09g0497500 [Oryza sativa Japonica Group]
 gi|113631793|dbj|BAF25474.1| Os09g0497500 [Oryza sativa Japonica Group]
 gi|215704298|dbj|BAG93138.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 349

 Score =  269 bits (687), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 137/324 (42%), Positives = 194/324 (59%), Gaps = 21/324 (6%)

Query: 43  VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTND 102
           +++  E+WM +HGR+Y D  EK+ R ++++ N+E +E  N   N  YKL  N+F+DLTN+
Sbjct: 27  MLDRFEQWMIRHGRAYTDAGEKQRRFEVYRRNVELVETFNSMSN-GYKLADNKFADLTNE 85

Query: 103 EFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMTDV-PTSLDWRDKGAVTPIKNQKECGC 159
           EFRA   G++  +  P   +T S+       S  D+ P S+DWR KGAV  +KNQ +CG 
Sbjct: 86  EFRAKMLGFRPHVTIPQISNTCSADIAMPGESSDDILPKSVDWRKKGAVVEVKNQGDCGS 145

Query: 160 CWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIAT 219
           CWAF+AVAA+EGI +I++G L+ LSEQ+L+DC      GC GG    AF +++ N G+ T
Sbjct: 146 CWAFSAVAAIEGINQIKNGELVSLSEQELVDCDDEA-VGCGGGYMSWAFEFVVGNHGLTT 204

Query: 220 EDEYPYQAVPGTCSAAQ-KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQ 278
           E  YPY A  G C AA+   +A  I+ Y  V    E  L +A + QPVS+A+   S  FQ
Sbjct: 205 EASYPYHAANGACQAAKLNQSAVAIAGYRNVTPSSEPDLARAAAAQPVSVAVDGGSFMFQ 264

Query: 279 SYKEGIFNGVCGTQLDHAVTIVGFGTTEDGAN----------YWLIKNSWGNTWGDAGYM 328
            Y  G++ G C   ++H VT+VG+G +E   +          YW++KNSWG  WGDAGY+
Sbjct: 265 LYGSGVYTGPCTADVNHGVTVVGYGESEPKTDGGGAAKGGEKYWIVKNSWGAEWGDAGYI 324

Query: 329 KIVRD-----EGLCGIGTRSSYPL 347
            + RD      GLCGI    SYP+
Sbjct: 325 LMQRDVAGLASGLCGIALLPSYPV 348


>gi|167521499|ref|XP_001745088.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163776702|gb|EDQ90321.1| predicted protein [Monosiga brevicollis MX1]
          Length = 294

 Score =  269 bits (687), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 144/304 (47%), Positives = 195/304 (64%), Gaps = 21/304 (6%)

Query: 52  AQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEFRALY 108
           + + +SY+ E  +  RL  F+ NLE+I K N E   G  +Y +G N+F+DLT DEF ALY
Sbjct: 3   SDYSKSYESEAVEAKRLAAFEANLEFINKHNAEHAQGLHSYTVGVNEFADLTIDEFMALY 62

Query: 109 TGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAA 168
               +PS  +R+   +T  Y   +  D   S+DWR KGAVTPIKNQ +CG CW+F+   +
Sbjct: 63  ----VPSKFNRTMPYNTV-YLPATSED---SVDWRTKGAVTPIKNQGQCGSCWSFSTTGS 114

Query: 169 VEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQA 227
            EG   I +GNL+ LSEQQL+DCS + GN GC GG  + AF YII N+G+ TE++YPY A
Sbjct: 115 TEGAHAIATGNLVSLSEQQLVDCSGSFGNQGCNGGLMDDAFKYIISNKGLDTEEDYPYTA 174

Query: 228 VPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFN 286
             GTC+  ++   AA IS+Y +VP  +E  L  AV+  PVS+AI A  + FQ YK G+F+
Sbjct: 175 QDGTCNKEKEAKHAATISSYSDVPKNNEDQLAAAVAKGPVSVAIEADQSGFQLYKSGVFD 234

Query: 287 GVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD---EGLCGIGTRS 343
           G CGT LDH V +VG+  T+D   YW++KNSWG TWG  GY+ + R     G+CGI  + 
Sbjct: 235 GNCGTNLDHGVLVVGY--TDD---YWIVKNSWGTTWGVEGYINMKRGVSASGICGIAMQP 289

Query: 344 SYPL 347
           SYP+
Sbjct: 290 SYPI 293


>gi|356543010|ref|XP_003539956.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
          Length = 306

 Score =  269 bits (687), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 144/309 (46%), Positives = 196/309 (63%), Gaps = 19/309 (6%)

Query: 48  EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRAL 107
           E+W+ Q+ R YKD+ E E+R  I++ NLEYIE  N +   +Y L  N+F+DLTN+EF + 
Sbjct: 6   ERWLKQNDRXYKDKEEWEVRFGIYQANLEYIECKNSQ-EXSYNLTDNKFADLTNEEFVSP 64

Query: 108 YTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVA 167
           Y G+       R    + F Y      D+P S DWR +GAV+ IK+Q  CG CWAF+AVA
Sbjct: 65  YLGFGT-----RFLPHTGFMYH--EHEDLPESKDWRKEGAVSDIKDQGNCGSCWAFSAVA 117

Query: 168 AVEGITKIRSGNLIQLSEQQLLDCST-NGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQ 226
           AVEGI KI+SG L+ LSEQ+  DC   +GN GC GG  + AFA+I +N G+ T  +YPY+
Sbjct: 118 AVEGINKIKSGKLVSLSEQEFRDCDVEDGNQGCEGGLMDTAFAFIKKNGGLTTSKDYPYE 177

Query: 227 AVPGTCSAAQK-PAAAKISNYEEVPSGDEQALLKAVSM--QPVSIAIAAYSTEFQSYKEG 283
            V GTC+  +    AA IS + +VP+ DE  L    +   Q  S+AI A    FQ Y +G
Sbjct: 178 GVDGTCNKEKALHHAANISGHVKVPANDEAMLKAKAAAANQXESVAIDAGGHAFQLYLKG 237

Query: 284 IFNGVCGTQLDHAVTIVGFGT-TEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCG 338
           +F+G+CG QL+H VTIVG+G  T D   YW++KNSWG  WG++GY+++ RD     G CG
Sbjct: 238 VFSGICGKQLNHGVTIVGYGKGTSD--KYWIVKNSWGADWGESGYIRMKRDAFDKAGTCG 295

Query: 339 IGTRSSYPL 347
           I  ++SYPL
Sbjct: 296 IAMQASYPL 304


>gi|312381833|gb|EFR27483.1| hypothetical protein AND_05794 [Anopheles darlingi]
          Length = 344

 Score =  269 bits (687), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 141/321 (43%), Positives = 196/321 (61%), Gaps = 19/321 (5%)

Query: 46  IHEKWMA---QHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDL 99
           + E+W A   QH + Y  E E+ +R+KI+ +N   I K N+    G   ++L  N+++DL
Sbjct: 23  VKEEWNAFKLQHRKKYDSESEERIRMKIYVQNKHKIAKHNQRYDLGQEKFRLRVNKYADL 82

Query: 100 TNDEF--------RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPI 151
            ++EF        R+   G K+       T      +   +  DVPT++DWR+KGAVTP+
Sbjct: 83  LHEEFVHTLNGFNRSAAAGSKLLGREQLMTIEEPITWIEPANVDVPTTIDWREKGAVTPV 142

Query: 152 KNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAY 210
           K+Q  CG CW+F+A  A+EG    ++G L+ LSEQ L+DCST  GNNGC GG  + AF Y
Sbjct: 143 KDQGHCGSCWSFSATGALEGQHFRKTGKLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFQY 202

Query: 211 IIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIA 269
           +  N+GI TE  YPY+A+   C    K   A    + ++P GDE+AL KA+ ++ PVS+A
Sbjct: 203 VKDNKGIDTEKAYPYEAIDDECHYNPKAIGATDKGFVDIPQGDEKALKKALATVGPVSVA 262

Query: 270 IAAYSTEFQSYKEGI-FNGVCGT-QLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGY 327
           I A    FQ Y EG+ +   C + QLDH V  VG+GTTEDG +YWL+KNSWG TWGD GY
Sbjct: 263 IDASHESFQFYSEGVYYEPQCDSEQLDHGVLAVGYGTTEDGEDYWLVKNSWGTTWGDQGY 322

Query: 328 MKIVRD-EGLCGIGTRSSYPL 347
           +K+ R+ E  CGI T +SYPL
Sbjct: 323 VKMARNRENHCGIATTASYPL 343


>gi|326490904|dbj|BAJ90119.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 457

 Score =  268 bits (686), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 135/310 (43%), Positives = 185/310 (59%), Gaps = 12/310 (3%)

Query: 48  EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANK------EGNRTYKLGTNQFSDLTN 101
           E W A+HG++Y    E+  RL  F EN  ++   N        G  +Y L  N F+DLT+
Sbjct: 40  EAWCAEHGKAYATPGERAARLAAFAENAAFVAAHNDAVASSGPGGPSYTLALNAFADLTH 99

Query: 102 DEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCW 161
           DEFRA   G     P      S +       +  VP +LDWR  GAVT +K+Q  CG CW
Sbjct: 100 DEFRAARLGRLAVGPGPLGAPSPSDGGFEGRVGAVPDALDWRQSGAVTKVKDQGSCGACW 159

Query: 162 AFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATED 221
           +F+A  A+EGI KI +G+L+ LSEQ+L+DC  + N GC GG    A+ ++I+N GI TED
Sbjct: 160 SFSATGAMEGINKITTGSLLSLSEQELIDCDRSYNTGCGGGLMTYAYKFVIKNGGIDTED 219

Query: 222 EYPYQAVPGTCSAAQ-KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSY 280
           +YP++   GTC+  + K     I  Y+EVPS  E  LL+AV+ QP+S+ I   +  FQ Y
Sbjct: 220 DYPFREADGTCNKNKLKKHVVTIDGYKEVPSSKEDLLLQAVAQQPISVGICGSARAFQLY 279

Query: 281 KEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGL 336
            +GIF+G C T LDHAV IVG+G +E G +YW++KNSWG  WG  GYM + R+     G+
Sbjct: 280 SQGIFDGPCPTSLDHAVLIVGYG-SEGGKDYWIVKNSWGERWGMKGYMHMHRNTGSSSGI 338

Query: 337 CGIGTRSSYP 346
           CGI   +S+P
Sbjct: 339 CGINMMASFP 348


>gi|297845822|ref|XP_002890792.1| hypothetical protein ARALYDRAFT_473117 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297336634|gb|EFH67051.1| hypothetical protein ARALYDRAFT_473117 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 322

 Score =  268 bits (684), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 141/327 (43%), Positives = 205/327 (62%), Gaps = 35/327 (10%)

Query: 30  SQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTY 89
           SQ     + +EQS+V+ H++WM Q  R Y+DE EKEMRL++FK+NL++IE  N  GN++Y
Sbjct: 21  SQARPHVTLNEQSIVDYHQQWMTQFSRVYQDESEKEMRLQVFKKNLKFIENFNNMGNQSY 80

Query: 90  KLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPT---SLDWRDKG 146
            +G N+F+D T +EF A +TG ++   +     + T   +N +++D+     S DWRD+G
Sbjct: 81  TVGVNEFTDWTIEEFLATHTGLRVNVTTLSELFNETMPSRNWNISDIDIDDESKDWRDEG 140

Query: 147 AVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREK 206
           AV P+K Q  C             G+TKI   NL+ LSEQQL+DC T  N GC GG  E+
Sbjct: 141 AVIPVKVQGAC-------------GLTKISGKNLLTLSEQQLIDCDTEKNTGCDGGGIEE 187

Query: 207 AFAYIIQNQGIATEDEYPYQAVPGTCSA-AQKPAAAKISNYEEVPSGDEQALLKAVSMQP 265
           AF YII+N G++ E EYPYQ   G+C A A+     +I  +E VPS +E+ALL+AV  QP
Sbjct: 188 AFKYIIKNGGVSLETEYPYQVKKGSCRANARSATQTQIRGFEMVPSHNERALLEAVRRQP 247

Query: 266 VSIAIAAYSTEFQSYKEGIFNGV-CGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGD 324
           VS+ I A +  F++YK G++ G+ CGT ++HAVT VG+GT        +I+     +WG+
Sbjct: 248 VSVLIDARADSFKTYKGGVYAGLDCGTDVNHAVTFVGYGT--------MIQ-----SWGE 294

Query: 325 AGYMKIVRD----EGLCGIGTRSSYPL 347
            GYM+I RD    +G+CGI   ++YP+
Sbjct: 295 NGYMRIRRDVEWPQGMCGIAQVAAYPI 321


>gi|255546708|ref|XP_002514413.1| cysteine protease, putative [Ricinus communis]
 gi|223546510|gb|EEF48009.1| cysteine protease, putative [Ricinus communis]
          Length = 324

 Score =  268 bits (684), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 143/347 (41%), Positives = 210/347 (60%), Gaps = 44/347 (12%)

Query: 13  INTTPMFIIITLLVSCAS-----QVVSSRSTHEQS---VVEIHEKWMAQHGRSYKDELEK 64
           +++  +F I T LV C+       +V     H  S   + E+ E WM++HG++Y+   EK
Sbjct: 5   VSSIFLFTIFTSLVICSVVAHDFSIVGYSPEHLTSMHKLTELFESWMSKHGKTYESIEEK 64

Query: 65  EMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSS 124
             RL++FK+NL +I++ N++   TY L  N+F+DL+++EF+                 S 
Sbjct: 65  LHRLEVFKDNLMHIDRRNRDVT-TYWLALNEFADLSHEEFK-----------------SK 106

Query: 125 TFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLS 184
             + + L            +KGAV P+KNQ  CG CWAF+ VAAVEGI +I +GNL  LS
Sbjct: 107 LAQIRRL------------EKGAVAPVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSLS 154

Query: 185 EQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKI 243
           EQ+L+DC T+ N+GC GG  + AF YI+ N G+  E++YPY    GTC   ++      I
Sbjct: 155 EQELIDCDTSFNSGCNGGLMDYAFDYIVNNGGLHKEEDYPYLMEEGTCDEKREEMEVVTI 214

Query: 244 SNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFG 303
           S Y +VP  +E++LLKA++ QP+SIAI A   +FQ Y  G+FNG CGT LDH V  VG+G
Sbjct: 215 SGYHDVPENNEESLLKALAHQPLSIAIEASGRDFQFYGRGVFNGPCGTDLDHGVAAVGYG 274

Query: 304 TTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
           +++ G +Y ++KNSWG  WG+ GY+++ R+    EGLCGI   +SYP
Sbjct: 275 SSK-GLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGLCGINKMASYP 320


>gi|326497561|dbj|BAK05870.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 340

 Score =  268 bits (684), Expect = 4e-69,   Method: Compositional matrix adjust.
 Identities = 145/343 (42%), Positives = 202/343 (58%), Gaps = 21/343 (6%)

Query: 14  NTTPMFIIITLLVSCASQVVSSRS-----THEQSVVEIHEKWMAQHGRSYKDELEKEMRL 68
            T P+  I+ LL         + S       +  +++   +W A H RSY    E+  R 
Sbjct: 7   GTRPVIPILVLLTGGLFAAFPAASGGRVDAGDMLMMDRFRQWQATHNRSYLSAEERLRRF 66

Query: 69  KIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKY 128
           ++++ N+EYI+  N+ G  TY+LG NQF+DLT +EF A Y G       H  +  +T   
Sbjct: 67  EVYRTNVEYIDATNRRGGLTYELGENQFADLTGEEFLARYAG------GHTGSAITTAAE 120

Query: 129 QNLSM-TDVPTSLDWRDKGAVTPIKNQ-KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQ 186
            + S+  D P S+DWR KGAVTP+KNQ  +C  CWAF+AVA +E +  I++G L+ LSEQ
Sbjct: 121 ADGSLEADPPASVDWRAKGAVTPVKNQGSQCYSCWAFSAVATMESLYFIKTGKLVALSEQ 180

Query: 187 QLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNY 246
           QL+DC    + GC  G   +AF +I++N GI T  +YPY+AV G CSAA KPA   I+ +
Sbjct: 181 QLVDCDKY-DGGCNKGYYHRAFQWIMENGGITTAAQYPYKAVRGACSAA-KPAV-TITGH 237

Query: 247 EEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTE 306
             V   +E AL  AV+ QP+ +AI       Q YK G+F+  CG Q+ HAV  VG+G   
Sbjct: 238 LAVAK-NELALQSAVARQPIGVAIEV-PISMQFYKSGVFSAACGIQMSHAVVTVGYGADA 295

Query: 307 DGANYWLIKNSWGNTWGDAGYMKIVRD---EGLCGIGTRSSYP 346
            G  YWL+KNSWG TWG+AGY+++ RD    GLCGI   ++YP
Sbjct: 296 SGLKYWLVKNSWGQTWGEAGYIRMRRDVGGGGLCGIALDTAYP 338


>gi|53791858|dbj|BAD53944.1| putative cysteine protease [Oryza sativa Japonica Group]
          Length = 335

 Score =  267 bits (683), Expect = 5e-69,   Method: Compositional matrix adjust.
 Identities = 143/343 (41%), Positives = 197/343 (57%), Gaps = 22/343 (6%)

Query: 15  TTPMFIIITLLV--SCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFK 72
           T+ + ++ TL+   + A+    +  + +   +++ E+WMA+ G++YK   EKE R  IF+
Sbjct: 2   TSIVLLVCTLMALQAMAASAYYNNGSDDGVTMQMFEEWMAKFGKTYKCHGEKEHRFGIFR 61

Query: 73  ENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
           +N+ +I     +      +G NQF+DLTNDEF A YTG K P P            + + 
Sbjct: 62  DNVHFIRGYKPQVTYDSAVGINQFADLTNDEFVATYTGAKPPHPKEAP--------RPVD 113

Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
               P  +DWR +GAVT +K+Q  CG CWAFAAVAA+EG+TKIR+G L  LSEQ+L+DC 
Sbjct: 114 PIWTPCCIDWRFRGAVTGVKDQGACGSCWAFAAVAAIEGLTKIRTGQLTPLSEQELVDCD 173

Query: 193 TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQK--PAAAKISNYEEVP 250
           TN +NGC GG  ++AF  +    GI  E +Y Y+   G C         AA I  Y  VP
Sbjct: 174 TN-SNGCGGGHTDRAFELVASKGGITAESDYRYEGFQGKCRVDDMLFNHAASIGGYRAVP 232

Query: 251 SGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGAN 310
             DE+ L  AV+ QPV++ I A    FQ YK G+F G CG   +HAVT+VG+   +DGA+
Sbjct: 233 PNDERQLATAVARQPVTVYIDASGPAFQFYKSGVFPGPCGASSNHAVTLVGY--CQDGAS 290

Query: 311 ---YWLIKNSWGNTWGDAGYM----KIVRDEGLCGIGTRSSYP 346
              YWL KNSWG TWG  GY+     IV+  G CG+     YP
Sbjct: 291 GKKYWLAKNSWGKTWGQQGYILLEKDIVQPHGTCGLAVSPFYP 333


>gi|154183745|gb|ABS70713.1| cathepsin L-like cysteine proteinase [Dermacentor variabilis]
          Length = 333

 Score =  267 bits (683), Expect = 5e-69,   Method: Compositional matrix adjust.
 Identities = 143/337 (42%), Positives = 209/337 (62%), Gaps = 13/337 (3%)

Query: 19  FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
            + I+LL  CA  VV++ ++  + +    E + A H +SY+  +E+ +R KIF EN   +
Sbjct: 1   MLRISLL--CAFVVVTTAASSHEILRTQWEAFKATHKKSYQSNMEELLRFKIFSENSLLV 58

Query: 79  EKANKEGNR---TYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD 135
            + N++  R   +YKLG NQF DL   EF  ++ GY+    + R +T       N++ + 
Sbjct: 59  ARHNEKYARGLVSYKLGMNQFGDLLPHEFARMFNGYRGARTAGRGST--FLPPANVNYSS 116

Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TN 194
           +P S+DWR+KGAVTP+KNQ +CG CWAF+   ++EG   +++G L+ LSEQ L+DCS T 
Sbjct: 117 LPQSMDWREKGAVTPVKNQGQCGSCWAFSTTGSLEGQHFLKTGVLVSLSEQNLVDCSETF 176

Query: 195 GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDE 254
           GN+GC GG  + AF YI  N GI TE  YPY+A  G C   ++   A  + + ++  G E
Sbjct: 177 GNHGCEGGLMDNAFQYIKANGGIDTEKSYPYEAEDGECRFKKQNVGATDTGFVDIEQGSE 236

Query: 255 QALLKAV-SMQPVSIAIAAYSTEFQSYKEGIFNGV-CGT-QLDHAVTIVGFGTTEDGANY 311
             L KAV ++ PVS+AI A  + FQ Y EG+++   C + QLDH V +VG+G  EDG  Y
Sbjct: 237 DDLKKAVATVGPVSVAIDASHSSFQLYSEGVYDETECSSEQLDHGVLVVGYG-VEDGKKY 295

Query: 312 WLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
           WL+KNSW  +WGD GY+K+ RD +  CGI + +SYPL
Sbjct: 296 WLVKNSWAESWGDNGYIKMSRDKDNQCGIASAASYPL 332


>gi|72008176|ref|XP_780713.1| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
          Length = 335

 Score =  267 bits (682), Expect = 7e-69,   Method: Compositional matrix adjust.
 Identities = 144/337 (42%), Positives = 209/337 (62%), Gaps = 13/337 (3%)

Query: 19  FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
           ++ + L+ +C   VVSS S       E   +W  +HG+ Y  + E+  R  I+++NL+ +
Sbjct: 3   YLSVLLVAAC---VVSSLSMSFTDFDEDWNQWKNEHGKRYLSDEEEASRKLIWEKNLDIV 59

Query: 79  EKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD 135
            K N +   G+ TY LG NQF+DL N+EF A+ TG+++   S ++   STF   N ++ +
Sbjct: 60  IKHNLKYDLGHFTYALGMNQFADLKNEEFVAMMTGFRVNGTS-KAAKGSTFLPSN-NIGE 117

Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TN 194
           +P ++DWR KG VTP+K+Q +CG CWAF+   ++EG     +G L+ LSEQ L+DCS   
Sbjct: 118 LPKTVDWRTKGYVTPVKDQGQCGSCWAFSTTGSLEGQHFKATGKLVSLSEQNLVDCSGKE 177

Query: 195 GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDE 254
           GN GC GG  ++AF YII+  GI TE+ YPY+AV G C   +    A ++ Y +V S  E
Sbjct: 178 GNEGCDGGLMDQAFQYIIKAGGIDTEESYPYKAVDGECHFKKANIGATVTGYTDVTSDSE 237

Query: 255 QALLKAVS-MQPVSIAIAAYSTEFQSYKEGIFN--GVCGTQLDHAVTIVGFGTTEDGANY 311
            AL KAV+ + P+S+AI A    FQ YK G++N      T LDH V  VG+GTT DG +Y
Sbjct: 238 TALQKAVAHIGPISVAIDASHMSFQLYKSGVYNEPDCSSTLLDHGVLAVGYGTTSDGTDY 297

Query: 312 WLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
           W++KNSW  TWG  GY+ + R+ +  CGI T++SYPL
Sbjct: 298 WIVKNSWAETWGMNGYLWMSRNKDNQCGIATQASYPL 334


>gi|356542171|ref|XP_003539543.1| PREDICTED: LOW QUALITY PROTEIN: KDEL-tailed cysteine endopeptidase
           CEP2-like [Glycine max]
          Length = 342

 Score =  267 bits (682), Expect = 7e-69,   Method: Compositional matrix adjust.
 Identities = 144/346 (41%), Positives = 206/346 (59%), Gaps = 24/346 (6%)

Query: 15  TTPMFIIITLLVSCASQVVSS--------RSTHEQSVVEIHEKWMAQHGRSYKDELEKEM 66
           T  +  II LLV C   + +S         S+  + +   +E W+ ++G+ Y+++ E E 
Sbjct: 4   TITLVAIINLLVLCNLWITASACPAKHNDNSSDSEVMRMRYESWLKKYGQKYRNKDEWEF 63

Query: 67  RLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTF 126
           R +I++ N+++IE  N + N +YKL  N+F DLTN+EFR +Y  Y+      RS   + F
Sbjct: 64  RFEIYRANVQFIEVYNSQ-NYSYKLMDNKFVDLTNEEFRRMYLVYQP-----RSHLQTRF 117

Query: 127 KYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQ 186
            YQ     D+P  +DWR +GAVT IK+Q  CG CW+F+AVA VE I KI++G L+ LSEQ
Sbjct: 118 MYQ--KHGDLPKRIDWRTRGAVTXIKDQGHCGSCWSFSAVATVEDINKIKTGKLVSLSEQ 175

Query: 187 QLLDCST-NGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQ-KPAAAKIS 244
           QL+DC   NGN GC GG  E  F +I +  G+ T+  YPYQ   G  + A+ +  A  I 
Sbjct: 176 QLIDCDNRNGNEGCNGGHME-TFTFITKRGGLTTDKNYPYQGSDGDXNKAKVRNHAVAIC 234

Query: 245 NYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGT 304
            YE +P+ +E  L  AV+ QP S+A  A    FQ Y +G F+G CG  L+H +TIVG+G 
Sbjct: 235 GYENLPAHNENMLKAAVAHQPASVATDAGGYAFQLYSKGTFSGSCGKDLNHRMTIVGYG- 293

Query: 305 TEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
            E+G  YWL+KNSW N  G +GY+++ RD    +G CG    +SYP
Sbjct: 294 EENGEKYWLVKNSWANDXGVSGYIRMKRDPKDKDGTCGTAMEASYP 339


>gi|357130488|ref|XP_003566880.1| PREDICTED: fruit bromelain-like [Brachypodium distachyon]
          Length = 356

 Score =  267 bits (682), Expect = 7e-69,   Method: Compositional matrix adjust.
 Identities = 139/315 (44%), Positives = 189/315 (60%), Gaps = 16/315 (5%)

Query: 47  HEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRA 106
           HE+WMA+ GR Y D  EK  R ++F  N  Y++  N+ GNRTY LG N+FSDLT+DEF  
Sbjct: 39  HEEWMAKFGRVYTDAQEKARRQEVFGANARYVDAVNRAGNRTYTLGLNKFSDLTDDEFVQ 98

Query: 107 LYTGYK-MPSPSHRSTTSSTFKYQNLS--MTDVPTSLDWRDKGAVTPIKNQKECGCCWAF 163
            + GY+       R    +  K   L     D+P S+DWR +GAVT +KNQ  CGCCWAF
Sbjct: 99  THLGYRGHQQGGLRPEEENVSKVAALGYGQADMPESVDWRAQGAVTGVKNQGSCGCCWAF 158

Query: 164 AAVAAVEGITKIRSGNLIQLSEQQLLDCSTN----GN-NGCLGGSREKAFAYIIQNQGIA 218
           AAVAA EG+ KI +GNLI +SEQQ+LDC+      GN N C GG  + A  Y+  ++G+ 
Sbjct: 159 AAVAATEGLVKIATGNLISMSEQQVLDCTGQSPGMGNTNTCDGGHIDDALRYVAASRGLQ 218

Query: 219 TEDEYPYQAVPGTCSAAQKP-AAAKISNYEEVP-SGDEQALLKAVSMQPVSIAIAAYSTE 276
            E  Y Y  + G C +   P +AA     + V   GDE  L   V+ QP+++++ A S +
Sbjct: 219 PEAAYAYTGLQGACQSGFTPNSAASFGEPQTVTLQGDEGRLQGLVAGQPIAVSVEA-SDD 277

Query: 277 FQSYKEGIFNG---VCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD 333
           F+ Y  G+F      CG +L+HAVT+VG+G+ + G  YWL+KN WG +WG+ GYM+I R 
Sbjct: 278 FRHYMSGVFTAGTSSCGQRLNHAVTVVGYGSADGGQEYWLVKNQWGTSWGEGGYMRIARG 337

Query: 334 EGL--CGIGTRSSYP 346
            G   CGI   + YP
Sbjct: 338 NGAPNCGISAYAYYP 352


>gi|356517384|ref|XP_003527367.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
          Length = 332

 Score =  266 bits (681), Expect = 7e-69,   Method: Compositional matrix adjust.
 Identities = 151/340 (44%), Positives = 212/340 (62%), Gaps = 28/340 (8%)

Query: 20  IIITLLVSCASQV--VSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
           I   +L+S A     V+ R+  + S+ E H + M ++ +  KD  +      +FKEN+ Y
Sbjct: 10  IAFAMLLSMAFLAFQVTCRTLQDASMYESHGQRMTRYSKVDKDPPDX-----VFKENVNY 64

Query: 78  IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
           IE  N   ++ YK   NQF+       +  + G+ M S   R TT   FK++N++ T  P
Sbjct: 65  IEACNNAADKPYKRDINQFAP------KKRFKGH-MCSSIIRITT---FKFENVTAT--P 112

Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLS-EQQLLDCSTNG- 195
           +++D R K AVTPIK+Q +CGC WA +AVAA EGI  + +G LI LS EQ+L+DC T G 
Sbjct: 113 STVDCRQKVAVTPIKDQGQCGCFWALSAVAATEGIHALXAGKLILLSSEQELVDCDTKGV 172

Query: 196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSA--AQKPAAAKISNYEEVPSGD 253
           +  C GG  + AF +IIQN G+ TE  YPY+ V G C+A  A K AA  I+ YE+VP+ +
Sbjct: 173 DQDCQGGLMDDAFKFIIQNHGLNTEANYPYKGVDGKCNAYEADKNAATIITGYEDVPANN 232

Query: 254 EQA-LLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYW 312
           E+A L KAV+  PVS+AI A  ++FQ YK G+F G CGT+LDH VT VG+G ++DG  YW
Sbjct: 233 EKAHLQKAVANNPVSVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSDDGTEYW 292

Query: 313 LIKNSWGNTWGDAGYMKIVR----DEGLCGIGTRSSYPLA 348
           L+KNS G  WG+ GY+++ R    +E LCGI  ++SYP A
Sbjct: 293 LVKNSRGTEWGEEGYIRMQRGVDSEEALCGIAVQASYPSA 332


>gi|194701748|gb|ACF84958.1| unknown [Zea mays]
 gi|414589103|tpg|DAA39674.1| TPA: thiol protease SEN102 [Zea mays]
          Length = 374

 Score =  266 bits (681), Expect = 8e-69,   Method: Compositional matrix adjust.
 Identities = 146/347 (42%), Positives = 199/347 (57%), Gaps = 33/347 (9%)

Query: 29  ASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNR- 87
           A  +  S ST + S++E  ++W A + +SY    E+  R ++   N+ YIE  N E    
Sbjct: 32  AGDMERSMSTDDSSMIERFQRWKAAYNKSYATVAEERRRFRVCARNMAYIEATNAEAEAA 91

Query: 88  --TYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFK------------------ 127
             TY+LG   ++DLTN EF A+YT    P+P+      S                     
Sbjct: 92  GLTYELGETAYTDLTNQEFMAMYTA---PAPAQLPADESVITTRAGPVDAVGGAPGQLPV 148

Query: 128 YQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQ 187
           Y NLS T  P S+DWR  GAVTP+KNQ  CG CWAF+ VA VEGI +IR+G L+ LSEQ+
Sbjct: 149 YVNLS-TSAPASVDWRASGAVTPVKNQGRCGSCWAFSTVAVVEGIYQIRTGKLVSLSEQE 207

Query: 188 LLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQ-KPAAAKISNY 246
           L+DC T  ++GC GG   +A  +I  N GI TE +YPY      C+ A+    A  I+  
Sbjct: 208 LVDCDTL-DDGCDGGISYRALRWIASNGGITTETDYPYTGTTDACNRAKLSHNAVSIAGL 266

Query: 247 EEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTE 306
             V +  E +L  AV+ QPV+++I A    FQ YK+G++NG CGT L+H VT+VG+G   
Sbjct: 267 RRVATRSEASLANAVAGQPVAVSIEAGGDNFQHYKKGVYNGPCGTNLNHGVTVVGYGQEA 326

Query: 307 DGAN-YWLIKNSWGNTWGDAGYMKIVRD-----EGLCGIGTRSSYPL 347
            G + YW++KNSWG  WGD GY+++ +D     EGLCGI  R SYPL
Sbjct: 327 AGGDRYWIVKNSWGQGWGDDGYIRMKKDVAGKPEGLCGIAIRPSYPL 373


>gi|390337642|ref|XP_780653.3| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
          Length = 333

 Score =  266 bits (681), Expect = 9e-69,   Method: Compositional matrix adjust.
 Identities = 144/336 (42%), Positives = 208/336 (61%), Gaps = 13/336 (3%)

Query: 19  FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
           ++ + L+ +C   VVSS S       E   +W  +HG+ Y  + E+  R  I+++NL+ +
Sbjct: 3   YLSVLLVAAC---VVSSLSMSFTDFDEDWNEWKNEHGKRYLSDEEEASRRLIWQKNLDIV 59

Query: 79  EKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD 135
            K N +   G+ TY LG NQF+DL N+EF A+ TG+++   S ++   STF   N ++ +
Sbjct: 60  IKHNLKYDLGHFTYDLGINQFTDLQNEEFVAMMTGFRVSGTS-KAAKGSTFLPPN-NVGE 117

Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG 195
           +P ++DWR KG VTP+K+Q +CG CWAF+   +VEG     +G L+ LSEQ L+DCS   
Sbjct: 118 LPKTVDWRTKGYVTPVKDQGQCGSCWAFSTTGSVEGQHFKATGKLVSLSEQNLVDCSGR- 176

Query: 196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQ 255
           + GC GG  ++AF YII   GI TE  YPY+AV G C   +    A ++ Y +V SG E+
Sbjct: 177 DAGCDGGFMDRAFQYIIDAGGIDTEASYPYKAVDGKCHFKKANVGATVTGYTDVTSGSEK 236

Query: 256 ALLKAVS-MQPVSIAIAAYSTEFQSYKEGIFN--GVCGTQLDHAVTIVGFGTTEDGANYW 312
           AL KAV+ + P+S+AI A    FQ YK G++N  G   T LDH V  VG+GT+ DG +YW
Sbjct: 237 ALQKAVAHVGPISVAIDASHMSFQHYKSGVYNEPGCDSTVLDHGVLAVGYGTSSDGTDYW 296

Query: 313 LIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
           ++KNSW  TWG  GY+ + R+ +  CGI T +SYPL
Sbjct: 297 IVKNSWAETWGMNGYVWMSRNKDNQCGIATNASYPL 332


>gi|218202077|gb|EEC84504.1| hypothetical protein OsI_31195 [Oryza sativa Indica Group]
          Length = 362

 Score =  266 bits (680), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 144/361 (39%), Positives = 210/361 (58%), Gaps = 31/361 (8%)

Query: 11  FKINTTPMFIIITLLVSC----ASQVVSSRSTH-------EQSVVEIHEKWMAQHGRSYK 59
           F    +P  + + LL SC    A+ ++ +R+T        +  +++    W   H RSY 
Sbjct: 4   FMACASPPVLTLALLASCGALLATSMLPARATAGSCLDVGDMVMMDRFRAWQGAHNRSYP 63

Query: 60  DELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKM---PSP 116
              E   R  +++ N E+I+  N  G+ TY+L  N+F+DLT +EF A YTGY     P  
Sbjct: 64  SAEEALQRFDVYRRNAEFIDAVNLRGDLTYRLAENEFADLTEEEFLATYTGYYAGDGPVD 123

Query: 117 SHRSTT-----SSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKE-CGCCWAFAAVAAVE 170
               TT      ++F Y+     DVP S+DWR +GAV P K+Q   C  CWAF   A +E
Sbjct: 124 DSVITTGAGDVDASFSYR----VDVPASVDWRAQGAVVPPKSQTSTCSSCWAFVTAATIE 179

Query: 171 GITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPG 230
            +  I++G L+ LSEQQL+DC +  + GC  GS  +A+ ++++N G+ TE +YPY A  G
Sbjct: 180 SLNMIKTGKLVSLSEQQLVDCDSY-DGGCNLGSYGRAYKWVVENGGLTTEADYPYTARRG 238

Query: 231 TCSAAQKP-AAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVC 289
            C+ A+    AAKI+ + +VP  +E AL  AV+ QPV++AI    +  Q YK G++ G C
Sbjct: 239 PCNRAKSAHHAAKITGFGKVPPRNEAALQAAVARQPVAVAIEV-GSGMQFYKGGVYTGPC 297

Query: 290 GTQLDHAVTIVGFGT-TEDGANYWLIKNSWGNTWGDAGYMKIVRD---EGLCGIGTRSSY 345
           GT+L HAVT+VG+GT    GA YW IKNSWG +WG+ GY++I+RD    GLCG+    +Y
Sbjct: 298 GTRLAHAVTVVGYGTDASSGAKYWTIKNSWGQSWGERGYIRILRDVGGPGLCGVTLDIAY 357

Query: 346 P 346
           P
Sbjct: 358 P 358


>gi|356517398|ref|XP_003527374.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
          Length = 333

 Score =  266 bits (680), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 152/341 (44%), Positives = 212/341 (62%), Gaps = 30/341 (8%)

Query: 19  FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
           F ++  +   A QV + R+  + S+ E HE+ M ++ + YKD  E       F  N+ YI
Sbjct: 12  FAMLLCMAFLAFQV-TCRTLQDASMYERHEQRMTRYSKVYKDPPES------FXGNVNYI 64

Query: 79  EKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPT 138
           E  N   ++ YK G NQF        R  + G+ M S   R TT   FK++N++ T  P+
Sbjct: 65  EACNNAADKPYKXGINQFPP------RNRFKGH-MCSSIIRITT---FKFENVTAT--PS 112

Query: 139 SLDWRDKGAVTP--IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLS-EQQLLDCSTNG 195
           ++D R KGAVTP  +K+Q +CGC WA +AVAA EGI  + +G LI LS E +L+DC T G
Sbjct: 113 TVDCRQKGAVTPYTVKDQGQCGCFWALSAVAATEGIHALXAGKLILLSXEPELVDCDTKG 172

Query: 196 -NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSA--AQKPAAAKISNYEEVPSG 252
            + GC GG  + AF +IIQN G+ TE  YPY+ V G C+A  A K AA  I+ Y++VP+ 
Sbjct: 173 VDQGCEGGLTDDAFKFIIQNHGLNTEANYPYKGVDGKCNANEADKNAATIITGYDDVPAN 232

Query: 253 DEQALL-KAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANY 311
           +E+A L KAV+  PVS+AI A  ++FQ YK G+F G CGT+LDH VT VG+G ++DG  Y
Sbjct: 233 NEKAHLQKAVANNPVSVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSDDGTEY 292

Query: 312 WLIKNSWGNTWGDAGYMKIVR----DEGLCGIGTRSSYPLA 348
           WL+KNS G  WG+ GY+++ R    +E LCGI  ++SYP A
Sbjct: 293 WLVKNSRGPEWGEEGYIRMQRGVDSEEALCGIAVQASYPSA 333


>gi|297818854|ref|XP_002877310.1| hypothetical protein ARALYDRAFT_484828 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297323148|gb|EFH53569.1| hypothetical protein ARALYDRAFT_484828 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 376

 Score =  266 bits (679), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 144/331 (43%), Positives = 201/331 (60%), Gaps = 23/331 (6%)

Query: 32  VVSSRSTH--EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTY 89
           VV++  +H  E  V  I+E+W+ +HG++Y    EKE R KIFK+NL++IE+ N + NR+Y
Sbjct: 24  VVTATESHRNEAEVRTIYERWLVEHGKNYNGLGEKERRFKIFKDNLKHIEEHNSDPNRSY 83

Query: 90  KLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVT 149
             G NQFSDLT DEF+A Y G K+     +S +    +YQ      +P  +DWR++GAV 
Sbjct: 84  DRGLNQFSDLTVDEFQASYLGGKI---EKKSLSDVAERYQYKEGDILPDEVDWRERGAVV 140

Query: 150 P-IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNN-GCLGGSREKA 207
           P +K Q +CG CWAFAA  AVEGI +I +G L+ LSEQ+L+DC    +N GC GG    A
Sbjct: 141 PRVKRQGDCGSCWAFAATGAVEGINQITTGELLSLSEQELIDCDRGKDNFGCAGGGAVWA 200

Query: 208 FAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAK------ISNYEEVPSGDEQALLKAV 261
           F +I +N GI T+++Y Y    G  +AA K    K      I+ +E VP  DE +L KAV
Sbjct: 201 FEFIKENGGIVTDEDYGYT---GDDTAACKAIEMKTTRVVTINGHEVVPVNDEMSLKKAV 257

Query: 262 SMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQL-DHAVTIVGFGTTEDGANYWLIKNSWGN 320
           S QP+S+ I+A       YK G++ G C     DH V IVG+GT+ D  +YWLI+NSWG 
Sbjct: 258 SYQPISVMISA--ANMSDYKSGVYKGPCSNLWGDHNVLIVGYGTSSDEGDYWLIRNSWGP 315

Query: 321 TWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
            WG+ GY+++ R+     G C +     YP+
Sbjct: 316 GWGEGGYLRLQRNFNEPTGKCAVAVAPVYPI 346


>gi|194757786|ref|XP_001961143.1| GF13722 [Drosophila ananassae]
 gi|190622441|gb|EDV37965.1| GF13722 [Drosophila ananassae]
          Length = 417

 Score =  266 bits (679), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 137/306 (44%), Positives = 188/306 (61%), Gaps = 11/306 (3%)

Query: 53  QHGRSYKDELEKEMRLKIFKENLEYIEKANK---EGNRTYKLGTNQFSDLTNDEFRALYT 109
           +H ++Y DE E+  RLKIF EN   I K N+    G  +YKL  N+++D+ + EFR L  
Sbjct: 111 EHRKNYLDETEERFRLKIFNENKHKIAKHNQLWASGKVSYKLAVNKYADMLHHEFRQLMN 170

Query: 110 GYKMPSPSHRSTTSSTFK---YQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAV 166
           G+             +FK   + +     +P S+DWRDKGAVT +K+Q  CG CWAF++ 
Sbjct: 171 GFNYTLHKELRAADESFKGVTFISPEHVTLPKSVDWRDKGAVTGVKDQGHCGSCWAFSST 230

Query: 167 AAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPY 225
            A+EG    +SG L+ LSEQ L+DCST  GNNGC GG  + AF YI  N GI TE  YPY
Sbjct: 231 GALEGQHYRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYPY 290

Query: 226 QAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGI 284
           +A+  +C   +    A    + ++P G+E+ L +AV ++ PVS+AI A    FQ Y EG+
Sbjct: 291 EALDDSCHFNKGTIGATDRGFVDIPQGNEKKLAEAVATIGPVSVAIDASHESFQFYSEGV 350

Query: 285 F-NGVCGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGT 341
           +    C  Q LDH V +VGFGT E G +YWL+KNSWG TWGD G++K++R+ +  CGI +
Sbjct: 351 YVEPACDAQNLDHGVLVVGFGTDESGQDYWLVKNSWGTTWGDKGFIKMLRNKDNQCGIAS 410

Query: 342 RSSYPL 347
            SSYPL
Sbjct: 411 ASSYPL 416


>gi|125525815|gb|EAY73929.1| hypothetical protein OsI_01813 [Oryza sativa Indica Group]
          Length = 336

 Score =  266 bits (679), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 140/340 (41%), Positives = 195/340 (57%), Gaps = 22/340 (6%)

Query: 18  MFIIITLLV--SCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENL 75
           + ++ TL+   + A+    +  + +   +++ E+WMA+ G++YK   EKE R  IF++N+
Sbjct: 6   LLVVCTLMALQAMAASAYYNNGSDDGVTMQMFEEWMAKFGKTYKCHGEKEHRFGIFRDNV 65

Query: 76  EYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD 135
            +I     +      +G NQF+DLTNDEF A YTG K P P            + +    
Sbjct: 66  HFIRGYKPQVTYDSAVGINQFADLTNDEFVATYTGAKPPHPKEAP--------RPVDPIW 117

Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG 195
            P  +DWR +GAVT +K+Q  CG CWAFAAVAA+EG+TKIR+G L  LSEQ+L+DC TN 
Sbjct: 118 TPCCIDWRFRGAVTGVKDQGACGSCWAFAAVAAIEGLTKIRTGQLTPLSEQELVDCDTN- 176

Query: 196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQK--PAAAKISNYEEVPSGD 253
           +NGC GG  ++AF  +    GI  E +Y Y+   G C         AA I  Y  VP  D
Sbjct: 177 SNGCGGGHTDRAFELVASKGGITAESDYRYEGFQGKCRVDDMLFNHAASIGGYRAVPPND 236

Query: 254 EQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGAN--- 310
           E+ L  AV+ QPV++ I A    FQ YK G+F G CG   +HAVT+VG+   +DGA+   
Sbjct: 237 ERQLATAVARQPVTVYIDASGPAFQFYKSGVFPGPCGASSNHAVTLVGY--CQDGASGKK 294

Query: 311 YWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
           YW+ KNSWG TWG  GY+ + +D     G CG+     YP
Sbjct: 295 YWVAKNSWGKTWGQQGYILLEKDVLQPHGTCGLAVSPFYP 334


>gi|115478933|ref|NP_001063060.1| Os09g0381400 [Oryza sativa Japonica Group]
 gi|113631293|dbj|BAF24974.1| Os09g0381400 [Oryza sativa Japonica Group]
 gi|215678649|dbj|BAG92304.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|218202075|gb|EEC84502.1| hypothetical protein OsI_31193 [Oryza sativa Indica Group]
          Length = 362

 Score =  266 bits (679), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 144/361 (39%), Positives = 210/361 (58%), Gaps = 31/361 (8%)

Query: 11  FKINTTPMFIIITLLVSC----ASQVVSSRSTH-------EQSVVEIHEKWMAQHGRSYK 59
           F    +P  + + LL SC    A+ ++ +R+T        +  +++    W   H RSY 
Sbjct: 4   FMACASPPVLTLALLASCGALLATSMLPARATAGSCLDVGDMVMMDRFRAWQGAHNRSYP 63

Query: 60  DELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKM---PSP 116
              E   R  +++ N E+I+  N  G+ TY+L  N+F+DLT +EF A YTGY     P  
Sbjct: 64  SAEEALQRFDVYRRNAEFIDAVNLRGDLTYQLAENEFADLTEEEFLATYTGYYAGDGPVD 123

Query: 117 SHRSTT-----SSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKE-CGCCWAFAAVAAVE 170
               TT      ++F Y+     DVP S+DWR +GAV P K+Q   C  CWAF   A +E
Sbjct: 124 DSVITTGAGDVDASFSYR----VDVPASVDWRAQGAVVPPKSQTSTCSSCWAFVTAATIE 179

Query: 171 GITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPG 230
            +  I++G L+ LSEQQL+DC +  + GC  GS  +A+ ++++N G+ TE +YPY A  G
Sbjct: 180 SLNMIKTGKLVSLSEQQLVDCDSY-DGGCNLGSYGRAYKWVVENGGLTTEADYPYTARRG 238

Query: 231 TCSAAQKP-AAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVC 289
            C+ A+    AAKI+ + +VP  +E AL  AV+ QPV++AI    +  Q YK G++ G C
Sbjct: 239 PCNRAKSAHHAAKITGFGKVPPRNEAALQAAVARQPVAVAIEV-GSGMQFYKGGVYTGPC 297

Query: 290 GTQLDHAVTIVGFGT-TEDGANYWLIKNSWGNTWGDAGYMKIVRD---EGLCGIGTRSSY 345
           GT+L HAVT+VG+GT    GA YW IKNSWG +WG+ GY++I+RD    GLCG+    +Y
Sbjct: 298 GTRLAHAVTVVGYGTDASSGAKYWTIKNSWGQSWGERGYIRILRDVGGPGLCGVTLDIAY 357

Query: 346 P 346
           P
Sbjct: 358 P 358


>gi|301116794|ref|XP_002906125.1| cysteine protease family C01A, putative [Phytophthora infestans
           T30-4]
 gi|262107474|gb|EEY65526.1| cysteine protease family C01A, putative [Phytophthora infestans
           T30-4]
          Length = 535

 Score =  266 bits (679), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 141/305 (46%), Positives = 187/305 (61%), Gaps = 11/305 (3%)

Query: 50  WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRT-YKLGTNQFSDLTNDEFRALY 108
           WM  H  S+ D LE   RL+ +  N  YI + N E   T  KL  N+FS ++ +EF+   
Sbjct: 32  WMKTHSVSFSDALEFAKRLENYIANDMYIMEHNLENAWTGVKLDHNEFSSMSFEEFKFKM 91

Query: 109 TGYKMPSPSHRSTTSSTFKYQNL-SMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVA 167
           TGY MP        +S  +  NL S   VP S+DW+DKG VTP+KNQ  CG CWAF+   
Sbjct: 92  TGYVMPEGYLEQRLAS--RVDNLWSDVQVPDSVDWQDKGGVTPVKNQGMCGSCWAFSTTG 149

Query: 168 AVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQA 227
           AVEG   + SG L+ LSEQ+L+DC  NG+ GC GG  + AFA+I  N GI +ED+Y Y+A
Sbjct: 150 AVEGAAFVSSGKLVSLSEQELVDCDHNGDMGCNGGLMDHAFAWIEDNGGICSEDDYEYKA 209

Query: 228 VPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNG 287
               C   +K    KIS +++V   DE AL  AV+ QPVS+AI A    FQ YK G+FN 
Sbjct: 210 KAQVCRDCEK--VVKISGFQDVNPQDEHALKVAVAQQPVSVAIEADQKAFQFYKSGVFNL 267

Query: 288 VCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE----GLCGIGTRS 343
            CGT+LDH V  VG+G +E+G  +W +KNSWG++WG+ GY+++ R+E    G CGI +  
Sbjct: 268 TCGTRLDHGVLAVGYG-SENGQKFWKVKNSWGSSWGEKGYIRLAREENGPAGQCGIASVP 326

Query: 344 SYPLA 348
           SYP A
Sbjct: 327 SYPFA 331


>gi|66270077|gb|AAY43368.1| cysteine protease [Phytophthora infestans]
          Length = 510

 Score =  265 bits (678), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 141/305 (46%), Positives = 187/305 (61%), Gaps = 11/305 (3%)

Query: 50  WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRT-YKLGTNQFSDLTNDEFRALY 108
           WM  H  S+ D LE   RL+ +  N  YI + N E   T  KL  N+FS ++ +EF+   
Sbjct: 32  WMKTHSVSFSDALEFAKRLENYIANDMYIMEHNLENAWTGVKLDHNEFSSMSFEEFKFKM 91

Query: 109 TGYKMPSPSHRSTTSSTFKYQNL-SMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVA 167
           TGY MP        +S  +  NL S   VP S+DW+DKG VTP+KNQ  CG CWAF+   
Sbjct: 92  TGYVMPEGYLEQRLAS--RVDNLWSDVQVPDSVDWQDKGGVTPVKNQGMCGSCWAFSTTG 149

Query: 168 AVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQA 227
           AVEG   + SG L+ LSEQ+L+DC  NG+ GC GG  + AFA+I  N GI +ED+Y Y+A
Sbjct: 150 AVEGAAFVSSGKLVSLSEQELVDCDHNGDMGCNGGLMDHAFAWIEDNGGICSEDDYEYKA 209

Query: 228 VPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNG 287
               C   +K    KIS +++V   DE AL  AV+ QPVS+AI A    FQ YK G+FN 
Sbjct: 210 KAQVCRDCEK--VVKISGFQDVNPQDEHALKVAVAQQPVSVAIEADQKAFQFYKSGVFNL 267

Query: 288 VCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE----GLCGIGTRS 343
            CGT+LDH V  VG+G +E+G  +W +KNSWG++WG+ GY+++ R+E    G CGI +  
Sbjct: 268 TCGTRLDHGVLAVGYG-SENGQKFWKVKNSWGSSWGEKGYIRLAREENGPAGQCGIASVP 326

Query: 344 SYPLA 348
           SYP A
Sbjct: 327 SYPFA 331


>gi|194352758|emb|CAQ00107.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
          Length = 457

 Score =  265 bits (678), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 134/309 (43%), Positives = 184/309 (59%), Gaps = 12/309 (3%)

Query: 48  EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANK------EGNRTYKLGTNQFSDLTN 101
           E W A+HG++Y    E+  RL  F EN  ++   N        G  +Y L  N F+DLT+
Sbjct: 40  EAWCAEHGKAYATPGERAARLAAFAENAAFVAAHNDAVASSGPGGPSYTLALNAFADLTH 99

Query: 102 DEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCW 161
           DEFRA   G     P      S +       +  VP +LDWR  GAVT +K+Q  CG CW
Sbjct: 100 DEFRAARLGRLAVGPGPLGAPSPSDGGFEGRVGAVPDALDWRQSGAVTKVKDQGSCGACW 159

Query: 162 AFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATED 221
           +F+A  A+EGI KI +G+L+ LSEQ+L+DC  + N GC GG    A+ ++I+N GI TED
Sbjct: 160 SFSATGAMEGINKITTGSLLSLSEQELIDCDRSYNTGCGGGLMTYAYKFVIKNGGIDTED 219

Query: 222 EYPYQAVPGTCSAAQ-KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSY 280
           +YP++   GTC+  + K     I  Y+EVPS  E  LL+AV+ QP+S+ I   +  FQ Y
Sbjct: 220 DYPFREADGTCNKNKLKKHVVTIDGYKEVPSSKEDLLLQAVAQQPISVGICGSARAFQLY 279

Query: 281 KEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGL 336
            +GIF+G C T LDHAV IVG+G +E G +YW++KNSWG  WG  GYM + R+     G+
Sbjct: 280 SQGIFDGPCPTSLDHAVLIVGYG-SEGGKDYWIVKNSWGERWGMKGYMHMHRNTGSSSGI 338

Query: 337 CGIGTRSSY 345
           CGI   +S+
Sbjct: 339 CGINMMASF 347


>gi|194352760|emb|CAQ00108.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
 gi|326510977|dbj|BAJ91836.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326523875|dbj|BAJ96948.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326528631|dbj|BAJ97337.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 368

 Score =  265 bits (678), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 137/327 (41%), Positives = 195/327 (59%), Gaps = 29/327 (8%)

Query: 48  EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNR--TYKLGTNQFSDLTNDEFR 105
            +W A+H R+Y    E+  RL+++  N+ YIE  N +     TY+LG   ++DLT+DEF 
Sbjct: 43  RRWKAEHSRTYATPEEERHRLRVYARNMRYIEATNGDAGAGLTYELGETAYTDLTSDEFT 102

Query: 106 ALYTGYKMPS-------PSHRSTTSSTFK-----------YQNLSMTDVPTSLDWRDKGA 147
           A+YT    P        P    TT +              Y N S    P S+DWR++GA
Sbjct: 103 AMYTSRAPPLSDDDDDLPMTMITTRAGPVAAAGGGGWLQVYVNES-AGAPASVDWRERGA 161

Query: 148 VTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKA 207
           VT +KNQ +CG CWAF+ VA +EGI +I++G L  LSEQ+L+DC    ++GC GG   +A
Sbjct: 162 VTAVKNQGQCGSCWAFSTVAVIEGIHQIKTGKLASLSEQELVDCD-KLDHGCNGGVSYRA 220

Query: 208 FAYIIQNQGIATEDEYPYQAVPGTCSAAQ-KPAAAKISNYEEVPSGDEQALLKAVSMQPV 266
             +I  N GI ++D+YPY A   TC   +    AA IS ++ V +  E +L  AV+MQPV
Sbjct: 221 LQWITSNGGITSQDDYPYTAKDDTCDTKKLSHHAASISGFQRVATRSELSLTNAVAMQPV 280

Query: 267 SIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTE-DGANYWLIKNSWGNTWGDA 325
           +++I A    FQ Y+ G++NG CGT+L+H VT+VG+G  E  G +YW++KNSWG  WGD 
Sbjct: 281 AVSIEAGGANFQHYRNGVYNGPCGTRLNHGVTVVGYGEDEVTGESYWIVKNSWGEKWGDN 340

Query: 326 GYMK-----IVRDEGLCGIGTRSSYPL 347
           GY++     I + EG+CGI  R S+PL
Sbjct: 341 GYLRMKKGIIDKPEGICGIAIRPSFPL 367


>gi|15290195|dbj|BAB63884.1| putative cysteine protease [Oryza sativa Japonica Group]
 gi|125525813|gb|EAY73927.1| hypothetical protein OsI_01811 [Oryza sativa Indica Group]
          Length = 342

 Score =  265 bits (678), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 139/340 (40%), Positives = 195/340 (57%), Gaps = 22/340 (6%)

Query: 18  MFIIITLLV--SCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENL 75
           + ++ TL+   +  +    +  + +   +++ E+WMA+ G++YK   EKE R  IF++N+
Sbjct: 12  LLVVCTLMALQAMGADAYYNNGSDDGVTMQMFEEWMAKFGKTYKCHGEKEHRFGIFRDNV 71

Query: 76  EYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD 135
            +I     +      +G NQF+DLTNDEF A YTG K P P            + +    
Sbjct: 72  HFIRGYKPQVTYDSAVGINQFADLTNDEFVATYTGAKPPHPKEAP--------RPVDPIW 123

Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG 195
            P  +DWR +GAVT +K+Q  CG CWAFAAVAA+EG+TKIR+G L  LSEQ+L+DC TN 
Sbjct: 124 TPCCIDWRFRGAVTGVKDQGACGSCWAFAAVAAIEGLTKIRTGQLTPLSEQELVDCDTN- 182

Query: 196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQK--PAAAKISNYEEVPSGD 253
           +NGC GG  ++AF  +    GI  E +Y Y+   G C         AA+I  Y  VP  D
Sbjct: 183 SNGCGGGHTDRAFELVASKGGITAESDYRYEGFQGKCRVDDMLFNHAARIGGYRAVPPND 242

Query: 254 EQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGAN--- 310
           E+ L  AV+ QPV++ I A    FQ YK G+F G CG   +HAVT+VG+   +DGA+   
Sbjct: 243 ERQLATAVARQPVTVYIDASGPAFQFYKSGVFPGPCGASSNHAVTLVGY--CQDGASGKK 300

Query: 311 YWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
           YW+ KNSWG TWG  GY+ + +D     G CG+     YP
Sbjct: 301 YWVAKNSWGKTWGQQGYILLEKDVLQPHGTCGLAVSPFYP 340


>gi|159479072|ref|XP_001697622.1| cysteine endopeptidase [Chlamydomonas reinhardtii]
 gi|158274232|gb|EDP00016.1| cysteine endopeptidase [Chlamydomonas reinhardtii]
          Length = 469

 Score =  265 bits (676), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 133/309 (43%), Positives = 193/309 (62%), Gaps = 10/309 (3%)

Query: 48  EKWMAQHGRSY-KDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRA 106
           ++W   H RSY  D  E E R K++ ENLEY+   N     ++ L  N  +DL+  E+++
Sbjct: 14  KEWAQTHSRSYVNDVAEFENRFKVWLENLEYVLAYNAR-TTSHWLTLNHLADLSTPEYKS 72

Query: 107 LYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAV 166
              G+   +   R+   + F+Y+++    +P ++DWR K AV  +KNQ +CG CWAFA  
Sbjct: 73  KLLGFDNQARVARNKLKTGFRYEDVDAEALPPAIDWRKKNAVAEVKNQGQCGSCWAFATT 132

Query: 167 AAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQ 226
            +VEGI  I +G+L+ LSEQ+L+DC T  + GC GG  + A+A+II+N+GI TE++YPY 
Sbjct: 133 GSVEGINAIVTGSLVSLSEQELVDCDTEQDKGCSGGLMDYAYAWIIKNKGINTEEDYPYT 192

Query: 227 AVPGTCSAAQ-KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIF 285
           A+ G C  A+ K     I +YE+VP  DE AL KA + QPV++AI A +  FQ Y  G++
Sbjct: 193 AMDGQCDVAKMKRRVVTIDSYEDVPENDEVALKKAAAHQPVAVAIEADAKSFQLYGGGVY 252

Query: 286 NG-VCGTQLDHAVTIVGFG--TTEDGANYWLIKNSWGNTWGDAGYMKI----VRDEGLCG 338
           +   CGT L+H V +VG+G   T  G+NYW++KNSWG  WGDAGY+++       EGLCG
Sbjct: 253 DDPTCGTSLNHGVLVVGYGKDVTGSGSNYWIVKNSWGAEWGDAGYIRLKMGSTDAEGLCG 312

Query: 339 IGTRSSYPL 347
           I    SYP+
Sbjct: 313 IAMAPSYPV 321


>gi|16304178|gb|AAL16954.1|AF426414_1 cathepsin L-like cysteine protease precursor [Delia radicum]
          Length = 337

 Score =  265 bits (676), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 139/313 (44%), Positives = 192/313 (61%), Gaps = 12/313 (3%)

Query: 46  IHEKWMA---QHGRSYKDELEKEMRLKIFKENLEYIEKANK---EGNRTYKLGTNQFSDL 99
           I E+W     +H +++  E+E+  R+KIF EN   I K N+   +G  ++KLG N++SD+
Sbjct: 23  IKEEWQTFKMEHRKNFLSEVEERFRMKIFNENRHKIAKHNQLYAQGKVSFKLGLNKYSDM 82

Query: 100 TNDEFRALYTGYKMP-SPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECG 158
              EF+    GY        R+   S   Y   +   +P S+DWR  GAVT +K+Q  CG
Sbjct: 83  LYHEFKETMNGYNHTMRKVLRAQGFSGIIYIPPANVQIPKSVDWRQHGAVTAVKDQGHCG 142

Query: 159 CCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGI 217
            CWAF++ AA+EG    ++G L+ LSEQ L+DCST  GNNGC GG  + AF YI  N GI
Sbjct: 143 SCWAFSSTAALEGQHFRKAGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGGI 202

Query: 218 ATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTE 276
            TE  YPY+ +  +C   +    A  + + ++P GDE+AL+KAV +M PVS+AI A    
Sbjct: 203 DTEKSYPYEGIDDSCHFTKSGVGATDTGFVDIPQGDEEALMKAVATMGPVSVAIDASHES 262

Query: 277 FQSYKEGIFNGV-CGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD- 333
           FQ Y EG++N   C  Q LDH V +VG+GT + G +YWL+KNSWG TWGD GY+K+ R+ 
Sbjct: 263 FQLYSEGVYNEPECDAQNLDHGVLVVGYGTDKTGLDYWLVKNSWGTTWGDQGYIKMARNQ 322

Query: 334 EGLCGIGTRSSYP 346
           +  CGI T SSYP
Sbjct: 323 DNQCGIATASSYP 335


>gi|359359068|gb|AEV40975.1| putative cysteine protease [Oryza punctata]
          Length = 464

 Score =  265 bits (676), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 144/322 (44%), Positives = 207/322 (64%), Gaps = 18/322 (5%)

Query: 40  EQSVVEIHEKWMAQH---GRSYKDEL-EKEMRLKIFKENLEYIE--KANKEGNRTYKLGT 93
           E     +++ W+A+H   G S+   + E E R ++F +NL++++   A+ +G+  ++LG 
Sbjct: 59  EAEARAVYDLWVARHRHGGGSHNGFVGEYERRFRVFWDNLKFVDAHNAHADGHGGFRLGM 118

Query: 94  NQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAV-TPIK 152
           N+F+DLTNDEFRA Y G    +P+ R        Y++  +  +P S+DWRDKGAV +P+K
Sbjct: 119 NRFADLTNDEFRAAYLG---TTPAGRGRHVGEM-YRHDGVEALPDSVDWRDKGAVVSPVK 174

Query: 153 NQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGS-REKAFAYI 211
           NQ +CG CWAF+AVAAVEGI KI +G L+ LSEQ+L++C+ NG N    G   + AFA+I
Sbjct: 175 NQGQCGSCWAFSAVAAVEGINKIVTGELVSLSEQELVECARNGGNSGCNGGIMDDAFAFI 234

Query: 212 IQNQGIATEDEYPYQAVPGTCSAAQKP-AAAKISNYEEVPSGDEQALLKAVSMQPVSIAI 270
            +N G+ TE++YPY A+ G C  A+K      I  +E+VP  DE +L KAV+ QPVS+AI
Sbjct: 235 TRNGGLDTEEDYPYTAMDGKCDLAKKSRKVVSIDGFEDVPENDELSLQKAVAHQPVSVAI 294

Query: 271 AAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGT-TEDGANYWLIKNSWGNTWGDAGYMK 329
            A   EFQ Y  G+F G CGT LDH V  VG+GT    G +YW ++NSWG  WG+ GY++
Sbjct: 295 DAGGREFQLYDSGVFTGRCGTSLDHGVVAVGYGTDAATGTDYWTVRNSWGPDWGENGYIR 354

Query: 330 IVRD----EGLCGIGTRSSYPL 347
           + R+     G CGI   +SYP+
Sbjct: 355 MERNVTARTGKCGIAMMASYPI 376


>gi|125570286|gb|EAZ11801.1| hypothetical protein OsJ_01675 [Oryza sativa Japonica Group]
          Length = 319

 Score =  265 bits (676), Expect = 4e-68,   Method: Compositional matrix adjust.
 Identities = 140/327 (42%), Positives = 188/327 (57%), Gaps = 20/327 (6%)

Query: 29  ASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRT 88
           A+    +  + +   +++ E+WMA+ G++YK   EKE R  IF++N+ +I     +    
Sbjct: 2   AASAYYNNGSDDGVTMQMFEEWMAKFGKTYKCHGEKEHRFGIFRDNVHFIRGYKPQVTYD 61

Query: 89  YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAV 148
             +G NQF+DLTNDEF A YTG K P P            + +     P  +DWR +GAV
Sbjct: 62  SAVGINQFADLTNDEFVATYTGAKPPHPKEAP--------RPVDPIWTPCCIDWRFRGAV 113

Query: 149 TPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAF 208
           T +K+Q  CG CWAFAAVAA+EG+TKIR+G L  LSEQ+L+DC TN +NGC GG  ++AF
Sbjct: 114 TGVKDQGACGSCWAFAAVAAIEGLTKIRTGQLTPLSEQELVDCDTN-SNGCGGGHTDRAF 172

Query: 209 AYIIQNQGIATEDEYPYQAVPGTCSAAQK--PAAAKISNYEEVPSGDEQALLKAVSMQPV 266
             +    GI  E +Y Y+   G C         AA I  Y  VP  DE+ L  AV+ QPV
Sbjct: 173 ELVASKGGITAESDYRYEGFQGKCRVDDMLFNHAASIGGYRAVPPNDERQLATAVARQPV 232

Query: 267 SIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGAN---YWLIKNSWGNTWG 323
           ++ I A    FQ YK G+F G CG   +HAVT+VG+   +DGA+   YWL KNSWG TWG
Sbjct: 233 TVYIDASGPAFQFYKSGVFPGPCGASSNHAVTLVGY--CQDGASGKKYWLAKNSWGKTWG 290

Query: 324 DAGYM----KIVRDEGLCGIGTRSSYP 346
             GY+     IV+  G CG+     YP
Sbjct: 291 QQGYILLEKDIVQPHGTCGLAVSPFYP 317


>gi|49387634|dbj|BAD25828.1| putative cysteine proteinase [Oryza sativa Japonica Group]
 gi|49388888|dbj|BAD26098.1| putative cysteine proteinase [Oryza sativa Japonica Group]
          Length = 358

 Score =  264 bits (675), Expect = 4e-68,   Method: Compositional matrix adjust.
 Identities = 143/356 (40%), Positives = 209/356 (58%), Gaps = 31/356 (8%)

Query: 16  TPMFIIITLLVSC----ASQVVSSRSTH-------EQSVVEIHEKWMAQHGRSYKDELEK 64
           +P  + + LL SC    A+ ++ +R+T        +  +++    W   H RSY    E 
Sbjct: 5   SPPVLTLALLASCGALLATSMLPARATAGSCLDVGDMVMMDRFRAWQGAHNRSYPSAEEA 64

Query: 65  EMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKM---PSPSHRST 121
             R  +++ N E+I+  N  G+ TY+L  N+F+DLT +EF A YTGY     P      T
Sbjct: 65  LQRFDVYRRNAEFIDAVNLRGDLTYQLAENEFADLTEEEFLATYTGYYAGDGPVDDSVIT 124

Query: 122 T-----SSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKE-CGCCWAFAAVAAVEGITKI 175
           T      ++F Y+     DVP S+DWR +GAV P K+Q   C  CWAF   A +E +  I
Sbjct: 125 TGAGDVDASFSYR----VDVPASVDWRAQGAVVPPKSQTSTCSSCWAFVTAATIESLNMI 180

Query: 176 RSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAA 235
           ++G L+ LSEQQL+DC +  + GC  GS  +A+ ++++N G+ TE +YPY A  G C+ A
Sbjct: 181 KTGKLVSLSEQQLVDCDSY-DGGCNLGSYGRAYKWVVENGGLTTEADYPYTARRGPCNRA 239

Query: 236 QKP-AAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLD 294
           +    AAKI+ + +VP  +E AL  AV+ QPV++AI    +  Q YK G++ G CGT+L 
Sbjct: 240 KSAHHAAKITGFGKVPPRNEAALQAAVARQPVAVAIEV-GSGMQFYKGGVYTGPCGTRLA 298

Query: 295 HAVTIVGFGT-TEDGANYWLIKNSWGNTWGDAGYMKIVRD---EGLCGIGTRSSYP 346
           HAVT+VG+GT    GA YW IKNSWG +WG+ GY++I+RD    GLCG+    +YP
Sbjct: 299 HAVTVVGYGTDASSGAKYWTIKNSWGQSWGERGYIRILRDVGGPGLCGVTLDIAYP 354


>gi|195429415|ref|XP_002062758.1| GK19626 [Drosophila willistoni]
 gi|194158843|gb|EDW73744.1| GK19626 [Drosophila willistoni]
          Length = 341

 Score =  264 bits (675), Expect = 4e-68,   Method: Compositional matrix adjust.
 Identities = 135/340 (39%), Positives = 203/340 (59%), Gaps = 13/340 (3%)

Query: 19  FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
           F +ITLL++  +  ++   ++ + V E    +  +H ++Y D  E+  R+KIF EN  +I
Sbjct: 3   FALITLLIALVA--MTQAVSYSELVREEWNTFKLEHRKNYADSTEETFRMKIFNENKHHI 60

Query: 79  EKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFK---YQNLS 132
            K N+    G  +YKL  N+++D+ + EFR    G+         +T  +F    + +  
Sbjct: 61  AKHNQRYATGEVSYKLALNKYADMLHHEFRETMNGFNYTLHKQLRSTDESFTGVTFISPE 120

Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
              +PT++DWR KGAVT +K+Q  CG CWAF++  A+EG    +SG L+ LSEQ L+DCS
Sbjct: 121 HVKLPTAVDWRTKGAVTEVKDQGHCGSCWAFSSTGAIEGQHFRKSGTLVSLSEQNLVDCS 180

Query: 193 TN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPS 251
           T  GNNGC GG  + AF Y+  N GI TE  Y Y+ +  +C   +    A    + ++P 
Sbjct: 181 TKYGNNGCNGGLMDNAFRYVKDNGGIDTEKSYAYEGIDDSCHFDKNSIGATDRGFADIPQ 240

Query: 252 GDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIFN--GVCGTQLDHAVTIVGFGTTEDG 308
           G+E+ L +AV ++ PVS+AI A    FQ Y EG+++        LDH V +VG+GT +DG
Sbjct: 241 GNEKKLAQAVATIGPVSVAIDASQQSFQFYSEGVYDEPNCSAENLDHGVLVVGYGTEKDG 300

Query: 309 ANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
           ++YWL+KNSWG TWGD G++K+ R+ E  CGI + SSYPL
Sbjct: 301 SDYWLVKNSWGTTWGDKGFIKMSRNKENQCGIASASSYPL 340


>gi|194352764|emb|CAQ00110.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
          Length = 406

 Score =  264 bits (675), Expect = 4e-68,   Method: Compositional matrix adjust.
 Identities = 147/347 (42%), Positives = 200/347 (57%), Gaps = 41/347 (11%)

Query: 39  HEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEG---NRTYKLGTNQ 95
           H+  +++    WM  H RSY    EK  R ++++ N+ +IE  N E      TY+LG   
Sbjct: 55  HQDLMMDRFHVWMTVHNRSYSTAGEKARRFEVYRSNMRFIEAVNAEAATSGLTYELGEGP 114

Query: 96  FSDLTNDEFRALYTGYKMP-------------------SPSHRSTTSSTFKYQNLSMTDV 136
           F+DLTN+EF  LYTG  +                    S     T      Y N S +  
Sbjct: 115 FTDLTNEEFMELYTGQILEDDQSEDGDDDEQIITTHAGSIDGLGTHKGATVYANFSAS-A 173

Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGN 196
           PTS+DWR +G VTP+KNQK+CG CWAF  VA +EGI KI+ G L+ LSEQQL+DC    +
Sbjct: 174 PTSIDWRKRGVVTPVKNQKQCGSCWAFPTVATIEGIHKIKRGTLVSLSEQQLIDCDYL-D 232

Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQA 256
           NGC GG   +AF +I +N GI +   Y Y+AV G C   +KP AAKI  + +V S  E +
Sbjct: 233 NGCKGGLVTRAFQWIKKNGGITSTSSYKYKAVRGRCLRNRKP-AAKIVGFRKVKSNSEVS 291

Query: 257 LLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCG-TQLDHAVTIVGFGTTED-------- 307
           L+ AV+ QPV+++I+++S+ F  YK GI+NG C  T+L+HAVT+VG+G  +         
Sbjct: 292 LMNAVANQPVAVSISSHSSHFHHYKGGIYNGPCSTTKLNHAVTVVGYGQQQQNGADSVHA 351

Query: 308 ---GANYWLIKNSWGNTWGDAGYMKIVR----DEGLCGIGTRSSYPL 347
              GA YW++KNSWG TWGD GY+ + R      G CGI TR  +PL
Sbjct: 352 SAPGAKYWIVKNSWGTTWGDKGYILMKRGTKHSSGQCGIATRPVFPL 398


>gi|302845628|ref|XP_002954352.1| hypothetical protein VOLCADRAFT_76255 [Volvox carteri f.
           nagariensis]
 gi|300260282|gb|EFJ44502.1| hypothetical protein VOLCADRAFT_76255 [Volvox carteri f.
           nagariensis]
          Length = 489

 Score =  264 bits (675), Expect = 4e-68,   Method: Compositional matrix adjust.
 Identities = 143/348 (41%), Positives = 206/348 (59%), Gaps = 22/348 (6%)

Query: 19  FIIITLLVSCASQVVSS-----RSTHEQSVVEIH-------EKWMAQHGRSYKDEL-EKE 65
           F+I  LLV+ +  V ++     R  HE+ +++         ++WM Q+ ++Y +++ E E
Sbjct: 5   FLIAALLVAASGGVGAAPELQLREQHEKLLLDAKANPMAAFQQWMMQYTKAYANDIKELE 64

Query: 66  MRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFR-ALYTGYKMPSPSHRSTTSS 124
            R  ++ ENL YI   N     ++ L  N F+DLT DEFR  L   +K    S+R   SS
Sbjct: 65  TRFSVWLENLNYILAYNAR-TTSHWLHLNAFADLTTDEFRNRLGYDFKARQASNR-LQSS 122

Query: 125 TFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLS 184
            F Y N+    +PT +DWR KGAVT +KNQ +CG CWAFA   +VEGI  I +G L  LS
Sbjct: 123 PFIYDNVDANQLPTEIDWRKKGAVTEVKNQGQCGSCWAFATTGSVEGINAIVTGELASLS 182

Query: 185 EQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQK-PAAAKI 243
           EQ+L+DC T+ + GC GG  + A+ +II+N G+ TED+YPY A  G C AA+K      I
Sbjct: 183 EQELVDCDTDEDRGCSGGLMDYAYQWIIKNGGLDTEDDYPYTAEDGVCVAAKKNRRVVTI 242

Query: 244 SNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNG-VCGTQLDHAVTIVGF 302
             Y ++P  DE AL KA + QP+++AI A +  FQ Y  G+++   CGT L+H V +VG+
Sbjct: 243 DGYVDIPENDEVALKKAAAHQPIAVAIEADAKSFQLYGGGVYDDPTCGTSLNHGVLVVGY 302

Query: 303 GTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
           G      NYW++KNSWG  WGD GY+++       +G+CGI    S+P
Sbjct: 303 GKDPHFGNYWIVKNSWGPEWGDNGYIRLRMGAEDVQGMCGIAMAPSFP 350


>gi|326520387|dbj|BAK07452.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 349

 Score =  264 bits (675), Expect = 4e-68,   Method: Compositional matrix adjust.
 Identities = 144/346 (41%), Positives = 201/346 (58%), Gaps = 18/346 (5%)

Query: 14  NTTPMFIIITLLVSCASQVVSSRS-----THEQSVVEIHEKWMAQHGRSYKDELEKEMRL 68
            T P+  I+ LL         + S       +  +++   +W A H RSY    E+  R 
Sbjct: 7   GTRPVIPILVLLTGGLFAAFPAASGGRVDAGDMLMMDRFRQWQATHNRSYLSAEERLRRF 66

Query: 69  KIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALY----TGYKMPSPSHRSTTSS 124
           ++++ N+EYI+  N+ G  TY+LG NQF+DLT +EF A Y    TG  + + +      S
Sbjct: 67  EVYRTNVEYIDATNRRGGLTYELGENQFADLTGEEFLARYAGGHTGSAITTAAEADGLWS 126

Query: 125 TFKYQNLSMTDVPTSLDWRDKGAVTPIKNQ-KECGCCWAFAAVAAVEGITKIRSGNLIQL 183
           +         D P S+DWR KGAVTP+KNQ  +C  CWAF+AVA +E +  I++G L+ L
Sbjct: 127 SGGSDGSLEADPPASVDWRAKGAVTPVKNQGSQCYSCWAFSAVATMESLYFIKTGKLVAL 186

Query: 184 SEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKI 243
           SEQQL+DC    + GC  G   +AF +I++N GI T  +YPY+AV G CSAA KPA   I
Sbjct: 187 SEQQLVDCDKY-DGGCNKGYYHRAFQWIMENGGITTAAQYPYKAVRGACSAA-KPAV-TI 243

Query: 244 SNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFG 303
           + +  V   +E AL  AV+ QP+ +AI       Q YK G+F+  CG Q+ HAV  VG+G
Sbjct: 244 TGHLAVAK-NELALQSAVARQPIGVAIEV-PISMQFYKSGVFSAACGIQMSHAVVTVGYG 301

Query: 304 TTEDGANYWLIKNSWGNTWGDAGYMKIVRD---EGLCGIGTRSSYP 346
               G  YWL+KNSWG TWG+AGY+++ RD    GLCGI   ++YP
Sbjct: 302 ADASGLKYWLVKNSWGQTWGEAGYIRMRRDVGGGGLCGIALDTAYP 347


>gi|307110445|gb|EFN58681.1| hypothetical protein CHLNCDRAFT_56822 [Chlorella variabilis]
          Length = 466

 Score =  264 bits (675), Expect = 4e-68,   Method: Compositional matrix adjust.
 Identities = 138/347 (39%), Positives = 201/347 (57%), Gaps = 21/347 (6%)

Query: 18  MFIIITLLVSCASQVVSSRSTHE----------QSVVEIHEKWMAQHGRSYKDELEKEMR 67
           M +   LLV+C+   V++    E          +S  E  + W+    R+Y    E E R
Sbjct: 1   MRLSCVLLVACSCLAVAAGFPFENHRLFIQQAVESPREAFDFWVQTLKRAYASAEEYERR 60

Query: 68  LKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFK 127
             ++ +NL ++ + N  G+ ++ L    ++DL+ DE+R+   GY       R   ++ F 
Sbjct: 61  FDVWLDNLRFVHEYNA-GHTSHWLSMGVYADLSQDEYRSKALGYNADLHEERPLRAAPFL 119

Query: 128 YQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQ 187
           Y+    T  P  +DW  KGAVTP+KNQ  CG CWAF+   AVEG + I +G L  LSEQ 
Sbjct: 120 YEG---TVPPKEVDWVAKGAVTPVKNQLLCGSCWAFSTTGAVEGASAIATGKLASLSEQM 176

Query: 188 LLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQ-KPAAAKISNY 246
           L+DC    +NGC GG  + AF +I++N GI TED+YPY A  G C   + +     I +Y
Sbjct: 177 LVDCDRERDNGCHGGLMDFAFEFIMKNGGIDTEDDYPYTAEEGMCQDNKMRRHVVTIDDY 236

Query: 247 EEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTE 306
           ++VP  DE AL+KAV+ QPVS+AI A    FQ Y  G+F+  CGT LDH V +VG+GT  
Sbjct: 237 QDVPPNDEHALMKAVANQPVSVAIEADQRAFQLYGGGVFDAECGTALDHGVLVVGYGTAS 296

Query: 307 DGAN---YWLIKNSWGNTWGDAGYMKIVR---DEGLCGIGTRSSYPL 347
           +G +   YWL+KNSWG  WGD GY++++R   +EG CG+  ++S+P+
Sbjct: 297 NGTHHLPYWLVKNSWGAEWGDKGYIRLLRNLGEEGQCGVAMQASFPI 343


>gi|390368662|ref|XP_780781.2| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
          Length = 333

 Score =  264 bits (674), Expect = 5e-68,   Method: Compositional matrix adjust.
 Identities = 142/336 (42%), Positives = 209/336 (62%), Gaps = 13/336 (3%)

Query: 19  FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
           ++ + L+  C   VVSS S       E   +W  +HG+ Y  + E+  R  I+++NL+ +
Sbjct: 3   YLSVLLVAVC---VVSSLSMSFTDFDEDWNQWKNEHGKRYLSDEEEASRKLIWEKNLDIV 59

Query: 79  EKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD 135
            K N +   G+ TY LG NQF+DL N+EF A+ TG+++   S ++   STF   N ++  
Sbjct: 60  IKHNLKYDLGHFTYALGMNQFADLQNEEFVAMMTGFRVNGTS-KAAKGSTFLPSN-NVDK 117

Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG 195
           +P ++DWR KG VTP+K+Q +CG CWAF+A  ++EG    ++G L+ LSEQ L+DCS   
Sbjct: 118 LPKTVDWRTKGYVTPVKDQGQCGSCWAFSATGSLEGQQFKKTGKLVSLSEQNLVDCSYR- 176

Query: 196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQ 255
           N GC GG  ++AF YII   GI TE  Y Y+AV G C   +    A ++ Y +V SG E+
Sbjct: 177 NYGCHGGFMDRAFQYIIDAGGIDTEATYSYRAVDGNCHFKKANVGATVTGYTDVTSGSEK 236

Query: 256 ALLKAVS-MQPVSIAIAAYSTEFQSYKEGIFN--GVCGTQLDHAVTIVGFGTTEDGANYW 312
           AL KAV+ + P+S+AI A    F+ YK G++N  G   T+L HAV +VG+GTT DG +YW
Sbjct: 237 ALQKAVAHIGPISVAIDASHKFFKFYKSGVYNEPGCSTTRLGHAVLVVGYGTTSDGTDYW 296

Query: 313 LIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
           ++KNSW  TWG  GY+ + R+ +  CGI + +SYP+
Sbjct: 297 IVKNSWAKTWGMNGYLWMSRNKDNQCGIASEASYPM 332


>gi|194320502|gb|ACF48469.1| cathepsin L [Triatoma brasiliensis]
          Length = 330

 Score =  264 bits (674), Expect = 5e-68,   Method: Compositional matrix adjust.
 Identities = 135/311 (43%), Positives = 198/311 (63%), Gaps = 17/311 (5%)

Query: 48  EKWM---AQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRTYKLGTNQFSDLTN 101
           E+W    A HG++YK++ E+  R+KIF +N + IE  N   ++G  +YK+  N F DL  
Sbjct: 25  EEWHVFKAMHGKTYKNQFEEMFRMKIFMDNKKKIEAHNAKYEQGEVSYKMMMNHFGDLMV 84

Query: 102 DEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCW 161
            EF+AL  G+KM SP  +      F     S +++P ++DWR KGAVTP+K+Q +CG CW
Sbjct: 85  HEFKALMNGFKM-SPDTKRNGELYFP----SNSNLPKTVDWRQKGAVTPVKDQGQCGSCW 139

Query: 162 AFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATE 220
           +F+A  ++EG   +++G L+ LSEQ L+DCST+ GNNGC GG  ++AF Y+  N+GI TE
Sbjct: 140 SFSATGSLEGQVFLKTGKLVSLSEQNLVDCSTSYGNNGCEGGLMDQAFQYVSDNKGIDTE 199

Query: 221 DEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQS 279
             YPY+A   TC   +         + ++P+GDE+AL  A+ ++ P+S+AI A    FQ 
Sbjct: 200 ASYPYEARENTCRFKKNKVGGTDKGHVDIPAGDEKALQNALATVGPISVAIDANHGSFQF 259

Query: 280 YKEGIFN--GVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE-GL 336
           Y +G++N        LDH V  VG+G TE+G +YWL+KNSWG +WG+ GY+KI R+    
Sbjct: 260 YSKGVYNEPNCSSYDLDHGVLAVGYG-TENGQDYWLVKNSWGPSWGENGYIKIARNHSNH 318

Query: 337 CGIGTRSSYPL 347
           CGI + +SYPL
Sbjct: 319 CGIASMASYPL 329


>gi|390337645|ref|XP_001199228.2| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
          Length = 333

 Score =  264 bits (674), Expect = 5e-68,   Method: Compositional matrix adjust.
 Identities = 143/337 (42%), Positives = 209/337 (62%), Gaps = 13/337 (3%)

Query: 19  FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
           ++ + L+  C   VVSS S       E  ++W  +HG+ Y  + E+  R  I+++NL+ +
Sbjct: 3   YLSVLLVAVC---VVSSLSMSFTDFDEDWKEWKNEHGKRYLSDEEEASRRLIWQKNLDIV 59

Query: 79  EKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD 135
            + N +   G+ TY LG NQF+DL N EF A+ TG+++   S ++   STF   N ++  
Sbjct: 60  IRHNLKYDLGHFTYDLGMNQFADLQNKEFVAMMTGFRVNGTS-KAAKGSTFLPPN-NVGK 117

Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG 195
           +P ++DWR KG VTP+K+Q +CG CWAF+A  ++EG    ++G L+ LSEQ L+DCS + 
Sbjct: 118 LPKTVDWRTKGYVTPVKDQGQCGSCWAFSATGSLEGQHFKKTGKLVSLSEQNLVDCS-DK 176

Query: 196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQ 255
           N GC GG  ++AF YII   GI TE+ YPY A+ G C        A ++ Y +V SG E+
Sbjct: 177 NYGCNGGLMDRAFQYIIDAGGIDTEESYPYIAMDGNCHFKTANVGATVTGYTDVTSGSEK 236

Query: 256 ALLKAVS-MQPVSIAIAAYSTEFQSYKEGIFN--GVCGTQLDHAVTIVGFGTTEDGANYW 312
           AL KAV+ + P+S+AI A    FQ Y+ G++N  G   T LDH V  VG+GTT DG +YW
Sbjct: 237 ALQKAVAHIGPISVAIDASHFSFQLYQSGVYNEPGCSSTLLDHGVLAVGYGTTIDGTDYW 296

Query: 313 LIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPLA 348
           ++KNSW  TWG  GY+ + R+ +  CGI T++SYPL 
Sbjct: 297 IVKNSWAETWGMNGYIWMSRNKDNQCGIATQASYPLV 333


>gi|226499884|ref|NP_001148278.1| thiol protease SEN102 precursor [Zea mays]
 gi|195617112|gb|ACG30386.1| thiol protease SEN102 precursor [Zea mays]
          Length = 374

 Score =  264 bits (674), Expect = 6e-68,   Method: Compositional matrix adjust.
 Identities = 143/344 (41%), Positives = 199/344 (57%), Gaps = 27/344 (7%)

Query: 29  ASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNR- 87
           A   + S S  + S++E  ++W A + +SY    E+  R +++  N+ YIE  N E    
Sbjct: 32  AGDTMGSMSNDDSSMIERFQRWKAAYNKSYATVAEERRRFRVYARNMAYIEATNAEAEAA 91

Query: 88  --TYKLGTNQFSDLTNDEFRALYTGYKMPS-PSHRSTTSSTFK--------------YQN 130
             TY+LG   ++DLTN EF A+YT   +   P+  S  ++                 Y N
Sbjct: 92  GLTYELGETAYTDLTNQEFMAMYTAPALAQLPADESVITTRAGPVDAVGGAPGQLPVYVN 151

Query: 131 LSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLD 190
           LS +  P S+DWR  GAVTP+KNQ  CG CWAF+ VA VEGI +IR+G L+ LSEQ+L+D
Sbjct: 152 LSAS-APASVDWRASGAVTPVKNQGRCGSCWAFSTVAVVEGIYQIRTGKLVSLSEQELVD 210

Query: 191 CSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQ-KPAAAKISNYEEV 249
           C T  ++GC GG   +A  +I  N GI TE +YPY      C+ A+    A  I+    V
Sbjct: 211 CDTL-DDGCDGGISYRALRWIASNGGITTEADYPYTGTTDACNRAKLSHNAVSIAGLRRV 269

Query: 250 PSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGT-TEDG 308
            +  E +L  AV+ QPV+++I A    FQ YK+G++NG CGT L+H VT+VG+G     G
Sbjct: 270 ATRSEASLANAVAGQPVAVSIEAGGDNFQHYKKGVYNGPCGTNLNHGVTVVGYGQEAAAG 329

Query: 309 ANYWLIKNSWGNTWGDAGYMKIVRD-----EGLCGIGTRSSYPL 347
             YW++KNSWG  WGD GY+++ +D     EGLCGI  R SYPL
Sbjct: 330 DRYWIVKNSWGQGWGDDGYIRMKKDVAGKPEGLCGIAIRPSYPL 373


>gi|242048430|ref|XP_002461961.1| hypothetical protein SORBIDRAFT_02g011230 [Sorghum bicolor]
 gi|241925338|gb|EER98482.1| hypothetical protein SORBIDRAFT_02g011230 [Sorghum bicolor]
          Length = 380

 Score =  263 bits (673), Expect = 8e-68,   Method: Compositional matrix adjust.
 Identities = 145/346 (41%), Positives = 195/346 (56%), Gaps = 35/346 (10%)

Query: 33  VSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNR---TY 89
           + S +     ++E  ++W A + +SY    E   R  ++  N+ YIE  N E      TY
Sbjct: 38  MGSSTDDNSPMIERFQRWKAAYNKSYATVAEDRRRFLVYARNMAYIEATNAEAEAAGLTY 97

Query: 90  KLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFK---------------------Y 128
           +LG   ++DLTN EF A+YT    PSP+                               Y
Sbjct: 98  ELGETAYTDLTNQEFMAMYTA--APSPAQLPADEDEDDAAEAVITTRAGPVDAVGQLPVY 155

Query: 129 QNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQL 188
            NLS T  P S+DWR  GAVTP+KNQ  CG CWAF+ VA VEGI +IR+G L+ LSEQ+L
Sbjct: 156 VNLS-TAAPASVDWRASGAVTPVKNQGRCGSCWAFSTVAVVEGIYQIRTGKLVSLSEQEL 214

Query: 189 LDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQ-KPAAAKISNYE 247
           +DC T  + GC GG   +A  +I  N G+ TE++YPY      C+ A+    AA I+   
Sbjct: 215 VDCDTL-DAGCDGGISYRALRWITSNGGLTTEEDYPYTGTTDACNRAKLAHNAASIAGLR 273

Query: 248 EVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGT-TE 306
            V +  E +L  AV+ QPV+++I A    FQ YK G++NG CGT L+H VT+VG+G   E
Sbjct: 274 RVATRSEASLANAVAGQPVAVSIEAGGDNFQHYKRGVYNGPCGTSLNHGVTVVGYGQEEE 333

Query: 307 DGANYWLIKNSWGNTWGDAGYMKIVRD-----EGLCGIGTRSSYPL 347
           DG  YW+IKNSWG +WGD GY+K+ +D     EGLCGI  R S+PL
Sbjct: 334 DGDKYWIIKNSWGASWGDGGYIKMRKDVAGKPEGLCGIAIRPSFPL 379


>gi|297830594|ref|XP_002883179.1| hypothetical protein ARALYDRAFT_318695 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297329019|gb|EFH59438.1| hypothetical protein ARALYDRAFT_318695 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 308

 Score =  263 bits (672), Expect = 9e-68,   Method: Compositional matrix adjust.
 Identities = 135/309 (43%), Positives = 192/309 (62%), Gaps = 24/309 (7%)

Query: 46  IHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFR 105
           ++E+W+ ++ ++Y    EKE R KIFKENL++I++ N   N+T+++G  +F+DLTNDE +
Sbjct: 1   MYERWLVENRKNYNGLGEKERRCKIFKENLKFIDEHNSLPNQTFEVGLTRFADLTNDEPK 60

Query: 106 ALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAA 165
                            +  + Y+   +  +P  +DWR KGAV P+K+Q  CG CWAF+A
Sbjct: 61  DF-------------MKADRYLYKEGDI--LPDEIDWRAKGAVVPVKDQGNCGSCWAFSA 105

Query: 166 VAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYIIQNQGIATEDEYP 224
           V AVEGI +I++G LI LS+Q+L+DC     N GC GG    AF +II N GI ++ +YP
Sbjct: 106 VGAVEGINQIKTGELISLSDQELIDCDRGFVNAGCEGGVMNYAFEFIINNGGIESDQDYP 165

Query: 225 YQAVP-GTCSAAQK--PAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYK 281
           Y A   G C+A +K      KI  YE V   DE++L KAV+ QPV +AI A S  F+ YK
Sbjct: 166 YTATDLGVCNADKKNNTRVVKIDGYEYVAQNDEKSLKKAVAHQPVGVAIEASSQAFKLYK 225

Query: 282 EGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLC 337
            G+F G CG  LDH V +VG+GT+  G +YW+I+NSWG  WG+ GY+K+ R+     G C
Sbjct: 226 SGVFTGTCGIYLDHGVVVVGYGTSS-GEDYWIIRNSWGLNWGENGYVKLQRNIDDSFGKC 284

Query: 338 GIGTRSSYP 346
           G+    SYP
Sbjct: 285 GVAMMPSYP 293


>gi|195056367|ref|XP_001995082.1| GH22826 [Drosophila grimshawi]
 gi|193899288|gb|EDV98154.1| GH22826 [Drosophila grimshawi]
          Length = 340

 Score =  263 bits (672), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 141/343 (41%), Positives = 199/343 (58%), Gaps = 21/343 (6%)

Query: 19  FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMA---QHGRSYKDELEKEMRLKIFKENL 75
           +I   L +   +Q VS           I E+W     +H + Y+DE E+  RLKIF EN 
Sbjct: 4   YIFALLALVAVAQAVSFADV-------IKEEWQTFKLEHRKQYQDETEERFRLKIFNENK 56

Query: 76  EYIEKANK---EGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFK---YQ 129
             I K N+    G  ++K+G N+++D+ + EF     G+          + +TF    + 
Sbjct: 57  HKIAKHNQLYAAGEVSFKMGLNKYADMLHHEFHETMNGFNYTLHKQLRASDATFTGVTFI 116

Query: 130 NLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLL 189
           +     +P S+DWR+KGAVT +K+Q  CG CWAF++  A+EG    ++G LI LSEQ L+
Sbjct: 117 SPEHVKLPQSVDWRNKGAVTGVKDQGHCGSCWAFSSTGALEGQHFRKTGTLISLSEQNLV 176

Query: 190 DCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEE 248
           DCST  GNNGC GG  + AF YI  N GI TE  YPY+ +  +C   +    A    + +
Sbjct: 177 DCSTKYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYPYEGIDDSCHFNKGTIGATDRGFTD 236

Query: 249 VPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIFN-GVCGTQ-LDHAVTIVGFGTT 305
           +P GDE+ L +AV ++ PVS+AI A    FQ Y  G+++   C  Q LDH V +VG+GT 
Sbjct: 237 IPQGDEKKLAQAVATIGPVSVAIDASHESFQFYSTGVYDEPQCDPQNLDHGVLVVGYGTD 296

Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVR-DEGLCGIGTRSSYPL 347
           E+G +YWL+KNSWG TWGD G++K+ R D+  CGI T SSYPL
Sbjct: 297 ENGKDYWLVKNSWGTTWGDKGFIKMARNDDNQCGIATASSYPL 339


>gi|195124431|ref|XP_002006696.1| GI21205 [Drosophila mojavensis]
 gi|193911764|gb|EDW10631.1| GI21205 [Drosophila mojavensis]
          Length = 339

 Score =  263 bits (671), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 140/318 (44%), Positives = 195/318 (61%), Gaps = 18/318 (5%)

Query: 46  IHEKWMA---QHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDL 99
           I E+W     +H ++Y+DE E+  RLKIF EN   I K N+    G  T+K+  N+++D+
Sbjct: 23  IKEEWHTFKLEHRKTYQDETEERFRLKIFNENKHKIAKHNQRYATGEVTFKMAVNKYADM 82

Query: 100 TNDEFRAL-----YTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQ 154
            + EFR       YT +K    S  S T  TF   + +   +P S+DWR+KGAVT +K+Q
Sbjct: 83  LHHEFRETMNGFNYTLHKELRASDPSFTGITF--ISPAHVKLPKSVDWREKGAVTAVKDQ 140

Query: 155 KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQ 213
             CG CWAF++  A+EG    ++G L+ LSEQ L+DCS   GNNGC GG  + AF YI  
Sbjct: 141 GHCGSCWAFSSTGALEGQHFRKTGTLVSLSEQNLVDCSAKYGNNGCNGGLMDNAFRYIKD 200

Query: 214 NQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAA 272
           N GI TE  YPY+ +  +C   +    A    + ++P G+E+ + +AV ++ PVS+AI A
Sbjct: 201 NGGIDTEKSYPYEGIDDSCHFNKDSVGATDRGFADIPQGNEKKMAEAVATIGPVSVAIDA 260

Query: 273 YSTEFQSYKEGIFNGV-CGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKI 330
               FQ Y EGI+N   C +Q LDH V +VG+GT E G +YWL+KNSWG TWGD G++K+
Sbjct: 261 SHESFQFYSEGIYNEPECNSQNLDHGVLVVGYGTDESGKDYWLVKNSWGTTWGDKGFIKM 320

Query: 331 VRDE-GLCGIGTRSSYPL 347
            R+E   CGI + SSYPL
Sbjct: 321 ARNEDNQCGIASASSYPL 338


>gi|357133074|ref|XP_003568153.1| PREDICTED: cysteine proteinase RD21a-like [Brachypodium distachyon]
          Length = 565

 Score =  263 bits (671), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 133/314 (42%), Positives = 191/314 (60%), Gaps = 15/314 (4%)

Query: 46  IHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNR--------TYKLGTNQFS 97
           + E W A+HG++Y    E+  RL  F +N  ++   N  G          +Y L  N F+
Sbjct: 41  LFEAWCAEHGKAYASPGERAARLAAFADNAAFVAAHNAGGGGAGGSNAAPSYTLALNAFA 100

Query: 98  DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
           DLT+ EFRA   G ++     R+  S      ++ +  VP +LDWR  GAVT +K+Q  C
Sbjct: 101 DLTHAEFRAARLG-RLAVGGARAPPSEGGFAGSVGVGAVPEALDWRQSGAVTKVKDQGSC 159

Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
           G CW+F+A  A+EGI KI++G+LI LSEQ+L+DC  + N GC GG  + A+ ++I+N GI
Sbjct: 160 GACWSFSATGAIEGINKIKTGSLISLSEQELIDCDRSYNAGCGGGLMDYAYRFVIKNGGI 219

Query: 218 ATEDEYPYQAVPGTCSAAQ-KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTE 276
            TED+YPY+   GTC+  + K     I  Y +VP+  E +LL+AV+ QP+S+ I   +  
Sbjct: 220 DTEDDYPYREADGTCNKNKLKRHVVTIDGYSDVPANKEDSLLQAVAQQPISVGICGSARA 279

Query: 277 FQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD--- 333
           FQ Y +GIF+G C T LDHAV IVG+G +E G +YW++KNSWG  WG  GYM + R+   
Sbjct: 280 FQLYSQGIFDGPCPTSLDHAVLIVGYG-SEGGKDYWIVKNSWGERWGMKGYMHMHRNTGS 338

Query: 334 -EGLCGIGTRSSYP 346
             G+CGI   +S+P
Sbjct: 339 SSGICGINMMASFP 352


>gi|242088413|ref|XP_002440039.1| hypothetical protein SORBIDRAFT_09g024940 [Sorghum bicolor]
 gi|241945324|gb|EES18469.1| hypothetical protein SORBIDRAFT_09g024940 [Sorghum bicolor]
          Length = 463

 Score =  263 bits (671), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 130/315 (41%), Positives = 197/315 (62%), Gaps = 16/315 (5%)

Query: 46  IHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNR--------TYKLGTNQFS 97
           + + W A+HG++Y    E+  RL +F +N  ++   N   N         +Y L  N F+
Sbjct: 40  LFDAWCAEHGKAYATPEERAARLAVFADNAAFVAAHNARVNAAGGGGAPPSYTLALNAFA 99

Query: 98  DLTNDEFRALYTG-YKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKE 156
           DLT++EFRA   G     + + RS  +  ++  +  +  VP +LDWR+ GAVT +K+Q  
Sbjct: 100 DLTHEEFRAARLGRIAAGAAALRSPAAPVYRGLDGGLGAVPDALDWRENGAVTKVKDQGS 159

Query: 157 CGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQG 216
           CG CW+F+A  A+EGI KI++G+L+ LSEQ+L+DC  + N+GC GG  + A+ ++++N G
Sbjct: 160 CGACWSFSATGAMEGINKIKTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYKFVVKNGG 219

Query: 217 IATEDEYPYQAVPGTCSAAQ-KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYST 275
           I TE++YPY+   GTC+  + K     I  Y +VPS  E  LL+AV+ QPVS+ I   + 
Sbjct: 220 IDTEEDYPYREADGTCNKNKLKKRIVTIDGYSDVPSNKEDLLLQAVAQQPVSVGICGSAR 279

Query: 276 EFQSY-KEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD- 333
            FQ Y ++GIF+G C T LDHAV IVG+G +E G +YW++KNSWG +WG  GYM + R+ 
Sbjct: 280 AFQLYSQQGIFDGPCPTSLDHAVLIVGYG-SEGGKDYWIVKNSWGESWGMKGYMHMHRNT 338

Query: 334 ---EGLCGIGTRSSY 345
              +G+CGI   +S+
Sbjct: 339 GDSKGVCGINMMASF 353


>gi|320169652|gb|EFW46551.1| cathepsin L2 [Capsaspora owczarzaki ATCC 30864]
          Length = 325

 Score =  262 bits (670), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 136/305 (44%), Positives = 186/305 (60%), Gaps = 9/305 (2%)

Query: 49  KWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALY 108
           +W A H R Y    E+ +R +I+  NLE I + N  G  +Y LG N+F DL + EF A Y
Sbjct: 23  EWKALHNRQYASAQEEALRQEIYLSNLELINEHNAAGRHSYTLGMNEFGDLAHHEFAAKY 82

Query: 109 TGYKMPS-PSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVA 167
            G +     + +S  SST+  +   M  +P S+DWR  G VTP+KNQ +CG CW+F+   
Sbjct: 83  LGVRFNGVNATKSFASSTYLPR---MVSLPDSVDWRTAGIVTPVKNQGQCGSCWSFSTTG 139

Query: 168 AVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQ 226
           +VEG    ++G L+ LSEQ L+DCS+  GN GC GG  + AF YII+N GI TE  YPY 
Sbjct: 140 SVEGQHARKTGTLVSLSEQNLVDCSSQEGNEGCNGGLMDDAFEYIIKNGGIDTEASYPYT 199

Query: 227 AVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIF 285
           A  GTC        A +++Y+++ +G E  L  AV ++ PVS+AI A    FQ Y  G++
Sbjct: 200 ATTGTCKFNAANIGATVASYQDIITGSESDLQNAVATVGPVSVAIDASHINFQFYFTGVY 259

Query: 286 N--GVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTR 342
           N      TQLDH V  VG+GT+ +G +YWL+KNSWG TWG AGY+ + R+ +  CGI T 
Sbjct: 260 NEKKCSTTQLDHGVLAVGYGTSTEGKDYWLVKNSWGATWGKAGYIWMSRNADNQCGIATS 319

Query: 343 SSYPL 347
           +SYPL
Sbjct: 320 ASYPL 324


>gi|449450419|ref|XP_004142960.1| PREDICTED: vignain-like [Cucumis sativus]
          Length = 345

 Score =  262 bits (670), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 143/319 (44%), Positives = 206/319 (64%), Gaps = 20/319 (6%)

Query: 40  EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
           E+S+ +++E+W   H  S ++  EK  R  +FKEN+ ++   N+  ++ YKL  N+F+D+
Sbjct: 34  EESLWQLYERWGKHHTIS-RNLKEKHKRFSVFKENVNHVFTVNQM-DKPYKLKLNKFADM 91

Query: 100 TNDEFRALYTGYKMPSPSH------RSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKN 153
           +N EF   Y      + SH      R   +  F Y+    TD+P+S+DWR++GAV  +K 
Sbjct: 92  SNYEFVNFYA---RSNISHYRKLHERRRGAGGFMYE--QDTDLPSSVDWRERGAVNAVKE 146

Query: 154 QKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQ 213
           Q  CG CWAF++VAAVEGI KI++  L+ LSEQ+LLDC+   N GC GG  E AF +I +
Sbjct: 147 QGRCGSCWAFSSVAAVEGINKIKTNQLLSLSEQELLDCNYR-NKGCNGGFMEIAFDFIKR 205

Query: 214 NQGIATEDEYPYQAVPGTCSAAQ-KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAA 272
           N GIATE+ YPY    G C +++      KI  YE VP  +E AL++AV+ QPVS+AI A
Sbjct: 206 NGGIATENSYPYHGSRGLCRSSRISSPIVKIDGYESVPE-NEDALMQAVANQPVSVAIDA 264

Query: 273 YSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR 332
              +FQ Y +G+F+G CGT+L+H V  +G+GTTEDG +YWL++NSWG  WG+ GY+++ R
Sbjct: 265 AGRDFQFYSQGVFDGYCGTELNHGVVAIGYGTTEDGTDYWLVRNSWGVGWGEDGYVRMKR 324

Query: 333 D----EGLCGIGTRSSYPL 347
                EGLCGI   +SYP+
Sbjct: 325 GVEQAEGLCGIAMEASYPI 343


>gi|194719810|emb|CAR31335.1| pro-asclepain f [Gomphocarpus fruticosus subsp. fruticosus]
          Length = 340

 Score =  262 bits (670), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 145/344 (42%), Positives = 209/344 (60%), Gaps = 22/344 (6%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
           + I+  LL   A   +S+    +  V+ ++E+W+ +H + Y    EK  R +IFK+NL Y
Sbjct: 5   VLILSFLLFVSAITCISTNWRSDDEVIALYEEWLVKHQKLYSSLGEKIKRFEIFKDNLRY 64

Query: 78  IEKAN---KEGNRTYKLGTNQFSDLTNDEFRALYTGYKM-------PSPSHRSTTSSTFK 127
           I++ N   K  +  + LG NQF+DLT DEF ++Y G  +        +P+H        K
Sbjct: 65  IDQQNHYNKVNHMNFTLGLNQFADLTLDEFSSIYLGTSVDYEQIISSNPNHDDVEEDILK 124

Query: 128 YQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQ 187
                + ++P S+DWR+KG V PI+NQ +CG CW F+AVA++E +  I+ G++I LSEQ+
Sbjct: 125 E---DVVELPDSVDWREKGVVFPIRNQGKCGSCWTFSAVASIETLNGIKKGHMIALSEQE 181

Query: 188 LLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYE 247
           LLDC T  + GC GG    AFAY+ +N GI +E++YPY    G C   QK    KIS Y+
Sbjct: 182 LLDCETI-SQGCKGGHYNNAFAYVAKN-GITSEEKYPYIFRQGQC--YQKEKVVKISGYK 237

Query: 248 EVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTED 307
            VP  +   L  AV+ Q VS+A+   S +FQ Y  GIF+G CG  LDHAV IVG+G ++ 
Sbjct: 238 RVPRNNGGQLQSAVAQQVVSVAVKCESKDFQFYDRGIFSGACGPILDHAVNIVGYG-SKG 296

Query: 308 GANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
           GANYW+++NSWG  WG+ GYM+I ++    EG CGI  + SYP+
Sbjct: 297 GANYWIMRNSWGTNWGENGYMRIQKNSKHYEGHCGIAMQPSYPV 340


>gi|125525812|gb|EAY73926.1| hypothetical protein OsI_01810 [Oryza sativa Indica Group]
          Length = 319

 Score =  262 bits (669), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 138/327 (42%), Positives = 188/327 (57%), Gaps = 20/327 (6%)

Query: 29  ASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRT 88
           A+    +  + +   +++ E+WMA+ G++YK   EKE R  IF++N+ +I     +    
Sbjct: 2   AASAYYNNGSDDGVTMQMFEEWMAKFGKTYKCHGEKEHRFGIFRDNVHFIRGYKPQVTYD 61

Query: 89  YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAV 148
             +G NQF+DLTNDEF A YTG K P P            + +     P  +DWR +GAV
Sbjct: 62  SAVGINQFADLTNDEFVATYTGAKPPHPKEAP--------RPVDPIWTPCCIDWRFRGAV 113

Query: 149 TPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAF 208
           T +K+Q  CG CWAFAAVAA+EG+TKIR+G L  LSEQ+L+DC TN +NGC GG  ++AF
Sbjct: 114 TGVKDQGACGSCWAFAAVAAIEGLTKIRTGQLTPLSEQELVDCDTN-SNGCGGGHTDRAF 172

Query: 209 AYIIQNQGIATEDEYPYQAVPGTCSAAQK--PAAAKISNYEEVPSGDEQALLKAVSMQPV 266
             +    GI  E +Y Y+   G C         AA I  Y  VP  DE+ L  AV+ QPV
Sbjct: 173 ELVASKGGITAESDYRYEGFQGKCRVDDMLFNHAASIGGYRAVPPNDERQLATAVARQPV 232

Query: 267 SIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGAN---YWLIKNSWGNTWG 323
           ++ I A    FQ YK G+F G CG   +HAVT+VG+   +DGA+   YW+ KNSWG TWG
Sbjct: 233 TVYIDASGPAFQFYKSGVFPGPCGASSNHAVTLVGY--CQDGASGKKYWVAKNSWGKTWG 290

Query: 324 DAGYMKIVRD----EGLCGIGTRSSYP 346
             GY+ + +D     G CG+     YP
Sbjct: 291 QQGYILLEKDVLQPHGTCGLAVSPFYP 317


>gi|391343119|ref|XP_003745860.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
          Length = 385

 Score =  262 bits (669), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 136/306 (44%), Positives = 191/306 (62%), Gaps = 11/306 (3%)

Query: 48  EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRAL 107
           E + A+H + Y+   E+ MR  IF+EN ++IE  N +    + LG N F DLTN E+R  
Sbjct: 82  ENFKAEHNKKYESFPEELMRRLIFEENHQFIEDHNSKKEFDFYLGMNHFGDLTNKEYRER 141

Query: 108 YTGYKMP--SPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAA 165
           Y GY+ P  +PS  S   S  +     + DVP  +DWRD+G VTP+KNQ +CG CWAF+A
Sbjct: 142 YLGYRRPENTPSKASYIFSRAE----KIEDVPDQIDWRDQGFVTPVKNQGQCGSCWAFSA 197

Query: 166 VAAVEGITKIRSGNLIQLSEQQLLDCST-NGNNGCLGGSREKAFAYIIQNQGIATEDEYP 224
           V ++EG     +G L+ LSEQ L+DCST  GN+GC GG  ++AF Y+  N GI TED YP
Sbjct: 198 VGSLEGQHFKSTGKLVSLSEQNLVDCSTPEGNSGCNGGWMDQAFEYVKDNHGIDTEDSYP 257

Query: 225 YQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQ-PVSIAIAAYSTEFQSYKEG 283
           Y    G+C    K   A +  + +V  GDE+AL +AV +  PVS+AI A S  FQ Y+ G
Sbjct: 258 YVGTDGSCHFKNKSIGATLKGFMDVKEGDEEALRQAVGVAGPVSVAIDASSMLFQFYRGG 317

Query: 284 IFN-GVCGT-QLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDEG-LCGIG 340
           ++N   C T +LDH V +VG+G    G ++W++KNSWG  WG  GY+++ R++G  CGI 
Sbjct: 318 VYNVPWCSTSELDHGVLVVGYGKQFQGKDFWMVKNSWGVGWGIYGYIEMSRNKGNQCGIA 377

Query: 341 TRSSYP 346
           +++S P
Sbjct: 378 SKASIP 383


>gi|170041165|ref|XP_001848344.1| cathepsin l [Culex quinquefasciatus]
 gi|167864709|gb|EDS28092.1| cathepsin l [Culex quinquefasciatus]
          Length = 340

 Score =  262 bits (669), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 137/317 (43%), Positives = 195/317 (61%), Gaps = 15/317 (4%)

Query: 46  IHEKWMA---QHGRSYKDELEKEMRLKIFKENLEYIEKANK---EGNRTYKLGTNQFSDL 99
           + E+W A   QH + Y  E E+ +RLKI+ +N   I K N+   +G   ++L  N+++DL
Sbjct: 23  VKEEWNAYKLQHRKKYDSETEERLRLKIYVQNKHKIAKHNQRFEQGQEKFRLRVNKYTDL 82

Query: 100 TNDEFRALYTGYKMPS---PSHRSTT-SSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQK 155
            ++EF     G+   +   P  +         Y   +  +VP ++DWR+KGAVTP+K+Q 
Sbjct: 83  LHEEFVQTLNGFNRTNAKKPMLKGVKIDEPVTYIEPANVEVPKTVDWREKGAVTPVKDQG 142

Query: 156 ECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQN 214
            CG CW+F+A  A+EG    ++G L+ LSEQ L+DCST  GNNGC GG  + AF YI  N
Sbjct: 143 HCGSCWSFSATGALEGQHFRKTGKLVSLSEQNLVDCSTKYGNNGCNGGMMDFAFQYIKDN 202

Query: 215 QGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQ-PVSIAIAAY 273
            GI TE  YPY+A+  TC    K   A    + ++P GDE+AL+KA++   PVS+AI A 
Sbjct: 203 GGIDTEKAYPYEAIDDTCHYNPKAVGATDKGFVDIPQGDEKALMKAIATAGPVSVAIDAS 262

Query: 274 STEFQSYKEGI-FNGVCGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIV 331
              FQ Y EG+ +   C ++ LDH V  VG+GT+E+G +YWL+KNSWG TWGD GY+K+ 
Sbjct: 263 HESFQFYSEGVYYEPQCDSENLDHGVLAVGYGTSEEGEDYWLVKNSWGTTWGDQGYVKMA 322

Query: 332 RD-EGLCGIGTRSSYPL 347
           R+ +  CGI T +SYPL
Sbjct: 323 RNRDNHCGIATAASYPL 339


>gi|357114837|ref|XP_003559200.1| PREDICTED: fruit bromelain-like [Brachypodium distachyon]
          Length = 371

 Score =  261 bits (667), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 145/358 (40%), Positives = 203/358 (56%), Gaps = 43/358 (12%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIHE--------KWMAQHGRSYKDELEKEMRLK 69
           +F+ +T L   A  +++  + H   VVE+ +        +W A H R+Y D  E+  R +
Sbjct: 27  LFVFLTALPPAA--IMTPAAGH---VVELDDMLMLDRFVRWQAAHNRTYGDAEERLRRFQ 81

Query: 70  IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQ 129
           +++ N+EYIE  N+ G  TY+LG NQF+DLT++EF ++Y        S            
Sbjct: 82  VYRANIEYIEATNRRGGLTYELGENQFADLTSEEFLSMYA-------SSYDAGDRADDEA 134

Query: 130 NLSMTDV---------------PTSLDWRDKGAVTPIKNQ-KECGCCWAFAAVAAVEGIT 173
            L  TDV               P S DWR KGAVTP KNQ   C  CWAF  VA +EG+T
Sbjct: 135 ALITTDVAGDGAWSDGDLEALPPPSWDWRAKGAVTPPKNQGPTCSSCWAFVTVATIEGLT 194

Query: 174 KIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCS 233
            I++G LI LSEQQL+DC    + GC  GS  + F ++++N G+ TE EYPY A  G C+
Sbjct: 195 FIKTGKLISLSEQQLVDCDMY-DGGCNTGSYSRGFRWVLENGGLTTEAEYPYTAARGPCN 253

Query: 234 AAQKP-AAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQ 292
            A+    AAKI+    +P  +E  + KAV+ QPV +AI    +  Q YK G+++G CGT 
Sbjct: 254 RAKSAHHAAKITGQGRIPPQNELVMQKAVAGQPVGVAIEV-GSGMQFYKTGVYSGPCGTN 312

Query: 293 LDHAVTIVGFGT-TEDGANYWLIKNSWGNTWGDAGYMKIVRD---EGLCGIGTRSSYP 346
           L HAVT+VG+G     GA YW++KNSWG  WG+ G++++ RD    GLCGI    +YP
Sbjct: 313 LAHAVTVVGYGVDPASGAKYWIVKNSWGQAWGERGFIRMRRDVGGPGLCGIALDVAYP 370


>gi|242072388|ref|XP_002446130.1| hypothetical protein SORBIDRAFT_06g002130 [Sorghum bicolor]
 gi|241937313|gb|EES10458.1| hypothetical protein SORBIDRAFT_06g002130 [Sorghum bicolor]
          Length = 276

 Score =  261 bits (667), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 127/280 (45%), Positives = 179/280 (63%), Gaps = 29/280 (10%)

Query: 72  KENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNL 131
           ++N+ ++E  N   N  + LG NQF+DLT +EF+A   G+K  S     TT   FKY+NL
Sbjct: 19  RDNVAFVESFNANKNNKFWLGVNQFADLTTEEFKA-NKGFKPTSAEKVPTTG--FKYENL 75

Query: 132 SMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDC 191
           S++ +PT++DWR KGAVTPIKNQ +CGCCWAF+AVAA+EGI K+ +GNLI LS+Q+L+DC
Sbjct: 76  SVSALPTAVDWRTKGAVTPIKNQGQCGCCWAFSAVAAMEGIVKLSTGNLISLSKQELVDC 135

Query: 192 STNG-NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVP 250
            T+  + GC                    E + PY+AV G C    K +AA I  +E+VP
Sbjct: 136 DTHSMDEGC--------------------EVQLPYKAVDGKCKGGSK-SAATIKGHEDVP 174

Query: 251 SGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGAN 310
             +E AL+KAV+ QPVS+A+ A    F  Y  G+  G CGT+LDH +  +G+G   DG  
Sbjct: 175 VNNEAALMKAVANQPVSVAVDASDRTFMLYSGGVMTGSCGTELDHGIAAIGYGMESDGTK 234

Query: 311 YWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
           YW++KNSWG TWG+ G++++ +D     G+CG+  + SYP
Sbjct: 235 YWILKNSWGTTWGEKGFLRMEKDITDKRGMCGLAMKPSYP 274


>gi|147769019|emb|CAN62459.1| hypothetical protein VITISV_015168 [Vitis vinifera]
          Length = 246

 Score =  261 bits (666), Expect = 4e-67,   Method: Compositional matrix adjust.
 Identities = 134/271 (49%), Positives = 183/271 (67%), Gaps = 34/271 (12%)

Query: 86  NRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHR-STTSSTFKYQNLSMTDVPTSLDWRD 144
           +++YKL  N+F+DLTN+EF      +K    +H  ST +++FKY+N+  T VP++ DWR 
Sbjct: 2   DKSYKLSINEFADLTNEEFGTSRNRFK----AHICSTEATSFKYENV--TAVPSTXDWRK 55

Query: 145 KGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGS 203
           KGAVTPIK+Q +CG CWAF+AVAA+EGIT++ +G LI LSEQ+L+DC T+G + GC G +
Sbjct: 56  KGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQGCXGAN 115

Query: 204 REKAFAYIIQNQGIATEDEYPYQAVPGTCS--AAQKPAAAKISNYEEVPSGDEQALLKAV 261
                              YPY    GTC+   A  PAA KI+ YE+VP+ +E+AL KAV
Sbjct: 116 -------------------YPYAGTDGTCNRKKAAHPAA-KINGYEDVPANNEKALQKAV 155

Query: 262 SMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNT 321
           + QP+++AI A   EFQ Y  G+F G CGT+LDH V  VG+GT++DG  YWL+KNSWG  
Sbjct: 156 AHQPIAVAIDAGGXEFQFYSSGVFTGQCGTELDHGVXAVGYGTSDDGMKYWLVKNSWGTG 215

Query: 322 WGDAGYMKIVRD----EGLCGIGTRSSYPLA 348
           WG+ GY+++ RD    EGLCGI  ++SYP A
Sbjct: 216 WGEEGYIRMQRDVTAKEGLCGIAMQASYPTA 246


>gi|226505708|ref|NP_001141813.1| uncharacterized protein LOC100273952 precursor [Zea mays]
 gi|194706024|gb|ACF87096.1| unknown [Zea mays]
 gi|413945958|gb|AFW78607.1| hypothetical protein ZEAMMB73_489507 [Zea mays]
          Length = 460

 Score =  261 bits (666), Expect = 4e-67,   Method: Compositional matrix adjust.
 Identities = 132/317 (41%), Positives = 192/317 (60%), Gaps = 20/317 (6%)

Query: 48  EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNR-------------TYKLGTN 94
           + W A+HG++Y    E+  RL +F +N  ++   N                  +Y L  N
Sbjct: 37  DAWCAEHGKAYATPEERAARLAVFADNAAFVAAHNARAGANAAGGGGGGAAPPSYTLALN 96

Query: 95  QFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQ 154
            F+DLT++EFRA   G   P  + RS  +  + +       VP +LDWR  GAVT +K+Q
Sbjct: 97  AFADLTHEEFRAARLGRIAPGAALRSRAAPVY-WGLGGGAAVPDALDWRKSGAVTKVKDQ 155

Query: 155 KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQN 214
             CG CW+F+A  A+EGI KI++G+L+ LSEQ+L+DC  + N+GC GG  + A+ ++I+N
Sbjct: 156 GSCGACWSFSATGAMEGINKIKTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYKFVIKN 215

Query: 215 QGIATEDEYPYQAVPGTCSAAQ-KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAY 273
            GI TE++YPY+   GTC+  + K     I  Y +VPS  E  LL+AV+ QPVS+ I   
Sbjct: 216 GGIDTEEDYPYREADGTCNKNKLKKRVVTIDGYTDVPSNKEDLLLQAVAQQPVSVGICGS 275

Query: 274 STEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD 333
           +  FQ Y +GIF+G C T LDHAV IVG+G +E G +YW++KNSWG +WG  GYM + R+
Sbjct: 276 ARAFQLYYQGIFDGPCPTSLDHAVLIVGYG-SEGGKDYWIVKNSWGESWGMKGYMHMHRN 334

Query: 334 ----EGLCGIGTRSSYP 346
               +G+CGI   +S+P
Sbjct: 335 TGDSKGVCGINMMASFP 351


>gi|157132324|ref|XP_001655999.1| cathepsin l [Aedes aegypti]
 gi|108881694|gb|EAT45919.1| AAEL002833-PA [Aedes aegypti]
          Length = 339

 Score =  261 bits (666), Expect = 4e-67,   Method: Compositional matrix adjust.
 Identities = 146/350 (41%), Positives = 210/350 (60%), Gaps = 32/350 (9%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMA---QHGRSYKDELEKEMRLKIFKEN 74
           M I+I L+   A+   ++ S +E     + E+W A   QH ++Y  E E+ +RLKI+ +N
Sbjct: 1   MKILILLMAFVAA--ANAVSLYEL----VKEEWNAFKLQHRKNYDSETEERIRLKIYVQN 54

Query: 75  LEYIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFK---- 127
              I K N+    G   Y+L  N+++DL ++EF     G+      +R+ +  + K    
Sbjct: 55  KHKIAKHNQRFDLGQEKYRLRVNKYADLLHEEFVQTVNGF------NRTDSKKSLKGVRI 108

Query: 128 -----YQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQ 182
                +   +  +VPT++DWR KGAVTP+K+Q  CG CW+F+A  A+EG    ++G L+ 
Sbjct: 109 EEPVTFIEPANVEVPTTVDWRKKGAVTPVKDQGHCGSCWSFSATGALEGQHFRKTGKLVS 168

Query: 183 LSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAA 241
           LSEQ L+DCS   GNNGC GG  + AF YI  N GI TE  YPY+A+  TC    K   A
Sbjct: 169 LSEQNLVDCSGKYGNNGCNGGMMDYAFQYIKDNGGIDTEKSYPYEAIDDTCHFNPKAVGA 228

Query: 242 KISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVCGTQ-LDHAVT 298
               Y ++P GDE+AL KA+ ++ PVSIAI A    FQ Y EG+ +   C ++ LDH V 
Sbjct: 229 TDKGYVDIPQGDEEALKKALATVGPVSIAIDASHESFQFYSEGVYYEPQCDSENLDHGVL 288

Query: 299 IVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
            VG+GT+E+G +YWL+KNSWG TWGD GY+K+ R+ +  CG+ T +SYPL
Sbjct: 289 AVGYGTSEEGEDYWLVKNSWGTTWGDQGYVKMARNRDNHCGVATCASYPL 338


>gi|22653679|sp|Q26636.1|CATL_SARPE RecName: Full=Cathepsin L; Contains: RecName: Full=Cathepsin L
           heavy chain; Contains: RecName: Full=Cathepsin L light
           chain; Flags: Precursor
 gi|505140|dbj|BAA03970.1| cathepsin L precursor [Sarcophaga peregrina]
          Length = 339

 Score =  261 bits (666), Expect = 4e-67,   Method: Compositional matrix adjust.
 Identities = 138/314 (43%), Positives = 191/314 (60%), Gaps = 13/314 (4%)

Query: 46  IHEKWMA---QHGRSYKDELEKEMRLKIFKENLEYIEKANK---EGNRTYKLGTNQFSDL 99
           I E+W     QH ++Y +E+E+  R+KIF EN   I K N+   +G  +YKLG N+++D+
Sbjct: 24  IKEEWHTYKLQHRKNYANEVEERFRMKIFNENRHKIAKHNQLFAQGKVSYKLGLNKYADM 83

Query: 100 TNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
            + EF+    GY   +       T      Y   +   VP S+DWR+ GAVT +K+Q  C
Sbjct: 84  LHHEFKETMNGYNHTLRQLMRERTGLVGATYIPPAHVTVPKSVDWREHGAVTGVKDQGHC 143

Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQG 216
           G CWAF++  A+EG    ++G L+ LSEQ L+DCST  GNNGC GG  + AF YI  N G
Sbjct: 144 GSCWAFSSTGALEGQHFRKAGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGG 203

Query: 217 IATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYST 275
           I TE  YPY+ +  +C   +    A  + + ++P GDE+ + KAV +M PVS+AI A   
Sbjct: 204 IDTEKSYPYEGIDDSCHFNKATIGATDTGFVDIPEGDEEKMKKAVATMGPVSVAIDASHE 263

Query: 276 EFQSYKEGIFNGV-CGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD 333
            FQ Y EG++N   C  Q LDH V +VG+GT E G +YWL+KNSWG TWG+ GY+K+ R+
Sbjct: 264 SFQLYSEGVYNEPECDEQNLDHGVLVVGYGTDESGMDYWLVKNSWGTTWGEQGYIKMARN 323

Query: 334 E-GLCGIGTRSSYP 346
           +   CGI T SSYP
Sbjct: 324 QNNQCGIATASSYP 337


>gi|443698586|gb|ELT98517.1| hypothetical protein CAPTEDRAFT_128252 [Capitella teleta]
          Length = 324

 Score =  261 bits (666), Expect = 4e-67,   Method: Compositional matrix adjust.
 Identities = 141/332 (42%), Positives = 207/332 (62%), Gaps = 17/332 (5%)

Query: 24  LLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANK 83
           +L  C +  ++S    ++++ E+   +   H ++Y  E E +MR  I++ +L  I + N 
Sbjct: 1   MLACCIAATLASPLVFDEALDEMWTLFKTTHSKTYATEAE-DMRRFIWERHLNMINQHNI 59

Query: 84  E---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSL 140
           E   G  T+ LG N++ DLT  E+ A+ +GYKM   +  S  SS  + +NL    VP ++
Sbjct: 60  EADLGKHTFSLGMNEYGDLTQHEYAAM-SGYKM---AKSSVGSSFLEPENLQ---VPKTV 112

Query: 141 DWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGC 199
           DWR+KG VTP+KNQ +CG CWAF++  ++EG    ++G L  +SEQ L+DCS + GN GC
Sbjct: 113 DWREKGYVTPVKNQGQCGSCWAFSSTGSLEGQVFRKTGRLPSISEQNLVDCSRDEGNMGC 172

Query: 200 LGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLK 259
            GG  + AF YI +N GI +E  YPY+AV G C   +  +    S + ++P GDE AL  
Sbjct: 173 SGGLMDNAFTYIKKNMGIDSEKSYPYEAVDGECRYKKSDSVTTDSGFVDIPHGDETALRT 232

Query: 260 AV-SMQPVSIAIAAYSTEFQSYKEGIFN--GVCGTQLDHAVTIVGFGTTEDGANYWLIKN 316
           AV S+ PVS+AI A  T FQ YK G++       TQLDH V +VG+G  E+G +YWL+KN
Sbjct: 233 AVASVGPVSVAIDASHTSFQFYKTGVYTEANCSSTQLDHGVLVVGYG-VENGQDYWLVKN 291

Query: 317 SWGNTWGDAGYMKIVRDEG-LCGIGTRSSYPL 347
           SWG +WG+AGY+K+ R+ G  CGI +++SYPL
Sbjct: 292 SWGASWGEAGYIKLARNHGNQCGIASQASYPL 323


>gi|125526835|gb|EAY74949.1| hypothetical protein OsI_02845 [Oryza sativa Indica Group]
          Length = 360

 Score =  261 bits (666), Expect = 5e-67,   Method: Compositional matrix adjust.
 Identities = 143/321 (44%), Positives = 186/321 (57%), Gaps = 16/321 (4%)

Query: 41  QSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEG-NRTYKLGTNQFSDL 99
            S+   HE+WMA+ GR+Y D  EK  R+++F  N E ++ AN+ G +RTY LG NQFSDL
Sbjct: 37  HSMAARHERWMARFGRAYADAAEKARRMEVFAANAERVDAANRAGGDRTYTLGLNQFSDL 96

Query: 100 TNDEFRALYTGYKM--PSPSHRS--TTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQK 155
           T+DEF   + GY    P PSHR      +         TDVP S+DWR +GAVT +KNQ+
Sbjct: 97  TDDEFARTHLGYSWAPPPPSHRHGHRAENGTAAAAADDTDVPDSVDWRARGAVTEVKNQR 156

Query: 156 ECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQ 215
            CG CWAFAAVAA EG+ ++ +GNL+ LSEQQ+LDC T G N C GG    A  YI  + 
Sbjct: 157 SCGSCWAFAAVAATEGLVQLATGNLVSLSEQQVLDC-TGGANTCSGGDVSAALRYIAASG 215

Query: 216 GIATEDEYPYQAVPGTCS----AAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIA 271
           G+ TE  Y Y    G C     AA   AAA          GDE AL    + QPV + + 
Sbjct: 216 GLQTEAAYAYGGQQGACRAGGFAAPNSAAAVGGARWARLYGDEGALQALAAGQPVVVVVE 275

Query: 272 AYSTEFQSYKEGIFNG--VCGTQLDHAVTIV-GFGTTEDGANYWLIKNSWGNTWGDAGYM 328
           A   +F+ Y+ G++ G   CG +L+HAVT+V      + G  YWL+KN WG  WG+ GYM
Sbjct: 276 ASEPDFRHYRSGVYAGSAACGRRLNHAVTVVGYGAAADGGGEYWLVKNQWGTWWGEGGYM 335

Query: 329 KIVRD---EGLCGIGTRSSYP 346
           ++ R     G CGI T + YP
Sbjct: 336 RVARGGAAGGNCGIATYAFYP 356


>gi|115438530|ref|NP_001043562.1| Os01g0613500 [Oryza sativa Japonica Group]
 gi|11034572|dbj|BAB17096.1| cysteine proteinase-like [Oryza sativa Japonica Group]
 gi|113533093|dbj|BAF05476.1| Os01g0613500 [Oryza sativa Japonica Group]
 gi|215697766|dbj|BAG91959.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 360

 Score =  261 bits (666), Expect = 5e-67,   Method: Compositional matrix adjust.
 Identities = 143/321 (44%), Positives = 186/321 (57%), Gaps = 16/321 (4%)

Query: 41  QSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEG-NRTYKLGTNQFSDL 99
            S+   HE+WMA+ GR+Y D  EK  R+++F  N E ++ AN+ G +RTY LG NQFSDL
Sbjct: 37  HSMAARHERWMARFGRAYADAAEKARRMEVFAANAERVDAANRAGGDRTYTLGLNQFSDL 96

Query: 100 TNDEFRALYTGYKM--PSPSHRS--TTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQK 155
           T+DEF   + GY    P PSHR      +         TDVP S+DWR +GAVT +KNQ+
Sbjct: 97  TDDEFAQTHLGYSWAPPPPSHRHGHRAENGTAAAAADDTDVPDSVDWRARGAVTEVKNQR 156

Query: 156 ECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQ 215
            CG CWAFAAVAA EG+ ++ +GNL+ LSEQQ+LDC T G N C GG    A  YI  + 
Sbjct: 157 SCGSCWAFAAVAATEGLVQLATGNLVSLSEQQVLDC-TGGANTCSGGDVSAALRYIAASG 215

Query: 216 GIATEDEYPYQAVPGTCS----AAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIA 271
           G+ TE  Y Y    G C     AA   AAA          GDE AL    + QPV + + 
Sbjct: 216 GLQTEAAYAYGGQQGACRAGGFAAPNSAAAVGGARWARLYGDEGALQALAAGQPVVVVVE 275

Query: 272 AYSTEFQSYKEGIFNG--VCGTQLDHAVTIV-GFGTTEDGANYWLIKNSWGNTWGDAGYM 328
           A   +F+ Y+ G++ G   CG +L+HAVT+V      + G  YWL+KN WG  WG+ GYM
Sbjct: 276 ASEPDFRHYRSGVYAGSAACGRRLNHAVTVVGYGAAADGGGEYWLVKNQWGTWWGEGGYM 335

Query: 329 KIVRD---EGLCGIGTRSSYP 346
           ++ R     G CGI T + YP
Sbjct: 336 RVARGGAAGGNCGIATYAFYP 356


>gi|21593501|gb|AAM65468.1| cysteine proteinase [Arabidopsis thaliana]
          Length = 376

 Score =  261 bits (666), Expect = 5e-67,   Method: Compositional matrix adjust.
 Identities = 142/351 (40%), Positives = 205/351 (58%), Gaps = 21/351 (5%)

Query: 10  SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
           SF+        ++ + +S      +    +E  V+ ++E+W+ ++G++Y    EKE R K
Sbjct: 4   SFRTLALLTLSVLLISISLGVVTATESQRNEGGVLTMYEQWLVENGKNYNGLGEKERRFK 63

Query: 70  IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQ 129
           IFK+NL+ IE+ N + NR+Y+ G N+FSDLT DEF+A Y G KM     +S +    +YQ
Sbjct: 64  IFKDNLKRIEEHNSDPNRSYERGLNKFSDLTADEFQASYLGGKM---EKKSLSDVAERYQ 120

Query: 130 NLSMTDVPTSLDWRDKGAVTP-IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQL 188
                 +P  +DWR++GAV P +K Q ECG CWAFAA  AVEGI +I +G L+ LSEQ+L
Sbjct: 121 YKEGDVLPDEVDWRERGAVVPRVKRQGECGSCWAFAATGAVEGINQITTGELVSLSEQEL 180

Query: 189 LDCST-NGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAK----- 242
           +DC   N N GC GG    AF +I +N GI +++ Y Y    G  +AA K    K     
Sbjct: 181 IDCDRGNDNFGCAGGGAVWAFEFIKENGGIVSDEVYGYT---GEDTAACKAIEMKTTRVV 237

Query: 243 -ISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQL-DHAVTIV 300
            I+ +E VP  DE +L KAV+ QP+S+ I+A       YK G++ G C     DH V IV
Sbjct: 238 TINGHEVVPVNDEMSLKKAVAYQPISVMISA--ANMSDYKSGVYKGACSNLWGDHNVLIV 295

Query: 301 GFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
           G+GT+ D  +YWLI+NSWG  WG+ GY+++ R+     G C +     YP+
Sbjct: 296 GYGTSSDEGDYWLIRNSWGPEWGEGGYLRLQRNFHEPTGKCAVAVAPVYPI 346


>gi|18407678|ref|NP_566867.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|30315950|sp|Q9LXW3.1|CPR2_ARATH RecName: Full=Probable cysteine proteinase At3g43960; Flags:
           Precursor
 gi|7594557|emb|CAB88124.1| cysteine proteinase-like protein [Arabidopsis thaliana]
 gi|26452289|dbj|BAC43231.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|332644328|gb|AEE77849.1| putative cysteine proteinase [Arabidopsis thaliana]
          Length = 376

 Score =  260 bits (665), Expect = 5e-67,   Method: Compositional matrix adjust.
 Identities = 142/351 (40%), Positives = 205/351 (58%), Gaps = 21/351 (5%)

Query: 10  SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
           SF+        ++ + +S      +    +E  V+ ++E+W+ ++G++Y    EKE R K
Sbjct: 4   SFRTLALLTLSVLLISISLGVVTATESQRNEGEVLTMYEQWLVENGKNYNGLGEKERRFK 63

Query: 70  IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQ 129
           IFK+NL+ IE+ N + NR+Y+ G N+FSDLT DEF+A Y G KM     +S +    +YQ
Sbjct: 64  IFKDNLKRIEEHNSDPNRSYERGLNKFSDLTADEFQASYLGGKM---EKKSLSDVAERYQ 120

Query: 130 NLSMTDVPTSLDWRDKGAVTP-IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQL 188
                 +P  +DWR++GAV P +K Q ECG CWAFAA  AVEGI +I +G L+ LSEQ+L
Sbjct: 121 YKEGDVLPDEVDWRERGAVVPRVKRQGECGSCWAFAATGAVEGINQITTGELVSLSEQEL 180

Query: 189 LDCST-NGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAK----- 242
           +DC   N N GC GG    AF +I +N GI +++ Y Y    G  +AA K    K     
Sbjct: 181 IDCDRGNDNFGCAGGGAVWAFEFIKENGGIVSDEVYGYT---GEDTAACKAIEMKTTRVV 237

Query: 243 -ISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQL-DHAVTIV 300
            I+ +E VP  DE +L KAV+ QP+S+ I+A       YK G++ G C     DH V IV
Sbjct: 238 TINGHEVVPVNDEMSLKKAVAYQPISVMISA--ANMSDYKSGVYKGACSNLWGDHNVLIV 295

Query: 301 GFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
           G+GT+ D  +YWLI+NSWG  WG+ GY+++ R+     G C +     YP+
Sbjct: 296 GYGTSSDEGDYWLIRNSWGPEWGEGGYLRLQRNFHEPTGKCAVAVAPVYPI 346


>gi|125604306|gb|EAZ43631.1| hypothetical protein OsJ_28254 [Oryza sativa Japonica Group]
          Length = 369

 Score =  260 bits (665), Expect = 6e-67,   Method: Compositional matrix adjust.
 Identities = 138/314 (43%), Positives = 188/314 (59%), Gaps = 23/314 (7%)

Query: 40  EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
           E+++ E++E+W  QH R  +D  EK  R  +FK+N+  I + N+  +  YKL  N+F D+
Sbjct: 41  EEALWELYERWRGQH-RVARDLGEKARRFNVFKDNVRLIHEFNRR-DEPYKLRLNRFGDM 98

Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGC 159
           T DE    Y   ++    HR       K Q L              GAV  +K+Q +CG 
Sbjct: 99  TADESAGAYASSRVSH--HRMFRGRGEKAQRL-------------HGAVGAVKDQGQCGS 143

Query: 160 CWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIA 218
           CWAF+ +AAVEGI  IR+ NL  LSEQQL+DC T  GN GC GG  + AF YI ++ G+A
Sbjct: 144 CWAFSTIAAVEGINAIRTSNLTALSEQQLVDCDTKTGNAGCDGGLMDNAFQYIAKHGGVA 203

Query: 219 TEDEYPYQAVPGTCSAAQKPAA-AKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEF 277
               YPY+A   +C ++   +    I  YE+VP+  E AL KAV+ QPVS+AI A  + F
Sbjct: 204 ASSAYPYRARQSSCKSSAASSPAVTIDGYEDVPANSESALKKAVANQPVSVAIEAGGSHF 263

Query: 278 QSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD---- 333
           Q Y EG+F G CGT+LDH V  VG+GTT DG  YW+++NSWG  WG+ GY+++ RD    
Sbjct: 264 QFYSEGVFAGKCGTELDHGVAAVGYGTTVDGTKYWIVRNSWGADWGEKGYIRMKRDVSAK 323

Query: 334 EGLCGIGTRSSYPL 347
           EGLCGI   +SYP+
Sbjct: 324 EGLCGIAMEASYPI 337


>gi|440793751|gb|ELR14926.1| Cysteine proteinase 5, putative [Acanthamoeba castellanii str.
           Neff]
          Length = 326

 Score =  260 bits (665), Expect = 7e-67,   Method: Compositional matrix adjust.
 Identities = 136/331 (41%), Positives = 198/331 (59%), Gaps = 14/331 (4%)

Query: 23  TLLVSCASQVVSSR-STHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKA 81
           TLL  C +  V+S  +     +  +   WM +H +SY +E E   R  +++EN  YIE  
Sbjct: 5   TLLALCVALFVASTFAVSHDPLTGVFADWMQEHQKSYANE-EFVYRWNVWRENYLYIEAH 63

Query: 82  NKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLD 141
           N + N+++ L  N+F DLTN EF  L+ G  + +   +  +             +P   D
Sbjct: 64  NHQ-NKSFHLAMNKFGDLTNAEFNKLFKGLSITADQAKQESDIA------PAPGLPADFD 116

Query: 142 WRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCL 200
           WR KGAVT +KNQ +CG CW+F+   + EG   ++ G L  LSEQ L+DCST+ GN+GC 
Sbjct: 117 WRQKGAVTHVKNQGQCGSCWSFSTTGSTEGANFLKHGRLTSLSEQNLVDCSTSYGNHGCN 176

Query: 201 GGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKA 260
           GG  + AF YII+N+GI TE+ YPY A  GTC   ++ +  ++ +Y  VPSG+E ALL A
Sbjct: 177 GGLMDYAFEYIIRNKGIDTEESYPYHASQGTCRYNKQHSGGELVSYTNVPSGNEGALLNA 236

Query: 261 VSMQPVSIAIAAYSTEFQSYKEGIFN--GVCGTQLDHAVTIVGFGTTEDGANYWLIKNSW 318
           V+ QP S+AI A  + FQ YK G+++      ++LDH V  VG+G   DG +YWL+KNSW
Sbjct: 237 VATQPTSVAIDASHSSFQFYKGGVYDEPACSSSRLDHGVLAVGWG-VRDGKDYWLVKNSW 295

Query: 319 GNTWGDAGYMKIVRDE-GLCGIGTRSSYPLA 348
           G  WG +GY+++ R++   CGI T +S+P A
Sbjct: 296 GADWGLSGYIEMSRNKHNQCGIATAASHPHA 326


>gi|238816977|gb|ACR56863.1| cathepsin L-like cysteine proteinase [Delia coarctata]
          Length = 338

 Score =  260 bits (664), Expect = 7e-67,   Method: Compositional matrix adjust.
 Identities = 130/314 (41%), Positives = 192/314 (61%), Gaps = 13/314 (4%)

Query: 46  IHEKWMA---QHGRSYKDELEKEMRLKIFKENLEYIEKANK---EGNRTYKLGTNQFSDL 99
           I E+W     +H ++Y  E+E+  R+KIF EN   I K N+   +G  ++KLG N+++D+
Sbjct: 23  IKEEWQTFKMEHRKNYLSEVEERFRMKIFNENRHKIAKHNQLYAQGKVSFKLGLNKYADM 82

Query: 100 TNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
            + EF+    GY   M          +   Y + +   VP ++DWR  GAVT +K+Q  C
Sbjct: 83  LHHEFKETMNGYNHTMRKELRAQEGFNGITYISPANVQVPKAVDWRQHGAVTSVKDQGHC 142

Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQG 216
           G CW+F++  ++EG    ++G L+ LSEQ L+DCST  GNNGC GG  + AF YI  N G
Sbjct: 143 GSCWSFSSTGSLEGQHFRKAGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGG 202

Query: 217 IATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYST 275
           + TE  YPY+ +  +C   +    A  + + ++P GDE+A++KAV +M PV++AI A + 
Sbjct: 203 VDTEKSYPYEGIDDSCHFNKATVGATDTGFVDIPQGDEEAMMKAVATMGPVAVAIDASNE 262

Query: 276 EFQSYKEGIFN--GVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD 333
            FQ Y EG++N        LDH V +VG+GT +DG +YWL+KNSWG TWGD GY+K+ R+
Sbjct: 263 SFQLYSEGVYNDPNCSSDNLDHGVLVVGYGTDKDGQDYWLVKNSWGTTWGDQGYIKMARN 322

Query: 334 -EGLCGIGTRSSYP 346
            +  CGI T SS+P
Sbjct: 323 QDNQCGIATASSFP 336


>gi|156399477|ref|XP_001638528.1| predicted protein [Nematostella vectensis]
 gi|156225649|gb|EDO46465.1| predicted protein [Nematostella vectensis]
          Length = 325

 Score =  260 bits (664), Expect = 8e-67,   Method: Compositional matrix adjust.
 Identities = 144/334 (43%), Positives = 196/334 (58%), Gaps = 22/334 (6%)

Query: 19  FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
            ++  L+  C S++   R  H          W   HG++Y  E E+++R  I+ +NLE +
Sbjct: 8   LLVAVLIAQCFSELSQDRQWH---------AWKDFHGKTYTGE-EEDLRRAIWNDNLEIV 57

Query: 79  EKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPT 138
           +K N E N +YKL  N F+DLT  EF+  + GY+  S    ST  STF    LS   +P 
Sbjct: 58  KKHNAE-NHSYKLDMNHFADLTVTEFKQRFMGYRAAS---NSTGGSTF--LPLSNVQLPA 111

Query: 139 SLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNN 197
            +DWRDKG VT +KNQ +CG CWAF++  ++EG    ++G L+ LSEQ L+DCS   GNN
Sbjct: 112 EVDWRDKGFVTAVKNQGQCGSCWAFSSTGSLEGQHFRKTGKLVSLSEQNLVDCSKKYGNN 171

Query: 198 GCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQAL 257
           GC GG  + AF YI  N GI TE  YPY A  G C        A ++ Y +V  G E  L
Sbjct: 172 GCEGGLMDYAFKYIKNNDGIDTEQSYPYTARDGQCHFKPGSVGATVTGYTDVQRGSEGDL 231

Query: 258 LKAV-SMQPVSIAIAAYSTEFQSYKEGIFN--GVCGTQLDHAVTIVGFGTTEDGANYWLI 314
             AV ++ P+S+AI A  + FQ YK G+++      TQLDH V  VG+G  EDG +YWL+
Sbjct: 232 QSAVATVGPISVAIDAGHSSFQLYKTGVYSEPDCSSTQLDHGVLAVGYG-AEDGKDYWLV 290

Query: 315 KNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
           KNSWG  WG  GY+K+ R+ +  CGI T++SYPL
Sbjct: 291 KNSWGEGWGMNGYIKMSRNKDNQCGIATQASYPL 324


>gi|149617838|ref|XP_001521715.1| PREDICTED: cathepsin L1-like [Ornithorhynchus anatinus]
          Length = 338

 Score =  260 bits (664), Expect = 8e-67,   Method: Compositional matrix adjust.
 Identities = 152/343 (44%), Positives = 209/343 (60%), Gaps = 19/343 (5%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIHEK-WMAQHGRSYKDELEKEMRLKIFKENLE 76
           M +++ L+  C    VS+      S ++ H K W   H +SY  E E+  R  +++ENL+
Sbjct: 1   MNLLVCLVSLCWGLAVSAPLG--DSELDRHWKLWKNWHQKSYH-EAEEGWRRTVWEENLK 57

Query: 77  YIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
            I+  N E   G  TY+LG NQF DLTN+EF+ + TG +  S  +R   S+   +   + 
Sbjct: 58  AIQLHNLEQSLGLHTYRLGMNQFGDLTNEEFQEILTGERHFSKGNRINGSA---FLEANF 114

Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS- 192
             VPTS+DWRD G VTP+KNQ  CG CWAF+   A+EG    +SG LI LSEQ L+DCS 
Sbjct: 115 VQVPTSVDWRDHGYVTPVKNQGHCGSCWAFSTTGALEGQLFRKSGRLISLSEQNLVDCSW 174

Query: 193 TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVP-GTCSAAQKPAAAKISNYEEVPS 251
             GN GC GG  + AF YI+QNQGI +ED YPY A     C+   + A A ++ + ++P 
Sbjct: 175 QQGNQGCHGGIVDLAFQYILQNQGIDSEDCYPYTAKDTAQCTFKPECATAPVTGFVDIPP 234

Query: 252 GDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIF-NGVCGTQ-LDHAVTIVGFG---TT 305
             E+AL+KAV ++ PVS+ I A ST F+ Y+ GIF +  C ++ LDHAV +VG+G     
Sbjct: 235 HSEEALMKAVATVGPVSVGIDASSTSFRFYQSGIFYDPKCSSESLDHAVLVVGYGYERED 294

Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRDEG-LCGIGTRSSYPL 347
           E G  YW++KNSWG  WGD GY+ + +D G  CGI T +SYPL
Sbjct: 295 EAGKKYWIVKNSWGKHWGDRGYVYMSKDRGNHCGIATVASYPL 337


>gi|91992508|gb|ABE72970.1| cathepsin L [Aedes aegypti]
          Length = 339

 Score =  260 bits (664), Expect = 9e-67,   Method: Compositional matrix adjust.
 Identities = 139/323 (43%), Positives = 197/323 (60%), Gaps = 26/323 (8%)

Query: 46  IHEKWMA---QHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDL 99
           + E+W A   QH ++Y  E E+ +RLKI+ +N   I K N+    G   Y+L  N+++DL
Sbjct: 23  VKEEWNAFKLQHRKNYDSETEERIRLKIYVQNKHKIAKHNQRFDLGQEKYRLRVNKYADL 82

Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFK---------YQNLSMTDVPTSLDWRDKGAVTP 150
            ++EF     G+      +R+ +  + K         +   +  +VPT++DWR KGAVTP
Sbjct: 83  LHEEFVQTVNGF------NRTDSKKSLKGVRIEEPVTFIEPANVEVPTTVDWRKKGAVTP 136

Query: 151 IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFA 209
           +K+Q  CG CW+F+A  A+EG    ++G L+ LSEQ L+DCS   GNNGC GG  + AF 
Sbjct: 137 VKDQGHCGSCWSFSATGALEGQHFRKTGKLVSLSEQNLVDCSGKYGNNGCNGGMMDYAFQ 196

Query: 210 YIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSI 268
           YI  N GI TE  YPY+A+  TC    K   A    Y ++P GDE+AL KA+ ++ PVSI
Sbjct: 197 YIKDNGGIDTEKSYPYEAIDDTCHFNPKAVGATDKGYVDIPQGDEEALKKALATVGPVSI 256

Query: 269 AIAAYSTEFQSYKEGI-FNGVCGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAG 326
           AI A    FQ Y EG+ +   C ++ LDH V  VG+GT+E+G +YWL+KNSWG TWGD G
Sbjct: 257 AIDASHESFQFYSEGVYYEPQCDSENLDHGVLAVGYGTSEEGEDYWLVKNSWGTTWGDQG 316

Query: 327 YMKIVRD-EGLCGIGTRSSYPLA 348
           Y+K+ R+ +  CG+ T +SYPL 
Sbjct: 317 YVKMARNHDNHCGVATCASYPLV 339


>gi|158300877|ref|XP_001689282.1| AGAP011828-PA [Anopheles gambiae str. PEST]
 gi|157013372|gb|EDO63348.1| AGAP011828-PA [Anopheles gambiae str. PEST]
          Length = 344

 Score =  260 bits (664), Expect = 9e-67,   Method: Compositional matrix adjust.
 Identities = 143/348 (41%), Positives = 202/348 (58%), Gaps = 27/348 (7%)

Query: 19  FIIITLLVSCASQVVSSRSTHEQSVVE-IHEKWMA---QHGRSYKDELEKEMRLKIFKEN 74
           F+I+ L    A+  +S        + E + E+W A   QH + Y  E E+ +R+KI+ +N
Sbjct: 4   FLILILGFVAAANAIS--------IFELVKEEWTAFKLQHRKKYDSETEERIRMKIYVQN 55

Query: 75  LEYIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNL 131
              I K N+    G   ++L  N+++DL ++EF     G+               K    
Sbjct: 56  KHKIAKHNQRYDLGQEKFRLRVNKYADLLHEEFVHTLNGFNRSVSGKGQLLRGELKPIEE 115

Query: 132 SMT-------DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLS 184
            +T       DVPT++DWR KGAVT +K+Q  CG CW+F+A  A+EG    ++G L+ LS
Sbjct: 116 PVTWIEPANVDVPTAMDWRTKGAVTQVKDQGHCGSCWSFSATGALEGQHFRKTGKLVSLS 175

Query: 185 EQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKI 243
           EQ L+DCS   GNNGC GG  + AF YI  N+GI TE  YPY+A+   C    K   A  
Sbjct: 176 EQNLVDCSQKYGNNGCNGGMMDFAFQYIKDNKGIDTEKSYPYEAIDDECHYNPKAVGATD 235

Query: 244 SNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVCGT-QLDHAVTIV 300
             + ++P G+E+AL+KA+ ++ PVS+AI A    FQ Y EG+ +   C + QLDH V  V
Sbjct: 236 KGFVDIPQGNEKALMKALATVGPVSVAIDASHESFQFYSEGVYYEPQCDSEQLDHGVLAV 295

Query: 301 GFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
           G+GTTEDG +YWL+KNSWG TWGD GY+K+ R+ +  CGI T +SYPL
Sbjct: 296 GYGTTEDGEDYWLVKNSWGTTWGDQGYVKMARNRDNHCGIATTASYPL 343


>gi|18390634|ref|NP_563764.1| cysteine proteinase-like protein [Arabidopsis thaliana]
 gi|8844131|gb|AAF80223.1|AC025290_12 Contains similarity to a cysteine endopeptidase 1 from Phaseolus
           vulgaris gb|U52970 and is a member of the papain
           cysteine protease family PF|00112 [Arabidopsis thaliana]
 gi|332189848|gb|AEE27969.1| cysteine proteinase-like protein [Arabidopsis thaliana]
          Length = 343

 Score =  259 bits (663), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 141/343 (41%), Positives = 199/343 (58%), Gaps = 21/343 (6%)

Query: 15  TTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKEN 74
           T  + I   L+ S    V SS     +++ +  EKW+  H + Y    E  +R  I++ N
Sbjct: 11  TLAVLICFVLIASKLCSVDSSVYDPHKTLKQRFEKWLKTHSKLYGGRDEWMLRFGIYQSN 70

Query: 75  LEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
           ++ I+  N   +  +KL  N+F+D+TN EF+A + G         +T+S     +   + 
Sbjct: 71  VQLIDYINSL-HLPFKLTDNRFADMTNSEFKAHFLGL--------NTSSLRLHKKQRPVC 121

Query: 135 D----VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLD 190
           D    VP ++DWR +GAVTPI+NQ +CG CWAF+AVAA+EGI KI++GNL+ LSEQQL+D
Sbjct: 122 DPAGNVPDAVDWRTQGAVTPIRNQGKCGGCWAFSAVAAIEGINKIKTGNLVSLSEQQLID 181

Query: 191 CSTNG-NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQ-KPAAAKISNYEE 248
           C     N GC GG  E AF +I  N G+ATE +YPY  + GTC   + K     I  Y++
Sbjct: 182 CDVGTYNKGCSGGLMETAFEFIKTNGGLATETDYPYTGIEGTCDQEKSKNKVVTIQGYQK 241

Query: 249 VPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDG 308
           V   +E +L  A + QPVS+ I A    FQ Y  G+F   CGT L+H VT+VG+G   D 
Sbjct: 242 VAQ-NEASLQIAAAQQPVSVGIDAGGFIFQLYSSGVFTNYCGTNLNHGVTVVGYGVEGD- 299

Query: 309 ANYWLIKNSWGNTWGDAGYMKIVR----DEGLCGIGTRSSYPL 347
             YW++KNSWG  WG+ GY+++ R    D G CGI   +SYPL
Sbjct: 300 QKYWIVKNSWGTGWGEEGYIRMERGVSEDTGKCGIAMMASYPL 342


>gi|388890776|gb|AFK80364.1| cysteine proteinase 3, partial [Acanthamoeba castellanii]
          Length = 329

 Score =  259 bits (663), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 140/335 (41%), Positives = 195/335 (58%), Gaps = 10/335 (2%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
           M  I  L++  A  V S+ +T    +  +  +WM  + +SY +E E   R  +++EN + 
Sbjct: 1   MRAITILVLLAAICVASTLATTHDPLTGVFAEWMRDNSKSYSNE-EFVFRWNVWRENQQL 59

Query: 78  IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
           IE+ N+  N+T  L  N+F DLTN EF  L+ G       H +  ++    + +    + 
Sbjct: 60  IEEHNRS-NKTSFLAMNKFGDLTNAEFNKLFKGLAFDYSFHANKAAAE---KAVPAPGLS 115

Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TNGN 196
              DWR KGAVT +KNQ +CG CW+F+   + EG   +++G L  LSEQ L+DCS + GN
Sbjct: 116 ADFDWRQKGAVTHVKNQGQCGSCWSFSTTGSTEGANFLKTGRLTSLSEQNLIDCSGSYGN 175

Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQA 256
           NGC GG  + AF YII N+GI TE  YPYQ    TC      +   +++Y +V SGDE A
Sbjct: 176 NGCNGGLMDYAFEYIINNKGIDTEASYPYQTAQYTCQYNPANSGGSLTSYTDVSSGDENA 235

Query: 257 LLKAVSMQPVSIAIAAYSTEFQSYKEGIF--NGVCGTQLDHAVTIVGFGTTEDGANYWLI 314
           LL AV+ +P S+AI A    FQ Y  G++  +    TQLDH V  VG+G TEDG +YWL+
Sbjct: 236 LLNAVATEPTSVAIDASHNSFQFYSGGVYYESACSSTQLDHGVLAVGWG-TEDGQDYWLV 294

Query: 315 KNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYPLA 348
           KNSWG  WG AGY+K+ R+    CGI T +SYP A
Sbjct: 295 KNSWGADWGLAGYIKMARNRSNNCGIATSASYPTA 329


>gi|402770501|gb|AFQ98385.1| cathepsin L [Rhipicephalus microplus]
          Length = 332

 Score =  259 bits (663), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 141/336 (41%), Positives = 205/336 (61%), Gaps = 14/336 (4%)

Query: 21  IITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEK 80
           ++ L V CA   V+  ++ ++ +    E +   H ++Y+  +E+ +R KIF EN   I K
Sbjct: 1   MLRLSVLCAIAAVTVAASSQEILRTQWEAFKTTHKKTYQSHMEELLRFKIFTENSLIIAK 60

Query: 81  ANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTF-KYQNLSMTDV 136
            N +   G  +YKLG NQF DL   EF  ++ G+       R T  STF    N++ + +
Sbjct: 61  HNAKYAKGLVSYKLGMNQFGDLLAHEFARIFNGHH----GTRKTGGSTFLPPANVNDSSL 116

Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-G 195
           P ++DWR KGAVTP+K+Q +CG CWAF+A  ++EG   +++G L+ LSEQ L+DCS + G
Sbjct: 117 PKAVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQSFG 176

Query: 196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQ 255
           NNGC GG  E AF YI  N GI TE  YPY+AV G C   ++   A  + Y E+ +G E 
Sbjct: 177 NNGCEGGLMEDAFKYIKANDGIDTEKSYPYEAVDGECRFKKEDVGATDTGYVEIKAGSED 236

Query: 256 ALLKAV-SMQPVSIAIAAYSTEFQSYKEGIFNGV-CGTQ-LDHAVTIVGFGTTEDGANYW 312
            L KAV ++ P+S+AI A  + FQ Y EG+++   C ++ LDH V +VG+G  + G  YW
Sbjct: 237 DLKKAVATVGPISVAIDASHSSFQLYSEGVYDEPECSSEDLDHGVLVVGYG-VKGGKKYW 295

Query: 313 LIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
           L+KNSW  +WGD GY+ + RD    CGI +++SYPL
Sbjct: 296 LVKNSWAESWGDQGYILMSRDNNNQCGIASQASYPL 331


>gi|449530091|ref|XP_004172030.1| PREDICTED: vignain-like [Cucumis sativus]
          Length = 351

 Score =  259 bits (662), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 138/351 (39%), Positives = 210/351 (59%), Gaps = 21/351 (5%)

Query: 12  KINTTPMFIIITLLVSCASQVVSSRSTH-EQSVVEIHEKWMAQHGRSYKDELEKEMRLKI 70
           K    P+ +I  L   C S  +  +    E+S+++++++W + H R  ++  E   R K+
Sbjct: 5   KFLIVPLVLIAFLCNICESFELERKDFESEKSLMQLYKRWSSHH-RISRNANEMHNRFKV 63

Query: 71  FKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALY----TGYKMPSPSHRSTTSST- 125
           FK N +++ K N  G ++ KL  NQF+D+++DEFR +Y    T YK         T    
Sbjct: 64  FKNNAKHVFKVNLMG-KSLKLKLNQFADMSDDEFRNMYSSNITYYKDLHAKKIEATGGRI 122

Query: 126 --FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQL 183
             F Y++ +  ++P+S+DWR KGAV  IKNQ  CG CWAFAAVAAVE I +I++  L+ L
Sbjct: 123 GGFMYEHAN--NIPSSIDWRKKGAVNAIKNQGRCGSCWAFAAVAAVESIHQIKTNELVSL 180

Query: 184 SEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTC-SAAQKPAAAK 242
           SE+++LDC    + GC GG    AF +++ N G+  ED YPY    G C     +    +
Sbjct: 181 SEEEVLDCDYR-DGGCRGGFYNSAFEFMMDNDGVTIEDNYPYYEGNGYCRRRGGRNKRVR 239

Query: 243 ISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIF--NGVCGTQLDHAVTIV 300
           I  YE VP  +E AL+KAV+ QPV++AIA+  ++F+ Y  G+F  N  CG  +DH V +V
Sbjct: 240 IDGYENVPRNNEYALMKAVAHQPVAVAIASGGSDFKFYGGGMFTENDFCGFNIDHTVVVV 299

Query: 301 GFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
           G+GT EDG +YW+I+N +G+ WG  GYMK+ R     +G+CG+  + +YP+
Sbjct: 300 GYGTDEDG-DYWIIRNQYGHRWGMNGYMKMQRGAHSPQGVCGMAMQPAYPV 349


>gi|195153545|ref|XP_002017686.1| GL17172 [Drosophila persimilis]
 gi|194113482|gb|EDW35525.1| GL17172 [Drosophila persimilis]
          Length = 341

 Score =  259 bits (662), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 141/346 (40%), Positives = 202/346 (58%), Gaps = 22/346 (6%)

Query: 16  TPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMA---QHGRSYKDELEKEMRLKIFK 72
           T + + +  LV+ A Q VS           I E+W     +H ++Y+DE E+  RLKIF 
Sbjct: 3   TALILPLLALVAVA-QAVSYAEV-------IQEEWHTFKLEHRKNYQDETEERFRLKIFN 54

Query: 73  ENLEYIEKANK---EGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFK-- 127
           EN   I K N+    G  ++K+  N+++D+ + EF +   G+             +FK  
Sbjct: 55  ENKHKIAKHNQLWATGAVSFKMAVNKYADMLHHEFYSTMNGFNYTLHKQLRNADESFKGV 114

Query: 128 -YQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQ 186
            + +     +P  +DWR KGAVT +K+Q  CG CWAF++  A+EG    +SG L+ LSEQ
Sbjct: 115 TFISPEHVTLPKQVDWRTKGAVTDVKDQGHCGSCWAFSSTGALEGQHYRKSGVLVSLSEQ 174

Query: 187 QLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
            L+DCST  GNNGC GG  + AF YI  N GI TE  YPY+A+  +C   +    A    
Sbjct: 175 NLVDCSTKYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYPYEAIDDSCHFNKGSIGATDRG 234

Query: 246 YEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIFN-GVCGTQ-LDHAVTIVGF 302
           + ++P G+E+ + +AV ++ PV++AI A    FQ Y EG++N   C  Q LDH V +VGF
Sbjct: 235 FVDIPQGNEKKMAEAVATIGPVAVAIDASHESFQFYSEGVYNEPACDAQNLDHGVLVVGF 294

Query: 303 GTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
           GT E G +YWL+KNSWG TWGD G++K++R+ E  CGI + SSYPL
Sbjct: 295 GTDESGEDYWLVKNSWGTTWGDKGFIKMLRNKENQCGIASASSYPL 340


>gi|238007404|gb|ACR34737.1| unknown [Zea mays]
 gi|413943289|gb|AFW75938.1| cysteine proteinase Mir2 [Zea mays]
          Length = 484

 Score =  259 bits (662), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 138/327 (42%), Positives = 199/327 (60%), Gaps = 21/327 (6%)

Query: 40  EQSVVEIHEKWMAQH----------GRSYKDELEKEMRLKIFKENLEYIEKANKE---GN 86
           ++ V  ++E+W ++H          G     E +   RL++F+ NL YI+  N E   G 
Sbjct: 46  DEEVRRLYEEWRSEHDAGPRRGATGGSLGPGEDDDARRLEVFRYNLRYIDAHNAEADAGL 105

Query: 87  RTYKLGTNQFSDLTNDEFRA-LYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDK 145
             ++LG  +F+DLT +E+RA L  G +  + +      S  +Y  L+   +P ++DWR++
Sbjct: 106 HGFRLGLTRFADLTLEEYRARLLLGSRGRNGTAVGVVGSR-RYLPLAGEQLPDAVDWRER 164

Query: 146 GAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSRE 205
           GAV  +K+Q +CG CWAF+AVAAVEGI KI +G+LI LSEQ+L+DC    + GC GG  +
Sbjct: 165 GAVAEVKDQGQCGACWAFSAVAAVEGINKIVTGSLISLSEQELIDCDKFQDQGCDGGLMD 224

Query: 206 KAFAYIIQNQGIATEDEYPYQAVPGTCSAAQK-PAAAKISNYEEVPSGDEQALLKAVSMQ 264
            AF ++I+N GI TE +YP+    GTC    K      I ++E VP   E+AL KAV+ Q
Sbjct: 225 NAFVFMIKNGGIDTEADYPFTGHDGTCDLKLKNTRVVSIDSFERVPINYERALQKAVAHQ 284

Query: 265 PVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGD 324
           PVS +I A    FQ Y  GIF+G CGT LDH VT+VG+G +E G +YW++KNSWG  WG+
Sbjct: 285 PVSASIEASRRAFQLYSSGIFDGRCGTYLDHGVTVVGYG-SEGGKDYWIVKNSWGTQWGE 343

Query: 325 AGYMKIVRD----EGLCGIGTRSSYPL 347
           AGY+++ R+     G CGI     YP+
Sbjct: 344 AGYVRMARNVRVRAGKCGIAMEPLYPV 370


>gi|402770505|gb|AFQ98387.1| cathepsin L [Rhipicephalus microplus]
          Length = 332

 Score =  259 bits (662), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 141/336 (41%), Positives = 206/336 (61%), Gaps = 14/336 (4%)

Query: 21  IITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEK 80
           ++ L V CA   V+  ++ ++ +    E +   H ++Y+  +E+ +R KIF EN   I K
Sbjct: 1   MLRLSVLCAIVAVTVAASSQEILRTQWEAFKTTHKKTYQSHMEELLRFKIFTENSLIIAK 60

Query: 81  ANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTF-KYQNLSMTDV 136
            N +   G  +YKLG NQF DL   EF  ++ G++      R T  STF    N++ + +
Sbjct: 61  HNAKYAKGLVSYKLGMNQFGDLLAHEFARIFNGHR----GTRKTGGSTFLPPANVNDSSL 116

Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-G 195
           P ++DWR KGAVTP+K+Q +CG CWAF+A  ++EG   +++G L+ LSEQ L+DCS + G
Sbjct: 117 PKAVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQSFG 176

Query: 196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQ 255
           NNGC GG  E AF YI  N GI TE  YPY+AV G C   ++   A  + Y E+ +G E 
Sbjct: 177 NNGCEGGLMEDAFKYIKANDGIDTEKSYPYEAVDGECRFKKEDVGATDTGYVEIKAGSEV 236

Query: 256 ALLKAV-SMQPVSIAIAAYSTEFQSYKEGIFNGV-CGTQ-LDHAVTIVGFGTTEDGANYW 312
            L KAV ++ P+S+AI A  + FQ Y EG+++   C ++ LDH V +VG+G  + G  YW
Sbjct: 237 DLKKAVATVGPISVAIDASHSSFQLYSEGVYDEPECSSEDLDHGVLVVGYG-VKGGKKYW 295

Query: 313 LIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
           L+KNSW  +WGD GY+ + RD    CGI +++SYPL
Sbjct: 296 LVKNSWAESWGDQGYILMSRDNNNQCGIASQASYPL 331


>gi|125811033|ref|XP_001361727.1| GA25021 [Drosophila pseudoobscura pseudoobscura]
 gi|54636904|gb|EAL26307.1| GA25021 [Drosophila pseudoobscura pseudoobscura]
          Length = 341

 Score =  259 bits (662), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 141/346 (40%), Positives = 202/346 (58%), Gaps = 22/346 (6%)

Query: 16  TPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMA---QHGRSYKDELEKEMRLKIFK 72
           T + + +  LV+ A Q VS           I E+W     +H ++Y+DE E+  RLKIF 
Sbjct: 3   TALILPLLALVAVA-QAVSYAEV-------IQEEWHTFKLEHRKNYQDETEERFRLKIFN 54

Query: 73  ENLEYIEKANK---EGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFK-- 127
           EN   I K N+    G  ++K+  N+++D+ + EF +   G+             +FK  
Sbjct: 55  ENKHKIAKHNQLWATGAVSFKMAVNKYADMLHHEFYSTMNGFNYTLHKQLRNADESFKGV 114

Query: 128 -YQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQ 186
            + +     +P  +DWR KGAVT +K+Q  CG CWAF++  A+EG    +SG L+ LSEQ
Sbjct: 115 TFISPEHVTLPKQVDWRTKGAVTDVKDQGHCGSCWAFSSTGALEGQHYRKSGVLVSLSEQ 174

Query: 187 QLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISN 245
            L+DCST  GNNGC GG  + AF YI  N GI TE  YPY+A+  +C   +    A    
Sbjct: 175 NLVDCSTKYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYPYEAIDDSCHFNKGTIGATDRG 234

Query: 246 YEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIFN-GVCGTQ-LDHAVTIVGF 302
           + ++P G+E+ + +AV ++ PV++AI A    FQ Y EG++N   C  Q LDH V +VGF
Sbjct: 235 FVDIPQGNEKKMAEAVATIGPVAVAIDASHESFQFYSEGVYNEPACDAQNLDHGVLVVGF 294

Query: 303 GTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
           GT E G +YWL+KNSWG TWGD G++K++R+ E  CGI + SSYPL
Sbjct: 295 GTDESGQDYWLVKNSWGTTWGDKGFIKMLRNKENQCGIASASSYPL 340


>gi|402770503|gb|AFQ98386.1| cathepsin L [Rhipicephalus microplus]
          Length = 332

 Score =  259 bits (662), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 141/336 (41%), Positives = 205/336 (61%), Gaps = 14/336 (4%)

Query: 21  IITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEK 80
           ++ L V CA   V+  ++ ++ +    E +   H ++Y+  +E+ +R KIF EN   I K
Sbjct: 1   MLRLSVLCAIVAVTVAASSQEILRTQWEAFKTTHKKTYQSHMEELLRFKIFTENSLIIAK 60

Query: 81  ANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTF-KYQNLSMTDV 136
            N +   G  +YKLG NQF DL   EF  ++ G+       R T  STF    N++ + +
Sbjct: 61  HNAKYAKGLVSYKLGMNQFGDLLAHEFARIFNGHH----GTRKTGGSTFLPPANVNDSSL 116

Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-G 195
           P  +DWR KGAVTP+K+Q +CG CWAF+A  ++EG   +++G L+ LSEQ L+DCS + G
Sbjct: 117 PKVVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGRHFLKNGELVSLSEQNLVDCSQSFG 176

Query: 196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQ 255
           NNGC GG  E AF YI +N GI TE  YPY+AV G C   ++   A  + Y E+ +G E 
Sbjct: 177 NNGCEGGLMEDAFKYIKENDGIDTEKSYPYEAVDGECRFKKEDVGATDTGYVEIKAGSED 236

Query: 256 ALLKAV-SMQPVSIAIAAYSTEFQSYKEGIFNGV-CGTQ-LDHAVTIVGFGTTEDGANYW 312
            L KAV ++ P+S+AI A  + FQ Y EG+++   C ++ LDH V +VG+G  + G  YW
Sbjct: 237 DLKKAVATVGPISVAIDASHSSFQLYSEGVYDEPECSSEDLDHGVLVVGYG-VKGGKKYW 295

Query: 313 LIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
           L+KNSW  +WGD GY+ + RD    CGI +++SYPL
Sbjct: 296 LVKNSWAESWGDQGYILMSRDNNNQCGIASQASYPL 331


>gi|129614|sp|P00784.1|PAPA1_CARPA RecName: Full=Papain; AltName: Full=Papaya proteinase I; Short=PPI;
           AltName: Allergen=Car p 1; Flags: Precursor
 gi|167391|gb|AAB02650.1| papain precursor [Carica papaya]
 gi|387885|gb|AAA72774.1| papain [synthetic construct]
 gi|225437|prf||1303270A papain
          Length = 345

 Score =  259 bits (662), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 144/337 (42%), Positives = 206/337 (61%), Gaps = 17/337 (5%)

Query: 18  MFIIITLLVSCASQVVSSRS--THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENL 75
           +F+ + L     S V  S++  T  + ++++ E WM +H + YK+  EK  R +IFK+NL
Sbjct: 17  LFVYMGLSFGDFSIVGYSQNDLTSTERLIQLFESWMLKHNKIYKNIDEKIYRFEIFKDNL 76

Query: 76  EYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD 135
           +YI++ NK+ N +Y LG N F+D++NDEF+  YTG    + ++ +T  S  +  N    +
Sbjct: 77  KYIDETNKK-NNSYWLGLNVFADMSNDEFKEKYTG--SIAGNYTTTELSYEEVLNDGDVN 133

Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG 195
           +P  +DWR KGAVTP+KNQ  CG CWAF+AV  +EGI KIR+GNL + SEQ+LLDC    
Sbjct: 134 IPEYVDWRQKGAVTPVKNQGSCGSCWAFSAVVTIEGIIKIRTGNLNEYSEQELLDCDRR- 192

Query: 196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQK-PAAAKISNYEEVPSGDE 254
           + GC GG    A   + Q  GI   + YPY+ V   C + +K P AAK     +V   +E
Sbjct: 193 SYGCNGGYPWSALQLVAQ-YGIHYRNTYPYEGVQRYCRSREKGPYAAKTDGVRQVQPYNE 251

Query: 255 QALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLI 314
            ALL +++ QPVS+ + A   +FQ Y+ GIF G CG ++DHAV  VG+     G NY LI
Sbjct: 252 GALLYSIANQPVSVVLEAAGKDFQLYRGGIFVGPCGNKVDHAVAAVGY-----GPNYILI 306

Query: 315 KNSWGNTWGDAGYMKIVR----DEGLCGIGTRSSYPL 347
           KNSWG  WG+ GY++I R      G+CG+ T S YP+
Sbjct: 307 KNSWGTGWGENGYIRIKRGTGNSYGVCGLYTSSFYPV 343


>gi|156398078|ref|XP_001638016.1| predicted protein [Nematostella vectensis]
 gi|156225133|gb|EDO45953.1| predicted protein [Nematostella vectensis]
          Length = 326

 Score =  259 bits (661), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 139/335 (41%), Positives = 205/335 (61%), Gaps = 17/335 (5%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
           +F+++ +LV+ +S+  S R   +   V     W + HG+SY D  E+  R+ I+++NLE 
Sbjct: 3   VFLVLCVLVA-SSRGWSVRFGQDSEWV----AWKSYHGKSYSDVHEERTRMAIWQQNLEK 57

Query: 78  IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
           I++ N E + +YK+  N   DLT DEFR  Y G +     H ST      Y   S   +P
Sbjct: 58  IKRHNAE-DHSYKMAMNHLGDLTEDEFRYFYLGVR---AHHNSTKRGWATYMPPSNVKIP 113

Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TNGN 196
           +S+DW  KG VT +KNQ +CG CWAF+   +VEG    ++G+L+ LSEQ L+DCS + GN
Sbjct: 114 SSVDWSQKGYVTGVKNQGQCGSCWAFSTTGSVEGQHFRKTGSLVSLSEQNLIDCSGSYGN 173

Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQA 256
           NGC GG  + AF YI  N GI TE  YPY    G+C  +     A+++ Y+++P G EQA
Sbjct: 174 NGCQGGLMDNAFRYIESNGGIDTESSYPYLGQQGSCHFSSSHVGARVTGYQDIPQGSEQA 233

Query: 257 LLKAV-SMQPVSIAIAAYSTEFQSYKEGIF-NGVC-GTQLDHAVTIVGFGTTEDGANYWL 313
           L  AV ++ PVS+A+ A  +++Q Y  G++ N  C  TQLDH V ++G+G   +G +YWL
Sbjct: 234 LQSAVATVGPVSVAVDA--SQWQFYSSGVYDNPYCSSTQLDHGVLVIGYGNY-NGQDYWL 290

Query: 314 IKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYPL 347
           +KNSWG +WG  GY+ + R++   CGI + +SYPL
Sbjct: 291 VKNSWGYSWGVEGYIMMSRNKNNQCGIASSASYPL 325


>gi|55740406|gb|AAV63979.1| cathepsin L1 precursor [Artemia parthenogenetica]
          Length = 338

 Score =  259 bits (661), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 137/335 (40%), Positives = 207/335 (61%), Gaps = 11/335 (3%)

Query: 21  IITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEK 80
           +I LL +   Q+ ++ S       E H  + A H + Y  +LE+++R+KI+ EN   + K
Sbjct: 6   LIFLLAAVLVQLSAALSLTNLLADEWH-LFKATHKKEYPSQLEEKLRMKIYLENKHKVAK 64

Query: 81  AN---KEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
            N   ++G ++Y++  N+F DL + EFR++  GY+     + S   STF +   +  +VP
Sbjct: 65  HNILYEKGEKSYQVAMNKFGDLLHHEFRSIMNGYQHKK-QNSSRAESTFTFMEPANVEVP 123

Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GN 196
            S+DWR+KGA+TP+K+Q +CG CWAF++  A+EG T  ++G L+ LSEQ L+DCS   GN
Sbjct: 124 ESVDWREKGAITPVKDQGQCGSCWAFSSTGALEGQTFRKTGKLVSLSEQNLIDCSGKYGN 183

Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQA 256
            GC GG  ++AF YI  N+GI TE+ YPY+A  G C    +   A    + ++PSG+E  
Sbjct: 184 EGCNGGLMDQAFQYIKDNKGIDTENTYPYEAEDGVCRYNPRNRGAVDRGFVDIPSGEEDK 243

Query: 257 LLKAV-SMQPVSIAIAAYSTEFQSYKEG-IFNGVCGT-QLDHAVTIVGFGTTEDGANYWL 313
           L  AV ++ PVS+AI A    FQ Y +G  +   C +  LDH V +VG+G +++G +YWL
Sbjct: 244 LKAAVATVGPVSVAIDASHESFQFYSKGXYYEPSCDSDDLDHGVLVVGYG-SDNGEDYWL 302

Query: 314 IKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
           +KNSW   WGD GY+KI R+ +  CG+ T +SYPL
Sbjct: 303 VKNSWSEHWGDEGYIKIARNRKNHCGVATAASYPL 337


>gi|7381610|gb|AAF61565.1|AF227957_1 cathepsin L-like proteinase precursor [Rhipicephalus microplus]
          Length = 332

 Score =  258 bits (660), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 142/336 (42%), Positives = 204/336 (60%), Gaps = 14/336 (4%)

Query: 21  IITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEK 80
           ++ L V CA   V+  ++ ++ +    E +   H +SY+  +E+ +R KIF EN   I K
Sbjct: 1   MLRLSVLCAIVAVTVAASSQEILRTQWEAFKTTHKKSYQSHMEELLRFKIFTENSLIIAK 60

Query: 81  ANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTF-KYQNLSMTDV 136
            N +   G  +YKLG NQF DL   EF  ++ G+       R T  STF    N++ + +
Sbjct: 61  HNAKYAKGLVSYKLGMNQFGDLLAHEFARIFNGHH----GTRKTGGSTFLPPANVNDSSL 116

Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-G 195
           P  +DWR KGAVTP+K+Q +CG CWAF+A  ++EG   +++G L+ LSEQ L+DCS + G
Sbjct: 117 PKVVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQSFG 176

Query: 196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQ 255
           NNGC GG  E AF YI  N GI TE  YPY+AV G C   ++   A  + Y E+ +G E 
Sbjct: 177 NNGCEGGLMEDAFKYIKANDGIDTEKSYPYEAVDGECRFKKEDVGATDTGYVEIKAGSEV 236

Query: 256 ALLKAV-SMQPVSIAIAAYSTEFQSYKEGIFNGV-CGTQ-LDHAVTIVGFGTTEDGANYW 312
            L KAV ++ P+S+AI A  + FQ Y EG+++   C ++ LDH V +VG+G  + G  YW
Sbjct: 237 DLKKAVATVGPISVAIDASHSSFQLYSEGVYDEPECSSEDLDHGVLVVGYG-VKGGKKYW 295

Query: 313 LIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
           L+KNSW  +WGD GY+ + RD    CGI +++SYPL
Sbjct: 296 LVKNSWAESWGDQGYILMSRDNNNQCGIASQASYPL 331


>gi|413944252|gb|AFW76901.1| hypothetical protein ZEAMMB73_101481 [Zea mays]
          Length = 232

 Score =  258 bits (659), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 117/229 (51%), Positives = 157/229 (68%), Gaps = 6/229 (2%)

Query: 123 SSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQ 182
           S+ F+Y+N+S+  +P ++DWR  GAVTPIK+Q +CGCCWAF+AVAA EGI KI +G LI 
Sbjct: 3   STGFRYENVSVDAIPATIDWRTNGAVTPIKDQGQCGCCWAFSAVAATEGIVKISTGKLIS 62

Query: 183 LSEQQLLDCSTNG-NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAA 241
           LSEQ+L+DC   G + GC GG  + AF +II+N G+ TE  YPY A  G C +    +AA
Sbjct: 63  LSEQELVDCDVYGEDQGCEGGLMDDAFKFIIKNGGLTTESNYPYTAADGKCKSGSN-SAA 121

Query: 242 KISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVG 301
            I  YE+VP+ DE AL+KAV+ QPVS+A+      FQ Y  G+  G CGT LDH +  +G
Sbjct: 122 NIKGYEDVPTNDEAALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIG 181

Query: 302 FGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
           +G T DG  YWL+KNSWG TWG+ GY+++ +D    +G+CG+    SYP
Sbjct: 182 YGKTSDGTKYWLMKNSWGTTWGENGYLRMEKDISDKKGMCGLAIEPSYP 230


>gi|222641485|gb|EEE69617.1| hypothetical protein OsJ_29194 [Oryza sativa Japonica Group]
          Length = 360

 Score =  258 bits (659), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 139/347 (40%), Positives = 203/347 (58%), Gaps = 28/347 (8%)

Query: 11  FKINTTPMFIIITLLVSC----ASQVVSSRSTH-------EQSVVEIHEKWMAQHGRSYK 59
           F    +P  + + LL SC    A+ ++ +R+T        +  +++    W   H RSY 
Sbjct: 4   FMACASPPVLTLALLASCGALLATSMLPARATAGSCLDVGDMVMMDRFRAWQGAHNRSYP 63

Query: 60  DELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKM---PSP 116
              E   R  +++ N E+I+  N  G+ TY+L  N+F+DLT +EF A YTGY     P  
Sbjct: 64  SAEEALQRFDVYRRNAEFIDAVNLRGDLTYQLAENEFADLTEEEFLATYTGYYAGDGPVD 123

Query: 117 SHRSTT-----SSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKE-CGCCWAFAAVAAVE 170
               TT      ++F Y+     DVP S+DWR +GAV P K+Q   C  CWAF   A +E
Sbjct: 124 DSVITTGAGDVDASFSYR----VDVPASVDWRAQGAVVPPKSQTSTCSSCWAFVTAATIE 179

Query: 171 GITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPG 230
            +  I++G L+ LSEQQL+DC +  + GC  GS  +A+ ++++N G+ TE +YPY A  G
Sbjct: 180 SLNMIKTGKLVSLSEQQLVDCDSY-DGGCNLGSYGRAYKWVVENGGLTTEADYPYTARRG 238

Query: 231 TCSAAQKP-AAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVC 289
            C+ A+    AAKI+ + +VP  +E AL  AV+ QPV++AI    +  Q YK G++ G C
Sbjct: 239 PCNRAKSAHHAAKITGFGKVPPRNEAALQAAVARQPVAVAIEV-GSGMQFYKGGVYTGPC 297

Query: 290 GTQLDHAVTIVGFGT-TEDGANYWLIKNSWGNTWGDAGYMKIVRDEG 335
           GT+L HAVT+VG+GT    GA YW IKNSWG +WG+ GY++I+RD G
Sbjct: 298 GTRLAHAVTVVGYGTDASSGAKYWTIKNSWGQSWGERGYIRILRDVG 344


>gi|242040563|ref|XP_002467676.1| hypothetical protein SORBIDRAFT_01g032090 [Sorghum bicolor]
 gi|241921530|gb|EER94674.1| hypothetical protein SORBIDRAFT_01g032090 [Sorghum bicolor]
          Length = 358

 Score =  258 bits (659), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 129/304 (42%), Positives = 183/304 (60%), Gaps = 10/304 (3%)

Query: 49  KWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALY 108
           +W A + RSY    E++ R ++++ N+E+IE  N+ GN TY LG NQF+DLT +EF  LY
Sbjct: 59  RWQATYNRSYPTAEERQRRFQVYRRNMEHIEATNRAGNLTYTLGENQFADLTEEEFLDLY 118

Query: 109 TGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQ-KECGCCWAFAAVA 167
           T   MP     +       +   S+ D PTS+DWR +GAVTPIKNQ   C  CWAF   A
Sbjct: 119 TMKGMPPVRRDAGKKQQANFS--SVVDAPTSVDWRSRGAVTPIKNQGPSCSSCWAFVTAA 176

Query: 168 AVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQA 227
            +E IT+IR+G L+ LSEQ+L+DC    + GC  G     + ++IQN G+ TE  YPYQA
Sbjct: 177 TIESITQIRTGKLVSLSEQELIDCDPY-DGGCNLGYFVNGYKWVIQNGGLTTEANYPYQA 235

Query: 228 VPGTCSAAQK-PAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFN 286
               C+ ++    AA+ISNY ++P G+ Q           +      S +F  Y  G+++
Sbjct: 236 RRYQCNRSKAGQRAARISNYRQLPQGEAQLQQAVAQQPVAAAIEMGGSLQF--YSGGVWS 293

Query: 287 GVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKI---VRDEGLCGIGTRS 343
           G CGT+++HA+T+VG+G    G  YWL+KNSWG TWG+ GY+++   VR  GLCGI    
Sbjct: 294 GQCGTRMNHAITVVGYGADSSGVKYWLVKNSWGQTWGERGYLRMRKDVRQGGLCGIALDL 353

Query: 344 SYPL 347
           +YP+
Sbjct: 354 AYPI 357


>gi|402770507|gb|AFQ98388.1| cathepsin L [Rhipicephalus microplus]
          Length = 332

 Score =  258 bits (659), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 139/335 (41%), Positives = 204/335 (60%), Gaps = 12/335 (3%)

Query: 21  IITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEK 80
           ++ L V CA   V+  ++ ++ +    E +   H ++Y+  +E+ +R KIF EN   I K
Sbjct: 1   MLRLSVLCAIVAVTVAASSQEILRTQWEAFKTTHKKTYQSHMEELLRFKIFTENSLIIAK 60

Query: 81  ANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
            N +   G  +YKLG NQF DL   EF  ++ GY     S +S  S+     N++ + +P
Sbjct: 61  HNAKYAKGLVSYKLGMNQFGDLLAHEFARIFNGYH---GSRKSGGSTFLPPANVNDSSLP 117

Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GN 196
            ++DWR KGAVTP+K+Q +CG CWAF+   ++EG   +++G L+ LSEQ L+DCS + GN
Sbjct: 118 KAVDWRKKGAVTPVKDQGQCGSCWAFSTTGSLEGQHFLKNGELVSLSEQNLVDCSQSFGN 177

Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQA 256
           NGC GG  E AF YI  N GI TE  YPY+AV G C   ++   A  + Y E+ +G E  
Sbjct: 178 NGCEGGLMEDAFKYIKANDGIDTEKSYPYEAVDGECRFKKEDVGATDTGYVEIKAGCEDD 237

Query: 257 LLKAV-SMQPVSIAIAAYSTEFQSYKEGIFNGV-CGTQ-LDHAVTIVGFGTTEDGANYWL 313
           L KAV ++ P+S+AI A  + FQ Y EG+++   C ++ LDH V +VG+G  + G  YWL
Sbjct: 238 LKKAVATVGPISVAIDASHSSFQLYSEGVYDEPECSSEDLDHGVLVVGYG-VKGGKKYWL 296

Query: 314 IKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
           +KNSW  +WGD GY+ + RD    CGI +++SYPL
Sbjct: 297 VKNSWAESWGDQGYILMSRDNNNQCGIASQASYPL 331


>gi|242093994|ref|XP_002437487.1| hypothetical protein SORBIDRAFT_10g027980 [Sorghum bicolor]
 gi|241915710|gb|EER88854.1| hypothetical protein SORBIDRAFT_10g027980 [Sorghum bicolor]
          Length = 341

 Score =  257 bits (657), Expect = 5e-66,   Method: Compositional matrix adjust.
 Identities = 137/317 (43%), Positives = 198/317 (62%), Gaps = 32/317 (10%)

Query: 40  EQSVVEIHEKWMAQHGRSYKD-ELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQ 95
           ++ V ++++ W ++HGR      +   +RLK+F++NL YI+  N E   G  T++LG   
Sbjct: 44  DEEVRQLYKTWKSEHGRPRDGISVADGLRLKVFRDNLRYIDAHNAEADAGLHTFRLGLTP 103

Query: 96  FSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQK 155
           F+DLT +EFRA   G+ + S   R  +    +Y   +  D+P ++DWR +GAVT +KNQ 
Sbjct: 104 FTDLTLEEFRAHALGF-LNSTLPRVASD---RYLPRAGDDLPDAVDWRQQGAVTGVKNQL 159

Query: 156 ECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQ 215
           +CG CWAF+AVAA+EGI KI + NLI LSEQ+L+DC T  + GC GG  +KAF ++I N 
Sbjct: 160 DCGGCWAFSAVAAMEGINKIVTNNLISLSEQELIDCDTE-DYGCQGGEMQKAFQFVIDNG 218

Query: 216 GIATEDEYPYQAVPGTCSA-AQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYS 274
           GI TE +YP+    GTC A  +K     I +YE VP+ DE+AL KAV+ QP         
Sbjct: 219 GIDTEADYPFIGTNGTCDAIREKRKVVSIDSYENVPTNDEEALQKAVANQP--------- 269

Query: 275 TEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD- 333
                   GIFNG CG  LDH VT VG+G +++G ++W++KNSWG  WG++GY+++ R+ 
Sbjct: 270 --------GIFNGPCGFILDHGVTAVGYG-SDNGEDFWIVKNSWGAEWGESGYIRMKRNV 320

Query: 334 ---EGLCGIGTRSSYPL 347
               G CGI   +SYP+
Sbjct: 321 LLPMGKCGIAMYASYPV 337


>gi|297843430|ref|XP_002889596.1| hypothetical protein ARALYDRAFT_887827 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297335438|gb|EFH65855.1| hypothetical protein ARALYDRAFT_887827 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 343

 Score =  257 bits (657), Expect = 5e-66,   Method: Compositional matrix adjust.
 Identities = 140/343 (40%), Positives = 198/343 (57%), Gaps = 21/343 (6%)

Query: 15  TTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKEN 74
           T  + I   L+ S    V SS     +++ +  EKW+  H + Y    E  +R  I++ N
Sbjct: 11  TLVVLICFVLIASKLCSVNSSVYDPHKTLKQRFEKWLKTHSKLYGGRDEWMLRFGIYQSN 70

Query: 75  LEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
           ++ I+  N   +  +KL  N+F+D+TN EF+A + G         +T+S     +   + 
Sbjct: 71  VQLIDYINSL-HLPFKLTDNRFADMTNSEFKAHFLGL--------NTSSLRLHKKQRPVC 121

Query: 135 D----VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLD 190
           D    VP ++DWR +GAVTPI+NQ +CG CWAF+AVAA+EGI KI++GNL+ LSEQQL+D
Sbjct: 122 DPAGNVPDAVDWRTQGAVTPIRNQGKCGGCWAFSAVAAIEGINKIKTGNLVSLSEQQLID 181

Query: 191 CSTNG-NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQ-KPAAAKISNYEE 248
           C     N GC GG  E AF +I  N G+ TE +YPY  + GTC   + K     I  Y++
Sbjct: 182 CDVGTYNKGCSGGLMETAFEFIKSNGGLTTETDYPYTGIEGTCDQEKAKNKVVTIQGYQK 241

Query: 249 VPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDG 308
           V   +E +L  A + QPVS+ I A    FQ Y  G+F   CGT L+H VT+VG+G   D 
Sbjct: 242 VAQ-NEASLQIAAAQQPVSVGIDAGGFIFQLYSSGVFTSYCGTNLNHGVTVVGYGVEGD- 299

Query: 309 ANYWLIKNSWGNTWGDAGYMKIVR----DEGLCGIGTRSSYPL 347
             YW++KNSWG  WG+ GY+++ R    D G CGI   +SYPL
Sbjct: 300 QKYWIVKNSWGTGWGEEGYIRMERGISEDTGKCGIAMLASYPL 342


>gi|320169658|gb|EFW46557.1| cathepsin L2 [Capsaspora owczarzaki ATCC 30864]
          Length = 324

 Score =  257 bits (657), Expect = 5e-66,   Method: Compositional matrix adjust.
 Identities = 135/306 (44%), Positives = 190/306 (62%), Gaps = 11/306 (3%)

Query: 48  EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRAL 107
           + W A HG SY    E+  R  I++ NL++IEK N EG+ +YKL  N+F+DLT  EF A 
Sbjct: 23  DSWKATHGVSYATVGEETARRGIYRANLDFIEKHNSEGH-SYKLAVNKFADLTYPEFAAK 81

Query: 108 YTGYKMPSP-SHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAV 166
           Y G +  +  + +S  +ST+  +   M  +P S+DWR  G VTPIK+Q +CG CW+F+  
Sbjct: 82  YLGLRFDATNATKSFAASTYLPR---MVSLPDSVDWRTAGIVTPIKDQGQCGSCWSFSTT 138

Query: 167 AAVEGITKIRSGNLIQLSEQQLLDCST-NGNNGCLGGSREKAFAYIIQNQGIATEDEYPY 225
            +VEG    ++G L+ LSEQ L+DCS+  GN GC GG  ++AF YII N GI TE  YPY
Sbjct: 139 GSVEGQHARKTGQLVSLSEQNLVDCSSAQGNAGCNGGLMDQAFQYIISNNGIDTESSYPY 198

Query: 226 QAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGI 284
            A  GTC        A +++Y+++ SG E  L  AV ++ P+S+AI A    FQ Y  G+
Sbjct: 199 TAQDGTCQFNSANVGATVASYQDIASGSESDLQNAVATVGPISVAIDASQPSFQFYSSGV 258

Query: 285 FN--GVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGT 341
           +N      +QLDH V  VG+GT+   ++YWL+KNSWG +WG +GY+ + R+    CGI T
Sbjct: 259 YNEPACSSSQLDHGVLAVGYGTS-GSSDYWLVKNSWGTSWGQSGYIWMTRNSNNQCGIAT 317

Query: 342 RSSYPL 347
            +SYPL
Sbjct: 318 AASYPL 323


>gi|449500383|ref|XP_004161083.1| PREDICTED: vignain-like [Cucumis sativus]
          Length = 345

 Score =  257 bits (657), Expect = 5e-66,   Method: Compositional matrix adjust.
 Identities = 142/319 (44%), Positives = 205/319 (64%), Gaps = 20/319 (6%)

Query: 40  EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
           E+S+ +++E+W   H  S ++  EK  R  +FKEN+ ++   N+  ++ YKL  N+F+D+
Sbjct: 34  EESLWQLYERWGKHHTIS-RNLKEKHKRFSVFKENVNHVFTVNQM-DKPYKLKLNKFADM 91

Query: 100 TNDEFRALYTGYKMPSPSH------RSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKN 153
           +N EF   Y      + SH      R   +  F Y+    TD+P+S+D R++GAV  +K 
Sbjct: 92  SNYEFVNFYA---RSNISHYRKLHERRRGAGGFMYE--QDTDLPSSVDGRERGAVNAVKE 146

Query: 154 QKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQ 213
           Q  CG CWAF++VAAVEGI KI++  L+ LSEQ+LLDC+   N GC GG  E AF +I +
Sbjct: 147 QGRCGSCWAFSSVAAVEGINKIKTNQLLSLSEQELLDCNYR-NKGCNGGFMEIAFDFIKR 205

Query: 214 NQGIATEDEYPYQAVPGTCSAAQ-KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAA 272
           N GIATE+ YPY    G C +++      KI  YE VP  +E AL++AV+ QPVS+AI A
Sbjct: 206 NGGIATENSYPYHGSRGLCRSSRISSPIVKIDGYESVPE-NEDALMQAVANQPVSVAIDA 264

Query: 273 YSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR 332
              +FQ Y +G+F+G CGT+L+H V  +G+GTTEDG +YWL++NSWG  WG+ GY+++ R
Sbjct: 265 AGRDFQFYSQGVFDGYCGTELNHGVVAIGYGTTEDGTDYWLVRNSWGVGWGEDGYVRMKR 324

Query: 333 D----EGLCGIGTRSSYPL 347
                EGLCGI   +SYP+
Sbjct: 325 GVEQAEGLCGIAMEASYPI 343


>gi|343978787|gb|AEM76722.1| cathepsin L-like proteinase [Triatoma brasiliensis]
          Length = 330

 Score =  257 bits (657), Expect = 5e-66,   Method: Compositional matrix adjust.
 Identities = 139/333 (41%), Positives = 199/333 (59%), Gaps = 14/333 (4%)

Query: 23  TLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN 82
            LLV+ A   VS  +       E  E +   HG++YK++ E+  R KIF  N + IE  N
Sbjct: 3   VLLVAVAVIAVSCANRFYNINPEEWETFKVVHGKNYKNQFEEMFRRKIFMNNKKRIEAHN 62

Query: 83  ---KEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTS 139
              ++G  +YK+  N F DL + E +AL  G+KM +P+ +      F     S   +P S
Sbjct: 63  AKYEQGEVSYKMKMNHFGDLMSHEIKALMNGFKM-TPNTKREGKIYFP----SNDKLPKS 117

Query: 140 LDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNG 198
           +DWR KGAVTP+K+Q +CG CW+F+A  ++EG   ++ G L+ LSEQ L+DCS   GNNG
Sbjct: 118 VDWRQKGAVTPVKDQGQCGSCWSFSATGSLEGQIFLKKGKLVSLSEQNLMDCSKEYGNNG 177

Query: 199 CLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALL 258
           C GG  +KAF Y+  N+GI TE  YPY+A    C   +         Y ++P GDE+AL 
Sbjct: 178 CEGGLMDKAFQYVSDNKGIDTESSYPYEARDYACRFKKDKVGGTDKGYVDIPEGDEKALQ 237

Query: 259 KAV-SMQPVSIAIAAYSTEFQSYKEGIFN-GVCGT-QLDHAVTIVGFGTTEDGANYWLIK 315
            A+ ++ P+S+AI A    F  Y EG++N   C +  LDH V  VG+G TE+G +YWL+K
Sbjct: 238 NALATVGPISVAIDASHESFHFYSEGVYNEPYCSSYDLDHGVLAVGYG-TENGQDYWLVK 296

Query: 316 NSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYPL 347
           NSWG +WG++GY+KI R+    CGI + +SYP+
Sbjct: 297 NSWGPSWGESGYIKIARNHSNHCGIASMASYPI 329


>gi|402770515|gb|AFQ98392.1| cathepsin L [Rhipicephalus microplus]
          Length = 332

 Score =  257 bits (656), Expect = 7e-66,   Method: Compositional matrix adjust.
 Identities = 138/335 (41%), Positives = 204/335 (60%), Gaps = 12/335 (3%)

Query: 21  IITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEK 80
           ++ L V CA   V+  ++ ++ +    E +   H ++Y+  +E+ +R KIF EN   I K
Sbjct: 1   MLRLSVLCAIAAVTVAASSQEILRTQWEAFKTTHKKTYQSHMEELLRFKIFTENSLIIAK 60

Query: 81  ANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
            N +   G  +YKLG NQF DL   EF  ++ G+     + ++  SS     N++ + +P
Sbjct: 61  HNAKYAKGLVSYKLGMNQFGDLLAHEFARIFNGHH---GTRKTGGSSFLPPANVNDSSLP 117

Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GN 196
             +DWR KGAVTP+K+Q +CG CWAF+A  ++EG   +++G L+ LSEQ L+DCS + GN
Sbjct: 118 KVVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQSFGN 177

Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQA 256
           NGC GG  E AF YI  N GI TE  YPY+AV G C   ++   A  + Y E+ +G E  
Sbjct: 178 NGCEGGLMEDAFKYIKANDGIDTEKSYPYEAVDGECRFKKEDVGATDTGYVEIKAGSEVD 237

Query: 257 LLKAV-SMQPVSIAIAAYSTEFQSYKEGIFNGV-CGTQ-LDHAVTIVGFGTTEDGANYWL 313
           L KAV ++ P+S+AI A  + FQ Y EG+++   C ++ LDH V +VG+G  + G  YWL
Sbjct: 238 LKKAVATVGPISVAIDASHSSFQLYSEGVYDEPECSSEDLDHGVLVVGYG-VKGGKKYWL 296

Query: 314 IKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
           +KNSW  +WGD GY+ + RD    CGI +++SYPL
Sbjct: 297 VKNSWAESWGDQGYILMSRDNNNQCGIASQASYPL 331


>gi|195381187|ref|XP_002049336.1| GJ20806 [Drosophila virilis]
 gi|194144133|gb|EDW60529.1| GJ20806 [Drosophila virilis]
          Length = 339

 Score =  257 bits (656), Expect = 7e-66,   Method: Compositional matrix adjust.
 Identities = 142/345 (41%), Positives = 197/345 (57%), Gaps = 22/345 (6%)

Query: 18  MFIIITLLVSCA-SQVVSSRSTHEQSVVEIHEKWMA---QHGRSYKDELEKEMRLKIFKE 73
           M I+  LL   A +Q VS           I E+W     +H ++Y DE E+  RLKIF E
Sbjct: 1   MRILFALLALVAVAQAVSYADV-------IKEEWQTFKLEHRKNYVDETEERFRLKIFNE 53

Query: 74  NLEYIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTF---K 127
           N   I K N+    G  ++K+  N+++D+ + EF     G+          +  +F    
Sbjct: 54  NKHKIAKHNQRYASGEVSFKMAVNKYADMLHHEFHTTMNGFNYTLHKQLRASDPSFVGVT 113

Query: 128 YQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQ 187
           + +     +P S+DWR KGAVT +K+Q  CG CWAF++  A+EG    ++G LI LSEQ 
Sbjct: 114 FISPEHVKIPKSVDWRSKGAVTEVKDQGHCGSCWAFSSTGALEGQHFRKAGTLISLSEQN 173

Query: 188 LLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNY 246
           L+DCST  GNNGC GG  + AF YI  N GI TE  YPY+ +  +C   +    A     
Sbjct: 174 LVDCSTKYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYPYEGIDDSCHFNKATIGATDRGS 233

Query: 247 EEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIFN-GVCGTQ-LDHAVTIVGFG 303
            ++P GDE+ + +AV ++ PVS+AI A    FQ Y EGI+N   C  Q LDH V +VG+G
Sbjct: 234 VDIPQGDEKKMAEAVATIGPVSVAIDASHESFQFYSEGIYNEPQCDPQNLDHGVLVVGYG 293

Query: 304 TTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
           T E G +YWL+KNSWG TWGD G++K+ R+ +  CGI + SSYPL
Sbjct: 294 TDESGQDYWLVKNSWGTTWGDKGFIKMARNADNQCGIASASSYPL 338


>gi|146217394|gb|ABQ10739.1| cathepsin L [Penaeus monodon]
          Length = 341

 Score =  257 bits (656), Expect = 7e-66,   Method: Compositional matrix adjust.
 Identities = 136/317 (42%), Positives = 195/317 (61%), Gaps = 13/317 (4%)

Query: 43  VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANK---EGNRTYKLGTNQFSDL 99
           V+E  E +  +H + Y  E+E+  R+KIF EN   I   NK   +G+ TYKL  N++ D+
Sbjct: 25  VLEEWEAFKLEHSKKYDSEVEESFRMKIFTENKHKIANHNKGFAQGHHTYKLSMNKYGDM 84

Query: 100 TNDEFRALYTGYKMPS----PSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQK 155
            + EF +   G++        ++R+ T +TF   +  +  +P ++DWR KGAVTPIK+Q 
Sbjct: 85  LHHEFVSTMNGFRGNHTGGYKNNRAYTGATFIEPDDDVQ-LPKNVDWRTKGAVTPIKDQG 143

Query: 156 ECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQN 214
           +CG CWAF+A  A+EG T  ++G L+ LSEQ L+DCS   GNNGC GG  + AF Y+ +N
Sbjct: 144 QCGSCWAFSATGALEGQTFRKTGQLVSLSEQNLVDCSRKFGNNGCNGGLMDNAFEYVKEN 203

Query: 215 QGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAY 273
            GI TE+ YPY A    C    + A A+   + +V  G E AL KAV ++ PVS+AI A 
Sbjct: 204 GGIDTEESYPYDAEDEKCHYNPRAAGAEDKGFVDVREGSEHALKKAVATVGPVSVAIDAS 263

Query: 274 STEFQSYKEGIF-NGVCGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIV 331
              FQ Y  G++    C  + LDH V +VG+G  +DG +YWL+KNSWG TWGD GY+K+ 
Sbjct: 264 HESFQFYSHGVYIEPECSPEMLDHGVLVVGYGIDDDGTDYWLVKNSWGTTWGDQGYVKMA 323

Query: 332 RD-EGLCGIGTRSSYPL 347
           R+ +  CGI + +S+PL
Sbjct: 324 RNRDNQCGIASSASFPL 340


>gi|402770517|gb|AFQ98393.1| cathepsin L [Rhipicephalus microplus]
          Length = 332

 Score =  257 bits (656), Expect = 7e-66,   Method: Compositional matrix adjust.
 Identities = 138/335 (41%), Positives = 204/335 (60%), Gaps = 12/335 (3%)

Query: 21  IITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEK 80
           ++ L V CA   V+  ++ ++ +    E +   H ++Y+  +E+ +R KIF EN   I K
Sbjct: 1   MLRLSVLCAIAAVTVAASSQEILRTQWEAFKTTHKKTYQSHMEELLRFKIFTENSLIIAK 60

Query: 81  ANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
            N +   G  +YKLG NQF DL   EF  ++ G+     + ++  SS     N++ + +P
Sbjct: 61  HNAKYAKGLVSYKLGMNQFGDLLAHEFARIFNGHH---GTRKTGGSSFLPPANVNDSSLP 117

Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GN 196
             +DWR KGAVTP+K+Q +CG CWAF+A  ++EG   +++G L+ LSEQ L+DCS + GN
Sbjct: 118 KVVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQSFGN 177

Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQA 256
           NGC GG  E AF YI  N GI TE  YPY+AV G C   ++   A  + Y E+ +G E  
Sbjct: 178 NGCEGGLMEDAFKYIKANDGIDTEKSYPYKAVDGECRFKKEDVGATDTGYVEIKAGSEVD 237

Query: 257 LLKAV-SMQPVSIAIAAYSTEFQSYKEGIFNGV-CGTQ-LDHAVTIVGFGTTEDGANYWL 313
           L KAV ++ P+S+AI A  + FQ Y EG+++   C ++ LDH V +VG+G  + G  YWL
Sbjct: 238 LKKAVATVGPISVAIDASHSSFQLYSEGVYDEPECSSEDLDHGVLVVGYG-VKGGKKYWL 296

Query: 314 IKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
           +KNSW  +WGD GY+ + RD    CGI +++SYPL
Sbjct: 297 VKNSWAESWGDQGYILMSRDNNNQCGIASQASYPL 331


>gi|402770509|gb|AFQ98389.1| cathepsin L [Rhipicephalus microplus]
          Length = 332

 Score =  256 bits (655), Expect = 8e-66,   Method: Compositional matrix adjust.
 Identities = 139/336 (41%), Positives = 205/336 (61%), Gaps = 14/336 (4%)

Query: 21  IITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEK 80
           ++ L V CA   V+  ++ ++ +    E +   H ++Y+  +E+ +R KIF E+   I +
Sbjct: 1   MLRLSVLCAIAAVTVAASSQEILRTQWEAFKTTHKKTYQSHMEELLRFKIFTESSLIIAR 60

Query: 81  ANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTF-KYQNLSMTDV 136
            N +   G  +YKLG NQF DL   EF  ++ G+       R T  STF    N++ + +
Sbjct: 61  HNAKYAKGLVSYKLGMNQFGDLLAHEFARIFNGHH----GTRKTGGSTFLPPANVNDSSL 116

Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-G 195
           P ++DWR KGAVTP+K+Q +CG CWAF+A  ++EG   +++G L+ LSEQ L+DCS + G
Sbjct: 117 PKAVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQSFG 176

Query: 196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQ 255
           NNGC GG  E AF YI  N GI TE  YPY+AV G C   ++   A  + Y E+ +G E 
Sbjct: 177 NNGCEGGLMEDAFKYIKANDGIDTEKSYPYEAVDGECRFKKEDVGATDTGYVEIKAGSED 236

Query: 256 ALLKAV-SMQPVSIAIAAYSTEFQSYKEGIFNGV-CGTQ-LDHAVTIVGFGTTEDGANYW 312
            L KAV ++ P+S+AI A  + FQ Y EG+++   C ++ LDH V +VG+G  + G  YW
Sbjct: 237 DLKKAVATVGPISVAIDASHSSFQLYSEGVYDEPECSSEDLDHGVLVVGYG-VKGGKKYW 295

Query: 313 LIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
           L+KNSW  +WGD GY+ + RD    CGI +++SYPL
Sbjct: 296 LVKNSWAESWGDQGYILMSRDNNNQCGIASQASYPL 331


>gi|91092014|ref|XP_970644.1| PREDICTED: similar to cathepsin-L-like cysteine peptidase 02
           [Tribolium castaneum]
 gi|270001249|gb|EEZ97696.1| cathepsin L precursor [Tribolium castaneum]
          Length = 337

 Score =  256 bits (655), Expect = 8e-66,   Method: Compositional matrix adjust.
 Identities = 138/342 (40%), Positives = 197/342 (57%), Gaps = 19/342 (5%)

Query: 19  FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMA---QHGRSYKDELEKEMRLKIFKENL 75
           F++   L    SQ VS           + E+W A    H + Y+ E E+  R+KIF EN 
Sbjct: 3   FLVFVALCVVGSQAVSFFDL-------VQEQWGAFKVTHKKQYESETEERFRMKIFMENA 55

Query: 76  EYIEKANK---EGNRTYKLGTNQFSDLTNDEFRALYTGY-KMPSPSHRSTTSSTFKYQNL 131
             + K NK   +G  ++KLG N++SD+ N EF     GY +  +P        +  +   
Sbjct: 56  HKVAKHNKLYAQGLVSFKLGVNKYSDMLNHEFVHTLNGYNRSKTPLRSGELDESITFIPP 115

Query: 132 SMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDC 191
           +  ++P  +DWR  GAVTP+K+Q +CG CW+F+   ++EG    +S  L+ LSEQ L+DC
Sbjct: 116 ANVELPKQIDWRKLGAVTPVKDQGQCGSCWSFSTTGSLEGQHFRKSKKLVSLSEQNLIDC 175

Query: 192 STN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVP 250
           S   GNNGC GG  + AF YI  N GI TE  YPY+A    C    +   A    + ++ 
Sbjct: 176 SEKYGNNGCNGGLMDNAFRYIKDNGGIDTEQSYPYKAEDEKCHYKPRNKGATDRGFVDIE 235

Query: 251 SGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIF-NGVCGT-QLDHAVTIVGFGTTED 307
           SGDE+ L  AV ++ P+S+AI A    FQ Y EG++    C + QLDH V +VG+GT ED
Sbjct: 236 SGDEEKLKAAVATVGPISVAIDASHPTFQQYSEGVYYEPECSSEQLDHGVLVVGYGTDED 295

Query: 308 GANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPLA 348
           G +YWL+KNSWG++WGD GY+K+ R+ +  CGI T++SYPL 
Sbjct: 296 GNDYWLVKNSWGDSWGDQGYIKMARNRDNNCGIATQASYPLV 337


>gi|55740402|gb|AAV63977.1| cathepsin L precursor [Artemia franciscana]
          Length = 338

 Score =  256 bits (655), Expect = 9e-66,   Method: Compositional matrix adjust.
 Identities = 137/335 (40%), Positives = 206/335 (61%), Gaps = 11/335 (3%)

Query: 21  IITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEK 80
           +I LL +   Q+ ++ S       E H  + A H + Y  +LE++ R+KI+ EN   + K
Sbjct: 6   LIFLLGAVLVQLSAALSLTNLLADEWH-LFKATHKKEYPSQLEEKFRMKIYLENKHKVAK 64

Query: 81  AN---KEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
            N   ++G ++Y++  N+F DL + EFR++  GY+     + S   STF +   +  +VP
Sbjct: 65  HNILYEKGEKSYQVAMNKFGDLLHHEFRSIMNGYQHKK-QNSSRAESTFTFMEPANVEVP 123

Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GN 196
            S+DWR+KGA+TP+K+Q +CG CWAF++  A+EG T  ++G LI LSEQ L+DCS   GN
Sbjct: 124 ESVDWREKGAITPVKDQGQCGSCWAFSSTGALEGQTFRKTGKLISLSEQNLIDCSGKYGN 183

Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQA 256
            GC GG  ++AF YI  N+GI TE+ YPY+A    C    +   A    + ++PSG+E  
Sbjct: 184 EGCNGGLMDQAFQYIKDNKGIDTENTYPYEAEDDVCRYNPRNRGAVDRGFVDIPSGEEDK 243

Query: 257 LLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVCGT-QLDHAVTIVGFGTTEDGANYWL 313
           L  AV ++ PVS+AI A    FQ Y +G+ +   C +  LDH V +VG+G +++G +YWL
Sbjct: 244 LKAAVATVGPVSVAIDASHESFQFYSKGVYYEPSCDSDDLDHGVLVVGYG-SDNGKDYWL 302

Query: 314 IKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
           +KNSW   WGD GY+KI R+ +  CG+ T +SYPL
Sbjct: 303 VKNSWSEHWGDEGYIKIARNRKNHCGVATAASYPL 337


>gi|402770511|gb|AFQ98390.1| cathepsin L [Rhipicephalus microplus]
 gi|402770513|gb|AFQ98391.1| cathepsin L [Rhipicephalus microplus]
          Length = 332

 Score =  256 bits (655), Expect = 9e-66,   Method: Compositional matrix adjust.
 Identities = 138/335 (41%), Positives = 204/335 (60%), Gaps = 12/335 (3%)

Query: 21  IITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEK 80
           ++ L V CA   V+  ++ ++ +    E +   H ++Y+  +E+ +R KIF EN   I K
Sbjct: 1   MLRLSVLCAIVAVTVAASSQEILRTQWEAFKTTHKKTYQSHMEELLRFKIFTENSLIIAK 60

Query: 81  ANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
            N +   G  +YKLG NQF DL   EF  ++ G+     + ++  SS     N++ + +P
Sbjct: 61  HNAKYAKGLVSYKLGMNQFGDLLAHEFARIFNGHH---GTRKTGGSSFLPPANVNDSSLP 117

Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GN 196
             +DWR KGAVTP+K+Q +CG CWAF+A  ++EG   +++G L+ LSEQ L+DCS + GN
Sbjct: 118 KVVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQSFGN 177

Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQA 256
           NGC GG  E AF YI  N GI TE  YPY+AV G C   ++   A  + Y E+ +G E  
Sbjct: 178 NGCEGGLMEDAFKYIKANDGIDTEKSYPYEAVDGECRFKKEDVGATDTGYVEIKAGSEVD 237

Query: 257 LLKAV-SMQPVSIAIAAYSTEFQSYKEGIFNGV-CGTQ-LDHAVTIVGFGTTEDGANYWL 313
           L KAV ++ P+S+AI A  + FQ Y EG+++   C ++ LDH V +VG+G  + G  YWL
Sbjct: 238 LKKAVATVGPISVAIDASHSSFQLYSEGVYDEPECSSEDLDHGVLVVGYG-VKGGKKYWL 296

Query: 314 IKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
           +KNSW  +WGD GY+ + RD    CGI +++SYPL
Sbjct: 297 VKNSWAESWGDQGYILMSRDNNNQCGIASQASYPL 331


>gi|224116884|ref|XP_002317418.1| predicted protein [Populus trichocarpa]
 gi|222860483|gb|EEE98030.1| predicted protein [Populus trichocarpa]
          Length = 503

 Score =  256 bits (655), Expect = 9e-66,   Method: Compositional matrix adjust.
 Identities = 138/333 (41%), Positives = 203/333 (60%), Gaps = 18/333 (5%)

Query: 24  LLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI-EKAN 82
           ++V+  S++VS     E+S++EI ++W  +H + Y+   E E R + FK NL+YI EKA 
Sbjct: 32  IVVNDFSELVS-----EESIIEIFQQWRDRHQKVYEHAAESEKRYRNFKRNLKYIIEKAG 86

Query: 83  KE-GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLD 141
           K+     + +G N+F+DL+N+EF+ LY        + + +T+  ++ +NL   D P+SLD
Sbjct: 87  KKTAALGHSVGLNKFADLSNEEFKELYLSKVKKPINIKRSTARDWRQRNLQTCDAPSSLD 146

Query: 142 WRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLG 201
           WR KG VT +K+Q +CG CW+F+   A+EGI  I +G+LI LSEQ+L+DC T  N GC G
Sbjct: 147 WRKKGVVTAVKDQGDCGSCWSFSTTGAIEGINAIVTGDLISLSEQELVDCDTT-NYGCEG 205

Query: 202 GSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKP-AAAKISNYEEVPSGDEQALLKA 260
           G  + AF ++I N GI TE  YPY  V GTC+  ++      I  Y +V   D  ALL A
Sbjct: 206 GYMDYAFEWVINNGGIDTEANYPYTGVDGTCNTTKEEIKVVSIDGYTDVDETD-SALLCA 264

Query: 261 VSMQPVSIAIAAYSTEFQSYKEGIFNGVCG---TQLDHAVTIVGFGTTEDGANYWLIKNS 317
              QP+S+ +   + +FQ Y  GI++G C      +DHAV IVG+G +E+G +YW++KNS
Sbjct: 265 TVQQPISVGMDGSALDFQLYTGGIYDGDCSDDPNDIDHAVLIVGYG-SENGEDYWIVKNS 323

Query: 318 WGNTWGDAGYMKIVRDE----GLCGIGTRSSYP 346
           WG  WG  GY  I R+     G+C I   +SYP
Sbjct: 324 WGTEWGMEGYFYIKRNTDLPYGVCAINAEASYP 356


>gi|391338876|ref|XP_003743781.1| PREDICTED: cathepsin L-like isoform 4 [Metaseiulus occidentalis]
          Length = 336

 Score =  256 bits (654), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 141/338 (41%), Positives = 208/338 (61%), Gaps = 19/338 (5%)

Query: 19  FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
           F+I+ +LV  AS  +    T EQ      + +   H + Y+    +  R KIF +N   I
Sbjct: 8   FLILAVLVGAASAAL----TLEQLFDAEWQNFKVHHNKKYEGSTVEAFRKKIFLQNTHLI 63

Query: 79  EKAN---KEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTF-KYQNLSMT 134
            + N    +G  TYKL  NQF D+ + EF +   G      S+R+   ST+ + +++S+ 
Sbjct: 64  ARHNIKHAKGETTYKLKMNQFGDMLHHEFVSTMNGLLR---SNRTYFGSTWIEPESVSL- 119

Query: 135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN 194
             P S+DWR+KGAVTP+KNQ  CG CW+F+   A+EG    ++G L+ LSEQ L+DCST+
Sbjct: 120 --PKSVDWREKGAVTPVKNQGHCGSCWSFSTTGALEGQLFRKTGELVSLSEQNLIDCSTS 177

Query: 195 -GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGD 253
            GNNGC GG  + AF YI +N GI TE+ YPY+   G C   ++ +A + + + ++PSG+
Sbjct: 178 YGNNGCGGGLMDNAFTYIKENHGIDTEESYPYEGKQGKCRYHKEDSAGRDTGFVDIPSGN 237

Query: 254 EQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIFNGV-CGTQ-LDHAVTIVGFGTTEDGAN 310
           E+AL KA+ ++ PVS+AI A    FQ Y EG++N   C +  LDH V  VG+GTT+DG +
Sbjct: 238 ERALAKALATIGPVSVAIDASHESFQFYHEGVYNPPDCDSHSLDHGVLAVGYGTTDDGQD 297

Query: 311 YWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
           Y++IKNSWG  WG  GY+ + R+ +  CG+ T++SYPL
Sbjct: 298 YYIIKNSWGERWGQEGYVLMARNSKNECGVATQASYPL 335


>gi|413953665|gb|AFW86314.1| hypothetical protein ZEAMMB73_546353 [Zea mays]
          Length = 233

 Score =  256 bits (654), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 115/226 (50%), Positives = 155/226 (68%), Gaps = 6/226 (2%)

Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
           F+Y+N+S   +PT++DWR KGAVTPIK+Q +CGCCWAF+AVAA EGI KI +G L+ L+E
Sbjct: 7   FRYENVSADALPTTIDWRTKGAVTPIKDQGQCGCCWAFSAVAATEGIVKISTGKLVSLAE 66

Query: 186 QQLLDCSTNG-NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKIS 244
           Q+L+DC  +  + GC GG  + AF +II+N G+ TE  YPY A  G C +    +AA I 
Sbjct: 67  QELVDCDVHDEDQGCEGGLMDDAFKFIIKNGGLTTESSYPYTAADGKCKSGSN-SAATIK 125

Query: 245 NYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGT 304
            YE+VP+ DE AL+KAV+ QPVS+A+      FQ Y  G+  G CGT LDH +  +G+G 
Sbjct: 126 GYEDVPANDEAALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGK 185

Query: 305 TEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYP 346
           T DG  YWL+KNSWG TWG+ GY+++ +D     G+CG+    SYP
Sbjct: 186 TSDGTKYWLMKNSWGTTWGENGYLRMEKDISDKRGMCGLAMEPSYP 231


>gi|330805277|ref|XP_003290611.1| hypothetical protein DICPUDRAFT_81345 [Dictyostelium purpureum]
 gi|325079250|gb|EGC32859.1| hypothetical protein DICPUDRAFT_81345 [Dictyostelium purpureum]
          Length = 330

 Score =  256 bits (654), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 137/333 (41%), Positives = 197/333 (59%), Gaps = 21/333 (6%)

Query: 22  ITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKA 81
            +  V  AS  V S  T++ S +     WM +H RSY    E   + + FK+N+++I   
Sbjct: 12  FSFNVCFASNSVYSAQTYQTSFL----GWMKKHDRSYHHH-EFNNKYQAFKDNMDFIHNW 66

Query: 82  NKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV--PTS 139
           N   N    LG  QF+DLTN+E+R +Y G K+     +          N +M     P S
Sbjct: 67  NTNKNSKTVLGLTQFADLTNEEYRKIYLGTKVNVAPEK---------HNFNMIHFTGPDS 117

Query: 140 LDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNG 198
           +DWR KGAV+ +K+Q +CG CW+F+   +VEG  +I++GN++ LSEQ L+DCS   GNNG
Sbjct: 118 IDWRTKGAVSHVKDQGQCGSCWSFSTTGSVEGAHQIKTGNMVTLSEQNLVDCSGKFGNNG 177

Query: 199 CLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALL 258
           C GG    AF +I+   G+ATED YPY AV G C   +    A IS Y+E+  G E  L 
Sbjct: 178 CDGGLMVNAFKFIMSQGGVATEDSYPYNAVQGKCKFTKSMVGANISGYKEITQGSELELQ 237

Query: 259 KAVSMQPVSIAIAAYSTEFQSYKEGIFNGV-CGT-QLDHAVTIVGFGTTEDGANYWLIKN 316
            A++ QPVSIAI A    FQ YK G+++   C + QLDH V  VG+G TE+G +Y+++KN
Sbjct: 238 AALTKQPVSIAIDASQQSFQLYKSGVYDEPECSSYQLDHGVLAVGYG-TENGKDYYIVKN 296

Query: 317 SWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPLA 348
           SW ++WG  GY+ + R+ +  CG+ T +SYP++
Sbjct: 297 SWADSWGQDGYIFMSRNAKNQCGVATMASYPIS 329


>gi|125564712|gb|EAZ10092.1| hypothetical protein OsI_32402 [Oryza sativa Indica Group]
          Length = 382

 Score =  256 bits (654), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 135/336 (40%), Positives = 191/336 (56%), Gaps = 31/336 (9%)

Query: 42  SVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTN 101
           +++E+ ++W A++ RSY    E+  RL+++  N+ YIE  N      Y+LG   ++DLTN
Sbjct: 47  TMMEMFQRWKAEYNRSYATPEEERRRLRVYARNVRYIEATNAAAGLAYELGETAYTDLTN 106

Query: 102 DEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV----------------PTSLDWRDK 145
           DEF A+YT   + S +     ++T          V                P S+DWR  
Sbjct: 107 DEFMAMYTAPPLRSAADDDDDAATTTIITTRAGPVDEHQQPEVYFNESAGAPASVDWRAS 166

Query: 146 GAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSRE 205
           GAVT +K+Q  CG CWAF+ VA VEGI KI+ G L+ LSEQ+L+DC T  ++GC GG   
Sbjct: 167 GAVTEVKDQGRCGSCWAFSTVAVVEGIQKIKKGKLVSLSEQELVDCDTL-DSGCDGGVSY 225

Query: 206 KAFAYIIQNQGIATEDEYPYQA-VPGTCSAAQ-KPAAAKISNYEEVPSGDEQALLKAVSM 263
           +A  +I  N GI T D+YPY       C  A+    AA I+    V +  E +L  A + 
Sbjct: 226 RALEWITANGGITTRDDYPYTGAAAAACDRAKLGHHAATIAGLRRVATRSEASLQNAAAA 285

Query: 264 QPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTE-------DGANYWLIKN 316
           QPV+++I A    FQ Y++G+++G CGT+L+H VT+VG+G  E        G  YW+IKN
Sbjct: 286 QPVAVSIEAGGDNFQHYRKGVYDGPCGTRLNHGVTVVGYGQEEAPVDGSAAGDKYWIIKN 345

Query: 317 SWGNTWGDAGYMKIVRD-----EGLCGIGTRSSYPL 347
           SWG  WGD GY+K+ +D     EGLCGI  R S+PL
Sbjct: 346 SWGKNWGDQGYIKMKKDVAGKPEGLCGIAIRPSFPL 381


>gi|330805273|ref|XP_003290609.1| hypothetical protein DICPUDRAFT_92519 [Dictyostelium purpureum]
 gi|325079248|gb|EGC32857.1| hypothetical protein DICPUDRAFT_92519 [Dictyostelium purpureum]
          Length = 333

 Score =  256 bits (654), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 137/320 (42%), Positives = 196/320 (61%), Gaps = 14/320 (4%)

Query: 33  VSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLG 92
           V S  T++ S +     WM +H R+Y  E E   R + FKEN+++I K N + + T  LG
Sbjct: 23  VFSSQTYQTSFI----GWMRKHDRAYSHE-EFTDRYQAFKENMDFIHKWNSQESDTV-LG 76

Query: 93  TNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIK 152
             +F+DLTN+E++  Y G K+    + +      K+   +    P S+DWR+KGAV+ +K
Sbjct: 77  LTKFADLTNEEYKKHYLGIKVNVKKNLNAAQKGLKFFKFTG---PDSIDWREKGAVSQVK 133

Query: 153 NQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYI 211
           +Q +CG CW+F+   AVEG  +I+SGN++ LSEQ L+DCS   GN GC GG    AF YI
Sbjct: 134 DQGQCGSCWSFSTTGAVEGAHQIKSGNMVSLSEQNLVDCSGQYGNQGCEGGLMVNAFEYI 193

Query: 212 IQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIA 271
           I N GIATE  YPY A  G C   +    A I  Y+E+P G+E +L  A++ QPVS+AI 
Sbjct: 194 IDNGGIATESSYPYTAAQGRCKFTKSMNGANIIGYKEIPQGEEDSLTAALAKQPVSVAID 253

Query: 272 AYSTEFQSYKEGIFN-GVCGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMK 329
           A    FQ Y  G+++   C ++ LDH V  VG+GT E G +Y++IKNSWG TWG  GY+ 
Sbjct: 254 ASHMSFQLYSSGVYDEPACSSEALDHGVLAVGYGTLE-GKDYYIIKNSWGPTWGQDGYIF 312

Query: 330 IVRD-EGLCGIGTRSSYPLA 348
           + R+ +  CG+ T +SYP++
Sbjct: 313 MSRNAQNQCGVATMASYPIS 332


>gi|30023547|gb|AAO48766.2| cathepsin L-like cysteine proteinase [Tenebrio molitor]
          Length = 337

 Score =  256 bits (654), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 140/341 (41%), Positives = 195/341 (57%), Gaps = 19/341 (5%)

Query: 19  FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMA---QHGRSYKDELEKEMRLKIFKENL 75
           F+I   +    SQ VS           + E+W A    H + Y+ E E+  R+KIF EN 
Sbjct: 3   FLIFLAICVAGSQAVSFFDL-------VQEQWGAFKMTHNKQYQSETEERFRMKIFMENS 55

Query: 76  EYIEKANK---EGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSS-TFKYQNL 131
             + K NK   +G  ++KLG N+++D+ + EF  +  G+       RS  S  +  +   
Sbjct: 56  HTVAKHNKLYAQGLVSFKLGINKYADMLHHEFVQVLNGFNRTKSGLRSGESDDSVTFLPP 115

Query: 132 SMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDC 191
           +   +P  +DWRDKGAVTP+K+Q +CG CW+F+A  ++EG    +SG L+ LSEQ L+DC
Sbjct: 116 ANVQLPGQIDWRDKGAVTPVKDQGQCGSCWSFSATGSLEGQHFRQSGKLVSLSEQNLVDC 175

Query: 192 STN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVP 250
           S   GNNGC GG  + AF YI  N GI TE  YPY+A    C    K   A    Y ++ 
Sbjct: 176 SEKFGNNGCNGGLMDNAFRYIKANGGIDTEQAYPYKAEDEKCHYKPKNKGATDRGYVDIE 235

Query: 251 SGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIF--NGVCGTQLDHAVTIVGFGTTED 307
           SG+E  L  AV ++ PVS+AI A    FQ Y  G++       +QLDH V +VG+GT +D
Sbjct: 236 SGNEDKLQSAVATVGPVSVAIDASHQSFQLYSGGVYYEPDCSASQLDHGVLVVGYGTEDD 295

Query: 308 GANYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYPL 347
           G +YWL+KNSWG +WGD GY+K+ R+    CGI T +SYPL
Sbjct: 296 GTDYWLVKNSWGKSWGDQGYIKMARNRNNNCGIATEASYPL 336


>gi|449683741|ref|XP_002155462.2| PREDICTED: cathepsin L-like [Hydra magnipapillata]
          Length = 324

 Score =  256 bits (654), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 138/329 (41%), Positives = 201/329 (61%), Gaps = 18/329 (5%)

Query: 24  LLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANK 83
           LL+      +  R T + S +    +W   H ++Y  + E+ +R  I+K+N   I + N 
Sbjct: 8   LLLGVTLAYIIERPTEDDSWI----RWKMAHNKAYSHDGEETVRYTIWKDNERRIREHNL 63

Query: 84  EGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWR 143
           +G   + L  NQF D+TN+EF+  + GY     SH+  + STF   N  +   P S+DWR
Sbjct: 64  QGG-DFLLEMNQFGDMTNNEFKD-FNGY----LSHKHVSGSTFLTPNSFV--APDSVDWR 115

Query: 144 DKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGG 202
           ++G VTP+K+Q +CG CWAF+   ++EG    ++G L+ LSEQ L+DCST  GNNGC GG
Sbjct: 116 NEGYVTPVKDQGQCGSCWAFSTTGSLEGQNFKKTGKLVSLSEQNLVDCSTAYGNNGCNGG 175

Query: 203 SREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV- 261
             + AF YI +N GI +E  YPY A  G C+  +   AA  + + ++PSGDE  L +AV 
Sbjct: 176 LMDNAFTYIKENNGIDSEASYPYTAKDGKCAFTKPNVAATDTGFVDIPSGDENKLKEAVA 235

Query: 262 SMQPVSIAIAAYSTEFQSYKEGIFN--GVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWG 319
           S+ P+S+AI A    FQ Y++G++N      T+LDH V +VG+G TE G +YWL+KNSW 
Sbjct: 236 SVGPISVAIDASHFSFQFYRKGVYNERKCSSTELDHGVLVVGYG-TESGKDYWLVKNSWN 294

Query: 320 NTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
            +WGD GY+K+ R+ +  CGI T +SYPL
Sbjct: 295 TSWGDKGYIKMSRNAKNQCGIATNASYPL 323


>gi|33112583|gb|AAP94047.1| cathepsin-L-like cysteine peptidase 03 [Tenebrio molitor]
          Length = 337

 Score =  256 bits (654), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 139/341 (40%), Positives = 196/341 (57%), Gaps = 19/341 (5%)

Query: 19  FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMA---QHGRSYKDELEKEMRLKIFKENL 75
           F+I   +    SQ VS           + E+W A    H + Y+ + E+  R+KIF EN 
Sbjct: 3   FLIFLAICVAGSQAVSFFDL-------VQEQWGAFKMTHNKQYQSDTEERFRMKIFMENS 55

Query: 76  EYIEKANK---EGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSS-TFKYQNL 131
             + K NK   +G  ++KLG N+++D+ + EF  +  G+       RS  S  +  +   
Sbjct: 56  HTVAKHNKLYAQGLVSFKLGINKYADMLHHEFVQVLNGFNRTKSGLRSGESDDSVTFLPP 115

Query: 132 SMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDC 191
           +   +P  +DWRDKGAVTP+K+Q +CG CW+F+A  ++EG    +SG L+ LSEQ L+DC
Sbjct: 116 ANVQLPGQIDWRDKGAVTPVKDQGQCGSCWSFSATGSLEGQHFRKSGKLVSLSEQNLVDC 175

Query: 192 STN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVP 250
           S   GNNGC GG  + AF YI  N GI TE  YPY+A    C    K   A    Y ++ 
Sbjct: 176 SEKFGNNGCNGGLMDNAFRYIKANGGIDTEQAYPYKAEDEKCHYKPKNKGATDRGYVDIE 235

Query: 251 SGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIF--NGVCGTQLDHAVTIVGFGTTED 307
           SG+E  L  AV ++ PVS+AI A    FQ Y  G++       +QLDH V +VG+GT +D
Sbjct: 236 SGNEDKLQSAVATVGPVSVAIDASHQSFQLYSGGVYYEPDCSASQLDHGVLVVGYGTEDD 295

Query: 308 GANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
           G +YWL+KNSWG +WGD GY+K+ R+ +  CGI T +SYPL
Sbjct: 296 GTDYWLVKNSWGKSWGDQGYIKMARNRDNNCGIATEASYPL 336


>gi|391338870|ref|XP_003743778.1| PREDICTED: cathepsin L-like isoform 1 [Metaseiulus occidentalis]
 gi|391338872|ref|XP_003743779.1| PREDICTED: cathepsin L-like isoform 2 [Metaseiulus occidentalis]
 gi|391338874|ref|XP_003743780.1| PREDICTED: cathepsin L-like isoform 3 [Metaseiulus occidentalis]
          Length = 331

 Score =  256 bits (653), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 141/338 (41%), Positives = 208/338 (61%), Gaps = 19/338 (5%)

Query: 19  FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
           F+I+ +LV  AS  +    T EQ      + +   H + Y+    +  R KIF +N   I
Sbjct: 3   FLILAVLVGAASAAL----TLEQLFDAEWQNFKVHHNKKYEGSTVEAFRKKIFLQNTHLI 58

Query: 79  EKAN---KEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTF-KYQNLSMT 134
            + N    +G  TYKL  NQF D+ + EF +   G      S+R+   ST+ + +++S+ 
Sbjct: 59  ARHNIKHAKGETTYKLKMNQFGDMLHHEFVSTMNGLLR---SNRTYFGSTWIEPESVSL- 114

Query: 135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN 194
             P S+DWR+KGAVTP+KNQ  CG CW+F+   A+EG    ++G L+ LSEQ L+DCST+
Sbjct: 115 --PKSVDWREKGAVTPVKNQGHCGSCWSFSTTGALEGQLFRKTGELVSLSEQNLIDCSTS 172

Query: 195 -GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGD 253
            GNNGC GG  + AF YI +N GI TE+ YPY+   G C   ++ +A + + + ++PSG+
Sbjct: 173 YGNNGCGGGLMDNAFTYIKENHGIDTEESYPYEGKQGKCRYHKEDSAGRDTGFVDIPSGN 232

Query: 254 EQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIFNGV-CGTQ-LDHAVTIVGFGTTEDGAN 310
           E+AL KA+ ++ PVS+AI A    FQ Y EG++N   C +  LDH V  VG+GTT+DG +
Sbjct: 233 ERALAKALATIGPVSVAIDASHESFQFYHEGVYNPPDCDSHSLDHGVLAVGYGTTDDGQD 292

Query: 311 YWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
           Y++IKNSWG  WG  GY+ + R+ +  CG+ T++SYPL
Sbjct: 293 YYIIKNSWGERWGQEGYVLMARNSKNECGVATQASYPL 330


>gi|297733654|emb|CBI14901.3| unnamed protein product [Vitis vinifera]
          Length = 273

 Score =  256 bits (653), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 127/259 (49%), Positives = 164/259 (63%), Gaps = 14/259 (5%)

Query: 99  LTNDEFRALYTGYKMPSPSHR-----STTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKN 153
           +TN EFR+ Y G K+    HR        + +F Y+ +    VP S+DWR KGAVTPIK+
Sbjct: 1   MTNHEFRSTYAGSKVNH--HRMFRGSQHAAGSFMYEKVK--SVPPSVDWRKKGAVTPIKD 56

Query: 154 QKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQ 213
           Q +CG CWAF+ V AVEGI  I++  L+ LSEQ+L+DC T+ N GC GG    AF +I +
Sbjct: 57  QGQCGSCWAFSTVVAVEGINHIKTNKLVSLSEQELVDCDTSENQGCNGGLMGYAFEFIKE 116

Query: 214 NQGIATEDEYPYQAVPGTCSAAQ-KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAA 272
             GI TE  YPY A  GTC  ++       I  +E VP  +E ALLKA + QP+S+AI A
Sbjct: 117 KGGITTEQSYPYTAEDGTCDVSKVNSPVVSIDGHETVPPNNEDALLKAAANQPISVAIDA 176

Query: 273 YSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR 332
             + FQ Y EG+F G CGT LDH V IVG+GTT DG  YW++KNSWG  WG+ GY+++ R
Sbjct: 177 GGSAFQFYSEGVFAGRCGTDLDHGVAIVGYGTTLDGTKYWIVKNSWGTDWGENGYIRMKR 236

Query: 333 ----DEGLCGIGTRSSYPL 347
                EGLCGI   +SYP+
Sbjct: 237 GISAKEGLCGIAVEASYPI 255


>gi|324983200|gb|ADY68475.1| stem bromelain [Ananas comosus]
          Length = 291

 Score =  256 bits (653), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 128/287 (44%), Positives = 183/287 (63%), Gaps = 6/287 (2%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
           +F+ + L V  AS   +SR      +++  E+WMA++GR YKD  EK  R +IFK N+ +
Sbjct: 8   VFLFLFLCVMWASPSAASRDEPSDPMMKRFEEWMAEYGRVYKDNDEKMRRFQIFKNNVNH 67

Query: 78  IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTG-YKMPSPSHRSTTSSTFKYQNLSMTDV 136
           IE  N     +Y LG N+F+D+TN+EF A YTG    P    +    S   + +++++ V
Sbjct: 68  IETFNNRNGNSYTLGINKFTDMTNNEFVAQYTGGISRPLNIEKEPVVS---FDDVNISAV 124

Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGN 196
             S+DWRD GAVT +K+Q  CG CWAF+A+A VEGI KI +G L+ LSEQ++LDC+ +  
Sbjct: 125 GQSIDWRDYGAVTEVKDQNPCGSCWAFSAIATVEGIYKIVTGYLVSLSEQEVLDCAVS-- 182

Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQA 256
           NGC GG  + A+ +II N G+A+E +YPYQA  G C+A   P +A I+ Y  V S DE +
Sbjct: 183 NGCDGGFVDNAYDFIISNNGVASEADYPYQAYQGDCAANSWPNSAYITGYSYVRSNDESS 242

Query: 257 LLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFG 303
           +  AV  QP++ AI A    FQ Y  G+F+G CGT L+HA+TI+G+G
Sbjct: 243 MKYAVWNQPIAAAIDASGDNFQYYNGGVFSGPCGTSLNHAITIIGYG 289


>gi|326430490|gb|EGD76060.1| cysteine proteinase [Salpingoeca sp. ATCC 50818]
          Length = 448

 Score =  255 bits (652), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 134/313 (42%), Positives = 187/313 (59%), Gaps = 23/313 (7%)

Query: 46  IHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNR---TYKLGTNQFSDLTND 102
           + + +  +  + Y+   E+  R  +F +N+++I + N E  R   T+ +  NQF+DLTN+
Sbjct: 29  LFDAFKTKFNKVYESAEEEARRFSVFSQNIDFINRHNAEAARGVHTHTVDVNQFADLTNE 88

Query: 103 EFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPT--SLDWRDKGAVTPIKNQKECGCC 160
           E+R LY     P P     T    + +     D P   S+DWR KGAVTPIKNQ +CG C
Sbjct: 89  EYRQLYL---RPYP-----TELLGRERQEVWLDGPNAGSVDWRQKGAVTPIKNQGQCGSC 140

Query: 161 WAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIAT 219
           W+F+   +VEG   I +GNL+ LSEQQL+DCS + GN GC GG  + AF YII N G+ T
Sbjct: 141 WSFSTTGSVEGAHAIATGNLVSLSEQQLVDCSGSFGNQGCNGGLMDNAFKYIISNGGLDT 200

Query: 220 EDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQ 278
           E +YPY A  G C  +++   A  IS Y++VP  +E  L  AV   PVS+AI A    FQ
Sbjct: 201 EQDYPYTARDGVCDKSKESKHAVSISGYKDVPQNNEDQLAAAVEKGPVSVAIEADQQSFQ 260

Query: 279 SYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR---DEG 335
            Y  G+F+G CGT LDH V +VG+ +     +YW++KNSWG +WGD GY+ + R     G
Sbjct: 261 MYSSGVFSGPCGTNLDHGVLVVGYTS-----DYWIVKNSWGASWGDQGYIMMKRGVSSAG 315

Query: 336 LCGIGTRSSYPLA 348
           +CGI  + SYP+A
Sbjct: 316 ICGIAMQPSYPIA 328


>gi|33348834|gb|AAQ16117.1| cathepsin L-like cysteine proteinase A [Rhipicephalus
           haemaphysaloides haemaphysaloides]
          Length = 332

 Score =  255 bits (652), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 137/336 (40%), Positives = 204/336 (60%), Gaps = 14/336 (4%)

Query: 21  IITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEK 80
           ++ L + CA   V+  +   + +    E +   H +SY+  +E+ +R KIF EN   I K
Sbjct: 1   MLRLSLLCAIVAVTVAANSHEILRTQWEAFKTTHKKSYESHMEELLRFKIFTENSLIIAK 60

Query: 81  ANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTF-KYQNLSMTDV 136
            N +   G  +YKLG NQF DL   EF  ++ GY+      R++  STF    N++ + +
Sbjct: 61  HNAKYAKGLVSYKLGMNQFGDLLAHEFAKIFNGYR----GQRTSRGSTFMPPANVNDSSL 116

Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-G 195
           P+++DWR KGAVTP+K+Q +CG CWAF+A  ++EG   ++ G L+ LSEQ L+DCS + G
Sbjct: 117 PSTVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKDGELVSLSEQNLVDCSQSFG 176

Query: 196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQ 255
           NNGC GG  + AF YI  N GI  E+ YPY+A+   C   ++   A  + + ++  G E 
Sbjct: 177 NNGCEGGLMDNAFKYIKANDGIDAEESYPYEAMDDKCRFKKEDVGATDTGFVDIEGGSED 236

Query: 256 ALLKAV-SMQPVSIAIAAYSTEFQSYKEGIFNGV-CGT-QLDHAVTIVGFGTTEDGANYW 312
            L KAV ++ P+S+AI A  + FQ Y EG+++   C + +LDH V  VG+G  +DG  YW
Sbjct: 237 DLKKAVATVGPISVAIDAGHSSFQLYSEGVYDEPECSSEELDHGVLAVGYG-VKDGKKYW 295

Query: 313 LIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYPL 347
           L+KNSWG +WGD GY+ + RD+   CGI + +SYPL
Sbjct: 296 LVKNSWGGSWGDNGYILMSRDKNNQCGIASAASYPL 331


>gi|33112581|gb|AAP94046.1| cathepsin-L-like cysteine peptidase 02 [Tenebrio molitor]
          Length = 337

 Score =  255 bits (651), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 140/341 (41%), Positives = 197/341 (57%), Gaps = 19/341 (5%)

Query: 19  FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMA---QHGRSYKDELEKEMRLKIFKENL 75
           F+I   +    SQ VS           + E+W A    H + Y+ + E+  R+KIF EN 
Sbjct: 3   FLIFLAICVAGSQAVSFFDL-------VQEQWGAFKMTHNKQYQSDTEERFRMKIFMENS 55

Query: 76  EYIEKANK---EGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSS-TFKYQNL 131
             + K NK   +G  ++KLG N+++D+ + EF  +  G+       RS  S  +  +   
Sbjct: 56  HTVAKHNKLYAQGLVSFKLGINKYADMLHHEFVQVLNGFNRTKSGLRSGESDDSVTFLPP 115

Query: 132 SMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDC 191
           +   +P  +DWRDKGAVTP+K+Q +CG CW+F+A  ++EG    +SG L+ LSEQ L+DC
Sbjct: 116 ANVQLPGQIDWRDKGAVTPVKDQGQCGSCWSFSATGSLEGQHFRKSGKLVSLSEQNLVDC 175

Query: 192 STN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVP 250
           S   GNNGC GG  + AF YI  N GI TE  YPY+A    C    K   A    Y ++ 
Sbjct: 176 SEKFGNNGCNGGLMDNAFRYIKANGGIDTEQAYPYKAEDEKCHYKPKNKGATDRGYVDIE 235

Query: 251 SGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVCG-TQLDHAVTIVGFGTTED 307
           SG+E  L  AV ++ PVS+AI A    FQ Y  G+ +   C  +QLDH V +VG+GT +D
Sbjct: 236 SGNEDKLQSAVATVGPVSVAIDASHQSFQLYSGGVYYEPECSPSQLDHGVLVVGYGTEDD 295

Query: 308 GANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
           G +YWL+KNSWG +WGD GY+K+ R+ +  CGI T +SYPL
Sbjct: 296 GTDYWLVKNSWGKSWGDQGYIKMARNRDNNCGIATEASYPL 336


>gi|348687948|gb|EGZ27762.1| papain-like cysteine protease C1 [Phytophthora sojae]
          Length = 533

 Score =  255 bits (651), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 134/305 (43%), Positives = 185/305 (60%), Gaps = 11/305 (3%)

Query: 50  WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRT-YKLGTNQFSDLTNDEFRALY 108
           WM+ HG ++ D LE   RL+ +  N  YI + N E   T  KLG N FS ++ DEF+   
Sbjct: 31  WMSAHGVTFSDALEFARRLENYIANDMYILEHNAENAWTGVKLGHNAFSHMSFDEFKFKM 90

Query: 109 TGYKMPSPSHRSTTSSTFKYQNL-SMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVA 167
           TG  +P        +S  +   L S  +VP+++DW DKG VTP+KNQ  CG CWAF+   
Sbjct: 91  TGLVLPEGYLEQRLAS--RVDGLWSDVEVPSAVDWVDKGGVTPVKNQGMCGSCWAFSTTG 148

Query: 168 AVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQA 227
           AVEG T + SG L+ LSEQ+L+DC  NG+ GC GG  + AF +I  + GI +ED+Y Y+A
Sbjct: 149 AVEGATFVSSGKLLSLSEQELVDCDHNGDMGCNGGLMDHAFQWIEDHGGICSEDDYEYKA 208

Query: 228 VPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNG 287
               C      +  K++ +++V   DE AL  AV+ QPVS+AI A    FQ YK G+FN 
Sbjct: 209 KAQVCRKCD--SVVKVTGFQDVNPQDEHALKVAVAQQPVSVAIEADQKAFQFYKSGVFNL 266

Query: 288 VCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE----GLCGIGTRS 343
            CGT+LDH V  VG+G  ++G  +W +KNSWG +WG+ GY+++ R+E    G CGI +  
Sbjct: 267 TCGTRLDHGVLAVGYG-NDNGQKFWKVKNSWGASWGEQGYIRLAREENGPAGQCGIASVP 325

Query: 344 SYPLA 348
           SYP A
Sbjct: 326 SYPFA 330


>gi|442754503|gb|JAA69411.1| Putative cathepsin l-like cysteine proteinase b [Ixodes ricinus]
          Length = 335

 Score =  255 bits (651), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 131/304 (43%), Positives = 189/304 (62%), Gaps = 9/304 (2%)

Query: 52  AQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEFRALY 108
           A+HG+SY  E E+  RLKI+ EN   I K N++   G   Y +  N+F D+ + EF +  
Sbjct: 32  AKHGKSYVSETEEVFRLKIYMENRHKIAKHNEKYARGEVPYSMAMNEFGDMLHHEFVSTR 91

Query: 109 TGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAA 168
            G+K          S+  + +N+    +P ++DWR KGAVTP+KNQ +CG CWAF+A  +
Sbjct: 92  NGFKRNYKDQPREGSTYLEPENIEDFSLPKTVDWRTKGAVTPVKNQGQCGSCWAFSATGS 151

Query: 169 VEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQA 227
           +EG    +SG+++ LSEQ L+DCST+ GNNGC GG  + AF YI  N+GI TE  YPY  
Sbjct: 152 LEGQHFRKSGSMVSLSEQNLVDCSTDFGNNGCEGGLMDNAFKYIRANKGIDTEKSYPYNG 211

Query: 228 VPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIFN 286
             GTC   +    A  S + ++  G E  L KAV ++ P+S+AI A    FQ Y +G+++
Sbjct: 212 TDGTCHFKKSTVGATDSGFVDIKEGSETQLKKAVATVGPISVAIDASHESFQFYSDGVYD 271

Query: 287 GV-CGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRS 343
              C ++ LDH V +VG+GT  +G +YWL+KNSWG TWGD GY+++ R+ +  CGI + +
Sbjct: 272 EPECDSESLDHGVLVVGYGTL-NGTDYWLVKNSWGTTWGDEGYIRMSRNKKNQCGIASSA 330

Query: 344 SYPL 347
           SYPL
Sbjct: 331 SYPL 334


>gi|260516654|gb|ACX43954.1| cysteine protease 1 [Brachiaria hybrid cultivar]
 gi|260516656|gb|ACX43955.1| cysteine protease 1 [Brachiaria hybrid cultivar]
 gi|260516658|gb|ACX43956.1| cysteine protease 1 [Brachiaria hybrid cultivar]
 gi|260516660|gb|ACX43957.1| cysteine protease 1 [Brachiaria hybrid cultivar]
 gi|260516662|gb|ACX43958.1| cysteine protease 2 [Brachiaria hybrid cultivar]
 gi|260516664|gb|ACX43959.1| cysteine protease 2 [Brachiaria hybrid cultivar]
 gi|260516666|gb|ACX43960.1| cysteine protease 2 [Brachiaria hybrid cultivar]
 gi|260516668|gb|ACX43961.1| cysteine protease 2 [Brachiaria hybrid cultivar]
 gi|260516670|gb|ACX43962.1| cysteine protease 2 [Brachiaria hybrid cultivar]
          Length = 338

 Score =  255 bits (651), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 137/320 (42%), Positives = 192/320 (60%), Gaps = 11/320 (3%)

Query: 30  SQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTY 89
           S + S     E  + ++   +M Q+ ++Y    E   R   FK N+E I   N   N +Y
Sbjct: 25  SALFSEEVPSEVMLQDMFTAFMKQYSKAYS-HAEFSSRFNQFKANVETIRLHNTLANASY 83

Query: 90  KLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVT 149
            +G N+F+DL+ +EF+  Y GYK      R    S   +Q +     PTS+DWR   AVT
Sbjct: 84  TMGLNEFADLSFEEFKGKYFGYKHV---EREFARSNNLHQEVEA--APTSIDWRTSNAVT 138

Query: 150 PIKNQKECGCCWAFAAVAAVEGITKIRSGN-LIQLSEQQLLDCSTN-GNNGCLGGSREKA 207
           PIK+Q +CG CWAF+A  ++EG   ++  + L  LSEQQL+DCST+ GN GC GG  + A
Sbjct: 139 PIKDQGQCGSCWAFSATGSIEGAWVLQGKHTLTSLSEQQLVDCSTSYGNAGCNGGLMDYA 198

Query: 208 FAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPV 266
           F YII N+GI  E  YPY+ V G C  +       IS Y++V SGDE +LL AV ++ PV
Sbjct: 199 FEYIIANKGICAESAYPYKGVGGLCQKSCTKVVT-ISGYKDVASGDEASLLNAVGTVGPV 257

Query: 267 SIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAG 326
           S+AI A    FQ Y  G+F+G CG  LDH V  VG+GTT    +YW++KNSWG +WG++G
Sbjct: 258 SVAIEADQAGFQFYSSGVFSGTCGHNLDHGVLAVGYGTT-GSQDYWIVKNSWGTSWGESG 316

Query: 327 YMKIVRDEGLCGIGTRSSYP 346
           Y++++R++  CGI  + SYP
Sbjct: 317 YIRMIRNKNQCGIAIQPSYP 336


>gi|357167707|ref|XP_003581294.1| PREDICTED: actinidain-like [Brachypodium distachyon]
          Length = 358

 Score =  254 bits (650), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 134/317 (42%), Positives = 183/317 (57%), Gaps = 22/317 (6%)

Query: 47  HEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRA 106
           HE+WMA+ GRSY D  EK  R ++F  N  +++  N+ GNRTY LG NQFSDLT+ EF  
Sbjct: 42  HERWMARFGRSYTDAGEKARRQEVFGANARHVDAVNRAGNRTYTLGLNQFSDLTDHEFLQ 101

Query: 107 LYTGYK-------MPSPSHRSTTSST-FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECG 158
            + GY        +  P       +T   Y      D+P S+DWR KGAVT IKNQ+ CG
Sbjct: 102 QHLGYGRHHGQRGLLLPEEEVMPKATALGYGQ----DMPYSVDWRAKGAVTEIKNQRSCG 157

Query: 159 CCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIA 218
            CWAFAAVAA EG+ KI +GNLI +SEQQ+LDC T   + C  G    A  Y++ + G+ 
Sbjct: 158 SCWAFAAVAATEGLVKIATGNLISMSEQQVLDC-TGDRSSCDSGYISDALRYVVTSGGLQ 216

Query: 219 TEDEYPYQAVPGTCSA---AQKPAAAKISN-YEEVPSGDEQALLKAVSMQPVSIAIAAYS 274
            E  Y Y    G C +   A+  +AA +   +    +GDE AL    + QPV++ + A  
Sbjct: 217 REAAYAYTGQKGACGSRRFARPNSAASVGGVHMATLNGDEGALQGLAARQPVAVIVEASE 276

Query: 275 TEFQSYKEGIFNG--VCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR 332
            +F+ Y  G++ G   CG +L+HA+T+VG+GT      YWL+KN WG  WG+ GYM++ R
Sbjct: 277 PDFRHYSSGVYAGSASCGRELNHALTVVGYGTENGAGEYWLVKNQWGTWWGENGYMRVAR 336

Query: 333 DEGL---CGIGTRSSYP 346
             G    CGI + + YP
Sbjct: 337 RNGAGANCGIASVAFYP 353


>gi|22661|emb|CAA49504.1| papaya proteinase omega [Carica papaya]
          Length = 367

 Score =  254 bits (650), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 136/315 (43%), Positives = 195/315 (61%), Gaps = 12/315 (3%)

Query: 38  THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
           T  + ++++   WM  H + Y++  EK  R +IFK+NL YI++ NK+ N +Y+LG N+F+
Sbjct: 39  TSTERLIQLFNSWMLNHNKFYENVDEKLYRFEIFKDNLNYIDETNKK-NNSYRLGLNEFA 97

Query: 98  DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
           DL+NDEF   Y G  + +   +S      ++ N  + ++P ++DWR KGAVTP+++Q  C
Sbjct: 98  DLSNDEFNEKYVGSLIDATIEQSYDE---EFINEDIVNLPENVDWRKKGAVTPVRHQGSC 154

Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
           G CWAF+AVA VEGI KIR+G L++LSEQ+L+DC    ++GC GG    A  Y+ +N GI
Sbjct: 155 GSCWAFSAVATVEGINKIRTGKLVELSEQELVDCERR-SHGCKGGYPPYALEYVAKN-GI 212

Query: 218 ATEDEYPYQAVPGTCSAAQKPAA-AKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTE 276
               +YPY+A  GTC A Q      K S    V   +E  LL A++ QPVS+ + +    
Sbjct: 213 HLRSKYPYKAKQGTCRAKQVGGPIVKTSGVGRVQPNNEGNLLNAIAKQPVSVVVESKGRP 272

Query: 277 FQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR---- 332
           FQ YK GIF G CGT++DHAVT VG+G +       LIKNSWG  WG+ GY++I R    
Sbjct: 273 FQLYKGGIFEGPCGTKVDHAVTAVGYGKSGGKGYI-LIKNSWGTAWGEKGYIRIKRAPGN 331

Query: 333 DEGLCGIGTRSSYPL 347
             G+CG+   S YP+
Sbjct: 332 SPGVCGLYKSSYYPI 346


>gi|189525868|ref|XP_001341714.2| PREDICTED: cathepsin L1-like isoform 1 [Danio rerio]
          Length = 336

 Score =  254 bits (650), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 136/340 (40%), Positives = 206/340 (60%), Gaps = 17/340 (5%)

Query: 20  IIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIE 79
           ++  LLV+ +   V + S+ +  + +    W +QHG+SY +++E   R+ I++ENL  IE
Sbjct: 1   MMFALLVTLSISAVFAASSIDIQLDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRKIE 59

Query: 80  KANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV 136
           + N E   GN T+K+G NQF D+TN+EFR    GYK       + TS    +   S    
Sbjct: 60  QHNFEYSYGNHTFKMGMNQFGDMTNEEFRQAMNGYKHDP----NQTSQGPLFMEPSFFAA 115

Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TNG 195
           P  +DWR +G VTP+K+QK+CG CW+F++  A+EG    ++G LI +SEQ L+DCS   G
Sbjct: 116 PQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQG 175

Query: 196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPG-TCSAAQKPAAAKISNYEEVPSGDE 254
           N GC GG  ++AF Y+ +N+G+ +E  YPY A     C    +   AKI+ + ++PSG+E
Sbjct: 176 NQGCNGGLMDQAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPSGNE 235

Query: 255 QALLKAV-SMQPVSIAIAAYSTEFQSYKEGIF--NGVCGTQLDHAVTIVGF---GTTEDG 308
            AL+ AV ++ PVS+AI A     Q Y+ GI+       ++LDHAV +VG+   G    G
Sbjct: 236 LALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACSSSRLDHAVLVVGYGYQGADVAG 295

Query: 309 ANYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYPL 347
             YW++KNSW + WGD GY+ + +D+   CG+ T++SYPL
Sbjct: 296 NRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGVATKASYPL 335


>gi|5081735|gb|AAD39513.1|AF147207_1 cathepsin L-like protease precursor [Artemia franciscana]
          Length = 338

 Score =  254 bits (650), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 138/335 (41%), Positives = 204/335 (60%), Gaps = 11/335 (3%)

Query: 21  IITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEK 80
           +I LL +   Q+ ++ S       E H  + A H + Y  +LE++ R+KI+ EN   + K
Sbjct: 6   LIFLLGAVLVQLSAALSLTNLLADEWH-LFKATHKKEYPSQLEEKFRMKIYLENKHKVAK 64

Query: 81  AN---KEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
            N   ++G ++Y++  N+F DL + EFR++  GY+     + S   STF +   +  +VP
Sbjct: 65  HNILYEKGEKSYQVAMNKFGDLLHHEFRSIMNGYQHKK-QNSSRAESTFTFMEPANVEVP 123

Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GN 196
            S+DWR KGA+TP+K+Q +CG CWAF++  A+EG T  ++G LI LSEQ L+DCS   GN
Sbjct: 124 ESVDWRVKGAITPVKDQGQCGSCWAFSSTGALEGQTFRKTGKLISLSEQNLIDCSGKYGN 183

Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQA 256
            GC GG  ++AF YI  N+GI TE+ YPY+A    C    +   A    +  +PSG+E  
Sbjct: 184 EGCNGGLMDQAFQYIKDNKGIDTENTYPYEAEDNVCRYNPRNRGAIDRGFVHIPSGEEDK 243

Query: 257 LLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVCGT-QLDHAVTIVGFGTTEDGANYWL 313
           L  AV ++ PVS+AI A    FQ Y +G+ +   C +  LDH V +VG+G +++G +YWL
Sbjct: 244 LKAAVATVGPVSVAIDASHESFQFYSKGVYYEPSCDSDDLDHGVLVVGYG-SDNGKDYWL 302

Query: 314 IKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
           +KNSW   WGD GY+KI R+ +  CGI T +SYPL
Sbjct: 303 VKNSWSEHWGDEGYIKIARNRKNHCGIATAASYPL 337


>gi|156739275|ref|NP_001096585.1| cathepsin L1-like precursor [Danio rerio]
 gi|156230123|gb|AAI52285.1| MGC174857 protein [Danio rerio]
          Length = 335

 Score =  254 bits (650), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 142/341 (41%), Positives = 207/341 (60%), Gaps = 20/341 (5%)

Query: 20  IIITLLVS-CASQVVSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
           ++  LLV+ C S V ++ S   Q  ++ H   W +QHG+SY ++LE   R+ I++ENL  
Sbjct: 1   MMFALLVTLCISAVFTAPSIDIQ--LDDHWNSWKSQHGKSYHEDLEVGRRM-IWEENLRK 57

Query: 78  IEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
           IE+ N E   GN T+K+G NQF D+TN+EFR    GYK       + TS    +   S  
Sbjct: 58  IEQHNFEYSYGNHTFKMGMNQFGDMTNEEFRQAMNGYKHDP----NRTSQGPLFMEPSFF 113

Query: 135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-T 193
             P  +DWR +G VTP+K+QK+CG CW+F++  A+EG    ++G LI +SEQ L+DCS  
Sbjct: 114 AAPQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRP 173

Query: 194 NGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPG-TCSAAQKPAAAKISNYEEVPSG 252
            GN GC GG  ++AF Y+ +N+G+ +E  YPY A     C    +   AKI+ + ++P G
Sbjct: 174 QGNQGCNGGIMDQAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPRG 233

Query: 253 DEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVCGTQLDHAVTIVGF---GTTED 307
           +E AL+ AV ++ PVS+AI A     Q Y+ GI +   C ++LDHAV +VG+   G    
Sbjct: 234 NELALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACTSRLDHAVLVVGYGYQGADVA 293

Query: 308 GANYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYPL 347
           G  YW++KNSW + WGD GY+ + +D+   CGI T +SYPL
Sbjct: 294 GNRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGIATMASYPL 334


>gi|242093944|ref|XP_002437462.1| hypothetical protein SORBIDRAFT_10g027570 [Sorghum bicolor]
 gi|241915685|gb|EER88829.1| hypothetical protein SORBIDRAFT_10g027570 [Sorghum bicolor]
          Length = 366

 Score =  254 bits (649), Expect = 4e-65,   Method: Compositional matrix adjust.
 Identities = 135/329 (41%), Positives = 193/329 (58%), Gaps = 26/329 (7%)

Query: 40  EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
           E+S+  ++E+W A +  + +D  EK  R  +FKEN   I + N +GN TY LG N+FSD+
Sbjct: 41  EESLWALYERWCAHYNMA-RDHGEKTRRFDLFKENARRIYEHNHQGNATYTLGLNRFSDM 99

Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD----------------VPTSLDWR 143
           T++EF     G  + +P           + +    D                 P ++DWR
Sbjct: 100 TDEEFNRSPYGGCLTAPRMSDDEIEELHHHHHQQEDDGSFNLTHGSGGGKLGAPPAVDWR 159

Query: 144 DKGAVTPIKNQ-KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGG 202
            + AVT +K+Q   CG CWAF+A+AAVEGI  IR+ NL+ LSEQQL+DC    N+GC GG
Sbjct: 160 GR-AVTRVKDQGPTCGSCWAFSAIAAVEGINAIRTRNLVPLSEQQLVDCDKL-NHGCNGG 217

Query: 203 SREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVS 262
               AF+++++N+G+  E  YPY    G C     P    I  Y+ VP  D  AL+ AV+
Sbjct: 218 LMTTAFSFVVRNRGVVPEGAYPYMGREGRCKHVMAPPVT-IYGYQRVPRFDANALMNAVA 276

Query: 263 MQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTW 322
            QPVS+AI A S EF+ Y+ G+FNG CG +L HA T VG+G  + G  +W++KNSWG  W
Sbjct: 277 AQPVSVAIEASSFEFRHYQGGVFNGNCGGRLGHAATAVGYG-ADAGGPFWIVKNSWGPGW 335

Query: 323 GDAGYMKIVRD----EGLCGIGTRSSYPL 347
           G+ GY++I R+    +G+CGI T +SYP+
Sbjct: 336 GEGGYVRISRNTPVRQGVCGILTENSYPV 364


>gi|156739281|ref|NP_001096588.1| cathepsin L1-like precursor [Danio rerio]
 gi|166158351|ref|NP_001107526.1| uncharacterized protein LOC100135391 precursor [Xenopus (Silurana)
           tropicalis]
 gi|326672305|ref|XP_003199634.1| PREDICTED: cathepsin L1-like [Danio rerio]
 gi|156230096|gb|AAI52237.1| MGC174155 protein [Danio rerio]
 gi|163916362|gb|AAI57707.1| LOC100135391 protein [Xenopus (Silurana) tropicalis]
          Length = 335

 Score =  254 bits (649), Expect = 4e-65,   Method: Compositional matrix adjust.
 Identities = 137/339 (40%), Positives = 204/339 (60%), Gaps = 16/339 (4%)

Query: 20  IIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIE 79
           ++  LLV+     V + S+ +  + +    W +QHG+SY +++E   R+ I++ENL  IE
Sbjct: 1   MMFALLVTLCISAVFAASSIDIQLDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRKIE 59

Query: 80  KANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV 136
           + N E   GN T+K+G NQF D+TN+EFR    GYK       + TS    +   S    
Sbjct: 60  QHNFEYSYGNHTFKMGMNQFGDMTNEEFRQAMNGYKHDP----NRTSQGPLFMEPSFFAA 115

Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TNG 195
           P  +DWR +G VTP+K+QK+CG CW+F++  A+EG    ++G LI +SEQ L+DCS   G
Sbjct: 116 PQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQG 175

Query: 196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPG-TCSAAQKPAAAKISNYEEVPSGDE 254
           N GC GG  ++AF Y+ +N+G+ +E  YPY A     C    +   AKI+ + ++P G+E
Sbjct: 176 NQGCNGGIMDQAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPRGNE 235

Query: 255 QALLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVCGTQLDHAVTIVGF---GTTEDGA 309
            AL+ AV ++ PVS+AI A     Q Y+ GI +   C ++LDHAV +VG+   G    G 
Sbjct: 236 LALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACTSRLDHAVLVVGYGYQGADVAGN 295

Query: 310 NYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYPL 347
            YW++KNSW + WGD GY+ + +D+   CGI T +SYPL
Sbjct: 296 RYWIVKNSWSDKWGDKGYIYMAKDKNNHCGIATMASYPL 334


>gi|1483570|emb|CAA68066.1| cathepsin l [Litopenaeus vannamei]
          Length = 328

 Score =  254 bits (649), Expect = 4e-65,   Method: Compositional matrix adjust.
 Identities = 133/313 (42%), Positives = 188/313 (60%), Gaps = 16/313 (5%)

Query: 46  IHEKWM---AQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRTYKLGTNQFSDL 99
           + ++W    A+HGR Y    E+  RL +F++N ++I+  N   + G  T+ L  NQF D+
Sbjct: 20  LRQQWRDFKAEHGRRYASVQEERYRLSVFEQNQQFIDDHNARFENGEVTFTLQMNQFGDM 79

Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGC 159
           T++EF A   G+ +  PS R T              +P  +DWR KGAVTP+K+QK+CG 
Sbjct: 80  TSEEFTATMNGF-LNVPSRRPTAILRADPDET----LPKEVDWRTKGAVTPVKDQKQCGS 134

Query: 160 CWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIA 218
           CWAF+   ++EG   ++ G L+ LSEQ L+DCS   GN GC+GG  ++AF YI  N+GI 
Sbjct: 135 CWAFSTTGSLEGQHFLKDGKLVSLSEQNLVDCSDKFGNMGCMGGLMDQAFRYIKANKGID 194

Query: 219 TEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEF 277
           TED YPY+A  G C        A  + Y +V  G E AL KAV ++ P+S+AI A    F
Sbjct: 195 TEDSYPYEAQDGKCRFDASNVGATDTGYVDVEHGSESALKKAVATIGPISVAIDASQPSF 254

Query: 278 QSYKEGIF--NGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-E 334
           Q Y +G++   G   T LDH V  VG+G TE G  YWL+KNSW  +WG+ GY+++ RD +
Sbjct: 255 QFYHDGVYYEEGCSSTMLDHGVLAVGYGETEKGEAYWLVKNSWNTSWGNKGYIQMSRDKK 314

Query: 335 GLCGIGTRSSYPL 347
             CGI +++SYPL
Sbjct: 315 NNCGIASQASYPL 327


>gi|326672302|ref|XP_003199633.1| PREDICTED: cathepsin L1-like [Danio rerio]
 gi|157423549|gb|AAI53506.1| Im:6910535 [Danio rerio]
          Length = 335

 Score =  254 bits (649), Expect = 4e-65,   Method: Compositional matrix adjust.
 Identities = 141/341 (41%), Positives = 207/341 (60%), Gaps = 20/341 (5%)

Query: 20  IIITLLVS-CASQVVSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
           ++  LLV+ C S V ++ S   Q  ++ H   W +QHG+SY +++E   R+ I++ENL  
Sbjct: 1   MMFALLVTLCISAVFTAPSIDIQ--LDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRK 57

Query: 78  IEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
           IE+ N E   GN T+K+G NQF D+TN+EFR    GYK       + TS    +   S  
Sbjct: 58  IEQHNFEYSYGNHTFKMGMNQFGDMTNEEFRQAMNGYKQDP----NRTSKGALFMEPSFF 113

Query: 135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-T 193
             P  +DWR +G VTP+K+QK+CG CW+F++  A+EG    ++G LI +SEQ L+DCS  
Sbjct: 114 AAPQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRP 173

Query: 194 NGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPG-TCSAAQKPAAAKISNYEEVPSG 252
            GN GC GG  ++AF Y+ +N+G+ +E  YPY A     C    +   AKI+ + ++P G
Sbjct: 174 QGNQGCNGGIMDQAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPKG 233

Query: 253 DEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVCGTQLDHAVTIVGF---GTTED 307
           +E AL+ AV ++ PVS+AI A     Q Y+ GI +   C ++LDHAV +VG+   G    
Sbjct: 234 NELALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACTSRLDHAVLVVGYGYQGADVA 293

Query: 308 GANYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYPL 347
           G  YW++KNSW + WGD GY+ + +D+   CGI T +SYPL
Sbjct: 294 GNRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGIATMASYPL 334


>gi|356545112|ref|XP_003540989.1| PREDICTED: LOW QUALITY PROTEIN: KDEL-tailed cysteine endopeptidase
           CEP1-like [Glycine max]
          Length = 400

 Score =  254 bits (648), Expect = 5e-65,   Method: Compositional matrix adjust.
 Identities = 130/305 (42%), Positives = 189/305 (61%), Gaps = 4/305 (1%)

Query: 26  VSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEG 85
           V+C  Q   S+S  E    E HEKWMAQ+G+ Y+D  E E R +IFK N+++IE  N  G
Sbjct: 95  VTCGRQC-RSKSRLEACTSERHEKWMAQYGKVYEDAAEMEKRFQIFKNNVQFIESFNVAG 153

Query: 86  NRTYKLGTNQFSDLTNDEFRALY-TGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRD 144
           ++ + +  NQF DL ++EF+AL   G +  S    +T  ++F+Y ++ +T++P ++D R 
Sbjct: 154 DKPFNIRINQFPDLHDEEFKALLINGQRKVSGVETATEETSFRYGSV-VTNIPATMDGRK 212

Query: 145 KGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSR 204
           KG VTPIK+Q   G CWA +AVAA+EGI +I +  L+ LS+Q+L+D     + GC+GG  
Sbjct: 213 KGVVTPIKDQGIIGSCWALSAVAAIEGIHQITTSKLMFLSKQKLVDSVKGESEGCIGGYV 272

Query: 205 EKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQ 264
           E AF +I++  GI +E  YPY+ V       +  + A I  YE+VPS +++ALLK V+ Q
Sbjct: 273 EDAFEFIVKKGGILSETHYPYKGVNXCKVEKETHSVAHIKGYEKVPSNNKKALLKVVANQ 332

Query: 265 PVSIAIAAYSTEFQSYKEGIFNGV-CGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWG 323
           PVS+ I   +  F+ Y   IFN   CG+  +H V +VG+G   DGA YW +KNSWG  WG
Sbjct: 333 PVSVYIDVGAHAFKYYSSEIFNARNCGSDPNHVVAVVGYGKALDGAKYWPVKNSWGTEWG 392

Query: 324 DAGYM 328
              YM
Sbjct: 393 GKWYM 397


>gi|325303202|tpg|DAA34687.1| TPA_inf: cathepsin L-like cysteine proteinase B [Amblyomma
           variegatum]
          Length = 337

 Score =  254 bits (648), Expect = 6e-65,   Method: Compositional matrix adjust.
 Identities = 134/337 (39%), Positives = 203/337 (60%), Gaps = 12/337 (3%)

Query: 19  FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
           F+++  L  CA+ + ++  TH++ V      + A HG+ Y+ E E+  RLKI+ EN   I
Sbjct: 4   FVVLCFL--CAA-MTAAAITHQELVGAEWSAFKALHGKEYQSETEEYYRLKIYMENRMMI 60

Query: 79  EKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD 135
            + N++      +YKL  N++ D+ + EF +   G++    S     S   + + +    
Sbjct: 61  ARHNEKYANNKVSYKLAMNEYGDMLHHEFVSTRNGFRRDYRSKPRQGSFYIEPEGIEDKH 120

Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN- 194
           +P ++DWR KGAVTP+KNQ +CG CWAF+   ++EG    +SG+++ LSEQ L+DCST  
Sbjct: 121 LPKTVDWRKKGAVTPVKNQGQCGSCWAFSTTGSLEGQHFRKSGDMVSLSEQNLVDCSTAF 180

Query: 195 GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDE 254
           GNNGC GG  + AF YI  N GI TE  YPY    GTC   +    A  + + ++P G+E
Sbjct: 181 GNNGCEGGLMDNAFKYIKANGGIDTEKSYPYNGTDGTCHFKKSDVGATDTGFVDIPEGNE 240

Query: 255 QALLKAV-SMQPVSIAIAAYSTEFQSYKEGIFNGV-CGTQ-LDHAVTIVGFGTTEDGANY 311
             L KAV ++ P+S+AI A    FQ Y +G+++   C ++ LDH V +VG+GT +D  +Y
Sbjct: 241 HLLKKAVATVGPISVAIDASHQSFQFYSQGVYDEPECSSENLDHGVLVVGYGTKDD-QDY 299

Query: 312 WLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
           WL+KNSWG TWGD GY+ + R+ +  CGI + +SYPL
Sbjct: 300 WLVKNSWGTTWGDGGYIYMTRNKDNQCGIASSASYPL 336


>gi|125606204|gb|EAZ45240.1| hypothetical protein OsJ_29883 [Oryza sativa Japonica Group]
          Length = 350

 Score =  254 bits (648), Expect = 6e-65,   Method: Compositional matrix adjust.
 Identities = 136/327 (41%), Positives = 192/327 (58%), Gaps = 26/327 (7%)

Query: 43  VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTND 102
           +++  E+WM +HGR+Y D  EK+ R ++++ N+E +E  N   N  YKL  N+F+DLTN+
Sbjct: 27  MLDRFEQWMIRHGRAYTDAGEKQRRFEVYRRNVELVETFNSMSN-GYKLADNKFADLTNE 85

Query: 103 EFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMTDV-PTSLDWRDKGAVTPIKNQKEC-- 157
           EFRA   G++  +  P   +T S+       S  D+ P S+DWR+KGAV  I   K C  
Sbjct: 86  EFRAKMLGFRPHVTIPQISNTCSADIAMPGESSDDILPKSVDWRNKGAV--INRWKICVD 143

Query: 158 -GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQG 216
            G CWAF+AVAA+EGI +I++G L+ LSEQ+L+DC      GC GG    AF +++ N G
Sbjct: 144 AGSCWAFSAVAAIEGINQIKNGELVSLSEQELVDCDDEAV-GCGGGYMSWAFEFVVGNHG 202

Query: 217 IATEDEYPYQAVPGTCSAAQ-KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYST 275
           + TE  YPY A  G C AA+   +A  I+ Y  V    E  L +A + QPVS+A+   S 
Sbjct: 203 LTTEASYPYHAANGACQAAKLNQSAVAIAGYRNVTPSSEPDLARAAAAQPVSVAVDGGSF 262

Query: 276 EFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGAN----------YWLIKNSWGNTWGDA 325
            FQ Y  G++ G C   ++H VT+VG+G +E   +          YW++KNSWG  WGDA
Sbjct: 263 MFQLYGSGVYTGPCTADVNHGVTVVGYGESEPKTDGGGAAKGGEKYWIVKNSWGAEWGDA 322

Query: 326 GYMKIVRD-----EGLCGIGTRSSYPL 347
           GY+ + RD      GLCGI    SYP+
Sbjct: 323 GYILMQRDVAGLASGLCGIALLPSYPV 349


>gi|345320664|ref|XP_001521690.2| PREDICTED: cathepsin L1-like [Ornithorhynchus anatinus]
          Length = 388

 Score =  253 bits (647), Expect = 6e-65,   Method: Compositional matrix adjust.
 Identities = 150/355 (42%), Positives = 213/355 (60%), Gaps = 23/355 (6%)

Query: 10  SFKINTTP----MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKE 65
           S +++  P    M +++ LL  C    VS+    +  + +  E W   H +SY  + E+ 
Sbjct: 39  SLQVSPGPWGQAMKLLVCLLSLCWGLAVSA-PLGDSELDKHWELWKNWHQKSYH-KAEEG 96

Query: 66  MRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTT 122
            R  +++ENL+ IE  N E   G  TY+LG NQF DLTN+EF+ +    +  S  +R   
Sbjct: 97  WRRMVWEENLKVIELHNLEQSLGLHTYQLGMNQFGDLTNEEFQQMLISERHFSEGNRING 156

Query: 123 SSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQ 182
           S+   +  ++   VPTS+DWRD G VTP+KNQ  CG CWAF+   A+EG    +SG L+ 
Sbjct: 157 SA---FLEVNYVQVPTSVDWRDHGYVTPVKNQGHCGSCWAFSTTGALEGQLFRKSGRLVS 213

Query: 183 LSEQQLLDCS-TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKP--A 239
           LSEQ L+DCS   GN GC GG  + AF YI++N+GI +ED YPY A   T   A KP  A
Sbjct: 214 LSEQNLVDCSWQQGNQGCNGGIVDFAFQYILENRGIDSEDCYPYTA-KDTAQCAFKPECA 272

Query: 240 AAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIF-NGVCGTQ-LDHA 296
            A+++ + ++P   E+AL+KAV ++ PVS+AI A+ T F+ Y+ GIF    C ++ L+HA
Sbjct: 273 TARVTGFVDIPPHSEEALMKAVATVGPVSVAIDAHPTSFRFYQSGIFYEPKCSSERLNHA 332

Query: 297 VTIVGF---GTTEDGANYWLIKNSWGNTWGDAGYMKIVRDEG-LCGIGTRSSYPL 347
           V +VG+   G  E G  YW++KNSWG  WGD GY  + +D G  CGI T +SYPL
Sbjct: 333 VLVVGYGYEGEDEAGKKYWIVKNSWGKQWGDHGYFYLSKDRGNHCGIATTASYPL 387


>gi|115464789|ref|NP_001055994.1| Os05g0508300 [Oryza sativa Japonica Group]
 gi|48475189|gb|AAT44258.1| hypothetical protein [Oryza sativa Japonica Group]
 gi|113579545|dbj|BAF17908.1| Os05g0508300 [Oryza sativa Japonica Group]
          Length = 450

 Score =  253 bits (647), Expect = 7e-65,   Method: Compositional matrix adjust.
 Identities = 129/303 (42%), Positives = 188/303 (62%), Gaps = 7/303 (2%)

Query: 48  EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRAL 107
           E W A+HGRSY    E+  RL  F +N  ++  A+     +Y L  N F+DLT+DEFRA 
Sbjct: 39  EAWCAEHGRSYATPGERAARLAAFADNAAFV-AAHNGAPASYALALNAFADLTHDEFRAA 97

Query: 108 YTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVA 167
             G    +        + +   +  +  VP ++DWR  GAVT +K+Q  CG CW+F+A  
Sbjct: 98  RLGRLAAAGGPGRDGGAPYLGVDGGVGAVPDAVDWRQSGAVTKVKDQGSCGACWSFSATG 157

Query: 168 AVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQA 227
           A+EGI KI++G+LI LSEQ+L+DC  + N+GC GG  + A+ ++++N GI TE +YPY+ 
Sbjct: 158 AMEGINKIKTGSLISLSEQELIDCDRSYNSGCGGGLMDYAYKFVVKNGGIDTEADYPYRE 217

Query: 228 VPGTCSAAQ-KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFN 286
             GTC+  + K     I  Y++VP+ +E  LL+AV+ QPVS+ I   +  FQ Y +GIF+
Sbjct: 218 TDGTCNKNKLKRRVVTIDGYKDVPANNEDMLLQAVAQQPVSVGICGSARAFQLYSKGIFD 277

Query: 287 GVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTR 342
           G C T LDHA+ IVG+G +E G +YW++KNSWG +WG  GYM + R+     G+CGI   
Sbjct: 278 GPCPTSLDHAILIVGYG-SEGGKDYWIVKNSWGESWGMKGYMYMHRNTGNSNGVCGINQM 336

Query: 343 SSY 345
            S+
Sbjct: 337 PSF 339


>gi|151573016|gb|ABS17683.1| cathepsin L-1 [Artemia persimilis]
          Length = 334

 Score =  253 bits (647), Expect = 8e-65,   Method: Compositional matrix adjust.
 Identities = 135/335 (40%), Positives = 205/335 (61%), Gaps = 11/335 (3%)

Query: 21  IITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEK 80
           +I LL +   Q+ ++ S       E H  + A H + Y  +LE++ R+KI+ EN   + K
Sbjct: 2   LIFLLGAVFVQLSAALSLTNLLADEWH-LFKATHKKEYPSQLEEKFRMKIYLENKHKVAK 60

Query: 81  AN---KEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
            N   ++G ++Y++  N+F DL + EFR++  GY+     + S   STF +   +  +VP
Sbjct: 61  HNILFEKGEKSYQVAMNKFGDLLHHEFRSIMNGYQHKK-QNSSRAESTFTFMEPANVEVP 119

Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GN 196
            S+DWR+KGA+TP+K+Q +CG CWAF++  A+EG T  ++G L+ L EQ L+DCS   GN
Sbjct: 120 ESVDWREKGAITPVKDQGQCGPCWAFSSTGALEGQTFRKTGKLVSLREQNLIDCSGKYGN 179

Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQA 256
            GC GG  ++AF YI  N+GI TE+ YPY+A    C    +   A    + ++PSG+E  
Sbjct: 180 EGCNGGLMDQAFQYIKDNKGIDTENTYPYEAEDDVCRYNPRNRGAVDRGFVDIPSGEEDK 239

Query: 257 LLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVCGT-QLDHAVTIVGFGTTEDGANYWL 313
           L  AV ++ PVS+AI A    FQ Y +G+ +   C +  LDH V +VG+G +++G +YWL
Sbjct: 240 LKAAVATVGPVSVAIDASHESFQFYSKGVYYEPSCDSDDLDHGVLVVGYG-SDNGKDYWL 298

Query: 314 IKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
           +KNSW   WGD GY+KI R+ +  CG+ T +SYPL
Sbjct: 299 VKNSWSEHWGDQGYIKIARNRKNHCGVATAASYPL 333


>gi|156739289|ref|NP_001096592.1| uncharacterized protein LOC569326 precursor [Danio rerio]
 gi|156230119|gb|AAI52283.1| Im:6910535 protein [Danio rerio]
          Length = 335

 Score =  253 bits (647), Expect = 8e-65,   Method: Compositional matrix adjust.
 Identities = 141/342 (41%), Positives = 206/342 (60%), Gaps = 21/342 (6%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLE 76
           MF ++  L  C S V ++ S   Q  ++ H   W +QHG+SY +++E   R+ I++ENL 
Sbjct: 2   MFALLITL--CISAVFTAPSIDIQ--LDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLR 56

Query: 77  YIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
            IE+ N E   GN T+K+G NQF D+TN+EFR    GYK       + TS    +   S 
Sbjct: 57  KIEQHNFEYSLGNHTFKMGMNQFGDMTNEEFRQAMNGYKQDP----NRTSKGALFMEPSF 112

Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS- 192
              P  +DWR +G VTP+K+QK+CG CW+F++  A+EG    ++G LI +SEQ L+DCS 
Sbjct: 113 FAAPQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSR 172

Query: 193 TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPG-TCSAAQKPAAAKISNYEEVPS 251
             GN GC GG  ++AF Y+ +N+G+ +E  YPY A     C    +   AKI+ + ++P 
Sbjct: 173 PQGNQGCNGGIMDQAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPR 232

Query: 252 GDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVCGTQLDHAVTIVGF---GTTE 306
           G+E AL+ AV ++ PVS+AI A     Q Y+ GI +   C ++LDHAV +VG+   G   
Sbjct: 233 GNELALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACTSRLDHAVLVVGYGYQGADV 292

Query: 307 DGANYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYPL 347
            G  YW++KNSW + WGD GY+ + +D+   CGI T +SYPL
Sbjct: 293 AGNRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGIATMASYPL 334


>gi|293342579|ref|XP_001065885.2| PREDICTED: cathepsin L1 [Rattus norvegicus]
 gi|293354415|ref|XP_225137.5| PREDICTED: cathepsin L1 [Rattus norvegicus]
 gi|149039747|gb|EDL93863.1| rCG24278 [Rattus norvegicus]
          Length = 330

 Score =  253 bits (647), Expect = 8e-65,   Method: Compositional matrix adjust.
 Identities = 130/338 (38%), Positives = 199/338 (58%), Gaps = 18/338 (5%)

Query: 16  TPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENL 75
           TP+F++ TL +     ++S+  TH+ S   + E+W  +HG++Y    E + R  +++ N+
Sbjct: 2   TPIFLLATLCLG----MISAAPTHDPSFDTVWEEWKTKHGKTYNTNEEGQKR-AVWENNM 56

Query: 76  EYIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
           + I   N++   G   + L  N F DLTN EFR L TG++   P   +     F      
Sbjct: 57  KMINLHNEDYLKGKHGFSLEMNAFGDLTNTEFRELMTGFQSMGPKETTIFREPF------ 110

Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
           + D+P SLDWR+ G VTP+KNQ +CG CWAF+AV ++EG    ++G L+ LSEQ L+DCS
Sbjct: 111 LGDIPKSLDWREHGYVTPVKNQGQCGSCWAFSAVGSLEGQIFKKTGKLVSLSEQNLVDCS 170

Query: 193 -TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPS 251
            + GN GC GG  E AF Y+ +N+G+ T + Y Y+A  G C    K +AA ++ + +VP 
Sbjct: 171 WSYGNLGCNGGLMEFAFQYVKENRGLDTGESYAYEAQDGLCRYNPKYSAANVTGFVKVPL 230

Query: 252 GDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIF--NGVCGTQLDHAVTIVGFGTTEDGA 309
            ++  +    S+ PVS+ I ++   F+ Y  G++       T++DHAV +VG+G   DG 
Sbjct: 231 SEDDLMSAVASVGPVSVGIDSHHQSFRFYSGGMYYEPDCSSTEMDHAVLVVGYGEESDGG 290

Query: 310 NYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYP 346
            YWL+KNSWG  WG  GY+K+ +D+   CGI T + YP
Sbjct: 291 KYWLVKNSWGEDWGMDGYIKMAKDQNNNCGIATYAIYP 328


>gi|33348836|gb|AAQ16118.1| cathepsin L-like cysteine proteinase B [Rhipicephalus
           haemaphysaloides haemaphysaloides]
          Length = 335

 Score =  253 bits (646), Expect = 8e-65,   Method: Compositional matrix adjust.
 Identities = 140/334 (41%), Positives = 200/334 (59%), Gaps = 15/334 (4%)

Query: 25  LVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE 84
           +V C   V ++  TH++ V      + A HG+ Y  + E+  RLKI+ EN   I + N++
Sbjct: 5   IVLCCLFVTAAAITHQELVGAEWSAFKALHGKDYASDTEEYYRLKIYMENRLKIARHNEK 64

Query: 85  GNRT---YKLGTNQFSDLTNDEFRALYTGYKM---PSPSHRSTTSSTFKYQNLSMTDVPT 138
             ++   YKL  N+F DL + EF +   G+K     SP   S       +++L +   P 
Sbjct: 65  YAKSQVSYKLAMNEFGDLLHHEFVSTRNGFKRNYRDSPREGSFFVEPEGFEDLQL---PK 121

Query: 139 SLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNN 197
           ++DWR KGAVTP+KNQ +CG CWAF+   ++EG    ++  L+ LSEQ L+DCS + GNN
Sbjct: 122 TVDWRKKGAVTPVKNQGQCGSCWAFSTTGSLEGPHFRKTRKLVSLSEQNLVDCSRSFGNN 181

Query: 198 GCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQAL 257
           GC GG  + AF YI  N+GI TE  YPY A  G C   +    A  + + ++P GDE  L
Sbjct: 182 GCEGGLMDNAFKYIKSNKGIDTEWSYPYNATDGVCHFNRSDVGATDTGFVDIPEGDENKL 241

Query: 258 LKAV-SMQPVSIAIAAYSTEFQSYKEGIFNGV-CGT-QLDHAVTIVGFGTTEDGANYWLI 314
            KAV ++ PVS+AI A    FQ Y EG+++   C + QLDH V +VG+G T+DG +YWL+
Sbjct: 242 KKAVAAVGPVSVAIDASHESFQFYSEGVYDEPECSSEQLDHGVLVVGYG-TKDGQDYWLV 300

Query: 315 KNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
           KNSWG TWGD GY+ + R+ +  CGI + +SYPL
Sbjct: 301 KNSWGTTWGDEGYIYMTRNKDNQCGIASSASYPL 334


>gi|189525870|ref|XP_001923796.1| PREDICTED: cathepsin L1 [Danio rerio]
          Length = 335

 Score =  253 bits (646), Expect = 9e-65,   Method: Compositional matrix adjust.
 Identities = 136/339 (40%), Positives = 203/339 (59%), Gaps = 16/339 (4%)

Query: 20  IIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIE 79
           ++  LLV+     V +  + +  + +    W +QHG+SY +++E   R+ I++ENL  IE
Sbjct: 1   MMFALLVTLYISAVFAAPSIDIQLDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRKIE 59

Query: 80  KANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV 136
           + N E   GN T+K+G NQF D+TN+EFR    GYK       + TS    +        
Sbjct: 60  QHNFEYSLGNHTFKMGMNQFGDMTNEEFRQAMNGYKHDP----NRTSQGPLFMEPKFFAA 115

Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TNG 195
           P  +DWR +G VTP+K+QK+CG CW+F++  A+EG    ++G LI +SEQ L+DCS  +G
Sbjct: 116 PQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPHG 175

Query: 196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPG-TCSAAQKPAAAKISNYEEVPSGDE 254
           N GC GG  ++AF Y+ +N+G+ +E  YPY A     C    +   AKI+ + ++P G+E
Sbjct: 176 NQGCNGGLMDQAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPKGNE 235

Query: 255 QALLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVCGTQLDHAVTIVGF---GTTEDGA 309
            AL+ AV ++ PVS+AI A     Q Y+ GI +   C +QLDHAV +VG+   G    G 
Sbjct: 236 LALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACTSQLDHAVLVVGYGYQGADVAGN 295

Query: 310 NYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYPL 347
            YW++KNSW + WGD GY+ + +D+   CGI T +SYPL
Sbjct: 296 RYWIVKNSWSDKWGDKGYIYMAKDKNNHCGIATMASYPL 334


>gi|260516678|gb|ACX43965.1| cysteine protease 1 [Brachiaria hybrid cultivar]
          Length = 338

 Score =  253 bits (646), Expect = 9e-65,   Method: Compositional matrix adjust.
 Identities = 136/320 (42%), Positives = 192/320 (60%), Gaps = 11/320 (3%)

Query: 30  SQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTY 89
           S + S     E  + ++   +M Q+ ++Y    E   R   FK N+E I   N   N +Y
Sbjct: 25  SALFSEEVPSEVMLQDMFTAFMKQYSKAYS-HAEFSSRFNQFKANVETIRLHNTLANASY 83

Query: 90  KLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVT 149
            +G N+F+DL+ +EF+  Y GYK      R    S   +Q +     PTS+DWR   AVT
Sbjct: 84  TMGLNEFADLSFEEFKGKYFGYKHV---EREFARSNNLHQEVEA--APTSIDWRTSNAVT 138

Query: 150 PIKNQKECGCCWAFAAVAAVEGITKIRSGN-LIQLSEQQLLDCSTN-GNNGCLGGSREKA 207
           PIK+Q +CG CWAF+A  ++EG   ++  + L  LSEQQL+DCST+ G+ GC GG  + A
Sbjct: 139 PIKDQGQCGSCWAFSATGSIEGAWVLQGKHTLTSLSEQQLVDCSTSYGDAGCNGGLMDYA 198

Query: 208 FAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPV 266
           F YII N+GI  E  YPY+ V G C  +       IS Y++V SGDE +LL AV ++ PV
Sbjct: 199 FEYIIANKGICAESAYPYKGVGGLCQKSCTKVVT-ISGYKDVASGDEASLLNAVGTVGPV 257

Query: 267 SIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAG 326
           S+AI A    FQ Y  G+F+G CG  LDH V  VG+GTT    +YW++KNSWG +WG++G
Sbjct: 258 SVAIEADQAGFQFYSSGVFSGTCGHNLDHGVLAVGYGTT-GSQDYWIVKNSWGTSWGESG 316

Query: 327 YMKIVRDEGLCGIGTRSSYP 346
           Y++++R++  CGI  + SYP
Sbjct: 317 YIRMIRNKNQCGIAIQPSYP 336


>gi|226509942|ref|NP_001146834.1| cysteine protease precursor [Zea mays]
 gi|159506725|gb|ABW97700.1| cysteine protease [Zea mays]
 gi|414867308|tpg|DAA45865.1| TPA: cysteine protease [Zea mays]
          Length = 352

 Score =  253 bits (646), Expect = 9e-65,   Method: Compositional matrix adjust.
 Identities = 137/310 (44%), Positives = 187/310 (60%), Gaps = 22/310 (7%)

Query: 50  WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYT 109
           W A + RSY    E++ R ++++ N+E+IE  N+ GN TY LG NQF+DLT +EF  LYT
Sbjct: 52  WQATYNRSYPTAEERQRRFQVYRRNIEHIEATNRAGNLTYTLGENQFADLTEEEFLDLYT 111

Query: 110 GYKMP----SPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQ-KECGCCWAFA 164
              MP    +   R+  SS+      +  D PTS+DWR KGAVTPIKNQ   C  CWAF 
Sbjct: 112 MKGMPVRRDAGKKRANVSSS-----AAAVDAPTSVDWRSKGAVTPIKNQGPSCSSCWAFV 166

Query: 165 AVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYP 224
             A +E ITKI +G L+ LSEQ+L+DC    + GC  G     + ++IQN G+ TE  YP
Sbjct: 167 TAATIESITKITTGKLVSLSEQELIDCDPY-DGGCNLGYFVNGYRWVIQNGGLTTEANYP 225

Query: 225 YQAVPGTCS---AAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYK 281
           YQA    CS   AAQ   AA IS+Y ++P+G+ Q  L+    Q    A        Q Y 
Sbjct: 226 YQARRYACSRSRAAQH--AATISDYVQLPAGEGQ--LQQAVAQQPVAAAIEMGGSLQFYS 281

Query: 282 EGIFNGVCGTQLDHAVTIVGFGT-TEDGANYWLIKNSWGNTWGDAGYMKIVRD---EGLC 337
            G+F+G CGT+++HA+T+VG+G  +  G  YWL+KNSWG +WG+ GY+++ RD    GLC
Sbjct: 282 GGVFSGQCGTRMNHAITVVGYGADSSSGLKYWLVKNSWGQSWGERGYLRMRRDVGRGGLC 341

Query: 338 GIGTRSSYPL 347
           GI    +YP+
Sbjct: 342 GIALDLAYPV 351


>gi|237844793|ref|XP_002371694.1| cathepsin L-like thiolproteinase, putative [Toxoplasma gondii ME49]
 gi|50313163|gb|AAT74529.1| toxopain-2 [Toxoplasma gondii]
 gi|89242977|gb|ABD64744.1| cathepsin L [Toxoplasma gondii]
 gi|95007485|emb|CAJ20707.1| toxopain-2 [Toxoplasma gondii RH]
 gi|211969358|gb|EEB04554.1| cathepsin L-like thiolproteinase, putative [Toxoplasma gondii ME49]
 gi|221480879|gb|EEE19300.1| cysteine protease, putative [Toxoplasma gondii GT1]
 gi|221501596|gb|EEE27366.1| cysteine protease, putative [Toxoplasma gondii VEG]
          Length = 422

 Score =  253 bits (646), Expect = 9e-65,   Method: Compositional matrix adjust.
 Identities = 126/301 (41%), Positives = 176/301 (58%), Gaps = 6/301 (1%)

Query: 52  AQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGY 111
           A + +SY  E EK+ R  IFK NL YI   N++G  +Y L  N F DL+ DEFR  Y G+
Sbjct: 122 AMYAKSYATEEEKQRRYAIFKNNLVYIHTHNQQGY-SYSLKMNHFGDLSRDEFRRKYLGF 180

Query: 112 KMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEG 171
           K              +  N+  +++P  +DWR +G VTP+K+Q++CG CWAF+   A+EG
Sbjct: 181 KKSRNLKSHHLGVATELLNVLPSELPAGVDWRSRGCVTPVKDQRDCGSCWAFSTTGALEG 240

Query: 172 ITKIRSGNLIQLSEQQLLDCS-TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPG 230
               ++G L+ LSEQ+L+DCS   GN  C GG    AF Y++ + GI +ED YPY A   
Sbjct: 241 AHCAKTGKLVSLSEQELMDCSRAEGNQSCSGGEMNDAFQYVLDSGGICSEDAYPYLARDE 300

Query: 231 TCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCG 290
            C A       KI  +++VP   E A+  A++  PVSIAI A    FQ Y EG+F+  CG
Sbjct: 301 ECRAQSCEKVVKILGFKDVPRRSEAAMKAALAKSPVSIAIEADQMPFQFYHEGVFDASCG 360

Query: 291 TQLDHAVTIVGFGT-TEDGANYWLIKNSWGNTWGDAGYMKIVR---DEGLCGIGTRSSYP 346
           T LDH V +VG+GT  E   ++W++KNSWG  WG  GYM +     +EG CG+   +S+P
Sbjct: 361 TDLDHGVLLVGYGTDKESKKDFWIMKNSWGTGWGRDGYMYMAMHKGEEGQCGLLLDASFP 420

Query: 347 L 347
           +
Sbjct: 421 V 421


>gi|255544115|ref|XP_002513120.1| cysteine protease, putative [Ricinus communis]
 gi|223548131|gb|EEF49623.1| cysteine protease, putative [Ricinus communis]
          Length = 362

 Score =  253 bits (646), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 134/254 (52%), Positives = 176/254 (69%), Gaps = 12/254 (4%)

Query: 35  SRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTN 94
           +R+  E S+ E HE+WMA + R YKD  EK+MR KIFKEN++ I+  N E +++YKL  N
Sbjct: 27  ARTLQEASMYERHEQWMASYARVYKDANEKQMRYKIFKENVQRIDSFNSESDKSYKLAVN 86

Query: 95  QFSDLTNDEFRALYTGYKMPSPSHR-STTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKN 153
           QF+DLTN+EF++L  G+K     H  S  +  F+Y+N+  T VP S+DWR KGAVT IK 
Sbjct: 87  QFADLTNEEFKSLRNGFK----GHMCSAQAGHFRYENV--TAVPASIDWRKKGAVTQIKE 140

Query: 154 QKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYII 212
           Q +CG CWAF+AVAAVEGIT+I++G LI LSEQ+L+DC TN  + GC GG  + AF +I 
Sbjct: 141 QGQCGSCWAFSAVAAVEGITEIKTGKLISLSEQELVDCDTNSEDQGCQGGLMDDAFKFIE 200

Query: 213 QNQGIATEDEYPYQAVPGTCSAAQ--KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAI 270
           Q+ G+A+E  YPY A   TC   +  KP +AKI+ YE+VP+ DE AL  AV+ QPVS+AI
Sbjct: 201 QH-GLASEATYPYDAADSTCKTKEEAKP-SAKITGYEDVPANDEAALKNAVANQPVSVAI 258

Query: 271 AAYSTEFQSYKEGI 284
            A   EFQ Y  GI
Sbjct: 259 DAGGFEFQFYSSGI 272


>gi|125552927|gb|EAY98636.1| hypothetical protein OsI_20560 [Oryza sativa Indica Group]
          Length = 449

 Score =  253 bits (646), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 130/303 (42%), Positives = 187/303 (61%), Gaps = 8/303 (2%)

Query: 48  EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRAL 107
           E W A+HGRSY    E+  RL  F +N  ++  A+     +Y L  N F+DLT+DEFRA 
Sbjct: 39  EAWCAEHGRSYATPGERAARLAAFADNAAFV-AAHNGAPASYALALNAFADLTHDEFRAA 97

Query: 108 YTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVA 167
             G    +   R   +         +  VP ++DWR  GAVT +K+Q  CG CW+F+A  
Sbjct: 98  RLGRLAAAGPGRDGGAPYLGVDG-GVGAVPDAVDWRQSGAVTKVKDQGSCGACWSFSATG 156

Query: 168 AVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQA 227
           A+EGI KI++G+LI LSEQ+L+DC  + N+GC GG  + A+ ++++N GI TE +YPY+ 
Sbjct: 157 AMEGINKIKTGSLISLSEQELIDCDRSYNSGCGGGLMDYAYKFVVKNGGIDTEADYPYRE 216

Query: 228 VPGTCSAAQ-KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFN 286
             GTC+  + K     I  Y++VP+ +E  LL+AV+ QPVS+ I   +  FQ Y +GIF+
Sbjct: 217 TDGTCNKNKLKRRVVTIDGYKDVPANNEDMLLQAVAQQPVSVGICGSARAFQLYSKGIFD 276

Query: 287 GVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTR 342
           G C T LDHA+ IVG+G +E G +YW++KNSWG +WG  GYM + R+     G+CGI   
Sbjct: 277 GPCPTSLDHAILIVGYG-SEGGKDYWIVKNSWGESWGMKGYMYMHRNTGNSNGVCGINQM 335

Query: 343 SSY 345
            S+
Sbjct: 336 PSF 338


>gi|225709022|gb|ACO10357.1| Cathepsin L precursor [Caligus rogercresseyi]
          Length = 332

 Score =  253 bits (645), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 135/310 (43%), Positives = 188/310 (60%), Gaps = 14/310 (4%)

Query: 48  EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEF 104
           E W   HG+SY+  +E+++RLKI  EN   I + N E   G  +Y +  N + DL + EF
Sbjct: 28  ESWKLTHGKSYESSIEEKLRLKIHMENSLKISRHNAEAINGKHSYYMKMNHYGDLLHHEF 87

Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFA 164
            A+  GY+  + +  S   S    +N+ +   PT +DWR+ GAVTP+KNQ +CG CWAF+
Sbjct: 88  VAMVNGYEYVNKT--SLGGSFIPSKNVKL---PTHVDWREDGAVTPVKNQGQCGSCWAFS 142

Query: 165 AVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEY 223
           +  ++EG T  ++G LI LSEQ L+DCS   GNNGC GG  + AF YI  N+GI TE  Y
Sbjct: 143 STGSLEGQTFRKTGKLIPLSEQNLVDCSRKYGNNGCEGGLMDFAFTYIRDNKGIDTEGSY 202

Query: 224 PYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKE 282
           PY+ V G C        +    + +V  G E+ LLKAV S+ PVS+AI A    FQ Y  
Sbjct: 203 PYEGVGGRCHYDPSKKGSSDIGFVDVKKGSEEELLKAVASVGPVSVAIDASHMSFQFYSH 262

Query: 283 GI-FNGVCGTQ-LDHAVTIVGFGTTED-GANYWLIKNSWGNTWGDAGYMKIVRD-EGLCG 338
           G+ F   C  + LDH V +VG+GT E+ G +YWL+KNSW   WGD GY+K+ R+ + +CG
Sbjct: 263 GVYFESKCSPENLDHGVLVVGYGTDENSGEDYWLVKNSWSENWGDQGYIKMARNKKNMCG 322

Query: 339 IGTRSSYPLA 348
           I + +SYP+ 
Sbjct: 323 IASSASYPVV 332


>gi|27728675|gb|AAO18731.1| cysteine protease [Gossypium hirsutum]
          Length = 389

 Score =  253 bits (645), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 131/318 (41%), Positives = 195/318 (61%), Gaps = 15/318 (4%)

Query: 40  EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI--EKANKEGNR-TYKLGTNQF 96
           E+ V+EI ++W  +H + Y+   E E R + FK NL+YI    A ++ N+  + +G N+F
Sbjct: 42  EERVLEIFQQWKEKHRKVYRHAEEAEKRFENFKGNLKYILERNAKRKANKWEHHVGLNKF 101

Query: 97  SDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKE 156
           +D++N+EFR  Y   K+  P ++  T S    + +   D P+SLDWR+ G VT +K+Q  
Sbjct: 102 ADMSNEEFRKAYLS-KVKKPINKGITLSRNMRRKVQSCDAPSSLDWRNYGVVTAVKDQGS 160

Query: 157 CGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQG 216
           CG CWAF++  A+EGI  + +G+LI LSEQ+L++C T+ N GC GG  + AF ++I N G
Sbjct: 161 CGSCWAFSSTGAMEGINALVTGDLISLSEQELVECDTS-NYGCEGGYMDYAFEWVINNGG 219

Query: 217 IATEDEYPYQAVPGTC-SAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYST 275
           I +E +YPY  V GTC +  ++     I  Y++V   D  ALL AV+ QPVS+ I   + 
Sbjct: 220 IDSESDYPYTGVDGTCNTTKEETKVVSIDGYQDVEQSD-SALLCAVAQQPVSVGIDGSAI 278

Query: 276 EFQSYKEGIFNGVCG---TQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR 332
           +FQ Y  GI++G C      +DHAV IVG+G +ED   YW++KNSWG +WG  GY  + R
Sbjct: 279 DFQLYTGGIYDGSCSDDPDDIDHAVLIVGYG-SEDSEEYWIVKNSWGTSWGIDGYFYLKR 337

Query: 333 DE----GLCGIGTRSSYP 346
           D     G+C +   +SYP
Sbjct: 338 DTDLPYGVCAVNAMASYP 355


>gi|222636309|gb|EEE66441.1| hypothetical protein OsJ_22818 [Oryza sativa Japonica Group]
          Length = 318

 Score =  253 bits (645), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 136/343 (39%), Positives = 193/343 (56%), Gaps = 46/343 (13%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
           + ++   L + A   ++SR+   ++    H+KWMA+HGR+YKD  EK  R ++FK N++ 
Sbjct: 6   LLVVAGGLSTMAKVTMASRAGTMEAR---HDKWMAEHGRTYKDAAEKARRFRVFKANVDL 62

Query: 78  IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD-- 135
           I+++N  GN+ Y+L TN+F+DLT+ EF A+YTGY   +  + +  ++T     LS  D  
Sbjct: 63  IDRSNAAGNKRYRLATNRFTDLTDAEFAAMYTGYNPANTMYAAANATT----RLSSEDDQ 118

Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG 195
            P  +DWR +GAVT +KNQ+ CGCCWAF+ VAAVEGI +I +G L+ L+           
Sbjct: 119 QPAEVDWRQQGAVTGVKNQRSCGCCWAFSTVAAVEGIHQITTGELVSLTWPTAAASP--- 175

Query: 196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTC----SAAQKPAAAKISNYEEVPS 251
                                      Y YQ   G C    S++    AA IS Y+ V  
Sbjct: 176 -----------------------PRRAYAYQGAQGACQFDASSSASGVAATISGYQRVNP 212

Query: 252 GDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNG-VCGTQLDHAVTIVGFGTTEDGA- 309
            DE +L  AV+ QPVS+AI      F+ Y  G+F    CGT+LDHAV +VG+G   DG+ 
Sbjct: 213 NDEGSLAAAVASQPVSVAIEGSGAMFRHYGSGVFTADSCGTKLDHAVAVVGYGAEADGSG 272

Query: 310 --NYWLIKNSWGNTWGDAGYMKIVRD---EGLCGIGTRSSYPL 347
              YW+IKNSWG TWGD GYMK+ +D   +G CG+    SYP+
Sbjct: 273 GGGYWIIKNSWGTTWGDGGYMKLEKDVGSQGACGVAMAPSYPV 315


>gi|164472556|gb|ABY58967.1| cathepsin L [Toxoplasma gondii]
          Length = 421

 Score =  253 bits (645), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 126/301 (41%), Positives = 176/301 (58%), Gaps = 6/301 (1%)

Query: 52  AQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGY 111
           A + +SY  E EK+ R  IFK NL YI   N++G  +Y L  N F DL+ DEFR  Y G+
Sbjct: 121 AMYAKSYATEEEKQRRYAIFKNNLVYIHTHNQQGY-SYSLKMNHFGDLSRDEFRRKYLGF 179

Query: 112 KMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEG 171
           K              +  N+  +++P  +DWR +G VTP+K+Q++CG CWAF+   A+EG
Sbjct: 180 KKSRNLKSHHLGVATELLNVLPSELPAGVDWRSRGCVTPVKDQRDCGSCWAFSTTGALEG 239

Query: 172 ITKIRSGNLIQLSEQQLLDCS-TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPG 230
               ++G L+ LSEQ+L+DCS   GN  C GG    AF Y++ + GI +ED YPY A   
Sbjct: 240 AHCAKTGKLVSLSEQELMDCSRAEGNQSCSGGEMNDAFQYVLDSGGICSEDAYPYLARDE 299

Query: 231 TCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCG 290
            C A       KI  +++VP   E A+  A++  PVSIAI A    FQ Y EG+F+  CG
Sbjct: 300 ECRAQSCEKVVKILGFKDVPRRSEAAMKAALAKSPVSIAIEADQMPFQFYHEGVFDASCG 359

Query: 291 TQLDHAVTIVGFGT-TEDGANYWLIKNSWGNTWGDAGYMKIVR---DEGLCGIGTRSSYP 346
           T LDH V +VG+GT  E   ++W++KNSWG  WG  GYM +     +EG CG+   +S+P
Sbjct: 360 TDLDHGVLLVGYGTDKESKKDFWIMKNSWGTGWGRDGYMYMAMHKGEEGQCGLLLDASFP 419

Query: 347 L 347
           +
Sbjct: 420 V 420


>gi|224116880|ref|XP_002317417.1| predicted protein [Populus trichocarpa]
 gi|118488173|gb|ABK95906.1| unknown [Populus trichocarpa]
 gi|222860482|gb|EEE98029.1| predicted protein [Populus trichocarpa]
          Length = 498

 Score =  252 bits (644), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 131/317 (41%), Positives = 190/317 (59%), Gaps = 16/317 (5%)

Query: 40  EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN--KEGNRTYKLGTNQFS 97
           E+ + E+ + W  +H + YK   E E R+  FK NL+YI + N  ++    +K+G N+F+
Sbjct: 43  EEGITEVFKLWKEKHQKVYKHAEEAERRIGNFKRNLKYIIEKNGKRKSGLEHKVGLNKFA 102

Query: 98  DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
           DL+N+EFR +Y   K+  P    T     K+++L   D P+SLDWR+KG VT +K+Q +C
Sbjct: 103 DLSNEEFREMYLS-KVKKPI---TIEEKRKHRHLQTCDAPSSLDWRNKGVVTAVKDQGDC 158

Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
           G CW+F+   A+E I  I +G+LI LSEQ+L+DC T  N GC GG  + AF ++I N GI
Sbjct: 159 GSCWSFSTTGAIEAINAIVTGDLISLSEQELVDCDTTNNYGCEGGDMDSAFQWVIGNGGI 218

Query: 218 ATEDEYPYQAVPGTC-SAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTE 276
            TE +YPY  V GTC +A ++     I  Y +V   D  ALL A   QP+S+ +   + +
Sbjct: 219 DTEADYPYTGVDGTCNTAKEEKKVVSIEGYVDVDPSD-SALLCATVQQPISVGMDGSALD 277

Query: 277 FQSYKEGIFNGVCG---TQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD 333
           FQ Y  GI++G C      +DHA+ IVG+G+  D  +YW++KNSWG  WG  GY  I R+
Sbjct: 278 FQLYTGGIYDGDCSGDPNDIDHAILIVGYGSEND-EDYWIVKNSWGTEWGMEGYFYIRRN 336

Query: 334 E----GLCGIGTRSSYP 346
                G+C I   +SYP
Sbjct: 337 TSKPYGVCAINADASYP 353


>gi|8886940|gb|AAF80626.1|AC069251_19 F2D10.37 [Arabidopsis thaliana]
          Length = 315

 Score =  252 bits (643), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 122/276 (44%), Positives = 188/276 (68%), Gaps = 7/276 (2%)

Query: 38  THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
           +H++ ++E+ E W++   ++Y+   EK +R ++FK+NL++I++ NK+G ++Y LG N+F+
Sbjct: 43  SHDK-LIELFENWISNFEKAYETVEEKFLRFEVFKDNLKHIDETNKKG-KSYWLGLNEFA 100

Query: 98  DLTNDEFRALYTGYKMPSPSHRSTTS-STFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKE 156
           DL+++EF+ +Y G K          S + F Y+++    VP S+DWR KGAV  +KNQ  
Sbjct: 101 DLSHEEFKKMYLGLKTDIVRRDEERSYAEFAYRDVEA--VPKSVDWRKKGAVAEVKNQGS 158

Query: 157 CGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQG 216
           CG CWAF+ VAAVEGI KI +GNL  LSEQ+L+DC T  NNGC GG  + AF YI++N G
Sbjct: 159 CGSCWAFSTVAAVEGINKIVTGNLTTLSEQELIDCDTTYNNGCNGGLMDYAFEYIVKNGG 218

Query: 217 IATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYST 275
           +  E++YPY    GTC   +  +    I+ +++VP+ DE++LLKA++ QP+S+AI A   
Sbjct: 219 LRKEEDYPYSMEEGTCEMQKDESETVTINGHQDVPTNDEKSLLKALAHQPLSVAIDASGR 278

Query: 276 EFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANY 311
           EFQ Y  G+F+G CG  LDH V  VG+G+++ G++Y
Sbjct: 279 EFQFYSGGVFDGRCGVDLDHGVAAVGYGSSK-GSDY 313


>gi|330434686|gb|AEC22811.1| cathepsin L [Macrobrachium nipponense]
          Length = 342

 Score =  252 bits (643), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 135/317 (42%), Positives = 192/317 (60%), Gaps = 12/317 (3%)

Query: 43  VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANK---EGNRTYKLGTNQFSDL 99
           V+E  E +  +H + Y+ + E+  R+KIF EN + I   NK    G++TYKLG N++ D+
Sbjct: 25  VMEEWESFKFEHSKKYESDTEETFRMKIFAENKQKIAAHNKLYHTGSKTYKLGMNKYGDM 84

Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFKYQNL--SMTDV--PTSLDWRDKGAVTPIKNQK 155
            + EF  +  G++  +       +  F+  +      DV  P S+DWR+KGAVT +K+Q 
Sbjct: 85  LHHEFVNMMNGFRANTSGAGYKANRGFQGAHFVEPPEDVVMPKSVDWREKGAVTEVKDQG 144

Query: 156 ECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQN 214
            CG CWAF+A  A+EG    ++G+L+ LSEQ L+DCS+  GNNGC GG  + AF YI  N
Sbjct: 145 SCGSCWAFSATGALEGQHYRQTGDLVSLSEQNLVDCSSKFGNNGCNGGLMDNAFQYIKVN 204

Query: 215 QGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAY 273
            GI TE  YPY+A    C      A A    + +V  G+E AL KA+ ++ PVS+AI A 
Sbjct: 205 GGIDTEKSYPYEAEDEPCRYNPANAGADDRGFVDVREGNENALKKAIATIGPVSVAIDAS 264

Query: 274 STEFQSYKEGIFN--GVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIV 331
              FQ Y+ G+++        LDH V  VG+GTTEDG +YWL+KNSW  +WGD GY+KI 
Sbjct: 265 QDSFQFYQHGVYSDPDCSAENLDHGVLAVGYGTTEDGQDYWLVKNSWSKSWGDQGYIKIA 324

Query: 332 RDE-GLCGIGTRSSYPL 347
           R++  +CGI + +SYPL
Sbjct: 325 RNQNNMCGIASAASYPL 341


>gi|346466067|gb|AEO32878.1| hypothetical protein [Amblyomma maculatum]
          Length = 358

 Score =  252 bits (643), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 137/338 (40%), Positives = 198/338 (58%), Gaps = 14/338 (4%)

Query: 19  FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
           F+++  L   A+ +     TH++ V      + A HG+ Y  E E+  RLKI+ EN   I
Sbjct: 27  FVVLGCLFVTAAAI-----THQELVGAEWSAFKALHGKEYHSETEEYYRLKIYMENRLKI 81

Query: 79  EKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD 135
            + N++      +YKL  N+F DL + EF +   G+K    S     S   + + +    
Sbjct: 82  ARHNEKYANNKASYKLAMNEFGDLLHHEFVSTRNGFKRNYRSTPREGSFYIEPEGIEDKH 141

Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN- 194
           +P ++DWR KGAVTP+KNQ +CG CWAF+   ++EG    ++G ++ LSEQ L+DCS   
Sbjct: 142 LPKTVDWRKKGAVTPVKNQGQCGSCWAFSTTGSLEGQHFRKTGRMVSLSEQNLVDCSGKF 201

Query: 195 GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDE 254
           GNNGC GG  + AF YI  N GI TE  YPY    G C   +    A  + + ++P G+E
Sbjct: 202 GNNGCEGGLMDNAFKYIKANGGIDTELSYPYNGTDGICHFEKSDVGATDTGFVDIPEGNE 261

Query: 255 QALLKAV-SMQPVSIAIAAYSTEFQSYKEGIFNGV-CGTQ-LDHAVTIVGFGTTEDGANY 311
           Q L KAV ++ PVS+AI A    FQ Y +G+++   C ++ LDH V +VG+G T+DG +Y
Sbjct: 262 QLLKKAVATVGPVSVAIDASHESFQFYSQGVYDEPECSSESLDHGVLVVGYG-TKDGQDY 320

Query: 312 WLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPLA 348
           WL+KNSWG TWGD GY+ + R+ E  CGI + +SYPL 
Sbjct: 321 WLVKNSWGTTWGDDGYIYMTRNKENQCGIASSASYPLV 358


>gi|21483184|gb|AAF86584.1| cathepsin L cysteine protease [Haemonchus contortus]
          Length = 355

 Score =  252 bits (643), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 136/327 (41%), Positives = 198/327 (60%), Gaps = 13/327 (3%)

Query: 33  VSSRSTHEQSVVEIHEKW---MAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GN 86
           V  + +  Q + E   KW       G+SY+ E E +  ++ F +N+ +IE+ NKE   G 
Sbjct: 31  VHRQKSLRQKIDEAFNKWDDYKETFGKSYEPEEENDY-MEAFVKNVIHIEEHNKEHRLGR 89

Query: 87  RTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKG 146
           +T+++G N+ +DL   ++R L  GY+M      S  S+  K+       +P S+DWR++G
Sbjct: 90  KTFEMGLNEIADLPFSQYRKL-NGYRMRRQFGDSMQSNGTKFLVPFNVQIPESVDWREEG 148

Query: 147 AVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSRE 205
            VTP+KNQ  CG CWAF++  A+EG     +G L+ LSEQ L+DCST  GN+GC GG  +
Sbjct: 149 LVTPVKNQGMCGSCWAFSSTGALEGQHARATGKLVSLSEQNLVDCSTKYGNHGCNGGLMD 208

Query: 206 KAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQ- 264
            AF YI +N G+ TED YPY      C   +    A    + ++P GDE+AL KAV+ Q 
Sbjct: 209 LAFEYIKENHGVDTEDSYPYVGRETKCHFKRNTVGADDKGFVDLPEGDEEALKKAVATQG 268

Query: 265 PVSIAIAAYSTEFQSYKEGI-FNGVCGT-QLDHAVTIVGFGTTEDGANYWLIKNSWGNTW 322
           P+SIAI A    FQ YK+G+ F+  C + +LDH V +VG+GT  +  +YWL+KNSWG TW
Sbjct: 269 PISIAIDAGHRSFQLYKKGVYFDEECSSEELDHGVLLVGYGTDPEAGDYWLVKNSWGPTW 328

Query: 323 GDAGYMKIVRDE-GLCGIGTRSSYPLA 348
           G+ GY++I R+    CG+ T++SYPL 
Sbjct: 329 GEKGYIRIARNRNNHCGVATKASYPLV 355


>gi|34559455|gb|AAQ75437.1| cathepsin L-like protease [Helicoverpa armigera]
 gi|338855117|gb|AEJ31938.1| cathepsin L-like protease [Helicoverpa assulta]
          Length = 341

 Score =  252 bits (643), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 134/320 (41%), Positives = 190/320 (59%), Gaps = 20/320 (6%)

Query: 46  IHEKWMA---QHGRSYKDELEKEMRLKIFKENLEYIEKANK---EGNRTYKLGTNQFSDL 99
           + E+W A   +H + Y  E+E + R+KI+ EN   I K N+   +G  +YKL  N+++D+
Sbjct: 23  VREEWSAFKLEHSKRYDSEVEDKFRMKIYLENKHRIAKHNQRFEQGAVSYKLRPNKYADM 82

Query: 100 TNDEFRALYTGY----KMPSPSH---RSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIK 152
            + EF  +  G+    K P   H   R +  +TF     +    P  +DWR KGAVT +K
Sbjct: 83  LSHEFVHVMNGFNKTLKHPKAVHGKGRESRPATFIAP--AHVTYPDHVDWRKKGAVTEVK 140

Query: 153 NQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYI 211
           +Q +CG CWAF+   A+EG    ++G L+ LSEQ L+DCS   GNNGC GG  + AF YI
Sbjct: 141 DQGKCGSCWAFSTTGALEGQHFRKTGYLVSLSEQNLIDCSAAYGNNGCNGGLMDNAFKYI 200

Query: 212 IQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAI 270
             N GI TE  YPY+ V   C    K + A    + ++P GDE+ L++AV ++ PVS+AI
Sbjct: 201 KDNGGIDTEKAYPYEGVDDKCRYNAKNSGADDVGFVDIPQGDEEKLMQAVATVGPVSVAI 260

Query: 271 AAYSTEFQSYKEGIF--NGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYM 328
            A    FQ Y +G++       T LDH V +VG+GT E G +YWL+KNSWG TWGD GY+
Sbjct: 261 DASQESFQFYSDGVYYDENCSSTDLDHGVMVVGYGTDEQGGDYWLVKNSWGRTWGDLGYI 320

Query: 329 KIVRDE-GLCGIGTRSSYPL 347
           K+ R++   CGI + +SYPL
Sbjct: 321 KMARNKNNHCGIASSASYPL 340


>gi|326672297|ref|XP_003199631.1| PREDICTED: cathepsin L1-like [Danio rerio]
          Length = 336

 Score =  252 bits (643), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 136/340 (40%), Positives = 204/340 (60%), Gaps = 17/340 (5%)

Query: 20  IIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIE 79
           ++  LLV+     V + S+ +  + +    W +QHG+SY +++E   R+ I++ENL  IE
Sbjct: 1   MMFALLVTLCISAVFAASSIDIQLDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRKIE 59

Query: 80  KANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV 136
           + N E   GN T+K+G NQF D+TN+EFR    GYK       + TS    +   S    
Sbjct: 60  QHNFEYSYGNHTFKMGMNQFGDMTNEEFRHAMNGYKHDP----NQTSQGPLFMEPSFFAA 115

Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TNG 195
           P  +DWR +G VTP+K+QK+CG CW+F++  A+EG    ++G LI +SEQ L+DCS  +G
Sbjct: 116 PQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPHG 175

Query: 196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPG-TCSAAQKPAAAKISNYEEVPSGDE 254
           N GC GG  ++AF Y+ +N+G+ +E  YPY A     C    +   AKI+ + ++P G+E
Sbjct: 176 NQGCNGGLMDQAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPKGNE 235

Query: 255 QALLKAV-SMQPVSIAIAAYSTEFQSYKEGIF--NGVCGTQLDHAVTIVGF---GTTEDG 308
            AL+ AV ++ PVS+AI A     Q Y+ GI+       ++LDHAV +VG+   G    G
Sbjct: 236 LALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACSSSRLDHAVLVVGYGYQGADVAG 295

Query: 309 ANYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYPL 347
             YW++KNSW + WGD GY+ + +D+   CGI T +SYPL
Sbjct: 296 NRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGIATMASYPL 335


>gi|151573014|gb|ABS17682.1| cathepsin L-1 [Artemia salina]
          Length = 334

 Score =  252 bits (643), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 134/335 (40%), Positives = 204/335 (60%), Gaps = 11/335 (3%)

Query: 21  IITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEK 80
           +I LL +   Q+ ++ S       E H  + A H + Y  +LE++ R+KI+ EN   + K
Sbjct: 2   LIFLLGAVLVQLSAALSLTNLLADEWH-LFKATHKKEYPSQLEEKFRMKIYLENKHKVAK 60

Query: 81  AN---KEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
            N   ++G ++Y +  N+F DL + EFR++  GY+     + S   STF +   +   VP
Sbjct: 61  HNILYEKGEKSYHVAMNKFGDLLHHEFRSIMNGYQHKK-QNSSRAESTFTFMEPANVTVP 119

Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GN 196
            S+DWR+KGA+TP+K+Q +CG CWAF++  A+EG T  ++G L+ LSEQ L+DCS   GN
Sbjct: 120 ESVDWREKGAITPVKDQGQCGSCWAFSSTGALEGQTFRKTGKLVSLSEQNLIDCSGKYGN 179

Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQA 256
            GC GG  ++AF YI  N+GI TE+ YPY+A    C    +   A    + ++PSG+E  
Sbjct: 180 EGCNGGLMDQAFQYIKDNKGIDTENTYPYEAEDDVCRYNPRNRGAVDRGFVDIPSGEEDK 239

Query: 257 LLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVCGT-QLDHAVTIVGFGTTEDGANYWL 313
           L  AV ++ PVS+AI A    FQ Y +G+ +   C +  LDH V +VG+G +++G +YWL
Sbjct: 240 LKAAVATVGPVSVAIDASHESFQFYSKGVYYEPSCDSDDLDHGVLVVGYG-SDNGKDYWL 298

Query: 314 IKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
           +KNSW   WGD GY+K+ R+ +  CG+ + +SYPL
Sbjct: 299 VKNSWSEHWGDEGYIKMARNRKNHCGVASAASYPL 333


>gi|115715524|ref|XP_780580.2| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
          Length = 334

 Score =  252 bits (643), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 141/337 (41%), Positives = 203/337 (60%), Gaps = 14/337 (4%)

Query: 19  FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
           ++ + L+ +C   VVSS S       E   +W  +HG+ Y  + E+  R  I+++NL+ +
Sbjct: 3   YLSVLLVAAC---VVSSLSMSFIDFDEDWNQWKNEHGKRYLSDEEEASRRLIWQKNLDIV 59

Query: 79  EKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD 135
            K N +   G+ TY LG NQF+DL N+EF +L  G++    S ++T  STF   + ++ D
Sbjct: 60  IKHNLKYDLGHFTYDLGMNQFADLKNEEFVSLMNGFR--GNSSKATRGSTFLPPS-NVFD 116

Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TN 194
           +PT +DWR KG VTP+KNQ +CG CWAF+A  ++EG    ++G L+ LSEQ L+DCS   
Sbjct: 117 MPTMVDWRTKGYVTPVKNQLQCGSCWAFSATGSLEGQHFKKTGKLVSLSEQNLVDCSGKE 176

Query: 195 GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDE 254
           GN GC GG  ++AF YI+   GI TE  YPY A+ G C   +    A  + Y +V +G E
Sbjct: 177 GNMGCEGGLMDQAFQYILDVGGIDTEMSYPYTAMDGQCHFNKANIGATDTGYTDVTTGSE 236

Query: 255 QALLKAV-SMQPVSIAIAAYSTEFQSYKEGIFN--GVCGTQLDHAVTIVGFGTTEDGANY 311
            AL  AV S+ P+S+AI A    FQ YK G++N      T LDH V  VG+GT+ DG +Y
Sbjct: 237 SALQMAVASVGPISVAIDASHQSFQLYKSGVYNEPACSSTLLDHGVLAVGYGTSSDGTDY 296

Query: 312 WLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
           +   +SWG  WG  GY+ + R+ +  CGI T++SYPL
Sbjct: 297 FFFFHSWGAAWGMNGYLWMSRNKDNQCGIATKASYPL 333


>gi|404312774|pdb|3TNX|A Chain A, Structure Of The Precursor Of A Thermostable Variant Of
           Papain At 2.6 Angstroem Resolution
 gi|404312775|pdb|3TNX|C Chain C, Structure Of The Precursor Of A Thermostable Variant Of
           Papain At 2.6 Angstroem Resolution
 gi|428698029|pdb|3USV|A Chain A, Structure Of The Precursor Of A Thermostable Variant Of
           Papain At 3.8 A Resolution From A Crystal Soaked At Ph 4
 gi|428698030|pdb|3USV|C Chain C, Structure Of The Precursor Of A Thermostable Variant Of
           Papain At 3.8 A Resolution From A Crystal Soaked At Ph 4
          Length = 363

 Score =  252 bits (643), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 136/315 (43%), Positives = 195/315 (61%), Gaps = 15/315 (4%)

Query: 38  THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
           T  + ++++ E WM +H + YK+  EK  R +IFK+NL+YI++ NK+ N +Y LG N F+
Sbjct: 57  TSTERLIQLFESWMLKHNKIYKNIDEKIYRFEIFKDNLKYIDETNKK-NNSYWLGLNVFA 115

Query: 98  DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
           D++NDEF+  YTG    + ++ +T  S  +  N    ++P  +DWR KGAVTP+KNQ  C
Sbjct: 116 DMSNDEFKEKYTG--SIAGNYTTTELSYEEVLNDGDVNIPEYVDWRQKGAVTPVKNQGSC 173

Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
           G  WAF+AV+ +E I KIR+GNL + SEQ+LLDC    + GC GG    A   + Q  GI
Sbjct: 174 GSAWAFSAVSTIESIIKIRTGNLNEYSEQELLDCDRR-SYGCNGGYPWSALQLVAQ-YGI 231

Query: 218 ATEDEYPYQAVPGTCSAAQK-PAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTE 276
              + YPY+ V   C + +K P AAK     +V   +E ALL +++ QPVS+ + A   +
Sbjct: 232 HYRNTYPYEGVQRYCRSREKGPYAAKTDGVRQVQPYNEGALLYSIANQPVSVVLEAAGKD 291

Query: 277 FQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR---- 332
           FQ Y+ GIF G CG ++DHAV  VG+     G NY LI+NSWG  WG+ GY++I R    
Sbjct: 292 FQLYRGGIFVGPCGNKVDHAVAAVGY-----GPNYILIRNSWGTGWGENGYIRIKRGTGN 346

Query: 333 DEGLCGIGTRSSYPL 347
             G+CG+ T S YP+
Sbjct: 347 SYGVCGLYTSSFYPV 361


>gi|47169030|pdb|1S4V|A Chain A, The 2.0 A Crystal Structure Of The Kdel-Tailed Cysteine
           Endopeptidase Functioning In Programmed Cell Death Of
           Ricinus Communis Endosperm
 gi|47169031|pdb|1S4V|B Chain B, The 2.0 A Crystal Structure Of The Kdel-Tailed Cysteine
           Endopeptidase Functioning In Programmed Cell Death Of
           Ricinus Communis Endosperm
          Length = 229

 Score =  251 bits (642), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 118/217 (54%), Positives = 152/217 (70%), Gaps = 5/217 (2%)

Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG 195
           VP S+DWR KGAVT +K+Q +CG CWAF+ + AVEGI +I++  L+ LSEQ+L+DC T+ 
Sbjct: 2   VPASVDWRKKGAVTSVKDQGQCGSCWAFSTIVAVEGINQIKTNKLVSLSEQELVDCDTDQ 61

Query: 196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDE 254
           N GC GG  + AF +I Q  GI TE  YPY+A  GTC  +++ A A  I  +E VP  DE
Sbjct: 62  NQGCNGGLMDYAFEFIKQRGGITTEANYPYEAYDGTCDVSKENAPAVSIDGHENVPENDE 121

Query: 255 QALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLI 314
            ALLKAV+ QPVS+AI A  ++FQ Y EG+F G CGT+LDH V IVG+GTT DG  YW +
Sbjct: 122 NALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGSCGTELDHGVAIVGYGTTIDGTKYWTV 181

Query: 315 KNSWGNTWGDAGYMKIVR----DEGLCGIGTRSSYPL 347
           KNSWG  WG+ GY+++ R     EGLCGI   +SYP+
Sbjct: 182 KNSWGPEWGEKGYIRMERGISDKEGLCGIAMEASYPI 218


>gi|401397136|ref|XP_003879989.1| cathepsin L, related [Neospora caninum Liverpool]
 gi|325114397|emb|CBZ49954.1| cathepsin L, related [Neospora caninum Liverpool]
          Length = 415

 Score =  251 bits (642), Expect = 3e-64,   Method: Compositional matrix adjust.
 Identities = 125/286 (43%), Positives = 175/286 (61%), Gaps = 4/286 (1%)

Query: 52  AQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGY 111
           A +G+SY  E E + R  IFK NL YI   N++G  +Y L  N F DL+ +EFR  Y GY
Sbjct: 124 ATYGKSYATEEETQKRYAIFKNNLAYIHTHNQQG-YSYSLKMNHFGDLSREEFRRKYLGY 182

Query: 112 KMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEG 171
                   +      +   +S +DVP+++DWR+KG VTP+K+Q++CG CWAF+A  A+EG
Sbjct: 183 NKSRNLKSNNLGVATELLKVSPSDVPSAVDWREKGCVTPVKDQRDCGSCWAFSATGALEG 242

Query: 172 ITKIRSGNLIQLSEQQLLDCS-TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPG 230
               ++G L+ LSEQ+L+DCS   GN GC GG    AF Y++ + G+ +E+ YPY A  G
Sbjct: 243 AHCAKTGELLSLSEQELVDCSLAEGNQGCSGGEMNDAFQYVVDSGGLCSEEGYPYLARDG 302

Query: 231 TCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCG 290
            C  A K     IS +++VP   E A+  A++  PVSIAI A    FQ Y EG+F+  CG
Sbjct: 303 ECKRACKKVVT-ISGFKDVPRKSETAMKAALAHSPVSIAIEADQLPFQFYHEGVFDASCG 361

Query: 291 TQLDHAVTIVGFGT-TEDGANYWLIKNSWGNTWGDAGYMKIVRDEG 335
           T LDH V +VG+GT  E   ++W++KNSWG+ WG  GYM +   +G
Sbjct: 362 TDLDHGVLLVGYGTDKETKKDFWIMKNSWGSGWGRDGYMYMAMHKG 407


>gi|81542|pir||S02728 actinidain (EC 3.4.22.14) precursor (clone pAC.1) - kiwi fruit
           (fragment)
 gi|15957|emb|CAA31435.1| actinidin precursor [Actinidia chinensis]
 gi|166319|gb|AAA32630.1| actinidin precursor [Actinidia deliciosa]
 gi|226542|prf||1601514A actinidin
          Length = 302

 Score =  251 bits (642), Expect = 3e-64,   Method: Compositional matrix adjust.
 Identities = 132/278 (47%), Positives = 174/278 (62%), Gaps = 10/278 (3%)

Query: 75  LEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
           L +I++ N + NR+YK+G NQF+DLT +EFR+ Y G+   S    + T  + +Y+     
Sbjct: 1   LRFIDEHNADTNRSYKVGLNQFADLTGEEFRSTYLGFTGGS----NKTKVSNRYEPRVSQ 56

Query: 135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDC-ST 193
            +P+ +DWR  GAV  IK+Q ECG CWAF+A+A VEGI KI +G LI LSEQ+L+ C  T
Sbjct: 57  VLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIGCGGT 116

Query: 194 NGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSA-AQKPAAAKISNYEEVPSG 252
               GC GG     F +II N GI T + YPY A  G C+   Q      I  Y  VP  
Sbjct: 117 QNTRGCNGGYITDGFQFIINNGGINTGENYPYTAQDGECNLDLQNEKYVTIDTYGNVPYN 176

Query: 253 DEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYW 312
           +E AL  AV+ QPVS+A+ A    F+ Y  GIF G CGT +DHAVTIVG+G TE G +YW
Sbjct: 177 NEWALQTAVTYQPVSVALDAAGDAFKHYSSGIFTGPCGTAIDHAVTIVGYG-TEGGIDYW 235

Query: 313 LIKNSWGNTWGDAGYMKIVRD---EGLCGIGTRSSYPL 347
           +++NSW  TWG+ GYM+I+R+    G CGI T  SYP+
Sbjct: 236 IVENSWDTTWGEEGYMRILRNVGGAGTCGIATMPSYPV 273


>gi|157787177|ref|NP_001099150.1| cathepsin L1-like precursor [Danio rerio]
 gi|157422879|gb|AAI53505.1| MGC174152 protein [Danio rerio]
          Length = 336

 Score =  251 bits (641), Expect = 3e-64,   Method: Compositional matrix adjust.
 Identities = 136/340 (40%), Positives = 203/340 (59%), Gaps = 17/340 (5%)

Query: 20  IIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIE 79
           ++  LLV+     V + S+ +  + +    W +QHG+SY +++E   R+ I++ENL  IE
Sbjct: 1   MMFALLVTLCISAVFAASSIDIQLDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRKIE 59

Query: 80  KANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV 136
           + N E   GN T+K+G NQF D+TN+EFR    GYK       + TS    +   S    
Sbjct: 60  QHNFEYSYGNHTFKMGMNQFGDMTNEEFRHAMNGYKHDP----NQTSQGPLFMEPSFFAA 115

Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TNG 195
           P  +DWR +G VTP+K+QK+CG CW+F++  A+EG    ++G LI +SEQ L+DCS   G
Sbjct: 116 PQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQG 175

Query: 196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPG-TCSAAQKPAAAKISNYEEVPSGDE 254
           N GC GG  ++AF Y+ +N+G+ +E  YPY A     C    +   AKI+ + ++P G+E
Sbjct: 176 NQGCNGGLMDQAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPRGNE 235

Query: 255 QALLKAV-SMQPVSIAIAAYSTEFQSYKEGIF--NGVCGTQLDHAVTIVGF---GTTEDG 308
            AL+ AV ++ PVS+AI A     Q Y+ GI+       ++LDHAV +VG+   G    G
Sbjct: 236 LALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACSSSRLDHAVLVVGYGYQGADVAG 295

Query: 309 ANYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYPL 347
             YW++KNSW + WGD GY+ + +D+   CGI T +SYPL
Sbjct: 296 NRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGIATMASYPL 335


>gi|356514419|ref|XP_003525903.1| PREDICTED: LOW QUALITY PROTEIN: cysteine proteinase RD21a-like
           [Glycine max]
          Length = 343

 Score =  251 bits (641), Expect = 4e-64,   Method: Compositional matrix adjust.
 Identities = 144/357 (40%), Positives = 204/357 (57%), Gaps = 46/357 (12%)

Query: 9   GSFKINTTPMFIIITLLVSCAS----QVVSSRSTH--------EQSVVEIHEKWMAQHGR 56
           G+ + +   +FI+   +++ +S     ++S   +H        ++ V+ I+E+ +A+HG+
Sbjct: 2   GTNRSSKATIFILFFTVLAVSSALDLSIISYDRSHADKSGWRSDEEVMSIYEEXLAKHGK 61

Query: 57  SYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSP 116
            Y    E E R +I KENL+++E+ N  GNRTYK+G N+F+D +            M  P
Sbjct: 62  VYNAIDEMEERFQISKENLKFVEQHNA-GNRTYKVGLNRFADRSR----------MMTRP 110

Query: 117 SHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIR 176
           S R        Y      ++  S+DWR +GAV  +K Q EC  C  F  +AAVEGI KI 
Sbjct: 111 SSR--------YAPRVSDNLSESVDWRKEGAVVRVKTQSECESCRTFTVIAAVEGINKIV 162

Query: 177 SGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQ 236
           +GNL  LS     DC    N GC GG  + A  +II N GI TE++YP+Q   G C   +
Sbjct: 163 TGNLTALS-----DCDRTVNAGCSGGLADYALEFIINNGGIDTEEDYPFQGAVGICDQYK 217

Query: 237 KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIA-IAAYSTEFQSYKEGIFNGVCGTQLDH 295
             A   +  YE VP+ DE AL KAV+ QPVS+A I AY  EFQ Y+ GIF G CGT +DH
Sbjct: 218 INA---VDGYERVPAYDELALKKAVANQPVSVAYIEAYGKEFQLYESGIFTGKCGTSIDH 274

Query: 296 AVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-----EGLCGIGTRSSYPL 347
            VT VG+G TE+G +YW++KNSWG  WG+AGY+++ R+      G CGI   + YP+
Sbjct: 275 GVTAVGYG-TENGIDYWIVKNSWGENWGEAGYVRMERNTAEDTAGKCGIAILTLYPI 330


>gi|6650705|gb|AAF21977.1|AF115280_1 thiolproteinase SmTP1 [Sarcocystis muris]
          Length = 394

 Score =  251 bits (641), Expect = 4e-64,   Method: Compositional matrix adjust.
 Identities = 129/305 (42%), Positives = 182/305 (59%), Gaps = 16/305 (5%)

Query: 54  HGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKM 113
           H + Y  E E+  R  IFK NL YI   N +G  +Y L  N+F DLT +EFR  Y GYK 
Sbjct: 96  HNKFYATEEERLKRYAIFKNNLTYIHNHNMQG-YSYVLKMNKFGDLTLEEFRQRYLGYKK 154

Query: 114 PS----PSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAV 169
           P     P    TT      +++   D+PT +DWR +G VT +K+Q +CG CWAF+A  A+
Sbjct: 155 PDLRTPPREVDTT-----LESVEDNDIPTHVDWRQRGCVTSVKDQGDCGSCWAFSATGAM 209

Query: 170 EGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAV 228
           EG+   ++G L+ LS+QQL+DCS   GN GC GG  E+AF Y+++N GI + + YPY   
Sbjct: 210 EGVYCAKTGKLVNLSQQQLVDCSRFLGNQGCDGGRMEEAFEYVVENGGICSGENYPYMRK 269

Query: 229 PGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQ-PVSIAIAAYSTEFQSYKEGIFNG 287
            G C ++Q  + A I+ Y  VP   E+++  A++++ PVS+AI A    FQ Y +GIF+ 
Sbjct: 270 DGVCKSSQCTSVATITGYRSVPRRSEKSMKTALALRSPVSVAIQANQAAFQFYYDGIFDA 329

Query: 288 VCGTQLDHAVTIVGFGTTEDG-ANYWLIKNSWGNTWGDAGYMKIVRDE---GLCGIGTRS 343
            CGT LDH V +VG+     G  +YW++KNSWG  WG  GYM +   +   G CG+    
Sbjct: 330 PCGTNLDHGVLLVGYSAETAGQGDYWIMKNSWGAAWGKGGYMLMAMHKGPAGQCGVLLDG 389

Query: 344 SYPLA 348
           S+P+A
Sbjct: 390 SFPVA 394


>gi|164420679|ref|NP_001037464.2| fibroinase precursor [Bombyx mori]
 gi|40556818|gb|AAR87763.1| fibroinase precursor [Bombyx mori]
          Length = 341

 Score =  251 bits (641), Expect = 4e-64,   Method: Compositional matrix adjust.
 Identities = 137/341 (40%), Positives = 200/341 (58%), Gaps = 20/341 (5%)

Query: 24  LLVSCASQVVSSRSTHEQSVVEIHEKWMA---QHGRSYKDELEKEMRLKIFKENLEYIEK 80
           +L+ CA   VS+     Q    + E+W A   QH  +Y+ E+E   R+KI+ E+   I K
Sbjct: 5   VLLLCAVAAVSAV----QFFDLVKEEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAK 60

Query: 81  ANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRST-----TSSTFKYQNLS 132
            N++   G  +YKLG N++ D+ + EF     G+   +  +++      +    K+ + +
Sbjct: 61  HNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPA 120

Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
              +P  +DWR  GAVT IK+Q +CG CW+F+   A+EG    +SG L+ LSEQ L+DCS
Sbjct: 121 NVKLPEQVDWRKHGAVTDIKDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCS 180

Query: 193 TN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPS 251
              GNNGC GG  + AF YI  N GI TE  YPY+ V   C    K   A+   + ++P 
Sbjct: 181 EQYGNNGCNGGLMDNAFKYIKDNGGIDTEQTYPYEGVDDKCRYNPKNTGAEDVGFVDIPE 240

Query: 252 GDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIFN--GVCGTQLDHAVTIVGFGTTEDG 308
           GDEQ L++AV ++ PVS+AI A  T FQ Y  G++N      T LDH V +VG+GT E G
Sbjct: 241 GDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQG 300

Query: 309 ANYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYPLA 348
            +YWL+KNSWG +WG+ GY+K++R++   CGI + +SYPL 
Sbjct: 301 VDYWLVKNSWGRSWGELGYIKMIRNKNNRCGIASSASYPLV 341


>gi|293342577|ref|XP_001065834.2| PREDICTED: cathepsin L1 [Rattus norvegicus]
 gi|293354413|ref|XP_573976.3| PREDICTED: cathepsin L1 [Rattus norvegicus]
 gi|149039745|gb|EDL93861.1| rCG24317, isoform CRA_a [Rattus norvegicus]
          Length = 330

 Score =  251 bits (641), Expect = 4e-64,   Method: Compositional matrix adjust.
 Identities = 135/341 (39%), Positives = 198/341 (58%), Gaps = 20/341 (5%)

Query: 16  TPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENL 75
           TP+F++ TL +     ++S+  TH+ S   + E+W  +HG++Y    E + R  +++ N+
Sbjct: 2   TPIFLLATLCLG----MISAAPTHDPSFDTVWEEWKTKHGKTYNTNEEGQKR-AVWENNM 56

Query: 76  EYIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
           + I   N++   G   + L  N F DLTN EFR L TG++             F      
Sbjct: 57  KMINLHNEDYLKGKHGFSLEMNAFGDLTNTEFRELMTGFQGQKTKMMKVFPEPF------ 110

Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
           + DVP ++DWR  G VTP+KNQ  CG CWAF+AV ++EG    ++G L+ LSEQ L+DCS
Sbjct: 111 LGDVPKTVDWRKHGYVTPVKNQGPCGSCWAFSAVGSLEGQVFRKTGKLVPLSEQNLVDCS 170

Query: 193 -TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPS 251
            ++GN GC GG  + AF Y+  N G+ T   YPY+A+ GTC    K +AAK+  +  +P 
Sbjct: 171 WSHGNKGCDGGLPDFAFQYVKDNGGLDTSVSYPYEALNGTCRYNPKYSAAKVVGFMSIPP 230

Query: 252 GDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIF--NGVCGTQLDHAVTIVGFGTTEDG 308
             E AL+KAV ++ P+S+ I      FQ YK G++       T L+HAV +VG+G   DG
Sbjct: 231 -SENALMKAVATVGPISVGIDIKHKSFQFYKGGMYYEPDCSSTNLNHAVLVVGYGEESDG 289

Query: 309 ANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPLA 348
             YWL+KNSWG  WG  GY+K+ +D    CGI + +SYP+ 
Sbjct: 290 RKYWLVKNSWGRDWGMDGYIKMAKDWNNNCGIASDASYPIV 330


>gi|157093563|gb|ABV22436.1| cysteine proteinase [Oxyrrhis marina]
          Length = 329

 Score =  251 bits (641), Expect = 4e-64,   Method: Compositional matrix adjust.
 Identities = 132/315 (41%), Positives = 189/315 (60%), Gaps = 15/315 (4%)

Query: 42  SVVEIHEKW---MAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSD 98
           +V +I  +W    A+ G SY  E E+  R  +F +N++ I + N +G+ TY LG NQF+D
Sbjct: 11  AVADIDAQWEEFKAKFGESYNGEEEEAERKGVFAQNVQLINEENSKGH-TYTLGVNQFAD 69

Query: 99  LTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECG 158
           LT +EF   Y G+K   P+ +   ++       +   +PTS+DW  +GAVTP+KNQ +CG
Sbjct: 70  LTVEEFSKTYMGFK--KPAQKYGDAAYLGRHVYNGEALPTSVDWSSQGAVTPVKNQGQCG 127

Query: 159 CCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TNGNNGCLGGSREKAFAYIIQNQGI 217
            CW+F+   ++EG  +I +G L+ LSEQQ +DC+ T GN GC GG  + AF Y   N  +
Sbjct: 128 SCWSFSTTGSLEGANEISTGKLVSLSEQQFVDCAGTYGNQGCNGGLMDSAFKYAEAN-AL 186

Query: 218 ATEDEYPYQAVPGTCSAAQ---KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYS 274
            TE  YPY+   G+C A+      A   +S Y++V S  EQ ++ AV+ QPVSIAI A  
Sbjct: 187 CTEQSYPYKGTDGSCQASSCSTGLAKGSVSGYKDVSSDSEQDMMSAVAQQPVSIAIEADK 246

Query: 275 TEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE 334
           + FQ Y  G+  G CG  LDH V  VG+GT   G +YW +KNSWG+TWG +GY+ + R +
Sbjct: 247 SVFQLYSGGVLTGACGASLDHGVLAVGYGTL-SGTDYWKVKNSWGSTWGMSGYVLLQRGK 305

Query: 335 ---GLCGIGTRSSYP 346
              G CG+ +  SYP
Sbjct: 306 GGSGECGLLSEPSYP 320


>gi|354549232|gb|AER27707.1| putative cysteine protease [Phytophthora sp. SH-2011]
          Length = 533

 Score =  251 bits (640), Expect = 4e-64,   Method: Compositional matrix adjust.
 Identities = 133/305 (43%), Positives = 182/305 (59%), Gaps = 11/305 (3%)

Query: 50  WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRT-YKLGTNQFSDLTNDEFRALY 108
           WM  HG ++ D LE   RL+ +  N  YI + N E   T   LG N FS ++ DEF+   
Sbjct: 31  WMGAHGVTFSDALEFARRLENYIVNDMYIMEHNAENAWTGVTLGHNAFSHMSFDEFKFKM 90

Query: 109 TGYKMPSPSHRSTTSSTFKYQNL-SMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVA 167
           TG  +P        +S  +   L S  +VP+++DW DKG VTP+KNQ  CG CWAF+   
Sbjct: 91  TGLVLPEGYLEQRLAS--RVDGLWSDVEVPSAVDWVDKGGVTPVKNQGMCGSCWAFSTTG 148

Query: 168 AVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQA 227
           AVEG T + SG L  LSEQ+L+DC  NG+ GC GG  + AF +I  + GI +ED+Y Y+A
Sbjct: 149 AVEGATFVSSGKLPSLSEQELVDCDHNGDMGCNGGLMDHAFQWIEDHGGICSEDDYEYKA 208

Query: 228 VPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNG 287
               C      +  K++ +++V   DE AL  AV+ QPVS+AI A    FQ YK G+FN 
Sbjct: 209 KAQVCRECD--SVVKVTGFQDVNPQDEHALKVAVAQQPVSVAIEADQKAFQFYKSGVFNL 266

Query: 288 VCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE----GLCGIGTRS 343
            CGT+LDH V  VG+G  ++G  +W +KNSWG +WG+ GY+++ R+E    G CGI +  
Sbjct: 267 TCGTRLDHGVLAVGYG-NDNGHKFWKVKNSWGASWGEQGYIRLAREENGPAGQCGIASVP 325

Query: 344 SYPLA 348
           SYP A
Sbjct: 326 SYPFA 330


>gi|146152090|gb|ABQ08058.1| cathepsin L [Misgurnus mizolepis]
          Length = 337

 Score =  251 bits (640), Expect = 4e-64,   Method: Compositional matrix adjust.
 Identities = 144/343 (41%), Positives = 200/343 (58%), Gaps = 18/343 (5%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
           M+  + L   C S V ++ S  +Q + +  E+W   HG++Y  E E+  R  I+++NL  
Sbjct: 1   MWTYLALFTLCLSGVFAAPSLDKQ-LDDHWEQWKTWHGKNYH-EKEEGWRRMIWEKNLRK 58

Query: 78  IEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
           I+  N E   G  TY+LG N F D+ ++EFR +  GYK    + R    S F   N    
Sbjct: 59  IQFHNLEHSMGIHTYRLGMNHFGDMNHEEFRQVMNGYK--HKTERKFKGSLFMEPNF--L 114

Query: 135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-T 193
           +VP+ LDWR+KG VTP+K+Q ECG CWAF+   A+EG    + G L+ LSEQ L+DCS  
Sbjct: 115 EVPSKLDWREKGYVTPVKDQGECGSCWAFSTTGAMEGQMFRKQGKLVSLSEQNLVDCSRP 174

Query: 194 NGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGT-CSAAQKPAAAKISNYEEVPSG 252
            GN GC GG  ++AF YI  N G+ +E+ YPY       C    K  AA  + + ++PSG
Sbjct: 175 EGNEGCNGGLMDQAFQYIKDNNGLDSEEAYPYLGTDDQPCHYDPKYNAANDTGFVDIPSG 234

Query: 253 DEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVCGT-QLDHAVTIVGF---GTTE 306
            E AL+KAV S+ PVS+AI A    FQ Y+ GI F   C + +LDH V +VG+   G   
Sbjct: 235 KEHALMKAVASVGPVSVAIDAGHESFQFYQSGIYFEKECSSEELDHGVLVVGYGFEGEDV 294

Query: 307 DGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPLA 348
           DG  YW++KNSW  +WGD GY+ + +D +  CGI T +SYPL 
Sbjct: 295 DGKKYWIVKNSWSESWGDKGYIYMAKDRKNHCGIATAASYPLV 337


>gi|156124998|gb|ABU50817.1| Ale o 1 allergen [Aleuroglyphus ovatus]
          Length = 337

 Score =  251 bits (640), Expect = 5e-64,   Method: Compositional matrix adjust.
 Identities = 130/308 (42%), Positives = 196/308 (63%), Gaps = 13/308 (4%)

Query: 48  EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEF 104
           E++ +  GR Y     +  R  IF+ NL++I + N +   G+ T+ +  N F+DL+N+EF
Sbjct: 34  EQFKSTFGRVYPSPEIELHRKSIFRANLQFILRHNIDYFNGDSTFSVSVNNFTDLSNEEF 93

Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFA 164
           RA + GY+  +    + + +   + +  +  +P ++DW  KG VTPIKNQ++CG CWAF+
Sbjct: 94  RATFNGYRRLA----AVSLADSVHADNDVEALPATVDWTTKGVVTPIKNQQQCGSCWAFS 149

Query: 165 AVAAVEGITKIRSGNLIQLSEQQLLDCS-TNGNNGCLGGSREKAFAYIIQNQGIATEDEY 223
           AVA++EG   +++G L+ LSEQ L+DCS   G+ GC GG  + AF Y+IQN+GI TE  Y
Sbjct: 150 AVASMEGQHALKTGKLVSLSEQNLVDCSAAEGDMGCSGGWMDYAFKYVIQNRGIDTEASY 209

Query: 224 PYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKE 282
           PY+A+  +C   +    A I ++ +V +GDE AL  AV S+ P+S+AI A    FQ Y  
Sbjct: 210 PYKAIDESCEFKRNSVGATIHSFVDVKTGDESALQNAVASIGPISVAIDAAQPSFQFYSS 269

Query: 283 GIFNGV-CGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGI 339
           G++N   C T+ LDH VT VG+GT  +GA YW +KNSWG +WG  GY+ + R+ +  CGI
Sbjct: 270 GVYNEPDCSTEILDHGVTAVGYGTL-NGAPYWKVKNSWGTSWGRKGYIFMSRNKQNQCGI 328

Query: 340 GTRSSYPL 347
            T++SYP+
Sbjct: 329 ATKASYPV 336


>gi|157311713|ref|NP_001098585.1| uncharacterized protein LOC564979 precursor [Danio rerio]
 gi|156230121|gb|AAI52284.1| Wu:fa26c03 protein [Danio rerio]
          Length = 336

 Score =  251 bits (640), Expect = 5e-64,   Method: Compositional matrix adjust.
 Identities = 135/340 (39%), Positives = 203/340 (59%), Gaps = 17/340 (5%)

Query: 20  IIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIE 79
           ++  LLV+     V + S+ +  + +    W +QHG+SY +++E   R+ I++ENL  IE
Sbjct: 1   MMFALLVTLCISAVFAASSIDIQLDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRKIE 59

Query: 80  KANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV 136
           + N E   GN T+K+G NQF D+TN+EFR    GYK       + TS    +   S    
Sbjct: 60  QHNFEYSYGNHTFKMGMNQFGDMTNEEFRQAMNGYKHDP----NRTSQGPLFMEPSFFAA 115

Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TNG 195
           P  +DWR +G VTP+K+QK+CG CW+F++  A+EG    ++G LI +SEQ L+DCS   G
Sbjct: 116 PQQVDWRQRGFVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQG 175

Query: 196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPG-TCSAAQKPAAAKISNYEEVPSGDE 254
           N GC GG  ++AF Y+ +N+G+ +E  YPY A     C    +   AKI+ + ++P G+E
Sbjct: 176 NQGCNGGLMDQAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPRGNE 235

Query: 255 QALLKAV-SMQPVSIAIAAYSTEFQSYKEGIF--NGVCGTQLDHAVTIVGF---GTTEDG 308
            AL+ AV ++ PVS+AI A     Q Y+ GI+       ++LDHAV +VG+   G    G
Sbjct: 236 LALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACSSSRLDHAVLVVGYGYQGADVAG 295

Query: 309 ANYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYPL 347
             YW++KNSW + WGD GY+ + +D+   CG+ T +SYPL
Sbjct: 296 NRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGVATSASYPL 335


>gi|1709574|sp|P10056.2|PAPA3_CARPA RecName: Full=Caricain; AltName: Full=Papaya peptidase A; AltName:
           Full=Papaya proteinase III; Short=PPIII; AltName:
           Full=Papaya proteinase omega; Flags: Precursor
 gi|18098|emb|CAA46862.1| proteinase omega [Carica papaya]
          Length = 348

 Score =  251 bits (640), Expect = 5e-64,   Method: Compositional matrix adjust.
 Identities = 136/314 (43%), Positives = 192/314 (61%), Gaps = 12/314 (3%)

Query: 38  THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
           T  + ++++   WM  H + Y++  EK  R +IFK+NL YI++ NK+ N +Y LG N+F+
Sbjct: 39  TSTERLIQLFNSWMLNHNKFYENVDEKLYRFEIFKDNLNYIDETNKK-NNSYWLGLNEFA 97

Query: 98  DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
           DL+NDEF   Y G  + +   +S      ++ N    ++P ++DWR KGAVTP+++Q  C
Sbjct: 98  DLSNDEFNEKYVGSLIDATIEQSYDE---EFINEDTVNLPENVDWRKKGAVTPVRHQGSC 154

Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
           G CWAF+AVA VEGI KIR+G L++LSEQ+L+DC    ++GC GG    A  Y+ +N GI
Sbjct: 155 GSCWAFSAVATVEGINKIRTGKLVELSEQELVDCERR-SHGCKGGYPPYALEYVAKN-GI 212

Query: 218 ATEDEYPYQAVPGTCSAAQKPAA-AKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTE 276
               +YPY+A  GTC A Q      K S    V   +E  LL A++ QPVS+ + +    
Sbjct: 213 HLRSKYPYKAKQGTCRAKQVGGPIVKTSGVGRVQPNNEGNLLNAIAKQPVSVVVESKGRP 272

Query: 277 FQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR---- 332
           FQ YK GIF G CGT++DHAVT VG+G +       LIKNSWG  WG+ GY++I R    
Sbjct: 273 FQLYKGGIFEGPCGTKVDHAVTAVGYGKSGGKGYI-LIKNSWGTAWGEKGYIRIKRAPGN 331

Query: 333 DEGLCGIGTRSSYP 346
             G+CG+   S YP
Sbjct: 332 SPGVCGLYKSSYYP 345


>gi|402770499|gb|AFQ98384.1| cathepsin L, partial [Hyalomma anatolicum anatolicum]
          Length = 312

 Score =  250 bits (639), Expect = 6e-64,   Method: Compositional matrix adjust.
 Identities = 133/309 (43%), Positives = 189/309 (61%), Gaps = 14/309 (4%)

Query: 48  EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEF 104
           E +   H +SY+ ++E+ +R KIF EN   I K N +   G  +YKLG NQF DL   EF
Sbjct: 8   EAFKTTHKKSYQSKMEELLRYKIFTENSLLIAKHNAKYAKGLVSYKLGMNQFGDLLPHEF 67

Query: 105 RALYTGYKMPSPSHRSTTSSTF-KYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAF 163
             ++ GY       R    STF    N++ + +P ++DWR KGAVTP+K+Q +CG CWAF
Sbjct: 68  AKMFNGYH----GERKGRGSTFLPPANVNDSSLPKTVDWRKKGAVTPVKDQGQCGSCWAF 123

Query: 164 AAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDE 222
           +A  ++EG   ++SG L+ LSEQ L+DCS + GN GC GG  + AF YI  N GI TE+ 
Sbjct: 124 SATGSLEGQHFLKSGKLVSLSEQNLIDCSGSFGNEGCGGGLMDNAFKYIKANDGIDTEES 183

Query: 223 YPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYK 281
           YPY+A+ G C   ++   A  + + ++  G E  L KAV ++ P+S+AI A  + FQ Y 
Sbjct: 184 YPYEAMDGDCRFKKEDVGATDTGFVDIQQGSEDDLQKAVATVGPISVAIDASHSSFQLYS 243

Query: 282 EGIFN--GVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCG 338
           EG+++       +LDH V  VG+G  ++G  YWL+KNSW  TWGD GY+ + RD +  CG
Sbjct: 244 EGVYDEPNCSSEELDHGVLAVGYG-VKNGKKYWLVKNSWAETWGDNGYILMSRDKDNQCG 302

Query: 339 IGTRSSYPL 347
           I + +SYPL
Sbjct: 303 IASSASYPL 311


>gi|238481789|gb|ACR43934.1| cathepsin L-like cysteine proteinase [Haliotis diversicolor
           supertexta]
          Length = 347

 Score =  250 bits (639), Expect = 6e-64,   Method: Compositional matrix adjust.
 Identities = 140/303 (46%), Positives = 189/303 (62%), Gaps = 12/303 (3%)

Query: 53  QHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYT 109
           QHGR Y+   E+E R +IFK+NL+YIE+ NK+   G ++Y LG NQF+D+ N+EFR +Y 
Sbjct: 48  QHGRLYEKHEEEEERFEIFKQNLQYIEEHNKKFSLGQKSYYLGINQFADMKNEEFR-MYN 106

Query: 110 GYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAV 169
           G +      R    S   +        P  +DWR KG VT +KNQ +CG CW+F+   ++
Sbjct: 107 GLRRDYNYSREVQCSN--HLTPEYLVAPDEVDWRKKGYVTAVKNQGQCGSCWSFSTTGSL 164

Query: 170 EGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAV 228
           EG    +SG L+ LSEQQL+DCS   GN GC GG  ++AF YII N GI TE+EYPY A 
Sbjct: 165 EGQHFHKSGKLVSLSEQQLVDCSGKFGNEGCNGGLMDQAFEYIITNGGIETEEEYPYDAR 224

Query: 229 PGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVS-MQPVSIAIAAYSTEFQSYKEGIFN- 286
              C   +   AA  S   +V SGDE  L  +V+ + PVSIAI A    FQ Y  G+++ 
Sbjct: 225 QERCHFKKSEVAATASGCVDVKSGDETDLKNSVAEVGPVSIAIDASHQSFQLYSGGVYDE 284

Query: 287 -GVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSS 344
                T+LDH V +VG+G T+DG +YWL+KNSWG TWG  GY+K+ R+ +  CG+ T++S
Sbjct: 285 PKCSSTELDHGVLVVGYG-TDDGQDYWLVKNSWGTTWGLEGYVKMSRNQDNQCGVATQAS 343

Query: 345 YPL 347
           YPL
Sbjct: 344 YPL 346


>gi|21489677|gb|AAM55195.1|AF412313_1 cathepsin L cysteine protease [Haemonchus contortus]
 gi|21483192|gb|AAL14224.1| cathepsin L [Haemonchus contortus]
          Length = 354

 Score =  250 bits (639), Expect = 7e-64,   Method: Compositional matrix adjust.
 Identities = 135/326 (41%), Positives = 198/326 (60%), Gaps = 13/326 (3%)

Query: 33  VSSRSTHEQSVVEIHEKW---MAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GN 86
           V  + +  Q + E   KW       G+SY+ + E +  ++ F +N+ +IE+ NKE   G 
Sbjct: 30  VHRQKSLRQKIDEAFNKWDDYKETFGKSYEPDEENDY-MEAFVKNVIHIEEHNKEHRLGR 88

Query: 87  RTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKG 146
           +T+++G N+ +DL   ++R L  GY+M      S  S+  K+       +P S+DWR++G
Sbjct: 89  KTFEMGLNEIADLPFSQYRKL-NGYRMRRQFGDSLQSNGTKFLVPFNVQIPESVDWREEG 147

Query: 147 AVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSRE 205
            VTP+KNQ  CG CWAF++  A+EG     +G L+ LSEQ L+DCST  GN+GC GG  +
Sbjct: 148 LVTPVKNQGMCGSCWAFSSTGALEGQHARATGKLVSLSEQNLVDCSTKYGNHGCNGGLMD 207

Query: 206 KAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQ- 264
            AF YI +N G+ TED YPY      C   +    A    + ++P GDE+AL KAV+ Q 
Sbjct: 208 LAFEYIKENHGVDTEDSYPYVGRETKCHFKRNAVGADDKGFVDLPEGDEEALKKAVATQG 267

Query: 265 PVSIAIAAYSTEFQSYKEGI-FNGVCGT-QLDHAVTIVGFGTTEDGANYWLIKNSWGNTW 322
           P+SIAI A    FQ YK+G+ F+  C + +LDH V +VG+GT  +  +YWL+KNSWG TW
Sbjct: 268 PISIAIDAGHRSFQLYKKGVYFDEECSSEELDHGVLLVGYGTDPEAGDYWLVKNSWGPTW 327

Query: 323 GDAGYMKIVRDE-GLCGIGTRSSYPL 347
           G+ GY++I R+    CG+ T++SYPL
Sbjct: 328 GEKGYIRIARNRNNHCGVATKASYPL 353


>gi|395535909|ref|XP_003769963.1| PREDICTED: cathepsin S [Sarcophilus harrisii]
          Length = 347

 Score =  250 bits (638), Expect = 8e-64,   Method: Compositional matrix adjust.
 Identities = 139/353 (39%), Positives = 206/353 (58%), Gaps = 17/353 (4%)

Query: 3   LIFERSGSFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDEL 62
           +IF+ S S   N   M ++I + ++CAS     R  H+  +    E W   +G+ Y+++ 
Sbjct: 1   MIFQDSKSSPANLLRMKVVIWMFLACASTTAYLR--HDPMLDNHWELWKKTYGKQYEEQN 58

Query: 63  EKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHR 119
           ++  R  I+++NL+++   N E   G  +Y L  N  SD+T++E  +L +  ++P+   R
Sbjct: 59  QEVTRRLIWEKNLKFVTLHNLEHSMGLHSYDLSMNHLSDMTSEEVASLMSSLRIPNQWSR 118

Query: 120 STTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGN 179
           +TT     Y+  S   +P S+DWRDKG VT +K Q  CG CWAF+AV A+E   K+++G 
Sbjct: 119 NTT-----YRLNSNQKLPDSVDWRDKGCVTEVKYQGTCGSCWAFSAVGALEAQLKLKTGK 173

Query: 180 LIQLSEQQLLDCSTN---GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQ 236
           L+ LS Q L+DCSTN    N+GC GG   +AF YII N GI ++  YPY+A  G C    
Sbjct: 174 LVSLSAQNLVDCSTNEKYENHGCNGGCMTEAFQYIIDNNGIDSDASYPYKAKDGKCQYNP 233

Query: 237 KPAAAKISNYEEVPSGDEQALLKAVSMQ-PVSIAIAAYSTEFQSYKEGI-FNGVCGTQLD 294
              AA  S Y E+P G E AL +AV+ + PVS+ I A    F  YK G+ ++  C   ++
Sbjct: 234 ANRAATCSRYTELPYGSEDALKEAVANKGPVSVGIDASLPSFFLYKSGVYYDPSCTQNVN 293

Query: 295 HAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDEG-LCGIGTRSSYP 346
           H V + G+G   DG +YWL+KNSWG ++GD GY++I R+ G  CGI    SYP
Sbjct: 294 HGVLVTGYGNL-DGKDYWLVKNSWGLSFGDKGYIRIARNRGNHCGIANFPSYP 345


>gi|357627452|gb|EHJ77132.1| cathepsin L-like protease [Danaus plexippus]
          Length = 341

 Score =  249 bits (637), Expect = 9e-64,   Method: Compositional matrix adjust.
 Identities = 132/337 (39%), Positives = 198/337 (58%), Gaps = 13/337 (3%)

Query: 24  LLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANK 83
           LLV CA     +  +    V E    +  +H + Y  E E++ R+KI+ EN   + K N+
Sbjct: 4   LLVLCAVVAAGTAVSFFDLVREEWNTFKLEHKKQYDSETEEKFRMKIYAENKHKVAKHNQ 63

Query: 84  ---EGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD----- 135
              +G  +Y+L TN++SD+ + EF     G+      ++   +     +  +        
Sbjct: 64  RYQKGLVSYRLKTNKYSDMLHHEFVNTMNGFNKTVKHNKGLYAKGNDIRGATFVSPANVA 123

Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN- 194
            P ++DWR  GAVTP+K+Q +CG CW+F+   A+EG    +SG L+ LSEQ L+DCS+  
Sbjct: 124 APPTVDWRQHGAVTPVKDQGKCGSCWSFSTTGALEGQHFRKSGFLVSLSEQNLIDCSSAY 183

Query: 195 GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDE 254
           GNNGC GG  + AF YI  N GI TE  YPY+AV   C    K + A+   + ++P+GDE
Sbjct: 184 GNNGCNGGLMDNAFKYIKDNDGIDTEKTYPYEAVDDKCRYNPKNSGAEDVGFVDIPAGDE 243

Query: 255 QALLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVCGTQ-LDHAVTIVGFGTTEDGANY 311
             L+ A+ ++ PVS+AI A    FQ Y +G+ ++  C ++ LDH V +VG+GT EDG +Y
Sbjct: 244 HKLMLALATVGPVSVAIDASQESFQLYSDGVYYDENCSSENLDHGVLVVGYGTDEDGGDY 303

Query: 312 WLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
           WL+KNSWG +WGD GY+K+ R+ +  CGI + +SYPL
Sbjct: 304 WLVKNSWGPSWGDEGYIKMARNRDNHCGIASSASYPL 340


>gi|119433808|gb|ABL74967.1| cysteine protease [Acanthamoeba castellanii]
          Length = 330

 Score =  249 bits (637), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 132/321 (41%), Positives = 186/321 (57%), Gaps = 9/321 (2%)

Query: 32  VVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKL 91
           V S+ +     +  +   WM  H +SY +E E   R  +++EN  +I++ N++ N +Y L
Sbjct: 15  VASTLAYKHDPLTGVFADWMRTHTKSYSNE-EFVFRWNVWRENYNFIQEENRK-NNSYYL 72

Query: 92  GTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPI 151
             N+F DLTN EF  +Y G      +H     +           +P + DWR KGAVT +
Sbjct: 73  TMNKFGDLTNAEFNKVYKGLAFDYSAH--ILKAKAATPAAPAPGLPANFDWRQKGAVTHV 130

Query: 152 KNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TNGNNGCLGGSREKAFAY 210
           KNQ +CG CW+F+   + EG   ++ G L+ LSEQ L+DCS + GNNGC GG  + AF Y
Sbjct: 131 KNQGQCGSCWSFSTTGSTEGANFLKRGTLVSLSEQNLIDCSGSYGNNGCNGGLMDYAFEY 190

Query: 211 IIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAI 270
           II N+GI TE  YPY+     C      +   +++Y +V SGDE ALL AV+++P S+AI
Sbjct: 191 IINNKGIDTEASYPYETAQYNCRYNPANSGGSLTSYTDVSSGDENALLNAVAIEPTSVAI 250

Query: 271 AAYSTEFQSYKEGIF--NGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYM 328
            A    FQ Y  G++  +    TQLDH V  VG+G TE+G +YWL+KNSWG  WG  GY+
Sbjct: 251 DASHNSFQFYSGGVYYESSCSSTQLDHGVLAVGWG-TENGQDYWLVKNSWGADWGLQGYI 309

Query: 329 KIVRD-EGLCGIGTRSSYPLA 348
           K+ R+    CGI T +SYP A
Sbjct: 310 KMARNRHNNCGIATAASYPTA 330


>gi|162138968|ref|NP_001104662.1| uncharacterized protein LOC567623 precursor [Danio rerio]
 gi|158254065|gb|AAI54241.1| Zgc:174153 protein [Danio rerio]
          Length = 336

 Score =  249 bits (637), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 140/343 (40%), Positives = 205/343 (59%), Gaps = 22/343 (6%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLE 76
           MF +I  L  C S V ++ S   Q  ++ H   W +QHG+SY +++E   R+ I++ENL 
Sbjct: 2   MFALIITL--CISAVFTAPSIDIQ--LDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLR 56

Query: 77  YIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
            IE+ N E   GN T+K+G NQF D+TN+EFR    GYK       + TS    +   S 
Sbjct: 57  KIEQHNFEYSYGNHTFKMGMNQFGDMTNEEFRQAMNGYKHDP----NRTSQGPLFMEPSF 112

Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS- 192
              P  +DWR +G VTP+K+QK+CG CW+F++  A+EG    ++G LI +SEQ L+DCS 
Sbjct: 113 FAAPQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSR 172

Query: 193 TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPG-TCSAAQKPAAAKISNYEEVPS 251
             GN GC GG  + AF Y+ +N+G+ +E  YPY A     C    +   AK + + ++PS
Sbjct: 173 PQGNQGCNGGLMDLAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKSTGFVDIPS 232

Query: 252 GDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIF--NGVCGTQLDHAVTIVGF---GTT 305
           G+E AL+ AV ++ PVS+AI A     Q Y+ GI+       ++LDHAV +VG+   G  
Sbjct: 233 GNEPALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACSSSRLDHAVLVVGYGYQGAD 292

Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYPL 347
             G  YW++KNSW + WGD GY+ + +D+   CG+ T++SYPL
Sbjct: 293 VAGNRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGVATKASYPL 335


>gi|18858809|ref|NP_571273.1| cathepsin L, 1 b precursor [Danio rerio]
 gi|1752664|emb|CAA69623.1| cathepsin L [Danio rerio]
          Length = 336

 Score =  249 bits (637), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 134/340 (39%), Positives = 203/340 (59%), Gaps = 17/340 (5%)

Query: 20  IIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIE 79
           ++  LLV+     V +  + +  + +    W +QHG+SY +++E   R+ I++ENL  IE
Sbjct: 1   MMFALLVTLYISAVFAAPSIDIQLDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRKIE 59

Query: 80  KANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV 136
           + N E   GN T+K+G NQF D+TN+EFR    GY        + TS    +   S    
Sbjct: 60  QHNFEYSYGNHTFKMGMNQFGDMTNEEFRQAMNGYTHDP----NQTSQGPLFMEPSFFAA 115

Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TNG 195
           P  +DWR +G VTP+K+QK+CG CW+F++  A+EG    ++G LI +SEQ L+DCS   G
Sbjct: 116 PQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQG 175

Query: 196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPG-TCSAAQKPAAAKISNYEEVPSGDE 254
           N GC GG  ++AF Y+ +N+G+ +E  YPY A     C    +   AKI+ + ++PSG+E
Sbjct: 176 NQGCNGGLMDQAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPSGNE 235

Query: 255 QALLKAV-SMQPVSIAIAAYSTEFQSYKEGIF--NGVCGTQLDHAVTIVGF---GTTEDG 308
            AL+ AV ++ PVS+AI A     Q Y+ GI+       ++LDHAV +VG+   G    G
Sbjct: 236 LALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACSSSRLDHAVLVVGYGYQGADVAG 295

Query: 309 ANYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYPL 347
             YW++KNSW + WGD GY+ + +D+   CG+ T++SYPL
Sbjct: 296 NRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGVATKASYPL 335


>gi|356545079|ref|XP_003540973.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
          Length = 330

 Score =  249 bits (636), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 131/296 (44%), Positives = 182/296 (61%), Gaps = 12/296 (4%)

Query: 29  ASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRT 88
           ASQV + R+  + S+ E HE+WM+++G+ YKD  E+E R +IFKEN+ YIE +N    + 
Sbjct: 5   ASQV-TCRTLQDASMYERHEEWMSRYGKVYKDPREREKRFRIFKENMNYIETSNNVAIKP 63

Query: 89  YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAV 148
            KL  NQF+DL N+EF A    +K        +   TF +        P       KGAV
Sbjct: 64  XKLVINQFADLNNEEFIAPRNIFKGMILCRFLSRKHTFPF--------PYVFLGHKKGAV 115

Query: 149 TPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKA 207
           TP+K+Q  CG CWAF  VA+ EGI  + +G LI LSEQ+L+DC T G + GC  G  + A
Sbjct: 116 TPVKDQGHCGFCWAFYDVASTEGILALTAGKLISLSEQELVDCDTKGVDQGCECGLMDDA 175

Query: 208 FAYIIQNQGIATEDEYPYQAVPGTCSAAQKP-AAAKISNYEEVPSGDEQALLKAVSMQPV 266
           F +IIQN G+   + YPY+ V G C+A ++   AA I+  E+VP+ +E+AL K V+ QPV
Sbjct: 176 FKFIIQNHGVXDAN-YPYKGVDGKCNANEEANPAATITGXEDVPANNEKALQKVVANQPV 234

Query: 267 SIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTW 322
            +AI A  ++FQ YK G+F G C T+L+H VT +G+G + DG  YWL+KNS    W
Sbjct: 235 FVAIDACDSDFQFYKSGVFTGSCETELNHGVTTMGYGVSHDGTQYWLVKNSXETEW 290


>gi|254746340|emb|CAX16635.1| putative C1A cysteine protease precursor [Manduca sexta]
          Length = 342

 Score =  249 bits (636), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 132/319 (41%), Positives = 186/319 (58%), Gaps = 16/319 (5%)

Query: 46  IHEKWMA---QHGRSYKDELEKEMRLKIFKENLEYIEKANK---EGNRTYKLGTNQFSDL 99
           + E+W+A   QH + Y  E+E   R+KI+ EN   I K N+   +G  +YKLG N+++D+
Sbjct: 24  VKEEWVAFKMQHDKKYDSEVEDRFRMKIYAENKHKIAKHNQLYEQGLVSYKLGPNKYTDM 83

Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM-----TDVPTSLDWRDKGAVTPIKNQ 154
            + EF     GY   +  ++         +  +         P  +DW  KGAVT +K+Q
Sbjct: 84  LHHEFIQAMNGYNRTAKHNKGLYGKKHDVRGATFIPPAHVKYPDHVDWTKKGAVTEVKDQ 143

Query: 155 KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDC-STNGNNGCLGGSREKAFAYIIQ 213
            +CG CWAF+   A+EG    +SG L+ LSEQ L+DC ST GNNGC GG  + AF YI  
Sbjct: 144 GKCGSCWAFSTTGALEGQHFRKSGYLVSLSEQNLIDCSSTYGNNGCNGGLMDNAFKYIKD 203

Query: 214 NQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAA 272
           N GI TE  YPY+ V   C    K + A+   + ++PSGDE+ L++AV ++ PVS+AI A
Sbjct: 204 NGGIDTEKTYPYEGVDDKCRYNPKNSGAEDVGFVDIPSGDEEKLMQAVATVGPVSVAIDA 263

Query: 273 YSTEFQSYKEGIFNGV--CGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKI 330
               FQ Y  G++       T LDH V +VG+GT E G +YWL+KNSW  TWG+ GY+K+
Sbjct: 264 SQNSFQFYSGGVYYDTECSSTDLDHGVLVVGYGTDEAGGDYWLVKNSWSRTWGELGYIKM 323

Query: 331 VRD-EGLCGIGTRSSYPLA 348
            R+ +  CGI T +SYPL 
Sbjct: 324 ARNRDNHCGIATDASYPLV 342


>gi|161408097|dbj|BAF94152.1| cathepsin L-like cysteine protease 2 [Plautia stali]
          Length = 334

 Score =  249 bits (636), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 131/303 (43%), Positives = 185/303 (61%), Gaps = 10/303 (3%)

Query: 54  HGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRTYKLGTNQFSDLTNDEFRALYTG 110
           H + Y +ELE+  R KIF EN + IEK N   K+G  ++KL  N  +D+   E+  +Y G
Sbjct: 34  HRKEYDNELEESYRKKIFLENKKRIEKHNSRYKQGKVSFKLKLNHLADMLIHEYSDVYLG 93

Query: 111 YKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVE 170
           +   S ++ +   S + +   +   +   +DWR KGAVTP+KNQ  CG CWAF+   A+E
Sbjct: 94  FNKSSKANNNKLQS-YTFIPPAHVTLNKEVDWRTKGAVTPVKNQGHCGSCWAFSTTGALE 152

Query: 171 GITKIRSGNLIQLSEQQLLDCS-TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVP 229
           G    ++G L+ LSEQ L+DCS + GNNGC GG  + AF YI +N GI TE  YPY+   
Sbjct: 153 GQNFRKTGKLVSLSEQNLVDCSGSYGNNGCEGGLMDNAFQYIKENHGIDTEKSYPYEGED 212

Query: 230 GTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNG 287
            TC   +    A  S + ++  GDE+AL++AV ++ P+S+AI A    FQ Y EG+ +  
Sbjct: 213 ETCRFRKTSIGATDSGFVDITQGDEEALMQAVATIGPISVAIDASHQSFQFYSEGVYYEP 272

Query: 288 VCGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSY 345
            C ++ LDH V +VG+G  ED   YWL+KNSWG  WGD GY+K+ RD +  CGI T++SY
Sbjct: 273 ECSSENLDHGVLVVGYG-VEDNQKYWLVKNSWGTQWGDGGYIKMARDQDNNCGIATQASY 331

Query: 346 PLA 348
           PL 
Sbjct: 332 PLV 334


>gi|262410743|gb|ACY66807.1| cathepsin L [Aphis gossypii]
          Length = 341

 Score =  249 bits (636), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 141/345 (40%), Positives = 199/345 (57%), Gaps = 18/345 (5%)

Query: 18  MFIIITL-LVSCASQVVSSRSTHEQSVVEIHEKW---MAQHGRSYKDELEKEMRLKIFKE 73
           M ++I L LV  A   VSS + +E     I E+W    AQ  + Y+D  E+  R K++ +
Sbjct: 1   MKVVIVLGLVVFAISSVSSINLNEV----IEEEWSLFKAQFKKIYEDVKEEAFRKKVYLD 56

Query: 74  NLEYIEKANK---EGNRTYKLGTNQFSDLTNDEFRALYTGYK--MPSPSHRSTTSSTFKY 128
           N   I + NK    G  TY L  N F DL   E++ +  G+K  +       T      +
Sbjct: 57  NKLKIARHNKLYETGEETYALEMNHFGDLMQHEYKKMMNGFKPSLAGGDKNFTDDDAVTF 116

Query: 129 QNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQL 188
                  VP ++DWR KG VTP+KNQ +CG CW+F+A  ++EG    ++G L+ LSEQ L
Sbjct: 117 LKSENVVVPKAIDWRKKGYVTPVKNQGQCGSCWSFSATGSLEGQHFRKTGVLVSLSEQNL 176

Query: 189 LDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYE 247
           +DCS   GNNGC GG  + AF YI  N+G+ TE  YPY+A    C    + + A    + 
Sbjct: 177 IDCSRKYGNNGCEGGLMDLAFKYIKSNKGLDTEKSYPYEAEDDKCRYNPENSGATDKGFV 236

Query: 248 EVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIF-NGVC-GTQLDHAVTIVGFGT 304
           ++P GDE AL+ A+ ++ PVSIAI A S +FQ YK+G+F N  C  T+LDH V  VG+GT
Sbjct: 237 DIPEGDEDALMHALATVGPVSIAIDASSEKFQFYKKGVFYNPRCSSTELDHGVLAVGYGT 296

Query: 305 TEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPLA 348
              G +YW++KNSWG TWGD GY+ + R+ +  CG+ + +SYPL 
Sbjct: 297 DHKGGDYWIVKNSWGKTWGDQGYIMMARNKKNNCGVASSASYPLV 341


>gi|340371596|ref|XP_003384331.1| PREDICTED: digestive cysteine proteinase 2-like [Amphimedon
           queenslandica]
          Length = 327

 Score =  249 bits (636), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 128/333 (38%), Positives = 196/333 (58%), Gaps = 17/333 (5%)

Query: 19  FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
           F+ + LL+   S  V+          E    W  ++G++Y+   E  MR KI+ +N +Y+
Sbjct: 9   FVAVLLLIGLVSAAVND--------AEEWRLWKGKYGKTYRSIYEDNMRQKIWLQNRDYV 60

Query: 79  EKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPT 138
            + N   + +++L  N+F+DLT +EF ++Y GY           ++ ++Y   +   +P 
Sbjct: 61  NEHNSM-DSSFQLEVNEFADLTAEEFSSIYNGYGKGRNRENHENTTIYRYTGGA---IPD 116

Query: 139 SLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNG 198
           S+DWR KG VTP+KNQK+CG CWAF+   ++EG    ++G L+ LSEQ L+DC    ++G
Sbjct: 117 SVDWRTKGLVTPVKNQKQCGSCWAFSTTGSLEGAHAKKTGKLVSLSEQNLVDCDKK-DHG 175

Query: 199 CLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALL 258
           C GG    AF YI +N+GI TE+ YPY+A  G C   +    A +  +  + + D +AL 
Sbjct: 176 CQGGLMTTAFKYIEENKGIDTEESYPYKAKNGRCEFKKDDIGATVERHVSILTTDCEALK 235

Query: 259 KAVS-MQPVSIAIAAYSTEFQSYKEGIFN-GVCGT-QLDHAVTIVGFGTTEDGANYWLIK 315
           KAV+ + P+S+A+ A  + FQ YK GI++  +C + +LDH V +VG+G  EDG  YWL+K
Sbjct: 236 KAVAEIGPISVAMDASHSSFQLYKSGIYDPKICSSRKLDHGVLVVGYG-KEDGEEYWLVK 294

Query: 316 NSWGNTWGDAGYMKIVRDEGLCGIGTRSSYPLA 348
           NSWG  WG  GY KI   + LCGI T + YP+ 
Sbjct: 295 NSWGKNWGMEGYFKIASKKNLCGICTSACYPVV 327


>gi|156124996|gb|ABU50816.1| Ale o 1 allergen [Aleuroglyphus ovatus]
          Length = 337

 Score =  249 bits (635), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 129/308 (41%), Positives = 195/308 (63%), Gaps = 13/308 (4%)

Query: 48  EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEF 104
           E++ +  GR Y     +  R  IF+ NL++I + N +   G+ T+ +  N F+DL+N+EF
Sbjct: 34  EQFKSTFGRVYPSPEIELHRKSIFRANLQFILRHNIDYFNGDSTFSVSVNNFTDLSNEEF 93

Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFA 164
           RA + GY+  +    + + +   + +  +  +P ++DW  KG VTPIKNQ++CG CWAF+
Sbjct: 94  RATFNGYRRLA----AVSLADSVHADNDVEALPATVDWTTKGVVTPIKNQQQCGSCWAFS 149

Query: 165 AVAAVEGITKIRSGNLIQLSEQQLLDCS-TNGNNGCLGGSREKAFAYIIQNQGIATEDEY 223
           AVA++EG   +++G L+ LSEQ L+DCS   G+ GC GG  + AF Y+IQN+GI TE  Y
Sbjct: 150 AVASMEGQHALKTGKLVSLSEQNLVDCSAAEGDMGCSGGWMDYAFKYVIQNRGIDTEASY 209

Query: 224 PYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKE 282
           PY+A+  +C   +    A I ++ +V +GDE AL  AV S+ P+S+AI A    FQ Y  
Sbjct: 210 PYKAIDESCEFKRNSIGATIHSFVDVKTGDESALQNAVASIGPISVAIDASQPSFQFYSS 269

Query: 283 GIFNGV-CGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGI 339
           G++N   C T+ LDH VT VG+GT  +G  YW +KNSWG +WG  GY+ + R+ +  CGI
Sbjct: 270 GVYNEPDCSTEILDHGVTAVGYGTL-NGVPYWKVKNSWGTSWGQKGYIFMSRNKQNQCGI 328

Query: 340 GTRSSYPL 347
            T++SYP+
Sbjct: 329 ATKASYPV 336


>gi|427797099|gb|JAA64001.1| Putative cathepsin l cathepsin l, partial [Rhipicephalus
           pulchellus]
          Length = 331

 Score =  249 bits (635), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 134/318 (42%), Positives = 191/318 (60%), Gaps = 9/318 (2%)

Query: 38  THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRT---YKLGTN 94
           THE+ V      + A HG+ Y+ + E+  RLKI+ EN   I + N++  ++   YKL  N
Sbjct: 14  THEELVGAEWSAFKALHGKEYESDTEEYYRLKIYMENRLKIARHNEKYAKSQVSYKLAMN 73

Query: 95  QFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQ 154
           +F D+ + EF +   G+K          S   + + L    +P ++DWR KGAVTP+KNQ
Sbjct: 74  EFGDMLHHEFVSTRNGFKRNYRDTPREGSFFVEPEGLEDFHLPKTVDWRKKGAVTPVKNQ 133

Query: 155 KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQ 213
            +CG CW+F+   ++EG    +   L+ LSEQ L+DCS + GNNGC GG  + AF YI  
Sbjct: 134 GQCGSCWSFSTTGSLEGQHFRKLHKLVSLSEQNLIDCSRSFGNNGCEGGLMDYAFKYIKA 193

Query: 214 NQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAA 272
           N+GI TE  YPY A  G C   +    A  + + ++P GDE  L KAV ++ PVS+AI A
Sbjct: 194 NKGIDTEQSYPYNATDGVCHFNKSAVGATDTGFVDIPEGDENKLKKAVATVGPVSVAIDA 253

Query: 273 YSTEFQSYKEGIFNGV-CGT-QLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKI 330
               FQ Y EG+++   C + QLDH V +VG+G T+DG +YWL+KNSWG TWGD GY+ +
Sbjct: 254 SHESFQFYSEGVYDEPECDSEQLDHGVLVVGYG-TKDGQDYWLVKNSWGTTWGDGGYIYM 312

Query: 331 VRD-EGLCGIGTRSSYPL 347
            R+ +  CGI + +SYPL
Sbjct: 313 SRNKDNQCGIASAASYPL 330


>gi|224079085|ref|XP_002305743.1| predicted protein [Populus trichocarpa]
 gi|222848707|gb|EEE86254.1| predicted protein [Populus trichocarpa]
          Length = 494

 Score =  249 bits (635), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 134/317 (42%), Positives = 195/317 (61%), Gaps = 14/317 (4%)

Query: 40  EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI-EKANKEGNRTYKLGTNQFSD 98
           ++S++EI ++W  +H ++YK   E E R   FK NL+YI EK  KE    +++G N+F+D
Sbjct: 36  DESIIEIFQQWRDRHQKAYKHAEEAEKRFGNFKRNLKYIIEKTGKETTLRHRVGLNKFAD 95

Query: 99  LTNDEFRALYTGYKMPSPSHRSTTSSTFK-YQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
           L+N+EF+ LY   K+  P +++   +  +  +NL   D P+SLDWR KG VT +K+Q +C
Sbjct: 96  LSNEEFKQLYLS-KVKKPINKTRIDAEDRSRRNLQSCDAPSSLDWRKKGVVTAVKDQGDC 154

Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
           G CW+F+   A+EGI  I + +LI LSEQ+L+DC T  N GC GG  + AF ++I N GI
Sbjct: 155 GSCWSFSTTGAIEGINAIVTSDLISLSEQELVDCDTT-NYGCEGGYMDYAFEWVINNGGI 213

Query: 218 ATEDEYPYQAVPGTCSAAQKP-AAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTE 276
            TE  YPY  V GTC+ A++      I  Y++V   D  ALL A + QP+S+ I   + +
Sbjct: 214 DTEANYPYTGVDGTCNTAKEEIKVVSIDGYKDVDETD-SALLCAAAQQPISVGIDGSAID 272

Query: 277 FQSYKEGIF---NGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD 333
           FQ Y  GI+          +DHAV IVG+G +E+G +YW++KNSWG +WG  GY  I R+
Sbjct: 273 FQLYTGGIYDGDCSDDPDDIDHAVLIVGYG-SENGEDYWIVKNSWGTSWGIEGYFYIKRN 331

Query: 334 E----GLCGIGTRSSYP 346
                G+C I   +SYP
Sbjct: 332 TDLPYGVCAINAMASYP 348


>gi|110625773|ref|NP_081620.2| cathepsin L-like 3 precursor [Mus musculus]
 gi|74208432|dbj|BAE26401.1| unnamed protein product [Mus musculus]
 gi|187955662|gb|AAI47425.1| RIKEN cDNA 2310051M13 gene [Mus musculus]
 gi|187957686|gb|AAI47424.1| RIKEN cDNA 2310051M13 gene [Mus musculus]
          Length = 331

 Score =  249 bits (635), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 138/341 (40%), Positives = 199/341 (58%), Gaps = 19/341 (5%)

Query: 16  TPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENL 75
           TP+F++ TL +     VVS+   H  S+  + E+W  +H ++Y    E + R  +++ N 
Sbjct: 2   TPVFLLATLCLG----VVSAAPAHNPSLDAVWEEWKTKHKKTYNMNDEGQKR-AVWENNK 56

Query: 76  EYIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
           + I+  N++   G   + L  N F DLTN EFR L TG++      + T      +Q   
Sbjct: 57  KMIDLHNEDYLKGKHGFSLEMNAFGDLTNTEFRELMTGFQ-----GQKTKMMMKVFQEPL 111

Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
           + DVP S+DWRD G VTP+K+Q  CG CWAF+AV ++EG    ++G L+ LS Q L+DCS
Sbjct: 112 LGDVPKSVDWRDHGYVTPVKDQGSCGSCWAFSAVGSLEGQMFRKTGKLVPLSVQNLVDCS 171

Query: 193 -TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPS 251
            + GN GC GG  + AF Y+  N G+ T   YPY+A+ GTC    K +AA ++ +  V S
Sbjct: 172 WSQGNQGCDGGLPDLAFQYVKDNGGLDTSVSYPYEALNGTCRYNPKNSAATVTGFVNVQS 231

Query: 252 GDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIF--NGVCGTQLDHAVTIVGFGTTEDG 308
             E AL+KAV ++ P+S+ I      FQ YKEG++       T LDHAV +VG+G   DG
Sbjct: 232 -SEDALMKAVATVGPISVGIDTKHKSFQFYKEGMYYEPDCSSTVLDHAVLVVGYGEESDG 290

Query: 309 ANYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYPLA 348
             YWL+KNSWG  WG  GY+K+ +D    CGI + +SYP+ 
Sbjct: 291 RKYWLVKNSWGRDWGMNGYIKMAKDRNNNCGIASDASYPVV 331


>gi|156371477|ref|XP_001628790.1| predicted protein [Nematostella vectensis]
 gi|156215775|gb|EDO36727.1| predicted protein [Nematostella vectensis]
          Length = 330

 Score =  248 bits (634), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 136/335 (40%), Positives = 193/335 (57%), Gaps = 13/335 (3%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
           + +   LL + AS  V      EQ      + W   H + Y    E+  R  I+++NL+ 
Sbjct: 3   LLVAACLLFAVASGFVVKFDEDEQQW----QAWKLFHTKKYTTVTEEGARKAIWRDNLKK 58

Query: 78  IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
           I+K N EG+ ++ L  N   DLT DEFR  YTG +    ++     S F     S   VP
Sbjct: 59  IQKHNAEGH-SFTLAMNHLGDLTQDEFRYFYTGMRSHYSNYTKKQGSAFLAP--SHVQVP 115

Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GN 196
            ++DWR +G VTP+KNQ +CG CWAF+   ++EG    ++G L+ LSEQ L+DCST  GN
Sbjct: 116 DTVDWRKEGYVTPVKNQGQCGSCWAFSTTGSLEGQNFKKTGKLVSLSEQNLVDCSTAYGN 175

Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQA 256
           NGC GG  + AF YI +N GI TE+ YPY+A    C   +    A  + + +V  GDE+A
Sbjct: 176 NGCQGGLMDYAFKYIKENGGIDTEESYPYEARNDRCRFQKSNIGAVDTGFVDVTHGDEEA 235

Query: 257 LLKAV-SMQPVSIAIAAYSTEFQSYKEGIFN--GVCGTQLDHAVTIVGFGTTEDGANYWL 313
           L  A  ++ P+S+AI A    FQ Y  G++N  G   T LDH V +VG+GT + G++YWL
Sbjct: 236 LKTAAGTVGPISVAIDAGHMSFQFYHSGVYNNAGCSSTSLDHGVLVVGYGTYQ-GSDYWL 294

Query: 314 IKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYPL 347
           +KNSWG  WG  GY+ + R++   CG+ T++SYPL
Sbjct: 295 VKNSWGERWGMEGYIMMSRNKNNQCGVATQASYPL 329


>gi|209693435|ref|NP_001129410.1| cathepsin L precursor [Acyrthosiphon pisum]
 gi|251823771|ref|NP_001156569.1| cathepsin L precursor [Acyrthosiphon pisum]
          Length = 341

 Score =  248 bits (634), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 138/342 (40%), Positives = 197/342 (57%), Gaps = 17/342 (4%)

Query: 20  IIITLLVSCASQVVSSRSTHEQSVVEIHEKW---MAQHGRSYKDELEKEMRLKIFKENLE 76
           +I+  LV+ A   VSS + +E     I E+W     Q  + Y+D  E+  R K++ +N  
Sbjct: 4   VIVLGLVAFAISTVSSINLNEV----IEEEWSLFKIQFKKLYEDIKEETFRKKVYLDNKL 59

Query: 77  YIEKANK---EGNRTYKLGTNQFSDLTNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNL 131
            I + NK    G  TY L  N F DL   E+  +  G+K  +       T      +   
Sbjct: 60  KIARHNKLYESGEETYALEMNHFGDLMQHEYTKMMNGFKPSLAGGDRNFTNDEAVTFLKS 119

Query: 132 SMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDC 191
               +P S+DWR KG VTP+KNQ +CG CW+F+A  ++EG    ++G L+ LSEQ L+DC
Sbjct: 120 ENVVIPKSVDWRKKGYVTPVKNQGQCGSCWSFSATGSLEGQHFRKTGVLVSLSEQNLIDC 179

Query: 192 STN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVP 250
           S   GNNGC GG  + AF YI  N+G+ TE  YPY+A    C    + + A    + ++P
Sbjct: 180 SRKYGNNGCEGGLMDLAFKYIKSNKGLDTEKSYPYEAEDDKCRYNPENSGATDKGFVDIP 239

Query: 251 SGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIF-NGVC-GTQLDHAVTIVGFGTTED 307
            GDE AL+ A+ ++ PVSIAI A S +FQ YK+G+F N  C  T+LDH V  VGFG+ + 
Sbjct: 240 EGDEDALMHALATVGPVSIAIDASSEKFQFYKKGVFYNPRCSSTELDHGVLAVGFGSDKK 299

Query: 308 GANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPLA 348
           G +YW++KNSWG TWGD GY+ + R+ +  CG+ + +SYPL 
Sbjct: 300 GGDYWIVKNSWGKTWGDEGYIMMARNKKNNCGVASSASYPLV 341


>gi|330803820|ref|XP_003289900.1| hypothetical protein DICPUDRAFT_80649 [Dictyostelium purpureum]
 gi|325080011|gb|EGC33585.1| hypothetical protein DICPUDRAFT_80649 [Dictyostelium purpureum]
          Length = 328

 Score =  248 bits (634), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 139/339 (41%), Positives = 200/339 (58%), Gaps = 26/339 (7%)

Query: 19  FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
            +   L+V+C S   ++R   ++      + WM +H +SY ++ E   R  IF++N++++
Sbjct: 7   LVFCFLIVNCIS---AARVFSQKQYQTAFQNWMVKHQKSYTND-EFGSRYTIFQDNMDFV 62

Query: 79  EKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNL--SMTDV 136
            K N++G+ T  LG N  +DLTN E++ +Y G           T +T K  NL   +TDV
Sbjct: 63  TKWNQKGSDTI-LGLNSMADLTNQEYQRIYLG-----------TKTTVKKPNLIIGVTDV 110

Query: 137 ---PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS- 192
              P S+DWR  GAVT +KNQ +CG C++F+   +VEGI +I S  L+ LSEQQ+LDCS 
Sbjct: 111 SKAPASVDWRANGAVTAVKNQGQCGGCYSFSTTGSVEGIHEITSKQLVSLSEQQILDCSG 170

Query: 193 TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSG 252
           + GNNGC GG    +F YII   G+ TE  YPY+ V G C   +    A I+ Y+ V SG
Sbjct: 171 SEGNNGCDGGLMTNSFEYIIAVGGLDTEASYPYEGVVGKCKFNKANIGATITGYKNVKSG 230

Query: 253 DEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFN--GVCGTQLDHAVTIVGFGTTEDGAN 310
            E  L  AV+ QPVS+AI A    FQ Y  G++       TQLDH V  VG+G ++ G +
Sbjct: 231 SESDLQTAVAAQPVSVAIDASQNSFQLYSSGVYYEPACSSTQLDHGVLAVGYG-SQSGQD 289

Query: 311 YWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYPLA 348
           YW++KNSWG  WG+ G++ + R++   CGI T +SYP A
Sbjct: 290 YWIVKNSWGADWGEKGFILMARNKHNNCGIATMASYPTA 328


>gi|392884266|gb|AFM90965.1| cathepsin L [Callorhinchus milii]
          Length = 338

 Score =  248 bits (634), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 135/312 (43%), Positives = 190/312 (60%), Gaps = 16/312 (5%)

Query: 48  EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEF 104
           E+W + HG+SY ++ E+  R  +++E+L  IE  N E   G  +++LG N F D+ N+EF
Sbjct: 30  EQWKSWHGKSY-EQKEETWRRMVWEEHLRVIEIHNLEHSLGKHSFRLGMNHFGDMPNEEF 88

Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFA 164
           R L  GYK    +H+    S F   N    +VP  +DWRD+G VTP+K+Q +CG CWAF+
Sbjct: 89  RQLMNGYKYKQ-THKKLQGSHFLEPNF--LEVPKHVDWRDEGYVTPVKDQGQCGSCWAFS 145

Query: 165 AVAAVEGITKIRSGNLIQLSEQQLLDCST-NGNNGCLGGSREKAFAYIIQNQGIATEDEY 223
              A+EG    R+G L+ LSEQ L++CS   GN GC GG  ++AF Y+  N GI +ED Y
Sbjct: 146 TTGALEGQHFRRTGQLVSLSEQNLVECSKPEGNEGCNGGLMDQAFQYVKDNGGIDSEDSY 205

Query: 224 PYQAVPGT-CSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYK 281
           PY     T C    +  AA  + + ++PSG E+AL+KA+ ++ PVS+AI A  T FQ Y+
Sbjct: 206 PYVGTDDTPCHYNPQYNAANDTGFVDIPSGKERALMKAIAAVGPVSVAIDAGHTSFQFYQ 265

Query: 282 EGI-FNGVC-GTQLDHAVTIVGFGTTE---DGANYWLIKNSWGNTWGDAGYMKIVRD-EG 335
            GI F   C  T LDH V +VG+G  +   DG  YW++KNSW   WG  GY+ + +D + 
Sbjct: 266 SGIYFEAECSSTDLDHGVLVVGYGVEKRDTDGKKYWIVKNSWSEKWGQNGYILMAKDKDN 325

Query: 336 LCGIGTRSSYPL 347
            CGI T +SYPL
Sbjct: 326 HCGIATAASYPL 337


>gi|81294188|gb|AAI08032.1| Cathepsin L, 1 b [Danio rerio]
          Length = 336

 Score =  248 bits (633), Expect = 3e-63,   Method: Compositional matrix adjust.
 Identities = 134/340 (39%), Positives = 202/340 (59%), Gaps = 17/340 (5%)

Query: 20  IIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIE 79
           ++  LLV+     V +  + +  + +    W +QHG+SY +++E   R+ I++ENL  IE
Sbjct: 1   MMFALLVTLYISAVFAAPSIDIQLDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRKIE 59

Query: 80  KANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV 136
           + N E   GN T+K+G NQF D+TN+EFR    GY        + TS    +   S    
Sbjct: 60  QHNFEYSYGNHTFKMGMNQFGDMTNEEFRQAMNGYTHDP----NQTSQGPLFMEPSFFAA 115

Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TNG 195
           P  +DWR +G VTP+K+QK+CG CW+F++  A+EG    ++G LI +SEQ L+DCS   G
Sbjct: 116 PQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQG 175

Query: 196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPG-TCSAAQKPAAAKISNYEEVPSGDE 254
           N GC GG  + AF Y+ +N+G+ +E  YPY A     C    +   AKI+ + ++PSG+E
Sbjct: 176 NQGCNGGLMDLAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPSGNE 235

Query: 255 QALLKAV-SMQPVSIAIAAYSTEFQSYKEGIF--NGVCGTQLDHAVTIVGF---GTTEDG 308
            AL+ AV ++ PVS+AI A     Q Y+ GI+       ++LDHAV +VG+   G    G
Sbjct: 236 LALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACSSSRLDHAVLVVGYGYQGADVAG 295

Query: 309 ANYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYPL 347
             YW++KNSW + WGD GY+ + +D+   CG+ T++SYPL
Sbjct: 296 NRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGVATKASYPL 335


>gi|728637|emb|CAA59441.1| cathepsin l [Litopenaeus vannamei]
          Length = 326

 Score =  248 bits (633), Expect = 3e-63,   Method: Compositional matrix adjust.
 Identities = 127/313 (40%), Positives = 191/313 (61%), Gaps = 17/313 (5%)

Query: 46  IHEKWM---AQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRTYKLGTNQFSDL 99
           + ++W    A+HGR Y    E+  RL +F++N ++I+  N   + G  T+ L  NQF D+
Sbjct: 19  LRQQWQNFKAEHGRRYASVQEERYRLSVFEQNQQFIDDHNARFENGEVTFTLQMNQFGDM 78

Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGC 159
           T++E  A   G+ + +P+ R   ++  K  + ++   P  +DWR KGAVTP+K+QK+CG 
Sbjct: 79  TSEEIVATMNGF-LGAPTRRP--AAVLKADDETL---PEKVDWRTKGAVTPVKDQKQCGS 132

Query: 160 CWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIA 218
           CWAF+   ++EG   ++ G L+ LSEQ L+DCS   GN GC+GG  ++AF YI  N+GI 
Sbjct: 133 CWAFSTTGSLEGQHFLKDGKLVSLSEQNLVDCSDKFGNMGCMGGLMDQAFRYIKANKGID 192

Query: 219 TEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEF 277
           TED YPY+A  G C        A  + Y +V  G E AL KAV ++ P+S+ I A  + F
Sbjct: 193 TEDSYPYEAQDGKCRFDASNVGATDTGYVDVEHGSESALKKAVATIGPISVGIDASQSTF 252

Query: 278 QSYKEGIFNG--VCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE- 334
             Y  G+++      T LDH V  VG+G+ E+G ++WL+KNSW  +WGD GY+K+ R+  
Sbjct: 253 HFYHTGVYHDDHCSSTMLDHGVLAVGYGSDENGGDFWLVKNSWNTSWGDKGYIKMSRNRN 312

Query: 335 GLCGIGTRSSYPL 347
             CGI +++SYPL
Sbjct: 313 NNCGIASQASYPL 325


>gi|1709576|sp|P05994.3|PAPA4_CARPA RecName: Full=Papaya proteinase 4; AltName: Full=Glycyl
           endopeptidase; AltName: Full=Papaya peptidase B;
           AltName: Full=Papaya proteinase IV; Short=PPIV; Flags:
           Precursor
 gi|953176|emb|CAA54974.1| proteinase IV [Carica papaya]
          Length = 348

 Score =  248 bits (633), Expect = 3e-63,   Method: Compositional matrix adjust.
 Identities = 137/315 (43%), Positives = 194/315 (61%), Gaps = 12/315 (3%)

Query: 38  THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
           T  + ++++   WM +H ++YK+  EK  R +IFK+NL+YI++ NK  N  Y LG N+FS
Sbjct: 39  TSTERLIQLFNSWMLKHNKNYKNVDEKLYRFEIFKDNLKYIDERNKMIN-GYWLGLNEFS 97

Query: 98  DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
           DL+NDEF+  Y G     P   +      ++ N  + D+P S+DWR KGAVTP+K+Q  C
Sbjct: 98  DLSNDEFKEKYVG---SLPEDYTNQPYDEEFVNEDIVDLPESVDWRAKGAVTPVKHQGYC 154

Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
             CWAF+ VA VEGI KI++GNL++LSEQ+L+DC    + GC  G +  +  Y+ QN GI
Sbjct: 155 ESCWAFSTVATVEGINKIKTGNLVELSEQELVDCDKQ-SYGCNRGYQSTSLQYVAQN-GI 212

Query: 218 ATEDEYPYQAVPGTCSAAQKPAA-AKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTE 276
               +YPY A   TC A Q      K +    V S +E +LL A++ QPVS+ + +   +
Sbjct: 213 HLRAKYPYIAKQQTCRANQVGGPKVKTNGVGRVQSNNEGSLLNAIAHQPVSVVVESAGRD 272

Query: 277 FQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR---- 332
           FQ+YK GIF G CGT++DHAVT VG+G +       LIKNSWG  WG+ GY++I R    
Sbjct: 273 FQNYKGGIFEGSCGTKVDHAVTAVGYGKSGGKGYI-LIKNSWGPGWGENGYIRIRRASGN 331

Query: 333 DEGLCGIGTRSSYPL 347
             G+CG+   S YP+
Sbjct: 332 SPGVCGVYRSSYYPI 346


>gi|269954686|ref|NP_954599.2| uncharacterized protein LOC218275 precursor [Mus musculus]
          Length = 330

 Score =  248 bits (633), Expect = 3e-63,   Method: Compositional matrix adjust.
 Identities = 138/339 (40%), Positives = 200/339 (58%), Gaps = 20/339 (5%)

Query: 16  TPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENL 75
           TP+F++ TL +     VVS+   H+ S+  + E+W  +H ++Y    E + R  +++ N+
Sbjct: 2   TPVFLLATLCLG----VVSAAPAHDPSLDAVWEEWKTKHRKTYNMNEEAQKR-AVWENNM 56

Query: 76  EYIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
           + I   N++   G   + L  N F DLTN EFR L TG++  S  H+  T     +Q   
Sbjct: 57  KMIGLHNEDYLKGKHGFNLEMNAFGDLTNTEFRELMTGFQ--SMGHKEMTI----FQEPL 110

Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
           + DVP S+DWRD G VTP+K+Q  CG CWAF+AV ++EG    ++G L+ LSEQ L+DCS
Sbjct: 111 LGDVPKSVDWRDHGYVTPVKDQGHCGSCWAFSAVGSLEGQIFRKTGKLVPLSEQNLMDCS 170

Query: 193 -TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPS 251
            + GN GC GG  E AF Y+ +N+G+ T + Y Y+A  G C    K +A  I+ + +VP 
Sbjct: 171 WSYGNVGCNGGLMELAFQYVKENRGLDTRESYAYEAWDGPCRYDPKYSAVNITGFVKVPL 230

Query: 252 GDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIF--NGVCGTQLDHAVTIVGFGTTEDG 308
             E AL+ AV S+ PVS+ I  +   F+ Y+ G +       T LDHAV +VG+G   DG
Sbjct: 231 -SEDALMNAVASVGPVSVGIDTHHHSFRFYRGGTYYEPDCSSTNLDHAVLVVGYGEESDG 289

Query: 309 ANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYP 346
             YWL+KNSWG  WG  GY+K+ +D +  CGI T + YP
Sbjct: 290 RKYWLVKNSWGEDWGMDGYIKMAKDRDNNCGIATYAIYP 328


>gi|74211558|dbj|BAE26509.1| unnamed protein product [Mus musculus]
          Length = 338

 Score =  248 bits (633), Expect = 3e-63,   Method: Compositional matrix adjust.
 Identities = 138/339 (40%), Positives = 200/339 (58%), Gaps = 20/339 (5%)

Query: 16  TPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENL 75
           TP+F++ TL +     VVS+   H+ S+  + E+W  +H ++Y    E + R  +++ N+
Sbjct: 10  TPVFLLATLCLG----VVSAAPAHDPSLDAVWEEWKTKHRKTYNMNEEAQKR-AVWENNM 64

Query: 76  EYIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
           + I   N++   G   + L  N F DLTN EFR L TG++  S  H+  T     +Q   
Sbjct: 65  KMIGLHNEDYLKGKHGFNLEMNAFGDLTNTEFRELMTGFQ--SMGHKEMTI----FQEPL 118

Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
           + DVP S+DWRD G VTP+K+Q  CG CWAF+AV ++EG    ++G L+ LSEQ L+DCS
Sbjct: 119 LGDVPKSVDWRDHGYVTPVKDQGHCGSCWAFSAVGSLEGQIFRKTGKLVPLSEQNLMDCS 178

Query: 193 -TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPS 251
            + GN GC GG  E AF Y+ +N+G+ T + Y Y+A  G C    K +A  I+ + +VP 
Sbjct: 179 WSYGNVGCNGGLMELAFQYVKENRGLDTRESYAYEAWDGPCRYDPKYSAVNITGFVKVPL 238

Query: 252 GDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIF--NGVCGTQLDHAVTIVGFGTTEDG 308
             E AL+ AV S+ PVS+ I  +   F+ Y+ G +       T LDHAV +VG+G   DG
Sbjct: 239 -SEDALMNAVASVGPVSVGIDTHHHSFRFYRGGTYYEPDCSSTNLDHAVLVVGYGEESDG 297

Query: 309 ANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYP 346
             YWL+KNSWG  WG  GY+K+ +D +  CGI T + YP
Sbjct: 298 RKYWLVKNSWGEDWGMDGYIKMAKDRDNNCGIATYAIYP 336


>gi|186701255|gb|ACC91281.1| putative cysteine proteinase [Capsella rubella]
          Length = 324

 Score =  248 bits (633), Expect = 3e-63,   Method: Compositional matrix adjust.
 Identities = 140/342 (40%), Positives = 200/342 (58%), Gaps = 42/342 (12%)

Query: 15  TTPMFIIITLLVSCASQ--VVSSRSTHEQSVVEIHEKWMAQHGRSYKDEL-EKEMRLKIF 71
           T  + II  L  S A    V S      + V  I + WM++HG++Y + L +KE R + F
Sbjct: 11  TLSLLIIFLLPPSSAMDLSVTSGGLRSNEEVGFIFQTWMSKHGKTYTNALGDKEQRFQNF 70

Query: 72  KENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNL 131
           K+NL +I++ N + N +Y+LG  QF+DLT  E++ L++G  +     +     T +Y  L
Sbjct: 71  KDNLRFIDQHNAK-NLSYRLGLTQFADLTVQEYQDLFSGRPI---QKQKALRVTHRYVPL 126

Query: 132 SMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDC 191
           +   +P S+DWR KGAV+ IK+Q  C           VE I KI +G LI LSEQ+L+DC
Sbjct: 127 AEDQLPQSVDWRQKGAVSEIKDQGRC----------TVESINKIVTGELISLSEQELVDC 176

Query: 192 STNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA--AAKISNYEEV 249
           S + N+GC GG  + AF ++I N G+  + +YPYQAV G C+  Q  +    KI  YE+V
Sbjct: 177 SID-NHGCNGGLMDSAFQFLINNNGLEYQSDYPYQAVQGYCNHNQNTSKKVIKIDGYEDV 235

Query: 250 PSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGA 309
           P+ +E +L KAV+ QP                 GI+ G CGT LDHAV IVG+GT E+G 
Sbjct: 236 PANNENSLQKAVAHQP-----------------GIYTGPCGTDLDHAVVIVGYGT-ENGQ 277

Query: 310 NYWLIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
           +YW+++NSWG  WG+AGY KI R+     G+CGI   +SYP+
Sbjct: 278 DYWIVRNSWGTVWGEAGYAKIARNFENPTGVCGIAMVASYPI 319


>gi|148709355|gb|EDL41301.1| cDNA sequence BC051665 [Mus musculus]
          Length = 349

 Score =  248 bits (633), Expect = 3e-63,   Method: Compositional matrix adjust.
 Identities = 138/339 (40%), Positives = 200/339 (58%), Gaps = 20/339 (5%)

Query: 16  TPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENL 75
           TP+F++ TL +     VVS+   H+ S+  + E+W  +H ++Y    E + R  +++ N+
Sbjct: 21  TPVFLLATLCLG----VVSAAPAHDPSLDAVWEEWKTKHRKTYNMNEEAQKR-AVWENNM 75

Query: 76  EYIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
           + I   N++   G   + L  N F DLTN EFR L TG++  S  H+  T     +Q   
Sbjct: 76  KMIGLHNEDYLKGKHGFNLEMNAFGDLTNTEFRELMTGFQ--SMGHKEMTI----FQEPL 129

Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
           + DVP S+DWRD G VTP+K+Q  CG CWAF+AV ++EG    ++G L+ LSEQ L+DCS
Sbjct: 130 LGDVPKSVDWRDHGYVTPVKDQGHCGSCWAFSAVGSLEGQIFRKTGKLVPLSEQNLMDCS 189

Query: 193 -TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPS 251
            + GN GC GG  E AF Y+ +N+G+ T + Y Y+A  G C    K +A  I+ + +VP 
Sbjct: 190 WSYGNVGCNGGLMELAFQYVKENRGLDTRESYAYEAWDGPCRYDPKYSAVNITGFVKVPL 249

Query: 252 GDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIF--NGVCGTQLDHAVTIVGFGTTEDG 308
             E AL+ AV S+ PVS+ I  +   F+ Y+ G +       T LDHAV +VG+G   DG
Sbjct: 250 -SEDALMNAVASVGPVSVGIDTHHHSFRFYRGGTYYEPDCSSTNLDHAVLVVGYGEESDG 308

Query: 309 ANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYP 346
             YWL+KNSWG  WG  GY+K+ +D +  CGI T + YP
Sbjct: 309 RKYWLVKNSWGEDWGMDGYIKMAKDRDNNCGIATYAIYP 347


>gi|30388235|gb|AAH51665.1| CDNA sequence BC051665 [Mus musculus]
          Length = 330

 Score =  248 bits (633), Expect = 3e-63,   Method: Compositional matrix adjust.
 Identities = 138/339 (40%), Positives = 200/339 (58%), Gaps = 20/339 (5%)

Query: 16  TPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENL 75
           TP+F++ TL +     VVS+   H+ S+  + E+W  +H ++Y    E + R  +++ N+
Sbjct: 2   TPVFLLATLCLG----VVSAAPAHDPSLDAVWEEWKTKHRKTYSMNEEAQKR-AVWENNM 56

Query: 76  EYIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
           + I   N++   G   + L  N F DLTN EFR L TG++  S  H+  T     +Q   
Sbjct: 57  KMIGLHNEDYLKGKHGFNLEMNAFGDLTNTEFRELMTGFQ--SMGHKEMTI----FQEPL 110

Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
           + DVP S+DWRD G VTP+K+Q  CG CWAF+AV ++EG    ++G L+ LSEQ L+DCS
Sbjct: 111 LGDVPKSVDWRDHGYVTPVKDQGHCGSCWAFSAVGSLEGQIFRKTGKLVPLSEQNLMDCS 170

Query: 193 -TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPS 251
            + GN GC GG  E AF Y+ +N+G+ T + Y Y+A  G C    K +A  I+ + +VP 
Sbjct: 171 WSYGNVGCNGGLMELAFQYVKENRGLDTRESYAYEAWDGPCRYDPKYSAVNITGFVKVPL 230

Query: 252 GDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIF--NGVCGTQLDHAVTIVGFGTTEDG 308
             E AL+ AV S+ PVS+ I  +   F+ Y+ G +       T LDHAV +VG+G   DG
Sbjct: 231 -SEDALMNAVASVGPVSVGIDTHHHSFRFYRGGTYYEPDCSSTNLDHAVLVVGYGEESDG 289

Query: 309 ANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYP 346
             YWL+KNSWG  WG  GY+K+ +D +  CGI T + YP
Sbjct: 290 RKYWLVKNSWGEDWGMDGYIKMAKDRDNNCGIATYAIYP 328


>gi|94480716|emb|CAI91577.1| cathepsin L [Aphrocallistes vastus]
          Length = 329

 Score =  248 bits (633), Expect = 3e-63,   Method: Compositional matrix adjust.
 Identities = 138/333 (41%), Positives = 201/333 (60%), Gaps = 10/333 (3%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
           M ++  LL   A   V  +   E+      E W  ++ RSY   L++E+R KI+  N+ Y
Sbjct: 1   MKLVFLLLGLFAGACVCLQCETEEVQDFAWEGWKLKYNRSYG--LDEELRKKIWANNMLY 58

Query: 78  IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
           +++ N EG+ +YKL  NQF+DLTN E+R +Y GY   +   R      F+ + +   D+P
Sbjct: 59  VKEFNAEGH-SYKLAANQFADLTNLEYRQIYLGYDNEARLSRKREGKVFQ-RKMKDEDLP 116

Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GN 196
           T++DWR KG VTP+KNQ +CG CW+F+A  ++EG   I+SG L+  SEQ+L+DCST+ GN
Sbjct: 117 TTVDWRSKGVVTPVKNQGQCGSCWSFSATGSLEGQYAIKSGKLVSFSEQELVDCSTSLGN 176

Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQA 256
           +GC GG  + AF Y   N     E +Y Y A  G C    +    K S++ ++PS +  A
Sbjct: 177 HGCQGGLMDYAFKYWETNLA-EKESDYTYTAKNGKCKYNAQLGVTKDSSFTDIPSENCDA 235

Query: 257 LLKAVSMQ-PVSIAIAAYSTEFQSYKEGIFNG-VCG-TQLDHAVTIVGFGTTEDGANYWL 313
           L +AV+ + P+++A+ A  T FQ Y  GI+   +C  T+LDH V +VG+G T++G +YWL
Sbjct: 236 LKEAVANKGPIAVAMDASHTSFQMYHSGIYTPFLCSKTKLDHGVLVVGYG-TDNGVDYWL 294

Query: 314 IKNSWGNTWGDAGYMKIVRDEGLCGIGTRSSYP 346
           IKNSWG  WG  GY KI      CGI T++SYP
Sbjct: 295 IKNSWGMAWGMDGYFKIEMKSDKCGICTQASYP 327


>gi|162463334|ref|NP_001104878.1| maize insect resistance2 precursor [Zea mays]
 gi|2425064|gb|AAB88262.1| cysteine proteinase Mir2 [Zea mays]
          Length = 493

 Score =  248 bits (632), Expect = 4e-63,   Method: Compositional matrix adjust.
 Identities = 129/290 (44%), Positives = 184/290 (63%), Gaps = 11/290 (3%)

Query: 67  RLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEFRA-LYTGYKMPSPSHRSTT 122
           RL++F++NL YI+  N E   G   ++LG  +F+DLT +E+RA L  G +  + +     
Sbjct: 92  RLEVFRDNLRYIDAHNAEADAGLHGFRLGLTRFADLTLEEYRARLLLGSRGRNGTAVGVV 151

Query: 123 SSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQ 182
               +Y  L+   +P ++DWR++GAV  +K+Q +CG CWAF+AVAAVEGI KI +G+LI 
Sbjct: 152 GRR-RYLPLAGEQLPDAVDWRERGAVAEVKDQGQCGGCWAFSAVAAVEGINKIVTGSLIS 210

Query: 183 LSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQK-PAAA 241
           LSEQ+L+DC    + GC GG  + AF ++I+N GI TE +YP+    GTC    K     
Sbjct: 211 LSEQELIDCDKFQDQGCDGGLMDNAFVFMIKNGGIDTEADYPFTGHDGTCDLKLKNTRVV 270

Query: 242 KISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVG 301
            I ++E VP   E+AL KAV+ QPVS +I A    FQ Y  GIF+G CGT LDH VT+VG
Sbjct: 271 SIDSFERVPINYERALQKAVAHQPVSASIEASRRAFQLYSSGIFDGRCGTYLDHGVTVVG 330

Query: 302 FGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDEGL----CGIGTRSSYPL 347
           +G +E G +YW++KNSWG  WG+AGY+++ R+  +     GI     YP+
Sbjct: 331 YG-SEGGKDYWIVKNSWGTQWGEAGYVRMARNVRVRPPSAGIAMEPLYPV 379


>gi|126681066|gb|ABO26562.1| cathepsin L-like cysteine protease [Ixodes ricinus]
          Length = 335

 Score =  248 bits (632), Expect = 4e-63,   Method: Compositional matrix adjust.
 Identities = 128/304 (42%), Positives = 186/304 (61%), Gaps = 9/304 (2%)

Query: 52  AQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEFRALY 108
           A+HG+SY  E E+  RLKI+ EN   I K N++   G   Y +  N+F D+ + EF +  
Sbjct: 32  AKHGKSYVSETEEVFRLKIYMENRHKIAKHNEKYARGEVPYSMAMNEFGDMLHHEFVSTR 91

Query: 109 TGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAA 168
            G+K          S+  + +N+    +P ++DWR KGAVTP+KNQ +CG CWAF+A  +
Sbjct: 92  NGFKRNYKDQPREGSTYLEPENIEDFSLPKTVDWRTKGAVTPVKNQGQCGSCWAFSATGS 151

Query: 169 VEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQA 227
           +EG    +SG+++ LSEQ L+ CST+ GNNGC GG  + AF YI  N+GI TE  YPY  
Sbjct: 152 LEGQHFRKSGSMVSLSEQNLVGCSTDFGNNGCEGGLMDDAFKYIRANKGIDTEKSYPYNG 211

Query: 228 VPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIFN 286
             GTC   +    A  S + ++  G E  L KAV ++ P+S+AI A    FQ Y +G+++
Sbjct: 212 TDGTCHFKKSTVGATDSGFVDIKEGSETQLKKAVATVGPISVAIDASHESFQFYSDGVYD 271

Query: 287 GV-CGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRS 343
              C ++ LDH V +VG+GT  +G +YW +KNSWG TWGD GY+++ R+ +  CGI + +
Sbjct: 272 EPECDSESLDHGVLVVGYGTL-NGTDYWFVKNSWGTTWGDEGYIRMSRNKKNQCGIASSA 330

Query: 344 SYPL 347
           S PL
Sbjct: 331 SIPL 334


>gi|156397875|ref|XP_001637915.1| predicted protein [Nematostella vectensis]
 gi|156225031|gb|EDO45852.1| predicted protein [Nematostella vectensis]
          Length = 331

 Score =  248 bits (632), Expect = 4e-63,   Method: Compositional matrix adjust.
 Identities = 131/304 (43%), Positives = 186/304 (61%), Gaps = 11/304 (3%)

Query: 50  WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYT 109
           W + HG+ Y ++ E+ MR  I++ NL+ I   N EG  ++KL  N   D+T+ E      
Sbjct: 32  WKSFHGKEYPNKNEETMRNFIWQNNLKKIVTHN-EGKHSFKLAMNHLGDMTSLEISQTLL 90

Query: 110 GYKMPSPSHRSTTSSTF-KYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAA 168
           G K+   +      +TF    N+ + D   S+DWR KG VTP+KNQ +CG CWAF+   A
Sbjct: 91  GLKLKKHAESQPKGATFLPPANVKVVD---SIDWRSKGYVTPVKNQGQCGSCWAFSTTGA 147

Query: 169 VEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQA 227
           +EG    ++G L+ LSEQ L+DCS   GNNGC GG  + AF YI +N GI TE  YPY A
Sbjct: 148 LEGQHFRKTGKLVSLSEQNLVDCSGKYGNNGCEGGLMDNAFQYIKENGGIDTEKSYPYLA 207

Query: 228 VPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIFN 286
             G C   +    AK + + ++P+GDE AL +A+ S+ P+SIAI A  + F  Y +G+++
Sbjct: 208 KDGVCHYNKSAIGAKDTGFVDIPTGDENALQQALASVGPISIAIDASQSTFHFYHQGVYD 267

Query: 287 --GVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR-DEGLCGIGTRS 343
                 T+LDH V  VG+G T+DG +YWL+KNSWG +WG+ GY+KI R D   CG+ +++
Sbjct: 268 DPDCSSTRLDHGVLAVGYG-TDDGKDYWLVKNSWGPSWGEEGYIKIARNDHDKCGVASKA 326

Query: 344 SYPL 347
           SYPL
Sbjct: 327 SYPL 330


>gi|320164780|gb|EFW41679.1| cathepsin L2 [Capsaspora owczarzaki ATCC 30864]
          Length = 334

 Score =  248 bits (632), Expect = 4e-63,   Method: Compositional matrix adjust.
 Identities = 130/305 (42%), Positives = 182/305 (59%), Gaps = 7/305 (2%)

Query: 48  EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRAL 107
           E W    G+SY D +E+  R  +++ N   ++  N  G  +Y LG N F+DLT++EF+  
Sbjct: 31  EAWKRTFGKSYSDAVEEINRRAVWEANKMLVDAHNGAGIHSYTLGMNIFADLTHEEFKRF 90

Query: 108 YTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVA 167
           Y G K+     RS  SSTF     ++  +P S+DWR  G VTP+K+Q +CG CW+F+   
Sbjct: 91  YLGTKVDLNRPRSNFSSTF-IPTANVGALPDSVDWRTAGIVTPVKDQGQCGSCWSFSTTG 149

Query: 168 AVEGITKIRSGNLIQLSEQQLLDCS-TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQ 226
           +VEG    ++G L+ LSEQ L+DCS   GN GC GG  + AF YII N+GI TE  YPY 
Sbjct: 150 SVEGQHARKTGQLVSLSEQNLVDCSKAQGNQGCNGGLMDDAFQYIITNKGIDTEASYPYT 209

Query: 227 AVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIF 285
           A  GTC        A +S+++++  G E  L  AV ++ PVS+AI A    FQ Y  G++
Sbjct: 210 AKDGTCKFNAANVGATLSSFQDITRGSESDLQNAVATVGPVSVAIDASKNSFQLYTSGVY 269

Query: 286 N--GVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTR 342
           N      T LDH V   G+GT+ +G  YWL+KNSWG++WG AGY+ + R+    CGI T 
Sbjct: 270 NEKKCSSTSLDHGVLAAGYGTS-NGTPYWLVKNSWGSSWGQAGYIWMSRNANNQCGIATS 328

Query: 343 SSYPL 347
           +SYP+
Sbjct: 329 ASYPI 333


>gi|94448674|emb|CAI91575.1| cathepsin L2 [Lubomirskia baicalensis]
          Length = 324

 Score =  247 bits (631), Expect = 5e-63,   Method: Compositional matrix adjust.
 Identities = 135/306 (44%), Positives = 188/306 (61%), Gaps = 15/306 (4%)

Query: 50  WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE-GNRTYKLGTNQFSDLTNDEFRALY 108
           W A+HG+SY++  E+ +R   ++ N +YI++ N+  G   Y L  NQF DL N EF++LY
Sbjct: 25  WKAEHGKSYRNHKEEMLRHVTWQANKKYIDEHNQHAGVFGYTLKMNQFGDLENSEFKSLY 84

Query: 109 TGYKMP-SPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVA 167
            GY+M  +P          + Q     D+P S+DW  KG VTP+KNQ +CG CW+F+A  
Sbjct: 85  NGYRMSNAPRKGKPFVPAARVQ-----DLPASVDWSKKGWVTPVKNQGQCGSCWSFSATG 139

Query: 168 AVEGITKIRSGNLIQLSEQQLLDCS-TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQ 226
           ++EG     +G L+ LSEQ L+DCS   GN+GC GG  + AF Y+I+N GI TE  YPY+
Sbjct: 140 SMEGQHFNATGTLMSLSEQNLVDCSAAEGNHGCNGGLMDDAFEYVIKNNGIDTEASYPYR 199

Query: 227 AVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIF 285
           AV  TC        A IS Y +V    E  L  AV ++ PVS+AI A    FQ Y  G++
Sbjct: 200 AVDSTCKFNTADVGATISGYVDVTKDSESDLQVAVATIGPVSVAIDASHISFQFYSSGVY 259

Query: 286 NG-VC-GTQLDHAVTIVGFGTTEDGA-NYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGT 341
           +  +C  T LDH V  VG+GT  DG+ +YWL+KNSWG +WG +GY+++VR+    CGI T
Sbjct: 260 DPLICSSTNLDHGVLAVGYGT--DGSKDYWLVKNSWGASWGMSGYIEMVRNHNNKCGIAT 317

Query: 342 RSSYPL 347
            +SYP+
Sbjct: 318 SASYPV 323


>gi|444519959|gb|ELV12909.1| Cathepsin L1 [Tupaia chinensis]
          Length = 333

 Score =  247 bits (631), Expect = 5e-63,   Method: Compositional matrix adjust.
 Identities = 135/341 (39%), Positives = 209/341 (61%), Gaps = 24/341 (7%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
           +F+ I  L      + S+  TH+QS+ E   +W A+HG+ Y    E+ +R  ++++NL+ 
Sbjct: 5   LFLTILCL-----GIASAAPTHDQSLDEQWNQWTAEHGKVYSTG-EESLRRAVWEKNLKM 58

Query: 78  IEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
           IE+ N E   G  T+ +G N F D+TN++FR + TG++    + +      F  Q     
Sbjct: 59  IEQHNLEYSQGKHTFTMGMNAFGDMTNEDFRQMMTGFQ----NQKYNKGEVF--QPPQPL 112

Query: 135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-T 193
           +VP S+DWR+KG VTP+KNQ  CG CWAF+A  A+EG    ++G L+ LSEQ L+DCS  
Sbjct: 113 EVPESVDWREKGYVTPVKNQHRCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSQP 172

Query: 194 NGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGD 253
             N+GC GG   KAF Y+  N G+ +E+ YPY+ +  TC  +   +AA ++ ++ +P+ +
Sbjct: 173 QHNSGCKGGLVIKAFQYVKDNGGLDSEESYPYEEMESTCRYSPGNSAATVTGFKHIPA-E 231

Query: 254 EQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIFNGV-CGTQ-LDHAVTIVGFGTTEDGAN 310
           E+AL KAV S+ P+S+AI A+   FQ Y  GI +   C  + L+HAV +VG+G  ++G+N
Sbjct: 232 EKALEKAVASVGPISVAIDAHHHSFQFYTGGILHEPNCSPKWLNHAVLVVGYGVMQEGSN 291

Query: 311 ---YWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYPL 347
              YWL+KNSWG  WG  GY+ + +D+   CGI + + YP+
Sbjct: 292 NNTYWLVKNSWGERWGVGGYIMMAKDKNNHCGIASDALYPI 332


>gi|443724292|gb|ELU12369.1| hypothetical protein CAPTEDRAFT_165495 [Capitella teleta]
          Length = 351

 Score =  247 bits (631), Expect = 6e-63,   Method: Compositional matrix adjust.
 Identities = 131/306 (42%), Positives = 190/306 (62%), Gaps = 16/306 (5%)

Query: 54  HGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRTYKLGTNQFSDLTNDEFRALYTG 110
           H R+Y  E E+  R ++F+ NL+ IE  N    +G  +Y++G NQF+D+   EF ++  G
Sbjct: 51  HERNYG-ETEEMQRKEVFRNNLKKIEMHNYLHSQGKSSYRMGINQFADMEVKEFASVVNG 109

Query: 111 YKMPSPSHRSTTSSTFKYQNLSM---TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVA 167
           ++M   ++R+          +S      +P  +DWR +G VTPIK+Q  CG CW+F+   
Sbjct: 110 FRM---NNRTKVRDHLHSHYISPAIPVSLPAEVDWRKEGYVTPIKDQGHCGSCWSFSTTG 166

Query: 168 AVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQ 226
           A+EG    ++G L+ LSEQ L+DCST+ GNNGC GG  + AF YI  N G  TED YPY+
Sbjct: 167 ALEGQHFRKTGKLVSLSEQNLIDCSTSYGNNGCNGGVMDYAFQYIKDNDGDDTEDSYPYE 226

Query: 227 AVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSM-QPVSIAIAAYSTEFQSYKEGIF 285
           A  G C   ++   A  + Y ++P GDE+ + +AV+M  PVS+AI A  T FQ Y+ G++
Sbjct: 227 AADGPCRFKKEYVGATDTGYTDLPKGDEEKMKEAVAMVGPVSVAIDASHTSFQMYQSGVY 286

Query: 286 NGV-CGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTR 342
           + V C  + LDH V +VG+G TE G +YWL+KNSWG  WGD GY+K+ R++   CGI + 
Sbjct: 287 DEVECDPEGLDHGVLVVGYG-TELGQDYWLVKNSWGTKWGDEGYIKMSRNKNNQCGISSM 345

Query: 343 SSYPLA 348
           +SYPL 
Sbjct: 346 ASYPLV 351


>gi|443708542|gb|ELU03619.1| hypothetical protein CAPTEDRAFT_17807 [Capitella teleta]
          Length = 350

 Score =  247 bits (630), Expect = 6e-63,   Method: Compositional matrix adjust.
 Identities = 131/305 (42%), Positives = 189/305 (61%), Gaps = 16/305 (5%)

Query: 54  HGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRTYKLGTNQFSDLTNDEFRALYTG 110
           H R+Y  E E+  R ++F+ NL+ I+  N   ++G   Y++G NQF+D+  +EF ++  G
Sbjct: 50  HERTYG-ETEESQRKEVFRNNLKKIQAHNHLHEQGKSPYRMGINQFADMEANEFASIMNG 108

Query: 111 YKMPSPSHRSTTSSTFKYQNLSM---TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVA 167
           ++M   ++R+          +S      VP  +DWR +G VTP+KNQ +CG CWAF+   
Sbjct: 109 FRM---NNRTEVRDHLHANYISPAIPVSVPAEVDWRKEGYVTPVKNQGQCGSCWAFSTTG 165

Query: 168 AVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQ 226
           ++EG    ++G L+ LSEQ L+DCST+ GN GC GG  + AF YI  N G  TE  YPY+
Sbjct: 166 SLEGQHFRKTGKLVSLSEQNLVDCSTSYGNEGCNGGIVDYAFQYIKDNDGDDTEACYPYE 225

Query: 227 AVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSM-QPVSIAIAAYSTEFQSYKEGIF 285
           AV GTC        A  + Y ++P GDE  + +AV++  PVS+AI A  + FQ Y+ GI+
Sbjct: 226 AVDGTCRFKSVCVGATCTGYTDLPKGDEAKMKEAVALVGPVSVAIDASHSSFQMYQSGIY 285

Query: 286 --NGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTR 342
                   QLDHAV +VG+G TE G +YWL+KNSWG TWGD GY+K+ R+ +  CGI ++
Sbjct: 286 VEQECSPKQLDHAVLVVGYG-TEQGQDYWLVKNSWGTTWGDEGYIKMARNMDNQCGIASQ 344

Query: 343 SSYPL 347
           +SYPL
Sbjct: 345 ASYPL 349


>gi|29165304|gb|AAO65603.1| cathepsin L precursor [Hydra vulgaris]
          Length = 324

 Score =  247 bits (630), Expect = 6e-63,   Method: Compositional matrix adjust.
 Identities = 131/305 (42%), Positives = 188/305 (61%), Gaps = 14/305 (4%)

Query: 49  KWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALY 108
           +W   H + Y  + E+ +R  I+K+N   I + N +G   + L  NQF D+TN EF+A +
Sbjct: 29  QWKMYHNKVYSHDGEETVRYTIWKDNERRIREHNLKGG-DFILKMNQFGDMTNSEFKA-F 86

Query: 109 TGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAA 168
            GY     SH+    STF   N  +   P ++DWR++G VTP+K+Q +CG CWAF+   +
Sbjct: 87  NGY----LSHKHVNGSTFLTPNNFV--APDTVDWRNEGYVTPVKDQGQCGSCWAFSTTGS 140

Query: 169 VEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQA 227
           +EG    ++G L+ LSEQ L+DCST  GNNGC GG  + AF YI +N+GI +E  YPY A
Sbjct: 141 LEGQHFKKTGKLVSLSEQNLVDCSTAYGNNGCDGGLMDNAFTYIKENKGIDSEASYPYTA 200

Query: 228 VPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIFN 286
             G C   +   AA  + + ++P G+E  L +AV S+ P+S+AI A    FQ Y  G++N
Sbjct: 201 EDGKCVFKKSSVAATDTGFVDIPEGNENKLKEAVASVGPISVAIDASHESFQFYSSGVYN 260

Query: 287 --GVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRS 343
                 T+LDH V +VG+G TE G +YWL+KNSW  +WGD GY+K+ R+ +  CGI T++
Sbjct: 261 EPSCSSTELDHGVLVVGYG-TESGKDYWLVKNSWNTSWGDKGYIKMRRNAKNQCGIATKA 319

Query: 344 SYPLA 348
           SYPL 
Sbjct: 320 SYPLV 324


>gi|391332597|ref|XP_003740719.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
          Length = 330

 Score =  247 bits (630), Expect = 6e-63,   Method: Compositional matrix adjust.
 Identities = 139/337 (41%), Positives = 200/337 (59%), Gaps = 19/337 (5%)

Query: 20  IIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKE--MRLKIFKENLEY 77
           +I++L V+C    VS  +       E  E +  QH ++Y   L+K+   R  IF+ N++ 
Sbjct: 1   MILSLTVACIFVGVSPAAVDAHD--EHWELFKRQHNKTY---LQKQDVGRRAIFEANIKK 55

Query: 78  IEKAN---KEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
           I   N     G  +Y+LG N F+D+T DEF   Y G +  +   R    S  ++++    
Sbjct: 56  INAHNLLYDLGRSSYRLGLNGFADMTPDEFEK-YRGTRFEANEARV---SKLQHRDNRSM 111

Query: 135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-T 193
            VP ++DWR +G VTP+KNQ  CG CWAF+   A+EG    RSG+L+ LSEQ L+DCS  
Sbjct: 112 HVPDTVDWRTEGYVTPVKNQGVCGSCWAFSTTGALEGQHFRRSGDLVSLSEQMLVDCSAV 171

Query: 194 NGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGD 253
            GN GC GG  + AF +I    G+ TE  YPY    GTC    +   AK++ + +VPS D
Sbjct: 172 YGNAGCNGGLMDNAFRFIKDAGGLETEKSYPYTGKDGTCHFDARGIGAKLTGFVDVPSRD 231

Query: 254 EQALLKAVS-MQPVSIAIAAYSTEFQSYKEGIFNGVC--GTQLDHAVTIVGFGTTEDGAN 310
           E+AL +A   + PVS+AI A    FQ YK+G+++ +    T LDH V +VG+GTT DG +
Sbjct: 232 EEALKEAAGVVGPVSVAIDASGQNFQFYKDGVYDEITCSSTSLDHGVLVVGYGTTRDGKD 291

Query: 311 YWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYP 346
           YWL+KNSWG++WG +GY+++ R+ E  CGI T +SYP
Sbjct: 292 YWLVKNSWGSSWGQSGYIQMSRNKENQCGIATMASYP 328


>gi|391336140|ref|XP_003742440.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
          Length = 330

 Score =  247 bits (630), Expect = 6e-63,   Method: Compositional matrix adjust.
 Identities = 130/300 (43%), Positives = 189/300 (63%), Gaps = 13/300 (4%)

Query: 54  HGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRT--YKLGTNQFSDLTNDEFRALYTGY 111
           H +SY+D  E+ +R  IF++NL  IE+ N+       + LG N+F+D+TN EF  +  G 
Sbjct: 35  HLKSYRDGQEELIRRFIFEDNLHTIEEFNRVNASLAGFTLGVNEFADMTNTEFSNMLLGL 94

Query: 112 KMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEG 171
                  R+  +    +++  + D+P  +DW  KG VT +KNQ +CG CWAF+   ++EG
Sbjct: 95  -----GGRNKIAGDSVFESSHVQDLPAEVDWTQKGYVTEVKNQGQCGSCWAFSTTGSLEG 149

Query: 172 ITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPG 230
               ++G L+ LSEQ L+DCST+ GN GC GG  ++AF YI +N GI TE  YPY    G
Sbjct: 150 QVFKKTGKLVSLSEQNLVDCSTSEGNQGCNGGLMDQAFTYIKKNGGIDTEAAYPYTGSDG 209

Query: 231 TCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIFNG-- 287
           TC   +    A +S + +V SGDE AL +AV ++ P+S+AI A S  FQ Y+ G++N   
Sbjct: 210 TCRFLENKVGATVSGFVDVKSGDENALKEAVATVGPISVAIDASSIFFQFYRGGVYNPWF 269

Query: 288 VCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYP 346
              T+LDH V +VG+G TE G +YWL+KNSWG++WG  GY+K+VR+ +  CGI T++SYP
Sbjct: 270 CSSTELDHGVLVVGYG-TEGGKDYWLVKNSWGSSWGLKGYIKMVRNKKNRCGIATQASYP 328


>gi|449673497|ref|XP_002169904.2| PREDICTED: cathepsin L-like [Hydra magnipapillata]
          Length = 325

 Score =  247 bits (630), Expect = 6e-63,   Method: Compositional matrix adjust.
 Identities = 133/334 (39%), Positives = 195/334 (58%), Gaps = 18/334 (5%)

Query: 19  FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
            I ++L+  C   ++          V     W   H ++Y  E E+ +R  I+K+N+  I
Sbjct: 4   LIFVSLITLCFGYIIEKPIRESSWYV-----WKMAHNKAYSHESEENVRYAIWKDNMNRI 58

Query: 79  EKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPT 138
            + N + ++   L  N F D+TN EFRA   G  +    H+    STF     S T  P 
Sbjct: 59  TEYNSK-SKNVILRMNHFGDMTNTEFRAKMNGLLL----HKHQNGSTFLVP--SHTAAPD 111

Query: 139 SLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNN 197
           ++DWR +G VTP+KNQ +CG CWAF++  A+EG    ++G L+ LSEQ L+DCST+ GNN
Sbjct: 112 AVDWRSEGYVTPVKNQGQCGSCWAFSSTGALEGQHFKKTGRLVSLSEQNLVDCSTDYGNN 171

Query: 198 GCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQAL 257
           GC GG  + AF+YI  N GI TE  YPY+   GTC  ++    A  + + ++P GDE AL
Sbjct: 172 GCNGGLMDNAFSYIKANGGIDTETGYPYEGQDGTCRYSKSSIGADDTGFVDIPEGDEDAL 231

Query: 258 LKAV-SMQPVSIAIAAYSTEFQSYKEGIFN--GVCGTQLDHAVTIVGFGTTEDGANYWLI 314
            +AV ++ PVS+AI A    FQ Y  G+++      + LDH V +VG+G T++G +YWL+
Sbjct: 232 KQAVATVGPVSVAIDASHMSFQFYHSGVYDEPQCSPSALDHGVLVVGYG-TDNGKDYWLV 290

Query: 315 KNSWGNTWGDAGYMKIVR-DEGLCGIGTRSSYPL 347
           KNSWG  WG  GY+ + R ++  CGI +++SYPL
Sbjct: 291 KNSWGTGWGTEGYIYMSRNNQNQCGIASKASYPL 324


>gi|326531188|dbj|BAK04945.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 360

 Score =  247 bits (630), Expect = 6e-63,   Method: Compositional matrix adjust.
 Identities = 137/350 (39%), Positives = 201/350 (57%), Gaps = 22/350 (6%)

Query: 17  PMFIIITLLVSCASQVVSSRSTHEQSVVEIHE--KWMAQHGRSYKDELEKEMRLKIFKEN 74
           P+    T+L++ A+   S R      ++ +     W A H +SY+   E+  R +++++N
Sbjct: 10  PVITASTILLAWAAAAASGRGVDVGDMLMMDRFLMWQATHNQSYRSAEERLRRFQVYRDN 69

Query: 75  LEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTS----------- 123
           +EYIE  N+ G+ TY+LG NQF+DLT +EF A +T Y           S           
Sbjct: 70  VEYIETTNRRGDLTYQLGENQFADLTREEFIARFTSYNGDDDRTGDDDSVITTAAVGGGD 129

Query: 124 -STFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCC-WAFAAVAAVEGITKIRSGNLI 181
              +      ++  P S+DWR KGAV P K+Q       WAF AVA +E +  I++G L+
Sbjct: 130 PDLWSSGGDDVSLDPPSVDWRAKGAVVPPKSQSSSCSSSWAFVAVATIESLHAIKTGKLV 189

Query: 182 QLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKP-AA 240
            LSEQQL+DC    + GC  G+  +AF ++IQN G+ TE EYPY A  GTC++A+     
Sbjct: 190 ALSEQQLVDCDQY-DGGCNRGTFRRAFHWVIQNGGLTTEAEYPYTAAQGTCNSAKSDHHV 248

Query: 241 AKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIV 300
           A IS +  VP  +E A+  AV+ QPV+ AI    ++ Q YK G+++G CG +L+HAVT+V
Sbjct: 249 AAISGHASVPGSNELAMKHAVATQPVAAAI-ELGSDMQFYKSGVYSGPCGARLEHAVTVV 307

Query: 301 GFGTTED-GANYWLIKNSWGNTWGDAGYMKIVRD---EGLCGIGTRSSYP 346
           G+G  E  G  YW++KNSWG TWG+ GY+++ R     GLCGI    +YP
Sbjct: 308 GYGADESTGDKYWIVKNSWGQTWGERGYIRMQRKILGPGLCGIMLDVAYP 357


>gi|21425246|emb|CAD33266.1| cathepsin L [Aphis gossypii]
          Length = 341

 Score =  247 bits (630), Expect = 7e-63,   Method: Compositional matrix adjust.
 Identities = 138/342 (40%), Positives = 196/342 (57%), Gaps = 17/342 (4%)

Query: 20  IIITLLVSCASQVVSSRSTHEQSVVEIHEKW---MAQHGRSYKDELEKEMRLKIFKENLE 76
           +I+  LV+ A   VSS + +E     I E+W     Q  + Y+D  E+  R K++ +N  
Sbjct: 4   VIVLGLVAFAISTVSSINLNEV----IEEEWSLFKIQFKKLYEDIKEETFRKKVYLDNKL 59

Query: 77  YIEKANK---EGNRTYKLGTNQFSDLTNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNL 131
            I   NK    G  TY L  N F DL   E+  +  G+K  +       T      +   
Sbjct: 60  KIAGHNKLYESGEETYALEMNHFGDLMQHEYTKMMNGFKPSLAGGDRNFTNDEAVTFLKS 119

Query: 132 SMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDC 191
               +P S+DWR KG VTP+KNQ +CG CW+F+A  ++EG    ++G L+ LSEQ L+DC
Sbjct: 120 ENVVIPKSVDWRKKGYVTPVKNQGQCGSCWSFSATGSLEGQHFRKTGVLVSLSEQNLIDC 179

Query: 192 STN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVP 250
           S   GNNGC GG  + AF YI  N+G+ TE  YPY+A    C    + + A    + ++P
Sbjct: 180 SRKYGNNGCEGGLMDLAFKYIKSNKGLDTEKSYPYEAEDDKCRYNPENSGATDKGFVDIP 239

Query: 251 SGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIF-NGVC-GTQLDHAVTIVGFGTTED 307
            GDE AL+ A+ ++ PVSIAI A S +FQ YK+G+F N  C  T+LDH V  VGFG+ + 
Sbjct: 240 EGDEDALMHALATVGPVSIAIDASSEKFQFYKKGVFYNPRCSSTELDHGVLAVGFGSDKK 299

Query: 308 GANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPLA 348
           G +YW++KNSWG TWGD GY+ + R+ +  CG+ + +SYPL 
Sbjct: 300 GGDYWIVKNSWGKTWGDEGYIMMARNKKNNCGVASSASYPLV 341


>gi|392881548|gb|AFM89606.1| cathepsin L [Callorhinchus milii]
          Length = 338

 Score =  247 bits (630), Expect = 7e-63,   Method: Compositional matrix adjust.
 Identities = 134/312 (42%), Positives = 190/312 (60%), Gaps = 16/312 (5%)

Query: 48  EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEF 104
           E+W + HG+SY ++ E+  R  +++++L  IE  N E   G  +++LG N F D+ N+EF
Sbjct: 30  EQWKSWHGKSY-EQKEETWRRMVWEKHLRVIEIHNLEHSLGKHSFRLGMNHFGDMPNEEF 88

Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFA 164
           R L  GYK    +H+    S F   N    +VP  +DWRD+G VTP+K+Q +CG CWAF+
Sbjct: 89  RQLMNGYKYKQ-THKKLQGSHFLEPNFQ--EVPKHVDWRDEGYVTPVKDQGQCGSCWAFS 145

Query: 165 AVAAVEGITKIRSGNLIQLSEQQLLDCST-NGNNGCLGGSREKAFAYIIQNQGIATEDEY 223
              A+EG    R+G L+ LSEQ L++CS   GN GC GG  ++AF Y+  N GI +ED Y
Sbjct: 146 TTGALEGQHFRRTGQLVSLSEQNLVECSKPEGNEGCNGGLMDQAFQYVKDNGGIDSEDSY 205

Query: 224 PYQAVPGT-CSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYK 281
           PY     T C    +  AA  + + ++PSG E+AL+KA+ ++ PVS+AI A  T FQ Y+
Sbjct: 206 PYVGTDDTPCHYNPQYNAANDTGFVDIPSGKERALMKAIAAVGPVSVAIDAGHTSFQFYQ 265

Query: 282 EGI-FNGVC-GTQLDHAVTIVGFGTTE---DGANYWLIKNSWGNTWGDAGYMKIVRD-EG 335
            GI F   C  T LDH V +VG+G  +   DG  YW++KNSW   WG  GY+ + +D + 
Sbjct: 266 SGIYFEAECSSTDLDHGVLVVGYGVEKRDTDGKKYWIVKNSWSEKWGQNGYILMAKDKDN 325

Query: 336 LCGIGTRSSYPL 347
            CGI T +SYPL
Sbjct: 326 HCGIATAASYPL 337


>gi|2098464|pdb|1PCI|A Chain A, Procaricain
 gi|2098465|pdb|1PCI|B Chain B, Procaricain
 gi|2098466|pdb|1PCI|C Chain C, Procaricain
          Length = 322

 Score =  247 bits (630), Expect = 7e-63,   Method: Compositional matrix adjust.
 Identities = 135/314 (42%), Positives = 192/314 (61%), Gaps = 12/314 (3%)

Query: 38  THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
           T  + ++++   WM  H + Y++  EK  R +IFK+NL YI++ NK+ N +Y LG N+F+
Sbjct: 13  TSTERLIQLFNSWMLNHNKFYENVDEKLYRFEIFKDNLNYIDETNKK-NNSYWLGLNEFA 71

Query: 98  DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
           DL+NDEF   Y G  + +   +S      ++ N  + ++P ++DWR KGAVTP+++Q  C
Sbjct: 72  DLSNDEFNEKYVGSLIDATIEQSYDE---EFINEDIVNLPENVDWRKKGAVTPVRHQGSC 128

Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
           G CWAF+AVA VEGI KIR+G L++LSEQ+L+DC    ++GC GG    A  Y+ +N GI
Sbjct: 129 GSCWAFSAVATVEGINKIRTGKLVELSEQELVDCERR-SHGCKGGYPPYALEYVAKN-GI 186

Query: 218 ATEDEYPYQAVPGTCSAAQKPAA-AKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTE 276
               +YPY+A  GTC A Q      K S    V   +E  LL A++ QPVS+ + +    
Sbjct: 187 HLRSKYPYKAKQGTCRAKQVGGPIVKTSGVGRVQPNNEGNLLNAIAKQPVSVVVESKGRP 246

Query: 277 FQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR---- 332
           FQ YK GIF G CGT++D AVT VG+G +       LIKNSWG  WG+ GY++I R    
Sbjct: 247 FQLYKGGIFEGPCGTKVDGAVTAVGYGKSGGKGYI-LIKNSWGTAWGEKGYIRIKRAPGN 305

Query: 333 DEGLCGIGTRSSYP 346
             G+CG+   S YP
Sbjct: 306 SPGVCGLYKSSYYP 319


>gi|387914010|gb|AFK10614.1| cathepsin L [Callorhinchus milii]
 gi|392873762|gb|AFM85713.1| cathepsin L [Callorhinchus milii]
 gi|392877488|gb|AFM87576.1| cathepsin L [Callorhinchus milii]
          Length = 338

 Score =  247 bits (630), Expect = 7e-63,   Method: Compositional matrix adjust.
 Identities = 134/312 (42%), Positives = 190/312 (60%), Gaps = 16/312 (5%)

Query: 48  EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEF 104
           E+W + HG+SY ++ E+  R  +++++L  IE  N E   G  +++LG N F D+ N+EF
Sbjct: 30  EQWKSWHGKSY-EQKEETWRRMVWEKHLRVIEIHNLEHSLGKHSFRLGMNHFGDMPNEEF 88

Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFA 164
           R L  GYK    +H+    S F   N    +VP  +DWRD+G VTP+K+Q +CG CWAF+
Sbjct: 89  RQLMNGYKYKQ-THKKLQGSHFLEPNF--LEVPKHVDWRDEGYVTPVKDQGQCGSCWAFS 145

Query: 165 AVAAVEGITKIRSGNLIQLSEQQLLDCST-NGNNGCLGGSREKAFAYIIQNQGIATEDEY 223
              A+EG    R+G L+ LSEQ L++CS   GN GC GG  ++AF Y+  N GI +ED Y
Sbjct: 146 TTGALEGQHFRRTGQLVSLSEQNLVECSKPEGNEGCNGGLMDQAFQYVKDNGGIDSEDSY 205

Query: 224 PYQAVPGT-CSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYK 281
           PY     T C    +  AA  + + ++PSG E+AL+KA+ ++ PVS+AI A  T FQ Y+
Sbjct: 206 PYVGTDDTPCHYNPQYNAANDTGFVDIPSGKERALMKAIAAVGPVSVAIDAGHTSFQFYQ 265

Query: 282 EGI-FNGVC-GTQLDHAVTIVGFGTTE---DGANYWLIKNSWGNTWGDAGYMKIVRD-EG 335
            GI F   C  T LDH V +VG+G  +   DG  YW++KNSW   WG  GY+ + +D + 
Sbjct: 266 SGIYFEAECSSTDLDHGVLVVGYGVEKRDTDGKKYWIVKNSWSEKWGQNGYILMAKDKDN 325

Query: 336 LCGIGTRSSYPL 347
            CGI T +SYPL
Sbjct: 326 HCGIATAASYPL 337


>gi|225718114|gb|ACO14903.1| Cathepsin L precursor [Caligus clemensi]
          Length = 336

 Score =  246 bits (629), Expect = 8e-63,   Method: Compositional matrix adjust.
 Identities = 129/310 (41%), Positives = 186/310 (60%), Gaps = 13/310 (4%)

Query: 48  EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEF 104
           E W   HG++Y   +E+++RLKI+ EN   I + N E   G   Y +  N + DL + EF
Sbjct: 31  ESWKLMHGKTYSSSIEEKLRLKIYMENSLKISRHNSEALNGIHPYYMKMNHYGDLLHHEF 90

Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFA 164
            A+  GY+  + +  S   +    +N+ +   PT +DWR++GAVTP+KNQ +CG CW+F+
Sbjct: 91  VAMVNGYQYANKT-ASLGGTYIPNKNIQL---PTHVDWREEGAVTPVKNQGQCGSCWSFS 146

Query: 165 AVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEY 223
           A  A+EG    ++G LI LSEQ L+DCS   GNNGC GG  + AF YI  N+GI TE  Y
Sbjct: 147 ATGALEGQDFRKTGKLISLSEQNLVDCSRKFGNNGCEGGLMDFAFTYIRDNKGIDTEASY 206

Query: 224 PYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVS-MQPVSIAIAAYSTEFQSYKE 282
           PY+ + G C    K        + ++  G E+ L KAV+ + P+S+AI A    FQ Y  
Sbjct: 207 PYEGIDGHCHYNPKNKGGSDIGFVDIKKGSEKDLKKAVAGVGPISVAIDASHMSFQFYSH 266

Query: 283 GIF-NGVCGT-QLDHAVTIVGFGT-TEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCG 338
           G++    C + +LDH V +VGFGT +  G +YWL+KNSW   WGD GY+K+ R+ E +CG
Sbjct: 267 GVYVESKCSSEELDHGVLVVGFGTDSVSGEDYWLVKNSWSEKWGDQGYIKMARNKENMCG 326

Query: 339 IGTRSSYPLA 348
           I + +SYP+ 
Sbjct: 327 IASSASYPVV 336


>gi|299507656|gb|ADJ21807.1| cathepsin L [Oplegnathus fasciatus]
          Length = 336

 Score =  246 bits (629), Expect = 9e-63,   Method: Compositional matrix adjust.
 Identities = 141/341 (41%), Positives = 199/341 (58%), Gaps = 18/341 (5%)

Query: 19  FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
            + + +L  C S  +S+ S   Q + E  + W + H + Y  E E+  R  ++++NL+ I
Sbjct: 1   MLPVAVLAVCLSAALSAPSLDPQ-LDEHWDLWKSWHTKKYH-EKEEGWRRMVWEKNLKKI 58

Query: 79  EKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD 135
           E  N E   G  TY+LG N F D+T++EFR +  GYK    S R    S F   N    +
Sbjct: 59  ELHNLEHSMGEHTYRLGMNHFGDMTHEEFRQIMNGYK--RKSERKFKGSLFMEPNF--LE 114

Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TN 194
            P S+DWRD G VTP+K+Q +CG CWAF+   A+EG    ++G L+ LSEQ L+DCS   
Sbjct: 115 APRSVDWRDNGYVTPVKDQGQCGSCWAFSTTGAMEGQHFRKTGKLVSLSEQNLVDCSRPE 174

Query: 195 GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPG-TCSAAQKPAAAKISNYEEVPSGD 253
           GN GC GG  ++AF YI  NQG+ +ED YPY       C    K  +A  + + ++PSG 
Sbjct: 175 GNEGCNGGLMDQAFQYIKDNQGLDSEDSYPYLGTDDQPCHYDPKYNSANDTGFIDIPSGK 234

Query: 254 EQALLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVCGT-QLDHAVTIVGF---GTTED 307
           E+AL+KAV ++ PVS+AI A    FQ Y+ GI +   C + +LDH V +VG+   G   D
Sbjct: 235 ERALMKAVAAVGPVSVAIDAGHESFQFYQSGIYYEKECSSEELDHGVLVVGYGFEGEDVD 294

Query: 308 GANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
           G  YW++KNSW   WGD GY+ + +D +  CGI T +SYPL
Sbjct: 295 GKKYWIVKNSWSEKWGDKGYIYMAKDRKNHCGIATAASYPL 335


>gi|410923307|ref|XP_003975123.1| PREDICTED: cathepsin L1-like [Takifugu rubripes]
          Length = 336

 Score =  246 bits (629), Expect = 9e-63,   Method: Compositional matrix adjust.
 Identities = 143/343 (41%), Positives = 199/343 (58%), Gaps = 19/343 (5%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
           MF ++ L + C +  +S+ S   Q + E    W   H + Y  E E+  R  ++++NL+ 
Sbjct: 1   MFPVVVLAL-CVTAALSAPSLDPQ-LDEHWNLWKDWHSKKYH-EKEEGWRRMVWEKNLKK 57

Query: 78  IEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
           IE  N E   G  TY LG N F D+T++EFR +  GYK+ S   R    S F   N    
Sbjct: 58  IELHNLEHSMGKHTYSLGMNHFGDMTHEEFRQIMNGYKLKS--QRKLRGSLFMEPNF--L 113

Query: 135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-T 193
           + P S+DWRDKG VTP+K+Q +CG CWAF+   A+EG    ++G L+ LSEQ L+DCS  
Sbjct: 114 EAPRSVDWRDKGYVTPVKDQGQCGSCWAFSTTGAMEGQHFRKTGTLVSLSEQNLVDCSRP 173

Query: 194 NGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVP-GTCSAAQKPAAAKISNYEEVPSG 252
            GN GC GG  ++AF YI  N G+ +E+ YPY     G C       +A  + + +VPSG
Sbjct: 174 EGNEGCNGGLMDQAFQYIKDNGGLDSEESYPYLGTDEGPCHYDPSYNSANDTGFVDVPSG 233

Query: 253 DEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVCGT-QLDHAVTIVGF---GTTE 306
            E+AL+KAV S+ PVS+AI A    FQ Y  GI ++  C + +LDH V +VG+   G   
Sbjct: 234 SERALMKAVASVGPVSVAIDAGHESFQFYHSGIYYDKECSSEELDHGVLVVGYGFEGKDV 293

Query: 307 DGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPLA 348
           DG  YW++KNSW   WGD GY+ + +D +  CGI T +SYPL 
Sbjct: 294 DGKKYWIVKNSWSENWGDKGYIYMAKDKKNHCGIATAASYPLV 336


>gi|330805275|ref|XP_003290610.1| hypothetical protein DICPUDRAFT_98747 [Dictyostelium purpureum]
 gi|325079249|gb|EGC32858.1| hypothetical protein DICPUDRAFT_98747 [Dictyostelium purpureum]
          Length = 334

 Score =  246 bits (628), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 135/336 (40%), Positives = 194/336 (57%), Gaps = 17/336 (5%)

Query: 18  MFIIITLLV----SCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKE 73
           +F+I++L++     CA+  + S  T++ S +     WM +H ++Y    E   + + FK+
Sbjct: 5   VFLIVSLVILSINVCAATNLFSAQTYQTSFL----GWMKKHNKAYHHH-EFNDKYQTFKD 59

Query: 74  NLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
           N+++I   N + + T  LG N+F+DLTN+E++  Y G  M    +           N   
Sbjct: 60  NMDFIHNWNSKESDTV-LGLNRFADLTNEEYKKTYLG--MSINVNLRANQVPMNGLNFER 116

Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST 193
              P+S+DWR  GAV  +K+Q  CG CWAFA   AVEG  +I++GN++  SEQ L+DCS 
Sbjct: 117 FTGPSSIDWRQNGAVAYVKDQGHCGSCWAFATTGAVEGAHQIKTGNMVTFSEQHLVDCSG 176

Query: 194 N-GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSG 252
             GNNGC GG    AF YII N GIATE+ YPY A    C          IS Y++VP G
Sbjct: 177 RYGNNGCDGGLMTSAFKYIIDNDGIATEEAYPYTATQNRCVYNTTMLGTAISGYKDVPRG 236

Query: 253 DEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFN-GVCGT-QLDHAVTIVGFGTTEDGAN 310
            E AL  A+S QPV++AI A    FQ YK G++    C + +L+H V  VG+GT E G +
Sbjct: 237 SESALTAAISKQPVAVAIDASPITFQLYKSGVYQEATCSSYRLNHGVLAVGYGTLE-GKD 295

Query: 311 YWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSY 345
           Y+++KNSW  TWG+ GY+ + R+    CGI T +SY
Sbjct: 296 YYIVKNSWAETWGNQGYILMARNANNHCGIATMASY 331


>gi|346469447|gb|AEO34568.1| hypothetical protein [Amblyomma maculatum]
          Length = 333

 Score =  246 bits (628), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 131/308 (42%), Positives = 191/308 (62%), Gaps = 11/308 (3%)

Query: 48  EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRTYKLGTNQFSDLTNDEF 104
           E + + H ++YK  +E+ +R KIF EN  +I K N    +G  +YKLG NQF+DL   EF
Sbjct: 28  EAFKSTHKKTYKSNVEELLRFKIFTENSLFIAKHNVKYAKGLVSYKLGINQFADLLPHEF 87

Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFA 164
             +  GY+    + R +T       NL+ + +P ++DWR KGAVTP+K+Q +CG CWAF+
Sbjct: 88  VKMMNGYQGKRLAGRGST--YLPPANLNDSSLPKTVDWRKKGAVTPVKDQGQCGSCWAFS 145

Query: 165 AVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEY 223
           +  ++EG   +++G L+ LSEQ L+DCS+  GN GC GG  + +F YI  N GI TED Y
Sbjct: 146 STGSLEGQHFLKTGKLVSLSEQNLVDCSSAYGNQGCNGGLMDNSFNYIKANGGIDTEDSY 205

Query: 224 PYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKE 282
           PY+A  G C   ++   A  + + ++  G E+ L KAV ++ PVS+AI A    FQ Y E
Sbjct: 206 PYEAEDGDCRYKKEDVGATDTGFVDIKEGSEKDLQKAVATVGPVSVAIDASQQSFQLYSE 265

Query: 283 GIFNGV-CGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE-GLCGI 339
           G+++   C ++ LDH V  VG+G  ++G  YWL+KNSW  TWG  GY+ + RD+   CGI
Sbjct: 266 GVYDEPNCSSESLDHGVLAVGYG-VKNGKKYWLVKNSWAETWGQDGYILMSRDKNNQCGI 324

Query: 340 GTRSSYPL 347
            + +SYPL
Sbjct: 325 ASSASYPL 332


>gi|221090861|ref|XP_002167224.1| PREDICTED: cathepsin L-like [Hydra magnipapillata]
          Length = 324

 Score =  246 bits (628), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 135/330 (40%), Positives = 196/330 (59%), Gaps = 18/330 (5%)

Query: 24  LLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANK 83
           LL+         R   ++S ++    W   H + Y  + E+ +R  I+K+N   I + N 
Sbjct: 8   LLLGVTLAYTIERPVKDESWIQ----WKMYHNKVYSHDGEETVRYTIWKDNERRIREHNL 63

Query: 84  EGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWR 143
           +G   + L  NQF D+TN EF+A + GY     SH+    STF   N  +   P ++DWR
Sbjct: 64  KGG-DFLLKMNQFGDMTNSEFKA-FNGYL----SHKHVNGSTFLTPNNFV--APDTVDWR 115

Query: 144 DKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGG 202
           ++G VTP+K+Q +CG CWAF+   ++EG    ++G L+ LSEQ L+DCST  GNNGC GG
Sbjct: 116 NEGYVTPVKDQGQCGSCWAFSTTGSLEGQHFKKTGKLVSLSEQNLVDCSTAYGNNGCNGG 175

Query: 203 SREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV- 261
             + AF YI +N+GI +E  YPY A  G C   +   AA  + + ++P G+E  L +AV 
Sbjct: 176 LMDNAFTYIKENKGIDSEASYPYTAEDGKCVFKKPSVAATDTGFVDLPEGNENKLKEAVA 235

Query: 262 SMQPVSIAIAAYSTEFQSYKEGIFN--GVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWG 319
           S+ P+S+AI A    FQ Y  G++N      T+LDH V +VG+G TE G +YWL+KNSW 
Sbjct: 236 SVGPISVAIDASHESFQFYSSGVYNEPSCSSTELDHGVLVVGYG-TESGKDYWLVKNSWN 294

Query: 320 NTWGDAGYMKIVRD-EGLCGIGTRSSYPLA 348
            +WGD GY+K+ R+ +  CGI T++SYPL 
Sbjct: 295 TSWGDKGYIKMRRNAKNQCGIATKASYPLV 324


>gi|21953244|emb|CAD42716.1| putative cathepsin L [Myzus persicae]
          Length = 341

 Score =  246 bits (628), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 138/342 (40%), Positives = 197/342 (57%), Gaps = 17/342 (4%)

Query: 20  IIITLLVSCASQVVSSRSTHEQSVVEIHEKW---MAQHGRSYKDELEKEMRLKIFKENLE 76
           +I+  LV+ A   VSS + +E     I E+W     Q  + Y+D  E+  R K++ +N  
Sbjct: 4   VIVLGLVAFAISSVSSINLNEV----IEEEWSLFKMQFKKLYEDIKEETFRKKVYLDNKL 59

Query: 77  YIEKANK---EGNRTYKLGTNQFSDLTNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNL 131
            I + NK    G  TY L  N F DL   E+  +  G+K  +       T      +   
Sbjct: 60  KIARHNKLYESGEETYALEMNHFGDLMQHEYSKMMNGFKPSLAGGDSNFTNDEGVTFLKS 119

Query: 132 SMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDC 191
               +P S+DWR KG VTP+KNQ +CG CW+F+A  ++EG    ++G L+ LSEQ L+DC
Sbjct: 120 ENVVIPKSIDWRKKGYVTPVKNQGQCGSCWSFSATGSLEGQHFRKTGVLVSLSEQNLIDC 179

Query: 192 STN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVP 250
           S   GNNGC GG  + AF YI  N+G+ TE  YPY+A    C      + A  + + ++P
Sbjct: 180 SRKYGNNGCEGGLMDLAFKYIKSNKGLDTEKSYPYEAEDDKCRYNPDNSGATDNGFVDIP 239

Query: 251 SGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIF-NGVC-GTQLDHAVTIVGFGTTED 307
            GDE+AL+ A+ ++ PVSIAI A S +FQ YK+G+F N  C  T+LDH V  VGF T + 
Sbjct: 240 EGDEEALMHALATVGPVSIAIDASSEKFQFYKKGVFYNPRCSSTELDHGVLAVGFRTDKK 299

Query: 308 GANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPLA 348
           G +YW++KNSWG TWGD GY+ + R+ +  CG+ + +SYPL 
Sbjct: 300 GGDYWIVKNSWGKTWGDEGYIMMARNKKNNCGVASSASYPLV 341


>gi|330803818|ref|XP_003289899.1| hypothetical protein DICPUDRAFT_154350 [Dictyostelium purpureum]
 gi|325080010|gb|EGC33584.1| hypothetical protein DICPUDRAFT_154350 [Dictyostelium purpureum]
          Length = 326

 Score =  246 bits (628), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 136/335 (40%), Positives = 200/335 (59%), Gaps = 20/335 (5%)

Query: 19  FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
            I   L+++C S   ++R   ++      + WM +H +SY ++ E   R  +F++N++ +
Sbjct: 7   LIFCFLIINCCS---AARIFSQKQYQTAFQNWMVKHQKSYTND-EFGSRYSVFQDNMDIV 62

Query: 79  EKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNL-SMTDVP 137
            K N++G+ T  LG N  +DLTN+EF+ LY G K          + T+K + L  ++ +P
Sbjct: 63  AKWNQKGSNTI-LGLNVMADLTNEEFKKLYLGTK---------ANVTYKKKTLVGVSGLP 112

Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TNGN 196
            S+DWR  GAVT +KNQ +CG C+AF+   +VEGI +I S  L+ LSEQQ+LDCS + GN
Sbjct: 113 ASVDWRANGAVTAVKNQGQCGGCYAFSTTGSVEGIHEITSQQLVPLSEQQILDCSGSEGN 172

Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQA 256
           NGC GG    +F YII   G+ TE  YPY    G C   +K   A I+ Y+ V SG E  
Sbjct: 173 NGCDGGLMTNSFEYIIAVGGLDTEASYPYTGEVGKCKFNKKNIGATITGYKNVESGSESD 232

Query: 257 LLKAVSMQPVSIAIAAYSTEFQSYKEGIF--NGVCGTQLDHAVTIVGFGTTEDGANYWLI 314
           L  AV+ QPVS+AI A  + FQ Y  G++       TQLDH V  VG+G ++ G +YW++
Sbjct: 233 LQTAVAAQPVSVAIDASQSSFQLYASGVYYEPECSSTQLDHGVLAVGYG-SQSGQDYWIV 291

Query: 315 KNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPLA 348
           KNSWG  WG+ G++ + R+ +  CGI T +S+P A
Sbjct: 292 KNSWGADWGENGFILMARNKDNNCGIATMASFPTA 326


>gi|52630917|gb|AAU84922.1| putative cathepsin L [Toxoptera citricida]
          Length = 341

 Score =  246 bits (628), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 140/345 (40%), Positives = 197/345 (57%), Gaps = 18/345 (5%)

Query: 18  MFIIITL-LVSCASQVVSSRSTHEQSVVEIHEKW---MAQHGRSYKDELEKEMRLKIFKE 73
           M ++I L LV  A   VSS + +E     I E+W     Q  + Y+D  E+  R K++ +
Sbjct: 1   MKVVIVLGLVVFAISSVSSINLNEI----IEEEWDLFKVQFKKIYEDVKEEAFRKKVYLD 56

Query: 74  NLEYIEKANK---EGNRTYKLGTNQFSDLTNDEFRALYTGYK--MPSPSHRSTTSSTFKY 128
           N   I + NK    G  TY L  N F DL   E+  +  G+K  +       T      +
Sbjct: 57  NKLKIARHNKLYETGEETYALEMNHFGDLMQHEYTKMMNGFKPSLAGGDKNFTDDDAVTF 116

Query: 129 QNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQL 188
                  +P S+DWR KG VTP+KNQ +CG CW+F+A  ++EG    ++G L+ LSEQ L
Sbjct: 117 LKSENVVIPKSIDWRKKGYVTPVKNQGQCGSCWSFSATGSLEGQHFRKTGVLVSLSEQNL 176

Query: 189 LDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYE 247
           +DCS   GNNGC GG  + AF YI  N+G+ TE  YPY+A    C    + + A    + 
Sbjct: 177 IDCSRKYGNNGCEGGLMDLAFKYIKSNKGLDTEKSYPYEAEDDKCRYNPENSGATDKGFV 236

Query: 248 EVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIF-NGVC-GTQLDHAVTIVGFGT 304
           ++P GDE AL+ A+ ++ PVSIAI A S +FQ YK+G+F N  C  T+LDH V  VG+GT
Sbjct: 237 DIPEGDEDALVHALATVGPVSIAIDASSEKFQFYKKGVFYNPRCSSTELDHGVLAVGYGT 296

Query: 305 TEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPLA 348
              G +YW++KNSWG TWGD GY+ + R+ +  CG+ + +SYPL 
Sbjct: 297 DHKGGDYWIVKNSWGKTWGDQGYIMMARNKKNNCGVASSASYPLV 341


>gi|957281|gb|AAB33990.1| cysteine proteinase [Bombyx mori]
          Length = 344

 Score =  246 bits (628), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 138/344 (40%), Positives = 199/344 (57%), Gaps = 23/344 (6%)

Query: 24  LLVSCASQVVSSRSTHEQSVVEIHEKWMA---QHGRSYKDELEKEMRLKIFKENLEYIEK 80
           +L+ CA   VS+     Q    + E+W A   QH  +YK E+E   R+KI+ E+   I K
Sbjct: 5   VLLLCAVAAVSAV----QFFDLVKEEWSAFKLQHRLNYKSEVEDNFRMKIYAEHKHIIAK 60

Query: 81  ANKE---GNRTYKLGTN---QFSDLTNDEFRALYTGYKMPSPSHRST-----TSSTFKYQ 129
            N++   G  +YKLG N   +  D+ + EF     G+   +  +++      +    K+ 
Sbjct: 61  HNQKYEMGLVSYKLGMNSWWEHGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFI 120

Query: 130 NLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLL 189
           + +   +P  +DWR  GAVT IK+Q +CG CW+F+   A+EG    +SG L+ LSEQ L+
Sbjct: 121 SPANVKLPEQVDWRKHGAVTDIKDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLI 180

Query: 190 DCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEE 248
           DCS   GNNGC GG  + AF YI  N GI TE  YPY+ V   C    K   A+   + +
Sbjct: 181 DCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTEQAYPYEGVDDKCRYNPKNTGAEDVGFVD 240

Query: 249 VPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIFN--GVCGTQLDHAVTIVGFGTT 305
           +P GDEQ L++AV ++ PVS+AI A  T FQ Y  G++N      T LDH V +VG+GT 
Sbjct: 241 IPEGDEQKLMEAVATVGPVSVAIDASHTHFQLYSSGVYNEEECSSTDLDHGVLVVGYGTD 300

Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYPLA 348
           E G +YWL+KNSWG +WG+ GY+K++R++   CGI + +SYPL 
Sbjct: 301 EQGVDYWLVKNSWGRSWGELGYIKMIRNKNNRCGIASSASYPLV 344


>gi|332375975|gb|AEE63128.1| unknown [Dendroctonus ponderosae]
          Length = 338

 Score =  246 bits (628), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 136/344 (39%), Positives = 199/344 (57%), Gaps = 22/344 (6%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMA---QHGRSYKDELEKEMRLKIFKEN 74
           + +++T+ V+C  Q VS           + E+W +   QH + Y+ E E+  R+KIF +N
Sbjct: 4   LVLLVTIAVAC--QAVSFSEL-------VQEQWNSFKVQHKKQYESETEERFRMKIFMDN 54

Query: 75  LEYIEKANK---EGNRTYKLGTNQFSDLTNDEFRALYTGY-KMPSPSHRSTTSSTFKYQN 130
              + K NK   +G   YKL  N++ DL + EF  L  G+ +  +   R     +  +  
Sbjct: 55  SHKVAKHNKLFEQGLYPYKLAMNKYGDLLHHEFVGLLNGFNRTKTYLKRGELQDSITFIE 114

Query: 131 LSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLD 190
            +  D+P ++DWR +GAVTP+K+Q  CG CW+F+A  A+EG    ++  L+ LSEQ L+D
Sbjct: 115 PAHVDIPDTVDWRQEGAVTPVKDQGHCGSCWSFSATGALEGQHFRQTKKLVSLSEQNLVD 174

Query: 191 CSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEV 249
           CS+  GNNGC GG  + AF YI  N GI TE  YPY         + K   A    + ++
Sbjct: 175 CSSRFGNNGCNGGLMDNAFRYIKNNGGIDTEAAYPYMGEDEKFRYSAKNRGATDKGFVDI 234

Query: 250 PSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIF-NGVC-GTQLDHAVTIVGFGTTE 306
           PSGDE  L  AV ++ P+SIAI A    FQ Y  G++ +  C  T+LDH V +VG+GT E
Sbjct: 235 PSGDEDKLKAAVATVGPISIAIDASHESFQLYSNGVYSDPTCSSTELDHGVLVVGYGTDE 294

Query: 307 D-GANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPLA 348
             G +YWL+KNSWG+TWG  GY+K+ R+ +  CG+ T++SYPL 
Sbjct: 295 KTGMDYWLVKNSWGDTWGLDGYIKMARNQDNQCGVATQASYPLV 338


>gi|41688064|dbj|BAD08618.1| cathepsin L preproprotein [Cyprinus carpio]
          Length = 337

 Score =  246 bits (628), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 141/343 (41%), Positives = 199/343 (58%), Gaps = 20/343 (5%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLE 76
           M + +     C S V ++ +  +Q  ++ H E+W   HG+ Y  E E+  R  ++++NL+
Sbjct: 1   MRVFLAAFALCLSAVFAAPTLDKQ--LDNHWEQWKNWHGKKYH-EKEEGWRRMVWEKNLQ 57

Query: 77  YIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
            IE  N E   G  TY+LG N+F D+T++EFR +  GYK      R    S F   N   
Sbjct: 58  KIELHNLEHSMGTHTYRLGMNRFGDMTHEEFRQVMNGYK--HKKERRFRGSLFMEPNF-- 113

Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST 193
            +VP SLDWR+KG VTP+K+Q ECG CWAF+   A+EG    ++G L+ LSEQ L+DCS 
Sbjct: 114 LEVPNSLDWREKGYVTPVKDQGECGSCWAFSTTGAMEGQMFRKTGKLVSLSEQNLVDCSR 173

Query: 194 -NGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGT-CSAAQKPAAAKISNYEEVPS 251
             GN GC GG  ++AF YI    G+ +E+ YPY       C    K +AA  + + ++PS
Sbjct: 174 PEGNEGCNGGLMDQAFQYIKDQNGLDSEESYPYVGTDDQPCHYDPKYSAANDTGFVDIPS 233

Query: 252 GDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVCGT-QLDHAVTIVGF---GTT 305
           G E AL+KA+ ++ PVS+AI A    FQ Y+ GI +   C + +LDH V  VG+   G  
Sbjct: 234 GKEHALMKAIAAVGPVSVAIDAGHESFQFYQSGIYYEKECSSEELDHGVLAVGYGFEGED 293

Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
            DG  YW++KNSW   WGD GY+ + +D    CGI T +SYPL
Sbjct: 294 VDGKKYWIVKNSWSENWGDKGYVYMAKDRHNHCGIATAASYPL 336


>gi|342305188|dbj|BAK55648.1| cathepsin L [Oplegnathus fasciatus]
          Length = 336

 Score =  246 bits (627), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 141/341 (41%), Positives = 199/341 (58%), Gaps = 18/341 (5%)

Query: 19  FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
            + + +L  C S  +S+ S   Q + E  + W + H + Y  E E+  R  ++++NL+ I
Sbjct: 1   MLPVAVLAVCLSAALSAPSLDPQ-LDEHWDLWKSWHTKKYH-EKEEGWRRMVWEKNLKKI 58

Query: 79  EKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD 135
           E  N E   G  TY+LG N F D+T++EFR +  GYK    S R    S F   N    +
Sbjct: 59  ELHNLEHSMGEHTYRLGMNHFGDMTHEEFRQIMYGYK--RKSERKFKGSLFMEPNF--LE 114

Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TN 194
            P S+DWRD G VTP+K+Q +CG CWAF+   A+EG    ++G L+ LSEQ L+DCS   
Sbjct: 115 APRSVDWRDNGYVTPVKDQGQCGSCWAFSTTGAMEGQHFRKTGKLVSLSEQNLVDCSRPE 174

Query: 195 GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPG-TCSAAQKPAAAKISNYEEVPSGD 253
           GN GC GG  ++AF YI  NQG+ +ED YPY       C    K  +A  + + ++PSG 
Sbjct: 175 GNEGCNGGLMDQAFQYIKDNQGLDSEDSYPYLGTDDQPCHYDPKYNSANDTGFIDIPSGK 234

Query: 254 EQALLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVCGT-QLDHAVTIVGF---GTTED 307
           E+AL+KAV ++ PVS+AI A    FQ Y+ GI +   C + +LDH V +VG+   G   D
Sbjct: 235 ERALMKAVAAVGPVSVAIDAGHESFQFYQSGIYYEKECSSEELDHGVLVVGYGFEGEDVD 294

Query: 308 GANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
           G  YW++KNSW   WGD GY+ + +D +  CGI T +SYPL
Sbjct: 295 GKKYWIVKNSWSEKWGDKGYIYMAKDRKNHCGIATAASYPL 335


>gi|390347681|ref|XP_801784.2| PREDICTED: cathepsin L1-like isoform 2 [Strongylocentrotus
           purpuratus]
          Length = 336

 Score =  246 bits (627), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 134/338 (39%), Positives = 204/338 (60%), Gaps = 15/338 (4%)

Query: 19  FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
           F++   LV+CA+      +  +     +  +W   H +SY +++ +  R  +++EN++ I
Sbjct: 6   FLVAIGLVACATAAFVKPTNPDLDSRWL--EWKIAHTKSYTNDMHELERRLVWEENVKMI 63

Query: 79  EKANKEGN---RTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD 135
              N + +   + ++LG N++ D+   E R+   GYK  S +      STF     S   
Sbjct: 64  NMHNLDHSLHKKGFRLGMNEYGDMRLHEVRSTMNGYK--SSNVTKVQGSTF--LTPSNIQ 119

Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TN 194
           VP ++DWR KG VTP+KNQ +CG CWAF+   ++EG T  ++  L+ LSEQ L+DCS T 
Sbjct: 120 VPDTVDWRTKGYVTPVKNQGQCGSCWAFSTTGSLEGQTFKKTSKLVSLSEQNLVDCSRTE 179

Query: 195 GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDE 254
           GN GC GG  ++ F Y+I N GI +ED YPY A   TC       +A+++ + +V SGDE
Sbjct: 180 GNMGCEGGLMDQGFQYVIDNHGIDSEDCYPYDAEDETCHYKASCDSAEVTGFTDVTSGDE 239

Query: 255 QALLKAV-SMQPVSIAIAAYSTEFQSYKEGIFN--GVCGTQLDHAVTIVGFGTTEDGANY 311
           QAL++AV S+ PVS+AI A    FQ Y+ G+++      ++LDH V +VG+G T+ G +Y
Sbjct: 240 QALMEAVASVGPVSVAIDASHQSFQLYESGVYDEPECSSSELDHGVLVVGYG-TDGGKDY 298

Query: 312 WLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYPLA 348
           WL+KNSWG TWG +GY+K+ R++   CGI T +SYPL 
Sbjct: 299 WLVKNSWGETWGLSGYIKMSRNKSNQCGIATSASYPLV 336


>gi|302763127|ref|XP_002964985.1| hypothetical protein SELMODRAFT_406652 [Selaginella moellendorffii]
 gi|300167218|gb|EFJ33823.1| hypothetical protein SELMODRAFT_406652 [Selaginella moellendorffii]
          Length = 320

 Score =  246 bits (627), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 129/326 (39%), Positives = 188/326 (57%), Gaps = 29/326 (8%)

Query: 14  NTTPMFIIITLLVSCASQVVSSRSTHEQS----VVEIHEKWMAQHGRSYKDELEKEMRLK 69
           N   + +I+ ++V  A   ++  +  E      +  + E W A+HG+SY  + EK  R+ 
Sbjct: 4   NMIALILILLVVVGAAPFAIARPAALEDDRALEIKNMFEDWAAKHGKSYSSDWEKARRMT 63

Query: 70  IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQ 129
           IF + L YIEK N   N T+ LG N+FSDLTN EFRA Y G K   P ++    +  K  
Sbjct: 64  IFSDTLAYIEKHNALPNTTFTLGLNKFSDLTNAEFRANYVG-KFKPPRYQDRRPA--KDV 120

Query: 130 NLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLL 189
           ++ ++ +PTSLDWR +GAVTPIK+Q +CG CWAF+A+A++E    + +  L+ LSEQQL+
Sbjct: 121 DVDVSSLPTSLDWRQEGAVTPIKDQGQCGSCWAFSAIASIESAHFLATNQLVSLSEQQLI 180

Query: 190 DCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEV 249
           DC T  + GC                    E+ YPY  + G+C+ A K   A+I+ +  V
Sbjct: 181 DCDTV-DEGC-------------------QEEAYPYTGLAGSCN-ANKNKVAEITGFNVV 219

Query: 250 PSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGA 309
                 AL+KAVS  PV++ I      FQ+Y+ GI +G C    DH V ++G+G TE G 
Sbjct: 220 TKDKADALMKAVSKTPVTVGICGSDQNFQNYRSGILSGQCCNSRDHVVLVIGYG-TEGGM 278

Query: 310 NYWLIKNSWGNTWGDAGYMKIVRDEG 335
            YW+IKNSWG +WG+ G+MKI + +G
Sbjct: 279 PYWIIKNSWGTSWGEDGFMKIEKKDG 304


>gi|224460525|gb|ACN43674.1| cathepsin L [Paralichthys olivaceus]
          Length = 334

 Score =  246 bits (627), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 132/311 (42%), Positives = 184/311 (59%), Gaps = 19/311 (6%)

Query: 50  WMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRTYKLGTNQFSDLTNDEFRA 106
           W  + GRSY    E++ R++I+  N E +   N    +G+ TY+LG   ++DL ++EF+ 
Sbjct: 29  WKLKFGRSYNSSSEEDKRMQIWLRNREIVMAHNAMADQGHSTYRLGMTFYADLEHEEFKQ 88

Query: 107 LYTG-----YKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCW 161
              G     +    P   S+     ++ NL     P ++DWR  G VTP+KNQ  CG CW
Sbjct: 89  TVFGVCLGSFNASKPRGGSSFLKMHRFYNL-----PQTIDWRQWGFVTPVKNQGSCGSCW 143

Query: 162 AFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATE 220
           +F++  A+EG    ++G L+ LSEQ+L+DCS N GN GC GG  + AF YI+   GI TE
Sbjct: 144 SFSSTGALEGQNFRKTGRLVSLSEQELVDCSGNYGNYGCNGGWMDNAFRYIVNKGGIHTE 203

Query: 221 DEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQS 279
           D YPY+   G C A      A  + Y ++PSG+E AL +AV +  PVS+AI A    FQ 
Sbjct: 204 DSYPYEGQVGQCRANYGEIGATCTGYYDIPSGNEHALKEAVATFGPVSVAIHASDQSFQL 263

Query: 280 YKEGIFNG--VCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE-GL 336
           Y  G++N     GT LDHAV IVG+G TE G +YWL+KNSWG  WGD GY+K+ R+    
Sbjct: 264 YHSGVYNNPYCSGTALDHAVLIVGYG-TEYGQDYWLVKNSWGPAWGDQGYIKMSRNRYNQ 322

Query: 337 CGIGTRSSYPL 347
           CGI + +S+PL
Sbjct: 323 CGIASAASFPL 333


>gi|391328505|ref|XP_003738729.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
          Length = 323

 Score =  246 bits (627), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 131/301 (43%), Positives = 189/301 (62%), Gaps = 13/301 (4%)

Query: 54  HGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTG 110
           HG+SY  + E+  R ++F +++  I   N     G  TY++G N+F+D+T++EFR  + G
Sbjct: 26  HGKSYGHD-EEHFRRQLFYKSVAKINAHNLRHDLGLTTYRMGLNKFTDMTSEEFRN-FKG 83

Query: 111 YKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVE 170
            K  +   ++  + T   + L    +PT +DWR+KG VTP+KNQ +CG CWAF+   ++E
Sbjct: 84  LKFDAT--KTKRNGTRFQKELLGEALPTQVDWREKGYVTPVKNQGQCGSCWAFSTTGSLE 141

Query: 171 GITKIRSGNLIQLSEQQLLDCS-TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVP 229
           G     +G L+ LSEQ L+DCS   GNNGC GG  +  F YI QN GI TE+ YPY    
Sbjct: 142 GQHFKATGKLVSLSEQNLVDCSRVEGNNGCNGGLMDNGFTYIQQNGGIDTEESYPYTGKD 201

Query: 230 GTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIFN-- 286
           G C+  +    A++  + +VP  DE AL  AV S+ PVS+AI A +  FQ YKEG+++  
Sbjct: 202 GDCAFNENSVGARVKGFVDVPQRDEAALQAAVASVGPVSVAIDASNDSFQYYKEGVYDEP 261

Query: 287 GVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSY 345
               +QLDH V +VG+G TE+G +YWL+KNSWG TWG  GY+K++R+ E  CGI + +SY
Sbjct: 262 SCSFSQLDHGVLVVGYG-TENGVDYWLVKNSWGPTWGQDGYIKMMRNKENQCGIASMASY 320

Query: 346 P 346
           P
Sbjct: 321 P 321


>gi|269784818|ref|NP_001161481.1| cathepsin L1 precursor [Gallus gallus]
          Length = 353

 Score =  246 bits (627), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 147/361 (40%), Positives = 204/361 (56%), Gaps = 26/361 (7%)

Query: 2   VLIFERSGSFKINTTPMFIIITL---LVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSY 58
           VL   R  S  +N      I++L   L   A +V     +H Q        W + H + Y
Sbjct: 3   VLFLARRLSRFVNMNVCLTILSLCLGLAFAAPRVDPDLDSHWQL-------WKSWHSKDY 55

Query: 59  KDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPS 115
             E E+  R  ++++NL+ IE  N +   G  +YKLG NQF D+T +EFR L  GYK   
Sbjct: 56  H-EREESWRRVVWEKNLKMIELHNLDHSLGKHSYKLGMNQFGDMTAEEFRQLMNGYKHKK 114

Query: 116 PSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKI 175
            S R    S F     S  + P S+DWR+KG VTP+K+Q +CG CWAF+   A+EG    
Sbjct: 115 -SERKYRGSQF--LEPSFLEAPRSVDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQHFR 171

Query: 176 RSGNLIQLSEQQLLDCST-NGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPG-TCS 233
           ++G L+ LSEQ L+DCS   GN GC GG  ++AF Y+  N GI +E+ YPY A     C 
Sbjct: 172 KTGKLVSLSEQNLVDCSRPEGNQGCNGGLMDQAFQYVQDNGGIDSEESYPYTAKDDEDCR 231

Query: 234 AAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVCGT 291
              +  AA  + + ++P G E+AL+KAV S+ PVS+AI A  + FQ Y+ GI +   C +
Sbjct: 232 YKAEYNAANDTGFVDIPQGHERALMKAVASVGPVSVAIDAGHSSFQFYQSGIYYEPDCSS 291

Query: 292 Q-LDHAVTIVGF---GTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYP 346
           + LDH V +VG+   G   DG  YW++KNSWG  WGD GY+ + +D +  CGI T +SYP
Sbjct: 292 EDLDHGVLVVGYGFEGEDVDGKKYWIVKNSWGEKWGDKGYIYMAKDRKNHCGIATAASYP 351

Query: 347 L 347
           L
Sbjct: 352 L 352


>gi|112490572|pdb|2FO5|A Chain A, Crystal Structure Of Recombinant Barley Cysteine
           Endoprotease B Isoform 2 (Ep-B2) In Complex With
           Leupeptin
 gi|112490573|pdb|2FO5|B Chain B, Crystal Structure Of Recombinant Barley Cysteine
           Endoprotease B Isoform 2 (Ep-B2) In Complex With
           Leupeptin
 gi|112490574|pdb|2FO5|C Chain C, Crystal Structure Of Recombinant Barley Cysteine
           Endoprotease B Isoform 2 (Ep-B2) In Complex With
           Leupeptin
 gi|112490575|pdb|2FO5|D Chain D, Crystal Structure Of Recombinant Barley Cysteine
           Endoprotease B Isoform 2 (Ep-B2) In Complex With
           Leupeptin
          Length = 262

 Score =  246 bits (627), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 110/223 (49%), Positives = 152/223 (68%), Gaps = 8/223 (3%)

Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
           ++D+P S+DWR KGAVT +K+Q +CG CWAF+ V +VEGI  IR+G+L+ LSEQ+L+DC 
Sbjct: 1   VSDLPPSVDWRQKGAVTGVKDQGKCGSCWAFSTVVSVEGINAIRTGSLVSLSEQELIDCD 60

Query: 193 TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQ----KPAAAKISNYEE 248
           T  N+GC GG  + AF YI  N G+ TE  YPY+A  GTC+ A+     P    I  +++
Sbjct: 61  TADNDGCQGGLMDNAFEYIKNNGGLITEAAYPYRAARGTCNVARAAQNSPVVVHIDGHQD 120

Query: 249 VPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDG 308
           VP+  E+ L +AV+ QPVS+A+ A    F  Y EG+F G CGT+LDH V +VG+G  EDG
Sbjct: 121 VPANSEEDLARAVANQPVSVAVEASGKAFMFYSEGVFTGECGTELDHGVAVVGYGVAEDG 180

Query: 309 ANYWLIKNSWGNTWGDAGYMKIVRDE----GLCGIGTRSSYPL 347
             YW +KNSWG +WG+ GY+++ +D     GLCGI   +SYP+
Sbjct: 181 KAYWTVKNSWGPSWGEQGYIRVEKDSGASGGLCGIAMEASYPV 223


>gi|158268253|gb|ABW25046.1| cathepsin L-like protease [Strongylus vulgaris]
          Length = 354

 Score =  246 bits (627), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 141/356 (39%), Positives = 206/356 (57%), Gaps = 27/356 (7%)

Query: 18  MFIIITLLVSCASQVVS----SRSTH----------EQSVVEIHEKW---MAQHGRSYKD 60
           MF +++L++ CAS   S    SR  H           Q + E  + W       G+SY  
Sbjct: 1   MFRLLSLVLLCASVFASIDSGSRHDHTIRLHRVKSLRQKIDEAFKLWDDYKESFGKSYNK 60

Query: 61  ELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPS 117
           + E +  ++ F +N+ +I++ N+E   G +T+++G N  +DL   ++R L  GY+     
Sbjct: 61  DEENDY-MEAFVKNVIHIDEHNQEHRLGRKTFEMGLNSIADLPFSQYRKL-NGYRHRRNF 118

Query: 118 HRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRS 177
             S  S+  K+      ++P S+DWRDKG VT +KNQ  CG CWAF+A  A+EG     S
Sbjct: 119 GDSMQSNGTKWLAPFNVEIPDSVDWRDKGLVTDVKNQGMCGSCWAFSATGALEGQHARAS 178

Query: 178 GNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQ 236
           G ++ LSEQ L+DCST  GN+GC GG  + AF YI  N GI TE+ YPY      C   +
Sbjct: 179 GKMVSLSEQNLVDCSTKYGNHGCNGGLMDLAFEYIKDNHGIDTEESYPYVGRETKCHFKK 238

Query: 237 KPAAAKISNYEEVPSGDEQALLKAVSMQ-PVSIAIAAYSTEFQSYKEGI-FNGVCGT-QL 293
           K   A+   + ++P GDE+AL  AV+ Q P+SIAI A    FQ YK+G+ ++  C + +L
Sbjct: 239 KDIGAEDKGFVDLPEGDEEALKVAVATQGPISIAIDAGHRTFQLYKKGVYYDEECSSEEL 298

Query: 294 DHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYPLA 348
           DH V +VG+GT  +  +YWLIKNSWG  WG+ GY++I R+    CG+ T++SYPL 
Sbjct: 299 DHGVLLVGYGTDPEAGDYWLIKNSWGPGWGEKGYIRIARNRSNHCGVATKASYPLV 354


>gi|38147395|gb|AAR12010.1| cathepsin L-like proteinase [Triatoma infestans]
          Length = 328

 Score =  246 bits (627), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 132/311 (42%), Positives = 190/311 (61%), Gaps = 18/311 (5%)

Query: 48  EKWMA---QHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTN 101
           E+W+A   Q G+SYK+  E+  R+ ++KEN   I++ NK    G  +YKL  N F DL  
Sbjct: 24  EEWLAFKAQFGKSYKNSFEELFRMNVYKENQRKIDEHNKRYENGEVSYKLKMNHFGDLMQ 83

Query: 102 DEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCW 161
            EF+AL    K+   + +  +   F+    +   +P  +DWR KGAVTP+K+  +CG CW
Sbjct: 84  HEFKALN---KLKRSAKQQNSGEVFR---ATGGKLPAKVDWRQKGAVTPVKDPGQCGSCW 137

Query: 162 AFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATE 220
           AF++  ++ G   +++  L+ LSEQQL+DCS N GN+GC GG   +AF YI  N GI TE
Sbjct: 138 AFSSTGSLGGQLFLKNKKLVSLSEQQLVDCSGNYGNDGCDGGIMVQAFQYIKGNGGIDTE 197

Query: 221 DEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVS-MQPVSIAIAAYSTEFQS 279
             YPY+A    C    K  A     Y ++  GDE AL +AV+ + P+S+AI A +  FQ 
Sbjct: 198 GSYPYEAEDDKCRYKTKSVAGTDKGYVDIAQGDENALKEAVAEIGPISVAIDAGNLSFQF 257

Query: 280 YKEGIFNG--VCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE-GL 336
           Y EGI++      T+LDH V +VG+G TE+G +YWL+KNSWG +WG+ GY+KI R+    
Sbjct: 258 YSEGIYDEPFCSNTELDHGVLVVGYG-TENGQDYWLVKNSWGPSWGENGYIKIARNHNNH 316

Query: 337 CGIGTRSSYPL 347
           CGI + +SYP+
Sbjct: 317 CGIASMASYPI 327


>gi|340368362|ref|XP_003382721.1| PREDICTED: cathepsin L1-like [Amphimedon queenslandica]
          Length = 329

 Score =  245 bits (626), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 129/306 (42%), Positives = 186/306 (60%), Gaps = 12/306 (3%)

Query: 48  EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRT-YKLGTNQFSDLTNDEFRA 106
           E W A +G+SY    E++ R   ++EN   I+  N + ++  Y L  N F DLT+ EF +
Sbjct: 28  ELWKATYGKSYLTLEEEKYRRDTWEENSLLIKTHNTDSDKHGYTLEMNSFGDLTSAEFSS 87

Query: 107 LYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAV 166
           LY GY+    +  S  SS+ +        +P+SLDWRDK  VT +KNQ +CG CWAF+  
Sbjct: 88  LYNGYRQNLETSGSVFSSSLR------NAMPSSLDWRDKKVVTDVKNQGKCGSCWAFSTT 141

Query: 167 AAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPY 225
            ++EG+  +++G+L+ LSEQQL+DCS   GNNGC GG+   AF YI    G  TE+ YPY
Sbjct: 142 GSLEGLHALKTGHLVSLSEQQLMDCSVKYGNNGCDGGNMRSAFQYIKDAGGDDTEESYPY 201

Query: 226 QAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGI 284
            A   +C    K   A    Y  +PSGDE +L+ A+  + P+S+A+ A    FQ YK+GI
Sbjct: 202 TAKNESCRFDPKKVGATDEGYVRIPSGDEVSLMHALYEVGPISVAMDAGLKTFQFYKKGI 261

Query: 285 FNG-VC-GTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDEG-LCGIGT 341
           ++  +C  T L+H VT++G+G + DG+ YWL+KNSWG  WG  GY  + R  G +CG+ T
Sbjct: 262 YSDYLCSNTHLNHGVTLIGYGESSDGSPYWLVKNSWGKDWGIDGYFMLARYVGNMCGVAT 321

Query: 342 RSSYPL 347
            +SYP+
Sbjct: 322 DASYPI 327


>gi|2765358|emb|CAA74241.1| cathepsin L [Litopenaeus vannamei]
          Length = 325

 Score =  245 bits (626), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 126/316 (39%), Positives = 191/316 (60%), Gaps = 17/316 (5%)

Query: 43  VVEIHEKWM---AQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRTYKLGTNQF 96
           +  + ++W    A+HGR Y    E+  RL +F++N ++I+  N   + G  T+ L  NQF
Sbjct: 15  IPSLRQQWQNFKAEHGRRYASVQEERYRLSVFEQNQQFIDDHNARFENGEVTFTLQMNQF 74

Query: 97  SDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKE 156
            D+T++E  A   G+ + +P+ R   ++  K  + ++   P  +DWR KGAVTP+K+QK+
Sbjct: 75  GDMTSEEIVATMNGF-LGAPTRRP--AAVLKADDETL---PEKVDWRTKGAVTPVKDQKQ 128

Query: 157 CGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNN-GCLGGSREKAFAYIIQNQ 215
           CG CWAF+   ++EG   ++ G L+ LSEQ L+DCS    N GC+GG  ++AF YI  N+
Sbjct: 129 CGSCWAFSTTGSLEGQHFLKDGKLVSLSEQNLVDCSDKFRNMGCMGGLMDQAFRYIKANK 188

Query: 216 GIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYS 274
           GI TED YPY+A  G C        A  + Y +V  G E AL KAV ++ P+S+ I A  
Sbjct: 189 GIDTEDSYPYEAQDGKCRFDASNVGATDTGYVDVEHGSESALKKAVATIGPISVGIDASQ 248

Query: 275 TEFQSYKEGIFNG--VCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR 332
           + F  Y  G+++      T LDH V  VG+G+ E+G ++WL+KNSW  +WGD GY+K+ R
Sbjct: 249 STFHFYHTGVYHDDHCSSTMLDHGVLAVGYGSDENGGDFWLVKNSWNTSWGDKGYIKMSR 308

Query: 333 DE-GLCGIGTRSSYPL 347
           +    CGI +++SYPL
Sbjct: 309 NRNNNCGIASQASYPL 324


>gi|303283194|ref|XP_003060888.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226457239|gb|EEH54538.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 422

 Score =  245 bits (626), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 130/314 (41%), Positives = 194/314 (61%), Gaps = 17/314 (5%)

Query: 48  EKWMAQHGRSYKDELEKEMRLKIFKENLEYIE---KANKEGNRTYKLGTNQFSDLTNDEF 104
           ++W+A HG++Y    E+  RL IF +N E++    +A+  G +++ L  N  +DLT +EF
Sbjct: 71  DRWLATHGKAYACPKERAKRLAIFADNAEFVRVHNEAHAAGKKSHWLRLNHLADLTREEF 130

Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV--PTSLDWRDKGAVTPIKNQKECGCCWA 162
           + +  GY   S     ++S      N    DV  P ++DW  +GAVTP+KNQ +CG CWA
Sbjct: 131 KHML-GYDA-SKKRVESSSPPVDAANWEYADVTPPETMDWVSRGAVTPVKNQGQCGSCWA 188

Query: 163 FAAVAAVEGITKIRSGNLIQLSEQQLLDCST-NGNNGCLGGSREKAFAYIIQNQGIATED 221
           F+ V AVEG+  +++G+LI LSEQ+L+ C+   GNNGC GG  +  F +I++N+G+  E+
Sbjct: 189 FSTVGAVEGVVAVKTGDLISLSEQELVSCAKIGGNNGCKGGLMDNGFEWIVENRGVDDEE 248

Query: 222 EYPYQAVPGTCS--AAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQS 279
           ++ Y A    C+    ++  AA I  +++VP  DE AL KAVS QPV++AI A   EFQ 
Sbjct: 249 DWGYLAKDRRCNWFKKRRAKAASIDGFKDVPRNDEDALKKAVSQQPVAVAIEADHREFQL 308

Query: 280 YKEGIFNGVCGTQLDHAVTIVGFGTTEDGA---NYWLIKNSWGNTWGDAGYMKIVR---- 332
           Y  G+F+G CGT LDH V +VG+G   + A   +YW +KNSWG  WG+ GY++I R    
Sbjct: 309 YSGGVFDGECGTNLDHGVLVVGYGYDGESAGHKHYWTVKNSWGAKWGEEGYIRIARGGMG 368

Query: 333 DEGLCGIGTRSSYP 346
             G CG+  ++SYP
Sbjct: 369 PAGQCGVAMQASYP 382


>gi|158268255|gb|ABW25047.1| cathepsin L-like protease [Strongylus vulgaris]
          Length = 354

 Score =  245 bits (626), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 141/355 (39%), Positives = 206/355 (58%), Gaps = 27/355 (7%)

Query: 18  MFIIITLLVSCASQVVS----SRSTH----------EQSVVEIHEKW---MAQHGRSYKD 60
           MF +++L++ CAS   S    SR  H           Q + E  + W       G+SY  
Sbjct: 1   MFRLLSLVLLCASVFASIDSGSRRDHTIRLHRVKSLRQKIDEAFKLWDDYKEAFGKSYNK 60

Query: 61  ELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPS 117
           + E +  ++ F +N+ +I++ N+E   G +T+++G N  +DL   ++R L  GY+     
Sbjct: 61  DEENDY-MEAFVKNVIHIDEHNQEHRLGRKTFEMGLNSIADLPFSQYRKL-NGYRHRRNF 118

Query: 118 HRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRS 177
             S  S+  K+      ++P S+DWRDKG VT +KNQ  CG CWAF+A  A+EG     S
Sbjct: 119 GDSMQSNGTKWLAPFNVEIPDSVDWRDKGLVTDVKNQGMCGSCWAFSATGALEGQHARAS 178

Query: 178 GNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQ 236
           G ++ LSEQ L+DCST  GN+GC GG  + AF YI  N GI TE+ YPY      C   +
Sbjct: 179 GKMVSLSEQNLVDCSTKYGNHGCNGGLMDLAFEYIKDNHGIDTEESYPYVGRETKCHFKK 238

Query: 237 KPAAAKISNYEEVPSGDEQALLKAVSMQ-PVSIAIAAYSTEFQSYKEGI-FNGVCGT-QL 293
           K   A+   + ++P GDE+AL  AV+ Q P+SIAI A    FQ YK+G+ ++  C + +L
Sbjct: 239 KDIGAEDKGFVDLPEGDEEALKVAVATQGPISIAIDAGHRTFQLYKKGVYYDEECSSEEL 298

Query: 294 DHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYPL 347
           DH V +VG+GT  +  +YWLIKNSWG  WG+ GY++I R+    CG+ T++SYPL
Sbjct: 299 DHGVLLVGYGTDPEAGDYWLIKNSWGPGWGEKGYIRIARNRSNHCGVATKASYPL 353


>gi|110349475|gb|ABG73218.1| cathepsin L 2 precursor [Diaprepes abbreviatus]
          Length = 348

 Score =  245 bits (625), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 142/349 (40%), Positives = 194/349 (55%), Gaps = 26/349 (7%)

Query: 21  IITLLVSCASQVVSSRSTHEQSVV-EIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIE 79
           + +L+V  A+ V  S +   Q +V E  E++  +HG+ Y+ E E E R  +F ENL  I 
Sbjct: 1   MYSLVVLLATLVAYSHAISYQVLVQEQWEQFKLEHGKVYESESENEYRQSVFMENLFQIN 60

Query: 80  KANK---EGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKY-------- 128
           + NK    G  +Y++  N   DLT DEF  +YT   MP        S +  +        
Sbjct: 61  EHNKLYEMGLSSYQMAMNHLGDLTKDEFMRIYT-VNMPQLPQSENLSDSEPWLDLPQDLQ 119

Query: 129 --------QNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNL 180
                    NL   D+PT +DWR KGAVTP+KNQ+ CG CW+F+A  A+E     ++  L
Sbjct: 120 GFVTYALPTNLDEVDLPTDIDWRQKGAVTPVKNQRNCGSCWSFSATGALEAQWFKKTNKL 179

Query: 181 IQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPA 239
           I LSEQQL+DCS   GN+GC GG    AF YI +N GI TE  YPY A  G C+      
Sbjct: 180 ISLSEQQLVDCSGRYGNHGCHGGWMHWAFGYIKENGGIDTEQSYPYTAKDGRCAYKPGNK 239

Query: 240 AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFN-GVCGTQLDHAVT 298
           AA +S    VP G+ Q   K  S+ P+SIA A  S +FQ Y  G+++   CG  L+HA+ 
Sbjct: 240 AATVSQVIMVPRGENQLAAKVSSVGPISIA-AEVSHKFQFYHSGVYDEPQCGHSLNHAML 298

Query: 299 IVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYP 346
            VG+G+   G N+WL+KNSWG  WGD GY+++ +D+   CGI   +SYP
Sbjct: 299 AVGYGSM-GGKNFWLVKNSWGTGWGDQGYIRMAKDKNNQCGIALMASYP 346


>gi|389608655|dbj|BAM17937.1| cathepsin L [Papilio xuthus]
          Length = 341

 Score =  245 bits (625), Expect = 3e-62,   Method: Compositional matrix adjust.
 Identities = 131/344 (38%), Positives = 198/344 (57%), Gaps = 23/344 (6%)

Query: 20  IIITLLVSCASQVVSSRSTHEQSVVEIHEKWMA---QHGRSYKDELEKEMRLKIFKENLE 76
           ++I L V  A+  VS           + E+W A   +H + Y  E+E + R+KI+ EN  
Sbjct: 4   LVILLCVVAAASAVSFFDL-------VKEEWNAFKMEHQKQYDSEVEDKFRMKIYAENKH 56

Query: 77  YIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
            I K N++   G  +++L  N++ D+ + EF     G+   + + +     +   +  + 
Sbjct: 57  NIAKHNQKYARGEVSFRLKQNKYGDMLHHEFVHTMNGFNKTTKNSKGLFGKSAGERGATF 116

Query: 134 -----TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQL 188
                  +P  +DWR  GAVT +K+Q +CG CW+F++  A+EG    R+  L+ LSEQ L
Sbjct: 117 ITPANVHLPDHVDWRKHGAVTEVKDQGKCGSCWSFSSTGALEGQHYRRTNILVSLSEQNL 176

Query: 189 LDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYE 247
           +DCS   GNNGC GG  + AF YI  N+GI TE  YPY+ +   C    K   A  + + 
Sbjct: 177 IDCSAAYGNNGCNGGLMDNAFKYIKDNRGIDTEKSYPYEGIDDKCRYNPKNTGADDNGFV 236

Query: 248 EVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVCGTQ-LDHAVTIVGFGT 304
           ++PSGDE  L+ AV ++ PVS+AI A  + FQ Y +G+ F+  C +  LDH V +VG+GT
Sbjct: 237 DIPSGDEGKLMAAVATVGPVSVAIDASQSSFQFYSDGVYFDENCSSSSLDHGVLVVGYGT 296

Query: 305 TEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
            E+G +YWL+KNSWG +WGD GY+K+ R+ +  CGI T +SYPL
Sbjct: 297 DENGGDYWLVKNSWGRSWGDLGYIKMARNRDNHCGIATAASYPL 340


>gi|417409876|gb|JAA51427.1| Putative cathepsin s, partial [Desmodus rotundus]
          Length = 342

 Score =  244 bits (624), Expect = 3e-62,   Method: Compositional matrix adjust.
 Identities = 133/338 (39%), Positives = 200/338 (59%), Gaps = 18/338 (5%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLE 76
           M  ++ +L+ C+S +      H+   ++ H + W   +G+ YK++ E+ +R  I+++NL+
Sbjct: 12  MKWLVLVLLGCSSAMAQ---LHKDPTLDRHWDLWKKTYGKQYKEKNEEGVRRLIWEKNLK 68

Query: 77  YIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
           ++   N E   G  +Y LG N   D+T++E  AL +  ++PS   R+ T  +   Q L  
Sbjct: 69  FVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVTALMSSLRVPSQWQRNVTYKSNPNQKL-- 126

Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST 193
              P S+DWRDKG VT +K Q  CG CWAF+AV A+E   K+++G L+ LS Q L+DCS 
Sbjct: 127 ---PDSVDWRDKGCVTDVKYQGSCGSCWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCSV 183

Query: 194 N--GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPS 251
               N GC GG   +AF YII N GI +E  YPY+A+ G C    K  AA  S Y E+P 
Sbjct: 184 GKYSNRGCNGGFMTEAFQYIIDNNGIESEASYPYKAMDGKCQYDSKYRAATCSRYTELPE 243

Query: 252 GDEQALLKAVSMQ-PVSIAIAAYSTEFQSYKEGI-FNGVCGTQLDHAVTIVGFGTTEDGA 309
             E AL +AV+ + PVS+AI A    F  Y+ G+ ++  C   ++H V +VG+G   +G 
Sbjct: 244 DSEDALKEAVANKGPVSVAIDASHPSFFLYRSGVYYDPACTLHVNHGVLVVGYGNL-NGK 302

Query: 310 NYWLIKNSWGNTWGDAGYMKIVRDEG-LCGIGTRSSYP 346
           +YWL+KNSWG  +GD GY+++ R+ G  CGI + +SYP
Sbjct: 303 DYWLVKNSWGLHFGDQGYIRMARNSGNHCGIASYASYP 340


>gi|244539471|dbj|BAH82657.1| cysteine protease [Lotus japonicus]
          Length = 286

 Score =  244 bits (624), Expect = 3e-62,   Method: Compositional matrix adjust.
 Identities = 119/252 (47%), Positives = 174/252 (69%), Gaps = 6/252 (2%)

Query: 43  VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTND 102
           ++E+ E WM++HG+ Y+   EK +R +IFK+NL++I++ NK  +  Y LG N+F+DL++ 
Sbjct: 4   LIELFESWMSRHGKIYESIEEKLLRFEIFKDNLKHIDETNKVVS-NYWLGLNEFADLSHH 62

Query: 103 EFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWA 162
           EF+  Y G K+   + R + S  F Y+++   D+P S+DWR KGAVT IKNQ  CG CWA
Sbjct: 63  EFKKQYLGLKVDFSTRRES-SEEFTYRDV---DLPKSVDWRKKGAVTNIKNQGSCGSCWA 118

Query: 163 FAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDE 222
           F+ VAAVEGI +I +GNL  LSEQ+L+DC    N+GC GG  + AF++I++N G+  ED+
Sbjct: 119 FSTVAAVEGINQIVTGNLTSLSEQELIDCDRTYNSGCNGGLMDYAFSFIVENGGLHKEDD 178

Query: 223 YPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYK 281
           YPY    GTC  +++ +    IS Y +VP  +EQ+LLKA++ QP+S+AI A   +FQ Y 
Sbjct: 179 YPYIMEEGTCEMSKEESQVVTISGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQFYS 238

Query: 282 EGIFNGVCGTQL 293
            G+F+G CGTQL
Sbjct: 239 GGVFDGHCGTQL 250


>gi|327289213|ref|XP_003229319.1| PREDICTED: cathepsin S-like [Anolis carolinensis]
          Length = 333

 Score =  244 bits (624), Expect = 4e-62,   Method: Compositional matrix adjust.
 Identities = 132/317 (41%), Positives = 198/317 (62%), Gaps = 14/317 (4%)

Query: 40  EQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQ 95
           + S+++ H E W  ++ + Y+++ E+ +R  I+++NL ++   N E   G  +Y+LG N 
Sbjct: 21  KDSMLDGHWELWKKKYNKEYQNKEEEGVRRVIWEKNLRFVMLHNLEQSLGLHSYELGMNH 80

Query: 96  FSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQK 155
             D+T++E  AL TG K+P    R++T     Y        P ++DWR+KG VT +KNQ 
Sbjct: 81  LGDMTSEEVTALMTGLKIPVSQSRNST----LYWARQGASAPDTVDWREKGCVTNVKNQG 136

Query: 156 ECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQN 214
            CG CWAF+AV A+E   K+++GNL+ LS Q L+DCS+  GN+GC GG    AF Y+I N
Sbjct: 137 SCGSCWAFSAVGALECQLKLKTGNLVSLSPQNLVDCSSAFGNHGCNGGYISAAFQYVIYN 196

Query: 215 QGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVS-MQPVSIAIAAY 273
            GI +E  YPY    GTC    +  AA  S Y ++PSG+E AL  AV+   PVS+AI A 
Sbjct: 197 NGIDSEASYPYTGQSGTCRYNLQGRAATCSRYVDLPSGNEAALKDAVANFGPVSVAIDAS 256

Query: 274 STEFQSYKEGIFNGVCGT--QLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIV 331
              F  +++G+++    T   ++H V +VG+G TEDG +YWL+KNSWG ++GD GY+KI 
Sbjct: 257 RPSFFLFRKGVYDDPSCTSAHINHGVLVVGYG-TEDGIDYWLVKNSWGVSFGDQGYIKIA 315

Query: 332 RD-EGLCGIGTRSSYPL 347
           R+ +  CGI ++ +YPL
Sbjct: 316 RNHDNRCGIASQCTYPL 332


>gi|6630974|gb|AAF19631.1|AF194427_1 cysteine proteinase precursor [Myxine glutinosa]
          Length = 324

 Score =  244 bits (624), Expect = 4e-62,   Method: Compositional matrix adjust.
 Identities = 131/308 (42%), Positives = 187/308 (60%), Gaps = 12/308 (3%)

Query: 48  EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRTYKLGTNQFSDLTNDEF 104
           E W  ++G+SY    E+ +R ++++ NL+ +++ N    +G   Y+LG N ++DL N+EF
Sbjct: 20  ESWKGKYGKSYLGRGEEVLRKRVWESNLQIVQQHNVLADQGQANYRLGMNTYADLYNEEF 79

Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFA 164
            AL     +     +S+T  TFK   L    +P+S+DWR++G VTP+K+Q +CG CW F+
Sbjct: 80  MALKGSGGLLQAKDKSSTQ-TFK--PLVGVTLPSSVDWRNQGYVTPVKDQGQCGSCWTFS 136

Query: 165 AVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEY 223
           A  ++EG    ++GNL+ LSEQQL+DC+   GN GC GG  E A+ YI    G+  E  Y
Sbjct: 137 ATGSLEGQHFAKTGNLLSLSEQQLVDCAGRYGNYGCNGGLMESAYDYIKGVGGVELESAY 196

Query: 224 PYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKE 282
           PY A  G C   +    A    Y  +P GDEQAL++AV ++ PV+++I A    FQ Y+ 
Sbjct: 197 PYTARDGRCKFDRSKVVATCKGYVVIPVGDEQALMQAVGTIGPVAVSIDASGYSFQLYES 256

Query: 283 GI--FNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE-GLCGI 339
           G+  F     T LDH V  VG+G TE G NYWL+KNSWG  WGD GY+K+ +D+   CGI
Sbjct: 257 GVYDFRRCSSTNLDHGVLAVGYG-TEGGQNYWLVKNSWGPGWGDQGYIKMSKDKNNQCGI 315

Query: 340 GTRSSYPL 347
            T S YPL
Sbjct: 316 ATDSCYPL 323


>gi|7523482|dbj|BAA94210.1| putative cysteine protease [Oryza sativa Japonica Group]
 gi|10800060|dbj|BAB16480.1| putative cysteine protease [Oryza sativa Japonica Group]
          Length = 349

 Score =  244 bits (624), Expect = 4e-62,   Method: Compositional matrix adjust.
 Identities = 133/320 (41%), Positives = 182/320 (56%), Gaps = 29/320 (9%)

Query: 45  EIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEF 104
           ++ E+WMA+ G+ Y    EKE R  +F++N+ +I            L  NQF+DLTNDEF
Sbjct: 39  QMFEEWMAKFGKKYPCHGEKEYRFGVFRDNVRFIRSYRPPAGYNSALRVNQFADLTNDEF 98

Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFA 164
            + +TG K P P            + +    +P  +DWR KGAVT +K+Q  CG CWAFA
Sbjct: 99  VSTHTGAKPPCPKDAP--------RGVDPIWLPCCIDWRYKGAVTDVKDQGACGSCWAFA 150

Query: 165 AVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYP 224
           AVAA+EG+T+IR+G L  LSEQ+L+DC T G++GC GG  ++AF  +    GI  E  Y 
Sbjct: 151 AVAAIEGLTQIRTGKLTPLSEQELVDCDT-GSSGCAGGHTDRAFELVAAKGGITAESGYR 209

Query: 225 YQAVPGTCSA--AQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKE 282
           Y+   G C A  A    AA+I  +  VP GDE+ L  AV+ QPV+  I A    FQ Y  
Sbjct: 210 YEGYRGKCRADDALFNHAARIGGHRAVPPGDERQLATAVARQPVTAYIDASGPAFQFYGS 269

Query: 283 GIFNGVCGTQL---------DHAVTIVGFGTTEDGAN---YWLIKNSWGNTWGDAGYMKI 330
           G+F G CG+           +HAVT+VG+   +DGA+   YW+ KNSWG TWG+ GY+ +
Sbjct: 270 GVFPGPCGSGSGAAAAAPTTNHAVTLVGY--CQDGASGKKYWVAKNSWGKTWGEKGYILL 327

Query: 331 VRD----EGLCGIGTRSSYP 346
            +D     G CG+     YP
Sbjct: 328 EKDVASPHGTCGVAVSPFYP 347


>gi|307111936|gb|EFN60170.1| hypothetical protein CHLNCDRAFT_59551 [Chlorella variabilis]
          Length = 364

 Score =  244 bits (623), Expect = 4e-62,   Method: Compositional matrix adjust.
 Identities = 134/350 (38%), Positives = 194/350 (55%), Gaps = 29/350 (8%)

Query: 23  TLLVSCASQVVSSRSTHE----------QSVVEIHEKWM----AQHGRSYKDELE-KEMR 67
            LLV+C+   V++    E          +S  E  + W+        R+Y    E  E R
Sbjct: 12  VLLVACSCLAVAAGFRFENHRLFIQQAIESPREAFDFWVHTVKPPSNRAYASSAEVYERR 71

Query: 68  LKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFK 127
             I+ +NL +  + N   + ++ L    ++DL+ DE+R+   GY       R   ++ F 
Sbjct: 72  FNIWLDNLRFAHEYNAR-HTSHWLSMGVYADLSQDEYRSKALGYNAHLHKKRPLRAAPFL 130

Query: 128 YQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQ 187
           Y+    T  P  +DW   GAVTP+K+Q  CG CWAF+   AVEG   I +G L+ LSEQ 
Sbjct: 131 YKG---TVPPEEVDWVAGGAVTPVKDQLLCGSCWAFSTTGAVEGANAIATGKLVSLSEQM 187

Query: 188 LLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQ-KPAAAKISNY 246
           L+DC    + GC GG  + AF +I+ N GI TED+YPY+A  G C   + +     I  Y
Sbjct: 188 LVDCDREYDTGCRGGFMDSAFDFIVNNGGIDTEDDYPYRAEDGICQDNRTRRHVVTIDGY 247

Query: 247 EEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTE 306
           ++VP  DE AL+KAV+ QPVS+AI A    FQ Y  G+F+  CGT LDHAV +VG+GT  
Sbjct: 248 QDVPPNDENALMKAVAHQPVSVAIEADQLAFQLYGGGVFDAECGTALDHAVLVVGYGTAS 307

Query: 307 DGAN---YWLIKNSWGNTWGDAGYMKIVRD------EGLCGIGTRSSYPL 347
           +G +   YWL+KNSWG  WG+ GY++++R+      EG CG+   +S+P+
Sbjct: 308 NGTHNLPYWLVKNSWGAEWGEKGYIRLLRNLGKDAPEGQCGLAMYASFPI 357


>gi|116794072|gb|ABK26996.1| unknown [Picea sitchensis]
          Length = 367

 Score =  244 bits (623), Expect = 4e-62,   Method: Compositional matrix adjust.
 Identities = 137/347 (39%), Positives = 195/347 (56%), Gaps = 25/347 (7%)

Query: 22  ITLLVSCASQVVSSRSTHE----QSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
           I  LVS A  V  S    +      +V + ++W+ +HG+ Y    EK  RL+IF+ NL+Y
Sbjct: 14  IICLVSAAKAVQHSYEVGDINSGNGLVRLFDRWLGRHGKLYGSHEEKARRLQIFRTNLQY 73

Query: 78  IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGY--KMPSPSHRSTTSSTFKYQNLSMT- 134
           I   NK  N +++LG N+F+DLTN+EF+  Y G   K      R+          L  T 
Sbjct: 74  IHAHNKNSNSSFRLGLNKFADLTNEEFKTRYFGKNSKQWRDRRRTELEGAELRPVLKQTV 133

Query: 135 -------DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQ 187
                   + +SLDWR KGAVT +K+Q +CG CWAF+   A+EG+  I +G L+ LSEQ+
Sbjct: 134 GSQSSSCSIASSLDWRKKGAVTGVKDQAQCGSCWAFSTTGAIEGVNFISTGKLVSLSEQE 193

Query: 188 LLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAA-AKISNY 246
           L+ C    N GC GG  + AF ++IQN GI TE +Y Y  V  TC+  ++      I  Y
Sbjct: 194 LVACDAT-NYGCEGGDMDYAFTWVIQNGGIDTEKDYSYTGVDSTCNTNKEAKKIVSIDGY 252

Query: 247 EEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCG---TQLDHAVTIVGFG 303
            +V S D+ ALL A   QPVS+ I   + +FQ Y  GI++G C      +DHAV +VG+ 
Sbjct: 253 TDV-SPDDSALLCAAGSQPVSVGIDGSAIDFQLYTGGIYDGDCSGNPDDIDHAVLVVGY- 310

Query: 304 TTEDGANYWLIKNSWGNTWGDAGYMKIVRDE----GLCGIGTRSSYP 346
           + ++G +YW++KNSWG  WG  GY  I+R+     G+C I   +SYP
Sbjct: 311 SAKNGKDYWIVKNSWGTDWGLEGYFYILRNTELPYGVCAINAMASYP 357


>gi|151176971|gb|ABR88030.1| digestive cysteine protease [Dermestes frischii]
          Length = 339

 Score =  244 bits (623), Expect = 4e-62,   Method: Compositional matrix adjust.
 Identities = 132/343 (38%), Positives = 198/343 (57%), Gaps = 21/343 (6%)

Query: 19  FIIITLLVSCASQVVSSRSTHEQSVVEIHEKW---MAQHGRSYKDELEKEMRLKIFKENL 75
           F ++ L+    +Q VS           + E+W     QH + YK + E++ R+KIF EN 
Sbjct: 3   FFVLALVFIVGAQAVSFFDL-------VQEQWGTFKLQHKKQYKSDTEEKFRMKIFMENS 55

Query: 76  EYIEKANK---EGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNL- 131
             + K NK    G  +YKL  N+++D+ + EF     G+     +    TS   +     
Sbjct: 56  HKVAKXNKLYEMGLVSYKLKINKYADMLHHEFVHTVNGFNRTKNTPLLGTSEDEQGATFI 115

Query: 132 --SMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLL 189
             +    P ++DWR+ GAVT +K+Q  CG CW+F+A  A+EG    ++  L+ LSEQ L+
Sbjct: 116 APANVKFPENVDWREHGAVTXVKDQGHCGSCWSFSATGALEGQHFRKTNKLVSLSEQNLV 175

Query: 190 DCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEE 248
           DCST  GN+GC GG  + AF Y+  N GI TE  YPY A    C    K + A    + +
Sbjct: 176 DCSTKFGNDGCNGGLMDNAFKYVKYNHGIDTEASYPYHADDEKCHYNPKTSGATDRGFVD 235

Query: 249 VPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVCGT-QLDHAVTIVGFGTT 305
           +P+GDE+ L+ AV ++ PVS+AI A    FQ Y EG+ ++  C + +LDH V +VG+GT 
Sbjct: 236 IPTGDEEKLMAAVATVGPVSVAIDASHESFQLYSEGVYYDPECSSEELDHGVLVVGYGTD 295

Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
           E+G +YW++KNSWG +WG+ GY+K+ R+ +  CGI T++SYPL
Sbjct: 296 ENGQDYWIVKNSWGESWGEQGYIKMARNRDNNCGIATQASYPL 338


>gi|356552228|ref|XP_003544471.1| PREDICTED: LOW QUALITY PROTEIN: cysteine proteinase RD21a-like
           [Glycine max]
          Length = 351

 Score =  244 bits (623), Expect = 5e-62,   Method: Compositional matrix adjust.
 Identities = 131/339 (38%), Positives = 197/339 (58%), Gaps = 16/339 (4%)

Query: 18  MFIIITLLVSCASQVVSSRSTH--------EQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
           +F++  +  +    ++S  + H        +  V+ + E+W+ +H + Y    EKE R +
Sbjct: 8   LFMVFAVSSALDMSIISHDNAHADRATRRTDDEVMSMFEEWLVKHDKVYNALGEKEKRFQ 67

Query: 70  IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQ 129
           IFK NL +I++ N   NRTYKLG N F+DLTN E+RA+Y       P     T     Y 
Sbjct: 68  IFKNNLRFIDERNSL-NRTYKLGLNVFADLTNAEYRAMYLRTWDDGPRLDLDTPPRNHYV 126

Query: 130 NLSMTDVPTSLDWRDKGAVTPIKNQ-KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQL 188
                 +P S+DWR +GAVTP+KNQ   C  CWAF AV AVE + KI++G+LI LSEQ++
Sbjct: 127 PRVGDTIPKSVDWRKEGAVTPVKNQGATCNSCWAFTAVGAVESLVKIKTGDLISLSEQEV 186

Query: 189 LDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEE 248
           +DC+T+ + GC GG  +  + YI +N GI+ E +YPY+   G C + +K A   I  +  
Sbjct: 187 VDCTTSSSRGCGGGDIQHGYIYIRKN-GISLEKDYPYRGDEGKCDSNKKNAIVTIDGHGW 245

Query: 249 VPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDG 308
           VP+  E+AL +A+       A   Y  +F    +G+F G CGT+L+HA+ +VG+GT +DG
Sbjct: 246 VPTQLEEALNRALF---CYCAYFLYVDKF-FLCQGVFKGKCGTELNHALLLVGYGTEKDG 301

Query: 309 ANYWLIKNSWGNTWGDAGYMKIVRDEGLCGIGTRSSYPL 347
            +YW+ KNS+ + WG+ GY++I R    C  G    YP+
Sbjct: 302 -DYWIAKNSYSDKWGENGYIRIQRKLSTCKFGNGGYYPI 339


>gi|116563690|gb|ABJ99858.1| cathepsin L [Hippoglossus hippoglossus]
          Length = 336

 Score =  244 bits (623), Expect = 5e-62,   Method: Compositional matrix adjust.
 Identities = 141/341 (41%), Positives = 204/341 (59%), Gaps = 18/341 (5%)

Query: 19  FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
            + + +L +C S V+S+     Q + E  + W + H + Y  E E+  R  ++++NL+ I
Sbjct: 1   MLPLLVLTACLSSVLSAPVLDAQ-LNEHWDLWKSWHSKKYH-EKEEGWRRMVWEKNLQKI 58

Query: 79  EKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD 135
           E  N E   G  +++LG N F D+T++EFR +  GYK+ +   R  T S F   N  MT 
Sbjct: 59  ELHNLEHSMGTHSFRLGMNHFGDMTHEEFRQIMNGYKLKT--QRKFTGSLFMEPNF-MT- 114

Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TN 194
            P+++DWR+KG VTP+K+Q +CG CWAF+   A+EG    ++G L+ LSEQ L+DCS   
Sbjct: 115 APSAVDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQQFRKTGKLVSLSEQNLVDCSRPE 174

Query: 195 GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPG-TCSAAQKPAAAKISNYEEVPSGD 253
           GN GC GG  ++AF Y+  NQG+ +ED YPY       C       +A  + + +VPSG 
Sbjct: 175 GNEGCGGGLMDQAFQYVTDNQGLDSEDSYPYTGTDDQPCHYDPLYNSANDTGFVDVPSGK 234

Query: 254 EQALLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVCGT-QLDHAVTIVGFG-TTED-- 307
           E AL+KAV S+ PVS+AI A    FQ Y+ GI +   C + +LDH V  VG+G   ED  
Sbjct: 235 EHALMKAVASVGPVSVAIDAGHESFQFYQSGIYYEKECSSEELDHGVLAVGYGFEGEDKM 294

Query: 308 GANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
           G  +W++KNSWG  WGD GY+ + +D +  CGI T +SYPL
Sbjct: 295 GKKFWIVKNSWGEKWGDKGYIYMAKDRKNHCGIATAASYPL 335


>gi|157644745|gb|ABV59078.1| cathepsin L [Lates calcarifer]
          Length = 337

 Score =  244 bits (622), Expect = 5e-62,   Method: Compositional matrix adjust.
 Identities = 137/341 (40%), Positives = 196/341 (57%), Gaps = 17/341 (4%)

Query: 19  FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
            + + +L  C S  +S+ S   Q + +  + W + H + Y  E E+  R  ++++NL+ I
Sbjct: 1   MLPLAVLAVCLSAALSAPSLDPQ-LDDHWDLWKSWHSKKYH-EKEEGWRRMVWEKNLKKI 58

Query: 79  EKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD 135
           E  N E   G   Y+LG N F D+T++EFR +  GYK    + R    S F   N    +
Sbjct: 59  ELHNLEHSMGKHPYRLGMNHFGDMTHEEFRQIMNGYKQ-RKTERKFKGSLFMEPNF--LE 115

Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TN 194
            P +LDWRDKG VTP+K+Q +CG CWAF+   A+EG    ++G L+ LSEQ L+DCS   
Sbjct: 116 APRALDWRDKGYVTPVKDQGQCGSCWAFSTTGALEGQQFRKTGKLVSLSEQNLVDCSRPE 175

Query: 195 GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPG-TCSAAQKPAAAKISNYEEVPSGD 253
           GN GC GG  ++AF Y+  NQG+ +ED YPY       C       +A  + + +VPSG 
Sbjct: 176 GNEGCNGGLMDQAFQYVKDNQGLDSEDSYPYLGTDDQPCHYDPNYNSANDTGFVDVPSGK 235

Query: 254 EQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIF--NGVCGTQLDHAVTIVGF---GTTED 307
           E+AL+KAV ++ PVS+AI A    FQ Y+ GI+        +LDH V +VG+   G   D
Sbjct: 236 ERALMKAVAAVGPVSVAIDAGHESFQFYQSGIYYEKDCSSEELDHGVLVVGYGYEGEDVD 295

Query: 308 GANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
           G  YW++KNSW   WGD GY+ + +D +  CGI T +SYPL
Sbjct: 296 GKKYWIVKNSWSEKWGDKGYIYMAKDRKNHCGIATAASYPL 336


>gi|357216861|gb|AET71138.1| cysteine peptidase isoform b [Sphenophorus levis]
          Length = 324

 Score =  244 bits (622), Expect = 6e-62,   Method: Compositional matrix adjust.
 Identities = 136/336 (40%), Positives = 197/336 (58%), Gaps = 23/336 (6%)

Query: 19  FIIITLLV-SCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
           FI+ +LLV + ++ ++     H QS       +  +HG++YK++ E+  R  IF+ENL  
Sbjct: 4   FILASLLVVAVSATLLKEDGAHFQS-------FKLKHGKTYKNQAEETKRFAIFRENLRK 56

Query: 78  IEKAN---KEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
           IE  N   K+G  +Y  G N+F+D+T  EF+A+        PS  +T +    +Q     
Sbjct: 57  IEAHNAEYKQGIHSYTQGINKFADMTRAEFKAMLATQVKTKPSIVATKT----FQLADGV 112

Query: 135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN 194
            VP S+DWR +  VTPIK+Q +CG CWAFA V + EG   + +G L + SEQQL+DC+T+
Sbjct: 113 SVPESIDWRSRNVVTPIKDQAQCGSCWAFAVVGSTEGAYALSTGKLTRFSEQQLVDCTTD 172

Query: 195 GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDE 254
            N GC GG  +  F Y IQ  G+  E +YPY    G CS        K+S+Y  VP+ +E
Sbjct: 173 LNYGCDGGYLDDTFPY-IQTNGLELESDYPYTGYDGYCSYESSKVVTKVSSYVSVPA-NE 230

Query: 255 QALLKAV-SMQPVSIAIAAYSTEFQSYKEGIFNG-VCGTQ-LDHAVTIVGFGTTEDGANY 311
           QALL+AV +  PV+IAI A   + Q Y  GI +   C  + LDH V  VG+  +E+G +Y
Sbjct: 231 QALLEAVGTAGPVAIAINA--DDLQFYFSGIIDDKYCDPEYLDHGVLAVGY-DSENGRDY 287

Query: 312 WLIKNSWGNTWGDAGYMKIVRDEGLCGIGTRSSYPL 347
           WLIKNSWG  WG++GY + +R + +CG+   + YPL
Sbjct: 288 WLIKNSWGADWGESGYFRFLRGQNICGVKEDAVYPL 323


>gi|217323618|gb|ACK38176.1| midgut cysteine peptidase, partial [Sphenophorus levis]
          Length = 324

 Score =  243 bits (621), Expect = 7e-62,   Method: Compositional matrix adjust.
 Identities = 135/336 (40%), Positives = 198/336 (58%), Gaps = 23/336 (6%)

Query: 19  FIIITLLV-SCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
           FI+ +LLV + ++ ++     H QS       +  +HG++YK++ E+  R  IF+ENL  
Sbjct: 4   FILASLLVVAVSATLLKEDGVHFQS-------FKLKHGKTYKNQAEETKRFAIFRENLRK 56

Query: 78  IEKAN---KEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
           IE  N   K+G  +Y  G N+F+D+T  EF+A+        PS  +T +    +Q     
Sbjct: 57  IEAHNAEYKQGIHSYTQGINKFADMTRAEFKAMLATQVKTKPSIVATKT----FQLADGV 112

Query: 135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN 194
            VP S+DWR +  VTPIK+Q +CG CW+FA V + EG   + +G L + SEQQL+DC+T+
Sbjct: 113 SVPESIDWRSRNVVTPIKDQAQCGSCWSFAVVGSTEGAYALSTGKLTRFSEQQLVDCTTD 172

Query: 195 GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDE 254
            N GC GG  +  F Y IQ  G+  E +YPY    G+CS        K+S+Y  VP+ +E
Sbjct: 173 LNYGCDGGYLDDTFPY-IQTNGLELESDYPYTGYDGSCSYDSSKVVTKVSSYVSVPA-NE 230

Query: 255 QALLKAV-SMQPVSIAIAAYSTEFQSYKEGIFNG-VCGTQ-LDHAVTIVGFGTTEDGANY 311
           QALL+AV +  PV+IAI A   + Q Y  GI +   C  + LDH V  VG+  +E+G +Y
Sbjct: 231 QALLEAVGTAGPVAIAINA--DDLQFYFSGIIDDKYCDPEWLDHGVLAVGY-NSENGLDY 287

Query: 312 WLIKNSWGNTWGDAGYMKIVRDEGLCGIGTRSSYPL 347
           WLIKNSWG  WG++GY + +R + +CG+   + YPL
Sbjct: 288 WLIKNSWGADWGESGYFRFLRGQNICGVKEDAVYPL 323


>gi|218187750|gb|EEC70177.1| hypothetical protein OsI_00904 [Oryza sativa Indica Group]
 gi|222617983|gb|EEE54115.1| hypothetical protein OsJ_00884 [Oryza sativa Japonica Group]
          Length = 327

 Score =  243 bits (621), Expect = 7e-62,   Method: Compositional matrix adjust.
 Identities = 133/320 (41%), Positives = 182/320 (56%), Gaps = 29/320 (9%)

Query: 45  EIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEF 104
           ++ E+WMA+ G+ Y    EKE R  +F++N+ +I            L  NQF+DLTNDEF
Sbjct: 17  QMFEEWMAKFGKKYPCHGEKEYRFGVFRDNVRFIRSYRPPAGYNSALRVNQFADLTNDEF 76

Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFA 164
            + +TG K P P            + +    +P  +DWR KGAVT +K+Q  CG CWAFA
Sbjct: 77  VSTHTGAKPPCPKDAP--------RGVDPIWLPCCIDWRYKGAVTDVKDQGACGSCWAFA 128

Query: 165 AVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYP 224
           AVAA+EG+T+IR+G L  LSEQ+L+DC T G++GC GG  ++AF  +    GI  E  Y 
Sbjct: 129 AVAAIEGLTQIRTGKLTPLSEQELVDCDT-GSSGCAGGHTDRAFELVAAKGGITAESGYR 187

Query: 225 YQAVPGTCSA--AQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKE 282
           Y+   G C A  A    AA+I  +  VP GDE+ L  AV+ QPV+  I A    FQ Y  
Sbjct: 188 YEGYRGKCRADDALFNHAARIGGHRAVPPGDERQLATAVARQPVTAYIDASGPAFQFYGS 247

Query: 283 GIFNGVCGTQ---------LDHAVTIVGFGTTEDGAN---YWLIKNSWGNTWGDAGYMKI 330
           G+F G CG+           +HAVT+VG+   +DGA+   YW+ KNSWG TWG+ GY+ +
Sbjct: 248 GVFPGPCGSGSGAAAAAPTTNHAVTLVGY--CQDGASGKKYWVAKNSWGKTWGEKGYILL 305

Query: 331 VRD----EGLCGIGTRSSYP 346
            +D     G CG+     YP
Sbjct: 306 EKDVASPHGTCGVAVSPFYP 325


>gi|324512246|gb|ADY45078.1| Cathepsin L [Ascaris suum]
          Length = 388

 Score =  243 bits (620), Expect = 9e-62,   Method: Compositional matrix adjust.
 Identities = 132/314 (42%), Positives = 188/314 (59%), Gaps = 17/314 (5%)

Query: 47  HEKWM---AQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRTYKLGTNQFSDLT 100
           +E+W     QHG++Y+DE  +   +  F  NLE I K N   + G  ++++GTN  +DL 
Sbjct: 80  YEQWRLFKEQHGKNYEDEETENDHMLAFLSNLEEIRKHNARYQRGESSFEMGTNHITDLP 139

Query: 101 NDEFRALYTGYK-MPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGC 159
            +E+R L  GYK     SHR+ T     +      +VP   DWRD G VT +KNQ  CG 
Sbjct: 140 FEEYRKL-NGYKPRYDDSHRNGTKFLVPFN----INVPGHWDWRDHGYVTEVKNQGMCGS 194

Query: 160 CWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIA 218
           CWAF+A  A+EG  K + G+L+ LSEQ L+DCS   GNNGC GG  + AF YI  N G+ 
Sbjct: 195 CWAFSATGALEGQHKRKIGSLVSLSEQNLVDCSRKYGNNGCNGGLMDYAFEYIKDNHGVD 254

Query: 219 TEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQ-PVSIAIAAYSTEF 277
           TE  YPY+     C   +K   A+   Y ++P GDE+ L  AV+ Q P+S+AI A    F
Sbjct: 255 TEASYPYKGKEMKCHFNKKTVGAEDEGYVDLPEGDEEKLKIAVATQGPISVAIDAGHPSF 314

Query: 278 QSYKEGI-FNGVCGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-E 334
           Q Y++G+ +   C ++ LDH V +VG+GT E   +YW++KNSWG  WG+ GY++I R+ +
Sbjct: 315 QMYRKGVYYEPQCSSESLDHGVLVVGYGTDEIDGDYWIVKNSWGPGWGEKGYVRIARNRD 374

Query: 335 GLCGIGTRSSYPLA 348
             CGI +++SYP+ 
Sbjct: 375 NHCGIASKASYPIV 388


>gi|413947586|gb|AFW80235.1| hypothetical protein ZEAMMB73_542371 [Zea mays]
          Length = 264

 Score =  243 bits (620), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 117/254 (46%), Positives = 173/254 (68%), Gaps = 10/254 (3%)

Query: 13  INTTPMFIIITLLVSCA----SQVVSSRS-THEQSVVEIHEKWMAQHGRSYKDELEKEMR 67
           +++    +++ +L+ C     S V+++R  + + ++ E HE+WMA++GR YKD  +K  R
Sbjct: 2   VSSKAFLLLLAVLIGCVCSFPSPVLAARELSDDAAMAERHERWMAEYGRVYKDAADKARR 61

Query: 68  LKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFK 127
            ++FK+N  ++E  N +    + LG NQF+DLT + F+A   G+K  S     TT   FK
Sbjct: 62  FEVFKDNFAFVESFNADKKNKFWLGVNQFADLTTEAFKA-NKGFKPISAEKAPTTG--FK 118

Query: 128 YQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQ 187
           Y+NLS++ +PT++DWR KGAVTPIKNQ +CGCCWAF+AVAAVEGI K+ +GNL+ LSEQ+
Sbjct: 119 YENLSISALPTAVDWRTKGAVTPIKNQGQCGCCWAFSAVAAVEGIVKLSTGNLVSLSEQE 178

Query: 188 LLDCSTNG-NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNY 246
           L+DC T+  + GC GG  + AF ++I+N G+ATE  YPY+AV G C    K +AA I  +
Sbjct: 179 LVDCDTHSMDEGCEGGWMDSAFEFVIKNGGLATESSYPYKAVDGKCKGGSK-SAATIKGH 237

Query: 247 EEVPSGDEQALLKA 260
           E+VP  +E AL+KA
Sbjct: 238 EDVPPNNEAALMKA 251


>gi|18308182|gb|AAL67857.1|AF462309_1 cysteine proteinase [Acanthamoeba healyi]
          Length = 330

 Score =  243 bits (620), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 134/327 (40%), Positives = 189/327 (57%), Gaps = 13/327 (3%)

Query: 28  CASQVVSSRSTHEQSVVEIHEKWMAQHGRS-YKDELEKEMRLKIFKENLEYIEKANKEGN 86
           C   V S+ +     +  +  KWM ++ +S Y+     E    I++ N+   E+ N++ N
Sbjct: 11  CGLFVASTLAATHDPLTGVFAKWMRENTKSNYRFVYSNEEF--IYRWNVWRDEEHNRQ-N 67

Query: 87  RTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKG 146
           ++Y L  NQF DLTN EF  L+ G       H    ++         T +P+  DWR KG
Sbjct: 68  KSYFLAMNQFGDLTNAEFNRLFKGLAFDYSKHAKIHTAA---PEAPATGIPSEFDWRQKG 124

Query: 147 AVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSRE 205
           AVT +KNQ +CG CW+F+   + EG   +++G L+ LSEQ L+DCS + GNNGC GG  +
Sbjct: 125 AVTHVKNQGQCGSCWSFSTTGSTEGANFLKTGRLVSLSEQNLIDCSVSYGNNGCNGGLMD 184

Query: 206 KAFAYIIQNQGIATEDEYPYQ-AVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQ 264
            AF YII N+GI TE  YPYQ A P TC          ++ Y +V SGDE ALL A   +
Sbjct: 185 YAFEYIINNRGIDTEASYPYQTAGPLTCQYNAANKGGSLTGYTDVTSGDENALLNAAVKE 244

Query: 265 PVSIAIAAYSTEFQSYKEGIF--NGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTW 322
           PVS+AI A    FQ Y  G++  +    TQLDH V +VG+G +E+G ++W +KNSWG +W
Sbjct: 245 PVSVAIDASHNSFQFYSGGVYYESACSSTQLDHGVLVVGWG-SENGQDFWWVKNSWGASW 303

Query: 323 GDAGYMKIVRDE-GLCGIGTRSSYPLA 348
           G  GY+K+ R++   CGI T +SYP A
Sbjct: 304 GLNGYIKMSRNQNNNCGIATAASYPTA 330


>gi|2804262|dbj|BAA24442.1| cysteine proteinase [Sitophilus zeamais]
          Length = 338

 Score =  243 bits (620), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 130/339 (38%), Positives = 203/339 (59%), Gaps = 15/339 (4%)

Query: 22  ITLLVSCASQVVSSRSTHEQSVVEIHEKWMA---QHGRSYKDELEKEMRLKIFKENLEYI 78
           + L +  A+ V+S ++     +V+  E+W +   QH ++Y  E E+  R+KIF EN   +
Sbjct: 1   MKLFLILAAVVISCQAVSFYDLVQ--EQWSSFKMQHSKNYDSETEERFRMKIFMENAHKV 58

Query: 79  EKANK---EGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPS--HRSTTSSTFKYQNLSM 133
            K NK   +G   +KLG N+++D+ + EF +   G+     +    S  +   ++ + + 
Sbjct: 59  AKHNKLFSQGFVKFKLGLNKYADMLHHEFVSTLNGFNKTKNNILKGSDLNDAVRFISPAN 118

Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST 193
             +P ++DWRDKGAVT +K+Q  CG CW+F+A  ++EG    ++G L+ LSEQ L+DCS 
Sbjct: 119 VKLPDTVDWRDKGAVTEVKDQGHCGSCWSFSATGSLEGQHFRKTGKLVSLSEQNLVDCSG 178

Query: 194 N-GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSG 252
             GNNGC GG  + AF YI  N GI TE  YPY A    C    + + A    + ++   
Sbjct: 179 RYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYPYLAEDEKCHYKAQNSGATDKGFVDIEEA 238

Query: 253 DEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIFNGV-CGTQ-LDHAVTIVGFGTTEDGA 309
           +E  L  AV ++ PVSIAI A    FQ Y +G+++   C +Q LDH V +VG+GT++DG 
Sbjct: 239 NEDDLKAAVATVGPVSIAIDASHETFQLYSDGVYSDPECSSQELDHGVLVVGYGTSDDGQ 298

Query: 310 NYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
           +YWL+KNSWG +WG  GY+K+ R+ + +CG+ +++SYPL
Sbjct: 299 DYWLVKNSWGPSWGLNGYIKMARNQDNMCGVASQASYPL 337


>gi|254674508|dbj|BAH86062.1| cysteine protease [Haemaphysalis longicornis]
          Length = 333

 Score =  243 bits (619), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 130/338 (38%), Positives = 194/338 (57%), Gaps = 15/338 (4%)

Query: 19  FIIITLLVSCASQVVSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
            + +  L  C +  +++ S   Q ++    E + +QH ++Y   +E+ +R KIF EN   
Sbjct: 1   MLRLAFLCGCVAAAIAASS---QEILRTEWEAFKSQHNKAYSSHVEELLRFKIFTENTLL 57

Query: 78  IEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
           + K N +   G  +YKL  N+F DL   EF  +  GY+     ++    +     NL+ +
Sbjct: 58  VAKHNAKYAKGLVSYKLAMNKFGDLLPHEFAKMVNGYR--GKQNKEQRPTFIPPANLNDS 115

Query: 135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN 194
            +PT++DWR KGAVTP+KNQ +CG CWAF+   ++EG    ++G L+ LSEQ L+DCS +
Sbjct: 116 SLPTTVDWRKKGAVTPVKNQGQCGSCWAFSTTGSLEGQHFRKTGKLVSLSEQNLVDCSDD 175

Query: 195 -GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGD 253
            GN GC GG  +  F YI  N GI TE+ +PY A  G C   +    A  + + ++  G 
Sbjct: 176 FGNQGCNGGLMDNGFQYIKANGGIDTEESHPYTAQDGDCKFKKADVGATDAGFVDIQQGS 235

Query: 254 EQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIFN--GVCGTQLDHAVTIVGFGTTEDGAN 310
           E  L KAV ++ PVS+AI A    FQ Y +G+++      +QLDH V  VG+G  ++G  
Sbjct: 236 EDDLKKAVATVGPVSVAIDASHGSFQLYSQGVYDEPDCSSSQLDHGVLTVGYG-VKNGKK 294

Query: 311 YWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
           YWL+KNSWG  WGD GY+ + RD +  CGI + +SYPL
Sbjct: 295 YWLVKNSWGGDWGDNGYILMSRDKDNQCGIASSASYPL 332


>gi|354502595|ref|XP_003513369.1| PREDICTED: cathepsin L1-like [Cricetulus griseus]
          Length = 330

 Score =  242 bits (618), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 132/338 (39%), Positives = 200/338 (59%), Gaps = 20/338 (5%)

Query: 17  PMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLE 76
           P+F + TL +     VV +  TH+ S+ +  ++W  +HG++Y  + E + R  +++ N +
Sbjct: 3   PIFFLATLCLG----VVPAAPTHDPSLDDEWQEWKTRHGKTYSMDEEGQKR-AVWENNRK 57

Query: 77  YIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
            IE  N++   G   + L  N F DLTN EFR L TG++         T     +Q   +
Sbjct: 58  MIELHNEDYTKGKHGFHLEMNAFGDLTNIEFRQLMTGFQ------SMGTKEMNVFQEPLL 111

Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS- 192
            DVP S+DWR+   VTP+K+Q +C  CWAF+AV ++EG    ++G LI LSEQ L+DCS 
Sbjct: 112 GDVPKSVDWRNLSYVTPVKDQGQCSSCWAFSAVGSLEGQIFRKTGQLISLSEQNLVDCSW 171

Query: 193 TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSG 252
           + GN GC GG  E AF Y+ +N+G+ T   YPY+A  G C    K +AA ++++ ++P  
Sbjct: 172 SYGNIGCFGGLMEYAFRYVKENRGLDTRVSYPYEARNGPCRYDPKNSAANVTDFVKIPI- 230

Query: 253 DEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIF--NGVCGTQLDHAVTIVGFGTTEDGA 309
            E AL+KAV ++ P+S+ + ++   F+ YK G++       + LDHAV +VG+G   DG 
Sbjct: 231 SEDALMKAVATVGPISVGVDSHHHSFRFYKGGMYYEPHCSSSNLDHAVLVVGYGEESDGN 290

Query: 310 NYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYP 346
            YW++KNSWG  WG  GY+K+ RD    CGI T + YP
Sbjct: 291 KYWMVKNSWGQGWGMNGYIKMARDRNNNCGIATYAIYP 328


>gi|227018328|gb|ACP18830.1| cysteine proteinase 1 [Chrysomela tremula]
          Length = 323

 Score =  242 bits (618), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 137/334 (41%), Positives = 206/334 (61%), Gaps = 21/334 (6%)

Query: 22  ITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKA 81
           + L+++ A+ VV+  +  +Q   E+   +   HG++YK   E+++R  IF++ L  I   
Sbjct: 1   MKLIIAFAAFVVAINAASDQ---ELWADFKKAHGKTYKSLREEKLRFNIFQDTLREIAAH 57

Query: 82  N---KEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPT 138
           N   + G  TY L  NQFSD+T++EFRA+        PS         +  NL++   P 
Sbjct: 58  NAKYESGESTYYLAINQFSDITDEEFRAMLMKNVESRPSLED-----MEIANLTVGAAPE 112

Query: 139 SLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST-NGNN 197
           S+DWR +GAV PI+NQ++CG CWAF+AVAAVEG   I+SG+   LS QQL+DCST  GN+
Sbjct: 113 SIDWRTEGAVLPIRNQEDCGSCWAFSAVAAVEGQAAIKSGSKTPLSVQQLVDCSTEGGNS 172

Query: 198 GCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQAL 257
           GC GG    AF YI  N G+ ++ +YPY     +C A +  +  K++ Y++V S  E +L
Sbjct: 173 GCNGGLMNGAFDYIKAN-GLESDAKYPYTGTDDSCKADKSSSLVKLTGYKKVAS-SEASL 230

Query: 258 LKAV-SMQPVSIAIAAYSTEFQSYKEGIFNGV--CGTQLDHAVTIVGFGTTEDGANYWLI 314
            +AV ++ P+S+A+  Y+  ++SY  GIFN +   G  LDH VT VG+G T++G  YW +
Sbjct: 231 KEAVGTVGPISVAV--YADLWRSYGGGIFNNILCLGFGLDHGVTAVGYG-TDNGKKYWPV 287

Query: 315 KNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYPL 347
           KNSWG +WG+ GY+++ RD    CGI  ++SYP+
Sbjct: 288 KNSWGESWGEEGYIRMARDTLHNCGINQQASYPI 321


>gi|357122137|ref|XP_003562772.1| PREDICTED: fruit bromelain-like [Brachypodium distachyon]
          Length = 358

 Score =  242 bits (618), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 132/315 (41%), Positives = 183/315 (58%), Gaps = 21/315 (6%)

Query: 52  AQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGY 111
           A + R+Y    E+  R ++++ N++YIE  N+ G+ TY+LG NQF+DLT  EFRA+YT  
Sbjct: 45  ATYNRTYASPEERLRRFEVYRRNVDYIEAMNRRGDLTYELGENQFADLTVQEFRAMYTMP 104

Query: 112 ----KMPSPSHRSTTSSTFK----------YQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
                 P    R    +T            Y +      PTS+DWR KGAVTP+K+Q  C
Sbjct: 105 ARVDSRPDAWRRRQMITTLAGPVTEDGGSYYSDAWEEAGPTSVDWRSKGAVTPVKDQGGC 164

Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
           GCCWAFA VA +EG+ KI++G L+ LSEQ+L+DC    +    G   E A  ++  N G+
Sbjct: 165 GCCWAFATVATIEGLHKIKTGQLVSLSEQELVDCDDADDGCGGGLP-EIAMEWVAHNGGL 223

Query: 218 ATEDEYPYQAVPGTCSAAQKP-AAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTE 276
            TE  YPY    G C   +    AAKI+  + V +  E  L +AV+ QPV++AI A  + 
Sbjct: 224 TTEANYPYTGKAGKCDRGKASNHAAKIAAAQMVRANSEAELERAVARQPVAVAINAPDS- 282

Query: 277 FQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR---- 332
              YK G+++G C  + DHAVT+VG+G    G  YW+IKNSW  TWG+ GY ++ R    
Sbjct: 283 LMFYKSGVYSGPCTAEFDHAVTVVGYGADNKGHKYWIIKNSWAETWGEKGYGRMQRGVAA 342

Query: 333 DEGLCGIGTRSSYPL 347
            EGLCGI T +SYP+
Sbjct: 343 KEGLCGIATHASYPV 357


>gi|326501772|dbj|BAK02675.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 333

 Score =  242 bits (617), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 134/343 (39%), Positives = 197/343 (57%), Gaps = 19/343 (5%)

Query: 13  INTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFK 72
           ++   +  ++ L  SC     +  + H +        W   + + Y D  E+ +R   ++
Sbjct: 1   MHAISVLAVLALAFSCTLAFDAKLNQHWKL-------WKEANNKRYSDA-EEHVRRATWE 52

Query: 73  ENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQ 129
            NL+ +++ N +   G  TY LG N+++D+T  EF  +  GY       R+    TF + 
Sbjct: 53  GNLQKVQEHNLQADLGVHTYWLGMNKYADMTVTEFVKVMNGYNATMRGQRTQDRHTFSFN 112

Query: 130 NLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLL 189
             S   +P ++DWRDKG VT +K+Q +CG CWAF+   A+EG    ++G L+ LSEQ L+
Sbjct: 113 --SKIALPDTVDWRDKGYVTDVKDQGQCGSCWAFSTTGALEGQHFKQTGKLVSLSEQNLV 170

Query: 190 DCS-TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEE 248
           DCS   GN GC GG  ++AF YI +N GI TED YPY+AV   C        A  + + +
Sbjct: 171 DCSGKQGNMGCNGGLMDQAFEYIKENNGIDTEDSYPYEAVDNQCRFKAANVGATDTGFTD 230

Query: 249 VPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIFNG-VCG-TQLDHAVTIVGFGTT 305
           + S DE AL +AV ++ P+S+AI A  T FQ YK G++N   C  T+LDH V  VG+G T
Sbjct: 231 ITSKDESALQQAVATVGPISVAIDAGHTSFQLYKHGVYNEPFCSQTRLDHGVLAVGYG-T 289

Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYPL 347
           + G +YWL+KNSWG  WGD GY+K+ R++   CGI T +SYPL
Sbjct: 290 DSGKDYWLVKNSWGEGWGDKGYIKMTRNKRNQCGIATAASYPL 332


>gi|113120265|gb|ABI30272.1| VXH-A, partial [Vasconcellea x heilbornii]
          Length = 318

 Score =  242 bits (617), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 131/287 (45%), Positives = 175/287 (60%), Gaps = 18/287 (6%)

Query: 38  THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
           T  + ++ + + WM ++ + YKD  EK  R +IFK+NL+YI++ NK+ N TY LG   F+
Sbjct: 39  TSTEKLINLFDSWMVEYDKVYKDIDEKIYRFEIFKDNLKYIDETNKK-NNTYWLGLTSFT 97

Query: 98  DLTNDEFRALYTGYKMPSPSHRSTTSST----FKYQNLSMTDVPTSLDWRDKGAVTPIKN 153
           DLTNDEF+  Y G     P + STT  +    F Y ++   ++P S+DWR KGAVTP++N
Sbjct: 98  DLTNDEFKEKYVG---SIPENWSTTEESNDKEFIYDDV--VNIPASIDWRQKGAVTPVRN 152

Query: 154 QKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQ 213
           Q  CG CW F++VAAVEGI KI +G L+ LSEQ+LLDC    + GC GG    A  Y + 
Sbjct: 153 QGSCGSCWTFSSVAAVEGINKIVTGQLVSLSEQELLDCERR-SYGCRGGFPPYALQY-VA 210

Query: 214 NQGIATEDEYPYQAVPGTCSAAQ-KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAA 272
           N GI     YPY+ V   C AAQ K    K      V   +EQAL++ +++QPVSI + A
Sbjct: 211 NSGIHLRQYYPYEGVQRQCRAAQAKGPKVKTDGVGRVQRNNEQALIQRIAIQPVSIVVEA 270

Query: 273 YSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWG 319
               FQ+Y+ GIF G CGT +DHAV  VG+G       Y LIKNSWG
Sbjct: 271 KGRAFQNYRGGIFAGPCGTSIDHAVAAVGYGN-----GYILIKNSWG 312


>gi|392873948|gb|AFM85806.1| cathepsin L [Callorhinchus milii]
          Length = 338

 Score =  242 bits (617), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 133/312 (42%), Positives = 189/312 (60%), Gaps = 16/312 (5%)

Query: 48  EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEF 104
           E+W + HG+SY ++ E+  R  +++++L  IE  N E   G  +++LG N F D+ N+EF
Sbjct: 30  EQWKSWHGKSY-EQKEETWRRMVWEKHLRVIEIHNLEHSLGKHSFRLGMNHFGDMPNEEF 88

Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFA 164
           R L  GYK    +H+    S F   N    +VP  +DWRD+G VTP+K+Q +CG CWAF+
Sbjct: 89  RQLMNGYKYKQ-THKKLQGSHFLEPNF--LEVPKHVDWRDEGYVTPVKDQGQCGSCWAFS 145

Query: 165 AVAAVEGITKIRSGNLIQLSEQQLLDCST-NGNNGCLGGSREKAFAYIIQNQGIATEDEY 223
              A+EG    R+G L+ LSEQ L++CS   GN GC GG  ++AF Y+  N GI +ED Y
Sbjct: 146 TTGALEGQHFRRTGQLVSLSEQNLVECSKPEGNEGCNGGLMDQAFQYVKDNGGIDSEDSY 205

Query: 224 PYQAVPGT-CSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYK 281
           PY     T C    +  AA  + + ++PSG E+AL+KA+ ++ PVS+AI A  T FQ Y+
Sbjct: 206 PYVGTDDTPCHYNPQYNAANDTGFVDIPSGKERALMKAIAAVGPVSVAIDAGHTSFQFYQ 265

Query: 282 EGI-FNGVC-GTQLDHAVTIVGFGTTE---DGANYWLIKNSWGNTWGDAGYMKIVRD-EG 335
            GI F   C  T LDH V +VG+G  +   DG  YW++KNSW    G  GY+ + +D + 
Sbjct: 266 SGIYFEAECSSTDLDHGVLVVGYGVEKRDTDGKKYWIVKNSWSEKLGQNGYILMAKDKDN 325

Query: 336 LCGIGTRSSYPL 347
            CGI T +SYPL
Sbjct: 326 HCGIATAASYPL 337


>gi|395856029|ref|XP_003800445.1| PREDICTED: cathepsin S [Otolemur garnettii]
          Length = 331

 Score =  242 bits (617), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 136/336 (40%), Positives = 196/336 (58%), Gaps = 19/336 (5%)

Query: 20  IIITLLVSCASQVVSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
           ++ TLLV C++        H    ++ H   W   +G+ Y ++ E+  R  I+++NL+++
Sbjct: 4   LVWTLLVCCSAMA----QLHRDPALDHHWHLWKKTYGKQYTEKNEETERRLIWEKNLKFV 59

Query: 79  EKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD 135
              N E   G  +Y LG N   D+T++E  +L T  K+P  S R+ T  +   Q L    
Sbjct: 60  MLHNLEHSMGMHSYDLGMNHLGDMTSEEVVSLMTCLKVPRQSQRNVTYKSSPNQKL---- 115

Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG 195
            P SLDWR+KG VT +K Q  CG CWAF+AV A+E   K+ +G L+ LS Q L+DCST  
Sbjct: 116 -PDSLDWREKGCVTEVKYQGSCGSCWAFSAVGALEAQLKLTTGKLVSLSAQNLVDCSTEK 174

Query: 196 --NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGD 253
             N GC GG   +AF YII N GI +E  YPY+A+   C    K  AA  S Y E+P G 
Sbjct: 175 YRNEGCHGGFMTEAFQYIIDNNGIDSEASYPYKAMDEKCQYDSKNRAATCSKYTELPFGS 234

Query: 254 EQALLKAVSMQ-PVSIAIAAYSTEFQSYKEGI-FNGVCGTQLDHAVTIVGFGTTEDGANY 311
           E+AL +AV+ + PVS+AI A  + F  Y+ G+ +   C   ++H V +VG+G   +G +Y
Sbjct: 235 EEALKEAVASKGPVSVAIDASHSSFFLYRSGVYYEPACTQVVNHGVLVVGYGNL-NGNDY 293

Query: 312 WLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYP 346
           WL+KNSWG  +GD GY+++ R+ E  CGI + SSYP
Sbjct: 294 WLVKNSWGLYFGDKGYIRMARNRENHCGIASYSSYP 329


>gi|305434756|gb|ADM53740.1| cathepsin L1 precursor [Lepeophtheirus salmonis]
          Length = 325

 Score =  242 bits (617), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 133/307 (43%), Positives = 190/307 (61%), Gaps = 15/307 (4%)

Query: 50  WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEFRA 106
           W   HG++Y     +E+R+KIF+EN   I+K N E   G  TY L  NQ+ DL   EF  
Sbjct: 24  WTKLHGKTYTSFEIEELRVKIFEENRIKIQKHNAEAQNGLHTYSLEMNQYGDLLQSEFLQ 83

Query: 107 LYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAV 166
            YTG    S S  +T          +   VP+ ++W   GAVT +K+QK+CG CWAF+  
Sbjct: 84  GYTGLAKGSYSGDNTVILD------NSAPVPSYVNWTKNGAVTAVKDQKDCGSCWAFSTT 137

Query: 167 AAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPY 225
            +VEG   I++  L+  SEQQL+DCS++  N GC GG  + AF Y+I N+GIATED YPY
Sbjct: 138 GSVEGQYFIKNKKLLSFSEQQLVDCSSDFRNEGCNGGWMDNAFKYLIANKGIATEDTYPY 197

Query: 226 QAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVS-MQPVSIAIAAYSTEFQSYKEGI 284
            A  G C   +  AA +IS++++V  G E  L  AV+ + P+S+AI A S +FQ YK+G+
Sbjct: 198 TATDGVCVYNKTMAAGRISSFKDVKHGSEDQLKLAVAQIGPISVAIDASSGDFQFYKKGV 257

Query: 285 F-NGVCGTQ-LDHAVTIVGFGTTED-GANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIG 340
           + +  C ++ LDH V  VG+GT +  G +YWL+KNSW  +WGD GY+K+ R+ + +CGI 
Sbjct: 258 YVDEECSSKYLDHGVLAVGYGTDKGTGLDYWLVKNSWSASWGDQGYIKMARNHKNMCGIA 317

Query: 341 TRSSYPL 347
           + +SYP+
Sbjct: 318 SLASYPV 324


>gi|332376957|gb|AEE63618.1| unknown [Dendroctonus ponderosae]
          Length = 318

 Score =  242 bits (617), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 138/335 (41%), Positives = 195/335 (58%), Gaps = 27/335 (8%)

Query: 19  FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
           FI+ +LL+      + +  +  QS       +  +H +SY +++E+  RL IF ENL  I
Sbjct: 4   FILASLLIVAVGASLENVGSTFQS-------FKLKHSKSYSNQVEEAKRLAIFTENLRDI 56

Query: 79  EKANK---EGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD 135
           E+ N     G  +Y    NQF+DLT DEF+A  T +  P       T +T  Y    +  
Sbjct: 57  EEHNALYAAGLVSYNKSVNQFTDLTIDEFKAYLTLHSKP-------TLNTVPYVRTGL-Q 108

Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG 195
           VPT+LDWR +G VT +K+Q +CG CWAF+ V + EG     +G L+ LSEQQL+DC+TN 
Sbjct: 109 VPTTLDWRSQGYVTGVKDQGDCGSCWAFSVVGSTEGAYYKSTGKLVSLSEQQLIDCTTNV 168

Query: 196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQ 255
           N+GC GG  E+ F Y +Q  G+ +E  YPY    G C  ++     K+S Y  V  G E 
Sbjct: 169 NDGCDGGYLEETFPY-VQQTGLVSESSYPYTGRDGNCRISESDVVTKVSKY--VLLGGEA 225

Query: 256 ALLKAV-SMQPVSIAIAAYSTEFQSYKEGIF-NGVCGT-QLDHAVTIVGFGTTEDGANYW 312
            LL+AV S+ PVS+A+ A  T   SY  G++ + +C    L+H V +VG+G T+DG +YW
Sbjct: 226 DLLEAVGSVGPVSVAMDA--TYIYSYASGVYESSLCSLYSLNHGVLVVGYG-TQDGKDYW 282

Query: 313 LIKNSWGNTWGDAGYMKIVRDEGLCGIGTRSSYPL 347
           LIKNSWGNTWG+ GY+K++R    CGI     YP+
Sbjct: 283 LIKNSWGNTWGEQGYLKLLRGTNECGIAEDDVYPI 317


>gi|405966498|gb|EKC31776.1| Cathepsin L [Crassostrea gigas]
          Length = 330

 Score =  242 bits (617), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 133/313 (42%), Positives = 188/313 (60%), Gaps = 17/313 (5%)

Query: 45  EIHEKW---MAQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRTYKLGTNQFSD 98
           E+  +W   +  HG+ Y  E E   R+ I++ NL+YIEK N     G+ ++ LG N++ D
Sbjct: 22  ELDSEWQLYLKAHGKQYGAEEEARRRV-IWEGNLDYIEKHNLAADRGDYSFWLGMNEYGD 80

Query: 99  LTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECG 158
           +TN+EFR+   GYKM       T+  +      ++ D+P ++DWR KG VTPIKNQ +CG
Sbjct: 81  MTNEEFRSTMNGYKM----RNGTSRGSLYLPPSNIGDLPDTVDWRPKGYVTPIKNQGQCG 136

Query: 159 CCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TNGNNGCLGGSREKAFAYIIQNQGI 217
            CW+F+A  ++EG T  ++G L  LSEQ L+DCS   GN+GC GG  + AF YI  N GI
Sbjct: 137 SCWSFSATGSLEGQTFKKTGKLPSLSEQNLVDCSQKQGNHGCQGGLMDDAFQYIKDNSGI 196

Query: 218 ATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTE 276
            TE  YPY+A  G C        A  S + ++ S  E  L  AV ++ P+S+AI A    
Sbjct: 197 DTESSYPYEAKNGKCRFNAANVGATDSGFTDIKSKSESDLQSAVATVGPISVAIDASHMS 256

Query: 277 FQSYKEGIFNG-VCG-TQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE 334
           FQ Y+ G+++   C  T+LDH V  VG+G TE G +YWL+KNSWG +WG  GY+ + R++
Sbjct: 257 FQLYRSGVYHEFFCSETRLDHGVLAVGYG-TESGKDYWLVKNSWGESWGQKGYIMMSRNK 315

Query: 335 -GLCGIGTRSSYP 346
              CGI T +SYP
Sbjct: 316 RNNCGIATSASYP 328


>gi|389610697|dbj|BAM18960.1| cathepsin L [Papilio polytes]
          Length = 341

 Score =  242 bits (617), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 131/344 (38%), Positives = 198/344 (57%), Gaps = 23/344 (6%)

Query: 20  IIITLLVSCASQVVSSRSTHEQSVVEIHEKWMA---QHGRSYKDELEKEMRLKIFKENLE 76
           +++ + V  A+  VS           + E+W A   +H + Y  E+E + R+KI+ EN  
Sbjct: 4   LVVLMCVVAAASAVSFFDL-------VKEEWNAFKMEHQKQYDSEVEDKFRMKIYAENKH 56

Query: 77  YIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
            I K N++   G   +++  N++ D+ + EF     G+   + + +     +   +  + 
Sbjct: 57  KIAKHNQKFARGQVPFRVKQNKYGDMLHHEFVHTMNGFNKTTKNGKGLFGKSAGERGATF 116

Query: 134 -----TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQL 188
                  VP  +DWR  GAVT +K+Q +CG CW+F+A  A+EG    ++  L+ LSEQ L
Sbjct: 117 IPPANVRVPDHVDWRKHGAVTEVKDQGKCGSCWSFSATGALEGQHYRQTNILVSLSEQNL 176

Query: 189 LDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYE 247
           +DCST  GNNGC GG  + AF YI  N+GI TE  YPY+AV   C    + + A    + 
Sbjct: 177 IDCSTAYGNNGCNGGLMDNAFKYIKDNKGIDTEKSYPYEAVDDKCRYNPRNSGADDVGFI 236

Query: 248 EVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVCG-TQLDHAVTIVGFGT 304
           ++PSGDE  L+ AV ++ PVS+AI A    FQ Y +G+ F+  C  T LDH V +VG+GT
Sbjct: 237 DIPSGDEGKLMAAVATVGPVSVAIDASQETFQFYSDGVYFDENCSSTSLDHGVLVVGYGT 296

Query: 305 TEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
            E+G +YWL+KNSWG +WGD GY+K+ R+ +  CGI T +S+PL
Sbjct: 297 DENGGDYWLVKNSWGRSWGDLGYIKMARNRDNHCGIATAASFPL 340


>gi|163658591|gb|ABY28387.1| cathepsin L [Gnathostoma spinigerum]
          Length = 398

 Score =  241 bits (616), Expect = 3e-61,   Method: Compositional matrix adjust.
 Identities = 123/310 (39%), Positives = 190/310 (61%), Gaps = 12/310 (3%)

Query: 48  EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNR---TYKLGTNQFSDLTNDEF 104
           E +  +HG+++ D   +   +  F +NLEYI++ N++  R   T+++G N  +DL  DE+
Sbjct: 92  EDFKLEHGKAFDDVENEYDHIFAFTKNLEYIKQHNEKFQRGEVTFEMGVNHLTDLPFDEY 151

Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFA 164
           + L  G++  +   R    STF   +     +P ++DWR+   VT +K+Q +CG CWAF+
Sbjct: 152 KKL-NGFRKNNDDSRPRNGSTFLRPHF--VQIPDTVDWRNSSYVTVVKDQGQCGSCWAFS 208

Query: 165 AVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEY 223
           A  A+EG    ++  L+ LSEQ L+DCS   GNNGC GG  + AF YI  N GI TE+ Y
Sbjct: 209 ATGALEGQHMRKTHQLVSLSEQNLVDCSRKYGNNGCNGGLMDNAFEYIKDNHGIDTEESY 268

Query: 224 PYQAVPG-TCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYK 281
           PY+ V G  C   +K   A+   Y ++P GDE+AL  AV ++ P+S+AI A    FQ+Y+
Sbjct: 269 PYKGVEGKKCHFRRKFVGAEDYGYTDLPEGDEEALKVAVATIGPISVAIDAGHISFQNYR 328

Query: 282 EGIF--NGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE-GLCG 338
           +GI+  N      LDH V +VG+GT E+  +YW++KNSWG  WG+ GY+++ R++   CG
Sbjct: 329 KGIYTENECSPEDLDHGVLVVGYGTDENAGDYWIVKNSWGTRWGEHGYIRMARNKRNQCG 388

Query: 339 IGTRSSYPLA 348
           I +++SYP+ 
Sbjct: 389 IASKASYPIV 398


>gi|405958751|gb|EKC24845.1| Cathepsin L [Crassostrea gigas]
          Length = 330

 Score =  241 bits (616), Expect = 3e-61,   Method: Compositional matrix adjust.
 Identities = 133/313 (42%), Positives = 188/313 (60%), Gaps = 17/313 (5%)

Query: 45  EIHEKW---MAQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRTYKLGTNQFSD 98
           E+  +W   +  HG+ Y  E E   R+ I++ NL+YIEK N     G+ ++ LG N++ D
Sbjct: 22  ELDSEWQLYLKAHGKQYGAEEEARRRV-IWEGNLDYIEKHNLAADRGDYSFWLGMNEYGD 80

Query: 99  LTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECG 158
           +TN+EFR+   GYKM       T+  +      ++ D+P ++DWR KG VTPIKNQ +CG
Sbjct: 81  MTNEEFRSTMNGYKM----RNGTSRGSLYLPPSNIGDLPDTVDWRPKGYVTPIKNQGQCG 136

Query: 159 CCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TNGNNGCLGGSREKAFAYIIQNQGI 217
            CW+F+A  ++EG T  ++G L  LSEQ L+DCS   GN+GC GG  + AF YI  N GI
Sbjct: 137 SCWSFSATGSLEGQTFKKTGKLPSLSEQNLVDCSQKQGNHGCQGGLMDDAFQYIKDNNGI 196

Query: 218 ATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTE 276
            TE  YPY+A  G C        A  S + ++ S  E  L  AV ++ P+++AI A    
Sbjct: 197 DTESSYPYEAKNGKCRFNAANVGATDSGFTDIKSKSESDLQSAVATVGPIAVAIDASHMS 256

Query: 277 FQSYKEGIFNG-VCG-TQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE 334
           FQ YK G+++   C  T+LDH V  VG+G TE G +YWL+KNSWG +WG  GY+ + R++
Sbjct: 257 FQLYKSGVYHEFFCSETRLDHGVLAVGYG-TESGKDYWLVKNSWGESWGQKGYIMMSRNK 315

Query: 335 -GLCGIGTRSSYP 346
              CGI T +SYP
Sbjct: 316 RNNCGIATSASYP 328


>gi|432114311|gb|ELK36239.1| Cathepsin S [Myotis davidii]
          Length = 340

 Score =  241 bits (616), Expect = 3e-61,   Method: Compositional matrix adjust.
 Identities = 134/338 (39%), Positives = 200/338 (59%), Gaps = 18/338 (5%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLE 76
           M  ++ +L+ C+S +      H+   ++ H + W   +G+ Y +E E+  R  I+++NL+
Sbjct: 10  MKWLLLVLLGCSSAMAQ---LHKDPTLDHHWDLWKKTYGKQYTEENEEVTRRFIWEKNLK 66

Query: 77  YIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
           Y+   N E   G  +Y LG N  +D+T++E   L +  ++PS   R+ T  +   Q L  
Sbjct: 67  YVMLHNLEHSMGMHSYDLGMNHLADMTSEEVMLLMSSLRVPSQWQRNVTFKSNPNQKL-- 124

Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST 193
              P S+DWRDKG VT +K Q  CG CWAF+AV A+E   K+++G L+ LS Q L+DCST
Sbjct: 125 ---PDSMDWRDKGCVTEVKYQGSCGSCWAFSAVGALEAQLKLKTGKLVSLSVQNLVDCST 181

Query: 194 N--GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPS 251
               N GC GG   +AF YII N GI +E  YPY+A+ G C    K  AA  S Y E+P 
Sbjct: 182 GKYSNKGCNGGFMTEAFQYIIDNNGIDSEASYPYKAMDGKCQYDVKNRAATCSKYVELPF 241

Query: 252 GDEQALLKAVSMQ-PVSIAIAAYSTEFQSYKEGI-FNGVCGTQLDHAVTIVGFGTTEDGA 309
           G+E+AL +AV+ + PVS+AI A    F  Y+ G+ ++  C   ++H V  VG+G   +G 
Sbjct: 242 GNEEALKEAVANKGPVSVAIDASHPSFFLYRSGVYYDKACTLNVNHGVLAVGYGNY-NGK 300

Query: 310 NYWLIKNSWGNTWGDAGYMKIVRDEG-LCGIGTRSSYP 346
           +YWL+KNSWG  +G+ GY+++ R+ G  CGI +  SYP
Sbjct: 301 DYWLVKNSWGLHFGEQGYIRMARNSGNHCGIASYPSYP 338


>gi|328776427|ref|XP_625135.3| PREDICTED: cathepsin L-like [Apis mellifera]
          Length = 351

 Score =  241 bits (616), Expect = 3e-61,   Method: Compositional matrix adjust.
 Identities = 133/328 (40%), Positives = 196/328 (59%), Gaps = 20/328 (6%)

Query: 38  THEQSVVE-IHEKWMA---QHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYK 90
           TH  S  E ++++WM    +H + YK ++E+  R+KIF +N   I K N        +YK
Sbjct: 21  THAVSFFELVNQEWMTFKMEHKKVYKSDVEERFRMKIFMDNKHKIAKHNSNYEMKKVSYK 80

Query: 91  LGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSSTF-KYQNLSMTDVPTSLDWRDK 145
           L  N++ D+ + EF  +  G+         S R    ++F +  N+ +   P  +DWR +
Sbjct: 81  LKMNKYGDMLHHEFVNILNGFNKSINTQLRSERLPVGASFIEPANVVL---PKKVDWRKE 137

Query: 146 GAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSR 204
           GAVTP+K+Q  CG CW+F+A  A+EG    R+G L+ LSEQ L+DCS   GNNGC GG  
Sbjct: 138 GAVTPVKDQGHCGSCWSFSATGALEGQHFRRTGVLVSLSEQNLIDCSGKYGNNGCNGGLM 197

Query: 205 EKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SM 263
           ++AF YI  N+G+ TE  YPY+A    C      + A    Y ++P+GDE+ L  AV ++
Sbjct: 198 DQAFQYIKDNKGLDTEASYPYEAENDKCRYNPANSGAIDVGYIDIPTGDEKLLKAAVATI 257

Query: 264 QPVSIAIAAYSTEFQSYKEGI-FNGVCGT-QLDHAVTIVGFGTTEDGANYWLIKNSWGNT 321
            PVS+AI A    FQ Y EG+ +   C + +LDH V ++G+GT E+G +YWL+KNSWG T
Sbjct: 258 GPVSVAIDASHQSFQFYSEGVYYEPECSSEELDHGVLVIGYGTNENGQDYWLVKNSWGET 317

Query: 322 WGDAGYMKIVRDE-GLCGIGTRSSYPLA 348
           WG+ GY+K+ R++   CGI + +SYPL 
Sbjct: 318 WGNNGYIKMARNKLNHCGIASSASYPLV 345


>gi|383849553|ref|XP_003700409.1| PREDICTED: cathepsin L-like [Megachile rotundata]
          Length = 343

 Score =  241 bits (616), Expect = 3e-61,   Method: Compositional matrix adjust.
 Identities = 132/349 (37%), Positives = 207/349 (59%), Gaps = 33/349 (9%)

Query: 20  IIITLLVSCASQVVSSRSTHEQSVVEIHEKWM---AQHGRSYKDELEKEMRLKIFKENLE 76
           I++ ++++CA+  V + S  E     ++++W+    +H + YK E E+ +R+KI+ +N  
Sbjct: 4   ILLLIVITCAA--VQAISFFEL----VNQEWINFKMEHKKCYKHEAEERLRMKIYMKNKL 57

Query: 77  YIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
            I + N +      TY+L  N++ D+ N EF+ +  GY         T + T + + L +
Sbjct: 58  QIAQHNCDYELKKVTYRLKINKYGDMLNHEFKNMLNGYN-------RTINHTLRNERLPV 110

Query: 134 ---------TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLS 184
                     ++P  +DWR  GAVT +K+Q  CG CWAF+A  ++EG    R+G L+ LS
Sbjct: 111 GAAFIEPCNVELPKMVDWRKCGAVTEVKDQGHCGSCWAFSATGSLEGQHFRRTGVLVSLS 170

Query: 185 EQQLLDCS-TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKI 243
           EQ L+DCS + GNNGC GG  ++AF+YI  N+G+ TE  YPY+     C   ++ + A  
Sbjct: 171 EQNLIDCSGSYGNNGCNGGLMDQAFSYIKDNKGLDTEKTYPYEGEDDKCRYDKRSSGASD 230

Query: 244 SNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVC-GTQLDHAVTIV 300
             + ++P GDEQ L  AV ++ PVS+AI A    FQ Y +GI F   C  T LDH V +V
Sbjct: 231 VGFVDIPVGDEQKLKAAVATVGPVSVAIDASHQSFQFYSDGIYFEPECSSTNLDHGVLVV 290

Query: 301 GFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPLA 348
           G+GT E+G +YW++KNSWG +WG+ GY+K+ R+ +  CGI + +SYP+ 
Sbjct: 291 GYGTDEEGRDYWIVKNSWGESWGEKGYIKMARNIDNHCGIASSASYPIV 339


>gi|37786769|gb|AAO64471.1| cathepsin L precursor [Fundulus heteroclitus]
          Length = 337

 Score =  241 bits (616), Expect = 3e-61,   Method: Compositional matrix adjust.
 Identities = 136/341 (39%), Positives = 198/341 (58%), Gaps = 17/341 (4%)

Query: 19  FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
            + + +L  C S  V S  + +  + +    W + H ++Y    E   RL ++++NL+ I
Sbjct: 1   MLPVAVLTLCLSSAVLSAPSLDPQLDQHWNLWKSWHSKNYHQREEGWRRL-VWEKNLKKI 59

Query: 79  EKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD 135
           E  N E   G  +Y+LG N F D+T++EF+ +  GYK    + R    S F   N    +
Sbjct: 60  ELHNLEHSMGKHSYRLGMNHFGDMTHEEFKQIMNGYK--HKAERKFKGSLFLEPNF--LE 115

Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TN 194
            P S+DWR+KG VTP+K+Q ECG CWAF+   A+EG    R+G L+ LS Q L++CS   
Sbjct: 116 APRSVDWREKGYVTPVKDQGECGSCWAFSTTGALEGQEFTRTGKLVSLSGQNLVECSRPE 175

Query: 195 GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPG-TCSAAQKPAAAKISNYEEVPSGD 253
           GN GC GG  ++AF Y+  NQG+ +ED YPY       C    K +AA  + + ++PSG+
Sbjct: 176 GNEGCNGGLMDQAFQYVKDNQGLDSEDSYPYLGTDDQPCHYDPKFSAANDTGFVDIPSGN 235

Query: 254 EQALLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVCGT-QLDHAVTIVGF---GTTED 307
           E+AL+KAV S+ PVS+AI A    FQ Y+ GI +   C + +LDH V  VG+   G   D
Sbjct: 236 ERALMKAVASVGPVSVAIDAGHESFQFYQSGIYYEKECSSEELDHGVLAVGYGFQGEDVD 295

Query: 308 GANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
           G  +W++KNSW   WGD GY+ + +D +  CGI T +SYPL
Sbjct: 296 GKKFWIVKNSWSENWGDKGYIYMAKDRKNHCGIATAASYPL 336


>gi|3850787|emb|CAA05360.1| cathepsin S [Mus musculus]
          Length = 330

 Score =  241 bits (616), Expect = 3e-61,   Method: Compositional matrix adjust.
 Identities = 131/320 (40%), Positives = 194/320 (60%), Gaps = 16/320 (5%)

Query: 37  STHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLG 92
           +T E+  ++ H + W   H + YKD+ E+E+R  I+++NL++I   N E   G  TY++G
Sbjct: 15  ATAERPTLDHHWDLWKKTHEKEYKDKNEEEVRRLIWEKNLKFIMIHNLEYSMGMHTYQVG 74

Query: 93  TNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIK 152
            N   D+TN+E        ++P  S ++ T     +++ S   +P ++DWR+KG VT +K
Sbjct: 75  MNDMGDMTNEEILCRMGALRIPRQSPKTVT-----FRSYSNRTLPDTVDWREKGCVTEVK 129

Query: 153 NQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN---GNNGCLGGSREKAFA 209
            Q  CG CWAF+AV A+EG  K+++G LI LS Q L+DCS     GN GC GG   +AF 
Sbjct: 130 YQGSCGACWAFSAVGALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQ 189

Query: 210 YIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQ-PVSI 268
           YII N GI  +  YPY+A+   C    K  AA  S Y ++P GDE AL +AV+ + PVS+
Sbjct: 190 YIIDNGGIEADASYPYKAMDEKCHYNSKNRAATCSRYIQLPFGDEDALKEAVATKGPVSV 249

Query: 269 AIAAYSTEFQSYKEGIFNG-VCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGY 327
            I A  + F  YK G+++   C   ++H V +VG+GT  DG +YWL+KNSWG  +GD GY
Sbjct: 250 GIDASHSSFFFYKSGVYDDPSCTGNVNHGVLVVGYGTL-DGKDYWLVKNSWGLNFGDQGY 308

Query: 328 MKIVR-DEGLCGIGTRSSYP 346
           +++ R ++  CGI +  SYP
Sbjct: 309 IRMARNNKNHCGIASYCSYP 328


>gi|312306194|gb|ADQ73946.1| cathepsin L [Paralithodes camtschaticus]
          Length = 324

 Score =  241 bits (616), Expect = 3e-61,   Method: Compositional matrix adjust.
 Identities = 129/308 (41%), Positives = 185/308 (60%), Gaps = 15/308 (4%)

Query: 48  EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEF 104
            ++  Q+GR Y    E+  R  ++ +N+E+IE  N++   G  TY L  NQF D+TN+E 
Sbjct: 23  HQFKVQYGRQYATAQEERYRSSVYDQNMEFIEAHNEQYTNGEVTYMLAINQFGDMTNEEI 82

Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFA 164
            A+  G  +P+   R       +   L     P  +DWR KGAVTP+K+QK CG CWAF+
Sbjct: 83  NAVMNGL-LPASESRGVAVLGGRDDTL-----PAEVDWRTKGAVTPVKDQKACGSCWAFS 136

Query: 165 AVAAVEGITKIRSGNLIQLSEQQLLDCST-NGNNGCLGGSREKAFAYIIQNQGIATEDEY 223
           A  ++EG   ++ G L+ LSEQ L+DCST  G++GC GG  + AF YI  N GI TE  Y
Sbjct: 137 ATGSLEGQHFLKDGKLVSLSEQNLVDCSTKQGDHGCGGGLMDFAFTYIKDNGGIDTEASY 196

Query: 224 PYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKE 282
           PY+A  G C      + A ++ Y +V    E AL KAV ++ P+S+AI A  + F  Y +
Sbjct: 197 PYEATDGKCQYNPANSGATVTGYVDVEHDSEDALQKAVATIGPISVAIDASRSTFHFYHK 256

Query: 283 GI-FNGVC-GTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE-GLCGI 339
           G+ ++  C  T LDH V  VG+G T+DG +YWL+KNSW  TWG+ G++++ R+    CGI
Sbjct: 257 GVYYDKECSSTSLDHGVLAVGYG-TQDGTDYWLVKNSWNITWGNHGFIEMSRNRNNNCGI 315

Query: 340 GTRSSYPL 347
            T++SYPL
Sbjct: 316 ATQASYPL 323


>gi|66810271|ref|XP_638859.1| cysteine proteinase 3 [Dictyostelium discoideum AX4]
 gi|166201983|sp|Q23894.2|CYSP3_DICDI RecName: Full=Cysteine proteinase 3; AltName: Full=Cysteine
           proteinase II; Flags: Precursor
 gi|60467526|gb|EAL65548.1| cysteine proteinase 3 [Dictyostelium discoideum AX4]
          Length = 337

 Score =  241 bits (616), Expect = 3e-61,   Method: Compositional matrix adjust.
 Identities = 132/344 (38%), Positives = 210/344 (61%), Gaps = 13/344 (3%)

Query: 11  FKINTTPMFIIITLLVSCASQ-VVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
            +++ T +F +I L +S  S   V S   ++ S ++    WM  + ++Y  + E   R +
Sbjct: 1   MRLSITLIFTLIVLSISFISAGNVFSHKQYQDSFID----WMRSNNKAYTHK-EFMPRYE 55

Query: 70  IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQ 129
            FK+N++Y+   N +G++T  LG NQ +DL+N+E+R  Y G +     +     +     
Sbjct: 56  EFKKNMDYVHNWNSKGSKTV-LGLNQHADLSNEEYRLNYLGTRAHIKLNGYHKRNLGLRL 114

Query: 130 NLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLL 189
           N      P ++DWR+K AVTP+K+Q +CG C++F+   +VEG+T I++G L+ LSEQ +L
Sbjct: 115 NRPQFKQPLNVDWREKDAVTPVKDQGQCGSCYSFSTTGSVEGVTAIKTGKLVSLSEQNIL 174

Query: 190 DCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQA-VPGTCSAAQKPAAAKISNYE 247
           DCS++ GN GC GG    AF YII+N G+ +E++YPY+  V   C   +   AAKI++Y+
Sbjct: 175 DCSSSFGNEGCNGGLMTNAFEYIIKNNGLNSEEQYPYEMKVNDECKFQEGSVAAKITSYK 234

Query: 248 EVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGI-FNGVCGTQ-LDHAVTIVGFGTT 305
           E+ +GDE  L  A+ + PVS+AI A    FQ Y  G+ +   C ++ LDH V  VG G T
Sbjct: 235 EIEAGDENDLQNALLLNPVSVAIDASHNSFQLYTAGVYYEPACSSEDLDHGVLAVGMG-T 293

Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPLA 348
           ++G +Y+++KNSWG +WG  GY+ + R+ +  CGI T +SYP+A
Sbjct: 294 DNGEDYYIVKNSWGPSWGLNGYIHMARNKDNNCGISTMASYPIA 337


>gi|113120271|gb|ABI30275.1| VS-A [Vasconcellea stipulata]
          Length = 318

 Score =  241 bits (615), Expect = 4e-61,   Method: Compositional matrix adjust.
 Identities = 131/287 (45%), Positives = 174/287 (60%), Gaps = 18/287 (6%)

Query: 38  THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
           T  + ++ + + WM ++ + YKD  EK  R +IFK+NL+YI++ NK+ N TY LG   F+
Sbjct: 39  TSTEKLINLFDSWMVEYDKVYKDIDEKIYRFEIFKDNLKYIDETNKK-NNTYWLGLTSFT 97

Query: 98  DLTNDEFRALYTGYKMPSPSHRSTT----SSTFKYQNLSMTDVPTSLDWRDKGAVTPIKN 153
           DLTNDEF+  Y G     P + STT       F Y ++   ++P S+DWR KGAVTP++N
Sbjct: 98  DLTNDEFKEKYVG---SIPENWSTTEEPNDKEFIYDDV--VNIPASIDWRQKGAVTPVRN 152

Query: 154 QKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQ 213
           Q  CG CW F++VAAVEGI KI +G L+ LSEQ+LLDC    + GC GG    A  Y + 
Sbjct: 153 QGSCGSCWTFSSVAAVEGINKIVTGQLVSLSEQELLDCERR-SYGCRGGFPPYALQY-VA 210

Query: 214 NQGIATEDEYPYQAVPGTCSAAQ-KPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAA 272
           N GI     YPY+ V   C AAQ K    K      V   +EQAL++ +++QPVSI + A
Sbjct: 211 NSGIHLRQYYPYEGVQRQCRAAQAKGPKVKTDGVGRVQRNNEQALIQRIAIQPVSIVVEA 270

Query: 273 YSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWG 319
               FQ+Y+ GIF G CGT +DHAV  VG+G       Y LIKNSWG
Sbjct: 271 KGRAFQNYRGGIFAGPCGTSIDHAVAAVGYGN-----GYILIKNSWG 312


>gi|52076128|dbj|BAD46641.1| putative cysteine proteinase [Oryza sativa Japonica Group]
 gi|52076135|dbj|BAD46648.1| putative cysteine proteinase [Oryza sativa Japonica Group]
          Length = 374

 Score =  241 bits (615), Expect = 4e-61,   Method: Compositional matrix adjust.
 Identities = 139/330 (42%), Positives = 192/330 (58%), Gaps = 27/330 (8%)

Query: 40  EQSVVEIHEKWMAQHG---RSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQF 96
           E+S+  ++++W   +G    S +D  +K  R ++FK+N  YI   N++   +YKLG N+F
Sbjct: 36  EESMWSLYQRWRHVYGAASSSPRDLADKGSRFEVFKKNARYIHDFNRKKGMSYKLGLNKF 95

Query: 97  SDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKE 156
           +DLT +EF A YTG   P P       +          D P + DWR+ GAVT +K+Q  
Sbjct: 96  ADLTLEEFTAKYTGAN-PGPITGLKNGTGSPPLAAVAGDAPPAWDWREHGAVTRVKDQGP 154

Query: 157 CGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQG 216
           CG CWAF+ V AVEGI  I +GNL+ LSEQQ+LDCS  G+  C GG    AF Y + N G
Sbjct: 155 CGSCWAFSVVEAVEGINAIMTGNLLTLSEQQVLDCSGAGD--CSGGYTSYAFDYAVSN-G 211

Query: 217 IATED------------EYP-YQAVPGTCS-AAQKPAAAKISNYEEVPSGDEQALLKAVS 262
           I  +              YP Y+AV   C     K    KI +Y  V   DE+AL +AV 
Sbjct: 212 ITLDQCFSPPTTGENYFYYPAYEAVQEPCRFDPNKAPIVKIDSYSFVDPNDEEALKQAVY 271

Query: 263 MQ-PVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNT 321
            Q PVS+ I A S EF  Y+ G+F+G CGT+L+HAV +VG+  TEDG  YW++KNSWG  
Sbjct: 272 SQGPVSVLIEA-SYEFMIYQGGVFSGPCGTELNHAVLVVGYDETEDGTPYWIVKNSWGAG 330

Query: 322 WGDAGYMKIVRD----EGLCGIGTRSSYPL 347
           WG++GY++++R+    EG+CGI     YP+
Sbjct: 331 WGESGYIRMIRNIPAPEGICGIAMYPIYPI 360


>gi|307192137|gb|EFN75465.1| Cathepsin L [Harpegnathos saltator]
          Length = 339

 Score =  241 bits (615), Expect = 4e-61,   Method: Compositional matrix adjust.
 Identities = 131/338 (38%), Positives = 196/338 (57%), Gaps = 14/338 (4%)

Query: 20  IIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIE 79
           I + L +  A+Q +S  +     V E    +   H ++Y  ++E+  R+KIF EN   I 
Sbjct: 5   IFLLLGILAAAQAISFFNL----VTEEWNTFKVTHRKAYDSKIEESFRMKIFMENWHKIA 60

Query: 80  KANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTF--KYQNLSMT 134
             N++      +YKLG N++ D+ + EF     G+     +           ++   +  
Sbjct: 61  LHNQKYELNEVSYKLGMNKYGDMLHHEFINTLNGFNKSVSAQLRAQRRPIGSRFIEPANV 120

Query: 135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN 194
           ++P+S+DWR  GAVTPIK+Q  CG CW+F+A  A+EG     +G L+ LSEQ L+DCS  
Sbjct: 121 EIPSSVDWRTHGAVTPIKDQGHCGSCWSFSATGALEGQHYRITGKLVSLSEQNLIDCSGR 180

Query: 195 -GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGD 253
            GNNGC GG  ++AF YI  N G+ TE  YPY+A    C    +   A  S Y ++P G+
Sbjct: 181 YGNNGCNGGLMDQAFQYIKDNHGLDTEISYPYEAENDKCRYNPRNNGATDSGYVDIPEGN 240

Query: 254 EQALLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVCGTQ-LDHAVTIVGFGTTEDGAN 310
           E+ L  AV ++ PVS+AI A +  FQ Y+EG+ +   C ++ LDH V +VG+GT ++  +
Sbjct: 241 EKKLKAAVATIGPVSVAIDASAESFQFYREGVYYEPRCSSENLDHGVLVVGYGTDDNDQD 300

Query: 311 YWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
           YWL+KNSWG TWGD GY+K+ R+ +  CGI + +SYPL
Sbjct: 301 YWLVKNSWGVTWGDEGYIKMARNKDNHCGIASSASYPL 338


>gi|261289783|ref|XP_002611753.1| hypothetical protein BRAFLDRAFT_236364 [Branchiostoma floridae]
 gi|229297125|gb|EEN67763.1| hypothetical protein BRAFLDRAFT_236364 [Branchiostoma floridae]
          Length = 307

 Score =  241 bits (615), Expect = 4e-61,   Method: Compositional matrix adjust.
 Identities = 129/294 (43%), Positives = 180/294 (61%), Gaps = 11/294 (3%)

Query: 63  EKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEFRA-LYTGYKMPSPSH 118
           E+  R++IF+ N + I   N E   G  TY LG NQF+ +TNDEF A +  G  +   + 
Sbjct: 15  EESRRMEIFENNTKLINLHNNEADLGMHTYWLGHNQFAHMTNDEFVANVIGGCLLDRNAS 74

Query: 119 RSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSG 178
           +ST     +Y + ++ ++P ++DWR KG VTP+KNQ++CG CWAF+   ++EG T  ++G
Sbjct: 75  KSTADRVHQYDS-NLVELPDTVDWRTKGYVTPVKNQEQCGSCWAFSTTGSLEGQTFKKTG 133

Query: 179 NLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQK 237
            L+ LSEQ L+DCS   GN GC GG  + AF YI  N GI TED YPY+A  G C     
Sbjct: 134 KLVSLSEQNLVDCSGEFGNQGCNGGLMDDAFKYIKANGGIDTEDSYPYEARDGKCRFKPA 193

Query: 238 PAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIF--NGVCGTQLD 294
              A ++ Y ++  GDE AL +AV ++ P+S+AI A    FQ Y  G++       T+LD
Sbjct: 194 DVGATVTGYTDISEGDEGALTQAVATVGPISVAIDASHHTFQMYSHGVYYEPQCSSTELD 253

Query: 295 HAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYPL 347
           H V  VG+G TE G +YWL+KNSWG  WG  GY+ + R++   CGI T +SYPL
Sbjct: 254 HGVLAVGYG-TEGGKDYWLVKNSWGEVWGQNGYIMMSRNKNNQCGIATSASYPL 306


>gi|340381055|ref|XP_003389037.1| PREDICTED: cathepsin L1-like [Amphimedon queenslandica]
          Length = 329

 Score =  241 bits (615), Expect = 4e-61,   Method: Compositional matrix adjust.
 Identities = 135/338 (39%), Positives = 194/338 (57%), Gaps = 24/338 (7%)

Query: 22  ITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKEN----LEY 77
           + LL++ A+ +V + +    +  E+   W   +G+ Y  E E+  R  I++ N    LE+
Sbjct: 1   MKLLIAVAALIVCATAFEYTAEWEL---WKRTNGKDYSSEKEELYRQTIWEANKKIVLEH 57

Query: 78  IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
              A+K G   + L  N F+DL + EF A+Y GY+  +    +T     +Y   +   +P
Sbjct: 58  NANADKWG---WTLEMNAFADLESSEFAAMYNGYRRSARKSNAT-----RYHVPTGNALP 109

Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GN 196
            ++DWR KGAVTP+KNQK+CG CWAF+   ++EG T ++ G L  LSEQQL+DCS   GN
Sbjct: 110 DTVDWRTKGAVTPVKNQKQCGSCWAFSTTGSLEGQTFLKKGTLPSLSEQQLVDCSDKYGN 169

Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQA 256
           +GC GG  + AF YI  N GI +E  YPY+A  G C   Q   AA  + Y+++P  D   
Sbjct: 170 HGCQGGLMDNAFKYIEANGGIDSEASYPYEAKNGKCRFQQSAVAATCTGYKDIPHDDIDG 229

Query: 257 LLKAVS-MQPVSIAIAAYSTEFQSYKEGIFNGVC--GTQLDHAVTIVGFGTTEDG----- 308
           L  AV+ + P+S+A+ A  + FQ Y  G+++ +    T+LDH V  VG+GT   G     
Sbjct: 230 LQDAVANVGPISVAMDASHSSFQLYAAGVYDPLLCSSTRLDHGVLAVGYGTEPSGLFHEE 289

Query: 309 ANYWLIKNSWGNTWGDAGYMKIVRDEGLCGIGTRSSYP 346
             YWL+KNSWG  WG  GY KIVR +  CGI T +SYP
Sbjct: 290 KPYWLVKNSWGPDWGQQGYFKIVRKDNKCGIATDASYP 327


>gi|440906716|gb|ELR56945.1| Cathepsin S, partial [Bos grunniens mutus]
          Length = 342

 Score =  241 bits (614), Expect = 4e-61,   Method: Compositional matrix adjust.
 Identities = 135/338 (39%), Positives = 200/338 (59%), Gaps = 18/338 (5%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLE 76
           M  ++  L+ C+S +      H    ++ H + W   +G+ YK++ E+  R  I+++NL+
Sbjct: 12  MNWLVWALLLCSSAMAQ---VHRDPTLDHHWDLWKKTYGKQYKEKNEEVARRLIWEKNLK 68

Query: 77  YIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
            +   N E   G  +Y+LG N   D+T++E  +L +  ++PS   R+ T  +   Q L  
Sbjct: 69  TVTLHNLEHSMGMHSYELGMNHLGDMTSEEVISLMSSLRVPSQWPRNVTYKSDPNQKL-- 126

Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST 193
              P S+DWR+KG VT +K Q  CG CWAF+AV A+E   K+++G L+ LS Q L+DCST
Sbjct: 127 ---PDSMDWREKGCVTEVKYQGACGSCWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCST 183

Query: 194 N--GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPS 251
              GN GC GG   +AF YII N GI +E  YPY+A+ G C    K  AA  S Y E+P 
Sbjct: 184 AKYGNKGCNGGFMTEAFQYIIDNNGIDSEASYPYKAMDGKCQYDVKNRAATCSRYIELPF 243

Query: 252 GDEQALLKAVSMQ-PVSIAIAAYSTEFQSYKEGI-FNGVCGTQLDHAVTIVGFGTTEDGA 309
           G E+AL +AV+ + PVS+ I A  + F  YK G+ ++  C   ++H V +VG+G   DG 
Sbjct: 244 GSEEALKEAVANKGPVSVGIDASHSSFFLYKTGVYYDPSCTQNVNHGVLVVGYGNL-DGK 302

Query: 310 NYWLIKNSWGNTWGDAGYMKIVRDEG-LCGIGTRSSYP 346
           +YWL+KNSWG  +GD GY+++ R+ G  CGI +  SYP
Sbjct: 303 DYWLVKNSWGLHFGDQGYIRMARNSGNHCGIASYPSYP 340


>gi|428186189|gb|EKX55040.1| hypothetical protein GUITHDRAFT_63227 [Guillardia theta CCMP2712]
          Length = 344

 Score =  241 bits (614), Expect = 4e-61,   Method: Compositional matrix adjust.
 Identities = 131/299 (43%), Positives = 182/299 (60%), Gaps = 16/299 (5%)

Query: 63  EKEMRLKIFKENLEYIEKANKEGNR---TYKLGTNQFSDLTNDEFRALYTGYKMPSPSHR 119
           E     ++F++NL+ I K N+E N+   +Y++G N F+ LT +EF A Y GY   +   +
Sbjct: 47  ESTRAFEVFQKNLDMIMKHNEEYNQGLQSYEMGLNGFAHLTFEEFSAQYLGYG-GAEVEQ 105

Query: 120 STTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGN 179
             T    K++  S +++P S+DWR+KGAV  +KNQ  CG CWAF+AVAA+EG   + SG 
Sbjct: 106 PKTRRAGKHERKSRSEIPASVDWREKGAVAEVKNQGACGSCWAFSAVAALEGAHFLNSGE 165

Query: 180 LIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIA--TEDEYPYQAVPGTCSAAQ 236
           LI LSEQQL+DCS   GN+GC GG  + AF Y + N G    +E +YPY+ + G C  + 
Sbjct: 166 LISLSEQQLVDCSKKFGNHGCAGGYMDNAFEYWMNNTGHGDDSEKDYPYKGMDGKCKFSA 225

Query: 237 KPAAAKISNYEEVPSGDEQALLKAVS-MQPVSIAIAAYSTEFQSYKEGIFNGVCGT---Q 292
               A IS Y +V  G+E  LL AV+ + PVS+AI A     Q Y  G+FNGV GT    
Sbjct: 226 DGVRATISGYNDVKQGNETDLLDAVANVGPVSVAIHA-GAALQFYLRGVFNGVAGTCFGP 284

Query: 293 LDHAVTIVGFGTTE----DGANYWLIKNSWGNTWGDAGYMKIVRDEGLCGIGTRSSYPL 347
           L+H VT VG+GT         +YW+IKNSWG  WG+ G+++  R + LCG+   +SYPL
Sbjct: 285 LNHGVTAVGYGTASLRFGRKMDYWIIKNSWGMGWGEKGFVRFARGKNLCGVANGASYPL 343


>gi|42573181|ref|NP_974687.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
 gi|332661102|gb|AEE86502.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
          Length = 288

 Score =  241 bits (614), Expect = 5e-61,   Method: Compositional matrix adjust.
 Identities = 116/250 (46%), Positives = 169/250 (67%), Gaps = 5/250 (2%)

Query: 38  THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
           T+   ++E+ E WM++H ++YK   EK  R ++F+ENL +I++ N E N +Y LG N+F+
Sbjct: 42  TNTDKLLELFESWMSEHSKAYKSVEEKVHRFEVFRENLMHIDQRNNEIN-SYWLGLNEFA 100

Query: 98  DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
           DLT++EF+  Y G   P  S +   S+ F+Y+++  TD+P S+DWR KGAV P+K+Q +C
Sbjct: 101 DLTHEEFKGRYLGLAKPQFSRKRQPSANFRYRDI--TDLPKSVDWRKKGAVAPVKDQGQC 158

Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
           G CWAF+ VAAVEGI +I +GNL  LSEQ+L+DC T  N+GC GG  + AF YII   G+
Sbjct: 159 GSCWAFSTVAAVEGINQITTGNLSSLSEQELIDCDTTFNSGCNGGLMDYAFQYIISTGGL 218

Query: 218 ATEDEYPYQAVPGTCSAAQKPAA-AKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTE 276
             ED+YPY    G C   ++      IS YE+VP  D+++L+KA++ QPVS+AI A   +
Sbjct: 219 HKEDDYPYLMEEGICQEQKEDVERVTISGYEDVPENDDESLVKALAHQPVSVAIEASGRD 278

Query: 277 FQSYKEGIFN 286
           FQ YK G++N
Sbjct: 279 FQFYK-GVYN 287


>gi|354473025|ref|XP_003498737.1| PREDICTED: cathepsin S-like [Cricetulus griseus]
          Length = 341

 Score =  241 bits (614), Expect = 5e-61,   Method: Compositional matrix adjust.
 Identities = 130/306 (42%), Positives = 185/306 (60%), Gaps = 15/306 (4%)

Query: 50  WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGN---RTYKLGTNQFSDLTNDEFRA 106
           W   HG+ YK++ E+E R  I+++NL+ +   N E +    +Y LG N   D+T++E   
Sbjct: 40  WKKFHGKQYKEKNEEEARRLIWEKNLKLVMLHNLEYSLEMHSYSLGMNHMGDMTSEEVLG 99

Query: 107 LYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAV 166
                ++PS  HR++T  +   Q L     P S+DWR+KG VT +K Q  CG CWAF+AV
Sbjct: 100 QMRPLRVPSQRHRNSTYKSNPNQKL-----PDSMDWREKGCVTEVKYQGSCGSCWAFSAV 154

Query: 167 AAVEGITKIRSGNLIQLSEQQLLDCSTN---GNNGCLGGSREKAFAYIIQNQGIATEDEY 223
            A+E   K+++G L+ LS Q L+DCST    GN GC GG   +AF YII N GI ++  Y
Sbjct: 155 GALEAQLKLKTGKLVSLSAQNLVDCSTEEKYGNKGCDGGFMTRAFQYIIDNGGIDSDASY 214

Query: 224 PYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQ-PVSIAIAAYSTEFQSYKE 282
           PY+AV   C    K  AA  S Y E+PSGDE+AL +AV+ + PVS+ I A    F  YK 
Sbjct: 215 PYKAVAEKCHYDSKSRAATCSRYMELPSGDEEALKEAVANKGPVSVGIDASHPSFFLYKS 274

Query: 283 GIFN-GVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR-DEGLCGIG 340
           G+++   C   ++H V +VG+G   DG +YWL+KNSWG  +GD GY+++ R ++  CGI 
Sbjct: 275 GVYDEPSCTENVNHGVLVVGYGNL-DGKDYWLVKNSWGLHFGDQGYIRMARNNKNQCGIA 333

Query: 341 TRSSYP 346
           +  SYP
Sbjct: 334 SYGSYP 339


>gi|261289785|ref|XP_002611754.1| hypothetical protein BRAFLDRAFT_284341 [Branchiostoma floridae]
 gi|229297126|gb|EEN67764.1| hypothetical protein BRAFLDRAFT_284341 [Branchiostoma floridae]
          Length = 327

 Score =  241 bits (614), Expect = 5e-61,   Method: Compositional matrix adjust.
 Identities = 132/310 (42%), Positives = 191/310 (61%), Gaps = 12/310 (3%)

Query: 48  EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEF 104
           E +   HG+ YK   E+ +R  IF++N + I++ N+E   G R+Y +G NQF DL + E+
Sbjct: 21  EAFKLTHGKQYKSPDEENVRRAIFRDNNQMIKEHNQEAAMGRRSYFMGMNQFGDLAHSEY 80

Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFA 164
             L  G  +  P + ST S    +++     V  ++DWR KGAVTPIK+Q  CG CWAF+
Sbjct: 81  LELVVGPGLL-PLNLSTPSENV-FESTPGLQVDDTVDWRQKGAVTPIKDQGHCGSCWAFS 138

Query: 165 AVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEY 223
              ++EG   +++G L+ LSEQ LLDCS   GN GC GG  ++AF YI  N GI TE+ Y
Sbjct: 139 TTGSLEGQHFMKTGKLVSLSEQNLLDCSRRFGNKGCEGGLMDQAFRYIKSNGGIDTEECY 198

Query: 224 PYQAVP-GTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYK 281
           PY A     C      + A +S+Y ++ + DE AL++AV ++ PVS+AI A     + YK
Sbjct: 199 PYMAKDEKVCDYKTSCSGATLSSYTDIKAMDEMALMQAVGTVGPVSVAIDASHKSLRFYK 258

Query: 282 EGIFN--GVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE-GLCG 338
            GI++      T+LDH V  VG+G+  DG +YWL+KNSWG+ WGD GY+K+ R++   CG
Sbjct: 259 SGIYDEPECSRTKLDHGVLAVGYGSM-DGMDYWLVKNSWGSAWGDMGYVKMTRNKNNQCG 317

Query: 339 IGTRSSYPLA 348
           I T++SYP+ 
Sbjct: 318 IATKASYPVV 327


>gi|403300975|ref|XP_003941187.1| PREDICTED: cathepsin L1-like isoform 1 [Saimiri boliviensis
           boliviensis]
 gi|403300977|ref|XP_003941188.1| PREDICTED: cathepsin L1-like isoform 2 [Saimiri boliviensis
           boliviensis]
 gi|403300979|ref|XP_003941189.1| PREDICTED: cathepsin L1-like isoform 3 [Saimiri boliviensis
           boliviensis]
          Length = 333

 Score =  240 bits (613), Expect = 6e-61,   Method: Compositional matrix adjust.
 Identities = 132/342 (38%), Positives = 199/342 (58%), Gaps = 23/342 (6%)

Query: 16  TPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENL 75
            P  I+    +  AS  ++   + E   +    KW A H R Y    E+E R  ++++N+
Sbjct: 2   NPTLILAAFCLGLASAALTFNHSLEAQWI----KWKAMHNRLYGKN-EEEWRRAVWEKNM 56

Query: 76  EYIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
           + IE  N E   G  ++ +  N F D+TN+EFR +  G++   P +         +Q   
Sbjct: 57  KTIELHNHEYNQGKHSFTMAMNTFGDMTNEEFRQVMNGFQNRKPRNGKV------FQEPL 110

Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
           + + P S+DWR+KG VTP+KNQ +CG CWAF+A  A+EG    ++G L+ LSEQ L+DCS
Sbjct: 111 LHEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCS 170

Query: 193 -TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPS 251
              GN GC GG  + AF Y+ +N G+ +E+ YPY+A   +C    K + A  + + ++P 
Sbjct: 171 GPQGNQGCNGGLMDYAFQYVQENGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPK 230

Query: 252 GDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVCGTQ-LDHAVTIVGFG---TT 305
             E+AL+KAV ++ P+S+AI A    FQ YKEGI F   C ++ +DH V +VG+G   T 
Sbjct: 231 -LEKALMKAVATVGPISVAIDAGHESFQFYKEGIYFEPECSSEDMDHGVLVVGYGFERTG 289

Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYP 346
            D + YWL+KNSWG  WG  GY+K+ +D +  CGI + +SYP
Sbjct: 290 SDNSKYWLVKNSWGEEWGMDGYIKMAKDRKNHCGIASAASYP 331


>gi|380014284|ref|XP_003691169.1| PREDICTED: cathepsin L-like [Apis florea]
          Length = 345

 Score =  240 bits (613), Expect = 6e-61,   Method: Compositional matrix adjust.
 Identities = 134/348 (38%), Positives = 205/348 (58%), Gaps = 26/348 (7%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVE-IHEKWMA---QHGRSYKDELEKEMRLKIFKE 73
           M + + L ++  + V      H  S  E ++++WM    +H ++YK ++E+  R+KIF +
Sbjct: 1   MKLFLILFITIFATV------HAVSFFELVNQEWMTFKMEHKKAYKSDVEERFRMKIFMD 54

Query: 74  NLEYIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMP----SPSHRSTTSSTF 126
           N   I K N        +YKL  N++ D+ + EF  +  G+         S R    ++F
Sbjct: 55  NKHKIAKHNSNYEMKKVSYKLKMNKYGDMLHHEFVNILNGFNKSINTQLRSERMPIGASF 114

Query: 127 -KYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
            +  N+++   P  +DWR +GAVTP+K+Q  CG CW+F+A  A+EG    R+G L+ LSE
Sbjct: 115 IEPANVAL---PKKVDWRKEGAVTPVKDQGHCGSCWSFSATGALEGQHFRRTGVLVSLSE 171

Query: 186 QQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKIS 244
           Q L+DCS   GNNGC GG  ++AF YI  N+G+ TE  YPY+A    C      + A   
Sbjct: 172 QNLIDCSGKYGNNGCNGGLMDQAFQYIKDNKGLDTEASYPYEAENDKCRYNPANSGAIDV 231

Query: 245 NYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVCGT-QLDHAVTIVG 301
            Y ++P+G+E+ L  AV ++ PVS+AI A    FQ Y EG+ +   C + +LDH V ++G
Sbjct: 232 GYIDIPTGNEKLLKAAVATIGPVSVAIDASHQSFQFYSEGVYYEPECSSEELDHGVLVIG 291

Query: 302 FGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYPLA 348
           +GT E+G +YWL+KNSWG TWG+ GY+K+ R++   CGI + +SYPL 
Sbjct: 292 YGTNENGEDYWLVKNSWGETWGNNGYIKMARNKLNHCGIASSASYPLV 339


>gi|357153071|ref|XP_003576329.1| PREDICTED: fruit bromelain-like [Brachypodium distachyon]
          Length = 398

 Score =  240 bits (613), Expect = 6e-61,   Method: Compositional matrix adjust.
 Identities = 141/331 (42%), Positives = 185/331 (55%), Gaps = 34/331 (10%)

Query: 48  EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEG---NRTYKLGTNQFSDLTNDEF 104
           + WMA  GRSY    E   R +++K N+ YIE  N E      T++LG   F+DLT++EF
Sbjct: 63  QGWMAAQGRSYWTAEETARRFEVYKSNVRYIEAVNAEAATTGLTFELGEGPFTDLTHEEF 122

Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV----------------------PTSLDW 142
            ALY G  MP P          + + +  T V                      P S DW
Sbjct: 123 SALYNG-SMPPPEEEEGDDIQEEDEQVIATVVDGVDVNVAVHTNLSAGGPRPWPPRSRDW 181

Query: 143 RDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGG 202
           R  GAVTPIK+Q  CG CWAF  VA +EG  KI  GNL+ LSEQQL+DC    N+GC GG
Sbjct: 182 RKHGAVTPIKDQGRCGSCWAFPTVATIEGKHKIVRGNLVSLSEQQLIDCDYT-NSGCKGG 240

Query: 203 SREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVS 262
              +A+ +I +  G+ T   YPY+   G C   ++ AAA+I+ +  V S  E AL+ AV+
Sbjct: 241 FVIRAYRWIRKIGGLTTSSAYPYKGARGKCMKRRR-AAARIAGWRSVRSRSEVALVNAVA 299

Query: 263 MQPVSIAIAAYSTEFQSYKEGIFNGVCGT-QLDHAVTIVGFGTTED-GANYWLIKNSWGN 320
            QPV++ I+A    FQ YK+GI NG C T +L+HAVT+VG+G   D GA YW++KNSWG 
Sbjct: 300 GQPVAVYISASGKNFQHYKKGILNGPCDTARLNHAVTVVGYGRQADTGAKYWIVKNSWGT 359

Query: 321 TWGDAGYMKIVR----DEGLCGIGTRSSYPL 347
           TWG  GY+ + R      G CGI T   +PL
Sbjct: 360 TWGQEGYILMKRGTRNPRGQCGIATSPVFPL 390


>gi|291383488|ref|XP_002708302.1| PREDICTED: cathepsin L1 [Oryctolagus cuniculus]
          Length = 344

 Score =  240 bits (613), Expect = 6e-61,   Method: Compositional matrix adjust.
 Identities = 133/341 (39%), Positives = 209/341 (61%), Gaps = 20/341 (5%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
           M + + L V C S ++S+  T + S+     +W+A H R Y    E+E R  ++++N++ 
Sbjct: 1   MHLPLFLAVLC-SGMISAAPTPDHSLDTRWRQWLAAHKRRYGVR-EEEWRRAVWEKNMQM 58

Query: 78  IEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
           IEK N+E   G   + +  N + D+TN+EFR +  G++  + +H+       ++ N  + 
Sbjct: 59  IEKHNREYSQGKHGFTMAMNAYGDMTNEEFRLMMNGFE--NQNHKRGE----EFHNSLLF 112

Query: 135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-T 193
            +P  LDWR++G VTP+KNQ+ CG  WAF+A  A+EG    ++G L+ LSEQ L+DCS  
Sbjct: 113 KIPAFLDWRERGYVTPVKNQELCGSSWAFSATGALEGQMFRKTGRLVSLSEQNLVDCSWP 172

Query: 194 NGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGD 253
            GN GC GG  + AF Y+  N+G+ +E+ YPY+   G+C    + +AA ++ + +V S D
Sbjct: 173 QGNQGCSGGLMDYAFQYVKDNRGLDSEESYPYEQRKGSCKYNPRFSAANVTGFVDV-SKD 231

Query: 254 EQALLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVCGTQ-LDHAVTIVGFGTTEDGA- 309
           E+AL++AV ++ PVS+ IA     F  Y+ GI ++  C ++ ++HAV +VG+G  E G+ 
Sbjct: 232 EKALMEAVATVGPVSVGIATTPESFLFYEGGIYYDPKCSSENVNHAVLVVGYGFEEVGSK 291

Query: 310 --NYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYPL 347
              YWLIKNSWG  WG  GYMK+ +D+   CGI T +SYPL
Sbjct: 292 NNKYWLIKNSWGKDWGMGGYMKMAKDQNNHCGIATAASYPL 332


>gi|159792912|gb|ABW98676.1| cathepsin L [Apostichopus japonicus]
          Length = 332

 Score =  240 bits (613), Expect = 7e-61,   Method: Compositional matrix adjust.
 Identities = 128/307 (41%), Positives = 182/307 (59%), Gaps = 13/307 (4%)

Query: 48  EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEF 104
           E W   H + Y  E E++ R KI+++NL+ + K N E   G  +Y LG N+++DL  +EF
Sbjct: 29  EAWKQTHSKQYTKE-EEDNRRKIWEDNLQKVSKHNTEHSLGLHSYTLGMNKYADLRGEEF 87

Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFA 164
             +  G K  +   R       K+ + +    P S+DWRD+G VTP+K+Q +CG CWAF+
Sbjct: 88  VQMMNGLKFDASRERQG----IKFLSYAKFQAPDSVDWRDEGYVTPVKDQGQCGSCWAFS 143

Query: 165 AVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEY 223
              ++EG     +G L  LSEQ L+DCS + GNNGC GG  + AF YI  N GI TED+Y
Sbjct: 144 TTGSLEGQHFRSTGVLTSLSEQNLVDCSISYGNNGCEGGLMDYAFQYIKDNLGIDTEDKY 203

Query: 224 PYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQ-PVSIAIAAYSTEFQSYKE 282
           PY+A   TC  +     A  S Y +V SGDE AL +A +   P+S+AI A    FQ Y+ 
Sbjct: 204 PYEAEDDTCRFSPDNVGATDSGYVDVDSGDEDALKEACAANGPISVAIDASHESFQLYES 263

Query: 283 GIFN--GVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGI 339
           G+++       +LDH V +VG+GT   G +YW++KNSWG +WG  GY+ + R+ +  CGI
Sbjct: 264 GVYDEESCSSIELDHGVLVVGYGTDSVGGDYWIVKNSWGLSWGQEGYIWMSRNKDNQCGI 323

Query: 340 GTRSSYP 346
            T +SYP
Sbjct: 324 ATSASYP 330


>gi|75812934|ref|NP_001028787.1| cathepsin S precursor [Bos taurus]
 gi|115503669|sp|P25326.2|CATS_BOVIN RecName: Full=Cathepsin S; Flags: Precursor
 gi|74353837|gb|AAI02246.1| Cathepsin S [Bos taurus]
 gi|296489535|tpg|DAA31648.1| TPA: cathepsin S precursor [Bos taurus]
          Length = 331

 Score =  240 bits (613), Expect = 7e-61,   Method: Compositional matrix adjust.
 Identities = 135/338 (39%), Positives = 199/338 (58%), Gaps = 18/338 (5%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLE 76
           M  ++  L+ C+S +      H    ++ H + W   +G+ YK++ E+  R  I+++NL+
Sbjct: 1   MNWLVWALLLCSSAMAH---VHRDPTLDHHWDLWKKTYGKQYKEKNEEVARRLIWEKNLK 57

Query: 77  YIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
            +   N E   G  +Y+LG N   D+T++E  +L +  ++PS   R+ T  +   Q L  
Sbjct: 58  TVTLHNLEHSMGMHSYELGMNHLGDMTSEEVISLMSSLRVPSQWPRNVTYKSDPNQKL-- 115

Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST 193
              P S+DWR+KG VT +K Q  CG CWAF+AV A+E   K+++G L+ LS Q L+DCST
Sbjct: 116 ---PDSMDWREKGCVTEVKYQGACGSCWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCST 172

Query: 194 N--GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPS 251
              GN GC GG   +AF YII N GI +E  YPY+A+ G C    K  AA  S Y E+P 
Sbjct: 173 AKYGNKGCNGGFMTEAFQYIIDNNGIDSEASYPYKAMDGKCQYDVKNRAATCSRYIELPF 232

Query: 252 GDEQALLKAVSMQ-PVSIAIAAYSTEFQSYKEGI-FNGVCGTQLDHAVTIVGFGTTEDGA 309
           G E+AL +AV+ + PVS+ I A  + F  YK G+ ++  C   ++H V +VG+G   DG 
Sbjct: 233 GSEEALKEAVANKGPVSVGIDASHSSFFLYKTGVYYDPSCTQNVNHGVLVVGYGNL-DGK 291

Query: 310 NYWLIKNSWGNTWGDAGYMKIVRDEG-LCGIGTRSSYP 346
           +YWL+KNSWG  +GD GY+++ R+ G  CGI    SYP
Sbjct: 292 DYWLVKNSWGLHFGDQGYIRMARNSGNHCGIANYPSYP 329


>gi|326514800|dbj|BAJ99761.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 291

 Score =  240 bits (613), Expect = 7e-61,   Method: Compositional matrix adjust.
 Identities = 118/219 (53%), Positives = 152/219 (69%), Gaps = 4/219 (1%)

Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
           + DVP+S+DWR KGAVT +K+Q +CG CWAF+ +AAVEGI  IR+ NL  LSEQQL+DC 
Sbjct: 58  VRDVPSSVDWRQKGAVTAVKDQGQCGSCWAFSTIAAVEGINAIRTKNLTSLSEQQLVDCD 117

Query: 193 TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSG 252
           T  N GC GG  + AF YI ++ G+A ED YPY+A   +    +  A   I  YE+VP+ 
Sbjct: 118 TKSNAGCNGGLMDYAFQYIAKHGGVAAEDAYPYKARQASSCNKKPSAVVTIDGYEDVPAN 177

Query: 253 DEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYW 312
           DE AL KAV+ QPV++AI A  + FQ Y EG+F G CGT+LDH V  VG+GTT DG  YW
Sbjct: 178 DETALKKAVAAQPVAVAIEASGSHFQFYSEGVFAGKCGTELDHGVAAVGYGTTVDGTKYW 237

Query: 313 LIKNSWGNTWGDAGYMKIVRD----EGLCGIGTRSSYPL 347
           ++KNSWG  WG+ GY+++ RD    EGLCGI   +SYP+
Sbjct: 238 IVKNSWGPEWGEKGYIRMKRDVEDKEGLCGIAMEASYPV 276


>gi|62510452|sp|Q8HY81.1|CATS_CANFA RecName: Full=Cathepsin S; Flags: Precursor
 gi|27497538|gb|AAO13009.1| cathepsin S preproprotein [Canis lupus familiaris]
          Length = 331

 Score =  240 bits (613), Expect = 7e-61,   Method: Compositional matrix adjust.
 Identities = 135/338 (39%), Positives = 195/338 (57%), Gaps = 18/338 (5%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLE 76
           M  ++ LL  C+  V      H+   ++ H   W   + + YK+E E+  R  I+++NL+
Sbjct: 1   MKWLVGLLPLCSYAVAQ---VHKDPTLDHHWNLWKKTYSKQYKEENEEVARRLIWEKNLK 57

Query: 77  YIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
           ++   N E   G  +Y LG N   D+T +E  +L    ++PS   R+ T     Y++ S 
Sbjct: 58  FVMLHNLEHSMGMHSYDLGMNHLGDMTGEEVISLMGSLRVPSQWQRNVT-----YRSNSN 112

Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST 193
             +P S+DWR+KG VT +K Q  CG CWAF+AV A+E   K+++G L+ LS Q L+DCST
Sbjct: 113 QKLPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCST 172

Query: 194 N--GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPS 251
              GN GC GG    AF YII N GI +E  YPY+A+ G C    K  AA  S Y E+P 
Sbjct: 173 EKYGNKGCNGGFMTTAFQYIIDNNGIDSEASYPYKAMNGKCRYDSKKRAATCSKYTELPF 232

Query: 252 GDEQALLKAVSMQ-PVSIAIAAYSTEFQSYKEGI-FNGVCGTQLDHAVTIVGFGTTEDGA 309
           G E AL +AV+ + PVS+AI A    F  Y+ G+ +   C   ++H V +VG+G   +G 
Sbjct: 233 GSEDALKEAVANKGPVSVAIDASHYSFFLYRSGVYYEPSCTQNVNHGVLVVGYGNL-NGK 291

Query: 310 NYWLIKNSWGNTWGDAGYMKIVRDEG-LCGIGTRSSYP 346
           +YWL+KNSWG  +GD GY+++ R+ G  CGI +  SYP
Sbjct: 292 DYWLVKNSWGLNFGDQGYIRMARNSGNHCGIASYPSYP 329


>gi|157278115|ref|NP_001098156.1| cathepsin L precursor [Oryzias latipes]
 gi|50251128|dbj|BAD27581.1| cathepsin L [Oryzias latipes]
          Length = 336

 Score =  240 bits (613), Expect = 7e-61,   Method: Compositional matrix adjust.
 Identities = 131/310 (42%), Positives = 186/310 (60%), Gaps = 17/310 (5%)

Query: 50  WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEFRA 106
           W   H ++Y ++ E   RL ++++NL  IE  N E   G  +Y+LG N F D+T++EFR 
Sbjct: 31  WKGWHSKNYHEKEEGWRRL-VWEKNLRKIELHNLEHSMGKHSYRLGMNHFGDMTHEEFRQ 89

Query: 107 LYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAV 166
           +  GYK      R  + S F   N    + P ++DWRDKG VTP+K+Q +CG CWAF+  
Sbjct: 90  IMNGYK--RREQRKYSGSLFMEPNF--LEAPRAVDWRDKGYVTPVKDQGQCGSCWAFSTT 145

Query: 167 AAVEGITKIRSGNLIQLSEQQLLDCS-TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPY 225
            A+EG    ++G L+ LSEQ L+DCS   GN GC GG  ++AF Y+  NQG+ +ED YPY
Sbjct: 146 GALEGQQFRKTGKLVSLSEQNLVDCSRPEGNEGCNGGLMDQAFQYVKDNQGLDSEDFYPY 205

Query: 226 QAVPGT-CSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEG 283
           +      C    + +A   + + ++PSG E+AL+KAV S+ PVS+AI A    FQ Y+ G
Sbjct: 206 KGTDDQPCQYNAQYSAVNDTGFVDIPSGKERALMKAVASVGPVSVAIDAGHESFQFYQSG 265

Query: 284 I-FNGVCGT-QLDHAVTIVGF---GTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLC 337
           I F   C + +LDH V +VG+   G   DG  YW++KNSW   WGD G++ + +D    C
Sbjct: 266 IYFEKECSSDELDHGVLVVGYGFEGEDVDGKKYWIVKNSWSEKWGDKGFIYMAKDRHNHC 325

Query: 338 GIGTRSSYPL 347
           GI T +SYPL
Sbjct: 326 GIATAASYPL 335


>gi|355681664|gb|AER96818.1| cathepsin S [Mustela putorius furo]
          Length = 338

 Score =  240 bits (613), Expect = 7e-61,   Method: Compositional matrix adjust.
 Identities = 129/305 (42%), Positives = 188/305 (61%), Gaps = 14/305 (4%)

Query: 50  WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEFRA 106
           W   +GR Y+++ E+  R  I+++NL+ +   N E   G  +Y LG N  +D+T++E  +
Sbjct: 39  WKKTYGRQYQEKNEEVARRLIWEKNLKSVMLHNLEYSMGMHSYDLGMNHLADMTSEEVSS 98

Query: 107 LYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAV 166
           L +  ++PS    + T     Y++ S   +P S+DWR+KG VT +K Q  CG CWAF+AV
Sbjct: 99  LMSSLRVPSQWQANVT-----YKSNSNQKLPDSVDWREKGCVTEVKYQGACGACWAFSAV 153

Query: 167 AAVEGITKIRSGNLIQLSEQQLLDCSTN--GNNGCLGGSREKAFAYIIQNQGIATEDEYP 224
            A+E   K+++GNL+ LS Q L+DCST   GN GC GG   KAF YII N GI +E  YP
Sbjct: 154 GALEAQLKLKTGNLVSLSAQNLVDCSTERYGNKGCNGGFMTKAFQYIIDNNGIDSEVSYP 213

Query: 225 YQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQ-PVSIAIAAYSTEFQSYKEG 283
           Y+A+ G C    K  AA  S Y E+P G E AL +AV+ + PVS+AI A  + F  YK G
Sbjct: 214 YKAMDGNCRYDSKHRAATCSKYTELPFGSEDALKEAVANKGPVSVAIDAKHSSFFLYKSG 273

Query: 284 I-FNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDEG-LCGIGT 341
           + ++  C   ++H V +VG+G   +G +YWL+KNSWG  +G+ GY+++ R+ G  CGI +
Sbjct: 274 VYYDPSCTQNVNHGVLVVGYGNL-NGRDYWLVKNSWGLNFGEQGYIRMARNSGNHCGIAS 332

Query: 342 RSSYP 346
             SYP
Sbjct: 333 YPSYP 337


>gi|225706086|gb|ACO08889.1| Cathepsin S precursor [Osmerus mordax]
          Length = 333

 Score =  240 bits (613), Expect = 7e-61,   Method: Compositional matrix adjust.
 Identities = 126/312 (40%), Positives = 188/312 (60%), Gaps = 13/312 (4%)

Query: 44  VEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDL 99
           +++H + W  QHG++YK E+E+  R ++++ NL+ I   N E   G  TY LG N   D+
Sbjct: 26  LDLHWQMWKKQHGKNYKTEVEELGRREVWERNLQLISLHNLEASMGMHTYDLGMNHMGDM 85

Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGC 159
           T +E    +   K+P+   R  ++    +   S T VP ++DWR KG VT +KNQ  CG 
Sbjct: 86  TEEEILQSFASLKVPADLKREPSA----FVASSGTPVPDTVDWRQKGYVTQVKNQGSCGS 141

Query: 160 CWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIA 218
           CWAF++V A+EG     +G L+ LS Q L+DCS+  GN GC GG   +AF Y+I N+GI 
Sbjct: 142 CWAFSSVGALEGQLMRTTGKLLDLSPQNLVDCSSKYGNKGCNGGFMSEAFQYVIDNKGID 201

Query: 219 TEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSM-QPVSIAIAAYSTEF 277
           ++  YPYQ V GTC       +A  + Y  +P GDE  L +AV+M  P+S+AI A    F
Sbjct: 202 SDTSYPYQGVQGTCHYNPSYRSANCTRYSFLPEGDETTLKQAVAMIGPISVAIDATRPSF 261

Query: 278 QSYKEGIFNGV-CGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE-G 335
             ++ G++N + C  +++HAV +VG+GT  DG +YWL+KNSWG  +G+ GY+++ R+   
Sbjct: 262 ILWRSGVYNDLTCTQKINHAVLVVGYGTL-DGQDYWLVKNSWGTRFGENGYIRMSRNRNN 320

Query: 336 LCGIGTRSSYPL 347
            CGI     YP+
Sbjct: 321 QCGIALYGCYPI 332


>gi|50657027|emb|CAH04631.1| cathepsin H [Suberites domuncula]
          Length = 335

 Score =  240 bits (612), Expect = 7e-61,   Method: Compositional matrix adjust.
 Identities = 131/337 (38%), Positives = 194/337 (57%), Gaps = 13/337 (3%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
           +F+ + +LV   S  +           +  ++W  +HG+ Y  E E + RLK+F +N+ Y
Sbjct: 6   LFLGLCVLVHVCSAFIPLVLPIPGLYEDYFKEWQEKHGKVYSTEEESQSRLKVFMKNVIY 65

Query: 78  IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
           I+  NK+G+ +Y+L  N+++D+T DEF+  Y    +  P H S T S  K       D P
Sbjct: 66  IDNHNKQGH-SYELEVNEYADMTLDEFKDQY----LMEPQHCSATHS-LKSDPPKYRDPP 119

Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GN 196
            ++DWR KGAVTP+KNQ +CG CW F+    +E    +++G L+ LSEQQL+DC+    N
Sbjct: 120 KAIDWRSKGAVTPVKNQGQCGSCWTFSTTGCLESHHFLKTGQLVSLSEQQLVDCAQAFNN 179

Query: 197 NGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQA 256
           NGC GG   +AF YI  N G+ +E+ YPY+A    C       +A +SN   + S DE  
Sbjct: 180 NGCNGGLPSQAFEYIHYNGGLDSEESYPYRAHDEKCHFVPSEVSATVSNVVNITSKDEMQ 239

Query: 257 LLKAV-SMQPVSIAIAAYSTEFQSYKEGIFNGV-CGT---QLDHAVTIVGFGTTEDGANY 311
           L  AV ++ PVSIA    S +F+ YK+G++    C T    ++HAV  VG+ TTE G +Y
Sbjct: 240 LYNAVGTVGPVSIAYDV-SADFRFYKKGVYKSKECKTDPEHVNHAVLAVGYNTTESGEDY 298

Query: 312 WLIKNSWGNTWGDAGYMKIVRDEGLCGIGTRSSYPLA 348
           W++KNSWG  +G  GY  I R E +CG+   +SYP+ 
Sbjct: 299 WIVKNSWGTKFGINGYFWIARGENMCGLADCASYPIV 335


>gi|6630972|gb|AAF19630.1|AF194426_1 cysteine proteinase precursor [Myxine glutinosa]
          Length = 324

 Score =  240 bits (612), Expect = 7e-61,   Method: Compositional matrix adjust.
 Identities = 127/308 (41%), Positives = 191/308 (62%), Gaps = 12/308 (3%)

Query: 48  EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRTYKLGTNQFSDLTNDEF 104
           E W  ++G+SY    E+ +R ++++ NL+ +++ N    +G   Y+LG N ++DL N+EF
Sbjct: 20  ESWKGKYGKSYLGRGEEVLRKRVWESNLQIVQQHNVLADQGQANYRLGMNTYADLYNEEF 79

Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFA 164
            AL     +     +S+T  TFK   L    +P+S+DWR++G VTP+K+Q +CG CW+F+
Sbjct: 80  MALKGSSGILQAKDQSSTQ-TFK--PLVGVTLPSSVDWRNQGYVTPVKDQGQCGSCWSFS 136

Query: 165 AVAAVEGITKIRSGNLIQLSEQQLLDCS-TNGNNGCLGGSREKAFAYIIQNQGIATEDEY 223
           A  ++EG    ++G L+ LSEQQL+DCS + GN GC GG  E A+ YI    G+  E  Y
Sbjct: 137 ATGSLEGQHFAKTGTLVSLSEQQLVDCSWSYGNYGCSGGLMESAYDYIRDAGGVQLESAY 196

Query: 224 PYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKE 282
           PY A  G C   Q  A A  + +  +PSGDEQ+L++AV ++ PV++AI A   +FQ Y+ 
Sbjct: 197 PYTAQNGRCHFDQSKAVATCTGHVAIPSGDEQSLMQAVGTVGPVAVAIDASGYDFQLYES 256

Query: 283 GIFN--GVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDEG-LCGI 339
           G+++      + LDH V   G+G TE G +YWL+KNSWG  WG  GY+K+ R++   CGI
Sbjct: 257 GVYDRSRCSSSSLDHGVLAAGYG-TEGGNDYWLVKNSWGPGWGAQGYIKMSRNKSNQCGI 315

Query: 340 GTRSSYPL 347
            T + YPL
Sbjct: 316 ATMACYPL 323


>gi|348565223|ref|XP_003468403.1| PREDICTED: cathepsin L1-like [Cavia porcellus]
          Length = 333

 Score =  240 bits (612), Expect = 8e-61,   Method: Compositional matrix adjust.
 Identities = 135/342 (39%), Positives = 197/342 (57%), Gaps = 21/342 (6%)

Query: 16  TPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENL 75
           TP F++  L +     +VS+    +Q++    ++W A HGR Y    E+  R  ++++NL
Sbjct: 2   TPSFVLAALCLG----IVSALPKLDQTLDAQWDQWKAAHGRLYGLN-EEGWRRAVWEKNL 56

Query: 76  EYIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
             IE  N E   G  ++ LG N F D+TN+EFR +  G++     H+   +    YQ   
Sbjct: 57  RMIELHNGEYSQGRHSFTLGMNHFGDMTNEEFRQVMNGFQ-----HQKHKTGKM-YQEPL 110

Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
           +  +P S+DWR+KG VT +KNQ +CG CWAF+A  ++EG    ++GNL+ LSEQ L+DCS
Sbjct: 111 LLQLPKSVDWREKGYVTEVKNQGQCGSCWAFSATGSLEGQMFHKTGNLVSLSEQNLVDCS 170

Query: 193 -TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPS 251
              GN GC GG  + AF Y+  N+G+  E  YPY    G C    + +AA  + + +VP 
Sbjct: 171 RPQGNQGCNGGLMDFAFQYVKDNKGLEAEKSYPYVGKDGECKYKPELSAANDTGFVDVPQ 230

Query: 252 GDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIF--NGVCGTQLDHAVTIVGFGT--TED 307
            ++       ++ P+S+AI A    FQ YKEGI+   G     L+H V +VG+GT  +E 
Sbjct: 231 REKVVQKALATVGPLSVAIDAGLQSFQFYKEGIYYDPGCSSRDLNHGVLLVGYGTDASET 290

Query: 308 G-ANYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYPL 347
           G  +YWLIKNSWG TWG  GY+KI R+    CG+ T +SYPL
Sbjct: 291 GKGDYWLIKNSWGTTWGADGYVKIARNRNNHCGVATAASYPL 332


>gi|340727787|ref|XP_003402217.1| PREDICTED: cathepsin L-like [Bombus terrestris]
          Length = 343

 Score =  240 bits (612), Expect = 8e-61,   Method: Compositional matrix adjust.
 Identities = 133/345 (38%), Positives = 199/345 (57%), Gaps = 24/345 (6%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMA---QHGRSYKDELEKEMRLKIFKEN 74
           +F+ + + V   +Q +S           ++++W     +H + YK+++E+  R+KIF +N
Sbjct: 3   LFLFLIVAVLATAQAISFFEL-------VNQEWTTFKMEHNKVYKNDVEERFRMKIFMDN 55

Query: 75  LEYIEKANKEGNR-----TYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQ 129
              I K N  GN      +YKL  N++ D+ + EF     G+     +   +        
Sbjct: 56  KHKIAKHN--GNYEMKKVSYKLKMNKYGDMLHHEFVNTLNGFNKSINTQLRSERLPIAAS 113

Query: 130 NLSMTDV--PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQ 187
            +   +V  P ++DWR+ GAVTP+K+Q  CG CW+F+A  A+EG    R+G LI LSEQ 
Sbjct: 114 FIEPANVVLPKTVDWREHGAVTPVKDQGHCGSCWSFSATGALEGQHFRRTGILIPLSEQN 173

Query: 188 LLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNY 246
           L+DCS   GNNGC GG  ++AF YI  N+G+ TE  YPY+A    C      + A+   Y
Sbjct: 174 LIDCSGKYGNNGCNGGLMDQAFQYIKDNKGLDTEVTYPYEAENDKCRYNAANSGARDVGY 233

Query: 247 EEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVCGTQ-LDHAVTIVGFG 303
            ++P G+E+ L  AV ++ PVS+AI A    FQ Y EG+ +   C ++ LDH V  VG+G
Sbjct: 234 VDIPQGNEKKLKAAVATIGPVSVAIDASHQSFQFYSEGVYYEPECSSENLDHGVLAVGYG 293

Query: 304 TTEDGANYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYPL 347
           T E+G +YWL+KNSWG TWGD GY+K+ R++   CGI + +SYPL
Sbjct: 294 TDENGQDYWLVKNSWGETWGDNGYIKMARNKLNHCGIASTASYPL 338


>gi|306992173|gb|ADN19567.1| cathepsin L-like proteinase [Spodoptera frugiperda]
          Length = 344

 Score =  240 bits (612), Expect = 8e-61,   Method: Compositional matrix adjust.
 Identities = 128/323 (39%), Positives = 187/323 (57%), Gaps = 23/323 (7%)

Query: 46  IHEKWMA---QHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNR---TYKLGTNQFSDL 99
           + E+W A   +H + Y  E+E + R+KI+ EN   I K N+   +   +YKL  N+++D+
Sbjct: 23  VREEWNAFKMEHSKQYDSEVEDKFRMKIYVENKHRIAKHNQRFEQRLVSYKLKPNKYADM 82

Query: 100 TNDEFRALYTGY----------KMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVT 149
            + EF     G+          K      R   ++TF     +    P  +DWR KGAVT
Sbjct: 83  LHHEFVHTMNGFNKTAKHGGRNKAVHSKGRDGRAATFIAP--AHVSYPDHVDWRKKGAVT 140

Query: 150 PIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAF 208
            +K+Q +CG CWAF+   A+EG    ++G L+ LSEQ L+DCS   GNNGC GG  + AF
Sbjct: 141 DVKDQGKCGSCWAFSTTGALEGQHFRKTGYLVSLSEQNLVDCSAAYGNNGCNGGLMDNAF 200

Query: 209 AYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVS 267
            YI  N GI TE  YPY+AV   C    K + A    + ++P GDE+ L++AV ++ P+S
Sbjct: 201 KYIKDNGGIDTEKSYPYEAVDDKCRYNPKNSGADDVGFVDIPQGDEEKLMQAVATVGPIS 260

Query: 268 IAIAAYSTEFQSYKEGIF--NGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDA 325
           +AI A    FQ Y +G++       T LDH V +VG+GT E+G +YWL+KNSWG +WG+ 
Sbjct: 261 VAIDASQETFQFYSKGVYYDENCSSTDLDHGVMVVGYGTEEEGGDYWLVKNSWGRSWGEL 320

Query: 326 GYMKIVRDE-GLCGIGTRSSYPL 347
           GY+K+  ++   CGI + +SYPL
Sbjct: 321 GYIKMAHNKNNHCGIASSASYPL 343


>gi|288548566|gb|ADC52431.1| cathepsin L2 cysteine protease [Pinctada fucata]
          Length = 330

 Score =  240 bits (612), Expect = 8e-61,   Method: Compositional matrix adjust.
 Identities = 127/302 (42%), Positives = 189/302 (62%), Gaps = 14/302 (4%)

Query: 53  QHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRTYKLGTNQFSDLTNDEFRALYT 109
           Q+ + Y++E E   RL +++ NL++I   N     G  T+ +G N++ D+TN+EF     
Sbjct: 33  QYNKLYQNEEEARRRL-VWESNLDFITLHNLAADRGEHTFWVGMNEYGDMTNEEFTKTMN 91

Query: 110 GYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAV 169
           GY+M    ++++ +  F   N +M D+P ++DWR KG VTPIKNQ +CG CW+F+A  ++
Sbjct: 92  GYRM---RNKTSNAPVFMPPN-NMGDLPDTVDWRPKGYVTPIKNQGQCGSCWSFSATGSL 147

Query: 170 EGITKIRSGNLIQLSEQQLLDCS-TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAV 228
           EG T  ++G L+ LSEQ L+DCS   GN+GC GG  + AF YI  N GI TE  YPY+A 
Sbjct: 148 EGQTFKKTGKLVSLSEQNLVDCSKKQGNHGCEGGLMDDAFTYIKANNGIDTEASYPYKAR 207

Query: 229 PGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIFNG 287
            G C        A  + + ++ + DE+AL +AV ++ P+S+AI A    FQ Y+ G+++ 
Sbjct: 208 DGKCEFKSADVGATDTGFVDIKTKDEEALKQAVATVGPISVAIDASHMSFQLYRTGVYHD 267

Query: 288 -VCG-TQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSS 344
             C  T+LDH V  VG+G TED  +YWL+KNSWG +WG  GY+++ R+    CGI T +S
Sbjct: 268 WFCSQTKLDHGVLAVGYG-TEDSKDYWLVKNSWGESWGQKGYIQMSRNRRNNCGIATSAS 326

Query: 345 YP 346
           YP
Sbjct: 327 YP 328


>gi|110349473|gb|ABG73217.1| cathepsin L 1 precursor [Diaprepes abbreviatus]
          Length = 322

 Score =  240 bits (612), Expect = 8e-61,   Method: Compositional matrix adjust.
 Identities = 129/336 (38%), Positives = 204/336 (60%), Gaps = 23/336 (6%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
           +FI   LLV+ ++ V+       Q+       +  +HG++YK+++E+  R  IFK+NL  
Sbjct: 3   VFIAACLLVAVSATVLEETGVKFQA-------FKLKHGKTYKNQVEETARFNIFKDNLRA 55

Query: 78  IEKAN---KEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
           IE+ N   ++G  +YK G N+F+D+T +EFRA  T      P H +TT        L+  
Sbjct: 56  IEQHNVLYEQGLVSYKKGINRFTDMTQEEFRAFLTLSSSKKP-HFNTTEHV-----LTGL 109

Query: 135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN 194
            VP S+DWR KG VT +K+Q  CG CWAF+   + E     ++G L+ LSEQQL+DCST+
Sbjct: 110 AVPDSIDWRTKGQVTGVKDQGNCGSCWAFSVTGSTEAAYYRKAGKLVSLSEQQLVDCSTD 169

Query: 195 GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDE 254
            N GC GG  ++ F Y ++++G+  E  YPY+   G+C  +      K+S ++ + S DE
Sbjct: 170 INAGCNGGYLDETFTY-VKSKGLEAESTYPYKGTDGSCKYSASKVVTKVSGHKSLKSEDE 228

Query: 255 QALLKAV-SMQPVSIAIAAYSTEFQSYKEGIF-NGVCG-TQLDHAVTIVGFGTTEDGANY 311
            ALL AV ++ PVS+AI A  T   SY+ GI+ +  C  ++L+H V +VG+GT+ +G  Y
Sbjct: 229 NALLDAVGNVGPVSVAIDA--TYLSSYESGIYEDDWCSPSELNHGVLVVGYGTS-NGKKY 285

Query: 312 WLIKNSWGNTWGDAGYMKIVRDEGLCGIGTRSSYPL 347
           W++KNSWG ++G++GY +++R +  CG+   + YP+
Sbjct: 286 WIVKNSWGGSFGESGYFRLLRGKNECGVAEDTVYPI 321


>gi|354622947|ref|NP_001002938.2| cathepsin S precursor [Canis lupus familiaris]
          Length = 339

 Score =  240 bits (612), Expect = 9e-61,   Method: Compositional matrix adjust.
 Identities = 138/348 (39%), Positives = 199/348 (57%), Gaps = 24/348 (6%)

Query: 8   SGSFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEM 66
           +GSF      M  ++ LL  C+  V      H+   ++ H   W   + + YK+E E+  
Sbjct: 5   AGSF------MKWLVGLLPLCSYAVAQ---VHKDPTLDHHWNLWKKTYSKQYKEENEEVA 55

Query: 67  RLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTS 123
           R  I+++NL+++   N E   G  +Y LG N   D+T +E  +L    ++PS   R+ T 
Sbjct: 56  RRLIWEKNLKFVMLHNLEHSMGMHSYDLGMNHLGDMTGEEVISLMGSLRVPSQWQRNVT- 114

Query: 124 STFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQL 183
               Y++ S   +P S+DWR+KG VT +K Q  CG CWAF+AV A+E   K+++G L+ L
Sbjct: 115 ----YRSNSNQKLPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSL 170

Query: 184 SEQQLLDCSTN--GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAA 241
           S Q L+DCST   GN GC GG    AF YII N GI +E  YPY+A+ G C    K  AA
Sbjct: 171 SAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNNGIDSEASYPYKAMNGKCRYDSKKRAA 230

Query: 242 KISNYEEVPSGDEQALLKAVSMQ-PVSIAIAAYSTEFQSYKEGI-FNGVCGTQLDHAVTI 299
             S Y E+P G E AL +AV+ + PVS+AI A    F  Y+ G+ +   C   ++H V +
Sbjct: 231 TCSKYTELPFGSEDALKEAVANKGPVSVAIDASHYSFFLYRSGVYYEPSCTQNVNHGVLV 290

Query: 300 VGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDEG-LCGIGTRSSYP 346
           VG+G   +G +YWL+KNSWG  +GD GY+++ R+ G  CGI +  SYP
Sbjct: 291 VGYGNL-NGKDYWLVKNSWGLNFGDQGYIRMARNSGNHCGIASYPSYP 337


>gi|32394728|gb|AAM96000.1| cathepsin L precursor [Metapenaeus ensis]
          Length = 322

 Score =  240 bits (612), Expect = 9e-61,   Method: Compositional matrix adjust.
 Identities = 122/308 (39%), Positives = 184/308 (59%), Gaps = 14/308 (4%)

Query: 48  EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRTYKLGTNQFSDLTNDEF 104
           + +  Q+GR Y    E   R  +F++N ++IE  N   + G  T+ L  NQF D+T++EF
Sbjct: 20  QDFKVQYGRHYGTAREDLYRQSVFEQNQQFIEDHNAKFENGEVTFTLKMNQFGDMTSEEF 79

Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFA 164
            A   G+ +  P+           + L     P  +DWR KGAVTP+K+QK+CG CWAF+
Sbjct: 80  AATMNGF-LNVPTRHPVAILEADDETL-----PKHVDWRTKGAVTPVKDQKQCGSCWAFS 133

Query: 165 AVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEY 223
              ++EG   ++ G L+ LSEQ L+DCS   GN GC GG  ++AF YI +N+GI TE+ Y
Sbjct: 134 TTGSLEGQHFLKDGKLVSLSEQNLVDCSGKFGNMGCCGGLMDQAFKYIKENKGIDTEESY 193

Query: 224 PYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVS-MQPVSIAIAAYSTEFQSYKE 282
           PY+A  G C        A  + + ++  G+E +L+KAV+ + P+S+AI A    FQ Y +
Sbjct: 194 PYEAQDGKCRFDSSNVGATDTGFVDIAHGEENSLMKAVANIGPISVAIDASHPSFQFYHQ 253

Query: 283 GIF--NGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGI 339
           G++       T LDH V  +G+G T+DG  YWL+KNSW  +WGD G++++ R+ +  CGI
Sbjct: 254 GVYYEKECSSTMLDHGVLAIGYGETDDGKEYWLVKNSWNTSWGDKGFIQMSRNKKNNCGI 313

Query: 340 GTRSSYPL 347
            +++SYPL
Sbjct: 314 ASQASYPL 321


>gi|350412176|ref|XP_003489564.1| PREDICTED: cathepsin L-like [Bombus impatiens]
          Length = 343

 Score =  240 bits (612), Expect = 9e-61,   Method: Compositional matrix adjust.
 Identities = 132/346 (38%), Positives = 200/346 (57%), Gaps = 24/346 (6%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMA---QHGRSYKDELEKEMRLKIFKEN 74
           +F+++ + +   +Q +S           ++++W     +H + YK+++E+  R+KIF +N
Sbjct: 3   LFLLLIVAILATAQAISFFEL-------VNQEWTTFKMEHNKVYKNDIEERFRMKIFMDN 55

Query: 75  LEYIEKANKEGNR-----TYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQ 129
              I K N  GN      +YKL  N++ D+ + EF     G+     +   +        
Sbjct: 56  KHKIAKHN--GNYEMKKVSYKLKMNKYGDMLHHEFVNTLNGFNKSINTQLRSERLPIGAS 113

Query: 130 NLSMTDV--PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQ 187
            +   +V  P ++DWR+ GAVTP+K+Q  CG CW+F+A  A+EG    R+G LI LSEQ 
Sbjct: 114 FIEPANVVLPKTVDWREHGAVTPVKDQGHCGSCWSFSATGALEGQHFRRTGILIPLSEQN 173

Query: 188 LLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNY 246
           L+DCS   GNNGC GG  ++AF YI  N+G+ TE  YPY+A    C      + A+   Y
Sbjct: 174 LIDCSGKYGNNGCNGGLMDQAFQYIKDNKGLDTEVTYPYEAENDKCRYNAANSGARDVGY 233

Query: 247 EEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVCGTQ-LDHAVTIVGFG 303
            ++P G+E+ L  AV ++ PVS+AI A    FQ Y EG+ +   C ++ LDH V  VG+G
Sbjct: 234 VDIPQGNEKKLKAAVATIGPVSVAIDASHQSFQFYSEGVYYEPECSSENLDHGVLAVGYG 293

Query: 304 TTEDGANYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYPLA 348
           T E+G +YWL+KNSWG TWGD GY+K+ R++   CGI + +SYPL 
Sbjct: 294 TDENGQDYWLVKNSWGETWGDNGYIKMARNKLNHCGIASTASYPLV 339


>gi|32394730|gb|AAM96001.1| cathepsin L precursor [Metapenaeus ensis]
          Length = 306

 Score =  239 bits (611), Expect = 9e-61,   Method: Compositional matrix adjust.
 Identities = 123/310 (39%), Positives = 183/310 (59%), Gaps = 18/310 (5%)

Query: 48  EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRTYKLGTNQFSDLTNDEF 104
           + +  Q+GR Y    E   R  +F++N ++IE  N   + G  T+ L  NQF D+T++EF
Sbjct: 4   QDFKVQYGRHYGTAREDLYRQSVFEQNQQFIEDHNAKFENGEVTFTLKMNQFGDMTSEEF 63

Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTD--VPTSLDWRDKGAVTPIKNQKECGCCWA 162
            A   G+      H            L   D  +P  +DWR KGAVTP+K+QK+CG CWA
Sbjct: 64  AATMNGFLNVPTRHPVAI--------LEADDETLPKHVDWRTKGAVTPVKDQKQCGSCWA 115

Query: 163 FAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATED 221
           F+   ++EG   ++ G L+ LSEQ L+DCS   GN GC GG  ++AF YI +N+GI TE+
Sbjct: 116 FSTTGSLEGQHFLKDGKLVSLSEQNLVDCSGKFGNMGCCGGLMDQAFKYIKENKGIDTEE 175

Query: 222 EYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVS-MQPVSIAIAAYSTEFQSY 280
            YPY+A  G C        A  + + ++  G+E +L+KAV+ + P+S+AI A    FQ Y
Sbjct: 176 SYPYEAQDGKCRFDSSNVGATDTGFVDIAHGEENSLMKAVANIGPISVAIDASHPSFQFY 235

Query: 281 KEGIF--NGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLC 337
            +G++       T LDH V  +G+G T+DG  YWL+KNSW  +WGD G++++ R+ +  C
Sbjct: 236 HQGVYYEKECSSTMLDHGVLAIGYGETDDGKEYWLVKNSWNTSWGDKGFIQMSRNKKNNC 295

Query: 338 GIGTRSSYPL 347
           GI +++SYPL
Sbjct: 296 GIASQASYPL 305


>gi|301767946|ref|XP_002919405.1| PREDICTED: cathepsin S-like [Ailuropoda melanoleuca]
          Length = 340

 Score =  239 bits (611), Expect = 1e-60,   Method: Compositional matrix adjust.
 Identities = 128/305 (41%), Positives = 187/305 (61%), Gaps = 14/305 (4%)

Query: 50  WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEFRA 106
           W   +G+ YK++ E+  R  I+++NL+++   N E   G  +Y LG N   D+T++E  +
Sbjct: 40  WKKTYGKQYKEKNEEVARRLIWEKNLKFVTLHNLEHSMGMHSYDLGMNHLGDMTSEEVIS 99

Query: 107 LYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAV 166
           L +  ++PS   R+ T     Y++ S   +P S+DWR+KG VT +K Q  CG CWAF+AV
Sbjct: 100 LMSSLRVPSQWPRNVT-----YKSNSNQKLPDSVDWREKGCVTKVKYQGACGACWAFSAV 154

Query: 167 AAVEGITKIRSGNLIQLSEQQLLDCSTN--GNNGCLGGSREKAFAYIIQNQGIATEDEYP 224
            A+E   K+++G L+ LS Q L+DCST   GN GC GG   +AF YII N GI +E  YP
Sbjct: 155 GALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTEAFQYIIDNNGIDSEASYP 214

Query: 225 YQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQ-PVSIAIAAYSTEFQSYKEG 283
           Y+A  G C    K  AA  S Y E+PSG E  L +AV+ + PVS+AI A  + F  Y+ G
Sbjct: 215 YKATDGKCRYDSKNRAATCSKYTELPSGSEDDLKEAVANKGPVSVAIDARHSSFFLYRSG 274

Query: 284 I-FNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDEG-LCGIGT 341
           + ++  C   ++H V +VG+G   +G +YWL+KNSWG  +GD GY+++ R+ G  CGI +
Sbjct: 275 VYYDPSCTQNVNHGVLVVGYGNL-NGKDYWLVKNSWGLNFGDQGYIRMARNSGNHCGIAS 333

Query: 342 RSSYP 346
             SYP
Sbjct: 334 YPSYP 338


>gi|281352890|gb|EFB28474.1| hypothetical protein PANDA_008012 [Ailuropoda melanoleuca]
          Length = 328

 Score =  239 bits (611), Expect = 1e-60,   Method: Compositional matrix adjust.
 Identities = 128/305 (41%), Positives = 187/305 (61%), Gaps = 14/305 (4%)

Query: 50  WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEFRA 106
           W   +G+ YK++ E+  R  I+++NL+++   N E   G  +Y LG N   D+T++E  +
Sbjct: 28  WKKTYGKQYKEKNEEVARRLIWEKNLKFVTLHNLEHSMGMHSYDLGMNHLGDMTSEEVIS 87

Query: 107 LYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAV 166
           L +  ++PS   R+ T     Y++ S   +P S+DWR+KG VT +K Q  CG CWAF+AV
Sbjct: 88  LMSSLRVPSQWPRNVT-----YKSNSNQKLPDSVDWREKGCVTKVKYQGACGACWAFSAV 142

Query: 167 AAVEGITKIRSGNLIQLSEQQLLDCSTN--GNNGCLGGSREKAFAYIIQNQGIATEDEYP 224
            A+E   K+++G L+ LS Q L+DCST   GN GC GG   +AF YII N GI +E  YP
Sbjct: 143 GALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTEAFQYIIDNNGIDSEASYP 202

Query: 225 YQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQ-PVSIAIAAYSTEFQSYKEG 283
           Y+A  G C    K  AA  S Y E+PSG E  L +AV+ + PVS+AI A  + F  Y+ G
Sbjct: 203 YKATDGKCRYDSKNRAATCSKYTELPSGSEDDLKEAVANKGPVSVAIDARHSSFFLYRSG 262

Query: 284 I-FNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDEG-LCGIGT 341
           + ++  C   ++H V +VG+G   +G +YWL+KNSWG  +GD GY+++ R+ G  CGI +
Sbjct: 263 VYYDPSCTQNVNHGVLVVGYGNL-NGKDYWLVKNSWGLNFGDQGYIRMARNSGNHCGIAS 321

Query: 342 RSSYP 346
             SYP
Sbjct: 322 YPSYP 326


>gi|290462225|gb|ADD24160.1| Cathepsin L [Lepeophtheirus salmonis]
          Length = 334

 Score =  239 bits (611), Expect = 1e-60,   Method: Compositional matrix adjust.
 Identities = 132/340 (38%), Positives = 199/340 (58%), Gaps = 21/340 (6%)

Query: 19  FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
            +++++++S AS V     +    V+   E W   H + Y   +E+++RLKIF EN   I
Sbjct: 6   ILLLSVIISTASAV-----SFFDVVLSDWESWKLTHQKGYDSSVEEKLRLKIFMENSLRI 60

Query: 79  EKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTF-KYQNLSMT 134
            + N E   G  TY +  N + DL + EF A+  GY     ++++T   TF   +N+++ 
Sbjct: 61  SRHNAEAIQGRHTYFMKMNHYGDLLHHEFVAMVNGYIY---NNKTTLGGTFIPSKNINL- 116

Query: 135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN 194
             P  +DWR++GAVTP+KNQ +CG CW+F+A  ++EG    ++G LI LSEQ L+DCS  
Sbjct: 117 --PEHVDWREEGAVTPVKNQGQCGSCWSFSATGSLEGQDFRKTGKLISLSEQNLVDCSRK 174

Query: 195 -GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGD 253
            GNNGC GG  + AF YI  N GI TE  YPY+ + G C    K        + ++  G 
Sbjct: 175 YGNNGCEGGLMDYAFKYIQDNNGIDTEASYPYEGIDGHCHYDPKNKGGSDIGFVDIKKGS 234

Query: 254 EQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIFN-GVCGTQ-LDHAVTIVGFGTTE-DGA 309
           E+ L KA+ ++ P+S+AI A    FQ Y  G+++   C  + LDH V  VG+GT E  G 
Sbjct: 235 EKDLQKALATVGPISVAIDASHMSFQFYSHGVYSEKKCSPENLDHGVLAVGYGTDEVTGE 294

Query: 310 NYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPLA 348
           +YWL+KNSW   WG+ GY+K+ R+ + +CGI + +SYP+ 
Sbjct: 295 DYWLVKNSWSEKWGEDGYIKMARNKDNMCGIASSASYPVV 334


>gi|12805315|gb|AAH02125.1| Ctss protein [Mus musculus]
          Length = 340

 Score =  239 bits (610), Expect = 1e-60,   Method: Compositional matrix adjust.
 Identities = 128/306 (41%), Positives = 186/306 (60%), Gaps = 15/306 (4%)

Query: 50  WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEFRA 106
           W   H + YKD+ E+E+R  I+++NL++I   N E   G  TY++G N   D+TN+E   
Sbjct: 39  WKKTHEKEYKDKNEEEVRRLIWEKNLKFIMIHNLEYSMGMHTYQVGMNDMGDMTNEEILC 98

Query: 107 LYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAV 166
                ++P  S ++ T     +++ S   +P ++DWR+KG VT +K Q  CG CWAF+AV
Sbjct: 99  RMGALRIPRQSPKTVT-----FRSYSNRTLPDTVDWREKGCVTEVKYQGSCGACWAFSAV 153

Query: 167 AAVEGITKIRSGNLIQLSEQQLLDCSTN---GNNGCLGGSREKAFAYIIQNQGIATEDEY 223
            A+EG  K+++G LI LS Q L+DCS     GN GC GG   +AF YII N GI  +  Y
Sbjct: 154 GALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQYIIDNGGIEADASY 213

Query: 224 PYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQ-PVSIAIAAYSTEFQSYKE 282
           PY+A+   C    K  AA  S Y ++P GDE AL +AV+ + PVS+ I A  + F  YK 
Sbjct: 214 PYKAMDEKCHYNSKNRAATCSRYIQLPFGDEDALKEAVATKGPVSVGIDASHSSFFFYKS 273

Query: 283 GIFNG-VCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR-DEGLCGIG 340
           G+++   C   ++H V +VG+GT  DG +YWL+KNSWG  +GD GY+++ R ++  CGI 
Sbjct: 274 GVYDDPSCTGNVNHGVLVVGYGTL-DGKDYWLVKNSWGLNFGDQGYIRMARNNKNHCGIA 332

Query: 341 TRSSYP 346
           +  SYP
Sbjct: 333 SDCSYP 338


>gi|391340505|ref|XP_003744580.1| PREDICTED: digestive cysteine proteinase 1-like [Metaseiulus
           occidentalis]
          Length = 469

 Score =  239 bits (610), Expect = 1e-60,   Method: Compositional matrix adjust.
 Identities = 131/308 (42%), Positives = 188/308 (61%), Gaps = 16/308 (5%)

Query: 48  EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE--GNRTYKLGTNQFSDLTNDEFR 105
           E +    G++Y+ + E  +R  IF+ NL +IEK N E   +R Y LG  QF+D++  EFR
Sbjct: 167 EHFKEHFGKTYEGD-EHALRQGIFQRNLAHIEKFNAEKAASRGYTLGITQFADMSTAEFR 225

Query: 106 ALYTGYKMPSPSHRSTTSSTFKYQNLSMTD---VPTSLDWRDKGAVTPIKNQKECGCCWA 162
             Y G +M +    ST +   K Q   + D   +P ++DWRDKGAV+P+K+Q +CG CWA
Sbjct: 226 QTYLGLRMNA----STIAKLRKLQREVVADDRDLPEAVDWRDKGAVSPVKDQGQCGSCWA 281

Query: 163 FAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDE 222
           F+   A+EG   +++G L+ LSEQQ++DCS   + GC GG    A  Y+  N G+  E  
Sbjct: 282 FSTSGAIEGQHFLKNGELLSLSEQQMVDCSWL-DFGCNGGQPMLAMEYVRFNGGLELETA 340

Query: 223 YPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVS-MQPVSIAIAAYSTEFQSYK 281
           YPY+ V G+C + +K AAAKI+ +       E AL KAV+ + P+S+ + A   +FQ YK
Sbjct: 341 YPYKGVGGSCHSDKKSAAAKITGFWMAGFYSESALQKAVAKVGPISVGMDASGEDFQHYK 400

Query: 282 EGIFN--GVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDEG-LCG 338
            GI+N        LDHAV  VG+GT++DG +YWL+KNSW  +WG+ GY K+ R++G  CG
Sbjct: 401 SGIYNPESCSSIGLDHAVLAVGYGTSDDG-DYWLVKNSWNTSWGEKGYFKLPRNKGNKCG 459

Query: 339 IGTRSSYP 346
           I T   YP
Sbjct: 460 IATTPIYP 467


>gi|291383517|ref|XP_002708299.1| PREDICTED: cathepsin L1 [Oryctolagus cuniculus]
          Length = 333

 Score =  239 bits (610), Expect = 1e-60,   Method: Compositional matrix adjust.
 Identities = 132/334 (39%), Positives = 199/334 (59%), Gaps = 19/334 (5%)

Query: 24  LLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANK 83
           LL +    + S+    +Q++     +W A H R Y    E+  R  ++++N+  IE  N 
Sbjct: 6   LLAAVCWGIASAIPKFDQNLDTQWYQWKATHKRLYGLN-EEGWRRAVWEKNMRMIELHNG 64

Query: 84  E---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSL 140
           E   G   + +G N + D+TN+EFR +  G++  +  H+        +++  +   P S+
Sbjct: 65  EYSQGKHGFTMGMNAYGDMTNEEFRQVMNGFQ--NQKHKKGK----MFRDPLLLQYPKSV 118

Query: 141 DWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TNGNNGC 199
           DWR+KG VTP+KNQ +CG CWAF+A  A+EG    ++G LI LSEQ L+DCS   GN GC
Sbjct: 119 DWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFQKTGKLISLSEQNLVDCSHPQGNQGC 178

Query: 200 LGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLK 259
            GG  + AF Y+  N G+ +E+ YPY+ + GTC    + + A  + + ++P G E+ALL+
Sbjct: 179 NGGLMDYAFQYVKDNSGLDSEESYPYEGMDGTCKYKPECSVANDTGFVDIP-GHEKALLR 237

Query: 260 AV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVCGTQ-LDHAVTIVGF---GTTEDGANYWL 313
           AV ++ P+S AI A    FQ YK GI ++  C ++ LDH + +VG+   GT  +   YWL
Sbjct: 238 AVATVGPISAAIDAGHMSFQFYKSGIYYDPDCSSKDLDHGILVVGYGFEGTNSNATKYWL 297

Query: 314 IKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYP 346
           +KNSWG TWGD GY+KI+RD +  CGI T +SYP
Sbjct: 298 VKNSWGTTWGDEGYVKIIRDKDNHCGIATAASYP 331


>gi|350583407|ref|XP_003481511.1| PREDICTED: cathepsin S [Sus scrofa]
          Length = 331

 Score =  239 bits (610), Expect = 1e-60,   Method: Compositional matrix adjust.
 Identities = 134/338 (39%), Positives = 198/338 (58%), Gaps = 18/338 (5%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLE 76
           M  ++ +L+ C+S +      H    ++ H + W   +G+ YK++ E+  R  I+++NL+
Sbjct: 1   MKCLVWVLLLCSSAMAQ---LHRDPTLDRHWDLWKKTYGKQYKEKNEEVARRLIWEKNLK 57

Query: 77  YIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
            +   N E   G  +Y LG N   D+T++E  +L +  ++PS   R+ T  +   Q L  
Sbjct: 58  TVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVISLMSCVRVPSQWPRNVTYKSNPNQKL-- 115

Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST 193
              P S+DWR+KG VT +K Q  CG CWAF+AV A+E   K+++G L+ LS Q L+DCST
Sbjct: 116 ---PDSMDWREKGCVTEVKYQGSCGSCWAFSAVGALEAQVKMKTGRLVSLSAQNLVDCST 172

Query: 194 NG--NNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPS 251
               N GC GG   +AF YII N GI +E  YPY+AV G C    K  AA  S Y E+P 
Sbjct: 173 EKYRNKGCNGGFMTEAFQYIIDNNGIDSEASYPYKAVDGKCKYDSKNRAATCSRYTELPF 232

Query: 252 GDEQALLKAVSMQ-PVSIAIAAYSTEFQSYKEGI-FNGVCGTQLDHAVTIVGFGTTEDGA 309
            DE AL +AV+ + PVS+AI A  + F  Y+ G+ ++  C   ++H V +VG+G   +G 
Sbjct: 233 ADEYALKEAVANKGPVSVAIDAKHSSFFFYRSGVYYDPSCTQNVNHGVLVVGYGNL-NGK 291

Query: 310 NYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYP 346
           +YWL+KNSWG  +GD GY+++ R+ E  CGI    SYP
Sbjct: 292 DYWLVKNSWGLNFGDGGYIRMARNSENHCGIANYPSYP 329


>gi|74178074|dbj|BAE29827.1| unnamed protein product [Mus musculus]
 gi|74178231|dbj|BAE29900.1| unnamed protein product [Mus musculus]
 gi|74220784|dbj|BAE31361.1| unnamed protein product [Mus musculus]
          Length = 326

 Score =  239 bits (610), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 128/306 (41%), Positives = 185/306 (60%), Gaps = 15/306 (4%)

Query: 50  WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEFRA 106
           W   H + YKD+ E+E+R  I+++NL++I   N E   G  TY++G N   D+TN+E   
Sbjct: 25  WKKTHEKEYKDKNEEEVRRLIWEKNLKFIMIHNLEYSMGMHTYQVGMNDMGDMTNEEILC 84

Query: 107 LYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAV 166
                ++P  S ++ T     +++ S   +P ++DWR+KG VT +K Q  CG CWAF+AV
Sbjct: 85  RMGALRIPRQSPKTVT-----FRSYSNRTLPDTVDWREKGCVTEVKYQGSCGACWAFSAV 139

Query: 167 AAVEGITKIRSGNLIQLSEQQLLDCSTN---GNNGCLGGSREKAFAYIIQNQGIATEDEY 223
            A+EG  K+++G LI LS Q L+DCS     GN GC GG   +AF YII N GI  +  Y
Sbjct: 140 GALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQYIIDNGGIEADASY 199

Query: 224 PYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQ-PVSIAIAAYSTEFQSYKE 282
           PY+A    C    K  AA  S Y ++P GDE AL +AV+ + PVS+ I A  + F  YK 
Sbjct: 200 PYKATDEKCHYNSKNRAATCSRYIQLPFGDEDALKEAVATKGPVSVGIDASHSSFFFYKS 259

Query: 283 GIFNG-VCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR-DEGLCGIG 340
           G+++   C   ++H V +VG+GT  DG +YWL+KNSWG  +GD GY+++ R ++  CGI 
Sbjct: 260 GVYDDPSCTGNVNHGVLVVGYGTL-DGKDYWLVKNSWGLNFGDQGYIRMARNNKNHCGIA 318

Query: 341 TRSSYP 346
           +  SYP
Sbjct: 319 SYCSYP 324


>gi|443685370|gb|ELT89004.1| hypothetical protein CAPTEDRAFT_95613, partial [Capitella teleta]
          Length = 295

 Score =  239 bits (609), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 127/299 (42%), Positives = 181/299 (60%), Gaps = 15/299 (5%)

Query: 61  ELEKEMRLKIFKENLEYIEKAN---KEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPS 117
           E E+  R ++F+ N++ I+  N   ++G   + +G NQFSD+   EF  +  G++M   +
Sbjct: 1   ETEENQRKEVFRNNIKKIQMHNYLHEQGKSPFTMGINQFSDMDEKEFSTIMNGFRM---N 57

Query: 118 HRSTTSSTFKYQNLSM---TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITK 174
           +R+          +S      VP  +DWR KG VTP+KNQ +CG CWAF+A+ A+EG   
Sbjct: 58  NRTKVRDHLHSHYISPAIPVSVPAEVDWRKKGYVTPVKNQGQCGSCWAFSAIGALEGQHF 117

Query: 175 IRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCS 233
            ++G L+ LSEQ L+DCS + GNNGC GG  + AF YI  N G  TE  YPY+AV G C 
Sbjct: 118 RKTGKLVSLSEQNLVDCSKSYGNNGCNGGVMDYAFKYIKDNDGDDTEACYPYEAVDGMCR 177

Query: 234 AAQKPAAAKISNYEEVPSGDEQALLKAVSM-QPVSIAIAAYSTEFQSYKEGIF-NGVCGT 291
             ++   A    Y ++P G+E  + +AV++  PVS+AI A  + F SYK G++    C  
Sbjct: 178 FKRECVGATCRGYTDLPWGNEVKMKEAVALVGPVSVAIDASHSSFMSYKGGVYVEKECSP 237

Query: 292 -QLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPLA 348
            QLDH V +VG+G TE G +YWL+KNSWG TWGD GY+K+ R+    CGI + + YPL 
Sbjct: 238 YQLDHGVLVVGYG-TEQGLDYWLVKNSWGTTWGDQGYIKMARNMHNHCGIASMACYPLV 295


>gi|4731372|gb|AAD28476.1|AF133838_1 papain-like cysteine protease [Sandersonia aurantiaca]
          Length = 370

 Score =  239 bits (609), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 116/252 (46%), Positives = 164/252 (65%), Gaps = 6/252 (2%)

Query: 101 NDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCC 160
           N   R   T + +     R+   ++ +Y+  +   +P S+DWR+KGAV PIK+Q  CG C
Sbjct: 6   NSRPRRRTTYFGVRGAGRRTPGLASDRYRYRAGDALPDSVDWREKGAVVPIKDQGGCGSC 65

Query: 161 WAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATE 220
           WAF+ +A+VEGI KI +G+LI LSEQ+L+DC    N+GC GG  + AF +II N GI TE
Sbjct: 66  WAFSTIASVEGINKIVTGDLISLSEQELVDCDKTYNDGCNGGLMDYAFQFIIDNGGIDTE 125

Query: 221 DEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQS 279
            +YPY    G C + +K A    I++YE+VP  DEQAL KA + QP+++AI      FQ 
Sbjct: 126 KDYPYTEQDGRCDSYRKNAKVVSINSYEDVPVNDEQALKKAAASQPIAVAIDGGGRSFQL 185

Query: 280 YKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EG 335
           Y  GIF G CGT LDH VT+VG+G +E G +YW+++NSWG +WG+ GY+++ R+     G
Sbjct: 186 YNSGIFTGKCGTSLDHGVTVVGYG-SESGKDYWIVRNSWGESWGEKGYIRMARNIDSPSG 244

Query: 336 LCGIGTRSSYPL 347
           +CGI   +SYP+
Sbjct: 245 ICGIAMEASYPI 256


>gi|348514005|ref|XP_003444531.1| PREDICTED: cathepsin L1-like [Oreochromis niloticus]
          Length = 338

 Score =  239 bits (609), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 131/311 (42%), Positives = 185/311 (59%), Gaps = 17/311 (5%)

Query: 50  WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEFRA 106
           W + H + Y  E E+  R  ++++NL+ IE  N +   G  TY+LG N F D+TN+EFR 
Sbjct: 33  WKSWHTKKYH-EKEEGWRRMVWEKNLKKIELHNLDHSMGKHTYRLGMNHFGDMTNEEFRQ 91

Query: 107 LYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAV 166
           L  GYK    + R    S F   N    + P SLDWRDKG VTP+K+Q +CG CWAF+A 
Sbjct: 92  LMNGYK--HKAERKVKGSLFLEPNF--LEAPRSLDWRDKGYVTPVKDQGQCGSCWAFSAT 147

Query: 167 AAVEGITKIRSGNLIQLSEQQLLDCST-NGNNGCLGGSREKAFAYIIQNQGIATEDEYPY 225
            A+EG    ++G ++QLSEQ L++CS   GN GC GG  ++AF Y+  NQG+ +E+ YPY
Sbjct: 148 GALEGQQFRKTGKMVQLSEQNLVECSRPEGNEGCNGGLMDQAFQYVKDNQGLDSEESYPY 207

Query: 226 QAVPG-TCSAAQKPAAAKISNYEEVPSGDEQALLKAVS-MQPVSIAIAAYSTEFQSYKEG 283
                  C    +  A   + + ++ SG E AL+KAV+ + P+S+AI A    FQ Y+ G
Sbjct: 208 LGTDDQKCHYDPRYNAVNDTGFVDIKSGSEHALMKAVTAVGPISVAIDAGHESFQFYQSG 267

Query: 284 I-FNGVCGT-QLDHAVTIVGF---GTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLC 337
           I +   C + +LDH V +VG+   G   DG  YW++KNSW   WGD GY+ + +D +  C
Sbjct: 268 IYYEPECSSEELDHGVLLVGYGFEGEDVDGKKYWIVKNSWSEKWGDKGYVYMAKDRQNHC 327

Query: 338 GIGTRSSYPLA 348
           GI T +SYPL 
Sbjct: 328 GIATAASYPLV 338


>gi|221117518|ref|XP_002157675.1| PREDICTED: cathepsin L-like [Hydra magnipapillata]
          Length = 340

 Score =  239 bits (609), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 142/340 (41%), Positives = 202/340 (59%), Gaps = 15/340 (4%)

Query: 20  IIITLLVSCASQVVSSRSTHEQSVVEIHE--KWMAQHGRSYKDELEKEMRLKIFKENLEY 77
           I+I +LV   S  + S   + Q+ ++  E  ++  + G+ Y   +E+  +   +K N E 
Sbjct: 5   ILIGILVQSYSFELQSFLNNSQTPMKDPEWRRFKIKFGKFYSSNIEETSKYLNWKINNEK 64

Query: 78  IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTG-YKMPSPSHRSTTSSTFKYQNLSMTDV 136
           I+  N E NR +K+G NQFSDLT++EF  +Y G +K+P      T  STF     S  ++
Sbjct: 65  IKNHNSE-NRFFKIGMNQFSDLTHEEFIKIYGGCFKLPKSFINITKGSTF--LPPSNVNI 121

Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-G 195
           P  +DWR KG V P+KNQ +CG CWAF+   A+EG T  ++G L  LSEQ L+DC+ + G
Sbjct: 122 PDEVDWRTKGYVNPVKNQGQCGSCWAFSTTGALEGQTFRKTGVLPDLSEQNLVDCTQSYG 181

Query: 196 NNGCLGGSREKAFAYIIQNQGIATEDEYPYQA-VPGTCSAAQKPAAAKISNYEEVPSGDE 254
           N  C GG  + AF YI  N+GI +E  YPY A   G C   Q+   A  + + ++ SGDE
Sbjct: 182 NEACNGGWMDNAFKYISDNKGIDSEAGYPYYAKALGYCYYNQQFNVASDTGFVDIASGDE 241

Query: 255 QALLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVCGT---QLDHAVTIVGFGTTEDGA 309
            AL  AV ++ P+S+AI A    F  Y+ G+ +   CG     LDHAV +VG+G TEDG 
Sbjct: 242 DALKVAVATVGPISVAIDATKDSFMRYQSGVYYEPTCGNGLENLDHAVLVVGYG-TEDGR 300

Query: 310 NYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPLA 348
           ++WL+KNSW  TWGD GY+K+ R+    CGI T++SYPL 
Sbjct: 301 DFWLVKNSWDITWGDQGYIKMSRNMSNQCGIATKASYPLV 340


>gi|392306967|ref|NP_067256.3| cathepsin S isoform 2 preproprotein [Mus musculus]
 gi|26390492|dbj|BAC25906.1| unnamed protein product [Mus musculus]
 gi|148706872|gb|EDL38819.1| cathepsin S [Mus musculus]
          Length = 342

 Score =  239 bits (609), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 128/306 (41%), Positives = 185/306 (60%), Gaps = 15/306 (4%)

Query: 50  WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEFRA 106
           W   H + YKD+ E+E+R  I+++NL++I   N E   G  TY++G N   D+TN+E   
Sbjct: 41  WKKTHEKEYKDKNEEEVRRLIWEKNLKFIMIHNLEYSMGMHTYQVGMNDMGDMTNEEILC 100

Query: 107 LYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAV 166
                ++P  S ++ T     +++ S   +P ++DWR+KG VT +K Q  CG CWAF+AV
Sbjct: 101 RMGALRIPRQSPKTVT-----FRSYSNRTLPDTVDWREKGCVTEVKYQGSCGACWAFSAV 155

Query: 167 AAVEGITKIRSGNLIQLSEQQLLDCSTN---GNNGCLGGSREKAFAYIIQNQGIATEDEY 223
            A+EG  K+++G LI LS Q L+DCS     GN GC GG   +AF YII N GI  +  Y
Sbjct: 156 GALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQYIIDNGGIEADASY 215

Query: 224 PYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQ-PVSIAIAAYSTEFQSYKE 282
           PY+A    C    K  AA  S Y ++P GDE AL +AV+ + PVS+ I A  + F  YK 
Sbjct: 216 PYKATDEKCHYNSKNRAATCSRYIQLPFGDEDALKEAVATKGPVSVGIDASHSSFFFYKS 275

Query: 283 GIFNG-VCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR-DEGLCGIG 340
           G+++   C   ++H V +VG+GT  DG +YWL+KNSWG  +GD GY+++ R ++  CGI 
Sbjct: 276 GVYDDPSCTGNVNHGVLVVGYGTL-DGKDYWLVKNSWGLNFGDQGYIRMARNNKNHCGIA 334

Query: 341 TRSSYP 346
           +  SYP
Sbjct: 335 SYCSYP 340


>gi|390608645|ref|NP_001254624.1| cathepsin S isoform 1 preproprotein [Mus musculus]
 gi|74214026|dbj|BAE29430.1| unnamed protein product [Mus musculus]
          Length = 343

 Score =  239 bits (609), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 128/306 (41%), Positives = 185/306 (60%), Gaps = 15/306 (4%)

Query: 50  WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEFRA 106
           W   H + YKD+ E+E+R  I+++NL++I   N E   G  TY++G N   D+TN+E   
Sbjct: 42  WKKTHEKEYKDKNEEEVRRLIWEKNLKFIMIHNLEYSMGMHTYQVGMNDMGDMTNEEILC 101

Query: 107 LYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAV 166
                ++P  S ++ T     +++ S   +P ++DWR+KG VT +K Q  CG CWAF+AV
Sbjct: 102 RMGALRIPRQSPKTVT-----FRSYSNRTLPDTVDWREKGCVTEVKYQGSCGACWAFSAV 156

Query: 167 AAVEGITKIRSGNLIQLSEQQLLDCSTN---GNNGCLGGSREKAFAYIIQNQGIATEDEY 223
            A+EG  K+++G LI LS Q L+DCS     GN GC GG   +AF YII N GI  +  Y
Sbjct: 157 GALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQYIIDNGGIEADASY 216

Query: 224 PYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQ-PVSIAIAAYSTEFQSYKE 282
           PY+A    C    K  AA  S Y ++P GDE AL +AV+ + PVS+ I A  + F  YK 
Sbjct: 217 PYKATDEKCHYNSKNRAATCSRYIQLPFGDEDALKEAVATKGPVSVGIDASHSSFFFYKS 276

Query: 283 GIFNG-VCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR-DEGLCGIG 340
           G+++   C   ++H V +VG+GT  DG +YWL+KNSWG  +GD GY+++ R ++  CGI 
Sbjct: 277 GVYDDPSCTGNVNHGVLVVGYGTL-DGKDYWLVKNSWGLNFGDQGYIRMARNNKNHCGIA 335

Query: 341 TRSSYP 346
           +  SYP
Sbjct: 336 SYCSYP 341


>gi|198427748|ref|XP_002130282.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
          Length = 340

 Score =  239 bits (609), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 131/343 (38%), Positives = 199/343 (58%), Gaps = 19/343 (5%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
           + + + L   C S +        Q  V + + W     + Y+   E+E ++  +  N   
Sbjct: 5   VLLAVVLFAGCCSAM-----QLNQQHVSLFQTWKNLWKKVYQTVEEEEQKMATWFNNWNK 59

Query: 78  IEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNL--- 131
           I + N +     ++Y+L  N++ DLT++EF ++  GY+      R +T  +  Y NL   
Sbjct: 60  ISEHNMQYSLKQKSYRLEMNEYGDLTSEEFSSMMNGYRNDIRLKRKSTGGS-TYLNLLSF 118

Query: 132 -SMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLD 190
            S   +PT +DWR  G VTP+KNQ +CG CW+F+A  ++EG  K ++G L+ LSEQ L+D
Sbjct: 119 GSQIQLPTLVDWRKHGLVTPVKNQGQCGSCWSFSATGSLEGQHKKKTGKLVSLSEQNLID 178

Query: 191 CST-NGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEV 249
           CST  GN+GC GG  ++AF YI    GI TE  YPY+A   TC      + A  + + ++
Sbjct: 179 CSTPEGNDGCNGGLMDQAFKYIKIQGGIDTEAYYPYEAKDDTCRFNITDSGATDTGFVDI 238

Query: 250 PSGDEQALLK-AVSMQPVSIAIAAYSTEFQSYKEGIFN--GVCGTQLDHAVTIVGFGTTE 306
            SGDE+ L + A ++ P+S+AI A  T FQ Y  G+++      T LDH V +VG+G TE
Sbjct: 239 KSGDEEMLKEAAATVGPISVAIDASHTSFQFYSNGVYSETACSSTMLDHGVLVVGYG-TE 297

Query: 307 DGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPLA 348
           +G +YWL+KNSWG  WG+AGY+K+ R+ +  CGI T++SYPL 
Sbjct: 298 NGKDYWLVKNSWGEGWGEAGYIKMSRNADNQCGIATQASYPLV 340


>gi|47086859|ref|NP_997749.1| cathepsin L, 1 a precursor [Danio rerio]
 gi|42542930|gb|AAH66490.1| Cathepsin L1, a [Danio rerio]
          Length = 337

 Score =  239 bits (609), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 136/342 (39%), Positives = 195/342 (57%), Gaps = 18/342 (5%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
           M + +     C S V ++  T +Q + +  ++W   H + Y    E+  R  I+++NL+ 
Sbjct: 1   MRVFLAAFTLCLSAVFAA-PTLDQQLNDHWDQWKKWHSKKYH-ATEEGWRRVIWEKNLKK 58

Query: 78  IEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
           IE  N E   G  TY+LG N F D+T++EFR +  G+K      R    S F   N    
Sbjct: 59  IEMHNLEHSMGIHTYRLGMNHFGDMTHEEFRQVMNGFKHKKD--RRFRGSLFMEPNF--I 114

Query: 135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST- 193
           +VP  LDWR+KG VTP+K+Q ECG CWAF+   A+EG    ++G L+ LSEQ L+DCS  
Sbjct: 115 EVPNKLDWREKGYVTPVKDQGECGSCWAFSTTGALEGQMFRKTGKLVSLSEQNLVDCSRP 174

Query: 194 NGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGT-CSAAQKPAAAKISNYEEVPSG 252
            GN GC GG  ++AF Y+    G+ +E+ YPY       C    K +AA  + + ++PSG
Sbjct: 175 EGNEGCNGGLMDQAFQYVKDQNGLDSEESYPYLGTDDQPCHFDPKNSAANDTGFVDIPSG 234

Query: 253 DEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVCGT-QLDHAVTIVGF---GTTE 306
            E+AL+KA+ ++ PVS+AI A    FQ Y+ GI +   C + +LDH V  VG+   G   
Sbjct: 235 KERALMKAIAAVGPVSVAIDAGHESFQFYQSGIYYEKECSSEELDHGVLAVGYGFEGEDV 294

Query: 307 DGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
           DG  YW++KNSW   WGD GY+ + +D    CGI T +SYPL
Sbjct: 295 DGKKYWIVKNSWSENWGDKGYIYMAKDRHNHCGIATAASYPL 336


>gi|226443040|ref|NP_001140018.1| Cathepsin L1 precursor [Salmo salar]
 gi|221221188|gb|ACM09255.1| Cathepsin L1 precursor [Salmo salar]
          Length = 338

 Score =  239 bits (609), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 138/343 (40%), Positives = 194/343 (56%), Gaps = 24/343 (6%)

Query: 20  IIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIE 79
           + + +LV C S V ++     Q     H  W   H +SY  E E+  R  ++++NL+ IE
Sbjct: 4   LYLAVLVLCVSAVCAAPRFDSQLEDHWH-LWKNWHSKSYH-ESEEGWRRMVWEKNLKKIE 61

Query: 80  KANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFK---YQNLSM 133
             N E   G  +Y+LG N F D+TN+EFR    GYK        TT   FK   +   + 
Sbjct: 62  MHNLEHTMGKHSYRLGMNHFGDMTNEEFRQTMNGYK-------QTTERKFKGSLFMEPNY 114

Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS- 192
              P ++DWR+KG VTP+K+Q  CG CWAF+   A+EG    ++G L+ LSEQ L+DCS 
Sbjct: 115 LQAPKAVDWREKGYVTPVKDQGSCGSCWAFSTTGAMEGQQFRKTGKLVSLSEQNLVDCSR 174

Query: 193 TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAV-PGTCSAAQKPAAAKISNYEEVPS 251
             GN GC GG  ++AF YI  N G+ TE+ YPY       C    + + A  + + ++PS
Sbjct: 175 PEGNEGCNGGLMDQAFQYIQDNAGLDTEESYPYVGTDEDPCHYKPEFSGANETGFVDIPS 234

Query: 252 GDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVCGT-QLDHAVTIVGF---GTT 305
           G E A++KAV ++ PVS+AI A    FQ Y+ GI +   C + +LDH V +VG+   G  
Sbjct: 235 GKEHAMMKAVAAVGPVSVAIDAGHESFQFYESGIYYEKECSSEELDHGVLVVGYGFEGED 294

Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
            DG  YW++KNSW   WGD GY+ + +D +  CGI T SSYPL
Sbjct: 295 VDGKKYWIVKNSWSEKWGDKGYIYMAKDRKNHCGIATASSYPL 337


>gi|348546019|ref|XP_003460476.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
 gi|348546143|ref|XP_003460538.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
          Length = 334

 Score =  238 bits (608), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 133/313 (42%), Positives = 189/313 (60%), Gaps = 12/313 (3%)

Query: 44  VEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRTYKLGTNQFSDLT 100
           +E H  W  +  RSY    E+  R +I+  N +++   N    +G ++Y+LG   F+D+ 
Sbjct: 24  LEFH-AWKLKFERSYHSPSEEAHRRQIWLNNRKFVLVHNILADQGLKSYRLGMTYFADME 82

Query: 101 NDEF-RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGC 159
           N+E+ R +  G      +      STF ++    TD+P ++DWRDKG VT +K+QK+CG 
Sbjct: 83  NEEYKRVISQGCLHSFNASLPRRGSTF-FRLPEGTDLPDAVDWRDKGYVTDVKDQKQCGS 141

Query: 160 CWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIA 218
           CWAF+A  ++EG    ++G L+ LSEQQL+DCS + GN GC+GG  + AF YI  N GI 
Sbjct: 142 CWAFSATGSLEGQHFRKTGTLVSLSEQQLVDCSGDYGNMGCMGGLMDYAFQYIQANGGID 201

Query: 219 TEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEF 277
           TE+ YPY+A  G C        A  + Y EV  GDE AL +AV ++ P+S+ I A    F
Sbjct: 202 TEESYPYEAENGKCRYNPDNIGATSTGYTEVSQGDEDALKEAVATIGPISVGIDASQMSF 261

Query: 278 QSYKEGIFN--GVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE- 334
           Q Y+ G++N       +LDH V  VG+G TEDG +YWL+KNSWG  WGD GY+K+ R++ 
Sbjct: 262 QFYESGVYNEPDCSSLELDHGVLAVGYG-TEDGNDYWLVKNSWGLEWGDKGYIKMSRNKS 320

Query: 335 GLCGIGTRSSYPL 347
             CGI T +SYPL
Sbjct: 321 NQCGIATAASYPL 333


>gi|341940310|sp|O70370.2|CATS_MOUSE RecName: Full=Cathepsin S; Flags: Precursor
          Length = 340

 Score =  238 bits (608), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 128/306 (41%), Positives = 185/306 (60%), Gaps = 15/306 (4%)

Query: 50  WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEFRA 106
           W   H + YKD+ E+E+R  I+++NL++I   N E   G  TY++G N   D+TN+E   
Sbjct: 39  WKKTHEKEYKDKNEEEVRRLIWEKNLKFIMIHNLEYSMGMHTYQVGMNDMGDMTNEEILC 98

Query: 107 LYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAV 166
                ++P  S ++ T     +++ S   +P ++DWR+KG VT +K Q  CG CWAF+AV
Sbjct: 99  RMGALRIPRQSPKTVT-----FRSYSNRTLPDTVDWREKGCVTEVKYQGSCGACWAFSAV 153

Query: 167 AAVEGITKIRSGNLIQLSEQQLLDCSTN---GNNGCLGGSREKAFAYIIQNQGIATEDEY 223
            A+EG  K+++G LI LS Q L+DCS     GN GC GG   +AF YII N GI  +  Y
Sbjct: 154 GALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQYIIDNGGIEADASY 213

Query: 224 PYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQ-PVSIAIAAYSTEFQSYKE 282
           PY+A    C    K  AA  S Y ++P GDE AL +AV+ + PVS+ I A  + F  YK 
Sbjct: 214 PYKATDEKCHYNSKNRAATCSRYIQLPFGDEDALKEAVATKGPVSVGIDASHSSFFFYKS 273

Query: 283 GIFNG-VCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR-DEGLCGIG 340
           G+++   C   ++H V +VG+GT  DG +YWL+KNSWG  +GD GY+++ R ++  CGI 
Sbjct: 274 GVYDDPSCTGNVNHGVLVVGYGTL-DGKDYWLVKNSWGLNFGDQGYIRMARNNKNHCGIA 332

Query: 341 TRSSYP 346
           +  SYP
Sbjct: 333 SYCSYP 338


>gi|296189340|ref|XP_002742739.1| PREDICTED: cathepsin L1 [Callithrix jacchus]
          Length = 333

 Score =  238 bits (608), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 133/343 (38%), Positives = 199/343 (58%), Gaps = 25/343 (7%)

Query: 16  TPMFIIITLLVSCASQVVS-SRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKEN 74
            P  I+    +  AS  ++  RS   Q +     KW A H R Y    E+E R  ++++N
Sbjct: 2   NPTLILTAFCLGLASSALTFDRSLEAQWI-----KWKAMHNRLYGMN-EEEWRRAVWEKN 55

Query: 75  LEYIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNL 131
           ++ IE  N E   G  ++ +  N F D+TN+EFR +  G++   P +         +Q  
Sbjct: 56  MKMIELHNHEYNQGKHSFTMAMNAFGDMTNEEFRQVMNGFQNRKPRNGKV------FQEP 109

Query: 132 SMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDC 191
              + P S+DWR+KG VTP+KNQ +CG CWAF+A  A+EG    ++G L+ LSEQ L+DC
Sbjct: 110 LFHEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDC 169

Query: 192 S-TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVP 250
           S   GN GC GG  + AF Y+ +N G+ +E+ YPY+A   +C    + + A  + + ++P
Sbjct: 170 SGPQGNQGCDGGLMDYAFQYVQENGGLDSEESYPYEATEESCKYNPEYSVANDTGFVDIP 229

Query: 251 SGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVCGTQ-LDHAVTIVGFG---T 304
              E+AL+KAV ++ P+S+AI A    FQ YKEGI F   C ++ +DH V +VG+G   T
Sbjct: 230 K-LEKALMKAVATVGPISVAIDAGHESFQFYKEGIYFEPECSSEDMDHGVLVVGYGFERT 288

Query: 305 TEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYP 346
             D + YWL+KNSWG  WG  GY+K+ +D +  CGI + +SYP
Sbjct: 289 GSDNSKYWLVKNSWGEKWGMDGYIKMAKDRKNHCGIASAASYP 331


>gi|334332718|ref|XP_001367502.2| PREDICTED: cathepsin L1-like [Monodelphis domestica]
          Length = 333

 Score =  238 bits (608), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 124/307 (40%), Positives = 193/307 (62%), Gaps = 13/307 (4%)

Query: 48  EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEF 104
            +W AQHG+SY+   E  +R   +++NL+ IE+ N+E   G  +++L  N+F D++ +EF
Sbjct: 30  HQWKAQHGKSYEAN-EDSLRRATWEKNLKMIERHNQEYSAGKHSFQLRMNKFGDMSTEEF 88

Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFA 164
           + +  GYK  + S R T  S ++   L+   +P S+DWR+KG VTP+K Q +CG CW+F+
Sbjct: 89  KQVMNGYK-SNGSQRRTKGSLYRESLLAQ--LPESVDWREKGYVTPVKEQGDCGACWSFS 145

Query: 165 AVAAVEGITKIRSGNLIQLSEQQLLDCST-NGNNGCLGGSREKAFAYIIQNQGIATEDEY 223
           AV A+EG    ++G L+ LS Q L+DC+   GNNGC GG  + AF Y+  N GI TE+ Y
Sbjct: 146 AVGAIEGQWFRKTGKLVSLSIQNLIDCTIPEGNNGCDGGFMDNAFQYVQDNGGIDTEECY 205

Query: 224 PYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKE 282
           PY A    C    + + A I+ + ++PS DE+AL++AV ++ P+S+ I + +  F+ Y+ 
Sbjct: 206 PYVAQDTECKYKPECSGANITGFVDIPSMDERALMEAVATVGPISVGIDSANPSFKFYQS 265

Query: 283 GIF--NGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGI 339
           G++       +QLDH V +VG+G+      YW++KNSWG  WGD GY+ + +D +  CGI
Sbjct: 266 GVYYEPDCSSSQLDHGVLVVGYGSIGKD-EYWIVKNSWGEAWGDNGYILMAKDKDNHCGI 324

Query: 340 GTRSSYP 346
            T +SYP
Sbjct: 325 ATEASYP 331


>gi|118119|sp|P13277.2|CYSP1_HOMAM RecName: Full=Digestive cysteine proteinase 1; Flags: Precursor
 gi|11051|emb|CAA45127.1| cysteine proteinase preproenzyme [Homarus americanus]
 gi|228243|prf||1801240A Cys protease 1
          Length = 322

 Score =  238 bits (608), Expect = 3e-60,   Method: Compositional matrix adjust.
 Identities = 129/309 (41%), Positives = 181/309 (58%), Gaps = 19/309 (6%)

Query: 48  EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEF 104
           E++  + GR Y D  E+  RL +F +NL+YIE+ NK+   G  TY L  NQFSD+TN++F
Sbjct: 21  EEFKGKFGRKYVDLEEERYRLNVFLDNLQYIEEFNKKYERGEVTYNLAINQFSDMTNEKF 80

Query: 105 RALYTGYKM-PSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAF 163
            A+  GYK  P P+   T++              T +DWR KGAVTP+K+Q +CG CWAF
Sbjct: 81  NAVMKGYKKGPRPAAVFTSTDA--------APESTEVDWRTKGAVTPVKDQGQCGSCWAF 132

Query: 164 AAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG--NNGCLGGSREKAFAYIIQNQGIATED 221
           +    +EG   +++G L+ LSEQQL+DC+     N GC GG  E+A  Y+  N G+ TE 
Sbjct: 133 STTGGIEGQHFLKTGRLVSLSEQQLVDCAGGSYYNQGCNGGWVERAIMYVRDNGGVDTES 192

Query: 222 EYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSY 280
            YPY+A   TC        A  + Y  +  G E AL  A   + P+S+AI A    FQSY
Sbjct: 193 SYPYEARDNTCRFNSNTIGATCTGYVGIAQGSESALKTATRDIGPISVAIDASHRSFQSY 252

Query: 281 KEGIF--NGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE-GLC 337
             G++       +QLDHAV  VG+G +E G ++WL+KNSW  +WG++GY+K+ R+    C
Sbjct: 253 YTGVYYEPSCSSSQLDHAVLAVGYG-SEGGQDFWLVKNSWATSWGESGYIKMARNRNNNC 311

Query: 338 GIGTRSSYP 346
           GI T + YP
Sbjct: 312 GIATDACYP 320


>gi|449275508|gb|EMC84350.1| Cathepsin L1, partial [Columba livia]
          Length = 319

 Score =  238 bits (607), Expect = 3e-60,   Method: Compositional matrix adjust.
 Identities = 133/310 (42%), Positives = 186/310 (60%), Gaps = 16/310 (5%)

Query: 50  WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEFRA 106
           W + H + Y  E E+  R  ++++NL+ IE  N +   G  +YKLG NQF D+T +EFR 
Sbjct: 13  WKSWHNKDYH-EREESWRRVVWEKNLKMIELHNLDHTLGKHSYKLGMNQFGDMTTEEFRQ 71

Query: 107 LYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAV 166
           L  GY     S R    S F     S  + P S+DWR+KG VTP+K+Q +CG CWAF+  
Sbjct: 72  LMNGYAHKK-SERKYRGSQF--LEPSFLEAPRSVDWREKGYVTPVKDQGQCGSCWAFSTT 128

Query: 167 AAVEGITKIRSGNLIQLSEQQLLDCST-NGNNGCLGGSREKAFAYIIQNQGIATEDEYPY 225
            A+EG    ++G L+ LSEQ L+DCS   GN GC GG  ++AF Y+  N GI +E+ YPY
Sbjct: 129 GALEGQHFRKTGKLVSLSEQNLVDCSRPEGNQGCNGGLMDQAFQYVQDNGGIDSEESYPY 188

Query: 226 QAVPG-TCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEG 283
            A     C    +  AA  + + ++P G E+AL+KAV ++ PVS+AI A  + FQ Y+ G
Sbjct: 189 TAKDDEDCRYKAEYNAANDTGFVDIPQGHERALMKAVAAVGPVSVAIDAGHSSFQFYQSG 248

Query: 284 I-FNGVCGTQ-LDHAVTIVGF---GTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLC 337
           I +   C ++ LDH V +VG+   G   DG  YW++KNSWG  WGD GY+ + +D +  C
Sbjct: 249 IYYEPDCSSEDLDHGVLVVGYGFEGEDVDGKKYWIVKNSWGEKWGDKGYIYMAKDRKNHC 308

Query: 338 GIGTRSSYPL 347
           GI T +SYPL
Sbjct: 309 GIATAASYPL 318


>gi|185135439|ref|NP_001117777.1| procathepsin L precursor [Oncorhynchus mykiss]
 gi|14582899|gb|AAK69706.1|AF358668_1 procathepsin L [Oncorhynchus mykiss]
          Length = 338

 Score =  238 bits (607), Expect = 3e-60,   Method: Compositional matrix adjust.
 Identities = 138/343 (40%), Positives = 194/343 (56%), Gaps = 24/343 (6%)

Query: 20  IIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIE 79
           + + +LV C S V ++     Q     H  W   H + Y  E E+  R  ++++NL+ IE
Sbjct: 4   LYLAVLVLCVSAVCAAPRFDSQLEDHWH-LWKNWHSKHYH-ESEEGWRRMVWEKNLKKIE 61

Query: 80  KANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFK---YQNLSM 133
             N E   G  +Y+LG N F D+TN+EFR    GYK        TT   FK   +   + 
Sbjct: 62  IHNLEHTMGKHSYRLGMNHFGDMTNEEFRQTMNGYK-------QTTERKFKGSLFMEPNY 114

Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS- 192
              P ++DWR+KG VTP+K+Q  CG CWAF+   A+EG    ++G L+ LSEQ L+DCS 
Sbjct: 115 LQAPKAVDWREKGYVTPVKDQGSCGSCWAFSTTGAMEGQQFRKTGKLVSLSEQNLVDCSR 174

Query: 193 TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAV-PGTCSAAQKPAAAKISNYEEVPS 251
             GN GC GG  ++AF YI  N G+ TE+ YPY       C    + +AA  + + ++PS
Sbjct: 175 PEGNEGCNGGLMDQAFQYIQDNAGLDTEESYPYVGTDEDPCHYKPEFSAANETGFVDIPS 234

Query: 252 GDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVCGT-QLDHAVTIVGF---GTT 305
           G E A++KAV ++ PVS+AI A    FQ Y+ GI +   C + +LDH V +VG+   G  
Sbjct: 235 GKEHAMMKAVAAVGPVSVAIDAGHESFQFYESGIYYEKECSSEELDHGVLVVGYGFEGED 294

Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
            DG  YW++KNSW   WGD GY+ + +D +  CGI T SSYPL
Sbjct: 295 VDGKKYWIVKNSWSEKWGDKGYIYMAKDRKNHCGIATASSYPL 337


>gi|118429523|gb|ABK91809.1| cathepsin L-like proteinase precursor [Clonorchis sinensis]
          Length = 373

 Score =  238 bits (607), Expect = 3e-60,   Method: Compositional matrix adjust.
 Identities = 142/342 (41%), Positives = 196/342 (57%), Gaps = 31/342 (9%)

Query: 29  ASQVVSSRSTHEQSVV---------EIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIE 79
           AS + S  S H Q V+          I + +M  + R+Y D  E E R KIF  N   I 
Sbjct: 39  ASPLTSLDSMHMQDVIGVDWNFTLSSIWKHFMTTYKRNYIDPSEHERRFKIFANNFVRIS 98

Query: 80  KANK---EGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDV 136
           K N    +G  +Y +G N+FSD T++E + L   ++    + R  +    KY  ++    
Sbjct: 99  KHNVRFIQGQVSYTMGINEFSDKTDEELKRLRC-FRGSLNASRDGS----KYITIAAPP- 152

Query: 137 PTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-G 195
           P+ +DWR+KGAVTP+KNQ  CG CWAF+A  A+EG   + +GNL+ LSEQQL+DCS+  G
Sbjct: 153 PSEIDWRNKGAVTPVKNQGNCGSCWAFSATGAIEGQNFLATGNLVSLSEQQLVDCSSEYG 212

Query: 196 NNGCLGGSREKAFAYIIQNQGIATEDEYPY------QAVPGTCSAAQKPAAAKISNYEEV 249
           NN C GG  + AF Y+  + GI TE  YPY       A P TC    K A  +++ Y ++
Sbjct: 213 NNACNGGLMDNAFKYVKDSNGIDTEASYPYVSGETGDANP-TCRFNLKEAVVRVTGYIDL 271

Query: 250 PSGDEQALLKAVSMQ-PVSIAIAAYSTEFQSYKEGIF-NGVCGT-QLDHAVTIVGFGTTE 306
           P G    L +AV    P+S+AI A    F SYK G++ +  C +  LDH V +VG+G  E
Sbjct: 272 PRGQVSELKQAVGHYGPISVAINAGLPSFMSYKSGVYSDDQCSSDDLDHGVLLVGYG-EE 330

Query: 307 DGANYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYPL 347
           +G  YWLIKNSWG  WG+ GY+KI+RD   LCG+ + +SYPL
Sbjct: 331 NGIPYWLIKNSWGPHWGENGYVKILRDHNNLCGVASMASYPL 372


>gi|28194647|gb|AAO33585.1|AF479267_1 cathepsin L [Mesocricetus auratus]
          Length = 333

 Score =  238 bits (607), Expect = 3e-60,   Method: Compositional matrix adjust.
 Identities = 131/310 (42%), Positives = 187/310 (60%), Gaps = 19/310 (6%)

Query: 48  EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRTYKLGTNQFSDLTNDEF 104
            KW + H R Y D  E+E R  ++++N++ IE  N    EG   + +  N F D+TN+EF
Sbjct: 30  HKWKSTHRRLY-DTNEEEWRRAVWEKNMKMIELHNGEYSEGKHGFTMEMNAFGDMTNEEF 88

Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFA 164
           R L  GYK     HR        +Q   M  +P S+DWR+KG VTP+KNQ +CG CWAF+
Sbjct: 89  RQLVNGYK--HQKHRKGKL----FQEPLMLQLPKSVDWREKGCVTPVKNQGQCGSCWAFS 142

Query: 165 AVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEY 223
           A  A+EG   +++G L+ LSEQ L+DCS   GN GC GG  + AF Y++ N+G+ +E+ Y
Sbjct: 143 ACGALEGQMCLKTGVLVSLSEQNLVDCSRGEGNQGCNGGLMDFAFQYVLNNKGLDSEESY 202

Query: 224 PYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKE 282
           PY+A  GTC    + AAA  + Y ++P   E+AL+KAV ++ P+++AI A    FQ Y  
Sbjct: 203 PYEAKDGTCKYKPEFAAANDTGYVDIPQ-LEKALMKAVATVGPIAVAIDASHPSFQFYSS 261

Query: 283 GI-FNGVCGTQ-LDHAVTIVGF---GTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE-GL 336
           GI F   C ++ LDH V ++G+   GT  +   YW++KNSWG  WG  G+  I +D+   
Sbjct: 262 GIYFEPNCSSKDLDHGVLVIGYGFEGTDSNKKKYWIVKNSWGTGWGMGGFFHIAKDKNNH 321

Query: 337 CGIGTRSSYP 346
           CGI T +SYP
Sbjct: 322 CGIATAASYP 331


>gi|410519429|gb|AFV73398.1| cathepsin L [Haliotis discus hannai]
          Length = 326

 Score =  238 bits (607), Expect = 3e-60,   Method: Compositional matrix adjust.
 Identities = 126/302 (41%), Positives = 185/302 (61%), Gaps = 13/302 (4%)

Query: 53  QHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNR---TYKLGTNQFSDLTNDEFRALYT 109
           +H + YKD  E+  R  +F + +EYI++ N E +R   ++++G N+++D+ N+EF  +  
Sbjct: 28  RHNKQYKDNQEEAYRKGVFMKAVEYIQQHNLEADRGVHSFRVGINEYADMPNEEFVRVMN 87

Query: 110 GYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAV 169
           GYKM     R    +     N+   D+P ++DWR KG VT +KNQ +CG CWAF++  ++
Sbjct: 88  GYKMQE--QRPKAPTYMPPSNVG--DLPATVDWRTKGYVTEVKNQGQCGSCWAFSSTGSL 143

Query: 170 EGITKIRSGNLIQLSEQQLLDCST-NGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAV 228
           EG T  +   LI LSEQ L+DCST  GN GC GG  ++AF YI  N GI TE  YPY+A 
Sbjct: 144 EGQTFKKYNKLISLSEQNLVDCSTEQGNMGCGGGLMDQAFTYIKVNDGIDTETSYPYEAA 203

Query: 229 PGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIFNG 287
            G C   +    A  + Y ++ S  E  L  AV ++ P+++AI A    FQ YK G+++ 
Sbjct: 204 SGKCRFNKANVGANDTGYTDIKSKSESDLQSAVATVGPIAVAIDASHMSFQLYKSGVYHY 263

Query: 288 V-CG-TQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSS 344
           + C  T+LDH V  VG+G T+ G +YWL+KNSWG TWG  GY+ + R+ +  CGI T++S
Sbjct: 264 IFCSQTRLDHGVLAVGYG-TDSGKDYWLVKNSWGATWGQQGYIMMSRNRDNNCGIATQAS 322

Query: 345 YP 346
           YP
Sbjct: 323 YP 324


>gi|225706370|gb|ACO09031.1| Cathepsin L precursor [Osmerus mordax]
          Length = 337

 Score =  238 bits (607), Expect = 3e-60,   Method: Compositional matrix adjust.
 Identities = 134/342 (39%), Positives = 197/342 (57%), Gaps = 18/342 (5%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
           M + + +LV C    +++     Q   E  + W + H ++Y+ E E+  R  ++++NL+ 
Sbjct: 1   MTLYLVVLVLCTGAALAAPRFDAQ-FDEHWDLWKSWHSKNYQHEKEEGWRRMVWEKNLKK 59

Query: 78  IEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
           IE  N E   G  +Y LG N F D+TN+EFR +  GYK+     R    S F   N    
Sbjct: 60  IEMHNLEHSLGKHSYSLGMNHFGDMTNEEFRQVMNGYKL---QQRKFKGSLFLEPN--NM 114

Query: 135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST- 193
           + P  +DWR++G VTP+K+Q +CG CWAF+   A+EG    ++  L+ LSEQ L+DCS  
Sbjct: 115 EAPKQVDWREEGYVTPVKDQGQCGSCWAFSTTGAMEGQMFRKTQKLVSLSEQNLVDCSRP 174

Query: 194 NGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGT-CSAAQKPAAAKISNYEEVPSG 252
            GN GC GG  ++AF YI  N G+ +E+ YPY       C+   + +AA  + + ++PSG
Sbjct: 175 EGNEGCNGGLMDQAFQYIQDNSGLDSEEAYPYLGTDDQPCNYKAEFSAANDTGFMDIPSG 234

Query: 253 DEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVCGT-QLDHAVTIVGF---GTTE 306
            E AL+KA+ S+ PVS+AI A    FQ Y+ GI +   C + +LDH V  VG+   G   
Sbjct: 235 KEHALMKAIASVGPVSVAIDAGHESFQFYQSGIYYEKECSSEELDHGVLAVGYGFEGEDV 294

Query: 307 DGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
           DG  YW++KNSW   WGD GY+ + +D +  CGI T +SYPL
Sbjct: 295 DGKKYWIVKNSWSEKWGDKGYILMAKDRKNHCGIATAASYPL 336


>gi|449513868|ref|XP_002191976.2| PREDICTED: cathepsin L1-like [Taeniopygia guttata]
          Length = 443

 Score =  238 bits (607), Expect = 3e-60,   Method: Compositional matrix adjust.
 Identities = 133/311 (42%), Positives = 187/311 (60%), Gaps = 16/311 (5%)

Query: 50  WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEFRA 106
           W + H + Y  E E+  R  ++++NL+ IE  N +   G  +YKLG NQF D+T +EFR 
Sbjct: 137 WKSWHRKDYH-EREEGWRRVVWEKNLKMIEIHNLDHALGKHSYKLGMNQFGDMTTEEFRQ 195

Query: 107 LYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAV 166
           L  GY +   S R    S F   N    + P S+DWR+KG VTP+K+Q +CG CWAF+  
Sbjct: 196 LMNGY-VHKKSERKYRGSQFLEPNF--LEAPRSVDWREKGYVTPVKDQGQCGSCWAFSTT 252

Query: 167 AAVEGITKIRSGNLIQLSEQQLLDCST-NGNNGCLGGSREKAFAYIIQNQGIATEDEYPY 225
            A+EG    ++G L+ LSEQ L+DCS   GN GC GG  ++AF Y+  N GI +E+ YPY
Sbjct: 253 GALEGQHFRKTGKLVSLSEQNLVDCSRPEGNQGCNGGLMDQAFQYVQDNGGIDSEESYPY 312

Query: 226 QAVPG-TCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEG 283
            A     C    +  AA  + + ++P G E+AL+KAV ++ PVS+AI A  + FQ Y+ G
Sbjct: 313 TAKDDEDCRYKAEYNAANDTGFVDIPQGHERALMKAVAAVGPVSVAIDAGHSSFQFYQSG 372

Query: 284 I-FNGVCGTQ-LDHAVTIVGF---GTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLC 337
           I +   C ++ LDH V +VG+   G   DG  YW++KNSWG  WGD GY+ + +D +  C
Sbjct: 373 IYYEPDCSSEDLDHGVLVVGYGFEGEDVDGKKYWIVKNSWGEKWGDKGYIYMAKDRKNHC 432

Query: 338 GIGTRSSYPLA 348
           GI T +SYPL 
Sbjct: 433 GIATAASYPLV 443


>gi|72005575|ref|XP_783218.1| PREDICTED: cathepsin L2-like isoform 2 [Strongylocentrotus
           purpuratus]
 gi|390337647|ref|XP_003724610.1| PREDICTED: cathepsin L2-like isoform 1 [Strongylocentrotus
           purpuratus]
          Length = 334

 Score =  238 bits (607), Expect = 3e-60,   Method: Compositional matrix adjust.
 Identities = 138/338 (40%), Positives = 195/338 (57%), Gaps = 17/338 (5%)

Query: 19  FIIITLLVSCA-SQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
           FII+ L V+ A +  + SR   E+      ++W+  HG+ Y    E+  R  I+++NL  
Sbjct: 4   FIIVLLSVAGALATRLPSRDFDEE-----WKEWVDYHGKEYSAMGEEMERRMIWEDNLRI 58

Query: 78  IEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
           I K N E   G  TY+LG N+F D+TN EF A  T  KM         S+    + L + 
Sbjct: 59  ITKHNLEHSQGKTTYRLGMNEFGDMTNAEFVATRTMKKMSGVPKVGQGSTFLPSEFLQL- 117

Query: 135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-T 193
             P S+DWR +G VTP+K+Q +CG CWAF+ V A+EG   +++G L+ LSEQ L+DCS  
Sbjct: 118 --PDSVDWRTEGYVTPVKDQGQCGSCWAFSTVGALEGQHFVKTGTLVSLSEQNLVDCSQA 175

Query: 194 NGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGD 253
            GN+GC GG    A  YI  N GI TE  YPY+ V  +C        A I+ + EV +  
Sbjct: 176 EGNDGCNGGWPAWADEYIKSNGGIDTEVGYPYEGVDDSCHYRTSDVGATITGFAEVEADS 235

Query: 254 EQALLKAVS-MQPVSIAIAAYSTEFQSYKEGIFN--GVCGTQLDHAVTIVGFGTTEDGAN 310
           E+AL KA++ + P+S+ I A    FQ Y+ G+++      T LDH VT VG+ +T DG  
Sbjct: 236 EKALEKALAQVGPISVCIDATQPSFQLYESGVYDEPDCSSTALDHCVTAVGYDSTADGDK 295

Query: 311 YWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
           Y+++KNSWG TWG  GY+ + RD +  CGI T ++YPL
Sbjct: 296 YYIVKNSWGTTWGQEGYIWMSRDKQKQCGIATNATYPL 333


>gi|334324655|ref|XP_001370975.2| PREDICTED: cathepsin S-like isoform 1 [Monodelphis domestica]
          Length = 331

 Score =  238 bits (607), Expect = 4e-60,   Method: Compositional matrix adjust.
 Identities = 128/317 (40%), Positives = 191/317 (60%), Gaps = 15/317 (4%)

Query: 39  HEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTN 94
           H   +++ H + W   HG+ YK + E+  R  I+++NL+Y+   N E   G  +Y L  N
Sbjct: 19  HRDPMLDGHWDLWKKTHGKQYKGQNEEIARRLIWEKNLKYVTLHNLEHSMGLHSYDLSMN 78

Query: 95  QFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQ 154
              D+T++E  +L +  ++P+  +R+TT     Y+  S   +P S+DWR+KG VT +K Q
Sbjct: 79  HLGDMTSEEVISLMSSLRIPNQWNRNTT-----YRLSSNQKLPDSVDWREKGCVTEVKYQ 133

Query: 155 KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN--GNNGCLGGSREKAFAYII 212
             CG CWAF+AV A+E   K+++G L+ LS Q L+DCST+   N+GC GG    AF Y+I
Sbjct: 134 GSCGSCWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTDKYDNHGCNGGFMTSAFQYVI 193

Query: 213 QNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQ-PVSIAIA 271
            N GI ++  YPY+A  G C       AA  S Y E+P G E+AL +AV+ + PVS+ I 
Sbjct: 194 DNNGIDSDVSYPYKATDGKCQYNPASRAATCSKYTELPYGSEEALKEAVANKGPVSVGID 253

Query: 272 AYSTEFQSYKEGI-FNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKI 330
           A +  F  YK G+ ++  C  +++H V ++G+G   DG +YWL+KNSWG  +GD GY++I
Sbjct: 254 AKTPSFFLYKSGVYYDPSCTQKVNHGVLVIGYGNL-DGQDYWLVKNSWGLHFGDKGYVRI 312

Query: 331 VRDEG-LCGIGTRSSYP 346
            R+ G  CGI    SYP
Sbjct: 313 ARNRGNHCGIANFPSYP 329


>gi|410898132|ref|XP_003962552.1| PREDICTED: cathepsin L-like [Takifugu rubripes]
          Length = 335

 Score =  238 bits (607), Expect = 4e-60,   Method: Compositional matrix adjust.
 Identities = 129/312 (41%), Positives = 195/312 (62%), Gaps = 10/312 (3%)

Query: 44  VEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRTYKLGTNQFSDLT 100
           +E H  W  + GRSY+   E+  R++I+  N + +   N    +G ++Y+LG  QF+D+ 
Sbjct: 25  MEFH-AWKLKFGRSYRTPSEEVQRMQIWLNNRKLVLVHNILADQGIKSYRLGMTQFADMD 83

Query: 101 NDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCC 160
           N+E+++L +   + + +  +    +  ++    T +PT++DWRDKG VT +K+QK+CG C
Sbjct: 84  NEEYKSLISLGCLRAFNTSAPRRGSAFFRLAEGTHLPTTVDWRDKGYVTGVKDQKQCGSC 143

Query: 161 WAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIAT 219
           WAF+A  ++EG    ++G L+ LSEQQL+DCS + GN GC GG  + AF YI +N GI T
Sbjct: 144 WAFSATGSLEGQNFRKTGKLVSLSEQQLVDCSGDYGNMGCNGGLMDYAFKYIQENGGIDT 203

Query: 220 EDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQ 278
           E  YPY+A  G C    +   AK + Y +V  GDE AL +AV ++ PVS+ I A  + FQ
Sbjct: 204 EKSYPYEAEDGQCRFKPENVGAKCTGYVDVTVGDEDALKEAVATIGPVSVGIDASHSSFQ 263

Query: 279 SYKEGIFNGV-CGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EG 335
            Y  G+++   C +Q LDH V  VG+G T++G +YWL+KNSWG  WG  GY+ + R+ + 
Sbjct: 264 LYDSGVYDEQDCSSQDLDHGVLAVGYG-TDNGQDYWLVKNSWGLGWGQEGYIMMSRNKDN 322

Query: 336 LCGIGTRSSYPL 347
            CGI T +SYPL
Sbjct: 323 QCGIATAASYPL 334


>gi|194352766|emb|CAQ00111.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
          Length = 384

 Score =  238 bits (607), Expect = 4e-60,   Method: Compositional matrix adjust.
 Identities = 134/331 (40%), Positives = 185/331 (55%), Gaps = 35/331 (10%)

Query: 50  WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYT 109
           WMA HGRSY    EK  R ++++ N+E+IE AN++   +Y LG   F+DLT+DEF A+Y+
Sbjct: 55  WMAAHGRSYPTVEEKLRRFEVYRSNMEFIEAANRDSRMSYSLGETPFTDLTHDEFMAMYS 114

Query: 110 GYKMPS-------------PSHRSTTS--STFKYQNLSMTDV-PTSLDWRDKGAVTPIKN 153
                S             P H  T +     +  NL++T V P S+DWR KG VTP KN
Sbjct: 115 SNDDSSEWEEATVITTRAGPVHEGTAAVEEPPRRTNLNVTAVLPPSVDWRAKGVVTPAKN 174

Query: 154 Q-KECGCCWAFAAVAAVEGITKIRSGNLIQ-LSEQQLLDCSTNGNNGCLGGSREKAFAYI 211
           Q   C  CWAF +VA +E    I +G     LSEQQL+DCST  ++GC  G  + AF ++
Sbjct: 175 QGATCFSCWAFTSVATMESAQAISTGGSPPVLSEQQLVDCSTL-HHGCGRGWMDDAFKWV 233

Query: 212 IQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEV-PSGDEQALLKAVSMQPVSIAI 270
           I N GI TE  YPY    G C    KP A ++ +Y++V P G+E  L +AV+ QPV+++ 
Sbjct: 234 IMNGGITTEAAYPYTGKAGNCQTG-KPVAVRLRSYKKVTPPGNEAGLKEAVAQQPVAVSF 292

Query: 271 AAYSTEFQSYKEGIFN-----------GVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWG 319
                 FQ Y  G++N           G C T  +HA+ +VG+GT  DG  YW+ KNSW 
Sbjct: 293 DYSDPCFQHYIGGVYNAGCSRSGVYIKGACKTAQNHAMALVGYGTKPDGTKYWIGKNSWT 352

Query: 320 NTWGDAGYMKIVRDE---GLCGIGTRSSYPL 347
             WGD G++ ++RD    GLCG+     YP+
Sbjct: 353 AKWGDKGFIYLLRDSPPLGLCGLAKLPVYPI 383


>gi|325185016|emb|CCA19507.1| cysteine protease family C01A putative [Albugo laibachii Nc14]
          Length = 492

 Score =  238 bits (606), Expect = 4e-60,   Method: Compositional matrix adjust.
 Identities = 124/305 (40%), Positives = 171/305 (56%), Gaps = 32/305 (10%)

Query: 50  WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYT 109
           W+  H  ++ D  E   RL+ +  N  YI   N +   ++KLG N FS LTN+EFR  + 
Sbjct: 36  WLKTHHLTFSDAFEYAKRLETYIANDIYILTHNLQ-ESSFKLGHNAFSHLTNEEFRQRFN 94

Query: 110 GYKMPSP--SHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVA 167
           G+K      + R   S+     N    D+P S+DW +KGAVT +KNQ  CG CWAF+   
Sbjct: 95  GFKASDDYLTKRLAQSNVASSTNFQYIDLPESVDWVEKGAVTGVKNQGMCGSCWAFSTTG 154

Query: 168 AVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQA 227
           A+EG T I SG L+ LSEQ+L+DC  NG++GC GG  + AF++I ++ GI +E++Y Y  
Sbjct: 155 AIEGATFISSGKLVSLSEQELVDCDHNGDHGCNGGLMDHAFSWISEHDGICSEEDYAYIH 214

Query: 228 VPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTEFQSYKEGIFNG 287
               C +  KP                        + PV++AI A    FQ Y+ G++N 
Sbjct: 215 SQSLCRSC-KPV-----------------------VSPVAVAIDAGDRSFQFYQSGVYNK 250

Query: 288 VCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE----GLCGIGTRS 343
            CGTQLDH V  VG+G  EDG  YW +KNSWGN+WG+ GY+++ RD+    G CGI    
Sbjct: 251 TCGTQLDHGVLTVGYG-VEDGQKYWKVKNSWGNSWGEKGYIRLSRDQNGRSGQCGIAMVP 309

Query: 344 SYPLA 348
           SYP A
Sbjct: 310 SYPTA 314


>gi|223646726|gb|ACN10121.1| Cathepsin L1 precursor [Salmo salar]
 gi|223672581|gb|ACN12472.1| Cathepsin L1 precursor [Salmo salar]
          Length = 338

 Score =  238 bits (606), Expect = 4e-60,   Method: Compositional matrix adjust.
 Identities = 138/343 (40%), Positives = 194/343 (56%), Gaps = 24/343 (6%)

Query: 20  IIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIE 79
           + + +LV C S V ++     Q     H  W   H +SY  E E+  R  ++++NL+ IE
Sbjct: 4   LYLAVLVLCVSAVCAAPRFDSQLEDHWH-LWKNWHSKSYH-ESEEGWRRMVWEKNLKKIE 61

Query: 80  KANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFK---YQNLSM 133
             N E   G  +Y+LG N F D+TN+EFR    GYK        TT   FK   +   + 
Sbjct: 62  MHNLEHTMGKHSYRLGMNHFGDMTNEEFRQTMNGYK-------QTTERKFKGSLFMEPNY 114

Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS- 192
              P ++DWR+KG VTP+K+Q  CG CWAF+   A+EG    ++G L+ LSEQ L+DCS 
Sbjct: 115 LQAPKAVDWREKGYVTPVKDQGSCGSCWAFSTTGAMEGQQFRKTGKLVSLSEQNLVDCSR 174

Query: 193 TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAV-PGTCSAAQKPAAAKISNYEEVPS 251
             GN GC GG  ++AF YI  N G+ TE+ YPY       C    + + A  + + ++PS
Sbjct: 175 PEGNEGCNGGLMDQAFQYIQDNAGLDTEESYPYVGTDEDPCHYKPEFSGANETGFVDIPS 234

Query: 252 GDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVCGT-QLDHAVTIVGF---GTT 305
           G E A++KAV ++ PVS+AI A    FQ Y+ GI +   C + +LDH V +VG+   G  
Sbjct: 235 GKEHAMMKAVAAVGPVSVAIDAGHESFQFYEFGIYYEKECSSEELDHGVLVVGYGFEGED 294

Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
            DG  YW++KNSW   WGD GY+ + +D +  CGI T SSYPL
Sbjct: 295 VDGKKYWIVKNSWSEKWGDKGYIYMAKDRKNHCGIATASSYPL 337


>gi|219884655|gb|ACL52702.1| unknown [Zea mays]
 gi|413916718|gb|AFW56650.1| thiol protease SEN102 [Zea mays]
          Length = 349

 Score =  238 bits (606), Expect = 4e-60,   Method: Compositional matrix adjust.
 Identities = 135/326 (41%), Positives = 194/326 (59%), Gaps = 24/326 (7%)

Query: 43  VVEIH-------EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQ 95
           +V IH       + W A++ R+Y    E + R  ++ EN+++IE  N+ G+ +Y+LG NQ
Sbjct: 26  IVPIHIPLLDRFQAWQAEYNRTYATPEEFQQRFMVYSENVKFIETMNQPGS-SYELGENQ 84

Query: 96  FSDLTNDEFRALYTGYKM----PSPSHRSTTSSTFKYQNLS----MTDVPTSLDWRDKGA 147
           F+DLT +EF+  Y   K+     SP   + T  T      S      + P S+DWR KGA
Sbjct: 85  FADLTEEEFKDTYL-MKLDNVASSPEAMALTVDTMNRAGTSGGSNTNEAPNSVDWRTKGA 143

Query: 148 VTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TNGNNGCLGGSREK 206
           VTP+K+Q+ CG CWAFAAVA++EG+ KI++G L+ LSEQ+++DC     N+GC GG    
Sbjct: 144 VTPVKSQQHCGSCWAFAAVASIEGVHKIKTGRLVSLSEQEIVDCDRGGNNHGCHGGHSSS 203

Query: 207 AFAYIIQNQGIATEDEYPYQAVPGTC-SAAQKPAAAKISNYEEVPSGDEQALLKAVSMQP 265
           A  ++ +N G+ TE +YPY    G C S      AAKI   + V   +E AL  AV+ +P
Sbjct: 204 AMEWVTRNGGLTTESDYPYVGRQGQCMSDKLGHHAAKIRGRQAVQGKNEGALQHAVAGRP 263

Query: 266 VSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDA 325
           V+++I A S  FQ YK GIF+G C T  +HAVT+VG+G    G  YW++KNSWG  WG+ 
Sbjct: 264 VAVSINA-SRAFQFYKRGIFSGPCNTTRNHAVTVVGYGANASGHKYWIVKNSWGERWGEK 322

Query: 326 GYMKIVRD----EGLCGIGTRSSYPL 347
           GY+++ R     EG+CGI     Y +
Sbjct: 323 GYVRMQRGVRAREGVCGIAIAPFYAV 348


>gi|431896621|gb|ELK06033.1| Cathepsin S [Pteropus alecto]
          Length = 331

 Score =  238 bits (606), Expect = 4e-60,   Method: Compositional matrix adjust.
 Identities = 132/337 (39%), Positives = 197/337 (58%), Gaps = 16/337 (4%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
           M  +  +L+ C++ V  ++   + ++    + W   + + Y++++E+  R  I+++NL++
Sbjct: 1   MKWLACVLLGCSAAV--AQLQRDPTLDRHWDLWKKTYSKHYREKIEEVARRLIWEKNLKF 58

Query: 78  IEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
           +   N E   G  +Y LG N   D+T++E  +L     +PS   R+ T  +   Q L   
Sbjct: 59  VMLHNLEHSMGMHSYDLGMNHLGDMTSEEVISLMGSLTVPSQWQRNVTYKSNPNQKL--- 115

Query: 135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN 194
             P SLDWRDKG VT +K Q  CG CWAF+AV A+E   K+++G L+ LS Q L+DCST 
Sbjct: 116 --PDSLDWRDKGCVTEVKYQGSCGSCWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTE 173

Query: 195 --GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSG 252
              N GC GG    AF YII N GI +E  YPY+A  G C    K  AA  S Y E+P G
Sbjct: 174 KYSNKGCNGGFMTSAFQYIIDNNGIDSEASYPYKAQDGKCQYDSKFRAATCSKYTELPFG 233

Query: 253 DEQALLKAVSMQ-PVSIAIAAYSTEFQSYKEGI-FNGVCGTQLDHAVTIVGFGTTEDGAN 310
            E+AL +AV+ + PVS+AI A    F  Y+ G+ ++  C  +++H V +VG+G   DG +
Sbjct: 234 SEEALKEAVANKGPVSVAIDASHPSFFLYRSGVYYDQSCTLKVNHGVLVVGYGNL-DGKD 292

Query: 311 YWLIKNSWGNTWGDAGYMKIVRDEG-LCGIGTRSSYP 346
           YWL+KNSWG  +GD GY+++ R+ G  CGI +  SYP
Sbjct: 293 YWLVKNSWGLNFGDKGYIRMARNSGNHCGIASYPSYP 329


>gi|405971603|gb|EKC36430.1| Cathepsin L [Crassostrea gigas]
          Length = 360

 Score =  238 bits (606), Expect = 4e-60,   Method: Compositional matrix adjust.
 Identities = 137/303 (45%), Positives = 186/303 (61%), Gaps = 15/303 (4%)

Query: 54  HGRSYKDELEKE-MRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYT 109
           H ++Y D LE+E  R +IF+EN++ IE+ NK    G ++Y LG NQFSDL ++EF   Y 
Sbjct: 63  HDKTY-DALEEESRRFEIFRENVQKIEEHNKLYHLGKKSYYLGVNQFSDLKHEEF-VKYN 120

Query: 110 GYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAV 169
           G K  S       SS     NL     P S+DWR KG VT +KNQ +CG CW+F+   ++
Sbjct: 121 GLKKTSLK-DGGCSSYLAANNLVE---PDSVDWRKKGYVTDVKNQGQCGSCWSFSTTGSL 176

Query: 170 EGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAV 228
           EG    +SG L+ LSE QL+DCS + GN GC GG  + AF YI    G+ +E++YPY+  
Sbjct: 177 EGQHFRKSGKLVSLSESQLVDCSQSFGNEGCNGGLMDNAFKYIKSVGGLESEEDYPYKPK 236

Query: 229 PGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVS-MQPVSIAIAAYSTEFQSYKEGIFN- 286
            GTC       AA  +   +V SG E AL KAVS + PVS+AI A  + FQSY  G+++ 
Sbjct: 237 QGTCKFDDTKVAATDTGCVDVESGSESALKKAVSEVGPVSVAIDASHSSFQSYAGGVYDE 296

Query: 287 -GVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSS 344
                 QLDH V  VG+GT + G +YW++KNSWG  WG+ GY+K+ R+ +  CGI T++S
Sbjct: 297 PECSSEQLDHGVLCVGYGTDDQGQDYWIVKNSWGAEWGEDGYVKMSRNKKNQCGIATQAS 356

Query: 345 YPL 347
           YPL
Sbjct: 357 YPL 359


>gi|413919735|gb|AFW59667.1| hypothetical protein ZEAMMB73_680472 [Zea mays]
          Length = 344

 Score =  238 bits (606), Expect = 4e-60,   Method: Compositional matrix adjust.
 Identities = 116/266 (43%), Positives = 164/266 (61%), Gaps = 7/266 (2%)

Query: 32  VVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRT 88
           +VS     E+    ++ +WMA HGR+Y    E+E R ++F++NL Y++  N     G  +
Sbjct: 31  IVSYGERSEEEARRMYAEWMAAHGRTYNAVGEEERRFEVFRDNLRYVDAHNAAADAGVHS 90

Query: 89  YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAV 148
           ++LG N+F+DLTNDE+RA Y G +      R          N    D+P S+DWR KGAV
Sbjct: 91  FRLGLNRFADLTNDEYRATYLGVRSRPQRERRLGDRYLAGDN---EDLPESVDWRAKGAV 147

Query: 149 TPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAF 208
             +K+Q  CG CWAF+ +AAVEGI +I +G++I LSEQ+L+DC T+ N GC GG  + AF
Sbjct: 148 AEVKDQGSCGSCWAFSTIAAVEGINQIVTGDMISLSEQELVDCDTSYNQGCNGGLMDYAF 207

Query: 209 AYIIQNQGIATEDEYPYQAVPGTCSAAQKPA-AAKISNYEEVPSGDEQALLKAVSMQPVS 267
            +II N GI TE++YPY+   G C   +K A    I +YE+VP+  E++L KAV+ QP+S
Sbjct: 208 EFIINNGGIDTEEDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANSEKSLQKAVANQPIS 267

Query: 268 IAIAAYSTEFQSYKEGIFNGVCGTQL 293
           +AI A    FQ Y  GIF G CG  +
Sbjct: 268 VAIEAGGRAFQLYNSGIFTGTCGNSV 293


>gi|395514298|ref|XP_003761356.1| PREDICTED: cathepsin L1-like [Sarcophilus harrisii]
          Length = 365

 Score =  238 bits (606), Expect = 4e-60,   Method: Compositional matrix adjust.
 Identities = 134/360 (37%), Positives = 200/360 (55%), Gaps = 41/360 (11%)

Query: 25  LVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE 84
           LVS    +V++    ++++     +W AQH R Y +   ++ R  I+++NL  IE  N E
Sbjct: 7   LVSLCLGLVAAIPKLDRTLDAQWYQWKAQHRRDYGEN--EDWRRAIWEKNLRSIEMHNLE 64

Query: 85  ---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFK-------------- 127
              G  ++++  N+F D+TN+EFR +  G+       R T    F+              
Sbjct: 65  YSAGKHSFQMEMNKFGDMTNEEFRQVMNGFSTHR-VQRRTKGRLFREPLLVQIPKSVDWR 123

Query: 128 ----------------YQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEG 171
                           ++   +  +P S+DWRDKG VTP+KNQ +CG CWAF+A  ++EG
Sbjct: 124 DKGYVTPVKNQLVRRLFREPLLVQIPKSVDWRDKGYVTPVKNQGQCGSCWAFSATGSLEG 183

Query: 172 ITKIRSGNLIQLSEQQLLDCST-NGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPG 230
               ++G L+ LSEQ L+DCST  GN+GC GG  + AF Y+ +N GI TE+ YPY A   
Sbjct: 184 QWFRKTGKLVSLSEQNLVDCSTAQGNSGCQGGLMDNAFEYVKENGGIDTEESYPYIAADD 243

Query: 231 TCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGV 288
           TC    + + A I+ Y ++PS  E+AL KAV ++ P+S+AI A  + FQ Y+ G+ +   
Sbjct: 244 TCQYKPQYSGANITGYVDIPSRMEKALEKAVATVGPISVAIDAGHSSFQFYRSGVYYEPE 303

Query: 289 CGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYP 346
           C ++ LDH V  VG+G       YW++KNSWG  WGD+GY+ + RD    CGI T +SYP
Sbjct: 304 CSSEDLDHGVLAVGYGVQGKNGKYWIVKNSWGEEWGDSGYILMARDRNNHCGIATAASYP 363


>gi|118424553|gb|ABK90824.1| cathepsin L-like cysteine proteinase [Spodoptera exigua]
          Length = 344

 Score =  238 bits (606), Expect = 4e-60,   Method: Compositional matrix adjust.
 Identities = 125/311 (40%), Positives = 183/311 (58%), Gaps = 16/311 (5%)

Query: 53  QHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNR---TYKLGTNQFSDLTNDEFRALYT 109
           +H + Y  E+E + R+KI+ EN   I K N+   +   +YKL  N+++D+ + EF     
Sbjct: 33  EHSKQYDSEVEDKFRMKIYVENKHRITKHNQRFEQRLVSYKLKPNKYADMLHHEFVHTMN 92

Query: 110 GY-KMPSPSHRSTTSSTFKYQNLSMTDV-------PTSLDWRDKGAVTPIKNQKECGCCW 161
           G+ K      R+       +   + T +       P  +DWR KGAVT +K+Q +CG CW
Sbjct: 93  GFNKTAKHGGRNKNVHGKGHDGRAATFIAPAHVSYPDHVDWRKKGAVTDVKDQGKCGSCW 152

Query: 162 AFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATE 220
           AF+   A+EG    ++G L+ LSEQ L+DCS   GNNGC GG  + AF YI  N GI TE
Sbjct: 153 AFSTTGALEGQHFRKTGYLVSLSEQNLIDCSAAYGNNGCNGGLMDNAFKYIKDNGGIDTE 212

Query: 221 DEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQS 279
             YPY+AV   C    K + A    + ++P GDE+ L++AV ++ P+S+AI A    FQ 
Sbjct: 213 KSYPYEAVDDKCRYNPKESGADDVGFVDIPQGDEEKLMQAVATVGPISVAIDASQETFQF 272

Query: 280 YKEGIF--NGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE-GL 336
           Y +G++       T LDH V +VG+GT EDG++ WL+KNSWG +WG+ GY+K+ R++   
Sbjct: 273 YSKGVYYDENCSSTDLDHGVMVVGYGTEEDGSDDWLVKNSWGRSWGELGYIKMARNKNNH 332

Query: 337 CGIGTRSSYPL 347
           CGI + +SYPL
Sbjct: 333 CGIASSASYPL 343


>gi|309380130|gb|ADO65978.1| cathepsin L [Eriocheir sinensis]
 gi|309380134|gb|ADO65980.1| cathepsin L [Eriocheir sinensis]
          Length = 325

 Score =  238 bits (606), Expect = 4e-60,   Method: Compositional matrix adjust.
 Identities = 130/313 (41%), Positives = 192/313 (61%), Gaps = 24/313 (7%)

Query: 48  EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEF 104
           +++ A++G+ Y+   E   R  ++++N E+I   N++   G  ++ L  NQF D+T +E 
Sbjct: 23  QQFKARYGKQYRSTKEDSYRQSVYEQNQEFINSHNEQYENGLVSFTLAMNQFGDMTTEEI 82

Query: 105 RALYTGY-----KMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGC 159
            A   G+     K+P    R T      YQ L + ++P ++DWRDKGAVTP+K+QK CG 
Sbjct: 83  NAAMNGFLSAGKKVP----RGTM-----YQPL-VDELPDTVDWRDKGAVTPVKDQKACGS 132

Query: 160 CWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIA 218
           CWAF+A  ++EG   + +G L+ LSEQ L+DCS   GN GC GG  + AF YI  N GI 
Sbjct: 133 CWAFSATGSLEGQHFLSTGKLVSLSEQNLVDCSDKYGNFGCGGGLMDNAFRYIKDNNGID 192

Query: 219 TEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQ-PVSIAIAAYSTEF 277
           TE+ YPY+A  G C        A +S+Y ++  G E  L KAV+ + PVS+AI A ++ F
Sbjct: 193 TEESYPYEAKNGPCRFNSDNVGATLSSYVDIQHGSEDDLQKAVAEKGPVSVAIDASTSTF 252

Query: 278 QSYKEGI-FNGVCGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE- 334
             Y  GI ++  C +  LDH V  VG+G T+D ++YWL+KNSW  TWGD+GY+K+ R+  
Sbjct: 253 HFYSRGIYYDEKCSSSFLDHGVLAVGYG-TDDSSDYWLVKNSWNETWGDSGYIKMSRNRN 311

Query: 335 GLCGIGTRSSYPL 347
             CGI +++SYP+
Sbjct: 312 NNCGIASQASYPV 324


>gi|344953542|gb|AEN28617.1| cathepsin L-like cysteine protease [Epinephelus coioides]
          Length = 336

 Score =  237 bits (605), Expect = 5e-60,   Method: Compositional matrix adjust.
 Identities = 135/341 (39%), Positives = 197/341 (57%), Gaps = 18/341 (5%)

Query: 19  FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
            + + ++  C S  +S+ S   Q + +  E W + H + Y  E E+  R  ++++NL+ I
Sbjct: 1   MLPLAVVALCLSAALSAPSLDPQ-LDDHWELWKSWHSKKYH-EKEEGWRRMVWEKNLKKI 58

Query: 79  EKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD 135
           E  N E   G  +Y+LG N F D+T++EFR L  GYK  + +      S F   N    +
Sbjct: 59  ELHNLEHSMGTHSYRLGMNHFGDMTHEEFRQLMNGYKRKAET--KARGSLFLEPNF--LE 114

Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TN 194
            P S+DWRD G VTP+K+Q +CG CWAF+   A+EG    ++G L+ LSEQ L+DCS   
Sbjct: 115 APKSVDWRDNGYVTPVKDQGQCGSCWAFSTTGALEGQHFRKTGKLVSLSEQNLVDCSRPE 174

Query: 195 GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPG-TCSAAQKPAAAKISNYEEVPSGD 253
           GN GC GG  ++AF Y+  NQG+ +ED YPY       C       +   + + ++PSG 
Sbjct: 175 GNEGCNGGLMDQAFQYVKDNQGLDSEDSYPYLGTDDQPCHYDPTYNSVNDTGFVDIPSGK 234

Query: 254 EQALLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVCGT-QLDHAVTIVGF---GTTED 307
           E+AL+KAV ++ PVS+AI A    FQ Y+ GI +   C + +LDH V +VG+   G   D
Sbjct: 235 ERALMKAVAAVGPVSVAIDAGHESFQFYQSGIYYEKECSSEELDHGVLVVGYGFQGEDVD 294

Query: 308 GANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
           G  YW++KNSW   WGD GY+ + +D +  CGI T +SYPL
Sbjct: 295 GKKYWIVKNSWSEKWGDKGYIYMAKDRKNHCGIATAASYPL 335


>gi|148224022|ref|NP_001087489.1| cathepsin L2 precursor [Xenopus laevis]
 gi|51258284|gb|AAH80004.1| MGC81823 protein [Xenopus laevis]
          Length = 335

 Score =  237 bits (605), Expect = 5e-60,   Method: Compositional matrix adjust.
 Identities = 138/342 (40%), Positives = 200/342 (58%), Gaps = 20/342 (5%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
           M + +     C + V ++ +T + ++ +    W   H +SY  + E+  R  ++++NL  
Sbjct: 1   MALYLVAAALCLTTVFAAPTT-DPALDDHWHLWKNWHKKSYLPK-EEGWRRVLWEKNLRT 58

Query: 78  IEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
           IE  N +   G  +Y+LG NQF D+TN+EFR L  GYK    + +    STF   N    
Sbjct: 59  IEFHNLDHSLGKHSYRLGMNQFGDMTNEEFRQLMNGYK----NQKMIKGSTFLAPN--NF 112

Query: 135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-T 193
           + P ++DWR+KG VTP+K+Q +CG CWAF+   A+EG    ++G LI LSEQ L+DCS  
Sbjct: 113 EAPKTVDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQHYRKAGKLISLSEQNLVDCSRA 172

Query: 194 NGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGT-CSAAQKPAAAKISNYEEVPSG 252
            GN GC GG  ++AF Y+  N GI +ED YPY A     C       +A  + + +VPSG
Sbjct: 173 QGNQGCNGGLMDQAFQYVKDNGGIDSEDSYPYTAKDDQECHYDPNYNSANDTGFVDVPSG 232

Query: 253 DEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVCGTQ-LDHAVTIVGF---GTTE 306
            E+ L+KAV S+ PVS+A+ A    FQ Y+ GI ++  C ++ LDH V +VG+   G   
Sbjct: 233 SEKDLMKAVASVGPVSVAVDAGHKSFQFYQSGIYYDPECSSEDLDHGVLVVGYGFEGEDV 292

Query: 307 DGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
           DG  YW++KNSW   WG+ GY+KI +D    CGI T +SYPL
Sbjct: 293 DGKRYWIVKNSWSEKWGNNGYIKIAKDRHNHCGIATAASYPL 334


>gi|17062058|gb|AAL34984.1|AF320565_1 cathepsine L-like cysteine protease [Rhodnius prolixus]
          Length = 316

 Score =  237 bits (605), Expect = 5e-60,   Method: Compositional matrix adjust.
 Identities = 124/312 (39%), Positives = 189/312 (60%), Gaps = 17/312 (5%)

Query: 48  EKWMA---QHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTN 101
           ++W+A    HG++Y+++ E+  R+K+F +N + I++ N +   G  +YK+  N   DL  
Sbjct: 11  QEWLAFKAMHGKNYRNQFEEIFRMKVFIDNKKKIDEHNAKYELGEASYKMKMNHLGDLMV 70

Query: 102 DEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCW 161
            EF+AL  G+K    + R+        +NL     P S+DWR +GAVTP+K+Q  CG CW
Sbjct: 71  HEFKALMNGFKKTPNAERNGKIYVPSNENL-----PKSVDWRQRGAVTPVKDQGHCGSCW 125

Query: 162 AFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TNGNNGCLGGSREKAFAYIIQNQGIATE 220
           +F+A  ++EG   +++G L+ LSEQ L+DCS T GN+GC GG   +AF Y+  N+GI TE
Sbjct: 126 SFSATGSLEGQLFLKTGRLVSLSEQNLVDCSKTYGNSGCEGGLMNQAFQYVRDNKGIDTE 185

Query: 221 DEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQS 279
             YPY+A    C   +         Y ++    E+ L  AV ++ P+S+ I A    FQ 
Sbjct: 186 ASYPYEARENNCRFKEDKVGGTDKGYVDILEASEKDLQSAVATVGPISVRIDASHESFQF 245

Query: 280 YKEGIFN-GVCG-TQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGL 336
           Y EG++    C  +QLDH V  VG+G TE+G +YWL+KNSWG +WG++GY+KI R+ +  
Sbjct: 246 YSEGVYKEQYCSPSQLDHGVLTVGYG-TENGQDYWLVKNSWGPSWGESGYIKIARNHKNH 304

Query: 337 CGIGTRSSYPLA 348
           CGI + +SYP+ 
Sbjct: 305 CGIASMASYPVV 316


>gi|149751225|ref|XP_001490531.1| PREDICTED: cathepsin S-like [Equus caballus]
          Length = 332

 Score =  237 bits (605), Expect = 5e-60,   Method: Compositional matrix adjust.
 Identities = 129/317 (40%), Positives = 187/317 (58%), Gaps = 15/317 (4%)

Query: 39  HEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTN 94
           H    ++ H + W   +G+ YK++ E+  R  I++ NL+++   N E   G  +Y LG N
Sbjct: 20  HRDPTLDNHWDLWKKTYGKQYKEKNEEVARRLIWERNLKFVMLHNLEHSMGMHSYDLGMN 79

Query: 95  QFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQ 154
              D+T++E  +L +  ++PS   R+ T     Y++     +P SLDWR+KG VT +K Q
Sbjct: 80  HLGDMTSEEVTSLMSSLRVPSQWQRNVT-----YKSNPNEKLPDSLDWREKGCVTEVKYQ 134

Query: 155 KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN--GNNGCLGGSREKAFAYII 212
             CG CWAF+AV A+E   K+++GNL+ LS Q L+DCST    N GC GG    AF YII
Sbjct: 135 GSCGACWAFSAVGALEAQLKLKTGNLVSLSAQNLVDCSTEKYSNKGCNGGFMTAAFQYII 194

Query: 213 QNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQ-PVSIAIA 271
            N GI ++  YPY+A+ G C    K  AA  S Y E+P G E  L +AV+ + PVS+AI 
Sbjct: 195 DNNGIDSDASYPYKAMDGKCRYDSKNRAATCSKYTELPFGSEDDLKEAVANKGPVSVAID 254

Query: 272 AYSTEFQSYKEGI-FNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKI 330
           A    F  YK G+ ++  C   ++H V +VG+G   +G +YWL+KNSWG  +GD GY+++
Sbjct: 255 ASHPSFFLYKSGVYYDPSCTQNVNHGVLVVGYGNL-NGKDYWLVKNSWGINFGDKGYIRM 313

Query: 331 VRDEG-LCGIGTRSSYP 346
            R+ G  CGI    SYP
Sbjct: 314 ARNSGNHCGIANYCSYP 330


>gi|356549192|ref|XP_003542981.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
          Length = 517

 Score =  237 bits (605), Expect = 6e-60,   Method: Compositional matrix adjust.
 Identities = 126/317 (39%), Positives = 190/317 (59%), Gaps = 17/317 (5%)

Query: 40  EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTY--KLGTNQFS 97
           E+ V+E+ ++W  ++ + Y+   ++++R + FK NL+YI + N +    Y   LG N+F+
Sbjct: 43  EEGVIELFQRWKEENKKIYRSPDQEKLRFENFKRNLKYIAEKNSKRISPYGQSLGLNRFA 102

Query: 98  DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
           D++N+EF++ +T       S R+  S     ++ S  D P SLDWR KG VT +K+Q  C
Sbjct: 103 DMSNEEFKSKFTSKVKKPFSKRNGLSG----KDHSCEDAPYSLDWRKKGVVTAVKDQGYC 158

Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
           GCCWAF++  A+EGI  I SG+LI LSE +L+DC    N+GC GG  + AF +++ N GI
Sbjct: 159 GCCWAFSSTGAIEGINAIVSGDLISLSEPELVDCDRT-NDGCDGGHMDYAFEWVMHNGGI 217

Query: 218 ATEDEYPYQAVPGTCSAAQKPAAA-KISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTE 276
            TE  YPY    GTC+ A++      I  Y  V   D ++LL A   QP+S  I   S +
Sbjct: 218 DTETNYPYSGADGTCNVAKEETKVIGIDGYYNVEQSD-RSLLCATVKQPISAGIDGSSWD 276

Query: 277 FQSYKEGIFNGVCGT---QLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD 333
           FQ Y  GI++G C +    +DHA+ +VG+G+  D  +YW++KNSWG +WG  GY+ I R+
Sbjct: 277 FQLYIGGIYDGDCSSDPDDIDHAILVVGYGSEGD-EDYWIVKNSWGTSWGMEGYIYIRRN 335

Query: 334 E----GLCGIGTRSSYP 346
                G+C I   +SYP
Sbjct: 336 TNLKYGVCAINYMASYP 352


>gi|242074968|ref|XP_002447420.1| hypothetical protein SORBIDRAFT_06g000780 [Sorghum bicolor]
 gi|241938603|gb|EES11748.1| hypothetical protein SORBIDRAFT_06g000780 [Sorghum bicolor]
          Length = 381

 Score =  237 bits (605), Expect = 6e-60,   Method: Compositional matrix adjust.
 Identities = 127/313 (40%), Positives = 188/313 (60%), Gaps = 21/313 (6%)

Query: 39  HEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSD 98
           HE  ++E    WMA HGRSY    EK  R +I+++N+++IE  N++  +T+  G NQF+D
Sbjct: 51  HELLMMERFHAWMAAHGRSYPTAEEKLRRFQIYRDNVKFIEAINRDTTKTFTCGENQFTD 110

Query: 99  LTNDEFRALYTGYKMPS-PSHRSTTSSTFKYQNLSM-----------TDVPTSLDWRDKG 146
           LT+ EF A YT     S P   S++  T +  +++            TD+P  +DWR++ 
Sbjct: 111 LTHQEFLARYTMASHDSVPLDLSSSVITTRAGDITESDSGTTMQVEDTDLPEHVDWREQD 170

Query: 147 AVTPIKNQKE-CGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSRE 205
           AVTP++NQ + C  CW FA+VA +E   KI++G+L++LSEQQ++DC+      C GG+ +
Sbjct: 171 AVTPVQNQLQGCHACWVFASVATIESANKIKNGDLLKLSEQQIVDCTA---EKCGGGTLQ 227

Query: 206 KAFAYIIQNQGIATEDEY-PYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQ 264
           +AF Y+ +N GIATE+EY  Y A  G+C A     A +I  Y+ +P  +E AL + V  Q
Sbjct: 228 EAFKYVQKNGGIATEEEYGAYTAKAGSCHAGNVRKAVRIQTYDFLPRENETALAEKVVQQ 287

Query: 265 PVSIAIAAYSTEFQSYKEGIFNG---VCGTQLDHAVTIVGFGTTED-GANYWLIKNSWGN 320
           PV++   A+   F  YK GI++G        L+HA+ IVG+G  E  G  YW+ KNSWG 
Sbjct: 288 PVAVLFDAHDPAFAYYKGGIYSGGQPRTRYVLNHAMAIVGYGKNESTGQKYWIAKNSWGT 347

Query: 321 TWGDAGYMKIVRD 333
            WGD GY+ I +D
Sbjct: 348 GWGDGGYVYIAKD 360


>gi|342675481|gb|AEL31666.1| cathepsin L [Cynoglossus semilaevis]
          Length = 336

 Score =  237 bits (604), Expect = 6e-60,   Method: Compositional matrix adjust.
 Identities = 136/341 (39%), Positives = 195/341 (57%), Gaps = 18/341 (5%)

Query: 19  FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYI 78
            + + LL    S V+S+ S  +  + +  E W   H + Y  E E+  R  I+++NL  I
Sbjct: 1   MLPLALLALGVSAVLSAPSL-DARLSDHWELWKNWHSKKYH-EKEEGWRRMIWEKNLNKI 58

Query: 79  EKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD 135
           E  N E   G  +Y+LG N F D+T++EFR +  GY+    + R    S F   N  +  
Sbjct: 59  ELHNLEHSMGKHSYRLGMNHFGDMTHEEFRQIMNGYQ--RKTERKAIGSLFMEPNFMVA- 115

Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TN 194
            P+++DWR+KG VTP+K+Q +CG CWAF+   A+ZG    + G L+ LSEQ L+DCS   
Sbjct: 116 -PSAVDWREKGYVTPVKDQGQCGSCWAFSTTGALZGQNFRKMGKLVSLSEQNLVDCSRPE 174

Query: 195 GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPG-TCSAAQKPAAAKISNYEEVPSGD 253
           GN GC GG  ++AF Y+  NQG+ +ED YPY       C    K  +   + + ++PSG 
Sbjct: 175 GNEGCGGGLMDQAFQYVKDNQGLDSEDSYPYLGTDDQPCHYDPKYNSVNDTGFVDIPSGK 234

Query: 254 EQALLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVCGT-QLDHAVTIVGF---GTTED 307
           E AL+KAV S+ PVS+AI A    FQ Y+ GI +   C + +LDH V  VG+   G   D
Sbjct: 235 EHALMKAVASVGPVSVAIDAGHESFQFYQSGIYYEKECSSEELDHGVLAVGYGFEGEDVD 294

Query: 308 GANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
           G  YW++KNSW   WGD GY+ + +D +  CGI T +SYPL
Sbjct: 295 GKKYWIVKNSWSEKWGDKGYIYMAKDRKNHCGIATAASYPL 335


>gi|530734|emb|CAA56914.1| cathepsin l [Nephrops norvegicus]
 gi|1582620|prf||2119193A cathepsin L-related Cys protease
          Length = 324

 Score =  237 bits (604), Expect = 8e-60,   Method: Compositional matrix adjust.
 Identities = 133/311 (42%), Positives = 181/311 (58%), Gaps = 21/311 (6%)

Query: 48  EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEF 104
           E++  + GR Y D  E+  RL +F +NL+YIE+ NK+   G  TY L  NQFSDLTNDEF
Sbjct: 21  EEFKGKFGRKYVDLEEERYRLNVFLDNLQYIEEFNKKYESGEVTYNLAINQFSDLTNDEF 80

Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP---TSLDWRDKGAVTPIKNQKECGCCW 161
            ++  GYK    S R    + F     + TD     T +DWR KG VT +K+Q +CG CW
Sbjct: 81  NSMMKGYKT---SLRPKPVAVF-----TSTDAAPETTEVDWRTKGCVTHVKDQGQCGSCW 132

Query: 162 AFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN--GNNGCLGGSREKAFAYIIQNQGIAT 219
           AF+A  ++EG   ++ G L+ L+EQQL+DC+     N GC GG   +AF YI  N GI T
Sbjct: 133 AFSATGSLEGQHFLKYGELVSLAEQQLVDCAGGIYYNQGCNGGWVNQAFKYIKANGGIDT 192

Query: 220 EDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQA-LLKAVSMQPVSIAIAAYSTEFQ 278
           E  YPY+A   TC       AA  S +  +  G E   + +  +  P+S+AI A    FQ
Sbjct: 193 ESSYPYEARDNTCRFNSNSVAATCSGFVSIAQGSESPEVRRTTNTGPISVAIDAAHRSFQ 252

Query: 279 SYKEGIF--NGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE-G 335
           SY  G++       +QLDHAV  VG+G +E G ++WL+KNSWG +WG AGY+ + R+   
Sbjct: 253 SYSSGVYYEPSCSSSQLDHAVLAVGYG-SEGGQDFWLVKNSWGTSWGSAGYINMARNRNN 311

Query: 336 LCGIGTRSSYP 346
            CGI T +SYP
Sbjct: 312 NCGIATDASYP 322


>gi|298705581|emb|CBJ28832.1| Cathepsin L-like proteinase [Ectocarpus siliculosus]
          Length = 553

 Score =  236 bits (603), Expect = 9e-60,   Method: Compositional matrix adjust.
 Identities = 124/335 (37%), Positives = 191/335 (57%), Gaps = 24/335 (7%)

Query: 36  RSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGN-RTYKLGTN 94
           R+  +  V      WMAQHG ++  + E + RLKIF EN + I+  N   +  T+ L  N
Sbjct: 32  RTAEDAKVANRFRAWMAQHGVTFGTKGEFDRRLKIFAENSDLIDTHNTANDGSTFTLSHN 91

Query: 95  QFSDLTNDEFRALYTGYKM----PSPSHRSTTSSTF--------KYQNLSMTDVPTSLDW 142
           +FS L+ DEF+  + GYK     P P+ ++              +   L+ +++P  +DW
Sbjct: 92  EFSHLSWDEFKETHFGYKRSSDKPKPARQTPERRPMEKVAGGRRRLVELTGSEIPDEVDW 151

Query: 143 RDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGG 202
             +GAVTP++NQ  CG CWAF+ + A+EG   + + +LI+ SE+QL+DC    + GC GG
Sbjct: 152 VREGAVTPVQNQGMCGSCWAFSTIGAMEGAYYLATDDLIKFSEEQLVDCDKV-DKGCFGG 210

Query: 203 SREKAFAYIIQNQGIATEDEYPYQAV-PG--TCSAAQKPA-AAKISNYEEVPSGDEQALL 258
             E+AF +I +N G+  EDEYPY  + P   TC+    P   +++  + +V + DE  + 
Sbjct: 211 DMEQAFDWIKENGGVCPEDEYPYVGLWPPFKTCATTCTPVEGSQVKEWAQVKATDEALMT 270

Query: 259 KAVSMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSW 318
              ++ P++IAI A    FQ Y +G++   CG +LDH V  VG+GT EDG +YW +KNSW
Sbjct: 271 ALATVGPIAIAIEADQMAFQFYSDGVYTAPCGDKLDHGVLAVGYGTWEDGTDYWKVKNSW 330

Query: 319 GNTWGDAGYMKIVR-----DE-GLCGIGTRSSYPL 347
           G++WG  GY+ + R     DE G CG+   + YP+
Sbjct: 331 GDSWGQGGYILLERADSEEDEGGQCGLLIEAIYPI 365


>gi|354507493|ref|XP_003515790.1| PREDICTED: cathepsin L1-like [Cricetulus griseus]
 gi|344259154|gb|EGW15258.1| Cathepsin L1 [Cricetulus griseus]
          Length = 333

 Score =  236 bits (603), Expect = 9e-60,   Method: Compositional matrix adjust.
 Identities = 129/310 (41%), Positives = 186/310 (60%), Gaps = 19/310 (6%)

Query: 48  EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRTYKLGTNQFSDLTNDEF 104
            KW + + R Y    E+E R  ++++N++ IE  N    EG   Y +  N F D+TN+EF
Sbjct: 30  HKWKSTYRRLYGTN-EEEWRRAVWEKNMKMIELHNGEYSEGKHGYTMEMNAFGDMTNEEF 88

Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFA 164
           R L  GYK     HR        +Q   M  +P S+DWR+KG VTP+KNQ +CG CWAF+
Sbjct: 89  RQLVNGYK--HQKHRKGKV----FQEPLMLQLPKSVDWREKGCVTPVKNQGQCGSCWAFS 142

Query: 165 AVAAVEGITKIRSGNLIQLSEQQLLDCS-TNGNNGCLGGSREKAFAYIIQNQGIATEDEY 223
           A  A+EG   +++G L+ LSEQ L+DCS   GN GC GG  + AF Y++ N+G+ +E+ Y
Sbjct: 143 ACGALEGQMCLKTGVLVSLSEQNLVDCSQAEGNQGCNGGLMDFAFQYVLNNKGLDSEESY 202

Query: 224 PYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKE 282
           PY+A  GTC    + AAA  + Y ++P   E+AL+KAV ++ P++IAI A    FQ Y  
Sbjct: 203 PYEAKDGTCKYKPEFAAANDTGYVDIPQ-LEKALMKAVATVGPIAIAIDASHPSFQFYSS 261

Query: 283 GIF--NGVCGTQLDHAVTIVGF---GTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE-GL 336
           GI+        +LDH V +VG+   GT  +   YW++KNSWG++WG  G+  I +D+   
Sbjct: 262 GIYYEPNCSSKELDHGVLVVGYGFEGTDSNKKKYWIVKNSWGSSWGMGGFFHIAKDKNNH 321

Query: 337 CGIGTRSSYP 346
           CG+ T +SYP
Sbjct: 322 CGVATAASYP 331


>gi|355753449|gb|EHH57495.1| Cathepsin L1 [Macaca fascicularis]
          Length = 333

 Score =  236 bits (603), Expect = 9e-60,   Method: Compositional matrix adjust.
 Identities = 131/341 (38%), Positives = 195/341 (57%), Gaps = 23/341 (6%)

Query: 17  PMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLE 76
           P FI+    +  AS  +    T   S+     KW A H R Y    E+  R  ++++N++
Sbjct: 3   PTFILAAFCLGIASATL----TFNHSLEAQWTKWKAMHNRLYGMN-EEGWRRAVWEKNMK 57

Query: 77  YIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
            IE  N+E   G  ++ +  N F D+T++EFR +  G++   P           +Q L  
Sbjct: 58  MIELHNQEYSQGKHSFTMAMNTFGDMTSEEFRQVMNGFQNRKPRKGKV------FQELLF 111

Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS- 192
            + P S+DWR+KG VTP+KNQ +CG CWAF+A  A+EG    ++G L+ LSEQ L+DCS 
Sbjct: 112 YEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSW 171

Query: 193 TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSG 252
             GN GC GG  + AF Y+  N G+ +E+ YPY+A   +C    + + A  + + ++P  
Sbjct: 172 PQGNEGCNGGLMDYAFQYVADNGGLDSEESYPYEATEESCKYNPEYSVANDTGFVDIPK- 230

Query: 253 DEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVCGTQ-LDHAVTIVGFG---TTE 306
            E+AL+KAV ++ P+S+AI A    F  YKEGI F   C ++ +DH V +VG+G   T  
Sbjct: 231 QEKALMKAVATVGPISVAIDAGHESFMFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTES 290

Query: 307 DGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYP 346
           D + YWL+KNSWG  WG  GY+K+ +D    CGI + +SYP
Sbjct: 291 DNSKYWLVKNSWGEEWGMGGYIKMAKDRRNHCGIASAASYP 331


>gi|351705687|gb|EHB08606.1| Cathepsin S [Heterocephalus glaber]
          Length = 331

 Score =  236 bits (602), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 128/328 (39%), Positives = 195/328 (59%), Gaps = 15/328 (4%)

Query: 28  CASQVVSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE-- 84
           C +  ++     +  +++ H   W   +G+ Y+++ E+++R  I+++NL+++   N E  
Sbjct: 8   CVTCSLAGAQLQQDPMLDYHWHLWKKTYGKHYQEKNEEQVRRLIWEKNLKFVMLHNLEHS 67

Query: 85  -GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWR 143
            G  +Y LG N   D+T++E R+L +  ++P    R+ T  +   Q L     P S+DWR
Sbjct: 68  MGMHSYDLGMNHLGDMTSEEVRSLMSSLRVPRQWLRNVTYKSDPNQKL-----PDSVDWR 122

Query: 144 DKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG--NNGCLG 201
           +KG VT +K Q  CG CWAF+AV A+EG  K+++G L+ LS Q L+DCST    N GC G
Sbjct: 123 EKGCVTEVKYQGACGSCWAFSAVGALEGQLKLKTGKLVSLSAQNLVDCSTEKYRNKGCSG 182

Query: 202 GSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV 261
           G   +AF Y+I N GI +E  YPY+A    C    K  AA  S Y E+P G E+AL +AV
Sbjct: 183 GFMTEAFQYVIDNNGIDSETSYPYKATDEKCHYDSKNRAATCSRYTELPYGSEEALKEAV 242

Query: 262 SMQ-PVSIAIAAYSTEFQSYKEGIFNG-VCGTQLDHAVTIVGFGTTEDGANYWLIKNSWG 319
           + + PVS+A+ A    F  YK G+++   C   + H V  VG+G   +G +YWL+KNSWG
Sbjct: 243 ANKGPVSVAVDASRPSFFLYKNGVYDDPSCTQNVTHGVLAVGYGNL-NGKDYWLVKNSWG 301

Query: 320 NTWGDAGYMKIVRDEG-LCGIGTRSSYP 346
             +GD GY+++ R++G  CGI + SSYP
Sbjct: 302 LYFGDQGYIRMARNKGNHCGIASYSSYP 329


>gi|2746723|gb|AAB94925.1| cathepsin S precursor [Mus musculus]
          Length = 340

 Score =  236 bits (602), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 127/306 (41%), Positives = 185/306 (60%), Gaps = 15/306 (4%)

Query: 50  WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEFRA 106
           W   H + YKD+ E+E+R  I+++NL++I   N E   G  TY++G N   D+TN+E   
Sbjct: 39  WKKTHEKEYKDKNEEEVRRLIWEKNLKFIMIHNLEYSMGMHTYQVGMNDMGDMTNEEISC 98

Query: 107 LYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAV 166
                ++   S ++ T     +++ S   +P ++DWR+KG VT +K Q  CG CWAF+AV
Sbjct: 99  RMGALRISRQSPKTVT-----FRSYSNRTLPDTVDWREKGCVTEVKYQGSCGACWAFSAV 153

Query: 167 AAVEGITKIRSGNLIQLSEQQLLDCSTN---GNNGCLGGSREKAFAYIIQNQGIATEDEY 223
            A+EG  K+++G LI LS Q L+DCS     GN GC GG   +AF YII N GI  +  Y
Sbjct: 154 GALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQYIIDNGGIEADASY 213

Query: 224 PYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQ-PVSIAIAAYSTEFQSYKE 282
           PY+A+   C    K  AA  S Y ++P GDE AL +AV+ + PVS+ I A  + F  YK 
Sbjct: 214 PYKAMDEKCHYNSKNRAATCSRYIQLPFGDEDALKEAVATKGPVSVGIDASHSSFFFYKS 273

Query: 283 GIFNG-VCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR-DEGLCGIG 340
           G+++   C   ++H V +VG+GT  DG +YWL+KNSWG  +GD GY+++ R ++  CGI 
Sbjct: 274 GVYDDPSCTGNVNHGVLVVGYGTL-DGKDYWLVKNSWGLNFGDQGYIRMARNNKNHCGIA 332

Query: 341 TRSSYP 346
           +  SYP
Sbjct: 333 SYCSYP 338


>gi|384941728|gb|AFI34469.1| cathepsin L2 preproprotein [Macaca mulatta]
          Length = 334

 Score =  236 bits (602), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 133/340 (39%), Positives = 196/340 (57%), Gaps = 19/340 (5%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
           M + + L   C   + S+    +Q++     +W A H R Y    E+  R  ++++N++ 
Sbjct: 1   MNLSLVLAAFCLG-IASAVPKFDQNLDTKWYQWKATHRRLYGAS-EEGWRRAVWEKNMKM 58

Query: 78  IEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
           IE  N E   G   + +  N F D+TN+EFR +   ++    + +      F+       
Sbjct: 59  IELHNGEYSQGKHGFAMAMNAFGDMTNEEFRQVMGCFR----NQKLRKGKLFR--EPLFL 112

Query: 135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST- 193
           D+P S+DWR KG VTP+KNQK+CG CWAF+A  A+EG    ++G L+ LSEQ L+DCS  
Sbjct: 113 DLPKSVDWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRP 172

Query: 194 NGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGD 253
            GN GC GG    AF Y+ +N G+ +E+ YPY A+ G C    + + A  + +E VP+G 
Sbjct: 173 QGNQGCNGGFMNSAFRYVKENGGLDSEESYPYVAMDGICKYRSENSVANDTGFEVVPAGK 232

Query: 254 EQALLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVCGTQ-LDHAVTIVGF---GTTED 307
           E+AL+KAV ++ P+S+A+ A  + FQ YK GI F   C ++ LDH V +VG+   G   D
Sbjct: 233 EKALMKAVATVGPISVAMDAGHSSFQFYKSGIYFEPDCSSKNLDHGVLVVGYGFEGANSD 292

Query: 308 GANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYP 346
              YWL+KNSWG  WG  GY+KI +D +  CGI T +SYP
Sbjct: 293 NNKYWLVKNSWGPEWGSNGYVKIAKDKDNHCGIATAASYP 332


>gi|355567966|gb|EHH24307.1| Cathepsin L2 [Macaca mulatta]
 gi|355753494|gb|EHH57540.1| Cathepsin L2 [Macaca fascicularis]
 gi|380790509|gb|AFE67130.1| cathepsin L2 preproprotein [Macaca mulatta]
          Length = 334

 Score =  236 bits (602), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 133/340 (39%), Positives = 196/340 (57%), Gaps = 19/340 (5%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
           M + + L   C   + S+    +Q++     +W A H R Y    E+  R  ++++N++ 
Sbjct: 1   MNLSLVLAAFCLG-IASAVPKFDQNLDTKWYQWKATHRRLYGAS-EEGWRRAVWEKNMKM 58

Query: 78  IEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
           IE  N E   G   + +  N F D+TN+EFR +   ++    + +      F+       
Sbjct: 59  IELHNGEYSQGKHGFAMAMNAFGDMTNEEFRQVMGCFR----NQKLRKGKLFR--EPLFL 112

Query: 135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-T 193
           D+P S+DWR KG VTP+KNQK+CG CWAF+A  A+EG    ++G L+ LSEQ L+DCS  
Sbjct: 113 DLPKSVDWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSHP 172

Query: 194 NGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGD 253
            GN GC GG    AF Y+ +N G+ +E+ YPY A+ G C    + + A  + +E VP+G 
Sbjct: 173 QGNQGCNGGFMNSAFRYVKENGGLDSEESYPYVAMDGICKYRPENSVANDTGFEVVPAGK 232

Query: 254 EQALLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVCGTQ-LDHAVTIVGF---GTTED 307
           E+AL+KAV ++ P+S+A+ A  + FQ YK GI F   C ++ LDH V +VG+   G   D
Sbjct: 233 EKALMKAVATVGPISVAMDAGHSSFQFYKSGIYFEPDCSSKNLDHGVLVVGYGFEGANSD 292

Query: 308 GANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYP 346
              YWL+KNSWG  WG  GY+KI +D +  CGI T +SYP
Sbjct: 293 NNKYWLVKNSWGPEWGSNGYVKIAKDKDNHCGIATAASYP 332


>gi|84660246|emb|CAI43320.1| cathepsin L [Lubomirskia baicalensis]
 gi|85677150|emb|CAI46307.1| cathepsin L [Lubomirskia baicalensis]
          Length = 327

 Score =  236 bits (602), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 134/307 (43%), Positives = 186/307 (60%), Gaps = 12/307 (3%)

Query: 48  EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNR-TYKLGTNQFSDLTNDEFRA 106
           E W  +HG+ Y  + E+  R  I++ N +Y+++ N    +  + +G NQF+DL + EF  
Sbjct: 23  ESWKKEHGKVYNSDREELTRHIIWQANRKYVDEHNAHAEKFGFTVGMNQFADLESSEFGR 82

Query: 107 LYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAV 166
           LY GY    PS +   S  F   +  + D+PTS+DWR KG VT IKNQ +CG CWAF+AV
Sbjct: 83  LYNGYN-NKPSMKKAQSKVF---STKVGDLPTSVDWRTKGFVTAIKNQGQCGSCWAFSAV 138

Query: 167 AAVEGITKIRSGNLIQLSEQQLLDCST-NGNNGCLGGSREKAFAYIIQNQGIATEDEYPY 225
           A +EG     +G L+ LSEQ L+DCST  GN GC GG  + AF Y+I+N GI TE  YPY
Sbjct: 139 AGLEGQHFNATGTLVSLSEQNLVDCSTAEGNQGCNGGLMDNAFQYVIKNGGIDTEASYPY 198

Query: 226 QAVPGTCSAAQKPAAAKISNYEEV-PSGDEQALLKAVSMQ-PVSIAIAAYSTEFQSYKEG 283
           +AV   C        +  S + ++ P   E AL  AV++  P+S+AI A  T FQ YK G
Sbjct: 199 KAVDQKCKFNAANVGSTCSGFSDILPHKSEAALQVAVAVVGPISVAIDASHTSFQLYKSG 258

Query: 284 IFN-GVCG-TQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIG 340
           +++   C  T LDH VT VG+ ++  G  YW++KNSWG TWG AGY+ + R++   CGI 
Sbjct: 259 VYSESACSQTSLDHGVTAVGYDSSS-GVAYWIVKNSWGTTWGQAGYIWMSRNKNNQCGIA 317

Query: 341 TRSSYPL 347
           T +SYP+
Sbjct: 318 TAASYPI 324


>gi|395819351|ref|XP_003783057.1| PREDICTED: cathepsin L1-like [Otolemur garnettii]
          Length = 333

 Score =  236 bits (602), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 132/340 (38%), Positives = 202/340 (59%), Gaps = 20/340 (5%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
           M +++ L   C   + S+ S  + S+     +W A+H + Y    E+  R  ++++N++ 
Sbjct: 1   MNLLLILAAFCVG-ITSATSMFDGSLNAHWYRWKAKHRKLYGMR-EEGWRRAVWEKNMKM 58

Query: 78  IEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
           IE  N+E   G   + +  N F D+TN+EFR +  G++  +  H+        +Q  S  
Sbjct: 59  IEVHNQEYSQGKHGFTMAMNAFGDMTNEEFRQVMNGFR--NQKHKKGKV----FQEPSFL 112

Query: 135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST- 193
           +VP S+DWR+KG VTP+KNQ +CG CWAF+A  A+EG    ++G LI LSEQ L+DCS  
Sbjct: 113 EVPKSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLISLSEQNLVDCSRP 172

Query: 194 NGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGD 253
            GN GC GG  + AF YI +N G+ +E+ YPY A+  +C    + + A  + + ++P  +
Sbjct: 173 QGNEGCDGGLMDYAFQYIKENGGLDSEESYPYDAMDESCKYRPEYSVANDTGFVDIPK-E 231

Query: 254 EQALLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVCGTQ-LDHAVTIVGFG---TTED 307
           E+AL+KAV ++ P+S+AI A    FQ YKEG+ F   C +  +DH V +VG+G   T  D
Sbjct: 232 EKALMKAVATVGPISVAIDAGHESFQFYKEGVYFEPECSSDNVDHGVLVVGYGYEETESD 291

Query: 308 GANYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYP 346
              +WL+KNSWG  WG  GY+K+ +D+   CGI T +SYP
Sbjct: 292 NNKFWLVKNSWGEEWGLGGYIKMTKDQKNHCGIATAASYP 331


>gi|334332714|ref|XP_001367224.2| PREDICTED: cathepsin L1-like [Monodelphis domestica]
          Length = 335

 Score =  236 bits (602), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 126/309 (40%), Positives = 192/309 (62%), Gaps = 13/309 (4%)

Query: 48  EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEF 104
            +W AQHG+SY    E   R   +++NL+ IE+ N+E   G  +++L  N+F D++ +EF
Sbjct: 30  HQWKAQHGKSYAAN-EDSWRRATWEKNLKMIERHNQEYSAGKHSFQLRMNKFGDMSTEEF 88

Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFA 164
           + +  GYK      R+  S    Y+   +  +P S+DWR+KG VTP+K Q+ C  CWAF+
Sbjct: 89  KQVMNGYKSNGSQKRTKGS---LYRESLLAQLPESVDWREKGYVTPVKEQRGCYSCWAFS 145

Query: 165 AVAAVEGITKIRSGNLIQLSEQQLLDCST-NGNNGCLGGSREKAFAYIIQNQGIATEDEY 223
           A  A+EG    ++G L+ LS Q L+DCS   GNNGC GG    AF Y+  N GI TE+ Y
Sbjct: 146 AAGAIEGQWFRKTGKLVSLSVQNLVDCSIPEGNNGCDGGLMGNAFQYVQDNGGIDTEECY 205

Query: 224 PYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVS-MQPVSIAIAAYSTEFQSYKE 282
           PY A    C    + + A ++ + ++PS DE+AL+KAV+ + P+S+AI A +  F+ Y+ 
Sbjct: 206 PYVAQDNECKYQPECSGANVTGFVKIPSTDERALMKAVANVGPISVAIDAGNPSFKFYQS 265

Query: 283 GI-FNGVC-GTQLDHAVTIVGFGTT-EDGANYWLIKNSWGNTWGDAGYMKIVRDE-GLCG 338
           G+ ++  C  +QL+H V +VG+G+  ++G  YW++KNSWG  WGD GY+ + +DE   CG
Sbjct: 266 GVYYDPQCSSSQLNHGVLVVGYGSEGKNGRKYWIVKNSWGENWGDNGYVLMAKDEDNHCG 325

Query: 339 IGTRSSYPL 347
           I T +SYP+
Sbjct: 326 IITDASYPI 334


>gi|71897043|ref|NP_001026516.1| cathepsin S precursor [Gallus gallus]
 gi|53126701|emb|CAG30977.1| hypothetical protein RCJMB04_1f23 [Gallus gallus]
          Length = 328

 Score =  236 bits (602), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 130/332 (39%), Positives = 195/332 (58%), Gaps = 18/332 (5%)

Query: 25  LVSCASQVVSSRST--HEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKA 81
           L+ C + +V+  +   H    ++ H + W   HG+ Y+ + E+  R   +++NL  +   
Sbjct: 3   LLRCMAVLVTLVAVMGHPDPTLDQHWQLWKKAHGKEYRHQAEEGQRRATWEKNLRLVMLH 62

Query: 82  NKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPT 138
           N E   G  +Y+LG N   D+T+++  AL TG ++P   H  T  ST++ +       P 
Sbjct: 63  NLEHSLGLHSYQLGMNHMGDMTSEDVAALLTGLRVPY-GHNQT--STYRRRG----GAPD 115

Query: 139 SLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNN 197
           ++DWR+KG VT +KNQ  CG CWAF+AV A+E   K+++G L+ LS Q L+DCS   GN 
Sbjct: 116 AMDWREKGCVTEVKNQGACGACWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCSMMYGNK 175

Query: 198 GCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQAL 257
           GC GG   +AF YII N GI +E+ YPY A  GTC       AA  S Y E+P  DE AL
Sbjct: 176 GCGGGFMTRAFQYIIDNNGIDSEESYPYMAQNGTCQYNVSTRAATCSKYVELPYADEAAL 235

Query: 258 LKAVS-MQPVSIAIAAYSTEFQSYKEGIFNGV-CGTQLDHAVTIVGFGTTEDGANYWLIK 315
             AV+ + PVS+AI A    F  Y+ G+++   C  +++H V +VG+GT  +  ++WL+K
Sbjct: 236 KDAVANVGPVSVAIDATQPTFFLYRSGVYDDPRCTQEVNHGVLVVGYGTLNE-KDFWLVK 294

Query: 316 NSWGNTWGDAGYMKIVRDEG-LCGIGTRSSYP 346
           NSWG  +GD GY+++ R+    CGI + +SYP
Sbjct: 295 NSWGERFGDGGYIRMSRNHANHCGIASYASYP 326


>gi|402898110|ref|XP_003912074.1| PREDICTED: cathepsin L2 [Papio anubis]
          Length = 334

 Score =  236 bits (602), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 133/340 (39%), Positives = 196/340 (57%), Gaps = 19/340 (5%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
           M + + L   C   + S+    +Q++     +W A H R Y    E+  R  ++++N++ 
Sbjct: 1   MNLSLVLAAFCLG-IASAVPKFDQNLDTKWYQWKATHRRLYGAS-EEGWRRAVWEKNMKM 58

Query: 78  IEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMT 134
           IE  N E   G   + +  N F D+TN+EFR +   ++    + +      F+       
Sbjct: 59  IELHNGEYSQGKHGFTMAMNAFGDMTNEEFRQVMGCFR----NQKLRKGKLFR--EPLFL 112

Query: 135 DVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-T 193
           D+P S+DWR KG VTP+KNQK+CG CWAF+A  A+EG    ++G L+ LSEQ L+DCS  
Sbjct: 113 DLPKSVDWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRP 172

Query: 194 NGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGD 253
            GN GC GG    AF Y+ +N G+ +E+ YPY A+ G C    + + A  + +E VP+G 
Sbjct: 173 QGNQGCNGGFMNSAFRYVKENGGLDSEESYPYVAMDGICKYRPENSVANDTGFEVVPAGK 232

Query: 254 EQALLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVCGTQ-LDHAVTIVGF---GTTED 307
           E+AL+KAV ++ P+S+A+ A  + FQ YK GI F   C ++ LDH V +VG+   G   D
Sbjct: 233 EKALMKAVATVGPISVAMDAGHSSFQFYKSGIYFEPDCSSKNLDHGVLVVGYGFEGANSD 292

Query: 308 GANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYP 346
              YWL+KNSWG  WG  GY+KI +D +  CGI T +SYP
Sbjct: 293 NNKYWLVKNSWGPEWGSNGYVKIAKDKDNHCGIATAASYP 332


>gi|194741252|ref|XP_001953103.1| GF17600 [Drosophila ananassae]
 gi|190626162|gb|EDV41686.1| GF17600 [Drosophila ananassae]
          Length = 333

 Score =  236 bits (602), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 136/340 (40%), Positives = 203/340 (59%), Gaps = 22/340 (6%)

Query: 19  FIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMA---QHGRSYKDELEKEMRLKIFKENL 75
            I+  LL++ A  V      + Q ++E  E+WMA   ++ + Y+DE E+++R KIF  N 
Sbjct: 6   LILFMLLLAIAHAV-----PYAQDILE--EEWMAFKLEYNKVYQDETEEQLRFKIFNYNK 58

Query: 76  EYIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
             I + N +   G  ++ L  N+F+DL + EF+ L  G KM SPS  +  SSTF    ++
Sbjct: 59  LLIARHNLKWAAGKVSFNLAVNKFADLLDHEFQDLMLG-KM-SPSGSNFGSSTF-LPPVN 115

Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
           +T +P ++DWR  G VTP+K+Q  CG CWAF+   ++EG    ++G LI LSEQ L+DCS
Sbjct: 116 LT-LPDAVDWRKYGFVTPVKDQGSCGSCWAFSTTGSLEGQHFRKTGQLISLSEQNLIDCS 174

Query: 193 TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSG 252
             GNNGC  G+ E AF YI  N+GI TE  YPY+A    C   +    A  + + ++  G
Sbjct: 175 P-GNNGCKNGAVEYAFRYIQSNKGIDTEISYPYEAAQNQCRFRRDTIGATSTGFVKLNPG 233

Query: 253 DEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIFNG-VCG-TQLDHAVTIVGFGTTEDGA 309
           DE  L +AV ++ P+S+ I +    F+ Y +G++N   C   +L HAV +VG+GT + G 
Sbjct: 234 DEMELAQAVATVGPISVLINSSLDSFKFYHDGVYNDPSCNPNKLTHAVLVVGYGTDDRGG 293

Query: 310 NYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPLA 348
           ++WL+KNSW   WG+ GY+KI R+   LCGI + + YPL 
Sbjct: 294 DFWLVKNSWSTHWGEQGYVKIKRNANNLCGIASNALYPLV 333


>gi|229893789|gb|ACQ90252.1| cathepsin L [Pinctada fucata]
          Length = 362

 Score =  236 bits (602), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 134/315 (42%), Positives = 189/315 (60%), Gaps = 22/315 (6%)

Query: 47  HEKWM---AQHGRSYKDELEKEM-RLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDL 99
           HE W       G+ Y D +E+E+ R  IF++ LE IE+ N++   G ++Y +G NQFSD+
Sbjct: 51  HETWKEFKTLFGKVY-DTVEEEIKRFDIFRDTLERIEEHNRKYHMGQKSYYMGVNQFSDM 109

Query: 100 TNDEF---RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKE 156
           ++DE+     L  G +  S      + +       S   +   +DWRDKG VTP+KNQ +
Sbjct: 110 SHDEYLRHNGLRRGNRKYSKGEGCDSYTK------SGKQLDDKVDWRDKGYVTPVKNQGQ 163

Query: 157 CGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS-TNGNNGCLGGSREKAFAYIIQNQ 215
           CG CW+F+   ++EG    ++G LI LSEQQL+DCS T GN GC GG  + AF YI    
Sbjct: 164 CGSCWSFSTTGSLEGQHFRQTGKLISLSEQQLVDCSGTFGNEGCNGGLMDNAFEYIKSIG 223

Query: 216 GIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYS 274
           G+  ED+YPY A  G C   +    A  +   +V SGDE AL  A+ S+ P+S+AI A  
Sbjct: 224 GLEGEDDYPYTAKQGKCHLKKSLFKANDTGCTDVESGDEDALKDALASVGPISVAIDASH 283

Query: 275 TEFQSYKEGIFN-GVCGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR 332
             FQSY  G+++   C +Q LDH V  VG+GT E+G +YWL+KNSWG  WG+ GY+K+ R
Sbjct: 284 ASFQSYDGGVYDEEECSSQNLDHGVLTVGYGTEENGGDYWLVKNSWGEMWGEEGYIKMSR 343

Query: 333 D-EGLCGIGTRSSYP 346
           + +  CGI T++SYP
Sbjct: 344 NKDNQCGIATQASYP 358


>gi|530736|emb|CAA56915.1| cathepsin l [Nephrops norvegicus]
 gi|1582621|prf||2119193B cathepsin L-related Cys protease
          Length = 313

 Score =  236 bits (601), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 132/307 (42%), Positives = 184/307 (59%), Gaps = 16/307 (5%)

Query: 48  EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEF 104
           E +  Q+GR Y D  E+  R ++F++N + +E  NK+   G  T+K+  NQF D+TN+EF
Sbjct: 13  EHFKTQYGRKYGDAKEELYRQRVFQQNEQLVEAFNKKFENGEVTFKVAMNQFGDMTNEEF 72

Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFA 164
            A+  GYK  S   R   ++ F  +   M      +DWR KGAVTP+K+Q +CG CWAF+
Sbjct: 73  NAVMKGYKKGS---RGEPTTVFTAEGRPMA---ADVDWRTKGAVTPVKDQGQCGSCWAFS 126

Query: 165 AVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEY 223
           A  ++EG   +++  L+ LSEQ+L+DCST  GN+GC GG    AF YI  N GI TE  Y
Sbjct: 127 ATGSLEGQHFLKNNELVSLSEQELVDCSTEYGNDGCGGGWMTSAFDYIKDNGGIDTESSY 186

Query: 224 PYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVS-MQPVSIAIAAYSTEFQSYKE 282
           PY+A   +C        A  + + EV    E+AL +AVS + P+S+AI A    FQ Y  
Sbjct: 187 PYEAQDRSCRFDANSIGATCTGFVEVQH-TEEALHEAVSDIGPISVAIDASHFSFQFYSS 245

Query: 283 GIF--NGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGI 339
           G++       T LDH V  VG+G TE   +YWL+KNSWG+ WGDAGY+K+ R+ +  CGI
Sbjct: 246 GVYYEKKCSPTNLDHGVLAVGYG-TESTEDYWLVKNSWGSGWGDAGYIKMSRNRDNNCGI 304

Query: 340 GTRSSYP 346
            +  SYP
Sbjct: 305 ASEPSYP 311


>gi|32396018|gb|AAP41846.1| cysteine protease [Anthurium andraeanum]
          Length = 502

 Score =  236 bits (601), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 131/317 (41%), Positives = 181/317 (57%), Gaps = 18/317 (5%)

Query: 45  EIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRT----YKLGTNQFSDLT 100
           E+ E+WM +H + Y    EK  R   F  NL ++ K N EG R       +G N F+DL+
Sbjct: 49  ELFERWMEKHRKVYAHPGEKARRYANFLSNLAFVRKRNAEGRRAPSSGQGVGMNVFADLS 108

Query: 101 NDEFRALYTG--YKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECG 158
           N+EFR +Y+    +  +   R       + + ++  D P SLDWR +GAVT +KNQ +CG
Sbjct: 109 NEEFREVYSSRVLRKKAAEGRGARRRAGEGRVVAGCDAPASLDWRKRGAVTAVKNQGDCG 168

Query: 159 CCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIA 218
            CWAF++  A+EGI  I +G LI LSEQ+L+DC T  N GC GG  + AF ++I N GI 
Sbjct: 169 SCWAFSSTGAMEGINAITTGELISLSEQELVDCDTT-NEGCDGGYMDYAFEWVINNGGID 227

Query: 219 TEDEYPY--QAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQPVSIAIAAYSTE 276
           +E  YPY  QA     +  ++     I  YE+V +  E ALL A   QPVS+ I   S +
Sbjct: 228 SEANYPYTGQADSVCNTTKEEIKVVSIDGYEDVAT-SESALLCAAVQQPVSVGIDGSSLD 286

Query: 277 FQSYKEGIFNGVCG---TQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD 333
           FQ Y  GI++G C      +DHAV +VG+G  + G +YW++KNSWG  WG  GY+ I R+
Sbjct: 287 FQLYAGGIYDGDCSGNPDDIDHAVLVVGYG-QQGGTDYWIVKNSWGTDWGMQGYIYIRRN 345

Query: 334 EGL----CGIGTRSSYP 346
            GL    C I   +SYP
Sbjct: 346 TGLPYGVCAIDAMASYP 362


>gi|2961621|gb|AAC05781.1| cathepsin S [Mus musculus]
          Length = 340

 Score =  236 bits (601), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 127/306 (41%), Positives = 184/306 (60%), Gaps = 15/306 (4%)

Query: 50  WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEFRA 106
           W   H + YKD+ E+E+R  I+++NL++I   N E   G  TY++G N   D+TN+E   
Sbjct: 39  WKKTHEKEYKDKNEEEVRRLIWEKNLKFIMIHNLEYSMGMHTYQVGMNDMGDMTNEEISC 98

Query: 107 LYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAV 166
                ++   S ++ T     +++ S   +P ++DWR+KG VT +K Q  CG CWAF+AV
Sbjct: 99  RMGALRISRQSPKTVT-----FRSYSNRTLPDTVDWREKGCVTEVKYQGSCGACWAFSAV 153

Query: 167 AAVEGITKIRSGNLIQLSEQQLLDCSTN---GNNGCLGGSREKAFAYIIQNQGIATEDEY 223
            A+EG  K+++G LI LS Q L+DCS     GN GC GG   +AF YII N GI  +  Y
Sbjct: 154 GALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQYIIDNGGIEADASY 213

Query: 224 PYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAVSMQ-PVSIAIAAYSTEFQSYKE 282
           PY+A    C    K  AA  S Y ++P GDE AL +AV+ + PVS+ I A  + F  YK 
Sbjct: 214 PYKATDEKCHYNSKNRAATCSRYIQLPFGDEDALKEAVATKGPVSVGIDASHSSFFFYKS 273

Query: 283 GIFNG-VCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR-DEGLCGIG 340
           G+++   C   ++H V +VG+GT  DG +YWL+KNSWG  +GD GY+++ R ++  CGI 
Sbjct: 274 GVYDDPSCTGNVNHGVLVVGYGTL-DGKDYWLVKNSWGLNFGDQGYIRMARNNKNHCGIA 332

Query: 341 TRSSYP 346
           +  SYP
Sbjct: 333 SYCSYP 338


>gi|405966499|gb|EKC31777.1| Cathepsin L [Crassostrea gigas]
          Length = 331

 Score =  236 bits (601), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 126/301 (41%), Positives = 184/301 (61%), Gaps = 14/301 (4%)

Query: 54  HGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRTYKLGTNQFSDLTNDEFRALYTG 110
           H ++Y  + E++MR  I+++N+ YI+K N     G  TY LG N+++D+T  EFRA+  G
Sbjct: 35  HKKTYSQD-EEQMRRLIWEDNVNYIQKHNLAADRGEHTYWLGQNEYADMTIFEFRAIMNG 93

Query: 111 YKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVE 170
           YKM +    + T         ++ D+P S+DWR +G VT IKNQ  CG CW+F+A  ++E
Sbjct: 94  YKMSA----NRTKGDLYMSPSNIGDLPDSVDWRKEGYVTDIKNQGHCGSCWSFSATGSLE 149

Query: 171 GITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVP 229
           G     S  L+ LSEQ L+DCS   GN+GC GG  + AF YI  N+GI TE+ YPY A  
Sbjct: 150 GQHFKASKKLVSLSEQNLVDCSKKEGNHGCQGGLMDNAFRYIESNKGIDTEESYPYTAKN 209

Query: 230 GTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIFN-- 286
           G C    +   A  + Y ++P   E  L +AV ++ P+S+ I A    FQ Y+EG+++  
Sbjct: 210 GFCHFKAENVGATDTGYVDIPHMQEDKLQEAVATVGPISVGIDAGHKSFQLYREGVYSEP 269

Query: 287 GVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSY 345
               ++LDH V  VG+G TE G +YWL+KNSWG +WG  GY+ + R++  +CGI T++SY
Sbjct: 270 ACSSSKLDHGVLAVGYG-TESGDDYWLVKNSWGTSWGMQGYVMMARNKHNMCGIATQASY 328

Query: 346 P 346
           P
Sbjct: 329 P 329


>gi|226821421|gb|ACO82386.1| cathepsin L-like protein [Lutjanus argentimaculatus]
          Length = 301

 Score =  235 bits (600), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 128/299 (42%), Positives = 178/299 (59%), Gaps = 16/299 (5%)

Query: 61  ELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPS 117
           E E+  R  ++++NL+ IE  N E   G  +Y+LG N F D+T++EFR +  GYK     
Sbjct: 6   EKEEGWRRMVWEKNLKKIEMHNLEHSMGTHSYRLGMNHFGDMTHEEFRQIMNGYK--RKP 63

Query: 118 HRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRS 177
            R  T S F   N    + P ++DWRD G VTP+K+Q +CG CWAF+   A+EG    ++
Sbjct: 64  QRKFTGSLFMEPNF--LEAPRAVDWRDNGYVTPVKDQGQCGSCWAFSTTGALEGQHFRKT 121

Query: 178 GNLIQLSEQQLLDCS-TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPG-TCSAA 235
           G L+ LSEQ L+DCS   GN GC GG  ++AF YI  NQG+ +ED YPY       C   
Sbjct: 122 GKLVSLSEQNLVDCSRPEGNEGCNGGLMDQAFQYIKDNQGLDSEDSYPYLGTDDQPCHYD 181

Query: 236 QKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGIF--NGVCGTQ 292
            K  +A  + + ++PSG E+AL+KAV ++ PVS+AI A    FQ Y+ GI+        +
Sbjct: 182 PKYNSANDTGFVDIPSGKERALMKAVAAVGPVSVAIDAGHESFQFYQSGIYYEKDCSSEE 241

Query: 293 LDHAVTIVGF---GTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 347
           LDH V +VG+   G   DG  YW++KNSW   WGD GY+ + +D +  CGI T +SYPL
Sbjct: 242 LDHGVLVVGYGFEGEDVDGKKYWIVKNSWSEKWGDKGYIYMAKDRKNHCGIATAASYPL 300


>gi|242020003|ref|XP_002430447.1| Cathepsin L precursor, putative [Pediculus humanus corporis]
 gi|212515585|gb|EEB17709.1| Cathepsin L precursor, putative [Pediculus humanus corporis]
          Length = 345

 Score =  235 bits (600), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 130/321 (40%), Positives = 192/321 (59%), Gaps = 17/321 (5%)

Query: 43  VVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRTYKLGTNQFSDL 99
           V+E  + + A+H ++Y +++E++ R+KIF +N + I K N   + G   YKLG N++SD+
Sbjct: 23  VMEEWQLFKAEHKKNYNNDVEEKFRMKIFMDNKQKITKHNTKYQRGEVGYKLGLNKYSDM 82

Query: 100 TNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNLSM----TDVPTSLDWRDKGAVTPIKN 153
            + EF   + G+   +  P  RS    T    +  +      +P  +DW   GAVTP+K+
Sbjct: 83  LHHEFINTFNGFNKSIIPPHLRSNNGKTHLKGSFFIPPANVKLPKHVDWVKLGAVTPVKD 142

Query: 154 QKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST-NGNNGCLGGSREKAFAYII 212
           Q  CG CWAF+A  A+EG+   ++  L+ LSEQ L+DCST  GNNGC GG  ++AF Y+ 
Sbjct: 143 QGHCGSCWAFSATGALEGLHFRKTKVLVSLSEQNLIDCSTEEGNNGCNGGLMDQAFQYVR 202

Query: 213 QNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV-SMQPVSIAIA 271
            N GI TE  YPY+     C    + + A  + Y +VP GDE AL  AV ++ PVS+AI 
Sbjct: 203 INGGIDTERSYPYEGNNDVCRYEPENSGAIDTGYTDVPLGDEDALKSAVATVGPVSVAID 262

Query: 272 AYSTEFQSYKEGI-FNGVCGTQ---LDHAVTIVGFGTTEDG-ANYWLIKNSWGNTWGDAG 326
           A    FQ Y  G+ F   C  +   LDH V +VG+GT E+   +YWL+KNSWG++WG+ G
Sbjct: 263 ASQESFQLYSSGVYFEPNCKNEPESLDHGVLVVGYGTDEETQQDYWLVKNSWGDSWGENG 322

Query: 327 YMKIVRD-EGLCGIGTRSSYP 346
           Y+K+ R+ +  CGI T+ S+P
Sbjct: 323 YIKMARNADNQCGIATQPSFP 343


>gi|94421564|gb|ABF18889.1| cathepsin-L [Lygus lineolaris]
          Length = 314

 Score =  235 bits (600), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 122/294 (41%), Positives = 181/294 (61%), Gaps = 14/294 (4%)

Query: 48  EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRTYKLGTNQFSDLTNDEF 104
           E + A++G++Y+    +  R  I+    E + + N   ++G  +YKLG N F+D+ N EF
Sbjct: 28  ESYKAKYGKTYESNENEAARRTIYFMAKEKVMEHNARFEQGLVSYKLGLNSFADMHNGEF 87

Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFA 164
           R +  GY+  +P +     S   +   ++T +P S+DWR KGAVTPIKNQ +CG CWAF+
Sbjct: 88  RKMMNGYRRGTPRN-----SVVVHVESNIT-LPASVDWRTKGAVTPIKNQGQCGSCWAFS 141

Query: 165 AVAAVEGITKIRSGNLIQLSEQQLLDCS-TNGNNGCLGGSREKAFAYIIQNQGIATEDEY 223
              ++EG   ++ G L+ LSEQ+L+DCS   GN+GC GG  + AF YI +N GI TE  Y
Sbjct: 142 TTGSLEGQHALKKGKLVSLSEQELVDCSAAEGNDGCDGGLMDDAFTYIKKNNGIDTEQSY 201

Query: 224 PYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKA-VSMQPVSIAIAAYSTEFQSYKE 282
           PY    GTCS  +   AA ++ + +V SG E  L  A  ++ P+S+AI A S +FQ Y+ 
Sbjct: 202 PYTGEDGTCSFKKSDVAATVTGFVDVTSGSESGLQDASATIGPISVAIDASSWDFQLYES 261

Query: 283 GIFN--GVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE 334
           G+++      T+LDH V +VG+G T+DG  YWL+KNSWG  WG  GY+++ R +
Sbjct: 262 GVYDVSDCSTTELDHGVLVVGYG-TDDGTAYWLVKNSWGTDWGHHGYIQMSRKQ 314


>gi|414591039|tpg|DAA41610.1| TPA: hypothetical protein ZEAMMB73_356414 [Zea mays]
          Length = 376

 Score =  235 bits (600), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 136/330 (41%), Positives = 189/330 (57%), Gaps = 28/330 (8%)

Query: 40  EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
           E+S+  ++E+W + H  S +D  EK+ R + FK N  +I + NK  +  YKLG N+F+DL
Sbjct: 38  EESMWSLYERWRSVHTVS-RDLREKQSRFEAFKANARHIGEFNKRKDVPYKLGLNKFADL 96

Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFKYQN---------LSMTDVPTSLDWRDKGAVTP 150
           T +EF + YTG K+      +  +S  +  +          S+ D P + DWRD GAVT 
Sbjct: 97  TQEEFVSKYTGAKVVDSEAAARLASGVRVSSSDESPPQLAASVGDAPDAWDWRDHGAVTA 156

Query: 151 IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAY 210
           +K+Q +CG CWAF+AV AVE +  I +GNL+ LSEQQ+LDCS  G+    GG    A  Y
Sbjct: 157 VKDQGQCGSCWAFSAVGAVESVNAIVTGNLLTLSEQQMLDCSGAGDC-TYGGYTYYAMLY 215

Query: 211 IIQNQGIATED--EYPY-------QAVPGTCSAAQKPAAAKISNYEEVPSGDEQALLKAV 261
            I N G+  +   + PY       Q +P     A+KP   KI +   + + DE AL +AV
Sbjct: 216 AISN-GLTLDQCGKTPYYQRYDAQQHLPCRFD-AKKPPVVKIDSMYVMNNADEAALKRAV 273

Query: 262 SMQPVSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNT 321
             QPVS+ I A    +  Y EG+F G CGT L+HAV +VG+G T DG  YW++KNSWG  
Sbjct: 274 YKQPVSVLIDAGGIGY--YSEGVFTGPCGTSLNHAVLLVGYGATADGTKYWIVKNSWGAD 331

Query: 322 WGDAGYMKIVRD----EGLCGIGTRSSYPL 347
           WG+ GY ++ RD     GLCGI     YP+
Sbjct: 332 WGEKGYFRLKRDVGTQGGLCGITMYPIYPI 361


>gi|332260024|ref|XP_003279085.1| PREDICTED: cathepsin L1 isoform 3 [Nomascus leucogenys]
 gi|441593306|ref|XP_004087072.1| PREDICTED: cathepsin L1 [Nomascus leucogenys]
 gi|441593309|ref|XP_004087073.1| PREDICTED: cathepsin L1 [Nomascus leucogenys]
          Length = 333

 Score =  235 bits (600), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 130/342 (38%), Positives = 195/342 (57%), Gaps = 23/342 (6%)

Query: 16  TPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENL 75
            P  I+    +  AS  +    T + S+     KW A H R Y    E+  R  ++++N+
Sbjct: 2   NPTLILAAFCLGIASATL----TFDHSLEAQWTKWKAMHNRLYGMN-EEGWRRAVWEKNM 56

Query: 76  EYIEKAN---KEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
           + IE+ N   +EG  ++ +  N F D+T++EFR +  G++   P           +Q   
Sbjct: 57  KMIEQHNQEYREGKHSFTMAMNAFGDMTSEEFRQVMNGFQNRKPRKGKV------FQEPL 110

Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
             + P S+DWR+KG VTP+KNQ +CG CWAF+A  A+EG    ++G L+ LSEQ L+DCS
Sbjct: 111 FYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCS 170

Query: 193 -TNGNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPS 251
              GN GC GG  + AF Y+  N G+ +E+ YPY+A   +C    K + A  + + ++P 
Sbjct: 171 GPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPK 230

Query: 252 GDEQALLKAV-SMQPVSIAIAAYSTEFQSYKEGI-FNGVCGTQ-LDHAVTIVGFG---TT 305
             E+AL+KAV ++ P+S+A+ A    FQ YKEGI F   C ++ +DH V +VG+G   T 
Sbjct: 231 -QEKALMKAVATVGPISVAVDAGHQSFQFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTE 289

Query: 306 EDGANYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYP 346
            D   YWL+KNSWG  WG  GY+K+ +D    CGI + +SYP
Sbjct: 290 SDNNKYWLVKNSWGEEWGMGGYIKMAKDRRNHCGIASAASYP 331


>gi|61368403|gb|AAX43172.1| cathepsin S [synthetic construct]
          Length = 332

 Score =  235 bits (600), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 128/338 (37%), Positives = 199/338 (58%), Gaps = 18/338 (5%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLE 76
           M  ++ +L+ C+S V      H+   ++ H   W   +G+ YK++ E+ +R  I+++NL+
Sbjct: 1   MKRLVCVLLVCSSAVAQ---LHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLK 57

Query: 77  YIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
           ++   N E   G  +Y LG N   D+T++E  +L +  ++PS   R+ T     Y++   
Sbjct: 58  FVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLRVPSQWQRNIT-----YKSNPN 112

Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST 193
             +P S+DWR+KG VT +K Q  CG CWAF+AV A+E   K+++G L+ LS Q L+DCST
Sbjct: 113 RILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCST 172

Query: 194 N--GNNGCLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPS 251
              GN GC GG    AF YII N+GI ++  YPY+A+   C    K  AA  S Y E+P 
Sbjct: 173 EKYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKAMDQKCQYDSKYRAATCSKYTELPY 232

Query: 252 GDEQALLKAVSMQ-PVSIAIAAYSTEFQSYKEGI-FNGVCGTQLDHAVTIVGFGTTEDGA 309
           G E  L +AV+ + PVS+ + A    F  Y+ G+ +   C   ++H V +VG+G   +G 
Sbjct: 233 GREDVLKEAVANKGPVSVGVDARHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYGDL-NGK 291

Query: 310 NYWLIKNSWGNTWGDAGYMKIVRDEG-LCGIGTRSSYP 346
            YWL+KNSWG+ +G+ GY+++ R++G  CGI +  SYP
Sbjct: 292 EYWLVKNSWGHNFGEEGYIRMARNKGNHCGIASFPSYP 329


>gi|167526493|ref|XP_001747580.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163774026|gb|EDQ87660.1| predicted protein [Monosiga brevicollis MX1]
          Length = 330

 Score =  235 bits (600), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 133/324 (41%), Positives = 186/324 (57%), Gaps = 24/324 (7%)

Query: 30  SQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEK-EMRLKIFKENLEYIEKANKEGNRT 88
           SQ +  R+ H   V++    +   HG  Y  +L   E   +    NL  IE A+  GN +
Sbjct: 11  SQFLPRRNLH--LVLKGPTAFRRIHGVFYSSQLGLCEPAFRCHLANLRVIE-AHNAGNSS 67

Query: 89  YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTS-LDWRDKGA 147
           + +G  QF+DLT  EF A    + M         + T     + +T+ P   +DWR K A
Sbjct: 68  FTMGITQFADLTAAEFSAYVKRFPM---------NVTRPRNEVWITEAPLQEVDWRQKNA 118

Query: 148 VTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREK 206
           VT IKNQ +CG CW+F+   +VEG   I +G L+ LSEQQL+DCST  GN+GC GG  + 
Sbjct: 119 VTEIKNQGQCGSCWSFSTTGSVEGAHAIATGKLVSLSEQQLMDCSTRYGNHGCNGGLMDY 178

Query: 207 AFAYIIQNQGIATEDEYPYQAVPGTCSA-AQKPAAAKISNYEEVPSGDEQALLKAVSMQP 265
           AF Y+I N G+ TE++YPY A  G C+   +K  AA+I  +  VP   E  L  AVS+ P
Sbjct: 179 AFEYVIANGGLDTEEDYPYTAEDGKCNTEKEKKHAAEIHGFRNVPKEHEDQLAAAVSIGP 238

Query: 266 VSIAIAAYSTEFQSYKEGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDA 325
           VS+AI A    FQ Y  G+F+G CGT LDH V +VG+       +YW++KNSWG +WG+ 
Sbjct: 239 VSVAIEADQAGFQHYTSGVFDGKCGTSLDHGVLVVGYSD-----DYWIVKNSWGKSWGEE 293

Query: 326 GYMKIVR---DEGLCGIGTRSSYP 346
           GY+++ R    +G+CGI  ++SYP
Sbjct: 294 GYIRLKRGVDKKGMCGITMQASYP 317


>gi|426216524|ref|XP_004002512.1| PREDICTED: cathepsin S isoform 1 [Ovis aries]
          Length = 331

 Score =  235 bits (600), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 133/331 (40%), Positives = 196/331 (59%), Gaps = 18/331 (5%)

Query: 25  LVSCASQVVSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANK 83
           L+ C+S +      H    ++ H + W   +G+ Y+++ E+  R  I+++NL+ +   N 
Sbjct: 8   LLLCSSAMAQ---VHRDPTLDHHWDLWKKTYGKQYEEKNEEVARRLIWEKNLKTVMLHNL 64

Query: 84  E---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSL 140
           E   G  +Y+LG N   D+T++E  +  +  ++PS   R+ T  +   Q L     P SL
Sbjct: 65  EHSMGMHSYELGMNHLGDMTSEEVISSMSSLRVPSQWPRNVTYKSSPNQKL-----PDSL 119

Query: 141 DWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST--NGNNG 198
           DWR+KG VT +K Q  CG CWAF+AV A+E   K+++G L+ LS Q L+DCST   GN G
Sbjct: 120 DWREKGCVTEVKYQGACGSCWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCSTVKYGNKG 179

Query: 199 CLGGSREKAFAYIIQNQGIATEDEYPYQAVPGTCSAAQKPAAAKISNYEEVPSGDEQALL 258
           C GG   +AF YII N GI +E  YPY+A+ G C    K  AA  S Y E+P G E+AL 
Sbjct: 180 CNGGFMTEAFQYIIDNNGIDSEASYPYKAMDGRCQYDVKNRAATCSRYIELPFGSEEALK 239

Query: 259 KAVSMQ-PVSIAIAAYSTEFQSYKEGI-FNGVCGTQLDHAVTIVGFGTTEDGANYWLIKN 316
           +AV+ + PVS+ I A  T F  YK G+ ++  C   ++H V +VG+G+  +G +YWL+KN
Sbjct: 240 EAVANKGPVSVGIDAKQTSFFLYKTGVYYDPSCTQNVNHGVLVVGYGSL-NGKDYWLVKN 298

Query: 317 SWGNTWGDAGYMKIVRDEG-LCGIGTRSSYP 346
           SWG  +GD GY+++ R+ G  CGI    SYP
Sbjct: 299 SWGLNFGDQGYIRMARNSGNHCGIANFPSYP 329


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.315    0.130    0.387 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,388,770,322
Number of Sequences: 23463169
Number of extensions: 222062658
Number of successful extensions: 646414
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 6581
Number of HSP's successfully gapped in prelim test: 1124
Number of HSP's that attempted gapping in prelim test: 614430
Number of HSP's gapped (non-prelim): 9535
length of query: 348
length of database: 8,064,228,071
effective HSP length: 143
effective length of query: 205
effective length of database: 9,003,962,200
effective search space: 1845812251000
effective search space used: 1845812251000
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 77 (34.3 bits)