BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 043883
         (348 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|255564908|ref|XP_002523447.1| cysteine protease, putative [Ricinus communis]
 gi|223537275|gb|EEF38906.1| cysteine protease, putative [Ricinus communis]
          Length = 342

 Score =  358 bits (918), Expect = 3e-96,   Method: Compositional matrix adjust.
 Identities = 180/339 (53%), Positives = 240/339 (70%), Gaps = 15/339 (4%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
            + V L++ G  ASQA  R+  + ++ E+ E W A+YGR YK+++E  +RFEIF++N+  
Sbjct: 9   LMFVALLVVGLWASQAWSRSLHDAAMNERHEMWMAKYGRVYKDNSEKERRFEIFRNNVEF 68

Query: 65  VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQV 123
           +E FN   +GNR Y L +N+FADLT +EF  S+ G+K S  S       + F Y + + V
Sbjct: 69  IESFNK--LGNRPYKLDINEFADLTNEEFKVSKNGYKRS--SGVGLTEKSSFRYANVTAV 124

Query: 124 PPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDN 176
           P S++W + GAVTP+K QGQC       AVAA+EGI  +   +L+SLSEQ+LVDC T+  
Sbjct: 125 PTSMDWRQNGAVTPIKDQGQCGCCWAFSAVAAMEGITKLSTGKLISLSEQELVDCDTSGE 184

Query: 177 NNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPND 236
           + GC GG MDDAF++I QN G+T +A Y Y+G + G C++ KA + AA+IT YEDVP N 
Sbjct: 185 DQGCEGGLMDDAFEFIKQNGGLTTEANYPYQG-TDGTCNTNKAGNDAAKITGYEDVPANS 243

Query: 237 EESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWL 294
           E++LLKAVA+QPVSVAIDAS  A QFYSGGVF G C T L+HGVTAVGYGTS++G KYWL
Sbjct: 244 EDALLKAVASQPVSVAIDASGSAFQFYSGGVFTGDCGTELDHGVTAVGYGTSDDGTKYWL 303

Query: 295 IKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
           +KNSWG  WGEDGY R++RDI+  +G CGIAM  S+P +
Sbjct: 304 VKNSWGTSWGEDGYIRMERDIEAKEGLCGIAMQPSYPTA 342


>gi|225446583|ref|XP_002280204.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1 [Vitis vinifera]
          Length = 341

 Score =  355 bits (912), Expect = 1e-95,   Method: Compositional matrix adjust.
 Identities = 181/341 (53%), Positives = 239/341 (70%), Gaps = 17/341 (4%)

Query: 3   KYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
           +Y  + +L +  + ASQAT R   E S+ E+ E W  QYGR YK++ E SKR++IFKDN+
Sbjct: 8   QYICLALLFVLAAWASQATARNLHEASMYERHEDWMVQYGREYKDADEKSKRYKIFKDNV 67

Query: 63  VAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-S 121
             +E FN A   ++SY L +N+FADLT +EF AS+  FK   H  S +A  T F Y++ +
Sbjct: 68  ARIESFNKAM--DKSYKLSINEFADLTNEEFRASRNRFKA--HICSTEA--TSFKYENVT 121

Query: 122 QVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATN 174
            VP +V+W +KGAVTP+K QGQC       AVAA+EGI  +   +L+SLSEQ+LVDC T+
Sbjct: 122 AVPSTVDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTS 181

Query: 175 DNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPP 234
             + GC GG MDDAFK+I QN G+T +A Y Y G + G C+  KA   AA+I  YEDVP 
Sbjct: 182 GEDQGCSGGLMDDAFKFIEQNHGLTTEANYPYAG-TDGTCNRKKAAHPAAKINGYEDVPA 240

Query: 235 NDEESLLKAVANQPVSVAIDA--SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
           N+E++L KAVA+QP++VAIDA  S  QFYS GVF G C T L+HGV+AVGYGTS++G+KY
Sbjct: 241 NNEKALQKAVAHQPIAVAIDAGGSEFQFYSSGVFTGQCGTELDHGVSAVGYGTSDDGMKY 300

Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
           WL+KNSWG  WGE+GY R+QRD+   +G CGIAM AS+P +
Sbjct: 301 WLVKNSWGTGWGEEGYIRMQRDVTAKEGLCGIAMQASYPTA 341


>gi|147788834|emb|CAN64655.1| hypothetical protein VITISV_005140 [Vitis vinifera]
          Length = 341

 Score =  355 bits (911), Expect = 2e-95,   Method: Compositional matrix adjust.
 Identities = 180/341 (52%), Positives = 236/341 (69%), Gaps = 17/341 (4%)

Query: 3   KYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
           +Y  + +L +  + AS A  R   E S+ E+ E W AQYGR YK++ E SKR++IFKDN+
Sbjct: 8   RYICLALLFVLAAWASHAKARNLHEASMYERHEDWMAQYGRVYKDAGEKSKRYKIFKDNV 67

Query: 63  VAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-S 121
             +E FN A   N+SY L +N+FADLT +EF AS+  FK   H  S +A  T F Y+   
Sbjct: 68  ARIESFNKAM--NKSYKLSINEFADLTNEEFRASRNRFKA--HICSTEA--TSFKYEHVX 121

Query: 122 QVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATN 174
            VP +V+W +KGAVTP+K QGQC       AVAA+EGI  +   +L+SLSEQ+LVDC T+
Sbjct: 122 AVPSTVDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTS 181

Query: 175 DNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPP 234
             + GC GG MDDAFK+I QN G+T +A Y Y G + G C+  KA   AA+I  YEDVP 
Sbjct: 182 GEDQGCSGGLMDDAFKFIEQNHGLTTEANYPYAG-TDGTCNRKKAAHPAAKINGYEDVPA 240

Query: 235 NDEESLLKAVANQPVSVAIDASAL--QFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
           N+E++L KAVA+QP++VAIDA     QFYS GVF G C T L+HGV+AVGYGTS++G+KY
Sbjct: 241 NNEKALQKAVAHQPIAVAIDAGGFEFQFYSSGVFTGQCGTELDHGVSAVGYGTSDDGMKY 300

Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
           WL+KNSWG  WGE+GY R+QRD+ + +G CGIAM AS+P +
Sbjct: 301 WLVKNSWGTGWGEEGYIRMQRDVTEKEGLCGIAMQASYPTA 341


>gi|37780045|gb|AAP32195.1| cysteine protease 5 [Trifolium repens]
          Length = 343

 Score =  354 bits (909), Expect = 3e-95,   Method: Compositional matrix adjust.
 Identities = 185/342 (54%), Positives = 230/342 (67%), Gaps = 19/342 (5%)

Query: 1   MAKYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKD 60
           +A +F + +L I      Q T RT  + SI E+ EQW   YG+ YK   E  KR  IF +
Sbjct: 12  LALFFCLGLLAI------QVTSRTLQDDSIFERHEQWMTHYGKVYKNPQEREKRLRIFTE 65

Query: 61  NLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS 120
           NL  +E  NNA   N+ Y L +N+FADLT +EFIAS+  FK    SS ++   T F Y++
Sbjct: 66  NLKYIEASNNAG-NNKPYKLGINQFADLTNEEFIASRNKFKGHMCSSIIRT--TTFKYEN 122

Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
           + VP +V+W +KGAVTPVK QGQC       A+AA EGI+ I   +LVSLSEQ+LVDC T
Sbjct: 123 TSVPSTVDWRKKGAVTPVKNQGQCGCCWAFSAIAATEGIHKISTGKLVSLSEQELVDCDT 182

Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
           N  + GC GG MDDAFK+IIQN GI+ +A Y Y+G+  G C + +A   AA IT YEDVP
Sbjct: 183 NGVDQGCEGGLMDDAFKFIIQNNGISTEAGYPYQGVD-GTCKANEASTSAATITGYEDVP 241

Query: 234 PNDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIK 291
            N+E +L KAVANQP+SVAIDAS    QFY  GVF G C T L+HGVTAVGYG S +G K
Sbjct: 242 ANNENALQKAVANQPISVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGISNDGTK 301

Query: 292 YWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
           YWL+KNSWG DWGE+GY R+QR ID  +G CGIAM AS+P +
Sbjct: 302 YWLVKNSWGTDWGEEGYIRMQRSIDAAEGLCGIAMQASYPTA 343


>gi|225446581|ref|XP_002280246.1| PREDICTED: vignain [Vitis vinifera]
          Length = 341

 Score =  353 bits (907), Expect = 5e-95,   Method: Compositional matrix adjust.
 Identities = 180/341 (52%), Positives = 238/341 (69%), Gaps = 17/341 (4%)

Query: 3   KYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
           +Y  + +L +  + ASQAT R+  E S+ E+ E W  QYGR YK++ E SKR++IFKDN+
Sbjct: 8   QYICLALLFVLAAWASQATARSLHEASMYERHEDWMVQYGREYKDADEKSKRYKIFKDNV 67

Query: 63  VAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-S 121
             +E FN A   ++SY L +N+FADLT +EF AS+  FK   H  S +A  T F Y++ +
Sbjct: 68  ARIESFNKAM--DKSYKLSINEFADLTNEEFRASRNRFKA--HICSTEA--TSFKYENVT 121

Query: 122 QVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATN 174
            VP +V+W +KGAVTP+K QGQC       AVAA+EGI  +   +L+SLSEQ+LVDC T+
Sbjct: 122 AVPSTVDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTS 181

Query: 175 DNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPP 234
             + GC GG MDDAFK+I QN G+T +A Y Y G + G C+  KA   AA+I  YEDVP 
Sbjct: 182 GEDQGCSGGLMDDAFKFIEQNHGLTTEANYPYAG-TDGTCNRKKAAHPAAKINGYEDVPA 240

Query: 235 NDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
           N+E++L KAVA+QP++VAIDAS    QFYS GVF G C T L+HGV AVGYGTS++G+KY
Sbjct: 241 NNEKALQKAVAHQPIAVAIDASGSEFQFYSSGVFTGQCGTELDHGVAAVGYGTSDDGMKY 300

Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
           WL+KNSW   WGE+GY R+QRD+   +G CGIAM AS+P +
Sbjct: 301 WLVKNSWSTGWGEEGYIRMQRDVTAKEGLCGIAMQASYPTA 341


>gi|255564910|ref|XP_002523448.1| cysteine protease, putative [Ricinus communis]
 gi|223537276|gb|EEF38907.1| cysteine protease, putative [Ricinus communis]
          Length = 341

 Score =  353 bits (906), Expect = 7e-95,   Method: Compositional matrix adjust.
 Identities = 180/341 (52%), Positives = 242/341 (70%), Gaps = 16/341 (4%)

Query: 3   KYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
           +  + V L++ G   SQA  R+  + ++ E+ E W  +YGR YK+++E  +RFEIF++N+
Sbjct: 7   RKLMFVALLVVGLWVSQAWSRSLHDAAMNERHEMWMVKYGRVYKDNSEKERRFEIFRNNV 66

Query: 63  VAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-S 121
             +E FN    GNR Y L +N+FADLT +EF AS+ G+K S  S+   +  + F Y + +
Sbjct: 67  EFIESFNKP--GNRPYKLDINEFADLTNEEFKASRNGYKRS--SNVGLSEKSSFRYGNVT 122

Query: 122 QVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATN 174
            VP S++W +KGAVTP+K QGQC       AVAA+EGI  +   +L+SLSEQ+LVDC T+
Sbjct: 123 AVPTSMDWRQKGAVTPIKDQGQCGCCWAFSAVAAMEGITKLSTGKLISLSEQELVDCDTS 182

Query: 175 DNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPP 234
             + GC GG MDDAF++I QN G+T +A Y Y+G + G C++ KA + AA+IT YEDVP 
Sbjct: 183 GEDQGCEGGLMDDAFEFIKQNGGLTTEANYPYQG-TDGTCNTNKAGNDAAKITGYEDVPA 241

Query: 235 NDEESLLKAVANQPVSVAIDA--SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
           N E++LLKAVA+QPVSVAIDA  SA QFYSGGVF G C T L+HGVTAVGYGTS +G KY
Sbjct: 242 NSEDALLKAVASQPVSVAIDASGSAFQFYSGGVFTGDCGTELDHGVTAVGYGTS-DGTKY 300

Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
           WL+KNSWG  WGEDGY R++RDI+  +G CGIAM +S+P +
Sbjct: 301 WLVKNSWGTSWGEDGYIRMERDIEAKEGLCGIAMQSSYPTA 341


>gi|147839728|emb|CAN70559.1| hypothetical protein VITISV_032465 [Vitis vinifera]
          Length = 341

 Score =  353 bits (905), Expect = 9e-95,   Method: Compositional matrix adjust.
 Identities = 180/341 (52%), Positives = 237/341 (69%), Gaps = 17/341 (4%)

Query: 3   KYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
           +Y  + +L +  + ASQAT R   E S+ E+ E W  QYGR YK++ E SKR++IFKDN+
Sbjct: 8   QYICLALLFVLAAWASQATARXLHEASMYERHEDWMVQYGREYKDADEKSKRYKIFKDNV 67

Query: 63  VAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-S 121
             +E FN A   ++SY L +N+FADLT +EF AS+  FK   H  S +A  T F Y++ +
Sbjct: 68  ARIESFNKAM--DKSYKLSINEFADLTNEEFRASRNRFKA--HICSTEA--TSFKYENVT 121

Query: 122 QVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATN 174
            VP +V+W +KGAVTP+K QGQC       AVAA+EGI  +   +L+SLSEQ+LVDC T+
Sbjct: 122 AVPSTVDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTS 181

Query: 175 DNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPP 234
             + GC GG MDDAFK+I QN G+T +A Y Y G + G C+  KA   AA+I  YEDVP 
Sbjct: 182 GEDQGCSGGLMDDAFKFIEQNHGLTTEANYPYAG-TDGTCNRKKAAHPAAKINGYEDVPA 240

Query: 235 NDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
           N+E++L KAVA+QP++VAIDAS    QFYS GVF G C T L+HGV AVGYGTS++G+KY
Sbjct: 241 NNEKALQKAVAHQPIAVAIDASGSEFQFYSSGVFTGQCGTELDHGVAAVGYGTSDDGMKY 300

Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
           WL+KNSW   WGE+GY R+QRD+   +G CGIAM AS+P +
Sbjct: 301 WLVKNSWSTGWGEEGYIRMQRDVTVKEGLCGIAMQASYPTA 341


>gi|37780051|gb|AAP32198.1| cysteine protease 12 [Trifolium repens]
          Length = 343

 Score =  352 bits (904), Expect = 1e-94,   Method: Compositional matrix adjust.
 Identities = 186/343 (54%), Positives = 231/343 (67%), Gaps = 21/343 (6%)

Query: 1   MAKYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKD 60
           +A +F + +L I      Q T RT  + SI E+ EQW   YG+ YK   E  KR  IF +
Sbjct: 12  LALFFCLGLLAI------QVTSRTLQDDSIFERHEQWMTHYGKVYKNPQEREKRLRIFTE 65

Query: 61  NLVAVERFNNAAIGNRS-YTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK 119
           NL  +E  NNA  GN+  Y L +N+FADLT +EFIAS+  FK    SS ++   T F Y+
Sbjct: 66  NLKYIEASNNA--GNKKPYKLGINQFADLTNEEFIASRNKFKGHMCSSIIRT--TTFKYE 121

Query: 120 SSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCA 172
           ++ VP +V+W +KGAVTPVK QGQC       A+AA EGI+ I   +LVSLSEQ+LVDC 
Sbjct: 122 NTSVPSTVDWRKKGAVTPVKNQGQCGCCWAFSAIAATEGIHKISTGKLVSLSEQELVDCD 181

Query: 173 TNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDV 232
           TN  + GC GG MDDAFK+IIQN GI+ +A Y Y+G+  G C + +A   AA IT YEDV
Sbjct: 182 TNGVDQGCEGGLMDDAFKFIIQNNGISTEAGYPYQGVD-GTCKANEASTSAATITGYEDV 240

Query: 233 PPNDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGI 290
           P N+E +L KAVANQP+SVAIDAS    QFY  GVF G C T L+HGVTAVGYG S +G 
Sbjct: 241 PANNENALQKAVANQPISVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGISNDGT 300

Query: 291 KYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
           KYWL+KNSWG DWGE+GY R+QR ID  +G CGIAM AS+P +
Sbjct: 301 KYWLVKNSWGTDWGEEGYIRMQRSIDAAEGLCGIAMQASYPTA 343


>gi|255563110|ref|XP_002522559.1| cysteine protease, putative [Ricinus communis]
 gi|223538250|gb|EEF39859.1| cysteine protease, putative [Ricinus communis]
          Length = 344

 Score =  351 bits (901), Expect = 3e-94,   Method: Compositional matrix adjust.
 Identities = 182/343 (53%), Positives = 236/343 (68%), Gaps = 18/343 (5%)

Query: 3   KYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
           K  L+ +L+++   ASQ+  R+  E S+  + + W  QYGR YK + E  KRF+IFK+N+
Sbjct: 8   KLVLMAMLLVT-LWASQSWSRSLHEASMELRHKTWMTQYGRVYKGNVEKEKRFKIFKENV 66

Query: 63  VAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGF--KMSDHSSSLKANGTPFLYKS 120
             +E FNN   GN+ Y L +N F DLT +EF AS  G+   MS H SS +     F Y++
Sbjct: 67  EFIESFNNN--GNKPYKLGINAFTDLTNEEFRASHNGYTMSMSSHQSSYRTK--SFRYEN 122

Query: 121 -SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCA 172
            + VPPS++W  KGAVT +K QGQC       AVAA+EGI  +    L+SLSEQ+LVDC 
Sbjct: 123 VTAVPPSLDWRTKGAVTHIKDQGQCGCCWAFSAVAAMEGITKLSTGTLISLSEQELVDCD 182

Query: 173 TNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDV 232
           T+  + GC GG MDDAF++II+N G+T +A Y YEG+  G C++ KA +HAA+IT YE+V
Sbjct: 183 TSGMDQGCEGGLMDDAFEFIIENNGLTTEANYPYEGVD-GSCNTRKAANHAAKITGYENV 241

Query: 233 PPNDEESLLKAVANQPVSVAIDA--SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGI 290
           P  DEE+L KAVANQPVSVAIDA  SA Q YS G+F G C T L+HGVT VGYGTS++G 
Sbjct: 242 PAYDEEALRKAVANQPVSVAIDAGESAFQHYSSGIFTGDCGTELDHGVTVVGYGTSDDGT 301

Query: 291 KYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
           KYWL+KNSWG  WGEDGY R++RDID  +G CGIAM  S+P +
Sbjct: 302 KYWLVKNSWGTSWGEDGYIRMERDIDAKEGLCGIAMEPSYPTA 344


>gi|359485281|ref|XP_002280230.2| PREDICTED: LOW QUALITY PROTEIN: KDEL-tailed cysteine endopeptidase
           CEP1 [Vitis vinifera]
          Length = 341

 Score =  350 bits (899), Expect = 4e-94,   Method: Compositional matrix adjust.
 Identities = 178/341 (52%), Positives = 236/341 (69%), Gaps = 17/341 (4%)

Query: 3   KYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
           +Y  + +L +  + ASQAT R   E S+ E+ E W AQYGR YK++ E SKR++IFKDN+
Sbjct: 8   QYICLALLFVLAAWASQATARNLHEASMYERHEDWMAQYGRVYKDADEKSKRYKIFKDNV 67

Query: 63  VAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-S 121
             +E FN A   ++SY L +N+FADLT +EF  S+  FK   H  S +A  T F Y++ +
Sbjct: 68  ARIESFNKAM--DKSYKLSINEFADLTNEEFGTSRNRFKA--HICSTEA--TSFKYENVT 121

Query: 122 QVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATN 174
            VP +++W +KGAVTP+K QGQC       AVAA+EGI  +   +L+SLSEQ+LVDC T+
Sbjct: 122 AVPSTIDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTS 181

Query: 175 DNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPP 234
             + GC GG MDDAFK+I QN G+T +A Y Y G + G C+  KA   AA+I  YEDVP 
Sbjct: 182 GEDQGCNGGLMDDAFKFIKQNHGLTTEANYPYAG-TDGTCNRKKAAHPAAKINGYEDVPA 240

Query: 235 NDEESLLKAVANQPVSVAIDASAL--QFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
           N+E++L KAV +QP++VAIDA     QFYS GVF G C T L+HGV AVGYGTS++G+KY
Sbjct: 241 NNEKALQKAVVHQPIAVAIDAGGFEFQFYSSGVFTGQCGTELDHGVAAVGYGTSDDGMKY 300

Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
           WL+KNSWG  WGE+GY R+QRD+   +G CGIAM AS+P +
Sbjct: 301 WLVKNSWGTGWGEEGYIRMQRDVTAKEGLCGIAMQASYPTA 341


>gi|357477459|ref|XP_003609015.1| Cysteine proteinase [Medicago truncatula]
 gi|355510070|gb|AES91212.1| Cysteine proteinase [Medicago truncatula]
          Length = 345

 Score =  350 bits (897), Expect = 6e-94,   Method: Compositional matrix adjust.
 Identities = 183/342 (53%), Positives = 232/342 (67%), Gaps = 16/342 (4%)

Query: 4   YFLIVVLII--SGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDN 61
           Y+ I +  I   G CA Q T R+    S+ E+ EQW +QY + YK+  E  +R +IF  N
Sbjct: 8   YYSIALTFIFCLGLCAIQVTSRSLQVDSMYERHEQWMSQYSKVYKDPQEREERHKIFTAN 67

Query: 62  LVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS- 120
           +  +E FNN A  N+ Y L +N+FADLT +EFIAS+  FK   H  S  A  T F Y++ 
Sbjct: 68  VNYIEVFNNDA-NNKLYKLGINQFADLTNEEFIASRNKFK--GHMCSSIAKTTTFKYENV 124

Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
           S +P +V+W +KGAVTPVK QGQC       AVAA EGI  +   +LVSLSEQ+LVDC T
Sbjct: 125 SAIPSTVDWRKKGAVTPVKNQGQCGCCWAFSAVAATEGITKLSTGKLVSLSEQELVDCDT 184

Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
              + GC GG MDDAFK+IIQN G++ +A Y Y+G+  G C++ KA  HAA IT YEDVP
Sbjct: 185 KGVDQGCEGGLMDDAFKFIIQNHGLSTEAAYPYQGVD-GTCNANKASIHAATITGYEDVP 243

Query: 234 PNDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIK 291
            N+E++L KAVANQP+SVAIDAS    QFY  GVF+G C T L+HGVTAVGYG   +G K
Sbjct: 244 ANNEQALQKAVANQPISVAIDASGSDFQFYKSGVFSGSCGTELDHGVTAVGYGVGNDGTK 303

Query: 292 YWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
           YWL+KNSWG DWGE+GY R+QR +D  +G CGIAM AS+P +
Sbjct: 304 YWLVKNSWGTDWGEEGYIRMQRGVDAAEGLCGIAMQASYPTA 345


>gi|225446585|ref|XP_002280215.1| PREDICTED: vignain [Vitis vinifera]
          Length = 341

 Score =  348 bits (893), Expect = 2e-93,   Method: Compositional matrix adjust.
 Identities = 180/341 (52%), Positives = 234/341 (68%), Gaps = 17/341 (4%)

Query: 3   KYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
           +Y  + +L    + ASQAT R   E S+ E+ E W AQYGR YK++ E SKR++IFKDN+
Sbjct: 8   QYICLALLFFLAAWASQATARNLLEASMYERHEDWMAQYGRVYKDADEKSKRYKIFKDNV 67

Query: 63  VAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-S 121
             +E FN A   ++SY L +N+FADLT +EF AS+  FK   H  S +A  T F Y+  +
Sbjct: 68  ARIESFNKAM--DKSYKLSINEFADLTNEEFRASRNRFKA--HICSTEA--TSFKYEHVA 121

Query: 122 QVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATN 174
            VP +V+W +KGAVTP+K QGQC       AVAA+EGI  +   +L+SLSEQ+LVDC T+
Sbjct: 122 AVPSTVDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTS 181

Query: 175 DNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPP 234
             + GC GG MDDAFK+I QN G+  +A Y Y G + G C+  KA   AA+I  YEDVP 
Sbjct: 182 GEDQGCNGGLMDDAFKFIEQNHGLATEANYPYAG-TDGTCNRKKAAHPAAKINGYEDVPA 240

Query: 235 NDEESLLKAVANQPVSVAIDASAL--QFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
           N+E++L KAVA+QP++VAIDA     QFYS GVF G C T L+HGV AVGYGTS++G+KY
Sbjct: 241 NNEKALQKAVAHQPIAVAIDAGGFEFQFYSSGVFTGQCGTELDHGVAAVGYGTSDDGMKY 300

Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
           WL+KNSWG  WGE GY R+QRD+   +G CGIAM AS+P +
Sbjct: 301 WLVKNSWGTGWGEVGYIRMQRDVTAKEGLCGIAMQASYPTA 341


>gi|224114698|ref|XP_002316833.1| predicted protein [Populus trichocarpa]
 gi|222859898|gb|EEE97445.1| predicted protein [Populus trichocarpa]
          Length = 305

 Score =  348 bits (892), Expect = 3e-93,   Method: Compositional matrix adjust.
 Identities = 179/312 (57%), Positives = 224/312 (71%), Gaps = 18/312 (5%)

Query: 32  EKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQ 91
           E+ E W AQYGR YK   E  +R  IFK+N+  +E FN   +G + Y L +N+FADLT +
Sbjct: 2   ERHETWMAQYGRAYKGHVEKERRLNIFKNNVEFIESFNK--VGKKPYKLSVNEFADLTNE 59

Query: 92  EFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGAVTPVKYQGQC------ 144
           EF AS+ G+KMS H SS  ++  PF Y++ S VP +++W +KGAVTP+K QGQC      
Sbjct: 60  EFQASRNGYKMSAHLSS--SSTKPFRYENVSAVPSTMDWRKKGAVTPIKDQGQCGCCWAF 117

Query: 145 -AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAV 203
            AVAA EGI  +   +L+SLSEQ+LVDC T+  + GC GG MDDAF +IIQNKG+T +A 
Sbjct: 118 SAVAATEGITQLSTGKLISLSEQELVDCDTSGEDQGCNGGLMDDAFDFIIQNKGLTTEAN 177

Query: 204 YSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDA--SALQFY 261
           Y Y+G + G C+S KA   AA+IT YEDVP N E +LLKAVANQPVSVAIDA  SA QFY
Sbjct: 178 YPYQG-ADGACNSGKA---AAKITGYEDVPANSEAALLKAVANQPVSVAIDAGGSAFQFY 233

Query: 262 SGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQ 321
           S GVF G C T L+HGVTAVGYG S++G KYWL+KNSWG  WGE+GY R++RDID  +G 
Sbjct: 234 SSGVFTGDCGTDLDHGVTAVGYGMSDDGTKYWLVKNSWGTSWGENGYIRMERDIDAQEGL 293

Query: 322 CGIAMFASFPVS 333
           CGIAM AS+P +
Sbjct: 294 CGIAMEASYPTA 305


>gi|357483847|ref|XP_003612210.1| Cysteine proteinase [Medicago truncatula]
 gi|355513545|gb|AES95168.1| Cysteine proteinase [Medicago truncatula]
          Length = 344

 Score =  346 bits (888), Expect = 9e-93,   Method: Compositional matrix adjust.
 Identities = 179/341 (52%), Positives = 231/341 (67%), Gaps = 15/341 (4%)

Query: 4   YFLIVVLIIS-GSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
           Y + + L+   G  A Q T RT  +GS+ E+ E+W   YG+ YK+  E  KRF+IF +N+
Sbjct: 8   YHISLALVFCLGLWAIQVTSRTLQDGSMHERHERWMNHYGKVYKDHQEREKRFKIFTENM 67

Query: 63  VAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-S 121
             +E FNN    N SY L +N+FADLT +EF+AS+  FK    SS ++   T F Y++ S
Sbjct: 68  KYIEAFNNGD-NNESYKLGINQFADLTNEEFVASRNKFKGHMCSSIIRT--TTFKYENVS 124

Query: 122 QVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATN 174
            +P +V+W +KGAVTPVK QGQC       AVAA EGI+ +   +LVSLSEQ+LVDC T 
Sbjct: 125 AIPSTVDWRKKGAVTPVKNQGQCGCCWAFSAVAATEGIHKLSTGKLVSLSEQELVDCDTK 184

Query: 175 DNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPP 234
             + GC GG MDDAFK+IIQN G+  +A Y Y+G+  G C++ KA   A  IT YEDVP 
Sbjct: 185 GVDQGCEGGLMDDAFKFIIQNHGLNTEAQYPYQGVD-GTCNANKASIQATTITGYEDVPA 243

Query: 235 NDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
           N+E++L KAVANQP+SVAIDAS    QFY  GVF G C T L+HGVTAVGYG S +G KY
Sbjct: 244 NNEQALQKAVANQPISVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSNDGTKY 303

Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
           WL+KNSWG DWGE+GY  +QR ++  +G CGIAM AS+P +
Sbjct: 304 WLVKNSWGTDWGEEGYIMMQRGVEAAEGLCGIAMQASYPTA 344


>gi|118627554|emb|CAL64936.1| putative cysteine protease 8 [Trifolium pratense]
          Length = 344

 Score =  346 bits (887), Expect = 1e-92,   Method: Compositional matrix adjust.
 Identities = 180/337 (53%), Positives = 230/337 (68%), Gaps = 14/337 (4%)

Query: 7   IVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVE 66
           + +L   G  A Q T RT  + S+ E+  QW +QYG+ YK+  E   RF+IFK+N+  +E
Sbjct: 12  LALLFCLGLFAIQVTSRTLQDDSMYERHGQWMSQYGKIYKDHQERETRFKIFKENVNYIE 71

Query: 67  RFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPP 125
            FNNA    +SY L +N+FADLT +EFIAS+  FK    SS ++   T F Y++ S +P 
Sbjct: 72  TFNNAD-DTKSYKLGINQFADLTNEEFIASRNKFKGHMCSSIMRT--TSFKYENVSGIPS 128

Query: 126 SVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNN 178
           +V+W +KGAVTPVK QGQC       AVAA EGI+ +   +L+SLSEQ+LVDC T   + 
Sbjct: 129 TVDWRKKGAVTPVKNQGQCGCCWAFSAVAATEGIHKLSTGKLISLSEQELVDCDTKGVDQ 188

Query: 179 GCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEE 238
           GC GG MDDAFK+IIQN G++ +A Y YEG+  G C++ KA   A  IT YEDVP N E+
Sbjct: 189 GCEGGLMDDAFKFIIQNHGLSTEAQYPYEGVD-GTCNANKASVQAVTITGYEDVPANSEQ 247

Query: 239 SLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIK 296
           +L KAVANQP+SVAIDAS    QFY  GVF G C T L+HGVTAVGYG S +G KYWL+K
Sbjct: 248 ALQKAVANQPISVAIDASGSDFQFYKSGVFTGACGTELDHGVTAVGYGVSNDGTKYWLVK 307

Query: 297 NSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
           NSWG DWGE+GY  +QR I+  +G CGIAM AS+P +
Sbjct: 308 NSWGTDWGEEGYIMMQRGIEAAEGICGIAMQASYPTA 344


>gi|356517348|ref|XP_003527349.1| PREDICTED: vignain-like [Glycine max]
          Length = 343

 Score =  346 bits (887), Expect = 1e-92,   Method: Compositional matrix adjust.
 Identities = 175/337 (51%), Positives = 231/337 (68%), Gaps = 15/337 (4%)

Query: 7   IVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVE 66
           + +L  SG  A Q T RT  + S+ E+ E+W  +Y + YK+  E  +RF+IFK+N+  +E
Sbjct: 12  LALLFCSGFLAFQVTCRTLQDASMYERHEEWMGRYAKVYKDPQERERRFKIFKENVNYIE 71

Query: 67  RFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPP 125
            FNNAA  N+ YTL +N+FADLT +EFIA +  FK   H  S     T F Y++ + +P 
Sbjct: 72  AFNNAA--NKPYTLGINQFADLTNEEFIAPRNRFK--GHMCSSITRTTTFKYENVTAIPS 127

Query: 126 SVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNN 178
           +V+W +KGAVTP+K QGQC       AVAA EGI+A+   +L+SLSEQ++VDC T   + 
Sbjct: 128 TVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALSAGKLISLSEQEVVDCDTKGEDQ 187

Query: 179 GCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEE 238
           GC GGFMD AFK+IIQN G+ N+  Y Y+ +  G C++  A +H A IT YEDVP N+E+
Sbjct: 188 GCAGGFMDGAFKFIIQNHGLNNEPNYPYKAVD-GKCNAKAAANHVATITGYEDVPVNNEK 246

Query: 239 SLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIK 296
           +L KAVANQPVSVAIDAS    QFY  GVF G C T L+HGVTAVGYG S +G +YWL+K
Sbjct: 247 ALQKAVANQPVSVAIDASGSDFQFYQSGVFTGSCGTELDHGVTAVGYGVSADGTEYWLVK 306

Query: 297 NSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
           NSWG +WGE+GY R+QR +   +G CGIAM AS+P +
Sbjct: 307 NSWGTEWGEEGYIRMQRGVKAEEGLCGIAMMASYPTA 343


>gi|50355621|dbj|BAD29959.1| cysteine protease [Daucus carota]
          Length = 361

 Score =  345 bits (885), Expect = 2e-92,   Method: Compositional matrix adjust.
 Identities = 178/341 (52%), Positives = 231/341 (67%), Gaps = 15/341 (4%)

Query: 3   KYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
           K+F+I  LI+ G+ A QAT RT  E S+ E+ EQW  QYGR YK+ AE S RF+IF DN+
Sbjct: 26  KHFMIAALILLGAWACQATSRTLPEASMFERHEQWMIQYGRVYKDEAEKSVRFQIFMDNV 85

Query: 63  VAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-S 121
             +E FN    G +SY L +N+FAD T +EF AS+ G+KM+   SS  +  T F Y++ +
Sbjct: 86  KFIEEFNKD--GRQSYKLAVNEFADQTNEEFQASRNGYKMA--VSSRPSQTTLFRYENVT 141

Query: 122 QVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATN 174
            VP S++W +KGAVTPVK QGQC        +AA EGI  +K  +L+SLSEQ+LVDC   
Sbjct: 142 AVPSSMDWRKKGAVTPVKDQGQCGSCWAFSTIAATEGITKLKTGKLISLSEQELVDCDKT 201

Query: 175 DNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPP 234
             + GC GG+M+D F++I++NKGI  +A Y Y   + G C+S +    AA+I+ YE VP 
Sbjct: 202 GEDQGCEGGYMEDGFEFIVKNKGIALEASYPYTA-ADGTCNSKEEASRAAKISGYEKVPA 260

Query: 235 NDEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
           N E +LLKAVANQPVSV+IDAS  A QFYS GVF G C T L+HGVTAVGYG + +G KY
Sbjct: 261 NSETALLKAVANQPVSVSIDASGVAFQFYSSGVFTGECGTDLDHGVTAVGYGKTSDGTKY 320

Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
           WL+KNSWG  WG+ GY  +QR +    G CGIAM AS+P +
Sbjct: 321 WLVKNSWGASWGDSGYIMMQRGVAAKGGLCGIAMDASYPTA 361


>gi|357474527|ref|XP_003607548.1| Cysteine protease [Medicago truncatula]
 gi|358347211|ref|XP_003637653.1| Cysteine protease [Medicago truncatula]
 gi|355503588|gb|AES84791.1| Cysteine protease [Medicago truncatula]
 gi|355508603|gb|AES89745.1| Cysteine protease [Medicago truncatula]
          Length = 345

 Score =  344 bits (883), Expect = 3e-92,   Method: Compositional matrix adjust.
 Identities = 177/327 (54%), Positives = 222/327 (67%), Gaps = 14/327 (4%)

Query: 17  ASQATYRTF-DEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGN 75
           A Q T RT  D+ +I EK EQW   YG+ YK+  E   R +IFK+N+  +E  NNA   N
Sbjct: 23  AIQVTSRTLQDDSNIYEKHEQWMVHYGKVYKDLQERENRLKIFKENVNYIEASNNAG-NN 81

Query: 76  RSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAV 135
           + Y L +N+FADLT +EFIAS+  FK   H  S     + F Y+++ VP +V+W +KGAV
Sbjct: 82  KLYKLGINQFADLTNEEFIASRNKFK--GHMCSSITKTSTFKYENASVPSTVDWRKKGAV 139

Query: 136 TPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDA 188
           TPVK QGQC       AVAA EGI+ +   +LVSLSEQ+LVDC T   + GC GG MDDA
Sbjct: 140 TPVKNQGQCGCCWAFSAVAATEGIHKLSTGKLVSLSEQELVDCDTKGVDQGCEGGLMDDA 199

Query: 189 FKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQP 248
           FK+IIQN G+  +A Y Y+G+  G C + KA  HA  IT YEDVP N+E++L KAVANQP
Sbjct: 200 FKFIIQNHGLNTEAQYPYQGVD-GTCSANKASIHAVTITGYEDVPANNEQALQKAVANQP 258

Query: 249 VSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGED 306
           +SVAIDAS    QFY  GVF G C T L+HGVTAVGYG   +G KYWL+KNSWG DWGE+
Sbjct: 259 ISVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVGNDGTKYWLVKNSWGTDWGEE 318

Query: 307 GYFRLQRDIDQPQGQCGIAMFASFPVS 333
           GY ++QR +D  +G CGIAM AS+P +
Sbjct: 319 GYIKMQRGVDAAEGLCGIAMEASYPTA 345


>gi|356577813|ref|XP_003557017.1| PREDICTED: uncharacterized protein LOC100801364 [Glycine max]
          Length = 890

 Score =  344 bits (883), Expect = 3e-92,   Method: Compositional matrix adjust.
 Identities = 175/337 (51%), Positives = 230/337 (68%), Gaps = 15/337 (4%)

Query: 7   IVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVE 66
           + +L+     A Q T R+  + S+ E+ EQW  +YG+ YK+  E  KRF IFK+N+  +E
Sbjct: 559 LAMLLCMAFLAFQVTCRSLQDASMYERHEQWMTRYGKVYKDPQEREKRFRIFKENVNYIE 618

Query: 67  RFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPP 125
            FNNAA  N+ Y L +N+FADLT +EFIA +  FK   H  S     T F Y++ + VP 
Sbjct: 619 AFNNAA--NKRYKLAINQFADLTNEEFIAPRNRFK--GHMCSSIIRTTTFKYENVTAVPS 674

Query: 126 SVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNN 178
           +V+W +KGAVTP+K QGQC       AVAA EGI+A+   +L+SLSEQ+LVDC T   + 
Sbjct: 675 TVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALTSGKLISLSEQELVDCDTKGVDQ 734

Query: 179 GCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEE 238
           GC GG MDDAFK++IQN G+  +A Y Y+G+  G C++ +A +    IT YEDVP N+E+
Sbjct: 735 GCEGGLMDDAFKFVIQNHGLNTEANYPYKGVD-GKCNANEAANDVVTITGYEDVPANNEK 793

Query: 239 SLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIK 296
           +L KAVANQPVSVAIDAS    QFY  GVF G C T L+HGVTAVGYG S +G +YWL+K
Sbjct: 794 ALQKAVANQPVSVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSNDGTEYWLVK 853

Query: 297 NSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
           NSWG +WGE+GY R+QR +D  +G CGIAM AS+P +
Sbjct: 854 NSWGTEWGEEGYIRMQRGVDSEEGLCGIAMQASYPTA 890


>gi|356577763|ref|XP_003556992.1| PREDICTED: vignain-like [Glycine max]
          Length = 343

 Score =  344 bits (882), Expect = 4e-92,   Method: Compositional matrix adjust.
 Identities = 174/337 (51%), Positives = 230/337 (68%), Gaps = 15/337 (4%)

Query: 7   IVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVE 66
           + +L  SG    Q T RT  + S+ E+ E+W  +Y + YK+  E  +RF+IFK+N+  +E
Sbjct: 12  LALLFCSGFLTFQVTCRTLQDASMYERHEEWMGRYAKVYKDPQERERRFKIFKENVNYIE 71

Query: 67  RFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPP 125
            FNNAA  N+ YTL +N+FADLT +EFIA +  FK   H  S     T F Y++ + +P 
Sbjct: 72  AFNNAA--NKPYTLGINQFADLTNEEFIAPRNRFK--GHMCSSITRTTTFKYENVTAIPS 127

Query: 126 SVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNN 178
           +V+W +KGAVTP+K QGQC       AVAA EGI+A+   +L+SLSEQ++VDC T   + 
Sbjct: 128 TVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALSAGKLISLSEQEVVDCDTKGEDQ 187

Query: 179 GCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEE 238
           GC GGFMD AFK+IIQN G+ N+  Y Y+ +  G C++  A +H A IT YEDVP N+E+
Sbjct: 188 GCAGGFMDGAFKFIIQNHGLNNEPNYPYKAVD-GKCNAKAAANHVATITGYEDVPVNNEK 246

Query: 239 SLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIK 296
           +L KAVANQPVSVAIDAS    QFY  GVF G C T L+HGVTAVGYG S +G +YWL+K
Sbjct: 247 ALQKAVANQPVSVAIDASGSDFQFYQSGVFTGSCGTELDHGVTAVGYGVSADGTEYWLVK 306

Query: 297 NSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
           NSWG +WGE+GY R+QR +   +G CGIAM AS+P +
Sbjct: 307 NSWGTEWGEEGYIRMQRGVKAEEGLCGIAMMASYPTA 343


>gi|357474573|ref|XP_003607571.1| Cysteine proteinase EP-B [Medicago truncatula]
 gi|34329348|gb|AAQ63885.1| putative cysteine proteinase [Medicago truncatula]
 gi|355508626|gb|AES89768.1| Cysteine proteinase EP-B [Medicago truncatula]
          Length = 345

 Score =  344 bits (882), Expect = 4e-92,   Method: Compositional matrix adjust.
 Identities = 178/337 (52%), Positives = 224/337 (66%), Gaps = 14/337 (4%)

Query: 7   IVVLIISGSCASQATYRTF-DEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
           + +    G  A Q T RT  D+  I EK EQW   YG+ YK+  E   R +IFK+N+  +
Sbjct: 13  LALFFCLGLFAIQVTSRTLQDDSIIYEKHEQWMVHYGKVYKDLQERENRLKIFKENVNYI 72

Query: 66  ERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPP 125
           E  NNA   N+ Y L +N+FADLT +EFIAS+  FK   H  S     + F Y+++ VP 
Sbjct: 73  EASNNAG-NNKLYKLGINQFADLTNEEFIASRNKFK--GHMCSSITKTSTFKYENASVPS 129

Query: 126 SVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNN 178
           +V+W +KGAVTPVK QGQC       AVAA EGI+ +   +LVSLSEQ+LVDC T   + 
Sbjct: 130 TVDWRKKGAVTPVKNQGQCGCCWAFSAVAATEGIHKLSTGKLVSLSEQELVDCDTKGVDQ 189

Query: 179 GCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEE 238
           GC GG MDDAFK+IIQN G+  +A Y Y+G+  G C + KA  HA  IT YEDVP N+E+
Sbjct: 190 GCEGGLMDDAFKFIIQNHGLNTEAQYPYQGVD-GTCSANKASIHAVTITGYEDVPANNEQ 248

Query: 239 SLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIK 296
           +L KAVANQP+SVAIDAS    QFY  GVF G C T L+HGVTAVGYG   +G KYWL+K
Sbjct: 249 ALQKAVANQPISVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVGNDGTKYWLVK 308

Query: 297 NSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
           NSWG DWGE+GY ++QR +D  +G CGIAM AS+P +
Sbjct: 309 NSWGTDWGEEGYIKMQRGVDAAEGLCGIAMEASYPTA 345


>gi|409190991|gb|AFV30165.1| cysteine proteinase [Lotus japonicus]
          Length = 342

 Score =  343 bits (881), Expect = 5e-92,   Method: Compositional matrix adjust.
 Identities = 174/338 (51%), Positives = 229/338 (67%), Gaps = 16/338 (4%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
           F +V+ +  G  A Q + RT  + S+ E+ EQW A+YGR YK+  E  KRF IFK+N+  
Sbjct: 12  FALVLCL--GLWAFQVSSRTLQDASMQERHEQWMARYGRVYKDLQEKEKRFSIFKENVNY 69

Query: 65  VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVP 124
           +E  NNA  G++ Y L +N+FADLT +EFIA++  FK   H SS     T F Y++   P
Sbjct: 70  IEASNNA--GDKPYKLGVNQFADLTNEEFIATRNKFK--GHMSSSITRTTTFKYENVTAP 125

Query: 125 PSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNN 177
            +V+W ++GAVTPVK QG C       AVAA EGI+ +    LVSLSEQ+LVDC T+  +
Sbjct: 126 STVDWRQEGAVTPVKNQGTCGCCWAFSAVAATEGIHKLSTGNLVSLSEQELVDCDTSGAD 185

Query: 178 NGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDE 237
            GC GG MDDAFK+IIQN G+  +A Y Y+G+  G C++ +   H A IT YEDVP N+E
Sbjct: 186 QGCQGGLMDDAFKFIIQNGGLNTEAQYPYQGVD-GTCNTNEEATHVATITGYEDVPSNNE 244

Query: 238 ESLLKAVANQPVSVAIDASALQF--YSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLI 295
           ++L +AVANQP+S+AIDAS   F  Y  GVF G C T L+HGV  VGYG S++G KYWL+
Sbjct: 245 QALQQAVANQPISIAIDASGSDFQNYQSGVFTGSCGTQLDHGVAVVGYGVSDDGTKYWLV 304

Query: 296 KNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
           KNSWG DWGE+GY R+QRD+D P+G CG+AM  S+P +
Sbjct: 305 KNSWGADWGEEGYIRMQRDVDAPEGLCGLAMQPSYPTA 342


>gi|356577811|ref|XP_003557016.1| PREDICTED: vignain-like [Glycine max]
          Length = 343

 Score =  343 bits (881), Expect = 6e-92,   Method: Compositional matrix adjust.
 Identities = 176/340 (51%), Positives = 234/340 (68%), Gaps = 15/340 (4%)

Query: 4   YFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLV 63
           +  + +L+     A Q T R+  + S+ E+ EQW  +YG+ YK+  E  KRF IFK+N+ 
Sbjct: 9   HISLAMLLCMAFLAFQVTCRSLQDASMYERHEQWMTRYGKVYKDPQEREKRFRIFKENVN 68

Query: 64  AVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQ 122
            +E FNNAA  N+ Y L +N+FADLT +EFIA +  FK    SS ++   T F Y++ + 
Sbjct: 69  YIEAFNNAA--NKRYKLAINQFADLTNEEFIAPRNRFKGHMCSSIIRT--TTFKYENVTA 124

Query: 123 VPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATND 175
           VP +V+W +KGAVTP+K QGQC       AVAA EGI+A+   +L+SLSEQ+LVDC T  
Sbjct: 125 VPSTVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALTSGKLISLSEQELVDCDTKG 184

Query: 176 NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPN 235
            + GC GG MDDAFK++IQN G+  +A Y Y+G+  G C+  +A + AA IT YEDVP N
Sbjct: 185 VDQGCEGGLMDDAFKFVIQNHGLNTEANYPYKGVD-GKCNVNEAANDAATITGYEDVPAN 243

Query: 236 DEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYW 293
           +E++L KAVANQPVSVAIDAS    QFY  GVF G C T L+HGVTAVGYG S +G +YW
Sbjct: 244 NEKALQKAVANQPVSVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSNDGTEYW 303

Query: 294 LIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
           L+KNSWG +WGE+GY R+QR ++  +G CGIAM AS+P +
Sbjct: 304 LVKNSWGTEWGEEGYIRMQRGVNSEEGLCGIAMQASYPTA 343


>gi|224093956|ref|XP_002310053.1| predicted protein [Populus trichocarpa]
 gi|224147016|ref|XP_002336386.1| predicted protein [Populus trichocarpa]
 gi|222834869|gb|EEE73318.1| predicted protein [Populus trichocarpa]
 gi|222852956|gb|EEE90503.1| predicted protein [Populus trichocarpa]
          Length = 340

 Score =  343 bits (880), Expect = 8e-92,   Method: Compositional matrix adjust.
 Identities = 178/341 (52%), Positives = 237/341 (69%), Gaps = 18/341 (5%)

Query: 3   KYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
           ++  + +L I G+  S++T RT  +  + E+ EQW  QYGR YK+  E + R+ IFK+N+
Sbjct: 8   QFVCLALLFILGAWPSKSTARTLLDAPMYERHEQWMTQYGRVYKDDNERATRYSIFKENV 67

Query: 63  VAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-S 121
             ++ FN+     +SY L +N+FADLT +EF AS+  FK   H  S +A   PF Y++ S
Sbjct: 68  ARIDAFNSQT--GKSYKLGVNQFADLTNEEFKASRNRFK--GHMCSPQAG--PFRYENVS 121

Query: 122 QVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATN 174
            VP +V+W ++GAVTPVK QGQC       AVAA+EGIN +   +L+SLSEQ++VDC T 
Sbjct: 122 AVPSTVDWRKEGAVTPVKDQGQCGCCWAFSAVAAMEGINKLTTGKLISLSEQEVVDCDTK 181

Query: 175 DNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPP 234
             + GC GG MDDAFK+I QNKG+T +A Y Y+G + G C++ KA  HAA+IT +EDVP 
Sbjct: 182 GEDQGCNGGLMDDAFKFIEQNKGLTTEANYPYKG-TDGTCNTNKAAIHAAKITGFEDVPA 240

Query: 235 NDEESLLKAVANQPVSVAIDA--SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
           N E +L+KAVA QPVSVAIDA  S  QFYS G+F G C+T L+HGVTAVGYG S +G KY
Sbjct: 241 NSEAALMKAVAKQPVSVAIDAGGSDFQFYSSGIFTGSCDTQLDHGVTAVGYGVS-DGSKY 299

Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
           WL+KNSWG  WGE+GY R+Q+DI   +G CGIAM AS+P +
Sbjct: 300 WLVKNSWGAQWGEEGYIRMQKDISAKEGLCGIAMQASYPTA 340


>gi|356545063|ref|XP_003540965.1| PREDICTED: thiol protease SEN102-like [Glycine max]
          Length = 361

 Score =  343 bits (880), Expect = 8e-92,   Method: Compositional matrix adjust.
 Identities = 175/337 (51%), Positives = 232/337 (68%), Gaps = 15/337 (4%)

Query: 7   IVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVE 66
           + +L+     A Q T R+  + S+ E+ EQW  +YG+ YK+  E  KRF IFK+N+  +E
Sbjct: 30  LAMLLCMAFLAFQVTCRSLQDASMYERHEQWMTRYGKVYKDPQEREKRFRIFKENVNYIE 89

Query: 67  RFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPP 125
            FNNAA  N+ Y L +N+FADLT +EFIA +  FK    SS ++   T F Y++ + VP 
Sbjct: 90  AFNNAA--NKRYKLAINQFADLTNEEFIAPRNRFKGHMCSSIIRT--TTFKYENVTAVPS 145

Query: 126 SVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNN 178
           +V+W +KGAVTP+K QGQC       AVAA EGI+A+   +L+SLSEQ+LVDC T   + 
Sbjct: 146 TVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALTSGKLISLSEQELVDCDTKGVDQ 205

Query: 179 GCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEE 238
           GC GG MDDAFK++IQN G+  +A Y Y+G+  G C++ +A +    IT YEDVP N+E+
Sbjct: 206 GCEGGLMDDAFKFVIQNHGLNTEANYPYKGVD-GKCNANEAANDVVTITGYEDVPANNEK 264

Query: 239 SLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIK 296
           +L KAVANQPVSVAIDAS    QFY  GVF G C T L+HGVTAVGYG S +G +YWL+K
Sbjct: 265 ALQKAVANQPVSVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSNDGTEYWLVK 324

Query: 297 NSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
           NSWG +WGE+GY R+QR +D  +G CGIAM AS+P +
Sbjct: 325 NSWGTEWGEEGYIRMQRGVDSEEGLCGIAMQASYPTA 361


>gi|356542633|ref|XP_003539771.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
          Length = 341

 Score =  343 bits (879), Expect = 9e-92,   Method: Compositional matrix adjust.
 Identities = 170/335 (50%), Positives = 224/335 (66%), Gaps = 16/335 (4%)

Query: 7   IVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVE 66
           + + +I   CA +A  RT ++  + E+ EQW A +G+ YK S E  ++++IF +N+  +E
Sbjct: 11  LALFLIFAFCAFEANARTLEDAPMRERHEQWMATHGKVYKHSYEKEQKYQIFMENVQRIE 70

Query: 67  RFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPP 125
            FNNA  G + Y L +N FADLT +EF A     +   H  S +   T F Y++ + VP 
Sbjct: 71  AFNNA--GXKPYKLGINHFADLTNEEFKAIN---RFKGHVCSKRTRTTTFRYENVTAVPA 125

Query: 126 SVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNN 178
           S++W +KGAVTP+K QGQC       AVAA EGI  ++  +L+SLSEQ+LVDC T   + 
Sbjct: 126 SLDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGITKLRTGKLISLSEQELVDCDTKGVDQ 185

Query: 179 GCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEE 238
           GC GG MDDAFK+I+QNKG+  +A+Y YEG   G C++    +HA  I  YEDVP N E 
Sbjct: 186 GCEGGLMDDAFKFILQNKGLATEAIYPYEGFD-GTCNAKADGNHAGSIKGYEDVPANSES 244

Query: 239 SLLKAVANQPVSVAIDASAL--QFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIK 296
           +LLKAVANQPVSVAI+AS    QFYSGGVF G C T L+HGVT+VGYG  ++G KYWL+K
Sbjct: 245 ALLKAVANQPVSVAIEASGFKFQFYSGGVFTGSCGTNLDHGVTSVGYGVGDDGTKYWLVK 304

Query: 297 NSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
           NSWG  WGE GY R+QRD+   +G CGIAM AS+P
Sbjct: 305 NSWGVKWGEKGYIRMQRDVAAKEGLCGIAMLASYP 339


>gi|224081320|ref|XP_002306369.1| predicted protein [Populus trichocarpa]
 gi|222855818|gb|EEE93365.1| predicted protein [Populus trichocarpa]
          Length = 340

 Score =  342 bits (877), Expect = 2e-91,   Method: Compositional matrix adjust.
 Identities = 176/340 (51%), Positives = 234/340 (68%), Gaps = 18/340 (5%)

Query: 2   AKYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDN 61
           +++  + +L + G+  S++  RT  + S+ E+ EQW AQYGR YK+ AE   R+ IFK+N
Sbjct: 7   SQFICLALLFVLGAWPSKSAARTLQDVSMYERHEQWMAQYGRVYKDDAEKETRYNIFKEN 66

Query: 62  LVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS- 120
           +  ++ FN+     +SY L +N+FADL+ +EF AS+  FK   H  S +A   PF Y++ 
Sbjct: 67  VARIDAFNSQT--GKSYKLGVNQFADLSNEEFKASRNRFK--GHMCSPQAG--PFRYENV 120

Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
           S VP +++W +KGAVTPVK QGQC       AVAA+EGIN +   +L+SLSEQ++VDC T
Sbjct: 121 SAVPATMDWRKKGAVTPVKDQGQCGCCWAFSAVAAMEGINQLTTGKLISLSEQEVVDCDT 180

Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
              + GC GG MDDAFK+I QNKG+T +A Y Y G + G C++ K   HAA+IT +EDVP
Sbjct: 181 KGEDQGCNGGLMDDAFKFIEQNKGLTTEANYPYTG-TDGTCNTQKEATHAAKITGFEDVP 239

Query: 234 PNDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIK 291
            N E +L+KAVA QPVSVAIDA     QFYS G+F G C T L+HGVTAVGYG S +G K
Sbjct: 240 ANSEAALMKAVAKQPVSVAIDAGGFEFQFYSSGIFTGSCGTQLDHGVTAVGYGIS-DGTK 298

Query: 292 YWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
           YWL+KNSWG  WGE+GY R+Q+DI   +G CGIAM AS+P
Sbjct: 299 YWLVKNSWGAQWGEEGYIRMQKDISAKEGLCGIAMQASYP 338


>gi|356517358|ref|XP_003527354.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
 gi|356577767|ref|XP_003556994.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
          Length = 343

 Score =  342 bits (876), Expect = 2e-91,   Method: Compositional matrix adjust.
 Identities = 174/337 (51%), Positives = 230/337 (68%), Gaps = 15/337 (4%)

Query: 7   IVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVE 66
           + +L  SG  A Q T RT  + S+ E+ E+W  +Y + YK+  E  +RF+IFK+N+  +E
Sbjct: 12  LALLFCSGFLAFQVTCRTLQDASMYERHEEWMGRYAKVYKDPQERERRFKIFKENVNYIE 71

Query: 67  RFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPP 125
            FNNAA  N+ YTL +N+FADLT +EFIA +  FK   H  S     T F Y++ + +P 
Sbjct: 72  AFNNAA--NKPYTLGINQFADLTNEEFIAPRNRFK--GHMCSSITRTTTFKYENVTAIPS 127

Query: 126 SVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNN 178
           +V+W +KGAVTP+K QGQC       AVAA EGI+A+   +L+SLSEQ++VDC T   + 
Sbjct: 128 TVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALSAGKLISLSEQEVVDCDTKGEDQ 187

Query: 179 GCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEE 238
           GC GGFMD AFK+IIQN G+ N+  Y Y+ +  G C++  A +H A IT YEDVP N+E+
Sbjct: 188 GCAGGFMDGAFKFIIQNHGLNNEPNYPYKAVD-GKCNAKAAANHVATITGYEDVPVNNEK 246

Query: 239 SLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIK 296
           +L KAVANQPVSVAIDAS    QFY  GVF G C T L+HGVTAVGYG S +G +YWL+K
Sbjct: 247 ALQKAVANQPVSVAIDASGSDFQFYQSGVFTGSCGTELDHGVTAVGYGVSADGTEYWLVK 306

Query: 297 NSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
           NSWG +WGE+GY R+QR +   +G  GIAM AS+P +
Sbjct: 307 NSWGTEWGEEGYIRMQRGVKAEEGLXGIAMMASYPTA 343


>gi|356515086|ref|XP_003526232.1| PREDICTED: vignain-like [Glycine max]
          Length = 343

 Score =  341 bits (875), Expect = 2e-91,   Method: Compositional matrix adjust.
 Identities = 178/337 (52%), Positives = 235/337 (69%), Gaps = 15/337 (4%)

Query: 7   IVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVE 66
           + +L+     A Q T RT  + S+ E+ EQW  +YG+ YK+  E  KRF +FK+N+  +E
Sbjct: 12  LAMLLCMTFLAFQVTCRTLQDASMYERHEQWMTRYGKVYKDPQEREKRFRVFKENVNYIE 71

Query: 67  RFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPP 125
            FNNAA  N+SY L +N+FADLT +EFIA + GFK    SS ++   T F +++ +  P 
Sbjct: 72  AFNNAA--NKSYKLGINQFADLTNKEFIAPRNGFKGHMCSSIIRT--TTFKFENVTATPS 127

Query: 126 SVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNN 178
           +V+W +KGAVTP+K QGQC       AVAA EGI+A+   +L+SLSEQ+LVDC T   + 
Sbjct: 128 TVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALSAGKLISLSEQELVDCDTKGVDQ 187

Query: 179 GCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEE 238
           GC GG MDDAFK+IIQN G+  +A Y Y+G+  G C++ +A  +AA IT YEDVP N+E 
Sbjct: 188 GCEGGLMDDAFKFIIQNHGLNTEANYPYKGVD-GKCNANEAAKNAATITGYEDVPANNEM 246

Query: 239 SLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIK 296
           +L KAVANQPVSVAIDAS    QFY  GVF G C T L+HGVTAVGYG S++G +YWL+K
Sbjct: 247 ALQKAVANQPVSVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSDDGTEYWLVK 306

Query: 297 NSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
           NSWG +WGE+GY R+QR +D  +G CGIAM AS+P +
Sbjct: 307 NSWGTEWGEEGYIRMQRGVDSEEGLCGIAMQASYPTA 343


>gi|144905104|dbj|BAF56427.1| cysteine proteinase [Lotus japonicus]
          Length = 342

 Score =  341 bits (874), Expect = 3e-91,   Method: Compositional matrix adjust.
 Identities = 173/338 (51%), Positives = 229/338 (67%), Gaps = 16/338 (4%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
           F +V+ +  G  A Q + RT  + S+ E+ EQW A+YG+ YK+  E  KRF IF++N+  
Sbjct: 12  FALVLCL--GLWAFQVSSRTLQDASMHERHEQWMARYGKVYKDLQEKEKRFNIFQENVKY 69

Query: 65  VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVP 124
           +E  NNA  GN+ Y L +N+F DLT +EFIA++  FK   H SS     T F Y++   P
Sbjct: 70  IEASNNA--GNKPYKLGVNQFTDLTNKEFIATRNKFK--GHMSSSITRTTTFKYENVTAP 125

Query: 125 PSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNN 177
            +V+W ++GAVTPVK QG C       AVAA EGI+ +    LVSLSEQ+LVDC T+  +
Sbjct: 126 STVDWRQEGAVTPVKNQGTCGCCWAFSAVAATEGIHKLSTGNLVSLSEQELVDCDTSGAD 185

Query: 178 NGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDE 237
            GC GG MDDAFK+IIQN G+  +A Y Y+G+  G C++ +   H A IT YEDVP N+E
Sbjct: 186 QGCQGGLMDDAFKFIIQNGGLNTEAQYPYQGVD-GTCNTNEEVTHVATITGYEDVPSNNE 244

Query: 238 ESLLKAVANQPVSVAIDASALQF--YSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLI 295
           ++L +AVANQP+SVAIDAS   F  Y  GVF G C T L+HGV  VGYG S++G KYWL+
Sbjct: 245 QALQQAVANQPISVAIDASGSDFQNYQSGVFTGSCGTQLDHGVAVVGYGVSDDGTKYWLV 304

Query: 296 KNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
           KNSWG+DWGE+GY R+QRD++ P+G CGIAM  S+P +
Sbjct: 305 KNSWGEDWGEEGYIRMQRDVEAPEGLCGIAMQPSYPTA 342


>gi|357474579|ref|XP_003607574.1| Cysteine protease [Medicago truncatula]
 gi|355508629|gb|AES89771.1| Cysteine protease [Medicago truncatula]
          Length = 345

 Score =  340 bits (871), Expect = 7e-91,   Method: Compositional matrix adjust.
 Identities = 176/327 (53%), Positives = 219/327 (66%), Gaps = 14/327 (4%)

Query: 17  ASQATYRTF-DEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGN 75
           A Q T RT  D+  I EK EQW   YG+ YK+  E   R +IFK+N+  +E  NNA   N
Sbjct: 23  AIQVTSRTLQDDSIIYEKHEQWMVHYGKVYKDLQERENRLKIFKENVNYIEASNNAG-NN 81

Query: 76  RSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAV 135
           + Y L +N+FAD+T +EFIAS+  FK   H  S     + F Y+++ VP +V+W +KGAV
Sbjct: 82  KLYKLGINQFADITNEEFIASRNKFK--GHMCSSITKTSTFKYENASVPSTVDWRKKGAV 139

Query: 136 TPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDA 188
           TPVK QGQC       AVAA EGI+ +   +LVSLSEQ+LVDC T   + GC GG MDDA
Sbjct: 140 TPVKNQGQCGCCWAFSAVAATEGIHKLSTGKLVSLSEQELVDCDTKGVDQGCEGGLMDDA 199

Query: 189 FKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQP 248
           FK+IIQN G+  +A Y Y+G+  G C + +    AA I  YEDVP N+E +L KAVANQP
Sbjct: 200 FKFIIQNHGLHTEAQYPYQGVD-GTCSANETSTPAATIAGYEDVPANNENALQKAVANQP 258

Query: 249 VSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGED 306
           +SVAIDAS    QFY  GVF G C T L+HGVTAVGYG S +G KYWL+KNSWG DWGE+
Sbjct: 259 ISVAIDASGSDFQFYKSGVFTGSCGTQLDHGVTAVGYGISNDGTKYWLVKNSWGNDWGEE 318

Query: 307 GYFRLQRDIDQPQGQCGIAMFASFPVS 333
           GY R+QR +D  QG CGIAM AS+P +
Sbjct: 319 GYIRMQRSVDAAQGLCGIAMMASYPTA 345


>gi|225443827|ref|XP_002274223.1| PREDICTED: vignain-like [Vitis vinifera]
          Length = 340

 Score =  338 bits (866), Expect = 3e-90,   Method: Compositional matrix adjust.
 Identities = 179/339 (52%), Positives = 231/339 (68%), Gaps = 15/339 (4%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
            + + L+I G  ASQA  RT  E S++E+ E W   YGRTYK+ AE  +RF+IFK+N+  
Sbjct: 7   IICITLLIMGVWASQALSRTLHEVSMSERHEDWMGLYGRTYKDIAEKERRFKIFKENVEY 66

Query: 65  VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQV 123
           +E  N+A  GNR Y L +N+FAD T +EF AS+ G+ MS    S +   T F Y++ + V
Sbjct: 67  IESVNSA--GNRRYKLSINEFADQTNEEFKASRNGYNMSSRPRSSEI--TSFRYENVAAV 122

Query: 124 PPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDN 176
           P S++W +KGAVTP+K QGQC       AVAA+EG+  +K   L+SLSEQ+LVDC T+  
Sbjct: 123 PSSMDWRKKGAVTPIKDQGQCGCCWAFSAVAAMEGVTQLKTGELISLSEQELVDCDTSGE 182

Query: 177 NNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPND 236
           + GC GG MD AF++II N G+T +A Y Y+G+    C+  KA   AA+I NYEDVP N 
Sbjct: 183 DQGCGGGLMDSAFEFIIGNGGLTTEANYPYKGVDA-TCNKKKAASSAAKIKNYEDVPANS 241

Query: 237 EESLLKAVANQPVSVAIDA--SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWL 294
           E +LLKAVA  PVSVAIDA  S  QFYS GVF G C T L+HGVTAVGYG +++G KYWL
Sbjct: 242 EAALLKAVAQHPVSVAIDAGGSDFQFYSSGVFTGQCGTELDHGVTAVGYGKTDDGTKYWL 301

Query: 295 IKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
           +KNSWG  WGEDGY  ++RDI   +G CGIAM AS+P +
Sbjct: 302 VKNSWGTGWGEDGYIWMERDIGADEGLCGIAMEASYPTA 340


>gi|535454|gb|AAA50755.1| cysteine proteinase [Alnus glutinosa]
          Length = 340

 Score =  338 bits (866), Expect = 3e-90,   Method: Compositional matrix adjust.
 Identities = 178/344 (51%), Positives = 235/344 (68%), Gaps = 18/344 (5%)

Query: 1   MAKYFLIVVLIISGSCASQ-ATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFK 59
           +++ F +VV++  G+ ASQ A  R+  + S+ E+ E+W A YGR YK+  E  KR++IF+
Sbjct: 4   VSQCFCLVVMVTLGALASQLAAARSLQDASMRERHEEWMASYGRVYKDINEKQKRYKIFE 63

Query: 60  DNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK 119
           +N+  +E  N  A  N+ Y L +N+FADLT +EF AS+  FK   H  S K+  T F Y 
Sbjct: 64  ENVALIESSNKDA--NKPYKLSVNQFADLTNEEFKASRNRFK--GHICSTKS--TSFKYG 117

Query: 120 S-SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDC 171
           + S VP +++W  KGAVTPVK QGQC       AVAA EGI  +    L+SLSEQ+LVDC
Sbjct: 118 NVSAVPSAMDWRMKGAVTPVKDQGQCGCCWAFSAVAATEGITKLTTGELISLSEQELVDC 177

Query: 172 ATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYED 231
            T+  + GC GG MD+AF +I  N G+ ++A Y Y+G+  G C++ K   HAA+I  +ED
Sbjct: 178 DTSGVDQGCEGGLMDNAFTFIQHNHGLASEANYPYKGVD-GTCNTNKQAIHAAEINGFED 236

Query: 232 VPPNDEESLLKAVANQPVSVAIDA--SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEG 289
           VP N EE+LL AVA+QPVSVAIDA  S  QFYS GVF G C T L+HGVTAVGYGTS++G
Sbjct: 237 VPANSEEALLNAVAHQPVSVAIDAGGSGFQFYSKGVFIGACGTQLDHGVTAVGYGTSDDG 296

Query: 290 IKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
            KYWL+KNSWG  WGE+GY R+QRD+D  +G CGIAM AS+P +
Sbjct: 297 TKYWLVKNSWGTQWGEEGYIRMQRDVDAKEGLCGIAMKASYPTA 340


>gi|10336513|dbj|BAB13759.1| cysteine proteinase [Astragalus sinicus]
          Length = 343

 Score =  337 bits (864), Expect = 5e-90,   Method: Compositional matrix adjust.
 Identities = 174/340 (51%), Positives = 232/340 (68%), Gaps = 15/340 (4%)

Query: 4   YFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLV 63
           Y  + +L+  G  A Q T RT  + S+ E+ +QW  QY + Y +  E  KRF+IFK+N+ 
Sbjct: 9   YISLALLMCLGLWAVQVTSRTLQDASMYERHQQWMGQYAKIYNDHQEWEKRFQIFKENVN 68

Query: 64  AVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQ 122
            +E  N    G R Y L +N+F DLT +EFIA +  FK    SS ++ N   + Y++ + 
Sbjct: 69  YIETSNKE--GGRFYKLGVNQFVDLTNEEFIAPRNRFKGHMCSSIIRTN--TYKYENVTT 124

Query: 123 VPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATND 175
           VP +V+W +KGAVTPVK QGQC       AVAA EGI+ +   +L+SLSEQ+LVDC T  
Sbjct: 125 VPSNVDWRQKGAVTPVKDQGQCGCCWAFSAVAATEGIHQLSTGKLISLSEQELVDCDTKG 184

Query: 176 NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPN 235
            + GC GG MDDAFK+IIQN G+  +A Y Y+G+  G C++ +A  +AA IT+YEDVP N
Sbjct: 185 VDQGCEGGLMDDAFKFIIQNHGLDTEAKYPYQGVD-GTCNANEASINAATITSYEDVPTN 243

Query: 236 DEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYW 293
           +E++L KAVANQP+SVAIDAS    QFY+ GVF G C T L+HGVTAVGYG S++G KYW
Sbjct: 244 NEQALQKAVANQPISVAIDASGSDFQFYTSGVFTGSCGTELDHGVTAVGYGVSDDGTKYW 303

Query: 294 LIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
           L+KNSWG  WGE+GY R+QR +D  +G CGIAM AS+P++
Sbjct: 304 LVKNSWGTSWGEEGYIRMQRGVDAVEGLCGIAMQASYPIA 343


>gi|124484401|dbj|BAF46311.1| cysteine proteinase precursor [Ipomoea nil]
          Length = 339

 Score =  337 bits (864), Expect = 5e-90,   Method: Compositional matrix adjust.
 Identities = 179/337 (53%), Positives = 228/337 (67%), Gaps = 17/337 (5%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
            LI + ++  + A  AT RT  +  +A + EQW AQYGR YK   E +KR+ IFK+N+  
Sbjct: 8   LLIALALVFATSAYLATSRTLLDSLMAVRHEQWMAQYGRVYKNEVEKTKRYNIFKENVEY 67

Query: 65  VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQV 123
           +E FN A  G + Y L +N FADLT +EFIAS+ G+ +    SS     TPF Y++ S V
Sbjct: 68  IESFNKA--GTKPYKLGINAFADLTNKEFIASRNGYILPHECSS----NTPFRYENVSAV 121

Query: 124 PPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDN 176
           P +V+W +KGAVTPVK QGQC       AVAA+EGI  +    L+SLSEQ+LVDC     
Sbjct: 122 PTTVDWRKKGAVTPVKDQGQCGCCWAFSAVAAMEGITKLSTGNLISLSEQELVDCDVKGI 181

Query: 177 NNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPND 236
           + GC GG MDDAF +II NKG+T ++ Y Y+G + G C   K+ + AA+I+ YEDVP N 
Sbjct: 182 DQGCEGGLMDDAFTFIINNKGLTTESNYPYQG-TDGSCKKSKSSNSAAKISGYEDVPANS 240

Query: 237 EESLLKAVANQPVSVAIDA--SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWL 294
           E +L KAVANQPVSVAIDA  S  QFYS GVF G C T L+HGVTAVGYG +E+G KYWL
Sbjct: 241 ESALEKAVANQPVSVAIDAGGSDFQFYSSGVFTGECGTELDHGVTAVGYGIAEDGSKYWL 300

Query: 295 IKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
           +KNSWG  WGE GY R+Q+DI+  +G CGIAM +S+P
Sbjct: 301 VKNSWGTSWGEKGYIRMQKDIEAKEGLCGIAMQSSYP 337


>gi|13491750|gb|AAK27968.1|AF242372_1 cysteine protease [Ipomoea batatas]
          Length = 339

 Score =  336 bits (862), Expect = 7e-90,   Method: Compositional matrix adjust.
 Identities = 178/322 (55%), Positives = 220/322 (68%), Gaps = 17/322 (5%)

Query: 20  ATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYT 79
           AT RT  +  +  + EQW AQYGR YK  AE +KRF IFK+N+  +E FN A  G + Y 
Sbjct: 23  ATSRTLSDSLMVVRHEQWMAQYGRVYKTEAEKTKRFNIFKENVEYIESFNKA--GTKPYK 80

Query: 80  LRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGAVTPV 138
           L +N FADLT QEF AS+ G+K+    SS     TPF Y++ S VP +V+W  KGAVTPV
Sbjct: 81  LGINAFADLTNQEFKASRNGYKLPHDCSS----NTPFRYENVSSVPTTVDWRTKGAVTPV 136

Query: 139 KYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKY 191
           K QGQC       AVAA+EGI  +    L+SLSEQ+LVDC     + GC GG MDDAF +
Sbjct: 137 KDQGQCGCCWAFSAVAAMEGITKLSTGNLISLSEQELVDCDVKGTDQGCEGGLMDDAFSF 196

Query: 192 IIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSV 251
           II NKG+T ++ Y Y+G + G C   K+ + AA+I+ YEDVP N E +L KAVANQPVSV
Sbjct: 197 IINNKGLTTESNYPYQG-TDGSCKKSKSSNSAAKISGYEDVPANSESALEKAVANQPVSV 255

Query: 252 AIDA--SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYF 309
           AIDA  S  QFYS GVF G C T L+HGVTAVGYG +E+G KYWL+KNSWG  WGE GY 
Sbjct: 256 AIDAGGSDFQFYSSGVFTGECGTELDHGVTAVGYGIAEDGSKYWLVKNSWGTSWGEKGYI 315

Query: 310 RLQRDIDQPQGQCGIAMFASFP 331
           R+Q+DI+  +G CGIAM +S+P
Sbjct: 316 RMQKDIEAKEGLCGIAMQSSYP 337


>gi|37780047|gb|AAP32196.1| cysteine protease 8 [Trifolium repens]
          Length = 343

 Score =  336 bits (862), Expect = 8e-90,   Method: Compositional matrix adjust.
 Identities = 177/341 (51%), Positives = 227/341 (66%), Gaps = 16/341 (4%)

Query: 4   YFLIVVLIIS-GSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
           Y + + L+   G  A Q T RT  + S+ E+  QW +QYG+ YK+  E   RF+IF +N+
Sbjct: 8   YHISLALVFCLGLFAIQVTSRTLQDDSMYERHGQWMSQYGKIYKDHQERETRFKIFTENV 67

Query: 63  VAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-S 121
             VE  N  A   +SY L +N+FADLT +EF+AS+  FK   H  S     T F Y++ S
Sbjct: 68  NYVEASN--ADDTKSYKLGINQFADLTNEEFVASRNKFK--GHMCSSITRTTTFKYENVS 123

Query: 122 QVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATN 174
            +P +V+W +KGAVTPVK QGQC       AVAA EGI+ +   +L+SLSEQ+LVDC T 
Sbjct: 124 AIPSTVDWRKKGAVTPVKNQGQCGCCWAFSAVAATEGIHKLSTGKLISLSEQELVDCDTK 183

Query: 175 DNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPP 234
             + GC GG MDDAFK+IIQN G++ +A Y YEG+  G C++ KA   A  IT YEDVP 
Sbjct: 184 GVDQGCEGGLMDDAFKFIIQNHGLSTEAQYPYEGVD-GTCNANKASVQAVTITGYEDVPA 242

Query: 235 NDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
           N E++L KAVANQP+SVAIDAS    QFY  GVF G C T L+HGVTAVGYG S +G KY
Sbjct: 243 NSEQALQKAVANQPISVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSNDGTKY 302

Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
           WL+KNSWG DWGE+GY  +QR ++  +G CGIAM AS+P +
Sbjct: 303 WLVKNSWGTDWGEEGYIMMQRGVEAAEGLCGIAMQASYPTA 343


>gi|356539398|ref|XP_003538185.1| PREDICTED: vignain-like [Glycine max]
          Length = 343

 Score =  336 bits (862), Expect = 8e-90,   Method: Compositional matrix adjust.
 Identities = 167/341 (48%), Positives = 226/341 (66%), Gaps = 16/341 (4%)

Query: 1   MAKYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKD 60
           + +YF + + ++   CA +   RT ++  + E+ EQW A +G+ Y  S E  ++++ FK+
Sbjct: 7   LFQYFTLALCLVFAFCAFEGNARTLEDAPMRERHEQWMAIHGKVYTHSYEKEQKYQTFKE 66

Query: 61  NLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS 120
           N+  +E FN+A  GN+ Y L +N FADLT +EF A     +   H  S       F Y++
Sbjct: 67  NVQRIEAFNHA--GNKPYKLGINHFADLTNEEFKAIN---RFKGHVCSKITRTPTFRYEN 121

Query: 121 -SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCA 172
            + VP +++W ++GAVTP+K QGQC       AVAA EGI  +   +L+SLSEQ+LVDC 
Sbjct: 122 MTAVPATLDWRQEGAVTPIKDQGQCGCCWAFSAVAATEGITKLSTGKLISLSEQELVDCD 181

Query: 173 TNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDV 232
           T   + GC GG MDDAFK+I+QNKG+  +A+Y YEG+  G C++    +HA  I  YEDV
Sbjct: 182 TKGVDQGCEGGLMDDAFKFILQNKGLAAEAIYPYEGVD-GTCNAKAEGNHATSIKGYEDV 240

Query: 233 PPNDEESLLKAVANQPVSVAIDASAL--QFYSGGVFNGYCETFLNHGVTAVGYGTSEEGI 290
           P N E +LLKAVANQPVSVAI+AS    QFYSGGVF G C T L+HGVTAVGYG S++G 
Sbjct: 241 PANSESALLKAVANQPVSVAIEASGFEFQFYSGGVFTGSCGTNLDHGVTAVGYGVSDDGT 300

Query: 291 KYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
           KYWL+KNSWG  WG+ GY R+QRD+   +G CGIAM AS+P
Sbjct: 301 KYWLVKNSWGVKWGDKGYIRMQRDVAAKEGLCGIAMLASYP 341


>gi|144905108|dbj|BAF56428.1| cysteine proteinase [Lotus japonicus]
          Length = 342

 Score =  336 bits (861), Expect = 1e-89,   Method: Compositional matrix adjust.
 Identities = 171/337 (50%), Positives = 223/337 (66%), Gaps = 16/337 (4%)

Query: 7   IVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVE 66
           + +L++ G  A +A  RT ++ S+ E+ EQW  QYG+ Y +S E   R  IFK+N+  +E
Sbjct: 12  LALLLVFGFLAFEANARTLEDVSLKERHEQWMTQYGKVYTDSYEKELRSNIFKENVQRIE 71

Query: 67  RFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPP 125
            FNNA  GN+ Y L +N+FADLT +EF A     +   H  S       F Y+  S VP 
Sbjct: 72  AFNNA--GNKPYKLGINQFADLTNEEFKARN---RFKGHMCSNSTRTPTFKYEDVSSVPA 126

Query: 126 SVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNN 178
           S++W +KGAVTP+K QGQC       AVAA EGI  +   +L+SLSEQ+LVDC T   + 
Sbjct: 127 SLDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGITKLSTGKLISLSEQELVDCDTKGVDQ 186

Query: 179 GCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEE 238
           GC GG MDDAFK+I+QNKG+  +A Y Y+G+    C++      AA I  +EDVP N E 
Sbjct: 187 GCEGGLMDDAFKFIMQNKGLNTEAKYPYQGVD-ATCNANAEAKDAASIKGFEDVPANSES 245

Query: 239 SLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIK 296
           +LLKAVANQP+SVAIDAS    QFYS G+F G C T L+HGVTAVGYG S++G KYWL+K
Sbjct: 246 ALLKAVANQPISVAIDASGSEFQFYSSGLFTGSCGTELDHGVTAVGYGVSDDGTKYWLVK 305

Query: 297 NSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
           NSWG+ WGE+GY R+QRD+   +G CGIAM AS+P +
Sbjct: 306 NSWGEQWGEEGYIRMQRDVAAEEGLCGIAMQASYPTA 342


>gi|144905116|dbj|BAF56430.1| cysteine proteinase [Lotus japonicus]
          Length = 341

 Score =  333 bits (855), Expect = 5e-89,   Method: Compositional matrix adjust.
 Identities = 172/337 (51%), Positives = 224/337 (66%), Gaps = 17/337 (5%)

Query: 7   IVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVE 66
           + +L++ G  + +A  RT ++ S+ E+ EQW AQYG+ YK+S E   R +IFK+N+  +E
Sbjct: 12  LTLLLVFGFLSFEANARTLEDASMHERHEQWMAQYGKVYKDSYEKELRSKIFKENVQRIE 71

Query: 67  RFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPP 125
            FNNA  GN+SY L +N+FADLT +EF A     +   H  S       F Y+  + VP 
Sbjct: 72  AFNNA--GNKSYKLGINQFADLTNEEFKARN---RFKGHMCSNSTRTPTFKYEHVTSVPA 126

Query: 126 SVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNN 178
           S++W +KGAVTP+K QGQC       AVAA EGI  +   +L+SLSEQ+LVDC T   + 
Sbjct: 127 SLDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGITKLSTGKLISLSEQELVDCDTKGVDQ 186

Query: 179 GCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEE 238
           GC GG MDDAFK+I+QNKG+  +A Y Y+G+    C++      AA I  +EDVP N E 
Sbjct: 187 GCEGGLMDDAFKFIMQNKGLNTEAKYPYQGVD-ATCNANAEAKDAASIKGFEDVPANSES 245

Query: 239 SLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIK 296
           +LLKAVANQP+SVAIDAS    QFYS GVF G C T L+HGVTAVGYG S+ G KYWL+K
Sbjct: 246 ALLKAVANQPISVAIDASGSEFQFYSSGVFTGSCGTELDHGVTAVGYG-SDGGTKYWLVK 304

Query: 297 NSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
           NSWG+ WGE GY R+QRD+   +G CG AM AS+P +
Sbjct: 305 NSWGEQWGEQGYIRMQRDVAAEEGLCGFAMQASYPTA 341


>gi|356517350|ref|XP_003527350.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
           max]
 gi|356577765|ref|XP_003556993.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
           max]
          Length = 343

 Score =  332 bits (851), Expect = 2e-88,   Method: Compositional matrix adjust.
 Identities = 172/337 (51%), Positives = 225/337 (66%), Gaps = 15/337 (4%)

Query: 7   IVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVE 66
           + +    G  A Q   RT  + S+ E+ EQW A+YG+ YK+  E  KRF +FK+N+  +E
Sbjct: 12  LALFFCLGFLAFQVASRTLQDASMYERHEQWMARYGKVYKDPEEKEKRFRVFKENVNYIE 71

Query: 67  RFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQV-PP 125
            FNNAA  N+ Y L +N+FADLT +EFI  +  F  + H+ S     T F Y++  V P 
Sbjct: 72  AFNNAA--NKPYKLGINQFADLTSEEFIVPRNRF--NGHTRSSNTRTTTFKYENVTVLPD 127

Query: 126 SVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNN 178
           S++W +KGAVTP+K QG C       A+AA EGI+ I   +LVSLSEQ++VDC T   ++
Sbjct: 128 SIDWRQKGAVTPIKNQGSCGCCWAFSAIAATEGIHKISTGKLVSLSEQEVVDCDTKGTDH 187

Query: 179 GCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEE 238
           GC GG+MD AFK+IIQN GI  +A Y Y+G+  G C+  +   HAA IT YEDVP N+E+
Sbjct: 188 GCEGGYMDGAFKFIIQNHGINTEASYPYKGVD-GKCNIKEEAVHAATITGYEDVPINNEK 246

Query: 239 SLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIK 296
           +L KAVANQPVSVAIDAS    QFY  G+F G C T L+HGVTAVGYG + EG KYWL+K
Sbjct: 247 ALQKAVANQPVSVAIDASGADFQFYKSGIFTGSCGTELDHGVTAVGYGENNEGTKYWLVK 306

Query: 297 NSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
           NSWG +WGE+GY  +QR +   +G CGIAM AS+P +
Sbjct: 307 NSWGTEWGEEGYIMMQRGVKAVEGICGIAMMASYPTA 343


>gi|255580657|ref|XP_002531151.1| cysteine protease, putative [Ricinus communis]
 gi|223529264|gb|EEF31236.1| cysteine protease, putative [Ricinus communis]
          Length = 340

 Score =  332 bits (851), Expect = 2e-88,   Method: Compositional matrix adjust.
 Identities = 170/337 (50%), Positives = 230/337 (68%), Gaps = 18/337 (5%)

Query: 7   IVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVE 66
           + ++ + G+  SQA  RT  + S+ EK E+W +++GR Y +  E   R++IFK+N+  +E
Sbjct: 12  LALIFLLGALVSQAMARTLQDASMHEKHEEWMSRFGRVYNDGNEKEIRYKIFKENVQRIE 71

Query: 67  RFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPP 125
            FN A+   +SY L +N+FADLT +EF  S+  FK   H  S +A   PF Y++ +  P 
Sbjct: 72  SFNKAS--GKSYKLGINQFADLTNEEFKTSRNRFK--GHMCSSQAG--PFRYENLTAAPS 125

Query: 126 SVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNN 178
           S++W +KGAVT +K QGQC       AVAAVEGI  +  ++L+SLSEQ+LVDC T   + 
Sbjct: 126 SMDWRKKGAVTAIKDQGQCGSCWAFSAVAAVEGITQLATSKLISLSEQELVDCDTKGEDQ 185

Query: 179 GCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEE 238
           GC GG MDDAFK+I QN+G+T +A Y YEG S G C++ +  +HAA+I  +EDVP N+E 
Sbjct: 186 GCQGGLMDDAFKFIEQNQGLTTEANYPYEG-SDGTCNTKQEANHAAKINGFEDVPANNEG 244

Query: 239 SLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIK 296
           +L+KAVA QPVSVAIDA     QFYS G+F G C T L+HGV AVGYG S  G+ YWL+K
Sbjct: 245 ALMKAVAKQPVSVAIDAGGFGFQFYSSGIFTGDCGTELDHGVAAVGYGES-NGMNYWLVK 303

Query: 297 NSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
           NSWG  WGE+GY R+Q+DID  +G CGIAM AS+P +
Sbjct: 304 NSWGTQWGEEGYIRMQKDIDAKEGLCGIAMQASYPTA 340


>gi|24285904|gb|AAL14199.1| cysteine proteinase precursor [Ipomoea batatas]
 gi|56961686|gb|AAK15148.2| cysteine proteinase-like protein [Ipomoea batatas]
          Length = 341

 Score =  332 bits (851), Expect = 2e-88,   Method: Compositional matrix adjust.
 Identities = 176/322 (54%), Positives = 219/322 (68%), Gaps = 17/322 (5%)

Query: 20  ATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYT 79
           AT RT  +  +  + EQW AQYGR Y+   E +KRF IFK+N+  +E FN A  G + Y 
Sbjct: 25  ATSRTLSDSLMVVRHEQWMAQYGRVYENEVEKTKRFNIFKENVEYIESFNKA--GTKPYK 82

Query: 80  LRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGAVTPV 138
           L +N FADLT QEF AS+ G+K+    SS     TPF Y++ S VP +V+W  KGAVTPV
Sbjct: 83  LGINAFADLTNQEFKASRNGYKLPHDCSS----NTPFRYENVSSVPTTVDWRTKGAVTPV 138

Query: 139 KYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKY 191
           K QGQC       AVAA+EGI  +    L+SLSEQ+LVDC     + GC GG MDDAF +
Sbjct: 139 KDQGQCGCCWAFSAVAAMEGITKLSTGNLISLSEQELVDCDVKGIDQGCEGGLMDDAFSF 198

Query: 192 IIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSV 251
           II NKG+T ++ Y Y+G + G C   K+ + AA+I+ YEDVP N E +L KAVANQPVSV
Sbjct: 199 IINNKGLTTESNYPYQG-TDGSCKKSKSSNSAAKISGYEDVPANSESALEKAVANQPVSV 257

Query: 252 AIDA--SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYF 309
           AIDA  S  QFYS GVF G C T L+HGVTAVGYG +E+G KYWL+KNSWG  WGE GY 
Sbjct: 258 AIDAGGSDFQFYSSGVFTGECGTELDHGVTAVGYGIAEDGSKYWLVKNSWGTSWGEKGYI 317

Query: 310 RLQRDIDQPQGQCGIAMFASFP 331
           R+Q+DI+  +G CGIAM +S+P
Sbjct: 318 RMQKDIEAKEGLCGIAMQSSYP 339


>gi|357471211|ref|XP_003605890.1| Cysteine proteinase [Medicago truncatula]
 gi|355506945|gb|AES88087.1| Cysteine proteinase [Medicago truncatula]
          Length = 343

 Score =  332 bits (850), Expect = 2e-88,   Method: Compositional matrix adjust.
 Identities = 172/327 (52%), Positives = 221/327 (67%), Gaps = 15/327 (4%)

Query: 17  ASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNR 76
           A Q T RT  +  + E+  QW +QYG+ YK+S E  KRF+IF +N+  +E FN     N+
Sbjct: 22  AIQVTSRTLQD-DMYERHRQWMSQYGKVYKDSQEREKRFKIFTENVNYIEAFNKGD-NNK 79

Query: 77  SYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK-SSQVPPSVNWIEKGAV 135
            YTL +N+FADLT  EF +S+  FK   H  S     + F Y+ +S +P SV+W +KGAV
Sbjct: 80  LYTLGVNQFADLTNDEFTSSRNKFK--GHMCSSITRTSTFKYENASAIPSSVDWRKKGAV 137

Query: 136 TPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDA 188
           TPVK QGQC       AVAA EGI+ +   +L+SLSEQ+LVDC T   + GC GG MDDA
Sbjct: 138 TPVKNQGQCGCCWAFSAVAATEGIHKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDA 197

Query: 189 FKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQP 248
           FK+IIQN G+  +A Y Y+G+  G C++ K   +A  IT YEDVP N+E++L KAVANQP
Sbjct: 198 FKFIIQNHGLNTEANYPYQGVD-GTCNANKGSINAVTITGYEDVPTNNEQALQKAVANQP 256

Query: 249 VSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGED 306
           +SVAIDAS    QFY  GVF G C T L+HGVTAVGYG S +G KYWL+KNSWG +WGE+
Sbjct: 257 ISVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSNDGTKYWLVKNSWGTEWGEE 316

Query: 307 GYFRLQRDIDQPQGQCGIAMFASFPVS 333
           GY  +QR +D  +G CGIAM AS+P +
Sbjct: 317 GYIMMQRGVDAAEGLCGIAMQASYPTA 343


>gi|356554921|ref|XP_003545789.1| PREDICTED: LOW QUALITY PROTEIN: thiol protease SEN102-like [Glycine
           max]
          Length = 439

 Score =  331 bits (849), Expect = 3e-88,   Method: Compositional matrix adjust.
 Identities = 177/337 (52%), Positives = 232/337 (68%), Gaps = 15/337 (4%)

Query: 7   IVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVE 66
           + +L+ +   A Q T  T  + S+ E+ EQW  ++G+ YK+  E  KRF IF +N+  VE
Sbjct: 108 LAMLLCTAFLAFQVTCCTLQDASMYERHEQWMTRHGKVYKDPREREKRFRIFNENVNYVE 167

Query: 67  RFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPP 125
            FNNAA  N+ Y L +N+F DLT QEFIA +  FK    SS ++   T F Y++ + VP 
Sbjct: 168 AFNNAA--NKPYKLGINQFXDLTNQEFIAPRNRFKGHMCSSIIRT--TTFKYENVTTVPS 223

Query: 126 SVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNN 178
           +V+W + GAVTPVK QGQC       AVAA EGI+A+   +L+SLSEQ+LVDC T   + 
Sbjct: 224 TVDWRQNGAVTPVKDQGQCGCCWAFSAVAATEGIHALSGGKLISLSEQELVDCDTKGVDQ 283

Query: 179 GCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEE 238
           GC GG MDDA+K+IIQN G+  +A Y Y+G+  G C++ +A +HAA IT YEDVP N+E+
Sbjct: 284 GCEGGLMDDAYKFIIQNHGLNTEANYPYKGVD-GKCNANEAANHAATITGYEDVPANNEK 342

Query: 239 SLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIK 296
           +L KAVANQPVSVAIDAS+   QFY  G F G C T L+HGVTAVGYG S+ G KYWL+K
Sbjct: 343 ALQKAVANQPVSVAIDASSSDFQFYKSGAFTGSCGTELDHGVTAVGYGVSDHGTKYWLVK 402

Query: 297 NSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
           NSWG +WGE+GY R+QR +D  +G CGIAM AS+P +
Sbjct: 403 NSWGTEWGEEGYIRMQRGVDSEEGVCGIAMQASYPTA 439


>gi|255580659|ref|XP_002531152.1| cysteine protease, putative [Ricinus communis]
 gi|223529265|gb|EEF31237.1| cysteine protease, putative [Ricinus communis]
          Length = 340

 Score =  331 bits (849), Expect = 3e-88,   Method: Compositional matrix adjust.
 Identities = 171/337 (50%), Positives = 230/337 (68%), Gaps = 18/337 (5%)

Query: 7   IVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVE 66
           + ++   G+ ASQA  RT  + SI EK E+W  ++ R Y ++ E   R++IFK+N+  +E
Sbjct: 12  LALIFFLGALASQAIARTLQDASIHEKHEEWMTRFKRVYSDAKEKEIRYKIFKENVQRIE 71

Query: 67  RFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPP 125
            FN A+   +SY L +N+FADLT +EF  S+  FK   H  S +A   PF Y++ + VP 
Sbjct: 72  SFNKAS--EKSYKLGINQFADLTNEEFKTSRNRFK--GHMCSSQAG--PFRYENITAVPS 125

Query: 126 SVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNN 178
           S++W ++GAVT +K QGQC       AVAAVEGI  +  ++L+SLSEQ+LVDC T   + 
Sbjct: 126 SMDWRKEGAVTAIKDQGQCGSCWAFSAVAAVEGITQLATSKLISLSEQELVDCDTKGEDQ 185

Query: 179 GCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEE 238
           GC GG MDDAFK+I QN+G+T +A Y YEG S G C++ +  +HAA+I  +EDVP N+E 
Sbjct: 186 GCQGGLMDDAFKFIEQNQGLTTEANYPYEG-SDGTCNTKQEANHAAKINGFEDVPANNEG 244

Query: 239 SLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIK 296
           +L+KAVA QPVSVAIDA     QFYS G+F G C T L+HGV AVGYG S  G+ YWL+K
Sbjct: 245 ALMKAVAKQPVSVAIDAGGFEFQFYSSGIFTGDCGTELDHGVAAVGYGES-NGMNYWLVK 303

Query: 297 NSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
           NSWG  WGE+GY R+Q+DID  +G CGIAM AS+P +
Sbjct: 304 NSWGTQWGEEGYIRMQKDIDAKEGLCGIAMQASYPTA 340


>gi|356515048|ref|XP_003526213.1| PREDICTED: vignain-like [Glycine max]
          Length = 350

 Score =  331 bits (849), Expect = 3e-88,   Method: Compositional matrix adjust.
 Identities = 176/349 (50%), Positives = 229/349 (65%), Gaps = 22/349 (6%)

Query: 3   KYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
           ++ L +VL++S  C SQ   R   E S++E+ EQW  +YG+ YK++AE  KR  IFKDN+
Sbjct: 8   QHILALVLLLS-ICTSQVMSRNLHEASMSERHEQWMKKYGKVYKDAAEKQKRLLIFKDNV 66

Query: 63  VAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-S 121
             +E FN  A GN+ Y L +N  AD T +EF+AS  G+K     S      TPF Y + +
Sbjct: 67  EFIESFN--AAGNKPYKLSINHLADQTNEEFVASHNGYKYKGSHSQ-----TPFKYGNVT 119

Query: 122 QVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATN 174
            +P +V+W + GAVT VK QGQC        VAA EGI  I    L+SLSEQ+LVDC + 
Sbjct: 120 DIPTAVDWRQNGAVTAVKDQGQCGSCWAFSTVAATEGIYQISTGMLMSLSEQELVDCDSV 179

Query: 175 DNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPP 234
           D+  GC GG M+D F++II+N GI+++A Y Y  +  G CD+ K    AAQI  YE VP 
Sbjct: 180 DH--GCDGGLMEDGFEFIIKNGGISSEANYPYTAVD-GTCDASKEASPAAQIKGYETVPA 236

Query: 235 NDEESLLKAVANQPVSVAIDA--SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGI-K 291
           N EE+L +AVANQPVSV+IDA  S  QFYS GVF G C T L+HGVT VGYGT+++G  +
Sbjct: 237 NSEEALQQAVANQPVSVSIDAGGSGFQFYSSGVFTGQCGTQLDHGVTVVGYGTTDDGTHE 296

Query: 292 YWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSKESAQPS 340
           YW++KNSWG  WGE+GY R+QR ID  +G CGIAM AS+P+ K S  PS
Sbjct: 297 YWIVKNSWGTQWGEEGYIRMQRGIDAQEGLCGIAMDASYPMGKSSDSPS 345


>gi|225446589|ref|XP_002280263.1| PREDICTED: vignain [Vitis vinifera]
          Length = 339

 Score =  331 bits (848), Expect = 3e-88,   Method: Compositional matrix adjust.
 Identities = 169/341 (49%), Positives = 229/341 (67%), Gaps = 19/341 (5%)

Query: 3   KYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
           +Y  + +L I  + ASQAT R+  E S+ E+ E W A+YGR YK++ E  KRF+IFKDN+
Sbjct: 8   QYVSMALLFILAAWASQATSRSLHEASMYERHEDWMARYGRMYKDANEKEKRFKIFKDNV 67

Query: 63  VAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-S 121
             +E FN A   +++Y L +N+FADLT +EF + +  FK     + + +  T F Y++ +
Sbjct: 68  ARIESFNKAM--DKTYKLSINEFADLTNEEFRSLRNRFK-----AHICSEATTFKYENVT 120

Query: 122 QVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATN 174
            VP +++W +KGAVTP+K Q QC       AVAA EGI  I   +L+SLSEQ+LVDC T 
Sbjct: 121 AVPSTIDWRKKGAVTPIKDQQQCGCCWAFSAVAATEGITQITTGKLISLSEQELVDCDTG 180

Query: 175 DNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPP 234
             N GC GG MDDAF++I +  G+ ++A Y YEG   G C+S K    AA+I  YEDVP 
Sbjct: 181 GENQGCSGGLMDDAFRFI-KIHGLASEATYPYEG-DDGTCNSKKEAHPAAKIKGYEDVPA 238

Query: 235 NDEESLLKAVANQPVSVAIDASAL--QFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
           N+E++L KAVA+QPV+VAIDA     QFY+ GVF G C T L+HGV AVGYG  ++G+ Y
Sbjct: 239 NNEKALQKAVAHQPVAVAIDAGGFEFQFYTSGVFTGQCGTELDHGVAAVGYGIGDDGMMY 298

Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
           WL+KNSWG  WGE+GY R+QRD+   +G CGIAM AS+P +
Sbjct: 299 WLVKNSWGTGWGEEGYIRMQRDVTAKEGLCGIAMQASYPTA 339


>gi|50355613|dbj|BAD29955.1| cysteine protease [Daucus carota]
          Length = 365

 Score =  330 bits (846), Expect = 5e-88,   Method: Compositional matrix adjust.
 Identities = 170/337 (50%), Positives = 226/337 (67%), Gaps = 17/337 (5%)

Query: 7   IVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVE 66
           + +L   G  AS A  R+ +E S+ E  +QW A+YGR YK + E ++R  IF++NL  ++
Sbjct: 12  LALLFTIGVLASLAAARSLNEASMTETHDQWMARYGRVYKTANEKNRRSTIFQENLKYIQ 71

Query: 67  RFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPP 125
            FN A   N+ Y L +N+FADLT +EF  S+  FK   H  +   N   F Y++ + VP 
Sbjct: 72  TFNKA--NNKPYKLGVNEFADLTNEEFTTSRNKFK--SHVCATVTN--VFRYENVTAVPA 125

Query: 126 SVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNN 178
           +++W +KGAVTP+K QGQC       AVAA+EGI  +K  +L+SLSEQ+LVDC TN  + 
Sbjct: 126 TMDWRKKGAVTPIKNQGQCGCCWAFSAVAAMEGITQLKTGKLISLSEQELVDCDTNGEDQ 185

Query: 179 GCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEE 238
           GC GG MD AF +I QN G++ +  Y Y G + G C++ K  +HAA IT +EDVP N E 
Sbjct: 186 GCEGGLMDYAFDFIQQNHGLSTETNYPYSG-TDGTCNANKEANHAATITGHEDVPANSES 244

Query: 239 SLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIK 296
           +LLKAVANQP+SVAIDAS    QFYS GVF G C T L+HGVTAVGYGT+ +G KYWL+K
Sbjct: 245 ALLKAVANQPISVAIDASGSDFQFYSSGVFTGECGTELDHGVTAVGYGTAADGTKYWLVK 304

Query: 297 NSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
           NSWG  WGE+GY ++QR +   +G CGIAM AS+P +
Sbjct: 305 NSWGTSWGEEGYIQMQRGVAAAEGLCGIAMQASYPTA 341


>gi|224135841|ref|XP_002327317.1| predicted protein [Populus trichocarpa]
 gi|222835687|gb|EEE74122.1| predicted protein [Populus trichocarpa]
          Length = 342

 Score =  330 bits (846), Expect = 6e-88,   Method: Compositional matrix adjust.
 Identities = 167/339 (49%), Positives = 225/339 (66%), Gaps = 16/339 (4%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
           F   +LI+ G  A +   R   E S++ + EQW   +G+ Y ++AE  +RFEIFKDN+  
Sbjct: 10  FFAFILIL-GMWAYEVASRELQEPSMSARHEQWMETFGKVYADAAEKERRFEIFKDNVEY 68

Query: 65  VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQV 123
           +E FN A  GN+ Y L +NKFADLT +E   ++ G++    +  +K   T F Y++ + V
Sbjct: 69  IESFNTA--GNKPYKLSVNKFADLTNEELKVARNGYRRPLQTRPMKV--TSFKYENVTAV 124

Query: 124 PPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDN 176
           P +++W +KGAVTP+K QGQC        VAA EGIN +   +LVSLSEQ+LVDC T   
Sbjct: 125 PATMDWRKKGAVTPIKDQGQCGSCWAFSTVAATEGINQLTTGKLVSLSEQELVDCDTQGE 184

Query: 177 NNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPND 236
           + GC GG M+D F++II+N GIT +A Y Y+  + G C+S K     A+IT YE VP N 
Sbjct: 185 DQGCEGGLMEDGFEFIIKNHGITTEANYPYQA-ADGTCNSKKEASRIAKITGYESVPANS 243

Query: 237 EESLLKAVANQPVSVAIDA--SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWL 294
           E +LLKAVA+QP+SV+IDA  S  QFYS GVF G C T L+HGVTAVGYG + +G KYWL
Sbjct: 244 EAALLKAVASQPISVSIDAGGSDFQFYSSGVFTGQCGTELDHGVTAVGYGETSDGTKYWL 303

Query: 295 IKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
           +KNSWG  WGE+GY R+QRD +  +G CGIAM +S+P +
Sbjct: 304 VKNSWGTSWGEEGYIRMQRDTEAEEGLCGIAMDSSYPTA 342


>gi|356543124|ref|XP_003540013.1| PREDICTED: vignain-like [Glycine max]
 gi|356543126|ref|XP_003540014.1| PREDICTED: vignain-like [Glycine max]
          Length = 337

 Score =  330 bits (846), Expect = 7e-88,   Method: Compositional matrix adjust.
 Identities = 174/341 (51%), Positives = 227/341 (66%), Gaps = 21/341 (6%)

Query: 3   KYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
           ++ L +VL++S  C SQ   R   E S++E+ EQW  +YG+ YK++AE  KR  IFKDN+
Sbjct: 8   QHILALVLLLS-ICTSQVMSRNLHEASMSERHEQWMKKYGKVYKDAAEKQKRLLIFKDNV 66

Query: 63  VAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-S 121
             +E FN  A GNR Y L +N  AD T +EF+AS  G+K     S      TPF Y++ +
Sbjct: 67  EFIESFN--AAGNRPYKLSINHLADQTNEEFVASHNGYKHKGSHSQ-----TPFKYENVT 119

Query: 122 QVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATN 174
            VP +V+W E GAVT VK QGQC        VAA EGI  I  + L+SLSEQ+LVDC + 
Sbjct: 120 GVPNAVDWRENGAVTAVKDQGQCGSCWAFSTVAATEGIYQITTSMLMSLSEQELVDCDSV 179

Query: 175 DNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPP 234
           D  +GC GG+M+  F++II+N GI+++A Y Y  +  G CD+ K    AAQI  YE VP 
Sbjct: 180 D--HGCDGGYMEGGFEFIIKNGGISSEANYPYTAVD-GTCDANKEASPAAQIKGYETVPA 236

Query: 235 NDEESLLKAVANQPVSVAIDA--SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
           N E++L KAVANQPVSV IDA  SA QFYS GVF G C T L+HGVTAVGYG++++G +Y
Sbjct: 237 NSEDALQKAVANQPVSVTIDAGGSAFQFYSSGVFTGQCGTQLDHGVTAVGYGSTDDGTQY 296

Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
           W++KNSWG  WGE+GY R+QR  D  +G CGIAM AS+P +
Sbjct: 297 WIVKNSWGTQWGEEGYIRMQRGTDAQEGLCGIAMDASYPTA 337


>gi|356543038|ref|XP_003539970.1| PREDICTED: vignain-like [Glycine max]
          Length = 343

 Score =  330 bits (845), Expect = 8e-88,   Method: Compositional matrix adjust.
 Identities = 175/337 (51%), Positives = 231/337 (68%), Gaps = 15/337 (4%)

Query: 7   IVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVE 66
           + +L   G  A Q T RT  + S+ E+ E+W A+Y + YK+  E  KRF+IFK+N+  +E
Sbjct: 12  LALLFCLGFWAFQVTSRTLQDASMYERHEEWMARYAKVYKDPEEREKRFKIFKENVNYIE 71

Query: 67  RFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPP 125
            FNNAA  N+ Y L +N+FADLT +EFIA +  FK   H  S     T F Y++ + +P 
Sbjct: 72  AFNNAA--NKPYKLGINQFADLTNEEFIAPRNRFK--GHMCSSITRTTTFKYENVTALPS 127

Query: 126 SVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNN 178
           +V+W +KGAVTP+K QGQC       AVAA EGI+A+   +L+SLSEQ++VDC T   + 
Sbjct: 128 TVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALNSGKLISLSEQEVVDCDTKGEDQ 187

Query: 179 GCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEE 238
           GC GGFMD AFK+IIQN G+  +A Y Y+ +  G C++ +A +HAA IT YEDVP N+E+
Sbjct: 188 GCAGGFMDGAFKFIIQNHGLNTEANYPYKAVD-GKCNANEAANHAATITGYEDVPVNNEK 246

Query: 239 SLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIK 296
           +L KAVANQPVSVAIDAS    QFY  GVF G C T L+HGVTAVGYG S +G +YWL+K
Sbjct: 247 ALQKAVANQPVSVAIDASGSDFQFYKTGVFTGSCGTQLDHGVTAVGYGVSADGTQYWLVK 306

Query: 297 NSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
           NSWG +WGE+GY  +QR +   +G CGIAM AS+P +
Sbjct: 307 NSWGTEWGEEGYIMMQRGVKAQEGLCGIAMMASYPTA 343


>gi|224162986|ref|XP_002338508.1| predicted protein [Populus trichocarpa]
 gi|222872535|gb|EEF09666.1| predicted protein [Populus trichocarpa]
          Length = 306

 Score =  329 bits (843), Expect = 1e-87,   Method: Compositional matrix adjust.
 Identities = 170/312 (54%), Positives = 221/312 (70%), Gaps = 18/312 (5%)

Query: 32  EKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQ 91
           E+ EQW  QYGR YK+  E + R+ IFK+N+  ++ FN+     +SY L +N+FADLT +
Sbjct: 3   ERHEQWMTQYGRVYKDDNERATRYSIFKENVARIDAFNSQT--GKSYKLGVNQFADLTNE 60

Query: 92  EFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGAVTPVKYQGQC------ 144
           EF AS+  FK   H  S +A   PF Y++ S VP +V+W ++GAVTPVK QGQC      
Sbjct: 61  EFKASRNRFK--GHMCSPQAG--PFRYENVSAVPSTVDWRKEGAVTPVKDQGQCGCCWAF 116

Query: 145 -AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAV 203
            AVAA+EGIN +   +L+SLSEQ++VDC T   + GC GG MDDAFK+I QNKG+T +A 
Sbjct: 117 SAVAAMEGINKLTTGKLISLSEQEVVDCDTKGEDQGCNGGLMDDAFKFIEQNKGLTTEAN 176

Query: 204 YSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDA--SALQFY 261
           Y Y+G + G C++ K+  HAA+IT +EDVP N E +L+KAVA QPVSVAIDA  S  QFY
Sbjct: 177 YPYKG-TDGTCNTKKSAIHAAKITGFEDVPANSEAALMKAVAKQPVSVAIDAGGSDFQFY 235

Query: 262 SGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQ 321
           S G+F G C+T L+HGVTAVGYG S +G KYWL+KNSWG  WGE+GY R+Q+DI   +G 
Sbjct: 236 SSGIFTGSCDTQLDHGVTAVGYGVS-DGSKYWLVKNSWGAQWGEEGYIRMQKDISAKEGL 294

Query: 322 CGIAMFASFPVS 333
           CGIAM AS+P +
Sbjct: 295 CGIAMQASYPTA 306


>gi|356517426|ref|XP_003527388.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
           max]
          Length = 343

 Score =  329 bits (843), Expect = 1e-87,   Method: Compositional matrix adjust.
 Identities = 176/337 (52%), Positives = 227/337 (67%), Gaps = 15/337 (4%)

Query: 7   IVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVE 66
           + +L   G  A Q T RT  + S+ E+  QW A+Y + YK+  E  KRF IFK+N+  +E
Sbjct: 12  LALLFCMGFLAFQVTCRTLQDASMYERHAQWMARYAKVYKDPQEREKRFRIFKENVNYIE 71

Query: 67  RFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPS 126
            FN+A   N+SY L +N+FADLT +EFIA +  FK   H  S     T F Y++  V PS
Sbjct: 72  TFNSA--DNKSYKLDINQFADLTNEEFIAPRNRFK--GHMCSSITRTTTFKYENVTVIPS 127

Query: 127 -VNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNN 178
            V+W +KGAVTP+K QGQC       AVAA EGI+A+   +L+SLSEQ++VDC T   + 
Sbjct: 128 TVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALNAGKLISLSEQEVVDCDTKGQDQ 187

Query: 179 GCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEE 238
           GC GGFMD AFK+IIQN G+  +  Y Y+  + G C++  A +HAA IT YEDVP N+E+
Sbjct: 188 GCAGGFMDGAFKFIIQNHGLNTEPNYPYKA-ADGKCNAKAAANHAATITGYEDVPVNNEK 246

Query: 239 SLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIK 296
           +L KAVANQPVSVAIDAS    QFY  GVF G C T L+HGVTAVGYG S +G +YWL+K
Sbjct: 247 ALQKAVANQPVSVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSADGTEYWLVK 306

Query: 297 NSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
           NSWG +WGE+GY R+QR +   +G CGIAM AS+P +
Sbjct: 307 NSWGTEWGEEGYIRMQRGVKAEEGLCGIAMMASYPTA 343


>gi|356543076|ref|XP_003539989.1| PREDICTED: vignain-like [Glycine max]
          Length = 343

 Score =  328 bits (841), Expect = 2e-87,   Method: Compositional matrix adjust.
 Identities = 174/337 (51%), Positives = 231/337 (68%), Gaps = 15/337 (4%)

Query: 7   IVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVE 66
           + +L   G  A Q T RT  + S+ E+ E+W A+Y + YK+  E  KRF+IFK+N+  +E
Sbjct: 12  LALLFCLGFWAFQVTSRTLQDASMYERHEEWMARYAKVYKDPEEREKRFKIFKENVNYIE 71

Query: 67  RFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPP 125
            FNNAA  ++ Y L +N+FADLT +EFIA +  FK   H  S     T F Y++ + +P 
Sbjct: 72  AFNNAA--DKPYKLGINQFADLTNEEFIAPRNKFK--GHMCSSITRTTTFKYENVTALPS 127

Query: 126 SVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNN 178
           +V+W +KGAVTP+K QGQC       AVAA EGI+A+   +L+SLSEQ++VDC T   + 
Sbjct: 128 TVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALNSGKLISLSEQEVVDCDTKGEDQ 187

Query: 179 GCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEE 238
           GC GGFMD AFK+IIQN G+  +A Y Y+ +  G C++ +A +HAA IT YEDVP N+E+
Sbjct: 188 GCAGGFMDGAFKFIIQNHGLNTEANYPYKAVD-GKCNANEAANHAATITGYEDVPVNNEK 246

Query: 239 SLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIK 296
           +L KAVANQPVSVAIDAS    QFY  GVF G C T L+HGVTAVGYG S +G +YWL+K
Sbjct: 247 ALQKAVANQPVSVAIDASGSDFQFYKTGVFTGSCGTQLDHGVTAVGYGVSADGTQYWLVK 306

Query: 297 NSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
           NSWG +WGE+GY  +QR +   +G CGIAM AS+P +
Sbjct: 307 NSWGTEWGEEGYIMMQRGVKAQEGLCGIAMMASYPTA 343


>gi|224083868|ref|XP_002307151.1| predicted protein [Populus trichocarpa]
 gi|222856600|gb|EEE94147.1| predicted protein [Populus trichocarpa]
          Length = 298

 Score =  328 bits (841), Expect = 2e-87,   Method: Compositional matrix adjust.
 Identities = 168/305 (55%), Positives = 217/305 (71%), Gaps = 12/305 (3%)

Query: 32  EKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQ 91
           E+ EQW AQYGR YK+ AE   R+ IFK+N+  ++ FN+     +SY L +N+FADL+ +
Sbjct: 3   ERHEQWMAQYGRVYKDDAEKETRYNIFKENVARIDAFNSQT--GKSYNLGVNQFADLSNE 60

Query: 92  EFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGAVTPVKYQGQCAVAAVE 150
           EF AS+  FK   H  S +A   PF Y++ S VP +++W +KGAVTPVK QGQC VAA+E
Sbjct: 61  EFKASRNRFK--GHMCSPQAG--PFRYENVSAVPATMDWRKKGAVTPVKDQGQC-VAAME 115

Query: 151 GINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMS 210
           GIN +   +L+SLSEQ++VDC T   + GC GG MDDAFK+I QNKG+T +A Y Y G +
Sbjct: 116 GINQLTTGKLISLSEQEVVDCDTKGEDQGCNGGLMDDAFKFIEQNKGLTTEANYPYTG-T 174

Query: 211 TGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNG 268
            G C++ K   HAA+IT ++DVP N E +L+KAVA QPVSVAIDA     QFYS G+F G
Sbjct: 175 DGTCNTQKEVSHAAKITGFQDVPANSEAALMKAVAKQPVSVAIDAGGFEFQFYSSGIFTG 234

Query: 269 YCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFA 328
            C T L+HGVTAVGYG S +G KYWL+KNSWG  WGE+GY R+Q+DI   +G CGIAM A
Sbjct: 235 SCGTELDHGVTAVGYGGS-DGTKYWLVKNSWGAQWGEEGYIRMQKDISAKEGLCGIAMQA 293

Query: 329 SFPVS 333
           S+P +
Sbjct: 294 SYPTA 298


>gi|356543116|ref|XP_003540009.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
           max]
          Length = 337

 Score =  327 bits (838), Expect = 5e-87,   Method: Compositional matrix adjust.
 Identities = 174/342 (50%), Positives = 229/342 (66%), Gaps = 23/342 (6%)

Query: 3   KYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
           ++ L +VL++S  C SQ   R   E S++E+ EQW  +YG+ YK++AE  KR  IFKDN+
Sbjct: 8   QHILALVLLLS-ICTSQVMSRYLHEASMSERHEQWMKKYGKVYKDAAEKQKRLLIFKDNV 66

Query: 63  VAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKM-SDHSSSLKANGTPFLYKS- 120
             +E FN  A GN+ Y L +N  AD T +EF+AS  G+K  + HS       TPF Y++ 
Sbjct: 67  EFIESFN--AAGNKPYKLGINHLADQTNEEFVASHNGYKHKASHSQ------TPFKYENV 118

Query: 121 SQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCAT 173
           + VP +V+W E GAVT VK QGQC        VAA EGI  I  + L+SLSEQ+LVDC +
Sbjct: 119 TGVPNAVDWRENGAVTAVKDQGQCGSCWAFSTVAATEGIYQITTSMLMSLSEQELVDCDS 178

Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
            D  +GC GG+M+  F++II+N GI+++A Y Y  +  G CD+ K    AAQI  YE VP
Sbjct: 179 VD--HGCDGGYMEGGFEFIIKNGGISSEANYPYTAVD-GTCDANKEASPAAQIKGYETVP 235

Query: 234 PNDEESLLKAVANQPVSVAIDA--SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIK 291
            N E++L KAVANQPVSV IDA  SA QFYS GVF G C T L+HGVTAVGYG++++G +
Sbjct: 236 ANSEDALQKAVANQPVSVTIDAGGSAFQFYSSGVFTGQCGTQLDHGVTAVGYGSTDDGTQ 295

Query: 292 YWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
           YW++KNSWG  WGE+GY R+QR  D  +G CGIAM AS+P +
Sbjct: 296 YWIVKNSWGTQWGEEGYIRMQRGTDAQEGLCGIAMDASYPTA 337


>gi|224099295|ref|XP_002334495.1| predicted protein [Populus trichocarpa]
 gi|222872550|gb|EEF09681.1| predicted protein [Populus trichocarpa]
          Length = 342

 Score =  326 bits (835), Expect = 1e-86,   Method: Compositional matrix adjust.
 Identities = 166/339 (48%), Positives = 223/339 (65%), Gaps = 16/339 (4%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
           F   +LI+ G  A +   R   E  ++ + EQW A YG+ Y ++AE  +RF+IFK+N+  
Sbjct: 10  FFAFILIL-GMWAFEVASRELQESYMSARHEQWMATYGKVYVDAAEKERRFKIFKNNVEY 68

Query: 65  VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQV 123
           +E FN A  GN+ Y L +NKFAD T ++F  ++ G++    +  +K   T F Y++ + V
Sbjct: 69  IESFNTA--GNKPYKLSVNKFADQTNEKFKGARNGYRRPFQTRPMKV--TSFKYENVTAV 124

Query: 124 PPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDN 176
           P +++W +KGAVTP+K QGQC        VAA EGIN +   +LVSLSEQ+LVDC     
Sbjct: 125 PATMDWRKKGAVTPIKDQGQCGSCWAFSTVAATEGINQLTTGKLVSLSEQELVDCDNQGE 184

Query: 177 NNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPND 236
           + GC GG M+D F++II+N GIT +A Y Y+  + G C+S K   H A+IT YE VP N 
Sbjct: 185 DQGCEGGLMEDGFEFIIKNHGITTEANYPYQA-ADGTCNSKKQASHIAKITGYESVPANS 243

Query: 237 EESLLKAVANQPVSVAIDA--SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWL 294
           E  LLK VANQP+SV+IDA  S  QFYS GVF G C T L+HGVTAVGYG + +G KYWL
Sbjct: 244 EAELLKVVANQPISVSIDAGGSDFQFYSSGVFTGKCGTELDHGVTAVGYGETSDGTKYWL 303

Query: 295 IKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
           +KNSW   WGE+GY R+QRDID  +G CGIAM +S+P +
Sbjct: 304 VKNSWXTSWGEEGYIRMQRDIDAEEGLCGIAMDSSYPTA 342


>gi|255568297|ref|XP_002525123.1| cysteine protease, putative [Ricinus communis]
 gi|223535582|gb|EEF37250.1| cysteine protease, putative [Ricinus communis]
          Length = 349

 Score =  325 bits (833), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 167/346 (48%), Positives = 222/346 (64%), Gaps = 16/346 (4%)

Query: 2   AKYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDN 61
            K   I +  +   CA QA  R   E  +  + E+W A++G+ YK+  E  +RF+IFK N
Sbjct: 7   GKILPIALFFVLAMCADQAASRELHELEMTGRHEKWMAKHGKVYKDDKEKLRRFQIFKSN 66

Query: 62  LVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS- 120
           +V +E FN A  GN+SY L +NKFADLT +EF A   G+K    +S      TPF Y++ 
Sbjct: 67  VVFIESFNTA--GNKSYMLGINKFADLTNEEFRAFWNGYKRPLGASR---KITPFKYENV 121

Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
           + +P S++W  KGAVTP+K QG C       AVAA EGI+ ++  +LVSLSEQ+LVDC  
Sbjct: 122 TALPSSIDWRSKGAVTPIKDQGVCGSCWAFSAVAATEGIHKLRTGKLVSLSEQELVDCDV 181

Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
              + GC GG M DAFK+I ++ G+T++A Y Y+G   G CD+ K    A +IT Y+ VP
Sbjct: 182 KGQDKGCQGGLMVDAFKFIKRHGGMTSEANYPYQGRD-GKCDTKKEASRAVKITGYQAVP 240

Query: 234 PNDEESLLKAVANQPVSVAIDASAL--QFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIK 291
            N E +LLKAVANQPVSVAIDA +L  QFY  G+F G C   +NHGV AVGYG S  G K
Sbjct: 241 KNSEAALLKAVANQPVSVAIDAGSLSFQFYRSGIFTGICGKDINHGVAAVGYGRSNSGSK 300

Query: 292 YWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSKESA 337
           YW++KNSWG +WGE GY R++RD+   +G CGIAM  S+P ++  A
Sbjct: 301 YWIVKNSWGTEWGEKGYIRMKRDVRSKEGLCGIAMECSYPTAQVQA 346


>gi|224121800|ref|XP_002330656.1| predicted protein [Populus trichocarpa]
 gi|222872260|gb|EEF09391.1| predicted protein [Populus trichocarpa]
          Length = 342

 Score =  325 bits (833), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 166/339 (48%), Positives = 223/339 (65%), Gaps = 16/339 (4%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
           F   +LI+ G  A +   R   E  ++ + EQW A YG+ Y ++AE  +RF+IFK+N+  
Sbjct: 10  FFAFILIL-GMWAFEVASRELQESYMSARHEQWMATYGKVYVDAAEKERRFKIFKNNVEY 68

Query: 65  VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQV 123
           +E FN A  GN+ Y L +NKFAD T ++F  ++ G++    +  +K   T F Y++ + V
Sbjct: 69  IESFNTA--GNKPYKLSVNKFADQTNEKFKGARNGYRRPFQTRPMKV--TSFKYENVTAV 124

Query: 124 PPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDN 176
           P +++W +KGAVT +K QGQC        VAA EGIN +   +LVSLSEQ+LVDC     
Sbjct: 125 PATMDWRKKGAVTLIKDQGQCGSCWAFSTVAATEGINQLTTGKLVSLSEQELVDCDIQGE 184

Query: 177 NNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPND 236
           + GC GG M+D F++II+N GIT +A Y Y+  + G C+S K   H A+IT YE VP N 
Sbjct: 185 DQGCEGGLMEDGFEFIIKNHGITTEANYPYQA-ADGTCNSKKQASHIAKITGYESVPANS 243

Query: 237 EESLLKAVANQPVSVAIDA--SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWL 294
           E  LLK VANQP+SV+IDA  S  QFYS GVF G C T L+HGVTAVGYG + +G KYWL
Sbjct: 244 EAELLKVVANQPISVSIDAGGSDFQFYSSGVFTGKCGTELDHGVTAVGYGETSDGTKYWL 303

Query: 295 IKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
           +KNSWG  WGE+GY R+QRDID  +G CGIAM +S+P +
Sbjct: 304 VKNSWGTSWGEEGYIRMQRDIDTEEGLCGIAMDSSYPTA 342


>gi|40806498|gb|AAR92154.1| putative cysteine protease 1 [Iris x hollandica]
          Length = 340

 Score =  323 bits (828), Expect = 7e-86,   Method: Compositional matrix adjust.
 Identities = 172/343 (50%), Positives = 223/343 (65%), Gaps = 22/343 (6%)

Query: 3   KYFLIVVLIISGSCASQATY-RTFDEG-SIAEKFEQWKAQYGRTYKESAENSKRFEIFKD 60
           K    + L+I    ASQ    R+  E  S+ E+ EQW AQ+GR YK +AE + RFEIF+ 
Sbjct: 8   KLLPALALLIVAIWASQGEAGRSLGENKSMLERHEQWMAQHGRVYKNAAEKAHRFEIFRA 67

Query: 61  NLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS 120
           N+  +E FN     N  + L +N+FADLT +EF    T  K S  +S+       F Y++
Sbjct: 68  NVERIESFNAE---NHKFKLGVNQFADLTNEEFKTRNT-LKPSKMAST-----KSFKYEN 118

Query: 121 -SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCA 172
            + VP +++W  KGAVTP+K QGQC       AVAA EGI  +   +L+SLSEQ++VDC 
Sbjct: 119 VTAVPATMDWRTKGAVTPIKDQGQCGSCWAFSAVAATEGITKLSTGKLISLSEQEVVDCD 178

Query: 173 TNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDV 232
              ++ GC GG MDDAF+YII+NKGIT +A Y Y+  + G C++ KA  HAA IT YEDV
Sbjct: 179 VTSDDQGCNGGEMDDAFEYIIKNKGITTEANYPYKA-ADGTCNTKKAASHAASITGYEDV 237

Query: 233 PPNDEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGI 290
             N E +LLKA ANQP++VAIDA   A Q YS GVF G C T L+HGVT VGYG + +G 
Sbjct: 238 TVNSEAALLKAAANQPIAVAIDAGDFAFQMYSSGVFTGDCGTDLDHGVTLVGYGATSDGT 297

Query: 291 KYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
           KYWL+KNSWG  WGEDGY R++RD+D  +G CGIAM AS+P +
Sbjct: 298 KYWLVKNSWGTSWGEDGYIRMERDVDAKEGLCGIAMDASYPTA 340


>gi|255568299|ref|XP_002525124.1| cysteine protease, putative [Ricinus communis]
 gi|223535583|gb|EEF37251.1| cysteine protease, putative [Ricinus communis]
          Length = 342

 Score =  323 bits (827), Expect = 9e-86,   Method: Compositional matrix adjust.
 Identities = 163/341 (47%), Positives = 226/341 (66%), Gaps = 16/341 (4%)

Query: 3   KYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
           ++ LI +  +    A QA+ R   E ++ E+ E+W A++G+ YK+  E  +RF+IFK+N+
Sbjct: 8   QFLLIALFFVLAMWADQASTRELHESTMVERHEKWMAKHGKVYKDDEEKLRRFQIFKNNV 67

Query: 63  VAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-S 121
             +E  N  A GN SY L +N+FADLT +EF AS  G+K    +S +    TPF Y++ +
Sbjct: 68  EFIESSN--AAGNNSYMLGINRFADLTNEEFRASWNGYKRPLDASRIV---TPFKYENVT 122

Query: 122 QVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATN 174
            +P S++W  KGAVT +K Q +C       AVAA EG++ ++  +LVSLSEQ+LVDC   
Sbjct: 123 ALPYSMDWRRKGAVTSIKDQRECGSCWAFSAVAATEGVHKLRTGKLVSLSEQELVDCDVK 182

Query: 175 DNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPP 234
             + GC GG M+DAFK+I +N GIT +A Y+Y G   G CD+ K   H A+IT Y+ VP 
Sbjct: 183 GEDKGCQGGLMEDAFKFIKRNGGITTEANYAYRGRD-GKCDTKKEASHVAKITGYQVVPE 241

Query: 235 NDEESLLKAVANQPVSVAIDASAL--QFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
           N E +LLKAVA+QPVSV+IDA ++  QFY  G++ G C + LNHGV AVGYGTS  G KY
Sbjct: 242 NSEAALLKAVAHQPVSVSIDAGSMSFQFYQSGIYAGSCGSDLNHGVAAVGYGTSSSGSKY 301

Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
           W++KNSWG +WGE GY R++RDI   +G CGIAM  S+P +
Sbjct: 302 WIVKNSWGPEWGERGYVRMKRDITSRKGLCGIAMDCSYPTA 342


>gi|47524507|gb|AAT34987.1| putative cysteine protease [Gossypium hirsutum]
          Length = 344

 Score =  319 bits (818), Expect = 1e-84,   Method: Compositional matrix adjust.
 Identities = 174/349 (49%), Positives = 229/349 (65%), Gaps = 23/349 (6%)

Query: 1   MAKYFLIVVLIISGSCASQA---TYRTFDEGSIAEKFEQWKAQYGRTYKESAEN--SKRF 55
           + + FL V L++S   + Q    +    DE S+  + E+W +Q+GR Y +  E+  +KRF
Sbjct: 3   LLQIFLFVALVLSFCFSIQLAGLSRPLLDEDSM--RHEEWMSQHGRVYADEQEDHKNKRF 60

Query: 56  EIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTP 115
            +FK+N+  +E FN+     +++ L +N+FADLT +EF AS  GFK     SS     TP
Sbjct: 61  NVFKENVERIEEFNDG----KTFKLAINQFADLTNEEFRASYNGFKGPMVLSSQITKPTP 116

Query: 116 FLYK--SSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQ 166
           F Y+  SS +P SV+W +KGAVTPVK QGQC       AVAA+EGI  I   +L+SLSEQ
Sbjct: 117 FRYENVSSALPVSVDWRKKGAVTPVKNQGQCGCCWAFSAVAAIEGITQISTGKLISLSEQ 176

Query: 167 QLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQI 226
           +LVDC T   ++GC GG MD AF++II N G+T ++ Y Y+G   G C+  K    A  I
Sbjct: 177 ELVDCDTKGIDHGCEGGLMDTAFEFIINNGGLTTESNYPYKG-EDGTCNFNKTNPIAVSI 235

Query: 227 TNYEDVPPNDEESLLKAVANQPVSVAIDA--SALQFYSGGVFNGYCETFLNHGVTAVGYG 284
           T YEDVP NDE++L+KAVA+QPVSVAI+A  S  QFYS GVF G C T L+H VTAVGYG
Sbjct: 236 TGYEDVPANDEQALMKAVAHQPVSVAIEAGGSDFQFYSSGVFTGECGTELDHAVTAVGYG 295

Query: 285 TSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
            SE+G KYW++KNSWG  WGE GY  +Q+DI   QG CGIAM AS+P +
Sbjct: 296 ESEDGSKYWIVKNSWGTKWGESGYIEMQKDIKVKQGLCGIAMQASYPTA 344


>gi|357452075|ref|XP_003596314.1| Cysteine proteinase [Medicago truncatula]
 gi|355485362|gb|AES66565.1| Cysteine proteinase [Medicago truncatula]
          Length = 341

 Score =  318 bits (816), Expect = 2e-84,   Method: Compositional matrix adjust.
 Identities = 174/341 (51%), Positives = 222/341 (65%), Gaps = 17/341 (4%)

Query: 3   KYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
            Y    + +  G  + QAT RT     + E  EQW  Q+G+ YK + E  KRF IFK+N+
Sbjct: 8   HYIPFALFLCLGLLSFQATSRTLQNDPMYEMHEQWMVQHGKVYKAAHEKQKRFGIFKENV 67

Query: 63  VAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-S 121
             +E FNN  +GN+SY L LN FADLT  EFIA++  F    H S +    T F YK+ S
Sbjct: 68  NYIEAFNN--VGNKSYKLGLNHFADLTNHEFIAARNKFNGYLHGSII----TTFKYKNVS 121

Query: 122 QVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATN 174
            VP +V+W ++GAVTPVK QGQC       AVA+ EGI+ +    LVSLSEQ+LVDC TN
Sbjct: 122 DVPSAVDWRQEGAVTPVKNQGQCGCCWAFSAVASTEGIHKLTTGNLVSLSEQELVDCDTN 181

Query: 175 DNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPP 234
             + GC GG MDDAF++IIQN G++ +A Y Y+G+  G C+  +    AA I+ YE+VP 
Sbjct: 182 GEDQGCEGGLMDDAFEFIIQNNGLSTEAEYPYQGVD-GTCNKTEVGSSAATISGYENVPV 240

Query: 235 NDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
           NDE++L KAVANQPVSVAIDAS    QFY  GVF G C T L+HGV  VGYG  E+  +Y
Sbjct: 241 NDEQALQKAVANQPVSVAIDASGSDFQFYKSGVFTGSCGTELDHGVAVVGYGVGEDETEY 300

Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
           WL+KNSWG  WGE+GY R+QR +D  +G CGIAM  S+P +
Sbjct: 301 WLVKNSWGTQWGEEGYIRMQRGVDASEGLCGIAMQPSYPTA 341


>gi|102140014|gb|ABF70145.1| cysteine protease, putative [Musa acuminata]
          Length = 373

 Score =  317 bits (813), Expect = 4e-84,   Method: Compositional matrix adjust.
 Identities = 170/339 (50%), Positives = 218/339 (64%), Gaps = 17/339 (5%)

Query: 6   LIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
           L + L+  G  A         + S+AE+  +W A++GRTYK++AE  +R  IFK N+  +
Sbjct: 7   LWMALLALGLGACSPAAAELGDASMAERHVEWMARHGRTYKDAAEKEQRLGIFKSNVEYI 66

Query: 66  ERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVP 124
           E FN    G R Y L  N+FADLT +EF A  TGFK S   +    NG  F + S S VP
Sbjct: 67  ESFN---AGKRKYQLAANQFADLTHEEFKAMHTGFKPSGTGAKKAGNG--FRHGSLSSVP 121

Query: 125 PSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNN 177
            SV+W  KGAVTPVK QG C        VAAVEGI  I   +L+SLSEQQLVDC  +  +
Sbjct: 122 DSVDWRSKGAVTPVKDQGLCGSCWAFTVVAAVEGITKIVTGKLISLSEQQLVDCDVHGKD 181

Query: 178 NGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDE 237
            GC GG MD AF++I+ N GIT++A Y YE +   +C++  A    A I ++EDVP NDE
Sbjct: 182 QGCQGGDMDAAFEFIVNNGGITSEANYPYEEVQR-LCNAHNASFVVATIESHEDVPTNDE 240

Query: 238 ESLLKAVANQPVSVAIDASA---LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWL 294
           ++L KAVANQPVSV IDA +    Q YSGGVF+G C T L+H VT VGYGT+ +G KYWL
Sbjct: 241 KALRKAVANQPVSVGIDAGSSLDFQLYSGGVFSGECGTDLDHAVTVVGYGTTSDGTKYWL 300

Query: 295 IKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
            KNSWG+ WGE+GY R++RD+   +G CGIAM AS+P +
Sbjct: 301 AKNSWGETWGENGYIRMERDVAAKEGLCGIAMQASYPTA 339


>gi|297819568|ref|XP_002877667.1| hypothetical protein ARALYDRAFT_348033 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297323505|gb|EFH53926.1| hypothetical protein ARALYDRAFT_348033 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 341

 Score =  317 bits (812), Expect = 5e-84,   Method: Compositional matrix adjust.
 Identities = 170/343 (49%), Positives = 222/343 (64%), Gaps = 21/343 (6%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
           F ++ +I+S   +   +     E S  EK EQW +++ R Y + +E + RFEIFK NL  
Sbjct: 6   FFLLAIILSSRTSGATSRGGLFEASAIEKHEQWMSRFHRVYSDDSEKTSRFEIFKKNLKF 65

Query: 65  VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGF----KMSDHSSSLKANGTPFLYKS 120
           VE FN     N++YTL +N+F+DLT +EF A  TG      M+  S++       F Y++
Sbjct: 66  VESFNMNT--NKTYTLDVNEFSDLTDEEFKARYTGLVVPEGMTRMSTTDSHETVSFRYEN 123

Query: 121 -SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCA 172
             +   S++W E+GAVT VK+Q QC       AVAAVEG+  I    LVSLSEQQL+DC+
Sbjct: 124 VGETGESMDWREEGAVTSVKHQQQCGCCWAFSAVAAVEGMTKIAKGELVSLSEQQLLDCS 183

Query: 173 TNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDV 232
           T   N+GC GG M  AF YI++N+GIT +  Y Y+G +   C+S      AA I+ YE V
Sbjct: 184 TE--NDGCDGGIMWKAFDYIVENQGITAEDNYPYQG-AQQTCESNHVA--AATISGYETV 238

Query: 233 PPNDEESLLKAVANQPVSVAIDASALQF--YSGGVFNGYCETFLNHGVTAVGYGTSEEGI 290
           P NDEE+LLKAV+ QPVSVAI+ S  +F  YSGG+FNG C T LNH VT VGYG SEEGI
Sbjct: 239 PQNDEEALLKAVSQQPVSVAIEGSGYEFIHYSGGIFNGECGTHLNHAVTIVGYGVSEEGI 298

Query: 291 KYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
           KYWL+KNSWG+ WGEDGY R+ RD+D PQG CG+A  A +PV+
Sbjct: 299 KYWLLKNSWGESWGEDGYMRIMRDVDAPQGMCGLASLAYYPVA 341


>gi|356543122|ref|XP_003540012.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
          Length = 342

 Score =  317 bits (811), Expect = 8e-84,   Method: Compositional matrix adjust.
 Identities = 167/328 (50%), Positives = 217/328 (66%), Gaps = 15/328 (4%)

Query: 16  CASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGN 75
           C SQ   R   + S+ E+ EQW  +YG+ YK+SAE  KRF IF++N+  +E FN  A GN
Sbjct: 20  CTSQVKSRKLHDASMYERHEQWMEKYGKVYKDSAEXEKRFLIFENNVEFIESFN--AAGN 77

Query: 76  RSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGA 134
           + Y L +N  AD T +EF+AS  G+K S          TPF Y++ + +P +V+W +KG 
Sbjct: 78  KPYKLSINHLADQTNEEFMASHKGYKGSHWQGLRITTQTPFKYENVTDIPWAVDWRQKGD 137

Query: 135 VTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDD 187
            T +K QGQC       AVAA EGI  I    LVSLSEQ+LVDC + D+  GC GG M+ 
Sbjct: 138 ATSIKDQGQCGICWAFSAVAATEGIYQITTGNLVSLSEQELVDCDSVDH--GCDGGLMEH 195

Query: 188 AFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ 247
            F++II+N GI+++A Y Y  ++ G CD+ K     AQI  YE VP N EE L KAVANQ
Sbjct: 196 GFEFIIKNGGISSEANYPYTAVN-GTCDTNKEASPGAQIKGYETVPVNCEEELQKAVANQ 254

Query: 248 PVSVAIDA--SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGE 305
           PVSV+IDA  SA QFYS GVF G C T L+HGVTAVGYG++++GI+YW++KNSWG  WGE
Sbjct: 255 PVSVSIDAGGSAFQFYSSGVFTGQCGTQLDHGVTAVGYGSTDDGIQYWIVKNSWGTQWGE 314

Query: 306 DGYFRLQRDIDQPQGQCGIAMFASFPVS 333
           +GY R+ R ID  +G CGIAM AS+P +
Sbjct: 315 EGYIRMLRGIDAQEGLCGIAMDASYPTA 342


>gi|319826926|gb|ADV74756.1| cysteine protease [Lactuca sativa]
          Length = 363

 Score =  315 bits (808), Expect = 2e-83,   Method: Compositional matrix adjust.
 Identities = 163/324 (50%), Positives = 220/324 (67%), Gaps = 12/324 (3%)

Query: 18  SQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRS 77
           S+AT RT ++ ++  + EQW A +GR Y +  E   RF+IFK+N+  ++  N  A  ++S
Sbjct: 39  SRATSRTLNDPTMIARHEQWMAHHGRIYTDENEKQLRFQIFKNNVAYIDAHN--ARSDQS 96

Query: 78  YTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTP 137
           YTL +NKFADLT  EF AS+ G+K    S S   +G       S VP  V+W ++GAVTP
Sbjct: 97  YTLEVNKFADLTNDEFRASRNGYKKQPDSDSHVVSGLFRYANVSAVPDEVDWRKEGAVTP 156

Query: 138 VKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFK 190
           VK QG C       AVAA+EGIN ++  +LVSLSEQ+LVDC  +  + GC GG M++AF+
Sbjct: 157 VKDQGDCGCCWAFSAVAAMEGINKLENGKLVSLSEQELVDCDIDGIDQGCEGGLMENAFQ 216

Query: 191 YIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVS 250
           +I + KG+  ++VY Y G   GIC++ KA   AA+I+ +E VP N+E++LL+AVANQPVS
Sbjct: 217 FIEKRKGLAAESVYPYTG-EDGICNTKKAAIPAAKISGHEKVPANNEKALLQAVANQPVS 275

Query: 251 VAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGY 308
           +AIDAS    QFYSGGVF G C T L+H +TAVGYG + +G KYWL+KNSWG  WGE+GY
Sbjct: 276 IAIDASGYEFQFYSGGVFTGSCGTELDHAITAVGYGATMDGTKYWLMKNSWGASWGENGY 335

Query: 309 FRLQRDIDQPQGQCGIAMFASFPV 332
            R++RD    +G CGIAM  S+PV
Sbjct: 336 IRIKRDSLAKEGLCGIAMDPSYPV 359


>gi|356515036|ref|XP_003526207.1| PREDICTED: thiol protease SEN102-like [Glycine max]
          Length = 336

 Score =  314 bits (804), Expect = 4e-83,   Method: Compositional matrix adjust.
 Identities = 165/326 (50%), Positives = 215/326 (65%), Gaps = 21/326 (6%)

Query: 18  SQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRS 77
           SQ   R   E S+ E+ EQW  +YG+ YK++AE  KRF+IFKDN+  +E FN  A GN+ 
Sbjct: 22  SQVMCRKLHETSMRERHEQWMTEYGKVYKDAAEKDKRFQIFKDNVEFIESFN--ADGNKP 79

Query: 78  YTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGAVT 136
           Y L +N  ADLT +EF AS+ GFK     S+     T F Y++ + +P +++W  KGAVT
Sbjct: 80  YKLGVNHLADLTVEEFKASRNGFKRPHEFST-----TTFKYENVTAIPAAIDWRTKGAVT 134

Query: 137 PVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAF 189
           P+K QGQC        +AA EGI+ I   +LVSLSEQ+LVDC T   + GC GG+M+D F
Sbjct: 135 PIKDQGQCGSCWAFSTIAATEGIHQITTGKLVSLSEQELVDCDTKGVDQGCEGGYMEDGF 194

Query: 190 KYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPV 249
           ++II+N GIT++  Y Y+ +  G C+  KA    AQI  YE VPPN E +L KAVANQPV
Sbjct: 195 EFIIKNGGITSETNYPYKAVD-GKCN--KATSPVAQIKGYEKVPPNSETALQKAVANQPV 251

Query: 250 SVAIDA--SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDG 307
           SV+IDA  +   FYS G++NG C T L+HGVTAVGYGT+  G  YW++KNSWG  WGE G
Sbjct: 252 SVSIDADGAGFMFYSSGIYNGECGTELDHGVTAVGYGTA-NGTDYWIVKNSWGTQWGEKG 310

Query: 308 YFRLQRDIDQPQGQCGIAMFASFPVS 333
           Y R+QR I    G CGIA+ +S+P S
Sbjct: 311 YVRMQRGIAAKHGLCGIALDSSYPTS 336


>gi|302143412|emb|CBI21973.3| unnamed protein product [Vitis vinifera]
          Length = 320

 Score =  314 bits (804), Expect = 4e-83,   Method: Compositional matrix adjust.
 Identities = 169/341 (49%), Positives = 224/341 (65%), Gaps = 38/341 (11%)

Query: 3   KYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
           +Y  + +L +  + ASQAT R   E S+ E+ E W  QYGR YK++ E SKR++IFKDN+
Sbjct: 8   QYICLALLFVLAAWASQATARNLHEASMYERHEDWMVQYGREYKDADEKSKRYKIFKDNV 67

Query: 63  VAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-S 121
             +E FN A   ++SY L +N+FADLT +EF AS+  FK   H  S +A  T F Y++ +
Sbjct: 68  ARIESFNKAM--DKSYKLSINEFADLTNEEFRASRNRFKA--HICSTEA--TSFKYENVT 121

Query: 122 QVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATN 174
            VP +V+W +KGAVTP+K QGQC       AVAA+EGI  +   +L+SLSEQ+LVDC T+
Sbjct: 122 AVPSTVDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTS 181

Query: 175 DNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPP 234
             + GC                  TN   Y Y G + G C+  KA   AA+I  YEDVP 
Sbjct: 182 GEDQGC------------------TN---YPYAG-TDGTCNRKKAAHPAAKINGYEDVPA 219

Query: 235 NDEESLLKAVANQPVSVAIDA--SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
           N+E++L KAVA+QP++VAIDA  S  QFYS GVF G C T L+HGV+AVGYGTS++G+KY
Sbjct: 220 NNEKALQKAVAHQPIAVAIDAGGSEFQFYSSGVFTGQCGTELDHGVSAVGYGTSDDGMKY 279

Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
           WL+KNSWG  WGE+GY R+QRD+   +G CGIAM AS+P +
Sbjct: 280 WLVKNSWGTGWGEEGYIRMQRDVTAKEGLCGIAMQASYPTA 320


>gi|302143411|emb|CBI21972.3| unnamed protein product [Vitis vinifera]
          Length = 320

 Score =  313 bits (801), Expect = 1e-82,   Method: Compositional matrix adjust.
 Identities = 168/341 (49%), Positives = 223/341 (65%), Gaps = 38/341 (11%)

Query: 3   KYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
           +Y  + +L +  + ASQAT R+  E S+ E+ E W  QYGR YK++ E SKR++IFKDN+
Sbjct: 8   QYICLALLFVLAAWASQATARSLHEASMYERHEDWMVQYGREYKDADEKSKRYKIFKDNV 67

Query: 63  VAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-S 121
             +E FN A   ++SY L +N+FADLT +EF AS+  FK   H  S +A  T F Y++ +
Sbjct: 68  ARIESFNKAM--DKSYKLSINEFADLTNEEFRASRNRFKA--HICSTEA--TSFKYENVT 121

Query: 122 QVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATN 174
            VP +V+W +KGAVTP+K QGQC       AVAA+EGI  +   +L+SLSEQ+LVDC T+
Sbjct: 122 AVPSTVDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTS 181

Query: 175 DNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPP 234
             + GC                  TN   Y Y G + G C+  KA   AA+I  YEDVP 
Sbjct: 182 GEDQGC------------------TN---YPYAG-TDGTCNRKKAAHPAAKINGYEDVPA 219

Query: 235 NDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
           N+E++L KAVA+QP++VAIDAS    QFYS GVF G C T L+HGV AVGYGTS++G+KY
Sbjct: 220 NNEKALQKAVAHQPIAVAIDASGSEFQFYSSGVFTGQCGTELDHGVAAVGYGTSDDGMKY 279

Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
           WL+KNSW   WGE+GY R+QRD+   +G CGIAM AS+P +
Sbjct: 280 WLVKNSWSTGWGEEGYIRMQRDVTAKEGLCGIAMQASYPTA 320


>gi|37780043|gb|AAP32194.1| cysteine protease 1 [Trifolium repens]
          Length = 292

 Score =  312 bits (800), Expect = 1e-82,   Method: Compositional matrix adjust.
 Identities = 162/294 (55%), Positives = 204/294 (69%), Gaps = 14/294 (4%)

Query: 50  ENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSL 109
           E  KR  IF  N+  +E  +N+A+ N+ Y L +NKFADLT +EFIAS+  FK    SS +
Sbjct: 3   EREKRLRIFNKNVNYIEA-SNSAVNNKLYKLSINKFADLTNEEFIASRNKFKGHMCSSII 61

Query: 110 KANGTPFLYK-SSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLV 161
           +   T F Y+ +S +P +V+W +KGAVTPVK QGQC       AVAA EGI+ +   +LV
Sbjct: 62  RT--TTFKYENASAIPSTVDWRKKGAVTPVKNQGQCGSCWAFSAVAATEGIHQLSTGKLV 119

Query: 162 SLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAED 221
           SLSEQ+L+DC T   + GC GG MDDAFK+IIQN G++ +  Y YEG+  G C++ KA  
Sbjct: 120 SLSEQELIDCDTKGVDQGCEGGLMDDAFKFIIQNHGLSTEVQYPYEGVD-GTCNANKASI 178

Query: 222 HAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVT 279
           HA  IT YEDVP N+E +L KAVANQP+SVAIDAS    QFY+ GVF G C T L+HGVT
Sbjct: 179 HAVTITGYEDVPANNELALQKAVANQPISVAIDASGSDFQFYNSGVFTGSCGTELDHGVT 238

Query: 280 AVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
           AVGYG   +G KYWL+KNSWG DWGE+GY R+QR I   +G CGIAM AS+P +
Sbjct: 239 AVGYGVGNDGTKYWLVKNSWGADWGEEGYIRMQRGIAAAEGLCGIAMQASYPTA 292


>gi|302143415|emb|CBI21976.3| unnamed protein product [Vitis vinifera]
          Length = 322

 Score =  312 bits (800), Expect = 1e-82,   Method: Compositional matrix adjust.
 Identities = 166/341 (48%), Positives = 221/341 (64%), Gaps = 36/341 (10%)

Query: 3   KYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
           +Y  + +L +  + ASQAT R   E S+ E+ E W AQYGR YK++ E SKR++IFKDN+
Sbjct: 8   QYICLALLFVLAAWASQATARNLHEASMYERHEDWMAQYGRVYKDADEKSKRYKIFKDNV 67

Query: 63  VAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-S 121
             +E FN A   ++SY L +N+FADLT +EF  S+  FK   H  S +A  T F Y++ +
Sbjct: 68  ARIESFNKAM--DKSYKLSINEFADLTNEEFGTSRNRFKA--HICSTEA--TSFKYENVT 121

Query: 122 QVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATN 174
            VP +++W +KGAVTP+K QGQC       AVAA+EGI  +   +L+SLSEQ+LVDC T+
Sbjct: 122 AVPSTIDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTS 181

Query: 175 DNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPP 234
             + GC                   N A Y Y G + G C+  KA   AA+I  YEDVP 
Sbjct: 182 GEDQGC-------------------NGANYPYAG-TDGTCNRKKAAHPAAKINGYEDVPA 221

Query: 235 NDEESLLKAVANQPVSVAIDASAL--QFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
           N+E++L KAV +QP++VAIDA     QFYS GVF G C T L+HGV AVGYGTS++G+KY
Sbjct: 222 NNEKALQKAVVHQPIAVAIDAGGFEFQFYSSGVFTGQCGTELDHGVAAVGYGTSDDGMKY 281

Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
           WL+KNSWG  WGE+GY R+QRD+   +G CGIAM AS+P +
Sbjct: 282 WLVKNSWGTGWGEEGYIRMQRDVTAKEGLCGIAMQASYPTA 322


>gi|413944253|gb|AFW76902.1| hypothetical protein ZEAMMB73_056195 [Zea mays]
          Length = 340

 Score =  312 bits (799), Expect = 2e-82,   Method: Compositional matrix adjust.
 Identities = 164/342 (47%), Positives = 218/342 (63%), Gaps = 25/342 (7%)

Query: 6   LIVVLIISGSCASQATYRTFDEGS-IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
           ++ VL  +  C +    R  +E S +  + EQW AQY R YK++AE ++RFE+FK N+  
Sbjct: 8   ILAVLSFAFFCGAALAARDLNEDSAMVARHEQWMAQYSRVYKDAAEKARRFEVFKANVKF 67

Query: 65  VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQT--GFKMSDHSSSLKANGTPFLYKSSQ 122
           +E FN    GNR + L +N+FADLT  EF  ++T  GFK      SL    T F Y++  
Sbjct: 68  IESFNTG--GNRKFWLGINQFADLTNDEFRTTKTNKGFK-----PSLDKVSTGFRYENVS 120

Query: 123 V---PPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCA 172
           V   P +++W   GAVTP+K QGQC       AVAA EGI  I   +L+SLSEQ+LVDC 
Sbjct: 121 VDAIPATIDWRTNGAVTPIKDQGQCGCCWAFSAVAATEGIVKISTGKLISLSEQELVDCD 180

Query: 173 TNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDV 232
            +  + GC GG MDDAFK+II+N G+T ++ Y Y   + G C S    + AA I  YEDV
Sbjct: 181 VHGEDQGCEGGLMDDAFKFIIKNGGLTTESNYPYTA-ADGKCKS--GSNSAANIKGYEDV 237

Query: 233 PPNDEESLLKAVANQPVSVAIDASAL--QFYSGGVFNGYCETFLNHGVTAVGYGTSEEGI 290
           P NDE +L+KAVANQPVSVA+D   +  QFYSGGV  G C T L+HG+ A+GYG + +G 
Sbjct: 238 PTNDEAALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGKTSDGT 297

Query: 291 KYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           KYWL+KNSWG  WGE+GY R+++DI   +G CG+AM  S+P 
Sbjct: 298 KYWLMKNSWGTTWGENGYLRMEKDISDKKGMCGLAMEPSYPT 339


>gi|356542631|ref|XP_003539770.1| PREDICTED: vignain-like [Glycine max]
          Length = 343

 Score =  311 bits (798), Expect = 2e-82,   Method: Compositional matrix adjust.
 Identities = 158/338 (46%), Positives = 223/338 (65%), Gaps = 17/338 (5%)

Query: 7   IVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVE 66
           + +L++ G  A  A  RT ++ S+ E+ EQW AQ+G+ YK+  E   R++IF+ N+  +E
Sbjct: 12  LALLLLFGFWAFSANTRTLEDASMHERHEQWMAQHGKVYKDHHEKELRYKIFQQNVKGIE 71

Query: 67  RFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPP 125
            FNNA  GN+S+ L +N+FADLT +EF A     K+  +  S  +  + F Y+  ++VP 
Sbjct: 72  GFNNA--GNKSHKLGVNQFADLTEEEFKAIN---KLKGYMWSKISRTSTFKYEHVTKVPA 126

Query: 126 SVNWIEKGAVTPVKYQG-QC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNN 177
           +++W +KGAVTP+K QG +C       AVAA EGI  +    L+SLSEQ+L+DC TN +N
Sbjct: 127 TLDWRQKGAVTPIKSQGLKCGSCWAFAAVAATEGITKLTTGELISLSEQELIDCDTNGDN 186

Query: 178 NGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDE 237
            GC  G + +AFK+I+QNKG+  +A Y Y+ +  G C++     H A I  YEDVP N+E
Sbjct: 187 GGCKWGIIQEAFKFIVQNKGLATEASYPYQAVD-GTCNAKVESKHVASIKGYEDVPANNE 245

Query: 238 ESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLI 295
            +LL AVANQPVSV +D+S    +FYS GV +G C T  +H VT VGYG S++G KYWLI
Sbjct: 246 TALLNAVANQPVSVLVDSSDYDFRFYSSGVLSGSCGTTFDHAVTVVGYGVSDDGTKYWLI 305

Query: 296 KNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
           KNSWG  WGE GY R++RD+   +G CGIAM AS+P++
Sbjct: 306 KNSWGVYWGEQGYIRIKRDVAAKEGMCGIAMQASYPIA 343


>gi|356545118|ref|XP_003540992.1| PREDICTED: thiol protease SEN102-like [Glycine max]
          Length = 337

 Score =  311 bits (797), Expect = 3e-82,   Method: Compositional matrix adjust.
 Identities = 166/342 (48%), Positives = 220/342 (64%), Gaps = 22/342 (6%)

Query: 3   KYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
           K + I + ++      Q   R   E S+ E+ EQW A+YG+ YK++AE  KRF IFK N+
Sbjct: 7   KQYTIALFLLLALGIPQMMSRKLHETSMRERHEQWMAEYGKVYKDAAEKEKRFLIFKHNV 66

Query: 63  VAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-S 121
             +E FN AA  N+ Y L +N  ADLT +EF AS+ G K     S+     TPF Y++ +
Sbjct: 67  EFIESFNAAA--NKPYKLGVNHLADLTVEEFKASRNGLKRPYELST-----TPFKYENVT 119

Query: 122 QVPPSVNWIEKGAVTPVKYQGQCA--------VAAVEGINAIKINRLVSLSEQQLVDCAT 173
            +P +++W  KGAVT +K QGQCA        VAA EGI+ I   +LVSLSEQ+LVDC T
Sbjct: 120 AIPAAIDWRTKGAVTSIKDQGQCAGSCWAFSTVAATEGIHQITTGKLVSLSEQELVDCDT 179

Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
              + GC GG+M+D F++II+N GIT++A Y Y+ +  G C+  KA    AQI  YE VP
Sbjct: 180 KGVDQGCEGGYMEDGFEFIIKNGGITSEANYPYKAVD-GKCN--KATSPVAQIKGYEKVP 236

Query: 234 PNDEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIK 291
           PN E++L KAVANQPVSV+IDA+     FYS G++NG C T L+HGVTAVGYG +  G  
Sbjct: 237 PNSEKTLQKAVANQPVSVSIDANGEGFMFYSSGIYNGECGTELDHGVTAVGYGIA-NGTD 295

Query: 292 YWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
           YWL+KNSWG  WGE GY R+QR +    G CGIA+ +S+P +
Sbjct: 296 YWLVKNSWGTQWGEKGYVRMQRGVAAKHGLCGIALDSSYPTA 337


>gi|125551397|gb|EAY97106.1| hypothetical protein OsI_19029 [Oryza sativa Indica Group]
          Length = 350

 Score =  311 bits (797), Expect = 3e-82,   Method: Compositional matrix adjust.
 Identities = 165/347 (47%), Positives = 219/347 (63%), Gaps = 20/347 (5%)

Query: 1   MAKYFLIVVLIISGSCASQATYRTFD-EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFK 59
           +A    IV L  S   A  A  R    + ++A + E+W AQ+GR YK++AE ++R E+FK
Sbjct: 10  LAILCCIVCLYSSSGGAIVAAARELGGDAAMAARHERWMAQHGRVYKDAAEKARRLEVFK 69

Query: 60  DNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQT---GFKMSDHSSSLKANGTPF 116
            N+  +E FN  A G   Y L +N+FADLT +EF A+ T   GF   ++   +    T F
Sbjct: 70  ANVAFIESFN--AGGKNRYWLGVNQFADLTSEEFKATMTNSKGFSTPNNGVRVS---TGF 124

Query: 117 LYK---SSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQ 166
            Y+   +  +P SV+W  KGAVT +K QGQC       AVAA+EGI  +   +L+SLSEQ
Sbjct: 125 KYENVSADALPASVDWRTKGAVTRIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQ 184

Query: 167 QLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQI 226
           +LVDC  + N+ GC GG +D AF++I+ N G+T +A Y Y     G C +  A D AA I
Sbjct: 185 ELVDCDVDGNDQGCEGGEIDGAFQFILSNGGLTAEANYPYTA-EDGRCKTTAAADVAASI 243

Query: 227 TNYEDVPPNDEESLLKAVANQPVSVAIDASALQFYSGGVFNGYCETFLNHGVTAVGYGTS 286
             YEDVP NDE SL+KAVA QPVSVA+DAS  QFY GGV  G C T L+HGVT +GYG +
Sbjct: 244 RGYEDVPANDEPSLMKAVAGQPVSVAVDASKFQFYGGGVMAGECGTSLDHGVTVIGYGAA 303

Query: 287 EEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
            +G KYWL+KNSWG  WGE GY R+++DID  +G CG+AM  S+P +
Sbjct: 304 SDGTKYWLVKNSWGTTWGEAGYLRMEKDIDDKRGMCGLAMQPSYPTA 350


>gi|334185815|ref|NP_680113.3| putative cysteine proteinase [Arabidopsis thaliana]
 gi|75313879|sp|Q9STL4.1|CEP2_ARATH RecName: Full=KDEL-tailed cysteine endopeptidase CEP2; Flags:
           Precursor
 gi|4678354|emb|CAB41164.1| cysteine endopeptidase-like protein [Arabidopsis thaliana]
 gi|332644882|gb|AEE78403.1| putative cysteine proteinase [Arabidopsis thaliana]
          Length = 361

 Score =  309 bits (791), Expect = 1e-81,   Method: Compositional matrix adjust.
 Identities = 169/354 (47%), Positives = 224/354 (63%), Gaps = 22/354 (6%)

Query: 5   FLIVVLIISGSCASQATYRTFD-EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLV 63
           FL  ++I+  +C      +  + E  ++  +++W++ +    +   E  KRF +F+ N++
Sbjct: 8   FLFSLVILQTACGFDYDDKEIESEEGLSTLYDRWRSHHS-VPRSLNEREKRFNVFRHNVM 66

Query: 64  AVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHS--SSLKANGTPFLYKS- 120
            V   N     NRSY L+LNKFADLT  EF  + TG  +  H      K     F+Y   
Sbjct: 67  HVHNTNKK---NRSYKLKLNKFADLTINEFKNAYTGSNIKHHRMLQGPKRGSKQFMYDHE 123

Query: 121 --SQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDC 171
             S++P SV+W +KGAVT +K QG+C        VAAVEGIN IK N+LVSLSEQ+LVDC
Sbjct: 124 NLSKLPSSVDWRKKGAVTEIKNQGKCGSCWAFSTVAAVEGINKIKTNKLVSLSEQELVDC 183

Query: 172 ATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYED 231
            T   N GC GG M+ AF++I +N GIT +  Y YEG+  G CD+ K       I  +ED
Sbjct: 184 DTK-QNEGCNGGLMEIAFEFIKKNGGITTEDSYPYEGID-GKCDASKDNGVLVTIDGHED 241

Query: 232 VPPNDEESLLKAVANQPVSVAIDA--SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEG 289
           VP NDE +LLKAVANQPVSVAIDA  S  QFYS GVF G C T LNHGV AVGYG SE G
Sbjct: 242 VPENDENALLKAVANQPVSVAIDAGSSDFQFYSEGVFTGSCGTELNHGVAAVGYG-SERG 300

Query: 290 IKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSKESAQPSSAD 343
            KYW+++NSWG +WGE GY +++R+ID+P+G+CGIAM AS+P+   S+ P+  D
Sbjct: 301 KKYWIVRNSWGAEWGEGGYIKIEREIDEPEGRCGIAMEASYPIKLSSSNPTPKD 354


>gi|242072572|ref|XP_002446222.1| hypothetical protein SORBIDRAFT_06g005410 [Sorghum bicolor]
 gi|241937405|gb|EES10550.1| hypothetical protein SORBIDRAFT_06g005410 [Sorghum bicolor]
          Length = 340

 Score =  309 bits (791), Expect = 1e-81,   Method: Compositional matrix adjust.
 Identities = 163/342 (47%), Positives = 222/342 (64%), Gaps = 25/342 (7%)

Query: 6   LIVVLIISGSCASQATYRTF-DEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
           ++ +L ++  C +    R   D+ ++  + EQW AQY R YK++ E ++RFE+FK N+  
Sbjct: 8   ILAILGLALFCGAALAARDLNDDSAMVARHEQWMAQYNRVYKDATEKAQRFEVFKANVKF 67

Query: 65  VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQT--GFKMSDHSSSLKANGTPFLYKSSQ 122
           +E FN  A GNR + L +N+FADLT  EF A++T  GFK     S +K   T F Y++  
Sbjct: 68  IESFN--AGGNRKFWLGVNQFADLTNDEFRATKTNKGFK----PSPVKVP-TGFRYENVS 120

Query: 123 V---PPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCA 172
           V   P S++W  KGAVTP+K QGQC       AVAA EGI  I  ++L+SLSEQ+LVDC 
Sbjct: 121 VDALPASIDWRTKGAVTPIKDQGQCGCCWAFSAVAATEGIVKISTDKLISLSEQELVDCD 180

Query: 173 TNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDV 232
            +  + GC GG MDDAFK+II+N G+T ++ Y Y   + G C S    + AA I  +EDV
Sbjct: 181 VHGEDQGCEGGLMDDAFKFIIKNGGLTTESSYPYTA-TDGKCKS--GTNSAANIKGFEDV 237

Query: 233 PPNDEESLLKAVANQPVSVAIDASAL--QFYSGGVFNGYCETFLNHGVTAVGYGTSEEGI 290
           P NDE +L+KAVANQPVSVA+D   +  Q YSGGV  G C T L+HG+ A+GYG + +G 
Sbjct: 238 PANDEAALMKAVANQPVSVAVDGGDMTFQLYSGGVMTGSCGTDLDHGIAAIGYGQTSDGT 297

Query: 291 KYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           KYWL+KNSWG  WGE+GY R+++DI   +G CG+AM  S+P 
Sbjct: 298 KYWLLKNSWGTTWGENGYLRMEKDISDKRGMCGLAMEPSYPT 339


>gi|77554625|gb|ABA97421.1| Vignain precursor, putative [Oryza sativa Japonica Group]
 gi|222630746|gb|EEE62878.1| hypothetical protein OsJ_17681 [Oryza sativa Japonica Group]
          Length = 350

 Score =  309 bits (791), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 164/346 (47%), Positives = 217/346 (62%), Gaps = 20/346 (5%)

Query: 1   MAKYFLIVVLIISGSCASQATYRTFD-EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFK 59
           +A    IV L  S   A  A  R    + ++A + E+W AQ+GR YK++AE ++R E+FK
Sbjct: 10  LAILCCIVCLYSSSGGAIVAAARELGGDAAMAARHERWMAQHGRVYKDAAEKARRLEVFK 69

Query: 60  DNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQT---GFKMSDHSSSLKANGTPF 116
            N+  +E FN  A G   Y L +N+FADLT +EF A+ T   GF   ++   +    T F
Sbjct: 70  ANVAFIESFN--AGGKNRYWLGVNQFADLTSEEFKATMTNSKGFSTPNNGVRVS---TGF 124

Query: 117 LYK---SSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQ 166
            Y+   +  +P SV+W  KGAVT +K QGQC       AVAA+EG   +   +L+SLSEQ
Sbjct: 125 KYENVSADALPASVDWRTKGAVTRIKDQGQCGCCWAFSAVAAMEGFVKLSTGKLISLSEQ 184

Query: 167 QLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQI 226
           +LVDC  + N+ GC GG +D AF++I+ N G+T +A Y Y     G C +  A D AA I
Sbjct: 185 ELVDCDVDGNDQGCEGGEIDGAFQFILSNGGLTAEANYPYTA-EDGRCKTTAAADVAASI 243

Query: 227 TNYEDVPPNDEESLLKAVANQPVSVAIDASALQFYSGGVFNGYCETFLNHGVTAVGYGTS 286
             YEDVP NDE SL+KAVA QPVSVA+DAS  QFY GGV  G C T L+HGVT +GYG +
Sbjct: 244 RGYEDVPANDEPSLMKAVAGQPVSVAVDASKFQFYGGGVMAGECGTSLDHGVTVIGYGAA 303

Query: 287 EEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
            +G KYWL+KNSWG  WGE GY R+++DID  +G CG+AM  S+P 
Sbjct: 304 SDGTKYWLVKNSWGTTWGEAGYLRMEKDIDDKRGMCGLAMQPSYPT 349


>gi|18408828|ref|NP_566920.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|12324451|gb|AAG52191.1|AC012329_18 putative cysteine proteinase; 15366-14136 [Arabidopsis thaliana]
 gi|6723404|emb|CAB66413.1| cysteine protease-like protein [Arabidopsis thaliana]
 gi|332645009|gb|AEE78530.1| putative cysteine proteinase [Arabidopsis thaliana]
          Length = 341

 Score =  308 bits (790), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 170/346 (49%), Positives = 224/346 (64%), Gaps = 26/346 (7%)

Query: 4   YFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLV 63
           +FL+ +L+ S +    +    F E S  EK EQW +++ R Y + +E + RFEIF +NL 
Sbjct: 6   FFLLAILLSSRTSGVTSRGGLF-EASAVEKHEQWMSRFNRVYSDDSEKTSRFEIFTNNLK 64

Query: 64  AVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGF----KMSDHSSSLKANGTPFLYK 119
            VE  N     N++YTL +N+F+DLT +EF A  TG      M+  S++       F Y+
Sbjct: 65  FVESINMNT--NKTYTLDVNEFSDLTDEEFKARYTGLVVPEGMTRISTTDSHETVSFRYE 122

Query: 120 S-SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDC 171
           +  +   S++WI++GAVT VK+Q QC       AVAAVEG+  I    LVSLSEQQL+DC
Sbjct: 123 NVGETGESMDWIQEGAVTSVKHQQQCGCCWAFSAVAAVEGMTKIANGELVSLSEQQLLDC 182

Query: 172 ATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDH--AAQITNY 229
           +T   NNGC GG M  AF YI +N+GIT +  Y Y+G +   C+S    +H  AA I+ Y
Sbjct: 183 STE--NNGCGGGIMWKAFDYIKENQGITTEDNYPYQG-AQQTCES----NHLAAATISGY 235

Query: 230 EDVPPNDEESLLKAVANQPVSVAIDASALQF--YSGGVFNGYCETFLNHGVTAVGYGTSE 287
           E VP NDEE+LLKAV+ QPVSVAI+ S  +F  YSGG+FNG C T L H VT VGYG SE
Sbjct: 236 ETVPQNDEEALLKAVSQQPVSVAIEGSGYEFIHYSGGIFNGECGTQLTHAVTIVGYGVSE 295

Query: 288 EGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
           EGIKYWL+KNSWG+ WGE+GY R+ RD+D PQG CG+A  A +PV+
Sbjct: 296 EGIKYWLLKNSWGESWGENGYMRIMRDVDSPQGMCGLASLAYYPVA 341


>gi|356515050|ref|XP_003526214.1| PREDICTED: vignain-like [Glycine max]
          Length = 344

 Score =  308 bits (790), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 167/345 (48%), Positives = 226/345 (65%), Gaps = 21/345 (6%)

Query: 3   KYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
           K  ++ + +      SQ   R   + ++ E+ E W A+YG+ YK++AE  KRF+IFKDN+
Sbjct: 7   KQHMLALFLFLAVGISQVMPRKLHQTALRERHENWMAEYGKIYKDAAEKEKRFQIFKDNV 66

Query: 63  VAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDH--SSSLKANGTPFLYKS 120
             +E FN  A GN+ Y L +N  ADLT +EF  S+ G K +    +++ K NG  F Y++
Sbjct: 67  EFIESFN--AAGNKPYKLGVNHLADLTLEEFKDSRNGLKRTYEFSTTTFKLNG--FKYEN 122

Query: 121 -SQVPPSVNWIEKGAVTPVKYQG-QCA-------VAAVEGINAIKINRLVSLSEQQLVDC 171
            + +P +++W  KGAVTP+K QG QC        VAA EGI  I    L+SLSEQ+LVDC
Sbjct: 123 VTDIPEAIDWRVKGAVTPIKDQGDQCGSCWAFSTVAATEGIYQISTGMLMSLSEQELVDC 182

Query: 172 ATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYED 231
            + D+  GC GG M+D F++II+N GI+++A Y Y  +  G CD+ K    AAQI  YE 
Sbjct: 183 DSVDH--GCDGGLMEDGFEFIIKNGGISSEANYPYTAVD-GTCDASKEASPAAQIKGYET 239

Query: 232 VPPNDEESLLKAVANQPVSVAIDA--SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEG 289
           VP N EE+L +AVANQPVSV+IDA  S  QFYS GVF G C T L+HGVT VGYGT+++G
Sbjct: 240 VPANSEEALQQAVANQPVSVSIDAGGSGFQFYSSGVFTGQCGTQLDHGVTVVGYGTTDDG 299

Query: 290 I-KYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
             +YW++KNSWG  WGE+GY R+QR ID  +G CGIAM AS+P +
Sbjct: 300 THEYWIVKNSWGTQWGEEGYIRMQRGIDALEGLCGIAMDASYPTA 344


>gi|242072398|ref|XP_002446135.1| hypothetical protein SORBIDRAFT_06g002170 [Sorghum bicolor]
 gi|241937318|gb|EES10463.1| hypothetical protein SORBIDRAFT_06g002170 [Sorghum bicolor]
          Length = 338

 Score =  308 bits (788), Expect = 3e-81,   Method: Compositional matrix adjust.
 Identities = 160/342 (46%), Positives = 220/342 (64%), Gaps = 21/342 (6%)

Query: 2   AKYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDN 61
           +K FL+ +L  +  C+S    R   + ++ E+ E W  +YGR YK++AE ++RFE+FKDN
Sbjct: 4   SKAFLLAILGCASLCSSVLAARELSDAAMVERHENWMVEYGRVYKDAAEKARRFEVFKDN 63

Query: 62  LVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS- 120
           +  VE FN     N  + L +N+FADLT +EF A++ GFK     S+ K   T F Y++ 
Sbjct: 64  VAFVESFNTNK--NNKFWLGINQFADLTIEEFKANK-GFK---PISAEKVPTTGFKYENL 117

Query: 121 --SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDC 171
             S +P +V+W  KGAVTP+K QGQC       AVAA+EGI  +    L+SLSEQ+LVDC
Sbjct: 118 SVSALPTAVDWRTKGAVTPIKNQGQCGCCWAFSAVAAMEGIVKLSTGNLISLSEQELVDC 177

Query: 172 ATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYED 231
            T+  + GC GG+MD AF+++I+N G+   + Y Y+ +  G C        AA I  +ED
Sbjct: 178 DTHSMDEGCEGGWMDSAFEFVIKNGGLATVSSYPYKAVD-GKCKG--GSKSAATIKGHED 234

Query: 232 VPPNDEESLLKAVANQPVSVAIDASALQF--YSGGVFNGYCETFLNHGVTAVGYGTSEEG 289
           VP NDE +L+KAVANQPVSVA+DAS   F  YSGGV  G C T L+HG+ A+GYG   +G
Sbjct: 235 VPVNDEAALMKAVANQPVSVAVDASDRTFMLYSGGVMTGSCGTELDHGIAAIGYGVESDG 294

Query: 290 IKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
            KYW++KNSWG  WGE G+ R+++DI   QG CG+AM  S+P
Sbjct: 295 TKYWILKNSWGTTWGEKGFLRMEKDISDKQGMCGLAMKPSYP 336


>gi|318136892|gb|ADV41672.1| cysteine protease [Nicotiana tabacum]
          Length = 349

 Score =  308 bits (788), Expect = 3e-81,   Method: Compositional matrix adjust.
 Identities = 159/342 (46%), Positives = 227/342 (66%), Gaps = 18/342 (5%)

Query: 6   LIVVLIISGSCASQ-ATYRTFD-EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLV 63
           L +  I  G   SQ A+ R  + E S+  + +QW A + + YK+  E   RF+IFK+N+ 
Sbjct: 12  LALFFIFLGVWRSQVASSRPINYEASMRARHDQWIAHHDKVYKDLNEKEMRFKIFKENVE 71

Query: 64  AVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANG--TPFLYKS- 120
            +E FN  A  ++ Y L +NKF+DLT ++F    TG+K S H   + ++   T F Y + 
Sbjct: 72  RIEAFN--AGEDKGYKLGVNKFSDLTNEKFRVLHTGYKRS-HPKVMSSSKPKTHFRYANV 128

Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
           + +PP+++W +KGAVTP+K Q +C       AVAA EG++ +K  +L+ LSEQ+LVDC  
Sbjct: 129 TDIPPTMDWRKKGAVTPIKDQKECGCCWAFSAVAATEGLHQLKTGKLIPLSEQELVDCDV 188

Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
              + GC GG +D AF +I++NKG+T +A Y Y+G   G+C+  K+   AA+I  YEDVP
Sbjct: 189 EGEDEGCSGGLLDTAFDFILKNKGLTTEANYPYKG-EDGVCNKKKSALSAAKIAGYEDVP 247

Query: 234 PNDEESLLKAVANQPVSVAIDASAL--QFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIK 291
            N E++LL+AVANQPVSVAID S+   QFYS GVF+G C T+LNH VTAVGYG + +G K
Sbjct: 248 ANSEKALLQAVANQPVSVAIDGSSFDFQFYSSGVFSGSCSTWLNHAVTAVGYGATTDGTK 307

Query: 292 YWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
           YW+IKNSWG  WG+ GY R++RD+ + +G CG+AM AS+P +
Sbjct: 308 YWIIKNSWGSKWGDSGYMRIKRDVHEKEGLCGLAMDASYPTA 349


>gi|297816028|ref|XP_002875897.1| hypothetical protein ARALYDRAFT_347926 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297321735|gb|EFH52156.1| hypothetical protein ARALYDRAFT_347926 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 361

 Score =  308 bits (788), Expect = 4e-81,   Method: Compositional matrix adjust.
 Identities = 168/354 (47%), Positives = 227/354 (64%), Gaps = 22/354 (6%)

Query: 5   FLIVVLIISGSCASQATYRTFD-EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLV 63
           FL  ++I+  +C      +  + E  +++ +++W++ +    +   E  KRF +F+ N++
Sbjct: 8   FLFSLVILETACGFDYEDKEIESEEGLSKLYDRWRSHHS-VPRSLHEREKRFNVFRHNVM 66

Query: 64  AVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHS--SSLKANGTPFLYKS- 120
            V   +N+   NRSY L+LNKFADLT  EF  + TG K+  H      K     F+Y   
Sbjct: 67  HV---HNSNKKNRSYKLKLNKFADLTIHEFKNAYTGSKIKHHRMLQGPKRGSKQFMYDHE 123

Query: 121 --SQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDC 171
             S++P SV+W +KGAVT +K QG+C        VAAVEGIN IK N+LVSLSEQ+LVDC
Sbjct: 124 NVSKLPSSVDWRKKGAVTEIKNQGKCGSCWAFSTVAAVEGINKIKTNKLVSLSEQELVDC 183

Query: 172 ATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYED 231
            TN  N GC GG M+ AF++I +N GIT +  Y YEG+  G CD+ K       I  +E+
Sbjct: 184 DTN-QNEGCNGGLMEIAFEFIKKNGGITTEDSYPYEGID-GKCDASKDNGVLVTIDGHEN 241

Query: 232 VPPNDEESLLKAVANQPVSVAIDA--SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEG 289
           VP NDE +LLKAVANQPVSVAIDA  S  QFYS GVF G C T LNHGV  VGYG S+ G
Sbjct: 242 VPENDENALLKAVANQPVSVAIDAGSSDFQFYSEGVFTGDCGTELNHGVATVGYG-SQGG 300

Query: 290 IKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSKESAQPSSAD 343
            KYW+++NSWG +WGE GY +++R ID+P+G+CGIAM AS+P+   S+ P+  D
Sbjct: 301 KKYWIVRNSWGTEWGEGGYIKIERGIDEPEGRCGIAMEASYPIKLSSSNPTPKD 354


>gi|144905112|dbj|BAF56429.1| cysteine proteinase [Lotus japonicus]
          Length = 341

 Score =  307 bits (786), Expect = 6e-81,   Method: Compositional matrix adjust.
 Identities = 169/339 (49%), Positives = 222/339 (65%), Gaps = 22/339 (6%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
           FL++ + IS    S+  + T  E S+ E+ EQW A+Y + YK++AE  KRF IFKDN+  
Sbjct: 15  FLLLAVGIS-RVISRELHET--ETSLIERHEQWMAKYDKVYKDAAEKEKRFLIFKDNVEF 71

Query: 65  VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQV 123
           +E FN  A GN+ Y L +N  ADLT +EF AS+ G K    S   +   T F Y++ + +
Sbjct: 72  IESFN--AAGNKPYKLGVNHLADLTIEEFKASRNGLK---RSYDYEVGTTSFKYENVTAI 126

Query: 124 PPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDN 176
           P SV+W +KGAVTP+K QGQC        VAA EGI+ I   +LVSLSEQ+LVDC     
Sbjct: 127 PASVDWRKKGAVTPIKDQGQCGSCWAFSTVAATEGIHKISTGKLVSLSEQELVDCDRKGT 186

Query: 177 NNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPND 236
           + GC GG+M+D F++II+N GIT +A Y Y+ +  G C +  A   AAQI  YE VP N 
Sbjct: 187 DQGCEGGYMEDGFEFIIKNGGITTEANYPYKAVD-GSCKNATAP--AAQIKGYEKVPVNS 243

Query: 237 EESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWL 294
           E++LLKAVANQPVSV+IDA+  +  FYS G+F G C T L+HGVTAVGYG +  G  YW+
Sbjct: 244 EKALLKAVANQPVSVSIDAADGSFMFYSSGIFTGECGTELDHGVTAVGYGRA-NGTDYWI 302

Query: 295 IKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
           +KNSWG  WGE GY R+QR I   +G CGIAM +S+P +
Sbjct: 303 VKNSWGTVWGEQGYIRMQRGIAAKEGLCGIAMDSSYPTA 341


>gi|84181681|gb|AAW78661.2| senescence-specific cysteine protease [Nicotiana tabacum]
          Length = 349

 Score =  307 bits (786), Expect = 6e-81,   Method: Compositional matrix adjust.
 Identities = 158/347 (45%), Positives = 227/347 (65%), Gaps = 17/347 (4%)

Query: 1   MAKYFLIVVLIIS-GSCASQ-ATYRTFD-EGSIAEKFEQWKAQYGRTYKESAENSKRFEI 57
           +++Y  + +  I  G  +SQ A  R  + E ++  + +QW   + + YK+  E   RF+I
Sbjct: 6   LSQYLCLALFFICLGLWSSQVALSRPINYEATMRARHDQWIVHHEKVYKDLNEKEVRFQI 65

Query: 58  FKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANG-TPF 116
           FK+N+  +E FN  A  ++ Y L  NKF+DLT +EF    TG+K S       + G T F
Sbjct: 66  FKENVERIEAFN--AGEDKGYKLGFNKFSDLTNEEFRVLHTGYKRSHPKVMTSSKGKTHF 123

Query: 117 LYKS-SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQL 168
            Y + + +PP+++W +KGAVTP+K Q +C       AVAA+EG++ +K   L+ LSEQ+L
Sbjct: 124 RYTNVTDIPPTMDWRKKGAVTPIKDQKECGCCWAFSAVAAMEGLHQLKTGELIPLSEQEL 183

Query: 169 VDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITN 228
           VDC     + GC GG +D AF +I++NKG+T +  Y Y+G   G+C+  K+   AA+IT 
Sbjct: 184 VDCDVEGEDEGCSGGLLDTAFDFILKNKGLTTEVNYPYKG-EDGVCNKKKSALSAAKITG 242

Query: 229 YEDVPPNDEESLLKAVANQPVSVAIDASAL--QFYSGGVFNGYCETFLNHGVTAVGYGTS 286
           YEDVP N E++LL+AVANQPVSVAID S+   QFYS GVF+G C T+LNH VTAVGYG +
Sbjct: 243 YEDVPANSEKALLQAVANQPVSVAIDGSSFDFQFYSSGVFSGSCSTWLNHAVTAVGYGAT 302

Query: 287 EEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
            +G KYW+IKNSWG  WG+ GY R++RD+ + +G CG+AM AS+P +
Sbjct: 303 TDGTKYWIIKNSWGSKWGDSGYMRIKRDVHEKEGLCGLAMDASYPTA 349


>gi|224076972|ref|XP_002305074.1| predicted protein [Populus trichocarpa]
 gi|224106329|ref|XP_002333698.1| predicted protein [Populus trichocarpa]
 gi|222837984|gb|EEE76349.1| predicted protein [Populus trichocarpa]
 gi|222848038|gb|EEE85585.1| predicted protein [Populus trichocarpa]
          Length = 307

 Score =  306 bits (785), Expect = 7e-81,   Method: Compositional matrix adjust.
 Identities = 154/314 (49%), Positives = 209/314 (66%), Gaps = 17/314 (5%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
           + ++ E+W AQ+GR Y +  E  KR+ IFK+N+  +E FNN +  +R Y L +NKFADLT
Sbjct: 1   MLKRHEEWMAQHGRVYGDMKEKEKRYLIFKENIERIEAFNNGS--DRGYKLGVNKFADLT 58

Query: 90  PQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGAVTPVKYQGQC---- 144
            +EF A   G+K      S K   + F +++ S +P S++W + GAVTPVK QG C    
Sbjct: 59  NEEFRAMHHGYK----RQSSKLMSSSFRHENLSAIPTSMDWRKAGAVTPVKDQGTCGCCW 114

Query: 145 ---AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITND 201
              AVAA+EGI  +K  +L+SLSEQQLVDC     + GC GG MD+AF++I++N G+T++
Sbjct: 115 AFSAVAAIEGIIKLKTGKLISLSEQQLVDCDVKGVDQGCGGGLMDNAFQFILRNGGLTSE 174

Query: 202 AVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQ 259
           A Y Y+G+  G C S K     A+IT YEDVP N+E +LL+AVA QPVSVA++      Q
Sbjct: 175 ATYPYQGVD-GTCKSKKTASIEAKITGYEDVPVNNENALLQAVAKQPVSVAVEGGGYDFQ 233

Query: 260 FYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQ 319
           FY  GVF G C T+L+H VTA+GYGT+ +G  YWL+KNSWG  WGE GY R+QR I   +
Sbjct: 234 FYKSGVFKGDCGTYLDHAVTAIGYGTNSDGTNYWLVKNSWGTSWGESGYMRMQRGIGARE 293

Query: 320 GQCGIAMFASFPVS 333
           G CG+AM AS+P +
Sbjct: 294 GLCGVAMDASYPTA 307


>gi|357160300|ref|XP_003578721.1| PREDICTED: oryzain beta chain-like [Brachypodium distachyon]
          Length = 349

 Score =  306 bits (785), Expect = 7e-81,   Method: Compositional matrix adjust.
 Identities = 160/344 (46%), Positives = 221/344 (64%), Gaps = 17/344 (4%)

Query: 4   YFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLV 63
           + L VVL     C++  + R   + ++ E+ EQW AQ+GR YK+ AE ++RFE F++N+V
Sbjct: 7   FLLAVVLGCICLCSTVLSARELGDAAMVERHEQWMAQHGRVYKDGAEKARRFEAFRNNVV 66

Query: 64  AVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQT--GFKMSDHSSSLKANGT-PFLYK- 119
            +E FN AA   R + L +N+F DLT  EF A++T  GF   + ++  KA+ T  F Y  
Sbjct: 67  FIESFN-AAGNRRKFWLGVNQFTDLTNDEFRATKTNKGFIKRNAAAVNKASPTGTFRYSN 125

Query: 120 --SSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVD 170
             +  +P +V+W  KGAVTP+K QGQC       AVAA EGI  +   +LV LSEQ+LVD
Sbjct: 126 VSADALPAAVDWRAKGAVTPIKNQGQCGCCWAFSAVAATEGIVQLSTGKLVPLSEQELVD 185

Query: 171 CATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYE 230
           C  N  ++GC GG MDDAF++II+N G+T++  Y Y     G C +    +  A I  YE
Sbjct: 186 CDANGADHGCEGGEMDDAFEFIIKNGGLTSETNYPYTAQD-GQCKAKNTINSVATIKGYE 244

Query: 231 DVPPNDEESLLKAVANQPVSVAIDASAL--QFYSGGVFNGYCETFLNHGVTAVGYGTSEE 288
           DVP NDE SL+KAVA QPVSVA+D   +  Q Y+GGV +G C T L+HG+ AVGYG +++
Sbjct: 245 DVPANDEASLMKAVAAQPVSVAVDGGDMVFQHYAGGVLSGSCGTSLDHGIVAVGYGAADD 304

Query: 289 GIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           G K+WL+KNSWG  WGEDGY R+++D+    G CG+AM  S+P 
Sbjct: 305 GTKFWLMKNSWGTTWGEDGYIRMEKDVADAGGMCGLAMQPSYPT 348


>gi|297740489|emb|CBI30671.3| unnamed protein product [Vitis vinifera]
          Length = 320

 Score =  306 bits (783), Expect = 1e-80,   Method: Compositional matrix adjust.
 Identities = 167/339 (49%), Positives = 216/339 (63%), Gaps = 35/339 (10%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
            + + L+I G  ASQA  RT  E S++E+ E W   YGRTYK+ AE  +RF+IFK+N+  
Sbjct: 7   IICITLLIMGVWASQALSRTLHEVSMSERHEDWMGLYGRTYKDIAEKERRFKIFKENVEY 66

Query: 65  VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQV 123
           +E  N                      +F AS+ G+ MS    S +   T F Y++ + V
Sbjct: 67  IESVN----------------------KFKASRNGYNMSSRPRSSEI--TSFRYENVAAV 102

Query: 124 PPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDN 176
           P S++W +KGAVTP+K QGQC       AVAA+EG+  +K   L+SLSEQ+LVDC T+  
Sbjct: 103 PSSMDWRKKGAVTPIKDQGQCGCCWAFSAVAAMEGVTQLKTGELISLSEQELVDCDTSGE 162

Query: 177 NNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPND 236
           + GC GG MD AF++II N G+T +A Y Y+G+    C+  KA   AA+I NYEDVP N 
Sbjct: 163 DQGCGGGLMDSAFEFIIGNGGLTTEANYPYKGVDA-TCNKKKAASSAAKIKNYEDVPANS 221

Query: 237 EESLLKAVANQPVSVAIDA--SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWL 294
           E +LLKAVA  PVSVAIDA  S  QFYS GVF G C T L+HGVTAVGYG +++G KYWL
Sbjct: 222 EAALLKAVAQHPVSVAIDAGGSDFQFYSSGVFTGQCGTELDHGVTAVGYGKTDDGTKYWL 281

Query: 295 IKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
           +KNSWG  WGEDGY  ++RDI   +G CGIAM AS+P +
Sbjct: 282 VKNSWGTGWGEDGYIWMERDIGADEGLCGIAMEASYPTA 320


>gi|224076970|ref|XP_002305073.1| predicted protein [Populus trichocarpa]
 gi|222848037|gb|EEE85584.1| predicted protein [Populus trichocarpa]
          Length = 340

 Score =  305 bits (782), Expect = 1e-80,   Method: Compositional matrix adjust.
 Identities = 165/342 (48%), Positives = 218/342 (63%), Gaps = 21/342 (6%)

Query: 3   KYFLIVVLIISGSCASQATYRTFDEGS-IAEKFEQWKAQYGRTYKESAENSKRFEIFKDN 61
           + FL  +LI++ + A++   R  DE   + ++ E+W AQ+GR Y +  E  KR+ IFK+N
Sbjct: 9   RIFLPFLLILA-AWATKIACRPLDEQEYMLKRHEEWMAQHGRVYGDMKEKEKRYLIFKEN 67

Query: 62  LVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS- 120
           +  +E FNN +  +R Y L +NKFADLT +EF A   G+K      S K   + F Y++ 
Sbjct: 68  IERIEAFNNGS--DRGYKLGVNKFADLTNEEFRAMYHGYK----RQSSKLMSSSFRYENL 121

Query: 121 SQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCAT 173
           S +P S++W   GAVTPVK QG C        VAA+EGI  ++   L+SLSEQQLVDC  
Sbjct: 122 SDIPTSMDWRNDGAVTPVKDQGTCGCCWAFSTVAAIEGIIKLQTGNLISLSEQQLVDCTA 181

Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
              N GC GG MD AF+YII+N G+T++  Y Y+G+  G C S KA    AQIT YEDVP
Sbjct: 182 G--NKGCQGGLMDTAFQYIIRNGGLTSEDNYPYQGVD-GTCSSEKAASTEAQITGYEDVP 238

Query: 234 PNDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIK 291
            N+E +LL+AVA QPVSV +D      QFY  GVFNG C T  NH VTA+GYGT  +G  
Sbjct: 239 QNNENALLQAVAKQPVSVGVDGGGNDFQFYKSGVFNGDCGTQQNHAVTAIGYGTDIDGTD 298

Query: 292 YWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
           YWL+KNSWG  WGE+GY R++R I   +G CG+AM AS+P +
Sbjct: 299 YWLVKNSWGTSWGENGYMRMRRGIGSSEGLCGVAMDASYPTA 340


>gi|413953668|gb|AFW86317.1| hypothetical protein ZEAMMB73_339067 [Zea mays]
          Length = 433

 Score =  305 bits (782), Expect = 2e-80,   Method: Compositional matrix adjust.
 Identities = 160/330 (48%), Positives = 216/330 (65%), Gaps = 24/330 (7%)

Query: 17  ASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNR 76
           A+ A     D+  +  + EQW AQY R YK+++E ++RFE+FK N+  +E FN  A GN 
Sbjct: 113 AAMAARDLSDDSVMVARHEQWMAQYSRVYKDASEKARRFEVFKANVQFIESFN--AGGNN 170

Query: 77  SYTLRLNKFADLTPQEFIASQT--GFKMSDHSSSLKANGTPFLYK---SSQVPPSVNWIE 131
            + L +N+FADLT  EF +++T  G K    SS++K   T F Y+   +  +P +++W  
Sbjct: 171 KFWLGVNQFADLTNDEFRSTKTNKGLK----SSNMKIP-TGFRYENVSADALPTTIDWRT 225

Query: 132 KGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGF 184
           KGAVTP+K QGQC       AVAA EGI  I   +LVSL+EQ+LVDC  +  + GC GG 
Sbjct: 226 KGAVTPIKDQGQCGCCWAFSAVAATEGIVKISTGKLVSLAEQELVDCDVHGEDQGCEGGL 285

Query: 185 MDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAV 244
           MDDAFK+II+N G+T ++ Y Y   + G C S    + AA I  YEDVP NDE +L+KAV
Sbjct: 286 MDDAFKFIIKNGGLTTESSYPYTA-ADGKCKS--GSNSAATIKGYEDVPANDEAALMKAV 342

Query: 245 ANQPVSVAIDASAL--QFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQD 302
           ANQPVSVA+D   +  QFYSGGV  G C T L+HG+ A+GYG + +G KYWL+KNSWG  
Sbjct: 343 ANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGKTSDGTKYWLMKNSWGTT 402

Query: 303 WGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           WGE+GY R+++DI   +G CG+AM  S+P 
Sbjct: 403 WGENGYLRMEKDISDKRGMCGLAMEPSYPT 432


>gi|224076968|ref|XP_002305072.1| predicted protein [Populus trichocarpa]
 gi|222848036|gb|EEE85583.1| predicted protein [Populus trichocarpa]
          Length = 305

 Score =  305 bits (782), Expect = 2e-80,   Method: Compositional matrix adjust.
 Identities = 161/314 (51%), Positives = 206/314 (65%), Gaps = 19/314 (6%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
           + ++ E+W AQ+GR Y +  E  KR+ IFK+N+  +E FNN +  +R Y L +NKFADLT
Sbjct: 1   MLKRHEEWMAQHGRVYGDMKEKEKRYLIFKENIERIEAFNNGS--DRGYKLGVNKFADLT 58

Query: 90  PQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGAVTPVKYQGQCA--- 145
            +EF A   G+K    SS L +  + F Y++ S +P S++W   GAVTPVK QG C    
Sbjct: 59  NEEFRAMYHGYKR--QSSKLMS--SSFRYENLSDIPTSMDWRNDGAVTPVKDQGTCGCCW 114

Query: 146 ----VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITND 201
               VAA+EGI  ++   L+SLSEQQLVDC     N GC GG MD AF+YII+N G+T++
Sbjct: 115 AFSTVAAIEGIIKLQTGNLISLSEQQLVDCTAG--NKGCQGGLMDTAFQYIIRNGGLTSE 172

Query: 202 AVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQ 259
             Y Y+G+  G C S KA    AQIT YEDVP N+E +LL+AVA QPVSVA+D      +
Sbjct: 173 DNYPYQGVD-GTCSSEKAASTEAQITGYEDVPQNNENALLQAVAKQPVSVAVDGGGNDFR 231

Query: 260 FYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQ 319
           FY  GVF G C T LNHGVTA+GYGT  +G  YWL+KNSWG  WGE GY R+QR I   +
Sbjct: 232 FYKSGVFEGDCGTNLNHGVTAIGYGTDSDGTDYWLVKNSWGTSWGESGYTRMQRGIGASE 291

Query: 320 GQCGIAMFASFPVS 333
           G CG+AM AS+P S
Sbjct: 292 GLCGVAMDASYPTS 305


>gi|413953667|gb|AFW86316.1| hypothetical protein ZEAMMB73_635707 [Zea mays]
          Length = 340

 Score =  305 bits (781), Expect = 2e-80,   Method: Compositional matrix adjust.
 Identities = 163/345 (47%), Positives = 220/345 (63%), Gaps = 25/345 (7%)

Query: 3   KYFLIVVLIISGSCASQATYRTF-DEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDN 61
           K  ++ +L  +  C +    R   D+ ++  + EQW AQY R YK+++E ++RFE+FK N
Sbjct: 5   KASILAILGFAFFCGAALAARDLSDDSAMVARHEQWMAQYSRVYKDASEKARRFEVFKAN 64

Query: 62  LVAVERFNNAAIGNRSYTLRLNKFADLTPQEF--IASQTGFKMSDHSSSLKANGTPFLYK 119
           +  +E FN  A GN  + L +N+FADLT  EF  I +  GFK    SS++K   T F Y+
Sbjct: 65  VKFIESFN--AGGNNKFWLGVNQFADLTNDEFRSIKTNKGFK----SSNMKIP-TGFRYE 117

Query: 120 SSQV---PPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLV 169
           +  V   P +++W  KGAVTP+K QGQC       AVAA EGI  I   +LVSL+EQ+LV
Sbjct: 118 NVSVDALPTTIDWRTKGAVTPIKDQGQCGCCWAFSAVAATEGIVKISTGKLVSLAEQELV 177

Query: 170 DCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNY 229
           DC  +  + GC GG MDDAFK+II N G+T ++ Y Y   + G C S    + AA I  Y
Sbjct: 178 DCDVHGEDQGCEGGLMDDAFKFIINNGGLTTESSYPYTA-ADGKCKS--GSNSAATIKGY 234

Query: 230 EDVPPNDEESLLKAVANQPVSVAIDASAL--QFYSGGVFNGYCETFLNHGVTAVGYGTSE 287
           EDVP NDE +L+KAVANQPVSVA+D   +  QFYS GV  G C T L+HG+ A+GYG + 
Sbjct: 235 EDVPANDEAALMKAVANQPVSVAVDGGDMTFQFYSSGVMTGSCGTDLDHGIAAIGYGKTS 294

Query: 288 EGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           +G KYWL+KNSWG  WGE+GY R+++DI   +G CG+AM  S+P 
Sbjct: 295 DGTKYWLMKNSWGTTWGENGYLRMEKDISDKRGMCGLAMEPSYPT 339


>gi|58531896|gb|AAW78660.1| cysteine protease [Nicotiana tabacum]
          Length = 361

 Score =  305 bits (781), Expect = 2e-80,   Method: Compositional matrix adjust.
 Identities = 166/324 (51%), Positives = 209/324 (64%), Gaps = 20/324 (6%)

Query: 32  EKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQ 91
           E +E+W++ +  + +   E  KRF +FK N+  V  FN     ++ Y L+LNKFAD+T  
Sbjct: 36  ELYERWRSHHTVS-RSLDEKDKRFNVFKANVHYVHNFNKK---DKPYKLKLNKFADMTNH 91

Query: 92  EFIASQTGFKMSDHSSSL---KANGTPFLYKSSQ-VPPSVNWIEKGAVTPVKYQGQCA-- 145
           EF     G K+  H S L   +ANGT F+Y + + VPPSV+W +KGAVTPVK QG+C   
Sbjct: 92  EFRHHYAGSKIKHHRSFLGASRANGT-FMYANVEDVPPSVDWRKKGAVTPVKDQGKCGSC 150

Query: 146 -----VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITN 200
                V AVEGIN IK N LVSLSEQ+LVDC T+  N GC GG MD AF++I +  GI  
Sbjct: 151 WAFSTVVAVEGINQIKTNELVSLSEQELVDCDTS-QNQGCNGGLMDMAFEFIKKKGGINT 209

Query: 201 DAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--L 258
           +  Y Y     G CD  K       I  YEDVPPNDE+SLLKAVANQPVSVAI AS    
Sbjct: 210 EENYPYMA-EGGECDIQKRNSPVVSIDGYEDVPPNDEDSLLKAVANQPVSVAIQASGSDF 268

Query: 259 QFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQP 318
           QFYS GVF G C T L+HGV  VGYGT+ +G KYW+++NSWG +WGE GY R+QR+ID  
Sbjct: 269 QFYSEGVFTGDCGTELDHGVAIVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQREIDAE 328

Query: 319 QGQCGIAMFASFPVSKESAQPSSA 342
           +G CGIAM  S+P+   S+ P+ +
Sbjct: 329 EGLCGIAMQPSYPIKTSSSNPTGS 352


>gi|351721126|ref|NP_001237199.1| cysteine proteinase precursor [Glycine max]
 gi|31559530|dbj|BAC77523.1| cysteine proteinase [Glycine max]
 gi|31559532|dbj|BAC77524.1| cysteine proteinase [Glycine max]
          Length = 362

 Score =  305 bits (780), Expect = 3e-80,   Method: Compositional matrix adjust.
 Identities = 168/360 (46%), Positives = 219/360 (60%), Gaps = 23/360 (6%)

Query: 3   KYFLIVVLIISGSCASQATYRTFD-----EGSIAEKFEQWKAQYGRTYKESAENSKRFEI 57
           K FL VVL +S       ++   D     E S+ + +E+W++ +    +   +  KRF +
Sbjct: 4   KKFLWVVLSLSLVLGVANSFDFHDKDLESEESLWDLYERWRSHH-TVSRSLGDKHKRFNV 62

Query: 58  FKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHS---SSLKANGT 114
           FK N++ V   N     ++ Y L+LNKFAD+T  EF ++  G K++ H       + NGT
Sbjct: 63  FKANMMHVHNTNKM---DKPYKLKLNKFADMTNHEFRSTYAGSKVNHHRMFRDMPRGNGT 119

Query: 115 PFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQ 167
               K   VP SV+W +KGAVT VK QG C        V AVEGIN IK N+LVSLSEQ+
Sbjct: 120 FMYEKVGSVPASVDWRKKGAVTDVKDQGHCGSCWAFSTVVAVEGINQIKTNKLVSLSEQE 179

Query: 168 LVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQIT 227
           LVDC T +N  GC GG M+ AF++I Q  GIT ++ Y Y     G CD+ KA D A  I 
Sbjct: 180 LVDCDTEENA-GCNGGLMESAFQFIKQKGGITTESYYPYTAQD-GTCDASKANDLAVSID 237

Query: 228 NYEDVPPNDEESLLKAVANQPVSVAIDA--SALQFYSGGVFNGYCETFLNHGVTAVGYGT 285
            +E+VP NDE +LLKAVANQPVSVAIDA  S  QFYS GVF G C T LNHGV  VGYG 
Sbjct: 238 GHENVPGNDENALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDCSTELNHGVAIVGYGA 297

Query: 286 SEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSKESAQPSSADKS 345
           + +G  YW+++NSWG +WGE GY R+QR+I + +G CGIAM AS+P+   S  P+    S
Sbjct: 298 TVDGTSYWIVRNSWGPEWGELGYIRMQRNISKKEGLCGIAMLASYPIKNSSNNPTGPSSS 357


>gi|242092702|ref|XP_002436841.1| hypothetical protein SORBIDRAFT_10g009840 [Sorghum bicolor]
 gi|241915064|gb|EER88208.1| hypothetical protein SORBIDRAFT_10g009840 [Sorghum bicolor]
          Length = 328

 Score =  304 bits (779), Expect = 3e-80,   Method: Compositional matrix adjust.
 Identities = 158/338 (46%), Positives = 217/338 (64%), Gaps = 23/338 (6%)

Query: 3   KYFLIVVLIISGSCASQATYRTF-DEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDN 61
           K  ++ +L ++  C +    R   D+ ++  + EQW  QY R YK++ E ++RFE+FK N
Sbjct: 5   KASILAILGLAFFCGAALAARDLNDDSAMVARHEQWMVQYSRVYKDTTEKARRFEVFKAN 64

Query: 62  LVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQT--GFKMSDHSSSLKANGTPFLYK 119
           +  +E FN  A GNR + L +N+FADLT  EF A++T  GFK S    S     T F Y+
Sbjct: 65  VKFIESFN--AGGNRKFWLGVNQFADLTNDEFRATKTNKGFKPSPVKVS-----TGFRYE 117

Query: 120 SSQV---PPSVNWIEKGAVTPVKYQGQCAVAAVEGINAIKINRLVSLSEQQLVDCATNDN 176
           +  V   P +++W  KGAVTP+K QGQC     EGI  I   +L+SLSEQ+LVDC  +  
Sbjct: 118 NVSVDALPATIDWRTKGAVTPIKDQGQC-----EGIVKISTGKLISLSEQELVDCDVHGE 172

Query: 177 NNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPND 236
           + GC GG MDDAFK+II+N G+T ++ Y Y   + G C S    + AA +  +EDVP ND
Sbjct: 173 DQGCEGGLMDDAFKFIIKNGGLTTESSYPYTA-ADGKCKS--GSNSAATVKGFEDVPAND 229

Query: 237 EESLLKAVANQPVSVAIDASAL--QFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWL 294
           E +L+KAVANQPVSVA+D   +  QFYSGGV  G C T L+HG+ A+GYG + +G KYWL
Sbjct: 230 EAALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGQTSDGTKYWL 289

Query: 295 IKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           +KNSWG  WGE+GY R+++DI   +G CG+AM  S+P 
Sbjct: 290 LKNSWGTTWGENGYLRMEKDISDKRGMCGLAMEPSYPT 327


>gi|1345573|emb|CAA40073.1| endopeptidase (EP-C1) [Phaseolus vulgaris]
          Length = 361

 Score =  304 bits (779), Expect = 4e-80,   Method: Compositional matrix adjust.
 Identities = 169/361 (46%), Positives = 221/361 (61%), Gaps = 29/361 (8%)

Query: 2   AKYFLIVVLIISGSCASQATYRTFD-----EGSIAEKFEQWKAQYGRTYKESAENSKRFE 56
            K  L VVL  S       ++   D     E S+ + +E+W++ +    +   E  KRF 
Sbjct: 2   TKKLLWVVLSFSLVLGVANSFDFHDKDLASEESLWDLYERWRSHH-TVSRSLGEKHKRFN 60

Query: 57  IFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTP- 115
           +FK NL+ V   N     ++ Y L+LNKFAD+T  EF ++  G K++ H       GTP 
Sbjct: 61  VFKANLMHVHNTNKM---DKPYKLKLNKFADMTNHEFRSTYAGSKVNHHR---MFRGTPH 114

Query: 116 ----FLY-KSSQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSL 163
               F+Y K   VPPSV+W +KGAVT VK QGQC        V AVEGIN IK N+LV+L
Sbjct: 115 ENGAFMYEKVVSVPPSVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQIKTNKLVAL 174

Query: 164 SEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHA 223
           SEQ+LVDC   + N GC GG M+ AF++I Q  GIT ++ Y Y+    G CD+ K  D A
Sbjct: 175 SEQELVDC-DKEENQGCNGGLMESAFEFIKQKGGITTESNYPYKAQE-GTCDASKVNDLA 232

Query: 224 AQITNYEDVPPNDEESLLKAVANQPVSVAIDA--SALQFYSGGVFNGYCETFLNHGVTAV 281
             I  +E+VP NDE++LLKAVANQPVSVAIDA  S  QFYS GVF G C T LNHGV  V
Sbjct: 233 VSIDGHENVPANDEDALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDCSTDLNHGVAIV 292

Query: 282 GYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSKESAQPSS 341
           GYGT+ +G  YW+++NSWG +WGE GY R+QR+I + +G CGIAM  S+P+   S  P+ 
Sbjct: 293 GYGTTVDGTNYWIVRNSWGPEWGEHGYIRMQRNISKKEGLCGIAMLPSYPIKNSSDNPTG 352

Query: 342 A 342
           +
Sbjct: 353 S 353


>gi|414587996|tpg|DAA38567.1| TPA: hypothetical protein ZEAMMB73_390779 [Zea mays]
          Length = 343

 Score =  304 bits (779), Expect = 4e-80,   Method: Compositional matrix adjust.
 Identities = 161/348 (46%), Positives = 226/348 (64%), Gaps = 26/348 (7%)

Query: 2   AKYFLIVVLIISGSCAS-----QATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFE 56
           ++ FL+++ I++G   S      A     D+ ++AE+ E+W A YGR YK++AE ++RFE
Sbjct: 4   SRAFLLLLAILTGCACSFPSPVLAARELSDDAAMAERHERWMAVYGRVYKDAAEKARRFE 63

Query: 57  IFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPF 116
           +FKDNL  VE FN  A     + L +N+FADLT +EF A++ GFK     S+ +   T F
Sbjct: 64  VFKDNLAFVESFN--ADKKNKFWLGVNQFADLTTEEFKANK-GFK---PISAEEVPTTGF 117

Query: 117 LYKS---SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQ 166
            Y++   S +P +V+W  KGAVTP+K QGQC       AVAA+EGI  +  + LVSLSEQ
Sbjct: 118 KYENLSVSALPTAVDWRTKGAVTPIKNQGQCGCCWAFSAVAAMEGIVKLSTDNLVSLSEQ 177

Query: 167 QLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQI 226
           +LVDC T+  + GC GG+MD AF+++I+N G+  ++ Y Y+ +  G C        AA I
Sbjct: 178 ELVDCDTHSMDEGCEGGWMDSAFEFVIKNGGLATESSYPYKAVD-GKCKG--GSKSAATI 234

Query: 227 TNYEDVPPNDEESLLKAVANQPVSVAIDASALQF--YSGGVFNGYCETFLNHGVTAVGYG 284
             +EDVPPN+E +L+KAVA+QPVSVA+DAS   F  YSGGV  G C T L+HG+ A+GYG
Sbjct: 235 KGHEDVPPNNEAALMKAVASQPVSVAVDASDRTFMLYSGGVMTGSCGTQLDHGIAAIGYG 294

Query: 285 TSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
              +G KYW++KNSWG  WGE  + R+++DI   QG CG+AM  S+P 
Sbjct: 295 VESDGTKYWILKNSWGTTWGEKRFLRMEKDISDKQGMCGLAMKPSYPT 342


>gi|351726339|ref|NP_001237379.1| cysteine proteinase precursor [Glycine max]
 gi|31559526|dbj|BAC77521.1| cysteine proteinase [Glycine max]
 gi|31559528|dbj|BAC77522.1| cysteine proteinase [Glycine max]
          Length = 362

 Score =  304 bits (779), Expect = 4e-80,   Method: Compositional matrix adjust.
 Identities = 162/331 (48%), Positives = 207/331 (62%), Gaps = 18/331 (5%)

Query: 27  EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFA 86
           E S  + +E+W++ +    +   +  KRF +FK N++ V   N     ++ Y L+LNKFA
Sbjct: 33  EESFWDLYERWRSHH-TVSRSLGDKHKRFNVFKANVMHVHNTNKM---DKPYKLKLNKFA 88

Query: 87  DLTPQEFIASQTGFKMSDHS---SSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQ 143
           D+T  EF ++  G K++ H     + + NGT    K   VPPSV+W + GAVT VK QGQ
Sbjct: 89  DMTNHEFRSTYAGSKVNHHRMFQGTPRGNGTFMYEKVGSVPPSVDWRKNGAVTGVKDQGQ 148

Query: 144 CA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNK 196
           C        V AVEGIN IK N+LVSLSEQ+LVDC T   N GC GG M+ AF++I Q  
Sbjct: 149 CGSCWAFSTVVAVEGINQIKTNKLVSLSEQELVDCDTK-KNAGCNGGLMESAFEFIKQKG 207

Query: 197 GITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDA- 255
           GIT ++ Y Y     G CD+ KA D A  I  +E+VP NDE +LLKAVANQPVSVAIDA 
Sbjct: 208 GITTESNYPYTAQD-GTCDASKANDLAVSIDGHENVPANDENALLKAVANQPVSVAIDAG 266

Query: 256 -SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRD 314
            S  QFYS GVF G C T LNHGV  VGYGT+ +G  YW ++NSWG +WGE GY R+QR 
Sbjct: 267 GSDFQFYSEGVFTGDCSTELNHGVAIVGYGTTVDGTNYWTVRNSWGPEWGEQGYIRMQRS 326

Query: 315 IDQPQGQCGIAMFASFPVSKESAQPSSADKS 345
           I + +G CGIAM AS+P+   S  P+    S
Sbjct: 327 ISKKEGLCGIAMMASYPIKNSSNNPTGPSSS 357


>gi|1046373|gb|AAC49135.1| SAG12 protein [Arabidopsis thaliana]
          Length = 346

 Score =  304 bits (778), Expect = 4e-80,   Method: Compositional matrix adjust.
 Identities = 163/342 (47%), Positives = 213/342 (62%), Gaps = 18/342 (5%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGSIAEKFE-QWKAQYGRTYKESAENSKRFEIFKDNLV 63
             + V I S  C S    R  D   I +K   +W  ++GR Y +  E + R+ +FK+N+ 
Sbjct: 8   IFLFVAIFSSFCFSITLSRPLDNELIMQKRHIEWMTKHGRVYADVKEENNRYVVFKNNVE 67

Query: 64  AVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFK-MSDHSSSLKANGTPFLYK--- 119
            +E  N+   G R++ L +N+FADLT  EF +  TGFK +S  SS  +   +PF Y+   
Sbjct: 68  RIEHLNSIPAG-RTFKLAVNQFADLTNDEFCSMYTGFKGVSALSSQSQTKMSPFRYQNVS 126

Query: 120 SSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCA 172
           S  +P SV+W +KGAVTP+K QG C       AVAA+EG   IK  +L+SLSEQQLVDC 
Sbjct: 127 SGALPVSVDWRKKGAVTPIKNQGSCGCCWAFSAVAAIEGATQIKKGKLISLSEQQLVDCD 186

Query: 173 TNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDV 232
           TND   GC GG MD AF++I    G+T ++ Y Y+G     C+S K    A  IT YEDV
Sbjct: 187 TNDF--GCEGGLMDTAFEHIKATGGLTTESDYPYKG-EDATCNSKKTNPKATSITGYEDV 243

Query: 233 PPNDEESLLKAVANQPVSVAIDASAL--QFYSGGVFNGYCETFLNHGVTAVGYGTSEEGI 290
           P NDE++L+KAVA+QPVSV I+      QFYS GVF G C T+L+H VTA+GYG S  G 
Sbjct: 244 PVNDEQALMKAVAHQPVSVGIEGGGFDFQFYSSGVFTGECTTYLDHAVTAIGYGESTNGS 303

Query: 291 KYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           KYW+IKNSWG  WGE GY R+Q+D+   QG CG+AM AS+P 
Sbjct: 304 KYWIIKNSWGTKWGESGYMRIQKDVKDKQGLCGLAMKASYPT 345


>gi|18422605|ref|NP_568651.1| senescence-associated protein 12 [Arabidopsis thaliana]
 gi|13877737|gb|AAK43946.1|AF370131_1 putative senescence-specific cysteine protease SAG12 [Arabidopsis
           thaliana]
 gi|9758936|dbj|BAB09317.1| senescence-specific cysteine protease [Arabidopsis thaliana]
 gi|14532898|gb|AAK64131.1| putative senescence-specific cysteine protease SAG12 [Arabidopsis
           thaliana]
 gi|332007929|gb|AED95312.1| senescence-associated protein 12 [Arabidopsis thaliana]
          Length = 346

 Score =  303 bits (777), Expect = 6e-80,   Method: Compositional matrix adjust.
 Identities = 163/342 (47%), Positives = 213/342 (62%), Gaps = 18/342 (5%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGSIAEKFE-QWKAQYGRTYKESAENSKRFEIFKDNLV 63
             + V I S  C S    R  D   I +K   +W  ++GR Y +  E + R+ +FK+N+ 
Sbjct: 8   IFLFVAIFSSFCFSITLSRPLDNELIMQKRHIEWMTKHGRVYADVKEENNRYVVFKNNVE 67

Query: 64  AVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFK-MSDHSSSLKANGTPFLYK--- 119
            +E  N+   G R++ L +N+FADLT  EF +  TGFK +S  SS  +   +PF Y+   
Sbjct: 68  RIEHLNSIPAG-RTFKLAVNQFADLTNDEFRSMYTGFKGVSALSSQSQTKMSPFRYQNVS 126

Query: 120 SSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCA 172
           S  +P SV+W +KGAVTP+K QG C       AVAA+EG   IK  +L+SLSEQQLVDC 
Sbjct: 127 SGALPVSVDWRKKGAVTPIKNQGSCGCCWAFSAVAAIEGATQIKKGKLISLSEQQLVDCD 186

Query: 173 TNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDV 232
           TND   GC GG MD AF++I    G+T ++ Y Y+G     C+S K    A  IT YEDV
Sbjct: 187 TNDF--GCEGGLMDTAFEHIKATGGLTTESNYPYKG-EDATCNSKKTNPKATSITGYEDV 243

Query: 233 PPNDEESLLKAVANQPVSVAIDASAL--QFYSGGVFNGYCETFLNHGVTAVGYGTSEEGI 290
           P NDE++L+KAVA+QPVSV I+      QFYS GVF G C T+L+H VTA+GYG S  G 
Sbjct: 244 PVNDEQALMKAVAHQPVSVGIEGGGFDFQFYSSGVFTGECTTYLDHAVTAIGYGESTNGS 303

Query: 291 KYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           KYW+IKNSWG  WGE GY R+Q+D+   QG CG+AM AS+P 
Sbjct: 304 KYWIIKNSWGTKWGESGYMRIQKDVKDKQGLCGLAMKASYPT 345


>gi|242072390|ref|XP_002446131.1| hypothetical protein SORBIDRAFT_06g002140 [Sorghum bicolor]
 gi|241937314|gb|EES10459.1| hypothetical protein SORBIDRAFT_06g002140 [Sorghum bicolor]
          Length = 328

 Score =  303 bits (777), Expect = 7e-80,   Method: Compositional matrix adjust.
 Identities = 154/336 (45%), Positives = 219/336 (65%), Gaps = 17/336 (5%)

Query: 2   AKYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDN 61
           +K FL+ +L  +  C+S    R   + ++ E+ E W  +YGR YK++AE ++RF++FKDN
Sbjct: 4   SKAFLLAILGCASLCSSVLAARELSDAAMVERHENWMVEYGRVYKDAAEKARRFQVFKDN 63

Query: 62  LVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS- 120
           +  VE FN     N  + L +N+FADLT +EF A++ GFK     ++ K   T F Y++ 
Sbjct: 64  VAFVESFNTNK--NNKFWLGVNQFADLTTEEFKANK-GFK----PTAEKVPTTGFKYENL 116

Query: 121 --SQVPPSVNWIEKGAVTPVKYQGQCAVAAVEGINAIKINRLVSLSEQQLVDCATNDNNN 178
             S +P +V+W  KGAVTP+K QGQCA  A+EGI  +    L+SLSEQ+LVDC T+  + 
Sbjct: 117 SVSALPTAVDWRTKGAVTPIKNQGQCA--AMEGIVKLSTGNLISLSEQELVDCDTHSMDE 174

Query: 179 GCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEE 238
           GC GG+MD AF+++I+N G+  ++ Y Y+ +  G C        AA I  +EDVP N+E 
Sbjct: 175 GCEGGWMDSAFEFVIKNGGLATESNYPYKAVD-GKCKG--GSKSAATIKGHEDVPVNNEA 231

Query: 239 SLLKAVANQPVSVAIDASALQF--YSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIK 296
           +L+KAVANQPVSVA+DAS   F  YSGGV  G C T L+HG+ A+GYG   +G KYW++K
Sbjct: 232 ALMKAVANQPVSVAVDASDRTFMLYSGGVMTGSCGTELDHGIAAIGYGMESDGTKYWILK 291

Query: 297 NSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           NSWG  WGE G+ R+++DI   +G CG+AM  S+P 
Sbjct: 292 NSWGTTWGEKGFLRMEKDITDKRGMCGLAMKPSYPT 327


>gi|302143416|emb|CBI21977.3| unnamed protein product [Vitis vinifera]
          Length = 297

 Score =  303 bits (777), Expect = 7e-80,   Method: Compositional matrix adjust.
 Identities = 155/305 (50%), Positives = 208/305 (68%), Gaps = 19/305 (6%)

Query: 39  AQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQT 98
           A+YGR YK++ E  KRF+IFKDN+  +E FN A   +++Y L +N+FADLT +EF + + 
Sbjct: 2   ARYGRMYKDANEKEKRFKIFKDNVARIESFNKAM--DKTYKLSINEFADLTNEEFRSLRN 59

Query: 99  GFKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVE 150
            FK     + + +  T F Y++ + VP +++W +KGAVTP+K Q QC       AVAA E
Sbjct: 60  RFK-----AHICSEATTFKYENVTAVPSTIDWRKKGAVTPIKDQQQCGCCWAFSAVAATE 114

Query: 151 GINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMS 210
           GI  I   +L+SLSEQ+LVDC T   N GC GG MDDAF++I +  G+ ++A Y YEG  
Sbjct: 115 GITQITTGKLISLSEQELVDCDTGGENQGCSGGLMDDAFRFI-KIHGLASEATYPYEG-D 172

Query: 211 TGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASAL--QFYSGGVFNG 268
            G C+S K    AA+I  YEDVP N+E++L KAVA+QPV+VAIDA     QFY+ GVF G
Sbjct: 173 DGTCNSKKEAHPAAKIKGYEDVPANNEKALQKAVAHQPVAVAIDAGGFEFQFYTSGVFTG 232

Query: 269 YCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFA 328
            C T L+HGV AVGYG  ++G+ YWL+KNSWG  WGE+GY R+QRD+   +G CGIAM A
Sbjct: 233 QCGTELDHGVAAVGYGIGDDGMMYWLVKNSWGTGWGEEGYIRMQRDVTAKEGLCGIAMQA 292

Query: 329 SFPVS 333
           S+P +
Sbjct: 293 SYPTA 297


>gi|388497270|gb|AFK36701.1| unknown [Lotus japonicus]
          Length = 343

 Score =  303 bits (776), Expect = 8e-80,   Method: Compositional matrix adjust.
 Identities = 175/340 (51%), Positives = 220/340 (64%), Gaps = 17/340 (5%)

Query: 6   LIVVLIISGSCASQATYRT-FDEGS--IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
           +I +  +  +CA  A  RT +DE S  +A+  +QW  QYGR+Y   AE  KRF+IF +NL
Sbjct: 7   IIALCTMLWACAYTAMSRTLYDETSSVVAKTHQQWMLQYGRSYTNDAEMEKRFKIFMENL 66

Query: 63  VAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKM-SDHSSSLKANGTPFLYKSS 121
             +E+FNNA  GN+SY L LN+F+DLT +EFIAS TG  +     SS     +P     S
Sbjct: 67  EYIEKFNNAP-GNKSYKLDLNQFSDLTNEEFIASHTGLMIDPSKPSSSSKRASPASLDLS 125

Query: 122 QVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATN 174
             P S++W E+GAVT VK QG C       AVAAVEGI  IK   L+SLSEQQLVDCA+N
Sbjct: 126 DTPTSLDWREQGAVTDVKNQGNCGSCWAFSAVAAVEGIVKIKNGNLISLSEQQLVDCASN 185

Query: 175 DNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPP 234
           + N GC GGFMD+AF YI +N GI ++  Y Y G   G C + +    AA+I+ YEDVP 
Sbjct: 186 EQNQGCGGGFMDNAFSYITEN-GIASENDYQYRG-GAGTCQNNEMITPAARISGYEDVPA 243

Query: 235 NDEESLLKAVANQPVSVAID-ASALQFYSGGVFNGYCETFLNHGVTAVGYGTSEE-GIKY 292
             E+ LL AV+ QPVSVAI    +   Y  G+++G C + LNHGVT VGYGTSEE G KY
Sbjct: 244 G-EDQLLLAVSQQPVSVAIAVGQSFHLYKEGIYSGPCGSSLNHGVTLVGYGTSEEDGTKY 302

Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           WLIKNSWG+ WGE+GY RL R+  Q +G CGIA+ AS P 
Sbjct: 303 WLIKNSWGESWGENGYMRLLRESGQSEGHCGIAVKASHPT 342


>gi|297794671|ref|XP_002865220.1| senescence-associated gene 12 [Arabidopsis lyrata subsp. lyrata]
 gi|297311055|gb|EFH41479.1| senescence-associated gene 12 [Arabidopsis lyrata subsp. lyrata]
          Length = 346

 Score =  303 bits (775), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 165/341 (48%), Positives = 211/341 (61%), Gaps = 18/341 (5%)

Query: 6   LIVVLIISGSCASQATYRTFDEGSIAEKFE-QWKAQYGRTYKESAENSKRFEIFKDNLVA 64
            + V I S    S +  R  D   I +K   +W  ++GR Y +  E S R+ +FK N+  
Sbjct: 9   FLFVAIFSSFYFSISLSRPLDNELIMQKRHIEWMTKHGRVYADVKEKSNRYVVFKSNVER 68

Query: 65  VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFK-MSDHSSSLKANGTPFLYK---S 120
           +E  NN   G R++ L +N+FADLT  EF +  TGFK +S  SS  +   T F Y+   S
Sbjct: 69  IEHLNNIPAG-RTFKLAVNQFADLTNDEFRSMYTGFKGVSSLSSQSQTKTTSFRYQNVSS 127

Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
             +P SV+W  KGAVTP+K QG C       AVAA+EG   IK  +L+SLSEQQLVDC T
Sbjct: 128 GALPISVDWRTKGAVTPIKNQGSCGCCWAFSAVAAIEGATQIKKGKLISLSEQQLVDCDT 187

Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
           ND   GC GG MD AF++I+   G+T ++ Y Y+G     C+S K    A  IT YEDVP
Sbjct: 188 NDF--GCEGGLMDTAFEHIMATGGLTTESNYPYKG-EDATCNSKKTNPKATSITGYEDVP 244

Query: 234 PNDEESLLKAVANQPVSVAIDASAL--QFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIK 291
            NDE++L+KAVA+QPVSV I+      QFYS GVF G C T+L+H VTA+GYG S  G K
Sbjct: 245 VNDEQALMKAVAHQPVSVGIEGGGFDFQFYSSGVFTGECTTYLDHAVTAIGYGQSTNGSK 304

Query: 292 YWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           YW+IKNSWG  WGE GY R+Q+DI   QG CG+AM AS+P 
Sbjct: 305 YWIIKNSWGTKWGESGYMRIQKDIKDKQGLCGLAMKASYPT 345


>gi|125547236|gb|EAY93058.1| hypothetical protein OsI_14861 [Oryza sativa Indica Group]
          Length = 339

 Score =  303 bits (775), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 155/346 (44%), Positives = 220/346 (63%), Gaps = 22/346 (6%)

Query: 1   MAKYFLIVVLIISGSCASQATYRTF-DEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFK 59
           MAK  L  +L     C++    R   D+ ++A + E+W AQYGR Y++ AE ++RFE+FK
Sbjct: 3   MAKALLFAILGCLCLCSAVLAARELSDDAAMAARHERWMAQYGRVYRDDAEKARRFEVFK 62

Query: 60  DNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK 119
            N+  +E FN    GN ++ L +N+FADLT  EF  ++T       ++ +    T F Y+
Sbjct: 63  ANVAFIESFN---AGNHNFWLGVNQFADLTNDEFRWTKTNKGFIPSTTRVP---TGFRYE 116

Query: 120 SSQV---PPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLV 169
           +  +   P +V+W  KGAVTP+K QGQC       AVAA+EGI  +   +L+SLSEQ+LV
Sbjct: 117 NVNIDALPATVDWRTKGAVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQELV 176

Query: 170 DCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNY 229
           DC  +  + GC GG MDDAFK+II+N G+T ++ Y Y   +   C S+   +  A I  Y
Sbjct: 177 DCDVHGEDQGCEGGLMDDAFKFIIKNGGLTTESNYPYAA-ADDKCKSV--SNSVASIKGY 233

Query: 230 EDVPPNDEESLLKAVANQPVSVAIDASAL--QFYSGGVFNGYCETFLNHGVTAVGYGTSE 287
           EDVP N+E +L+KAVANQPVSVA+D   +  QFY GGV  G C T L+HG+ A+GYG + 
Sbjct: 234 EDVPANNEAALMKAVANQPVSVAVDGGDMTFQFYKGGVMTGSCGTDLDHGIVAIGYGKAS 293

Query: 288 EGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
           +G KYWL+KNSWG  WGE+G+ R+++DI   +G CG+AM  S+P +
Sbjct: 294 DGTKYWLLKNSWGTTWGENGFLRMEKDISDKRGMCGLAMEPSYPTA 339


>gi|544129|sp|P25803.2|CYSEP_PHAVU RecName: Full=Vignain; AltName: Full=Bean endopeptidase; AltName:
           Full=Cysteine proteinase EP-C1; Flags: Precursor
 gi|20994|emb|CAA44816.1| endopeptidase [Phaseolus vulgaris]
          Length = 362

 Score =  302 bits (774), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 169/361 (46%), Positives = 222/361 (61%), Gaps = 29/361 (8%)

Query: 2   AKYFLIVVLIISGSCASQATYRTFD-----EGSIAEKFEQWKAQYGRTYKESAENSKRFE 56
            K  L VVL  S       ++   D     E S+ + +E+W++ +    +   E  KRF 
Sbjct: 3   TKKLLWVVLSFSLVLGVANSFDFHDKDLASEESLWDLYERWRSHH-TVSRSLGEKHKRFN 61

Query: 57  IFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTP- 115
           +FK NL+ V   N     ++ Y L+LNKFAD+T  EF ++  G K+ +H    +  GTP 
Sbjct: 62  VFKANLMHVHNTNKM---DKPYKLKLNKFADMTNHEFRSTYAGSKV-NHPRMFR--GTPH 115

Query: 116 ----FLY-KSSQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSL 163
               F+Y K   VPPSV+W +KGAVT VK QGQC        V AVEGIN IK N+LV+L
Sbjct: 116 ENGAFMYEKVVSVPPSVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQIKTNKLVAL 175

Query: 164 SEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHA 223
           SEQ+LVDC   + N GC GG M+ AF++I Q  GIT ++ Y Y+    G CD+ K  D A
Sbjct: 176 SEQELVDC-DKEENQGCNGGLMESAFEFIKQKGGITTESNYPYKAQE-GTCDASKVNDLA 233

Query: 224 AQITNYEDVPPNDEESLLKAVANQPVSVAIDA--SALQFYSGGVFNGYCETFLNHGVTAV 281
             I  +E+VP NDE++LLKAVANQPVSVAIDA  S  QFYS GVF G C T LNHGV  V
Sbjct: 234 VSIDGHENVPANDEDALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDCSTDLNHGVAIV 293

Query: 282 GYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSKESAQPSS 341
           GYGT+ +G  YW+++NSWG +WGE GY R+QR+I + +G CGIAM  S+P+   S  P+ 
Sbjct: 294 GYGTTVDGTNYWIVRNSWGPEWGEHGYIRMQRNISKKEGLCGIAMLPSYPIKNSSDNPTG 353

Query: 342 A 342
           +
Sbjct: 354 S 354


>gi|242092700|ref|XP_002436840.1| hypothetical protein SORBIDRAFT_10g009830 [Sorghum bicolor]
 gi|241915063|gb|EER88207.1| hypothetical protein SORBIDRAFT_10g009830 [Sorghum bicolor]
          Length = 328

 Score =  302 bits (774), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 157/338 (46%), Positives = 219/338 (64%), Gaps = 23/338 (6%)

Query: 3   KYFLIVVLIISGSCASQATYRTF-DEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDN 61
           K  ++ +L ++  C +    R   D+ ++  + EQW  QY R YK++ E ++RFE+FK N
Sbjct: 5   KASILAILGLAFFCGAALAARDLNDDSAMVARHEQWMVQYSRVYKDTTEKARRFEVFKAN 64

Query: 62  LVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQT--GFKMSDHSSSLKANGTPFLYK 119
           +  +E FN  A GNR + L +N+FADLT  EF A++T  GFK     S +K   T F Y+
Sbjct: 65  VKFIESFN--AGGNRKFWLGVNQFADLTNDEFRATKTNKGFK----PSPVKVP-TGFRYE 117

Query: 120 SSQV---PPSVNWIEKGAVTPVKYQGQCAVAAVEGINAIKINRLVSLSEQQLVDCATNDN 176
           +  V   P +++W  KGAVTP+K QGQC     EGI  I   +L+SLSEQ+LVDC  +  
Sbjct: 118 NVSVDALPATIDWRTKGAVTPIKDQGQC-----EGIVKISTGKLISLSEQELVDCDVHGE 172

Query: 177 NNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPND 236
           + GC GG MDDAF++II+N G+T ++ Y Y   + G C S    + AA +  +EDVP ND
Sbjct: 173 DQGCEGGLMDDAFQFIIKNGGLTTESSYPYTA-ADGKCKS--GSNSAATVKGFEDVPAND 229

Query: 237 EESLLKAVANQPVSVAIDASAL--QFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWL 294
           E +L+KAVANQPVSVA+D   +  QFYSGGV  G C T L+HG+ A+GYG + +G KYWL
Sbjct: 230 EAALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGQTSDGTKYWL 289

Query: 295 IKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           +KNSWG  WGE+GY R+++DI   +G CG+AM  S+P+
Sbjct: 290 LKNSWGTTWGENGYLRMEKDISDKRGMCGLAMEPSYPI 327


>gi|157093728|gb|ABV22590.1| KDEL-tailed cysteine endopeptidase [Solanum lycopersicum]
          Length = 360

 Score =  302 bits (774), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 167/358 (46%), Positives = 215/358 (60%), Gaps = 23/358 (6%)

Query: 1   MAKYFLIV----VLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFE 56
           M K FL++    +++  G            E    E +E+W++ +  + +   E  KRF 
Sbjct: 1   MKKLFLVLFTLALVLRLGESFDFHEKELETEEKFWELYERWRSHHTVS-RSLDEKHKRFN 59

Query: 57  IFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSL---KANG 113
           +FK N+  V  FN     ++ Y L+LNKFAD+T  EF     G K+  H + L   +ANG
Sbjct: 60  VFKANVHYVHNFNKK---DKPYKLKLNKFADMTNHEFRQHYAGSKIKHHRTLLGASRANG 116

Query: 114 TPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQ 166
           T        VPPS++W +KGAVTPVK QGQC        V AVEGIN IK  +LVSLSEQ
Sbjct: 117 TFMYANEDNVPPSIDWRKKGAVTPVKDQGQCGSCWAFSTVVAVEGINQIKTKKLVSLSEQ 176

Query: 167 QLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQI 226
           +LVDC T +N  GC GG MD AF +I +  GIT +  Y Y+      CD  K       I
Sbjct: 177 ELVDCDTTENQ-GCNGGLMDPAFDFIKKRGGITTEERYPYKAEDDK-CDIQKRNTPVVSI 234

Query: 227 TNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYG 284
             +EDVPPNDE++LLKAVANQP+SVAIDAS    QFYS GVF G C T L+HGV  VGYG
Sbjct: 235 DGHEDVPPNDEDALLKAVANQPISVAIDASGSQFQFYSEGVFTGECGTELDHGVAIVGYG 294

Query: 285 TSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSKESAQPSSA 342
           T+ +G KYW++KNSWG  WGE GY R+QR +D  +G CGIAM  S+P+ K S+ P+ +
Sbjct: 295 TTVDGTKYWIVKNSWGAGWGEKGYIRMQRKVDAEEGLCGIAMQPSYPI-KTSSNPTGS 351


>gi|356543118|ref|XP_003540010.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
          Length = 339

 Score =  302 bits (774), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 164/343 (47%), Positives = 219/343 (63%), Gaps = 23/343 (6%)

Query: 3   KYFLIVVLIISGSCASQATYRTFDEGS--IAEKFEQWKAQYGRTYKESAENSKRFEIFKD 60
           ++ L +VL++   C SQ   R   E S  ++E+ EQW  +YG+ YK++AE  KR  IFKD
Sbjct: 8   QHILALVLLLP-ICISQVMSRNLHEASXCMSERHEQWTKKYGKVYKDAAEKQKRLLIFKD 66

Query: 61  NLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS 120
           N+  +E FN  A GN+ Y L +N   D T +EF+AS  G+K     S      TPF Y++
Sbjct: 67  NVEFIESFN--AAGNKPYKLSINHLTDQTNEEFVASHNGYKHKGSHSQ-----TPFKYEN 119

Query: 121 -SQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCA 172
            + VP +V+W E GAV  +K QGQC        VA  EGI  I  + L+SLSEQ+LVDC 
Sbjct: 120 ITGVPNAVDWRENGAVXAMKDQGQCGNCWAFSTVATTEGIYQITTSMLMSLSEQELVDCD 179

Query: 173 TNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDV 232
           + D+  GC GG+M+  F++I +N GI+++A Y Y  +  G  D+ K    AAQI  YE V
Sbjct: 180 SVDH--GCDGGYMEGGFEFIXKNGGISSEANYPYTAVD-GTYDANKEASPAAQIKGYETV 236

Query: 233 PPNDEESLLKAVANQPVSVAID--ASALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGI 290
           P N E++L KAVANQPVSV ID   SA QF S GVF G C T L+HGVTAVGYG++++G 
Sbjct: 237 PANSEDALQKAVANQPVSVTIDVGGSAFQFNSSGVFTGQCGTQLDHGVTAVGYGSTDDGT 296

Query: 291 KYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
           +YW++KNSWG  WGE+GY R+QR  D  +G CGIAM AS+P +
Sbjct: 297 QYWIVKNSWGTQWGEEGYIRMQRGTDAQEGLCGIAMDASYPTA 339


>gi|445927|prf||1910332A Cys endopeptidase
          Length = 362

 Score =  302 bits (773), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 159/328 (48%), Positives = 208/328 (63%), Gaps = 18/328 (5%)

Query: 27  EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFA 86
           E S+ + +E+W++ +    +   E  KRF +FK N++ V   N     ++ Y L+LNKFA
Sbjct: 33  EESLWDLYERWRSHH-TVSRSLGEKHKRFNVFKANVMHVHNTNKM---DKPYKLKLNKFA 88

Query: 87  DLTPQEFIASQTGFKMSDHS---SSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQ 143
           D+T  EF ++  G K++ H     S   +GT    K   VP SV+W +KGAVT VK QGQ
Sbjct: 89  DMTNHEFRSTYAGSKVNHHKMFRGSQHGSGTFMYEKVGSVPASVDWRKKGAVTDVKDQGQ 148

Query: 144 CA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNK 196
           C        + AVEGIN IK N+LVSLSEQ+LVDC   + N GC GG M+ AF++I Q  
Sbjct: 149 CGSCWAFSTIVAVEGINQIKTNKLVSLSEQELVDC-DKEENQGCNGGLMESAFEFIKQKG 207

Query: 197 GITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDA- 255
           GIT ++ Y Y+    G CD  K  D A  I  +E+VP NDE +LLKAVANQPVSVAIDA 
Sbjct: 208 GITTESNYPYKAQE-GTCDESKVNDLAVSIDGHENVPVNDENALLKAVANQPVSVAIDAG 266

Query: 256 -SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRD 314
            S  QFYS GVF G C T LNHGV  VGYGT+ +G  YW+++NSWG +WGE GY R+QR+
Sbjct: 267 GSDFQFYSEGVFTGDCNTDLNHGVAIVGYGTTVDGTNYWIVRNSWGPEWGEQGYIRMQRN 326

Query: 315 IDQPQGQCGIAMFASFPVSKESAQPSSA 342
           I + +G CGIAM AS+P+   S  P+ +
Sbjct: 327 ISKKEGLCGIAMMASYPIKNSSDNPTGS 354


>gi|116309178|emb|CAH66275.1| OSIGBa0147O06.5 [Oryza sativa Indica Group]
          Length = 339

 Score =  302 bits (773), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 154/346 (44%), Positives = 219/346 (63%), Gaps = 22/346 (6%)

Query: 1   MAKYFLIVVLIISGSCASQATYRTF-DEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFK 59
           MAK  L  +L     C++    R   D+ ++A + E+W AQYGR YK+ AE ++RFE+FK
Sbjct: 3   MAKALLFAILGCLCLCSAVLAARELSDDAAMAARHERWMAQYGRMYKDDAEKARRFEVFK 62

Query: 60  DNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK 119
            N+  +E FN    GN  + L +N+FADLT  EF +++T       ++ +    T F Y+
Sbjct: 63  ANVAFIESFN---AGNHKFWLGVNQFADLTNDEFRSTKTNKGFIPSTTRVP---TGFRYE 116

Query: 120 SSQV---PPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLV 169
           +  +   P +++W  KG VTP+K QGQC       AVAA+EGI  +   +L+SLSEQ+LV
Sbjct: 117 NVNIDALPATMDWRTKGVVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQELV 176

Query: 170 DCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNY 229
           DC  +  + GC GG MDDAFK+II+N G+T ++ Y Y   +   C S+   +  A I  Y
Sbjct: 177 DCDVHGEDQGCEGGLMDDAFKFIIKNGGLTTESNYPYAA-ADDKCKSV--SNSVASIKGY 233

Query: 230 EDVPPNDEESLLKAVANQPVSVAIDASAL--QFYSGGVFNGYCETFLNHGVTAVGYGTSE 287
           EDVP N+E +L+KAVANQPVSVA+D   +  QFY GGV  G C T L+HG+ A+GYG + 
Sbjct: 234 EDVPANNEAALMKAVANQPVSVAVDGGDMTFQFYKGGVMTGSCGTDLDHGIVAIGYGKAS 293

Query: 288 EGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
           +G KYWL+KNSWG  WGE+G+ R+++DI   +G CG+AM  S+P +
Sbjct: 294 DGTKYWLLKNSWGTTWGENGFLRMEKDISDKRGMCGLAMEPSYPTA 339


>gi|297826061|ref|XP_002880913.1| hypothetical protein ARALYDRAFT_481640 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297326752|gb|EFH57172.1| hypothetical protein ARALYDRAFT_481640 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 347

 Score =  301 bits (772), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 168/350 (48%), Positives = 221/350 (63%), Gaps = 20/350 (5%)

Query: 1   MAKYFLIVVLIISGSCASQATYRT-FDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFK 59
           M+   + ++ I      S AT R    E S  EK EQW A++ R Y + +E   RF IFK
Sbjct: 1   MSSTIIFILTIFLSYRTSLATSRGGLFEASPIEKHEQWMARFNRVYSDESEKRNRFNIFK 60

Query: 60  DNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHS---SSLKANGT-P 115
            NL  V+ FN     N +Y L +N+F+DLT +EF A+ TG  + +     S+L ++ T P
Sbjct: 61  KNLEFVQSFNMNK--NITYKLDVNEFSDLTDEEFRATHTGLVVPEEITGISTLSSDKTVP 118

Query: 116 FLYKS-SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQ 167
           F Y + S    S++W ++GAVTPVKYQG+C       AVAAVEGI  I    LVSLSEQQ
Sbjct: 119 FRYGNVSDTGESMDWRQEGAVTPVKYQGRCGGCWAFSAVAAVEGITKITKGELVSLSEQQ 178

Query: 168 LVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAED--HAAQ 225
           L+DC T D N GC+GG M  AF+YII+N+GIT +  Y Y+        S        AA 
Sbjct: 179 LLDCDT-DYNQGCHGGIMSKAFEYIIKNQGITTEDNYPYQESQQTCSSSTTLSSSFRAAT 237

Query: 226 ITNYEDVPPNDEESLLKAVANQPVSVAIDASALQF--YSGGVFNGYCETFLNHGVTAVGY 283
           I+ YE VP N+EE+LL+AV+ QPVSV I+ +   F  YSGG+FNG C T L+H VT VGY
Sbjct: 238 ISGYETVPMNNEEALLQAVSQQPVSVGIEGTGAGFRHYSGGIFNGECGTDLHHAVTIVGY 297

Query: 284 GTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
           G SEEG KYW++KNSWG+ WGEDG+ R++RD+D PQG CG+AM A +P++
Sbjct: 298 GMSEEGTKYWVVKNSWGETWGEDGFMRIKRDVDAPQGMCGLAMLAFYPLA 347


>gi|242072394|ref|XP_002446133.1| hypothetical protein SORBIDRAFT_06g002160 [Sorghum bicolor]
 gi|241937316|gb|EES10461.1| hypothetical protein SORBIDRAFT_06g002160 [Sorghum bicolor]
          Length = 338

 Score =  301 bits (772), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 157/343 (45%), Positives = 217/343 (63%), Gaps = 21/343 (6%)

Query: 2   AKYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDN 61
           +K FL+ +L  +  C+S    R   + ++ E+ E W  +YGR YK++AE ++RFE FK N
Sbjct: 4   SKAFLLAILGCASLCSSVLAARELSDAAMVERHENWMVEYGRVYKDAAEKARRFEAFKHN 63

Query: 62  LVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS- 120
           +  VE FN        + L +N+FADLT +EF A++ GFK     S+     T F Y++ 
Sbjct: 64  VAFVESFNTNK--KNKFWLGVNQFADLTTEEFKANK-GFK---PISAEMVPTTGFKYENL 117

Query: 121 --SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDC 171
             S +P +V+W  KGAVTP+K QGQC       AVAA+EGI  +    L+SLSEQ+LVDC
Sbjct: 118 SVSALPTAVDWRTKGAVTPIKNQGQCGCCWAFSAVAAMEGIVKLSTGNLISLSEQELVDC 177

Query: 172 ATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYED 231
            T+  + GC GG+MD AF+++I+N G+  ++ Y Y+ +  G C        AA I  +ED
Sbjct: 178 DTHSMDEGCEGGWMDSAFEFVIKNGGLATESSYPYKAVD-GKCKG--GSKSAATIKGHED 234

Query: 232 VPPNDEESLLKAVANQPVSVAIDASALQF--YSGGVFNGYCETFLNHGVTAVGYGTSEEG 289
           VP NDE +L+KAVANQPVSVA+DAS   F  YSGGV  G C T L+HG+ A+GYG   +G
Sbjct: 235 VPVNDEAALMKAVANQPVSVAVDASDRTFMLYSGGVMTGSCGTELDHGIAAIGYGVESDG 294

Query: 290 IKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
            KYW++KNSWG  WGE G+ R+++DI   QG CG+AM  S+P 
Sbjct: 295 TKYWILKNSWGTTWGEKGFLRMEKDISDKQGMCGLAMKPSYPT 337


>gi|357167190|ref|XP_003581045.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like
           [Brachypodium distachyon]
          Length = 415

 Score =  301 bits (772), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 154/341 (45%), Positives = 213/341 (62%), Gaps = 19/341 (5%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
           FLI +L  + + ++ A     D+ S+  + EQW A+YGR Y + AE ++R E+FK N+  
Sbjct: 82  FLIAILACTCAVSALAARDLTDDLSMVARHEQWMAKYGRVYNDVAEKAQRLEVFKANVAF 141

Query: 65  VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS---S 121
           +E  N    GN  ++L  N+FAD+T  EF A+ TG+K    +   K   T F Y +    
Sbjct: 142 IELVN---AGNDKFSLEANQFADMTVDEFRAAHTGYKPVPAN---KGRTTQFKYANVSLD 195

Query: 122 QVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATN 174
            +P S++W  KGAVTP+K QGQC        VA+VEGI  +   +L+SLSEQ+LVDC  +
Sbjct: 196 ALPASMDWRAKGAVTPIKDQGQCGCCWAFSTVASVEGIVKLSTGKLISLSEQELVDCDVD 255

Query: 175 DNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPP 234
             + GC GG MD+AF++II N G+T +  Y Y G     C+S K  +  A I  YEDVP 
Sbjct: 256 GMDQGCEGGLMDNAFEFIIDNGGLTTEGNYPYTGTDDS-CNSNKESNDVASIKGYEDVPS 314

Query: 235 NDEESLLKAVANQPVSVAIDA--SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
           NDE SLLKAVA QPVS+A+D   +  +FY GGV +G C T L+HG+ AVGYG + +G K+
Sbjct: 315 NDETSLLKAVAAQPVSIAVDGGDNLFRFYKGGVLSGACGTELDHGIAAVGYGITSDGTKF 374

Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
           WL+KNSWG  WGE G+ R++RDI   +G CG+AM  S+P +
Sbjct: 375 WLMKNSWGTSWGEKGFIRMERDIADEEGLCGLAMQPSYPTA 415


>gi|356543114|ref|XP_003540008.1| PREDICTED: LOW QUALITY PROTEIN: KDEL-tailed cysteine endopeptidase
           CEP1-like [Glycine max]
          Length = 343

 Score =  301 bits (772), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 163/329 (49%), Positives = 214/329 (65%), Gaps = 16/329 (4%)

Query: 16  CASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGN 75
           C SQ   R   + S+ E+ EQW  +YG+ YK+SAE  KRF IF++N+  +E FN  A GN
Sbjct: 20  CTSQVKSRKLHDASMYERHEQWMEKYGKVYKDSAEMQKRFLIFENNVEFIESFN--AAGN 77

Query: 76  RSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGA 134
           + Y L +N  AD T +EF+AS  G+K S          TPF Y++ + +P +V+W +KG 
Sbjct: 78  KPYKLSINHLADQTNEEFMASHKGYKGSHWQGLRITTQTPFKYENVTDIPWAVDWRQKGD 137

Query: 135 VTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDD 187
           VT +K Q QC       AVAA EGI  I    LVSLSE++LVDC + D+  GC GG M+ 
Sbjct: 138 VTSIKDQAQCGNCWAFSAVAATEGIYQITTGNLVSLSEKELVDCDSVDH--GCDGGLMEH 195

Query: 188 AFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ 247
            F++II+N GI+++A Y Y  ++ G CD+ K     AQIT YE VP N EE L KAVANQ
Sbjct: 196 GFEFIIKNGGISSEANYPYTAVN-GTCDTNKEASPVAQITGYETVPVNCEEELQKAVANQ 254

Query: 248 -PVSVAIDA--SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWG 304
             +SV+IDA  SA QFY  GVF G C T L+HGVTAVGYG+++ G +YW++KNSWG  WG
Sbjct: 255 LTMSVSIDAGGSAFQFYPSGVFTGQCGTQLDHGVTAVGYGSTDYGTQYWIVKNSWGTQWG 314

Query: 305 EDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
           E+GY R+ R ID  +G CGIAM AS+P +
Sbjct: 315 EEGYIRMLRGIDAQEGLCGIAMDASYPTA 343


>gi|38345008|emb|CAD40026.2| OSJNBa0052O21.11 [Oryza sativa Japonica Group]
 gi|125589414|gb|EAZ29764.1| hypothetical protein OsJ_13822 [Oryza sativa Japonica Group]
          Length = 339

 Score =  301 bits (772), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 155/346 (44%), Positives = 219/346 (63%), Gaps = 22/346 (6%)

Query: 1   MAKYFLIVVLIISGSCASQATYRTF-DEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFK 59
           MAK  L  +L     C++    R   D+ ++A + E+W AQYGR Y++ AE ++RFE+FK
Sbjct: 3   MAKALLFAILGCLCLCSAVLAARELSDDAAMAARHERWMAQYGRVYRDDAEKARRFEVFK 62

Query: 60  DNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK 119
            N+  +E FN    GN ++ L +N+FADLT  EF   +T       ++ +    T F Y+
Sbjct: 63  ANVAFIESFN---AGNHNFWLGVNQFADLTNDEFRWMKTNKGFIPSTTRVP---TGFRYE 116

Query: 120 SSQV---PPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLV 169
           +  +   P +V+W  KGAVTP+K QGQC       AVAA+EGI  +   +L+SLSEQ+LV
Sbjct: 117 NVNIDALPATVDWRTKGAVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQELV 176

Query: 170 DCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNY 229
           DC  +  + GC GG MDDAFK+II+N G+T ++ Y Y   +   C S+   +  A I  Y
Sbjct: 177 DCDVHGEDQGCEGGLMDDAFKFIIKNGGLTTESNYPYAA-ADDKCKSV--SNSVASIKGY 233

Query: 230 EDVPPNDEESLLKAVANQPVSVAIDASAL--QFYSGGVFNGYCETFLNHGVTAVGYGTSE 287
           EDVP N+E +L+KAVANQPVSVA+D   +  QFY GGV  G C T L+HG+ A+GYG + 
Sbjct: 234 EDVPANNEAALMKAVANQPVSVAVDGGDMTFQFYKGGVMTGSCGTDLDHGIVAIGYGKAS 293

Query: 288 EGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
           +G KYWL+KNSWG  WGE+G+ R+++DI   +G CG+AM  S+P +
Sbjct: 294 DGTKYWLLKNSWGTTWGENGFLRMEKDISDKRGMCGLAMEPSYPTA 339


>gi|118158|sp|P12412.1|CYSEP_VIGMU RecName: Full=Vignain; AltName: Full=Bean endopeptidase; AltName:
           Full=Cysteine proteinase; AltName:
           Full=Sulfhydryl-endopeptidase; Short=SH-EP; Contains:
           RecName: Full=Vignain-1; Contains: RecName:
           Full=Vignain-2; Flags: Precursor
 gi|22062|emb|CAA33753.1| sulfhydryl-pre-endopeptidase (AA -20 to 342) [Vigna mungo]
 gi|22066|emb|CAA36181.1| sulfhydryl-endopeptidase [Vigna mungo]
          Length = 362

 Score =  301 bits (771), Expect = 3e-79,   Method: Compositional matrix adjust.
 Identities = 159/328 (48%), Positives = 207/328 (63%), Gaps = 18/328 (5%)

Query: 27  EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFA 86
           E S+ + +E+W++ +    +   E  KRF +FK N++ V   N     ++ Y L+LNKFA
Sbjct: 33  EESLWDLYERWRSHH-TVSRSLGEKHKRFNVFKANVMHVHNTNKM---DKPYKLKLNKFA 88

Query: 87  DLTPQEFIASQTGFKMSDHS---SSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQ 143
           D+T  EF ++  G K++ H     S   +GT    K   VP SV+W +KGAVT VK QGQ
Sbjct: 89  DMTNHEFRSTYAGSKVNHHKMFRGSQHGSGTFMYEKVGSVPASVDWRKKGAVTDVKDQGQ 148

Query: 144 CA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNK 196
           C        + AVEGIN IK N+LVSLSEQ+LVDC   + N GC GG M+ AF++I Q  
Sbjct: 149 CGSCWAFSTIVAVEGINQIKTNKLVSLSEQELVDC-DKEENQGCNGGLMESAFEFIKQKG 207

Query: 197 GITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDA- 255
           GIT ++ Y Y     G CD  K  D A  I  +E+VP NDE +LLKAVANQPVSVAIDA 
Sbjct: 208 GITTESNYPYTAQE-GTCDESKVNDLAVSIDGHENVPVNDENALLKAVANQPVSVAIDAG 266

Query: 256 -SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRD 314
            S  QFYS GVF G C T LNHGV  VGYGT+ +G  YW+++NSWG +WGE GY R+QR+
Sbjct: 267 GSDFQFYSEGVFTGDCNTDLNHGVAIVGYGTTVDGTNYWIVRNSWGPEWGEQGYIRMQRN 326

Query: 315 IDQPQGQCGIAMFASFPVSKESAQPSSA 342
           I + +G CGIAM AS+P+   S  P+ +
Sbjct: 327 ISKKEGLCGIAMMASYPIKNSSDNPTGS 354


>gi|1223922|gb|AAA92063.1| cysteinyl endopeptidase [Vigna radiata]
          Length = 362

 Score =  301 bits (770), Expect = 3e-79,   Method: Compositional matrix adjust.
 Identities = 158/328 (48%), Positives = 208/328 (63%), Gaps = 18/328 (5%)

Query: 27  EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFA 86
           E S+ + +E+W++ +    +   E  KRF +FK+N++ V   N     ++ Y L+LNKFA
Sbjct: 33  EESLWDLYERWRSHH-TVSRSLTEKHKRFNVFKENVMHVHNTNKM---DKPYKLKLNKFA 88

Query: 87  DLTPQEFIASQTGFKMSDHS---SSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQ 143
           D+T  EF ++  G K++ H     +   NGT    K   VP SV+W +KGAVT VK QGQ
Sbjct: 89  DMTNHEFRSTYAGSKVNHHKMFRGTQHGNGTFMYEKVGSVPASVDWRKKGAVTDVKDQGQ 148

Query: 144 CA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNK 196
           C        V AVEGIN IK ++LVSLSEQ+LVDC   + N GC GG M+ AF++I Q  
Sbjct: 149 CGSCWAFSTVVAVEGINQIKTDKLVSLSEQELVDC-DKEENQGCNGGLMESAFEFIKQKG 207

Query: 197 GITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDA- 255
           GIT ++ Y Y     G CD+ K  D A  I  +E+VP NDE +LLKAVANQPVSVAIDA 
Sbjct: 208 GITTESNYPYTAQE-GTCDASKVNDLAVSIDGHENVPVNDENALLKAVANQPVSVAIDAG 266

Query: 256 -SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRD 314
            S  QFYS GV  G C T LNHGV  VGYGT+ +G  YW+++NSWG +WGE GY R+QR+
Sbjct: 267 GSDFQFYSEGVLTGDCNTDLNHGVAIVGYGTTVDGTNYWIVRNSWGPEWGEQGYIRMQRN 326

Query: 315 IDQPQGQCGIAMFASFPVSKESAQPSSA 342
           I + +G CGIAM AS+P+   S  P+ +
Sbjct: 327 ISKKEGLCGIAMMASYPIKNSSDNPTGS 354


>gi|255540425|ref|XP_002511277.1| cysteine protease, putative [Ricinus communis]
 gi|46395620|sp|O65039.1|CYSEP_RICCO RecName: Full=Vignain; AltName: Full=Cysteine endopeptidase; Flags:
           Precursor
 gi|2944446|gb|AAC62396.1| cysteine endopeptidase precursor [Ricinus communis]
 gi|223550392|gb|EEF51879.1| cysteine protease, putative [Ricinus communis]
          Length = 360

 Score =  300 bits (769), Expect = 5e-79,   Method: Compositional matrix adjust.
 Identities = 165/324 (50%), Positives = 203/324 (62%), Gaps = 18/324 (5%)

Query: 34  FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
           +E+W++ +    +   E  KRF +FK N + V   +NA   ++ Y L+LNKFAD+T  EF
Sbjct: 38  YERWRSHH-TVSRSLHEKQKRFNVFKHNAMHV---HNANKMDKPYKLKLNKFADMTNHEF 93

Query: 94  IASQTGFKMSDHS---SSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA----- 145
             + +G K+  H       + NGT    K   VP SV+W +KGAVT VK QGQC      
Sbjct: 94  RNTYSGSKVKHHRMFRGGPRGNGTFMYEKVDTVPASVDWRKKGAVTSVKDQGQCGSCWAF 153

Query: 146 --VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAV 203
             + AVEGIN IK N+LVSLSEQ+LVDC T D N GC GG MD AF++I Q  GIT +A 
Sbjct: 154 STIVAVEGINQIKTNKLVSLSEQELVDCDT-DQNQGCNGGLMDYAFEFIKQRGGITTEAN 212

Query: 204 YSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDA--SALQFY 261
           Y YE    G CD  K    A  I  +E+VP NDE +LLKAVANQPVSVAIDA  S  QFY
Sbjct: 213 YPYEAYD-GTCDVSKENAPAVSIDGHENVPENDENALLKAVANQPVSVAIDAGGSDFQFY 271

Query: 262 SGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQ 321
           S GVF G C T L+HGV  VGYGT+ +G KYW +KNSWG +WGE GY R++R I   +G 
Sbjct: 272 SEGVFTGSCGTELDHGVAIVGYGTTIDGTKYWTVKNSWGPEWGEKGYIRMERGISDKEGL 331

Query: 322 CGIAMFASFPVSKESAQPSSADKS 345
           CGIAM AS+P+ K S  PS    S
Sbjct: 332 CGIAMEASYPIKKSSNNPSGIKSS 355


>gi|18403438|ref|NP_565780.1| cysteine proteinase-like protein [Arabidopsis thaliana]
 gi|2342728|gb|AAB67626.1| cysteine proteinase [Arabidopsis thaliana]
 gi|330253821|gb|AEC08915.1| cysteine proteinase-like protein [Arabidopsis thaliana]
          Length = 345

 Score =  300 bits (768), Expect = 6e-79,   Method: Compositional matrix adjust.
 Identities = 169/345 (48%), Positives = 223/345 (64%), Gaps = 23/345 (6%)

Query: 6   LIVVLII--SGSCASQATYRT--FDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDN 61
           L+ VLII  +G   SQAT RT  F E S+ +K EQW A++ R Y++  E + R ++FK N
Sbjct: 7   LVTVLIILFTGFRISQATSRTVIFREQSMVDKHEQWMARFSREYRDELEKNMRRDVFKKN 66

Query: 62  LVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFK----MSDHSSSLKANGTPFL 117
           L  +E FN    GN+SY L +N+FAD T +EF+A  TG K    +S      K   +   
Sbjct: 67  LKFIENFNKK--GNKSYKLGVNEFADWTNEEFLAIHTGLKGLTEVSPSKVVAKTISSQTW 124

Query: 118 YKSSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVD 170
             S  V  S +W  +GAVTPVKYQGQC       AVAAVEG+  I    LVSLSEQQL+D
Sbjct: 125 NVSDMVVESKDWRAEGAVTPVKYQGQCGCCWAFSAVAAVEGVAKIAGGNLVSLSEQQLLD 184

Query: 171 CATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYE 230
           C   + + GC GG M DAF Y++QN+GI ++  YSY+G S G C S      AA+I+ ++
Sbjct: 185 C-DREYDRGCDGGIMSDAFNYVVQNRGIASENDYSYQG-SDGGCRS--NARPAARISGFQ 240

Query: 231 DVPPNDEESLLKAVANQPVSVAIDASALQF--YSGGVFNGYCETFLNHGVTAVGYGTSEE 288
            VP N+E +LL+AV+ QPVSV++DA+   F  YSGGV++G C T  NH VT VGYGTS++
Sbjct: 241 TVPSNNERALLEAVSRQPVSVSMDATGDGFMHYSGGVYDGPCGTSSNHAVTFVGYGTSQD 300

Query: 289 GIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
           G KYWL KNSWG+ WGE GY R++RD+  PQG CG+A +A +PV+
Sbjct: 301 GTKYWLAKNSWGETWGEKGYIRIRRDVAWPQGMCGVAQYAFYPVA 345


>gi|388512155|gb|AFK44139.1| unknown [Medicago truncatula]
          Length = 340

 Score =  300 bits (768), Expect = 7e-79,   Method: Compositional matrix adjust.
 Identities = 159/321 (49%), Positives = 210/321 (65%), Gaps = 19/321 (5%)

Query: 23  RTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRL 82
           + ++  S+ E+ EQW ++YG+ YK++ E  KRF IFKDN+  +E FN  A  N+ Y L +
Sbjct: 29  KLYESPSLQERHEQWMSEYGKLYKDAIEKEKRFMIFKDNVEFIESFN--AADNKPYKLSV 86

Query: 83  NKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGAVTPVKYQ 141
           N  ADLT  EF AS+ G+K  D   +     T F Y++ + +P +V+W  KGAVTP+K Q
Sbjct: 87  NHLADLTLDEFKASRNGYKKIDREFAT----TSFKYENVTAIPEAVDWRVKGAVTPIKDQ 142

Query: 142 GQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQ 194
           GQC        VAA+EGIN I   +L+SLSEQ+LVDC T   + GC GG M+D F++II+
Sbjct: 143 GQCGSCWAFSTVAAIEGINQITTGKLISLSEQELVDCDTKGEDQGCEGGLMEDGFEFIIK 202

Query: 195 NKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAID 254
           N GIT++  Y Y+  + G C S       A+IT YE VP N E SLLKAVANQP+SV+ID
Sbjct: 203 NGGITSETNYPYKA-ADGSC-SAATTAPVAKITGYEKVPVNSEISLLKAVANQPISVSID 260

Query: 255 A--SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQ 312
           A  S+  FYS G++ G C T L+HGVTAVGYG S  G  YW++KNSWG  WGE GY R+Q
Sbjct: 261 ASDSSFMFYSSGIYTGECGTELDHGVTAVGYG-SANGTDYWIVKNSWGTVWGEKGYIRMQ 319

Query: 313 RDIDQPQGQCGIAMFASFPVS 333
           R I   +G CGIAM +S+P +
Sbjct: 320 RGIADKEGLCGIAMDSSYPTA 340


>gi|357167196|ref|XP_003581047.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like
           [Brachypodium distachyon]
          Length = 338

 Score =  300 bits (768), Expect = 7e-79,   Method: Compositional matrix adjust.
 Identities = 156/320 (48%), Positives = 200/320 (62%), Gaps = 18/320 (5%)

Query: 26  DEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKF 85
           D+  IA + EQW A+YGR Y + AE ++R E+FK N+  +E  N    GN  + L  N+F
Sbjct: 25  DDWLIAARHEQWMARYGRVYSDVAEKARRLEVFKANVGFIESVN---AGNHKFWLEANQF 81

Query: 86  ADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQV---PPSVNWIEKGAVTPVKYQG 142
           AD+T  EF A   G+KM    S  KA  T F Y +  +   P SV+W   GAVTPVK QG
Sbjct: 82  ADITKDEFRAMHKGYKMQVIGS--KARATGFRYANVSIDDLPASVDWRANGAVTPVKDQG 139

Query: 143 QCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQN 195
           QC        VA++EGI  +   +L+SLSEQ+LVDC     N GC GG MD+AF++I+ N
Sbjct: 140 QCGCCWAFSTVASMEGIVKVSTGKLISLSEQELVDCDVGMQNKGCGGGLMDNAFEFIVNN 199

Query: 196 KGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDA 255
            G+  +A Y Y G + G C+S K  + AA I  YEDVP NDE SL KAVA QPVS+A+D 
Sbjct: 200 GGLDTEADYPYTG-ADGTCNSNKESNIAASIKGYEDVPANDEASLQKAVAAQPVSIAVDG 258

Query: 256 S--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQR 313
                +FY GGV  G C T L+HGV AVGYG + +G KYWL+KNSWG  WGEDG+ RL+R
Sbjct: 259 GDDLFRFYKGGVLTGACGTELDHGVAAVGYGVAGDGTKYWLVKNSWGTSWGEDGFIRLER 318

Query: 314 DIDQPQGQCGIAMFASFPVS 333
           D+    G CG+AM  S+P +
Sbjct: 319 DVADEAGMCGLAMKPSYPTA 338


>gi|357474725|ref|XP_003607647.1| Cysteine proteinase [Medicago truncatula]
 gi|355508702|gb|AES89844.1| Cysteine proteinase [Medicago truncatula]
          Length = 340

 Score =  300 bits (768), Expect = 7e-79,   Method: Compositional matrix adjust.
 Identities = 158/321 (49%), Positives = 211/321 (65%), Gaps = 19/321 (5%)

Query: 23  RTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRL 82
           + ++  S+ E+ EQW ++YG+ YK++ E  KRF IFKDN+  +E FN  A  N+ Y L +
Sbjct: 29  KLYESPSLQERHEQWMSEYGKLYKDAIEKEKRFMIFKDNVEFIESFN--AADNKPYKLSV 86

Query: 83  NKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGAVTPVKYQ 141
           N  ADLT  EF AS+ G+K  D   +     T F Y++ + +P +V+W  KGAVTP+K Q
Sbjct: 87  NHLADLTLDEFKASRNGYKKIDREFAT----TSFKYENVTAIPEAVDWRVKGAVTPIKDQ 142

Query: 142 GQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQ 194
           GQC        VAA+EGIN I   +L+SLSEQ+LVDC T   + GC GG M+D F++II+
Sbjct: 143 GQCGSCWAFSTVAAIEGINQITTGKLISLSEQELVDCDTKGEDQGCEGGLMEDGFEFIIK 202

Query: 195 NKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAID 254
           N GIT++  Y Y+  + G C++       A+IT YE VP N E SLLKAVANQP+SV+ID
Sbjct: 203 NGGITSETNYPYKA-ADGSCNTATTAP-VAKITGYEKVPVNSEISLLKAVANQPISVSID 260

Query: 255 A--SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQ 312
           A  S+  FYS G++ G C T L+HGVTAVGYG S  G  YW++KNSWG  WGE GY R+Q
Sbjct: 261 ASDSSFMFYSSGIYTGECGTELDHGVTAVGYG-SANGTDYWIVKNSWGTVWGEKGYIRMQ 319

Query: 313 RDIDQPQGQCGIAMFASFPVS 333
           R I   +G CGIAM +S+P +
Sbjct: 320 RGIADKEGLCGIAMDSSYPTA 340


>gi|30141021|dbj|BAC75924.1| cysteine protease-2 [Helianthus annuus]
          Length = 362

 Score =  300 bits (767), Expect = 9e-79,   Method: Compositional matrix adjust.
 Identities = 161/333 (48%), Positives = 209/333 (62%), Gaps = 21/333 (6%)

Query: 27  EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFA 86
           E ++ + +E+W+ +    +    E  +RF +FK N++ V   N     ++ Y L+LNKFA
Sbjct: 33  EDNLWDMYERWRHKVATNH---GEKLRRFNVFKSNVLHVHETNKM---DKPYKLKLNKFA 86

Query: 87  DLTPQEFIASQTGFKMSDHSSSL---KANGTPFLYKSSQ-VPPSVNWIEKGAVTPVKYQG 142
           D+T  EF +   G K+  H  SL   ++    F+Y + + VP SV+W +KGAV PVK QG
Sbjct: 87  DMTNHEFRSVYAGSKIHHHDRSLQGDRSGSKTFMYANVESVPTSVDWRKKGAVAPVKDQG 146

Query: 143 QCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQN 195
           QC        VAAVEGIN IK N LVSLSEQ+LVDC T +N  GC GG MD AF +I + 
Sbjct: 147 QCGSCWAFSTVAAVEGINKIKTNELVSLSEQELVDCDTLENQ-GCNGGLMDLAFDFIKKT 205

Query: 196 KGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDA 255
            G+T +  Y Y     G CDS K       I  +EDVP NDE+SL+KAVANQPV+VAIDA
Sbjct: 206 GGLTREDAYPY-AAEDGKCDSNKMNSPVVSIDGHEDVPKNDEQSLMKAVANQPVAVAIDA 264

Query: 256 --SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQR 313
             S  QFYS GVF G C T L+HGV AVGYGT+ +G KYW+++NSWG +WGE GY R++R
Sbjct: 265 GSSDFQFYSEGVFTGKCGTQLDHGVAAVGYGTTLDGTKYWIVRNSWGSEWGEKGYIRMER 324

Query: 314 DIDQPQGQCGIAMFASFPVSKESAQPSSADKSS 346
            I   +G CGIAM AS+P+   S  P S+  SS
Sbjct: 325 GISDKRGLCGIAMEASYPIKNSSNNPKSSPTSS 357


>gi|224133760|ref|XP_002321654.1| predicted protein [Populus trichocarpa]
 gi|222868650|gb|EEF05781.1| predicted protein [Populus trichocarpa]
          Length = 362

 Score =  300 bits (767), Expect = 9e-79,   Method: Compositional matrix adjust.
 Identities = 161/332 (48%), Positives = 210/332 (63%), Gaps = 20/332 (6%)

Query: 27  EGSIAEKFEQWKAQYGRTYKESA-ENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKF 85
           E S+ + +E+W++ +  T   S  E  KRF +FK+N++ V + N      + Y L+LNKF
Sbjct: 33  EESLWDLYERWRSHH--TVSTSLDEKHKRFNVFKENVMHVHKTNKMG---KPYKLKLNKF 87

Query: 86  ADLTPQEFIASQTGFKMSDHS---SSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQG 142
           AD+T  EF +   G K+  H     + + NG+    K  +VP SV+W +KGAVT VK QG
Sbjct: 88  ADMTNHEFRSVYAGSKVKHHRMFRGTTRGNGSFMYGKVEKVPTSVDWRKKGAVTAVKDQG 147

Query: 143 QCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQN 195
           QC        + AVEGIN IK N LVSLSEQ+LVDC T +N  GC GG M+ AF++I + 
Sbjct: 148 QCGSCWAFSTIVAVEGINYIKTNELVSLSEQELVDCDTTENQ-GCNGGLMEYAFEFIKKK 206

Query: 196 KGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDA 255
           +GIT ++ Y Y+    G CD+ K  + A  I  YE VP NDE++LLKA ANQPVSVAIDA
Sbjct: 207 RGITTESTYPYKA-EDGHCDAAKENNPAVSIDGYEKVPENDEDALLKAAANQPVSVAIDA 265

Query: 256 --SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQR 313
             S  QFYS GVF G C T L+HGV  VGYGT+ +G KYW+++NSWG +WGE GY R+QR
Sbjct: 266 GGSDFQFYSEGVFIGECGTELDHGVAVVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQR 325

Query: 314 DIDQPQGQCGIAMFASFPVSKESAQPSSADKS 345
            I   +G CGIAM AS+P+   S  PS    S
Sbjct: 326 GISDKEGLCGIAMEASYPIKNSSTNPSGTKSS 357


>gi|242072392|ref|XP_002446132.1| hypothetical protein SORBIDRAFT_06g002150 [Sorghum bicolor]
 gi|241937315|gb|EES10460.1| hypothetical protein SORBIDRAFT_06g002150 [Sorghum bicolor]
          Length = 337

 Score =  299 bits (766), Expect = 1e-78,   Method: Compositional matrix adjust.
 Identities = 155/343 (45%), Positives = 218/343 (63%), Gaps = 22/343 (6%)

Query: 2   AKYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDN 61
           +K FL+ +L  +  C+S    R   + ++ E+ E W  +YGR YK++AE ++RFE FK N
Sbjct: 4   SKAFLLAILGCASLCSSVLAARELSDAAMVERHENWMVEYGRVYKDAAEKARRFEAFKHN 63

Query: 62  LVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS- 120
           +  VE FN        + L +N+FADLT +EF A++ GFK     ++ K   T F Y++ 
Sbjct: 64  VAFVESFNTNK--KNKFWLGVNQFADLTTEEFKANK-GFK----PTAEKVPTTGFKYENL 116

Query: 121 --SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDC 171
             S +P +V+W  KGAVTP+K QGQC       AVAA+EGI  +    L+SLSEQ+LVDC
Sbjct: 117 SVSALPTAVDWRTKGAVTPIKNQGQCGCCWAFSAVAAMEGIVKLSTGNLISLSEQELVDC 176

Query: 172 ATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYED 231
            T+  + GC GG+MD AF+++I+N G+  ++ Y Y+ +  G C        AA I  +ED
Sbjct: 177 DTHSMDEGCEGGWMDSAFEFVIKNGGLATESNYPYKAVD-GKCKG--GSKSAATIKGHED 233

Query: 232 VPPNDEESLLKAVANQPVSVAIDASALQF--YSGGVFNGYCETFLNHGVTAVGYGTSEEG 289
           VP N+E +L+KAVANQPVSVA+DAS   F  YSGGV  G C T L+HG+ A+GYG   +G
Sbjct: 234 VPVNNEAALMKAVANQPVSVAVDASDRTFMLYSGGVMTGSCGTELDHGIAAIGYGMESDG 293

Query: 290 IKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
            KYW++KNSWG  WGE G+ R+++DI   +G CG+AM  S+P 
Sbjct: 294 TKYWILKNSWGTTWGEKGFLRMEKDITDKRGMCGLAMKPSYPT 336


>gi|37780049|gb|AAP32197.1| cysteine protease 10 [Trifolium repens]
          Length = 272

 Score =  299 bits (766), Expect = 1e-78,   Method: Compositional matrix adjust.
 Identities = 153/274 (55%), Positives = 193/274 (70%), Gaps = 13/274 (4%)

Query: 70  NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK-SSQVPPSVN 128
           N+ + N+ Y L +NKFADLT +EF AS+  FK    SS ++   T F Y+ +S +P +V+
Sbjct: 2   NSNVNNKLYKLGINKFADLTNEEFKASRNKFKGHMCSSIIRT--TTFKYENASAIPSTVD 59

Query: 129 WIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCY 181
           W +KGAVTPVK QGQC       AVAA EGI+ +   +LVSLSEQ+L+DC T   + GC 
Sbjct: 60  WRKKGAVTPVKNQGQCGSCWAFSAVAATEGIHQLSTGKLVSLSEQELIDCDTKGVDQGCE 119

Query: 182 GGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLL 241
           GG MDDAFK+IIQN G++ +  Y YEG+  G C++ +A  HA  IT YEDVP N+E +L 
Sbjct: 120 GGLMDDAFKFIIQNHGLSTEVQYPYEGVD-GTCNTNEASIHAVTITGYEDVPANNELALQ 178

Query: 242 KAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSW 299
           KAVANQP+SVAIDAS    QFY+ GVF G C T L+HGVTAVGYG   +G KYWL+KNSW
Sbjct: 179 KAVANQPISVAIDASGSDFQFYNSGVFTGSCGTELDHGVTAVGYGVGNDGTKYWLVKNSW 238

Query: 300 GQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
           G DWGE+GY R+QR ID  +G CGIAM AS+P +
Sbjct: 239 GADWGEEGYIRMQRGIDAAEGLCGIAMQASYPTA 272


>gi|312281697|dbj|BAJ33714.1| unnamed protein product [Thellungiella halophila]
          Length = 347

 Score =  299 bits (766), Expect = 1e-78,   Method: Compositional matrix adjust.
 Identities = 161/343 (46%), Positives = 213/343 (62%), Gaps = 20/343 (5%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGSIA--EKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
           FLIV LI S  C S    R  D+  +   ++ ++W A++GR Y +  E + R+ +FK N+
Sbjct: 9   FLIVSLI-SSFCLSITLSRPLDDNELIMQKRHDEWMAKHGRVYADMKEKNNRYVVFKRNV 67

Query: 63  VAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTP-FLYK-- 119
             +ER NN   G R++ L +N+FADLT  EF +  TG+K     SS     T  F Y+  
Sbjct: 68  ERIERLNNVPAG-RTFKLAVNQFADLTNDEFRSMYTGYKGGSVLSSQSGTKTSSFRYQNV 126

Query: 120 -SSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDC 171
            S  +P SV+W +KGAVTP+K QG C       AVAA+EG   IK  +L+SLSEQQLVDC
Sbjct: 127 SSGALPVSVDWRKKGAVTPIKNQGTCGCCWAFSAVAAIEGATKIKKGKLISLSEQQLVDC 186

Query: 172 ATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYED 231
            TND   GC GG MD AF++I+   G+T ++ Y Y+G     C     +  A  IT YED
Sbjct: 187 DTNDF--GCSGGLMDTAFEHIMATGGLTTESNYPYKG-KDATCKIKNTKPTATSITGYED 243

Query: 232 VPPNDEESLLKAVANQPVSVAIDASAL--QFYSGGVFNGYCETFLNHGVTAVGYGTSEEG 289
           VP NDE++L+KAVA+QPVS+ I+      QFY  GVF G C T+L+H VTAVGYG S  G
Sbjct: 244 VPVNDEKALMKAVAHQPVSIGIEGGGFDFQFYGSGVFTGECTTYLDHAVTAVGYGQSSNG 303

Query: 290 IKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
            KYW+IKNSWG  WGE GY R+++D+   +G CG+AM AS+P 
Sbjct: 304 SKYWIIKNSWGTKWGESGYMRIKKDVKDKKGLCGLAMKASYPT 346


>gi|225456820|ref|XP_002278323.1| PREDICTED: vignain [Vitis vinifera]
          Length = 360

 Score =  299 bits (766), Expect = 1e-78,   Method: Compositional matrix adjust.
 Identities = 161/331 (48%), Positives = 207/331 (62%), Gaps = 18/331 (5%)

Query: 27  EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFA 86
           E S+   +E+W++ +    +   E  KRF +FK+N+  V  FN     +  Y L+LNKFA
Sbjct: 31  EESLWNLYERWRSHH-TVSRSLDEKHKRFNVFKENVNFVHEFNKK---DEPYKLKLNKFA 86

Query: 87  DLTPQEFIASQTGFKMSDHS---SSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQ 143
           D+T  EF ++  G K++ H     S  A G+    K   VPPSV+W +KGAVTP+K QGQ
Sbjct: 87  DMTNHEFRSTYAGSKVNHHRMFRGSQHAAGSFMYEKVKSVPPSVDWRKKGAVTPIKDQGQ 146

Query: 144 CA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNK 196
           C        V AVEGIN IK N+LVSLSEQ+LVDC T++N  GC GG M  AF++I +  
Sbjct: 147 CGSCWAFSTVVAVEGINHIKTNKLVSLSEQELVDCDTSENQ-GCNGGLMGYAFEFIKEKG 205

Query: 197 GITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDA- 255
           GIT +  Y Y     G CD  K       I  +E VPPN+E++LLKA ANQP+SVAIDA 
Sbjct: 206 GITTEQSYPYTA-EDGTCDVSKVNSPVVSIDGHETVPPNNEDALLKAAANQPISVAIDAG 264

Query: 256 -SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRD 314
            SA QFYS GVF G C T L+HGV  VGYGT+ +G KYW++KNSWG DWGE+GY R++R 
Sbjct: 265 GSAFQFYSEGVFAGRCGTDLDHGVAIVGYGTTLDGTKYWIVKNSWGTDWGENGYIRMKRG 324

Query: 315 IDQPQGQCGIAMFASFPVSKESAQPSSADKS 345
           I   +G CGIA+ AS+P+   S  P  A  S
Sbjct: 325 ISAKEGLCGIAVEASYPIKNSSTNPVGAPSS 355


>gi|172052260|gb|ACB70409.1| cysteine protease [Nicotiana tabacum]
          Length = 361

 Score =  298 bits (764), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 163/325 (50%), Positives = 203/325 (62%), Gaps = 22/325 (6%)

Query: 32  EKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQ 91
           E +E+W++ +    +   E  KRF +FK N+  V  FN     ++ Y L+LNKFAD+T  
Sbjct: 36  ELYERWRSHH-TVSRSLDEKDKRFNVFKANVHYVHNFNKK---DKPYKLKLNKFADMTNH 91

Query: 92  EFIASQTGFKMSDHSSSL---KANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA--- 145
           EF     G K+  H + L   +ANGT        VPP+V+W +KGAVTPVK QG+C    
Sbjct: 92  EFRHHYAGSKIKHHRTFLGASRANGTFMYAHEDSVPPTVDWRKKGAVTPVKDQGKCGSCW 151

Query: 146 ----VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITND 201
               V AVEGIN IK N LVSLSEQ+LVDC T+  N GC GG MD AF++I +  GI  +
Sbjct: 152 AFSTVVAVEGINQIKTNELVSLSEQELVDCDTS-QNQGCNGGLMDMAFEFIKKKGGINTE 210

Query: 202 AVYSY--EGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA-- 257
             Y Y  EG   G CD  K       I  +EDVPPNDE SLLKAVANQPVSVAI AS   
Sbjct: 211 ENYPYMAEG---GECDIQKRNSPVVSIDGHEDVPPNDEGSLLKAVANQPVSVAIQASGSD 267

Query: 258 LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQ 317
            QFYS GVF G C T L+HGV  VGYGT+ +  KYW++KNSWG +WGE GY R+QR+ID 
Sbjct: 268 FQFYSEGVFTGDCGTELDHGVAIVGYGTTLDRTKYWIVKNSWGPEWGEKGYIRMQREIDA 327

Query: 318 PQGQCGIAMFASFPVSKESAQPSSA 342
            +G CGIAM  S+P+   S+ P+ +
Sbjct: 328 EEGLCGIAMQPSYPIKTSSSNPTGS 352


>gi|310656789|gb|ADP02218.1| Peptidase_C1 domain-containing protein [Triticum aestivum]
          Length = 341

 Score =  298 bits (763), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 157/347 (45%), Positives = 212/347 (61%), Gaps = 24/347 (6%)

Query: 1   MAKYFLIVVLIISGSCASQA-TYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFK 59
           + K  L+ ++     C+S   + R   + ++ E+ EQW A++ R YK+  E ++RFE+FK
Sbjct: 3   IPKALLLAIVGCICLCSSAVLSARELGDTAMVERHEQWMAKFNRVYKDGTEKAQRFEVFK 62

Query: 60  DNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQT--GFKMSDHSSSLKANGTPFL 117
            N+  +E FN     NR + L +N+F DLT  EF A++T  G KMS   +      T F 
Sbjct: 63  ANVAFIESFNAE---NRKFWLGVNQFTDLTNDEFRATKTNKGLKMSGGRAP-----TGFK 114

Query: 118 YKSSQV---PPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQ 167
           Y +  +   P +V+W  KG VTP+K QGQC       AV A EGI  +   +L+SLSEQ+
Sbjct: 115 YSNVSIDALPTAVDWRTKGVVTPIKDQGQCGCCWAFSAVVATEGIVKLSTGKLISLSEQE 174

Query: 168 LVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQIT 227
           LVDC  +  + GC GG MDDAFK+II+N G+T +A Y Y     G C +  A +  A I 
Sbjct: 175 LVDCDVHGVDQGCEGGEMDDAFKFIIKNGGLTTEANYPYTAQD-GQCKTSIASNSVATIK 233

Query: 228 NYEDVPPNDEESLLKAVANQPVSVAIDASAL--QFYSGGVFNGYCETFLNHGVTAVGYGT 285
            YEDVP NDE SL+KAVANQPVSVA+D   +  Q YSGGV  G C T L+HG+ A+GYG 
Sbjct: 234 GYEDVPANDESSLMKAVANQPVSVAVDGGDVIFQHYSGGVMTGSCGTDLDHGIAAIGYGM 293

Query: 286 SEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           + +G KYWL+KNSWG  WGE GY R+++DI    G CG+AM  S+P 
Sbjct: 294 TSDGTKYWLLKNSWGTTWGESGYLRMEKDISDKSGMCGLAMQPSYPT 340


>gi|297826875|ref|XP_002881320.1| hypothetical protein ARALYDRAFT_321132 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297327159|gb|EFH57579.1| hypothetical protein ARALYDRAFT_321132 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 341

 Score =  298 bits (762), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 165/338 (48%), Positives = 217/338 (64%), Gaps = 17/338 (5%)

Query: 7   IVVLIISGSCASQATYRT--FDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
           I  ++ +    SQAT RT  F E S  EK EQW A++ R Y++  E   R ++FK NL  
Sbjct: 10  IFTILFTTFSISQATSRTVTFHEPSSLEKHEQWMARFSRVYRDELEKQMRRDVFKKNLKF 69

Query: 65  VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVP 124
           +E FN    GN+SY L +N+FAD T +EF+A  TG K        +   +     S  V 
Sbjct: 70  IENFNKK--GNKSYKLGVNEFADWTNEEFLAIHTGLKGLSSKVVDETISSRSWNISDMVG 127

Query: 125 PSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNN 177
            S +W  +GAVTPVKYQGQC       AVAAVEG+  I    LVSLSEQQL+DC   + +
Sbjct: 128 VSKDWRAEGAVTPVKYQGQCGCCWAFSAVAAVEGVTKIAGGNLVSLSEQQLLDC-DREYD 186

Query: 178 NGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDE 237
            GC GG M DAF YIIQN+GI ++  YSY+G S G C S  +   AA+I+ ++ VP N+E
Sbjct: 187 RGCDGGIMSDAFNYIIQNRGIASENDYSYQG-SDGRCRS--SARPAARISGFQTVPSNNE 243

Query: 238 ESLLKAVANQPVSVAIDASALQF--YSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLI 295
           ++LL+AV+ QPVSV++DA+   F  YSGGV++G C T  NH VT VGYGTS++G KYWL 
Sbjct: 244 QALLEAVSRQPVSVSMDANGDGFMHYSGGVYDGPCGTSSNHAVTFVGYGTSQDGTKYWLA 303

Query: 296 KNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
           KNSWG+ WGE GY R++RD+  PQG CG+A +A +PV+
Sbjct: 304 KNSWGETWGEKGYIRIRRDVAWPQGMCGVAQYAFYPVA 341


>gi|356515046|ref|XP_003526212.1| PREDICTED: thiol protease SEN102-like [Glycine max]
          Length = 342

 Score =  298 bits (762), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 161/344 (46%), Positives = 220/344 (63%), Gaps = 21/344 (6%)

Query: 3   KYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
           K  ++ + +      SQ   R   + ++ E+ E W A+YG+ YK++AE  KRF+IFKDN+
Sbjct: 7   KQHMLALFLFLAVGISQVMPRKLHQTALRERHENWMAEYGKMYKDAAEKEKRFQIFKDNV 66

Query: 63  VAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDH--SSSLKANGTPFLYKS 120
             +E FN  A GN+ Y L +N  ADLT +EF  S+ G K +    +++ K NG  F Y++
Sbjct: 67  EFIESFN--AAGNKPYKLGVNHLADLTLEEFKDSRNGLKRTYEFSTTTFKLNG--FKYEN 122

Query: 121 -SQVPPSVNWIEKGAVTPVKYQG-QCA-------VAAVEGINAIKINRLVSLSEQQLVDC 171
            + +P +++W  KGAVTP+K QG QC        +AA EGI+ I    LVSLSEQ+LVDC
Sbjct: 123 VTDIPEAIDWRVKGAVTPIKDQGDQCGRFWAFSTIAATEGIHQISTGNLVSLSEQELVDC 182

Query: 172 ATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYED 231
            + D+  GC GGFM+D F++II+N GIT++  Y Y+G+  G C++  A    AQI  YE 
Sbjct: 183 DSVDD--GCEGGFMEDGFEFIIKNGGITSETNYPYKGVD-GTCNTTIAASPVAQIKGYEI 239

Query: 232 VPPNDEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEG 289
           VP   EE+L KAVANQPVSV+I A+     FYS G++NG C T L+HGVTAVGYGT E G
Sbjct: 240 VPSYSEEALKKAVANQPVSVSIHATNATFMFYSSGIYNGECGTDLDHGVTAVGYGT-ENG 298

Query: 290 IKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
             YW++KNSWG  WGE GY R+ R I    G CGIA+ +S+P +
Sbjct: 299 TDYWIVKNSWGTQWGEKGYIRMHRGIAAKHGICGIALDSSYPTA 342


>gi|357458911|ref|XP_003599736.1| Cysteine proteinase [Medicago truncatula]
 gi|357474719|ref|XP_003607644.1| Cysteine proteinase [Medicago truncatula]
 gi|355488784|gb|AES69987.1| Cysteine proteinase [Medicago truncatula]
 gi|355508699|gb|AES89841.1| Cysteine proteinase [Medicago truncatula]
          Length = 340

 Score =  298 bits (762), Expect = 4e-78,   Method: Compositional matrix adjust.
 Identities = 157/321 (48%), Positives = 210/321 (65%), Gaps = 19/321 (5%)

Query: 23  RTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRL 82
           + ++  S+ E+ EQW  ++G+ Y+++ E  KRF IFKDN+  +E FN  A  N+ Y L +
Sbjct: 29  KLYESLSLQERHEQWMTEHGKVYEDAIEKEKRFMIFKDNVEFIESFN--AADNQPYKLSV 86

Query: 83  NKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGAVTPVKYQ 141
           N  ADLT  EF AS+ G+K  D   +     T F Y++ + +P +V+W  KGAVTP+K Q
Sbjct: 87  NHLADLTLDEFKASRNGYKKIDREFTT----TSFKYENVTAIPAAVDWRVKGAVTPIKDQ 142

Query: 142 GQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQ 194
           GQC        VAA EGIN I   +LVSLSEQ+LVDC T   + GC GG M+D F++II+
Sbjct: 143 GQCGSCWAFSTVAATEGINQITTGKLVSLSEQELVDCDTKGEDQGCEGGLMEDGFEFIIK 202

Query: 195 NKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAID 254
           N GIT++  Y Y+  + G C++       A+IT YE VP N E+SLLKAVANQP+SV+ID
Sbjct: 203 NGGITSETNYPYKA-ADGSCNTATTTP-VAKITGYEKVPVNSEKSLLKAVANQPISVSID 260

Query: 255 A--SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQ 312
           A  S+  FYS G++ G C T L+HGVTAVGYG S  G  YW++KNSWG  WGE GY R+Q
Sbjct: 261 ASDSSFMFYSSGIYTGECGTELDHGVTAVGYG-SANGTDYWIVKNSWGTVWGEKGYIRMQ 319

Query: 313 RDIDQPQGQCGIAMFASFPVS 333
           R I   +G CGIAM +S+P +
Sbjct: 320 RGIAAKEGLCGIAMDSSYPTA 340


>gi|356515040|ref|XP_003526209.1| PREDICTED: thiol protease SEN102-like [Glycine max]
          Length = 342

 Score =  297 bits (761), Expect = 5e-78,   Method: Compositional matrix adjust.
 Identities = 161/344 (46%), Positives = 220/344 (63%), Gaps = 21/344 (6%)

Query: 3   KYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
           K  ++ + +      SQ   R   + ++ E+ E W A+YG+ YK++AE  KRF+IFKDN+
Sbjct: 7   KQHMLALFLFLAVGISQVMPRKLHQTALRERHENWMAEYGKMYKDAAEKEKRFQIFKDNV 66

Query: 63  VAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDH--SSSLKANGTPFLYKS 120
             +E FN  A GN+ Y L +N  ADLT +EF  S+ G K +    +++ K NG  F Y++
Sbjct: 67  EFIESFN--AAGNKPYKLGVNHLADLTLEEFKDSRNGLKRTYEFSTTTFKLNG--FKYEN 122

Query: 121 -SQVPPSVNWIEKGAVTPVKYQG-QCA-------VAAVEGINAIKINRLVSLSEQQLVDC 171
            + +P +++W  KGAVTP+K QG QC        +AA EGI+ I    LVSLSEQ+LVDC
Sbjct: 123 VTDIPEAIDWRVKGAVTPIKDQGDQCGSCWAFSTIAATEGIHQISTGNLVSLSEQELVDC 182

Query: 172 ATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYED 231
            + D+  GC GGFM+D F++II+N GIT++  Y Y+G+  G C++  A    AQI  YE 
Sbjct: 183 DSVDD--GCEGGFMEDGFEFIIKNGGITSETNYPYKGVD-GTCNTTIAASPVAQIKGYEI 239

Query: 232 VPPNDEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEG 289
           VP   EE+L KAVANQPVSV+I A+     FYS G++NG C T L+HGVTAVGYGT E G
Sbjct: 240 VPSYSEEALQKAVANQPVSVSIHATNATFMFYSSGIYNGECGTDLDHGVTAVGYGT-ENG 298

Query: 290 IKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
             YW++KNSWG  WGE GY R+ R I    G CGIA+ +S+P +
Sbjct: 299 TDYWIVKNSWGTQWGEKGYIRMHRGIAAKHGICGIALDSSYPTA 342


>gi|116309130|emb|CAH66233.1| H0825G02.10 [Oryza sativa Indica Group]
          Length = 339

 Score =  297 bits (761), Expect = 5e-78,   Method: Compositional matrix adjust.
 Identities = 158/348 (45%), Positives = 219/348 (62%), Gaps = 26/348 (7%)

Query: 1   MAKYFLIVVLIISGSC-ASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFK 59
           + K  L  +L     C A  A     D  ++  + E+W  QYGR YK++ E ++RFEIFK
Sbjct: 3   IPKALLFAILSCLCLCSAVLAAREQSDHAAMVARHERWMEQYGRVYKDATEKARRFEIFK 62

Query: 60  DNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQT--GFKMSDHSSSLKANGTPFL 117
            N+  +E FN    GN  + L +N+FADLT  EF A++T  GF      S+++   T F 
Sbjct: 63  ANVAFIESFN---AGNHKFWLSVNQFADLTNYEFRATKTNKGFI----PSTVRVP-TTFR 114

Query: 118 YKSSQV---PPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQ 167
           Y++  +   P +V+W  KGAVTP+K QGQC       AVAA+EGI  +   +L+SLSEQ+
Sbjct: 115 YENVSIDTLPATVDWRTKGAVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQE 174

Query: 168 LVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQIT 227
           LVDC  +  + GC GG MDDAFK+II+N G+T ++ Y Y   + G C+     + AA I 
Sbjct: 175 LVDCDVHGEDQGCEGGLMDDAFKFIIKNGGLTTESKYPYTA-ADGKCNG--GSNSAATIK 231

Query: 228 NYEDVPPNDEESLLKAVANQPVSVAIDASAL--QFYSGGVFNGYCETFLNHGVTAVGYGT 285
            YEDVP N+E +L+KAVANQPVSVA+D   +  QFYSGGV  G C T L+HG+ A+GYG 
Sbjct: 232 GYEDVPANNEAALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIVAIGYGK 291

Query: 286 SEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
             +G +YWL+KNSWG  WGE+G+ R+++DI   +G CG+AM  S+P +
Sbjct: 292 DGDGTQYWLLKNSWGTTWGENGFLRMEKDISDKRGMCGLAMEPSYPTA 339


>gi|18401420|ref|NP_565649.1| cysteine proteinase-like protein [Arabidopsis thaliana]
 gi|4314384|gb|AAD15594.1| cysteine proteinase [Arabidopsis thaliana]
 gi|17381154|gb|AAL36389.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|20465849|gb|AAM20029.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|330252901|gb|AEC07995.1| cysteine proteinase-like protein [Arabidopsis thaliana]
          Length = 348

 Score =  297 bits (760), Expect = 6e-78,   Method: Compositional matrix adjust.
 Identities = 165/352 (46%), Positives = 219/352 (62%), Gaps = 23/352 (6%)

Query: 1   MAKYFLIVVLIISGSCASQATYR-TFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFK 59
           MA   + ++ I      S AT R +  E S  EK EQW A++ R Y +  E   RF IFK
Sbjct: 1   MASTIIFILTIFLSYRTSLATSRGSLFEASAIEKHEQWMARFNRVYSDETEKRNRFNIFK 60

Query: 60  DNLVAVERFNNAAIGNR-SYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKA-----NG 113
            NL  V+ FN   + N+ +Y + +N+F+DLT +EF A+ TG  + +  + +       N 
Sbjct: 61  KNLEFVQNFN---MNNKITYKVDINEFSDLTDEEFRATHTGLVVPEAITRISTLSSGKNT 117

Query: 114 TPFLYKS-SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSE 165
            PF Y + S    S++W ++GAVTPVKYQG+C       AVAAVEGI  I    LVSLSE
Sbjct: 118 VPFRYGNVSDNGESMDWRQEGAVTPVKYQGRCGGCWAFSAVAAVEGITKITKGELVSLSE 177

Query: 166 QQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAED--HA 223
           QQL+DC   D N GC GG M  AF+YII+N+GIT +  Y Y+        S        A
Sbjct: 178 QQLLDC-DRDYNQGCRGGIMSKAFEYIIKNQGITTEDNYPYQESQQTCSSSTTLSSSFRA 236

Query: 224 AQITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAV 281
           A I+ YE VP N+EE+LL+AV+ QPVSV I+ +  A + YSGGVFNG C T L+H VT V
Sbjct: 237 ATISGYETVPMNNEEALLQAVSQQPVSVGIEGTGAAFRHYSGGVFNGECGTDLHHAVTIV 296

Query: 282 GYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
           GYG SEEG KYW++KNSWG+ WGE+GY R++RD+D PQG CG+A+ A +P++
Sbjct: 297 GYGMSEEGTKYWVVKNSWGETWGENGYMRIKRDVDAPQGMCGLAILAFYPLA 348


>gi|38346003|emb|CAD40112.2| OSJNBa0035O13.5 [Oryza sativa Japonica Group]
 gi|125589427|gb|EAZ29777.1| hypothetical protein OsJ_13835 [Oryza sativa Japonica Group]
          Length = 339

 Score =  297 bits (760), Expect = 6e-78,   Method: Compositional matrix adjust.
 Identities = 158/348 (45%), Positives = 219/348 (62%), Gaps = 26/348 (7%)

Query: 1   MAKYFLIVVLIISGSC-ASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFK 59
           + K  L  +L     C A  A     D  ++  + E+W  QYGR YK++ E ++RFEIFK
Sbjct: 3   IPKALLFAILSCLCLCSAVLAAREQSDHAAMVARHERWMEQYGRVYKDATEKARRFEIFK 62

Query: 60  DNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQT--GFKMSDHSSSLKANGTPFL 117
            N+  +E FN    GN  + L +N+FADLT  EF A++T  GF      S+++   T F 
Sbjct: 63  ANVAFIESFN---AGNHKFWLGVNQFADLTNYEFRATKTNKGFI----PSTVRVP-TTFR 114

Query: 118 YKSSQV---PPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQ 167
           Y++  +   P +V+W  KGAVTP+K QGQC       AVAA+EGI  +   +L+SLSEQ+
Sbjct: 115 YENVSIDTLPATVDWRTKGAVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQE 174

Query: 168 LVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQIT 227
           LVDC  +  + GC GG MDDAFK+II+N G+T ++ Y Y   + G C+     + AA I 
Sbjct: 175 LVDCDVHGEDQGCEGGLMDDAFKFIIKNGGLTTESKYPYTA-ADGKCNG--GSNSAATIK 231

Query: 228 NYEDVPPNDEESLLKAVANQPVSVAIDASAL--QFYSGGVFNGYCETFLNHGVTAVGYGT 285
            YEDVP N+E +L+KAVANQPVSVA+D   +  QFYSGGV  G C T L+HG+ A+GYG 
Sbjct: 232 GYEDVPANNEAALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIVAIGYGK 291

Query: 286 SEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
             +G +YWL+KNSWG  WGE+G+ R+++DI   +G CG+AM  S+P +
Sbjct: 292 DGDGTQYWLLKNSWGTTWGENGFLRMEKDISDKRGMCGLAMEPSYPTA 339


>gi|357160572|ref|XP_003578808.1| PREDICTED: vignain-like [Brachypodium distachyon]
          Length = 339

 Score =  296 bits (759), Expect = 7e-78,   Method: Compositional matrix adjust.
 Identities = 154/346 (44%), Positives = 220/346 (63%), Gaps = 22/346 (6%)

Query: 1   MAKYFLIVVLIISGSCASQATYRTF-DEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFK 59
           + K  L+ +L     C S    R   D+ S+  + E W  QYGR YK++AE +++FE+FK
Sbjct: 3   IPKASLLAILGCLCLCGSVLAARELNDDLSMVARHENWMLQYGRVYKDAAEKAQKFEVFK 62

Query: 60  DNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK 119
            N   +  FN    GN  + L +N+FAD+T +EF A++T      +   +    T F+Y+
Sbjct: 63  ANAEFINSFN---AGNHKFWLGINQFADITNEEFKATKTNKGFISNKVRVP---TGFMYE 116

Query: 120 S---SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLV 169
           +     +P +++W  KGAVTP+K QGQC       AVAA+EGI  +   +LVSLSEQ+LV
Sbjct: 117 NMSFDALPATIDWRTKGAVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLVSLSEQELV 176

Query: 170 DCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNY 229
           DC  +  + GC GG MDDAFK+II+N G+T ++ Y Y+  + G C S      AA I +Y
Sbjct: 177 DCDVHGEDQGCEGGLMDDAFKFIIKNGGLTQESNYPYDA-ADGKCKS--GSSSAATIKSY 233

Query: 230 EDVPPNDEESLLKAVANQPVSVAIDASAL--QFYSGGVFNGYCETFLNHGVTAVGYGTSE 287
           EDVP N+E +L+KAVANQPVSVA+D   +  QFYSGGV  G C T L+HG+ A+GYGT+ 
Sbjct: 234 EDVPANNEGALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGTTS 293

Query: 288 EGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
           +G K+W++KNSWG  WGE+G+ R+++DI   +G CG+AM  S+P +
Sbjct: 294 DGTKFWIMKNSWGTSWGENGFLRMEKDIADKKGMCGLAMEPSYPTA 339


>gi|357160599|ref|XP_003578815.1| PREDICTED: vignain-like [Brachypodium distachyon]
          Length = 339

 Score =  296 bits (759), Expect = 8e-78,   Method: Compositional matrix adjust.
 Identities = 156/346 (45%), Positives = 219/346 (63%), Gaps = 22/346 (6%)

Query: 1   MAKYFLIVVLIISGSCASQATYRTF-DEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFK 59
           + K  ++ +L     C+S    R   D+ S+A + E W AQYGR YK++AE +++FE+FK
Sbjct: 3   IPKASILAILGCLCFCSSVLAARELNDDLSMAARHETWMAQYGRVYKDAAEKAQKFEVFK 62

Query: 60  DNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK 119
            N   ++ FN     N  + L +N+FADLT +EF A++T      + + +    T F Y+
Sbjct: 63  ANARFIDSFNAE---NHKFWLGINQFADLTNEEFKATKTNKGFISNKARVS---TGFKYE 116

Query: 120 SSQV---PPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLV 169
           + ++   P S++W  KGAVTPVK QGQC       AVAA EGI  +   +LVSLSEQ+LV
Sbjct: 117 NLKIEALPTSIDWRTKGAVTPVKDQGQCGCCWAFSAVAATEGIVKLSTGKLVSLSEQELV 176

Query: 170 DCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNY 229
           DC  +  + GC GG MDDAFK+II N G+T ++ Y Y+    G C S      A  I +Y
Sbjct: 177 DCDVHGEDQGCEGGLMDDAFKFIITNGGLTQESSYPYDA-EDGKCKS--GSKSAGTIKSY 233

Query: 230 EDVPPNDEESLLKAVANQPVSVAIDASAL--QFYSGGVFNGYCETFLNHGVTAVGYGTSE 287
           EDVP N+E +L+KAVANQPVSVA+D   +  QFYSGGV  G C T L+HG+ A+GYG + 
Sbjct: 234 EDVPANNEGALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGVTS 293

Query: 288 EGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
           +G K+WL+KNSWG  WGE+G+ R+++DI   +G CG+AM  S+P +
Sbjct: 294 DGTKFWLMKNSWGTTWGENGFLRMEKDIADKKGMCGLAMEPSYPTA 339


>gi|255646088|gb|ACU23531.1| unknown [Glycine max]
          Length = 362

 Score =  296 bits (759), Expect = 8e-78,   Method: Compositional matrix adjust.
 Identities = 159/326 (48%), Positives = 203/326 (62%), Gaps = 18/326 (5%)

Query: 27  EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFA 86
           E S  + +E+W++ Y    +   +  KRF +FK N++ V   N     ++ Y L+LNKFA
Sbjct: 33  EESFWDLYERWRS-YRTVSRSLGDKHKRFNVFKANVMHVHNTNKM---DKPYKLKLNKFA 88

Query: 87  DLTPQEFIASQTGFKMSDHS---SSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQ 143
           D+T  EF ++  G K++ H     + + NGT    K   VPPS +W + GAVT VK QGQ
Sbjct: 89  DMTNHEFRSTYAGSKVNHHRMFQGTPRGNGTFMYEKVGSVPPSADWRKNGAVTGVKDQGQ 148

Query: 144 CA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNK 196
           C        V AVEGIN IK N+LVSLSEQ+LVDC T   N GC GG M+ AF++I Q  
Sbjct: 149 CGSCWAFSTVVAVEGINQIKTNKLVSLSEQELVDCDTK-KNAGCNGGLMESAFEFIKQKG 207

Query: 197 GITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS 256
           GIT ++ Y Y     G CD+ KA D A  I  +E+VP NDE +LLKAVANQPVSVAIDA 
Sbjct: 208 GITTESNYPYTAQD-GTCDASKANDLAVSIDGHENVPANDENALLKAVANQPVSVAIDAG 266

Query: 257 AL--QFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRD 314
               QFY  GVF G C T LNHGV  VGYGT+ +G  YW ++NSWG +WGE GY R+QR 
Sbjct: 267 GFDFQFYFEGVFTGDCSTELNHGVAIVGYGTTVDGTNYWTVRNSWGPEWGEQGYIRMQRS 326

Query: 315 IDQPQGQCGIAMFASFPVSKESAQPS 340
           I + +G CGIAM AS+P+   S  P+
Sbjct: 327 IFKKEGLCGIAMMASYPIKNSSNNPT 352


>gi|5823018|gb|AAD53011.1|AF089848_1 senescence-specific cysteine protease [Brassica napus]
          Length = 346

 Score =  296 bits (758), Expect = 9e-78,   Method: Compositional matrix adjust.
 Identities = 165/345 (47%), Positives = 215/345 (62%), Gaps = 21/345 (6%)

Query: 3   KYFLIVVLIISGSCASQATYRTFDEGSIAEK-FEQWKAQYGRTYKESAENSKRFEIFKDN 61
           K FLIV L+ S  C S    R  D+  I +K  ++W A++GRTY +  E + R+ +FK N
Sbjct: 7   KIFLIVSLV-SSFCFSTTLSRLLDDELIMQKKHDEWMAEHGRTYADMNEKNNRYVVFKRN 65

Query: 62  LVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHS--SSLKANGTPFLYK 119
           +  +ER NN   G R++ L +N+FADLT  EF    TG+K  D    S  +   T F Y+
Sbjct: 66  VERIERLNNVPAG-RTFKLAVNQFADLTNDEFRFMYTGYK-GDFVLFSQSQTKSTSFRYQ 123

Query: 120 S---SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLV 169
           +     +P +V+W +KGAVTP+K QG C       AVAA+EG   IK  +L+SLSEQQLV
Sbjct: 124 NVFFGALPIAVDWRKKGAVTPIKNQGSCGCCWAFSAVAAIEGATQIKKGKLISLSEQQLV 183

Query: 170 DCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNY 229
           DC TND   GC GG MD AF++I+   G+T ++ Y Y+G     C     +  AA IT Y
Sbjct: 184 DCDTNDF--GCSGGLMDTAFEHIMATGGLTTESNYPYKGEDAN-CKIKSTKPSAASITGY 240

Query: 230 EDVPPNDEESLLKAVANQPVSVAIDASAL--QFYSGGVFNGYCETFLNHGVTAVGYGTSE 287
           EDVP NDE +L+KAVA+QPVSV I+      QFYS GVF G C T+L+H VTAVGY  S 
Sbjct: 241 EDVPVNDENALMKAVAHQPVSVGIEGGGFDFQFYSSGVFTGECTTYLDHAVTAVGYSQSS 300

Query: 288 EGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
            G KYW+IKNSWG  WGE GY R+++DI   +G CG+AM AS+P 
Sbjct: 301 AGSKYWIIKNSWGTKWGEGGYMRIKKDIKDKEGLCGLAMKASYPT 345


>gi|18402225|ref|NP_566633.1| Granulin repeat cysteine protease family protein [Arabidopsis
           thaliana]
 gi|11994461|dbj|BAB02463.1| cysteine proteinase [Arabidopsis thaliana]
 gi|17065298|gb|AAL32803.1| cysteine proteinase [Arabidopsis thaliana]
 gi|20260004|gb|AAM13349.1| cysteine proteinase [Arabidopsis thaliana]
 gi|332642713|gb|AEE76234.1| Granulin repeat cysteine protease family protein [Arabidopsis
           thaliana]
          Length = 452

 Score =  296 bits (758), Expect = 1e-77,   Method: Compositional matrix adjust.
 Identities = 162/339 (47%), Positives = 210/339 (61%), Gaps = 17/339 (5%)

Query: 5   FLIVVLIISGSCAS-QATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLV 63
            +  VL+IS S  S  AT  T +E      +E+W  +  + Y    E  +RFEIFKDNL 
Sbjct: 13  LIFSVLLISLSLGSVTATETTRNEAEARRMYERWLVENRKNYNGLGEKERRFEIFKDNLK 72

Query: 64  AVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK-SSQ 122
            VE   +++I NR+Y + L +FADLT  EF A     KM    + +   G  +LYK    
Sbjct: 73  FVEE--HSSIPNRTYEVGLTRFADLTNDEFRAIYLRSKM--ERTRVPVKGEKYLYKVGDS 128

Query: 123 VPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATND 175
           +P +++W  KGAV PVK QG C       A+ AVEGIN IK   L+SLSEQ+LVDC T+ 
Sbjct: 129 LPDAIDWRAKGAVNPVKDQGSCGSCWAFSAIGAVEGINQIKTGELISLSEQELVDCDTS- 187

Query: 176 NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPN 235
            N+GC GG MD AFK+II+N GI  +  Y Y      +C+S K       I  YEDVP N
Sbjct: 188 YNDGCGGGLMDYAFKFIIENGGIDTEEDYPYIATDVNVCNSDKKNTRVVTIDGYEDVPQN 247

Query: 236 DEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYW 293
           DE+SL KA+ANQP+SVAI+A   A Q Y+ GVF G C T L+HGV AVGYG SE G  YW
Sbjct: 248 DEKSLKKALANQPISVAIEAGGRAFQLYTSGVFTGTCGTSLDHGVVAVGYG-SEGGQDYW 306

Query: 294 LIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           +++NSWG +WGE GYF+L+R+I +  G+CG+AM AS+P 
Sbjct: 307 IVRNSWGSNWGESGYFKLERNIKESSGKCGVAMMASYPT 345


>gi|357160569|ref|XP_003578807.1| PREDICTED: vignain-like [Brachypodium distachyon]
          Length = 339

 Score =  296 bits (757), Expect = 1e-77,   Method: Compositional matrix adjust.
 Identities = 160/348 (45%), Positives = 220/348 (63%), Gaps = 26/348 (7%)

Query: 1   MAKYFLIVVLIISGSCASQATYRTF-DEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFK 59
           + K  L+ +L     C+S    R   D+ S+  + E W  QYGR YK++AE + +FE+FK
Sbjct: 3   IPKASLLAILGCLCFCSSVLAARELNDDLSMVARHESWMLQYGRVYKDAAEKASKFEVFK 62

Query: 60  DNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQT--GFKMSDHSSSLKANGTPFL 117
            N   ++ FN    GN  + L +N+FAD+T +EF A++T  GF     S+ ++A  T F 
Sbjct: 63  ANAGFIDSFN---AGNHKFWLGINQFADITNKEFKATKTNKGF----ISNKVRAP-TGFS 114

Query: 118 YKS---SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQ 167
           Y++     +P S++W  KGAVTPVK QGQC       AVAA EGI  +   +LVSLSEQ+
Sbjct: 115 YENVSFDALPASIDWRTKGAVTPVKDQGQCGCCWAFSAVAATEGIVKLSTGKLVSLSEQE 174

Query: 168 LVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQIT 227
           LVDC  +  + GC GG MDDAFK+II N G+T ++ Y Y+    G C S      A  I 
Sbjct: 175 LVDCDVHGEDQGCEGGLMDDAFKFIISNGGLTQESSYPYDA-EDGKCKS--GSKSAGTIK 231

Query: 228 NYEDVPPNDEESLLKAVANQPVSVAIDASAL--QFYSGGVFNGYCETFLNHGVTAVGYGT 285
           +YEDVP N+E +L+KAVANQPVSVA+D   +  QFYSGGV  G C T L+HG+ A+GYG 
Sbjct: 232 SYEDVPANNEGALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGV 291

Query: 286 SEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
           + +G KYWL+KNSWG  WGE+G+ R+++DI   +G CG+AM  S+P +
Sbjct: 292 TSDGTKYWLMKNSWGTSWGENGFLRMEKDIADKKGMCGLAMEPSYPTA 339


>gi|125547256|gb|EAY93078.1| hypothetical protein OsI_14879 [Oryza sativa Indica Group]
          Length = 339

 Score =  295 bits (755), Expect = 2e-77,   Method: Compositional matrix adjust.
 Identities = 157/348 (45%), Positives = 219/348 (62%), Gaps = 26/348 (7%)

Query: 1   MAKYFLIVVLIISGSC-ASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFK 59
           + K  L  +L     C A  A     D  ++  + E+W  QYGR YK++ E ++RFEIFK
Sbjct: 3   IPKALLFAILSCLCLCSAVLAAREQSDHAAMVARHERWMEQYGRVYKDATEKARRFEIFK 62

Query: 60  DNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQT--GFKMSDHSSSLKANGTPFL 117
            N+  +E FN    GN  + L +N+FADLT  EF A++T  GF      S+++   T F 
Sbjct: 63  ANVAFIESFN---AGNHKFWLGVNQFADLTNYEFRATKTNKGFI----PSTVRVP-TTFR 114

Query: 118 YKSSQV---PPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQ 167
           Y++  +   P +V+W  KGAVTP+K QGQC       AVAA+EGI  +   +L+SLSEQ+
Sbjct: 115 YENVSIDTLPATVDWRTKGAVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQE 174

Query: 168 LVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQIT 227
           LVDC  +  + GC GG MDDAFK+II+N G+T ++ Y Y   + G C+     + AA I 
Sbjct: 175 LVDCDVHGEDQGCEGGLMDDAFKFIIKNGGLTTESKYPYTA-ADGKCNG--GSNSAATIK 231

Query: 228 NYEDVPPNDEESLLKAVANQPVSVAIDASAL--QFYSGGVFNGYCETFLNHGVTAVGYGT 285
            YE+VP N+E +L+KAVANQPVSVA+D   +  QFYSGGV  G C T L+HG+ A+GYG 
Sbjct: 232 GYEEVPANNEAALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIVAIGYGK 291

Query: 286 SEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
             +G +YWL+KNSWG  WGE+G+ R+++DI   +G CG+AM  S+P +
Sbjct: 292 DGDGTQYWLLKNSWGTTWGENGFLRMEKDISDKRGMCGLAMEPSYPTA 339


>gi|414588010|tpg|DAA38581.1| TPA: hypothetical protein ZEAMMB73_156486 [Zea mays]
          Length = 347

 Score =  295 bits (754), Expect = 3e-77,   Method: Compositional matrix adjust.
 Identities = 157/347 (45%), Positives = 215/347 (61%), Gaps = 25/347 (7%)

Query: 6   LIVVLIISGSCASQATYRTF------DEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFK 59
           L++ ++  G C   A           DE ++  + EQW  Q+GR YK+  + + RF +FK
Sbjct: 7   LLLAILGCGVCLCSAAVLAARELGGDDELAMVARHEQWMVQHGRVYKDETDKAHRFLVFK 66

Query: 60  DNLVAVERFNNAAI-GNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLY 118
            N+  +E FN AA  GNR + L +N+FADLT  EF A++T    + +   +    T F Y
Sbjct: 67  ANVKFIESFNAAAAAGNRKFWLGVNQFADLTNDEFRATKTNKGFNPNVVKVP---TGFRY 123

Query: 119 KSSQV---PPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQL 168
           ++  +   P +V+W  KGAVTP+K QGQC       AVAA EGI  I   +L SLSEQ+L
Sbjct: 124 QNLSIDALPQTVDWRTKGAVTPIKDQGQCGCCWAFSAVAATEGIVKISTGKLTSLSEQEL 183

Query: 169 VDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITN 228
           VDC  +  + GC GG MDDAFK+II+N G+T ++ Y Y     G C S    + AA I  
Sbjct: 184 VDCDVHGEDQGCNGGEMDDAFKFIIKNGGLTTESNYPYTAQD-GQCKS--GSNGAATIKG 240

Query: 229 YEDVPPNDEESLLKAVANQPVSVAIDASAL--QFYSGGVFNGYCETFLNHGVTAVGYGTS 286
           YEDVP NDE +L+KAVA+QPVSVA+D   +  QFYSGGV  G C T L+HG+ A+GYG +
Sbjct: 241 YEDVPANDEAALMKAVASQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGKT 300

Query: 287 EEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
            +G KYWL+KNSWG  WGE+G+ R+++DI   +G CG+AM  S+P +
Sbjct: 301 SDGTKYWLMKNSWGTTWGENGFLRMEKDIADKKGMCGLAMQPSYPTA 347


>gi|5823020|gb|AAD53012.1|AF089849_1 senescence-specific cysteine protease [Brassica napus]
          Length = 344

 Score =  294 bits (753), Expect = 4e-77,   Method: Compositional matrix adjust.
 Identities = 151/320 (47%), Positives = 209/320 (65%), Gaps = 17/320 (5%)

Query: 25  FDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNK 84
            DE ++ ++  +W  ++GR Y ++ E + R+ +FK N+  +ER N+   G  ++ L +N+
Sbjct: 29  LDEVAMQKRHAEWMTEHGRVYADANEKNNRYAVFKRNVERIERLNDVQSG-LTFKLAVNQ 87

Query: 85  FADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK---SSQVPPSVNWIEKGAVTPVKYQ 141
           FADLT +EF +  TGFK +   SS +   T F Y+   S  +P SV+W +KGAVTP+K Q
Sbjct: 88  FADLTNEEFRSMYTGFKGNSVLSS-RTKPTSFRYQNVSSDALPVSVDWRKKGAVTPIKDQ 146

Query: 142 GQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQ 194
           G C       AVAA+EG+  IK  +L+SLSEQ+LVDC TND   GC GG MD AF Y I 
Sbjct: 147 GLCGSCWAFSAVAAIEGVAQIKKGKLISLSEQELVDCDTNDG--GCMGGLMDTAFNYTIT 204

Query: 195 NKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAID 254
             G+T+++ Y Y+  + G C+  K +  A  I  +EDVP NDE++L+KAVA+ PVS+ I 
Sbjct: 205 IGGLTSESNYPYKS-TNGTCNFNKTKQIATSIKGFEDVPANDEKALMKAVAHHPVSIGIA 263

Query: 255 AS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQ 312
                 QFYS GVF+G C T L+HGVTAVGYG S+ G+KYW++KNSWG  WGE GY R++
Sbjct: 264 GGDIGFQFYSSGVFSGECTTHLDHGVTAVGYGRSKNGLKYWILKNSWGPKWGERGYMRIK 323

Query: 313 RDIDQPQGQCGIAMFASFPV 332
           +DI    GQCG+AM AS+P 
Sbjct: 324 KDIKPKHGQCGLAMNASYPT 343


>gi|110737404|dbj|BAF00646.1| putative cysteine proteinase [Arabidopsis thaliana]
          Length = 345

 Score =  294 bits (753), Expect = 4e-77,   Method: Compositional matrix adjust.
 Identities = 167/345 (48%), Positives = 221/345 (64%), Gaps = 23/345 (6%)

Query: 6   LIVVLII--SGSCASQATYRT--FDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDN 61
           L+ VLII  +G   SQAT RT  F E S+ +K EQW A++ R Y++  E + R ++FK N
Sbjct: 7   LVTVLIILFTGFRISQATSRTVIFREQSMVDKHEQWMARFSREYRDELEKNMRRDVFKKN 66

Query: 62  LVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFK----MSDHSSSLKANGTPFL 117
           L  +E FN    GN+SY L +N+FAD T +EF+A  TG K    +S      K   +   
Sbjct: 67  LKFIENFNKK--GNKSYKLGVNEFADWTNEEFLAIHTGLKGLTEVSPSKVVAKTISSQTW 124

Query: 118 YKSSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVD 170
             S  V  S +W  +GAVTPVKYQGQC       AVAAVEG+  I    LVSLSEQQL+D
Sbjct: 125 NVSDMVVESKDWRAEGAVTPVKYQGQCGCCWAFSAVAAVEGVAKIAGGNLVSLSEQQLLD 184

Query: 171 CATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYE 230
           C   + +  C GG M DAF Y++QN+GI ++  YSY+G S G C S      AA+I+ ++
Sbjct: 185 C-DREYDRDCDGGIMSDAFNYVVQNRGIASENDYSYQG-SDGGCRS--NARPAARISGFQ 240

Query: 231 DVPPNDEESLLKAVANQPVSVAIDASALQF--YSGGVFNGYCETFLNHGVTAVGYGTSEE 288
            VP N+E +LL+AV+ QPVSV++DA+   F  YSGGV++G C T  NH VT VGYGTS++
Sbjct: 241 TVPSNNERALLEAVSRQPVSVSMDATGDGFMHYSGGVYDGPCGTSSNHAVTFVGYGTSQD 300

Query: 289 GIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
           G KYWL KNSWG+ W E GY R++RD+  PQG CG+A +A +PV+
Sbjct: 301 GTKYWLAKNSWGETWEEKGYIRIRRDVAWPQGMCGVAQYAFYPVA 345


>gi|218202087|gb|EEC84514.1| hypothetical protein OsI_31214 [Oryza sativa Indica Group]
          Length = 348

 Score =  294 bits (752), Expect = 5e-77,   Method: Compositional matrix adjust.
 Identities = 152/339 (44%), Positives = 213/339 (62%), Gaps = 22/339 (6%)

Query: 1   MAKYFLIVVLIISGSCASQATYRTF-DEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFK 59
           MAK  L  +L     C++    R   D+ ++A + E+W AQYGR YK+ AE ++RFE+FK
Sbjct: 3   MAKALLFAILGCLCLCSAVLAARELSDDAAMAARHERWMAQYGRMYKDDAEKARRFEVFK 62

Query: 60  DNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK 119
            N   +E FN    GN  + L +N+FADLT  EF  ++T       ++ +    T F Y+
Sbjct: 63  ANAAFIESFN---AGNHKFWLGVNQFADLTNDEFRLTKTNKGFIPSTTRVP---TGFRYE 116

Query: 120 SSQV---PPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLV 169
           +  +   P +++W  KG VTP+K QGQC       AVAA+EGI  +   +L+SLSEQ+LV
Sbjct: 117 NVNIDALPATMDWRTKGVVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQELV 176

Query: 170 DCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNY 229
           DC  +  + GC GG MDDAFK+II+N G+T ++ Y Y   +   C S+   +  A I  Y
Sbjct: 177 DCDVHGEDQGCEGGLMDDAFKFIIKNGGLTTESNYPYAA-ADDKCKSV--SNSVASIKGY 233

Query: 230 EDVPPNDEESLLKAVANQPVSVAIDASAL--QFYSGGVFNGYCETFLNHGVTAVGYGTSE 287
           EDVP N+E +L+KAVANQPVSVA+D   +  QFY GGV  G C T L+HG+ A+GYG + 
Sbjct: 234 EDVPANNEAALMKAVANQPVSVAVDGDDMTFQFYKGGVMIGSCGTDLDHGIVAIGYGKAS 293

Query: 288 EGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAM 326
           +G KYWL+KNSWG  WGE+G+ R+++DI   +G CG+AM
Sbjct: 294 DGTKYWLLKNSWGMTWGENGFLRMEKDISDKRGMCGLAM 332


>gi|18423124|ref|NP_568722.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|75309064|sp|Q9FGR9.1|CEP1_ARATH RecName: Full=KDEL-tailed cysteine endopeptidase CEP1; AltName:
           Full=Cysteine proteinase CP56; Short=AtCP56; Flags:
           Precursor
 gi|9759028|dbj|BAB09397.1| cysteine endopeptidase [Arabidopsis thaliana]
 gi|20258850|gb|AAM13907.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|308097832|gb|ADO14465.1| papain [Arabidopsis thaliana]
 gi|332008536|gb|AED95919.1| putative cysteine proteinase [Arabidopsis thaliana]
          Length = 361

 Score =  294 bits (752), Expect = 5e-77,   Method: Compositional matrix adjust.
 Identities = 157/328 (47%), Positives = 206/328 (62%), Gaps = 22/328 (6%)

Query: 27  EGSIAEKFEQWKAQY--GRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNK 84
           E S+ E +E+W++ +   R+ +E A   KRF +FK N+  +   N     ++SY L+LNK
Sbjct: 31  ENSLWELYERWRSHHTVARSLEEKA---KRFNVFKHNVKHIHETNKK---DKSYKLKLNK 84

Query: 85  FADLTPQEFIASQTGFKMSDHS--SSLKANGTPFLYKS-SQVPPSVNWIEKGAVTPVKYQ 141
           F D+T +EF  +  G  +  H      K     F+Y + + +P SV+W + GAVTPVK Q
Sbjct: 85  FGDMTSEEFRRTYAGSNIKHHRMFQGEKKATKSFMYANVNTLPTSVDWRKNGAVTPVKNQ 144

Query: 142 GQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQ 194
           GQC        V AVEGIN I+  +L SLSEQ+LVDC TN  N GC GG MD AF++I +
Sbjct: 145 GQCGSCWAFSTVVAVEGINQIRTKKLTSLSEQELVDCDTN-QNQGCNGGLMDLAFEFIKE 203

Query: 195 NKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAID 254
             G+T++ VY Y+  S   CD+ K       I  +EDVP N E+ L+KAVANQPVSVAID
Sbjct: 204 KGGLTSELVYPYKA-SDETCDTNKENAPVVSIDGHEDVPKNSEDDLMKAVANQPVSVAID 262

Query: 255 A--SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQ 312
           A  S  QFYS GVF G C T LNHGV  VGYGT+ +G KYW++KNSWG++WGE GY R+Q
Sbjct: 263 AGGSDFQFYSEGVFTGRCGTELNHGVAVVGYGTTIDGTKYWIVKNSWGEEWGEKGYIRMQ 322

Query: 313 RDIDQPQGQCGIAMFASFPVSKESAQPS 340
           R I   +G CGIAM AS+P+   +  PS
Sbjct: 323 RGIRHKEGLCGIAMEASYPLKNSNTNPS 350


>gi|242092704|ref|XP_002436842.1| hypothetical protein SORBIDRAFT_10g009850 [Sorghum bicolor]
 gi|241915065|gb|EER88209.1| hypothetical protein SORBIDRAFT_10g009850 [Sorghum bicolor]
          Length = 296

 Score =  294 bits (752), Expect = 5e-77,   Method: Compositional matrix adjust.
 Identities = 152/310 (49%), Positives = 203/310 (65%), Gaps = 22/310 (7%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
           +  + EQW  QY R YK++ E ++RFE+FK N+  +E FN  A GNR + L +N+FADLT
Sbjct: 1   MVARHEQWMVQYSRVYKDATEKAQRFEVFKSNVKFIESFN--AGGNRKFWLGVNQFADLT 58

Query: 90  PQEFIASQT--GFKMSDHSSSLKANGTPFLYKSSQV---PPSVNWIEKGAVTPVKYQGQC 144
             EF A++T  GFK     S +K   T F Y++  V   P +++W  KGAVTP+K QGQC
Sbjct: 59  NDEFRATKTNKGFK----PSPVKVP-TGFRYENISVDALPATIDWRTKGAVTPIKDQGQC 113

Query: 145 AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVY 204
                EGI  I   +L+SLSEQ+LVDC  +  + GC GG MDDAFK+II+  G+T ++ Y
Sbjct: 114 -----EGIVKISTGKLISLSEQELVDCDVHGEDQGCEGGLMDDAFKFIIKKGGLTTESSY 168

Query: 205 SYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASAL--QFYS 262
            Y   + G C S    +  A +  +EDVP NDE SL+KAVANQPVSVA+D   +  QFYS
Sbjct: 169 PYTA-ADGKCKS--GSNSVATVKGFEDVPANDEASLMKAVANQPVSVAVDGGDMTFQFYS 225

Query: 263 GGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQC 322
           GGV  G C T L+HG+ A+GYG + +G KYWL+KNSWG  WGE+GY R+++DI   +G C
Sbjct: 226 GGVMTGSCGTDLDHGIAAIGYGQTSDGTKYWLLKNSWGTTWGENGYLRMEKDISDKRGMC 285

Query: 323 GIAMFASFPV 332
           G+AM  S+P 
Sbjct: 286 GLAMEPSYPT 295


>gi|374530932|gb|AEP83812.2| cysteine endopeptidase EP8 [Secale cereale x Triticum durum]
          Length = 364

 Score =  293 bits (751), Expect = 6e-77,   Method: Compositional matrix adjust.
 Identities = 156/314 (49%), Positives = 202/314 (64%), Gaps = 22/314 (7%)

Query: 34  FEQWKAQYG---RTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTP 90
           +E+W++ Y    R     AE  +RF +FK+N   +   N     +R + L LNKFAD+T 
Sbjct: 40  YERWRSHYTVSRRGLGADAE-ERRFNVFKENARYIHEGNKK---DRPFRLALNKFADMTT 95

Query: 91  QEFIASQTGFKMSDH---SSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA-- 145
            EF  +  G ++  H   S   + +G+     +  +PP+V+W +KGAVT +K QGQC   
Sbjct: 96  DEFRRTYAGSRVRHHLSLSGGRRGDGSFRYGDADNLPPAVDWRQKGAVTAIKDQGQCGSC 155

Query: 146 -----VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITN 200
                + AVEGIN I+  +LVSLSEQ+L+DC  N NN GC GG MD AF++I +N GIT 
Sbjct: 156 WAFSTIVAVEGINKIRTGKLVSLSEQELMDC-DNVNNQGCDGGLMDYAFQFIHKN-GITT 213

Query: 201 DAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--L 258
           ++ Y Y+G   G CD  K + HA  I  YEDVP NDE +L KAVA QPVSVAIDAS    
Sbjct: 214 ESNYPYQG-EQGSCDLAKEKAHAVTIDGYEDVPANDESALQKAVAGQPVSVAIDASGNDF 272

Query: 259 QFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQP 318
           QFYS GVF G C T L+HGV AVGYGT+ +G KYW++KNSWG+DWGE GY R+QR + Q 
Sbjct: 273 QFYSEGVFTGECSTDLDHGVAAVGYGTTRDGTKYWIVKNSWGEDWGEKGYIRMQRGVSQA 332

Query: 319 QGQCGIAMFASFPV 332
           +GQCGIAM AS+P 
Sbjct: 333 EGQCGIAMQASYPT 346


>gi|356563584|ref|XP_003550041.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
          Length = 366

 Score =  293 bits (751), Expect = 7e-77,   Method: Compositional matrix adjust.
 Identities = 160/339 (47%), Positives = 201/339 (59%), Gaps = 18/339 (5%)

Query: 7   IVVLIISGSCA-SQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
           ++ L  + SCA   +T   + +  +   +E+W  ++ + Y    E  KRF++FKDNL  +
Sbjct: 12  LLFLSFTLSCAIDTSTITNYTDNEVMTMYEEWLVKHQKVYNGLGEKDKRFQVFKDNLGFI 71

Query: 66  ERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSS---Q 122
           +  NN    N +Y L LNKFAD+T +E+     G K       +K   T   Y  S   Q
Sbjct: 72  QEHNNNQ--NNTYKLGLNKFADMTNEEYRVMYFGTKSDAKRRLMKTKSTGHRYAYSAGDQ 129

Query: 123 VPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATND 175
           +P  V+W  KGAV P+K QG C        VA VE IN I   + VSLSEQ+LVDC    
Sbjct: 130 LPVHVDWRVKGAVAPIKDQGSCGSCWAFSTVATVEAINKIVTGKFVSLSEQELVDC-DRA 188

Query: 176 NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPN 235
            N GC GG MD AF++IIQN GI  D  Y Y G   GICD  K    A  I  YEDVPP 
Sbjct: 189 YNQGCNGGLMDYAFEFIIQNGGIDTDKDYPYRGFD-GICDPTKKNAKAVNIDGYEDVPPY 247

Query: 236 DEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYW 293
           DE +L KAVA QPVS+AI+AS  ALQ Y  GVF G C T L+HGV  VGYG SE G+ YW
Sbjct: 248 DENALKKAVARQPVSIAIEASGRALQLYQSGVFTGECGTSLDHGVVVVGYG-SENGVDYW 306

Query: 294 LIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           L++NSWG  WGEDGYF++QR++  P G+CGI M AS+PV
Sbjct: 307 LVRNSWGTGWGEDGYFKMQRNVRTPTGKCGITMEASYPV 345


>gi|30690594|ref|NP_564321.2| cysteine proteinase-like protein [Arabidopsis thaliana]
 gi|28393492|gb|AAO42167.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|332192920|gb|AEE31041.1| cysteine proteinase-like protein [Arabidopsis thaliana]
          Length = 355

 Score =  293 bits (751), Expect = 7e-77,   Method: Compositional matrix adjust.
 Identities = 167/355 (47%), Positives = 221/355 (62%), Gaps = 39/355 (10%)

Query: 4   YFLIVVLIISGSC-ASQATYR-TFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDN 61
           + L+ + I+S +   SQAT R TF E  +AE  +QW  ++ R Y +  E   RF++FK N
Sbjct: 15  FMLVSLTILSMNLKVSQATSRVTFHEPIVAEHHQQWMTRFSRVYSDELEKQMRFDVFKKN 74

Query: 62  LVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSS 121
           L  +E+FN    G+R+Y L +N+FAD T +EFIA+ TG K          NG P      
Sbjct: 75  LKFIEKFNKK--GDRTYKLGVNEFADWTREEFIATHTGLK--------GVNGIPSSEFVD 124

Query: 122 QVPPSVNW-------------IEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLV 161
           ++ PS NW               +GAVTPVKYQGQC       +VAAVEG+  I  N LV
Sbjct: 125 EMIPSWNWNVSDVAGRETKDWRYEGAVTPVKYQGQCGCCWAFSSVAAVEGLTKIVGNNLV 184

Query: 162 SLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAED 221
           SLSEQQL+DC   + +NGC GG M DAF YII+N+GI ++A Y Y+  + G C       
Sbjct: 185 SLSEQQLLDC-DRERDNGCNGGIMSDAFSYIIKNRGIASEASYPYQA-AEGTCRY--NGK 240

Query: 222 HAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASALQF--YSGGVFN-GYCETFLNHGV 278
            +A I  ++ VP N+E +LL+AV+ QPVSV+IDA    F  YSGGV++  YC T +NH V
Sbjct: 241 PSAWIRGFQTVPSNNERALLEAVSKQPVSVSIDADGPGFMHYSGGVYDEPYCGTNVNHAV 300

Query: 279 TAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
           T VGYGTS EGIKYWL KNSWG+ WGE+GY R++RD+  PQG CG+A +A +PV+
Sbjct: 301 TFVGYGTSPEGIKYWLAKNSWGETWGENGYIRIRRDVAWPQGMCGVAQYAFYPVA 355


>gi|358348957|ref|XP_003638507.1| Cysteine proteinase [Medicago truncatula]
 gi|355504442|gb|AES85645.1| Cysteine proteinase [Medicago truncatula]
          Length = 362

 Score =  293 bits (750), Expect = 8e-77,   Method: Compositional matrix adjust.
 Identities = 166/356 (46%), Positives = 222/356 (62%), Gaps = 26/356 (7%)

Query: 3   KYFLIV-----VLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEI 57
           K  LIV     VL++S S        + DE S+ + +E+W++ +  + +   E  KRF +
Sbjct: 5   KLLLIVLSIALVLVVSESFDFHDKDVSSDE-SLWDLYERWRSHHTVS-RNLNEKQKRFNV 62

Query: 58  FKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHS---SSLKANGT 114
           FK N++ V   N     ++ Y L+LNKFAD+T  EF  +  G K++ H     + + +GT
Sbjct: 63  FKSNVMHVHNTNKM---DKPYKLKLNKFADMTNHEFKTTYAGSKVNHHRMFRGTPRVSGT 119

Query: 115 PFLYKS-SQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQ 166
            F+Y++ ++ P SV+W +KGAVT VK QGQC        V AVEGIN IK NRLV LSEQ
Sbjct: 120 -FMYENFTKAPASVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQIKTNRLVPLSEQ 178

Query: 167 QLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQI 226
           +L+DC  N  N GC GG M+ AF+YI Q  GIT ++ Y Y   + G CD+ K    A  I
Sbjct: 179 ELIDC-DNQENQGCNGGLMEYAFEYIKQKGGITTESYYPYTA-NDGSCDATKENVPAVSI 236

Query: 227 TNYEDVPPNDEESLLKAVANQPVSVAIDA--SALQFYSGGVFNGYCETFLNHGVTAVGYG 284
             +E VP NDE++LLKAVANQPVSVAIDA  S  QFYS GVF G C   LNHGV  VGYG
Sbjct: 237 DGHETVPANDEDALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDCGKELNHGVAIVGYG 296

Query: 285 TSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSKESAQPS 340
           T+ +G  YW+++NSWG +WGE GY R++R++   +G CGIAM AS+PV   S  P+
Sbjct: 297 TTVDGTNYWIVRNSWGAEWGEQGYIRMKRNVSNKEGLCGIAMEASYPVKNSSKNPA 352


>gi|357160591|ref|XP_003578813.1| PREDICTED: vignain-like [Brachypodium distachyon]
          Length = 339

 Score =  292 bits (748), Expect = 1e-76,   Method: Compositional matrix adjust.
 Identities = 151/322 (46%), Positives = 211/322 (65%), Gaps = 25/322 (7%)

Query: 26  DEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKF 85
           D+ S+  + E W +QYGR+YK++AE  ++FE+FK N   ++ FN     N  + L +N+F
Sbjct: 29  DDLSMVARHESWMSQYGRSYKDAAEKDRKFEVFKANAAFIDSFNAK---NHKFWLGINQF 85

Query: 86  ADLTPQEFIASQT--GFKMSDHSSSLKANGTPFLYKSSQV---PPSVNWIEKGAVTPVKY 140
           AD+T +EF  ++T  GF     S+ ++A+ T F Y++  +   P +++W  KGAVTPVK 
Sbjct: 86  ADITNEEFKVTKTNKGF----ISNKVRAS-TGFSYENVSIDALPATIDWRTKGAVTPVKD 140

Query: 141 QGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYII 193
           QGQC       AVAA EGI  +   +LVSLSEQ+LVDC  +  + GC GG MDDAFK+II
Sbjct: 141 QGQCGCCWAFSAVAATEGIVKLSTGKLVSLSEQELVDCDVHGEDQGCEGGLMDDAFKFII 200

Query: 194 QNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAI 253
            N G+T ++ Y Y+    G C S      A  I +YEDVP N+E +L+KAVANQPVSVA+
Sbjct: 201 TNGGLTQESSYPYDA-EDGKCKS--GSKSAGTIKSYEDVPANNEGALMKAVANQPVSVAV 257

Query: 254 DASAL--QFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRL 311
           D   +  QFYSGGV  G C T L+HG+ A+GYG + +G KYWL+KNSWG  WGE+G+ R+
Sbjct: 258 DGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGVTSDGTKYWLMKNSWGTSWGENGFLRM 317

Query: 312 QRDIDQPQGQCGIAMFASFPVS 333
           ++DI   +G CG+AM  S+P +
Sbjct: 318 EKDIADKKGMCGLAMEPSYPTA 339


>gi|255539310|ref|XP_002510720.1| cysteine protease, putative [Ricinus communis]
 gi|223551421|gb|EEF52907.1| cysteine protease, putative [Ricinus communis]
          Length = 349

 Score =  292 bits (747), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 159/315 (50%), Positives = 200/315 (63%), Gaps = 17/315 (5%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
           + + FE W +++GR Y+ + E  +RFEIFKDNL  ++  N      R+Y L LN+FADL+
Sbjct: 43  LIDLFESWISRFGRVYESAEEKLERFEIFKDNLFHIDDTNKKV---RNYWLGLNEFADLS 99

Query: 90  PQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA---- 145
            +EF     G K  D S   +     F YK   +P SV+W +KGAVTPVK QG C     
Sbjct: 100 HEEFKNKYLGLK-PDLSKRAQCP-EEFTYKDVAIPKSVDWRKKGAVTPVKNQGSCGSCWA 157

Query: 146 ---VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDA 202
              VAAVEGIN I    L SLSEQ+L+DC T   NNGC GG MD AF YI+ N G+  + 
Sbjct: 158 FSTVAAVEGINQIVTGNLTSLSEQELIDCDTT-YNNGCNGGLMDYAFAYIVANGGLHKEE 216

Query: 203 VYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQF 260
            Y Y  M  G CD  K E  A  I+ Y DVP N EESLLKA+ANQP+S+AI+AS    QF
Sbjct: 217 DYPYI-MEEGTCDMRKEESDAVTISGYHDVPQNSEESLLKALANQPLSIAIEASGRDFQF 275

Query: 261 YSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQG 320
           YSGGVF+G+C T L+HGV AVGYGTS +G+ Y ++KNSWG  WGE GY R++R   +P+G
Sbjct: 276 YSGGVFDGHCGTELDHGVAAVGYGTS-KGLDYIIVKNSWGPKWGEKGYIRMKRKTSKPEG 334

Query: 321 QCGIAMFASFPVSKE 335
            CGI   AS+P  K+
Sbjct: 335 ICGIYKMASYPTKKK 349


>gi|537437|gb|AAC35211.1| cysteine proteinase [Hemerocallis hybrid cultivar]
          Length = 359

 Score =  292 bits (747), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 172/363 (47%), Positives = 228/363 (62%), Gaps = 30/363 (8%)

Query: 1   MAK--YFLIVVLIISGSCASQATYRTFDEGSIAEK------FEQWKAQYGRTYKESAENS 52
           MAK  Y L+ V+++ GS A  A    FDE  +A +      +E+W+A +  + ++  +  
Sbjct: 1   MAKLSYALLSVVLVLGSVA-LAQSIPFDEKDLASEESLWSLYEKWRAHHAVS-RDLDDTD 58

Query: 53  KRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKA- 111
           KRF +FK+N+  +  FN     + +Y L LNKF D+T QEF ++  G K+ DH  +L+  
Sbjct: 59  KRFNVFKENVKFIHEFNQKK--DATYKLALNKFGDMTNQEFRSTYAGSKI-DHHMTLRGV 115

Query: 112 -NGTPFLY-KSSQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVS 162
            +   F Y K   +P SV+W EKGAVT VK QGQC        V AVEGIN IK N LVS
Sbjct: 116 KDAGEFSYEKFHDLPTSVDWREKGAVTGVKDQGQCGSCWAFSTVVAVEGINQIKTNELVS 175

Query: 163 LSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDH 222
           LSEQQLVDC T   N+GC GG MD AF +I  N G++++  Y Y       C S +A   
Sbjct: 176 LSEQQLVDCDTK--NSGCNGGLMDYAFDFIKNNGGLSSEDSYPYLAEQKS-CGS-EANSA 231

Query: 223 AAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTA 280
              I  Y+DVP N+E +L+KAVANQPVSVAI+AS  A QFYS GVF+G+C T L+HGV A
Sbjct: 232 VVTIDGYQDVPRNNEAALMKAVANQPVSVAIEASGYAFQFYSQGVFSGHCGTELDHGVAA 291

Query: 281 VGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSKESAQPS 340
           VGYG  ++G KYW++KNSWG+ WGE GY R++R I   +G+CGIAM AS+P+ K S  P 
Sbjct: 292 VGYGVDDDGKKYWIVKNSWGEGWGESGYIRMERGIKDKRGKCGIAMEASYPI-KSSPNPK 350

Query: 341 SAD 343
            A+
Sbjct: 351 KAE 353


>gi|363807062|ref|NP_001242584.1| uncharacterized protein LOC100804015 precursor [Glycine max]
 gi|255640677|gb|ACU20623.1| unknown [Glycine max]
          Length = 366

 Score =  291 bits (745), Expect = 3e-76,   Method: Compositional matrix adjust.
 Identities = 166/364 (45%), Positives = 211/364 (57%), Gaps = 23/364 (6%)

Query: 1   MAKYFLIVVLIISGSCASQA----TYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFE 56
           MA   +I  L+      S A    T   + +  +   +E+W  ++ + Y E  +  KRF+
Sbjct: 1   MASMTMIYTLLFLSFTLSYAIKTSTIINYTDNEVMAMYEEWLVRHQKGYNELGKKDKRFQ 60

Query: 57  IFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPF 116
           +FKDNL  ++  NN    N +Y L LNKFAD+T +E+ A   G K +     +K   T  
Sbjct: 61  VFKDNLGFIQEHNNNL--NNTYKLGLNKFADMTNEEYRAMYLGTKSNAKRRLMKTKSTGH 118

Query: 117 LYKSS---QVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQ 166
            Y  S   ++P  V+W  KGAV P+K QG C        VA VE IN I   + VSLSEQ
Sbjct: 119 RYAFSARDRLPVHVDWRMKGAVAPIKDQGSCGSCWAFSTVATVEAINKIVTGKFVSLSEQ 178

Query: 167 QLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQI 226
           +LVDC     N GC GG MD AF++IIQN GI  D  Y Y G   GICD  K       I
Sbjct: 179 ELVDC-DRAYNEGCNGGLMDYAFEFIIQNGGIDTDKDYPYRGFD-GICDPTKKNAKVVNI 236

Query: 227 TNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYG 284
             YEDVPP DE +L KAVA+QPVSVAI+AS  ALQ Y  GVF G C T L+HGV  VGYG
Sbjct: 237 DGYEDVPPYDENALKKAVAHQPVSVAIEASGRALQLYQSGVFTGKCGTSLDHGVVVVGYG 296

Query: 285 TSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSK--ESAQPSSA 342
            SE G+ YWL++NSWG  WGEDGYF++QR++    G+CGI M AS+PV     SA P+S 
Sbjct: 297 -SENGVDYWLVRNSWGTGWGEDGYFKMQRNVRTSTGKCGITMEASYPVKNGLNSAVPNSV 355

Query: 343 DKSS 346
            +S+
Sbjct: 356 YEST 359


>gi|9502421|gb|AAF88120.1|AC021043_13 Putative cysteine proteinase [Arabidopsis thaliana]
          Length = 331

 Score =  291 bits (745), Expect = 3e-76,   Method: Compositional matrix adjust.
 Identities = 164/340 (48%), Positives = 213/340 (62%), Gaps = 38/340 (11%)

Query: 18  SQATYR-TFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNR 76
           SQAT R TF E  +AE  +QW  ++ R Y +  E   RF++FK NL  +E+FN    G+R
Sbjct: 6   SQATSRVTFHEPIVAEHHQQWMTRFSRVYSDELEKQMRFDVFKKNLKFIEKFNKK--GDR 63

Query: 77  SYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNW------- 129
           +Y L +N+FAD T +EFIA+ TG K          NG P      ++ PS NW       
Sbjct: 64  TYKLGVNEFADWTREEFIATHTGLK--------GVNGIPSSEFVDEMIPSWNWNVSDVAG 115

Query: 130 ------IEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDN 176
                   +GAVTPVKYQGQC       +VAAVEG+  I  N LVSLSEQQL+DC   + 
Sbjct: 116 RETKDWRYEGAVTPVKYQGQCGCCWAFSSVAAVEGLTKIVGNNLVSLSEQQLLDC-DRER 174

Query: 177 NNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPND 236
           +NGC GG M DAF YII+N+GI ++A Y Y+  + G C        +A I  ++ VP N+
Sbjct: 175 DNGCNGGIMSDAFSYIIKNRGIASEASYPYQA-AEGTCRYNGKP--SAWIRGFQTVPSNN 231

Query: 237 EESLLKAVANQPVSVAIDASALQF--YSGGVFN-GYCETFLNHGVTAVGYGTSEEGIKYW 293
           E +LL+AV+ QPVSV+IDA    F  YSGGV++  YC T +NH VT VGYGTS EGIKYW
Sbjct: 232 ERALLEAVSKQPVSVSIDADGPGFMHYSGGVYDEPYCGTNVNHAVTFVGYGTSPEGIKYW 291

Query: 294 LIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
           L KNSWG+ WGE+GY R++RD+  PQG CG+A +A +PV+
Sbjct: 292 LAKNSWGETWGENGYIRIRRDVAWPQGMCGVAQYAFYPVA 331


>gi|356515052|ref|XP_003526215.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
           max]
          Length = 339

 Score =  291 bits (744), Expect = 4e-76,   Method: Compositional matrix adjust.
 Identities = 152/337 (45%), Positives = 214/337 (63%), Gaps = 14/337 (4%)

Query: 4   YFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLV 63
           ++LI+ LI++         R   E   +E+ E+W AQYG+ Y ++AE  KRF+IFK+N+ 
Sbjct: 8   HYLILFLILT-VWTFHVMSRRLSEVCTSERHEKWMAQYGKLYTDAAEKEKRFQIFKNNVQ 66

Query: 64  AVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQ 122
            +E FN  A G++ + L +N+FADL  +EF AS    +  + S    A  T F Y+S ++
Sbjct: 67  FIESFN--AAGDKPFNLSINQFADLHNEEFKASLINVQKKE-SGVETATETSFRYESITK 123

Query: 123 VPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATND 175
           +P +++W ++GAVTP+K QG C        VAA+EGI+ I   +LVSLSEQ+LVDC    
Sbjct: 124 IPVTMDWRKRGAVTPIKDQGNCGSCWAFSTVAAIEGIHQITTGKLVSLSEQELVDC-VKG 182

Query: 176 NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPN 235
            + GC  G+ ++AF+++ +N G+ ++  Y Y+  +   C   K     AQI  YE+VP N
Sbjct: 183 KSEGCNFGYKEEAFEFVAKNGGLASEISYPYKA-NNKTCMVKKETQGVAQIKGYENVPSN 241

Query: 236 DEESLLKAVANQPVSVAIDASALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLI 295
            E++LLKAVANQPVSV IDA ALQFYS G+F G C T  NH VT +GYG +  G KYWL+
Sbjct: 242 SEKALLKAVANQPVSVYIDAGALQFYSSGIFTGKCGTAPNHAVTVIGYGKARGGAKYWLV 301

Query: 296 KNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           KNSWG  WGE GY +++RDI   +G CGIA  AS+P 
Sbjct: 302 KNSWGTKWGEKGYIKMKRDIRAKEGLCGIATNASYPT 338


>gi|255557851|ref|XP_002519955.1| cysteine protease, putative [Ricinus communis]
 gi|223541001|gb|EEF42559.1| cysteine protease, putative [Ricinus communis]
          Length = 321

 Score =  290 bits (743), Expect = 5e-76,   Method: Compositional matrix adjust.
 Identities = 162/344 (47%), Positives = 216/344 (62%), Gaps = 41/344 (11%)

Query: 1   MAKYFLIVVLIISGSCASQATYRTF-DEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFK 59
           + K   I +L++  + ASQA  R   +E ++ EK EQW A++GRTY++S E  +RF+IFK
Sbjct: 5   LEKKLAIALLVVFSTWASQAMARQLINEDALVEKHEQWMARHGRTYQDSEEKERRFQIFK 64

Query: 60  DNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK 119
            NL  ++ FN A+  N++Y L LN FADL+ +E++A+ T  KM                 
Sbjct: 65  SNLEYIDNFNKAS--NQTYQLGLNNFADLSHEEYVATYTARKMP---------------- 106

Query: 120 SSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCA 172
             +VP S++W + GAVTP+K Q QC       A AAVEGI    +   VSLS QQL+DC 
Sbjct: 107 -VEVPESIDWRDHGAVTPIKNQYQCGCCWAFSAAAAVEGI----VANGVSLSAQQLLDCV 161

Query: 173 TNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDV 232
           ++  N GC GG+M++AF YIIQN+GI  +  Y Y+ M   +C S  A   AAQI+ +EDV
Sbjct: 162 SD--NQGCKGGWMNNAFNYIIQNQGIALETDYPYQQMQQ-MCSSRMA---AAQISGFEDV 215

Query: 233 PPNDEESLLKAVANQPVSVAIDASA---LQFYSGGVFNGY-CETFLNHGVTAVGYGTSEE 288
            P DEE+L++AVA QPVSV IDA++    + Y  GVF    C    +H VT VGYGTSE+
Sbjct: 216 TPKDEEALMRAVAKQPVSVTIDATSNPNFKLYKEGVFTAAGCGNGHSHAVTLVGYGTSED 275

Query: 289 GIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           G KYWL KNSWG+ WGE GY RLQRDI    G CGIA++AS+P 
Sbjct: 276 GTKYWLAKNSWGETWGESGYMRLQRDIGLEGGPCGIALYASYPT 319


>gi|356515038|ref|XP_003526208.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
           max]
          Length = 339

 Score =  290 bits (743), Expect = 6e-76,   Method: Compositional matrix adjust.
 Identities = 152/337 (45%), Positives = 213/337 (63%), Gaps = 14/337 (4%)

Query: 4   YFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLV 63
           ++LI+ LI++         R   E   +E+ E+W AQYG+ Y ++AE  KRF+IFK+N+ 
Sbjct: 8   HYLILFLILT-VWTFHVMSRRLSEVCTSERHEKWMAQYGKLYTDAAEKEKRFQIFKNNVQ 66

Query: 64  AVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQ 122
            +E FN  A G++ + L +N+FADL  +EF AS    +  + S    A  T F Y+S ++
Sbjct: 67  FIESFN--AAGDKPFNLSINQFADLHNEEFKASLINVQKKE-SGVETATETSFRYESITK 123

Query: 123 VPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATND 175
           +P +++W ++GAVTP+K QG C        VAA+EGI+ I   +LVSLSEQ+LVDC    
Sbjct: 124 IPVTMDWRKRGAVTPIKDQGNCGSCWAFSIVAAIEGIHQITTGKLVSLSEQELVDC-VKG 182

Query: 176 NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPN 235
            + GC  G+ ++AF+++ +N G+ ++  Y Y+  +   C   K     AQI  YE+VP N
Sbjct: 183 KSEGCNFGYKEEAFEFVAKNGGLASEISYPYKA-NNKTCMVKKETQGVAQIKGYENVPSN 241

Query: 236 DEESLLKAVANQPVSVAIDASALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLI 295
            E++LLKAVANQPVSV IDA ALQFYS G+F G C T  NH  T +GYG +  G KYWL+
Sbjct: 242 SEKALLKAVANQPVSVYIDAGALQFYSSGIFTGKCGTAPNHAATVIGYGKARGGAKYWLV 301

Query: 296 KNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           KNSWG  WGE GY R++RDI   +G CGIA  AS+P 
Sbjct: 302 KNSWGTKWGEKGYIRMKRDIRAKEGLCGIATNASYPT 338


>gi|357113934|ref|XP_003558756.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like
           [Brachypodium distachyon]
          Length = 346

 Score =  290 bits (742), Expect = 6e-76,   Method: Compositional matrix adjust.
 Identities = 157/346 (45%), Positives = 209/346 (60%), Gaps = 27/346 (7%)

Query: 6   LIVVLIISGSC-ASQATYRTFDEG----SIAEKFEQWKAQYGRTYKESAENSKRFEIFKD 60
           L++V I+   C  S A     + G    ++A + EQW AQ+GR YK+ AE + R E+FK 
Sbjct: 8   LLLVAIVGCLCLCSTAVLAARELGDADNAMAARHEQWMAQFGRVYKDPAEKAHRLEVFKA 67

Query: 61  NLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQT--GFKMSDHSSSLKANGTPFLY 118
           N+  +E FN     N  + L  N+FADLT  EF AS+T  G K       ++   T F Y
Sbjct: 68  NVAFIESFNAE---NHEFWLGANQFADLTNDEFRASKTNKGIK----QGGVRDAPTGFKY 120

Query: 119 KSSQV---PPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQL 168
               +   P SV+W  KGAVTP+K QGQC       AVAA EG+  +   +LVSLSEQ+L
Sbjct: 121 SDVSIDALPASVDWRTKGAVTPIKNQGQCGSCWAFSAVAATEGVVKLSTGKLVSLSEQEL 180

Query: 169 VDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITN 228
           VDC  +  + GC GG+MDDAFK+II+N G+T +A Y Y G     C S +  + AA I  
Sbjct: 181 VDCDVHGVDQGCMGGWMDDAFKFIIKNGGLTTEANYPYTG-EDDKCKSNETVNVAATIKG 239

Query: 229 YEDVPPNDEESLLKAVANQPVSVAIDASAL--QFYSGGVFNGYCETFLNHGVTAVGYGTS 286
           YEDVP NDE +L+KAVA+QPVSV +D   +  Q Y+GGV  G C   ++HG+ A+GYG +
Sbjct: 240 YEDVPANDESALMKAVAHQPVSVVVDGGDMTFQLYAGGVMTGSCGVEMDHGIAAIGYGAT 299

Query: 287 EEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
             G KYWL+KNSWG  WGE G+ R+ +DI   +G CG+AM  S+P 
Sbjct: 300 SNGTKYWLMKNSWGTTWGEKGFLRMAKDIPDKRGMCGLAMKPSYPT 345


>gi|4731374|gb|AAD28477.1|AF133839_1 papain-like cysteine protease [Sandersonia aurantiaca]
          Length = 357

 Score =  290 bits (742), Expect = 7e-76,   Method: Compositional matrix adjust.
 Identities = 155/327 (47%), Positives = 204/327 (62%), Gaps = 23/327 (7%)

Query: 27  EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFA 86
           E S+   +E+W++ +  + ++  +  KRF +FK+N+  +  FN     + ++ L LNKF 
Sbjct: 31  EDSLWSLYERWRSHHAVS-RDLDQKQKRFNVFKENVKFIHEFNKNK--DVTFKLALNKFG 87

Query: 87  DLTPQEFIASQTGFKMSDH-----SSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQ 141
           D+T QEF A   G K+  H     S     +G  F+Y+++  PPS++W E+GAV  VK Q
Sbjct: 88  DMTNQEFRAKYAGSKVHHHRTMKGSRHGSGSGAKFMYENAVAPPSIDWRERGAVAAVKNQ 147

Query: 142 GQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQ 194
           GQC       A+AAVEGIN I    LV LSEQ+L+DC T D N GC GG MD AF++I  
Sbjct: 148 GQCGSCWAFSAIAAVEGINQIVTKELVPLSEQELIDCDT-DQNQGCSGGLMDYAFEFIKN 206

Query: 195 NKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAID 254
           N GIT + VY Y+        + K    A  I  YEDVP NDE++L+KAVANQPV+VAI+
Sbjct: 207 NGGITTEDVYPYQAEDA----TCKKNSPAVVIDGYEDVPTNDEDALMKAVANQPVAVAIE 262

Query: 255 ASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQ 312
           AS    QFYS GVF G C T L+HGV  VGYGT+++G KYW ++NSWG DWGE GY R+Q
Sbjct: 263 ASGYVFQFYSEGVFTGRCGTELDHGVAVVGYGTTQDGTKYWTVRNSWGADWGESGYVRMQ 322

Query: 313 RDIDQPQGQCGIAMFASFPVSKESAQP 339
           R I    G CGIAM AS+P+ K S  P
Sbjct: 323 RGIKATHGLCGIAMQASYPI-KTSLNP 348


>gi|297851334|ref|XP_002893548.1| peptidase C1A papain family protein [Arabidopsis lyrata subsp.
           lyrata]
 gi|297339390|gb|EFH69807.1| peptidase C1A papain family protein [Arabidopsis lyrata subsp.
           lyrata]
          Length = 346

 Score =  290 bits (742), Expect = 7e-76,   Method: Compositional matrix adjust.
 Identities = 166/355 (46%), Positives = 219/355 (61%), Gaps = 39/355 (10%)

Query: 4   YFLIVVLIISGSC-ASQATYR-TFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDN 61
           +  + + I+S S   SQAT R TF E  +AE  +QW  ++ R Y +  E   RF++FK N
Sbjct: 6   FMFVSLTILSMSLKVSQATSRVTFHEPIVAEHHQQWMTRFSRVYSDELEKQMRFDVFKKN 65

Query: 62  LVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSS 121
           L  +E+FN    G+R+Y L +N+FAD T +EFIA+ TG K          NG P      
Sbjct: 66  LKFIEKFNKK--GDRTYKLGVNEFADWTKEEFIATHTGLK--------GFNGIPSSEFVD 115

Query: 122 QVPPSVNW-------------IEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLV 161
           ++ PS NW               +GAVTPVKYQGQC       +VAAVEG+  I    LV
Sbjct: 116 EMIPSWNWNVSDVAGPEIKDWRYEGAVTPVKYQGQCGCCWAFSSVAAVEGLTKIVGGNLV 175

Query: 162 SLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAED 221
           SLSEQQL+DC   + +NGC GG M DAF YII+N+GI ++A Y Y+  + G C       
Sbjct: 176 SLSEQQLLDC-DRERDNGCNGGIMSDAFSYIIKNRGIASEASYPYQ-ETEGTCR--YNAK 231

Query: 222 HAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASALQF--YSGGVFN-GYCETFLNHGV 278
            +A I  ++ VP N+E +LL+AV+ QPVSV+IDA    F  YSGGV++  YC T +NH V
Sbjct: 232 PSAWIRGFQTVPSNNERALLEAVSRQPVSVSIDADGPGFMHYSGGVYDEPYCGTDVNHAV 291

Query: 279 TAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
           T VGYGTS EGIKYWL KNSWG+ WGE+GY R++RD+  PQG CG+A +A +PV+
Sbjct: 292 TFVGYGTSPEGIKYWLAKNSWGETWGENGYIRIRRDVAWPQGMCGVAQYAFYPVA 346


>gi|224133764|ref|XP_002321655.1| predicted protein [Populus trichocarpa]
 gi|222868651|gb|EEF05782.1| predicted protein [Populus trichocarpa]
          Length = 360

 Score =  290 bits (742), Expect = 7e-76,   Method: Compositional matrix adjust.
 Identities = 160/327 (48%), Positives = 211/327 (64%), Gaps = 22/327 (6%)

Query: 27  EGSIAEKFEQWKAQYGRTYKESA-ENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKF 85
           E S+ + +E+W++ +  T   S  E  KRF +F+ N++ V   N     ++ Y L+LNKF
Sbjct: 31  EESLWDLYEKWRSHH--TVSTSLDEKRKRFNVFRANVLHVHNTNKM---DKPYKLKLNKF 85

Query: 86  ADLTPQEFIASQTGFKMSDHSSSLKA---NGTPFLYKS-SQVPPSVNWIEKGAVTPVKYQ 141
           AD+T  EF  +    K+  H+    A   NG+ F+Y +  +VP S++W +KGAVTPVK Q
Sbjct: 86  ADMTNHEFRTAYASSKVKHHTMFRGAPLGNGS-FMYGNIDKVPASIDWRKKGAVTPVKDQ 144

Query: 142 GQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQ 194
           G+C        + AVEGIN IK N+L+SLSEQ+LVDC T +N+ GC GG MD AF++I +
Sbjct: 145 GKCGSCWAFSTIVAVEGINFIKTNKLISLSEQELVDCNTGENH-GCNGGLMDYAFEFITK 203

Query: 195 NKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAID 254
            KGIT +A Y Y     G CD+ KA   A  I  +EDV  N+E +LLKAVANQPVSVAID
Sbjct: 204 QKGITTEANYPYRAQD-GHCDANKANQPAVSIDGHEDVLHNNENALLKAVANQPVSVAID 262

Query: 255 A--SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQ 312
           A  S  QFYS GVF G C   L+HGV  VGYGT+ +G KYW+++NSWG +WGE GY R+Q
Sbjct: 263 AGGSDFQFYSEGVFTGECGKELDHGVAIVGYGTTVDGTKYWIVRNSWGPEWGERGYIRMQ 322

Query: 313 RDIDQPQGQCGIAMFASFPVSKESAQP 339
           R I   +G CGIAM AS+P+ K S  P
Sbjct: 323 RGISDRRGLCGIAMEASYPIKKSSTNP 349


>gi|115468686|ref|NP_001057942.1| Os06g0582600 [Oryza sativa Japonica Group]
 gi|55296512|dbj|BAD68726.1| putative cysteine proteinase [Oryza sativa Japonica Group]
 gi|113595982|dbj|BAF19856.1| Os06g0582600 [Oryza sativa Japonica Group]
 gi|215695236|dbj|BAG90427.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 357

 Score =  290 bits (741), Expect = 8e-76,   Method: Compositional matrix adjust.
 Identities = 160/355 (45%), Positives = 222/355 (62%), Gaps = 29/355 (8%)

Query: 1   MAKYFLIVVL------IISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKR 54
           +A   LI+++      ++  +    A     D+ ++ E++E+W A +GRTYK+S E ++R
Sbjct: 10  LAAILLIIIMYCCPTGLVEAARKGPAAAGGGDDSAMRERYEKWAADHGRTYKDSLEKARR 69

Query: 55  FEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGT 114
           FE+F+ N + ++ FN AA G +S  L  NKFADLT +EF A   G   S    +    G+
Sbjct: 70  FEVFRTNALFIDSFN-AAGGKKSPRLTTNKFADLTNEEF-AEYYGRPFS----TPVIGGS 123

Query: 115 PFLY---KSSQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLS 164
            F+Y   ++S VP ++NW ++GAVT VK Q  CA       VAAVEGI+ I+ + LV+LS
Sbjct: 124 GFMYGNVRTSDVPANINWRDRGAVTQVKNQKDCASCWAFSAVAAVEGIHQIRSHNLVALS 183

Query: 165 EQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAA 224
            QQL+DC+T  NN+GC  G MD+AF+YI  N GI  ++ Y YE  + G C +   +  AA
Sbjct: 184 TQQLLDCSTGRNNHGCNRGDMDEAFRYITSNGGIAAESDYPYEDRALGTCRA-SGKPVAA 242

Query: 225 QITNYEDVPPNDEESLLKAVANQPVSVAIDA--SALQFYSGGVF----NGYCETFLNHGV 278
            I  ++ VPPN+E +LL AVA+QPVSVA+D      QF+S GVF    N  C T LNH +
Sbjct: 243 SIRGFQYVPPNNETALLLAVAHQPVSVALDGVGKVSQFFSSGVFGAMQNETCTTDLNHAM 302

Query: 279 TAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
           TAVGYGT E G KYWL+KNSWG DWGE GY ++ RD+    G CG+AM  S+PV+
Sbjct: 303 TAVGYGTDEHGTKYWLMKNSWGTDWGEGGYMKIARDVASNTGLCGLAMQPSYPVA 357


>gi|357507505|ref|XP_003624041.1| Cysteine proteinase [Medicago truncatula]
 gi|355499056|gb|AES80259.1| Cysteine proteinase [Medicago truncatula]
          Length = 342

 Score =  289 bits (740), Expect = 1e-75,   Method: Compositional matrix adjust.
 Identities = 150/313 (47%), Positives = 204/313 (65%), Gaps = 21/313 (6%)

Query: 29  SIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADL 88
           S++E+FE WK +YG  YK+ AE  K F+IFK N+  ++ FN  A GN+ Y L +N+F D 
Sbjct: 37  SLSERFEYWKTKYGVVYKDVAEQKKHFQIFKHNVAYIDYFN--AAGNKPYKLAINRFVDK 94

Query: 89  TPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGAVTPVKYQGQC--- 144
             ++   S  GF+ +  ++        F Y++ + +P +V+W ++GAVTP+K QG+C   
Sbjct: 95  PIED---SDDGFERTTTTTPTTT----FKYENVTDIPATVDWRKRGAVTPIKNQGKCGSC 147

Query: 145 ----AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITN 200
               AVAA+EGI  I    LVSLSEQQLVDC  +    GC  G M +AFK+I++N GI  
Sbjct: 148 WAFSAVAAIEGIQKITSGNLVSLSEQQLVDCDRSGRTKGCDNGNMINAFKFILENGGIAT 207

Query: 201 DAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASAL-Q 259
           +A Y Y+ +  G C  +    H  QI +YE+VP N E+SLLKAVANQPVSV ID   + +
Sbjct: 208 EANYPYKRVVKGTCKKV---SHKVQIKSYEEVPSNSEDSLLKAVANQPVSVGIDMRGMFK 264

Query: 260 FYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQ 319
           FYS G+F G C T  NH +T VGYGTS++GIKYWL+KNSW + WGE GY R++RDID  +
Sbjct: 265 FYSSGIFTGECGTKPNHALTIVGYGTSKDGIKYWLVKNSWSKRWGEKGYIRIKRDIDAKE 324

Query: 320 GQCGIAMFASFPV 332
           G CGIAM  S+P+
Sbjct: 325 GLCGIAMKPSYPI 337


>gi|356515044|ref|XP_003526211.1| PREDICTED: LOW QUALITY PROTEIN: thiol protease SEN102-like [Glycine
           max]
          Length = 337

 Score =  289 bits (740), Expect = 1e-75,   Method: Compositional matrix adjust.
 Identities = 160/343 (46%), Positives = 218/343 (63%), Gaps = 26/343 (7%)

Query: 1   MAKYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKD 60
           +A + L+ + I      SQ   R   E S+ E+ E W A+YG+ YK +AE  + F+IFK+
Sbjct: 11  LALFLLLSIEI------SQVMSRKLHETSLREEHENWIARYGQVYKVAAEK-ETFQIFKE 63

Query: 61  NLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS 120
           N+  +E FN AA  N+ Y L +N FADLT +EF   + G K + H  S+    TPF Y++
Sbjct: 64  NVEFIESFNAAA--NKPYKLGVNLFADLTLEEFKDFRFGLKKT-HEFSI----TPFKYEN 116

Query: 121 -SQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCA 172
            + +P +++W EKGAVTP+K QGQC        VAA EGI+ I    LVSL EQ+LV C 
Sbjct: 117 VTDIPEALDWREKGAVTPIKDQGQCGSCWAFSTVAATEGIHQITTGNLVSLXEQELVSCD 176

Query: 173 TNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDV 232
           T   + GC GG+M+D F++II+N GIT  A Y Y+G++ G C++  A    AQI  YE V
Sbjct: 177 TKGVDQGCEGGYMEDGFEFIIKNGGITTKANYPYKGVN-GTCNTTIAASTVAQIKGYETV 235

Query: 233 PPNDEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGI 290
           P   EE+L KAVANQPVSV+IDA+     FY+GG++ G C T L+HGVTAVGYGT+ E  
Sbjct: 236 PSYSEEALQKAVANQPVSVSIDANNGHFMFYAGGIYTGECGTDLDHGVTAVGYGTTNE-T 294

Query: 291 KYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
            YW++KNSWG  W E G+ R+QR I    G CG+A+ +S+P +
Sbjct: 295 DYWIVKNSWGTGWDEKGFIRMQRGITVKHGLCGVALDSSYPTT 337


>gi|60100207|gb|AAX13273.1| putative cysteine protease [Oryza sativa Japonica Group]
          Length = 349

 Score =  289 bits (740), Expect = 1e-75,   Method: Compositional matrix adjust.
 Identities = 154/324 (47%), Positives = 202/324 (62%), Gaps = 22/324 (6%)

Query: 26  DEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKF 85
           D  ++A++ E+W A++GR Y + AE ++R E+F+DN+  +E  N AA     + L  N+F
Sbjct: 32  DAAAMAQRHERWMAKHGRAYADDAEKARRLEVFRDNVAFIESVN-AAASQHKFWLEENQF 90

Query: 86  ADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQV-----PPSVNWIEKGAVTPVKY 140
           ADLT  EF A++TG +     SS + N  P  ++ + V     P SV+W  KGAV PVK 
Sbjct: 91  ADLTNAEFRATRTGLR----PSSSRGNRAPTSFRYANVSTGDLPASVDWRGKGAVNPVKD 146

Query: 141 QGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYII 193
           QG C       AVAA+EG   +   +LVSLSEQQLV C     + GC GG MDDAF +II
Sbjct: 147 QGDCGCCWAFSAVAAMEGAVKLATGKLVSLSEQQLVSCDVKGEDQGCEGGLMDDAFDFII 206

Query: 194 QNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAI 253
           +N G+  ++ Y Y   S   C +  A   AA I  YEDVP NDE +LLKAVANQPVSVAI
Sbjct: 207 KNGGLAAESDYPYTA-SDDKCATAGAGAAAATIKGYEDVPANDEAALLKAVANQPVSVAI 265

Query: 254 DAS--ALQFYSGGVFNGY--CETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYF 309
           D      QFY GGV +G   C T L+H +TAVGYG + +G KYWL+KNSWG  WGEDGY 
Sbjct: 266 DGGDRHFQFYKGGVLSGAAGCATELDHAITAVGYGVASDGTKYWLMKNSWGTSWGEDGYV 325

Query: 310 RLQRDIDQPQGQCGIAMFASFPVS 333
           R++R +   +G CG+AM AS+P +
Sbjct: 326 RMERGVADKEGVCGLAMMASYPTA 349


>gi|413917937|gb|AFW57869.1| hypothetical protein ZEAMMB73_830006 [Zea mays]
          Length = 443

 Score =  289 bits (739), Expect = 1e-75,   Method: Compositional matrix adjust.
 Identities = 152/338 (44%), Positives = 211/338 (62%), Gaps = 27/338 (7%)

Query: 1   MAKYF---LIVVLIISGSCASQATYRTFD----EGSIAEKFEQWKAQYGRTYKESAENSK 53
           MA ++    +++ +++ +CA   +    D    + ++  + E+W A+Y R Y ++AE ++
Sbjct: 1   MATHYSSAFVLLSVVAWACALSGSLAARDLADQDQAMVARHEEWMAKYDRVYSDAAEKAR 60

Query: 54  RFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANG 113
           RFE+FK N+  +E  N    GN  + L  N+FADLT  EF A+ TG++    ++S K   
Sbjct: 61  RFEVFKANMALIESVN---AGNHKFWLEANRFADLTDDEFRATWTGYRPKTAAASSKGRS 117

Query: 114 ----TPFLYKS---SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINR 159
               T F Y +     VP SV+W  KGAVTP+K QG+C       AVA++EG+  +   +
Sbjct: 118 RTATTGFKYANVSLDDVPASVDWRTKGAVTPIKNQGECGCCWAFSAVASMEGVVKLSTGK 177

Query: 160 LVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKA 219
           LVSLSEQ+LVDC  N  + GC GG MDDAF +I+ N G+T ++ Y Y   S G C+S +A
Sbjct: 178 LVSLSEQELVDCDVNGMDQGCEGGEMDDAFDFIVGNGGLTTESRYPYTA-SDGTCNSNEA 236

Query: 220 EDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDA--SALQFYSGGVFNGYCETFLNHG 277
              AA I  YEDVP NDE SL KAVANQPVSVA+D   S  +FY GGV +G C T L+HG
Sbjct: 237 SGDAASIKGYEDVPANDEASLRKAVANQPVSVAVDGGDSHFRFYKGGVLSGACGTELDHG 296

Query: 278 VTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDI 315
           + AVGYG + +G KYW++KNSWG  WGE GY R++RDI
Sbjct: 297 IAAVGYGVASDGTKYWVMKNSWGTSWGEAGYIRMERDI 334


>gi|297792329|ref|XP_002864049.1| hypothetical protein ARALYDRAFT_495086 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297309884|gb|EFH40308.1| hypothetical protein ARALYDRAFT_495086 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 361

 Score =  289 bits (739), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 156/328 (47%), Positives = 202/328 (61%), Gaps = 22/328 (6%)

Query: 27  EGSIAEKFEQWKAQY--GRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNK 84
           E S+ E +E+WK+ +   R+ +E A   KRF +FK N+  +   N       SY L+LNK
Sbjct: 31  EDSLWELYERWKSHHTIARSLEEKA---KRFNVFKHNVKHIHETNKK---ENSYKLKLNK 84

Query: 85  FADLTPQEFIASQTGFKMSDHS--SSLKANGTPFLYKS-SQVPPSVNWIEKGAVTPVKYQ 141
           F D+T +EF  +  G  +  H      +     F+Y +   +P SV+W + GAVTPVK Q
Sbjct: 85  FGDMTSEEFRRTYAGSNIKHHRMFQGERQTTKSFMYANVDTLPTSVDWRKNGAVTPVKNQ 144

Query: 142 GQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQ 194
           GQC        V AVEGIN I+  +L SLSEQ+LVDC TN  N GC GG MD AF++I +
Sbjct: 145 GQCGSCWAFSTVVAVEGINQIRTKKLTSLSEQELVDCDTN-KNQGCNGGLMDLAFEFIKE 203

Query: 195 NKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAID 254
             G+T++ VY Y+  S   CD+ K       I  +EDVP N E  L+KAVA+QPVSVAID
Sbjct: 204 KGGLTSELVYPYKA-SDETCDTNKENAPVVSIDGHEDVPKNSEVDLMKAVAHQPVSVAID 262

Query: 255 A--SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQ 312
           A  S  QFYS GVF G C T LNHGV  VGYGT+ +G KYW++KNSWG++WGE GY R+Q
Sbjct: 263 AGGSDFQFYSEGVFTGRCGTELNHGVAVVGYGTTIDGTKYWIVKNSWGEEWGEKGYIRMQ 322

Query: 313 RDIDQPQGQCGIAMFASFPVSKESAQPS 340
           R I   +G CGIAM AS+P+   +  PS
Sbjct: 323 RGIRHKEGLCGIAMEASYPLKNSNTNPS 350


>gi|171702841|dbj|BAG16376.1| cysteine protease [Brassica rapa var. perviridis]
          Length = 333

 Score =  288 bits (737), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 149/316 (47%), Positives = 206/316 (65%), Gaps = 17/316 (5%)

Query: 25  FDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNK 84
            DE ++ ++  +W  ++GR Y ++ E + R+ +FK N+  +ER N+   G  ++ L +N+
Sbjct: 23  LDEVAMQKRHAEWMTEHGRVYADANEKNNRYAVFKRNVERIERLNDVQSG-LTFKLAVNQ 81

Query: 85  FADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK---SSQVPPSVNWIEKGAVTPVKYQ 141
           FADLT +EF +  TGFK +   SS +   T F Y+   S  +P SV+W +KGAVTP+K Q
Sbjct: 82  FADLTNEEFRSMYTGFKGNSVLSS-RTKPTSFRYQNVSSDALPVSVDWRKKGAVTPIKDQ 140

Query: 142 GQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQ 194
           G C       AVAA+EG+  IK  +L+SLSEQ+LVDC TND   GC GG MD AF Y I 
Sbjct: 141 GLCGSCWAFSAVAAIEGVAQIKKGKLISLSEQELVDCDTNDG--GCMGGLMDTAFNYTIT 198

Query: 195 NKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAID 254
             G+T+++ Y Y+  + G C+  K +  A  I  +EDVP NDE++L+KAVA+ PVS+ I 
Sbjct: 199 IGGLTSESNYPYKS-TNGTCNFNKTKQIATSIKGFEDVPANDEKALMKAVAHHPVSIGIA 257

Query: 255 AS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQ 312
                 QFYS GVF+G C T L+HGVTAVGYG S+ G+KYW++KNSWG  WGE GY R++
Sbjct: 258 GGDIGFQFYSSGVFSGECTTHLDHGVTAVGYGRSKNGLKYWILKNSWGPKWGERGYMRIK 317

Query: 313 RDIDQPQGQCGIAMFA 328
           +DI    GQCG+AM A
Sbjct: 318 KDIKPKHGQCGLAMNA 333


>gi|148907299|gb|ABR16787.1| unknown [Picea sitchensis]
          Length = 372

 Score =  288 bits (737), Expect = 3e-75,   Method: Compositional matrix adjust.
 Identities = 156/317 (49%), Positives = 204/317 (64%), Gaps = 22/317 (6%)

Query: 34  FEQWKAQYGRTYK-ESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQE 92
           +++W  Q+  T   +S E+++RFEIFK+N+  ++  N     +  Y L LNKFADL+ +E
Sbjct: 45  YDKWALQHRSTRSLDSDEHARRFEIFKENVKHIDSVNKK---DGPYKLGLNKFADLSNEE 101

Query: 93  FIASQTGFKMSDHSSSLKANGT---PFLYKSS-QVPPSVNWIEKGAVTPVKYQGQCA--- 145
           F A     KM  H S     G     F+Y++S ++P S++W +KGAVTPVK QGQC    
Sbjct: 102 FKAMHMTTKMEKHKSLRGDRGVESGSFMYQNSKRLPASIDWRKKGAVTPVKNQGQCGSCW 161

Query: 146 ----VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITND 201
               +A+VEGIN IK  +LVSLSEQQLVDC+    N GC GG MD+AF+YII N GI  +
Sbjct: 162 AFSTIASVEGINYIKTGKLVSLSEQQLVDCSKE--NAGCNGGLMDNAFQYIIDNGGIVTE 219

Query: 202 AVYSYEGMSTGICDSIKAEDH--AAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA-- 257
             Y Y     G C + K E    A  I  +EDVP N+E +L KAVA+QPVS+AI+AS   
Sbjct: 220 DEYPYTA-EAGECSTTKIESKSIATIIDGFEDVPANNEGALKKAVAHQPVSIAIEASGHD 278

Query: 258 LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQ 317
            QFYS GVF G C T L+HGV  VGYG S EGI YW+++NSWG +WGE GY R+QR I+ 
Sbjct: 279 FQFYSTGVFTGKCGTELDHGVVVVGYGKSPEGINYWIVRNSWGPEWGEQGYIRMQRGIEA 338

Query: 318 PQGQCGIAMFASFPVSK 334
            +G+CGI+M AS+P  K
Sbjct: 339 TEGKCGISMQASYPTKK 355


>gi|356521444|ref|XP_003529366.1| PREDICTED: thiol protease SEN102-like [Glycine max]
          Length = 340

 Score =  288 bits (737), Expect = 3e-75,   Method: Compositional matrix adjust.
 Identities = 163/344 (47%), Positives = 215/344 (62%), Gaps = 28/344 (8%)

Query: 1   MAKYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKD 60
           +  +F++ +     +C  +A+ RT  E SIA + E+W A + R Y +SAE  +R +IFK+
Sbjct: 10  VGTFFMLFL-----TCICRASSRTLSESSIATQHEEWMAMHDRVYADSAEKDRRQQIFKE 64

Query: 61  NLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTG--FKMSDHSSSLKANGTPFLY 118
           NL  +E+ NN   G + Y L LN FADLT +EF+AS TG  +K      S K N +   +
Sbjct: 65  NLEFIEKHNNE--GKKRYNLSLNSFADLTNEEFVASHTGALYKPPTQLGSFKINHSLGFH 122

Query: 119 KSS--QVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLV 169
           K S   +  S++W ++GAV  +K QG+C       AVAAVEGIN IK  +LVSLSEQ LV
Sbjct: 123 KMSVGDIEASLDWRKRGAVNDIKNQGRCGSCWAFSAVAAVEGINQIKNGQLVSLSEQNLV 182

Query: 170 DCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNY 229
           DCA+ND   GC+G +++ AF YI ++ G+ N+  Y Y   + G C      + A QI  Y
Sbjct: 183 DCASND---GCHGQYVEKAFDYI-RDYGLANEEEYPYV-ETVGTCSG--NSNPAIQIRGY 235

Query: 230 EDVPPNDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSE 287
           + V P +EE LL AVA+QPVSV ++A     QFYSGGVF+G C T LNH VT VGYG   
Sbjct: 236 QSVTPQNEEQLLTAVASQPVSVLLEAKGQGFQFYSGGVFSGECGTELNHAVTIVGYGEEA 295

Query: 288 EGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
           EG KYWLI+NSWG+ WGE GY +L RD   PQG CGI M AS+P
Sbjct: 296 EG-KYWLIRNSWGKSWGEGGYMKLMRDTGNPQGLCGINMQASYP 338


>gi|388517427|gb|AFK46775.1| unknown [Medicago truncatula]
          Length = 362

 Score =  288 bits (736), Expect = 3e-75,   Method: Compositional matrix adjust.
 Identities = 163/356 (45%), Positives = 220/356 (61%), Gaps = 26/356 (7%)

Query: 3   KYFLIV-----VLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEI 57
           K  LIV     VL++S S        + DE S+ + +E+W++ +  + +   E  KRF +
Sbjct: 5   KLLLIVLSIALVLVVSESFDFHDKDVSSDE-SLWDLYERWRSHHTVS-RNLNEKQKRFNV 62

Query: 58  FKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHS---SSLKANGT 114
           FK N++ V   N     ++ Y L+LNKFAD+T  EF  +  G K++ H     + + +GT
Sbjct: 63  FKSNVMHVHNTNKM---DKPYKLKLNKFADMTNHEFKTTYAGTKVNHHRMFRGTPRVSGT 119

Query: 115 PFLYKS-SQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQ 166
            F+Y++ ++ P SV+W +KGAVT VK QGQC        V AVEGIN IK NRLV LSEQ
Sbjct: 120 -FMYENFTKAPASVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQIKTNRLVPLSEQ 178

Query: 167 QLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQI 226
           +L+DC  N  N GC GG M+ AF+YI Q  G+T ++ Y Y   + G CD+ K       I
Sbjct: 179 ELIDC-DNQENQGCNGGLMEYAFEYIKQKGGVTTESYYPYTA-NDGSCDATKENVPTVSI 236

Query: 227 TNYEDVPPNDEESLLKAVANQPVSVAIDA--SALQFYSGGVFNGYCETFLNHGVTAVGYG 284
             +E VP NDE++LLKAVANQPVSVAIDA  S  QFYS GVF G C   LNHGV  VGYG
Sbjct: 237 DGHETVPANDEDALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDCGKELNHGVAIVGYG 296

Query: 285 TSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSKESAQPS 340
           T+ +G  YW+++NSWG +WGE G  R++R++   +G CGIAM AS+PV   S  P+
Sbjct: 297 TTVDGTNYWIVRNSWGAEWGEQGCIRMKRNVSNKEGLCGIAMEASYPVKNSSKNPA 352


>gi|21666724|gb|AAM73806.1|AF448505_1 cysteine proteinase [Brassica napus]
 gi|21666726|gb|AAM73807.1|AF448506_1 cysteine proteinase [Brassica napus]
          Length = 343

 Score =  288 bits (736), Expect = 4e-75,   Method: Compositional matrix adjust.
 Identities = 146/322 (45%), Positives = 207/322 (64%), Gaps = 17/322 (5%)

Query: 23  RTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRL 82
           R  DE ++ ++   W  ++GR Y ++ E + R+ +FK N+ ++ER N    G  ++ L +
Sbjct: 26  RPLDEVTMQKRHAAWMTEHGRVYADANEKNNRYVVFKRNVESIERLNEVQYG-LTFKLAV 84

Query: 83  NKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK---SSQVPPSVNWIEKGAVTPVK 139
           N+FADLT +EF +  TG+K +   SS +   T F Y+   S  +P SV+W +KGAVTP+K
Sbjct: 85  NQFADLTNEEFRSMYTGYKGNSVLSS-RTKPTSFRYQHVSSDALPISVDWRKKGAVTPIK 143

Query: 140 YQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYI 192
            QG C       AVAA+EG+  IK  +L+SLSEQ+LVDC TND+  GC GG+M+ AF Y 
Sbjct: 144 DQGSCGSCWAFSAVAAIEGVAQIKKGKLISLSEQELVDCDTNDD--GCMGGYMNSAFNYT 201

Query: 193 IQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVA 252
           +   G+T+++ Y Y+  + G C+  K +  A  I  +EDVP NDE++L+KAVA+ PVS+ 
Sbjct: 202 MTTGGLTSESNYPYKS-TDGTCNINKTKQIATSIKGFEDVPANDEKALMKAVAHHPVSIG 260

Query: 253 I--DASALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFR 310
           I    +  QFYS GVF+G C T L+HGV  VGYG S  G KYW++KNSWG  WGE GY R
Sbjct: 261 IAGGGTGFQFYSSGVFSGECSTHLDHGVAVVGYGKSSNGSKYWILKNSWGPKWGERGYMR 320

Query: 311 LQRDIDQPQGQCGIAMFASFPV 332
           +++D     GQCG+AM AS+P 
Sbjct: 321 IKKDTKAKHGQCGLAMNASYPT 342


>gi|37780041|gb|AAP32193.1| cysteine protease 14 [Trifolium repens]
          Length = 351

 Score =  288 bits (736), Expect = 4e-75,   Method: Compositional matrix adjust.
 Identities = 151/315 (47%), Positives = 201/315 (63%), Gaps = 15/315 (4%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
           + E FE W +++G+ Y+   E   RFE+FKDNL  ++  N       +Y L LN+FADL+
Sbjct: 43  LIELFESWMSRHGKIYETIEEKLLRFEVFKDNLKHIDERNKIV---SNYWLGLNEFADLS 99

Query: 90  PQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA---- 145
            QEF     G K++       +N   F Y+   +P SV+W +KGAVTPVK QGQC     
Sbjct: 100 HQEFKNKYLGLKVNLSQRRESSNEEEFTYRDVDLPKSVDWRKKGAVTPVKNQGQCGSCWA 159

Query: 146 ---VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDA 202
              VAAVEGIN I    L SLSEQ+L+DC T   NNGC GG MD AF +I+QN G+  + 
Sbjct: 160 FSTVAAVEGINQIVTGNLTSLSEQELIDCDTT-YNNGCNGGLMDYAFSFIVQNGGLHKED 218

Query: 203 VYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQF 260
            Y Y  M    C+  K E     I  Y DVP N+E+SLLKA+ANQP+SVAI+AS+   QF
Sbjct: 219 DYPYI-MEESTCEMKKEETQVVTINGYHDVPQNNEQSLLKALANQPLSVAIEASSRDFQF 277

Query: 261 YSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQG 320
           YSGGVF+G+C + L+HGV+AVGYGTS + + Y ++KNSWG  WGE G+ R++R+I +P+G
Sbjct: 278 YSGGVFDGHCGSDLDHGVSAVGYGTS-KNLDYIIVKNSWGAKWGEKGFIRMKRNIGKPEG 336

Query: 321 QCGIAMFASFPVSKE 335
            CG+   AS+P  K+
Sbjct: 337 ICGLYKMASYPTKKK 351


>gi|217073894|gb|ACJ85307.1| unknown [Medicago truncatula]
 gi|388507498|gb|AFK41815.1| unknown [Medicago truncatula]
          Length = 362

 Score =  288 bits (736), Expect = 4e-75,   Method: Compositional matrix adjust.
 Identities = 163/356 (45%), Positives = 220/356 (61%), Gaps = 26/356 (7%)

Query: 3   KYFLIV-----VLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEI 57
           K  LIV     VL++S S        + DE S+ + +E+W++ +  + +   E  KRF +
Sbjct: 5   KLLLIVLSIALVLVVSESFDFHDKDVSSDE-SLWDLYERWRSHHTVS-RNLNEKQKRFNV 62

Query: 58  FKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHS---SSLKANGT 114
           FK N++ V   N     ++ Y L+LNKFAD+T  EF  +  G K++ H     + + +GT
Sbjct: 63  FKSNVMHVHNTNKM---DKPYKLKLNKFADMTNHEFKTTYAGSKVNHHRMFRGTPRVSGT 119

Query: 115 PFLYKS-SQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQ 166
            F+Y++ ++ P SV+W +KGAVT VK QGQC        V AVEGIN IK NRLV LSEQ
Sbjct: 120 -FMYENFTKAPASVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQIKTNRLVPLSEQ 178

Query: 167 QLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQI 226
           +L+DC  N  N GC GG M+ AF+YI Q  G+T ++ Y Y   + G CD+ K       I
Sbjct: 179 ELIDC-DNQENQGCNGGLMEYAFEYIKQKGGVTTESYYPYTA-NDGSCDATKENVPTVSI 236

Query: 227 TNYEDVPPNDEESLLKAVANQPVSVAIDA--SALQFYSGGVFNGYCETFLNHGVTAVGYG 284
             +E VP NDE++LLKAVANQPVSVAIDA  S  QFYS GVF G C   LNHGV  VGYG
Sbjct: 237 DGHETVPANDEDALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDCGKELNHGVAIVGYG 296

Query: 285 TSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSKESAQPS 340
           T+ +G  YW+++NSWG +WGE G  R++R++   +G CGIAM AS+PV   S  P+
Sbjct: 297 TTVDGTNYWIVRNSWGAEWGEQGCIRMKRNVSNKEGLCGIAMEASYPVKNSSKNPA 352


>gi|356517308|ref|XP_003527330.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
           max]
          Length = 342

 Score =  287 bits (735), Expect = 4e-75,   Method: Compositional matrix adjust.
 Identities = 153/342 (44%), Positives = 221/342 (64%), Gaps = 19/342 (5%)

Query: 4   YFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLV 63
           ++LI+ L+++    S    R   E   +E+ E+W AQYGR YK++AE  KRF++FK+N+ 
Sbjct: 8   HYLILFLVLA-VWTSHVMSRRLSEACTSERHEKWMAQYGRVYKDAAEKEKRFQVFKNNVH 66

Query: 64  AVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQ 122
            +E FN  A G++ + L +N+FADL  +EF A     +    S    +  T F Y+S ++
Sbjct: 67  FIESFN--AAGDKPFNLSINQFADLNDEEFKALLINVQ-KKASWVETSTETSFRYESVTK 123

Query: 123 VPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATND 175
           +P +++W ++GAVTP+K QG+C       AVAA EGI+ I   +LV LSEQ+LVDC   +
Sbjct: 124 IPATIDWRKRGAVTPIKDQGRCGSCWAFSAVAATEGIHQITTGKLVPLSEQELVDCVKGE 183

Query: 176 NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHA-AQITNYEDVPP 234
           +  GC GG++DDAF++I +  GI ++  Y Y+G++   C  +K E H  A+I  YE VP 
Sbjct: 184 SE-GCIGGYVDDAFEFIAKKGGIASETHYPYKGVNK-TC-KVKKETHGVAEIKGYEKVPS 240

Query: 235 NDEESLLKAVANQPVSVAIDA--SALQFYSGGVFNGY-CETFLNHGVTAVGYGTSEEGIK 291
           N+E++LLKAVANQPVSV IDA   A ++YS G+FN   C T  NH V  VGYG + +G K
Sbjct: 241 NNEKALLKAVANQPVSVYIDAGTHAFKYYSSGIFNARNCGTDPNHAVAVVGYGKALDGSK 300

Query: 292 YWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
           YWL+KNSWG +WGE GY R++RDI   +G CGIA +  +P +
Sbjct: 301 YWLVKNSWGTEWGERGYIRIKRDIRAKEGLCGIAKYPYYPTA 342


>gi|356515056|ref|XP_003526217.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
           max]
          Length = 342

 Score =  287 bits (735), Expect = 5e-75,   Method: Compositional matrix adjust.
 Identities = 154/342 (45%), Positives = 221/342 (64%), Gaps = 19/342 (5%)

Query: 4   YFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLV 63
           ++LI+ L++S    S    R   E   +E+ E+W AQYGR YK++AE  KRF++FK+N+ 
Sbjct: 8   HYLILFLVLS-VWTSHVMSRRLSEACTSERHEKWMAQYGRVYKDAAEKEKRFQVFKNNVH 66

Query: 64  AVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQ 122
            +E FN  A G++ + L +N+FADL  +EF A     +    S    +  T F Y+S ++
Sbjct: 67  FIESFN--AAGDKPFNLSINQFADLNDEEFKALLINVQ-KKASWVETSTQTSFRYESVTK 123

Query: 123 VPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATND 175
           +P +++W ++GAVTP+K QG+C       AVAA EGI+ I   +LV LSEQ+LVDC   +
Sbjct: 124 IPATIDWRKRGAVTPIKDQGRCGSCWAFSAVAATEGIHQITTGKLVPLSEQELVDCVKGE 183

Query: 176 NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHA-AQITNYEDVPP 234
           +  GC GG++DDAF++I +  GI ++  Y Y+G++   C  +K E H  A+I  YE VP 
Sbjct: 184 SE-GCIGGYVDDAFEFIAKKGGIASETHYPYKGVNK-TC-KVKKETHGVAEIKGYEKVPS 240

Query: 235 NDEESLLKAVANQPVSVAIDA--SALQFYSGGVFNGY-CETFLNHGVTAVGYGTSEEGIK 291
           N+E++LLKAVANQPVSV IDA   A ++YS G+FN   C T  NH V  VGYG + +G K
Sbjct: 241 NNEKALLKAVANQPVSVYIDAGTHAFKYYSSGIFNVRNCGTDPNHAVAVVGYGKALDGSK 300

Query: 292 YWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
           YWL+KNSWG +WGE GY R++RDI   +G CGIA +  +P +
Sbjct: 301 YWLVKNSWGTEWGERGYIRIKRDIRAKEGLCGIAKYPYYPTA 342


>gi|374713649|gb|AEZ65082.1| cysteine protease [Carica papaya]
          Length = 471

 Score =  287 bits (735), Expect = 5e-75,   Method: Compositional matrix adjust.
 Identities = 161/329 (48%), Positives = 201/329 (61%), Gaps = 20/329 (6%)

Query: 19  QATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSY 78
           Q+T RT  E  + + +E W  ++G+ Y    E  +RFEIFKDNL  V+  N  ++  R+Y
Sbjct: 39  QSTERT--EAHMMKMYEHWLVKHGKNYNAIGEKERRFEIFKDNLRFVDEQN--SVPGRTY 94

Query: 79  TLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS---SQVPPSVNWIEKGAV 135
            L L KFADLT +E+ A   G KM             +L+K+     +P  V+W EKGAV
Sbjct: 95  KLGLTKFADLTNEEYRAMYLGAKMEKKEKLRTERSQRYLHKAGNDDDLPSHVDWREKGAV 154

Query: 136 TPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDA 188
           T VK QGQC        V +VEGIN I    L+SLSEQ+LVDC     N GC GG MD A
Sbjct: 155 TEVKDQGQCGSCWAFSTVGSVEGINQIVTGDLISLSEQELVDC-DKAYNQGCNGGLMDYA 213

Query: 189 FKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQP 248
           F++II+N GI ++A Y Y   S  +CDS +   H   I  YEDVP NDEESL KAVANQP
Sbjct: 214 FEFIIKNGGIDSEADYPYRA-SDNMCDSNRKNAHVVTIDGYEDVPENDEESLKKAVANQP 272

Query: 249 VSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGED 306
           VSVAI+A     Q Y  GVF G C T L+HGV AVGYGT E GI YW+++NSWG  WGE 
Sbjct: 273 VSVAIEAGGREFQLYQSGVFTGRCGTNLDHGVVAVGYGT-ENGIDYWIVRNSWGPKWGES 331

Query: 307 GYFRLQRDI-DQPQGQCGIAMFASFPVSK 334
           GY R++R++     G+CGIAM AS+P  K
Sbjct: 332 GYIRMERNVASTDTGKCGIAMEASYPTKK 360


>gi|1256830|gb|AAB68374.1| cysteine endopeptidase 1 [Phaseolus vulgaris]
 gi|2959418|emb|CAA12118.1| cysteine protease [Phaseolus vulgaris]
          Length = 364

 Score =  287 bits (734), Expect = 5e-75,   Method: Compositional matrix adjust.
 Identities = 160/355 (45%), Positives = 215/355 (60%), Gaps = 21/355 (5%)

Query: 7   IVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVE 66
           +++L  + S A+  +   + E  + + +E+W  ++ + Y    E  KRF++FKDNL  ++
Sbjct: 9   LLLLSFTFSHATAMSIINYSENEVMDMYEEWLVKHRKVYNGLDEKEKRFQVFKDNLGFIQ 68

Query: 67  RFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLY---KSSQV 123
             N     N +YTL LNKFAD+T +E+ A   G +       +K   T   Y      Q+
Sbjct: 69  DHNAQ---NNTYTLGLNKFADITNKEYRAMYLGTRTDAKRRVMKTQNTGHRYAYNSGDQL 125

Query: 124 PPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDN 176
           P  V+W  KGAV P+K QG C        VAAVEGIN I     VSLSEQ+LVDC   + 
Sbjct: 126 PVHVDWRLKGAVGPIKDQGNCGSCWAFSTVAAVEGINNIVTGEFVSLSEQELVDC-DREY 184

Query: 177 NNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPND 236
           + GC GG MD AF++IIQN GI  +  Y Y+G+  G CD  K +    QI  YEDVP N+
Sbjct: 185 DEGCNGGLMDYAFQFIIQNGGIDTEEDYPYQGID-GTCDETKKKTKVVQIDGYEDVPSNN 243

Query: 237 EESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWL 294
           E +L KAV++QPVSVAI+AS  ALQ Y  GVF G C T L+HGV  VGYGT E G+ YWL
Sbjct: 244 ENALKKAVSHQPVSVAIEASGRALQLYQSGVFTGKCGTALDHGVVVVGYGT-ENGVDYWL 302

Query: 295 IKNSWGQDWGEDGYFRLQRDI-DQPQGQCGIAMFASFPVS--KESAQPSSADKSS 346
           ++NSWG  WGEDGYF+++R++    +G+CGIAM  S+PV     SA PSS  +S+
Sbjct: 303 VRNSWGTGWGEDGYFKMERNVRSTSEGKCGIAMDCSYPVKYGLNSAVPSSVYEST 357


>gi|255563134|ref|XP_002522571.1| cysteine protease, putative [Ricinus communis]
 gi|223538262|gb|EEF39871.1| cysteine protease, putative [Ricinus communis]
          Length = 343

 Score =  287 bits (734), Expect = 5e-75,   Method: Compositional matrix adjust.
 Identities = 164/344 (47%), Positives = 217/344 (63%), Gaps = 28/344 (8%)

Query: 5   FLIVVLIISGSCASQATYR-TFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLV 63
            +I +L+I G+  SQA  R   +  +IAEK EQW A++GRTY ++AE  +RF+IFK+NL 
Sbjct: 10  LVITLLMILGTWVSQAMPRPLLNAEAIAEKHEQWMARHGRTYHDNAEKERRFQIFKNNLD 69

Query: 64  AVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFL----YK 119
            +E FN A   N++Y L LNKF+DL+ +EF+ +  G++M     +      P      Y 
Sbjct: 70  YIENFNKAF--NKTYKLGLNKFSDLSEEEFVTTYNGYEMPTTLPTANTTVKPTFFSNYYN 127

Query: 120 SSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCA 172
             +VP S++W E G VT VK QG+C       AVAAVEGI         SLS QQL+DC 
Sbjct: 128 QDEVPESIDWRENGVVTSVKNQGECGCCWAFSAVAAVEGIAG----NGASLSAQQLLDCV 183

Query: 173 TNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDV 232
              +N+GC GG M  AF+YI+QN+GI +D  Y YE  +  +C S    + AA+IT YE V
Sbjct: 184 --GDNSGCGGGTMIKAFEYIVQNQGIVSDTDYPYE-QTQEMCRS--GSNVAARITGYESV 238

Query: 233 PPNDEESLLKAVANQPVSVAIDASA---LQFYSGGVFNGY-CETFLNHGVTAVGYGTSEE 288
               EE+L +AVA QP+SVAIDAS+    + Y  GVF+   C T L H VT VGYGT+E+
Sbjct: 239 I-QSEEALKRAVAKQPISVAIDASSGPNFKSYISGVFSAEDCGTHLTHAVTLVGYGTTED 297

Query: 289 GIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           G KYWL+KNSWG++WGE GY RLQRD+   +G CGIAM AS+P 
Sbjct: 298 GTKYWLVKNSWGEEWGESGYMRLQRDVGAMEGPCGIAMQASYPT 341


>gi|2511689|emb|CAB17074.1| cysteine proteinase precursor [Phaseolus vulgaris]
          Length = 364

 Score =  287 bits (734), Expect = 5e-75,   Method: Compositional matrix adjust.
 Identities = 160/355 (45%), Positives = 215/355 (60%), Gaps = 21/355 (5%)

Query: 7   IVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVE 66
           +++L  + S A+  +   + E  + + +E+W  ++ + Y    E  KRF++FKDNL  ++
Sbjct: 9   LLLLSFTFSHATAMSIINYSENEVMDMYEEWLVKHRKVYNGLDEKEKRFQVFKDNLGFIQ 68

Query: 67  RFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLY---KSSQV 123
             N     N +YTL LNKFAD+T +E+ A   G +       +K   T   Y      Q+
Sbjct: 69  DHNAQ---NNTYTLGLNKFADITNEEYRAMYLGTRTDAKRRVMKTQNTGHRYAYNSGDQL 125

Query: 124 PPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDN 176
           P  V+W  KGAV P+K QG C        VAAVEGIN I     VSLSEQ+LVDC   + 
Sbjct: 126 PVHVDWRLKGAVGPIKDQGNCGSCWAFSTVAAVEGINNIVTGEFVSLSEQELVDC-DREY 184

Query: 177 NNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPND 236
           + GC GG MD AF++IIQN GI  +  Y Y+G+  G CD  K +    QI  YEDVP N+
Sbjct: 185 DEGCNGGLMDYAFQFIIQNGGIDTEEDYPYQGID-GTCDQTKKKTKVVQIDGYEDVPSNN 243

Query: 237 EESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWL 294
           E +L KAV++QPVSVAI+AS  ALQ Y  GVF G C T L+HGV  VGYGT E G+ YWL
Sbjct: 244 ENALKKAVSHQPVSVAIEASGRALQLYQSGVFTGKCGTALDHGVVVVGYGT-ENGVDYWL 302

Query: 295 IKNSWGQDWGEDGYFRLQRDI-DQPQGQCGIAMFASFPVS--KESAQPSSADKSS 346
           ++NSWG  WGEDGYF+++R++    +G+CGIAM  S+PV     SA PSS  +S+
Sbjct: 303 VRNSWGTGWGEDGYFKMERNVRSTSEGKCGIAMDCSYPVKYGLNSAVPSSVYEST 357


>gi|262360187|gb|ACY38051.2| cysteine proteinase C1A [Dactylis glomerata]
          Length = 365

 Score =  287 bits (734), Expect = 6e-75,   Method: Compositional matrix adjust.
 Identities = 154/321 (47%), Positives = 202/321 (62%), Gaps = 20/321 (6%)

Query: 27  EGSIAEKFEQWKAQY--GRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNK 84
           E S+   +E W++ +   R    +   ++RF +FK+N+  +   N     +R + L LNK
Sbjct: 33  EESLRGLYETWRSHHTVSRRGLGAEAEARRFNVFKENVRYIHEANKK---DRPFRLALNK 89

Query: 85  FADLTPQEFIASQTGFKMSDHSS---SLKANGTPFLYKSSQ-VPPSVNWIEKGAVTPVKY 140
           FAD+T  EF  +  G ++  H S     +  G  F+Y  ++ +P +V+W +KGAVTP+K 
Sbjct: 90  FADMTTDEFRRTYAGSRVRHHRSLSGGRRQGGGSFMYADAENLPAAVDWRQKGAVTPIKD 149

Query: 141 QGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYII 193
           QGQC        + AVEGIN I+  RLVSLSEQ+L+DC   +N+ GC GG MD AF++I 
Sbjct: 150 QGQCGSCWAFSTIVAVEGINKIRTGRLVSLSEQELMDCNIGEND-GCNGGLMDVAFQFIQ 208

Query: 194 QNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAI 253
           QN GIT +A Y Y+G     CD  K   H   I  YEDVP NDE +L KAVANQPVSVAI
Sbjct: 209 QNGGITTEASYPYQGEQNS-CDQSKENSHDVSIDGYEDVPANDESALQKAVANQPVSVAI 267

Query: 254 DASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRL 311
           DAS    QFYS GVF     T L+HGV AVGYGT+ +G KYW++KNSWG+DWGE GY R+
Sbjct: 268 DASGNDFQFYSEGVFTTDGGTDLDHGVAAVGYGTTRDGTKYWIVKNSWGEDWGEKGYIRM 327

Query: 312 QRDIDQPQGQCGIAMFASFPV 332
           QR + Q +G CGIAM AS+P 
Sbjct: 328 QRGVKQAEGLCGIAMEASYPT 348


>gi|148927394|gb|ABR19828.1| cysteine proteinase [Elaeis guineensis]
          Length = 469

 Score =  286 bits (733), Expect = 7e-75,   Method: Compositional matrix adjust.
 Identities = 158/324 (48%), Positives = 207/324 (63%), Gaps = 20/324 (6%)

Query: 23  RTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAA-IGNRSYTLR 81
           R+ DE  +   ++ WKAQ+ R+Y    E+ +R EIF+DNL  +++ N AA  G  S+ L 
Sbjct: 38  RSDDE--VHRLYQAWKAQHARSYNALDEDEQRLEIFRDNLRFIDQHNAAANAGKYSFRLG 95

Query: 82  LNKFADLTPQEFIASQTGFKMS----DHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTP 137
           L +FADLT +E+ ++  G + +      +S++ +N   F   S  +P S++W +KGAV  
Sbjct: 96  LTRFADLTNEEYRSTYLGVRTAGSRRRRNSTVGSNRYRFR-SSDDLPDSIDWRDKGAVVD 154

Query: 138 VKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFK 190
           VK QG C        +AAVEGIN I    L+SLSEQ+LVDC T   N GC GG MD AF+
Sbjct: 155 VKDQGSCGSCWAFSTIAAVEGINHIVTGDLISLSEQELVDCDTY-YNQGCNGGLMDYAFE 213

Query: 191 YIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVS 250
           +II N GI  D  Y Y G   G CD  +   H   I +YEDVP NDE+SL KAVANQPVS
Sbjct: 214 FIISNGGIDTDEDYPYTGRD-GSCDQYRKNAHVVTIDSYEDVPINDEKSLQKAVANQPVS 272

Query: 251 VAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGY 308
           VAI+A   A Q Y  G+F GYC T L+HGVTA+GYG SE G  YW++KNSWG DWGE GY
Sbjct: 273 VAIEAGGRAFQLYESGIFTGYCGTELDHGVTAIGYG-SENGKYYWIVKNSWGSDWGESGY 331

Query: 309 FRLQRDIDQPQGQCGIAMFASFPV 332
            R++R+I+   G+CGIAM AS+P+
Sbjct: 332 IRMERNINSATGKCGIAMEASYPI 355


>gi|37780039|gb|AAP32192.1| cysteine protease 14 [Trifolium repens]
          Length = 351

 Score =  286 bits (733), Expect = 7e-75,   Method: Compositional matrix adjust.
 Identities = 152/315 (48%), Positives = 199/315 (63%), Gaps = 15/315 (4%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
           + E FE W +++G+ Y+   E   RFE+FKDNL  ++  N       +Y L LN+FADL+
Sbjct: 43  LIELFESWMSRHGKIYETIEEKLLRFEVFKDNLKHIDDRNKIV---SNYWLGLNEFADLS 99

Query: 90  PQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA---- 145
            QEF     G K+        +N   F Y+   +P SV+W +KGAVTPVK QGQC     
Sbjct: 100 HQEFKNKYLGLKVDLSQRRESSNEEEFTYRDVDLPKSVDWRKKGAVTPVKNQGQCGSCWA 159

Query: 146 ---VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDA 202
              VAAVEGIN I    L SLSEQ+L+DC T   NNGC GG MD AF +I QN G+  + 
Sbjct: 160 FSTVAAVEGINQIVTGNLTSLSEQELIDCDTT-YNNGCNGGLMDYAFSFIGQNGGLHKEE 218

Query: 203 VYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQF 260
            Y Y  M    C+  K E     I  Y DVP N+E+SLLKA+ANQP+SVAI+AS+   QF
Sbjct: 219 DYPYI-MEESTCEMKKEETQVVTINGYHDVPQNNEQSLLKALANQPLSVAIEASSRDFQF 277

Query: 261 YSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQG 320
           YSGGVF+G+C + L+HGV+AVGYGTS + + Y ++KNSWG  WGE G+ R++RDI +P+G
Sbjct: 278 YSGGVFDGHCGSDLDHGVSAVGYGTS-KNLDYIIVKNSWGAKWGEKGFIRMKRDIGKPEG 336

Query: 321 QCGIAMFASFPVSKE 335
            CG+   AS+P  K+
Sbjct: 337 ICGLYKMASYPTKKK 351


>gi|356508490|ref|XP_003522989.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
          Length = 349

 Score =  286 bits (733), Expect = 8e-75,   Method: Compositional matrix adjust.
 Identities = 153/315 (48%), Positives = 202/315 (64%), Gaps = 17/315 (5%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
           + E FE W +++G+ Y+   E   RFEIFKDNL  ++  N       +Y L LN+FADL+
Sbjct: 43  LIELFESWMSKHGKIYQSIEEKLLRFEIFKDNLKHIDERNKVV---SNYWLGLNEFADLS 99

Query: 90  PQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA---- 145
            QEF     G K+ D+S   + +   F YK  ++P SV+W +KGAV PVK QG C     
Sbjct: 100 HQEFKNKYLGLKV-DYSRR-RESPEEFTYKDVELPKSVDWRKKGAVAPVKNQGSCGSCWA 157

Query: 146 ---VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDA 202
              VAAVEGIN I    L SLSEQ+L+DC     NNGC GG MD AF +I++N G+  + 
Sbjct: 158 FSTVAAVEGINQIVTGNLTSLSEQELIDC-DRTYNNGCNGGLMDYAFSFIVENGGLHKEE 216

Query: 203 VYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQF 260
            Y Y  M  G C+  K E     I+ Y DVP N+E+SLLKA+ANQP+SVAI+AS    QF
Sbjct: 217 DYPYI-MEEGTCEMTKEETEVVTISGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQF 275

Query: 261 YSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQG 320
           YSGGVF+G+C + L+HGV AVGYGT++ G+ Y ++KNSWG  WGE GY R++R+I +P+G
Sbjct: 276 YSGGVFDGHCGSDLDHGVAAVGYGTAK-GVDYIIVKNSWGSKWGEKGYIRMRRNIGKPEG 334

Query: 321 QCGIAMFASFPVSKE 335
            CGI   AS+P  K+
Sbjct: 335 ICGIYKMASYPTKKK 349


>gi|357446975|ref|XP_003593763.1| Cysteine proteinase [Medicago truncatula]
 gi|355482811|gb|AES64014.1| Cysteine proteinase [Medicago truncatula]
          Length = 350

 Score =  286 bits (733), Expect = 8e-75,   Method: Compositional matrix adjust.
 Identities = 161/339 (47%), Positives = 207/339 (61%), Gaps = 16/339 (4%)

Query: 3   KYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
           K+ +   +I+  +CA     RT  E S+ E  +QW  +Y RTY  S+E  KR +IFK+NL
Sbjct: 2   KHLIGFCIILLWACAYPTMSRTLTESSVVEAHQQWMMKYERTYTNSSEMEKRKKIFKENL 61

Query: 63  VAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDH--SSSLKANGTPFLYKS 120
             +E FNN  +GN+SY L LN+++DLT +EFIAS TGFK+SD    S +++   PF   +
Sbjct: 62  EYIENFNN--VGNKSYKLGLNRYSDLTSEEFIASHTGFKVSDQLSDSKMRSVAIPFNL-N 118

Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
             VP + +W EKG VT VK Q QC       AVAAVEGI  IK   L+SLSEQQLVDC  
Sbjct: 119 DDVPTNFDWREKGVVTDVKNQRQCGCCWAFTAVAAVEGIVKIKNGNLISLSEQQLVDC-- 176

Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
           +  ++GC GG    AF  II+++GI  +  Y Y+      C  +     AAQI  Y  VP
Sbjct: 177 DRQSSGCGGGDFVLAFDSIIKSRGIVKEDDYPYKANDVQTCQ-LGQIPGAAQINGYFKVP 235

Query: 234 PNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
            NDE+ LL+AV  QPVSVAI  S     Y GGV+ G C   LNH VT +GYG SE G KY
Sbjct: 236 ANDEQQLLRAVLQQPVSVAISTSYDFHHYMGGVYEGSCGPKLNHAVTIIGYGVSEAGKKY 295

Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
           WLIKNSWG+ WGE GY ++ R+     GQC IA+ A++P
Sbjct: 296 WLIKNSWGETWGEKGYMKVLRESSATGGQCSIAVHAAYP 334


>gi|297816030|ref|XP_002875898.1| hypothetical protein ARALYDRAFT_485194 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297321736|gb|EFH52157.1| hypothetical protein ARALYDRAFT_485194 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 363

 Score =  286 bits (733), Expect = 8e-75,   Method: Compositional matrix adjust.
 Identities = 162/361 (44%), Positives = 221/361 (61%), Gaps = 30/361 (8%)

Query: 3   KYFLIVVLIISGSCASQATYRTFD--------EGSIAEKFEQWKAQYGRTYKESAENSKR 54
           K F IV+  +   C  QA+ + FD        E ++ + +E+W+  +  T + S E  KR
Sbjct: 2   KLFFIVLSFL---CLLQAS-KGFDFDEKELETEENVWKLYERWRDHHSVT-RASHEALKR 56

Query: 55  FEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHS--SSLKAN 112
           F +F+ N++ V R N     N+ Y L++N+FAD+T  EF +S  G  +  H      K  
Sbjct: 57  FNVFRHNVLHVHRTNKK---NKPYKLKVNRFADITHHEFRSSYAGSNVKHHRMLRGPKRG 113

Query: 113 GTPFLYKS-SQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLS 164
              F+Y++ ++VP SV+W EKGAVT VK Q  C        VAAVEGIN I+ N+LVSLS
Sbjct: 114 SGGFMYENVTRVPSSVDWREKGAVTEVKNQQDCGSCWAFSTVAAVEGINKIRTNKLVSLS 173

Query: 165 EQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAA 224
           EQ+LVDC T +N  GC GG M+ AF++I  N GI  +  Y Y+      C +   +    
Sbjct: 174 EQELVDCDTEENQ-GCAGGLMEPAFEFIKNNGGIKTEETYPYDSNDVQFCRAKSIDGETV 232

Query: 225 QITNYEDVPPNDEESLLKAVANQPVSVAIDA--SALQFYSGGVFNGYCETFLNHGVTAVG 282
            I  +E VP NDEE+LLKAVA+QPVSVAIDA  S  Q YS GVF G C T LNHGV  VG
Sbjct: 233 TIDGHEHVPENDEEALLKAVAHQPVSVAIDAGSSDFQLYSEGVFIGECGTQLNHGVVIVG 292

Query: 283 YGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSKESAQPSSA 342
           YG ++ G KYW+++NSWG +WGE GY R++R I + +G+CGIAM AS+P +K S+ PS+ 
Sbjct: 293 YGETKNGTKYWIVRNSWGPEWGEGGYVRIERGISENEGRCGIAMEASYP-TKVSSTPSTP 351

Query: 343 D 343
           +
Sbjct: 352 E 352


>gi|38346007|emb|CAD40110.2| OSJNBa0035O13.9 [Oryza sativa Japonica Group]
 gi|125589429|gb|EAZ29779.1| hypothetical protein OsJ_13837 [Oryza sativa Japonica Group]
          Length = 314

 Score =  286 bits (733), Expect = 8e-75,   Method: Compositional matrix adjust.
 Identities = 153/320 (47%), Positives = 200/320 (62%), Gaps = 22/320 (6%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
           +A++ E+W A++GR Y + AE ++R E+F+DN+  +E  N AA     + L  N+FADLT
Sbjct: 1   MAQRHERWMAKHGRAYADDAEKARRLEVFRDNVAFIESVN-AAASQHKFWLEENQFADLT 59

Query: 90  PQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQV-----PPSVNWIEKGAVTPVKYQGQC 144
             EF A++TG +     SS + N  P  ++ + V     P SV+W  KGAV PVK QG C
Sbjct: 60  NAEFRATRTGLR----PSSSRGNRAPTSFRYANVSTGDLPASVDWRGKGAVNPVKDQGDC 115

Query: 145 -------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKG 197
                  AVAA+EG   +   +LVSLSEQQLV C     + GC GG MDDAF +II+N G
Sbjct: 116 GCCWAFSAVAAMEGAVKLATGKLVSLSEQQLVSCDVKGEDQGCEGGLMDDAFDFIIKNGG 175

Query: 198 ITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS- 256
           +  ++ Y Y   S   C +  A   AA I  YEDVP NDE +LLKAVANQPVSVAID   
Sbjct: 176 LAAESDYPYTA-SDDKCATAGAGAAAATIKGYEDVPANDEAALLKAVANQPVSVAIDGGD 234

Query: 257 -ALQFYSGGVFNGY--CETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQR 313
              QFY GGV +G   C T L+H +TAVGYG + +G KYWL+KNSWG  WGEDGY R++R
Sbjct: 235 RHFQFYKGGVLSGAAGCATELDHAITAVGYGVASDGTKYWLMKNSWGTSWGEDGYVRMER 294

Query: 314 DIDQPQGQCGIAMFASFPVS 333
            +   +G CG+AM AS+P +
Sbjct: 295 GVADKEGVCGLAMMASYPTA 314


>gi|357156854|ref|XP_003577598.1| PREDICTED: thiol protease SEN102-like [Brachypodium distachyon]
          Length = 368

 Score =  286 bits (733), Expect = 9e-75,   Method: Compositional matrix adjust.
 Identities = 153/328 (46%), Positives = 202/328 (61%), Gaps = 27/328 (8%)

Query: 27  EGSIAEKFEQWKAQYG----------RTYKESAENSKRFEIFKDNLVAVERFNNAAIGNR 76
           E S+   +E+W+++Y           R      + ++RF +FK+N+  +   N     +R
Sbjct: 31  EESLRGLYERWRSRYTVSPSTPGSGLRGKLADHDPARRFNVFKENVKYIHEANKK---DR 87

Query: 77  SYTLRLNKFADLTPQEFIASQTGFKMSDH---SSSLKANGTPFLYKSSQVPPSVNWIEKG 133
            + L LNKFAD+T  E   S  G ++  H   S   +A G      +  +PP+V+W EKG
Sbjct: 88  PFRLALNKFADMTTDELRHSYAGSRVRHHRALSGGRRAQGNFTYSDAENLPPAVDWREKG 147

Query: 134 AVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMD 186
           AVT +K QGQC        +AAVE IN I+  +LVSLSEQ+L+DC  N N+ GC GG MD
Sbjct: 148 AVTGIKDQGQCGSCWAFSTIAAVESINKIRTGKLVSLSEQELMDC-DNVNDQGCDGGLMD 206

Query: 187 DAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN 246
            AF++I +N G+T++A Y Y+G     CD  K   H   I  YEDVP NDE +L KAVA 
Sbjct: 207 YAFQFIQKNGGVTSEANYPYQGQQN-TCDQAKENTHDVAIDGYEDVPANDESALQKAVAY 265

Query: 247 QPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWG 304
           QPVSVAI+AS    QFYS GVF G C T L+HGV AVGYGT+ +G KYW++KNSWG DWG
Sbjct: 266 QPVSVAIEASGQDFQFYSEGVFTGQCTTDLDHGVAAVGYGTARDGTKYWIVKNSWGLDWG 325

Query: 305 EDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           E GY R+QR + Q +G CGIAM AS+P+
Sbjct: 326 EKGYIRMQRGVSQAEGLCGIAMQASYPI 353


>gi|356515080|ref|XP_003526229.1| PREDICTED: vignain-like [Glycine max]
          Length = 284

 Score =  286 bits (733), Expect = 9e-75,   Method: Compositional matrix adjust.
 Identities = 149/285 (52%), Positives = 192/285 (67%), Gaps = 15/285 (5%)

Query: 59  KDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLY 118
           K+N+  +E FNNAA  N+ Y L +N+FADLT +EFI  +  F  + H        T F Y
Sbjct: 5   KENVNYIEAFNNAA--NKPYKLGINQFADLTSEEFIVPRNRF--NGHMRFSNTRTTTFKY 60

Query: 119 KSSQV-PPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVD 170
           ++  V P S++W +KGAVTP+K QG C       A+AA EGI+ I   +LVSLSEQ++VD
Sbjct: 61  ENVTVLPDSIDWRQKGAVTPIKNQGSCGCCWAFSAIAATEGIHKISTGKLVSLSEQEVVD 120

Query: 171 CATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYE 230
           C T   ++GC GG+MD AFK+IIQN GI  +A Y Y+G+  G C+  +   HA  IT YE
Sbjct: 121 CDTKGTDHGCEGGYMDGAFKFIIQNHGINTEASYPYKGVD-GKCNIKEEAVHATTITGYE 179

Query: 231 DVPPNDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEE 288
           DVP N+E++L KAVANQPVSVAIDA     QFY  G+F G C T L+HGVTAVGYG + E
Sbjct: 180 DVPINNEKALQKAVANQPVSVAIDARGADFQFYKSGIFTGSCGTELDHGVTAVGYGENNE 239

Query: 289 GIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
           G KYWL+KNSWG +WGE+GY  +QR +   +G CGIAM AS+P +
Sbjct: 240 GTKYWLVKNSWGTEWGEEGYTMMQRGVKAVEGICGIAMLASYPTA 284


>gi|357458909|ref|XP_003599735.1| Cysteine proteinase [Medicago truncatula]
 gi|357474677|ref|XP_003607623.1| Cysteine proteinase [Medicago truncatula]
 gi|355488783|gb|AES69986.1| Cysteine proteinase [Medicago truncatula]
 gi|355508678|gb|AES89820.1| Cysteine proteinase [Medicago truncatula]
          Length = 342

 Score =  286 bits (732), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 148/317 (46%), Positives = 203/317 (64%), Gaps = 14/317 (4%)

Query: 27  EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFA 86
           E  ++ K E+W  Q+G++YK++AE  KRF+IFK+N+  +E FN  A+GN+ + L +N FA
Sbjct: 30  EPYLSNKHEKWMTQFGKSYKDAAEKEKRFQIFKNNVEFIELFN--AVGNKPFNLSINHFA 87

Query: 87  DLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGAVTPVKYQGQCA 145
           DLT +EF AS  G K       +    T F Y + + VP S++W ++GAVTP+K QG C 
Sbjct: 88  DLTNEEFKASLNGNKKLHDKFDILNETTSFRYHNVTSVPASMDWRKRGAVTPIKNQGSCG 147

Query: 146 -------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGI 198
                  VA++EGI+ I    LVSLSEQ+L+DC    N++GC GG+++DAFK+I +  G+
Sbjct: 148 SCWAFSTVASIEGIHQITTGELVSLSEQELIDCV-RGNSSGCSGGYLEDAFKFIAKKGGM 206

Query: 199 TNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS-- 256
            ++  Y Y+      C   K   H A+I  YE VP N E  LLKAVANQPVSV +DA   
Sbjct: 207 ASETNYPYKETDEK-CKFKKESKHVAEIKGYEKVPSNSENDLLKAVANQPVSVYVDAGDY 265

Query: 257 ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDID 316
             QFYSGG+F G C T  +H VT VGYG S +  +YWL+KNSWG  WGE GY +L+R++D
Sbjct: 266 VFQFYSGGIFTGKCGTDTDHVVTIVGYGVSLDYTEYWLVKNSWGTGWGEKGYMKLKRNVD 325

Query: 317 QPQGQCGIAMFASFPVS 333
             +G CGIA   S+PV+
Sbjct: 326 SKKGLCGIATNPSYPVA 342


>gi|356559055|ref|XP_003547817.1| PREDICTED: cysteine proteinase RD21a [Glycine max]
          Length = 366

 Score =  286 bits (732), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 155/339 (45%), Positives = 200/339 (58%), Gaps = 18/339 (5%)

Query: 7   IVVLIISGSCA-SQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
           ++ L  + SCA   +T   + +  +   +E+W  ++ + Y    E  KRF++FKDNL  +
Sbjct: 12  LLFLSFTLSCAIDTSTITNYTDNEVMTMYEEWLVKHQKVYNGLREKDKRFQVFKDNLGFI 71

Query: 66  ERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSS---Q 122
           +  NN    N +Y L LN+FAD+T +E+     G K       +K   T   Y  S   +
Sbjct: 72  QEHNNNQ--NNTYKLGLNQFADMTNEEYRVMYFGTKSDAKRRLMKTKSTGHRYAYSAGDR 129

Query: 123 VPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATND 175
           +P  V+W  KGAV P+K QG C        VA VE IN I   + VSLSEQ+LVDC    
Sbjct: 130 LPVHVDWRVKGAVAPIKDQGSCGSCWAFSTVATVEAINKIVTGKFVSLSEQELVDC-DRA 188

Query: 176 NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPN 235
            N GC GG MD AF++IIQN GI  D  Y Y G   GICD  K       I  +EDVPP 
Sbjct: 189 YNEGCNGGLMDYAFEFIIQNGGIDTDKDYPYRGFD-GICDPTKKNAKVVNIDGFEDVPPY 247

Query: 236 DEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYW 293
           DE +L KAVA+QPVS+AI+AS   LQ Y  GVF G C T L+HGV  VGYG SE G+ YW
Sbjct: 248 DENALKKAVAHQPVSIAIEASGRDLQLYQSGVFTGKCGTSLDHGVVVVGYG-SENGVDYW 306

Query: 294 LIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           L++NSWG  WGEDGYF++QR++  P G+CGI M AS+PV
Sbjct: 307 LVRNSWGTGWGEDGYFKMQRNVRTPTGKCGITMEASYPV 345


>gi|297830592|ref|XP_002883178.1| hypothetical protein ARALYDRAFT_479457 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297329018|gb|EFH59437.1| hypothetical protein ARALYDRAFT_479457 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 452

 Score =  286 bits (731), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 158/339 (46%), Positives = 203/339 (59%), Gaps = 17/339 (5%)

Query: 5   FLIVVLIISGSCAS-QATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLV 63
            +  +L+IS S  S  A   T +E      +EQW  +  + Y    E   RFEIF DNL 
Sbjct: 13  LIFSMLLISLSLGSVTAADTTRNEAEARRMYEQWLVENRKNYNGLGEKETRFEIFTDNLK 72

Query: 64  AVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK-SSQ 122
            +E  N  ++ N+++ + L +FADLT  EF A     KM    + +   G  +LYK    
Sbjct: 73  YIEEHN--SVPNQTFEVGLTRFADLTNDEFRAIYLRSKM--ERTRVPVKGERYLYKVGDT 128

Query: 123 VPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATND 175
           +P  ++W  KGAV PVK QG C       A+ AVEGIN IK   L+SLSEQ+LVDC T+ 
Sbjct: 129 LPDQIDWRAKGAVNPVKDQGNCGSCWAFSAIGAVEGINQIKTGELISLSEQELVDCDTS- 187

Query: 176 NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPN 235
            N GC GG MD AFK+II+N GI  +  Y Y      IC+S K       I  YEDVP N
Sbjct: 188 YNGGCGGGLMDYAFKFIIENGGIDTEEDYPYTATDDNICNSDKKNSRVVTIDGYEDVPQN 247

Query: 236 DEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYW 293
           DE+SL KA+ANQP+SVAI+A   A Q Y  GVF G C T L+HGV AVGYG SE G  YW
Sbjct: 248 DEKSLKKALANQPISVAIEAGGRAFQLYKSGVFTGTCGTSLDHGVVAVGYG-SEGGQDYW 306

Query: 294 LIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           +++NSWG +WGE GYF+L+R+I +  G+CG+AM AS+P 
Sbjct: 307 IVRNSWGSNWGESGYFKLERNIKESSGKCGVAMMASYPT 345


>gi|116787404|gb|ABK24495.1| unknown [Picea sitchensis]
 gi|224286306|gb|ACN40861.1| unknown [Picea sitchensis]
          Length = 452

 Score =  286 bits (731), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 158/320 (49%), Positives = 199/320 (62%), Gaps = 17/320 (5%)

Query: 26  DEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKF 85
           ++ +I E +E W A++ R Y    E  KRF +FKDN + +   N    GNRSY L LN+F
Sbjct: 34  EDDAIMELYELWLAEHKRAYNGLDEKQKRFSVFKDNFLYIHEHNQ---GNRSYKLGLNQF 90

Query: 86  ADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQ-VPPSVNWIEKGAVTPVKYQGQC 144
           ADL+ +EF A+  G K+       +     + Y   + +P S++W EKGAVT VK QG C
Sbjct: 91  ADLSHEEFKATYLGAKLDTKKRLSRPPSRRYQYSDGEDLPESIDWREKGAVTSVKDQGSC 150

Query: 145 A-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKG 197
                   VAAVEGIN I    L+SLSEQ+LVDC T+  N GC GG MD AF++II N G
Sbjct: 151 GSCWAFSTVAAVEGINQIVTGDLISLSEQELVDCDTS-YNQGCNGGLMDYAFEFIINNGG 209

Query: 198 ITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA 257
           + ++  Y Y     G CDS +   H   I +YEDVP NDE+SL KA ANQP+SVAI+AS 
Sbjct: 210 LDSEEDYPYTAYD-GSCDSYRKNAHVVTIDDYEDVPENDEKSLKKAAANQPISVAIEASG 268

Query: 258 --LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDI 315
              QFY  GVF   C T L+HGVT VGYG SE G  YW +KNSWG+ WGE+G+ RLQR+I
Sbjct: 269 REFQFYDSGVFTSTCGTQLDHGVTLVGYG-SESGTDYWTVKNSWGKSWGEEGFIRLQRNI 327

Query: 316 D-QPQGQCGIAMFASFPVSK 334
           +    G CGIAM AS+PV K
Sbjct: 328 EVASTGMCGIAMEASYPVKK 347


>gi|224102377|ref|XP_002312656.1| predicted protein [Populus trichocarpa]
 gi|222852476|gb|EEE90023.1| predicted protein [Populus trichocarpa]
          Length = 358

 Score =  286 bits (731), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 154/333 (46%), Positives = 209/333 (62%), Gaps = 23/333 (6%)

Query: 27  EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFA 86
           E  + + +E+W++ +    +  AE  +RF +FK+NL  + + N+    +R Y L+LN FA
Sbjct: 33  EERLRDLYERWRSHH-TVSRSLAEKQERFNVFKENLKHIHKVNHK---DRPYKLKLNSFA 88

Query: 87  DLTPQEFIASQTGFKMSDHSSSLKAN--GTPFLYK-SSQVPPSVNWIEKGAVTPVKYQGQ 143
           D+T  EF+    G K+S H   L+    GT  +++ +S++P SV+W + GAVT +K QG+
Sbjct: 89  DMTNHEFLQHYGGSKVS-HYRVLRGQRQGTGSMHEDTSKLPSSVDWRKNGAVTGIKDQGK 147

Query: 144 CA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNK 196
           C        VAAVEGIN IK   L+SLSEQ+LVDC  + +N+GC GG M+DAF +I Q  
Sbjct: 148 CGSCWAFSTVAAVEGINKIKTGELISLSEQELVDC--DSDNHGCNGGLMEDAFNFIKQIG 205

Query: 197 GITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS 256
           G+T++  Y Y       CDS K       I  YE VP NDE +L+KAVANQPV++A+DA 
Sbjct: 206 GLTSENTYPYRAKEEP-CDSNKMNSPVVNIDGYEMVPENDENALMKAVANQPVAIAMDAG 264

Query: 257 A--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRD 314
              LQFYS  +F G C T LNHGV  VGYGT+++G KYW++KNSWG DWGE GY R+QR 
Sbjct: 265 GKDLQFYSEAIFTGDCGTELNHGVALVGYGTTQDGTKYWIVKNSWGTDWGEKGYIRMQRG 324

Query: 315 IDQPQGQCGIAMFASFPV---SKESAQPSSADK 344
           ID  +G CGI M AS+PV   S     PS  D+
Sbjct: 325 IDAEEGLCGITMEASYPVKLRSDNKKAPSRKDE 357


>gi|224103643|ref|XP_002313136.1| predicted protein [Populus trichocarpa]
 gi|222849544|gb|EEE87091.1| predicted protein [Populus trichocarpa]
          Length = 477

 Score =  285 bits (730), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 166/349 (47%), Positives = 209/349 (59%), Gaps = 25/349 (7%)

Query: 4   YFLIVVLIISGSC------ASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEI 57
           YFL V L I  S         Q   RT  E      +E W  +YG+ Y    E  +RFEI
Sbjct: 15  YFLSVCLAIDMSIIDYNLKHGQVPERT--EAETLRLYEMWLVKYGKAYNALGEKERRFEI 72

Query: 58  FKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKA-NGTPF 116
           FKDNL  V++ N  ++GN SY L LNKFADL+ +E+ A+  G +M      L       +
Sbjct: 73  FKDNLKFVDQHN--SVGNPSYKLGLNKFADLSNEEYRAAYLGTRMDGKRRLLGGPKSARY 130

Query: 117 LYKSSQ-VPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQL 168
           L+K    +P SV+W EKGAV PVK QGQC        V AVEGIN I    L SLSEQ+L
Sbjct: 131 LFKDGDDLPESVDWREKGAVAPVKDQGQCGSCWAFSTVGAVEGINQIVTGNLTSLSEQEL 190

Query: 169 VDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITN 228
           VDC     N GC GG MD AF++I++N GI  +  Y Y+ + + +CD  +       I  
Sbjct: 191 VDC-DKVYNQGCNGGLMDYAFEFIMKNGGIDTEEDYPYKAVDS-MCDPNRKNARVVTIDG 248

Query: 229 YEDVPPNDEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTS 286
           YEDVP NDE+SL KAVANQPVSVAI+A   A Q Y  GVF G C T L+HGV AVGYGT 
Sbjct: 249 YEDVPQNDEKSLRKAVANQPVSVAIEAGGRAFQLYQSGVFTGSCGTQLDHGVVAVGYGT- 307

Query: 287 EEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQ-GQCGIAMFASFPVSK 334
           E G+ YW+++NSWG  WGE+GY R++R++   + G+CGIAM AS+P  K
Sbjct: 308 ENGVDYWVVRNSWGPAWGENGYIRMERNVASTETGKCGIAMEASYPTKK 356


>gi|125533982|gb|EAY80530.1| hypothetical protein OsI_35710 [Oryza sativa Indica Group]
          Length = 378

 Score =  285 bits (730), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 157/342 (45%), Positives = 205/342 (59%), Gaps = 28/342 (8%)

Query: 27  EGSIAEKFEQWKAQY--------GRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSY 78
           E S+   +E+W+++Y        G    +  E  +RF +F +N   +   N    G R +
Sbjct: 35  EESLRALYERWRSRYTVSRPAASGGVGNDDGEARRRFNVFVENARYIHEANRR--GGRPF 92

Query: 79  TLRLNKFADLTPQEFIASQTGFKMSDHSS---SLKANGTPFLY---KSSQVPPSVNWIEK 132
            L LNKFAD+T  EF  +  G +   H S        G  F Y       +PP+V+W E+
Sbjct: 93  RLALNKFADMTTDEFRRTYAGSRARHHRSLRGGRGGEGGSFRYGGDDEDNLPPAVDWRER 152

Query: 133 GAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFM 185
           GAVT +K QGQC       AVAAVEG+N IK  RLV+LSEQ+LVDC T DN  GC GG M
Sbjct: 153 GAVTGIKDQGQCGSCWAFSAVAAVEGVNKIKTGRLVTLSEQELVDCDTGDNQ-GCDGGLM 211

Query: 186 DDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVA 245
           D AF++I +N GIT ++ Y Y     G C+  KA  H   I  YEDVP NDE +L KAVA
Sbjct: 212 DYAFQFIKRNGGITTESNYPYR-AEQGRCNKAKASSHDVTIDGYEDVPANDESALQKAVA 270

Query: 246 NQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDW 303
           NQPV+VA++AS    QFYS GVF G C T L+HGV AVGYG + +G KYW++KNSWG+DW
Sbjct: 271 NQPVAVAVEASGQDFQFYSEGVFTGECGTDLDHGVAAVGYGITRDGTKYWIVKNSWGEDW 330

Query: 304 GEDGYFRLQRDI-DQPQGQCGIAMFASFPVSKESAQPSSADK 344
           GE GY R+QR +     G CGIAM AS+PV   +   +++++
Sbjct: 331 GERGYIRMQRGVSSDSNGLCGIAMEASYPVKSGARNAAASNR 372


>gi|125547258|gb|EAY93080.1| hypothetical protein OsI_14881 [Oryza sativa Indica Group]
          Length = 314

 Score =  285 bits (730), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 153/320 (47%), Positives = 199/320 (62%), Gaps = 22/320 (6%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
           +A++ E+W A++GR Y + AE  +R E+F+DN+  +E  N AA     + L  N+FADLT
Sbjct: 1   MAQRHERWMAKHGRAYADDAEKVRRLEVFRDNVAFIESVN-AAASQHKFWLEENQFADLT 59

Query: 90  PQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQV-----PPSVNWIEKGAVTPVKYQGQC 144
             EF A++TG +     SS + N  P  ++ + V     P SV+W  KGAV PVK QG C
Sbjct: 60  NAEFRATRTGLR----PSSSRGNRAPTSFRYANVSTGDLPASVDWRGKGAVNPVKDQGDC 115

Query: 145 -------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKG 197
                  AVAA+EG   +   +LVSLSEQQLV C     + GC GG MDDAF +II+N G
Sbjct: 116 GCCWAFSAVAAMEGAVKLATGKLVSLSEQQLVSCDVKGEDQGCEGGLMDDAFDFIIKNGG 175

Query: 198 ITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS- 256
           +  ++ Y Y   S   C +  A   AA I  YEDVP NDE +LLKAVANQPVSVAID   
Sbjct: 176 LAAESDYPYTA-SDDKCATAGAGAAAATIKGYEDVPANDEAALLKAVANQPVSVAIDGGD 234

Query: 257 -ALQFYSGGVFNGY--CETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQR 313
              QFY GGV +G   C T L+H +TAVGYG + +G KYWL+KNSWG  WGEDGY R++R
Sbjct: 235 RHFQFYKGGVLSGAAGCATELDHAITAVGYGVASDGTKYWLMKNSWGTSWGEDGYVRMER 294

Query: 314 DIDQPQGQCGIAMFASFPVS 333
            +   +G CG+AM AS+P +
Sbjct: 295 GVADKEGVCGLAMMASYPTA 314


>gi|115477767|ref|NP_001062479.1| Os08g0556900 [Oryza sativa Japonica Group]
 gi|42407937|dbj|BAD09076.1| putative cysteine proteinase [Oryza sativa Japonica Group]
 gi|113624448|dbj|BAF24393.1| Os08g0556900 [Oryza sativa Japonica Group]
 gi|125562525|gb|EAZ07973.1| hypothetical protein OsI_30231 [Oryza sativa Indica Group]
 gi|215701458|dbj|BAG92882.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 385

 Score =  285 bits (729), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 154/327 (47%), Positives = 207/327 (63%), Gaps = 20/327 (6%)

Query: 27  EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFA 86
           E ++ E +E+W+ Q+ R  ++  E ++RF +FKDN+  +  FN     +  Y LRLN+F 
Sbjct: 41  EEALWELYERWRGQH-RVARDLGEKARRFNVFKDNVRLIHEFNRR---DEPYKLRLNRFG 96

Query: 87  DLTPQEFIASQTGFKMSDHSSSLKANG---TPFLYKSSQ-VPPSVNWIEKGAVTPVKYQG 142
           D+T  EF  +    ++S H    +  G   + F+Y  ++ +P +V+W EKGAV  VK QG
Sbjct: 97  DMTADEFRRAYASSRVS-HHRMFRGRGERRSGFMYAGARDLPAAVDWREKGAVGAVKDQG 155

Query: 143 QCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQN 195
           QC        +AAVEGINAI+ + L +LSEQQLVDC T   N GC GG MD+AF+YI ++
Sbjct: 156 QCGSCWAFSTIAAVEGINAIRTSNLTALSEQQLVDCDTKTGNAGCDGGLMDNAFQYIAKH 215

Query: 196 KGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDA 255
            G+   + Y Y    +  C S  A   A  I  YEDVP N E +L KAVANQPVSVAI+A
Sbjct: 216 GGVAASSAYPYRARQS-SCKSSAASSPAVTIDGYEDVPANSESALKKAVANQPVSVAIEA 274

Query: 256 --SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQR 313
             S  QFYS GVF G C T L+HGV AVGYGT+ +G KYW+++NSWG DWGE GY R++R
Sbjct: 275 GGSHFQFYSEGVFAGKCGTELDHGVAAVGYGTTVDGTKYWIVRNSWGADWGEKGYIRMKR 334

Query: 314 DIDQPQGQCGIAMFASFPVSKESAQPS 340
           D+   +G CGIAM AS+P+ K S  P+
Sbjct: 335 DVSAKEGLCGIAMEASYPI-KTSPNPA 360


>gi|18408616|ref|NP_566901.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|75313880|sp|Q9STL5.1|CEP3_ARATH RecName: Full=KDEL-tailed cysteine endopeptidase CEP3; Flags:
           Precursor
 gi|4678353|emb|CAB41163.1| cysteine endopeptidase precursor-like protein [Arabidopsis
           thaliana]
 gi|26453052|dbj|BAC43602.1| putative cysteine endopeptidase precursor [Arabidopsis thaliana]
 gi|332644885|gb|AEE78406.1| putative cysteine proteinase [Arabidopsis thaliana]
          Length = 364

 Score =  285 bits (729), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 159/361 (44%), Positives = 219/361 (60%), Gaps = 26/361 (7%)

Query: 1   MAKYFLIVVLIISGSCASQATYRTFDEGSIAEK------FEQWKAQYGRTYKESAENSKR 54
           M  +F++++  +S   AS+     FDE  +  +      +E+W+  +  + + S E  KR
Sbjct: 1   MKLFFIVLISFLSLLQASKGF--DFDEKELETEENVWKLYERWRGHHSVS-RASHEAIKR 57

Query: 55  FEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHS--SSLKAN 112
           F +F+ N++ V R N     N+ Y L++N+FAD+T  EF +S  G  +  H      K  
Sbjct: 58  FNVFRHNVLHVHRTNKK---NKPYKLKINRFADITHHEFRSSYAGSNVKHHRMLRGPKRG 114

Query: 113 GTPFLYKS-SQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLS 164
              F+Y++ ++VP SV+W EKGAVT VK Q  C        VAAVEGIN I+ N+LVSLS
Sbjct: 115 SGGFMYENVTRVPSSVDWREKGAVTEVKNQQDCGSCWAFSTVAAVEGINKIRTNKLVSLS 174

Query: 165 EQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAA 224
           EQ+LVDC T +N  GC GG M+ AF++I  N GI  +  Y Y+      C +        
Sbjct: 175 EQELVDCDTEENQ-GCAGGLMEPAFEFIKNNGGIKTEETYPYDSSDVQFCRANSIGGETV 233

Query: 225 QITNYEDVPPNDEESLLKAVANQPVSVAIDA--SALQFYSGGVFNGYCETFLNHGVTAVG 282
            I  +E VP NDEE LLKAVA+QPVSVAIDA  S  Q YS GVF G C T LNHGV  VG
Sbjct: 234 TIDGHEHVPENDEEELLKAVAHQPVSVAIDAGSSDFQLYSEGVFIGECGTQLNHGVVIVG 293

Query: 283 YGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSKESAQPSSA 342
           YG ++ G KYW+++NSWG +WGE GY R++R I + +G+CGIAM AS+P +K S+ PS+ 
Sbjct: 294 YGETKNGTKYWIVRNSWGPEWGEGGYVRIERGISENEGRCGIAMEASYP-TKLSSTPSTH 352

Query: 343 D 343
           +
Sbjct: 353 E 353


>gi|255563136|ref|XP_002522572.1| cysteine protease, putative [Ricinus communis]
 gi|223538263|gb|EEF39872.1| cysteine protease, putative [Ricinus communis]
          Length = 340

 Score =  285 bits (728), Expect = 3e-74,   Method: Compositional matrix adjust.
 Identities = 168/344 (48%), Positives = 224/344 (65%), Gaps = 30/344 (8%)

Query: 6   LIVVLIISGSCASQATYRTF-DEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
           L +VL+I  +  SQA  R   DE ++AEK EQW A++GRTY++  E  +RF IFK NL  
Sbjct: 9   LAIVLMILVTWVSQAMPRPLIDEDAVAEKHEQWMARHGRTYQDDEEKERRFHIFKKNLKH 68

Query: 65  VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKM-----SDHSSSLKANGTPFLYK 119
           +E FNNA   NR+Y L LN FADLT +EF+A+ TG+KM     + + ++     +  LY+
Sbjct: 69  IENFNNAF--NRTYKLGLNHFADLTDEEFLATYTGYKMPKVLPTANITTKTTQSSDVLYE 126

Query: 120 SSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCA 172
           ++ VP S++W  +G VTPVK QG+C       A AAVEGI    I   VSLS QQL+DC 
Sbjct: 127 AN-VPESIDWRTRGVVTPVKNQGRCGCCWAFSAAAAVEGI----IGNGVSLSAQQLLDCV 181

Query: 173 TNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDV 232
              ++NGC GGFMD+AF+YIIQN+G+ +   Y Y+ M     +  +  ++AA+I+ Y DV
Sbjct: 182 --PDSNGCNGGFMDNAFRYIIQNQGLASATYYPYQLMR----EMCRPSNNAARISGYVDV 235

Query: 233 PPNDEESLLKAVANQPVSVAIDASA---LQFYSGGVFNGY-CETFLNHGVTAVGYGTSEE 288
            P DEE+L  AVA QPVS A+DA++    ++Y GG+F    C + L H +T VGYGTS E
Sbjct: 236 TPADEETLKSAVARQPVSAAVDATSELNFKYYGGGIFPPQDCGSTLTHAITIVGYGTSAE 295

Query: 289 GIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           G KYWLIKNSWG+ WGE GY RLQRD+    G CGIA+ AS+P 
Sbjct: 296 GTKYWLIKNSWGEGWGEGGYMRLQRDVGSYGGACGIALRASYPT 339


>gi|1208549|gb|AAC49455.1| Pseudotzain [Pseudotsuga menziesii]
          Length = 454

 Score =  284 bits (727), Expect = 4e-74,   Method: Compositional matrix adjust.
 Identities = 168/356 (47%), Positives = 217/356 (60%), Gaps = 28/356 (7%)

Query: 1   MAKYFLIVVLIISGSCASQA-------TYRTFD---EGSIAEKFEQWKAQYGRTYKESAE 50
           M    L  VL +S    S +       +Y + D   + +I E +E W AQ+ + Y    E
Sbjct: 1   MGILLLFAVLALSAMAGSASRADFSIISYDSQDLIGDDAIMELYELWLAQHKKAYNGLDE 60

Query: 51  NSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLK 110
             K+F +FKDN + + + NN   GN SY L LN+FADL+ +EF A+  G K+ D    L 
Sbjct: 61  KQKKFSVFKDNFLYIHQHNNQ--GNPSYKLGLNQFADLSHEEFKAAYLGTKL-DAKKRLS 117

Query: 111 ANGTP-FLYKSSQ-VPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLV 161
            + +P + Y   + +P S++W EKGAVT VK QG C        VAAVEGIN I    L 
Sbjct: 118 RSPSPRYQYSVGEDLPESIDWREKGAVTAVKNQGSCGSCWAFSTVAAVEGINQIVTGNLT 177

Query: 162 SLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAED 221
           SLSEQ+LVDC T+  N GC GG MD AF++II N G+ ++  Y Y+  + G CD+ +   
Sbjct: 178 SLSEQELVDCDTS-YNQGCNGGLMDYAFQFIISNGGLDSEDDYPYKA-NNGSCDAYRKNA 235

Query: 222 HAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVT 279
           H   I +YEDVP NDE+SL KA ANQP+SVAI+AS  A QFY  GVF   C T L+HGVT
Sbjct: 236 HVVTIDDYEDVPENDEKSLKKAAANQPISVAIEASGRAFQFYESGVFTSNCGTQLDHGVT 295

Query: 280 AVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDID-QPQGQCGIAMFASFPVSK 334
            VGYG SE GI YWL+KNSWG  WGE G+ +LQR+++    G CGIAM AS+PV K
Sbjct: 296 LVGYG-SESGIDYWLVKNSWGNSWGEKGFIKLQRNLEGASTGMCGIAMEASYPVKK 350


>gi|115484973|ref|NP_001067630.1| Os11g0255300 [Oryza sativa Japonica Group]
 gi|530335|emb|CAA56844.1| cysteine protease [Oryza sativa Japonica Group]
 gi|5761322|dbj|BAA83472.1| cysteine endopeptidase [Oryza sativa Japonica Group]
 gi|62732672|gb|AAX94791.1| cysteine proteinase (EC 3.4.22.-) - rice [Oryza sativa Japonica
           Group]
 gi|62732673|gb|AAX94792.1| cysteine proteinase (EC 3.4.22.-) - rice [Oryza sativa Japonica
           Group]
 gi|62732674|gb|AAX94793.1| cysteine proteinase (EC 3.4.22.-) - rice [Oryza sativa Japonica
           Group]
 gi|77549615|gb|ABA92412.1| Thiol protease SEN102 precursor, putative, expressed [Oryza sativa
           Japonica Group]
 gi|77549616|gb|ABA92413.1| Thiol protease SEN102 precursor, putative, expressed [Oryza sativa
           Japonica Group]
 gi|77549617|gb|ABA92414.1| Thiol protease SEN102 precursor, putative, expressed [Oryza sativa
           Japonica Group]
 gi|113644852|dbj|BAF27993.1| Os11g0255300 [Oryza sativa Japonica Group]
 gi|125576789|gb|EAZ18011.1| hypothetical protein OsJ_33558 [Oryza sativa Japonica Group]
 gi|215701098|dbj|BAG92522.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 378

 Score =  284 bits (727), Expect = 4e-74,   Method: Compositional matrix adjust.
 Identities = 156/342 (45%), Positives = 204/342 (59%), Gaps = 28/342 (8%)

Query: 27  EGSIAEKFEQWKAQY--------GRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSY 78
           E S+   +E+W+++Y        G    +  E  +RF +F +N   +   N    G R +
Sbjct: 35  EESLRALYERWRSRYTVSRPAASGGVGNDDGEARRRFNVFVENARYIHEANRR--GGRPF 92

Query: 79  TLRLNKFADLTPQEFIASQTGFKMSDHSS---SLKANGTPFLY---KSSQVPPSVNWIEK 132
            L LNKFAD+T  EF  +  G +   H S        G  F Y       +PP+V+W E+
Sbjct: 93  RLALNKFADMTTDEFRRTYAGSRARHHRSLSGGRGGEGGSFRYGGDDEDNLPPAVDWRER 152

Query: 133 GAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFM 185
           GAVT +K QGQC        VAAVEG+N IK  RLV+LSEQ+LVDC T DN  GC GG M
Sbjct: 153 GAVTGIKDQGQCGSCWAFSTVAAVEGVNKIKTGRLVTLSEQELVDCDTGDNQ-GCDGGLM 211

Query: 186 DDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVA 245
           D AF++I +N GIT ++ Y Y     G C+  KA  H   I  YEDVP NDE +L KAVA
Sbjct: 212 DYAFQFIKRNGGITTESNYPYR-AEQGRCNKAKASSHDVTIDGYEDVPANDESALQKAVA 270

Query: 246 NQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDW 303
           NQPV+VA++AS    QFYS GVF G C T L+HGV AVGYG + +G KYW++KNSWG+DW
Sbjct: 271 NQPVAVAVEASGQDFQFYSEGVFTGECGTDLDHGVAAVGYGITRDGTKYWIVKNSWGEDW 330

Query: 304 GEDGYFRLQRDI-DQPQGQCGIAMFASFPVSKESAQPSSADK 344
           GE GY R+QR +     G CGIAM AS+PV   +   +++++
Sbjct: 331 GERGYIRMQRGVSSDSNGLCGIAMEASYPVKSGARNAAASNR 372


>gi|356517184|ref|XP_003527269.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
          Length = 350

 Score =  284 bits (727), Expect = 4e-74,   Method: Compositional matrix adjust.
 Identities = 153/315 (48%), Positives = 201/315 (63%), Gaps = 17/315 (5%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
           + E FE W +++G+ Y+   E   RFEIFKDNL  ++  N       +Y L LN+FADL+
Sbjct: 44  LIELFESWISRHGKIYQSIEEKLHRFEIFKDNLKHIDERNKVV---SNYWLGLNEFADLS 100

Query: 90  PQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA---- 145
            QEF     G K+ D+S   + +   F YK  ++P SV+W +KGAVT VK QG C     
Sbjct: 101 HQEFKNKYLGLKV-DYSRR-RESPEEFTYKDVELPKSVDWRKKGAVTQVKNQGSCGSCWA 158

Query: 146 ---VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDA 202
              VAAVEGIN I    L SLSEQ+L+DC     NNGC GG MD AF +I++N G+  + 
Sbjct: 159 FSTVAAVEGINQIVTGNLTSLSEQELIDC-DRTYNNGCNGGLMDYAFSFIVENDGLHKEE 217

Query: 203 VYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQF 260
            Y Y  M  G C+  K E     I+ Y DVP N+E+SLLKA+ANQP+SVAI+AS    QF
Sbjct: 218 DYPYI-MEEGTCEMAKEETEVVTISGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQF 276

Query: 261 YSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQG 320
           YSGGVF+G+C + L+HGV AVGYGT++ G+ Y  +KNSWG  WGE GY R++R+I +P+G
Sbjct: 277 YSGGVFDGHCGSDLDHGVAAVGYGTAK-GVDYITVKNSWGSKWGEKGYIRMRRNIGKPEG 335

Query: 321 QCGIAMFASFPVSKE 335
            CGI   AS+P  K+
Sbjct: 336 ICGIYKMASYPTKKK 350


>gi|147772785|emb|CAN62838.1| hypothetical protein VITISV_003391 [Vitis vinifera]
          Length = 298

 Score =  284 bits (727), Expect = 4e-74,   Method: Compositional matrix adjust.
 Identities = 153/341 (44%), Positives = 201/341 (58%), Gaps = 60/341 (17%)

Query: 3   KYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
           +Y  + +L I  + ASQAT R+  E S+ E+ E W A+YGR YK++ E  KRF+IFKDN+
Sbjct: 8   QYVSMALLFILAAWASQATSRSLHEASMYERHEDWMARYGRMYKDANEKEKRFKIFKDNV 67

Query: 63  VAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-S 121
                                                           A  T F Y++ +
Sbjct: 68  ------------------------------------------------AQATTFKYENVT 79

Query: 122 QVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATN 174
            VP +++W +KGAVTP+K Q QC       AVAA EGI  I   +L+SLSEQ+LVDC T 
Sbjct: 80  AVPSTIDWRKKGAVTPIKDQQQCGSCWAFSAVAATEGITQITTGKLISLSEQELVDCDTG 139

Query: 175 DNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPP 234
             N GC GG  DDAF++I  + G+ ++A Y YEG   G C+S K    AA+I  YEDVP 
Sbjct: 140 GENQGCSGGLXDDAFRFIXIH-GLASEATYPYEG-DDGTCNSKKEAHPAAKIKGYEDVPA 197

Query: 235 NDEESLLKAVANQPVSVAIDASAL--QFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
           N+E++L KAVA+QPV+VAIDA     QFY+ GVF G C T L+HGV AVGYG  ++G+ Y
Sbjct: 198 NNEKALQKAVAHQPVAVAIDAGGFEFQFYTSGVFTGQCGTELDHGVAAVGYGIGDDGMXY 257

Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
           WL+KNSWG  WGE+GY R+QRD+   +G CGIAM AS+P +
Sbjct: 258 WLVKNSWGTGWGEEGYIRMQRDVTAKEGLCGIAMQASYPTA 298


>gi|356508487|ref|XP_003522988.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
          Length = 349

 Score =  284 bits (727), Expect = 4e-74,   Method: Compositional matrix adjust.
 Identities = 152/315 (48%), Positives = 201/315 (63%), Gaps = 17/315 (5%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
           + E FE W +++G+ Y+   E   RF+IFKDNL  ++  N       +Y L LN+FADL+
Sbjct: 43  LIELFESWMSRHGKIYQSIEEKLHRFDIFKDNLKHIDERNKVV---SNYWLGLNEFADLS 99

Query: 90  PQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA---- 145
            QEF     G K+ D+S   + +   F YK  ++P SV+W +KGAVT VK QG C     
Sbjct: 100 HQEFKNKYLGLKV-DYSRR-RESPEEFTYKDFELPKSVDWRKKGAVTQVKNQGSCGSCWA 157

Query: 146 ---VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDA 202
              VAAVEGIN I    L SLSEQ+L+DC     NNGC GG MD AF +I++N G+  + 
Sbjct: 158 FSTVAAVEGINQIVTGNLTSLSEQELIDC-DRTYNNGCNGGLMDYAFSFIVENGGLHKEE 216

Query: 203 VYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQF 260
            Y Y  M  G C+  K E     I+ Y DVP N+E+SLLKA+ NQP+SVAI+AS    QF
Sbjct: 217 DYPYI-MEEGTCEMTKEETEVVTISGYHDVPQNNEQSLLKALVNQPLSVAIEASGRDFQF 275

Query: 261 YSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQG 320
           YSGGVF+G+C + L+HGV AVGYGTS+ G+ Y ++KNSWG  WGE GY R++R+I +P+G
Sbjct: 276 YSGGVFDGHCGSDLDHGVAAVGYGTSK-GVNYIIVKNSWGSKWGEKGYIRMRRNIGKPEG 334

Query: 321 QCGIAMFASFPVSKE 335
            CGI   AS+P  K+
Sbjct: 335 ICGIYKMASYPTKKK 349


>gi|116786779|gb|ABK24233.1| unknown [Picea sitchensis]
          Length = 463

 Score =  284 bits (726), Expect = 5e-74,   Method: Compositional matrix adjust.
 Identities = 167/356 (46%), Positives = 216/356 (60%), Gaps = 28/356 (7%)

Query: 1   MAKYFLIVVLIISG--SCASQATYRTF--------DEGSIAEKFEQWKAQYGRTYKESAE 50
           M    L  VL +S     AS+A +           ++ +I E +E W AQ+ + Y    E
Sbjct: 1   MGILLLFAVLALSAMAGSASRADFSIIGYDSKDLREDDAIMELYELWLAQHKKAYNGLGE 60

Query: 51  NSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLK 110
              RF +FKDN + + + NN   GN SY L LN+FADL+ +EF A+  G K+ D    L 
Sbjct: 61  KQNRFSVFKDNFLYIHQHNNQ--GNPSYKLGLNQFADLSHEEFKATYLGAKL-DTKKRLS 117

Query: 111 ANGTP-FLYKSSQ-VPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLV 161
            + +P + Y   + +P S++W EKGAVT VK QG C        VAAVEGIN I    L 
Sbjct: 118 NSPSPRYQYSDGEDLPESIDWREKGAVTAVKDQGSCGSCWAFSTVAAVEGINQIVTGNLT 177

Query: 162 SLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAED 221
           SLSEQ+LVDC T+  N GC GG MD AF++II N G+ ++  Y Y+  + G CD+ +   
Sbjct: 178 SLSEQELVDCDTS-YNQGCNGGLMDYAFQFIINNGGLDSEDDYPYKA-NDGSCDAYRKNA 235

Query: 222 HAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVT 279
           H   I +YEDVP NDE+SL KA ANQP+SVAI+AS  A QFY  GVF   C T L+HGVT
Sbjct: 236 HVVTIDDYEDVPENDEKSLKKAAANQPISVAIEASGRAFQFYESGVFTSTCGTQLDHGVT 295

Query: 280 AVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQ-PQGQCGIAMFASFPVSK 334
            VGYG SE G  YW++KNSWG+ WGE G+ RLQR+I+    G CGIAM AS+P+ K
Sbjct: 296 LVGYG-SESGTDYWIVKNSWGKSWGEKGFIRLQRNIEGVSTGMCGIAMEASYPLKK 350


>gi|356517188|ref|XP_003527271.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
          Length = 350

 Score =  284 bits (726), Expect = 5e-74,   Method: Compositional matrix adjust.
 Identities = 152/315 (48%), Positives = 201/315 (63%), Gaps = 17/315 (5%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
           + E FE W +++G+ Y+   E   RFEIFKDNL  ++  N       +Y L LN+FADL+
Sbjct: 44  LIELFESWMSRHGKIYENIEEKLLRFEIFKDNLKHIDERNKVV---SNYWLGLNEFADLS 100

Query: 90  PQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA---- 145
            +EF     G K+ D+S   + +   F YK  ++P SV+W +KGAV PVK QG C     
Sbjct: 101 HREFNNKYLGLKV-DYSRR-RESPEEFTYKDVELPKSVDWRKKGAVAPVKNQGSCGSCWA 158

Query: 146 ---VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDA 202
              VAAVEGIN I    L SLSEQ+L+DC     NNGC GG MD AF +I++N G+  + 
Sbjct: 159 FSTVAAVEGINQIVTGNLTSLSEQELIDC-DRTYNNGCNGGLMDYAFSFIVENGGLHKEE 217

Query: 203 VYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQF 260
            Y Y  M  G C+  K E     I+ Y DVP N+E+SLLKA+ANQP+SVAI+AS    QF
Sbjct: 218 DYPYI-MEEGTCEMTKEETQVVTISGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQF 276

Query: 261 YSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQG 320
           YSGGVF+G+C + L+HGV AVGYGT++ G+ Y  +KNSWG  WGE GY R++R+I +P+G
Sbjct: 277 YSGGVFDGHCGSDLDHGVAAVGYGTAK-GVDYITVKNSWGSKWGEKGYIRMRRNIGKPEG 335

Query: 321 QCGIAMFASFPVSKE 335
            CGI   AS+P  K+
Sbjct: 336 ICGIYKMASYPTKKK 350


>gi|296081395|emb|CBI16828.3| unnamed protein product [Vitis vinifera]
          Length = 359

 Score =  283 bits (725), Expect = 6e-74,   Method: Compositional matrix adjust.
 Identities = 154/319 (48%), Positives = 209/319 (65%), Gaps = 20/319 (6%)

Query: 27  EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFA 86
           E S+ + +E+W++ Y    ++  E +KRF +FK+N   V + N     ++ Y L+LNKFA
Sbjct: 33  EESLWDLYERWRS-YHTVSRDLEEKNKRFNVFKENTKHVHKVNQM---DKPYKLKLNKFA 88

Query: 87  DLTPQEFIASQTGFKMSDHSSSLKAN--GTP-FLY-KSSQVPPSVNWIEKGAVTPVKYQG 142
           D+T  EF +S  G K+  H   L+ +  GT  F++ K++ +PPSV+W +KGAVT +K QG
Sbjct: 89  DMTNHEFRSSYGGSKVK-HYRMLRGDRRGTGGFMHEKTTYLPPSVDWRKKGAVTGIKDQG 147

Query: 143 QCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQN 195
           +C        V  VEGIN IK   L+SLSEQQL+DC  +D++ GC GG M+ AF++I +N
Sbjct: 148 KCGSCWAFSTVVGVEGINQIKTKELLSLSEQQLIDCDRSDDH-GCNGGLMESAFEFIKKN 206

Query: 196 KGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDA 255
            GIT +  Y Y+      CD +K       I  +E VP NDE +L+KAVA+QPVSVAIDA
Sbjct: 207 GGITTENNYPYKAKDER-CDMLKMNAPVVTIDGHESVPVNDERALMKAVAHQPVSVAIDA 265

Query: 256 --SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQR 313
             S LQFYS GVF+G C T L+HGV  VGYGT+ +G KYW++KNSWG +WGE GY R+ R
Sbjct: 266 GGSDLQFYSEGVFDGECGTELDHGVAIVGYGTTLDGTKYWIVKNSWGAEWGEKGYIRMAR 325

Query: 314 DIDQPQGQCGIAMFASFPV 332
            I   +GQCGIAM AS+PV
Sbjct: 326 GIQAAEGQCGIAMEASYPV 344


>gi|359473128|ref|XP_002285397.2| PREDICTED: vignain-like [Vitis vinifera]
          Length = 357

 Score =  283 bits (725), Expect = 7e-74,   Method: Compositional matrix adjust.
 Identities = 154/319 (48%), Positives = 209/319 (65%), Gaps = 20/319 (6%)

Query: 27  EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFA 86
           E S+ + +E+W++ Y    ++  E +KRF +FK+N   V + N     ++ Y L+LNKFA
Sbjct: 31  EESLWDLYERWRS-YHTVSRDLEEKNKRFNVFKENTKHVHKVNQM---DKPYKLKLNKFA 86

Query: 87  DLTPQEFIASQTGFKMSDHSSSLKAN--GTP-FLY-KSSQVPPSVNWIEKGAVTPVKYQG 142
           D+T  EF +S  G K+  H   L+ +  GT  F++ K++ +PPSV+W +KGAVT +K QG
Sbjct: 87  DMTNHEFRSSYGGSKVK-HYRMLRGDRRGTGGFMHEKTTYLPPSVDWRKKGAVTGIKDQG 145

Query: 143 QCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQN 195
           +C        V  VEGIN IK   L+SLSEQQL+DC  +D++ GC GG M+ AF++I +N
Sbjct: 146 KCGSCWAFSTVVGVEGINQIKTKELLSLSEQQLIDCDRSDDH-GCNGGLMESAFEFIKKN 204

Query: 196 KGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDA 255
            GIT +  Y Y+      CD +K       I  +E VP NDE +L+KAVA+QPVSVAIDA
Sbjct: 205 GGITTENNYPYKAKDE-RCDMLKMNAPVVTIDGHESVPVNDERALMKAVAHQPVSVAIDA 263

Query: 256 --SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQR 313
             S LQFYS GVF+G C T L+HGV  VGYGT+ +G KYW++KNSWG +WGE GY R+ R
Sbjct: 264 GGSDLQFYSEGVFDGECGTELDHGVAIVGYGTTLDGTKYWIVKNSWGAEWGEKGYIRMAR 323

Query: 314 DIDQPQGQCGIAMFASFPV 332
            I   +GQCGIAM AS+PV
Sbjct: 324 GIQAAEGQCGIAMEASYPV 342


>gi|2224808|emb|CAB09697.1| cysteine endopeptidase EP-A [Hordeum vulgare subsp. vulgare]
 gi|326502180|dbj|BAK06781.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 365

 Score =  283 bits (724), Expect = 8e-74,   Method: Compositional matrix adjust.
 Identities = 156/321 (48%), Positives = 199/321 (61%), Gaps = 22/321 (6%)

Query: 27  EGSIAEKFEQWKAQYG---RTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLN 83
           E S+   +E+W++ Y    R     AE  +RF +FK+N   V   N     +R + L LN
Sbjct: 34  EESLRGLYERWRSHYTVSRRGLGADAEE-RRFNVFKENARYVHEGNKR---DRPFRLALN 89

Query: 84  KFADLTPQEFIASQTGFKMSDH---SSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKY 140
           KFAD+T  EF  +  G ++  H   S   + +G      +  +PP+V+W +KGAVT +K 
Sbjct: 90  KFADMTTDEFRRTYAGSRVRHHLSLSGGRRGDGGFRYADADNLPPAVDWRQKGAVTAIKD 149

Query: 141 QGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYII 193
           QGQC        + AVEGIN I+  +LVSLSEQ+L+DC  N NN GC GG MD AF++I 
Sbjct: 150 QGQCGSCWAFSTIVAVEGINKIRTGKLVSLSEQELMDC-DNVNNQGCEGGLMDYAFQFI- 207

Query: 194 QNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAI 253
           Q  GIT ++ Y Y+G   G CD  K    A  I  YEDVP NDE +L KAVA QPVSVAI
Sbjct: 208 QKNGITTESNYPYQG-EQGSCDQAKENAQAVTIDGYEDVPANDESALQKAVAGQPVSVAI 266

Query: 254 DASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRL 311
           DAS    QFYS GVF G C T L+HGV AVGYG + +G KYW++KNSWG+DWGE GY R+
Sbjct: 267 DASGQDFQFYSEGVFTGECSTDLDHGVAAVGYGATRDGTKYWIVKNSWGEDWGEKGYIRM 326

Query: 312 QRDIDQPQGQCGIAMFASFPV 332
           QR + Q +G CGIAM AS+P 
Sbjct: 327 QRGVSQTEGLCGIAMQASYPT 347


>gi|356545116|ref|XP_003540991.1| PREDICTED: vignain-like [Glycine max]
          Length = 342

 Score =  283 bits (723), Expect = 1e-73,   Method: Compositional matrix adjust.
 Identities = 149/340 (43%), Positives = 215/340 (63%), Gaps = 19/340 (5%)

Query: 6   LIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
           ++VV ++     SQ   R   E   + K E+W AQYG+ YK++AE  KRF+IFK+N+  +
Sbjct: 10  ILVVFLVLTVWTSQVMSRRLSEAYSSVKHEKWMAQYGKVYKDAAEKEKRFQIFKNNVHFI 69

Query: 66  ERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHS-SSLKANGTPFLYKS-SQV 123
           E F+  A G++ + L +N+FADL   +F A     +  +H+  +  A    F Y S +++
Sbjct: 70  ESFH--AAGDKPFNLSINQFADL--HKFKALLINGQKKEHNVRTATATEASFKYDSVTRI 125

Query: 124 PPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDN 176
           P S++W ++GAVTP+K QG C        VA +EG++ I    LVSLSEQ+LVDC   D+
Sbjct: 126 PSSLDWRKRGAVTPIKDQGTCRSCWAFSTVATIEGLHQITKGELVSLSEQELVDCVKGDS 185

Query: 177 NNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAA-QITNYEDVPPN 235
             GCYGG+++DAF++I +  G+ ++  Y Y+G++   C  +K E H   QI  YE VP N
Sbjct: 186 E-GCYGGYVEDAFEFIAKKGGVASETHYPYKGVNK-TCK-VKKETHGVVQIKGYEQVPSN 242

Query: 236 DEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYW 293
            E++LLKAVA+QPVS  ++A   A QFYS G+F G C T ++H VT VGYG +  G KYW
Sbjct: 243 SEKALLKAVAHQPVSAYVEAGGYAFQFYSSGIFTGKCGTDIDHSVTVVGYGKARGGNKYW 302

Query: 294 LIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
           L+KNSWG +WGE GY R++RDI   +G CGIA  A +P +
Sbjct: 303 LVKNSWGTEWGEKGYIRMKRDIRAKEGLCGIATGALYPTA 342


>gi|310656790|gb|ADP02219.1| Peptidase_C1 domain-containing protein [Triticum aestivum]
          Length = 419

 Score =  282 bits (722), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 153/339 (45%), Positives = 203/339 (59%), Gaps = 30/339 (8%)

Query: 1   MAKYFLIVVLIISGSCASQATY---RTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEI 57
           MA    +++ II   C   +T    R   + ++ EK EQW A++ R YK+S E ++RF+ 
Sbjct: 1   MAIPKALLLAIIGSICLCSSTVLSARELGDAAMVEKHEQWMAKFNRVYKDSTEKAQRFKA 60

Query: 58  FKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANG---- 113
           FK N+  +E FN    GN  + L +N+F DLT  EF A++T       +  LK NG    
Sbjct: 61  FKANVAFIESFNT---GNHKFWLGVNQFTDLTNDEFRATKT-------NKGLKRNGARAP 110

Query: 114 TPFLYK---SSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSL 163
           T F Y    +  +P +V+W  KG VTP+K QGQC       AVAA EGI  +   +LVSL
Sbjct: 111 TRFKYNNVSTDALPAAVDWRTKGVVTPIKDQGQCGCCWAFSAVAATEGIVKLSTGKLVSL 170

Query: 164 SEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHA 223
           SEQ+LVDC  +  + GC GG MD+AFK+II+N G+T +A Y Y     G C +    +  
Sbjct: 171 SEQELVDCDVHGVDQGCEGGEMDNAFKFIIKNGGLTTEANYPYTAQD-GQCKTSTTSNSV 229

Query: 224 AQITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAV 281
           A I  YEDVP NDE SL+KAVANQPVSVA+D      Q YSGGV  G C T L+HG+ A+
Sbjct: 230 ATIKGYEDVPANDESSLMKAVANQPVSVAVDGGDVIFQHYSGGVMTGSCGTDLDHGIVAI 289

Query: 282 GYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQG 320
           GYG + +G K+WL+KNSWG  WGE GY R+++DI    G
Sbjct: 290 GYGMTSDGTKFWLLKNSWGTTWGESGYLRMEKDISDKSG 328


>gi|118486542|gb|ABK95110.1| unknown [Populus trichocarpa]
          Length = 465

 Score =  282 bits (721), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 153/344 (44%), Positives = 215/344 (62%), Gaps = 19/344 (5%)

Query: 1   MAKYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKD 60
           ++  F + ++    + A+++++RT DE  +   +E+W  ++G+ Y    E  KRFEIFKD
Sbjct: 20  LSSAFDMSIISYHQTHATKSSWRTDDE--VMAMYEEWLVKHGKNYNALGEKEKRFEIFKD 77

Query: 61  NLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK- 119
           NL+ +++ N+    NR+YT+ LN+FADLT +EF +   G + + H   L      +  + 
Sbjct: 78  NLMFIDQHNSE---NRTYTVGLNRFADLTNEEFRSMYLGTR-TGHKKRLPKTSDRYAPRV 133

Query: 120 SSQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCA 172
              +P SV+W ++GAV  VK QG C        +AAVEGIN I    L++LSEQ+LVDC 
Sbjct: 134 GDSLPDSVDWRKEGAVAEVKDQGGCGSCWAFSTIAAVEGINKIVTGDLIALSEQELVDCD 193

Query: 173 TNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDV 232
           T+  N GC GG MD AF++II N GI  +  Y Y G   G CD+ +       I +YEDV
Sbjct: 194 TS-YNEGCNGGLMDYAFEFIINNGGIDTEDDYPYLGRD-GRCDTYRKNAKVVSIDSYEDV 251

Query: 233 PPNDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGI 290
           P NDE +L KAVANQPVSVAI+      Q Y+ GVF G C T L+HGV AVGYGT E+G 
Sbjct: 252 PENDETALKKAVANQPVSVAIEGGGRNFQLYNSGVFTGECGTSLDHGVAAVGYGT-EKGK 310

Query: 291 KYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSK 334
            YW+++NSWG+ WGE GY R++R+I  P G+CGIA+  S+P+ K
Sbjct: 311 DYWIVRNSWGKSWGESGYIRMERNIASPTGKCGIAIEPSYPIKK 354


>gi|356563155|ref|XP_003549830.1| PREDICTED: vignain-like [Glycine max]
          Length = 361

 Score =  281 bits (720), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 153/326 (46%), Positives = 211/326 (64%), Gaps = 21/326 (6%)

Query: 27  EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFA 86
           E  + + +E+W++ +  + +   E   RF +FK N++ V   N     ++ Y L+LN+FA
Sbjct: 33  EEGLWDLYERWRSHHTVS-RSLDEKHNRFNVFKGNVMHVHSSNKM---DKPYKLKLNRFA 88

Query: 87  DLTPQEFIASQTGFKMSDHS---SSLKANGTPFLYKS-SQVPPSVNWIEKGAVTPVKYQG 142
           D+T  EF +   G K++ H     + + NGT F+Y++  +VP SV+W +KGAVT VK QG
Sbjct: 89  DMTNHEFRSIYAGSKVNHHRMFRGTPRGNGT-FMYQNVDRVPSSVDWRKKGAVTDVKDQG 147

Query: 143 QCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQN 195
           QC        + AVEGIN IK ++LV LSEQ+LVDC T   N GC GG M+ AF++I Q 
Sbjct: 148 QCGSCWAFSTIVAVEGINQIKTHKLVPLSEQELVDCDTT-QNQGCNGGLMESAFEFIKQ- 205

Query: 196 KGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDA 255
            GIT  + Y YE    G CD+ K  + A  I  +E+VP N+E +LLKAVA+QPVSVAI+A
Sbjct: 206 YGITTASNYPYEA-KDGTCDASKVNEPAVSIDGHENVPVNNEAALLKAVAHQPVSVAIEA 264

Query: 256 SAL--QFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQR 313
             +  QFYS GVF G C T L+HGV  VGYGT+++G KYW +KNSWG +WGE GY R++R
Sbjct: 265 GGIDFQFYSEGVFTGNCGTALDHGVAIVGYGTTQDGTKYWTVKNSWGSEWGEKGYIRMKR 324

Query: 314 DIDQPQGQCGIAMFASFPVSKESAQP 339
            I   +G CGIAM AS+P+ K S++P
Sbjct: 325 SISVKKGLCGIAMEASYPIKKSSSKP 350


>gi|224136808|ref|XP_002326950.1| predicted protein [Populus trichocarpa]
 gi|222835265|gb|EEE73700.1| predicted protein [Populus trichocarpa]
          Length = 456

 Score =  281 bits (720), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 153/344 (44%), Positives = 215/344 (62%), Gaps = 19/344 (5%)

Query: 1   MAKYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKD 60
           ++  F + ++    + A+++++RT DE  +   +E+W  ++G+ Y    E  KRFEIFKD
Sbjct: 11  LSSAFDMSIISYHQTHATKSSWRTDDE--VMAMYEEWLVKHGKNYNALGEKEKRFEIFKD 68

Query: 61  NLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK- 119
           NL+ +++ N+    NR+YT+ LN+FADLT +EF +   G + + H   L      +  + 
Sbjct: 69  NLMFIDQHNSE---NRTYTVGLNRFADLTNEEFRSMYLGTR-TGHKKRLPKTSDRYAPRV 124

Query: 120 SSQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCA 172
              +P SV+W ++GAV  VK QG C        +AAVEGIN I    L++LSEQ+LVDC 
Sbjct: 125 GDSLPDSVDWRKEGAVAEVKDQGGCGSCWAFSTIAAVEGINKIVTGDLIALSEQELVDCD 184

Query: 173 TNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDV 232
           T+  N GC GG MD AF++II N GI  +  Y Y G   G CD+ +       I +YEDV
Sbjct: 185 TS-YNEGCNGGLMDYAFEFIINNGGIDTEDDYPYLGRD-GRCDTYRKNAKVVSIDSYEDV 242

Query: 233 PPNDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGI 290
           P NDE +L KAVANQPVSVAI+      Q Y+ GVF G C T L+HGV AVGYGT E+G 
Sbjct: 243 PENDETALKKAVANQPVSVAIEGGGRNFQLYNSGVFTGECGTSLDHGVAAVGYGT-EKGK 301

Query: 291 KYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSK 334
            YW+++NSWG+ WGE GY R++R+I  P G+CGIA+  S+P+ K
Sbjct: 302 DYWIVRNSWGKSWGESGYIRMERNIASPTGKCGIAIEPSYPIKK 345


>gi|414588007|tpg|DAA38578.1| TPA: hypothetical protein ZEAMMB73_159244 [Zea mays]
          Length = 307

 Score =  281 bits (720), Expect = 3e-73,   Method: Compositional matrix adjust.
 Identities = 147/315 (46%), Positives = 207/315 (65%), Gaps = 21/315 (6%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
           +AE+ E+W A+Y R YK++AE ++RFE+FKDN   VE FN  A     + L +N+FADLT
Sbjct: 1   MAERHERWMAEYDRVYKDAAEKARRFEVFKDNFAFVESFN--ADKKNKFWLGVNQFADLT 58

Query: 90  PQEFIASQTGFKMSDHSSSLKANGTPFLYKS---SQVPPSVNWIEKGAVTPVKYQGQC-- 144
            +EF A++ GFK     S+ +   T F Y++   S +P +V+W  KGAVTP+K QGQC  
Sbjct: 59  TEEFKANK-GFK---PISAEEVPTTGFKYENLSVSALPTAVDWRTKGAVTPIKNQGQCGC 114

Query: 145 -----AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGIT 199
                A+AA+EGI  +    LVSLSEQ+ VDC T++ + GC GG+MD+AF+++I+N G+ 
Sbjct: 115 CWAFSAIAAMEGIVKLSTGNLVSLSEQEPVDCDTHNMDEGCEGGWMDNAFEFVIKNGGLA 174

Query: 200 NDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASALQ 259
            ++ Y Y+ +  G C        AA I  +EDVPPN+E +L+K VA+QPVSVA+DAS   
Sbjct: 175 TESSYPYK-VVDGKCKG--GSKSAATIKGHEDVPPNNEAALMKVVASQPVSVAVDASDRT 231

Query: 260 F--YSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQ 317
           F  YSGGV  G C T L+HG+ A+GYG   +  KYW++KNSWG  WGE G+ R+++DI  
Sbjct: 232 FMLYSGGVMTGSCGTQLDHGIAAIGYGVESDDTKYWILKNSWGTTWGEKGFLRMEKDISD 291

Query: 318 PQGQCGIAMFASFPV 332
            +G C +AM  S+P 
Sbjct: 292 KRGMCDLAMKPSYPT 306


>gi|171702829|dbj|BAG16370.1| cysteine protease [Brassica oleracea var. italica]
          Length = 332

 Score =  281 bits (720), Expect = 3e-73,   Method: Compositional matrix adjust.
 Identities = 144/318 (45%), Positives = 204/318 (64%), Gaps = 17/318 (5%)

Query: 23  RTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRL 82
           R  DE ++ ++   W  ++GR Y ++ E + R+ +FK N+ ++ER N    G  ++ L +
Sbjct: 20  RPLDEVTMQKRHAAWMTEHGRVYADANEKNNRYVVFKRNVESIERLNEVQYG-LTFKLAV 78

Query: 83  NKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK---SSQVPPSVNWIEKGAVTPVK 139
           N+FADLT +EF +  TG+K +   SS +   T F Y+   S  +P SV+W +KGAVTP+K
Sbjct: 79  NQFADLTNEEFRSMYTGYKGNSVLSS-RTKPTSFRYQHVSSDALPISVDWRKKGAVTPIK 137

Query: 140 YQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYI 192
            QG C       AVAA+EG+  IK  +L+SLSEQ+LVDC TND+  GC GG+M+ AF Y 
Sbjct: 138 DQGSCGSCWAFSAVAAIEGVAQIKKGKLISLSEQELVDCDTNDD--GCMGGYMNSAFNYT 195

Query: 193 IQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVA 252
           +   G+T+++ Y Y+  + G C+  K +  A  I  +EDVP NDE++L+KAVA+ PVS+ 
Sbjct: 196 MTTGGLTSESNYPYKS-TDGTCNINKTKQIATSIKGFEDVPANDEKALMKAVAHHPVSIG 254

Query: 253 I--DASALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFR 310
           I    +  QFYS GVF+G C T L+HGV  VGYG S  G KYW++KNSWG  WGE GY R
Sbjct: 255 IAGGGTGFQFYSSGVFSGECSTHLDHGVAVVGYGKSSNGSKYWILKNSWGPKWGERGYMR 314

Query: 311 LQRDIDQPQGQCGIAMFA 328
           +++D     GQCG+AM A
Sbjct: 315 IKKDTKAKHGQCGLAMNA 332


>gi|255646767|gb|ACU23856.1| unknown [Glycine max]
          Length = 350

 Score =  281 bits (720), Expect = 3e-73,   Method: Compositional matrix adjust.
 Identities = 151/315 (47%), Positives = 201/315 (63%), Gaps = 17/315 (5%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
           + E FE W +++G+ Y+   E   RFEIFKDNL  ++  N       +Y L L++FADL+
Sbjct: 44  LIELFESWMSRHGKIYENIEEKLLRFEIFKDNLKHIDERNKVV---SNYWLGLSEFADLS 100

Query: 90  PQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA---- 145
            +EF     G K+ D+S   + +   F YK  ++P SV+W +KGAV PVK QG C     
Sbjct: 101 HREFNNKYLGLKV-DYSRR-RESPEEFTYKDVELPKSVDWRKKGAVAPVKNQGSCGSCWA 158

Query: 146 ---VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDA 202
              VAAVEGIN I    L SLSEQ+L+DC     NNGC GG MD AF +I++N G+  + 
Sbjct: 159 FSTVAAVEGINQIVTGNLTSLSEQELIDC-DRTYNNGCNGGLMDYAFSFIVENGGLHKEE 217

Query: 203 VYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQF 260
            Y Y  M  G C+  K E     I+ Y DVP N+E+SLLKA+ANQP+SVAI+AS    QF
Sbjct: 218 DYPYI-MEEGACEMTKEETQVVTISGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQF 276

Query: 261 YSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQG 320
           YSGGVF+G+C + L+HGV AVGYGT++ G+ Y  +KNSWG  WGE GY R++R+I +P+G
Sbjct: 277 YSGGVFDGHCGSDLDHGVAAVGYGTAK-GVDYITVKNSWGSKWGEKGYIRMRRNIGKPEG 335

Query: 321 QCGIAMFASFPVSKE 335
            CGI   AS+P  K+
Sbjct: 336 ICGIYKMASYPTKKK 350


>gi|357467173|ref|XP_003603871.1| Cysteine proteinase [Medicago truncatula]
 gi|355492919|gb|AES74122.1| Cysteine proteinase [Medicago truncatula]
 gi|388499154|gb|AFK37643.1| unknown [Medicago truncatula]
          Length = 350

 Score =  281 bits (719), Expect = 3e-73,   Method: Compositional matrix adjust.
 Identities = 150/315 (47%), Positives = 201/315 (63%), Gaps = 16/315 (5%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
           + E FE W +++G+ Y+   E   RFE+FKDNL  ++  N       +Y L LN+FADL+
Sbjct: 43  LIELFESWMSRHGKIYETIEEKLLRFEVFKDNLKHIDDRNKVV---SNYWLGLNEFADLS 99

Query: 90  PQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA---- 145
            QEF     G K+ D S   +++   F Y+   +P SV+W +KGAVTPVK QGQC     
Sbjct: 100 HQEFKNKYLGLKV-DLSQRRESSEEEFTYRDVDLPKSVDWRKKGAVTPVKNQGQCGSCWA 158

Query: 146 ---VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDA 202
              VAAVEGIN I    L SLSEQ+L+DC T   NNGC GG MD AF +I++N G+  + 
Sbjct: 159 FSTVAAVEGINQIVTGNLTSLSEQELIDCDTT-YNNGCNGGLMDYAFSFIVKNGGLHKEE 217

Query: 203 VYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQF 260
            Y Y  M    C+  K       I  Y DVP N+E+SLLKA+ANQP+SVAI+AS    QF
Sbjct: 218 DYPYI-MEESTCEMKKEVSEVVTINGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQF 276

Query: 261 YSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQG 320
           YSGGVF+G+C + L+HGV+AVGYGTS+ G+ Y ++KNSWG  WGE G+ R++R+I + +G
Sbjct: 277 YSGGVFDGHCGSELDHGVSAVGYGTSK-GLDYIIVKNSWGAKWGEKGFIRMKRNIGKSEG 335

Query: 321 QCGIAMFASFPVSKE 335
            CG+   AS+P  K+
Sbjct: 336 ICGLYKMASYPTKKK 350


>gi|359483514|ref|XP_003632971.1| PREDICTED: LOW QUALITY PROTEIN: oryzain beta chain-like [Vitis
           vinifera]
          Length = 340

 Score =  280 bits (717), Expect = 5e-73,   Method: Compositional matrix adjust.
 Identities = 148/327 (45%), Positives = 213/327 (65%), Gaps = 14/327 (4%)

Query: 17  ASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNR 76
           AS+AT R   E S+ E+ EQW A+Y R YK+ AE  +RF +FKDN+  ++ F+ A  GN 
Sbjct: 18  ASEATSRPLHEASMYERHEQWMARYSRNYKDDAEEERRFXMFKDNVDFIQTFDTA--GNM 75

Query: 77  SYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGAV 135
              L +N  AD+T +EF AS   FK+   +  L++  T F +++ +++P +++W +K  V
Sbjct: 76  PNKLGVNALADMTHEEFRASGNTFKIPP-NLGLRSETTSFRHQNVTRIPSTMDWRKKRTV 134

Query: 136 TPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDA 188
           T +K Q QC       AVAA+EGI  ++ ++ +SLSEQ+LVDC    +N GC GG MDDA
Sbjct: 135 THIKNQLQCGGCWAFSAVAAMEGIAKLQTSKSISLSEQELVDCDIFGSNIGCEGGCMDDA 194

Query: 189 FKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQP 248
           FK+IIQN+G+ ++A Y Y+G+  G C+  K    AA+I +YE++P   E++LLK VA+QP
Sbjct: 195 FKFIIQNRGLNSEARYLYKGVE-GHCNKKKESSRAARINDYENMPEFSEKALLKVVAHQP 253

Query: 249 VSVAIDA--SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGED 306
           +SVAIDA  SA QFY  G+        L++GVT  GYG S +G K+WL+KNSWG DWGE+
Sbjct: 254 ISVAIDAGGSAFQFYEIGIITXESGNDLDYGVTTDGYGRSADGKKHWLVKNSWGTDWGEN 313

Query: 307 GYFRLQRDIDQPQGQCGIAMFASFPVS 333
           GY R++R +    G CG  M AS+P +
Sbjct: 314 GYTRMERGVKATTGLCGFTMQASYPTA 340


>gi|356517310|ref|XP_003527331.1| PREDICTED: vignain-like [Glycine max]
          Length = 342

 Score =  280 bits (717), Expect = 5e-73,   Method: Compositional matrix adjust.
 Identities = 151/342 (44%), Positives = 220/342 (64%), Gaps = 19/342 (5%)

Query: 4   YFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLV 63
           ++LI+ L+++    S    R   E   +E+ E+W AQYGR YK++AE  KRF++FK+N+ 
Sbjct: 8   HYLILFLVLA-VWTSHVMSRRLSEACTSERHEKWMAQYGRVYKDAAEKEKRFQVFKNNVH 66

Query: 64  AVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQ 122
            +E FN  A G++ + L +N+FADL  +EF A     +    S    +  T F Y+S ++
Sbjct: 67  FIESFN--AAGDKPFNLSINQFADLNDEEFKALLINVQ-KKASWVETSTETSFRYESVTK 123

Query: 123 VPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATND 175
           +P +++  ++GAVTP+K QG+C       AVAA EGI+ I   +LV LSEQ+LVDC   +
Sbjct: 124 IPATIDRRKRGAVTPIKDQGRCGSCWAFSAVAATEGIHQITTGKLVPLSEQELVDCVKGE 183

Query: 176 NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHA-AQITNYEDVPP 234
           +  GC GG++DDAF++I +  GI ++  Y Y+G++   C  +K E H  A+I  YE VP 
Sbjct: 184 SE-GCIGGYVDDAFEFIAKKGGIASETHYPYKGVNK-TC-KVKKETHGVAEIKGYEKVPS 240

Query: 235 NDEESLLKAVANQPVSVAIDA--SALQFYSGGVFNGY-CETFLNHGVTAVGYGTSEEGIK 291
           N+E++LLKAVANQPVSV IDA   A ++YS G+FN   C T  NH V  VGYG + +  K
Sbjct: 241 NNEKALLKAVANQPVSVYIDAGTHAFKYYSSGIFNARNCGTDPNHAVAVVGYGKALDDSK 300

Query: 292 YWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
           YWL+KNSWG +WGE GY R++RDI   +G CGIA +  +P++
Sbjct: 301 YWLVKNSWGTEWGERGYIRIKRDIRAKEGLCGIAKYPYYPIA 342


>gi|414584879|tpg|DAA35450.1| TPA: cysteine protease 1 [Zea mays]
          Length = 522

 Score =  280 bits (716), Expect = 8e-73,   Method: Compositional matrix adjust.
 Identities = 145/312 (46%), Positives = 195/312 (62%), Gaps = 14/312 (4%)

Query: 34  FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
           +E W A++GR Y    E  +RF +F DNL  V+  N  A     + L +N+FADLT  EF
Sbjct: 109 YELWLAEHGRAYNALGERDRRFRVFWDNLRFVDAHNERA-AEHGFRLGMNQFADLTNDEF 167

Query: 94  IASQTGFKMSDHSSSLKANGTPFLYK--SSQVPPSVNWIEKGAVTPVKYQGQC------- 144
            A+  G ++        A G  + +   + ++P SV+W EKGAV PVK QGQC       
Sbjct: 168 RAAYLGARIPASRRRGTAVGERYRHGGGAEELPESVDWREKGAVAPVKNQGQCGSCWAFS 227

Query: 145 AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVY 204
           AV++VE +N I    +V+LSEQ+LV+C+T+  N+GC GG MD AF +II+N GI  +  Y
Sbjct: 228 AVSSVESVNQIVTGEMVTLSEQELVECSTDGGNSGCNGGLMDAAFDFIIKNGGIDTEGDY 287

Query: 205 SYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASALQF--YS 262
            Y+ +  G CD  +       I  +EDVP NDE+SL KAVA+QPVSVAI+A   +F  Y 
Sbjct: 288 PYKAVD-GKCDINRENAKVVSIDGFEDVPENDEKSLQKAVAHQPVSVAIEAGGREFQLYK 346

Query: 263 GGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQC 322
            GVF G C T L+HGV AVGYGT E G  YW+++NSWG  WGEDGY R++R+++   G+C
Sbjct: 347 AGVFTGTCTTNLDHGVVAVGYGT-ENGKDYWIVRNSWGAKWGEDGYIRMERNVNATTGKC 405

Query: 323 GIAMFASFPVSK 334
           GIAM AS+P  K
Sbjct: 406 GIAMMASYPTKK 417


>gi|242074728|ref|XP_002447300.1| hypothetical protein SORBIDRAFT_06g032360 [Sorghum bicolor]
 gi|241938483|gb|EES11628.1| hypothetical protein SORBIDRAFT_06g032360 [Sorghum bicolor]
          Length = 471

 Score =  280 bits (716), Expect = 8e-73,   Method: Compositional matrix adjust.
 Identities = 151/318 (47%), Positives = 195/318 (61%), Gaps = 15/318 (4%)

Query: 27  EGSIAEKFEQWKAQYGRTYKES-AENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKF 85
           E  +   +EQW A++G+    +  E+ +RF  F DNL  V+  +NA  G R Y L +N+F
Sbjct: 45  EAQVRAMYEQWMARHGKAASNALGEHDRRFRAFWDNLRFVD-AHNARAGARGYRLGINRF 103

Query: 86  ADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQ-VPPSVNWIEKGAVTPVKYQGQC 144
           ADLT  EF A+      + + ++  A G  + +   + +P  V+W +KGAV PVK QGQC
Sbjct: 104 ADLTNAEFRAAYLS-AGARNGTATAATGERYRHDGVEALPEFVDWRQKGAVAPVKNQGQC 162

Query: 145 -------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKG 197
                  AV AVEGIN I    LV+LSEQ+LVDC+ N  N GC GG MDDAF +I+ N G
Sbjct: 163 GSCWAFSAVGAVEGINQIVTGELVTLSEQELVDCSKNGQNGGCDGGMMDDAFAFIVGNGG 222

Query: 198 ITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA 257
           I  D  Y Y     G CD  K   H   I  +E VP NDE+SL KAVA+QPV+VAI+A  
Sbjct: 223 IDTDKDYPYTARD-GKCDVAKRSRHVVSIDGFEGVPRNDEKSLQKAVAHQPVAVAIEAGG 281

Query: 258 --LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIK-YWLIKNSWGQDWGEDGYFRLQRD 314
              Q Y  GVF G C T L+HGV AVGYGT  +G + YWL++NSWG DWGE GY R++R+
Sbjct: 282 REFQLYQSGVFTGRCGTSLDHGVVAVGYGTEADGGRDYWLVRNSWGADWGEGGYIRMERN 341

Query: 315 IDQPQGQCGIAMFASFPV 332
           +    G+CGIAM AS+PV
Sbjct: 342 VGARAGKCGIAMEASYPV 359


>gi|40806500|gb|AAR92155.1| putative cysteine protease 2 [Iris x hollandica]
          Length = 359

 Score =  280 bits (716), Expect = 8e-73,   Method: Compositional matrix adjust.
 Identities = 160/360 (44%), Positives = 218/360 (60%), Gaps = 32/360 (8%)

Query: 1   MAKYFLIVVLIISGSCASQATYRTFDEGSIAEK------FEQWKAQYGRTYKESAENSKR 54
           MA   +++ L+++ +    A    F+E  +A +      +E+W++ +    ++ +E +KR
Sbjct: 1   MATKSMLLALVVALAFVGVARTIPFNEKDLASEESLWGLYERWRSHH-TVSRDLSEKNKR 59

Query: 55  FEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGT 114
           F +FK+N   +  FN     +  Y L LNKFAD+T QEF ++  G K+  H +     GT
Sbjct: 60  FNVFKENAKFIHEFNKK---DAPYKLGLNKFADMTNQEFRSTYAGSKIHHHRTQ---RGT 113

Query: 115 P-----FLYKSSQ-VPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLV 161
           P     F+Y++   +P SV+W  +GAV PVK QGQC        +A+VEGIN IK N+LV
Sbjct: 114 PRATGSFMYENVHSIPASVDWRTQGAVAPVKDQGQCGSCWAFSTIASVEGINKIKTNQLV 173

Query: 162 SLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAED 221
            LS QQLVDC T D N GC GG MD AF++I  N GIT+++ Y Y     G C S ++  
Sbjct: 174 PLSGQQLVDCDT-DQNEGCNGGLMDYAFEFIKSNGGITSESAYPYTA-EQGSCAS-ESSA 230

Query: 222 HAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVT 279
               I  YEDVP N+E +L+KAVANQ VSVAI+AS  A QFYS GVF G C   L+HGV 
Sbjct: 231 PVVTIDGYEDVPANNEAALMKAVANQVVSVAIEASGMAFQFYSEGVFTGSCGNELDHGVA 290

Query: 280 AVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSKESAQP 339
            VGYG + +G KYW+++NSWG +WGE GY R+QR I    G CGIAM  S+P+ K S  P
Sbjct: 291 VVGYGATRDGTKYWIVRNSWGAEWGEKGYIRMQRGIRARHGLCGIAMEPSYPL-KTSPNP 349


>gi|242081867|ref|XP_002445702.1| hypothetical protein SORBIDRAFT_07g024430 [Sorghum bicolor]
 gi|241942052|gb|EES15197.1| hypothetical protein SORBIDRAFT_07g024430 [Sorghum bicolor]
          Length = 372

 Score =  280 bits (716), Expect = 8e-73,   Method: Compositional matrix adjust.
 Identities = 150/319 (47%), Positives = 199/319 (62%), Gaps = 22/319 (6%)

Query: 34  FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
           +E+W+ ++    ++  + ++RF +FK+N+  +  FN     +  Y LRLN+F D+T  EF
Sbjct: 47  YERWRGRHA-VARDLGDKARRFNVFKENVRLIHDFNQR---DEPYKLRLNRFGDMTADEF 102

Query: 94  IASQTGFKMSDHS---SSLKANGTPFLYKSSQ-VPPSVNWIEKGAVTPVKYQGQCA---- 145
                G +++ H       + + + F+Y  ++ +P SV+W +KGAVT VK QGQC     
Sbjct: 103 RRHYAGSRVAHHRMFRGDRQGSASSFMYAGARDLPTSVDWRQKGAVTDVKDQGQCGSCWA 162

Query: 146 ---VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDA 202
              +AAVEGINAIK   L SLSEQQLVDC T   N GC GG MD AF+YI ++ G+  + 
Sbjct: 163 FSTIAAVEGINAIKTKNLTSLSEQQLVDCDTK-GNAGCDGGLMDYAFQYIAKHGGVAAED 221

Query: 203 VYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQF 260
            Y Y+      C    A   A  I  YEDVP NDE +L KAVA+QPVSVAI+AS    QF
Sbjct: 222 AYPYKARQAS-CKKSPAP--AVTIDGYEDVPANDESALKKAVAHQPVSVAIEASGSHFQF 278

Query: 261 YSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQG 320
           YS GVF G C T L+HGVTAVGYG + +G KYW++KNSWG +WGE GY R+ RD+   +G
Sbjct: 279 YSEGVFAGRCGTELDHGVTAVGYGVAADGTKYWVVKNSWGPEWGEKGYIRMARDVAAKEG 338

Query: 321 QCGIAMFASFPVSKESAQP 339
            CGIAM AS+PV K S  P
Sbjct: 339 HCGIAMEASYPV-KTSPNP 356


>gi|414870137|tpg|DAA48694.1| TPA: vignain [Zea mays]
          Length = 484

 Score =  280 bits (715), Expect = 9e-73,   Method: Compositional matrix adjust.
 Identities = 151/322 (46%), Positives = 197/322 (61%), Gaps = 25/322 (7%)

Query: 34  FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
           +E+W+ ++    ++  + ++RF +FK N+  +  FN     +  Y LRLN+F D+T  EF
Sbjct: 156 YERWRGRHA-LARDLGDKARRFNVFKANVRLIHEFNRR---DEPYKLRLNRFGDMTADEF 211

Query: 94  IASQTGFKMSDHS------SSLKANGTPFLYKSSQ-VPPSVNWIEKGAVTPVKYQGQCA- 145
                G +++ H           A+ + F+Y  ++ VP SV+W +KGAVT VK QGQC  
Sbjct: 212 RRHYAGSRVAHHRMFRGDRQGSSASASSFMYADARDVPASVDWRQKGAVTDVKDQGQCGS 271

Query: 146 ------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGIT 199
                 +AAVEGINAIK   L SLSEQQLVDC T   N GC GG MD AF+YI ++ G+ 
Sbjct: 272 CWAFSTIAAVEGINAIKTKNLTSLSEQQLVDCDTK-ANAGCNGGLMDYAFQYIAKHGGVA 330

Query: 200 NDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA-- 257
            +  Y Y       C   K+      I  YEDVP NDE +L KAVA+QPVSVAI+AS   
Sbjct: 331 AEDAYPYRARQAS-CK--KSPAPVVTIDGYEDVPANDESALKKAVAHQPVSVAIEASGSH 387

Query: 258 LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQ 317
            QFYS GVF+G C T L+HGV AVGYG + +G KYWL+KNSWG +WGE GY R+ RD+  
Sbjct: 388 FQFYSEGVFSGRCGTELDHGVAAVGYGVTADGTKYWLVKNSWGPEWGEKGYIRMARDVAA 447

Query: 318 PQGQCGIAMFASFPVSKESAQP 339
            +G CGIAM AS+PV K S  P
Sbjct: 448 KEGHCGIAMEASYPV-KTSPNP 468


>gi|34223513|gb|AAQ62999.1| oil palm polygalacturonase allergen PEST472 [Elaeis guineensis]
          Length = 525

 Score =  280 bits (715), Expect = 9e-73,   Method: Compositional matrix adjust.
 Identities = 148/312 (47%), Positives = 194/312 (62%), Gaps = 14/312 (4%)

Query: 34  FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAA-IGNRSYTLRLNKFADLTPQE 92
           +E W A++GR      E  +RFEIFKDN+  ++  N AA  G+RS+ L LN+FAD+T +E
Sbjct: 50  YEGWLAKHGRADNALGEKERRFEIFKDNVRFIDAHNAAADSGHRSFRLGLNRFADMTNEE 109

Query: 93  FIASQTGFKMSDHSSSLKANGTPFLYKSSQ-VPPSVNWIEKGAVTPVKYQGQCA------ 145
           +     G + + H    +     + Y + + +P SV+W +KGAVT VK QG C       
Sbjct: 110 YRTVYLGTRPASHRRRARLGSDRYRYNAGEELPESVDWRDKGAVTTVKDQGSCGSCWAFS 169

Query: 146 -VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVY 204
            +AAVEGIN I    L+SLSEQ+LVDC  N  N GC GG MD AF++II N GI  +  Y
Sbjct: 170 TIAAVEGINKIVTGDLISLSEQELVDC-DNGQNQGCNGGLMDYAFEFIINNGGIDTEEDY 228

Query: 205 SYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFYS 262
            Y+    G CD  +       I  YEDVP NDE++L KAVANQPVSVAI+A     Q Y 
Sbjct: 229 PYKARD-GKCDQYRKNAKVVSIDGYEDVPVNDEKALQKAVANQPVSVAIEAGGREFQLYH 287

Query: 263 GGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQC 322
            G+F G C T L+HGV AVGYGT E G  YW+++NSWG DWGE GY R++R+++   G+C
Sbjct: 288 SGIFTGRCGTDLDHGVVAVGYGT-ENGKDYWIVRNSWGGDWGESGYIRMERNVNASTGKC 346

Query: 323 GIAMFASFPVSK 334
           GIAM +S+P  K
Sbjct: 347 GIAMESSYPTKK 358


>gi|302816222|ref|XP_002989790.1| hypothetical protein SELMODRAFT_184826 [Selaginella moellendorffii]
 gi|300142356|gb|EFJ09057.1| hypothetical protein SELMODRAFT_184826 [Selaginella moellendorffii]
          Length = 358

 Score =  280 bits (715), Expect = 9e-73,   Method: Compositional matrix adjust.
 Identities = 153/313 (48%), Positives = 198/313 (63%), Gaps = 22/313 (7%)

Query: 34  FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
           +E+W   +GR Y    E  +RF+IF+DN   +E  N     N++Y L LN FAD+T  EF
Sbjct: 34  YEKWMVDHGRVYNGIGEKERRFQIFRDNAEYIEEHNRQV--NQTYWLGLNNFADMTHDEF 91

Query: 94  IASQTGFKMSDHSSSLKANGTPFLYK-SSQVPPSVNWIEKGAVTPVKYQGQCA------- 145
            A   G K+   S+++K+    F YK ++ +P   +W  KGAV  VK QG C        
Sbjct: 92  KALYFGTKVP-LSNTIKSG---FRYKDATNLPLDTDWRSKGAVATVKNQGACGSCWAFST 147

Query: 146 VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYS 205
           VAAVEG+N I    LVSLSEQ+LVDC     N GC GG MD AF++IIQN G+ ++A Y 
Sbjct: 148 VAAVEGVNQIVTGELVSLSEQELVDC-DKQKNQGCNGGLMDSAFEFIIQNGGLDSEADYP 206

Query: 206 YEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFYSG 263
           Y+ +S G CD  +   H   I  +EDVP   E  LLKAVANQPVSVAI+AS    Q YSG
Sbjct: 207 YKAVS-GSCDESRRNSHVVTIDGFEDVPAESEADLLKAVANQPVSVAIEASGRNFQLYSG 265

Query: 264 GVFNGYCETFLNHGVTAVGYGTSE--EGI--KYWLIKNSWGQDWGEDGYFRLQRDIDQPQ 319
           GV+ G+C   L+HGV AVGYGTS+  +G+   YW+++NSWG  WGE GY RLQR++  P+
Sbjct: 266 GVYTGHCGYELDHGVVAVGYGTSKTPDGVATDYWIVRNSWGDAWGESGYIRLQRNVASPR 325

Query: 320 GQCGIAMFASFPV 332
           G+CGIAM AS+PV
Sbjct: 326 GKCGIAMMASYPV 338


>gi|4100157|gb|AAD10337.1| cysteine proteinase precursor [Hordeum vulgare]
          Length = 365

 Score =  280 bits (715), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 152/320 (47%), Positives = 198/320 (61%), Gaps = 20/320 (6%)

Query: 27  EGSIAEKFEQWKAQYGRTYKESAENS--KRFEIFKDNLVAVERFNNAAIGNRSYTLRLNK 84
           E S+   +E+W++ Y  + +    ++  +RF +FK N   V   N   +    + L LNK
Sbjct: 34  EESLRGLYERWRSHYTVSRRGLGADAGERRFNVFKQNARYVHEGNKRDM---PFRLALNK 90

Query: 85  FADLTPQEFIASQTGFKMSDH---SSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQ 141
           FAD+T  EF  +  G ++  H   S   + +G      +  +PP+V+W +KGAVT +K Q
Sbjct: 91  FADMTTDEFRRTYAGSRVRHHLSLSGGRRGDGGFRYGDADNLPPAVDWRQKGAVTAIKDQ 150

Query: 142 GQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQ 194
           GQC        + AVEGIN I+  +LVSLSEQ+L+DC  N NN GC GG MD AF++I Q
Sbjct: 151 GQCGSCWAFSTIVAVEGINKIRTGKLVSLSEQELMDC-DNVNNQGCDGGLMDYAFQFI-Q 208

Query: 195 NKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAID 254
             GIT ++ Y Y+G   G CD  K    A  I  YEDVP NDE +L KAVA QPVSVAID
Sbjct: 209 KNGITTESNYPYQG-EQGSCDQAKENAQAVTIDGYEDVPANDESALQKAVAGQPVSVAID 267

Query: 255 ASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQ 312
           AS    QFYS GVF G C T L+HGV AVGYG + +G KYW++KNSWG+DWGE GY R+Q
Sbjct: 268 ASGQDFQFYSEGVFTGECSTDLDHGVAAVGYGATRDGTKYWIVKNSWGEDWGEKGYIRMQ 327

Query: 313 RDIDQPQGQCGIAMFASFPV 332
           R + Q +G CGIAM AS+P 
Sbjct: 328 RGVSQTEGLCGIAMQASYPT 347


>gi|357154164|ref|XP_003576692.1| PREDICTED: vignain-like [Brachypodium distachyon]
          Length = 427

 Score =  280 bits (715), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 150/317 (47%), Positives = 197/317 (62%), Gaps = 25/317 (7%)

Query: 33  KFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQE 92
           +FEQW  ++GR Y    E  +RFE++K+NL  +E FN+   G   YTL  NKFADLT +E
Sbjct: 118 RFEQWMGKHGRAYANGGEKQRRFEVYKENLALIEEFNS---GGHGYTLTDNKFADLTNEE 174

Query: 93  FIASQTGFKMSD--------HSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC 144
           F A   G   +D        H+S+  A   P    S+ +P  V+W +KGAV  VK QG C
Sbjct: 175 FRAKMLGGLGADPDRRRRARHASN--ALELPGNDNSTDLPKDVDWRKKGAVVEVKNQGSC 232

Query: 145 -------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKG 197
                  AVAA+EG+N IK  +LVSLSEQ+LVDC  +    GC GGFM  AF++++ N G
Sbjct: 233 GSCWAFSAVAAMEGLNQIKNGKLVSLSEQELVDC--DAEAVGCAGGFMSWAFEFVMANHG 290

Query: 198 ITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA 257
           +T +A Y Y+G++ G C + K  + +  IT Y +V  N E  LLK  A QPVSVA+DA  
Sbjct: 291 LTTEASYPYKGIN-GACQTAKLNESSVSITGYVNVTVNSEAELLKVAAVQPVSVAVDAGG 349

Query: 258 L--QFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDI 315
              Q Y+GGVF+G C   +NHGVT VGYG +++  KYW++KNSWG +WGE GY  +QRD 
Sbjct: 350 FLFQLYAGGVFSGPCTAQINHGVTVVGYGETDKAEKYWIVKNSWGPEWGEAGYMLMQRDA 409

Query: 316 DQPQGQCGIAMFASFPV 332
             P G CGIAM AS+PV
Sbjct: 410 GVPTGLCGIAMLASYPV 426


>gi|226501480|ref|NP_001150266.1| cysteine protease 1 precursor [Zea mays]
 gi|195637948|gb|ACG38442.1| cysteine protease 1 precursor [Zea mays]
          Length = 462

 Score =  279 bits (714), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 145/312 (46%), Positives = 195/312 (62%), Gaps = 14/312 (4%)

Query: 34  FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
           +E W A++GR Y    E  +RF +F DNL  V+  N  A     + L +N+FADLT  EF
Sbjct: 49  YELWLAEHGRAYNALGERDRRFRVFWDNLRFVDAHNERA-AEHGFRLGMNQFADLTNDEF 107

Query: 94  IASQTGFKMSDHSSSLKANGTPFLYK--SSQVPPSVNWIEKGAVTPVKYQGQC------- 144
            A+  G ++        A G  + +   + ++P SV+W EKGAV PVK QGQC       
Sbjct: 108 RAAYLGARIPAARRRGTAVGERYRHGGGAEELPESVDWREKGAVAPVKNQGQCGSCWAFS 167

Query: 145 AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVY 204
           AV++VE +N I    +V+LSEQ+LV+C+T+  N+GC GG MD AF +II+N GI  +  Y
Sbjct: 168 AVSSVESVNQIVTGEMVTLSEQELVECSTDGGNSGCNGGLMDAAFDFIIKNGGIDTEGDY 227

Query: 205 SYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFYS 262
            Y+ +  G CD  +       I  +EDVP NDE+SL KAVA+QPVSVAI+A     Q Y 
Sbjct: 228 PYKAVD-GKCDINRENAKVVSIDGFEDVPENDEKSLQKAVAHQPVSVAIEAGGREFQLYK 286

Query: 263 GGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQC 322
            GVF+G C T L+HGV AVGYGT E G  YW+++NSWG  WGEDGY R++R+++   G+C
Sbjct: 287 AGVFSGTCTTNLDHGVVAVGYGT-ENGKDYWIVRNSWGAKWGEDGYIRMERNVNATTGKC 345

Query: 323 GIAMFASFPVSK 334
           GIAM AS+P  K
Sbjct: 346 GIAMMASYPTKK 357


>gi|116781957|gb|ABK22314.1| unknown [Picea sitchensis]
          Length = 369

 Score =  279 bits (714), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 155/321 (48%), Positives = 205/321 (63%), Gaps = 20/321 (6%)

Query: 27  EGSIAEKFEQWKAQYGRTYK-ESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKF 85
           E S+   ++ W  Q+  +   +S E+++RFEIFK+N+  ++  N     +  Y L LNKF
Sbjct: 39  EKSLRSLYDNWALQHRSSRSLDSEEHAERFEIFKENVKYIDSVNKK---DSPYKLGLNKF 95

Query: 86  ADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQ-VPPSVNWIEKGAVTPVKYQGQC 144
           ADL+ +EF A   G KM D     +     F+Y++S+ +P S++W +KGAV  VK QG C
Sbjct: 96  ADLSNEEFKAIYMGTKM-DLRGDREVQSGSFMYQNSEPLPASIDWRQKGAVAAVKNQGHC 154

Query: 145 A-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKG 197
                   VA+VEGIN I    LVSLSEQQLVDC+T   N+GC GG MD AF+YII N G
Sbjct: 155 GSCWAFSTVASVEGINYITTGNLVSLSEQQLVDCSTE--NSGCNGGLMDTAFQYIINNGG 212

Query: 198 ITNDAVYSYEGMSTGICDSIKAEDHAAQ--ITNYEDVPPNDEESLLKAVANQPVSVAIDA 255
           I  +  Y Y   +T  C S K      +  I  +EDVP N+E++L +AVA+QPVSVAI+A
Sbjct: 213 IVTEDNYPYTAEATE-CSSTKINSQTTRVVIDGFEDVPANNEQALKEAVAHQPVSVAIEA 271

Query: 256 SA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQR 313
           S    QFYS GVF G C T L+HGV AVGYGTS EGI YW+++NSWG  WGE+GY R+Q+
Sbjct: 272 SGQDFQFYSTGVFTGKCGTALDHGVVAVGYGTSPEGINYWIVRNSWGPKWGEEGYIRMQQ 331

Query: 314 DIDQPQGQCGIAMFASFPVSK 334
            I+  +G+CGIAM AS+P  K
Sbjct: 332 GIEAAEGKCGIAMQASYPTKK 352


>gi|238006338|gb|ACR34204.1| unknown [Zea mays]
          Length = 465

 Score =  279 bits (714), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 145/312 (46%), Positives = 194/312 (62%), Gaps = 14/312 (4%)

Query: 34  FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
           +E W A++GR Y    E  +RF +F DNL  V+  N  A     + L +N+FADLT  EF
Sbjct: 52  YELWLAEHGRAYNALGERDRRFRVFWDNLRFVDAHNERA-AEHGFRLGMNQFADLTNDEF 110

Query: 94  IASQTGFKMSDHSSSLKANGTPFLYK--SSQVPPSVNWIEKGAVTPVKYQGQC------- 144
            A+  G ++        A G  + +   + ++P SV+W EKGAV PVK QGQC       
Sbjct: 111 RAAYLGARIPASRRRGTAVGERYRHGGGAEELPESVDWREKGAVAPVKNQGQCGSCWAFS 170

Query: 145 AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVY 204
           AV++VE +N I    +V+LSEQ+LV+C+T+  N+GC GG MD AF +II+N GI  +  Y
Sbjct: 171 AVSSVESVNQIVTGEMVTLSEQELVECSTDGGNSGCNGGLMDAAFDFIIKNGGIDTEGDY 230

Query: 205 SYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFYS 262
            Y+ +  G CD  +       I  +EDVP NDE+SL KAVA+QPVSVAI+A     Q Y 
Sbjct: 231 PYKAVD-GKCDINRENAKVVSIDGFEDVPENDEKSLQKAVAHQPVSVAIEAGGREFQLYK 289

Query: 263 GGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQC 322
            GVF G C T L+HGV AVGYGT E G  YW+++NSWG  WGEDGY R++R+++   G+C
Sbjct: 290 AGVFTGTCTTNLDHGVVAVGYGT-ENGKDYWIVRNSWGAKWGEDGYIRMERNVNATTGKC 348

Query: 323 GIAMFASFPVSK 334
           GIAM AS+P  K
Sbjct: 349 GIAMMASYPTKK 360


>gi|356543112|ref|XP_003540007.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
           max]
          Length = 345

 Score =  279 bits (714), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 148/329 (44%), Positives = 207/329 (62%), Gaps = 18/329 (5%)

Query: 18  SQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRS 77
           S    R   E   +E+ E W AQYG+ YK++AE  KRF+IFK+N+  +E FN A  G++ 
Sbjct: 22  SHIMSRRLFEACTSERHENWMAQYGKVYKDAAEKKKRFQIFKNNVHFIESFNTA--GDKP 79

Query: 78  YTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPP---SVNWIEKGA 134
           + L +N+FADL  +EF A  T       S    A  T   +K ++V     +++W ++GA
Sbjct: 80  FNLSINQFADLHDEEFKALLTNGNKKVRSVVGTATETETSFKYNRVTKLLATMDWRKRGA 139

Query: 135 VTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDD 187
           VTP+K Q +C       AVAA+EGI+ I  ++LVSLSEQ+LVDC   ++  GC GG+M+D
Sbjct: 140 VTPIKDQRRCGSCWAFSAVAAIEGIHQITTSKLVSLSEQELVDCVKGESE-GCNGGYMED 198

Query: 188 AFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHA-AQITNYEDVPPNDEESLLKAVAN 246
           AF+++ +  GI +++ Y Y+G        +K E H  +QI  YE VP N E++L KAVA+
Sbjct: 199 AFEFVAKKGGIASESYYPYKGKDKSC--KVKKETHGVSQIKGYEKVPSNSEKALQKAVAH 256

Query: 247 QPVSVAIDA--SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWG 304
           QPVSV ++A  +A QFYS G+F G C T  +H +T VGYG S  G KYWL+KNSWG  WG
Sbjct: 257 QPVSVYVEAGGNAFQFYSSGIFTGKCGTNTDHAITVVGYGKSRGGTKYWLVKNSWGAGWG 316

Query: 305 EDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
           E GY R++RDI   +G CGIAM A +P +
Sbjct: 317 EKGYIRMKRDIRAKEGLCGIAMNAFYPTA 345


>gi|2224812|emb|CAB09699.1| cysteine endopeptidase EP-A [Hordeum vulgare subsp. vulgare]
          Length = 365

 Score =  279 bits (714), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 155/321 (48%), Positives = 197/321 (61%), Gaps = 22/321 (6%)

Query: 27  EGSIAEKFEQWKAQYG---RTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLN 83
           E S+   +E+W++ Y    R     AE  +RF +FK N   V   N   +    + L LN
Sbjct: 34  EESLRGLYERWRSHYTVSRRGLGADAEE-RRFNVFKQNARYVHEGNKRDM---PFRLALN 89

Query: 84  KFADLTPQEFIASQTGFKMSDH---SSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKY 140
           KFAD+T  EF  +  G ++  H   S   + +G      +  +PP+V+W +KGAVT +K 
Sbjct: 90  KFADMTTDEFRRTYAGSRVRHHLSLSGGRRGDGGFRYGDADNLPPAVDWRQKGAVTAIKD 149

Query: 141 QGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYII 193
           QGQC        + AVEGIN I+  +LVSLSEQ+L+DC  N NN GC GG MD AF++I 
Sbjct: 150 QGQCGSCWAFSTIVAVEGINKIRTGKLVSLSEQELMDC-DNVNNQGCDGGLMDYAFQFI- 207

Query: 194 QNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAI 253
           Q  GIT ++ Y Y+G   G CD  K    A  I  YEDVP NDE +L KAVA QPVSVAI
Sbjct: 208 QKNGITTESNYPYQG-EQGSCDQAKENAQAVTIDGYEDVPANDESALQKAVAGQPVSVAI 266

Query: 254 DASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRL 311
           DAS    QFYS GVF G C T L+HGV AVGYG + +G KYW++KNSWG+DWGE GY R+
Sbjct: 267 DASGQDFQFYSEGVFTGECSTDLDHGVAAVGYGATRDGTKYWIVKNSWGEDWGEKGYIRM 326

Query: 312 QRDIDQPQGQCGIAMFASFPV 332
           QR + Q +G CGIAM AS+P 
Sbjct: 327 QRGVSQTEGLCGIAMQASYPT 347


>gi|148927382|gb|ABR19827.1| cysteine proteinase [Elaeis guineensis]
          Length = 470

 Score =  279 bits (713), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 148/312 (47%), Positives = 192/312 (61%), Gaps = 14/312 (4%)

Query: 34  FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAA-IGNRSYTLRLNKFADLTPQE 92
           +E W A++GR Y    E  +RFEIFKDN++ ++  N AA  G+RS+ L LN+FAD+T +E
Sbjct: 50  YEGWLAKHGRAYNALGEKERRFEIFKDNVLFIDAHNAAADAGHRSFRLGLNRFADMTNEE 109

Query: 93  FIASQTGFKMSDHSSSLKANGTPFLYKSSQ-VPPSVNWIEKGAVTPVKYQGQCA------ 145
           + A   G + + H    +     + Y + + +P SV+W  KGAV  VK QG C       
Sbjct: 110 YRAVYLGTRPAGHRRRARVGSDRYRYNAGEDLPESVDWRAKGAVAAVKDQGSCGSCWAFS 169

Query: 146 -VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVY 204
            VAAVEGIN I    L+SLSEQ+LVDC  N  N GC GG MD  F++II N GI  +  Y
Sbjct: 170 TVAAVEGINKIVTGDLISLSEQELVDC-DNGYNQGCNGGLMDYGFEFIINNGGIDTEEDY 228

Query: 205 SYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFYS 262
            Y     G CD  +       I  YEDVP NDE++L KAVANQPVSVAI+A     Q Y 
Sbjct: 229 PYTARD-GKCDQYRKNAKVVSIDGYEDVPVNDEKALQKAVANQPVSVAIEAGGREFQLYH 287

Query: 263 GGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQC 322
            G+F G C T L+HGV AVGYGT E G  YW+++NSWG DWGE GY R++R+++   G+C
Sbjct: 288 SGIFTGRCGTDLDHGVVAVGYGT-ENGKDYWIVRNSWGGDWGESGYIRMERNVNTSTGKC 346

Query: 323 GIAMFASFPVSK 334
           GIA+  S+P  K
Sbjct: 347 GIAIEPSYPTKK 358


>gi|449448298|ref|XP_004141903.1| PREDICTED: germination-specific cysteine protease 1-like [Cucumis
           sativus]
 gi|449531757|ref|XP_004172852.1| PREDICTED: germination-specific cysteine protease 1-like [Cucumis
           sativus]
          Length = 365

 Score =  279 bits (713), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 156/340 (45%), Positives = 207/340 (60%), Gaps = 24/340 (7%)

Query: 10  LIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN 69
           L IS S  S+ +     +G + E ++ W A++G+ Y    E  KRF+IFK+NL  ++  N
Sbjct: 16  LSISASALSRRS-----DGEVREIYDLWLAKHGKAYNGIDEREKRFQIFKENLKFIDDHN 70

Query: 70  NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSS---QVPPS 126
           +    NR+Y + LN FADLT +E+ A   G +       +KA      Y  +   ++P S
Sbjct: 71  SE---NRTYKVGLNMFADLTNEEYRALYLGTRSPPARRVMKAKTASRRYAVNNLDRLPES 127

Query: 127 VNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNG 179
           ++W  +GAV PVK QG C        +AAVEGIN I    L+SLSEQ+LV C     N+G
Sbjct: 128 MDWRTRGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVSC-DKKYNSG 186

Query: 180 CYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEES 239
           C GG MD AF++II N G+  +  Y YE    G CD  +       I  YEDVP NDEES
Sbjct: 187 CNGGLMDYAFQFIIDNGGLDTEEDYPYEAFD-GQCDPTRKNAKVVSIDAYEDVPANDEES 245

Query: 240 LLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKN 297
           L KAVA+QPVSVAI+AS  ALQ Y  GVF G C + L+HGV AVGYG  E G+ YWL++N
Sbjct: 246 LKKAVAHQPVSVAIEASGLALQLYQSGVFTGKCGSALDHGVVAVGYG-KENGVDYWLVRN 304

Query: 298 SWGQDWGEDGYFRLQRDIDQ-PQGQCGIAMFASFPVSKES 336
           SWG  WGEDGYF+L+R++    +G+CGIAM AS+PV  ++
Sbjct: 305 SWGTSWGEDGYFKLERNVKHITEGKCGIAMQASYPVKNDN 344


>gi|224096714|ref|XP_002310708.1| predicted protein [Populus trichocarpa]
 gi|222853611|gb|EEE91158.1| predicted protein [Populus trichocarpa]
          Length = 356

 Score =  279 bits (713), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 162/345 (46%), Positives = 219/345 (63%), Gaps = 25/345 (7%)

Query: 19  QATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSY 78
           Q+++R+ DE  +   ++ W  ++G+ Y    E +KRFEIFK+NL  ++  N+    NR+Y
Sbjct: 15  QSSWRSDDE--VMSIYKWWLQKHGKAYNRLGEKAKRFEIFKNNLRFIDEHNSQ---NRTY 69

Query: 79  TLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTP---FLYKS-SQVPPSVNWIEKGA 134
            + L KFADLT QE+ A   G + SD    L  +  P   + YK+  ++P SV+W  KGA
Sbjct: 70  KVGLTKFADLTNQEYRAMFLGTR-SDPKRRLMKSKNPSERYAYKAGDKLPESVDWRGKGA 128

Query: 135 VTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDD 187
           V P+K QG C        VAAVEGIN I    L+SLSEQ+LVDC     N GC GG MD 
Sbjct: 129 VNPIKDQGSCGSCWAFSTVAAVEGINQIVTGELISLSEQELVDC-DRFYNAGCNGGLMDY 187

Query: 188 AFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ 247
           AF++II N G+  +  Y Y G +   CD  K +  A  I  +EDV P DE++L KAVA+Q
Sbjct: 188 AFQFIINNGGLDTEKDYPYLG-NDDTCDRDKMKTKAVSIDGFEDVLPFDEKALQKAVAHQ 246

Query: 248 PVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGE 305
           PVSVAI+AS  ALQFY  GVF G C T L+HGV  VGYGT E+G+ YWL++NSWG +WGE
Sbjct: 247 PVSVAIEASGMALQFYQSGVFTGECGTALDHGVVVVGYGT-EKGLDYWLVRNSWGTEWGE 305

Query: 306 DGYFRLQRDI-DQPQGQCGIAMFASFPVS--KESAQPSSADKSSA 347
            GY ++QR++ D   G+CGIAM +S+PV   + +A+P  AD+S+ 
Sbjct: 306 HGYIKMQRNVRDTYTGRCGIAMESSYPVKNGQNTAKPYLADESAG 350


>gi|255567869|ref|XP_002524912.1| cysteine protease, putative [Ricinus communis]
 gi|223535747|gb|EEF37409.1| cysteine protease, putative [Ricinus communis]
          Length = 366

 Score =  279 bits (713), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 155/335 (46%), Positives = 216/335 (64%), Gaps = 23/335 (6%)

Query: 19  QATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSY 78
           Q+++R+ +E  +   +  W A++ +TY +  E  KRFEIFK+NL  ++  NN+   NR+Y
Sbjct: 35  QSSWRSDNE--VISMYNWWLAKHSKTYNKLGEREKRFEIFKNNLRFIDEHNNSK--NRTY 90

Query: 79  TLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTP---FLYKSSQV-PPSVNWIEKGA 134
            + L +FADLT +E+ A   G K SD    L  +  P   + +K+  V P S++W + GA
Sbjct: 91  KVGLTRFADLTNEEYRAKFLGTK-SDPKRRLMKSKNPSQRYAFKAGDVLPESIDWRQSGA 149

Query: 135 VTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDD 187
           V+ +K QG C        +AAVEG+N I    L+SLSEQ+LVDC     N GC GG MD+
Sbjct: 150 VSAIKDQGSCGSCWAFSTIAAVEGVNKIVTGELISLSEQELVDC-DRSYNAGCNGGLMDN 208

Query: 188 AFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ 247
           AF++II N GI  D  Y Y+ +  G CD+ K ++ A  I  +EDV   DE +L KAVA+Q
Sbjct: 209 AFQFIINNGGIDTDKDYPYQAVD-GKCDTTKVKNKAVTIDGFEDVMAFDEMALQKAVAHQ 267

Query: 248 PVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGE 305
           PVSVAI+AS  ALQFY  GVF G C + L+HGV  VGYGT E+GI YWL++NSWG+DWGE
Sbjct: 268 PVSVAIEASGMALQFYQSGVFTGECGSALDHGVVIVGYGT-EDGIDYWLVRNSWGRDWGE 326

Query: 306 DGYFRLQRD-IDQPQGQCGIAMFASFPVSKESAQP 339
           +GY ++QR+ +D   G+CGIAM +S+P+ K +  P
Sbjct: 327 NGYIKMQRNVVDTFTGKCGIAMESSYPI-KNTQNP 360


>gi|255547982|ref|XP_002515048.1| cysteine protease, putative [Ricinus communis]
 gi|223546099|gb|EEF47602.1| cysteine protease, putative [Ricinus communis]
          Length = 359

 Score =  278 bits (712), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 155/346 (44%), Positives = 213/346 (61%), Gaps = 34/346 (9%)

Query: 23  RTFD--------EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIG 74
           R+FD        E S+   +E+W++ +    +   E ++RF +FK+NL  + + N     
Sbjct: 21  RSFDYKEEDLASEESLWNLYERWRSHH-TVSRSLTEKNQRFNVFKENLKHIHKVNQK--- 76

Query: 75  NRSYTLRLNKFADLTPQEFIASQTGFKMSD----HSSSLKANGTPFLYK-SSQVPPSVNW 129
           +R Y LRLNKFAD+T  EF+    G K+S     H S  +   T F ++ +S +P S++W
Sbjct: 77  DRPYKLRLNKFADMTNHEFLQHYGGSKVSHYRMFHGSRRQ---TGFAHENTSNLPSSIDW 133

Query: 130 IEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYG 182
            ++GAVT VK QG+C       +VAAVEGIN IK   L+SLSEQ+LVDC  N  N+GC G
Sbjct: 134 RKQGAVTGVKDQGKCGSCWAFSSVAAVEGINKIKTGELISLSEQELVDC--NSVNHGCDG 191

Query: 183 GFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLK 242
           G M+ AF +I +  G+T +  Y Y     G CDS K       I  YE VP NDE +L++
Sbjct: 192 GLMEQAFSFIEKTGGLTTENNYPYRA-KDGYCDSAKMNTPMVTIDGYEMVPENDEHALMQ 250

Query: 243 AVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWG 300
           AVANQPVS+AIDA     QFYS GV+ G C T LNHGV  VGYG +++G KYW++KNSWG
Sbjct: 251 AVANQPVSIAIDAGGQDFQFYSEGVYTGDCGTELNHGVALVGYGATQDGTKYWIVKNSWG 310

Query: 301 QDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSKES--AQPSSADK 344
            +WGE+G+ R+QR+ D  +G CGI + AS+P+ + S   QP S+ K
Sbjct: 311 SEWGENGFIRMQRENDVEEGLCGITLEASYPIKQRSDIKQPPSSGK 356


>gi|357452869|ref|XP_003596711.1| Cysteine proteinase [Medicago truncatula]
 gi|355485759|gb|AES66962.1| Cysteine proteinase [Medicago truncatula]
          Length = 344

 Score =  278 bits (711), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 143/316 (45%), Positives = 207/316 (65%), Gaps = 16/316 (5%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
           + EK EQW  ++G+ YK++AE  +RF+IFK+NL  +E FN  A G+  + L +N+F D T
Sbjct: 31  LLEKHEQWMEEHGKFYKDAAEKEQRFQIFKENLEFIESFN--AAGDNGFNLSINQFGDQT 88

Query: 90  PQEFIAS-QTGFKMSDHSSSLKA--NGTPFLYKS-SQVPPSVNWIEKGAVTPVKYQGQCA 145
             EF A+   G K       + A    + F Y++ ++VP +++W E+GAVTP+K+Q  C 
Sbjct: 89  NDEFKANYLNGKKKPLIGVGIAAIEEESVFRYENVTEVPATMDWRERGAVTPIKHQHLCG 148

Query: 146 -------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGI 198
                  VAA+EGI+ I   RLVSLSEQ+LVDC   +  +GC GG+++DA  +I++  GI
Sbjct: 149 SCWAFATVAAIEGIHQITTGRLVSLSEQELVDCVKTNTTDGCNGGYVEDACDFIVKKGGI 208

Query: 199 TNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS-- 256
           T++  Y Y  +  G C+  K   + A+I  YE VP N+E++LLKAVANQP++V I A+  
Sbjct: 209 TSETNYPYTRVD-GKCNVRKGTYNVAKIKGYEHVPANNEKALLKAVANQPIAVYIAATKR 267

Query: 257 ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDID 316
           A QFYS G+  G C   L+H VT VGYGTS++G+KYWL+KNSWG  WGE GY +++RD+ 
Sbjct: 268 AFQFYSSGILKGKCGIDLDHTVTIVGYGTSDDGVKYWLVKNSWGTKWGEKGYIKIKRDVH 327

Query: 317 QPQGQCGIAMFASFPV 332
             +G CGIAM  ++P+
Sbjct: 328 AKEGSCGIAMVPTYPI 343


>gi|30141019|dbj|BAC75923.1| cysteine protease-1 [Helianthus annuus]
          Length = 461

 Score =  278 bits (711), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 153/324 (47%), Positives = 201/324 (62%), Gaps = 19/324 (5%)

Query: 20  ATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYT 79
           A  RT DE  +   +E W  ++G+TY    E  +RF+IFKDNL  ++  N+   G+ +Y 
Sbjct: 40  APLRTDDE--VNALYESWLVKHGKTYNALGEKDRRFQIFKDNLRFIDEHNS---GDHTYK 94

Query: 80  LRLNKFADLTPQEFIASQTGFKMSDHSSSL-KANGTPFLYKSSQ-VPPSVNWIEKGAVTP 137
           L LNKFADLT +E+  + TG K  D    L K     + Y+S   +P  V+W E+GAVT 
Sbjct: 95  LGLNKFADLTNEEYRMTYTGIKTIDDKKKLSKMKSDRYAYRSGDSLPEYVDWREQGAVTD 154

Query: 138 VKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFK 190
           VK QG C          +VEG+N I    L+S+SEQ+LV+C T+  N GC GG MD AF+
Sbjct: 155 VKDQGSCGSCWAFSTTGSVEGVNKIVTGDLISVSEQELVNCDTS-YNQGCNGGLMDYAFE 213

Query: 191 YIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVS 250
           +II+N GI  +  Y Y G   G CD  K       I +YEDVP NDE SL KAV+NQPV+
Sbjct: 214 FIIKNGGIDTEEDYPYTGKD-GKCDKNKKNAKVVTIDSYEDVPVNDESSLKKAVSNQPVA 272

Query: 251 VAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGY 308
           VAI+A     QFY+ G+F G C T L+HGV A GYGT E+G  YWL+KNSWG +WGE GY
Sbjct: 273 VAIEAGGRDFQFYTSGIFTGSCGTALDHGVLAAGYGT-EDGKDYWLVKNSWGAEWGEGGY 331

Query: 309 FRLQRDIDQPQGQCGIAMFASFPV 332
            +++R+I    G+CGIAM AS+P+
Sbjct: 332 LKMERNIADKSGKCGIAMEASYPI 355


>gi|147790682|emb|CAN61026.1| hypothetical protein VITISV_001146 [Vitis vinifera]
          Length = 469

 Score =  278 bits (711), Expect = 3e-72,   Method: Compositional matrix adjust.
 Identities = 154/327 (47%), Positives = 206/327 (62%), Gaps = 19/327 (5%)

Query: 19  QATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSY 78
           ++++RT DE  +A  +E W A++G++Y    E  +RF+IFKDNL  ++  N     NR+Y
Sbjct: 40  KSSWRT-DEDVMA-VYEAWLAKHGKSYNALGEKERRFQIFKDNLRFIDEHNAE---NRTY 94

Query: 79  TLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK-SSQVPPSVNWIEKGAVTP 137
            + LN+FADLT +E+ +   G + +    S       + ++    +P SV+W +KGAV  
Sbjct: 95  KVGLNRFADLTNEEYRSMYLGTRTAAKRRSSNKISDRYAFRVGDSLPESVDWRKKGAVVE 154

Query: 138 VKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFK 190
           VK QG C        +AAVEGIN I    L+SLSEQ+LVDC T+  N GC GG MD AF+
Sbjct: 155 VKDQGSCGSCWAFSTIAAVEGINKIVTGGLISLSEQELVDCDTS-YNEGCNGGLMDYAFE 213

Query: 191 YIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVS 250
           +II N GI ++  Y Y+  S G CD  +       I  YEDVP NDE+SL KAVANQPVS
Sbjct: 214 FIINNGGIDSEEDYPYKA-SDGRCDQYRKNAXVVTIDGYEDVPENDEKSLEKAVANQPVS 272

Query: 251 VAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGY 308
           VAI+A     Q Y  G+F G C T L+HGVTAVGYGT E G+ YW++KNSWG  WGE+GY
Sbjct: 273 VAIEAGGREFQLYQSGIFTGRCGTALDHGVTAVGYGT-ENGVDYWIVKNSWGASWGEEGY 331

Query: 309 FRLQRDI-DQPQGQCGIAMFASFPVSK 334
            R++RD+     G+CGIAM AS+P+ K
Sbjct: 332 IRMERDLATSATGKCGIAMEASYPIKK 358


>gi|255568345|ref|XP_002525147.1| cysteine protease, putative [Ricinus communis]
 gi|223535606|gb|EEF37274.1| cysteine protease, putative [Ricinus communis]
          Length = 347

 Score =  278 bits (711), Expect = 3e-72,   Method: Compositional matrix adjust.
 Identities = 151/309 (48%), Positives = 195/309 (63%), Gaps = 17/309 (5%)

Query: 33  KFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQE 92
           ++++W  QYGR Y    E   RF I+  N+  +E  N+    N S+ L  NKFADLT  E
Sbjct: 45  RYDKWLEQYGRKYDTKDEYLLRFGIYHSNIQFIEYINSQ---NLSFKLTDNKFADLTNDE 101

Query: 93  FIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC-------A 145
           F +   G+++  +    + N +     S+ +P +V+W E GAVTP+K QGQC       A
Sbjct: 102 FNSIYLGYQIRSYK---RRNLSHMHENSTDLPDAVDWRENGAVTPIKDQGQCGSCWAFSA 158

Query: 146 VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYS 205
           VAAVEGIN IK   LVSLSEQ+LVDC  N +N GC GGFM+ AF +I    G+T +  Y 
Sbjct: 159 VAAVEGINKIKTGNLVSLSEQELVDCDVNGDNKGCNGGFMEKAFTFIKSIGGLTTENDYP 218

Query: 206 YEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASALQF--YSG 263
           Y+G + G C+  K ++HA  I  YE VP N+E SL  AV+ QPVSVAIDAS  +F  YS 
Sbjct: 219 YKG-TDGSCEKAKTDNHAVIIGGYETVPANNENSLKVAVSKQPVSVAIDASGYEFQLYSE 277

Query: 264 GVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCG 323
           GVF+GYC   LNHGVT VGYG +  G KYWL+KNSWG+ WGE GY R++RD    +G CG
Sbjct: 278 GVFSGYCGIQLNHGVTIVGYGDN-NGQKYWLVKNSWGKGWGESGYIRMKRDSSDTKGMCG 336

Query: 324 IAMFASFPV 332
           IAM  S+P+
Sbjct: 337 IAMEPSYPI 345


>gi|242071345|ref|XP_002450949.1| hypothetical protein SORBIDRAFT_05g021550 [Sorghum bicolor]
 gi|241936792|gb|EES09937.1| hypothetical protein SORBIDRAFT_05g021550 [Sorghum bicolor]
          Length = 371

 Score =  278 bits (711), Expect = 3e-72,   Method: Compositional matrix adjust.
 Identities = 153/325 (47%), Positives = 203/325 (62%), Gaps = 24/325 (7%)

Query: 27  EGSIAEKFEQWKAQYGRT----YKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRL 82
           E S+   +EQW++ Y  +     +E  + ++ F +FK+N+  +   N      RS+ L L
Sbjct: 35  EESLRALYEQWRSHYMVSRPAGLQEQDDKARWFNVFKENVRYIHEANKKG---RSFRLAL 91

Query: 83  NKFADLTPQEFI-ASQTGFKMSDH---SSSLKANGT-PFLY-KSSQVPPSVNWIEKGAVT 136
           NKFAD+T  EF  A   G +   H   SS ++ +G   F+Y ++  +P +V+W ++GAVT
Sbjct: 92  NKFADMTTDEFRRAYAAGSRTRHHRALSSGIRRHGDGSFMYAQAGNLPLAVDWRQRGAVT 151

Query: 137 PVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAF 189
            +K QGQC        +AAVEGIN I+  +LVSLSEQ+LVDC   DN  GC GG MD AF
Sbjct: 152 GIKDQGQCGSCWAFSTIAAVEGINKIRTGKLVSLSEQELVDCDDVDNQ-GCNGGLMDYAF 210

Query: 190 KYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPV 249
           +YI +N GIT ++ Y Y       C+  K   H   I  YEDVP N+E++L KAVANQPV
Sbjct: 211 QYIKRNGGITTESNYPYLAEQRS-CNKAKERSHDVTIDGYEDVPANNEDALQKAVANQPV 269

Query: 250 SVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDG 307
           S+AI+AS    QFYS GVF G C T L+HGV AVGYG + +G KYW++KNSWG+DWGE G
Sbjct: 270 SIAIEASGQDFQFYSEGVFTGSCGTELDHGVAAVGYGITRDGTKYWIVKNSWGEDWGERG 329

Query: 308 YFRLQRDIDQPQGQCGIAMFASFPV 332
           Y R+QR I   QG CGIAM  S+P 
Sbjct: 330 YIRMQRGISDSQGLCGIAMEPSYPT 354


>gi|225458701|ref|XP_002284973.1| PREDICTED: cysteine proteinase RD21a-like [Vitis vinifera]
          Length = 467

 Score =  278 bits (711), Expect = 3e-72,   Method: Compositional matrix adjust.
 Identities = 154/327 (47%), Positives = 206/327 (62%), Gaps = 19/327 (5%)

Query: 19  QATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSY 78
           ++++RT DE  +A  +E W A++G++Y    E  +RF+IFKDNL  ++  N     NR+Y
Sbjct: 38  KSSWRT-DEDVMA-VYEAWLAKHGKSYNALGEKERRFQIFKDNLRFIDEHNAE---NRTY 92

Query: 79  TLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK-SSQVPPSVNWIEKGAVTP 137
            + LN+FADLT +E+ +   G + +    S       + ++    +P SV+W +KGAV  
Sbjct: 93  KVGLNRFADLTNEEYRSMYLGTRTAAKRRSSNKISDRYAFRVGDSLPESVDWRKKGAVVE 152

Query: 138 VKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFK 190
           VK QG C        +AAVEGIN I    L+SLSEQ+LVDC T+  N GC GG MD AF+
Sbjct: 153 VKDQGSCGSCWAFSTIAAVEGINKIVTGGLISLSEQELVDCDTS-YNEGCNGGLMDYAFE 211

Query: 191 YIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVS 250
           +II N GI ++  Y Y+  S G CD  +       I  YEDVP NDE+SL KAVANQPVS
Sbjct: 212 FIINNGGIDSEEDYPYKA-SDGRCDQYRKNAKVVTIDGYEDVPENDEKSLEKAVANQPVS 270

Query: 251 VAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGY 308
           VAI+A     Q Y  G+F G C T L+HGVTAVGYGT E G+ YW++KNSWG  WGE+GY
Sbjct: 271 VAIEAGGREFQLYQSGIFTGRCGTALDHGVTAVGYGT-ENGVDYWIVKNSWGASWGEEGY 329

Query: 309 FRLQRDI-DQPQGQCGIAMFASFPVSK 334
            R++RD+     G+CGIAM AS+P+ K
Sbjct: 330 IRMERDLATSATGKCGIAMEASYPIKK 356


>gi|224083362|ref|XP_002306996.1| predicted protein [Populus trichocarpa]
 gi|222856445|gb|EEE93992.1| predicted protein [Populus trichocarpa]
          Length = 336

 Score =  278 bits (711), Expect = 3e-72,   Method: Compositional matrix adjust.
 Identities = 155/316 (49%), Positives = 196/316 (62%), Gaps = 18/316 (5%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
           I + FE W +++ + Y+   E   RFEIFKDNL  ++  N   +   +Y L LN+FADL+
Sbjct: 29  IIDLFESWISKHQKIYESIEEKWHRFEIFKDNLFHIDETNKKVV---NYWLGLNEFADLS 85

Query: 90  PQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGAVTPVKYQGQCA--- 145
            +EF     G  +    S+ +     F YK  S +P SV+W +KGAVT VK QG C    
Sbjct: 86  HEEFKNKYLGLNVD--LSNRRECSEEFTYKDVSSIPKSVDWRKKGAVTDVKNQGSCGSCW 143

Query: 146 ----VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITND 201
               VAAVEGIN I    L SLSEQ+LVDC T   NNGC GG MD AF YII N G+  +
Sbjct: 144 AFSTVAAVEGINQIVTGNLTSLSEQELVDCDTT-YNNGCNGGLMDYAFAYIISNGGLHKE 202

Query: 202 AVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQ 259
             Y Y  M  G C+  KAE     I+ Y DVP N EESLLKA+ANQP+SVAIDAS    Q
Sbjct: 203 EDYPYI-MEEGTCEMRKAESEVVTISGYHDVPQNSEESLLKALANQPLSVAIDASGRDFQ 261

Query: 260 FYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQ 319
           FYSGGVF+G+C T L+HGV AVGYG S +G+ + ++KNSWG  WGE G+ R++R+  +P 
Sbjct: 262 FYSGGVFDGHCGTELDHGVAAVGYG-SAKGLDFIVVKNSWGSKWGEKGFIRMKRNTGKPA 320

Query: 320 GQCGIAMFASFPVSKE 335
           G CGI   AS+P  K+
Sbjct: 321 GLCGINKMASYPTKKK 336


>gi|18141281|gb|AAL60578.1|AF454956_1 senescence-associated cysteine protease [Brassica oleracea]
          Length = 445

 Score =  278 bits (710), Expect = 3e-72,   Method: Compositional matrix adjust.
 Identities = 157/325 (48%), Positives = 198/325 (60%), Gaps = 20/325 (6%)

Query: 18  SQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRS 77
           ++A +R  +E    + FE+W  +  + Y    E  KRFEIF DNL  V+  N  ++ N+S
Sbjct: 24  AKADHRNPEE---VKMFERWLVENHKNYNGLGEKDKRFEIFMDNLKFVQEHN--SVPNQS 78

Query: 78  YTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK-SSQVPPSVNWIEKGAVT 136
           Y L L +FADLT +EF A     KM     S+K+    +L+    ++P  V+W  KGAV 
Sbjct: 79  YELGLTRFADLTNEEFRAIYLRSKMERTRDSVKSE--RYLHNVGDKLPDEVDWRAKGAVV 136

Query: 137 PVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAF 189
           PVK QG C       A+ AVEGIN IK   LVSLSEQ+LVDC T+  NNGC GG MD AF
Sbjct: 137 PVKDQGSCGSCWAFSAIGAVEGINQIKTGELVSLSEQELVDCDTS-YNNGCGGGLMDYAF 195

Query: 190 KYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPV 249
           ++II N GI  +  Y Y      IC++ K       I  YEDVP N E SL KA+ANQP+
Sbjct: 196 QFIISNGGIDTEEDYPYTATDDNICNTDKKNTRVVTIDGYEDVPEN-ENSLKKALANQPI 254

Query: 250 SVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDG 307
           SVAI+A     Q Y  GVF G C T L+HGV AVGYGTSE G  YW+I+NSWG +WGE G
Sbjct: 255 SVAIEAGGRGFQLYKSGVFTGTCGTALDHGVVAVGYGTSE-GQDYWIIRNSWGSNWGESG 313

Query: 308 YFRLQRDIDQPQGQCGIAMFASFPV 332
           Y +LQR+I    G+CG+AM AS+P 
Sbjct: 314 YIKLQRNIKDSSGKCGVAMMASYPT 338


>gi|224065647|ref|XP_002301901.1| predicted protein [Populus trichocarpa]
 gi|222843627|gb|EEE81174.1| predicted protein [Populus trichocarpa]
          Length = 336

 Score =  278 bits (710), Expect = 3e-72,   Method: Compositional matrix adjust.
 Identities = 154/316 (48%), Positives = 195/316 (61%), Gaps = 18/316 (5%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
           I + FE W +++G+ Y+   E   RFEIFKDNL  ++  N   +   +Y L LN+F+DL+
Sbjct: 29  IIDLFESWISKHGKIYESIEEKWLRFEIFKDNLFHIDETNKKVV---NYWLGLNEFSDLS 85

Query: 90  PQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQ-VPPSVNWIEKGAVTPVKYQGQCA--- 145
            +EF     G K+    S  +     F YK    +P SV+W +KGAVT VK QG C    
Sbjct: 86  HEEFKNKYLGLKVD--MSERRECSQEFNYKDVMSIPKSVDWRKKGAVTDVKNQGSCGSCW 143

Query: 146 ----VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITND 201
               VAAVEGIN I    L SLSEQ+LVDC T  NN GC GG MD AF YII N G+  +
Sbjct: 144 AFSTVAAVEGINQIVTGNLTSLSEQELVDCDTT-NNYGCNGGLMDYAFSYIISNGGLHKE 202

Query: 202 AVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQ 259
             Y Y  M  G C+  K E     I+ Y DVP N EESLLKA+ANQP+SVAI+AS    Q
Sbjct: 203 VDYPYI-MEEGTCEMRKEESEVVTISGYHDVPQNSEESLLKALANQPLSVAIEASGRDFQ 261

Query: 260 FYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQ 319
           FYSGGVF+G+C T L+HGV AVGYG++  G+ Y ++KNSWG  WGE GY R++R+  +P 
Sbjct: 262 FYSGGVFDGHCGTQLDHGVAAVGYGSTN-GLDYIIVKNSWGSKWGEKGYIRMKRNTGKPA 320

Query: 320 GQCGIAMFASFPVSKE 335
           G CGI   AS+P  K+
Sbjct: 321 GLCGINKMASYPTKKK 336


>gi|255555337|ref|XP_002518705.1| cysteine protease, putative [Ricinus communis]
 gi|223542086|gb|EEF43630.1| cysteine protease, putative [Ricinus communis]
          Length = 471

 Score =  278 bits (710), Expect = 3e-72,   Method: Compositional matrix adjust.
 Identities = 152/316 (48%), Positives = 194/316 (61%), Gaps = 17/316 (5%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
           +   +E W  ++G+ Y    E  KRFEIFKDNL  ++  N+    +RSY + LN+FADLT
Sbjct: 47  VRRMYEMWLVEHGKAYNALGEKEKRFEIFKDNLRFIDEHNSV---DRSYKVGLNRFADLT 103

Query: 90  PQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQ-VPPSVNWIEKGAVTPVKYQGQCA--- 145
            +E+ A   G KM   +  L      +L+K    +P +V+W EKGAV PVK QGQC    
Sbjct: 104 NEEYKAMFLGTKMERKNRFLGTRSQRYLFKDGDDLPENVDWREKGAVVPVKDQGQCGSCW 163

Query: 146 ----VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITND 201
               V AVEGIN I    L+SLSEQ+LVDC  +  N GC GG MD AF++II N GI  +
Sbjct: 164 AFSTVGAVEGINQIVTGELISLSEQELVDCDKS-YNQGCNGGLMDYAFEFIINNGGIDTE 222

Query: 202 AVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQ 259
             Y Y+  S  ICD  +       I  YEDVP NDE SL KAVA+QPVSVAI+A   A Q
Sbjct: 223 EDYPYKA-SDNICDPNRKNAKVVTIDGYEDVPENDENSLKKAVAHQPVSVAIEAGGRAFQ 281

Query: 260 FYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDI-DQP 318
            Y  GVF G C T L+HGV AVGYGT E G+ YW+++NSWG  WGE GY R++R++ +  
Sbjct: 282 LYKSGVFTGRCGTELDHGVVAVGYGT-ENGVNYWIVRNSWGSAWGESGYIRMERNVANTK 340

Query: 319 QGQCGIAMFASFPVSK 334
            G+CGIA+  S+P  K
Sbjct: 341 TGKCGIAIQPSYPTKK 356


>gi|356533293|ref|XP_003535200.1| PREDICTED: LOW QUALITY PROTEIN: cysteine proteinase RD21a-like
           [Glycine max]
          Length = 466

 Score =  278 bits (710), Expect = 4e-72,   Method: Compositional matrix adjust.
 Identities = 151/322 (46%), Positives = 199/322 (61%), Gaps = 19/322 (5%)

Query: 25  FDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNK 84
           +DE      +E W  ++G+ Y    E  +RF+IFKDNL  +E  N A  G++SY L LNK
Sbjct: 39  YDESHTRHVYEAWLVKHGKAYNALGEKERRFKIFKDNLRFIEEHNGA--GDKSYKLGLNK 96

Query: 85  FADLTPQEFIASQTGFKM---SDHSSSLKANGTPFLYKSSQ-VPPSVNWIEKGAVTPVKY 140
           FADLT +E+ A   G +     + ++ +      + Y++ + +P  V+W EKGAVTP+K 
Sbjct: 97  FADLTNEEYRAMFLGTRTRGPKNKAAVVAKKTDRYAYRAGEELPAMVDWREKGAVTPIKD 156

Query: 141 QGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYII 193
           QGQC        V AVEGIN I    L SLSEQ+LVDC     N GC GG MD AF++I+
Sbjct: 157 QGQCGSCWAFSTVGAVEGINQIVTGNLTSLSEQELVDC-DRGYNMGCNGGLMDYAFEFIV 215

Query: 194 QNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAI 253
           QN GI  +  Y Y       CD  +       I  YEDVP NDE+SL+KAVANQPVSVAI
Sbjct: 216 QNGGIDTEEDYPYHAKDN-TCDPNRKNARVVTIDGYEDVPTNDEKSLMKAVANQPVSVAI 274

Query: 254 DASALQF--YSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRL 311
           +A  ++F  Y  GVF G C T L+HGV AVGYGT E G  YWL++NSWG  WGE+GY +L
Sbjct: 275 EAGGMEFQLYQSGVFTGRCGTNLDHGVVAVGYGT-ENGTDYWLVRNSWGSAWGENGYIKL 333

Query: 312 QRDIDQPQ-GQCGIAMFASFPV 332
           +R++   + G+CGIA+ AS+P+
Sbjct: 334 ERNVQNTETGKCGIAIEASYPI 355


>gi|30141027|dbj|BAC75927.1| cysteine protease-5 [Helianthus annuus]
          Length = 365

 Score =  278 bits (710), Expect = 4e-72,   Method: Compositional matrix adjust.
 Identities = 148/336 (44%), Positives = 206/336 (61%), Gaps = 20/336 (5%)

Query: 20  ATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYT 79
            T  T  +  +   +E W A++G+TY    E   RF IF DNL  ++  N +  GNRSY 
Sbjct: 22  VTSNTRTDEEVRNTYELWLARHGKTYNALGEKESRFRIFADNLKFIDEHNLS--GNRSYK 79

Query: 80  LRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPF-----LYKSSQVPPSVNWIEKGA 134
           + LN+FADLT +E+ +   G K+  +    K           + ++   P  V+W E+GA
Sbjct: 80  VGLNQFADLTNEEYRSMYLGTKVDPYRRIAKMQRGEISRRYAVQENEMFPAKVDWRERGA 139

Query: 135 VTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDD 187
           V+PVK QG C        VA+VEGIN I    L+SLSEQ+LVDC  N  N+GC GG MD 
Sbjct: 140 VSPVKNQGGCGSCWAFSTVASVEGINKIVTGDLISLSEQELVDC-DNKYNSGCNGGSMDY 198

Query: 188 AFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ 247
           AF++I+ N GI +++ Y Y+G+   +CD ++ +     I  YEDVPP +E++L+KAVA+Q
Sbjct: 199 AFQFIVSNGGIDSESDYPYKGVGA-VCDPVRNKAKIVSIDGYEDVPPMNEKALMKAVAHQ 257

Query: 248 PVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGE 305
           PVSV I+AS  A Q Y+ GV  G C T L+HGV  VGYG SE G  YW+++NSWG +WGE
Sbjct: 258 PVSVGIEASGRAFQLYTSGVLTGSCGTNLDHGVVVVGYG-SENGKDYWIVRNSWGPEWGE 316

Query: 306 DGYFRLQRD-IDQPQGQCGIAMFASFPVSKESAQPS 340
           DGY R++R+ +D P G CGI + AS+P+   +  PS
Sbjct: 317 DGYIRMERNMVDTPVGMCGITLMASYPIKYGNKNPS 352


>gi|297851332|ref|XP_002893547.1| peptidase C1A papain family protein [Arabidopsis lyrata subsp.
           lyrata]
 gi|297339389|gb|EFH69806.1| peptidase C1A papain family protein [Arabidopsis lyrata subsp.
           lyrata]
          Length = 345

 Score =  277 bits (709), Expect = 4e-72,   Method: Compositional matrix adjust.
 Identities = 154/331 (46%), Positives = 214/331 (64%), Gaps = 21/331 (6%)

Query: 18  SQATYRT-FDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNR 76
           S+AT R    E +I    ++W   + R Y +  E   R E+F +NL  +E FNN  +G++
Sbjct: 21  SEATSRVALHEPTIFYYHQKWMINFSRVYDDEFEKQMRLEVFTENLKFIENFNN--MGSQ 78

Query: 77  SYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKA--NGTP-FLYKSSQVPPSV-NWIEK 132
           SY L +NKF D T +EF+A+ TG    + +S  +     TP + +  S V  +  +W  +
Sbjct: 79  SYKLGVNKFTDWTKEEFLATHTGLSGINVTSPFEVVNETTPAWNWTVSDVLGTTKDWRNE 138

Query: 133 GAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFM 185
           GAVTPVKYQG+C       A+AAVEG+  I    L+SLSEQQL+DCA  + NNGC GG M
Sbjct: 139 GAVTPVKYQGECGGCWAFSAIAAVEGLTKIARGNLISLSEQQLLDCA-REQNNGCKGGTM 197

Query: 186 DDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVA 245
            +AF YI++N G++++  Y Y+ +  G C S   +  A  I  +E+VP N+E +LL+AV+
Sbjct: 198 IEAFNYIVKNGGVSSENAYPYQ-VKEGPCRS--NDIPAIVIRGFENVPSNNERALLEAVS 254

Query: 246 NQPVSVAIDASALQF--YSGGVFNGY-CETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQD 302
            QPV+V IDAS   F  YSGGV+N   C T +NH VT VGYGTS+EGIKYWL KNSWG+ 
Sbjct: 255 RQPVAVDIDASETGFIHYSGGVYNARDCGTSVNHAVTLVGYGTSQEGIKYWLAKNSWGKT 314

Query: 303 WGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
           WGE+GY R++RD++ PQG CG+A +AS+PV+
Sbjct: 315 WGENGYIRIRRDVEWPQGMCGVAQYASYPVA 345


>gi|226507950|ref|NP_001151278.1| LOC100284911 precursor [Zea mays]
 gi|195645488|gb|ACG42212.1| vignain precursor [Zea mays]
          Length = 376

 Score =  277 bits (708), Expect = 5e-72,   Method: Compositional matrix adjust.
 Identities = 154/324 (47%), Positives = 198/324 (61%), Gaps = 30/324 (9%)

Query: 34  FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
           +E+W+ ++    ++  + ++RF +FK N+  +  FN     +  Y LRLN+F D+T  EF
Sbjct: 49  YERWRGRHALA-RDLGDKARRFNVFKANVRLIHEFNRR---DEPYKLRLNRFGDMTADEF 104

Query: 94  IASQTGFKMSDH--------SSSLKANGTPFLYKSSQ-VPPSVNWIEKGAVTPVKYQGQC 144
                G +++ H         SS  A+   F+Y  ++ VP SV+W +KGAVT VK QGQC
Sbjct: 105 RRHYAGSRVAHHRMFRGDRQGSSASAS---FMYADARDVPASVDWRQKGAVTDVKDQGQC 161

Query: 145 A-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKG 197
                   +AAVEGINAIK   L SLSEQQLVDC T   N GC GG MD AF+YI ++ G
Sbjct: 162 GSCWAFSTIAAVEGINAIKTKNLTSLSEQQLVDCDTK-ANAGCNGGLMDYAFQYIAKHGG 220

Query: 198 ITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA 257
           +  +  Y Y       C    A      I  YEDVP NDE +L KAVA+QPVSVAI+AS 
Sbjct: 221 VAAEDAYPYRARQAS-CKKSPAP--VVTIDGYEDVPANDESALKKAVAHQPVSVAIEASG 277

Query: 258 --LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDI 315
              QFYS GVF+G C T L+HGVTAVGYG + +G KYWL+KNSWG +WGE GY R+ RD+
Sbjct: 278 SHFQFYSEGVFSGRCGTELDHGVTAVGYGVTADGTKYWLVKNSWGPEWGEKGYIRMARDV 337

Query: 316 DQPQGQCGIAMFASFPVSKESAQP 339
              +G CGIAM AS+PV K S  P
Sbjct: 338 AAKEGHCGIAMEASYPV-KTSPNP 360


>gi|225446523|ref|XP_002275891.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP2 [Vitis vinifera]
          Length = 358

 Score =  277 bits (708), Expect = 5e-72,   Method: Compositional matrix adjust.
 Identities = 159/343 (46%), Positives = 211/343 (61%), Gaps = 22/343 (6%)

Query: 4   YFLIVVLI---ISGSCASQATYRTFDEGSIAEK-FEQWKAQYGRTYKESAENSKRFEIFK 59
           YF ++++    +S S  S+       E S  EK +E+W  Q+GR YK   E  + F I++
Sbjct: 11  YFALLIMWTVGVSWSAFSEEHEPMESEMSDMEKRYERWLVQHGRRYKNRDEWQRHFGIYQ 70

Query: 60  DNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK 119
            N+  +   N     N S+TL  N+FAD+T +E+ A   G   S+ S   + N + F  +
Sbjct: 71  SNVRFINYINAQ---NFSFTLTDNQFADMTNEEYKALYMGLGTSETS---RKNQSSFKRE 124

Query: 120 SSQVPP-SVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDC 171
            S+V P SV+W + GAVTPV+ QG+C        VAAVEGIN I+  +LVSLSEQ+L+DC
Sbjct: 125 RSKVLPISVDWRKMGAVTPVRNQGECGSCWAFSTVAAVEGINKIRTGKLVSLSEQELLDC 184

Query: 172 ATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYED 231
             +  N GC GG+M +AFK+I QN GIT    Y Y G   GIC+  KA +H  +I+ YE 
Sbjct: 185 DIDSGNEGCNGGYMVNAFKFIKQNGGITTARNYPYIG-EQGICNKDKAANHVVKISGYET 243

Query: 232 VPPNDEESLLKAVANQPVSVAIDASALQF--YSGGVFNGYCETFLNHGVTAVGYGTSEEG 289
           VPPN+E+ L  AVA QPVSVAIDA   +F  YS G+FNG+C   LNH VT +GYG  + G
Sbjct: 244 VPPNNEKILQAAVAKQPVSVAIDAGGYEFQLYSKGIFNGFCGKQLNHAVTVIGYG-EDNG 302

Query: 290 IKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
            KYWL+KNSWG  WGE GY R+ RD    +G CGIAM AS+P+
Sbjct: 303 KKYWLVKNSWGTGWGEAGYARMIRDSRDDEGICGIAMEASYPI 345


>gi|449460678|ref|XP_004148072.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Cucumis
           sativus]
          Length = 317

 Score =  277 bits (708), Expect = 6e-72,   Method: Compositional matrix adjust.
 Identities = 152/314 (48%), Positives = 200/314 (63%), Gaps = 24/314 (7%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
           I +++++W  +YGR YK   E  +RF I++ N+  ++ FN+    N S+TL  N FADLT
Sbjct: 15  IQDRYQKWMDKYGRQYKSREEWERRFTIYQANVQYIDNFNSM---NHSHTLAENNFADLT 71

Query: 90  PQEFIASQTGFKMSDHSSSLKANGTPFLYKSS-QVPPSVNWIEKGAVTPVKYQGQC---- 144
            +EF A+  G+K      ++    T F Y +   +P +V+W ++GAVTP+K QGQC    
Sbjct: 72  NEEFKATYLGYK------TVSIPDTCFRYGNMVNLPTNVDWRQEGAVTPIKNQGQCGSCW 125

Query: 145 ---AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITND 201
              AVAAVEGIN IK  +L+SLSEQ+LVDC     N GC GG+M  AF++I +  G+T +
Sbjct: 126 AFSAVAAVEGINKIKAGKLISLSEQELVDCDVTSGNQGCNGGYMYKAFEFI-KRTGLTTE 184

Query: 202 AVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQ 259
             Y Y+G  +  C+  K +     I+ YE VP NDE+SL  AVANQPVSVAIDA     Q
Sbjct: 185 IEYPYQGAESA-CNEQKEKYQFVSISGYEKVPVNDEKSLKAAVANQPVSVAIDAEGNNFQ 243

Query: 260 FYSGGVFNGYCETFLNHGVTAVGYG-TSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQP 318
           FYSGG+F+G C   LNHGV  VGYG TS +   YWL+KNSWG DWGE GY R++RD    
Sbjct: 244 FYSGGIFSGNCGNQLNHGVAIVGYGETSNQA--YWLVKNSWGTDWGESGYIRMKRDSTDR 301

Query: 319 QGQCGIAMFASFPV 332
           QG CGIAM AS+P 
Sbjct: 302 QGTCGIAMMASYPT 315


>gi|302816909|ref|XP_002990132.1| hypothetical protein SELMODRAFT_428615 [Selaginella moellendorffii]
 gi|300142145|gb|EFJ08849.1| hypothetical protein SELMODRAFT_428615 [Selaginella moellendorffii]
          Length = 358

 Score =  277 bits (708), Expect = 6e-72,   Method: Compositional matrix adjust.
 Identities = 153/320 (47%), Positives = 200/320 (62%), Gaps = 22/320 (6%)

Query: 27  EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFA 86
           +GS    +E+W   +GR Y    E  +RF+IF+DN   +E  N     N++Y L LN FA
Sbjct: 27  DGSFRALYEKWMVDHGRVYNGIGEKERRFQIFRDNAEYIEEHNRQV--NQTYWLGLNNFA 84

Query: 87  DLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK-SSQVPPSVNWIEKGAVTPVKYQGQCA 145
           D+T  EF A   G K+   S+++K+    F Y+ ++ +P   +W  KGAV  VK QG C 
Sbjct: 85  DMTHDEFKALYFGTKVP-LSNTIKSG---FRYEDATNLPLDTDWRSKGAVATVKNQGACG 140

Query: 146 -------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGI 198
                  VAAVEG+N I    LVSLSEQ+LVDC     N GC GG MD AF++IIQN G+
Sbjct: 141 SCWAFSTVAAVEGVNQIVTGELVSLSEQELVDC-DKQKNQGCNGGLMDSAFEFIIQNGGL 199

Query: 199 TNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA- 257
            ++A Y Y+ +S G CD  +   H   I  +EDVP   E  LLKAVANQPVSVAI+AS  
Sbjct: 200 DSEADYPYKAVS-GSCDESRRNSHVVTIDGFEDVPAESEADLLKAVANQPVSVAIEASGR 258

Query: 258 -LQFYSGGVFNGYCETFLNHGVTAVGYGTSE--EGI--KYWLIKNSWGQDWGEDGYFRLQ 312
             Q YSGGV+ G+C   L+HGV AVGYGTS+  +G+   YW+++NSWG  WGE GY RLQ
Sbjct: 259 NFQLYSGGVYTGHCGYELDHGVVAVGYGTSKTPDGVATDYWIVRNSWGDAWGESGYIRLQ 318

Query: 313 RDIDQPQGQCGIAMFASFPV 332
           R++   +G+CGIAM AS+PV
Sbjct: 319 RNVASSRGKCGIAMMASYPV 338


>gi|146216000|gb|ABQ10202.1| cysteine protease Cp4 [Actinidia deliciosa]
          Length = 463

 Score =  277 bits (708), Expect = 6e-72,   Method: Compositional matrix adjust.
 Identities = 150/310 (48%), Positives = 193/310 (62%), Gaps = 18/310 (5%)

Query: 34  FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
           +E+W   +G+ Y    E  +RFEIFKDNL  V+  N  A    SY + LN+FADLT +E+
Sbjct: 47  YEKWLTTHGKAYNAIGEKERRFEIFKDNLRFVDEHNAVA---GSYRVGLNRFADLTNEEY 103

Query: 94  IASQTG--FKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA------ 145
            +   G   +M + S+S K++   F     ++P SV+W EKGAV+PVK QGQC       
Sbjct: 104 RSMFLGGNMEMKERSASTKSDRYAFR-AGDKLPGSVDWREKGAVSPVKDQGQCGSCWAFS 162

Query: 146 -VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVY 204
            ++AVEGIN I    L+SLSEQ+LVDC     N GC GG MD  F++II N GI  +  Y
Sbjct: 163 TISAVEGINQIVTGELISLSEQELVDC-DKSYNMGCNGGLMDYGFQFIINNGGIDTEEDY 221

Query: 205 SYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQFYS 262
            Y  +  G CD  +       I  YEDVP +DE SL KAVANQPVSVAI+A   A Q Y 
Sbjct: 222 PYRAVD-GTCDQFRKNARVVSINGYEDVPEDDENSLKKAVANQPVSVAIEAGGRAFQLYE 280

Query: 263 GGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQC 322
            GVF G+C T L+HGV AVGYGT E G+ YW ++NSWG  WGE+GY +L+R+I+   G+C
Sbjct: 281 SGVFTGHCGTNLDHGVVAVGYGT-ENGVDYWTVRNSWGPKWGENGYIKLERNINATSGKC 339

Query: 323 GIAMFASFPV 332
           GIA  AS+P 
Sbjct: 340 GIASMASYPT 349


>gi|224056176|ref|XP_002298740.1| predicted protein [Populus trichocarpa]
 gi|222845998|gb|EEE83545.1| predicted protein [Populus trichocarpa]
          Length = 455

 Score =  277 bits (708), Expect = 6e-72,   Method: Compositional matrix adjust.
 Identities = 158/333 (47%), Positives = 199/333 (59%), Gaps = 25/333 (7%)

Query: 19  QATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSY 78
           Q   RT  E      +E W  ++GR Y    E  +RFEIFKDNL  ++  N  ++GN SY
Sbjct: 12  QVPERT--EAETRRIYEMWLVKHGRAYNALGEKERRFEIFKDNLKFIDEHN--SVGNPSY 67

Query: 79  TLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTP----FLYKSSQ-VPPSVNWIEKG 133
            L LNKFADL+  E+ +   G +M      L   G P    +L+K    +P +V+W EKG
Sbjct: 68  KLGLNKFADLSNDEYRSVYLGTRMDGKGRLL---GGPKSERYLFKEGDDLPETVDWREKG 124

Query: 134 AVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMD 186
           AV PVK QGQC        V AVEGIN I    L SLSEQ+LVDC     N GC GG MD
Sbjct: 125 AVAPVKDQGQCGSCWAFSTVGAVEGINQIVTGNLTSLSEQELVDCDKT-YNLGCNGGLMD 183

Query: 187 DAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN 246
            AF +II+N GI  +  Y Y+ + + +CD  +       I  YEDVP NDE+SL KAVAN
Sbjct: 184 YAFDFIIENGGIDTEEDYPYKAIDS-MCDPNRKNARVVTIDGYEDVPQNDEKSLKKAVAN 242

Query: 247 QPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWG 304
           QPVSVAI+A     Q Y  GVF G C T L+HGV  VGYGT E G+ YW+++NSWG  WG
Sbjct: 243 QPVSVAIEAGGRGFQLYQSGVFTGSCGTQLDHGVVTVGYGT-EHGVDYWIVRNSWGPAWG 301

Query: 305 EDGYFRLQRDIDQPQ-GQCGIAMFASFPVSKES 336
           E+GY R++RD+   + G+CGIAM AS+P  K +
Sbjct: 302 ENGYIRMERDVASTETGKCGIAMEASYPTKKSA 334


>gi|449524070|ref|XP_004169046.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like, partial
           [Cucumis sativus]
          Length = 314

 Score =  277 bits (708), Expect = 6e-72,   Method: Compositional matrix adjust.
 Identities = 152/313 (48%), Positives = 200/313 (63%), Gaps = 24/313 (7%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
           I +++++W  +YGR YK   E  +RF I++ N+  ++ FN+    N S+TL  N FADLT
Sbjct: 15  IQDRYQKWMDKYGRQYKSREEWERRFTIYQANVQYIDNFNSM---NHSHTLAENNFADLT 71

Query: 90  PQEFIASQTGFKMSDHSSSLKANGTPFLYKSS-QVPPSVNWIEKGAVTPVKYQGQC---- 144
            +EF A+  G+K      ++    T F Y +   +P +V+W ++GAVTP+K QGQC    
Sbjct: 72  NEEFKATYLGYK------TVSIPDTCFRYGNMVNLPTNVDWRQEGAVTPIKNQGQCGSCW 125

Query: 145 ---AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITND 201
              AVAAVEGIN IK  +L+SLSEQ+LVDC     N GC GG+M  AF++I +  G+T +
Sbjct: 126 AFSAVAAVEGINKIKAGKLISLSEQELVDCDVTSGNQGCNGGYMYKAFEFI-KRTGLTTE 184

Query: 202 AVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQ 259
             Y Y+G  +  C+  K +     I+ YE VP NDE+SL  AVANQPVSVAIDA     Q
Sbjct: 185 IEYPYQGAESA-CNEQKEKYQFVSISGYEKVPVNDEKSLKAAVANQPVSVAIDAEGNNFQ 243

Query: 260 FYSGGVFNGYCETFLNHGVTAVGYG-TSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQP 318
           FYSGG+F+G C   LNHGV  VGYG TS +   YWL+KNSWG DWGE GY R++RD    
Sbjct: 244 FYSGGIFSGNCGNQLNHGVAIVGYGETSNQA--YWLVKNSWGTDWGESGYIRMKRDSTDK 301

Query: 319 QGQCGIAMFASFP 331
           QG CGIAM AS+P
Sbjct: 302 QGTCGIAMMASYP 314


>gi|302143380|emb|CBI21941.3| unnamed protein product [Vitis vinifera]
          Length = 354

 Score =  276 bits (707), Expect = 7e-72,   Method: Compositional matrix adjust.
 Identities = 159/343 (46%), Positives = 211/343 (61%), Gaps = 22/343 (6%)

Query: 4   YFLIVVLI---ISGSCASQATYRTFDEGSIAEK-FEQWKAQYGRTYKESAENSKRFEIFK 59
           YF ++++    +S S  S+       E S  EK +E+W  Q+GR YK   E  + F I++
Sbjct: 7   YFALLIMWTVGVSWSAFSEEHEPMESEMSDMEKRYERWLVQHGRRYKNRDEWQRHFGIYQ 66

Query: 60  DNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK 119
            N+  +   N     N S+TL  N+FAD+T +E+ A   G   S+ S   + N + F  +
Sbjct: 67  SNVRFINYINAQ---NFSFTLTDNQFADMTNEEYKALYMGLGTSETS---RKNQSSFKRE 120

Query: 120 SSQVPP-SVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDC 171
            S+V P SV+W + GAVTPV+ QG+C        VAAVEGIN I+  +LVSLSEQ+L+DC
Sbjct: 121 RSKVLPISVDWRKMGAVTPVRNQGECGSCWAFSTVAAVEGINKIRTGKLVSLSEQELLDC 180

Query: 172 ATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYED 231
             +  N GC GG+M +AFK+I QN GIT    Y Y G   GIC+  KA +H  +I+ YE 
Sbjct: 181 DIDSGNEGCNGGYMVNAFKFIKQNGGITTARNYPYIG-EQGICNKDKAANHVVKISGYET 239

Query: 232 VPPNDEESLLKAVANQPVSVAIDASALQF--YSGGVFNGYCETFLNHGVTAVGYGTSEEG 289
           VPPN+E+ L  AVA QPVSVAIDA   +F  YS G+FNG+C   LNH VT +GYG  + G
Sbjct: 240 VPPNNEKILQAAVAKQPVSVAIDAGGYEFQLYSKGIFNGFCGKQLNHAVTVIGYG-EDNG 298

Query: 290 IKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
            KYWL+KNSWG  WGE GY R+ RD    +G CGIAM AS+P+
Sbjct: 299 KKYWLVKNSWGTGWGEAGYARMIRDSRDDEGICGIAMEASYPI 341


>gi|194352762|emb|CAQ00109.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
 gi|326517250|dbj|BAJ99991.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 367

 Score =  276 bits (707), Expect = 7e-72,   Method: Compositional matrix adjust.
 Identities = 159/330 (48%), Positives = 208/330 (63%), Gaps = 22/330 (6%)

Query: 27  EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFA 86
           E S+   +E+W+ Q+    ++  E ++RF +F++N+  +  FN    G+  Y LRLN+F 
Sbjct: 40  EDSLWALYERWREQH-TVARDLGEKARRFNVFRENVRLIHEFNR---GDAPYKLRLNRFG 95

Query: 87  DLTPQEFIASQTGFKMSDHSS-SLKANGTPFLYKSS----QVPPSVNWIEKGAVTPVKYQ 141
           D+T  EF  +    ++S H   SLK  G  F++ S+     VPPSV+W +KGAVT VK Q
Sbjct: 96  DMTADEFRRAYASSRVSHHRMFSLKEGGGGFMHGSAASVRDVPPSVDWRQKGAVTAVKDQ 155

Query: 142 GQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQ 194
           GQC        +AAVEGINAI+   L SLSEQQLVDC T  +N GC GG MD AF+YI +
Sbjct: 156 GQCGSCWAFSTIAAVEGINAIRSKNLTSLSEQQLVDCDTK-SNAGCNGGLMDYAFQYIAK 214

Query: 195 NKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAID 254
           + G+  +  Y Y+      C+  K       I  YEDVP NDE +L KAVA QPV+VAI+
Sbjct: 215 HGGVAAEDAYPYKARQASSCN--KKPSAVVTIDGYEDVPANDETALKKAVAAQPVAVAIE 272

Query: 255 ASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQ 312
           AS    QFYS GVF G C T L+HGV AVGYGT+ +G KYW++KNSWG +WGE GY R++
Sbjct: 273 ASGSHFQFYSEGVFAGKCGTELDHGVAAVGYGTTVDGTKYWIVKNSWGPEWGEKGYIRMK 332

Query: 313 RDIDQPQGQCGIAMFASFPVSKESAQPSSA 342
           RD+   +G CGIAM AS+PV K SA P  A
Sbjct: 333 RDVKDKEGLCGIAMEASYPV-KTSANPKHA 361


>gi|357507617|ref|XP_003624097.1| Cysteine protease [Medicago truncatula]
 gi|355499112|gb|AES80315.1| Cysteine protease [Medicago truncatula]
          Length = 340

 Score =  276 bits (707), Expect = 8e-72,   Method: Compositional matrix adjust.
 Identities = 155/339 (45%), Positives = 212/339 (62%), Gaps = 29/339 (8%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
            LIV+ ++  S  +Q   ++    +++E+++ WK +Y   YK+ AE  K  +IFK N+  
Sbjct: 13  ILIVIWVMFPSNQNQENDQSL---TLSERYKHWKIKYRVIYKDDAEEEKHIQIFKHNVAY 69

Query: 65  VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFK---MSDHSSSLKANGTPFLYKS- 120
           ++ FN  A GN+SY L +N+FADL P E   S  GFK   +   +SSL      F YK+ 
Sbjct: 70  IDSFN--AAGNKSYKLTINRFADL-PTE--PSDDGFKKRKLEPTTSSL------FKYKNI 118

Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
           + +P +V+W ++GAVTPVK Q +C       AV A+EGI  I    LVSLSEQ+LVD   
Sbjct: 119 TDIPAAVDWRKRGAVTPVKNQRECGSCWAFSAVGALEGIQQITSGNLVSLSEQELVDRVR 178

Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
           ++  NGC GG++ DAF+++++N GI  +A Y Y G+     ++ K      QI +YE VP
Sbjct: 179 SNWTNGCNGGYLIDAFEFVLENGGIATEASYPYRGVKG---NNSKKVSRQVQIKSYEQVP 235

Query: 234 PNDEESLLKAVANQPVSVAIDASAL-QFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
            N E+SLLK VANQPVSV ID S + +FYS G+F G C T  NH V  VGYGTS +G KY
Sbjct: 236 RNSEDSLLKVVANQPVSVGIDISGMIRFYSSGIFTGECGTKPNHAVIIVGYGTSNDGTKY 295

Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
           WL+KNSWG  WGE  Y R++RDID  +G CGI M AS+P
Sbjct: 296 WLVKNSWGIRWGEKRYIRMKRDIDAKEGLCGIPMDASYP 334


>gi|357129125|ref|XP_003566217.1| PREDICTED: thiol protease SEN102-like [Brachypodium distachyon]
          Length = 380

 Score =  276 bits (707), Expect = 9e-72,   Method: Compositional matrix adjust.
 Identities = 161/335 (48%), Positives = 206/335 (61%), Gaps = 25/335 (7%)

Query: 27  EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFA 86
           E S+   +E+W+A++    ++ AE S+RF +F++N   V  FN     +  Y LRLN+FA
Sbjct: 42  EESLWALYERWRARH-TVSRDLAEKSRRFNVFRENARLVHEFN--LRRDAPYKLRLNRFA 98

Query: 87  DLTPQEFIASQTGFKMSDH----------SSSLKANGTPFLYKSSQVPPSVNWIEKGAVT 136
           DLT  EF  S    ++S H          +      G+ F +  + +P SV+W EKGAVT
Sbjct: 99  DLTSDEFRRSYASSRVSHHRMFKPRAANNNDDDDDKGSSFTHGGA-LPTSVDWREKGAVT 157

Query: 137 PVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAF 189
            VK QGQC        +AAVEGINAI+ N L SLSEQQLVDC T   N GC GG MDDAF
Sbjct: 158 GVKDQGQCGSCWAFSTIAAVEGINAIRTNNLTSLSEQQLVDCDTK-TNAGCDGGLMDDAF 216

Query: 190 KYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPV 249
            YI ++ G+  +  Y Y    +  C+S KA      I  YEDVP NDE +L KAVA QPV
Sbjct: 217 SYIAKHGGVAAEKSYPYRARQSSSCNSKKAAAAVVSIDGYEDVPRNDETALKKAVAAQPV 276

Query: 250 SVAIDA--SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDG 307
           +VAI+A  S  QFYS GVF G C T L+HGV AVGYG + +G KYW++KNSWG++WGE G
Sbjct: 277 AVAIEAGGSHFQFYSEGVFAGKCGTELDHGVAAVGYGVTVDGTKYWIVKNSWGEEWGEKG 336

Query: 308 YFRLQRDIDQPQGQCGIAMFASFPVSKESAQPSSA 342
           Y R++RD+   +G CGIAM AS+PV K S  P  A
Sbjct: 337 YIRMKRDVADKEGLCGIAMEASYPV-KTSPNPKHA 370


>gi|62526575|gb|AAX84673.1| cysteine protease CP1 [Manihot esculenta]
          Length = 467

 Score =  276 bits (706), Expect = 9e-72,   Method: Compositional matrix adjust.
 Identities = 150/328 (45%), Positives = 205/328 (62%), Gaps = 18/328 (5%)

Query: 17  ASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNR 76
           A+++++RT DE  +   +E+W  + G+ Y    E  KRF++FKDNL  ++  N+    NR
Sbjct: 37  ATKSSWRTDDE--VMAIYEEWLVKQGKVYNALGEREKRFQVFKDNLRFIDEHNSE---NR 91

Query: 77  SYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQ-VPPSVNWIEKGAV 135
           +Y L LN FADLT +E+ ++  G +     + L+     +  +  + +P SV+W ++GAV
Sbjct: 92  TYKLGLNGFADLTNEEYRSTYLGARGGMKRNRLRKTSDRYAPRVGESLPDSVDWRKEGAV 151

Query: 136 TPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDA 188
             VK QG C        +AAVEGIN I    L+SLSEQ+LVDC T+  N GC GG MD A
Sbjct: 152 AEVKDQGSCGSCWAFSTIAAVEGINKIVTGDLISLSEQELVDCDTS-YNEGCNGGLMDYA 210

Query: 189 FKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQP 248
           F++II N GI  +  Y Y     G CD+ +       I +YEDVP N E +L KAVANQP
Sbjct: 211 FEFIINNGGIDTEEDYPYLARD-GRCDTYRKNAKVVTIDDYEDVPVNSETALQKAVANQP 269

Query: 249 VSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGED 306
           VSVAI+A     QFY+ G+F+G C T L+HGV AVGYGT E G  YW+++NSWG+ WGE+
Sbjct: 270 VSVAIEAGGRDFQFYASGIFSGRCGTQLDHGVAAVGYGT-ENGKDYWIVRNSWGKSWGEN 328

Query: 307 GYFRLQRDIDQPQGQCGIAMFASFPVSK 334
           GY R+ R I+ P G CGIAM AS+P+ K
Sbjct: 329 GYLRMARSINSPTGICGIAMEASYPIKK 356


>gi|1173630|gb|AAB37233.1| cysteine proteinase [Phalaenopsis sp. SM9108]
          Length = 359

 Score =  276 bits (706), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 156/334 (46%), Positives = 211/334 (63%), Gaps = 24/334 (7%)

Query: 27  EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFA 86
           E S+   +E+W++ +  + ++  E  KRF +FK+N   +  FN     +  Y LRLNKFA
Sbjct: 31  EDSLWNLYERWRSHHTVS-RDLDEKQKRFNVFKENPRYIHDFNKRK--DIPYKLRLNKFA 87

Query: 87  DLTPQEFIASQTGFKMSDHSS---SLKANGT-PFLYKS---SQVPPSVNWIEKGAVTPVK 139
           DLT  EF ++  G +++ H S   S +   T  F+Y+S     +P S++W +KGAVT VK
Sbjct: 88  DLTNHEFRSTYAGSRINHHRSLRGSRRGGATNSFMYQSLDSRSLPASIDWRQKGAVTAVK 147

Query: 140 YQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYI 192
            QGQC        VAAVEGIN IK  +L+SLSEQ+L+DC T D NNGC GG MD AF +I
Sbjct: 148 DQGQCGSCWAFSTVAAVEGINQIKTKKLLSLSEQELIDCDT-DENNGCNGGLMDYAFDFI 206

Query: 193 IQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVA 252
            +N GI+++A Y Y    +  C + K + H   I  +EDVP NDE+SLLKAVANQPVS+A
Sbjct: 207 KKNGGISSEAEYPYAAEDS-YCATEK-KSHVVSIDGHEDVPANDEDSLLKAVANQPVSIA 264

Query: 253 IDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFR 310
           I+AS    QFYS GVF G   T L+HGV  VGYG +++G KYW+++NSWG +WGE GY R
Sbjct: 265 IEASGYDFQFYSEGVFTGRSGTELDHGVAIVGYGKTQQGTKYWIVRNSWGAEWGEKGYIR 324

Query: 311 LQRDIDQPQGQCGIAMFASFPVSKESAQPSSADK 344
           +    D  +  CG+AM AS+P+ K S  PS   +
Sbjct: 325 ISAASDSKR-LCGLAMEASYPI-KTSPNPSHKSR 356


>gi|359359066|gb|AEV40973.1| putative oryzain beta chain precursor [Oryza punctata]
          Length = 461

 Score =  276 bits (705), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 144/309 (46%), Positives = 196/309 (63%), Gaps = 15/309 (4%)

Query: 34  FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
           ++ W A+ GR+Y    E  +RF +F DNL  V+  N  A  +  + L +N+FADLT  EF
Sbjct: 49  YDLWLAENGRSYNALGERERRFRVFWDNLKFVDAHNARADEHGGFRLGMNRFADLTNDEF 108

Query: 94  IASQTGFKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGAVTPVKYQGQC-------A 145
            ++  G K+ + S   +A G  + +    ++P SV+W EKGAV PVK QGQC       A
Sbjct: 109 RSTFLGAKVVERS---RAAGERYRHDGVEELPESVDWREKGAVAPVKNQGQCGSCWAFSA 165

Query: 146 VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYS 205
           V+ VE IN +    +++LSEQ+LV+C+TN  N+GC GG MDDAF +II+N GI  +  Y 
Sbjct: 166 VSTVESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMDDAFDFIIKNGGIDTEDDYP 225

Query: 206 YEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFYSG 263
           Y+ +  G CD  +       I  +EDVP NDE+SL KAVA+QPVSVAI+A     Q Y  
Sbjct: 226 YKAVD-GKCDINRENAKVVSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGGREFQLYHS 284

Query: 264 GVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCG 323
           GVF+G C T L+HGV AVGYGT + G  YW+++NSWG  WGE GY R++R+I+   G+CG
Sbjct: 285 GVFSGRCGTSLDHGVVAVGYGT-DNGKDYWIVRNSWGPKWGESGYVRMERNINATTGKCG 343

Query: 324 IAMFASFPV 332
           IAM AS+P 
Sbjct: 344 IAMMASYPT 352


>gi|449500145|ref|XP_004161017.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
          Length = 349

 Score =  275 bits (704), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 150/314 (47%), Positives = 196/314 (62%), Gaps = 19/314 (6%)

Query: 32  EKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQ 91
           E FE W +++ +TY+   E   RFEIF DNL  ++  N       SY L LN+FADL+ +
Sbjct: 45  ELFESWMSKHSKTYRSIEEKLHRFEIFLDNLKHIDETNKKV---SSYWLGLNEFADLSHE 101

Query: 92  EFIASQTGFKMSDHSSSLKANGTPFLYKSSQ-VPPSVNWIEKGAVTPVKYQGQCA----- 145
           EF +   G ++       K +   F Y   + +P SV+W  KGAVTPVK QG C      
Sbjct: 102 EFKSKYLGLRVE---FPRKRSSRGFSYGDVEDLPESVDWRTKGAVTPVKNQGSCGSCWAF 158

Query: 146 --VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAV 203
             VAAVEGIN I    L SLSEQ+L+DC     NNGCYGG MD AF+YI+ N G+  +  
Sbjct: 159 STVAAVEGINQIVTGNLTSLSEQELIDC-DRSFNNGCYGGLMDYAFQYIMSNSGLRKEED 217

Query: 204 YSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFY 261
           Y Y  M  G C   K +     I+ YEDVP NDE+SLLKA+++QPVSVAI+AS+   QFY
Sbjct: 218 YPYL-MEEGRCIREKEQFEVVTISGYEDVPANDEQSLLKALSHQPVSVAIEASSRNFQFY 276

Query: 262 SGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQ 321
            GG+F G C T ++HGVTAVGYG+SE G  Y ++KNSWG  WGE+GY R++R+  +P+G 
Sbjct: 277 KGGIFTGRCGTQMDHGVTAVGYGSSE-GTDYIIVKNSWGPKWGENGYIRMKRNTGKPEGL 335

Query: 322 CGIAMFASFPVSKE 335
           CGI   AS+P  ++
Sbjct: 336 CGINQMASYPTKEK 349


>gi|359359213|gb|AEV41117.1| putative oryzain beta chain precursor [Oryza officinalis]
          Length = 465

 Score =  275 bits (704), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 146/312 (46%), Positives = 199/312 (63%), Gaps = 22/312 (7%)

Query: 34  FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERF---NNAAIGNRSYTLRLNKFADLTP 90
           ++ W A+ GR+Y    E+ +RF +F DNL    RF   +NA   +  + L +N+FADLT 
Sbjct: 54  YDLWLAENGRSYNALGEHERRFRVFWDNL----RFADAHNARADDHGFRLGMNRFADLTN 109

Query: 91  QEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGAVTPVKYQGQC----- 144
           +EF A+  G K+ + S   +A G  + +    ++P SV+W EKGAV PVK QGQC     
Sbjct: 110 EEFRATFLGAKVVERS---RAAGERYRHDGVEELPESVDWREKGAVAPVKNQGQCGSCWA 166

Query: 145 --AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDA 202
             AV+ VE IN +    +++LSEQ+LV+C+TN  N+GC GG MDDAF +II+N GI  + 
Sbjct: 167 FSAVSTVESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMDDAFDFIIKNGGIDTED 226

Query: 203 VYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQF 260
            Y Y+ +  G CD  +       I  +EDVP NDE+SL KAVA+QPVSVAI+A     Q 
Sbjct: 227 DYPYKAVD-GKCDINRENAKVVSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGGREFQL 285

Query: 261 YSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQG 320
           Y  GVF+G C T L+HGV AVGYGT + G  YW+++NSWG  WGE GY R++R+I+   G
Sbjct: 286 YHSGVFSGRCGTSLDHGVVAVGYGT-DNGKDYWIVRNSWGPKWGESGYVRMERNINVTTG 344

Query: 321 QCGIAMFASFPV 332
           +CGIAM AS+P 
Sbjct: 345 KCGIAMMASYPT 356


>gi|30141023|dbj|BAC75925.1| cysteine protease-3 [Helianthus annuus]
          Length = 348

 Score =  275 bits (704), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 142/319 (44%), Positives = 198/319 (62%), Gaps = 17/319 (5%)

Query: 29  SIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADL 88
           S+ + +E+W +Q+    +   E  KRF +FK N+  + R N      + Y L+LN+FAD+
Sbjct: 35  SLWDLYERWGSQH-MVSRAPDEKKKRFNVFKYNVNHINRVNQLG---KPYKLKLNEFADM 90

Query: 89  TPQEFIASQTGFKMSDHSSSLKANGTPFLY-KSSQVPPSVNWIEKGAVTPVKYQGQCA-- 145
           T  EF A      +       K   TPF + K++  PPS++W   GAV P+K QG+C   
Sbjct: 91  TNHEFKAGFDSKILHFRMLKGKRRQTPFTHAKTTDPPPSIDWRTNGAVNPIKNQGRCGSC 150

Query: 146 -----VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITN 200
                +  VEGIN IK N+LVSLSEQ+LVDC T+    GC GG M++ +++I +  G+T 
Sbjct: 151 WAFSTIVGVEGINKIKTNQLVSLSEQELVDCETD--CEGCNGGLMENGYEFIKETGGVTT 208

Query: 201 DAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASAL-- 258
           + +Y Y   + G CD  K      +I  +E+VP NDE ++L+AVANQPVS+AIDA  L  
Sbjct: 209 EQIYPYFARN-GRCDISKRNSPVVKIDGFENVPANDESAMLRAVANQPVSIAIDAGGLNF 267

Query: 259 QFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQP 318
           QFYS GVFNG C T LNHGV  VGYGT+++G  YW+++NSWG  WGE GY R+QR ++ P
Sbjct: 268 QFYSQGVFNGACGTELNHGVAIVGYGTTQDGTNYWIVRNSWGTGWGEQGYVRMQRGVNVP 327

Query: 319 QGQCGIAMFASFPVSKESA 337
           +G CG+AM AS+P+   S 
Sbjct: 328 EGLCGLAMDASYPIKASSV 346


>gi|414591545|tpg|DAA42116.1| TPA: hypothetical protein ZEAMMB73_388689 [Zea mays]
          Length = 384

 Score =  275 bits (704), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 149/333 (44%), Positives = 198/333 (59%), Gaps = 26/333 (7%)

Query: 27  EGSIAEKFEQWKAQYGRTYKESAEN----SKRFEIFKDNLVAVERFNNAAIGNRSYTLRL 82
           E S+   +E+W++ Y R      ++    ++RF +FK+N   V   N      R + L L
Sbjct: 34  EESLRALYERWRSHYHRVSPRDGDDKQQQARRFNVFKENARYVHEANRKD--GRPFRLAL 91

Query: 83  NKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK--------SSQVPPSVNWIEKGA 134
           NKFAD+T  EF  +  G +   H + L      F +         ++ +PP+V+W  +GA
Sbjct: 92  NKFADMTTDEFRRTYAGSRTRHHRAQL-GEARSFAHAQHGRGGSGTTNLPPAVDWRLRGA 150

Query: 135 VTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDD 187
           VT VK QGQC       A+AAVEG+N I   +LVSLSEQ+LVDC   DN  GC GG MD 
Sbjct: 151 VTGVKDQGQCGSCWAFSAIAAVEGVNKIMTGKLVSLSEQELVDCDDVDNQ-GCDGGLMDY 209

Query: 188 AFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ 247
           AF+YI +N G+T ++ Y Y       C+  K   H   I  YEDVP N+E++L KAVA+Q
Sbjct: 210 AFQYIQRNGGVTTESNYPYLAEQRS-CNKAKERSHDVTIDGYEDVPANNEDALQKAVASQ 268

Query: 248 PVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGE 305
           PV+VAI+AS    QFYS GVF G C T L+HGV AVGYGT+ +G KYW +KNSWG+DWGE
Sbjct: 269 PVAVAIEASGQDFQFYSEGVFTGSCGTDLDHGVAAVGYGTTGDGTKYWTVKNSWGEDWGE 328

Query: 306 DGYFRLQRDIDQPQGQCGIAMFASFPVSKESAQ 338
            GY R+QR +   +G CGIAM  S+P  K +  
Sbjct: 329 RGYIRMQRGVPDSRGLCGIAMEPSYPTKKPAGH 361


>gi|297802418|ref|XP_002869093.1| hypothetical protein ARALYDRAFT_491113 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297314929|gb|EFH45352.1| hypothetical protein ARALYDRAFT_491113 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 355

 Score =  275 bits (703), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 150/313 (47%), Positives = 196/313 (62%), Gaps = 17/313 (5%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
           + E FE W +++ + YK   E   RFE+F++NL+ +++ NN      SY L LN+FADLT
Sbjct: 47  LLELFESWMSEHSKVYKSVEEKVHRFEVFRENLMHIDQRNNEI---NSYWLGLNEFADLT 103

Query: 90  PQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGAVTPVKYQGQCA--- 145
            +EF     G      S   + +   F Y+  + +P SV+W +KGAV PVK QGQC    
Sbjct: 104 HEEFKGRYLGLAKPQFSRKRQPSAN-FRYRDITDLPKSVDWRKKGAVAPVKDQGQCGSCW 162

Query: 146 ----VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITND 201
               VAAVEGIN I    L SLSEQ+L+DC T   N+GC GG MD AF+YII   G+  +
Sbjct: 163 AFSTVAAVEGINQITTGNLSSLSEQELIDCDTT-FNSGCNGGLMDYAFQYIISTGGLHKE 221

Query: 202 AVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQ 259
             Y Y  M  GIC   K +     I+ YEDVP ND+ESL+KA+A+QPVSVAI+AS    Q
Sbjct: 222 DDYPYL-MEEGICQEQKEDVERVTISGYEDVPENDDESLVKALAHQPVSVAIEASGRDFQ 280

Query: 260 FYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQ 319
           FY GGVFNG C T L+HGV AVGYG+S +G  Y ++KNSWG  WGE G+ R++R+  +P+
Sbjct: 281 FYKGGVFNGQCGTDLDHGVAAVGYGSS-KGSDYVIVKNSWGPRWGEKGFIRMKRNTGKPE 339

Query: 320 GQCGIAMFASFPV 332
           G CGI   AS+P 
Sbjct: 340 GLCGINKMASYPT 352


>gi|109119897|dbj|BAE96008.1| cysteine proteinase [Triticum aestivum]
          Length = 377

 Score =  275 bits (703), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 153/330 (46%), Positives = 203/330 (61%), Gaps = 27/330 (8%)

Query: 27  EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFA 86
           E ++ + +E+W+  + R  +  AE  +RF  FK N+  +   N    G+R Y LRLN+F 
Sbjct: 39  EEALWDLYERWQTAH-RVPRHHAEKHRRFGTFKSNVHFIHSHNKR--GDRPYRLRLNRFG 95

Query: 87  DLTPQEFIASQTGFKMSDHSSSLKANGTP-----FLYKS---SQVPPSVNWIEKGAVTPV 138
           D++  EF A+  G ++SD      A  TP     F+Y +   S +P SV+W +KGAVT V
Sbjct: 96  DMSQAEFRATFAGSRVSDRRRDGPA--TPPSVPGFMYAAVNVSDLPRSVDWRQKGAVTGV 153

Query: 139 KYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKY 191
           K QG+C        V +VEGINAI+  +LVSLSEQ+L+DC T DN+ GC GG MD+AF+Y
Sbjct: 154 KNQGKCGSCWAFSTVVSVEGINAIRTGKLVSLSEQELIDCDTADND-GCEGGLMDNAFEY 212

Query: 192 IIQNKGITNDAVYSYEGMSTGICDSIK---AEDHAAQITNYEDVPPNDEESLLKAVANQP 248
           I +N G+T +A Y Y   + G C + K   +      I  ++DVP N EE+L KAVANQP
Sbjct: 213 IKKNGGLTTEAAYPYRA-ANGTCKAAKVAKSSPMVVHIDGHQDVPANSEEALAKAVANQP 271

Query: 249 VSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGED 306
           VSV IDAS  A  FYS GVF G C T L+HGV  VGYG +E+G  YW +KNSWG  WGE 
Sbjct: 272 VSVGIDASGKAFMFYSEGVFTGECGTELDHGVAVVGYGVAEDGKAYWTVKNSWGPSWGEK 331

Query: 307 GYFRLQRDIDQPQGQCGIAMFASFPVSKES 336
           GY R+++D     G CGIAM AS+ V  +S
Sbjct: 332 GYIRVEKDSGAEGGLCGIAMEASYAVKTDS 361


>gi|18396939|ref|NP_564320.1| Papain family cysteine protease [Arabidopsis thaliana]
 gi|9502427|gb|AAF88126.1|AC021043_19 Putative cysteine proteinase [Arabidopsis thaliana]
 gi|67633400|gb|AAY78625.1| peptidase C1A papain family protein [Arabidopsis thaliana]
 gi|332192919|gb|AEE31040.1| Papain family cysteine protease [Arabidopsis thaliana]
          Length = 346

 Score =  275 bits (703), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 157/347 (45%), Positives = 219/347 (63%), Gaps = 24/347 (6%)

Query: 5   FLIVVLII--SGSCASQATYRT--FDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKD 60
           F+ VVL I       S+AT R   +   SI +  +QW  Q+ R Y +  E   R ++  +
Sbjct: 6   FVCVVLTIFFMDLKISEATSRVALYKPSSIVDYHQQWMIQFSRVYDDEFEKQLRLQVLTE 65

Query: 61  NLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKA-NGTPFLYK 119
           NL  +E FNN  +GN+SY L +N+F D T +EF+A+ TG +  + +S  +  N T   + 
Sbjct: 66  NLKFIESFNN--MGNQSYKLGVNEFTDWTKEEFLATYTGLRGVNVTSPFEVVNETKPAWN 123

Query: 120 ---SSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLV 169
              S  +  + +W  +GAVTPVK QG+C       A+AAVEG+  I    L+SLSEQQL+
Sbjct: 124 WTVSDVLGTNKDWRNEGAVTPVKSQGECGGCWAFSAIAAVEGLTKIARGNLISLSEQQLL 183

Query: 170 DCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNY 229
           DC T + NNGC GG   +AF YII+++GI+++  Y Y+ +  G C S      A  I  +
Sbjct: 184 DC-TREQNNGCKGGTFVNAFNYIIKHRGISSENEYPYQ-VKEGPCRS--NARPAILIRGF 239

Query: 230 EDVPPNDEESLLKAVANQPVSVAIDASALQF--YSGGVFNGY-CETFLNHGVTAVGYGTS 286
           E+VP N+E +LL+AV+ QPV+VAIDAS   F  YSGGV+N   C T +NH VT VGYGTS
Sbjct: 240 ENVPSNNERALLEAVSRQPVAVAIDASEAGFVHYSGGVYNARNCGTSVNHAVTLVGYGTS 299

Query: 287 EEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
            EG+KYWL KNSWG+ WGE+GY R++RD++ PQG CG+A +AS+PV+
Sbjct: 300 PEGMKYWLAKNSWGKTWGENGYIRIRRDVEWPQGMCGVAQYASYPVA 346


>gi|50355623|dbj|BAD29960.1| cysteine protease [Daucus carota]
          Length = 460

 Score =  275 bits (703), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 149/315 (47%), Positives = 192/315 (60%), Gaps = 19/315 (6%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
           I   +E W  ++G++Y    E  +RF+IFKDN + ++  N A   +RS+ L LN+FADLT
Sbjct: 40  IMAAYESWLVKHGKSYNALGEKEQRFQIFKDNFLYIDEQNAAK--DRSFKLGLNRFADLT 97

Query: 90  PQEFIASQTGFKMSDHSSSLKANGTPFLYKS---SQVPPSVNWIEKGAVTPVKYQGQCA- 145
            +E+ +  TG +  D  S  K +G    Y S     +P SV+W E GAV  VK QGQC  
Sbjct: 98  NEEYRSKYTGIRTKD--SRKKVSGKSQRYASLAGESLPESVDWREHGAVASVKDQGQCGS 155

Query: 146 ------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGIT 199
                 ++AVEGIN I   +L++LSEQ+LVDC     N GC GG MDDAF++II N GI 
Sbjct: 156 CWAFSTISAVEGINQIATGKLITLSEQELVDC-DRSYNEGCNGGLMDDAFQFIINNGGID 214

Query: 200 NDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA-- 257
           +DA Y Y G   G CD  +       I +YEDVP  DE++L KA ANQP+SVAI+AS   
Sbjct: 215 SDADYPYTGRD-GQCDQYRKNAKVVTIDSYEDVPEYDEKALQKAAANQPISVAIEASGRD 273

Query: 258 LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQ 317
            QFY  G+F G C T L+HGV  VGYGT E G  YW+++NSWG DWGE GY R++R I  
Sbjct: 274 FQFYDSGIFTGKCGTDLDHGVVVVGYGT-ENGKDYWIVRNSWGADWGEKGYLRMERGISS 332

Query: 318 PQGQCGIAMFASFPV 332
             G CGI    S+PV
Sbjct: 333 KAGICGITSEPSYPV 347


>gi|297602242|ref|NP_001052232.2| Os04g0203500 [Oryza sativa Japonica Group]
 gi|255675217|dbj|BAF14146.2| Os04g0203500 [Oryza sativa Japonica Group]
          Length = 336

 Score =  275 bits (703), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 149/351 (42%), Positives = 211/351 (60%), Gaps = 35/351 (9%)

Query: 1   MAKYFLIVVLIISGSCASQATYRTF-DEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFK 59
           MAK  L  +L     C++    R   D+ ++A + E+W AQYGR YK+ AE ++RFE+FK
Sbjct: 3   MAKALLFAILGCLCLCSAVLAARELSDDAAMAARHERWMAQYGRMYKDDAEKARRFEVFK 62

Query: 60  DNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQT--GFKMSDHSSSLKANGTPFL 117
            N+  +E FN    GN  + L +N+FADLT  EF +++T  GF  S           P  
Sbjct: 63  ANVAFIESFN---AGNHKFWLGVNQFADLTNDEFRSTKTNKGFIPSTTR-------VPTG 112

Query: 118 YKSSQV-----PPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSE 165
           +++  V     P +++W  KG VTP+K QGQC       AVAA+EGI  +   +L+S S 
Sbjct: 113 FRNENVNIDALPATMDWRTKGVVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISHSL 172

Query: 166 QQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKA-EDHAA 224
            + +    +    GC GG MDDAFK+II+N G+T ++ Y Y  +     D  K+  +  A
Sbjct: 173 NKSLLTVMS---MGCEGGLMDDAFKFIIKNGGLTTESNYPYAAVD----DKFKSVSNSVA 225

Query: 225 QITNYEDVPPNDEESLLKAVANQPVSVAIDASAL--QFYSGGVFNGYCETFLNHGVTAVG 282
            I  YEDVP N+E +L+KAVANQPVSVA+D   +  QFY GGV  G C T L+HG+ A+G
Sbjct: 226 SIKGYEDVPANNEAALMKAVANQPVSVAVDGGDMTFQFYKGGVMTGSCGTDLDHGIVAIG 285

Query: 283 YGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
           YG + +G KYWL+KNSWG  WGE+G+ R+++DI   +G CG+AM  S+P +
Sbjct: 286 YGKASDGTKYWLLKNSWGMTWGENGFLRMEKDISDKRGMCGLAMEPSYPTA 336


>gi|357474523|ref|XP_003607546.1| Cysteine proteinase [Medicago truncatula]
 gi|358347207|ref|XP_003637651.1| Cysteine proteinase [Medicago truncatula]
 gi|355503586|gb|AES84789.1| Cysteine proteinase [Medicago truncatula]
 gi|355508601|gb|AES89743.1| Cysteine proteinase [Medicago truncatula]
          Length = 345

 Score =  275 bits (702), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 150/334 (44%), Positives = 205/334 (61%), Gaps = 20/334 (5%)

Query: 9   VLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERF 68
           V+     C      ++ D  ++ ++F+ W  ++GR YK + E   RF I++ N+  ++  
Sbjct: 21  VIASESECPPTHKQKSSDVEAMKKRFDGWVKRHGRKYKHNDEREVRFGIYQANVQYIQCK 80

Query: 69  NNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLY-KSSQVPPSV 127
           N       SY L  NKFADLT +EF ++  G      S+ L+++ T F Y +   +P S 
Sbjct: 81  NAQ---KNSYNLTDNKFADLTNEEFQSTYMGL-----STRLRSHNTGFRYDEHGDLPESK 132

Query: 128 NWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGC 180
           +W ++GAVT +  QGQC       AVAAVEGIN IK  +L+SLSEQ+L+DC     N GC
Sbjct: 133 DWRKEGAVTEIMDQGQCGGCWAFAAVAAVEGINKIKSGKLISLSEQELIDCDVKSGNQGC 192

Query: 181 YGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESL 240
            GG M+ A+ +II+N G+T +  Y YEG+  G C   KA  +AA I+ YE+VP ++E  L
Sbjct: 193 QGGLMETAYTFIIENGGLTTEQDYPYEGVD-GTCKMEKAAHYAASISGYEEVPADNEAKL 251

Query: 241 LKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNS 298
             A A+QPVSVAIDA   + QFYS GVF+G C   LNHGVT VGYG  E   KYW++KNS
Sbjct: 252 KAAAAHQPVSVAIDAGGYSFQFYSEGVFSGICGKQLNHGVTVVGYG-KETINKYWIVKNS 310

Query: 299 WGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           WG DWGE GY R++RD    +G CGIAM AS+P+
Sbjct: 311 WGADWGESGYIRMKRDTLSKEGMCGIAMQASYPL 344


>gi|242070333|ref|XP_002450443.1| hypothetical protein SORBIDRAFT_05g005530 [Sorghum bicolor]
 gi|241936286|gb|EES09431.1| hypothetical protein SORBIDRAFT_05g005530 [Sorghum bicolor]
          Length = 351

 Score =  275 bits (702), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 165/349 (47%), Positives = 220/349 (63%), Gaps = 36/349 (10%)

Query: 6   LIVVLIISG--SCASQA----TYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFK 59
           L+ VL I+    CA  A    +   + E ++  + E+W  ++GRTYK+ AE ++RF++FK
Sbjct: 18  LLTVLAIANCIGCAVAARDLSSSTGYGEEAMTARHEKWMVEHGRTYKDEAEKARRFQVFK 77

Query: 60  DNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTP---F 116
            N   V+  +NAA G + Y L +N+FAD+T  EF+A  TGFK       L A G     F
Sbjct: 78  ANAAFVDT-SNAAAGGKKYHLAINRFADMTHDEFMARYTGFK------PLPATGKKMPGF 130

Query: 117 LYK----SSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSE 165
            Y     SS+   +V+W +KGAVT VK Q +C       AVAA+EG++ I    LVSLSE
Sbjct: 131 KYANVTLSSEDQQAVDWRKKGAVTDVKNQQKCGCCWAFSAVAAIEGMHQINTGELVSLSE 190

Query: 166 QQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQ 225
           QQLVDC+TN NNNGC GG M+DAF+Y+I N GI  +A Y Y  M  G+C +++    A  
Sbjct: 191 QQLVDCSTNGNNNGCGGGTMEDAFQYVIGNNGIATEAAYPYTAMQ-GMCQNVQP---AVA 246

Query: 226 ITNYEDVPPNDEESLLKAVANQPVSVAIDASALQFYSGGVFNG-YCETFLNHGVTAVGYG 284
           + +Y+ VP +DE++L  AVA QPVSVA+DA+  QFY GGV     C T LNH VTAVGYG
Sbjct: 247 VRSYQQVPRDDEDALAAAVAGQPVSVAVDANNFQFYKGGVMTADSCGTNLNHAVTAVGYG 306

Query: 285 TSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
           T+E+G  YWL+KN WG  WGE+GY RLQR +    G CG+A  AS+PV+
Sbjct: 307 TAEDGTPYWLLKNQWGSTWGEEGYLRLQRGV----GACGVAKDASYPVA 351


>gi|1169186|sp|P43156.1|CYSP_HEMSP RecName: Full=Thiol protease SEN102; Flags: Precursor
 gi|396568|emb|CAA52425.1| thiol-protease [Hemerocallis hybrid cultivar]
          Length = 360

 Score =  274 bits (701), Expect = 4e-71,   Method: Compositional matrix adjust.
 Identities = 158/361 (43%), Positives = 217/361 (60%), Gaps = 26/361 (7%)

Query: 1   MAKYFLIVVLIISGSCASQATYRTFDEGSIAEK------FEQWKAQYGRTYKESAENSKR 54
           MAK   I + +++ S  S A    F E  +A +      +E+W+  +    ++  E ++R
Sbjct: 1   MAKPKFIALALVALSFLSIAQSIPFTEKDLASEDSLWNLYEKWRTHH-TVARDLDEKNRR 59

Query: 55  FEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSS--SLKAN 112
           F +FK+N+  +  FN     +  Y L LNKF D+T QEF +   G K+  H S   ++ N
Sbjct: 60  FNVFKENVKFIHEFNQKK--DAPYKLALNKFGDMTNQEFRSKYAGSKIQHHRSQRGIQKN 117

Query: 113 GTPFLYKSSQVPP--SVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSL 163
              F+Y++    P  S++W  KGAVT VK QGQC        +A+VEGIN IK   LVSL
Sbjct: 118 TGSFMYENVGSLPAASIDWRAKGAVTGVKDQGQCGSCWAFSTIASVEGINQIKTGELVSL 177

Query: 164 SEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHA 223
           SEQ+LVDC T+  N GC GG MD AF++I Q  GIT +  Y Y     G C S       
Sbjct: 178 SEQELVDCDTS-YNEGCNGGLMDYAFEFI-QKNGITTEDSYPY-AEQDGTCASNLLNSPV 234

Query: 224 AQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAV 281
             I  ++DVP N+E +L++AVANQP+SV+I+AS    QFYS GVF G C T L+HGV  V
Sbjct: 235 VSIDGHQDVPANNENALMQAVANQPISVSIEASGYGFQFYSEGVFTGRCGTELDHGVAIV 294

Query: 282 GYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSKESAQPSS 341
           GYG + +G KYW++KNSWG++WGE GY R+QR I   +G+CGIAM AS+P+ K SA P +
Sbjct: 295 GYGATRDGTKYWIVKNSWGEEWGESGYIRMQRGISDKRGKCGIAMEASYPI-KTSANPKN 353

Query: 342 A 342
           +
Sbjct: 354 S 354


>gi|18418684|ref|NP_567983.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
 gi|71153408|sp|O65493.1|XCP1_ARATH RecName: Full=Xylem cysteine proteinase 1; Short=AtXCP1; Flags:
           Precursor
 gi|6708181|gb|AAF25831.1|AF191027_1 papain-type cysteine endopeptidase XCP1 [Arabidopsis thaliana]
 gi|3080415|emb|CAA18734.1| cysteine proteinase-like protein [Arabidopsis thaliana]
 gi|7270487|emb|CAB80252.1| cysteine proteinase-like protein [Arabidopsis thaliana]
 gi|26449881|dbj|BAC42063.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|28827736|gb|AAO50712.1| unknown protein [Arabidopsis thaliana]
 gi|332661101|gb|AEE86501.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
          Length = 355

 Score =  274 bits (701), Expect = 4e-71,   Method: Compositional matrix adjust.
 Identities = 150/313 (47%), Positives = 196/313 (62%), Gaps = 17/313 (5%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
           + E FE W +++ + YK   E   RFE+F++NL+ +++ NN      SY L LN+FADLT
Sbjct: 47  LLELFESWMSEHSKAYKSVEEKVHRFEVFRENLMHIDQRNNEI---NSYWLGLNEFADLT 103

Query: 90  PQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGAVTPVKYQGQCA--- 145
            +EF     G      S   + +   F Y+  + +P SV+W +KGAV PVK QGQC    
Sbjct: 104 HEEFKGRYLGLAKPQFSRKRQPSAN-FRYRDITDLPKSVDWRKKGAVAPVKDQGQCGSCW 162

Query: 146 ----VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITND 201
               VAAVEGIN I    L SLSEQ+L+DC T   N+GC GG MD AF+YII   G+  +
Sbjct: 163 AFSTVAAVEGINQITTGNLSSLSEQELIDCDTT-FNSGCNGGLMDYAFQYIISTGGLHKE 221

Query: 202 AVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQ 259
             Y Y  M  GIC   K +     I+ YEDVP ND+ESL+KA+A+QPVSVAI+AS    Q
Sbjct: 222 DDYPYL-MEEGICQEQKEDVERVTISGYEDVPENDDESLVKALAHQPVSVAIEASGRDFQ 280

Query: 260 FYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQ 319
           FY GGVFNG C T L+HGV AVGYG+S +G  Y ++KNSWG  WGE G+ R++R+  +P+
Sbjct: 281 FYKGGVFNGKCGTDLDHGVAAVGYGSS-KGSDYVIVKNSWGPRWGEKGFIRMKRNTGKPE 339

Query: 320 GQCGIAMFASFPV 332
           G CGI   AS+P 
Sbjct: 340 GLCGINKMASYPT 352


>gi|242086591|ref|XP_002439128.1| hypothetical protein SORBIDRAFT_09g000960 [Sorghum bicolor]
 gi|241944413|gb|EES17558.1| hypothetical protein SORBIDRAFT_09g000960 [Sorghum bicolor]
          Length = 371

 Score =  274 bits (701), Expect = 4e-71,   Method: Compositional matrix adjust.
 Identities = 155/316 (49%), Positives = 195/316 (61%), Gaps = 24/316 (7%)

Query: 34  FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
           FE+W A+Y + Y    E   RFE+FKDNL  ++  N       +Y L LN FADLT  EF
Sbjct: 66  FEEWVAKYRKAYASFEEKLHRFEVFKDNLHHIDEANKKVT---TYWLGLNAFADLTHDEF 122

Query: 94  IASQTGFKMSDHSSSLKANGTPFLY---KSSQVPPSVNWIEKGAVTPVKYQGQCA----- 145
            A+  G +  +   + K   + F Y       VP SV+W +KGAVT VK QGQC      
Sbjct: 123 KATYLGLRQPE---TKKTTDSRFRYGGVADDDVPASVDWRKKGAVTDVKNQGQCGSCWAF 179

Query: 146 --VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAV 203
             VAAVEGIN I    L SLSEQ+LVDC+T D NNGC GG MD+AF YI  + G+  +  
Sbjct: 180 STVAAVEGINQIVTGNLTSLSEQELVDCST-DGNNGCNGGVMDNAFSYIASSGGLRTEEA 238

Query: 204 YSYEGMSTGICDSIKAED--HAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQ 259
           Y Y  M  G CD  KA D      I+ YEDVP NDE++L+KA+A+QP+SVAI+AS    Q
Sbjct: 239 YPYL-MEEGDCDD-KARDGEQVVTISGYEDVPANDEQALVKALAHQPLSVAIEASGRHFQ 296

Query: 260 FYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQ 319
           FYSGGVFNG C + L+HGV AVGYG+S +G  Y ++KNSWG  WGE GY R++R   +P+
Sbjct: 297 FYSGGVFNGPCGSELDHGVAAVGYGSS-KGQDYIIVKNSWGSHWGEKGYIRMKRGTGKPE 355

Query: 320 GQCGIAMFASFPVSKE 335
           G CGI   AS+P   +
Sbjct: 356 GLCGINKMASYPTKDQ 371


>gi|414589857|tpg|DAA40428.1| TPA: Vignain [Zea mays]
          Length = 377

 Score =  274 bits (701), Expect = 4e-71,   Method: Compositional matrix adjust.
 Identities = 150/333 (45%), Positives = 205/333 (61%), Gaps = 36/333 (10%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
           + E+FEQW  ++GR Y ++ E  +R E+++ N+  VE FN+  +GN  Y L  NKFADLT
Sbjct: 50  MLERFEQWMGRHGRLYADAGEKQRRLEVYRRNVELVETFNS--MGN-GYRLADNKFADLT 106

Query: 90  PQEFIASQTGFKM------SDHS---SSLKANGTPFLYKS--SQVPPSVNWIEKGAVTPV 138
            +EF A   GF        + HS   S++   G+  + +   S +P SV+W EKGAV PV
Sbjct: 107 NEEFRAKMLGFGRPRSGGGAGHSTAPSTVACIGSGLMGRQGYSDLPKSVDWREKGAVAPV 166

Query: 139 KYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKY 191
           K QG C       AVAA+EGIN IK  +LVSLSEQ+LVDC T     GC GG+M  AF++
Sbjct: 167 KSQGDCGSCWAFSAVAAIEGINQIKNGKLVSLSEQELVDCDTK--AIGCAGGYMSWAFEF 224

Query: 192 IIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSV 251
           +++N+G+T +  Y Y+G++ G C + K ++ A  I+ Y +V P+ E  LL+A A QPVSV
Sbjct: 225 VMKNRGLTTERNYPYQGLN-GACQTPKLKESAVSISGYMNVTPSSEPDLLRAAAAQPVSV 283

Query: 252 AIDASAL--QFYSGGVFNGYCETFLNHGVTAVGYGTSE----------EGIKYWLIKNSW 299
           A+DA +   Q Y GGVF G C   LNHGVT VGYG ++           G KYW++KNSW
Sbjct: 284 AVDAGSFVWQLYGGGVFTGPCTAELNHGVTVVGYGETQGDTDGDGSGVPGKKYWIVKNSW 343

Query: 300 GQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           G +WG+ GY  +QR+     G CGIAM  S+PV
Sbjct: 344 GPEWGDAGYILMQREASVASGLCGIAMLPSYPV 376


>gi|109390302|gb|ABG33750.1| cysteine protease [Hevea brasiliensis]
          Length = 457

 Score =  274 bits (700), Expect = 5e-71,   Method: Compositional matrix adjust.
 Identities = 150/328 (45%), Positives = 202/328 (61%), Gaps = 18/328 (5%)

Query: 17  ASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNR 76
            +++++RT DE  +   +E W  ++G+ Y    E  +RFE+FKDNL  ++  N+    NR
Sbjct: 27  GTKSSWRTDDE--VMAIYEDWLVKHGKAYNSLGEKERRFEVFKDNLRFIDEHNSE---NR 81

Query: 77  SYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK-SSQVPPSVNWIEKGAV 135
           +Y + LN+FADLT +E+ +   G       + L+     +  +    +P SV+W ++GAV
Sbjct: 82  TYRVGLNRFADLTNEEYRSMYLGALSGIRRNKLRKISDRYTPRVGDSLPDSVDWRKEGAV 141

Query: 136 TPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDA 188
             VK QG C       AVAAVEGIN I    L+SLSEQ+LVDC  N  N GC GG MD  
Sbjct: 142 VGVKDQGSCGSCWAFSAVAAVEGINKIVTGDLISLSEQELVDC-DNSYNEGCNGGLMDYG 200

Query: 189 FKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQP 248
           F++II N GI ++  Y Y     G CD+ +       I +YEDVP N+E +L KAVANQP
Sbjct: 201 FEFIINNGGIDSEEDYPYLARD-GRCDTYRKNARVVSIDSYEDVPVNNEAALQKAVANQP 259

Query: 249 VSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGED 306
           VSVAI+A     Q YS GVF+G C T L+HGV AVGYGT E G  YW+++NSWG+ WGE 
Sbjct: 260 VSVAIEAGGRDFQLYSSGVFSGRCGTALDHGVVAVGYGT-ENGQDYWIVRNSWGKSWGES 318

Query: 307 GYFRLQRDIDQPQGQCGIAMFASFPVSK 334
           GY R+ R+I +P G CGIAM AS+P+ K
Sbjct: 319 GYLRMARNIRKPTGICGIAMEASYPIKK 346


>gi|226507844|ref|NP_001148894.1| LOC100282514 precursor [Zea mays]
 gi|194703250|gb|ACF85709.1| unknown [Zea mays]
 gi|195622994|gb|ACG33327.1| vignain precursor [Zea mays]
          Length = 356

 Score =  274 bits (700), Expect = 5e-71,   Method: Compositional matrix adjust.
 Identities = 150/333 (45%), Positives = 205/333 (61%), Gaps = 36/333 (10%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
           + E+FEQW  ++GR Y ++ E  +R E+++ N+  VE FN+  +GN  Y L  NKFADLT
Sbjct: 29  MLERFEQWMGRHGRLYADAGEKQRRLEVYRRNVELVETFNS--MGN-GYRLADNKFADLT 85

Query: 90  PQEFIASQTGFKM------SDHS---SSLKANGTPFLYKS--SQVPPSVNWIEKGAVTPV 138
            +EF A   GF        + HS   S++   G+  + +   S +P SV+W EKGAV PV
Sbjct: 86  NEEFRAKMLGFGRPRSGGGAGHSTAPSTVACIGSGLMGRQGYSDLPKSVDWREKGAVAPV 145

Query: 139 KYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKY 191
           K QG C       AVAA+EGIN IK  +LVSLSEQ+LVDC T     GC GG+M  AF++
Sbjct: 146 KSQGDCGSCWAFSAVAAIEGINQIKNGKLVSLSEQELVDCDTK--AIGCAGGYMSWAFEF 203

Query: 192 IIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSV 251
           +++N+G+T +  Y Y+G++ G C + K ++ A  I+ Y +V P+ E  LL+A A QPVSV
Sbjct: 204 VMKNRGLTTERNYPYQGLN-GACQTPKLKESAVSISGYMNVTPSSEPDLLRAAAAQPVSV 262

Query: 252 AIDASAL--QFYSGGVFNGYCETFLNHGVTAVGYGTSE----------EGIKYWLIKNSW 299
           A+DA +   Q Y GGVF G C   LNHGVT VGYG ++           G KYW++KNSW
Sbjct: 263 AVDAGSFVWQLYGGGVFTGPCTAELNHGVTVVGYGETQGDTDGDGSGVPGKKYWIVKNSW 322

Query: 300 GQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           G +WG+ GY  +QR+     G CGIAM  S+PV
Sbjct: 323 GPEWGDAGYILMQREASVASGLCGIAMLPSYPV 355


>gi|356564154|ref|XP_003550321.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
          Length = 476

 Score =  274 bits (700), Expect = 5e-71,   Method: Compositional matrix adjust.
 Identities = 152/324 (46%), Positives = 202/324 (62%), Gaps = 18/324 (5%)

Query: 20  ATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYT 79
           AT RT +E  +   +EQW  ++G+ Y    E  KRF+IFKDNL  ++  N+A   +R+Y 
Sbjct: 47  ATLRTEEE--LMSMYEQWLVKHGKVYNALGEKEKRFQIFKDNLRFIDDHNSAE--DRTYK 102

Query: 80  LRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK-SSQVPPSVNWIEKGAVTPV 138
           L LN+FADLT +E+ A   G K+  +    K     +  +   ++P SV+W ++GAV PV
Sbjct: 103 LGLNRFADLTNEEYRAKYLGTKIDPNRRLGKTPSNRYAPRVGDKLPDSVDWRKEGAVPPV 162

Query: 139 KYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKY 191
           K QG C       A+ AVEGIN I    L+SLSEQ+LVDC T   N GC GG MD AF++
Sbjct: 163 KDQGGCGSCWAFSAIGAVEGINKIVTGELISLSEQELVDCDTG-YNQGCNGGLMDYAFEF 221

Query: 192 IIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSV 251
           II N GI +D  Y Y G+  G CD+ +       I +YEDVP  DE +L KAVANQPVSV
Sbjct: 222 IINNGGIDSDEDYPYRGVD-GRCDTYRKNAKVVSIDDYEDVPAYDELALKKAVANQPVSV 280

Query: 252 AIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYF 309
           AI+      Q Y  GVF G C T L+HGV AVGYGT+ +G  YW+++NSWG  WGEDGY 
Sbjct: 281 AIEGGGREFQLYVSGVFTGRCGTALDHGVVAVGYGTA-KGHDYWIVRNSWGSSWGEDGYI 339

Query: 310 RLQRDI-DQPQGQCGIAMFASFPV 332
           RL+R++ +   G+CGIA+  S+P+
Sbjct: 340 RLERNLANSRSGKCGIAIEPSYPL 363


>gi|307110445|gb|EFN58681.1| hypothetical protein CHLNCDRAFT_56822 [Chlorella variabilis]
          Length = 466

 Score =  273 bits (699), Expect = 6e-71,   Method: Compositional matrix adjust.
 Identities = 152/351 (43%), Positives = 209/351 (59%), Gaps = 30/351 (8%)

Query: 6   LIVVLIISGSCASQAT------YRTFDEGSIA---EKFEQWKAQYGRTYKESAENSKRFE 56
           L  VL+++ SC + A       +R F + ++    E F+ W     R Y  + E  +RF+
Sbjct: 3   LSCVLLVACSCLAVAAGFPFENHRLFIQQAVESPREAFDFWVQTLKRAYASAEEYERRFD 62

Query: 57  IFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSS-SLKANGTP 115
           ++ DNL  V  +N    G+ S+ L +  +ADL+  E+ +   G+    H    L+A   P
Sbjct: 63  VWLDNLRFVHEYN---AGHTSHWLSMGVYADLSQDEYRSKALGYNADLHEERPLRA--AP 117

Query: 116 FLYKSSQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQL 168
           FLY+ +  P  V+W+ KGAVTPVK Q  C          AVEG +AI   +L SLSEQ L
Sbjct: 118 FLYEGTVPPKEVDWVAKGAVTPVKNQLLCGSCWAFSTTGAVEGASAIATGKLASLSEQML 177

Query: 169 VDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITN 228
           VDC   + +NGC+GG MD AF++I++N GI  +  Y Y     G+C   K   H   I +
Sbjct: 178 VDC-DRERDNGCHGGLMDFAFEFIMKNGGIDTEDDYPYTA-EEGMCQDNKMRRHVVTIDD 235

Query: 229 YEDVPPNDEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTS 286
           Y+DVPPNDE +L+KAVANQPVSVAI+A   A Q Y GGVF+  C T L+HGV  VGYGT+
Sbjct: 236 YQDVPPNDEHALMKAVANQPVSVAIEADQRAFQLYGGGVFDAECGTALDHGVLVVGYGTA 295

Query: 287 EEG---IKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSK 334
             G   + YWL+KNSWG +WG+ GY RL R++ + +GQCG+AM ASFP+ K
Sbjct: 296 SNGTHHLPYWLVKNSWGAEWGDKGYIRLLRNLGE-EGQCGVAMQASFPIKK 345


>gi|449454309|ref|XP_004144898.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
 gi|449471311|ref|XP_004153272.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
          Length = 349

 Score =  273 bits (699), Expect = 6e-71,   Method: Compositional matrix adjust.
 Identities = 149/314 (47%), Positives = 195/314 (62%), Gaps = 19/314 (6%)

Query: 32  EKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQ 91
           E FE W +++ + Y+   E   RFEIF DNL  ++  N       SY L LN+FADL+ +
Sbjct: 45  ELFESWMSKHSKAYRSIEEKLHRFEIFLDNLKHIDETNKKV---SSYWLGLNEFADLSHE 101

Query: 92  EFIASQTGFKMSDHSSSLKANGTPFLYKSSQ-VPPSVNWIEKGAVTPVKYQGQCA----- 145
           EF +   G ++       K +   F Y   + +P SV+W  KGAVTPVK QG C      
Sbjct: 102 EFKSKYLGLRVE---FPRKRSSRGFSYGDVEDLPESVDWRTKGAVTPVKNQGSCGSCWAF 158

Query: 146 --VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAV 203
             VAAVEGIN I    L SLSEQ+L+DC     NNGCYGG MD AF+YI+ N G+  +  
Sbjct: 159 STVAAVEGINQIVTGNLTSLSEQELIDC-DRSFNNGCYGGLMDYAFQYIMSNSGLRKEED 217

Query: 204 YSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFY 261
           Y Y  M  G C   K +     I+ YEDVP NDE+SLLKA+++QPVSVAI+AS+   QFY
Sbjct: 218 YPYL-MEEGRCIREKEQFEVVTISGYEDVPANDEQSLLKALSHQPVSVAIEASSRNFQFY 276

Query: 262 SGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQ 321
            GG+F G C T ++HGVTAVGYG+SE G  Y ++KNSWG  WGE+GY R++R+  +P+G 
Sbjct: 277 KGGIFTGRCGTQMDHGVTAVGYGSSE-GTDYIIVKNSWGPKWGENGYIRMKRNTGKPEGL 335

Query: 322 CGIAMFASFPVSKE 335
           CGI   AS+P  ++
Sbjct: 336 CGINQMASYPTKEK 349


>gi|194352750|emb|CAQ00103.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
 gi|326514262|dbj|BAJ92281.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326519402|dbj|BAJ96700.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326524351|dbj|BAK00559.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326531998|dbj|BAK01375.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 356

 Score =  273 bits (699), Expect = 6e-71,   Method: Compositional matrix adjust.
 Identities = 157/348 (45%), Positives = 211/348 (60%), Gaps = 31/348 (8%)

Query: 8   VVLIISGSCASQAT------YRTFDEGS---IAEKFEQWKAQYGRTYKESAENSKRFEIF 58
           ++L+  G+C ++ +      Y   D  S   + E FE+W A++ + Y    E   RFE+F
Sbjct: 14  LLLLCVGACVARNSDFSIVGYSEEDLSSNERLVELFEKWLAKHQKAYASFEEKLHRFEVF 73

Query: 59  KDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLY 118
           KDNL  +++ N       SY L LN+FADLT  EF A+  G    D + + + +   F Y
Sbjct: 74  KDNLKHIDKINREVT---SYWLGLNEFADLTHDEFKAAYLGL---DAAPARRGSSRSFRY 127

Query: 119 K---SSQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQL 168
           +   +S +P SV+W +KGAVT VK QGQC        VAAVEGINAI    L +LSEQ+L
Sbjct: 128 EDVSASDLPKSVDWRKKGAVTEVKNQGQCGSCWAFSTVAAVEGINAIVTGNLTALSEQEL 187

Query: 169 VDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGIC-DSIKAEDHAAQIT 227
           +DC+  D N+GC GG MD AF YI  + G+  +  Y Y  M  G C D  KAE  A  I+
Sbjct: 188 IDCSV-DGNSGCNGGLMDYAFSYIASSGGLHTEEAYPYL-MEEGSCGDGKKAESEAVTIS 245

Query: 228 NYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGT 285
            YEDVP NDE++L+KA+A+QPVSVAI+AS    QFYSGGVF+G C   L+HGV AVGYG+
Sbjct: 246 GYEDVPANDEQALIKALAHQPVSVAIEASGRHFQFYSGGVFDGPCGAQLDHGVAAVGYGS 305

Query: 286 SE-EGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
            + +G  Y +++NSWG  WGE GY R++R     +G CGI   AS+P 
Sbjct: 306 DKGKGHDYIIVRNSWGAQWGEKGYIRMKRGTSNGEGLCGINKMASYPT 353


>gi|224131910|ref|XP_002328138.1| predicted protein [Populus trichocarpa]
 gi|222837653|gb|EEE76018.1| predicted protein [Populus trichocarpa]
          Length = 349

 Score =  273 bits (699), Expect = 7e-71,   Method: Compositional matrix adjust.
 Identities = 151/316 (47%), Positives = 196/316 (62%), Gaps = 19/316 (6%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
           + E FE W + +G+ Y    E   RFE+FK+NL  +++ N       SY L LN+FADL+
Sbjct: 43  LVELFESWISGHGKAYNSLEEKLHRFEVFKENLKHIDQRNKEVT---SYWLGLNEFADLS 99

Query: 90  PQEFIASQTGFKMSDHSSSLKANGTPFLYKSS-QVPPSVNWIEKGAVTPVKYQGQCA--- 145
            +EF +   G          K +   F Y+    +P S++W +KGAVTPVK QG C    
Sbjct: 100 HEEFKSKFLGLYPE---FPRKKSSEDFSYRDVVDLPKSIDWRKKGAVTPVKNQGSCGSCW 156

Query: 146 ----VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITND 201
               VAAVEGIN I    L SLSEQQL+DC T+  NNGC GG MD AF++I+ N G+  +
Sbjct: 157 AFSTVAAVEGINQIVAGNLTSLSEQQLIDCDTS-FNNGCNGGLMDYAFEFIVNNGGLHKE 215

Query: 202 AVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQ 259
             Y Y  M  G CD  + E     I+ Y DVP NDE+SLLKA+A+QP+SVAIDAS    Q
Sbjct: 216 EDYPYL-MEEGTCDEKREEMEVVTISGYHDVPRNDEQSLLKALAHQPLSVAIDASGRDFQ 274

Query: 260 FYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQ 319
           FYSGGVF+G C T L+HGV AVGYG+S  GI Y ++KNSWG  WGE GY R++R+  +P+
Sbjct: 275 FYSGGVFSGPCGTDLDHGVAAVGYGSSS-GIDYIIVKNSWGPKWGERGYLRMKRNTGKPE 333

Query: 320 GQCGIAMFASFPVSKE 335
           G CGI   AS+P  ++
Sbjct: 334 GLCGINKMASYPTKQK 349


>gi|226495425|ref|NP_001148706.1| cysteine protease 1 precursor [Zea mays]
 gi|195621544|gb|ACG32602.1| cysteine protease 1 precursor [Zea mays]
          Length = 463

 Score =  273 bits (698), Expect = 9e-71,   Method: Compositional matrix adjust.
 Identities = 149/311 (47%), Positives = 193/311 (62%), Gaps = 14/311 (4%)

Query: 34  FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAA-IGNRSYTLRLNKFADLTPQE 92
           + +W A +GRTY    E  +R+++F+DNL  ++  N AA  G  S+ L LN+FADLT  E
Sbjct: 41  YAEWMAAHGRTYNAVGEEERRYQVFRDNLRYIDAHNAAADAGVHSFRLGLNRFADLTNDE 100

Query: 93  FIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA------- 145
           + A+  G +        K         +  +P SV+W  KGAV  VK QG C        
Sbjct: 101 YRATYLGARTRPQRER-KLGARYHAADNEDLPESVDWRAKGAVAEVKDQGSCGSCWAFST 159

Query: 146 VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYS 205
           +AAVEGIN I    L+SLSEQ+LVDC T+  N GC GG MD AF++II N GI  +  Y 
Sbjct: 160 IAAVEGINQIVTGDLISLSEQELVDCDTS-YNQGCNGGLMDYAFEFIINNGGIDTEKDYP 218

Query: 206 YEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDA--SALQFYSG 263
           Y+G + G CD  +       I +YEDVP NDE+SL KAVANQPVSVAI+A  +A Q YS 
Sbjct: 219 YKG-TDGRCDVNRKNAKVVTIDSYEDVPANDEKSLQKAVANQPVSVAIEAAGTAFQLYSS 277

Query: 264 GVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCG 323
           G+F G C T L+HGVTAVGYGT E G  YW++KNSWG  WGE GY R++R+I    G+CG
Sbjct: 278 GIFTGSCGTALDHGVTAVGYGT-ENGKDYWIVKNSWGSSWGESGYVRMERNIKASSGKCG 336

Query: 324 IAMFASFPVSK 334
           IA+  S+P+ +
Sbjct: 337 IAVEPSYPLKE 347


>gi|351629615|gb|AEQ54771.1| KDDL-tailed cysteine proteinase CP4 [Coffea canephora]
          Length = 359

 Score =  273 bits (698), Expect = 9e-71,   Method: Compositional matrix adjust.
 Identities = 158/358 (44%), Positives = 223/358 (62%), Gaps = 29/358 (8%)

Query: 1   MAKYFL---IVVLIISGSCASQATYRTF-DEGSIAEKFEQWKAQYGRTYKESAENSKRFE 56
           M K FL   ++ +I+  + + + T R    E S+ + +E+W++ +    ++ +E  KRF 
Sbjct: 3   MGKAFLFAVVLAVILVAAMSMEITERDLASEESLWDLYERWRSHH-TVSRDLSEKRKRFN 61

Query: 57  IFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTP---QEFIASQTGFKMSDHSSSLKANG 113
           +FK N+  + + N     ++ Y L+LN FAD+T    +EF +S+       H S  +AN 
Sbjct: 62  VFKANVHHIHKVNQK---DKPYKLKLNSFADMTNHEFREFYSSKVKHYRMLHGS--RAN- 115

Query: 114 TPFLY-KSSQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSE 165
           T F++ K+  +P SV+W ++GAVT VK QG+C        V  VEGIN IK  +LVSLSE
Sbjct: 116 TGFMHGKTESLPASVDWRKQGAVTGVKNQGKCGSCWAFSTVVGVEGINKIKTGQLVSLSE 175

Query: 166 QQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQ 225
           Q+LVDC T+  N GC GG M++A+++I ++ GIT + +Y Y+    G CDS K    A  
Sbjct: 176 QELVDCETD--NEGCNGGLMENAYEFIKKSGGITTERLYPYKARD-GSCDSSKMNAPAVT 232

Query: 226 ITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNG-YCETFLNHGVTAVG 282
           I  +E VP NDE +L+KAVANQPVSVAIDAS   +QFYS GV+ G  C   L+HGV  VG
Sbjct: 233 IDGHEMVPANDENALMKAVANQPVSVAIDASGSDMQFYSEGVYAGDSCGNELDHGVAVVG 292

Query: 283 YGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQ-CGIAMFASFPVSKESAQP 339
           YGT+ +G KYW++KNSWG  WGE GY R+QR +D  +G  CGIAM AS+P+   S  P
Sbjct: 293 YGTALDGTKYWIVKNSWGTGWGEQGYIRMQRGVDAAEGGVCGIAMEASYPLKLSSHNP 350


>gi|414585111|tpg|DAA35682.1| TPA: cysteine proteinase Mir3 [Zea mays]
          Length = 468

 Score =  273 bits (698), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 149/311 (47%), Positives = 193/311 (62%), Gaps = 14/311 (4%)

Query: 34  FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAA-IGNRSYTLRLNKFADLTPQE 92
           + +W A +GRTY    E  +R+++F+DNL  ++  N AA  G  S+ L LN+FADLT  E
Sbjct: 46  YAEWMAAHGRTYNAVGEEERRYQVFRDNLRYIDAHNAAADAGVHSFRLGLNRFADLTNDE 105

Query: 93  FIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA------- 145
           + A+  G +        K         +  +P SV+W  KGAV  VK QG C        
Sbjct: 106 YRATYLGARTRPQRER-KLGARYHAADNEDLPESVDWRAKGAVAEVKDQGSCGSCWAFST 164

Query: 146 VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYS 205
           +AAVEGIN I    L+SLSEQ+LVDC T+  N GC GG MD AF++II N GI  +  Y 
Sbjct: 165 IAAVEGINQIVTGDLISLSEQELVDCDTS-YNQGCNGGLMDYAFEFIINNGGIDTEKDYP 223

Query: 206 YEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDA--SALQFYSG 263
           Y+G + G CD  +       I +YEDVP NDE+SL KAVANQPVSVAI+A  +A Q YS 
Sbjct: 224 YKG-TDGRCDVNRKNAKVVTIDSYEDVPANDEKSLQKAVANQPVSVAIEAAGTAFQLYSS 282

Query: 264 GVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCG 323
           G+F G C T L+HGVTAVGYGT E G  YW++KNSWG  WGE GY R++R+I    G+CG
Sbjct: 283 GIFTGSCGTALDHGVTAVGYGT-ENGKDYWIVKNSWGSSWGESGYVRMERNIKASSGKCG 341

Query: 324 IAMFASFPVSK 334
           IA+  S+P+ +
Sbjct: 342 IAVEPSYPLKE 352


>gi|297598407|ref|NP_001045533.2| Os01g0971400 [Oryza sativa Japonica Group]
 gi|15289977|dbj|BAB63672.1| putative cysteine protease CP1 [Oryza sativa Japonica Group]
 gi|125529282|gb|EAY77396.1| hypothetical protein OsI_05384 [Oryza sativa Indica Group]
 gi|125573472|gb|EAZ14987.1| hypothetical protein OsJ_04922 [Oryza sativa Japonica Group]
 gi|215740756|dbj|BAG97412.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215741010|dbj|BAG97505.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215765325|dbj|BAG87022.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215767338|dbj|BAG99566.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|255674119|dbj|BAF07447.2| Os01g0971400 [Oryza sativa Japonica Group]
          Length = 365

 Score =  273 bits (698), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 154/322 (47%), Positives = 198/322 (61%), Gaps = 26/322 (8%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
           + E FE++ A+Y + Y    E  +RFE+FKDNL  ++  N    G   Y L LN+FADLT
Sbjct: 48  LMELFEKFMAKYRKAYSSLEEKLRRFEVFKDNLNHIDEENKKITG---YWLGLNEFADLT 104

Query: 90  PQEFIASQTGFKMSDHSSSLKANGTPFLYK---SSQVPPSVNWIEKGAVTPVKYQGQCA- 145
             EF A+  G  ++   +   +N   F Y+   ++ +P  V+W +KGAVT VK QGQC  
Sbjct: 105 HDEFKAAYLGLTLT--PARRNSNDQLFRYEEVEAASLPKEVDWRKKGAVTEVKNQGQCGS 162

Query: 146 ------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGIT 199
                 VAAVEGINAI    L  LSEQ+L+DC T D NNGC GG MD AF YI  N G+ 
Sbjct: 163 CWAFSTVAAVEGINAIVTGNLTRLSEQELIDCDT-DGNNGCSGGLMDYAFSYIAANGGLH 221

Query: 200 NDAVYSYEGMSTGIC--DSIKAEDH-----AAQITNYEDVPPNDEESLLKAVANQPVSVA 252
            +  Y Y  M  G C   S + +D      A  I+ YEDVP N+E++LLKA+A+QPVSVA
Sbjct: 222 TEESYPYL-MEEGTCRRGSTEGDDDGEAAAAVTISGYEDVPRNNEQALLKALAHQPVSVA 280

Query: 253 IDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFR 310
           I+AS    QFYSGGVF+G C T L+HGVTAVGYGT+ +G  Y ++KNSWG  WGE GY R
Sbjct: 281 IEASGRNFQFYSGGVFDGPCGTRLDHGVTAVGYGTASKGHDYIIVKNSWGSHWGEKGYIR 340

Query: 311 LQRDIDQPQGQCGIAMFASFPV 332
           ++R   +  G CGI   AS+P 
Sbjct: 341 MRRGTGKHDGLCGINKMASYPT 362


>gi|168006315|ref|XP_001755855.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162693174|gb|EDQ79528.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 454

 Score =  273 bits (697), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 152/319 (47%), Positives = 197/319 (61%), Gaps = 20/319 (6%)

Query: 26  DEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKF 85
           +E  ++E+F  W  ++G+ Y    E++ R+ ++KDNL  ++R +     NRSY L L KF
Sbjct: 38  NERLLSEQFGAWAHKHGKVYSSLEEHAHRYMVWKDNLEYIQRHSEK---NRSYWLGLTKF 94

Query: 86  ADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC- 144
           AD+T  EF    TG ++     S +  G  F Y  S+ P SV+W +KGAVT VK QG C 
Sbjct: 95  ADITNDEFRRQYTGTRIDRSKRSKRKTG--FRYADSEAPESVDWRKKGAVTTVKDQGSCG 152

Query: 145 ------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGI 198
                 A+ +VEGINAI+    VSLSEQ+LVDC   + N GC GG MD AF +I++N GI
Sbjct: 153 SCWAFSAIGSVEGINAIRTGEAVSLSEQELVDCDL-EYNQGCNGGLMDYAFDFILENGGI 211

Query: 199 TNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA- 257
             +  Y Y+G+  G CD+ K   H   I  YEDVP NDEE+L KAVA QPVSVAI+A   
Sbjct: 212 DTENDYPYKGLD-GRCDNNKKNAHVVTIDGYEDVPENDEEALKKAVAGQPVSVAIEAGGR 270

Query: 258 -LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDID 316
             Q YSGGVF G C T L+HGV AVGYG SE  + YW++KNSWG+ WGE GY R+QR+I 
Sbjct: 271 DFQLYSGGVFTGECGTDLDHGVLAVGYG-SEGSLDYWIVKNSWGEYWGESGYLRMQRNIK 329

Query: 317 QPQ---GQCGIAMFASFPV 332
                 G CGI +  S+ V
Sbjct: 330 DSNHQFGLCGINIEPSYAV 348


>gi|38345188|emb|CAE03344.2| OSJNBb0005B05.11 [Oryza sativa Japonica Group]
 gi|125589403|gb|EAZ29753.1| hypothetical protein OsJ_13812 [Oryza sativa Japonica Group]
          Length = 323

 Score =  273 bits (697), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 140/338 (41%), Positives = 202/338 (59%), Gaps = 22/338 (6%)

Query: 1   MAKYFLIVVLIISGSCASQATYRTF-DEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFK 59
           MAK  L  +L     C++    R   D+ ++A + E+W AQYGR YK+ AE ++RFE+FK
Sbjct: 3   MAKALLFAILGCLCLCSAVLAARELSDDAAMAARHERWMAQYGRMYKDDAEKARRFEVFK 62

Query: 60  DNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK 119
            N+  +E FN    GN  + L +N+FADLT  EF +++T       ++ +          
Sbjct: 63  ANVAFIESFN---AGNHKFWLGVNQFADLTNDEFRSTKTNKGFIPSTTRVPTGFRNENVN 119

Query: 120 SSQVPPSVNWIEKGAVTPVKYQGQCAVA-AVEGINAIKINRLVSLSEQQLVDCATNDNNN 178
              +P +++W  KG VTP+K QGQC    A   + A+          ++LVDC  +  + 
Sbjct: 120 IDALPATMDWRTKGVVTPIKDQGQCGCCWAFSAVAAM----------EELVDCDVHGEDQ 169

Query: 179 GCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKA-EDHAAQITNYEDVPPNDE 237
           GC GG MDDAFK+II+N G+T ++ Y Y  +     D  K+  +  A I  YEDVP N+E
Sbjct: 170 GCEGGLMDDAFKFIIKNGGLTTESNYPYAAVD----DKFKSVSNSVASIKGYEDVPANNE 225

Query: 238 ESLLKAVANQPVSVAIDASAL--QFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLI 295
            +L+KAVANQPVSVA+D   +  QFY GGV  G C T L+HG+ A+GYG + +G KYWL+
Sbjct: 226 AALMKAVANQPVSVAVDGGDMTFQFYKGGVMTGSCGTDLDHGIVAIGYGKASDGTKYWLL 285

Query: 296 KNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
           KNSWG  WGE+G+ R+++DI   +G CG+AM  S+P +
Sbjct: 286 KNSWGMTWGENGFLRMEKDISDKRGMCGLAMEPSYPTA 323


>gi|242038089|ref|XP_002466439.1| hypothetical protein SORBIDRAFT_01g007820 [Sorghum bicolor]
 gi|241920293|gb|EER93437.1| hypothetical protein SORBIDRAFT_01g007820 [Sorghum bicolor]
          Length = 353

 Score =  273 bits (697), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 155/352 (44%), Positives = 216/352 (61%), Gaps = 28/352 (7%)

Query: 5   FLIVVLIISGSCASQA------------TYRTFDEGSIAEKFEQWKAQYGRTYKESAENS 52
           F++ VL+++  C + A                    ++  + E+W A++GRTY + AE +
Sbjct: 6   FVLTVLVVASVCTAAAPRALAVRELAGEEESAAVAAAMVSRHEKWMAEHGRTYTDEAEKA 65

Query: 53  KRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKAN 112
           +R EIF+ N   ++ FN+A  G  S+ L  N+FADLT +EF A++TGF+     ++   +
Sbjct: 66  RRLEIFRANAEFIDSFNDA--GKHSHRLATNRFADLTDEEFRAARTGFRPRPAPAAAAGS 123

Query: 113 GTPFLYKS---SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVS 162
           G  F Y++   +    SV+W   GAVT VK QG+C       AVAAVEG+N I+  RLVS
Sbjct: 124 GGRFRYENFSLADAAQSVDWRAMGAVTGVKDQGECGCCWAFSAVAAVEGLNKIRTGRLVS 183

Query: 163 LSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDH 222
           LSEQ+LVDC  N  + GC GG MDDAF++I +  G+ +++ Y Y+G   G C S  A   
Sbjct: 184 LSEQELVDCDVNGEDQGCEGGLMDDAFQFIERRGGLASESGYPYQG-DDGSCRSSAAAAR 242

Query: 223 AAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTA 280
           AA I  +EDVP N+E +L  AVANQPVSVAI+    A +FY  GV  G C T LNH +TA
Sbjct: 243 AASIRGHEDVPRNNEAALAAAVANQPVSVAINGEDYAFRFYDSGVLGGECGTDLNHAITA 302

Query: 281 VGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           VGYGT+ +G KYWL+KNSWG  WGE GY R++R + + +G CG+A   S+PV
Sbjct: 303 VGYGTAADGSKYWLMKNSWGTSWGEGGYVRIRRGV-RGEGVCGLAKLPSYPV 353


>gi|413953666|gb|AFW86315.1| hypothetical protein ZEAMMB73_539008 [Zea mays]
          Length = 314

 Score =  273 bits (697), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 148/345 (42%), Positives = 206/345 (59%), Gaps = 51/345 (14%)

Query: 3   KYFLIVVLIISGSCASQATYRTF-DEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDN 61
           K  ++ +L  +  C +    R   D+ ++  + EQW AQY R YK+++E ++RF      
Sbjct: 5   KASILAILGFAFFCGAALAARDLSDDSAMVARHEQWMAQYSRVYKDASEKARRF------ 58

Query: 62  LVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQT--GFKMSDHSSSLKANGTPFLYK 119
                                 KFADLT  EF + +T  GFK    SS++K   T F Y+
Sbjct: 59  ----------------------KFADLTNHEFRSVKTNKGFK----SSNMKIL-TGFRYE 91

Query: 120 ---SSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLV 169
              +  +P +++W  KG VTP+K QGQC       AVAA EGI  I   +LVSL++Q+LV
Sbjct: 92  NVSADALPTTIDWRTKGVVTPIKDQGQCGCCSAFSAVAATEGIVKISTGKLVSLADQELV 151

Query: 170 DCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNY 229
           DC  +  + GC GG MDDAFK+II+N G+T ++ Y Y   + G C+S    + AA I  Y
Sbjct: 152 DCDVHGEDQGCEGGLMDDAFKFIIKNGGLTTESSYPYTA-ADGKCNS--GSNSAATIKGY 208

Query: 230 EDVPPNDEESLLKAVANQPVSVAIDASAL--QFYSGGVFNGYCETFLNHGVTAVGYGTSE 287
           EDVP NDE +L+KA+ANQPVSVA+D   +  +FYSGGV  G C T L+HG+ A+GYG + 
Sbjct: 209 EDVPANDEAALMKAMANQPVSVAVDGGDMTFRFYSGGVMTGSCGTDLDHGIAAIGYGKTS 268

Query: 288 EGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           +G KYWL+KNSWG  WGE+GY R+++DI   +G CG+AM  S+P 
Sbjct: 269 DGTKYWLMKNSWGTTWGENGYLRMEKDISDKRGMCGLAMEPSYPT 313


>gi|600111|emb|CAA84378.1| cysteine proteinase [Vicia sativa]
          Length = 359

 Score =  273 bits (697), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 167/356 (46%), Positives = 218/356 (61%), Gaps = 31/356 (8%)

Query: 9   VLIISGSCASQATY-RTFD--------EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFK 59
           +L IS S A   T   TFD        E S+   +E+W++ +  T +   E   RF +FK
Sbjct: 6   LLFISLSLALIFTVANTFDFNEHDLESEKSLWNLYERWRSHHTVT-RNLDEKHNRFNVFK 64

Query: 60  DNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHS---SSLKANGTPF 116
            N++ V   N     ++ Y L+LNKF D+T  EF       K+S H         NGT F
Sbjct: 65  ANVMHVHNTNKL---DKPYKLKLNKFGDMTNYEFRRIYADSKISHHRMFRGMSHENGT-F 120

Query: 117 LYKSS-QVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQL 168
           +Y+++  VP S++W  KGAVT VK QGQC        +AAVEGIN IK  +LVSLSEQQL
Sbjct: 121 MYENAVDVPSSIDWRNKGAVTGVKDQGQCGSCWAFSTIAAVEGINQIKTQKLVSLSEQQL 180

Query: 169 VDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITN 228
           VDC T +N  GC GG M+ AF++I QN GIT ++ Y Y     G CD ++ ED A  I  
Sbjct: 181 VDCDTEENE-GCNGGLMEYAFEFIKQN-GITTESNYPY-AAKDGTCD-VEKEDKAVSIDG 236

Query: 229 YEDVPPNDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTS 286
           +E+VP N+E +LLKA A QPVSVAIDA     QFYS GVF G+C+T LNHGV  VGYG +
Sbjct: 237 HENVPINNEAALLKAAAKQPVSVAIDAGGYNFQFYSEGVFTGHCDTDLNHGVAIVGYGVT 296

Query: 287 EEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSKESAQPSSA 342
           ++  KYW++KNSWG +WGE GY R+QR I   +G CGIAM AS+P+ K S +P+ +
Sbjct: 297 QDRTKYWIMKNSWGSEWGEQGYIRMQRGISSREGLCGIAMEASYPIKKSSTKPTES 352


>gi|413919736|gb|AFW59668.1| cysteine protease 1 [Zea mays]
          Length = 469

 Score =  273 bits (697), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 153/328 (46%), Positives = 199/328 (60%), Gaps = 16/328 (4%)

Query: 18  SQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAA-IGNR 76
           S  +Y    E      + +W A +GRTY    E  +RFE+F+DNL  V+  N AA  G  
Sbjct: 30  SIVSYGERSEEEARRMYAEWMAAHGRTYNAVGEEERRFEVFRDNLRYVDAHNAAADAGVH 89

Query: 77  SYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQ-VPPSVNWIEKGAV 135
           S+ L LN+FADLT  E+ A+  G +        +  G  +L   ++ +P SV+W  KGAV
Sbjct: 90  SFRLGLNRFADLTNDEYRATYLGVRSRPQRE--RRLGDRYLAGDNEDLPESVDWRAKGAV 147

Query: 136 TPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDA 188
             VK QG C        +AAVEGIN I    ++SLSEQ+LVDC T+  N GC GG MD A
Sbjct: 148 AEVKDQGSCGSCWAFSTIAAVEGINQIVTGDMISLSEQELVDCDTS-YNQGCNGGLMDYA 206

Query: 189 FKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQP 248
           F++II N GI  +  Y Y+G + G CD  +       I +YEDVP N E+SL KAVANQP
Sbjct: 207 FEFIINNGGIDTEEDYPYKG-TDGRCDVNRKNAKVVTIDSYEDVPANSEKSLQKAVANQP 265

Query: 249 VSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGED 306
           +SVAI+A   A Q Y+ G+F G C T L+HGVTAVGYGT E G  YW++KNSWG  WGE 
Sbjct: 266 ISVAIEAGGRAFQLYNSGIFTGTCGTALDHGVTAVGYGT-ENGKDYWIVKNSWGSSWGES 324

Query: 307 GYFRLQRDIDQPQGQCGIAMFASFPVSK 334
           GY R++R+I    G+CGIA+  S+P+ K
Sbjct: 325 GYVRMERNIKASSGKCGIAVEPSYPLKK 352


>gi|18394919|ref|NP_564126.1| Xylem cysteine proteinase 2 [Arabidopsis thaliana]
 gi|71153409|sp|Q9LM66.2|XCP2_ARATH RecName: Full=Xylem cysteine proteinase 2; Short=AtXCP2; Flags:
           Precursor
 gi|4836904|gb|AAD30607.1|AC007369_17 Putative cysteine proteinase [Arabidopsis thaliana]
 gi|6708183|gb|AAF25832.1|AF191028_1 papain-type cysteine endopeptidase XCP2 [Arabidopsis thaliana]
 gi|28466959|gb|AAO44088.1| At1g20850 [Arabidopsis thaliana]
 gi|110743795|dbj|BAE99733.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|332191910|gb|AEE30031.1| Xylem cysteine proteinase 2 [Arabidopsis thaliana]
          Length = 356

 Score =  272 bits (696), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 149/313 (47%), Positives = 192/313 (61%), Gaps = 16/313 (5%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
           + E FE W + + + Y+   E   RFE+FKDNL  ++  N      +SY L LN+FADL+
Sbjct: 47  LIELFENWISNFEKAYETVEEKFLRFEVFKDNLKHIDETNKKG---KSYWLGLNEFADLS 103

Query: 90  PQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQ-VPPSVNWIEKGAVTPVKYQGQCA--- 145
            +EF     G K        + +   F Y+  + VP SV+W +KGAV  VK QG C    
Sbjct: 104 HEEFKKMYLGLKTDIVRRDEERSYAEFAYRDVEAVPKSVDWRKKGAVAEVKNQGSCGSCW 163

Query: 146 ----VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITND 201
               VAAVEGIN I    L +LSEQ+L+DC T   NNGC GG MD AF+YI++N G+  +
Sbjct: 164 AFSTVAAVEGINKIVTGNLTTLSEQELIDCDTT-YNNGCNGGLMDYAFEYIVKNGGLRKE 222

Query: 202 AVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQ 259
             Y Y  M  G C+  K E     I  ++DVP NDE+SLLKA+A+QP+SVAIDAS    Q
Sbjct: 223 EDYPY-SMEEGTCEMQKDESETVTINGHQDVPTNDEKSLLKALAHQPLSVAIDASGREFQ 281

Query: 260 FYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQ 319
           FYSGGVF+G C   L+HGV AVGYG+S+ G  Y ++KNSWG  WGE GY RL+R+  +P+
Sbjct: 282 FYSGGVFDGRCGVDLDHGVAAVGYGSSK-GSDYIIVKNSWGPKWGEKGYIRLKRNTGKPE 340

Query: 320 GQCGIAMFASFPV 332
           G CGI   ASFP 
Sbjct: 341 GLCGINKMASFPT 353


>gi|226496089|ref|NP_001149658.1| cysteine protease 1 precursor [Zea mays]
 gi|195629242|gb|ACG36262.1| cysteine protease 1 precursor [Zea mays]
          Length = 469

 Score =  272 bits (695), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 149/312 (47%), Positives = 195/312 (62%), Gaps = 16/312 (5%)

Query: 34  FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAA-IGNRSYTLRLNKFADLTPQE 92
           + +W A +GRTY    E  +RFE+F+DNL  V+  N AA  G  S+ L LN+FADLT  E
Sbjct: 46  YAEWMAAHGRTYNAVGEEERRFEVFRDNLRYVDAHNAAADAGVHSFRLGLNRFADLTNDE 105

Query: 93  FIASQTGFKMSDHSSSLKANGTPFLYKSSQ-VPPSVNWIEKGAVTPVKYQGQCA------ 145
           + A+  G +        +  G  +L   ++ +P SV+W  KGAV  +K QG C       
Sbjct: 106 YRATYLGVRSRPQRE--RRLGDRYLAGDNEDLPESVDWRAKGAVAEIKDQGSCGSCWAFS 163

Query: 146 -VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVY 204
            +AAVEGIN I    ++SLSEQ+LVDC T+  N GC GG MD AF++II N GI  +  Y
Sbjct: 164 TIAAVEGINQIVTGDMISLSEQELVDCDTS-YNQGCNGGLMDYAFEFIINNGGIDTEEDY 222

Query: 205 SYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQFYS 262
            Y+G + G CD  +       I +YEDVP N E+SL KAVANQP+SVAI+A   A Q Y+
Sbjct: 223 PYKG-TDGRCDVNRKNAKVVTIDSYEDVPANSEKSLQKAVANQPISVAIEAGGRAFQLYN 281

Query: 263 GGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQC 322
            G+F G C T L+HGVTAVGYGT E G  YW++KNSWG  WGE GY R++R+I    G+C
Sbjct: 282 SGIFTGTCGTALDHGVTAVGYGT-ENGKDYWIVKNSWGSSWGESGYVRMERNIKASSGKC 340

Query: 323 GIAMFASFPVSK 334
           GIA+  S+P+ K
Sbjct: 341 GIAVEPSYPLKK 352


>gi|357162587|ref|XP_003579458.1| PREDICTED: oryzain beta chain-like [Brachypodium distachyon]
          Length = 470

 Score =  272 bits (695), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 149/312 (47%), Positives = 191/312 (61%), Gaps = 17/312 (5%)

Query: 37  WKAQYGRTYKES-AENSKRFEIFKDNLVAVERFN-NAAIGNRSYTLRLNKFADLTPQEFI 94
           W+A++G     S  E  +RF  F DNL  V+  N  AA G   + L +N+FADLT  EF 
Sbjct: 55  WRAEHGSGNSNSLGEEERRFRAFWDNLRFVDAHNARAAAGEEGFRLGMNRFADLTNDEFR 114

Query: 95  ASQTGFKMSDHSSSLKANGTPFLYKS---SQVPPSVNWIEKGAVTPVKYQGQC------- 144
           A+  G K +    S +A G    Y+     ++P +V+W EKGAV PVK QGQC       
Sbjct: 115 AAYLGVKGAGQRRSARA-GVGERYRHDGVEELPEAVDWREKGAVAPVKNQGQCGSCWAFS 173

Query: 145 AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVY 204
           AV+AVE IN +    LV+LSEQ+LV+C  N  +NGC GG MDDAF +II N GI  +  Y
Sbjct: 174 AVSAVESINQLVTGELVTLSEQELVECDINGQSNGCNGGLMDDAFDFIINNGGIDTEDDY 233

Query: 205 SYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFYS 262
            Y+ +  G CD  +       I  +EDVP NDE+SL KAVA+QPVSVAI+A     Q Y 
Sbjct: 234 PYKALD-GKCDINRRNAKVVSIDGFEDVPENDEKSLQKAVAHQPVSVAIEAGGREFQLYH 292

Query: 263 GGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQC 322
            GVF G C T L+HGV AVGYGT E G  YW+++NSWG  WGE GY R++R+I+   G+C
Sbjct: 293 SGVFTGRCGTELDHGVVAVGYGT-ENGKDYWIVRNSWGPKWGEAGYLRMERNINATTGKC 351

Query: 323 GIAMFASFPVSK 334
           GIAM +S+P  K
Sbjct: 352 GIAMMSSYPTKK 363


>gi|2414570|emb|CAB16317.1| cysteine proteinase precursor [Nicotiana tabacum]
          Length = 374

 Score =  272 bits (695), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 149/325 (45%), Positives = 201/325 (61%), Gaps = 20/325 (6%)

Query: 26  DEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKF 85
           DE  +  ++E W A++GR Y    E  KRFEIFKDNL  +E  NN+  GNR+Y + LN+F
Sbjct: 42  DEDQVKNRYEMWLAEHGRAYNALGEKEKRFEIFKDNLRFIEGHNNS--GNRTYKVGLNQF 99

Query: 86  ADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQ---VPPSVNWIEKGAVTPVKYQG 142
           ADLT +E+     G K       +K+      Y S     +P SV+W ++GAV P+K QG
Sbjct: 100 ADLTNEEYRTMYLGTKSDARRRFVKSKNPSQRYASRPNELMPHSVDWRKRGAVAPIKNQG 159

Query: 143 QCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQN 195
            C        VAAVEGIN I    +++LSEQ+LVDC     N+GC GG MD AF++II N
Sbjct: 160 SCGSCWAFSTVAAVEGINQIVTGEMITLSEQELVDC-DRVQNSGCNGGLMDYAFEFIISN 218

Query: 196 KGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDA 255
            G+  +  Y Y G+  G CD ++       I  YEDVP N E +L KAVA+QPV VAI+A
Sbjct: 219 GGMDTEKHYPYRGVE-GRCDPVRKNYKVVSIDGYEDVPRN-ERALQKAVAHQPVCVAIEA 276

Query: 256 S--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQR 313
           S  A Q YS GVF G C   ++HGV  VGYG SE+G+ YW+++NSWG  WGE+GY +++R
Sbjct: 277 SGRAFQLYSSGVFTGECGEEVDHGVVVVGYG-SEDGVDYWIVRNSWGTKWGENGYVKMER 335

Query: 314 DIDQPQ-GQCGIAMFASFPVSKESA 337
           ++ +   G+CGI   AS+P +K+SA
Sbjct: 336 NVKKSHLGKCGIMTEASYP-TKDSA 359


>gi|162463464|ref|NP_001104879.1| cysteine proteinase Mir3 precursor [Zea mays]
 gi|2425066|gb|AAB88263.1| cysteine proteinase Mir3 [Zea mays]
          Length = 480

 Score =  271 bits (694), Expect = 3e-70,   Method: Compositional matrix adjust.
 Identities = 148/311 (47%), Positives = 192/311 (61%), Gaps = 14/311 (4%)

Query: 34  FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAA-IGNRSYTLRLNKFADLTPQE 92
           + +W A +GRTY       +R+++F+DNL  ++  N AA  G  S+ L LN+FADLT  E
Sbjct: 44  YAEWMAAHGRTYNAVGAEERRYQVFRDNLRYIDAHNAAADAGVHSFRLGLNRFADLTNDE 103

Query: 93  FIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA------- 145
           + A+  G +        K         +  +P SV+W  KGAV  VK QG C        
Sbjct: 104 YPATYLGARTRPQRDR-KLGARYHAADNEDLPESVDWRAKGAVAEVKDQGSCGTCWAFST 162

Query: 146 VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYS 205
           +AAVEGIN I    L+SLSEQ+LVDC T+  N GC GG MD AF++II N GI  +  Y 
Sbjct: 163 IAAVEGINQIVTGDLISLSEQELVDCDTS-YNQGCNGGLMDYAFEFIINNGGIDTEKDYP 221

Query: 206 YEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDA--SALQFYSG 263
           Y+G + G CD  +       I +YEDVP NDE+SL KAVANQPVSVAI+A  +A Q YS 
Sbjct: 222 YKG-TDGRCDVNRKNAKVVTIDSYEDVPANDEKSLQKAVANQPVSVAIEAAGTAFQLYSS 280

Query: 264 GVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCG 323
           G+F G C T L+HGVTAVGYGT E G  YW++KNSWG  WGE GY R++R+I    G+CG
Sbjct: 281 GIFTGSCGTRLDHGVTAVGYGT-ENGKDYWIVKNSWGSSWGESGYVRMERNIKASSGKCG 339

Query: 324 IAMFASFPVSK 334
           IA+  S+P+ +
Sbjct: 340 IAVEPSYPLKE 350


>gi|358343350|ref|XP_003635767.1| Cysteine proteinase [Medicago truncatula]
 gi|355501702|gb|AES82905.1| Cysteine proteinase [Medicago truncatula]
          Length = 338

 Score =  271 bits (693), Expect = 3e-70,   Method: Compositional matrix adjust.
 Identities = 150/338 (44%), Positives = 205/338 (60%), Gaps = 22/338 (6%)

Query: 7   IVVL---IISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLV 63
           IV+L   II+ +C    T  + +   + +++E W  +YGR Y++  E   RF+I++ N+ 
Sbjct: 9   IVILNLWIIASACPEIHTKNSTNPAVMKKRYETWLKRYGRHYRDREEWEVRFDIYQSNVQ 68

Query: 64  AVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQV 123
            +E +N+    N SY L  N+FAD+T +EF ++  G+         +       +K  ++
Sbjct: 69  YIEFYNSQ---NYSYKLIDNRFADITNEEFKSTYLGY-----LPRFRVQTEFRYHKHGEL 120

Query: 124 PPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDN 176
           P S++W +KGAVT VK QG+C       AVAAVEGIN IK   LVSLSEQQL+DC     
Sbjct: 121 PKSIDWRKKGAVTHVKDQGRCGSCWAFSAVAAVEGINKIKTENLVSLSEQQLIDCDIKSG 180

Query: 177 NNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPND 236
           N GC GG M  AF YI ++ GI     Y Y+G   G C+  KA+++A  I+ YE VP  +
Sbjct: 181 NEGCEGGDMYIAFNYIKKHGGIATAKEYPYKGRD-GNCNKSKAKNNAVTISGYESVPARN 239

Query: 237 EESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWL 294
           E+ L  AVA+QPVS+A DA   A QFYS G+F+G C   LNHG+T VGYG  E G KYW+
Sbjct: 240 EKMLKAAVAHQPVSIATDAGGYAFQFYSKGIFSGSCGKNLNHGMTIVGYG-EENGDKYWI 298

Query: 295 IKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           +KNSW  DWGE GY R++RD     G CGIAM A++PV
Sbjct: 299 VKNSWANDWGESGYVRMKRDTKDKDGTCGIAMDATYPV 336


>gi|124484387|dbj|BAF46304.1| cysteine proteinase precursor [Ipomoea nil]
          Length = 474

 Score =  271 bits (693), Expect = 4e-70,   Method: Compositional matrix adjust.
 Identities = 150/319 (47%), Positives = 197/319 (61%), Gaps = 16/319 (5%)

Query: 27  EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFA 86
           E  + E FE W  ++G++Y    E  KRF+IF+DNL  ++  N  ++ NRSY L LN+FA
Sbjct: 43  EDEVKEMFESWLVKHGKSYNAVDEKDKRFKIFRDNLKYIDEKN--SLENRSYKLGLNRFA 100

Query: 87  DLTPQEFIASQTGFKMSDHSSSLKANGTPFL-YKSSQVPPSVNWIEKGAVTPVKYQGQCA 145
           D+T +E+     G K     + +K+    +       +P S++W EKGAVT VK QG C 
Sbjct: 101 DITNEEYRTGYLGAKRDASRNMVKSKSDRYAPVAGDSLPDSIDWREKGAVTGVKDQGSCG 160

Query: 146 -------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGI 198
                  +AAVEG+N +    L+SLSEQ+LVDC     N GC GG M  AF++II+N GI
Sbjct: 161 SCWAFSTIAAVEGVNQLATGNLISLSEQELVDC-DRKINQGCNGGDMGYAFQFIIKNGGI 219

Query: 199 TNDAVYSYEGMSTGICDSIKAED-HAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA 257
            ++  Y Y G   G CDS +  +   A I  YE+VP N+E+SL KAVANQPVSVAI+A  
Sbjct: 220 DSEEDYPYTG-KDGKCDSYRQNNAKVASIDGYEEVPVNNEKSLQKAVANQPVSVAIEAGG 278

Query: 258 --LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDI 315
              Q YS G+F G C T L+HGV AVGYGT E G+ YW++KNSWG  WGE GY R+QR++
Sbjct: 279 YDFQLYSSGIFTGSCGTDLDHGVAAVGYGT-ENGVDYWIVKNSWGDYWGEKGYVRMQRNV 337

Query: 316 DQPQGQCGIAMFASFPVSK 334
               G CGIAM AS+P  K
Sbjct: 338 KAKTGLCGIAMEASYPTKK 356


>gi|242077600|ref|XP_002448736.1| hypothetical protein SORBIDRAFT_06g032320 [Sorghum bicolor]
 gi|241939919|gb|EES13064.1| hypothetical protein SORBIDRAFT_06g032320 [Sorghum bicolor]
          Length = 467

 Score =  271 bits (692), Expect = 4e-70,   Method: Compositional matrix adjust.
 Identities = 147/321 (45%), Positives = 198/321 (61%), Gaps = 19/321 (5%)

Query: 27  EGSIAEKFEQWKAQYGRTYKES-AENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKF 85
           E  +   +E W  ++GR       E+  RF +F DNL  V+  N  A G   + L +N+F
Sbjct: 49  EAEVRAMYELWLVEHGRRVSNVLGEHDSRFRVFWDNLRFVDAHNERA-GEHGFRLGMNQF 107

Query: 86  ADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK---SSQVPPSVNWIEKGAVTPVKYQG 142
           ADLT  EF A+  G ++    ++   N    +Y+   + ++P SV+W EKGAV PVK QG
Sbjct: 108 ADLTNDEFRAAYLGARIP---AARSGNAVGEMYRHDGAEELPESVDWREKGAVAPVKNQG 164

Query: 143 QC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQN 195
           QC       AV++VE IN I    +V+LSEQ+LV+C+T+  N+GC GG MD AF +II+N
Sbjct: 165 QCGSCWAFSAVSSVESINQIVTGEMVTLSEQELVECSTDGGNSGCNGGLMDAAFNFIIKN 224

Query: 196 KGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDA 255
            GI  +  Y Y+ +  G CD  +       I  +EDVP NDE+SL KAVA+QPVSVAI+A
Sbjct: 225 GGIDTEDDYPYKAVD-GKCDINRRNAKVVSIDAFEDVPENDEKSLQKAVAHQPVSVAIEA 283

Query: 256 SALQF--YSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQR 313
              QF  Y  GVF+G C T L+HGV AVGYGT E G  YW+++NSWG  WGE GY R++R
Sbjct: 284 GGRQFQLYKSGVFSGSCTTNLDHGVVAVGYGT-ENGKDYWIVRNSWGPKWGEAGYIRMER 342

Query: 314 DIDQPQGQCGIAMFASFPVSK 334
           +I+   G+CGIAM AS+P  K
Sbjct: 343 NINATTGKCGIAMMASYPTKK 363


>gi|358248896|ref|NP_001239703.1| uncharacterized protein LOC100799247 precursor [Glycine max]
 gi|255636729|gb|ACU18700.1| unknown [Glycine max]
          Length = 341

 Score =  271 bits (692), Expect = 4e-70,   Method: Compositional matrix adjust.
 Identities = 148/325 (45%), Positives = 206/325 (63%), Gaps = 20/325 (6%)

Query: 23  RTFDEGSI-AEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLR 81
           R    G I +E+ E+W AQYG+ YK++AE  KRF++FK+N+  +E FN  A G++ + L 
Sbjct: 23  RVMSRGLITSERHEKWMAQYGKVYKDAAEKEKRFQVFKNNVQFIESFN--AAGDKPFNLS 80

Query: 82  LNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGAVTPVKY 140
           +N+FADL  +EF A     +    S    A  T F Y++ +++P +++W ++GAVTP+K 
Sbjct: 81  INQFADLHDEEFKALLNNVQ-KKASRVETATETSFRYENVTKIPSTMDWRKRGAVTPIKD 139

Query: 141 QG----QC----AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYI 192
           QG     C     VA VE ++ I    LVSLSEQ+LVDC   D+  GC GG++++AF++I
Sbjct: 140 QGYTCGSCWAFATVATVESLHQITTGELVSLSEQELVDCVRGDSE-GCRGGYVENAFEFI 198

Query: 193 IQNKGITNDAVYSYEGMSTGICDSIKAEDHA-AQITNYEDVPPNDEESLLKAVANQPVSV 251
               GIT++A Y Y+G        +K E H  A+I  YE VP N E++LLKAVANQPVSV
Sbjct: 199 ANKGGITSEAYYPYKGKDRSC--KVKKETHGVARIIGYESVPSNSEKALLKAVANQPVSV 256

Query: 252 AIDASAL--QFYSGGVFNGY-CETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGY 308
            IDA A+  +FYS G+F    C T L+H V  VGYG   +G KYWL+KNSW   WGE GY
Sbjct: 257 YIDAGAIAFKFYSSGIFEARNCGTHLDHAVAVVGYGKLRDGTKYWLVKNSWSTAWGEKGY 316

Query: 309 FRLQRDIDQPQGQCGIAMFASFPVS 333
            R++RDI   +G CGIA  AS+P++
Sbjct: 317 MRIKRDIRAKKGLCGIASNASYPIA 341


>gi|355344587|gb|AER60490.1| cysteine proteases [Gossypium hirsutum]
          Length = 371

 Score =  271 bits (692), Expect = 4e-70,   Method: Compositional matrix adjust.
 Identities = 151/338 (44%), Positives = 207/338 (61%), Gaps = 19/338 (5%)

Query: 7   IVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVE 66
           I  L ++ +  S +++R+ DE  +   ++ W  Q+G+ Y    E  KRFEIFKDNL  ++
Sbjct: 20  ISTLTLNQNHPSSSSWRSDDE--VMGLYKSWVIQHGKAYNGIGEEEKRFEIFKDNLRFID 77

Query: 67  RFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKAN--GTPFLYKS-SQV 123
             N+    N +Y L LNKFADLT QE+ A   G +       +K+    + + +++   +
Sbjct: 78  EHNSN--NNTTYKLGLNKFADLTNQEYRAKFLGTRTDPRRRLMKSKIPSSRYAHRAGDNL 135

Query: 124 PPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDN 176
           P SV+W + GAV+PVK QG C        +A VEGIN I    LVSLSEQ+LVDC  +  
Sbjct: 136 PDSVDWRDHGAVSPVKDQGSCGSCWAFSTIATVEGINKIVSGELVSLSEQELVDCDRS-Y 194

Query: 177 NNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPND 236
           + GC GG MD AF++I+ N GI  +  Y Y G +   CD  K       I  YEDVP N+
Sbjct: 195 DAGCNGGLMDYAFQFIMDNGGIDTEKDYPYLGFNNQ-CDPTKKNAKVVSIDGYEDVP-NN 252

Query: 237 EESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWL 294
           E +L KAVA+QPVS+AI+A   A Q Y  GVFNG C   L+HGV AVGYGT + G  YW+
Sbjct: 253 ENALKKAVAHQPVSIAIEAGGRAFQLYESGVFNGECGLALDHGVVAVGYGTDDNGQDYWI 312

Query: 295 IKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           ++NSWG +WGE+GY R++R+I+   G+CGIAM AS+PV
Sbjct: 313 VRNSWGSNWGENGYIRMERNINANTGKCGIAMEASYPV 350


>gi|357166359|ref|XP_003580684.1| PREDICTED: oryzain alpha chain-like [Brachypodium distachyon]
          Length = 456

 Score =  271 bits (692), Expect = 4e-70,   Method: Compositional matrix adjust.
 Identities = 153/327 (46%), Positives = 200/327 (61%), Gaps = 14/327 (4%)

Query: 18  SQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAA-IGNR 76
           S  +Y    E  +   + +W A+ GRTY    E  +RFE+F+DNL  V++ N AA  G  
Sbjct: 26  SIVSYGERSEEEVRRMYVEWMAENGRTYNAIGEEERRFEVFRDNLRYVDQHNAAADAGLH 85

Query: 77  SYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVT 136
           S+ L LN+FADLT +E+  +  G + +      + +G      + ++P SV+W EKGAV 
Sbjct: 86  SFRLGLNRFADLTNEEYRDTYLGVR-TKPVRERRLSGRYQAADNEELPESVDWREKGAVA 144

Query: 137 PVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAF 189
            VK QG C       A+AAVEGIN I    +++LSEQ+LVDC T+  N GC GG MD AF
Sbjct: 145 KVKDQGGCGSCWAFSAIAAVEGINQIVTGDMIALSEQELVDCDTS-YNQGCNGGLMDYAF 203

Query: 190 KYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPV 249
           ++II N GI ++  Y Y+      CD+ K       I  YEDVP N E SL KAVANQP+
Sbjct: 204 EFIINNGGIDSEEDYPYKERDN-RCDANKKNAKVVTIDGYEDVPVNSELSLKKAVANQPI 262

Query: 250 SVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDG 307
           SVAI+A   A Q Y  G+F G C T L+HGVTAVGYG SE G  YW++KNSWG  WGEDG
Sbjct: 263 SVAIEAGGRAFQLYKSGIFTGRCGTALDHGVTAVGYG-SENGKDYWIVKNSWGTVWGEDG 321

Query: 308 YFRLQRDIDQPQGQCGIAMFASFPVSK 334
           Y RL+R+I    G+CGIA+  S+P+ K
Sbjct: 322 YVRLERNIKATSGKCGIAIEPSYPLKK 348


>gi|168058022|ref|XP_001781010.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162667567|gb|EDQ54194.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 457

 Score =  271 bits (692), Expect = 5e-70,   Method: Compositional matrix adjust.
 Identities = 156/317 (49%), Positives = 197/317 (62%), Gaps = 21/317 (6%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
           +A +F  W  ++G+ Y  + E + RF ++KDNL  ++R +     N SY L L KFADLT
Sbjct: 41  LAGQFAAWAHKHGKVYSAAEERAHRFLVWKDNLEYIQRHSEK---NLSYWLGLTKFADLT 97

Query: 90  PQEFIASQTGFKMSDHSSSLKA--NGT-PFLYKSSQVPPSVNWIEKGAVTPVKYQGQC-- 144
            +EF    TG ++ D S  LK   N T  F Y +S+ P S++W EKGAVT VK QG C  
Sbjct: 98  NEEFRRQYTGTRI-DRSRRLKKGRNATGSFRYANSEAPKSIDWREKGAVTSVKDQGSCGS 156

Query: 145 -----AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGIT 199
                AV +VEGINAI+    +SLS Q+LVDC     N GC GG MD AF ++IQN GI 
Sbjct: 157 CWAFSAVGSVEGINAIRTGDAISLSVQELVDC-DKKYNQGCNGGLMDYAFDFVIQNGGID 215

Query: 200 NDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA-- 257
            +  Y Y+G   G CD  K       I +YEDVP NDEE+L KAVA QPVSVAI+A    
Sbjct: 216 TEKDYPYQGYD-GRCDVNKMNARVVTIDSYEDVPENDEEALKKAVAGQPVSVAIEAGGRD 274

Query: 258 LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDI-- 315
            Q YSGGVF G C T L+HGV AVGYG SE+G+ YW++KNSWG+ WGE GY R+QR++  
Sbjct: 275 FQLYSGGVFTGRCGTDLDHGVLAVGYG-SEKGLDYWIVKNSWGEYWGESGYLRMQRNLKD 333

Query: 316 DQPQGQCGIAMFASFPV 332
           D   G CGI +  S+ V
Sbjct: 334 DNGYGLCGINIEPSYAV 350


>gi|307111936|gb|EFN60170.1| hypothetical protein CHLNCDRAFT_59551 [Chlorella variabilis]
          Length = 364

 Score =  270 bits (691), Expect = 6e-70,   Method: Compositional matrix adjust.
 Identities = 158/362 (43%), Positives = 207/362 (57%), Gaps = 36/362 (9%)

Query: 7   IVVLIISGSCASQAT------YRTFDEGSI---AEKFEQW----KAQYGRTYKESAE-NS 52
           + VL+++ SC + A       +R F + +I    E F+ W    K    R Y  SAE   
Sbjct: 10  LSVLLVACSCLAVAAGFRFENHRLFIQQAIESPREAFDFWVHTVKPPSNRAYASSAEVYE 69

Query: 53  KRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSS-LKA 111
           +RF I+ DNL     +N     + S+ L +  +ADL+  E+ +   G+    H    L+A
Sbjct: 70  RRFNIWLDNLRFAHEYNAR---HTSHWLSMGVYADLSQDEYRSKALGYNAHLHKKRPLRA 126

Query: 112 NGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLS 164
              PFLYK +  P  V+W+  GAVTPVK Q  C          AVEG NAI   +LVSLS
Sbjct: 127 --APFLYKGTVPPEEVDWVAGGAVTPVKDQLLCGSCWAFSTTGAVEGANAIATGKLVSLS 184

Query: 165 EQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAA 224
           EQ LVDC   + + GC GGFMD AF +I+ N GI  +  Y Y     GIC   +   H  
Sbjct: 185 EQMLVDC-DREYDTGCRGGFMDSAFDFIVNNGGIDTEDDYPYRA-EDGICQDNRTRRHVV 242

Query: 225 QITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVG 282
            I  Y+DVPPNDE +L+KAVA+QPVSVAI+A   A Q Y GGVF+  C T L+H V  VG
Sbjct: 243 TIDGYQDVPPNDENALMKAVAHQPVSVAIEADQLAFQLYGGGVFDAECGTALDHAVLVVG 302

Query: 283 YGTSEEG---IKYWLIKNSWGQDWGEDGYFRLQRDI--DQPQGQCGIAMFASFPVSKESA 337
           YGT+  G   + YWL+KNSWG +WGE GY RL R++  D P+GQCG+AM+ASFP+ K + 
Sbjct: 303 YGTASNGTHNLPYWLVKNSWGAEWGEKGYIRLLRNLGKDAPEGQCGLAMYASFPIKKGAN 362

Query: 338 QP 339
            P
Sbjct: 363 PP 364


>gi|115441717|ref|NP_001045138.1| Os01g0907600 [Oryza sativa Japonica Group]
 gi|5761329|dbj|BAA83473.1| cysteine endopeptidase [Oryza sativa]
 gi|20804884|dbj|BAB92565.1| cysteine endopeptidase [Oryza sativa Japonica Group]
 gi|56785107|dbj|BAD82745.1| putative cysteine proteinase [Oryza sativa Japonica Group]
 gi|113534669|dbj|BAF07052.1| Os01g0907600 [Oryza sativa Japonica Group]
 gi|119395242|gb|ABL74582.1| cysteine endopeptidase [Oryza sativa Japonica Group]
 gi|125528777|gb|EAY76891.1| hypothetical protein OsI_04850 [Oryza sativa Indica Group]
 gi|125573036|gb|EAZ14551.1| hypothetical protein OsJ_04473 [Oryza sativa Japonica Group]
          Length = 371

 Score =  270 bits (690), Expect = 8e-70,   Method: Compositional matrix adjust.
 Identities = 147/317 (46%), Positives = 198/317 (62%), Gaps = 18/317 (5%)

Query: 29  SIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADL 88
           ++ + +E+W+  +    +   E  +RF  FKDN+  +   N    G R Y LRLN+F D+
Sbjct: 41  ALWDLYERWQEHH-HVPRHHGEKHRRFGAFKDNVRYIHEHNKR--GGRGYRLRLNRFGDM 97

Query: 89  TPQEFIASQTGFKMSD-HSSSLKANGTP-FLYKSSQ-VPPSVNWIEKGAVTPVKYQGQCA 145
             +EF A+  G   +D     L A   P F+Y+  + +P +V+W  KGAVT VK QG+C 
Sbjct: 98  GREEFRATFAGSHANDLRRDGLAAPPLPGFMYEGVRDLPRAVDWRRKGAVTGVKDQGKCG 157

Query: 146 -------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGI 198
                  V +VEGINAI+  RLVSLSEQ+L+DC T DN+ GC GG M++AF+YI  + GI
Sbjct: 158 SCWAFSTVVSVEGINAIRTGRLVSLSEQELIDCDTADNS-GCQGGLMENAFEYIKHSGGI 216

Query: 199 TNDAVYSYEGMSTGICDSIKAEDHA-AQITNYEDVPPNDEESLLKAVANQPVSVAIDAS- 256
           T ++ Y Y   + G CD+++A       I  +++VP N E +L KAVANQPVSVAIDA  
Sbjct: 217 TTESAYPYR-AANGTCDAVRARRAPLVVIDGHQNVPANSEAALAKAVANQPVSVAIDAGD 275

Query: 257 -ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDI 315
            + QFYS GVF G C T L+HGV  VGYG + +G +YW++KNSWG  WGE GY R+QRD 
Sbjct: 276 QSFQFYSDGVFAGDCGTDLDHGVAVVGYGETNDGTEYWIVKNSWGTAWGEGGYIRMQRDS 335

Query: 316 DQPQGQCGIAMFASFPV 332
               G CGIAM AS+PV
Sbjct: 336 GYDGGLCGIAMEASYPV 352


>gi|50355611|dbj|BAD29954.1| cysteine protease [Daucus carota]
          Length = 474

 Score =  270 bits (690), Expect = 8e-70,   Method: Compositional matrix adjust.
 Identities = 153/323 (47%), Positives = 198/323 (61%), Gaps = 20/323 (6%)

Query: 23  RTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRL 82
           RT D+  +   +E W  ++ + Y    E   RF IFKDN+  V+R N  ++ N+SY L L
Sbjct: 51  RTHDQ--LLSLYESWLVKHHKNYNALGEKETRFGIFKDNVGFVDRHN--SMRNQSYKLGL 106

Query: 83  NKFADLTPQEFIASQTGFKM--SDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGAVTPVK 139
           NKFADLT  E+ +     KM   +  +        F+++    +P SV+W ++GAV PVK
Sbjct: 107 NKFADLTNDEYRSLYLSGKMMKRERKNEDGFRSDRFVFEDGDHLPESVDWRDRGAVAPVK 166

Query: 140 YQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYI 192
            QGQC        V AVEGIN I    L+SLSEQ+LVDC  N  N GC GG MD AF++I
Sbjct: 167 DQGQCGSCWAFSTVGAVEGINKIVTGELISLSEQELVDC-DNGYNQGCNGGLMDYAFEFI 225

Query: 193 IQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVA 252
           ++N GI  +  Y Y+G+  G+CD  +       I  YEDVP NDE+SL KAVA+QPVSVA
Sbjct: 226 VKNGGIDTEDDYPYKGVD-GLCDQNRKNAKVVTINGYEDVPHNDEKSLKKAVAHQPVSVA 284

Query: 253 IDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFR 310
           I+A   A Q Y  GVF G C T L+HGV AVGYG SE G  YW+++NSWG DWGE GY R
Sbjct: 285 IEAGGRAFQLYESGVFTGQCGTELDHGVVAVGYG-SENGKDYWIVRNSWGPDWGESGYIR 343

Query: 311 LQRDI-DQPQGQCGIAMFASFPV 332
           L+R++     G+CGIAM AS+P 
Sbjct: 344 LERNVASTSTGKCGIAMQASYPT 366


>gi|226506492|ref|NP_001140873.1| uncharacterized protein LOC100272949 precursor [Zea mays]
 gi|194701540|gb|ACF84854.1| unknown [Zea mays]
          Length = 379

 Score =  270 bits (689), Expect = 9e-70,   Method: Compositional matrix adjust.
 Identities = 157/365 (43%), Positives = 215/365 (58%), Gaps = 33/365 (9%)

Query: 1   MAKYFLIVVLIISGSCASQATYRT-FDEGSIA------EKFEQWKAQYGRTYKESAENSK 53
           ++K  L+V L+   S A +      FDE  +A      + +E+W+  + R ++   E  +
Sbjct: 4   VSKTLLLVALVFVSSAAVELCRAIDFDERDLASDEALWDLYERWQTHH-RVHRHHGEKGR 62

Query: 54  RFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSD----HSSSL 109
           RF  FK+N+  +   N    G+R Y LRLN+F D+  +EF ++    +++D     S + 
Sbjct: 63  RFGTFKENVRFIHAHNKR--GDRPYRLRLNRFGDMGREEFRSTFADSRINDLRRQDSPAA 120

Query: 110 KANGTP-FLYKSSQVPP-SVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRL 160
           +A   P F+Y S+  PP SV+W ++GAVT VK QG C        V AVEGINAI+   L
Sbjct: 121 RAGAVPGFMYDSAADPPRSVDWRQEGAVTGVKVQGHCGSCWAFSTVVAVEGINAIRTGSL 180

Query: 161 VSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAE 220
            SLSEQ+L+DC T++N  GC GG M++AF++I    GIT +A Y Y   S G CD  +A 
Sbjct: 181 ASLSEQELIDCDTDEN--GCQGGLMENAFEFIKSFGGITTEAAYPYR-ASNGTCDGDRAR 237

Query: 221 ---DHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLN 275
                   I  ++ VP   E++L KAVA+QPVSVA+DA   A QFYS GVF G C T L+
Sbjct: 238 RGGGVVVVIDGHQMVPAGSEDALAKAVAHQPVSVAVDAGGQAFQFYSEGVFTGDCGTDLD 297

Query: 276 HGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSKE 335
           HGV AVGYG  ++G  YW++KNSWG  WGE GY R+QR      G CGIAM ASFP+ K 
Sbjct: 298 HGVAAVGYGVGDDGTPYWIVKNSWGTSWGEGGYIRMQRGAGN-GGLCGIAMEASFPI-KT 355

Query: 336 SAQPS 340
           S  P+
Sbjct: 356 SPNPA 360


>gi|28192373|gb|AAK07730.1| CPR1-like cysteine proteinase [Nicotiana tabacum]
          Length = 374

 Score =  270 bits (689), Expect = 9e-70,   Method: Compositional matrix adjust.
 Identities = 148/325 (45%), Positives = 200/325 (61%), Gaps = 20/325 (6%)

Query: 26  DEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKF 85
           DE  +  ++E W A++GR Y    E  KRFEIFKDNL  +E  NN+  GNR+Y + LN+F
Sbjct: 42  DEDQVKNRYEMWLAEHGRAYNALGEKEKRFEIFKDNLRFIEEHNNS--GNRTYKVGLNQF 99

Query: 86  ADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQ---VPPSVNWIEKGAVTPVKYQG 142
           ADLT +E+     G K       +K+      Y S     +P SV+W ++GAV P+K QG
Sbjct: 100 ADLTNEEYRTMYLGTKSDARRRFVKSKNPSQRYASRPNELMPHSVDWRKRGAVAPIKNQG 159

Query: 143 QCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQN 195
            C        VAAV GIN I    +++LSEQ+LVDC     N+GC GG MD AF++II N
Sbjct: 160 SCGSCWAFSTVAAVGGINQIVTGEMITLSEQELVDC-DRVQNSGCNGGLMDYAFEFIISN 218

Query: 196 KGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDA 255
            G+  +  Y Y G+  G CD ++       I  YEDVP N E +L KAVA+QPV VAI+A
Sbjct: 219 GGMDTEKHYPYRGVE-GRCDPVRKNYKVVSIDGYEDVPRN-ERALQKAVAHQPVCVAIEA 276

Query: 256 S--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQR 313
           S  A Q YS GVF G C   ++HGV  VGYG SE+G+ YW+++NSWG  WGE+GY +++R
Sbjct: 277 SGRAFQLYSSGVFTGECGEEVDHGVVVVGYG-SEDGVDYWIVRNSWGTKWGENGYVKMER 335

Query: 314 DIDQPQ-GQCGIAMFASFPVSKESA 337
           ++ +   G+CGI   AS+P +K+SA
Sbjct: 336 NVKKSHLGKCGIMTEASYP-TKDSA 359


>gi|226503129|ref|NP_001149806.1| LOC100283433 precursor [Zea mays]
 gi|195634783|gb|ACG36860.1| xylem cysteine proteinase 2 precursor [Zea mays]
 gi|219884977|gb|ACL52863.1| unknown [Zea mays]
          Length = 377

 Score =  270 bits (689), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 153/314 (48%), Positives = 195/314 (62%), Gaps = 18/314 (5%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
           +   FE+W A+Y + Y    E  +RFE+FKDNL  ++  N   +   SY L LN FADLT
Sbjct: 68  LVRLFEEWVAKYRKAYGSFEEKLRRFEVFKDNLHHIDEANRKEV--TSYWLGLNAFADLT 125

Query: 90  PQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA---- 145
             EF A+  G  +   +S  +           +VP SV+W +KGAVT VK QGQC     
Sbjct: 126 HDEFKATYLGL-LPKRTSGGRFRYGGVGDGGDEVPASVDWRKKGAVTEVKNQGQCGSCWA 184

Query: 146 ---VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDA 202
              VAAVEGIN I    L SLSEQQLVDC+T D NNGC GG MD+AF +I    G+ ++ 
Sbjct: 185 FSTVAAVEGINQIVTGNLTSLSEQQLVDCST-DGNNGCSGGVMDNAFSFIATGAGLRSEE 243

Query: 203 VYSYEGMSTGICDSIKAEDHAAQIT--NYEDVPPNDEESLLKAVANQPVSVAIDASA--L 258
            Y Y  M  G CD  +A D    +T   YEDVP NDE++L+KA+A+QPVSVAI+AS    
Sbjct: 244 AYPYL-MEEGDCDD-RARDGEVLVTISGYEDVPANDEQALVKALAHQPVSVAIEASGRHF 301

Query: 259 QFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQP 318
           QFYSGGVF+G C + L+HGV AVGYG+S+ G  Y ++KNSWG  WGE GY R++R   +P
Sbjct: 302 QFYSGGVFDGPCGSELDHGVAAVGYGSSK-GQDYIIVKNSWGTHWGEKGYIRMKRGTGKP 360

Query: 319 QGQCGIAMFASFPV 332
           +G CGI   AS+P 
Sbjct: 361 EGLCGINKMASYPT 374


>gi|414879123|tpg|DAA56254.1| TPA: hypothetical protein ZEAMMB73_708930 [Zea mays]
          Length = 368

 Score =  270 bits (689), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 153/341 (44%), Positives = 206/341 (60%), Gaps = 27/341 (7%)

Query: 25  FDEGSIA------EKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSY 78
           FDE  +A      + +E+W+  + R ++   E  +RF  FK+N   +   N    G+R Y
Sbjct: 27  FDERDLASDEALWDLYERWQTHH-RVHRHHGEKGRRFGTFKENARFIHAHNKR--GDRPY 83

Query: 79  TLRLNKFADLTPQEFIASQTGFKMSD-HSSSLKANGTP-FLYK-SSQVPPSVNWIEKGAV 135
            LRLN+F D+  +EF +     +++D       A   P F+Y  ++ +P SV+W +KGAV
Sbjct: 84  RLRLNRFGDMGREEFRSGFADSRINDLRREPTAAPAVPGFMYDDATDLPRSVDWRQKGAV 143

Query: 136 TPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDA 188
           T VK QG+C        V AVEGINAI+   LVSLSEQ+L+DC T++N  GC GG M++A
Sbjct: 144 TAVKNQGRCGSCWAFSTVVAVEGINAIRTGSLVSLSEQELIDCDTDEN--GCQGGLMENA 201

Query: 189 FKYIIQNKGITNDAVYSYEGMSTGICDSIKAE-DHAAQITNYEDVPPNDEESLLKAVANQ 247
           F++I  + GIT ++ Y Y   S G CD  +A       I  ++ VP   E++L KAVA+Q
Sbjct: 202 FEFIKSHGGITTESAYPYH-ASNGTCDGARARRGRVVAIDGHQAVPAGSEDALAKAVAHQ 260

Query: 248 PVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGE 305
           PVSVAIDA   ALQFYS GVF G C T L+HGV AVGYG S++G  YW++KNSWG  WGE
Sbjct: 261 PVSVAIDAGGQALQFYSEGVFTGDCGTDLDHGVAAVGYGVSDDGTPYWIVKNSWGPSWGE 320

Query: 306 DGYFRLQRDIDQPQGQCGIAMFASFPVSKESAQPSSADKSS 346
            GY R+QR      G CGIAM ASFP+ K S  PS   + +
Sbjct: 321 GGYIRMQRGTGN-GGLCGIAMEASFPI-KTSPNPSRKPRRA 359


>gi|356553978|ref|XP_003545327.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
          Length = 496

 Score =  270 bits (689), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 153/333 (45%), Positives = 204/333 (61%), Gaps = 18/333 (5%)

Query: 11  IISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNN 70
           IIS   A  AT R+ +E  +   +EQW  ++G+ Y    E  KRF+IFKDNL  ++  N+
Sbjct: 58  IISYDNAHAATSRSDEE--LMSMYEQWLVKHGKVYNALGEKEKRFQIFKDNLRFIDDHNS 115

Query: 71  AAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK-SSQVPPSVNW 129
               +R+Y L LN+FADLT +E+ A   G K+  +    K     +  +   ++P SV+W
Sbjct: 116 QE--DRTYKLGLNRFADLTNEEYRAKYLGTKIDPNRRLGKTPSNRYAPRVGDKLPESVDW 173

Query: 130 IEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYG 182
            ++GAV PVK QG C       A+ AVEGIN I    L+SLSEQ+LVDC T   N GC G
Sbjct: 174 RKEGAVPPVKDQGGCGSCWAFSAIGAVEGINKIVTGELISLSEQELVDCDTG-YNEGCNG 232

Query: 183 GFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLK 242
           G MD AF++II N GI ++  Y Y G+  G CD+ +       I +YEDVP  DE +L K
Sbjct: 233 GLMDYAFEFIINNGGIDSEEDYPYRGVD-GRCDTYRKNAKVVSIDDYEDVPAYDELALKK 291

Query: 243 AVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWG 300
           AVANQPVSVAI+      Q Y  GVF G C T L+HGV AVGYGT+  G  YW+++NSWG
Sbjct: 292 AVANQPVSVAIEGGGREFQLYVSGVFTGRCGTALDHGVVAVGYGTA-NGHDYWIVRNSWG 350

Query: 301 QDWGEDGYFRLQRDI-DQPQGQCGIAMFASFPV 332
             WGEDGY RL+R++ +   G+CGIA+  S+P+
Sbjct: 351 PSWGEDGYIRLERNLANSRSGKCGIAIEPSYPL 383


>gi|225458143|ref|XP_002280937.1| PREDICTED: cysteine proteinase RD21a [Vitis vinifera]
 gi|302142569|emb|CBI19772.3| unnamed protein product [Vitis vinifera]
          Length = 436

 Score =  269 bits (688), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 150/341 (43%), Positives = 202/341 (59%), Gaps = 24/341 (7%)

Query: 1   MAKYFLIVVLIISGSCASQATYRTFDEGS-IAEKFEQWKAQYGRTYKESAENSKRFEIFK 59
           M  +   V ++I       A + +  E S  A+ FE W  QYG+TY    E + R ++F+
Sbjct: 1   MGSWLWAVSILI------LAVHSSVSEASSTADLFEAWCEQYGKTYSSEEEKASRLKVFE 54

Query: 60  DNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK 119
           +N   V + N+ A  N SYTL LN FADLT  EF AS+ GF     + S+++ GTP   +
Sbjct: 55  ENHAFVTQHNSMA--NASYTLALNAFADLTHHEFKASRLGFS-PGRAQSIRSVGTPV--Q 109

Query: 120 SSQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCA 172
              VPP+V+W + GAVT VK QG C          A+EGIN I    LVSLSEQ+LVDC 
Sbjct: 110 ELHVPPAVDWRKSGAVTGVKDQGNCGGCWSFSTTGAIEGINKIVTGSLVSLSEQELVDC- 168

Query: 173 TNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDV 232
               N+GC GG MD A++++I+N+GI ++A Y Y GM    C+  K + H   I  Y D+
Sbjct: 169 DRSYNSGCEGGLMDYAYQFVIKNQGIDSEADYPYVGMDKP-CNKEKLKKHIVTIDGYTDI 227

Query: 233 PPNDEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGI 290
           PPNDE+ LL+ VA QPVSV I  S    Q YS GV+ G C + L+H V  VGYGT E+G+
Sbjct: 228 PPNDEKQLLQVVAKQPVSVGICGSEKTFQLYSKGVYTGPCSSTLDHAVLIVGYGT-EDGV 286

Query: 291 KYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
            +W++KNSWG+ WG  GY  + R+    +G CGI M AS+P
Sbjct: 287 DFWIVKNSWGEHWGMRGYIHMLRNNGTAEGICGINMLASYP 327


>gi|400180422|gb|AFP73349.1| cysteine protease [Solanum chmielewskii]
          Length = 344

 Score =  269 bits (688), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 149/339 (43%), Positives = 207/339 (61%), Gaps = 19/339 (5%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
            LI +  +     +Q   R+  + S++E+ E W +++GR YK+  E  +RF IFK+N+  
Sbjct: 10  ILITLFFVISIFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKF 69

Query: 65  VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSD-HSSSLKANGTPFL---YKS 120
           +E  N A  GN SY L +N+FAD+T QEF+A  TG  + + + S    + T F       
Sbjct: 70  IESVNKA--GNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKTNDLSD 127

Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
             +P +++W E GAVT VK+QGQC       AV ++EG   I    L+  SEQ+L+DC T
Sbjct: 128 DDMPSNLDWRESGAVTQVKHQGQCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTT 187

Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
           N  N GC GGFM +AF +II+N GI+ ++ Y Y G     C S + +  A QI++Y+ V 
Sbjct: 188 N--NYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQY-TCRS-QEKTAAVQISSYQ-VV 242

Query: 234 PNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
           P  E SLL+AV  QPVS+ I AS  LQFYSGG ++G C   +NH VTA+GYGT EEG KY
Sbjct: 243 PEGETSLLQAVTKQPVSIGIAASQDLQFYSGGTYDGSCADRINHAVTAIGYGTDEEGQKY 302

Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
           WL+KNSWG  WGE+G+ ++ RD   P G C IA  +S+P
Sbjct: 303 WLLKNSWGTSWGENGFMKIIRDSGDPSGLCDIAKMSSYP 341


>gi|168063167|ref|XP_001783545.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162664932|gb|EDQ51634.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 461

 Score =  269 bits (687), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 154/319 (48%), Positives = 197/319 (61%), Gaps = 23/319 (7%)

Query: 27  EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFA 86
           E  + E+F  W  ++G+ Y ++ +   RF ++KDNL  +         NR+Y+L L KFA
Sbjct: 47  ENLLLEQFAAWAHKHGKAYHDAEQCLHRFAVWKDNLAYIRHSET----NRTYSLGLTKFA 102

Query: 87  DLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC-- 144
           DLT +EF    TG ++ D S   K   T F Y  S+ P SV+W + GAVT VK QG C  
Sbjct: 103 DLTNEEFRRMYTGTRI-DRSRRAKRR-TGFRYADSEAPESVDWRKNGAVTSVKDQGSCGS 160

Query: 145 -----AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGIT 199
                AV +VEGINAI+    VSLSEQ+LVDC   + N GC GG MD AF +IIQN GI 
Sbjct: 161 CWAFSAVGSVEGINAIRNGEAVSLSEQELVDCDL-EYNQGCNGGLMDYAFDFIIQNGGID 219

Query: 200 NDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA-- 257
            +  Y Y+G   G CD+ K   H   I  YEDVP NDEE+L KAVA QPVSVAI+A    
Sbjct: 220 TEKDYPYKGFD-GRCDNSKKNAHVVTIDGYEDVPENDEEALKKAVAGQPVSVAIEAGGRD 278

Query: 258 LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDI-- 315
            Q Y+ GVF+G C T L+HGV AVGYGT E+G+ YW++KNSWG+ WGE GY R++R++  
Sbjct: 279 FQLYAQGVFSGECGTDLDHGVLAVGYGT-EDGVDYWIVKNSWGEYWGESGYLRMKRNMKD 337

Query: 316 --DQPQGQCGIAMFASFPV 332
             D P G CGI +  S+ V
Sbjct: 338 SNDGP-GLCGINIEPSYAV 355


>gi|449522968|ref|XP_004168497.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
          Length = 348

 Score =  269 bits (687), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 149/312 (47%), Positives = 197/312 (63%), Gaps = 18/312 (5%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
           + E FE+W + +G+ Y+   E   RFE+FKDNL  ++  N       SY L +N+FADLT
Sbjct: 41  LIELFEEWISNHGKIYETIEEKWHRFEVFKDNLKHIDETNKKVT---SYWLGVNEFADLT 97

Query: 90  PQEFIASQTGFKMSDHSSSLKANGTPFLYKSS-QVPPSVNWIEKGAVTPVKYQGQCA--- 145
            QEF     G K+   SS  + +   F YK    +P SV+W +KGAVT VK QG C    
Sbjct: 98  HQEFKNMYLGLKV--ESSRTRQSPEEFTYKDVVDLPKSVDWRKKGAVTRVKNQGSCGSCW 155

Query: 146 ----VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITND 201
               VAAVEGIN I    L SLSEQ+L+DC     NNGC+GG MD AF +I+ + G+  +
Sbjct: 156 AFSTVAAVEGINKIVGGNLTSLSEQELIDC-DRPYNNGCHGGLMDYAFSFIVSSGGLHKE 214

Query: 202 AVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQ 259
             Y Y  + +  CD+ K E     I+ Y+DVP N+E SL+KA+A+QP+SVAI+AS    Q
Sbjct: 215 EDYPYLEVES-TCDNKKGELEVVTISGYKDVPENNEASLIKALAHQPLSVAIEASGRDFQ 273

Query: 260 FYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQ 319
           FYSGGVF+G C T L+HGVTAVGYG+S+ G+ Y ++KNSWG  WGE GY R++R+  +P 
Sbjct: 274 FYSGGVFDGPCGTQLDHGVTAVGYGSSK-GVDYIIVKNSWGPKWGEKGYIRMKRNTGKPA 332

Query: 320 GQCGIAMFASFP 331
           G CGI   AS+P
Sbjct: 333 GLCGINKMASYP 344


>gi|115461667|ref|NP_001054433.1| Os05g0108600 [Oryza sativa Japonica Group]
 gi|14719319|gb|AAK73137.1|AC079022_10 putative cysteine proteinase [Oryza sativa]
 gi|33151125|gb|AAP97431.1| cysteine protease CP1 [Oryza sativa]
 gi|52353572|gb|AAU44138.1| cysteine proteinase CP1 [Oryza sativa Japonica Group]
 gi|113577984|dbj|BAF16347.1| Os05g0108600 [Oryza sativa Japonica Group]
 gi|125550541|gb|EAY96250.1| hypothetical protein OsI_18148 [Oryza sativa Indica Group]
          Length = 358

 Score =  269 bits (687), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 154/316 (48%), Positives = 193/316 (61%), Gaps = 20/316 (6%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
           + E FE+W A+Y + Y    E  +RFE+FKDNL  ++  N       SY L LN+FADLT
Sbjct: 47  LIELFEKWVAKYRKAYASFEEKVRRFEVFKDNLNHIDDINKKVT---SYWLGLNEFADLT 103

Query: 90  PQEFIASQTGFKMS-DHSSSLKANGTPFLY---KSSQVPPSVNWIEKGAVTPVKYQGQCA 145
             EF A+  G       S+S   +   F Y    + +VP  ++W +K AVT VK QGQC 
Sbjct: 104 HDEFKATYLGLTPPPTRSNSKHYSSEEFRYGKMSNGEVPKEMDWRKKNAVTEVKNQGQCG 163

Query: 146 -------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGI 198
                  VAAVEGINAI    L SLSEQ+L+DC+T D NNGC GG MD AF YI    G+
Sbjct: 164 SCWAFSTVAAVEGINAIVTGNLTSLSEQELIDCST-DGNNGCNGGLMDYAFSYIASTGGL 222

Query: 199 TNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA- 257
             +  Y Y  M  G CD  K       I+ YEDVP NDE++L+KA+A+QPVSVAI+AS  
Sbjct: 223 RTEEAYPY-AMEEGDCDEGKGA-AVVTISGYEDVPANDEQALVKALAHQPVSVAIEASGR 280

Query: 258 -LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDID 316
             QFYSGGVF+G C   L+HGVTAVGYGTS+ G  Y ++KNSWG  WGE GY R++R   
Sbjct: 281 HFQFYSGGVFDGPCGEQLDHGVTAVGYGTSK-GQDYIIVKNSWGPHWGEKGYIRMKRGTG 339

Query: 317 QPQGQCGIAMFASFPV 332
           + +G CGI   AS+P 
Sbjct: 340 KGEGLCGINKMASYPT 355


>gi|413951606|gb|AFW84255.1| hypothetical protein ZEAMMB73_933931 [Zea mays]
          Length = 379

 Score =  269 bits (687), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 157/365 (43%), Positives = 215/365 (58%), Gaps = 33/365 (9%)

Query: 1   MAKYFLIVVLIISGSCASQATYRT-FDEGSIA------EKFEQWKAQYGRTYKESAENSK 53
           ++K  L+V L+   S A +      FDE  +A      + +E+W+  + R ++   E  +
Sbjct: 4   VSKTLLLVALVFVSSAAVELCRAIDFDERDLASDEALWDLYERWQTHH-RVHRHHGEKGR 62

Query: 54  RFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSD----HSSSL 109
           RF  FK+N+  +   N    G+R Y LRLN+F D+  +EF ++    +++D     S + 
Sbjct: 63  RFGTFKENVRFIHAHNKR--GDRPYRLRLNRFGDMGREEFRSTFADSRINDLRRQDSPAA 120

Query: 110 KANGTP-FLYKSSQVPP-SVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRL 160
           +A   P F+Y S+  PP SV+W ++GAVT VK QG C        V AVEGINAI+   L
Sbjct: 121 RAGAVPGFMYDSAADPPRSVDWRQEGAVTGVKDQGHCGSCWAFSTVVAVEGINAIRTGSL 180

Query: 161 VSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAE 220
            SLSEQ+L+DC T++N  GC GG M++AF++I    GIT +A Y Y   S G CD  +A 
Sbjct: 181 ASLSEQELIDCDTDEN--GCQGGLMENAFEFIKSFGGITTEAAYPYR-ASNGTCDGDRAR 237

Query: 221 ---DHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLN 275
                   I  ++ VP   E++L KAVA+QPVSVA+DA   A QFYS GVF G C T L+
Sbjct: 238 RGGGVVVVIDGHQMVPAGSEDALAKAVAHQPVSVAVDAGGQAFQFYSEGVFTGDCGTDLD 297

Query: 276 HGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSKE 335
           HGV AVGYG  ++G  YW++KNSWG  WGE GY R+QR      G CGIAM ASFP+ K 
Sbjct: 298 HGVAAVGYGVGDDGTPYWIVKNSWGTSWGEGGYIRMQRGAGN-GGLCGIAMEASFPI-KT 355

Query: 336 SAQPS 340
           S  P+
Sbjct: 356 SPNPA 360


>gi|413942348|gb|AFW74997.1| Xylem cysteine proteinase 2 [Zea mays]
          Length = 391

 Score =  269 bits (687), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 153/314 (48%), Positives = 195/314 (62%), Gaps = 18/314 (5%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
           +   FE+W A+Y + Y    E  +RFE+FKDNL  ++  N   +   SY L LN FADLT
Sbjct: 82  LVRLFEEWVAKYRKAYGSFEEKLRRFEVFKDNLHHIDEANRKEV--TSYWLGLNAFADLT 139

Query: 90  PQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA---- 145
             EF A+  G  +   +S  +           +VP SV+W +KGAVT VK QGQC     
Sbjct: 140 HDEFKATYLGL-LPKRTSGGRFRYGGVGDGGDEVPASVDWRKKGAVTEVKNQGQCGSCWA 198

Query: 146 ---VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDA 202
              VAAVEGIN I    L SLSEQQLVDC+T D NNGC GG MD+AF +I    G+ ++ 
Sbjct: 199 FSTVAAVEGINQIVTGNLTSLSEQQLVDCST-DGNNGCSGGVMDNAFSFIATGAGLRSEE 257

Query: 203 VYSYEGMSTGICDSIKAEDHAAQIT--NYEDVPPNDEESLLKAVANQPVSVAIDASA--L 258
            Y Y  M  G CD  +A D    +T   YEDVP NDE++L+KA+A+QPVSVAI+AS    
Sbjct: 258 AYPYL-MEEGDCDD-RARDGEVLVTISGYEDVPANDEQALVKALAHQPVSVAIEASGRHF 315

Query: 259 QFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQP 318
           QFYSGGVF+G C + L+HGV AVGYG+S+ G  Y ++KNSWG  WGE GY R++R   +P
Sbjct: 316 QFYSGGVFDGPCGSELDHGVAAVGYGSSK-GQDYIIVKNSWGTHWGEKGYIRMKRGTGKP 374

Query: 319 QGQCGIAMFASFPV 332
           +G CGI   AS+P 
Sbjct: 375 EGLCGINKMASYPT 388


>gi|413951605|gb|AFW84254.1| hypothetical protein ZEAMMB73_933931 [Zea mays]
          Length = 423

 Score =  269 bits (687), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 157/365 (43%), Positives = 215/365 (58%), Gaps = 33/365 (9%)

Query: 1   MAKYFLIVVLIISGSCASQATYRT-FDEGSIA------EKFEQWKAQYGRTYKESAENSK 53
           ++K  L+V L+   S A +      FDE  +A      + +E+W+  + R ++   E  +
Sbjct: 48  VSKTLLLVALVFVSSAAVELCRAIDFDERDLASDEALWDLYERWQTHH-RVHRHHGEKGR 106

Query: 54  RFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSD----HSSSL 109
           RF  FK+N+  +   N    G+R Y LRLN+F D+  +EF ++    +++D     S + 
Sbjct: 107 RFGTFKENVRFIHAHNKR--GDRPYRLRLNRFGDMGREEFRSTFADSRINDLRRQDSPAA 164

Query: 110 KANGTP-FLYKSSQVPP-SVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRL 160
           +A   P F+Y S+  PP SV+W ++GAVT VK QG C        V AVEGINAI+   L
Sbjct: 165 RAGAVPGFMYDSAADPPRSVDWRQEGAVTGVKDQGHCGSCWAFSTVVAVEGINAIRTGSL 224

Query: 161 VSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAE 220
            SLSEQ+L+DC T++N  GC GG M++AF++I    GIT +A Y Y   S G CD  +A 
Sbjct: 225 ASLSEQELIDCDTDEN--GCQGGLMENAFEFIKSFGGITTEAAYPYRA-SNGTCDGDRAR 281

Query: 221 ---DHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLN 275
                   I  ++ VP   E++L KAVA+QPVSVA+DA   A QFYS GVF G C T L+
Sbjct: 282 RGGGVVVVIDGHQMVPAGSEDALAKAVAHQPVSVAVDAGGQAFQFYSEGVFTGDCGTDLD 341

Query: 276 HGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSKE 335
           HGV AVGYG  ++G  YW++KNSWG  WGE GY R+QR      G CGIAM ASFP+ K 
Sbjct: 342 HGVAAVGYGVGDDGTPYWIVKNSWGTSWGEGGYIRMQRGAGN-GGLCGIAMEASFPI-KT 399

Query: 336 SAQPS 340
           S  P+
Sbjct: 400 SPNPA 404


>gi|3688528|emb|CAA06243.1| pre-pro-TPE4A protein [Pisum sativum]
          Length = 360

 Score =  269 bits (687), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 159/330 (48%), Positives = 208/330 (63%), Gaps = 23/330 (6%)

Query: 27  EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFA 86
           E S+ + +E+W++ +  T +   E   RF +FK N++ V   N     ++ Y L+LNKFA
Sbjct: 33  EKSLWDLYERWRSHHTVT-RSLDEKHNRFNVFKANVMHVHNTNKL---DKPYKLKLNKFA 88

Query: 87  DLTPQEFIASQTGFKMSDHS---SSLKANGTPFLYKSSQ-VPPSVNWIEKGAVTPVKYQG 142
           D+T  EF       K+S H         NGT F+Y++ + VP S++W +KGAVT VK QG
Sbjct: 89  DMTNYEFRRIYADSKVSHHRMFRGMSNENGT-FMYENVKNVPSSIDWRKKGAVTDVKDQG 147

Query: 143 QCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQN 195
           QC        + AVEGIN IK  +LVSLSEQ+LVDC T   N GC GG M+ AF++I QN
Sbjct: 148 QCGSCWAFSTIVAVEGINQIKTQKLVSLSEQELVDCDTG-GNEGCNGGLMEYAFEFIKQN 206

Query: 196 KGITNDAVYSYEGMSTGICDSIKAEDHA-AQITNYEDVPPNDEESLLKAVANQPVSVAID 254
            GIT ++ Y Y     G CD +K ED A   I  YE+VP N+E +LLKA A QPVSVAID
Sbjct: 207 -GITTESNYPY-AAKDGTCD-LKKEDKAEVSIDGYENVPINNEAALLKAAAKQPVSVAID 263

Query: 255 ASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQ 312
           A     QFYS GVF+G+C T LNHGV  VGYG +++  KYW++KNSWG +WGE GY R+Q
Sbjct: 264 AGGYNFQFYSEGVFSGHCGTDLNHGVAVVGYGVTQDRTKYWIVKNSWGSEWGEQGYIRMQ 323

Query: 313 RDIDQPQGQCGIAMFASFPVSKESAQPSSA 342
           R I   +G CGIAM AS+P+ K S  P+ +
Sbjct: 324 RGISHKEGLCGIAMEASYPIKKSSTNPTES 353


>gi|50355619|dbj|BAD29958.1| cysteine protease [Daucus carota]
          Length = 496

 Score =  268 bits (686), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 147/321 (45%), Positives = 191/321 (59%), Gaps = 17/321 (5%)

Query: 22  YRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLR 81
           ++T DE +    FE W   +G++Y    E  KRF+IFK+NL  ++  N   + +R + L 
Sbjct: 35  FKTDDEATTL--FESWLVTHGKSYNALGEEEKRFQIFKNNLRYIDEQN--LVEDRGFKLG 90

Query: 82  LNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQ-VPPSVNWIEKGAVTPVKY 140
           LNKFADLT +E+ +  TG K  D    + A    +   S + +P SV+W E GAV  VK 
Sbjct: 91  LNKFADLTNEEYRSKYTGIKSKDLRKKVSAKSGRYATLSGESLPESVDWRESGAVATVKD 150

Query: 141 QGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYII 193
           QG C        ++AVEGIN I   +L++LSEQ+LVDC     N GC GG MD AF++II
Sbjct: 151 QGSCGSCWAFSTISAVEGINQIATGKLITLSEQELVDC-DRSYNEGCNGGLMDYAFEFII 209

Query: 194 QNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAI 253
            N GI  D  Y Y G   G CD  +       I +YEDVP  DE +L KA ANQP+SVAI
Sbjct: 210 NNGGIDTDVDYPYTGRD-GKCDQYRKNAKVVTIDSYEDVPAYDELALKKAAANQPISVAI 268

Query: 254 DASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRL 311
           +AS    QFY  G+F G C   L+HGV  VGYGT E G  YW+++NSWG DWGE+GY R+
Sbjct: 269 EASGRDFQFYDSGIFTGKCGIALDHGVVVVGYGT-ENGKDYWIVRNSWGADWGENGYLRM 327

Query: 312 QRDIDQPQGQCGIAMFASFPV 332
           +R I    G CGIA+  S+PV
Sbjct: 328 ERGISSKTGICGIAIEPSYPV 348


>gi|400180419|gb|AFP73348.1| cysteine protease [Solanum lycopersicoides]
          Length = 343

 Score =  268 bits (686), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 146/338 (43%), Positives = 206/338 (60%), Gaps = 18/338 (5%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
            LI +  +     SQ T R+  + S++E+ E W +++GR YK+  E  +RF IFK+N+  
Sbjct: 10  ILITLFFVISMFNSQTTARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKF 69

Query: 65  VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFL---YKSS 121
           +E  N A  GN SY L +N+FAD+T +EF+   TG  +  + S    + T F        
Sbjct: 70  IESVNKA--GNLSYKLGINEFADITSEEFLTKFTGINIPSYLSPSPMSSTEFKINDLSDD 127

Query: 122 QVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATN 174
            +P +++W E GAVT VK QGQC       AV ++EG   I    L+  SEQ+L+DC TN
Sbjct: 128 DMPSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTN 187

Query: 175 DNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPP 234
             N GC GGFM +AF +I +N GI++++ Y Y+G     C S + +  A QI++Y+ V P
Sbjct: 188 --NYGCNGGFMTNAFDFIKENGGISSESDYEYQGQQY-TCRS-QEKTAAVQISSYQ-VVP 242

Query: 235 NDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYW 293
             E SLL+AV  QPVS+ I AS  LQFY+GG ++G C   +NH VTA+GYGT E+G KYW
Sbjct: 243 EGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGYGTDEKGQKYW 302

Query: 294 LIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
           L+KNSWG  WGE+G+ ++ RD   P G C IA  +S+P
Sbjct: 303 LLKNSWGTSWGENGFMKIIRDSGNPGGHCDIAKMSSYP 340


>gi|162459393|ref|NP_001105993.1| cysteine protease component of protease-inhibitor complex precursor
           [Zea mays]
 gi|6682829|dbj|BAA88898.1| cysteine protease component of protease-inhibitor complex [Zea
           mays]
          Length = 465

 Score =  268 bits (686), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 149/311 (47%), Positives = 193/311 (62%), Gaps = 14/311 (4%)

Query: 34  FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAA-IGNRSYTLRLNKFADLTPQE 92
           + +W A +GRTY    E  +R+++F+DNL  ++  N AA  G  S+ L LN+FADLT  E
Sbjct: 44  YAEWMAAHGRTYNAVGEEERRYQVFRDNLRYIDAHNAAADAGVHSFRLGLNRFADLTNDE 103

Query: 93  FIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQ---C----A 145
           + A+  G +        K         +  +P SV+W  KGAV  VK QG    C     
Sbjct: 104 YRATYLGARTRPQRER-KLGARYHAADNEDLPESVDWRAKGAVAEVKDQGSYGSCWAFST 162

Query: 146 VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYS 205
           +AAVEGIN I    L+SLSEQ+LVDC T+  N GC GG MD AF++II N GI  +  Y 
Sbjct: 163 IAAVEGINQIVTGDLISLSEQELVDCDTS-YNQGCNGGLMDYAFEFIINNGGIDTEKDYP 221

Query: 206 YEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASALQF--YSG 263
           Y+G + G CD  +       I +YEDVP NDE+SL KAVANQPVSVAI+A+  QF  YS 
Sbjct: 222 YKG-TDGRCDVNRKNAKVVTIDSYEDVPANDEKSLQKAVANQPVSVAIEAAGTQFQLYSS 280

Query: 264 GVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCG 323
           G+F G C T L+HGVTAVGYGT E G  YW++KNSWG  WGE GY R++R+I    G+CG
Sbjct: 281 GIFTGSCGTALDHGVTAVGYGT-ENGKDYWIVKNSWGSSWGESGYVRMERNIKASSGKCG 339

Query: 324 IAMFASFPVSK 334
           IA+  S+P+ +
Sbjct: 340 IAVEPSYPLKE 350


>gi|350535639|ref|NP_001233949.1| phytophthora-inhibited protease 1 [Solanum lycopersicum]
 gi|108937128|gb|ABG23376.1| phytophthora-inhibited protease 1 [Solanum lycopersicum]
          Length = 345

 Score =  268 bits (685), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 151/327 (46%), Positives = 196/327 (59%), Gaps = 21/327 (6%)

Query: 20  ATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYT 79
            T R   E S+ E+ E W   +GR YK+  E   RF+ FK+N+  +E FN    G + Y 
Sbjct: 27  VTSRNLKELSMLERHENWMVHHGRVYKDDIEKEHRFKTFKENVEFIESFNKN--GTQRYK 84

Query: 80  LRLNKFADLTPQEFIASQTGFKMSDHSS-SLKANGTPFLYKS-SQVPPSVNWIEKGAVTP 137
           L +NK+ADLT +EF  S  G   S  S     A  T F Y S ++VP S++W ++G+VT 
Sbjct: 85  LAVNKYADLTTEEFTTSFMGLDTSLLSQQESTATTTSFKYDSVTEVPNSMDWRKRGSVTG 144

Query: 138 VKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFK 190
           VK QG C       A AA+EG   I  N L+SLSEQQL+DC+T   N GC GG M  A+ 
Sbjct: 145 VKDQGVCGCCWAFSAAAAIEGAYQIANNELISLSEQQLLDCSTQ--NKGCEGGLMTVAYD 202

Query: 191 YIIQNKG--ITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQP 248
           +++QN G  IT +  Y YE  +  +C   K E  AA   N  +V P+DE SLLKAV NQP
Sbjct: 203 FLLQNNGGGITTETNYPYE-EAQNVC---KTEQPAAVTINGYEVVPSDESSLLKAVVNQP 258

Query: 249 VSVAIDAS-ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEE-GIKYWLIKNSWGQDWGED 306
           +SV I A+     Y  G+++G C + LNH VT +GYGTSEE G KYW++KNSWG DWGE+
Sbjct: 259 ISVGIAANDEFHMYGSGIYDGSCNSRLNHAVTVIGYGTSEEDGTKYWIVKNSWGSDWGEE 318

Query: 307 GYFRLQRDIDQPQGQCGIAMFASFPVS 333
           GY R+ RD+    G CGIA  ASFP +
Sbjct: 319 GYMRIARDVGVDGGHCGIAKVASFPTA 345


>gi|449455625|ref|XP_004145553.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
          Length = 351

 Score =  268 bits (685), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 149/313 (47%), Positives = 197/313 (62%), Gaps = 18/313 (5%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
           + E FE+W + +G+ Y+   E   RFE+FKDNL  ++  N       SY L +N+FADLT
Sbjct: 44  LIELFEEWISNHGKIYETIEEKWHRFEVFKDNLKHIDETNKKVT---SYWLGVNEFADLT 100

Query: 90  PQEFIASQTGFKMSDHSSSLKANGTPFLYKSS-QVPPSVNWIEKGAVTPVKYQGQCA--- 145
            QEF     G K+   SS  + +   F YK    +P SV+W +KGAVT VK QG C    
Sbjct: 101 HQEFKNMYLGLKV--ESSRTRQSPEEFTYKDVVDLPKSVDWRKKGAVTRVKNQGSCGSCW 158

Query: 146 ----VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITND 201
               VAAVEGIN I    L SLSEQ+L+DC     NNGC+GG MD AF +I+ + G+  +
Sbjct: 159 AFSTVAAVEGINKIVGGNLTSLSEQELIDC-DRPYNNGCHGGLMDYAFSFIVSSGGLHKE 217

Query: 202 AVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQ 259
             Y Y  + +  CD+ K E     I+ Y+DVP N+E SL+KA+A+QP+SVAI+AS    Q
Sbjct: 218 EDYPYLEVES-TCDNKKGELEVVTISGYKDVPENNEASLIKALAHQPLSVAIEASGRDFQ 276

Query: 260 FYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQ 319
           FYSGGVF+G C T L+HGVTAVGYG+S+ G+ Y ++KNSWG  WGE GY R++R+  +P 
Sbjct: 277 FYSGGVFDGPCGTQLDHGVTAVGYGSSK-GVDYIIVKNSWGPKWGEKGYIRMKRNTGKPA 335

Query: 320 GQCGIAMFASFPV 332
           G CGI   AS+P 
Sbjct: 336 GLCGINKMASYPT 348


>gi|111073717|dbj|BAF02547.1| triticain beta [Triticum aestivum]
          Length = 472

 Score =  268 bits (685), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 146/316 (46%), Positives = 197/316 (62%), Gaps = 18/316 (5%)

Query: 34  FEQWKAQYGRTYKESA----ENSKRFEIFKDNLVAVERFN-NAAIGNRSYTLRLNKFADL 88
           ++ W A+ G     +A    E  +RF  F DNL  V+  N  AA G   Y L +N+FADL
Sbjct: 53  YDLWLAENGGGSSPNANSIPERERRFRAFWDNLNFVDAHNARAAAGEEGYRLGMNRFADL 112

Query: 89  TPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQ-VPPSVNWIEKGAVTPVKYQGQC--- 144
           T  EF A+  G K +  +   +  G  + +  ++ +P +V+W EKGAV PVK QGQC   
Sbjct: 113 TNDEFRAAYLGVK-AQRARPGRMVGERYRHDGAEELPEAVDWREKGAVAPVKNQGQCGSC 171

Query: 145 ----AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITN 200
               AV+ VE IN I    +V+LSEQ+LV+C TN  ++GC GG MDDAF++II+N GI  
Sbjct: 172 WAFSAVSTVESINQIVTGEMVTLSEQELVECDTNGQSSGCNGGLMDDAFEFIIKNGGIDT 231

Query: 201 DAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--L 258
           +  Y Y+ +  G CD ++       I  +EDVP NDE+SL KAVA+QPVSVAI+A     
Sbjct: 232 EDDYPYKAID-GRCDVLRKNAKVVSIDGFEDVPENDEKSLQKAVAHQPVSVAIEAGGREF 290

Query: 259 QFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQP 318
           Q Y  GVF+G C T L+HGV AVGYGT E G  YW+++NSWG +WGE GY R++R+I+  
Sbjct: 291 QLYHSGVFSGRCGTQLDHGVVAVGYGT-ENGKDYWIVRNSWGPNWGESGYLRMERNINVT 349

Query: 319 QGQCGIAMFASFPVSK 334
            G+CGIAM +S+P  K
Sbjct: 350 SGKCGIAMMSSYPTKK 365


>gi|400180377|gb|AFP73327.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  268 bits (685), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 147/339 (43%), Positives = 208/339 (61%), Gaps = 19/339 (5%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
            LI +  +     +Q   R+  E S++E+ E W +++GR YK+  E  +RF IFK+N+  
Sbjct: 10  ILITLFFVISMFNTQTRGRSQPELSVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKF 69

Query: 65  VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSD-HSSSLKANGTPFL---YKS 120
           +E  N A  GN SY L +N+FAD+T QEF+A  TG  + + + S    + T F       
Sbjct: 70  IESVNKA--GNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSD 127

Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
             +P +++W E GAVT VK+QG+C       AV ++EG   I    L+  SEQ+L+DC T
Sbjct: 128 DDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTT 187

Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
           N  N GC GGFM +AF +II+N GI+ ++ Y Y+G     C S + +  A QI++Y+ V 
Sbjct: 188 N--NYGCNGGFMTNAFDFIIENGGISRESDYEYQGEQY-TCRS-QEKTAAVQISSYQ-VV 242

Query: 234 PNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
           P  E SLL+AV  QPVS+ I AS  LQFY+GG ++G C   +NH VTA+GYGT E+G KY
Sbjct: 243 PEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGYGTDEKGQKY 302

Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
           WL+KNSWG  WGE+G+ ++ RD   P G C IA  +S+P
Sbjct: 303 WLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341


>gi|13897890|gb|AAK48495.1|AF259983_1 putative cysteine protease [Ipomoea batatas]
          Length = 462

 Score =  268 bits (684), Expect = 4e-69,   Method: Compositional matrix adjust.
 Identities = 151/319 (47%), Positives = 192/319 (60%), Gaps = 33/319 (10%)

Query: 34  FEQWKAQYGRTYKE-SAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQE 92
           +E W  ++G++Y     E  KRFEIFKDNL  ++  N+   G+RSY L LN+FADLT +E
Sbjct: 49  YESWLVEHGKSYNGLGGEKDKRFEIFKDNLRYIDEQNSR--GDRSYKLGLNRFADLTNEE 106

Query: 93  FIASQTGFKM----------SDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQG 142
           + ++  G K           SD   + KA G+        +P S++W EKGAV  VK QG
Sbjct: 107 YRSTYLGAKTDARRRIAKTKSDRRYAPKAGGS--------LPDSIDWREKGAVAEVKDQG 158

Query: 143 QCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQN 195
            C        +AAVEGIN I    L+SLSEQ+LVDC T+  N GC GG MD AF++II+N
Sbjct: 159 SCGSCWAFSTIAAVEGINQIVTGELISLSEQELVDCDTS-YNEGCNGGLMDYAFEFIIKN 217

Query: 196 KGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDA 255
            GI  +A Y Y G   G CD  +       I  YEDV P DE +L +AVA QPVSVAI+A
Sbjct: 218 GGIDTEADYPYTG-RYGRCDQTRKNAKVVSIDGYEDVTPYDEAALKEAVAGQPVSVAIEA 276

Query: 256 SA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQR 313
                Q YS G+F G C T L+HGVTAVGYGT E G+ YW++KNSW   WGE GY R+QR
Sbjct: 277 GGRDFQLYSSGIFTGSCGTDLDHGVTAVGYGT-ENGVDYWIVKNSWAASWGEKGYLRMQR 335

Query: 314 DIDQPQGQCGIAMFASFPV 332
           ++    G CGIA+  S+P 
Sbjct: 336 NVKDKNGLCGIAIEPSYPT 354


>gi|359491865|ref|XP_002273243.2| PREDICTED: xylem cysteine proteinase 1-like [Vitis vinifera]
          Length = 351

 Score =  268 bits (684), Expect = 4e-69,   Method: Compositional matrix adjust.
 Identities = 145/313 (46%), Positives = 194/313 (61%), Gaps = 18/313 (5%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
           + + FE W +++G++Y+   E   RFE+F+DNL  ++  N       SY L LN+FADL+
Sbjct: 44  LTDLFESWMSKHGKSYRSFEEKLHRFEVFQDNLKHIDETNKKV---SSYWLGLNEFADLS 100

Query: 90  PQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGAVTPVKYQGQCA--- 145
            +EF     G K+       + +   F YK  + +P SV+W +KGAV  VK QG C    
Sbjct: 101 HEEFKRKYLGLKI--ELPKRRDSPEEFSYKDVADLPKSVDWRKKGAVAHVKNQGACGSCW 158

Query: 146 ----VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITND 201
               VAAVEGIN I    L +LSEQ+L+DC     NNGC GG MD AF +II N G+  +
Sbjct: 159 AFSTVAAVEGINQIVTGNLTALSEQELIDC-DKPFNNGCNGGLMDYAFAFIISNGGLRKE 217

Query: 202 AVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQ 259
             Y Y  M  G C   K E     I+ Y DVP ++E+S LKA+ANQP+SVAI+AS+   Q
Sbjct: 218 EDYPYV-MEEGTCGEKKEELEVVTISGYHDVPEDNEQSFLKALANQPLSVAIEASSRGFQ 276

Query: 260 FYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQ 319
           FYSGG+FNG+C T L+HGV AVGYGTS+ G+ Y  +KNSWG  WGE GY R++R++ +P+
Sbjct: 277 FYSGGIFNGHCGTELDHGVAAVGYGTSK-GVDYITVKNSWGSKWGEKGYIRMKRNVGKPE 335

Query: 320 GQCGIAMFASFPV 332
           G CGI   AS+P 
Sbjct: 336 GICGIYKMASYPT 348


>gi|226533314|ref|NP_001150119.1| xylem cysteine proteinase 2 [Zea mays]
 gi|195636886|gb|ACG37911.1| xylem cysteine proteinase 2 precursor [Zea mays]
 gi|223946183|gb|ACN27175.1| unknown [Zea mays]
 gi|413951209|gb|AFW83858.1| Xylem cysteine proteinase 2 [Zea mays]
          Length = 385

 Score =  267 bits (683), Expect = 4e-69,   Method: Compositional matrix adjust.
 Identities = 156/335 (46%), Positives = 194/335 (57%), Gaps = 36/335 (10%)

Query: 29  SIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADL 88
           S+AE FE+W +++ R Y    E  +RF++FKDNL  ++  N       SY L LN+FADL
Sbjct: 54  SLAELFERWLSRHRRAYASLEEKLRRFQVFKDNLHHIDETNRKV---SSYWLGLNEFADL 110

Query: 89  TPQEFIASQTGFKMS--------DHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKY 140
           T  EF A+  G + S        D     +          + +P SV+W  KGAVT VK 
Sbjct: 111 THDEFKATYLGLRSSVGDGGSGIDDDDEPEEEEGYEGVDGASLPKSVDWRSKGAVTGVKN 170

Query: 141 QGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYII 193
           QGQC        VAAVEGIN I    L +LSEQ+L+DC T D NNGC GG MD AF YI 
Sbjct: 171 QGQCGSCWAFSTVAAVEGINQIVTGNLTALSEQELIDCDT-DGNNGCNGGLMDYAFSYIA 229

Query: 194 QNKGITNDAVYSYEGMSTGIC------------DSIKAEDHAAQIT--NYEDVPPNDEES 239
            N G+  +  Y Y  M  G C             S  A D AA +T   YEDVP N+E++
Sbjct: 230 HNGGLHTEEAYPYL-MEEGTCQRSSSSEKKWPGSSEDANDDAAVVTISGYEDVPRNNEQA 288

Query: 240 LLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKN 297
           LLKA+A QPVSVAI+AS    QFYSGGVF+G C T L+HGV AVGYGT+ +G  Y ++KN
Sbjct: 289 LLKALAQQPVSVAIEASGRNFQFYSGGVFDGPCGTQLDHGVAAVGYGTAAKGHDYIIVKN 348

Query: 298 SWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           SWG  WGE GY R++R   + QG CGI   AS+P 
Sbjct: 349 SWGPSWGEKGYIRMRRGTGKRQGLCGINKMASYPT 383


>gi|357130141|ref|XP_003566711.1| PREDICTED: xylem cysteine proteinase 1-like [Brachypodium
           distachyon]
          Length = 457

 Score =  267 bits (683), Expect = 4e-69,   Method: Compositional matrix adjust.
 Identities = 150/317 (47%), Positives = 202/317 (63%), Gaps = 22/317 (6%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
           I E FE+W A++ + Y    E   RFE+FKDNL  +++ N       SY L LN+FADLT
Sbjct: 146 IIELFEKWLAKHQKAYASFEEKLHRFEVFKDNLKHIDKVNREVT---SYWLGLNEFADLT 202

Query: 90  PQEFIASQTGFKMSDHSSSLKANGTPFLYK---SSQVPPSVNWIEKGAVTPVKYQGQCA- 145
            +EF A+  G  ++  + + ++ G+ F Y+   +  +P SV+W  KGAVT VK QGQC  
Sbjct: 203 HEEFKATYLG--LAPPAPARESRGS-FKYEDVSADDLPKSVDWRTKGAVTEVKNQGQCGS 259

Query: 146 ------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGIT 199
                 VAAVEGINAI    L +LSEQ+L+DC+  D NNGC GG MD AF YI  + G+ 
Sbjct: 260 CWAFSTVAAVEGINAIVTGNLTALSEQELIDCSV-DGNNGCNGGLMDYAFSYIASSGGLH 318

Query: 200 NDAVYSYEGMSTGIC-DSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA- 257
            +  Y Y  M  G C D  K+E  A  I+ YEDVP ++E++L+KA+A+QPVSVAI+AS  
Sbjct: 319 TEEAYPYL-MEEGSCGDGKKSESEAVTISGYEDVPAHNEQALIKALAHQPVSVAIEASGR 377

Query: 258 -LQFYSGGVFNGYCETFLNHGVTAVGYGTSE-EGIKYWLIKNSWGQDWGEDGYFRLQRDI 315
             QFYSGGVF+G C T L+HGV AVGYG+ + +G  Y +++NSWG  WGE GY R++R  
Sbjct: 378 HFQFYSGGVFDGPCGTQLDHGVAAVGYGSDKGKGHDYIIVRNSWGAKWGEKGYIRMKRGT 437

Query: 316 DQPQGQCGIAMFASFPV 332
            + +G CGI   AS+P 
Sbjct: 438 GKGEGLCGINKMASYPT 454


>gi|26452046|dbj|BAC43113.1| putative cysteine proteinase RD21A precursor [Arabidopsis thaliana]
          Length = 362

 Score =  267 bits (683), Expect = 4e-69,   Method: Compositional matrix adjust.
 Identities = 151/329 (45%), Positives = 199/329 (60%), Gaps = 18/329 (5%)

Query: 26  DEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKF 85
           +E  +   +EQW  +  + Y    E  +RF+IFKDNL  V+  N  ++ +R++ + L +F
Sbjct: 36  NETEVRLMYEQWLVENRKNYNGLGEKERRFKIFKDNLKFVDEHN--SVPDRTFEVGLTRF 93

Query: 86  ADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQV-PPSVNWIEKGAVTPVKYQGQC 144
           ADLT +EF A     KM  +  S+K     +LYK   V P  V+W   GAV  VK QG C
Sbjct: 94  ADLTNEEFRAIYLRKKMERNKDSVKTE--RYLYKEGDVLPDEVDWRANGAVVSVKDQGNC 151

Query: 145 -------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKG 197
                  AV AVEGIN I    L+SLSEQ+LVDC     N GC GG M+ AF++I++N G
Sbjct: 152 GSCWAFSAVGAVEGINQITTGELISLSEQELVDCDRGFVNAGCDGGIMNYAFEFIMKNGG 211

Query: 198 ITNDAVYSYEGMSTGICDSIKAED-HAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS 256
           I  D  Y Y     G+C++ K  +     I  YEDVP +DE+SL KAVA+QPVSVAI+AS
Sbjct: 212 IETDQDYPYNANDLGLCNADKNNNTRVVTIDGYEDVPRDDEKSLKKAVAHQPVSVAIEAS 271

Query: 257 --ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRD 314
             A Q Y  GV  G C   L+HGV  VGYG++  G  YW+I+NSWG +WG+ GY +LQR+
Sbjct: 272 SQAFQLYKSGVMTGTCGISLDHGVVVVGYGST-SGEDYWIIRNSWGLNWGDSGYVKLQRN 330

Query: 315 IDQPQGQCGIAMFASFPVSKESAQPSSAD 343
           ID P G+CGIAM  S+P   +S+ PSS D
Sbjct: 331 IDDPFGKCGIAMMPSYPT--KSSFPSSFD 357


>gi|118124|sp|P25250.1|CYSP2_HORVU RecName: Full=Cysteine proteinase EP-B 2; Flags: Precursor
 gi|1146118|gb|AAA85036.1| cysteine proteinase EPB2 precursor [Hordeum vulgare subsp. vulgare]
          Length = 373

 Score =  267 bits (683), Expect = 4e-69,   Method: Compositional matrix adjust.
 Identities = 152/335 (45%), Positives = 200/335 (59%), Gaps = 26/335 (7%)

Query: 27  EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFA 86
           E ++ + +E+W++ + R  +  AE  +RF  FK N   +   N    G+  Y L LN+F 
Sbjct: 39  EEALWDLYERWQSAH-RVRRHHAEKHRRFGTFKSNAHFIHSHNKR--GDHPYRLHLNRFG 95

Query: 87  DLTPQEFIASQTGFKMSDHSSSLKANGTP-FLYKS---SQVPPSVNWIEKGAVTPVKYQG 142
           D+   EF A+  G    D  S  K    P F+Y +   S +PPSV+W +KGAVT VK QG
Sbjct: 96  DMDQAEFRATFVGDLRRDTPS--KPPSVPGFMYAALNVSDLPPSVDWRQKGAVTGVKDQG 153

Query: 143 QCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQN 195
           +C        V +VEGINAI+   LVSLSEQ+L+DC T DN+ GC GG MD+AF+YI  N
Sbjct: 154 KCGSCWAFSTVVSVEGINAIRTGSLVSLSEQELIDCDTADND-GCQGGLMDNAFEYIKNN 212

Query: 196 KGITNDAVYSYEGMSTGICDSIKAEDHA---AQITNYEDVPPNDEESLLKAVANQPVSVA 252
            G+  +A Y Y   + G C+  +A  ++     I  ++DVP N EE L +AVANQPVSVA
Sbjct: 213 GGLITEAAYPYRA-ARGTCNVARAAQNSPVVVHIDGHQDVPANSEEDLARAVANQPVSVA 271

Query: 253 IDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFR 310
           ++AS  A  FYS GVF G C T L+HGV  VGYG +E+G  YW +KNSWG  WGE GY R
Sbjct: 272 VEASGKAFMFYSEGVFTGECGTELDHGVAVVGYGVAEDGKAYWTVKNSWGPSWGEQGYIR 331

Query: 311 LQRDIDQPQGQCGIAMFASFPV---SKESAQPSSA 342
           +++D     G CGIAM AS+PV   SK    P  A
Sbjct: 332 VEKDSGASGGLCGIAMEASYPVKTYSKPKPTPRRA 366


>gi|255538210|ref|XP_002510170.1| cysteine protease, putative [Ricinus communis]
 gi|223550871|gb|EEF52357.1| cysteine protease, putative [Ricinus communis]
          Length = 469

 Score =  267 bits (683), Expect = 5e-69,   Method: Compositional matrix adjust.
 Identities = 146/330 (44%), Positives = 203/330 (61%), Gaps = 21/330 (6%)

Query: 18  SQATYRTFDEGSIAEKFEQWKAQYGRTYKES---AENSKRFEIFKDNLVAVERFNNAAIG 74
           +++++RT DE  +   +E+W  + G+ +  +    E  +RF++FKDNL  ++  N+    
Sbjct: 37  TKSSWRTDDE--VMAIYEEWLVKNGKAHSNNNALGEKERRFQVFKDNLRFIDEHNSE--- 91

Query: 75  NRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK-SSQVPPSVNWIEKG 133
           NRSY + LN+FADLT +E+ +   G +     + L  +   +L +    +P SV+W ++G
Sbjct: 92  NRSYKVGLNRFADLTNEEYRSMYLGARSGAKRNRLSRSSNRYLPRVGDSLPDSVDWRKEG 151

Query: 134 AVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMD 186
           AV  VK QG C        +AAVEGIN I    L+SLSEQ+LVDC     N GC GG MD
Sbjct: 152 AVAEVKDQGSCGSCWAFSTIAAVEGINKIVTGDLISLSEQELVDC-DRSYNEGCNGGLMD 210

Query: 187 DAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN 246
            AF++II N GI ++  Y Y     G CD+ +       I NYEDVP NDE++L KAVAN
Sbjct: 211 YAFQFIINNGGIDSEEDYPYLARD-GTCDTYRKNAKVVTIDNYEDVPVNDEKALQKAVAN 269

Query: 247 QPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWG 304
           QPVSVAI+A     QFY  G+F G C T L+HGV AVGYGT E G  YW+++NSWG+ WG
Sbjct: 270 QPVSVAIEAGGREFQFYQSGIFTGRCGTALDHGVAAVGYGT-ENGKDYWIVRNSWGKSWG 328

Query: 305 EDGYFRLQRDIDQPQGQCGIAMFASFPVSK 334
           E GY R++R+I    G+CGIA+  S+P+ K
Sbjct: 329 ESGYIRMERNIATATGKCGIAIEPSYPIKK 358


>gi|20334377|gb|AAM19209.1|AF493234_1 cysteine protease [Solanum lycopersicum]
 gi|400180431|gb|AFP73353.1| cysteine protease [Solanum lycopersicum]
          Length = 345

 Score =  267 bits (683), Expect = 5e-69,   Method: Compositional matrix adjust.
 Identities = 148/340 (43%), Positives = 207/340 (60%), Gaps = 20/340 (5%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
            LI +  +     +Q   R+  + S++E+ E W +++GR YK+  E  +RF IFK+N+  
Sbjct: 10  ILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKF 69

Query: 65  VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSD-HSSSLKANGTPFL----YK 119
           +E  N A  GN SY L +N+FAD+T QEF+A  TG  + + + S    + T F       
Sbjct: 70  IESVNKA--GNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKKINDLS 127

Query: 120 SSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCA 172
              +P +++W E GAVT VK+QG+C       AV ++EG   I    L+  SEQ+L+DC 
Sbjct: 128 DDYMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 187

Query: 173 TNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDV 232
           TN  N GC GGFM +AF +II+N GI+ ++ Y Y G     C S + +  A QI++Y+ V
Sbjct: 188 TN--NYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQY-TCRS-QEKTAAVQISSYQ-V 242

Query: 233 PPNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIK 291
            P  E SLL+AV  QPVS+ I AS  LQFY+GG ++G C   +NH VTA+GYGT EEG K
Sbjct: 243 VPEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGNCADRINHAVTAIGYGTDEEGQK 302

Query: 292 YWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
           YWL+KNSWG  WGE+GY ++ RD   P G C IA  +S+P
Sbjct: 303 YWLLKNSWGTSWGENGYMKIIRDSGDPSGLCDIAKMSSYP 342


>gi|326493368|dbj|BAJ85145.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 436

 Score =  267 bits (682), Expect = 6e-69,   Method: Compositional matrix adjust.
 Identities = 151/338 (44%), Positives = 203/338 (60%), Gaps = 22/338 (6%)

Query: 9   VLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERF 68
           V + + +  S  +Y    E  +   + +W A++G TY    E  +RFE F+DNL  +++ 
Sbjct: 18  VSLAAAADMSIVSYGERSEEEVRRMYAEWMAEHGSTYNAIGEEERRFEAFRDNLRYIDQH 77

Query: 69  NNAA-IGNRSYTLRLNKFADLTPQEFIASQTGFKMS-DHSSSLKANGTPFLYKSS---QV 123
           N AA  G  S+ L LN+FADLT +E+ ++  G +   D    L A      Y+++   ++
Sbjct: 78  NAAADAGVHSFRLGLNRFADLTNEEYRSTYLGARTKPDRERKLSAR-----YQAADNDEL 132

Query: 124 PPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDN 176
           P SV+W +KGAV  VK QG C       A+AAVEGIN I    ++ LSEQ+LVDC T+  
Sbjct: 133 PESVDWRKKGAVGAVKDQGGCGSCWAFSAIAAVEGINQIVTGDMIPLSEQELVDCDTS-Y 191

Query: 177 NNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPND 236
           N GC GG MD AF++II N GI ++  Y Y+      CD+ K       I  YEDVP N 
Sbjct: 192 NQGCNGGLMDYAFEFIINNGGIDSEEDYPYKERDNR-CDANKKNAKVVTIDGYEDVPVNS 250

Query: 237 EESLLKAVANQPVSVAIDA--SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWL 294
           E+SL KAVANQP+SVAI+A   A Q Y  G+F G C T L+HGV AVGYGT E G  YWL
Sbjct: 251 EKSLQKAVANQPISVAIEAGGRAFQLYKSGIFTGTCGTALDHGVAAVGYGT-ENGKDYWL 309

Query: 295 IKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           ++NSWG  WGEDGY R++R+I    G+CGIA+  S+P 
Sbjct: 310 VRNSWGSVWGEDGYIRMERNIKASSGKCGIAVEPSYPT 347


>gi|400180447|gb|AFP73360.1| cysteine protease [Solanum chilense]
          Length = 345

 Score =  267 bits (682), Expect = 6e-69,   Method: Compositional matrix adjust.
 Identities = 147/340 (43%), Positives = 208/340 (61%), Gaps = 20/340 (5%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
            LI +  +     +Q   R+  + S++E+ E W +++GR YK+  E  +RF IFK+N+  
Sbjct: 10  ILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKF 69

Query: 65  VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSD-HSSSLKANGTPFL----YK 119
           +E  N A  GN SY L +N+FAD+T QEF+A  TG  + + + S    + T F       
Sbjct: 70  IESVNKA--GNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKKINDLS 127

Query: 120 SSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCA 172
              +P +++W E GAVT VK+QGQC       AV ++EG   I   +L+  SEQ+L+DC 
Sbjct: 128 DDDMPSNLDWRESGAVTQVKHQGQCGCCWAFSAVGSLEGAYKIATGKLMEFSEQELLDCT 187

Query: 173 TNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDV 232
           TN  N GC GGFM +AF +II+N GI+ ++ Y Y G     C S + +  A QI++Y+ V
Sbjct: 188 TN--NYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQY-TCRS-QEKTAAVQISSYQ-V 242

Query: 233 PPNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIK 291
            P  E SLL+AV  QPVS+ I AS  LQFY+GG ++G C   +NH VTA+GYGT E+G K
Sbjct: 243 VPEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGYGTDEKGQK 302

Query: 292 YWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
           YWL+KNSWG  WGE+G+ ++ RD   P G C IA  +S+P
Sbjct: 303 YWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 342


>gi|160858205|dbj|BAF93840.1| triticain beta 2 [Triticum aestivum]
          Length = 469

 Score =  267 bits (682), Expect = 7e-69,   Method: Compositional matrix adjust.
 Identities = 144/316 (45%), Positives = 197/316 (62%), Gaps = 18/316 (5%)

Query: 34  FEQWKAQYGRTYKESA----ENSKRFEIFKDNLVAVERFN-NAAIGNRSYTLRLNKFADL 88
           ++ W A++G     +A    E  +RF  F DNL  V+  N  AA G   + L +N+FADL
Sbjct: 50  YDLWLAEHGGGSYPNANSIPERERRFRAFWDNLRFVDAHNARAAAGEEGFRLAMNRFADL 109

Query: 89  TPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQ-VPPSVNWIEKGAVTPVKYQGQC--- 144
           T  EF A+  G K    +   +  G  + +  ++ +P +V+W EKGAV PVK QGQC   
Sbjct: 110 TNDEFRAAYLGVK-GQRARPGRVVGERYRHDGAEELPEAVDWREKGAVAPVKNQGQCGSC 168

Query: 145 ----AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITN 200
               A++ VE IN I    +V+LSEQ+LV+C TN  ++GC GG MDDAF++II+N GI  
Sbjct: 169 WAFSAISTVESINQIVTGEMVTLSEQELVECDTNGQSSGCNGGLMDDAFEFIIKNGGIDT 228

Query: 201 DAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--L 258
           +  Y Y+ +  G CD ++       I  +EDVP NDE+SL KAVA+QPVSVAI+A     
Sbjct: 229 EDDYPYKAID-GRCDVLRKNAKVVSIDGFEDVPENDEKSLQKAVAHQPVSVAIEAGGREF 287

Query: 259 QFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQP 318
           Q Y  GVF+G C T L+HGV AVGYGT E G  YW+++NSWG +WGE GY R++R+I+  
Sbjct: 288 QLYHSGVFSGRCGTQLDHGVVAVGYGT-ENGKDYWIVRNSWGPNWGEAGYLRMERNINVT 346

Query: 319 QGQCGIAMFASFPVSK 334
            G+CGIAM +S+P  K
Sbjct: 347 SGKCGIAMMSSYPTKK 362


>gi|400180357|gb|AFP73317.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  267 bits (682), Expect = 7e-69,   Method: Compositional matrix adjust.
 Identities = 147/339 (43%), Positives = 207/339 (61%), Gaps = 19/339 (5%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
            LI +  +     SQ   R+  + S++E+ E W +++GR YK+  E  +RF IFK+N+  
Sbjct: 10  ILITLFFVISMFNSQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKF 69

Query: 65  VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSD-HSSSLKANGTPFL---YKS 120
           +E  N A  GN SY L +N+FAD+T QEF+A  TG  + + + S    + T F       
Sbjct: 70  IESVNKA--GNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSD 127

Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
             +P +++W E GAVT VK+QG+C       AV ++EG   I    L+  SEQ+L+DC T
Sbjct: 128 DDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTT 187

Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
           N  N GC GGFM +AF +II+N GI+ ++ Y Y G     C S + +  A QI++Y+ V 
Sbjct: 188 N--NYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQY-TCRS-QEKTAAVQISSYK-VV 242

Query: 234 PNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
           P  E SLL+AV  QPVS+ I AS  LQFY+GG ++G C   +NH VTA+GYGT E+G KY
Sbjct: 243 PEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGYGTDEKGQKY 302

Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
           WL+KNSWG  WGE+G+ ++ RD   P G C IA  +S+P
Sbjct: 303 WLLKNSWGTSWGENGFMKIIRDYGNPSGLCDIAKMSSYP 341


>gi|146215996|gb|ABQ10200.1| cysteine protease Cp2 [Actinidia deliciosa]
          Length = 376

 Score =  267 bits (682), Expect = 7e-69,   Method: Compositional matrix adjust.
 Identities = 144/312 (46%), Positives = 190/312 (60%), Gaps = 19/312 (6%)

Query: 34  FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
           + +W A++G+ Y    E  +RFEIFKDNL  V+  N+    NRSY + LN+FADLT +E+
Sbjct: 47  YAEWLAKHGKAYNGIGERERRFEIFKDNLKFVDEHNSE---NRSYKVGLNRFADLTNEEY 103

Query: 94  IASQTGFKMSDHSSSLKANGTPFLY---KSSQVPPSVNWIEKGAVTPVKYQGQCA----- 145
            +   G K       +K+      Y    S  +P SV+W E GAV P+K QG C      
Sbjct: 104 RSMFLGTKTDSKRRFMKSKSASRRYAVQDSDMLPESVDWRESGAVAPIKDQGSCGSCWAF 163

Query: 146 --VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAV 203
             VAAVEG+N I    ++ LSEQ+LVDC     + GC GG MD AF++II N GI  +  
Sbjct: 164 STVAAVEGVNQIATGEMIQLSEQELVDCDRT-YDAGCNGGLMDYAFEFIINNGGIDTEED 222

Query: 204 YSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQFY 261
           Y Y G+  G CD  +       I +YEDVPP DE +L KAVA+QPVSVAI+AS  A Q Y
Sbjct: 223 YPYRGVD-GTCDPERKNTKVVSINDYEDVPPYDEMALKKAVAHQPVSVAIEASGRAFQLY 281

Query: 262 SGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRD-IDQPQG 320
             GVF G C   L+HGV  VGYGT + G  +W+++NSWG  WGE+GY R++R+ +D   G
Sbjct: 282 LSGVFTGECGRALDHGVVVVGYGT-DNGADHWIVRNSWGTSWGENGYIRMERNVVDNFGG 340

Query: 321 QCGIAMFASFPV 332
           +CGIAM AS+P+
Sbjct: 341 KCGIAMQASYPI 352


>gi|400180443|gb|AFP73358.1| cysteine protease, partial [Solanum habrochaites]
          Length = 345

 Score =  266 bits (681), Expect = 8e-69,   Method: Compositional matrix adjust.
 Identities = 148/343 (43%), Positives = 205/343 (59%), Gaps = 27/343 (7%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
            LI +  +     SQ   R+  + S++E+ E W +++GR YK+  E  +RF IFK+N+  
Sbjct: 10  ILITLFFVISMFNSQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKF 69

Query: 65  VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDH--------SSSLKANGTPF 116
           +E  N A  GN SY L +N+FAD+T +EF+A  TG  + +         S+  K N    
Sbjct: 70  IESVNKA--GNLSYKLGMNEFADITSEEFLAKFTGLNIPNSYLSPSPMPSTEFKIND--- 124

Query: 117 LYKSSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLV 169
                 +P +++W E GAVT VK QGQC       AV ++EG   I    L+  SEQ+L+
Sbjct: 125 -LSDDDMPSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELL 183

Query: 170 DCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNY 229
           DC TN  N GC GGFM +AF +II+N GI+ ++ Y Y G     C S + +  A QI+NY
Sbjct: 184 DCTTN--NYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQY-TCRS-QGKTAAVQISNY 239

Query: 230 EDVPPNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEE 288
           + V P  E SLL+AV  QPVS+ I AS  LQFY+GG ++G C   +NH VTA+GYGT E+
Sbjct: 240 Q-VVPEGETSLLQAVTKQPVSIGIAASHDLQFYAGGTYDGSCANRINHAVTAIGYGTDEK 298

Query: 289 GIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
           G KYWL+KNSWG  WGE+G+ ++ RD   P G C IA  +S+P
Sbjct: 299 GQKYWLLKNSWGTSWGENGFMKIIRDSGNPAGLCDIAKMSSYP 341


>gi|226529105|ref|NP_001150196.1| cysteine protease 1 precursor [Zea mays]
 gi|194701798|gb|ACF84983.1| unknown [Zea mays]
 gi|194704800|gb|ACF86484.1| unknown [Zea mays]
 gi|195637480|gb|ACG38208.1| cysteine protease 1 precursor [Zea mays]
 gi|413919895|gb|AFW59827.1| cysteine protease 1 [Zea mays]
          Length = 470

 Score =  266 bits (681), Expect = 8e-69,   Method: Compositional matrix adjust.
 Identities = 145/323 (44%), Positives = 197/323 (60%), Gaps = 19/323 (5%)

Query: 27  EGSIAEKFEQWKAQYGRTY----KESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRL 82
           E  +   ++ W A++GR Y    +   E  +RF +F DNL  V+  N  A G R + L +
Sbjct: 50  EPEVRAMYDLWLAEHGRAYNALGEGEGERDRRFLVFWDNLRFVDAHNERA-GARGFRLGM 108

Query: 83  NKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSS--QVPPSVNWIEKGAVTPVKY 140
           N+FADLT  EF A+  G  M   +      G  + +  +  ++P SV+W EKGAV PVK 
Sbjct: 109 NQFADLTNDEFRAAYLG-AMVPAARRGAVVGERYRHDGAAEELPESVDWREKGAVAPVKN 167

Query: 141 QGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYII 193
           QGQC       AV++VE +N I    +V+LSEQ+LV+C+T+  N+GC GG MD AF +II
Sbjct: 168 QGQCGSCWAFSAVSSVESVNQIVTGEMVTLSEQELVECSTDGGNSGCNGGLMDAAFDFII 227

Query: 194 QNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAI 253
           +N GI  +  Y Y  +  G CD  +       I  +EDVP NDE+SL KAVA+QPVSVAI
Sbjct: 228 KNGGIDTEDDYPYRAVD-GKCDMNRKNARVVSIDGFEDVPENDEKSLQKAVAHQPVSVAI 286

Query: 254 DASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRL 311
           +A     Q Y  GVF+G C T L+HGV AVGYG +E G  YW+++NSWG  WGE GY R+
Sbjct: 287 EAGGREFQLYKSGVFSGSCTTNLDHGVVAVGYG-AENGKDYWIVRNSWGPKWGEAGYIRM 345

Query: 312 QRDIDQPQGQCGIAMFASFPVSK 334
           +R+++   G+CGIAM AS+P  K
Sbjct: 346 ERNVNASTGKCGIAMMASYPTKK 368


>gi|195637152|gb|ACG38044.1| vignain precursor [Zea mays]
          Length = 377

 Score =  266 bits (681), Expect = 8e-69,   Method: Compositional matrix adjust.
 Identities = 150/303 (49%), Positives = 185/303 (61%), Gaps = 29/303 (9%)

Query: 55  FEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDH--------S 106
           F +FK N+  +  FN     +  Y LRLN+F D+T  EF     G +++ H         
Sbjct: 70  FNVFKANVRLIHEFNRR---DEPYKLRLNRFGDMTADEFRRHYAGSRVAHHRMFRGDRQG 126

Query: 107 SSLKANGTPFLYKSSQ-VPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKIN 158
           SS  A+   F+Y  ++ VP SV+W +KGAVT VK QGQC        +AAVEGINAIK  
Sbjct: 127 SSASAS---FMYADARDVPASVDWRQKGAVTDVKDQGQCGSCWAFSTIAAVEGINAIKTK 183

Query: 159 RLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIK 218
            L SLSEQQLVDC T   N GC GG MD AF+YI ++ G+  +  Y Y       C   K
Sbjct: 184 NLTSLSEQQLVDCDTK-ANAGCNGGLMDYAFQYIAKHGGVAAEDAYPYRARQAS-CK--K 239

Query: 219 AEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNH 276
           +      I  YEDVP NDE +L KAVA+QPVSVAI+AS    QFYS GVF+G C T L+H
Sbjct: 240 SPAPVVTIDGYEDVPANDESALKKAVAHQPVSVAIEASGSHFQFYSEGVFSGRCGTELDH 299

Query: 277 GVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSKES 336
           GV AVGYG + +G KYWL+KNSWG +WGE GY R+ RD+   +G CGIAM AS+PV K S
Sbjct: 300 GVAAVGYGVTADGTKYWLVKNSWGPEWGEKGYIRMARDVAAKEGHCGIAMEASYPV-KTS 358

Query: 337 AQP 339
             P
Sbjct: 359 PNP 361


>gi|400180383|gb|AFP73330.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  266 bits (681), Expect = 8e-69,   Method: Compositional matrix adjust.
 Identities = 146/339 (43%), Positives = 208/339 (61%), Gaps = 19/339 (5%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
            LI +  +     +Q   R+  + S++E+ E W +++GR YK+  E  +RF IFK+N+  
Sbjct: 10  ILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKF 69

Query: 65  VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSD-HSSSLKANGTPFL---YKS 120
           +E  N A  GN SY L +N+FAD+T QEF+A  TG  + + + S    + T F+      
Sbjct: 70  IESVNKA--GNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFIINDLSD 127

Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
             +P +++W E GAVT VK+QG+C       AV ++EG   I    L+  SEQ+L+DC T
Sbjct: 128 DDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTT 187

Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
           N  N GC GGFM +AF +II+N GI+ ++ Y Y G     C S + +  A QI++Y+ V 
Sbjct: 188 N--NYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQY-TCRS-QEKTAAVQISSYK-VV 242

Query: 234 PNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
           P  E SLL+AV  QPVS+ I AS  LQFY+GG ++G C   +NH VTA+GYGT E+G KY
Sbjct: 243 PEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGYGTDEKGQKY 302

Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
           WL+KNSWG  WGE+G+ ++ RD   P G C IA  +S+P
Sbjct: 303 WLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341


>gi|30685308|ref|NP_566634.2| putative cysteine proteinase [Arabidopsis thaliana]
 gi|30315949|sp|Q9LT77.1|CPR1_ARATH RecName: Full=Probable cysteine proteinase At3g19400; Flags:
           Precursor
 gi|11994462|dbj|BAB02464.1| cysteine proteinase [Arabidopsis thaliana]
 gi|332642715|gb|AEE76236.1| putative cysteine proteinase [Arabidopsis thaliana]
          Length = 362

 Score =  266 bits (681), Expect = 8e-69,   Method: Compositional matrix adjust.
 Identities = 151/329 (45%), Positives = 198/329 (60%), Gaps = 18/329 (5%)

Query: 26  DEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKF 85
           +E  +   +EQW  +  + Y    E  +RF+IFKDNL  V+  N  ++ +R++ + L +F
Sbjct: 36  NETEVRLMYEQWLVENRKNYNGLGEKERRFKIFKDNLKFVDEHN--SVPDRTFEVGLTRF 93

Query: 86  ADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQV-PPSVNWIEKGAVTPVKYQGQC 144
           ADLT +EF A     KM     S+K     +LYK   V P  V+W   GAV  VK QG C
Sbjct: 94  ADLTNEEFRAIYLRKKMERTKDSVKTE--RYLYKEGDVLPDEVDWRANGAVVSVKDQGNC 151

Query: 145 -------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKG 197
                  AV AVEGIN I    L+SLSEQ+LVDC     N GC GG M+ AF++I++N G
Sbjct: 152 GSCWAFSAVGAVEGINQITTGELISLSEQELVDCDRGFVNAGCDGGIMNYAFEFIMKNGG 211

Query: 198 ITNDAVYSYEGMSTGICDSIKAED-HAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS 256
           I  D  Y Y     G+C++ K  +     I  YEDVP +DE+SL KAVA+QPVSVAI+AS
Sbjct: 212 IETDQDYPYNANDLGLCNADKNNNTRVVTIDGYEDVPRDDEKSLKKAVAHQPVSVAIEAS 271

Query: 257 --ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRD 314
             A Q Y  GV  G C   L+HGV  VGYG++  G  YW+I+NSWG +WG+ GY +LQR+
Sbjct: 272 SQAFQLYKSGVMTGTCGISLDHGVVVVGYGST-SGEDYWIIRNSWGLNWGDSGYVKLQRN 330

Query: 315 IDQPQGQCGIAMFASFPVSKESAQPSSAD 343
           ID P G+CGIAM  S+P   +S+ PSS D
Sbjct: 331 IDDPFGKCGIAMMPSYPT--KSSFPSSFD 357


>gi|226508570|ref|NP_001141984.1| uncharacterized protein LOC100274134 precursor [Zea mays]
 gi|194706676|gb|ACF87422.1| unknown [Zea mays]
 gi|413920745|gb|AFW60677.1| vignain [Zea mays]
          Length = 363

 Score =  266 bits (680), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 139/322 (43%), Positives = 201/322 (62%), Gaps = 25/322 (7%)

Query: 26  DEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKF 85
           DE  +  ++++W AQY R YK+ AE + RF++FK N   ++R N  A G + Y L  N+F
Sbjct: 51  DEAMMMARYKKWMAQYRRKYKDDAEKAHRFQVFKANAEFIDRSN--AGGKKKYVLGTNQF 108

Query: 86  ADLTPQEFIASQTGFK----MSDHSSSLKANGTPFL-YKSSQVPPSVNWIEKGAVTPVKY 140
           ADLT +EF A  TG +    +   +  + A G+ +  +        V+W ++GAVTPVK 
Sbjct: 109 ADLTSKEFAAMYTGLRKPAAVPSGAKQIPAAGSKYQNFTRLDDDVQVDWRQQGAVTPVKN 168

Query: 141 QGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYII 193
           QGQC       AV A+EG+  I    LVSLSEQQ++DC  +D N GC GG+MD+AF+Y+I
Sbjct: 169 QGQCGCCWAFSAVGAMEGLIMITTGNLVSLSEQQILDCDESDGNQGCNGGYMDNAFQYVI 228

Query: 194 QNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAI 253
            N G+T +  Y Y  +  G C +++    AA I+ ++D+P  DE +L  AVANQPVSV +
Sbjct: 229 NNGGVTTEDAYPYSAVQ-GTCQNVQP---AATISGFQDLPSGDENALANAVANQPVSVGV 284

Query: 254 D--ASALQFYSGGVFNG-YCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFR 310
           D  +S  QFY GG+++G  C T +NH VTA+GYG  ++G +YW++KNSWG  WGE+G+ +
Sbjct: 285 DGGSSPFQFYQGGIYDGDGCGTDMNHAVTAIGYGADDQGTQYWILKNSWGTGWGENGFMQ 344

Query: 311 LQRDIDQPQGQCGIAMFASFPV 332
           LQ  +    G CGI+  AS+P 
Sbjct: 345 LQMGV----GACGISTMASYPT 362


>gi|22759715|dbj|BAC10906.1| cysteine proteinase [Zinnia elegans]
          Length = 352

 Score =  266 bits (680), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 147/316 (46%), Positives = 192/316 (60%), Gaps = 18/316 (5%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
           +   FE W  ++ + Y+   E   RFEIF DNL  ++  N       +Y L LN+FADLT
Sbjct: 45  VIHLFESWLVKHSKFYESLDEKLHRFEIFMDNLKHIDETNKKV---SNYWLGLNEFADLT 101

Query: 90  PQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGAVTPVKYQGQCA--- 145
            +EF     GFK  + +     +   F Y+    +P SV+W +KGAV PVK QGQC    
Sbjct: 102 HEEFKHKFLGFK-GELAERKDESSKEFGYRDFVDLPKSVDWRKKGAVAPVKNQGQCGSCW 160

Query: 146 ----VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITND 201
               VAAVEGIN I    L  LSEQ+L+DC T   NNGC GG MD AF Y++++ G+  +
Sbjct: 161 AFSTVAAVEGINQIVTGNLTMLSEQELIDCDTT-FNNGCNGGLMDYAFAYVMRS-GLHKE 218

Query: 202 AVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQ 259
             Y Y  MS G CD  K       I+ Y DVP NDE S LKA+ANQP+SVAI+AS    Q
Sbjct: 219 EEYPYI-MSEGTCDEKKDVSEKVTISGYHDVPRNDEASFLKALANQPISVAIEASGRDFQ 277

Query: 260 FYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQ 319
           FYSGGVF+G+C T L+HGV AVGYGT++ G+ Y +++NSWG  WGE GY R++R   +P 
Sbjct: 278 FYSGGVFDGHCGTELDHGVAAVGYGTTK-GLDYVIVRNSWGPKWGEKGYIRMKRGSGKPH 336

Query: 320 GQCGIAMFASFPVSKE 335
           G CG+ M AS+P  ++
Sbjct: 337 GMCGLYMMASYPTKQK 352


>gi|400180353|gb|AFP73315.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  266 bits (680), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 146/339 (43%), Positives = 208/339 (61%), Gaps = 19/339 (5%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
            LI +  +     +Q   R+  + S++E+ E W +++GR YK+  E  +RF IFK+N+  
Sbjct: 10  ILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKF 69

Query: 65  VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSD-HSSSLKANGTPFL---YKS 120
           +E  N A  GN SY L +N+FAD+T QEF+A  TG  + + + S    + T F+      
Sbjct: 70  IESVNKA--GNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFIINDLSD 127

Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
             +P +++W E GAVT VK+QG+C       AV ++EG   I    L+  SEQ+L+DC T
Sbjct: 128 DDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTT 187

Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
           N  N GC GGFM +AF +II+N GI+ ++ Y Y G     C S + +  A QI++Y+ V 
Sbjct: 188 N--NYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQY-TCRS-QEKTAAVQISSYQ-VV 242

Query: 234 PNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
           P  E SLL+AV  QPVS+ I AS  LQFY+GG ++G C   +NH VTA+GYGT E+G KY
Sbjct: 243 PEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGYGTDEKGQKY 302

Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
           WL+KNSWG  WGE+G+ ++ RD   P G C IA  +S+P
Sbjct: 303 WLLKNSWGTSWGENGFMKIIRDSGNPAGLCDIAKMSSYP 341


>gi|225428879|ref|XP_002285299.1| PREDICTED: cysteine proteinase RD21a-like [Vitis vinifera]
          Length = 469

 Score =  266 bits (680), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 147/312 (47%), Positives = 190/312 (60%), Gaps = 20/312 (6%)

Query: 34  FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
           +E W  ++G++Y    E  +RFEIFKDNL  +E  N     NR+Y + LN+FADLT +E+
Sbjct: 54  YEAWLVKHGKSYNALGERERRFEIFKDNLRFIEEHNAV---NRTYKVGLNRFADLTNEEY 110

Query: 94  IASQTGFKMSDHSSSLKANGTPFLYK---SSQVPPSVNWIEKGAVTPVKYQGQCA----- 145
            +   G +  +    L+A+     Y       +P SV+W EKGAV PVK QG C      
Sbjct: 111 RSRYLG-RRDETRRGLRASRVSDRYSFRAGEDLPESVDWREKGAVVPVKDQGNCGSCWAF 169

Query: 146 --VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAV 203
             +AAVEGIN I    L+SLSEQ+LVDC     N GC GG MD AF++II N GI ++  
Sbjct: 170 STIAAVEGINQIATGDLISLSEQELVDC-DKSYNQGCNGGLMDYAFEFIINNGGIDSEED 228

Query: 204 YSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDA--SALQFY 261
           Y Y    T  CD  +       I  YEDVP NDE SL KAVANQPVSVAI+A   A Q Y
Sbjct: 229 YPYRAADT-TCDPNRKNARVVSIDGYEDVPQNDERSLKKAVANQPVSVAIEAGGRAFQLY 287

Query: 262 SGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQ-G 320
             GVF G C T L+HGV AVGYGT E  + YW+++NSWG +WGE GY +L+R++   + G
Sbjct: 288 QSGVFTGQCGTQLDHGVVAVGYGT-ENSVDYWIVRNSWGPNWGESGYIKLERNLAGTETG 346

Query: 321 QCGIAMFASFPV 332
           +CGIA+  S+P+
Sbjct: 347 KCGIAIEPSYPI 358


>gi|400180457|gb|AFP73365.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  266 bits (680), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 146/339 (43%), Positives = 208/339 (61%), Gaps = 19/339 (5%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
            LI +  +     +Q   R+  + S++E+ E W +++GR YK+  E  +RF IFK+N+  
Sbjct: 10  ILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKF 69

Query: 65  VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSD-HSSSLKANGTPFL---YKS 120
           +E  N A  GN SY L +N+FAD+T QEF+A  TG  + + + S    + T F+      
Sbjct: 70  IESVNKA--GNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFIINDLSD 127

Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
             +P +++W E GAVT VK+QG+C       AV ++EG   I    L+  SEQ+L+DC T
Sbjct: 128 DDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTT 187

Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
           N  N GC GGFM +AF +II+N GI+ ++ Y Y G     C S + +  A QI++Y+ V 
Sbjct: 188 N--NYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQY-TCRS-QEKTAAVQISSYK-VV 242

Query: 234 PNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
           P  E SLL+AV  QPVS+ I AS  LQFY+GG ++G C   +NH VTA+GYGT E+G KY
Sbjct: 243 PEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGYGTDEKGQKY 302

Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
           WL+KNSWG  WGE+G+ ++ RD   P G C IA  +S+P
Sbjct: 303 WLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341


>gi|297603535|ref|NP_001054211.2| Os04g0670200 [Oryza sativa Japonica Group]
 gi|109939735|sp|P25777.2|ORYB_ORYSJ RecName: Full=Oryzain beta chain; Flags: Precursor
 gi|32488398|emb|CAE02823.1| OSJNBa0043A12.28 [Oryza sativa Japonica Group]
 gi|90399163|emb|CAJ86092.1| H0818H01.14 [Oryza sativa Indica Group]
 gi|125550169|gb|EAY95991.1| hypothetical protein OsI_17862 [Oryza sativa Indica Group]
 gi|215766596|dbj|BAG98700.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|255675868|dbj|BAF16125.2| Os04g0670200 [Oryza sativa Japonica Group]
          Length = 466

 Score =  266 bits (679), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 143/311 (45%), Positives = 196/311 (63%), Gaps = 17/311 (5%)

Query: 34  FEQWKAQYGRTYKES--AENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQ 91
           ++ W A+ G     +   E+ +RF +F DNL  V+  N  A     + L +N+FADLT +
Sbjct: 52  YDLWLAENGGGSPNALGGEHERRFLVFWDNLKFVDAHNARADERGGFRLGMNRFADLTNE 111

Query: 92  EFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGAVTPVKYQGQC------ 144
           EF A+  G K+++ S   +A G  + +    ++P SV+W EKGAV PVK QGQC      
Sbjct: 112 EFRATFLGAKVAERS---RAAGERYRHDGVEELPESVDWREKGAVAPVKNQGQCGSCWAF 168

Query: 145 -AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAV 203
            AV+ VE IN +    +++LSEQ+LV+C+TN  N+GC GG MDDAF +II+N GI  +  
Sbjct: 169 SAVSTVESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMDDAFDFIIKNGGIDTEDD 228

Query: 204 YSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFY 261
           Y Y+ +  G CD  +       I  +EDVP NDE+SL KAVA+QPVSVAI+A     Q Y
Sbjct: 229 YPYKAVD-GKCDINRENAKVVSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGGREFQLY 287

Query: 262 SGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQ 321
             GVF+G C T L+HGV AVGYGT + G  YW+++NSWG  WGE GY R++R+I+   G+
Sbjct: 288 HSGVFSGRCGTSLDHGVVAVGYGT-DNGKDYWIVRNSWGPKWGESGYVRMERNINVTTGK 346

Query: 322 CGIAMFASFPV 332
           CGIAM AS+P 
Sbjct: 347 CGIAMMASYPT 357


>gi|357446979|ref|XP_003593765.1| Cysteine proteinase [Medicago truncatula]
 gi|355482813|gb|AES64016.1| Cysteine proteinase [Medicago truncatula]
          Length = 364

 Score =  266 bits (679), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 158/296 (53%), Positives = 199/296 (67%), Gaps = 21/296 (7%)

Query: 49  AENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDH--S 106
           +E  KR  IFK+NL  +E FNNA  GN+SY L LN+++DLT  EF+AS TG K+S    S
Sbjct: 77  SELEKRKRIFKNNLEYIENFNNA--GNKSYKLGLNQYSDLTSDEFLASHTGLKVSKQLSS 134

Query: 107 SSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINR 159
           S +++   PF   +  VP + +W ++GAVT VK QG C        VAAVEG   I    
Sbjct: 135 SKMRSAAVPFNL-NDDVPTNFDWRQQGAVTDVKDQGSCGCCWAFSVVAAVEGAVKINTGE 193

Query: 160 LVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSY-EGMST-GICDSI 217
           L+SLSEQQLVDC  ++ N+GC+GG MD AFKYIIQ KGI ++A Y Y EG  T  + D +
Sbjct: 194 LISLSEQQLVDC--DERNSGCHGGNMDSAFKYIIQ-KGIVSEADYPYQEGSQTCQLNDQM 250

Query: 218 KAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAID-ASALQFYSGGVFNGYCETFLNH 276
           K E   AQITN+ DVP NDE+ LL+AVA QPVSV I+     Q Y G V++G C   +NH
Sbjct: 251 KFE---AQITNFIDVPANDEQQLLQAVAQQPVSVGIEVGDEFQHYMGDVYSGTCGQSMNH 307

Query: 277 GVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
            VTAVGYG SE+G KYWLIKNSWG+ WGE+GY +L R+  +P GQCGIA  AS+P+
Sbjct: 308 AVTAVGYGVSEDGTKYWLIKNSWGKGWGEEGYMKLLRESGEPGGQCGIAAHASYPI 363


>gi|195628596|gb|ACG36128.1| vignain precursor [Zea mays]
          Length = 362

 Score =  266 bits (679), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 148/360 (41%), Positives = 209/360 (58%), Gaps = 39/360 (10%)

Query: 1   MAKYFLIVVLIISGSCASQATY-----------RTF--DEGSIAEKFEQWKAQYGRTYKE 47
           M    L++ +I    C  QA             RT   DE  +  ++++W AQY R YK+
Sbjct: 13  MTTLMLLLCVIAIADCICQAAVAARVEPSTTVGRTTGGDEAMMMARYKKWMAQYRRKYKD 72

Query: 48  SAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSS 107
            AE + RF++FK N   ++R N  A G + Y L  N+FADLT +EF A  TG +      
Sbjct: 73  DAEKAHRFQVFKANAEFIDRSN--AGGKKKYVLGTNQFADLTSKEFAAMYTGLRKPAAVP 130

Query: 108 SLKANGTPFLYKSSQVPP-----SVNWIEKGAVTPVKYQGQC-------AVAAVEGINAI 155
           S  A   P  +K            V+W ++GAVTPVK QGQC       AV A+EG+  I
Sbjct: 131 S-GAKQIPAGFKYQNFTRLDDDVQVDWRQQGAVTPVKNQGQCGCCWAFSAVGAMEGLIMI 189

Query: 156 KINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICD 215
               LVSLSEQQ++DC  +D N GC GG+MD+AF+Y++ N G+T +  Y Y  +  G C 
Sbjct: 190 TTGNLVSLSEQQILDCDESDGNQGCNGGYMDNAFQYVVNNGGVTTEDAYPYSAVQ-GTCQ 248

Query: 216 SIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAID--ASALQFYSGGVFNG-YCET 272
           +++    AA I+ ++D+P  DE +L  AVANQPVSV +D  +S  QFY GG+++G  C T
Sbjct: 249 NVQP---AATISGFQDLPSGDENALANAVANQPVSVGVDGGSSPFQFYQGGIYDGDGCGT 305

Query: 273 FLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
            +NH VTA+GYG  ++G +YW++KNSWG  WGE+G+ +LQ  +    G CGI+  AS+P 
Sbjct: 306 DMNHAVTAIGYGADDQGTQYWILKNSWGTGWGENGFMQLQMGV----GACGISTMASYPT 361


>gi|121308860|dbj|BAF43527.1| cysteine proteinase [Zinnia elegans]
          Length = 352

 Score =  266 bits (679), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 147/316 (46%), Positives = 192/316 (60%), Gaps = 18/316 (5%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
           +   FE W  ++ + Y+   E   RFEIF DNL  ++  N       +Y L LN+FADLT
Sbjct: 45  VIHLFESWLVKHSKFYESLDEKLHRFEIFMDNLKHIDETNKKV---SNYWLGLNEFADLT 101

Query: 90  PQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGAVTPVKYQGQCA--- 145
            +EF     GFK  + +     +   F Y+    +P SV+W +KGAV PVK QGQC    
Sbjct: 102 HEEFKHKFLGFK-GELAERKDESSKEFGYRDFVDLPKSVDWRKKGAVAPVKNQGQCGNCW 160

Query: 146 ----VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITND 201
               VAAVEGIN I    L  LSEQ+L+DC T   NNGC GG MD AF Y++++ G+  +
Sbjct: 161 AFSTVAAVEGINQIVTGNLTMLSEQELIDCDTT-FNNGCNGGLMDYAFAYVMRS-GLHKE 218

Query: 202 AVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQ 259
             Y Y  MS G CD  K       I+ Y DVP NDE S LKA+ANQP+SVAI+AS    Q
Sbjct: 219 EEYPYI-MSEGTCDEKKDVSEKVTISGYHDVPRNDEASFLKALANQPISVAIEASGRDFQ 277

Query: 260 FYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQ 319
           FYSGGVF+G+C T L+HGV AVGYGT++ G+ Y +++NSWG  WGE GY R++R   +P 
Sbjct: 278 FYSGGVFDGHCGTELDHGVAAVGYGTTK-GLDYVIVRNSWGPKWGEKGYIRMKRGSGKPH 336

Query: 320 GQCGIAMFASFPVSKE 335
           G CG+ M AS+P  ++
Sbjct: 337 GMCGLYMMASYPTKQK 352


>gi|297845064|ref|XP_002890413.1| hypothetical protein ARALYDRAFT_472321 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297336255|gb|EFH66672.1| hypothetical protein ARALYDRAFT_472321 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 357

 Score =  266 bits (679), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 148/314 (47%), Positives = 191/314 (60%), Gaps = 17/314 (5%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
           + E FE W + + + Y+   E   RFE+FKDNL  ++  N      +SY L LN+FADL+
Sbjct: 47  LIELFENWISNFEKAYETVEEKLLRFEVFKDNLKHIDETNKKV---KSYWLGLNEFADLS 103

Query: 90  PQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQ-VPPSVNWIEKGAVTPVKYQGQCA--- 145
            +EF     G K        + +   F Y+  + VP SV+W +KGAV  VK QG C    
Sbjct: 104 HEEFKKMYLGLKTDIVRRDEERSYAEFAYRDVEAVPKSVDWRKKGAVAEVKNQGSCGSCW 163

Query: 146 ----VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITND 201
               VAAVEGIN I    L +LSEQ+L+DC T   NNGC GG MD AF+YI++N G+  +
Sbjct: 164 AFSTVAAVEGINKIVTGNLTTLSEQELIDCDTT-YNNGCNGGLMDYAFEYIVKNGGLRKE 222

Query: 202 AVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQ 259
             Y Y  M  G C+  K E     I  ++DVP NDE+SLLKA+A+QP+SVAIDAS    Q
Sbjct: 223 EDYPY-SMEEGTCEMQKDESETVTIDGHQDVPTNDEKSLLKALAHQPLSVAIDASGREFQ 281

Query: 260 FYSG-GVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQP 318
           FYSG  VF+G C   L+HGV AVGYG+S+ G  Y ++KNSWG  WGE GY RL+R+  +P
Sbjct: 282 FYSGVSVFDGRCGVDLDHGVAAVGYGSSK-GSDYIIVKNSWGPKWGEKGYIRLKRNTGKP 340

Query: 319 QGQCGIAMFASFPV 332
           +G CGI   ASFP 
Sbjct: 341 EGLCGINKMASFPT 354


>gi|146216004|gb|ABQ10204.1| cysteine protease Cp6 [Actinidia deliciosa]
          Length = 461

 Score =  266 bits (679), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 151/323 (46%), Positives = 198/323 (61%), Gaps = 24/323 (7%)

Query: 23  RTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRL 82
           RT DE  +   +E W  ++G++Y    E  KRF+IFKDNL  ++  N  +   R+Y + L
Sbjct: 37  RTDDE--VMAMYESWLVKHGKSYNAIGEKEKRFQIFKDNLRFIDEHNAES---RTYKVGL 91

Query: 83  NKFADLTPQEF----IASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPV 138
           N+FADLT  E+    + ++TG +    +        P   +S  +P SV+W EKGAV  V
Sbjct: 92  NRFADLTNDEYRSMYLGARTGSRRRLSTQKRSDRYVPVAGES--LPDSVDWREKGAVVGV 149

Query: 139 KYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKY 191
           K QG C        +AAVEGIN I    L+SLSEQ+LVDC T+  N GC GG MD AF++
Sbjct: 150 KDQGSCGSCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDTS-YNEGCNGGLMDYAFEF 208

Query: 192 IIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSV 251
           II+N GI  +  Y Y     G CD  +       I +YEDVP N+E++L KAVANQPVSV
Sbjct: 209 IIKNGGIDTEEDYPYNARD-GRCDQYRKNAKVVTIDDYEDVPVNNEQALQKAVANQPVSV 267

Query: 252 AIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYF 309
           AI+AS  A QFY  GVF G C T L+HGVTAVGYGT E  + YW++KNSWG  WGE GY 
Sbjct: 268 AIEASGMAFQFYESGVFTGNCGTALDHGVTAVGYGT-ENSVDYWIVKNSWGSSWGESGYI 326

Query: 310 RLQRDIDQPQGQCGIAMFASFPV 332
           R++R+     G+CGIA+  S+P+
Sbjct: 327 RMERNT-GATGKCGIAVEPSYPI 348


>gi|57282619|emb|CAE54307.1| cysteine proteinase [Gossypium hirsutum]
          Length = 372

 Score =  266 bits (679), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 149/323 (46%), Positives = 199/323 (61%), Gaps = 19/323 (5%)

Query: 22  YRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLR 81
           +R+ DE  +   ++ W  Q+G+ Y    E  KRFEIFKDNL  ++  N+    N +Y L 
Sbjct: 36  WRSDDE--VMGLYKSWVIQHGKAYNGIGEEEKRFEIFKDNLRFIDEHNSN--NNTTYKLG 91

Query: 82  LNKFADLTPQEFIASQTGFKMSDHSSSLKAN--GTPFLYKS-SQVPPSVNWIEKGAVTPV 138
           LNKFADLT QE+ A   G +       +K+    + + +++   +P SVNW + GAV+ V
Sbjct: 92  LNKFADLTNQEYRAKFLGTRTDPRRRLMKSKIPSSRYAHRAGDNLPDSVNWRDHGAVSRV 151

Query: 139 KYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKY 191
           K QG C       A+AAVEGIN I    L+SLSEQ+LVDC     + GC GG MD AF++
Sbjct: 152 KDQGSCGSCWAFSAIAAVEGINKIVSGELISLSEQELVDC-DRSYDAGCNGGLMDYAFQF 210

Query: 192 IIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSV 251
           II N GI  +  Y Y G +   CD  K       I  YEDVP N+E +L KAVA+QPVS+
Sbjct: 211 IIDNGGIDTEKDYPYLGFNNQ-CDPTKKNAKVVSIDGYEDVP-NNENALKKAVAHQPVSI 268

Query: 252 AIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYF 309
           AI+A   A Q Y  GVFNG C   L+HGV AVGYG+ + G  YW+++NSWG +WGE+GY 
Sbjct: 269 AIEAGGRAFQLYESGVFNGECGLALDHGVVAVGYGSDDNGQDYWIVRNSWGGNWGENGYI 328

Query: 310 RLQRDIDQPQGQCGIAMFASFPV 332
           R++R+I+   G+CGIAM AS+PV
Sbjct: 329 RMERNINANTGKCGIAMEASYPV 351


>gi|400180387|gb|AFP73332.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  265 bits (678), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 147/343 (42%), Positives = 207/343 (60%), Gaps = 27/343 (7%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
            LI +  +     +Q   R+  + S++E+ E W +++GR YK+  E  +RF IFK+N+  
Sbjct: 10  ILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKVERFMIFKENMKF 69

Query: 65  VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDH--------SSSLKANGTPF 116
           +E  N A  GN SY L +N+FAD+T QEF+A  TG  + +         S+ LK N    
Sbjct: 70  IESVNKA--GNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTELKIND--- 124

Query: 117 LYKSSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLV 169
                 +P +++WIE GAVT VK+QG+C       AV ++EG   I    L+  SEQ+L+
Sbjct: 125 -LSDDDMPSNLDWIESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELL 183

Query: 170 DCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNY 229
           DC TN  N GC GGFM +AF +I +N GI+ ++ Y Y G     C S + +  A QI++Y
Sbjct: 184 DCTTN--NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQY-TCRS-QEKTAAVQISSY 239

Query: 230 EDVPPNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEE 288
           + V P  E SLL+AV  QPVS+ I AS  LQFY+GG ++G C   +NH VTA+GYGT E+
Sbjct: 240 Q-VVPEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGYGTDEK 298

Query: 289 GIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
           G KYWL+KNSWG  WGE+G+ ++ RD   P G C IA  +S+P
Sbjct: 299 GQKYWLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341


>gi|212275830|ref|NP_001130503.1| cysteine protease 1 [Zea mays]
 gi|194689328|gb|ACF78748.1| unknown [Zea mays]
 gi|219886279|gb|ACL53514.1| unknown [Zea mays]
 gi|238010470|gb|ACR36270.1| unknown [Zea mays]
 gi|413920875|gb|AFW60807.1| cysteine protease 1 [Zea mays]
          Length = 354

 Score =  265 bits (678), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 150/322 (46%), Positives = 200/322 (62%), Gaps = 22/322 (6%)

Query: 25  FDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNK 84
           + E ++  + +QW A++GRTY++ AE + RF++FK N   V+  N A    +SY L LN+
Sbjct: 42  YGEEAMKVRHQQWMAEHGRTYRDEAEKAHRFQVFKANADFVDASNAAGDDKKSYRLELNE 101

Query: 85  FADLTPQEFIASQTGFKMSDHSSSLKAN---GTPFLYKSSQVPPSVNWIEKGAVTPVKYQ 141
           FAD+T  EF+A  TG +     +   A    G   L  +     +V+W +KGAVT +K Q
Sbjct: 102 FADMTNDEFMAMYTGLRPVPAGAKKMAGFKYGNVTLSDADDDQQTVDWRQKGAVTGIKNQ 161

Query: 142 GQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQ 194
           GQC       AVAAVEGI+ I    LVSLSEQQ++DC T D NNGC GG++D+AF+YI+ 
Sbjct: 162 GQCGCCWAFAAVAAVEGIHQITTGNLVSLSEQQVLDCDT-DGNNGCNGGYIDNAFQYIVG 220

Query: 195 NKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAID 254
           N G+  +  Y Y   +  +C S++     A I+ Y+DVP  DE +L  AVANQPVSVAID
Sbjct: 221 NGGLGTEDAYPYTA-AQAMCQSVQP---VAAISGYQDVPSGDEAALAAAVANQPVSVAID 276

Query: 255 ASALQFYSGGVFNGY-CET--FLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRL 311
           A   Q Y GGV     C T   LNH VTAVGYGT+E+G  YWL+KN WGQ+WGE GY RL
Sbjct: 277 AHNFQLYGGGVMTAASCSTPPNLNHAVTAVGYGTAEDGTPYWLLKNQWGQNWGEGGYLRL 336

Query: 312 QRDIDQPQGQCGIAMFASFPVS 333
           +R  +     CG+A  AS+PV+
Sbjct: 337 ERGAN----ACGVAQQASYPVA 354


>gi|400180453|gb|AFP73363.1| cysteine protease [Solanum chilense]
          Length = 344

 Score =  265 bits (678), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 146/339 (43%), Positives = 207/339 (61%), Gaps = 19/339 (5%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
            LI +  +     +Q   R+  + S++E+ E W +++GR YK+  E  +RF IFK+N+  
Sbjct: 10  ILITLFFVISMFNTQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKF 69

Query: 65  VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSD-HSSSLKANGTPFL---YKS 120
           +E  N A  GN SY L +N+FAD+T QEF+A  TG  + + + S    + T F       
Sbjct: 70  IESVNKA--GNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPVSSTEFKINDLSD 127

Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
             +P +++W E GAVT VK+QG+C       AV ++EG   I    L+  SEQ+L+DC T
Sbjct: 128 DDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTT 187

Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
           N  N GC GGFM +AF +II+N GI+ ++ Y Y G     C S + +  A QI++Y+ V 
Sbjct: 188 N--NYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQY-TCRS-QEKTAAVQISSYQ-VV 242

Query: 234 PNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
           P  E SLL+AV  QPVS+ I AS  LQFY+GG ++G C   +NH VTA+GYGT E+G KY
Sbjct: 243 PEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGYGTDEKGQKY 302

Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
           WL+KNSWG  WGE+G+ ++ RD   P G C IA  +S+P
Sbjct: 303 WLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341


>gi|297819566|ref|XP_002877666.1| hypothetical protein ARALYDRAFT_906213 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297323504|gb|EFH53925.1| hypothetical protein ARALYDRAFT_906213 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 304

 Score =  265 bits (677), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 149/316 (47%), Positives = 196/316 (62%), Gaps = 31/316 (9%)

Query: 27  EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFA 86
           E S  EK EQW +++ R Y + +E + RFEIFK NL  VE FN     N +Y L +NKF+
Sbjct: 11  EASAIEKHEQWMSRFNRVYSDDSEKTSRFEIFKKNLKFVESFNMNT--NNTYKLDVNKFS 68

Query: 87  DLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGAVTPVKYQGQC- 144
           DLT +EF A   G  + +  +        F Y++ S+   S++W  +GAVTPVK QGQC 
Sbjct: 69  DLTDEEFQARYMGL-VPEGMTGDSQKTVSFRYENVSETGESMDWRLEGAVTPVKDQGQCG 127

Query: 145 ------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGI 198
                 AVAAVEG+  I    LVSLSEQQLVDC+T +NN GC GG    A+ YI +N+GI
Sbjct: 128 CCWAFAAVAAVEGVTKIANGELVSLSEQQLVDCSTANNNMGCDGGLALTAYDYIKENQGI 187

Query: 199 TNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASAL 258
           T++  Y Y+ +    C S   +  AA I+ YE VP +DEE+LLKAV+             
Sbjct: 188 TSEENYPYQAVQQ-TCKS--TDPAAATISGYEAVPKDDEEALLKAVSQH----------- 233

Query: 259 QFYSGGVF-NGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQ 317
                G+F + YC T  +H VT VGYGTSEEGIKYWL+KNSWG+ WGE+GY R++RD+D+
Sbjct: 234 -----GIFEDEYCGTDSHHAVTIVGYGTSEEGIKYWLLKNSWGESWGENGYMRIKRDVDE 288

Query: 318 PQGQCGIAMFASFPVS 333
           PQG CG+A  A +PV+
Sbjct: 289 PQGMCGLAHRAYYPVA 304


>gi|400180428|gb|AFP73352.1| cysteine protease [Solanum corneliomuelleri]
          Length = 344

 Score =  265 bits (677), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 146/339 (43%), Positives = 207/339 (61%), Gaps = 19/339 (5%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
            LI +  +     +Q   R+  + S++E+ E W +++GR YK+  E  +RF IFK+N+  
Sbjct: 10  ILITLFFVISMFNTQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKF 69

Query: 65  VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSD-HSSSLKANGTPFL---YKS 120
           +E  N A  GN SY L +N+FAD+T QEF+A  TG  + + + S    + T F       
Sbjct: 70  IESVNKA--GNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSD 127

Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
             +P +++W E GAVT VK+QG+C       AV ++EG   I    L+  SEQ+L+DC T
Sbjct: 128 DDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTT 187

Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
           N  N GC GGFM +AF +II+N GI+ ++ Y Y G     C S + +  A QI++Y+ V 
Sbjct: 188 N--NYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQY-TCRS-QEKTAAVQISSYQ-VV 242

Query: 234 PNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
           P  E SLL+AV  QPVS+ I AS  LQFY+GG ++G C   +NH VTA+GYGT E+G KY
Sbjct: 243 PEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGYGTDEKGQKY 302

Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
           WL+KNSWG  WGE+G+ ++ RD   P G C IA  +S+P
Sbjct: 303 WLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341


>gi|400180393|gb|AFP73335.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  265 bits (677), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 147/343 (42%), Positives = 207/343 (60%), Gaps = 27/343 (7%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
            LI +  +     +Q   R+  + S++E+ E W +++GR YK+  E  +RF IFK+N+  
Sbjct: 10  ILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKVERFMIFKENMKF 69

Query: 65  VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDH--------SSSLKANGTPF 116
           +E  N A  GN SY L +N+FAD+T QEF+A  TG  + +         S+ LK N    
Sbjct: 70  IESVNKA--GNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTELKIND--- 124

Query: 117 LYKSSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLV 169
                 +P +++W E GAVT VK+QG+C       AV ++EG   I    L+  SEQ+L+
Sbjct: 125 -LSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELL 183

Query: 170 DCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNY 229
           DC TN  N GC GGFM +AF +II+N GI+ ++ Y Y G     C S + +  A QI++Y
Sbjct: 184 DCTTN--NYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQY-TCRS-QEKTAAVQISSY 239

Query: 230 EDVPPNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEE 288
           + V P  E SLL+AV  QPVS+ I AS  LQFY+GG ++G C   +NH VTA+GYGT E+
Sbjct: 240 K-VVPEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGYGTDEK 298

Query: 289 GIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
           G KYWL+KNSWG  WGE+G+ ++ RD   P G C IA  +S+P
Sbjct: 299 GQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341


>gi|400180365|gb|AFP73321.1| cysteine protease [Solanum peruvianum]
 gi|400180395|gb|AFP73336.1| cysteine protease [Solanum peruvianum]
 gi|400180405|gb|AFP73341.1| cysteine protease [Solanum peruvianum]
 gi|400180409|gb|AFP73343.1| cysteine protease [Solanum peruvianum]
 gi|400180411|gb|AFP73344.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  265 bits (677), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 146/339 (43%), Positives = 207/339 (61%), Gaps = 19/339 (5%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
            LI +  +     +Q   R+  + S++E+ E W +++GR YK+  E  +RF IFK+N+  
Sbjct: 10  ILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKF 69

Query: 65  VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSD-HSSSLKANGTPFL---YKS 120
           +E  N A  GN SY L +N+FAD+T QEF+A  TG  + + + S    + T F       
Sbjct: 70  IESVNKA--GNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSD 127

Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
             +P +++W E GAVT VK+QG+C       AV ++EG   I    L+  SEQ+L+DC T
Sbjct: 128 DDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTT 187

Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
           N  N GC GGFM +AF +II+N GI+ ++ Y Y G     C S + +  A QI++Y+ V 
Sbjct: 188 N--NYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQY-TCRS-QEKTAAVQISSYK-VV 242

Query: 234 PNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
           P  E SLL+AV  QPVS+ I AS  LQFY+GG ++G C   +NH VTA+GYGT E+G KY
Sbjct: 243 PEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGYGTDEKGQKY 302

Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
           WL+KNSWG  WGE+G+ ++ RD   P G C IA  +S+P
Sbjct: 303 WLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341


>gi|400180347|gb|AFP73312.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  265 bits (677), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 146/339 (43%), Positives = 207/339 (61%), Gaps = 19/339 (5%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
            LI +  +     +Q   R+  + S++E+ E W +++GR YK+  E  +RF IFK+N+  
Sbjct: 10  ILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKF 69

Query: 65  VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSD-HSSSLKANGTPFL---YKS 120
           +E  N A  GN SY L +N+FAD+T QEF+A  TG  + + + S    + T F       
Sbjct: 70  IESVNKA--GNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSD 127

Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
             +P +++W E GAVT VK+QG+C       AV ++EG   I    L+  SEQ+L+DC T
Sbjct: 128 DDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTT 187

Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
           N  N GC GGFM +AF +II+N GI+ ++ Y Y G     C S + +  A QI++Y+ V 
Sbjct: 188 N--NYGCDGGFMTNAFDFIIENGGISRESDYEYLGQQY-TCRS-QEKTAAVQISSYQ-VV 242

Query: 234 PNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
           P  E SLL+AV  QPVS+ I AS  LQFY+GG ++G C   +NH VTA+GYGT E+G KY
Sbjct: 243 PEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGYGTDEKGQKY 302

Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
           WL+KNSWG  WGE+G+ ++ RD   P G C IA  +S+P
Sbjct: 303 WLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341


>gi|400180426|gb|AFP73351.1| cysteine protease [Solanum corneliomuelleri]
          Length = 344

 Score =  265 bits (677), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 146/339 (43%), Positives = 206/339 (60%), Gaps = 19/339 (5%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
            LI +  +     +Q   R+  + S++E+ E W +++GR YK+  E  +RF IFK+N+  
Sbjct: 10  ILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKF 69

Query: 65  VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSD-HSSSLKANGTPFL---YKS 120
           +E  N A  GN SY L +N+FAD+T QEF+A  TG  + + + S    + T F       
Sbjct: 70  IESVNKA--GNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSD 127

Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
             +P +++W E GAVT VK+QG+C       AV ++EG   I    L+  SEQ+L+DC T
Sbjct: 128 DDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTT 187

Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
           N  N GC GGFM +AF +II+N GI+ ++ Y Y G     C S + +  A QI++Y+ V 
Sbjct: 188 N--NYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQY-TCRS-QEKTAAVQISSYQ-VV 242

Query: 234 PNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
           P  E SLL+AV  QPVS+ I AS  LQFY+GG ++G C   +NH VTA+GYGT E G KY
Sbjct: 243 PEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGYGTDENGQKY 302

Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
           WL+KNSWG  WGE+G+ ++ RD   P G C IA  +S+P
Sbjct: 303 WLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341


>gi|400180355|gb|AFP73316.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  265 bits (677), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 146/339 (43%), Positives = 207/339 (61%), Gaps = 19/339 (5%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
            LI +  +     +Q   R+  + S++E+ E W +++GR YK+  E  +RF IFK+N+  
Sbjct: 10  ILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKF 69

Query: 65  VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSD-HSSSLKANGTPFL---YKS 120
           +E  N A  GN SY L +N+FAD+T QEF+A  TG  + + + S    + T F       
Sbjct: 70  IESVNKA--GNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSD 127

Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
             +P +++W E GAVT VK+QG+C       AV ++EG   I    L+  SEQ+L+DC T
Sbjct: 128 DDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTT 187

Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
           N  N GC GGFM +AF +II+N GI+ ++ Y Y G     C S + +  A QI++Y+ V 
Sbjct: 188 N--NYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQY-TCRS-QEKTAAVQISSYQ-VV 242

Query: 234 PNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
           P  E SLL+AV  QPVS+ I AS  LQFY+GG ++G C   +NH VTA+GYGT E+G KY
Sbjct: 243 PEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGYGTDEKGQKY 302

Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
           WL+KNSWG  WGE+G+ ++ RD   P G C IA  +S+P
Sbjct: 303 WLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341


>gi|118120|sp|P25249.1|CYSP1_HORVU RecName: Full=Cysteine proteinase EP-B 1; Flags: Precursor
 gi|1146116|gb|AAA85035.1| cysteine proteinase EPB1 precursor [Hordeum vulgare subsp. vulgare]
          Length = 371

 Score =  265 bits (676), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 147/322 (45%), Positives = 196/322 (60%), Gaps = 23/322 (7%)

Query: 27  EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFA 86
           E ++ + +E+W++ + R  +  AE  +RF  FK N   +   N    G+  Y L LN+F 
Sbjct: 39  EEALWDLYERWQSAH-RVRRHHAEKHRRFGTFKSNAHFIHSHNKR--GDHPYRLHLNRFG 95

Query: 87  DLTPQEFIASQTGFKMSDHSSSLKANGTP-FLYKS---SQVPPSVNWIEKGAVTPVKYQG 142
           D+   EF A+  G    D  +  K    P F+Y +   S +PPSV+W +KGAVT VK QG
Sbjct: 96  DMDQAEFRATFVGDLRRD--TPAKPPSVPGFMYAALNVSDLPPSVDWRQKGAVTGVKDQG 153

Query: 143 QCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQN 195
           +C        V +VEGINAI+   LVSLSEQ+L+DC T DN+ GC GG MD+AF+YI  N
Sbjct: 154 KCGSCWAFSTVVSVEGINAIRTGSLVSLSEQELIDCDTADND-GCQGGLMDNAFEYIKNN 212

Query: 196 KGITNDAVYSYEGMSTGICDSIKAEDHA---AQITNYEDVPPNDEESLLKAVANQPVSVA 252
            G+  +A Y Y   + G C+  +A  ++     I  ++DVP N EE L +AVANQPVSVA
Sbjct: 213 GGLITEAAYPYRA-ARGTCNVARAAQNSPVVVHIDGHQDVPANSEEDLARAVANQPVSVA 271

Query: 253 IDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFR 310
           ++AS  A  FYS GVF G C T L+HGV  VGYG +E+G  YW +KNSWG  WGE GY R
Sbjct: 272 VEASGKAFMFYSEGVFTGDCGTELDHGVAVVGYGVAEDGKAYWTVKNSWGPSWGEQGYIR 331

Query: 311 LQRDIDQPQGQCGIAMFASFPV 332
           +++D     G CGIAM AS+PV
Sbjct: 332 VEKDSGASGGLCGIAMEASYPV 353


>gi|326507362|dbj|BAK03074.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 473

 Score =  265 bits (676), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 142/317 (44%), Positives = 198/317 (62%), Gaps = 18/317 (5%)

Query: 34  FEQWKAQYGRTYKESA----ENSKRFEIFKDNLVAVERFN-NAAIGNRSYTLRLNKFADL 88
           ++ W A++G     +A    +  +RF  F DNL  V+  N  AA G   + L +N+FADL
Sbjct: 52  YDLWLAEHGGGSSPNANSIADRERRFSAFWDNLRFVDAHNARAAAGEEGFRLAMNRFADL 111

Query: 89  TPQEFIASQTGFK-MSDHSSSLKANGTPFLYKSSQ-VPPSVNWIEKGAVTPVKYQGQC-- 144
           T  EF A+  G K  ++ + + +  G  + +  ++ +P +V+W EKGAV PVK QGQC  
Sbjct: 112 TNDEFRAAYLGVKGAAERNRAGRVVGERYRHDGAEELPEAVDWREKGAVAPVKNQGQCGS 171

Query: 145 -----AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGIT 199
                AV+ VE IN I    +V+LSEQ+LV+C  N  ++GC GG MDDAF++II+N GI 
Sbjct: 172 CWAFSAVSTVESINQIVTGEMVTLSEQELVECDINGQSSGCNGGLMDDAFEFIIKNGGID 231

Query: 200 NDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA-- 257
            +  Y Y+ +  G CD ++       I  +EDVP NDE+SL KAVA+ PVSVAI+A    
Sbjct: 232 TEDDYPYKAVD-GRCDVLRKNAKVVSIDGFEDVPENDEKSLQKAVAHHPVSVAIEAGGRE 290

Query: 258 LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQ 317
            Q Y  GVF+G C T L+HGV AVGYGT E G  YW+++NSWG +WGE GY R++R+I+ 
Sbjct: 291 FQLYHSGVFSGRCGTQLDHGVVAVGYGT-ENGKDYWIVRNSWGPNWGEAGYLRMERNINV 349

Query: 318 PQGQCGIAMFASFPVSK 334
             G+CGIAM +S+P  K
Sbjct: 350 TSGKCGIAMMSSYPTKK 366


>gi|204307508|gb|ACI00280.1| triticain beta 2 [Hordeum vulgare]
          Length = 473

 Score =  265 bits (676), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 142/317 (44%), Positives = 198/317 (62%), Gaps = 18/317 (5%)

Query: 34  FEQWKAQYGRTYKESA----ENSKRFEIFKDNLVAVERFN-NAAIGNRSYTLRLNKFADL 88
           ++ W A++G     +A    +  +RF  F DNL  V+  N  AA G   + L +N+FADL
Sbjct: 52  YDLWLAEHGGGSSPNANSIADRERRFSAFWDNLRFVDAHNARAAAGEEGFRLAMNRFADL 111

Query: 89  TPQEFIASQTGFK-MSDHSSSLKANGTPFLYKSSQ-VPPSVNWIEKGAVTPVKYQGQC-- 144
           T  EF A+  G K  ++ + + +  G  + +  ++ +P +V+W EKGAV PVK QGQC  
Sbjct: 112 TNDEFRAAYLGVKGAAERNRAGRVVGERYRHDGAEELPEAVDWREKGAVAPVKNQGQCGS 171

Query: 145 -----AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGIT 199
                AV+ VE IN I    +V+LSEQ+LV+C  N  ++GC GG MDDAF++II+N GI 
Sbjct: 172 CWAFSAVSTVESINQIVTGEMVTLSEQELVECDINGQSSGCNGGLMDDAFEFIIKNGGID 231

Query: 200 NDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA-- 257
            +  Y Y+ +  G CD ++       I  +EDVP NDE+SL KAVA+ PVSVAI+A    
Sbjct: 232 TEDDYPYKAVD-GRCDVLRKNAKVVSIDGFEDVPENDEKSLQKAVAHHPVSVAIEAGGRE 290

Query: 258 LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQ 317
            Q Y  GVF+G C T L+HGV AVGYGT E G  YW+++NSWG +WGE GY R++R+I+ 
Sbjct: 291 FQLYHSGVFSGRCGTQLDHGVVAVGYGT-ENGKDYWIVRNSWGPNWGEAGYLRMERNINV 349

Query: 318 PQGQCGIAMFASFPVSK 334
             G+CGIAM +S+P  K
Sbjct: 350 TSGKCGIAMMSSYPTKK 366


>gi|400180389|gb|AFP73333.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  264 bits (675), Expect = 4e-68,   Method: Compositional matrix adjust.
 Identities = 146/339 (43%), Positives = 207/339 (61%), Gaps = 19/339 (5%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
            LI +  +     +Q   R+  + S++E+ E W +++GR YK+  E  +RF IFK+N+  
Sbjct: 10  ILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKF 69

Query: 65  VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSD-HSSSLKANGTPFL---YKS 120
           +E  N A  GN SY L +N+FAD+T QEF+A  TG  + + + S    + T F       
Sbjct: 70  IESVNKA--GNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSD 127

Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
             +P +++W E GAVT VK+QG+C       AV ++EG   I    L+  SEQ+L+DC T
Sbjct: 128 DDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTT 187

Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
           N  N GC GGFM +AF +II+N GI+ ++ Y Y G     C S + +  A QI++Y+ V 
Sbjct: 188 N--NYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQY-TCRS-QEKTAAVQISSYK-VV 242

Query: 234 PNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
           P  E SLL+AV  QPVS+ I AS  LQFY+GG ++G C   +NH VTA+GYGT E+G KY
Sbjct: 243 PEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGYGTDEKGQKY 302

Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
           WL+KNSWG  WGE+G+ ++ RD   P G C IA  +S+P
Sbjct: 303 WLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341


>gi|317106666|dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas]
          Length = 441

 Score =  264 bits (675), Expect = 4e-68,   Method: Compositional matrix adjust.
 Identities = 152/313 (48%), Positives = 185/313 (59%), Gaps = 17/313 (5%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
           IA  FE W  Q+G+TY    E   R ++F+DN   V   N+   GN SYTL LN FADLT
Sbjct: 26  IAHLFETWCQQHGKTYASQEEKLFRLKVFQDNYDFVTEHNSQ--GNSSYTLSLNAFADLT 83

Query: 90  PQEFIASQTGFKMSDHSSSLKANGTPFLYKS--SQVPPSVNWIEKGAVTPVKYQGQC--- 144
             EF AS+ G   S  S+SL  + +        + VP SV+W + GAVT VK QG C   
Sbjct: 84  HHEFKASRLGLS-SAASASLNVDRSNRQIPDFVADVPASVDWRKNGAVTQVKDQGNCGAC 142

Query: 145 ----AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITN 200
               A  A+EGIN I    LVSLSEQ+LVDC     NNGC GG MD AF+++I N GI  
Sbjct: 143 WSFSATGAIEGINKIVTGSLVSLSEQELVDC-DKSYNNGCEGGIMDYAFQFVIDNHGIDT 201

Query: 201 DAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS--AL 258
           +  Y Y+G     C+  K + H   I  Y DVP N+E+ LLKAVANQPVSV I  S  A 
Sbjct: 202 EEDYPYQGRDRS-CNKEKLKRHVVTIDGYVDVPQNNEKELLKAVANQPVSVGICGSERAF 260

Query: 259 QFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQP 318
           Q YS G+F G C T L+H V  VGYG SE G+ YW++KNSWG  WG DGY  +QR+    
Sbjct: 261 QLYSKGIFTGPCSTSLDHAVLIVGYG-SENGVDYWIVKNSWGSYWGMDGYMHMQRNSGSS 319

Query: 319 QGQCGIAMFASFP 331
           +G CGI M AS+P
Sbjct: 320 RGLCGINMLASYP 332


>gi|400180435|gb|AFP73355.1| cysteine protease [Solanum pennellii]
          Length = 344

 Score =  264 bits (675), Expect = 4e-68,   Method: Compositional matrix adjust.
 Identities = 147/339 (43%), Positives = 205/339 (60%), Gaps = 19/339 (5%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
            LI +  +     +Q   R+  + S++E+ E W +++GR YK+  E  +RF IFK+N+  
Sbjct: 10  ILITLFFVISMFNTQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKF 69

Query: 65  VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSD-HSSSLKANGTPFL---YKS 120
           +E  N A  GN SY L +N+FAD+T QEF+A  TG  + + + S    + T F       
Sbjct: 70  IESVNKA--GNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYVSPSPMSSTEFKINDLSD 127

Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
             +P +++W E GAVT VK QGQC       AV ++EG   I    L+  SEQ+L+DC T
Sbjct: 128 DDMPSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTT 187

Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
           N  N GC GGFM +AF +I +N GI+ ++ Y Y G     C S + +  A QI++Y+ V 
Sbjct: 188 N--NYGCNGGFMTNAFDFIKENGGISRESDYEYLGQQY-TCRS-QEKTAAVQISSYQ-VV 242

Query: 234 PNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
           P  E SLL+AV  QPVS+ I AS  LQFY+GG ++G C   +NH VTA+GYGT E+G KY
Sbjct: 243 PEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCANRINHAVTAIGYGTDEKGQKY 302

Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
           WL+KNSWG  WGEDG+ ++ RD   P G C IA  +S+P
Sbjct: 303 WLLKNSWGTSWGEDGFMKIIRDSGNPAGLCDIAKVSSYP 341


>gi|400180417|gb|AFP73347.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  264 bits (675), Expect = 4e-68,   Method: Compositional matrix adjust.
 Identities = 147/339 (43%), Positives = 205/339 (60%), Gaps = 19/339 (5%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
            LI V  +     +Q   R+  + S++E+ E W +++GR YK+  E  +RF IFK+N+  
Sbjct: 10  ILITVFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKF 69

Query: 65  VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANG-TPFL---YKS 120
           +E  N A  GN SY L +N+FAD+T QEF+A  TG  + +   S      T F       
Sbjct: 70  IESVNKA--GNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPLSSTEFKINDLSD 127

Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
             +P +++W E GAVT VK+QG+C       AV ++EG   I    L+  SEQ+L+DC T
Sbjct: 128 DDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTT 187

Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
           N  N GC GGFM +AF +II+N GI+ ++ Y Y G     C S + +  A QI++Y+ V 
Sbjct: 188 N--NYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQY-TCRS-QEKTAAVQISSYK-VV 242

Query: 234 PNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
           P  E SLL+AV  QPVS+ I AS  LQFY+GG ++G C   +NH VTA+GYGT E+G KY
Sbjct: 243 PEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGYGTDEKGQKY 302

Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
           WL+KNSWG  WGE+G+ ++ RD   P G C IA  +S+P
Sbjct: 303 WLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341


>gi|359359120|gb|AEV41026.1| putative cysteine protease [Oryza minuta]
          Length = 464

 Score =  264 bits (675), Expect = 4e-68,   Method: Compositional matrix adjust.
 Identities = 146/300 (48%), Positives = 182/300 (60%), Gaps = 20/300 (6%)

Query: 49  AENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSS 108
            E  +RF +F DNL  V+  N  A  +  + L +N+FADLT  EF A+  G   +     
Sbjct: 85  GEYERRFRVFWDNLKFVDAHNAHADEHGGFRLGMNRFADLTNDEFRAAYLGTTPAGRGRH 144

Query: 109 LKANGTPFLYKSSQV---PPSVNWIEKGAV-TPVKYQGQC-------AVAAVEGINAIKI 157
           +       +Y+   V   P SV+W +KGAV +PVK QGQC       AVAAVEGIN I  
Sbjct: 145 VGE-----MYRHDGVEALPDSVDWRDKGAVVSPVKNQGQCGSCWAFSAVAAVEGINKIVT 199

Query: 158 NRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSI 217
             LVSLSEQ+LV+CA N  N+GC GG MDDAF +I +N G+  +  Y Y  M  G CD  
Sbjct: 200 GELVSLSEQELVECARNRGNSGCNGGIMDDAFAFITRNGGLDTEEDYPYTAMD-GKCDLA 258

Query: 218 KAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLN 275
           K       I  +EDVP NDE SL KAVA+QPVSVAIDA     Q Y  GVF G C T L+
Sbjct: 259 KKSRKVVSIDGFEDVPENDELSLQKAVAHQPVSVAIDAGGREFQLYDSGVFTGRCGTSLD 318

Query: 276 HGVTAVGYGT-SEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSK 334
           HGV AVGYGT +  G  YW ++NSWG DWGE+GY R++R++    G+CGIAM AS+P+ K
Sbjct: 319 HGVVAVGYGTDAATGTDYWTVRNSWGPDWGENGYIRMERNVTARTGKCGIAMMASYPIKK 378


>gi|194352754|emb|CAQ00105.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
 gi|326513690|dbj|BAJ87864.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326514532|dbj|BAJ96253.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 463

 Score =  264 bits (675), Expect = 4e-68,   Method: Compositional matrix adjust.
 Identities = 150/329 (45%), Positives = 199/329 (60%), Gaps = 22/329 (6%)

Query: 18  SQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAA-IGNR 76
           S  +Y    E  +   + +W A++G TY    E  +RFE F+DNL  +++ N AA  G  
Sbjct: 27  SIVSYGERSEEEVRRMYAEWMAEHGSTYNAIGEEERRFEAFRDNLRYIDQHNAAADAGVH 86

Query: 77  SYTLRLNKFADLTPQEFIASQTGFKMS-DHSSSLKANGTPFLYKSS---QVPPSVNWIEK 132
           S+ L LN+FADLT +E+ ++  G +   D    L A      Y+++   ++P SV+W +K
Sbjct: 87  SFRLGLNRFADLTNEEYRSTYLGARTKPDRERKLSAR-----YQAADNDELPESVDWRKK 141

Query: 133 GAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFM 185
           GAV  VK QG C       A+AAVEGIN I    ++ LSEQ+LVDC T+  N GC GG M
Sbjct: 142 GAVGAVKDQGGCGSCWAFSAIAAVEGINQIVTGDMIPLSEQELVDCDTS-YNQGCNGGLM 200

Query: 186 DDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVA 245
           D AF++II N GI ++  Y Y+      CD+ K       I  YEDVP N E+SL KAVA
Sbjct: 201 DYAFEFIINNGGIDSEEDYPYKERDN-RCDANKKNAKVVTIDGYEDVPVNSEKSLQKAVA 259

Query: 246 NQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDW 303
           NQP+SVAI+A   A Q Y  G+F G C T L+HGV AVGYGT E G  YWL++NSWG  W
Sbjct: 260 NQPISVAIEAGGRAFQLYKSGIFTGTCGTALDHGVAAVGYGT-ENGKDYWLVRNSWGSVW 318

Query: 304 GEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           GEDGY R++R+I    G+CGIA+  S+P 
Sbjct: 319 GEDGYIRMERNIKASSGKCGIAVEPSYPT 347


>gi|400180345|gb|AFP73311.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  264 bits (675), Expect = 5e-68,   Method: Compositional matrix adjust.
 Identities = 146/339 (43%), Positives = 206/339 (60%), Gaps = 19/339 (5%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
            LI +  +     +Q   R+  + S++E+ E W +++GR YK+  E  +RF IFK+N+  
Sbjct: 10  ILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKF 69

Query: 65  VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSD-HSSSLKANGTPFL---YKS 120
           +E  N A  GN SY L +N+FAD+T QEF+A  TG  + + + S    + T F       
Sbjct: 70  IESVNKA--GNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSD 127

Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
             +P +++W E GAVT VK+QG+C       AV ++EG   I    L+  SEQ+L+DC T
Sbjct: 128 DDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTT 187

Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
           N  N GC GGFM +AF +II+N GI+ ++ Y Y G     C S + +  A QI++Y+ V 
Sbjct: 188 N--NYGCDGGFMTNAFDFIIENGGISRESDYEYLGQQY-TCRS-QEKTAAVQISSYQ-VV 242

Query: 234 PNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
           P  E SLL+AV  QPVS+ I AS  LQFY+GG ++G C   +NH VTA+GYGT E G KY
Sbjct: 243 PEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGYGTDENGQKY 302

Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
           WL+KNSWG  WGE+G+ ++ RD   P G C IA  +S+P
Sbjct: 303 WLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341


>gi|400180445|gb|AFP73359.1| cysteine protease, partial [Solanum chilense]
          Length = 345

 Score =  264 bits (674), Expect = 5e-68,   Method: Compositional matrix adjust.
 Identities = 146/339 (43%), Positives = 207/339 (61%), Gaps = 19/339 (5%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
            LI +  +     +Q   R+  E S++E+ E W +++GR YK+  E  +RF IFK+N+  
Sbjct: 10  ILITLFFVISMFNTQTRGRSQPELSVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKF 69

Query: 65  VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSD-HSSSLKANGTPFL---YKS 120
           +E  N A  GN SY L +N+FAD+T QEF+A  TG  + + + S    + T F       
Sbjct: 70  IESVNKA--GNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSD 127

Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
             +P +++W E GAVT VK+QG+C       AV ++EG   I    L+  SEQ+L+DC T
Sbjct: 128 DDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTT 187

Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
           N  N GC GGFM +AF +I +N GI++++ Y Y G     C S + +  A QI++Y+ V 
Sbjct: 188 N--NYGCDGGFMTNAFDFIKENGGISSESDYEYLGQQY-TCRS-QEKTAAVQISSYQ-VV 242

Query: 234 PNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
           P  E SLL+AV  QPVS+ I AS  LQFY+GG ++G C   +NH VTA+GYGT E+G KY
Sbjct: 243 PEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGYGTDEKGQKY 302

Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
           WL+KNSWG  WGE+G+ ++ RD   P G C IA  +S+P
Sbjct: 303 WLLKNSWGTSWGENGFMKIIRDSGDPSGLCDIAKMSSYP 341


>gi|400180451|gb|AFP73362.1| cysteine protease [Solanum chilense]
          Length = 344

 Score =  264 bits (674), Expect = 5e-68,   Method: Compositional matrix adjust.
 Identities = 146/339 (43%), Positives = 207/339 (61%), Gaps = 19/339 (5%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
            LI +  +     +Q   R+  E S++E+ E W +++GR YK+  E  +RF IFK+N+  
Sbjct: 10  ILITLFFVISMFNTQTRGRSQPELSVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKF 69

Query: 65  VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSD-HSSSLKANGTPFL---YKS 120
           +E  N A  GN SY L +N+FAD+T QEF+A  TG  + + + S    + T F       
Sbjct: 70  IESVNKA--GNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSD 127

Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
             +P +++W E GAVT VK+QG+C       AV ++EG   I    L+  SEQ+L+DC T
Sbjct: 128 DDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTT 187

Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
           N  N GC GGFM +AF +I +N GI++++ Y Y G     C S + +  A QI++Y+ V 
Sbjct: 188 N--NYGCDGGFMTNAFDFIKENGGISSESDYEYLGQQY-TCRS-QEKTAAVQISSYQ-VV 242

Query: 234 PNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
           P  E SLL+AV  QPVS+ I AS  LQFY+GG ++G C   +NH VTA+GYGT E+G KY
Sbjct: 243 PEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGYGTDEKGQKY 302

Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
           WL+KNSWG  WGE+G+ ++ RD   P G C IA  +S+P
Sbjct: 303 WLLKNSWGTSWGENGFMKIIRDSGDPSGLCDIAKMSSYP 341


>gi|225438807|ref|XP_002283263.1| PREDICTED: germination-specific cysteine protease 1-like isoform 1
           [Vitis vinifera]
          Length = 374

 Score =  264 bits (674), Expect = 5e-68,   Method: Compositional matrix adjust.
 Identities = 153/349 (43%), Positives = 207/349 (59%), Gaps = 22/349 (6%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
            L +  + S +    +++R+ +E  +   ++ W A++G+ Y    E  KRFEIFKDNL  
Sbjct: 19  LLFLFFVASSAADLSSSWRSEEE--VMGMYQWWMAKHGKAYNGLGEKEKRFEIFKDNLKF 76

Query: 65  VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLK-ANGTPF--LYKSS 121
           ++  N     NR+Y + LN+FADLT +E+ A   G +        K  N +P   +    
Sbjct: 77  IDEHNAQ---NRTYKVGLNRFADLTNEEYRAIYLGTRSDPKRRFAKLKNASPRYAVMPGE 133

Query: 122 QVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATN 174
            +P SV+W E GAV PVK Q  C        VAAVEGIN I    L+SLSEQ+LVDC T 
Sbjct: 134 VLPESVDWRETGAVNPVKDQRSCGSCWAFSTVAAVEGINQIVTGELISLSEQELVDCDT- 192

Query: 175 DNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPP 234
           + + GC GG MD AF +II+N G+  +  Y Y G   G C+          I  YEDVPP
Sbjct: 193 EYDMGCNGGLMDYAFDFIIKNGGLDTEKDYPYTGFD-GECNLSGKSSKVVSIDGYEDVPP 251

Query: 235 NDEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
            DE++L KAVA+QPVSVA++A   ALQ Y  G+F G C T L+HG+ AVGYGT E G  Y
Sbjct: 252 FDEKALQKAVAHQPVSVAVEAGGRALQLYVSGIFTGECGTALDHGIVAVGYGT-ENGTDY 310

Query: 293 WLIKNSWGQDWGEDGYFRLQRDI-DQPQGQCGIAMFASFPVSKESAQPS 340
           W+++NSWG  WGE+GY R++R++ D   G+CGIAM AS+P+ K    PS
Sbjct: 311 WIVRNSWGSSWGENGYIRMERNMADAFSGKCGIAMEASYPI-KNGENPS 358


>gi|400180367|gb|AFP73322.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  264 bits (674), Expect = 5e-68,   Method: Compositional matrix adjust.
 Identities = 146/339 (43%), Positives = 207/339 (61%), Gaps = 19/339 (5%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
            LI +  +     +Q   R+  + S++E+ E W +++GR YK+  E  +RF IFK+N+  
Sbjct: 10  ILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKVERFMIFKENMKF 69

Query: 65  VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSD-HSSSLKANGTPFL---YKS 120
           +E  N A  GN SY L +N+FAD+T QEF+A  TG  + + + S    + T F       
Sbjct: 70  IESVNKA--GNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSD 127

Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
             +P +++W E GAVT VK+QG+C       AV ++EG   I    L+  SEQ+L+DC T
Sbjct: 128 DDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTT 187

Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
           N  N GC GGFM +AF +II+N GI+ ++ Y Y G     C S + +  A QI++Y+ V 
Sbjct: 188 N--NYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQY-TCRS-QEKTAAVQISSYK-VV 242

Query: 234 PNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
           P  E SLL+AV  QPVS+ I AS  LQFY+GG ++G C   +NH VTA+GYGT E+G KY
Sbjct: 243 PEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGYGTDEKGQKY 302

Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
           WL+KNSWG  WGE+G+ ++ RD   P G C IA  +S+P
Sbjct: 303 WLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341


>gi|359359118|gb|AEV41024.1| putative oryzain beta chain precursor [Oryza minuta]
          Length = 493

 Score =  264 bits (674), Expect = 6e-68,   Method: Compositional matrix adjust.
 Identities = 145/341 (42%), Positives = 195/341 (57%), Gaps = 47/341 (13%)

Query: 34  FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
           ++ W A+ GR+Y    E  +RF +F DNL  V+  N  A  +  + L +N+FADLT  EF
Sbjct: 49  YDLWLAENGRSYNALGERERRFRVFWDNLKFVDAHNARADEHGGFRLGMNRFADLTNDEF 108

Query: 94  IASQTGFKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGAVTPVKYQGQC-------- 144
            A+  G K  + S   +A G  + +    ++P SV+W EKGAV PVK QGQC        
Sbjct: 109 RATFLGAKFVERS---RAAGERYRHDGVEELPESVDWREKGAVAPVKNQGQCVDRIIVWN 165

Query: 145 -------------------------------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
                                          AV+ VE IN +    +++LSEQ+LV+C+T
Sbjct: 166 SMVRIYVVDAGCMLENPLMGLTVQGSCWAFSAVSTVESINQLVTGEMITLSEQELVECST 225

Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
           N  N+GC GG MDDAF +II+N GI  +  Y Y+ +  G CD  +       I  +EDVP
Sbjct: 226 NGQNSGCNGGLMDDAFDFIIKNGGIDTEDDYPYKAVD-GKCDINRENAKVVSIDGFEDVP 284

Query: 234 PNDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIK 291
            NDE+SL KAVA+QPVSVAI+A     Q Y  GVF+G C T L+HGV AVGYGT + G  
Sbjct: 285 QNDEKSLQKAVAHQPVSVAIEAGGREFQLYHSGVFSGRCGTSLDHGVVAVGYGT-DNGKD 343

Query: 292 YWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           YW+++NSWG  WGE GY R++R+I+   G+CGIAM AS+P 
Sbjct: 344 YWIVRNSWGPKWGESGYVRMERNINATTGKCGIAMMASYPT 384


>gi|20334373|gb|AAM19207.1|AF493232_1 cysteine protease [Solanum pimpinellifolium]
 gi|400180424|gb|AFP73350.1| cysteine protease [Solanum pimpinellifolium]
 gi|400180433|gb|AFP73354.1| cysteine protease [Solanum lycopersicum]
          Length = 344

 Score =  263 bits (673), Expect = 6e-68,   Method: Compositional matrix adjust.
 Identities = 147/343 (42%), Positives = 206/343 (60%), Gaps = 27/343 (7%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
            LI +  +     +Q   R+  + S++E+ E W +++GR YK+  E  +RF IFK+N+  
Sbjct: 10  ILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKF 69

Query: 65  VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDH--------SSSLKANGTPF 116
           +E  N A  GN SY L +N+FAD+T QEF+A  TG  + +         S+  K N    
Sbjct: 70  IESVNKA--GNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSD 127

Query: 117 LYKSSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLV 169
            Y    +P +++W E GAVT VK+QG+C       AV ++EG   I    L+  SEQ+L+
Sbjct: 128 DY----MPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELL 183

Query: 170 DCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNY 229
           DC TN  N GC GG M +AF +II+N GI+ ++ Y Y G     C S + +  A QI++Y
Sbjct: 184 DCTTN--NYGCNGGLMTNAFDFIIENGGISRESDYEYLGEQY-TCRS-REKTAAVQISSY 239

Query: 230 EDVPPNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEE 288
           + V P  E SLL+AV  QPVS+ I AS  LQFY+GG ++G C   +NH VTA+GYGT EE
Sbjct: 240 K-VVPEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGNCADQINHAVTAIGYGTDEE 298

Query: 289 GIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
           G KYWL+KNSWG  WGE+G+ ++ RD   P G C IA  +S+P
Sbjct: 299 GQKYWLLKNSWGTSWGENGFMKIIRDSGDPSGLCDIAKMSSYP 341


>gi|30141025|dbj|BAC75926.1| cysteine protease-4 [Helianthus annuus]
          Length = 352

 Score =  263 bits (673), Expect = 6e-68,   Method: Compositional matrix adjust.
 Identities = 146/316 (46%), Positives = 192/316 (60%), Gaps = 18/316 (5%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
           +   FE W A++ + Y+   E   RFEIF DNL  ++  N       +Y L LN+FADLT
Sbjct: 45  VIHLFESWLAKHSKIYESLDEKLHRFEIFMDNLKHIDDTNKKV---SNYWLGLNEFADLT 101

Query: 90  PQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGAVTPVKYQGQCA--- 145
            +EF     G K  +       +   F Y+    +P SV+W +KGAV PVK QGQC    
Sbjct: 102 HEEFKNKFLGLK-GELPERKDESIEEFSYRDFVDLPKSVDWRKKGAVAPVKNQGQCGSCW 160

Query: 146 ----VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITND 201
               VAAVEGIN I    L  LSEQ+L+DC T   NNGC GG MD AF Y++++ G+  +
Sbjct: 161 AFSTVAAVEGINQIVTGNLTMLSEQELIDCDTT-FNNGCNGGLMDYAFAYVMRS-GLHKE 218

Query: 202 AVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQ 259
             Y Y  MS G CD  K       I+ Y DVP N+E+S LKA+ANQP+SVAI+AS    Q
Sbjct: 219 EEYPYI-MSEGTCDEKKDVSETVTISGYHDVPRNNEDSFLKALANQPISVAIEASGRDFQ 277

Query: 260 FYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQ 319
           FYSGGVF+G+C T L+HGV AVGYGT++ G+ Y +++NSWG  WGE GY R++R   +P 
Sbjct: 278 FYSGGVFDGHCGTELDHGVAAVGYGTTK-GLDYVIVRNSWGPKWGEKGYIRMKRKTGKPH 336

Query: 320 GQCGIAMFASFPVSKE 335
           G CG+ M AS+P  ++
Sbjct: 337 GMCGLYMMASYPTKQK 352


>gi|350538043|ref|NP_001234324.1| cysteine protease TDI-65 precursor [Solanum lycopersicum]
 gi|5726641|gb|AAD48496.1|AF172856_1 cysteine protease TDI-65 [Solanum lycopersicum]
 gi|2828252|emb|CAA05894.1| CYP1 [Solanum lycopersicum]
          Length = 466

 Score =  263 bits (673), Expect = 6e-68,   Method: Compositional matrix adjust.
 Identities = 145/347 (41%), Positives = 209/347 (60%), Gaps = 24/347 (6%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGSIAEK--------FEQWKAQYGRTYKESAENSKRFE 56
            L++++  + S AS  +  ++DE  I  +        +E W  ++G++Y    E  KRF+
Sbjct: 12  ILLMLIFSTLSSASDMSIISYDETHIHRRTDDEVSALYESWLIEHGKSYNALGEKDKRFQ 71

Query: 57  IFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTP- 115
           IFKDNL  ++  N  ++ N+SY L L KFADLT +E+ +   G K S     L  N +  
Sbjct: 72  IFKDNLRYIDEQN--SVPNQSYKLGLTKFADLTNEEYRSIYLGTKSSGDRKKLSKNKSDR 129

Query: 116 FLYK-SSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQ 167
           +L K    +P S++W EKG +  VK QG C       AVAA+E INAI    L+SLSEQ+
Sbjct: 130 YLPKVGDSLPESIDWREKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGNLISLSEQE 189

Query: 168 LVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQIT 227
           LVDC     N GC GG MD AF+++I+N GI  +  Y Y+    G+CD  +      +I 
Sbjct: 190 LVDC-DRSYNEGCDGGLMDYAFEFVIKNGGIDTEEDYPYK-ERNGVCDQYRKNAKVVKID 247

Query: 228 NYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGT 285
           +YEDVP N+E++L KAVA+QPVS+A++A     Q Y  G+F G C T ++HGV   GYGT
Sbjct: 248 SYEDVPVNNEKALQKAVAHQPVSIALEAGGRDFQHYKSGIFTGKCGTAVDHGVVIAGYGT 307

Query: 286 SEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
            E G+ YW+++NSWG +WGE+GY R+QR++    G CG+A+  S+PV
Sbjct: 308 -ENGMDYWIVRNSWGANWGENGYLRVQRNVASSSGLCGLAIEPSYPV 353


>gi|400180399|gb|AFP73338.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  263 bits (673), Expect = 6e-68,   Method: Compositional matrix adjust.
 Identities = 145/339 (42%), Positives = 207/339 (61%), Gaps = 19/339 (5%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
            LI +  +     +Q   R+  + S++E+ E W +++GR YK+  E  +RF IFK+N+  
Sbjct: 10  ILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKF 69

Query: 65  VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSD-HSSSLKANGTPFL---YKS 120
           +E  N A  GN SY L +N+FAD+T QEF+A  TG  + + + S    + T F+      
Sbjct: 70  IESVNKA--GNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFIINDLSD 127

Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
             +P +++W E GAVT VK+QG+C       AV ++EG   I    L+  SEQ+L+DC T
Sbjct: 128 DDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTT 187

Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
           N  N GC GGFM +AF +I +N GI+ ++ Y Y G     C S + +  A QI++Y+ V 
Sbjct: 188 N--NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQY-TCRS-QEKTAAVQISSYQ-VV 242

Query: 234 PNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
           P  E SLL+AV  QPVS+ I AS  LQFY+GG ++G C   +NH VTA+GYGT E+G KY
Sbjct: 243 PEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGYGTDEKGQKY 302

Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
           WL+KNSWG  WGE+G+ ++ RD   P G C IA  +S+P
Sbjct: 303 WLLKNSWGTSWGENGFMKIIRDSGDPSGLCDIAKMSSYP 341


>gi|400180461|gb|AFP73367.1| cysteine protease [Solanum peruvianum]
 gi|400180473|gb|AFP73373.1| cysteine protease [Solanum peruvianum]
 gi|400180475|gb|AFP73374.1| cysteine protease [Solanum peruvianum]
 gi|400180479|gb|AFP73376.1| cysteine protease [Solanum peruvianum]
 gi|400180481|gb|AFP73377.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  263 bits (673), Expect = 7e-68,   Method: Compositional matrix adjust.
 Identities = 145/339 (42%), Positives = 207/339 (61%), Gaps = 19/339 (5%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
            LI +  +     +Q   R+  + S++E+ E W +++GR YK+  E  +RF IFK+N+  
Sbjct: 10  ILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKF 69

Query: 65  VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSD-HSSSLKANGTPFL---YKS 120
           +E  N A  GN SY L +N+FAD+T QEF+A  TG  + + + S    + T F+      
Sbjct: 70  IESVNKA--GNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFIINDLSD 127

Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
             +P +++W E GAVT VK+QG+C       AV ++EG   I    L+  SEQ+L+DC T
Sbjct: 128 DDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTT 187

Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
           N  N GC GGFM +AF +I +N GI+ ++ Y Y G     C S + +  A QI++Y+ V 
Sbjct: 188 N--NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQY-TCRS-QEKTAAVQISSYQ-VV 242

Query: 234 PNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
           P  E SLL+AV  QPVS+ I AS  LQFY+GG ++G C   +NH VTA+GYGT E+G KY
Sbjct: 243 PEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGYGTDEKGQKY 302

Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
           WL+KNSWG  WGE+G+ ++ RD   P G C IA  +S+P
Sbjct: 303 WLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341


>gi|400180371|gb|AFP73324.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  263 bits (673), Expect = 7e-68,   Method: Compositional matrix adjust.
 Identities = 145/339 (42%), Positives = 207/339 (61%), Gaps = 19/339 (5%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
            LI +  +     +Q   R+  + S++E+ E W +++GR YK+  E  +RF IFK+N+  
Sbjct: 10  ILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKF 69

Query: 65  VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSD-HSSSLKANGTPFL---YKS 120
           +E  N A  GN SY L +N+FAD+T QEF+A  TG  + + + S    + T F+      
Sbjct: 70  IESVNKA--GNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFIINDLSD 127

Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
             +P +++W E GAVT VK+QG+C       AV ++EG   I    L+  SEQ+L+DC T
Sbjct: 128 DDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTT 187

Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
           N  N GC GGFM +AF +I +N GI+ ++ Y Y G     C S + +  A QI++Y+ V 
Sbjct: 188 N--NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQY-TCRS-QEKTAAVQISSYQ-VV 242

Query: 234 PNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
           P  E SLL+AV  QPVS+ I AS  LQFY+GG ++G C   +NH VTA+GYGT E+G KY
Sbjct: 243 PEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGYGTDEKGQKY 302

Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
           WL+KNSWG  WGE+G+ ++ RD   P G C IA  +S+P
Sbjct: 303 WLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341


>gi|255538788|ref|XP_002510459.1| cysteine protease, putative [Ricinus communis]
 gi|223551160|gb|EEF52646.1| cysteine protease, putative [Ricinus communis]
          Length = 422

 Score =  263 bits (673), Expect = 7e-68,   Method: Compositional matrix adjust.
 Identities = 145/308 (47%), Positives = 181/308 (58%), Gaps = 14/308 (4%)

Query: 34  FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
           FE W  ++G+TY    +   RF+IF++N   V++ N+   GN SYTL LN FADLT  EF
Sbjct: 32  FESWTKEHGKTYTSKEDKLYRFKIFEENYEFVKKHNSQ--GNSSYTLSLNAFADLTHHEF 89

Query: 94  IASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC-------AV 146
            AS+ G      S  L     P       VP S++W +KGAV+ VK QG C       A 
Sbjct: 90  KASRLGLSAFSTSGKLSRRNFPLHDFVGDVPISIDWRKKGAVSQVKDQGNCGACWSFSAT 149

Query: 147 AAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSY 206
            A+EGIN I    LVSLSEQ+LVDC     NNGC GG MD A++++I+N GI  +  Y Y
Sbjct: 150 GAIEGINKIVTGSLVSLSEQELVDC-DRSYNNGCEGGLMDYAYQFVIENNGIDTEEDYPY 208

Query: 207 EGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQFYSGG 264
           +      C+  K + H   I  Y DVP N+E+ LLKAVA QPVSV I  S  A Q YS G
Sbjct: 209 QAREK-TCNKEKLKRHVVTIDGYTDVPQNNEKELLKAVAAQPVSVGICGSERAFQLYSKG 267

Query: 265 VFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGI 324
           +F G C T L+H V  VGYG SE G+ YW++KNSWG  WG +GY  + R+    QG CGI
Sbjct: 268 IFTGPCSTSLDHAVLIVGYG-SENGVDYWIVKNSWGTHWGINGYMYMLRNSGNSQGLCGI 326

Query: 325 AMFASFPV 332
            M ASFPV
Sbjct: 327 NMLASFPV 334


>gi|400180381|gb|AFP73329.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  263 bits (673), Expect = 7e-68,   Method: Compositional matrix adjust.
 Identities = 145/339 (42%), Positives = 206/339 (60%), Gaps = 19/339 (5%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
            LI +  +     +Q   R+  + S++E+ E W +++GR YK+  E  +RF IFK+N+  
Sbjct: 10  ILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKF 69

Query: 65  VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSD-HSSSLKANGTPFL---YKS 120
           +E  N A  GN SY L +N+FAD+T QEF+A  TG  + + + S    + T F       
Sbjct: 70  IESVNKA--GNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSD 127

Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
             +P +++W E GAVT VK+QG+C       AV ++EG   I    L+  SEQ+L+DC T
Sbjct: 128 DDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTT 187

Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
           N  N GC GGFM +AF +II+N GI+ ++ Y Y G     C S + +  A QI++Y+ V 
Sbjct: 188 N--NYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQY-TCRS-QEKTAAVQISSYK-VV 242

Query: 234 PNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
           P  E SLL+AV  QPVS+ I AS  LQFY+GG ++G C   +NH VTA+GYGT E+G KY
Sbjct: 243 PEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGYGTDEKGQKY 302

Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
           WL+KNSWG  WGE+G+ ++ RD   P G C I   +S+P
Sbjct: 303 WLLKNSWGTSWGENGFMKIIRDSGDPSGLCDITKMSSYP 341


>gi|359359215|gb|AEV41119.1| putative cysteine protease [Oryza officinalis]
          Length = 499

 Score =  263 bits (673), Expect = 7e-68,   Method: Compositional matrix adjust.
 Identities = 145/298 (48%), Positives = 182/298 (61%), Gaps = 16/298 (5%)

Query: 49  AENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSS 108
            E  +RF +F DNL  V+  N  A  +  + L +N+FADLT  EF A+  G   +     
Sbjct: 84  GEYERRFRVFWDNLKFVDAHNARADEHGGFRLGMNRFADLTNDEFRAAYLGTTPAGRGRH 143

Query: 109 LKANGTPFLYKSSQV-PPSVNWIEKGAVT-PVKYQGQC-------AVAAVEGINAIKINR 159
           +   G  + +   +V P SV+W +KGAV  PVK QGQC       AVAAVEGIN I    
Sbjct: 144 V---GEAYRHDGVEVLPDSVDWRDKGAVVAPVKNQGQCGSCWAFSAVAAVEGINKIVTGE 200

Query: 160 LVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKA 219
           LVSLSEQ+LV+CA N  N+GC GG MDDAF +I +N G+  +  Y Y  M  G C+  K 
Sbjct: 201 LVSLSEQELVECARNGANSGCNGGMMDDAFAFIARNGGLDTEEDYPYTAMD-GKCNLAKK 259

Query: 220 EDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHG 277
                 I  +EDVP NDE SL KAVA+QPVSVAIDA     Q Y  GVF G C T L+HG
Sbjct: 260 SRKVVSIDGFEDVPENDELSLQKAVAHQPVSVAIDAGGREFQLYDSGVFTGRCGTSLDHG 319

Query: 278 VTAVGYGT-SEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSK 334
           V AVGYGT +  G  YW ++NSWG DWGE+GY R++R++    G+CGIAM AS+P+ K
Sbjct: 320 VVAVGYGTDAATGTDYWTVRNSWGPDWGENGYIRMERNVTARTGKCGIAMMASYPIKK 377


>gi|194352756|emb|CAQ00106.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
          Length = 476

 Score =  263 bits (673), Expect = 7e-68,   Method: Compositional matrix adjust.
 Identities = 142/317 (44%), Positives = 198/317 (62%), Gaps = 18/317 (5%)

Query: 34  FEQWKAQYGRTYKESA----ENSKRFEIFKDNLVAVERFN-NAAIGNRSYTLRLNKFADL 88
           ++ W A++G     +A    +  +RF  F DNL  V+  N  AA G   + L +N+FADL
Sbjct: 52  YDLWLAEHGGGSSPNANSIADRERRFSAFWDNLRFVDAHNARAAAGEEGFRLAMNRFADL 111

Query: 89  TPQEFIASQTGFK-MSDHSSSLKANGTPFLYKSSQ-VPPSVNWIEKGAVTPVKYQGQC-- 144
           T  EF A+  G K  ++ + + +  G  + +  ++ +P +V+W EKGAV PVK QGQC  
Sbjct: 112 TNDEFRAAYLGVKGAAERNRAGRVVGDRYRHDGAEELPEAVDWREKGAVAPVKNQGQCGS 171

Query: 145 -----AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGIT 199
                AV+ VE IN I    +V+LSEQ+LV+C  N  ++GC GG MDDAF++II+N GI 
Sbjct: 172 CWAFSAVSTVESINQIVTGEMVTLSEQELVECDINGQSSGCNGGLMDDAFEFIIKNGGID 231

Query: 200 NDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA-- 257
            +  Y Y+ +  G CD ++       I  +EDVP NDE+SL KAVA+ PVSVAI+A    
Sbjct: 232 TEDDYPYKAVD-GRCDVLRKNAKVVSIDGFEDVPENDEKSLQKAVAHHPVSVAIEAGGRE 290

Query: 258 LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQ 317
            Q Y  GVF+G C T L+HGV AVGYGT E G  YW+++NSWG +WGE GY R++R+I+ 
Sbjct: 291 FQLYHSGVFSGRCGTQLDHGVVAVGYGT-ENGKDYWIVRNSWGPNWGEAGYLRMERNINV 349

Query: 318 PQGQCGIAMFASFPVSK 334
             G+CGIAM +S+P  K
Sbjct: 350 TSGKCGIAMMSSYPTKK 366


>gi|400180449|gb|AFP73361.1| cysteine protease [Solanum chilense]
          Length = 344

 Score =  263 bits (673), Expect = 8e-68,   Method: Compositional matrix adjust.
 Identities = 146/339 (43%), Positives = 206/339 (60%), Gaps = 19/339 (5%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
            LI +  +     +Q   R+  + S++E+ E W +++GR YK+  E  +RF IFK N+  
Sbjct: 10  ILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKKNMKF 69

Query: 65  VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSD-HSSSLKANGTPFL---YKS 120
           +E  N A  GN SY L +N+FAD+T QEF+A  TG  + + + S    + T F       
Sbjct: 70  IESVNKA--GNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSD 127

Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
             +P +++W E GAVT VK+QGQC       AV ++EG   I   +L+  SEQ+L+DC T
Sbjct: 128 DDMPSNLDWRESGAVTQVKHQGQCGCCWAFSAVGSLEGAYKIATGKLMEFSEQELLDCTT 187

Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
           N  N GC GGFM +AF +II+N GI+ ++ Y Y G     C S + +  A QI++Y+ V 
Sbjct: 188 N--NYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQY-TCRS-QEKTAAVQISSYQ-VV 242

Query: 234 PNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
           P  E SLL+AV  QPVS+ I AS  LQFY+ G ++G C   +NH VTA+GYGT E+G KY
Sbjct: 243 PEGETSLLQAVTKQPVSIGIAASQDLQFYAEGTYDGSCADRINHAVTAIGYGTDEKGQKY 302

Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
           WL+KNSWG  WGE+G+ ++ RD   P G C IA  +S+P
Sbjct: 303 WLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341


>gi|400180359|gb|AFP73318.1| cysteine protease [Solanum peruvianum]
 gi|400180477|gb|AFP73375.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  263 bits (673), Expect = 8e-68,   Method: Compositional matrix adjust.
 Identities = 146/339 (43%), Positives = 206/339 (60%), Gaps = 19/339 (5%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
            LI +  +     SQ   R+  + S++E+ E W +++GR YK+  E  +RF IFK+N+  
Sbjct: 10  ILITLFFVISMFNSQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKF 69

Query: 65  VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSD-HSSSLKANGTPFL---YKS 120
           +E  N A  GN SY L +N+FAD+T QEF+A  TG  + + + S    + T F       
Sbjct: 70  IESVNKA--GNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSD 127

Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
             +P +++W E GAVT VK+QG+C       AV ++EG   I    L+  SEQ+L+DC T
Sbjct: 128 DDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTT 187

Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
           N  N GC GGFM +AF +I +N GI+ ++ Y Y G     C S + +  A QI++Y+ V 
Sbjct: 188 N--NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQY-TCRS-QEKTAAVQISSYQ-VV 242

Query: 234 PNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
           P  E SLL+AV  QPVS+ I AS  LQFY+GG ++G C   +NH VTA+GYGT E+G KY
Sbjct: 243 PEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGYGTDEKGQKY 302

Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
           WL+KNSWG  WGE+G+ ++ RD   P G C IA  +S+P
Sbjct: 303 WLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341


>gi|168057475|ref|XP_001780740.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162667829|gb|EDQ54449.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 463

 Score =  263 bits (673), Expect = 8e-68,   Method: Compositional matrix adjust.
 Identities = 145/316 (45%), Positives = 189/316 (59%), Gaps = 19/316 (6%)

Query: 29  SIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADL 88
           +I + F QW   + R Y+  +E   RF+IFK+N + +   N      +SY L LNKF+DL
Sbjct: 44  AILDVFHQWLETHSRVYRSLSEKHHRFQIFKENFLYIHAHNKQ---QKSYWLGLNKFSDL 100

Query: 89  TPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC---- 144
           T QEF A   G K  +     +AN   F+Y+  +  P V+W  KGAVT VK QG C    
Sbjct: 101 THQEFRAQYLGTKPVNRQRK-EAN---FMYEDVEAEPKVDWRLKGAVTDVKDQGACGSCW 156

Query: 145 ---AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITND 201
              AV +VEG+NAIK   LVSLSEQ+LVDC     N GC GG MD AF++II+N GI  +
Sbjct: 157 AFSAVGSVEGVNAIKTGELVSLSEQELVDC-DRKQNQGCNGGLMDYAFEFIIKNGGIDTE 215

Query: 202 AVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQ 259
             Y Y+    G CD  +       I +Y+DVP   E +L+KA+   PVSVAI+A     Q
Sbjct: 216 KDYPYKARD-GRCDEGRRNSKVVVIDDYQDVPTQSESALMKALTKNPVSVAIEAGGRDFQ 274

Query: 260 FYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQR-DIDQP 318
            Y GGVF G C + L+HGV AVGYGT ++G+ YW++KNSWG  WGE GY R++R   D  
Sbjct: 275 HYQGGVFTGPCGSELDHGVLAVGYGTDDDGVNYWIVKNSWGPGWGEKGYIRMERFGSDST 334

Query: 319 QGQCGIAMFASFPVSK 334
            G+CGI + ASFP+ K
Sbjct: 335 DGKCGINIEASFPIKK 350


>gi|400180363|gb|AFP73320.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  263 bits (672), Expect = 8e-68,   Method: Compositional matrix adjust.
 Identities = 146/343 (42%), Positives = 206/343 (60%), Gaps = 27/343 (7%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
            LI +  +     +Q   R+  + S++E+ E W +++GR YK+  E  +RF IFK+N+  
Sbjct: 10  ILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKF 69

Query: 65  VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDH--------SSSLKANGTPF 116
           +E  N A  GN SY L +N+FAD+T QEF+A  TG  + +         S+ LK N    
Sbjct: 70  IESVNKA--GNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTELKIND--- 124

Query: 117 LYKSSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLV 169
                 +P +++W E GAVT VK+QG+C       AV ++EG   I    L+  SEQ+L+
Sbjct: 125 -LSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELL 183

Query: 170 DCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNY 229
           DC TN  N GC GGFM +AF +I +N GI+ ++ Y Y G     C S + +  A QI++Y
Sbjct: 184 DCTTN--NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQY-TCRS-QEKTAAVQISSY 239

Query: 230 EDVPPNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEE 288
           + V P  E SLL+AV  QPVS+ I AS  LQFY+GG ++G C   +NH VTA+GYGT E+
Sbjct: 240 Q-VVPEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGYGTDEK 298

Query: 289 GIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
           G KYWL+KNSWG  WGE+G+ ++ RD   P G C IA  +S+P
Sbjct: 299 GQKYWLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341


>gi|1514953|dbj|BAA11170.1| cysteine proteinase [Oryza sativa (japonica cultivar-group)]
          Length = 368

 Score =  263 bits (672), Expect = 9e-68,   Method: Compositional matrix adjust.
 Identities = 144/316 (45%), Positives = 195/316 (61%), Gaps = 19/316 (6%)

Query: 29  SIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADL 88
           ++ + +E+W+  +    +   E  +RF  FKDN+  +   N  A G       LN+F D+
Sbjct: 41  ALWDLYERWQEHH-HVPRHHGEKHRRFGAFKDNVRYIHEHNKRAPGYAP----LNRFGDM 95

Query: 89  TPQEFIASQTGFKMSD-HSSSLKANGTP-FLYKSSQ-VPPSVNWIEKGAVTPVKYQGQCA 145
             +EF A+  G   +D     L A   P F+Y+  + +P +V+W  KGAVT VK QG+C 
Sbjct: 96  GREEFRATFAGSHANDLRRDGLAAPPLPGFMYEGVRDLPRAVDWRRKGAVTGVKDQGKCG 155

Query: 146 -------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGI 198
                  V +VEGINAI+  RLVSLSEQ+L+DC T DN+ GC GG M++AF+YI  + GI
Sbjct: 156 SCWAFSTVVSVEGINAIRTGRLVSLSEQELIDCDTADNS-GCQGGLMENAFEYIKHSGGI 214

Query: 199 TNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS-- 256
           T ++ Y Y   + G CD+++A      I  +++VP N E +L KAVANQPVSVAIDA   
Sbjct: 215 TTESAYPYR-AANGTCDAVRARGGLVVIDGHQNVPANSEAALAKAVANQPVSVAIDAGDQ 273

Query: 257 ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDID 316
           + QFYS GVF G C T L+HGV  VGYG + +G +YW++KNSWG  WGE GY R+QRD  
Sbjct: 274 SFQFYSDGVFAGDCGTDLDHGVAVVGYGETNDGTEYWIVKNSWGTAWGEGGYIRMQRDSG 333

Query: 317 QPQGQCGIAMFASFPV 332
              G CGIAM AS+PV
Sbjct: 334 YDGGLCGIAMEASYPV 349


>gi|302759380|ref|XP_002963113.1| hypothetical protein SELMODRAFT_270344 [Selaginella moellendorffii]
 gi|300169974|gb|EFJ36576.1| hypothetical protein SELMODRAFT_270344 [Selaginella moellendorffii]
          Length = 479

 Score =  263 bits (672), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 151/319 (47%), Positives = 193/319 (60%), Gaps = 28/319 (8%)

Query: 34  FEQWKAQYGRTYKESA--------ENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKF 85
           F+ W  Q+G++Y E+A        E + R+ IFKDNL  +   N     N+ Y L LN F
Sbjct: 57  FDSWMLQHGKSYAENALSGDSQAGEKATRYGIFKDNLRFIHGENEK---NQGYFLGLNAF 113

Query: 86  ADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQV---PPSVNWIEKGAVTPVKYQG 142
           ADLT +EF A + G +     S  + +   F Y S Q+   P S++W EKGAV  VK QG
Sbjct: 114 ADLTNEEFRAQRHGGRFD--RSRERTSYEEFRYGSVQLKDLPDSIDWREKGAVVGVKDQG 171

Query: 143 QC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQN 195
            C       AVAA+EG+N +    LVSLSEQ+LVDC   ++  GC GG MD AF ++I+N
Sbjct: 172 SCGSCWAFSAVAAIEGVNKLATGELVSLSEQELVDCDKGEDE-GCNGGLMDYAFGFVIKN 230

Query: 196 KGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDA 255
            G+  +A Y Y+G  T  CD  K       I  YEDVP NDE +LLKAVA+QPVSVAIDA
Sbjct: 231 GGLDTEADYPYKGYGT-RCDRSKMNAKVVTIDGYEDVPVNDETALLKAVAHQPVSVAIDA 289

Query: 256 --SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQR 313
             S++QFY  G+F G C T L+HGVT VGYG  E+G  YW+IKNSWG +WGE GY ++ R
Sbjct: 290 GGSSMQFYRSGIFTGRCGTDLDHGVTNVGYG-KEDGKAYWIIKNSWGSNWGEKGYIKMAR 348

Query: 314 DIDQPQGQCGIAMFASFPV 332
           +     G CGI M AS+P 
Sbjct: 349 NTGLAAGLCGINMEASYPT 367


>gi|109939734|sp|P25776.2|ORYA_ORYSJ RecName: Full=Oryzain alpha chain; Flags: Precursor
 gi|78192122|gb|ABB30151.1| oryzain alpha [Oryza sativa Japonica Group]
          Length = 458

 Score =  263 bits (672), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 144/311 (46%), Positives = 189/311 (60%), Gaps = 14/311 (4%)

Query: 34  FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAA-IGNRSYTLRLNKFADLTPQE 92
           + +WKA++G++Y    E  +R+  F+DNL  ++  N AA  G  S+ L LN+FADLT +E
Sbjct: 40  YAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHSFRLGLNRFADLTNEE 99

Query: 93  FIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC-------A 145
           +  +  G +        K +       +  +P SV+W  KGAV  +K QG C       A
Sbjct: 100 YRDTYLGLRNKPRRER-KVSDRYLAADNEALPESVDWRTKGAVAEIKDQGGCGSCWAFSA 158

Query: 146 VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYS 205
           +AAVEGIN I    L+SLSEQ+LVDC T+  N GC GG MD AF +II N GI  +  Y 
Sbjct: 159 IAAVEGINQIVTGDLISLSEQELVDCDTS-YNEGCNGGLMDYAFDFIINNGGIDTEDDYP 217

Query: 206 YEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQFYSG 263
           Y+G     CD  +       I +YEDV PN E SL KAVANQPVSVAI+A   A Q YS 
Sbjct: 218 YKGKDE-RCDVNRKNAKVVTIDSYEDVTPNSETSLQKAVANQPVSVAIEAGGRAFQLYSS 276

Query: 264 GVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCG 323
           G+F G C T L+HGV AVGYGT E G  YW+++NSWG+ WGE GY R++R+I    G+CG
Sbjct: 277 GIFTGKCGTALDHGVAAVGYGT-ENGKDYWIVRNSWGKSWGESGYVRMERNIKASSGKCG 335

Query: 324 IAMFASFPVSK 334
           IA+  S+P+ K
Sbjct: 336 IAVEPSYPLKK 346


>gi|218183|dbj|BAA14403.1| oryzain beta precursor [Oryza sativa Japonica Group]
          Length = 471

 Score =  263 bits (671), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 142/311 (45%), Positives = 195/311 (62%), Gaps = 17/311 (5%)

Query: 34  FEQWKAQYGRTYKES--AENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQ 91
           ++ W A+ G     +   E+ +RF +F DNL  V+  N  A     + L +N+FADLT +
Sbjct: 51  YDLWLAENGGGSPNALGGEHERRFLVFWDNLKFVDAHNARADEGGGFRLGMNRFADLTNE 110

Query: 92  EFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGAVTPVKYQGQC------ 144
           EF A+  G K+++ S   +A G  + +    ++P SV+W EKGAV PVK QGQC      
Sbjct: 111 EFRATFLGAKVAERS---RAAGERYRHDGVEELPESVDWREKGAVAPVKNQGQCGSCWAF 167

Query: 145 -AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAV 203
            AV+ VE IN +    +++LSEQ+LV+C+TN  N+GC GG M DAF +II+N GI  +  
Sbjct: 168 SAVSTVESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMADAFDFIIKNGGIDTEDD 227

Query: 204 YSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFY 261
           Y Y+ +  G CD  +       I  +EDVP NDE+SL KAVA+QPVSVAI+A     Q Y
Sbjct: 228 YPYKAVD-GKCDINRENAKVVSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGGREFQLY 286

Query: 262 SGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQ 321
             GVF+G C T L+HGV AVGYGT + G  YW+++NSWG  WGE GY R++R+I+   G+
Sbjct: 287 HSGVFSGRCGTSLDHGVVAVGYGT-DNGKDYWIVRNSWGPKWGESGYVRMERNINVTTGK 345

Query: 322 CGIAMFASFPV 332
           CGIAM AS+P 
Sbjct: 346 CGIAMMASYPT 356


>gi|413933049|gb|AFW67600.1| cysteine protease 1 [Zea mays]
          Length = 341

 Score =  263 bits (671), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 152/346 (43%), Positives = 212/346 (61%), Gaps = 22/346 (6%)

Query: 2   AKYFLIVVLIISGSC---ASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIF 58
           A  FL+ VL++  +    A+          ++A + E+W A++GR YK+ AE ++R E+F
Sbjct: 3   ASRFLLAVLVVGSAVLCTAAAPRALAAAAAAMASRHEKWMAEHGRAYKDEAEKARRLEVF 62

Query: 59  KDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLY 118
           + N   ++ FN  A G  S+ L  N+FADLT +EF A++TG +     S   A    F Y
Sbjct: 63  RANAELIDSFN--AAGTHSHRLATNRFADLTVEEFRAARTGLRPRPAPS---AGAGRFRY 117

Query: 119 KS---SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQL 168
           ++   +    SV+W   GAVT VK QG C       AVAAVEG+N I+  RLVSLSEQ+L
Sbjct: 118 ENFSLADAAQSVDWRAMGAVTGVKDQGACGCCWAFSAVAAVEGLNKIRTGRLVSLSEQEL 177

Query: 169 VDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITN 228
           VDC  +  + GC GG MD+AF+++ +  G+ +++ Y Y+G   G C S  A   AA I  
Sbjct: 178 VDCDVSGVDQGCDGGLMDNAFQFVARRGGLASESGYPYQGRD-GPCRSSAAAARAASIRG 236

Query: 229 YEDVPPNDEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTS 286
           +EDVP N+E +L  AVANQPVSVAI+    A +FY  GV  G C T LNH +TAVGYGT+
Sbjct: 237 HEDVPRNNEAALAAAVANQPVSVAINGEDMAFRFYDSGVLGGACGTDLNHAITAVGYGTA 296

Query: 287 EEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
            +G +YWL+KNSWG  WGE GY R++R + + +G CG+A   S+PV
Sbjct: 297 NDGTRYWLMKNSWGASWGEGGYVRIRRGV-RGEGVCGLAKLPSYPV 341


>gi|18141283|gb|AAL60579.1|AF454957_1 senescence-associated cysteine protease [Brassica oleracea]
          Length = 460

 Score =  263 bits (671), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 150/355 (42%), Positives = 205/355 (57%), Gaps = 33/355 (9%)

Query: 5   FLIVVLIISGSCASQATYRTFDE------------GSIAEKFEQWKAQYGRTYKESA--- 49
            L++ ++I  S A+  +  ++DE              +A  +E W  ++G+  + +    
Sbjct: 8   ILLLAMMIGVSYAADMSIISYDEKHHITAENERSDAEVARIYEAWMEKHGKKAQSNGLVG 67

Query: 50  -ENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSS 108
            E  +RFEIFKDNL  ++  NN    N SY L L +FADLT +E+ +   G K       
Sbjct: 68  EEKDQRFEIFKDNLRFIDEHNNK---NLSYKLGLTRFADLTNEEYRSIYLGAKSKKRVLK 124

Query: 109 LKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLV 161
                 P +     +P SV+W ++GAV  VK QG C        + AVEGIN I    L+
Sbjct: 125 TSDRYQPRV--GDAIPDSVDWRKEGAVAAVKDQGSCGSCWAFSTIGAVEGINKIVTGDLI 182

Query: 162 SLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAED 221
           SLSEQ+LVDC T+  N GC GG MD AF++II+N GI  +  Y Y+  + G CD  +   
Sbjct: 183 SLSEQELVDCDTS-YNQGCNGGLMDYAFEFIIKNGGIDTEEDYPYKA-ADGRCDQTRKNA 240

Query: 222 HAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVT 279
               I  YEDVP N+E +L K +ANQP+SVAI+A   A Q YS GVF+G C T L+HGV 
Sbjct: 241 KVVTIDAYEDVPENNEAALKKTLANQPISVAIEAGGRAFQLYSSGVFDGICGTELDHGVV 300

Query: 280 AVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSK 334
           AVGYGT E G  YW+++NSWG  WGE GY ++ R+I +P G+CGIAM AS+P+ K
Sbjct: 301 AVGYGT-ENGKDYWIVRNSWGGSWGESGYIKMARNIAEPTGKCGIAMEASYPIKK 354


>gi|4426617|gb|AAD20453.1| cysteine endopeptidase precursor [Oryza sativa]
          Length = 368

 Score =  263 bits (671), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 144/316 (45%), Positives = 195/316 (61%), Gaps = 19/316 (6%)

Query: 29  SIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADL 88
           ++ + +E+W+  +    +   E  +RF  FKDN+  +   N  A G       LN+F D+
Sbjct: 41  ALWDLYERWQEHH-HVPRHHGEKHRRFGAFKDNVRYIHEHNKRAPGYPP----LNRFGDM 95

Query: 89  TPQEFIASQTGFKMSD-HSSSLKANGTP-FLYKSSQ-VPPSVNWIEKGAVTPVKYQGQCA 145
             +EF A+  G   +D     L A   P F+Y+  + +P +V+W  KGAVT VK QG+C 
Sbjct: 96  GREEFRATFAGSHANDLRRDGLAAPPLPGFMYEGVRDLPRAVDWRRKGAVTGVKDQGKCG 155

Query: 146 -------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGI 198
                  V +VEGINAI+  RLVSLSEQ+L+DC T DN+ GC GG M++AF+YI  + GI
Sbjct: 156 SCWAFSTVVSVEGINAIRTGRLVSLSEQELIDCDTADNS-GCQGGLMENAFEYIKHSGGI 214

Query: 199 TNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS-- 256
           T ++ Y Y   + G CD+++A      I  +++VP N E +L KAVANQPVSVAIDA   
Sbjct: 215 TTESAYPYR-AANGTCDAVRARGGLVVIDGHQNVPANSEAALAKAVANQPVSVAIDAGDQ 273

Query: 257 ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDID 316
           + QFYS GVF G C T L+HGV  VGYG + +G +YW++KNSWG  WGE GY R+QRD  
Sbjct: 274 SFQFYSDGVFAGDCGTDLDHGVAVVGYGETNDGTEYWIVKNSWGTAWGEGGYIRMQRDSG 333

Query: 317 QPQGQCGIAMFASFPV 332
              G CGIAM AS+PV
Sbjct: 334 YDGGLCGIAMEASYPV 349


>gi|194352752|emb|CAQ00104.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
          Length = 351

 Score =  263 bits (671), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 154/354 (43%), Positives = 210/354 (59%), Gaps = 31/354 (8%)

Query: 2   AKYFLIVVLIISGSCASQAT------YRTFDEGS---IAEKFEQWKAQYGRTYKESAENS 52
           +K  + V+L+  G+C ++ +      Y   D  S   + E FE+W A++ + Y    E  
Sbjct: 3   SKLSVAVLLLCVGACVARNSDFSIVGYSEEDLSSHDRLVELFEKWLAKHQKAYASFEEKL 62

Query: 53  KRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKAN 112
            RFE+FKDNL  ++  N       SY L LN+FADLT  EF  +  G        + +++
Sbjct: 63  HRFEVFKDNLKLIDEINREVT---SYWLGLNEFADLTHDEFKTTYLGLSPP---PARRSS 116

Query: 113 GTPFLYK---SSQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVS 162
              F Y+   +  +P +V+W +KGAVT VK QGQC        VAAVEGINAI    L +
Sbjct: 117 SRSFRYENVAAHDLPKAVDWRKKGAVTDVKNQGQCGSCWAFSTVAAVEGINAIVTGNLTA 176

Query: 163 LSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGIC-DSIKAED 221
           LSEQ+L+DC+  D N+GC GG MD AF YI  + G+  +  Y Y  M  G C D  K+E 
Sbjct: 177 LSEQELIDCSV-DGNSGCNGGMMDYAFSYIASSGGLHTEEAYPYL-MEEGSCGDGKKSES 234

Query: 222 HAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVT 279
            A  I+ YEDVP  DE++L+KA+A+QPVSVAI+AS    QFYSGGVF+G C   L+HGV 
Sbjct: 235 EAVSISGYEDVPTKDEQALIKALAHQPVSVAIEASGRHFQFYSGGVFDGPCGAQLDHGVA 294

Query: 280 AVGYGTSE-EGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           AVGYG+ + +G  Y ++KNSWG  WGE GY R++R   + +G CGI   AS+P 
Sbjct: 295 AVGYGSDKGKGHDYIIVKNSWGGKWGEKGYIRMKRGTGKSEGLCGINKMASYPT 348


>gi|400180375|gb|AFP73326.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  263 bits (671), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 146/339 (43%), Positives = 207/339 (61%), Gaps = 19/339 (5%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
            LI +  +     SQ   R+  + S++E+ E W +++GR YK+  E  +RF IFK+N+  
Sbjct: 10  ILITLFFVISMFNSQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKENIKF 69

Query: 65  VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSD-HSSSLKANGTPFL---YKS 120
           +E  N A  GN SY L +N+FAD+T QEF+A  TG  + + + S    + T F       
Sbjct: 70  IESVNKA--GNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSD 127

Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
             +P +++W E GAVT VK+QG+C       AV ++EG   I    L+  SEQ+L+DC T
Sbjct: 128 DDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTT 187

Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
           N  N GC GGFM +AF +I +N GI++++ Y Y G     C S + +  A QI++Y+ V 
Sbjct: 188 N--NYGCDGGFMTNAFDFIKENGGISSESDYEYLGEQY-TCRS-QEKTAAVQISSYQ-VV 242

Query: 234 PNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
           P  E SLL+AV  QPVS+ I AS  LQFY+GG ++G C   +NH VTA+GYGT E+G KY
Sbjct: 243 PEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGYGTDEKGQKY 302

Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
           WL+KNSWG  WGE+G+ ++ RD   P G C IA  +S+P
Sbjct: 303 WLLKNSWGTSWGENGFMKIIRDSGNPAGLCDIAKMSSYP 341


>gi|171702843|dbj|BAG16377.1| cysteine protease [Brassica rapa var. perviridis]
          Length = 431

 Score =  263 bits (671), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 142/315 (45%), Positives = 191/315 (60%), Gaps = 23/315 (7%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
           ++  +E+W  ++G+      E  +RFEIFKDNL  ++  N     N SY L L KFADLT
Sbjct: 38  VSRLYEEWVVKHGKAQNSLTEKDRRFEIFKDNLRFIDEHNGK---NLSYRLGLTKFADLT 94

Query: 90  PQEFIASQTGFKMSDHSSSLKANGTPFLYKS---SQVPPSVNWIEKGAVTPVKYQGQCA- 145
             E+ +   G ++       KA  T   Y++     +P SV+W ++GAV  VK QG C  
Sbjct: 95  NDEYRSMYLGSRLK-----RKATKTSLRYEARVGDAIPESVDWRKEGAVAEVKDQGSCGS 149

Query: 146 ------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGIT 199
                 + AVEGIN I    L+SLSEQ+LVDC T+  N GC GG MD AF++II+N GI 
Sbjct: 150 CWAFSTIGAVEGINKIVTGDLISLSEQELVDCDTS-YNEGCNGGLMDYAFEFIIKNGGID 208

Query: 200 NDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS--A 257
            +  Y Y+G+  G CD  +       I +YEDVP N EESL KA+++QP+SVAI+    A
Sbjct: 209 TEEDYPYKGVD-GRCDQTRKNAKVVTIDSYEDVPANSEESLKKALSHQPISVAIEGGGRA 267

Query: 258 LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQ 317
            Q Y  G+F+G C T L+HGV AVGYGT E G  YW++KNSWG  WGE GY R++R+I  
Sbjct: 268 FQLYDSGIFDGICGTDLDHGVVAVGYGT-ENGKDYWIVKNSWGTSWGESGYIRMERNIAS 326

Query: 318 PQGQCGIAMFASFPV 332
             G+CGIA+  S+P+
Sbjct: 327 SAGKCGIAVEPSYPI 341


>gi|400180349|gb|AFP73313.1| cysteine protease [Solanum peruvianum]
 gi|400180469|gb|AFP73371.1| cysteine protease [Solanum peruvianum]
 gi|400180471|gb|AFP73372.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  263 bits (671), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 146/343 (42%), Positives = 206/343 (60%), Gaps = 27/343 (7%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
            LI +  +     +Q   R+  + S++E+ E W +++GR YK+  E  +RF IFK+N+  
Sbjct: 10  ILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKVERFMIFKENMKF 69

Query: 65  VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDH--------SSSLKANGTPF 116
           +E  N A  GN SY L +N+FAD+T QEF+A  TG  + +         S+ LK N    
Sbjct: 70  IESVNKA--GNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTELKIND--- 124

Query: 117 LYKSSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLV 169
                 +P +++W E GAVT VK+QG+C       AV ++EG   I    L+  SEQ+L+
Sbjct: 125 -LSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELL 183

Query: 170 DCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNY 229
           DC TN  N GC GGFM +AF +I +N GI+ ++ Y Y G     C S + +  A QI++Y
Sbjct: 184 DCTTN--NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQY-TCRS-QEKTAAVQISSY 239

Query: 230 EDVPPNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEE 288
           + V P  E SLL+AV  QPVS+ I AS  LQFY+GG ++G C   +NH VTA+GYGT E+
Sbjct: 240 Q-VVPEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGYGTDEK 298

Query: 289 GIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
           G KYWL+KNSWG  WGE+G+ ++ RD   P G C IA  +S+P
Sbjct: 299 GQKYWLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341


>gi|357126406|ref|XP_003564878.1| PREDICTED: cysteine proteinase EP-B 1-like [Brachypodium
           distachyon]
          Length = 377

 Score =  263 bits (671), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 145/331 (43%), Positives = 204/331 (61%), Gaps = 23/331 (6%)

Query: 27  EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVE----RFNNAAIGNR--SYTL 80
           E ++ E + +W++ +    +  AE  +RF  FK N++ +     R N+ +  N   SY L
Sbjct: 35  EEALWELYTRWQSAHRLPPQHHAEKHRRFGTFKSNVLFIHAHNTRLNDTSTNNNGPSYRL 94

Query: 81  RLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTP-FLYKSSQ-VPPSVNWIEKGAVTPV 138
           RLN+F D+   EF   ++ F    H  +  A   P F+Y + + +P +V+W +KGAVT V
Sbjct: 95  RLNRFGDMDQAEF---RSTFAGPLHRHTRPAQSIPGFIYDTVKDIPQAVDWRQKGAVTGV 151

Query: 139 KYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKY 191
           K QG+C       AVA+VEG+NAI+   LVSLSEQ+L+DC T  ++NGC GG M+ AF++
Sbjct: 152 KDQGKCGSCWAFSAVASVEGLNAIRTGSLVSLSEQELIDCDTGGDDNGCQGGLMESAFEF 211

Query: 192 IIQNKG-ITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVS 250
           I  + G +  +A Y Y   S G C++ +    + +I  ++ VP  +EE+L KAVA+QPVS
Sbjct: 212 IAHSAGGLATEAAYPYH-ASNGTCNANRGSSVSVRIDGHQSVPAGNEEALAKAVAHQPVS 270

Query: 251 VAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEE-GIKYWLIKNSWGQDWGEDG 307
           VAIDA   A QFYS GVF G C + L+HGV  VGYG +EE G +YW++KNSWG  WGE G
Sbjct: 271 VAIDAGGQAFQFYSEGVFTGDCGSELDHGVAVVGYGVAEEDGKEYWIVKNSWGPGWGEHG 330

Query: 308 YFRLQRDIDQPQGQCGIAMFASFPVSKESAQ 338
           Y R+QRD     G CGIAM AS+PV  E  +
Sbjct: 331 YVRMQRDSGVDGGLCGIAMEASYPVKNEQTK 361


>gi|21593501|gb|AAM65468.1| cysteine proteinase [Arabidopsis thaliana]
          Length = 376

 Score =  263 bits (671), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 149/329 (45%), Positives = 191/329 (58%), Gaps = 15/329 (4%)

Query: 20  ATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYT 79
           AT    +EG +   +EQW  + G+ Y    E  +RF+IFKDNL  +E  N+    NRSY 
Sbjct: 27  ATESQRNEGGVLTMYEQWLVENGKNYNGLGEKERRFKIFKDNLKRIEEHNSDP--NRSYE 84

Query: 80  LRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQV-PPSVNWIEKGAVTP- 137
             LNKF+DLT  EF AS  G KM   S S  A    + YK   V P  V+W E+GAV P 
Sbjct: 85  RGLNKFSDLTADEFQASYLGGKMEKKSLSDVAE--RYQYKEGDVLPDEVDWRERGAVVPR 142

Query: 138 VKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFK 190
           VK QG+C       A  AVEGIN I    LVSLSEQ+L+DC   ++N GC GG    AF+
Sbjct: 143 VKRQGECGSCWAFAATGAVEGINQITTGELVSLSEQELIDCDRGNDNFGCAGGGAVWAFE 202

Query: 191 YIIQNKGITNDAVYSYEGMSTGICDSIKAE-DHAAQITNYEDVPPNDEESLLKAVANQPV 249
           +I +N GI +D VY Y G  T  C +I+ +      I  +E VP NDE SL KAVA QP+
Sbjct: 203 FIKENGGIVSDEVYGYTGEDTAACKAIEMKTTRVVTINGHEVVPVNDEMSLKKAVAYQPI 262

Query: 250 SVAIDASALQFYSGGVFNGYCETFL-NHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGY 308
           SV I A+ +  Y  GV+ G C     +H V  VGYGTS +   YWLI+NSWG +WGE GY
Sbjct: 263 SVMISAANMSDYKSGVYKGACSNLWGDHNVLIVGYGTSSDEGDYWLIRNSWGPEWGEGGY 322

Query: 309 FRLQRDIDQPQGQCGIAMFASFPVSKESA 337
            RLQR+  +P G+C +A+   +P+   S+
Sbjct: 323 LRLQRNFHEPTGKCAVAVAPVYPIKSNSS 351


>gi|400180467|gb|AFP73370.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  263 bits (671), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 145/339 (42%), Positives = 206/339 (60%), Gaps = 19/339 (5%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
            LI +  +     +Q   R+  + S++E+ E W +++GR YK+  E  +RF IFK+N+  
Sbjct: 10  ILITLFFVISMFNTQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKF 69

Query: 65  VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSD-HSSSLKANGTPFL---YKS 120
           +E  N A  GN SY L +N+FAD+T QEF+A  TG  + + + S    + T F       
Sbjct: 70  IESVNKA--GNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSD 127

Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
             +P +++W E GAVT VK+QG+C       AV ++EG   I    L+  SEQ+L+DC T
Sbjct: 128 DDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTT 187

Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
           N  N GC GGFM +AF +I +N GI+ ++ Y Y G     C S + +  A QI++Y+ V 
Sbjct: 188 N--NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQY-TCRS-QEKTAAVQISSYQ-VV 242

Query: 234 PNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
           P  E SLL+AV  QPVS+ I AS  LQFY+GG ++G C   +NH VTA+GYGT E+G KY
Sbjct: 243 PEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGYGTDEKGQKY 302

Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
           WL+KNSWG  WGE+G+ ++ RD   P G C IA  +S+P
Sbjct: 303 WLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341


>gi|222629675|gb|EEE61807.1| hypothetical protein OsJ_16426 [Oryza sativa Japonica Group]
          Length = 459

 Score =  263 bits (671), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 144/311 (46%), Positives = 189/311 (60%), Gaps = 14/311 (4%)

Query: 34  FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAA-IGNRSYTLRLNKFADLTPQE 92
           + +WKA++G++Y    E  +R+  F+DNL  ++  N AA  G  S+ L LN+FADLT +E
Sbjct: 41  YAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHSFRLGLNRFADLTNEE 100

Query: 93  FIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC-------A 145
           +  +  G +        K +       +  +P SV+W  KGAV  +K QG C       A
Sbjct: 101 YRDTYLGLRNKPRRER-KVSDRYLAADNEALPESVDWRTKGAVAEIKDQGGCGSCWAFSA 159

Query: 146 VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYS 205
           +AAVEGIN I    L+SLSEQ+LVDC T+  N GC GG MD AF +II N GI  +  Y 
Sbjct: 160 IAAVEGINQIVTGDLISLSEQELVDCDTS-YNEGCNGGLMDYAFDFIINNGGIDTEDDYP 218

Query: 206 YEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQFYSG 263
           Y+G     CD  +       I +YEDV PN E SL KAVANQPVSVAI+A   A Q YS 
Sbjct: 219 YKGKDE-RCDVNRKNAKVVTIDSYEDVTPNSETSLQKAVANQPVSVAIEAGGRAFQLYSS 277

Query: 264 GVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCG 323
           G+F G C T L+HGV AVGYGT E G  YW+++NSWG+ WGE GY R++R+I    G+CG
Sbjct: 278 GIFTGKCGTALDHGVAAVGYGT-ENGKDYWIVRNSWGKSWGESGYVRMERNIKASSGKCG 336

Query: 324 IAMFASFPVSK 334
           IA+  S+P+ K
Sbjct: 337 IAVEPSYPLKK 347


>gi|111073715|dbj|BAF02546.1| triticain alpha [Triticum aestivum]
 gi|388890585|gb|AFK80346.1| cysteine endopeptidase EP alpha [Secale cereale x Triticum durum]
          Length = 461

 Score =  263 bits (671), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 147/329 (44%), Positives = 199/329 (60%), Gaps = 22/329 (6%)

Query: 18  SQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAA-IGNR 76
           S  +Y    E  +   + +W +++ RTY    E  +RFE+F+DNL  +++ N AA  G  
Sbjct: 25  SIVSYGERSEEEVRRMYAEWMSEHRRTYNAIGEEERRFEVFRDNLRYIDQHNAAADAGLH 84

Query: 77  SYTLRLNKFADLTPQEFIASQTGFKMS-DHSSSLKANGTPFLYKS---SQVPPSVNWIEK 132
           S+ L LN+FADLT +E+ ++  G +   D    L A      Y++    ++P +V+W +K
Sbjct: 85  SFRLGLNRFADLTNEEYRSTYLGARTKPDRERKLSAR-----YQADDNEELPETVDWRKK 139

Query: 133 GAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFM 185
           GAV  +K QG C       A+AAVEGIN I    ++ LSEQ+LVDC T+  N GC GG M
Sbjct: 140 GAVAAIKDQGGCGSCWAFSAIAAVEGINQIVTGDMIPLSEQELVDCDTS-YNEGCNGGLM 198

Query: 186 DDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVA 245
           D AF++II N GI ++  Y Y+      CD+ K       I  YEDVP N E+SL KAVA
Sbjct: 199 DYAFEFIINNGGIDSEEDYPYKERDN-RCDANKKNAKVVTIDGYEDVPVNSEKSLQKAVA 257

Query: 246 NQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDW 303
           NQP+SVAI+A   A Q Y  G+F G C T L+HGV AVGYGT E G  YWL++NSWG  W
Sbjct: 258 NQPISVAIEAGGRAFQLYKSGIFTGTCGTALDHGVAAVGYGT-ENGKDYWLVRNSWGTVW 316

Query: 304 GEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           GEDGY R++R+I    G+CGIA+  S+P 
Sbjct: 317 GEDGYIRMERNIKASSGKCGIAVEPSYPT 345


>gi|400180437|gb|AFP73356.1| cysteine protease [Solanum pennellii]
          Length = 337

 Score =  262 bits (670), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 147/335 (43%), Positives = 204/335 (60%), Gaps = 18/335 (5%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
            LI +  +     +Q   R+  + S++E+ E W +++GR YK+  E  +RF IFK+N+  
Sbjct: 10  ILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKF 69

Query: 65  VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVP 124
           +E  N A  GN SY L +N+FAD+T QEF+A  TG  + +  S L  +    L     +P
Sbjct: 70  IESVNKA--GNLSYKLGMNEFADITSQEFLAKFTGLNIPN--SYLSPSPINDL-SDDDMP 124

Query: 125 PSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNN 177
            +++W E GAVT VK QGQC       AV ++EG   I    L+  SEQ+L+DC TN  N
Sbjct: 125 SNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTN--N 182

Query: 178 NGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDE 237
            GC GGFM +AF +I +N GI+ ++ Y Y G     C S + +  A QI++Y+ V P  E
Sbjct: 183 YGCNGGFMTNAFDFIKENGGISRESDYEYLGQQY-TCRS-QEKTAAVQISSYQ-VVPEGE 239

Query: 238 ESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIK 296
            SLL+AV  QPVS+ I AS  LQFY+GG ++G C   +NH VTA+GYGT E+G KYWL+K
Sbjct: 240 TSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCANRINHAVTAIGYGTDEKGQKYWLLK 299

Query: 297 NSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
           NSWG  WGEDG+ ++ RD   P G C IA  +S+P
Sbjct: 300 NSWGTSWGEDGFMKIIRDSGNPAGLCDIAKVSSYP 334


>gi|125604306|gb|EAZ43631.1| hypothetical protein OsJ_28254 [Oryza sativa Japonica Group]
          Length = 369

 Score =  262 bits (670), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 147/323 (45%), Positives = 196/323 (60%), Gaps = 28/323 (8%)

Query: 27  EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFA 86
           E ++ E +E+W+ Q+ R  ++  E ++RF +FKDN+  +  FN     +  Y LRLN+F 
Sbjct: 41  EEALWELYERWRGQH-RVARDLGEKARRFNVFKDNVRLIHEFNRR---DEPYKLRLNRFG 96

Query: 87  DLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA- 145
           D+T  E   +    ++S H    +  G     K+ ++         GAV  VK QGQC  
Sbjct: 97  DMTADESAGAYASSRVS-HHRMFRGRGE----KAQRL--------HGAVGAVKDQGQCGS 143

Query: 146 ------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGIT 199
                 +AAVEGINAI+ + L +LSEQQLVDC T   N GC GG MD+AF+YI ++ G+ 
Sbjct: 144 CWAFSTIAAVEGINAIRTSNLTALSEQQLVDCDTKTGNAGCDGGLMDNAFQYIAKHGGVA 203

Query: 200 NDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDA--SA 257
             + Y Y    +  C S  A   A  I  YEDVP N E +L KAVANQPVSVAI+A  S 
Sbjct: 204 ASSAYPYRARQS-SCKSSAASSPAVTIDGYEDVPANSESALKKAVANQPVSVAIEAGGSH 262

Query: 258 LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQ 317
            QFYS GVF G C T L+HGV AVGYGT+ +G KYW+++NSWG DWGE GY R++RD+  
Sbjct: 263 FQFYSEGVFAGKCGTELDHGVAAVGYGTTVDGTKYWIVRNSWGADWGEKGYIRMKRDVSA 322

Query: 318 PQGQCGIAMFASFPVSKESAQPS 340
            +G CGIAM AS+P+ K S  P+
Sbjct: 323 KEGLCGIAMEASYPI-KTSPNPA 344


>gi|50355615|dbj|BAD29956.1| cysteine protease [Daucus carota]
          Length = 423

 Score =  262 bits (670), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 147/304 (48%), Positives = 192/304 (63%), Gaps = 18/304 (5%)

Query: 40  QYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTG 99
           ++ + Y       KRFEIFKDNL  ++  N     N+S+ L LNKFADL+ +E+ +   G
Sbjct: 13  KHHKNYNALGAKEKRFEIFKDNLRFIDEHNKGV--NQSFKLGLNKFADLSNEEYKSMFLG 70

Query: 100 FKMSDHSSSLKANGTPFLYK-SSQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEG 151
            +M       +++   F Y    ++P SV+W EKGAV PVK QGQC        VAAVEG
Sbjct: 71  GRMVRDRKGFESD--RFKYGVGDELPQSVDWREKGAVAPVKDQGQCGSCWAFSTVAAVEG 128

Query: 152 INAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMST 211
           IN I    L+SLSEQ+LVDC     N GC GGFMD AF++I++N GI  +  Y Y+G+  
Sbjct: 129 INQIATGDLISLSEQELVDCDKG-FNQGCNGGFMDYAFEFIVKNGGIDTEDDYPYKGVD- 186

Query: 212 GICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGY 269
           G CD  +       I  +EDVP NDE+SL KAVA+QPVSVAI+A   A Q Y  G+FNG 
Sbjct: 187 GQCDQNRKNAKVVTINGFEDVPQNDEKSLKKAVAHQPVSVAIEAGGRAFQLYESGIFNGL 246

Query: 270 CETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQ-GQCGIAMFA 328
           C T L+HGV AVGYGT E+G  YW+++NSWG +WGE+GY RL+R++     G+CGIAM  
Sbjct: 247 CGTDLDHGVVAVGYGT-EDGKDYWIVRNSWGPNWGENGYIRLERNVASTNTGKCGIAMQP 305

Query: 329 SFPV 332
           S+P 
Sbjct: 306 SYPT 309


>gi|18407678|ref|NP_566867.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|30315950|sp|Q9LXW3.1|CPR2_ARATH RecName: Full=Probable cysteine proteinase At3g43960; Flags:
           Precursor
 gi|7594557|emb|CAB88124.1| cysteine proteinase-like protein [Arabidopsis thaliana]
 gi|26452289|dbj|BAC43231.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|332644328|gb|AEE77849.1| putative cysteine proteinase [Arabidopsis thaliana]
          Length = 376

 Score =  262 bits (670), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 149/329 (45%), Positives = 191/329 (58%), Gaps = 15/329 (4%)

Query: 20  ATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYT 79
           AT    +EG +   +EQW  + G+ Y    E  +RF+IFKDNL  +E  N+    NRSY 
Sbjct: 27  ATESQRNEGEVLTMYEQWLVENGKNYNGLGEKERRFKIFKDNLKRIEEHNSDP--NRSYE 84

Query: 80  LRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQV-PPSVNWIEKGAVTP- 137
             LNKF+DLT  EF AS  G KM   S S  A    + YK   V P  V+W E+GAV P 
Sbjct: 85  RGLNKFSDLTADEFQASYLGGKMEKKSLSDVAE--RYQYKEGDVLPDEVDWRERGAVVPR 142

Query: 138 VKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFK 190
           VK QG+C       A  AVEGIN I    LVSLSEQ+L+DC   ++N GC GG    AF+
Sbjct: 143 VKRQGECGSCWAFAATGAVEGINQITTGELVSLSEQELIDCDRGNDNFGCAGGGAVWAFE 202

Query: 191 YIIQNKGITNDAVYSYEGMSTGICDSIKAE-DHAAQITNYEDVPPNDEESLLKAVANQPV 249
           +I +N GI +D VY Y G  T  C +I+ +      I  +E VP NDE SL KAVA QP+
Sbjct: 203 FIKENGGIVSDEVYGYTGEDTAACKAIEMKTTRVVTINGHEVVPVNDEMSLKKAVAYQPI 262

Query: 250 SVAIDASALQFYSGGVFNGYCETFL-NHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGY 308
           SV I A+ +  Y  GV+ G C     +H V  VGYGTS +   YWLI+NSWG +WGE GY
Sbjct: 263 SVMISAANMSDYKSGVYKGACSNLWGDHNVLIVGYGTSSDEGDYWLIRNSWGPEWGEGGY 322

Query: 309 FRLQRDIDQPQGQCGIAMFASFPVSKESA 337
            RLQR+  +P G+C +A+   +P+   S+
Sbjct: 323 LRLQRNFHEPTGKCAVAVAPVYPIKSNSS 351


>gi|400180463|gb|AFP73368.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  262 bits (670), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 145/339 (42%), Positives = 206/339 (60%), Gaps = 19/339 (5%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
            LI +  +     +Q   R+  + S++E+ E W +++GR YK+  E  +RF IFK+N+  
Sbjct: 10  ILITLFFVISMFNTQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKF 69

Query: 65  VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSD-HSSSLKANGTPFL---YKS 120
           +E  N A  GN SY L +N+FAD+T QEF+A  TG  + + + S    + T F       
Sbjct: 70  IESVNKA--GNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSD 127

Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
             +P +++W E GAVT VK+QG+C       AV ++EG   I    L+  SEQ+L+DC T
Sbjct: 128 DDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTT 187

Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
           N  N GC GGFM +AF +I +N GI+ ++ Y Y G     C S + +  A QI++Y+ V 
Sbjct: 188 N--NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQY-TCRS-QEKTAAVQISSYQ-VV 242

Query: 234 PNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
           P  E SLL+AV  QPVS+ I AS  LQFY+GG ++G C   +NH VTA+GYGT E+G KY
Sbjct: 243 PEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGYGTDEKGQKY 302

Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
           WL+KNSWG  WGE+G+ ++ RD   P G C IA  +S+P
Sbjct: 303 WLLKNSWGTSWGENGFMKIIRDSGDPSGLCDIAKMSSYP 341


>gi|225428328|ref|XP_002279940.1| PREDICTED: cysteine proteinase-like [Vitis vinifera]
          Length = 707

 Score =  262 bits (670), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 146/312 (46%), Positives = 196/312 (62%), Gaps = 18/312 (5%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
           +  +FE W +++G+ YK   E   RFE+F++NL  ++  N       SY L LN+FADL+
Sbjct: 400 LIARFESWVSKHGKVYKSMEEKLHRFEVFRENLNHIDERNKEV---SSYWLGLNEFADLS 456

Query: 90  PQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGAVTPVKYQGQCA--- 145
            +EF +   G + ++   S   +G  F Y+  + +P SV+W +KGAVT VK QG C    
Sbjct: 457 HEEFKSKYLGLR-AEFPRSRDYSG-EFRYRDVADLPESVDWRKKGAVTHVKNQGACGSCW 514

Query: 146 ----VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITND 201
               VAAVEGIN I    L +LSEQ+L+DC T   N+GC GG MD AF +I  N G+  +
Sbjct: 515 AFSTVAAVEGINQIVTGNLTTLSEQELIDCDTT-FNSGCNGGLMDYAFAFIASNGGLHKE 573

Query: 202 AVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQ 259
             Y Y  M  G C+  K +     I+ YEDVP  DEESLLKA+A+QP+SVAI+AS    Q
Sbjct: 574 DDYPYL-MEEGTCEEQKEDVDIVTISGYEDVPEKDEESLLKALAHQPLSVAIEASGRDFQ 632

Query: 260 FYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQ 319
           FYSGGVFNG C T L+HGV AVGYG+S+ G+ Y ++KNSWG  WGE GY R++R+  + +
Sbjct: 633 FYSGGVFNGPCGTELDHGVAAVGYGSSK-GLDYIIVKNSWGPKWGEKGYIRMKRNTGKTE 691

Query: 320 GQCGIAMFASFP 331
           G CGI   AS+P
Sbjct: 692 GLCGINKMASYP 703


>gi|302796898|ref|XP_002980210.1| hypothetical protein SELMODRAFT_153766 [Selaginella moellendorffii]
 gi|300151826|gb|EFJ18470.1| hypothetical protein SELMODRAFT_153766 [Selaginella moellendorffii]
          Length = 479

 Score =  262 bits (670), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 150/319 (47%), Positives = 193/319 (60%), Gaps = 28/319 (8%)

Query: 34  FEQWKAQYGRTYKESA--------ENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKF 85
           F+ W  Q+G++Y ++A        E + R+ IFKDNL  +   N     N+ Y L LN F
Sbjct: 57  FDSWMLQHGKSYADNALSGDSQAGEKATRYGIFKDNLRFIHGENEK---NQGYFLGLNAF 113

Query: 86  ADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQV---PPSVNWIEKGAVTPVKYQG 142
           ADLT +EF A + G +     S  + +   F Y S Q+   P S++W EKGAV  VK QG
Sbjct: 114 ADLTNEEFRAQRHGGRFD--RSRERTSHEEFRYGSVQLKDLPDSIDWREKGAVVGVKDQG 171

Query: 143 QC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQN 195
            C       AVAA+EG+N +    LVSLSEQ+LVDC   ++  GC GG MD AF ++I+N
Sbjct: 172 SCGSCWAFSAVAAIEGVNKLATGELVSLSEQELVDCDKGEDE-GCNGGLMDYAFGFVIKN 230

Query: 196 KGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDA 255
            G+  +A Y Y+G  T  CD  K       I  YEDVP NDE +LLKAVA+QPVSVAIDA
Sbjct: 231 GGLDTEADYPYKGYGT-RCDRSKMNAKVVTIDGYEDVPVNDETALLKAVAHQPVSVAIDA 289

Query: 256 --SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQR 313
             S++QFY  G+F G C T L+HGVT VGYG  E+G  YW+IKNSWG +WGE GY ++ R
Sbjct: 290 GGSSMQFYRSGIFTGRCGTDLDHGVTNVGYG-KEDGKAYWIIKNSWGSNWGEKGYVKMAR 348

Query: 314 DIDQPQGQCGIAMFASFPV 332
           +     G CGI M AS+P 
Sbjct: 349 NTGLAAGLCGINMEASYPT 367


>gi|20334375|gb|AAM19208.1|AF493233_1 cysteine protease [Solanum pennellii]
          Length = 337

 Score =  262 bits (670), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 147/335 (43%), Positives = 204/335 (60%), Gaps = 18/335 (5%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
            LI +  +     +Q   R+  + S++E+ E W +++GR YK+  E  +RF IFK+N+  
Sbjct: 10  ILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKF 69

Query: 65  VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVP 124
           +E  N A  GN SY L +N+FAD+T QEF+A  TG  + +  S L  +    L     +P
Sbjct: 70  IESVNKA--GNLSYKLGMNEFADITSQEFLAKFTGLNIPN--SYLSPSPINDL-SDDDMP 124

Query: 125 PSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNN 177
            +++W E GAVT VK QGQC       AV ++EG   I    L+  SEQ+L+DC TN  N
Sbjct: 125 SNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTN--N 182

Query: 178 NGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDE 237
            GC GGFM +AF +I +N GI+ ++ Y Y G     C S + +  A QI++Y+ V P  E
Sbjct: 183 YGCNGGFMTNAFDFIKENGGISRESDYEYLGQQY-TCRS-QEKTAAVQISSYQ-VVPEGE 239

Query: 238 ESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIK 296
            SLL+AV  QPVS+ I AS  LQFY+GG ++G C   +NH VTA+GYGT E+G KYWL+K
Sbjct: 240 TSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCANRINHAVTAIGYGTDEKGQKYWLLK 299

Query: 297 NSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
           NSWG  WGEDG+ ++ RD   P G C IA  +S+P
Sbjct: 300 NSWGTSWGEDGFMKIIRDSGNPAGLCDIAKVSSYP 334


>gi|400180369|gb|AFP73323.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  262 bits (670), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 145/339 (42%), Positives = 206/339 (60%), Gaps = 19/339 (5%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
            LI +  +     +Q   R+  + S++E+ E W +++GR YK+  E  +RF IFK+N+  
Sbjct: 10  ILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKF 69

Query: 65  VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSD-HSSSLKANGTPFL---YKS 120
           +E  N A  GN SY L +N+FAD+T QEF+A  TG  + + + S    + T F       
Sbjct: 70  IESVNKA--GNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSD 127

Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
             +P +++W E GAVT VK+QG+C       AV ++EG   I    L+  SEQ+L+DC T
Sbjct: 128 DDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTT 187

Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
           N  N GC GGFM +AF +I +N GI+ ++ Y Y G     C S + +  A QI++Y+ V 
Sbjct: 188 N--NYGCNGGFMTNAFDFIKENGGISRESDYEYLGQQY-TCRS-QEKTAAVQISSYK-VV 242

Query: 234 PNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
           P  E SLL+AV  QPVS+ I AS  LQFY+GG ++G C   +NH VTA+GYGT E+G KY
Sbjct: 243 PEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGYGTDEKGQKY 302

Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
           WL+KNSWG  WGE+G+ ++ RD   P G C IA  +S+P
Sbjct: 303 WLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341


>gi|359359168|gb|AEV41073.1| putative cysteine protease [Oryza minuta]
          Length = 499

 Score =  262 bits (670), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 144/298 (48%), Positives = 182/298 (61%), Gaps = 16/298 (5%)

Query: 49  AENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSS 108
            E  +RF +F DNL  V+  N  A  +  + L +N+FADLT  EF A+  G   +     
Sbjct: 84  GEYERRFRVFWDNLKFVDAHNARADEHGGFRLGMNRFADLTNDEFRAAYLGTTPAGRGRH 143

Query: 109 LKANGTPFLYKSSQ-VPPSVNWIEKGAVT-PVKYQGQC-------AVAAVEGINAIKINR 159
           +   G  + +   + +P SV+W +KGAV  PVK QGQC       AVAAVEGIN I    
Sbjct: 144 V---GEAYRHDGVEALPDSVDWRDKGAVVAPVKNQGQCGSCWAFSAVAAVEGINKIVTGE 200

Query: 160 LVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKA 219
           LVSLSEQ+LV+CA N  N+GC GG MDDAF +I +N G+  +  Y Y  M  G C+  K 
Sbjct: 201 LVSLSEQELVECARNGANSGCNGGMMDDAFAFIARNGGLDTEEDYPYTAMD-GKCNLAKK 259

Query: 220 EDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHG 277
                 I  +EDVP NDE SL KAVA+QPVSVAIDA     Q Y  GVF G C T L+HG
Sbjct: 260 SRKVVSIDGFEDVPENDELSLQKAVAHQPVSVAIDAGGREFQLYDSGVFTGRCGTSLDHG 319

Query: 278 VTAVGYGT-SEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSK 334
           V AVGYGT +  G  YW ++NSWG DWGE+GY R++R++    G+CGIAM AS+P+ K
Sbjct: 320 VVAVGYGTDAATGTDYWTVRNSWGPDWGENGYIRMERNVTARTGKCGIAMMASYPIKK 377


>gi|242055323|ref|XP_002456807.1| hypothetical protein SORBIDRAFT_03g043220 [Sorghum bicolor]
 gi|241928782|gb|EES01927.1| hypothetical protein SORBIDRAFT_03g043220 [Sorghum bicolor]
          Length = 369

 Score =  262 bits (670), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 155/360 (43%), Positives = 215/360 (59%), Gaps = 29/360 (8%)

Query: 1   MAKYFLIVVLIISGSCASQATYRTFDEGSIA------EKFEQWKAQYGRTYKESAENSKR 54
           +AK  L+V L+ + S         FDE  +A      + +E+W+  +   ++   E  +R
Sbjct: 4   LAKTLLLVALV-AMSAVELCRAIEFDERDLASDEALWDLYERWQTHH-HVHRHHGEKGRR 61

Query: 55  FEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSD--HSSSLKAN 112
           F  FK+N+  +   N    G+R Y L LN+F D+  +EF ++    +++D   + S  A 
Sbjct: 62  FGTFKENVRFIHAHNKR--GDRPYRLSLNRFGDMGREEFRSTFADSRINDLRRAESPAAP 119

Query: 113 GTP-FLYKS-SQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSL 163
             P F+Y   + +PPSV+W ++GAVT VK QG C        V +VEGINAI+   LVSL
Sbjct: 120 AVPGFMYDGVTDLPPSVDWRKEGAVTAVKDQGHCGSCWAFSTVVSVEGINAIRTGSLVSL 179

Query: 164 SEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAE-DH 222
           SEQ+L+DC T++N  GC GG M++AF++I    G+T ++ Y Y   S G CDS+++    
Sbjct: 180 SEQELIDCDTDEN--GCQGGLMENAFEFIKSYGGVTTESAYPYRA-SNGTCDSVRSRRGQ 236

Query: 223 AAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTA 280
              I  ++ VP   E++L KAVANQPVSVAIDA   A QFYS GVF G C T L+HGV A
Sbjct: 237 IVSIDGHQMVPTGSEDALAKAVANQPVSVAIDAGGQAFQFYSEGVFTGDCGTDLDHGVAA 296

Query: 281 VGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSKESAQPS 340
           VGYG S++G  YW++KNSWG  WGE GY R+QR      G CGIAM ASFP+ K S  P+
Sbjct: 297 VGYGVSDDGTAYWIVKNSWGPSWGEGGYIRMQRGAGN-GGLCGIAMEASFPI-KTSPNPA 354


>gi|400180361|gb|AFP73319.1| cysteine protease [Solanum peruvianum]
 gi|400180397|gb|AFP73337.1| cysteine protease [Solanum peruvianum]
 gi|400180401|gb|AFP73339.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  262 bits (670), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 144/339 (42%), Positives = 206/339 (60%), Gaps = 19/339 (5%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
            LI +  +     +Q   R+  + S++E+ E W +++GR YK+  E  +RF IFK+N+  
Sbjct: 10  ILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKF 69

Query: 65  VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSD-HSSSLKANGTPFL---YKS 120
           +E  N A  GN SY L +N+FAD+T QEF+A  TG  + + + S    + T F+      
Sbjct: 70  IESVNKA--GNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFIINDLSD 127

Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
             +P +++W E GAVT VK+QG+C       AV ++EG   I    L+  SEQ+L+DC T
Sbjct: 128 DDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTT 187

Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
           N  N GC GGFM +AF +I +N GI+ ++ Y Y G     C S + +  A QI++Y+ V 
Sbjct: 188 N--NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQY-TCRS-QEKTAAVQISSYQ-VV 242

Query: 234 PNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
           P  E SLL+AV  QPVS+ I AS  LQFY+GG ++G C   +NH VTA+GYGT E+G KY
Sbjct: 243 PEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGYGTDEKGQKY 302

Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
           WL+KNSWG  WGE+G+ ++ RD   P G C I   +S+P
Sbjct: 303 WLLKNSWGTSWGENGFMKIIRDSGDPSGLCDITKMSSYP 341


>gi|2511693|emb|CAB17076.1| cysteine proteinase precursor [Phaseolus vulgaris]
          Length = 455

 Score =  262 bits (669), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 146/325 (44%), Positives = 202/325 (62%), Gaps = 19/325 (5%)

Query: 19  QATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSY 78
           +AT+RT +E  +   +E+W  ++G+ Y    E  KRF+IFKDNL  +++ N     NR+Y
Sbjct: 27  KATWRTDEE--VNSLYEEWLVKHGKLYNALGEKDKRFQIFKDNLRFIDQQNAE---NRTY 81

Query: 79  TLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQ-VPPSVNWIEKGAVTP 137
            L LN+FADLT +E+ A   G K+  +    +     +  +  + +P SV+W ++GAV P
Sbjct: 82  KLGLNRFADLTNEEYRARYLGTKIDPNRRLGRTPSNRYAPRVGETLPDSVDWRKEGAVVP 141

Query: 138 VKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFK 190
           VK Q  C       A+ AVEGIN I    L+SLSEQ+LVDC T   N GC GG MD AF+
Sbjct: 142 VKDQASCGSCWAFSAIGAVEGINKIVTGDLISLSEQELVDCDTG-YNMGCNGGLMDYAFE 200

Query: 191 YIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVS 250
           +II+N GI ++  Y Y+G+  G CD  +       I  YEDV   DE +L KAVANQPVS
Sbjct: 201 FIIKNGGIDSEEDYPYKGVD-GRCDEYRKNAKVVSIDGYEDVNTYDELALKKAVANQPVS 259

Query: 251 VAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGY 308
           VA++      Q YS GVF G C T L+HGV AVGYGT + G  +W+++NSWG DWGE+GY
Sbjct: 260 VAVEGGGREFQLYSSGVFTGRCGTALDHGVVAVGYGT-DNGHDFWIVRNSWGADWGEEGY 318

Query: 309 FRLQRDIDQPQ-GQCGIAMFASFPV 332
            RL+R++   + G+CGIA+  S+P+
Sbjct: 319 IRLERNLGNSRSGKCGIAIEPSYPI 343


>gi|400180351|gb|AFP73314.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  262 bits (669), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 145/339 (42%), Positives = 206/339 (60%), Gaps = 19/339 (5%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
            LI +  +     +Q   R+  + S++E+ E W +++GR YK+  E  +RF IFK+N+  
Sbjct: 10  ILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKF 69

Query: 65  VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSD-HSSSLKANGTPFL---YKS 120
           +E  N A  GN SY L +N+FAD+T QEF+A  TG  + + + S    + T F       
Sbjct: 70  IESVNKA--GNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSD 127

Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
             +P +++W E GAVT VK+QG+C       AV ++EG   I    L+  SEQ+L+DC T
Sbjct: 128 DDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTT 187

Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
           N  N GC GGFM +AF +I +N GI+ ++ Y Y G     C S + +  A QI++Y+ V 
Sbjct: 188 N--NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQY-TCRS-QEKTAAVQISSYQ-VV 242

Query: 234 PNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
           P  E SLL+AV  QPVS+ I AS  LQFY+GG ++G C   +NH VTA+GYGT E+G KY
Sbjct: 243 PEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGYGTDEKGQKY 302

Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
           WL+KNSWG  WGE+G+ ++ RD   P G C IA  +S+P
Sbjct: 303 WLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341


>gi|5777889|emb|CAB53515.1| cysteine protease [Solanum tuberosum]
          Length = 466

 Score =  262 bits (669), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 150/347 (43%), Positives = 208/347 (59%), Gaps = 25/347 (7%)

Query: 6   LIVVLIISG-SCASQATYRTFDEGSIAEK--------FEQWKAQYGRTYKESAENSKRFE 56
           L+++LI S  S AS  +  ++DE  I  +        +E W  ++G++Y    E  KRF+
Sbjct: 12  LLLMLIFSTLSSASDMSIISYDETHIHHRSDDEVSALYESWLIEHGKSYNALGEKDKRFQ 71

Query: 57  IFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTP- 115
           IFKDNL  ++  N  ++ N+SY L L KFADLT +E+ +   G K S     L  N +  
Sbjct: 72  IFKDNLKYIDEQN--SVPNQSYKLGLTKFADLTNEEYRSIYLGTKSSGDRRKLSKNKSDR 129

Query: 116 FLYK-SSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQ 167
           +L K    +P SV+W +KG +  VK QG C       AVAA+E INAI    L+SLSEQ+
Sbjct: 130 YLPKVGDSLPESVDWRDKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGNLISLSEQE 189

Query: 168 LVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQIT 227
           LVDC     N GC GG MD AF+++I N GI  +  Y Y+  +  +CD  +      +I 
Sbjct: 190 LVDC-DKSYNEGCDGGLMDYAFEFVINNGGIDTEEDYPYKERND-VCDQYRKNAKVVKID 247

Query: 228 NYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGT 285
           +YEDVP N+E++L KAVA+QPVS+AI+A    LQ Y  G+F G C T ++HGV A GYG 
Sbjct: 248 SYEDVPVNNEKALQKAVAHQPVSIAIEAGGRDLQHYKSGIFTGKCGTAVDHGVVAAGYG- 306

Query: 286 SEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           SE G+ YW+++NSWG  WGE GY R+QR++    G CG+A   S+PV
Sbjct: 307 SENGMDYWIVRNSWGAKWGEKGYLRVQRNVASSSGLCGLATEPSYPV 353


>gi|218195711|gb|EEC78138.1| hypothetical protein OsI_17694 [Oryza sativa Indica Group]
          Length = 458

 Score =  262 bits (669), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 144/311 (46%), Positives = 188/311 (60%), Gaps = 14/311 (4%)

Query: 34  FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAA-IGNRSYTLRLNKFADLTPQE 92
           + +WKA++G+ Y    E  +R+  F+DNL  ++  N AA  G  S+ L LN+FADLT +E
Sbjct: 40  YAEWKAEHGKNYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHSFRLGLNRFADLTNEE 99

Query: 93  FIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC-------A 145
           +  +  G +        K +       +  +P SV+W  KGAV  +K QG C       A
Sbjct: 100 YRDTYLGLRNKPRRER-KVSDRYLAADNEALPESVDWRTKGAVAEIKDQGGCGSCWAFSA 158

Query: 146 VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYS 205
           +AAVEGIN I    L+SLSEQ+LVDC T+  N GC GG MD AF +II N GI  +  Y 
Sbjct: 159 IAAVEGINQIVTGDLISLSEQELVDCDTS-YNEGCNGGLMDYAFDFIINNGGIDTEDDYP 217

Query: 206 YEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQFYSG 263
           Y+G     CD  +       I +YEDV PN E SL KAVANQPVSVAI+A   A Q YS 
Sbjct: 218 YKGKDE-RCDVNRKNAKVVTIDSYEDVTPNSETSLQKAVANQPVSVAIEAGGRAFQLYSS 276

Query: 264 GVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCG 323
           G+F G C T L+HGV AVGYGT E G  YW+++NSWG+ WGE GY R++R+I    G+CG
Sbjct: 277 GIFTGKCGTALDHGVAAVGYGT-ENGKDYWIVRNSWGKSWGESGYVRMERNIKASSGKCG 335

Query: 324 IAMFASFPVSK 334
           IA+  S+P+ K
Sbjct: 336 IAVEPSYPLKK 346


>gi|168017893|ref|XP_001761481.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162687165|gb|EDQ73549.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 471

 Score =  262 bits (669), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 145/320 (45%), Positives = 192/320 (60%), Gaps = 18/320 (5%)

Query: 26  DEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKF 85
           D+G + + F QW  ++ R Y   +E  +RF+IFKDNL  +   N      +SY L LNKF
Sbjct: 45  DDGML-DVFHQWLERHSRVYHSLSEKQRRFQIFKDNLHYIHNHNKQ---EKSYWLGLNKF 100

Query: 86  ADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC- 144
           +DLT  EF A   G + +  +  L+ NG  F+Y+       V+W +KGAV+ VK QG C 
Sbjct: 101 SDLTHDEFRALYLGIRPAGRAHGLR-NGDRFIYEDVVAEEMVDWRKKGAVSDVKDQGSCG 159

Query: 145 ------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGI 198
                 A+ +VEG+NAI    L+SLSEQ+LVDC     N GC GG MD AF +II+N GI
Sbjct: 160 SCWAFSAIGSVEGVNAIVTGELISLSEQELVDCDRG-QNQGCNGGLMDYAFDFIIKNGGI 218

Query: 199 TNDAVYSYEGMSTGICDSIKAE-DHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA 257
             +  Y Y+  + G CD  + E      I +Y+DVP   E SLLKAV+  PVSVAI+A  
Sbjct: 219 DTEEDYPYKA-TDGQCDEARKETSKVVVIDDYQDVPTKSESSLLKAVSKNPVSVAIEAGG 277

Query: 258 --LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQR-D 314
              Q Y GGVF G C T L+HGV AVGYGT ++G+ YW++KNSWG  WGE GY R++R  
Sbjct: 278 RDFQHYQGGVFTGPCGTDLDHGVLAVGYGTDDDGVNYWIVKNSWGPSWGEKGYIRMERMG 337

Query: 315 IDQPQGQCGIAMFASFPVSK 334
            +   G+CGI +  SFP+ K
Sbjct: 338 SNSTSGKCGINIEPSFPIKK 357


>gi|400180455|gb|AFP73364.1| cysteine protease [Solanum peruvianum]
 gi|400180459|gb|AFP73366.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  262 bits (669), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 145/339 (42%), Positives = 206/339 (60%), Gaps = 19/339 (5%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
            LI +  +     +Q   R+  + S++E+ E W +++GR YK+  E  +RF IFK+N+  
Sbjct: 10  ILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKF 69

Query: 65  VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSD-HSSSLKANGTPFL---YKS 120
           +E  N A  GN SY L +N+FAD+T QEF+A  TG  + + + S    + T F       
Sbjct: 70  IESVNKA--GNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSD 127

Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
             +P +++W E GAVT VK+QG+C       AV ++EG   I    L+  SEQ+L+DC T
Sbjct: 128 DDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTT 187

Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
           N  N GC GGFM +AF +I +N GI+ ++ Y Y G     C S + +  A QI++Y+ V 
Sbjct: 188 N--NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQY-TCRS-QEKTAAVQISSYQ-VV 242

Query: 234 PNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
           P  E SLL+AV  QPVS+ I AS  LQFY+GG ++G C   +NH VTA+GYGT E+G KY
Sbjct: 243 PEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGYGTDEKGQKY 302

Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
           WL+KNSWG  WGE+G+ ++ RD   P G C IA  +S+P
Sbjct: 303 WLLKNSWGTSWGENGFMKIIRDSGDPSGLCDIAKMSSYP 341


>gi|226502454|ref|NP_001140922.1| hypothetical protein [Zea mays]
 gi|223948637|gb|ACN28402.1| unknown [Zea mays]
 gi|413920877|gb|AFW60809.1| hypothetical protein ZEAMMB73_830238 [Zea mays]
          Length = 354

 Score =  262 bits (669), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 148/322 (45%), Positives = 199/322 (61%), Gaps = 22/322 (6%)

Query: 25  FDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNK 84
           + E ++  + +QW A++GRTY++ AE + RF++FK N   V+  N A    +SY + LN+
Sbjct: 42  YGEEAMKVRHQQWMAEHGRTYRDEAEKAHRFQVFKANADFVDASNAAGDDKKSYRMELNE 101

Query: 85  FADLTPQEFIASQTGFKMSDHSSSLKAN---GTPFLYKSSQVPPSVNWIEKGAVTPVKYQ 141
           FAD+T  EF+A  TG +     +   A    G   L  +     +V+W +KGAVT +K Q
Sbjct: 102 FADMTNDEFMAMYTGLRPVPAGAKKMAGFKYGNVTLSDADDNQQTVDWRQKGAVTGIKNQ 161

Query: 142 GQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQ 194
           GQC       AVAAVEGI+ I    LVSLSEQQ++DC T + NNGC GG++D+AF+YI  
Sbjct: 162 GQCGCCWAFAAVAAVEGIHQITTGNLVSLSEQQVLDCDT-EGNNGCNGGYIDNAFQYIAG 220

Query: 195 NKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAID 254
           N G+  +  Y Y   +  +C S++     A I+ Y+DVP  DE +L  AVANQPVSVAID
Sbjct: 221 NGGLATEDAYPYTA-AQAMCQSVQP---VAAISGYQDVPSGDEAALAAAVANQPVSVAID 276

Query: 255 ASALQFYSGGVFNGY-CET--FLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRL 311
           A   Q Y GGV     C T   LNH VTAVGYGT+E+G  YWL+KN WGQ+WGE GY RL
Sbjct: 277 AHNFQLYGGGVMTAASCSTPPNLNHAVTAVGYGTAEDGTPYWLLKNQWGQNWGEGGYLRL 336

Query: 312 QRDIDQPQGQCGIAMFASFPVS 333
           +R  +     CG+A  AS+PV+
Sbjct: 337 ERGAN----ACGVAQQASYPVA 354


>gi|89274062|dbj|BAE80740.1| cysteine proteinase [Platycodon grandiflorus]
          Length = 462

 Score =  262 bits (669), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 149/327 (45%), Positives = 199/327 (60%), Gaps = 20/327 (6%)

Query: 17  ASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNR 76
           AS++++RT DE  +   +E W  ++G++Y    E  KRF+IFKDNL  ++  N  A  N 
Sbjct: 35  ASKSSWRTDDE--VMAMYESWLVKHGKSYNALGEKEKRFQIFKDNLRFIDEHN--AEENL 90

Query: 77  SYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANG-TPFLYKSSQVPPSVNWIEKGAV 135
           SY + LN+FADLT +E+ ++  G K     S +K++   P +  S  +P SV+W  KGAV
Sbjct: 91  SYKVGLNRFADLTNEEYRSTYLGAKSKPKLSKVKSDRYAPRVGDS--LPESVDWRAKGAV 148

Query: 136 TPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDA 188
            P+K QG C        V AVEGIN I    L++LSEQ+LVDC     N GC GG MD  
Sbjct: 149 APIKDQGSCGSCWAFSTVNAVEGINQIVTGELITLSEQELVDC-DKSYNEGCDGGLMDYG 207

Query: 189 FKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQP 248
           F++II N GI  D  Y Y G     CD  +       I +YEDVP N+EE+L KAVA+QP
Sbjct: 208 FEFIINNGGIDTDKDYPYLGRDA-RCDQYRKNAKVVTIDSYEDVPVNNEEALKKAVASQP 266

Query: 249 VSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGED 306
           VSV I+    A QFY  G+F G C T L+HGV  VGYGT E+G  YW+++NSWG  WGE 
Sbjct: 267 VSVGIEGGGRAFQFYDSGIFTGKCGTALDHGVNVVGYGT-EKGKDYWIVRNSWGSSWGEA 325

Query: 307 GYFRLQRDI-DQPQGQCGIAMFASFPV 332
           GY R++R++     G+CGIAM  S+P+
Sbjct: 326 GYIRMERNLAGTSVGKCGIAMEPSYPL 352


>gi|400180403|gb|AFP73340.1| cysteine protease [Solanum peruvianum]
 gi|400180413|gb|AFP73345.1| cysteine protease [Solanum peruvianum]
 gi|400180415|gb|AFP73346.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  262 bits (669), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 145/339 (42%), Positives = 206/339 (60%), Gaps = 19/339 (5%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
            LI +  +     +Q   R+  + S++E+ E W +++GR YK+  E  +RF IFK+N+  
Sbjct: 10  ILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKF 69

Query: 65  VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSD-HSSSLKANGTPFL---YKS 120
           +E  N A  GN SY L +N+FAD+T QEF+A  TG  + + + S    + T F       
Sbjct: 70  IESVNKA--GNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSD 127

Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
             +P +++W E GAVT VK+QG+C       AV ++EG   I    L+  SEQ+L+DC T
Sbjct: 128 DDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTT 187

Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
           N  N GC GGFM +AF +I +N GI+ ++ Y Y G     C S + +  A QI++Y+ V 
Sbjct: 188 N--NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQY-TCRS-QEKTAAVQISSYQ-VV 242

Query: 234 PNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
           P  E SLL+AV  QPVS+ I AS  LQFY+GG ++G C   +NH VTA+GYGT E+G KY
Sbjct: 243 PEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGYGTDEKGQKY 302

Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
           WL+KNSWG  WGE+G+ ++ RD   P G C IA  +S+P
Sbjct: 303 WLLKNSWGTSWGENGFMKIIRDSGNPAGLCDIAKMSSYP 341


>gi|400180441|gb|AFP73357.1| cysteine protease [Solanum habrochaites]
          Length = 344

 Score =  261 bits (668), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 146/339 (43%), Positives = 203/339 (59%), Gaps = 19/339 (5%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
            LI +  +     SQ   R+  + S++E+ E W +++GR YK+  E  +RF IFK+N+  
Sbjct: 10  ILITLFFVISMFNSQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKF 69

Query: 65  VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSD-HSSSLKANGTPFLYKS--- 120
           +E  N A  GN SY L +N+FAD+T +EF+A  TG  + + + S    + T F       
Sbjct: 70  IESVNKA--GNLSYKLGMNEFADITSEEFLAKFTGLNIPNSYLSPSPMSSTEFKINDISD 127

Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
             +P +++W E GAVT VK QGQC       AV ++EG   I    L+  SEQ+L+DC T
Sbjct: 128 DDMPSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTT 187

Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
           N  N GC GGFM +AF +I +N GI+ ++ Y Y G     C S + +  A QI++Y+ V 
Sbjct: 188 N--NYGCNGGFMTNAFDFIRENGGISRESDYEYLGQQY-TCRS-QEKTAAVQISSYQ-VV 242

Query: 234 PNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
           P  E SLL+AV  QPVS+ I AS  LQFY+GG ++G C   +NH VTA+GYGT E G KY
Sbjct: 243 PEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCANRINHAVTAIGYGTDENGQKY 302

Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
           WL+KNSWG  WGE G+ ++ RD   P G C IA  +S+P
Sbjct: 303 WLLKNSWGTSWGEKGFMKIIRDYGNPSGLCDIAKLSSYP 341


>gi|400180407|gb|AFP73342.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  261 bits (668), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 145/339 (42%), Positives = 206/339 (60%), Gaps = 19/339 (5%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
            LI +  +     +Q   R+  + S++E+ E W +++GR YK+  E  +RF IFK+N+  
Sbjct: 10  ILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKVERFMIFKENMKF 69

Query: 65  VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSD-HSSSLKANGTPFL---YKS 120
           +E  N A  GN SY L +N+FAD+T QEF+A  TG  + + + S    + T F       
Sbjct: 70  IESVNKA--GNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSD 127

Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
             +P +++W E GAVT VK+QG+C       AV ++EG   I    L+  SEQ+L+DC T
Sbjct: 128 DDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTT 187

Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
           N  N GC GGFM +AF +I +N GI+ ++ Y Y G     C S + +  A QI++Y+ V 
Sbjct: 188 N--NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQY-TCRS-QEKTAAVQISSYQ-VV 242

Query: 234 PNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
           P  E SLL+AV  QPVS+ I AS  LQFY+GG ++G C   +NH VTA+GYGT E+G KY
Sbjct: 243 PEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGYGTDEKGQKY 302

Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
           WL+KNSWG  WGE+G+ ++ RD   P G C IA  +S+P
Sbjct: 303 WLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341


>gi|14517542|gb|AAK62661.1| F2G19.31/F2G19.31 [Arabidopsis thaliana]
 gi|19548039|gb|AAL87383.1| F2G19.31/F2G19.31 [Arabidopsis thaliana]
          Length = 462

 Score =  261 bits (668), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 152/355 (42%), Positives = 205/355 (57%), Gaps = 33/355 (9%)

Query: 1   MAKYFLIVVLIISGSCASQATYRT---------FDEGSIAEKFEQWKAQYGRTYKESA-- 49
           MA  FL +V + S    S  +Y             E  +   +E W  ++G+   +++  
Sbjct: 8   MAILFLAMVTVSSAVDMSIISYDEKHGVSTTGGRSEAEVMSIYEAWLVKHGKAQSQNSLV 67

Query: 50  ENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMS---DHS 106
           E  +RFEIFKDNL  V+  N     N SY L L +FADLT  E+ +   G KM    +  
Sbjct: 68  EKDRRFEIFKDNLRFVDEHNEK---NLSYRLGLTRFADLTNDEYRSKYLGAKMEKKGERR 124

Query: 107 SSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINR 159
           +SL+           ++P S++W +KGAV  VK QG C        + AVEGIN I    
Sbjct: 125 TSLRYEARV----GDELPESIDWRKKGAVAEVKDQGGCGSCWAFSTIGAVEGINQIVTGD 180

Query: 160 LVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKA 219
           L++LSEQ+LVDC T+  N GC GG MD AF++II+N GI  D  Y Y+G+  G CD I+ 
Sbjct: 181 LITLSEQELVDCDTS-YNEGCNGGLMDYAFEFIIKNGGIDTDKDYPYKGVD-GTCDQIRK 238

Query: 220 EDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHG 277
                 I +YEDVP   EESL KAVA+QP+S+AI+A   A Q Y  G+F+G C T L+HG
Sbjct: 239 NAKVVTIDSYEDVPTYSEESLKKAVAHQPISIAIEAGGRAFQLYDSGIFDGSCGTQLDHG 298

Query: 278 VTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           V AVGYGT E G  YW+++NSWG+ WGE GY R+ R+I    G+CGIA+  S+P+
Sbjct: 299 VVAVGYGT-ENGKDYWIVRNSWGKSWGESGYLRMARNIASSSGKCGIAIEPSYPI 352


>gi|595986|gb|AAA79915.1| cysteine proteinase, partial [Dianthus caryophyllus]
          Length = 427

 Score =  261 bits (668), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 143/313 (45%), Positives = 195/313 (62%), Gaps = 14/313 (4%)

Query: 31  AEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNR--SYTLRLNKFADL 88
           A   + W  ++ + Y    E  KRF IF+DNL  +++ NN   G     + L LNKFADL
Sbjct: 2   AYHLQSWLVKHRKNYNALGEKEKRFAIFRDNLEFIDQHNNNNNGGGGGEFELGLNKFADL 61

Query: 89  TPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC---- 144
           T  EF     G K  + + S+K++    + +  ++P SV+W +KGAV+ VK QGQC    
Sbjct: 62  TNDEFRRIYFGVKRPEKAESVKSDRYA-VKEGDELPESVDWRKKGAVSHVKDQGQCGSCW 120

Query: 145 ---AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITND 201
              A+ AVEGIN I    L++LSEQ+LVDC T+  N+GC GG MD AF++II N GI  D
Sbjct: 121 AFSAIGAVEGINKIVTGDLITLSEQELVDCDTS-YNSGCDGGLMDYAFRFIINNGGIDTD 179

Query: 202 AVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQ 259
             Y Y+  + G CDS +       I   EDVP N+E++L KAVA+QPV +AI+A     Q
Sbjct: 180 KDYPYKA-TDGSCDSNRKNAKVVTIDGLEDVPANNEKALQKAVAHQPVRLAIEAGGRDFQ 238

Query: 260 FYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQ 319
            Y  GVF G C T L+HGV AVGYGT+++G  YW+++NSWG DWGEDGY R++R+ +   
Sbjct: 239 LYKSGVFTGSCGTSLDHGVVAVGYGTTDDGKDYWIVRNSWGDDWGEDGYIRMERNTESKS 298

Query: 320 GQCGIAMFASFPV 332
           G+CGIA+  S+PV
Sbjct: 299 GKCGIAIEPSYPV 311


>gi|18401614|ref|NP_564497.1| cysteine proteinase RD21a [Arabidopsis thaliana]
 gi|1172873|sp|P43297.1|RD21A_ARATH RecName: Full=Cysteine proteinase RD21a; Short=RD21; Flags:
           Precursor
 gi|12321010|gb|AAG50628.1|AC083835_13 cysteine protease, putative [Arabidopsis thaliana]
 gi|435619|dbj|BAA02374.1| thiol protease [Arabidopsis thaliana]
 gi|18175926|gb|AAL59952.1| putative cysteine proteinase RD21A [Arabidopsis thaliana]
 gi|22136972|gb|AAM91715.1| putative cysteine proteinase RD21A [Arabidopsis thaliana]
 gi|332194014|gb|AEE32135.1| cysteine proteinase RD21a [Arabidopsis thaliana]
          Length = 462

 Score =  261 bits (668), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 144/320 (45%), Positives = 194/320 (60%), Gaps = 24/320 (7%)

Query: 27  EGSIAEKFEQWKAQYGRTYKESA--ENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNK 84
           E  +   +E W  ++G+   +++  E  +RFEIFKDNL  V+  N     N SY L L +
Sbjct: 43  EAEVMSIYEAWLVKHGKAQSQNSLVEKDRRFEIFKDNLRFVDEHNEK---NLSYRLGLTR 99

Query: 85  FADLTPQEFIASQTGFKMS---DHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQ 141
           FADLT  E+ +   G KM    +  +SL+           ++P S++W +KGAV  VK Q
Sbjct: 100 FADLTNDEYRSKYLGAKMEKKGERRTSLRYEARV----GDELPESIDWRKKGAVAEVKDQ 155

Query: 142 GQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQ 194
           G C        + AVEGIN I    L++LSEQ+LVDC T+  N GC GG MD AF++II+
Sbjct: 156 GGCGSCWAFSTIGAVEGINQIVTGDLITLSEQELVDCDTS-YNEGCNGGLMDYAFEFIIK 214

Query: 195 NKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAID 254
           N GI  D  Y Y+G+  G CD I+       I +YEDVP   EESL KAVA+QP+S+AI+
Sbjct: 215 NGGIDTDKDYPYKGVD-GTCDQIRKNAKVVTIDSYEDVPTYSEESLKKAVAHQPISIAIE 273

Query: 255 AS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQ 312
           A   A Q Y  G+F+G C T L+HGV AVGYGT E G  YW+++NSWG+ WGE GY R+ 
Sbjct: 274 AGGRAFQLYDSGIFDGSCGTQLDHGVVAVGYGT-ENGKDYWIVRNSWGKSWGESGYLRMA 332

Query: 313 RDIDQPQGQCGIAMFASFPV 332
           R+I    G+CGIA+  S+P+
Sbjct: 333 RNIASSSGKCGIAIEPSYPI 352


>gi|400180379|gb|AFP73328.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  261 bits (668), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 145/339 (42%), Positives = 206/339 (60%), Gaps = 19/339 (5%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
            LI +  +     +Q   R+  + S++E+ E W +++GR YK+  E  +RF IFK+N+  
Sbjct: 10  ILITLFFVITMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKF 69

Query: 65  VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSD-HSSSLKANGTPFL---YKS 120
           +E  N A  GN SY L +N+FAD+T QEF+A  TG  + + + S    + T F       
Sbjct: 70  IESVNKA--GNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSD 127

Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
             +P +++W E GAVT VK+QG+C       AV ++EG   I    L+  SEQ+L+DC T
Sbjct: 128 DDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTT 187

Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
           N  N GC GGFM +AF +I +N GI+ ++ Y Y G     C S + +  A QI++Y+ V 
Sbjct: 188 N--NYGCDGGFMTNAFDFIKENGGISRESDYEYLGEQY-TCRS-QEKTAAVQISSYQ-VV 242

Query: 234 PNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
           P  E SLL+AV  QPVS+ I AS  LQFY+GG ++G C   +NH VTA+GYGT E+G KY
Sbjct: 243 PEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGYGTDEKGQKY 302

Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
           WL+KNSWG  WGE+G+ ++ RD   P G C IA  +S+P
Sbjct: 303 WLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341


>gi|62320725|dbj|BAD95392.1| cysteine proteinase RD21A [Arabidopsis thaliana]
          Length = 433

 Score =  261 bits (668), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 144/320 (45%), Positives = 194/320 (60%), Gaps = 24/320 (7%)

Query: 27  EGSIAEKFEQWKAQYGRTYKESA--ENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNK 84
           E  +   +E W  ++G+   +++  E  +RFEIFKDNL  V+  N     N SY L L +
Sbjct: 43  EAEVMSIYEAWLVKHGKAQSQNSLVEKDRRFEIFKDNLRFVDEHNEK---NLSYRLGLTR 99

Query: 85  FADLTPQEFIASQTGFKMS---DHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQ 141
           FADLT  E+ +   G KM    +  +SL+           ++P S++W +KGAV  VK Q
Sbjct: 100 FADLTNDEYRSKYLGAKMEKKGERRTSLRYEARV----GDELPESIDWRKKGAVAEVKDQ 155

Query: 142 GQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQ 194
           G C        + AVEGIN I    L++LSEQ+LVDC T+  N GC GG MD AF++II+
Sbjct: 156 GGCGSCWAFSTIGAVEGINQIVTGDLITLSEQELVDCDTS-YNEGCNGGLMDYAFEFIIK 214

Query: 195 NKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAID 254
           N GI  D  Y Y+G+  G CD I+       I +YEDVP   EESL KAVA+QP+S+AI+
Sbjct: 215 NGGIDTDKDYPYKGVD-GTCDQIRKNAKVVTIDSYEDVPTYSEESLKKAVAHQPISIAIE 273

Query: 255 AS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQ 312
           A   A Q Y  G+F+G C T L+HGV AVGYGT E G  YW+++NSWG+ WGE GY R+ 
Sbjct: 274 AGGRAFQLYDSGIFDGSCGTQLDHGVVAVGYGT-ENGKDYWIVRNSWGKSWGESGYLRMA 332

Query: 313 RDIDQPQGQCGIAMFASFPV 332
           R+I    G+CGIA+  S+P+
Sbjct: 333 RNIASSSGKCGIAIEPSYPI 352


>gi|242032709|ref|XP_002463749.1| hypothetical protein SORBIDRAFT_01g005350 [Sorghum bicolor]
 gi|241917603|gb|EER90747.1| hypothetical protein SORBIDRAFT_01g005350 [Sorghum bicolor]
          Length = 381

 Score =  261 bits (667), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 154/347 (44%), Positives = 207/347 (59%), Gaps = 33/347 (9%)

Query: 25  FDEGSIA------EKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNR-S 77
           FDE  +A      + +E+W+  + R ++   E  +RF  FK+N+  +   N    G+R S
Sbjct: 31  FDERDLASDEALWDLYERWQTHH-RVHRHHGEKGRRFGTFKENVRFIHAHNKR--GDRPS 87

Query: 78  YTLRLNKFADLTPQEFIASQTGFKMSD----HSSSLKANGTP-FLYK-SSQVPPSVNWIE 131
           Y LRLN+F D+ P+EF ++    +++D      SS  A   P F+Y  ++ VP SV+W +
Sbjct: 88  YRLRLNRFGDMGPEEFRSTFADSRINDLRRYRESSPAATAVPGFMYDDATDVPRSVDWRQ 147

Query: 132 KGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGF 184
            GAVT VK QG+C        V AVEGINAI+   LVSLSEQ+LVDC T +N  GC GG 
Sbjct: 148 HGAVTAVKNQGRCGSCWAFSTVVAVEGINAIRTGSLVSLSEQELVDCDTAEN--GCQGGL 205

Query: 185 MDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQIT--NYEDVPPNDEESLLK 242
           M++AF +I    GIT ++ Y Y   S G CD ++A      ++   ++ VP   E++L K
Sbjct: 206 MENAFDFIKSYGGITTESAYPYRA-SNGTCDGMRARRGRVHVSIDGHQMVPTGSEDALAK 264

Query: 243 AVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSE-EGIKYWLIKNSW 299
           AVA QPVSVAIDA   A QFYS GVF G C T L+HGV  VGYG S+ +G  YW++KNSW
Sbjct: 265 AVARQPVSVAIDAGGQAFQFYSEGVFTGDCGTDLDHGVAVVGYGVSDVDGTPYWIVKNSW 324

Query: 300 GQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSKESAQPSSADKSS 346
           G  WGE GY R+QR      G CGIAM ASFP+ K S  P+   + +
Sbjct: 325 GPSWGEGGYIRMQRGAGN-GGLCGIAMEASFPI-KTSHNPARKPRRA 369


>gi|297733654|emb|CBI14901.3| unnamed protein product [Vitis vinifera]
          Length = 273

 Score =  261 bits (667), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 139/270 (51%), Positives = 173/270 (64%), Gaps = 14/270 (5%)

Query: 88  LTPQEFIASQTGFKMSDHS---SSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC 144
           +T  EF ++  G K++ H     S  A G+    K   VPPSV+W +KGAVTP+K QGQC
Sbjct: 1   MTNHEFRSTYAGSKVNHHRMFRGSQHAAGSFMYEKVKSVPPSVDWRKKGAVTPIKDQGQC 60

Query: 145 A-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKG 197
                   V AVEGIN IK N+LVSLSEQ+LVDC T++N  GC GG M  AF++I +  G
Sbjct: 61  GSCWAFSTVVAVEGINHIKTNKLVSLSEQELVDCDTSENQ-GCNGGLMGYAFEFIKEKGG 119

Query: 198 ITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDA-- 255
           IT +  Y Y     G CD  K       I  +E VPPN+E++LLKA ANQP+SVAIDA  
Sbjct: 120 ITTEQSYPYTA-EDGTCDVSKVNSPVVSIDGHETVPPNNEDALLKAAANQPISVAIDAGG 178

Query: 256 SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDI 315
           SA QFYS GVF G C T L+HGV  VGYGT+ +G KYW++KNSWG DWGE+GY R++R I
Sbjct: 179 SAFQFYSEGVFAGRCGTDLDHGVAIVGYGTTLDGTKYWIVKNSWGTDWGENGYIRMKRGI 238

Query: 316 DQPQGQCGIAMFASFPVSKESAQPSSADKS 345
              +G CGIA+ AS+P+   S  P  A  S
Sbjct: 239 SAKEGLCGIAVEASYPIKNSSTNPVGAPSS 268


>gi|400180373|gb|AFP73325.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  261 bits (666), Expect = 4e-67,   Method: Compositional matrix adjust.
 Identities = 145/339 (42%), Positives = 206/339 (60%), Gaps = 19/339 (5%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
            LI +  +     +Q   R+  + S++E+ E W +++G  YK+  E  +RF IFK+N+  
Sbjct: 10  ILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGHVYKDEVEKGERFMIFKENMKF 69

Query: 65  VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSD-HSSSLKANGTPFL---YKS 120
           +E  N A  GN SY L +N+FAD+T QEF+A  TG  + + + S    + T F       
Sbjct: 70  IESVNKA--GNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSD 127

Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
             +P +++W E GAVT VK+QGQC       AV ++EG   I    L+  SEQ+L+DC T
Sbjct: 128 DDMPSNLDWRESGAVTQVKHQGQCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTT 187

Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
           N  N GC GGFM +AF +I +N GI++++ Y Y G     C S + +  A QI++Y+ V 
Sbjct: 188 N--NYGCDGGFMTNAFDFIKENGGISSESDYEYLGEQY-TCRS-QEKTAAVQISSYQ-VV 242

Query: 234 PNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
           P  E SLL+AV  QPVS+ I AS  LQFY+GG ++G C   +NH VTA+GYGT E+G KY
Sbjct: 243 PEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGYGTDEKGQKY 302

Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
           WL+KNSWG  WGE+G+ ++ RD   P G C IA  +S+P
Sbjct: 303 WLLKNSWGTSWGENGFMKIIRDSGNPAGLCDIAKMSSYP 341


>gi|255635584|gb|ACU18142.1| unknown [Glycine max]
          Length = 345

 Score =  261 bits (666), Expect = 5e-67,   Method: Compositional matrix adjust.
 Identities = 146/312 (46%), Positives = 194/312 (62%), Gaps = 18/312 (5%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
           + E FE W +++G+ Y+   E   RFEIFKDNL  ++   N  + N  Y L LN+FADL+
Sbjct: 43  LIELFESWMSKHGKIYQSIEEKLLRFEIFKDNLKHIDE-RNKVVSN--YWLGLNEFADLS 99

Query: 90  PQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA---- 145
            QEF     G K+ D+S   + +   F YK  ++P SV+W +KGAV PVK QG C     
Sbjct: 100 HQEFKNKYLGLKV-DYSRR-RESPEEFTYKDVELPKSVDWRKKGAVAPVKNQGSCGSCWA 157

Query: 146 ---VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDA 202
              VAAVEGIN I    L SLSEQ+L+DC     +NGC GG MD AF +I++N G+  + 
Sbjct: 158 FSTVAAVEGINQIVTGNLTSLSEQELIDC-DRTYSNGCNGGLMDYAFSFIVENGGLHKEE 216

Query: 203 VYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQF 260
            Y Y  M  G C+  K E     I+ Y DVP N+E+SLLKA+ANQ +SVAI+AS    QF
Sbjct: 217 DYPYI-MEEGTCEMTKEETEVVTISGYHDVPQNNEQSLLKALANQSLSVAIEASGRDFQF 275

Query: 261 YSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQG 320
           YSGGVF+G+C + L+HGV AVGYGT+ +G+ Y ++KNSWG  WGE GY R+ R   + +G
Sbjct: 276 YSGGVFDGHCGSDLDHGVAAVGYGTA-KGVDYIIVKNSWGSKWGEKGYIRM-RGTLETRG 333

Query: 321 QCGIAMFASFPV 332
                  AS+P+
Sbjct: 334 NLRYLQMASYPL 345


>gi|50355617|dbj|BAD29957.1| cysteine protease [Daucus carota]
          Length = 437

 Score =  260 bits (665), Expect = 5e-67,   Method: Compositional matrix adjust.
 Identities = 145/322 (45%), Positives = 193/322 (59%), Gaps = 20/322 (6%)

Query: 23  RTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRL 82
           RT DE  +   +  W  ++G++Y    E   RF+IFKDNL  ++  N  A  +RSY L L
Sbjct: 40  RTDDE--VMTMYNSWLVKHGKSYNALGEKETRFQIFKDNLRYIDNHN--ADPDRSYELGL 95

Query: 83  NKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLY---KSSQVPPSVNWIEKGAVTPVK 139
           N+FADLT +E+ A   G K  +    L + G    Y   +  ++P S++W EKGAV  VK
Sbjct: 96  NRFADLTNEEYRAKYLGTKSRESRPKL-SKGPSDRYAPVEGEELPDSIDWREKGAVAAVK 154

Query: 140 YQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYI 192
            QG C       A+ AVEGIN I    L++LSEQ+LVDC     N GC GG MD AF +I
Sbjct: 155 DQGSCGSCWAFSAIGAVEGINQITTGELITLSEQELVDC-DRSYNEGCEGGLMDYAFNFI 213

Query: 193 IQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVA 252
           I+N GI +D  Y Y G   G C+  K       I +YEDVP  DE++L KA ANQP+SVA
Sbjct: 214 IKNGGIDSDLDYPYTGRD-GTCNQNKENAKVVTIDSYEDVPVYDEKALQKAAANQPISVA 272

Query: 253 IDASAL--QFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFR 310
           I+A  +  Q Y  G+F G C T ++HGV  VGYG SEEG+ YW+++NSWG  WGE GY +
Sbjct: 273 IEAGGMDFQLYVSGIFTGKCGTAVDHGVVVVGYG-SEEGMDYWIVRNSWGAAWGEAGYLK 331

Query: 311 LQRDIDQPQGQCGIAMFASFPV 332
           +QR++ +  G CGI +  S+PV
Sbjct: 332 MQRNVGKSSGLCGITIEPSYPV 353


>gi|400180385|gb|AFP73331.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  260 bits (665), Expect = 5e-67,   Method: Compositional matrix adjust.
 Identities = 144/339 (42%), Positives = 205/339 (60%), Gaps = 19/339 (5%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
            LI +  +     +Q   R+  + S++E+ E W +++GR YK+  E  +RF IFK+N+  
Sbjct: 10  ILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKF 69

Query: 65  VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSD-HSSSLKANGTPFL---YKS 120
           +E  N A  GN SY L +N+FAD+T QEF+A  TG  + + + S    + T F       
Sbjct: 70  IESVNKA--GNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSD 127

Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
             +P +++W E GAVT VK+QG+C       AV ++EG   I    L+  SEQ+L+DC T
Sbjct: 128 DDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTT 187

Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
           N  N GC GGFM +AF +I +N GI+ ++ Y Y G     C S + +  A QI++Y+ V 
Sbjct: 188 N--NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQY-TCRS-QEKTAAVQISSYQ-VV 242

Query: 234 PNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
           P  E SLL+AV  QPVS+ I AS  LQFY+GG ++G C   +NH VTA+GYGT E+G KY
Sbjct: 243 PEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGYGTDEKGQKY 302

Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
           WL+KNSWG  WGE+G+ ++ RD   P G C I   +S+P
Sbjct: 303 WLLKNSWGTSWGENGFMKIIRDSGDPSGLCDITKMSSYP 341


>gi|45738078|gb|AAS75836.1| fastuosain precursor [Bromelia fastuosa]
          Length = 324

 Score =  260 bits (665), Expect = 5e-67,   Method: Compositional matrix adjust.
 Identities = 143/320 (44%), Positives = 194/320 (60%), Gaps = 24/320 (7%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
           + E+FE+W A+YGR Y ++AE  +RF+IFK+N+  +E FNN + GN SYTL +N+F D+T
Sbjct: 6   MMERFEEWMAEYGRVYNDNAEKMRRFQIFKNNVNHIETFNNRS-GN-SYTLGVNQFTDMT 63

Query: 90  PQEFIASQTGFKMSDHSSSLKANGTPFL----YKSSQVPPSVNWIEKGAVTPVKYQGQC- 144
             EF+A  TG  +      L     P +       S VP S++W + GAVT VK QG C 
Sbjct: 64  NNEFLARYTGASLP-----LNIERDPVVSFDDVDISAVPQSIDWRDYGAVTSVKNQGSCG 118

Query: 145 ------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGI 198
                 A+A VEGI  IK   L+SLSEQ+++DCA +    GC GG+++ A+ +II N G+
Sbjct: 119 SCWAFSAIATVEGIYKIKAGNLISLSEQEVLDCALS---YGCDGGWVNKAYDFIISNNGV 175

Query: 199 TNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA- 257
           T+ A   Y+G   G C+     + A  IT Y  V  N+E S++ AVANQP++  IDA   
Sbjct: 176 TSFANLPYKGYK-GPCNHNDLPNKA-YITGYTYVQSNNERSMMIAVANQPIAALIDAGGD 233

Query: 258 LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQ 317
            Q+Y  GVF G C T LNH +T +GYG +  G KYW++KNSWG  WGE GY R+ RD+  
Sbjct: 234 FQYYKSGVFTGSCGTSLNHAITVIGYGQTSSGTKYWIVKNSWGTSWGERGYIRMARDVSS 293

Query: 318 PQGQCGIAMFASFPVSKESA 337
           P G CGIAM   FP  +  A
Sbjct: 294 PYGLCGIAMAPLFPTLQSGA 313


>gi|115461226|ref|NP_001054213.1| Os04g0670500 [Oryza sativa Japonica Group]
 gi|62510688|sp|Q7XR52.2|CYSP1_ORYSJ RecName: Full=Cysteine protease 1; AltName: Full=OsCP1; Flags:
           Precursor
 gi|38345300|emb|CAE02828.2| OSJNBa0043A12.33 [Oryza sativa Japonica Group]
 gi|113565784|dbj|BAF16127.1| Os04g0670500 [Oryza sativa Japonica Group]
 gi|215741575|dbj|BAG98070.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 490

 Score =  260 bits (665), Expect = 6e-67,   Method: Compositional matrix adjust.
 Identities = 143/295 (48%), Positives = 181/295 (61%), Gaps = 16/295 (5%)

Query: 50  ENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSL 109
           E+ +RF +F DNL  V+  N  A     + L +N+FADLT  EF A+  G   +     +
Sbjct: 84  EHERRFRVFWDNLKFVDAHNARADERGGFRLGMNRFADLTNGEFRATYLGTTPAGRGRRV 143

Query: 110 KANGTPFLYKSSQ-VPPSVNWIEKGAVT-PVKYQGQC-------AVAAVEGINAIKINRL 160
              G  + +   + +P SV+W +KGAV  PVK QGQC       AVAAVEGIN I    L
Sbjct: 144 ---GEAYRHDGVEALPDSVDWRDKGAVVAPVKNQGQCGSCWAFSAVAAVEGINKIVTGEL 200

Query: 161 VSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAE 220
           VSLSEQ+LV+CA N  N+GC GG MDDAF +I +N G+  +  Y Y  M  G C+  K  
Sbjct: 201 VSLSEQELVECARNGQNSGCNGGIMDDAFAFIARNGGLDTEEDYPYTAMD-GKCNLAKRS 259

Query: 221 DHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGV 278
                I  +EDVP NDE SL KAVA+QPVSVAIDA     Q Y  GVF G C T L+HGV
Sbjct: 260 RKVVSIDGFEDVPENDELSLQKAVAHQPVSVAIDAGGREFQLYDSGVFTGRCGTNLDHGV 319

Query: 279 TAVGYGT-SEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
            AVGYGT +  G  YW ++NSWG DWGE+GY R++R++    G+CGIAM AS+P+
Sbjct: 320 VAVGYGTDAATGAAYWTVRNSWGPDWGENGYIRMERNVTARTGKCGIAMMASYPI 374


>gi|46401612|dbj|BAD16614.1| cysteine proteinase [Dianthus caryophyllus]
          Length = 459

 Score =  260 bits (665), Expect = 6e-67,   Method: Compositional matrix adjust.
 Identities = 148/342 (43%), Positives = 205/342 (59%), Gaps = 20/342 (5%)

Query: 5   FLIVVLIISGSCASQATYRTFD--EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
           F I+ ++ S +       R F+  +  IA  +E W  ++G+ Y    E   RF IFKDNL
Sbjct: 12  FSIIFIVSSSALDLSIIDRAFNRPDDEIASLYETWLVKHGKNYNGLGEKQLRFNIFKDNL 71

Query: 63  VAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHS--SSLKANGTPFLYKS 120
             V+  N+    N S+ L LN+FADLT +E+ +   G +    +   S ++    + +++
Sbjct: 72  RFVDERNSE---NLSFKLGLNRFADLTNEEYRSVYLGTRPRSVAVARSGRSKSDRYAFRA 128

Query: 121 SQ-VPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCA 172
              +P SV+W +KGAV  +K QG C       A+AAVEG+N I    L+SLSEQ+LV+C 
Sbjct: 129 GDTLPESVDWRKKGAVAGIKDQGSCGSCWAFSAIAAVEGVNQIVTGDLISLSEQELVECD 188

Query: 173 TNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDV 232
           T+  N+GC GG MD AF++II+N+GI +D  Y Y G   G CD+ +       I +YED 
Sbjct: 189 TS-YNDGCDGGLMDYAFEFIIKNEGIDSDEDYPYTGRD-GRCDTNRKNAKVVTIDDYEDS 246

Query: 233 PPNDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGI 290
           P  DE+SL KAVANQPVSVAI+      Q Y  GVF G C T L+HGV  VGYGT E+G+
Sbjct: 247 PVYDEKSLQKAVANQPVSVAIEGGGRDFQLYDSGVFTGKCGTALDHGVAVVGYGT-EDGL 305

Query: 291 KYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
            YW+++NSWG  WGE GY R+QR+   P G CGIA+  S+P+
Sbjct: 306 DYWIVRNSWGDTWGEGGYIRMQRNTKLPSGICGIAIEPSYPI 347


>gi|363814535|ref|NP_001242660.1| uncharacterized protein LOC100807362 precursor [Glycine max]
 gi|255636658|gb|ACU18666.1| unknown [Glycine max]
          Length = 367

 Score =  260 bits (665), Expect = 6e-67,   Method: Compositional matrix adjust.
 Identities = 148/367 (40%), Positives = 211/367 (57%), Gaps = 28/367 (7%)

Query: 1   MAKYFLIVVLIISGSCASQATYRTFDEG-----------SIAEKFEQWKAQYGRTYKESA 49
           MA   +++  +++ S A   +  ++D              +   +E+W  ++G+ Y    
Sbjct: 8   MATILIVLFTVLAVSSALDMSIISYDRSHADKSGWKSDEEVMSIYEEWLVKHGKVYNAVE 67

Query: 50  ENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSL 109
           E  KRF+IFKDNL  +E  N     NR+Y + LN+F+DL+ +E+ +   G K+       
Sbjct: 68  EKEKRFQIFKDNLNFIEEHNAV---NRTYKVGLNRFSDLSNEEYRSKYLGTKIDPSRMMA 124

Query: 110 KANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVS 162
           + +       +  +P SV+W ++GAV  VK Q +C       A+AAVEGIN I    L +
Sbjct: 125 RPSRRYSPRVADNLPESVDWRKEGAVVRVKNQSECEGCWAFSAIAAVEGINKIVTGNLTA 184

Query: 163 LSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDH 222
           LSEQ+L+DC     N GC GG +D AF++II N GI  +  Y ++G + GICD  K    
Sbjct: 185 LSEQELLDC-DRTVNAGCSGGLVDYAFEFIINNGGIDTEEDYPFQG-ADGICDQYKINAR 242

Query: 223 AAQITNYEDVPPNDEESLLKAVANQPVSVAIDA--SALQFYSGGVFNGYCETFLNHGVTA 280
           A  I  YE VP  DE +L KAVANQPVSVAI+A     Q Y  G+F G C T ++HGVTA
Sbjct: 243 AVTIDGYERVPAYDELALKKAVANQPVSVAIEAYGKEFQLYESGIFTGTCGTSIDHGVTA 302

Query: 281 VGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDI-DQPQGQCGIAMFASFPVSKESAQP 339
           VGYGT E GI YW++KNSWG++WGE GY  ++R+I +   G+CGIA+   +P+ K    P
Sbjct: 303 VGYGT-ENGIDYWIVKNSWGENWGEAGYVGMERNIAEDTAGKCGIAILTLYPI-KIGQNP 360

Query: 340 SSADKSS 346
           S+ D SS
Sbjct: 361 SNPDNSS 367


>gi|90265242|emb|CAH67695.1| H0624F09.3 [Oryza sativa Indica Group]
          Length = 494

 Score =  260 bits (665), Expect = 6e-67,   Method: Compositional matrix adjust.
 Identities = 143/295 (48%), Positives = 181/295 (61%), Gaps = 16/295 (5%)

Query: 50  ENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSL 109
           E+ +RF +F DNL  V+  N  A     + L +N+FADLT  EF A+  G   +     +
Sbjct: 84  EHERRFRVFWDNLKFVDAHNARADERGGFRLGMNRFADLTNGEFRATYLGTTPAGRGRRV 143

Query: 110 KANGTPFLYKSSQ-VPPSVNWIEKGAVT-PVKYQGQC-------AVAAVEGINAIKINRL 160
              G  + +   + +P SV+W +KGAV  PVK QGQC       AVAAVEGIN I    L
Sbjct: 144 ---GEAYRHDGVEALPDSVDWRDKGAVVAPVKNQGQCGSCWAFSAVAAVEGINKIVTGEL 200

Query: 161 VSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAE 220
           VSLSEQ+LV+CA N  N+GC GG MDDAF +I +N G+  +  Y Y  M  G C+  K  
Sbjct: 201 VSLSEQELVECARNGQNSGCNGGIMDDAFAFIARNGGLDTEEDYPYTAMD-GKCNLAKRS 259

Query: 221 DHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGV 278
                I  +EDVP NDE SL KAVA+QPVSVAIDA     Q Y  GVF G C T L+HGV
Sbjct: 260 RKVVSIDGFEDVPENDELSLQKAVAHQPVSVAIDAGGREFQLYDSGVFTGRCGTNLDHGV 319

Query: 279 TAVGYGT-SEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
            AVGYGT +  G  YW ++NSWG DWGE+GY R++R++    G+CGIAM AS+P+
Sbjct: 320 VAVGYGTDAATGAAYWTVRNSWGPDWGENGYIRMERNVTARTGKCGIAMMASYPI 374


>gi|21070926|gb|AAM34401.1|AF377947_7 putative cysteine proteinase [Oryza sativa Japonica Group]
 gi|31712050|gb|AAP68356.1| putative cysteine protease [Oryza sativa Japonica Group]
 gi|40538988|gb|AAR87245.1| putative cysteine protease [Oryza sativa Japonica Group]
 gi|108711126|gb|ABF98921.1| Papain family cysteine protease containing protein, expressed
           [Oryza sativa Japonica Group]
 gi|125545747|gb|EAY91886.1| hypothetical protein OsI_13535 [Oryza sativa Indica Group]
          Length = 350

 Score =  260 bits (664), Expect = 8e-67,   Method: Compositional matrix adjust.
 Identities = 149/350 (42%), Positives = 209/350 (59%), Gaps = 27/350 (7%)

Query: 5   FLIVVLIISGSCA-------SQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEI 57
           F++ VL++SG+ A         A      + ++A + E+W A++G+TYK+  E ++R E+
Sbjct: 6   FVLAVLVMSGAAALGRELAGDGAAAAAAADVAMASRHEKWMAKHGKTYKDEEEKARRLEV 65

Query: 58  FKDNLVAVERFNNAA--IGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTP 115
           F+ N   ++ FN AA   G   + L  N+FADLT  EF A++TG++    + +       
Sbjct: 66  FRANAKLIDSFNAAAEKDGGGGHRLATNRFADLTDDEFRAARTGYQRPPAAVAGAG--GG 123

Query: 116 FLYKS---SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSE 165
           FLY++   +  P S++W   GAVT VK QG C       AVAAVEG+  I+  +LVSLSE
Sbjct: 124 FLYENFSLAAAPQSMDWRAMGAVTGVKDQGSCGCCWAFSAVAAVEGLAKIRTGQLVSLSE 183

Query: 166 QQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQ 225
           Q+LVDC     + GC GG MD AF+YI +  G+  ++ Y Y G+         A   AA 
Sbjct: 184 QELVDCDVRGEDQGCEGGLMDTAFQYIARRGGLAAESSYPYRGVDG--ACRAAAGRAAAS 241

Query: 226 ITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGY-CETFLNHGVTAVG 282
           I  ++DVP NDE +L+ AVA QPVSVAI+ +    +FY  GV  G  C T LNH VTAVG
Sbjct: 242 IRGFQDVPSNDEGALMAAVARQPVSVAINGAGYVFRFYDRGVLGGAGCGTELNHAVTAVG 301

Query: 283 YGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           YGT+ +G  YWL+KNSWG  WGE GY R++R + + +G CGIA  AS+PV
Sbjct: 302 YGTASDGTGYWLMKNSWGASWGEGGYVRIRRGVGR-EGACGIAQMASYPV 350


>gi|18396952|ref|NP_564322.1| cysteine proteinase-like protein [Arabidopsis thaliana]
 gi|332192922|gb|AEE31043.1| cysteine proteinase-like protein [Arabidopsis thaliana]
          Length = 334

 Score =  260 bits (664), Expect = 9e-67,   Method: Compositional matrix adjust.
 Identities = 140/337 (41%), Positives = 203/337 (60%), Gaps = 19/337 (5%)

Query: 4   YFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLV 63
           +  + +L +    +    + T +E SI +  +QW  Q+ R YK+ +E   R ++FK NL 
Sbjct: 8   FVALTILSMDLRISQARPHVTLNEQSIVDYHQQWMTQFSRVYKDESEKEMRLKVFKKNLK 67

Query: 64  AVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTP-----FLY 118
            +E FNN  +GN+SYTL +N+F D   +EF+A+ TG +++  S S   N T       + 
Sbjct: 68  FIENFNN--MGNQSYTLGVNEFTDWKTEEFLATHTGLRVNVTSLSELFNKTKPSRNWNMS 125

Query: 119 KSSQVPPSVNWIEKGAVTPVKYQGQCAVAAVEGINAIKINRLVSLSEQQLVDCATNDNNN 178
                  S +W ++GAVTPVKYQG C +  + G N      L++LSEQQL+DC   + N 
Sbjct: 126 DIDMEDESKDWRDEGAVTPVKYQGACRLTKISGKN------LLTLSEQQLIDCDI-EKNG 178

Query: 179 GCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEE 238
           GC GG  ++AFKYII+N G++ +  Y Y+        + +   H  QI  ++ VP ++E 
Sbjct: 179 GCNGGEFEEAFKYIIKNGGVSLETEYPYQVKKESCRANARRAPHT-QIRGFQMVPSHNER 237

Query: 239 SLLKAVANQPVSVAIDASALQF--YSGGVFNGY-CETFLNHGVTAVGYGTSEEGIKYWLI 295
           +LL+AV  QPVSV IDA A  F  Y GGV+ G  C T +NH VT VGYGT   G+ YW++
Sbjct: 238 ALLEAVRRQPVSVLIDARADSFGHYKGGVYAGLDCGTDVNHAVTIVGYGTMS-GLNYWVL 296

Query: 296 KNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           KNSWG+ WGE+GY R++RD++ PQG CGIA  A++PV
Sbjct: 297 KNSWGESWGENGYMRIRRDVEWPQGMCGIAQVAAYPV 333


>gi|146215988|gb|ABQ10196.1| actinidin Act3a [Actinidia eriantha]
          Length = 380

 Score =  260 bits (664), Expect = 9e-67,   Method: Compositional matrix adjust.
 Identities = 156/353 (44%), Positives = 210/353 (59%), Gaps = 25/353 (7%)

Query: 1   MAKYFLIVVLIISGSC-ASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFK 59
           M+  F    LI S +  A  +  RT DE  +   +E W  +YG++Y    E   R EIFK
Sbjct: 10  MSLLFFSTFLIFSFAIDAKISPLRTNDE--VMALYESWLVKYGKSYNSLGEREMRIEIFK 67

Query: 60  DNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKAN-GTPFLY 118
           +NL  ++  N  A  NRSYT+ LN+FADLT +E+ ++  GFK     SSLK+     ++ 
Sbjct: 68  ENLRFIDEHN--ADPNRSYTVGLNQFADLTDEEYRSTYLGFK-----SSLKSKVSNRYMP 120

Query: 119 KSSQVPPS-VNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVD 170
           +  +V P  V+W   GAV  VK QG C+       +A VE IN I    L+SLSEQ+LVD
Sbjct: 121 QVGEVLPDYVDWRTTGAVVDVKNQGLCSSCWAFATIATVESINQIITGDLISLSEQELVD 180

Query: 171 CATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYE 230
           C     N GC GGFMDDA+++II N GI  +  Y Y G     CD  K   +   I +YE
Sbjct: 181 CNRTPINEGCKGGFMDDAYEFIINNGGINTEENYPYIGQDDQ-CDEPKKNQNYVTIDSYE 239

Query: 231 DVPPNDEESLLKAVANQPVSVAIDASAL--QFYSGGVFN-GYCETFLNHGVTAVGYGTSE 287
            VPPNDE ++ +AVA QPVSVAIDA  L  +FY  G+F  G C T LNH VT +GYGT E
Sbjct: 240 QVPPNDELAMKRAVAYQPVSVAIDAYCLGFRFYQSGIFTGGSCGTTLNHAVTIIGYGT-E 298

Query: 288 EGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSKESAQPS 340
            GI YW++KNS+G  WGE GY ++QR++   +G+CGIA +  +PV   +++P+
Sbjct: 299 NGIDYWIVKNSYGTQWGESGYGKVQRNVGG-EGRCGIASYPFYPVKNYTSKPA 350


>gi|297745594|emb|CBI40759.3| unnamed protein product [Vitis vinifera]
          Length = 300

 Score =  259 bits (663), Expect = 9e-67,   Method: Compositional matrix adjust.
 Identities = 142/304 (46%), Positives = 189/304 (62%), Gaps = 18/304 (5%)

Query: 39  AQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQT 98
           +++G++Y+   E   RFE+F+DNL  ++  N       SY L LN+FADL+ +EF     
Sbjct: 2   SKHGKSYRSFEEKLHRFEVFQDNLKHIDETNKKV---SSYWLGLNEFADLSHEEFKRKYL 58

Query: 99  GFKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVE 150
           G K+       + +   F YK  + +P SV+W +KGAV  VK QG C        VAAVE
Sbjct: 59  GLKIE--LPKRRDSPEEFSYKDVADLPKSVDWRKKGAVAHVKNQGACGSCWAFSTVAAVE 116

Query: 151 GINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMS 210
           GIN I    L +LSEQ+L+DC     NNGC GG MD AF +II N G+  +  Y Y  M 
Sbjct: 117 GINQIVTGNLTALSEQELIDC-DKPFNNGCNGGLMDYAFAFIISNGGLRKEEDYPYV-ME 174

Query: 211 TGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNG 268
            G C   K E     I+ Y DVP ++E+S LKA+ANQP+SVAI+AS+   QFYSGG+FNG
Sbjct: 175 EGTCGEKKEELEVVTISGYHDVPEDNEQSFLKALANQPLSVAIEASSRGFQFYSGGIFNG 234

Query: 269 YCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFA 328
           +C T L+HGV AVGYGTS+ G+ Y  +KNSWG  WGE GY R++R++ +P+G CGI   A
Sbjct: 235 HCGTELDHGVAAVGYGTSK-GVDYITVKNSWGSKWGEKGYIRMKRNVGKPEGICGIYKMA 293

Query: 329 SFPV 332
           S+P 
Sbjct: 294 SYPT 297


>gi|400180391|gb|AFP73334.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  259 bits (663), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 145/339 (42%), Positives = 205/339 (60%), Gaps = 19/339 (5%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
            LI +  +     SQ   R+  + S++E+ E W +++GR YK+  E  +RF IFK+N+  
Sbjct: 10  ILITLFFVISMFNSQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKF 69

Query: 65  VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSD-HSSSLKANGTPFL---YKS 120
           +E  N A  GN SY L +N+FAD+T QEF+A  TG  + + + S    + T F       
Sbjct: 70  IESVNKA--GNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSD 127

Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
             +P +++W E GAVT VK+QG+C       AV ++EG   I    L+  SEQ+L+DC T
Sbjct: 128 DDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTT 187

Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
           N  N GC GGFM +AF +I +N GI+ ++ Y Y G     C S + +  A QI++Y+ V 
Sbjct: 188 N--NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQY-TCRS-QEKTAAVQISSYQ-VV 242

Query: 234 PNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
           P  E SLL+AV  QPVS+ I AS  LQF +GG ++G C   +NH VTA+GYGT E+G KY
Sbjct: 243 PEGETSLLQAVTKQPVSIGIAASQDLQFCAGGTYDGSCADRINHAVTAIGYGTDEKGQKY 302

Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
           WL+KNSWG  WGE+G+ ++ RD   P G C IA  +S+P
Sbjct: 303 WLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341


>gi|18422289|ref|NP_568620.1| Granulin repeat cysteine protease family protein [Arabidopsis
           thaliana]
 gi|9757832|dbj|BAB08269.1| cysteine protease component of protease-inhibitor complex
           [Arabidopsis thaliana]
 gi|17065064|gb|AAL32686.1| cysteine protease component of protease-inhibitor complex
           [Arabidopsis thaliana]
 gi|21387153|gb|AAM47980.1| cysteine protease component of protease-inhibitor complex
           [Arabidopsis thaliana]
 gi|332007522|gb|AED94905.1| Granulin repeat cysteine protease family protein [Arabidopsis
           thaliana]
          Length = 463

 Score =  259 bits (663), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 146/314 (46%), Positives = 191/314 (60%), Gaps = 21/314 (6%)

Query: 34  FEQWKAQYGRTYKE----SAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
           +E W  ++G+         AE  +RFEIFKDNL  ++  N     N SY L L +FADLT
Sbjct: 50  YEAWMVEHGKKKMNQNGLGAEKDQRFEIFKDNLRFIDEHNTK---NLSYKLGLTRFADLT 106

Query: 90  PQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA---- 145
            +E+ +   G K +     LK +          +P SV+W ++GAV  VK QG C     
Sbjct: 107 NEEYRSMYLGAKPTKRV--LKTSDRYQARVGDALPDSVDWRKEGAVADVKDQGSCGSCWA 164

Query: 146 ---VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDA 202
              + AVEGIN I    L+SLSEQ+LVDC T+  N GC GG MD AF++II+N GI  +A
Sbjct: 165 FSTIGAVEGINKIVTGDLISLSEQELVDCDTS-YNQGCNGGLMDYAFEFIIKNGGIDTEA 223

Query: 203 VYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQF 260
            Y Y+  + G CD  +       I +YEDVP N E SL KA+A+QP+SVAI+A   A Q 
Sbjct: 224 DYPYKA-ADGRCDQNRKNAKVVTIDSYEDVPENSEASLKKALAHQPISVAIEAGGRAFQL 282

Query: 261 YSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQG 320
           YS GVF+G C T L+HGV AVGYGT E G  YW+++NSWG  WGE GY ++ R+I+ P G
Sbjct: 283 YSSGVFDGLCGTELDHGVVAVGYGT-ENGKDYWIVRNSWGNRWGESGYIKMARNIEAPTG 341

Query: 321 QCGIAMFASFPVSK 334
           +CGIAM AS+P+ K
Sbjct: 342 KCGIAMEASYPIKK 355


>gi|38345906|emb|CAE04498.2| OSJNBb0059K02.8 [Oryza sativa Japonica Group]
          Length = 458

 Score =  259 bits (662), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 144/311 (46%), Positives = 189/311 (60%), Gaps = 14/311 (4%)

Query: 34  FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAA-IGNRSYTLRLNKFADLTPQE 92
           + +WKA++G++Y    E  +R+  F+DNL  ++  N AA  G  S+ L LN+FADLT +E
Sbjct: 40  YAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHSFRLGLNRFADLTNEE 99

Query: 93  FIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQ---GQC----A 145
           +  +  G +        K +       +  +P SV+W  KGAV  +K Q   G C    A
Sbjct: 100 YRDTYLGLRNKPRRER-KVSDRYLAADNEALPESVDWRTKGAVAEIKDQEVAGSCWAFSA 158

Query: 146 VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYS 205
           +AAVEGIN I    L+SLSEQ+LVDC T+  N GC GG MD AF +II N GI  +  Y 
Sbjct: 159 IAAVEGINQIVTGDLISLSEQELVDCDTS-YNEGCNGGLMDYAFDFIINNGGIDTEDDYP 217

Query: 206 YEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQFYSG 263
           Y+G     CD  +       I +YEDV PN E SL KAVANQPVSVAI+A   A Q YS 
Sbjct: 218 YKGKDE-RCDVNRKNAKVVTIDSYEDVTPNSETSLQKAVANQPVSVAIEAGGRAFQLYSS 276

Query: 264 GVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCG 323
           G+F G C T L+HGV AVGYGT E G  YW+++NSWG+ WGE GY R++R+I    G+CG
Sbjct: 277 GIFTGKCGTALDHGVAAVGYGT-ENGKDYWIVRNSWGKSWGESGYVRMERNIKASSGKCG 335

Query: 324 IAMFASFPVSK 334
           IA+  S+P+ K
Sbjct: 336 IAVEPSYPLKK 346


>gi|255546708|ref|XP_002514413.1| cysteine protease, putative [Ricinus communis]
 gi|223546510|gb|EEF48009.1| cysteine protease, putative [Ricinus communis]
          Length = 324

 Score =  259 bits (662), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 143/315 (45%), Positives = 189/315 (60%), Gaps = 42/315 (13%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
           + E FE W +++G+TY+   E   R E+FKDNL+ ++R N       +Y L LN+FADL+
Sbjct: 43  LTELFESWMSKHGKTYESIEEKLHRLEVFKDNLMHIDRRNRDVT---TYWLALNEFADLS 99

Query: 90  PQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA---- 145
            +EF +                       K +Q+      +EKGAV PVK QG C     
Sbjct: 100 HEEFKS-----------------------KLAQI----RRLEKGAVAPVKNQGSCGSCWA 132

Query: 146 ---VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDA 202
              VAAVEGIN I    L SLSEQ+L+DC T+  N+GC GG MD AF YI+ N G+  + 
Sbjct: 133 FSTVAAVEGINQIVTGNLTSLSEQELIDCDTS-FNSGCNGGLMDYAFDYIVNNGGLHKEE 191

Query: 203 VYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQF 260
            Y Y  M  G CD  + E     I+ Y DVP N+EESLLKA+A+QP+S+AI+AS    QF
Sbjct: 192 DYPYL-MEEGTCDEKREEMEVVTISGYHDVPENNEESLLKALAHQPLSIAIEASGRDFQF 250

Query: 261 YSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQG 320
           Y  GVFNG C T L+HGV AVGYG+S+ G+ Y ++KNSWG  WGE GY R++R+  +P+G
Sbjct: 251 YGRGVFNGPCGTDLDHGVAAVGYGSSK-GLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEG 309

Query: 321 QCGIAMFASFPVSKE 335
            CGI   AS+P  K+
Sbjct: 310 LCGINKMASYPTKKK 324


>gi|535473|emb|CAA53377.1| cysteine protease [Vicia sativa]
          Length = 368

 Score =  259 bits (661), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 153/345 (44%), Positives = 205/345 (59%), Gaps = 26/345 (7%)

Query: 5   FLIVVLIISGSCA---SQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDN 61
           F +   +I+ S A      T R+ DE  +   +E+W  ++ + Y    E  +RF+IFKDN
Sbjct: 9   FFLFFSLITFSLALDIQLPTGRSNDE--VMTMYEEWLVKHQKVYNGLREKDQRFQIFKDN 66

Query: 62  LVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKAN---GTPFLY 118
           L  ++  N     N +Y + LNKFAD+T +E+     G + SD    +  N   G  + Y
Sbjct: 67  LNFIDEHNAQ---NYTYIVGLNKFADMTNEEYRDMYLGTR-SDIKRRIMKNKITGHRYAY 122

Query: 119 KSS-QVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVD 170
            S  ++P  V+W  KGA+T +K QG C        +A VE IN I   +LVSLSEQ+LVD
Sbjct: 123 NSGDRLPVHVDWRLKGAITHIKDQGSCGSCWAFSTIATVEAINKIVTGKLVSLSEQELVD 182

Query: 171 CATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYE 230
           C     N GC GG MD AF++II N GI  D  Y Y+G   G CD  + +     I  YE
Sbjct: 183 C-DRAFNEGCNGGLMDYAFEFIIGNGGIDTDQHYPYKGFE-GRCDPTRKKAKIVSIDGYE 240

Query: 231 DVPPNDEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEE 288
           DVP N+E +L KAVA+QPVSVAI+AS  ALQ Y  GVF G C T L+H V  VGYG SE 
Sbjct: 241 DVPSNNENALKKAVAHQPVSVAIEASGRALQLYQSGVFTGKCGTSLDHAVVIVGYG-SEN 299

Query: 289 GIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQ-GQCGIAMFASFPV 332
           G+ YWL++NSWG +WGEDGYF+++R++     G+CGIA+ AS+PV
Sbjct: 300 GLDYWLVRNSWGTNWGEDGYFKMERNVKGTHTGKCGIAVEASYPV 344


>gi|449438381|ref|XP_004136967.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
          Length = 479

 Score =  259 bits (661), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 146/314 (46%), Positives = 186/314 (59%), Gaps = 17/314 (5%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
           +A  +E W   +G+ Y    E  +RFEIFKDNL  ++  N  +   R+Y + L +FADLT
Sbjct: 58  VAALYESWLVHHGKAYNAIGEKERRFEIFKDNLRFIDEHNRES---RTYKVGLTRFADLT 114

Query: 90  PQEFIASQTGFKMSDHSS-SLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC---- 144
            +E+ A   G + S     S   +G         +P  V+W +KGAV  VK QGQC    
Sbjct: 115 NEEYRARFLGGRFSRKPRLSAAKSGRYAAALGDDLPDDVDWRKKGAVATVKDQGQCGSCW 174

Query: 145 ---AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITND 201
              +VAAVEGIN I    L+ LSEQ+LVDC     N GC GG MD AF++II N GI  +
Sbjct: 175 AFSSVAAVEGINQIVTGELIPLSEQELVDC-DKSFNMGCNGGLMDYAFQFIIGNGGIDTE 233

Query: 202 AVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQ 259
             Y Y+G     CD  +       I  YEDVP NDE SL KAVANQPVSVAI+A   A Q
Sbjct: 234 EDYPYKGRDAA-CDPNRKNAKVVTIDGYEDVPENDESSLKKAVANQPVSVAIEAGGRAFQ 292

Query: 260 FYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDI-DQP 318
            Y  GVF G C T L+HGV AVGYGT + G  YW+++NSWG+DWGE GY RL+R++ +  
Sbjct: 293 LYQSGVFTGRCGTDLDHGVVAVGYGT-DNGTDYWIVRNSWGKDWGESGYIRLERNVANIT 351

Query: 319 QGQCGIAMFASFPV 332
            G+CGIA+  S+P 
Sbjct: 352 TGKCGIAVQPSYPT 365


>gi|356517368|ref|XP_003527359.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
          Length = 332

 Score =  258 bits (660), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 151/340 (44%), Positives = 207/340 (60%), Gaps = 30/340 (8%)

Query: 4   YFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLV 63
           +    +L+     A Q T RT  + S+ E+ EQ   +YG+ YK+  +       FK+N+ 
Sbjct: 9   HIAFAMLLCMAFLAFQVTCRTLQDASMXERHEQRMTRYGKVYKDPPKRX-----FKENVN 63

Query: 64  AVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQ 122
            +E  NNAA  N+ Y   +N+FA   P+         +   H  S     T F +++ + 
Sbjct: 64  YIEACNNAA--NKPYKRGINQFA---PRN--------RFKGHMCSSIIRITTFKFENVTA 110

Query: 123 VPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATND 175
            P +V+  +KGAVTP+K QGQC       AVAA EGI+A+   +L+SLSEQ+LVDC T  
Sbjct: 111 TPSTVDCRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALSAGKLISLSEQELVDCDTKG 170

Query: 176 NNNGCYGGFMDDAFKYIIQNKGITNDA-VYSYEGMSTGICDSIKAEDHAAQITNYEDVPP 234
            + GC GG MDDAFK+IIQN G+ + + +  Y G+      +  A++ A  IT YEDVP 
Sbjct: 171 VDXGCEGGLMDDAFKFIIQNHGLKHXSQLPLYMGVDGKCNANEAAKNAATIITGYEDVPA 230

Query: 235 NDEESLL-KAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIK 291
           N+E++ L KAVAN PVS AIDAS    QFY  GVF G C T L+HGVTAVGYG S++G +
Sbjct: 231 NNEKAHLQKAVANNPVSEAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSDDGTE 290

Query: 292 YWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
           YWL+KNSWG +WGE+GY R+QR +D  +  CGIA+ AS+P
Sbjct: 291 YWLVKNSWGTEWGEEGYIRMQRGVDSEEALCGIAVQASYP 330


>gi|242068363|ref|XP_002449458.1| hypothetical protein SORBIDRAFT_05g013840 [Sorghum bicolor]
 gi|241935301|gb|EES08446.1| hypothetical protein SORBIDRAFT_05g013840 [Sorghum bicolor]
          Length = 350

 Score =  258 bits (660), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 152/324 (46%), Positives = 205/324 (63%), Gaps = 27/324 (8%)

Query: 25  FDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNK 84
           + E ++  + +QW A++GRTYK+ AE ++RF++FK N   V+R N  A G +SY L +N+
Sbjct: 40  YGEEAMKVRHQQWMAEHGRTYKDEAEKARRFQVFKANADFVDRSN--AAGGKSYELAINE 97

Query: 85  FADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVP----PSVNWIEKGAVTPVKY 140
           FAD+T  EF+A  TG K         A    F Y++  +      +V+W +KGAVT +K 
Sbjct: 98  FADMTNDEFVAMYTGLKPVPAGPKKMAG---FKYENLTLSDVDQQAVDWRQKGAVTGIKN 154

Query: 141 QGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYII 193
           QGQC       AVAAVE I+ I    LVSLSEQQ++DC T D NNGC GG++D+AF+YII
Sbjct: 155 QGQCGCCWAFAAVAAVESIHQITTGNLVSLSEQQVLDCDT-DGNNGCNGGYIDNAFQYII 213

Query: 194 QNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAI 253
            N G+  +  Y Y   + G C S  +   A  I++Y+DVP  DE +L  AVANQPV+VAI
Sbjct: 214 SNGGLATEDAYPY-AAAQGTCQS--SVQPAVTISSYQDVPSGDEAALAAAVANQPVAVAI 270

Query: 254 DA-SALQFYSGGVFNG-YCET-FLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFR 310
           DA +  QFYS GV     C T  LNH VTAVGY T+E+G  YWL+KN WGQ+WGE GY R
Sbjct: 271 DAHNNFQFYSSGVLTADTCGTPSLNHAVTAVGYSTAEDGTPYWLLKNQWGQNWGEGGYLR 330

Query: 311 LQRDIDQPQGQCGIAMFASFPVSK 334
           ++R  +     CG+A  AS+PV++
Sbjct: 331 VERGTN----ACGVAQQASYPVAR 350


>gi|400180465|gb|AFP73369.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  258 bits (659), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 144/339 (42%), Positives = 205/339 (60%), Gaps = 19/339 (5%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
            LI +  +     +Q   R+  + S++E+ E W +++GR YK+  E  +RF IFK+N+  
Sbjct: 10  ILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKF 69

Query: 65  VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSD-HSSSLKANGTPFL---YKS 120
           +E  N A  GN SY L +N+FAD+T QEF+A  TG  + + + S    + T F       
Sbjct: 70  IESVNKA--GNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSD 127

Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
             +P +++W E GAVT VK+QG+C       AV ++E    I    L+  SEQ+L+DC T
Sbjct: 128 DDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEVAYKIATGNLMEFSEQELLDCTT 187

Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
           N  N GC GGFM +AF +I +N GI+ ++ Y Y G     C S + +  A QI++Y+ V 
Sbjct: 188 N--NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQY-TCRS-QEKTAAVQISSYQ-VV 242

Query: 234 PNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
           P  E SLL+AV  QPVS+ I AS  LQFY+GG ++G C   +NH VTA+GYGT E+G KY
Sbjct: 243 PEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGYGTDEKGQKY 302

Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
           WL+KNSWG  WGE+G+ ++ RD   P G C IA  +S+P
Sbjct: 303 WLLKNSWGTSWGENGFMKIIRDSGNPAGLCDIAKMSSYP 341


>gi|297791625|ref|XP_002863697.1| hypothetical protein ARALYDRAFT_917391 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297309532|gb|EFH39956.1| hypothetical protein ARALYDRAFT_917391 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 463

 Score =  258 bits (659), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 147/328 (44%), Positives = 194/328 (59%), Gaps = 21/328 (6%)

Query: 20  ATYRTFDEGSIAEKFEQWKAQYGRTYKE----SAENSKRFEIFKDNLVAVERFNNAAIGN 75
           +T  +  +  +   +E W  ++G+         AE  +RFEIFKDNL  ++  N     N
Sbjct: 36  STVSSRSDAEVERIYEAWMVEHGKKKMNQNGLGAEKDQRFEIFKDNLRYIDEHNTK---N 92

Query: 76  RSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAV 135
            SY L L +FADLT  E+ +   G K       LK +          +P SV+W ++GAV
Sbjct: 93  LSYKLGLTRFADLTNDEYRSMYLGAKPVKRV--LKTSDRYEARVGDALPDSVDWRKEGAV 150

Query: 136 TPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDA 188
             VK QG C        + AVEGIN I    L+SLSEQ+LVDC T+  N GC GG MD A
Sbjct: 151 ADVKDQGSCGSCWAFSTIGAVEGINKIVTGDLISLSEQELVDCDTS-YNQGCNGGLMDYA 209

Query: 189 FKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQP 248
           F++II+N GI  +A Y Y+  + G CD  +       I +YEDVP N E SL KA+A+QP
Sbjct: 210 FEFIIKNGGIDTEADYPYKA-ADGRCDQNRKNAKVVTIDSYEDVPENSEASLKKALAHQP 268

Query: 249 VSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGED 306
           +SVAI+A   A Q YS GVF+G C T L+HGV AVGYGT E G  YW+++NSWG  WGE 
Sbjct: 269 ISVAIEAGGRAFQLYSSGVFDGICGTELDHGVVAVGYGT-ENGKDYWIVRNSWGNRWGES 327

Query: 307 GYFRLQRDIDQPQGQCGIAMFASFPVSK 334
           GY ++ R+I +P G+CGIAM AS+P+ K
Sbjct: 328 GYIKMARNIAEPTGKCGIAMEASYPIKK 355


>gi|218181|dbj|BAA14402.1| oryzain alpha precursor [Oryza sativa Japonica Group]
          Length = 458

 Score =  258 bits (659), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 142/311 (45%), Positives = 187/311 (60%), Gaps = 14/311 (4%)

Query: 34  FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAA-IGNRSYTLRLNKFADLTPQE 92
           + +WKA++G++Y    E  +R+  F+DNL  ++  N AA  G  S+ L LN+FADLT +E
Sbjct: 40  YAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHSFRLGLNRFADLTNEE 99

Query: 93  FIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC-------A 145
           +  +  G +        K +       +  +P SV+W  KGAV  +K QG C       A
Sbjct: 100 YRDTYLGLRNKPRRER-KVSDRYLAADNEALPESVDWRTKGAVAEIKDQGGCGSCWAFSA 158

Query: 146 VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYS 205
           +AAVE IN I    L+SLSEQ+LVDC T+  N GC GG MD AF +II N GI  +  Y 
Sbjct: 159 IAAVEDINQIVTGDLISLSEQELVDCDTS-YNEGCNGGLMDYAFDFIINNGGIDTEDDYP 217

Query: 206 YEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQFYSG 263
           Y+G     CD  +       I +YEDV PN E SL KAV NQPVSVAI+A   A Q YS 
Sbjct: 218 YKGKDE-RCDVNRKNAKVVTIDSYEDVTPNSETSLQKAVRNQPVSVAIEAGGRAFQLYSS 276

Query: 264 GVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCG 323
           G+F G C T L+HGV AVGYGT E G  YW+++NSWG+ WGE GY R++R+I    G+CG
Sbjct: 277 GIFTGKCGTALDHGVAAVGYGT-ENGKDYWIVRNSWGKSWGESGYVRMERNIKASSGKCG 335

Query: 324 IAMFASFPVSK 334
           IA+  S+P+ K
Sbjct: 336 IAVEPSYPLKK 346


>gi|182375363|gb|ACB87490.1| mucunain [Mucuna pruriens]
          Length = 422

 Score =  258 bits (659), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 142/312 (45%), Positives = 189/312 (60%), Gaps = 19/312 (6%)

Query: 34  FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
           +EQW  ++G+ Y    E  KRF+IFKDNL  ++  N     NR+Y L LN+FADLT +E+
Sbjct: 4   YEQWLVKHGKAYNALGEKDKRFDIFKDNLRFIDDHN---ADNRTYKLGLNRFADLTNEEY 60

Query: 94  IASQTGFKMSDHSSSLKANGTPFLYK---SSQVPPSVNWIEKGAVTPVKYQGQCA----- 145
            A   G ++  +   +K       Y       +P SV+W  + AV PVK QG C      
Sbjct: 61  RARYLGTRIDPNRRFVKTKTQSNRYAPRVGDNLPESVDWRNESAVLPVKDQGNCGSCWAF 120

Query: 146 --VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAV 203
             + AVEGIN I    L+SLSEQ+LVDC T+  N GC GG MD A+++II N GI ++  
Sbjct: 121 STIGAVEGINKIVTGDLISLSEQELVDCDTS-YNQGCNGGLMDYAYEFIINNGGIDSEED 179

Query: 204 YSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFY 261
           Y Y  +  G CD  +       I +YEDVP NDE +L KAVANQPVSVAI+      Q Y
Sbjct: 180 YPYRAVD-GTCDQYRKNAKVVTIDSYEDVPANDELALKKAVANQPVSVAIEGGGREFQLY 238

Query: 262 SGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQ-G 320
             GVF G C T L+HGV AVGYG S +G  YW+++NSWG  WGE+GY RL+R++ + + G
Sbjct: 239 VSGVFTGRCGTALDHGVVAVGYG-SVKGHDYWIVRNSWGASWGEEGYVRLERNLAKSRSG 297

Query: 321 QCGIAMFASFPV 332
           +CGIA+  S+P+
Sbjct: 298 KCGIAIEPSYPI 309


>gi|242049716|ref|XP_002462602.1| hypothetical protein SORBIDRAFT_02g028840 [Sorghum bicolor]
 gi|241925979|gb|EER99123.1| hypothetical protein SORBIDRAFT_02g028840 [Sorghum bicolor]
          Length = 384

 Score =  258 bits (659), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 146/360 (40%), Positives = 196/360 (54%), Gaps = 61/360 (16%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
           + E+FEQW  ++GR Y ++ E  +R E+++ N+  VE FN  ++ N  Y L  NKFADLT
Sbjct: 28  MLERFEQWMGRHGRLYADAGEKQRRLEVYRRNVALVETFN--SMSNGGYRLADNKFADLT 85

Query: 90  PQEFIASQTGF-KMSDHSSSLKANGTPFLYK----------SSQVPPSVNWIEKGAVTPV 138
            +EF A   GF +   H  +     TP              S ++P SV+W EKGAV PV
Sbjct: 86  NEEFRAKMLGFGRPPPHGRATGHTTTPGTVACIGSGLGRRYSDELPKSVDWREKGAVAPV 145

Query: 139 KYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKY 191
           K QG+C       AVAA+EGIN IK  +LVSLSEQ+LVDC T     GC GG+M  AF++
Sbjct: 146 KNQGECGSCWAFSAVAAIEGINQIKNGKLVSLSEQELVDCDTK--AIGCAGGYMSWAFEF 203

Query: 192 IIQNKGITNDAVYSYEGM---------------------------STGICDSIKAEDHAA 224
           ++ N G+T +  Y Y+G                              G C + K ++ A 
Sbjct: 204 VMNNSGLTTERNYPYQGTYAHGNRKTHALPFDCTKGSSTCDSRAGMNGACQTPKLKESAV 263

Query: 225 QITNYEDVPPNDEESLLKAVANQPVSVAIDASAL--QFYSGGVFNGYCETFLNHGVTAVG 282
            I+ Y +V  + E  LL+A A QPVSVA+DA +   Q Y GGVF G C   LNHGVT VG
Sbjct: 264 SISGYVNVTASSEPDLLRAAAAQPVSVAVDAGSFVWQLYGGGVFTGPCTADLNHGVTVVG 323

Query: 283 YGTSEE----------GIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           YG ++           G KYW++KNSWG +WG+ GY  +QR+     G CGIA+  S+PV
Sbjct: 324 YGETQRDTDGDGTGVPGQKYWIVKNSWGPEWGDAGYILMQREASVASGLCGIALLPSYPV 383


>gi|171702831|dbj|BAG16371.1| cysteine protease [Brassica oleracea var. italica]
          Length = 441

 Score =  258 bits (659), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 141/327 (43%), Positives = 192/327 (58%), Gaps = 23/327 (7%)

Query: 18  SQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRS 77
           +  T  +  +  ++  +E+W  ++G+      E  +RFEIFKDNL  ++  N     N S
Sbjct: 26  NHHTVSSRSDAEVSRLYEEWLVKHGKAQNSLTEKDRRFEIFKDNLRFIDEHNGK---NLS 82

Query: 78  YTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK---SSQVPPSVNWIEKGA 134
           Y L L KFADLT  E+ +   G ++       KA  +   Y+      +P SV+W ++GA
Sbjct: 83  YRLGLTKFADLTNDEYRSMYLGSRLK-----RKATKSSLRYEVRVGDAIPESVDWRKEGA 137

Query: 135 VTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDD 187
           V  VK QG C        + AVEGIN I    L++LSEQ+LVDC T+  N GC GG MD 
Sbjct: 138 VAEVKDQGSCGSCWAFSTIGAVEGINKIVTGDLITLSEQELVDCDTS-YNEGCNGGLMDY 196

Query: 188 AFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ 247
           AF++II N GI  +  Y Y+G+  G CD  +       I  YEDVP N EESL KA+++Q
Sbjct: 197 AFEFIINNGGIDTEEDYPYKGVD-GRCDQTRKNAKVVTIDLYEDVPANSEESLKKALSHQ 255

Query: 248 PVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGE 305
           P+SVAI+    A Q Y  G+F+G C T L+HGV AVGYGT E G  YW++KNSWG  WGE
Sbjct: 256 PISVAIEGGGRAFQLYDSGIFDGICGTDLDHGVVAVGYGT-ENGKDYWIVKNSWGTSWGE 314

Query: 306 DGYFRLQRDIDQPQGQCGIAMFASFPV 332
            GY R++R+I    G+CGIA+  S+P+
Sbjct: 315 SGYIRMERNIASSAGKCGIAVEPSYPI 341


>gi|297852302|ref|XP_002894032.1| F2G19.31/F2G19.31 [Arabidopsis lyrata subsp. lyrata]
 gi|297339874|gb|EFH70291.1| F2G19.31/F2G19.31 [Arabidopsis lyrata subsp. lyrata]
          Length = 455

 Score =  258 bits (658), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 141/310 (45%), Positives = 190/310 (61%), Gaps = 18/310 (5%)

Query: 34  FEQWKAQYGRTYKESA--ENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQ 91
           +E W  ++G+   +++  E  +RFEIFKDNL  ++  N     N SY L L +FADLT  
Sbjct: 43  YEAWLVKHGKAQNQNSLVEKDRRFEIFKDNLRFIDDHNKK---NLSYRLGLTRFADLTND 99

Query: 92  EFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA------ 145
           E+ +   G KM +     + +         ++P S++W +KGAV  VK QG C       
Sbjct: 100 EYRSKYLGAKM-EKKGERRTSQRYEARVGDELPESIDWRKKGAVAEVKDQGSCGSCWAFS 158

Query: 146 -VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVY 204
            + AVEGIN I    L++LSEQ+LVDC T+  N GC GG MD AF++II+N GI  D  Y
Sbjct: 159 TIGAVEGINQIVTGDLITLSEQELVDCDTS-YNEGCNGGLMDYAFEFIIKNGGIDTDKDY 217

Query: 205 SYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQFYS 262
            Y+G+  G CD I+       I +YEDVP   EESL KAVA+QPVSVAI+A   A Q Y 
Sbjct: 218 PYKGVD-GTCDQIRKNAKVVTIDSYEDVPTYSEESLKKAVAHQPVSVAIEAGGRAFQLYD 276

Query: 263 GGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQC 322
            G+F+G C T L+HGV AVGYGT E G  YW+++NSWG+ WGE GY ++ R+I    G+C
Sbjct: 277 SGIFDGTCGTQLDHGVVAVGYGT-ENGKDYWIVRNSWGKSWGESGYLKMARNIASSSGKC 335

Query: 323 GIAMFASFPV 332
           GIA+  S+P+
Sbjct: 336 GIAIEPSYPI 345


>gi|449450419|ref|XP_004142960.1| PREDICTED: vignain-like [Cucumis sativus]
          Length = 345

 Score =  258 bits (658), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 154/327 (47%), Positives = 208/327 (63%), Gaps = 28/327 (8%)

Query: 25  FDEGSIAEKFEQWK--AQYGRTY---KESAENSKRFEIFKDNLVAVERFNNAAIGNRSYT 79
           FDE  +A +   W+   ++G+ +   +   E  KRF +FK+N+  V   N     ++ Y 
Sbjct: 26  FDEKELATEESLWQLYERWGKHHTISRNLKEKHKRFSVFKENVNHVFTVNQM---DKPYK 82

Query: 80  LRLNKFADLTPQEFI----ASQTGFKMSDHSSSLKANGTPFLY-KSSQVPPSVNWIEKGA 134
           L+LNKFAD++  EF+     S        H     A G  F+Y + + +P SV+W E+GA
Sbjct: 83  LKLNKFADMSNYEFVNFYARSNISHYRKLHERRRGAGG--FMYEQDTDLPSSVDWRERGA 140

Query: 135 VTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDD 187
           V  VK QG+C       +VAAVEGIN IK N+L+SLSEQ+L+DC  N  N GC GGFM+ 
Sbjct: 141 VNAVKEQGRCGSCWAFSSVAAVEGINKIKTNQLLSLSEQELLDC--NYRNKGCNGGFMEI 198

Query: 188 AFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ 247
           AF +I +N GI  +  Y Y G S G+C S +      +I  YE VP N E++L++AVANQ
Sbjct: 199 AFDFIKRNGGIATENSYPYHG-SRGLCRSSRISSPIVKIDGYESVPEN-EDALMQAVANQ 256

Query: 248 PVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGE 305
           PVSVAIDA+    QFYS GVF+GYC T LNHGV A+GYGT+E+G  YWL++NSWG  WGE
Sbjct: 257 PVSVAIDAAGRDFQFYSQGVFDGYCGTELNHGVVAIGYGTTEDGTDYWLVRNSWGVGWGE 316

Query: 306 DGYFRLQRDIDQPQGQCGIAMFASFPV 332
           DGY R++R ++Q +G CGIAM AS+P+
Sbjct: 317 DGYVRMKRGVEQAEGLCGIAMEASYPI 343


>gi|22093636|dbj|BAC06931.1| putative cysteine proteinase [Oryza sativa Japonica Group]
 gi|50510021|dbj|BAD30633.1| putative cysteine proteinase [Oryza sativa Japonica Group]
          Length = 352

 Score =  258 bits (658), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 147/342 (42%), Positives = 206/342 (60%), Gaps = 24/342 (7%)

Query: 8   VVLIISGSCASQATYRTFDE-GSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVE 66
           ++L+++G  ++ A        G++  + ++W A++GRTYK++AE ++RF +FK N+  ++
Sbjct: 15  LLLVVAGGLSTMAKVTMASRAGTMEARHDKWMAEHGRTYKDAAEKARRFRVFKANVDLID 74

Query: 67  RFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPS 126
           R N  A GN+ Y L  N+F DLT  EF A  TG+  ++   +     T    +  Q P  
Sbjct: 75  RSN--AAGNKRYRLATNRFTDLTDAEFAAMYTGYNPANTMYAAANATTRLSSEDDQQPAE 132

Query: 127 VNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNG 179
           V+W ++GAVT VK Q  C        VAAVEGI+ I    LVSLSEQQL+DCA   +N G
Sbjct: 133 VDWRQQGAVTGVKNQRSCGCCWAFSTVAAVEGIHQITTGELVSLSEQQLLDCA---DNGG 189

Query: 180 CYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICD---SIKAEDHAAQITNYEDVPPND 236
           C GG +D+AF+Y+  + G+T +A Y+Y+G + G C    S  A   AA I+ Y+ V PND
Sbjct: 190 CTGGSLDNAFQYMANSGGVTTEAAYAYQG-AQGACQFDASSSASGVAATISGYQRVNPND 248

Query: 237 EESLLKAVANQPVSVAIDASALQF--YSGGVFNG-YCETFLNHGVTAVGYGTSEEGI--- 290
           E SL  AVA+QPVSVAI+ S   F  Y  GVF    C T L+H V  VGYG   +G    
Sbjct: 249 EGSLAAAVASQPVSVAIEGSGAMFRHYGSGVFTADSCGTKLDHAVAVVGYGAEADGSGGG 308

Query: 291 KYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
            YW+IKNSWG  WG+ GY +L++D+   QG CG+AM  S+PV
Sbjct: 309 GYWIIKNSWGTTWGDGGYMKLEKDVG-SQGACGVAMAPSYPV 349


>gi|224081756|ref|XP_002306486.1| predicted protein [Populus trichocarpa]
 gi|222855935|gb|EEE93482.1| predicted protein [Populus trichocarpa]
          Length = 352

 Score =  258 bits (658), Expect = 4e-66,   Method: Compositional matrix adjust.
 Identities = 152/326 (46%), Positives = 206/326 (63%), Gaps = 23/326 (7%)

Query: 34  FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
           ++ W A++G+ Y    E ++RFEIFK+NL  ++  N+    N +Y + L KFADLT +E+
Sbjct: 4   YKWWLAKHGKAYNGLGEEAERFEIFKNNLRFIDEHNSQ---NHTYKVGLTKFADLTNEEY 60

Query: 94  IASQTGFKMSDHSSSLKANGTP---FLYKSS-QVPPSVNWIEKGAVTPVKYQGQCA---- 145
            A   G + SD    L  + +P   + +K+  ++P SV+W  KGAV P+K QG C     
Sbjct: 61  RAMFLGTR-SDAKRRLMKSKSPSERYAFKAGDKLPESVDWRAKGAVNPIKDQGSCGSCWA 119

Query: 146 ---VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDA 202
              VAAVEGIN I    L+SLSEQ+LVDC     N GC GG MD AF++II N G+  + 
Sbjct: 120 FSTVAAVEGINQIVTGELISLSEQELVDC-DRTYNAGCNGGLMDYAFQFIINNGGLDTEK 178

Query: 203 VYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQF 260
            Y Y G     CD  K +  A  I  +EDV P DE++L KAVA+QPVSVAI+AS  ALQF
Sbjct: 179 DYPYVGDDD-KCDKDKMKTKAVSIDGFEDVLPYDEKALQKAVAHQPVSVAIEASGMALQF 237

Query: 261 YSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDI-DQPQ 319
           Y  GVF G C T L+HGV  VGY  SE G+ YWL++NSWG +WGE GY ++QR++ D   
Sbjct: 238 YQSGVFTGECGTALDHGVVVVGY-ASENGLDYWLVRNSWGTEWGEHGYIKMQRNVGDTYT 296

Query: 320 GQCGIAMFASFPVS--KESAQPSSAD 343
           G+CGIAM +S+PV   + +A+P+ A+
Sbjct: 297 GRCGIAMESSYPVKNGENTAKPNLAE 322


>gi|146215986|gb|ABQ10195.1| actinidin Act2d [Actinidia eriantha]
          Length = 381

 Score =  258 bits (658), Expect = 4e-66,   Method: Compositional matrix adjust.
 Identities = 147/350 (42%), Positives = 203/350 (58%), Gaps = 20/350 (5%)

Query: 1   MAKYFLIVVLIISGSC-ASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFK 59
           M+  F   +LI+S +     +  RT D+  +   +E W  + G++Y    E   RFEIFK
Sbjct: 12  MSLLFFSTLLILSSALDIKNSVQRTNDQ--VMAMYESWLVEQGKSYNSLDEKEMRFEIFK 69

Query: 60  DNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK 119
           +NL  ++  N  A  NRSY+L LN+FADLT +E+ ++  GFK S   + +     P +  
Sbjct: 70  ENLRIIDDHN--ADANRSYSLGLNRFADLTDEEYRSTYLGFK-SGPKAKVSNRYVPKV-- 124

Query: 120 SSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCA 172
              +P  V+W   GAV  VK QG C       AVAAVEGIN I    L+SLSEQ+LVDC 
Sbjct: 125 GVVLPNYVDWRTVGAVVGVKDQGLCSSCWAFSAVAAVEGINKIVTGNLISLSEQELVDCG 184

Query: 173 TNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDV 232
                 GC  G+M+DAF++II N GI  +  Y Y     G CD  +       I NYE +
Sbjct: 185 RTQRTRGCNRGYMNDAFQFIIDNGGINTEDNYPYTAQD-GQCDWYRKNQRYVTIDNYEQL 243

Query: 233 PPNDEESLLKAVANQPVSVAIDASALQF--YSGGVFNGYCETFLNHGVTAVGYGTSEEGI 290
           P N+E  L  AVA QP++V +++   +F  Y+ G++ GYC T ++HGVT VGYGT E G+
Sbjct: 244 PANNEWVLQNAVAYQPITVGLESEGGKFKLYTSGIYTGYCGTAIDHGVTIVGYGT-ERGL 302

Query: 291 KYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSKESAQPS 340
            YW++KNSWG +WGE+GY R+QR+I    G+CGIAM  S+PV      P+
Sbjct: 303 DYWIVKNSWGTNWGENGYIRIQRNIGGA-GKCGIAMVPSYPVKYSYQNPN 351


>gi|359359166|gb|AEV41071.1| putative oryzain beta chain precursor [Oryza minuta]
          Length = 464

 Score =  258 bits (658), Expect = 4e-66,   Method: Compositional matrix adjust.
 Identities = 146/312 (46%), Positives = 198/312 (63%), Gaps = 22/312 (7%)

Query: 34  FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERF---NNAAIGNRSYTLRLNKFADLTP 90
           ++ W A+ GR+Y    E+ +RF +F DNL    RF   +NA   +  + L +N+FADLT 
Sbjct: 53  YDLWLAENGRSYNALGEHERRFRVFWDNL----RFADAHNARADDHGFRLGMNRFADLTN 108

Query: 91  QEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGAVTPVKYQGQC----- 144
           +EF A+  G K+ + S   +A G  + +    ++P SV+W EKGAV PVK QGQC     
Sbjct: 109 EEFRATFLGAKVVERS---RAAGERYRHDGVEELPESVDWREKGAVAPVKNQGQCGSCWA 165

Query: 145 --AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDA 202
             AV+ VE IN +    +++LSEQ+LV+C+TN  N GC GG MDDAF +II+N GI  + 
Sbjct: 166 FSAVSTVESINQLVTGEMITLSEQELVECSTNGQNGGCNGGLMDDAFDFIIKNGGIDTED 225

Query: 203 VYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQF 260
            Y Y+ +  G CD  +       I  +EDVP NDE+SL KAVA+QPVSVAI+A     Q 
Sbjct: 226 DYPYKAVD-GKCDINRENAKVVSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGGREFQL 284

Query: 261 YSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQG 320
           Y  GVF+G C T L+HGV AVGYGT + G  YW+++NSWG  WGE GY R++R+I+   G
Sbjct: 285 YHSGVFSGRCGTSLDHGVVAVGYGT-DNGKDYWIVRNSWGPKWGESGYVRMERNINVTTG 343

Query: 321 QCGIAMFASFPV 332
           +CGIAM AS+P 
Sbjct: 344 KCGIAMMASYPT 355


>gi|944916|gb|AAA74430.1| cysteine proteinase [Mesembryanthemum crystallinum]
          Length = 367

 Score =  258 bits (658), Expect = 4e-66,   Method: Compositional matrix adjust.
 Identities = 142/321 (44%), Positives = 199/321 (61%), Gaps = 24/321 (7%)

Query: 29  SIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADL 88
           ++ + +E+W++ Y  + +   E   RF +FK+N+  +   N     ++ Y LRLN+F DL
Sbjct: 39  TLWDLYERWRSVY-TSARSFGEKQNRFHVFKENVKYINEVNKM---DKPYKLRLNQFGDL 94

Query: 89  TPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC---- 144
           TP EF  +    K+ + + +       F+Y++ +VP S++W  KGAVTPVK QG+C    
Sbjct: 95  TPSEFARTYANSKIIEGTRNESGG---FMYENVEVPRSIDWRVKGAVTPVKNQGRCGGCW 151

Query: 145 ---AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITND 201
              A AAVEGIN I   +L+SLSEQQL+DC T   N+GC GG M  AF+YI Q  GIT++
Sbjct: 152 AFSAAAAVEGINQITTGQLISLSEQQLIDCDTQ--NSGCRGGTMGRAFEYIKQRGGITSE 209

Query: 202 AVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASAL--- 258
           A Y Y+  + G+C +   +     I  Y ++    E+++LK +A+QPVSVA+DA+     
Sbjct: 210 ANYPYKAQA-GMCKNNLIQRPTVSIDGYYNIR-RSEDAVLKILAHQPVSVAVDATTWSSL 267

Query: 259 --QFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDID 316
              FY  GVF G C T LNHGVTAVGYGT+ +G  YW+IKNSWG+ WGE GY R+ R + 
Sbjct: 268 DWMFYFQGVFTGPCGTKLNHGVTAVGYGTTNDGYDYWIIKNSWGETWGERGYMRMLRGVS 327

Query: 317 QPQGQCGIAMFASFPVSKESA 337
            P G CGIAM ASFP+ + SA
Sbjct: 328 -PYGLCGIAMQASFPIKRVSA 347


>gi|18141285|gb|AAL60580.1|AF454958_1 senescence-associated cysteine protease [Brassica oleracea]
          Length = 485

 Score =  257 bits (657), Expect = 5e-66,   Method: Compositional matrix adjust.
 Identities = 141/327 (43%), Positives = 192/327 (58%), Gaps = 23/327 (7%)

Query: 18  SQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRS 77
           +  T  +  +  ++  +E+W  ++G+      E  +RFEIFKDNL  ++  N     N S
Sbjct: 32  NHHTVSSRSDAEVSRLYEEWLVKHGKAQNSLTEKDRRFEIFKDNLRFIDEHNGK---NLS 88

Query: 78  YTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK---SSQVPPSVNWIEKGA 134
           Y L L KFADLT  E+ +   G ++       KA  +   Y+      +P SV+W ++GA
Sbjct: 89  YRLGLTKFADLTNDEYRSMYLGSRLKR-----KATKSSLRYEVRVGDAIPESVDWRKEGA 143

Query: 135 VTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDD 187
           V  VK QG C        + AVEGIN I    L++LSEQ+LVDC T+  N GC GG MD 
Sbjct: 144 VAEVKDQGSCGSCWAFSTIGAVEGINKIVTGDLITLSEQELVDCDTS-YNEGCNGGLMDY 202

Query: 188 AFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ 247
           AF++II N GI  +  Y Y+G+  G CD  +       I  YEDVP N EESL KA+++Q
Sbjct: 203 AFEFIINNGGIDTEEDYPYKGVD-GRCDQTRKNAKVVTIDLYEDVPANSEESLKKALSHQ 261

Query: 248 PVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGE 305
           P+SVAI+    A Q Y  G+F+G C T L+HGV AVGYGT E G  YW++KNSWG  WGE
Sbjct: 262 PISVAIEGGGRAFQLYDSGIFDGICGTDLDHGVVAVGYGT-ENGKDYWIVKNSWGTSWGE 320

Query: 306 DGYFRLQRDIDQPQGQCGIAMFASFPV 332
            GY R++R+I    G+CGIA+  S+P+
Sbjct: 321 SGYIRMERNIASSAGKCGIAVEPSYPI 347


>gi|218198967|gb|EEC81394.1| hypothetical protein OsI_24614 [Oryza sativa Indica Group]
          Length = 342

 Score =  257 bits (657), Expect = 5e-66,   Method: Compositional matrix adjust.
 Identities = 147/342 (42%), Positives = 206/342 (60%), Gaps = 24/342 (7%)

Query: 8   VVLIISGSCASQATYRTFDE-GSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVE 66
           ++L+++G  ++ A        G++  + ++W A++GRTYK++AE ++RF +FK N+  ++
Sbjct: 5   LLLVVAGGLSTMAKVTMASRAGTMEARHDKWMAEHGRTYKDAAEKARRFRVFKANVDLID 64

Query: 67  RFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPS 126
           R N  A GN+ Y L  N+F DLT  EF A  TG+  ++   +     T    +  Q P  
Sbjct: 65  RSN--AAGNKRYRLATNRFTDLTDAEFAAMYTGYNPANTMYAAANATTRLSSEDDQQPAE 122

Query: 127 VNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNG 179
           V+W ++GAVT VK Q  C        VAAVEGI+ I    LVSLSEQQL+DCA   +N G
Sbjct: 123 VDWRQQGAVTGVKNQRSCGCCWAFSTVAAVEGIHQITTGELVSLSEQQLLDCA---DNGG 179

Query: 180 CYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICD---SIKAEDHAAQITNYEDVPPND 236
           C GG +D+AF+Y+  + G+T +A Y+Y+G + G C    S  A   AA I+ Y+ V PND
Sbjct: 180 CTGGSLDNAFQYMANSGGVTTEAAYAYQG-AQGACQFDASSSASGVAATISGYQRVNPND 238

Query: 237 EESLLKAVANQPVSVAIDASALQF--YSGGVFNG-YCETFLNHGVTAVGYGTSEEGI--- 290
           E SL  AVA+QPVSVAI+ S   F  Y  GVF    C T L+H V  VGYG   +G    
Sbjct: 239 EGSLAAAVASQPVSVAIEGSGAMFRHYGSGVFTADSCGTKLDHAVAVVGYGAEADGSGGG 298

Query: 291 KYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
            YW+IKNSWG  WG+ GY +L++D+   QG CG+AM  S+PV
Sbjct: 299 GYWIIKNSWGTTWGDGGYMKLEKDVG-SQGACGVAMAPSYPV 339


>gi|222629922|gb|EEE62054.1| hypothetical protein OsJ_16838 [Oryza sativa Japonica Group]
          Length = 336

 Score =  257 bits (657), Expect = 5e-66,   Method: Compositional matrix adjust.
 Identities = 149/305 (48%), Positives = 185/305 (60%), Gaps = 20/305 (6%)

Query: 41  YGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGF 100
           Y + Y    E  +RFE+FKDNL  ++  N       SY L LN+FADLT  EF A+  G 
Sbjct: 36  YRKAYASFEEKVRRFEVFKDNLNHIDDINKKVT---SYWLGLNEFADLTHDEFKATYLGL 92

Query: 101 KMS-DHSSSLKANGTPFLY---KSSQVPPSVNWIEKGAVTPVKYQGQCA-------VAAV 149
                 S+S   +   F Y    + +VP  ++W +K AVT VK QGQC        VAAV
Sbjct: 93  TPPPTRSNSKHYSSEEFRYGKMSNGEVPKEMDWRKKNAVTEVKNQGQCGSCWAFSTVAAV 152

Query: 150 EGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGM 209
           EGINAI    L SLSEQ+L+DC+T D NNGC GG MD AF YI    G+  +  Y Y  M
Sbjct: 153 EGINAIVTGNLTSLSEQELIDCST-DGNNGCNGGLMDYAFSYIASTGGLRTEEAYPY-AM 210

Query: 210 STGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFYSGGVFN 267
             G CD  K       I+ YEDVP NDE++L+KA+A+QPVSVAI+AS    QFYSGGVF+
Sbjct: 211 EEGDCDEGKGA-AVVTISGYEDVPANDEQALVKALAHQPVSVAIEASGRHFQFYSGGVFD 269

Query: 268 GYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMF 327
           G C   L+HGVTAVGYGTS+ G  Y ++KNSWG  WGE GY R++R   + +G CGI   
Sbjct: 270 GPCGEQLDHGVTAVGYGTSK-GQDYIIVKNSWGPHWGEKGYIRMKRGTGKGEGLCGINKM 328

Query: 328 ASFPV 332
           AS+P 
Sbjct: 329 ASYPT 333


>gi|18391078|ref|NP_563855.1| xylem bark cysteine peptidase 3 [Arabidopsis thaliana]
 gi|110741821|dbj|BAE98853.1| papain-like cysteine peptidase XBCP3 [Arabidopsis thaliana]
 gi|111074448|gb|ABH04597.1| At1g09850 [Arabidopsis thaliana]
 gi|332190386|gb|AEE28507.1| xylem bark cysteine peptidase 3 [Arabidopsis thaliana]
          Length = 437

 Score =  257 bits (657), Expect = 5e-66,   Method: Compositional matrix adjust.
 Identities = 142/312 (45%), Positives = 189/312 (60%), Gaps = 15/312 (4%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
           I+E F+ W  ++G+TY    E  +R +IFKDN   V + N   I N +Y+L LN FADLT
Sbjct: 28  ISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHN--LITNATYSLSLNAFADLT 85

Query: 90  PQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC----- 144
             EF AS+ G  +S  S  + + G   L  S +VP SV+W +KGAVT VK QG C     
Sbjct: 86  HHEFKASRLGLSVSAPSVIMASKGQS-LGGSVKVPDSVDWRKKGAVTNVKDQGSCGACWS 144

Query: 145 --AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDA 202
             A  A+EGIN I    L+SLSEQ+L+DC     N GC GG MD AF+++I+N GI  + 
Sbjct: 145 FSATGAMEGINQIVTGDLISLSEQELIDC-DKSYNAGCNGGLMDYAFEFVIKNHGIDTEK 203

Query: 203 VYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQF 260
            Y Y+    G C   K +     I +Y  V  NDE++L++AVA QPVSV I  S  A Q 
Sbjct: 204 DYPYQERD-GTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSERAFQL 262

Query: 261 YSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQG 320
           YS G+F+G C T L+H V  VGYG S+ G+ YW++KNSWG+ WG DG+  +QR+ +   G
Sbjct: 263 YSSGIFSGPCSTSLDHAVLIVGYG-SQNGVDYWIVKNSWGKSWGMDGFMHMQRNTENSDG 321

Query: 321 QCGIAMFASFPV 332
            CGI M AS+P+
Sbjct: 322 VCGINMLASYPI 333


>gi|146215984|gb|ABQ10194.1| actinidin Act2c [Actinidia arguta]
          Length = 378

 Score =  256 bits (655), Expect = 8e-66,   Method: Compositional matrix adjust.
 Identities = 151/348 (43%), Positives = 206/348 (59%), Gaps = 24/348 (6%)

Query: 1   MAKYFLIVVLIISGSC-ASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFK 59
           M+  F   +LI+S +     +  RT D+  + + +E W  + G++Y    E   RFEIFK
Sbjct: 10  MSLLFFSTLLILSSALDIVNSAQRTNDQ--VRDMYESWLVEQGKSYNSLDEKEMRFEIFK 67

Query: 60  DNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKAN-GTPFLY 118
           DNL  ++  N  A  NRS++L LN+FADLT +E+ ++  GFK     S  KA     ++ 
Sbjct: 68  DNLRIIDDHN--ADANRSFSLGLNRFADLTDEEYRSTYLGFK-----SGPKAKVSNRYVP 120

Query: 119 KSSQVPPS-VNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVD 170
           K   V P+ V+W   GAV  VK QG C       AVAAVEGIN I    L+SLSEQ+LVD
Sbjct: 121 KVGDVLPNYVDWRTVGAVVGVKNQGLCSSCWAFSAVAAVEGINKIMTGNLLSLSEQELVD 180

Query: 171 CATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYE 230
           C    +  GC  G+M DAF++II N GI  +  Y Y     G C+          I +YE
Sbjct: 181 CGRTQSTRGCNRGYMTDAFQFIINNGGINTEDNYPYTAQD-GQCNRYLQNQKYVTIDDYE 239

Query: 231 DVPPNDEESLLKAVANQPVSVAIDASALQF--YSGGVFNGYCETFLNHGVTAVGYGTSEE 288
           +VP N+E +L  AVA+QPVSV +++   +F  Y+ G+F  YC T ++HGVT VGYGT E 
Sbjct: 240 NVPSNNEWALQNAVAHQPVSVGLESEGGKFKLYTSGIFTQYCGTAIDHGVTIVGYGT-ER 298

Query: 289 GIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSKES 336
           G+ YW++KNSWG +WGE+GY R+QR+I    G+CGIA  AS+PV   S
Sbjct: 299 GLDYWIVKNSWGTNWGENGYIRIQRNIGGA-GKCGIARMASYPVKYNS 345


>gi|14600257|gb|AAK71314.1|AF388175_1 papain-like cysteine peptidase XBCP3 [Arabidopsis thaliana]
          Length = 437

 Score =  256 bits (655), Expect = 9e-66,   Method: Compositional matrix adjust.
 Identities = 142/312 (45%), Positives = 189/312 (60%), Gaps = 15/312 (4%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
           I+E F+ W  ++G+TY    E  +R +IFKDN   V + N   I N +Y+L LN FADLT
Sbjct: 28  ISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHN--LITNATYSLSLNAFADLT 85

Query: 90  PQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC----- 144
             EF AS+ G  +S  S  + + G   L  S +VP SV+W +KGAVT VK QG C     
Sbjct: 86  HHEFKASRLGLSVSAPSVIMASKGQS-LGGSVKVPDSVDWRKKGAVTNVKDQGSCGACWS 144

Query: 145 --AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDA 202
             A  A+EGIN I    L+SLSEQ+L+DC     N GC GG MD AF+++I+N GI  + 
Sbjct: 145 FSATGAMEGINQIVTGDLISLSEQELIDC-DKSYNAGCNGGLMDYAFEFVIKNHGIDTEK 203

Query: 203 VYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQF 260
            Y Y+    G C   K +     I +Y  V  NDE++L++AVA QPVSV I  S  A Q 
Sbjct: 204 DYPYQERD-GTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSERAFQL 262

Query: 261 YSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQG 320
           YS G+F+G C T L+H V  VGYG S+ G+ YW++KNSWG+ WG DG+  +QR+ +   G
Sbjct: 263 YSRGIFSGPCSTSLDHAVLIVGYG-SQNGVDYWIVKNSWGKSWGMDGFMHMQRNTENSDG 321

Query: 321 QCGIAMFASFPV 332
            CGI M AS+P+
Sbjct: 322 VCGINMLASYPI 333


>gi|220983358|dbj|BAH11164.1| cysteine protease [Hordeum vulgare]
          Length = 462

 Score =  256 bits (654), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 146/325 (44%), Positives = 196/325 (60%), Gaps = 22/325 (6%)

Query: 22  YRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAA-IGNRSYTL 80
           Y    E  +   + +W A++  TY    E  +RFE F++NL  +++ N AA  G  S+ L
Sbjct: 30  YGERSEEEVRRMYAEWMAEHHSTYNPIGEEERRFEAFRNNLRYIDQHNAAADAGVHSFRL 89

Query: 81  RLNKFADLTPQEFIASQTGFKMS-DHSSSLKANGTPFLYKSS---QVPPSVNWIEKGAVT 136
            LN+FADLT +E+ ++  G +   D    L A      Y+++   ++P SV+W +KGAV 
Sbjct: 90  GLNRFADLTNEEYRSTYLGARTKPDRERKLSAR-----YQAADNDELPESVDWRKKGAVG 144

Query: 137 PVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAF 189
            VK QG C       A+AAVEGIN I    ++ LSEQ+LVDC T+  N GC GG MD AF
Sbjct: 145 AVKDQGGCGSCWAFSAIAAVEGINQIVTGDMIPLSEQELVDCDTS-YNQGCNGGLMDYAF 203

Query: 190 KYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPV 249
           ++II N GI ++  Y Y+      CD+ K       I  YEDVP N E+SL KAVANQP+
Sbjct: 204 EFIINNGGIDSEEDYPYKERDN-RCDANKKNAKVVTIDGYEDVPVNSEKSLQKAVANQPI 262

Query: 250 SVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDG 307
           SVAI+A   A Q Y  G+F G C T L+HGV AVGYGT E G  YWL++NSWG  WGE+G
Sbjct: 263 SVAIEAGGRAFQLYKSGIFTGTCGTALDHGVAAVGYGT-ENGKDYWLVRNSWGSVWGENG 321

Query: 308 YFRLQRDIDQPQGQCGIAMFASFPV 332
           Y R++R+I    G+CGIA+  S+P 
Sbjct: 322 YIRMERNIKASSGKCGIAVEPSYPT 346


>gi|388519351|gb|AFK47737.1| unknown [Medicago truncatula]
          Length = 359

 Score =  256 bits (654), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 147/341 (43%), Positives = 200/341 (58%), Gaps = 22/341 (6%)

Query: 6   LIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
           L+   +I+ S A   + R+ +E  +   +E+W  ++ + Y    E  +RFEIFKDNL  +
Sbjct: 9   LLFFSLITLSLAMDTSMRSNEE--VMTMYEEWLVKHHKVYNGLGEKDQRFEIFKDNLGFI 66

Query: 66  ERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLK---ANGTPFLYKSS- 121
           +  N     N +Y + LNKFAD T +E+     G K     + +K     G  + + S  
Sbjct: 67  DEHNAQ---NYTYKVGLNKFADTTNEEYRNMYLGTKNDAKRNVMKIKITTGHRYAFNSGD 123

Query: 122 QVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATN 174
           ++P  V+W  KGAV  +K QG C        +A VE IN I   +LVSLSEQ+LVDC   
Sbjct: 124 RLPVHVDWRSKGAVAHIKDQGSCGSCWAFSTIATVEAINKIVTGKLVSLSEQELVDC-DR 182

Query: 175 DNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPP 234
             N GC GG MD AF++I++N GI  +  Y Y+G   G CD  +       I  YEDVP 
Sbjct: 183 AFNEGCNGGLMDYAFEFIVENGGIDTEQDYPYKGFE-GRCDPTRKNAKVVSIDGYEDVPA 241

Query: 235 NDEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
            +E +L KAV +QPVSVAI+A   ALQ Y  GVF G C T L+HGV  VGYG  E G+ Y
Sbjct: 242 YNENALKKAVFHQPVSVAIEAGGRALQLYQSGVFTGRCGTNLDHGVVVVGYGF-ENGVDY 300

Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQ-GQCGIAMFASFPV 332
           WL++NSWG +WGEDGYF+L+R++ +   G+CGIAM AS+PV
Sbjct: 301 WLVRNSWGTNWGEDGYFKLERNVKKINTGKCGIAMQASYPV 341


>gi|297843430|ref|XP_002889596.1| hypothetical protein ARALYDRAFT_887827 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297335438|gb|EFH65855.1| hypothetical protein ARALYDRAFT_887827 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 343

 Score =  256 bits (654), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 145/338 (42%), Positives = 194/338 (57%), Gaps = 20/338 (5%)

Query: 5   FLIVVLIISGSCASQATYRTFD-EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLV 63
            +  VLI S  C+  ++   +D   ++ ++FE+W   + + Y    E   RF I++ N+ 
Sbjct: 15  LICFVLIASKLCSVNSS--VYDPHKTLKQRFEKWLKTHSKLYGGRDEWMLRFGIYQSNVQ 72

Query: 64  AVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQV 123
            ++  N+  +    + L  N+FAD+T  EF A   G   S  S  L     P    +  V
Sbjct: 73  LIDYINSLHL---PFKLTDNRFADMTNSEFKAHFLGLNTS--SLRLHKKQRPVCDPAGNV 127

Query: 124 PPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDN 176
           P +V+W  +GAVTP++ QG+C       AVAA+EGIN IK   LVSLSEQQL+DC     
Sbjct: 128 PDAVDWRTQGAVTPIRNQGKCGGCWAFSAVAAIEGINKIKTGNLVSLSEQQLIDCDVGTY 187

Query: 177 NNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPND 236
           N GC GG M+ AF++I  N G+T +  Y Y G+  G CD  KA++    I  Y+ V  N 
Sbjct: 188 NKGCSGGLMETAFEFIKSNGGLTTETDYPYTGIE-GTCDQEKAKNKVVTIQGYQKVAQN- 245

Query: 237 EESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWL 294
           E SL  A A QPVSV IDA     Q YS GVF  YC T LNHGVT VGYG  E   KYW+
Sbjct: 246 EASLQIAAAQQPVSVGIDAGGFIFQLYSSGVFTSYCGTNLNHGVTVVGYGV-EGDQKYWI 304

Query: 295 IKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           +KNSWG  WGE+GY R++R I +  G+CGIAM AS+P+
Sbjct: 305 VKNSWGTGWGEEGYIRMERGISEDTGKCGIAMLASYPL 342


>gi|334904467|gb|AEH26024.1| cysteine peptidase [Ananas comosus]
          Length = 352

 Score =  256 bits (653), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 145/345 (42%), Positives = 204/345 (59%), Gaps = 24/345 (6%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
           FL + L +  +  S A+ R      + ++FE+W A+YGR YK++ E  +RF+IFK+N+  
Sbjct: 9   FLFLFLCVMWASPSAAS-RDEPSDPMMKRFEEWMAEYGRVYKDNDEKMRRFQIFKNNVNH 67

Query: 65  VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFL----YKS 120
           +E FN+   GN SYTL +N+F D+T  EF+A  TG      S  L     P +       
Sbjct: 68  IETFNSHN-GN-SYTLGINQFTDMTKSEFVAQYTG----GISRPLNIEREPVVSFDDVNI 121

Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
           S VP S++W + GAV  VK Q  C       A+A VEGI  IK   LVSLSEQ+++DCA 
Sbjct: 122 SAVPQSIDWRDYGAVNEVKNQNPCGSCWAFAAIATVEGIYKIKTGYLVSLSEQEVLDCAV 181

Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
           +    GC GG+++ A+ +II N G+T +  Y Y+    G C++  +  ++A IT Y  V 
Sbjct: 182 S---YGCKGGWVNKAYDFIISNNGVTTEENYPYQAYQ-GTCNA-NSFPNSAYITGYSYVR 236

Query: 234 PNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
            NDE S++ AV+NQP++  IDAS   Q+Y+GGVF+G C T LNH +T +GYG    G KY
Sbjct: 237 RNDERSMMYAVSNQPIAALIDASENFQYYNGGVFSGPCGTSLNHAITIIGYGQDSSGTKY 296

Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSKESA 337
           W+++NSWG  WGE GY R+ R +    G CGIAM   FP  +  A
Sbjct: 297 WIVRNSWGSSWGEGGYVRMARGVSSSSGACGIAMSPLFPTLQSGA 341


>gi|222425026|dbj|BAH20463.1| cysteine protease [Spinacia oleracea]
          Length = 473

 Score =  255 bits (652), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 146/327 (44%), Positives = 201/327 (61%), Gaps = 24/327 (7%)

Query: 23  RTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRL 82
           R+ DE  +   +E W  Q+ + Y    E  KRF IFKDNL  +++ N+    ++++ + L
Sbjct: 44  RSDDE--VMRIYESWLVQHRKNYNALGEKEKRFAIFKDNLEFIDQHNSD--DSQTFKVGL 99

Query: 83  NKFADLTPQEFIASQTG------FKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGAV 135
           NKFADLT +EF +   G            S+  K     +L+K   ++P +V+W + GAV
Sbjct: 100 NKFADLTNEEFRSVYLGRKKSSSSSPLLSSAKSKVKSDRYLFKEGDELPEAVDWRKNGAV 159

Query: 136 TPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDA 188
             VK QGQC        +AAVEGIN I    L+SLSEQ+LVDC T+  N+GC GG MD A
Sbjct: 160 AKVKDQGQCGSCWAFSTIAAVEGINQIVTGELLSLSEQELVDCDTS-YNSGCDGGLMDYA 218

Query: 189 FKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQP 248
           +++II N GI  DA Y Y     G CD  +       I ++EDVP NDE++L KAVA+QP
Sbjct: 219 YEFIINNGGIDTDADYPYTAKD-GKCDQYRKNAKVVTIDDFEDVPENDEKALQKAVAHQP 277

Query: 249 VSVAIDA--SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGED 306
           VSVAI+A  S  QFY  GVF G C   L+HGV AVGYG S++G  YW+++NSWG DWGE 
Sbjct: 278 VSVAIEAGGSTFQFYQSGVFTGKCGADLDHGVVAVGYG-SDDGKDYWIVRNSWGADWGES 336

Query: 307 GYFRLQRDIDQPQ-GQCGIAMFASFPV 332
           GY R++R+++  + G+CGIA+  S+P+
Sbjct: 337 GYIRMERNLETVKTGKCGIAIEPSYPI 363


>gi|146215990|gb|ABQ10197.1| actinidin Act4a [Actinidia eriantha]
          Length = 385

 Score =  255 bits (652), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 147/331 (44%), Positives = 198/331 (59%), Gaps = 20/331 (6%)

Query: 23  RTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRL 82
           RT DE  +   FE W  +YG++Y    E  +RFEIFKDNL  V+  N  A  NRSY + L
Sbjct: 39  RTNDE--VIAMFESWLVEYGKSYNALGEKERRFEIFKDNLRFVDEHN--ADVNRSYKVGL 94

Query: 83  NKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQG 142
           N+F+DLT  E+ +   G K +   +++     P +    Q+P SV+W +KGAV  VK QG
Sbjct: 95  NQFSDLTDAEYSSIYLGTKFNIRMTNVSDRYEPRV--GDQLPDSVDWRKKGAVLGVKNQG 152

Query: 143 QC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQN 195
            C       ++AAVEGIN I    L+SLSEQ++VDC     NNGC GG +  A+++II N
Sbjct: 153 NCGSCWTFASIAAVEGINKIVTGNLISLSEQEIVDCQRKYPNNGCNGGTLSGAYQFIINN 212

Query: 196 KGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAI-- 253
            GI  +A Y Y G   G+CD  K       I  YE+VP N+E++L KAVA QPVSV I  
Sbjct: 213 GGINTEANYPYTGRD-GVCDQNKKNKKYVTIDRYENVPSNNEKALQKAVAFQPVSVVIAS 271

Query: 254 DASALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQR 313
           +++A + Y  G+FNG C   ++HGVT VGYGT E G  YW+++NSWG +WGE GY R+QR
Sbjct: 272 NSTAFKSYKSGIFNGPCGPRIDHGVTIVGYGT-EGGKDYWIVRNSWGPNWGESGYVRMQR 330

Query: 314 DIDQPQGQCGIAMFASFPV--SKESAQPSSA 342
           ++    G+C IA    +PV       +P SA
Sbjct: 331 NVGG-SGKCFIARAPVYPVKYGPNPTKPRSA 360


>gi|297744465|emb|CBI37727.3| unnamed protein product [Vitis vinifera]
          Length = 331

 Score =  254 bits (650), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 141/309 (45%), Positives = 185/309 (59%), Gaps = 37/309 (11%)

Query: 33  KFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQE 92
           +FE W +++G+ YK   E   RFE+F++NL  ++  N       SY L LN+FADL+ +E
Sbjct: 48  RFESWVSKHGKVYKSMEEKLHRFEVFRENLNHIDERNKEV---SSYWLGLNEFADLSHEE 104

Query: 93  FIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA------- 145
           F +                         + +P SV+W +KGAVT VK QG C        
Sbjct: 105 FKSKDV----------------------ADLPESVDWRKKGAVTHVKNQGACGSCWAFST 142

Query: 146 VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYS 205
           VAAVEGIN I    L +LSEQ+L+DC T   N+GC GG MD AF +I  N G+  +  Y 
Sbjct: 143 VAAVEGINQIVTGNLTTLSEQELIDCDTT-FNSGCNGGLMDYAFAFIASNGGLHKEDDYP 201

Query: 206 YEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFYSG 263
           Y  M  G C+  K +     I+ YEDVP  DEESLLKA+A+QP+SVAI+AS    QFYSG
Sbjct: 202 YL-MEEGTCEEQKEDVDIVTISGYEDVPEKDEESLLKALAHQPLSVAIEASGRDFQFYSG 260

Query: 264 GVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCG 323
           GVFNG C T L+HGV AVGYG+S+ G+ Y ++KNSWG  WGE GY R++R+  + +G CG
Sbjct: 261 GVFNGPCGTELDHGVAAVGYGSSK-GLDYIIVKNSWGPKWGEKGYIRMKRNTGKTEGLCG 319

Query: 324 IAMFASFPV 332
           I   AS+P 
Sbjct: 320 INKMASYPT 328


>gi|384253406|gb|EIE26881.1| hypothetical protein COCSUDRAFT_21961 [Coccomyxa subellipsoidea
           C-169]
          Length = 481

 Score =  254 bits (649), Expect = 4e-65,   Method: Compositional matrix adjust.
 Identities = 141/317 (44%), Positives = 185/317 (58%), Gaps = 17/317 (5%)

Query: 27  EGSIAEKFEQWKAQYGRTYKESAENSKR-FEIFKDNLVAVERFNNAAIGNRSYTLRLNKF 85
           +G+    F  W     + YK++ E  +R F ++ DNL  V   N     + ++ L L  F
Sbjct: 41  KGNPRAAFSDWVEHLQKAYKDNVEEYERKFSVWLDNLEFVHSHNEK---DSTFKLGLTNF 97

Query: 86  ADLTPQEFIASQTGFKMSDHSSSL-KANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC 144
           ADLT  E+     G++     + L     T F Y   + PPS++W +KGAVT VK Q QC
Sbjct: 98  ADLTHDEYRQHALGYRPELKGTGLGTGKSTGFQYADYEAPPSIDWRKKGAVTDVKNQQQC 157

Query: 145 A-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKG 197
                     +VEG NAI    LVSLSEQ+LVDC     ++GC+GG MD AF +II+N G
Sbjct: 158 GSCWAFSTTGSVEGANAIYSGELVSLSEQELVDCDVT-QDHGCHGGLMDFAFSFIIRNGG 216

Query: 198 ITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS- 256
           I  +  Y Y+    G+C+  K + H   I +YEDVPPNDE +L KA ANQP+SVAI+A  
Sbjct: 217 IDTEKDYKYKAQD-GVCNIAKEKRHVVTIDSYEDVPPNDESALKKAAANQPISVAIEADQ 275

Query: 257 -ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDI 315
              Q Y+GGVF+  C T L+HGV  VGYG S+ G  YW++KNSWG  WG+ GY RL R I
Sbjct: 276 REFQLYAGGVFDAPCGTALDHGVLVVGYG-SDNGTDYWIVKNSWGDFWGDSGYIRLARGI 334

Query: 316 DQPQGQCGIAMFASFPV 332
               GQCGIAM AS+P+
Sbjct: 335 SNSAGQCGIAMQASYPI 351


>gi|2463584|dbj|BAA22544.1| FBSB precursor [Ananas comosus]
          Length = 356

 Score =  254 bits (648), Expect = 5e-65,   Method: Compositional matrix adjust.
 Identities = 148/347 (42%), Positives = 204/347 (58%), Gaps = 28/347 (8%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGS--IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
           FL + L +  +  S A+    DE S  + ++FE+W  +YGR YK++ E  +RF+IFK+N+
Sbjct: 9   FLFLFLCVMWASPSAASA---DEPSDPMMKRFEEWMVEYGRVYKDNDEKMRRFQIFKNNV 65

Query: 63  VAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFL----Y 118
             +E FN+      SYTL +N+F D+T  EFIA  TG      S  L     P +     
Sbjct: 66  NHIETFNSR--NENSYTLGINQFTDMTNNEFIAQYTG----GISRPLNIEREPVVSFDDV 119

Query: 119 KSSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDC 171
             S VP S++W + GAVT VK Q  C       A+A VE I  IK   L  LSEQQ++DC
Sbjct: 120 DISAVPQSIDWRDYGAVTSVKNQNPCGACWAFAAIATVESIYKIKKGILEPLSEQQVLDC 179

Query: 172 ATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYED 231
           A      GC GG+   AF++II NKG+ + A+Y Y+  + G C +     ++A IT Y  
Sbjct: 180 A---KGYGCKGGWEFRAFEFIISNKGVASGAIYPYKA-AKGTCKT-NGVPNSAYITGYAR 234

Query: 232 VPPNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGI 290
           VP N+E S++ AV+ QP++VA+DA+A  Q+Y  GVFNG C T LNH VTA+GYG    G 
Sbjct: 235 VPRNNESSMMYAVSKQPITVAVDANANFQYYKSGVFNGPCGTSLNHAVTAIGYGQDSNGK 294

Query: 291 KYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSKESA 337
           KYW++KNSWG  WGE GY R+ RD+    G CGIA+ + +P  +  A
Sbjct: 295 KYWIVKNSWGARWGEAGYIRMARDVSSSSGICGIAIDSLYPTLESRA 341


>gi|374713651|gb|AEZ65083.1| cysteine protease [Carica papaya]
          Length = 467

 Score =  254 bits (648), Expect = 5e-65,   Method: Compositional matrix adjust.
 Identities = 145/329 (44%), Positives = 196/329 (59%), Gaps = 21/329 (6%)

Query: 17  ASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNR 76
           A ++++RT DE  +   +E W  ++G+ Y    E  KRF IFKDNL  ++  N+    N 
Sbjct: 34  ADKSSWRTDDE--VMAMYEAWLVKHGKAYNALGEKEKRFGIFKDNLRFIDEHNSQ---NL 88

Query: 77  SYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS---SQVPPSVNWIEKG 133
           +Y L LN+FADLT +E+ +   G K      + K +     + +     +P  ++W ++G
Sbjct: 89  TYRLGLNRFADLTNEEYRSMYLGVKPGATRVTRKVSRKSDRFAARVGDALPDFIDWRKEG 148

Query: 134 AVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMD 186
           AV  VK QG C        +AAVEGIN I    L+SLSEQ+LVDC T+  N GC GG MD
Sbjct: 149 AVVGVKDQGSCGSCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDTS-YNEGCNGGLMD 207

Query: 187 DAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN 246
            AF++II N GI ++  Y Y       CD  +   +   I  YEDVP NDE +L KAVA 
Sbjct: 208 YAFEFIINNGGIDSEEDYPYRAADQK-CDQYRKNANVVSIDGYEDVPENDEAALKKAVAK 266

Query: 247 QPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWG 304
           QPVSVAI+A   A Q Y  GVF G C T L+HGV AVGYGT E G  YW++ NSWG++WG
Sbjct: 267 QPVSVAIEAGGRAFQLYQSGVFTGKCGTSLDHGVAAVGYGT-ENGQDYWIVGNSWGKNWG 325

Query: 305 EDGYFRLQRDI-DQPQGQCGIAMFASFPV 332
           EDGY R++R++     G+CGIA+  S+P+
Sbjct: 326 EDGYIRMERNLAGSSSGKCGIAIGPSYPI 354


>gi|57118005|gb|AAW34134.1| cysteine protease gp2a [Zingiber officinale]
          Length = 381

 Score =  254 bits (648), Expect = 5e-65,   Method: Compositional matrix adjust.
 Identities = 140/312 (44%), Positives = 189/312 (60%), Gaps = 16/312 (5%)

Query: 34  FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAA-IGNRSYTLRLNKFADLTPQE 92
           + +W+ +     K    N  R E+FK+NL  V+  N AA  G  ++ L +N+FADLT +E
Sbjct: 53  YLEWRVKNHPAEKYLDLNEYRLEVFKENLQFVDEHNAAADRGEHTFLLGMNRFADLTNEE 112

Query: 93  FIAS-QTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA------ 145
           +       F     S+S K +    L +   +P S++W E GAV PVK QG C       
Sbjct: 113 YRTRFLRDFSRLRRSASGKISSRYRLREGDDLPDSIDWRENGAVVPVKNQGGCGSCWAFS 172

Query: 146 -VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVY 204
            VAAVEGIN I    L+SLSEQQLVDC T   N+GC GG+M+ AF++I+ N GI ++  Y
Sbjct: 173 TVAAVEGINQIVTGDLISLSEQQLVDCTTA--NHGCRGGWMNPAFQFIVNNGGINSEETY 230

Query: 205 SYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFYS 262
            Y G + GIC+S         I +YE+VP ++E+SL KAVANQPVSV +DA+    Q Y 
Sbjct: 231 PYRGQN-GICNST-VNAPVVSIDSYENVPSHNEQSLQKAVANQPVSVTMDAAGRDFQLYR 288

Query: 263 GGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQC 322
            G+F G C    NH +T VGYGT E    +W++KNSWG++WGE GY R +R+I+ P G+C
Sbjct: 289 SGIFTGSCNISANHALTVVGYGT-ENDKDFWIVKNSWGKNWGESGYIRAERNIENPNGKC 347

Query: 323 GIAMFASFPVSK 334
           GI  FAS+PV K
Sbjct: 348 GITRFASYPVKK 359


>gi|57118007|gb|AAW34135.1| cysteine protease gp2b [Zingiber officinale]
          Length = 379

 Score =  254 bits (648), Expect = 6e-65,   Method: Compositional matrix adjust.
 Identities = 142/312 (45%), Positives = 190/312 (60%), Gaps = 16/312 (5%)

Query: 34  FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAA-IGNRSYTLRLNKFADLTPQE 92
           + +W+A+     K    N  R E+FK+NL  V++ N AA  G  ++ L +N+FADLT +E
Sbjct: 51  YLEWRAKNHPAEKYLDLNEYRLEVFKENLQFVDKHNAAADRGEHTFRLGMNRFADLTNEE 110

Query: 93  FIAS-QTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA------ 145
           +       F     S+S K +    L +   +P S++W EKGAV PVK QG C       
Sbjct: 111 YRTRFLRDFSRLRRSASGKISSRYRLREGDDLPDSIDWREKGAVVPVKNQGGCGSCWAFS 170

Query: 146 -VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVY 204
            VAAVEGIN I    L+SLSEQQLVDC T   N+GC GG+M+ AF++I+ N GI ++  Y
Sbjct: 171 TVAAVEGINQIVTGDLISLSEQQLVDCTTA--NHGCRGGWMNPAFQFIVNNGGINSEETY 228

Query: 205 SYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFYS 262
            Y G + GIC+S         I +YE+VP ++E+SL KAVANQPVSV +DA+    Q Y 
Sbjct: 229 PYRGQN-GICNST-VNAPVVSIDSYENVPSHNEQSLQKAVANQPVSVTMDAAGRDFQLYR 286

Query: 263 GGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQC 322
            G+F G C    NH +T VGYGT E    Y  +KNSWG++WGE GY R++R+I  P G+C
Sbjct: 287 SGIFTGSCNISANHALTVVGYGT-ENDKDYRTVKNSWGKNWGESGYIRVERNIGNPNGKC 345

Query: 323 GIAMFASFPVSK 334
           GI  FAS+PV K
Sbjct: 346 GITRFASYPVKK 357


>gi|90399361|emb|CAJ86180.1| H0212B02.7 [Oryza sativa Indica Group]
          Length = 470

 Score =  253 bits (647), Expect = 7e-65,   Method: Compositional matrix adjust.
 Identities = 144/323 (44%), Positives = 188/323 (58%), Gaps = 26/323 (8%)

Query: 34  FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAA-IGNRSYTLRLNKFADLTPQE 92
           + +WKA++G+ Y    E  +R+  F+DNL  ++  N AA  G  S+ L LN+FADLT +E
Sbjct: 40  YAEWKAEHGKNYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHSFRLGLNRFADLTNEE 99

Query: 93  FIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC-------A 145
           +  +  G +        K +       +  +P SV+W  KGAV  +K QG C       A
Sbjct: 100 YRDTYLGLRNKPRRER-KVSDRYLAADNEALPESVDWRTKGAVAEIKDQGGCGSCWAFSA 158

Query: 146 VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYS 205
           +AAVEGIN I    L+SLSEQ+LVDC T+  N GC GG MD AF +II N GI  +  Y 
Sbjct: 159 IAAVEGINQIVTGDLISLSEQELVDCDTS-YNEGCNGGLMDYAFDFIINNGGIDTEDDYP 217

Query: 206 YEGMSTGICDS------------IKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAI 253
           Y+G     CD              +       I +YEDV PN E SL KAVANQPVSVAI
Sbjct: 218 YKGKDE-RCDVNRVSFVFFAPLVFQKNAKVVTIDSYEDVTPNSETSLQKAVANQPVSVAI 276

Query: 254 DAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRL 311
           +A   A Q YS G+F G C T L+HGV AVGYGT E G  YW+++NSWG+ WGE GY R+
Sbjct: 277 EAGGRAFQLYSSGIFTGKCGTALDHGVAAVGYGT-ENGKDYWIVRNSWGKSWGESGYVRM 335

Query: 312 QRDIDQPQGQCGIAMFASFPVSK 334
           +R+I    G+CGIA+  S+P+ K
Sbjct: 336 ERNIKASSGKCGIAVEPSYPLKK 358


>gi|357465603|ref|XP_003603086.1| Cysteine proteinase [Medicago truncatula]
 gi|355492134|gb|AES73337.1| Cysteine proteinase [Medicago truncatula]
          Length = 474

 Score =  253 bits (646), Expect = 9e-65,   Method: Compositional matrix adjust.
 Identities = 143/317 (45%), Positives = 192/317 (60%), Gaps = 22/317 (6%)

Query: 34  FEQWKAQYGRTYK--ESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQ 91
           +E+W+ ++G+     + +E  KRFEIFKDNL  ++  N     NR+Y + LN+FADL+ +
Sbjct: 53  YEEWRVKHGKLNNNIDGSEKDKRFEIFKDNLKFIDEHNAE---NRTYKVGLNRFADLSNE 109

Query: 92  EFIASQTGFKMSDHSSSLKANGTPF-LYKSS---QVPPSVNWIEKGAVTPVKYQGQCA-- 145
           E+ +   G K+      +    T    Y  S   ++P SV+W  +GAV  VK QG C   
Sbjct: 110 EYRSRYLGTKIDPIGMMMARTKTRSNRYAPSVGDKLPKSVDWRSQGAVVQVKDQGSCGSC 169

Query: 146 -----VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITN 200
                +AAVEGIN I    LVSLSEQ+LVDC     N GC GG M+ AF++II N GI +
Sbjct: 170 WAFSTIAAVEGINKIVTGELVSLSEQELVDC-DRTVNAGCDGGLMEYAFEFIINNGGIDS 228

Query: 201 DAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--L 258
           D  Y Y G+  G CD  K       I +YE VP  DE +L KAVANQP+SVAI+A     
Sbjct: 229 DEDYPYRGVD-GKCDQYKKNARVVSIDDYEQVPAYDELALKKAVANQPISVAIEAGGREF 287

Query: 259 QFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQP 318
           Q Y  G+F G C T L+HGVTAVGYGT E G+ YW+++NSWG+ WGE GY R++R++   
Sbjct: 288 QLYVSGIFTGKCGTALDHGVTAVGYGT-ENGVDYWIVRNSWGKSWGESGYVRMERNLAAS 346

Query: 319 -QGQCGIAMFASFPVSK 334
             G+CGI M +S+P+ K
Sbjct: 347 VAGKCGIVMQSSYPIKK 363


>gi|159485468|ref|XP_001700766.1| cysteine endopeptidase [Chlamydomonas reinhardtii]
 gi|158281265|gb|EDP07020.1| cysteine endopeptidase [Chlamydomonas reinhardtii]
          Length = 498

 Score =  253 bits (646), Expect = 9e-65,   Method: Compositional matrix adjust.
 Identities = 161/356 (45%), Positives = 206/356 (57%), Gaps = 34/356 (9%)

Query: 2   AKYFLIVVLIISGSCASQATYRTFDEGSIAE-----KFEQWKAQYGRTYKE-SAENSKRF 55
           AK+  + +  + G   + A   + D  ++A+      F  W  Q+ RTY E S E ++R 
Sbjct: 3   AKFLALALAGLVGLSCAHALLSSADMLALAQVEPERAFGLWATQHARTYSEGSPEYTRRL 62

Query: 56  EIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKAN--- 112
            +F DN+ A+   N     N   TL LN++AD T +EF A + G K+S     LKA    
Sbjct: 63  GVFADNVRAIAEQNRR---NTGITLALNEYADETWEEFAAKRLGLKISQEQ--LKAREAR 117

Query: 113 -----GTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRL 160
                 + + Y   Q P +V+W  K AVT VK QGQC       AV ++EG NA+   +L
Sbjct: 118 SSSSSSSSWRYAQVQTPAAVDWRAKNAVTQVKNQGQCGSCWAFSAVGSIEGANALATGQL 177

Query: 161 VSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSY-EGMSTGI-CDSIK 218
           V+LSEQQLVDC T  +N GC GG MDDAFKY++ N GI  +  YSY  G   G  C+  K
Sbjct: 178 VALSEQQLVDCDTA-SNMGCSGGLMDDAFKYVLDNGGIDTEEDYSYWSGYGFGFWCNKRK 236

Query: 219 AEDH-AAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNH 276
             D  A  I  YEDVP   E +LLKAVA QPV+VAI ASA +QFYS GV N  CE  LNH
Sbjct: 237 QTDRPAVSIDGYEDVP-TSEPALLKAVAGQPVAVAICASANMQFYSSGVINSCCEG-LNH 294

Query: 277 GVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           GV AVGY TS++   YW++KNSWG  WGE GYFRL+   + P+G CGIA  AS+ V
Sbjct: 295 GVLAVGYDTSDKAQPYWIVKNSWGGSWGEQGYFRLKMG-EGPKGLCGIASAASYAV 349


>gi|2463588|dbj|BAA22546.1| FB1035 precursor [Ananas comosus]
          Length = 324

 Score =  253 bits (646), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 138/320 (43%), Positives = 194/320 (60%), Gaps = 24/320 (7%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
           + ++FE+W A+YGR YK++ E  +RF+IFK+N+  +E FN+   GN SYTL +N+F D+T
Sbjct: 6   MMKRFEEWMAEYGRIYKDNDEKMRRFQIFKNNVKHIETFNSRN-GN-SYTLGINQFTDMT 63

Query: 90  PQEFIASQTGFKMSDHSSSLKANGTPFL----YKSSQVPPSVNWIEKGAVTPVKYQGQC- 144
             EF+A  TG  +      L     P +       S VP S++W + GAV  VK Q  C 
Sbjct: 64  KSEFVAQYTGVSLP-----LNIEREPVVSFDDVNISAVPQSIDWRDYGAVNEVKNQNPCG 118

Query: 145 ------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGI 198
                 A+A VEGI  IK   LVSLSEQ+++DCA +    GC GG+++ A+ +II N G+
Sbjct: 119 SCWAFAAIATVEGIYKIKTGYLVSLSEQEVLDCAVS---YGCKGGWVNKAYDFIISNNGV 175

Query: 199 TNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA- 257
           T +  Y Y+    G C++  +  ++A IT Y  V  NDE S++ AV+NQP++  IDAS  
Sbjct: 176 TTEENYPYQAYQ-GTCNA-NSFPNSAYITGYSYVRRNDERSMMYAVSNQPIAALIDASEN 233

Query: 258 LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQ 317
            Q+Y+GGVF+G C T LNH +T +GYG    G KYW+++NSWG  WGE GY R+ R +  
Sbjct: 234 FQYYNGGVFSGPCGTSLNHAITIIGYGQDSSGTKYWIVRNSWGSSWGEGGYVRMARGVSS 293

Query: 318 PQGQCGIAMFASFPVSKESA 337
             G CGIAM   FP  +  A
Sbjct: 294 SSGACGIAMSPLFPTLQSGA 313


>gi|146215982|gb|ABQ10193.1| actinidin Act2b [Actinidia eriantha]
          Length = 378

 Score =  253 bits (645), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 145/343 (42%), Positives = 199/343 (58%), Gaps = 22/343 (6%)

Query: 1   MAKYFLIVVLIISGSC-ASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFK 59
           M+  F   +LI+S +     +  RT D+  +   +E W  + G++Y    E   RFEIFK
Sbjct: 10  MSLLFFSTLLILSLALDIENSVQRTNDQ--VMAMYESWLVEQGKSYNSLDEKEMRFEIFK 67

Query: 60  DNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK 119
           +NL  ++  N  A  NRSY+L LN+FADLT +E+ ++  G KM   +         ++ K
Sbjct: 68  ENLRIIDDHN--ADANRSYSLGLNRFADLTDEEYRSTYLGLKMGPKTDV----SNEYMPK 121

Query: 120 SSQ-VPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDC 171
             + +P  V+W   GAV  VK QG C       AV AVEGIN I    L+SLSEQ+LVDC
Sbjct: 122 VGEALPDYVDWRTVGAVVGVKNQGLCSSCWAFSAVTAVEGINKIVTGNLISLSEQELVDC 181

Query: 172 ATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYED 231
                  GC  G M DAF++II N GI  +  Y Y     G C+          I NY++
Sbjct: 182 GRTQRTKGCNRGLMTDAFQFIINNGGINTEDNYPYTA-KDGQCNLSLKNQKYVTIDNYKN 240

Query: 232 VPPNDEESLLKAVANQPVSVAIDASALQF--YSGGVFNGYCETFLNHGVTAVGYGTSEEG 289
           VP N+E +L KAVA QPVSV +++   +F  Y+ G+F G+C T ++HGVT VGYGT E G
Sbjct: 241 VPSNNEMALKKAVAYQPVSVGVESEGGKFKLYTSGIFTGFCGTAVDHGVTIVGYGT-ERG 299

Query: 290 IKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           + YW++KNSWG +WGE+GY R+QR+I    G+CGIA   S+PV
Sbjct: 300 MDYWIVKNSWGTNWGENGYIRIQRNIGG-AGKCGIARMPSYPV 341


>gi|449500383|ref|XP_004161083.1| PREDICTED: vignain-like [Cucumis sativus]
          Length = 345

 Score =  253 bits (645), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 153/327 (46%), Positives = 207/327 (63%), Gaps = 28/327 (8%)

Query: 25  FDEGSIAEKFEQWK--AQYGRTY---KESAENSKRFEIFKDNLVAVERFNNAAIGNRSYT 79
           FDE  +A +   W+   ++G+ +   +   E  KRF +FK+N+  V   N     ++ Y 
Sbjct: 26  FDEKELATEESLWQLYERWGKHHTISRNLKEKHKRFSVFKENVNHVFTVNQM---DKPYK 82

Query: 80  LRLNKFADLTPQEFI----ASQTGFKMSDHSSSLKANGTPFLY-KSSQVPPSVNWIEKGA 134
           L+LNKFAD++  EF+     S        H     A G  F+Y + + +P SV+  E+GA
Sbjct: 83  LKLNKFADMSNYEFVNFYARSNISHYRKLHERRRGAGG--FMYEQDTDLPSSVDGRERGA 140

Query: 135 VTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDD 187
           V  VK QG+C       +VAAVEGIN IK N+L+SLSEQ+L+DC  N  N GC GGFM+ 
Sbjct: 141 VNAVKEQGRCGSCWAFSSVAAVEGINKIKTNQLLSLSEQELLDC--NYRNKGCNGGFMEI 198

Query: 188 AFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ 247
           AF +I +N GI  +  Y Y G S G+C S +      +I  YE VP N E++L++AVANQ
Sbjct: 199 AFDFIKRNGGIATENSYPYHG-SRGLCRSSRISSPIVKIDGYESVPEN-EDALMQAVANQ 256

Query: 248 PVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGE 305
           PVSVAIDA+    QFYS GVF+GYC T LNHGV A+GYGT+E+G  YWL++NSWG  WGE
Sbjct: 257 PVSVAIDAAGRDFQFYSQGVFDGYCGTELNHGVVAIGYGTTEDGTDYWLVRNSWGVGWGE 316

Query: 306 DGYFRLQRDIDQPQGQCGIAMFASFPV 332
           DGY R++R ++Q +G CGIAM AS+P+
Sbjct: 317 DGYVRMKRGVEQAEGLCGIAMEASYPI 343


>gi|47169030|pdb|1S4V|A Chain A, The 2.0 A Crystal Structure Of The Kdel-Tailed Cysteine
           Endopeptidase Functioning In Programmed Cell Death Of
           Ricinus Communis Endosperm
 gi|47169031|pdb|1S4V|B Chain B, The 2.0 A Crystal Structure Of The Kdel-Tailed Cysteine
           Endopeptidase Functioning In Programmed Cell Death Of
           Ricinus Communis Endosperm
          Length = 229

 Score =  253 bits (645), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 134/227 (59%), Positives = 155/227 (68%), Gaps = 11/227 (4%)

Query: 123 VPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATND 175
           VP SV+W +KGAVT VK QGQC        + AVEGIN IK N+LVSLSEQ+LVDC T D
Sbjct: 2   VPASVDWRKKGAVTSVKDQGQCGSCWAFSTIVAVEGINQIKTNKLVSLSEQELVDCDT-D 60

Query: 176 NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPN 235
            N GC GG MD AF++I Q  GIT +A Y YE    G CD  K    A  I  +E+VP N
Sbjct: 61  QNQGCNGGLMDYAFEFIKQRGGITTEANYPYEAYD-GTCDVSKENAPAVSIDGHENVPEN 119

Query: 236 DEESLLKAVANQPVSVAIDA--SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYW 293
           DE +LLKAVANQPVSVAIDA  S  QFYS GVF G C T L+HGV  VGYGT+ +G KYW
Sbjct: 120 DENALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGSCGTELDHGVAIVGYGTTIDGTKYW 179

Query: 294 LIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSKESAQPS 340
            +KNSWG +WGE GY R++R I   +G CGIAM AS+P+ K S  PS
Sbjct: 180 TVKNSWGPEWGEKGYIRMERGISDKEGLCGIAMEASYPIKKSSNNPS 226


>gi|115448287|ref|NP_001047923.1| Os02g0715000 [Oryza sativa Japonica Group]
 gi|42408029|dbj|BAD09165.1| putative cysteine proteinase [Oryza sativa Japonica Group]
 gi|113537454|dbj|BAF09837.1| Os02g0715000 [Oryza sativa Japonica Group]
 gi|215737450|dbj|BAG96580.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215765786|dbj|BAG87483.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222623551|gb|EEE57683.1| hypothetical protein OsJ_08138 [Oryza sativa Japonica Group]
          Length = 366

 Score =  253 bits (645), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 145/310 (46%), Positives = 185/310 (59%), Gaps = 17/310 (5%)

Query: 34  FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
           F  W  ++ + Y    E  KR+EIFK NL  +   N     N SY L LN FAD+  +EF
Sbjct: 55  FTSWSVKHSKIYASPKEKVKRYEIFKRNLRHIVETNRR---NGSYWLGLNHFADIAHEEF 111

Query: 94  IASQTGFKMSDHSSSLKANG-TPFLYKSS-QVPPSVNWIEKGAVTPVKYQGQCA------ 145
            AS  G K        + +G T F Y ++  +P +V+W +KGAVTPVK QG+C       
Sbjct: 112 KASYLGLKPGLARRDAQPHGSTTFRYANAVNLPWAVDWRKKGAVTPVKNQGECGSCWAFS 171

Query: 146 -VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVY 204
            VAAVEGIN I   +LVSLSEQ+L+DC  N  N+GC GG MD AF YI+ N+GI  +  Y
Sbjct: 172 TVAAVEGINQIVTGKLVSLSEQELMDC-DNTFNHGCRGGLMDFAFAYIMGNQGIYTEEDY 230

Query: 205 SYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFYS 262
            Y  M  G C   +       IT YEDVP N E SLLKA+A+QPVSV I A +   QFY 
Sbjct: 231 PYL-MEEGYCREKQPHSKVITITGYEDVPANSETSLLKALAHQPVSVGIAAGSRDFQFYK 289

Query: 263 GGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQC 322
           GG+F+G C    +H +TAVGYG S  G  Y ++KNSWG++WGE GYFR++R   +P+G C
Sbjct: 290 GGIFDGECGIQPDHALTAVGYG-SYYGQDYIIMKNSWGKNWGEQGYFRIRRGTGKPEGVC 348

Query: 323 GIAMFASFPV 332
            I   AS+P 
Sbjct: 349 DIYKIASYPT 358


>gi|357143305|ref|XP_003572875.1| PREDICTED: xylem cysteine proteinase 1-like [Brachypodium
           distachyon]
          Length = 473

 Score =  253 bits (645), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 143/313 (45%), Positives = 188/313 (60%), Gaps = 18/313 (5%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
           + + F  W  ++ + Y    E  KR+E+FK NL  +   N     N SY L LN+FAD+ 
Sbjct: 44  LVDLFSSWSVKHSKIYVSPEEKVKRYEVFKQNLKHIVETNRR---NGSYWLGLNQFADVA 100

Query: 90  PQEFIASQTGFKMSDHSSSLKANGTPFLYKSS-QVPPSVNWIEKGAVTPVKYQGQCA--- 145
            +EF ++  G K      +     T F Y++S  +P SV+W +KGAVTPVK QG+C    
Sbjct: 101 HEEFKSTYLGLKTGMDGPARAP--TAFRYENSVNLPWSVDWRKKGAVTPVKNQGECGSCW 158

Query: 146 ----VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITND 201
               VAAVEGIN I   +L SLSEQ+L+DC T   ++GC GGFMD AF YI+ N GI  D
Sbjct: 159 AFSTVAAVEGINQIATGKLESLSEQELMDCDTT-FDHGCGGGFMDFAFAYIMGNLGIHTD 217

Query: 202 AVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQ 259
             Y Y  M  G C   + +     I+ YEDVP N E SLLKA+A+QP+SV I A +   Q
Sbjct: 218 DDYPYL-MEEGYCKEKQPQSKVVTISGYEDVPENSEVSLLKALAHQPISVGIAAGSKDFQ 276

Query: 260 FYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQ 319
           FY  GVF G C T L+H +TAVGYG+S+ G  Y ++KNSWG+ WGE GYFR++R   +P+
Sbjct: 277 FYKRGVFEGSCGTELDHALTAVGYGSSD-GQDYIIMKNSWGKSWGEQGYFRIKRGTGKPE 335

Query: 320 GQCGIAMFASFPV 332
           G C I   AS+P 
Sbjct: 336 GVCSIYSMASYPT 348


>gi|75277440|sp|O23791.1|BROM1_ANACO RecName: Full=Fruit bromelain; AltName: Allergen=Ana c 2; Flags:
           Precursor
 gi|2342496|dbj|BAA21849.1| bromelain [Ananas comosus]
          Length = 351

 Score =  252 bits (644), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 142/345 (41%), Positives = 199/345 (57%), Gaps = 25/345 (7%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
           FL + L    +  S A+ R      + ++FE+W A+YGR YK+  E  +RF+IFK+N+  
Sbjct: 9   FLFLFLCAMWASPSAAS-RDEPNDPMMKRFEEWMAEYGRVYKDDDEKMRRFQIFKNNVKH 67

Query: 65  VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFL----YKS 120
           +E FN+      SYTL +N+F D+T  EF+A  TG  +      L     P +       
Sbjct: 68  IETFNSR--NENSYTLGINQFTDMTKSEFVAQYTGVSLP-----LNIEREPVVSFDDVNI 120

Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
           S VP S++W + GAV  VK Q  C       A+A VEGI  IK   LVSLSEQ+++DCA 
Sbjct: 121 SAVPQSIDWRDYGAVNEVKNQNPCGSCWSFAAIATVEGIYKIKTGYLVSLSEQEVLDCAV 180

Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
           +    GC GG+++ A+ +II N G+T +  Y Y     G C++  +  ++A IT Y  V 
Sbjct: 181 S---YGCKGGWVNKAYDFIISNNGVTTEENYPYLAYQ-GTCNA-NSFPNSAYITGYSYVR 235

Query: 234 PNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
            NDE S++ AV+NQP++  IDAS   Q+Y+GGVF+G C T LNH +T +GYG    G KY
Sbjct: 236 RNDERSMMYAVSNQPIAALIDASENFQYYNGGVFSGPCGTSLNHAITIIGYGQDSSGTKY 295

Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSKESA 337
           W+++NSWG  WGE GY R+ R +    G CGIAM   FP  +  A
Sbjct: 296 WIVRNSWGSSWGEGGYVRMARGVSSSSGVCGIAMAPLFPTLQSGA 340


>gi|125540888|gb|EAY87283.1| hypothetical protein OsI_08685 [Oryza sativa Indica Group]
          Length = 357

 Score =  252 bits (644), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 145/310 (46%), Positives = 185/310 (59%), Gaps = 17/310 (5%)

Query: 34  FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
           F  W  ++ + Y    E  KR+EIFK NL  +   N     N SY L LN FAD+  +EF
Sbjct: 46  FTSWSVKHSKIYASPKEKVKRYEIFKRNLRHIVETNRR---NGSYWLGLNHFADIAHEEF 102

Query: 94  IASQTGFKMSDHSSSLKANG-TPFLYKSS-QVPPSVNWIEKGAVTPVKYQGQCA------ 145
            AS  G K        + +G T F Y ++  +P +V+W +KGAVTPVK QG+C       
Sbjct: 103 KASYLGLKPGLARRDAQPHGSTTFRYANAVNLPWAVDWRKKGAVTPVKNQGECGSCWAFS 162

Query: 146 -VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVY 204
            VAAVEGIN I   +LVSLSEQ+L+DC  N  N+GC GG MD AF YI+ N+GI  +  Y
Sbjct: 163 TVAAVEGINQIVTGKLVSLSEQELMDC-DNTFNHGCRGGLMDFAFAYIMGNQGIYTEEDY 221

Query: 205 SYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFYS 262
            Y  M  G C   +       IT YEDVP N E SLLKA+A+QPVSV I A +   QFY 
Sbjct: 222 PYL-MEEGYCREKQPHSKVITITGYEDVPENSETSLLKALAHQPVSVGIAAGSRDFQFYK 280

Query: 263 GGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQC 322
           GG+F+G C    +H +TAVGYG S  G  Y ++KNSWG++WGE GYFR++R   +P+G C
Sbjct: 281 GGIFDGECGIQPDHALTAVGYG-SYYGQDYIIMKNSWGKNWGEQGYFRIRRGTGKPEGVC 339

Query: 323 GIAMFASFPV 332
            I   AS+P 
Sbjct: 340 DIYKIASYPT 349


>gi|356545079|ref|XP_003540973.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
          Length = 330

 Score =  252 bits (643), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 143/299 (47%), Positives = 185/299 (61%), Gaps = 25/299 (8%)

Query: 17  ASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNR 76
           ASQ T RT  + S+ E+ E+W ++YG+ YK+  E  KRF IFK+N+  +E  NN AI  +
Sbjct: 5   ASQVTCRTLQDASMYERHEEWMSRYGKVYKDPREREKRFRIFKENMNYIETSNNVAI--K 62

Query: 77  SYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGT---PFLYKSSQVPPSVNWIEKG 133
              L +N+FADL  +EFIA +  FK       L    T   P+++   +         KG
Sbjct: 63  PXKLVINQFADLNNEEFIAPRNIFKGMILCRFLSRKHTFPFPYVFLGHK---------KG 113

Query: 134 AVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMD 186
           AVTPVK QG C        VA+ EGI A+   +L+SLSEQ+LVDC T   + GC  G MD
Sbjct: 114 AVTPVKDQGHCGFCWAFYDVASTEGILALTAGKLISLSEQELVDCDTKGVDQGCECGLMD 173

Query: 187 DAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN 246
           DAFK+IIQN G+  DA Y Y+G+  G C++ +  + AA IT  EDVP N+E++L K VAN
Sbjct: 174 DAFKFIIQNHGVX-DANYPYKGVD-GKCNANEEANPAATITGXEDVPANNEKALQKVVAN 231

Query: 247 QPVSVAIDA--SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDW 303
           QPV VAIDA  S  QFY  GVF G CET LNHGVT +GYG S +G +YWL+KNS   +W
Sbjct: 232 QPVFVAIDACDSDFQFYKSGVFTGSCETELNHGVTTMGYGVSHDGTQYWLVKNSXETEW 290


>gi|226504984|ref|NP_001151293.1| cysteine protease 1 precursor [Zea mays]
 gi|195645596|gb|ACG42266.1| cysteine protease 1 precursor [Zea mays]
          Length = 340

 Score =  252 bits (643), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 150/346 (43%), Positives = 210/346 (60%), Gaps = 23/346 (6%)

Query: 2   AKYFLIVVLIISGSC---ASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIF 58
           A  FL+ VL++  +    A+          ++A + E+W A++GR YK+ AE ++R E+F
Sbjct: 3   ASRFLLAVLVVGSAVLCTAAAPRALAAAAAAMASRHEKWMAEHGRAYKDEAEKARRLEVF 62

Query: 59  KDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLY 118
           + N   ++ FN  A G  S+ L  N+FADLT QEF A++TG +     S   A    F Y
Sbjct: 63  RANAELIDSFN--AAGTHSHRLATNRFADLTVQEFRAARTGLRPRPAPS---AGAGRFRY 117

Query: 119 KS---SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQL 168
           ++   +    SV+W   GAVT VK QG         AVAAVEG+N I+  RLVSLSEQ+L
Sbjct: 118 ENFSLADAAQSVDWRAMGAVTGVKDQGASGCCWAFSAVAAVEGLNKIRTGRLVSLSEQEL 177

Query: 169 VDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITN 228
           VDC  +  + GC GG MD+AF+++ +  G+ +++ Y Y+    G C S  A   AA I  
Sbjct: 178 VDCDVSGVDQGCDGGLMDNAFQFVARRGGLASESGYPYQ-CRDGPCRSSAAAA-AASIRG 235

Query: 229 YEDVPPNDEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTS 286
           +EDVP N+E +L  AVA+QPVSVAI+    A +FY  GV  G C T LNH +TAVGYGT+
Sbjct: 236 HEDVPRNNEAALAAAVAHQPVSVAINGEDMAFRFYDSGVLGGACGTDLNHAITAVGYGTA 295

Query: 287 EEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
            +G +YWL+KNSWG  WGE GY R++R + + +G CG+A   S+PV
Sbjct: 296 ADGTRYWLMKNSWGASWGEGGYVRIRRGV-RGEGVCGLAKLPSYPV 340


>gi|18390634|ref|NP_563764.1| cysteine proteinase-like protein [Arabidopsis thaliana]
 gi|8844131|gb|AAF80223.1|AC025290_12 Contains similarity to a cysteine endopeptidase 1 from Phaseolus
           vulgaris gb|U52970 and is a member of the papain
           cysteine protease family PF|00112 [Arabidopsis thaliana]
 gi|332189848|gb|AEE27969.1| cysteine proteinase-like protein [Arabidopsis thaliana]
          Length = 343

 Score =  251 bits (642), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 142/338 (42%), Positives = 193/338 (57%), Gaps = 20/338 (5%)

Query: 5   FLIVVLIISGSCASQATYRTFD-EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLV 63
            +  VLI S  C+  ++   +D   ++ ++FE+W   + + Y    E   RF I++ N+ 
Sbjct: 15  LICFVLIASKLCSVDSS--VYDPHKTLKQRFEKWLKTHSKLYGGRDEWMLRFGIYQSNVQ 72

Query: 64  AVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQV 123
            ++  N+  +    + L  N+FAD+T  EF A   G   S  S  L     P    +  V
Sbjct: 73  LIDYINSLHL---PFKLTDNRFADMTNSEFKAHFLGLNTS--SLRLHKKQRPVCDPAGNV 127

Query: 124 PPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDN 176
           P +V+W  +GAVTP++ QG+C       AVAA+EGIN IK   LVSLSEQQL+DC     
Sbjct: 128 PDAVDWRTQGAVTPIRNQGKCGGCWAFSAVAAIEGINKIKTGNLVSLSEQQLIDCDVGTY 187

Query: 177 NNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPND 236
           N GC GG M+ AF++I  N G+  +  Y Y G+  G CD  K+++    I  Y+ V  N 
Sbjct: 188 NKGCSGGLMETAFEFIKTNGGLATETDYPYTGIE-GTCDQEKSKNKVVTIQGYQKVAQN- 245

Query: 237 EESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWL 294
           E SL  A A QPVSV IDA     Q YS GVF  YC T LNHGVT VGYG  E   KYW+
Sbjct: 246 EASLQIAAAQQPVSVGIDAGGFIFQLYSSGVFTNYCGTNLNHGVTVVGYGV-EGDQKYWI 304

Query: 295 IKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           +KNSWG  WGE+GY R++R + +  G+CGIAM AS+P+
Sbjct: 305 VKNSWGTGWGEEGYIRMERGVSEDTGKCGIAMMASYPL 342


>gi|3377948|emb|CAA08860.1| cysteine proteinase precursor, AN8 [Ananas comosus]
          Length = 356

 Score =  251 bits (642), Expect = 3e-64,   Method: Compositional matrix adjust.
 Identities = 147/347 (42%), Positives = 205/347 (59%), Gaps = 28/347 (8%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGS--IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
           FL + L +  +  S A+    DE S  + ++FE+W  +YGR YK++ E  +RF+IFK+N+
Sbjct: 9   FLFLFLCVMWASPSAASA---DEPSDPMMKRFEEWMVEYGRVYKDNDEKMRRFQIFKNNV 65

Query: 63  VAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFL----Y 118
             +E FN+      SYTL +N+F D+T  EF+A  TG      S  L     P +     
Sbjct: 66  NHIETFNSR--NKDSYTLGINQFTDMTNNEFVAQYTG----GISRPLNIEREPVVSFDDV 119

Query: 119 KSSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDC 171
             S VP S++W + GAVT VK Q  C       A+A VE I  IK   L  LSEQQ++DC
Sbjct: 120 DISAVPQSIDWRDYGAVTSVKNQNPCGACWAFAAIATVESIYKIKKGILEPLSEQQVLDC 179

Query: 172 ATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYED 231
           A      GC GG+   AF++II NKG+ + A+Y Y+  + G C +     ++A IT Y  
Sbjct: 180 A---KGYGCKGGWEFRAFEFIISNKGVASVAIYPYKA-AKGTCKT-NGVPNSAYITGYAR 234

Query: 232 VPPNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGI 290
           VP N+E S++ AV+ QP++VA+DA+A  Q+Y+ GVFNG C T LNH VTA+GYG    G 
Sbjct: 235 VPRNNESSMMYAVSKQPITVAVDANANSQYYNSGVFNGPCGTSLNHAVTAIGYGQDSNGK 294

Query: 291 KYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSKESA 337
           KYW++KNSWG  WGE GY R+ RD+    G CGIA+ + +P  +  A
Sbjct: 295 KYWIVKNSWGARWGEAGYIRMARDVSSSSGICGIAIDSLYPTLESRA 341


>gi|13432122|sp|P80884.2|ANAN_ANACO RecName: Full=Ananain; Flags: Precursor
 gi|2623956|emb|CAA05487.1| Ananain precursor [Ananas comosus]
          Length = 345

 Score =  251 bits (641), Expect = 3e-64,   Method: Compositional matrix adjust.
 Identities = 144/341 (42%), Positives = 203/341 (59%), Gaps = 29/341 (8%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGS--IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
           FL + L +  +  S A+    DE S  + ++FE+W A+YGR YK++ E   RF+IFK+N+
Sbjct: 9   FLFLFLCVMWASPSAAS---CDEPSDPMMKQFEEWMAEYGRVYKDNDEKMLRFQIFKNNV 65

Query: 63  VAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFL----Y 118
             +E FNN   GN SYTL +N+F D+T  EF+A  TG  +      L     P +     
Sbjct: 66  NHIETFNNRN-GN-SYTLGINQFTDMTNNEFVAQYTGLSLP-----LNIKREPVVSFDDV 118

Query: 119 KSSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDC 171
             S VP S++W + GAVT VK QG+C       ++A VE I  IK   LVSLSEQQ++DC
Sbjct: 119 DISSVPQSIDWRDSGAVTSVKNQGRCGSCWAFASIATVESIYKIKRGNLVSLSEQQVLDC 178

Query: 172 ATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYED 231
           A +    GC GG+++ A+ +II NKG+ + A+Y Y+  + G C +     ++A IT Y  
Sbjct: 179 AVS---YGCKGGWINKAYSFIISNKGVASAAIYPYKA-AKGTCKT-NGVPNSAYITRYTY 233

Query: 232 VPPNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGI 290
           V  N+E +++ AV+NQP++ A+DAS   Q Y  GVF G C T LNH +  +GYG    G 
Sbjct: 234 VQRNNERNMMYAVSNQPIAAALDASGNFQHYKRGVFTGPCGTRLNHAIVIIGYGQDSSGK 293

Query: 291 KYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
           K+W+++NSWG  WGE GY RL RD+    G CGIAM   +P
Sbjct: 294 KFWIVRNSWGAGWGEGGYIRLARDVSSSFGLCGIAMDPLYP 334


>gi|242094000|ref|XP_002437490.1| hypothetical protein SORBIDRAFT_10g028000 [Sorghum bicolor]
 gi|241915713|gb|EER88857.1| hypothetical protein SORBIDRAFT_10g028000 [Sorghum bicolor]
          Length = 372

 Score =  251 bits (641), Expect = 3e-64,   Method: Compositional matrix adjust.
 Identities = 142/330 (43%), Positives = 199/330 (60%), Gaps = 21/330 (6%)

Query: 23  RTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN-NAAIGNRSYTLR 81
           R  DE  +   +E WK+++G  +   +++  R E+F+DNL  ++  N  A  G  ++ L 
Sbjct: 43  RADDE--VRRMYEAWKSEHG--HGHGSDDRLRLEVFRDNLRYIDAHNAEADAGLHTFRLG 98

Query: 82  LNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK--SSQVPPSVNWIEKGAVTPVK 139
           L  FADLT +E+     GF+     +S   +G+ +  +     +P +++W E GAVT VK
Sbjct: 99  LTPFADLTLEEYRGRALGFRARRGGASRVGSGSSYRPRPRGGDLPDAIDWRELGAVTGVK 158

Query: 140 YQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYI 192
            Q QC       AVAA+EGIN I    LVSLSEQ+++DC T D   GC GG M +AF+++
Sbjct: 159 NQEQCGGCWAFSAVAAIEGINEIVTGNLVSLSEQEIIDCDTQDG--GCNGGEMQNAFQFV 216

Query: 193 IQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVA 252
           I N GI  +A Y Y G     CD+ +  +    I  +  V   +E +L +AVANQPVSVA
Sbjct: 217 INNGGIDTEADYPYLGTDAA-CDANRVNERVVTIDGFVSVATENETALQEAVANQPVSVA 275

Query: 253 IDASALQF--YSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFR 310
           IDAS  +F  Y+ G+FNG C T L+HGVTAVGYG SE G  YW++KNSW   WGE GY R
Sbjct: 276 IDASGRKFQHYTSGIFNGPCGTQLDHGVTAVGYG-SENGKDYWIVKNSWSSSWGEAGYIR 334

Query: 311 LQRDIDQPQGQCGIAMFASFPVSKESAQPS 340
           ++R++    G+CGIAM AS+PV K S+ P+
Sbjct: 335 IRRNVAAATGKCGIAMDASYPV-KSSSNPA 363


>gi|357437715|ref|XP_003589133.1| Cysteine proteinase [Medicago truncatula]
 gi|87240770|gb|ABD32628.1| Granulin; Peptidase C1A, papain [Medicago truncatula]
 gi|355478181|gb|AES59384.1| Cysteine proteinase [Medicago truncatula]
          Length = 474

 Score =  251 bits (641), Expect = 3e-64,   Method: Compositional matrix adjust.
 Identities = 151/356 (42%), Positives = 209/356 (58%), Gaps = 35/356 (9%)

Query: 6   LIVVLIISGSCASQA------TY-RTFDEGSIAEK--------FEQWKAQYGRTYKESAE 50
           L++VLIIS    S A      +Y +T  + S +++        +E+W  ++G++Y    E
Sbjct: 12  LMIVLIISSFTVSLALDMSIISYDKTHPDKSTSKRTNKEVLTMYEEWLVKHGKSYNGLGE 71

Query: 51  NSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLK 110
             KRFEIFKDNL  ++  N     N +Y L L +FADLT +E+ +   G K+  +    K
Sbjct: 72  KDKRFEIFKDNLKFIDEHNGL---NSTYRLGLTRFADLTNEEYRSKFLGTKIDPNRRMKK 128

Query: 111 ANGTPFLYKSSQV----PPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINR 159
             G+     + +V    P SV+W ++GAV  VK Q  C       A+AAVEGIN I    
Sbjct: 129 LGGSKSNRYAPRVGDKLPESVDWRKEGAVVGVKDQASCGSCWAFSAIAAVEGINKIVTGD 188

Query: 160 LVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKA 219
           L+SLSEQ+LVDC T+  N GC GG MD AF++II N GI ++  Y Y+ +  G CD  + 
Sbjct: 189 LISLSEQELVDCDTS-YNEGCNGGLMDYAFEFIISNGGIDSEDDYPYKAVD-GRCDQNRK 246

Query: 220 EDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHG 277
                 I +YEDVP  DE +L KAVANQP++VA++      Q Y  GVF G C T L+HG
Sbjct: 247 NAKVVTIDDYEDVPAYDELALQKAVANQPIAVAVEGGGREFQLYEYGVFTGRCGTALDHG 306

Query: 278 VTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQ-GQCGIAMFASFPV 332
           V AVGYGT E G  YW+++NSWG  WGE GY RL+R++   + G+CGIA+  S+P+
Sbjct: 307 VAAVGYGT-ENGKDYWIVRNSWGGSWGEQGYIRLERNLASSRAGKCGIAIEPSYPI 361


>gi|449525012|ref|XP_004169515.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
          Length = 459

 Score =  251 bits (641), Expect = 4e-64,   Method: Compositional matrix adjust.
 Identities = 144/342 (42%), Positives = 195/342 (57%), Gaps = 18/342 (5%)

Query: 1   MAKYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKE-SAENSKRFEIFK 59
           MA  F + + + + S +S    RT DE  +   ++QW+A++G+ +    AE   RF IFK
Sbjct: 10  MALLFFLFIALSAASPSSIIPQRTDDE--VMALYDQWRAKHGKLHNNLGAEPENRFHIFK 67

Query: 60  DNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK 119
           DNL  ++  N     N  Y L LN FADLT +E+ +   G K +  S   + +       
Sbjct: 68  DNLKFIDEINAQ---NLPYRLGLNVFADLTNEEYRSRYLGGKFASGSRRNRTSNRYLPRL 124

Query: 120 SSQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCA 172
              +P S++W  KGAV PVK QG C        VA+VE IN I    L++LSEQ+LVDC 
Sbjct: 125 GDDLPDSIDWRAKGAVAPVKDQGSCGSCWAFSTVASVEAINQIVTGDLIALSEQELVDC- 183

Query: 173 TNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDV 232
               N GC GG MD AF++II+N G+  +  Y Y G  +  C   K       I +YEDV
Sbjct: 184 DRSYNEGCNGGLMDYAFEFIIENGGLDTEEDYPYYGFDSS-CIQYKKNAKVVAIDSYEDV 242

Query: 233 PPNDEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGI 290
           P N+E++L KAV+ Q VSVAI+    + Q Y  G+F G C T L+HGV  VGYG SE G+
Sbjct: 243 PVNNEKALQKAVSKQVVSVAIEGGGRSFQLYQSGIFTGRCGTDLDHGVNVVGYG-SEGGV 301

Query: 291 KYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
            YW+++NSWG  WGE GY ++QR+I  P G CGIAM  S+P 
Sbjct: 302 DYWIVRNSWGGSWGESGYVKMQRNIASPTGLCGIAMEPSYPT 343


>gi|2160175|gb|AAB60738.1| Strong similarity to Dianthus cysteine proteinase (gb|U17135)
           [Arabidopsis thaliana]
          Length = 416

 Score =  251 bits (641), Expect = 4e-64,   Method: Compositional matrix adjust.
 Identities = 142/319 (44%), Positives = 189/319 (59%), Gaps = 22/319 (6%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
           I+E F+ W  ++G+TY    E  +R +IFKDN   V + N   I N +Y+L LN FADLT
Sbjct: 26  ISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHN--LITNATYSLSLNAFADLT 83

Query: 90  PQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC----- 144
             EF AS+ G  +S  S  + + G   L  S +VP SV+W +KGAVT VK QG C     
Sbjct: 84  HHEFKASRLGLSVSAPSVIMASKGQS-LGGSVKVPDSVDWRKKGAVTNVKDQGSCGACWS 142

Query: 145 --AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDA 202
             A  A+EGIN I    L+SLSEQ+L+DC     N GC GG MD AF+++I+N GI  + 
Sbjct: 143 FSATGAMEGINQIVTGDLISLSEQELIDC-DKSYNAGCNGGLMDYAFEFVIKNHGIDTEK 201

Query: 203 VYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQF 260
            Y Y+    G C   K +     I +Y  V  NDE++L++AVA QPVSV I  S  A Q 
Sbjct: 202 DYPYQERD-GTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSERAFQL 260

Query: 261 YSG-------GVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQR 313
           YS        G+F+G C T L+H V  VGYG S+ G+ YW++KNSWG+ WG DG+  +QR
Sbjct: 261 YSSKFYLLMQGIFSGPCSTSLDHAVLIVGYG-SQNGVDYWIVKNSWGKSWGMDGFMHMQR 319

Query: 314 DIDQPQGQCGIAMFASFPV 332
           + +   G CGI M AS+P+
Sbjct: 320 NTENSDGVCGINMLASYPI 338


>gi|2342494|dbj|BAA21848.1| bromelain [Ananas comosus]
 gi|2463582|dbj|BAA22543.1| FB31 precursor [Ananas comosus]
          Length = 352

 Score =  251 bits (641), Expect = 4e-64,   Method: Compositional matrix adjust.
 Identities = 149/346 (43%), Positives = 204/346 (58%), Gaps = 26/346 (7%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
           FL + L +  +  S A+ R      + ++FE+W A+YGR YK++ E  +RF+IFK+N+  
Sbjct: 9   FLFLFLCVMWASPSAAS-RDEPSDPMMKRFEEWMAEYGRVYKDNDEKMRRFQIFKNNVNH 67

Query: 65  VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFL----YKS 120
           +E FNN   GN SYTL +NKF D+T  EF+A  TG      S  L     P +       
Sbjct: 68  IETFNNRN-GN-SYTLGINKFTDMTNNEFVAQYTG----GISRPLNIEKEPVVSFDDVNI 121

Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
           S V  S++W + GAVT VK Q  C       A+A VEGI  I    LVSLSEQ+++DCA 
Sbjct: 122 SAVGQSIDWRDYGAVTEVKDQNPCGSCWAFSAIATVEGIYKIVTGYLVSLSEQEVLDCAV 181

Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
           +   NGC GGF+D+A+ +II N G+ ++A Y Y+    G C +  +  ++A IT Y  V 
Sbjct: 182 S---NGCDGGFVDNAYDFIISNNGVASEADYPYQAYQ-GDC-AANSWPNSAYITGYSYVR 236

Query: 234 PNDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIK 291
            NDE S+  AV NQP++ AIDAS    Q+Y+GGVF+G C T LNH +T +GYG    G +
Sbjct: 237 SNDESSMKYAVWNQPIAAAIDASGDNFQYYNGGVFSGPCGTSLNHAITIIGYGQDSSGTQ 296

Query: 292 YWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSKESA 337
           YW++KNSWG  WGE GY R+ R +    G CGIAM   +P  +  A
Sbjct: 297 YWIVKNSWGSSWGERGYIRMARGVSS-SGLCGIAMDPLYPTLQSGA 341


>gi|218202389|gb|EEC84816.1| hypothetical protein OsI_31898 [Oryza sativa Indica Group]
          Length = 350

 Score =  251 bits (640), Expect = 4e-64,   Method: Compositional matrix adjust.
 Identities = 141/328 (42%), Positives = 190/328 (57%), Gaps = 31/328 (9%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
           + ++FEQW  ++GR Y +S E  +RFE+++ N+  VE FN+ + G   Y L  NKFADLT
Sbjct: 28  MLDRFEQWMIRHGRAYTDSGEKQRRFEVYRRNVELVETFNSMSNG---YKLADNKFADLT 84

Query: 90  PQEFIASQTGFK----MSDHSSSLKAN-GTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC 144
            +EF A   GF+    +   S++  A+   P       +P SV+W +KGAV  VK QG C
Sbjct: 85  NEEFRAKMLGFRPHVTIPQISNTCSADIAMPGESSDDILPKSVDWRKKGAVVEVKNQGDC 144

Query: 145 -------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKG 197
                  AVAA+EGIN IK   LVSLSEQ+LVDC  +D   GC GG+M  AF++++ N G
Sbjct: 145 GSCWAFSAVAAIEGINQIKNGELVSLSEQELVDC--DDEAVGCGGGYMSWAFEFVVGNHG 202

Query: 198 ITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA 257
           +T +A Y Y   + G C + K    A  I  Y +V P+ E  L +A A QPVSVA+D  +
Sbjct: 203 LTTEASYPYHA-ANGACQAAKLNQSAVAIAGYRNVTPSSEPDLARAAAAQPVSVAVDGGS 261

Query: 258 LQF--YSGGVFNGYCETFLNHGVTAVGYGTSEEGIK----------YWLIKNSWGQDWGE 305
             F  Y  GV+ G C   +NHGVT VGYG SE              YW++KNSWG +WG+
Sbjct: 262 FMFQLYGSGVYTGPCTADVNHGVTVVGYGESEPKTDGGGAAKGGEKYWIVKNSWGAEWGD 321

Query: 306 DGYFRLQRDI-DQPQGQCGIAMFASFPV 332
            GY  +QRD+     G CGIA+  S+PV
Sbjct: 322 AGYILMQRDVAGLASGLCGIALLPSYPV 349


>gi|357437719|ref|XP_003589135.1| Cysteine proteinase [Medicago truncatula]
 gi|355478183|gb|AES59386.1| Cysteine proteinase [Medicago truncatula]
          Length = 457

 Score =  250 bits (639), Expect = 5e-64,   Method: Compositional matrix adjust.
 Identities = 151/356 (42%), Positives = 209/356 (58%), Gaps = 35/356 (9%)

Query: 6   LIVVLIISGSCASQA------TY-RTFDEGSIAEK--------FEQWKAQYGRTYKESAE 50
           L++VLIIS    S A      +Y +T  + S +++        +E+W  ++G++Y    E
Sbjct: 12  LMIVLIISSFTVSLALDMSIISYDKTHPDKSTSKRTNKEVLTMYEEWLVKHGKSYNGLGE 71

Query: 51  NSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLK 110
             KRFEIFKDNL  ++  N     N +Y L L +FADLT +E+ +   G K+  +    K
Sbjct: 72  KDKRFEIFKDNLKFIDEHNGL---NSTYRLGLTRFADLTNEEYRSKFLGTKIDPNRRMKK 128

Query: 111 ANGTPFLYKSSQV----PPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINR 159
             G+     + +V    P SV+W ++GAV  VK Q  C       A+AAVEGIN I    
Sbjct: 129 LGGSKSNRYAPRVGDKLPESVDWRKEGAVVGVKDQASCGSCWAFSAIAAVEGINKIVTGD 188

Query: 160 LVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKA 219
           L+SLSEQ+LVDC T+  N GC GG MD AF++II N GI ++  Y Y+ +  G CD  + 
Sbjct: 189 LISLSEQELVDCDTS-YNEGCNGGLMDYAFEFIISNGGIDSEDDYPYKAVD-GRCDQNRK 246

Query: 220 EDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHG 277
                 I +YEDVP  DE +L KAVANQP++VA++      Q Y  GVF G C T L+HG
Sbjct: 247 NAKVVTIDDYEDVPAYDELALQKAVANQPIAVAVEGGGREFQLYEYGVFTGRCGTALDHG 306

Query: 278 VTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQ-GQCGIAMFASFPV 332
           V AVGYGT E G  YW+++NSWG  WGE GY RL+R++   + G+CGIA+  S+P+
Sbjct: 307 VAAVGYGT-ENGKDYWIVRNSWGGSWGEQGYIRLERNLASSRAGKCGIAIEPSYPI 361


>gi|302142276|emb|CBI19479.3| unnamed protein product [Vitis vinifera]
          Length = 388

 Score =  250 bits (639), Expect = 6e-64,   Method: Compositional matrix adjust.
 Identities = 142/311 (45%), Positives = 183/311 (58%), Gaps = 47/311 (15%)

Query: 34  FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
           +E W A++G++Y    E  +RF+IFKDNL  ++  N     NR+Y               
Sbjct: 4   YEAWLAKHGKSYNALGEKERRFQIFKDNLRFIDEHNAE---NRTYK-------------- 46

Query: 94  IASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA-------V 146
           I+ +  F++ D                  +P SV+W +KGAV  VK QG C        +
Sbjct: 47  ISDRYAFRVGD-----------------SLPESVDWRKKGAVVEVKDQGSCGSCWAFSTI 89

Query: 147 AAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSY 206
           AAVEGIN I    L+SLSEQ+LVDC T+  N GC GG MD AF++II N GI ++  Y Y
Sbjct: 90  AAVEGINKIVTGGLISLSEQELVDCDTS-YNEGCNGGLMDYAFEFIINNGGIDSEEDYPY 148

Query: 207 EGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFYSGG 264
           +  S G CD  +       I  YEDVP NDE+SL KAVANQPVSVAI+A     Q Y  G
Sbjct: 149 KA-SDGRCDQYRKNAKVVTIDGYEDVPENDEKSLEKAVANQPVSVAIEAGGREFQLYQSG 207

Query: 265 VFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDI-DQPQGQCG 323
           +F G C T L+HGVTAVGYGT E G+ YW++KNSWG  WGE+GY R++RD+     G+CG
Sbjct: 208 IFTGRCGTALDHGVTAVGYGT-ENGVDYWIVKNSWGASWGEEGYIRMERDLATSATGKCG 266

Query: 324 IAMFASFPVSK 334
           IAM AS+P+ K
Sbjct: 267 IAMEASYPIKK 277


>gi|356517384|ref|XP_003527367.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
          Length = 332

 Score =  250 bits (639), Expect = 6e-64,   Method: Compositional matrix adjust.
 Identities = 150/342 (43%), Positives = 211/342 (61%), Gaps = 33/342 (9%)

Query: 4   YFLIVVLIISGS-CASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
           Y +   +++S +  A Q T RT  + S+ E   Q   +Y +  K+  +      +FK+N+
Sbjct: 8   YHIAFAMLLSMAFLAFQVTCRTLQDASMYESHGQRMTRYSKVDKDPPD-----XVFKENV 62

Query: 63  VAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-S 121
             +E  NNAA  ++ Y   +N+FA   P++        +   H  S     T F +++ +
Sbjct: 63  NYIEACNNAA--DKPYKRDINQFA---PKK--------RFKGHMCSSIIRITTFKFENVT 109

Query: 122 QVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLS-EQQLVDCAT 173
             P +V+  +K AVTP+K QGQC       AVAA EGI+A+   +L+ LS EQ+LVDC T
Sbjct: 110 ATPSTVDCRQKVAVTPIKDQGQCGCFWALSAVAATEGIHALXAGKLILLSSEQELVDCDT 169

Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQI-TNYEDV 232
              +  C GG MDDAFK+IIQN G+  +A Y Y+G+  G C++ +A+ +AA I T YEDV
Sbjct: 170 KGVDQDCQGGLMDDAFKFIIQNHGLNTEANYPYKGVD-GKCNAYEADKNAATIITGYEDV 228

Query: 233 PPNDEESLL-KAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEG 289
           P N+E++ L KAVAN PVSVAIDAS    QFY  GVF G C T L+HGVTAVGYG S++G
Sbjct: 229 PANNEKAHLQKAVANNPVSVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSDDG 288

Query: 290 IKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
            +YWL+KNS G +WGE+GY R+QR +D  +  CGIA+ AS+P
Sbjct: 289 TEYWLVKNSRGTEWGEEGYIRMQRGVDSEEALCGIAVQASYP 330


>gi|357115272|ref|XP_003559414.1| PREDICTED: thiol protease SEN102-like [Brachypodium distachyon]
          Length = 360

 Score =  250 bits (638), Expect = 7e-64,   Method: Compositional matrix adjust.
 Identities = 143/327 (43%), Positives = 200/327 (61%), Gaps = 27/327 (8%)

Query: 29  SIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN---NAAIGNR--SYTLRLN 83
           ++A + E W A++GRTY ++ E ++R EIF+ N   ++ FN   +AA G    S+ L  N
Sbjct: 38  AMASRHESWMAEHGRTYADAEEKARRLEIFRANAERIDSFNSKADAAAGESVDSHRLATN 97

Query: 84  KFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS----SQVPPSVNWIEKGAVTPVK 139
           +FADLT +EF A++TG +     +        F Y++    +    S++W   GAVT VK
Sbjct: 98  RFADLTDEEFRAARTGLRRPAAVAGAVG--GGFRYENFSLQADAAGSMDWRAMGAVTGVK 155

Query: 140 YQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYI 192
            QG C       AVAA+EG+  I+  RLVSLSEQQLVDC    ++ GC GG MD+AF+YI
Sbjct: 156 DQGSCGCCWAFSAVAAMEGLTKIRTGRLVSLSEQQLVDCDVYGDDQGCEGGLMDNAFQYI 215

Query: 193 IQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVA 252
            +  G+ +++ Y Y G   G C S +A+  AA I  +EDVP N+E +L+ AVA+QPVSVA
Sbjct: 216 SRQGGLASESAYPYSGEDGGSCRSGRAQP-AASIRGHEDVPANNEGALMAAVAHQPVSVA 274

Query: 253 IDAS--ALQFY----SGGVFNGYCE-TFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGE 305
           I+      +FY     G   NG CE T L+H +TAVGYG + +G  YWL+KNSWG  WGE
Sbjct: 275 INGGDYVFRFYDRGVLGAGGNGGCESTELDHAITAVGYGMAGDGTGYWLMKNSWGSGWGE 334

Query: 306 DGYFRLQRDIDQPQGQCGIAMFASFPV 332
            GY R++R   + +G CG+A  AS+PV
Sbjct: 335 SGYVRIRRG-SRGEGVCGLAKLASYPV 360


>gi|193806686|sp|A5HII1.1|ACTN_ACTDE RecName: Full=Actinidain; Short=Actinidin; AltName: Full=Allergen
           Act d 1; AltName: Allergen=Act d 1; Flags: Precursor
 gi|146215974|gb|ABQ10189.1| actinidin Act1a [Actinidia deliciosa]
          Length = 380

 Score =  250 bits (638), Expect = 7e-64,   Method: Compositional matrix adjust.
 Identities = 146/343 (42%), Positives = 198/343 (57%), Gaps = 21/343 (6%)

Query: 1   MAKYFLIVVLIISGSC-ASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFK 59
           M+  F   +LI+S +  A   T RT DE  +   +E W  +YG++Y    E  +RFEIFK
Sbjct: 10  MSLLFFSTLLILSLAFNAKNLTQRTNDE--VKAMYESWLIKYGKSYNSLGEWERRFEIFK 67

Query: 60  DNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK 119
           + L  ++  N  A  NRSY + LN+FADLT +EF ++  GF    + + +     P   +
Sbjct: 68  ETLRFIDEHN--ADTNRSYKVGLNQFADLTDEEFRSTYLGFTSGSNKTKVSNRYEP---R 122

Query: 120 SSQVPPS-VNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDC 171
             QV PS V+W   GAV  +K QG+C       A+A VEGIN I    L+SLSEQ+L+DC
Sbjct: 123 VGQVLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIDC 182

Query: 172 ATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYED 231
               N  GC GG++ D F++II N GI  +  Y Y     G C+     +    I  YE+
Sbjct: 183 GRTQNTRGCNGGYITDGFQFIINNGGINTEENYPYTAQD-GECNLDLQNEKYVTIDTYEN 241

Query: 232 VPPNDEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEG 289
           VP N+E +L  AV  QPVSVA+DA+  A + YS G+F G C T ++H VT VGYGT E G
Sbjct: 242 VPYNNEWALQTAVTYQPVSVALDAAGDAFKHYSSGIFTGPCGTAIDHAVTIVGYGT-EGG 300

Query: 290 IKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           I YW++KNSW   WGE+GY R+ R++    G CGIA   S+PV
Sbjct: 301 IDYWIVKNSWDTTWGEEGYMRILRNVGG-AGTCGIATMPSYPV 342


>gi|9502426|gb|AAF88125.1|AC021043_18 Putative cysteine proteinase [Arabidopsis thaliana]
          Length = 365

 Score =  250 bits (638), Expect = 9e-64,   Method: Compositional matrix adjust.
 Identities = 141/362 (38%), Positives = 203/362 (56%), Gaps = 38/362 (10%)

Query: 4   YFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLV 63
           +  + +L +    +    + T +E SI +  +QW  Q+ R YK+ +E   R ++FK NL 
Sbjct: 8   FVALTILSMDLRISQARPHVTLNEQSIVDYHQQWMTQFSRVYKDESEKEMRLKVFKKNLK 67

Query: 64  AVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTP-----FLY 118
            +E FNN  +GN+SYTL +N+F D   +EF+A+ TG +++  S S   N T       + 
Sbjct: 68  FIENFNN--MGNQSYTLGVNEFTDWKTEEFLATHTGLRVNVTSLSELFNKTKPSRNWNMS 125

Query: 119 KSSQVPPSVNWIEKGAVTPVKYQGQCAVAAV-------------------------EGIN 153
                  S +W ++GAVTPVKYQG C                              EG+ 
Sbjct: 126 DIDMEDESKDWRDEGAVTPVKYQGACPEFPTKQIRRNSLVGKQYTKLLGVLSDWGDEGLT 185

Query: 154 AIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGI 213
            I    L++LSEQQL+DC   + N GC GG  ++AFKYII+N G++ +  Y Y+      
Sbjct: 186 KISGKNLLTLSEQQLIDCDI-EKNGGCNGGEFEEAFKYIIKNGGVSLETEYPYQVKKESC 244

Query: 214 CDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASALQF--YSGGVFNGY-C 270
             + +   H  QI  ++ VP ++E +LL+AV  QPVSV IDA A  F  Y GGV+ G  C
Sbjct: 245 RANARRAPHT-QIRGFQMVPSHNERALLEAVRRQPVSVLIDARADSFGHYKGGVYAGLDC 303

Query: 271 ETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASF 330
            T +NH VT VGYGT   G+ YW++KNSWG+ WGE+GY R++RD++ PQG CGIA  A++
Sbjct: 304 GTDVNHAVTIVGYGTMS-GLNYWVLKNSWGESWGENGYMRIRRDVEWPQGMCGIAQVAAY 362

Query: 331 PV 332
           PV
Sbjct: 363 PV 364


>gi|15984|emb|CAA34486.1| unnamed protein product [Actinidia deliciosa]
          Length = 380

 Score =  249 bits (637), Expect = 9e-64,   Method: Compositional matrix adjust.
 Identities = 146/343 (42%), Positives = 198/343 (57%), Gaps = 21/343 (6%)

Query: 1   MAKYFLIVVLIISGSC-ASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFK 59
           M+  F   +LI+S +  A   T RT DE  +   +E W  +YG++Y    E  +RFEIFK
Sbjct: 10  MSLLFFSTLLILSLAFNAKNLTQRTNDE--VKAMYESWLIKYGKSYNSLGEWERRFEIFK 67

Query: 60  DNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK 119
           + L  ++  N  A  NRSY + LN+FADLT +EF ++  GF    + + +     P   +
Sbjct: 68  ETLRFIDEHN--ADTNRSYKVGLNQFADLTDEEFRSTYLGFTSGSNKTKVSNRYEP---R 122

Query: 120 SSQVPPS-VNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDC 171
             QV PS V+W   GAV  +K QG+C       A+A VEGIN I    L+SLSEQ+L+DC
Sbjct: 123 FGQVLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIDC 182

Query: 172 ATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYED 231
               N  GC GG++ D F++II N GI  +  Y Y     G C+     +    I  YE+
Sbjct: 183 GRTQNTRGCNGGYITDGFQFIINNGGINTEENYPYTAQD-GECNLDLQNEKYVTIDTYEN 241

Query: 232 VPPNDEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEG 289
           VP N+E +L  AV  QPVSVA+DA+  A + YS G+F G C T ++H VT VGYGT E G
Sbjct: 242 VPYNNEWALQTAVTYQPVSVALDAAGDAFKHYSSGIFTGPCGTAIDHAVTIVGYGT-EGG 300

Query: 290 IKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           I YW++KNSW   WGE+GY R+ R++    G CGIA   S+PV
Sbjct: 301 IDYWIVKNSWDTTWGEEGYMRILRNVGG-AGTCGIATMPSYPV 342


>gi|115479933|ref|NP_001063560.1| Os09g0497500 [Oryza sativa Japonica Group]
 gi|113631793|dbj|BAF25474.1| Os09g0497500 [Oryza sativa Japonica Group]
 gi|215704298|dbj|BAG93138.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 349

 Score =  249 bits (637), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 140/328 (42%), Positives = 190/328 (57%), Gaps = 31/328 (9%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
           + ++FEQW  ++GR Y ++ E  +RFE+++ N+  VE FN+ + G   Y L  NKFADLT
Sbjct: 27  MLDRFEQWMIRHGRAYTDAGEKQRRFEVYRRNVELVETFNSMSNG---YKLADNKFADLT 83

Query: 90  PQEFIASQTGFK----MSDHSSSLKAN-GTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC 144
            +EF A   GF+    +   S++  A+   P       +P SV+W +KGAV  VK QG C
Sbjct: 84  NEEFRAKMLGFRPHVTIPQISNTCSADIAMPGESSDDILPKSVDWRKKGAVVEVKNQGDC 143

Query: 145 -------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKG 197
                  AVAA+EGIN IK   LVSLSEQ+LVDC  +D   GC GG+M  AF++++ N G
Sbjct: 144 GSCWAFSAVAAIEGINQIKNGELVSLSEQELVDC--DDEAVGCGGGYMSWAFEFVVGNHG 201

Query: 198 ITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA 257
           +T +A Y Y   + G C + K    A  I  Y +V P+ E  L +A A QPVSVA+D  +
Sbjct: 202 LTTEASYPYHA-ANGACQAAKLNQSAVAIAGYRNVTPSSEPDLARAAAAQPVSVAVDGGS 260

Query: 258 LQF--YSGGVFNGYCETFLNHGVTAVGYGTSEEGIK----------YWLIKNSWGQDWGE 305
             F  Y  GV+ G C   +NHGVT VGYG SE              YW++KNSWG +WG+
Sbjct: 261 FMFQLYGSGVYTGPCTADVNHGVTVVGYGESEPKTDGGGAAKGGEKYWIVKNSWGAEWGD 320

Query: 306 DGYFRLQRDI-DQPQGQCGIAMFASFPV 332
            GY  +QRD+     G CGIA+  S+PV
Sbjct: 321 AGYILMQRDVAGLASGLCGIALLPSYPV 348


>gi|359359068|gb|AEV40975.1| putative cysteine protease [Oryza punctata]
          Length = 464

 Score =  249 bits (637), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 147/300 (49%), Positives = 184/300 (61%), Gaps = 20/300 (6%)

Query: 49  AENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSS 108
            E  +RF +F DNL  V+  N  A G+  + L +N+FADLT  EF A+  G      + +
Sbjct: 85  GEYERRFRVFWDNLKFVDAHNAHADGHGGFRLGMNRFADLTNDEFRAAYLGT-----TPA 139

Query: 109 LKANGTPFLYKSSQV---PPSVNWIEKGAV-TPVKYQGQC-------AVAAVEGINAIKI 157
            +      +Y+   V   P SV+W +KGAV +PVK QGQC       AVAAVEGIN I  
Sbjct: 140 GRGRHVGEMYRHDGVEALPDSVDWRDKGAVVSPVKNQGQCGSCWAFSAVAAVEGINKIVT 199

Query: 158 NRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSI 217
             LVSLSEQ+LV+CA N  N+GC GG MDDAF +I +N G+  +  Y Y  M  G CD  
Sbjct: 200 GELVSLSEQELVECARNGGNSGCNGGIMDDAFAFITRNGGLDTEEDYPYTAMD-GKCDLA 258

Query: 218 KAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLN 275
           K       I  +EDVP NDE SL KAVA+QPVSVAIDA     Q Y  GVF G C T L+
Sbjct: 259 KKSRKVVSIDGFEDVPENDELSLQKAVAHQPVSVAIDAGGREFQLYDSGVFTGRCGTSLD 318

Query: 276 HGVTAVGYGT-SEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSK 334
           HGV AVGYGT +  G  YW ++NSWG DWGE+GY R++R++    G+CGIAM AS+P+ K
Sbjct: 319 HGVVAVGYGTDAATGTDYWTVRNSWGPDWGENGYIRMERNVTARTGKCGIAMMASYPIKK 378


>gi|2463586|dbj|BAA22545.1| FB22 precursor [Ananas comosus]
          Length = 340

 Score =  249 bits (637), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 147/346 (42%), Positives = 203/346 (58%), Gaps = 27/346 (7%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
           FL + L +  +  S A+ R      + ++FE+W A+YGR YK++ E  +RF+IFK+N+  
Sbjct: 9   FLFLFLCVMWASPSAAS-RDEPSDPMMKRFEEWMAEYGRVYKDNDEKMRRFQIFKNNVNH 67

Query: 65  VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFL----YKS 120
           +E FNN   GN SYTL +NKF D+T  EF+   TG  +      L     P +       
Sbjct: 68  IETFNNRN-GN-SYTLGINKFTDMTNNEFVTQYTGVSLP-----LNFKREPVVSFDDVNI 120

Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
           S V  S++W + GAVT VK Q  C       A+A VEGI  I    LVSLSEQ+++DCA 
Sbjct: 121 SAVGQSIDWRDYGAVTEVKDQNPCGSCWAFSAIATVEGIYKIVTGYLVSLSEQEVLDCAV 180

Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
           +   NGC GGF+D+A+ +II N G+ ++A Y Y+    G C +  +  ++A IT Y  V 
Sbjct: 181 S---NGCDGGFVDNAYDFIISNNGVASEADYPYQAYE-GDC-TANSWPNSAYITGYSYVR 235

Query: 234 PNDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIK 291
            NDE S+  AV NQP++ AIDAS    Q+Y+GGVF+G C T LNH +T +GYG    G +
Sbjct: 236 SNDESSMKYAVWNQPIAAAIDASGDNFQYYNGGVFSGPCGTSLNHAITIIGYGQDSSGTQ 295

Query: 292 YWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSKESA 337
           YW++KNSWG  WGE GY R+ R +    G CGIAM   +P  +  A
Sbjct: 296 YWIVKNSWGSSWGERGYVRMARGVSS-SGLCGIAMDPLYPTLQSGA 340


>gi|2144501|pir||TAGB actinidain (EC 3.4.22.14) precursor - kiwi fruit
 gi|166317|gb|AAA32629.1| actinidin [Actinidia deliciosa]
          Length = 380

 Score =  249 bits (636), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 146/343 (42%), Positives = 198/343 (57%), Gaps = 21/343 (6%)

Query: 1   MAKYFLIVVLIISGSC-ASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFK 59
           M+  F   +LI+S +  A   T RT DE  +   +E W  +YG++Y    E  +RFEIFK
Sbjct: 10  MSLLFFSTLLILSLAFNAKNLTQRTNDE--VKAMYESWLIKYGKSYNSLGEWERRFEIFK 67

Query: 60  DNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK 119
           + L  ++  N  A  NRSY + LN+FADLT +EF ++  GF    + + +     P   +
Sbjct: 68  ETLRFIDEHN--ADTNRSYKVGLNQFADLTDEEFRSTYLGFTSGSNKTKVSNRYEP---R 122

Query: 120 SSQVPPS-VNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDC 171
             QV PS V+W   GAV  +K QG+C       A+A VEGIN I    L+SLSEQ+L+DC
Sbjct: 123 VGQVLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIDC 182

Query: 172 ATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYED 231
               N  GC GG++ D F++II N GI  +  Y Y     G C+     +    I  YE+
Sbjct: 183 GRTQNTRGCNGGYITDGFQFIINNGGINTEENYPYTAQD-GECNVELQNEKYVTIDTYEN 241

Query: 232 VPPNDEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEG 289
           VP N+E +L  AV  QPVSVA+DA+  A + YS G+F G C T ++H VT VGYGT E G
Sbjct: 242 VPYNNEWALQTAVTYQPVSVALDAAGDAFKQYSSGIFTGPCGTAIDHAVTIVGYGT-EGG 300

Query: 290 IKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           I YW++KNSW   WGE+GY R+ R++    G CGIA   S+PV
Sbjct: 301 IDYWIVKNSWDTTWGEEGYMRILRNVGG-AGTCGIATMPSYPV 342


>gi|242066206|ref|XP_002454392.1| hypothetical protein SORBIDRAFT_04g029960 [Sorghum bicolor]
 gi|241934223|gb|EES07368.1| hypothetical protein SORBIDRAFT_04g029960 [Sorghum bicolor]
          Length = 356

 Score =  249 bits (636), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 143/310 (46%), Positives = 189/310 (60%), Gaps = 17/310 (5%)

Query: 34  FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
           F+ W  ++ + Y    E  KR+ IFK NL+ +   N     N SY L LN+FAD+T +EF
Sbjct: 45  FKSWSVKHRKIYVSPKEKLKRYGIFKQNLMHIAETNRK---NGSYWLGLNQFADITHEEF 101

Query: 94  IASQTGFK--MSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC------- 144
            A+  G K  +S   +  +   T     ++ +P SV+W  KGAVTPVK QG+C       
Sbjct: 102 KANHLGLKQGLSRMGAQTRTPTTFRYAAAANLPWSVDWRYKGAVTPVKNQGKCGSCWAFS 161

Query: 145 AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVY 204
           +VAAVEGIN I   +LVSLSEQ+L+DC T   ++GC GG MD AF YI+ ++GI  +  Y
Sbjct: 162 SVAAVEGINQIVTGKLVSLSEQELMDCDTM-LDHGCEGGLMDFAFAYIMGSQGIHAEDDY 220

Query: 205 SYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFYS 262
            Y  M  G C   +   +   IT YEDVP N E SLLKA+A+QPVSV I A +   QFY 
Sbjct: 221 PYL-MEEGYCKEKQPYANVVTITGYEDVPENSEISLLKALAHQPVSVGIAAGSRDFQFYK 279

Query: 263 GGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQC 322
           GGVF+G C   L+H +TAVGYG+S  G  Y  +KNSWG++WGE GY R++    +P+G C
Sbjct: 280 GGVFDGSCSDELDHALTAVGYGSSY-GQNYITMKNSWGKNWGEQGYVRIKMGTGKPEGVC 338

Query: 323 GIAMFASFPV 332
           GI   AS+PV
Sbjct: 339 GIYTMASYPV 348


>gi|297830594|ref|XP_002883179.1| hypothetical protein ARALYDRAFT_318695 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297329019|gb|EFH59438.1| hypothetical protein ARALYDRAFT_318695 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 308

 Score =  249 bits (636), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 140/324 (43%), Positives = 194/324 (59%), Gaps = 36/324 (11%)

Query: 34  FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT---P 90
           +E+W  +  + Y    E  +R +IFK+NL  ++  N  ++ N+++ + L +FADLT   P
Sbjct: 2   YERWLVENRKNYNGLGEKERRCKIFKENLKFIDEHN--SLPNQTFEVGLTRFADLTNDEP 59

Query: 91  QEFIASQTGFKMSDHSSSLKANGTPFLYKSSQV-PPSVNWIEKGAVTPVKYQGQC----- 144
           ++F+ +                   +LYK   + P  ++W  KGAV PVK QG C     
Sbjct: 60  KDFMKADR-----------------YLYKEGDILPDEIDWRAKGAVVPVKDQGNCGSCWA 102

Query: 145 --AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDA 202
             AV AVEGIN IK   L+SLS+Q+L+DC     N GC GG M+ AF++II N GI +D 
Sbjct: 103 FSAVGAVEGINQIKTGELISLSDQELIDCDRGFVNAGCEGGVMNYAFEFIINNGGIESDQ 162

Query: 203 VYSYEGMSTGICDSIKAED-HAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQ 259
            Y Y     G+C++ K  +    +I  YE V  NDE+SL KAVA+QPV VAI+AS  A +
Sbjct: 163 DYPYTATDLGVCNADKKNNTRVVKIDGYEYVAQNDEKSLKKAVAHQPVGVAIEASSQAFK 222

Query: 260 FYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQ 319
            Y  GVF G C  +L+HGV  VGYGTS  G  YW+I+NSWG +WGE+GY +LQR+ID   
Sbjct: 223 LYKSGVFTGTCGIYLDHGVVVVGYGTS-SGEDYWIIRNSWGLNWGENGYVKLQRNIDDSF 281

Query: 320 GQCGIAMFASFPVSKESAQPSSAD 343
           G+CG+AM  S+P   +S+ PSS D
Sbjct: 282 GKCGVAMMPSYPT--KSSFPSSFD 303


>gi|146215978|gb|ABQ10191.1| actinidin Act1c [Actinidia eriantha]
          Length = 368

 Score =  249 bits (635), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 145/342 (42%), Positives = 199/342 (58%), Gaps = 23/342 (6%)

Query: 1   MAKYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKD 60
           M+  F   +LI+S +  ++   RT DE  +   +E W  ++G++Y    E  +RFEIFK+
Sbjct: 10  MSLLFFSTLLILSLALDAK---RTNDE--VKAMYESWLIKHGKSYNSLGERERRFEIFKE 64

Query: 61  NLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS 120
            L  ++  N  A  +RSY + LN+FADLT +EF ++  GF    + + +     P   + 
Sbjct: 65  TLRFIDEHN--ADTSRSYKVGLNQFADLTNEEFRSTYLGFTRGSNKTKVSNRYEP---RV 119

Query: 121 SQVPPS-VNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCA 172
            QV P  V+W  +GAV  +K QGQC       A+AAVEGIN I    L+SLSEQ+LVDC 
Sbjct: 120 GQVLPDYVDWRSEGAVVDIKNQGQCGSCWAFSAIAAVEGINKIVTGNLISLSEQELVDCG 179

Query: 173 TNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDV 232
              +  GC GG+M D F++II N GI  +  Y Y     G CD     +    I NYE+V
Sbjct: 180 RTQSTKGCDGGYMTDGFEFIINNGGINTEENYPYTAQE-GQCDLNLQNEKYVTIDNYENV 238

Query: 233 PPNDEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGI 290
           P  +E +L  AVA QPVSVA++++  A Q YS G+F G C T  +H VT VGYGT E GI
Sbjct: 239 PYYNEWALQTAVAYQPVSVALESAGDAFQHYSSGIFTGPCGTATDHAVTIVGYGT-EGGI 297

Query: 291 KYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
            YW++KNSW   WGE+GY R+ R++    G CGIA   S+PV
Sbjct: 298 DYWIVKNSWDTTWGEEGYMRILRNVGGA-GTCGIATMPSYPV 338


>gi|3451077|emb|CAA20473.1| cysteine proteinase-like protein [Arabidopsis thaliana]
 gi|7269200|emb|CAB79307.1| cysteine proteinase-like protein [Arabidopsis thaliana]
          Length = 355

 Score =  248 bits (634), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 144/355 (40%), Positives = 208/355 (58%), Gaps = 28/355 (7%)

Query: 1   MAKYFLIVVLIISGSCA-------SQATYRTFDEGSIAEKFEQWKAQYGRTYKES-AENS 52
           M   FL++V ++S   +       S    R+ +E  +   F+ W +++G+TY  +  E  
Sbjct: 9   MTILFLLIVFVLSAPSSAMDLPATSGGHNRSNEE--VEFIFQMWMSKHGKTYTNALGEKE 66

Query: 53  KRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKAN 112
           +RF+ FKDNL  +++ N     N SY L L +FADLT QE+     G        +LK +
Sbjct: 67  RRFQNFKDNLRFIDQHNAK---NLSYQLGLTRFADLTVQEYRDLFPG-SPKPKQRNLKTS 122

Query: 113 GTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSE 165
                    Q+P SV+W ++GAV+ +K QG C        VAAVEG+N I    L+SLSE
Sbjct: 123 RRYVPLAGDQLPESVDWRQEGAVSEIKDQGTCNSCWAFSTVAAVEGLNKIVTGELISLSE 182

Query: 166 QQLVDCATNDNNNGCYG-GFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAA 224
           Q+LVDC  N  NNGCYG G MD AF+++I N G+ ++  Y Y+G + G C+  +      
Sbjct: 183 QELVDC--NLVNNGCYGSGLMDTAFQFLINNNGLDSEKDYPYQG-TQGSCNRKQVHLLVI 239

Query: 225 QITNYEDVPPNDEESLLKAVANQPVSVAIDASALQF--YSGGVFNGYCETFLNHGVTAVG 282
            I +YEDVP NDE SL KAVA+QPVSV +D  + +F  Y   ++NG C T L+H +  VG
Sbjct: 240 TIDSYEDVPANDEISLQKAVAHQPVSVGVDKKSQEFMLYRSCIYNGPCGTNLDHALVIVG 299

Query: 283 YGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSKESA 337
           YG SE G  YW+++NSWG  WG+ GY ++ R+ + P+G CGIAM AS+P+   ++
Sbjct: 300 YG-SENGQDYWIVRNSWGTTWGDAGYIKIARNFEDPKGLCGIAMLASYPIKNSAS 353


>gi|312451836|gb|ADQ85985.1| actinidin [Actinidia chinensis]
          Length = 380

 Score =  248 bits (634), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 145/343 (42%), Positives = 199/343 (58%), Gaps = 21/343 (6%)

Query: 1   MAKYFLIVVLIISGSCASQ-ATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFK 59
           M+  F   +LI+S +  ++  T RT DE  +   +E W  +YG++Y    E  +RFEIFK
Sbjct: 10  MSLLFFSTLLILSLAFNTKNLTQRTNDE--VKAMYESWLIKYGKSYNSLGEWERRFEIFK 67

Query: 60  DNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK 119
           + L  ++  N  A  NRSY + LN+FADLT +EF ++  GF    + + +     P   +
Sbjct: 68  ETLRFIDEHN--ADTNRSYKVGLNQFADLTDEEFRSTYLGFTSGSNKTKVSNRYEP---R 122

Query: 120 SSQVPPS-VNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDC 171
             QV PS V+W   GAV  +K QG+C       A+A VEGIN I    L+SLSEQ+L+DC
Sbjct: 123 VGQVLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIDC 182

Query: 172 ATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYED 231
               N  GC GG++ D F++II N GI  +  Y Y     G C+     +    I  YE+
Sbjct: 183 GRTQNTRGCNGGYITDGFQFIINNGGINTEENYPYTAQD-GECNVDLQNEKYVTIDTYEN 241

Query: 232 VPPNDEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEG 289
           VP N+E +L  AV  QPVSVA+DA+  A + YS G+F G C T ++H VT VGYGT E G
Sbjct: 242 VPYNNEWALQTAVTYQPVSVALDAAGDAFKQYSSGIFTGPCGTAIDHAVTIVGYGT-EGG 300

Query: 290 IKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           I YW++KNSW   WGE+GY R+ R++    G CGIA   S+PV
Sbjct: 301 IDYWIVKNSWDTTWGEEGYMRILRNVGGA-GTCGIATMPSYPV 342


>gi|303283194|ref|XP_003060888.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226457239|gb|EEH54538.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 422

 Score =  248 bits (634), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 145/334 (43%), Positives = 193/334 (57%), Gaps = 17/334 (5%)

Query: 29  SIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKFAD 87
           +I  +F++W A +G+ Y    E +KR  IF DN   V   N A A G +S+ LRLN  AD
Sbjct: 65  TIEARFDRWLATHGKAYACPKERAKRLAIFADNAEFVRVHNEAHAAGKKSHWLRLNHLAD 124

Query: 88  LTPQEFIASQTGFKMSD---HSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC 144
           LT +EF     G+  S     SSS   +   + Y     P +++W+ +GAVTPVK QGQC
Sbjct: 125 LTREEF-KHMLGYDASKKRVESSSPPVDAANWEYADVTPPETMDWVSRGAVTPVKNQGQC 183

Query: 145 A-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKG 197
                   V AVEG+ A+K   L+SLSEQ+LV CA    NNGC GG MD+ F++I++N+G
Sbjct: 184 GSCWAFSTVGAVEGVVAVKTGDLISLSEQELVSCAKIGGNNGCKGGLMDNGFEWIVENRG 243

Query: 198 ITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS- 256
           + ++  + Y           K    AA I  ++DVP NDE++L KAV+ QPV+VAI+A  
Sbjct: 244 VDDEEDWGYLAKDRRCNWFKKRRAKAASIDGFKDVPRNDEDALKKAVSQQPVAVAIEADH 303

Query: 257 -ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGI---KYWLIKNSWGQDWGEDGYFRLQ 312
              Q YSGGVF+G C T L+HGV  VGYG   E      YW +KNSWG  WGE+GY R+ 
Sbjct: 304 REFQLYSGGVFDGECGTNLDHGVLVVGYGYDGESAGHKHYWTVKNSWGAKWGEEGYIRIA 363

Query: 313 RDIDQPQGQCGIAMFASFPVSKESAQPSSADKSS 346
           R    P GQCG+AM AS+P    SA     D+ +
Sbjct: 364 RGGMGPAGQCGVAMQASYPTKSSSAPLEDGDEPT 397


>gi|3980198|emb|CAA46863.1| thiolprotease [Pisum sativum]
          Length = 464

 Score =  248 bits (634), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 140/327 (42%), Positives = 196/327 (59%), Gaps = 21/327 (6%)

Query: 19  QATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSY 78
           ++T RT D+  +   +E+W  ++G+ Y    E  KRFEIFKDNL  ++  N+    N S+
Sbjct: 34  KSTPRTNDQ--VLTMYEEWLVKHGKNYNALGEKEKRFEIFKDNLGFIDEHNSK---NLSF 88

Query: 79  TLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS---SQVPPSVNWIEKGAV 135
            L LN+FADLT +E+     G +++ +  + K N     Y +    ++P SV+W ++GAV
Sbjct: 89  RLGLNRFADLTNEEYRTRFLGTRINPNRRNRKVNSQTNRYATRVGDKLPESVDWRKEGAV 148

Query: 136 TPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDA 188
             VK QG C       A+AAVEG+N +    L+SLSEQ+LVDC T+  N GC GG MD A
Sbjct: 149 VGVKDQGSCGSCWAFSAIAAVEGVNKLATGDLISLSEQELVDCDTS-YNEGCNGGLMDYA 207

Query: 189 FKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQP 248
           F++II    +T +  Y Y  +  G CD  +       I  YEDVP  DE +L KAVANQ 
Sbjct: 208 FEFIINMVALTPEEDYPYRAID-GRCDQNRKNAKVVSIDQYEDVPAYDEGALKKAVANQV 266

Query: 249 VSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGED 306
           ++VA++      Q Y  GVF G C T L+HGV AVGYGT E G  YW+++NSWG  WGE 
Sbjct: 267 IAVAVEGGGREFQLYDSGVFTGRCGTALDHGVAAVGYGT-ENGKDYWIVRNSWGGSWGEA 325

Query: 307 GYFRLQRDIDQPQ-GQCGIAMFASFPV 332
           GY RL+R++   + G+CGIA+  S+P+
Sbjct: 326 GYIRLERNLATSKSGKCGIAIEPSYPI 352


>gi|297818854|ref|XP_002877310.1| hypothetical protein ARALYDRAFT_484828 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297323148|gb|EFH53569.1| hypothetical protein ARALYDRAFT_484828 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 376

 Score =  248 bits (634), Expect = 3e-63,   Method: Compositional matrix adjust.
 Identities = 142/329 (43%), Positives = 187/329 (56%), Gaps = 15/329 (4%)

Query: 20  ATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYT 79
           AT    +E  +   +E+W  ++G+ Y    E  +RF+IFKDNL  +E  N+    NRSY 
Sbjct: 27  ATESHRNEAEVRTIYERWLVEHGKNYNGLGEKERRFKIFKDNLKHIEEHNSDP--NRSYD 84

Query: 80  LRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQV-PPSVNWIEKGAVTP- 137
             LN+F+DLT  EF AS  G K+   S S  A    + YK   + P  V+W E+GAV P 
Sbjct: 85  RGLNQFSDLTVDEFQASYLGGKIEKKSLSDVAE--RYQYKEGDILPDEVDWRERGAVVPR 142

Query: 138 VKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFK 190
           VK QG C       A  AVEGIN I    L+SLSEQ+L+DC    +N GC GG    AF+
Sbjct: 143 VKRQGDCGSCWAFAATGAVEGINQITTGELLSLSEQELIDCDRGKDNFGCAGGGAVWAFE 202

Query: 191 YIIQNKGITNDAVYSYEGMSTGICDSIKAED-HAAQITNYEDVPPNDEESLLKAVANQPV 249
           +I +N GI  D  Y Y G  T  C +I+ +      I  +E VP NDE SL KAV+ QP+
Sbjct: 203 FIKENGGIVTDEDYGYTGDDTAACKAIEMKTTRVVTINGHEVVPVNDEMSLKKAVSYQPI 262

Query: 250 SVAIDASALQFYSGGVFNGYCETFL-NHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGY 308
           SV I A+ +  Y  GV+ G C     +H V  VGYGTS +   YWLI+NSWG  WGE GY
Sbjct: 263 SVMISAANMSDYKSGVYKGPCSNLWGDHNVLIVGYGTSSDEGDYWLIRNSWGPGWGEGGY 322

Query: 309 FRLQRDIDQPQGQCGIAMFASFPVSKESA 337
            RLQR+ ++P G+C +A+   +P+   SA
Sbjct: 323 LRLQRNFNEPTGKCAVAVAPVYPIKTNSA 351


>gi|356543010|ref|XP_003539956.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
          Length = 306

 Score =  248 bits (634), Expect = 3e-63,   Method: Compositional matrix adjust.
 Identities = 149/314 (47%), Positives = 184/314 (58%), Gaps = 27/314 (8%)

Query: 33  KFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQE 92
           +FE+W  Q  R YK+  E   RF I++ NL  +E  N+      SY L  NKFADLT +E
Sbjct: 4   RFERWLKQNDRXYKDKEEWEVRFGIYQANLEYIECKNSQ---EXSYNLTDNKFADLTNEE 60

Query: 93  FIASQTGF--KMSDHSSSLKANGTPFLYKSSQ-VPPSVNWIEKGAVTPVKYQGQC----- 144
           F++   GF  +   H        T F+Y   + +P S +W ++GAV+ +K QG C     
Sbjct: 61  FVSPYLGFGTRFLPH--------TGFMYHEHEDLPESKDWRKEGAVSDIKDQGNCGSCWA 112

Query: 145 --AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDA 202
             AVAAVEGIN IK  +LVSLSEQ+  DC   D N GC GG MD AF +I +N G+T   
Sbjct: 113 FSAVAAVEGINKIKSGKLVSLSEQEFRDCDVEDGNQGCEGGLMDTAFAFIKKNGGLTTSK 172

Query: 203 VYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESL--LKAVANQPVSVAIDAS--AL 258
            Y YEG+  G C+  KA  HAA I+ +  VP NDE  L    A ANQ  SVAIDA   A 
Sbjct: 173 DYPYEGVD-GTCNKEKALHHAANISGHVKVPANDEAMLKAKAAAANQXESVAIDAGGHAF 231

Query: 259 QFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQP 318
           Q Y  GVF+G C   LNHGVT VGYG      KYW++KNSWG DWGE GY R++RD    
Sbjct: 232 QLYLKGVFSGICGKQLNHGVTIVGYGKGTSD-KYWIVKNSWGADWGESGYIRMKRDAFDK 290

Query: 319 QGQCGIAMFASFPV 332
            G CGIAM AS+P+
Sbjct: 291 AGTCGIAMQASYPL 304


>gi|242055753|ref|XP_002457022.1| hypothetical protein SORBIDRAFT_03g047290 [Sorghum bicolor]
 gi|241928997|gb|EES02142.1| hypothetical protein SORBIDRAFT_03g047290 [Sorghum bicolor]
          Length = 378

 Score =  248 bits (633), Expect = 3e-63,   Method: Compositional matrix adjust.
 Identities = 152/339 (44%), Positives = 197/339 (58%), Gaps = 42/339 (12%)

Query: 29  SIAEKFEQWKAQYGR-TYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFAD 87
           S+AE FE+W +++ +  Y    E  +RFE+FKDNL  ++  N       SY L LN+FAD
Sbjct: 43  SLAELFERWLSRHRKGAYASLEEKLRRFEVFKDNLHHIDETNRKV---SSYWLGLNEFAD 99

Query: 88  LTPQEFIASQTGFKMSDHSSSL----------------KANGTPFLYK-----SSQVPPS 126
           LT  EF A+  G   S     +                 ++ + F ++     ++++P S
Sbjct: 100 LTHDEFKATYLGLSPSGGGGDVVHMHHDDDDEEPEEEGSSSSSSFRFRYEGVDAARLPKS 159

Query: 127 VNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNG 179
           V+W  KGAVT VK QGQC        VAAVEGIN I    L +LSEQ+LVDC T D NNG
Sbjct: 160 VDWRSKGAVTGVKNQGQCGSCWAFSTVAAVEGINQIVTGNLTALSEQELVDCDT-DGNNG 218

Query: 180 CYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEES 239
           C GG MD AF YI  N G+  +  Y Y  M  G C S  +      I+ YEDVP N+E++
Sbjct: 219 CNGGLMDYAFSYIAHNGGLHTEEAYPYL-MEEGTC-SRGSSAAVVTISGYEDVPRNNEQA 276

Query: 240 LLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEG-----IKY 292
           LLKA+A+QPVSVAI+AS   LQFYSGGVF+G C T L+HGV AVGYGT+ +        Y
Sbjct: 277 LLKALAHQPVSVAIEASGRNLQFYSGGVFDGPCGTQLDHGVAAVGYGTAGKDNGHVVADY 336

Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
            ++KNSWG  WGE GY R++R   + QG CGI    S+P
Sbjct: 337 IIVKNSWGPSWGEKGYIRMRRGTGKRQGLCGINKMPSYP 375


>gi|297809385|ref|XP_002872576.1| hypothetical protein ARALYDRAFT_489965 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297318413|gb|EFH48835.1| hypothetical protein ARALYDRAFT_489965 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 371

 Score =  248 bits (632), Expect = 4e-63,   Method: Compositional matrix adjust.
 Identities = 139/316 (43%), Positives = 191/316 (60%), Gaps = 29/316 (9%)

Query: 34  FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERF-NNAAIGNRSYTLRLNKFADLTPQE 92
           FE W  ++G+ Y+  AE  +R  IF+DNL    RF  N    N SY L LN+FADL+  E
Sbjct: 56  FESWMVKHGKVYESVAEKERRLTIFEDNL----RFITNRNAENLSYRLGLNRFADLSLHE 111

Query: 93  FIASQTGFKMS---DHSSSLKANGTPFLYKSSQ---VPPSVNWIEKGAVTPVKYQGQC-- 144
           +     G       +H     +N     YK+S    +P SV+W  +GAVT VK QGQC  
Sbjct: 112 YAQICHGADPRPPRNHVFMTSSN----RYKTSDGDVLPKSVDWRNEGAVTEVKDQGQCRS 167

Query: 145 -----AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGIT 199
                 V AVEG+N I    LV+LSEQ L++C  N  NNGC GG ++ A+++I+ N G+ 
Sbjct: 168 CWAFSTVGAVEGLNKIVTGELVTLSEQDLINC--NKENNGCGGGKVETAYEFIMNNGGLG 225

Query: 200 NDAVYSYEGMSTGIC-DSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA- 257
            D  Y Y+ ++ G+C D +K  +    I  YE++P NDE +L+KAVA+QPV+  +D+S+ 
Sbjct: 226 TDNDYPYKALN-GVCNDRLKENNKNVMIDGYENLPANDESALMKAVAHQPVTAVVDSSSR 284

Query: 258 -LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDID 316
             Q Y+ GVF+G C T LNHGV  VGYGT E G  YW+++NS G  WGE GY ++ R+I 
Sbjct: 285 EFQLYASGVFDGTCGTNLNHGVVVVGYGT-ENGRDYWIVRNSRGNTWGEAGYMKMARNIA 343

Query: 317 QPQGQCGIAMFASFPV 332
            P+G CGIAM AS+P+
Sbjct: 344 NPRGLCGIAMRASYPL 359


>gi|297843784|ref|XP_002889773.1| hypothetical protein ARALYDRAFT_471096 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297335615|gb|EFH66032.1| hypothetical protein ARALYDRAFT_471096 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 439

 Score =  248 bits (632), Expect = 4e-63,   Method: Compositional matrix adjust.
 Identities = 140/312 (44%), Positives = 187/312 (59%), Gaps = 17/312 (5%)

Query: 32  EKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQ 91
           E F+ W  ++G+TY    E  +R +IFKDN   V + N   I N +Y+L LN FADLT  
Sbjct: 30  ELFDDWCQRHGKTYGSEEERQQRIQIFKDNHDFVTQHN--LITNATYSLSLNAFADLTHH 87

Query: 92  EFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC------- 144
           EF AS+ G  +S  S  + + G   L  +++VP SV+W +KGAVT VK QG C       
Sbjct: 88  EFKASRLGLSVSASSLIMASKGQS-LGGNAKVPDSVDWRKKGAVTNVKDQGSCGACWSFS 146

Query: 145 AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVY 204
           A  A+EGIN I    L+SLSEQ+L+DC     N GC GG MD AF+++I+N GI  +  Y
Sbjct: 147 ATGAMEGINQIVTGDLISLSEQELIDC-DKSYNAGCNGGLMDYAFEFVIKNHGIDTEKDY 205

Query: 205 SYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQFYS 262
            Y+    G C   K +     I +Y  V  NDE++L +AVA QPVSV I  S  A Q YS
Sbjct: 206 PYQ-ERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALREAVAAQPVSVGICGSERAFQLYS 264

Query: 263 --GGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQG 320
              G+F+G C T L+H V  VGYG S+ G+ YW++KNSWG+ WG DG+  +QR+    +G
Sbjct: 265 RVSGIFSGPCSTSLDHAVLIVGYG-SQNGVDYWIVKNSWGKSWGMDGFMHMQRNTGNSEG 323

Query: 321 QCGIAMFASFPV 332
            CGI M AS+P+
Sbjct: 324 ICGINMLASYPI 335


>gi|146215980|gb|ABQ10192.1| actinidin Act2a [Actinidia deliciosa]
          Length = 378

 Score =  248 bits (632), Expect = 4e-63,   Method: Compositional matrix adjust.
 Identities = 146/346 (42%), Positives = 199/346 (57%), Gaps = 23/346 (6%)

Query: 5   FLIVVLIISGSC-ASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLV 63
           F   +LI+S +     +  RT D+  +   +E W  ++G++Y    E   RFEIFK+NL 
Sbjct: 14  FFSTLLILSSAIDIENSVQRTNDQ--VMAMYESWLVEHGKSYNSLDEKEMRFEIFKENLR 71

Query: 64  AVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK-SSQ 122
            ++  N  A  NRSY+L LN+FADLT +E+ ++  G K    +         ++ K    
Sbjct: 72  IIDDHN--ADANRSYSLGLNRFADLTDEEYRSTYLGLKRGPKTDV----SNQYMPKVGDA 125

Query: 123 VPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATND 175
           +P  V+W   GAV  VK QG C       AVAAVEGIN I    L+SLSEQ+LVDC    
Sbjct: 126 LPDYVDWRTVGAVVGVKNQGLCSSCWAFSAVAAVEGINKIVTGNLISLSEQELVDCGRTQ 185

Query: 176 NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPN 235
              GC  G M DAFK+II N GI  +  Y Y     G C+          I +Y++VP N
Sbjct: 186 ITKGCNRGLMTDAFKFIINNGGINTENNYPYTA-KDGQCNLSLKNQKYVTIDSYKNVPSN 244

Query: 236 DEESLLKAVANQPVSVAIDASALQF--YSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYW 293
           +E +L KAVA QPVSV +++   +F  Y+ G+F G C T ++HGVT VGYGT E G+ YW
Sbjct: 245 NEMALKKAVAYQPVSVGVESEGGKFKLYTSGIFTGSCGTAVDHGVTIVGYGT-ERGMDYW 303

Query: 294 LIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSKESAQP 339
           ++KNSWG +WGE GY R+QR+I    G+CGIA   S+PV K ++ P
Sbjct: 304 IVKNSWGTNWGESGYIRIQRNIGG-AGKCGIAKMPSYPV-KYTSNP 347


>gi|302764466|ref|XP_002965654.1| hypothetical protein SELMODRAFT_230713 [Selaginella moellendorffii]
 gi|300166468|gb|EFJ33074.1| hypothetical protein SELMODRAFT_230713 [Selaginella moellendorffii]
          Length = 345

 Score =  247 bits (631), Expect = 5e-63,   Method: Compositional matrix adjust.
 Identities = 140/308 (45%), Positives = 185/308 (60%), Gaps = 18/308 (5%)

Query: 34  FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
           +++W  ++G+ Y  + E  KRF+IFK+N+  +   N  A  N S++L LNKFADLT  EF
Sbjct: 38  YQKWIQEHGKAYNSAHEYKKRFQIFKENVNYINSHN--ARRNNSHSLGLNKFADLTNSEF 95

Query: 94  IASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC-------AV 146
                G ++   +   +      +   +    SV+W +KG VT +K QG C       AV
Sbjct: 96  RGLYVG-RLQRPAPFHEVGDIALV---ADTATSVDWRKKGGVTEIKDQGDCGSCWAFSAV 151

Query: 147 AAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSY 206
           AAVEG+  +    LVSLSEQ+LVDC T   N GC GG MD AF+Y+I+N GIT+ + Y Y
Sbjct: 152 AAVEGLTFLSTGTLVSLSEQELVDCDTT-VNQGCDGGIMDYAFQYMIRNGGITSQSNYPY 210

Query: 207 EGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFYSGG 264
             +  G CD  K + HAA I  ++ +PP  EE LL+AVANQPVSVAI+A     Q YS G
Sbjct: 211 RALR-GACDKDKVKYHAATINGFQAIPPQSEELLLRAVANQPVSVAIEAGGQDFQLYSSG 269

Query: 265 VFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGI 324
           VF G C + L+HGV  VGYGT   G +YWL+KNSWG  WGE GY R++R      G CGI
Sbjct: 270 VFTGECGSNLDHGVAIVGYGTDAGGRQYWLVKNSWGSGWGESGYVRMERQ-GPGAGVCGI 328

Query: 325 AMFASFPV 332
            + AS+P 
Sbjct: 329 NLDASYPT 336


>gi|18413505|ref|NP_567376.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|30315954|sp|Q9SUT0.1|CPR3_ARATH RecName: Full=Probable cysteine proteinase At4g11310; Flags:
           Precursor
 gi|5596477|emb|CAB51415.1| drought-inducible cysteine proteinase RD21A precursor-like protein
           [Arabidopsis thaliana]
 gi|7267830|emb|CAB81232.1| drought-inducible cysteine proteinase RD21A precursor-like protein
           [Arabidopsis thaliana]
 gi|332657595|gb|AEE82995.1| putative cysteine proteinase [Arabidopsis thaliana]
          Length = 364

 Score =  247 bits (631), Expect = 5e-63,   Method: Compositional matrix adjust.
 Identities = 155/372 (41%), Positives = 210/372 (56%), Gaps = 43/372 (11%)

Query: 2   AKYFLIVVLIISGSCAS------------QATYRTFD-EGSIAEKFEQWKAQYGRTYKES 48
           A   L+V ++I+ SCA+               +  FD E S+   FE W  ++G+ Y   
Sbjct: 7   AMLILLVAMVIA-SCATAIDMSVVSYDDNNRLHSVFDAEASLI--FESWMVKHGKVYGSV 63

Query: 49  AENSKRFEIFKDNLVAVERF-NNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSS 107
           AE  +R  IF+DNL    RF NN    N SY L L  FADL+  E+     G       +
Sbjct: 64  AEKERRLTIFEDNL----RFINNRNAENLSYRLGLTGFADLSLHEYKEVCHGADPRPPRN 119

Query: 108 SLKANGTPFLYKSSQ---VPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKI 157
            +    +   YK+S    +P SV+W  +GAVT VK QG C        V AVEG+N I  
Sbjct: 120 HVFMTSSD-RYKTSADDVLPKSVDWRNEGAVTEVKDQGHCRSCWAFSTVGAVEGLNKIVT 178

Query: 158 NRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDS- 216
             LV+LSEQ L++C  N  NNGC GG ++ A+++I++N G+  D  Y Y+ ++ G+CD  
Sbjct: 179 GELVTLSEQDLINC--NKENNGCGGGKLETAYEFIMKNGGLGTDNDYPYKAVN-GVCDGR 235

Query: 217 IKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFL 274
           +K  +    I  YE++P NDE +L+KAVA+QPV+  ID+S+   Q Y  GVF+G C T L
Sbjct: 236 LKENNKNVMIDGYENLPANDESALMKAVAHQPVTAVIDSSSREFQLYESGVFDGSCGTNL 295

Query: 275 NHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSK 334
           NHGV  VGYGT E G  YWL+KNS G  WGE GY ++ R+I  P+G CGIAM AS+P+  
Sbjct: 296 NHGVVVVGYGT-ENGRDYWLVKNSRGITWGEAGYMKMARNIANPRGLCGIAMRASYPLK- 353

Query: 335 ESAQPSSADKSS 346
                 S DKSS
Sbjct: 354 ---NSFSTDKSS 362


>gi|224085750|ref|XP_002307688.1| predicted protein [Populus trichocarpa]
 gi|222857137|gb|EEE94684.1| predicted protein [Populus trichocarpa]
          Length = 436

 Score =  247 bits (631), Expect = 6e-63,   Method: Compositional matrix adjust.
 Identities = 137/308 (44%), Positives = 176/308 (57%), Gaps = 16/308 (5%)

Query: 34  FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
           FE W  ++G++Y    E S R ++F+DN   V + N+   GN SY+L LN FADLT  EF
Sbjct: 29  FETWCKEHGKSYTSQEERSHRLKVFEDNYDFVTKHNSK--GNSSYSLALNAFADLTHHEF 86

Query: 94  IASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC-------AV 146
             S+ G  +S    +L             +P S++W  KG VT VK QG C       A 
Sbjct: 87  KTSRLG--LSAAPLNLAHRNLEITGVVGDIPASIDWRNKGVVTNVKDQGSCGACWSFSAT 144

Query: 147 AAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSY 206
            A+EGIN I    LVSLSEQ+L++C     N+GC GG MD AF+++I N GI  +  Y Y
Sbjct: 145 GAIEGINKIVTGSLVSLSEQELIEC-DKSYNDGCGGGLMDYAFQFVINNHGIDTEEDYPY 203

Query: 207 EGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQFYSGG 264
                G C+  + +     I  Y DVP N+E+ LL+AVA QPVSV I  S  A Q YS G
Sbjct: 204 RARD-GTCNKDRMKRRVVTIDKYVDVPENNEKQLLQAVAAQPVSVGICGSERAFQMYSKG 262

Query: 265 VFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGI 324
           +F G C T L+H V  VGYG SE G+ YW++KNSWG  WG  GY  +QR+    QG CGI
Sbjct: 263 IFTGPCSTSLDHAVLIVGYG-SENGVDYWIVKNSWGTGWGMRGYMHMQRNSGNSQGVCGI 321

Query: 325 AMFASFPV 332
            M AS+PV
Sbjct: 322 NMLASYPV 329


>gi|32396020|gb|AAP41847.1| senescence-associated cysteine protease [Anthurium andraeanum]
          Length = 460

 Score =  247 bits (631), Expect = 6e-63,   Method: Compositional matrix adjust.
 Identities = 139/315 (44%), Positives = 182/315 (57%), Gaps = 18/315 (5%)

Query: 34  FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
           +E W    G+ Y    E  +RFEIF DNL  ++  N A   N SYTL L +FADLT +E+
Sbjct: 38  YEGWLVGNGKAYNLLGEKERRFEIFWDNLRYIDDHNRAE-NNHSYTLGLTRFADLTNEEY 96

Query: 94  IASQTGFK----MSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA---- 145
            ++  G K        ++     G         +P  V+W EKGAV P+K QG C     
Sbjct: 97  RSTYLGVKPGQVRPRRANRAPGRGRDLSANGDDLPQKVDWREKGAVAPIKDQGGCGSCWA 156

Query: 146 ---VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDA 202
              VAAVEGIN I    L+ LSEQ+LVDC T   N GC GG MD AF++II N GI  + 
Sbjct: 157 FSTVAAVEGINQIVTGDLIVLSEQELVDCDTA-YNEGCNGGLMDYAFQFIISNGGIDTEE 215

Query: 203 VYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQF 260
            Y Y+    G+CD  +       I +YEDV  NDE +L  AVA+QPVSVAI+    + Q 
Sbjct: 216 DYPYK-ERDGLCDPNRKNAKVVSIDSYEDVLENDEHALKTAVAHQPVSVAIEGGGRSFQL 274

Query: 261 YSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDI-DQPQ 319
           Y  G+F+G C   L+HGV AVGYGT E G  YW+++NSWG+ WGE GY R++R++     
Sbjct: 275 YKSGIFDGRCGIDLDHGVVAVGYGT-ESGKDYWIVRNSWGKSWGEAGYIRMERNLPSSSS 333

Query: 320 GQCGIAMFASFPVSK 334
           G+CGIA+  S+P+ K
Sbjct: 334 GKCGIAIEPSYPIKK 348


>gi|162459488|ref|NP_001105571.1| maize insect resistance1 precursor [Zea mays]
 gi|5731354|gb|AAB70820.2| cysteine protease Mir1 [Zea mays]
          Length = 398

 Score =  247 bits (630), Expect = 6e-63,   Method: Compositional matrix adjust.
 Identities = 143/335 (42%), Positives = 201/335 (60%), Gaps = 31/335 (9%)

Query: 30  IAEKFEQWKAQYGR--------------TYKESAENSKRFEIFKDNLVAVERFN-NAAIG 74
           +   +E WK+++GR                +E  +   R E+F+DNL  ++  N  A  G
Sbjct: 50  VRRMYEAWKSKHGRGGSSNDDCDMAPGDDEQEEEDRRLRLEVFRDNLRYIDAHNAEADAG 109

Query: 75  NRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGA 134
             ++ L L  FADLT +E+     GF+      S    G+ +  +   +P +++W + GA
Sbjct: 110 LHTFRLGLTPFADLTLEEYRGRVLGFRAR-GRRSGARYGSGYSVRGGDLPDAIDWRQLGA 168

Query: 135 VTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDD 187
           VT VK Q QC       AVAA+EG+NAI    LVSLSEQ+++DC   D+  GC GG M++
Sbjct: 169 VTEVKDQQQCGGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDAQDS--GCDGGQMEN 226

Query: 188 AFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDH-AAQITNYEDVPPNDEESLLKAVAN 246
           AF+++I N GI  +A Y + G + G CD+ K ++   A I    +V  N+E +L +AVA 
Sbjct: 227 AFRFVIGNGGIDTEADYPFIG-TDGTCDASKEKNEKVATIDGLVEVASNNETALQEAVAI 285

Query: 247 QPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWG 304
           QPVSVAIDAS  A Q YS G+FNG C T L+HGVTAVGYG SE G  YW++KNSW   WG
Sbjct: 286 QPVSVAIDASGRAFQHYSSGIFNGPCGTSLDHGVTAVGYG-SESGKDYWIVKNSWSASWG 344

Query: 305 EDGYFRLQRDIDQPQGQCGIAMFASFPVSKESAQP 339
           E GY R++R++ +P G+CGIAM AS+PV K++  P
Sbjct: 345 EAGYIRMRRNVPRPTGKCGIAMDASYPV-KDTYHP 378


>gi|147769019|emb|CAN62459.1| hypothetical protein VITISV_015168 [Vitis vinifera]
          Length = 246

 Score =  247 bits (630), Expect = 8e-63,   Method: Compositional matrix adjust.
 Identities = 134/269 (49%), Positives = 175/269 (65%), Gaps = 34/269 (12%)

Query: 75  NRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKG 133
           ++SY L +N+FADLT +EF  S+  FK   H  S +A  T F Y++ + VP + +W +KG
Sbjct: 2   DKSYKLSINEFADLTNEEFGTSRNRFKA--HICSTEA--TSFKYENVTAVPSTXDWRKKG 57

Query: 134 AVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMD 186
           AVTP+K QGQC       AVAA+EGI  +   +L+SLSEQ+LVDC T+  + GC G    
Sbjct: 58  AVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQGCXG---- 113

Query: 187 DAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN 246
                          A Y Y G + G C+  KA   AA+I  YEDVP N+E++L KAVA+
Sbjct: 114 ---------------ANYPYAG-TDGTCNRKKAAHPAAKINGYEDVPANNEKALQKAVAH 157

Query: 247 QPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWG 304
           QP++VAIDA     QFYS GVF G C T L+HGV AVGYGTS++G+KYWL+KNSWG  WG
Sbjct: 158 QPIAVAIDAGGXEFQFYSSGVFTGQCGTELDHGVXAVGYGTSDDGMKYWLVKNSWGTGWG 217

Query: 305 EDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
           E+GY R+QRD+   +G CGIAM AS+P +
Sbjct: 218 EEGYIRMQRDVTAKEGLCGIAMQASYPTA 246


>gi|146215976|gb|ABQ10190.1| actinidin Act1b [Actinidia arguta]
          Length = 380

 Score =  246 bits (629), Expect = 8e-63,   Method: Compositional matrix adjust.
 Identities = 148/343 (43%), Positives = 197/343 (57%), Gaps = 21/343 (6%)

Query: 1   MAKYFLIVVLIISGSC-ASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFK 59
           M+  F   +L++S +  A   T RT DE  +   +E W  +YG++Y    E  +RFEIFK
Sbjct: 10  MSLLFFSTLLVLSLAFNAKNLTKRTNDE--LKAMYESWLTKYGKSYNSLGEWERRFEIFK 67

Query: 60  DNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK 119
           + L  ++  N  A  NRSY + LN+FAD T +EF ++  GF    +   +     P   +
Sbjct: 68  ETLRFIDEHN--ADTNRSYRVGLNQFADQTNEEFQSTYLGFTSGSNKMKVSNRYEP---R 122

Query: 120 SSQVPPS-VNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDC 171
             QV P  V+W   GAV  +K QGQC       A+A VEGIN I    L+SLSEQ+LVDC
Sbjct: 123 VGQVLPDYVDWRSAGAVVDIKSQGQCGSCWAFSAIATVEGINKIVTGDLISLSEQELVDC 182

Query: 172 ATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYED 231
               N  GC GG + D F++II N GI  +A Y Y     G C+     +  A I  YE+
Sbjct: 183 GRTQNTRGCDGGSITDGFQFIINNGGINTEANYPYTA-EDGQCNLDLQNEKYASIDTYEN 241

Query: 232 VPPNDEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEG 289
           VP N+E +L  AVA QPVSVA++A+  A Q YS G+F G C T ++H VT VGYGT E G
Sbjct: 242 VPYNNEWALQTAVAYQPVSVALEAAGDAFQHYSSGIFTGPCGTAVDHAVTIVGYGT-EGG 300

Query: 290 IKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           I YW++KNSW   WGE+GY R+ R++    G CGIA   S+PV
Sbjct: 301 IDYWIVKNSWDTTWGEEGYIRILRNVGGA-GTCGIATKPSYPV 342


>gi|42567068|ref|NP_567686.2| putative cysteine proteinase [Arabidopsis thaliana]
 gi|332659371|gb|AEE84771.1| putative cysteine proteinase [Arabidopsis thaliana]
          Length = 356

 Score =  246 bits (629), Expect = 8e-63,   Method: Compositional matrix adjust.
 Identities = 144/356 (40%), Positives = 210/356 (58%), Gaps = 29/356 (8%)

Query: 1   MAKYFLIVVLIISGSCA-------SQATYRTFDEGSIAEKFEQWKAQYGRTYKES-AENS 52
           M   FL++V ++S   +       S    R+ +E  +   F+ W +++G+TY  +  E  
Sbjct: 9   MTILFLLIVFVLSAPSSAMDLPATSGGHNRSNEE--VEFIFQMWMSKHGKTYTNALGEKE 66

Query: 53  KRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKAN 112
           +RF+ FKDNL  +++ N     N SY L L +FADLT QE+     G        +LK +
Sbjct: 67  RRFQNFKDNLRFIDQHNAK---NLSYQLGLTRFADLTVQEYRDLFPG-SPKPKQRNLKTS 122

Query: 113 GTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSE 165
                    Q+P SV+W ++GAV+ +K QG C        VAAVEG+N I    L+SLSE
Sbjct: 123 RRYVPLAGDQLPESVDWRQEGAVSEIKDQGTCNSCWAFSTVAAVEGLNKIVTGELISLSE 182

Query: 166 QQLVDCATNDNNNGCYG-GFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKA-EDHA 223
           Q+LVDC  N  NNGCYG G MD AF+++I N G+ ++  Y Y+G + G C+  ++  +  
Sbjct: 183 QELVDC--NLVNNGCYGSGLMDTAFQFLINNNGLDSEKDYPYQG-TQGSCNRKQSTSNKV 239

Query: 224 AQITNYEDVPPNDEESLLKAVANQPVSVAIDASALQF--YSGGVFNGYCETFLNHGVTAV 281
             I +YEDVP NDE SL KAVA+QPVSV +D  + +F  Y   ++NG C T L+H +  V
Sbjct: 240 ITIDSYEDVPANDEISLQKAVAHQPVSVGVDKKSQEFMLYRSCIYNGPCGTNLDHALVIV 299

Query: 282 GYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSKESA 337
           GYG SE G  YW+++NSWG  WG+ GY ++ R+ + P+G CGIAM AS+P+   ++
Sbjct: 300 GYG-SENGQDYWIVRNSWGTTWGDAGYIKIARNFEDPKGLCGIAMLASYPIKNSAS 354


>gi|3377950|emb|CAA08861.1| cysteine proteinase precursor, AN11 [Ananas comosus]
          Length = 357

 Score =  246 bits (629), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 140/346 (40%), Positives = 201/346 (58%), Gaps = 25/346 (7%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
           FL + L +  +  S A+ R      + ++FE+W A+YGR YK++ E  +RF+IFK+N+  
Sbjct: 9   FLFLFLCVMWASPSAAS-RDEPSDPMMKRFEEWMAEYGRVYKDNDEKMRRFQIFKNNVNH 67

Query: 65  VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFL----YKS 120
           +E FN+   GN SYTL +N+F D+T  EF+A  TG  +      L     P +       
Sbjct: 68  IETFNSRN-GN-SYTLGINQFTDMTNNEFVAQYTGVSLP-----LNIEREPVVSFDDVDI 120

Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
           S VP S++W   GAVT VK    C       A+A VE I  IK   L+SLSEQQ++DCA 
Sbjct: 121 SAVPQSIDWRNYGAVTSVKNHIPCGSCWAFAAIATVESIYKIKRGYLISLSEQQVLDCAV 180

Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEG-MSTGICDSIKAEDHAAQITNYEDV 232
           +    GC GG+++ A+ +II NKG+ + A+Y Y+     G C  I    ++A IT Y  V
Sbjct: 181 S---YGCDGGWVNKAYDFIISNKGVASAAIYPYKASQGQGTC-RINGVPNSAYITGYTRV 236

Query: 233 PPNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIK 291
             N+E S++ AV+NQP++ +I+AS   Q Y  GVF+G C T LNH +T +GYG    G K
Sbjct: 237 QSNNERSMMYAVSNQPIAASIEASGDFQHYKRGVFSGPCGTSLNHAITIIGYGQDSSGKK 296

Query: 292 YWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSKESA 337
           +W+++NSWG  WGE GY R+ RD+    G CGIA+   +P  +  A
Sbjct: 297 FWIVRNSWGASWGERGYIRMARDVSSSSGLCGIAIRPLYPTLQSGA 342


>gi|20260334|gb|AAM13065.1| drought-inducible cysteine proteinase RD21A precursor-like protein
           [Arabidopsis thaliana]
 gi|23197782|gb|AAN15418.1| drought-inducible cysteine proteinase RD21A precursor-like protein
           [Arabidopsis thaliana]
          Length = 357

 Score =  246 bits (628), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 152/369 (41%), Positives = 208/369 (56%), Gaps = 42/369 (11%)

Query: 5   FLIVVLIISGSCAS------------QATYRTFD-EGSIAEKFEQWKAQYGRTYKESAEN 51
            +++V ++  SCA+               +  FD E S+   FE W  ++G+ Y   AE 
Sbjct: 2   LILLVAMVIASCATAIDMSVVSYDDNNRLHSVFDAEASLI--FESWMVKHGKVYGSVAEK 59

Query: 52  SKRFEIFKDNLVAVERF-NNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLK 110
            +R  IF+DNL    RF NN    N SY L L  FADL+  E+     G       + + 
Sbjct: 60  ERRLTIFEDNL----RFINNRNAENLSYRLGLTGFADLSLHEYKEVCHGADPRPPRNHVF 115

Query: 111 ANGTPFLYKSSQ---VPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRL 160
              +   YK+S    +P SV+W  +GAVT VK QG C        V AVEG+N I    L
Sbjct: 116 MTSSD-RYKTSADDVLPKSVDWRNEGAVTEVKDQGHCRSCWAFSTVGAVEGLNKIVTGEL 174

Query: 161 VSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDS-IKA 219
           V+LSEQ L++C  N  NNGC GG ++ A+++I++N G+  D  Y Y+ ++ G+CD  +K 
Sbjct: 175 VTLSEQDLINC--NKENNGCGGGKLETAYEFIMKNGGLGTDNDYPYKAVN-GVCDGRLKE 231

Query: 220 EDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHG 277
            +    I  YE++P NDE +L+KAVA+QPV+  ID+S+   Q Y  GVF+G C T LNHG
Sbjct: 232 NNKNVMIDGYENLPANDESALMKAVAHQPVTAVIDSSSREFQLYESGVFDGSCGTNLNHG 291

Query: 278 VTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSKESA 337
           V  VGYGT E G  YWL+KNS G  WGE GY ++ R+I  P+G CGIAM AS+P+     
Sbjct: 292 VVVVGYGT-ENGRDYWLVKNSRGITWGEAGYMKMARNIANPRGLCGIAMRASYPLK---- 346

Query: 338 QPSSADKSS 346
              S DKSS
Sbjct: 347 NSFSTDKSS 355


>gi|312451845|gb|ADQ85986.1| actinidin [Actinidia chinensis]
          Length = 380

 Score =  246 bits (628), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 145/343 (42%), Positives = 196/343 (57%), Gaps = 21/343 (6%)

Query: 1   MAKYFLIVVLIISGSC-ASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFK 59
           M+  F   +LI+S +  A   T RT DE  +   +E W  +YG++Y    E  +RFEIFK
Sbjct: 10  MSLLFFSTLLILSLAFNAKNLTQRTNDE--VKAMYESWLIKYGKSYNSLGEWERRFEIFK 67

Query: 60  DNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK 119
           + L  ++  N  A  NRSY + LN+FADLT +EF ++  GF    + + +     P   +
Sbjct: 68  ETLRFIDEHN--ADTNRSYKVGLNQFADLTDEEFRSTYLGFTSGSNKTKVSNRYEP---R 122

Query: 120 SSQVPPS-VNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDC 171
             QV PS V+W   GAV  +K QG+C       A+A VEGIN I    L+SLSEQ+L+DC
Sbjct: 123 VGQVLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIDC 182

Query: 172 ATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYED 231
               N  GC G ++ D F +II N GI  +  Y Y     G C+     +    I  YE+
Sbjct: 183 GRTQNTRGCNGSYITDGFPFIINNGGINTEENYPYTAQD-GECNVDLQNEKYVTIDTYEN 241

Query: 232 VPPNDEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEG 289
           VP N+E +L  AV  QPVSVA+DA+  A + YS G+F G C T ++H VT VGYGT E G
Sbjct: 242 VPYNNEWALQTAVTYQPVSVALDAAGDAFKQYSSGIFTGPCGTAIDHAVTIVGYGT-EGG 300

Query: 290 IKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           I YW++KNSW   WGE+GY R+ R++    G CGIA   S+PV
Sbjct: 301 IDYWIVKNSWDTTWGEEGYMRILRNVGG-AGTCGIATMPSYPV 342


>gi|413953665|gb|AFW86314.1| hypothetical protein ZEAMMB73_546353 [Zea mays]
          Length = 233

 Score =  246 bits (628), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 121/231 (52%), Positives = 157/231 (67%), Gaps = 15/231 (6%)

Query: 114 TPFLYK---SSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSL 163
           T F Y+   +  +P +++W  KGAVTP+K QGQC       AVAA EGI  I   +LVSL
Sbjct: 5   TGFRYENVSADALPTTIDWRTKGAVTPIKDQGQCGCCWAFSAVAATEGIVKISTGKLVSL 64

Query: 164 SEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHA 223
           +EQ+LVDC  +D + GC GG MDDAFK+II+N G+T ++ Y Y   + G C S    + A
Sbjct: 65  AEQELVDCDVHDEDQGCEGGLMDDAFKFIIKNGGLTTESSYPYTA-ADGKCKS--GSNSA 121

Query: 224 AQITNYEDVPPNDEESLLKAVANQPVSVAIDASAL--QFYSGGVFNGYCETFLNHGVTAV 281
           A I  YEDVP NDE +L+KAVANQPVSVA+D   +  QFYSGGV  G C T L+HG+ A+
Sbjct: 122 ATIKGYEDVPANDEAALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAI 181

Query: 282 GYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           GYG + +G KYWL+KNSWG  WGE+GY R+++DI   +G CG+AM  S+P 
Sbjct: 182 GYGKTSDGTKYWLMKNSWGTTWGENGYLRMEKDISDKRGMCGLAMEPSYPT 232


>gi|356517398|ref|XP_003527374.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
          Length = 333

 Score =  246 bits (628), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 150/343 (43%), Positives = 207/343 (60%), Gaps = 35/343 (10%)

Query: 4   YFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLV 63
           +    +L+     A Q T RT  + S+ E+ EQ   +Y + YK+  E+      F  N+ 
Sbjct: 9   HIAFAMLLCMAFLAFQVTCRTLQDASMYERHEQRMTRYSKVYKDPPES------FXGNVN 62

Query: 64  AVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQ 122
            +E  NNAA  ++ Y   +N+F    P+         +   H  S     T F +++ + 
Sbjct: 63  YIEACNNAA--DKPYKXGINQFP---PRN--------RFKGHMCSSIIRITTFKFENVTA 109

Query: 123 VPPSVNWIEKGAVTP--VKYQGQC-------AVAAVEGINAIKINRLVSLS-EQQLVDCA 172
            P +V+  +KGAVTP  VK QGQC       AVAA EGI+A+   +L+ LS E +LVDC 
Sbjct: 110 TPSTVDCRQKGAVTPYTVKDQGQCGCFWALSAVAATEGIHALXAGKLILLSXEPELVDCD 169

Query: 173 TNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQI-TNYED 231
           T   + GC GG  DDAFK+IIQN G+  +A Y Y+G+  G C++ +A+ +AA I T Y+D
Sbjct: 170 TKGVDQGCEGGLTDDAFKFIIQNHGLNTEANYPYKGVD-GKCNANEADKNAATIITGYDD 228

Query: 232 VPPNDEESLL-KAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEE 288
           VP N+E++ L KAVAN PVSVAIDAS    QFY  GVF G C T L+HGVTAVGYG S++
Sbjct: 229 VPANNEKAHLQKAVANNPVSVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSDD 288

Query: 289 GIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
           G +YWL+KNS G +WGE+GY R+QR +D  +  CGIA+ AS+P
Sbjct: 289 GTEYWLVKNSRGPEWGEEGYIRMQRGVDSEEALCGIAVQASYP 331


>gi|190358935|sp|P00785.4|ACTN_ACTCH RecName: Full=Actinidain; Short=Actinidin; AltName: Allergen=Act c
           1; Flags: Precursor
 gi|12744965|gb|AAK06862.1|AF343446_1 actinidin protease [Actinidia chinensis]
          Length = 380

 Score =  246 bits (627), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 145/343 (42%), Positives = 197/343 (57%), Gaps = 21/343 (6%)

Query: 1   MAKYFLIVVLIISGSC-ASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFK 59
           M+  F   +LI+S +  A   T RT DE  +   +E W  +YG++Y    E  +RFEIFK
Sbjct: 10  MSLLFFSTLLILSLAFNAKNLTQRTNDE--VKAMYESWLIKYGKSYNSLGEWERRFEIFK 67

Query: 60  DNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK 119
           + L  ++  N  A  NRSY + LN+FADLT +EF ++   F    + + +     P   +
Sbjct: 68  ETLRFIDEHN--ADTNRSYKVGLNQFADLTDEEFRSTYLRFTSGSNKTKVSNRYEP---R 122

Query: 120 SSQVPPS-VNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDC 171
             QV PS V+W   GAV  +K QG+C       A+A VEGIN I    L+SLSEQ+L+DC
Sbjct: 123 VGQVLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIDC 182

Query: 172 ATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYED 231
               N  GC GG++ D F++II N GI  +  Y Y     G C+     +    I  YE+
Sbjct: 183 GRTQNTRGCNGGYITDGFQFIINNGGINTEENYPYTAQD-GECNVDLQNEKYVTIDTYEN 241

Query: 232 VPPNDEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEG 289
           VP N+E +L  AV  QPVSVA+DA+  A + YS G+F G C T ++H VT VGYGT E G
Sbjct: 242 VPYNNEWALQTAVTYQPVSVALDAAGDAFKQYSSGIFTGPCGTAVDHAVTIVGYGT-EGG 300

Query: 290 IKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           I YW++KNSW   WGE+GY R+ R++    G CGIA   S+PV
Sbjct: 301 IDYWIVKNSWDTTWGEEGYMRILRNVGG-AGTCGIATMPSYPV 342


>gi|449469929|ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
 gi|449529596|ref|XP_004171784.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
          Length = 431

 Score =  246 bits (627), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 140/314 (44%), Positives = 183/314 (58%), Gaps = 24/314 (7%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
           ++E FE W  ++G++Y  + E   R  +F DN   V   NN  + N SYTL LN +ADLT
Sbjct: 25  VSELFEIWCTEHGKSYSSAEEKLYRLGVFADNYEFVTHHNN--LDNSSYTLSLNSYADLT 82

Query: 90  PQEFIASQTGFKMSDHSSSLKANGTPFLYKS----SQVPPSVNWIEKGAVTPVKYQGQC- 144
             EF  S+ GF     S +L+ N  P L +       VP S++W +KGAVT VK QG C 
Sbjct: 83  HHEFKVSRLGF-----SPALR-NFRPVLPQEPSLPRDVPDSLDWRKKGAVTAVKDQGSCG 136

Query: 145 ------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGI 198
                 A  A+EGIN I    L+SLSEQ+L+DC     N+GC GG MD A++++I N GI
Sbjct: 137 ACWSFSATGAMEGINQIMTGSLISLSEQELIDC-DRSYNSGCGGGLMDYAYQFVISNHGI 195

Query: 199 TNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS-- 256
             +  Y Y+    G C   K + +   I  Y D+P NDE  LL+AVA QPVSV I  S  
Sbjct: 196 DTENDYPYQARD-GSCRKDKLQRNVVTIDGYADIPSNDEGKLLQAVAAQPVSVGICGSER 254

Query: 257 ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDID 316
           A Q YS G+F+G C T L+H V  VGYG SE G+ YW++KNSWG+ WG DGY  +QR+  
Sbjct: 255 AFQLYSKGIFSGPCSTSLDHAVLIVGYG-SENGVDYWIVKNSWGKSWGMDGYMHMQRNSG 313

Query: 317 QPQGQCGIAMFASF 330
             +G CGI   AS+
Sbjct: 314 NSEGVCGINKLASY 327


>gi|356515062|ref|XP_003526220.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
          Length = 337

 Score =  246 bits (627), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 131/311 (42%), Positives = 193/311 (62%), Gaps = 18/311 (5%)

Query: 35  EQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFI 94
           E+W AQ+G+ YK++AE  +  +IF++N+  +E F+    G++S+ L  N+FADL  +EF 
Sbjct: 33  EKWMAQHGKVYKDAAEKERCLQIFENNMEFIESFD--VCGDKSFNLSTNQFADLHDEEFK 90

Query: 95  ASQTGFKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGAVTPVKYQGQC--------A 145
           A  T     +HS       T F Y + +++P S++W ++G VTP+K QG+C         
Sbjct: 91  ALLTNGHKKEHSL-WTTTETLFRYDNVTKIPASMDWRKRGVVTPIKDQGKCLSCWAFSLC 149

Query: 146 VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYS 205
           VA +EG++ I  + LV LSEQ+LVD    ++  GCYG +++DAFK+I +   I ++  Y 
Sbjct: 150 VATIEGLHQIITSELVPLSEQELVDFVKGESE-GCYGDYVEDAFKFITKKGRIESETHYP 208

Query: 206 YEGMSTGICDSIKAEDH-AAQITNYEDVPPNDEESLLKAVANQPVSVAIDA--SALQFYS 262
           Y+G++      +K E H  AQI  Y+ VP   E +LLKAVANQ VSV+++A  SA QFYS
Sbjct: 209 YKGVNNTC--KVKKETHGVAQIKGYKKVPSKSENALLKAVANQLVSVSVEARDSAFQFYS 266

Query: 263 GGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQC 322
            G+F G C T  +H V    YG S +G KYWL KNSWG +WGE GY R++ DI   +G C
Sbjct: 267 SGIFTGKCGTDTDHRVALASYGESGDGTKYWLAKNSWGTEWGEKGYIRIKXDIPAKEGLC 326

Query: 323 GIAMFASFPVS 333
           GIA +  +P++
Sbjct: 327 GIAKYPYYPIA 337


>gi|641905|gb|AAC49406.1| cysteine proteinase [Zinnia violacea]
          Length = 342

 Score =  245 bits (626), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 140/301 (46%), Positives = 183/301 (60%), Gaps = 18/301 (5%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
           +   FE    ++ + Y+   E   RFEIF DNL  ++  N       +Y L LN+FADLT
Sbjct: 45  VIHLFESSLVKHSKIYESFDEKLHRFEIFMDNLKHIDETNKKV---SNYWLGLNEFADLT 101

Query: 90  PQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGAVTPVKYQGQCA--- 145
            +EF     GFK  + +     +   F Y+    +P SV+W +KGAV+PVK QGQC    
Sbjct: 102 HEEFKNKFLGFK-GELAERKDESIEQFRYRDFVDLPKSVDWRKKGAVSPVKNQGQCGSCW 160

Query: 146 ----VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITND 201
               VAAVEGIN I    L  LSEQ+L+DC T   NNGC GG MD AF Y+ +N G+  +
Sbjct: 161 AFSTVAAVEGINQIVTGNLTVLSEQELIDCDTT-FNNGCNGGLMDYAFAYVTRN-GLHKE 218

Query: 202 AVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQ 259
             Y Y  MS G CD  +       I+ Y DVP N+E+S LKA+ANQP+SVAI+AS    Q
Sbjct: 219 EEYPYI-MSEGTCDEKRDASEKVTISGYHDVPRNNEDSFLKALANQPISVAIEASGRDFQ 277

Query: 260 FYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQ 319
           FYSGGVF+G+C T L+HGV AVGYGTS +G+ Y +++NSWG  WGE GY R++R+  +P 
Sbjct: 278 FYSGGVFDGHCGTELDHGVAAVGYGTS-KGLDYVIVRNSWGPKWGEKGYIRMKRNTGKPM 336

Query: 320 G 320
           G
Sbjct: 337 G 337


>gi|413938554|gb|AFW73105.1| hypothetical protein ZEAMMB73_931917 [Zea mays]
          Length = 361

 Score =  245 bits (625), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 144/320 (45%), Positives = 190/320 (59%), Gaps = 31/320 (9%)

Query: 34  FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
           F  W  ++G+ Y    E  +R+EIFK NL+ +   N     N SY L LN+FAD+  +EF
Sbjct: 44  FRSWSVKHGKLYASPTEKLERYEIFKQNLMHIAETNRK---NGSYWLGLNQFADVAHEEF 100

Query: 94  IASQTGFKMSDHSSSLKANGTPFLYK-----SSQVPPSVNWIEKGAVTPVKYQGQC---- 144
            AS  G K +   +      TP  ++     +  +P SV+W  KGAVTPVK QG+C    
Sbjct: 101 KASYLGLKRALPRAGAPQTRTPTAFRYAAAAAGSLPWSVDWRYKGAVTPVKNQGKCGSCW 160

Query: 145 ---AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITND 201
              +VAAVEGIN I   +LVSLSEQ+LVDC T   ++GC GG MD AF Y++ ++GI  +
Sbjct: 161 AFSSVAAVEGINQIVTGKLVSLSEQELVDCDTT-LDHGCEGGTMDLAFAYMMGSQGIHAE 219

Query: 202 AVYSYEGMSTGICD-------SIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAID 254
             Y Y  M  G C         I  +D    +T +EDVP N E SLLKA+A+QPVSV I 
Sbjct: 220 DDYPYL-MEEGYCKEKQPCVLGITEQD----LTGFEDVPENSEISLLKALAHQPVSVGIA 274

Query: 255 ASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQ 312
           A +   QFY GGVF+G C   L+H +TAVGYG+S  G  Y  +KNSWG++WGE GY R++
Sbjct: 275 AGSRDFQFYRGGVFDGACSVELDHALTAVGYGSSY-GQNYITMKNSWGKNWGEQGYVRIK 333

Query: 313 RDIDQPQGQCGIAMFASFPV 332
               +P+G CGI   AS+PV
Sbjct: 334 MGTGKPEGVCGIYTMASYPV 353


>gi|326502440|dbj|BAJ95283.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 349

 Score =  245 bits (625), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 141/320 (44%), Positives = 201/320 (62%), Gaps = 21/320 (6%)

Query: 27  EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFA 86
           + ++  + E+W A++GRTY    E ++R E+F+ N   ++ FN+A   + ++ L  N+FA
Sbjct: 37  DSAMVSRHEKWMAEHGRTYANEEEKARRLEVFRANAKLIDSFNSAE--DSTHRLATNRFA 94

Query: 87  DLTPQEFIASQTGFKMS-DHSSSLKANGTPFLYKS---SQVPPSVNWIEKGAVTPVKYQG 142
           DLT +EF A++TG +     ++   +    F Y++   +    S++W   GAVT VK QG
Sbjct: 95  DLTDEEFRAARTGLRRPPAAAAGAGSGAGGFRYENFSLADAAGSMDWRAMGAVTGVKDQG 154

Query: 143 QC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQN 195
            C       AVAAVEG+  I+  RLVSLSEQQLVDC    ++ GC GG MD+AF+Y+I  
Sbjct: 155 SCGCCWAFSAVAAVEGLTKIRTGRLVSLSEQQLVDCDVYGDDEGCAGGLMDNAFEYMINR 214

Query: 196 KGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDA 255
            G+T ++ Y Y G + G C   +    AA I  YEDVP N+E +L+ AVA+QPVSVAI+ 
Sbjct: 215 GGLTTESSYPYRG-TDGSC---RRSASAASIRGYEDVPANNEAALMAAVAHQPVSVAING 270

Query: 256 --SALQFYSGGVFNGY-CETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQ 312
             S  +FY  GV  G  C T LNH +TAVGYGT+ +G KYW++KNSWG  WGE GY R++
Sbjct: 271 GDSVFRFYDSGVLGGSGCGTELNHAITAVGYGTASDGTKYWIMKNSWGGSWGEGGYVRIR 330

Query: 313 RDIDQPQGQCGIAMFASFPV 332
           R + + +G CG+A  AS+PV
Sbjct: 331 RGV-RGEGVCGLAQLASYPV 349


>gi|297799636|ref|XP_002867702.1| hypothetical protein ARALYDRAFT_329301 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297313538|gb|EFH43961.1| hypothetical protein ARALYDRAFT_329301 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 357

 Score =  245 bits (625), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 141/316 (44%), Positives = 193/316 (61%), Gaps = 20/316 (6%)

Query: 34  FEQWKAQYGRTYKES-AENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQE 92
           F+ W +++G+TY  +  E  +RF+ FKDNL  +++ N     N SY L L +FADLT QE
Sbjct: 48  FQMWMSKHGKTYTNALGEKERRFQNFKDNLRFIDQHNAK---NLSYQLGLTRFADLTVQE 104

Query: 93  FIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC-------A 145
           +     G        +L+ +         Q+P SV+W  +GAV+ +K QG C        
Sbjct: 105 YRDLFPG-SPKPKQRNLRISRRYVPLDGDQLPESVDWRNEGAVSAIKDQGTCNSCWAFST 163

Query: 146 VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYG-GFMDDAFKYIIQNKGITNDAVY 204
           VAAVEGIN I    LVSLSEQ+LVDC  N  NNGCYG G MD AF+++I N G+ +D  Y
Sbjct: 164 VAAVEGINKIVTGELVSLSEQELVDC--NLVNNGCYGSGTMDAAFQFLINNGGLDSDTDY 221

Query: 205 SYEGMSTGICDSIKA-EDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASALQF--Y 261
            Y+G S G C+  ++  +    I +YEDVP NDE SL KAVA+QPVSV +D  + +F  Y
Sbjct: 222 PYQG-SQGYCNRKESTSNKIITIDSYEDVPANDEISLQKAVAHQPVSVGVDKKSQEFMLY 280

Query: 262 SGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQ 321
             G++NG C T L+H +  VGYG SE G  YW+++NSWG  WG+ GY ++ R+ + P G 
Sbjct: 281 RSGIYNGPCGTDLDHALVIVGYG-SENGQDYWIVRNSWGTTWGDAGYAKMARNFEYPSGV 339

Query: 322 CGIAMFASFPVSKESA 337
           CGIAM AS+PV   ++
Sbjct: 340 CGIAMLASYPVKNSAS 355


>gi|357124027|ref|XP_003563708.1| PREDICTED: germination-specific cysteine protease 1-like
           [Brachypodium distachyon]
          Length = 334

 Score =  245 bits (625), Expect = 3e-62,   Method: Compositional matrix adjust.
 Identities = 144/327 (44%), Positives = 198/327 (60%), Gaps = 23/327 (7%)

Query: 26  DEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAI--GNRSYTLRLN 83
           D+ ++ E++E+W A+ GRTYK+S E ++RFE+FK N   ++  N A    G     L  N
Sbjct: 12  DDKAMRERYEKWMAEQGRTYKDSTEKARRFEVFKSNAHFIDSHNAATGPGGKSRPKLTTN 71

Query: 84  KFADLTPQEFI-ASQTGFKMSDHSSSLKANGTPFLYKS---SQVPPSVNWIEKGAVTPVK 139
           KFADLT  EF     TG +++   +SL  + T F + +   S VPPS++W  +GAVT VK
Sbjct: 72  KFADLTEDEFRNIYVTGHRVNYRPTSLVTD-TVFKFGAVSLSDVPPSIDWRARGAVTSVK 130

Query: 140 YQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYI 192
            Q  CA        AAVEGI+ I     VSLS QQLVDC +N  N  C  G +D A++YI
Sbjct: 131 DQHLCACCWAFSSAAAVEGIHQITTGNQVSLSVQQLVDC-SNAANEKCKAGEIDKAYEYI 189

Query: 193 IQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVA 252
            ++ G+  D  Y YEG S G C  +  +   A+I+ ++ VP  +E +LL AVA+QPVSVA
Sbjct: 190 ARSGGLVADQDYPYEGHS-GTC-RVYGKQAVARISGFQYVPARNETALLLAVAHQPVSVA 247

Query: 253 ID--ASALQFYSGGVFNGY---CETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDG 307
           +D  + ALQ    G+F      C T LNH +T VGYGT E G +YWL+KNSWG DWG+ G
Sbjct: 248 LDGLSRALQHIGTGIFGSAGEPCTTNLNHAMTIVGYGTDEHGTRYWLMKNSWGSDWGDKG 307

Query: 308 YFRLQRDI-DQPQGQCGIAMFASFPVS 333
           Y +  RD+  +  G CG+A+ AS+PV+
Sbjct: 308 YVKFARDVASEINGVCGLALEASYPVA 334


>gi|312282059|dbj|BAJ33895.1| unnamed protein product [Thellungiella halophila]
          Length = 379

 Score =  244 bits (624), Expect = 3e-62,   Method: Compositional matrix adjust.
 Identities = 135/312 (43%), Positives = 186/312 (59%), Gaps = 21/312 (6%)

Query: 34  FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
           FE W  ++G+ Y   AE  +R  IFKDNL  +   N+  +G   Y L LN+FADL+  E+
Sbjct: 64  FESWIVKHGKVYDSVAEKERRLTIFKDNLRFITNRNSENLG---YRLGLNRFADLSLHEY 120

Query: 94  IASQTGFKMSDHSSSLKANGTPFLYKSSQ---VPPSVNWIEKGAVTPVKYQGQC------ 144
                G       + +  + +   YK+S    +P SV+W  +GAVT VK QG C      
Sbjct: 121 KEICHGADPKPPRNHVFMSSSD-RYKTSAGDVLPKSVDWRNEGAVTEVKDQGHCRSCWAF 179

Query: 145 -AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAV 203
             V AVEG+N I    LV+LSEQ L++C  N  NNGC GG ++ A+++I+ N G+  D  
Sbjct: 180 STVGAVEGLNKIVTGELVTLSEQDLINC--NKENNGCGGGKVETAYEFIVSNGGLGTDND 237

Query: 204 YSYEGMSTGICDS-IKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQF 260
           Y Y+ ++ G CD  +K       I  YE++P NDE +L+KAVA+QPV+  ID+S+   Q 
Sbjct: 238 YPYKAVN-GACDGRLKENIKNVMIDGYENLPANDELALMKAVAHQPVTAVIDSSSREFQL 296

Query: 261 YSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQG 320
           Y  GVF+G C T LNHGV  VGYGT E G  YW+++NSWG  WGE GY ++ R+I  P+G
Sbjct: 297 YESGVFDGRCGTNLNHGVVVVGYGT-ENGRNYWIVRNSWGNTWGEAGYMKMARNIANPRG 355

Query: 321 QCGIAMFASFPV 332
            CGIAM  S+P+
Sbjct: 356 LCGIAMRVSYPL 367


>gi|186516984|ref|NP_195406.2| cysteine proteinase1 [Arabidopsis thaliana]
 gi|15290508|gb|AAK92229.1| cysteine proteinase [Arabidopsis thaliana]
 gi|332661313|gb|AEE86713.1| cysteine proteinase1 [Arabidopsis thaliana]
          Length = 376

 Score =  244 bits (624), Expect = 3e-62,   Method: Compositional matrix adjust.
 Identities = 144/340 (42%), Positives = 194/340 (57%), Gaps = 25/340 (7%)

Query: 18  SQATYRTFDEGSIAEKFEQWKAQYGRTYKESA----ENSKRFEIFKDNLVAVERFNNAAI 73
           S   +RT +E  +   + QW A++G+T   +     +  KRF IFKDNL  ++  +N   
Sbjct: 35  SDGKWRTDEE--VRSIYLQWSAEHGKTNNNNNGIINDQDKRFNIFKDNLRFID-LHNENN 91

Query: 74  GNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSS----QVPPSVNW 129
            N +Y L L KF DLT  E+     G +        KA      Y ++    +VP +V+W
Sbjct: 92  KNATYKLGLTKFTDLTNDEYRKLYLGARTEPARRIAKAKNVNQKYSAAVNGKEVPETVDW 151

Query: 130 IEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYG 182
            +KGAV P+K QG C         AAVEGIN I    L+SLSEQ+LVDC     N GC G
Sbjct: 152 RQKGAVNPIKDQGTCGSCWAFSTTAAVEGINKIVTGELISLSEQELVDC-DKSYNQGCNG 210

Query: 183 GFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLK 242
           G MD AF++I++N G+  +  Y Y G   G C+S         I  YEDVP  DE +L K
Sbjct: 211 GLMDYAFQFIMKNGGLNTEKDYPYRGFG-GKCNSFLKNSRVVSIDGYEDVPTKDETALKK 269

Query: 243 AVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWG 300
           A++ QPVSVAI+A     Q Y  G+F G C T L+H V AVGYG SE G+ YW+++NSWG
Sbjct: 270 AISYQPVSVAIEAGGRIFQHYQSGIFTGSCGTNLDHAVVAVGYG-SENGVDYWIVRNSWG 328

Query: 301 QDWGEDGYFRLQRDIDQPQ-GQCGIAMFASFPVSKESAQP 339
             WGE+GY R++R++   + G+CGIA+ AS+PV K S  P
Sbjct: 329 PRWGEEGYIRMERNLAASKSGKCGIAVEASYPV-KYSPNP 367


>gi|320169652|gb|EFW46551.1| cathepsin L2 [Capsaspora owczarzaki ATCC 30864]
          Length = 325

 Score =  244 bits (624), Expect = 4e-62,   Method: Compositional matrix adjust.
 Identities = 148/342 (43%), Positives = 192/342 (56%), Gaps = 31/342 (9%)

Query: 3   KYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
           K F  V L+   +CA+            A  F +WKA + R Y  + E + R EI+  NL
Sbjct: 2   KAFTAVALLALVACAT------------AMPFAEWKALHNRQYASAQEEALRQEIYLSNL 49

Query: 63  VAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQ 122
             +   N  A G  SYTL +N+F DL   EF A   G + +  +++     + +L +   
Sbjct: 50  ELINEHN--AAGRHSYTLGMNEFGDLAHHEFAAKYLGVRFNGVNATKSFASSTYLPRMVS 107

Query: 123 VPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATND 175
           +P SV+W   G VTPVK QGQC          +VEG +A K   LVSLSEQ LVDC++ +
Sbjct: 108 LPDSVDWRTAGIVTPVKNQGQCGSCWSFSTTGSVEGQHARKTGTLVSLSEQNLVDCSSQE 167

Query: 176 NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPN 235
            N GC GG MDDAF+YII+N GI  +A Y Y   +TG C    A +  A + +Y+D+   
Sbjct: 168 GNEGCNGGLMDDAFEYIIKNGGIDTEASYPYTA-TTGTCK-FNAANIGATVASYQDIITG 225

Query: 236 DEESLLKAVAN-QPVSVAIDASAL--QFYSGGVFN-GYCETF-LNHGVTAVGYGTSEEGI 290
            E  L  AVA   PVSVAIDAS +  QFY  GV+N   C T  L+HGV AVGYGTS EG 
Sbjct: 226 SESDLQNAVATVGPVSVAIDASHINFQFYFTGVYNEKKCSTTQLDHGVLAVGYGTSTEGK 285

Query: 291 KYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
            YWL+KNSWG  WG+ GY  + R+ D    QCGIA  AS+P+
Sbjct: 286 DYWLVKNSWGATWGKAGYIWMSRNADN---QCGIATSASYPL 324


>gi|46395939|sp|Q94B08.2|GCP1_ARATH RecName: Full=Germination-specific cysteine protease 1; Flags:
           Precursor
 gi|4006883|emb|CAB16767.1| cysteine proteinase [Arabidopsis thaliana]
 gi|7270637|emb|CAB80354.1| cysteine proteinase [Arabidopsis thaliana]
          Length = 376

 Score =  244 bits (624), Expect = 4e-62,   Method: Compositional matrix adjust.
 Identities = 144/340 (42%), Positives = 195/340 (57%), Gaps = 25/340 (7%)

Query: 18  SQATYRTFDEGSIAEKFEQWKAQYGRTYKESA----ENSKRFEIFKDNLVAVERFNNAAI 73
           S   +RT +E  +   + QW A++G+T   +     +  KRF IFKDNL  ++  +N   
Sbjct: 35  SDGKWRTDEE--VRSIYLQWSAEHGKTNNNNNGIINDQDKRFNIFKDNLRFID-LHNEDN 91

Query: 74  GNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSS----QVPPSVNW 129
            N +Y L L KF DLT  E+     G +        KA      Y ++    +VP +V+W
Sbjct: 92  KNATYKLGLTKFTDLTNDEYRKLYLGARTEPARRIAKAKNVNQKYSAAVNGKEVPETVDW 151

Query: 130 IEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYG 182
            +KGAV P+K QG C         AAVEGIN I    L+SLSEQ+LVDC  +  N GC G
Sbjct: 152 RQKGAVNPIKDQGTCGSCWAFSTTAAVEGINKIVTGELISLSEQELVDCDKS-YNQGCNG 210

Query: 183 GFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLK 242
           G MD AF++I++N G+  +  Y Y G   G C+S         I  YEDVP  DE +L K
Sbjct: 211 GLMDYAFQFIMKNGGLNTEKDYPYRGFG-GKCNSFLKNSRVVSIDGYEDVPTKDETALKK 269

Query: 243 AVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWG 300
           A++ QPVSVAI+A     Q Y  G+F G C T L+H V AVGYG SE G+ YW+++NSWG
Sbjct: 270 AISYQPVSVAIEAGGRIFQHYQSGIFTGSCGTNLDHAVVAVGYG-SENGVDYWIVRNSWG 328

Query: 301 QDWGEDGYFRLQRDIDQPQ-GQCGIAMFASFPVSKESAQP 339
             WGE+GY R++R++   + G+CGIA+ AS+PV K S  P
Sbjct: 329 PRWGEEGYIRMERNLAASKSGKCGIAVEASYPV-KYSPNP 367


>gi|57118011|gb|AAW34137.1| cysteine protease gp3b [Zingiber officinale]
          Length = 466

 Score =  244 bits (622), Expect = 5e-62,   Method: Compositional matrix adjust.
 Identities = 139/315 (44%), Positives = 192/315 (60%), Gaps = 16/315 (5%)

Query: 34  FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAA-IGNRSYTLRLNKFADLTPQE 92
           +++W+A++     +      R E+FK+NL  V+  N AA  G  +Y L +N+FADLT +E
Sbjct: 43  YQEWRAKHRPAENDQYVGDYRLEVFKENLRFVDEHNAAADRGEHAYRLGMNRFADLTNEE 102

Query: 93  FIAS-QTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC------- 144
           + A           S+S + +    L +   +P S++W EKGAV  VK QG+C       
Sbjct: 103 YRARFLRDLSRLGRSTSGEISNQYRLREGDVLPDSIDWREKGAVVAVKSQGRCGSCWAFA 162

Query: 145 AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVY 204
           A+A VEGIN I    L+SLSEQQLVDC+T   N+GC GG+   AF+YII N G+ ++  Y
Sbjct: 163 AIATVEGINQIVTGDLISLSEQQLVDCSTR--NHGCEGGWPYRAFQYIINNGGVNSEEHY 220

Query: 205 SYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFYS 262
            Y G +    ++ K   H   I +Y +VP NDE+SL KAVANQP+SV I+AS    Q Y 
Sbjct: 221 PYTGTNGTC-NTTKGNAHVVSIDSYRNVPSNDEKSLQKAVANQPISVGINASGRNFQLYH 279

Query: 263 GGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQC 322
            G+F G C T LNHGVT VGYGT   G  YW++KNSWG+ WG+ GY  ++R+I +  G+C
Sbjct: 280 SGIFTGSCNTSLNHGVTVVGYGTVN-GNDYWIVKNSWGESWGDSGYILMERNIAESSGKC 338

Query: 323 GIAMFASFPVSKESA 337
           GIA+  S+P+ KE A
Sbjct: 339 GIAISPSYPI-KEGA 352


>gi|57118009|gb|AAW34136.1| cysteine protease gp3a [Zingiber officinale]
          Length = 475

 Score =  244 bits (622), Expect = 6e-62,   Method: Compositional matrix adjust.
 Identities = 137/310 (44%), Positives = 188/310 (60%), Gaps = 15/310 (4%)

Query: 34  FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAA-IGNRSYTLRLNKFADLTPQE 92
           +++W+ ++     +      R E+FK+NL  V+  N AA  G  +Y L +N+FADLT +E
Sbjct: 52  YQEWRVKHRPAENDQYVGDYRLEVFKENLRFVDEHNAAADRGEHAYRLGMNRFADLTNEE 111

Query: 93  FIAS-QTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC------- 144
           + A           S+S + +    L +   +P S++W EKGAV  VK QG+C       
Sbjct: 112 YRARFLRDLSRLGRSTSGEISNQYRLREGDVLPDSIDWREKGAVVAVKNQGRCGSCWAFA 171

Query: 145 AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVY 204
           A+AAVEGIN I    L+SLSEQQLVDC+T   N GC GG+   AF+YII N G+ ++  Y
Sbjct: 172 AIAAVEGINQIVTGDLISLSEQQLVDCSTR--NYGCEGGWPYRAFQYIINNGGVNSEEHY 229

Query: 205 SYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFYS 262
            Y G +    ++ K   H   I +Y +VP NDE+SL KA ANQP+SV IDAS    Q Y 
Sbjct: 230 PYTGTNGTC-NTTKENAHVVSIDSYRNVPSNDEKSLQKAAANQPISVGIDASGRNFQLYH 288

Query: 263 GGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQC 322
            G+F G C T LNHGVT VGYGT E G  YW++KNSWG++WG  GY  ++R+I +  G+C
Sbjct: 289 SGIFTGSCNTSLNHGVTVVGYGT-ENGNDYWIVKNSWGENWGNSGYILMERNIAESSGKC 347

Query: 323 GIAMFASFPV 332
           GIA+  S+P+
Sbjct: 348 GIAISPSYPI 357


>gi|146215992|gb|ABQ10198.1| actinidin Act4b [Actinidia eriantha]
          Length = 379

 Score =  244 bits (622), Expect = 6e-62,   Method: Compositional matrix adjust.
 Identities = 143/333 (42%), Positives = 198/333 (59%), Gaps = 20/333 (6%)

Query: 23  RTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRL 82
           RT DE  +   FE W  +YG++Y    E  +RFEIFKDNL  V+  N  A  NRSY + L
Sbjct: 39  RTNDE--VMAMFESWLVEYGKSYNALGEKERRFEIFKDNLRFVDEHN--ADVNRSYKVGL 94

Query: 83  NKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQG 142
           N+F+DLT +E+ +   G K     +++     P +    Q+P S++W +KGAV  VK QG
Sbjct: 95  NQFSDLTLEEYSSIYLGTKFDMRMTNVSDRYEPRV--GDQLPNSIDWRKKGAVLGVKNQG 152

Query: 143 QCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQN 195
            C        +AAVE IN I    L+SLSEQQ+VDC     NNGC GG    A+++II N
Sbjct: 153 NCGSCWTFAPIAAVEAINQIVTGNLISLSEQQIVDCQRKSPNNGCKGGSRAGAYQFIIDN 212

Query: 196 KGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDA 255
            GI  +A Y Y+    G CD  K + +   I  YE+VP  +E++L KAV+NQ VSV I +
Sbjct: 213 GGINTEANYPYKAQD-GECDEQKNQKYVT-IDRYENVPRKNEKALQKAVSNQLVSVGIAS 270

Query: 256 SALQF--YSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQR 313
           ++ +F  Y  G+F G C   ++H VT VGYGT E G+ YW+++NSWG +WGE+GY R+QR
Sbjct: 271 NSSEFKAYKSGIFTGPCGAKIDHAVTIVGYGT-EGGMDYWIVRNSWGSNWGENGYVRMQR 329

Query: 314 DIDQPQGQCGIAMFASFPVSKESAQPSSADKSS 346
           ++    G C IA   ++PV K    P++A  SS
Sbjct: 330 NVGNA-GTCFIATSPNYPV-KYGPNPTNAHLSS 360


>gi|2224810|emb|CAB09698.1| cysteine proteinase [Hordeum vulgare subsp. vulgare]
          Length = 349

 Score =  244 bits (622), Expect = 6e-62,   Method: Compositional matrix adjust.
 Identities = 140/320 (43%), Positives = 200/320 (62%), Gaps = 21/320 (6%)

Query: 27  EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFA 86
           + ++  + E+W A++GRTY    E ++R E+F+ N   ++ FN+A   + ++ L  N+FA
Sbjct: 37  DAAMVSRHEKWMAEHGRTYANEEEKARRLEVFRANAKLIDSFNSAE--DSTHRLATNRFA 94

Query: 87  DLTPQEFIASQTGFKMS-DHSSSLKANGTPFLYKS---SQVPPSVNWIEKGAVTPVKYQG 142
           DLT +EF A++TG +     ++   +    F Y++   +    S++W   GAVT VK QG
Sbjct: 95  DLTDEEFRAARTGLRRPPAAAAGAGSGAGGFRYENFSLADAAGSMDWRAMGAVTGVKDQG 154

Query: 143 QC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQN 195
            C       AVAAVEG+  I+  RLVSLSEQQLVDC    ++ GC GG MD+AF+Y+I  
Sbjct: 155 SCGCCWAFSAVAAVEGLTKIRTGRLVSLSEQQLVDCDVYGDDEGCAGGLMDNAFEYMINR 214

Query: 196 KGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDA 255
            G+T ++ Y Y G + G C   +    AA I  YEDVP N+E +L+ AVA+QPVSVAI+ 
Sbjct: 215 GGLTTESSYPYRG-TDGSC---RRSASAASIRGYEDVPANNEAALMAAVAHQPVSVAING 270

Query: 256 --SALQFYSGGVFNGY-CETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQ 312
             S  +FY  GV  G  C T LNH +TA GYGT+ +G KYW++KNSWG  WGE GY R++
Sbjct: 271 GDSVFRFYDSGVLGGSGCGTELNHAITAAGYGTASDGTKYWIMKNSWGGSWGEGGYVRIR 330

Query: 313 RDIDQPQGQCGIAMFASFPV 332
           R + + +G CG+A  AS+PV
Sbjct: 331 RGV-RGEGVCGLAQLASYPV 349


>gi|242094002|ref|XP_002437491.1| hypothetical protein SORBIDRAFT_10g028010 [Sorghum bicolor]
 gi|241915714|gb|EER88858.1| hypothetical protein SORBIDRAFT_10g028010 [Sorghum bicolor]
          Length = 397

 Score =  243 bits (621), Expect = 7e-62,   Method: Compositional matrix adjust.
 Identities = 145/342 (42%), Positives = 204/342 (59%), Gaps = 35/342 (10%)

Query: 30  IAEKFEQWKAQYGRTYKE----SAENSKRFEIFKDNLVAVERFN-NAAIGNRSYTLRLNK 84
           +   +E WK+++GR          E+  R E+F+DNL  ++  N  A  G  ++ L L  
Sbjct: 50  VRRMYEAWKSKHGRPRGNCDMAGDEDRLRLEVFRDNLRYIDAHNAEADAGLHTFRLGLTP 109

Query: 85  FADLTPQEFIASQTGFKMSDH--------SSSLKANGTPFLYKS-------SQVPPSVNW 129
           FADLT +E+     GF+            +S + + GT   ++          +P +++W
Sbjct: 110 FADLTLEEYRGRALGFRARHRGGPSARAAASRVGSGGTRSHHRRPRPRPRCGDLPDAIDW 169

Query: 130 IEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYG 182
            + GAVT VK Q QC       AVAA+EGINAI    LVSLSEQ+++DC T D+  GC G
Sbjct: 170 RQLGAVTDVKNQEQCGGCWAFSAVAAIEGINAIVTGNLVSLSEQEIIDCDTQDS--GCNG 227

Query: 183 GFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDH-AAQITNYEDVPPNDEESLL 241
           G M++AF+++I N GI ++A Y +   + G CD+ KA D   A I  + +V  N+E +L 
Sbjct: 228 GQMENAFQFVIDNGGIDSEADYPFIA-TDGTCDANKANDEKVAAIDGFVEVASNNETALQ 286

Query: 242 KAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSW 299
           +AVA QPVSVAIDA   A Q YS G+FNG C T L+HGVT VGYG SE G  YW++KNSW
Sbjct: 287 EAVAIQPVSVAIDAGGRAFQHYSSGIFNGPCGTNLDHGVTVVGYG-SENGKAYWIVKNSW 345

Query: 300 GQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSKESAQPSS 341
              WGE GY R++R++  P G+CGIAM AS+PV K++  P++
Sbjct: 346 SDSWGEAGYIRIRRNVFLPVGKCGIAMDASYPV-KDTYGPAA 386


>gi|297802228|ref|XP_002868998.1| cysteine proteinase [Arabidopsis lyrata subsp. lyrata]
 gi|297314834|gb|EFH45257.1| cysteine proteinase [Arabidopsis lyrata subsp. lyrata]
          Length = 375

 Score =  243 bits (621), Expect = 8e-62,   Method: Compositional matrix adjust.
 Identities = 146/347 (42%), Positives = 198/347 (57%), Gaps = 25/347 (7%)

Query: 18  SQATYRTFDEGSIAEKFEQWKAQYGRTYKESA----ENSKRFEIFKDNLVAVERFNNAAI 73
           S + +RT +E  +   + QW A +G+T   +     +  KRF IFKDNL  ++  +N   
Sbjct: 35  SDSWWRTDEE--VRSIYLQWSADHGKTNNNNNGIINDQDKRFNIFKDNLRFID-LHNEKN 91

Query: 74  GNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSS----QVPPSVNW 129
            N +Y L L KF DLT +E+ +   G +        KA      Y ++    +VP +V+W
Sbjct: 92  KNATYKLGLTKFTDLTNEEYRSLYLGARTEPVRRIAKAKNVNQKYSAAVDGKEVPETVDW 151

Query: 130 IEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYG 182
             KGAV P+K QG C         AAVEGIN I    L+SLSEQ+LVDC  N  N GC G
Sbjct: 152 RLKGAVNPIKDQGTCGSCWAFSTAAAVEGINKIVTGELISLSEQELVDC-DNSYNQGCNG 210

Query: 183 GFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLK 242
           G MD AF++I++N G+  +  Y Y G   G C+S         I  YEDVP  DE +L +
Sbjct: 211 GLMDYAFQFIMKNGGLKTEKDYPYRGFG-GKCNSFLKNAKVVSIDGYEDVPTKDETALKR 269

Query: 243 AVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWG 300
           A++ QPVSVAI+A     Q Y  G+F G C T L+H V AVGYG SE G+ YW+++NSWG
Sbjct: 270 AISLQPVSVAIEAGGRIFQHYQTGIFTGNCGTNLDHAVVAVGYG-SENGVDYWIVRNSWG 328

Query: 301 QDWGEDGYFRLQRDIDQPQ-GQCGIAMFASFPVSKESAQPSSADKSS 346
             WGE+GY R++R++   + G+CGIA+ AS+PV K S  P     SS
Sbjct: 329 PRWGEEGYIRMERNLASSKSGKCGIAVEASYPV-KYSPNPVRGSISS 374


>gi|125525815|gb|EAY73929.1| hypothetical protein OsI_01813 [Oryza sativa Indica Group]
          Length = 336

 Score =  243 bits (621), Expect = 8e-62,   Method: Compositional matrix adjust.
 Identities = 148/349 (42%), Positives = 195/349 (55%), Gaps = 31/349 (8%)

Query: 1   MAKYFLIVV--LIISGSCASQATYRT-FDEGSIAEKFEQWKAQYGRTYKESAENSKRFEI 57
           MA  FL+VV  L+   + A+ A Y    D+G   + FE+W A++G+TYK   E   RF I
Sbjct: 1   MASAFLLVVCTLMALQAMAASAYYNNGSDDGVTMQMFEEWMAKFGKTYKCHGEKEHRFGI 60

Query: 58  FKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFL 117
           F+DN+  +  +      + +  + +N+FADLT  EF+A+ TG K      +      P  
Sbjct: 61  FRDNVHFIRGYKPQVTYDSA--VGINQFADLTNDEFVATYTGAKPPHPKEA------PRP 112

Query: 118 YKSSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVD 170
                 P  ++W  +GAVT VK QG C       AVAA+EG+  I+  +L  LSEQ+LVD
Sbjct: 113 VDPIWTPCCIDWRFRGAVTGVKDQGACGSCWAFAAVAAIEGLTKIRTGQLTPLSEQELVD 172

Query: 171 CATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAED----HAAQI 226
           C TN  +NGC GG  D AF+ +    GIT ++ Y YEG   G C   + +D    HAA I
Sbjct: 173 CDTN--SNGCGGGHTDRAFELVASKGGITAESDYRYEGFQ-GKC---RVDDMLFNHAASI 226

Query: 227 TNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYG 284
             Y  VPPNDE  L  AVA QPV+V IDAS  A QFY  GVF G C    NH VT VGY 
Sbjct: 227 GGYRAVPPNDERQLATAVARQPVTVYIDASGPAFQFYKSGVFPGPCGASSNHAVTLVGYC 286

Query: 285 T-SEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
                G KYW+ KNSWG+ WG+ GY  L++D+ QP G CG+A+   +P 
Sbjct: 287 QDGASGKKYWVAKNSWGKTWGQQGYILLEKDVLQPHGTCGLAVSPFYPT 335


>gi|255032|gb|AAB23155.1| COT44=cysteine proteinase homolog [Brassica napus, seedling, rapid
           cycling base population CrGC5, Peptide, 328 aa]
          Length = 328

 Score =  243 bits (621), Expect = 8e-62,   Method: Compositional matrix adjust.
 Identities = 138/323 (42%), Positives = 188/323 (58%), Gaps = 22/323 (6%)

Query: 34  FEQWKAQYGRTYKESA----ENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
           + +W  ++G++   S     +  +RF IFKDNL  ++  +N    N +Y L L  FA+LT
Sbjct: 4   YLRWSLEHGKSNSNSNGIINQQDERFNIFKDNLRFID-LHNENNKNATYKLGLTIFANLT 62

Query: 90  PQEFIASQTGFKMSDHSSSLKANGTPFLYKSS----QVPPSVNWIEKGAVTPVKYQGQCA 145
             E+ +   G +        KA      Y ++    +VP +V+W +KGAV  +K QG C 
Sbjct: 63  NDEYRSLYLGARTEPVRRITKAKNVNMKYSAAVNDVEVPVTVDWRQKGAVNAIKDQGTCG 122

Query: 146 -------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGI 198
                   AAVEGIN I    LVSLSEQ+LVDC     N GC GG MD AF++I++N G+
Sbjct: 123 SCWAFSTAAAVEGINKIVTGELVSLSEQELVDC-DKSYNQGCNGGLMDYAFQFIMKNGGL 181

Query: 199 TNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS-- 256
             +  Y Y G + G C+S+        I  YEDVP  DE +L +AV+ QPVSVAIDA   
Sbjct: 182 NTEKDYPYHG-TNGKCNSLLKNSRVVTIDGYEDVPSKDETALKRAVSYQPVSVAIDAGGR 240

Query: 257 ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDID 316
           A Q Y  G+F G C T ++H V AVGYG SE G+ YW+++NSWG  WGEDGY R++R++ 
Sbjct: 241 AFQHYQSGIFTGKCGTNMDHAVVAVGYG-SENGVDYWIVRNSWGTRWGEDGYIRMERNVA 299

Query: 317 QPQGQCGIAMFASFPVSKESAQP 339
              G+CGIA+ AS+PV K S  P
Sbjct: 300 SKSGKCGIAIEASYPV-KYSPNP 321


>gi|118127|sp|P25251.1|CYSP4_BRANA RecName: Full=Cysteine proteinase COT44; Flags: Precursor
          Length = 328

 Score =  243 bits (621), Expect = 8e-62,   Method: Compositional matrix adjust.
 Identities = 138/323 (42%), Positives = 188/323 (58%), Gaps = 22/323 (6%)

Query: 34  FEQWKAQYGRTYKESA----ENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
           + +W  ++G++   S     +  +RF IFKDNL  ++  +N    N +Y L L  FA+LT
Sbjct: 4   YLRWSLEHGKSNSNSNGIINQQDERFNIFKDNLRFID-LHNENNKNATYKLGLTIFANLT 62

Query: 90  PQEFIASQTGFKMSDHSSSLKANGTPFLYKSS----QVPPSVNWIEKGAVTPVKYQGQCA 145
             E+ +   G +        KA      Y ++    +VP +V+W +KGAV  +K QG C 
Sbjct: 63  NDEYRSLYLGARTEPVRRITKAKNVNMKYSAAVNVDEVPVTVDWRQKGAVNAIKDQGTCG 122

Query: 146 -------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGI 198
                   AAVEGIN I    LVSLSEQ+LVDC     N GC GG MD AF++I++N G+
Sbjct: 123 SCWAFSTAAAVEGINKIVTGELVSLSEQELVDC-DKSYNQGCNGGLMDYAFQFIMKNGGL 181

Query: 199 TNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS-- 256
             +  Y Y G + G C+S+        I  YEDVP  DE +L +AV+ QPVSVAIDA   
Sbjct: 182 NTEKDYPYHG-TNGKCNSLLKNSRVVTIDGYEDVPSKDETALKRAVSYQPVSVAIDAGGR 240

Query: 257 ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDID 316
           A Q Y  G+F G C T ++H V AVGYG SE G+ YW+++NSWG  WGEDGY R++R++ 
Sbjct: 241 AFQHYQSGIFTGKCGTNMDHAVVAVGYG-SENGVDYWIVRNSWGTRWGEDGYIRMERNVA 299

Query: 317 QPQGQCGIAMFASFPVSKESAQP 339
              G+CGIA+ AS+PV K S  P
Sbjct: 300 SKSGKCGIAIEASYPV-KYSPNP 321


>gi|356542171|ref|XP_003539543.1| PREDICTED: LOW QUALITY PROTEIN: KDEL-tailed cysteine endopeptidase
           CEP2-like [Glycine max]
          Length = 342

 Score =  243 bits (621), Expect = 8e-62,   Method: Compositional matrix adjust.
 Identities = 140/336 (41%), Positives = 188/336 (55%), Gaps = 22/336 (6%)

Query: 6   LIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
           L  + I + +C ++    + D   +  ++E W  +YG+ Y+   E   RFEI++ N+  +
Sbjct: 16  LCNLWITASACPAKHNDNSSDSEVMRMRYESWLKKYGQKYRNKDEWEFRFEIYRANVQFI 75

Query: 66  ERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLY-KSSQVP 124
           E +N+    N SY L  NKF DLT +EF      ++   H        T F+Y K   +P
Sbjct: 76  EVYNSQ---NYSYKLMDNKFVDLTNEEFRRMYLVYQPRSHLQ------TRFMYQKHGDLP 126

Query: 125 PSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNN 177
             ++W  +GAVT +K QG C       AVA VE IN IK  +LVSLSEQQL+DC   + N
Sbjct: 127 KRIDWRTRGAVTXIKDQGHCGSCWSFSAVATVEDINKIKTGKLVSLSEQQLIDCDNRNGN 186

Query: 178 NGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDE 237
            GC GG M+  F +I +  G+T D  Y Y+G S G  +  K  +HA  I  YE++P ++E
Sbjct: 187 EGCNGGHME-TFTFITKRGGLTTDKNYPYQG-SDGDXNKAKVRNHAVAICGYENLPAHNE 244

Query: 238 ESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLI 295
             L  AVA+QP SVA DA   A Q YS G F+G C   LNH +T VGYG  E G KYWL+
Sbjct: 245 NMLKAAVAHQPASVATDAGGYAFQLYSKGTFSGSCGKDLNHRMTIVGYG-EENGEKYWLV 303

Query: 296 KNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
           KNSW  D G  GY R++RD     G CG AM AS+P
Sbjct: 304 KNSWANDXGVSGYIRMKRDPKDKDGTCGTAMEASYP 339


>gi|302790828|ref|XP_002977181.1| hypothetical protein SELMODRAFT_106402 [Selaginella moellendorffii]
 gi|300155157|gb|EFJ21790.1| hypothetical protein SELMODRAFT_106402 [Selaginella moellendorffii]
          Length = 337

 Score =  243 bits (620), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 136/346 (39%), Positives = 204/346 (58%), Gaps = 23/346 (6%)

Query: 1   MAKYFLIVVLIISGSCASQATYRTFDEG---SIAEKFEQWKAQYGRTYKESAENSKRFEI 57
           M    LI+++++  +  + A     ++G    I   FE W A++G++Y    E ++R  I
Sbjct: 1   MIASTLILLVVVGATPFAIARPAALEDGRALEIKNMFEDWAAKHGKSYSSDWEKARRLMI 60

Query: 58  FKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTG-FKMSDHSSSLKANGTPF 116
           F D L  +E+ N  A  N ++TL LNKF+DLT  EF A   G FK   +   L A     
Sbjct: 61  FSDTLAYIEKHN--AQPNTTFTLGLNKFSDLTNAEFRAMHVGKFKRPRYQDRLPAEDEDV 118

Query: 117 LYKSSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLV 169
               S +P S++W +KGAVTP+K QG C       A+A++E  + +    LVSLSEQQL+
Sbjct: 119 --DVSSLPTSLDWRQKGAVTPIKDQGDCGSCWAFSAIASIESAHFLATKELVSLSEQQLM 176

Query: 170 DCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNY 229
           DC T D   GC GG M+ AFK++++N G+T +A Y Y G S G C++ KA++  A+IT +
Sbjct: 177 DCDTVDA--GCDGGLMETAFKFVVKNGGVTTEAAYPYTG-SVGSCNANKAKNKVAEITGF 233

Query: 230 EDVPPNDEESLLKAVANQPVSVAIDASALQF--YSGGVFNGYCETFLNHGVTAVGYGTSE 287
           + V  +  ++L+KAV+  PV+V+I  S   F  Y  G+ +G C+  L+HGV  +GYGT E
Sbjct: 234 KVVTEDSADALMKAVSKTPVTVSICGSDENFQNYKSGILSGKCDDSLDHGVLLIGYGT-E 292

Query: 288 EGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
            G+ YW+IKNSWG  WGEDG+ +++R      G CG+   +S+P +
Sbjct: 293 GGMPYWIIKNSWGTSWGEDGFMKIER--KDGDGMCGMNGDSSYPTT 336


>gi|110737959|dbj|BAF00916.1| cysteine proteinase [Arabidopsis thaliana]
          Length = 376

 Score =  243 bits (620), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 143/340 (42%), Positives = 193/340 (56%), Gaps = 25/340 (7%)

Query: 18  SQATYRTFDEGSIAEKFEQWKAQYGRTYKESA----ENSKRFEIFKDNLVAVERFNNAAI 73
           S   +RT +E  +   + QW A++G+T   +     +  KRF IFKDNL  ++  +N   
Sbjct: 35  SDGKWRTDEE--VRSIYLQWSAEHGKTNNNNNGIINDQDKRFNIFKDNLRFID-LHNENN 91

Query: 74  GNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSS----QVPPSVNW 129
            N +Y L L KF DLT  E+     G +        KA      Y ++    +VP +V+W
Sbjct: 92  KNATYKLGLTKFTDLTNDEYRKLYLGARTEPARRIAKAKNVNQKYSAAVNGKEVPETVDW 151

Query: 130 IEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYG 182
            +KGAV P+K QG C         AAVEGIN I    L+SLSEQ+LVDC     N GC G
Sbjct: 152 RQKGAVNPIKDQGTCGSCWAFSTTAAVEGINKIVTGELISLSEQELVDC-DKSYNQGCNG 210

Query: 183 GFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLK 242
           G MD AF++I++N G+  +  Y Y G   G C+S         I  YEDVP  DE +L K
Sbjct: 211 GLMDYAFQFIMKNGGLNTEKDYPYRGFG-GKCNSFLKNSRVVSIDGYEDVPTKDETALKK 269

Query: 243 AVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWG 300
           A++ QPV VAI+A     Q Y  G+F G C T L+H V AVGYG SE G+ YW+++NSWG
Sbjct: 270 AISYQPVRVAIEAGGRIFQHYQSGIFTGSCGTNLDHAVVAVGYG-SENGVDYWIVRNSWG 328

Query: 301 QDWGEDGYFRLQRDIDQPQ-GQCGIAMFASFPVSKESAQP 339
             WGE+GY R++R++   + G+CGIA+ AS+PV K S  P
Sbjct: 329 PRWGEEGYIRMERNLAASKSGKCGIAVEASYPV-KYSPNP 367


>gi|223946391|gb|ACN27279.1| unknown [Zea mays]
          Length = 279

 Score =  243 bits (619), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 134/268 (50%), Positives = 165/268 (61%), Gaps = 21/268 (7%)

Query: 88  LTPQEFIASQTGFKMSDHS------SSLKANGTPFLYKSSQ-VPPSVNWIEKGAVTPVKY 140
           +T  EF     G +++ H           A+ + F+Y  ++ VP SV+W +KGAVT VK 
Sbjct: 1   MTADEFRRHYAGSRVAHHRMFRGDRQGSSASASSFMYADARDVPASVDWRQKGAVTDVKD 60

Query: 141 QGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYII 193
           QGQC        +AAVEGINAIK   L SLSEQQLVDC T   N GC GG MD AF+YI 
Sbjct: 61  QGQCGSCWAFSTIAAVEGINAIKTKNLTSLSEQQLVDCDTK-ANAGCNGGLMDYAFQYIA 119

Query: 194 QNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAI 253
           ++ G+  +  Y Y           K+      I  YEDVP NDE +L KAVA+QPVSVAI
Sbjct: 120 KHGGVAAEDAYPYRARQASC---KKSPAPVVTIDGYEDVPANDESALKKAVAHQPVSVAI 176

Query: 254 DASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRL 311
           +AS    QFYS GVF+G C T L+HGV AVGYG + +G KYWL+KNSWG +WGE GY R+
Sbjct: 177 EASGSHFQFYSEGVFSGRCGTELDHGVAAVGYGVTADGTKYWLVKNSWGPEWGEKGYIRM 236

Query: 312 QRDIDQPQGQCGIAMFASFPVSKESAQP 339
            RD+   +G CGIAM AS+PV K S  P
Sbjct: 237 ARDVAAKEGHCGIAMEASYPV-KTSPNP 263


>gi|302845628|ref|XP_002954352.1| hypothetical protein VOLCADRAFT_76255 [Volvox carteri f.
           nagariensis]
 gi|300260282|gb|EFJ44502.1| hypothetical protein VOLCADRAFT_76255 [Volvox carteri f.
           nagariensis]
          Length = 489

 Score =  243 bits (619), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 144/357 (40%), Positives = 198/357 (55%), Gaps = 31/357 (8%)

Query: 1   MAKYFLIVVLIISGS----CASQATYRTFDEGSIAEK-------FEQWKAQYGRTY-KES 48
           MA  FLI  L+++ S     A +   R   E  + +        F+QW  QY + Y  + 
Sbjct: 1   MAVRFLIAALLVAASGGVGAAPELQLREQHEKLLLDAKANPMAAFQQWMMQYTKAYANDI 60

Query: 49  AENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSS 108
            E   RF ++ +NL  +  +N       S+ L LN FADLT  EF  ++ G+      +S
Sbjct: 61  KELETRFSVWLENLNYILAYNARTT---SHWLHLNAFADLTTDEF-RNRLGYDFKARQAS 116

Query: 109 LKANGTPFLYK---SSQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKIN 158
            +   +PF+Y    ++Q+P  ++W +KGAVT VK QGQC          +VEGINAI   
Sbjct: 117 NRLQSSPFIYDNVDANQLPTEIDWRKKGAVTEVKNQGQCGSCWAFATTGSVEGINAIVTG 176

Query: 159 RLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIK 218
            L SLSEQ+LVDC T D + GC GG MD A+++II+N G+  +  Y Y     G+C + K
Sbjct: 177 ELASLSEQELVDCDT-DEDRGCSGGLMDYAYQWIIKNGGLDTEDDYPYTA-EDGVCVAAK 234

Query: 219 AEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAI--DASALQFYSGGVFNG-YCETFLN 275
                  I  Y D+P NDE +L KA A+QP++VAI  DA + Q Y GGV++   C T LN
Sbjct: 235 KNRRVVTIDGYVDIPENDEVALKKAAAHQPIAVAIEADAKSFQLYGGGVYDDPTCGTSLN 294

Query: 276 HGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           HGV  VGYG       YW++KNSWG +WG++GY RL+   +  QG CGIAM  SFP 
Sbjct: 295 HGVLVVGYGKDPHFGNYWIVKNSWGPEWGDNGYIRLRMGAEDVQGMCGIAMAPSFPT 351


>gi|326430490|gb|EGD76060.1| cysteine proteinase [Salpingoeca sp. ATCC 50818]
          Length = 448

 Score =  242 bits (618), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 136/338 (40%), Positives = 198/338 (58%), Gaps = 24/338 (7%)

Query: 6   LIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
           L++V  + G+  ++    T ++G +   F+ +K ++ + Y+ + E ++RF +F  N+  +
Sbjct: 5   LVLVCALVGAAMAEPLSLTVNKGRL---FDAFKTKFNKVYESAEEEARRFSVFSQNIDFI 61

Query: 66  ERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVP 124
            R N  AA G  ++T+ +N+FADLT +E+        +  + + L       ++      
Sbjct: 62  NRHNAEAARGVHTHTVDVNQFADLTNEEY----RQLYLRPYPTELLGRERQEVWLDGPNA 117

Query: 125 PSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNN 177
            SV+W +KGAVTP+K QGQC          +VEG +AI    LVSLSEQQLVDC+ +  N
Sbjct: 118 GSVDWRQKGAVTPIKNQGQCGSCWSFSTTGSVEGAHAIATGNLVSLSEQQLVDCSGSFGN 177

Query: 178 NGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDE 237
            GC GG MD+AFKYII N G+  +  Y Y     G+CD  K   HA  I+ Y+DVP N+E
Sbjct: 178 QGCNGGLMDNAFKYIISNGGLDTEQDYPYTARD-GVCDKSKESKHAVSISGYKDVPQNNE 236

Query: 238 ESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLI 295
           + L  AV   PVSVAI+A   + Q YS GVF+G C T L+HGV  VGY TS+    YW++
Sbjct: 237 DQLAAAVEKGPVSVAIEADQQSFQMYSSGVFSGPCGTNLDHGVLVVGY-TSD----YWIV 291

Query: 296 KNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
           KNSWG  WG+ GY  ++R +    G CGIAM  S+P++
Sbjct: 292 KNSWGASWGDQGYIMMKRGVSS-AGICGIAMQPSYPIA 328


>gi|53791858|dbj|BAD53944.1| putative cysteine protease [Oryza sativa Japonica Group]
          Length = 335

 Score =  242 bits (618), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 145/343 (42%), Positives = 190/343 (55%), Gaps = 29/343 (8%)

Query: 5   FLIVVLIISGSCASQATYRT-FDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLV 63
            L+  L+   + A+ A Y    D+G   + FE+W A++G+TYK   E   RF IF+DN+ 
Sbjct: 6   LLVCTLMALQAMAASAYYNNGSDDGVTMQMFEEWMAKFGKTYKCHGEKEHRFGIFRDNVH 65

Query: 64  AVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQV 123
            +  +      + +  + +N+FADLT  EF+A+ TG K      +      P        
Sbjct: 66  FIRGYKPQVTYDSA--VGINQFADLTNDEFVATYTGAKPPHPKEA------PRPVDPIWT 117

Query: 124 PPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDN 176
           P  ++W  +GAVT VK QG C       AVAA+EG+  I+  +L  LSEQ+LVDC TN  
Sbjct: 118 PCCIDWRFRGAVTGVKDQGACGSCWAFAAVAAIEGLTKIRTGQLTPLSEQELVDCDTN-- 175

Query: 177 NNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAED----HAAQITNYEDV 232
           +NGC GG  D AF+ +    GIT ++ Y YEG   G C   + +D    HAA I  Y  V
Sbjct: 176 SNGCGGGHTDRAFELVASKGGITAESDYRYEGFQ-GKC---RVDDMLFNHAASIGGYRAV 231

Query: 233 PPNDEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGT-SEEG 289
           PPNDE  L  AVA QPV+V IDAS  A QFY  GVF G C    NH VT VGY      G
Sbjct: 232 PPNDERQLATAVARQPVTVYIDASGPAFQFYKSGVFPGPCGASSNHAVTLVGYCQDGASG 291

Query: 290 IKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
            KYWL KNSWG+ WG+ GY  L++DI QP G CG+A+   +P 
Sbjct: 292 KKYWLAKNSWGKTWGQQGYILLEKDIVQPHGTCGLAVSPFYPT 334


>gi|18413507|ref|NP_567377.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|30315953|sp|Q9SUS9.1|CPR4_ARATH RecName: Full=Probable cysteine proteinase At4g11320; Flags:
           Precursor
 gi|5596478|emb|CAB51416.1| drought-inducible cysteine proteinase RD21A precursor-like protein
           [Arabidopsis thaliana]
 gi|7267831|emb|CAB81233.1| drought-inducible cysteine proteinase RD21A precursor-like protein
           [Arabidopsis thaliana]
 gi|14334764|gb|AAK59560.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|15293257|gb|AAK93739.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|332657596|gb|AEE82996.1| putative cysteine proteinase [Arabidopsis thaliana]
          Length = 371

 Score =  242 bits (617), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 138/316 (43%), Positives = 189/316 (59%), Gaps = 29/316 (9%)

Query: 34  FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERF-NNAAIGNRSYTLRLNKFADLTPQE 92
           FE W  ++G+ Y   AE  +R  IF+DNL    RF  N    N SY L LN+FADL+  E
Sbjct: 56  FESWMVKHGKVYDSVAEKERRLTIFEDNL----RFITNRNAENLSYRLGLNRFADLSLHE 111

Query: 93  FIASQTGFKMS---DHSSSLKANGTPFLYKSSQ---VPPSVNWIEKGAVTPVKYQGQC-- 144
           +     G       +H     +N     YK+S    +P SV+W  +GAVT VK QG C  
Sbjct: 112 YGEICHGADPRPPRNHVFMTSSN----RYKTSDGDVLPKSVDWRNEGAVTEVKDQGLCRS 167

Query: 145 -----AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGIT 199
                 V AVEG+N I    LV+LSEQ L++C  N  NNGC GG ++ A+++I+ N G+ 
Sbjct: 168 CWAFSTVGAVEGLNKIVTGELVTLSEQDLINC--NKENNGCGGGKVETAYEFIMNNGGLG 225

Query: 200 NDAVYSYEGMSTGICDS-IKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA- 257
            D  Y Y+ ++ G+C+  +K ++    I  YE++P NDE +L+KAVA+QPV+  +D+S+ 
Sbjct: 226 TDNDYPYKALN-GVCEGRLKEDNKNVMIDGYENLPANDEAALMKAVAHQPVTAVVDSSSR 284

Query: 258 -LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDID 316
             Q Y  GVF+G C T LNHGV  VGYGT E G  YW++KNS G  WGE GY ++ R+I 
Sbjct: 285 EFQLYESGVFDGTCGTNLNHGVVVVGYGT-ENGRDYWIVKNSRGDTWGEAGYMKMARNIA 343

Query: 317 QPQGQCGIAMFASFPV 332
            P+G CGIAM AS+P+
Sbjct: 344 NPRGLCGIAMRASYPL 359


>gi|302831223|ref|XP_002947177.1| hypothetical protein VOLCADRAFT_103269 [Volvox carteri f.
           nagariensis]
 gi|300267584|gb|EFJ51767.1| hypothetical protein VOLCADRAFT_103269 [Volvox carteri f.
           nagariensis]
          Length = 514

 Score =  241 bits (615), Expect = 3e-61,   Method: Compositional matrix adjust.
 Identities = 152/349 (43%), Positives = 197/349 (56%), Gaps = 49/349 (14%)

Query: 34  FEQWKAQYGRTYKE-SAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQE 92
           F  W  QYGRTY E S E ++R  IF DN+ A++  +    G    TL LN++ADLT +E
Sbjct: 38  FTLWSRQYGRTYVEQSPEYTRRLSIFSDNVRAIQESHEKDPG---VTLALNEYADLTWEE 94

Query: 93  FIASQTGFK-----MSDHSSSLKANGTPFLYKSS-QVPPSVNWIEKGAVTPVKYQGQCA- 145
           F +++ G +     +   S    +    + Y ++   P +++W EKGAV  VK QGQC  
Sbjct: 95  FSSTRLGLRIDQDQLDRRSRRSASRRNAWRYAAAVDNPKAIDWREKGAVAEVKNQGQCGS 154

Query: 146 ------VAAVEGINAIKINRLVSLSEQQLVDCAT-------------------------N 174
                   A+EGINAI   +L SLSEQQLVDC T                         N
Sbjct: 155 CWAFSTTGAIEGINAIVTGQLQSLSEQQLVDCDTGKRTVTRSKRSCTVILPSYSSNSCRN 214

Query: 175 DNNNGCYGGFMDDAFKYIIQNKGITNDAVYSY-EGMSTGI-CDSIKAEDH-AAQITNYED 231
           ++N GC GG MDDAFKY+IQN G+  +  Y+Y  G   G  C+  K  D  A  I  YED
Sbjct: 215 ESNMGCSGGLMDDAFKYVIQNGGLDTEQDYAYWSGYGLGFWCNKRKQTDRPAVSIDGYED 274

Query: 232 VPPNDEESLLKAVANQPVSVAIDASA-LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGI 290
           VP   E++LLKAVA+QPV+VAI A A +QFYS GV +  CE  LNHGV  VGY  S++G 
Sbjct: 275 VP-QGEDNLLKAVAHQPVAVAICAGASMQFYSRGVISTCCEG-LNHGVLTVGYNVSQDGE 332

Query: 291 KYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSKESAQP 339
           KYW++KNSWG  WGE GYFRL+  + +  G CGIA  AS+P      +P
Sbjct: 333 KYWIVKNSWGAGWGEQGYFRLKMGVGE-TGLCGIASAASYPTKTSPNKP 380


>gi|413943290|gb|AFW75939.1| maize insect resistance1 [Zea mays]
          Length = 435

 Score =  241 bits (615), Expect = 4e-61,   Method: Compositional matrix adjust.
 Identities = 145/344 (42%), Positives = 196/344 (56%), Gaps = 44/344 (12%)

Query: 30  IAEKFEQWKAQYGRTYK------------ESAENSK-RFEIFKDNLVAVERFN-NAAIGN 75
           +   +E WK+++GR               E  E+ + R E+F+DNL  +++ N  A  G 
Sbjct: 80  VRRMYEAWKSKHGRGGSSNDDCDMAPGDDEQEEDRRLRLEVFRDNLRYIDKHNAEADAGL 139

Query: 76  RSYTLRLNKFADLTPQEFIASQTGFKMSD----------HSSSLKANGTPFLYKSSQVPP 125
            ++ L L  FADLT  E+     GF+             H    +  G   L      P 
Sbjct: 140 HTFRLGLTPFADLTLDEYRGRVLGFRARARRSGARYGHGHGYRARPRGGDLL------PD 193

Query: 126 SVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNN 178
           +++W + GAVT VK Q QC       AVAA+EGINAI    LVSLSEQ+++DC   D+  
Sbjct: 194 AIDWRQLGAVTEVKDQQQCGGCWAFSAVAAIEGINAIATGNLVSLSEQEIIDCDAQDS-- 251

Query: 179 GCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDH-AAQITNYEDVPPNDE 237
           GC GG M++AF+++I N GI  +A Y + G + G CD+ K  +   A I    +V  N+E
Sbjct: 252 GCDGGQMENAFRFVIGNGGIDTEADYPFIG-TDGTCDASKENNEKVATIDGLVEVASNNE 310

Query: 238 ESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLI 295
            +L +AVA QPVSVAIDAS  A Q YS G+FNG C T L+HGVTAVGYG SE G  YW++
Sbjct: 311 TALQEAVAIQPVSVAIDASGRAFQHYSSGIFNGPCGTSLDHGVTAVGYG-SESGKDYWIV 369

Query: 296 KNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSKESAQP 339
           KNSW   WGE GY R++R++ +P G+CGIAM AS+PV      P
Sbjct: 370 KNSWSASWGEAGYIRMRRNVPRPTGKCGIAMDASYPVKDTYHDP 413


>gi|424513619|emb|CCO66241.1| predicted protein [Bathycoccus prasinos]
          Length = 396

 Score =  241 bits (614), Expect = 4e-61,   Method: Compositional matrix adjust.
 Identities = 134/333 (40%), Positives = 187/333 (56%), Gaps = 26/333 (7%)

Query: 23  RTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAI-GNRSYTLR 81
           R   E  I + F+ W  +Y +    + E  KR +IF +N + V   N   + G  S+ + 
Sbjct: 61  RVLRESKIEDAFDAWLVKYDKEIANAEERLKRLKIFGENYLFVLEHNAKYVAGKVSHYVE 120

Query: 82  LNKFADLTPQEFIASQTGFKMS----DHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTP 137
           +NKFA  T +E+     GFK S      S     + + + Y+  + P S++W+++G +T 
Sbjct: 121 MNKFAAHTREEY-RKMLGFKKSLRRKKDSGEAAKDVSLWEYEGVEAPESIDWVDEGVITT 179

Query: 138 VKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFK 190
            K QG C       A+ AVEGINAI+  +LVSLSEQ+LV CA    N GC GG MD+AF+
Sbjct: 180 PKNQGSCGSCWAFSAIGAVEGINAIRTGKLVSLSEQELVSCAREGGNQGCNGGLMDNAFE 239

Query: 191 YIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVS 250
           +I++N G+ ++  Y Y+  S   C + K   H A I  + DVP NDE +L KAV+ QPVS
Sbjct: 240 WIVENGGVDSEKQYQYKA-SFDDCKTRKTLLHIASIDGFNDVPSNDETALKKAVSQQPVS 298

Query: 251 VAIDAS--ALQFYSGGVFNGY-CETFLNHGVTAVGYGTSEEG---------IKYWLIKNS 298
           VAI+A   + Q Y GGV++   C T L+HGV  VGYG               KYW IKNS
Sbjct: 299 VAIEADQRSFQLYGGGVYHAEDCGTQLDHGVLVVGYGIDHNSSNVIIPGATKKYWKIKNS 358

Query: 299 WGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
           W + WGE GY R+ RD++ P G CG+A  AS+P
Sbjct: 359 WSEQWGEGGYIRIARDVESPSGMCGVAEMASYP 391


>gi|255078398|ref|XP_002502779.1| cysteine endopeptidase [Micromonas sp. RCC299]
 gi|226518045|gb|ACO64037.1| cysteine endopeptidase [Micromonas sp. RCC299]
          Length = 414

 Score =  241 bits (614), Expect = 5e-61,   Method: Compositional matrix adjust.
 Identities = 130/323 (40%), Positives = 186/323 (57%), Gaps = 17/323 (5%)

Query: 24  TFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN-NAAIGNRSYTLRL 82
           T + GS+++ F +W  ++G+TY    E   R +IF DN   V++ N     G  ++ + L
Sbjct: 58  TKEVGSLSDLFHEWTQKHGKTYDSEEEKELRLKIFADNHEFVQKHNAEYENGEHTHFVGL 117

Query: 83  NKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQG 142
           N  ADLT  EF     G+  +  +S    + + + Y     P  ++W+  GAVTPVK Q 
Sbjct: 118 NHLADLTKDEF-KKMLGYNAALRASRAPVDASTWEYADVTPPEEIDWVASGAVTPVKNQK 176

Query: 143 QCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQN 195
           QC          AVEG+NAIK  +L+SLSE++L+ C+TN  N GC GG MD+ F++I+ N
Sbjct: 177 QCGSCWAFSTTGAVEGVNAIKTGKLISLSEEELISCSTN-GNMGCNGGLMDNGFEWIVNN 235

Query: 196 KGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDA 255
           +GI  +  + Y       C   +    A  I  ++DVP NDE+SL+KAV+ QPVSVAI+A
Sbjct: 236 RGIDTEDGWEYVAKEEK-CGFFRRHHRAVAIDGFKDVPSNDEDSLMKAVSQQPVSVAIEA 294

Query: 256 S--ALQFYSGGVFNGY-CETFLNHGVTAVGYGTSEEGIK---YWLIKNSWGQDWGEDGYF 309
              + Q Y+GGV++   C T L+HGV  VGYG   +  K   +W IKNSWG  WGEDGY 
Sbjct: 295 DHQSFQLYAGGVYSAKDCGTELDHGVLLVGYGVDPKSTKHKHFWKIKNSWGPAWGEDGYI 354

Query: 310 RLQRDIDQPQGQCGIAMFASFPV 332
           R+ +     +GQCG+AM  S+P 
Sbjct: 355 RIAKGGSGVEGQCGVAMQPSYPT 377


>gi|242093994|ref|XP_002437487.1| hypothetical protein SORBIDRAFT_10g027980 [Sorghum bicolor]
 gi|241915710|gb|EER88854.1| hypothetical protein SORBIDRAFT_10g027980 [Sorghum bicolor]
          Length = 341

 Score =  241 bits (614), Expect = 5e-61,   Method: Compositional matrix adjust.
 Identities = 138/325 (42%), Positives = 195/325 (60%), Gaps = 33/325 (10%)

Query: 18  SQATYRTFDEGSIAEKFEQWKAQYGRTYKE-SAENSKRFEIFKDNLVAVERFN-NAAIGN 75
           S A     DE  + + ++ WK+++GR     S  +  R ++F+DNL  ++  N  A  G 
Sbjct: 36  SAAPLERADE-EVRQLYKTWKSEHGRPRDGISVADGLRLKVFRDNLRYIDAHNAEADAGL 94

Query: 76  RSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGA 134
            ++ L L  F DLT +EF A   GF    +S+  +     +L ++   +P +V+W ++GA
Sbjct: 95  HTFRLGLTPFTDLTLEEFRAHALGFL---NSTLPRVASDRYLPRAGDDLPDAVDWRQQGA 151

Query: 135 VTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDD 187
           VT VK Q  C       AVAA+EGIN I  N L+SLSEQ+L+DC T D   GC GG M  
Sbjct: 152 VTGVKNQLDCGGCWAFSAVAAMEGINKIVTNNLISLSEQELIDCDTEDY--GCQGGEMQK 209

Query: 188 AFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ 247
           AF+++I N GI  +A Y + G + G CD+I+ +     I +YE+VP NDEE+L KAVANQ
Sbjct: 210 AFQFVIDNGGIDTEADYPFIG-TNGTCDAIREKRKVVSIDSYENVPTNDEEALQKAVANQ 268

Query: 248 PVSVAIDASALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDG 307
           P               G+FNG C   L+HGVTAVGYG S+ G  +W++KNSWG +WGE G
Sbjct: 269 P---------------GIFNGPCGFILDHGVTAVGYG-SDNGEDFWIVKNSWGAEWGESG 312

Query: 308 YFRLQRDIDQPQGQCGIAMFASFPV 332
           Y R++R++  P G+CGIAM+AS+PV
Sbjct: 313 YIRMKRNVLLPMGKCGIAMYASYPV 337


>gi|357166364|ref|XP_003580686.1| PREDICTED: oryzain alpha chain-like [Brachypodium distachyon]
          Length = 360

 Score =  241 bits (614), Expect = 5e-61,   Method: Compositional matrix adjust.
 Identities = 140/312 (44%), Positives = 187/312 (59%), Gaps = 19/312 (6%)

Query: 34  FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAA-IGNRSYTLRLNKFADLTPQE 92
           + +W AQ+G     + E   R+E F+DNL  ++  N AA  G  S+ L LN+FA LT +E
Sbjct: 43  YAEWTAQHGSPI--TNEEEGRYEAFRDNLRYIDEHNAAADAGIHSFRLGLNRFAGLTNEE 100

Query: 93  FIASQTGFKM-SDHSSSLKANGTPFLYKSSQ-VPPSVNWIEKGAVTPVKYQGQC------ 144
           + A+  G ++ S     L+     +     + +P SV+W EKGAV  VK QG+       
Sbjct: 101 YRAAYLGLRLRSGAVGDLRKPSARYEAADGEALPESVDWREKGAVGKVKDQGRSCGSAWA 160

Query: 145 --AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDA 202
             A+AAVE IN I    L+SLSEQ+L+DC T+  N GC GG MDDAF++II N GI  D 
Sbjct: 161 FSAIAAVESINQIVTGELISLSEQELMDCDTS-YNAGCDGGLMDDAFEFIISNGGIDTDE 219

Query: 203 VYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQF 260
            Y Y+  +   CD+ K    A  I +YED+  N E+SL KAV+NQPVSVAI+A     Q 
Sbjct: 220 DYPYKARNDS-CDANKRNRKAVTIDDYEDLRMN-EKSLQKAVSNQPVSVAIEAGGRDFQL 277

Query: 261 YSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQG 320
           Y  G+F G C T L+H  T VGYG SE G  YW++K S+G  WGE GY R++R+I +  G
Sbjct: 278 YKSGIFTGTCGTDLDHATTIVGYG-SENGTDYWIVKESYGTSWGESGYARMERNIKETSG 336

Query: 321 QCGIAMFASFPV 332
           +CGIAM  S+PV
Sbjct: 337 KCGIAMLPSYPV 348


>gi|15290195|dbj|BAB63884.1| putative cysteine protease [Oryza sativa Japonica Group]
 gi|125525813|gb|EAY73927.1| hypothetical protein OsI_01811 [Oryza sativa Indica Group]
          Length = 342

 Score =  240 bits (613), Expect = 7e-61,   Method: Compositional matrix adjust.
 Identities = 148/354 (41%), Positives = 196/354 (55%), Gaps = 41/354 (11%)

Query: 1   MAKYFLIVV--LIISGSCASQATYRT-FDEGSIAEKFEQWKAQYGRTYKESAENSKRFEI 57
           MA   L+VV  L+   +  + A Y    D+G   + FE+W A++G+TYK   E   RF I
Sbjct: 7   MASAVLLVVCTLMALQAMGADAYYNNGSDDGVTMQMFEEWMAKFGKTYKCHGEKEHRFGI 66

Query: 58  FKDNLVAVERFN-----NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKAN 112
           F+DN+  +  +      ++A+G       +N+FADLT  EF+A+ TG K      +    
Sbjct: 67  FRDNVHFIRGYKPQVTYDSAVG-------INQFADLTNDEFVATYTGAKPPHPKEA---- 115

Query: 113 GTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSE 165
             P        P  ++W  +GAVT VK QG C       AVAA+EG+  I+  +L  LSE
Sbjct: 116 --PRPVDPIWTPCCIDWRFRGAVTGVKDQGACGSCWAFAAVAAIEGLTKIRTGQLTPLSE 173

Query: 166 QQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAED---- 221
           Q+LVDC TN  +NGC GG  D AF+ +    GIT ++ Y YEG   G C   + +D    
Sbjct: 174 QELVDCDTN--SNGCGGGHTDRAFELVASKGGITAESDYRYEGFQ-GKC---RVDDMLFN 227

Query: 222 HAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVT 279
           HAA+I  Y  VPPNDE  L  AVA QPV+V IDAS  A QFY  GVF G C    NH VT
Sbjct: 228 HAARIGGYRAVPPNDERQLATAVARQPVTVYIDASGPAFQFYKSGVFPGPCGASSNHAVT 287

Query: 280 AVGYGT-SEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
            VGY      G KYW+ KNSWG+ WG+ GY  L++D+ QP G CG+A+   +P 
Sbjct: 288 LVGYCQDGASGKKYWVAKNSWGKTWGQQGYILLEKDVLQPHGTCGLAVSPFYPT 341


>gi|297809383|ref|XP_002872575.1| hypothetical protein ARALYDRAFT_911472 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297318412|gb|EFH48834.1| hypothetical protein ARALYDRAFT_911472 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 371

 Score =  240 bits (612), Expect = 8e-61,   Method: Compositional matrix adjust.
 Identities = 141/327 (43%), Positives = 193/327 (59%), Gaps = 27/327 (8%)

Query: 34  FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERF-NNAAIGNRSYTLRLNKFADLTPQE 92
           F+ W  ++G+ Y   AE  +R  IF+DNL    RF +N    N SY L L +FADL+  E
Sbjct: 56  FDSWMVKHGKVYGSVAEKERRLTIFEDNL----RFISNRNAENLSYRLGLTQFADLSLHE 111

Query: 93  FIASQTGFKMSDHSSSLKANGTPFLYKSSQ---VPPSVNWIEKGAVTPVKYQGQC----- 144
           +     G       + +    +   YK+S    +P SV+W  +GAVT VK QG C     
Sbjct: 112 YGEVCHGADPRPPRNHVFMTSSD-RYKTSAGDVLPKSVDWRNEGAVTEVKDQGHCRSCWA 170

Query: 145 --AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDA 202
              V AVEG+N I    LV+LSEQ L++C  N  NNGC GG ++ A+++I++N G+  D 
Sbjct: 171 FSTVGAVEGLNKIVTGELVTLSEQDLINC--NKENNGCGGGKVETAYEFIMKNGGLGTDN 228

Query: 203 VYSYEGMSTGICDS-IKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQ 259
            Y Y+ ++ G+CD  +K  +    I  +E++P NDE +L+KAVA+QPV+  ID+S+   Q
Sbjct: 229 DYPYKAVN-GVCDGRLKENNKNVMIDGFENLPANDEFALMKAVAHQPVTAVIDSSSREFQ 287

Query: 260 FYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQ 319
            Y  GVF+G C T LNHGV  VGYGT E G  YWL+KNS G  WGE GY ++ R+I  P+
Sbjct: 288 LYESGVFDGSCGTNLNHGVVVVGYGT-ENGRDYWLVKNSRGNTWGEAGYMKMARNIANPR 346

Query: 320 GQCGIAMFASFPVSKESAQPSSADKSS 346
           G CGIAM AS+P+        S DKSS
Sbjct: 347 GLCGIAMRASYPLK----NSFSTDKSS 369


>gi|125570286|gb|EAZ11801.1| hypothetical protein OsJ_01675 [Oryza sativa Japonica Group]
          Length = 319

 Score =  239 bits (611), Expect = 1e-60,   Method: Compositional matrix adjust.
 Identities = 140/321 (43%), Positives = 181/321 (56%), Gaps = 28/321 (8%)

Query: 26  DEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKF 85
           D+G   + FE+W A++G+TYK   E   RF IF+DN+  +  +      + +  + +N+F
Sbjct: 12  DDGVTMQMFEEWMAKFGKTYKCHGEKEHRFGIFRDNVHFIRGYKPQVTYDSA--VGINQF 69

Query: 86  ADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC- 144
           ADLT  EF+A+ TG K      +      P        P  ++W  +GAVT VK QG C 
Sbjct: 70  ADLTNDEFVATYTGAKPPHPKEA------PRPVDPIWTPCCIDWRFRGAVTGVKDQGACG 123

Query: 145 ------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGI 198
                 AVAA+EG+  I+  +L  LSEQ+LVDC TN  +NGC GG  D AF+ +    GI
Sbjct: 124 SCWAFAAVAAIEGLTKIRTGQLTPLSEQELVDCDTN--SNGCGGGHTDRAFELVASKGGI 181

Query: 199 TNDAVYSYEGMSTGICDSIKAED----HAAQITNYEDVPPNDEESLLKAVANQPVSVAID 254
           T ++ Y YEG   G C   + +D    HAA I  Y  VPPNDE  L  AVA QPV+V ID
Sbjct: 182 TAESDYRYEGFQ-GKC---RVDDMLFNHAASIGGYRAVPPNDERQLATAVARQPVTVYID 237

Query: 255 AS--ALQFYSGGVFNGYCETFLNHGVTAVGYGT-SEEGIKYWLIKNSWGQDWGEDGYFRL 311
           AS  A QFY  GVF G C    NH VT VGY      G KYWL KNSWG+ WG+ GY  L
Sbjct: 238 ASGPAFQFYKSGVFPGPCGASSNHAVTLVGYCQDGASGKKYWLAKNSWGKTWGQQGYILL 297

Query: 312 QRDIDQPQGQCGIAMFASFPV 332
           ++DI QP G CG+A+   +P 
Sbjct: 298 EKDIVQPHGTCGLAVSPFYPT 318


>gi|194320502|gb|ACF48469.1| cathepsin L [Triatoma brasiliensis]
          Length = 330

 Score =  239 bits (611), Expect = 1e-60,   Method: Compositional matrix adjust.
 Identities = 149/341 (43%), Positives = 197/341 (57%), Gaps = 28/341 (8%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
            L+ V II+ S A    + +FD     E++  +KA +G+TYK   E   R +IF DN   
Sbjct: 4   LLVAVAIIALSYA----HPSFD--IYPEEWHVFKAMHGKTYKNQFEEMFRMKIFMDNKKK 57

Query: 65  VERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQV 123
           +E  N     G  SY + +N F DL   EF A   GFKMS  +   K NG  +   +S +
Sbjct: 58  IEAHNAKYEQGEVSYKMMMNHFGDLMVHEFKALMNGFKMSPDT---KRNGELYFPSNSNL 114

Query: 124 PPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDN 176
           P +V+W +KGAVTPVK QGQC       A  ++EG   +K  +LVSLSEQ LVDC+T+  
Sbjct: 115 PKTVDWRQKGAVTPVKDQGQCGSCWSFSATGSLEGQVFLKTGKLVSLSEQNLVDCSTSYG 174

Query: 177 NNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPND 236
           NNGC GG MD AF+Y+  NKGI  +A Y YE      C   K          + D+P  D
Sbjct: 175 NNGCEGGLMDQAFQYVSDNKGIDTEASYPYEAREN-TC-RFKKNKVGGTDKGHVDIPAGD 232

Query: 237 EESLLKAVAN-QPVSVAIDAS--ALQFYSGGVFN-GYCETF-LNHGVTAVGYGTSEEGIK 291
           E++L  A+A   P+SVAIDA+  + QFYS GV+N   C ++ L+HGV AVGYGT E G  
Sbjct: 233 EKALQNALATVGPISVAIDANHGSFQFYSKGVYNEPNCSSYDLDHGVLAVGYGT-ENGQD 291

Query: 292 YWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           YWL+KNSWG  WGE+GY ++ R+       CGIA  AS+P+
Sbjct: 292 YWLVKNSWGPSWGENGYIKIARNHSN---HCGIASMASYPL 329


>gi|242088413|ref|XP_002440039.1| hypothetical protein SORBIDRAFT_09g024940 [Sorghum bicolor]
 gi|241945324|gb|EES18469.1| hypothetical protein SORBIDRAFT_09g024940 [Sorghum bicolor]
          Length = 463

 Score =  239 bits (609), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 140/318 (44%), Positives = 186/318 (58%), Gaps = 26/318 (8%)

Query: 34  FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN---NAAIGN---RSYTLRLNKFAD 87
           F+ W A++G+ Y    E + R  +F DN   V   N   NAA G     SYTL LN FAD
Sbjct: 41  FDAWCAEHGKAYATPEERAARLAVFADNAAFVAAHNARVNAAGGGGAPPSYTLALNAFAD 100

Query: 88  LTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-----SQVPPSVNWIEKGAVTPVKYQG 142
           LT +EF A++ G +++  +++L++   P +Y+        VP +++W E GAVT VK QG
Sbjct: 101 LTHEEFRAARLG-RIAAGAAALRSPAAP-VYRGLDGGLGAVPDALDWRENGAVTKVKDQG 158

Query: 143 QC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQN 195
            C       A  A+EGIN IK   LVSLSEQ+L+DC     N+GC GG MD A+K++++N
Sbjct: 159 SCGACWSFSATGAMEGINKIKTGSLVSLSEQELIDC-DRSYNSGCGGGLMDYAYKFVVKN 217

Query: 196 KGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAI-- 253
            GI  +  Y Y   + G C+  K +     I  Y DVP N E+ LL+AVA QPVSV I  
Sbjct: 218 GGIDTEEDYPYR-EADGTCNKNKLKKRIVTIDGYSDVPSNKEDLLLQAVAQQPVSVGICG 276

Query: 254 DASALQFYSG-GVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQ 312
            A A Q YS  G+F+G C T L+H V  VGYG SE G  YW++KNSWG+ WG  GY  + 
Sbjct: 277 SARAFQLYSQQGIFDGPCPTSLDHAVLIVGYG-SEGGKDYWIVKNSWGESWGMKGYMHMH 335

Query: 313 RDIDQPQGQCGIAMFASF 330
           R+    +G CGI M ASF
Sbjct: 336 RNTGDSKGVCGINMMASF 353


>gi|558563|emb|CAA57538.1| cysteine proteinase [Cicer arietinum]
          Length = 325

 Score =  238 bits (608), Expect = 3e-60,   Method: Compositional matrix adjust.
 Identities = 141/311 (45%), Positives = 175/311 (56%), Gaps = 19/311 (6%)

Query: 34  FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
           +E+W  ++ + Y    E   RF+IFKDNL  ++  N     N SY + LNKFAD+  +E+
Sbjct: 4   YEKWLVKHQKMYNGLGEKDTRFQIFKDNLRFIDEHNAQ---NYSYKVGLNKFADINNEEY 60

Query: 94  IASQTGFKMSDHSSSLKA--NGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA------ 145
                G K       +K    G    Y S  V   V+W  KGAVT +K QG C       
Sbjct: 61  RDMYLGTKSDAKRRVMKTKITGHRITYNSVIVTVKVDWRLKGAVTHIKDQGSCGSCWAFS 120

Query: 146 -VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVY 204
            +A VE IN I   + VSLSEQ+LVDC     N GC GG MD AF++II+N GI  D  Y
Sbjct: 121 TIATVEAINKIVTGKFVSLSEQELVDC-DRAFNEGCNGGLMDYAFEFIIRNGGIDTDQDY 179

Query: 205 SYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDA--SALQFYS 262
            Y G     CD  K       I  YEDVP     +L KAVA+QPVSVAI     ALQ Y 
Sbjct: 180 PYNGFERK-CDPTKKNAKVVSIDGYEDVPSY-MNALKKAVAHQPVSVAIAGLGRALQLYQ 237

Query: 263 GGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRL-QRDIDQPQGQ 321
            GVF G C T L+HGV  VGYG SE G+ YWL++NSWG +WGEDGYF++  R++     +
Sbjct: 238 SGVFTGKCGTDLDHGVVVVGYG-SENGVDYWLVRNSWGTNWGEDGYFKIASRNVKSLYRK 296

Query: 322 CGIAMFASFPV 332
           CGIAM AS+PV
Sbjct: 297 CGIAMEASYPV 307


>gi|49387634|dbj|BAD25828.1| putative cysteine proteinase [Oryza sativa Japonica Group]
 gi|49388888|dbj|BAD26098.1| putative cysteine proteinase [Oryza sativa Japonica Group]
          Length = 358

 Score =  238 bits (607), Expect = 3e-60,   Method: Compositional matrix adjust.
 Identities = 134/319 (42%), Positives = 188/319 (58%), Gaps = 24/319 (7%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
           + ++F  W+  + R+Y  + E  +RF++++ N   ++  N    G+ +Y L  N+FADLT
Sbjct: 43  MMDRFRAWQGAHNRSYPSAEEALQRFDVYRRNAEFIDAVN--LRGDLTYQLAENEFADLT 100

Query: 90  PQEFIASQTGFKMSDHS--SSLKANGT-----PFLYKSSQVPPSVNWIEKGAVTPVKYQ- 141
            +EF+A+ TG+   D     S+   G       F Y+   VP SV+W  +GAV P K Q 
Sbjct: 101 EEEFLATYTGYYAGDGPVDDSVITTGAGDVDASFSYRV-DVPASVDWRAQGAVVPPKSQT 159

Query: 142 GQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQ 194
             C+        A +E +N IK  +LVSLSEQQLVDC + D   GC  G    A+K++++
Sbjct: 160 STCSSCWAFVTAATIESLNMIKTGKLVSLSEQQLVDCDSYDG--GCNLGSYGRAYKWVVE 217

Query: 195 NKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAID 254
           N G+T +A Y Y     G C+  K+  HAA+IT +  VPP +E +L  AVA QPV+VAI+
Sbjct: 218 NGGLTTEADYPYTARR-GPCNRAKSAHHAAKITGFGKVPPRNEAALQAAVARQPVAVAIE 276

Query: 255 -ASALQFYSGGVFNGYCETFLNHGVTAVGYGT-SEEGIKYWLIKNSWGQDWGEDGYFRLQ 312
             S +QFY GGV+ G C T L H VT VGYGT +  G KYW IKNSWGQ WGE GY R+ 
Sbjct: 277 VGSGMQFYKGGVYTGPCGTRLAHAVTVVGYGTDASSGAKYWTIKNSWGQSWGERGYIRIL 336

Query: 313 RDIDQPQGQCGIAMFASFP 331
           RD+  P G CG+ +  ++P
Sbjct: 337 RDVGGP-GLCGVTLDIAYP 354


>gi|218202077|gb|EEC84504.1| hypothetical protein OsI_31195 [Oryza sativa Indica Group]
          Length = 362

 Score =  238 bits (607), Expect = 3e-60,   Method: Compositional matrix adjust.
 Identities = 134/319 (42%), Positives = 188/319 (58%), Gaps = 24/319 (7%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
           + ++F  W+  + R+Y  + E  +RF++++ N   ++  N    G+ +Y L  N+FADLT
Sbjct: 47  MMDRFRAWQGAHNRSYPSAEEALQRFDVYRRNAEFIDAVN--LRGDLTYRLAENEFADLT 104

Query: 90  PQEFIASQTGFKMSDHS--SSLKANGT-----PFLYKSSQVPPSVNWIEKGAVTPVKYQ- 141
            +EF+A+ TG+   D     S+   G       F Y+   VP SV+W  +GAV P K Q 
Sbjct: 105 EEEFLATYTGYYAGDGPVDDSVITTGAGDVDASFSYRV-DVPASVDWRAQGAVVPPKSQT 163

Query: 142 GQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQ 194
             C+        A +E +N IK  +LVSLSEQQLVDC + D   GC  G    A+K++++
Sbjct: 164 STCSSCWAFVTAATIESLNMIKTGKLVSLSEQQLVDCDSYDG--GCNLGSYGRAYKWVVE 221

Query: 195 NKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAID 254
           N G+T +A Y Y     G C+  K+  HAA+IT +  VPP +E +L  AVA QPV+VAI+
Sbjct: 222 NGGLTTEADYPYTARR-GPCNRAKSAHHAAKITGFGKVPPRNEAALQAAVARQPVAVAIE 280

Query: 255 -ASALQFYSGGVFNGYCETFLNHGVTAVGYGT-SEEGIKYWLIKNSWGQDWGEDGYFRLQ 312
             S +QFY GGV+ G C T L H VT VGYGT +  G KYW IKNSWGQ WGE GY R+ 
Sbjct: 281 VGSGMQFYKGGVYTGPCGTRLAHAVTVVGYGTDASSGAKYWTIKNSWGQSWGERGYIRIL 340

Query: 313 RDIDQPQGQCGIAMFASFP 331
           RD+  P G CG+ +  ++P
Sbjct: 341 RDVGGP-GLCGVTLDIAYP 358


>gi|413944252|gb|AFW76901.1| hypothetical protein ZEAMMB73_101481 [Zea mays]
          Length = 232

 Score =  238 bits (606), Expect = 4e-60,   Method: Compositional matrix adjust.
 Identities = 119/231 (51%), Positives = 154/231 (66%), Gaps = 15/231 (6%)

Query: 114 TPFLYKSSQV---PPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSL 163
           T F Y++  V   P +++W   GAVTP+K QGQC       AVAA EGI  I   +L+SL
Sbjct: 4   TGFRYENVSVDAIPATIDWRTNGAVTPIKDQGQCGCCWAFSAVAATEGIVKISTGKLISL 63

Query: 164 SEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHA 223
           SEQ+LVDC     + GC GG MDDAFK+II+N G+T ++ Y Y   + G C S    + A
Sbjct: 64  SEQELVDCDVYGEDQGCEGGLMDDAFKFIIKNGGLTTESNYPYTA-ADGKCKS--GSNSA 120

Query: 224 AQITNYEDVPPNDEESLLKAVANQPVSVAIDASAL--QFYSGGVFNGYCETFLNHGVTAV 281
           A I  YEDVP NDE +L+KAVANQPVSVA+D   +  QFYSGGV  G C T L+HG+ A+
Sbjct: 121 ANIKGYEDVPTNDEAALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAI 180

Query: 282 GYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           GYG + +G KYWL+KNSWG  WGE+GY R+++DI   +G CG+A+  S+P 
Sbjct: 181 GYGKTSDGTKYWLMKNSWGTTWGENGYLRMEKDISDKKGMCGLAIEPSYPT 231


>gi|115478933|ref|NP_001063060.1| Os09g0381400 [Oryza sativa Japonica Group]
 gi|113631293|dbj|BAF24974.1| Os09g0381400 [Oryza sativa Japonica Group]
 gi|215678649|dbj|BAG92304.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|218202075|gb|EEC84502.1| hypothetical protein OsI_31193 [Oryza sativa Indica Group]
          Length = 362

 Score =  238 bits (606), Expect = 4e-60,   Method: Compositional matrix adjust.
 Identities = 134/319 (42%), Positives = 188/319 (58%), Gaps = 24/319 (7%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
           + ++F  W+  + R+Y  + E  +RF++++ N   ++  N    G+ +Y L  N+FADLT
Sbjct: 47  MMDRFRAWQGAHNRSYPSAEEALQRFDVYRRNAEFIDAVN--LRGDLTYQLAENEFADLT 104

Query: 90  PQEFIASQTGFKMSDHS--SSLKANGT-----PFLYKSSQVPPSVNWIEKGAVTPVKYQ- 141
            +EF+A+ TG+   D     S+   G       F Y+   VP SV+W  +GAV P K Q 
Sbjct: 105 EEEFLATYTGYYAGDGPVDDSVITTGAGDVDASFSYRV-DVPASVDWRAQGAVVPPKSQT 163

Query: 142 GQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQ 194
             C+        A +E +N IK  +LVSLSEQQLVDC + D   GC  G    A+K++++
Sbjct: 164 STCSSCWAFVTAATIESLNMIKTGKLVSLSEQQLVDCDSYDG--GCNLGSYGRAYKWVVE 221

Query: 195 NKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAID 254
           N G+T +A Y Y     G C+  K+  HAA+IT +  VPP +E +L  AVA QPV+VAI+
Sbjct: 222 NGGLTTEADYPYTARR-GPCNRAKSAHHAAKITGFGKVPPRNEAALQAAVARQPVAVAIE 280

Query: 255 -ASALQFYSGGVFNGYCETFLNHGVTAVGYGT-SEEGIKYWLIKNSWGQDWGEDGYFRLQ 312
             S +QFY GGV+ G C T L H VT VGYGT +  G KYW IKNSWGQ WGE GY R+ 
Sbjct: 281 VGSGMQFYKGGVYTGPCGTRLAHAVTVVGYGTDASSGAKYWTIKNSWGQSWGERGYIRIL 340

Query: 313 RDIDQPQGQCGIAMFASFP 331
           RD+  P G CG+ +  ++P
Sbjct: 341 RDVGGP-GLCGVTLDIAYP 358


>gi|125525812|gb|EAY73926.1| hypothetical protein OsI_01810 [Oryza sativa Indica Group]
          Length = 319

 Score =  238 bits (606), Expect = 4e-60,   Method: Compositional matrix adjust.
 Identities = 138/321 (42%), Positives = 181/321 (56%), Gaps = 28/321 (8%)

Query: 26  DEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKF 85
           D+G   + FE+W A++G+TYK   E   RF IF+DN+  +  +      + +  + +N+F
Sbjct: 12  DDGVTMQMFEEWMAKFGKTYKCHGEKEHRFGIFRDNVHFIRGYKPQVTYDSA--VGINQF 69

Query: 86  ADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC- 144
           ADLT  EF+A+ TG K      +      P        P  ++W  +GAVT VK QG C 
Sbjct: 70  ADLTNDEFVATYTGAKPPHPKEA------PRPVDPIWTPCCIDWRFRGAVTGVKDQGACG 123

Query: 145 ------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGI 198
                 AVAA+EG+  I+  +L  LSEQ+LVDC TN  +NGC GG  D AF+ +    GI
Sbjct: 124 SCWAFAAVAAIEGLTKIRTGQLTPLSEQELVDCDTN--SNGCGGGHTDRAFELVASKGGI 181

Query: 199 TNDAVYSYEGMSTGICDSIKAED----HAAQITNYEDVPPNDEESLLKAVANQPVSVAID 254
           T ++ Y YEG   G C   + +D    HAA I  Y  VPPNDE  L  AVA QPV+V ID
Sbjct: 182 TAESDYRYEGFQ-GKC---RVDDMLFNHAASIGGYRAVPPNDERQLATAVARQPVTVYID 237

Query: 255 AS--ALQFYSGGVFNGYCETFLNHGVTAVGYGT-SEEGIKYWLIKNSWGQDWGEDGYFRL 311
           AS  A QFY  GVF G C    NH VT VGY      G KYW+ KNSWG+ WG+ GY  L
Sbjct: 238 ASGPAFQFYKSGVFPGPCGASSNHAVTLVGYCQDGASGKKYWVAKNSWGKTWGQQGYILL 297

Query: 312 QRDIDQPQGQCGIAMFASFPV 332
           ++D+ QP G CG+A+   +P 
Sbjct: 298 EKDVLQPHGTCGLAVSPFYPT 318


>gi|341850671|gb|AEK97329.1| chromoplast senescence-associated protein 12 [Brassica rapa var.
           parachinensis]
          Length = 260

 Score =  238 bits (606), Expect = 4e-60,   Method: Compositional matrix adjust.
 Identities = 132/261 (50%), Positives = 168/261 (64%), Gaps = 16/261 (6%)

Query: 84  KFADLTPQEFIASQTGFKM-SDHSSSLKANGTPFLYK---SSQVPPSVNWIEKGAVTPVK 139
           +FA++T  EF +  TG+K  S  SS  +   T F Y+   S  +P +V+W +KGAVTP+K
Sbjct: 1   QFAEITNDEFRSMYTGYKGDSVLSSQSQTKSTSFRYQNVSSGALPIAVDWRKKGAVTPIK 60

Query: 140 YQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYI 192
            QG C       AVAA+EG   IK  +L+SLSEQQLVDC TND   GC GG +D AF++I
Sbjct: 61  NQGSCGCCWAFSAVAAIEGATQIKKGKLISLSEQQLVDCDTNDF--GCSGGLIDTAFEHI 118

Query: 193 IQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVA 252
           +   G+T ++ Y Y+G     C        AA IT YEDVP NDE +L+KAVA+QPVSV 
Sbjct: 119 MATGGLTTESNYPYKG-EDATCKIKSTXPSAASITGYEDVPVNDENALMKAVAHQPVSVG 177

Query: 253 IDASAL--QFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFR 310
           I+      QFYS GVF G C T+L+H VTAVGY  S  G KYW+IKNSWG  WGE GY R
Sbjct: 178 IEGGGFDFQFYSSGVFTGECTTYLDHAVTAVGYSQSSAGSKYWIIKNSWGTKWGEGGYMR 237

Query: 311 LQRDIDQPQGQCGIAMFASFP 331
           +++DI   +G CG+AM AS+P
Sbjct: 238 IKKDIKDKEGLCGLAMKASYP 258


>gi|2351107|dbj|BAA21929.1| bromelain [Ananas comosus]
          Length = 312

 Score =  237 bits (605), Expect = 5e-60,   Method: Compositional matrix adjust.
 Identities = 140/312 (44%), Positives = 187/312 (59%), Gaps = 25/312 (8%)

Query: 39  AQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQT 98
           A+YGR YK++ E  +RF+IFK+N+  +E FNN   GN SYTL +NKF D+T  EF+A  T
Sbjct: 2   AEYGRVYKDNDEKMRRFQIFKNNVNHIETFNNRN-GN-SYTLGINKFTDMTNNEFVAQYT 59

Query: 99  GFKMSDHSSSLKANGTPFL----YKSSQVPPSVNWIEKGAVTPVKYQGQC-------AVA 147
           G      S  L     P +       S V  S++W + GAVT VK Q  C       A+A
Sbjct: 60  G----GISRPLNIEKEPVVSFDDVNISAVGQSIDWRDYGAVTEVKDQNPCGSCWAFSAIA 115

Query: 148 AVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYE 207
            VEGI  I    LVSLSEQ+++DCA +   NGC GGF+D+A+ +II N G+ ++A Y Y+
Sbjct: 116 TVEGIYKIVTGYLVSLSEQEVLDCAVS---NGCDGGFVDNAYDFIISNNGVASEADYPYQ 172

Query: 208 GMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFYSGGV 265
               G C +  +  ++A IT Y  V  NDE S+  AV NQP++ AIDAS    Q+Y+GGV
Sbjct: 173 AYQ-GDC-AANSWPNSAYITGYSYVRSNDESSMKYAVWNQPIAAAIDASGDNFQYYNGGV 230

Query: 266 FNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIA 325
           F+G C T LNH +T +GYG    G +YW++KNSWG  WGE GY R+ R +    G CGIA
Sbjct: 231 FSGPCGTSLNHAITIIGYGQDSSGTQYWIVKNSWGSSWGERGYIRMARGVSS-SGLCGIA 289

Query: 326 MFASFPVSKESA 337
           M   +P  +  A
Sbjct: 290 MDPLYPTLQSGA 301


>gi|296090463|emb|CBI40282.3| unnamed protein product [Vitis vinifera]
          Length = 386

 Score =  237 bits (604), Expect = 7e-60,   Method: Compositional matrix adjust.
 Identities = 138/309 (44%), Positives = 173/309 (55%), Gaps = 47/309 (15%)

Query: 34  FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
           +E W  ++G++Y    E  +RFEIFKDNL  +E  N     NR+Y               
Sbjct: 4   YEAWLVKHGKSYNALGERERRFEIFKDNLRFIEEHNAV---NRTY--------------- 45

Query: 94  IASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA-------V 146
                  K+ D  S               +P SV+W EKGAV PVK QG C        +
Sbjct: 46  -------KVGDRYS---------FRAGEDLPESVDWREKGAVVPVKDQGNCGSCWAFSTI 89

Query: 147 AAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSY 206
           AAVEGIN I    L+SLSEQ+LVDC     N GC GG MD AF++II N GI ++  Y Y
Sbjct: 90  AAVEGINQIATGDLISLSEQELVDC-DKSYNQGCNGGLMDYAFEFIINNGGIDSEEDYPY 148

Query: 207 EGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQFYSGG 264
               T  CD  +       I  YEDVP NDE SL KAVANQPVSVAI+A   A Q Y  G
Sbjct: 149 RAADT-TCDPNRKNARVVSIDGYEDVPQNDERSLKKAVANQPVSVAIEAGGRAFQLYQSG 207

Query: 265 VFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQ-GQCG 323
           VF G C T L+HGV AVGYGT E  + YW+++NSWG +WGE GY +L+R++   + G+CG
Sbjct: 208 VFTGQCGTQLDHGVVAVGYGT-ENSVDYWIVRNSWGPNWGESGYIKLERNLAGTETGKCG 266

Query: 324 IAMFASFPV 332
           IA+  S+P+
Sbjct: 267 IAIEPSYPI 275


>gi|125606204|gb|EAZ45240.1| hypothetical protein OsJ_29883 [Oryza sativa Japonica Group]
          Length = 350

 Score =  237 bits (604), Expect = 7e-60,   Method: Compositional matrix adjust.
 Identities = 137/330 (41%), Positives = 189/330 (57%), Gaps = 34/330 (10%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
           + ++FEQW  ++GR Y ++ E  +RFE+++ N+  VE FN+ + G   Y L  NKFADLT
Sbjct: 27  MLDRFEQWMIRHGRAYTDAGEKQRRFEVYRRNVELVETFNSMSNG---YKLADNKFADLT 83

Query: 90  PQEFIASQTGFK----MSDHSSSLKAN-GTPFLYKSSQVPPSVNWIEKGAVTPVKYQ--- 141
            +EF A   GF+    +   S++  A+   P       +P SV+W  KGAV   +++   
Sbjct: 84  NEEFRAKMLGFRPHVTIPQISNTCSADIAMPGESSDDILPKSVDWRNKGAVIN-RWKICV 142

Query: 142 --GQC----AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQN 195
             G C    AVAA+EGIN IK   LVSLSEQ+LVDC  +D   GC GG+M  AF++++ N
Sbjct: 143 DAGSCWAFSAVAAIEGINQIKNGELVSLSEQELVDC--DDEAVGCGGGYMSWAFEFVVGN 200

Query: 196 KGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDA 255
            G+T +A Y Y   + G C + K    A  I  Y +V P+ E  L +A A QPVSVA+D 
Sbjct: 201 HGLTTEASYPYHA-ANGACQAAKLNQSAVAIAGYRNVTPSSEPDLARAAAAQPVSVAVDG 259

Query: 256 SALQF--YSGGVFNGYCETFLNHGVTAVGYGTSEEGIK----------YWLIKNSWGQDW 303
            +  F  Y  GV+ G C   +NHGVT VGYG SE              YW++KNSWG +W
Sbjct: 260 GSFMFQLYGSGVYTGPCTADVNHGVTVVGYGESEPKTDGGGAAKGGEKYWIVKNSWGAEW 319

Query: 304 GEDGYFRLQRDI-DQPQGQCGIAMFASFPV 332
           G+ GY  +QRD+     G CGIA+  S+PV
Sbjct: 320 GDAGYILMQRDVAGLASGLCGIALLPSYPV 349


>gi|167521499|ref|XP_001745088.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163776702|gb|EDQ90321.1| predicted protein [Monosiga brevicollis MX1]
          Length = 294

 Score =  236 bits (603), Expect = 8e-60,   Method: Compositional matrix adjust.
 Identities = 138/306 (45%), Positives = 182/306 (59%), Gaps = 23/306 (7%)

Query: 37  WKAQYGRTYKESAENSKRFEIFKDNLVAVERFN-NAAIGNRSYTLRLNKFADLTPQEFIA 95
           +K+ Y ++Y+  A  +KR   F+ NL  + + N   A G  SYT+ +N+FADLT  EF+A
Sbjct: 1   FKSDYSKSYESEAVEAKRLAAFEANLEFINKHNAEHAQGLHSYTVGVNEFADLTIDEFMA 60

Query: 96  SQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA-------VAA 148
               +  S  + ++  N T +L  +S+   SV+W  KGAVTP+K QGQC          +
Sbjct: 61  L---YVPSKFNRTMPYN-TVYLPATSE--DSVDWRTKGAVTPIKNQGQCGSCWSFSTTGS 114

Query: 149 VEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEG 208
            EG +AI    LVSLSEQQLVDC+ +  N GC GG MDDAFKYII NKG+  +  Y Y  
Sbjct: 115 TEGAHAIATGNLVSLSEQQLVDCSGSFGNQGCNGGLMDDAFKYIISNKGLDTEEDYPYTA 174

Query: 209 MSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDA--SALQFYSGGVF 266
              G C+  K   HAA I++Y DVP N+E+ L  AVA  PVSVAI+A  S  Q Y  GVF
Sbjct: 175 QD-GTCNKEKEAKHAATISSYSDVPKNNEDQLAAAVAKGPVSVAIEADQSGFQLYKSGVF 233

Query: 267 NGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAM 326
           +G C T L+HGV  VGY        YW++KNSWG  WG +GY  ++R +    G CGIAM
Sbjct: 234 DGNCGTNLDHGVLVVGYTDD-----YWIVKNSWGTTWGVEGYINMKRGV-SASGICGIAM 287

Query: 327 FASFPV 332
             S+P+
Sbjct: 288 QPSYPI 293


>gi|326531188|dbj|BAK04945.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 360

 Score =  236 bits (602), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 135/326 (41%), Positives = 191/326 (58%), Gaps = 30/326 (9%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
           + ++F  W+A + ++Y+ + E  +RF++++DN+  +E  N    G+ +Y L  N+FADLT
Sbjct: 38  MMDRFLMWQATHNQSYRSAEERLRRFQVYRDNVEYIETTNRR--GDLTYQLGENQFADLT 95

Query: 90  PQEFIASQTGFKM---------SDHSSSLKANGTPFLYKS-----SQVPPSVNWIEKGAV 135
            +EFIA  T +           S  +++    G P L+ S     S  PPSV+W  KGAV
Sbjct: 96  REEFIARFTSYNGDDDRTGDDDSVITTAAVGGGDPDLWSSGGDDVSLDPPSVDWRAKGAV 155

Query: 136 TPVKYQGQ--------CAVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDD 187
            P K Q           AVA +E ++AIK  +LV+LSEQQLVDC   D   GC  G    
Sbjct: 156 VPPKSQSSSCSSSWAFVAVATIESLHAIKTGKLVALSEQQLVDCDQYDG--GCNRGTFRR 213

Query: 188 AFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ 247
           AF ++IQN G+T +A Y Y   + G C+S K++ H A I+ +  VP ++E ++  AVA Q
Sbjct: 214 AFHWVIQNGGLTTEAEYPYTA-AQGTCNSAKSDHHVAAISGHASVPGSNELAMKHAVATQ 272

Query: 248 PVSVAID-ASALQFYSGGVFNGYCETFLNHGVTAVGYGTSEE-GIKYWLIKNSWGQDWGE 305
           PV+ AI+  S +QFY  GV++G C   L H VT VGYG  E  G KYW++KNSWGQ WGE
Sbjct: 273 PVAAAIELGSDMQFYKSGVYSGPCGARLEHAVTVVGYGADESTGDKYWIVKNSWGQTWGE 332

Query: 306 DGYFRLQRDIDQPQGQCGIAMFASFP 331
            GY R+QR I  P G CGI +  ++P
Sbjct: 333 RGYIRMQRKILGP-GLCGIMLDVAYP 357


>gi|357133074|ref|XP_003568153.1| PREDICTED: cysteine proteinase RD21a-like [Brachypodium distachyon]
          Length = 565

 Score =  236 bits (602), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 136/315 (43%), Positives = 172/315 (54%), Gaps = 19/315 (6%)

Query: 34  FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNR------SYTLRLNKFAD 87
           FE W A++G+ Y    E + R   F DN   V   N    G        SYTL LN FAD
Sbjct: 42  FEAWCAEHGKAYASPGERAARLAAFADNAAFVAAHNAGGGGAGGSNAAPSYTLALNAFAD 101

Query: 88  LTPQEFIASQTG-FKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC-- 144
           LT  EF A++ G   +    +     G         VP +++W + GAVT VK QG C  
Sbjct: 102 LTHAEFRAARLGRLAVGGARAPPSEGGFAGSVGVGAVPEALDWRQSGAVTKVKDQGSCGA 161

Query: 145 -----AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGIT 199
                A  A+EGIN IK   L+SLSEQ+L+DC     N GC GG MD A++++I+N GI 
Sbjct: 162 CWSFSATGAIEGINKIKTGSLISLSEQELIDC-DRSYNAGCGGGLMDYAYRFVIKNGGID 220

Query: 200 NDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAI--DASA 257
            +  Y Y   + G C+  K + H   I  Y DVP N E+SLL+AVA QP+SV I   A A
Sbjct: 221 TEDDYPYR-EADGTCNKNKLKRHVVTIDGYSDVPANKEDSLLQAVAQQPISVGICGSARA 279

Query: 258 LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQ 317
            Q YS G+F+G C T L+H V  VGYG SE G  YW++KNSWG+ WG  GY  + R+   
Sbjct: 280 FQLYSQGIFDGPCPTSLDHAVLIVGYG-SEGGKDYWIVKNSWGERWGMKGYMHMHRNTGS 338

Query: 318 PQGQCGIAMFASFPV 332
             G CGI M ASFP 
Sbjct: 339 SSGICGINMMASFPT 353


>gi|226499884|ref|NP_001148278.1| thiol protease SEN102 precursor [Zea mays]
 gi|195617112|gb|ACG30386.1| thiol protease SEN102 precursor [Zea mays]
          Length = 374

 Score =  236 bits (601), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 138/335 (41%), Positives = 189/335 (56%), Gaps = 31/335 (9%)

Query: 26  DEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIG-NRSYTLRLNK 84
           D+ S+ E+F++WKA Y ++Y   AE  +RF ++  N+  +E  N  A     +Y L    
Sbjct: 42  DDSSMIERFQRWKAAYNKSYATVAEERRRFRVYARNMAYIEATNAEAEAAGLTYELGETA 101

Query: 85  FADLTPQEFIASQTG---FKMSDHSSSLKANGTP-------------FLYKSSQVPPSVN 128
           + DLT QEF+A  T     ++    S +     P             ++  S+  P SV+
Sbjct: 102 YTDLTNQEFMAMYTAPALAQLPADESVITTRAGPVDAVGGAPGQLPVYVNLSASAPASVD 161

Query: 129 WIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCY 181
           W   GAVTPVK QG+C        VA VEGI  I+  +LVSLSEQ+LVDC T D+  GC 
Sbjct: 162 WRASGAVTPVKNQGRCGSCWAFSTVAVVEGIYQIRTGKLVSLSEQELVDCDTLDD--GCD 219

Query: 182 GGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLL 241
           GG    A ++I  N GIT +A Y Y G +T  C+  K   +A  I     V    E SL 
Sbjct: 220 GGISYRALRWIASNGGITTEADYPYTG-TTDACNRAKLSHNAVSIAGLRRVATRSEASLA 278

Query: 242 KAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGT-SEEGIKYWLIKNS 298
            AVA QPV+V+I+A     Q Y  GV+NG C T LNHGVT VGYG  +  G +YW++KNS
Sbjct: 279 NAVAGQPVAVSIEAGGDNFQHYKKGVYNGPCGTNLNHGVTVVGYGQEAAAGDRYWIVKNS 338

Query: 299 WGQDWGEDGYFRLQRDI-DQPQGQCGIAMFASFPV 332
           WGQ WG+DGY R+++D+  +P+G CGIA+  S+P+
Sbjct: 339 WGQGWGDDGYIRMKKDVAGKPEGLCGIAIRPSYPL 373


>gi|356509992|ref|XP_003523725.1| PREDICTED: oryzain alpha chain-like [Glycine max]
          Length = 439

 Score =  235 bits (600), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 138/323 (42%), Positives = 183/323 (56%), Gaps = 36/323 (11%)

Query: 31  AEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNR---SYTLRLNKFAD 87
           +E FE+W  ++ +TY    E   R ++F+DN   V + N  A  N    SYTL LN FAD
Sbjct: 30  SELFEKWCKEHSKTYSSEEEKLYRLKVFEDNYAFVAQHNQNANNNNNNSSYTLSLNAFAD 89

Query: 88  LTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQ---------VPPSVNWIEKGAVTPV 138
           LT  EF  ++ G  +           T   +K  Q         +P  ++W + GAVTPV
Sbjct: 90  LTHHEFKTTRLGLPL-----------TLLRFKRPQNQQSRDLLHIPSQIDWRQSGAVTPV 138

Query: 139 KYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKY 191
           K Q  C       A  A+EGIN I    LVSLSEQ+L+DC T+  N+GC GG MD A+++
Sbjct: 139 KDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDTS-YNSGCGGGLMDFAYQF 197

Query: 192 IIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSV 251
           +I NKGI  +  Y Y+      C   K +  A  I +Y DVPP++EE +LKAVA+QPVSV
Sbjct: 198 VIDNKGIDTEDDYPYQARQRS-CSKDKLKRRAVTIEDYVDVPPSEEE-ILKAVASQPVSV 255

Query: 252 AIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYF 309
            I  S    Q YS G+F G C TFL+H V  VGYG SE G+ YW++KNSWG+ WG +GY 
Sbjct: 256 GICGSEREFQLYSKGIFTGPCSTFLDHAVLIVGYG-SENGVDYWIVKNSWGKYWGMNGYI 314

Query: 310 RLQRDIDQPQGQCGIAMFASFPV 332
            + R+    +G CGI   AS+PV
Sbjct: 315 HMIRNSGNSKGICGINTLASYPV 337


>gi|226505708|ref|NP_001141813.1| uncharacterized protein LOC100273952 precursor [Zea mays]
 gi|194706024|gb|ACF87096.1| unknown [Zea mays]
 gi|413945958|gb|AFW78607.1| hypothetical protein ZEAMMB73_489507 [Zea mays]
          Length = 460

 Score =  235 bits (600), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 140/330 (42%), Positives = 183/330 (55%), Gaps = 28/330 (8%)

Query: 26  DEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNR--------- 76
           D  +I  +F+ W A++G+ Y    E + R  +F DN   V   N  A  N          
Sbjct: 28  DPPAIEAQFDAWCAEHGKAYATPEERAARLAVFADNAAFVAAHNARAGANAAGGGGGGAA 87

Query: 77  --SYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPF---LYKSSQVPPSVNWIE 131
             SYTL LN FADLT +EF A++ G       ++L++   P    L   + VP +++W +
Sbjct: 88  PPSYTLALNAFADLTHEEFRAARLGRIAP--GAALRSRAAPVYWGLGGGAAVPDALDWRK 145

Query: 132 KGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGF 184
            GAVT VK QG C       A  A+EGIN IK   LVSLSEQ+L+DC     N+GC GG 
Sbjct: 146 SGAVTKVKDQGSCGACWSFSATGAMEGINKIKTGSLVSLSEQELIDC-DRSYNSGCGGGL 204

Query: 185 MDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAV 244
           MD A+K++I+N GI  +  Y Y   + G C+  K +     I  Y DVP N E+ LL+AV
Sbjct: 205 MDYAYKFVIKNGGIDTEEDYPYR-EADGTCNKNKLKKRVVTIDGYTDVPSNKEDLLLQAV 263

Query: 245 ANQPVSVAI--DASALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQD 302
           A QPVSV I   A A Q Y  G+F+G C T L+H V  VGYG SE G  YW++KNSWG+ 
Sbjct: 264 AQQPVSVGICGSARAFQLYYQGIFDGPCPTSLDHAVLIVGYG-SEGGKDYWIVKNSWGES 322

Query: 303 WGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           WG  GY  + R+    +G CGI M ASFP 
Sbjct: 323 WGMKGYMHMHRNTGDSKGVCGINMMASFPT 352


>gi|390337645|ref|XP_001199228.2| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
          Length = 333

 Score =  235 bits (599), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 143/345 (41%), Positives = 204/345 (59%), Gaps = 29/345 (8%)

Query: 3   KYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
           KY  ++++ +    +   ++  FDE      +++WK ++G+ Y    E + R  I++ NL
Sbjct: 2   KYLSVLLVAVCVVSSLSMSFTDFDE-----DWKEWKNEHGKRYLSDEEEASRRLIWQKNL 56

Query: 63  VAVERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSS 121
             V R N    +G+ +Y L +N+FADL  +EF+A  TGF++  + +S  A G+ FL  ++
Sbjct: 57  DIVIRHNLKYDLGHFTYDLGMNQFADLQNKEFVAMMTGFRV--NGTSKAAKGSTFLPPNN 114

Query: 122 --QVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCA 172
             ++P +V+W  KG VTPVK QGQC       A  ++EG +  K  +LVSLSEQ LVDC+
Sbjct: 115 VGKLPKTVDWRTKGYVTPVKDQGQCGSCWAFSATGSLEGQHFKKTGKLVSLSEQNLVDCS 174

Query: 173 TNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDV 232
             D N GC GG MD AF+YII   GI  +  Y Y  M  G C   K  +  A +T Y DV
Sbjct: 175 --DKNYGCNGGLMDRAFQYIIDAGGIDTEESYPYIAMD-GNC-HFKTANVGATVTGYTDV 230

Query: 233 PPNDEESLLKAVAN-QPVSVAIDAS--ALQFYSGGVFN--GYCETFLNHGVTAVGYGTSE 287
               E++L KAVA+  P+SVAIDAS  + Q Y  GV+N  G   T L+HGV AVGYGT+ 
Sbjct: 231 TSGSEKALQKAVAHIGPISVAIDASHFSFQLYQSGVYNEPGCSSTLLDHGVLAVGYGTTI 290

Query: 288 EGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           +G  YW++KNSW + WG +GY  + R+ D    QCGIA  AS+P+
Sbjct: 291 DGTDYWIVKNSWAETWGMNGYIWMSRNKDN---QCGIATQASYPL 332


>gi|238007404|gb|ACR34737.1| unknown [Zea mays]
 gi|413943289|gb|AFW75938.1| cysteine proteinase Mir2 [Zea mays]
          Length = 484

 Score =  235 bits (599), Expect = 3e-59,   Method: Compositional matrix adjust.
 Identities = 136/334 (40%), Positives = 194/334 (58%), Gaps = 27/334 (8%)

Query: 23  RTFDEGSIAEKFEQWKAQYGRTYKESA----------ENSKRFEIFKDNLVAVERFN-NA 71
           RT +E  +   +E+W++++    +  A          ++++R E+F+ NL  ++  N  A
Sbjct: 44  RTDEE--VRRLYEEWRSEHDAGPRRGATGGSLGPGEDDDARRLEVFRYNLRYIDAHNAEA 101

Query: 72  AIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTP--FLYKSSQVPPSVNW 129
             G   + L L +FADLT +E+ A         + +++   G+         Q+P +V+W
Sbjct: 102 DAGLHGFRLGLTRFADLTLEEYRARLLLGSRGRNGTAVGVVGSRRYLPLAGEQLPDAVDW 161

Query: 130 IEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYG 182
            E+GAV  VK QGQC       AVAAVEGIN I    L+SLSEQ+L+DC     + GC G
Sbjct: 162 RERGAVAEVKDQGQCGACWAFSAVAAVEGINKIVTGSLISLSEQELIDC-DKFQDQGCDG 220

Query: 183 GFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLK 242
           G MD+AF ++I+N GI  +A Y + G   G CD          I ++E VP N E +L K
Sbjct: 221 GLMDNAFVFMIKNGGIDTEADYPFTGHD-GTCDLKLKNTRVVSIDSFERVPINYERALQK 279

Query: 243 AVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWG 300
           AVA+QPVS +I+AS  A Q YS G+F+G C T+L+HGVT VGYG SE G  YW++KNSWG
Sbjct: 280 AVAHQPVSASIEASRRAFQLYSSGIFDGRCGTYLDHGVTVVGYG-SEGGKDYWIVKNSWG 338

Query: 301 QDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSK 334
             WGE GY R+ R++    G+CGIAM   +PV +
Sbjct: 339 TQWGEAGYVRMARNVRVRAGKCGIAMEPLYPVKE 372


>gi|194701748|gb|ACF84958.1| unknown [Zea mays]
 gi|414589103|tpg|DAA39674.1| TPA: thiol protease SEN102 [Zea mays]
          Length = 374

 Score =  235 bits (599), Expect = 3e-59,   Method: Compositional matrix adjust.
 Identities = 141/365 (38%), Positives = 198/365 (54%), Gaps = 40/365 (10%)

Query: 5   FLIVVLIISGSCASQATYR---------TFDEGSIAEKFEQWKAQYGRTYKESAENSKRF 55
            L+++ +    C+S   +R         + D+ S+ E+F++WKA Y ++Y   AE  +RF
Sbjct: 12  VLLLLAVFHHGCSSARAHRRAGDMERSMSTDDSSMIERFQRWKAAYNKSYATVAEERRRF 71

Query: 56  EIFKDNLVAVERFNNAAIG-NRSYTLRLNKFADLTPQEFIASQTG---FKMSDHSSSLKA 111
            +   N+  +E  N  A     +Y L    + DLT QEF+A  T     ++    S +  
Sbjct: 72  RVCARNMAYIEATNAEAEAAGLTYELGETAYTDLTNQEFMAMYTAPAPAQLPADESVITT 131

Query: 112 NGTP-------------FLYKSSQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEG 151
              P             ++  S+  P SV+W   GAVTPVK QG+C        VA VEG
Sbjct: 132 RAGPVDAVGGAPGQLPVYVNLSTSAPASVDWRASGAVTPVKNQGRCGSCWAFSTVAVVEG 191

Query: 152 INAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMST 211
           I  I+  +LVSLSEQ+LVDC T D+  GC GG    A ++I  N GIT +  Y Y G +T
Sbjct: 192 IYQIRTGKLVSLSEQELVDCDTLDD--GCDGGISYRALRWIASNGGITTETDYPYTG-TT 248

Query: 212 GICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGY 269
             C+  K   +A  I     V    E SL  AVA QPV+V+I+A     Q Y  GV+NG 
Sbjct: 249 DACNRAKLSHNAVSIAGLRRVATRSEASLANAVAGQPVAVSIEAGGDNFQHYKKGVYNGP 308

Query: 270 CETFLNHGVTAVGYGT-SEEGIKYWLIKNSWGQDWGEDGYFRLQRDI-DQPQGQCGIAMF 327
           C T LNHGVT VGYG  +  G +YW++KNSWGQ WG+DGY R+++D+  +P+G CGIA+ 
Sbjct: 309 CGTNLNHGVTVVGYGQEAAGGDRYWIVKNSWGQGWGDDGYIRMKKDVAGKPEGLCGIAIR 368

Query: 328 ASFPV 332
            S+P+
Sbjct: 369 PSYPL 373


>gi|194352760|emb|CAQ00108.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
 gi|326510977|dbj|BAJ91836.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326523875|dbj|BAJ96948.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326528631|dbj|BAJ97337.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 368

 Score =  234 bits (596), Expect = 5e-59,   Method: Compositional matrix adjust.
 Identities = 141/354 (39%), Positives = 190/354 (53%), Gaps = 34/354 (9%)

Query: 9   VLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERF 68
           V  + GS A+     T D   +A++F +WKA++ RTY    E   R  ++  N+  +E  
Sbjct: 18  VFFLHGSSATSRP-ATEDADPMAQRFRRWKAEHSRTYATPEEERHRLRVYARNMRYIEAT 76

Query: 69  NNAAIGNRSYTLRLNKFADLTPQEFIASQTGF--KMSDHSSSLKANGTP----------- 115
           N  A    +Y L    + DLT  EF A  T     +SD    L                 
Sbjct: 77  NGDAGAGLTYELGETAYTDLTSDEFTAMYTSRAPPLSDDDDDLPMTMITTRAGPVAAAGG 136

Query: 116 ------FLYKSSQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVS 162
                 ++ +S+  P SV+W E+GAVT VK QGQC        VA +EGI+ IK  +L S
Sbjct: 137 GGWLQVYVNESAGAPASVDWRERGAVTAVKNQGQCGSCWAFSTVAVIEGIHQIKTGKLAS 196

Query: 163 LSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDH 222
           LSEQ+LVDC   D  +GC GG    A ++I  N GIT+   Y Y       CD+ K   H
Sbjct: 197 LSEQELVDCDKLD--HGCNGGVSYRALQWITSNGGITSQDDYPYTAKDD-TCDTKKLSHH 253

Query: 223 AAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTA 280
           AA I+ ++ V    E SL  AVA QPV+V+I+A     Q Y  GV+NG C T LNHGVT 
Sbjct: 254 AASISGFQRVATRSELSLTNAVAMQPVAVSIEAGGANFQHYRNGVYNGPCGTRLNHGVTV 313

Query: 281 VGYGTSE-EGIKYWLIKNSWGQDWGEDGYFRLQRD-IDQPQGQCGIAMFASFPV 332
           VGYG  E  G  YW++KNSWG+ WG++GY R+++  ID+P+G CGIA+  SFP+
Sbjct: 314 VGYGEDEVTGESYWIVKNSWGEKWGDNGYLRMKKGIIDKPEGICGIAIRPSFPL 367


>gi|302763831|ref|XP_002965337.1| hypothetical protein SELMODRAFT_230602 [Selaginella moellendorffii]
 gi|300167570|gb|EFJ34175.1| hypothetical protein SELMODRAFT_230602 [Selaginella moellendorffii]
          Length = 343

 Score =  234 bits (596), Expect = 7e-59,   Method: Compositional matrix adjust.
 Identities = 135/348 (38%), Positives = 201/348 (57%), Gaps = 25/348 (7%)

Query: 1   MAKYFLIVVLIISGSCASQATYRTFDEG---SIAEKFEQWKAQYGRTYKESAENSKRFEI 57
           M    LI+++++  +  + A     ++G    I   FE W A++G++Y    E ++R  I
Sbjct: 5   MIASTLILLVVVGATPFAIARPAALEDGRALEIKNMFEDWAAKHGKSYSSDLEKARRLMI 64

Query: 58  FKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTG-FKMSDHSSSLKANGTPF 116
           F D L  +E+ N  A  N ++TL LNKF+DLT  EF A   G FK   +   L A     
Sbjct: 65  FSDTLAYIEKHN--AQPNTTFTLGLNKFSDLTNAEFRAMHVGKFKRPRYQDRLPAEDEDV 122

Query: 117 LYKSSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLV 169
               S +P S++W +KGAVTP+K QG C       A+A++E  + +    LVSLSEQQL+
Sbjct: 123 --DVSSLPTSLDWRQKGAVTPIKDQGDCGSCWAFSAIASIESAHFLATKELVSLSEQQLM 180

Query: 170 DCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAE--DHAAQIT 227
           DC T D   GC GG M+ AFK++++N G+T +A Y Y G S G C++ K    +  A+IT
Sbjct: 181 DCDTVDA--GCDGGLMETAFKFVVKNGGVTTEASYPYTG-SVGSCNANKVAIINKVAEIT 237

Query: 228 NYEDVPPNDEESLLKAVANQPVSVAIDASALQF--YSGGVFNGYCETFLNHGVTAVGYGT 285
            ++ V  +  ++L+KAV+  PV+V+I  S   F  Y  G+ +G C   L+HGV  +GYGT
Sbjct: 238 GFKVVTEDSADALMKAVSKTPVTVSICGSDENFQNYKSGILSGQCGDSLDHGVLLIGYGT 297

Query: 286 SEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
            E G+ YW+IKNSWG  WGEDG+ +++R      G CG+   +S+P +
Sbjct: 298 -EGGMPYWIIKNSWGTSWGEDGFMKIER--KDGDGICGMNGDSSYPTT 342


>gi|390368662|ref|XP_780781.2| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
          Length = 333

 Score =  234 bits (596), Expect = 7e-59,   Method: Compositional matrix adjust.
 Identities = 143/345 (41%), Positives = 203/345 (58%), Gaps = 29/345 (8%)

Query: 3   KYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
           KY  ++++ +    +   ++  FDE      + QWK ++G+ Y    E + R  I++ NL
Sbjct: 2   KYLSVLLVAVCVVSSLSMSFTDFDE-----DWNQWKNEHGKRYLSDEEEASRKLIWEKNL 56

Query: 63  VAVERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSS 121
             V + N    +G+ +Y L +N+FADL  +EF+A  TGF++  + +S  A G+ FL  ++
Sbjct: 57  DIVIKHNLKYDLGHFTYALGMNQFADLQNEEFVAMMTGFRV--NGTSKAAKGSTFLPSNN 114

Query: 122 --QVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCA 172
             ++P +V+W  KG VTPVK QGQC       A  ++EG    K  +LVSLSEQ LVDC+
Sbjct: 115 VDKLPKTVDWRTKGYVTPVKDQGQCGSCWAFSATGSLEGQQFKKTGKLVSLSEQNLVDCS 174

Query: 173 TNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDV 232
               N GC+GGFMD AF+YII   GI  +A YSY  +  G C   KA +  A +T Y DV
Sbjct: 175 YR--NYGCHGGFMDRAFQYIIDAGGIDTEATYSYRAVD-GNCHFKKA-NVGATVTGYTDV 230

Query: 233 PPNDEESLLKAVAN-QPVSVAIDASA--LQFYSGGVFN--GYCETFLNHGVTAVGYGTSE 287
               E++L KAVA+  P+SVAIDAS    +FY  GV+N  G   T L H V  VGYGT+ 
Sbjct: 231 TSGSEKALQKAVAHIGPISVAIDASHKFFKFYKSGVYNEPGCSTTRLGHAVLVVGYGTTS 290

Query: 288 EGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           +G  YW++KNSW + WG +GY  + R+ D    QCGIA  AS+P+
Sbjct: 291 DGTDYWIVKNSWAKTWGMNGYLWMSRNKDN---QCGIASEASYPM 332


>gi|326490904|dbj|BAJ90119.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 457

 Score =  233 bits (594), Expect = 9e-59,   Method: Compositional matrix adjust.
 Identities = 133/316 (42%), Positives = 177/316 (56%), Gaps = 20/316 (6%)

Query: 33  KFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAI----GNRSYTLRLNKFADL 88
           +FE W A++G+ Y    E + R   F +N   V   N+A      G  SYTL LN FADL
Sbjct: 38  QFEAWCAEHGKAYATPGERAARLAAFAENAAFVAAHNDAVASSGPGGPSYTLALNAFADL 97

Query: 89  TPQEFIASQTGFKMSDHSSSLKANGTP---FLYKSSQVPPSVNWIEKGAVTPVKYQGQC- 144
           T  EF A++ G +++     L A       F  +   VP +++W + GAVT VK QG C 
Sbjct: 98  THDEFRAARLG-RLAVGPGPLGAPSPSDGGFEGRVGAVPDALDWRQSGAVTKVKDQGSCG 156

Query: 145 ------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGI 198
                 A  A+EGIN I    L+SLSEQ+L+DC     N GC GG M  A+K++I+N GI
Sbjct: 157 ACWSFSATGAMEGINKITTGSLLSLSEQELIDC-DRSYNTGCGGGLMTYAYKFVIKNGGI 215

Query: 199 TNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAI--DAS 256
             +  Y +   + G C+  K + H   I  Y++VP + E+ LL+AVA QP+SV I   A 
Sbjct: 216 DTEDDYPFR-EADGTCNKNKLKKHVVTIDGYKEVPSSKEDLLLQAVAQQPISVGICGSAR 274

Query: 257 ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDID 316
           A Q YS G+F+G C T L+H V  VGYG SE G  YW++KNSWG+ WG  GY  + R+  
Sbjct: 275 AFQLYSQGIFDGPCPTSLDHAVLIVGYG-SEGGKDYWIVKNSWGERWGMKGYMHMHRNTG 333

Query: 317 QPQGQCGIAMFASFPV 332
              G CGI M ASFP 
Sbjct: 334 SSSGICGINMMASFPT 349


>gi|343978787|gb|AEM76722.1| cathepsin L-like proteinase [Triatoma brasiliensis]
          Length = 330

 Score =  233 bits (594), Expect = 9e-59,   Method: Compositional matrix adjust.
 Identities = 142/343 (41%), Positives = 195/343 (56%), Gaps = 28/343 (8%)

Query: 3   KYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
           K  L+ V +I+ SCA+    R ++     E++E +K  +G+ YK   E   R +IF +N 
Sbjct: 2   KVLLVAVAVIAVSCAN----RFYNIN--PEEWETFKVVHGKNYKNQFEEMFRRKIFMNNK 55

Query: 63  VAVERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSS 121
             +E  N     G  SY +++N F DL   E  A   GFKM+ ++   K  G  +   + 
Sbjct: 56  KRIEAHNAKYEQGEVSYKMKMNHFGDLMSHEIKALMNGFKMTPNT---KREGKIYFPSND 112

Query: 122 QVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATN 174
           ++P SV+W +KGAVTPVK QGQC       A  ++EG   +K  +LVSLSEQ L+DC+  
Sbjct: 113 KLPKSVDWRQKGAVTPVKDQGQCGSCWSFSATGSLEGQIFLKKGKLVSLSEQNLMDCSKE 172

Query: 175 DNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPP 234
             NNGC GG MD AF+Y+  NKGI  ++ Y YE          K +        Y D+P 
Sbjct: 173 YGNNGCEGGLMDKAFQYVSDNKGIDTESSYPYEARDYAC--RFKKDKVGGTDKGYVDIPE 230

Query: 235 NDEESLLKAVAN-QPVSVAIDAS--ALQFYSGGVFN-GYCETF-LNHGVTAVGYGTSEEG 289
            DE++L  A+A   P+SVAIDAS  +  FYS GV+N  YC ++ L+HGV AVGYGT E G
Sbjct: 231 GDEKALQNALATVGPISVAIDASHESFHFYSEGVYNEPYCSSYDLDHGVLAVGYGT-ENG 289

Query: 290 IKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
             YWL+KNSWG  WGE GY ++ R+       CGIA  AS+P+
Sbjct: 290 QDYWLVKNSWGPSWGESGYIKIARN---HSNHCGIASMASYPI 329


>gi|386648112|gb|AFJ15103.1| mexicain-like cystein protease, partial [Jacaratia mexicana]
          Length = 348

 Score =  233 bits (594), Expect = 1e-58,   Method: Compositional matrix adjust.
 Identities = 132/307 (42%), Positives = 180/307 (58%), Gaps = 21/307 (6%)

Query: 34  FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
           FE W  ++ R Y    E   RFEIFKDNL+ ++  N     N SY L LN+F DLT  EF
Sbjct: 48  FESWMLKHDRVYNNIEEKIHRFEIFKDNLMYIDETNKK---NNSYWLGLNEFVDLTHDEF 104

Query: 94  IASQTGFKMSDHSSSLKANGTPFLYKSS-QVPPSVNWIEKGAVTPVKYQ--GQC----AV 146
                G    D  +  ++N   F YK     P S++W +KGAVTPVK    G C     V
Sbjct: 105 KEKYVGSIGEDFVTIEQSNDEEFPYKHVVDYPESIDWRDKGAVTPVKPNPCGSCWAFSTV 164

Query: 147 AAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSY 206
           A VEGIN I   +L+SLSEQ+L+DC  +  ++GC GG+   + +Y++ N G+  +  Y Y
Sbjct: 165 ATVEGINKIVTGKLISLSEQELLDC--DRRSHGCKGGYQTTSLQYVVDN-GVHTEKEYPY 221

Query: 207 EGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQFYSGG 264
           E    G C + + +    QIT Y+ VP NDE SL++A+ANQPVSV +++   A Q Y GG
Sbjct: 222 E-KKQGKCRAKEKKGTKVQITGYKRVPANDEISLIQAIANQPVSVLLESKGRAFQLYKGG 280

Query: 265 VFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGI 324
           +FNG C T L+H VTA+GYG +     Y LIKNSWG +WGE GY +++R   + +G CG+
Sbjct: 281 IFNGPCGTKLDHAVTAIGYGKT-----YILIKNSWGPNWGEKGYLKIKRASGKSEGTCGV 335

Query: 325 AMFASFP 331
              + FP
Sbjct: 336 YKSSYFP 342


>gi|72008176|ref|XP_780713.1| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
          Length = 335

 Score =  233 bits (593), Expect = 1e-58,   Method: Compositional matrix adjust.
 Identities = 140/343 (40%), Positives = 203/343 (59%), Gaps = 26/343 (7%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
           +L V+L+   +C   +   +F +    E + QWK ++G+ Y    E + R  I++ NL  
Sbjct: 3   YLSVLLV--AACVVSSLSMSFTD--FDEDWNQWKNEHGKRYLSDEEEASRKLIWEKNLDI 58

Query: 65  VERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSS-- 121
           V + N    +G+ +Y L +N+FADL  +EF+A  TGF++  + +S  A G+ FL  ++  
Sbjct: 59  VIKHNLKYDLGHFTYALGMNQFADLKNEEFVAMMTGFRV--NGTSKAAKGSTFLPSNNIG 116

Query: 122 QVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATN 174
           ++P +V+W  KG VTPVK QGQC          ++EG +     +LVSLSEQ LVDC+  
Sbjct: 117 ELPKTVDWRTKGYVTPVKDQGQCGSCWAFSTTGSLEGQHFKATGKLVSLSEQNLVDCSGK 176

Query: 175 DNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPP 234
           + N GC GG MD AF+YII+  GI  +  Y Y+ +  G C   KA +  A +T Y DV  
Sbjct: 177 EGNEGCDGGLMDQAFQYIIKAGGIDTEESYPYKAVD-GECHFKKA-NIGATVTGYTDVTS 234

Query: 235 NDEESLLKAVAN-QPVSVAIDASAL--QFYSGGVFN--GYCETFLNHGVTAVGYGTSEEG 289
           + E +L KAVA+  P+SVAIDAS +  Q Y  GV+N      T L+HGV AVGYGT+ +G
Sbjct: 235 DSETALQKAVAHIGPISVAIDASHMSFQLYKSGVYNEPDCSSTLLDHGVLAVGYGTTSDG 294

Query: 290 IKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
             YW++KNSW + WG +GY  + R+ D    QCGIA  AS+P+
Sbjct: 295 TDYWIVKNSWAETWGMNGYLWMSRNKDN---QCGIATQASYPL 334


>gi|357114837|ref|XP_003559200.1| PREDICTED: fruit bromelain-like [Brachypodium distachyon]
          Length = 371

 Score =  233 bits (593), Expect = 1e-58,   Method: Compositional matrix adjust.
 Identities = 130/331 (39%), Positives = 189/331 (57%), Gaps = 34/331 (10%)

Query: 25  FDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNK 84
            D+  + ++F +W+A + RTY ++ E  +RF++++ N+  +E  N    G  +Y L  N+
Sbjct: 50  LDDMLMLDRFVRWQAAHNRTYGDAEERLRRFQVYRANIEYIEATNRR--GGLTYELGENQ 107

Query: 85  FADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS--------------SQVPPSVNWI 130
           FADLT +EF++       S + +  +A+    L  +              +  PPS +W 
Sbjct: 108 FADLTSEEFLS----MYASSYDAGDRADDEAALITTDVAGDGAWSDGDLEALPPPSWDWR 163

Query: 131 EKGAVTPVKYQGQ--------CAVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYG 182
            KGAVTP K QG           VA +EG+  IK  +L+SLSEQQLVDC   D   GC  
Sbjct: 164 AKGAVTPPKNQGPTCSSCWAFVTVATIEGLTFIKTGKLISLSEQQLVDCDMYDG--GCNT 221

Query: 183 GFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLK 242
           G     F+++++N G+T +A Y Y   + G C+  K+  HAA+IT    +PP +E  + K
Sbjct: 222 GSYSRGFRWVLENGGLTTEAEYPYTA-ARGPCNRAKSAHHAAKITGQGRIPPQNELVMQK 280

Query: 243 AVANQPVSVAID-ASALQFYSGGVFNGYCETFLNHGVTAVGYGTSE-EGIKYWLIKNSWG 300
           AVA QPV VAI+  S +QFY  GV++G C T L H VT VGYG     G KYW++KNSWG
Sbjct: 281 AVAGQPVGVAIEVGSGMQFYKTGVYSGPCGTNLAHAVTVVGYGVDPASGAKYWIVKNSWG 340

Query: 301 QDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
           Q WGE G+ R++RD+  P G CGIA+  ++P
Sbjct: 341 QAWGERGFIRMRRDVGGP-GLCGIALDVAYP 370


>gi|159479072|ref|XP_001697622.1| cysteine endopeptidase [Chlamydomonas reinhardtii]
 gi|158274232|gb|EDP00016.1| cysteine endopeptidase [Chlamydomonas reinhardtii]
          Length = 469

 Score =  232 bits (592), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 138/317 (43%), Positives = 186/317 (58%), Gaps = 26/317 (8%)

Query: 34  FEQWKAQYGRTY-KESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQE 92
           F++W   + R+Y  + AE   RF+++ +NL  V  +N       S+ L LN  ADL+  E
Sbjct: 13  FKEWAQTHSRSYVNDVAEFENRFKVWLENLEYVLAYNARTT---SHWLTLNHLADLSTPE 69

Query: 93  FIASQTGFKMSDHSSSLKANG--TPFLYK---SSQVPPSVNWIEKGAVTPVKYQGQCA-- 145
           + +   GF   D+ + +  N   T F Y+   +  +PP+++W +K AV  VK QGQC   
Sbjct: 70  YKSKLLGF---DNQARVARNKLKTGFRYEDVDAEALPPAIDWRKKNAVAEVKNQGQCGSC 126

Query: 146 -----VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITN 200
                  +VEGINAI    LVSLSEQ+LVDC T + + GC GG MD A+ +II+NKGI  
Sbjct: 127 WAFATTGSVEGINAIVTGSLVSLSEQELVDCDT-EQDKGCSGGLMDYAYAWIIKNKGINT 185

Query: 201 DAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAI--DASAL 258
           +  Y Y  M  G CD  K +     I +YEDVP NDE +L KA A+QPV+VAI  DA + 
Sbjct: 186 EEDYPYTAMD-GQCDVAKMKRRVVTIDSYEDVPENDEVALKKAAAHQPVAVAIEADAKSF 244

Query: 259 QFYSGGVFNG-YCETFLNHGVTAVGYG--TSEEGIKYWLIKNSWGQDWGEDGYFRLQRDI 315
           Q Y GGV++   C T LNHGV  VGYG   +  G  YW++KNSWG +WG+ GY RL+   
Sbjct: 245 QLYGGGVYDDPTCGTSLNHGVLVVGYGKDVTGSGSNYWIVKNSWGAEWGDAGYIRLKMGS 304

Query: 316 DQPQGQCGIAMFASFPV 332
              +G CGIAM  S+PV
Sbjct: 305 TDAEGLCGIAMAPSYPV 321


>gi|115464789|ref|NP_001055994.1| Os05g0508300 [Oryza sativa Japonica Group]
 gi|48475189|gb|AAT44258.1| hypothetical protein [Oryza sativa Japonica Group]
 gi|113579545|dbj|BAF17908.1| Os05g0508300 [Oryza sativa Japonica Group]
          Length = 450

 Score =  232 bits (592), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 135/310 (43%), Positives = 175/310 (56%), Gaps = 19/310 (6%)

Query: 33  KFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQE 92
           +FE W A++GR+Y    E + R   F DN   V   N A     SY L LN FADLT  E
Sbjct: 37  QFEAWCAEHGRSYATPGERAARLAAFADNAAFVAAHNGAPA---SYALALNAFADLTHDE 93

Query: 93  FIASQTGFKMSDHSSSLKANGTPFLYKSS---QVPPSVNWIEKGAVTPVKYQGQC----- 144
           F A++ G   +      +  G P+L        VP +V+W + GAVT VK QG C     
Sbjct: 94  FRAARLGRLAAAGGPG-RDGGAPYLGVDGGVGAVPDAVDWRQSGAVTKVKDQGSCGACWS 152

Query: 145 --AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDA 202
             A  A+EGIN IK   L+SLSEQ+L+DC     N+GC GG MD A+K++++N GI  +A
Sbjct: 153 FSATGAMEGINKIKTGSLISLSEQELIDC-DRSYNSGCGGGLMDYAYKFVVKNGGIDTEA 211

Query: 203 VYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAI--DASALQF 260
            Y Y   + G C+  K +     I  Y+DVP N+E+ LL+AVA QPVSV I   A A Q 
Sbjct: 212 DYPYR-ETDGTCNKNKLKRRVVTIDGYKDVPANNEDMLLQAVAQQPVSVGICGSARAFQL 270

Query: 261 YSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQG 320
           YS G+F+G C T L+H +  VGYG SE G  YW++KNSWG+ WG  GY  + R+     G
Sbjct: 271 YSKGIFDGPCPTSLDHAILIVGYG-SEGGKDYWIVKNSWGESWGMKGYMYMHRNTGNSNG 329

Query: 321 QCGIAMFASF 330
            CGI    SF
Sbjct: 330 VCGINQMPSF 339


>gi|225718114|gb|ACO14903.1| Cathepsin L precursor [Caligus clemensi]
          Length = 336

 Score =  232 bits (592), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 142/342 (41%), Positives = 200/342 (58%), Gaps = 26/342 (7%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
            L+ VL+I    AS A   +F +  +++ +E WK  +G+TY  S E   R +I+ +N + 
Sbjct: 6   LLLSVLVI----ASTANAVSFFDVVLSD-WESWKLMHGKTYSSSIEEKLRLKIYMENSLK 60

Query: 65  VERFNNAAI-GNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQV 123
           + R N+ A+ G   Y +++N + DL   EF+A   G++ ++ ++SL   GT    K+ Q+
Sbjct: 61  ISRHNSEALNGIHPYYMKMNHYGDLLHHEFVAMVNGYQYANKTASL--GGTYIPNKNIQL 118

Query: 124 PPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDN 176
           P  V+W E+GAVTPVK QGQC       A  A+EG +  K  +L+SLSEQ LVDC+    
Sbjct: 119 PTHVDWREEGAVTPVKNQGQCGSCWSFSATGALEGQDFRKTGKLISLSEQNLVDCSRKFG 178

Query: 177 NNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPND 236
           NNGC GG MD AF YI  NKGI  +A Y YEG+  G C     ++       + D+    
Sbjct: 179 NNGCEGGLMDFAFTYIRDNKGIDTEASYPYEGID-GHCH-YNPKNKGGSDIGFVDIKKGS 236

Query: 237 EESLLKAVAN-QPVSVAIDASAL--QFYSGGVF-NGYCET-FLNHGVTAVGYGT-SEEGI 290
           E+ L KAVA   P+SVAIDAS +  QFYS GV+    C +  L+HGV  VG+GT S  G 
Sbjct: 237 EKDLKKAVAGVGPISVAIDASHMSFQFYSHGVYVESKCSSEELDHGVLVVGFGTDSVSGE 296

Query: 291 KYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
            YWL+KNSW + WG+ GY ++ R+    +  CGIA  AS+PV
Sbjct: 297 DYWLVKNSWSEKWGDQGYIKMARN---KENMCGIASSASYPV 335


>gi|125552927|gb|EAY98636.1| hypothetical protein OsI_20560 [Oryza sativa Indica Group]
          Length = 449

 Score =  232 bits (592), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 135/310 (43%), Positives = 175/310 (56%), Gaps = 20/310 (6%)

Query: 33  KFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQE 92
           +FE W A++GR+Y    E + R   F DN   V   N A     SY L LN FADLT  E
Sbjct: 37  QFEAWCAEHGRSYATPGERAARLAAFADNAAFVAAHNGAPA---SYALALNAFADLTHDE 93

Query: 93  FIASQTGFKMSDHSSSLKANGTPFLYKSS---QVPPSVNWIEKGAVTPVKYQGQC----- 144
           F A++ G   +      +  G P+L        VP +V+W + GAVT VK QG C     
Sbjct: 94  FRAARLGRLAAAGPG--RDGGAPYLGVDGGVGAVPDAVDWRQSGAVTKVKDQGSCGACWS 151

Query: 145 --AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDA 202
             A  A+EGIN IK   L+SLSEQ+L+DC     N+GC GG MD A+K++++N GI  +A
Sbjct: 152 FSATGAMEGINKIKTGSLISLSEQELIDC-DRSYNSGCGGGLMDYAYKFVVKNGGIDTEA 210

Query: 203 VYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAI--DASALQF 260
            Y Y   + G C+  K +     I  Y+DVP N+E+ LL+AVA QPVSV I   A A Q 
Sbjct: 211 DYPYR-ETDGTCNKNKLKRRVVTIDGYKDVPANNEDMLLQAVAQQPVSVGICGSARAFQL 269

Query: 261 YSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQG 320
           YS G+F+G C T L+H +  VGYG SE G  YW++KNSWG+ WG  GY  + R+     G
Sbjct: 270 YSKGIFDGPCPTSLDHAILIVGYG-SEGGKDYWIVKNSWGESWGMKGYMYMHRNTGNSNG 328

Query: 321 QCGIAMFASF 330
            CGI    SF
Sbjct: 329 VCGINQMPSF 338


>gi|112490572|pdb|2FO5|A Chain A, Crystal Structure Of Recombinant Barley Cysteine
           Endoprotease B Isoform 2 (Ep-B2) In Complex With
           Leupeptin
 gi|112490573|pdb|2FO5|B Chain B, Crystal Structure Of Recombinant Barley Cysteine
           Endoprotease B Isoform 2 (Ep-B2) In Complex With
           Leupeptin
 gi|112490574|pdb|2FO5|C Chain C, Crystal Structure Of Recombinant Barley Cysteine
           Endoprotease B Isoform 2 (Ep-B2) In Complex With
           Leupeptin
 gi|112490575|pdb|2FO5|D Chain D, Crystal Structure Of Recombinant Barley Cysteine
           Endoprotease B Isoform 2 (Ep-B2) In Complex With
           Leupeptin
          Length = 262

 Score =  232 bits (591), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 126/254 (49%), Positives = 160/254 (62%), Gaps = 29/254 (11%)

Query: 121 SQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCAT 173
           S +PPSV+W +KGAVT VK QG+C        V +VEGINAI+   LVSLSEQ+L+DC T
Sbjct: 2   SDLPPSVDWRQKGAVTGVKDQGKCGSCWAFSTVVSVEGINAIRTGSLVSLSEQELIDCDT 61

Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHA---AQITNYE 230
            DN+ GC GG MD+AF+YI  N G+  +A Y Y   + G C+  +A  ++     I  ++
Sbjct: 62  ADND-GCQGGLMDNAFEYIKNNGGLITEAAYPYR-AARGTCNVARAAQNSPVVVHIDGHQ 119

Query: 231 DVPPNDEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEE 288
           DVP N EE L +AVANQPVSVA++AS  A  FYS GVF G C T L+HGV  VGYG +E+
Sbjct: 120 DVPANSEEDLARAVANQPVSVAVEASGKAFMFYSEGVFTGECGTELDHGVAVVGYGVAED 179

Query: 289 GIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV---------------S 333
           G  YW +KNSWG  WGE GY R+++D     G CGIAM AS+PV               +
Sbjct: 180 GKAYWTVKNSWGPSWGEQGYIRVEKDSGASGGLCGIAMEASYPVKTYSKPKPTPRRALGA 239

Query: 334 KESAQPSSADKSSA 347
           +ES   SS DK +A
Sbjct: 240 RESLNSSSVDKLAA 253


>gi|302763837|ref|XP_002965340.1| hypothetical protein SELMODRAFT_143126 [Selaginella moellendorffii]
 gi|302790566|ref|XP_002977050.1| hypothetical protein SELMODRAFT_232903 [Selaginella moellendorffii]
 gi|300155026|gb|EFJ21659.1| hypothetical protein SELMODRAFT_232903 [Selaginella moellendorffii]
 gi|300167573|gb|EFJ34178.1| hypothetical protein SELMODRAFT_143126 [Selaginella moellendorffii]
          Length = 300

 Score =  232 bits (591), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 126/310 (40%), Positives = 188/310 (60%), Gaps = 22/310 (7%)

Query: 34  FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
           FE W A++G++Y    E ++R  IF D L  +E+ N  A+ N ++TL LNKF+DLT  EF
Sbjct: 2   FEGWAAKHGKSYSSDWEKARRLMIFSDTLAYIEKHN--ALPNTTFTLGLNKFSDLTNAEF 59

Query: 94  IASQTG-FKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC-------A 145
            A+  G FK   +     A         S +P S++W ++GAVTP+K QGQC       A
Sbjct: 60  RANYVGKFKPPRYQDRRPAKDVDV--DVSSLPTSLDWRQEGAVTPIKDQGQCGSCWAFSA 117

Query: 146 VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYS 205
           +A++E  + +    LVSLSEQQL+DC T D   GC GGF +DAFK++++N G+T +  Y 
Sbjct: 118 IASIESAHFLATKELVSLSEQQLIDCDTVDQ--GCQGGFPEDAFKFVVENGGVTTEEAYP 175

Query: 206 YEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASALQF--YSG 263
           Y G + G C++ K  +   +IT Y+DV  +  ++L+KAV+  PV+V I  S   F  Y  
Sbjct: 176 YTGFA-GSCNANK--NKVVEITGYKDVTKDSADALMKAVSKTPVTVGICGSDQNFQNYRS 232

Query: 264 GVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCG 323
           G+ +G+C    +H V  +GYGT E G+ YW+IKNSWG  WGEDG+ R+++  +  +G CG
Sbjct: 233 GILSGHCSNSRDHAVLVIGYGT-EGGMPYWIIKNSWGTSWGEDGFMRIKK--EDGEGMCG 289

Query: 324 IAMFASFPVS 333
           +   +S+P +
Sbjct: 290 MNGQSSYPTT 299


>gi|449447027|ref|XP_004141271.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
          Length = 458

 Score =  231 bits (590), Expect = 3e-58,   Method: Compositional matrix adjust.
 Identities = 143/345 (41%), Positives = 192/345 (55%), Gaps = 25/345 (7%)

Query: 1   MAKYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKE-SAENSKRFEIFK 59
           MA  F + + + + S +S    RT DE  +   ++QW+A++G+ +    AE   RF IFK
Sbjct: 10  MALLFFLFIALSAASPSSIIPQRTDDE--VMALYDQWRAKHGKLHNNLGAEPENRFHIFK 67

Query: 60  DNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK 119
           DNL  ++  N     N  Y L LN FADLT +E+ +   G K +  S   + +       
Sbjct: 68  DNLKFIDEINAQ---NLPYRLGLNVFADLTNEEYRSRYLGGKFASGSRRNRTSNRYLPRL 124

Query: 120 SSQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCA 172
              +P S++W  KGAV PVK QG C        VA+VE IN I    L++LSEQ+LVDC 
Sbjct: 125 GDDLPDSIDWRAKGAVAPVKDQGSCGSCWAFSTVASVEAINQIVTGDLIALSEQELVDC- 183

Query: 173 TNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDV 232
               N GC GG MD AF++II+N G+  +  Y Y G      DS   +     I  YEDV
Sbjct: 184 DRSYNEGCNGGLMDYAFEFIIENGGLDTEEDYPYYGF-----DSSCIQYKKNAIDGYEDV 238

Query: 233 PPNDEESLLKAVANQPVSVAIDA-----SALQFYSGGVFNGYCETFLNHGVTAVGYGTSE 287
           P N+E++L KAV+ Q VSV   A      + Q Y  G+F G C T L+HGV  VGYG SE
Sbjct: 239 PVNNEKALQKAVSKQVVSVVSVAIEGGGRSFQLYQSGIFTGRCGTDLDHGVNVVGYG-SE 297

Query: 288 EGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
            G+ YW+++NSWG  WGE GY ++QR+I  P G CGIAM  S+P 
Sbjct: 298 GGVDYWIVRNSWGGSWGESGYVKMQRNIASPTGLCGIAMEPSYPT 342


>gi|1174171|gb|AAB41816.1| NTH1 [Pisum sativum]
          Length = 367

 Score =  231 bits (590), Expect = 3e-58,   Method: Compositional matrix adjust.
 Identities = 136/342 (39%), Positives = 203/342 (59%), Gaps = 25/342 (7%)

Query: 4   YFLIVVLIISGSCA-SQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
           Y LI+  +I+ S +   ++ R+  E  +   +E+W  ++ + Y    E ++RF+IFKDNL
Sbjct: 6   YSLILFGLITLSLSLDMSSGRSNKE--VMTMYEKWLVKHQKVYYGLGEKNQRFQIFKDNL 63

Query: 63  VAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-- 120
           + ++  N     N SY + LN+F+D+T +E+  +    + S+++   K     + YK+  
Sbjct: 64  IFIDEHNAP---NHSYRVGLNEFSDITNKEYRDTYLS-RWSNNNIKNKITSVRYAYKAGH 119

Query: 121 -SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCA 172
            +++P SV+W  +GA+TP+K QG C       AVAAVE IN I    LVSLSEQ+LVDC 
Sbjct: 120 NNKLPVSVDW--RGALTPIKNQGSCGACWAFSAVAAVEAINKIVTGSLVSLSEQELVDC- 176

Query: 173 TNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDV 232
               N GC GG   +A+++I++N G+ +   Y Y G  +  C+  K       I  Y++V
Sbjct: 177 DRTKNKGCNGGNQVNAYRFIVENGGLDSQIDYPYLGRQS-TCNQAKKNTKVVSINGYKNV 235

Query: 233 PPNDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGI 290
             N E +L++AVANQPVSV I+A     Q Y  GVF G C T L+H V  VGYG SE G 
Sbjct: 236 QRNSESALMEAVANQPVSVGIEAYGKDFQLYQSGVFTGSCGTSLDHAVVVVGYG-SENGK 294

Query: 291 KYWLIKNSWGQDWGEDGYFRLQRDIDQPQ-GQCGIAMFASFP 331
            YWL+KNSWG +WGE GY +++R++     G+CGIAM A++P
Sbjct: 295 DYWLVKNSWGTNWGERGYLKIERNLKNTNTGKCGIAMDATYP 336


>gi|222641485|gb|EEE69617.1| hypothetical protein OsJ_29194 [Oryza sativa Japonica Group]
          Length = 360

 Score =  231 bits (590), Expect = 3e-58,   Method: Compositional matrix adjust.
 Identities = 130/307 (42%), Positives = 181/307 (58%), Gaps = 23/307 (7%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
           + ++F  W+  + R+Y  + E  +RF++++ N   ++  N    G+ +Y L  N+FADLT
Sbjct: 47  MMDRFRAWQGAHNRSYPSAEEALQRFDVYRRNAEFIDAVN--LRGDLTYQLAENEFADLT 104

Query: 90  PQEFIASQTGFKMSDHS--SSLKANGT-----PFLYKSSQVPPSVNWIEKGAVTPVKYQ- 141
            +EF+A+ TG+   D     S+   G       F Y+   VP SV+W  +GAV P K Q 
Sbjct: 105 EEEFLATYTGYYAGDGPVDDSVITTGAGDVDASFSYRV-DVPASVDWRAQGAVVPPKSQT 163

Query: 142 GQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQ 194
             C+        A +E +N IK  +LVSLSEQQLVDC + D   GC  G    A+K++++
Sbjct: 164 STCSSCWAFVTAATIESLNMIKTGKLVSLSEQQLVDCDSYDG--GCNLGSYGRAYKWVVE 221

Query: 195 NKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAID 254
           N G+T +A Y Y     G C+  K+  HAA+IT +  VPP +E +L  AVA QPV+VAI+
Sbjct: 222 NGGLTTEADYPYTARR-GPCNRAKSAHHAAKITGFGKVPPRNEAALQAAVARQPVAVAIE 280

Query: 255 -ASALQFYSGGVFNGYCETFLNHGVTAVGYGT-SEEGIKYWLIKNSWGQDWGEDGYFRLQ 312
             S +QFY GGV+ G C T L H VT VGYGT +  G KYW IKNSWGQ WGE GY R+ 
Sbjct: 281 VGSGMQFYKGGVYTGPCGTRLAHAVTVVGYGTDASSGAKYWTIKNSWGQSWGERGYIRIL 340

Query: 313 RDIDQPQ 319
           RD+  P+
Sbjct: 341 RDVGGPR 347


>gi|91992508|gb|ABE72970.1| cathepsin L [Aedes aegypti]
          Length = 339

 Score =  231 bits (590), Expect = 3e-58,   Method: Compositional matrix adjust.
 Identities = 138/321 (42%), Positives = 187/321 (58%), Gaps = 23/321 (7%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKFADL 88
           + E++  +K Q+ + Y    E   R +I+  N   + + N    +G   Y LR+NK+ADL
Sbjct: 23  VKEEWNAFKLQHRKNYDSETEERIRLKIYVQNKHKIAKHNQRFDLGQEKYRLRVNKYADL 82

Query: 89  TPQEFIASQTGFKMSDHSSSLKA----NGTPFLYKSS-QVPPSVNWIEKGAVTPVKYQGQ 143
             +EF+ +  GF  +D   SLK         F+  ++ +VP +V+W +KGAVTPVK QG 
Sbjct: 83  LHEEFVQTVNGFNRTDSKKSLKGVRIEEPVTFIEPANVEVPTTVDWRKKGAVTPVKDQGH 142

Query: 144 C-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNK 196
           C       A  A+EG +  K  +LVSLSEQ LVDC+    NNGC GG MD AF+YI  N 
Sbjct: 143 CGSCWSFSATGALEGQHFRKTGKLVSLSEQNLVDCSGKYGNNGCNGGMMDYAFQYIKDNG 202

Query: 197 GITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDA 255
           GI  +  Y YE +      + KA    A    Y D+P  DEE+L KA+A   PVS+AIDA
Sbjct: 203 GIDTEKSYPYEAIDDTCHFNPKAV--GATDKGYVDIPQGDEEALKKALATVGPVSIAIDA 260

Query: 256 S--ALQFYSGGVF-NGYCETF-LNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRL 311
           S  + QFYS GV+    C++  L+HGV AVGYGTSEEG  YWL+KNSWG  WG+ GY ++
Sbjct: 261 SHESFQFYSEGVYYEPQCDSENLDHGVLAVGYGTSEEGEDYWLVKNSWGTTWGDQGYVKM 320

Query: 312 QRDIDQPQGQCGIAMFASFPV 332
            R+ D     CG+A  AS+P+
Sbjct: 321 ARNHDN---HCGVATCASYPL 338


>gi|297845822|ref|XP_002890792.1| hypothetical protein ARALYDRAFT_473117 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297336634|gb|EFH67051.1| hypothetical protein ARALYDRAFT_473117 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 322

 Score =  231 bits (590), Expect = 3e-58,   Method: Compositional matrix adjust.
 Identities = 138/337 (40%), Positives = 201/337 (59%), Gaps = 33/337 (9%)

Query: 6   LIVVLIISGSCA-SQAT-YRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLV 63
           L+ + I+S     SQA  + T +E SI +  +QW  Q+ R Y++ +E   R ++FK NL 
Sbjct: 8   LVALTILSMDLRISQARPHVTLNEQSIVDYHQQWMTQFSRVYQDESEKEMRLQVFKKNLK 67

Query: 64  AVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGT--PFLYKSS 121
            +E FNN  +GN+SYT+ +N+F D T +EF+A+ TG +++  + S   N T     +  S
Sbjct: 68  FIENFNN--MGNQSYTVGVNEFTDWTIEEFLATHTGLRVNVTTLSELFNETMPSRNWNIS 125

Query: 122 QVP---PSVNWIEKGAVTPVKYQGQCAVAAVEGINAIKINRLVSLSEQQLVDCATNDNNN 178
            +     S +W ++GAV PVK QG C +  + G N      L++LSEQQL+DC T + N 
Sbjct: 126 DIDIDDESKDWRDEGAVIPVKVQGACGLTKISGKN------LLTLSEQQLIDCDT-EKNT 178

Query: 179 GCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEE 238
           GC GG +++AFKYII+N G++ +  Y Y+ +  G C +        QI  +E VP ++E 
Sbjct: 179 GCDGGGIEEAFKYIIKNGGVSLETEYPYQ-VKKGSCRANARSATQTQIRGFEMVPSHNER 237

Query: 239 SLLKAVANQPVSVAIDASALQF--YSGGVFNGY-CETFLNHGVTAVGYGTSEEGIKYWLI 295
           +LL+AV  QPVSV IDA A  F  Y GGV+ G  C T +NH VT VGYGT        +I
Sbjct: 238 ALLEAVRRQPVSVLIDARADSFKTYKGGVYAGLDCGTDVNHAVTFVGYGT--------MI 289

Query: 296 KNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
                Q WGE+GY R++RD++ PQG CGIA  A++P+
Sbjct: 290 -----QSWGENGYMRIRRDVEWPQGMCGIAQVAAYPI 321


>gi|157132324|ref|XP_001655999.1| cathepsin l [Aedes aegypti]
 gi|108881694|gb|EAT45919.1| AAEL002833-PA [Aedes aegypti]
          Length = 339

 Score =  231 bits (589), Expect = 3e-58,   Method: Compositional matrix adjust.
 Identities = 138/321 (42%), Positives = 187/321 (58%), Gaps = 23/321 (7%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKFADL 88
           + E++  +K Q+ + Y    E   R +I+  N   + + N    +G   Y LR+NK+ADL
Sbjct: 23  VKEEWNAFKLQHRKNYDSETEERIRLKIYVQNKHKIAKHNQRFDLGQEKYRLRVNKYADL 82

Query: 89  TPQEFIASQTGFKMSDHSSSLKA----NGTPFLYKSS-QVPPSVNWIEKGAVTPVKYQGQ 143
             +EF+ +  GF  +D   SLK         F+  ++ +VP +V+W +KGAVTPVK QG 
Sbjct: 83  LHEEFVQTVNGFNRTDSKKSLKGVRIEEPVTFIEPANVEVPTTVDWRKKGAVTPVKDQGH 142

Query: 144 C-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNK 196
           C       A  A+EG +  K  +LVSLSEQ LVDC+    NNGC GG MD AF+YI  N 
Sbjct: 143 CGSCWSFSATGALEGQHFRKTGKLVSLSEQNLVDCSGKYGNNGCNGGMMDYAFQYIKDNG 202

Query: 197 GITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDA 255
           GI  +  Y YE +      + KA    A    Y D+P  DEE+L KA+A   PVS+AIDA
Sbjct: 203 GIDTEKSYPYEAIDDTCHFNPKAV--GATDKGYVDIPQGDEEALKKALATVGPVSIAIDA 260

Query: 256 S--ALQFYSGGVF-NGYCETF-LNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRL 311
           S  + QFYS GV+    C++  L+HGV AVGYGTSEEG  YWL+KNSWG  WG+ GY ++
Sbjct: 261 SHESFQFYSEGVYYEPQCDSENLDHGVLAVGYGTSEEGEDYWLVKNSWGTTWGDQGYVKM 320

Query: 312 QRDIDQPQGQCGIAMFASFPV 332
            R+ D     CG+A  AS+P+
Sbjct: 321 ARNRDN---HCGVATCASYPL 338


>gi|302790570|ref|XP_002977052.1| hypothetical protein SELMODRAFT_268054 [Selaginella moellendorffii]
 gi|300155028|gb|EFJ21661.1| hypothetical protein SELMODRAFT_268054 [Selaginella moellendorffii]
          Length = 300

 Score =  231 bits (589), Expect = 4e-58,   Method: Compositional matrix adjust.
 Identities = 126/310 (40%), Positives = 187/310 (60%), Gaps = 22/310 (7%)

Query: 34  FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
           FE W A++G++Y    E ++R  IF D L  +E+ N  A+ N ++TL LNKF+DLT  EF
Sbjct: 2   FEGWAAKHGKSYSSDWEKARRLMIFSDTLAYIEKHN--ALPNTTFTLGLNKFSDLTNAEF 59

Query: 94  IASQTG-FKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC-------A 145
            A+  G FK   +     A         S +P S++W ++GAVTP+K QGQC       A
Sbjct: 60  RANYVGKFKPPRYQDRRPAKDVDV--DVSSLPTSLDWRQEGAVTPIKDQGQCGSCWAFSA 117

Query: 146 VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYS 205
           +A++E  + +    LVSLSEQQL+DC T D   GC GGF +DAFK++++N G+T +  Y 
Sbjct: 118 IASIESAHFLATKELVSLSEQQLIDCDTVDQ--GCQGGFPEDAFKFVVENGGVTTEEAYP 175

Query: 206 YEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASALQF--YSG 263
           Y G + G C++ K  +   +IT Y+DV  +  ++L+KAV+  PV+V I  S   F  Y  
Sbjct: 176 YTGFA-GSCNANK--NKVVEITGYKDVTKDSADALMKAVSKTPVTVGICGSDQNFQNYRS 232

Query: 264 GVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCG 323
           G+ +G+C    +H V  +GYGT E G+ YW+IKNSWG  WGEDG+ R+++     +G CG
Sbjct: 233 GILSGHCSNSRDHAVLVIGYGT-EGGMPYWIIKNSWGTSWGEDGFMRIKK--KDGEGMCG 289

Query: 324 IAMFASFPVS 333
           +   +S+P +
Sbjct: 290 MNGQSSYPTT 299


>gi|356517306|ref|XP_003527329.1| PREDICTED: LOW QUALITY PROTEIN: thiol protease SEN102-like [Glycine
           max]
          Length = 333

 Score =  231 bits (589), Expect = 4e-58,   Method: Compositional matrix adjust.
 Identities = 140/338 (41%), Positives = 202/338 (59%), Gaps = 21/338 (6%)

Query: 4   YFLIVVLIISGSCASQATYRTFDEGSI-AEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
           Y L++ LI++   +     R    G I +E+ E+W AQYG+ YK++ E  KRF++FK+N+
Sbjct: 9   YVLVLFLILTVWIS-----RVMSRGLIRSERHEKWIAQYGKVYKDAVE-EKRFQVFKNNV 62

Query: 63  VAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFL--YKS 120
             +E FN  A G++ + L +N+F DL  +EF A      +   +S ++    P +   K 
Sbjct: 63  QFIESFN--AAGDKPFNLSINQFVDLHDEEFKA--LLINVQKKASGVETVKEPAMDIQKL 118

Query: 121 SQVPPSVNWIEKGAVTPVKYQGQCAVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGC 180
           ++     N  +K    P+   G   +A +E ++ I I  LV LSEQ+LVDC   D+   C
Sbjct: 119 TEEACRENXKKKNEKKPMWDLGFFLIATIESLHQITIGELVFLSEQELVDCVRGDSE-AC 177

Query: 181 YGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDH-AAQITNYEDVPPND-EE 238
           +GGF+++AF++I    GIT++A Y Y+G     C  +K E H  A+   YE VP N+ E+
Sbjct: 178 HGGFVENAFEFIANKGGITSEAYYPYKGKDRS-C-KVKKETHGVARNIGYEKVPSNNSEK 235

Query: 239 SLLKAVANQPVSVAIDASA--LQFYSGGVFNGY-CETFLNHGVTAVGYGTSEEGIKYWLI 295
           +LLKAVANQPVSV IDA A   +FYS G+FN   C T L+H  T VGYG   +G KYWL+
Sbjct: 236 ALLKAVANQPVSVYIDAGAPAYKFYSSGIFNARNCGTHLDHAATVVGYGKLHDGTKYWLV 295

Query: 296 KNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
           KNSW   WGE GY R++RDI   +G CGIA  AS+P++
Sbjct: 296 KNSWSTAWGEKGYIRMKRDIHSKKGLCGIASNASYPIA 333


>gi|390337642|ref|XP_780653.3| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
          Length = 333

 Score =  231 bits (588), Expect = 5e-58,   Method: Compositional matrix adjust.
 Identities = 144/343 (41%), Positives = 203/343 (59%), Gaps = 28/343 (8%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
           +L V+L+   +C   +   +F +    E + +WK ++G+ Y    E + R  I++ NL  
Sbjct: 3   YLSVLLV--AACVVSSLSMSFTD--FDEDWNEWKNEHGKRYLSDEEEASRRLIWQKNLDI 58

Query: 65  VERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSS-- 121
           V + N    +G+ +Y L +N+F DL  +EF+A  TGF++S   +S  A G+ FL  ++  
Sbjct: 59  VIKHNLKYDLGHFTYDLGINQFTDLQNEEFVAMMTGFRVS--GTSKAAKGSTFLPPNNVG 116

Query: 122 QVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATN 174
           ++P +V+W  KG VTPVK QGQC          +VEG +     +LVSLSEQ LVDC+  
Sbjct: 117 ELPKTVDWRTKGYVTPVKDQGQCGSCWAFSTTGSVEGQHFKATGKLVSLSEQNLVDCSGR 176

Query: 175 DNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPP 234
           D   GC GGFMD AF+YII   GI  +A Y Y+ +  G C   KA +  A +T Y DV  
Sbjct: 177 DA--GCDGGFMDRAFQYIIDAGGIDTEASYPYKAVD-GKCHFKKA-NVGATVTGYTDVTS 232

Query: 235 NDEESLLKAVAN-QPVSVAIDASALQF--YSGGVFN--GYCETFLNHGVTAVGYGTSEEG 289
             E++L KAVA+  P+SVAIDAS + F  Y  GV+N  G   T L+HGV AVGYGTS +G
Sbjct: 233 GSEKALQKAVAHVGPISVAIDASHMSFQHYKSGVYNEPGCDSTVLDHGVLAVGYGTSSDG 292

Query: 290 IKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
             YW++KNSW + WG +GY  + R+ D    QCGIA  AS+P+
Sbjct: 293 TDYWIVKNSWAETWGMNGYVWMSRNKDN---QCGIATNASYPL 332


>gi|307103885|gb|EFN52142.1| hypothetical protein CHLNCDRAFT_139276 [Chlorella variabilis]
          Length = 388

 Score =  231 bits (588), Expect = 6e-58,   Method: Compositional matrix adjust.
 Identities = 138/319 (43%), Positives = 186/319 (58%), Gaps = 24/319 (7%)

Query: 31  AEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTP 90
            + F QW+  +GR+YK ++E  KR  +F +N   V   N     N    L LN+FADLT 
Sbjct: 43  GQAFSQWQMTHGRSYKSASEARKRQAVFVENAKHVAEQNAR---NSGLVLALNQFADLTL 99

Query: 91  QEFIASQTGFKMSDHSSSLKANGTPFLY-KSSQVPPSVNWIEKGAVTPVKYQGQC----- 144
           +EF A+  G+  S      +   T F Y  ++ +P +V+W +K AVTPVK Q  C     
Sbjct: 100 EEFAATHLGYNPSLREGK-EHTTTSFQYADANDLPSTVDWRKKNAVTPVKNQAMCGSCWA 158

Query: 145 --AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDA 202
             A  AVEGINAI+  +LVSLSEQQLVDC  ++ + GC GG MD AF YI +N GI ++ 
Sbjct: 159 FSATGAVEGINAIRTGKLVSLSEQQLVDC-DSEKDLGCGGGLMDFAFDYITKNGGIDSED 217

Query: 203 VYSYEGMSTGICDSIK-AEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASALQFY 261
            YSY G    IC   K A+ H   I  +EDVP ND E+L KA+A+QPVS+        ++
Sbjct: 218 DYSYWGYGL-ICQRRKEADRHVVTIDGFEDVPKNDGEALKKAIAHQPVSL--------YH 268

Query: 262 SGGVFNGYCETFLNHGVTAVGYGT-SEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQG 320
           SG V +  C   LNHGV AVGY   S+ G  +++IKNSWG+ WGE G+FRL     +  G
Sbjct: 269 SGVVGDDACCQDLNHGVLAVGYDDGSKGGTPHYVIKNSWGEGWGEQGFFRLAAKSSEASG 328

Query: 321 QCGIAMFASFPVSKESAQP 339
            CG+   AS+P+ K++  P
Sbjct: 329 ACGVYKAASYPLKKDATNP 347


>gi|391338876|ref|XP_003743781.1| PREDICTED: cathepsin L-like isoform 4 [Metaseiulus occidentalis]
          Length = 336

 Score =  230 bits (587), Expect = 7e-58,   Method: Compositional matrix adjust.
 Identities = 139/350 (39%), Positives = 204/350 (58%), Gaps = 36/350 (10%)

Query: 1   MAKYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKD 60
           ++  FLI+ +++  + A+    + FD      +++ +K  + + Y+ S   + R +IF  
Sbjct: 4   LSMKFLILAVLVGAASAALTLEQLFDA-----EWQNFKVHHNKKYEGSTVEAFRKKIFLQ 58

Query: 61  NLVAVERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLY- 118
           N   + R N   A G  +Y L++N+F D+   EF+++  G         L++N T F   
Sbjct: 59  NTHLIARHNIKHAKGETTYKLKMNQFGDMLHHEFVSTMNGL--------LRSNRTYFGST 110

Query: 119 ----KSSQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQ 167
               +S  +P SV+W EKGAVTPVK QG C          A+EG    K   LVSLSEQ 
Sbjct: 111 WIEPESVSLPKSVDWREKGAVTPVKNQGHCGSCWSFSTTGALEGQLFRKTGELVSLSEQN 170

Query: 168 LVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQIT 227
           L+DC+T+  NNGC GG MD+AF YI +N GI  +  Y YEG   G C   K ED A + T
Sbjct: 171 LIDCSTSYGNNGCGGGLMDNAFTYIKENHGIDTEESYPYEG-KQGKCRYHK-EDSAGRDT 228

Query: 228 NYEDVPPNDEESLLKAVAN-QPVSVAIDAS--ALQFYSGGVFNGY-CETF-LNHGVTAVG 282
            + D+P  +E +L KA+A   PVSVAIDAS  + QFY  GV+N   C++  L+HGV AVG
Sbjct: 229 GFVDIPSGNERALAKALATIGPVSVAIDASHESFQFYHEGVYNPPDCDSHSLDHGVLAVG 288

Query: 283 YGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           YGT+++G  Y++IKNSWG+ WG++GY  + R+    + +CG+A  AS+P+
Sbjct: 289 YGTTDDGQDYYIIKNSWGERWGQEGYVLMARN---SKNECGVATQASYPL 335


>gi|194352758|emb|CAQ00107.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
          Length = 457

 Score =  230 bits (586), Expect = 8e-58,   Method: Compositional matrix adjust.
 Identities = 132/314 (42%), Positives = 176/314 (56%), Gaps = 20/314 (6%)

Query: 33  KFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAI----GNRSYTLRLNKFADL 88
           +FE W A++G+ Y    E + R   F +N   V   N+A      G  SYTL LN FADL
Sbjct: 38  QFEAWCAEHGKAYATPGERAARLAAFAENAAFVAAHNDAVASSGPGGPSYTLALNAFADL 97

Query: 89  TPQEFIASQTGFKMSDHSSSLKANGTP---FLYKSSQVPPSVNWIEKGAVTPVKYQGQC- 144
           T  EF A++ G +++     L A       F  +   VP +++W + GAVT VK QG C 
Sbjct: 98  THDEFRAARLG-RLAVGPGPLGAPSPSDGGFEGRVGAVPDALDWRQSGAVTKVKDQGSCG 156

Query: 145 ------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGI 198
                 A  A+EGIN I    L+SLSEQ+L+DC     N GC GG M  A+K++I+N GI
Sbjct: 157 ACWSFSATGAMEGINKITTGSLLSLSEQELIDC-DRSYNTGCGGGLMTYAYKFVIKNGGI 215

Query: 199 TNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAI--DAS 256
             +  Y +   + G C+  K + H   I  Y++VP + E+ LL+AVA QP+SV I   A 
Sbjct: 216 DTEDDYPFR-EADGTCNKNKLKKHVVTIDGYKEVPSSKEDLLLQAVAQQPISVGICGSAR 274

Query: 257 ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDID 316
           A Q YS G+F+G C T L+H V  VGYG SE G  YW++KNSWG+ WG  GY  + R+  
Sbjct: 275 AFQLYSQGIFDGPCPTSLDHAVLIVGYG-SEGGKDYWIVKNSWGERWGMKGYMHMHRNTG 333

Query: 317 QPQGQCGIAMFASF 330
              G CGI M ASF
Sbjct: 334 SSSGICGINMMASF 347


>gi|242048430|ref|XP_002461961.1| hypothetical protein SORBIDRAFT_02g011230 [Sorghum bicolor]
 gi|241925338|gb|EER98482.1| hypothetical protein SORBIDRAFT_02g011230 [Sorghum bicolor]
          Length = 380

 Score =  229 bits (585), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 142/358 (39%), Positives = 193/358 (53%), Gaps = 44/358 (12%)

Query: 16  CASQATYRTF---------DEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVE 66
           C+S   +R +         D   + E+F++WKA Y ++Y   AE+ +RF ++  N+  +E
Sbjct: 25  CSSATAHRPYAGDMGSSTDDNSPMIERFQRWKAAYNKSYATVAEDRRRFLVYARNMAYIE 84

Query: 67  RFNNAAIG-NRSYTLRLNKFADLTPQEFIASQTGFK------------------MSDHSS 107
             N  A     +Y L    + DLT QEF+A  T                     ++  + 
Sbjct: 85  ATNAEAEAAGLTYELGETAYTDLTNQEFMAMYTAAPSPAQLPADEDEDDAAEAVITTRAG 144

Query: 108 SLKANGTPFLY--KSSQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKIN 158
            + A G   +Y   S+  P SV+W   GAVTPVK QG+C        VA VEGI  I+  
Sbjct: 145 PVDAVGQLPVYVNLSTAAPASVDWRASGAVTPVKNQGRCGSCWAFSTVAVVEGIYQIRTG 204

Query: 159 RLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIK 218
           +LVSLSEQ+LVDC T D   GC GG    A ++I  N G+T +  Y Y G +T  C+  K
Sbjct: 205 KLVSLSEQELVDCDTLDA--GCDGGISYRALRWITSNGGLTTEEDYPYTG-TTDACNRAK 261

Query: 219 AEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNH 276
              +AA I     V    E SL  AVA QPV+V+I+A     Q Y  GV+NG C T LNH
Sbjct: 262 LAHNAASIAGLRRVATRSEASLANAVAGQPVAVSIEAGGDNFQHYKRGVYNGPCGTSLNH 321

Query: 277 GVTAVGYGTSEE-GIKYWLIKNSWGQDWGEDGYFRLQRDI-DQPQGQCGIAMFASFPV 332
           GVT VGYG  EE G KYW+IKNSWG  WG+ GY ++++D+  +P+G CGIA+  SFP+
Sbjct: 322 GVTVVGYGQEEEDGDKYWIIKNSWGASWGDGGYIKMRKDVAGKPEGLCGIAIRPSFPL 379


>gi|386648114|gb|AFJ15104.1| mexicain-like cystein protease, partial [Jacaratia mexicana]
          Length = 323

 Score =  229 bits (585), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 132/309 (42%), Positives = 179/309 (57%), Gaps = 22/309 (7%)

Query: 34  FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
           FE W  +  + YK   E   RFEIFKDNL+ ++  N     N SY L LN+FADLT  EF
Sbjct: 22  FESWTLENDKIYKNIDEKIYRFEIFKDNLMYIDETNKK---NSSYWLGLNEFADLTHDEF 78

Query: 94  IASQTGFKMSDHSSSLKANGTPFLYKSS-QVPPSVNWIEKGAVTPVKYQGQCA------- 145
            A   G    D +   +++   F YK     P S++W +KGAVTPVK Q  C        
Sbjct: 79  KAKYVGSLGEDSTIIEQSDDEEFPYKHVVDYPESIDWRQKGAVTPVKNQNPCGSCWAFST 138

Query: 146 VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYS 205
           VA VEGIN I   +L+SLSEQ+L+DC  +  ++GC GG+   + +Y+  N G+  +  Y 
Sbjct: 139 VATVEGINKIVTGKLISLSEQELLDC--DRRSHGCKGGYQTTSLQYVADN-GVHTEKEYP 195

Query: 206 YEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQFYSG 263
           YE    G C +   +    +IT Y+ VP N+E SL++A+ANQPVSV +++   A QFY G
Sbjct: 196 YE-KKQGKCRAKDKKGSKVKITGYKRVPANNEVSLIQAIANQPVSVVVESKGRAFQFYKG 254

Query: 264 GVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCG 323
           G+F G C T ++H VTAVGYG +     Y LIKNSWG  WGE GY R++R   + +G CG
Sbjct: 255 GIFEGPCGTKVDHAVTAVGYGKN-----YILIKNSWGPKWGEKGYIRIKRASGKSKGTCG 309

Query: 324 IAMFASFPV 332
           +   + FP 
Sbjct: 310 VYSSSYFPT 318


>gi|391338870|ref|XP_003743778.1| PREDICTED: cathepsin L-like isoform 1 [Metaseiulus occidentalis]
 gi|391338872|ref|XP_003743779.1| PREDICTED: cathepsin L-like isoform 2 [Metaseiulus occidentalis]
 gi|391338874|ref|XP_003743780.1| PREDICTED: cathepsin L-like isoform 3 [Metaseiulus occidentalis]
          Length = 331

 Score =  229 bits (585), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 139/346 (40%), Positives = 202/346 (58%), Gaps = 36/346 (10%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
           FLI+ +++  + A+    + FD      +++ +K  + + Y+ S   + R +IF  N   
Sbjct: 3   FLILAVLVGAASAALTLEQLFDA-----EWQNFKVHHNKKYEGSTVEAFRKKIFLQNTHL 57

Query: 65  VERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLY----- 118
           + R N   A G  +Y L++N+F D+   EF+++  G         L++N T F       
Sbjct: 58  IARHNIKHAKGETTYKLKMNQFGDMLHHEFVSTMNGL--------LRSNRTYFGSTWIEP 109

Query: 119 KSSQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDC 171
           +S  +P SV+W EKGAVTPVK QG C          A+EG    K   LVSLSEQ L+DC
Sbjct: 110 ESVSLPKSVDWREKGAVTPVKNQGHCGSCWSFSTTGALEGQLFRKTGELVSLSEQNLIDC 169

Query: 172 ATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYED 231
           +T+  NNGC GG MD+AF YI +N GI  +  Y YEG   G C   K ED A + T + D
Sbjct: 170 STSYGNNGCGGGLMDNAFTYIKENHGIDTEESYPYEG-KQGKCRYHK-EDSAGRDTGFVD 227

Query: 232 VPPNDEESLLKAVAN-QPVSVAIDAS--ALQFYSGGVFNGY-CETF-LNHGVTAVGYGTS 286
           +P  +E +L KA+A   PVSVAIDAS  + QFY  GV+N   C++  L+HGV AVGYGT+
Sbjct: 228 IPSGNERALAKALATIGPVSVAIDASHESFQFYHEGVYNPPDCDSHSLDHGVLAVGYGTT 287

Query: 287 EEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           ++G  Y++IKNSWG+ WG++GY  + R+    + +CG+A  AS+P+
Sbjct: 288 DDGQDYYIIKNSWGERWGQEGYVLMARN---SKNECGVATQASYPL 330


>gi|449469176|ref|XP_004152297.1| PREDICTED: vignain-like [Cucumis sativus]
          Length = 340

 Score =  229 bits (584), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 140/345 (40%), Positives = 204/345 (59%), Gaps = 22/345 (6%)

Query: 1   MAKYFLIVVLII---SGSCASQATYRT-FD-EGSIAEKFEQWKAQYGRTYKESAENSKRF 55
           M K+ ++ V++I   S  C      R  F+ E S+ + +++W + + R  + + E  KRF
Sbjct: 3   MMKFLIVFVVLIAFASHLCEGFDLERKDFESEKSLMQLYKRWSSHH-RISRNAHEMHKRF 61

Query: 56  EIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSS-SLKANGT 114
           +IF+DN   V + N+     +S  LRLN+FADL+  EF +   G  ++ +++   KA G 
Sbjct: 62  KIFQDNAKRVFKVNHMG---KSLKLRLNQFADLSDDEF-SMMYGSNITHYNNLHAKAGGR 117

Query: 115 P--FLY-KSSQVPPSVNWIEKGAVTPVKYQGQCAVAAVEGINAIKINRLVSLSEQQLVDC 171
              F+Y ++  +P S++W EKGAV  +K QG CAVAAVE I+ IK N LVSLSEQ++VDC
Sbjct: 118 VGGFMYERAMNIPFSIDWREKGAVNAIKNQGLCAVAAVESIHQIKTNELVSLSEQEVVDC 177

Query: 172 ATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYED 231
             +    GC GG  D AF++I+QN GIT +  Y Y     G C           I  YE 
Sbjct: 178 --DYKVGGCRGGNYDSAFEFIMQNGGITIEENYPYFA-GNGYCRRRGPNSERVTIDGYEC 234

Query: 232 VPPNDEESLLKAVANQPVSVAIDASA--LQFYSGGVFN--GYCETFLNHGVTAVGYGTSE 287
           VP N+E +L+KAVA+QPV+V++ +S    +FY  G+     +C   ++H V  VGYG+ E
Sbjct: 235 VPQNNEYALMKAVAHQPVAVSVASSGSDFRFYGEGMLREGSFCGYRIDHTVVVVGYGSDE 294

Query: 288 EGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           EG  YW+I+N +G  WG +GY ++QR    PQG CG+AM  SFPV
Sbjct: 295 EG-DYWIIRNQYGTQWGMNGYMKMQRGTRNPQGVCGMAMQPSFPV 338


>gi|242040563|ref|XP_002467676.1| hypothetical protein SORBIDRAFT_01g032090 [Sorghum bicolor]
 gi|241921530|gb|EER94674.1| hypothetical protein SORBIDRAFT_01g032090 [Sorghum bicolor]
          Length = 358

 Score =  229 bits (583), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 123/315 (39%), Positives = 183/315 (58%), Gaps = 22/315 (6%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
           + ++F +W+A Y R+Y  + E  +RF++++ N+  +E  N A  GN +YTL  N+FADLT
Sbjct: 53  MMDRFLRWQATYNRSYPTAEERQRRFQVYRRNMEHIEATNRA--GNLTYTLGENQFADLT 110

Query: 90  PQEFIASQTGFKM----SDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQG-QC 144
            +EF+   T   M     D     +AN +  +      P SV+W  +GAVTP+K QG  C
Sbjct: 111 EEEFLDLYTMKGMPPVRRDAGKKQQANFSSVV----DAPTSVDWRSRGAVTPIKNQGPSC 166

Query: 145 A-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKG 197
           +        A +E I  I+  +LVSLSEQ+L+DC   D   GC  G+  + +K++IQN G
Sbjct: 167 SSCWAFVTAATIESITQIRTGKLVSLSEQELIDCDPYDG--GCNLGYFVNGYKWVIQNGG 224

Query: 198 ITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA 257
           +T +A Y Y+      C+  KA   AA+I+NY  +P  + +           +      +
Sbjct: 225 LTTEANYPYQARRYQ-CNRSKAGQRAARISNYRQLPQGEAQLQQAVAQQPVAAAIEMGGS 283

Query: 258 LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQ 317
           LQFYSGGV++G C T +NH +T VGYG    G+KYWL+KNSWGQ WGE GY R+++D+ Q
Sbjct: 284 LQFYSGGVWSGQCGTRMNHAITVVGYGADSSGVKYWLVKNSWGQTWGERGYLRMRKDVRQ 343

Query: 318 PQGQCGIAMFASFPV 332
             G CGIA+  ++P+
Sbjct: 344 G-GLCGIALDLAYPI 357


>gi|33348836|gb|AAQ16118.1| cathepsin L-like cysteine proteinase B [Rhipicephalus
           haemaphysaloides haemaphysaloides]
          Length = 335

 Score =  228 bits (581), Expect = 3e-57,   Method: Compositional matrix adjust.
 Identities = 141/347 (40%), Positives = 200/347 (57%), Gaps = 28/347 (8%)

Query: 1   MAKYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKD 60
           M  Y ++  L ++   A+  T++      +  ++  +KA +G+ Y    E   R +I+ +
Sbjct: 1   MRGYIVLCCLFVT---AAAITHQEL----VGAEWSAFKALHGKDYASDTEEYYRLKIYME 53

Query: 61  NLVAVERFNNA-AIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKAN--GTPFL 117
           N + + R N   A    SY L +N+F DL   EF++++ GFK +   S  + +    P  
Sbjct: 54  NRLKIARHNEKYAKSQVSYKLAMNEFGDLLHHEFVSTRNGFKRNYRDSPREGSFFVEPEG 113

Query: 118 YKSSQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVD 170
           ++  Q+P +V+W +KGAVTPVK QGQC          ++EG +  K  +LVSLSEQ LVD
Sbjct: 114 FEDLQLPKTVDWRKKGAVTPVKNQGQCGSCWAFSTTGSLEGPHFRKTRKLVSLSEQNLVD 173

Query: 171 CATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYE 230
           C+ +  NNGC GG MD+AFKYI  NKGI  +  Y Y   + G+C      D  A  T + 
Sbjct: 174 CSRSFGNNGCEGGLMDNAFKYIKSNKGIDTEWSYPYNA-TDGVC-HFNRSDVGATDTGFV 231

Query: 231 DVPPNDEESLLKAVAN-QPVSVAIDAS--ALQFYSGGVFN-GYCETF-LNHGVTAVGYGT 285
           D+P  DE  L KAVA   PVSVAIDAS  + QFYS GV++   C +  L+HGV  VGYGT
Sbjct: 232 DIPEGDENKLKKAVAAVGPVSVAIDASHESFQFYSEGVYDEPECSSEQLDHGVLVVGYGT 291

Query: 286 SEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
            ++G  YWL+KNSWG  WG++GY  + R+ D    QCGIA  AS+P+
Sbjct: 292 -KDGQDYWLVKNSWGTTWGDEGYIYMTRNKDN---QCGIASSASYPL 334


>gi|346466067|gb|AEO32878.1| hypothetical protein [Amblyomma maculatum]
          Length = 358

 Score =  228 bits (581), Expect = 3e-57,   Method: Compositional matrix adjust.
 Identities = 136/318 (42%), Positives = 190/318 (59%), Gaps = 21/318 (6%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNR-SYTLRLNKFADL 88
           +  ++  +KA +G+ Y    E   R +I+ +N + + R N     N+ SY L +N+F DL
Sbjct: 46  VGAEWSAFKALHGKEYHSETEEYYRLKIYMENRLKIARHNEKYANNKASYKLAMNEFGDL 105

Query: 89  TPQEFIASQTGFKMSDHSSSLKANG--TPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA- 145
              EF++++ GFK +  S+  + +    P   +   +P +V+W +KGAVTPVK QGQC  
Sbjct: 106 LHHEFVSTRNGFKRNYRSTPREGSFYIEPEGIEDKHLPKTVDWRKKGAVTPVKNQGQCGS 165

Query: 146 ------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGIT 199
                   ++EG +  K  R+VSLSEQ LVDC+    NNGC GG MD+AFKYI  N GI 
Sbjct: 166 CWAFSTTGSLEGQHFRKTGRMVSLSEQNLVDCSGKFGNNGCEGGLMDNAFKYIKANGGID 225

Query: 200 NDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDAS-- 256
            +  Y Y G + GIC   K+ D  A  T + D+P  +E+ L KAVA   PVSVAIDAS  
Sbjct: 226 TELSYPYNG-TDGICHFEKS-DVGATDTGFVDIPEGNEQLLKKAVATVGPVSVAIDASHE 283

Query: 257 ALQFYSGGVFN-GYCET-FLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRD 314
           + QFYS GV++   C +  L+HGV  VGYGT ++G  YWL+KNSWG  WG+DGY  + R+
Sbjct: 284 SFQFYSQGVYDEPECSSESLDHGVLVVGYGT-KDGQDYWLVKNSWGTTWGDDGYIYMTRN 342

Query: 315 IDQPQGQCGIAMFASFPV 332
               + QCGIA  AS+P+
Sbjct: 343 ---KENQCGIASSASYPL 357


>gi|226509942|ref|NP_001146834.1| cysteine protease precursor [Zea mays]
 gi|159506725|gb|ABW97700.1| cysteine protease [Zea mays]
 gi|414867308|tpg|DAA45865.1| TPA: cysteine protease [Zea mays]
          Length = 352

 Score =  228 bits (581), Expect = 3e-57,   Method: Compositional matrix adjust.
 Identities = 124/314 (39%), Positives = 182/314 (57%), Gaps = 18/314 (5%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
           + ++F  W+A Y R+Y  + E  +RF++++ N+  +E  N A  GN +YTL  N+FADLT
Sbjct: 45  MMDRFLSWQATYNRSYPTAEERQRRFQVYRRNIEHIEATNRA--GNLTYTLGENQFADLT 102

Query: 90  PQEFIASQT--GFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQG-QCA- 145
            +EF+   T  G  +   +   +AN +     +   P SV+W  KGAVTP+K QG  C+ 
Sbjct: 103 EEEFLDLYTMKGMPVRRDAGKKRANVSSSA-AAVDAPTSVDWRSKGAVTPIKNQGPSCSS 161

Query: 146 ------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGIT 199
                  A +E I  I   +LVSLSEQ+L+DC   D   GC  G+  + ++++IQN G+T
Sbjct: 162 CWAFVTAATIESITKITTGKLVSLSEQELIDCDPYDG--GCNLGYFVNGYRWVIQNGGLT 219

Query: 200 NDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASALQ 259
            +A Y Y+      C   +A  HAA I++Y  +P  + +           +      +LQ
Sbjct: 220 TEANYPYQARRYA-CSRSRAAQHAATISDYVQLPAGEGQLQQAVAQQPVAAAIEMGGSLQ 278

Query: 260 FYSGGVFNGYCETFLNHGVTAVGYGT-SEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQP 318
           FYSGGVF+G C T +NH +T VGYG  S  G+KYWL+KNSWGQ WGE GY R++RD+ + 
Sbjct: 279 FYSGGVFSGQCGTRMNHAITVVGYGADSSSGLKYWLVKNSWGQSWGERGYLRMRRDVGRG 338

Query: 319 QGQCGIAMFASFPV 332
            G CGIA+  ++PV
Sbjct: 339 -GLCGIALDLAYPV 351


>gi|225719058|gb|ACO15375.1| Cathepsin L1 precursor [Caligus clemensi]
          Length = 326

 Score =  228 bits (581), Expect = 4e-57,   Method: Compositional matrix adjust.
 Identities = 142/347 (40%), Positives = 198/347 (57%), Gaps = 37/347 (10%)

Query: 1   MAKYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKD 60
           M K+FLI+ L   G+  S A + +        ++ +WKA +G+ Y  + E S RF+IF++
Sbjct: 1   MYKFFLILSL---GAFVSGAEFSS--------EWLKWKATHGKVYNSADEESLRFKIFQE 49

Query: 61  NLVAVERFNNA-AIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK 119
           N + + + N     G  +Y L +N F DL   EF+    GF+         + G  F + 
Sbjct: 50  NSLMITQHNEEYRQGFHTYILGMNHFGDLLHSEFLERSNGFQGG------VSGGDVFTFD 103

Query: 120 S-SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDC 171
           + + VP   NW  KGAVTPVK QG+C       A  +VEG   +K  +L+SLSEQQLVDC
Sbjct: 104 TNAPVPSYANWTAKGAVTPVKDQGKCGSCWAFSATGSVEGQIFLKKKKLMSLSEQQLVDC 163

Query: 172 ATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYED 231
           + ++ N GC GG MD+AFKY I NKGI N+  Y Y           K     A I++++D
Sbjct: 164 SGDEGNLGCGGGLMDNAFKYFIANKGIANEKSYPYTAKDNDC--KYKKSMSVATISSFKD 221

Query: 232 VPPNDEESLLKAVAN-QPVSVAIDASA--LQFYSGGV-FNGYCET-FLNHGVTAVGYGTS 286
           V   DE+ L  AVAN  PVSVAIDAS+   QFY  GV ++  C +  L+HGV AVGYGT 
Sbjct: 222 VKHKDEDQLKMAVANVGPVSVAIDASSSKFQFYESGVYYDENCSSEVLDHGVLAVGYGTD 281

Query: 287 EE-GIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           ++ G+ +WL+KNSW   WG +GY ++ R+ D     CGIA  AS+P+
Sbjct: 282 KKSGMDFWLVKNSWAASWGLNGYIKMARNKDN---NCGIATMASYPI 325


>gi|225709022|gb|ACO10357.1| Cathepsin L precursor [Caligus rogercresseyi]
          Length = 332

 Score =  228 bits (580), Expect = 4e-57,   Method: Compositional matrix adjust.
 Identities = 136/319 (42%), Positives = 185/319 (57%), Gaps = 22/319 (6%)

Query: 28  GSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAI-GNRSYTLRLNKFA 86
           G +   +E WK  +G++Y+ S E   R +I  +N + + R N  AI G  SY +++N + 
Sbjct: 21  GVVLSDWESWKLTHGKSYESSIEEKLRLKIHMENSLKISRHNAEAINGKHSYYMKMNHYG 80

Query: 87  DLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC-- 144
           DL   EF+A   G++  + +S     G+    K+ ++P  V+W E GAVTPVK QGQC  
Sbjct: 81  DLLHHEFVAMVNGYEYVNKTS---LGGSFIPSKNVKLPTHVDWREDGAVTPVKNQGQCGS 137

Query: 145 -----AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGIT 199
                +  ++EG    K  +L+ LSEQ LVDC+    NNGC GG MD AF YI  NKGI 
Sbjct: 138 CWAFSSTGSLEGQTFRKTGKLIPLSEQNLVDCSRKYGNNGCEGGLMDFAFTYIRDNKGID 197

Query: 200 NDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDASAL 258
            +  Y YEG+  G C    ++  ++ I  + DV    EE LLKAVA+  PVSVAIDAS +
Sbjct: 198 TEGSYPYEGVG-GRCHYDPSKKGSSDI-GFVDVKKGSEEELLKAVASVGPVSVAIDASHM 255

Query: 259 --QFYSGGV-FNGYCE-TFLNHGVTAVGYGTSE-EGIKYWLIKNSWGQDWGEDGYFRLQR 313
             QFYS GV F   C    L+HGV  VGYGT E  G  YWL+KNSW ++WG+ GY ++ R
Sbjct: 256 SFQFYSHGVYFESKCSPENLDHGVLVVGYGTDENSGEDYWLVKNSWSENWGDQGYIKMAR 315

Query: 314 DIDQPQGQCGIAMFASFPV 332
           +    +  CGIA  AS+PV
Sbjct: 316 N---KKNMCGIASSASYPV 331


>gi|449530091|ref|XP_004172030.1| PREDICTED: vignain-like [Cucumis sativus]
          Length = 351

 Score =  228 bits (580), Expect = 5e-57,   Method: Compositional matrix adjust.
 Identities = 132/348 (37%), Positives = 208/348 (59%), Gaps = 30/348 (8%)

Query: 6   LIVVLIISGSCAS-QATYRTFD-EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLV 63
           L+++  +   C S +   + F+ E S+ + +++W + + R  + + E   RF++FK+N  
Sbjct: 11  LVLIAFLCNICESFELERKDFESEKSLMQLYKRWSSHH-RISRNANEMHNRFKVFKNNAK 69

Query: 64  AVERFNNAAIGNRSYTLRLNKFADLTPQEF---IASQTGFKMSDHSSSLKANGTP---FL 117
            V + N   +  +S  L+LN+FAD++  EF    +S   +    H+  ++A G     F+
Sbjct: 70  HVFKVN---LMGKSLKLKLNQFADMSDDEFRNMYSSNITYYKDLHAKKIEATGGRIGGFM 126

Query: 118 YK-SSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLV 169
           Y+ ++ +P S++W +KGAV  +K QG+C       AVAAVE I+ IK N LVSLSE++++
Sbjct: 127 YEHANNIPSSIDWRKKGAVNAIKNQGRCGSCWAFAAVAAVESIHQIKTNELVSLSEEEVL 186

Query: 170 DCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSY-EGMSTGICDSIKAEDHAAQITN 228
           DC   D   GC GGF + AF++++ N G+T +  Y Y EG   G C      +   +I  
Sbjct: 187 DCDYRDG--GCRGGFYNSAFEFMMDNDGVTIEDNYPYYEG--NGYCRRRGGRNKRVRIDG 242

Query: 229 YEDVPPNDEESLLKAVANQPVSVAI--DASALQFYSGGVF--NGYCETFLNHGVTAVGYG 284
           YE+VP N+E +L+KAVA+QPV+VAI    S  +FY GG+F  N +C   ++H V  VGYG
Sbjct: 243 YENVPRNNEYALMKAVAHQPVAVAIASGGSDFKFYGGGMFTENDFCGFNIDHTVVVVGYG 302

Query: 285 TSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           T E+G  YW+I+N +G  WG +GY ++QR    PQG CG+AM  ++PV
Sbjct: 303 TDEDG-DYWIIRNQYGHRWGMNGYMKMQRGAHSPQGVCGMAMQPAYPV 349


>gi|156124996|gb|ABU50816.1| Ale o 1 allergen [Aleuroglyphus ovatus]
          Length = 337

 Score =  227 bits (578), Expect = 6e-57,   Method: Compositional matrix adjust.
 Identities = 139/322 (43%), Positives = 193/322 (59%), Gaps = 27/322 (8%)

Query: 27  EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN-NAAIGNRSYTLRLNKF 85
           EG +  +FEQ+K+ +GR Y        R  IF+ NL  + R N +   G+ ++++ +N F
Sbjct: 26  EGELEAQFEQFKSTFGRVYPSPEIELHRKSIFRANLQFILRHNIDYFNGDSTFSVSVNNF 85

Query: 86  ADLTPQEFIASQTGFKMSDHSS---SLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQG 142
            DL+ +EF A+  G++     S   S+ A+          +P +V+W  KG VTP+K Q 
Sbjct: 86  TDLSNEEFRATFNGYRRLAAVSLADSVHADN-----DVEALPATVDWTTKGVVTPIKNQQ 140

Query: 143 QC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQN 195
           QC       AVA++EG +A+K  +LVSLSEQ LVDC+  + + GC GG+MD AFKY+IQN
Sbjct: 141 QCGSCWAFSAVASMEGQHALKTGKLVSLSEQNLVDCSAAEGDMGCSGGWMDYAFKYVIQN 200

Query: 196 KGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAID 254
           +GI  +A Y Y+ +    C+  K     A I ++ DV   DE +L  AVA+  P+SVAID
Sbjct: 201 RGIDTEASYPYKAIDES-CE-FKRNSIGATIHSFVDVKTGDESALQNAVASIGPISVAID 258

Query: 255 AS--ALQFYSGGVFNGY-CET-FLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFR 310
           AS  + QFYS GV+N   C T  L+HGVTAVGYGT   G+ YW +KNSWG  WG+ GY  
Sbjct: 259 ASQPSFQFYSSGVYNEPDCSTEILDHGVTAVGYGTL-NGVPYWKVKNSWGTSWGQKGYIF 317

Query: 311 LQRDIDQPQGQCGIAMFASFPV 332
           + R+    Q QCGIA  AS+PV
Sbjct: 318 MSRN---KQNQCGIATKASYPV 336


>gi|242020003|ref|XP_002430447.1| Cathepsin L precursor, putative [Pediculus humanus corporis]
 gi|212515585|gb|EEB17709.1| Cathepsin L precursor, putative [Pediculus humanus corporis]
          Length = 345

 Score =  227 bits (578), Expect = 7e-57,   Method: Compositional matrix adjust.
 Identities = 141/326 (43%), Positives = 187/326 (57%), Gaps = 29/326 (8%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKFADL 88
           + E+++ +KA++ + Y    E   R +IF DN   + + N     G   Y L LNK++D+
Sbjct: 23  VMEEWQLFKAEHKKNYNNDVEEKFRMKIFMDNKQKITKHNTKYQRGEVGYKLGLNKYSDM 82

Query: 89  TPQEFIASQTGFKMSDHSSSLKAN-GTPFLYKSSQVPPS-------VNWIEKGAVTPVKY 140
              EFI +  GF  S     L++N G   L  S  +PP+       V+W++ GAVTPVK 
Sbjct: 83  LHHEFINTFNGFNKSIIPPHLRSNNGKTHLKGSFFIPPANVKLPKHVDWVKLGAVTPVKD 142

Query: 141 QGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYII 193
           QG C       A  A+EG++  K   LVSLSEQ L+DC+T + NNGC GG MD AF+Y+ 
Sbjct: 143 QGHCGSCWAFSATGALEGLHFRKTKVLVSLSEQNLIDCSTEEGNNGCNGGLMDQAFQYVR 202

Query: 194 QNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVA 252
            N GI  +  Y YEG +  +C   + E+  A  T Y DVP  DE++L  AVA   PVSVA
Sbjct: 203 INGGIDTERSYPYEG-NNDVC-RYEPENSGAIDTGYTDVPLGDEDALKSAVATVGPVSVA 260

Query: 253 IDAS--ALQFYSGGV-FNGYCET---FLNHGVTAVGYGTSEEGIK-YWLIKNSWGQDWGE 305
           IDAS  + Q YS GV F   C+     L+HGV  VGYGT EE  + YWL+KNSWG  WGE
Sbjct: 261 IDASQESFQLYSSGVYFEPNCKNEPESLDHGVLVVGYGTDEETQQDYWLVKNSWGDSWGE 320

Query: 306 DGYFRLQRDIDQPQGQCGIAMFASFP 331
           +GY ++ R+ D    QCGIA   SFP
Sbjct: 321 NGYIKMARNADN---QCGIATQPSFP 343


>gi|170041165|ref|XP_001848344.1| cathepsin l [Culex quinquefasciatus]
 gi|167864709|gb|EDS28092.1| cathepsin l [Culex quinquefasciatus]
          Length = 340

 Score =  227 bits (578), Expect = 7e-57,   Method: Compositional matrix adjust.
 Identities = 135/322 (41%), Positives = 184/322 (57%), Gaps = 24/322 (7%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKFADL 88
           + E++  +K Q+ + Y    E   R +I+  N   + + N     G   + LR+NK+ DL
Sbjct: 23  VKEEWNAYKLQHRKKYDSETEERLRLKIYVQNKHKIAKHNQRFEQGQEKFRLRVNKYTDL 82

Query: 89  TPQEFIASQTGFKMSDHSSSLKAN---GTPFLY---KSSQVPPSVNWIEKGAVTPVKYQG 142
             +EF+ +  GF  ++    +        P  Y    + +VP +V+W EKGAVTPVK QG
Sbjct: 83  LHEEFVQTLNGFNRTNAKKPMLKGVKIDEPVTYIEPANVEVPKTVDWREKGAVTPVKDQG 142

Query: 143 QC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQN 195
            C       A  A+EG +  K  +LVSLSEQ LVDC+T   NNGC GG MD AF+YI  N
Sbjct: 143 HCGSCWSFSATGALEGQHFRKTGKLVSLSEQNLVDCSTKYGNNGCNGGMMDFAFQYIKDN 202

Query: 196 KGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ-PVSVAID 254
            GI  +  Y YE +      + KA    A    + D+P  DE++L+KA+A   PVSVAID
Sbjct: 203 GGIDTEKAYPYEAIDDTCHYNPKAV--GATDKGFVDIPQGDEKALMKAIATAGPVSVAID 260

Query: 255 AS--ALQFYSGGV-FNGYCETF-LNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFR 310
           AS  + QFYS GV +   C++  L+HGV AVGYGTSEEG  YWL+KNSWG  WG+ GY +
Sbjct: 261 ASHESFQFYSEGVYYEPQCDSENLDHGVLAVGYGTSEEGEDYWLVKNSWGTTWGDQGYVK 320

Query: 311 LQRDIDQPQGQCGIAMFASFPV 332
           + R+ D     CGIA  AS+P+
Sbjct: 321 MARNRDN---HCGIATAASYPL 339


>gi|290462225|gb|ADD24160.1| Cathepsin L [Lepeophtheirus salmonis]
          Length = 334

 Score =  227 bits (578), Expect = 7e-57,   Method: Compositional matrix adjust.
 Identities = 142/340 (41%), Positives = 194/340 (57%), Gaps = 26/340 (7%)

Query: 7   IVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVE 66
           I++L +  S AS  ++  FD   +   +E WK  + + Y  S E   R +IF +N + + 
Sbjct: 6   ILLLSVIISTASAVSF--FD--VVLSDWESWKLTHQKGYDSSVEEKLRLKIFMENSLRIS 61

Query: 67  RFNNAAI-GNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPP 125
           R N  AI G  +Y +++N + DL   EF+A   G+  ++ ++     GT    K+  +P 
Sbjct: 62  RHNAEAIQGRHTYFMKMNHYGDLLHHEFVAMVNGYIYNNKTT---LGGTFIPSKNINLPE 118

Query: 126 SVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNN 178
            V+W E+GAVTPVK QGQC       A  ++EG +  K  +L+SLSEQ LVDC+    NN
Sbjct: 119 HVDWREEGAVTPVKNQGQCGSCWSFSATGSLEGQDFRKTGKLISLSEQNLVDCSRKYGNN 178

Query: 179 GCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEE 238
           GC GG MD AFKYI  N GI  +A Y YEG+  G C         + I  + D+    E+
Sbjct: 179 GCEGGLMDYAFKYIQDNNGIDTEASYPYEGID-GHCHYDPKNKGGSDI-GFVDIKKGSEK 236

Query: 239 SLLKAVAN-QPVSVAIDASAL--QFYSGGVFN-GYCE-TFLNHGVTAVGYGTSE-EGIKY 292
            L KA+A   P+SVAIDAS +  QFYS GV++   C    L+HGV AVGYGT E  G  Y
Sbjct: 237 DLQKALATVGPISVAIDASHMSFQFYSHGVYSEKKCSPENLDHGVLAVGYGTDEVTGEDY 296

Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           WL+KNSW + WGEDGY ++ R+ D     CGIA  AS+PV
Sbjct: 297 WLVKNSWSEKWGEDGYIKMARNKDN---MCGIASSASYPV 333


>gi|302790836|ref|XP_002977185.1| hypothetical protein SELMODRAFT_106228 [Selaginella moellendorffii]
 gi|300155161|gb|EFJ21794.1| hypothetical protein SELMODRAFT_106228 [Selaginella moellendorffii]
          Length = 299

 Score =  227 bits (578), Expect = 8e-57,   Method: Compositional matrix adjust.
 Identities = 124/310 (40%), Positives = 186/310 (60%), Gaps = 22/310 (7%)

Query: 34  FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
           FE W A++G++Y   +E ++R  IF D L  +E+ N  A  N ++TL LNKF+DLT  EF
Sbjct: 2   FEDWAAKHGKSYSSDSEKARRLMIFSDTLAYIEKHN--AQPNTTFTLGLNKFSDLTNAEF 59

Query: 94  IASQTG-FKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC-------A 145
            A+  G FK   +     A         S +P S++W ++GAVTP+K QGQC       A
Sbjct: 60  RANYVGKFKSPRYQDRRPAKDVDV--DVSSLPTSLDWRQEGAVTPIKDQGQCGSCWAFSA 117

Query: 146 VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYS 205
           +A++E  + +    LVSLSEQQL+DC T D   GC GGF +DAFK++++N G+T +  Y 
Sbjct: 118 IASIESAHFLATKELVSLSEQQLIDCDTVDQ--GCQGGFPEDAFKFVVENGGVTTEEAYP 175

Query: 206 YEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASALQF--YSG 263
           Y G + G C++ K  +   +IT Y+DV  +  ++L+KAV+  PV+V I  S   F  Y  
Sbjct: 176 YTGFA-GSCNANK--NKVVEITGYKDVTKDSADALMKAVSKTPVTVGICGSDQNFQNYRS 232

Query: 264 GVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCG 323
           G+ +G C    +H V  +GYGT E G+ YW+IKNSWG  WGE+G+ ++++     +G CG
Sbjct: 233 GILSGQCSNSRDHAVLVIGYGT-EGGMPYWIIKNSWGTSWGENGFMKIKK--KDGEGMCG 289

Query: 324 IAMFASFPVS 333
           +   +S+P +
Sbjct: 290 MNGQSSYPTT 299


>gi|27728675|gb|AAO18731.1| cysteine protease [Gossypium hirsutum]
          Length = 389

 Score =  226 bits (577), Expect = 9e-57,   Method: Compositional matrix adjust.
 Identities = 132/322 (40%), Positives = 187/322 (58%), Gaps = 19/322 (5%)

Query: 27  EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNR-SYTLRLNKF 85
           E  + E F+QWK ++ + Y+ + E  KRFE FK NL  +   N     N+  + + LNKF
Sbjct: 42  EERVLEIFQQWKEKHRKVYRHAEEAEKRFENFKGNLKYILERNAKRKANKWEHHVGLNKF 101

Query: 86  ADLTPQEFI-ASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC 144
           AD++ +EF  A  +  K   +     +       +S   P S++W   G VT VK QG C
Sbjct: 102 ADMSNEEFRKAYLSKVKKPINKGITLSRNMRRKVQSCDAPSSLDWRNYGVVTAVKDQGSC 161

Query: 145 -------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKG 197
                  +  A+EGINA+    L+SLSEQ+LV+C T+  N GC GG+MD AF+++I N G
Sbjct: 162 GSCWAFSSTGAMEGINALVTGDLISLSEQELVECDTS--NYGCEGGYMDYAFEWVINNGG 219

Query: 198 ITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA 257
           I +++ Y Y G+  G C++ K E     I  Y+DV  +D  +LL AVA QPVSV ID SA
Sbjct: 220 IDSESDYPYTGVD-GTCNTTKEETKVVSIDGYQDVEQSDS-ALLCAVAQQPVSVGIDGSA 277

Query: 258 L--QFYSGGVFNGYCETF---LNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQ 312
           +  Q Y+GG+++G C      ++H V  VGYG SE+  +YW++KNSWG  WG DGYF L+
Sbjct: 278 IDFQLYTGGIYDGSCSDDPDDIDHAVLIVGYG-SEDSEEYWIVKNSWGTSWGIDGYFYLK 336

Query: 313 RDIDQPQGQCGIAMFASFPVSK 334
           RD D P G C +   AS+P  +
Sbjct: 337 RDTDLPYGVCAVNAMASYPTKQ 358


>gi|162463334|ref|NP_001104878.1| maize insect resistance2 precursor [Zea mays]
 gi|2425064|gb|AAB88262.1| cysteine proteinase Mir2 [Zea mays]
          Length = 493

 Score =  226 bits (577), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 129/294 (43%), Positives = 173/294 (58%), Gaps = 15/294 (5%)

Query: 53  KRFEIFKDNLVAVERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKA 111
           +R E+F+DNL  ++  N  A  G   + L L +FADLT +E+ A         + +++  
Sbjct: 91  RRLEVFRDNLRYIDAHNAEADAGLHGFRLGLTRFADLTLEEYRARLLLGSRGRNGTAVGV 150

Query: 112 NGTP--FLYKSSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVS 162
            G          Q+P +V+W E+GAV  VK QGQC       AVAAVEGIN I    L+S
Sbjct: 151 VGRRRYLPLAGEQLPDAVDWRERGAVAEVKDQGQCGGCWAFSAVAAVEGINKIVTGSLIS 210

Query: 163 LSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDH 222
           LSEQ+L+DC     + GC GG MD+AF ++I+N GI  +A Y + G   G CD       
Sbjct: 211 LSEQELIDC-DKFQDQGCDGGLMDNAFVFMIKNGGIDTEADYPFTGHD-GTCDLKLKNTR 268

Query: 223 AAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTA 280
              I ++E VP N E +L KAVA+QPVS +I+AS  A Q YS G+F+G C T+L+HGVT 
Sbjct: 269 VVSIDSFERVPINYERALQKAVAHQPVSASIEASRRAFQLYSSGIFDGRCGTYLDHGVTV 328

Query: 281 VGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSK 334
           VGYG SE G  YW++KNSWG  WGE GY R+ R++       GIAM   +PV +
Sbjct: 329 VGYG-SEGGKDYWIVKNSWGTQWGEAGYVRMARNVRVRPPSAGIAMEPLYPVKE 381


>gi|326503122|dbj|BAJ99186.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326512552|dbj|BAJ99631.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 389

 Score =  226 bits (576), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 133/332 (40%), Positives = 183/332 (55%), Gaps = 35/332 (10%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN-NAAIGNRSYTLRLNKFADL 88
           +  +F  W     R+Y  S+E + RF++++ N+  +E  N  A     +Y L    F DL
Sbjct: 56  MMARFHVWMTVQNRSYPTSSEKAHRFKVYRSNMRYIEALNAEATTSGFTYELGEGPFTDL 115

Query: 89  TPQEFIASQTGFKMSD----------------HSSSLKANGTPFLYK--SSQVPPSVNWI 130
           T +EFI+  TG K+ D                H+ S+       +Y   S+  P  ++W 
Sbjct: 116 TDEEFISLYTG-KIPDDDHREDGVHDEQIITTHAGSVNGAEGVTVYANFSAGAPIRMDWR 174

Query: 131 EKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGG 183
           ++GAVTPVK QG+C        VA +EGI+ IK  RLVSLSEQQLVDC   D   GC GG
Sbjct: 175 KRGAVTPVKDQGKCGSCWAFPTVATIEGIHKIKRGRLVSLSEQQLVDCDFLDG--GCNGG 232

Query: 184 FMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKA 243
           +  +AF++IIQN GIT  + Y+Y+  + G C   +    AA+IT Y  V  N E S++  
Sbjct: 233 WPRNAFQWIIQNGGITTTSSYTYKA-AEGQCKGNRKP--AAKITGYRKVKSNSEVSMVNI 289

Query: 244 VANQPV--SVAIDASALQFYSGGVFNGYCETF-LNHGVTAVGYGTSEEGIKYWLIKNSWG 300
           VANQP+  S+ +     Q Y GG++NG C T  LNH +T VGYG    G KYW++KNSWG
Sbjct: 290 VANQPIAASIVVHGGQFQHYKGGIYNGPCATSKLNHVITIVGYGQQAYGAKYWIVKNSWG 349

Query: 301 QDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
             WG  GY  ++R    P GQCGIA+   FP+
Sbjct: 350 AAWGNKGYMLMKRGTKNPLGQCGIAVRPIFPL 381


>gi|242093944|ref|XP_002437462.1| hypothetical protein SORBIDRAFT_10g027570 [Sorghum bicolor]
 gi|241915685|gb|EER88829.1| hypothetical protein SORBIDRAFT_10g027570 [Sorghum bicolor]
          Length = 366

 Score =  226 bits (576), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 147/366 (40%), Positives = 202/366 (55%), Gaps = 42/366 (11%)

Query: 1   MAKYFLIVVLIIS-GSCASQATYRTFD---EGSIAEKFEQWKAQYGRTYKESAENSKRFE 56
           MA   ++V + +S    AS   Y   D   E S+   +E+W A Y    ++  E ++RF+
Sbjct: 11  MAATLVVVGMALSIAPVASAIDYTERDLASEESLWALYERWCAHYNMA-RDHGEKTRRFD 69

Query: 57  IFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQ-----TGFKMSD------- 104
           +FK+N   +   N+   GN +YTL LN+F+D+T +EF  S      T  +MSD       
Sbjct: 70  LFKENARRIYEHNHQ--GNATYTLGLNRFSDMTDEEFNRSPYGGCLTAPRMSDDEIEELH 127

Query: 105 -HSSSLKANGTPFLYKSSQ-----VPPSVNWIEKGAVTPVKYQGQC--------AVAAVE 150
            H    + +G+  L   S       PP+V+W  + AVT VK QG          A+AAVE
Sbjct: 128 HHHHQQEDDGSFNLTHGSGGGKLGAPPAVDWRGR-AVTRVKDQGPTCGSCWAFSAIAAVE 186

Query: 151 GINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMS 210
           GINAI+   LV LSEQQLVDC  +  N+GC GG M  AF ++++N+G+  +  Y Y G  
Sbjct: 187 GINAIRTRNLVPLSEQQLVDC--DKLNHGCNGGLMTTAFSFVVRNRGVVPEGAYPYMGRE 244

Query: 211 TGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASALQF--YSGGVFNG 268
            G C  + A      I  Y+ VP  D  +L+ AVA QPVSVAI+AS+ +F  Y GGVFNG
Sbjct: 245 -GRCKHVMAP--PVTIYGYQRVPRFDANALMNAVAAQPVSVAIEASSFEFRHYQGGVFNG 301

Query: 269 YCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFA 328
            C   L H  TAVGYG ++ G  +W++KNSWG  WGE GY R+ R+    QG CGI    
Sbjct: 302 NCGGRLGHAATAVGYG-ADAGGPFWIVKNSWGPGWGEGGYVRISRNTPVRQGVCGILTEN 360

Query: 329 SFPVSK 334
           S+PV +
Sbjct: 361 SYPVKR 366


>gi|449673497|ref|XP_002169904.2| PREDICTED: cathepsin L-like [Hydra magnipapillata]
          Length = 325

 Score =  226 bits (576), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 137/313 (43%), Positives = 183/313 (58%), Gaps = 35/313 (11%)

Query: 37  WKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIAS 96
           WK  + + Y   +E + R+ I+KDN+  +  +N+ +   ++  LR+N F D+T  EF A 
Sbjct: 30  WKMAHNKAYSHESEENVRYAIWKDNMNRITEYNSKS---KNVILRMNHFGDMTNTEFRAK 86

Query: 97  QTGFKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGAVTPVKYQGQC-------AVAA 148
             G  +  H      NG+ FL  S +  P +V+W  +G VTPVK QGQC       +  A
Sbjct: 87  MNGLLLHKHQ-----NGSTFLVPSHTAAPDAVDWRSEGYVTPVKNQGQCGSCWAFSSTGA 141

Query: 149 VEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEG 208
           +EG +  K  RLVSLSEQ LVDC+T+  NNGC GG MD+AF YI  N GI  +  Y YEG
Sbjct: 142 LEGQHFKKTGRLVSLSEQNLVDCSTDYGNNGCNGGLMDNAFSYIKANGGIDTETGYPYEG 201

Query: 209 MSTGIC----DSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDASAL--QFY 261
              G C     SI A+D     T + D+P  DE++L +AVA   PVSVAIDAS +  QFY
Sbjct: 202 QD-GTCRYSKSSIGADD-----TGFVDIPEGDEDALKQAVATVGPVSVAIDASHMSFQFY 255

Query: 262 SGGVFN-GYCE-TFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQ 319
             GV++   C  + L+HGV  VGYGT + G  YWL+KNSWG  WG +GY  + R+    Q
Sbjct: 256 HSGVYDEPQCSPSALDHGVLVVGYGT-DNGKDYWLVKNSWGTGWGTEGYIYMSRN---NQ 311

Query: 320 GQCGIAMFASFPV 332
            QCGIA  AS+P+
Sbjct: 312 NQCGIASKASYPL 324


>gi|333069454|gb|AEF13978.1| chymopapain [Carica papaya]
          Length = 352

 Score =  226 bits (576), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 139/344 (40%), Positives = 186/344 (54%), Gaps = 25/344 (7%)

Query: 5   FLIVVLIISGSCAS----QATYRTFDEGSIA---EKFEQWKAQYGRTYKESAENSKRFEI 57
           FL   LII  S +S       Y   D  SI    + F+ W  ++ + Y+   E   RFEI
Sbjct: 12  FLATCLIIHMSLSSADFYTVGYSQDDLTSIERLIQLFDSWMLKHNKIYESIDEKIYRFEI 71

Query: 58  FKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFL 117
           F+DNL+ ++  N     N SY L LN FADL+  EF     G    D +     +   F 
Sbjct: 72  FRDNLMYIDETNKK---NNSYWLGLNGFADLSNDEFKKKYVGSVAEDFTGLEHFDNEDFT 128

Query: 118 YKS-SQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLV 169
           YK  +  P S++W  KGAVTPVK QG C        +A VEG+N I    L+ LSEQ+LV
Sbjct: 129 YKHVTNYPQSIDWRAKGAVTPVKNQGSCGSCWAFSTIATVEGVNKIVTGNLLELSEQELV 188

Query: 170 DCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNY 229
           DC  + N++GC GG+   + +Y+  N G+    VY Y+  +   C +        +IT Y
Sbjct: 189 DC--DKNSHGCKGGYQTTSLQYVADN-GVHTSKVYPYQAKAMQ-CRATDKPGPKVKITGY 244

Query: 230 EDVPPNDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSE 287
           + VP N E S L A+ANQP+SV ++A     Q Y  GVF+G C T L+H VTAVGYGTS+
Sbjct: 245 KRVPSNCETSFLGALANQPLSVLVEAGGKPFQLYKSGVFDGPCGTKLDHAVTAVGYGTSD 304

Query: 288 EGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
            G  Y +IKNSWG +WGE GY RL+R     QG CG+   + +P
Sbjct: 305 -GKNYIIIKNSWGPNWGEKGYMRLKRQSGNSQGTCGVYKSSYYP 347


>gi|125564712|gb|EAZ10092.1| hypothetical protein OsI_32402 [Oryza sativa Indica Group]
          Length = 382

 Score =  226 bits (575), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 135/339 (39%), Positives = 182/339 (53%), Gaps = 39/339 (11%)

Query: 29  SIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADL 88
           ++ E F++WKA+Y R+Y    E  +R  ++  N+  +E  N AA    +Y L    + DL
Sbjct: 47  TMMEMFQRWKAEYNRSYATPEEERRRLRVYARNVRYIEATNAAA--GLAYELGETAYTDL 104

Query: 89  TPQEFIASQTGFKMSDHSSS----------------LKANGTPFLY--KSSQVPPSVNWI 130
           T  EF+A  T   +   +                  +  +  P +Y  +S+  P SV+W 
Sbjct: 105 TNDEFMAMYTAPPLRSAADDDDDAATTTIITTRAGPVDEHQQPEVYFNESAGAPASVDWR 164

Query: 131 EKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGG 183
             GAVT VK QG+C        VA VEGI  IK  +LVSLSEQ+LVDC T D+  GC GG
Sbjct: 165 ASGAVTEVKDQGRCGSCWAFSTVAVVEGIQKIKKGKLVSLSEQELVDCDTLDS--GCDGG 222

Query: 184 FMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKA 243
               A ++I  N GIT    Y Y G +   CD  K   HAA I     V    E SL  A
Sbjct: 223 VSYRALEWITANGGITTRDDYPYTGAAAAACDRAKLGHHAATIAGLRRVATRSEASLQNA 282

Query: 244 VANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSE-------EGIKYWL 294
            A QPV+V+I+A     Q Y  GV++G C T LNHGVT VGYG  E        G KYW+
Sbjct: 283 AAAQPVAVSIEAGGDNFQHYRKGVYDGPCGTRLNHGVTVVGYGQEEAPVDGSAAGDKYWI 342

Query: 295 IKNSWGQDWGEDGYFRLQRDI-DQPQGQCGIAMFASFPV 332
           IKNSWG++WG+ GY ++++D+  +P+G CGIA+  SFP+
Sbjct: 343 IKNSWGKNWGDQGYIKMKKDVAGKPEGLCGIAIRPSFPL 381


>gi|326497561|dbj|BAK05870.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 340

 Score =  226 bits (575), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 138/339 (40%), Positives = 199/339 (58%), Gaps = 28/339 (8%)

Query: 7   IVVLIISGSCAS--QATYRTFDEGSI--AEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
           I+VL+  G  A+   A+    D G +   ++F QW+A + R+Y  + E  +RFE+++ N+
Sbjct: 14  ILVLLTGGLFAAFPAASGGRVDAGDMLMMDRFRQWQATHNRSYLSAEERLRRFEVYRTNV 73

Query: 63  VAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKM-SDHSSSLKANGTPFLYKSS 121
             ++  N    G  +Y L  N+FADLT +EF+A   G    S  +++ +A+G+      +
Sbjct: 74  EYIDATNRR--GGLTYELGENQFADLTGEEFLARYAGGHTGSAITTAAEADGS----LEA 127

Query: 122 QVPPSVNWIEKGAVTPVKYQG-QC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
             P SV+W  KGAVTPVK QG QC       AVA +E +  IK  +LV+LSEQQLVDC  
Sbjct: 128 DPPASVDWRAKGAVTPVKNQGSQCYSCWAFSAVATMESLYFIKTGKLVALSEQQLVDC-- 185

Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
           +  + GC  G+   AF++I++N GIT  A Y Y+ +  G C + K    A  IT +  V 
Sbjct: 186 DKYDGGCNKGYYHRAFQWIMENGGITTAAQYPYKAVR-GACSAAKP---AVTITGHLAVA 241

Query: 234 PNDEESLLKAVANQPVSVAIDAS-ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKY 292
            N E +L  AVA QP+ VAI+   ++QFY  GVF+  C   ++H V  VGYG    G+KY
Sbjct: 242 KN-ELALQSAVARQPIGVAIEVPISMQFYKSGVFSAACGIQMSHAVVTVGYGADASGLKY 300

Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
           WL+KNSWGQ WGE GY R++RD+    G CGIA+  ++P
Sbjct: 301 WLVKNSWGQTWGEAGYIRMRRDVGG-GGLCGIALDTAYP 338


>gi|22653679|sp|Q26636.1|CATL_SARPE RecName: Full=Cathepsin L; Contains: RecName: Full=Cathepsin L
           heavy chain; Contains: RecName: Full=Cathepsin L light
           chain; Flags: Precursor
 gi|505140|dbj|BAA03970.1| cathepsin L precursor [Sarcophaga peregrina]
          Length = 339

 Score =  225 bits (574), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 140/322 (43%), Positives = 181/322 (56%), Gaps = 26/322 (8%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKFADL 88
           I E++  +K Q+ + Y    E   R +IF +N   + + N   A G  SY L LNK+AD+
Sbjct: 24  IKEEWHTYKLQHRKNYANEVEERFRMKIFNENRHKIAKHNQLFAQGKVSYKLGLNKYADM 83

Query: 89  TPQEFIASQTGFK------MSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQG 142
              EF  +  G+       M + +  + A   P  + +  VP SV+W E GAVT VK QG
Sbjct: 84  LHHEFKETMNGYNHTLRQLMRERTGLVGATYIPPAHVT--VPKSVDWREHGAVTGVKDQG 141

Query: 143 QC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQN 195
            C       +  A+EG +  K   LVSLSEQ LVDC+T   NNGC GG MD+AF+YI  N
Sbjct: 142 HCGSCWAFSSTGALEGQHFRKAGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDN 201

Query: 196 KGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ-PVSVAID 254
            GI  +  Y YEG+    C   KA    A  T + D+P  DEE + KAVA   PVSVAID
Sbjct: 202 GGIDTEKSYPYEGIDDS-CHFNKATI-GATDTGFVDIPEGDEEKMKKAVATMGPVSVAID 259

Query: 255 AS--ALQFYSGGVFN-GYC-ETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFR 310
           AS  + Q YS GV+N   C E  L+HGV  VGYGT E G+ YWL+KNSWG  WGE GY +
Sbjct: 260 ASHESFQLYSEGVYNEPECDEQNLDHGVLVVGYGTDESGMDYWLVKNSWGTTWGEQGYIK 319

Query: 311 LQRDIDQPQGQCGIAMFASFPV 332
           + R+ +    QCGIA  +S+P 
Sbjct: 320 MARNQNN---QCGIATASSYPT 338


>gi|219884655|gb|ACL52702.1| unknown [Zea mays]
 gi|413916718|gb|AFW56650.1| thiol protease SEN102 [Zea mays]
          Length = 349

 Score =  225 bits (574), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 135/321 (42%), Positives = 192/321 (59%), Gaps = 23/321 (7%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
           + ++F+ W+A+Y RTY    E  +RF ++ +N+  +E  N       SY L  N+FADLT
Sbjct: 33  LLDRFQAWQAEYNRTYATPEEFQQRFMVYSENVKFIETMNQPG---SSYELGENQFADLT 89

Query: 90  PQEFIASQTGFKMSDHSSSLKA----------NGTPFLYKSSQVPPSVNWIEKGAVTPVK 139
            +EF  +    K+ + +SS +A           GT     +++ P SV+W  KGAVTPVK
Sbjct: 90  EEEFKDTYL-MKLDNVASSPEAMALTVDTMNRAGTSGGSNTNEAPNSVDWRTKGAVTPVK 148

Query: 140 YQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYI 192
            Q  C       AVA++EG++ IK  RLVSLSEQ++VDC    NN+GC+GG    A +++
Sbjct: 149 SQQHCGSCWAFAAVASIEGVHKIKTGRLVSLSEQEIVDCDRGGNNHGCHGGHSSSAMEWV 208

Query: 193 IQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVA 252
            +N G+T ++ Y Y G   G C S K   HAA+I   + V   +E +L  AVA +PV+V+
Sbjct: 209 TRNGGLTTESDYPYVGRQ-GQCMSDKLGHHAAKIRGRQAVQGKNEGALQHAVAGRPVAVS 267

Query: 253 IDAS-ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRL 311
           I+AS A QFY  G+F+G C T  NH VT VGYG +  G KYW++KNSWG+ WGE GY R+
Sbjct: 268 INASRAFQFYKRGIFSGPCNTTRNHAVTVVGYGANASGHKYWIVKNSWGERWGEKGYVRM 327

Query: 312 QRDIDQPQGQCGIAMFASFPV 332
           QR +   +G CGIA+   + V
Sbjct: 328 QRGVRAREGVCGIAIAPFYAV 348


>gi|357477225|ref|XP_003608898.1| Cysteine proteinase, partial [Medicago truncatula]
 gi|355509953|gb|AES91095.1| Cysteine proteinase, partial [Medicago truncatula]
          Length = 260

 Score =  225 bits (573), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 130/274 (47%), Positives = 166/274 (60%), Gaps = 33/274 (12%)

Query: 81  RLNKFADLTPQEFIASQTGFKMSDHS--SSLKANGTPFLYKSSQ-VPPSVNWIEKGAVTP 137
           +LNKFAD+T  EF +     K++ H     +  +  PF+Y++ + VP S++W + GAVT 
Sbjct: 1   KLNKFADMTNYEFRSIYADSKVNHHRMFRGMSHDNGPFMYENVEGVPSSIDWRKIGAVTG 60

Query: 138 VKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFK 190
           VK QGQC        + AVEGIN IK  +LVSLSEQ+LVDC T + N GC GG M+ AF+
Sbjct: 61  VKDQGQCGSCWAFSTIVAVEGINQIKTQKLVSLSEQELVDCDT-EVNQGCNGGLMEYAFE 119

Query: 191 YIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVS 250
           +I QN GIT +  Y Y     G C+  K    A  I  +E+VP N+E++LLKA ANQP+S
Sbjct: 120 FIKQN-GITTETNYPY-AAKDGTCNIQKENKPAVSIDGHENVPANNEKALLKAAANQPIS 177

Query: 251 VAIDA--SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGY 308
           VAIDA  S  QFYS GVF G+C T LNHGV                  NSWG +WGE GY
Sbjct: 178 VAIDAGGSDFQFYSEGVFTGHCGTELNHGV------------------NSWGSEWGEQGY 219

Query: 309 FRLQRDIDQPQGQCGIAMFASFPVSKESAQPSSA 342
            R+QR I   QG CGIAM AS+P+ K S  P+ +
Sbjct: 220 IRMQRAISHKQGLCGIAMEASYPIKKSSKNPTKS 253


>gi|154183745|gb|ABS70713.1| cathepsin L-like cysteine proteinase [Dermacentor variabilis]
          Length = 333

 Score =  225 bits (573), Expect = 3e-56,   Method: Compositional matrix adjust.
 Identities = 141/345 (40%), Positives = 196/345 (56%), Gaps = 35/345 (10%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
            L   ++++ + +S    RT        ++E +KA + ++Y+ + E   RF+IF +N + 
Sbjct: 6   LLCAFVVVTTAASSHEILRT--------QWEAFKATHKKSYQSNMEELLRFKIFSENSLL 57

Query: 65  VERFNNA-AIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS--- 120
           V R N   A G  SY L +N+F DL P EF     G++     +     G+ FL  +   
Sbjct: 58  VARHNEKYARGLVSYKLGMNQFGDLLPHEFARMFNGYR----GARTAGRGSTFLPPANVN 113

Query: 121 -SQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCA 172
            S +P S++W EKGAVTPVK QGQC          ++EG + +K   LVSLSEQ LVDC+
Sbjct: 114 YSSLPQSMDWREKGAVTPVKNQGQCGSCWAFSTTGSLEGQHFLKTGVLVSLSEQNLVDCS 173

Query: 173 TNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDV 232
               N+GC GG MD+AF+YI  N GI  +  Y YE    G C   K ++  A  T + D+
Sbjct: 174 ETFGNHGCEGGLMDNAFQYIKANGGIDTEKSYPYEA-EDGEC-RFKKQNVGATDTGFVDI 231

Query: 233 PPNDEESLLKAVAN-QPVSVAIDA--SALQFYSGGVFN-GYCETF-LNHGVTAVGYGTSE 287
               E+ L KAVA   PVSVAIDA  S+ Q YS GV++   C +  L+HGV  VGYG  E
Sbjct: 232 EQGSEDDLKKAVATVGPVSVAIDASHSSFQLYSEGVYDETECSSEQLDHGVLVVGYGV-E 290

Query: 288 EGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           +G KYWL+KNSW + WG++GY ++ RD D    QCGIA  AS+P+
Sbjct: 291 DGKKYWLVKNSWAESWGDNGYIKMSRDKDN---QCGIASAASYPL 332


>gi|320169658|gb|EFW46557.1| cathepsin L2 [Capsaspora owczarzaki ATCC 30864]
          Length = 324

 Score =  225 bits (573), Expect = 3e-56,   Method: Compositional matrix adjust.
 Identities = 131/311 (42%), Positives = 177/311 (56%), Gaps = 21/311 (6%)

Query: 34  FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
           F+ WKA +G +Y    E + R  I++ NL  +E+ N+      SY L +NKFADLT  EF
Sbjct: 22  FDSWKATHGVSYATVGEETARRGIYRANLDFIEKHNSEG---HSYKLAVNKFADLTYPEF 78

Query: 94  IASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA-------V 146
            A   G +    +++     + +L +   +P SV+W   G VTP+K QGQC         
Sbjct: 79  AAKYLGLRFDATNATKSFAASTYLPRMVSLPDSVDWRTAGIVTPIKDQGQCGSCWSFSTT 138

Query: 147 AAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSY 206
            +VEG +A K  +LVSLSEQ LVDC++   N GC GG MD AF+YII N GI  ++ Y Y
Sbjct: 139 GSVEGQHARKTGQLVSLSEQNLVDCSSAQGNAGCNGGLMDQAFQYIISNNGIDTESSYPY 198

Query: 207 EGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDAS--ALQFYSG 263
                G C    + +  A + +Y+D+    E  L  AVA   P+SVAIDAS  + QFYS 
Sbjct: 199 TAQD-GTCQ-FNSANVGATVASYQDIASGSESDLQNAVATVGPISVAIDASQPSFQFYSS 256

Query: 264 GVFN--GYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQ 321
           GV+N      + L+HGV AVGYGTS     YWL+KNSWG  WG+ GY  + R+ +    Q
Sbjct: 257 GVYNEPACSSSQLDHGVLAVGYGTSGSS-DYWLVKNSWGTSWGQSGYIWMTRNSNN---Q 312

Query: 322 CGIAMFASFPV 332
           CGIA  AS+P+
Sbjct: 313 CGIATAASYPL 323


>gi|345320664|ref|XP_001521690.2| PREDICTED: cathepsin L1-like [Ornithorhynchus anatinus]
          Length = 388

 Score =  224 bits (572), Expect = 3e-56,   Method: Compositional matrix adjust.
 Identities = 138/344 (40%), Positives = 194/344 (56%), Gaps = 26/344 (7%)

Query: 6   LIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
           L+V L+    C   A      +  + + +E WK  + ++Y ++ E  +R  ++++NL  +
Sbjct: 53  LLVCLL--SLCWGLAVSAPLGDSELDKHWELWKNWHQKSYHKAEEGWRRM-VWEENLKVI 109

Query: 66  ERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFL-YKSSQV 123
           E  N   ++G  +Y L +N+F DLT +EF   Q        S   + NG+ FL     QV
Sbjct: 110 ELHNLEQSLGLHTYQLGMNQFGDLTNEEF--QQMLISERHFSEGNRINGSAFLEVNYVQV 167

Query: 124 PPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDN 176
           P SV+W + G VTPVK QG C          A+EG    K  RLVSLSEQ LVDC+    
Sbjct: 168 PTSVDWRDHGYVTPVKNQGHCGSCWAFSTTGALEGQLFRKSGRLVSLSEQNLVDCSWQQG 227

Query: 177 NNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPND 236
           N GC GG +D AF+YI++N+GI ++  Y Y    T  C + K E   A++T + D+PP+ 
Sbjct: 228 NQGCNGGIVDFAFQYILENRGIDSEDCYPYTAKDTAQC-AFKPECATARVTGFVDIPPHS 286

Query: 237 EESLLKAVAN-QPVSVAIDA--SALQFYSGGVF-NGYCET-FLNHGVTAVGY---GTSEE 288
           EE+L+KAVA   PVSVAIDA  ++ +FY  G+F    C +  LNH V  VGY   G  E 
Sbjct: 287 EEALMKAVATVGPVSVAIDAHPTSFRFYQSGIFYEPKCSSERLNHAVLVVGYGYEGEDEA 346

Query: 289 GIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           G KYW++KNSWG+ WG+ GYF L +D       CGIA  AS+P+
Sbjct: 347 GKKYWIVKNSWGKQWGDHGYFYLSKDRGN---HCGIATTASYPL 387


>gi|255544115|ref|XP_002513120.1| cysteine protease, putative [Ricinus communis]
 gi|223548131|gb|EEF49623.1| cysteine protease, putative [Ricinus communis]
          Length = 362

 Score =  224 bits (572), Expect = 3e-56,   Method: Compositional matrix adjust.
 Identities = 126/262 (48%), Positives = 171/262 (65%), Gaps = 18/262 (6%)

Query: 14  GSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAI 73
           G+  SQ   RT  E S+ E+ EQW A Y R YK++ E   R++IFK+N+  ++ FN+ + 
Sbjct: 19  GAWTSQCMARTLQEASMYERHEQWMASYARVYKDANEKQMRYKIFKENVQRIDSFNSES- 77

Query: 74  GNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEK 132
            ++SY L +N+FADLT +EF + + GFK   H  S +A    F Y++ + VP S++W +K
Sbjct: 78  -DKSYKLAVNQFADLTNEEFKSLRNGFK--GHMCSAQAG--HFRYENVTAVPASIDWRKK 132

Query: 133 GAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFM 185
           GAVT +K QGQC       AVAAVEGI  IK  +L+SLSEQ+LVDC TN  + GC GG M
Sbjct: 133 GAVTQIKEQGQCGSCWAFSAVAAVEGITEIKTGKLISLSEQELVDCDTNSEDQGCQGGLM 192

Query: 186 DDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVA 245
           DDAFK+I Q+ G+ ++A Y Y+   +  C + +    +A+IT YEDVP NDE +L  AVA
Sbjct: 193 DDAFKFIEQH-GLASEATYPYDAADS-TCKTKEEAKPSAKITGYEDVPANDEAALKNAVA 250

Query: 246 NQPVSVAIDASA--LQFYSGGV 265
           NQPVSVAIDA     QFYS G+
Sbjct: 251 NQPVSVAIDAGGFEFQFYSSGI 272


>gi|357139514|ref|XP_003571326.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP2-like
           [Brachypodium distachyon]
          Length = 363

 Score =  224 bits (572), Expect = 3e-56,   Method: Compositional matrix adjust.
 Identities = 131/335 (39%), Positives = 188/335 (56%), Gaps = 33/335 (9%)

Query: 26  DEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN------NAAIGN---- 75
           D+  + +++ +W+A+Y + Y    E  KRF +F+DN  ++  F+      +A +G+    
Sbjct: 35  DDSELRQRWSKWQAKYSKRYPSHEEQEKRFGVFRDNSNSIGAFSAPQTTTSAVVGSFGAP 94

Query: 76  ---RSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEK 132
               +  + +N+F DL P+E +   TGF  ++ ++ LK      L   S+ P  V+W   
Sbjct: 95  QTVTTVRVGMNRFGDLQPREVLDQFTGF--NNTAAVLKTPPPTRLPHHSRKPCCVDWRSS 152

Query: 133 GAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFM 185
           GAVT VK+QG C       AVAA+EG+N I+   LVSLSEQQLVDC  ++ ++GC GG  
Sbjct: 153 GAVTGVKFQGSCQSCWAFAAVAAIEGMNKIRTGTLVSLSEQQLVDC--DNGSSGCAGGRT 210

Query: 186 DDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAE-DHAAQITNYEDVPPNDEESLLKAV 244
           D A   + +  GIT+   Y+Y G + G C   K   DH A +  ++ VPPNDE  L  AV
Sbjct: 211 DTALDLVARRGGITSGERYAYGGFN-GRCKVDKLLFDHGAAVGGFKAVPPNDEHQLAMAV 269

Query: 245 ANQPVSVAIDASA--LQFYSGGVFNGYCE---TFLNHGVTAVGYGTSEEGIKYWLIKNSW 299
           A QPV+  +DAS    QFYSGG+F G C      +NH VT VGY   E G K+W+ KNSW
Sbjct: 270 ARQPVTAYVDASTWEFQFYSGGIFRGPCSGDPARVNHAVTIVGY-CEEFGDKFWIAKNSW 328

Query: 300 GQDWGEDGYFRLQRDI-DQPQGQCGIAMFASFPVS 333
             DWG+ GY  L +D+   P G CG+A    +P +
Sbjct: 329 SDDWGDQGYILLAKDVLSSPNGTCGLATSPFYPTA 363


>gi|7523482|dbj|BAA94210.1| putative cysteine protease [Oryza sativa Japonica Group]
 gi|10800060|dbj|BAB16480.1| putative cysteine protease [Oryza sativa Japonica Group]
          Length = 349

 Score =  224 bits (572), Expect = 3e-56,   Method: Compositional matrix adjust.
 Identities = 132/321 (41%), Positives = 178/321 (55%), Gaps = 31/321 (9%)

Query: 32  EKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQ 91
           + FE+W A++G+ Y    E   RF +F+DN+  +  +   A  N +  LR+N+FADLT  
Sbjct: 39  QMFEEWMAKFGKKYPCHGEKEYRFGVFRDNVRFIRSYRPPAGYNSA--LRVNQFADLTND 96

Query: 92  EFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC------- 144
           EF+++ TG K      + +     +L      P  ++W  KGAVT VK QG C       
Sbjct: 97  EFVSTHTGAKPPCPKDAPRGVDPIWL------PCCIDWRYKGAVTDVKDQGACGSCWAFA 150

Query: 145 AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVY 204
           AVAA+EG+  I+  +L  LSEQ+LVDC T   ++GC GG  D AF+ +    GIT ++ Y
Sbjct: 151 AVAAIEGLTQIRTGKLTPLSEQELVDCDTG--SSGCAGGHTDRAFELVAAKGGITAESGY 208

Query: 205 SYEGMSTGICDSIKAE-DHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQFY 261
            YEG   G C +  A  +HAA+I  +  VPP DE  L  AVA QPV+  IDAS  A QFY
Sbjct: 209 RYEGYR-GKCRADDALFNHAARIGGHRAVPPGDERQLATAVARQPVTAYIDASGPAFQFY 267

Query: 262 SGGVFNGYCETF---------LNHGVTAVGYGT-SEEGIKYWLIKNSWGQDWGEDGYFRL 311
             GVF G C +           NH VT VGY      G KYW+ KNSWG+ WGE GY  L
Sbjct: 268 GSGVFPGPCGSGSGAAAAAPTTNHAVTLVGYCQDGASGKKYWVAKNSWGKTWGEKGYILL 327

Query: 312 QRDIDQPQGQCGIAMFASFPV 332
           ++D+  P G CG+A+   +P 
Sbjct: 328 EKDVASPHGTCGVAVSPFYPT 348


>gi|218187750|gb|EEC70177.1| hypothetical protein OsI_00904 [Oryza sativa Indica Group]
 gi|222617983|gb|EEE54115.1| hypothetical protein OsJ_00884 [Oryza sativa Japonica Group]
          Length = 327

 Score =  224 bits (572), Expect = 4e-56,   Method: Compositional matrix adjust.
 Identities = 132/321 (41%), Positives = 178/321 (55%), Gaps = 31/321 (9%)

Query: 32  EKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQ 91
           + FE+W A++G+ Y    E   RF +F+DN+  +  +   A  N +  LR+N+FADLT  
Sbjct: 17  QMFEEWMAKFGKKYPCHGEKEYRFGVFRDNVRFIRSYRPPAGYNSA--LRVNQFADLTND 74

Query: 92  EFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC------- 144
           EF+++ TG K      + +     +L      P  ++W  KGAVT VK QG C       
Sbjct: 75  EFVSTHTGAKPPCPKDAPRGVDPIWL------PCCIDWRYKGAVTDVKDQGACGSCWAFA 128

Query: 145 AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVY 204
           AVAA+EG+  I+  +L  LSEQ+LVDC T   ++GC GG  D AF+ +    GIT ++ Y
Sbjct: 129 AVAAIEGLTQIRTGKLTPLSEQELVDCDTG--SSGCAGGHTDRAFELVAAKGGITAESGY 186

Query: 205 SYEGMSTGICDSIKAE-DHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQFY 261
            YEG   G C +  A  +HAA+I  +  VPP DE  L  AVA QPV+  IDAS  A QFY
Sbjct: 187 RYEGYR-GKCRADDALFNHAARIGGHRAVPPGDERQLATAVARQPVTAYIDASGPAFQFY 245

Query: 262 SGGVFNGYCETF---------LNHGVTAVGYGT-SEEGIKYWLIKNSWGQDWGEDGYFRL 311
             GVF G C +           NH VT VGY      G KYW+ KNSWG+ WGE GY  L
Sbjct: 246 GSGVFPGPCGSGSGAAAAAPTTNHAVTLVGYCQDGASGKKYWVAKNSWGKTWGEKGYILL 305

Query: 312 QRDIDQPQGQCGIAMFASFPV 332
           ++D+  P G CG+A+   +P 
Sbjct: 306 EKDVASPHGTCGVAVSPFYPT 326


>gi|312381833|gb|EFR27483.1| hypothetical protein AND_05794 [Anopheles darlingi]
          Length = 344

 Score =  224 bits (571), Expect = 4e-56,   Method: Compositional matrix adjust.
 Identities = 140/353 (39%), Positives = 200/353 (56%), Gaps = 34/353 (9%)

Query: 3   KYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
           K FL++V  ++ + A       F+   + E++  +K Q+ + Y   +E   R +I+  N 
Sbjct: 2   KLFLLLVSFLAAANAVS----IFN--LVKEEWNAFKLQHRKKYDSESEERIRMKIYVQNK 55

Query: 63  VAVERFNNA-AIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSS-------LKANGT 114
             + + N    +G   + LR+NK+ADL  +EF+ +  GF  S  + S       L     
Sbjct: 56  HKIAKHNQRYDLGQEKFRLRVNKYADLLHEEFVHTLNGFNRSAAAGSKLLGREQLMTIEE 115

Query: 115 PFLY---KSSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLS 164
           P  +    +  VP +++W EKGAVTPVK QG C       A  A+EG +  K  +LVSLS
Sbjct: 116 PITWIEPANVDVPTTIDWREKGAVTPVKDQGHCGSCWSFSATGALEGQHFRKTGKLVSLS 175

Query: 165 EQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAA 224
           EQ LVDC+T   NNGC GG MD+AF+Y+  NKGI  +  Y YE +      + KA    A
Sbjct: 176 EQNLVDCSTKYGNNGCNGGLMDNAFQYVKDNKGIDTEKAYPYEAIDDECHYNPKAI--GA 233

Query: 225 QITNYEDVPPNDEESLLKAVAN-QPVSVAIDAS--ALQFYSGGV-FNGYCETF-LNHGVT 279
               + D+P  DE++L KA+A   PVSVAIDAS  + QFYS GV +   C++  L+HGV 
Sbjct: 234 TDKGFVDIPQGDEKALKKALATVGPVSVAIDASHESFQFYSEGVYYEPQCDSEQLDHGVL 293

Query: 280 AVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           AVGYGT+E+G  YWL+KNSWG  WG+ GY ++ R+    +  CGIA  AS+P+
Sbjct: 294 AVGYGTTEDGEDYWLVKNSWGTTWGDQGYVKMARN---RENHCGIATTASYPL 343


>gi|2507252|sp|P14080.2|PAPA2_CARPA RecName: Full=Chymopapain; AltName: Full=Papaya proteinase II;
           Short=PPII; Flags: Precursor
 gi|1332461|emb|CAA66378.1| chymopapain [Carica papaya]
          Length = 352

 Score =  224 bits (571), Expect = 4e-56,   Method: Compositional matrix adjust.
 Identities = 138/344 (40%), Positives = 186/344 (54%), Gaps = 25/344 (7%)

Query: 5   FLIVVLIISGSCASQATYR---TFDEGSIAEK----FEQWKAQYGRTYKESAENSKRFEI 57
           FL   LII    +S   Y    + D+ +  E+    F+ W  ++ + Y+   E   RFEI
Sbjct: 12  FLATCLIIHMGLSSADFYTVGYSQDDLTSIERLIQLFDSWMLKHNKIYESIDEKIYRFEI 71

Query: 58  FKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFL 117
           F+DNL+ ++  N     N SY L LN FADL+  EF     GF   D +     +   F 
Sbjct: 72  FRDNLMYIDETNKK---NNSYWLGLNGFADLSNDEFKKKYVGFVAEDFTGLEHFDNEDFT 128

Query: 118 YKS-SQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLV 169
           YK  +  P S++W  KGAVTPVK QG C        +A VEGIN I    L+ LSEQ+LV
Sbjct: 129 YKHVTNYPQSIDWRAKGAVTPVKNQGACGSCWAFSTIATVEGINKIVTGNLLELSEQELV 188

Query: 170 DCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNY 229
           DC  + ++ GC GG+   + +Y+  N G+    VY Y+      C +        +IT Y
Sbjct: 189 DC--DKHSYGCKGGYQTTSLQYVANN-GVHTSKVYPYQAKQYK-CRATDKPGPKVKITGY 244

Query: 230 EDVPPNDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSE 287
           + VP N E S L A+ANQP+SV ++A     Q Y  GVF+G C T L+H VTAVGYGTS+
Sbjct: 245 KRVPSNCETSFLGALANQPLSVLVEAGGKPFQLYKSGVFDGPCGTKLDHAVTAVGYGTSD 304

Query: 288 EGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
            G  Y +IKNSWG +WGE GY RL+R     QG CG+   + +P
Sbjct: 305 -GKNYIIIKNSWGPNWGEKGYMRLKRQSGNSQGTCGVYKSSYYP 347


>gi|384247445|gb|EIE20932.1| hypothetical protein COCSUDRAFT_18161 [Coccomyxa subellipsoidea
           C-169]
          Length = 387

 Score =  224 bits (571), Expect = 4e-56,   Method: Compositional matrix adjust.
 Identities = 140/336 (41%), Positives = 177/336 (52%), Gaps = 41/336 (12%)

Query: 41  YGRTYKESAENSKRFEIFKDNLVAVERFNNAA----------------------IGNRSY 78
           + + Y    E + R  IFK N+  +   N+A                       +   ++
Sbjct: 7   FNKKYSNEEEAALRLNIFKTNVDYITSVNSAQQSYQASKHFSENTQQTALSSLFLSQLAH 66

Query: 79  T-----LRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKG 133
           T     L LN+FAD T +EF ++  G    +  S   +  T F +       S+NW+E G
Sbjct: 67  TDLLPQLGLNEFADQTWEEFSSTHLGLNAGEDGSFRSSANTGFRHADVTPANSINWVEAG 126

Query: 134 AVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMD 186
           AVTPVK Q  C          +VEG N +    LVSLSEQQLVDC T   + GC GG MD
Sbjct: 127 AVTPVKNQAFCGSCWAFSTTGSVEGANFLATGDLVSLSEQQLVDCDTK-KDQGCGGGLMD 185

Query: 187 DAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN 246
            AF YII+N G+  +  YSY  +  G C+ ++ E     I  YEDVP NDE +L KAV+ 
Sbjct: 186 YAFDYIIKNGGLDTEEDYSYWSVG-GFCNKLREERTVVSIDGYEDVPVNDEVALAKAVSK 244

Query: 247 QPVSVAIDAS-ALQFYSGGVF--NGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDW 303
           QPVSVAI AS A+QFYS GV    G C   LNHGV A GY   E G  YWL+KNSWG  W
Sbjct: 245 QPVSVAICASEAMQFYSSGVIAAKGSC-IGLNHGVLAAGYDVDESGKPYWLVKNSWGGTW 303

Query: 304 GEDGYFRLQRDIDQPQGQCGIAMFASFPVSKESAQP 339
           G  GY +L++D    +G CGIAM AS+PV K S  P
Sbjct: 304 GMQGYMKLEKDSSVKEGACGIAMAASYPV-KSSPNP 338


>gi|302763109|ref|XP_002964976.1| hypothetical protein SELMODRAFT_83176 [Selaginella moellendorffii]
 gi|302763113|ref|XP_002964978.1| hypothetical protein SELMODRAFT_83554 [Selaginella moellendorffii]
 gi|300167209|gb|EFJ33814.1| hypothetical protein SELMODRAFT_83176 [Selaginella moellendorffii]
 gi|300167211|gb|EFJ33816.1| hypothetical protein SELMODRAFT_83554 [Selaginella moellendorffii]
          Length = 300

 Score =  224 bits (570), Expect = 6e-56,   Method: Compositional matrix adjust.
 Identities = 124/310 (40%), Positives = 184/310 (59%), Gaps = 22/310 (7%)

Query: 34  FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
           FE W A++ ++Y    E ++R  +F D L  +E+ N  A  N ++TL LNKF+DLT  EF
Sbjct: 2   FEDWAAKHDKSYSSDWEKARRLMVFSDTLAYIEKHN--AQPNTTFTLGLNKFSDLTNAEF 59

Query: 94  IASQTG-FKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC-------A 145
            A+  G FK   +     A         S +P S++W ++GAVTP+K QGQC       A
Sbjct: 60  RANYVGKFKPPRYQDRRPAKDVDV--DVSSLPTSLDWRQEGAVTPIKDQGQCGSCWAFSA 117

Query: 146 VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYS 205
           +A++E  + +    LVSLSEQQL+DC T D   GC GGF DDAFK++++N G+T +  Y 
Sbjct: 118 IASIESAHFLATKELVSLSEQQLIDCDTVDQ--GCQGGFPDDAFKFVVENGGVTTEEAYP 175

Query: 206 YEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASALQF--YSG 263
           Y G + G C++ K  +   +IT Y+DV  +  ++L+KAV+  PV+V I  S   F  Y  
Sbjct: 176 YTGFA-GSCNTNK--NKVVEITGYKDVTKDSADALMKAVSKTPVTVGICGSDQNFQNYRS 232

Query: 264 GVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCG 323
           G+ +G C    +H V  +GYGT E G+ YW+IKNSWG  WGEDG+ ++++     +G CG
Sbjct: 233 GILSGQCCNSRDHAVLVIGYGT-EGGMPYWIIKNSWGTSWGEDGFMKIKK--KDGEGMCG 289

Query: 324 IAMFASFPVS 333
           +   +S+P +
Sbjct: 290 MNGQSSYPTT 299


>gi|157093563|gb|ABV22436.1| cysteine proteinase [Oxyrrhis marina]
          Length = 329

 Score =  224 bits (570), Expect = 6e-56,   Method: Compositional matrix adjust.
 Identities = 129/313 (41%), Positives = 174/313 (55%), Gaps = 18/313 (5%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
           I  ++E++KA++G +Y    E ++R  +F  N+  +   N+      +YTL +N+FADLT
Sbjct: 15  IDAQWEEFKAKFGESYNGEEEEAERKGVFAQNVQLINEENSKG---HTYTLGVNQFADLT 71

Query: 90  PQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA---- 145
            +EF  +  GFK         A     +Y    +P SV+W  +GAVTPVK QGQC     
Sbjct: 72  VEEFSKTYMGFKKPAQKYGDAAYLGRHVYNGEALPTSVDWSSQGAVTPVKNQGQCGSCWS 131

Query: 146 ---VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDA 202
                ++EG N I   +LVSLSEQQ VDCA    N GC GG MD AFKY   N  +  + 
Sbjct: 132 FSTTGSLEGANEISTGKLVSLSEQQFVDCAGTYGNQGCNGGLMDSAFKYAEANA-LCTEQ 190

Query: 203 VYSYEGMSTGICDSIKAEDHAAQ--ITNYEDVPPNDEESLLKAVANQPVSVAIDA--SAL 258
            Y Y+G + G C +       A+  ++ Y+DV  + E+ ++ AVA QPVS+AI+A  S  
Sbjct: 191 SYPYKG-TDGSCQASSCSTGLAKGSVSGYKDVSSDSEQDMMSAVAQQPVSIAIEADKSVF 249

Query: 259 QFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQP 318
           Q YSGGV  G C   L+HGV AVGYGT   G  YW +KNSWG  WG  GY  LQR     
Sbjct: 250 QLYSGGVLTGACGASLDHGVLAVGYGTL-SGTDYWKVKNSWGSTWGMSGYVLLQRG-KGG 307

Query: 319 QGQCGIAMFASFP 331
            G+CG+    S+P
Sbjct: 308 SGECGLLSEPSYP 320


>gi|156124998|gb|ABU50817.1| Ale o 1 allergen [Aleuroglyphus ovatus]
          Length = 337

 Score =  223 bits (569), Expect = 7e-56,   Method: Compositional matrix adjust.
 Identities = 138/322 (42%), Positives = 191/322 (59%), Gaps = 27/322 (8%)

Query: 27  EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN-NAAIGNRSYTLRLNKF 85
           EG +  +FEQ+K+ +GR Y        R  IF+ NL  + R N +   G+ ++++ +N F
Sbjct: 26  EGELEAQFEQFKSTFGRVYPSPEIELHRKSIFRANLQFILRHNIDYFNGDSTFSVSVNNF 85

Query: 86  ADLTPQEFIASQTGFKMSDHSS---SLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQG 142
            DL+ +EF A+  G++     S   S+ A+          +P +V+W  KG VTP+K Q 
Sbjct: 86  TDLSNEEFRATFNGYRRLAAVSLADSVHADN-----DVEALPATVDWTTKGVVTPIKNQQ 140

Query: 143 QC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQN 195
           QC       AVA++EG +A+K  +LVSLSEQ LVDC+  + + GC GG+MD AFKY+IQN
Sbjct: 141 QCGSCWAFSAVASMEGQHALKTGKLVSLSEQNLVDCSAAEGDMGCSGGWMDYAFKYVIQN 200

Query: 196 KGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAID 254
           +GI  +A Y Y+ +    C+  K     A I ++ DV   DE +L  AVA+  P+SVAID
Sbjct: 201 RGIDTEASYPYKAIDES-CE-FKRNSVGATIHSFVDVKTGDESALQNAVASIGPISVAID 258

Query: 255 AS--ALQFYSGGVFNGY-CET-FLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFR 310
           A+  + QFYS GV+N   C T  L+HGVTAVGYGT   G  YW +KNSWG  WG  GY  
Sbjct: 259 AAQPSFQFYSSGVYNEPDCSTEILDHGVTAVGYGTL-NGAPYWKVKNSWGTSWGRKGYIF 317

Query: 311 LQRDIDQPQGQCGIAMFASFPV 332
           + R+    Q QCGIA  AS+PV
Sbjct: 318 MSRN---KQNQCGIATKASYPV 336


>gi|4731372|gb|AAD28476.1|AF133838_1 papain-like cysteine protease [Sandersonia aurantiaca]
          Length = 370

 Score =  223 bits (569), Expect = 7e-56,   Method: Compositional matrix adjust.
 Identities = 115/221 (52%), Positives = 145/221 (65%), Gaps = 12/221 (5%)

Query: 123 VPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATND 175
           +P SV+W EKGAV P+K QG C        +A+VEGIN I    L+SLSEQ+LVDC    
Sbjct: 41  LPDSVDWREKGAVVPIKDQGGCGSCWAFSTIASVEGINKIVTGDLISLSEQELVDCDKT- 99

Query: 176 NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPN 235
            N+GC GG MD AF++II N GI  +  Y Y     G CDS +       I +YEDVP N
Sbjct: 100 YNDGCNGGLMDYAFQFIIDNGGIDTEKDYPYT-EQDGRCDSYRKNAKVVSINSYEDVPVN 158

Query: 236 DEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYW 293
           DE++L KA A+QP++VAID    + Q Y+ G+F G C T L+HGVT VGYG SE G  YW
Sbjct: 159 DEQALKKAAASQPIAVAIDGGGRSFQLYNSGIFTGKCGTSLDHGVTVVGYG-SESGKDYW 217

Query: 294 LIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSK 334
           +++NSWG+ WGE GY R+ R+ID P G CGIAM AS+P+ K
Sbjct: 218 IVRNSWGESWGEKGYIRMARNIDSPSGICGIAMEASYPIKK 258


>gi|149617838|ref|XP_001521715.1| PREDICTED: cathepsin L1-like [Ornithorhynchus anatinus]
          Length = 338

 Score =  223 bits (569), Expect = 7e-56,   Method: Compositional matrix adjust.
 Identities = 136/344 (39%), Positives = 196/344 (56%), Gaps = 26/344 (7%)

Query: 6   LIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
           L+V L+    C   A      +  +   ++ WK  + ++Y E+ E  +R  ++++NL A+
Sbjct: 3   LLVCLV--SLCWGLAVSAPLGDSELDRHWKLWKNWHQKSYHEAEEGWRR-TVWEENLKAI 59

Query: 66  ERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQV 123
           +  N   ++G  +Y L +N+F DLT +EF    TG +    S   + NG+ FL  +  QV
Sbjct: 60  QLHNLEQSLGLHTYRLGMNQFGDLTNEEFQEILTGER--HFSKGNRINGSAFLEANFVQV 117

Query: 124 PPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDN 176
           P SV+W + G VTPVK QG C          A+EG    K  RL+SLSEQ LVDC+    
Sbjct: 118 PTSVDWRDHGYVTPVKNQGHCGSCWAFSTTGALEGQLFRKSGRLISLSEQNLVDCSWQQG 177

Query: 177 NNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPND 236
           N GC+GG +D AF+YI+QN+GI ++  Y Y    T  C + K E   A +T + D+PP+ 
Sbjct: 178 NQGCHGGIVDLAFQYILQNQGIDSEDCYPYTAKDTAQC-TFKPECATAPVTGFVDIPPHS 236

Query: 237 EESLLKAVAN-QPVSVAIDAS--ALQFYSGGVF-NGYCET-FLNHGVTAVGYGTSEE--- 288
           EE+L+KAVA   PVSV IDAS  + +FY  G+F +  C +  L+H V  VGYG   E   
Sbjct: 237 EEALMKAVATVGPVSVGIDASSTSFRFYQSGIFYDPKCSSESLDHAVLVVGYGYEREDEA 296

Query: 289 GIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           G KYW++KNSWG+ WG+ GY  + +D       CGIA  AS+P+
Sbjct: 297 GKKYWIVKNSWGKHWGDRGYVYMSKDRGN---HCGIATVASYPL 337


>gi|346469447|gb|AEO34568.1| hypothetical protein [Amblyomma maculatum]
          Length = 333

 Score =  223 bits (569), Expect = 7e-56,   Method: Compositional matrix adjust.
 Identities = 143/346 (41%), Positives = 196/346 (56%), Gaps = 37/346 (10%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
            L  ++  + +  SQ   RT        ++E +K+ + +TYK + E   RF+IF +N + 
Sbjct: 6   LLCAIVAAATAATSQEILRT--------EWEAFKSTHKKTYKSNVEELLRFKIFTENSLF 57

Query: 65  VERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFL----YK 119
           + + N   A G  SY L +N+FADL P EF+    G++       L   G+ +L      
Sbjct: 58  IAKHNVKYAKGLVSYKLGINQFADLLPHEFVKMMNGYQ----GKRLAGRGSTYLPPANLN 113

Query: 120 SSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCA 172
            S +P +V+W +KGAVTPVK QGQC       +  ++EG + +K  +LVSLSEQ LVDC+
Sbjct: 114 DSSLPKTVDWRKKGAVTPVKDQGQCGSCWAFSSTGSLEGQHFLKTGKLVSLSEQNLVDCS 173

Query: 173 TNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDV 232
           +   N GC GG MD++F YI  N GI  +  Y YE    G C   K ED  A  T + D+
Sbjct: 174 SAYGNQGCNGGLMDNSFNYIKANGGIDTEDSYPYEA-EDGDC-RYKKEDVGATDTGFVDI 231

Query: 233 PPNDEESLLKAVAN-QPVSVAIDAS--ALQFYSGGVF---NGYCETFLNHGVTAVGYGTS 286
               E+ L KAVA   PVSVAIDAS  + Q YS GV+   N   E+ L+HGV AVGYG  
Sbjct: 232 KEGSEKDLQKAVATVGPVSVAIDASQQSFQLYSEGVYDEPNCSSES-LDHGVLAVGYGV- 289

Query: 287 EEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           + G KYWL+KNSW + WG+DGY  + RD +    QCGIA  AS+P+
Sbjct: 290 KNGKKYWLVKNSWAETWGQDGYILMSRDKNN---QCGIASSASYPL 332


>gi|224116884|ref|XP_002317418.1| predicted protein [Populus trichocarpa]
 gi|222860483|gb|EEE98030.1| predicted protein [Populus trichocarpa]
          Length = 503

 Score =  223 bits (569), Expect = 8e-56,   Method: Compositional matrix adjust.
 Identities = 131/327 (40%), Positives = 182/327 (55%), Gaps = 29/327 (8%)

Query: 27  EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFA 86
           E SI E F+QW+ ++ + Y+ +AE+ KR+  FK NL  +            +++ LNKFA
Sbjct: 43  EESIIEIFQQWRDRHQKVYEHAAESEKRYRNFKRNLKYIIEKAGKKTAALGHSVGLNKFA 102

Query: 87  DLTPQEF-------IASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVK 139
           DL+ +EF       +      K S      + N      ++   P S++W +KG VT VK
Sbjct: 103 DLSNEEFKELYLSKVKKPINIKRSTARDWRQRN-----LQTCDAPSSLDWRKKGVVTAVK 157

Query: 140 YQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYI 192
            QG C          A+EGINAI    L+SLSEQ+LVDC T   N GC GG+MD AF+++
Sbjct: 158 DQGDCGSCWSFSTTGAIEGINAIVTGDLISLSEQELVDCDTT--NYGCEGGYMDYAFEWV 215

Query: 193 IQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVA 252
           I N GI  +A Y Y G+  G C++ K E     I  Y DV   D  +LL A   QP+SV 
Sbjct: 216 INNGGIDTEANYPYTGVD-GTCNTTKEEIKVVSIDGYTDVDETDS-ALLCATVQQPISVG 273

Query: 253 IDASAL--QFYSGGVFNGYCE---TFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDG 307
           +D SAL  Q Y+GG+++G C      ++H V  VGYG SE G  YW++KNSWG +WG +G
Sbjct: 274 MDGSALDFQLYTGGIYDGDCSDDPNDIDHAVLIVGYG-SENGEDYWIVKNSWGTEWGMEG 332

Query: 308 YFRLQRDIDQPQGQCGIAMFASFPVSK 334
           YF ++R+ D P G C I   AS+P  +
Sbjct: 333 YFYIKRNTDLPYGVCAINAEASYPTKE 359


>gi|260516654|gb|ACX43954.1| cysteine protease 1 [Brachiaria hybrid cultivar]
 gi|260516656|gb|ACX43955.1| cysteine protease 1 [Brachiaria hybrid cultivar]
 gi|260516658|gb|ACX43956.1| cysteine protease 1 [Brachiaria hybrid cultivar]
 gi|260516660|gb|ACX43957.1| cysteine protease 1 [Brachiaria hybrid cultivar]
 gi|260516662|gb|ACX43958.1| cysteine protease 2 [Brachiaria hybrid cultivar]
 gi|260516664|gb|ACX43959.1| cysteine protease 2 [Brachiaria hybrid cultivar]
 gi|260516666|gb|ACX43960.1| cysteine protease 2 [Brachiaria hybrid cultivar]
 gi|260516668|gb|ACX43961.1| cysteine protease 2 [Brachiaria hybrid cultivar]
 gi|260516670|gb|ACX43962.1| cysteine protease 2 [Brachiaria hybrid cultivar]
          Length = 338

 Score =  223 bits (569), Expect = 9e-56,   Method: Compositional matrix adjust.
 Identities = 137/314 (43%), Positives = 180/314 (57%), Gaps = 25/314 (7%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
           + + F  +  QY + Y   AE S RF  FK N+  +   N  A  N SYT+ LN+FADL+
Sbjct: 38  LQDMFTAFMKQYSKAYSH-AEFSSRFNQFKANVETIRLHNTLA--NASYTMGLNEFADLS 94

Query: 90  PQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC----- 144
            +EF     G+K   H     A       +    P S++W    AVTP+K QGQC     
Sbjct: 95  FEEFKGKYFGYK---HVEREFARSNNLHQEVEAAPTSIDWRTSNAVTPIKDQGQCGSCWA 151

Query: 145 --AVAAVEGINAIK-INRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITND 201
             A  ++EG   ++  + L SLSEQQLVDC+T+  N GC GG MD AF+YII NKGI  +
Sbjct: 152 FSATGSIEGAWVLQGKHTLTSLSEQQLVDCSTSYGNAGCNGGLMDYAFEYIIANKGICAE 211

Query: 202 AVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDA--SAL 258
           + Y Y+G+  G+C   K+      I+ Y+DV   DE SLL AV    PVSVAI+A  +  
Sbjct: 212 SAYPYKGVG-GLCQ--KSCTKVVTISGYKDVASGDEASLLNAVGTVGPVSVAIEADQAGF 268

Query: 259 QFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQP 318
           QFYS GVF+G C   L+HGV AVGYGT+     YW++KNSWG  WGE GY R+ R+    
Sbjct: 269 QFYSSGVFSGTCGHNLDHGVLAVGYGTTGSQ-DYWIVKNSWGTSWGESGYIRMIRN---- 323

Query: 319 QGQCGIAMFASFPV 332
           + QCGIA+  S+P 
Sbjct: 324 KNQCGIAIQPSYPT 337


>gi|326520387|dbj|BAK07452.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 349

 Score =  223 bits (568), Expect = 9e-56,   Method: Compositional matrix adjust.
 Identities = 138/345 (40%), Positives = 195/345 (56%), Gaps = 29/345 (8%)

Query: 7   IVVLIISGSCAS--QATYRTFDEGSI--AEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
           I+VL+  G  A+   A+    D G +   ++F QW+A + R+Y  + E  +RFE+++ N+
Sbjct: 14  ILVLLTGGLFAAFPAASGGRVDAGDMLMMDRFRQWQATHNRSYLSAEERLRRFEVYRTNV 73

Query: 63  VAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTG------FKMSDHSSSLKANGTPF 116
             ++  N    G  +Y L  N+FADLT +EF+A   G         +  +  L ++G   
Sbjct: 74  EYIDATNRR--GGLTYELGENQFADLTGEEFLARYAGGHTGSAITTAAEADGLWSSGGSD 131

Query: 117 LYKSSQVPPSVNWIEKGAVTPVKYQG-QC-------AVAAVEGINAIKINRLVSLSEQQL 168
               +  P SV+W  KGAVTPVK QG QC       AVA +E +  IK  +LV+LSEQQL
Sbjct: 132 GSLEADPPASVDWRAKGAVTPVKNQGSQCYSCWAFSAVATMESLYFIKTGKLVALSEQQL 191

Query: 169 VDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITN 228
           VDC   D   GC  G+   AF++I++N GIT  A Y Y+ +  G C + K    A  IT 
Sbjct: 192 VDCDKYDG--GCNKGYYHRAFQWIMENGGITTAAQYPYKAVR-GACSAAKP---AVTITG 245

Query: 229 YEDVPPNDEESLLKAVANQPVSVAIDAS-ALQFYSGGVFNGYCETFLNHGVTAVGYGTSE 287
           +  V  N E +L  AVA QP+ VAI+   ++QFY  GVF+  C   ++H V  VGYG   
Sbjct: 246 HLAVAKN-ELALQSAVARQPIGVAIEVPISMQFYKSGVFSAACGIQMSHAVVTVGYGADA 304

Query: 288 EGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
            G+KYWL+KNSWGQ WGE GY R++RD+    G CGIA+  ++P 
Sbjct: 305 SGLKYWLVKNSWGQTWGEAGYIRMRRDVGG-GGLCGIALDTAYPT 348


>gi|151176971|gb|ABR88030.1| digestive cysteine protease [Dermestes frischii]
          Length = 339

 Score =  223 bits (568), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 140/348 (40%), Positives = 197/348 (56%), Gaps = 29/348 (8%)

Query: 3   KYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
           K+F++ ++ I G+ A       FD   + E++  +K Q+ + YK   E   R +IF +N 
Sbjct: 2   KFFVLALVFIVGAQAVSF----FD--LVQEQWGTFKLQHKKQYKSDTEEKFRMKIFMENS 55

Query: 63  VAVERFNNA-AIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKAN----GTPFL 117
             V + N    +G  SY L++NK+AD+   EF+ +  GF  + ++  L  +    G  F+
Sbjct: 56  HKVAKXNKLYEMGLVSYKLKINKYADMLHHEFVHTVNGFNRTKNTPLLGTSEDEQGATFI 115

Query: 118 YKSS-QVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLV 169
             ++ + P +V+W E GAVT VK QG C       A  A+EG +  K N+LVSLSEQ LV
Sbjct: 116 APANVKFPENVDWREHGAVTXVKDQGHCGSCWSFSATGALEGQHFRKTNKLVSLSEQNLV 175

Query: 170 DCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNY 229
           DC+T   N+GC GG MD+AFKY+  N GI  +A Y Y         + K     A    +
Sbjct: 176 DCSTKFGNDGCNGGLMDNAFKYVKYNHGIDTEASYPYHADDEKCHYNPKTS--GATDRGF 233

Query: 230 EDVPPNDEESLLKAVAN-QPVSVAIDAS--ALQFYSGGV-FNGYCET-FLNHGVTAVGYG 284
            D+P  DEE L+ AVA   PVSVAIDAS  + Q YS GV ++  C +  L+HGV  VGYG
Sbjct: 234 VDIPTGDEEKLMAAVATVGPVSVAIDASHESFQLYSEGVYYDPECSSEELDHGVLVVGYG 293

Query: 285 TSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           T E G  YW++KNSWG+ WGE GY ++ R+ D     CGIA  AS+P+
Sbjct: 294 TDENGQDYWIVKNSWGESWGEQGYIKMARNRDN---NCGIATQASYPL 338


>gi|357127811|ref|XP_003565571.1| PREDICTED: fruit bromelain-like [Brachypodium distachyon]
          Length = 364

 Score =  223 bits (568), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 133/361 (36%), Positives = 193/361 (53%), Gaps = 39/361 (10%)

Query: 3   KYFLIVVLIISGSCASQATYRT------FDEGSIAEKFEQWKAQYGRTYKESAENSKRFE 56
           + +L+++L ++G                  E  + +++  W+A+Y +TY    E  KRF 
Sbjct: 9   RPYLVLLLCLTGVLEQALQAAAAPPSWELPESELRQRWTNWQAKYSKTYPSHEEQEKRFG 68

Query: 57  IFKDNLVAVERFN------NAAIGN-------RSYTLRLNKFADLTPQEFIASQTGFKMS 103
           +F+ N+  +  F+       A +G+        +  + +N+F DL P E +   TGF  +
Sbjct: 69  VFRGNINNIGAFSAAQTTTTAVVGSFGAPQTVTTVRVGMNRFGDLQPSEVLEQFTGFNST 128

Query: 104 DHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIK 156
               + K    P+    S+ P  V+W   GAVT VK+QG C       AVAA+EG+N I+
Sbjct: 129 VVLKTPKPTRLPY---HSRKPCCVDWRSSGAVTGVKFQGSCLSCWAFAAVAAIEGMNKIR 185

Query: 157 INRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDS 216
              LVSLSEQQLVDC  +  ++GC GG  D A   + +  GIT++  Y Y G + G C+ 
Sbjct: 186 TGTLVSLSEQQLVDC--DKGSSGCAGGRTDTALDLVAKRGGITSEEKYPYGGFN-GKCNV 242

Query: 217 IKAE-DHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCET- 272
            K   +HAA +  ++ VPPNDE  L  AVA QPV+V +DAS    QFYSGG+F G C T 
Sbjct: 243 DKLLFEHAAIVKGFKAVPPNDEHQLALAVAQQPVTVYVDASTWEFQFYSGGIFRGPCSTD 302

Query: 273 --FLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASF 330
              +NH VT VGY   + G K+W+ KNSW  DWG+ GY  L +D+  P G C +A    +
Sbjct: 303 PARVNHAVTIVGY-CEDFGEKFWIAKNSWSNDWGDQGYIYLAKDVAWPTGTCSLASSPFY 361

Query: 331 P 331
           P
Sbjct: 362 P 362


>gi|115436422|ref|NP_001042969.1| Os01g0347500 [Oryza sativa Japonica Group]
 gi|115436426|ref|NP_001042971.1| Os01g0348000 [Oryza sativa Japonica Group]
 gi|15290194|dbj|BAB63883.1| putative SAG12 protein [Oryza sativa Japonica Group]
 gi|15290200|dbj|BAB63889.1| putative SAG12 protein [Oryza sativa Japonica Group]
 gi|21104809|dbj|BAB93394.1| putative SAG12 protein [Oryza sativa Japonica Group]
 gi|113532500|dbj|BAF04883.1| Os01g0347500 [Oryza sativa Japonica Group]
 gi|113532502|dbj|BAF04885.1| Os01g0348000 [Oryza sativa Japonica Group]
 gi|125570283|gb|EAZ11798.1| hypothetical protein OsJ_01672 [Oryza sativa Japonica Group]
          Length = 361

 Score =  223 bits (567), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 133/321 (41%), Positives = 174/321 (54%), Gaps = 29/321 (9%)

Query: 34  FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNN--------AAIGNRSYT---LRL 82
           F QW A+Y + Y    E  KR++++K N   +  F +         A   ++ T   + +
Sbjct: 47  FSQWMAKYAKHYSCPEEQEKRYQVWKGNTNFIGAFRSQTQLSSGVGAFAPQTITDSVVGM 106

Query: 83  NKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQG 142
           N+F DLT  EF+   TGF  S   S      +P  ++    P  V+W   GAVT VK+QG
Sbjct: 107 NRFGDLTSTEFVQQFTGFNASGFHSPPPTPISPHSWQ----PCCVDWRSSGAVTGVKFQG 162

Query: 143 QCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQN 195
            CA        AA+EG++ IK   LVSLSEQ +VDC T   + GC GG  D A   +   
Sbjct: 163 NCASCWAFASAAAIEGLHKIKTGELVSLSEQVMVDCDTG--SFGCSGGHSDTALNLVASR 220

Query: 196 KGITNDAVYSYEGMSTGICDSIKAE-DHAAQITNYEDVPPNDEESLLKAVANQPVSVAID 254
            GIT++  Y Y G+  G CD  K   DH+A ++ +  VPPNDE  L  AVA QPV+V ID
Sbjct: 221 GGITSEEKYPYTGVQ-GSCDVGKLLFDHSASVSGFAAVPPNDERQLALAVARQPVTVYID 279

Query: 255 ASA--LQFYSGGVFNGYCET-FLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRL 311
           ASA   QFY GGV+ G C    +NH VT VGY  +  G KYW+ KNSW  DWGE GY  L
Sbjct: 280 ASAQEFQFYKGGVYKGPCNPGSVNHAVTIVGYCENFGGEKYWIAKNSWSNDWGEQGYVYL 339

Query: 312 QRDIDQPQGQCGIAMFASFPV 332
            +D+  PQG CG+A    +P 
Sbjct: 340 AKDVWWPQGTCGLATSPFYPT 360


>gi|226503205|ref|NP_001150062.1| thiol protease SEN102 precursor [Zea mays]
 gi|195636390|gb|ACG37663.1| thiol protease SEN102 precursor [Zea mays]
          Length = 349

 Score =  223 bits (567), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 134/321 (41%), Positives = 191/321 (59%), Gaps = 23/321 (7%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
           + ++F+ W+A+Y RTY    E  +RF ++ +N+  +E  N       SY L  N+FADLT
Sbjct: 33  LLDRFQAWQAEYNRTYATPEEFQQRFMVYSENVKFIETMNQPG---SSYELGENRFADLT 89

Query: 90  PQEFIASQTGFKMSDHSSSLKA----------NGTPFLYKSSQVPPSVNWIEKGAVTPVK 139
            +EF  +    K+ + +SS +A           GT     +++ P SV+W  KGAVTPVK
Sbjct: 90  EEEFKDTYL-MKLDNVASSPEAMALTVDTMNRAGTSGGSNTNEAPNSVDWRTKGAVTPVK 148

Query: 140 YQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYI 192
            Q  C       AVA++EG++ IK   LVSLSEQ++VDC    NN+GC+GG    A +++
Sbjct: 149 SQQHCGSCWAFAAVASIEGVHKIKTGLLVSLSEQEIVDCDRGGNNHGCHGGHSSSAMEWV 208

Query: 193 IQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVA 252
            +N G+T ++ Y Y G   G C S K   HAA+I   + V   +E +L  AVA +PV+V+
Sbjct: 209 TRNGGLTTESDYPYVGRQ-GQCMSDKLGHHAAKIRGRQAVQGKNEGALQHAVAGRPVAVS 267

Query: 253 IDAS-ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRL 311
           I+AS A QFY  G+F+G C T  NH VT VGYG +  G KYW++KNSWG+ WGE GY R+
Sbjct: 268 INASRAFQFYKRGIFSGPCNTTRNHAVTVVGYGANASGHKYWIVKNSWGERWGEKGYVRM 327

Query: 312 QRDIDQPQGQCGIAMFASFPV 332
           QR +   +G CGIA+   + V
Sbjct: 328 QRGVRAREGVCGIAIAPFYAV 348


>gi|195429415|ref|XP_002062758.1| GK19626 [Drosophila willistoni]
 gi|194158843|gb|EDW73744.1| GK19626 [Drosophila willistoni]
          Length = 341

 Score =  223 bits (567), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 138/350 (39%), Positives = 200/350 (57%), Gaps = 34/350 (9%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
           F ++ L+I+    +QA   ++ E  + E++  +K ++ + Y +S E + R +IF +N   
Sbjct: 3   FALITLLIALVAMTQAV--SYSE-LVREEWNTFKLEHRKNYADSTEETFRMKIFNENKHH 59

Query: 65  VERFNNA-AIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDH----SSSLKANGTPFLY- 118
           + + N   A G  SY L LNK+AD+   EF  +  GF  + H    S+     G  F+  
Sbjct: 60  IAKHNQRYATGEVSYKLALNKYADMLHHEFRETMNGFNYTLHKQLRSTDESFTGVTFISP 119

Query: 119 KSSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDC 171
           +  ++P +V+W  KGAVT VK QG C       +  A+EG +  K   LVSLSEQ LVDC
Sbjct: 120 EHVKLPTAVDWRTKGAVTEVKDQGHCGSCWAFSSTGAIEGQHFRKSGTLVSLSEQNLVDC 179

Query: 172 ATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGIC---DSIKAEDHAAQITN 228
           +T   NNGC GG MD+AF+Y+  N GI  +  Y+YEG+        +SI A D       
Sbjct: 180 STKYGNNGCNGGLMDNAFRYVKDNGGIDTEKSYAYEGIDDSCHFDKNSIGATDRG----- 234

Query: 229 YEDVPPNDEESLLKAVAN-QPVSVAIDAS--ALQFYSGGVF---NGYCETFLNHGVTAVG 282
           + D+P  +E+ L +AVA   PVSVAIDAS  + QFYS GV+   N   E  L+HGV  VG
Sbjct: 235 FADIPQGNEKKLAQAVATIGPVSVAIDASQQSFQFYSEGVYDEPNCSAEN-LDHGVLVVG 293

Query: 283 YGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           YGT ++G  YWL+KNSWG  WG+ G+ ++ R+    + QCGIA  +S+P+
Sbjct: 294 YGTEKDGSDYWLVKNSWGTTWGDKGFIKMSRN---KENQCGIASASSYPL 340


>gi|320164780|gb|EFW41679.1| cathepsin L2 [Capsaspora owczarzaki ATCC 30864]
          Length = 334

 Score =  223 bits (567), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 136/315 (43%), Positives = 181/315 (57%), Gaps = 25/315 (7%)

Query: 33  KFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQE 92
           +FE WK  +G++Y ++ E   R  +++ N + V+  N A I   SYTL +N FADLT +E
Sbjct: 29  EFEAWKRTFGKSYSDAVEEINRRAVWEANKMLVDAHNGAGI--HSYTLGMNIFADLTHEE 86

Query: 93  FIASQTGFKMSDHSSSLKANGTPFLYKSSQV---PPSVNWIEKGAVTPVKYQGQCA---- 145
           F     G K+  +    ++N +     ++ V   P SV+W   G VTPVK QGQC     
Sbjct: 87  FKRFYLGTKVDLNRP--RSNFSSTFIPTANVGALPDSVDWRTAGIVTPVKDQGQCGSCWS 144

Query: 146 ---VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDA 202
                +VEG +A K  +LVSLSEQ LVDC+    N GC GG MDDAF+YII NKGI  +A
Sbjct: 145 FSTTGSVEGQHARKTGQLVSLSEQNLVDCSKAQGNQGCNGGLMDDAFQYIITNKGIDTEA 204

Query: 203 VYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDAS--ALQ 259
            Y Y     G C    A +  A +++++D+    E  L  AVA   PVSVAIDAS  + Q
Sbjct: 205 SYPYTAKD-GTC-KFNAANVGATLSSFQDITRGSESDLQNAVATVGPVSVAIDASKNSFQ 262

Query: 260 FYSGGVFN--GYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQ 317
            Y+ GV+N      T L+HGV A GYGTS  G  YWL+KNSWG  WG+ GY  + R+ + 
Sbjct: 263 LYTSGVYNEKKCSSTSLDHGVLAAGYGTS-NGTPYWLVKNSWGSSWGQAGYIWMSRNANN 321

Query: 318 PQGQCGIAMFASFPV 332
              QCGIA  AS+P+
Sbjct: 322 ---QCGIATSASYPI 333


>gi|4469153|emb|CAB38314.1| chymopapain isoform II [Carica papaya]
          Length = 352

 Score =  222 bits (566), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 137/344 (39%), Positives = 185/344 (53%), Gaps = 25/344 (7%)

Query: 5   FLIVVLIISGSCASQATYR---TFDEGSIAEK----FEQWKAQYGRTYKESAENSKRFEI 57
           FL   LII    +S   Y    + D+ +  E+    F+ W  ++ + Y+   E   RFEI
Sbjct: 12  FLATCLIIHMGLSSADFYTVGYSQDDLTSIERLIQLFDSWMLKHNKIYESIDEKIYRFEI 71

Query: 58  FKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFL 117
           F+DNL+ ++  N     N SY L LN FADL+  EF     GF   D +     +   F 
Sbjct: 72  FRDNLMYIDETNKK---NNSYWLGLNGFADLSNDEFKKKYVGFVAEDFTGLEHFDNEDFT 128

Query: 118 YKS-SQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLV 169
           YK  +  P S++W  KGAVTPVK QG C        +A VEGIN I    L+ LSEQ+LV
Sbjct: 129 YKHVTNYPQSIDWRAKGAVTPVKNQGACGSCWAFSTIATVEGINKIVTGNLLELSEQELV 188

Query: 170 DCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNY 229
           DC  + ++ GC GG+   + +Y+  N G+    VY Y+      C +        +IT Y
Sbjct: 189 DC--DKHSYGCKGGYQTTSLQYVANN-GVHTSKVYPYQAKQYK-CRATDKPGPKVKITGY 244

Query: 230 EDVPPNDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSE 287
           + VP N E S L A+ANQP+S  ++A     Q Y  GVF+G C T L+H VTAVGYGTS+
Sbjct: 245 KRVPSNCETSFLGALANQPLSFLVEAGGKPFQLYKSGVFDGPCGTKLDHAVTAVGYGTSD 304

Query: 288 EGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
            G  Y +IKNSWG +WGE GY RL+R     QG CG+   + +P
Sbjct: 305 -GKNYIIIKNSWGPNWGEKGYMRLKRQSGNSQGTCGVYKSSYYP 347


>gi|225706370|gb|ACO09031.1| Cathepsin L precursor [Osmerus mordax]
          Length = 337

 Score =  222 bits (565), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 135/349 (38%), Positives = 193/349 (55%), Gaps = 30/349 (8%)

Query: 1   MAKYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKD 60
           M  Y +++VL    + A+      FDE      ++ WK+ + + Y+   E   R  +++ 
Sbjct: 1   MTLYLVVLVLCTGAALAAPRFDAQFDE-----HWDLWKSWHSKNYQHEKEEGWRRMVWEK 55

Query: 61  NLVAVERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK 119
           NL  +E  N   ++G  SY+L +N F D+T +EF     G+K+       K  G+ FL  
Sbjct: 56  NLKKIEMHNLEHSLGKHSYSLGMNHFGDMTNEEFRQVMNGYKLQQR----KFKGSLFLEP 111

Query: 120 SS-QVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDC 171
           ++ + P  V+W E+G VTPVK QGQC          A+EG    K  +LVSLSEQ LVDC
Sbjct: 112 NNMEAPKQVDWREEGYVTPVKDQGQCGSCWAFSTTGAMEGQMFRKTQKLVSLSEQNLVDC 171

Query: 172 ATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYED 231
           +  + N GC GG MD AF+YI  N G+ ++  Y Y G     C+  KAE  AA  T + D
Sbjct: 172 SRPEGNEGCNGGLMDQAFQYIQDNSGLDSEEAYPYLGTDDQPCN-YKAEFSAANDTGFMD 230

Query: 232 VPPNDEESLLKAVAN-QPVSVAIDA--SALQFYSGGV-FNGYCET-FLNHGVTAVGYGTS 286
           +P   E +L+KA+A+  PVSVAIDA   + QFY  G+ +   C +  L+HGV AVGYG  
Sbjct: 231 IPSGKEHALMKAIASVGPVSVAIDAGHESFQFYQSGIYYEKECSSEELDHGVLAVGYGFE 290

Query: 287 EE---GIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
            E   G KYW++KNSW + WG+ GY  + +D    +  CGIA  AS+P+
Sbjct: 291 GEDVDGKKYWIVKNSWSEKWGDKGYILMAKD---RKNHCGIATAASYPL 336


>gi|326514800|dbj|BAJ99761.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 291

 Score =  222 bits (565), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 126/229 (55%), Positives = 153/229 (66%), Gaps = 13/229 (5%)

Query: 123 VPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATND 175
           VP SV+W +KGAVT VK QGQC        +AAVEGINAI+   L SLSEQQLVDC T  
Sbjct: 61  VPSSVDWRQKGAVTAVKDQGQCGSCWAFSTIAAVEGINAIRTKNLTSLSEQQLVDCDTK- 119

Query: 176 NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPN 235
           +N GC GG MD AF+YI ++ G+  +  Y Y+      C+  K       I  YEDVP N
Sbjct: 120 SNAGCNGGLMDYAFQYIAKHGGVAAEDAYPYKARQASSCN--KKPSAVVTIDGYEDVPAN 177

Query: 236 DEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYW 293
           DE +L KAVA QPV+VAI+AS    QFYS GVF G C T L+HGV AVGYGT+ +G KYW
Sbjct: 178 DETALKKAVAAQPVAVAIEASGSHFQFYSEGVFAGKCGTELDHGVAAVGYGTTVDGTKYW 237

Query: 294 LIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSKESAQPSSA 342
           ++KNSWG +WGE GY R++RD++  +G CGIAM AS+PV K S  P  A
Sbjct: 238 IVKNSWGPEWGEKGYIRMKRDVEDKEGLCGIAMEASYPV-KTSTNPKHA 285


>gi|310975577|gb|ADP55137.1| cathepsin S [Miichthys miiuy]
          Length = 338

 Score =  222 bits (565), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 135/341 (39%), Positives = 191/341 (56%), Gaps = 24/341 (7%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
            ++  L++   CA  A    FD   +   +E WK  +G+TY+   E+  R E+++ NLV 
Sbjct: 8   LVLGSLLLFSLCAGAAA--MFDS-KLDGHWELWKKMHGKTYRNYVEDESRRELWEKNLVL 64

Query: 65  VERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQ 122
           +   N  A++G  +Y L +N   DLTP+E + S   F      + ++   +PF   S + 
Sbjct: 65  ITMHNLEASMGLHTYKLSMNHMGDLTPEEIMQS---FATLTPPTDIQRAPSPFAGTSGAA 121

Query: 123 VPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATND 175
           VP +++W EKG VT VK QG C       A  A+EG  A    +LV LS Q LVDC+T  
Sbjct: 122 VPDTMDWREKGCVTSVKMQGACGSCWAFSAAGALEGQLAKTTGKLVDLSPQNLVDCSTKY 181

Query: 176 NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPN 235
            N+GC GGFM  AF+Y+I N GI +DA Y Y G  +  C     +  AA  + Y  +P  
Sbjct: 182 GNHGCNGGFMHKAFQYVIDNHGIDSDAAYPYTGRQSQEC-HYSPKFRAANCSQYSFLPEG 240

Query: 236 DEESLLKAVAN-QPVSVAIDA--SALQFYSGGVFNG-YCETFLNHGVTAVGYGTSEEGIK 291
           DE +L +A+A   P+SVAIDA      FYS GV++   C   +NHGV AVGYGT   G  
Sbjct: 241 DEGALKQALATIGPISVAIDARRPRFAFYSSGVYDDPSCSQDVNHGVLAVGYGTL-NGQD 299

Query: 292 YWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           YWL+KNSWGQ +G++GY R+ R+ +    QCGIA +  +P+
Sbjct: 300 YWLVKNSWGQTFGDNGYIRMARNKND---QCGIARYGCYPI 337


>gi|356545071|ref|XP_003540969.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
          Length = 317

 Score =  222 bits (565), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 132/300 (44%), Positives = 171/300 (57%), Gaps = 40/300 (13%)

Query: 17  ASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNR 76
           ASQ T RT  + S+ E+ E+W ++YG+ YK+  E  KRF IFK+N+  +E   NAAI  +
Sbjct: 5   ASQVTCRTLQDASMYERHEEWMSRYGKVYKDPWEREKRFRIFKENMNYIETSKNAAI--K 62

Query: 77  SYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVT 136
            Y L +N+FADL  +EFIA Q  FK           G       S+           AVT
Sbjct: 63  PYKLVINQFADLNNEEFIAPQNIFK-----------GMIICRLLSR-----------AVT 100

Query: 137 PVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAF 189
           PVK QG C        VA+ EGI A+   +L+SLSEQ+LVDC T   + GC G  MDDAF
Sbjct: 101 PVKDQGHCGFCWAFYDVASTEGILALTAGKLISLSEQELVDCDTKGVDQGCEGDLMDDAF 160

Query: 190 KYIIQNKGITNDAVYSYEGMST----GICDSIKAEDHAAQITNYEDVPPNDEESLLKAVA 245
              +    ++N +    E        G C++ +  + A  IT  EDVP N+E++L K VA
Sbjct: 161 FMAVT---LSNSSFKILESRCQLGVDGKCNANEEVNPATTITGXEDVPANNEKALQKVVA 217

Query: 246 NQPVSVAIDA--SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDW 303
           NQPVS+AIDA  S  QFY  GVF G C T L+HGVT VGYG S +G +YWL+KNSW  +W
Sbjct: 218 NQPVSIAIDACDSDFQFYKRGVFTGSCGTELDHGVTIVGYGVSHDGTQYWLVKNSWETEW 277


>gi|325303202|tpg|DAA34687.1| TPA_inf: cathepsin L-like cysteine proteinase B [Amblyomma
           variegatum]
          Length = 337

 Score =  222 bits (565), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 134/344 (38%), Positives = 196/344 (56%), Gaps = 27/344 (7%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
           F+++  + +   A+  T++      +  ++  +KA +G+ Y+   E   R +I+ +N + 
Sbjct: 4   FVVLCFLCAAMTAAAITHQEL----VGAEWSAFKALHGKEYQSETEEYYRLKIYMENRMM 59

Query: 65  VERFNNAAIGNR-SYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANG---TPFLYKS 120
           + R N     N+ SY L +N++ D+   EF++++ GF+  D+ S  +       P   + 
Sbjct: 60  IARHNEKYANNKVSYKLAMNEYGDMLHHEFVSTRNGFR-RDYRSKPRQGSFYIEPEGIED 118

Query: 121 SQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCAT 173
             +P +V+W +KGAVTPVK QGQC          ++EG +  K   +VSLSEQ LVDC+T
Sbjct: 119 KHLPKTVDWRKKGAVTPVKNQGQCGSCWAFSTTGSLEGQHFRKSGDMVSLSEQNLVDCST 178

Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
              NNGC GG MD+AFKYI  N GI  +  Y Y G + G C   K  D  A  T + D+P
Sbjct: 179 AFGNNGCEGGLMDNAFKYIKANGGIDTEKSYPYNG-TDGTC-HFKKSDVGATDTGFVDIP 236

Query: 234 PNDEESLLKAVAN-QPVSVAIDAS--ALQFYSGGVFN-GYCET-FLNHGVTAVGYGTSEE 288
             +E  L KAVA   P+SVAIDAS  + QFYS GV++   C +  L+HGV  VGYGT ++
Sbjct: 237 EGNEHLLKKAVATVGPISVAIDASHQSFQFYSQGVYDEPECSSENLDHGVLVVGYGTKDD 296

Query: 289 GIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
              YWL+KNSWG  WG+ GY  + R+ D    QCGIA  AS+P+
Sbjct: 297 -QDYWLVKNSWGTTWGDGGYIYMTRNKDN---QCGIASSASYPL 336


>gi|16304178|gb|AAL16954.1|AF426414_1 cathepsin L-like cysteine protease precursor [Delia radicum]
          Length = 337

 Score =  222 bits (565), Expect = 3e-55,   Method: Compositional matrix adjust.
 Identities = 135/320 (42%), Positives = 187/320 (58%), Gaps = 23/320 (7%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKFADL 88
           I E+++ +K ++ + +    E   R +IF +N   + + N   A G  S+ L LNK++D+
Sbjct: 23  IKEEWQTFKMEHRKNFLSEVEERFRMKIFNENRHKIAKHNQLYAQGKVSFKLGLNKYSDM 82

Query: 89  TPQEFIASQTGFKMSDHSSSLKANG-TPFLY---KSSQVPPSVNWIEKGAVTPVKYQGQC 144
              EF  +  G+  +     L+A G +  +Y    + Q+P SV+W + GAVT VK QG C
Sbjct: 83  LYHEFKETMNGYNHT-MRKVLRAQGFSGIIYIPPANVQIPKSVDWRQHGAVTAVKDQGHC 141

Query: 145 -------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKG 197
                  + AA+EG +  K   LVSLSEQ LVDC+T   NNGC GG MD+AF+YI  N G
Sbjct: 142 GSCWAFSSTAALEGQHFRKAGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGG 201

Query: 198 ITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ-PVSVAIDAS 256
           I  +  Y YEG+    C   K+   A   T + D+P  DEE+L+KAVA   PVSVAIDAS
Sbjct: 202 IDTEKSYPYEGIDDS-CHFTKSGVGATD-TGFVDIPQGDEEALMKAVATMGPVSVAIDAS 259

Query: 257 --ALQFYSGGVFN-GYCET-FLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQ 312
             + Q YS GV+N   C+   L+HGV  VGYGT + G+ YWL+KNSWG  WG+ GY ++ 
Sbjct: 260 HESFQLYSEGVYNEPECDAQNLDHGVLVVGYGTDKTGLDYWLVKNSWGTTWGDQGYIKMA 319

Query: 313 RDIDQPQGQCGIAMFASFPV 332
           R+ D    QCGIA  +S+P 
Sbjct: 320 RNQDN---QCGIATASSYPT 336


>gi|359483753|ref|XP_002266308.2| PREDICTED: oryzain alpha chain-like [Vitis vinifera]
          Length = 501

 Score =  221 bits (564), Expect = 3e-55,   Method: Compositional matrix adjust.
 Identities = 139/357 (38%), Positives = 196/357 (54%), Gaps = 40/357 (11%)

Query: 6   LIVVLIISGSCASQATYRTF--------DEGSIAEKFEQWKAQYGRTYKESAENSKRFEI 57
           L++ +  S +C S +    F         E  + E F  WK ++ R YK + E +KRFEI
Sbjct: 10  LVLFIWASLACLSSSLPTEFYITGEEFASEERVRELFHLWKERHKRVYKHAEETAKRFEI 69

Query: 58  FKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSD--------HSSSL 109
           FK+NL  V   N+   G+R +TL +NKFAD++ +EF                     S  
Sbjct: 70  FKENLKYVIERNSK--GHR-HTLGMNKFADMSNEEFKEKYLSKIKKPINKKNNYLRRSMQ 126

Query: 110 KANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVS 162
           +  GT     S + P S++W +KG VT +K QG C       +  A+EGINAI    L+S
Sbjct: 127 QKKGTA----SCEAPSSLDWRKKGVVTGIKDQGDCGSCWAFSSTGAMEGINAIVTGDLIS 182

Query: 163 LSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDH 222
           LSEQ+LVDC T   N GC GG+MD AF+++I N GI +++ Y Y G + G C++ K +  
Sbjct: 183 LSEQELVDCDTT--NYGCEGGYMDYAFEWVISNGGIDSESDYPYTG-TDGTCNTTKEDTK 239

Query: 223 AAQITNYEDVPPNDEESLLKAVANQPVSVAIDASAL--QFYSGGVFNGYCETFLN---HG 277
              I  Y+DV  +D  +LL A  NQP+SV +D SAL  Q Y+ G++ G C    +   H 
Sbjct: 240 VVSIDGYKDVDESDS-ALLCAAVNQPISVGMDGSALDFQLYTSGIYAGDCSDDPDDIDHA 298

Query: 278 VTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSK 334
           V  VGYG SE+   YW+ KNSWG  WG +GYF ++R+ D P G+C I   AS+P  +
Sbjct: 299 VLIVGYG-SEDSEDYWICKNSWGTSWGMEGYFYIKRNTDLPYGECAINAMASYPTKE 354


>gi|3097321|dbj|BAA25899.1| Bd 30K [Glycine max]
 gi|84371705|gb|ABC56139.1| 34 kDa maturing seed protein [Glycine max]
 gi|195957142|gb|ACG59282.1| major allergen Gly m Bd 30K [Glycine max]
 gi|223452512|gb|ACM89583.1| maturing seed protein [Glycine max]
 gi|226432468|gb|ACO55749.1| Gly m Bd 30K allergen [Glycine max]
 gi|320090153|gb|ADW08728.1| P34 allergen [Glycine max]
 gi|320090155|gb|ADW08729.1| P34 allergen [Glycine max]
 gi|320090157|gb|ADW08730.1| P34 allergen [Glycine max]
 gi|320090159|gb|ADW08731.1| P34 allergen [Glycine max]
          Length = 379

 Score =  221 bits (564), Expect = 3e-55,   Method: Compositional matrix adjust.
 Identities = 129/332 (38%), Positives = 180/332 (54%), Gaps = 24/332 (7%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
           ++  F+ WK+++GR Y    E +KR EIFK+NL  +   N       S+ L LNKFAD+T
Sbjct: 40  VSSLFQLWKSEHGRVYHNHEEEAKRLEIFKNNLNYIRDMNANRKSPHSHRLGLNKFADIT 99

Query: 90  PQEFIAS--QTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC--- 144
           PQEF     Q    +S              Y     P S +W +KG +T VKYQG C   
Sbjct: 100 PQEFSKKYLQAPKDVSQQIKMANKKMKKEQYSCDHPPASWDWRKKGVITQVKYQGGCGSG 159

Query: 145 ----AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITN 200
               A  A+E  +AI    LVSLSEQ+LVDC   + + GCY G+   +F++++++ GI  
Sbjct: 160 WAFSATGAIEAAHAIATGDLVSLSEQELVDCV--EESEGCYNGWHYQSFEWVLEHGGIAT 217

Query: 201 DAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDE-------ESLLKAVANQPVSVAI 253
           D  Y Y     G C + K +D    I  YE +  +DE       ++ L A+  QP+SV+I
Sbjct: 218 DDDYPYRA-KEGRCKANKIQDKVT-IDGYETLIMSDESTESETEQAFLSAILEQPISVSI 275

Query: 254 DASALQFYSGGVFNGYCETF---LNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFR 310
           DA     Y+GG+++G   T    +NH V  VGYG S +G+ YW+ KNSWG+DWGEDGY  
Sbjct: 276 DAKDFHLYTGGIYDGENCTSPYGINHFVLLVGYG-SADGVDYWIAKNSWGEDWGEDGYIW 334

Query: 311 LQRDIDQPQGQCGIAMFASFPVSKESAQPSSA 342
           +QR+     G CG+  FAS+P  +ES    SA
Sbjct: 335 IQRNTGNLLGVCGMNYFASYPTKEESETLVSA 366


>gi|326493706|dbj|BAJ85314.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 365

 Score =  221 bits (564), Expect = 3e-55,   Method: Compositional matrix adjust.
 Identities = 133/334 (39%), Positives = 181/334 (54%), Gaps = 32/334 (9%)

Query: 25  FDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN----NAAIG------ 74
             E  + E+F +W  +Y + Y    E   RF++FK+N  ++ + +    N  +G      
Sbjct: 39  LPESEVRERFSKWMIKYSKHYSCKQEEEMRFQVFKNNTNSIGQLDRQNPNPGVGGALGPS 98

Query: 75  -NRSYTLR---LNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWI 130
            ++ +T +   +N+F DL+P+E I   TG     +++S +     +L   S  P  V+W 
Sbjct: 99  GSQVHTFQKVSMNRFGDLSPREVIQQYTGL----NTTSFRTASPTYLPYHSFKPCCVDWR 154

Query: 131 EKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGG 183
             GAVT VK+QG C       AVAA+EG+N I+   LVSLSEQ LVDC T   + GC GG
Sbjct: 155 SSGAVTGVKHQGTCGSCWAFAAVAAIEGMNKIRTGELVSLSEQVLVDCDTV--STGCGGG 212

Query: 184 FMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAE-DHAAQITNYEDVPPNDEESLLK 242
             D A   +    GIT++  Y Y G   G CD  K   DH A I  ++ VP N+E  L  
Sbjct: 213 HSDSAMALVAARGGITSEERYPYAGFQ-GKCDVDKLMFDHQASIKGFKAVPSNNEAQLAI 271

Query: 243 AVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSE-EGIKYWLIKNSW 299
           AVA QPV+V IDAS  A QFYSGG++ G C   +NH VT VGY     EG KYW+ KNSW
Sbjct: 272 AVAMQPVTVYIDASGSAFQFYSGGIYRGPCSANVNHAVTIVGYCEGPGEGNKYWIAKNSW 331

Query: 300 GQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
             DWGE GY  L +D+    G CG+A    +P +
Sbjct: 332 SNDWGEQGYVYLAKDVAWSTGTCGLATSPFYPTA 365


>gi|146217394|gb|ABQ10739.1| cathepsin L [Penaeus monodon]
          Length = 341

 Score =  221 bits (564), Expect = 3e-55,   Method: Compositional matrix adjust.
 Identities = 132/322 (40%), Positives = 183/322 (56%), Gaps = 25/322 (7%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKFADL 88
           + E++E +K ++ + Y    E S R +IF +N   +   N   A G+ +Y L +NK+ D+
Sbjct: 25  VLEEWEAFKLEHSKKYDSEVEESFRMKIFTENKHKIANHNKGFAQGHHTYKLSMNKYGDM 84

Query: 89  TPQEFIASQTGFKMSDHSSSLKAN----GTPFLYKSS--QVPPSVNWIEKGAVTPVKYQG 142
              EF+++  GF+  +H+   K N    G  F+      Q+P +V+W  KGAVTP+K QG
Sbjct: 85  LHHEFVSTMNGFR-GNHTGGYKNNRAYTGATFIEPDDDVQLPKNVDWRTKGAVTPIKDQG 143

Query: 143 QC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQN 195
           QC       A  A+EG    K  +LVSLSEQ LVDC+    NNGC GG MD+AF+Y+ +N
Sbjct: 144 QCGSCWAFSATGALEGQTFRKTGQLVSLSEQNLVDCSRKFGNNGCNGGLMDNAFEYVKEN 203

Query: 196 KGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAID 254
            GI  +  Y Y+        + +A    A+   + DV    E +L KAVA   PVSVAID
Sbjct: 204 GGIDTEESYPYDAEDEKCHYNPRAA--GAEDKGFVDVREGSEHALKKAVATVGPVSVAID 261

Query: 255 AS--ALQFYSGGVF-NGYCE-TFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFR 310
           AS  + QFYS GV+    C    L+HGV  VGYG  ++G  YWL+KNSWG  WG+ GY +
Sbjct: 262 ASHESFQFYSHGVYIEPECSPEMLDHGVLVVGYGIDDDGTDYWLVKNSWGTTWGDQGYVK 321

Query: 311 LQRDIDQPQGQCGIAMFASFPV 332
           + R+ D    QCGIA  ASFP+
Sbjct: 322 MARNRDN---QCGIASSASFPL 340


>gi|156398078|ref|XP_001638016.1| predicted protein [Nematostella vectensis]
 gi|156225133|gb|EDO45953.1| predicted protein [Nematostella vectensis]
          Length = 326

 Score =  221 bits (564), Expect = 3e-55,   Method: Compositional matrix adjust.
 Identities = 134/340 (39%), Positives = 195/340 (57%), Gaps = 26/340 (7%)

Query: 3   KYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
           K FL++ ++++   +S+     F + S   ++  WK+ +G++Y +  E   R  I++ NL
Sbjct: 2   KVFLVLCVLVA---SSRGWSVRFGQDS---EWVAWKSYHGKSYSDVHEERTRMAIWQQNL 55

Query: 63  VAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQ 122
             ++R N     + SY + +N   DLT  EF     G + + H+S+ +   T     + +
Sbjct: 56  EKIKRHNAE---DHSYKMAMNHLGDLTEDEFRYFYLGVR-AHHNSTKRGWATYMPPSNVK 111

Query: 123 VPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATND 175
           +P SV+W +KG VT VK QGQC          +VEG +  K   LVSLSEQ L+DC+ + 
Sbjct: 112 IPSSVDWSQKGYVTGVKNQGQCGSCWAFSTTGSVEGQHFRKTGSLVSLSEQNLIDCSGSY 171

Query: 176 NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPN 235
            NNGC GG MD+AF+YI  N GI  ++ Y Y G   G C    +    A++T Y+D+P  
Sbjct: 172 GNNGCQGGLMDNAFRYIESNGGIDTESSYPYLGQQ-GSCH-FSSSHVGARVTGYQDIPQG 229

Query: 236 DEESLLKAVAN-QPVSVAIDASALQFYSGGVF-NGYC-ETFLNHGVTAVGYGTSEEGIKY 292
            E++L  AVA   PVSVA+DAS  QFYS GV+ N YC  T L+HGV  +GYG +  G  Y
Sbjct: 230 SEQALQSAVATVGPVSVAVDASQWQFYSSGVYDNPYCSSTQLDHGVLVIGYG-NYNGQDY 288

Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           WL+KNSWG  WG +GY  + R+ +    QCGIA  AS+P+
Sbjct: 289 WLVKNSWGYSWGVEGYIMMSRNKNN---QCGIASSASYPL 325


>gi|260516678|gb|ACX43965.1| cysteine protease 1 [Brachiaria hybrid cultivar]
          Length = 338

 Score =  221 bits (564), Expect = 3e-55,   Method: Compositional matrix adjust.
 Identities = 136/314 (43%), Positives = 180/314 (57%), Gaps = 25/314 (7%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
           + + F  +  QY + Y   AE S RF  FK N+  +   N  A  N SYT+ LN+FADL+
Sbjct: 38  LQDMFTAFMKQYSKAYSH-AEFSSRFNQFKANVETIRLHNTLA--NASYTMGLNEFADLS 94

Query: 90  PQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC----- 144
            +EF     G+K   H     A       +    P S++W    AVTP+K QGQC     
Sbjct: 95  FEEFKGKYFGYK---HVEREFARSNNLHQEVEAAPTSIDWRTSNAVTPIKDQGQCGSCWA 151

Query: 145 --AVAAVEGINAIK-INRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITND 201
             A  ++EG   ++  + L SLSEQQLVDC+T+  + GC GG MD AF+YII NKGI  +
Sbjct: 152 FSATGSIEGAWVLQGKHTLTSLSEQQLVDCSTSYGDAGCNGGLMDYAFEYIIANKGICAE 211

Query: 202 AVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDA--SAL 258
           + Y Y+G+  G+C   K+      I+ Y+DV   DE SLL AV    PVSVAI+A  +  
Sbjct: 212 SAYPYKGVG-GLCQ--KSCTKVVTISGYKDVASGDEASLLNAVGTVGPVSVAIEADQAGF 268

Query: 259 QFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQP 318
           QFYS GVF+G C   L+HGV AVGYGT+     YW++KNSWG  WGE GY R+ R+    
Sbjct: 269 QFYSSGVFSGTCGHNLDHGVLAVGYGTTGSQ-DYWIVKNSWGTSWGESGYIRMIRN---- 323

Query: 319 QGQCGIAMFASFPV 332
           + QCGIA+  S+P 
Sbjct: 324 KNQCGIAIQPSYPT 337


>gi|334332718|ref|XP_001367502.2| PREDICTED: cathepsin L1-like [Monodelphis domestica]
          Length = 333

 Score =  221 bits (564), Expect = 3e-55,   Method: Compositional matrix adjust.
 Identities = 139/347 (40%), Positives = 197/347 (56%), Gaps = 32/347 (9%)

Query: 1   MAKYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKD 60
           M  Y  +  L +  + A     R  D      ++ QWKAQ+G++Y E+ E+S R   ++ 
Sbjct: 1   MNFYLCLASLCLGLAAAIPPFDRALDS-----QWHQWKAQHGKSY-EANEDSLRRATWEK 54

Query: 61  NLVAVERFNNA-AIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK 119
           NL  +ER N   + G  S+ LR+NKF D++ +EF     G+K   + S  +  G+  LY+
Sbjct: 55  NLKMIERHNQEYSAGKHSFQLRMNKFGDMSTEEFKQVMNGYK--SNGSQRRTKGS--LYR 110

Query: 120 SS---QVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLV 169
            S   Q+P SV+W EKG VTPVK QG C       AV A+EG    K  +LVSLS Q L+
Sbjct: 111 ESLLAQLPESVDWREKGYVTPVKEQGDCGACWSFSAVGAIEGQWFRKTGKLVSLSIQNLI 170

Query: 170 DCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNY 229
           DC   + NNGC GGFMD+AF+Y+  N GI  +  Y Y    T      K E   A IT +
Sbjct: 171 DCTIPEGNNGCDGGFMDNAFQYVQDNGGIDTEECYPYVAQDTEC--KYKPECSGANITGF 228

Query: 230 EDVPPNDEESLLKAVAN-QPVSVAIDAS--ALQFYSGGV-FNGYCETF-LNHGVTAVGYG 284
            D+P  DE +L++AVA   P+SV ID++  + +FY  GV +   C +  L+HGV  VGYG
Sbjct: 229 VDIPSMDERALMEAVATVGPISVGIDSANPSFKFYQSGVYYEPDCSSSQLDHGVLVVGYG 288

Query: 285 TSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
           +  +  +YW++KNSWG+ WG++GY  + +D D     CGIA  AS+P
Sbjct: 289 SIGKD-EYWIVKNSWGEAWGDNGYILMAKDKDN---HCGIATEASYP 331


>gi|413953051|gb|AFW85700.1| hypothetical protein ZEAMMB73_033873 [Zea mays]
          Length = 359

 Score =  221 bits (564), Expect = 3e-55,   Method: Compositional matrix adjust.
 Identities = 141/361 (39%), Positives = 202/361 (55%), Gaps = 34/361 (9%)

Query: 1   MAKYFLIVVLIISGSCASQATYRTFDEGSIA----EKFEQWKAQYGRTYKESAENSKRFE 56
           MA     + L++  +C+       F + +IA    E+F+ W+A+Y RTY    E  +RF 
Sbjct: 3   MATASASLALVMLFACSLLLAGTAFSDDTIAIPLLERFKAWQAEYNRTYATPEEFQQRFM 62

Query: 57  IFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPF 116
           ++ +NL  ++  N  + G+ SY L  N+F DLT +EF   +  + M        A   P 
Sbjct: 63  VYSENLRFIKTMNQLSTGS-SYELGENQFTDLTEEEF---KDTYLMKLDEQPPAAEAMPP 118

Query: 117 LY------------KSSQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKI 157
           +              + + P SV+W  KGAVTPVK Q QC        VA++EG++ IK 
Sbjct: 119 IVGTMSTAGMSNGDNTGEAPNSVDWRTKGAVTPVKNQQQCGSCWAFATVASIEGVHQIKT 178

Query: 158 NRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSI 217
            RLVSLSEQ++VDC    N++GC GG+   A +++ +N G+T ++ Y Y G S   C S 
Sbjct: 179 GRLVSLSEQEIVDCDRGGNDHGCRGGYPRSAMEWVTRNGGLTTESDYPYVG-SQRQCMSG 237

Query: 218 KAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS-ALQFYSGGVFNGYCE-TFLN 275
           K   HAA+I  Y+ V   +E  L +AVA +PV+V IDAS A QFY  GVF+G C  T +N
Sbjct: 238 KLGHHAARIRGYQAVQRKNEAELERAVAGRPVAVVIDASRAFQFYKRGVFSGPCNTTTVN 297

Query: 276 HGVTAVGYGTSEEGI----KYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
           H VT VGYG++        KYW++KNSWGQ WGE+GY R+ R +   +G C IA+   +P
Sbjct: 298 HAVTVVGYGSAGSDSGGGRKYWIVKNSWGQRWGENGYVRMARRVRAREGMCAIAIEPYYP 357

Query: 332 V 332
           V
Sbjct: 358 V 358


>gi|8886940|gb|AAF80626.1|AC069251_19 F2D10.37 [Arabidopsis thaliana]
          Length = 315

 Score =  221 bits (563), Expect = 4e-55,   Method: Compositional matrix adjust.
 Identities = 125/268 (46%), Positives = 162/268 (60%), Gaps = 15/268 (5%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
           + E FE W + + + Y+   E   RFE+FKDNL  ++  N      +SY L LN+FADL+
Sbjct: 47  LIELFENWISNFEKAYETVEEKFLRFEVFKDNLKHIDETNKKG---KSYWLGLNEFADLS 103

Query: 90  PQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQ-VPPSVNWIEKGAVTPVKYQGQCA--- 145
            +EF     G K        + +   F Y+  + VP SV+W +KGAV  VK QG C    
Sbjct: 104 HEEFKKMYLGLKTDIVRRDEERSYAEFAYRDVEAVPKSVDWRKKGAVAEVKNQGSCGSCW 163

Query: 146 ----VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITND 201
               VAAVEGIN I    L +LSEQ+L+DC T   NNGC GG MD AF+YI++N G+  +
Sbjct: 164 AFSTVAAVEGINKIVTGNLTTLSEQELIDCDTT-YNNGCNGGLMDYAFEYIVKNGGLRKE 222

Query: 202 AVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQ 259
             Y Y  M  G C+  K E     I  ++DVP NDE+SLLKA+A+QP+SVAIDAS    Q
Sbjct: 223 EDYPY-SMEEGTCEMQKDESETVTINGHQDVPTNDEKSLLKALAHQPLSVAIDASGREFQ 281

Query: 260 FYSGGVFNGYCETFLNHGVTAVGYGTSE 287
           FYSGGVF+G C   L+HGV AVGYG+S+
Sbjct: 282 FYSGGVFDGRCGVDLDHGVAAVGYGSSK 309


>gi|238816977|gb|ACR56863.1| cathepsin L-like cysteine proteinase [Delia coarctata]
          Length = 338

 Score =  221 bits (563), Expect = 4e-55,   Method: Compositional matrix adjust.
 Identities = 134/321 (41%), Positives = 187/321 (58%), Gaps = 24/321 (7%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKFADL 88
           I E+++ +K ++ + Y    E   R +IF +N   + + N   A G  S+ L LNK+AD+
Sbjct: 23  IKEEWQTFKMEHRKNYLSEVEERFRMKIFNENRHKIAKHNQLYAQGKVSFKLGLNKYADM 82

Query: 89  TPQEFIASQTGFKMSDHSSSLKA----NGTPFLYKSS-QVPPSVNWIEKGAVTPVKYQGQ 143
              EF  +  G+  +     L+A    NG  ++  ++ QVP +V+W + GAVT VK QG 
Sbjct: 83  LHHEFKETMNGYNHT-MRKELRAQEGFNGITYISPANVQVPKAVDWRQHGAVTSVKDQGH 141

Query: 144 C-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNK 196
           C       +  ++EG +  K   LVSLSEQ LVDC+T   NNGC GG MD+AF+YI  N 
Sbjct: 142 CGSCWSFSSTGSLEGQHFRKAGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNG 201

Query: 197 GITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ-PVSVAIDA 255
           G+  +  Y YEG+    C   KA   A   T + D+P  DEE+++KAVA   PV+VAIDA
Sbjct: 202 GVDTEKSYPYEGIDDS-CHFNKATVGATD-TGFVDIPQGDEEAMMKAVATMGPVAVAIDA 259

Query: 256 S--ALQFYSGGVFNG-YCET-FLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRL 311
           S  + Q YS GV+N   C +  L+HGV  VGYGT ++G  YWL+KNSWG  WG+ GY ++
Sbjct: 260 SNESFQLYSEGVYNDPNCSSDNLDHGVLVVGYGTDKDGQDYWLVKNSWGTTWGDQGYIKM 319

Query: 312 QRDIDQPQGQCGIAMFASFPV 332
            R+ D    QCGIA  +SFP 
Sbjct: 320 ARNQDN---QCGIATASSFPT 337


>gi|52076128|dbj|BAD46641.1| putative cysteine proteinase [Oryza sativa Japonica Group]
 gi|52076135|dbj|BAD46648.1| putative cysteine proteinase [Oryza sativa Japonica Group]
          Length = 374

 Score =  221 bits (563), Expect = 4e-55,   Method: Compositional matrix adjust.
 Identities = 141/347 (40%), Positives = 189/347 (54%), Gaps = 34/347 (9%)

Query: 27  EGSIAEKFEQWKAQYG---RTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLN 83
           E S+   +++W+  YG    + ++ A+   RFE+FK N   +  FN       SY L LN
Sbjct: 36  EESMWSLYQRWRHVYGAASSSPRDLADKGSRFEVFKKNARYIHDFNRKK--GMSYKLGLN 93

Query: 84  KFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK-SSQVPPSVNWIEKGAVTPVKYQG 142
           KFADLT +EF A  TG      +      G+P L   +   PP+ +W E GAVT VK QG
Sbjct: 94  KFADLTLEEFTAKYTGANPGPITGLKNGTGSPPLAAVAGDAPPAWDWREHGAVTRVKDQG 153

Query: 143 QCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQN 195
            C        V AVEGINAI    L++LSEQQ++DC+   +   C GG+   AF Y + N
Sbjct: 154 PCGSCWAFSVVEAVEGINAIMTGNLLTLSEQQVLDCSGAGD---CSGGYTSYAFDYAVSN 210

Query: 196 KGITNDAVYS-------------YEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLK 242
            GIT D  +S             YE +    C     +    +I +Y  V PNDEE+L +
Sbjct: 211 -GITLDQCFSPPTTGENYFYYPAYEAVQE-PCRFDPNKAPIVKIDSYSFVDPNDEEALKQ 268

Query: 243 AVANQ-PVSVAIDAS-ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWG 300
           AV +Q PVSV I+AS     Y GGVF+G C T LNH V  VGY  +E+G  YW++KNSWG
Sbjct: 269 AVYSQGPVSVLIEASYEFMIYQGGVFSGPCGTELNHAVLVVGYDETEDGTPYWIVKNSWG 328

Query: 301 QDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSKESAQPSSADKSSA 347
             WGE GY R+ R+I  P+G CGIAM+  +P+ K    P +A  ++A
Sbjct: 329 AGWGESGYIRMIRNIPAPEGICGIAMYPIYPI-KSCPCPITAASAAA 374


>gi|302781881|ref|XP_002972714.1| hypothetical protein SELMODRAFT_98707 [Selaginella moellendorffii]
 gi|300159315|gb|EFJ25935.1| hypothetical protein SELMODRAFT_98707 [Selaginella moellendorffii]
          Length = 446

 Score =  221 bits (563), Expect = 4e-55,   Method: Compositional matrix adjust.
 Identities = 122/317 (38%), Positives = 183/317 (57%), Gaps = 19/317 (5%)

Query: 30  IAEKFEQWKAQYGRTYKES-AENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADL 88
           ++ ++  W A++G+    S +   +RFE FK+N   +E  N A  G  SY L LN+F+DL
Sbjct: 9   LSGEYASWCAKFGKECASSNSLGDRRFETFKENFRYIEEHNRA--GKHSYRLGLNQFSDL 66

Query: 89  TPQEFIASQTGFKMSDHSSSL----KANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC 144
           T +EF     G +     S +    + +     +++  +P SV+W + GAVT  K QG C
Sbjct: 67  TSEEFRQRFLGLRPDLIDSPVLKMPRDSDIEEGFQNVDLPASVDWRKHGAVTAPKDQGSC 126

Query: 145 A-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKG 197
                     A+EGIN I   +L+SLSEQ+L+DC     + GC GG M++A+++I++N G
Sbjct: 127 GGCWAFATTGAIEGINQIVTGQLMSLSEQELIDC-DKKADKGCDGGLMENAYQFIVENGG 185

Query: 198 ITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA 257
           +  +  Y Y   S   C+  K       I  YE +P  DE++LL+AVA QPVSVAI+ ++
Sbjct: 186 LDTETDYPYHA-SESHCNMKKLNSRVVAIDGYEAIPDGDEQALLRAVAKQPVSVAIEGAS 244

Query: 258 --LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDI 315
              Q Y+ GVF G+C   +NHGV  VGYGT E+G+ YW++KNSW   WG+ G+ ++QR+ 
Sbjct: 245 KDFQHYASGVFTGHCGEEINHGVLIVGYGT-EDGLDYWIVKNSWAATWGDGGFVKMQRNT 303

Query: 316 DQPQGQCGIAMFASFPV 332
            +  G C I   AS+PV
Sbjct: 304 GKRGGLCSINTLASYPV 320


>gi|402770499|gb|AFQ98384.1| cathepsin L, partial [Hyalomma anatolicum anatolicum]
          Length = 312

 Score =  221 bits (563), Expect = 4e-55,   Method: Compositional matrix adjust.
 Identities = 136/320 (42%), Positives = 185/320 (57%), Gaps = 28/320 (8%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN-NAAIGNRSYTLRLNKFADL 88
           +  ++E +K  + ++Y+   E   R++IF +N + + + N   A G  SY L +N+F DL
Sbjct: 3   LRTQWEAFKTTHKKSYQSKMEELLRYKIFTENSLLIAKHNAKYAKGLVSYKLGMNQFGDL 62

Query: 89  TPQEFIASQTGFKMSDHSSSLKANGTPFL----YKSSQVPPSVNWIEKGAVTPVKYQGQC 144
            P EF     G+         K  G+ FL       S +P +V+W +KGAVTPVK QGQC
Sbjct: 63  LPHEFAKMFNGYH-----GERKGRGSTFLPPANVNDSSLPKTVDWRKKGAVTPVKDQGQC 117

Query: 145 -------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKG 197
                  A  ++EG + +K  +LVSLSEQ L+DC+ +  N GC GG MD+AFKYI  N G
Sbjct: 118 GSCWAFSATGSLEGQHFLKSGKLVSLSEQNLIDCSGSFGNEGCGGGLMDNAFKYIKANDG 177

Query: 198 ITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDA- 255
           I  +  Y YE M  G C   K ED  A  T + D+    E+ L KAVA   P+SVAIDA 
Sbjct: 178 IDTEESYPYEAMD-GDC-RFKKEDVGATDTGFVDIQQGSEDDLQKAVATVGPISVAIDAS 235

Query: 256 -SALQFYSGGVFN-GYCET-FLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQ 312
            S+ Q YS GV++   C +  L+HGV AVGYG  + G KYWL+KNSW + WG++GY  + 
Sbjct: 236 HSSFQLYSEGVYDEPNCSSEELDHGVLAVGYGV-KNGKKYWLVKNSWAETWGDNGYILMS 294

Query: 313 RDIDQPQGQCGIAMFASFPV 332
           RD D    QCGIA  AS+P+
Sbjct: 295 RDKDN---QCGIASSASYPL 311


>gi|156142226|gb|ABU51882.1| ervatamin-C precursor [Tabernaemontana divaricata]
          Length = 365

 Score =  221 bits (563), Expect = 4e-55,   Method: Compositional matrix adjust.
 Identities = 142/358 (39%), Positives = 196/358 (54%), Gaps = 45/358 (12%)

Query: 1   MAKYFLIVVLIISGSCA-------------SQATYRTFDEGSIAEKFEQWKAQYGRTYKE 47
           M+  F+I +L+   S +               + +RT +E  + E +E W A++ + Y  
Sbjct: 1   MSTLFIISILLFLASFSYAMDISTIEYKYDKSSAWRTDEE--VKEIYELWLAKHDKVYSG 58

Query: 48  SAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSS 107
             E  KRFEIFKDNL  ++  N+    N +Y + L  + DLT +EF A   G + SD   
Sbjct: 59  LVEYEKRFEIFKDNLKFIDEHNSE---NHTYKMGLTPYTDLTNEEFQAIYLGTR-SDTIH 114

Query: 108 SLKAN---GTPFLYKSS-QVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIK 156
            LK        + Y++   +P  ++W +KGAVTPVK QG+C        V+ VE IN I+
Sbjct: 115 RLKRTINISERYAYEAGDNLPEQIDWRKKGAVTPVKNQGKCGSCWAFSTVSTVESINQIR 174

Query: 157 INRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDS 216
              L+SLSEQQLVDC  N  N+GC GG    A++YII N GI  +A Y Y+ +  G C  
Sbjct: 175 TGNLISLSEQQLVDC--NKKNHGCKGGAFVYAYQYIIDNGGIDTEANYPYKAVQ-GPC-- 229

Query: 217 IKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASALQF--YSGGVFNGYCETFL 274
            +A     +I  Y+ VP  +E +L KAVA+QP  VAIDAS+ QF  Y  G+F+G C T L
Sbjct: 230 -RAAKKVVRIDGYKGVPHCNENALKKAVASQPSVVAIDASSKQFQHYKSGIFSGPCGTKL 288

Query: 275 NHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           NHGV  VGY        YW+++NSWG+ WGE GY R++R      G CGIA    +P 
Sbjct: 289 NHGVVIVGYWKD-----YWIVRNSWGRYWGEQGYIRMKR--VGGCGLCGIARLPYYPT 339


>gi|194352764|emb|CAQ00110.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
          Length = 406

 Score =  221 bits (563), Expect = 4e-55,   Method: Compositional matrix adjust.
 Identities = 132/347 (38%), Positives = 188/347 (54%), Gaps = 47/347 (13%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN-NAAIGNRSYTLRLNKFADL 88
           + ++F  W   + R+Y  + E ++RFE+++ N+  +E  N  AA    +Y L    F DL
Sbjct: 59  MMDRFHVWMTVHNRSYSTAGEKARRFEVYRSNMRFIEAVNAEAATSGLTYELGEGPFTDL 118

Query: 89  TPQEFIASQTGFKM---------------SDHSSSLKANGTP-----FLYKSSQVPPSVN 128
           T +EF+   TG  +               + H+ S+   GT      +   S+  P S++
Sbjct: 119 TNEEFMELYTGQILEDDQSEDGDDDEQIITTHAGSIDGLGTHKGATVYANFSASAPTSID 178

Query: 129 WIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCY 181
           W ++G VTPVK Q QC        VA +EGI+ IK   LVSLSEQQL+DC   DN  GC 
Sbjct: 179 WRKRGVVTPVKNQKQCGSCWAFPTVATIEGIHKIKRGTLVSLSEQQLIDCDYLDN--GCK 236

Query: 182 GGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLL 241
           GG +  AF++I +N GIT+ + Y Y+ +  G C  ++    AA+I  +  V  N E SL+
Sbjct: 237 GGLVTRAFQWIKKNGGITSTSSYKYKAVR-GRC--LRNRKPAAKIVGFRKVKSNSEVSLM 293

Query: 242 KAVANQPVSVAIDASALQF--YSGGVFNGYCETF-LNHGVTAVGYGTSEE---------- 288
            AVANQPV+V+I + +  F  Y GG++NG C T  LNH VT VGYG  ++          
Sbjct: 294 NAVANQPVAVSISSHSSHFHHYKGGIYNGPCSTTKLNHAVTVVGYGQQQQNGADSVHASA 353

Query: 289 -GIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSK 334
            G KYW++KNSWG  WG+ GY  ++R      GQCGIA    FP+ K
Sbjct: 354 PGAKYWIVKNSWGTTWGDKGYILMKRGTKHSSGQCGIATRPVFPLMK 400


>gi|402770503|gb|AFQ98386.1| cathepsin L [Rhipicephalus microplus]
          Length = 332

 Score =  221 bits (562), Expect = 5e-55,   Method: Compositional matrix adjust.
 Identities = 142/344 (41%), Positives = 193/344 (56%), Gaps = 36/344 (10%)

Query: 6   LIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
           L  ++ ++ + +SQ   RT        ++E +K  + +TY+   E   RF+IF +N + +
Sbjct: 7   LCAIVAVTVAASSQEILRT--------QWEAFKTTHKKTYQSHMEELLRFKIFTENSLII 58

Query: 66  ERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFL----YKS 120
            + N   A G  SY L +N+F DL   EF     G     H  + K  G+ FL       
Sbjct: 59  AKHNAKYAKGLVSYKLGMNQFGDLLAHEFARIFNG-----HHGTRKTGGSTFLPPANVND 113

Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
           S +P  V+W +KGAVTPVK QGQC       A  ++EG + +K   LVSLSEQ LVDC+ 
Sbjct: 114 SSLPKVVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGRHFLKNGELVSLSEQNLVDCSQ 173

Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
           +  NNGC GG M+DAFKYI +N GI  +  Y YE +  G C   K ED  A  T Y ++ 
Sbjct: 174 SFGNNGCEGGLMEDAFKYIKENDGIDTEKSYPYEAVD-GEC-RFKKEDVGATDTGYVEIK 231

Query: 234 PNDEESLLKAVAN-QPVSVAIDA--SALQFYSGGVFN-GYCETF-LNHGVTAVGYGTSEE 288
              E+ L KAVA   P+SVAIDA  S+ Q YS GV++   C +  L+HGV  VGYG  + 
Sbjct: 232 AGSEDDLKKAVATVGPISVAIDASHSSFQLYSEGVYDEPECSSEDLDHGVLVVGYGV-KG 290

Query: 289 GIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           G KYWL+KNSW + WG+ GY  + RD +    QCGIA  AS+P+
Sbjct: 291 GKKYWLVKNSWAESWGDQGYILMSRDNNN---QCGIASQASYPL 331


>gi|427797099|gb|JAA64001.1| Putative cathepsin l cathepsin l, partial [Rhipicephalus
           pulchellus]
          Length = 331

 Score =  221 bits (562), Expect = 5e-55,   Method: Compositional matrix adjust.
 Identities = 132/318 (41%), Positives = 189/318 (59%), Gaps = 21/318 (6%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKFADL 88
           +  ++  +KA +G+ Y+   E   R +I+ +N + + R N   A    SY L +N+F D+
Sbjct: 19  VGAEWSAFKALHGKEYESDTEEYYRLKIYMENRLKIARHNEKYAKSQVSYKLAMNEFGDM 78

Query: 89  TPQEFIASQTGFKMSDHSSSLKAN--GTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA- 145
              EF++++ GFK +   +  + +    P   +   +P +V+W +KGAVTPVK QGQC  
Sbjct: 79  LHHEFVSTRNGFKRNYRDTPREGSFFVEPEGLEDFHLPKTVDWRKKGAVTPVKNQGQCGS 138

Query: 146 ------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGIT 199
                   ++EG +  K+++LVSLSEQ L+DC+ +  NNGC GG MD AFKYI  NKGI 
Sbjct: 139 CWSFSTTGSLEGQHFRKLHKLVSLSEQNLIDCSRSFGNNGCEGGLMDYAFKYIKANKGID 198

Query: 200 NDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDAS-- 256
            +  Y Y   + G+C   K+    A  T + D+P  DE  L KAVA   PVSVAIDAS  
Sbjct: 199 TEQSYPYNA-TDGVCHFNKSAV-GATDTGFVDIPEGDENKLKKAVATVGPVSVAIDASHE 256

Query: 257 ALQFYSGGVFN-GYCETF-LNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRD 314
           + QFYS GV++   C++  L+HGV  VGYGT ++G  YWL+KNSWG  WG+ GY  + R+
Sbjct: 257 SFQFYSEGVYDEPECDSEQLDHGVLVVGYGT-KDGQDYWLVKNSWGTTWGDGGYIYMSRN 315

Query: 315 IDQPQGQCGIAMFASFPV 332
            D    QCGIA  AS+P+
Sbjct: 316 KDN---QCGIASAASYPL 330


>gi|72005575|ref|XP_783218.1| PREDICTED: cathepsin L2-like isoform 2 [Strongylocentrotus
           purpuratus]
 gi|390337647|ref|XP_003724610.1| PREDICTED: cathepsin L2-like isoform 1 [Strongylocentrotus
           purpuratus]
          Length = 334

 Score =  221 bits (562), Expect = 5e-55,   Method: Compositional matrix adjust.
 Identities = 135/344 (39%), Positives = 200/344 (58%), Gaps = 26/344 (7%)

Query: 3   KYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
           K F+IV+L ++G+ A++   R FDE     ++++W   +G+ Y    E  +R  I++DNL
Sbjct: 2   KTFIIVLLSVAGALATRLPSRDFDE-----EWKEWVDYHGKEYSAMGEEMERRMIWEDNL 56

Query: 63  VAVERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS- 120
             + + N   + G  +Y L +N+F D+T  EF+A++T  KMS         G+ FL    
Sbjct: 57  RIITKHNLEHSQGKTTYRLGMNEFGDMTNAEFVATRTMKKMS--GVPKVGQGSTFLPSEF 114

Query: 121 SQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCAT 173
            Q+P SV+W  +G VTPVK QGQC        V A+EG + +K   LVSLSEQ LVDC+ 
Sbjct: 115 LQLPDSVDWRTEGYVTPVKDQGQCGSCWAFSTVGALEGQHFVKTGTLVSLSEQNLVDCSQ 174

Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
            + N+GC GG+   A +YI  N GI  +  Y YEG+        +  D  A IT + +V 
Sbjct: 175 AEGNDGCNGGWPAWADEYIKSNGGIDTEVGYPYEGVDDSC--HYRTSDVGATITGFAEVE 232

Query: 234 PNDEESLLKAVAN-QPVSVAIDAS--ALQFYSGGVFN--GYCETFLNHGVTAVGYGTSEE 288
            + E++L KA+A   P+SV IDA+  + Q Y  GV++      T L+H VTAVGY ++ +
Sbjct: 233 ADSEKALEKALAQVGPISVCIDATQPSFQLYESGVYDEPDCSSTALDHCVTAVGYDSTAD 292

Query: 289 GIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           G KY+++KNSWG  WG++GY  + RD    Q QCGIA  A++P+
Sbjct: 293 GDKYYIVKNSWGTTWGQEGYIWMSRD---KQKQCGIATNATYPL 333


>gi|356515116|ref|XP_003526247.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
          Length = 333

 Score =  221 bits (562), Expect = 5e-55,   Method: Compositional matrix adjust.
 Identities = 140/340 (41%), Positives = 178/340 (52%), Gaps = 52/340 (15%)

Query: 33  KFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQE 92
           +F++W    G  Y++  E   RF I++ N   VE          SY L  NKFADLT +E
Sbjct: 4   RFDRWLKXNGXNYEDKEEWEIRFVIYQAN---VEYIGCKKSQKNSYNLTDNKFADLTNEE 60

Query: 93  FIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA------- 145
           F+++  GF     ++ L  +     ++   +P S +W ++GAVT +K QG C        
Sbjct: 61  FVSTYLGF-----ATRLIPHTRFKYHEHGNLPXSKDWRKEGAVTDIKDQGNCGKHSTWFS 115

Query: 146 -----------------------------VAAVEGINAIKINRLVSLSEQQLVDCATNDN 176
                                        VAAVE IN IK  +LVSLSEQ+LVD    + 
Sbjct: 116 PEISHNLRNILTNYNTINFRDISFWAFSVVAAVERINKIKSGKLVSLSEQELVDYDVANK 175

Query: 177 NNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPND 236
           N GC GG MD  F +I +N G+T    Y YEG+  G C+  KA  HA  I+ YE  P  D
Sbjct: 176 NQGCEGGLMDTTFAFIKKNGGLTTSKDYPYEGVD-GSCNKEKALHHAVNISGYERAPSKD 234

Query: 237 EESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGY--GTSEEGIKY 292
           E  L  A ANQP+SVAIDA   A Q YS GVF+G C   LNHGVT VGY  GT +   KY
Sbjct: 235 EAMLKVAAANQPISVAIDAGGYAFQLYSQGVFSGVCGKKLNHGVTIVGYDKGTFD---KY 291

Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
             +KNS G DWGE GY R++RD     G CGIAM AS+P+
Sbjct: 292 RTVKNSXGADWGESGYIRMKRDAFDKAGTCGIAMKASYPL 331


>gi|357160095|ref|XP_003578656.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP2-like
           [Brachypodium distachyon]
          Length = 377

 Score =  221 bits (562), Expect = 6e-55,   Method: Compositional matrix adjust.
 Identities = 134/332 (40%), Positives = 182/332 (54%), Gaps = 31/332 (9%)

Query: 29  SIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADL 88
           ++A +F++WKA++GR Y    E  +R  ++  N+  +E  N       +Y L    + DL
Sbjct: 48  TMAPRFQRWKAEHGRAYATRDEELRRLRVYARNVRYIEAANGDPAAGLTYQLGETAYTDL 107

Query: 89  TPQEFIASQT---------------GFKMSDHSSSLKANGTPFLYKSSQV--PPSVNWIE 131
           T  EF A  T                  ++  + ++ A G    +  S    P SV+W  
Sbjct: 108 TADEFTAMYTSPSPVLSAHDDEAAGAMMITTRAGAVDAGGQQVYFNVSTAGAPASVDWRA 167

Query: 132 KGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGF 184
           KGAVT VK QG+C        VA VEGI+ I+   L+SLSEQ+LVDC T D   GC GG 
Sbjct: 168 KGAVTEVKNQGRCGSCWAFSTVAVVEGIHQIRTGNLISLSEQELVDCDTLDY--GCDGGV 225

Query: 185 MDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAV 244
              A ++I  N GI  +A Y Y G   G C + K   HAA I+ +  V    E SL  AV
Sbjct: 226 SYHALEWIASNGGIATEADYPYTGKD-GACVANKLPLHAAAISGFARVATRSEPSLANAV 284

Query: 245 ANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGI-KYWLIKNSWGQ 301
           A QPV+V+I+A     Q Y  GV+NG C T LNHGVT VGYG  E    KYW++KNSWG+
Sbjct: 285 AAQPVAVSIEAGGANFQHYVKGVYNGPCGTRLNHGVTVVGYGEEEGDGEKYWIVKNSWGK 344

Query: 302 DWGEDGYFRLQRDI-DQPQGQCGIAMFASFPV 332
            WG+ GYFR+++D+  +P+G CGIA+  SFP+
Sbjct: 345 KWGDGGYFRMKKDVAGKPEGLCGIAIRPSFPL 376


>gi|301116794|ref|XP_002906125.1| cysteine protease family C01A, putative [Phytophthora infestans
           T30-4]
 gi|262107474|gb|EEY65526.1| cysteine protease family C01A, putative [Phytophthora infestans
           T30-4]
          Length = 535

 Score =  220 bits (561), Expect = 6e-55,   Method: Compositional matrix adjust.
 Identities = 133/312 (42%), Positives = 175/312 (56%), Gaps = 21/312 (6%)

Query: 32  EKFEQWKAQYGRTYKESAENSKRFE--IFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
            +F  W   +  ++ ++ E +KR E  I  D  +      NA  G +   L  N+F+ ++
Sbjct: 27  HEFSAWMKTHSVSFSDALEFAKRLENYIANDMYIMEHNLENAWTGVK---LDHNEFSSMS 83

Query: 90  PQEFIASQTGFKMSD-HSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA--- 145
            +EF    TG+ M + +     A+    L+   QVP SV+W +KG VTPVK QG C    
Sbjct: 84  FEEFKFKMTGYVMPEGYLEQRLASRVDNLWSDVQVPDSVDWQDKGGVTPVKNQGMCGSCW 143

Query: 146 ----VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITND 201
                 AVEG   +   +LVSLSEQ+LVDC  N  + GC GG MD AF +I  N GI ++
Sbjct: 144 AFSTTGAVEGAAFVSSGKLVSLSEQELVDCDHN-GDMGCNGGLMDHAFAWIEDNGGICSE 202

Query: 202 AVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDA--SALQ 259
             Y Y+  +    D  K      +I+ ++DV P DE +L  AVA QPVSVAI+A   A Q
Sbjct: 203 DDYEYKAKAQVCRDCEKV----VKISGFQDVNPQDEHALKVAVAQQPVSVAIEADQKAFQ 258

Query: 260 FYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQ 319
           FY  GVFN  C T L+HGV AVGYG SE G K+W +KNSWG  WGE GY RL R+ + P 
Sbjct: 259 FYKSGVFNLTCGTRLDHGVLAVGYG-SENGQKFWKVKNSWGSSWGEKGYIRLAREENGPA 317

Query: 320 GQCGIAMFASFP 331
           GQCGIA   S+P
Sbjct: 318 GQCGIASVPSYP 329


>gi|195124431|ref|XP_002006696.1| GI21205 [Drosophila mojavensis]
 gi|193911764|gb|EDW10631.1| GI21205 [Drosophila mojavensis]
          Length = 339

 Score =  220 bits (561), Expect = 6e-55,   Method: Compositional matrix adjust.
 Identities = 131/324 (40%), Positives = 186/324 (57%), Gaps = 29/324 (8%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKFADL 88
           I E++  +K ++ +TY++  E   R +IF +N   + + N   A G  ++ + +NK+AD+
Sbjct: 23  IKEEWHTFKLEHRKTYQDETEERFRLKIFNENKHKIAKHNQRYATGEVTFKMAVNKYADM 82

Query: 89  TPQEFIASQTGFKMSDH----SSSLKANGTPFLYKSS-QVPPSVNWIEKGAVTPVKYQGQ 143
              EF  +  GF  + H    +S     G  F+  +  ++P SV+W EKGAVT VK QG 
Sbjct: 83  LHHEFRETMNGFNYTLHKELRASDPSFTGITFISPAHVKLPKSVDWREKGAVTAVKDQGH 142

Query: 144 C-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNK 196
           C       +  A+EG +  K   LVSLSEQ LVDC+    NNGC GG MD+AF+YI  N 
Sbjct: 143 CGSCWAFSSTGALEGQHFRKTGTLVSLSEQNLVDCSAKYGNNGCNGGLMDNAFRYIKDNG 202

Query: 197 GITNDAVYSYEGMSTGIC---DSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVA 252
           GI  +  Y YEG+        DS+ A D       + D+P  +E+ + +AVA   PVSVA
Sbjct: 203 GIDTEKSYPYEGIDDSCHFNKDSVGATDRG-----FADIPQGNEKKMAEAVATIGPVSVA 257

Query: 253 IDAS--ALQFYSGGVFN-GYCET-FLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGY 308
           IDAS  + QFYS G++N   C +  L+HGV  VGYGT E G  YWL+KNSWG  WG+ G+
Sbjct: 258 IDASHESFQFYSEGIYNEPECNSQNLDHGVLVVGYGTDESGKDYWLVKNSWGTTWGDKGF 317

Query: 309 FRLQRDIDQPQGQCGIAMFASFPV 332
            ++ R+ D    QCGIA  +S+P+
Sbjct: 318 IKMARNEDN---QCGIASASSYPL 338


>gi|66270077|gb|AAY43368.1| cysteine protease [Phytophthora infestans]
          Length = 510

 Score =  220 bits (560), Expect = 8e-55,   Method: Compositional matrix adjust.
 Identities = 133/312 (42%), Positives = 175/312 (56%), Gaps = 21/312 (6%)

Query: 32  EKFEQWKAQYGRTYKESAENSKRFE--IFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
            +F  W   +  ++ ++ E +KR E  I  D  +      NA  G +   L  N+F+ ++
Sbjct: 27  HEFSAWMKTHSVSFSDALEFAKRLENYIANDMYIMEHNLENAWTGVK---LDHNEFSSMS 83

Query: 90  PQEFIASQTGFKMSD-HSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA--- 145
            +EF    TG+ M + +     A+    L+   QVP SV+W +KG VTPVK QG C    
Sbjct: 84  FEEFKFKMTGYVMPEGYLEQRLASRVDNLWSDVQVPDSVDWQDKGGVTPVKNQGMCGSCW 143

Query: 146 ----VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITND 201
                 AVEG   +   +LVSLSEQ+LVDC  N  + GC GG MD AF +I  N GI ++
Sbjct: 144 AFSTTGAVEGAAFVSSGKLVSLSEQELVDCDHN-GDMGCNGGLMDHAFAWIEDNGGICSE 202

Query: 202 AVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDA--SALQ 259
             Y Y+  +    D  K      +I+ ++DV P DE +L  AVA QPVSVAI+A   A Q
Sbjct: 203 DDYEYKAKAQVCRDCEKV----VKISGFQDVNPQDEHALKVAVAQQPVSVAIEADQKAFQ 258

Query: 260 FYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQ 319
           FY  GVFN  C T L+HGV AVGYG SE G K+W +KNSWG  WGE GY RL R+ + P 
Sbjct: 259 FYKSGVFNLTCGTRLDHGVLAVGYG-SENGQKFWKVKNSWGSSWGEKGYIRLAREENGPA 317

Query: 320 GQCGIAMFASFP 331
           GQCGIA   S+P
Sbjct: 318 GQCGIASVPSYP 329


>gi|195379496|ref|XP_002048514.1| GJ14012 [Drosophila virilis]
 gi|194155672|gb|EDW70856.1| GJ14012 [Drosophila virilis]
          Length = 327

 Score =  220 bits (560), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 132/337 (39%), Positives = 194/337 (57%), Gaps = 27/337 (8%)

Query: 8   VVLIISG-SCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVE 66
           ++LI++G  C    +Y    E  +A +FE +K +Y ++Y++  E   R +IFKDN   ++
Sbjct: 6   LLLIVAGVGCNRALSY----EDVLASEFESFKVEYEKSYEDDGEEQLRMQIFKDNKQLID 61

Query: 67  RFNNA-AIGNRSYTLRLNKFADLTPQEFIASQ-TGFKMSDHSSSLKANGTPFLYKSSQVP 124
           R N   A G  +Y + +N+F D+   EF         +SD +SS++   +P    ++++P
Sbjct: 62  RHNERYAAGEETYEMGVNQFTDMLATEFRKIMLVNLNISDFTSSIEYIYSP---ANAEIP 118

Query: 125 PSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNN 177
             V+W EKGAVTPVK QG+C       A  A+EG + I+  +L+ LSEQ L+DC++  NN
Sbjct: 119 SQVDWREKGAVTPVKNQGRCGSCWAFSAAGALEGQHFIQTKQLIPLSEQNLLDCSSRYNN 178

Query: 178 NGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDE 237
           +GC GG+   A  Y+  N+G+ ND  Y YEG   G C   +    +A +T    V   DE
Sbjct: 179 HGCGGGWPAAALMYVRDNRGMDNDRAYPYEG-HVGRC-RFRRYSVSATVTQVMQVR-RDE 235

Query: 238 ESLLKAVANQ-PVSVAIDASALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIK 296
            +L  AVA + PVSVA+DA+  Q Y GGV++  C    NH +  VGYG+ + G  +WLIK
Sbjct: 236 VALANAVATKGPVSVAVDATYFQHYRGGVYSHRCRQQANHAMLVVGYGSDQRGGDFWLIK 295

Query: 297 NSWGQDWGEDGYFRLQRDIDQPQGQ-CGIAMFASFPV 332
           NSWG  WGE GY RL R+    QG  C +A +A FP+
Sbjct: 296 NSWG-GWGEQGYMRLARN----QGNLCHVASYAVFPI 327


>gi|269784818|ref|NP_001161481.1| cathepsin L1 precursor [Gallus gallus]
          Length = 353

 Score =  219 bits (559), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 138/342 (40%), Positives = 191/342 (55%), Gaps = 24/342 (7%)

Query: 8   VVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVER 67
           V L I   C   A      +  +   ++ WK+ + + Y E  E+ +R  +++ NL  +E 
Sbjct: 18  VCLTILSLCLGLAFAAPRVDPDLDSHWQLWKSWHSKDYHEREESWRRV-VWEKNLKMIEL 76

Query: 68  FN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPP 125
            N + ++G  SY L +N+F D+T +EF     G+K     S  K  G+ FL  S  + P 
Sbjct: 77  HNLDHSLGKHSYKLGMNQFGDMTAEEFRQLMNGYKHK--KSERKYRGSQFLEPSFLEAPR 134

Query: 126 SVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNN 178
           SV+W EKG VTPVK QGQC          A+EG +  K  +LVSLSEQ LVDC+  + N 
Sbjct: 135 SVDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQHFRKTGKLVSLSEQNLVDCSRPEGNQ 194

Query: 179 GCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEE 238
           GC GG MD AF+Y+  N GI ++  Y Y       C   KAE +AA  T + D+P   E 
Sbjct: 195 GCNGGLMDQAFQYVQDNGGIDSEESYPYTAKDDEDC-RYKAEYNAANDTGFVDIPQGHER 253

Query: 239 SLLKAVAN-QPVSVAIDA--SALQFYSGGV-FNGYCETF-LNHGVTAVGYGTSEE---GI 290
           +L+KAVA+  PVSVAIDA  S+ QFY  G+ +   C +  L+HGV  VGYG   E   G 
Sbjct: 254 ALMKAVASVGPVSVAIDAGHSSFQFYQSGIYYEPDCSSEDLDHGVLVVGYGFEGEDVDGK 313

Query: 291 KYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           KYW++KNSWG+ WG+ GY  + +D    +  CGIA  AS+P+
Sbjct: 314 KYWIVKNSWGEKWGDKGYIYMAKD---RKNHCGIATAASYPL 352


>gi|402770501|gb|AFQ98385.1| cathepsin L [Rhipicephalus microplus]
          Length = 332

 Score =  219 bits (559), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 142/344 (41%), Positives = 192/344 (55%), Gaps = 36/344 (10%)

Query: 6   LIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
           L  +  ++ + +SQ   RT        ++E +K  + +TY+   E   RF+IF +N + +
Sbjct: 7   LCAIAAVTVAASSQEILRT--------QWEAFKTTHKKTYQSHMEELLRFKIFTENSLII 58

Query: 66  ERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFL----YKS 120
            + N   A G  SY L +N+F DL   EF     G     H  + K  G+ FL       
Sbjct: 59  AKHNAKYAKGLVSYKLGMNQFGDLLAHEFARIFNG-----HHGTRKTGGSTFLPPANVND 113

Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
           S +P +V+W +KGAVTPVK QGQC       A  ++EG + +K   LVSLSEQ LVDC+ 
Sbjct: 114 SSLPKAVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQ 173

Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
           +  NNGC GG M+DAFKYI  N GI  +  Y YE +  G C   K ED  A  T Y ++ 
Sbjct: 174 SFGNNGCEGGLMEDAFKYIKANDGIDTEKSYPYEAVD-GEC-RFKKEDVGATDTGYVEIK 231

Query: 234 PNDEESLLKAVAN-QPVSVAIDA--SALQFYSGGVFN-GYCETF-LNHGVTAVGYGTSEE 288
              E+ L KAVA   P+SVAIDA  S+ Q YS GV++   C +  L+HGV  VGYG  + 
Sbjct: 232 AGSEDDLKKAVATVGPISVAIDASHSSFQLYSEGVYDEPECSSEDLDHGVLVVGYGV-KG 290

Query: 289 GIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           G KYWL+KNSW + WG+ GY  + RD +    QCGIA  AS+P+
Sbjct: 291 GKKYWLVKNSWAESWGDQGYILMSRDNNN---QCGIASQASYPL 331


>gi|357122137|ref|XP_003562772.1| PREDICTED: fruit bromelain-like [Brachypodium distachyon]
          Length = 358

 Score =  219 bits (559), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 133/335 (39%), Positives = 185/335 (55%), Gaps = 45/335 (13%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
           + ++F  ++A Y RTY    E  +RFE+++ N+  +E  N    G+ +Y L  N+FADLT
Sbjct: 36  MMDRFRAFQATYNRTYASPEERLRRFEVYRRNVDYIEAMNRR--GDLTYELGENQFADLT 93

Query: 90  PQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQV------------------------PP 125
            QEF A         ++   + +  P  ++  Q+                        P 
Sbjct: 94  VQEFRAM--------YTMPARVDSRPDAWRRRQMITTLAGPVTEDGGSYYSDAWEEAGPT 145

Query: 126 SVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNN 178
           SV+W  KGAVTPVK QG C        VA +EG++ IK  +LVSLSEQ+LVDC   D+  
Sbjct: 146 SVDWRSKGAVTPVKDQGGCGCCWAFATVATIEGLHKIKTGQLVSLSEQELVDCDDADDGC 205

Query: 179 GCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEE 238
           G      + A +++  N G+T +A Y Y G + G CD  KA +HAA+I   + V  N E 
Sbjct: 206 GGG--LPEIAMEWVAHNGGLTTEANYPYTGKA-GKCDRGKASNHAAKIAAAQMVRANSEA 262

Query: 239 SLLKAVANQPVSVAIDA-SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKN 297
            L +AVA QPV+VAI+A  +L FY  GV++G C    +H VT VGYG   +G KYW+IKN
Sbjct: 263 ELERAVARQPVAVAINAPDSLMFYKSGVYSGPCTAEFDHAVTVVGYGADNKGHKYWIIKN 322

Query: 298 SWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           SW + WGE GY R+QR +   +G CGIA  AS+PV
Sbjct: 323 SWAETWGEKGYGRMQRGVAAKEGLCGIATHASYPV 357


>gi|356545112|ref|XP_003540989.1| PREDICTED: LOW QUALITY PROTEIN: KDEL-tailed cysteine endopeptidase
           CEP1-like [Glycine max]
          Length = 400

 Score =  219 bits (558), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 123/311 (39%), Positives = 187/311 (60%), Gaps = 17/311 (5%)

Query: 11  IISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNN 70
           ++  +C  Q   ++  E   +E+ E+W AQYG+ Y+++AE  KRF+IFK+N+  +E FN 
Sbjct: 92  LVGVTCGRQCRSKSRLEACTSERHEKWMAQYGKVYEDAAEMEKRFQIFKNNVQFIESFNV 151

Query: 71  AAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS--SQVPPSVN 128
           A  G++ + +R+N+F DL  +EF A     +            T F Y S  + +P +++
Sbjct: 152 A--GDKPFNIRINQFPDLHDEEFKALLINGQRKVSGVETATEETSFRYGSVVTNIPATMD 209

Query: 129 WIEKGAVTPVKYQ---GQC----AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCY 181
             +KG VTP+K Q   G C    AVAA+EGI+ I  ++L+ LS+Q+LVD    + + GC 
Sbjct: 210 GRKKGVVTPIKDQGIIGSCWALSAVAAIEGIHQITTSKLMFLSKQKLVDSVKGE-SEGCI 268

Query: 182 GGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLL 241
           GG+++DAF++I++  GI ++  Y Y+G++   C   K     A I  YE VP N++++LL
Sbjct: 269 GGYVEDAFEFIVKKGGILSETHYPYKGVNX--CKVEKETHSVAHIKGYEKVPSNNKKALL 326

Query: 242 KAVANQPVSVAID--ASALQFYSGGVFNGY-CETFLNHGVTAVGYGTSEEGIKYWLIKNS 298
           K VANQPVSV ID  A A ++YS  +FN   C +  NH V  VGYG + +G KYW +KNS
Sbjct: 327 KVVANQPVSVYIDVGAHAFKYYSSEIFNARNCGSDPNHVVAVVGYGKALDGAKYWPVKNS 386

Query: 299 WGQDWGEDGYF 309
           WG +WG   Y 
Sbjct: 387 WGTEWGGKWYM 397


>gi|261289785|ref|XP_002611754.1| hypothetical protein BRAFLDRAFT_284341 [Branchiostoma floridae]
 gi|229297126|gb|EEN67764.1| hypothetical protein BRAFLDRAFT_284341 [Branchiostoma floridae]
          Length = 327

 Score =  219 bits (558), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 136/346 (39%), Positives = 192/346 (55%), Gaps = 40/346 (11%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
           FLI+VL ++ + A    +  F            K  +G+ YK   E + R  IF+DN   
Sbjct: 3   FLILVLSVTMATAMDVEWEAF------------KLTHGKQYKSPDEENVRRAIFRDNNQM 50

Query: 65  VERFNN-AAIGNRSYTLRLNKFADLTPQEFIASQTG-----FKMSDHSSSLKANGTPFLY 118
           ++  N  AA+G RSY + +N+F DL   E++    G       +S  S ++    TP L 
Sbjct: 51  IKEHNQEAAMGRRSYFMGMNQFGDLAHSEYLELVVGPGLLPLNLSTPSENV-FESTPGL- 108

Query: 119 KSSQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDC 171
              QV  +V+W +KGAVTP+K QG C          ++EG + +K  +LVSLSEQ L+DC
Sbjct: 109 ---QVDDTVDWRQKGAVTPIKDQGHCGSCWAFSTTGSLEGQHFMKTGKLVSLSEQNLLDC 165

Query: 172 ATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYED 231
           +    N GC GG MD AF+YI  N GI  +  Y Y      +CD  K     A +++Y D
Sbjct: 166 SRRFGNKGCEGGLMDQAFRYIKSNGGIDTEECYPYMAKDEKVCD-YKTSCSGATLSSYTD 224

Query: 232 VPPNDEESLLKAVAN-QPVSVAIDAS--ALQFYSGGVFN--GYCETFLNHGVTAVGYGTS 286
           +   DE +L++AV    PVSVAIDAS  +L+FY  G+++      T L+HGV AVGYG S
Sbjct: 225 IKAMDEMALMQAVGTVGPVSVAIDASHKSLRFYKSGIYDEPECSRTKLDHGVLAVGYG-S 283

Query: 287 EEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
            +G+ YWL+KNSWG  WG+ GY ++ R+ +    QCGIA  AS+PV
Sbjct: 284 MDGMDYWLVKNSWGSAWGDMGYVKMTRNKNN---QCGIATKASYPV 326


>gi|402770505|gb|AFQ98387.1| cathepsin L [Rhipicephalus microplus]
          Length = 332

 Score =  219 bits (558), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 142/344 (41%), Positives = 192/344 (55%), Gaps = 36/344 (10%)

Query: 6   LIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
           L  ++ ++ + +SQ   RT        ++E +K  + +TY+   E   RF+IF +N + +
Sbjct: 7   LCAIVAVTVAASSQEILRT--------QWEAFKTTHKKTYQSHMEELLRFKIFTENSLII 58

Query: 66  ERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFL----YKS 120
            + N   A G  SY L +N+F DL   EF     G     H  + K  G+ FL       
Sbjct: 59  AKHNAKYAKGLVSYKLGMNQFGDLLAHEFARIFNG-----HRGTRKTGGSTFLPPANVND 113

Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
           S +P +V+W +KGAVTPVK QGQC       A  ++EG + +K   LVSLSEQ LVDC+ 
Sbjct: 114 SSLPKAVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQ 173

Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
           +  NNGC GG M+DAFKYI  N GI  +  Y YE +  G C   K ED  A  T Y ++ 
Sbjct: 174 SFGNNGCEGGLMEDAFKYIKANDGIDTEKSYPYEAVD-GEC-RFKKEDVGATDTGYVEIK 231

Query: 234 PNDEESLLKAVAN-QPVSVAIDA--SALQFYSGGVFN-GYCETF-LNHGVTAVGYGTSEE 288
              E  L KAVA   P+SVAIDA  S+ Q YS GV++   C +  L+HGV  VGYG  + 
Sbjct: 232 AGSEVDLKKAVATVGPISVAIDASHSSFQLYSEGVYDEPECSSEDLDHGVLVVGYGV-KG 290

Query: 289 GIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           G KYWL+KNSW + WG+ GY  + RD +    QCGIA  AS+P+
Sbjct: 291 GKKYWLVKNSWAESWGDQGYILMSRDNNN---QCGIASQASYPL 331


>gi|413953050|gb|AFW85699.1| hypothetical protein ZEAMMB73_033873 [Zea mays]
          Length = 361

 Score =  219 bits (557), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 140/360 (38%), Positives = 200/360 (55%), Gaps = 34/360 (9%)

Query: 1   MAKYFLIVVLIISGSCASQATYRTFDEGSIA----EKFEQWKAQYGRTYKESAENSKRFE 56
           MA     + L++  +C+       F + +IA    E+F+ W+A+Y RTY    E  +RF 
Sbjct: 3   MATASASLALVMLFACSLLLAGTAFSDDTIAIPLLERFKAWQAEYNRTYATPEEFQQRFM 62

Query: 57  IFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPF 116
           ++ +NL  ++  N  + G+ SY L  N+F DLT +EF   +  + M        A   P 
Sbjct: 63  VYSENLRFIKTMNQLSTGS-SYELGENQFTDLTEEEF---KDTYLMKLDEQPPAAEAMPP 118

Query: 117 LY------------KSSQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKI 157
           +              + + P SV+W  KGAVTPVK Q QC        VA++EG++ IK 
Sbjct: 119 IVGTMSTAGMSNGDNTGEAPNSVDWRTKGAVTPVKNQQQCGSCWAFATVASIEGVHQIKT 178

Query: 158 NRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSI 217
            RLVSLSEQ++VDC    N++GC GG+   A +++ +N G+T ++ Y Y G S   C S 
Sbjct: 179 GRLVSLSEQEIVDCDRGGNDHGCRGGYPRSAMEWVTRNGGLTTESDYPYVG-SQRQCMSG 237

Query: 218 KAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS-ALQFYSGGVFNGYCE-TFLN 275
           K   HAA+I  Y+ V   +E  L +AVA +PV+V IDAS A QFY  GVF+G C  T +N
Sbjct: 238 KLGHHAARIRGYQAVQRKNEAELERAVAGRPVAVVIDASRAFQFYKRGVFSGPCNTTTVN 297

Query: 276 HGVTAVGYGTSEEGI----KYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
           H VT VGYG++        KYW++KNSWGQ WGE+GY R+ R +   +G C IA+    P
Sbjct: 298 HAVTVVGYGSAGSDSGGGRKYWIVKNSWGQRWGENGYVRMARRVRAREGMCAIAIEPLLP 357


>gi|224460525|gb|ACN43674.1| cathepsin L [Paralichthys olivaceus]
          Length = 334

 Score =  219 bits (557), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 135/316 (42%), Positives = 181/316 (57%), Gaps = 21/316 (6%)

Query: 32  EKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAA-IGNRSYTLRLNKFADLTP 90
             F  WK ++GR+Y  S+E  KR +I+  N   V   N  A  G+ +Y L +  +ADL  
Sbjct: 24  HDFHAWKLKFGRSYNSSSEEDKRMQIWLRNREIVMAHNAMADQGHSTYRLGMTFYADLEH 83

Query: 91  QEFIASQTGFKMSDHSSSLKANGTPFL--YKSSQVPPSVNWIEKGAVTPVKYQGQC---- 144
           +EF  +  G  +   ++S    G+ FL  ++   +P +++W + G VTPVK QG C    
Sbjct: 84  EEFKQTVFGVCLGSFNASKPRGGSSFLKMHRFYNLPQTIDWRQWGFVTPVKNQGSCGSCW 143

Query: 145 ---AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITND 201
              +  A+EG N  K  RLVSLSEQ+LVDC+ N  N GC GG+MD+AF+YI+   GI  +
Sbjct: 144 SFSSTGALEGQNFRKTGRLVSLSEQELVDCSGNYGNYGCNGGWMDNAFRYIVNKGGIHTE 203

Query: 202 AVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDAS--AL 258
             Y YEG   G C +   E   A  T Y D+P  +E +L +AVA   PVSVAI AS  + 
Sbjct: 204 DSYPYEGQ-VGQCRANYGEI-GATCTGYYDIPSGNEHALKEAVATFGPVSVAIHASDQSF 261

Query: 259 QFYSGGVFNG-YCE-TFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDID 316
           Q Y  GV+N  YC  T L+H V  VGYGT E G  YWL+KNSWG  WG+ GY ++ R+  
Sbjct: 262 QLYHSGVYNNPYCSGTALDHAVLIVGYGT-EYGQDYWLVKNSWGPAWGDQGYIKMSRN-- 318

Query: 317 QPQGQCGIAMFASFPV 332
               QCGIA  ASFP+
Sbjct: 319 -RYNQCGIASAASFPL 333


>gi|402770509|gb|AFQ98389.1| cathepsin L [Rhipicephalus microplus]
          Length = 332

 Score =  219 bits (557), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 142/344 (41%), Positives = 192/344 (55%), Gaps = 36/344 (10%)

Query: 6   LIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
           L  +  ++ + +SQ   RT        ++E +K  + +TY+   E   RF+IF ++ + +
Sbjct: 7   LCAIAAVTVAASSQEILRT--------QWEAFKTTHKKTYQSHMEELLRFKIFTESSLII 58

Query: 66  ERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFL----YKS 120
            R N   A G  SY L +N+F DL   EF     G     H  + K  G+ FL       
Sbjct: 59  ARHNAKYAKGLVSYKLGMNQFGDLLAHEFARIFNG-----HHGTRKTGGSTFLPPANVND 113

Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
           S +P +V+W +KGAVTPVK QGQC       A  ++EG + +K   LVSLSEQ LVDC+ 
Sbjct: 114 SSLPKAVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQ 173

Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
           +  NNGC GG M+DAFKYI  N GI  +  Y YE +  G C   K ED  A  T Y ++ 
Sbjct: 174 SFGNNGCEGGLMEDAFKYIKANDGIDTEKSYPYEAVD-GEC-RFKKEDVGATDTGYVEIK 231

Query: 234 PNDEESLLKAVAN-QPVSVAIDA--SALQFYSGGVFN-GYCETF-LNHGVTAVGYGTSEE 288
              E+ L KAVA   P+SVAIDA  S+ Q YS GV++   C +  L+HGV  VGYG  + 
Sbjct: 232 AGSEDDLKKAVATVGPISVAIDASHSSFQLYSEGVYDEPECSSEDLDHGVLVVGYGV-KG 290

Query: 289 GIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           G KYWL+KNSW + WG+ GY  + RD +    QCGIA  AS+P+
Sbjct: 291 GKKYWLVKNSWAESWGDQGYILMSRDNNN---QCGIASQASYPL 331


>gi|443698586|gb|ELT98517.1| hypothetical protein CAPTEDRAFT_128252 [Capitella teleta]
          Length = 324

 Score =  219 bits (557), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 140/330 (42%), Positives = 196/330 (59%), Gaps = 28/330 (8%)

Query: 17  ASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN-NAAIGN 75
           A+ A+   FDE ++ E +  +K  + +TY   AE+ +RF I++ +L  + + N  A +G 
Sbjct: 8   ATLASPLVFDE-ALDEMWTLFKTTHSKTYATEAEDMRRF-IWERHLNMINQHNIEADLGK 65

Query: 76  RSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSS-QVPPSVNWIEKGA 134
            +++L +N++ DLT  E+ A+ +G+KM+  S      G+ FL   + QVP +V+W EKG 
Sbjct: 66  HTFSLGMNEYGDLTQHEY-AAMSGYKMAKSSV-----GSSFLEPENLQVPKTVDWREKGY 119

Query: 135 VTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDD 187
           VTPVK QGQC       +  ++EG    K  RL S+SEQ LVDC+ ++ N GC GG MD+
Sbjct: 120 VTPVKNQGQCGSCWAFSSTGSLEGQVFRKTGRLPSISEQNLVDCSRDEGNMGCSGGLMDN 179

Query: 188 AFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN- 246
           AF YI +N GI ++  Y YE +  G C   K  D     + + D+P  DE +L  AVA+ 
Sbjct: 180 AFTYIKKNMGIDSEKSYPYEAVD-GEC-RYKKSDSVTTDSGFVDIPHGDETALRTAVASV 237

Query: 247 QPVSVAIDAS--ALQFYSGGVFN-GYC-ETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQD 302
            PVSVAIDAS  + QFY  GV+    C  T L+HGV  VGYG  E G  YWL+KNSWG  
Sbjct: 238 GPVSVAIDASHTSFQFYKTGVYTEANCSSTQLDHGVLVVGYGV-ENGQDYWLVKNSWGAS 296

Query: 303 WGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           WGE GY +L R+      QCGIA  AS+P+
Sbjct: 297 WGEAGYIKLARNHGN---QCGIASQASYPL 323


>gi|116794072|gb|ABK26996.1| unknown [Picea sitchensis]
          Length = 367

 Score =  219 bits (557), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 135/360 (37%), Positives = 197/360 (54%), Gaps = 34/360 (9%)

Query: 8   VVLIISGSCASQATYRTFDEGS---IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
           ++ ++S + A Q +Y   D  S   +   F++W  ++G+ Y    E ++R +IF+ NL  
Sbjct: 14  IICLVSAAKAVQHSYEVGDINSGNGLVRLFDRWLGRHGKLYGSHEEKARRLQIFRTNLQY 73

Query: 65  VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTG-----FKMSDHSSSLKANGTPFLYK 119
           +   N  +  N S+ L LNKFADLT +EF     G     ++    +    A   P L +
Sbjct: 74  IHAHNKNS--NSSFRLGLNKFADLTNEEFKTRYFGKNSKQWRDRRRTELEGAELRPVLKQ 131

Query: 120 S-------SQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSE 165
           +         +  S++W +KGAVT VK Q QC          A+EG+N I   +LVSLSE
Sbjct: 132 TVGSQSSSCSIASSLDWRKKGAVTGVKDQAQCGSCWAFSTTGAIEGVNFISTGKLVSLSE 191

Query: 166 QQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQ 225
           Q+LV C  +  N GC GG MD AF ++IQN GI  +  YSY G+ +  C++ K       
Sbjct: 192 QELVAC--DATNYGCEGGDMDYAFTWVIQNGGIDTEKDYSYTGVDS-TCNTNKEAKKIVS 248

Query: 226 ITNYEDVPPNDEESLLKAVANQPVSVAIDASAL--QFYSGGVFNGYCE---TFLNHGVTA 280
           I  Y DV P D+ +LL A  +QPVSV ID SA+  Q Y+GG+++G C      ++H V  
Sbjct: 249 IDGYTDVSP-DDSALLCAAGSQPVSVGIDGSAIDFQLYTGGIYDGDCSGNPDDIDHAVLV 307

Query: 281 VGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSKESAQPS 340
           VGY +++ G  YW++KNSWG DWG +GYF + R+ + P G C I   AS+P   ES+  S
Sbjct: 308 VGY-SAKNGKDYWIVKNSWGTDWGLEGYFYILRNTELPYGVCAINAMASYPTKTESSVQS 366


>gi|332375975|gb|AEE63128.1| unknown [Dendroctonus ponderosae]
          Length = 338

 Score =  219 bits (557), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 136/342 (39%), Positives = 191/342 (55%), Gaps = 25/342 (7%)

Query: 8   VVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVER 67
           +VL+++ + A QA   +F E  + E++  +K Q+ + Y+   E   R +IF DN   V +
Sbjct: 4   LVLLVTIAVACQAV--SFSE-LVQEQWNSFKVQHKKQYESETEERFRMKIFMDNSHKVAK 60

Query: 68  FNNA-AIGNRSYTLRLNKFADLTPQEFIASQTGFKMSD---HSSSLKANGTPFLYKSSQV 123
            N     G   Y L +NK+ DL   EF+    GF  +        L+ + T        +
Sbjct: 61  HNKLFEQGLYPYKLAMNKYGDLLHHEFVGLLNGFNRTKTYLKRGELQDSITFIEPAHVDI 120

Query: 124 PPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDN 176
           P +V+W ++GAVTPVK QG C       A  A+EG +  +  +LVSLSEQ LVDC++   
Sbjct: 121 PDTVDWRQEGAVTPVKDQGHCGSCWSFSATGALEGQHFRQTKKLVSLSEQNLVDCSSRFG 180

Query: 177 NNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPND 236
           NNGC GG MD+AF+YI  N GI  +A Y Y G          A++  A    + D+P  D
Sbjct: 181 NNGCNGGLMDNAFRYIKNNGGIDTEAAYPYMGEDEKF--RYSAKNRGATDKGFVDIPSGD 238

Query: 237 EESLLKAVAN-QPVSVAIDAS--ALQFYSGGVF-NGYC-ETFLNHGVTAVGYGTSEE-GI 290
           E+ L  AVA   P+S+AIDAS  + Q YS GV+ +  C  T L+HGV  VGYGT E+ G+
Sbjct: 239 EDKLKAAVATVGPISIAIDASHESFQLYSNGVYSDPTCSSTELDHGVLVVGYGTDEKTGM 298

Query: 291 KYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
            YWL+KNSWG  WG DGY ++ R+ D    QCG+A  AS+P+
Sbjct: 299 DYWLVKNSWGDTWGLDGYIKMARNQDN---QCGVATQASYPL 337


>gi|18202415|sp|P82474.1|CPGP2_ZINOF RecName: Full=Zingipain-2; AltName: Full=Cysteine proteinase GP-II
 gi|6137410|pdb|1CQD|A Chain A, The 2.1 Angstrom Structure Of A Cysteine Protease With
           Proline Specificity From Ginger Rhizome, Zingiber
           Officinale
 gi|6137411|pdb|1CQD|B Chain B, The 2.1 Angstrom Structure Of A Cysteine Protease With
           Proline Specificity From Ginger Rhizome, Zingiber
           Officinale
 gi|6137412|pdb|1CQD|C Chain C, The 2.1 Angstrom Structure Of A Cysteine Protease With
           Proline Specificity From Ginger Rhizome, Zingiber
           Officinale
 gi|6137413|pdb|1CQD|D Chain D, The 2.1 Angstrom Structure Of A Cysteine Protease With
           Proline Specificity From Ginger Rhizome, Zingiber
           Officinale
          Length = 221

 Score =  219 bits (557), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 113/221 (51%), Positives = 145/221 (65%), Gaps = 14/221 (6%)

Query: 123 VPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATND 175
           +P S++W E GAV PVK QG C        VAAVEGIN I    L+SLSEQQLVDC T  
Sbjct: 3   LPDSIDWRENGAVVPVKNQGGCGSCWAFSTVAAVEGINQIVTGDLISLSEQQLVDCTTA- 61

Query: 176 NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPN 235
            N+GC GG+M+ AF++I+ N GI ++  Y Y G   GIC+S         I +YE+VP +
Sbjct: 62  -NHGCRGGWMNPAFQFIVNNGGINSEETYPYRGQD-GICNS-TVNAPVVSIDSYENVPSH 118

Query: 236 DEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYW 293
           +E+SL KAVANQPVSV +DA+    Q Y  G+F G C    NH +T VGYGT E    +W
Sbjct: 119 NEQSLQKAVANQPVSVTMDAAGRDFQLYRSGIFTGSCNISANHALTVVGYGT-ENDKDFW 177

Query: 294 LIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSK 334
           ++KNSWG++WGE GY R +R+I+ P G+CGI  FAS+PV K
Sbjct: 178 IVKNSWGKNWGESGYIRAERNIENPDGKCGITRFASYPVKK 218


>gi|4469155|emb|CAB38315.1| chymopapain isoform III [Carica papaya]
          Length = 361

 Score =  219 bits (557), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 136/344 (39%), Positives = 184/344 (53%), Gaps = 25/344 (7%)

Query: 5   FLIVVLIISGSCASQATYR---TFDEGSIAEK----FEQWKAQYGRTYKESAENSKRFEI 57
           FL   LII    +S   Y    + D+ +  E+    F+ W  ++ + Y+   E   RFEI
Sbjct: 12  FLATCLIIHMGLSSADFYTVGYSQDDLTSIERLIQLFDSWMLKHNKIYESIDEKIYRFEI 71

Query: 58  FKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFL 117
           F+DNL+ ++  N     N SY L LN FADL+  EF     GF   D +     +   F 
Sbjct: 72  FRDNLMYIDETNKK---NNSYWLGLNGFADLSNDEFKKKYVGFVAEDFTGLEHFDNEDFT 128

Query: 118 YKS-SQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLV 169
           YK  +  P S++W  KGAVTPVK QG C        +A VEGIN I    L+ LSEQ+LV
Sbjct: 129 YKHVTNYPQSIDWRAKGAVTPVKNQGACGSCWAFSTIATVEGINKIVTGNLLELSEQELV 188

Query: 170 DCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNY 229
           DC  + ++ GC GG+   + +Y+  N G+    VY  +      C +        +IT Y
Sbjct: 189 DC--DKHSYGCKGGYQTTSLQYVANN-GVHTSKVYPCQAKQYK-CRATDKPGPKVKITGY 244

Query: 230 EDVPPNDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSE 287
           + VP N E S L A+ANQP+S  ++A     Q Y  GVF+G C T L+H VTAVGYGTS+
Sbjct: 245 KRVPSNCETSFLGALANQPLSFLVEAGGKPFQLYKSGVFDGPCGTKLDHAVTAVGYGTSD 304

Query: 288 EGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
            G  Y +IKNSWG +WGE GY RL+R     QG CG+   + +P
Sbjct: 305 -GKNYIIIKNSWGPNWGEKGYMRLKRQSGNSQGTCGVYKSSYYP 347


>gi|195056367|ref|XP_001995082.1| GH22826 [Drosophila grimshawi]
 gi|193899288|gb|EDV98154.1| GH22826 [Drosophila grimshawi]
          Length = 340

 Score =  218 bits (556), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 133/325 (40%), Positives = 186/325 (57%), Gaps = 31/325 (9%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKFADL 88
           I E+++ +K ++ + Y++  E   R +IF +N   + + N   A G  S+ + LNK+AD+
Sbjct: 24  IKEEWQTFKLEHRKQYQDETEERFRLKIFNENKHKIAKHNQLYAAGEVSFKMGLNKYADM 83

Query: 89  TPQEFIASQTGFKMSDH----SSSLKANGTPFLY-KSSQVPPSVNWIEKGAVTPVKYQGQ 143
              EF  +  GF  + H    +S     G  F+  +  ++P SV+W  KGAVT VK QG 
Sbjct: 84  LHHEFHETMNGFNYTLHKQLRASDATFTGVTFISPEHVKLPQSVDWRNKGAVTGVKDQGH 143

Query: 144 C-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNK 196
           C       +  A+EG +  K   L+SLSEQ LVDC+T   NNGC GG MD+AF+YI  N 
Sbjct: 144 CGSCWAFSSTGALEGQHFRKTGTLISLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNG 203

Query: 197 GITNDAVYSYEGMSTGICD----SIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSV 251
           GI  +  Y YEG+    C     +I A D       + D+P  DE+ L +AVA   PVSV
Sbjct: 204 GIDTEKSYPYEGIDDS-CHFNKGTIGATDRG-----FTDIPQGDEKKLAQAVATIGPVSV 257

Query: 252 AIDAS--ALQFYSGGVFN-GYCE-TFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDG 307
           AIDAS  + QFYS GV++   C+   L+HGV  VGYGT E G  YWL+KNSWG  WG+ G
Sbjct: 258 AIDASHESFQFYSTGVYDEPQCDPQNLDHGVLVVGYGTDENGKDYWLVKNSWGTTWGDKG 317

Query: 308 YFRLQRDIDQPQGQCGIAMFASFPV 332
           + ++ R+ D    QCGIA  +S+P+
Sbjct: 318 FIKMARNDDN---QCGIATASSYPL 339


>gi|302812789|ref|XP_002988081.1| hypothetical protein SELMODRAFT_183539 [Selaginella moellendorffii]
 gi|300144187|gb|EFJ10873.1| hypothetical protein SELMODRAFT_183539 [Selaginella moellendorffii]
          Length = 425

 Score =  218 bits (556), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 122/317 (38%), Positives = 181/317 (57%), Gaps = 19/317 (5%)

Query: 30  IAEKFEQWKAQYGRTYKES-AENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADL 88
           ++ ++  W A++G+    S +    RFE FK+N   +E  N A  G  SY L LN+F+DL
Sbjct: 9   LSGEYASWCAKFGKECASSNSLGDHRFETFKENFRYIEEHNRA--GKHSYRLGLNQFSDL 66

Query: 89  TPQEFIASQTGFKMSDHSSSL----KANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC 144
           T +EF     G +     S +    + +     +++  +P SV+W + GAVT  K QG C
Sbjct: 67  TSEEFRQRFLGLRPDLIDSPVLKMPRDSDIEEGFQNVDLPASVDWRQHGAVTAPKDQGSC 126

Query: 145 A-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKG 197
                     A+EGIN I   +LVSLSEQ+L+DC     + GC GG M++A+++I++N G
Sbjct: 127 GGCWAFATTGAIEGINQIVTGQLVSLSEQELIDC-DKKADKGCDGGLMENAYQFIVENGG 185

Query: 198 ITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA 257
           +  +  Y Y   S   C+  K       I  Y+ +P  DE++LL AVA QPVSVAI+ ++
Sbjct: 186 LDTETDYPYHA-SESHCNMKKLNSRVVAIDGYKAIPEGDEQALLLAVAKQPVSVAIEGAS 244

Query: 258 --LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDI 315
              Q Y+ GVF G+C   +NHGV  VGYGT E+G+ YW++KNSW   WG+ G+ ++QR+ 
Sbjct: 245 KDFQHYASGVFTGHCGEEINHGVLIVGYGT-EDGLDYWIVKNSWAATWGDGGFVKMQRNT 303

Query: 316 DQPQGQCGIAMFASFPV 332
            +  G C I   AS+PV
Sbjct: 304 GKRGGLCSINTLASYPV 320


>gi|61661067|gb|AAX51229.1| cathepsin S cysteine protease [Paralichthys olivaceus]
          Length = 337

 Score =  218 bits (556), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 126/311 (40%), Positives = 177/311 (56%), Gaps = 21/311 (6%)

Query: 34  FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN-NAAIGNRSYTLRLNKFADLTPQE 92
           +E WK  +G+TY    E+ +R E+++ NL+ + + N  A++G ++Y L +N   DLT +E
Sbjct: 35  WELWKKSHGKTYPNEVEDVRRRELWERNLMLITKHNLEASMGLQTYDLSMNHMGDLTTEE 94

Query: 93  FIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC-------A 145
            + S   +      + ++    PF+   + VP SV+W  +G VT VK QG C       A
Sbjct: 95  IMQS---YATLTPPADIQRAPAPFVGSGADVPVSVDWRLQGCVTSVKMQGSCGSCWAFSA 151

Query: 146 VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYS 205
             A+EG  A    +LV LS Q LVDC+    N GC GGFMD AF+Y+I NKGI ++A Y 
Sbjct: 152 AGALEGQLAKTTGKLVDLSPQNLVDCSLKYGNKGCNGGFMDRAFQYVIDNKGIDSEASYP 211

Query: 206 YEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDAS--ALQFYS 262
           Y G       S      AA  + Y  +P  DE +L  A+A   P+SVAIDA+     FY 
Sbjct: 212 YRGQLQQC--SYNPSYRAANCSRYSFLPEGDEGALKNALATIGPISVAIDATRPTFAFYR 269

Query: 263 GGVFNG-YCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQ 321
            GV+N   C   +NHGV AVGYGT E G  YWL+KNSWG  +G+ GY R+ R+ +    Q
Sbjct: 270 SGVYNDPTCTQRVNHGVLAVGYGT-ESGQDYWLVKNSWGTSFGDKGYIRMSRNKND---Q 325

Query: 322 CGIAMFASFPV 332
           CGIA++ S+P+
Sbjct: 326 CGIALYCSYPI 336


>gi|186701255|gb|ACC91281.1| putative cysteine proteinase [Capsella rubella]
          Length = 324

 Score =  218 bits (556), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 129/339 (38%), Positives = 191/339 (56%), Gaps = 35/339 (10%)

Query: 1   MAKYFLIVVLIISGSCA-----SQATYRTFDEGSIAEKFEQWKAQYGRTYKES-AENSKR 54
           M    L+++ ++  S A     +    R+ +E  +   F+ W +++G+TY  +  +  +R
Sbjct: 9   MITLSLLIIFLLPPSSAMDLSVTSGGLRSNEE--VGFIFQTWMSKHGKTYTNALGDKEQR 66

Query: 55  FEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGT 114
           F+ FKDNL  +++ N     N SY L L +FADLT QE+    +G  +    + L+    
Sbjct: 67  FQNFKDNLRFIDQHNAK---NLSYRLGLTQFADLTVQEYQDLFSGRPIQKQKA-LRVTHR 122

Query: 115 PFLYKSSQVPPSVNWIEKGAVTPVKYQGQCAVAAVEGINAIKINRLVSLSEQQLVDCATN 174
                  Q+P SV+W +KGAV+ +K QG+C V   E IN I    L+SLSEQ+LVDC+ +
Sbjct: 123 YVPLAEDQLPQSVDWRQKGAVSEIKDQGRCTV---ESINKIVTGELISLSEQELVDCSID 179

Query: 175 DNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIK-AEDHAAQITNYEDVP 233
             N+GC GG MD AF+++I N G+   + Y Y+ +  G C+  +       +I  YEDVP
Sbjct: 180 --NHGCNGGLMDSAFQFLINNNGLEYQSDYPYQAVQ-GYCNHNQNTSKKVIKIDGYEDVP 236

Query: 234 PNDEESLLKAVANQPVSVAIDASALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYW 293
            N+E SL KAVA+QP               G++ G C T L+H V  VGYGT E G  YW
Sbjct: 237 ANNENSLQKAVAHQP---------------GIYTGPCGTDLDHAVVIVGYGT-ENGQDYW 280

Query: 294 LIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           +++NSWG  WGE GY ++ R+ + P G CGIAM AS+P+
Sbjct: 281 IVRNSWGTVWGEAGYAKIARNFENPTGVCGIAMVASYPI 319


>gi|121543825|gb|ABM55577.1| putative cathepsin L-like protease [Maconellicoccus hirsutus]
          Length = 341

 Score =  218 bits (556), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 137/345 (39%), Positives = 185/345 (53%), Gaps = 27/345 (7%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
           FL  VLI   +  +  ++       IAE++E +K Q+ + Y    E   R ++F DN   
Sbjct: 6   FLCCVLIYHSNSVTAVSFNDL----IAEEWELFKTQFSKAYNTEIEEKFRMKVFMDNKHK 61

Query: 65  VERFNNA-AIGNRSYTLRLNKFADLTPQEFIASQTGFKMS-DHSSSLKANGTPFL-YKSS 121
           + R N     G  SY L +N F DL   EF+ +  G++ S    +  + +   F+   + 
Sbjct: 62  IARHNKLFQNGEVSYELEMNHFGDLLHHEFVKTVNGYRHSLRRVTGDEIDSVTFIPAYNV 121

Query: 122 QVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATN 174
            VP SV+W  +GAVT VK QGQC          ++EG +     +L SLSEQ L+DC+  
Sbjct: 122 TVPDSVDWRTEGAVTEVKNQGQCGSCWAFSTTGSLEGQHFRNTKQLTSLSEQNLIDCSGK 181

Query: 175 DNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPP 234
             NNGC GG MD+AF YI  NKGI  +  Y YEG+        K ++  A    + D+P 
Sbjct: 182 YGNNGCSGGLMDNAFAYIKSNKGIDTEQSYPYEGIDDKC--RYKPQESGATDKGFVDIPQ 239

Query: 235 NDEESLLKAVAN-QPVSVAIDAS--ALQFYSGGVFN----GYCETFLNHGVTAVGYGTSE 287
            DEE L  AVA   P+SVAIDAS  + QFY  GV+     G  E  L+HGV AVGYGT E
Sbjct: 240 GDEEKLKLAVATVGPISVAIDASHQSFQFYKKGVYYDKGCGNGEEDLDHGVLAVGYGT-E 298

Query: 288 EGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
            G  YWL+KNSWG+ WG DGY ++ R+       CGIA  AS+P+
Sbjct: 299 NGKDYWLVKNSWGKRWGLDGYIKMARN---KHNHCGIATSASYPL 340


>gi|344271925|ref|XP_003407787.1| PREDICTED: cathepsin L1-like [Loxodonta africana]
          Length = 333

 Score =  218 bits (555), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 132/340 (38%), Positives = 189/340 (55%), Gaps = 25/340 (7%)

Query: 8   VVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVER 67
           + L ++  C   A+     + S+  ++ QW++ Y + Y  + E+ +R  +++ N+  +ER
Sbjct: 3   LSLFLAALCLGIASAAPKFDQSLDAQWNQWRSTYKKVYAVNEEDWRR-AVWEKNMKMIER 61

Query: 68  FNNA-AIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPS 126
            N   + G   +T+ +N F D T +EF     GF+   H    K    P       +P S
Sbjct: 62  HNQEYSQGKHGFTMAMNAFGDKTNEEFRQLMNGFQSQKHKKG-KLFYEPVF---GHIPTS 117

Query: 127 VNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNG 179
           V+W +KG VTPVK QGQC       A  A+EG    K  +LVSLSEQ LVDC+  + N G
Sbjct: 118 VDWTQKGYVTPVKDQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSWREGNEG 177

Query: 180 CYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEES 239
           C GG MD+AF+Y+  N G+ ++  Y Y    T  C     +  AA  T + D+PP  E++
Sbjct: 178 CNGGLMDNAFQYVKDNGGLDSEESYPYTATDTQDC-RYNPKYSAANDTGFVDIPPQ-EKA 235

Query: 240 LLKAVAN-QPVSVAIDAS--ALQFYSGGV-FNGYCETFLNHGVTAVGY---GTSEEGIKY 292
           L+KAVA   P+SVAIDA   + QFYS G+ F+  C   +NHGV AVGY   GT  +  KY
Sbjct: 236 LMKAVATVGPISVAIDAGQVSFQFYSSGIYFDPACRLTVNHGVLAVGYGFEGTDPDKNKY 295

Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           WL+KNSWG+ WG DGY ++ +D +     CGIA  AS+P 
Sbjct: 296 WLVKNSWGKSWGADGYIKIAKDRNN---HCGIARAASYPT 332


>gi|357130488|ref|XP_003566880.1| PREDICTED: fruit bromelain-like [Brachypodium distachyon]
          Length = 356

 Score =  218 bits (555), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 126/325 (38%), Positives = 182/325 (56%), Gaps = 34/325 (10%)

Query: 33  KFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQE 92
           + E+W A++GR Y ++ E ++R E+F  N   V+  N A  GNR+YTL LNKF+DLT  E
Sbjct: 38  RHEEWMAKFGRVYTDAQEKARRQEVFGANARYVDAVNRA--GNRTYTLGLNKFSDLTDDE 95

Query: 93  FIASQTGFK---------MSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQ 143
           F+ +  G++           ++ S + A G    Y  + +P SV+W  +GAVT VK QG 
Sbjct: 96  FVQTHLGYRGHQQGGLRPEEENVSKVAALG----YGQADMPESVDWRAQGAVTGVKNQGS 151

Query: 144 C-------AVAAVEGINAIKINRLVSLSEQQLVDCATND----NNNGCYGGFMDDAFKYI 192
           C       AVAA EG+  I    L+S+SEQQ++DC        N N C GG +DDA +Y+
Sbjct: 152 CGCCWAFAAVAATEGLVKIATGNLISMSEQQVLDCTGQSPGMGNTNTCDGGHIDDALRYV 211

Query: 193 IQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKA-VANQPVSV 251
             ++G+  +A Y+Y G+  G C S    + AA     + V    +E  L+  VA QP++V
Sbjct: 212 AASRGLQPEAAYAYTGLQ-GACQSGFTPNSAASFGEPQTVTLQGDEGRLQGLVAGQPIAV 270

Query: 252 AIDASA-LQFYSGGVFNG---YCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDG 307
           +++AS   + Y  GVF      C   LNH VT VGYG+++ G +YWL+KN WG  WGE G
Sbjct: 271 SVEASDDFRHYMSGVFTAGTSSCGQRLNHAVTVVGYGSADGGQEYWLVKNQWGTSWGEGG 330

Query: 308 YFRLQRDIDQPQGQCGIAMFASFPV 332
           Y R+ R    P   CGI+ +A +P 
Sbjct: 331 YMRIARGNGAP--NCGISAYAYYPT 353


>gi|354549232|gb|AER27707.1| putative cysteine protease [Phytophthora sp. SH-2011]
          Length = 533

 Score =  218 bits (555), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 128/312 (41%), Positives = 176/312 (56%), Gaps = 21/312 (6%)

Query: 32  EKFEQWKAQYGRTYKESAENSKRFE--IFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
            +F  W   +G T+ ++ E ++R E  I  D  +      NA  G    TL  N F+ ++
Sbjct: 26  HEFSAWMGAHGVTFSDALEFARRLENYIVNDMYIMEHNAENAWTG---VTLGHNAFSHMS 82

Query: 90  PQEFIASQTGFKMSD-HSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA--- 145
             EF    TG  + + +     A+    L+   +VP +V+W++KG VTPVK QG C    
Sbjct: 83  FDEFKFKMTGLVLPEGYLEQRLASRVDGLWSDVEVPSAVDWVDKGGVTPVKNQGMCGSCW 142

Query: 146 ----VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITND 201
                 AVEG   +   +L SLSEQ+LVDC  N  + GC GG MD AF++I  + GI ++
Sbjct: 143 AFSTTGAVEGATFVSSGKLPSLSEQELVDCDHN-GDMGCNGGLMDHAFQWIEDHGGICSE 201

Query: 202 AVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQ 259
             Y Y+  +  +C   +  D   ++T ++DV P DE +L  AVA QPVSVAI+A   A Q
Sbjct: 202 DDYEYKAKAQ-VC---RECDSVVKVTGFQDVNPQDEHALKVAVAQQPVSVAIEADQKAFQ 257

Query: 260 FYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQ 319
           FY  GVFN  C T L+HGV AVGYG ++ G K+W +KNSWG  WGE GY RL R+ + P 
Sbjct: 258 FYKSGVFNLTCGTRLDHGVLAVGYG-NDNGHKFWKVKNSWGASWGEQGYIRLAREENGPA 316

Query: 320 GQCGIAMFASFP 331
           GQCGIA   S+P
Sbjct: 317 GQCGIASVPSYP 328


>gi|402770511|gb|AFQ98390.1| cathepsin L [Rhipicephalus microplus]
 gi|402770513|gb|AFQ98391.1| cathepsin L [Rhipicephalus microplus]
          Length = 332

 Score =  218 bits (555), Expect = 4e-54,   Method: Compositional matrix adjust.
 Identities = 142/344 (41%), Positives = 191/344 (55%), Gaps = 36/344 (10%)

Query: 6   LIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
           L  ++ ++ + +SQ   RT        ++E +K  + +TY+   E   RF+IF +N + +
Sbjct: 7   LCAIVAVTVAASSQEILRT--------QWEAFKTTHKKTYQSHMEELLRFKIFTENSLII 58

Query: 66  ERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFL----YKS 120
            + N   A G  SY L +N+F DL   EF     G     H  + K  G+ FL       
Sbjct: 59  AKHNAKYAKGLVSYKLGMNQFGDLLAHEFARIFNG-----HHGTRKTGGSSFLPPANVND 113

Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
           S +P  V+W +KGAVTPVK QGQC       A  ++EG + +K   LVSLSEQ LVDC+ 
Sbjct: 114 SSLPKVVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQ 173

Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
           +  NNGC GG M+DAFKYI  N GI  +  Y YE +  G C   K ED  A  T Y ++ 
Sbjct: 174 SFGNNGCEGGLMEDAFKYIKANDGIDTEKSYPYEAVD-GEC-RFKKEDVGATDTGYVEIK 231

Query: 234 PNDEESLLKAVAN-QPVSVAIDA--SALQFYSGGVFN-GYCETF-LNHGVTAVGYGTSEE 288
              E  L KAVA   P+SVAIDA  S+ Q YS GV++   C +  L+HGV  VGYG  + 
Sbjct: 232 AGSEVDLKKAVATVGPISVAIDASHSSFQLYSEGVYDEPECSSEDLDHGVLVVGYGV-KG 290

Query: 289 GIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           G KYWL+KNSW + WG+ GY  + RD +    QCGIA  AS+P+
Sbjct: 291 GKKYWLVKNSWAESWGDQGYILMSRDNNN---QCGIASQASYPL 331


>gi|308810026|ref|XP_003082322.1| cysteine protease-1 (ISS) [Ostreococcus tauri]
 gi|116060790|emb|CAL57268.1| cysteine protease-1 (ISS) [Ostreococcus tauri]
          Length = 430

 Score =  218 bits (554), Expect = 4e-54,   Method: Compositional matrix adjust.
 Identities = 133/339 (39%), Positives = 184/339 (54%), Gaps = 39/339 (11%)

Query: 29  SIAEKFEQWKAQYG--RTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKF 85
           ++A  FE+W +++G  R  +++ E +KR   F +N   V   N   AIG  S+ + LN  
Sbjct: 93  ALARHFERWCSEHGLERYLRDTEEYAKRLATFAENAAYVVEHNALYAIGEVSHWVGLNSL 152

Query: 86  ADLTPQEFIASQTGFKMSDHSSS----LKANGT--------PFLYKSSQVPPSVNWIEKG 133
           A  T +E+ A   G+K    SS     L+A  T         + Y S   P +++W+E G
Sbjct: 153 AATTREEYRA-LLGYKPELRSSGDAEMLEATSTDKVEQYKASWEYASVDPPEAIDWVELG 211

Query: 134 AVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMD 186
           AVTP K QGQC          AVEGI  I+  RLVSLSEQ++V C+    N GC GG MD
Sbjct: 212 AVTPPKNQGQCGSCWAFSTTGAVEGITKIRTGRLVSLSEQEMVSCS--KQNMGCNGGLMD 269

Query: 187 DAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN 246
            AF++I++N GI ++  Y Y   +   C+  K + H A I  ++DVPP DE+ L KAV+ 
Sbjct: 270 YAFRWIVKNGGIDSEFQYPYSAEALA-CNRWKLQLHVATIDGFKDVPPGDEKELEKAVSQ 328

Query: 247 QPVSVAI--DASALQFYSGGVFNGY-CETFLNHGVTAVGYG---TSEEGIK-------YW 293
           QPVS+AI  D  + Q Y GGV++   C + ++HGV  VGYG   T     K       +W
Sbjct: 329 QPVSIAIEADTKSFQLYDGGVYDSKECGSQVDHGVLVVGYGFDDTHHNATKHHKRHRHFW 388

Query: 294 LIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
            +KNSWG  WGE G+ R+ R I    GQCGI    S+P 
Sbjct: 389 KVKNSWGGTWGEGGFIRMARRISDETGQCGITTAPSYPT 427


>gi|94448674|emb|CAI91575.1| cathepsin L2 [Lubomirskia baicalensis]
          Length = 324

 Score =  218 bits (554), Expect = 4e-54,   Method: Compositional matrix adjust.
 Identities = 137/332 (41%), Positives = 187/332 (56%), Gaps = 30/332 (9%)

Query: 16  CASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGN 75
           C   A    FD    +++   WKA++G++Y+   E   R   ++ N   ++  N  A G 
Sbjct: 7   CTLIAAVAAFD---FSKELRAWKAEHGKSYRNHKEEMLRHVTWQANKKYIDEHNQHA-GV 62

Query: 76  RSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFL--YKSSQVPPSVNWIEKG 133
             YTL++N+F DL   EF +   G++MS+        G PF+   +   +P SV+W +KG
Sbjct: 63  FGYTLKMNQFGDLENSEFKSLYNGYRMSN----APRKGKPFVPAARVQDLPASVDWSKKG 118

Query: 134 AVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMD 186
            VTPVK QGQC       A  ++EG +      L+SLSEQ LVDC+  + N+GC GG MD
Sbjct: 119 WVTPVKNQGQCGSCWSFSATGSMEGQHFNATGTLMSLSEQNLVDCSAAEGNHGCNGGLMD 178

Query: 187 DAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN 246
           DAF+Y+I+N GI  +A Y Y  + +  C      D  A I+ Y DV  + E  L  AVA 
Sbjct: 179 DAFEYVIKNNGIDTEASYPYRAVDS-TC-KFNTADVGATISGYVDVTKDSESDLQVAVAT 236

Query: 247 -QPVSVAIDAS--ALQFYSGGVFNGYC--ETFLNHGVTAVGYGTSEEGIK-YWLIKNSWG 300
             PVSVAIDAS  + QFYS GV++      T L+HGV AVGYGT  +G K YWL+KNSWG
Sbjct: 237 IGPVSVAIDASHISFQFYSSGVYDPLICSSTNLDHGVLAVGYGT--DGSKDYWLVKNSWG 294

Query: 301 QDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
             WG  GY  + R+ +    +CGIA  AS+PV
Sbjct: 295 ASWGMSGYIEMVRNHNN---KCGIATSASYPV 323


>gi|66378053|gb|AAY45871.1| cathepsin L-like cysteine proteinase [Longidorus elongatus]
          Length = 358

 Score =  218 bits (554), Expect = 5e-54,   Method: Compositional matrix adjust.
 Identities = 130/318 (40%), Positives = 187/318 (58%), Gaps = 24/318 (7%)

Query: 37  WKAQYGRTYKESAENSKRFEIFKDNLVAVERFN-NAAIGNRSYTLRLNKFADLTPQEFIA 95
           +K ++ ++YK   E   RF++F  N   +E+ N     G  S+ L LNKFAD+T  EF  
Sbjct: 46  FKLKHAKSYKTKDEELLRFQVFASNHKVIEQHNIEYEAGQHSFALSLNKFADMTNAEFRQ 105

Query: 96  SQTGFKMSDH-----SSSLKANGTPF-LYKSSQVPPSVNWIEKGAVTPVKYQGQC----- 144
              GFK+        S  LK +G  F +  +  +P SV+W ++G VT VK QG C     
Sbjct: 106 RMNGFKLPAKRKLAKSQPLKEDGMIFEMPDNVTIPDSVDWRKEGYVTKVKDQGSCGSCWA 165

Query: 145 --AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDA 202
             A  ++EG +  +  +LVSLSEQ LVDC  N ++ GC GG+MD AF+Y+  NKGI  +A
Sbjct: 166 FSATGSLEGQHYKQTGKLVSLSEQNLVDCDVNGDDEGCNGGYMDGAFQYVETNKGIDTEA 225

Query: 203 VYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDASA--LQ 259
            Y Y+G   G C   K+ED  A  T + D+P  +E  L  A+A   PVSVAIDA++   Q
Sbjct: 226 SYPYKGRD-GRC-RFKSEDVGATDTGFVDIPEGNETLLEAAIATVGPVSVAIDAASFKFQ 283

Query: 260 FYSGGV-FNGYCET-FLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQ 317
           FYS GV ++  C   +L+HGV AVGY ++++G +Y+++KNSW +DWG+DGY  + R   +
Sbjct: 284 FYSHGVYYDRSCSPEYLDHGVLAVGYNSTKDGKQYYIVKNSWSEDWGDDGYILMSR---R 340

Query: 318 PQGQCGIAMFASFPVSKE 335
               CGIA  AS+P  ++
Sbjct: 341 KNNNCGIATMASYPFVQQ 358


>gi|242072388|ref|XP_002446130.1| hypothetical protein SORBIDRAFT_06g002130 [Sorghum bicolor]
 gi|241937313|gb|EES10458.1| hypothetical protein SORBIDRAFT_06g002130 [Sorghum bicolor]
          Length = 276

 Score =  218 bits (554), Expect = 5e-54,   Method: Compositional matrix adjust.
 Identities = 126/286 (44%), Positives = 170/286 (59%), Gaps = 41/286 (14%)

Query: 59  KDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLY 118
           +DN+  VE FN  A  N  + L +N+FADLT +EF A++ GFK    +S+ K   T F Y
Sbjct: 19  RDNVAFVESFN--ANKNNKFWLGVNQFADLTTEEFKANK-GFK---PTSAEKVPTTGFKY 72

Query: 119 KS---SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQL 168
           ++   S +P +V+W  KGAVTP+K QGQC       AVAA+EGI  +    L+SLS+Q+L
Sbjct: 73  ENLSVSALPTAVDWRTKGAVTPIKNQGQCGCCWAFSAVAAMEGIVKLSTGNLISLSKQEL 132

Query: 169 VDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITN 228
           VDC T+  + GC                    +    Y+ +  G C        AA I  
Sbjct: 133 VDCDTHSMDEGC--------------------EVQLPYKAVD-GKCKG--GSKSAATIKG 169

Query: 229 YEDVPPNDEESLLKAVANQPVSVAIDASALQF--YSGGVFNGYCETFLNHGVTAVGYGTS 286
           +EDVP N+E +L+KAVANQPVSVA+DAS   F  YSGGV  G C T L+HG+ A+GYG  
Sbjct: 170 HEDVPVNNEAALMKAVANQPVSVAVDASDRTFMLYSGGVMTGSCGTELDHGIAAIGYGME 229

Query: 287 EEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
            +G KYW++KNSWG  WGE G+ R+++DI   +G CG+AM  S+P 
Sbjct: 230 SDGTKYWILKNSWGTTWGEKGFLRMEKDITDKRGMCGLAMKPSYPT 275


>gi|334332714|ref|XP_001367224.2| PREDICTED: cathepsin L1-like [Monodelphis domestica]
          Length = 335

 Score =  217 bits (553), Expect = 5e-54,   Method: Compositional matrix adjust.
 Identities = 136/349 (38%), Positives = 193/349 (55%), Gaps = 32/349 (9%)

Query: 1   MAKYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKD 60
           M  Y  +  L +  + A     R  D      ++ QWKAQ+G++Y  + E+S R   ++ 
Sbjct: 1   MNFYLCLASLCLGLAAAIPPFDRALDS-----QWHQWKAQHGKSYAAN-EDSWRRATWEK 54

Query: 61  NLVAVERFNNA-AIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK 119
           NL  +ER N   + G  S+ LR+NKF D++ +EF     G+K +      K +    LY+
Sbjct: 55  NLKMIERHNQEYSAGKHSFQLRMNKFGDMSTEEFKQVMNGYKSNGSQKRTKGS----LYR 110

Query: 120 SS---QVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLV 169
            S   Q+P SV+W EKG VTPVK Q  C       A  A+EG    K  +LVSLS Q LV
Sbjct: 111 ESLLAQLPESVDWREKGYVTPVKEQRGCYSCWAFSAAGAIEGQWFRKTGKLVSLSVQNLV 170

Query: 170 DCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNY 229
           DC+  + NNGC GG M +AF+Y+  N GI  +  Y Y           + E   A +T +
Sbjct: 171 DCSIPEGNNGCDGGLMGNAFQYVQDNGGIDTEECYPYVAQDNEC--KYQPECSGANVTGF 228

Query: 230 EDVPPNDEESLLKAVAN-QPVSVAIDAS--ALQFYSGGV-FNGYCETF-LNHGVTAVGYG 284
             +P  DE +L+KAVAN  P+SVAIDA   + +FY  GV ++  C +  LNHGV  VGYG
Sbjct: 229 VKIPSTDERALMKAVANVGPISVAIDAGNPSFKFYQSGVYYDPQCSSSQLNHGVLVVGYG 288

Query: 285 T-SEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           +  + G KYW++KNSWG++WG++GY  + +D D     CGI   AS+P+
Sbjct: 289 SEGKNGRKYWIVKNSWGENWGDNGYVLMAKDEDN---HCGIITDASYPI 334


>gi|66735056|gb|AAY53767.1| cysteine protease [Saprolegnia parasitica]
          Length = 523

 Score =  217 bits (553), Expect = 5e-54,   Method: Compositional matrix adjust.
 Identities = 129/319 (40%), Positives = 179/319 (56%), Gaps = 20/319 (6%)

Query: 27  EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFA 86
           + S   KF  W  ++        E   RFE+F  N   +E  N  A  + S+T+  N+++
Sbjct: 21  DASYEAKFLSWMKKFAVKL-NPLEWVHRFEVFILNDQRIEAHNKDA--SSSFTMGHNEYS 77

Query: 87  DLTPQEFIASQTGFKMSD---HSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQ 143
            LT  EF   +TG ++S     S +  A   P +   + VP  ++W+E+G VTPVK QG 
Sbjct: 78  HLTFDEFKKLRTGLRVSPSYIQSRAKYALMAPAV-NMTDVPNEMDWVEQGGVTPVKNQGM 136

Query: 144 CA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNK 196
           C          A+EG   +   +LVS+SEQ+LVDC  N  + GC GG MD+AFK++  +K
Sbjct: 137 CGSCWAFSTTGAIEGAAFVSSKQLVSVSEQELVDCDHN-GDMGCNGGLMDNAFKWVKTHK 195

Query: 197 GITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS 256
           G+  +  Y Y     G C ++K      ++T + DVP NDE++L  AVA QPVSVAI+A 
Sbjct: 196 GLCKEEDYPYHA-KEGTC-ALKKCKPVTKVTAFHDVPANDEQALKAAVAKQPVSVAIEAD 253

Query: 257 --ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRD 314
               QFY  GVF+  C T L+HGV  VGYG  E G KYW +KNSWG DWG+ GY +L R+
Sbjct: 254 QPEFQFYKSGVFDKSCGTKLDHGVLVVGYG-EEGGKKYWKVKNSWGADWGDKGYIKLARE 312

Query: 315 IDQPQGQCGIAMFASFPVS 333
                GQCG+AM  S+P +
Sbjct: 313 FGPETGQCGVAMVPSYPTA 331


>gi|226499806|ref|NP_001151335.1| cysteine protease 1 [Zea mays]
 gi|195645896|gb|ACG42416.1| cysteine protease 1 precursor [Zea mays]
          Length = 258

 Score =  217 bits (553), Expect = 5e-54,   Method: Compositional matrix adjust.
 Identities = 128/267 (47%), Positives = 165/267 (61%), Gaps = 22/267 (8%)

Query: 80  LRLNKFADLTPQEFIASQTGFKMSDHSSSLKAN---GTPFLYKSSQVPPSVNWIEKGAVT 136
           + LN+FAD+T  EF+A  TG +     +   A    G   L  +     +V+W +KGAVT
Sbjct: 1   MELNEFADMTNDEFMAMYTGLRPVPAGAKKMAGFKYGNVTLSDADDDQQTVDWRQKGAVT 60

Query: 137 PVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAF 189
            +K Q QC       AVAAVEGI+ I    LVSLSEQQ++DC T D NNGC GG++D+AF
Sbjct: 61  GIKDQRQCGCCWAFAAVAAVEGIHQITTGNLVSLSEQQVLDCDT-DGNNGCNGGYIDNAF 119

Query: 190 KYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPV 249
           +YI+ N G+  +  Y Y   +  +C S++     A I+ Y+DVP  DE +L  AVANQPV
Sbjct: 120 QYIVGNGGLATEDAYPYTA-AQAMCQSVQP---VAAISGYQDVPSGDEAALAAAVANQPV 175

Query: 250 SVAIDASALQFYSGGVFNGY-CET--FLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGED 306
           SVAIDA   Q Y GGV     C T   LNH VTAVGYGT+E+G  YWL+KN WGQ+WGE 
Sbjct: 176 SVAIDAHNFQLYGGGVMTAASCSTPPNLNHAVTAVGYGTAEDGTPYWLLKNQWGQNWGEG 235

Query: 307 GYFRLQRDIDQPQGQCGIAMFASFPVS 333
           GY RL+R  +     CG+A  AS+PV+
Sbjct: 236 GYLRLERGAN----ACGVAQQASYPVA 258


>gi|33348834|gb|AAQ16117.1| cathepsin L-like cysteine proteinase A [Rhipicephalus
           haemaphysaloides haemaphysaloides]
          Length = 332

 Score =  217 bits (553), Expect = 6e-54,   Method: Compositional matrix adjust.
 Identities = 138/345 (40%), Positives = 192/345 (55%), Gaps = 36/345 (10%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
            L  ++ ++ +  S    RT        ++E +K  + ++Y+   E   RF+IF +N + 
Sbjct: 6   LLCAIVAVTVAANSHEILRT--------QWEAFKTTHKKSYESHMEELLRFKIFTENSLI 57

Query: 65  VERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFL----YK 119
           + + N   A G  SY L +N+F DL   EF     G++         + G+ F+      
Sbjct: 58  IAKHNAKYAKGLVSYKLGMNQFGDLLAHEFAKIFNGYR-----GQRTSRGSTFMPPANVN 112

Query: 120 SSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCA 172
            S +P +V+W +KGAVTPVK QGQC       A  ++EG + +K   LVSLSEQ LVDC+
Sbjct: 113 DSSLPSTVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKDGELVSLSEQNLVDCS 172

Query: 173 TNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDV 232
            +  NNGC GG MD+AFKYI  N GI  +  Y YE M        K ED  A  T + D+
Sbjct: 173 QSFGNNGCEGGLMDNAFKYIKANDGIDAEESYPYEAMDDKC--RFKKEDVGATDTGFVDI 230

Query: 233 PPNDEESLLKAVAN-QPVSVAIDA--SALQFYSGGVFN-GYCET-FLNHGVTAVGYGTSE 287
               E+ L KAVA   P+SVAIDA  S+ Q YS GV++   C +  L+HGV AVGYG  +
Sbjct: 231 EGGSEDDLKKAVATVGPISVAIDAGHSSFQLYSEGVYDEPECSSEELDHGVLAVGYGV-K 289

Query: 288 EGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           +G KYWL+KNSWG  WG++GY  + RD +    QCGIA  AS+P+
Sbjct: 290 DGKKYWLVKNSWGGSWGDNGYILMSRDKNN---QCGIASAASYPL 331


>gi|156371477|ref|XP_001628790.1| predicted protein [Nematostella vectensis]
 gi|156215775|gb|EDO36727.1| predicted protein [Nematostella vectensis]
          Length = 330

 Score =  217 bits (553), Expect = 6e-54,   Method: Compositional matrix adjust.
 Identities = 135/321 (42%), Positives = 181/321 (56%), Gaps = 25/321 (7%)

Query: 25  FDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNK 84
           FDE    ++++ WK  + + Y    E   R  I++DNL  +++ N       S+TL +N 
Sbjct: 21  FDEDE--QQWQAWKLFHTKKYTTVTEEGARKAIWRDNLKKIQKHNAEG---HSFTLAMNH 75

Query: 85  FADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSS-QVPPSVNWIEKGAVTPVKYQGQ 143
             DLT  EF    TG + S +S+  K  G+ FL  S  QVP +V+W ++G VTPVK QGQ
Sbjct: 76  LGDLTQDEFRYFYTGMR-SHYSNYTKKQGSAFLAPSHVQVPDTVDWRKEGYVTPVKNQGQ 134

Query: 144 CA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNK 196
           C          ++EG N  K  +LVSLSEQ LVDC+T   NNGC GG MD AFKYI +N 
Sbjct: 135 CGSCWAFSTTGSLEGQNFKKTGKLVSLSEQNLVDCSTAYGNNGCQGGLMDYAFKYIKENG 194

Query: 197 GITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDA 255
           GI  +  Y YE  +       +  +  A  T + DV   DEE+L  A     P+SVAIDA
Sbjct: 195 GIDTEESYPYEARNDRC--RFQKSNIGAVDTGFVDVTHGDEEALKTAAGTVGPISVAIDA 252

Query: 256 SAL--QFYSGGVFN--GYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRL 311
             +  QFY  GV+N  G   T L+HGV  VGYGT  +G  YWL+KNSWG+ WG +GY  +
Sbjct: 253 GHMSFQFYHSGVYNNAGCSSTSLDHGVLVVGYGTY-QGSDYWLVKNSWGERWGMEGYIMM 311

Query: 312 QRDIDQPQGQCGIAMFASFPV 332
            R+ +    QCG+A  AS+P+
Sbjct: 312 SRNKNN---QCGVATQASYPL 329


>gi|254674508|dbj|BAH86062.1| cysteine protease [Haemaphysalis longicornis]
          Length = 333

 Score =  217 bits (553), Expect = 6e-54,   Method: Compositional matrix adjust.
 Identities = 137/341 (40%), Positives = 184/341 (53%), Gaps = 27/341 (7%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
           FL   +  + + +SQ   RT        ++E +K+Q+ + Y    E   RF+IF +N + 
Sbjct: 6   FLCGCVAAAIAASSQEILRT--------EWEAFKSQHNKAYSSHVEELLRFKIFTENTLL 57

Query: 65  VERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQV 123
           V + N   A G  SY L +NKF DL P EF     G++   +         P     S +
Sbjct: 58  VAKHNAKYAKGLVSYKLAMNKFGDLLPHEFAKMVNGYRGKQNKEQRPTFIPPANLNDSSL 117

Query: 124 PPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDN 176
           P +V+W +KGAVTPVK QGQC          ++EG +  K  +LVSLSEQ LVDC+ +  
Sbjct: 118 PTTVDWRKKGAVTPVKNQGQCGSCWAFSTTGSLEGQHFRKTGKLVSLSEQNLVDCSDDFG 177

Query: 177 NNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPND 236
           N GC GG MD+ F+YI  N GI  +  + Y     G C   K  D  A    + D+    
Sbjct: 178 NQGCNGGLMDNGFQYIKANGGIDTEESHPYTAQD-GDC-KFKKADVGATDAGFVDIQQGS 235

Query: 237 EESLLKAVAN-QPVSVAIDAS--ALQFYSGGVFN--GYCETFLNHGVTAVGYGTSEEGIK 291
           E+ L KAVA   PVSVAIDAS  + Q YS GV++      + L+HGV  VGYG  + G K
Sbjct: 236 EDDLKKAVATVGPVSVAIDASHGSFQLYSQGVYDEPDCSSSQLDHGVLTVGYGV-KNGKK 294

Query: 292 YWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           YWL+KNSWG DWG++GY  + RD D    QCGIA  AS+P+
Sbjct: 295 YWLVKNSWGGDWGDNGYILMSRDKDN---QCGIASSASYPL 332


>gi|402770515|gb|AFQ98392.1| cathepsin L [Rhipicephalus microplus]
          Length = 332

 Score =  217 bits (553), Expect = 7e-54,   Method: Compositional matrix adjust.
 Identities = 142/344 (41%), Positives = 190/344 (55%), Gaps = 36/344 (10%)

Query: 6   LIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
           L  +  ++ + +SQ   RT        ++E +K  + +TY+   E   RF+IF +N + +
Sbjct: 7   LCAIAAVTVAASSQEILRT--------QWEAFKTTHKKTYQSHMEELLRFKIFTENSLII 58

Query: 66  ERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFL----YKS 120
            + N   A G  SY L +N+F DL   EF     G     H  + K  G+ FL       
Sbjct: 59  AKHNAKYAKGLVSYKLGMNQFGDLLAHEFARIFNG-----HHGTRKTGGSSFLPPANVND 113

Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
           S +P  V+W +KGAVTPVK QGQC       A  ++EG + +K   LVSLSEQ LVDC+ 
Sbjct: 114 SSLPKVVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQ 173

Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
           +  NNGC GG M+DAFKYI  N GI  +  Y YE +  G C   K ED  A  T Y ++ 
Sbjct: 174 SFGNNGCEGGLMEDAFKYIKANDGIDTEKSYPYEAVD-GEC-RFKKEDVGATDTGYVEIK 231

Query: 234 PNDEESLLKAVAN-QPVSVAIDA--SALQFYSGGVFN-GYCETF-LNHGVTAVGYGTSEE 288
              E  L KAVA   P+SVAIDA  S+ Q YS GV++   C +  L+HGV  VGYG  + 
Sbjct: 232 AGSEVDLKKAVATVGPISVAIDASHSSFQLYSEGVYDEPECSSEDLDHGVLVVGYGV-KG 290

Query: 289 GIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           G KYWL+KNSW + WG+ GY  + RD +    QCGIA  AS+P+
Sbjct: 291 GKKYWLVKNSWAESWGDQGYILMSRDNNN---QCGIASQASYPL 331


>gi|29165304|gb|AAO65603.1| cathepsin L precursor [Hydra vulgaris]
          Length = 324

 Score =  217 bits (552), Expect = 7e-54,   Method: Compositional matrix adjust.
 Identities = 133/314 (42%), Positives = 181/314 (57%), Gaps = 28/314 (8%)

Query: 32  EKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQ 91
           E + QWK  + + Y    E + R+ I+KDN   +   N   +    + L++N+F D+T  
Sbjct: 25  ESWIQWKMYHNKVYSHDGEETVRYTIWKDNERRIREHN---LKGGDFILKMNQFGDMTNS 81

Query: 92  EFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPP-SVNWIEKGAVTPVKYQGQCA----- 145
           EF A       + + S    NG+ FL  ++ V P +V+W  +G VTPVK QGQC      
Sbjct: 82  EFKA------FNGYLSHKHVNGSTFLTPNNFVAPDTVDWRNEGYVTPVKDQGQCGSCWAF 135

Query: 146 --VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAV 203
               ++EG +  K  +LVSLSEQ LVDC+T   NNGC GG MD+AF YI +NKGI ++A 
Sbjct: 136 STTGSLEGQHFKKTGKLVSLSEQNLVDCSTAYGNNGCDGGLMDNAFTYIKENKGIDSEAS 195

Query: 204 YSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDAS--ALQF 260
           Y Y     G C   K    AA  T + D+P  +E  L +AVA+  P+SVAIDAS  + QF
Sbjct: 196 YPYTA-EDGKC-VFKKSSVAATDTGFVDIPEGNENKLKEAVASVGPISVAIDASHESFQF 253

Query: 261 YSGGVFN--GYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQP 318
           YS GV+N      T L+HGV  VGYGT E G  YWL+KNSW   WG+ GY +++R+    
Sbjct: 254 YSSGVYNEPSCSSTELDHGVLVVGYGT-ESGKDYWLVKNSWNTSWGDKGYIKMRRN---A 309

Query: 319 QGQCGIAMFASFPV 332
           + QCGIA  AS+P+
Sbjct: 310 KNQCGIATKASYPL 323


>gi|1483570|emb|CAA68066.1| cathepsin l [Litopenaeus vannamei]
          Length = 328

 Score =  217 bits (552), Expect = 7e-54,   Method: Compositional matrix adjust.
 Identities = 131/320 (40%), Positives = 177/320 (55%), Gaps = 27/320 (8%)

Query: 29  SIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVE----RFNNAAIGNRSYTLRLNK 84
           S+ +++  +KA++GR Y    E   R  +F+ N   ++    RF N  +   ++TL++N+
Sbjct: 19  SLRQQWRDFKAEHGRRYASVQEERYRLSVFEQNQQFIDDHNARFENGEV---TFTLQMNQ 75

Query: 85  FADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC 144
           F D+T +EF A+  GF    +  S +            +P  V+W  KGAVTPVK Q QC
Sbjct: 76  FGDMTSEEFTATMNGFL---NVPSRRPTAILRADPDETLPKEVDWRTKGAVTPVKDQKQC 132

Query: 145 A-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKG 197
                     ++EG + +K  +LVSLSEQ LVDC+    N GC GG MD AF+YI  NKG
Sbjct: 133 GSCWAFSTTGSLEGQHFLKDGKLVSLSEQNLVDCSDKFGNMGCMGGLMDQAFRYIKANKG 192

Query: 198 ITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDAS 256
           I  +  Y YE    G C    A +  A  T Y DV    E +L KAVA   P+SVAIDAS
Sbjct: 193 IDTEDSYPYEAQD-GKC-RFDASNVGATDTGYVDVEHGSESALKKAVATIGPISVAIDAS 250

Query: 257 --ALQFYSGGVF--NGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQ 312
             + QFY  GV+   G   T L+HGV AVGYG +E+G  YWL+KNSW   WG  GY ++ 
Sbjct: 251 QPSFQFYHDGVYYEEGCSSTMLDHGVLAVGYGETEKGEAYWLVKNSWNTSWGNKGYIQMS 310

Query: 313 RDIDQPQGQCGIAMFASFPV 332
           RD    +  CGIA  AS+P+
Sbjct: 311 RD---KKNNCGIASQASYPL 327


>gi|198427748|ref|XP_002130282.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
          Length = 340

 Score =  217 bits (552), Expect = 7e-54,   Method: Compositional matrix adjust.
 Identities = 132/350 (37%), Positives = 187/350 (53%), Gaps = 29/350 (8%)

Query: 1   MAKYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKD 60
           M    L+ V++ +G C++    +          F+ WK  + + Y+   E  ++   + +
Sbjct: 1   MKVTVLLAVVLFAGCCSAMQLNQQH-----VSLFQTWKNLWKKVYQTVEEEEQKMATWFN 55

Query: 61  NLVAVERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK 119
           N   +   N   ++  +SY L +N++ DLT +EF +   G++           G+ +L  
Sbjct: 56  NWNKISEHNMQYSLKQKSYRLEMNEYGDLTSEEFSSMMNGYRNDIRLKRKSTGGSTYLNL 115

Query: 120 SS-----QVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQ 167
            S     Q+P  V+W + G VTPVK QGQC       A  ++EG +  K  +LVSLSEQ 
Sbjct: 116 LSFGSQIQLPTLVDWRKHGLVTPVKNQGQCGSCWSFSATGSLEGQHKKKTGKLVSLSEQN 175

Query: 168 LVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQIT 227
           L+DC+T + N+GC GG MD AFKYI    GI  +A Y YE      C      D  A  T
Sbjct: 176 LIDCSTPEGNDGCNGGLMDQAFKYIKIQGGIDTEAYYPYEAKDD-TC-RFNITDSGATDT 233

Query: 228 NYEDVPPNDEESLLKAVAN-QPVSVAIDAS--ALQFYSGGVFN--GYCETFLNHGVTAVG 282
            + D+   DEE L +A A   P+SVAIDAS  + QFYS GV++      T L+HGV  VG
Sbjct: 234 GFVDIKSGDEEMLKEAAATVGPISVAIDASHTSFQFYSNGVYSETACSSTMLDHGVLVVG 293

Query: 283 YGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           YGT E G  YWL+KNSWG+ WGE GY ++ R+ D    QCGIA  AS+P+
Sbjct: 294 YGT-ENGKDYWLVKNSWGEGWGEAGYIKMSRNADN---QCGIATQASYPL 339


>gi|442754503|gb|JAA69411.1| Putative cathepsin l-like cysteine proteinase b [Ixodes ricinus]
          Length = 335

 Score =  217 bits (552), Expect = 7e-54,   Method: Compositional matrix adjust.
 Identities = 130/318 (40%), Positives = 184/318 (57%), Gaps = 21/318 (6%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKFADL 88
           +  ++  +KA++G++Y    E   R +I+ +N   + + N   A G   Y++ +N+F D+
Sbjct: 23  LGAEWSAFKAKHGKSYVSETEEVFRLKIYMENRHKIAKHNEKYARGEVPYSMAMNEFGDM 82

Query: 89  TPQEFIASQTGFKMS--DHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC-- 144
              EF++++ GFK +  D          P   +   +P +V+W  KGAVTPVK QGQC  
Sbjct: 83  LHHEFVSTRNGFKRNYKDQPREGSTYLEPENIEDFSLPKTVDWRTKGAVTPVKNQGQCGS 142

Query: 145 -----AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGIT 199
                A  ++EG +  K   +VSLSEQ LVDC+T+  NNGC GG MD+AFKYI  NKGI 
Sbjct: 143 CWAFSATGSLEGQHFRKSGSMVSLSEQNLVDCSTDFGNNGCEGGLMDNAFKYIRANKGID 202

Query: 200 NDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDAS-- 256
            +  Y Y G + G C   K     A  + + D+    E  L KAVA   P+SVAIDAS  
Sbjct: 203 TEKSYPYNG-TDGTC-HFKKSTVGATDSGFVDIKEGSETQLKKAVATVGPISVAIDASHE 260

Query: 257 ALQFYSGGVFN-GYCET-FLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRD 314
           + QFYS GV++   C++  L+HGV  VGYGT   G  YWL+KNSWG  WG++GY R+ R+
Sbjct: 261 SFQFYSDGVYDEPECDSESLDHGVLVVGYGTL-NGTDYWLVKNSWGTTWGDEGYIRMSRN 319

Query: 315 IDQPQGQCGIAMFASFPV 332
               + QCGIA  AS+P+
Sbjct: 320 ---KKNQCGIASSASYPL 334


>gi|330803820|ref|XP_003289900.1| hypothetical protein DICPUDRAFT_80649 [Dictyostelium purpureum]
 gi|325080011|gb|EGC33585.1| hypothetical protein DICPUDRAFT_80649 [Dictyostelium purpureum]
          Length = 328

 Score =  217 bits (552), Expect = 7e-54,   Method: Compositional matrix adjust.
 Identities = 141/340 (41%), Positives = 186/340 (54%), Gaps = 33/340 (9%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
           FLIV  I +    SQ  Y+T         F+ W  ++ ++Y    E   R+ IF+DN+  
Sbjct: 11  FLIVNCISAARVFSQKQYQT--------AFQNWMVKHQKSYTND-EFGSRYTIFQDNMDF 61

Query: 65  VERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVP 124
           V ++N          L LN  ADLT QE+     G K +    +L    T      S+ P
Sbjct: 62  VTKWNQKG---SDTILGLNSMADLTNQEYQRIYLGTKTTVKKPNLIIGVTDV----SKAP 114

Query: 125 PSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNN 177
            SV+W   GAVT VK QGQC          +VEGI+ I   +LVSLSEQQ++DC+ ++ N
Sbjct: 115 ASVDWRANGAVTAVKNQGQCGGCYSFSTTGSVEGIHEITSKQLVSLSEQQILDCSGSEGN 174

Query: 178 NGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDE 237
           NGC GG M ++F+YII   G+  +A Y YEG+  G C   KA +  A IT Y++V    E
Sbjct: 175 NGCDGGLMTNSFEYIIAVGGLDTEASYPYEGV-VGKCKFNKA-NIGATITGYKNVKSGSE 232

Query: 238 ESLLKAVANQPVSVAIDAS--ALQFYSGGVF--NGYCETFLNHGVTAVGYGTSEEGIKYW 293
             L  AVA QPVSVAIDAS  + Q YS GV+       T L+HGV AVGYG S+ G  YW
Sbjct: 233 SDLQTAVAAQPVSVAIDASQNSFQLYSSGVYYEPACSSTQLDHGVLAVGYG-SQSGQDYW 291

Query: 294 LIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
           ++KNSWG DWGE G+  + R+       CGIA  AS+P +
Sbjct: 292 IVKNSWGADWGEKGFILMARN---KHNNCGIATMASYPTA 328


>gi|348687948|gb|EGZ27762.1| papain-like cysteine protease C1 [Phytophthora sojae]
          Length = 533

 Score =  217 bits (552), Expect = 7e-54,   Method: Compositional matrix adjust.
 Identities = 127/312 (40%), Positives = 180/312 (57%), Gaps = 21/312 (6%)

Query: 32  EKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN--NAAIGNRSYTLRLNKFADLT 89
            +F  W + +G T+ ++ E ++R E +  N + +   N  NA  G +   L  N F+ ++
Sbjct: 26  HEFSAWMSAHGVTFSDALEFARRLENYIANDMYILEHNAENAWTGVK---LGHNAFSHMS 82

Query: 90  PQEFIASQTGFKMSD-HSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA--- 145
             EF    TG  + + +     A+    L+   +VP +V+W++KG VTPVK QG C    
Sbjct: 83  FDEFKFKMTGLVLPEGYLEQRLASRVDGLWSDVEVPSAVDWVDKGGVTPVKNQGMCGSCW 142

Query: 146 ----VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITND 201
                 AVEG   +   +L+SLSEQ+LVDC  N  + GC GG MD AF++I  + GI ++
Sbjct: 143 AFSTTGAVEGATFVSSGKLLSLSEQELVDCDHN-GDMGCNGGLMDHAFQWIEDHGGICSE 201

Query: 202 AVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQ 259
             Y Y+  +  +C   +  D   ++T ++DV P DE +L  AVA QPVSVAI+A   A Q
Sbjct: 202 DDYEYKAKAQ-VC---RKCDSVVKVTGFQDVNPQDEHALKVAVAQQPVSVAIEADQKAFQ 257

Query: 260 FYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQ 319
           FY  GVFN  C T L+HGV AVGYG ++ G K+W +KNSWG  WGE GY RL R+ + P 
Sbjct: 258 FYKSGVFNLTCGTRLDHGVLAVGYG-NDNGQKFWKVKNSWGASWGEQGYIRLAREENGPA 316

Query: 320 GQCGIAMFASFP 331
           GQCGIA   S+P
Sbjct: 317 GQCGIASVPSYP 328


>gi|402770507|gb|AFQ98388.1| cathepsin L [Rhipicephalus microplus]
          Length = 332

 Score =  217 bits (552), Expect = 8e-54,   Method: Compositional matrix adjust.
 Identities = 141/344 (40%), Positives = 193/344 (56%), Gaps = 36/344 (10%)

Query: 6   LIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
           L  ++ ++ + +SQ   RT        ++E +K  + +TY+   E   RF+IF +N + +
Sbjct: 7   LCAIVAVTVAASSQEILRT--------QWEAFKTTHKKTYQSHMEELLRFKIFTENSLII 58

Query: 66  ERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFL----YKS 120
            + N   A G  SY L +N+F DL   EF     G+       S K+ G+ FL       
Sbjct: 59  AKHNAKYAKGLVSYKLGMNQFGDLLAHEFARIFNGYH-----GSRKSGGSTFLPPANVND 113

Query: 121 SQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCAT 173
           S +P +V+W +KGAVTPVK QGQC          ++EG + +K   LVSLSEQ LVDC+ 
Sbjct: 114 SSLPKAVDWRKKGAVTPVKDQGQCGSCWAFSTTGSLEGQHFLKNGELVSLSEQNLVDCSQ 173

Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
           +  NNGC GG M+DAFKYI  N GI  +  Y YE +  G C   K ED  A  T Y ++ 
Sbjct: 174 SFGNNGCEGGLMEDAFKYIKANDGIDTEKSYPYEAVD-GEC-RFKKEDVGATDTGYVEIK 231

Query: 234 PNDEESLLKAVAN-QPVSVAIDA--SALQFYSGGVFN-GYCETF-LNHGVTAVGYGTSEE 288
              E+ L KAVA   P+SVAIDA  S+ Q YS GV++   C +  L+HGV  VGYG  + 
Sbjct: 232 AGCEDDLKKAVATVGPISVAIDASHSSFQLYSEGVYDEPECSSEDLDHGVLVVGYGV-KG 290

Query: 289 GIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           G KYWL+KNSW + WG+ GY  + RD +    QCGIA  AS+P+
Sbjct: 291 GKKYWLVKNSWAESWGDQGYILMSRDNNN---QCGIASQASYPL 331


>gi|21489677|gb|AAM55195.1|AF412313_1 cathepsin L cysteine protease [Haemonchus contortus]
 gi|21483192|gb|AAL14224.1| cathepsin L [Haemonchus contortus]
          Length = 354

 Score =  216 bits (551), Expect = 9e-54,   Method: Compositional matrix adjust.
 Identities = 133/321 (41%), Positives = 183/321 (57%), Gaps = 25/321 (7%)

Query: 30  IAEKFEQW---KAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKF 85
           I E F +W   K  +G++Y+   EN    E F  N++ +E  N    +G +++ + LN+ 
Sbjct: 40  IDEAFNKWDDYKETFGKSYEPDEEND-YMEAFVKNVIHIEEHNKEHRLGRKTFEMGLNEI 98

Query: 86  ADLTPQEFIASQTGFKMSDH-SSSLKANGTPFLYK-SSQVPPSVNWIEKGAVTPVKYQGQ 143
           ADL   ++     G++M      SL++NGT FL   + Q+P SV+W E+G VTPVK QG 
Sbjct: 99  ADLPFSQY-RKLNGYRMRRQFGDSLQSNGTKFLVPFNVQIPESVDWREEGLVTPVKNQGM 157

Query: 144 C-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNK 196
           C       +  A+EG +A    +LVSLSEQ LVDC+T   N+GC GG MD AF+YI +N 
Sbjct: 158 CGSCWAFSSTGALEGQHARATGKLVSLSEQNLVDCSTKYGNHGCNGGLMDLAFEYIKENH 217

Query: 197 GITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ-PVSVAIDA 255
           G+  +  Y Y G  T      K     A    + D+P  DEE+L KAVA Q P+S+AIDA
Sbjct: 218 GVDTEDSYPYVGRETKC--HFKRNAVGADDKGFVDLPEGDEEALKKAVATQGPISIAIDA 275

Query: 256 S--ALQFYSGGV-FNGYCET-FLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRL 311
              + Q Y  GV F+  C +  L+HGV  VGYGT  E   YWL+KNSWG  WGE GY R+
Sbjct: 276 GHRSFQLYKKGVYFDEECSSEELDHGVLLVGYGTDPEAGDYWLVKNSWGPTWGEKGYIRI 335

Query: 312 QRDIDQPQGQCGIAMFASFPV 332
            R+ +     CG+A  AS+P+
Sbjct: 336 ARNRNN---HCGVATKASYPL 353


>gi|7381610|gb|AAF61565.1|AF227957_1 cathepsin L-like proteinase precursor [Rhipicephalus microplus]
          Length = 332

 Score =  216 bits (551), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 141/344 (40%), Positives = 191/344 (55%), Gaps = 36/344 (10%)

Query: 6   LIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
           L  ++ ++ + +SQ   RT        ++E +K  + ++Y+   E   RF+IF +N + +
Sbjct: 7   LCAIVAVTVAASSQEILRT--------QWEAFKTTHKKSYQSHMEELLRFKIFTENSLII 58

Query: 66  ERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFL----YKS 120
            + N   A G  SY L +N+F DL   EF     G     H  + K  G+ FL       
Sbjct: 59  AKHNAKYAKGLVSYKLGMNQFGDLLAHEFARIFNG-----HHGTRKTGGSTFLPPANVND 113

Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
           S +P  V+W +KGAVTPVK QGQC       A  ++EG + +K   LVSLSEQ LVDC+ 
Sbjct: 114 SSLPKVVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQ 173

Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
           +  NNGC GG M+DAFKYI  N GI  +  Y YE +  G C   K ED  A  T Y ++ 
Sbjct: 174 SFGNNGCEGGLMEDAFKYIKANDGIDTEKSYPYEAVD-GEC-RFKKEDVGATDTGYVEIK 231

Query: 234 PNDEESLLKAVAN-QPVSVAIDA--SALQFYSGGVFN-GYCETF-LNHGVTAVGYGTSEE 288
              E  L KAVA   P+SVAIDA  S+ Q YS GV++   C +  L+HGV  VGYG  + 
Sbjct: 232 AGSEVDLKKAVATVGPISVAIDASHSSFQLYSEGVYDEPECSSEDLDHGVLVVGYGV-KG 290

Query: 289 GIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           G KYWL+KNSW + WG+ GY  + RD +    QCGIA  AS+P+
Sbjct: 291 GKKYWLVKNSWAESWGDQGYILMSRDNNN---QCGIASQASYPL 331


>gi|224116880|ref|XP_002317417.1| predicted protein [Populus trichocarpa]
 gi|118488173|gb|ABK95906.1| unknown [Populus trichocarpa]
 gi|222860482|gb|EEE98029.1| predicted protein [Populus trichocarpa]
          Length = 498

 Score =  216 bits (551), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 127/330 (38%), Positives = 177/330 (53%), Gaps = 18/330 (5%)

Query: 15  SCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIG 74
           S  S   +    E  I E F+ WK ++ + YK + E  +R   FK NL  +   N     
Sbjct: 31  SAVSNDLHEGLTEEGITEVFKLWKEKHQKVYKHAEEAERRIGNFKRNLKYIIEKNGKRKS 90

Query: 75  NRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGA 134
              + + LNKFADL+ +EF   +          +++        ++   P S++W  KG 
Sbjct: 91  GLEHKVGLNKFADLSNEEF--REMYLSKVKKPITIEEKRKHRHLQTCDAPSSLDWRNKGV 148

Query: 135 VTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDD 187
           VT VK QG C          A+E INAI    L+SLSEQ+LVDC T  NN GC GG MD 
Sbjct: 149 VTAVKDQGDCGSCWSFSTTGAIEAINAIVTGDLISLSEQELVDCDTT-NNYGCEGGDMDS 207

Query: 188 AFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ 247
           AF+++I N GI  +A Y Y G+  G C++ K E     I  Y DV P+D  +LL A   Q
Sbjct: 208 AFQWVIGNGGIDTEADYPYTGVD-GTCNTAKEEKKVVSIEGYVDVDPSDS-ALLCATVQQ 265

Query: 248 PVSVAIDASAL--QFYSGGVFNGYCE---TFLNHGVTAVGYGTSEEGIKYWLIKNSWGQD 302
           P+SV +D SAL  Q Y+GG+++G C      ++H +  VGYG SE    YW++KNSWG +
Sbjct: 266 PISVGMDGSALDFQLYTGGIYDGDCSGDPNDIDHAILIVGYG-SENDEDYWIVKNSWGTE 324

Query: 303 WGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           WG +GYF ++R+  +P G C I   AS+P 
Sbjct: 325 WGMEGYFYIRRNTSKPYGVCAINADASYPT 354


>gi|158300877|ref|XP_001689282.1| AGAP011828-PA [Anopheles gambiae str. PEST]
 gi|157013372|gb|EDO63348.1| AGAP011828-PA [Anopheles gambiae str. PEST]
          Length = 344

 Score =  216 bits (551), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 140/353 (39%), Positives = 195/353 (55%), Gaps = 32/353 (9%)

Query: 2   AKYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDN 61
           A  FLI++L   G  A+      F+   + E++  +K Q+ + Y    E   R +I+  N
Sbjct: 1   AMKFLILIL---GFVAAANAISIFE--LVKEEWTAFKLQHRKKYDSETEERIRMKIYVQN 55

Query: 62  LVAVERFNNA-AIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSS------SLKANGT 114
              + + N    +G   + LR+NK+ADL  +EF+ +  GF  S           LK    
Sbjct: 56  KHKIAKHNQRYDLGQEKFRLRVNKYADLLHEEFVHTLNGFNRSVSGKGQLLRGELKPIEE 115

Query: 115 PFLY---KSSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLS 164
           P  +    +  VP +++W  KGAVT VK QG C       A  A+EG +  K  +LVSLS
Sbjct: 116 PVTWIEPANVDVPTAMDWRTKGAVTQVKDQGHCGSCWSFSATGALEGQHFRKTGKLVSLS 175

Query: 165 EQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAA 224
           EQ LVDC+    NNGC GG MD AF+YI  NKGI  +  Y YE +      + KA    A
Sbjct: 176 EQNLVDCSQKYGNNGCNGGMMDFAFQYIKDNKGIDTEKSYPYEAIDDECHYNPKAV--GA 233

Query: 225 QITNYEDVPPNDEESLLKAVAN-QPVSVAIDAS--ALQFYSGGV-FNGYCETF-LNHGVT 279
               + D+P  +E++L+KA+A   PVSVAIDAS  + QFYS GV +   C++  L+HGV 
Sbjct: 234 TDKGFVDIPQGNEKALMKALATVGPVSVAIDASHESFQFYSEGVYYEPQCDSEQLDHGVL 293

Query: 280 AVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           AVGYGT+E+G  YWL+KNSWG  WG+ GY ++ R+ D     CGIA  AS+P+
Sbjct: 294 AVGYGTTEDGEDYWLVKNSWGTTWGDQGYVKMARNRDN---HCGIATTASYPL 343


>gi|195381187|ref|XP_002049336.1| GJ20806 [Drosophila virilis]
 gi|194144133|gb|EDW60529.1| GJ20806 [Drosophila virilis]
          Length = 339

 Score =  216 bits (551), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 131/322 (40%), Positives = 185/322 (57%), Gaps = 25/322 (7%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKFADL 88
           I E+++ +K ++ + Y +  E   R +IF +N   + + N   A G  S+ + +NK+AD+
Sbjct: 23  IKEEWQTFKLEHRKNYVDETEERFRLKIFNENKHKIAKHNQRYASGEVSFKMAVNKYADM 82

Query: 89  TPQEFIASQTGFKMSDHSSSLKANGTPFL------YKSSQVPPSVNWIEKGAVTPVKYQG 142
              EF  +  GF  + H   L+A+   F+       +  ++P SV+W  KGAVT VK QG
Sbjct: 83  LHHEFHTTMNGFNYTLHKQ-LRASDPSFVGVTFISPEHVKIPKSVDWRSKGAVTEVKDQG 141

Query: 143 QC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQN 195
            C       +  A+EG +  K   L+SLSEQ LVDC+T   NNGC GG MD+AF+YI  N
Sbjct: 142 HCGSCWAFSSTGALEGQHFRKAGTLISLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDN 201

Query: 196 KGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAID 254
            GI  +  Y YEG+    C   KA   A    +  D+P  DE+ + +AVA   PVSVAID
Sbjct: 202 GGIDTEKSYPYEGIDDS-CHFNKATIGATDRGSV-DIPQGDEKKMAEAVATIGPVSVAID 259

Query: 255 AS--ALQFYSGGVFN-GYCE-TFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFR 310
           AS  + QFYS G++N   C+   L+HGV  VGYGT E G  YWL+KNSWG  WG+ G+ +
Sbjct: 260 ASHESFQFYSEGIYNEPQCDPQNLDHGVLVVGYGTDESGQDYWLVKNSWGTTWGDKGFIK 319

Query: 311 LQRDIDQPQGQCGIAMFASFPV 332
           + R+ D    QCGIA  +S+P+
Sbjct: 320 MARNADN---QCGIASASSYPL 338


>gi|221090861|ref|XP_002167224.1| PREDICTED: cathepsin L-like [Hydra magnipapillata]
          Length = 324

 Score =  216 bits (550), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 133/314 (42%), Positives = 181/314 (57%), Gaps = 28/314 (8%)

Query: 32  EKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQ 91
           E + QWK  + + Y    E + R+ I+KDN   +   N   +    + L++N+F D+T  
Sbjct: 25  ESWIQWKMYHNKVYSHDGEETVRYTIWKDNERRIREHN---LKGGDFLLKMNQFGDMTNS 81

Query: 92  EFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPP-SVNWIEKGAVTPVKYQGQCA----- 145
           EF A       + + S    NG+ FL  ++ V P +V+W  +G VTPVK QGQC      
Sbjct: 82  EFKA------FNGYLSHKHVNGSTFLTPNNFVAPDTVDWRNEGYVTPVKDQGQCGSCWAF 135

Query: 146 --VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAV 203
               ++EG +  K  +LVSLSEQ LVDC+T   NNGC GG MD+AF YI +NKGI ++A 
Sbjct: 136 STTGSLEGQHFKKTGKLVSLSEQNLVDCSTAYGNNGCNGGLMDNAFTYIKENKGIDSEAS 195

Query: 204 YSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDAS--ALQF 260
           Y Y     G C   K    AA  T + D+P  +E  L +AVA+  P+SVAIDAS  + QF
Sbjct: 196 YPYTA-EDGKC-VFKKPSVAATDTGFVDLPEGNENKLKEAVASVGPISVAIDASHESFQF 253

Query: 261 YSGGVFN--GYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQP 318
           YS GV+N      T L+HGV  VGYGT E G  YWL+KNSW   WG+ GY +++R+    
Sbjct: 254 YSSGVYNEPSCSSTELDHGVLVVGYGT-ESGKDYWLVKNSWNTSWGDKGYIKMRRN---A 309

Query: 319 QGQCGIAMFASFPV 332
           + QCGIA  AS+P+
Sbjct: 310 KNQCGIATKASYPL 323


>gi|146152090|gb|ABQ08058.1| cathepsin L [Misgurnus mizolepis]
          Length = 337

 Score =  216 bits (550), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 134/349 (38%), Positives = 191/349 (54%), Gaps = 30/349 (8%)

Query: 1   MAKYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKD 60
           M  Y  +  L +SG  A+ +  +  D+      +EQWK  +G+ Y E  E  +R  I++ 
Sbjct: 1   MWTYLALFTLCLSGVFAAPSLDKQLDD-----HWEQWKTWHGKNYHEKEEGWRRM-IWEK 54

Query: 61  NLVAVERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK 119
           NL  ++  N   ++G  +Y L +N F D+  +EF     G+K   H +  K  G+ F+  
Sbjct: 55  NLRKIQFHNLEHSMGIHTYRLGMNHFGDMNHEEFRQVMNGYK---HKTERKFKGSLFMEP 111

Query: 120 S-SQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDC 171
           +  +VP  ++W EKG VTPVK QG+C          A+EG    K  +LVSLSEQ LVDC
Sbjct: 112 NFLEVPSKLDWREKGYVTPVKDQGECGSCWAFSTTGAMEGQMFRKQGKLVSLSEQNLVDC 171

Query: 172 ATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYED 231
           +  + N GC GG MD AF+YI  N G+ ++  Y Y G     C     + +AA  T + D
Sbjct: 172 SRPEGNEGCNGGLMDQAFQYIKDNNGLDSEEAYPYLGTDDQPC-HYDPKYNAANDTGFVD 230

Query: 232 VPPNDEESLLKAVAN-QPVSVAIDA--SALQFYSGGV-FNGYCET-FLNHGVTAVGYGTS 286
           +P   E +L+KAVA+  PVSVAIDA   + QFY  G+ F   C +  L+HGV  VGYG  
Sbjct: 231 IPSGKEHALMKAVASVGPVSVAIDAGHESFQFYQSGIYFEKECSSEELDHGVLVVGYGFE 290

Query: 287 EE---GIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
            E   G KYW++KNSW + WG+ GY  + +D    +  CGIA  AS+P+
Sbjct: 291 GEDVDGKKYWIVKNSWSESWGDKGYIYMAKD---RKNHCGIATAASYPL 336


>gi|334332720|ref|XP_001367595.2| PREDICTED: cathepsin L1-like [Monodelphis domestica]
          Length = 333

 Score =  216 bits (550), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 133/323 (41%), Positives = 192/323 (59%), Gaps = 28/323 (8%)

Query: 25  FDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN-NAAIGNRSYTLRLN 83
           FD+ ++  ++ QWKAQ+ RTY  + E+  R   ++ NL  +E  N   + G  S+ L +N
Sbjct: 21  FDQ-TLDSQWHQWKAQHRRTYAAN-EDGWRRATWEKNLKMIEMHNLEYSAGKHSFQLGMN 78

Query: 84  KFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS---SQVPPSVNWIEKGAVTPVKY 140
           KF D+T +EF     G+  + + S  +  G+  LY+    +Q+P SV+W EKG VTPVK 
Sbjct: 79  KFGDMTTEEFKQVMNGY--NSNGSQKRTKGS--LYREPLLAQLPKSVDWREKGYVTPVKN 134

Query: 141 QGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYII 193
           QGQC       A  ++EG    K  +LVSLSEQ LVDC+T++ NNGC GG MD+AF+Y+ 
Sbjct: 135 QGQCGSCWAFSATGSLEGQWFHKTKKLVSLSEQNLVDCSTSEGNNGCSGGLMDNAFEYVK 194

Query: 194 QNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVA 252
            N GI  +  Y Y G         +AE   A +T + D+P  +E +L+KAVAN  P+SVA
Sbjct: 195 NNGGIDTEQAYPYLGQDNEC--KYRAECSGANVTGFVDIPSMNERALMKAVANVGPISVA 252

Query: 253 IDAS--ALQFYSGGV-FNGYCETF-LNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGY 308
           IDA   + QFY  GV +   C +  L+HGV  VGYG+  +  +YW++KNSWG++WG+ GY
Sbjct: 253 IDAGNPSFQFYESGVYYEPQCSSSQLDHGVLVVGYGSIGKD-EYWIVKNSWGEEWGKKGY 311

Query: 309 FRLQRDIDQPQGQCGIAMFASFP 331
             + +  +     CGIA  AS+P
Sbjct: 312 VLMAKFRNN---HCGIATAASYP 331


>gi|21483184|gb|AAF86584.1| cathepsin L cysteine protease [Haemonchus contortus]
          Length = 355

 Score =  216 bits (550), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 132/321 (41%), Positives = 183/321 (57%), Gaps = 25/321 (7%)

Query: 30  IAEKFEQW---KAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKF 85
           I E F +W   K  +G++Y+   EN    E F  N++ +E  N    +G +++ + LN+ 
Sbjct: 41  IDEAFNKWDDYKETFGKSYEPEEEND-YMEAFVKNVIHIEEHNKEHRLGRKTFEMGLNEI 99

Query: 86  ADLTPQEFIASQTGFKMSDH-SSSLKANGTPFLYK-SSQVPPSVNWIEKGAVTPVKYQGQ 143
           ADL   ++     G++M      S+++NGT FL   + Q+P SV+W E+G VTPVK QG 
Sbjct: 100 ADLPFSQY-RKLNGYRMRRQFGDSMQSNGTKFLVPFNVQIPESVDWREEGLVTPVKNQGM 158

Query: 144 C-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNK 196
           C       +  A+EG +A    +LVSLSEQ LVDC+T   N+GC GG MD AF+YI +N 
Sbjct: 159 CGSCWAFSSTGALEGQHARATGKLVSLSEQNLVDCSTKYGNHGCNGGLMDLAFEYIKENH 218

Query: 197 GITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ-PVSVAIDA 255
           G+  +  Y Y G  T      K     A    + D+P  DEE+L KAVA Q P+S+AIDA
Sbjct: 219 GVDTEDSYPYVGRETKC--HFKRNTVGADDKGFVDLPEGDEEALKKAVATQGPISIAIDA 276

Query: 256 S--ALQFYSGGV-FNGYCET-FLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRL 311
              + Q Y  GV F+  C +  L+HGV  VGYGT  E   YWL+KNSWG  WGE GY R+
Sbjct: 277 GHRSFQLYKKGVYFDEECSSEELDHGVLLVGYGTDPEAGDYWLVKNSWGPTWGEKGYIRI 336

Query: 312 QRDIDQPQGQCGIAMFASFPV 332
            R+ +     CG+A  AS+P+
Sbjct: 337 ARNRNN---HCGVATKASYPL 354


>gi|357627452|gb|EHJ77132.1| cathepsin L-like protease [Danaus plexippus]
          Length = 341

 Score =  216 bits (550), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 132/347 (38%), Positives = 192/347 (55%), Gaps = 27/347 (7%)

Query: 6   LIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
           + ++L++    A+      FD   + E++  +K ++ + Y    E   R +I+ +N   V
Sbjct: 1   MKILLVLCAVVAAGTAVSFFD--LVREEWNTFKLEHKKQYDSETEEKFRMKIYAENKHKV 58

Query: 66  ERFNNA-AIGNRSYTLRLNKFADLTPQEFIASQTGF-KMSDHSSSLKANGT-----PFLY 118
            + N     G  SY L+ NK++D+   EF+ +  GF K   H+  L A G       F+ 
Sbjct: 59  AKHNQRYQKGLVSYRLKTNKYSDMLHHEFVNTMNGFNKTVKHNKGLYAKGNDIRGATFVS 118

Query: 119 KSS-QVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVD 170
            ++   PP+V+W + GAVTPVK QG+C          A+EG +  K   LVSLSEQ L+D
Sbjct: 119 PANVAAPPTVDWRQHGAVTPVKDQGKCGSCWSFSTTGALEGQHFRKSGFLVSLSEQNLID 178

Query: 171 CATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYE 230
           C++   NNGC GG MD+AFKYI  N GI  +  Y YE +          ++  A+   + 
Sbjct: 179 CSSAYGNNGCNGGLMDNAFKYIKDNDGIDTEKTYPYEAVDDKC--RYNPKNSGAEDVGFV 236

Query: 231 DVPPNDEESLLKAVAN-QPVSVAIDAS--ALQFYSGGV-FNGYCET-FLNHGVTAVGYGT 285
           D+P  DE  L+ A+A   PVSVAIDAS  + Q YS GV ++  C +  L+HGV  VGYGT
Sbjct: 237 DIPAGDEHKLMLALATVGPVSVAIDASQESFQLYSDGVYYDENCSSENLDHGVLVVGYGT 296

Query: 286 SEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
            E+G  YWL+KNSWG  WG++GY ++ R+ D     CGIA  AS+P+
Sbjct: 297 DEDGGDYWLVKNSWGPSWGDEGYIKMARNRDN---HCGIASSASYPL 340


>gi|159792912|gb|ABW98676.1| cathepsin L [Apostichopus japonicus]
          Length = 332

 Score =  216 bits (550), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 134/332 (40%), Positives = 186/332 (56%), Gaps = 23/332 (6%)

Query: 15  SCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AI 73
           +C + A   T  +    + +E WK  + + Y +  E+++R +I++DNL  V + N   ++
Sbjct: 9   ACVAGALCFTIIDKGFDDTWEAWKQTHSKQYTKEEEDNRR-KIWEDNLQKVSKHNTEHSL 67

Query: 74  GNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFL-YKSSQVPPSVNWIEK 132
           G  SYTL +NK+ADL  +EF+    G K     +S +  G  FL Y   Q P SV+W ++
Sbjct: 68  GLHSYTLGMNKYADLRGEEFVQMMNGLKFD---ASRERQGIKFLSYAKFQAPDSVDWRDE 124

Query: 133 GAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFM 185
           G VTPVK QGQC          ++EG +      L SLSEQ LVDC+ +  NNGC GG M
Sbjct: 125 GYVTPVKDQGQCGSCWAFSTTGSLEGQHFRSTGVLTSLSEQNLVDCSISYGNNGCEGGLM 184

Query: 186 DDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKA-V 244
           D AF+YI  N GI  +  Y YE      C     ++  A  + Y DV   DE++L +A  
Sbjct: 185 DYAFQYIKDNLGIDTEDKYPYEA-EDDTC-RFSPDNVGATDSGYVDVDSGDEDALKEACA 242

Query: 245 ANQPVSVAIDAS--ALQFYSGGVFNG-YCETF-LNHGVTAVGYGTSEEGIKYWLIKNSWG 300
           AN P+SVAIDAS  + Q Y  GV++   C +  L+HGV  VGYGT   G  YW++KNSWG
Sbjct: 243 ANGPISVAIDASHESFQLYESGVYDEESCSSIELDHGVLVVGYGTDSVGGDYWIVKNSWG 302

Query: 301 QDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
             WG++GY  + R+ D    QCGIA  AS+P 
Sbjct: 303 LSWGQEGYIWMSRNKDN---QCGIATSASYPT 331


>gi|389608655|dbj|BAM17937.1| cathepsin L [Papilio xuthus]
          Length = 341

 Score =  216 bits (550), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 133/346 (38%), Positives = 194/346 (56%), Gaps = 29/346 (8%)

Query: 7   IVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVE 66
           +V+L+   + AS  ++  FD   + E++  +K ++ + Y    E+  R +I+ +N   + 
Sbjct: 4   LVILLCVVAAASAVSF--FD--LVKEEWNAFKMEHQKQYDSEVEDKFRMKIYAENKHNIA 59

Query: 67  RFNNA-AIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSS------SLKANGTPFLYK 119
           + N   A G  S+ L+ NK+ D+   EF+ +  GF  +  +S      S    G  F+  
Sbjct: 60  KHNQKYARGEVSFRLKQNKYGDMLHHEFVHTMNGFNKTTKNSKGLFGKSAGERGATFITP 119

Query: 120 SS-QVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDC 171
           ++  +P  V+W + GAVT VK QG+C       +  A+EG +  + N LVSLSEQ L+DC
Sbjct: 120 ANVHLPDHVDWRKHGAVTEVKDQGKCGSCWSFSSTGALEGQHYRRTNILVSLSEQNLIDC 179

Query: 172 ATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYED 231
           +    NNGC GG MD+AFKYI  N+GI  +  Y YEG+          ++  A    + D
Sbjct: 180 SAAYGNNGCNGGLMDNAFKYIKDNRGIDTEKSYPYEGIDDKC--RYNPKNTGADDNGFVD 237

Query: 232 VPPNDEESLLKAVAN-QPVSVAIDA--SALQFYSGGV-FNGYC-ETFLNHGVTAVGYGTS 286
           +P  DE  L+ AVA   PVSVAIDA  S+ QFYS GV F+  C  + L+HGV  VGYGT 
Sbjct: 238 IPSGDEGKLMAAVATVGPVSVAIDASQSSFQFYSDGVYFDENCSSSSLDHGVLVVGYGTD 297

Query: 287 EEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           E G  YWL+KNSWG+ WG+ GY ++ R+ D     CGIA  AS+P+
Sbjct: 298 ENGGDYWLVKNSWGRSWGDLGYIKMARNRDN---HCGIATAASYPL 340


>gi|161408095|dbj|BAF94151.1| cathepsin L-like cysteine protease 1 [Plautia stali]
          Length = 344

 Score =  216 bits (550), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 126/316 (39%), Positives = 182/316 (57%), Gaps = 20/316 (6%)

Query: 31  AEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKFADLT 89
           A +F ++K+QY + Y   +    R +++K N   V   N     G  +Y + LN  AD+ 
Sbjct: 20  ASEFTRFKSQYRKDYPSDSVERYRKKVYKQNEKFVREHNERYERGEVTYKMALNHLADMH 79

Query: 90  PQEFIASQTGFKMSDHSSSLKANGTPFLY-KSSQVPPSVNWIEKGAVTPVKYQGQC---- 144
           P+EF+A+  GF  S  +++    G PF + K + +   V+W +KGA++PVK QG C    
Sbjct: 80  PREFMATFLGFNRSLRATNKVPEGIPFRHNKDAVIQKEVDWRQKGAISPVKDQGHCGSCW 139

Query: 145 ---AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITND 201
              +  A+E    +K  R VSLSEQ L+DC+ N  NNGC GG M+ AF+Y+  N GI  +
Sbjct: 140 AFSSTGALEAHTFLKKGRRVSLSEQNLIDCSLNYGNNGCEGGLMEQAFQYVRDNDGIDTE 199

Query: 202 AVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ-PVSVAIDAS--AL 258
             Y YEG  +      K  +  A    +  +P  DE++L++AVA Q P+S+AIDAS  + 
Sbjct: 200 EAYPYEGEDSEC--RFKKNNVGATDAGFVTIPSGDEQALMEAVATQGPLSIAIDASNPSF 257

Query: 259 QFYSGGV-FNGYCETF-LNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDID 316
           QFYS GV +   C +  L+HGV  VGYG  E+  KYWL+KNSW + WGE+GY ++ R+ D
Sbjct: 258 QFYSEGVYYEPECSSAQLDHGVLLVGYGV-EKDQKYWLVKNSWSEQWGENGYIKMARNKD 316

Query: 317 QPQGQCGIAMFASFPV 332
                CGIA  ASFP+
Sbjct: 317 N---NCGIATQASFPI 329


>gi|226443040|ref|NP_001140018.1| Cathepsin L1 precursor [Salmo salar]
 gi|221221188|gb|ACM09255.1| Cathepsin L1 precursor [Salmo salar]
          Length = 338

 Score =  216 bits (550), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 134/348 (38%), Positives = 189/348 (54%), Gaps = 30/348 (8%)

Query: 2   AKYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDN 61
           A Y  ++VL +S  CA+      FD   + + +  WK  + ++Y ES E  +R  +++ N
Sbjct: 3   ALYLAVLVLCVSAVCAAP----RFDS-QLEDHWHLWKNWHSKSYHESEEGWRRM-VWEKN 56

Query: 62  LVAVERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS 120
           L  +E  N    +G  SY L +N F D+T +EF  +  G+K    ++  K  G+ F+  +
Sbjct: 57  LKKIEMHNLEHTMGKHSYRLGMNHFGDMTNEEFRQTMNGYK---QTTERKFKGSLFMEPN 113

Query: 121 S-QVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCA 172
             Q P +V+W EKG VTPVK QG C          A+EG    K  +LVSLSEQ LVDC+
Sbjct: 114 YLQAPKAVDWREKGYVTPVKDQGSCGSCWAFSTTGAMEGQQFRKTGKLVSLSEQNLVDCS 173

Query: 173 TNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDV 232
             + N GC GG MD AF+YI  N G+  +  Y Y G     C   K E   A  T + D+
Sbjct: 174 RPEGNEGCNGGLMDQAFQYIQDNAGLDTEESYPYVGTDEDPC-HYKPEFSGANETGFVDI 232

Query: 233 PPNDEESLLKAVAN-QPVSVAIDA--SALQFYSGGV-FNGYCET-FLNHGVTAVGYGTSE 287
           P   E +++KAVA   PVSVAIDA   + QFY  G+ +   C +  L+HGV  VGYG   
Sbjct: 233 PSGKEHAMMKAVAAVGPVSVAIDAGHESFQFYESGIYYEKECSSEELDHGVLVVGYGFEG 292

Query: 288 E---GIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           E   G KYW++KNSW + WG+ GY  + +D    +  CGIA  +S+P+
Sbjct: 293 EDVDGKKYWIVKNSWSEKWGDKGYIYMAKD---RKNHCGIATASSYPL 337


>gi|440793751|gb|ELR14926.1| Cysteine proteinase 5, putative [Acanthamoeba castellanii str.
           Neff]
          Length = 326

 Score =  216 bits (550), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 127/309 (41%), Positives = 177/309 (57%), Gaps = 25/309 (8%)

Query: 34  FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
           F  W  ++ ++Y    E   R+ ++++N + +E  N+    N+S+ L +NKF DLT  EF
Sbjct: 30  FADWMQEHQKSYANE-EFVYRWNVWRENYLYIEAHNHQ---NKSFHLAMNKFGDLTNAEF 85

Query: 94  IASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA-------V 146
                G  ++   +  +++  P    +  +P   +W +KGAVT VK QGQC         
Sbjct: 86  NKLFKGLSITADQAKQESDIAP----APGLPADFDWRQKGAVTHVKNQGQCGSCWSFSTT 141

Query: 147 AAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSY 206
            + EG N +K  RL SLSEQ LVDC+T+  N+GC GG MD AF+YII+NKGI  +  Y Y
Sbjct: 142 GSTEGANFLKHGRLTSLSEQNLVDCSTSYGNHGCNGGLMDYAFEYIIRNKGIDTEESYPY 201

Query: 207 EGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDA--SALQFYSGG 264
              S G C   K +    ++ +Y +VP  +E +LL AVA QP SVAIDA  S+ QFY GG
Sbjct: 202 HA-SQGTCRYNK-QHSGGELVSYTNVPSGNEGALLNAVATQPTSVAIDASHSSFQFYKGG 259

Query: 265 VFN--GYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQC 322
           V++      + L+HGV AVG+G   +G  YWL+KNSWG DWG  GY  + R+      QC
Sbjct: 260 VYDEPACSSSRLDHGVLAVGWGV-RDGKDYWLVKNSWGADWGLSGYIEMSRN---KHNQC 315

Query: 323 GIAMFASFP 331
           GIA  AS P
Sbjct: 316 GIATAASHP 324


>gi|350412176|ref|XP_003489564.1| PREDICTED: cathepsin L-like [Bombus impatiens]
          Length = 343

 Score =  216 bits (550), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 132/348 (37%), Positives = 195/348 (56%), Gaps = 29/348 (8%)

Query: 3   KYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
           K FL++++ I  +  + + +       + +++  +K ++ + YK   E   R +IF DN 
Sbjct: 2   KLFLLLIVAILATAQAISFFEL-----VNQEWTTFKMEHNKVYKNDIEERFRMKIFMDNK 56

Query: 63  VAVERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTP----FL 117
             + + N N  +   SY L++NK+ D+   EF+ +  GF  S  ++ L++   P    F+
Sbjct: 57  HKIAKHNGNYEMKKVSYKLKMNKYGDMLHHEFVNTLNGFNKSI-NTQLRSERLPIGASFI 115

Query: 118 YKSSQV-PPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLV 169
             ++ V P +V+W E GAVTPVK QG C       A  A+EG +  +   L+ LSEQ L+
Sbjct: 116 EPANVVLPKTVDWREHGAVTPVKDQGHCGSCWSFSATGALEGQHFRRTGILIPLSEQNLI 175

Query: 170 DCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNY 229
           DC+    NNGC GG MD AF+YI  NKG+  +  Y YE  +        A +  A+   Y
Sbjct: 176 DCSGKYGNNGCNGGLMDQAFQYIKDNKGLDTEVTYPYEAENDKC--RYNAANSGARDVGY 233

Query: 230 EDVPPNDEESLLKAVAN-QPVSVAIDAS--ALQFYSGGVF-NGYCET-FLNHGVTAVGYG 284
            D+P  +E+ L  AVA   PVSVAIDAS  + QFYS GV+    C +  L+HGV AVGYG
Sbjct: 234 VDIPQGNEKKLKAAVATIGPVSVAIDASHQSFQFYSEGVYYEPECSSENLDHGVLAVGYG 293

Query: 285 TSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           T E G  YWL+KNSWG+ WG++GY ++ R+       CGIA  AS+P+
Sbjct: 294 TDENGQDYWLVKNSWGETWGDNGYIKMARN---KLNHCGIASTASYPL 338


>gi|34559455|gb|AAQ75437.1| cathepsin L-like protease [Helicoverpa armigera]
 gi|338855117|gb|AEJ31938.1| cathepsin L-like protease [Helicoverpa assulta]
          Length = 341

 Score =  216 bits (550), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 134/353 (37%), Positives = 190/353 (53%), Gaps = 37/353 (10%)

Query: 3   KYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
           K   +++ ++  +CA            + E++  +K ++ + Y    E+  R +I+ +N 
Sbjct: 2   KSIAVLLCVVGAACAVSLL------DLVREEWSAFKLEHSKRYDSEVEDKFRMKIYLENK 55

Query: 63  VAV----ERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGF-KMSDHSSSLKANG---- 113
             +    +RF   A+   SY LR NK+AD+   EF+    GF K   H  ++   G    
Sbjct: 56  HRIAKHNQRFEQGAV---SYKLRPNKYADMLSHEFVHVMNGFNKTLKHPKAVHGKGRESR 112

Query: 114 -TPFLYKSS-QVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLS 164
              F+  +    P  V+W +KGAVT VK QG+C          A+EG +  K   LVSLS
Sbjct: 113 PATFIAPAHVTYPDHVDWRKKGAVTEVKDQGKCGSCWAFSTTGALEGQHFRKTGYLVSLS 172

Query: 165 EQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAA 224
           EQ L+DC+    NNGC GG MD+AFKYI  N GI  +  Y YEG+         A++  A
Sbjct: 173 EQNLIDCSAAYGNNGCNGGLMDNAFKYIKDNGGIDTEKAYPYEGVDDKC--RYNAKNSGA 230

Query: 225 QITNYEDVPPNDEESLLKAVAN-QPVSVAIDAS--ALQFYSGGVF--NGYCETFLNHGVT 279
               + D+P  DEE L++AVA   PVSVAIDAS  + QFYS GV+       T L+HGV 
Sbjct: 231 DDVGFVDIPQGDEEKLMQAVATVGPVSVAIDASQESFQFYSDGVYYDENCSSTDLDHGVM 290

Query: 280 AVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
            VGYGT E+G  YWL+KNSWG+ WG+ GY ++ R+ +     CGIA  AS+P+
Sbjct: 291 VVGYGTDEQGGDYWLVKNSWGRTWGDLGYIKMARNKNN---HCGIASSASYPL 340


>gi|52630917|gb|AAU84922.1| putative cathepsin L [Toxoptera citricida]
          Length = 341

 Score =  216 bits (550), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 127/320 (39%), Positives = 179/320 (55%), Gaps = 22/320 (6%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKFADL 88
           I E+++ +K Q+ + Y++  E + R +++ DN + + R N     G  +Y L +N F DL
Sbjct: 26  IEEEWDLFKVQFKKIYEDVKEEAFRKKVYLDNKLKIARHNKLYETGEETYALEMNHFGDL 85

Query: 89  TPQEFIASQTGFK--MSDHSSSLKANGTPFLYKSSQV--PPSVNWIEKGAVTPVKYQGQC 144
              E+     GFK  ++    +   +      KS  V  P S++W +KG VTPVK QGQC
Sbjct: 86  MQHEYTKMMNGFKPSLAGGDKNFTDDDAVTFLKSENVVIPKSIDWRKKGYVTPVKNQGQC 145

Query: 145 -------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKG 197
                  A  ++EG +  K   LVSLSEQ L+DC+    NNGC GG MD AFKYI  NKG
Sbjct: 146 GSCWSFSATGSLEGQHFRKTGVLVSLSEQNLIDCSRKYGNNGCEGGLMDLAFKYIKSNKG 205

Query: 198 ITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDAS 256
           +  +  Y YE            E+  A    + D+P  DE++L+ A+A   PVS+AIDAS
Sbjct: 206 LDTEKSYPYEAEDDKC--RYNPENSGATDKGFVDIPEGDEDALVHALATVGPVSIAIDAS 263

Query: 257 A--LQFYSGGVF-NGYC-ETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQ 312
           +   QFY  GVF N  C  T L+HGV AVGYGT  +G  YW++KNSWG+ WG+ GY  + 
Sbjct: 264 SEKFQFYKKGVFYNPRCSSTELDHGVLAVGYGTDHKGGDYWIVKNSWGKTWGDQGYIMMA 323

Query: 313 RDIDQPQGQCGIAMFASFPV 332
           R+    +  CG+A  AS+P+
Sbjct: 324 RN---KKNNCGVASSASYPL 340


>gi|185135439|ref|NP_001117777.1| procathepsin L precursor [Oncorhynchus mykiss]
 gi|14582899|gb|AAK69706.1|AF358668_1 procathepsin L [Oncorhynchus mykiss]
          Length = 338

 Score =  216 bits (549), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 135/348 (38%), Positives = 189/348 (54%), Gaps = 30/348 (8%)

Query: 2   AKYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDN 61
           A Y  ++VL +S  CA+      FD   + + +  WK  + + Y ES E  +R  +++ N
Sbjct: 3   ALYLAVLVLCVSAVCAAP----RFD-SQLEDHWHLWKNWHSKHYHESEEGWRRM-VWEKN 56

Query: 62  LVAVERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS 120
           L  +E  N    +G  SY L +N F D+T +EF  +  G+K +   +  K  G+ F+  +
Sbjct: 57  LKKIEIHNLEHTMGKHSYRLGMNHFGDMTNEEFRQTMNGYKQT---TERKFKGSLFMEPN 113

Query: 121 S-QVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCA 172
             Q P +V+W EKG VTPVK QG C          A+EG    K  +LVSLSEQ LVDC+
Sbjct: 114 YLQAPKAVDWREKGYVTPVKDQGSCGSCWAFSTTGAMEGQQFRKTGKLVSLSEQNLVDCS 173

Query: 173 TNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDV 232
             + N GC GG MD AF+YI  N G+  +  Y Y G     C   K E  AA  T + D+
Sbjct: 174 RPEGNEGCNGGLMDQAFQYIQDNAGLDTEESYPYVGTDEDPC-HYKPEFSAANETGFVDI 232

Query: 233 PPNDEESLLKAVAN-QPVSVAIDA--SALQFYSGGV-FNGYCET-FLNHGVTAVGYGTSE 287
           P   E +++KAVA   PVSVAIDA   + QFY  G+ +   C +  L+HGV  VGYG   
Sbjct: 233 PSGKEHAMMKAVAAVGPVSVAIDAGHESFQFYESGIYYEKECSSEELDHGVLVVGYGFEG 292

Query: 288 E---GIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           E   G KYW++KNSW + WG+ GY  + +D    +  CGIA  +S+P+
Sbjct: 293 EDVDGKKYWIVKNSWSEKWGDKGYIYMAKD---RKNHCGIATASSYPL 337


>gi|340727787|ref|XP_003402217.1| PREDICTED: cathepsin L-like [Bombus terrestris]
          Length = 343

 Score =  216 bits (549), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 135/345 (39%), Positives = 195/345 (56%), Gaps = 27/345 (7%)

Query: 6   LIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
           L + LI++    +QA   +F E  + +++  +K ++ + YK   E   R +IF DN   +
Sbjct: 3   LFLFLIVAVLATAQAI--SFFE-LVNQEWTTFKMEHNKVYKNDVEERFRMKIFMDNKHKI 59

Query: 66  ERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTP----FLYKS 120
            + N N  +   SY L++NK+ D+   EF+ +  GF  S  ++ L++   P    F+  +
Sbjct: 60  AKHNGNYEMKKVSYKLKMNKYGDMLHHEFVNTLNGFNKSI-NTQLRSERLPIAASFIEPA 118

Query: 121 SQV-PPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCA 172
           + V P +V+W E GAVTPVK QG C       A  A+EG +  +   L+ LSEQ L+DC+
Sbjct: 119 NVVLPKTVDWREHGAVTPVKDQGHCGSCWSFSATGALEGQHFRRTGILIPLSEQNLIDCS 178

Query: 173 TNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDV 232
               NNGC GG MD AF+YI  NKG+  +  Y YE  +        A +  A+   Y D+
Sbjct: 179 GKYGNNGCNGGLMDQAFQYIKDNKGLDTEVTYPYEAENDKC--RYNAANSGARDVGYVDI 236

Query: 233 PPNDEESLLKAVAN-QPVSVAIDAS--ALQFYSGGVF-NGYCET-FLNHGVTAVGYGTSE 287
           P  +E+ L  AVA   PVSVAIDAS  + QFYS GV+    C +  L+HGV AVGYGT E
Sbjct: 237 PQGNEKKLKAAVATIGPVSVAIDASHQSFQFYSEGVYYEPECSSENLDHGVLAVGYGTDE 296

Query: 288 EGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
            G  YWL+KNSWG+ WG++GY ++ R+       CGIA  AS+P+
Sbjct: 297 NGQDYWLVKNSWGETWGDNGYIKMARN---KLNHCGIASTASYPL 338


>gi|326430491|gb|EGD76061.1| cathepsin [Salpingoeca sp. ATCC 50818]
          Length = 381

 Score =  216 bits (549), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 123/283 (43%), Positives = 165/283 (58%), Gaps = 20/283 (7%)

Query: 31  AEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN-NAAIGNRSYTLRLNKFADLT 89
           A  F+ +K  + + Y+   E ++RF IF DNL  + R N  AA G  ++T+ +N+FADLT
Sbjct: 17  AMSFDDFKTTFEKQYESPEEEARRFAIFADNLAFIARHNAEAARGLHTHTVGVNQFADLT 76

Query: 90  PQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA---- 145
            +E+        +  + + L       ++       SV+W +KGAVTP+K QGQC     
Sbjct: 77  NEEYRQ----LYLRPYPTELLGRERQEVWLDGPNAGSVDWRQKGAVTPIKNQGQCGSCWS 132

Query: 146 ---VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDA 202
                +VEG +AI    LVSLSEQQLVDC+ +  N GC GG MD+AFKYII N G+  + 
Sbjct: 133 FSTTGSVEGAHAIATGNLVSLSEQQLVDCSGSFGNQGCNGGLMDNAFKYIISNGGLDTEQ 192

Query: 203 VYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQF 260
            Y Y     G+CD  K   HA  I+ Y+DVP N+E+ L  AV   PVSVAI+A   + Q 
Sbjct: 193 DYPYTARD-GVCDKSKESKHAVSISGYKDVPQNNEDQLAAAVEKGPVSVAIEADQQSFQM 251

Query: 261 YSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDW 303
           YS GVF+G C T L+HGV  VGY TS+    YW++KNSWG  W
Sbjct: 252 YSSGVFSGPCGTNLDHGVLVVGY-TSD----YWIVKNSWGASW 289


>gi|297729067|ref|NP_001176897.1| Os12g0273900 [Oryza sativa Japonica Group]
 gi|255670225|dbj|BAH95625.1| Os12g0273900 [Oryza sativa Japonica Group]
          Length = 184

 Score =  216 bits (549), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 100/184 (54%), Positives = 127/184 (69%), Gaps = 1/184 (0%)

Query: 149 VEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEG 208
           +EG   +   +L+SLSEQ+LVDC  + N+ GC GG +D AF++I+ N G+T +A Y Y  
Sbjct: 1   MEGFVKLSTGKLISLSEQELVDCDVDGNDQGCEGGEIDGAFQFILSNGGLTAEANYPYT- 59

Query: 209 MSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASALQFYSGGVFNG 268
              G C +  A D AA I  YEDVP NDE SL+KAVA QPVSVA+DAS  QFY GGV  G
Sbjct: 60  AEDGRCKTTAAADVAASIRGYEDVPANDEPSLMKAVAGQPVSVAVDASKFQFYGGGVMAG 119

Query: 269 YCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFA 328
            C T L+HGVT +GYG + +G KYWL+KNSWG  WGE GY R+++DID  +G CG+AM  
Sbjct: 120 ECGTSLDHGVTVIGYGAASDGTKYWLVKNSWGTTWGEAGYLRMEKDIDDKRGMCGLAMQP 179

Query: 329 SFPV 332
           S+P 
Sbjct: 180 SYPT 183


>gi|82796372|gb|ABB91778.1| cathepsin L [Hymeniacidon perlevis]
          Length = 323

 Score =  216 bits (549), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 131/316 (41%), Positives = 177/316 (56%), Gaps = 22/316 (6%)

Query: 29  SIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADL 88
           S ++ +E WK ++ + Y +  E   R++I++ N   +E  +NA      +TL +NKF DL
Sbjct: 17  SFSQDWEDWKNEHNKKYSDDLEELTRYKIWQGNQKIIE-VHNANSDKFGFTLGMNKFGDL 75

Query: 89  TPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA--- 145
              EF     G+ M   S+S K       YK+    P+V+W  KGAVT VK QGQC    
Sbjct: 76  ESHEFAEMFNGYMMQARSNSTKVFVADPNYKAD---PTVDWRTKGAVTGVKNQGQCGSCW 132

Query: 146 ----VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITND 201
                 ++EG + +K  +LVSLSEQ LVDC+  + N GC GG MD AF+YI +N GI  +
Sbjct: 133 AFSTTGSLEGQHFLKTGKLVSLSEQNLVDCSGKEGNEGCNGGLMDQAFEYIKKNGGIDTE 192

Query: 202 AVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDA--SAL 258
           A Y Y+          KA D  A  T Y D+   DE +L++AV    PVSVAIDA  S+ 
Sbjct: 193 ASYPYQAHDERC--RFKASDVGATCTGYVDIKREDENALMQAVEKIGPVSVAIDASHSSF 250

Query: 259 QFYSGGVF--NGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDID 316
           Q Y  GV+      +T L+HGV A+GYGT E G  YWL+KNSWG DWG +GY  + R+ +
Sbjct: 251 QLYRSGVYYERECSQTALDHGVLAIGYGT-EGGSDYWLVKNSWGTDWGMEGYIMMSRNRN 309

Query: 317 QPQGQCGIAMFASFPV 332
                CGIA  AS+P 
Sbjct: 310 N---NCGIATEASYPT 322


>gi|164420679|ref|NP_001037464.2| fibroinase precursor [Bombyx mori]
 gi|40556818|gb|AAR87763.1| fibroinase precursor [Bombyx mori]
          Length = 341

 Score =  216 bits (549), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 131/345 (37%), Positives = 196/345 (56%), Gaps = 28/345 (8%)

Query: 8   VVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVER 67
           +VL++    A  A  + FD   + E++  +K Q+   Y+   E++ R +I+ ++   + +
Sbjct: 4   LVLLLCAVAAVSAV-QFFD--LVKEEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAK 60

Query: 68  FNNA-AIGNRSYTLRLNKFADLTPQEFIASQTGF-KMSDHSSSL-----KANGTPFLYKS 120
            N    +G  SY L +NK+ D+   EF+ +  GF K + H+ +L        G  F+  +
Sbjct: 61  HNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPA 120

Query: 121 S-QVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCA 172
           + ++P  V+W + GAVT +K QG+C          A+EG +  +   LVSLSEQ L+DC+
Sbjct: 121 NVKLPEQVDWRKHGAVTDIKDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCS 180

Query: 173 TNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDV 232
               NNGC GG MD+AFKYI  N GI  +  Y YEG+          ++  A+   + D+
Sbjct: 181 EQYGNNGCNGGLMDNAFKYIKDNGGIDTEQTYPYEGVDDKC--RYNPKNTGAEDVGFVDI 238

Query: 233 PPNDEESLLKAVAN-QPVSVAIDAS--ALQFYSGGVFNGY--CETFLNHGVTAVGYGTSE 287
           P  DE+ L++AVA   PVSVAIDAS  + Q YS GV+N      T L+HGV  VGYGT E
Sbjct: 239 PEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDE 298

Query: 288 EGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           +G+ YWL+KNSWG+ WGE GY ++ R+      +CGIA  AS+P+
Sbjct: 299 QGVDYWLVKNSWGRSWGELGYIKMIRN---KNNRCGIASSASYPL 340


>gi|237844793|ref|XP_002371694.1| cathepsin L-like thiolproteinase, putative [Toxoplasma gondii ME49]
 gi|50313163|gb|AAT74529.1| toxopain-2 [Toxoplasma gondii]
 gi|89242977|gb|ABD64744.1| cathepsin L [Toxoplasma gondii]
 gi|95007485|emb|CAJ20707.1| toxopain-2 [Toxoplasma gondii RH]
 gi|211969358|gb|EEB04554.1| cathepsin L-like thiolproteinase, putative [Toxoplasma gondii ME49]
 gi|221480879|gb|EEE19300.1| cysteine protease, putative [Toxoplasma gondii GT1]
 gi|221501596|gb|EEE27366.1| cysteine protease, putative [Toxoplasma gondii VEG]
          Length = 422

 Score =  216 bits (549), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 122/324 (37%), Positives = 177/324 (54%), Gaps = 26/324 (8%)

Query: 25  FDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNK 84
           + E    + F  ++A Y ++Y    E  +R+ IFK+NLV +   N       SY+L++N 
Sbjct: 108 WKEAHFQDAFSSFQAMYAKSYATEEEKQRRYAIFKNNLVYIHTHNQQGY---SYSLKMNH 164

Query: 85  FADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK--SSQVPPSVNWIEKGAVTPVKYQG 142
           F DL+  EF     GFK S +  S        L     S++P  V+W  +G VTPVK Q 
Sbjct: 165 FGDLSRDEFRRKYLGFKKSRNLKSHHLGVATELLNVLPSELPAGVDWRSRGCVTPVKDQR 224

Query: 143 QCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQN 195
            C          A+EG +  K  +LVSLSEQ+L+DC+  + N  C GG M+DAF+Y++ +
Sbjct: 225 DCGSCWAFSTTGALEGAHCAKTGKLVSLSEQELMDCSRAEGNQSCSGGEMNDAFQYVLDS 284

Query: 196 KGITNDAVYSY----EGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSV 251
            GI ++  Y Y    E      C+ +       +I  ++DVP   E ++  A+A  PVS+
Sbjct: 285 GGICSEDAYPYLARDEECRAQSCEKV------VKILGFKDVPRRSEAAMKAALAKSPVSI 338

Query: 252 AIDASAL--QFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIK-YWLIKNSWGQDWGEDGY 308
           AI+A  +  QFY  GVF+  C T L+HGV  VGYGT +E  K +W++KNSWG  WG DGY
Sbjct: 339 AIEADQMPFQFYHEGVFDASCGTDLDHGVLLVGYGTDKESKKDFWIMKNSWGTGWGRDGY 398

Query: 309 FRLQRDIDQPQGQCGIAMFASFPV 332
             +     + +GQCG+ + ASFPV
Sbjct: 399 MYMAMHKGE-EGQCGLLLDASFPV 421


>gi|226531284|ref|NP_001147086.1| thiol protease SEN102 precursor [Zea mays]
 gi|195607128|gb|ACG25394.1| thiol protease SEN102 precursor [Zea mays]
          Length = 356

 Score =  216 bits (549), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 138/328 (42%), Positives = 193/328 (58%), Gaps = 31/328 (9%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
           + E+F+ W+A+Y RTY    E  +RF I+ +N+  ++  N  + G+ SY L  N+F DLT
Sbjct: 34  LLERFKAWQAEYNRTYATPEEFQQRFMIYSENVRFIKTMNQLSTGS-SYELGENQFTDLT 92

Query: 90  PQEF-------------IASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVT 136
            +EF              A   G  +   S++  +NG      + + P SV+W  KGAVT
Sbjct: 93  EEEFKDTYLMKLDEQPPAAEAMGPTVGTMSTAGMSNGN----NTGEAPNSVDWRTKGAVT 148

Query: 137 PVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAF 189
            VK Q QC        VA++EG++ IK  RLVSLSEQ++VDC    N+NGC GG    A 
Sbjct: 149 RVKDQQQCGSCWAFATVASIEGVHQIKTGRLVSLSEQEIVDCDRGGNDNGCRGGSPRSAM 208

Query: 190 KYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPV 249
           +++ +N G+T ++ Y Y G S   C S K   HAA+I  Y+ V  N+E  L +AVA +PV
Sbjct: 209 EWVTRNGGLTTESDYPYVG-SQRQCMSGKLGHHAARIRGYQAVQRNNEAELERAVAERPV 267

Query: 250 SVAIDAS-ALQFYSGGVFNGYCE-TFLNHGVTAVGYGTS---EEGIKYWLIKNSWGQDWG 304
           +V IDAS A QFY  GVF+G C+ T +NH VT VGYG++     G KYW++KNSWGQ WG
Sbjct: 268 AVFIDASRAFQFYKSGVFSGPCDTTTVNHVVTVVGYGSTGSDSGGRKYWIVKNSWGQGWG 327

Query: 305 EDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           E+GY R+ R +   +G C IA+   +PV
Sbjct: 328 ENGYVRMARRVRAREGMCAIAIEPYYPV 355


>gi|449275508|gb|EMC84350.1| Cathepsin L1, partial [Columba livia]
          Length = 319

 Score =  216 bits (549), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 132/316 (41%), Positives = 182/316 (57%), Gaps = 24/316 (7%)

Query: 34  FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN-NAAIGNRSYTLRLNKFADLTPQE 92
           ++ WK+ + + Y E  E+ +R  +++ NL  +E  N +  +G  SY L +N+F D+T +E
Sbjct: 10  WQLWKSWHNKDYHEREESWRRV-VWEKNLKMIELHNLDHTLGKHSYKLGMNQFGDMTTEE 68

Query: 93  FIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGAVTPVKYQGQCA------ 145
           F     G+  +   S  K  G+ FL  S  + P SV+W EKG VTPVK QGQC       
Sbjct: 69  FRQLMNGY--AHKKSERKYRGSQFLEPSFLEAPRSVDWREKGYVTPVKDQGQCGSCWAFS 126

Query: 146 -VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVY 204
              A+EG +  K  +LVSLSEQ LVDC+  + N GC GG MD AF+Y+  N GI ++  Y
Sbjct: 127 TTGALEGQHFRKTGKLVSLSEQNLVDCSRPEGNQGCNGGLMDQAFQYVQDNGGIDSEESY 186

Query: 205 SYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDA--SALQFY 261
            Y       C   KAE +AA  T + D+P   E +L+KAVA   PVSVAIDA  S+ QFY
Sbjct: 187 PYTAKDDEDC-RYKAEYNAANDTGFVDIPQGHERALMKAVAAVGPVSVAIDAGHSSFQFY 245

Query: 262 SGGV-FNGYCETF-LNHGVTAVGYGTSEE---GIKYWLIKNSWGQDWGEDGYFRLQRDID 316
             G+ +   C +  L+HGV  VGYG   E   G KYW++KNSWG+ WG+ GY  + +D  
Sbjct: 246 QSGIYYEPDCSSEDLDHGVLVVGYGFEGEDVDGKKYWIVKNSWGEKWGDKGYIYMAKD-- 303

Query: 317 QPQGQCGIAMFASFPV 332
             +  CGIA  AS+P+
Sbjct: 304 -RKNHCGIATAASYPL 318


>gi|115436338|ref|NP_001042927.1| Os01g0330300 [Oryza sativa Japonica Group]
 gi|13365805|dbj|BAB39243.1| putative SAG12 protein [Oryza sativa Japonica Group]
 gi|14164528|dbj|BAB55777.1| putative SAG12 protein [Oryza sativa Japonica Group]
 gi|113532458|dbj|BAF04841.1| Os01g0330300 [Oryza sativa Japonica Group]
 gi|125570199|gb|EAZ11714.1| hypothetical protein OsJ_01576 [Oryza sativa Japonica Group]
          Length = 367

 Score =  216 bits (549), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 130/322 (40%), Positives = 174/322 (54%), Gaps = 27/322 (8%)

Query: 34  FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNN--------AAIGNRSYT---LRL 82
           F QW ++Y + Y    E  KR++++K N   +  F +         A   ++ T   + +
Sbjct: 51  FSQWMSKYSKRYSCPEEQEKRYQVWKANTDFIGAFRSQTEISSGVGAFAPQTVTDSFVGM 110

Query: 83  NKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQG 142
           N F DL   EF+   TGF  +   +   +     +   S +P  V+W   GAVT VK QG
Sbjct: 111 NLFGDLASGEFVRQFTGFNATGFVAPPPSPSP--IPPRSWLPCCVDWRSSGAVTGVKLQG 168

Query: 143 QCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQN 195
            CA       VAA+EG++ IK   LVSLSEQ +VDC T   +NGC GG  D A   +   
Sbjct: 169 SCASCWAFAAVAAIEGLHRIKTGELVSLSEQVMVDCDTG--SNGCGGGRSDTALGLVASR 226

Query: 196 KGITNDAVYSYEGMSTGICDSIKA-EDHAAQITNYEDVPPNDEESLLKAVANQPVSVAID 254
            G+T++  Y Y G   G CD  K   DH+A ++ +  VPPNDE  L  AVA QPV+V ID
Sbjct: 227 GGVTSEERYPYAGARGG-CDVGKLLSDHSASVSGFAAVPPNDERQLALAVARQPVTVYID 285

Query: 255 ASA--LQFYSGGVFNGYCET-FLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRL 311
           ASA   QFY GGV+ G C+   +NH VT VGY  +  G KYW+ KNSW  DWGE GY  L
Sbjct: 286 ASAPEFQFYKGGVYRGPCDPGRMNHAVTIVGYCENIGGDKYWIAKNSWSSDWGEQGYVYL 345

Query: 312 QRDIDQPQGQCGIAMFASFPVS 333
            +D+  PQG CG+A    +P +
Sbjct: 346 AKDVWWPQGTCGLATSPFYPTA 367


>gi|261289779|ref|XP_002611751.1| hypothetical protein BRAFLDRAFT_284345 [Branchiostoma floridae]
 gi|229297123|gb|EEN67761.1| hypothetical protein BRAFLDRAFT_284345 [Branchiostoma floridae]
          Length = 330

 Score =  215 bits (548), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 140/349 (40%), Positives = 196/349 (56%), Gaps = 48/349 (13%)

Query: 10  LIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN 69
           L++  +C + AT  +    S   ++E +K ++ + Y E  E ++R  IF+DNL  +E  N
Sbjct: 3   LLVLLACVAMATAASL---SFESQWEAFKIKHDKVYSEKEEYARRL-IFQDNLKTIESHN 58

Query: 70  NAA-IGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK---SSQVPP 125
             A  G  SY L +N+FAD+T  E++    G  +   +S+L   G+   Y+   + QV  
Sbjct: 59  QEADTGKHSYWLGVNQFADMTHAEYLNQVIGGCLI--TSNLTKTGSRATYRYMPNMQVND 116

Query: 126 SVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNN 178
           +V+W +KG VT +K QGQC          ++EG +A     LVSLSEQ LVDC+  + N 
Sbjct: 117 TVDWRDKGLVTDIKDQGQCGSCWAFSTTGSLEGQHAKATGTLVSLSEQNLVDCSRQEGNK 176

Query: 179 GCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDH---------AAQITNY 229
           GC GG MD  F+YIIQNKGI  +  Y Y           KA++H          A ++++
Sbjct: 177 GCEGGDMDQGFQYIIQNKGIDTEQCYPY-----------KAKNHRCKFDNSCIGATMSSF 225

Query: 230 EDVPPNDEESLLKAVAN-QPVSVAIDAS--ALQFYSGGVFNGY--CETFLNHGVTAVGYG 284
            DV   DE++L +A AN  P+SV IDAS  + QFYS GV+N +    T L+HGV  VGYG
Sbjct: 226 TDVTSGDEDALKQACANIGPISVGIDASHQSFQFYSSGVYNEFECSSTKLDHGVLVVGYG 285

Query: 285 TSEEGIK-YWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           T   G K YWL+KNSWG  WG +GY  + R+ D    QCG+A  ASFPV
Sbjct: 286 TY--GSKDYWLVKNSWGTVWGNEGYIMMSRNKDN---QCGVATDASFPV 329


>gi|389610697|dbj|BAM18960.1| cathepsin L [Papilio polytes]
          Length = 341

 Score =  215 bits (548), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 136/346 (39%), Positives = 191/346 (55%), Gaps = 29/346 (8%)

Query: 7   IVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVE 66
           +VVL+   + AS  ++  FD   + E++  +K ++ + Y    E+  R +I+ +N   + 
Sbjct: 4   LVVLMCVVAAASAVSF--FD--LVKEEWNAFKMEHQKQYDSEVEDKFRMKIYAENKHKIA 59

Query: 67  RFNNA-AIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHS------SSLKANGTPFLYK 119
           + N   A G   + ++ NK+ D+   EF+ +  GF  +  +       S    G  F+  
Sbjct: 60  KHNQKFARGQVPFRVKQNKYGDMLHHEFVHTMNGFNKTTKNGKGLFGKSAGERGATFIPP 119

Query: 120 SS-QVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDC 171
           ++ +VP  V+W + GAVT VK QG+C       A  A+EG +  + N LVSLSEQ L+DC
Sbjct: 120 ANVRVPDHVDWRKHGAVTEVKDQGKCGSCWSFSATGALEGQHYRQTNILVSLSEQNLIDC 179

Query: 172 ATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYED 231
           +T   NNGC GG MD+AFKYI  NKGI  +  Y YE +           +  A    + D
Sbjct: 180 STAYGNNGCNGGLMDNAFKYIKDNKGIDTEKSYPYEAVDDKC--RYNPRNSGADDVGFID 237

Query: 232 VPPNDEESLLKAVAN-QPVSVAIDAS--ALQFYSGGV-FNGYC-ETFLNHGVTAVGYGTS 286
           +P  DE  L+ AVA   PVSVAIDAS    QFYS GV F+  C  T L+HGV  VGYGT 
Sbjct: 238 IPSGDEGKLMAAVATVGPVSVAIDASQETFQFYSDGVYFDENCSSTSLDHGVLVVGYGTD 297

Query: 287 EEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           E G  YWL+KNSWG+ WG+ GY ++ R+ D     CGIA  ASFP+
Sbjct: 298 ENGGDYWLVKNSWGRSWGDLGYIKMARNRDN---HCGIATAASFPL 340


>gi|164472556|gb|ABY58967.1| cathepsin L [Toxoplasma gondii]
          Length = 421

 Score =  215 bits (548), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 122/324 (37%), Positives = 177/324 (54%), Gaps = 26/324 (8%)

Query: 25  FDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNK 84
           + E    + F  ++A Y ++Y    E  +R+ IFK+NLV +   N       SY+L++N 
Sbjct: 107 WKEAHFQDAFSSFQAMYAKSYATEEEKQRRYAIFKNNLVYIHTHNQQGY---SYSLKMNH 163

Query: 85  FADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK--SSQVPPSVNWIEKGAVTPVKYQG 142
           F DL+  EF     GFK S +  S        L     S++P  V+W  +G VTPVK Q 
Sbjct: 164 FGDLSRDEFRRKYLGFKKSRNLKSHHLGVATELLNVLPSELPAGVDWRSRGCVTPVKDQR 223

Query: 143 QCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQN 195
            C          A+EG +  K  +LVSLSEQ+L+DC+  + N  C GG M+DAF+Y++ +
Sbjct: 224 DCGSCWAFSTTGALEGAHCAKTGKLVSLSEQELMDCSRAEGNQSCSGGEMNDAFQYVLDS 283

Query: 196 KGITNDAVYSY----EGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSV 251
            GI ++  Y Y    E      C+ +       +I  ++DVP   E ++  A+A  PVS+
Sbjct: 284 GGICSEDAYPYLARDEECRAQSCEKV------VKILGFKDVPRRSEAAMKAALAKSPVSI 337

Query: 252 AIDASAL--QFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIK-YWLIKNSWGQDWGEDGY 308
           AI+A  +  QFY  GVF+  C T L+HGV  VGYGT +E  K +W++KNSWG  WG DGY
Sbjct: 338 AIEADQMPFQFYHEGVFDASCGTDLDHGVLLVGYGTDKESKKDFWIMKNSWGTGWGRDGY 397

Query: 309 FRLQRDIDQPQGQCGIAMFASFPV 332
             +     + +GQCG+ + ASFPV
Sbjct: 398 MYMAMHKGE-EGQCGLLLDASFPV 420


>gi|125525718|gb|EAY73832.1| hypothetical protein OsI_01708 [Oryza sativa Indica Group]
          Length = 366

 Score =  215 bits (548), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 130/322 (40%), Positives = 174/322 (54%), Gaps = 27/322 (8%)

Query: 34  FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNN--------AAIGNRSYT---LRL 82
           F QW ++Y + Y    E  KR++++K N   +  F +         A   ++ T   + +
Sbjct: 50  FSQWMSKYSKRYSCPEEQEKRYQVWKANTDFIGAFRSQTEISSGVGAFAPQTVTDSFVGM 109

Query: 83  NKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQG 142
           N F DL   EF+   TGF  +   +   +     +   S +P  V+W   GAVT VK QG
Sbjct: 110 NLFGDLASGEFVRQFTGFNATGFVAPPPSPSP--IPPRSWLPCCVDWRSSGAVTGVKLQG 167

Query: 143 QCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQN 195
            CA       VAA+EG++ IK   LVSLSEQ +VDC T   +NGC GG  D A   +   
Sbjct: 168 SCASCWAFAAVAAIEGLHRIKTGELVSLSEQVMVDCDTG--SNGCGGGRSDTALGLVASR 225

Query: 196 KGITNDAVYSYEGMSTGICDSIKA-EDHAAQITNYEDVPPNDEESLLKAVANQPVSVAID 254
            G+T++  Y Y G   G CD  K   DH+A ++ +  VPPNDE  L  AVA QPV+V ID
Sbjct: 226 GGVTSEERYPYAGARGG-CDVGKLLSDHSASVSGFAAVPPNDERQLALAVARQPVTVYID 284

Query: 255 ASA--LQFYSGGVFNGYCET-FLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRL 311
           ASA   QFY GGV+ G C+   +NH VT VGY  +  G KYW+ KNSW  DWGE GY  L
Sbjct: 285 ASAPEFQFYKGGVYRGPCDPGRMNHAVTIVGYCENIGGDKYWIAKNSWSSDWGEQGYVYL 344

Query: 312 QRDIDQPQGQCGIAMFASFPVS 333
            +D+  PQG CG+A    +P +
Sbjct: 345 AKDVWWPQGTCGLATSPFYPTA 366


>gi|402770517|gb|AFQ98393.1| cathepsin L [Rhipicephalus microplus]
          Length = 332

 Score =  215 bits (548), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 141/344 (40%), Positives = 190/344 (55%), Gaps = 36/344 (10%)

Query: 6   LIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
           L  +  ++ + +SQ   RT        ++E +K  + +TY+   E   RF+IF +N + +
Sbjct: 7   LCAIAAVTVAASSQEILRT--------QWEAFKTTHKKTYQSHMEELLRFKIFTENSLII 58

Query: 66  ERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFL----YKS 120
            + N   A G  SY L +N+F DL   EF     G     H  + K  G+ FL       
Sbjct: 59  AKHNAKYAKGLVSYKLGMNQFGDLLAHEFARIFNG-----HHGTRKTGGSSFLPPANVND 113

Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
           S +P  V+W +KGAVTPVK QGQC       A  ++EG + +K   LVSLSEQ LVDC+ 
Sbjct: 114 SSLPKVVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQ 173

Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
           +  NNGC GG M+DAFKYI  N GI  +  Y Y+ +  G C   K ED  A  T Y ++ 
Sbjct: 174 SFGNNGCEGGLMEDAFKYIKANDGIDTEKSYPYKAVD-GEC-RFKKEDVGATDTGYVEIK 231

Query: 234 PNDEESLLKAVAN-QPVSVAIDA--SALQFYSGGVFN-GYCETF-LNHGVTAVGYGTSEE 288
              E  L KAVA   P+SVAIDA  S+ Q YS GV++   C +  L+HGV  VGYG  + 
Sbjct: 232 AGSEVDLKKAVATVGPISVAIDASHSSFQLYSEGVYDEPECSSEDLDHGVLVVGYGV-KG 290

Query: 289 GIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           G KYWL+KNSW + WG+ GY  + RD +    QCGIA  AS+P+
Sbjct: 291 GKKYWLVKNSWAESWGDQGYILMSRDNNN---QCGIASQASYPL 331


>gi|262410743|gb|ACY66807.1| cathepsin L [Aphis gossypii]
          Length = 341

 Score =  215 bits (548), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 127/320 (39%), Positives = 179/320 (55%), Gaps = 22/320 (6%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKFADL 88
           I E++  +KAQ+ + Y++  E + R +++ DN + + R N     G  +Y L +N F DL
Sbjct: 26  IEEEWSLFKAQFKKIYEDVKEEAFRKKVYLDNKLKIARHNKLYETGEETYALEMNHFGDL 85

Query: 89  TPQEFIASQTGFK--MSDHSSSLKANGTPFLYKSSQV--PPSVNWIEKGAVTPVKYQGQC 144
              E+     GFK  ++    +   +      KS  V  P +++W +KG VTPVK QGQC
Sbjct: 86  MQHEYKKMMNGFKPSLAGGDKNFTDDDAVTFLKSENVVVPKAIDWRKKGYVTPVKNQGQC 145

Query: 145 -------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKG 197
                  A  ++EG +  K   LVSLSEQ L+DC+    NNGC GG MD AFKYI  NKG
Sbjct: 146 GSCWSFSATGSLEGQHFRKTGVLVSLSEQNLIDCSRKYGNNGCEGGLMDLAFKYIKSNKG 205

Query: 198 ITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDAS 256
           +  +  Y YE            E+  A    + D+P  DE++L+ A+A   PVS+AIDAS
Sbjct: 206 LDTEKSYPYEAEDDKC--RYNPENSGATDKGFVDIPEGDEDALMHALATVGPVSIAIDAS 263

Query: 257 A--LQFYSGGVF-NGYC-ETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQ 312
           +   QFY  GVF N  C  T L+HGV AVGYGT  +G  YW++KNSWG+ WG+ GY  + 
Sbjct: 264 SEKFQFYKKGVFYNPRCSSTELDHGVLAVGYGTDHKGGDYWIVKNSWGKTWGDQGYIMMA 323

Query: 313 RDIDQPQGQCGIAMFASFPV 332
           R+    +  CG+A  AS+P+
Sbjct: 324 RN---KKNNCGVASSASYPL 340


>gi|392881548|gb|AFM89606.1| cathepsin L [Callorhinchus milii]
          Length = 338

 Score =  215 bits (548), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 130/345 (37%), Positives = 197/345 (57%), Gaps = 29/345 (8%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
           F+++ L ++G  A+     + D G +   +EQWK+ +G++Y++  E  +R  +++ +L  
Sbjct: 5   FVVLSLCLAGGLAAP----SLDPG-LDTHWEQWKSWHGKSYEQKEETWRRM-VWEKHLRV 58

Query: 65  VERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQ- 122
           +E  N   ++G  S+ L +N F D+  +EF     G+K     +  K  G+ FL  + Q 
Sbjct: 59  IEIHNLEHSLGKHSFRLGMNHFGDMPNEEFRQLMNGYKYK--QTHKKLQGSHFLEPNFQE 116

Query: 123 VPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATND 175
           VP  V+W ++G VTPVK QGQC          A+EG +  +  +LVSLSEQ LV+C+  +
Sbjct: 117 VPKHVDWRDEGYVTPVKDQGQCGSCWAFSTTGALEGQHFRRTGQLVSLSEQNLVECSKPE 176

Query: 176 NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPN 235
            N GC GG MD AF+Y+  N GI ++  Y Y G     C     + +AA  T + D+P  
Sbjct: 177 GNEGCNGGLMDQAFQYVKDNGGIDSEDSYPYVGTDDTPC-HYNPQYNAANDTGFVDIPSG 235

Query: 236 DEESLLKAVAN-QPVSVAIDA--SALQFYSGGV-FNGYC-ETFLNHGVTAVGYGTSE--- 287
            E +L+KA+A   PVSVAIDA  ++ QFY  G+ F   C  T L+HGV  VGYG  +   
Sbjct: 236 KERALMKAIAAVGPVSVAIDAGHTSFQFYQSGIYFEAECSSTDLDHGVLVVGYGVEKRDT 295

Query: 288 EGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           +G KYW++KNSW + WG++GY  + +D D     CGIA  AS+P+
Sbjct: 296 DGKKYWIVKNSWSEKWGQNGYILMAKDKDN---HCGIATAASYPL 337


>gi|261289789|ref|XP_002611756.1| hypothetical protein BRAFLDRAFT_236363 [Branchiostoma floridae]
 gi|229297128|gb|EEN67766.1| hypothetical protein BRAFLDRAFT_236363 [Branchiostoma floridae]
          Length = 308

 Score =  215 bits (548), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 129/307 (42%), Positives = 180/307 (58%), Gaps = 23/307 (7%)

Query: 42  GRTYKESAENSKRFEIFKDNLVAVERFNN-AAIGNRSYTLRLNKFADLTPQEF--IASQT 98
           G+ Y   +E + R  IF++N   V++ N  AA+G  ++ +++NKF DLT +EF  I   +
Sbjct: 8   GKQYNSLSEENARHSIFEENSKIVKQHNEEAAMGKHTFFMKMNKFGDLTTEEFRMIVIGS 67

Query: 99  GFKMSDHSSSLKANGTPF-LYKSSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVE 150
           GF  S+ +   +A G  F      +V  +V+W +KGAVT VK Q QC       A  ++E
Sbjct: 68  GFMQSNKTQ--QAEGGVFESLPGLKVDDTVDWRQKGAVTKVKNQEQCGSCWAFSATGSLE 125

Query: 151 GINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMS 210
           G + +K N LVSLSEQ LVDC+  + N GC GG MD AFKYI  N GI  +  YSY G  
Sbjct: 126 GQHFLKTNNLVSLSEQNLVDCSRREGNKGCKGGSMDQAFKYIKMNGGIDTEECYSYRGRD 185

Query: 211 TGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDA--SALQFYSGGVFN 267
             +C   K+    A +++Y D+   DE +L++AV+   P+SVAIDA   + Q Y  GV++
Sbjct: 186 ESMC-RYKSSCSGATLSSYTDIKTGDEMALMQAVSTVGPISVAIDAGHKSFQLYHHGVYD 244

Query: 268 --GYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIA 325
                 T L+HGV AVGYG+S  G  YWL+KNSWG +WG +GY  + R+      QCGIA
Sbjct: 245 EPKCSSTHLDHGVLAVGYGSS-NGSDYWLVKNSWGTEWGMEGYIMMSRN---KHNQCGIA 300

Query: 326 MFASFPV 332
             A +PV
Sbjct: 301 TRAIYPV 307


>gi|305434756|gb|ADM53740.1| cathepsin L1 precursor [Lepeophtheirus salmonis]
          Length = 325

 Score =  215 bits (547), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 128/319 (40%), Positives = 185/319 (57%), Gaps = 23/319 (7%)

Query: 28  GSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN-NAAIGNRSYTLRLNKFA 86
           G ++ ++  W   +G+TY        R +IF++N + +++ N  A  G  +Y+L +N++ 
Sbjct: 15  GELSGEWTLWTKLHGKTYTSFEIEELRVKIFEENRIKIQKHNAEAQNGLHTYSLEMNQYG 74

Query: 87  DLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA- 145
           DL   EF+   TG     +S     + T  L  S+ VP  VNW + GAVT VK Q  C  
Sbjct: 75  DLLQSEFLQGYTGLAKGSYS----GDNTVILDNSAPVPSYVNWTKNGAVTAVKDQKDCGS 130

Query: 146 ------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGIT 199
                   +VEG   IK  +L+S SEQQLVDC+++  N GC GG+MD+AFKY+I NKGI 
Sbjct: 131 CWAFSTTGSVEGQYFIKNKKLLSFSEQQLVDCSSDFRNEGCNGGWMDNAFKYLIANKGIA 190

Query: 200 NDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDASA- 257
            +  Y Y   + G+C   K    A +I++++DV    E+ L  AVA   P+SVAIDAS+ 
Sbjct: 191 TEDTYPYTA-TDGVCVYNKTM-AAGRISSFKDVKHGSEDQLKLAVAQIGPISVAIDASSG 248

Query: 258 -LQFYSGGVF-NGYCET-FLNHGVTAVGYGTSE-EGIKYWLIKNSWGQDWGEDGYFRLQR 313
             QFY  GV+ +  C + +L+HGV AVGYGT +  G+ YWL+KNSW   WG+ GY ++ R
Sbjct: 249 DFQFYKKGVYVDEECSSKYLDHGVLAVGYGTDKGTGLDYWLVKNSWSASWGDQGYIKMAR 308

Query: 314 DIDQPQGQCGIAMFASFPV 332
           +    +  CGIA  AS+PV
Sbjct: 309 N---HKNMCGIASLASYPV 324


>gi|291398027|ref|XP_002715626.1| PREDICTED: cathepsin S [Oryctolagus cuniculus]
          Length = 331

 Score =  215 bits (547), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 134/342 (39%), Positives = 188/342 (54%), Gaps = 27/342 (7%)

Query: 3   KYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
           K+ +  +L+ S + A      T D       +  WK  YG+ YKE  E + R  I++ NL
Sbjct: 2   KWLVWALLVCSSTVAQLHRDPTLDH-----HWHLWKKAYGKQYKEKNEEAARRLIWEKNL 56

Query: 63  VAVERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSS 121
             V   N   ++G  SY + +N  AD+T +E ++  +  ++         N T  L  + 
Sbjct: 57  KFVTLHNLEHSMGMHSYDVGMNHLADMTSEEVVSLMSSLRIPH---QWPRNVTYKLNPNQ 113

Query: 122 QVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATN 174
           ++P SV+W E+G VT VKYQG C       AV A+E    +K   LVSLS Q LVDC+T 
Sbjct: 114 KLPDSVDWRERGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGNLVSLSAQNLVDCSTT 173

Query: 175 D-NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
              N GC GGFM +AF+YII N GI ++A Y Y+ M         ++  AA  + Y ++P
Sbjct: 174 KYGNKGCNGGFMTEAFQYIIDNNGIDSEASYPYKAMDQKC--HYDSKHRAATCSKYTELP 231

Query: 234 PNDEESLLKAVANQ-PVSVAIDASALQFY---SGGVFNGYCETFLNHGVTAVGYGTSEEG 289
              EE+L +AVAN+ PVSVAIDAS   F+   SG  +   C   +NHGV AVGYG + +G
Sbjct: 232 FGSEEALKEAVANKGPVSVAIDASHSSFFLYRSGVYYEPSCTQNVNHGVLAVGYG-NLKG 290

Query: 290 IKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
             YWL+KNSWG  +GE GY R+ R+    +  CGIA + S+P
Sbjct: 291 KDYWLVKNSWGIHFGEQGYIRMARN---SKNHCGIANYPSYP 329


>gi|32396018|gb|AAP41846.1| cysteine protease [Anthurium andraeanum]
          Length = 502

 Score =  215 bits (547), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 125/323 (38%), Positives = 179/323 (55%), Gaps = 26/323 (8%)

Query: 32  EKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYT----LRLNKFAD 87
           E FE+W  ++ + Y    E ++R+  F  NL  V + N  A G R+ +    + +N FAD
Sbjct: 49  ELFERWMEKHRKVYAHPGEKARRYANFLSNLAFVRKRN--AEGRRAPSSGQGVGMNVFAD 106

Query: 88  LTPQEFIASQTGF----KMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQ 143
           L+ +EF    +      K ++   + +  G   +      P S++W ++GAVT VK QG 
Sbjct: 107 LSNEEFREVYSSRVLRKKAAEGRGARRRAGEGRVVAGCDAPASLDWRKRGAVTAVKNQGD 166

Query: 144 C-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNK 196
           C       +  A+EGINAI    L+SLSEQ+LVDC T   N GC GG+MD AF+++I N 
Sbjct: 167 CGSCWAFSSTGAMEGINAITTGELISLSEQELVDCDTT--NEGCDGGYMDYAFEWVINNG 224

Query: 197 GITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS 256
           GI ++A Y Y G +  +C++ K E     I  YEDV    E +LL A   QPVSV ID S
Sbjct: 225 GIDSEANYPYTGQADSVCNTTKEEIKVVSIDGYEDV-ATSESALLCAAVQQPVSVGIDGS 283

Query: 257 AL--QFYSGGVFNGYCE---TFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRL 311
           +L  Q Y+GG+++G C      ++H V  VGYG  + G  YW++KNSWG DWG  GY  +
Sbjct: 284 SLDFQLYAGGIYDGDCSGNPDDIDHAVLVVGYG-QQGGTDYWIVKNSWGTDWGMQGYIYI 342

Query: 312 QRDIDQPQGQCGIAMFASFPVSK 334
           +R+   P G C I   AS+P  +
Sbjct: 343 RRNTGLPYGVCAIDAMASYPTKQ 365


>gi|348525618|ref|XP_003450319.1| PREDICTED: cathepsin S-like [Oreochromis niloticus]
          Length = 330

 Score =  215 bits (547), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 132/342 (38%), Positives = 189/342 (55%), Gaps = 31/342 (9%)

Query: 6   LIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
           L+ VL+++ S   +     FDE S+  ++E+WK+ + R Y    E   R  I++ N+  +
Sbjct: 4   LVCVLLLATSALGR-----FDESSLDAQWEEWKSTHRREYNGLGEEGIRRAIWEKNMRMI 58

Query: 66  ERFNN-AAIGNRSYTLRLNKFADLTPQEFIASQTGFKM---SDHSSSLKANGTPFLYKSS 121
           E  N  AA+G  S+ + +N   D+T +E +   TG ++    + S +L  +  P     S
Sbjct: 59  EAHNEEAALGIHSFEMGMNHLGDMTSEEVVEKMTGLQIPMNQERSFTLAMDDMP-----S 113

Query: 122 QVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATN 174
           ++P SV++ +KG VT VK QG C       A  A+EG  A    +LV LS Q LVDC+  
Sbjct: 114 KIPKSVDYRKKGMVTSVKNQGACGSCWAFSAAGALEGQLAKSTGKLVDLSPQNLVDCSGK 173

Query: 175 DNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPP 234
             N+GC GGFM  AF+Y+I N GI +DA Y Y G              AA  ++Y+ +P 
Sbjct: 174 YGNHGCNGGFMTRAFQYVIDNHGIDSDASYPYTGRDEQC--RYNPATRAANCSSYQFLPE 231

Query: 235 NDEESLLKAVAN-QPVSVAIDAS--ALQFYSGGVFNG-YCETFLNHGVTAVGYGTSEEGI 290
            DE +L +A+A   P+SVAIDA      FY  GV+N   C   +NHGV AVGYG S  G 
Sbjct: 232 GDENALKQALATIGPISVAIDARRPRFSFYRSGVYNDPSCTQEVNHGVLAVGYG-SLNGQ 290

Query: 291 KYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
            YWL+KNSWG  +G+ GY R+ R+      QCGIA++A +PV
Sbjct: 291 DYWLVKNSWGSTFGDQGYIRMARNTGN---QCGIALYACYPV 329


>gi|392884266|gb|AFM90965.1| cathepsin L [Callorhinchus milii]
          Length = 338

 Score =  215 bits (547), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 130/349 (37%), Positives = 199/349 (57%), Gaps = 29/349 (8%)

Query: 1   MAKYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKD 60
           M   F+++ L ++G  A+     + D G +   +EQWK+ +G++Y++  E  +R  ++++
Sbjct: 1   MRLPFVVLSLCLAGGLAAP----SLDPG-LDTHWEQWKSWHGKSYEQKEETWRRM-VWEE 54

Query: 61  NLVAVERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK 119
           +L  +E  N   ++G  S+ L +N F D+  +EF     G+K     +  K  G+ FL  
Sbjct: 55  HLRVIEIHNLEHSLGKHSFRLGMNHFGDMPNEEFRQLMNGYKYK--QTHKKLQGSHFLEP 112

Query: 120 S-SQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDC 171
           +  +VP  V+W ++G VTPVK QGQC          A+EG +  +  +LVSLSEQ LV+C
Sbjct: 113 NFLEVPKHVDWRDEGYVTPVKDQGQCGSCWAFSTTGALEGQHFRRTGQLVSLSEQNLVEC 172

Query: 172 ATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYED 231
           +  + N GC GG MD AF+Y+  N GI ++  Y Y G     C     + +AA  T + D
Sbjct: 173 SKPEGNEGCNGGLMDQAFQYVKDNGGIDSEDSYPYVGTDDTPC-HYNPQYNAANDTGFVD 231

Query: 232 VPPNDEESLLKAVAN-QPVSVAIDA--SALQFYSGGV-FNGYC-ETFLNHGVTAVGYGTS 286
           +P   E +L+KA+A   PVSVAIDA  ++ QFY  G+ F   C  T L+HGV  VGYG  
Sbjct: 232 IPSGKERALMKAIAAVGPVSVAIDAGHTSFQFYQSGIYFEAECSSTDLDHGVLVVGYGVE 291

Query: 287 E---EGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           +   +G KYW++KNSW + WG++GY  + +D D     CGIA  AS+P+
Sbjct: 292 KRDTDGKKYWIVKNSWSEKWGQNGYILMAKDKDN---HCGIATAASYPL 337


>gi|223646726|gb|ACN10121.1| Cathepsin L1 precursor [Salmo salar]
 gi|223672581|gb|ACN12472.1| Cathepsin L1 precursor [Salmo salar]
          Length = 338

 Score =  215 bits (547), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 134/348 (38%), Positives = 189/348 (54%), Gaps = 30/348 (8%)

Query: 2   AKYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDN 61
           A Y  ++VL +S  CA+      FD   + + +  WK  + ++Y ES E  +R  +++ N
Sbjct: 3   ALYLAVLVLCVSAVCAAP----RFDS-QLEDHWHLWKNWHSKSYHESEEGWRRM-VWEKN 56

Query: 62  LVAVERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS 120
           L  +E  N    +G  SY L +N F D+T +EF  +  G+K    ++  K  G+ F+  +
Sbjct: 57  LKKIEMHNLEHTMGKHSYRLGMNHFGDMTNEEFRQTMNGYK---QTTERKFKGSLFMEPN 113

Query: 121 S-QVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCA 172
             Q P +V+W EKG VTPVK QG C          A+EG    K  +LVSLSEQ LVDC+
Sbjct: 114 YLQAPKAVDWREKGYVTPVKDQGSCGSCWAFSTTGAMEGQQFRKTGKLVSLSEQNLVDCS 173

Query: 173 TNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDV 232
             + N GC GG MD AF+YI  N G+  +  Y Y G     C   K E   A  T + D+
Sbjct: 174 RPEGNEGCNGGLMDQAFQYIQDNAGLDTEESYPYVGTDEDPC-HYKPEFSGANETGFVDI 232

Query: 233 PPNDEESLLKAVAN-QPVSVAIDA--SALQFYSGGV-FNGYCET-FLNHGVTAVGYGTSE 287
           P   E +++KAVA   PVSVAIDA   + QFY  G+ +   C +  L+HGV  VGYG   
Sbjct: 233 PSGKEHAMMKAVAAVGPVSVAIDAGHESFQFYEFGIYYEKECSSEELDHGVLVVGYGFEG 292

Query: 288 E---GIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           E   G KYW++KNSW + WG+ GY  + +D    +  CGIA  +S+P+
Sbjct: 293 EDVDGKKYWIVKNSWSEKWGDKGYIYMAKD---RKNHCGIATASSYPL 337


>gi|254746340|emb|CAX16635.1| putative C1A cysteine protease precursor [Manduca sexta]
          Length = 342

 Score =  215 bits (547), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 131/323 (40%), Positives = 182/323 (56%), Gaps = 25/323 (7%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKFADL 88
           + E++  +K Q+ + Y    E+  R +I+ +N   + + N     G  SY L  NK+ D+
Sbjct: 24  VKEEWVAFKMQHDKKYDSEVEDRFRMKIYAENKHKIAKHNQLYEQGLVSYKLGPNKYTDM 83

Query: 89  TPQEFIASQTGF-KMSDHSSSL-----KANGTPFLYKSS-QVPPSVNWIEKGAVTPVKYQ 141
              EFI +  G+ + + H+  L        G  F+  +  + P  V+W +KGAVT VK Q
Sbjct: 84  LHHEFIQAMNGYNRTAKHNKGLYGKKHDVRGATFIPPAHVKYPDHVDWTKKGAVTEVKDQ 143

Query: 142 GQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQ 194
           G+C          A+EG +  K   LVSLSEQ L+DC++   NNGC GG MD+AFKYI  
Sbjct: 144 GKCGSCWAFSTTGALEGQHFRKSGYLVSLSEQNLIDCSSTYGNNGCNGGLMDNAFKYIKD 203

Query: 195 NKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAI 253
           N GI  +  Y YEG+          ++  A+   + D+P  DEE L++AVA   PVSVAI
Sbjct: 204 NGGIDTEKTYPYEGVDDKC--RYNPKNSGAEDVGFVDIPSGDEEKLMQAVATVGPVSVAI 261

Query: 254 DAS--ALQFYSGGV-FNGYC-ETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYF 309
           DAS  + QFYSGGV ++  C  T L+HGV  VGYGT E G  YWL+KNSW + WGE GY 
Sbjct: 262 DASQNSFQFYSGGVYYDTECSSTDLDHGVLVVGYGTDEAGGDYWLVKNSWSRTWGELGYI 321

Query: 310 RLQRDIDQPQGQCGIAMFASFPV 332
           ++ R+ D     CGIA  AS+P+
Sbjct: 322 KMARNRDN---HCGIATDASYPL 341


>gi|156399477|ref|XP_001638528.1| predicted protein [Nematostella vectensis]
 gi|156225649|gb|EDO46465.1| predicted protein [Nematostella vectensis]
          Length = 325

 Score =  215 bits (547), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 142/340 (41%), Positives = 190/340 (55%), Gaps = 36/340 (10%)

Query: 6   LIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
           L+V ++I+         + F E S   ++  WK  +G+TY    E+ +R  I+ DNL  V
Sbjct: 8   LLVAVLIA---------QCFSELSQDRQWHAWKDFHGKTYTGEEEDLRR-AIWNDNLEIV 57

Query: 66  ERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSS-QVP 124
           ++ N     N SY L +N FADLT  EF     G++ + +S+     G+ FL  S+ Q+P
Sbjct: 58  KKHNAE---NHSYKLDMNHFADLTVTEFKQRFMGYRAASNSTG----GSTFLPLSNVQLP 110

Query: 125 PSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNN 177
             V+W +KG VT VK QGQC       +  ++EG +  K  +LVSLSEQ LVDC+    N
Sbjct: 111 AEVDWRDKGFVTAVKNQGQCGSCWAFSSTGSLEGQHFRKTGKLVSLSEQNLVDCSKKYGN 170

Query: 178 NGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDE 237
           NGC GG MD AFKYI  N GI  +  Y Y     G C   K     A +T Y DV    E
Sbjct: 171 NGCEGGLMDYAFKYIKNNDGIDTEQSYPYTARD-GQC-HFKPGSVGATVTGYTDVQRGSE 228

Query: 238 ESLLKAVAN-QPVSVAIDA--SALQFYSGGVFN--GYCETFLNHGVTAVGYGTSEEGIKY 292
             L  AVA   P+SVAIDA  S+ Q Y  GV++      T L+HGV AVGYG +E+G  Y
Sbjct: 229 GDLQSAVATVGPISVAIDAGHSSFQLYKTGVYSEPDCSSTQLDHGVLAVGYG-AEDGKDY 287

Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           WL+KNSWG+ WG +GY ++ R+ D    QCGIA  AS+P+
Sbjct: 288 WLVKNSWGEGWGMNGYIKMSRNKDN---QCGIATQASYPL 324


>gi|217323618|gb|ACK38176.1| midgut cysteine peptidase, partial [Sphenophorus levis]
          Length = 324

 Score =  215 bits (547), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 135/313 (43%), Positives = 181/313 (57%), Gaps = 29/313 (9%)

Query: 34  FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN-NAAIGNRSYTLRLNKFADLTPQE 92
           F+ +K ++G+TYK  AE +KRF IF++NL  +E  N     G  SYT  +NKFAD+T  E
Sbjct: 26  FQSFKLKHGKTYKNQAEETKRFAIFRENLRKIEAHNAEYKQGIHSYTQGINKFADMTRAE 85

Query: 93  F---IASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA---- 145
           F   +A+Q   K      S+ A  T  L     VP S++W  +  VTP+K Q QC     
Sbjct: 86  FKAMLATQVKTK-----PSIVATKTFQLADGVSVPESIDWRSRNVVTPIKDQAQCGSCWS 140

Query: 146 ---VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDA 202
              V + EG  A+   +L   SEQQLVDC T D N GC GG++DD F YI Q  G+  ++
Sbjct: 141 FAVVGSTEGAYALSTGKLTRFSEQQLVDC-TTDLNYGCDGGYLDDTFPYI-QTNGLELES 198

Query: 203 VYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ-PVSVAIDASALQFY 261
            Y Y G   G C S  +     ++++Y  VP N E++LL+AV    PV++AI+A  LQFY
Sbjct: 199 DYPYTGYD-GSC-SYDSSKVVTKVSSYVSVPAN-EQALLEAVGTAGPVAIAINADDLQFY 255

Query: 262 SGGVFNG-YCET-FLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQ 319
             G+ +  YC+  +L+HGV AVGY  SE G+ YWLIKNSWG DWGE GYFR  R     Q
Sbjct: 256 FSGIIDDKYCDPEWLDHGVLAVGY-NSENGLDYWLIKNSWGADWGESGYFRFLRG----Q 310

Query: 320 GQCGIAMFASFPV 332
             CG+   A +P+
Sbjct: 311 NICGVKEDAVYPL 323


>gi|219687002|dbj|BAH08632.1| daikon cysteine protease RD21 [Raphanus sativus]
          Length = 289

 Score =  215 bits (547), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 114/221 (51%), Positives = 146/221 (66%), Gaps = 12/221 (5%)

Query: 123 VPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATND 175
           +P SV+W ++GAV  VK QG C        + AVEGIN I    L+SLSEQ+LVDC T+ 
Sbjct: 3   IPESVDWRKEGAVAAVKDQGSCGSCWAFSTIGAVEGINKIVTGDLISLSEQELVDCDTS- 61

Query: 176 NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPN 235
            N GC GG MD AF++II+N GI  +  Y Y+  + G CD  +       I  YEDVP N
Sbjct: 62  YNQGCNGGLMDYAFEFIIKNGGIDTEEDYPYK-AADGRCDQNRKNAKVVTIDAYEDVPEN 120

Query: 236 DEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYW 293
           +E +L KA+ANQP+SVAI+A   A Q YS GVF+G C T L+HGV AVGYGT E G  YW
Sbjct: 121 NEAALKKALANQPISVAIEAGGRAFQLYSSGVFDGTCGTELDHGVVAVGYGT-ENGKDYW 179

Query: 294 LIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSK 334
           +++NSWG  WGE GY ++ R+I +  G+CGIAM AS+P+ K
Sbjct: 180 IVRNSWGGSWGESGYIKMARNIAEATGKCGIAMEASYPIKK 220


>gi|351721011|ref|NP_001238219.1| P34 probable thiol protease precursor [Glycine max]
 gi|1199563|gb|AAB09252.1| 34 kDa maturing seed vacuolar thiol protease precursor [Glycine
           max]
          Length = 379

 Score =  214 bits (546), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 127/332 (38%), Positives = 178/332 (53%), Gaps = 24/332 (7%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
           ++  F+ WK+++GR Y    E +KR EIFK+N   +   N       S+ L LNKFAD+T
Sbjct: 40  VSSLFQLWKSEHGRVYHNHEEEAKRLEIFKNNSNYIRDMNANRKSPHSHRLGLNKFADIT 99

Query: 90  PQEFIAS--QTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC--- 144
           PQEF     Q    +S              Y     P S +W +KG +T VKYQG C   
Sbjct: 100 PQEFSKKYLQAPKDVSQQIKMANKKMKKEQYSCDHPPASWDWRKKGVITQVKYQGGCGRG 159

Query: 145 ----AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITN 200
               A  A+E  +AI    LVSLSEQ+LVDC   + + G Y G+   +F++++++ GI  
Sbjct: 160 WAFSATGAIEAAHAIATGDLVSLSEQELVDCV--EESEGSYNGWQYQSFEWVLEHGGIAT 217

Query: 201 DAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDE-------ESLLKAVANQPVSVAI 253
           D  Y Y     G C + K +D    I  YE +  +DE       ++ L A+  QP+SV+I
Sbjct: 218 DDDYPYRA-KEGRCKANKIQDKVT-IDGYETLIMSDESTESETEQAFLSAILEQPISVSI 275

Query: 254 DASALQFYSGGVFNGYCETF---LNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFR 310
           DA     Y+GG+++G   T    +NH V  VGYG S +G+ YW+ KNSWG+DWGEDGY  
Sbjct: 276 DAKDFHLYTGGIYDGENCTSPYGINHFVLLVGYG-SADGVDYWIAKNSWGEDWGEDGYIW 334

Query: 311 LQRDIDQPQGQCGIAMFASFPVSKESAQPSSA 342
           +QR+     G CG+  FAS+P  +ES    SA
Sbjct: 335 IQRNTGNLLGVCGMNYFASYPTKEESETLVSA 366


>gi|332220183|ref|XP_003259237.1| PREDICTED: cathepsin S isoform 1 [Nomascus leucogenys]
          Length = 331

 Score =  214 bits (546), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 136/343 (39%), Positives = 192/343 (55%), Gaps = 32/343 (9%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
           +L+ VL++  S  +Q       + ++   +  WK  YG+ YKE  E + R  I++ NL  
Sbjct: 3   WLVCVLLVCSSAVAQ----LHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLKF 58

Query: 65  VERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQ- 122
           V   N   ++G  SY L +N   D+T +E ++  +  ++    S  + N T   YKS+  
Sbjct: 59  VMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLRVP---SQWQRNIT---YKSNPN 112

Query: 123 --VPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
             +P SV+W EKG VT VKYQG C       AV A+E    +K  +LVSLS Q LVDC+T
Sbjct: 113 QILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCST 172

Query: 174 ND-NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDV 232
               N GC GGFM  AF+YII NKGI +DA Y Y+ M         ++  AA  + Y ++
Sbjct: 173 EKYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKAMDQKC--QYDSKYRAATCSKYTEL 230

Query: 233 PPNDEESLLKAVANQ-PVSVAIDASALQFY---SGGVFNGYCETFLNHGVTAVGYGTSEE 288
           P + E+ L +AVAN+ PVSV +DAS   F+   SG  +   C   +NHGV  VGYG    
Sbjct: 231 PYSREDVLKEAVANKGPVSVGVDASHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYG-DLN 289

Query: 289 GIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
           G +YWL+KNSWG+++GE+GY R+ R+       CGIA F S+P
Sbjct: 290 GKEYWLVKNSWGRNFGEEGYIRMARN---KGNHCGIASFPSYP 329


>gi|440799058|gb|ELR20119.1| cysteine proteinase [Acanthamoeba castellanii str. Neff]
          Length = 401

 Score =  214 bits (546), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 132/308 (42%), Positives = 176/308 (57%), Gaps = 21/308 (6%)

Query: 40  QYGRTYKESAENSK---RFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIAS 96
           ++ RT+++S  +     RFEI+K N   +  +N       S+T+ +N+F DLT  EF   
Sbjct: 97  EWMRTHRKSYHHDHFLPRFEIWKTNNRWITHWNKKHANASSFTVAINQFGDLTSDEFNRL 156

Query: 97  QTGFKM-SDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA-------VAA 148
             G  + S   +S K         ++ +P S +W +KG V+ VK QG C          +
Sbjct: 157 YNGLHVFSAPKASEKVERPRQWANTAGIPESGDWRQKGVVSRVKDQGMCGSCWAFSTTGS 216

Query: 149 VEGINAIKINRLVSLSEQQLVDCATNDNNN-GCYGGFMDDAFKYIIQNKGITNDAVYSYE 207
            EGINAI  +RLV LSEQ LVDCAT   +N GC GGFMD+AF+YII NKGI ++A Y Y 
Sbjct: 217 TEGINAITTSRLVPLSEQNLVDCATAAYDNYGCNGGFMDNAFRYIIDNKGIDSEASYPYV 276

Query: 208 GMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQFYSGGV 265
             + G C       +  +    + +P  DE++LL A A QP+SV IDA   + QFYS GV
Sbjct: 277 A-ADGQCRFNPKTVYGGKGGTLKSLPKGDEKALLVAAARQPISVGIDAGRPSFQFYSKGV 335

Query: 266 FN-GYCE-TFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCG 323
           +N   C  T LNHGV  VG+G  E G  YWL+KNSWGQ WG DGY ++ RD +    QCG
Sbjct: 336 YNEPECSSTELNHGVLIVGWGV-ERGQAYWLVKNSWGQTWGMDGYIKMSRDKNN---QCG 391

Query: 324 IAMFASFP 331
           IA  AS+P
Sbjct: 392 IATLASYP 399


>gi|296082368|emb|CBI21373.3| unnamed protein product [Vitis vinifera]
          Length = 245

 Score =  214 bits (546), Expect = 4e-53,   Method: Compositional matrix adjust.
 Identities = 117/228 (51%), Positives = 147/228 (64%), Gaps = 14/228 (6%)

Query: 123 VPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATND 175
           +P SV+W E GAV PVK Q  C        VAAVEGIN I    L+SLSEQ+LVDC T +
Sbjct: 6   LPESVDWRETGAVNPVKDQRSCGSCWAFSTVAAVEGINQIVTGELISLSEQELVDCDT-E 64

Query: 176 NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPN 235
            + GC GG MD AF +II+N G+  +  Y Y G   G C+          I  YEDVPP 
Sbjct: 65  YDMGCNGGLMDYAFDFIIKNGGLDTEKDYPYTGFD-GECNLSGKSSKVVSIDGYEDVPPF 123

Query: 236 DEESLLKAVANQPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYW 293
           DE++L KAVA+QPVSVA++A   ALQ Y  G+F G C T L+HG+ AVGYGT E G  YW
Sbjct: 124 DEKALQKAVAHQPVSVAVEAGGRALQLYVSGIFTGECGTALDHGIVAVGYGT-ENGTDYW 182

Query: 294 LIKNSWGQDWGEDGYFRLQRDI-DQPQGQCGIAMFASFPVSKESAQPS 340
           +++NSWG  WGE+GY R++R++ D   G+CGIAM AS+P+ K    PS
Sbjct: 183 IVRNSWGSSWGENGYIRMERNMADAFSGKCGIAMEASYPI-KNGENPS 229


>gi|356514419|ref|XP_003525903.1| PREDICTED: LOW QUALITY PROTEIN: cysteine proteinase RD21a-like
           [Glycine max]
          Length = 343

 Score =  214 bits (545), Expect = 4e-53,   Method: Compositional matrix adjust.
 Identities = 141/343 (41%), Positives = 194/343 (56%), Gaps = 45/343 (13%)

Query: 15  SCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIG 74
           S A ++ +R+ +E  +   +E+  A++G+ Y    E  +RF+I K+NL  VE+ N    G
Sbjct: 35  SHADKSGWRSDEE--VMSIYEEXLAKHGKVYNAIDEMEERFQISKENLKFVEQHN---AG 89

Query: 75  NRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGA 134
           NR+Y + LN+FAD +           +M    SS  A   P +  S  +  SV+W ++GA
Sbjct: 90  NRTYKVGLNRFADRS-----------RMMTRPSSRYA---PRV--SDNLSESVDWRKEGA 133

Query: 135 VTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDD 187
           V  VK Q +C        +AAVEGIN I    L +LS     DC     N GC GG  D 
Sbjct: 134 VVRVKTQSECESCRTFTVIAAVEGINKIVTGNLTALS-----DC-DRTVNAGCSGGLADY 187

Query: 188 AFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ 247
           A ++II N GI  +  Y ++G + GICD  K       +  YE VP  DE +L KAVANQ
Sbjct: 188 ALEFIINNGGIDTEEDYPFQG-AVGICDQYKIN----AVDGYERVPAYDELALKKAVANQ 242

Query: 248 PVSVA-IDA--SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWG 304
           PVSVA I+A     Q Y  G+F G C T ++HGVTAVGYGT E GI YW++KNSWG++WG
Sbjct: 243 PVSVAYIEAYGKEFQLYESGIFTGKCGTSIDHGVTAVGYGT-ENGIDYWIVKNSWGENWG 301

Query: 305 EDGYFRLQRDI-DQPQGQCGIAMFASFPVSKESAQPSSADKSS 346
           E GY R++R+  +   G+CGIA+   +P+ K    PS+ D SS
Sbjct: 302 EAGYVRMERNTAEDTAGKCGIAILTLYPI-KSGQNPSNPDNSS 343


>gi|357216861|gb|AET71138.1| cysteine peptidase isoform b [Sphenophorus levis]
          Length = 324

 Score =  214 bits (545), Expect = 4e-53,   Method: Compositional matrix adjust.
 Identities = 135/313 (43%), Positives = 181/313 (57%), Gaps = 29/313 (9%)

Query: 34  FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN-NAAIGNRSYTLRLNKFADLTPQE 92
           F+ +K ++G+TYK  AE +KRF IF++NL  +E  N     G  SYT  +NKFAD+T  E
Sbjct: 26  FQSFKLKHGKTYKNQAEETKRFAIFRENLRKIEAHNAEYKQGIHSYTQGINKFADMTRAE 85

Query: 93  F---IASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC----- 144
           F   +A+Q   K      S+ A  T  L     VP S++W  +  VTP+K Q QC     
Sbjct: 86  FKAMLATQVKTK-----PSIVATKTFQLADGVSVPESIDWRSRNVVTPIKDQAQCGSCWA 140

Query: 145 --AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDA 202
              V + EG  A+   +L   SEQQLVDC T D N GC GG++DD F YI Q  G+  ++
Sbjct: 141 FAVVGSTEGAYALSTGKLTRFSEQQLVDC-TTDLNYGCDGGYLDDTFPYI-QTNGLELES 198

Query: 203 VYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ-PVSVAIDASALQFY 261
            Y Y G   G C S ++     ++++Y  VP N E++LL+AV    PV++AI+A  LQFY
Sbjct: 199 DYPYTGYD-GYC-SYESSKVVTKVSSYVSVPAN-EQALLEAVGTAGPVAIAINADDLQFY 255

Query: 262 SGGVFNG-YCET-FLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQ 319
             G+ +  YC+  +L+HGV AVGY  SE G  YWLIKNSWG DWGE GYFR  R     Q
Sbjct: 256 FSGIIDDKYCDPEYLDHGVLAVGY-DSENGRDYWLIKNSWGADWGESGYFRFLRG----Q 310

Query: 320 GQCGIAMFASFPV 332
             CG+   A +P+
Sbjct: 311 NICGVKEDAVYPL 323


>gi|5231178|gb|AAD41105.1|AF157961_1 cysteine proteinase [Hypera postica]
          Length = 324

 Score =  214 bits (545), Expect = 5e-53,   Method: Compositional matrix adjust.
 Identities = 135/343 (39%), Positives = 192/343 (55%), Gaps = 34/343 (9%)

Query: 1   MAKYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKD 60
           +A +F ++V+ IS S + +          +  KF+ +K ++G+TY   AE SKRF IF D
Sbjct: 3   VAIFFSLLVVAISASISEE----------LGAKFQAFKLEHGKTYLNQAEESKRFNIFTD 52

Query: 61  NLVAVERFNNA-AIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK 119
           N+ A+E  N     G  SY   +NKF D++ +EF   +T   +S  S       T ++  
Sbjct: 53  NVRAIEAHNALYEQGKVSYKKGINKFTDMSQEEF---KTMLTLS-ASRKPTLETTSYVKT 108

Query: 120 SSQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCA 172
             ++P SV+W ++G VT VK QG C          + EG  A K  +LVSLSEQQL+DC 
Sbjct: 109 GVEIPSSVDWRKEGRVTGVKDQGDCGSCWAFSITGSTEGAYARKSGKLVSLSEQQLIDCC 168

Query: 173 TNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDV 232
           T D + GC GG +DD FKY++++ G+ ++  Y+Y+G   G C          +++ Y  +
Sbjct: 169 T-DTSAGCDGGSLDDNFKYVMKD-GLQSEESYTYKG-EDGAC-KYNVASVVTKVSKYTSI 224

Query: 233 PPNDEESLLKAVAN-QPVSVAIDASALQFYSGGVFNGY-CE-TFLNHGVTAVGYGTSEEG 289
           P  DE++LL+AVA   PVSV +DAS L  Y  G++    C    LNH + AVGYGT E G
Sbjct: 225 PAEDEDALLEAVATVGPVSVGMDASYLSSYDSGIYEDQDCSPAGLNHAILAVGYGT-ENG 283

Query: 290 IKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
             YW+IKNSWG  WGE GYFRL R     + QCGI+    +P 
Sbjct: 284 KDYWIIKNSWGASWGEQGYFRLAR----GKNQCGISEDTVYPT 322


>gi|414887427|tpg|DAA63441.1| TPA: hypothetical protein ZEAMMB73_713985 [Zea mays]
          Length = 355

 Score =  214 bits (545), Expect = 5e-53,   Method: Compositional matrix adjust.
 Identities = 126/332 (37%), Positives = 181/332 (54%), Gaps = 34/332 (10%)

Query: 6   LIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
           L++VL+   +   +      ++  + ++F  W+A Y R+Y  +AE  +RFE+++ N+  +
Sbjct: 15  LMLVLMAGAASGGRVD---VEDMLMMDRFRAWQATYNRSYLTAAERLRRFEVYRQNMELI 71

Query: 66  ERFNNAAIGNRSYTLRLNKFADLTPQEFIASQT-------GFKMSDHSSSLKANGTPFLY 118
           E  N  A    SY L    F DLT +EF+A+ T             H   +  +  P   
Sbjct: 72  EATNRRA--ELSYQLSETPFTDLTSEEFLATHTMSTRLHASEAARRHRELITTHAGPVSD 129

Query: 119 KSSQ-----------VPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRL 160
              Q           VP SV+W  KGAVT VK QG C        VAA+EG++ I+  +L
Sbjct: 130 GGRQWNRRNYTTDLDVPESVDWRTKGAVTTVKDQGACGGCWSFATVAAIEGLHKIRTGQL 189

Query: 161 VSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAE 220
           VSLSEQ+++DC++  NN GC+GG    A  ++  N G+T ++ Y YEG   G C   KA 
Sbjct: 190 VSLSEQEVLDCSSPPNN-GCHGGNPAAAIDWVSANGGLTTESDYPYEGRQ-GKCKLDKAR 247

Query: 221 DHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASALQ-FYSGGVFNGYCETF-LNHGV 278
           +H A+I   + V  N+E +L  AVA QPV+V ++   +Q  Y  GVF+G C+   LNH V
Sbjct: 248 NHVAKIRGRKLVDQNNEAALEVAVAQQPVAVGMNVHPIQQHYKSGVFHGPCDPEDLNHAV 307

Query: 279 TAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFR 310
           T VGYG    G KYW++KNSWG+ WGE GYFR
Sbjct: 308 TMVGYGAESGGRKYWIVKNSWGEKWGEKGYFR 339


>gi|387914010|gb|AFK10614.1| cathepsin L [Callorhinchus milii]
 gi|392873762|gb|AFM85713.1| cathepsin L [Callorhinchus milii]
 gi|392877488|gb|AFM87576.1| cathepsin L [Callorhinchus milii]
          Length = 338

 Score =  214 bits (545), Expect = 5e-53,   Method: Compositional matrix adjust.
 Identities = 129/345 (37%), Positives = 197/345 (57%), Gaps = 29/345 (8%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
           F+++ L ++G  A+     + D G +   +EQWK+ +G++Y++  E  +R  +++ +L  
Sbjct: 5   FVVLSLCLAGGLAAP----SLDPG-LDTHWEQWKSWHGKSYEQKEETWRRM-VWEKHLRV 58

Query: 65  VERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQ 122
           +E  N   ++G  S+ L +N F D+  +EF     G+K     +  K  G+ FL  +  +
Sbjct: 59  IEIHNLEHSLGKHSFRLGMNHFGDMPNEEFRQLMNGYKYK--QTHKKLQGSHFLEPNFLE 116

Query: 123 VPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATND 175
           VP  V+W ++G VTPVK QGQC          A+EG +  +  +LVSLSEQ LV+C+  +
Sbjct: 117 VPKHVDWRDEGYVTPVKDQGQCGSCWAFSTTGALEGQHFRRTGQLVSLSEQNLVECSKPE 176

Query: 176 NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPN 235
            N GC GG MD AF+Y+  N GI ++  Y Y G     C     + +AA  T + D+P  
Sbjct: 177 GNEGCNGGLMDQAFQYVKDNGGIDSEDSYPYVGTDDTPC-HYNPQYNAANDTGFVDIPSG 235

Query: 236 DEESLLKAVAN-QPVSVAIDA--SALQFYSGGV-FNGYC-ETFLNHGVTAVGYGTSE--- 287
            E +L+KA+A   PVSVAIDA  ++ QFY  G+ F   C  T L+HGV  VGYG  +   
Sbjct: 236 KERALMKAIAAVGPVSVAIDAGHTSFQFYQSGIYFEAECSSTDLDHGVLVVGYGVEKRDT 295

Query: 288 EGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           +G KYW++KNSW + WG++GY  + +D D     CGIA  AS+P+
Sbjct: 296 DGKKYWIVKNSWSEKWGQNGYILMAKDKDN---HCGIATAASYPL 337


>gi|387015022|gb|AFJ49630.1| Cathepsin L1-like [Crotalus adamanteus]
          Length = 338

 Score =  214 bits (544), Expect = 6e-53,   Method: Compositional matrix adjust.
 Identities = 137/347 (39%), Positives = 190/347 (54%), Gaps = 30/347 (8%)

Query: 4   YFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLV 63
           Y  I+ L    S A+        + ++ + +  WK+ + + Y E  E  +R  I++ NL 
Sbjct: 3   YLCILALSFGASFAAPGL-----DPALNDHWLSWKSWHSKKYHEKEEGWRRM-IWEKNLK 56

Query: 64  AVERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-S 121
            +E  N + ++G  SY L +N F D+T +EF     GFK S   S  K  G+ FL  +  
Sbjct: 57  MIELHNLDHSLGKHSYRLGMNHFGDMTNEEFRQVMNGFKQS--RSQRKYKGSQFLEPNFL 114

Query: 122 QVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATN 174
           Q P SV+W EKG VTPVK QGQC       A  A+EG +  K  +LVSLSEQ L+DC+  
Sbjct: 115 QAPKSVDWREKGYVTPVKDQGQCGSCWAFSATGALEGQHFRKTGKLVSLSEQNLIDCSGP 174

Query: 175 DNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPP 234
           + N GC GG MD AF+YI  N GI ++  Y Y G     C   K E ++A  T + D+P 
Sbjct: 175 EGNQGCNGGLMDQAFQYIKDNNGIDSEESYPYIGKDDEDC-LYKPEYNSANDTGFVDIPE 233

Query: 235 NDEESLLKAVAN-QPVSVAIDAS--ALQFYSGGV-FNGYCET-FLNHGVTAVGYG----T 285
             E +L+KAVA   P+SVAIDAS  + QFY  GV +   C +  L+HGV  VGYG     
Sbjct: 234 GRERALMKAVAAVGPISVAIDASHTSFQFYESGVYYEPQCNSEELDHGVLVVGYGYEGTD 293

Query: 286 SEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
            +   +YW++KNSW + WG+ GY  + +D       CGIA  AS+P+
Sbjct: 294 DDNKKRYWIVKNSWSEKWGDQGYIHMAKD---RSNNCGIASAASYPM 337


>gi|224079085|ref|XP_002305743.1| predicted protein [Populus trichocarpa]
 gi|222848707|gb|EEE86254.1| predicted protein [Populus trichocarpa]
          Length = 494

 Score =  214 bits (544), Expect = 6e-53,   Method: Compositional matrix adjust.
 Identities = 135/323 (41%), Positives = 185/323 (57%), Gaps = 20/323 (6%)

Query: 29  SIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADL 88
           SI E F+QW+ ++ + YK + E  KRF  FK NL  +          R + + LNKFADL
Sbjct: 38  SIIEIFQQWRDRHQKAYKHAEEAEKRFGNFKRNLKYIIEKTGKETTLR-HRVGLNKFADL 96

Query: 89  TPQEFI-ASQTGFKMSDHSSSLKA-NGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA- 145
           + +EF     +  K   + + + A + +    +S   P S++W +KG VT VK QG C  
Sbjct: 97  SNEEFKQLYLSKVKKPINKTRIDAEDRSRRNLQSCDAPSSLDWRKKGVVTAVKDQGDCGS 156

Query: 146 ------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGIT 199
                   A+EGINAI  + L+SLSEQ+LVDC T   N GC GG+MD AF+++I N GI 
Sbjct: 157 CWSFSTTGAIEGINAIVTSDLISLSEQELVDCDTT--NYGCEGGYMDYAFEWVINNGGID 214

Query: 200 NDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASAL- 258
            +A Y Y G+  G C++ K E     I  Y+DV   D  +LL A A QP+SV ID SA+ 
Sbjct: 215 TEANYPYTGVD-GTCNTAKEEIKVVSIDGYKDVDETDS-ALLCAAAQQPISVGIDGSAID 272

Query: 259 -QFYSGGVFNGYCETFLN---HGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRD 314
            Q Y+GG+++G C    +   H V  VGYG SE G  YW++KNSWG  WG +GYF ++R+
Sbjct: 273 FQLYTGGIYDGDCSDDPDDIDHAVLIVGYG-SENGEDYWIVKNSWGTSWGIEGYFYIKRN 331

Query: 315 IDQPQGQCGIAMFASFPVSKESA 337
            D P G C I   AS+P  + SA
Sbjct: 332 TDLPYGVCAINAMASYPTKEASA 354


>gi|116563690|gb|ABJ99858.1| cathepsin L [Hippoglossus hippoglossus]
          Length = 336

 Score =  214 bits (544), Expect = 6e-53,   Method: Compositional matrix adjust.
 Identities = 129/343 (37%), Positives = 190/343 (55%), Gaps = 25/343 (7%)

Query: 7   IVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVE 66
           ++ L++  +C S        +  + E ++ WK+ + + Y E  E  +R  +++ NL  +E
Sbjct: 1   MLPLLVLTACLSSVLSAPVLDAQLNEHWDLWKSWHSKKYHEKEEGWRRM-VWEKNLQKIE 59

Query: 67  RFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPP 125
             N   ++G  S+ L +N F D+T +EF     G+K+    +  K  G+ F+  +    P
Sbjct: 60  LHNLEHSMGTHSFRLGMNHFGDMTHEEFRQIMNGYKLK---TQRKFTGSLFMEPNFMTAP 116

Query: 126 S-VNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNN 177
           S V+W EKG VTPVK QGQC          A+EG    K  +LVSLSEQ LVDC+  + N
Sbjct: 117 SAVDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQQFRKTGKLVSLSEQNLVDCSRPEGN 176

Query: 178 NGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDE 237
            GC GG MD AF+Y+  N+G+ ++  Y Y G     C      + +A  T + DVP   E
Sbjct: 177 EGCGGGLMDQAFQYVTDNQGLDSEDSYPYTGTDDQPCHYDPLYN-SANDTGFVDVPSGKE 235

Query: 238 ESLLKAVAN-QPVSVAIDA--SALQFYSGGV-FNGYCET-FLNHGVTAVGYGTSEE---G 289
            +L+KAVA+  PVSVAIDA   + QFY  G+ +   C +  L+HGV AVGYG   E   G
Sbjct: 236 HALMKAVASVGPVSVAIDAGHESFQFYQSGIYYEKECSSEELDHGVLAVGYGFEGEDKMG 295

Query: 290 IKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
            K+W++KNSWG+ WG+ GY  + +D    +  CGIA  AS+P+
Sbjct: 296 KKFWIVKNSWGEKWGDKGYIYMAKD---RKNHCGIATAASYPL 335


>gi|293342577|ref|XP_001065834.2| PREDICTED: cathepsin L1 [Rattus norvegicus]
 gi|293354413|ref|XP_573976.3| PREDICTED: cathepsin L1 [Rattus norvegicus]
 gi|149039745|gb|EDL93861.1| rCG24317, isoform CRA_a [Rattus norvegicus]
          Length = 330

 Score =  214 bits (544), Expect = 6e-53,   Method: Compositional matrix adjust.
 Identities = 131/345 (37%), Positives = 188/345 (54%), Gaps = 29/345 (8%)

Query: 1   MAKYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKD 60
           M   FL+  L +    A+     +FD       +E+WK ++G+TY  + E  KR  ++++
Sbjct: 1   MTPIFLLATLCLGMISAAPTHDPSFDT-----VWEEWKTKHGKTYNTNEEGQKR-AVWEN 54

Query: 61  NLVAVERFNNAAI-GNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK 119
           N+  +   N   + G   ++L +N F DLT  EF    TGF+    +  +K    PFL  
Sbjct: 55  NMKMINLHNEDYLKGKHGFSLEMNAFGDLTNTEFRELMTGFQ-GQKTKMMKVFPEPFL-- 111

Query: 120 SSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCA 172
              VP +V+W + G VTPVK QG C       AV ++EG    K  +LV LSEQ LVDC+
Sbjct: 112 -GDVPKTVDWRKHGYVTPVKNQGPCGSCWAFSAVGSLEGQVFRKTGKLVPLSEQNLVDCS 170

Query: 173 TNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDV 232
            +  N GC GG  D AF+Y+  N G+     Y YE ++ G C     +  AA++  +  +
Sbjct: 171 WSHGNKGCDGGLPDFAFQYVKDNGGLDTSVSYPYEALN-GTC-RYNPKYSAAKVVGFMSI 228

Query: 233 PPNDEESLLKAVAN-QPVSVAID--ASALQFYSGGVF--NGYCETFLNHGVTAVGYGTSE 287
           PP+ E +L+KAVA   P+SV ID    + QFY GG++       T LNH V  VGYG   
Sbjct: 229 PPS-ENALMKAVATVGPISVGIDIKHKSFQFYKGGMYYEPDCSSTNLNHAVLVVGYGEES 287

Query: 288 EGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           +G KYWL+KNSWG+DWG DGY ++ +D +     CGIA  AS+P+
Sbjct: 288 DGRKYWLVKNSWGRDWGMDGYIKMAKDWNN---NCGIASDASYPI 329


>gi|449513868|ref|XP_002191976.2| PREDICTED: cathepsin L1-like [Taeniopygia guttata]
          Length = 443

 Score =  214 bits (544), Expect = 6e-53,   Method: Compositional matrix adjust.
 Identities = 132/316 (41%), Positives = 181/316 (57%), Gaps = 24/316 (7%)

Query: 34  FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN-NAAIGNRSYTLRLNKFADLTPQE 92
           ++ WK+ + + Y E  E  +R  +++ NL  +E  N + A+G  SY L +N+F D+T +E
Sbjct: 134 WQLWKSWHRKDYHEREEGWRRV-VWEKNLKMIEIHNLDHALGKHSYKLGMNQFGDMTTEE 192

Query: 93  FIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGAVTPVKYQGQCA------ 145
           F     G+      S  K  G+ FL  +  + P SV+W EKG VTPVK QGQC       
Sbjct: 193 FRQLMNGYVHK--KSERKYRGSQFLEPNFLEAPRSVDWREKGYVTPVKDQGQCGSCWAFS 250

Query: 146 -VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVY 204
              A+EG +  K  +LVSLSEQ LVDC+  + N GC GG MD AF+Y+  N GI ++  Y
Sbjct: 251 TTGALEGQHFRKTGKLVSLSEQNLVDCSRPEGNQGCNGGLMDQAFQYVQDNGGIDSEESY 310

Query: 205 SYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDA--SALQFY 261
            Y       C   KAE +AA  T + D+P   E +L+KAVA   PVSVAIDA  S+ QFY
Sbjct: 311 PYTAKDDEDC-RYKAEYNAANDTGFVDIPQGHERALMKAVAAVGPVSVAIDAGHSSFQFY 369

Query: 262 SGGV-FNGYCETF-LNHGVTAVGYGTSEE---GIKYWLIKNSWGQDWGEDGYFRLQRDID 316
             G+ +   C +  L+HGV  VGYG   E   G KYW++KNSWG+ WG+ GY  + +D  
Sbjct: 370 QSGIYYEPDCSSEDLDHGVLVVGYGFEGEDVDGKKYWIVKNSWGEKWGDKGYIYMAKD-- 427

Query: 317 QPQGQCGIAMFASFPV 332
             +  CGIA  AS+P+
Sbjct: 428 -RKNHCGIATAASYPL 442


>gi|413953046|gb|AFW85695.1| thiol protease SEN102 [Zea mays]
          Length = 382

 Score =  214 bits (544), Expect = 7e-53,   Method: Compositional matrix adjust.
 Identities = 135/324 (41%), Positives = 189/324 (58%), Gaps = 23/324 (7%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
           + E+F+ W+A+Y RTY    E  +RF I+ +N+  ++  N  + G+ SY L  N+F DLT
Sbjct: 60  LLERFKAWQAEYNRTYATPEEFQQRFMIYSENVRFIKTMNQLSTGS-SYELGENQFTDLT 118

Query: 90  PQEFIAS---------QTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKY 140
            +EF  +              M     ++   G      + + P SV+W  KGAVT VK 
Sbjct: 119 EEEFKDTYLMKLDEQPPAAEAMPPTVGTMSTAGMSNGNNTGEAPNSVDWRTKGAVTRVKD 178

Query: 141 QGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYII 193
           Q QC        VA++EG++ IK  RLVSLSEQ++VDC    N+NGC GG    A +++ 
Sbjct: 179 QQQCGSCWAFATVASIEGVHQIKTGRLVSLSEQEIVDCDRGGNDNGCRGGSPRSAMEWVT 238

Query: 194 QNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAI 253
           +N G+T ++ Y Y G S   C S K   HAA+I  Y+ V  N+E  L +AVA QPV+V +
Sbjct: 239 RNGGLTTESDYPYVG-SQRQCMSGKLGHHAARIRGYQAVQRNNEAELERAVAGQPVAVFV 297

Query: 254 DAS-ALQFYSGGVFNGYCE-TFLNHGVTAVGYGTS---EEGIKYWLIKNSWGQDWGEDGY 308
           DAS A QFY  GVF+G C+ T +NH VT VGYG++     G KYW++KNSWGQ WGE+GY
Sbjct: 298 DASRAFQFYKSGVFSGPCDTTTVNHVVTVVGYGSTGSDSGGRKYWIVKNSWGQGWGENGY 357

Query: 309 FRLQRDIDQPQGQCGIAMFASFPV 332
            R+ R +   +G C IA+   +PV
Sbjct: 358 VRMARRVRAREGMCAIAIEPYYPV 381


>gi|21953244|emb|CAD42716.1| putative cathepsin L [Myzus persicae]
          Length = 341

 Score =  214 bits (544), Expect = 7e-53,   Method: Compositional matrix adjust.
 Identities = 132/344 (38%), Positives = 190/344 (55%), Gaps = 23/344 (6%)

Query: 6   LIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
           +++VL +     S  +    +E  I E++  +K Q+ + Y++  E + R +++ DN + +
Sbjct: 3   VVIVLGLVAFAISSVSSINLNE-VIEEEWSLFKMQFKKLYEDIKEETFRKKVYLDNKLKI 61

Query: 66  ERFNNA-AIGNRSYTLRLNKFADLTPQEFIASQTGFKMS---DHSSSLKANGTPFLYKSS 121
            R N     G  +Y L +N F DL   E+     GFK S     S+     G  FL   +
Sbjct: 62  ARHNKLYESGEETYALEMNHFGDLMQHEYSKMMNGFKPSLAGGDSNFTNDEGVTFLKSEN 121

Query: 122 QV-PPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
            V P S++W +KG VTPVK QGQC       A  ++EG +  K   LVSLSEQ L+DC+ 
Sbjct: 122 VVIPKSIDWRKKGYVTPVKNQGQCGSCWSFSATGSLEGQHFRKTGVLVSLSEQNLIDCSR 181

Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
              NNGC GG MD AFKYI  NKG+  +  Y YE            ++  A    + D+P
Sbjct: 182 KYGNNGCEGGLMDLAFKYIKSNKGLDTEKSYPYEAEDDKC--RYNPDNSGATDNGFVDIP 239

Query: 234 PNDEESLLKAVAN-QPVSVAIDASA--LQFYSGGVF-NGYC-ETFLNHGVTAVGYGTSEE 288
             DEE+L+ A+A   PVS+AIDAS+   QFY  GVF N  C  T L+HGV AVG+ T ++
Sbjct: 240 EGDEEALMHALATVGPVSIAIDASSEKFQFYKKGVFYNPRCSSTELDHGVLAVGFRTDKK 299

Query: 289 GIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           G  YW++KNSWG+ WG++GY  + R+    +  CG+A  AS+P+
Sbjct: 300 GGDYWIVKNSWGKTWGDEGYIMMARN---KKNNCGVASSASYPL 340


>gi|344271892|ref|XP_003407771.1| PREDICTED: cathepsin L1-like [Loxodonta africana]
          Length = 334

 Score =  214 bits (544), Expect = 7e-53,   Method: Compositional matrix adjust.
 Identities = 133/343 (38%), Positives = 191/343 (55%), Gaps = 34/343 (9%)

Query: 10  LIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN 69
           L ++  C   A+     + S+ E++ QWK+ Y + Y  + E+ +R  +++ N+  +ER N
Sbjct: 5   LFLTALCLGIASAAQKHDESLDEQWYQWKSLYKKPYAANEEDWRR-AVWEKNMKMIERHN 63

Query: 70  NA-AIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS--SQVPPS 126
              + G   +T+ +N F D+T +EF     GF+      + K      LY+     +P S
Sbjct: 64  QEYSQGKHGFTMTMNAFGDMTNEEFRQVMNGFQ------NQKRIQGKLLYEPVFGHIPKS 117

Query: 127 VNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNG 179
           V+W +KG VTPVK QGQC       A  A+EG    K  +LVSLSEQ LVDC+  + N G
Sbjct: 118 VDWTQKGYVTPVKDQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRREGNEG 177

Query: 180 CYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEES 239
           C GG MD+AF+YI  N G+ ++  Y Y  M    C     +  AA  T + D+PP  E++
Sbjct: 178 CNGGLMDNAFQYIKDNGGLDSEESYPYTAMDKQDC-RYNPKYSAANDTGFVDIPPQ-EKA 235

Query: 240 LLKAVAN-QPVSVAIDA--SALQFYSGGV-FNGYCETF-LNHGVTAVGYGTSEEGI---- 290
           L+KAVA   P+SVA+DA   + QFY  G+ ++  C +  LNHGV  VGYG   EGI    
Sbjct: 236 LMKAVATVGPISVAVDAGHESFQFYKSGIYYDSNCSSKDLNHGVLVVGYGF--EGIDSAN 293

Query: 291 -KYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
            +YWL+KNSWG  WG DGY ++ +D +     CGIA  AS+P 
Sbjct: 294 NRYWLVKNSWGTGWGTDGYIKMAKDRNN---HCGIATAASYPT 333


>gi|293334761|ref|NP_001168296.1| uncharacterized protein LOC100382061 [Zea mays]
 gi|223947281|gb|ACN27724.1| unknown [Zea mays]
          Length = 322

 Score =  213 bits (543), Expect = 7e-53,   Method: Compositional matrix adjust.
 Identities = 123/308 (39%), Positives = 171/308 (55%), Gaps = 31/308 (10%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
           + ++F  W+A Y R+Y  +AE  +RFE+++ N+  +E  N  A    SY L    F DLT
Sbjct: 3   MMDRFRAWQATYNRSYLTAAERLRRFEVYRQNMELIEATNRRA--ELSYQLSETPFTDLT 60

Query: 90  PQEFIASQT-------GFKMSDHSSSLKANGTPFLYKSSQ-----------VPPSVNWIE 131
            +EF+A+ T             H   +  +  P      Q           VP SV+W  
Sbjct: 61  SEEFLATHTMSTRLHASEAARRHRELITTHAGPVSDGGRQWNRRNYTTDLDVPESVDWRT 120

Query: 132 KGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGF 184
           KGAVT VK QG C        VAA+EG++ I+  +LVSLSEQ+++DC++  NN GC+GG 
Sbjct: 121 KGAVTTVKDQGACGGCWSFATVAAIEGLHKIRTGQLVSLSEQEVLDCSSPPNN-GCHGGN 179

Query: 185 MDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAV 244
              A  ++  N G+T ++ Y YEG   G C   KA +H A+I   + V  N+E +L  AV
Sbjct: 180 PAAAIDWVSANGGLTTESDYPYEGRQ-GKCKLDKARNHVAKIRGRKLVDQNNEAALEVAV 238

Query: 245 ANQPVSVAIDASALQ-FYSGGVFNGYCETF-LNHGVTAVGYGTSEEGIKYWLIKNSWGQD 302
           A QPV+V ++   +Q  Y  GVF+G C+   LNH VT VGYG    G KYW++KNSWG+ 
Sbjct: 239 AQQPVAVGMNVHPIQQHYKSGVFHGPCDPEDLNHAVTMVGYGAESGGRKYWIVKNSWGEK 298

Query: 303 WGEDGYFR 310
           WGE GYFR
Sbjct: 299 WGEKGYFR 306


>gi|311265493|ref|XP_003130681.1| PREDICTED: cathepsin L1-like [Sus scrofa]
          Length = 332

 Score =  213 bits (543), Expect = 7e-53,   Method: Compositional matrix adjust.
 Identities = 136/339 (40%), Positives = 189/339 (55%), Gaps = 28/339 (8%)

Query: 10  LIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN 69
           L+++  C   A+     + S+   + +WKA + + Y  + E  +R  I++ N+  +ER N
Sbjct: 5   LLLAAFCLGIASAAPRHDHSLDADWYKWKATHRKLYGLNEEGRRR-AIWEKNMKMIERHN 63

Query: 70  -NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPP-SV 127
                G  S+T+ +N F D+T +EF  +  GF+   H       G  FL   S + P SV
Sbjct: 64  WEHRQGKHSFTMAMNAFGDMTNEEFRKTMNGFQNQKHKK-----GKVFLDAGSALTPHSV 118

Query: 128 NWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGC 180
           +W EKG VT VK QG C       A  A+EG    K ++L+SLSEQ LVDC+  + N GC
Sbjct: 119 DWREKGYVTAVKNQGHCGSCWAFSATGALEGQMFRKTSKLISLSEQNLVDCSWPEGNEGC 178

Query: 181 YGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESL 240
            GG MD+AF+YI  N G+ ++  Y Y G   G C   K +  AA  T Y D+ P  E++L
Sbjct: 179 NGGLMDNAFQYIKDNGGLDSEESYPYFG-KDGSC-KYKPQSSAANDTGYVDI-PKQEKAL 235

Query: 241 LKAVAN-QPVSVAIDAS--ALQFYSGGV-FNGYCETF-LNHGVTAVGYGT--SEEGIKYW 293
           +KAVA   P+SV IDAS  + QFYS G+ F   C +  L+HGV  VGYG   +    KYW
Sbjct: 236 MKAVATVGPISVGIDASHESFQFYSTGIYFEPQCSSEDLDHGVLVVGYGVEGAHSNNKYW 295

Query: 294 LIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           L+KNSWG  WG DGY ++ +D +     CGIA  AS+PV
Sbjct: 296 LVKNSWGNTWGMDGYIKMTKDQNN---HCGIATMASYPV 331


>gi|209693435|ref|NP_001129410.1| cathepsin L precursor [Acyrthosiphon pisum]
 gi|251823771|ref|NP_001156569.1| cathepsin L precursor [Acyrthosiphon pisum]
          Length = 341

 Score =  213 bits (543), Expect = 8e-53,   Method: Compositional matrix adjust.
 Identities = 130/344 (37%), Positives = 191/344 (55%), Gaps = 23/344 (6%)

Query: 6   LIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
           +++VL +     S  +    +E  I E++  +K Q+ + Y++  E + R +++ DN + +
Sbjct: 3   VVIVLGLVAFAISTVSSINLNE-VIEEEWSLFKIQFKKLYEDIKEETFRKKVYLDNKLKI 61

Query: 66  ERFNNA-AIGNRSYTLRLNKFADLTPQEFIASQTGFKMS----DHSSSLKANGTPFLYKS 120
            R N     G  +Y L +N F DL   E+     GFK S    D + +     T    ++
Sbjct: 62  ARHNKLYESGEETYALEMNHFGDLMQHEYTKMMNGFKPSLAGGDRNFTNDEAVTFLKSEN 121

Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
             +P SV+W +KG VTPVK QGQC       A  ++EG +  K   LVSLSEQ L+DC+ 
Sbjct: 122 VVIPKSVDWRKKGYVTPVKNQGQCGSCWSFSATGSLEGQHFRKTGVLVSLSEQNLIDCSR 181

Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
              NNGC GG MD AFKYI  NKG+  +  Y YE            E+  A    + D+P
Sbjct: 182 KYGNNGCEGGLMDLAFKYIKSNKGLDTEKSYPYEAEDDKC--RYNPENSGATDKGFVDIP 239

Query: 234 PNDEESLLKAVAN-QPVSVAIDASA--LQFYSGGVF-NGYC-ETFLNHGVTAVGYGTSEE 288
             DE++L+ A+A   PVS+AIDAS+   QFY  GVF N  C  T L+HGV AVG+G+ ++
Sbjct: 240 EGDEDALMHALATVGPVSIAIDASSEKFQFYKKGVFYNPRCSSTELDHGVLAVGFGSDKK 299

Query: 289 GIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           G  YW++KNSWG+ WG++GY  + R+    +  CG+A  AS+P+
Sbjct: 300 GGDYWIVKNSWGKTWGDEGYIMMARN---KKNNCGVASSASYPL 340


>gi|432114311|gb|ELK36239.1| Cathepsin S [Myotis davidii]
          Length = 340

 Score =  213 bits (543), Expect = 9e-53,   Method: Compositional matrix adjust.
 Identities = 137/342 (40%), Positives = 192/342 (56%), Gaps = 27/342 (7%)

Query: 3   KYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
           K+ L+V+L  S + A      T D       ++ WK  YG+ Y E  E   R  I++ NL
Sbjct: 11  KWLLLVLLGCSSAMAQLHKDPTLDH-----HWDLWKKTYGKQYTEENEEVTRRFIWEKNL 65

Query: 63  VAVERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSS 121
             V   N   ++G  SY L +N  AD+T +E +   +  ++    S  + N T     + 
Sbjct: 66  KYVMLHNLEHSMGMHSYDLGMNHLADMTSEEVMLLMSSLRVP---SQWQRNVTFKSNPNQ 122

Query: 122 QVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATN 174
           ++P S++W +KG VT VKYQG C       AV A+E    +K  +LVSLS Q LVDC+T 
Sbjct: 123 KLPDSMDWRDKGCVTEVKYQGSCGSCWAFSAVGALEAQLKLKTGKLVSLSVQNLVDCSTG 182

Query: 175 D-NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
             +N GC GGFM +AF+YII N GI ++A Y Y+ M  G C     ++ AA  + Y ++P
Sbjct: 183 KYSNKGCNGGFMTEAFQYIIDNNGIDSEASYPYKAMD-GKCQ-YDVKNRAATCSKYVELP 240

Query: 234 PNDEESLLKAVANQ-PVSVAIDASALQFY---SGGVFNGYCETFLNHGVTAVGYGTSEEG 289
             +EE+L +AVAN+ PVSVAIDAS   F+   SG  ++  C   +NHGV AVGYG +  G
Sbjct: 241 FGNEEALKEAVANKGPVSVAIDASHPSFFLYRSGVYYDKACTLNVNHGVLAVGYG-NYNG 299

Query: 290 IKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
             YWL+KNSWG  +GE GY R+ R+       CGIA + S+P
Sbjct: 300 KDYWLVKNSWGLHFGEQGYIRMARN---SGNHCGIASYPSYP 338


>gi|126681066|gb|ABO26562.1| cathepsin L-like cysteine protease [Ixodes ricinus]
          Length = 335

 Score =  213 bits (543), Expect = 9e-53,   Method: Compositional matrix adjust.
 Identities = 129/318 (40%), Positives = 181/318 (56%), Gaps = 21/318 (6%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKFADL 88
           +  ++  +KA++G++Y    E   R +I+ +N   + + N   A G   Y++ +N+F D+
Sbjct: 23  LGAEWSAFKAKHGKSYVSETEEVFRLKIYMENRHKIAKHNEKYARGEVPYSMAMNEFGDM 82

Query: 89  TPQEFIASQTGFKMS--DHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC-- 144
              EF++++ GFK +  D          P   +   +P +V+W  KGAVTPVK QGQC  
Sbjct: 83  LHHEFVSTRNGFKRNYKDQPREGSTYLEPENIEDFSLPKTVDWRTKGAVTPVKNQGQCGS 142

Query: 145 -----AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGIT 199
                A  ++EG +  K   +VSLSEQ LV C+T+  NNGC GG MDDAFKYI  NKGI 
Sbjct: 143 CWAFSATGSLEGQHFRKSGSMVSLSEQNLVGCSTDFGNNGCEGGLMDDAFKYIRANKGID 202

Query: 200 NDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDAS-- 256
            +  Y Y G + G C   K     A  + + D+    E  L KAVA   P+SVAIDAS  
Sbjct: 203 TEKSYPYNG-TDGTC-HFKKSTVGATDSGFVDIKEGSETQLKKAVATVGPISVAIDASHE 260

Query: 257 ALQFYSGGVFN-GYCET-FLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRD 314
           + QFYS GV++   C++  L+HGV  VGYGT   G  YW +KNSWG  WG++GY R+ R+
Sbjct: 261 SFQFYSDGVYDEPECDSESLDHGVLVVGYGTL-NGTDYWFVKNSWGTTWGDEGYIRMSRN 319

Query: 315 IDQPQGQCGIAMFASFPV 332
               + QCGIA  AS P+
Sbjct: 320 ---KKNQCGIASSASIPL 334


>gi|375340657|emb|CBJ56264.1| cathepsin S protein [Dicentrarchus labrax]
          Length = 337

 Score =  213 bits (542), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 125/319 (39%), Positives = 184/319 (57%), Gaps = 22/319 (6%)

Query: 27  EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN-NAAIGNRSYTLRLNKF 85
           E  +   ++ WK  +G+ Y+   E+  R E+++ NL+ +   N  A++G  +Y L +N  
Sbjct: 27  EPKLDAHWKLWKMTHGKKYQTEVEDVSRRELWEKNLMLITMHNLEASMGLHTYELSMNHM 86

Query: 86  ADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGAVTPVKYQGQC 144
            DLT +E + S   F      + ++   +PF   + + VP +++W EKG VT VK QG C
Sbjct: 87  GDLTQEEIMQS---FATLSPPTDIQRAASPFAGTTGADVPDTMDWREKGCVTSVKMQGSC 143

Query: 145 -------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKG 197
                  A  A+EG  A    +LV LS Q LVDC+T   N+GC GGFM  AF+Y+I N+G
Sbjct: 144 GSCWAFSAAGALEGQLAKTTGKLVDLSPQNLVDCSTKYGNHGCNGGFMHQAFQYVIDNQG 203

Query: 198 ITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDAS 256
           I +DA Y Y G   G C    ++  AA  + Y  +P  +E +L +A+AN  P+SVAIDA+
Sbjct: 204 IDSDASYPYTG-RNGEC-RYNSKFRAANCSQYSFLPEGNEGALKEALANIGPISVAIDAT 261

Query: 257 --ALQFYSGGVFNG-YCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQR 313
                FY  GV+N   C   +NHGV AVGYGT  +G  YWL+KNSWG+ +G+ GY R+ R
Sbjct: 262 RPTFTFYRSGVYNDPNCSQKVNHGVLAVGYGTL-DGQDYWLVKNSWGKTFGDQGYIRMSR 320

Query: 314 DIDQPQGQCGIAMFASFPV 332
           + +    QCGIA++  +P+
Sbjct: 321 NKND---QCGIALYGCYPI 336


>gi|33333704|gb|AAQ11970.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
           maculatus]
          Length = 326

 Score =  213 bits (542), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 129/330 (39%), Positives = 186/330 (56%), Gaps = 49/330 (14%)

Query: 29  SIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKFAD 87
           S+ E+++Q+K  +G+TY+   E  +RF +F+ NLV ++  N     G  S+  ++ +FAD
Sbjct: 18  SVYEEWQQFKLDHGKTYRSLVEEKRRFSVFQKNLVDIQEHNKKYERGEESFAKKVTQFAD 77

Query: 88  LTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSS-----------QVPPSVNWIEKGAVT 136
           +T +EF+              LK  G P L  ++           +   +++W E+GAVT
Sbjct: 78  MTHEEFL------------DLLKLQGVPALPSNAVHFDNFEDIDMEEKDAIDWREEGAVT 125

Query: 137 PVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATND-NNNGCYGGFMDDA 188
           PVK Q  C       AV A+EG    K   LVSLS Q+LVDCAT D  NNGC GG M  A
Sbjct: 126 PVKDQANCGSCWAFSAVGAIEGQFFKKNGTLVSLSAQELVDCATEDYGNNGCKGGLMGQA 185

Query: 189 FKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ- 247
           F ++ Q++GI  +  Y YEG  +  C   K+ ++  ++  Y  V P DE+ + + VA + 
Sbjct: 186 FDFV-QDEGIQTEESYPYEGRRSS-CK--KSGEYVTKVKTY--VFPLDEQEMARTVAAKG 239

Query: 248 PVSVAIDASALQFYSGGVFNGYCETF-----LNHGVTAVGYGTSEEGIKYWLIKNSWGQD 302
           PV+VAI+AS L FY  G+ +  C        LNHGV  VGYG SE G+ YW++KNSWG D
Sbjct: 240 PVAVAIEASQLSFYDKGIVDERCRCSNKREDLNHGVLVVGYG-SENGVDYWIVKNSWGAD 298

Query: 303 WGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           WGE GYFRL++D+      CGI  + ++PV
Sbjct: 299 WGEKGYFRLKKDVK----ACGIGTYNTYPV 324


>gi|74927078|sp|Q86GF7.1|CRUST_PANBO RecName: Full=Crustapain; AltName: Full=NsCys; Flags: Precursor
 gi|28971811|dbj|BAC65417.1| crustapain [Pandalus borealis]
          Length = 323

 Score =  213 bits (542), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 125/313 (39%), Positives = 180/313 (57%), Gaps = 22/313 (7%)

Query: 33  KFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKFADLTPQ 91
           ++E +K ++G+ Y  S E S R  +F D L  ++  N     G  +Y L++N F+DLT +
Sbjct: 19  EWENFKTKFGKKYANSEEESHRMSVFMDKLKFIQEHNERYDKGEVTYWLKINNFSDLTHE 78

Query: 92  EFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC------- 144
           E +A++TG     H  S+     P    ++ +   V+W  KGAVTPVK QGQC       
Sbjct: 79  EVLATKTGMTRRRHPLSVLPKSAP----TTPMAADVDWRNKGAVTPVKDQGQCGSCWAFS 134

Query: 145 AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVY 204
           AVAA+EG + +K   LVSLSEQ LVDC+++  N GC GG+   A++YII N+GI  ++ Y
Sbjct: 135 AVAALEGAHFLKTGDLVSLSEQNLVDCSSSYGNQGCNGGWPYQAYQYIIANRGIDTESSY 194

Query: 205 SYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ-PVSVAIDASALQF--Y 261
            Y+ +         A +  A +++Y +    DE +L  AV N+ PVSV IDA    F  Y
Sbjct: 195 PYKAIDDNC--RYDAGNIGATVSSYVEPASGDESALQHAVQNEGPVSVCIDAGQSSFGSY 252

Query: 262 SGGV-FNGYCET-FLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQ 319
            GGV +   C++ + NH VTAVGYGT   G  YW++KNSWG  WGE GY ++ R+ D   
Sbjct: 253 GGGVYYEPNCDSWYANHAVTAVGYGTDANGGDYWIVKNSWGAWWGESGYIKMARNRDN-- 310

Query: 320 GQCGIAMFASFPV 332
             C IA ++ +PV
Sbjct: 311 -NCAIATYSVYPV 322


>gi|428186189|gb|EKX55040.1| hypothetical protein GUITHDRAFT_63227 [Guillardia theta CCMP2712]
          Length = 344

 Score =  213 bits (542), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 136/305 (44%), Positives = 178/305 (58%), Gaps = 26/305 (8%)

Query: 48  SAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKFADLTPQEFIASQTGFKMSD-H 105
           S E+++ FE+F+ NL  + + N     G +SY + LN FA LT +EF A   G+  ++  
Sbjct: 45  SPESTRAFEVFQKNLDMIMKHNEEYNQGLQSYEMGLNGFAHLTFEEFSAQYLGYGGAEVE 104

Query: 106 SSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKIN 158
               +  G       S++P SV+W EKGAV  VK QG C       AVAA+EG + +   
Sbjct: 105 QPKTRRAGKHERKSRSEIPASVDWREKGAVAEVKNQGACGSCWAFSAVAALEGAHFLNSG 164

Query: 159 RLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAV--YSYEGMSTGICDS 216
            L+SLSEQQLVDC+    N+GC GG+MD+AF+Y + N G  +D+   Y Y+GM  G C  
Sbjct: 165 ELISLSEQQLVDCSKKFGNHGCAGGYMDNAFEYWMNNTGHGDDSEKDYPYKGMD-GKC-K 222

Query: 217 IKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDA-SALQFYSGGVFNGY---CE 271
             A+   A I+ Y DV   +E  LL AVAN  PVSVAI A +ALQFY  GVFNG    C 
Sbjct: 223 FSADGVRATISGYNDVKQGNETDLLDAVANVGPVSVAIHAGAALQFYLRGVFNGVAGTCF 282

Query: 272 TFLNHGVTAVGYGTSE----EGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMF 327
             LNHGVTAVGYGT+       + YW+IKNSWG  WGE G+ R  R     +  CG+A  
Sbjct: 283 GPLNHGVTAVGYGTASLRFGRKMDYWIIKNSWGMGWGEKGFVRFARG----KNLCGVANG 338

Query: 328 ASFPV 332
           AS+P+
Sbjct: 339 ASYPL 343


>gi|417409876|gb|JAA51427.1| Putative cathepsin s, partial [Desmodus rotundus]
          Length = 342

 Score =  213 bits (541), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 138/347 (39%), Positives = 195/347 (56%), Gaps = 33/347 (9%)

Query: 1   MAKYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKD 60
           + K+ ++V+L  S + A      T D       ++ WK  YG+ YKE  E   R  I++ 
Sbjct: 11  IMKWLVLVLLGCSSAMAQLHKDPTLDR-----HWDLWKKTYGKQYKEKNEEGVRRLIWEK 65

Query: 61  NLVAVERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK 119
           NL  V   N   ++G  SY L +N   D+T +E  A  +  ++    S  + N T   YK
Sbjct: 66  NLKFVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVTALMSSLRVP---SQWQRNVT---YK 119

Query: 120 SS---QVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLV 169
           S+   ++P SV+W +KG VT VKYQG C       AV A+E    +K  +LVSLS Q LV
Sbjct: 120 SNPNQKLPDSVDWRDKGCVTDVKYQGSCGSCWAFSAVGALEAQVKLKTGKLVSLSAQNLV 179

Query: 170 DCATND-NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITN 228
           DC+    +N GC GGFM +AF+YII N GI ++A Y Y+ M  G C    ++  AA  + 
Sbjct: 180 DCSVGKYSNRGCNGGFMTEAFQYIIDNNGIESEASYPYKAMD-GKC-QYDSKYRAATCSR 237

Query: 229 YEDVPPNDEESLLKAVANQ-PVSVAIDASALQFY---SGGVFNGYCETFLNHGVTAVGYG 284
           Y ++P + E++L +AVAN+ PVSVAIDAS   F+   SG  ++  C   +NHGV  VGYG
Sbjct: 238 YTELPEDSEDALKEAVANKGPVSVAIDASHPSFFLYRSGVYYDPACTLHVNHGVLVVGYG 297

Query: 285 TSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
            +  G  YWL+KNSWG  +G+ GY R+ R+       CGIA +AS+P
Sbjct: 298 -NLNGKDYWLVKNSWGLHFGDQGYIRMARN---SGNHCGIASYASYP 340


>gi|194883222|ref|XP_001975702.1| GG20414 [Drosophila erecta]
 gi|190658889|gb|EDV56102.1| GG20414 [Drosophila erecta]
          Length = 341

 Score =  213 bits (541), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 133/323 (41%), Positives = 182/323 (56%), Gaps = 27/323 (8%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKFADL 88
           + E++  +K ++ + Y++  E   R +IF +N   + + N   A G  S+ L +NK+ADL
Sbjct: 25  VMEEWHTFKLEHRKNYQDDTEERFRLKIFNENKHKIAKHNQRYAEGKVSFKLAVNKYADL 84

Query: 89  TPQEFIASQTGFKMSDH----SSSLKANGTPFLYKSS-QVPPSVNWIEKGAVTPVKYQGQ 143
              EF     GF  + H    S+     G  F+  +   +P SV+W  KGAVT VK QG 
Sbjct: 85  LHHEFRQLMNGFNYTLHKQLRSTDDSFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGH 144

Query: 144 C-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNK 196
           C       +  A+EG +  K   LVSLSEQ LVDC+T   NNGC GG MD+AF+YI  N 
Sbjct: 145 CGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNG 204

Query: 197 GITNDAVYSYEGMSTGICDSIKAEDHAAQITN--YEDVPPNDEESLLKAVAN-QPVSVAI 253
           GI  +  Y YE     I DS      A   T+  + D+P  DE+ + +AVA   PV+VAI
Sbjct: 205 GIDTEKSYPYE----AIDDSCHFNKGAIGATDRGFTDIPQGDEKKMAEAVATVGPVAVAI 260

Query: 254 DAS--ALQFYSGGVFN-GYCET-FLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYF 309
           DAS  + QFYS GV+N   C+   L+HGV  VGYGT E G  YWL+KNSWG  WG+ G+ 
Sbjct: 261 DASHESFQFYSEGVYNEPQCDAQNLDHGVLVVGYGTDESGDDYWLVKNSWGTTWGDKGFI 320

Query: 310 RLQRDIDQPQGQCGIAMFASFPV 332
           ++ R+ D    QCGIA  +S+P+
Sbjct: 321 KMLRNKDN---QCGIASASSYPL 340


>gi|428170119|gb|EKX39047.1| hypothetical protein GUITHDRAFT_154556 [Guillardia theta CCMP2712]
          Length = 352

 Score =  213 bits (541), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 132/351 (37%), Positives = 201/351 (57%), Gaps = 27/351 (7%)

Query: 6   LIVVLIISGSCA---SQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
           ++++  I+ +CA   S A+  +  +  I   F  WK ++ + Y + AE+  RF +FK N+
Sbjct: 4   ILLLAAIAATCAIPTSPASKTSSVDDEIHLAFISWKNKFEKVY-DGAEHLARFAVFKANM 62

Query: 63  VAVERFNNAA--IGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKA---NGTPFL 117
             + R +NA   +G  ++++  N+FAD+T +EF  +  G+K       L     +G    
Sbjct: 63  EII-RAHNALYELGEETFSMAANQFADMTAEEFKRTVLGYKPELKGKRLLQGLNSGKNCT 121

Query: 118 YKS--SQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQL 168
           ++S  S  P +++W  K AVTPVK QGQC          AVEG   +  + L+SLSE++L
Sbjct: 122 HRSNNSTRPKAIDWRTKSAVTPVKNQGQCGSCWSFSTTGAVEGAWVVAGHPLISLSEEEL 181

Query: 169 VDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSY--EGMSTGICDSIKAEDHAAQI 226
           V C T  ++ GC GG MD+A+ +IIQN GI  + VY Y     +TG+C         A I
Sbjct: 182 VQCDTK-SDQGCNGGLMDNAYAWIIQNGGIAAEDVYPYISGNGTTGVCHVAFLSKKVASI 240

Query: 227 TNYEDVPPNDEESLLKAVANQPVSVAIDA--SALQFYSGGVFNGY-CETFLNHGVTAVGY 283
           +++ D+ P DE  L  A+  QPV+VAI+A  S+ QFY+GGV     C T L+HGV AVGY
Sbjct: 241 SDWCDLKPEDESDLELALVQQPVAVAIEADQSSFQFYNGGVLPAKKCGTKLDHGVLAVGY 300

Query: 284 G-TSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQ-GQCGIAMFASFPV 332
           G   +  + YW++KNSWG +WG++GY RL++   + +   CGIA  AS+P 
Sbjct: 301 GYDKKHKMHYWIVKNSWGAEWGDEGYIRLEKMPKKTKHSACGIAKAASYPT 351


>gi|261289783|ref|XP_002611753.1| hypothetical protein BRAFLDRAFT_236364 [Branchiostoma floridae]
 gi|229297125|gb|EEN67763.1| hypothetical protein BRAFLDRAFT_236364 [Branchiostoma floridae]
          Length = 307

 Score =  213 bits (541), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 134/301 (44%), Positives = 173/301 (57%), Gaps = 21/301 (6%)

Query: 47  ESAENSKRFEIFKDNLVAVERFNNAA-IGNRSYTLRLNKFADLTPQEFIASQTGFKMSDH 105
           E  E S+R EIF++N   +   NN A +G  +Y L  N+FA +T  EF+A+  G  + D 
Sbjct: 12  EGKEESRRMEIFENNTKLINLHNNEADLGMHTYWLGHNQFAHMTNDEFVANVIGGCLLDR 71

Query: 106 SSSLKANGTPFLYKSS--QVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIK 156
           ++S         Y S+  ++P +V+W  KG VTPVK Q QC          ++EG    K
Sbjct: 72  NASKSTADRVHQYDSNLVELPDTVDWRTKGYVTPVKNQEQCGSCWAFSTTGSLEGQTFKK 131

Query: 157 INRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDS 216
             +LVSLSEQ LVDC+    N GC GG MDDAFKYI  N GI  +  Y YE    G C  
Sbjct: 132 TGKLVSLSEQNLVDCSGEFGNQGCNGGLMDDAFKYIKANGGIDTEDSYPYEARD-GKC-R 189

Query: 217 IKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDAS--ALQFYSGGV-FNGYC-E 271
            K  D  A +T Y D+   DE +L +AVA   P+SVAIDAS    Q YS GV +   C  
Sbjct: 190 FKPADVGATVTGYTDISEGDEGALTQAVATVGPISVAIDASHHTFQMYSHGVYYEPQCSS 249

Query: 272 TFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
           T L+HGV AVGYGT E G  YWL+KNSWG+ WG++GY  + R+ +    QCGIA  AS+P
Sbjct: 250 TELDHGVLAVGYGT-EGGKDYWLVKNSWGEVWGQNGYIMMSRNKNN---QCGIATSASYP 305

Query: 332 V 332
           +
Sbjct: 306 L 306


>gi|129353|sp|P22895.1|P34_SOYBN RecName: Full=P34 probable thiol protease; Flags: Precursor
          Length = 379

 Score =  213 bits (541), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 127/332 (38%), Positives = 177/332 (53%), Gaps = 24/332 (7%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
           ++  F+ WK+++GR Y    E +KR EIFK+N   +   N       S+ L LNKFAD+T
Sbjct: 40  VSSLFQLWKSEHGRVYHNHEEEAKRLEIFKNNSNYIRDMNANRKSPHSHRLGLNKFADIT 99

Query: 90  PQEFIAS--QTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC--- 144
           PQEF     Q    +S              Y     P S +W +KG +T VKYQG C   
Sbjct: 100 PQEFSKKYLQAPKDVSQQIKMANKKMKKEQYSCDHPPASWDWRKKGVITQVKYQGGCGRG 159

Query: 145 ----AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITN 200
               A  A+E  +AI    LVSLSEQ+LVDC   + + G Y G+   +F++++++ GI  
Sbjct: 160 WAFSATGAIEAAHAIATGDLVSLSEQELVDCV--EESEGSYNGWQYQSFEWVLEHGGIAT 217

Query: 201 DAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDE-------ESLLKAVANQPVSVAI 253
           D  Y Y     G C + K +D    I  YE +  +DE       ++ L A+  QP+SV+I
Sbjct: 218 DDDYPYRA-KEGRCKANKIQDKVT-IDGYETLIMSDESTESETEQAFLSAILEQPISVSI 275

Query: 254 DASALQFYSGGVFNGYCETF---LNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFR 310
           DA     Y+GG+++G   T    +NH V  VGYG S +G+ YW+ KNSWG DWGEDGY  
Sbjct: 276 DAKDFHLYTGGIYDGENCTSPYGINHFVLLVGYG-SADGVDYWIAKNSWGFDWGEDGYIW 334

Query: 311 LQRDIDQPQGQCGIAMFASFPVSKESAQPSSA 342
           +QR+     G CG+  FAS+P  +ES    SA
Sbjct: 335 IQRNTGNLLGVCGMNYFASYPTKEESETLVSA 366


>gi|410923307|ref|XP_003975123.1| PREDICTED: cathepsin L1-like [Takifugu rubripes]
          Length = 336

 Score =  213 bits (541), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 132/345 (38%), Positives = 192/345 (55%), Gaps = 28/345 (8%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
           F +VVL +   C + A      +  + E +  WK  + + Y E  E  +R  +++ NL  
Sbjct: 2   FPVVVLAL---CVTAALSAPSLDPQLDEHWNLWKDWHSKKYHEKEEGWRRM-VWEKNLKK 57

Query: 65  VERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQ 122
           +E  N   ++G  +Y+L +N F D+T +EF     G+K+    S  K  G+ F+  +  +
Sbjct: 58  IELHNLEHSMGKHTYSLGMNHFGDMTHEEFRQIMNGYKLK---SQRKLRGSLFMEPNFLE 114

Query: 123 VPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATND 175
            P SV+W +KG VTPVK QGQC          A+EG +  K   LVSLSEQ LVDC+  +
Sbjct: 115 APRSVDWRDKGYVTPVKDQGQCGSCWAFSTTGAMEGQHFRKTGTLVSLSEQNLVDCSRPE 174

Query: 176 NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPN 235
            N GC GG MD AF+YI  N G+ ++  Y Y G   G C    + + +A  T + DVP  
Sbjct: 175 GNEGCNGGLMDQAFQYIKDNGGLDSEESYPYLGTDEGPCHYDPSYN-SANDTGFVDVPSG 233

Query: 236 DEESLLKAVAN-QPVSVAIDA--SALQFYSGGV-FNGYCET-FLNHGVTAVGY---GTSE 287
            E +L+KAVA+  PVSVAIDA   + QFY  G+ ++  C +  L+HGV  VGY   G   
Sbjct: 234 SERALMKAVASVGPVSVAIDAGHESFQFYHSGIYYDKECSSEELDHGVLVVGYGFEGKDV 293

Query: 288 EGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           +G KYW++KNSW ++WG+ GY  + +D    +  CGIA  AS+P+
Sbjct: 294 DGKKYWIVKNSWSENWGDKGYIYMAKD---KKNHCGIATAASYPL 335


>gi|317135059|gb|ADV03094.1| cathepsin L [Hyriopsis cumingii]
 gi|372126672|gb|AEX88474.1| cathepsin L [Hyriopsis schlegelii]
          Length = 333

 Score =  213 bits (541), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 129/306 (42%), Positives = 179/306 (58%), Gaps = 24/306 (7%)

Query: 41  YGRTYKESAENSKRFEIFKDNLVAVERFNNAA-IGNRSYTLRLNKFADLTPQEFIASQTG 99
           Y +TY+ + E   R+ ++KDN +A+ R N+ A  G  +Y L +N++ DLT +E+   +TG
Sbjct: 37  YNKTYR-AHEEPVRYSVWKDNFLAINRHNSKADQGFHTYWLAMNEYGDLTNEEYFRLRTG 95

Query: 100 FKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEG 151
            K++   ++++  G  F Y + S+ P  V+W  KG VTPVK QG C       A  AVEG
Sbjct: 96  LKIN---ANIERRGLVFKYTNLSEYPSEVDWRSKGYVTPVKNQGGCGSCYAFSATGAVEG 152

Query: 152 INAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMST 211
            +  K  +LVSLSEQ +VDC+  + N GC GG MD +F YI  N GI  +  Y YE    
Sbjct: 153 QHFRKTGKLVSLSEQNIVDCSFKEGNKGCRGGLMDKSFTYIKDNNGIDTEEAYPYEARD- 211

Query: 212 GICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDASA--LQFYSGGVF-N 267
           G C   ++E   A +  Y D+P NDE +L  AV    P+SVAID      +FY  GVF N
Sbjct: 212 GPCRFRRSEV-GATVRGYVDLPENDEIALQHAVTTIGPISVAIDGHHFNFRFYHHGVFDN 270

Query: 268 GYC-ETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAM 326
             C +T +NHGV  VGYGT  +G+ YWL+KNSWG+ WG +GY  + R+ D    QC I  
Sbjct: 271 PNCSKTKINHGVLVVGYGT-RDGLDYWLVKNSWGERWGAEGYILMSRNNDN---QCCITC 326

Query: 327 FASFPV 332
            AS+P+
Sbjct: 327 AASYPI 332


>gi|242072384|ref|XP_002446128.1| hypothetical protein SORBIDRAFT_06g002110 [Sorghum bicolor]
 gi|241937311|gb|EES10456.1| hypothetical protein SORBIDRAFT_06g002110 [Sorghum bicolor]
          Length = 186

 Score =  213 bits (541), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 102/187 (54%), Positives = 133/187 (71%), Gaps = 3/187 (1%)

Query: 149 VEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEG 208
           +EG   I   +LVSLSEQ+LVDC  N  + GC GG MDDAF++++ N G+T ++ Y Y G
Sbjct: 1   MEGAVKISTGKLVSLSEQELVDCDVNGMDQGCEGGEMDDAFEFVVDNGGLTTESKYPYTG 60

Query: 209 MSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDA--SALQFYSGGVF 266
            S G C+S +A++ AA IT YEDVP NDE SL KAVANQPVSVA+D   +  +FY GGV 
Sbjct: 61  -SDGNCNSDEAKNDAASITGYEDVPANDETSLRKAVANQPVSVAVDGGDNLFRFYKGGVL 119

Query: 267 NGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAM 326
           +G C T L+HG+ AVGYG + +G K+WL+KNSWG  WGE GY R++RDI   +G CG+AM
Sbjct: 120 SGACGTELDHGIAAVGYGVAGDGTKFWLMKNSWGTSWGEAGYIRMERDIADDEGLCGLAM 179

Query: 327 FASFPVS 333
             S+P +
Sbjct: 180 QPSYPTA 186


>gi|33333712|gb|AAQ11974.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
           maculatus]
          Length = 326

 Score =  213 bits (541), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 129/330 (39%), Positives = 186/330 (56%), Gaps = 49/330 (14%)

Query: 29  SIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKFAD 87
           S+ E+++Q+K  +G+TY+   E  +RF +F+ NLV ++  N     G  S+  ++ +FAD
Sbjct: 18  SVYEEWQQFKLDHGKTYRSLVEEKRRFSVFQKNLVDIQEHNKKYERGEESFAKKVTQFAD 77

Query: 88  LTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSS-----------QVPPSVNWIEKGAVT 136
           +T +EF+              LK  G P L  ++           +   +V+W E+GAVT
Sbjct: 78  MTHEEFL------------DLLKLQGVPALPSNAVHFDNFEDIDMEEKDAVDWREEGAVT 125

Query: 137 PVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATND-NNNGCYGGFMDDA 188
           PVK Q  C       AV A+EG    K   LVSLS Q+LVDCAT D  NNGC GG M  A
Sbjct: 126 PVKDQANCGSCWAFSAVGAIEGQFFKKNGTLVSLSAQELVDCATEDYGNNGCKGGLMGQA 185

Query: 189 FKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ- 247
           F ++ Q++GI  +  Y YEG  +  C   K+ ++  ++  Y  V P DE+ + + VA + 
Sbjct: 186 FDFV-QDEGIQTEESYPYEGRRSS-CK--KSGEYVTKVKTY--VFPLDEQEMARTVAAKG 239

Query: 248 PVSVAIDASALQFYSGGVFNGYCETF-----LNHGVTAVGYGTSEEGIKYWLIKNSWGQD 302
           PV+VAI+AS L FY  G+ +  C        LNHGV  VGYG SE G+ YW++KNSWG D
Sbjct: 240 PVAVAIEASQLSFYDKGIVDERCRCSNKREDLNHGVLVVGYG-SENGVDYWIVKNSWGAD 298

Query: 303 WGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           WGE GYFRL++D+      CGI  + ++P+
Sbjct: 299 WGEKGYFRLKKDVK----ACGIGYYNTYPI 324


>gi|115435294|ref|NP_001042405.1| Os01g0217300 [Oryza sativa Japonica Group]
 gi|7523481|dbj|BAA94209.1| putative cysteine proteinase Mir3 [Oryza sativa Japonica Group]
 gi|10800061|dbj|BAB16481.1| putative cysteine proteinase Mir3 [Oryza sativa Japonica Group]
 gi|113531936|dbj|BAF04319.1| Os01g0217300 [Oryza sativa Japonica Group]
 gi|125524918|gb|EAY73032.1| hypothetical protein OsI_00905 [Oryza sativa Indica Group]
          Length = 366

 Score =  213 bits (541), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 129/325 (39%), Positives = 172/325 (52%), Gaps = 32/325 (9%)

Query: 34  FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNN--------AAIGNRSYT---LRL 82
           F +W AQYG+ Y    E+ KR++I+KDN   +  F +         A   ++ T   + +
Sbjct: 49  FSRWMAQYGKAYSWPIEHEKRYQIWKDNSNFIGSFRSETEISSGVGAFAPQTVTDSFVGM 108

Query: 83  NKFADLTPQEFIASQTGFKMSD---HSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVK 139
           N+F DLTP EF    TGF  +    H++             S +P  V+W   GAVT VK
Sbjct: 109 NRFGDLTPGEFAEQFTGFNATGGLLHAAPPPCPIP----PDSWLPCCVDWRSSGAVTGVK 164

Query: 140 YQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYI 192
           +Q  CA        AA+EG+N I+   LVSLSEQ +VDC T   ++GC GG  D A   +
Sbjct: 165 FQRSCASCWAFAAAAAIEGLNKIRTGELVSLSEQVMVDCDTG--SSGCSGGRADTALGLV 222

Query: 193 IQNKGITNDAVYSYEGMSTGICDSIKA-EDHAAQITNYEDVPPNDEESLLKAVANQPVSV 251
               G+ ++  Y Y G+  G CD  K    H+A ++ +  VPPNDE  L  AVA QPV+ 
Sbjct: 223 AARGGVASEEEYPYTGVRGG-CDVGKLLSGHSASLSGFRAVPPNDERQLALAVARQPVTA 281

Query: 252 AIDASA--LQFYSGGVFNGYCET-FLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGY 308
            IDA A    FY GGV+ G C    +NH V  VGY     G KYW+ KNSWG DWGE GY
Sbjct: 282 YIDAGAREFMFYKGGVYRGPCSAERVNHAVAIVGYCEGFGGDKYWIAKNSWGSDWGEQGY 341

Query: 309 FRLQRDIDQPQGQCGIAMFASFPVS 333
             L +D+  PQG CG+A    +P +
Sbjct: 342 VYLAKDVWWPQGTCGLATSPFYPTA 366


>gi|33333706|gb|AAQ11971.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
           maculatus]
          Length = 326

 Score =  212 bits (540), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 129/330 (39%), Positives = 186/330 (56%), Gaps = 49/330 (14%)

Query: 29  SIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKFAD 87
           S+ E+++Q+K  +G+TY+   E  +RF +F+ NLV ++  N     G  S+  ++ +FAD
Sbjct: 18  SVYEEWQQFKLDHGKTYRSLVEEKRRFSVFQKNLVDIQEHNKKYERGEESFAKKVTQFAD 77

Query: 88  LTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSS-----------QVPPSVNWIEKGAVT 136
           +T +EF+              LK  G P L  ++           +   +V+W E+GAVT
Sbjct: 78  MTHEEFL------------DLLKLQGVPALPSNAVHFDNFEDIDMEEKDAVDWREEGAVT 125

Query: 137 PVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATND-NNNGCYGGFMDDA 188
           PVK Q  C       AV A+EG    K   LVSLS Q+LVDCAT D  NNGC GG M  A
Sbjct: 126 PVKDQANCGSCWAFSAVGAIEGQFFKKNGTLVSLSAQELVDCATEDYGNNGCKGGLMGQA 185

Query: 189 FKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ- 247
           F ++ Q++GI  +  Y YEG  +  C   K+ ++  ++  Y  V P DE+ + + VA + 
Sbjct: 186 FDFV-QDEGIQTEESYPYEGRRSS-CK--KSGEYVTKVKTY--VFPLDEQEMARTVAAKG 239

Query: 248 PVSVAIDASALQFYSGGVFNGYCETF-----LNHGVTAVGYGTSEEGIKYWLIKNSWGQD 302
           PV+VAI+AS L FY  G+ +  C        LNHGV  VGYG SE G+ YW++KNSWG D
Sbjct: 240 PVAVAIEASQLSFYDKGIVDERCRCSNKREDLNHGVLVVGYG-SENGVDYWIVKNSWGAD 298

Query: 303 WGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           WGE GYFRL++D+      CGI  + ++P+
Sbjct: 299 WGEKGYFRLKKDVK----ACGIGYYNTYPI 324


>gi|125569526|gb|EAZ11041.1| hypothetical protein OsJ_00885 [Oryza sativa Japonica Group]
          Length = 366

 Score =  212 bits (540), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 129/325 (39%), Positives = 172/325 (52%), Gaps = 32/325 (9%)

Query: 34  FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNN--------AAIGNRSYT---LRL 82
           F +W AQYG+ Y    E+ KR++I+KDN   +  F +         A   ++ T   + +
Sbjct: 49  FSRWMAQYGKAYSWPIEHEKRYQIWKDNSNFIGSFRSETEISSGVCAFAPQTVTDSFVGM 108

Query: 83  NKFADLTPQEFIASQTGFKMSD---HSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVK 139
           N+F DLTP EF    TGF  +    H++             S +P  V+W   GAVT VK
Sbjct: 109 NRFGDLTPGEFAEQFTGFNATGGLLHAAPPPCPIP----PDSWLPCCVDWRSSGAVTGVK 164

Query: 140 YQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYI 192
           +Q  CA        AA+EG+N I+   LVSLSEQ +VDC T   ++GC GG  D A   +
Sbjct: 165 FQRSCASCWAFAAAAAIEGLNKIRTGELVSLSEQVMVDCDTG--SSGCSGGRADTALGLV 222

Query: 193 IQNKGITNDAVYSYEGMSTGICDSIKA-EDHAAQITNYEDVPPNDEESLLKAVANQPVSV 251
               G+ ++  Y Y G+  G CD  K    H+A ++ +  VPPNDE  L  AVA QPV+ 
Sbjct: 223 AARGGVASEEEYPYTGVRGG-CDVGKLLSGHSASLSGFRAVPPNDERQLALAVARQPVTA 281

Query: 252 AIDASA--LQFYSGGVFNGYCET-FLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGY 308
            IDA A    FY GGV+ G C    +NH V  VGY     G KYW+ KNSWG DWGE GY
Sbjct: 282 YIDAGAREFMFYKGGVYRGPCSAERVNHAVAIVGYCEGFGGDKYWIAKNSWGSDWGEQGY 341

Query: 309 FRLQRDIDQPQGQCGIAMFASFPVS 333
             L +D+  PQG CG+A    +P +
Sbjct: 342 VYLAKDVWWPQGTCGLATSPFYPTA 366


>gi|17062058|gb|AAL34984.1|AF320565_1 cathepsine L-like cysteine protease [Rhodnius prolixus]
          Length = 316

 Score =  212 bits (540), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 127/309 (41%), Positives = 173/309 (55%), Gaps = 22/309 (7%)

Query: 37  WKAQYGRTYKESAENSKRFEIFKDNLVAVERFN-NAAIGNRSYTLRLNKFADLTPQEFIA 95
           +KA +G+ Y+   E   R ++F DN   ++  N    +G  SY +++N   DL   EF A
Sbjct: 16  FKAMHGKNYRNQFEEIFRMKVFIDNKKKIDEHNAKYELGEASYKMKMNHLGDLMVHEFKA 75

Query: 96  SQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC-------AVAA 148
              GFK + ++   + NG  ++  +  +P SV+W ++GAVTPVK QG C       A  +
Sbjct: 76  LMNGFKKTPNA---ERNGKIYVPSNENLPKSVDWRQRGAVTPVKDQGHCGSCWSFSATGS 132

Query: 149 VEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEG 208
           +EG   +K  RLVSLSEQ LVDC+    N+GC GG M+ AF+Y+  NKGI  +A Y YE 
Sbjct: 133 LEGQLFLKTGRLVSLSEQNLVDCSKTYGNSGCEGGLMNQAFQYVRDNKGIDTEASYPYEA 192

Query: 209 MSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDAS--ALQFYSGGV 265
                    K +        Y D+    E+ L  AVA   P+SV IDAS  + QFYS GV
Sbjct: 193 RENNC--RFKEDKVGGTDKGYVDILEASEKDLQSAVATVGPISVRIDASHESFQFYSEGV 250

Query: 266 FN-GYCE-TFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCG 323
           +   YC  + L+HGV  VGYGT E G  YWL+KNSWG  WGE GY ++ R+    +  CG
Sbjct: 251 YKEQYCSPSQLDHGVLTVGYGT-ENGQDYWLVKNSWGPSWGESGYIKIARN---HKNHCG 306

Query: 324 IAMFASFPV 332
           IA  AS+PV
Sbjct: 307 IASMASYPV 315


>gi|118136313|gb|ABK62794.1| cathepsin L-like cysteine protease [Neobenedenia melleni]
 gi|118136315|gb|ABK62795.1| cathepsin L-like cysteine protease [Neobenedenia melleni]
          Length = 335

 Score =  212 bits (540), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 131/342 (38%), Positives = 182/342 (53%), Gaps = 25/342 (7%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
           F + +L+   +  S  TY  F++ S      QWK +Y + Y  S +   +   +  NL  
Sbjct: 4   FSVFLLLCVATALSVPTYPLFNQWS------QWKVKYQKDYLSSEDELNKLLTWSKNLET 57

Query: 65  VERFNNA-AIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQV 123
           V + N   A G +SYTL +N  ADL+ +EF A     K        K        +    
Sbjct: 58  VRKHNELYAQGKKSYTLAMNHMADLSSEEFKALYLVPKFDATKVPRKGKAAGEHRQIKND 117

Query: 124 PPS-VNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATND 175
           PPS ++W+ KG VT VK Q QC       +  ++EG       +L+S SEQQLVDC+T  
Sbjct: 118 PPSEIDWVRKGHVTAVKNQAQCGSCWAFSSTGSIEGAVKRATGKLISFSEQQLVDCSTAF 177

Query: 176 NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPN 235
            N+GC GG MD++F Y+I NKG+ ++A Y YE      C   KA      I+++ DV   
Sbjct: 178 GNHGCNGGIMDNSFNYLIHNKGLESEASYPYEAQKKE-CRYKKALSKGT-ISSFTDVSQF 235

Query: 236 DEESLLKAVA-NQPVSVAIDASALQF--YSGGVFN--GYCETFLNHGVTAVGYGTSEEGI 290
           DE+ L +AV    PVS+AIDAS   F  Y  GV++     +T LNHGV AVGYGT+ EG+
Sbjct: 236 DEKDLKRAVGLVGPVSIAIDASQFSFHLYDSGVYDEEDCSQTMLNHGVLAVGYGTTPEGL 295

Query: 291 KYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
            YW +KNSW   WG +GY  + R+ D    QCG+A  AS+P+
Sbjct: 296 DYWKVKNSWTNTWGMEGYILMSRNKDN---QCGVATVASYPI 334


>gi|179957|gb|AAC37592.1| cathepsin S [Homo sapiens]
          Length = 331

 Score =  212 bits (540), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 135/342 (39%), Positives = 188/342 (54%), Gaps = 32/342 (9%)

Query: 6   LIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
           L+ VL++  S  +Q       + ++   +  WK  YG+ YKE  E + R  I++ NL  V
Sbjct: 4   LVCVLLVCSSAVAQ----LHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLKFV 59

Query: 66  ERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQ-- 122
              N   ++G  SY L +N   D+T +E ++  +  ++    S  + N T   YKS+   
Sbjct: 60  MLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLRVP---SQWQRNIT---YKSNPNR 113

Query: 123 -VPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATN 174
            +P SV+W EKG VT VKYQG C       AV A+E    +K  +LVSLS Q LVDC+T 
Sbjct: 114 ILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTE 173

Query: 175 D-NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
              N GC GGFM  AF+YII NKGI +DA Y Y+ M         ++  AA  + Y ++P
Sbjct: 174 KYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKAMDLKC--QYDSKYRAATCSKYTELP 231

Query: 234 PNDEESLLKAVANQ-PVSVAIDASALQFY---SGGVFNGYCETFLNHGVTAVGYGTSEEG 289
              E+ L +AVAN+ PVSV +DA    F+   SG  +   C   +NHGV  VGYG    G
Sbjct: 232 YGREDVLKEAVANKGPVSVGVDARHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYG-DLNG 290

Query: 290 IKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
            +YWL+KNSWG ++GE+GY R+ R+       CGIA F S+P
Sbjct: 291 KEYWLVKNSWGHNFGEEGYIRMARN---KGNHCGIASFPSYP 329


>gi|61368403|gb|AAX43172.1| cathepsin S [synthetic construct]
          Length = 332

 Score =  212 bits (539), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 135/342 (39%), Positives = 188/342 (54%), Gaps = 32/342 (9%)

Query: 6   LIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
           L+ VL++  S  +Q       + ++   +  WK  YG+ YKE  E + R  I++ NL  V
Sbjct: 4   LVCVLLVCSSAVAQ----LHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLKFV 59

Query: 66  ERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQ-- 122
              N   ++G  SY L +N   D+T +E ++  +  ++    S  + N T   YKS+   
Sbjct: 60  MLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLRVP---SQWQRNIT---YKSNPNR 113

Query: 123 -VPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATN 174
            +P SV+W EKG VT VKYQG C       AV A+E    +K  +LVSLS Q LVDC+T 
Sbjct: 114 ILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTE 173

Query: 175 D-NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
              N GC GGFM  AF+YII NKGI +DA Y Y+ M         ++  AA  + Y ++P
Sbjct: 174 KYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKAMDQKC--QYDSKYRAATCSKYTELP 231

Query: 234 PNDEESLLKAVANQ-PVSVAIDASALQFY---SGGVFNGYCETFLNHGVTAVGYGTSEEG 289
              E+ L +AVAN+ PVSV +DA    F+   SG  +   C   +NHGV  VGYG    G
Sbjct: 232 YGREDVLKEAVANKGPVSVGVDARHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYG-DLNG 290

Query: 290 IKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
            +YWL+KNSWG ++GE+GY R+ R+       CGIA F S+P
Sbjct: 291 KEYWLVKNSWGHNFGEEGYIRMARN---KGNHCGIASFPSYP 329


>gi|167526493|ref|XP_001747580.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163774026|gb|EDQ87660.1| predicted protein [Monosiga brevicollis MX1]
          Length = 330

 Score =  212 bits (539), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 121/286 (42%), Positives = 166/286 (58%), Gaps = 25/286 (8%)

Query: 55  FEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGT 114
           F     NL  +E  N    GN S+T+ + +FADLT  EF A    F M+      +    
Sbjct: 48  FRCHLANLRVIEAHN---AGNSSFTMGITQFADLTAAEFSAYVKRFPMNVTRPRNE---- 100

Query: 115 PFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQ 167
             ++ +      V+W +K AVT +K QGQC          +VEG +AI   +LVSLSEQQ
Sbjct: 101 --VWITEAPLQEVDWRQKNAVTEIKNQGQCGSCWSFSTTGSVEGAHAIATGKLVSLSEQQ 158

Query: 168 LVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQIT 227
           L+DC+T   N+GC GG MD AF+Y+I N G+  +  Y Y     G C++ K + HAA+I 
Sbjct: 159 LMDCSTRYGNHGCNGGLMDYAFEYVIANGGLDTEEDYPYTA-EDGKCNTEKEKKHAAEIH 217

Query: 228 NYEDVPPNDEESLLKAVANQPVSVAIDA--SALQFYSGGVFNGYCETFLNHGVTAVGYGT 285
            + +VP   E+ L  AV+  PVSVAI+A  +  Q Y+ GVF+G C T L+HGV  VGY  
Sbjct: 218 GFRNVPKEHEDQLAAAVSIGPVSVAIEADQAGFQHYTSGVFDGKCGTSLDHGVLVVGYSD 277

Query: 286 SEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
                 YW++KNSWG+ WGE+GY RL+R +D+ +G CGI M AS+P
Sbjct: 278 D-----YWIVKNSWGKSWGEEGYIRLKRGVDK-KGMCGITMQASYP 317


>gi|23110962|ref|NP_004070.3| cathepsin S isoform 1 preproprotein [Homo sapiens]
 gi|88984046|sp|P25774.3|CATS_HUMAN RecName: Full=Cathepsin S; Flags: Precursor
 gi|60816153|gb|AAX36372.1| cathepsin S [synthetic construct]
 gi|61358282|gb|AAX41541.1| cathepsin S [synthetic construct]
 gi|119573903|gb|EAW53518.1| cathepsin S, isoform CRA_b [Homo sapiens]
 gi|119573904|gb|EAW53519.1| cathepsin S, isoform CRA_b [Homo sapiens]
          Length = 331

 Score =  212 bits (539), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 135/342 (39%), Positives = 188/342 (54%), Gaps = 32/342 (9%)

Query: 6   LIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
           L+ VL++  S  +Q       + ++   +  WK  YG+ YKE  E + R  I++ NL  V
Sbjct: 4   LVCVLLVCSSAVAQ----LHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLKFV 59

Query: 66  ERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQ-- 122
              N   ++G  SY L +N   D+T +E ++  +  ++    S  + N T   YKS+   
Sbjct: 60  MLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLRVP---SQWQRNIT---YKSNPNR 113

Query: 123 -VPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATN 174
            +P SV+W EKG VT VKYQG C       AV A+E    +K  +LVSLS Q LVDC+T 
Sbjct: 114 ILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTE 173

Query: 175 D-NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
              N GC GGFM  AF+YII NKGI +DA Y Y+ M         ++  AA  + Y ++P
Sbjct: 174 KYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKAMDQKC--QYDSKYRAATCSKYTELP 231

Query: 234 PNDEESLLKAVANQ-PVSVAIDASALQFY---SGGVFNGYCETFLNHGVTAVGYGTSEEG 289
              E+ L +AVAN+ PVSV +DA    F+   SG  +   C   +NHGV  VGYG    G
Sbjct: 232 YGREDVLKEAVANKGPVSVGVDARHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYG-DLNG 290

Query: 290 IKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
            +YWL+KNSWG ++GE+GY R+ R+       CGIA F S+P
Sbjct: 291 KEYWLVKNSWGHNFGEEGYIRMARN---KGNHCGIASFPSYP 329


>gi|330805273|ref|XP_003290609.1| hypothetical protein DICPUDRAFT_92519 [Dictyostelium purpureum]
 gi|325079248|gb|EGC32857.1| hypothetical protein DICPUDRAFT_92519 [Dictyostelium purpureum]
          Length = 333

 Score =  212 bits (539), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 130/311 (41%), Positives = 178/311 (57%), Gaps = 22/311 (7%)

Query: 34  FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
           F  W  ++ R Y    E + R++ FK+N+  + ++N+         L L KFADLT +E+
Sbjct: 33  FIGWMRKHDRAYSHE-EFTDRYQAFKENMDFIHKWNSQ---ESDTVLGLTKFADLTNEEY 88

Query: 94  IASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA-------V 146
                G K+ +   +L A      +     P S++W EKGAV+ VK QGQC         
Sbjct: 89  KKHYLGIKV-NVKKNLNAAQKGLKFFKFTGPDSIDWREKGAVSQVKDQGQCGSCWSFSTT 147

Query: 147 AAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSY 206
            AVEG + IK   +VSLSEQ LVDC+    N GC GG M +AF+YII N GI  ++ Y Y
Sbjct: 148 GAVEGAHQIKSGNMVSLSEQNLVDCSGQYGNQGCEGGLMVNAFEYIIDNGGIATESSYPY 207

Query: 207 EGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASAL--QFYSGG 264
              + G C   K+ +  A I  Y+++P  +E+SL  A+A QPVSVAIDAS +  Q YS G
Sbjct: 208 TA-AQGRCKFTKSMN-GANIIGYKEIPQGEEDSLTAALAKQPVSVAIDASHMSFQLYSSG 265

Query: 265 VFN--GYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQC 322
           V++        L+HGV AVGYGT  EG  Y++IKNSWG  WG+DGY  + R+    Q QC
Sbjct: 266 VYDEPACSSEALDHGVLAVGYGTL-EGKDYYIIKNSWGPTWGQDGYIFMSRN---AQNQC 321

Query: 323 GIAMFASFPVS 333
           G+A  AS+P+S
Sbjct: 322 GVATMASYPIS 332


>gi|115715524|ref|XP_780580.2| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
          Length = 334

 Score =  212 bits (539), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 137/341 (40%), Positives = 188/341 (55%), Gaps = 26/341 (7%)

Query: 7   IVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVE 66
           + VL+++    S  +    D     E + QWK ++G+ Y    E + R  I++ NL  V 
Sbjct: 4   LSVLLVAACVVSSLSMSFID---FDEDWNQWKNEHGKRYLSDEEEASRRLIWQKNLDIVI 60

Query: 67  RFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSS--QV 123
           + N    +G+ +Y L +N+FADL  +EF++   GF+    +SS    G+ FL  S+   +
Sbjct: 61  KHNLKYDLGHFTYDLGMNQFADLKNEEFVSLMNGFR---GNSSKATRGSTFLPPSNVFDM 117

Query: 124 PPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDN 176
           P  V+W  KG VTPVK Q QC       A  ++EG +  K  +LVSLSEQ LVDC+  + 
Sbjct: 118 PTMVDWRTKGYVTPVKNQLQCGSCWAFSATGSLEGQHFKKTGKLVSLSEQNLVDCSGKEG 177

Query: 177 NNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPND 236
           N GC GG MD AF+YI+   GI  +  Y Y  M  G C   KA +  A  T Y DV    
Sbjct: 178 NMGCEGGLMDQAFQYILDVGGIDTEMSYPYTAMD-GQCHFNKA-NIGATDTGYTDVTTGS 235

Query: 237 EESLLKAVAN-QPVSVAIDAS--ALQFYSGGVFN--GYCETFLNHGVTAVGYGTSEEGIK 291
           E +L  AVA+  P+SVAIDAS  + Q Y  GV+N      T L+HGV AVGYGTS +G  
Sbjct: 236 ESALQMAVASVGPISVAIDASHQSFQLYKSGVYNEPACSSTLLDHGVLAVGYGTSSDGTD 295

Query: 292 YWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           Y+   +SWG  WG +GY  + R+ D    QCGIA  AS+P+
Sbjct: 296 YFFFFHSWGAAWGMNGYLWMSRNKDN---QCGIATKASYPL 333


>gi|350583407|ref|XP_003481511.1| PREDICTED: cathepsin S [Sus scrofa]
          Length = 331

 Score =  212 bits (539), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 135/339 (39%), Positives = 193/339 (56%), Gaps = 26/339 (7%)

Query: 6   LIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
           L+ VL++  S  +Q       + ++   ++ WK  YG+ YKE  E   R  I++ NL  V
Sbjct: 4   LVWVLLLCSSAMAQ----LHRDPTLDRHWDLWKKTYGKQYKEKNEEVARRLIWEKNLKTV 59

Query: 66  ERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVP 124
              N   ++G  SY L +N   D+T +E I+  +  ++    S    N T     + ++P
Sbjct: 60  MLHNLEHSMGMHSYDLGMNHLGDMTSEEVISLMSCVRVP---SQWPRNVTYKSNPNQKLP 116

Query: 125 PSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATND-N 176
            S++W EKG VT VKYQG C       AV A+E    +K  RLVSLS Q LVDC+T    
Sbjct: 117 DSMDWREKGCVTEVKYQGSCGSCWAFSAVGALEAQVKMKTGRLVSLSAQNLVDCSTEKYR 176

Query: 177 NNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPND 236
           N GC GGFM +AF+YII N GI ++A Y Y+ +  G C    +++ AA  + Y ++P  D
Sbjct: 177 NKGCNGGFMTEAFQYIIDNNGIDSEASYPYKAVD-GKC-KYDSKNRAATCSRYTELPFAD 234

Query: 237 EESLLKAVANQ-PVSVAIDA--SALQFYSGGV-FNGYCETFLNHGVTAVGYGTSEEGIKY 292
           E +L +AVAN+ PVSVAIDA  S+  FY  GV ++  C   +NHGV  VGYG +  G  Y
Sbjct: 235 EYALKEAVANKGPVSVAIDAKHSSFFFYRSGVYYDPSCTQNVNHGVLVVGYG-NLNGKDY 293

Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
           WL+KNSWG ++G+ GY R+ R+    +  CGIA + S+P
Sbjct: 294 WLVKNSWGLNFGDGGYIRMARN---SENHCGIANYPSYP 329


>gi|414591548|tpg|DAA42119.1| TPA: hypothetical protein ZEAMMB73_388689, partial [Zea mays]
          Length = 229

 Score =  212 bits (539), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 109/204 (53%), Positives = 136/204 (66%), Gaps = 8/204 (3%)

Query: 141 QGQC----AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNK 196
           +G C    A+AAVEG+N I   +LVSLSEQ+LVDC   DN  GC GG MD AF+YI +N 
Sbjct: 12  EGSCWAFSAIAAVEGVNKIMTGKLVSLSEQELVDCDDVDNQ-GCDGGLMDYAFQYIQRNG 70

Query: 197 GITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS 256
           G+T ++ Y Y       C+  K   H   I  YEDVP N+E++L KAVA+QPV+VAI+AS
Sbjct: 71  GVTTESNYPYLAEQRS-CNKAKERSHDVTIDGYEDVPANNEDALQKAVASQPVAVAIEAS 129

Query: 257 A--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRD 314
               QFYS GVF G C T L+HGV AVGYGT+ +G KYW +KNSWG+DWGE GY R+QR 
Sbjct: 130 GQDFQFYSEGVFTGSCGTDLDHGVAAVGYGTTGDGTKYWTVKNSWGEDWGERGYIRMQRG 189

Query: 315 IDQPQGQCGIAMFASFPVSKESAQ 338
           +   +G CGIAM  S+P  K +  
Sbjct: 190 VPDSRGLCGIAMEPSYPTKKPAGH 213


>gi|12803615|gb|AAH02642.1| Cathepsin S [Homo sapiens]
 gi|49456313|emb|CAG46477.1| CTSS [Homo sapiens]
 gi|60821573|gb|AAX36579.1| cathepsin S [synthetic construct]
 gi|189069420|dbj|BAG37086.1| unnamed protein product [Homo sapiens]
 gi|261858586|dbj|BAI45815.1| cathepsin S [synthetic construct]
          Length = 331

 Score =  212 bits (539), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 135/342 (39%), Positives = 188/342 (54%), Gaps = 32/342 (9%)

Query: 6   LIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
           L+ VL++  S  +Q       + ++   +  WK  YG+ YKE  E + R  I++ NL  V
Sbjct: 4   LVCVLLVCSSAVAQ----LHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLKFV 59

Query: 66  ERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQ-- 122
              N   ++G  SY L +N   D+T +E ++  +  ++    S  + N T   YKS+   
Sbjct: 60  MLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLRVP---SQWQRNIT---YKSNPNW 113

Query: 123 -VPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATN 174
            +P SV+W EKG VT VKYQG C       AV A+E    +K  +LVSLS Q LVDC+T 
Sbjct: 114 ILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTE 173

Query: 175 D-NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
              N GC GGFM  AF+YII NKGI +DA Y Y+ M         ++  AA  + Y ++P
Sbjct: 174 KYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKAMDQKC--QYDSKYRAATCSKYTELP 231

Query: 234 PNDEESLLKAVANQ-PVSVAIDASALQFY---SGGVFNGYCETFLNHGVTAVGYGTSEEG 289
              E+ L +AVAN+ PVSV +DA    F+   SG  +   C   +NHGV  VGYG    G
Sbjct: 232 YGREDVLKEAVANKGPVSVGVDARHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYG-DLNG 290

Query: 290 IKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
            +YWL+KNSWG ++GE+GY R+ R+       CGIA F S+P
Sbjct: 291 KEYWLVKNSWGHNFGEEGYIRMARN---KGNHCGIASFPSYP 329


>gi|410990008|ref|XP_004001242.1| PREDICTED: cathepsin L1 isoform 1 [Felis catus]
          Length = 333

 Score =  212 bits (539), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 127/339 (37%), Positives = 186/339 (54%), Gaps = 25/339 (7%)

Query: 9   VLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERF 68
           +L ++G C   A+       S+  ++ QWKA +G+ Y  + E  +R  +++ N+  +E+ 
Sbjct: 4   LLFLAGLCLGVASAAPQLYQSLDARWSQWKATHGKLYGMNDEVWRR-AVWERNMKMIEQH 62

Query: 69  NNA-AIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSV 127
           N   + G  ++T+ +N F D+T +EF     G K+       K    PF     ++P SV
Sbjct: 63  NREHSQGKHTFTMAMNAFGDMTNEEFRQVMNGLKIQKRKK-WKVFQAPFFV---EIPSSV 118

Query: 128 NWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGC 180
           +W EKG VTPVK QG C       A  A+EG    K  +LVSLSEQ LVDC+  + N G 
Sbjct: 119 DWREKGYVTPVKDQGYCLCCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSQTEGNEGY 178

Query: 181 YGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESL 240
            GG +DDAF+Y+  N G+ ++  Y Y   + G     + E+  A +T+Y D+P  + E +
Sbjct: 179 SGGLIDDAFQYVKDNGGLDSEESYPYH--AQGDSCKYRPENSVANVTDYWDIPSKENELM 236

Query: 241 LKAVANQPVSVAIDAS--ALQFYSGGV-FNGYCETF-LNHGVTAVGY---GTSEEGIKYW 293
           +   A  P+S AIDAS    +FY  G+ ++  C +  ++HGV  VGY   GT  E  KYW
Sbjct: 237 ITLAAVGPISAAIDASLDTFRFYKEGIYYDPSCSSEDVDHGVLVVGYGADGTETENKKYW 296

Query: 294 LIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           +IKNSWG DWG DGY ++ +D D     CGIA  ASFP 
Sbjct: 297 IIKNSWGTDWGMDGYIKMAKDRDN---HCGIASLASFPT 332


>gi|395535909|ref|XP_003769963.1| PREDICTED: cathepsin S [Sarcophilus harrisii]
          Length = 347

 Score =  211 bits (538), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 134/340 (39%), Positives = 191/340 (56%), Gaps = 24/340 (7%)

Query: 6   LIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
           + VV+ +  +CAS   Y   D   +   +E WK  YG+ Y+E  +   R  I++ NL  V
Sbjct: 16  MKVVIWMFLACASTTAYLRHDP-MLDNHWELWKKTYGKQYEEQNQEVTRRLIWEKNLKFV 74

Query: 66  ERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVP 124
              N   ++G  SY L +N  +D+T +E  +  +  ++ +  S    N T  L  + ++P
Sbjct: 75  TLHNLEHSMGLHSYDLSMNHLSDMTSEEVASLMSSLRIPNQWSR---NTTYRLNSNQKLP 131

Query: 125 PSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDN- 176
            SV+W +KG VT VKYQG C       AV A+E    +K  +LVSLS Q LVDC+TN+  
Sbjct: 132 DSVDWRDKGCVTEVKYQGTCGSCWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTNEKY 191

Query: 177 -NNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPN 235
            N+GC GG M +AF+YII N GI +DA Y Y+    G C    A + AA  + Y ++P  
Sbjct: 192 ENHGCNGGCMTEAFQYIIDNNGIDSDASYPYKA-KDGKCQYNPA-NRAATCSRYTELPYG 249

Query: 236 DEESLLKAVANQ-PVSVAIDASALQFY---SGGVFNGYCETFLNHGVTAVGYGTSEEGIK 291
            E++L +AVAN+ PVSV IDAS   F+   SG  ++  C   +NHGV   GYG + +G  
Sbjct: 250 SEDALKEAVANKGPVSVGIDASLPSFFLYKSGVYYDPSCTQNVNHGVLVTGYG-NLDGKD 308

Query: 292 YWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
           YWL+KNSWG  +G+ GY R+ R+       CGIA F S+P
Sbjct: 309 YWLVKNSWGLSFGDKGYIRIARNRGN---HCGIANFPSYP 345


>gi|414591039|tpg|DAA41610.1| TPA: hypothetical protein ZEAMMB73_356414 [Zea mays]
          Length = 376

 Score =  211 bits (538), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 127/330 (38%), Positives = 177/330 (53%), Gaps = 30/330 (9%)

Query: 27  EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFA 86
           E S+   +E+W++ +  + ++  E   RFE FK N   +  FN     +  Y L LNKFA
Sbjct: 38  EESMWSLYERWRSVHTVS-RDLREKQSRFEAFKANARHIGEFNKRK--DVPYKLGLNKFA 94

Query: 87  DLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSV-----------NWIEKGAV 135
           DLT +EF++  TG K+ D  ++ +      +  S + PP +           +W + GAV
Sbjct: 95  DLTQEEFVSKYTGAKVVDSEAAARLASGVRVSSSDESPPQLAASVGDAPDAWDWRDHGAV 154

Query: 136 TPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDA 188
           T VK QGQC       AV AVE +NAI    L++LSEQQ++DC+   +    YGG+   A
Sbjct: 155 TAVKDQGQCGSCWAFSAVGAVESVNAIVTGNLLTLSEQQMLDCSGAGDCT--YGGYTYYA 212

Query: 189 FKYIIQNKGITNDAV------YSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLK 242
             Y I N G+T D          Y+      C     +    +I +   +   DE +L +
Sbjct: 213 MLYAISN-GLTLDQCGKTPYYQRYDAQQHLPCRFDAKKPPVVKIDSMYVMNNADEAALKR 271

Query: 243 AVANQPVSVAIDASALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQD 302
           AV  QPVSV IDA  + +YS GVF G C T LNH V  VGYG + +G KYW++KNSWG D
Sbjct: 272 AVYKQPVSVLIDAGGIGYYSEGVFTGPCGTSLNHAVLLVGYGATADGTKYWIVKNSWGAD 331

Query: 303 WGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           WGE GYFRL+RD+    G CGI M+  +P+
Sbjct: 332 WGEKGYFRLKRDVGTQGGLCGITMYPIYPI 361


>gi|33112581|gb|AAP94046.1| cathepsin-L-like cysteine peptidase 02 [Tenebrio molitor]
          Length = 337

 Score =  211 bits (538), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 135/348 (38%), Positives = 194/348 (55%), Gaps = 31/348 (8%)

Query: 3   KYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
           K+ + + + ++GS A       FD   + E++  +K  + + Y+   E   R +IF +N 
Sbjct: 2   KFLIFLAICVAGSQA----VSFFD--LVQEQWGAFKMTHNKQYQSDTEERFRMKIFMENS 55

Query: 63  VAVERFNNA-AIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKA----NGTPFL 117
             V + N   A G  S+ L +NK+AD+   EF+    GF  +   S L++    +   FL
Sbjct: 56  HTVAKHNKLYAQGLVSFKLGINKYADMLHHEFVQVLNGFNRT--KSGLRSGESDDSVTFL 113

Query: 118 YKSS-QVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLV 169
             ++ Q+P  ++W +KGAVTPVK QGQC       A  ++EG +  K  +LVSLSEQ LV
Sbjct: 114 PPANVQLPGQIDWRDKGAVTPVKDQGQCGSCWSFSATGSLEGQHFRKSGKLVSLSEQNLV 173

Query: 170 DCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNY 229
           DC+    NNGC GG MD+AF+YI  N GI  +  Y Y+          K ++  A    Y
Sbjct: 174 DCSEKFGNNGCNGGLMDNAFRYIKANGGIDTEQAYPYKAEDEKC--HYKPKNKGATDRGY 231

Query: 230 EDVPPNDEESLLKAVAN-QPVSVAIDAS--ALQFYSGGV-FNGYCE-TFLNHGVTAVGYG 284
            D+   +E+ L  AVA   PVSVAIDAS  + Q YSGGV +   C  + L+HGV  VGYG
Sbjct: 232 VDIESGNEDKLQSAVATVGPVSVAIDASHQSFQLYSGGVYYEPECSPSQLDHGVLVVGYG 291

Query: 285 TSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           T ++G  YWL+KNSWG+ WG+ GY ++ R+ D     CGIA  AS+P+
Sbjct: 292 TEDDGTDYWLVKNSWGKSWGDQGYIKMARNRDN---NCGIATEASYPL 336


>gi|225719768|gb|ACO15730.1| Cathepsin L1 precursor [Caligus clemensi]
          Length = 338

 Score =  211 bits (538), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 133/348 (38%), Positives = 188/348 (54%), Gaps = 30/348 (8%)

Query: 2   AKYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDN 61
           A Y  ++VL +S  CA+      FD   + + +  WK  + + Y  S E  +R  +++ N
Sbjct: 3   ALYLAVLVLCVSAVCAAP----RFD-SQLEDHWHLWKNWHSKNYHASEEGWRRM-VWEKN 56

Query: 62  LVAVERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS 120
           L  +E  N    +G  S+ L +N F D+T +EF  +  G+K +   +  K  G+ F+  +
Sbjct: 57  LKKIEIHNLEHTMGKHSHRLGMNHFGDMTNEEFRQTMNGYKQT---TERKFKGSLFMEPN 113

Query: 121 S-QVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCA 172
             Q P +V+W EKG VTPVK QG C          A+EG    K  +LVSLSEQ LVDC+
Sbjct: 114 YLQAPKAVDWREKGYVTPVKDQGSCGSCWAFSTTGAMEGQPFRKTGKLVSLSEQNLVDCS 173

Query: 173 TNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDV 232
             + N GC GG MD AF+YI  N G+  +  Y Y G     C   K E  AA  T + D+
Sbjct: 174 RPEGNEGCNGGLMDQAFQYIQDNAGLDTEESYPYVGTDEDPC-HYKPEFSAANETGFVDI 232

Query: 233 PPNDEESLLKAVAN-QPVSVAIDA--SALQFYSGGV-FNGYCET-FLNHGVTAVGYGTSE 287
           P   E +++KAVA   PVSVAIDA   + QFY  G+ +   C +  L+HGV  VGYG   
Sbjct: 233 PSGKEHAMMKAVAAVGPVSVAIDAGHESFQFYESGIYYEKECSSEELDHGVLVVGYGFEG 292

Query: 288 E---GIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           E   G KYW++KNSW + WG+ GY  + +D    +  CGIA  +S+P+
Sbjct: 293 EDVDGKKYWIVKNSWSEKWGDKGYIYMAKD---RKNHCGIATASSYPL 337


>gi|55740406|gb|AAV63979.1| cathepsin L1 precursor [Artemia parthenogenetica]
          Length = 338

 Score =  211 bits (538), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 131/342 (38%), Positives = 190/342 (55%), Gaps = 26/342 (7%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
           FL+  +++  S A   T        +A+++  +KA + + Y    E   R +I+ +N   
Sbjct: 8   FLLAAVLVQLSAALSLT------NLLADEWHLFKATHKKEYPSQLEEKLRMKIYLENKHK 61

Query: 65  VERFNNA-AIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSS-Q 122
           V + N     G +SY + +NKF DL   EF +   G++    +SS   +   F+  ++ +
Sbjct: 62  VAKHNILYEKGEKSYQVAMNKFGDLLHHEFRSIMNGYQHKKQNSSRAESTFTFMEPANVE 121

Query: 123 VPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATND 175
           VP SV+W EKGA+TPVK QGQC       +  A+EG    K  +LVSLSEQ L+DC+   
Sbjct: 122 VPESVDWREKGAITPVKDQGQCGSCWAFSSTGALEGQTFRKTGKLVSLSEQNLIDCSGKY 181

Query: 176 NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPN 235
            N GC GG MD AF+YI  NKGI  +  Y YE    G+C      +  A    + D+P  
Sbjct: 182 GNEGCNGGLMDQAFQYIKDNKGIDTENTYPYEA-EDGVC-RYNPRNRGAVDRGFVDIPSG 239

Query: 236 DEESLLKAVAN-QPVSVAIDAS--ALQFYSGGV-FNGYCET-FLNHGVTAVGYGTSEEGI 290
           +E+ L  AVA   PVSVAIDAS  + QFYS G  +   C++  L+HGV  VGYG S+ G 
Sbjct: 240 EEDKLKAAVATVGPVSVAIDASHESFQFYSKGXYYEPSCDSDDLDHGVLVVGYG-SDNGE 298

Query: 291 KYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
            YWL+KNSW + WG++GY ++ R+    +  CG+A  AS+P+
Sbjct: 299 DYWLVKNSWSEHWGDEGYIKIARN---RKNHCGVATAASYPL 337


>gi|297663703|ref|XP_002810310.1| PREDICTED: LOW QUALITY PROTEIN: cathepsin S [Pongo abelii]
          Length = 330

 Score =  211 bits (538), Expect = 4e-52,   Method: Compositional matrix adjust.
 Identities = 138/342 (40%), Positives = 188/342 (54%), Gaps = 33/342 (9%)

Query: 6   LIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
           L+ VL++  S  +Q       + ++   +  WK  YG+ YKE  E + R  I++ NL  V
Sbjct: 4   LVCVLLVCSSAVAQ----LHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLKFV 59

Query: 66  ERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQ-- 122
              N   ++G  SY L +N   D+T +E ++  +  ++    S  + N T   YKS+   
Sbjct: 60  MIHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLRVP---SQWQRNIT---YKSNPNR 113

Query: 123 -VPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATN 174
            +P SV+W EKG VT VKYQG C       AV A+E    +K  +LVSLS Q LVDC+T 
Sbjct: 114 ILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTE 173

Query: 175 D-NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
              N GC GGFM  AF+YII NKGI +DA Y Y+ M     DS   +  AA  + Y D  
Sbjct: 174 KYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKAMVKCQYDS---KYRAATCSKYTDFX 230

Query: 234 PNDEESLLKAVANQ-PVSVAIDASALQFY---SGGVFNGYCETFLNHGVTAVGYGTSEEG 289
              E+ L +AVAN+ PVSV +DA    F+   SG  +   C   +NHGV  VGYG    G
Sbjct: 231 YGREDVLKEAVANKGPVSVGVDARHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYG-DLNG 289

Query: 290 IKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
            +YWL+KNSWG+++GE+GY R+ R+       CGIA F SFP
Sbjct: 290 KEYWLVKNSWGRNFGEEGYIRMARN---KGNHCGIASFPSFP 328


>gi|33112583|gb|AAP94047.1| cathepsin-L-like cysteine peptidase 03 [Tenebrio molitor]
          Length = 337

 Score =  211 bits (537), Expect = 4e-52,   Method: Compositional matrix adjust.
 Identities = 135/348 (38%), Positives = 194/348 (55%), Gaps = 31/348 (8%)

Query: 3   KYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
           K+ + + + ++GS A       FD   + E++  +K  + + Y+   E   R +IF +N 
Sbjct: 2   KFLIFLAICVAGSQA----VSFFD--LVQEQWGAFKMTHNKQYQSDTEERFRMKIFMENS 55

Query: 63  VAVERFNNA-AIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKA----NGTPFL 117
             V + N   A G  S+ L +NK+AD+   EF+    GF  +   S L++    +   FL
Sbjct: 56  HTVAKHNKLYAQGLVSFKLGINKYADMLHHEFVQVLNGFNRT--KSGLRSGESDDSVTFL 113

Query: 118 YKSS-QVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLV 169
             ++ Q+P  ++W +KGAVTPVK QGQC       A  ++EG +  K  +LVSLSEQ LV
Sbjct: 114 PPANVQLPGQIDWRDKGAVTPVKDQGQCGSCWSFSATGSLEGQHFRKSGKLVSLSEQNLV 173

Query: 170 DCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNY 229
           DC+    NNGC GG MD+AF+YI  N GI  +  Y Y+          K ++  A    Y
Sbjct: 174 DCSEKFGNNGCNGGLMDNAFRYIKANGGIDTEQAYPYKAEDEKC--HYKPKNKGATDRGY 231

Query: 230 EDVPPNDEESLLKAVAN-QPVSVAIDAS--ALQFYSGGV-FNGYCE-TFLNHGVTAVGYG 284
            D+   +E+ L  AVA   PVSVAIDAS  + Q YSGGV +   C  + L+HGV  VGYG
Sbjct: 232 VDIESGNEDKLQSAVATVGPVSVAIDASHQSFQLYSGGVYYEPDCSASQLDHGVLVVGYG 291

Query: 285 TSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           T ++G  YWL+KNSWG+ WG+ GY ++ R+ D     CGIA  AS+P+
Sbjct: 292 TEDDGTDYWLVKNSWGKSWGDQGYIKMARNRDN---NCGIATEASYPL 336


>gi|195484843|ref|XP_002090843.1| GE12574 [Drosophila yakuba]
 gi|194176944|gb|EDW90555.1| GE12574 [Drosophila yakuba]
          Length = 341

 Score =  211 bits (537), Expect = 4e-52,   Method: Compositional matrix adjust.
 Identities = 133/326 (40%), Positives = 183/326 (56%), Gaps = 33/326 (10%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKFADL 88
           + E++  +K ++ + Y++  E   R +IF +N   + + N   A G  S+ L +NK+ADL
Sbjct: 25  VMEEWHTFKLEHRKNYQDDTEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADL 84

Query: 89  TPQEFIASQTGFKMSDHSSSLKAN-----GTPFLYKSS-QVPPSVNWIEKGAVTPVKYQG 142
              EF     GF  + H   L+A      G  F+  +   +P SV+W  KGAVT VK QG
Sbjct: 85  LHHEFRQLMNGFNYTLHKQ-LRATDDSFKGVTFISPAHVTLPKSVDWRSKGAVTAVKDQG 143

Query: 143 QC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQN 195
            C       +  A+EG +  K   LVSLSEQ LVDC+T   NNGC GG MD+AF+YI  N
Sbjct: 144 HCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDN 203

Query: 196 KGITNDAVYSYEGMSTGICD----SIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVS 250
            GI  +  Y YE +    C     +I A D       + D+P  DE+ + +AVA   PVS
Sbjct: 204 GGIDTEKSYPYEAIDDS-CHFNKGTIGATDRG-----FTDIPQGDEKKMAEAVATVGPVS 257

Query: 251 VAIDAS--ALQFYSGGVFN-GYCET-FLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGED 306
           VAIDAS  + QFYS GV+N   C+   L+HGV  VG+GT E G  YWL+KNSWG  WG+ 
Sbjct: 258 VAIDASHESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGDDYWLVKNSWGTTWGDK 317

Query: 307 GYFRLQRDIDQPQGQCGIAMFASFPV 332
           G+ ++ R+ D    QCGIA  +S+P+
Sbjct: 318 GFIKMLRNKDN---QCGIASASSYPL 340


>gi|344271939|ref|XP_003407794.1| PREDICTED: cathepsin L1-like [Loxodonta africana]
          Length = 335

 Score =  211 bits (537), Expect = 4e-52,   Method: Compositional matrix adjust.
 Identities = 130/350 (37%), Positives = 188/350 (53%), Gaps = 34/350 (9%)

Query: 1   MAKYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKD 60
           M     +  L +  + A+    +T D      ++ QWK+ Y + Y  + E   R  +++ 
Sbjct: 1   MTPSVFLAALCLGIASAAPKLDQTLDV-----QWNQWKSTYKKVYAANEEGLTR-AVWEK 54

Query: 61  NLVAVERFNN-AAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK 119
           N+  +ER N   + G   +T+ +N F D T +EF     GF+   H       G  F + 
Sbjct: 55  NMKMIERHNQEHSQGKHGFTMAMNAFGDKTNEEFRQLMNGFQSQKHKK-----GKLFHFH 109

Query: 120 S---SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLV 169
                 +P SVNW ++G VTPVK QG C       A  A+EG    K  +LVSLSEQ LV
Sbjct: 110 EPVFGHIPTSVNWTQRGYVTPVKDQGSCHSCWAFSATGALEGQMFRKTGKLVSLSEQNLV 169

Query: 170 DCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNY 229
           DC+  ++NNGC GG MD AF+Y+  N G+ ++  Y Y    +  C   K E  AA  T +
Sbjct: 170 DCSRPESNNGCSGGLMDKAFQYVKNNGGLDSEESYPYTAKESRNC-LYKPEFSAANNTGF 228

Query: 230 EDVPPNDEESLLKAVAN-QPVSVAIDAS--ALQFYSGGV-FNGYCETFLNHGVTAVGY-- 283
            ++PP  E++L+ AVA+  P+SVA+DAS  + +FY  G+ F+  C   +NHGV  VGY  
Sbjct: 229 VNIPP-QEKALMNAVASVGPISVAVDASLKSFRFYKSGIYFDPACRLAVNHGVLVVGYGF 287

Query: 284 -GTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
            GT  +  KYWL+KNSWG+ WG DGY ++ +D +     CGIA  AS+P 
Sbjct: 288 EGTDPDKNKYWLVKNSWGKSWGADGYIKIAKDRNN---HCGIARAASYPT 334


>gi|383849553|ref|XP_003700409.1| PREDICTED: cathepsin L-like [Megachile rotundata]
          Length = 343

 Score =  211 bits (537), Expect = 4e-52,   Method: Compositional matrix adjust.
 Identities = 130/342 (38%), Positives = 191/342 (55%), Gaps = 24/342 (7%)

Query: 8   VVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVER 67
           ++L+I  +CA+      F+   + +++  +K ++ + YK  AE   R +I+  N + + +
Sbjct: 4   ILLLIVITCAAVQAISFFE--LVNQEWINFKMEHKKCYKHEAEERLRMKIYMKNKLQIAQ 61

Query: 68  FN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKAN---GTPFLYKSS-Q 122
            N +  +   +Y L++NK+ D+   EF     G+  + + +        G  F+   + +
Sbjct: 62  HNCDYELKKVTYRLKINKYGDMLNHEFKNMLNGYNRTINHTLRNERLPVGAAFIEPCNVE 121

Query: 123 VPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATND 175
           +P  V+W + GAVT VK QG C       A  ++EG +  +   LVSLSEQ L+DC+ + 
Sbjct: 122 LPKMVDWRKCGAVTEVKDQGHCGSCWAFSATGSLEGQHFRRTGVLVSLSEQNLIDCSGSY 181

Query: 176 NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPN 235
            NNGC GG MD AF YI  NKG+  +  Y YEG     C   K    A+ +  + D+P  
Sbjct: 182 GNNGCNGGLMDQAFSYIKDNKGLDTEKTYPYEGEDDK-CRYDKRSSGASDV-GFVDIPVG 239

Query: 236 DEESLLKAVAN-QPVSVAIDAS--ALQFYSGGV-FNGYCE-TFLNHGVTAVGYGTSEEGI 290
           DE+ L  AVA   PVSVAIDAS  + QFYS G+ F   C  T L+HGV  VGYGT EEG 
Sbjct: 240 DEQKLKAAVATVGPVSVAIDASHQSFQFYSDGIYFEPECSSTNLDHGVLVVGYGTDEEGR 299

Query: 291 KYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
            YW++KNSWG+ WGE GY ++ R+ID     CGIA  AS+P+
Sbjct: 300 DYWIVKNSWGESWGEKGYIKMARNIDN---HCGIASSASYPI 338


>gi|395514298|ref|XP_003761356.1| PREDICTED: cathepsin L1-like [Sarcophilus harrisii]
          Length = 365

 Score =  211 bits (537), Expect = 4e-52,   Method: Compositional matrix adjust.
 Identities = 142/377 (37%), Positives = 195/377 (51%), Gaps = 60/377 (15%)

Query: 1   MAKYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKD 60
           M  Y  +V L +    A     RT D      ++ QWKAQ+ R Y E+ +   R  I++ 
Sbjct: 1   MNYYLCLVSLCLGLVAAIPKLDRTLDA-----QWYQWKAQHRRDYGENED--WRRAIWEK 53

Query: 61  NLVAVERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPF--- 116
           NL ++E  N   + G  S+ + +NKF D+T +EF     GF  S H    +  G  F   
Sbjct: 54  NLRSIEMHNLEYSAGKHSFQMEMNKFGDMTNEEFRQVMNGF--STHRVQRRTKGRLFREP 111

Query: 117 ---------------------------LYKSS---QVPPSVNWIEKGAVTPVKYQGQC-- 144
                                      L++     Q+P SV+W +KG VTPVK QGQC  
Sbjct: 112 LLVQIPKSVDWRDKGYVTPVKNQLVRRLFREPLLVQIPKSVDWRDKGYVTPVKNQGQCGS 171

Query: 145 -----AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGIT 199
                A  ++EG    K  +LVSLSEQ LVDC+T   N+GC GG MD+AF+Y+ +N GI 
Sbjct: 172 CWAFSATGSLEGQWFRKTGKLVSLSEQNLVDCSTAQGNSGCQGGLMDNAFEYVKENGGID 231

Query: 200 NDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDA--S 256
            +  Y Y  ++       K +   A IT Y D+P   E++L KAVA   P+SVAIDA  S
Sbjct: 232 TEESYPY--IAADDTCQYKPQYSGANITGYVDIPSRMEKALEKAVATVGPISVAIDAGHS 289

Query: 257 ALQFYSGGV-FNGYCETF-LNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRD 314
           + QFY  GV +   C +  L+HGV AVGYG   +  KYW++KNSWG++WG+ GY  + RD
Sbjct: 290 SFQFYRSGVYYEPECSSEDLDHGVLAVGYGVQGKNGKYWIVKNSWGEEWGDSGYILMARD 349

Query: 315 IDQPQGQCGIAMFASFP 331
            +     CGIA  AS+P
Sbjct: 350 RNN---HCGIATAASYP 363


>gi|380236892|emb|CBK52289.1| cathepsin S protein [Dicentrarchus labrax]
          Length = 337

 Score =  211 bits (537), Expect = 4e-52,   Method: Compositional matrix adjust.
 Identities = 124/319 (38%), Positives = 183/319 (57%), Gaps = 22/319 (6%)

Query: 27  EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN-NAAIGNRSYTLRLNKF 85
           E  +   ++ WK  +G+ Y+   E+  R E+++ NL+ +   N  A++G  +Y L +N  
Sbjct: 27  EPKLDAHWKLWKMTHGKKYQTEVEDVSRRELWEKNLMLITMHNLEASMGLHTYELSMNHM 86

Query: 86  ADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGAVTPVKYQGQC 144
            DLT +E + S   F      + ++   +PF   + + VP +++W EKG VT VK QG C
Sbjct: 87  GDLTQEEIMQS---FATLSPPTDIQRAASPFAGTTGADVPDTMDWREKGCVTSVKMQGSC 143

Query: 145 -------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKG 197
                  A  A+EG  A    +LV LS Q LVDC+T   N+GC GG M  AF+Y+I N+G
Sbjct: 144 GSCWAFSAAGALEGQLAKTTGKLVDLSPQNLVDCSTKYGNHGCNGGLMHHAFQYVIDNQG 203

Query: 198 ITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDAS 256
           I +DA Y Y G   G C    ++  AA  + Y  +P  +E +L +A+AN  P+SVAIDA+
Sbjct: 204 IDSDASYPYTG-RNGEC-RYNSKFRAANCSQYSFLPEGNEGALKEALANIGPISVAIDAT 261

Query: 257 --ALQFYSGGVFNG-YCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQR 313
                FY  GV+N   C   +NHGV AVGYGT  +G  YWL+KNSWG+ +G+ GY R+ R
Sbjct: 262 RPTFTFYRSGVYNDPNCSQKVNHGVLAVGYGTL-DGQDYWLVKNSWGKTFGDQGYIRMSR 320

Query: 314 DIDQPQGQCGIAMFASFPV 332
           + +    QCGIA++  +P+
Sbjct: 321 NKND---QCGIALYGCYPI 336


>gi|255522980|gb|ACU12382.1| RE21773p [Drosophila melanogaster]
          Length = 375

 Score =  211 bits (537), Expect = 4e-52,   Method: Compositional matrix adjust.
 Identities = 131/323 (40%), Positives = 181/323 (56%), Gaps = 27/323 (8%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKFADL 88
           + E++  +K ++ + Y++  E   R +IF +N   + + N   A G  S+ L +NK+ADL
Sbjct: 59  VMEEWHTFKLEHRKNYQDETEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADL 118

Query: 89  TPQEFIASQTGFKMSDHSSSLKAN----GTPFLYKSS-QVPPSVNWIEKGAVTPVKYQGQ 143
              EF     GF  + H     A+    G  F+  +   +P SV+W  KGAVT VK QG 
Sbjct: 119 LHHEFRQLMNGFNYTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGH 178

Query: 144 C-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNK 196
           C       +  A+EG +  K   LVSLSEQ LVDC+T   NNGC GG MD+AF+YI  N 
Sbjct: 179 CGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNG 238

Query: 197 GITNDAVYSYEGMSTGICDSIKAEDHAAQITN--YEDVPPNDEESLLKAVAN-QPVSVAI 253
           GI  +  Y YE     I DS          T+  + D+P  DE+ + +AVA   PVSVAI
Sbjct: 239 GIDTEKSYPYE----AIDDSCHFNKGTVGATDRGFTDIPQGDEKKMAEAVATVGPVSVAI 294

Query: 254 DAS--ALQFYSGGVFN-GYCET-FLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYF 309
           DAS  + QFYS GV+N   C+   L+HGV  VG+GT E G  YWL+KNSWG  WG+ G+ 
Sbjct: 295 DASHESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGEDYWLVKNSWGTTWGDKGFI 354

Query: 310 RLQRDIDQPQGQCGIAMFASFPV 332
           ++ R+    + QCGIA  +S+P+
Sbjct: 355 KMLRN---KENQCGIASASSYPL 374


>gi|33333698|gb|AAQ11967.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
           maculatus]
          Length = 326

 Score =  211 bits (537), Expect = 4e-52,   Method: Compositional matrix adjust.
 Identities = 128/330 (38%), Positives = 186/330 (56%), Gaps = 49/330 (14%)

Query: 29  SIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKFAD 87
           S+ E+++Q+K  +G+TY+   E  +RF +F+ NL+ ++  N     G  S+  ++ +FAD
Sbjct: 18  SVYEEWQQFKLDHGKTYRSVVEEKRRFSVFQKNLIDIQEHNKKYERGEVSFAKKVTQFAD 77

Query: 88  LTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSS-----------QVPPSVNWIEKGAVT 136
           +T +EF+              LK  G P L  ++           +   +V+W E+GAVT
Sbjct: 78  MTHEEFL------------DLLKLQGVPALPSNAVHFDNFEDTDMEEKDAVDWREEGAVT 125

Query: 137 PVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATND-NNNGCYGGFMDDA 188
           PVK Q  C       AV A+EG    K   LVSLS Q+LVDCAT +  NNGC GG M  A
Sbjct: 126 PVKDQANCGSCWAFSAVGAIEGQFFKKNGTLVSLSAQELVDCATEEYGNNGCRGGLMGQA 185

Query: 189 FKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ- 247
           F ++ Q++GI  +  Y YEG  +  C   K+ D+  ++  Y  V P DE+ + + VA + 
Sbjct: 186 FDFV-QDEGIQTEESYPYEGRRSS-CK--KSGDYVTKVKTY--VFPLDEQEMARTVAAKG 239

Query: 248 PVSVAIDASALQFYSGGVFNGYCETF-----LNHGVTAVGYGTSEEGIKYWLIKNSWGQD 302
           PV+VAI+AS L FY  G+ +  C        LNHGV  VGYG SE G+ YW++KNSWG D
Sbjct: 240 PVAVAIEASQLSFYDKGIVDEKCRCSNKREDLNHGVLVVGYG-SENGVDYWIVKNSWGAD 298

Query: 303 WGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           WGE GYFRL++D+      CGI  + ++P+
Sbjct: 299 WGEKGYFRLKKDVK----ACGIGYYNTYPI 324


>gi|226821425|gb|ACO82388.1| cathepsin S [Lutjanus argentimaculatus]
          Length = 337

 Score =  211 bits (537), Expect = 4e-52,   Method: Compositional matrix adjust.
 Identities = 126/319 (39%), Positives = 180/319 (56%), Gaps = 22/319 (6%)

Query: 27  EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN-NAAIGNRSYTLRLNKF 85
           E  +   ++ WK  + + Y+   E   R  +++ NL+ +   N  A++G  +Y L +N  
Sbjct: 27  ESRLDAHWDLWKKTHEKKYQNEVEEFSRRRLWEKNLMLITMHNLEASMGLHTYELGMNHM 86

Query: 86  ADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGAVTPVKYQGQC 144
            D+TP+E   S   F      + ++   +PF   S + +P +++W EKG VT VK QG C
Sbjct: 87  GDMTPEEIWQS---FATLTPPTDIQRAPSPFAGSSGADIPDTMDWREKGCVTSVKTQGSC 143

Query: 145 -------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKG 197
                  AV A+EG  A K  +LV LS Q LVDC+T   N+GC GGFMD AF+Y+I N+G
Sbjct: 144 GSCWAFSAVGALEGQLAKKTGKLVDLSPQNLVDCSTKYGNHGCNGGFMDHAFQYVIDNQG 203

Query: 198 ITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDAS 256
           I +DA Y Y G S            AA  ++Y  +P  DE +L +A+A   P+SVAIDA+
Sbjct: 204 IDSDASYPYTGRSDQC--HYNPSYRAANCSSYNFLPEGDEGALKQALATIGPISVAIDAT 261

Query: 257 ALQ--FYSGGVFNG-YCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQR 313
             +  FY  GV+N   C   +NHGV AVGYGT   G  YWL+KNSWG  +G+ GY R+ R
Sbjct: 262 RPRFIFYRSGVYNDPSCSQEVNHGVLAVGYGTL-NGQDYWLVKNSWGTKFGDQGYIRMAR 320

Query: 314 DIDQPQGQCGIAMFASFPV 332
           + +    QCGIAM+  +P+
Sbjct: 321 NQND---QCGIAMYGCYPI 336


>gi|33333708|gb|AAQ11972.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
           maculatus]
          Length = 326

 Score =  211 bits (537), Expect = 4e-52,   Method: Compositional matrix adjust.
 Identities = 128/330 (38%), Positives = 186/330 (56%), Gaps = 49/330 (14%)

Query: 29  SIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKFAD 87
           S+ E+++Q+K  +G+TY+   E  +RF +F+ NL+ ++  N     G  S+  ++ +FAD
Sbjct: 18  SVYEEWQQFKLDHGKTYRSLVEEKRRFSVFQKNLIDIQEHNKKYERGEVSFAKKVTQFAD 77

Query: 88  LTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSS-----------QVPPSVNWIEKGAVT 136
           +T +EF+              LK  G P L  ++           +   +V+W E+GAVT
Sbjct: 78  MTHEEFL------------DLLKLQGVPALPSNAVHFDNFEDTDMEEKDAVDWREEGAVT 125

Query: 137 PVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATND-NNNGCYGGFMDDA 188
           PVK Q  C       AV A+EG    K   LVSLS Q+LVDCAT +  NNGC GG M  A
Sbjct: 126 PVKDQANCGSCWAFSAVGAIEGQFFKKNGTLVSLSAQELVDCATEEYGNNGCRGGLMGQA 185

Query: 189 FKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ- 247
           F ++ Q++GI  +  Y YEG  +  C   K+ D+  ++  Y  V P DE+ + + VA + 
Sbjct: 186 FDFV-QDEGIQTEESYPYEGRRSS-CK--KSGDYVTKVKTY--VFPLDEQEMARTVAAKG 239

Query: 248 PVSVAIDASALQFYSGGVFNGYCETF-----LNHGVTAVGYGTSEEGIKYWLIKNSWGQD 302
           PV+VAI+AS L FY  G+ +  C        LNHGV  VGYG SE G+ YW++KNSWG D
Sbjct: 240 PVAVAIEASQLSFYDKGIVDETCRCSNKREDLNHGVLVVGYG-SENGVDYWIVKNSWGAD 298

Query: 303 WGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           WGE GYFRL++D+      CGI  + ++P+
Sbjct: 299 WGEKGYFRLKKDVK----ACGIGYYNTYPI 324


>gi|52546918|gb|AAU81592.1| cysteine proteinase, partial [Petunia x hybrida]
          Length = 196

 Score =  211 bits (537), Expect = 4e-52,   Method: Compositional matrix adjust.
 Identities = 107/185 (57%), Positives = 126/185 (68%), Gaps = 4/185 (2%)

Query: 158 NRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSI 217
           N+LVSLSEQ+LVDC  N  N GC GG MD AF +I +  GIT +  Y Y   + G CD  
Sbjct: 3   NKLVSLSEQELVDC-DNGENQGCNGGLMDLAFDFIKKKGGITTEENYPYMA-ADGKCDLK 60

Query: 218 KAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLN 275
           K       I  +EDVPPNDEESLLKAVANQPVSVAI+AS    QFYS GVF G C T L+
Sbjct: 61  KRNTPVVSIDGHEDVPPNDEESLLKAVANQPVSVAIEASGSDFQFYSEGVFTGDCGTELD 120

Query: 276 HGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVSKE 335
           HGV  VGYGT+ +G KYW ++NSWG +WGE GY R+QRDID  +G CGIAM  S+P+   
Sbjct: 121 HGVAIVGYGTTLDGTKYWTVRNSWGPEWGEKGYIRMQRDIDAEEGLCGIAMQPSYPIKTS 180

Query: 336 SAQPS 340
           S  P+
Sbjct: 181 SDNPT 185


>gi|388890776|gb|AFK80364.1| cysteine proteinase 3, partial [Acanthamoeba castellanii]
          Length = 329

 Score =  211 bits (537), Expect = 4e-52,   Method: Compositional matrix adjust.
 Identities = 133/342 (38%), Positives = 188/342 (54%), Gaps = 30/342 (8%)

Query: 6   LIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
           + ++++++  C +     T D   +   F +W     ++Y    E   R+ ++++N   +
Sbjct: 4   ITILVLLAAICVASTLATTHDP--LTGVFAEWMRDNSKSYSNE-EFVFRWNVWRENQQLI 60

Query: 66  ERFNNAAIGNRSYTLRLNKFADLTPQEF--IASQTGFKMSDHSSSLKA-NGTPFLYKSSQ 122
           E  N +   N++  L +NKF DLT  EF  +     F  S H++   A    P    +  
Sbjct: 61  EEHNRS---NKTSFLAMNKFGDLTNAEFNKLFKGLAFDYSFHANKAAAEKAVP----APG 113

Query: 123 VPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATND 175
           +    +W +KGAVT VK QGQC          + EG N +K  RL SLSEQ L+DC+ + 
Sbjct: 114 LSADFDWRQKGAVTHVKNQGQCGSCWSFSTTGSTEGANFLKTGRLTSLSEQNLIDCSGSY 173

Query: 176 NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPN 235
            NNGC GG MD AF+YII NKGI  +A Y Y+  +   C    A +    +T+Y DV   
Sbjct: 174 GNNGCNGGLMDYAFEYIINNKGIDTEASYPYQ-TAQYTCQYNPA-NSGGSLTSYTDVSSG 231

Query: 236 DEESLLKAVANQPVSVAIDAS--ALQFYSGGVF--NGYCETFLNHGVTAVGYGTSEEGIK 291
           DE +LL AVA +P SVAIDAS  + QFYSGGV+  +    T L+HGV AVG+GT E+G  
Sbjct: 232 DENALLNAVATEPTSVAIDASHNSFQFYSGGVYYESACSSTQLDHGVLAVGWGT-EDGQD 290

Query: 292 YWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
           YWL+KNSWG DWG  GY ++ R+       CGIA  AS+P +
Sbjct: 291 YWLVKNSWGADWGLAGYIKMARN---RSNNCGIATSASYPTA 329


>gi|342675481|gb|AEL31666.1| cathepsin L [Cynoglossus semilaevis]
          Length = 336

 Score =  211 bits (537), Expect = 4e-52,   Method: Compositional matrix adjust.
 Identities = 127/323 (39%), Positives = 184/323 (56%), Gaps = 25/323 (7%)

Query: 27  EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN-NAAIGNRSYTLRLNKF 85
           +  +++ +E WK  + + Y E  E  +R  I++ NL  +E  N   ++G  SY L +N F
Sbjct: 21  DARLSDHWELWKNWHSKKYHEKEEGWRRM-IWEKNLNKIELHNLEHSMGKHSYRLGMNHF 79

Query: 86  ADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPS-VNWIEKGAVTPVKYQGQC 144
            D+T +EF     G++     +  KA G+ F+  +  V PS V+W EKG VTPVK QGQC
Sbjct: 80  GDMTHEEFRQIMNGYQ---RKTERKAIGSLFMEPNFMVAPSAVDWREKGYVTPVKDQGQC 136

Query: 145 A-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKG 197
                     A+ZG N  K+ +LVSLSEQ LVDC+  + N GC GG MD AF+Y+  N+G
Sbjct: 137 GSCWAFSTTGALZGQNFRKMGKLVSLSEQNLVDCSRPEGNEGCGGGLMDQAFQYVKDNQG 196

Query: 198 ITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDA- 255
           + ++  Y Y G     C     + ++   T + D+P   E +L+KAVA+  PVSVAIDA 
Sbjct: 197 LDSEDSYPYLGTDDQPC-HYDPKYNSVNDTGFVDIPSGKEHALMKAVASVGPVSVAIDAG 255

Query: 256 -SALQFYSGGV-FNGYCET-FLNHGVTAVGYGTSEE---GIKYWLIKNSWGQDWGEDGYF 309
             + QFY  G+ +   C +  L+HGV AVGYG   E   G KYW++KNSW + WG+ GY 
Sbjct: 256 HESFQFYQSGIYYEKECSSEELDHGVLAVGYGFEGEDVDGKKYWIVKNSWSEKWGDKGYI 315

Query: 310 RLQRDIDQPQGQCGIAMFASFPV 332
            + +D    +  CGIA  AS+P+
Sbjct: 316 YMAKD---RKNHCGIATAASYPL 335


>gi|449683741|ref|XP_002155462.2| PREDICTED: cathepsin L-like [Hydra magnipapillata]
          Length = 324

 Score =  211 bits (537), Expect = 4e-52,   Method: Compositional matrix adjust.
 Identities = 133/310 (42%), Positives = 175/310 (56%), Gaps = 28/310 (9%)

Query: 36  QWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIA 95
           +WK  + + Y    E + R+ I+KDN   +   N   +    + L +N+F D+T  EF  
Sbjct: 29  RWKMAHNKAYSHDGEETVRYTIWKDNERRIREHN---LQGGDFLLEMNQFGDMTNNEF-K 84

Query: 96  SQTGFKMSDHSSSLKANGTPFLYKSSQVPP-SVNWIEKGAVTPVKYQGQCA-------VA 147
              G+    H S     G+ FL  +S V P SV+W  +G VTPVK QGQC          
Sbjct: 85  DFNGYLSHKHVS-----GSTFLTPNSFVAPDSVDWRNEGYVTPVKDQGQCGSCWAFSTTG 139

Query: 148 AVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYE 207
           ++EG N  K  +LVSLSEQ LVDC+T   NNGC GG MD+AF YI +N GI ++A Y Y 
Sbjct: 140 SLEGQNFKKTGKLVSLSEQNLVDCSTAYGNNGCNGGLMDNAFTYIKENNGIDSEASYPYT 199

Query: 208 GMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDAS--ALQFYSGG 264
               G C   K  + AA  T + D+P  DE  L +AVA+  P+SVAIDAS  + QFY  G
Sbjct: 200 AKD-GKCAFTKP-NVAATDTGFVDIPSGDENKLKEAVASVGPISVAIDASHFSFQFYRKG 257

Query: 265 VFNGY--CETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQC 322
           V+N      T L+HGV  VGYGT E G  YWL+KNSW   WG+ GY ++ R+    + QC
Sbjct: 258 VYNERKCSSTELDHGVLVVGYGT-ESGKDYWLVKNSWNTSWGDKGYIKMSRN---AKNQC 313

Query: 323 GIAMFASFPV 332
           GIA  AS+P+
Sbjct: 314 GIATNASYPL 323


>gi|24653514|ref|NP_523735.2| cysteine proteinase-1, isoform C [Drosophila melanogaster]
 gi|118572624|sp|Q95029.2|CATL_DROME RecName: Full=Cathepsin L; AltName: Full=Cysteine proteinase 1;
           Contains: RecName: Full=Cathepsin L heavy chain;
           Contains: RecName: Full=Cathepsin L light chain; Flags:
           Precursor
 gi|21627209|gb|AAM68565.1| cysteine proteinase-1, isoform C [Drosophila melanogaster]
          Length = 371

 Score =  211 bits (537), Expect = 5e-52,   Method: Compositional matrix adjust.
 Identities = 131/323 (40%), Positives = 181/323 (56%), Gaps = 27/323 (8%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKFADL 88
           + E++  +K ++ + Y++  E   R +IF +N   + + N   A G  S+ L +NK+ADL
Sbjct: 55  VMEEWHTFKLEHRKNYQDETEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADL 114

Query: 89  TPQEFIASQTGFKMSDHSSSLKAN----GTPFLYKSS-QVPPSVNWIEKGAVTPVKYQGQ 143
              EF     GF  + H     A+    G  F+  +   +P SV+W  KGAVT VK QG 
Sbjct: 115 LHHEFRQLMNGFNYTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGH 174

Query: 144 C-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNK 196
           C       +  A+EG +  K   LVSLSEQ LVDC+T   NNGC GG MD+AF+YI  N 
Sbjct: 175 CGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNG 234

Query: 197 GITNDAVYSYEGMSTGICDSIKAEDHAAQITN--YEDVPPNDEESLLKAVAN-QPVSVAI 253
           GI  +  Y YE     I DS          T+  + D+P  DE+ + +AVA   PVSVAI
Sbjct: 235 GIDTEKSYPYE----AIDDSCHFNKGTVGATDRGFTDIPQGDEKKMAEAVATVGPVSVAI 290

Query: 254 DAS--ALQFYSGGVFN-GYCET-FLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYF 309
           DAS  + QFYS GV+N   C+   L+HGV  VG+GT E G  YWL+KNSWG  WG+ G+ 
Sbjct: 291 DASHESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGEDYWLVKNSWGTTWGDKGFI 350

Query: 310 RLQRDIDQPQGQCGIAMFASFPV 332
           ++ R+    + QCGIA  +S+P+
Sbjct: 351 KMLRN---KENQCGIASASSYPL 370


>gi|410990010|ref|XP_004001243.1| PREDICTED: cathepsin L1 isoform 2 [Felis catus]
          Length = 337

 Score =  211 bits (537), Expect = 5e-52,   Method: Compositional matrix adjust.
 Identities = 128/341 (37%), Positives = 185/341 (54%), Gaps = 25/341 (7%)

Query: 9   VLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERF 68
           +L ++G C   A+       S+  ++ QWKA +G+ Y  + E  +R  +++ N+  +E+ 
Sbjct: 4   LLFLAGLCLGVASAAPQLYQSLDARWSQWKATHGKLYGMNDEVWRR-AVWERNMKMIEQH 62

Query: 69  NNA-AIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSV 127
           N   + G  ++T+ +N F D+T +EF     G K+       K    PF     ++P SV
Sbjct: 63  NREHSQGKHTFTMAMNAFGDMTNEEFRQVMNGLKIQKRKK-WKVFQAPFFV---EIPSSV 118

Query: 128 NWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGC 180
           +W EKG VTPVK QG C       A  A+EG    K  +LVSLSEQ LVDC+  + N G 
Sbjct: 119 DWREKGYVTPVKDQGYCLCCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSQTEGNEGY 178

Query: 181 YGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIK--AEDHAAQITNYEDVPPNDEE 238
            GG +DDAF+Y+  N G+ ++  Y Y         S K   E+  A +T+Y D+P  + E
Sbjct: 179 SGGLIDDAFQYVKDNGGLDSEESYPYHAQVKRASYSCKYRPENSVANVTDYWDIPSKENE 238

Query: 239 SLLKAVANQPVSVAIDAS--ALQFYSGGV-FNGYCETF-LNHGVTAVGY---GTSEEGIK 291
            ++   A  P+S AIDAS    +FY  G+ ++  C +  ++HGV  VGY   GT  E  K
Sbjct: 239 LMITLAAVGPISAAIDASLDTFRFYKEGIYYDPSCSSEDVDHGVLVVGYGADGTETENKK 298

Query: 292 YWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           YW+IKNSWG DWG DGY ++ +D D     CGIA  ASFP 
Sbjct: 299 YWIIKNSWGTDWGMDGYIKMAKDRDN---HCGIASLASFPT 336


>gi|410898132|ref|XP_003962552.1| PREDICTED: cathepsin L-like [Takifugu rubripes]
          Length = 335

 Score =  211 bits (537), Expect = 5e-52,   Method: Compositional matrix adjust.
 Identities = 133/315 (42%), Positives = 186/315 (59%), Gaps = 21/315 (6%)

Query: 33  KFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAA-IGNRSYTLRLNKFADLTPQ 91
           +F  WK ++GR+Y+  +E  +R +I+ +N   V   N  A  G +SY L + +FAD+  +
Sbjct: 26  EFHAWKLKFGRSYRTPSEEVQRMQIWLNNRKLVLVHNILADQGIKSYRLGMTQFADMDNE 85

Query: 92  EFIASQTGFKMSDHSSSLKANGTPF--LYKSSQVPPSVNWIEKGAVTPVKYQGQC----- 144
           E+ +  +   +   ++S    G+ F  L + + +P +V+W +KG VT VK Q QC     
Sbjct: 86  EYKSLISLGCLRAFNTSAPRRGSAFFRLAEGTHLPTTVDWRDKGYVTGVKDQKQCGSCWA 145

Query: 145 --AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDA 202
             A  ++EG N  K  +LVSLSEQQLVDC+ +  N GC GG MD AFKYI +N GI  + 
Sbjct: 146 FSATGSLEGQNFRKTGKLVSLSEQQLVDCSGDYGNMGCNGGLMDYAFKYIQENGGIDTEK 205

Query: 203 VYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDA--SALQ 259
            Y YE    G C   K E+  A+ T Y DV   DE++L +AVA   PVSV IDA  S+ Q
Sbjct: 206 SYPYEA-EDGQC-RFKPENVGAKCTGYVDVTVGDEDALKEAVATIGPVSVGIDASHSSFQ 263

Query: 260 FYSGGVFNGY-CETF-LNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQ 317
            Y  GV++   C +  L+HGV AVGYGT + G  YWL+KNSWG  WG++GY  + R+ D 
Sbjct: 264 LYDSGVYDEQDCSSQDLDHGVLAVGYGT-DNGQDYWLVKNSWGLGWGQEGYIMMSRNKDN 322

Query: 318 PQGQCGIAMFASFPV 332
              QCGIA  AS+P+
Sbjct: 323 ---QCGIATAASYPL 334


>gi|195583187|ref|XP_002081405.1| GD10995 [Drosophila simulans]
 gi|194193414|gb|EDX06990.1| GD10995 [Drosophila simulans]
          Length = 341

 Score =  211 bits (536), Expect = 5e-52,   Method: Compositional matrix adjust.
 Identities = 131/325 (40%), Positives = 182/325 (56%), Gaps = 31/325 (9%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKFADL 88
           + E++  +K ++ + Y++  E   R +IF +N   + + N   A G  S+ L +NK+ADL
Sbjct: 25  VMEEWHTFKLEHRKNYQDDTEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADL 84

Query: 89  TPQEFIASQTGFKMSDHSSSLKAN----GTPFLYKSS-QVPPSVNWIEKGAVTPVKYQGQ 143
              EF     GF  + H     A+    G  F+  +   +P SV+W  KGAVT VK QG 
Sbjct: 85  LHHEFRQLMNGFNYTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGH 144

Query: 144 C-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNK 196
           C       +  A+EG +  K   LVSLSEQ LVDC+T   NNGC GG MD+AF+YI  N 
Sbjct: 145 CGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNG 204

Query: 197 GITNDAVYSYEGMSTGICD----SIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSV 251
           GI  +  Y YE +    C     +I A D       + D+P  DE+ + +AVA   PVSV
Sbjct: 205 GIDTEKSYPYEAIDDS-CHFNKGTIGATDRG-----FTDIPQGDEKKMAEAVATVGPVSV 258

Query: 252 AIDAS--ALQFYSGGVFN-GYCET-FLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDG 307
           AIDAS  + QFYS GV+N   C+   L+HGV  VG+GT E G  YWL+KNSWG  WG+ G
Sbjct: 259 AIDASHESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGDDYWLVKNSWGTTWGDKG 318

Query: 308 YFRLQRDIDQPQGQCGIAMFASFPV 332
           + ++ R+    + QCGIA  +S+P+
Sbjct: 319 FIKMLRN---KENQCGIASASSYPL 340


>gi|24653516|ref|NP_725347.1| cysteine proteinase-1, isoform A [Drosophila melanogaster]
 gi|24653518|ref|NP_725348.1| cysteine proteinase-1, isoform B [Drosophila melanogaster]
 gi|1658527|gb|AAB18345.1| cysteine proteinase 1 [Drosophila melanogaster]
 gi|2305221|gb|AAB65749.1| cysteine proteinase-1 [Drosophila melanogaster]
 gi|7303249|gb|AAF58311.1| cysteine proteinase-1, isoform A [Drosophila melanogaster]
 gi|21627210|gb|AAM68566.1| cysteine proteinase-1, isoform B [Drosophila melanogaster]
 gi|54650754|gb|AAV36956.1| LP06554p [Drosophila melanogaster]
 gi|220951982|gb|ACL88534.1| Cp1-PA [synthetic construct]
          Length = 341

 Score =  211 bits (536), Expect = 5e-52,   Method: Compositional matrix adjust.
 Identities = 131/323 (40%), Positives = 181/323 (56%), Gaps = 27/323 (8%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKFADL 88
           + E++  +K ++ + Y++  E   R +IF +N   + + N   A G  S+ L +NK+ADL
Sbjct: 25  VMEEWHTFKLEHRKNYQDETEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADL 84

Query: 89  TPQEFIASQTGFKMSDHSSSLKAN----GTPFLYKSS-QVPPSVNWIEKGAVTPVKYQGQ 143
              EF     GF  + H     A+    G  F+  +   +P SV+W  KGAVT VK QG 
Sbjct: 85  LHHEFRQLMNGFNYTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGH 144

Query: 144 C-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNK 196
           C       +  A+EG +  K   LVSLSEQ LVDC+T   NNGC GG MD+AF+YI  N 
Sbjct: 145 CGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNG 204

Query: 197 GITNDAVYSYEGMSTGICDSIKAEDHAAQITN--YEDVPPNDEESLLKAVAN-QPVSVAI 253
           GI  +  Y YE     I DS          T+  + D+P  DE+ + +AVA   PVSVAI
Sbjct: 205 GIDTEKSYPYE----AIDDSCHFNKGTVGATDRGFTDIPQGDEKKMAEAVATVGPVSVAI 260

Query: 254 DAS--ALQFYSGGVFN-GYCET-FLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYF 309
           DAS  + QFYS GV+N   C+   L+HGV  VG+GT E G  YWL+KNSWG  WG+ G+ 
Sbjct: 261 DASHESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGEDYWLVKNSWGTTWGDKGFI 320

Query: 310 RLQRDIDQPQGQCGIAMFASFPV 332
           ++ R+    + QCGIA  +S+P+
Sbjct: 321 KMLRN---KENQCGIASASSYPL 340


>gi|33333700|gb|AAQ11968.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
           maculatus]
          Length = 326

 Score =  211 bits (536), Expect = 5e-52,   Method: Compositional matrix adjust.
 Identities = 128/330 (38%), Positives = 185/330 (56%), Gaps = 49/330 (14%)

Query: 29  SIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKFAD 87
           S+ E+++Q+K  +G+TY+   E  +RF +F+ NLV ++  N     G  S+  ++ +FAD
Sbjct: 18  SVYEEWQQFKLDHGKTYRSLVEEKRRFSVFQKNLVDIQEHNKKYERGEESFAKKVTQFAD 77

Query: 88  LTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSS-----------QVPPSVNWIEKGAVT 136
           +T +EF+              LK  G P L  ++           +   +V+W E+GAVT
Sbjct: 78  MTHEEFL------------DLLKLQGVPALPSNAVHFDNSEDIDMEEKDAVDWREEGAVT 125

Query: 137 PVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATND-NNNGCYGGFMDDA 188
           P K Q  C       AV A+EG    K   LVSLS Q+LVDCAT D  NNGC GG M  A
Sbjct: 126 PAKDQANCGSCWAFSAVGAIEGQFFKKNGTLVSLSAQELVDCATEDYGNNGCKGGLMGQA 185

Query: 189 FKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ- 247
           F ++ Q++GI  +  Y YEG  +  C   K+ ++  ++  Y  V P DE+ + + VA + 
Sbjct: 186 FDFV-QDEGIQTEESYPYEGRRSS-CK--KSGEYVTKVKTY--VFPLDEQEMARTVAAKG 239

Query: 248 PVSVAIDASALQFYSGGVFNGYCETF-----LNHGVTAVGYGTSEEGIKYWLIKNSWGQD 302
           PV+VAI+AS L FY  G+ +  C        LNHGV  VGYG SE G+ YW++KNSWG D
Sbjct: 240 PVAVAIEASQLSFYDKGIVDERCRCSNKREDLNHGVLVVGYG-SENGVDYWIVKNSWGAD 298

Query: 303 WGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           WGE GYFRL++D+      CGI  + ++P+
Sbjct: 299 WGEKGYFRLKKDVK----ACGIGYYNTYPI 324


>gi|2765358|emb|CAA74241.1| cathepsin L [Litopenaeus vannamei]
          Length = 325

 Score =  211 bits (536), Expect = 5e-52,   Method: Compositional matrix adjust.
 Identities = 128/323 (39%), Positives = 180/323 (55%), Gaps = 34/323 (10%)

Query: 29  SIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVE----RFNNAAIGNRSYTLRLNK 84
           S+ ++++ +KA++GR Y    E   R  +F+ N   ++    RF N  +   ++TL++N+
Sbjct: 17  SLRQQWQNFKAEHGRRYASVQEERYRLSVFEQNQQFIDDHNARFENGEV---TFTLQMNQ 73

Query: 85  FADLTPQEFIASQTGF---KMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQ 141
           F D+T +E +A+  GF        ++ LKA+          +P  V+W  KGAVTPVK Q
Sbjct: 74  FGDMTSEEIVATMNGFLGAPTRRPAAVLKAD-------DETLPEKVDWRTKGAVTPVKDQ 126

Query: 142 GQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQ 194
            QC          ++EG + +K  +LVSLSEQ LVDC+    N GC GG MD AF+YI  
Sbjct: 127 KQCGSCWAFSTTGSLEGQHFLKDGKLVSLSEQNLVDCSDKFRNMGCMGGLMDQAFRYIKA 186

Query: 195 NKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAI 253
           NKGI  +  Y YE    G C    A +  A  T Y DV    E +L KAVA   P+SV I
Sbjct: 187 NKGIDTEDSYPYEAQD-GKC-RFDASNVGATDTGYVDVEHGSESALKKAVATIGPISVGI 244

Query: 254 DA--SALQFYSGGVF-NGYC-ETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYF 309
           DA  S   FY  GV+ + +C  T L+HGV AVGYG+ E G  +WL+KNSW   WG+ GY 
Sbjct: 245 DASQSTFHFYHTGVYHDDHCSSTMLDHGVLAVGYGSDENGGDFWLVKNSWNTSWGDKGYI 304

Query: 310 RLQRDIDQPQGQCGIAMFASFPV 332
           ++ R+ +     CGIA  AS+P+
Sbjct: 305 KMSRNRNN---NCGIASQASYPL 324


>gi|354507493|ref|XP_003515790.1| PREDICTED: cathepsin L1-like [Cricetulus griseus]
 gi|344259154|gb|EGW15258.1| Cathepsin L1 [Cricetulus griseus]
          Length = 333

 Score =  211 bits (536), Expect = 5e-52,   Method: Compositional matrix adjust.
 Identities = 129/316 (40%), Positives = 179/316 (56%), Gaps = 27/316 (8%)

Query: 33  KFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKFADLTPQ 91
           ++ +WK+ Y R Y  + E  +R  +++ N+  +E  N   + G   YT+ +N F D+T +
Sbjct: 28  QWHKWKSTYRRLYGTNEEEWRR-AVWEKNMKMIELHNGEYSEGKHGYTMEMNAFGDMTNE 86

Query: 92  EFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC------- 144
           EF     G+K   H    K    P +    Q+P SV+W EKG VTPVK QGQC       
Sbjct: 87  EFRQLVNGYKHQKHRKG-KVFQEPLML---QLPKSVDWREKGCVTPVKNQGQCGSCWAFS 142

Query: 145 AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVY 204
           A  A+EG   +K   LVSLSEQ LVDC+  + N GC GG MD AF+Y++ NKG+ ++  Y
Sbjct: 143 ACGALEGQMCLKTGVLVSLSEQNLVDCSQAEGNQGCNGGLMDFAFQYVLNNKGLDSEESY 202

Query: 205 SYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDAS--ALQFY 261
            YE    G C   K E  AA  T Y D+ P  E++L+KAVA   P+++AIDAS  + QFY
Sbjct: 203 PYEA-KDGTC-KYKPEFAAANDTGYVDI-PQLEKALMKAVATVGPIAIAIDASHPSFQFY 259

Query: 262 SGGV-FNGYCETF-LNHGVTAVGY---GTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDID 316
           S G+ +   C +  L+HGV  VGY   GT     KYW++KNSWG  WG  G+F + +D +
Sbjct: 260 SSGIYYEPNCSSKELDHGVLVVGYGFEGTDSNKKKYWIVKNSWGSSWGMGGFFHIAKDKN 319

Query: 317 QPQGQCGIAMFASFPV 332
                CG+A  AS+P 
Sbjct: 320 N---HCGVATAASYPT 332


>gi|288548566|gb|ADC52431.1| cathepsin L2 cysteine protease [Pinctada fucata]
          Length = 330

 Score =  211 bits (536), Expect = 5e-52,   Method: Compositional matrix adjust.
 Identities = 132/341 (38%), Positives = 192/341 (56%), Gaps = 28/341 (8%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
           F IV  +++ S A        +E +I      +K QY + Y+   E  +R  +++ NL  
Sbjct: 4   FAIVAALVAVSFARVPRVGLDNEWNI------FKKQYNKLYQNEEEARRRL-VWESNLDF 56

Query: 65  VERFNNAA-IGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQV 123
           +   N AA  G  ++ + +N++ D+T +EF  +  G++M + +S+      P       +
Sbjct: 57  ITLHNLAADRGEHTFWVGMNEYGDMTNEEFTKTMNGYRMRNKTSNAPVFMPP--NNMGDL 114

Query: 124 PPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDN 176
           P +V+W  KG VTP+K QGQC       A  ++EG    K  +LVSLSEQ LVDC+    
Sbjct: 115 PDTVDWRPKGYVTPIKNQGQCGSCWSFSATGSLEGQTFKKTGKLVSLSEQNLVDCSKKQG 174

Query: 177 NNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPND 236
           N+GC GG MDDAF YI  N GI  +A Y Y+    G C+  K+ D  A  T + D+   D
Sbjct: 175 NHGCEGGLMDDAFTYIKANNGIDTEASYPYKARD-GKCE-FKSADVGATDTGFVDIKTKD 232

Query: 237 EESLLKAVAN-QPVSVAIDASAL--QFYSGGVFNGY--CETFLNHGVTAVGYGTSEEGIK 291
           EE+L +AVA   P+SVAIDAS +  Q Y  GV++ +   +T L+HGV AVGYGT E+   
Sbjct: 233 EEALKQAVATVGPISVAIDASHMSFQLYRTGVYHDWFCSQTKLDHGVLAVGYGT-EDSKD 291

Query: 292 YWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           YWL+KNSWG+ WG+ GY ++ R+    +  CGIA  AS+P 
Sbjct: 292 YWLVKNSWGESWGQKGYIQMSRN---RRNNCGIATSASYPT 329


>gi|33520126|gb|AAQ21040.1| cathepsin L precursor [Branchiostoma belcheri tsingtauense]
          Length = 327

 Score =  211 bits (536), Expect = 5e-52,   Method: Compositional matrix adjust.
 Identities = 136/341 (39%), Positives = 193/341 (56%), Gaps = 31/341 (9%)

Query: 6   LIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
           L++VL+    C + AT       +I  ++E +K  +G+ Y E  E++ R  IF +N   V
Sbjct: 3   LLIVLV----CVAVAT-------AIDNEWEAFKLLHGKQYNE-YEDTARHAIFLENCKIV 50

Query: 66  ERFNN-AAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPF-LYKSSQV 123
           ++ N  AA+G  ++ +R+NKF DLT +EF     G  +   + + +A G  F      +V
Sbjct: 51  KQHNEEAAMGKHTFFMRMNKFGDLTNEEFRMLVIGSGLMQSNRTQQAEGGVFESIPGLKV 110

Query: 124 PPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDN 176
             +V+W +KGAVT VK Q QC          ++EG + +K   LVSLSEQ LVDC+  + 
Sbjct: 111 NDTVDWRQKGAVTKVKNQEQCGSCWAFSTTGSLEGQHFLKSGTLVSLSEQNLVDCSRKEG 170

Query: 177 NNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPND 236
           N GC GG MD AFKYI  N GI  +  Y Y+G     C+  KA    A ++++ DV   D
Sbjct: 171 NKGCKGGLMDQAFKYIKTNGGIDTEECYPYKGRDERKCE-YKASCSGATLSSFVDVKTGD 229

Query: 237 EESLLKAVAN-QPVSVAIDAS--ALQFYSGGVFN-GYCET-FLNHGVTAVGYGTSEEGIK 291
           E++L +A A   P+SV IDAS  + Q Y  GV++   C +  L+HGV  VGYGT +    
Sbjct: 230 EDALKQASATIGPISVGIDASHPSFQLYDHGVYHEKRCSSKKLDHGVLVVGYGT-QSTKD 288

Query: 292 YWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           YWL+KNSWG DWG +GY  + R+ D    QCGIA  AS+PV
Sbjct: 289 YWLVKNSWGADWGMEGYIMMSRNKDN---QCGIATQASYPV 326


>gi|179959|gb|AAA35655.1| cathepsin [Homo sapiens]
 gi|248406|gb|AAB22005.1| cathepsin S [Homo sapiens]
          Length = 331

 Score =  211 bits (536), Expect = 5e-52,   Method: Compositional matrix adjust.
 Identities = 134/342 (39%), Positives = 188/342 (54%), Gaps = 32/342 (9%)

Query: 6   LIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
           L+ VL++  S  +Q       + ++   +  WK  YG+ YKE  E + R  I++ NL  V
Sbjct: 4   LVCVLLVCSSAVAQ----LHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLKFV 59

Query: 66  ERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQ-- 122
              N   ++G  SY L +N   D+T +E ++  +  ++    S  + N T   YKS+   
Sbjct: 60  MLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLTSSLRVP---SQWQRNIT---YKSNPNR 113

Query: 123 -VPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATN 174
            +P SV+W EKG VT VKYQG C       AV A+E    +K  +LV+LS Q LVDC+T 
Sbjct: 114 ILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVTLSAQNLVDCSTE 173

Query: 175 D-NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
              N GC GGFM  AF+YII NKGI +DA Y Y+ M         ++  AA  + Y ++P
Sbjct: 174 KYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKAMDQKC--QYDSKYRAATCSKYTELP 231

Query: 234 PNDEESLLKAVANQ-PVSVAIDASALQFY---SGGVFNGYCETFLNHGVTAVGYGTSEEG 289
              E+ L +AVAN+ PVSV +DA    F+   SG  +   C   +NHGV  VGYG    G
Sbjct: 232 YGREDVLKEAVANKGPVSVGVDARHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYG-DLNG 290

Query: 290 IKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
            +YWL+KNSWG ++GE+GY R+ R+       CGIA F S+P
Sbjct: 291 KEYWLVKNSWGHNFGEEGYIRMARN---KGNHCGIASFPSYP 329


>gi|33333694|gb|AAQ11965.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
           maculatus]
          Length = 326

 Score =  211 bits (536), Expect = 6e-52,   Method: Compositional matrix adjust.
 Identities = 128/330 (38%), Positives = 186/330 (56%), Gaps = 49/330 (14%)

Query: 29  SIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKFAD 87
           S+ E+++Q+K  +G+TY+   E  +RF +F+ NL+ ++  N     G  S+  ++ +FAD
Sbjct: 18  SVYEEWQQFKLDHGKTYRSVVEEKRRFSVFQKNLIDIQEHNKKYERGEVSFAKKVTQFAD 77

Query: 88  LTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSS-----------QVPPSVNWIEKGAVT 136
           +T +EF+              LK  G P L  ++           +   +V+W E+GAVT
Sbjct: 78  MTHEEFL------------DLLKLQGVPALPSNAVHFDNFEDTDMEEKDAVDWREEGAVT 125

Query: 137 PVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATND-NNNGCYGGFMDDA 188
           PVK Q  C       AV A+EG    K   LVSLS Q+LVDCAT +  NNGC GG M  A
Sbjct: 126 PVKDQANCGSCWAFSAVGAIEGQFFKKNGTLVSLSAQELVDCATEEYGNNGCRGGLMGQA 185

Query: 189 FKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ- 247
           F ++ Q++GI  +  Y YEG  +  C   K+ D+  ++  Y  V P DE+ + + VA + 
Sbjct: 186 FDFV-QDEGIQTEESYPYEGRRSS-CK--KSGDYVTKVKTY--VFPLDEQEMARTVAAKG 239

Query: 248 PVSVAIDASALQFYSGGVFNGYCETF-----LNHGVTAVGYGTSEEGIKYWLIKNSWGQD 302
           PV+VAI+AS L FY  G+ +  C        LNHGV  VGYG SE G+ YW++KNSWG D
Sbjct: 240 PVAVAIEASQLSFYDKGIVDEKCRCSNKREDLNHGVLVVGYG-SENGVDYWIVKNSWGAD 298

Query: 303 WGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           WGE GYFRL++D+      CGI  + ++P+
Sbjct: 299 WGEKGYFRLKKDVK----ACGIDYYNTYPI 324


>gi|395856029|ref|XP_003800445.1| PREDICTED: cathepsin S [Otolemur garnettii]
          Length = 331

 Score =  211 bits (536), Expect = 6e-52,   Method: Compositional matrix adjust.
 Identities = 133/342 (38%), Positives = 192/342 (56%), Gaps = 32/342 (9%)

Query: 6   LIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
           L+  L++  S  +Q       + ++   +  WK  YG+ Y E  E ++R  I++ NL  V
Sbjct: 4   LVWTLLVCCSAMAQ----LHRDPALDHHWHLWKKTYGKQYTEKNEETERRLIWEKNLKFV 59

Query: 66  ERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSS--- 121
              N   ++G  SY L +N   D+T +E ++  T  K+   S   + N T   YKSS   
Sbjct: 60  MLHNLEHSMGMHSYDLGMNHLGDMTSEEVVSLMTCLKVPRQS---QRNVT---YKSSPNQ 113

Query: 122 QVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATN 174
           ++P S++W EKG VT VKYQG C       AV A+E    +   +LVSLS Q LVDC+T 
Sbjct: 114 KLPDSLDWREKGCVTEVKYQGSCGSCWAFSAVGALEAQLKLTTGKLVSLSAQNLVDCSTE 173

Query: 175 D-NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
              N GC+GGFM +AF+YII N GI ++A Y Y+ M         +++ AA  + Y ++P
Sbjct: 174 KYRNEGCHGGFMTEAFQYIIDNNGIDSEASYPYKAMDEKC--QYDSKNRAATCSKYTELP 231

Query: 234 PNDEESLLKAVANQ-PVSVAIDASALQFY---SGGVFNGYCETFLNHGVTAVGYGTSEEG 289
              EE+L +AVA++ PVSVAIDAS   F+   SG  +   C   +NHGV  VGYG +  G
Sbjct: 232 FGSEEALKEAVASKGPVSVAIDASHSSFFLYRSGVYYEPACTQVVNHGVLVVGYG-NLNG 290

Query: 290 IKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
             YWL+KNSWG  +G+ GY R+ R+    +  CGIA ++S+P
Sbjct: 291 NDYWLVKNSWGLYFGDKGYIRMARN---RENHCGIASYSSYP 329


>gi|390347681|ref|XP_801784.2| PREDICTED: cathepsin L1-like isoform 2 [Strongylocentrotus
           purpuratus]
          Length = 336

 Score =  211 bits (536), Expect = 6e-52,   Method: Compositional matrix adjust.
 Identities = 126/342 (36%), Positives = 199/342 (58%), Gaps = 26/342 (7%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
           FL+ + +++  CA+ A  +  +   +  ++ +WK  + ++Y       +R  ++++N+  
Sbjct: 6   FLVAIGLVA--CATAAFVKPTNP-DLDSRWLEWKIAHTKSYTNDMHELERRLVWEENVKM 62

Query: 65  VERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSS-Q 122
           +   N + ++  + + L +N++ D+   E  ++  G+K S+ +   K  G+ FL  S+ Q
Sbjct: 63  INMHNLDHSLHKKGFRLGMNEYGDMRLHEVRSTMNGYKSSNVT---KVQGSTFLTPSNIQ 119

Query: 123 VPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATND 175
           VP +V+W  KG VTPVK QGQC          ++EG    K ++LVSLSEQ LVDC+  +
Sbjct: 120 VPDTVDWRTKGYVTPVKNQGQCGSCWAFSTTGSLEGQTFKKTSKLVSLSEQNLVDCSRTE 179

Query: 176 NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPN 235
            N GC GG MD  F+Y+I N GI ++  Y Y+      C   KA   +A++T + DV   
Sbjct: 180 GNMGCEGGLMDQGFQYVIDNHGIDSEDCYPYDAEDE-TC-HYKASCDSAEVTGFTDVTSG 237

Query: 236 DEESLLKAVAN-QPVSVAIDAS--ALQFYSGGVFN-GYCETF-LNHGVTAVGYGTSEEGI 290
           DE++L++AVA+  PVSVAIDAS  + Q Y  GV++   C +  L+HGV  VGYGT + G 
Sbjct: 238 DEQALMEAVASVGPVSVAIDASHQSFQLYESGVYDEPECSSSELDHGVLVVGYGT-DGGK 296

Query: 291 KYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
            YWL+KNSWG+ WG  GY ++ R+      QCGIA  AS+P+
Sbjct: 297 DYWLVKNSWGETWGLSGYIKMSRN---KSNQCGIATSASYPL 335


>gi|728637|emb|CAA59441.1| cathepsin l [Litopenaeus vannamei]
          Length = 326

 Score =  211 bits (536), Expect = 6e-52,   Method: Compositional matrix adjust.
 Identities = 128/323 (39%), Positives = 180/323 (55%), Gaps = 34/323 (10%)

Query: 29  SIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVE----RFNNAAIGNRSYTLRLNK 84
           S+ ++++ +KA++GR Y    E   R  +F+ N   ++    RF N  +   ++TL++N+
Sbjct: 18  SLRQQWQNFKAEHGRRYASVQEERYRLSVFEQNQQFIDDHNARFENGEV---TFTLQMNQ 74

Query: 85  FADLTPQEFIASQTGF---KMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQ 141
           F D+T +E +A+  GF        ++ LKA+          +P  V+W  KGAVTPVK Q
Sbjct: 75  FGDMTSEEIVATMNGFLGAPTRRPAAVLKAD-------DETLPEKVDWRTKGAVTPVKDQ 127

Query: 142 GQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQ 194
            QC          ++EG + +K  +LVSLSEQ LVDC+    N GC GG MD AF+YI  
Sbjct: 128 KQCGSCWAFSTTGSLEGQHFLKDGKLVSLSEQNLVDCSDKFGNMGCMGGLMDQAFRYIKA 187

Query: 195 NKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAI 253
           NKGI  +  Y YE    G C    A +  A  T Y DV    E +L KAVA   P+SV I
Sbjct: 188 NKGIDTEDSYPYEAQD-GKC-RFDASNVGATDTGYVDVEHGSESALKKAVATIGPISVGI 245

Query: 254 DA--SALQFYSGGVF-NGYC-ETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYF 309
           DA  S   FY  GV+ + +C  T L+HGV AVGYG+ E G  +WL+KNSW   WG+ GY 
Sbjct: 246 DASQSTFHFYHTGVYHDDHCSSTMLDHGVLAVGYGSDENGGDFWLVKNSWNTSWGDKGYI 305

Query: 310 RLQRDIDQPQGQCGIAMFASFPV 332
           ++ R+ +     CGIA  AS+P+
Sbjct: 306 KMSRNRNN---NCGIASQASYPL 325


>gi|426216524|ref|XP_004002512.1| PREDICTED: cathepsin S isoform 1 [Ovis aries]
          Length = 331

 Score =  210 bits (535), Expect = 6e-52,   Method: Compositional matrix adjust.
 Identities = 134/330 (40%), Positives = 186/330 (56%), Gaps = 28/330 (8%)

Query: 18  SQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN-NAAIGNR 76
           S A  +   + ++   ++ WK  YG+ Y+E  E   R  I++ NL  V   N   ++G  
Sbjct: 12  SSAMAQVHRDPTLDHHWDLWKKTYGKQYEEKNEEVARRLIWEKNLKTVMLHNLEHSMGMH 71

Query: 77  SYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSS---QVPPSVNWIEKG 133
           SY L +N   D+T +E I+S +  ++    S    N T   YKSS   ++P S++W EKG
Sbjct: 72  SYELGMNHLGDMTSEEVISSMSSLRVP---SQWPRNVT---YKSSPNQKLPDSLDWREKG 125

Query: 134 AVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT-NDNNNGCYGGFM 185
            VT VKYQG C       AV A+E    +K  +LVSLS Q LVDC+T    N GC GGFM
Sbjct: 126 CVTEVKYQGACGSCWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCSTVKYGNKGCNGGFM 185

Query: 186 DDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVA 245
            +AF+YII N GI ++A Y Y+ M  G C     ++ AA  + Y ++P   EE+L +AVA
Sbjct: 186 TEAFQYIIDNNGIDSEASYPYKAMD-GRCQ-YDVKNRAATCSRYIELPFGSEEALKEAVA 243

Query: 246 NQ-PVSVAIDASALQFY---SGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQ 301
           N+ PVSV IDA    F+   +G  ++  C   +NHGV  VGYG S  G  YWL+KNSWG 
Sbjct: 244 NKGPVSVGIDAKQTSFFLYKTGVYYDPSCTQNVNHGVLVVGYG-SLNGKDYWLVKNSWGL 302

Query: 302 DWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
           ++G+ GY R+ R+       CGIA F S+P
Sbjct: 303 NFGDQGYIRMARN---SGNHCGIANFPSYP 329


>gi|32394728|gb|AAM96000.1| cathepsin L precursor [Metapenaeus ensis]
          Length = 322

 Score =  210 bits (535), Expect = 7e-52,   Method: Compositional matrix adjust.
 Identities = 128/315 (40%), Positives = 181/315 (57%), Gaps = 28/315 (8%)

Query: 34  FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN-NAAIGNRSYTLRLNKFADLTPQE 92
           ++ +K QYGR Y  + E+  R  +F+ N   +E  N     G  ++TL++N+F D+T +E
Sbjct: 19  WQDFKVQYGRHYGTAREDLYRQSVFEQNQQFIEDHNAKFENGEVTFTLKMNQFGDMTSEE 78

Query: 93  FIASQTGF---KMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA---- 145
           F A+  GF         + L+A+          +P  V+W  KGAVTPVK Q QC     
Sbjct: 79  FAATMNGFLNVPTRHPVAILEAD-------DETLPKHVDWRTKGAVTPVKDQKQCGSCWA 131

Query: 146 ---VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDA 202
                ++EG + +K  +LVSLSEQ LVDC+    N GC GG MD AFKYI +NKGI  + 
Sbjct: 132 FSTTGSLEGQHFLKDGKLVSLSEQNLVDCSGKFGNMGCCGGLMDQAFKYIKENKGIDTEE 191

Query: 203 VYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDAS--ALQ 259
            Y YE    G C    + +  A  T + D+   +E SL+KAVAN  P+SVAIDAS  + Q
Sbjct: 192 SYPYEAQD-GKC-RFDSSNVGATDTGFVDIAHGEENSLMKAVANIGPISVAIDASHPSFQ 249

Query: 260 FYSGGV-FNGYC-ETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQ 317
           FY  GV +   C  T L+HGV A+GYG +++G +YWL+KNSW   WG+ G+ ++ R+   
Sbjct: 250 FYHQGVYYEKECSSTMLDHGVLAIGYGETDDGKEYWLVKNSWNTSWGDKGFIQMSRN--- 306

Query: 318 PQGQCGIAMFASFPV 332
            +  CGIA  AS+P+
Sbjct: 307 KKNNCGIASQASYPL 321


>gi|229893789|gb|ACQ90252.1| cathepsin L [Pinctada fucata]
          Length = 362

 Score =  210 bits (535), Expect = 7e-52,   Method: Compositional matrix adjust.
 Identities = 125/317 (39%), Positives = 179/317 (56%), Gaps = 19/317 (5%)

Query: 28  GSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKFA 86
           G   E ++++K  +G+ Y    E  KRF+IF+D L  +E  N    +G +SY + +N+F+
Sbjct: 48  GPYHETWKEFKTLFGKVYDTVEEEIKRFDIFRDTLERIEEHNRKYHMGQKSYYMGVNQFS 107

Query: 87  DLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA- 145
           D++  E++    G +  +   S       +     Q+   V+W +KG VTPVK QGQC  
Sbjct: 108 DMSHDEYL-RHNGLRRGNRKYSKGEGCDSYTKSGKQLDDKVDWRDKGYVTPVKNQGQCGS 166

Query: 146 ------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGIT 199
                   ++EG +  +  +L+SLSEQQLVDC+    N GC GG MD+AF+YI    G+ 
Sbjct: 167 CWSFSTTGSLEGQHFRQTGKLISLSEQQLVDCSGTFGNEGCNGGLMDNAFEYIKSIGGLE 226

Query: 200 NDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDAS-- 256
            +  Y Y     G C  +K     A  T   DV   DE++L  A+A+  P+SVAIDAS  
Sbjct: 227 GEDDYPYTA-KQGKC-HLKKSLFKANDTGCTDVESGDEDALKDALASVGPISVAIDASHA 284

Query: 257 ALQFYSGGVFNGY-CET-FLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRD 314
           + Q Y GGV++   C +  L+HGV  VGYGT E G  YWL+KNSWG+ WGE+GY ++ R+
Sbjct: 285 SFQSYDGGVYDEEECSSQNLDHGVLTVGYGTEENGGDYWLVKNSWGEMWGEEGYIKMSRN 344

Query: 315 IDQPQGQCGIAMFASFP 331
            D    QCGIA  AS+P
Sbjct: 345 KDN---QCGIATQASYP 358


>gi|41688064|dbj|BAD08618.1| cathepsin L preproprotein [Cyprinus carpio]
          Length = 337

 Score =  210 bits (535), Expect = 7e-52,   Method: Compositional matrix adjust.
 Identities = 126/316 (39%), Positives = 178/316 (56%), Gaps = 25/316 (7%)

Query: 34  FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN-NAAIGNRSYTLRLNKFADLTPQE 92
           +EQWK  +G+ Y E  E  +R  +++ NL  +E  N   ++G  +Y L +N+F D+T +E
Sbjct: 29  WEQWKNWHGKKYHEKEEGWRRM-VWEKNLQKIELHNLEHSMGTHTYRLGMNRFGDMTHEE 87

Query: 93  FIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGAVTPVKYQGQCA------ 145
           F     G+K   H    +  G+ F+  +  +VP S++W EKG VTPVK QG+C       
Sbjct: 88  FRQVMNGYK---HKKERRFRGSLFMEPNFLEVPNSLDWREKGYVTPVKDQGECGSCWAFS 144

Query: 146 -VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVY 204
              A+EG    K  +LVSLSEQ LVDC+  + N GC GG MD AF+YI    G+ ++  Y
Sbjct: 145 TTGAMEGQMFRKTGKLVSLSEQNLVDCSRPEGNEGCNGGLMDQAFQYIKDQNGLDSEESY 204

Query: 205 SYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDA--SALQFY 261
            Y G     C     +  AA  T + D+P   E +L+KA+A   PVSVAIDA   + QFY
Sbjct: 205 PYVGTDDQPC-HYDPKYSAANDTGFVDIPSGKEHALMKAIAAVGPVSVAIDAGHESFQFY 263

Query: 262 SGGV-FNGYCET-FLNHGVTAVGYGTSEE---GIKYWLIKNSWGQDWGEDGYFRLQRDID 316
             G+ +   C +  L+HGV AVGYG   E   G KYW++KNSW ++WG+ GY  + +D  
Sbjct: 264 QSGIYYEKECSSEELDHGVLAVGYGFEGEDVDGKKYWIVKNSWSENWGDKGYVYMAKD-- 321

Query: 317 QPQGQCGIAMFASFPV 332
                CGIA  AS+P+
Sbjct: 322 -RHNHCGIATAASYPL 336


>gi|348531513|ref|XP_003453253.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
          Length = 333

 Score =  210 bits (535), Expect = 7e-52,   Method: Compositional matrix adjust.
 Identities = 136/347 (39%), Positives = 198/347 (57%), Gaps = 30/347 (8%)

Query: 1   MAKYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKD 60
           M   F++  ++   SCAS +            +F  WK ++ ++Y   +E ++R +I+  
Sbjct: 1   MKLLFVVAAVLAVSSCASISLEDM--------EFHAWKLKFEKSYDSPSEETQRKQIWLS 52

Query: 61  NLVAVERFNNAA-IGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPF--L 117
           N   V + N  A +G +SY L +  FAD+  +E+    +   +   ++SL   G+ F  L
Sbjct: 53  NRKLVLKHNALADLGLKSYHLGMTYFADMENEEYKKLISQGCLGSFNASLPRRGSTFNRL 112

Query: 118 YKSSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVD 170
            K + +P +V+W +KG VT VK Q QC       A  A+EG +  K  RLV LSEQQLVD
Sbjct: 113 PKGTVLPDTVDWRKKGYVTKVKNQQQCGSCWAFSATGALEGQHFKKTGRLVYLSEQQLVD 172

Query: 171 CATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYE 230
           C+ N  N GC GG+M++AFKYI  N GI  +A Y Y+ M  G+C         A    Y 
Sbjct: 173 CSRNFGNRGCDGGWMNNAFKYIKDNGGIQTEASYPYQAMD-GLCH-YNPNSVGAICNGYV 230

Query: 231 DVPPNDEESLLKAVAN-QPVSVAIDAS--ALQFYSGGVFNGY-C-ETFLNHGVTAVGYGT 285
           DV P DEE+L +AVA   P+S+A+DAS  + Q Y  GV++ + C + +L+HG+  VGYGT
Sbjct: 231 DVSP-DEEALKEAVATIGPISIAMDASHESFQLYQSGVYDEHRCNDYYLSHGMLVVGYGT 289

Query: 286 SEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
            E G+ YWLIKNSWG  WG+ GY ++ R+    + QCGIA  AS+P+
Sbjct: 290 -EGGLDYWLIKNSWGLGWGKMGYIKMVRN---KRNQCGIATAASYPL 332


>gi|391336140|ref|XP_003742440.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
          Length = 330

 Score =  210 bits (535), Expect = 7e-52,   Method: Compositional matrix adjust.
 Identities = 132/311 (42%), Positives = 180/311 (57%), Gaps = 27/311 (8%)

Query: 37  WKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIAS 96
           +K+ + ++Y++  E   R  IF+DNL  +E FN        +TL +N+FAD+T  EF   
Sbjct: 31  FKSTHLKSYRDGQEELIRRFIFEDNLHTIEEFNRVNASLAGFTLGVNEFADMTNTEFSNM 90

Query: 97  QTGFKMSDHSSSLKANGTPFLYKSSQV---PPSVNWIEKGAVTPVKYQGQCA-------V 146
             G    +     K  G   +++SS V   P  V+W +KG VT VK QGQC         
Sbjct: 91  LLGLGGRN-----KIAGDS-VFESSHVQDLPAEVDWTQKGYVTEVKNQGQCGSCWAFSTT 144

Query: 147 AAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSY 206
            ++EG    K  +LVSLSEQ LVDC+T++ N GC GG MD AF YI +N GI  +A Y Y
Sbjct: 145 GSLEGQVFKKTGKLVSLSEQNLVDCSTSEGNQGCNGGLMDQAFTYIKKNGGIDTEAAYPY 204

Query: 207 EGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDASAL--QFYSG 263
            G S G C  ++ +   A ++ + DV   DE +L +AVA   P+SVAIDAS++  QFY G
Sbjct: 205 TG-SDGTCRFLENK-VGATVSGFVDVKSGDENALKEAVATVGPISVAIDASSIFFQFYRG 262

Query: 264 GVFNGY--CETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQ 321
           GV+N +    T L+HGV  VGYGT E G  YWL+KNSWG  WG  GY ++ R+    + +
Sbjct: 263 GVYNPWFCSSTELDHGVLVVGYGT-EGGKDYWLVKNSWGSSWGLKGYIKMVRN---KKNR 318

Query: 322 CGIAMFASFPV 332
           CGIA  AS+P 
Sbjct: 319 CGIATQASYPT 329


>gi|32394730|gb|AAM96001.1| cathepsin L precursor [Metapenaeus ensis]
          Length = 306

 Score =  210 bits (535), Expect = 7e-52,   Method: Compositional matrix adjust.
 Identities = 128/318 (40%), Positives = 183/318 (57%), Gaps = 34/318 (10%)

Query: 34  FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVE----RFNNAAIGNRSYTLRLNKFADLT 89
           ++ +K QYGR Y  + E+  R  +F+ N   +E    +F N  +   ++TL++N+F D+T
Sbjct: 3   WQDFKVQYGRHYGTAREDLYRQSVFEQNQQFIEDHNAKFENGEV---TFTLKMNQFGDMT 59

Query: 90  PQEFIASQTGF---KMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA- 145
            +EF A+  GF         + L+A+          +P  V+W  KGAVTPVK Q QC  
Sbjct: 60  SEEFAATMNGFLNVPTRHPVAILEAD-------DETLPKHVDWRTKGAVTPVKDQKQCGS 112

Query: 146 ------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGIT 199
                   ++EG + +K  +LVSLSEQ LVDC+    N GC GG MD AFKYI +NKGI 
Sbjct: 113 CWAFSTTGSLEGQHFLKDGKLVSLSEQNLVDCSGKFGNMGCCGGLMDQAFKYIKENKGID 172

Query: 200 NDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDAS-- 256
            +  Y YE    G C    + +  A  T + D+   +E SL+KAVAN  P+SVAIDAS  
Sbjct: 173 TEESYPYEAQD-GKC-RFDSSNVGATDTGFVDIAHGEENSLMKAVANIGPISVAIDASHP 230

Query: 257 ALQFYSGGV-FNGYC-ETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRD 314
           + QFY  GV +   C  T L+HGV A+GYG +++G +YWL+KNSW   WG+ G+ ++ R+
Sbjct: 231 SFQFYHQGVYYEKECSSTMLDHGVLAIGYGETDDGKEYWLVKNSWNTSWGDKGFIQMSRN 290

Query: 315 IDQPQGQCGIAMFASFPV 332
               +  CGIA  AS+P+
Sbjct: 291 ---KKNNCGIASQASYPL 305


>gi|422001787|dbj|BAM66994.1| germination-specific cysteine protease 1, partial [Raphanus
           sativus]
          Length = 235

 Score =  210 bits (535), Expect = 8e-52,   Method: Compositional matrix adjust.
 Identities = 112/227 (49%), Positives = 145/227 (63%), Gaps = 14/227 (6%)

Query: 123 VPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATND 175
           +P +V+W +KGAV  +K QG C         A VEGIN I    L+SLSEQ+LVDC    
Sbjct: 4   LPETVDWRQKGAVNAIKNQGTCGSCWAFSTAAVVEGINKIVTGELISLSEQELVDC-DKS 62

Query: 176 NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPN 235
            N GC GG MD AF++I++N G+  +  Y Y G S G C+S+        I  YEDVP N
Sbjct: 63  YNQGCNGGLMDYAFQFIMKNGGLNTEQDYPYRG-SDGKCNSLLKNSKVVTIDGYEDVPTN 121

Query: 236 DEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYW 293
           DE +L +AV+ QPVSVAIDA     Q Y  G+F G C T ++H V AVGYG SE G+ YW
Sbjct: 122 DETALKRAVSYQPVSVAIDAGGRVFQHYQSGIFTGECGTKMDHAVVAVGYG-SENGVDYW 180

Query: 294 LIKNSWGQDWGEDGYFRLQRDIDQPQ-GQCGIAMFASFPVSKESAQP 339
           +++NSWGQ WGEDGY R++R++   + G+CGIA+ AS+PV K S  P
Sbjct: 181 IVRNSWGQKWGEDGYIRIERNLASSKSGKCGIAIEASYPV-KYSPNP 226


>gi|334332716|ref|XP_001367365.2| PREDICTED: cathepsin L1-like [Monodelphis domestica]
          Length = 335

 Score =  210 bits (535), Expect = 8e-52,   Method: Compositional matrix adjust.
 Identities = 137/348 (39%), Positives = 195/348 (56%), Gaps = 32/348 (9%)

Query: 1   MAKYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKD 60
           M  Y  +  L +  + A     R  D      ++ QWKAQ+G++Y E+ E+S R  I++ 
Sbjct: 1   MNFYLCLASLCLGLAAAIPPFDRALDS-----QWHQWKAQHGKSY-EANEDSLRRAIWEK 54

Query: 61  NLVAVERFNNA-AIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK 119
           NL  +ER N     G +S+ L +NKF D+T +EF   Q      + S+S +     +L++
Sbjct: 55  NLKMIERHNQEYRAGKQSFQLGMNKFGDMTTEEF---QEAINFYNSSASQRRT-KRYLHR 110

Query: 120 S---SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLV 169
               +Q+P SV+W E+G VTPVK QGQC       AV A+EG    K   LVSLS Q LV
Sbjct: 111 EPLLAQLPESVDWREEGYVTPVKNQGQCLSCWAFSAVGAIEGQWFRKTGELVSLSIQNLV 170

Query: 170 DCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNY 229
           DC T+D+ + C+GGFMD AF+Y+  N GI  +  Y Y G     C   + E   A +  +
Sbjct: 171 DCTTSDSISSCHGGFMDRAFQYVQDNGGIDTEECYPYVG-EVNEC-KYQPECSGANVVGF 228

Query: 230 EDVPPNDEESLLKAVAN-QPVSVAIDAS--ALQFYSGGV-FNGYCETF-LNHGVTAVGYG 284
            D+P  DE +L++AVA   P+SVAID    + +FY  GV ++  C +  LNH    VGYG
Sbjct: 229 VDIPSMDERALMEAVATVGPISVAIDGGNPSFKFYESGVYYDPQCSSSQLNHAGLVVGYG 288

Query: 285 TSE-EGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
           +   +G KYW++KNSWG+ WG +GY  + +D D     CGIA  AS+P
Sbjct: 289 SEGIDGRKYWIVKNSWGELWGNNGYILMAKDEDN---HCGIATEASYP 333


>gi|261289787|ref|XP_002611755.1| hypothetical protein BRAFLDRAFT_284339 [Branchiostoma floridae]
 gi|229297127|gb|EEN67765.1| hypothetical protein BRAFLDRAFT_284339 [Branchiostoma floridae]
          Length = 327

 Score =  210 bits (535), Expect = 8e-52,   Method: Compositional matrix adjust.
 Identities = 132/337 (39%), Positives = 194/337 (57%), Gaps = 27/337 (8%)

Query: 10  LIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN 69
           L+I   C + AT       +I  ++E +K  +G+ Y E  E+  R+ IF++N   V++ N
Sbjct: 3   LLIFVVCVAVAT-------AIDPQWEAFKLLHGKQYSE-YEDGARYAIFQENSRIVKQHN 54

Query: 70  N-AAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPF-LYKSSQVPPSV 127
             AA+G  ++ +R+NKF D+T +EF     G  +   + + +  G  F      +V  +V
Sbjct: 55  EEAAMGKHTFFMRMNKFGDMTNEEFQMLVIGSGLLYSNKTQQTEGGVFESLPGLKVNDTV 114

Query: 128 NWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGC 180
           +W +KGAVT VK Q QC          ++EG + +K   LVSLSEQ LVDC+  + N GC
Sbjct: 115 DWRQKGAVTKVKNQEQCGSCWAFSTTGSLEGQHFLKSGTLVSLSEQNLVDCSRKEGNKGC 174

Query: 181 YGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESL 240
            GG MD AFKYI  N GI  +  Y Y+G +   C+  K+    A +++Y D+   DE++L
Sbjct: 175 QGGLMDQAFKYIKTNGGIDTEECYPYKGKNERKCE-YKSSCSGATLSSYVDIKTGDEDAL 233

Query: 241 LKAVAN-QPVSVAIDAS--ALQFYSGGVFN-GYCET-FLNHGVTAVGYGTSEEGIKYWLI 295
           ++A A   P+SV IDAS  + Q Y  GV++   C +  L+HGV  VGYGT  E   YWL+
Sbjct: 234 MQASATIGPISVGIDASHPSFQLYDHGVYHEKRCSSKKLDHGVLVVGYGTDGEK-DYWLV 292

Query: 296 KNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           KNSWG++WG +GY ++ R+ D    QCGIA  AS+PV
Sbjct: 293 KNSWGEEWGMEGYIKMSRNKDN---QCGIATQASYPV 326


>gi|21425246|emb|CAD33266.1| cathepsin L [Aphis gossypii]
          Length = 341

 Score =  210 bits (534), Expect = 8e-52,   Method: Compositional matrix adjust.
 Identities = 129/344 (37%), Positives = 190/344 (55%), Gaps = 23/344 (6%)

Query: 6   LIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
           +++VL +     S  +    +E  I E++  +K Q+ + Y++  E + R +++ DN + +
Sbjct: 3   VVIVLGLVAFAISTVSSINLNE-VIEEEWSLFKIQFKKLYEDIKEETFRKKVYLDNKLKI 61

Query: 66  ERFNNA-AIGNRSYTLRLNKFADLTPQEFIASQTGFKMS----DHSSSLKANGTPFLYKS 120
              N     G  +Y L +N F DL   E+     GFK S    D + +     T    ++
Sbjct: 62  AGHNKLYESGEETYALEMNHFGDLMQHEYTKMMNGFKPSLAGGDRNFTNDEAVTFLKSEN 121

Query: 121 SQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
             +P SV+W +KG VTPVK QGQC       A  ++EG +  K   LVSLSEQ L+DC+ 
Sbjct: 122 VVIPKSVDWRKKGYVTPVKNQGQCGSCWSFSATGSLEGQHFRKTGVLVSLSEQNLIDCSR 181

Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
              NNGC GG MD AFKYI  NKG+  +  Y YE            E+  A    + D+P
Sbjct: 182 KYGNNGCEGGLMDLAFKYIKSNKGLDTEKSYPYEAEDDKC--RYNPENSGATDKGFVDIP 239

Query: 234 PNDEESLLKAVAN-QPVSVAIDASA--LQFYSGGVF-NGYC-ETFLNHGVTAVGYGTSEE 288
             DE++L+ A+A   PVS+AIDAS+   QFY  GVF N  C  T L+HGV AVG+G+ ++
Sbjct: 240 EGDEDALMHALATVGPVSIAIDASSEKFQFYKKGVFYNPRCSSTELDHGVLAVGFGSDKK 299

Query: 289 GIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           G  YW++KNSWG+ WG++GY  + R+    +  CG+A  AS+P+
Sbjct: 300 GGDYWIVKNSWGKTWGDEGYIMMARN---KKNNCGVASSASYPL 340


>gi|330803818|ref|XP_003289899.1| hypothetical protein DICPUDRAFT_154350 [Dictyostelium purpureum]
 gi|325080010|gb|EGC33584.1| hypothetical protein DICPUDRAFT_154350 [Dictyostelium purpureum]
          Length = 326

 Score =  210 bits (534), Expect = 8e-52,   Method: Compositional matrix adjust.
 Identities = 137/342 (40%), Positives = 185/342 (54%), Gaps = 36/342 (10%)

Query: 6   LIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
           LI   +I   C++    R F +      F+ W  ++ ++Y    E   R+ +F+DN+  V
Sbjct: 7   LIFCFLIINCCSAA---RIFSQKQYQTAFQNWMVKHQKSYTND-EFGSRYSVFQDNMDIV 62

Query: 66  ERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGT---PFLYKSSQ 122
            ++N       +  L LN  ADLT +EF     G          KAN T     L   S 
Sbjct: 63  AKWNQKG---SNTILGLNVMADLTNEEFKKLYLG---------TKANVTYKKKTLVGVSG 110

Query: 123 VPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATND 175
           +P SV+W   GAVT VK QGQC          +VEGI+ I   +LV LSEQQ++DC+ ++
Sbjct: 111 LPASVDWRANGAVTAVKNQGQCGGCYAFSTTGSVEGIHEITSQQLVPLSEQQILDCSGSE 170

Query: 176 NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPN 235
            NNGC GG M ++F+YII   G+  +A Y Y G   G C     ++  A IT Y++V   
Sbjct: 171 GNNGCDGGLMTNSFEYIIAVGGLDTEASYPYTG-EVGKC-KFNKKNIGATITGYKNVESG 228

Query: 236 DEESLLKAVANQPVSVAIDA--SALQFYSGGV-FNGYC-ETFLNHGVTAVGYGTSEEGIK 291
            E  L  AVA QPVSVAIDA  S+ Q Y+ GV +   C  T L+HGV AVGYG S+ G  
Sbjct: 229 SESDLQTAVAAQPVSVAIDASQSSFQLYASGVYYEPECSSTQLDHGVLAVGYG-SQSGQD 287

Query: 292 YWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPVS 333
           YW++KNSWG DWGE+G+  + R+ D     CGIA  ASFP +
Sbjct: 288 YWIVKNSWGADWGENGFILMARNKDN---NCGIATMASFPTA 326


>gi|33333696|gb|AAQ11966.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
           maculatus]
          Length = 326

 Score =  210 bits (534), Expect = 8e-52,   Method: Compositional matrix adjust.
 Identities = 128/330 (38%), Positives = 185/330 (56%), Gaps = 49/330 (14%)

Query: 29  SIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKFAD 87
           S+ E+++Q+K  +G+TY+   E  +RF +F+ NL+ ++  N     G  S+  ++ +FAD
Sbjct: 18  SVYEEWQQFKLDHGKTYRSLVEEKRRFSVFQKNLIDIQEHNKKYERGEVSFAKKVTQFAD 77

Query: 88  LTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSS-----------QVPPSVNWIEKGAVT 136
           +T +EF+              LK  G P L  ++           +   +V+W E+GAVT
Sbjct: 78  MTHEEFL------------DLLKLQGVPALPSNAVHFDNFEDTDMEEKDAVDWREEGAVT 125

Query: 137 PVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATND-NNNGCYGGFMDDA 188
           PVK Q  C       AV A+EG    K   LVSLS Q+LVDCAT +  NNGC GG M  A
Sbjct: 126 PVKDQANCGSCWAFSAVGAIEGQFFKKNGTLVSLSAQELVDCATEEYGNNGCRGGLMGQA 185

Query: 189 FKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ- 247
           F ++ Q++GI  +  Y YEG  +  C   K+ D+  ++  Y  V P DE+ + + VA + 
Sbjct: 186 FDFV-QDEGIQTEESYPYEGRRSS-CK--KSGDYVTKVKTY--VFPLDEQEMARTVAAKG 239

Query: 248 PVSVAIDASALQFYSGGVFNGYCETF-----LNHGVTAVGYGTSEEGIKYWLIKNSWGQD 302
           PV+VAI+AS L FY  G+ +  C        LNHGV  VGYG SE G+ YW++KNSWG D
Sbjct: 240 PVAVAIEASQLSFYDKGIVDETCRCSNKREDLNHGVLVVGYG-SENGVDYWIVKNSWGAD 298

Query: 303 WGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           WGE GYFRL++D+      CGI  +  +P+
Sbjct: 299 WGEKGYFRLKKDVK----ACGIGYYNPYPI 324


>gi|157278117|ref|NP_001098157.1| cathepsin S precursor [Oryzias latipes]
 gi|50251130|dbj|BAD27582.1| cathepsin S [Oryzias latipes]
          Length = 327

 Score =  210 bits (534), Expect = 8e-52,   Method: Compositional matrix adjust.
 Identities = 131/317 (41%), Positives = 177/317 (55%), Gaps = 25/317 (7%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN-NAAIGNRSYTLRLNKFADL 88
           + + +  WK  Y +TY    E   R  I+++NL  +   N   ++G  SY L +N   DL
Sbjct: 21  LDQHWNLWKKTYSKTYSHEIEEFGRRRIWEENLEMISVHNLEVSLGLHSYELAMNHLGDL 80

Query: 89  TPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC---- 144
           T +E IAS TG   +     L+      +  ++ VP SV+W E G VT VK QG+C    
Sbjct: 81  TIEELIASLTG---TVAPVGLERIHYDLVKINTSVPESVDWREGGLVTSVKTQGRCGSCW 137

Query: 145 ---AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITND 201
              AV A+EG        L SLS Q LVDC+T   N GC GGFM +AF+Y+I+N+GI++D
Sbjct: 138 AFSAVGALEGQLKKTTGILTSLSPQNLVDCSTKYGNYGCKGGFMSNAFQYVIKNQGISSD 197

Query: 202 AVYSYEGMSTGICDSIK--AEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDAS-- 256
           A Y Y G      D  K  ++  AA  T Y  +P  DE +L   VA   P+SVAIDAS  
Sbjct: 198 AAYPYIGKR----DKCKYDSKHRAANCTGYNFLPKGDEFALKVGVATIGPISVAIDASRP 253

Query: 257 ALQFYSGGVFNGY-CETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDI 315
              FY  GV+  + C   +NHGV  VGYGT E G  YWL+KNSWG+ +G+ GY ++ R+ 
Sbjct: 254 KFLFYRHGVYKDHSCSHNVNHGVLVVGYGT-ENGEDYWLVKNSWGERYGDGGYIKMARN- 311

Query: 316 DQPQGQCGIAMFASFPV 332
              + QCGIA++A FPV
Sbjct: 312 --RRNQCGIALYACFPV 326


>gi|306992173|gb|ADN19567.1| cathepsin L-like proteinase [Spodoptera frugiperda]
          Length = 344

 Score =  210 bits (534), Expect = 9e-52,   Method: Compositional matrix adjust.
 Identities = 130/352 (36%), Positives = 185/352 (52%), Gaps = 40/352 (11%)

Query: 7   IVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV- 65
           +++ +++G+CA            + E++  +K ++ + Y    E+  R +I+ +N   + 
Sbjct: 6   VLLCLVAGACAVSLL------DLVREEWNAFKMEHSKQYDSEVEDKFRMKIYVENKHRIA 59

Query: 66  ---ERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSD---------HSSSLKANG 113
              +RF    +   SY L+ NK+AD+   EF+ +  GF  +          HS       
Sbjct: 60  KHNQRFEQRLV---SYKLKPNKYADMLHHEFVHTMNGFNKTAKHGGRNKAVHSKGRDGRA 116

Query: 114 TPFLYKSS-QVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSE 165
             F+  +    P  V+W +KGAVT VK QG+C          A+EG +  K   LVSLSE
Sbjct: 117 ATFIAPAHVSYPDHVDWRKKGAVTDVKDQGKCGSCWAFSTTGALEGQHFRKTGYLVSLSE 176

Query: 166 QQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQ 225
           Q LVDC+    NNGC GG MD+AFKYI  N GI  +  Y YE +          ++  A 
Sbjct: 177 QNLVDCSAAYGNNGCNGGLMDNAFKYIKDNGGIDTEKSYPYEAVDDKC--RYNPKNSGAD 234

Query: 226 ITNYEDVPPNDEESLLKAVAN-QPVSVAIDAS--ALQFYSGGVF--NGYCETFLNHGVTA 280
              + D+P  DEE L++AVA   P+SVAIDAS    QFYS GV+       T L+HGV  
Sbjct: 235 DVGFVDIPQGDEEKLMQAVATVGPISVAIDASQETFQFYSKGVYYDENCSSTDLDHGVMV 294

Query: 281 VGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           VGYGT EEG  YWL+KNSWG+ WGE GY ++  + +     CGIA  AS+P+
Sbjct: 295 VGYGTEEEGGDYWLVKNSWGRSWGELGYIKMAHNKNN---HCGIASSASYPL 343


>gi|293342579|ref|XP_001065885.2| PREDICTED: cathepsin L1 [Rattus norvegicus]
 gi|293354415|ref|XP_225137.5| PREDICTED: cathepsin L1 [Rattus norvegicus]
 gi|149039747|gb|EDL93863.1| rCG24278 [Rattus norvegicus]
          Length = 330

 Score =  210 bits (534), Expect = 9e-52,   Method: Compositional matrix adjust.
 Identities = 131/346 (37%), Positives = 194/346 (56%), Gaps = 31/346 (8%)

Query: 1   MAKYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKD 60
           M   FL+  L +    A+     +FD       +E+WK ++G+TY  + E  KR  ++++
Sbjct: 1   MTPIFLLATLCLGMISAAPTHDPSFDT-----VWEEWKTKHGKTYNTNEEGQKR-AVWEN 54

Query: 61  NLVAVERFNNAAI-GNRSYTLRLNKFADLTPQEFIASQTGFK-MSDHSSSLKANGTPFLY 118
           N+  +   N   + G   ++L +N F DLT  EF    TGF+ M    +++     PFL 
Sbjct: 55  NMKMINLHNEDYLKGKHGFSLEMNAFGDLTNTEFRELMTGFQSMGPKETTIFRE--PFL- 111

Query: 119 KSSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDC 171
               +P S++W E G VTPVK QGQC       AV ++EG    K  +LVSLSEQ LVDC
Sbjct: 112 --GDIPKSLDWREHGYVTPVKNQGQCGSCWAFSAVGSLEGQIFKKTGKLVSLSEQNLVDC 169

Query: 172 ATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYED 231
           + +  N GC GG M+ AF+Y+ +N+G+     Y+YE    G+C     +  AA +T +  
Sbjct: 170 SWSYGNLGCNGGLMEFAFQYVKENRGLDTGESYAYEAQD-GLC-RYNPKYSAANVTGFVK 227

Query: 232 VPPNDEESLLKAVAN-QPVSVAIDAS--ALQFYSGGVF--NGYCETFLNHGVTAVGYGTS 286
           VP + E+ L+ AVA+  PVSV ID+   + +FYSGG++       T ++H V  VGYG  
Sbjct: 228 VPLS-EDDLMSAVASVGPVSVGIDSHHQSFRFYSGGMYYEPDCSSTEMDHAVLVVGYGEE 286

Query: 287 EEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
            +G KYWL+KNSWG+DWG DGY ++ +D +     CGIA +A +P 
Sbjct: 287 SDGGKYWLVKNSWGEDWGMDGYIKMAKDQNN---NCGIATYAIYPT 329


>gi|33333702|gb|AAQ11969.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
           maculatus]
          Length = 326

 Score =  210 bits (534), Expect = 9e-52,   Method: Compositional matrix adjust.
 Identities = 128/330 (38%), Positives = 185/330 (56%), Gaps = 49/330 (14%)

Query: 29  SIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKFAD 87
           S+ E+++Q+K  +G+TY+   E  +RF +F+ NL+ ++  N     G  S+  ++ +FAD
Sbjct: 18  SVYEEWQQFKLDHGKTYRSLVEEKRRFSVFQKNLIDIQEHNKKYERGEVSFAKKVTQFAD 77

Query: 88  LTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSS-----------QVPPSVNWIEKGAVT 136
           +T +EF+              LK  G P L  ++           +   +V+W E+GAVT
Sbjct: 78  MTHEEFL------------DLLKLQGVPALPSNAVHFDNFEDTDMEEKDAVDWREEGAVT 125

Query: 137 PVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATND-NNNGCYGGFMDDA 188
           PVK Q  C       AV A+EG    K   LVSLS Q+LVDCAT D  NNGC GG M  A
Sbjct: 126 PVKDQANCGSCWAFSAVGAIEGQFFKKNGTLVSLSAQELVDCATEDYGNNGCKGGLMGQA 185

Query: 189 FKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ- 247
           F ++ Q++GI  +  Y YEG  +  C   K+ ++  ++  Y  V P DE+ + + VA + 
Sbjct: 186 FDFV-QDEGIQTEESYPYEGRRSS-CK--KSGEYVTKVKTY--VFPLDEQEMARTVAAKG 239

Query: 248 PVSVAIDASALQFYSGGVFNGYCETF-----LNHGVTAVGYGTSEEGIKYWLIKNSWGQD 302
           PV+VAI+AS L FY  G+ +  C        LNHGV  VGYG SE G+ YW++KNSWG D
Sbjct: 240 PVAVAIEASQLSFYDKGIVDERCRCSNKREDLNHGVLVVGYG-SENGVDYWIVKNSWGAD 298

Query: 303 WGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           WGE GYFRL++D+      CGI  +  +P+
Sbjct: 299 WGEKGYFRLKKDVK----ACGIGYYNPYPI 324


>gi|157093355|gb|ABV22332.1| cysteine protease 1 [Noctiluca scintillans]
          Length = 338

 Score =  210 bits (534), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 131/311 (42%), Positives = 173/311 (55%), Gaps = 22/311 (7%)

Query: 34  FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
           F  +K +YG+ Y    E++ RF IFK N+  +   N     N ++ L +N+F DLT +E 
Sbjct: 27  FNNFKTKYGKVYNGINEDAVRFGIFKANVDIIYATNAR---NLTFALGVNEFTDLTQEEL 83

Query: 94  IASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA-------V 146
            AS TG K +   S L    T   Y  + +  SV+W  +G VTPVK QGQC         
Sbjct: 84  AASYTGLKPASLWSGLPRLST-HEYNGAPLASSVDWTTQGVVTPVKNQGQCGSCWSFSTT 142

Query: 147 AAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSY 206
            A+EG  A+    LVSLSEQQ VDC T D+  GC GG+MD+AF +  +N  I  +  Y Y
Sbjct: 143 GALEGAWALSTGNLVSLSEQQFVDCDTTDS--GCNGGWMDNAFSFAKKNS-ICTEGSYPY 199

Query: 207 EGMSTGICDSIKAEDHAAQ--ITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQFYS 262
              + G C+    +    Q  +  Y DV  + E++++ AVA QPVS+AI+A   + Q YS
Sbjct: 200 TA-TDGTCNLSGCQVGIPQGGVVGYTDVSTDSEQAMMSAVAQQPVSIAIEADQYSFQLYS 258

Query: 263 GGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQC 322
            GV    C T L+HGV AVGYG SE G  YW +KNSWG  WGE GY RLQR      G+C
Sbjct: 259 SGVLTASCGTRLDHGVLAVGYG-SEAGTDYWKVKNSWGSSWGEQGYVRLQRG-KGGAGEC 316

Query: 323 G-IAMFASFPV 332
           G +A   S+PV
Sbjct: 317 GLLAGPPSYPV 327


>gi|330434686|gb|AEC22811.1| cathepsin L [Macrobrachium nipponense]
          Length = 342

 Score =  210 bits (534), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 136/349 (38%), Positives = 192/349 (55%), Gaps = 28/349 (8%)

Query: 3   KYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
           KY   +VLI   + AS  ++ T     + E++E +K ++ + Y+   E + R +IF +N 
Sbjct: 2   KYLCALVLIAVAASASAVSFFTV----VMEEWESFKFEHSKKYESDTEETFRMKIFAENK 57

Query: 63  VAVERFNNA-AIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKAN----GTPFL 117
             +   N     G+++Y L +NK+ D+   EF+    GF+ +   +  KAN    G  F+
Sbjct: 58  QKIAAHNKLYHTGSKTYKLGMNKYGDMLHHEFVNMMNGFRANTSGAGYKANRGFQGAHFV 117

Query: 118 YKSSQV--PPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQL 168
                V  P SV+W EKGAVT VK QG C       A  A+EG +  +   LVSLSEQ L
Sbjct: 118 EPPEDVVMPKSVDWREKGAVTEVKDQGSCGSCWAFSATGALEGQHYRQTGDLVSLSEQNL 177

Query: 169 VDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITN 228
           VDC++   NNGC GG MD+AF+YI  N GI  +  Y YE             +  A    
Sbjct: 178 VDCSSKFGNNGCNGGLMDNAFQYIKVNGGIDTEKSYPYEAEDEPC--RYNPANAGADDRG 235

Query: 229 YEDVPPNDEESLLKAVAN-QPVSVAIDAS--ALQFYSGGVF-NGYCET-FLNHGVTAVGY 283
           + DV   +E +L KA+A   PVSVAIDAS  + QFY  GV+ +  C    L+HGV AVGY
Sbjct: 236 FVDVREGNENALKKAIATIGPVSVAIDASQDSFQFYQHGVYSDPDCSAENLDHGVLAVGY 295

Query: 284 GTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           GT+E+G  YWL+KNSW + WG+ GY ++ R+ +     CGIA  AS+P+
Sbjct: 296 GTTEDGQDYWLVKNSWSKSWGDQGYIKIARNQNN---MCGIASAASYPL 341


>gi|196002275|ref|XP_002111005.1| expressed hypothetical protein [Trichoplax adhaerens]
 gi|190586956|gb|EDV27009.1| expressed hypothetical protein [Trichoplax adhaerens]
          Length = 325

 Score =  210 bits (534), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 123/311 (39%), Positives = 179/311 (57%), Gaps = 23/311 (7%)

Query: 34  FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
           +E WK+ +G+ Y    E+  R  +F  N+  +   N       ++ + +N+F+DLT +EF
Sbjct: 25  WEAWKSFHGKKYHNQGEDDFRHYVFLQNIKTIAAHN----AKSTFKMAINEFSDLTRKEF 80

Query: 94  IASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA-------V 146
           + +  G+++S   S+ K + T     ++ +P  V+W ++G VTP+K QG+C         
Sbjct: 81  VKTYNGYRLSMKKSTNKPS-TFMAPLNTNMPTEVDWRKEGYVTPIKNQGRCGSCWAFSTT 139

Query: 147 AAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSY 206
            ++EG +  K  +LVSLSEQ L+DC+  + N+GC GGFMDDAF+YI  N GI  +A Y Y
Sbjct: 140 GSLEGQHFRKTGKLVSLSEQNLIDCSAAEGNDGCGGGFMDDAFEYIKLNNGIDTEASYPY 199

Query: 207 EGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDAS--ALQFYSG 263
           EG    IC   K  +  A  T Y D+    E+ L  AVA   P+SVAIDAS  +   Y  
Sbjct: 200 EGRDD-IC-RYKKTNKGAIDTGYMDIKQYSEDDLKAAVATVGPISVAIDASHKSFHMYHT 257

Query: 264 GVFN--GYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQ 321
           GV++     +T L+HGV  VGYGT E G  YWL+KNSWG DWG +GY ++ R+       
Sbjct: 258 GVYHEPECSQTVLDHGVLVVGYGT-ENGEDYWLVKNSWGTDWGMNGYIKMSRN---RSNN 313

Query: 322 CGIAMFASFPV 332
           CGIA  AS+P+
Sbjct: 314 CGIATNASYPL 324


>gi|340381055|ref|XP_003389037.1| PREDICTED: cathepsin L1-like [Amphimedon queenslandica]
          Length = 329

 Score =  209 bits (533), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 133/318 (41%), Positives = 177/318 (55%), Gaps = 29/318 (9%)

Query: 33  KFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQE 92
           ++E WK   G+ Y    E   R  I++ N   V   +NA      +TL +N FADL   E
Sbjct: 22  EWELWKRTNGKDYSSEKEELYRQTIWEANKKIVLE-HNANADKWGWTLEMNAFADLESSE 80

Query: 93  FIASQTGFKMSDHSSSLKANGTPFLYKSSQ-VPPSVNWIEKGAVTPVKYQGQCA------ 145
           F A   G++     S+ K+N T +   +   +P +V+W  KGAVTPVK Q QC       
Sbjct: 81  FAAMYNGYR----RSARKSNATRYHVPTGNALPDTVDWRTKGAVTPVKNQKQCGSCWAFS 136

Query: 146 -VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVY 204
              ++EG   +K   L SLSEQQLVDC+    N+GC GG MD+AFKYI  N GI ++A Y
Sbjct: 137 TTGSLEGQTFLKKGTLPSLSEQQLVDCSDKYGNHGCQGGLMDNAFKYIEANGGIDSEASY 196

Query: 205 SYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDA--SALQFY 261
            YE    G C   +    AA  T Y+D+P +D + L  AVAN  P+SVA+DA  S+ Q Y
Sbjct: 197 PYEA-KNGKC-RFQQSAVAATCTGYKDIPHDDIDGLQDAVANVGPISVAMDASHSSFQLY 254

Query: 262 SGGVFNGYC--ETFLNHGVTAVGYGTSEEGI-----KYWLIKNSWGQDWGEDGYFRLQRD 314
           + GV++      T L+HGV AVGYGT   G+      YWL+KNSWG DWG+ GYF++ R 
Sbjct: 255 AAGVYDPLLCSSTRLDHGVLAVGYGTEPSGLFHEEKPYWLVKNSWGPDWGQQGYFKIVRK 314

Query: 315 IDQPQGQCGIAMFASFPV 332
                 +CGIA  AS+P 
Sbjct: 315 ----DNKCGIATDASYPT 328


>gi|380014284|ref|XP_003691169.1| PREDICTED: cathepsin L-like [Apis florea]
          Length = 345

 Score =  209 bits (533), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 136/347 (39%), Positives = 189/347 (54%), Gaps = 27/347 (7%)

Query: 3   KYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
           K FLI+ + I  +  + + +   ++  +  K E  KA     YK   E   R +IF DN 
Sbjct: 2   KLFLILFITIFATVHAVSFFELVNQEWMTFKMEHKKA-----YKSDVEERFRMKIFMDNK 56

Query: 63  VAVERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHS---SSLKANGTPFLY 118
             + + N N  +   SY L++NK+ D+   EF+    GF  S ++   S     G  F+ 
Sbjct: 57  HKIAKHNSNYEMKKVSYKLKMNKYGDMLHHEFVNILNGFNKSINTQLRSERMPIGASFIE 116

Query: 119 KSS-QVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVD 170
            ++  +P  V+W ++GAVTPVK QG C       A  A+EG +  +   LVSLSEQ L+D
Sbjct: 117 PANVALPKKVDWRKEGAVTPVKDQGHCGSCWSFSATGALEGQHFRRTGVLVSLSEQNLID 176

Query: 171 CATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYE 230
           C+    NNGC GG MD AF+YI  NKG+  +A Y YE      C    A   A  +  Y 
Sbjct: 177 CSGKYGNNGCNGGLMDQAFQYIKDNKGLDTEASYPYEA-ENDKCRYNPANSGAIDV-GYI 234

Query: 231 DVPPNDEESLLKAVAN-QPVSVAIDAS--ALQFYSGGV-FNGYCET-FLNHGVTAVGYGT 285
           D+P  +E+ L  AVA   PVSVAIDAS  + QFYS GV +   C +  L+HGV  +GYGT
Sbjct: 235 DIPTGNEKLLKAAVATIGPVSVAIDASHQSFQFYSEGVYYEPECSSEELDHGVLVIGYGT 294

Query: 286 SEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           +E G  YWL+KNSWG+ WG +GY ++ R+       CGIA  AS+P+
Sbjct: 295 NENGEDYWLVKNSWGETWGNNGYIKMARN---KLNHCGIASSASYPL 338


>gi|195334204|ref|XP_002033774.1| GM21500 [Drosophila sechellia]
 gi|194125744|gb|EDW47787.1| GM21500 [Drosophila sechellia]
          Length = 341

 Score =  209 bits (533), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 130/325 (40%), Positives = 182/325 (56%), Gaps = 31/325 (9%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKFADL 88
           + E++  +K ++ + Y++  E   R +IF +N   + + N   A G  S+ L +NK+ADL
Sbjct: 25  VMEEWHTFKLEHRKNYQDDTEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADL 84

Query: 89  TPQEFIASQTGFKMSDHSSSLKAN----GTPFLYKSS-QVPPSVNWIEKGAVTPVKYQGQ 143
              EF     GF  + H     A+    G  F+  +   +P SV+W  KGAVT VK QG 
Sbjct: 85  LHHEFRQLMNGFNYTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGH 144

Query: 144 C-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNK 196
           C       +  A+EG +  K   LVSLSEQ LVDC+T   NNGC GG MD+AF+YI  N 
Sbjct: 145 CGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNG 204

Query: 197 GITNDAVYSYEGMSTGICD----SIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSV 251
           GI  +  Y YE +    C     +I A D       + D+P  DE+ + +AVA   PV+V
Sbjct: 205 GIDTEKSYPYEAIDDS-CHFNKGTIGATDRG-----FTDIPQGDEKKMAEAVATVGPVAV 258

Query: 252 AIDAS--ALQFYSGGVFN-GYCET-FLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDG 307
           AIDAS  + QFYS GV+N   C+   L+HGV  VG+GT E G  YWL+KNSWG  WG+ G
Sbjct: 259 AIDASHESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGEDYWLVKNSWGTTWGDKG 318

Query: 308 YFRLQRDIDQPQGQCGIAMFASFPV 332
           + ++ R+    + QCGIA  +S+P+
Sbjct: 319 FIKMLRN---KENQCGIASASSYPL 340


>gi|391328505|ref|XP_003738729.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
          Length = 323

 Score =  209 bits (533), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 130/320 (40%), Positives = 183/320 (57%), Gaps = 25/320 (7%)

Query: 28  GSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN-NAAIGNRSYTLRLNKFA 86
           G  +  ++ +K  +G++Y    E+ +R ++F  ++  +   N    +G  +Y + LNKF 
Sbjct: 13  GLASANWDLYKKVHGKSYGHDEEHFRR-QLFYKSVAKINAHNLRHDLGLTTYRMGLNKFT 71

Query: 87  DLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK--SSQVPPSVNWIEKGAVTPVKYQGQC 144
           D+T +EF  +  G K    ++  K NGT F  +     +P  V+W EKG VTPVK QGQC
Sbjct: 72  DMTSEEF-RNFKGLKFD--ATKTKRNGTRFQKELLGEALPTQVDWREKGYVTPVKNQGQC 128

Query: 145 A-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKG 197
                     ++EG +     +LVSLSEQ LVDC+  + NNGC GG MD+ F YI QN G
Sbjct: 129 GSCWAFSTTGSLEGQHFKATGKLVSLSEQNLVDCSRVEGNNGCNGGLMDNGFTYIQQNGG 188

Query: 198 ITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDAS 256
           I  +  Y Y G   G C +       A++  + DVP  DE +L  AVA+  PVSVAIDAS
Sbjct: 189 IDTEESYPYTG-KDGDC-AFNENSVGARVKGFVDVPQRDEAALQAAVASVGPVSVAIDAS 246

Query: 257 --ALQFYSGGVFNGYCETF--LNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQ 312
             + Q+Y  GV++    +F  L+HGV  VGYGT E G+ YWL+KNSWG  WG+DGY ++ 
Sbjct: 247 NDSFQYYKEGVYDEPSCSFSQLDHGVLVVGYGT-ENGVDYWLVKNSWGPTWGQDGYIKMM 305

Query: 313 RDIDQPQGQCGIAMFASFPV 332
           R+    + QCGIA  AS+P 
Sbjct: 306 RN---KENQCGIASMASYPT 322


>gi|328776427|ref|XP_625135.3| PREDICTED: cathepsin L-like [Apis mellifera]
          Length = 351

 Score =  209 bits (533), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 130/320 (40%), Positives = 180/320 (56%), Gaps = 22/320 (6%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN-NAAIGNRSYTLRLNKFADL 88
           + +++  +K ++ + YK   E   R +IF DN   + + N N  +   SY L++NK+ D+
Sbjct: 30  VNQEWMTFKMEHKKVYKSDVEERFRMKIFMDNKHKIAKHNSNYEMKKVSYKLKMNKYGDM 89

Query: 89  TPQEFIASQTGFKMSDHS---SSLKANGTPFLYKSSQV-PPSVNWIEKGAVTPVKYQGQC 144
              EF+    GF  S ++   S     G  F+  ++ V P  V+W ++GAVTPVK QG C
Sbjct: 90  LHHEFVNILNGFNKSINTQLRSERLPVGASFIEPANVVLPKKVDWRKEGAVTPVKDQGHC 149

Query: 145 -------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKG 197
                  A  A+EG +  +   LVSLSEQ L+DC+    NNGC GG MD AF+YI  NKG
Sbjct: 150 GSCWSFSATGALEGQHFRRTGVLVSLSEQNLIDCSGKYGNNGCNGGLMDQAFQYIKDNKG 209

Query: 198 ITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDAS 256
           +  +A Y YE      C    A   A  +  Y D+P  DE+ L  AVA   PVSVAIDAS
Sbjct: 210 LDTEASYPYEA-ENDKCRYNPANSGAIDV-GYIDIPTGDEKLLKAAVATIGPVSVAIDAS 267

Query: 257 --ALQFYSGGV-FNGYCET-FLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQ 312
             + QFYS GV +   C +  L+HGV  +GYGT+E G  YWL+KNSWG+ WG +GY ++ 
Sbjct: 268 HQSFQFYSEGVYYEPECSSEELDHGVLVIGYGTNENGQDYWLVKNSWGETWGNNGYIKMA 327

Query: 313 RDIDQPQGQCGIAMFASFPV 332
           R+       CGIA  AS+P+
Sbjct: 328 RN---KLNHCGIASSASYPL 344


>gi|392873948|gb|AFM85806.1| cathepsin L [Callorhinchus milii]
          Length = 338

 Score =  209 bits (533), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 129/349 (36%), Positives = 197/349 (56%), Gaps = 29/349 (8%)

Query: 1   MAKYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKD 60
           M   F+++ L ++G  A+     + D G +   +EQWK+ +G++Y++  E  +R  +++ 
Sbjct: 1   MRLPFVVLSLCLAGGLAAP----SLDPG-LDTHWEQWKSWHGKSYEQKEETWRRM-VWEK 54

Query: 61  NLVAVERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK 119
           +L  +E  N   ++G  S+ L +N F D+  +EF     G+K     +  K  G+ FL  
Sbjct: 55  HLRVIEIHNLEHSLGKHSFRLGMNHFGDMPNEEFRQLMNGYKYK--QTHKKLQGSHFLEP 112

Query: 120 S-SQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDC 171
           +  +VP  V+W ++G VTPVK QGQC          A+EG +  +  +LVSLSEQ LV+C
Sbjct: 113 NFLEVPKHVDWRDEGYVTPVKDQGQCGSCWAFSTTGALEGQHFRRTGQLVSLSEQNLVEC 172

Query: 172 ATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYED 231
           +  + N GC GG MD AF+Y+  N GI ++  Y Y G     C     + +AA  T + D
Sbjct: 173 SKPEGNEGCNGGLMDQAFQYVKDNGGIDSEDSYPYVGTDDTPC-HYNPQYNAANDTGFVD 231

Query: 232 VPPNDEESLLKAVAN-QPVSVAIDA--SALQFYSGGV-FNGYC-ETFLNHGVTAVGYGTS 286
           +P   E +L+KA+A   PVSVAIDA  ++ QFY  G+ F   C  T L+HGV  VGYG  
Sbjct: 232 IPSGKERALMKAIAAVGPVSVAIDAGHTSFQFYQSGIYFEAECSSTDLDHGVLVVGYGVE 291

Query: 287 E---EGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           +   +G KYW++KNSW +  G++GY  + +D D     CGIA  AS+P+
Sbjct: 292 KRDTDGKKYWIVKNSWSEKLGQNGYILMAKDKDN---HCGIATAASYPL 337


>gi|28194647|gb|AAO33585.1|AF479267_1 cathepsin L [Mesocricetus auratus]
          Length = 333

 Score =  209 bits (533), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 129/316 (40%), Positives = 179/316 (56%), Gaps = 27/316 (8%)

Query: 33  KFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKFADLTPQ 91
           ++ +WK+ + R Y  + E  +R  +++ N+  +E  N   + G   +T+ +N F D+T +
Sbjct: 28  QWHKWKSTHRRLYDTNEEEWRR-AVWEKNMKMIELHNGEYSEGKHGFTMEMNAFGDMTNE 86

Query: 92  EFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC------- 144
           EF     G+K   H    K    P +    Q+P SV+W EKG VTPVK QGQC       
Sbjct: 87  EFRQLVNGYKHQKHRKG-KLFQEPLML---QLPKSVDWREKGCVTPVKNQGQCGSCWAFS 142

Query: 145 AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVY 204
           A  A+EG   +K   LVSLSEQ LVDC+  + N GC GG MD AF+Y++ NKG+ ++  Y
Sbjct: 143 ACGALEGQMCLKTGVLVSLSEQNLVDCSRGEGNQGCNGGLMDFAFQYVLNNKGLDSEESY 202

Query: 205 SYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDAS--ALQFY 261
            YE    G C   K E  AA  T Y D+ P  E++L+KAVA   P++VAIDAS  + QFY
Sbjct: 203 PYEA-KDGTC-KYKPEFAAANDTGYVDI-PQLEKALMKAVATVGPIAVAIDASHPSFQFY 259

Query: 262 SGGV-FNGYCETF-LNHGVTAVGY---GTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDID 316
           S G+ F   C +  L+HGV  +GY   GT     KYW++KNSWG  WG  G+F + +D +
Sbjct: 260 SSGIYFEPNCSSKDLDHGVLVIGYGFEGTDSNKKKYWIVKNSWGTGWGMGGFFHIAKDKN 319

Query: 317 QPQGQCGIAMFASFPV 332
                CGIA  AS+P 
Sbjct: 320 N---HCGIATAASYPT 332


>gi|405971603|gb|EKC36430.1| Cathepsin L [Crassostrea gigas]
          Length = 360

 Score =  209 bits (532), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 135/316 (42%), Positives = 190/316 (60%), Gaps = 29/316 (9%)

Query: 34  FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKFADLTPQE 92
           ++++K  + +TY    E S+RFEIF++N+  +E  N    +G +SY L +N+F+DL  +E
Sbjct: 56  WKEFKILHDKTYDALEEESRRFEIFRENVQKIEEHNKLYHLGKKSYYLGVNQFSDLKHEE 115

Query: 93  FIASQTGFKMSDHSSSLKANG-TPFLYKSSQVPP-SVNWIEKGAVTPVKYQGQCA----- 145
           F+    G K     +SLK  G + +L  ++ V P SV+W +KG VT VK QGQC      
Sbjct: 116 FV-KYNGLK----KTSLKDGGCSSYLAANNLVEPDSVDWRKKGYVTDVKNQGQCGSCWSF 170

Query: 146 --VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAV 203
               ++EG +  K  +LVSLSE QLVDC+ +  N GC GG MD+AFKYI    G+ ++  
Sbjct: 171 STTGSLEGQHFRKSGKLVSLSESQLVDCSQSFGNEGCNGGLMDNAFKYIKSVGGLESEED 230

Query: 204 YSYEGMSTGIC--DSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDAS--AL 258
           Y Y+    G C  D  K    AA  T   DV    E +L KAV+   PVSVAIDAS  + 
Sbjct: 231 YPYK-PKQGTCKFDDTKV---AATDTGCVDVESGSESALKKAVSEVGPVSVAIDASHSSF 286

Query: 259 QFYSGGVFNG-YCETF-LNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDID 316
           Q Y+GGV++   C +  L+HGV  VGYGT ++G  YW++KNSWG +WGEDGY ++ R+  
Sbjct: 287 QSYAGGVYDEPECSSEQLDHGVLCVGYGTDDQGQDYWIVKNSWGAEWGEDGYVKMSRN-- 344

Query: 317 QPQGQCGIAMFASFPV 332
             + QCGIA  AS+P+
Sbjct: 345 -KKNQCGIATQASYPL 359


>gi|157093357|gb|ABV22333.1| cysteine protease 1 [Noctiluca scintillans]
          Length = 338

 Score =  209 bits (532), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 131/311 (42%), Positives = 173/311 (55%), Gaps = 22/311 (7%)

Query: 34  FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
           F  +K +YG+ Y    E++ RF IFK N+  +   N     N ++ L +N+F DLT +EF
Sbjct: 27  FNNFKTKYGKVYNGINEDAVRFGIFKANVDIIYATNAR---NLTFALGVNEFTDLTQEEF 83

Query: 94  IASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA-------V 146
            AS TG K +   S L    T   Y  + +  SV+W  +G VTPVK QGQC         
Sbjct: 84  AASYTGLKPASLWSGLPRLST-HEYNGAPLASSVDWTTQGVVTPVKNQGQCGSCWSFSTT 142

Query: 147 AAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSY 206
            A+EG  A+    LVSLSEQQ  DC T D+  GC GG+MD+AF +  +N  I  +  Y Y
Sbjct: 143 GALEGAWALSTGNLVSLSEQQFEDCDTTDS--GCNGGWMDNAFSFAKKNS-ICTEGSYPY 199

Query: 207 EGMSTGICDSIKAEDHAAQ--ITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQFYS 262
              + G C+    +    Q  +  Y DV  + E++++ AVA QPVS+AI+A   + Q YS
Sbjct: 200 TA-TDGTCNLSGCQVGIPQGGVVGYTDVSTDSEQAMMSAVAQQPVSIAIEADQYSFQLYS 258

Query: 263 GGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQC 322
            GV    C T L+HGV AVGYG SE G  YW +KNSWG  WGE GY RLQR      G+C
Sbjct: 259 SGVLTASCGTRLDHGVLAVGYG-SEAGTDYWKVKNSWGSSWGEQGYVRLQRG-KGGAGEC 316

Query: 323 G-IAMFASFPV 332
           G +A   S+PV
Sbjct: 317 GLLAGPPSYPV 327


>gi|119433808|gb|ABL74967.1| cysteine protease [Acanthamoeba castellanii]
          Length = 330

 Score =  209 bits (532), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 131/309 (42%), Positives = 172/309 (55%), Gaps = 23/309 (7%)

Query: 39  AQYGRTYKESAENSK---RFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIA 95
           A + RT+ +S  N +   R+ ++++N   ++  N     N SY L +NKF DLT  EF  
Sbjct: 31  ADWMRTHTKSYSNEEFVFRWNVWRENYNFIQEENRK---NNSYYLTMNKFGDLTNAEFNK 87

Query: 96  SQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA-------VAA 148
              G      +  LKA        +  +P + +W +KGAVT VK QGQC          +
Sbjct: 88  VYKGLAFDYSAHILKAKAATPAAPAPGLPANFDWRQKGAVTHVKNQGQCGSCWSFSTTGS 147

Query: 149 VEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEG 208
            EG N +K   LVSLSEQ L+DC+ +  NNGC GG MD AF+YII NKGI  +A Y YE 
Sbjct: 148 TEGANFLKRGTLVSLSEQNLIDCSGSYGNNGCNGGLMDYAFEYIINNKGIDTEASYPYET 207

Query: 209 MSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDAS--ALQFYSGGVF 266
                       +    +T+Y DV   DE +LL AVA +P SVAIDAS  + QFYSGGV+
Sbjct: 208 AQYNC--RYNPANSGGSLTSYTDVSSGDENALLNAVAIEPTSVAIDASHNSFQFYSGGVY 265

Query: 267 --NGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGI 324
             +    T L+HGV AVG+GT E G  YWL+KNSWG DWG  GY ++ R+       CGI
Sbjct: 266 YESSCSSTQLDHGVLAVGWGT-ENGQDYWLVKNSWGADWGLQGYIKMARN---RHNNCGI 321

Query: 325 AMFASFPVS 333
           A  AS+P +
Sbjct: 322 ATAASYPTA 330


>gi|326501772|dbj|BAK02675.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 333

 Score =  209 bits (532), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 129/340 (37%), Positives = 189/340 (55%), Gaps = 26/340 (7%)

Query: 6   LIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
           ++ VL ++ SC        FD   + + ++ WK    + Y ++ E+ +R   ++ NL  V
Sbjct: 6   VLAVLALAFSCT-----LAFD-AKLNQHWKLWKEANNKRYSDAEEHVRR-ATWEGNLQKV 58

Query: 66  ERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVP 124
           +  N  A +G  +Y L +NK+AD+T  EF+    G+  +      +   T        +P
Sbjct: 59  QEHNLQADLGVHTYWLGMNKYADMTVTEFVKVMNGYNATMRGQRTQDRHTFSFNSKIALP 118

Query: 125 PSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNN 177
            +V+W +KG VT VK QGQC          A+EG +  +  +LVSLSEQ LVDC+    N
Sbjct: 119 DTVDWRDKGYVTDVKDQGQCGSCWAFSTTGALEGQHFKQTGKLVSLSEQNLVDCSGKQGN 178

Query: 178 NGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDE 237
            GC GG MD AF+YI +N GI  +  Y YE +        KA +  A  T + D+   DE
Sbjct: 179 MGCNGGLMDQAFEYIKENNGIDTEDSYPYEAVDNQC--RFKAANVGATDTGFTDITSKDE 236

Query: 238 ESLLKAVAN-QPVSVAIDA--SALQFYSGGVFNG-YC-ETFLNHGVTAVGYGTSEEGIKY 292
            +L +AVA   P+SVAIDA  ++ Q Y  GV+N  +C +T L+HGV AVGYGT + G  Y
Sbjct: 237 SALQQAVATVGPISVAIDAGHTSFQLYKHGVYNEPFCSQTRLDHGVLAVGYGT-DSGKDY 295

Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           WL+KNSWG+ WG+ GY ++ R+    + QCGIA  AS+P+
Sbjct: 296 WLVKNSWGEGWGDKGYIKMTRN---KRNQCGIATAASYPL 332


>gi|189525870|ref|XP_001923796.1| PREDICTED: cathepsin L1 [Danio rerio]
          Length = 335

 Score =  209 bits (532), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 126/343 (36%), Positives = 191/343 (55%), Gaps = 29/343 (8%)

Query: 6   LIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
           L+V L IS   A+ +     D+      +  WK+Q+G++Y E  E  +R  I+++NL  +
Sbjct: 5   LLVTLYISAVFAAPSIDIQLDD-----HWNSWKSQHGKSYHEDVEVGRRM-IWEENLRKI 58

Query: 66  ERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLY-KSSQV 123
           E+ N   ++GN ++ + +N+F D+T +EF  +  G+K   H  +  + G  F+  K    
Sbjct: 59  EQHNFEYSLGNHTFKMGMNQFGDMTNEEFRQAMNGYK---HDPNRTSQGPLFMEPKFFAA 115

Query: 124 PPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDN 176
           P  V+W ++G VTPVK Q QC       +  A+EG    K  +L+S+SEQ LVDC+    
Sbjct: 116 PQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPHG 175

Query: 177 NNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPND 236
           N GC GG MD AF+Y+ +NKG+ ++  Y Y       C       + A+IT + D+P  +
Sbjct: 176 NQGCNGGLMDQAFQYVKENKGLDSEQSYPYLARDDLPC-RYDPRFNVAKITGFVDIPKGN 234

Query: 237 EESLLKAVAN-QPVSVAIDAS--ALQFYSGGV-FNGYCETFLNHGVTAVGY---GTSEEG 289
           E +L+ AVA   PVSVAIDAS  +LQFY  G+ +   C + L+H V  VGY   G    G
Sbjct: 235 ELALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACTSQLDHAVLVVGYGYQGADVAG 294

Query: 290 IKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
            +YW++KNSW   WG+ GY  + +D +     CGIA  AS+P+
Sbjct: 295 NRYWIVKNSWSDKWGDKGYIYMAKDKNN---HCGIATMASYPL 334


>gi|296228726|ref|XP_002759933.1| PREDICTED: cathepsin S isoform 1 [Callithrix jacchus]
          Length = 330

 Score =  209 bits (531), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 131/341 (38%), Positives = 187/341 (54%), Gaps = 31/341 (9%)

Query: 6   LIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
           L+ VL +  S  +Q       + ++   +  WK  YG+ YKE  E + R  I++ NL  V
Sbjct: 4   LVCVLFVCSSAVAQ----LLKDPTLDHHWNLWKKTYGKQYKEKNEEAVRRLIWEKNLKFV 59

Query: 66  ERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSS--- 121
              N   ++G  SY L +N   D+T +E ++  +  ++    S  + N T   YKS+   
Sbjct: 60  MLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLRVP---SQWQRNIT---YKSNPNQ 113

Query: 122 QVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATN 174
            +P SV+W EKG VT VKYQG C       AV A+E    +K  +LVSLS Q LVDC+  
Sbjct: 114 MLPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSEK 173

Query: 175 DNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPP 234
             N GC GGFM +AF+YII NKGI ++A Y Y+ M         ++  AA  + Y ++P 
Sbjct: 174 YGNKGCNGGFMTEAFQYIIDNKGIDSEASYPYKAMDQKC--QYDSKYRAATCSKYTELPY 231

Query: 235 NDEESLLKAVANQ-PVSVAIDASALQFY---SGGVFNGYCETFLNHGVTAVGYGTSEEGI 290
             E+ L +AVAN+ PV V +DAS   F+   SG  ++  C   +NHGV  +GYG    G 
Sbjct: 232 GREDVLKEAVANKGPVCVGVDASHSSFFLYRSGVYYDPACTQNVNHGVLVIGYG-DLNGE 290

Query: 291 KYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
           +YWL+KNSWG ++GE GY R+ R+       CGIA + S+P
Sbjct: 291 EYWLVKNSWGSNFGERGYIRMARN---KGNHCGIASYPSYP 328


>gi|440906716|gb|ELR56945.1| Cathepsin S, partial [Bos grunniens mutus]
          Length = 342

 Score =  209 bits (531), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 134/343 (39%), Positives = 191/343 (55%), Gaps = 32/343 (9%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
           +L+  L++  S  +Q       + ++   ++ WK  YG+ YKE  E   R  I++ NL  
Sbjct: 14  WLVWALLLCSSAMAQ----VHRDPTLDHHWDLWKKTYGKQYKEKNEEVARRLIWEKNLKT 69

Query: 65  VERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSS-- 121
           V   N   ++G  SY L +N   D+T +E I+  +  ++    S    N T   YKS   
Sbjct: 70  VTLHNLEHSMGMHSYELGMNHLGDMTSEEVISLMSSLRVP---SQWPRNVT---YKSDPN 123

Query: 122 -QVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
            ++P S++W EKG VT VKYQG C       AV A+E    +K  +LVSLS Q LVDC+T
Sbjct: 124 QKLPDSMDWREKGCVTEVKYQGACGSCWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCST 183

Query: 174 ND-NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDV 232
               N GC GGFM +AF+YII N GI ++A Y Y+ M  G C     ++ AA  + Y ++
Sbjct: 184 AKYGNKGCNGGFMTEAFQYIIDNNGIDSEASYPYKAMD-GKCQ-YDVKNRAATCSRYIEL 241

Query: 233 PPNDEESLLKAVANQ-PVSVAIDASALQFY---SGGVFNGYCETFLNHGVTAVGYGTSEE 288
           P   EE+L +AVAN+ PVSV IDAS   F+   +G  ++  C   +NHGV  VGYG + +
Sbjct: 242 PFGSEEALKEAVANKGPVSVGIDASHSSFFLYKTGVYYDPSCTQNVNHGVLVVGYG-NLD 300

Query: 289 GIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
           G  YWL+KNSWG  +G+ GY R+ R+       CGIA + S+P
Sbjct: 301 GKDYWLVKNSWGLHFGDQGYIRMARN---SGNHCGIASYPSYP 340


>gi|55740402|gb|AAV63977.1| cathepsin L precursor [Artemia franciscana]
          Length = 338

 Score =  209 bits (531), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 128/340 (37%), Positives = 189/340 (55%), Gaps = 20/340 (5%)

Query: 7   IVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVE 66
           I ++ + G+   Q +        +A+++  +KA + + Y    E   R +I+ +N   V 
Sbjct: 4   ITLIFLLGAVLVQLSAALSLTNLLADEWHLFKATHKKEYPSQLEEKFRMKIYLENKHKVA 63

Query: 67  RFNNA-AIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSS-QVP 124
           + N     G +SY + +NKF DL   EF +   G++    +SS   +   F+  ++ +VP
Sbjct: 64  KHNILYEKGEKSYQVAMNKFGDLLHHEFRSIMNGYQHKKQNSSRAESTFTFMEPANVEVP 123

Query: 125 PSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNN 177
            SV+W EKGA+TPVK QGQC       +  A+EG    K  +L+SLSEQ L+DC+    N
Sbjct: 124 ESVDWREKGAITPVKDQGQCGSCWAFSSTGALEGQTFRKTGKLISLSEQNLIDCSGKYGN 183

Query: 178 NGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDE 237
            GC GG MD AF+YI  NKGI  +  Y YE     +C      +  A    + D+P  +E
Sbjct: 184 EGCNGGLMDQAFQYIKDNKGIDTENTYPYEA-EDDVC-RYNPRNRGAVDRGFVDIPSGEE 241

Query: 238 ESLLKAVAN-QPVSVAIDAS--ALQFYSGGV-FNGYCET-FLNHGVTAVGYGTSEEGIKY 292
           + L  AVA   PVSVAIDAS  + QFYS GV +   C++  L+HGV  VGYG S+ G  Y
Sbjct: 242 DKLKAAVATVGPVSVAIDASHESFQFYSKGVYYEPSCDSDDLDHGVLVVGYG-SDNGKDY 300

Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           WL+KNSW + WG++GY ++ R+    +  CG+A  AS+P+
Sbjct: 301 WLVKNSWSEHWGDEGYIKIARN---RKNHCGVATAASYPL 337


>gi|114559418|ref|XP_001171268.1| PREDICTED: cathepsin S isoform 3 [Pan troglodytes]
 gi|397492866|ref|XP_003817341.1| PREDICTED: cathepsin S isoform 1 [Pan paniscus]
 gi|410225070|gb|JAA09754.1| cathepsin S [Pan troglodytes]
 gi|410251608|gb|JAA13771.1| cathepsin S [Pan troglodytes]
 gi|410328325|gb|JAA33109.1| cathepsin S [Pan troglodytes]
 gi|410328327|gb|JAA33110.1| cathepsin S [Pan troglodytes]
          Length = 331

 Score =  209 bits (531), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 134/342 (39%), Positives = 187/342 (54%), Gaps = 32/342 (9%)

Query: 6   LIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
           L+ VL++  S  +Q       + ++   +  WK  YG+ YKE  E + R  I++ NL  V
Sbjct: 4   LVCVLLVCSSAVAQ----LHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLKFV 59

Query: 66  ERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQ-- 122
              N   ++G  SY L +N   D+T +E ++  +  ++    S  + N T   YKS+   
Sbjct: 60  MLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLRVP---SQWQRNIT---YKSNPNQ 113

Query: 123 -VPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATN 174
            +P SV+W EKG VT VKYQG C       AV A+E    +K  +LVSLS Q LVDC+T 
Sbjct: 114 ILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTE 173

Query: 175 D-NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
              N GC GGFM  AF+YII NKGI +DA Y Y+           ++  AA  + Y ++P
Sbjct: 174 KYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKATDQKC--QYDSKYRAATCSKYTELP 231

Query: 234 PNDEESLLKAVANQ-PVSVAIDASALQFY---SGGVFNGYCETFLNHGVTAVGYGTSEEG 289
              E+ L +AVAN+ PVSV +DA    F+   SG  +   C   +NHGV  VGYG    G
Sbjct: 232 YGREDVLKEAVANKGPVSVGVDALHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYG-DLNG 290

Query: 290 IKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
            +YWL+KNSWG ++GE+GY R+ R+       CGIA F S+P
Sbjct: 291 KEYWLVKNSWGHNFGEEGYIRMARN---KGNHCGIASFPSYP 329


>gi|74151179|dbj|BAE27712.1| unnamed protein product [Mus musculus]
          Length = 334

 Score =  209 bits (531), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 131/324 (40%), Positives = 185/324 (57%), Gaps = 28/324 (8%)

Query: 25  FDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLN 83
           FD+   AE + QWK+ + R Y  + E  +R  I++ N+  ++  N   + G   +++ +N
Sbjct: 21  FDQTFSAE-WHQWKSTHRRLYGTNEEEWRR-AIWEKNMRMIQLHNGEYSNGQHGFSMEMN 78

Query: 84  KFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQ 143
            F D+T +EF     G++   H    +    P + K   +P SV+W EKG VTPVK QGQ
Sbjct: 79  AFGDMTNEEFRQVVNGYRHQKHKKG-RLFQEPLMLK---IPKSVDWREKGCVTPVKNQGQ 134

Query: 144 C-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNK 196
           C       A   +EG   +K  +L+SLSEQ LVDC+    N GC GG MD AF+YI +N 
Sbjct: 135 CGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGCNGGLMDFAFQYIKENG 194

Query: 197 GITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDA 255
           G+ ++  Y YE    G C   +AE   A  T + D+ P  EE+L+KAVA   P+SVA+DA
Sbjct: 195 GLDSEESYPYEA-KDGSC-KYRAEFAVANDTGFVDI-PQQEEALMKAVATVGPISVAMDA 251

Query: 256 S--ALQFYSGGV-FNGYCETF-LNHGVTAVGY---GTSEEGIKYWLIKNSWGQDWGEDGY 308
           S  +LQFYS G+ +   C +  L+HGV  VGY   GT     KYWL+KNSWG +WG +GY
Sbjct: 252 SHPSLQFYSSGIYYEPNCSSKNLDHGVLLVGYGYEGTDSNKNKYWLVKNSWGSEWGMEGY 311

Query: 309 FRLQRDIDQPQGQCGIAMFASFPV 332
            ++ +D D     CG+A  AS+PV
Sbjct: 312 IKIAKDRDN---HCGLATAASYPV 332


>gi|395819351|ref|XP_003783057.1| PREDICTED: cathepsin L1-like [Otolemur garnettii]
          Length = 333

 Score =  209 bits (531), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 128/341 (37%), Positives = 188/341 (55%), Gaps = 27/341 (7%)

Query: 8   VVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVER 67
           ++LI++  C    +  +  +GS+   + +WKA++ + Y    E  +R  +++ N+  +E 
Sbjct: 3   LLLILAAFCVGITSATSMFDGSLNAHWYRWKAKHRKLYGMREEGWRR-AVWEKNMKMIEV 61

Query: 68  FNNA-AIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPS 126
            N   + G   +T+ +N F D+T +EF     GF+   H          FL    +VP S
Sbjct: 62  HNQEYSQGKHGFTMAMNAFGDMTNEEFRQVMNGFRNQKHKKGKVFQEPSFL----EVPKS 117

Query: 127 VNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNG 179
           V+W EKG VTPVK QGQC       A  A+EG    K  +L+SLSEQ LVDC+    N G
Sbjct: 118 VDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLISLSEQNLVDCSRPQGNEG 177

Query: 180 CYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEES 239
           C GG MD AF+YI +N G+ ++  Y Y+ M        + E   A  T + D+ P +E++
Sbjct: 178 CDGGLMDYAFQYIKENGGLDSEESYPYDAMDESC--KYRPEYSVANDTGFVDI-PKEEKA 234

Query: 240 LLKAVAN-QPVSVAIDA--SALQFYSGGV-FNGYCET-FLNHGVTAVGYG---TSEEGIK 291
           L+KAVA   P+SVAIDA   + QFY  GV F   C +  ++HGV  VGYG   T  +  K
Sbjct: 235 LMKAVATVGPISVAIDAGHESFQFYKEGVYFEPECSSDNVDHGVLVVGYGYEETESDNNK 294

Query: 292 YWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           +WL+KNSWG++WG  GY ++ +D    +  CGIA  AS+P 
Sbjct: 295 FWLVKNSWGEEWGLGGYIKMTKD---QKNHCGIATAASYPT 332


>gi|156938919|gb|ABU97481.1| cathepsin L-like cysteine protease [Tyrophagus putrescentiae]
          Length = 333

 Score =  209 bits (531), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 135/347 (38%), Positives = 185/347 (53%), Gaps = 33/347 (9%)

Query: 3   KYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
           K+   V+ ++    AS+       EG +   F  +K ++GR+Y    E   R  +F  NL
Sbjct: 2   KFVFAVLALVFAPTASE----LISEGELEAHFNLFKTRFGRSYANFEEEIFRKRVFASNL 57

Query: 63  VAVERFNNAAI-GNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSS 121
             +   N     GN+++ + +N F D++  EF A   G + S   S+      P ++ +S
Sbjct: 58  EFIFNHNREFFAGNKNFNVAVNNFTDMSNTEFRARFNGLRHSGVQSA------PAIHSAS 111

Query: 122 Q--VPPSVNWIE-KGAVTPVKYQGQC--------AVAAVEGINAIKINRLVSLSEQQLVD 170
              +P +V+W + K  VTP+K Q QC        AVA++EG + +K  +LVSLSEQ LVD
Sbjct: 112 AEGLPATVDWTKVKNVVTPIKNQEQCGSCWAFFSAVASMEGQHGLKTGKLVSLSEQNLVD 171

Query: 171 CATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYE 230
           C+  + N GC GG MD AF+Y+I NKGI  +  Y Y+ +        K     A I +Y 
Sbjct: 172 CSAAEGNMGCEGGLMDQAFQYVIANKGIDTEMSYPYKAIDESW--EFKKNSVGATIKSYV 229

Query: 231 DVPPNDEESLLKAVAN-QPVSVAIDASAL--QFYSGGVFN--GYCETFLNHGVTAVGYGT 285
           DV    E SL  AVA   P+SV IDAS L  QFYS GV+       T L+HGVTAVGYG 
Sbjct: 230 DVKTGSESSLQSAVATVGPISVGIDASQLSFQFYSSGVYEEPACSTTILDHGVTAVGYG- 288

Query: 286 SEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           +  G  YW +KNSWG  WG  GY  + R+    Q QCGIA  AS+PV
Sbjct: 289 ALNGTPYWKVKNSWGTSWGMSGYIFMSRN---KQNQCGIATAASWPV 332


>gi|414875906|tpg|DAA53037.1| TPA: hypothetical protein ZEAMMB73_586844 [Zea mays]
          Length = 1039

 Score =  209 bits (531), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 112/209 (53%), Positives = 137/209 (65%), Gaps = 10/209 (4%)

Query: 137 PVKYQGQC----AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYI 192
           P    G C     +AAVEGIN I    L+SLSEQ+LVDC T+  N GC GG MD AF++I
Sbjct: 708 PFAVAGSCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDTS-YNQGCNGGLMDYAFEFI 766

Query: 193 IQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVA 252
           I N GI  +  Y Y+G + G CD  +       I +YEDVP NDE+SL KAVANQPVSVA
Sbjct: 767 INNGGIDTEKDYPYKG-TDGRCDVNRKNAKVVTIDSYEDVPANDEKSLQKAVANQPVSVA 825

Query: 253 IDA--SALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFR 310
           I+A  +  Q YS G+F G C T L+HGVT VGYGT E G  YW++KNSWG  WGE GY R
Sbjct: 826 IEAAGTTFQLYSSGIFTGSCGTALDHGVTVVGYGT-ENGKDYWIMKNSWGSSWGESGYVR 884

Query: 311 LQRDIDQPQGQCGIAMFASFPVSKESAQP 339
           ++R+I    G+CGIA+  S+P+ KE A P
Sbjct: 885 MERNIKASSGKCGIAVEPSYPL-KEGANP 912


>gi|156397875|ref|XP_001637915.1| predicted protein [Nematostella vectensis]
 gi|156225031|gb|EDO45852.1| predicted protein [Nematostella vectensis]
          Length = 331

 Score =  209 bits (531), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 130/344 (37%), Positives = 185/344 (53%), Gaps = 26/344 (7%)

Query: 1   MAKYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKD 60
           M      V+L IS + A        D       ++ WK+ +G+ Y    E + R  I+++
Sbjct: 1   MEAVIFAVLLCISSALAMPPMEPLQDP-----NWKAWKSFHGKEYPNKNEETMRNFIWQN 55

Query: 61  NLVAVERFNNAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS 120
           NL  +   N    G  S+ L +N   D+T  E   +  G K+  H+ S     T     +
Sbjct: 56  NLKKIVTHNE---GKHSFKLAMNHLGDMTSLEISQTLLGLKLKKHAESQPKGATFLPPAN 112

Query: 121 SQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCAT 173
            +V  S++W  KG VTPVK QGQC          A+EG +  K  +LVSLSEQ LVDC+ 
Sbjct: 113 VKVVDSIDWRSKGYVTPVKNQGQCGSCWAFSTTGALEGQHFRKTGKLVSLSEQNLVDCSG 172

Query: 174 NDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
              NNGC GG MD+AF+YI +N GI  +  Y Y     G+C   K+    A+ T + D+P
Sbjct: 173 KYGNNGCEGGLMDNAFQYIKENGGIDTEKSYPYLA-KDGVCHYNKSAI-GAKDTGFVDIP 230

Query: 234 PNDEESLLKAVAN-QPVSVAIDA--SALQFYSGGVFN--GYCETFLNHGVTAVGYGTSEE 288
             DE +L +A+A+  P+S+AIDA  S   FY  GV++      T L+HGV AVGYGT ++
Sbjct: 231 TGDENALQQALASVGPISIAIDASQSTFHFYHQGVYDDPDCSSTRLDHGVLAVGYGT-DD 289

Query: 289 GIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           G  YWL+KNSWG  WGE+GY ++ R+      +CG+A  AS+P+
Sbjct: 290 GKDYWLVKNSWGPSWGEEGYIKIARN---DHDKCGVASKASYPL 330


>gi|401397136|ref|XP_003879989.1| cathepsin L, related [Neospora caninum Liverpool]
 gi|325114397|emb|CBZ49954.1| cathepsin L, related [Neospora caninum Liverpool]
          Length = 415

 Score =  209 bits (531), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 121/293 (41%), Positives = 170/293 (58%), Gaps = 24/293 (8%)

Query: 34  FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQEF 93
           F  ++A YG++Y    E  KR+ IFK+NL  +   N       SY+L++N F DL+ +EF
Sbjct: 119 FGSFRATYGKSYATEEETQKRYAIFKNNLAYIHTHNQQGY---SYSLKMNHFGDLSREEF 175

Query: 94  IASQTGFKMSDHSSSLKAN----GTPFLYKS-SQVPPSVNWIEKGAVTPVKYQGQC---- 144
                G+   + S +LK+N     T  L  S S VP +V+W EKG VTPVK Q  C    
Sbjct: 176 RRKYLGY---NKSRNLKSNNLGVATELLKVSPSDVPSAVDWREKGCVTPVKDQRDCGSCW 232

Query: 145 ---AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITND 201
              A  A+EG +  K   L+SLSEQ+LVDC+  + N GC GG M+DAF+Y++ + G+ ++
Sbjct: 233 AFSATGALEGAHCAKTGELLSLSEQELVDCSLAEGNQGCSGGEMNDAFQYVVDSGGLCSE 292

Query: 202 AVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASAL--Q 259
             Y Y     G C   +A      I+ ++DVP   E ++  A+A+ PVS+AI+A  L  Q
Sbjct: 293 EGYPYLARD-GECK--RACKKVVTISGFKDVPRKSETAMKAALAHSPVSIAIEADQLPFQ 349

Query: 260 FYSGGVFNGYCETFLNHGVTAVGYGTSEEGIK-YWLIKNSWGQDWGEDGYFRL 311
           FY  GVF+  C T L+HGV  VGYGT +E  K +W++KNSWG  WG DGY  +
Sbjct: 350 FYHEGVFDASCGTDLDHGVLLVGYGTDKETKKDFWIMKNSWGSGWGRDGYMYM 402


>gi|291224870|ref|XP_002732425.1| PREDICTED: cathepsin L2-like [Saccoglossus kowalevskii]
          Length = 326

 Score =  208 bits (530), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 131/318 (41%), Positives = 183/318 (57%), Gaps = 23/318 (7%)

Query: 29  SIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAI-GNRSYTLRLNKFAD 87
           S+  ++E WK  YG+ Y +  E + R  I+  NL  ++  N   + G  +YT  +N+F D
Sbjct: 17  SVNTEWESWKRTYGKEYTQK-EEALRHMIWNVNLKMIQMHNEKYMSGKSTYTQNMNQFGD 75

Query: 88  LTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC--- 144
           LT +E+     G+K S+ +   K + T  L  + + P S++W  +G VT VK QG C   
Sbjct: 76  LTNEEYRELMCGYKKSNKTVISKPS-TFLLPSNYRAPASIDWRTQGYVTDVKDQGACGSC 134

Query: 145 ----AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITN 200
               +  ++EG    K  +LV LSEQQLVDC+ +  N GC GG+MD AF YI ++KG  +
Sbjct: 135 WAFSSTGSLEGQTFKKTGKLVPLSEQQLVDCSGDYGNMGCGGGWMDQAFSYI-KDKGEES 193

Query: 201 DAVYSYEGMS-TGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDA--S 256
           +  Y Y G   T + D+ K     A  T Y D+P  DE +L +AVA   P+SVAIDA  S
Sbjct: 194 EDGYPYTGTDDTCVYDASKV---VATDTGYTDIPEMDENALQQAVATVGPISVAIDATHS 250

Query: 257 ALQFYSGGVFNG--YCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRD 314
           + QFY  GV++     +T L+H V AVGYGTSEEG+ YW++KNSW   WG  GY  + R+
Sbjct: 251 SFQFYESGVYDEPECSQTNLDHAVLAVGYGTSEEGLDYWIVKNSWSTGWGMQGYIEMSRN 310

Query: 315 IDQPQGQCGIAMFASFPV 332
            D    QCGIA  AS+PV
Sbjct: 311 KDN---QCGIASKASYPV 325


>gi|195153545|ref|XP_002017686.1| GL17172 [Drosophila persimilis]
 gi|194113482|gb|EDW35525.1| GL17172 [Drosophila persimilis]
          Length = 341

 Score =  208 bits (530), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 133/349 (38%), Positives = 192/349 (55%), Gaps = 35/349 (10%)

Query: 6   LIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
           LI+ L+   + A   +Y       I E++  +K ++ + Y++  E   R +IF +N   +
Sbjct: 5   LILPLLALVAVAQAVSYAEV----IQEEWHTFKLEHRKNYQDETEERFRLKIFNENKHKI 60

Query: 66  ERFNNA-AIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKAN----GTPFLY-K 119
            + N   A G  S+ + +NK+AD+   EF ++  GF  + H     A+    G  F+  +
Sbjct: 61  AKHNQLWATGAVSFKMAVNKYADMLHHEFYSTMNGFNYTLHKQLRNADESFKGVTFISPE 120

Query: 120 SSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCA 172
              +P  V+W  KGAVT VK QG C       +  A+EG +  K   LVSLSEQ LVDC+
Sbjct: 121 HVTLPKQVDWRTKGAVTDVKDQGHCGSCWAFSSTGALEGQHYRKSGVLVSLSEQNLVDCS 180

Query: 173 TNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICD----SIKAEDHAAQITN 228
           T   NNGC GG MD+AF+YI  N GI  +  Y YE +    C     SI A D       
Sbjct: 181 TKYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYPYEAIDDS-CHFNKGSIGATDRG----- 234

Query: 229 YEDVPPNDEESLLKAVAN-QPVSVAIDAS--ALQFYSGGVFN-GYCET-FLNHGVTAVGY 283
           + D+P  +E+ + +AVA   PV+VAIDAS  + QFYS GV+N   C+   L+HGV  VG+
Sbjct: 235 FVDIPQGNEKKMAEAVATIGPVAVAIDASHESFQFYSEGVYNEPACDAQNLDHGVLVVGF 294

Query: 284 GTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           GT E G  YWL+KNSWG  WG+ G+ ++ R+    + QCGIA  +S+P+
Sbjct: 295 GTDESGEDYWLVKNSWGTTWGDKGFIKMLRN---KENQCGIASASSYPL 340


>gi|355763133|gb|EHH62119.1| hypothetical protein EGM_20318 [Macaca fascicularis]
          Length = 331

 Score =  208 bits (530), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 134/342 (39%), Positives = 187/342 (54%), Gaps = 32/342 (9%)

Query: 6   LIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
           LI VL++  S  +Q       + ++   +  WK  YG+ YKE  E + R  I++ NL  V
Sbjct: 4   LICVLLVCSSAVAQ----LHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLKFV 59

Query: 66  ERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQ-- 122
              N   ++G  SY L +N   D+T +E ++  +  ++    S  + N T   YKS+   
Sbjct: 60  MLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLRVP---SQWQRNIT---YKSNANQ 113

Query: 123 -VPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATN 174
            +P SV+W EKG VT VKYQG C       AV A+E    +K  +LVSLS Q LVDC+T 
Sbjct: 114 ILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTE 173

Query: 175 D-NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
              N GC GGFM  AF+YII N GI +DA Y Y+           ++  AA  + Y ++P
Sbjct: 174 KYGNKGCNGGFMTRAFQYIIDNNGIDSDASYPYKATDQKC--QYDSKYRAATCSKYTELP 231

Query: 234 PNDEESLLKAVANQ-PVSVAIDASALQFY---SGGVFNGYCETFLNHGVTAVGYGTSEEG 289
              E+ L + VAN+ PVSV +DAS   F+   SG  +   C   +NHGV  VGYG    G
Sbjct: 232 YGREDVLKEVVANKGPVSVGVDASHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYGVL-NG 290

Query: 290 IKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
            +YWL+KNSWG+++GE+GY R+ R+       CGIA F S+P
Sbjct: 291 KEYWLVKNSWGRNFGEEGYIRMARN---KGNHCGIASFPSYP 329


>gi|402856105|ref|XP_003892640.1| PREDICTED: cathepsin S isoform 1 [Papio anubis]
          Length = 331

 Score =  208 bits (530), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 133/342 (38%), Positives = 187/342 (54%), Gaps = 32/342 (9%)

Query: 6   LIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
           L+ VL++  S  +Q       + ++   +  WK  YG+ YKE  E + R  I++ NL  V
Sbjct: 4   LVCVLLVCSSAVAQ----LHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLKFV 59

Query: 66  ERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSS--- 121
              N   ++G  SY L +N   D+T +E ++  +  ++    S  + N T   YKS+   
Sbjct: 60  MLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLRVP---SQWQRNIT---YKSNPNQ 113

Query: 122 QVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATN 174
            +P SV+W EKG VT VKYQG C       AV A+E    +K  +LVSLS Q LVDC+T 
Sbjct: 114 MLPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTE 173

Query: 175 D-NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
              N GC GGFM  AF+YII N GI +DA Y Y+           ++  AA  + Y ++P
Sbjct: 174 KYGNKGCNGGFMTRAFQYIIDNNGIDSDASYPYKATDQKC--QYDSKYRAATCSKYTELP 231

Query: 234 PNDEESLLKAVANQ-PVSVAIDASALQFY---SGGVFNGYCETFLNHGVTAVGYGTSEEG 289
              E+ L + VAN+ PVSV +DAS   F+   SG  +   C   +NHGV  VGYG    G
Sbjct: 232 YGREDVLKEVVANKGPVSVGVDASHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYGVL-NG 290

Query: 290 IKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
            +YWL+KNSWG+++GE+GY R+ R+       CGIA F S+P
Sbjct: 291 KEYWLVKNSWGRNFGEEGYIRMARN---KGNHCGIASFPSYP 329


>gi|75812934|ref|NP_001028787.1| cathepsin S precursor [Bos taurus]
 gi|115503669|sp|P25326.2|CATS_BOVIN RecName: Full=Cathepsin S; Flags: Precursor
 gi|74353837|gb|AAI02246.1| Cathepsin S [Bos taurus]
 gi|296489535|tpg|DAA31648.1| TPA: cathepsin S precursor [Bos taurus]
          Length = 331

 Score =  208 bits (530), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 134/343 (39%), Positives = 190/343 (55%), Gaps = 32/343 (9%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
           +L+  L++     S A      + ++   ++ WK  YG+ YKE  E   R  I++ NL  
Sbjct: 3   WLVWALLL----CSSAMAHVHRDPTLDHHWDLWKKTYGKQYKEKNEEVARRLIWEKNLKT 58

Query: 65  VERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSS-- 121
           V   N   ++G  SY L +N   D+T +E I+  +  ++    S    N T   YKS   
Sbjct: 59  VTLHNLEHSMGMHSYELGMNHLGDMTSEEVISLMSSLRVP---SQWPRNVT---YKSDPN 112

Query: 122 -QVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCAT 173
            ++P S++W EKG VT VKYQG C       AV A+E    +K  +LVSLS Q LVDC+T
Sbjct: 113 QKLPDSMDWREKGCVTEVKYQGACGSCWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCST 172

Query: 174 ND-NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDV 232
               N GC GGFM +AF+YII N GI ++A Y Y+ M  G C     ++ AA  + Y ++
Sbjct: 173 AKYGNKGCNGGFMTEAFQYIIDNNGIDSEASYPYKAMD-GKC-QYDVKNRAATCSRYIEL 230

Query: 233 PPNDEESLLKAVANQ-PVSVAIDASALQFY---SGGVFNGYCETFLNHGVTAVGYGTSEE 288
           P   EE+L +AVAN+ PVSV IDAS   F+   +G  ++  C   +NHGV  VGYG + +
Sbjct: 231 PFGSEEALKEAVANKGPVSVGIDASHSSFFLYKTGVYYDPSCTQNVNHGVLVVGYG-NLD 289

Query: 289 GIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
           G  YWL+KNSWG  +G+ GY R+ R+       CGIA + S+P
Sbjct: 290 GKDYWLVKNSWGLHFGDQGYIRMARN---SGNHCGIANYPSYP 329


>gi|302779822|ref|XP_002971686.1| hypothetical protein SELMODRAFT_16221 [Selaginella moellendorffii]
 gi|300160818|gb|EFJ27435.1| hypothetical protein SELMODRAFT_16221 [Selaginella moellendorffii]
          Length = 214

 Score =  208 bits (530), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 112/216 (51%), Positives = 138/216 (63%), Gaps = 12/216 (5%)

Query: 126 SVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNN 178
           SV+W +KG VT +K QG C       A+AAVEG+  +    LVSLSEQ+LVDC T   N 
Sbjct: 1   SVDWRKKGGVTEIKDQGDCGNCWAFSAIAAVEGLTFLSTGTLVSLSEQELVDCDTT-VNQ 59

Query: 179 GCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEE 238
           GC GG MD AF+Y+I+N GIT+ + Y Y     G CD  K + HAA I  ++ +PP  EE
Sbjct: 60  GCDGGMMDYAFQYMIRNGGITSQSNYPYR-AQRGACDKDKVKYHAATINGFQAIPPQSEE 118

Query: 239 SLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIK 296
            LL+AVANQPVSVAI+A     Q YS GVF G C + L+HGV  VGYGT   G +YWL+K
Sbjct: 119 LLLRAVANQPVSVAIEAGGQDFQLYSSGVFTGECGSNLDHGVAIVGYGTDAGGRQYWLVK 178

Query: 297 NSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           NSWG  WGE GY R++R      G CGI + AS+P 
Sbjct: 179 NSWGSGWGESGYVRMERQ-GPGAGVCGINLDASYPT 213


>gi|410978262|ref|XP_003995514.1| PREDICTED: cathepsin L1-like [Felis catus]
          Length = 333

 Score =  208 bits (529), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 130/339 (38%), Positives = 184/339 (54%), Gaps = 27/339 (7%)

Query: 10  LIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN 69
           L ++  C   A+       S+ E + QWKA +G+ Y    E  +R E++K N+  + + N
Sbjct: 5   LFLAALCLGIASAAPQLNQSLDELWSQWKATHGKLYGMDEEGWRR-EVWKKNMKMIRQHN 63

Query: 70  -NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVN 128
              + G  S+T+ +N F D+T +EF     G +M  H    K    P   K   +P SV+
Sbjct: 64  WEHSQGKHSFTVAMNGFGDMTNEEFKQVMNGLQMQKHKKG-KMFQAPLFAK---IPSSVD 119

Query: 129 WIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCY 181
           W EKG VTPVK QG C       A  A+EG    K  +LVSLSEQ LVDC+  + N GC 
Sbjct: 120 WREKGYVTPVKDQGPCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSQAEGNEGCN 179

Query: 182 GGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLL 241
           GG M++AF+Y+  N G+ ++  Y Y           K +D AA  T + D+ P  E++L+
Sbjct: 180 GGLMNNAFQYVKDNGGLDSEESYPYHAQDESC--KYKPQDSAANDTGFFDI-PQQEKALM 236

Query: 242 KAVANQ-PVSVAIDAS--ALQFYSGGV-FNGYCETF-LNHGVTAVGYGTS---EEGIKYW 293
            AVA + P+SV IDAS    QFY  G+ ++  C +  L+HGV  +GYGT         YW
Sbjct: 237 VAVATKGPISVGIDASHFTFQFYHEGIYYDPDCSSEDLDHGVLVIGYGTEIGQSINKTYW 296

Query: 294 LIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           ++KNSWG +WG DGY ++ +D    +  CGIA  ASFPV
Sbjct: 297 IVKNSWGANWGIDGYIKMAKD---RKNHCGIATMASFPV 332


>gi|6650705|gb|AAF21977.1|AF115280_1 thiolproteinase SmTP1 [Sarcocystis muris]
          Length = 394

 Score =  208 bits (529), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 124/313 (39%), Positives = 174/313 (55%), Gaps = 18/313 (5%)

Query: 33  KFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTPQE 92
           +F Q++  + + Y    E  KR+ IFK+NL  +   N   +   SY L++NKF DLT +E
Sbjct: 88  QFYQFQRDHNKFYATEEERLKRYAIFKNNLTYIHNHN---MQGYSYVLKMNKFGDLTLEE 144

Query: 93  FIASQTGFKMSD-HSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC------- 144
           F     G+K  D  +   + + T    + + +P  V+W ++G VT VK QG C       
Sbjct: 145 FRQRYLGYKKPDLRTPPREVDTTLESVEDNDIPTHVDWRQRGCVTSVKDQGDCGSCWAFS 204

Query: 145 AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVY 204
           A  A+EG+   K  +LV+LS+QQLVDC+    N GC GG M++AF+Y+++N GI +   Y
Sbjct: 205 ATGAMEGVYCAKTGKLVNLSQQQLVDCSRFLGNQGCDGGRMEEAFEYVVENGGICSGENY 264

Query: 205 SYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVA-NQPVSVAIDA--SALQFY 261
            Y     G+C S +     A IT Y  VP   E+S+  A+A   PVSVAI A  +A QFY
Sbjct: 265 PYM-RKDGVCKSSQCTS-VATITGYRSVPRRSEKSMKTALALRSPVSVAIQANQAAFQFY 322

Query: 262 SGGVFNGYCETFLNHGVTAVGYGTSEEGI-KYWLIKNSWGQDWGEDGYFRLQRDIDQPQG 320
             G+F+  C T L+HGV  VGY     G   YW++KNSWG  WG+ GY  L      P G
Sbjct: 323 YDGIFDAPCGTNLDHGVLLVGYSAETAGQGDYWIMKNSWGAAWGKGGYM-LMAMHKGPAG 381

Query: 321 QCGIAMFASFPVS 333
           QCG+ +  SFPV+
Sbjct: 382 QCGVLLDGSFPVA 394


>gi|957281|gb|AAB33990.1| cysteine proteinase [Bombyx mori]
          Length = 344

 Score =  208 bits (529), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 131/348 (37%), Positives = 194/348 (55%), Gaps = 31/348 (8%)

Query: 8   VVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVER 67
           +VL++    A  A  + FD   + E++  +K Q+   YK   E++ R +I+ ++   + +
Sbjct: 4   LVLLLCAVAAVSAV-QFFD--LVKEEWSAFKLQHRLNYKSEVEDNFRMKIYAEHKHIIAK 60

Query: 68  FNNA-AIGNRSYTLRLNKF---ADLTPQEFIASQTGF-KMSDHSSSL-----KANGTPFL 117
            N    +G  SY L +N +    D+   EF+ +  GF K + H+ +L        G  F+
Sbjct: 61  HNQKYEMGLVSYKLGMNSWWEHGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFI 120

Query: 118 YKSS-QVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLV 169
             ++ ++P  V+W + GAVT +K QG+C          A+EG +  +   LVSLSEQ L+
Sbjct: 121 SPANVKLPEQVDWRKHGAVTDIKDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLI 180

Query: 170 DCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNY 229
           DC+    NNGC GG MD+AFKYI  N GI  +  Y YEG+          ++  A+   +
Sbjct: 181 DCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTEQAYPYEGVDDKC--RYNPKNTGAEDVGF 238

Query: 230 EDVPPNDEESLLKAVAN-QPVSVAIDAS--ALQFYSGGVFNGY--CETFLNHGVTAVGYG 284
            D+P  DE+ L++AVA   PVSVAIDAS    Q YS GV+N      T L+HGV  VGYG
Sbjct: 239 VDIPEGDEQKLMEAVATVGPVSVAIDASHTHFQLYSSGVYNEEECSSTDLDHGVLVVGYG 298

Query: 285 TSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           T E+G+ YWL+KNSWG+ WGE GY ++ R+      +CGIA  AS+P+
Sbjct: 299 TDEQGVDYWLVKNSWGRSWGELGYIKMIRN---KNNRCGIASSASYPL 343


>gi|110349473|gb|ABG73217.1| cathepsin L 1 precursor [Diaprepes abbreviatus]
          Length = 322

 Score =  208 bits (529), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 129/311 (41%), Positives = 179/311 (57%), Gaps = 24/311 (7%)

Query: 33  KFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKFADLTPQ 91
           KF+ +K ++G+TYK   E + RF IFKDNL A+E+ N     G  SY   +N+F D+T +
Sbjct: 24  KFQAFKLKHGKTYKNQVEETARFNIFKDNLRAIEQHNVLYEQGLVSYKKGINRFTDMTQE 83

Query: 92  EFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA------ 145
           EF A    F     S     N T  +     VP S++W  KG VT VK QG C       
Sbjct: 84  EFRA----FLTLSSSKKPHFNTTEHVLTGLAVPDSIDWRTKGQVTGVKDQGNCGSCWAFS 139

Query: 146 -VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVY 204
              + E     K  +LVSLSEQQLVDC+T D N GC GG++D+ F Y +++KG+  ++ Y
Sbjct: 140 VTGSTEAAYYRKAGKLVSLSEQQLVDCST-DINAGCNGGYLDETFTY-VKSKGLEAESTY 197

Query: 205 SYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDASALQFYSG 263
            Y+G + G C    A     +++ ++ +   DE +LL AV N  PVSVAIDA+ L  Y  
Sbjct: 198 PYKG-TDGSC-KYSASKVVTKVSGHKSLKSEDENALLDAVGNVGPVSVAIDATYLSSYES 255

Query: 264 GVF-NGYCE-TFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQ 321
           G++ + +C  + LNHGV  VGYGTS  G KYW++KNSWG  +GE GYFRL R     + +
Sbjct: 256 GIYEDDWCSPSELNHGVLVVGYGTS-NGKKYWIVKNSWGGSFGESGYFRLLRG----KNE 310

Query: 322 CGIAMFASFPV 332
           CG+A    +P+
Sbjct: 311 CGVAEDTVYPI 321


>gi|356582227|ref|NP_001239115.1| cathepsin L1 precursor [Canis lupus familiaris]
 gi|62899810|sp|Q9GL24.1|CATL1_CANFA RecName: Full=Cathepsin L1; Contains: RecName: Full=Cathepsin L1
           heavy chain; Contains: RecName: Full=Cathepsin L1 light
           chain; Flags: Precursor
 gi|10185020|emb|CAC08809.1| cathepsin L [Canis lupus familiaris]
          Length = 333

 Score =  208 bits (529), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 130/338 (38%), Positives = 189/338 (55%), Gaps = 25/338 (7%)

Query: 10  LIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN 69
           L ++  C   A+     + S+  ++ QWKA + R Y  + E  +R  +++ N+  +E  N
Sbjct: 5   LFLTALCLGIASAAPKFDQSLNAQWYQWKATHRRLYGMNEEGWRR-AVWEKNMKMIELHN 63

Query: 70  NA-AIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVN 128
              + G   +T+ +N F D+T +EF     GF+   H    K    P     +++P SV+
Sbjct: 64  REYSQGKHGFTMAMNAFGDMTNEEFRQVMNGFQNQKHKKG-KMFQEPLF---AEIPKSVD 119

Query: 129 WIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCY 181
           W EKG VTPVK QGQC       A  A+EG    K  +LVSLSEQ LVDC+    N GC 
Sbjct: 120 WREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQGNEGCN 179

Query: 182 GGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLL 241
           GG MD+AF+Y+  N G+ ++  Y Y G  T  C+  K E  AA  T + D+ P  E++L+
Sbjct: 180 GGLMDNAFRYVKDNGGLDSEESYPYLGRDTETCN-YKPECSAANDTGFVDL-PQREKALM 237

Query: 242 KAVAN-QPVSVAIDAS--ALQFYSGGV-FNGYCETF-LNHGVTAVGYG--TSEEGIKYWL 294
           KAVA   P+SVAIDA   + QFY  G+ F+  C +  L+HGV  VGYG   ++   K+W+
Sbjct: 238 KAVATLGPISVAIDAGHQSFQFYKSGIYFDPDCSSKDLDHGVLVVGYGFEGTDSNNKFWI 297

Query: 295 IKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           +KNSWG +WG +GY ++ +D +     CGIA  AS+P 
Sbjct: 298 VKNSWGPEWGWNGYVKMAKDQNN---HCGIATAASYPT 332


>gi|355681656|gb|AER96815.1| Cathepsin L precursor [Mustela putorius furo]
          Length = 331

 Score =  208 bits (529), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 132/342 (38%), Positives = 181/342 (52%), Gaps = 32/342 (9%)

Query: 6   LIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
            +  L +    A+   Y++ D      ++ QWKA +G+ Y E+ E  +R  +++ NL  +
Sbjct: 6   FLAALCLGIVSAAPKLYQSLDA-----RWSQWKAAHGKLYDENEEGWRR-AVWEKNLKVI 59

Query: 66  ERFNNA-AIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVP 124
           ++ N   + G  S+T+ +N F DLT +EF     G K             PF    ++ P
Sbjct: 60  KQHNQEYSQGKHSFTMAMNAFGDLTNEEFKQVMNGLKSQKRKEGNVFQAPPF----AETP 115

Query: 125 PSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNN 177
            SV+W +KG VTPVK QG C       A  A+EG    K  RLVSLSEQ LVDC+  + N
Sbjct: 116 SSVDWRKKGYVTPVKNQGPCGSCWAFSATGALEGQMFRKTKRLVSLSEQNLVDCSQAEGN 175

Query: 178 NGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDE 237
            GC GG MD AF+Y+  N G+ ++  Y Y           K E  AA  T + D+ P +E
Sbjct: 176 EGCSGGLMDYAFQYVKDNGGLDSEESYPYRAQDESC--KYKPEQSAANDTGFMDIHP-EE 232

Query: 238 ESLLKAVAN-QPVSVAIDA--SALQFYSGGV-FNGYCETF-LNHGVTAVGYGT---SEEG 289
           ESL  AVA   P+S AIDA  S  QFY  G+ ++  C +  L+HG+  VGYG+     E 
Sbjct: 233 ESLKLAVATVGPISAAIDASLSTFQFYHKGIYYDPDCSSENLDHGILVVGYGSQGEDSEK 292

Query: 290 IKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
            KYW++KNSWG DWG  GY  + +D D     CGIA  ASFP
Sbjct: 293 QKYWIVKNSWGTDWGTQGYILMAKDRDN---HCGIATAASFP 331


>gi|355558399|gb|EHH15179.1| hypothetical protein EGK_01236 [Macaca mulatta]
 gi|380809986|gb|AFE76868.1| cathepsin S isoform 1 preproprotein [Macaca mulatta]
 gi|383416071|gb|AFH31249.1| cathepsin S isoform 1 preproprotein [Macaca mulatta]
 gi|383416073|gb|AFH31250.1| cathepsin S isoform 1 preproprotein [Macaca mulatta]
 gi|383416075|gb|AFH31251.1| cathepsin S isoform 1 preproprotein [Macaca mulatta]
 gi|383416077|gb|AFH31252.1| cathepsin S isoform 1 preproprotein [Macaca mulatta]
 gi|383416079|gb|AFH31253.1| cathepsin S isoform 1 preproprotein [Macaca mulatta]
          Length = 331

 Score =  208 bits (529), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 133/342 (38%), Positives = 187/342 (54%), Gaps = 32/342 (9%)

Query: 6   LIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
           L+ VL++  S  +Q       + ++   +  WK  YG+ YKE  E + R  I++ NL  V
Sbjct: 4   LVCVLLVCSSAVAQ----LHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLKFV 59

Query: 66  ERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQ-- 122
              N   ++G  SY L +N   D+T +E ++  +  ++    S  + N T   YKS+   
Sbjct: 60  MLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLRVP---SQWQRNIT---YKSNANQ 113

Query: 123 -VPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATN 174
            +P SV+W EKG VT VKYQG C       AV A+E    +K  +LVSLS Q LVDC+T 
Sbjct: 114 ILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTE 173

Query: 175 D-NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
              N GC GGFM  AF+YII N GI +DA Y Y+           ++  AA  + Y ++P
Sbjct: 174 KYGNKGCNGGFMTRAFQYIIDNNGIDSDASYPYKATDQKC--QYDSKYRAATCSKYTELP 231

Query: 234 PNDEESLLKAVANQ-PVSVAIDASALQFY---SGGVFNGYCETFLNHGVTAVGYGTSEEG 289
              E+ L + VAN+ PVSV +DAS   F+   SG  +   C   +NHGV  VGYG    G
Sbjct: 232 YGREDVLKEVVANKGPVSVGVDASHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYGVL-NG 290

Query: 290 IKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
            +YWL+KNSWG+++GE+GY R+ R+       CGIA F S+P
Sbjct: 291 KEYWLVKNSWGRNFGEEGYIRMARN---KGNHCGIASFPSYP 329


>gi|225706086|gb|ACO08889.1| Cathepsin S precursor [Osmerus mordax]
          Length = 333

 Score =  208 bits (529), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 123/312 (39%), Positives = 178/312 (57%), Gaps = 22/312 (7%)

Query: 34  FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN-NAAIGNRSYTLRLNKFADLTPQE 92
           ++ WK Q+G+ YK   E   R E+++ NL  +   N  A++G  +Y L +N   D+T +E
Sbjct: 30  WQMWKKQHGKNYKTEVEELGRREVWERNLQLISLHNLEASMGMHTYDLGMNHMGDMTEEE 89

Query: 93  FIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGAVTPVKYQGQC------- 144
            + S    K+    + LK   + F+  S + VP +V+W +KG VT VK QG C       
Sbjct: 90  ILQSFASLKVP---ADLKREPSAFVASSGTPVPDTVDWRQKGYVTQVKNQGSCGSCWAFS 146

Query: 145 AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVY 204
           +V A+EG       +L+ LS Q LVDC++   N GC GGFM +AF+Y+I NKGI +D  Y
Sbjct: 147 SVGALEGQLMRTTGKLLDLSPQNLVDCSSKYGNKGCNGGFMSEAFQYVIDNKGIDSDTSY 206

Query: 205 SYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDASALQF--Y 261
            Y+G+  G C        +A  T Y  +P  DE +L +AVA   P+SVAIDA+   F  +
Sbjct: 207 PYQGVQ-GTC-HYNPSYRSANCTRYSFLPEGDETTLKQAVAMIGPISVAIDATRPSFILW 264

Query: 262 SGGVFNGY-CETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQG 320
             GV+N   C   +NH V  VGYGT  +G  YWL+KNSWG  +GE+GY R+ R+ +    
Sbjct: 265 RSGVYNDLTCTQKINHAVLVVGYGTL-DGQDYWLVKNSWGTRFGENGYIRMSRNRNN--- 320

Query: 321 QCGIAMFASFPV 332
           QCGIA++  +P+
Sbjct: 321 QCGIALYGCYPI 332


>gi|291383517|ref|XP_002708299.1| PREDICTED: cathepsin L1 [Oryctolagus cuniculus]
          Length = 333

 Score =  208 bits (529), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 131/324 (40%), Positives = 183/324 (56%), Gaps = 28/324 (8%)

Query: 25  FDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLN 83
           FD+ ++  ++ QWKA + R Y  + E  +R  +++ N+  +E  N   + G   +T+ +N
Sbjct: 21  FDQ-NLDTQWYQWKATHKRLYGLNEEGWRR-AVWEKNMRMIELHNGEYSQGKHGFTMGMN 78

Query: 84  KFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQ 143
            + D+T +EF     GF+   H    K    P L    Q P SV+W EKG VTPVK QGQ
Sbjct: 79  AYGDMTNEEFRQVMNGFQNQKHKKG-KMFRDPLLL---QYPKSVDWREKGYVTPVKNQGQ 134

Query: 144 C-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNK 196
           C       A  A+EG    K  +L+SLSEQ LVDC+    N GC GG MD AF+Y+  N 
Sbjct: 135 CGSCWAFSATGALEGQMFQKTGKLISLSEQNLVDCSHPQGNQGCNGGLMDYAFQYVKDNS 194

Query: 197 GITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDA 255
           G+ ++  Y YEGM  G C   K E   A  T + D+P + E++LL+AVA   P+S AIDA
Sbjct: 195 GLDSEESYPYEGMD-GTC-KYKPECSVANDTGFVDIPGH-EKALLRAVATVGPISAAIDA 251

Query: 256 SAL--QFYSGGV-FNGYCETF-LNHGVTAVGY---GTSEEGIKYWLIKNSWGQDWGEDGY 308
             +  QFY  G+ ++  C +  L+HG+  VGY   GT+    KYWL+KNSWG  WG++GY
Sbjct: 252 GHMSFQFYKSGIYYDPDCSSKDLDHGILVVGYGFEGTNSNATKYWLVKNSWGTTWGDEGY 311

Query: 309 FRLQRDIDQPQGQCGIAMFASFPV 332
            ++ RD D     CGIA  AS+P 
Sbjct: 312 VKIIRDKDN---HCGIATAASYPT 332


>gi|125592009|gb|EAZ32359.1| hypothetical protein OsJ_16569 [Oryza sativa Japonica Group]
          Length = 480

 Score =  208 bits (529), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 127/342 (37%), Positives = 178/342 (52%), Gaps = 39/342 (11%)

Query: 23  RTFDEGSIAEK----FEQWKAQYGRTYKES--AENSKRFEIFKDNLVAVERFNNAAIGNR 76
           R  +EG    +    ++ W A+ G     +   E+ +RF +F DNL  V+  N  A    
Sbjct: 37  RGLEEGPTEAEARAAYDLWLAENGGGSPNALGGEHERRFLVFWDNLKFVDAHNARADERG 96

Query: 77  SYTLRLNKFA---------DL---------------TPQEFIASQTGFKMSDHSSSLKAN 112
            + L +N+           DL                P        G +  +     +  
Sbjct: 97  GFRLGMNRLRRSHQRGVPRDLPRRQGRREEPRRRGEVPPRRGGGAAGVRRLEGEGRRRPR 156

Query: 113 GTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCAVAAVEGINAIKINRLVSLSEQQLVDCA 172
             P   +S  V  SV +  +G+          AV+ VE IN +    +++LSEQ+LV+C+
Sbjct: 157 QEPGPMRSFSVHLSVKYFGQGSCWAFS-----AVSTVESINQLVTGEMITLSEQELVECS 211

Query: 173 TNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDV 232
           TN  N+GC GG MDDAF +II+N GI  +  Y Y+ +  G CD  +       I  +EDV
Sbjct: 212 TNGQNSGCNGGLMDDAFDFIIKNGGIDTEDDYPYKAVD-GKCDINRENAKVVSIDGFEDV 270

Query: 233 PPNDEESLLKAVANQPVSVAIDASA--LQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGI 290
           P NDE+SL KAVA+QPVSVAI+A     Q Y  GVF+G C T L+HGV AVGYGT + G 
Sbjct: 271 PQNDEKSLQKAVAHQPVSVAIEAGGREFQLYHSGVFSGRCGTSLDHGVVAVGYGT-DNGK 329

Query: 291 KYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
            YW+++NSWG  WGE GY R++R+I+   G+CGIAM AS+P 
Sbjct: 330 DYWIVRNSWGPKWGESGYVRMERNINVTTGKCGIAMMASYPT 371


>gi|151573014|gb|ABS17682.1| cathepsin L-1 [Artemia salina]
          Length = 334

 Score =  208 bits (529), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 131/342 (38%), Positives = 189/342 (55%), Gaps = 26/342 (7%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
           FL+  +++  S A   T        +A+++  +KA + + Y    E   R +I+ +N   
Sbjct: 4   FLLGAVLVQLSAALSLT------NLLADEWHLFKATHKKEYPSQLEEKFRMKIYLENKHK 57

Query: 65  VERFNNA-AIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSS-Q 122
           V + N     G +SY + +NKF DL   EF +   G++    +SS   +   F+  ++  
Sbjct: 58  VAKHNILYEKGEKSYHVAMNKFGDLLHHEFRSIMNGYQHKKQNSSRAESTFTFMEPANVT 117

Query: 123 VPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATND 175
           VP SV+W EKGA+TPVK QGQC       +  A+EG    K  +LVSLSEQ L+DC+   
Sbjct: 118 VPESVDWREKGAITPVKDQGQCGSCWAFSSTGALEGQTFRKTGKLVSLSEQNLIDCSGKY 177

Query: 176 NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPN 235
            N GC GG MD AF+YI  NKGI  +  Y YE     +C      +  A    + D+P  
Sbjct: 178 GNEGCNGGLMDQAFQYIKDNKGIDTENTYPYEA-EDDVC-RYNPRNRGAVDRGFVDIPSG 235

Query: 236 DEESLLKAVAN-QPVSVAIDAS--ALQFYSGGV-FNGYCET-FLNHGVTAVGYGTSEEGI 290
           +E+ L  AVA   PVSVAIDAS  + QFYS GV +   C++  L+HGV  VGYG S+ G 
Sbjct: 236 EEDKLKAAVATVGPVSVAIDASHESFQFYSKGVYYEPSCDSDDLDHGVLVVGYG-SDNGK 294

Query: 291 KYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
            YWL+KNSW + WG++GY ++ R+    +  CG+A  AS+P+
Sbjct: 295 DYWLVKNSWSEHWGDEGYIKMARN---RKNHCGVASAASYPL 333


>gi|344953542|gb|AEN28617.1| cathepsin L-like cysteine protease [Epinephelus coioides]
          Length = 336

 Score =  207 bits (528), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 128/343 (37%), Positives = 186/343 (54%), Gaps = 25/343 (7%)

Query: 7   IVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVE 66
           ++ L +   C S A      +  + + +E WK+ + + Y E  E  +R  +++ NL  +E
Sbjct: 1   MLPLAVVALCLSAALSAPSLDPQLDDHWELWKSWHSKKYHEKEEGWRRM-VWEKNLKKIE 59

Query: 67  RFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVP 124
             N   ++G  SY L +N F D+T +EF     G+K     +  KA G+ FL  +  + P
Sbjct: 60  LHNLEHSMGTHSYRLGMNHFGDMTHEEFRQLMNGYK---RKAETKARGSLFLEPNFLEAP 116

Query: 125 PSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNN 177
            SV+W + G VTPVK QGQC          A+EG +  K  +LVSLSEQ LVDC+  + N
Sbjct: 117 KSVDWRDNGYVTPVKDQGQCGSCWAFSTTGALEGQHFRKTGKLVSLSEQNLVDCSRPEGN 176

Query: 178 NGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDE 237
            GC GG MD AF+Y+  N+G+ ++  Y Y G     C       ++   T + D+P   E
Sbjct: 177 EGCNGGLMDQAFQYVKDNQGLDSEDSYPYLGTDDQPC-HYDPTYNSVNDTGFVDIPSGKE 235

Query: 238 ESLLKAVAN-QPVSVAIDA--SALQFYSGGV-FNGYCET-FLNHGVTAVGYGTSEE---G 289
            +L+KAVA   PVSVAIDA   + QFY  G+ +   C +  L+HGV  VGYG   E   G
Sbjct: 236 RALMKAVAAVGPVSVAIDAGHESFQFYQSGIYYEKECSSEELDHGVLVVGYGFQGEDVDG 295

Query: 290 IKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
            KYW++KNSW + WG+ GY  + +D    +  CGIA  AS+P+
Sbjct: 296 KKYWIVKNSWSEKWGDKGYIYMAKD---RKNHCGIATAASYPL 335


>gi|156739281|ref|NP_001096588.1| cathepsin L1-like precursor [Danio rerio]
 gi|166158351|ref|NP_001107526.1| uncharacterized protein LOC100135391 precursor [Xenopus (Silurana)
           tropicalis]
 gi|326672305|ref|XP_003199634.1| PREDICTED: cathepsin L1-like [Danio rerio]
 gi|156230096|gb|AAI52237.1| MGC174155 protein [Danio rerio]
 gi|163916362|gb|AAI57707.1| LOC100135391 protein [Xenopus (Silurana) tropicalis]
          Length = 335

 Score =  207 bits (528), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 126/345 (36%), Positives = 191/345 (55%), Gaps = 29/345 (8%)

Query: 4   YFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLV 63
           + L+V L IS   A+ +     D+      +  WK+Q+G++Y E  E  +R  I+++NL 
Sbjct: 3   FALLVTLCISAVFAASSIDIQLDD-----HWNSWKSQHGKSYHEDVEVGRRM-IWEENLR 56

Query: 64  AVERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSS- 121
            +E+ N   + GN ++ + +N+F D+T +EF  +  G+K   H  +  + G  F+  S  
Sbjct: 57  KIEQHNFEYSYGNHTFKMGMNQFGDMTNEEFRQAMNGYK---HDPNRTSQGPLFMEPSFF 113

Query: 122 QVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATN 174
             P  V+W ++G VTPVK Q QC       +  A+EG    K  +L+S+SEQ LVDC+  
Sbjct: 114 AAPQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRP 173

Query: 175 DNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPP 234
             N GC GG MD AF+Y+ +NKG+ ++  Y Y       C       + A+IT + D+P 
Sbjct: 174 QGNQGCNGGIMDQAFQYVKENKGLDSEQSYPYLARDDLPC-RYDPRFNVAKITGFVDIPR 232

Query: 235 NDEESLLKAVAN-QPVSVAIDAS--ALQFYSGGV-FNGYCETFLNHGVTAVGY---GTSE 287
            +E +L+ AVA   PVSVAIDAS  +LQFY  G+ +   C + L+H V  VGY   G   
Sbjct: 233 GNELALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACTSRLDHAVLVVGYGYQGADV 292

Query: 288 EGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
            G +YW++KNSW   WG+ GY  + +D +     CGIA  AS+P+
Sbjct: 293 AGNRYWIVKNSWSDKWGDKGYIYMAKDKNN---HCGIATMASYPL 334


>gi|125811033|ref|XP_001361727.1| GA25021 [Drosophila pseudoobscura pseudoobscura]
 gi|54636904|gb|EAL26307.1| GA25021 [Drosophila pseudoobscura pseudoobscura]
          Length = 341

 Score =  207 bits (528), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 132/349 (37%), Positives = 192/349 (55%), Gaps = 35/349 (10%)

Query: 6   LIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV 65
           LI+ L+   + A   +Y       I E++  +K ++ + Y++  E   R +IF +N   +
Sbjct: 5   LILPLLALVAVAQAVSYAEV----IQEEWHTFKLEHRKNYQDETEERFRLKIFNENKHKI 60

Query: 66  ERFNNA-AIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKAN----GTPFLY-K 119
            + N   A G  S+ + +NK+AD+   EF ++  GF  + H     A+    G  F+  +
Sbjct: 61  AKHNQLWATGAVSFKMAVNKYADMLHHEFYSTMNGFNYTLHKQLRNADESFKGVTFISPE 120

Query: 120 SSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCA 172
              +P  V+W  KGAVT VK QG C       +  A+EG +  K   LVSLSEQ LVDC+
Sbjct: 121 HVTLPKQVDWRTKGAVTDVKDQGHCGSCWAFSSTGALEGQHYRKSGVLVSLSEQNLVDCS 180

Query: 173 TNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICD----SIKAEDHAAQITN 228
           T   NNGC GG MD+AF+YI  N GI  +  Y YE +    C     +I A D       
Sbjct: 181 TKYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYPYEAIDDS-CHFNKGTIGATDRG----- 234

Query: 229 YEDVPPNDEESLLKAVAN-QPVSVAIDAS--ALQFYSGGVFN-GYCET-FLNHGVTAVGY 283
           + D+P  +E+ + +AVA   PV+VAIDAS  + QFYS GV+N   C+   L+HGV  VG+
Sbjct: 235 FVDIPQGNEKKMAEAVATIGPVAVAIDASHESFQFYSEGVYNEPACDAQNLDHGVLVVGF 294

Query: 284 GTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           GT E G  YWL+KNSWG  WG+ G+ ++ R+    + QCGIA  +S+P+
Sbjct: 295 GTDESGQDYWLVKNSWGTTWGDKGFIKMLRN---KENQCGIASASSYPL 340


>gi|342305192|dbj|BAK55650.1| cathepsin S [Oplegnathus fasciatus]
          Length = 337

 Score =  207 bits (528), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 129/341 (37%), Positives = 190/341 (55%), Gaps = 25/341 (7%)

Query: 5   FLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVA 64
            ++  L++   C   A    FD G +   +E WK  + + Y+   E+ +R E+++ NL+ 
Sbjct: 8   LMLGSLLLVSLCVGAAA--MFD-GRLDVHWELWKRTHEKKYQNEGEDVRRRELWEKNLML 64

Query: 65  VERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQ 122
           +   N  A++G  +Y L +N   DLT +E + S   F      + ++   +PF   S + 
Sbjct: 65  ITMHNLEASMGLHTYELSMNHMGDLTQEEILQS---FATLSPPTDIQRAPSPFAGTSGAA 121

Query: 123 VPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATND 175
           VP +V+W EKG VT VK QG C       A  A+EG  A    +L+ LS Q LVDC++  
Sbjct: 122 VPDTVDWREKGCVTSVKMQGSCGSCWAFSAAGALEGQLAKTTGKLLDLSPQNLVDCSSKY 181

Query: 176 NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPN 235
            N+GC GGFM  AF+Y+I N+GI +DA Y Y G S   C    A   AA  + Y  +P  
Sbjct: 182 GNHGCNGGFMHRAFQYVIDNQGIDSDASYPYTGQSQQ-CHYNPAY-RAANCSRYSFLPEG 239

Query: 236 DEESLLKAVAN-QPVSVAIDAS--ALQFYSGGVFNG-YCETFLNHGVTAVGYGTSEEGIK 291
           DE +L +A+A   P+SVAIDA+  +  FY  GV++   C   +NHGV AVGYGT   G  
Sbjct: 240 DEGALKEALATIGPISVAIDATRPSFTFYRSGVYDDQTCTRNVNHGVLAVGYGTL-NGKD 298

Query: 292 YWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           YWL+KNSWG  +G+ G+ R+ R+ +    QCGIA++  +P+
Sbjct: 299 YWLVKNSWGSTFGDKGFIRMARNKND---QCGIALYGCYPI 336


>gi|115743|sp|P07154.2|CATL1_RAT RecName: Full=Cathepsin L1; AltName: Full=Cyclic protein 2;
           Short=CP-2; AltName: Full=Major excreted protein;
           Short=MEP; Contains: RecName: Full=Procathepsin L;
           Contains: RecName: Full=Cathepsin L1 heavy chain;
           Contains: RecName: Full=Cathepsin L1 light chain; Flags:
           Precursor
 gi|38648869|gb|AAH63175.1| Cathepsin L1 [Rattus norvegicus]
 gi|149029152|gb|EDL84437.1| cathepsin L, isoform CRA_a [Rattus norvegicus]
 gi|386267881|dbj|BAM14518.1| cathepsin L [Rattus norvegicus]
          Length = 334

 Score =  207 bits (528), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 126/324 (38%), Positives = 187/324 (57%), Gaps = 28/324 (8%)

Query: 25  FDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLN 83
           FD+ +   ++ QWK+ + R Y  + E  +R  +++ N+  ++  N   + G   +T+ +N
Sbjct: 21  FDQ-TFNAQWHQWKSTHRRLYGTNEEEWRR-AVWEKNMRMIQLHNGEYSNGKHGFTMEMN 78

Query: 84  KFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQ 143
            F D+T +EF     G++   H    +    P +    Q+P +V+W EKG VTPVK QGQ
Sbjct: 79  AFGDMTNEEFRQIVNGYRHQKHKKG-RLFQEPLML---QIPKTVDWREKGCVTPVKNQGQ 134

Query: 144 C-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNK 196
           C       A   +EG   +K  +L+SLSEQ LVDC+ +  N GC GG MD AF+YI +N 
Sbjct: 135 CGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHDQGNQGCNGGLMDFAFQYIKENG 194

Query: 197 GITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDA 255
           G+ ++  Y YE    G C   +AE   A  T + D+ P  E++L+KAVA   P+SVA+DA
Sbjct: 195 GLDSEESYPYEA-KDGSC-KYRAEYAVANDTGFVDI-PQQEKALMKAVATVGPISVAMDA 251

Query: 256 S--ALQFYSGGV-FNGYCETF-LNHGVTAVGY---GTSEEGIKYWLIKNSWGQDWGEDGY 308
           S  +LQFYS G+ +   C +  L+HGV  VGY   GT     KYWL+KNSWG++WG DGY
Sbjct: 252 SHPSLQFYSSGIYYEPNCSSKDLDHGVLVVGYGYEGTDSNKDKYWLVKNSWGKEWGMDGY 311

Query: 309 FRLQRDIDQPQGQCGIAMFASFPV 332
            ++ +D +     CG+A  AS+P+
Sbjct: 312 IKIAKDRNN---HCGLATAASYPI 332


>gi|327263389|ref|XP_003216502.1| PREDICTED: cathepsin L1-like [Anolis carolinensis]
          Length = 339

 Score =  207 bits (528), Expect = 5e-51,   Method: Compositional matrix adjust.
 Identities = 133/350 (38%), Positives = 190/350 (54%), Gaps = 30/350 (8%)

Query: 1   MAKYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKD 60
           M  Y   + L +    A+ +      + ++ + ++ WK  + + Y +  E  +R  I++ 
Sbjct: 1   MKVYLCALALFLEACFAAPSL-----DSALDDHWQAWKTWHSKKYHQQEEGWRRM-IWEK 54

Query: 61  NLVAVERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK 119
           NL  ++  N + ++G  SY L +N F D+T +EF     G+K S   +  K  G+ FL  
Sbjct: 55  NLKMIQLHNLDHSLGKHSYRLGMNHFGDMTNEEFRQVMNGYKHS--KTEKKYRGSEFLEP 112

Query: 120 SSQV-PPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDC 171
           +  V P SV+W EKG VTPVK QGQC          ++EG +  K  +LVSLSEQ LVDC
Sbjct: 113 NFLVVPKSVDWREKGYVTPVKDQGQCGSCWAFSTTGSLEGQHFRKTGKLVSLSEQNLVDC 172

Query: 172 ATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYED 231
           +  + N GC GG MD AF+YI  N GI ++  Y Y       C   K+E +AA  T + D
Sbjct: 173 SRPEGNQGCNGGLMDQAFEYIADNGGIDSEESYPYIAKDDEDC-LYKSEFNAANDTGFVD 231

Query: 232 VPPNDEESLLKAVAN-QPVSVAIDA--SALQFYSGGV-FNGYCET-FLNHGVTAVGYG-- 284
           VP   E +L+KAVA   PVSVAIDA  S  QFY  G+ ++  C +  L+HGV  VGYG  
Sbjct: 232 VPEGHERALMKAVAAVGPVSVAIDASHSTFQFYESGIYYDPDCSSEELDHGVLVVGYGFE 291

Query: 285 --TSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
               +   KYW++KNSW   WG+ GY  + +D +     CGIA  AS+P+
Sbjct: 292 GTDDDNKKKYWIVKNSWSDKWGDKGYILMAKDRNN---HCGIATAASYPL 338


>gi|426219875|ref|XP_004004143.1| PREDICTED: cathepsin L1 [Ovis aries]
          Length = 333

 Score =  207 bits (528), Expect = 5e-51,   Method: Compositional matrix adjust.
 Identities = 129/348 (37%), Positives = 197/348 (56%), Gaps = 32/348 (9%)

Query: 1   MAKYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKD 60
           M    L+ VL +  + A+      FD  S+  ++E WKA + + Y  + E  ++  ++K 
Sbjct: 1   MNPSLLLTVLCLGIASAAP----KFDH-SLNTQWELWKAVHRKPYDLNEEGWRK-AVWKK 54

Query: 61  NLVAVERFNNA-AIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK 119
           N+  +E  N   + G  S+++ +N F DLT +EF     GF+  ++      + T F   
Sbjct: 55  NMKMIELHNQEYSQGKHSFSMAMNAFGDLTSEEFRQMMNGFQRQENKKGKVFHETIF--- 111

Query: 120 SSQVPPSVNWIEKGAVTPVKYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCA 172
            + +PPSV+W EKG VTPVK QG+C          A+EG    K  +LVSLSEQ LVDC+
Sbjct: 112 -ASIPPSVDWREKGYVTPVKNQGKCGSCWAFSTTGALEGQMFRKTGKLVSLSEQNLVDCS 170

Query: 173 TNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDV 232
             + N GC+GG MD+AF+Y++   G+ ++  Y Y G+  G C+    ++ AA  T + D+
Sbjct: 171 QPEGNRGCHGGLMDNAFQYVLDVGGLDSEESYPYTGL-VGTCN-YNPKNSAANETGFVDL 228

Query: 233 PPNDEESLLKAVANQ-PVSVAIDAS--ALQFYSGGV-FNGYCET-FLNHGVTAVGY---G 284
            P  E +L+KAVA   P+SVA+DAS  + QFY  G+ +   C++  ++HGV  VGY   G
Sbjct: 229 -PKQENALMKAVATLGPISVAVDASNPSFQFYKSGIYYEPKCKSESVDHGVLVVGYGFEG 287

Query: 285 TSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
              +  KYWL+KNSWG+ WG +GY ++ +D +     CGIA  AS+P 
Sbjct: 288 ADSDDNKYWLVKNSWGKHWGINGYIKMAKDQNN---HCGIATMASYPT 332


>gi|344275470|ref|XP_003409535.1| PREDICTED: cathepsin S-like isoform 1 [Loxodonta africana]
          Length = 331

 Score =  207 bits (528), Expect = 5e-51,   Method: Compositional matrix adjust.
 Identities = 132/342 (38%), Positives = 189/342 (55%), Gaps = 27/342 (7%)

Query: 3   KYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNL 62
           K+ L V L+ S + A     R   + ++   ++ WK  Y + YKE  E   R  I++ NL
Sbjct: 2   KWLLWVALVCSSAMA-----RLHKDPTLDNHWDLWKKTYSKQYKEKNEEVARRLIWEKNL 56

Query: 63  VAVERFN-NAAIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSS 121
             V   N   ++G  SY L +N   D+T +E ++  +  ++    S  + N T     + 
Sbjct: 57  KFVMLHNLEHSMGMHSYDLSMNHLGDMTSEEVMSLMSSLRVP---SQWQRNVTFKSNPNQ 113

Query: 122 QVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATN 174
           ++P S++W EKG VT VKYQG C       AV A+E    +K  +LVSLS Q LVDC+  
Sbjct: 114 KLPDSLDWREKGCVTDVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSGE 173

Query: 175 D-NNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVP 233
             +N GC GGFM  AF+YII N GI ++A Y Y+  + G C     ++ AA  + Y ++P
Sbjct: 174 KYSNKGCNGGFMTRAFQYIIDNNGIDSEASYPYKA-TDGKC-QYDPKNRAATCSKYTELP 231

Query: 234 PNDEESLLKAVANQ-PVSVAIDASALQFY---SGGVFNGYCETFLNHGVTAVGYGTSEEG 289
              E++L +AVAN+ PVSV IDAS   F+   SG  ++  C   +NHGV  VGYG +  G
Sbjct: 232 YGSEDALKEAVANKGPVSVGIDASRPSFFLYKSGVYYDPSCTDNVNHGVLVVGYG-NLNG 290

Query: 290 IKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFP 331
             YWL+KNSWG ++GE GY R+ R+       CGIA F S+P
Sbjct: 291 KDYWLVKNSWGLNFGEQGYIRMARN---SGNHCGIASFPSYP 329


>gi|405966498|gb|EKC31776.1| Cathepsin L [Crassostrea gigas]
          Length = 330

 Score =  207 bits (528), Expect = 5e-51,   Method: Compositional matrix adjust.
 Identities = 127/307 (41%), Positives = 180/307 (58%), Gaps = 26/307 (8%)

Query: 41  YGRTYKESAENSKRFEIFKDNLVAVERFNNAA-IGNRSYTLRLNKFADLTPQEFIASQTG 99
           +G+ Y  + E ++R  I++ NL  +E+ N AA  G+ S+ L +N++ D+T +EF ++  G
Sbjct: 34  HGKQYG-AEEEARRRVIWEGNLDYIEKHNLAADRGDYSFWLGMNEYGDMTNEEFRSTMNG 92

Query: 100 FKMSDHSSSLKANGTPFLYKSS--QVPPSVNWIEKGAVTPVKYQGQC-------AVAAVE 150
           +KM + +S     G+ +L  S+   +P +V+W  KG VTP+K QGQC       A  ++E
Sbjct: 93  YKMRNGTS----RGSLYLPPSNIGDLPDTVDWRPKGYVTPIKNQGQCGSCWSFSATGSLE 148

Query: 151 GINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMS 210
           G    K  +L SLSEQ LVDC+    N+GC GG MDDAF+YI  N GI  ++ Y YE   
Sbjct: 149 GQTFKKTGKLPSLSEQNLVDCSQKQGNHGCQGGLMDDAFQYIKDNSGIDTESSYPYEA-K 207

Query: 211 TGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDASAL--QFYSGGVFN 267
            G C    A +  A  + + D+    E  L  AVA   P+SVAIDAS +  Q Y  GV++
Sbjct: 208 NGKC-RFNAANVGATDSGFTDIKSKSESDLQSAVATVGPISVAIDASHMSFQLYRSGVYH 266

Query: 268 GY--CETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIA 325
            +   ET L+HGV AVGYGT E G  YWL+KNSWG+ WG+ GY  + R+    +  CGIA
Sbjct: 267 EFFCSETRLDHGVLAVGYGT-ESGKDYWLVKNSWGESWGQKGYIMMSRN---KRNNCGIA 322

Query: 326 MFASFPV 332
             AS+P 
Sbjct: 323 TSASYPT 329


>gi|244539471|dbj|BAH82657.1| cysteine protease [Lotus japonicus]
          Length = 286

 Score =  207 bits (528), Expect = 5e-51,   Method: Compositional matrix adjust.
 Identities = 116/254 (45%), Positives = 153/254 (60%), Gaps = 16/254 (6%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLT 89
           + E FE W +++G+ Y+   E   RFEIFKDNL  ++  N       +Y L LN+FADL+
Sbjct: 4   LIELFESWMSRHGKIYESIEEKLLRFEIFKDNLKHIDETNKVV---SNYWLGLNEFADLS 60

Query: 90  PQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA---- 145
             EF     G K+    S+ + +   F Y+   +P SV+W +KGAVT +K QG C     
Sbjct: 61  HHEFKKQYLGLKVD--FSTRRESSEEFTYRDVDLPKSVDWRKKGAVTNIKNQGSCGSCWA 118

Query: 146 ---VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDA 202
              VAAVEGIN I    L SLSEQ+L+DC     N+GC GG MD AF +I++N G+  + 
Sbjct: 119 FSTVAAVEGINQIVTGNLTSLSEQELIDCDRT-YNSGCNGGLMDYAFSFIVENGGLHKED 177

Query: 203 VYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQF 260
            Y Y  M  G C+  K E     I+ Y DVP N+E+SLLKA+ANQP+SVAI+AS    QF
Sbjct: 178 DYPYI-MEEGTCEMSKEESQVVTISGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQF 236

Query: 261 YSGGVFNGYCETFL 274
           YSGGVF+G+C T L
Sbjct: 237 YSGGVFDGHCGTQL 250


>gi|81542|pir||S02728 actinidain (EC 3.4.22.14) precursor (clone pAC.1) - kiwi fruit
           (fragment)
 gi|15957|emb|CAA31435.1| actinidin precursor [Actinidia chinensis]
 gi|166319|gb|AAA32630.1| actinidin precursor [Actinidia deliciosa]
 gi|226542|prf||1601514A actinidin
          Length = 302

 Score =  207 bits (527), Expect = 5e-51,   Method: Compositional matrix adjust.
 Identities = 118/268 (44%), Positives = 157/268 (58%), Gaps = 16/268 (5%)

Query: 75  NRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPS-VNWIEKG 133
           NRSY + LN+FADLT +EF ++  GF    + + +     P   + SQV PS V+W   G
Sbjct: 12  NRSYKVGLNQFADLTGEEFRSTYLGFTGGSNKTKVSNRYEP---RVSQVLPSYVDWRSAG 68

Query: 134 AVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMD 186
           AV  +K QG+C       A+A VEGIN I    L+SLSEQ+L+ C    N  GC GG++ 
Sbjct: 69  AVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIGCGGTQNTRGCNGGYIT 128

Query: 187 DAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN 246
           D F++II N GI     Y Y     G C+     +    I  Y +VP N+E +L  AV  
Sbjct: 129 DGFQFIINNGGINTGENYPYTAQD-GECNLDLQNEKYVTIDTYGNVPYNNEWALQTAVTY 187

Query: 247 QPVSVAIDAS--ALQFYSGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWG 304
           QPVSVA+DA+  A + YS G+F G C T ++H VT VGYGT E GI YW+++NSW   WG
Sbjct: 188 QPVSVALDAAGDAFKHYSSGIFTGPCGTAIDHAVTIVGYGT-EGGIDYWIVENSWDTTWG 246

Query: 305 EDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           E+GY R+ R++    G CGIA   S+PV
Sbjct: 247 EEGYMRILRNVGG-AGTCGIATMPSYPV 273


>gi|348565223|ref|XP_003468403.1| PREDICTED: cathepsin L1-like [Cavia porcellus]
          Length = 333

 Score =  207 bits (527), Expect = 6e-51,   Method: Compositional matrix adjust.
 Identities = 142/348 (40%), Positives = 187/348 (53%), Gaps = 32/348 (9%)

Query: 1   MAKYFLIVVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKD 60
           M   F++  L +    A     +T D      +++QWKA +GR Y  + E  +R  +++ 
Sbjct: 1   MTPSFVLAALCLGIVSALPKLDQTLDA-----QWDQWKAAHGRLYGLNEEGWRR-AVWEK 54

Query: 61  NLVAVERFNNA-AIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYK 119
           NL  +E  N   + G  S+TL +N F D+T +EF     GF+   H +  K    P L  
Sbjct: 55  NLRMIELHNGEYSQGRHSFTLGMNHFGDMTNEEFRQVMNGFQHQKHKTG-KMYQEPLLL- 112

Query: 120 SSQVPPSVNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCA 172
             Q+P SV+W EKG VT VK QGQC       A  ++EG    K   LVSLSEQ LVDC+
Sbjct: 113 --QLPKSVDWREKGYVTEVKNQGQCGSCWAFSATGSLEGQMFHKTGNLVSLSEQNLVDCS 170

Query: 173 TNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDV 232
               N GC GG MD AF+Y+  NKG+  +  Y Y G   G C   K E  AA  T + DV
Sbjct: 171 RPQGNQGCNGGLMDFAFQYVKDNKGLEAEKSYPYVG-KDGEC-KYKPELSAANDTGFVDV 228

Query: 233 PPNDEESLLKAVAN-QPVSVAIDAS--ALQFYSGGVF--NGYCETFLNHGVTAVGYGT-- 285
            P  E+ + KA+A   P+SVAIDA   + QFY  G++   G     LNHGV  VGYGT  
Sbjct: 229 -PQREKVVQKALATVGPLSVAIDAGLQSFQFYKEGIYYDPGCSSRDLNHGVLLVGYGTDA 287

Query: 286 SEEGI-KYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           SE G   YWLIKNSWG  WG DGY ++ R+ +     CG+A  AS+P+
Sbjct: 288 SETGKGDYWLIKNSWGTTWGADGYVKIARNRNN---HCGVATAASYPL 332


>gi|12847813|dbj|BAB27719.1| unnamed protein product [Mus musculus]
          Length = 334

 Score =  207 bits (527), Expect = 6e-51,   Method: Compositional matrix adjust.
 Identities = 130/324 (40%), Positives = 185/324 (57%), Gaps = 28/324 (8%)

Query: 25  FDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLN 83
           FD+   AE + QWK+ + R Y  + E  +R  I++ N+  ++  N   + G   +++ +N
Sbjct: 21  FDQTFSAE-WHQWKSTHRRLYGTNEEEWRR-AIWEKNMRMIQLHNGEYSNGQHGFSMEMN 78

Query: 84  KFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQ 143
            F D+T +EF     G++   H    +    P + K   +P SV+W EKG VTPVK QGQ
Sbjct: 79  AFGDMTNEEFRQVVNGYRHQKHKKG-RLFQEPLMLK---IPKSVDWREKGCVTPVKNQGQ 134

Query: 144 C-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNK 196
           C       A   +EG   +K  +L+SLSEQ LVDC+    N GC GG MD AF+YI +N 
Sbjct: 135 CGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGCNGGLMDYAFQYIKENG 194

Query: 197 GITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDA 255
           G+ ++  Y YE    G C   +AE   A  T + D+ P  E++L+KAVA   P+SVA+DA
Sbjct: 195 GLDSEESYPYEA-KDGSC-KYRAEFAVANDTGFVDI-PQQEKALMKAVATVGPISVAMDA 251

Query: 256 S--ALQFYSGGV-FNGYCETF-LNHGVTAVGY---GTSEEGIKYWLIKNSWGQDWGEDGY 308
           S  +LQFYS G+ +   C +  L+HGV  VGY   GT     KYWL+KNSWG +WG +GY
Sbjct: 252 SHPSLQFYSSGIYYEPNCSSKNLDHGVLLVGYGYEGTDSNKNKYWLVKNSWGSEWGMEGY 311

Query: 309 FRLQRDIDQPQGQCGIAMFASFPV 332
            ++ +D D     CG+A  AS+PV
Sbjct: 312 IKIAKDRDN---HCGLATAASYPV 332


>gi|6753558|ref|NP_034114.1| cathepsin L1 preproprotein [Mus musculus]
 gi|115742|sp|P06797.2|CATL1_MOUSE RecName: Full=Cathepsin L1; AltName: Full=Major excreted protein;
           Short=MEP; AltName: Full=p39 cysteine proteinase;
           Contains: RecName: Full=Cathepsin L1 heavy chain;
           Contains: RecName: Full=Cathepsin L1 light chain; Flags:
           Precursor
 gi|53047|emb|CAA29470.1| unnamed protein product [Mus musculus]
 gi|309186|gb|AAA37445.1| preprocysteine proteinase [Mus musculus]
 gi|12832050|dbj|BAB21945.1| unnamed protein product [Mus musculus]
 gi|26340196|dbj|BAC33761.1| unnamed protein product [Mus musculus]
 gi|45768760|gb|AAH68163.1| Cathepsin L [Mus musculus]
 gi|74139700|dbj|BAE31701.1| unnamed protein product [Mus musculus]
 gi|74146632|dbj|BAE41323.1| unnamed protein product [Mus musculus]
 gi|74151584|dbj|BAE41141.1| unnamed protein product [Mus musculus]
 gi|74185397|dbj|BAE30172.1| unnamed protein product [Mus musculus]
 gi|74197196|dbj|BAE35143.1| unnamed protein product [Mus musculus]
 gi|74203006|dbj|BAE26206.1| unnamed protein product [Mus musculus]
 gi|74219606|dbj|BAE29572.1| unnamed protein product [Mus musculus]
 gi|148684295|gb|EDL16242.1| cathepsin L [Mus musculus]
          Length = 334

 Score =  207 bits (527), Expect = 7e-51,   Method: Compositional matrix adjust.
 Identities = 130/324 (40%), Positives = 185/324 (57%), Gaps = 28/324 (8%)

Query: 25  FDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLN 83
           FD+   AE + QWK+ + R Y  + E  +R  I++ N+  ++  N   + G   +++ +N
Sbjct: 21  FDQTFSAE-WHQWKSTHRRLYGTNEEEWRR-AIWEKNMRMIQLHNGEYSNGQHGFSMEMN 78

Query: 84  KFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQ 143
            F D+T +EF     G++   H    +    P + K   +P SV+W EKG VTPVK QGQ
Sbjct: 79  AFGDMTNEEFRQVVNGYRHQKHKKG-RLFQEPLMLK---IPKSVDWREKGCVTPVKNQGQ 134

Query: 144 C-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNK 196
           C       A   +EG   +K  +L+SLSEQ LVDC+    N GC GG MD AF+YI +N 
Sbjct: 135 CGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGCNGGLMDFAFQYIKENG 194

Query: 197 GITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDA 255
           G+ ++  Y YE    G C   +AE   A  T + D+ P  E++L+KAVA   P+SVA+DA
Sbjct: 195 GLDSEESYPYEA-KDGSC-KYRAEFAVANDTGFVDI-PQQEKALMKAVATVGPISVAMDA 251

Query: 256 S--ALQFYSGGV-FNGYCETF-LNHGVTAVGY---GTSEEGIKYWLIKNSWGQDWGEDGY 308
           S  +LQFYS G+ +   C +  L+HGV  VGY   GT     KYWL+KNSWG +WG +GY
Sbjct: 252 SHPSLQFYSSGIYYEPNCSSKNLDHGVLLVGYGYEGTDSNKNKYWLVKNSWGSEWGMEGY 311

Query: 309 FRLQRDIDQPQGQCGIAMFASFPV 332
            ++ +D D     CG+A  AS+PV
Sbjct: 312 IKIAKDRDN---HCGLATAASYPV 332


>gi|74149661|dbj|BAE36450.1| unnamed protein product [Mus musculus]
          Length = 334

 Score =  207 bits (526), Expect = 7e-51,   Method: Compositional matrix adjust.
 Identities = 130/324 (40%), Positives = 185/324 (57%), Gaps = 28/324 (8%)

Query: 25  FDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLN 83
           FD+   AE + QWK+ + R Y  + E  +R  I++ N+  ++  N   + G   +++ +N
Sbjct: 21  FDQTFSAE-WHQWKSTHRRLYGTNEEEWRR-AIWEKNMRMIQLHNGEYSNGQHGFSMEMN 78

Query: 84  KFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQ 143
            F D+T +EF     G++   H    +    P + K   +P SV+W EKG VTPVK QGQ
Sbjct: 79  AFGDMTNEEFRQVVNGYRHQKHKKG-RLFQEPLMLK---IPKSVDWREKGCVTPVKNQGQ 134

Query: 144 C-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNK 196
           C       A   +EG   +K  +L+SLSEQ LVDC+    N GC GG MD AF+YI +N 
Sbjct: 135 CGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGCNGGLMDFAFQYIKENG 194

Query: 197 GITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDA 255
           G+ ++  Y YE    G C   +AE   A  T + D+ P  E++L+KAVA   P+SVA+DA
Sbjct: 195 GLDSEESYPYEA-KDGSC-KYRAEFAVANGTGFVDI-PQQEKALMKAVATVGPISVAMDA 251

Query: 256 S--ALQFYSGGV-FNGYCETF-LNHGVTAVGY---GTSEEGIKYWLIKNSWGQDWGEDGY 308
           S  +LQFYS G+ +   C +  L+HGV  VGY   GT     KYWL+KNSWG +WG +GY
Sbjct: 252 SHPSLQFYSSGIYYEPNCSSKNLDHGVLLVGYGYEGTDSNKNKYWLVKNSWGSEWGMEGY 311

Query: 309 FRLQRDIDQPQGQCGIAMFASFPV 332
            ++ +D D     CG+A  AS+PV
Sbjct: 312 IKIAKDRDN---HCGLATAASYPV 332


>gi|355681664|gb|AER96818.1| cathepsin S [Mustela putorius furo]
          Length = 338

 Score =  207 bits (526), Expect = 7e-51,   Method: Compositional matrix adjust.
 Identities = 131/328 (39%), Positives = 184/328 (56%), Gaps = 22/328 (6%)

Query: 17  ASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN-NAAIGN 75
           +S A  +  ++ ++   +  WK  YGR Y+E  E   R  I++ NL +V   N   ++G 
Sbjct: 19  SSYAVAQVQNDPTLDHHWNLWKKTYGRQYQEKNEEVARRLIWEKNLKSVMLHNLEYSMGM 78

Query: 76  RSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAV 135
            SY L +N  AD+T +E  +  +  ++    S  +AN T     + ++P SV+W EKG V
Sbjct: 79  HSYDLGMNHLADMTSEEVSSLMSSLRVP---SQWQANVTYKSNSNQKLPDSVDWREKGCV 135

Query: 136 TPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATND-NNNGCYGGFMDD 187
           T VKYQG C       AV A+E    +K   LVSLS Q LVDC+T    N GC GGFM  
Sbjct: 136 TEVKYQGACGACWAFSAVGALEAQLKLKTGNLVSLSAQNLVDCSTERYGNKGCNGGFMTK 195

Query: 188 AFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ 247
           AF+YII N GI ++  Y Y+ M  G C    ++  AA  + Y ++P   E++L +AVAN+
Sbjct: 196 AFQYIIDNNGIDSEVSYPYKAMD-GNC-RYDSKHRAATCSKYTELPFGSEDALKEAVANK 253

Query: 248 -PVSVAIDASALQFY---SGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDW 303
            PVSVAIDA    F+   SG  ++  C   +NHGV  VGYG +  G  YWL+KNSWG ++
Sbjct: 254 GPVSVAIDAKHSSFFLYKSGVYYDPSCTQNVNHGVLVVGYG-NLNGRDYWLVKNSWGLNF 312

Query: 304 GEDGYFRLQRDIDQPQGQCGIAMFASFP 331
           GE GY R+ R+       CGIA + S+P
Sbjct: 313 GEQGYIRMARN---SGNHCGIASYPSYP 337


>gi|344271616|ref|XP_003407633.1| PREDICTED: cathepsin L1-like [Loxodonta africana]
          Length = 334

 Score =  207 bits (526), Expect = 7e-51,   Method: Compositional matrix adjust.
 Identities = 127/341 (37%), Positives = 186/341 (54%), Gaps = 26/341 (7%)

Query: 8   VVLIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVER 67
           + L ++  C   A+     + S+  ++ QW++ Y + Y  + E+ +R  +++ N+  +ER
Sbjct: 3   LSLFLAALCLGVASAAPKLDQSLDVQWNQWRSTYKKPYAVNEEDWRR-AVWEKNVKMIER 61

Query: 68  FNNA-AIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPS 126
            N   + G   +T+ +N F D+T +EF     GF+   H    K    P       +P S
Sbjct: 62  HNQEYSQGKHGFTMAMNAFGDMTNEEFRQVMNGFQNQKHKKG-KLFYEPVF---GHIPTS 117

Query: 127 VNWIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNG 179
           V+W +KG VTPVK QGQC       A  A+EG    K  +LVSLSEQ LVDC+  + N G
Sbjct: 118 VDWTQKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRREGNEG 177

Query: 180 CYGGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEES 239
           C GG MD+AF+Y+  N G+ ++  Y Y    T  C+  K E  AA  T + D+P   E++
Sbjct: 178 CNGGLMDNAFQYVQDNGGLDSEESYPYLATDTHTCN-YKPECSAANDTGFVDIPQR-EKA 235

Query: 240 LLKAVAN-QPVSVAIDA--SALQFYSGGVF--NGYCETFLNHGVTAVGY---GTSEEGIK 291
           L+KAVA   P+SVAIDA   + QFY  G++   G     L+HGV  VGY   G   E  K
Sbjct: 236 LMKAVATVGPISVAIDAGHESFQFYKSGIYYEPGCSSKDLDHGVLLVGYGFEGKDSENNK 295

Query: 292 YWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           +W++KNSWG  WG +GY ++ +D +     CGIA  AS+P 
Sbjct: 296 FWIVKNSWGTSWGTNGYVKMAKDQNN---HCGIATAASYPT 333


>gi|119389039|pdb|2C0Y|A Chain A, The Crystal Structure Of A Cys25ala Mutant Of Human
           Procathepsin S
          Length = 315

 Score =  207 bits (526), Expect = 8e-51,   Method: Compositional matrix adjust.
 Identities = 130/319 (40%), Positives = 178/319 (55%), Gaps = 28/319 (8%)

Query: 29  SIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN-NAAIGNRSYTLRLNKFAD 87
           ++   +  WK  YG+ YKE  E + R  I++ NL  V   N   ++G  SY L +N   D
Sbjct: 7   TLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLKFVMLHNLEHSMGMHSYDLGMNHLGD 66

Query: 88  LTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQ---VPPSVNWIEKGAVTPVKYQGQC 144
           +T +E ++  +  ++    S  + N T   YKS+    +P SV+W EKG VT VKYQG C
Sbjct: 67  MTSEEVMSLMSSLRVP---SQWQRNIT---YKSNPNRILPDSVDWREKGCVTEVKYQGSC 120

Query: 145 -------AVAAVEGINAIKINRLVSLSEQQLVDCATND-NNNGCYGGFMDDAFKYIIQNK 196
                  AV A+E    +K  +LVSLS Q LVDC+T    N GC GGFM  AF+YII NK
Sbjct: 121 GAAWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNK 180

Query: 197 GITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ-PVSVAIDA 255
           GI +DA Y Y+ M         ++  AA  + Y ++P   E+ L +AVAN+ PVSV +DA
Sbjct: 181 GIDSDASYPYKAMDQKC--QYDSKYRAATCSKYTELPYGREDVLKEAVANKGPVSVGVDA 238

Query: 256 SALQFY---SGGVFNGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQ 312
               F+   SG  +   C   +NHGV  VGYG    G +YWL+KNSWG ++GE+GY R+ 
Sbjct: 239 RHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYG-DLNGKEYWLVKNSWGHNFGEEGYIRMA 297

Query: 313 RDIDQPQGQCGIAMFASFP 331
           R+       CGIA F S+P
Sbjct: 298 RN---KGNHCGIASFPSYP 313


>gi|18308182|gb|AAL67857.1|AF462309_1 cysteine proteinase [Acanthamoeba healyi]
          Length = 330

 Score =  207 bits (526), Expect = 8e-51,   Method: Compositional matrix adjust.
 Identities = 127/326 (38%), Positives = 177/326 (54%), Gaps = 26/326 (7%)

Query: 25  FDEGSIAEKFEQWKAQYGRTYKESAENSKRFE------IFKDNLVAVERFNNAAIGNRSY 78
           F   ++A   +     + +  +E+ +++ RF       I++ N+   E  N     N+SY
Sbjct: 14  FVASTLAATHDPLTGVFAKWMRENTKSNYRFVYSNEEFIYRWNVWRDEEHNRQ---NKSY 70

Query: 79  TLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPV 138
            L +N+F DLT  EF     G    D+S   K +       ++ +P   +W +KGAVT V
Sbjct: 71  FLAMNQFGDLTNAEFNRLFKGLAF-DYSKHAKIHTAAPEAPATGIPSEFDWRQKGAVTHV 129

Query: 139 KYQGQCA-------VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKY 191
           K QGQC          + EG N +K  RLVSLSEQ L+DC+ +  NNGC GG MD AF+Y
Sbjct: 130 KNQGQCGSCWSFSTTGSTEGANFLKTGRLVSLSEQNLIDCSVSYGNNGCNGGLMDYAFEY 189

Query: 192 IIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSV 251
           II N+GI  +A Y Y+      C    A +    +T Y DV   DE +LL A   +PVSV
Sbjct: 190 IINNRGIDTEASYPYQTAGPLTCQ-YNAANKGGSLTGYTDVTSGDENALLNAAVKEPVSV 248

Query: 252 AIDAS--ALQFYSGGVF--NGYCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDG 307
           AIDAS  + QFYSGGV+  +    T L+HGV  VG+G SE G  +W +KNSWG  WG +G
Sbjct: 249 AIDASHNSFQFYSGGVYYESACSSTQLDHGVLVVGWG-SENGQDFWWVKNSWGASWGLNG 307

Query: 308 YFRLQRDIDQPQGQCGIAMFASFPVS 333
           Y ++ R+ +     CGIA  AS+P +
Sbjct: 308 YIKMSRNQNN---NCGIATAASYPTA 330


>gi|351629617|gb|AEQ54772.1| KDEL-tailed cysteine proteinase CP4, partial [Coffea canephora]
          Length = 215

 Score =  207 bits (526), Expect = 8e-51,   Method: Compositional matrix adjust.
 Identities = 108/198 (54%), Positives = 136/198 (68%), Gaps = 7/198 (3%)

Query: 146 VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYS 205
           V  VEGIN IK  +LVSLSEQ+LVDC T+  N GC GG M++A+++I ++ GIT + +Y 
Sbjct: 12  VVGVEGINKIKTGQLVSLSEQELVDCETD--NEGCNGGLMENAYEFIKKSGGITTERLYP 69

Query: 206 YEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQPVSVAIDASA--LQFYSG 263
           Y+    G CDS K    A  I  +E VP NDE +L+KAVANQPVSVAIDAS   +QFYS 
Sbjct: 70  YKARD-GSCDSSKMNAPAVTIDGHEMVPANDENALMKAVANQPVSVAIDASGSDMQFYSE 128

Query: 264 GVFNG-YCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQ- 321
           GV+ G  C   L+HGV  VGYGT+ +G KYW++KNSWG  WGE GY R+QR +D  +G  
Sbjct: 129 GVYTGDSCGNELDHGVAVVGYGTALDGTKYWIVKNSWGTGWGEQGYIRMQRGVDAAEGGV 188

Query: 322 CGIAMFASFPVSKESAQP 339
           CGIAM AS+P+   S  P
Sbjct: 189 CGIAMEASYPLKLSSHNP 206


>gi|309380130|gb|ADO65978.1| cathepsin L [Eriocheir sinensis]
 gi|309380134|gb|ADO65980.1| cathepsin L [Eriocheir sinensis]
          Length = 325

 Score =  207 bits (526), Expect = 9e-51,   Method: Compositional matrix adjust.
 Identities = 125/316 (39%), Positives = 187/316 (59%), Gaps = 28/316 (8%)

Query: 33  KFEQWKAQYGRTYKESAENSKRFEIFKDNLVAV----ERFNNAAIGNRSYTLRLNKFADL 88
           +++Q+KA+YG+ Y+ + E+S R  +++ N   +    E++ N  +   S+TL +N+F D+
Sbjct: 21  EWQQFKARYGKQYRSTKEDSYRQSVYEQNQEFINSHNEQYENGLV---SFTLAMNQFGDM 77

Query: 89  TPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC---- 144
           T +E  A+  GF  +         GT +     ++P +V+W +KGAVTPVK Q  C    
Sbjct: 78  TTEEINAAMNGFLSAGKKV---PRGTMYQPLVDELPDTVDWRDKGAVTPVKDQKACGSCW 134

Query: 145 ---AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITND 201
              A  ++EG + +   +LVSLSEQ LVDC+    N GC GG MD+AF+YI  N GI  +
Sbjct: 135 AFSATGSLEGQHFLSTGKLVSLSEQNLVDCSDKYGNFGCGGGLMDNAFRYIKDNNGIDTE 194

Query: 202 AVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ-PVSVAIDA--SAL 258
             Y YE    G C    +++  A +++Y D+    E+ L KAVA + PVSVAIDA  S  
Sbjct: 195 ESYPYEA-KNGPC-RFNSDNVGATLSSYVDIQHGSEDDLQKAVAEKGPVSVAIDASTSTF 252

Query: 259 QFYSGGV-FNGYC-ETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDID 316
            FYS G+ ++  C  +FL+HGV AVGYGT ++   YWL+KNSW + WG+ GY ++ R+ +
Sbjct: 253 HFYSRGIYYDEKCSSSFLDHGVLAVGYGT-DDSSDYWLVKNSWNETWGDSGYIKMSRNRN 311

Query: 317 QPQGQCGIAMFASFPV 332
                CGIA  AS+PV
Sbjct: 312 N---NCGIASQASYPV 324


>gi|4886998|gb|AAD32136.1|AF121837_1 cathepsin L [Mus musculus]
 gi|4887000|gb|AAD32137.1|AF121838_1 cathepsin L [Mus musculus]
 gi|4887002|gb|AAD32138.1|AF121839_1 cathepsin L [Mus musculus]
 gi|200501|gb|AAA39984.1| preprocathepsin L precursor [Mus musculus]
          Length = 334

 Score =  206 bits (525), Expect = 9e-51,   Method: Compositional matrix adjust.
 Identities = 130/324 (40%), Positives = 185/324 (57%), Gaps = 28/324 (8%)

Query: 25  FDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLN 83
           FD+   AE + QWK+ + R Y  + E  +R  I++ N+  ++  N   + G   +++ +N
Sbjct: 21  FDQTFSAE-WHQWKSTHRRLYGTNEEEWRR-AIWEKNMRIIQLHNGEYSNGQHGFSMEMN 78

Query: 84  KFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQ 143
            F D+T +EF     G++   H    +    P + K   +P SV+W EKG VTPVK QGQ
Sbjct: 79  AFGDMTNEEFRQVVNGYRHQKHKKG-RLFQEPLMLK---IPKSVDWREKGCVTPVKNQGQ 134

Query: 144 C-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNK 196
           C       A   +EG   +K  +L+SLSEQ LVDC+    N GC GG MD AF+YI +N 
Sbjct: 135 CGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGCNGGLMDFAFQYIKENG 194

Query: 197 GITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDA 255
           G+ ++  Y YE    G C   +AE   A  T + D+ P  E++L+KAVA   P+SVA+DA
Sbjct: 195 GLDSEESYPYEA-KDGSC-KYRAEFAVANDTGFVDI-PQQEKALMKAVATVGPISVAMDA 251

Query: 256 S--ALQFYSGGV-FNGYCETF-LNHGVTAVGY---GTSEEGIKYWLIKNSWGQDWGEDGY 308
           S  +LQFYS G+ +   C +  L+HGV  VGY   GT     KYWL+KNSWG +WG +GY
Sbjct: 252 SHPSLQFYSSGIYYEPNCSSKNLDHGVLLVGYGYEGTDSNKNKYWLVKNSWGSEWGMEGY 311

Query: 309 FRLQRDIDQPQGQCGIAMFASFPV 332
            ++ +D D     CG+A  AS+PV
Sbjct: 312 IKIAKDRDN---HCGLATAASYPV 332


>gi|33333710|gb|AAQ11973.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
           maculatus]
          Length = 326

 Score =  206 bits (525), Expect = 9e-51,   Method: Compositional matrix adjust.
 Identities = 128/330 (38%), Positives = 184/330 (55%), Gaps = 49/330 (14%)

Query: 29  SIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNA-AIGNRSYTLRLNKFAD 87
           S+ E+ +Q+K  +G+TY+   E  +RF +F+ NLV ++  N     G  S+  ++ +FAD
Sbjct: 18  SVYEEGQQFKLDHGKTYRSLVEEKRRFSVFQKNLVDIQEHNKKYERGEESFAKKVTQFAD 77

Query: 88  LTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSS-----------QVPPSVNWIEKGAVT 136
           +T +EF+              LK  G P L  ++           +   +V+W E+GAVT
Sbjct: 78  MTHEEFL------------DLLKLQGVPALPSNAVHFDNFEDIDMEEKDAVDWREEGAVT 125

Query: 137 PVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATND-NNNGCYGGFMDDA 188
           PVK Q  C       AV A+EG    K   LVSLS Q+LVDCAT D  NNGC GG M  A
Sbjct: 126 PVKDQANCGSCWAFSAVGAIEGQFFKKNGTLVSLSAQELVDCATEDYGNNGCKGGLMGQA 185

Query: 189 FKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVANQ- 247
           F ++ Q++GI  +  Y YEG  +  C   K+ ++  ++  Y  V P DE+ + + VA + 
Sbjct: 186 FDFV-QDEGIQTEESYPYEGRRSS-CK--KSGEYVTKVKTY--VFPLDEQEMARTVAAKG 239

Query: 248 PVSVAIDASALQFYSGGVFNGYCETF-----LNHGVTAVGYGTSEEGIKYWLIKNSWGQD 302
           PV+VAI+AS L FY  G+ +  C        LN GV  VGYG SE G+ YW++KNSWG D
Sbjct: 240 PVAVAIEASQLSFYDKGIVDERCRCSNKREDLNPGVLVVGYG-SENGVDYWIVKNSWGAD 298

Query: 303 WGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           WGE GYFRL++D+      CGI  + ++P+
Sbjct: 299 WGEKGYFRLKKDVK----ACGIGYYNTYPI 324


>gi|139947602|ref|NP_001077155.1| cathepsin L1 precursor [Bos taurus]
 gi|134025180|gb|AAI34742.1| CTSL1 protein [Bos taurus]
 gi|296484500|tpg|DAA26615.1| TPA: cathepsin L1 [Bos taurus]
          Length = 333

 Score =  206 bits (525), Expect = 9e-51,   Method: Compositional matrix adjust.
 Identities = 124/340 (36%), Positives = 195/340 (57%), Gaps = 29/340 (8%)

Query: 10  LIISGSCASQATYRTFDEGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN 69
           L+++  C   A+     + S+  +++ WKA + + Y  + E  ++  ++K N+  +E  N
Sbjct: 5   LLLTALCLGIASAAPKFDHSLDTQWKLWKAAHRKPYDLNEEGWRK-AVWKKNMKMIELHN 63

Query: 70  NA-AIGNRSYTLRLNKFADLTPQEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVN 128
              + G  S+++ +N F D+T +EF  +  GF+   +    + + T F    + +PPSV+
Sbjct: 64  QEYSQGKHSFSMAMNAFGDMTNEEFRHTMNGFQRQKNKKGKEFHETIF----ASIPPSVD 119

Query: 129 WIEKGAVTPVKYQGQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCY 181
           W EKG VTPVK QG+C       A  A+EG    K  +LVSLSEQ LVDC+  + N GC+
Sbjct: 120 WREKGYVTPVKNQGKCGSCWAFSATGALEGQMFQKTGKLVSLSEQNLVDCSQPEGNRGCH 179

Query: 182 GGFMDDAFKYIIQNKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLL 241
           GGF+D+AF+Y++   G+ ++  Y Y G+  G C      + AA  T + D+ P  E++L+
Sbjct: 180 GGFIDNAFQYVLDVGGLDSEESYPYTGL-VGTC-LYNPNNSAANETGFVDL-PKQEKALM 236

Query: 242 KAVANQ-PVSVAIDAS--ALQFYSGGVF---NGYCETFLNHGVTAVGY---GTSEEGIKY 292
           KAVAN  P+SVA+DA   + QFY  G++   N   E+ ++H V  VGY   G   +  KY
Sbjct: 237 KAVANLGPISVAVDAHNPSFQFYKSGIYYEPNCSSES-VDHAVLVVGYGFEGADSDDNKY 295

Query: 293 WLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIAMFASFPV 332
           WL+KNSWG+ WG +GY ++ +D +     CGIA  AS+P 
Sbjct: 296 WLVKNSWGEHWGMNGYIKMAKDRNN---HCGIATMASYPT 332


>gi|443724292|gb|ELU12369.1| hypothetical protein CAPTEDRAFT_165495 [Capitella teleta]
          Length = 351

 Score =  206 bits (525), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 128/314 (40%), Positives = 184/314 (58%), Gaps = 22/314 (7%)

Query: 34  FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNN-AAIGNRSYTLRLNKFADLTPQE 92
           ++ +K  + R Y E+ E  +R E+F++NL  +E  N   + G  SY + +N+FAD+  +E
Sbjct: 44  WQDFKTVHERNYGET-EEMQRKEVFRNNLKKIEMHNYLHSQGKSSYRMGINQFADMEVKE 102

Query: 93  FIASQTGFKMSDHSSSLKANGTPFLYKSSQV--PPSVNWIEKGAVTPVKYQGQCA----- 145
           F +   GF+M++ +       + ++  +  V  P  V+W ++G VTP+K QG C      
Sbjct: 103 FASVVNGFRMNNRTKVRDHLHSHYISPAIPVSLPAEVDWRKEGYVTPIKDQGHCGSCWSF 162

Query: 146 --VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAV 203
               A+EG +  K  +LVSLSEQ L+DC+T+  NNGC GG MD AF+YI  N G   +  
Sbjct: 163 STTGALEGQHFRKTGKLVSLSEQNLIDCSTSYGNNGCNGGVMDYAFQYIKDNDGDDTEDS 222

Query: 204 YSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDAS--ALQF 260
           Y YE  + G C   K E   A  T Y D+P  DEE + +AVA   PVSVAIDAS  + Q 
Sbjct: 223 YPYEA-ADGPC-RFKKEYVGATDTGYTDLPKGDEEKMKEAVAMVGPVSVAIDASHTSFQM 280

Query: 261 YSGGVFNGY-CET-FLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQP 318
           Y  GV++   C+   L+HGV  VGYGT E G  YWL+KNSWG  WG++GY ++ R+ +  
Sbjct: 281 YQSGVYDEVECDPEGLDHGVLVVGYGT-ELGQDYWLVKNSWGTKWGDEGYIKMSRNKNN- 338

Query: 319 QGQCGIAMFASFPV 332
             QCGI+  AS+P+
Sbjct: 339 --QCGISSMASYPL 350


>gi|42564161|gb|AAS20592.1| digestive cysteine proteinase intestain [Leptinotarsa decemlineata]
          Length = 326

 Score =  206 bits (525), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 134/315 (42%), Positives = 179/315 (56%), Gaps = 27/315 (8%)

Query: 35  EQW---KAQYGRTYKESAENSKRFEIFKDNLVAVERFN-NAAIGNRSYTLRLNKFADLTP 90
           +QW   K  +G+TYK   E   RF IF+ NL  +E  N     G  SY L +  FADLT 
Sbjct: 21  DQWVAFKQTHGKTYKSLLEERTRFGIFQSNLRKIEEHNAKYDKGEESYFLGVTPFADLTH 80

Query: 91  QEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC------ 144
            EF       +      +++A    F  +  +VP S++W +KGAV  VKYQG C      
Sbjct: 81  DEFKDKLR--RQIKTKPNVEATLAVFP-EGLEVPDSIDWTQKGAVLDVKYQGGCGSCWAF 137

Query: 145 -AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGC-YGGFMDDAFKYIIQNKGITNDA 202
            A  A+EG NAI  N  + LSEQQL+DC+    N+ C +GG M  AF Y++ +KGI  D+
Sbjct: 138 SATGALEGQNAIVNNVKIPLSEQQLLDCSKPYGNDDCEHGGLMSFAFDYVL-DKGIEADS 196

Query: 203 VYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDASALQFY 261
            Y Y+G+ T       A+    +I  Y +V  ++EE L KAV    PVSVAIDA  +Q Y
Sbjct: 197 SYPYKGIDTPC--QYDAKKTVLKIKGYRNVSISEEE-LKKAVGTVGPVSVAIDADPIQLY 253

Query: 262 SGGVFNG-YCETFLNHGVTAVGYGTSEEGI---KYWLIKNSWGQDWGEDGYFRLQRDIDQ 317
           SGG+ +G +C   LNHGV AVGYG  +      K+W +KNSWG+DWGE GYFR++RD + 
Sbjct: 254 SGGILDGLFCTHNLNHGVLAVGYGEEDHLFGKKKFWKVKNSWGKDWGEQGYFRIKRDANN 313

Query: 318 PQGQCGIAMFASFPV 332
               CGIA  AS+P+
Sbjct: 314 ---LCGIADKASYPI 325


>gi|443708542|gb|ELU03619.1| hypothetical protein CAPTEDRAFT_17807 [Capitella teleta]
          Length = 350

 Score =  206 bits (525), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 132/316 (41%), Positives = 186/316 (58%), Gaps = 26/316 (8%)

Query: 34  FEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNN-AAIGNRSYTLRLNKFADLTPQE 92
           ++ +K  + RTY E+ E S+R E+F++NL  ++  N+    G   Y + +N+FAD+   E
Sbjct: 43  WQDFKTVHERTYGET-EESQRKEVFRNNLKKIQAHNHLHEQGKSPYRMGINQFADMEANE 101

Query: 93  FIASQTGFKMSDHSS---SLKANG-TPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA--- 145
           F +   GF+M++ +     L AN  +P +  S  VP  V+W ++G VTPVK QGQC    
Sbjct: 102 FASIMNGFRMNNRTEVRDHLHANYISPAIPVS--VPAEVDWRKEGYVTPVKNQGQCGSCW 159

Query: 146 ----VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITND 201
                 ++EG +  K  +LVSLSEQ LVDC+T+  N GC GG +D AF+YI  N G   +
Sbjct: 160 AFSTTGSLEGQHFRKTGKLVSLSEQNLVDCSTSYGNEGCNGGIVDYAFQYIKDNDGDDTE 219

Query: 202 AVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVA-NQPVSVAIDA--SAL 258
           A Y YE +  G C   K+    A  T Y D+P  DE  + +AVA   PVSVAIDA  S+ 
Sbjct: 220 ACYPYEAVD-GTC-RFKSVCVGATCTGYTDLPKGDEAKMKEAVALVGPVSVAIDASHSSF 277

Query: 259 QFYSGGVF-NGYCETF-LNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDID 316
           Q Y  G++    C    L+H V  VGYGT E+G  YWL+KNSWG  WG++GY ++ R++D
Sbjct: 278 QMYQSGIYVEQECSPKQLDHAVLVVGYGT-EQGQDYWLVKNSWGTTWGDEGYIKMARNMD 336

Query: 317 QPQGQCGIAMFASFPV 332
               QCGIA  AS+P+
Sbjct: 337 N---QCGIASQASYPL 349


>gi|157278115|ref|NP_001098156.1| cathepsin L precursor [Oryzias latipes]
 gi|50251128|dbj|BAD27581.1| cathepsin L [Oryzias latipes]
          Length = 336

 Score =  206 bits (525), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 123/320 (38%), Positives = 179/320 (55%), Gaps = 25/320 (7%)

Query: 30  IAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN-NAAIGNRSYTLRLNKFADL 88
           + + ++ WK  + + Y E  E  +R  +++ NL  +E  N   ++G  SY L +N F D+
Sbjct: 24  LDQHWQLWKGWHSKNYHEKEEGWRRL-VWEKNLRKIELHNLEHSMGKHSYRLGMNHFGDM 82

Query: 89  TPQEFIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGAVTPVKYQGQCA-- 145
           T +EF     G+K  +     K +G+ F+  +  + P +V+W +KG VTPVK QGQC   
Sbjct: 83  THEEFRQIMNGYKRREQR---KYSGSLFMEPNFLEAPRAVDWRDKGYVTPVKDQGQCGSC 139

Query: 146 -----VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITN 200
                  A+EG    K  +LVSLSEQ LVDC+  + N GC GG MD AF+Y+  N+G+ +
Sbjct: 140 WAFSTTGALEGQQFRKTGKLVSLSEQNLVDCSRPEGNEGCNGGLMDQAFQYVKDNQGLDS 199

Query: 201 DAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDA--SA 257
           +  Y Y+G     C    A+  A   T + D+P   E +L+KAVA+  PVSVAIDA   +
Sbjct: 200 EDFYPYKGTDDQPC-QYNAQYSAVNDTGFVDIPSGKERALMKAVASVGPVSVAIDAGHES 258

Query: 258 LQFYSGGV-FNGYCET-FLNHGVTAVGYGTSEE---GIKYWLIKNSWGQDWGEDGYFRLQ 312
            QFY  G+ F   C +  L+HGV  VGYG   E   G KYW++KNSW + WG+ G+  + 
Sbjct: 259 FQFYQSGIYFEKECSSDELDHGVLVVGYGFEGEDVDGKKYWIVKNSWSEKWGDKGFIYMA 318

Query: 313 RDIDQPQGQCGIAMFASFPV 332
           +D       CGIA  AS+P+
Sbjct: 319 KD---RHNHCGIATAASYPL 335


>gi|405958751|gb|EKC24845.1| Cathepsin L [Crassostrea gigas]
          Length = 330

 Score =  206 bits (524), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 126/307 (41%), Positives = 180/307 (58%), Gaps = 26/307 (8%)

Query: 41  YGRTYKESAENSKRFEIFKDNLVAVERFNNAA-IGNRSYTLRLNKFADLTPQEFIASQTG 99
           +G+ Y  + E ++R  I++ NL  +E+ N AA  G+ S+ L +N++ D+T +EF ++  G
Sbjct: 34  HGKQYG-AEEEARRRVIWEGNLDYIEKHNLAADRGDYSFWLGMNEYGDMTNEEFRSTMNG 92

Query: 100 FKMSDHSSSLKANGTPFLYKSS--QVPPSVNWIEKGAVTPVKYQGQC-------AVAAVE 150
           +KM + +S     G+ +L  S+   +P +V+W  KG VTP+K QGQC       A  ++E
Sbjct: 93  YKMRNGTS----RGSLYLPPSNIGDLPDTVDWRPKGYVTPIKNQGQCGSCWSFSATGSLE 148

Query: 151 GINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAVYSYEGMS 210
           G    K  +L SLSEQ LVDC+    N+GC GG MDDAF+YI  N GI  ++ Y YE   
Sbjct: 149 GQTFKKTGKLPSLSEQNLVDCSQKQGNHGCQGGLMDDAFQYIKDNNGIDTESSYPYEA-K 207

Query: 211 TGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDASAL--QFYSGGVFN 267
            G C    A +  A  + + D+    E  L  AVA   P++VAIDAS +  Q Y  GV++
Sbjct: 208 NGKC-RFNAANVGATDSGFTDIKSKSESDLQSAVATVGPIAVAIDASHMSFQLYKSGVYH 266

Query: 268 GY--CETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQPQGQCGIA 325
            +   ET L+HGV AVGYGT E G  YWL+KNSWG+ WG+ GY  + R+    +  CGIA
Sbjct: 267 EFFCSETRLDHGVLAVGYGT-ESGKDYWLVKNSWGESWGQKGYIMMSRN---KRNNCGIA 322

Query: 326 MFASFPV 332
             AS+P 
Sbjct: 323 TSASYPT 329


>gi|340371596|ref|XP_003384331.1| PREDICTED: digestive cysteine proteinase 2-like [Amphimedon
           queenslandica]
          Length = 327

 Score =  206 bits (524), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 127/314 (40%), Positives = 180/314 (57%), Gaps = 25/314 (7%)

Query: 31  AEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFNNAAIGNRSYTLRLNKFADLTP 90
           AE++  WK +YG+TY+   E++ R +I+  N   V   N+    + S+ L +N+FADLT 
Sbjct: 26  AEEWRLWKGKYGKTYRSIYEDNMRQKIWLQNRDYVNEHNSM---DSSFQLEVNEFADLTA 82

Query: 91  QEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQCA----- 145
           +EF +   G+    +  +   N T + Y    +P SV+W  KG VTPVK Q QC      
Sbjct: 83  EEFSSIYNGYGKGRNREN-HENTTIYRYTGGAIPDSVDWRTKGLVTPVKNQKQCGSCWAF 141

Query: 146 --VAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQNKGITNDAV 203
               ++EG +A K  +LVSLSEQ LVDC   D+  GC GG M  AFKYI +NKGI  +  
Sbjct: 142 STTGSLEGAHAKKTGKLVSLSEQNLVDCDKKDH--GCQGGLMTTAFKYIEENKGIDTEES 199

Query: 204 YSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDAS--ALQF 260
           Y Y+    G C+  K +D  A +  +  +   D E+L KAVA   P+SVA+DAS  + Q 
Sbjct: 200 YPYKA-KNGRCE-FKKDDIGATVERHVSILTTDCEALKKAVAEIGPISVAMDASHSSFQL 257

Query: 261 YSGGVFN-GYCETF-LNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFRLQRDIDQP 318
           Y  G+++   C +  L+HGV  VGYG  E+G +YWL+KNSWG++WG +GYF+    I   
Sbjct: 258 YKSGIYDPKICSSRKLDHGVLVVGYG-KEDGEEYWLVKNSWGKNWGMEGYFK----IASK 312

Query: 319 QGQCGIAMFASFPV 332
           +  CGI   A +PV
Sbjct: 313 KNLCGICTSACYPV 326


>gi|42564159|gb|AAS20591.1| digestive cysteine proteinase intestain [Leptinotarsa decemlineata]
          Length = 326

 Score =  206 bits (524), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 134/315 (42%), Positives = 178/315 (56%), Gaps = 27/315 (8%)

Query: 35  EQW---KAQYGRTYKESAENSKRFEIFKDNLVAVERFN-NAAIGNRSYTLRLNKFADLTP 90
           +QW   K  +G+TYK   E   RF IF+ NL  +E  N     G  SY L +  FADLT 
Sbjct: 21  DQWVAFKQTHGKTYKSLLEERTRFGIFQSNLRKIEEHNAKYDKGEESYFLGVTPFADLTH 80

Query: 91  QEFIASQTGFKMSDHSSSLKANGTPFLYKSSQVPPSVNWIEKGAVTPVKYQGQC------ 144
            EF       +      +++A    F  +  +VP S++W +KGAV  VKYQG C      
Sbjct: 81  DEFKDELR--RQIKTKPNVEATLAVFP-EGLEVPDSIDWTQKGAVLDVKYQGGCGSCWAF 137

Query: 145 -AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGC-YGGFMDDAFKYIIQNKGITNDA 202
            A  A+EG NAI  N  + LSEQQL+DC+    N+ C +GG M  AF Y++ +KGI  D+
Sbjct: 138 SATGALEGQNAIVNNVKIPLSEQQLLDCSKPYGNDDCEHGGLMSFAFDYVL-DKGIEADS 196

Query: 203 VYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAIDASALQFY 261
            Y Y+G+ T       A+    +I  Y++V  N EE L KAV    PVSVAIDA  +Q Y
Sbjct: 197 SYPYKGIDTPC--QYDAKKTVLKIKGYKNVS-NSEEELKKAVGTVGPVSVAIDADPIQLY 253

Query: 262 SGGVFNG-YCETFLNHGVTAVGYGTSEEGI---KYWLIKNSWGQDWGEDGYFRLQRDIDQ 317
            GG+ +G +C   LNHGV AVGYG  +      K+W +KNSWG+DWGE GYFR++RD + 
Sbjct: 254 FGGILDGLFCTHNLNHGVLAVGYGEEDHLFGKKKFWKVKNSWGKDWGEQGYFRIKRDANN 313

Query: 318 PQGQCGIAMFASFPV 332
               CGIA  AS+P+
Sbjct: 314 ---LCGIADKASYPI 325


>gi|33242886|gb|AAQ01147.1| cathepsin [Haplochromis chilotes]
          Length = 334

 Score =  206 bits (524), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 128/322 (39%), Positives = 177/322 (54%), Gaps = 28/322 (8%)

Query: 27  EGSIAEKFEQWKAQYGRTYKESAENSKRFEIFKDNLVAVERFN-NAAIGNRSYTLRLNKF 85
           E ++   +E WK  +G++YK   EN+ R E++ +NL  +   N  A++G  +Y L +N  
Sbjct: 24  ESTLDAHWELWKKTHGKSYKNDVENAHRRELWGNNLKMITVHNLEASMGLHTYELGMNHM 83

Query: 86  ADLTPQE---FIASQTGFKMSDHSSSLKANGTPFLYKS-SQVPPSVNWIEKGAVTPVKYQ 141
            DLT +E   F AS T        + ++   +PF   S S +P +++W EKG VT VK Q
Sbjct: 84  GDLTEEEIMQFFASLT------PPTDIQRAPSPFAGASGSGIPDTMDWREKGCVTKVKMQ 137

Query: 142 GQC-------AVAAVEGINAIKINRLVSLSEQQLVDCATNDNNNGCYGGFMDDAFKYIIQ 194
           G C       A  A+EG  A    +LV LS Q LVDC+    N+GC GGFM  AF+Y+I 
Sbjct: 138 GACGSCWAFSAAGALEGQLAKSTGKLVDLSPQNLVDCSGKYGNHGCNGGFMTRAFQYVID 197

Query: 195 NKGITNDAVYSYEGMSTGICDSIKAEDHAAQITNYEDVPPNDEESLLKAVAN-QPVSVAI 253
           N GI +DA Y Y G              AA  ++Y+ +P  DE +L + +A   P+SVAI
Sbjct: 198 NHGIDSDASYPYIGRDDQC--HYNPATRAANCSSYQFLPEGDENALKQGLATVGPISVAI 255

Query: 254 DA--SALQFYSGGVFNG-YCETFLNHGVTAVGYGTSEEGIKYWLIKNSWGQDWGEDGYFR 310
           DA      FY  GV+N   C   +NHGV AVGYGT   G  YWL+KNSWG  +G+ GY R
Sbjct: 256 DARRPRFSFYRSGVYNDPSCTQKVNHGVLAVGYGTL-NGQDYWLVKNSWGTTFGDQGYIR 314

Query: 311 LQRDIDQPQGQCGIAMFASFPV 332
           + R+      QCGIA++  +PV
Sbjct: 315 MARNTGN---QCGIALYPCYPV 333


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.315    0.131    0.388 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,400,993,642
Number of Sequences: 23463169
Number of extensions: 222004660
Number of successful extensions: 556233
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 6262
Number of HSP's successfully gapped in prelim test: 1194
Number of HSP's that attempted gapping in prelim test: 526424
Number of HSP's gapped (non-prelim): 8379
length of query: 348
length of database: 8,064,228,071
effective HSP length: 143
effective length of query: 205
effective length of database: 9,003,962,200
effective search space: 1845812251000
effective search space used: 1845812251000
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 77 (34.3 bits)