BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 042468
         (346 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|255564908|ref|XP_002523447.1| cysteine protease, putative [Ricinus communis]
 gi|223537275|gb|EEF38906.1| cysteine protease, putative [Ricinus communis]
          Length = 342

 Score =  534 bits (1375), Expect = e-149,   Method: Compositional matrix adjust.
 Identities = 254/341 (74%), Positives = 290/341 (85%), Gaps = 5/341 (1%)

Query: 7   ENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKEN 66
           ENKL+  A+LV+G+WA Q+WSR+L+DA MNERHEMWMA+YGRVY+DN+EKE RF+IF+ N
Sbjct: 6   ENKLMFVALLVVGLWASQAWSRSLHDAAMNERHEMWMAKYGRVYKDNSEKERRFEIFRNN 65

Query: 67  VEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENA 126
           VE+I SFN K  N+PYKL INEFAD TNEEF+  +NGYKR   S     T   SFRY N 
Sbjct: 66  VEFIESFN-KLGNRPYKLDINEFADLTNEEFKVSKNGYKR---SSGVGLTEKSSFRYANV 121

Query: 127 S-VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDT 185
           + VP S+DWR+ GAVT +KDQGQCGCCWAFSAVAAMEGI  ++T KL SLSEQELVDCDT
Sbjct: 122 TAVPTSMDWRQNGAVTPIKDQGQCGCCWAFSAVAAMEGITKLSTGKLISLSEQELVDCDT 181

Query: 186 SGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPS 245
           SGEDQGCEGGLMDDAFEFI  N GL TEA YPY+ +DG+CN  +A   AAKI+GYEDVP+
Sbjct: 182 SGEDQGCEGGLMDDAFEFIKQNGGLTTEANYPYQGTDGTCNTNKAGNDAAKITGYEDVPA 241

Query: 246 NNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKY 305
           N+E AL+KAVA+QPVSVAIDASGS FQFYS GVFTG CGTELDHGVTAVGYGT+DDGTKY
Sbjct: 242 NSEDALLKAVASQPVSVAIDASGSAFQFYSGGVFTGDCGTELDHGVTAVGYGTSDDGTKY 301

Query: 306 WLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           WLVKNSWGT+WGE+GYIRM+RDI+AKEGLCGIAMQ SYPTA
Sbjct: 302 WLVKNSWGTSWGEDGYIRMERDIEAKEGLCGIAMQPSYPTA 342


>gi|255564910|ref|XP_002523448.1| cysteine protease, putative [Ricinus communis]
 gi|223537276|gb|EEF38907.1| cysteine protease, putative [Ricinus communis]
          Length = 341

 Score =  527 bits (1358), Expect = e-147,   Method: Compositional matrix adjust.
 Identities = 254/343 (74%), Positives = 293/343 (85%), Gaps = 6/343 (1%)

Query: 5   LLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFK 64
           + E KL+  A+LV+G+W  Q+WSR+L+DA MNERHEMWM +YGRVY+DN+EKE RF+IF+
Sbjct: 4   ISERKLMFVALLVVGLWVSQAWSRSLHDAAMNERHEMWMVKYGRVYKDNSEKERRFEIFR 63

Query: 65  ENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYE 124
            NVE+I SFN K  N+PYKL INEFAD TNEEF+A RNGYKR   +V  SE +  SFRY 
Sbjct: 64  NNVEFIESFN-KPGNRPYKLDINEFADLTNEEFKASRNGYKRS-SNVGLSEKS--SFRYG 119

Query: 125 NAS-VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDC 183
           N + VP S+DWR+KGAVT +KDQGQCGCCWAFSAVAAMEGI  ++T KL SLSEQELVDC
Sbjct: 120 NVTAVPTSMDWRQKGAVTPIKDQGQCGCCWAFSAVAAMEGITKLSTGKLISLSEQELVDC 179

Query: 184 DTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDV 243
           DTSGEDQGCEGGLMDDAFEFI  N GL TEA YPY+ +DG+CN  +A   AAKI+GYEDV
Sbjct: 180 DTSGEDQGCEGGLMDDAFEFIKQNGGLTTEANYPYQGTDGTCNTNKAGNDAAKITGYEDV 239

Query: 244 PSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGT 303
           P+N+E AL+KAVA+QPVSVAIDASGS FQFYS GVFTG CGTELDHGVTAVGYGT+ DGT
Sbjct: 240 PANSEDALLKAVASQPVSVAIDASGSAFQFYSGGVFTGDCGTELDHGVTAVGYGTS-DGT 298

Query: 304 KYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           KYWLVKNSWGT+WGE+GYIRM+RDI+AKEGLCGIAMQ+SYPTA
Sbjct: 299 KYWLVKNSWGTSWGEDGYIRMERDIEAKEGLCGIAMQSSYPTA 341


>gi|255563110|ref|XP_002522559.1| cysteine protease, putative [Ricinus communis]
 gi|223538250|gb|EEF39859.1| cysteine protease, putative [Ricinus communis]
          Length = 344

 Score =  525 bits (1353), Expect = e-147,   Method: Compositional matrix adjust.
 Identities = 252/344 (73%), Positives = 284/344 (82%), Gaps = 3/344 (0%)

Query: 4   ILLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIF 63
           +LL NKLVL A+L++ +WA QSWSR+L++A+M  RH+ WM QYGRVY+ N EKE RFKIF
Sbjct: 3   LLLHNKLVLMAMLLVTLWASQSWSRSLHEASMELRHKTWMTQYGRVYKGNVEKEKRFKIF 62

Query: 64  KENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRY 123
           KENVE+I SFNN   NKPYKLGIN F D TNEEFRA  NGY   + S +SS  T  SFRY
Sbjct: 63  KENVEFIESFNNNG-NKPYKLGINAFTDLTNEEFRASHNGYTMSMSSHQSSYRTK-SFRY 120

Query: 124 ENAS-VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVD 182
           EN + VP S+DWR KGAVT +KDQGQCGCCWAFSAVAAMEGI  ++T  L SLSEQELVD
Sbjct: 121 ENVTAVPPSLDWRTKGAVTHIKDQGQCGCCWAFSAVAAMEGITKLSTGTLISLSEQELVD 180

Query: 183 CDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYED 242
           CDTSG DQGCEGGLMDDAFEFII N GL TEA YPY+  DGSCN ++A   AAKI+GYE+
Sbjct: 181 CDTSGMDQGCEGGLMDDAFEFIIENNGLTTEANYPYEGVDGSCNTRKAANHAAKITGYEN 240

Query: 243 VPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDG 302
           VP+ +E AL KAVANQPVSVAIDA  S FQ YSSG+FTG CGTELDHGVT VGYGT+DDG
Sbjct: 241 VPAYDEEALRKAVANQPVSVAIDAGESAFQHYSSGIFTGDCGTELDHGVTVVGYGTSDDG 300

Query: 303 TKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           TKYWLVKNSWGT+WGE+GYIRM+RDIDAKEGLCGIAM+ SYPTA
Sbjct: 301 TKYWLVKNSWGTSWGEDGYIRMERDIDAKEGLCGIAMEPSYPTA 344


>gi|225443827|ref|XP_002274223.1| PREDICTED: vignain-like [Vitis vinifera]
          Length = 340

 Score =  505 bits (1300), Expect = e-140,   Method: Compositional matrix adjust.
 Identities = 254/344 (73%), Positives = 285/344 (82%), Gaps = 5/344 (1%)

Query: 4   ILLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIF 63
           + LE+K++   +L++GVWA Q+ SRTL++ +M+ERHE WM  YGR Y+D AEKE RFKIF
Sbjct: 1   MALESKIICITLLIMGVWASQALSRTLHEVSMSERHEDWMGLYGRTYKDIAEKERRFKIF 60

Query: 64  KENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRY 123
           KENVEYI S N+ A N+ YKL INEFADQTNEEF+A RNGY       RSSE T  SFRY
Sbjct: 61  KENVEYIESVNS-AGNRRYKLSINEFADQTNEEFKASRNGYNMS-SRPRSSEIT--SFRY 116

Query: 124 EN-ASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVD 182
           EN A+VP+S+DWRKKGAVT +KDQGQCGCCWAFSAVAAMEG+  + T +L SLSEQELVD
Sbjct: 117 ENVAAVPSSMDWRKKGAVTPIKDQGQCGCCWAFSAVAAMEGVTQLKTGELISLSEQELVD 176

Query: 183 CDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYED 242
           CDTSGEDQGC GGLMD AFEFII N GL TEA YPYK  D +CNKK+A  SAAKI  YED
Sbjct: 177 CDTSGEDQGCGGGLMDSAFEFIIGNGGLTTEANYPYKGVDATCNKKKAASSAAKIKNYED 236

Query: 243 VPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDG 302
           VP+N+EAAL+KAVA  PVSVAIDA GSDFQFYSSGVFTGQCGTELDHGVTAVGYG  DDG
Sbjct: 237 VPANSEAALLKAVAQHPVSVAIDAGGSDFQFYSSGVFTGQCGTELDHGVTAVGYGKTDDG 296

Query: 303 TKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           TKYWLVKNSWGT WGE+GYI M+RDI A EGLCGIAM+ASYPTA
Sbjct: 297 TKYWLVKNSWGTGWGEDGYIWMERDIGADEGLCGIAMEASYPTA 340


>gi|225446583|ref|XP_002280204.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1 [Vitis vinifera]
          Length = 341

 Score =  500 bits (1288), Expect = e-139,   Method: Compositional matrix adjust.
 Identities = 241/347 (69%), Positives = 279/347 (80%), Gaps = 7/347 (2%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRF 60
           MA +     + LA + VL  WA Q+ +R L++A+M ERHE WM QYGR Y+D  EK  R+
Sbjct: 1   MASVNQYQYICLALLFVLAAWASQATARNLHEASMYERHEDWMVQYGREYKDADEKSKRY 60

Query: 61  KIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVS 120
           KIFK+NV  I SFN KA +K YKL INEFAD TNEEFRA RN +K  + S  ++     S
Sbjct: 61  KIFKDNVARIESFN-KAMDKSYKLSINEFADLTNEEFRASRNRFKAHICSTEAT-----S 114

Query: 121 FRYENAS-VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQE 179
           F+YEN + VP+++DWRKKGAVT +KDQGQCG CWAFSAVAAMEGI  ++T KL SLSEQE
Sbjct: 115 FKYENVTAVPSTVDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQE 174

Query: 180 LVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISG 239
           LVDCDTSGEDQGC GGLMDDAF+FI  N GL TEA YPY  +DG+CN+K+A   AAKI+G
Sbjct: 175 LVDCDTSGEDQGCSGGLMDDAFKFIEQNHGLTTEANYPYAGTDGTCNRKKAAHPAAKING 234

Query: 240 YEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTA 299
           YEDVP+NNE AL KAVA+QP++VAIDA GS+FQFYSSGVFTGQCGTELDHGV+AVGYGT+
Sbjct: 235 YEDVPANNEKALQKAVAHQPIAVAIDAGGSEFQFYSSGVFTGQCGTELDHGVSAVGYGTS 294

Query: 300 DDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           DDG KYWLVKNSWGT WGE GYIRMQRD+ AKEGLCGIAMQASYPTA
Sbjct: 295 DDGMKYWLVKNSWGTGWGEEGYIRMQRDVTAKEGLCGIAMQASYPTA 341


>gi|225446581|ref|XP_002280246.1| PREDICTED: vignain [Vitis vinifera]
          Length = 341

 Score =  499 bits (1286), Expect = e-139,   Method: Compositional matrix adjust.
 Identities = 241/347 (69%), Positives = 279/347 (80%), Gaps = 7/347 (2%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRF 60
           MA +     + LA + VL  WA Q+ +R+L++A+M ERHE WM QYGR Y+D  EK  R+
Sbjct: 1   MASVNQYQYICLALLFVLAAWASQATARSLHEASMYERHEDWMVQYGREYKDADEKSKRY 60

Query: 61  KIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVS 120
           KIFK+NV  I SFN KA +K YKL INEFAD TNEEFRA RN +K  + S  ++     S
Sbjct: 61  KIFKDNVARIESFN-KAMDKSYKLSINEFADLTNEEFRASRNRFKAHICSTEAT-----S 114

Query: 121 FRYENAS-VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQE 179
           F+YEN + VP+++DWRKKGAVT +KDQGQCG CWAFSAVAAMEGI  ++T KL SLSEQE
Sbjct: 115 FKYENVTAVPSTVDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQE 174

Query: 180 LVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISG 239
           LVDCDTSGEDQGC GGLMDDAF+FI  N GL TEA YPY  +DG+CN+K+A   AAKI+G
Sbjct: 175 LVDCDTSGEDQGCSGGLMDDAFKFIEQNHGLTTEANYPYAGTDGTCNRKKAAHPAAKING 234

Query: 240 YEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTA 299
           YEDVP+NNE AL KAVA+QP++VAIDASGS+FQFYSSGVFTGQCGTELDHGV AVGYGT+
Sbjct: 235 YEDVPANNEKALQKAVAHQPIAVAIDASGSEFQFYSSGVFTGQCGTELDHGVAAVGYGTS 294

Query: 300 DDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           DDG KYWLVKNSW T WGE GYIRMQRD+ AKEGLCGIAMQASYPTA
Sbjct: 295 DDGMKYWLVKNSWSTGWGEEGYIRMQRDVTAKEGLCGIAMQASYPTA 341


>gi|224121800|ref|XP_002330656.1| predicted protein [Populus trichocarpa]
 gi|222872260|gb|EEF09391.1| predicted protein [Populus trichocarpa]
          Length = 342

 Score =  498 bits (1282), Expect = e-138,   Method: Compositional matrix adjust.
 Identities = 237/343 (69%), Positives = 276/343 (80%), Gaps = 5/343 (1%)

Query: 5   LLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFK 64
           + + +   A IL+LG+WA +  SR L ++ M+ RHE WMA YG+VY D AEKE RFKIFK
Sbjct: 4   ICKRQCFFAFILILGMWAFEVASRELQESYMSARHEQWMATYGKVYVDAAEKERRFKIFK 63

Query: 65  ENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYE 124
            NVEYI SFN  A NKPYKL +N+FADQTNE+F+  RNGY+R   + R  + T  SF+YE
Sbjct: 64  NNVEYIESFNT-AGNKPYKLSVNKFADQTNEKFKGARNGYRRPFQT-RPMKVT--SFKYE 119

Query: 125 NAS-VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDC 183
           N + VPA++DWRKKGAVT +KDQGQCG CWAFS VAA EGIN +TT KL SLSEQELVDC
Sbjct: 120 NVTAVPATMDWRKKGAVTLIKDQGQCGSCWAFSTVAATEGINQLTTGKLVSLSEQELVDC 179

Query: 184 DTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDV 243
           D  GEDQGCEGGLM+D FEFII N G+ TEA YPY+A+DG+CN K+     AKI+GYE V
Sbjct: 180 DIQGEDQGCEGGLMEDGFEFIIKNHGITTEANYPYQAADGTCNSKKQASHIAKITGYESV 239

Query: 244 PSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGT 303
           P+N+EA L+K VANQP+SV+IDA GSDFQFYSSGVFTG+CGTELDHGVTAVGYG   DGT
Sbjct: 240 PANSEAELLKVVANQPISVSIDAGGSDFQFYSSGVFTGKCGTELDHGVTAVGYGETSDGT 299

Query: 304 KYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           KYWLVKNSWGT+WGE GYIRMQRDID +EGLCGIAM +SYPTA
Sbjct: 300 KYWLVKNSWGTSWGEEGYIRMQRDIDTEEGLCGIAMDSSYPTA 342


>gi|147788834|emb|CAN64655.1| hypothetical protein VITISV_005140 [Vitis vinifera]
          Length = 341

 Score =  498 bits (1282), Expect = e-138,   Method: Compositional matrix adjust.
 Identities = 240/347 (69%), Positives = 278/347 (80%), Gaps = 7/347 (2%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRF 60
           MA +     + LA + VL  WA  + +R L++A+M ERHE WMAQYGRVY+D  EK  R+
Sbjct: 1   MASVNQYRYICLALLFVLAAWASHAKARNLHEASMYERHEDWMAQYGRVYKDAGEKSKRY 60

Query: 61  KIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVS 120
           KIFK+NV  I SFN KA NK YKL INEFAD TNEEFRA RN +K  + S  ++     S
Sbjct: 61  KIFKDNVARIESFN-KAMNKSYKLSINEFADLTNEEFRASRNRFKAHICSTEAT-----S 114

Query: 121 FRYENA-SVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQE 179
           F+YE+  +VP+++DWRKKGAVT +KDQGQCG CWAFSAVAAMEGI  ++T KL SLSEQE
Sbjct: 115 FKYEHVXAVPSTVDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQE 174

Query: 180 LVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISG 239
           LVDCDTSGEDQGC GGLMDDAF+FI  N GL TEA YPY  +DG+CN+K+A   AAKI+G
Sbjct: 175 LVDCDTSGEDQGCSGGLMDDAFKFIEQNHGLTTEANYPYAGTDGTCNRKKAAHPAAKING 234

Query: 240 YEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTA 299
           YEDVP+NNE AL KAVA+QP++VAIDA G +FQFYSSGVFTGQCGTELDHGV+AVGYGT+
Sbjct: 235 YEDVPANNEKALQKAVAHQPIAVAIDAGGFEFQFYSSGVFTGQCGTELDHGVSAVGYGTS 294

Query: 300 DDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           DDG KYWLVKNSWGT WGE GYIRMQRD+  KEGLCGIAMQASYPTA
Sbjct: 295 DDGMKYWLVKNSWGTGWGEEGYIRMQRDVTEKEGLCGIAMQASYPTA 341


>gi|147839728|emb|CAN70559.1| hypothetical protein VITISV_032465 [Vitis vinifera]
          Length = 341

 Score =  498 bits (1281), Expect = e-138,   Method: Compositional matrix adjust.
 Identities = 240/347 (69%), Positives = 277/347 (79%), Gaps = 7/347 (2%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRF 60
           MA +     + LA + VL  WA Q+ +R L++A+M ERHE WM QYGR Y+D  EK  R+
Sbjct: 1   MASVNQYQYICLALLFVLAAWASQATARXLHEASMYERHEDWMVQYGREYKDADEKSKRY 60

Query: 61  KIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVS 120
           KIFK+NV  I SFN KA +K YKL INEFAD TNEEFRA RN +K  + S  ++     S
Sbjct: 61  KIFKDNVARIESFN-KAMDKSYKLSINEFADLTNEEFRASRNRFKAHICSTEAT-----S 114

Query: 121 FRYENAS-VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQE 179
           F+YEN + VP+++DWRKKGAVT +KDQGQCG CWAFSAVAAMEGI  ++T KL SLSEQE
Sbjct: 115 FKYENVTAVPSTVDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQE 174

Query: 180 LVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISG 239
           LVDCDTSGEDQGC GGLMDDAF+FI  N GL TEA YPY  +DG+CN+K+A   AAKI+G
Sbjct: 175 LVDCDTSGEDQGCSGGLMDDAFKFIEQNHGLTTEANYPYAGTDGTCNRKKAAHPAAKING 234

Query: 240 YEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTA 299
           YEDVP+NNE AL KAVA+QP++VAIDASGS+FQFYSSGVFTGQCGTELDHGV AVGYGT+
Sbjct: 235 YEDVPANNEKALQKAVAHQPIAVAIDASGSEFQFYSSGVFTGQCGTELDHGVAAVGYGTS 294

Query: 300 DDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           DDG KYWLVKNSW T WGE GYIRMQRD+  KEGLCGIAMQASYPTA
Sbjct: 295 DDGMKYWLVKNSWSTGWGEEGYIRMQRDVTVKEGLCGIAMQASYPTA 341


>gi|37780045|gb|AAP32195.1| cysteine protease 5 [Trifolium repens]
          Length = 343

 Score =  498 bits (1281), Expect = e-138,   Method: Compositional matrix adjust.
 Identities = 236/335 (70%), Positives = 270/335 (80%), Gaps = 3/335 (0%)

Query: 12  LAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIA 71
           LA    LG+ A Q  SRTL D ++ ERHE WM  YG+VY++  E+E R +IF EN++YI 
Sbjct: 12  LALFFCLGLLAIQVTSRTLQDDSIFERHEQWMTHYGKVYKNPQEREKRLRIFTENLKYIE 71

Query: 72  SFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPAS 131
           + NN   NKPYKLGIN+FAD TNEEF A RN +K  + S     TT   F+YEN SVP++
Sbjct: 72  ASNNAGNNKPYKLGINQFADLTNEEFIASRNKFKGHMCSSIIRTTT---FKYENTSVPST 128

Query: 132 IDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQG 191
           +DWRKKGAVT VK+QGQCGCCWAFSA+AA EGI+ I+T KL SLSEQELVDCDT+G DQG
Sbjct: 129 VDWRKKGAVTPVKNQGQCGCCWAFSAIAATEGIHKISTGKLVSLSEQELVDCDTNGVDQG 188

Query: 192 CEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAAL 251
           CEGGLMDDAF+FII N G++TEA YPY+  DG+C   EA+ SAA I+GYEDVP+NNE AL
Sbjct: 189 CEGGLMDDAFKFIIQNNGISTEAGYPYQGVDGTCKANEASTSAATITGYEDVPANNENAL 248

Query: 252 MKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNS 311
            KAVANQP+SVAIDASGSDFQFY SGVFTG CGTELDHGVTAVGYG ++DGTKYWLVKNS
Sbjct: 249 QKAVANQPISVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGISNDGTKYWLVKNS 308

Query: 312 WGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           WGT WGE GYIRMQR IDA EGLCGIAMQASYPTA
Sbjct: 309 WGTDWGEEGYIRMQRSIDAAEGLCGIAMQASYPTA 343


>gi|357483847|ref|XP_003612210.1| Cysteine proteinase [Medicago truncatula]
 gi|355513545|gb|AES95168.1| Cysteine proteinase [Medicago truncatula]
          Length = 344

 Score =  497 bits (1280), Expect = e-138,   Method: Compositional matrix adjust.
 Identities = 235/338 (69%), Positives = 273/338 (80%), Gaps = 4/338 (1%)

Query: 10  LVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEY 69
           + LA +  LG+WA Q  SRTL D +M+ERHE WM  YG+VY+D+ E+E RFKIF EN++Y
Sbjct: 10  ISLALVFCLGLWAIQVTSRTLQDGSMHERHERWMNHYGKVYKDHQEREKRFKIFTENMKY 69

Query: 70  IASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS-V 128
           I +FNN   N+ YKLGIN+FAD TNEEF A RN +K  + S     TT   F+YEN S +
Sbjct: 70  IEAFNNGDNNESYKLGINQFADLTNEEFVASRNKFKGHMCSSIIRTTT---FKYENVSAI 126

Query: 129 PASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGE 188
           P+++DWRKKGAVT VK+QGQCGCCWAFSAVAA EGI+ ++T KL SLSEQELVDCDT G 
Sbjct: 127 PSTVDWRKKGAVTPVKNQGQCGCCWAFSAVAATEGIHKLSTGKLVSLSEQELVDCDTKGV 186

Query: 189 DQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNE 248
           DQGCEGGLMDDAF+FII N GL TEA+YPY+  DG+CN  +A+  A  I+GYEDVP+NNE
Sbjct: 187 DQGCEGGLMDDAFKFIIQNHGLNTEAQYPYQGVDGTCNANKASIQATTITGYEDVPANNE 246

Query: 249 AALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLV 308
            AL KAVANQP+SVAIDASGSDFQFY SGVFTG CGTELDHGVTAVGYG ++DGTKYWLV
Sbjct: 247 QALQKAVANQPISVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSNDGTKYWLV 306

Query: 309 KNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           KNSWGT WGE GYI MQR ++A EGLCGIAMQASYPTA
Sbjct: 307 KNSWGTDWGEEGYIMMQRGVEAAEGLCGIAMQASYPTA 344


>gi|225446585|ref|XP_002280215.1| PREDICTED: vignain [Vitis vinifera]
          Length = 341

 Score =  497 bits (1280), Expect = e-138,   Method: Compositional matrix adjust.
 Identities = 242/347 (69%), Positives = 279/347 (80%), Gaps = 7/347 (2%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRF 60
           MA +     + LA +  L  WA Q+ +R L +A+M ERHE WMAQYGRVY+D  EK  R+
Sbjct: 1   MASVNQYQYICLALLFFLAAWASQATARNLLEASMYERHEDWMAQYGRVYKDADEKSKRY 60

Query: 61  KIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVS 120
           KIFK+NV  I SFN KA +K YKL INEFAD TNEEFRA RN +K  + S  ++     S
Sbjct: 61  KIFKDNVARIESFN-KAMDKSYKLSINEFADLTNEEFRASRNRFKAHICSTEAT-----S 114

Query: 121 FRYEN-ASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQE 179
           F+YE+ A+VP+++DWRKKGAVT +KDQGQCG CWAFSAVAAMEGI  ++T KL SLSEQE
Sbjct: 115 FKYEHVAAVPSTVDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQE 174

Query: 180 LVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISG 239
           LVDCDTSGEDQGC GGLMDDAF+FI  N GLATEA YPY  +DG+CN+K+A   AAKI+G
Sbjct: 175 LVDCDTSGEDQGCNGGLMDDAFKFIEQNHGLATEANYPYAGTDGTCNRKKAAHPAAKING 234

Query: 240 YEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTA 299
           YEDVP+NNE AL KAVA+QP++VAIDA G +FQFYSSGVFTGQCGTELDHGV AVGYGT+
Sbjct: 235 YEDVPANNEKALQKAVAHQPIAVAIDAGGFEFQFYSSGVFTGQCGTELDHGVAAVGYGTS 294

Query: 300 DDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           DDG KYWLVKNSWGT WGE GYIRMQRD+ AKEGLCGIAMQASYPTA
Sbjct: 295 DDGMKYWLVKNSWGTGWGEVGYIRMQRDVTAKEGLCGIAMQASYPTA 341


>gi|224099295|ref|XP_002334495.1| predicted protein [Populus trichocarpa]
 gi|222872550|gb|EEF09681.1| predicted protein [Populus trichocarpa]
          Length = 342

 Score =  497 bits (1279), Expect = e-138,   Method: Compositional matrix adjust.
 Identities = 237/343 (69%), Positives = 276/343 (80%), Gaps = 5/343 (1%)

Query: 5   LLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFK 64
           + + +   A IL+LG+WA +  SR L ++ M+ RHE WMA YG+VY D AEKE RFKIFK
Sbjct: 4   ICKRQCFFAFILILGMWAFEVASRELQESYMSARHEQWMATYGKVYVDAAEKERRFKIFK 63

Query: 65  ENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYE 124
            NVEYI SFN  A NKPYKL +N+FADQTNE+F+  RNGY+R   + R  + T  SF+YE
Sbjct: 64  NNVEYIESFNT-AGNKPYKLSVNKFADQTNEKFKGARNGYRRPFQT-RPMKVT--SFKYE 119

Query: 125 NAS-VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDC 183
           N + VPA++DWRKKGAVT +KDQGQCG CWAFS VAA EGIN +TT KL SLSEQELVDC
Sbjct: 120 NVTAVPATMDWRKKGAVTPIKDQGQCGSCWAFSTVAATEGINQLTTGKLVSLSEQELVDC 179

Query: 184 DTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDV 243
           D  GEDQGCEGGLM+D FEFII N G+ TEA YPY+A+DG+CN K+     AKI+GYE V
Sbjct: 180 DNQGEDQGCEGGLMEDGFEFIIKNHGITTEANYPYQAADGTCNSKKQASHIAKITGYESV 239

Query: 244 PSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGT 303
           P+N+EA L+K VANQP+SV+IDA GSDFQFYSSGVFTG+CGTELDHGVTAVGYG   DGT
Sbjct: 240 PANSEAELLKVVANQPISVSIDAGGSDFQFYSSGVFTGKCGTELDHGVTAVGYGETSDGT 299

Query: 304 KYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           KYWLVKNSW T+WGE GYIRMQRDIDA+EGLCGIAM +SYPTA
Sbjct: 300 KYWLVKNSWXTSWGEEGYIRMQRDIDAEEGLCGIAMDSSYPTA 342


>gi|224135841|ref|XP_002327317.1| predicted protein [Populus trichocarpa]
 gi|222835687|gb|EEE74122.1| predicted protein [Populus trichocarpa]
          Length = 342

 Score =  495 bits (1275), Expect = e-137,   Method: Compositional matrix adjust.
 Identities = 236/343 (68%), Positives = 277/343 (80%), Gaps = 5/343 (1%)

Query: 5   LLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFK 64
           +   +   A IL+LG+WA +  SR L + +M+ RHE WM  +G+VY D AEKE RF+IFK
Sbjct: 4   ICRRQCFFAFILILGMWAYEVASRELQEPSMSARHEQWMETFGKVYADAAEKERRFEIFK 63

Query: 65  ENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYE 124
           +NVEYI SFN  A NKPYKL +N+FAD TNEE +  RNGY+R L + R  + T  SF+YE
Sbjct: 64  DNVEYIESFNT-AGNKPYKLSVNKFADLTNEELKVARNGYRRPLQT-RPMKVT--SFKYE 119

Query: 125 NAS-VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDC 183
           N + VPA++DWRKKGAVT +KDQGQCG CWAFS VAA EGIN +TT KL SLSEQELVDC
Sbjct: 120 NVTAVPATMDWRKKGAVTPIKDQGQCGSCWAFSTVAATEGINQLTTGKLVSLSEQELVDC 179

Query: 184 DTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDV 243
           DT GEDQGCEGGLM+D FEFII N G+ TEA YPY+A+DG+CN K+     AKI+GYE V
Sbjct: 180 DTQGEDQGCEGGLMEDGFEFIIKNHGITTEANYPYQAADGTCNSKKEASRIAKITGYESV 239

Query: 244 PSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGT 303
           P+N+EAAL+KAVA+QP+SV+IDA GSDFQFYSSGVFTGQCGTELDHGVTAVGYG   DGT
Sbjct: 240 PANSEAALLKAVASQPISVSIDAGGSDFQFYSSGVFTGQCGTELDHGVTAVGYGETSDGT 299

Query: 304 KYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           KYWLVKNSWGT+WGE GYIRMQRD +A+EGLCGIAM +SYPTA
Sbjct: 300 KYWLVKNSWGTSWGEEGYIRMQRDTEAEEGLCGIAMDSSYPTA 342


>gi|37780051|gb|AAP32198.1| cysteine protease 12 [Trifolium repens]
          Length = 343

 Score =  495 bits (1274), Expect = e-137,   Method: Compositional matrix adjust.
 Identities = 235/335 (70%), Positives = 269/335 (80%), Gaps = 3/335 (0%)

Query: 12  LAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIA 71
           LA    LG+ A Q  SRTL D ++ ERHE WM  YG+VY++  E+E R +IF EN++YI 
Sbjct: 12  LALFFCLGLLAIQVTSRTLQDDSIFERHEQWMTHYGKVYKNPQEREKRLRIFTENLKYIE 71

Query: 72  SFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPAS 131
           + NN    KPYKLGIN+FAD TNEEF A RN +K  + S     TT   F+YEN SVP++
Sbjct: 72  ASNNAGNKKPYKLGINQFADLTNEEFIASRNKFKGHMCSSIIRTTT---FKYENTSVPST 128

Query: 132 IDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQG 191
           +DWRKKGAVT VK+QGQCGCCWAFSA+AA EGI+ I+T KL SLSEQELVDCDT+G DQG
Sbjct: 129 VDWRKKGAVTPVKNQGQCGCCWAFSAIAATEGIHKISTGKLVSLSEQELVDCDTNGVDQG 188

Query: 192 CEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAAL 251
           CEGGLMDDAF+FII N G++TEA YPY+  DG+C   EA+ SAA I+GYEDVP+NNE AL
Sbjct: 189 CEGGLMDDAFKFIIQNNGISTEAGYPYQGVDGTCKANEASTSAATITGYEDVPANNENAL 248

Query: 252 MKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNS 311
            KAVANQP+SVAIDASGSDFQFY SGVFTG CGTELDHGVTAVGYG ++DGTKYWLVKNS
Sbjct: 249 QKAVANQPISVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGISNDGTKYWLVKNS 308

Query: 312 WGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           WGT WGE GYIRMQR IDA EGLCGIAMQASYPTA
Sbjct: 309 WGTDWGEEGYIRMQRSIDAAEGLCGIAMQASYPTA 343


>gi|359485281|ref|XP_002280230.2| PREDICTED: LOW QUALITY PROTEIN: KDEL-tailed cysteine endopeptidase
           CEP1 [Vitis vinifera]
          Length = 341

 Score =  495 bits (1274), Expect = e-137,   Method: Compositional matrix adjust.
 Identities = 240/347 (69%), Positives = 276/347 (79%), Gaps = 7/347 (2%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRF 60
           MA +     + LA + VL  WA Q+ +R L++A+M ERHE WMAQYGRVY+D  EK  R+
Sbjct: 1   MASVNQYQYICLALLFVLAAWASQATARNLHEASMYERHEDWMAQYGRVYKDADEKSKRY 60

Query: 61  KIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVS 120
           KIFK+NV  I SFN KA +K YKL INEFAD TNEEF   RN +K  + S  ++     S
Sbjct: 61  KIFKDNVARIESFN-KAMDKSYKLSINEFADLTNEEFGTSRNRFKAHICSTEAT-----S 114

Query: 121 FRYENAS-VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQE 179
           F+YEN + VP++IDWRKKGAVT +KDQGQCG CWAFSAVAAMEGI  ++T KL SLSEQE
Sbjct: 115 FKYENVTAVPSTIDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQE 174

Query: 180 LVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISG 239
           LVDCDTSGEDQGC GGLMDDAF+FI  N GL TEA YPY  +DG+CN+K+A   AAKI+G
Sbjct: 175 LVDCDTSGEDQGCNGGLMDDAFKFIKQNHGLTTEANYPYAGTDGTCNRKKAAHPAAKING 234

Query: 240 YEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTA 299
           YEDVP+NNE AL KAV +QP++VAIDA G +FQFYSSGVFTGQCGTELDHGV AVGYGT+
Sbjct: 235 YEDVPANNEKALQKAVVHQPIAVAIDAGGFEFQFYSSGVFTGQCGTELDHGVAAVGYGTS 294

Query: 300 DDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           DDG KYWLVKNSWGT WGE GYIRMQRD+ AKEGLCGIAMQASYPTA
Sbjct: 295 DDGMKYWLVKNSWGTGWGEEGYIRMQRDVTAKEGLCGIAMQASYPTA 341


>gi|118627554|emb|CAL64936.1| putative cysteine protease 8 [Trifolium pratense]
          Length = 344

 Score =  493 bits (1269), Expect = e-137,   Method: Compositional matrix adjust.
 Identities = 234/338 (69%), Positives = 273/338 (80%), Gaps = 4/338 (1%)

Query: 10  LVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEY 69
           + LA +  LG++A Q  SRTL D +M ERH  WM+QYG++Y+D+ E+E RFKIFKENV Y
Sbjct: 10  ISLALLFCLGLFAIQVTSRTLQDDSMYERHGQWMSQYGKIYKDHQERETRFKIFKENVNY 69

Query: 70  IASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS-V 128
           I +FNN    K YKLGIN+FAD TNEEF A RN +K  + S   S     SF+YEN S +
Sbjct: 70  IETFNNADDTKSYKLGINQFADLTNEEFIASRNKFKGHMCS---SIMRTTSFKYENVSGI 126

Query: 129 PASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGE 188
           P+++DWRKKGAVT VK+QGQCGCCWAFSAVAA EGI+ ++T KL SLSEQELVDCDT G 
Sbjct: 127 PSTVDWRKKGAVTPVKNQGQCGCCWAFSAVAATEGIHKLSTGKLISLSEQELVDCDTKGV 186

Query: 189 DQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNE 248
           DQGCEGGLMDDAF+FII N GL+TEA+YPY+  DG+CN  +A+  A  I+GYEDVP+N+E
Sbjct: 187 DQGCEGGLMDDAFKFIIQNHGLSTEAQYPYEGVDGTCNANKASVQAVTITGYEDVPANSE 246

Query: 249 AALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLV 308
            AL KAVANQP+SVAIDASGSDFQFY SGVFTG CGTELDHGVTAVGYG ++DGTKYWLV
Sbjct: 247 QALQKAVANQPISVAIDASGSDFQFYKSGVFTGACGTELDHGVTAVGYGVSNDGTKYWLV 306

Query: 309 KNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           KNSWGT WGE GYI MQR I+A EG+CGIAMQASYPTA
Sbjct: 307 KNSWGTDWGEEGYIMMQRGIEAAEGICGIAMQASYPTA 344


>gi|224081320|ref|XP_002306369.1| predicted protein [Populus trichocarpa]
 gi|222855818|gb|EEE93365.1| predicted protein [Populus trichocarpa]
          Length = 340

 Score =  493 bits (1268), Expect = e-137,   Method: Compositional matrix adjust.
 Identities = 237/338 (70%), Positives = 275/338 (81%), Gaps = 8/338 (2%)

Query: 10  LVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEY 69
           + LA + VLG W  +S +RTL D +M ERHE WMAQYGRVY+D+AEKE R+ IFKENV  
Sbjct: 10  ICLALLFVLGAWPSKSAARTLQDVSMYERHEQWMAQYGRVYKDDAEKETRYNIFKENVAR 69

Query: 70  IASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS-V 128
           I +FN++   K YKLG+N+FAD +NEEF+A RN +K  + S ++       FRYEN S V
Sbjct: 70  IDAFNSQT-GKSYKLGVNQFADLSNEEFKASRNRFKGHMCSPQAG-----PFRYENVSAV 123

Query: 129 PASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGE 188
           PA++DWRKKGAVT VKDQGQCGCCWAFSAVAAMEGIN +TT KL SLSEQE+VDCDT GE
Sbjct: 124 PATMDWRKKGAVTPVKDQGQCGCCWAFSAVAAMEGINQLTTGKLISLSEQEVVDCDTKGE 183

Query: 189 DQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNE 248
           DQGC GGLMDDAF+FI  NKGL TEA YPY  +DG+CN ++    AAKI+G+EDVP+N+E
Sbjct: 184 DQGCNGGLMDDAFKFIEQNKGLTTEANYPYTGTDGTCNTQKEATHAAKITGFEDVPANSE 243

Query: 249 AALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLV 308
           AALMKAVA QPVSVAIDA G +FQFYSSG+FTG CGT+LDHGVTAVGYG + DGTKYWLV
Sbjct: 244 AALMKAVAKQPVSVAIDAGGFEFQFYSSGIFTGSCGTQLDHGVTAVGYGIS-DGTKYWLV 302

Query: 309 KNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           KNSWG  WGE GYIRMQ+DI AKEGLCGIAMQASYP+A
Sbjct: 303 KNSWGAQWGEEGYIRMQKDISAKEGLCGIAMQASYPSA 340


>gi|357474573|ref|XP_003607571.1| Cysteine proteinase EP-B [Medicago truncatula]
 gi|34329348|gb|AAQ63885.1| putative cysteine proteinase [Medicago truncatula]
 gi|355508626|gb|AES89768.1| Cysteine proteinase EP-B [Medicago truncatula]
          Length = 345

 Score =  492 bits (1266), Expect = e-136,   Method: Compositional matrix adjust.
 Identities = 235/342 (68%), Positives = 273/342 (79%), Gaps = 4/342 (1%)

Query: 6   LENKLVLAAILVLGVWAPQSWSRTL-NDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFK 64
           L + + LA    LG++A Q  SRTL +D+ + E+HE WM  YG+VY+D  E+E R KIFK
Sbjct: 7   LYHSISLALFFCLGLFAIQVTSRTLQDDSIIYEKHEQWMVHYGKVYKDLQERENRLKIFK 66

Query: 65  ENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYE 124
           ENV YI + NN   NK YKLGIN+FAD TNEEF A RN +K  + S   S T   +F+YE
Sbjct: 67  ENVNYIEASNNAGNNKLYKLGINQFADLTNEEFIASRNKFKGHMCS---SITKTSTFKYE 123

Query: 125 NASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCD 184
           NASVP+++DWRKKGAVT VK+QGQCGCCWAFSAVAA EGI+ ++T KL SLSEQELVDCD
Sbjct: 124 NASVPSTVDWRKKGAVTPVKNQGQCGCCWAFSAVAATEGIHKLSTGKLVSLSEQELVDCD 183

Query: 185 TSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVP 244
           T G DQGCEGGLMDDAF+FII N GL TEA+YPY+  DG+C+  +A+  A  I+GYEDVP
Sbjct: 184 TKGVDQGCEGGLMDDAFKFIIQNHGLNTEAQYPYQGVDGTCSANKASIHAVTITGYEDVP 243

Query: 245 SNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTK 304
           +NNE AL KAVANQP+SVAIDASGSDFQFY SGVFTG CGTELDHGVTAVGYG  +DGTK
Sbjct: 244 ANNEQALQKAVANQPISVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVGNDGTK 303

Query: 305 YWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           YWLVKNSWGT WGE GYI+MQR +DA EGLCGIAM+ASYPTA
Sbjct: 304 YWLVKNSWGTDWGEEGYIKMQRGVDAAEGLCGIAMEASYPTA 345


>gi|124484401|dbj|BAF46311.1| cysteine proteinase precursor [Ipomoea nil]
          Length = 339

 Score =  492 bits (1266), Expect = e-136,   Method: Compositional matrix adjust.
 Identities = 248/339 (73%), Positives = 278/339 (82%), Gaps = 7/339 (2%)

Query: 9   KLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVE 68
           KL++A  LV    A  + SRTL D+ M  RHE WMAQYGRVY++  EK  R+ IFKENVE
Sbjct: 7   KLLIALALVFATSAYLATSRTLLDSLMAVRHEQWMAQYGRVYKNEVEKTKRYNIFKENVE 66

Query: 69  YIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS- 127
           YI SFN KA  KPYKLGIN FAD TN+EF A RNGY   LP   SS T    FRYEN S 
Sbjct: 67  YIESFN-KAGTKPYKLGINAFADLTNKEFIASRNGY--ILPHECSSNT---PFRYENVSA 120

Query: 128 VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSG 187
           VP ++DWRKKGAVT VKDQGQCGCCWAFSAVAAMEGI  ++T  L SLSEQELVDCD  G
Sbjct: 121 VPTTVDWRKKGAVTPVKDQGQCGCCWAFSAVAAMEGITKLSTGNLISLSEQELVDCDVKG 180

Query: 188 EDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNN 247
            DQGCEGGLMDDAF FII+NKGL TE+ YPY+ +DGSC K +++ SAAKISGYEDVP+N+
Sbjct: 181 IDQGCEGGLMDDAFTFIINNKGLTTESNYPYQGTDGSCKKSKSSNSAAKISGYEDVPANS 240

Query: 248 EAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWL 307
           E+AL KAVANQPVSVAIDA GSDFQFYSSGVFTG+CGTELDHGVTAVGYG A+DG+KYWL
Sbjct: 241 ESALEKAVANQPVSVAIDAGGSDFQFYSSGVFTGECGTELDHGVTAVGYGIAEDGSKYWL 300

Query: 308 VKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           VKNSWGT+WGE GYIRMQ+DI+AKEGLCGIAMQ+SYP+A
Sbjct: 301 VKNSWGTSWGEKGYIRMQKDIEAKEGLCGIAMQSSYPSA 339


>gi|356577813|ref|XP_003557017.1| PREDICTED: uncharacterized protein LOC100801364 [Glycine max]
          Length = 890

 Score =  491 bits (1265), Expect = e-136,   Method: Compositional matrix adjust.
 Identities = 235/338 (69%), Positives = 269/338 (79%), Gaps = 5/338 (1%)

Query: 10  LVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEY 69
           + LA +L +   A Q   R+L DA+M ERHE WM +YG+VY+D  E+E RF+IFKENV Y
Sbjct: 557 ISLAMLLCMAFLAFQVTCRSLQDASMYERHEQWMTRYGKVYKDPQEREKRFRIFKENVNY 616

Query: 70  IASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS-V 128
           I +FNN A NK YKL IN+FAD TNEEF APRN +K  + S     TT   F+YEN + V
Sbjct: 617 IEAFNNAA-NKRYKLAINQFADLTNEEFIAPRNRFKGHMCSSIIRTTT---FKYENVTAV 672

Query: 129 PASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGE 188
           P+++DWR+KGAVT +KDQGQCGCCWAFSAVAA EGI+ +T+ KL SLSEQELVDCDT G 
Sbjct: 673 PSTVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALTSGKLISLSEQELVDCDTKGV 732

Query: 189 DQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNE 248
           DQGCEGGLMDDAF+F+I N GL TEA YPYK  DG CN  EA      I+GYEDVP+NNE
Sbjct: 733 DQGCEGGLMDDAFKFVIQNHGLNTEANYPYKGVDGKCNANEAANDVVTITGYEDVPANNE 792

Query: 249 AALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLV 308
            AL KAVANQPVSVAIDASGSDFQFY SGVFTG CGTELDHGVTAVGYG ++DGT+YWLV
Sbjct: 793 KALQKAVANQPVSVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSNDGTEYWLV 852

Query: 309 KNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           KNSWGT WGE GYIRMQR +D++EGLCGIAMQASYPTA
Sbjct: 853 KNSWGTEWGEEGYIRMQRGVDSEEGLCGIAMQASYPTA 890


>gi|409190991|gb|AFV30165.1| cysteine proteinase [Lotus japonicus]
          Length = 342

 Score =  491 bits (1263), Expect = e-136,   Method: Compositional matrix adjust.
 Identities = 231/338 (68%), Positives = 269/338 (79%), Gaps = 4/338 (1%)

Query: 9   KLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVE 68
           ++  A +L LG+WA Q  SRTL DA+M ERHE WMA+YGRVY+D  EKE RF IFKENV 
Sbjct: 9   QVSFALVLCLGLWAFQVSSRTLQDASMQERHEQWMARYGRVYKDLQEKEKRFSIFKENVN 68

Query: 69  YIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASV 128
           YI + NN A +KPYKLG+N+FAD TNEEF A RN +K  + S  +  TT   F+YEN + 
Sbjct: 69  YIEASNN-AGDKPYKLGVNQFADLTNEEFIATRNKFKGHMSSSITRTTT---FKYENVTA 124

Query: 129 PASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGE 188
           P+++DWR++GAVT VK+QG CGCCWAFSAVAA EGI+ ++T  L SLSEQELVDCDTSG 
Sbjct: 125 PSTVDWRQEGAVTPVKNQGTCGCCWAFSAVAATEGIHKLSTGNLVSLSEQELVDCDTSGA 184

Query: 189 DQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNE 248
           DQGC+GGLMDDAF+FII N GL TEA+YPY+  DG+CN  E     A I+GYEDVPSNNE
Sbjct: 185 DQGCQGGLMDDAFKFIIQNGGLNTEAQYPYQGVDGTCNTNEEATHVATITGYEDVPSNNE 244

Query: 249 AALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLV 308
            AL +AVANQP+S+AIDASGSDFQ Y SGVFTG CGT+LDHGV  VGYG +DDGTKYWLV
Sbjct: 245 QALQQAVANQPISIAIDASGSDFQNYQSGVFTGSCGTQLDHGVAVVGYGVSDDGTKYWLV 304

Query: 309 KNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           KNSWG  WGE GYIRMQRD+DA EGLCG+AMQ SYPTA
Sbjct: 305 KNSWGADWGEEGYIRMQRDVDAPEGLCGLAMQPSYPTA 342


>gi|13491750|gb|AAK27968.1|AF242372_1 cysteine protease [Ipomoea batatas]
          Length = 339

 Score =  491 bits (1263), Expect = e-136,   Method: Compositional matrix adjust.
 Identities = 244/321 (76%), Positives = 272/321 (84%), Gaps = 7/321 (2%)

Query: 27  SRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGI 86
           SRTL+D+ M  RHE WMAQYGRVY+  AEK  RF IFKENVEYI SFN KA  KPYKLGI
Sbjct: 25  SRTLSDSLMVVRHEQWMAQYGRVYKTEAEKTKRFNIFKENVEYIESFN-KAGTKPYKLGI 83

Query: 87  NEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYEN-ASVPASIDWRKKGAVTGVKD 145
           N FAD TN+EF+A RNGYK  LP   SS T    FRYEN +SVP ++DWR KGAVT VKD
Sbjct: 84  NAFADLTNQEFKASRNGYK--LPHDCSSNT---PFRYENVSSVPTTVDWRTKGAVTPVKD 138

Query: 146 QGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFII 205
           QGQCGCCWAFSAVAAMEGI  ++T  L SLSEQELVDCD  G DQGCEGGLMDDAF FII
Sbjct: 139 QGQCGCCWAFSAVAAMEGITKLSTGNLISLSEQELVDCDVKGTDQGCEGGLMDDAFSFII 198

Query: 206 SNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAID 265
           +NKGL TE+ YPY+ +DGSC K +++ SAAKISGYEDVP+N+E+AL KAVANQPVSVAID
Sbjct: 199 NNKGLTTESNYPYQGTDGSCKKSKSSNSAAKISGYEDVPANSESALEKAVANQPVSVAID 258

Query: 266 ASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQ 325
           A GSDFQFYSSGVFTG+CGTELDHGVTAVGYG A+DG+KYWLVKNSWGT+WGE GYIRMQ
Sbjct: 259 AGGSDFQFYSSGVFTGECGTELDHGVTAVGYGIAEDGSKYWLVKNSWGTSWGEKGYIRMQ 318

Query: 326 RDIDAKEGLCGIAMQASYPTA 346
           +DI+AKEGLCGIAMQ+SYP+A
Sbjct: 319 KDIEAKEGLCGIAMQSSYPSA 339


>gi|10336513|dbj|BAB13759.1| cysteine proteinase [Astragalus sinicus]
          Length = 343

 Score =  490 bits (1262), Expect = e-136,   Method: Compositional matrix adjust.
 Identities = 234/340 (68%), Positives = 277/340 (81%), Gaps = 9/340 (2%)

Query: 10  LVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEY 69
           + LA ++ LG+WA Q  SRTL DA+M ERH+ WM QY ++Y D+ E E RF+IFKENV Y
Sbjct: 10  ISLALLMCLGLWAVQVTSRTLQDASMYERHQQWMGQYAKIYNDHQEWEKRFQIFKENVNY 69

Query: 70  IASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPS--VRSSETTDVSFRYENAS 127
           I + +NK   + YKLG+N+F D TNEEF APRN +K  + S  +R++     +++YEN +
Sbjct: 70  IET-SNKEGGRFYKLGVNQFVDLTNEEFIAPRNRFKGHMCSSIIRTN-----TYKYENVT 123

Query: 128 -VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTS 186
            VP+++DWR+KGAVT VKDQGQCGCCWAFSAVAA EGI+ ++T KL SLSEQELVDCDT 
Sbjct: 124 TVPSNVDWRQKGAVTPVKDQGQCGCCWAFSAVAATEGIHQLSTGKLISLSEQELVDCDTK 183

Query: 187 GEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSN 246
           G DQGCEGGLMDDAF+FII N GL TEAKYPY+  DG+CN  EA+ +AA I+ YEDVP+N
Sbjct: 184 GVDQGCEGGLMDDAFKFIIQNHGLDTEAKYPYQGVDGTCNANEASINAATITSYEDVPTN 243

Query: 247 NEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYW 306
           NE AL KAVANQP+SVAIDASGSDFQFY+SGVFTG CGTELDHGVTAVGYG +DDGTKYW
Sbjct: 244 NEQALQKAVANQPISVAIDASGSDFQFYTSGVFTGSCGTELDHGVTAVGYGVSDDGTKYW 303

Query: 307 LVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           LVKNSWGT+WGE GYIRMQR +DA EGLCGIAMQASYP A
Sbjct: 304 LVKNSWGTSWGEEGYIRMQRGVDAVEGLCGIAMQASYPIA 343


>gi|356577811|ref|XP_003557016.1| PREDICTED: vignain-like [Glycine max]
          Length = 343

 Score =  489 bits (1260), Expect = e-136,   Method: Compositional matrix adjust.
 Identities = 236/338 (69%), Positives = 271/338 (80%), Gaps = 5/338 (1%)

Query: 10  LVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEY 69
           + LA +L +   A Q   R+L DA+M ERHE WM +YG+VY+D  E+E RF+IFKENV Y
Sbjct: 10  ISLAMLLCMAFLAFQVTCRSLQDASMYERHEQWMTRYGKVYKDPQEREKRFRIFKENVNY 69

Query: 70  IASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS-V 128
           I +FNN A NK YKL IN+FAD TNEEF APRN +K  + S     TT   F+YEN + V
Sbjct: 70  IEAFNNAA-NKRYKLAINQFADLTNEEFIAPRNRFKGHMCSSIIRTTT---FKYENVTAV 125

Query: 129 PASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGE 188
           P+++DWR+KGAVT +KDQGQCGCCWAFSAVAA EGI+ +T+ KL SLSEQELVDCDT G 
Sbjct: 126 PSTVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALTSGKLISLSEQELVDCDTKGV 185

Query: 189 DQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNE 248
           DQGCEGGLMDDAF+F+I N GL TEA YPYK  DG CN  EA   AA I+GYEDVP+NNE
Sbjct: 186 DQGCEGGLMDDAFKFVIQNHGLNTEANYPYKGVDGKCNVNEAANDAATITGYEDVPANNE 245

Query: 249 AALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLV 308
            AL KAVANQPVSVAIDASGSDFQFY SGVFTG CGTELDHGVTAVGYG ++DGT+YWLV
Sbjct: 246 KALQKAVANQPVSVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSNDGTEYWLV 305

Query: 309 KNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           KNSWGT WGE GYIRMQR ++++EGLCGIAMQASYPTA
Sbjct: 306 KNSWGTEWGEEGYIRMQRGVNSEEGLCGIAMQASYPTA 343


>gi|50355621|dbj|BAD29959.1| cysteine protease [Daucus carota]
          Length = 361

 Score =  489 bits (1259), Expect = e-136,   Method: Compositional matrix adjust.
 Identities = 234/338 (69%), Positives = 274/338 (81%), Gaps = 5/338 (1%)

Query: 10  LVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEY 69
            ++AA+++LG WA Q+ SRTL +A+M ERHE WM QYGRVY+D AEK +RF+IF +NV++
Sbjct: 28  FMIAALILLGAWACQATSRTLPEASMFERHEQWMIQYGRVYKDEAEKSVRFQIFMDNVKF 87

Query: 70  IASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS-V 128
           I  FN   R + YKL +NEFADQTNEEF+A RNGYK  + S R S+TT   FRYEN + V
Sbjct: 88  IEEFNKDGR-QSYKLAVNEFADQTNEEFQASRNGYKMAVSS-RPSQTT--LFRYENVTAV 143

Query: 129 PASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGE 188
           P+S+DWRKKGAVT VKDQGQCG CWAFS +AA EGI  + T KL SLSEQELVDCD +GE
Sbjct: 144 PSSMDWRKKGAVTPVKDQGQCGSCWAFSTIAATEGITKLKTGKLISLSEQELVDCDKTGE 203

Query: 189 DQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNE 248
           DQGCEGG M+D FEFI+ NKG+A EA YPY A+DG+CN KE    AAKISGYE VP+N+E
Sbjct: 204 DQGCEGGYMEDGFEFIVKNKGIALEASYPYTAADGTCNSKEEASRAAKISGYEKVPANSE 263

Query: 249 AALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLV 308
            AL+KAVANQPVSV+IDASG  FQFYSSGVFTG+CGT+LDHGVTAVGYG   DGTKYWLV
Sbjct: 264 TALLKAVANQPVSVSIDASGVAFQFYSSGVFTGECGTDLDHGVTAVGYGKTSDGTKYWLV 323

Query: 309 KNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           KNSWG +WG++GYI MQR + AK GLCGIAM ASYPTA
Sbjct: 324 KNSWGASWGDSGYIMMQRGVAAKGGLCGIAMDASYPTA 361


>gi|357477459|ref|XP_003609015.1| Cysteine proteinase [Medicago truncatula]
 gi|355510070|gb|AES91212.1| Cysteine proteinase [Medicago truncatula]
          Length = 345

 Score =  489 bits (1258), Expect = e-135,   Method: Compositional matrix adjust.
 Identities = 236/342 (69%), Positives = 268/342 (78%), Gaps = 4/342 (1%)

Query: 6   LENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKE 65
           L   + L  I  LG+ A Q  SR+L   +M ERHE WM+QY +VY+D  E+E R KIF  
Sbjct: 7   LYYSIALTFIFCLGLCAIQVTSRSLQVDSMYERHEQWMSQYSKVYKDPQEREERHKIFTA 66

Query: 66  NVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYEN 125
           NV YI  FNN A NK YKLGIN+FAD TNEEF A RN +K  + S  +  TT   F+YEN
Sbjct: 67  NVNYIEVFNNDANNKLYKLGINQFADLTNEEFIASRNKFKGHMCSSIAKTTT---FKYEN 123

Query: 126 AS-VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCD 184
            S +P+++DWRKKGAVT VK+QGQCGCCWAFSAVAA EGI  ++T KL SLSEQELVDCD
Sbjct: 124 VSAIPSTVDWRKKGAVTPVKNQGQCGCCWAFSAVAATEGITKLSTGKLVSLSEQELVDCD 183

Query: 185 TSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVP 244
           T G DQGCEGGLMDDAF+FII N GL+TEA YPY+  DG+CN  +A+  AA I+GYEDVP
Sbjct: 184 TKGVDQGCEGGLMDDAFKFIIQNHGLSTEAAYPYQGVDGTCNANKASIHAATITGYEDVP 243

Query: 245 SNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTK 304
           +NNE AL KAVANQP+SVAIDASGSDFQFY SGVF+G CGTELDHGVTAVGYG  +DGTK
Sbjct: 244 ANNEQALQKAVANQPISVAIDASGSDFQFYKSGVFSGSCGTELDHGVTAVGYGVGNDGTK 303

Query: 305 YWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           YWLVKNSWGT WGE GYIRMQR +DA EGLCGIAMQASYPTA
Sbjct: 304 YWLVKNSWGTDWGEEGYIRMQRGVDAAEGLCGIAMQASYPTA 345


>gi|144905108|dbj|BAF56428.1| cysteine proteinase [Lotus japonicus]
          Length = 342

 Score =  489 bits (1258), Expect = e-135,   Method: Compositional matrix adjust.
 Identities = 238/347 (68%), Positives = 279/347 (80%), Gaps = 6/347 (1%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRF 60
           MA   + N   LA +LV G  A ++ +RTL D ++ ERHE WM QYG+VY D+ EKE+R 
Sbjct: 1   MASKTVLNISSLALLLVFGFLAFEANARTLEDVSLKERHEQWMTQYGKVYTDSYEKELRS 60

Query: 61  KIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVS 120
            IFKENV+ I +FNN A NKPYKLGIN+FAD TNEEF+A RN +K  + S   + T   +
Sbjct: 61  NIFKENVQRIEAFNN-AGNKPYKLGINQFADLTNEEFKA-RNRFKGHMCS---NSTRTPT 115

Query: 121 FRYEN-ASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQE 179
           F+YE+ +SVPAS+DWR+KGAVT +KDQGQCGCCWAFSAVAA EGI  ++T KL SLSEQE
Sbjct: 116 FKYEDVSSVPASLDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGITKLSTGKLISLSEQE 175

Query: 180 LVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISG 239
           LVDCDT G DQGCEGGLMDDAF+FI+ NKGL TEAKYPY+  D +CN       AA I G
Sbjct: 176 LVDCDTKGVDQGCEGGLMDDAFKFIMQNKGLNTEAKYPYQGVDATCNANAEAKDAASIKG 235

Query: 240 YEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTA 299
           +EDVP+N+E+AL+KAVANQP+SVAIDASGS+FQFYSSG+FTG CGTELDHGVTAVGYG +
Sbjct: 236 FEDVPANSESALLKAVANQPISVAIDASGSEFQFYSSGLFTGSCGTELDHGVTAVGYGVS 295

Query: 300 DDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           DDGTKYWLVKNSWG  WGE GYIRMQRD+ A+EGLCGIAMQASYPTA
Sbjct: 296 DDGTKYWLVKNSWGEQWGEEGYIRMQRDVAAEEGLCGIAMQASYPTA 342


>gi|24285904|gb|AAL14199.1| cysteine proteinase precursor [Ipomoea batatas]
 gi|56961686|gb|AAK15148.2| cysteine proteinase-like protein [Ipomoea batatas]
          Length = 341

 Score =  488 bits (1257), Expect = e-135,   Method: Compositional matrix adjust.
 Identities = 243/321 (75%), Positives = 271/321 (84%), Gaps = 7/321 (2%)

Query: 27  SRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGI 86
           SRTL+D+ M  RHE WMAQYGRVY +  EK  RF IFKENVEYI SFN KA  KPYKLGI
Sbjct: 27  SRTLSDSLMVVRHEQWMAQYGRVYENEVEKTKRFNIFKENVEYIESFN-KAGTKPYKLGI 85

Query: 87  NEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYEN-ASVPASIDWRKKGAVTGVKD 145
           N FAD TN+EF+A RNGYK  LP   SS T    FRYEN +SVP ++DWR KGAVT VKD
Sbjct: 86  NAFADLTNQEFKASRNGYK--LPHDCSSNT---PFRYENVSSVPTTVDWRTKGAVTPVKD 140

Query: 146 QGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFII 205
           QGQCGCCWAFSAVAAMEGI  ++T  L SLSEQELVDCD  G DQGCEGGLMDDAF FII
Sbjct: 141 QGQCGCCWAFSAVAAMEGITKLSTGNLISLSEQELVDCDVKGIDQGCEGGLMDDAFSFII 200

Query: 206 SNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAID 265
           +NKGL TE+ YPY+ +DGSC K +++ SAAKISGYEDVP+N+E+AL KAVANQPVSVAID
Sbjct: 201 NNKGLTTESNYPYQGTDGSCKKSKSSNSAAKISGYEDVPANSESALEKAVANQPVSVAID 260

Query: 266 ASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQ 325
           A GSDFQFYSSGVFTG+CGTELDHGVTAVGYG A+DG+KYWLVKNSWGT+WGE GYIRMQ
Sbjct: 261 AGGSDFQFYSSGVFTGECGTELDHGVTAVGYGIAEDGSKYWLVKNSWGTSWGEKGYIRMQ 320

Query: 326 RDIDAKEGLCGIAMQASYPTA 346
           +DI+AKEGLCGIAMQ+SYP+A
Sbjct: 321 KDIEAKEGLCGIAMQSSYPSA 341


>gi|356545063|ref|XP_003540965.1| PREDICTED: thiol protease SEN102-like [Glycine max]
          Length = 361

 Score =  488 bits (1256), Expect = e-135,   Method: Compositional matrix adjust.
 Identities = 235/338 (69%), Positives = 269/338 (79%), Gaps = 5/338 (1%)

Query: 10  LVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEY 69
           + LA +L +   A Q   R+L DA+M ERHE WM +YG+VY+D  E+E RF+IFKENV Y
Sbjct: 28  ISLAMLLCMAFLAFQVTCRSLQDASMYERHEQWMTRYGKVYKDPQEREKRFRIFKENVNY 87

Query: 70  IASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS-V 128
           I +FNN A NK YKL IN+FAD TNEEF APRN +K  + S     TT   F+YEN + V
Sbjct: 88  IEAFNNAA-NKRYKLAINQFADLTNEEFIAPRNRFKGHMCSSIIRTTT---FKYENVTAV 143

Query: 129 PASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGE 188
           P+++DWR+KGAVT +KDQGQCGCCWAFSAVAA EGI+ +T+ KL SLSEQELVDCDT G 
Sbjct: 144 PSTVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALTSGKLISLSEQELVDCDTKGV 203

Query: 189 DQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNE 248
           DQGCEGGLMDDAF+F+I N GL TEA YPYK  DG CN  EA      I+GYEDVP+NNE
Sbjct: 204 DQGCEGGLMDDAFKFVIQNHGLNTEANYPYKGVDGKCNANEAANDVVTITGYEDVPANNE 263

Query: 249 AALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLV 308
            AL KAVANQPVSVAIDASGSDFQFY SGVFTG CGTELDHGVTAVGYG ++DGT+YWLV
Sbjct: 264 KALQKAVANQPVSVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSNDGTEYWLV 323

Query: 309 KNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           KNSWGT WGE GYIRMQR +D++EGLCGIAMQASYPTA
Sbjct: 324 KNSWGTEWGEEGYIRMQRGVDSEEGLCGIAMQASYPTA 361


>gi|224093956|ref|XP_002310053.1| predicted protein [Populus trichocarpa]
 gi|224147016|ref|XP_002336386.1| predicted protein [Populus trichocarpa]
 gi|222834869|gb|EEE73318.1| predicted protein [Populus trichocarpa]
 gi|222852956|gb|EEE90503.1| predicted protein [Populus trichocarpa]
          Length = 340

 Score =  487 bits (1253), Expect = e-135,   Method: Compositional matrix adjust.
 Identities = 235/338 (69%), Positives = 273/338 (80%), Gaps = 8/338 (2%)

Query: 10  LVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEY 69
           + LA + +LG W  +S +RTL DA M ERHE WM QYGRVY+D+ E+  R+ IFKENV  
Sbjct: 10  VCLALLFILGAWPSKSTARTLLDAPMYERHEQWMTQYGRVYKDDNERATRYSIFKENVAR 69

Query: 70  IASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS-V 128
           I +FN++   K YKLG+N+FAD TNEEF+A RN +K  + S ++       FRYEN S V
Sbjct: 70  IDAFNSQT-GKSYKLGVNQFADLTNEEFKASRNRFKGHMCSPQAG-----PFRYENVSAV 123

Query: 129 PASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGE 188
           P+++DWRK+GAVT VKDQGQCGCCWAFSAVAAMEGIN +TT KL SLSEQE+VDCDT GE
Sbjct: 124 PSTVDWRKEGAVTPVKDQGQCGCCWAFSAVAAMEGINKLTTGKLISLSEQEVVDCDTKGE 183

Query: 189 DQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNE 248
           DQGC GGLMDDAF+FI  NKGL TEA YPYK +DG+CN  +A   AAKI+G+EDVP+N+E
Sbjct: 184 DQGCNGGLMDDAFKFIEQNKGLTTEANYPYKGTDGTCNTNKAAIHAAKITGFEDVPANSE 243

Query: 249 AALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLV 308
           AALMKAVA QPVSVAIDA GSDFQFYSSG+FTG C T+LDHGVTAVGYG + DG+KYWLV
Sbjct: 244 AALMKAVAKQPVSVAIDAGGSDFQFYSSGIFTGSCDTQLDHGVTAVGYGVS-DGSKYWLV 302

Query: 309 KNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           KNSWG  WGE GYIRMQ+DI AKEGLCGIAMQASYPTA
Sbjct: 303 KNSWGAQWGEEGYIRMQKDISAKEGLCGIAMQASYPTA 340


>gi|144905104|dbj|BAF56427.1| cysteine proteinase [Lotus japonicus]
          Length = 342

 Score =  487 bits (1253), Expect = e-135,   Method: Compositional matrix adjust.
 Identities = 229/338 (67%), Positives = 270/338 (79%), Gaps = 4/338 (1%)

Query: 9   KLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVE 68
           ++  A +L LG+WA Q  SRTL DA+M+ERHE WMA+YG+VY+D  EKE RF IF+ENV+
Sbjct: 9   QISFALVLCLGLWAFQVSSRTLQDASMHERHEQWMARYGKVYKDLQEKEKRFNIFQENVK 68

Query: 69  YIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASV 128
           YI + NN A NKPYKLG+N+F D TN+EF A RN +K  + S  +  TT   F+YEN + 
Sbjct: 69  YIEASNN-AGNKPYKLGVNQFTDLTNKEFIATRNKFKGHMSSSITRTTT---FKYENVTA 124

Query: 129 PASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGE 188
           P+++DWR++GAVT VK+QG CGCCWAFSAVAA EGI+ ++T  L SLSEQELVDCDTSG 
Sbjct: 125 PSTVDWRQEGAVTPVKNQGTCGCCWAFSAVAATEGIHKLSTGNLVSLSEQELVDCDTSGA 184

Query: 189 DQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNE 248
           DQGC+GGLMDDAF+FII N GL TEA+YPY+  DG+CN  E     A I+GYEDVPSNNE
Sbjct: 185 DQGCQGGLMDDAFKFIIQNGGLNTEAQYPYQGVDGTCNTNEEVTHVATITGYEDVPSNNE 244

Query: 249 AALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLV 308
            AL +AVANQP+SVAIDASGSDFQ Y SGVFTG CGT+LDHGV  VGYG +DDGTKYWLV
Sbjct: 245 QALQQAVANQPISVAIDASGSDFQNYQSGVFTGSCGTQLDHGVAVVGYGVSDDGTKYWLV 304

Query: 309 KNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           KNSWG  WGE GYIRMQRD++A EGLCGIAMQ SYPTA
Sbjct: 305 KNSWGEDWGEEGYIRMQRDVEAPEGLCGIAMQPSYPTA 342


>gi|356515086|ref|XP_003526232.1| PREDICTED: vignain-like [Glycine max]
          Length = 343

 Score =  486 bits (1252), Expect = e-135,   Method: Compositional matrix adjust.
 Identities = 238/340 (70%), Positives = 273/340 (80%), Gaps = 5/340 (1%)

Query: 8   NKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENV 67
           N + LA +L +   A Q   RTL DA+M ERHE WM +YG+VY+D  E+E RF++FKENV
Sbjct: 8   NHISLAMLLCMTFLAFQVTCRTLQDASMYERHEQWMTRYGKVYKDPQEREKRFRVFKENV 67

Query: 68  EYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS 127
            YI +FNN A NK YKLGIN+FAD TN+EF APRNG+K  + S     TT   F++EN +
Sbjct: 68  NYIEAFNNAA-NKSYKLGINQFADLTNKEFIAPRNGFKGHMCSSIIRTTT---FKFENVT 123

Query: 128 -VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTS 186
             P+++DWR+KGAVT +KDQGQCGCCWAFSAVAA EGI+ ++  KL SLSEQELVDCDT 
Sbjct: 124 ATPSTVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALSAGKLISLSEQELVDCDTK 183

Query: 187 GEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSN 246
           G DQGCEGGLMDDAF+FII N GL TEA YPYK  DG CN  EA  +AA I+GYEDVP+N
Sbjct: 184 GVDQGCEGGLMDDAFKFIIQNHGLNTEANYPYKGVDGKCNANEAAKNAATITGYEDVPAN 243

Query: 247 NEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYW 306
           NE AL KAVANQPVSVAIDASGSDFQFY SGVFTG CGTELDHGVTAVGYG +DDGT+YW
Sbjct: 244 NEMALQKAVANQPVSVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSDDGTEYW 303

Query: 307 LVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           LVKNSWGT WGE GYIRMQR +D++EGLCGIAMQASYPTA
Sbjct: 304 LVKNSWGTEWGEEGYIRMQRGVDSEEGLCGIAMQASYPTA 343


>gi|224114698|ref|XP_002316833.1| predicted protein [Populus trichocarpa]
 gi|222859898|gb|EEE97445.1| predicted protein [Populus trichocarpa]
          Length = 305

 Score =  486 bits (1250), Expect = e-135,   Method: Compositional matrix adjust.
 Identities = 234/311 (75%), Positives = 262/311 (84%), Gaps = 8/311 (2%)

Query: 37  ERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEE 96
           ERHE WMAQYGR Y+ + EKE R  IFK NVE+I SFN K   KPYKL +NEFAD TNEE
Sbjct: 2   ERHETWMAQYGRAYKGHVEKERRLNIFKNNVEFIESFN-KVGKKPYKLSVNEFADLTNEE 60

Query: 97  FRAPRNGYKRRLPSVRSSETTDVSFRYENAS-VPASIDWRKKGAVTGVKDQGQCGCCWAF 155
           F+A RNGYK    S   S ++   FRYEN S VP+++DWRKKGAVT +KDQGQCGCCWAF
Sbjct: 61  FQASRNGYKM---SAHLSSSSTKPFRYENVSAVPSTMDWRKKGAVTPIKDQGQCGCCWAF 117

Query: 156 SAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAK 215
           SAVAA EGI  ++T KL SLSEQELVDCDTSGEDQGC GGLMDDAF+FII NKGL TEA 
Sbjct: 118 SAVAATEGITQLSTGKLISLSEQELVDCDTSGEDQGCNGGLMDDAFDFIIQNKGLTTEAN 177

Query: 216 YPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYS 275
           YPY+ +DG+CN  +A   AAKI+GYEDVP+N+EAAL+KAVANQPVSVAIDA GS FQFYS
Sbjct: 178 YPYQGADGACNSGKA---AAKITGYEDVPANSEAALLKAVANQPVSVAIDAGGSAFQFYS 234

Query: 276 SGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLC 335
           SGVFTG CGT+LDHGVTAVGYG +DDGTKYWLVKNSWGT+WGENGYIRM+RDIDA+EGLC
Sbjct: 235 SGVFTGDCGTDLDHGVTAVGYGMSDDGTKYWLVKNSWGTSWGENGYIRMERDIDAQEGLC 294

Query: 336 GIAMQASYPTA 346
           GIAM+ASYPTA
Sbjct: 295 GIAMEASYPTA 305


>gi|37780047|gb|AAP32196.1| cysteine protease 8 [Trifolium repens]
          Length = 343

 Score =  485 bits (1248), Expect = e-134,   Method: Compositional matrix adjust.
 Identities = 230/338 (68%), Positives = 272/338 (80%), Gaps = 5/338 (1%)

Query: 10  LVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEY 69
           + LA +  LG++A Q  SRTL D +M ERH  WM+QYG++Y+D+ E+E RFKIF ENV Y
Sbjct: 10  ISLALVFCLGLFAIQVTSRTLQDDSMYERHGQWMSQYGKIYKDHQERETRFKIFTENVNY 69

Query: 70  IASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS-V 128
           + + +N    K YKLGIN+FAD TNEEF A RN +K  + S  +  TT   F+YEN S +
Sbjct: 70  VEA-SNADDTKSYKLGINQFADLTNEEFVASRNKFKGHMCSSITRTTT---FKYENVSAI 125

Query: 129 PASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGE 188
           P+++DWRKKGAVT VK+QGQCGCCWAFSAVAA EGI+ ++T KL SLSEQELVDCDT G 
Sbjct: 126 PSTVDWRKKGAVTPVKNQGQCGCCWAFSAVAATEGIHKLSTGKLISLSEQELVDCDTKGV 185

Query: 189 DQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNE 248
           DQGCEGGLMDDAF+FII N GL+TEA+YPY+  DG+CN  +A+  A  I+GYEDVP+N+E
Sbjct: 186 DQGCEGGLMDDAFKFIIQNHGLSTEAQYPYEGVDGTCNANKASVQAVTITGYEDVPANSE 245

Query: 249 AALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLV 308
            AL KAVANQP+SVAIDASGSDFQFY SGVFTG CGTELDHGVTAVGYG ++DGTKYWLV
Sbjct: 246 QALQKAVANQPISVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSNDGTKYWLV 305

Query: 309 KNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           KNSWGT WGE GYI MQR ++A EGLCGIAMQASYPTA
Sbjct: 306 KNSWGTDWGEEGYIMMQRGVEAAEGLCGIAMQASYPTA 343


>gi|50355613|dbj|BAD29955.1| cysteine protease [Daucus carota]
          Length = 365

 Score =  484 bits (1247), Expect = e-134,   Method: Compositional matrix adjust.
 Identities = 237/348 (68%), Positives = 280/348 (80%), Gaps = 9/348 (2%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRF 60
           MA+ +      LA +  +GV A  + +R+LN+A+M E H+ WMA+YGRVY+   EK  R 
Sbjct: 1   MALTIKHQCTPLALLFTIGVLASLAAARSLNEASMTETHDQWMARYGRVYKTANEKNRRS 60

Query: 61  KIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVS 120
            IF+EN++YI +FN KA NKPYKLG+NEFAD TNEEF   RN +K  +     +  T+V 
Sbjct: 61  TIFQENLKYIQTFN-KANNKPYKLGVNEFADLTNEEFTTSRNKFKSHV----CATVTNV- 114

Query: 121 FRYENAS-VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQE 179
           FRYEN + VPA++DWRKKGAVT +K+QGQCGCCWAFSAVAAMEGI  + T KL SLSEQE
Sbjct: 115 FRYENVTAVPATMDWRKKGAVTPIKNQGQCGCCWAFSAVAAMEGITQLKTGKLISLSEQE 174

Query: 180 LVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCN-KKEANPSAAKIS 238
           LVDCDT+GEDQGCEGGLMD AF+FI  N GL+TE  YPY  +DG+CN  KEAN  AA I+
Sbjct: 175 LVDCDTNGEDQGCEGGLMDYAFDFIQQNHGLSTETNYPYSGTDGTCNANKEAN-HAATIT 233

Query: 239 GYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGT 298
           G+EDVP+N+E+AL+KAVANQP+SVAIDASGSDFQFYSSGVFTG+CGTELDHGVTAVGYGT
Sbjct: 234 GHEDVPANSESALLKAVANQPISVAIDASGSDFQFYSSGVFTGECGTELDHGVTAVGYGT 293

Query: 299 ADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           A DGTKYWLVKNSWGT+WGE GYI+MQR + A EGLCGIAMQASYPTA
Sbjct: 294 AADGTKYWLVKNSWGTSWGEEGYIQMQRGVAAAEGLCGIAMQASYPTA 341


>gi|144905116|dbj|BAF56430.1| cysteine proteinase [Lotus japonicus]
          Length = 341

 Score =  483 bits (1244), Expect = e-134,   Method: Compositional matrix adjust.
 Identities = 238/347 (68%), Positives = 279/347 (80%), Gaps = 7/347 (2%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRF 60
           MA   + N   L  +LV G  + ++ +RTL DA+M+ERHE WMAQYG+VY+D+ EKE+R 
Sbjct: 1   MASKTVLNITSLTLLLVFGFLSFEANARTLEDASMHERHEQWMAQYGKVYKDSYEKELRS 60

Query: 61  KIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVS 120
           KIFKENV+ I +FNN A NK YKLGIN+FAD TNEEF+A RN +K  + S   + T   +
Sbjct: 61  KIFKENVQRIEAFNN-AGNKSYKLGINQFADLTNEEFKA-RNRFKGHMCS---NSTRTPT 115

Query: 121 FRYENA-SVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQE 179
           F+YE+  SVPAS+DWR+KGAVT +KDQGQCGCCWAFSAVAA EGI  ++T KL SLSEQE
Sbjct: 116 FKYEHVTSVPASLDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGITKLSTGKLISLSEQE 175

Query: 180 LVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISG 239
           LVDCDT G DQGCEGGLMDDAF+FI+ NKGL TEAKYPY+  D +CN       AA I G
Sbjct: 176 LVDCDTKGVDQGCEGGLMDDAFKFIMQNKGLNTEAKYPYQGVDATCNANAEAKDAASIKG 235

Query: 240 YEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTA 299
           +EDVP+N+E+AL+KAVANQP+SVAIDASGS+FQFYSSGVFTG CGTELDHGVTAVGYG+ 
Sbjct: 236 FEDVPANSESALLKAVANQPISVAIDASGSEFQFYSSGVFTGSCGTELDHGVTAVGYGS- 294

Query: 300 DDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           D GTKYWLVKNSWG  WGE GYIRMQRD+ A+EGLCG AMQASYPTA
Sbjct: 295 DGGTKYWLVKNSWGEQWGEQGYIRMQRDVAAEEGLCGFAMQASYPTA 341


>gi|255580657|ref|XP_002531151.1| cysteine protease, putative [Ricinus communis]
 gi|223529264|gb|EEF31236.1| cysteine protease, putative [Ricinus communis]
          Length = 340

 Score =  482 bits (1240), Expect = e-133,   Method: Compositional matrix adjust.
 Identities = 234/347 (67%), Positives = 272/347 (78%), Gaps = 8/347 (2%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRF 60
           MA       + LA I +LG    Q+ +RTL DA+M+E+HE WM+++GRVY D  EKE+R+
Sbjct: 1   MAFTTRNGCISLALIFLLGALVSQAMARTLQDASMHEKHEEWMSRFGRVYNDGNEKEIRY 60

Query: 61  KIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVS 120
           KIFKENV+ I SFN KA  K YKLGIN+FAD TNEEF+  RN +K  + S ++       
Sbjct: 61  KIFKENVQRIESFN-KASGKSYKLGINQFADLTNEEFKTSRNRFKGHMCSSQAG-----P 114

Query: 121 FRYEN-ASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQE 179
           FRYEN  + P+S+DWRKKGAVT +KDQGQCG CWAFSAVAA+EGI  + T KL SLSEQE
Sbjct: 115 FRYENLTAAPSSMDWRKKGAVTAIKDQGQCGSCWAFSAVAAVEGITQLATSKLISLSEQE 174

Query: 180 LVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISG 239
           LVDCDT GEDQGC+GGLMDDAF+FI  N+GL TEA YPY+ SDG+CN K+    AAKI+G
Sbjct: 175 LVDCDTKGEDQGCQGGLMDDAFKFIEQNQGLTTEANYPYEGSDGTCNTKQEANHAAKING 234

Query: 240 YEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTA 299
           +EDVP+NNE ALMKAVA QPVSVAIDA G  FQFYSSG+FTG CGTELDHGV AVGYG +
Sbjct: 235 FEDVPANNEGALMKAVAKQPVSVAIDAGGFGFQFYSSGIFTGDCGTELDHGVAAVGYGES 294

Query: 300 DDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
            +G  YWLVKNSWGT WGE GYIRMQ+DIDAKEGLCGIAMQASYPTA
Sbjct: 295 -NGMNYWLVKNSWGTQWGEEGYIRMQKDIDAKEGLCGIAMQASYPTA 340


>gi|356543038|ref|XP_003539970.1| PREDICTED: vignain-like [Glycine max]
          Length = 343

 Score =  481 bits (1239), Expect = e-133,   Method: Compositional matrix adjust.
 Identities = 237/338 (70%), Positives = 269/338 (79%), Gaps = 5/338 (1%)

Query: 10  LVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEY 69
           + LA +  LG WA Q  SRTL DA+M ERHE WMA+Y +VY+D  E+E RFKIFKENV Y
Sbjct: 10  ISLALLFCLGFWAFQVTSRTLQDASMYERHEEWMARYAKVYKDPEEREKRFKIFKENVNY 69

Query: 70  IASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS-V 128
           I +FNN A NKPYKLGIN+FAD TNEEF APRN +K  + S   S T   +F+YEN + +
Sbjct: 70  IEAFNNAA-NKPYKLGINQFADLTNEEFIAPRNRFKGHMCS---SITRTTTFKYENVTAL 125

Query: 129 PASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGE 188
           P+++DWR+KGAVT +KDQGQCGCCWAFSAVAA EGI+ + + KL SLSEQE+VDCDT GE
Sbjct: 126 PSTVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALNSGKLISLSEQEVVDCDTKGE 185

Query: 189 DQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNE 248
           DQGC GG MD AF+FII N GL TEA YPYKA DG CN  EA   AA I+GYEDVP NNE
Sbjct: 186 DQGCAGGFMDGAFKFIIQNHGLNTEANYPYKAVDGKCNANEAANHAATITGYEDVPVNNE 245

Query: 249 AALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLV 308
            AL KAVANQPVSVAIDASGSDFQFY +GVFTG CGT+LDHGVTAVGYG + DGT+YWLV
Sbjct: 246 KALQKAVANQPVSVAIDASGSDFQFYKTGVFTGSCGTQLDHGVTAVGYGVSADGTQYWLV 305

Query: 309 KNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           KNSWGT WGE GYI MQR + A+EGLCGIAM ASYPTA
Sbjct: 306 KNSWGTEWGEEGYIMMQRGVKAQEGLCGIAMMASYPTA 343


>gi|357474527|ref|XP_003607548.1| Cysteine protease [Medicago truncatula]
 gi|358347211|ref|XP_003637653.1| Cysteine protease [Medicago truncatula]
 gi|355503588|gb|AES84791.1| Cysteine protease [Medicago truncatula]
 gi|355508603|gb|AES89745.1| Cysteine protease [Medicago truncatula]
          Length = 345

 Score =  481 bits (1239), Expect = e-133,   Method: Compositional matrix adjust.
 Identities = 230/327 (70%), Positives = 265/327 (81%), Gaps = 4/327 (1%)

Query: 21  WAPQSWSRTL-NDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARN 79
           +A Q  SRTL +D+ + E+HE WM  YG+VY+D  E+E R KIFKENV YI + NN   N
Sbjct: 22  FAIQVTSRTLQDDSNIYEKHEQWMVHYGKVYKDLQERENRLKIFKENVNYIEASNNAGNN 81

Query: 80  KPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGA 139
           K YKLGIN+FAD TNEEF A RN +K  + S   S T   +F+YENASVP+++DWRKKGA
Sbjct: 82  KLYKLGINQFADLTNEEFIASRNKFKGHMCS---SITKTSTFKYENASVPSTVDWRKKGA 138

Query: 140 VTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDD 199
           VT VK+QGQCGCCWAFSAVAA EGI+ ++T KL SLSEQELVDCDT G DQGCEGGLMDD
Sbjct: 139 VTPVKNQGQCGCCWAFSAVAATEGIHKLSTGKLVSLSEQELVDCDTKGVDQGCEGGLMDD 198

Query: 200 AFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQP 259
           AF+FII N GL TEA+YPY+  DG+C+  +A+  A  I+GYEDVP+NNE AL KAVANQP
Sbjct: 199 AFKFIIQNHGLNTEAQYPYQGVDGTCSANKASIHAVTITGYEDVPANNEQALQKAVANQP 258

Query: 260 VSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGEN 319
           +SVAIDASGSDFQFY SGVFTG CGTELDHGVTAVGYG  +DGTKYWLVKNSWGT WGE 
Sbjct: 259 ISVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVGNDGTKYWLVKNSWGTDWGEE 318

Query: 320 GYIRMQRDIDAKEGLCGIAMQASYPTA 346
           GYI+MQR +DA EGLCGIAM+ASYPTA
Sbjct: 319 GYIKMQRGVDAAEGLCGIAMEASYPTA 345


>gi|255580659|ref|XP_002531152.1| cysteine protease, putative [Ricinus communis]
 gi|223529265|gb|EEF31237.1| cysteine protease, putative [Ricinus communis]
          Length = 340

 Score =  481 bits (1238), Expect = e-133,   Method: Compositional matrix adjust.
 Identities = 233/347 (67%), Positives = 273/347 (78%), Gaps = 8/347 (2%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRF 60
           MA  +    + LA I  LG  A Q+ +RTL DA+++E+HE WM ++ RVY D  EKE+R+
Sbjct: 1   MAFTIRHGCISLALIFFLGALASQAIARTLQDASIHEKHEEWMTRFKRVYSDAKEKEIRY 60

Query: 61  KIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVS 120
           KIFKENV+ I SFN KA  K YKLGIN+FAD TNEEF+  RN +K  + S ++       
Sbjct: 61  KIFKENVQRIESFN-KASEKSYKLGINQFADLTNEEFKTSRNRFKGHMCSSQAG-----P 114

Query: 121 FRYEN-ASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQE 179
           FRYEN  +VP+S+DWRK+GAVT +KDQGQCG CWAFSAVAA+EGI  + T KL SLSEQE
Sbjct: 115 FRYENITAVPSSMDWRKEGAVTAIKDQGQCGSCWAFSAVAAVEGITQLATSKLISLSEQE 174

Query: 180 LVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISG 239
           LVDCDT GEDQGC+GGLMDDAF+FI  N+GL TEA YPY+ SDG+CN K+    AAKI+G
Sbjct: 175 LVDCDTKGEDQGCQGGLMDDAFKFIEQNQGLTTEANYPYEGSDGTCNTKQEANHAAKING 234

Query: 240 YEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTA 299
           +EDVP+NNE ALMKAVA QPVSVAIDA G +FQFYSSG+FTG CGTELDHGV AVGYG +
Sbjct: 235 FEDVPANNEGALMKAVAKQPVSVAIDAGGFEFQFYSSGIFTGDCGTELDHGVAAVGYGES 294

Query: 300 DDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
            +G  YWLVKNSWGT WGE GYIRMQ+DIDAKEGLCGIAMQASYPTA
Sbjct: 295 -NGMNYWLVKNSWGTQWGEEGYIRMQKDIDAKEGLCGIAMQASYPTA 340


>gi|356543076|ref|XP_003539989.1| PREDICTED: vignain-like [Glycine max]
          Length = 343

 Score =  480 bits (1235), Expect = e-133,   Method: Compositional matrix adjust.
 Identities = 236/338 (69%), Positives = 269/338 (79%), Gaps = 5/338 (1%)

Query: 10  LVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEY 69
           + LA +  LG WA Q  SRTL DA+M ERHE WMA+Y +VY+D  E+E RFKIFKENV Y
Sbjct: 10  ISLALLFCLGFWAFQVTSRTLQDASMYERHEEWMARYAKVYKDPEEREKRFKIFKENVNY 69

Query: 70  IASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS-V 128
           I +FNN A +KPYKLGIN+FAD TNEEF APRN +K  + S  +  TT   F+YEN + +
Sbjct: 70  IEAFNNAA-DKPYKLGINQFADLTNEEFIAPRNKFKGHMCSSITRTTT---FKYENVTAL 125

Query: 129 PASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGE 188
           P+++DWR+KGAVT +KDQGQCGCCWAFSAVAA EGI+ + + KL SLSEQE+VDCDT GE
Sbjct: 126 PSTVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALNSGKLISLSEQEVVDCDTKGE 185

Query: 189 DQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNE 248
           DQGC GG MD AF+FII N GL TEA YPYKA DG CN  EA   AA I+GYEDVP NNE
Sbjct: 186 DQGCAGGFMDGAFKFIIQNHGLNTEANYPYKAVDGKCNANEAANHAATITGYEDVPVNNE 245

Query: 249 AALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLV 308
            AL KAVANQPVSVAIDASGSDFQFY +GVFTG CGT+LDHGVTAVGYG + DGT+YWLV
Sbjct: 246 KALQKAVANQPVSVAIDASGSDFQFYKTGVFTGSCGTQLDHGVTAVGYGVSADGTQYWLV 305

Query: 309 KNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           KNSWGT WGE GYI MQR + A+EGLCGIAM ASYPTA
Sbjct: 306 KNSWGTEWGEEGYIMMQRGVKAQEGLCGIAMMASYPTA 343


>gi|225446589|ref|XP_002280263.1| PREDICTED: vignain [Vitis vinifera]
          Length = 339

 Score =  480 bits (1235), Expect = e-133,   Method: Compositional matrix adjust.
 Identities = 239/337 (70%), Positives = 273/337 (81%), Gaps = 11/337 (3%)

Query: 12  LAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIA 71
           +A + +L  WA Q+ SR+L++A+M ERHE WMA+YGR+Y+D  EKE RFKIFK+NV  I 
Sbjct: 12  MALLFILAAWASQATSRSLHEASMYERHEDWMARYGRMYKDANEKEKRFKIFKDNVARIE 71

Query: 72  SFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS-VPA 130
           SFN KA +K YKL INEFAD TNEEFR+ RN +K  +     SE T  +F+YEN + VP+
Sbjct: 72  SFN-KAMDKTYKLSINEFADLTNEEFRSLRNRFKAHI----CSEAT--TFKYENVTAVPS 124

Query: 131 SIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQ 190
           +IDWRKKGAVT +KDQ QCGCCWAFSAVAA EGI  ITT KL SLSEQELVDCDT GE+Q
Sbjct: 125 TIDWRKKGAVTPIKDQQQCGCCWAFSAVAATEGITQITTGKLISLSEQELVDCDTGGENQ 184

Query: 191 GCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCN-KKEANPSAAKISGYEDVPSNNEA 249
           GC GGLMDDAF FI  + GLA+EA YPY+  DG+CN KKEA+P AAKI GYEDVP+NNE 
Sbjct: 185 GCSGGLMDDAFRFIKIH-GLASEATYPYEGDDGTCNSKKEAHP-AAKIKGYEDVPANNEK 242

Query: 250 ALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVK 309
           AL KAVA+QPV+VAIDA G +FQFY+SGVFTGQCGTELDHGV AVGYG  DDG  YWLVK
Sbjct: 243 ALQKAVAHQPVAVAIDAGGFEFQFYTSGVFTGQCGTELDHGVAAVGYGIGDDGMMYWLVK 302

Query: 310 NSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           NSWGT WGE GYIRMQRD+ AKEGLCGIAMQASYPTA
Sbjct: 303 NSWGTGWGEEGYIRMQRDVTAKEGLCGIAMQASYPTA 339


>gi|357474579|ref|XP_003607574.1| Cysteine protease [Medicago truncatula]
 gi|355508629|gb|AES89771.1| Cysteine protease [Medicago truncatula]
          Length = 345

 Score =  479 bits (1234), Expect = e-133,   Method: Compositional matrix adjust.
 Identities = 229/327 (70%), Positives = 264/327 (80%), Gaps = 4/327 (1%)

Query: 21  WAPQSWSRTL-NDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARN 79
           +A Q  SRTL +D+ + E+HE WM  YG+VY+D  E+E R KIFKENV YI + NN   N
Sbjct: 22  FAIQVTSRTLQDDSIIYEKHEQWMVHYGKVYKDLQERENRLKIFKENVNYIEASNNAGNN 81

Query: 80  KPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGA 139
           K YKLGIN+FAD TNEEF A RN +K  + S   S T   +F+YENASVP+++DWRKKGA
Sbjct: 82  KLYKLGINQFADITNEEFIASRNKFKGHMCS---SITKTSTFKYENASVPSTVDWRKKGA 138

Query: 140 VTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDD 199
           VT VK+QGQCGCCWAFSAVAA EGI+ ++T KL SLSEQELVDCDT G DQGCEGGLMDD
Sbjct: 139 VTPVKNQGQCGCCWAFSAVAATEGIHKLSTGKLVSLSEQELVDCDTKGVDQGCEGGLMDD 198

Query: 200 AFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQP 259
           AF+FII N GL TEA+YPY+  DG+C+  E +  AA I+GYEDVP+NNE AL KAVANQP
Sbjct: 199 AFKFIIQNHGLHTEAQYPYQGVDGTCSANETSTPAATIAGYEDVPANNENALQKAVANQP 258

Query: 260 VSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGEN 319
           +SVAIDASGSDFQFY SGVFTG CGT+LDHGVTAVGYG ++DGTKYWLVKNSWG  WGE 
Sbjct: 259 ISVAIDASGSDFQFYKSGVFTGSCGTQLDHGVTAVGYGISNDGTKYWLVKNSWGNDWGEE 318

Query: 320 GYIRMQRDIDAKEGLCGIAMQASYPTA 346
           GYIRMQR +DA +GLCGIAM ASYPTA
Sbjct: 319 GYIRMQRSVDAAQGLCGIAMMASYPTA 345


>gi|356517348|ref|XP_003527349.1| PREDICTED: vignain-like [Glycine max]
          Length = 343

 Score =  478 bits (1231), Expect = e-132,   Method: Compositional matrix adjust.
 Identities = 232/339 (68%), Positives = 263/339 (77%), Gaps = 5/339 (1%)

Query: 9   KLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVE 68
           ++ LA +   G  A Q   RTL DA+M ERHE WM +Y +VY+D  E+E RFKIFKENV 
Sbjct: 9   QISLALLFCSGFLAFQVTCRTLQDASMYERHEEWMGRYAKVYKDPQERERRFKIFKENVN 68

Query: 69  YIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS- 127
           YI +FNN A NKPY LGIN+FAD TNEEF APRN +K  + S   S T   +F+YEN + 
Sbjct: 69  YIEAFNNAA-NKPYTLGINQFADLTNEEFIAPRNRFKGHMCS---SITRTTTFKYENVTA 124

Query: 128 VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSG 187
           +P+++DWR+KGAVT +KDQGQCGCCWAFSAVAA EGI+ ++  KL SLSEQE+VDCDT G
Sbjct: 125 IPSTVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALSAGKLISLSEQEVVDCDTKG 184

Query: 188 EDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNN 247
           EDQGC GG MD AF+FII N GL  E  YPYKA DG CN K A    A I+GYEDVP NN
Sbjct: 185 EDQGCAGGFMDGAFKFIIQNHGLNNEPNYPYKAVDGKCNAKAAANHVATITGYEDVPVNN 244

Query: 248 EAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWL 307
           E AL KAVANQPVSVAIDASGSDFQFY SGVFTG CGTELDHGVTAVGYG + DGT+YWL
Sbjct: 245 EKALQKAVANQPVSVAIDASGSDFQFYQSGVFTGSCGTELDHGVTAVGYGVSADGTEYWL 304

Query: 308 VKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           VKNSWGT WGE GYIRMQR + A+EGLCGIAM ASYPTA
Sbjct: 305 VKNSWGTEWGEEGYIRMQRGVKAEEGLCGIAMMASYPTA 343


>gi|356577763|ref|XP_003556992.1| PREDICTED: vignain-like [Glycine max]
          Length = 343

 Score =  477 bits (1227), Expect = e-132,   Method: Compositional matrix adjust.
 Identities = 231/339 (68%), Positives = 262/339 (77%), Gaps = 5/339 (1%)

Query: 9   KLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVE 68
           ++ LA +   G    Q   RTL DA+M ERHE WM +Y +VY+D  E+E RFKIFKENV 
Sbjct: 9   QISLALLFCSGFLTFQVTCRTLQDASMYERHEEWMGRYAKVYKDPQERERRFKIFKENVN 68

Query: 69  YIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS- 127
           YI +FNN A NKPY LGIN+FAD TNEEF APRN +K  + S   S T   +F+YEN + 
Sbjct: 69  YIEAFNNAA-NKPYTLGINQFADLTNEEFIAPRNRFKGHMCS---SITRTTTFKYENVTA 124

Query: 128 VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSG 187
           +P+++DWR+KGAVT +KDQGQCGCCWAFSAVAA EGI+ ++  KL SLSEQE+VDCDT G
Sbjct: 125 IPSTVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALSAGKLISLSEQEVVDCDTKG 184

Query: 188 EDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNN 247
           EDQGC GG MD AF+FII N GL  E  YPYKA DG CN K A    A I+GYEDVP NN
Sbjct: 185 EDQGCAGGFMDGAFKFIIQNHGLNNEPNYPYKAVDGKCNAKAAANHVATITGYEDVPVNN 244

Query: 248 EAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWL 307
           E AL KAVANQPVSVAIDASGSDFQFY SGVFTG CGTELDHGVTAVGYG + DGT+YWL
Sbjct: 245 EKALQKAVANQPVSVAIDASGSDFQFYQSGVFTGSCGTELDHGVTAVGYGVSADGTEYWL 304

Query: 308 VKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           VKNSWGT WGE GYIRMQR + A+EGLCGIAM ASYPTA
Sbjct: 305 VKNSWGTEWGEEGYIRMQRGVKAEEGLCGIAMMASYPTA 343


>gi|357471211|ref|XP_003605890.1| Cysteine proteinase [Medicago truncatula]
 gi|355506945|gb|AES88087.1| Cysteine proteinase [Medicago truncatula]
          Length = 343

 Score =  476 bits (1226), Expect = e-132,   Method: Compositional matrix adjust.
 Identities = 229/327 (70%), Positives = 263/327 (80%), Gaps = 5/327 (1%)

Query: 21  WAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNK 80
           +A Q  SRTL D  M ERH  WM+QYG+VY+D+ E+E RFKIF ENV YI +FN    NK
Sbjct: 21  FAIQVTSRTLQD-DMYERHRQWMSQYGKVYKDSQEREKRFKIFTENVNYIEAFNKGDNNK 79

Query: 81  PYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS-VPASIDWRKKGA 139
            Y LG+N+FAD TN+EF + RN +K  + S   S T   +F+YENAS +P+S+DWRKKGA
Sbjct: 80  LYTLGVNQFADLTNDEFTSSRNKFKGHMCS---SITRTSTFKYENASAIPSSVDWRKKGA 136

Query: 140 VTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDD 199
           VT VK+QGQCGCCWAFSAVAA EGI+ ++T KL SLSEQELVDCDT G DQGCEGGLMDD
Sbjct: 137 VTPVKNQGQCGCCWAFSAVAATEGIHKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDD 196

Query: 200 AFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQP 259
           AF+FII N GL TEA YPY+  DG+CN  + + +A  I+GYEDVP+NNE AL KAVANQP
Sbjct: 197 AFKFIIQNHGLNTEANYPYQGVDGTCNANKGSINAVTITGYEDVPTNNEQALQKAVANQP 256

Query: 260 VSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGEN 319
           +SVAIDASGSDFQFY SGVFTG CGTELDHGVTAVGYG ++DGTKYWLVKNSWGT WGE 
Sbjct: 257 ISVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSNDGTKYWLVKNSWGTEWGEE 316

Query: 320 GYIRMQRDIDAKEGLCGIAMQASYPTA 346
           GYI MQR +DA EGLCGIAMQASYPTA
Sbjct: 317 GYIMMQRGVDAAEGLCGIAMQASYPTA 343


>gi|535454|gb|AAA50755.1| cysteine proteinase [Alnus glutinosa]
          Length = 340

 Score =  476 bits (1224), Expect = e-132,   Method: Compositional matrix adjust.
 Identities = 232/346 (67%), Positives = 272/346 (78%), Gaps = 8/346 (2%)

Query: 3   MILLENKLVLAAILVLGVWAPQ-SWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFK 61
           M  +     L  ++ LG  A Q + +R+L DA+M ERHE WMA YGRVY+D  EK+ R+K
Sbjct: 1   MGFVSQCFCLVVMVTLGALASQLAAARSLQDASMRERHEEWMASYGRVYKDINEKQKRYK 60

Query: 62  IFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSF 121
           IF+ENV  I S +NK  NKPYKL +N+FAD TNEEF+A RN +K  + S +S+     SF
Sbjct: 61  IFEENVALIES-SNKDANKPYKLSVNQFADLTNEEFKASRNRFKGHICSTKST-----SF 114

Query: 122 RYENAS-VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQEL 180
           +Y N S VP+++DWR KGAVT VKDQGQCGCCWAFSAVAA EGI  +TT +L SLSEQEL
Sbjct: 115 KYGNVSAVPSAMDWRMKGAVTPVKDQGQCGCCWAFSAVAATEGITKLTTGELISLSEQEL 174

Query: 181 VDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGY 240
           VDCDTSG DQGCEGGLMD+AF FI  N GLA+EA YPYK  DG+CN  +    AA+I+G+
Sbjct: 175 VDCDTSGVDQGCEGGLMDNAFTFIQHNHGLASEANYPYKGVDGTCNTNKQAIHAAEINGF 234

Query: 241 EDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTAD 300
           EDVP+N+E AL+ AVA+QPVSVAIDA GS FQFYS GVF G CGT+LDHGVTAVGYGT+D
Sbjct: 235 EDVPANSEEALLNAVAHQPVSVAIDAGGSGFQFYSKGVFIGACGTQLDHGVTAVGYGTSD 294

Query: 301 DGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           DGTKYWLVKNSWGT WGE GYIRMQRD+DAKEGLCGIAM+ASYPTA
Sbjct: 295 DGTKYWLVKNSWGTQWGEEGYIRMQRDVDAKEGLCGIAMKASYPTA 340


>gi|356517358|ref|XP_003527354.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
 gi|356577767|ref|XP_003556994.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
          Length = 343

 Score =  474 bits (1220), Expect = e-131,   Method: Compositional matrix adjust.
 Identities = 231/339 (68%), Positives = 262/339 (77%), Gaps = 5/339 (1%)

Query: 9   KLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVE 68
           ++ LA +   G  A Q   RTL DA+M ERHE WM +Y +VY+D  E+E RFKIFKENV 
Sbjct: 9   QISLALLFCSGFLAFQVTCRTLQDASMYERHEEWMGRYAKVYKDPQERERRFKIFKENVN 68

Query: 69  YIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS- 127
           YI +FNN A NKPY LGIN+FAD TNEEF APRN +K  + S   S T   +F+YEN + 
Sbjct: 69  YIEAFNNAA-NKPYTLGINQFADLTNEEFIAPRNRFKGHMCS---SITRTTTFKYENVTA 124

Query: 128 VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSG 187
           +P+++DWR+KGAVT +KDQGQCGCCWAFSAVAA EGI+ ++  KL SLSEQE+VDCDT G
Sbjct: 125 IPSTVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALSAGKLISLSEQEVVDCDTKG 184

Query: 188 EDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNN 247
           EDQGC GG MD AF+FII N GL  E  YPYKA DG CN K A    A I+GYEDVP NN
Sbjct: 185 EDQGCAGGFMDGAFKFIIQNHGLNNEPNYPYKAVDGKCNAKAAANHVATITGYEDVPVNN 244

Query: 248 EAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWL 307
           E AL KAVANQPVSVAIDASGSDFQFY SGVFTG CGTELDHGVTAVGYG + DGT+YWL
Sbjct: 245 EKALQKAVANQPVSVAIDASGSDFQFYQSGVFTGSCGTELDHGVTAVGYGVSADGTEYWL 304

Query: 308 VKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           VKNSWGT WGE GYIRMQR + A+EGL GIAM ASYPTA
Sbjct: 305 VKNSWGTEWGEEGYIRMQRGVKAEEGLXGIAMMASYPTA 343


>gi|356517350|ref|XP_003527350.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
           max]
 gi|356577765|ref|XP_003556993.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
           max]
          Length = 343

 Score =  471 bits (1212), Expect = e-130,   Method: Compositional matrix adjust.
 Identities = 234/347 (67%), Positives = 266/347 (76%), Gaps = 5/347 (1%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRF 60
           MA  +  + + LA    LG  A Q  SRTL DA+M ERHE WMA+YG+VY+D  EKE RF
Sbjct: 1   MATKIQFHHISLALFFCLGFLAFQVASRTLQDASMYERHEQWMARYGKVYKDPEEKEKRF 60

Query: 61  KIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVS 120
           ++FKENV YI +FNN A NKPYKLGIN+FAD T+EEF  PRN +       RSS T   +
Sbjct: 61  RVFKENVNYIEAFNNAA-NKPYKLGINQFADLTSEEFIVPRNRFNGHT---RSSNTRTTT 116

Query: 121 FRYENASV-PASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQE 179
           F+YEN +V P SIDWR+KGAVT +K+QG CGCCWAFSA+AA EGI+ I+T KL SLSEQE
Sbjct: 117 FKYENVTVLPDSIDWRQKGAVTPIKNQGSCGCCWAFSAIAATEGIHKISTGKLVSLSEQE 176

Query: 180 LVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISG 239
           +VDCDT G D GCEGG MD AF+FII N G+ TEA YPYK  DG CN KE    AA I+G
Sbjct: 177 VVDCDTKGTDHGCEGGYMDGAFKFIIQNHGINTEASYPYKGVDGKCNIKEEAVHAATITG 236

Query: 240 YEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTA 299
           YEDVP NNE AL KAVANQPVSVAIDASG+DFQFY SG+FTG CGTELDHGVTAVGYG  
Sbjct: 237 YEDVPINNEKALQKAVANQPVSVAIDASGADFQFYKSGIFTGSCGTELDHGVTAVGYGEN 296

Query: 300 DDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           ++GTKYWLVKNSWGT WGE GYI MQR + A EG+CGIAM ASYPTA
Sbjct: 297 NEGTKYWLVKNSWGTEWGEEGYIMMQRGVKAVEGICGIAMMASYPTA 343


>gi|356542633|ref|XP_003539771.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
          Length = 341

 Score =  469 bits (1207), Expect = e-130,   Method: Compositional matrix adjust.
 Identities = 230/337 (68%), Positives = 264/337 (78%), Gaps = 6/337 (1%)

Query: 11  VLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYI 70
            LA  L+    A ++ +RTL DA M ERHE WMA +G+VY+ + EKE +++IF ENV+ I
Sbjct: 10  TLALFLIFAFCAFEANARTLEDAPMRERHEQWMATHGKVYKHSYEKEQKYQIFMENVQRI 69

Query: 71  ASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS-VP 129
            +FNN A  KPYKLGIN FAD TNEEF+A  N +K  + S R+  TT   FRYEN + VP
Sbjct: 70  EAFNN-AGXKPYKLGINHFADLTNEEFKAI-NRFKGHVCSKRTRTTT---FRYENVTAVP 124

Query: 130 ASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGED 189
           AS+DWR+KGAVT +KDQGQCGCCWAFSAVAA EGI  + T KL SLSEQELVDCDT G D
Sbjct: 125 ASLDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGITKLRTGKLISLSEQELVDCDTKGVD 184

Query: 190 QGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEA 249
           QGCEGGLMDDAF+FI+ NKGLATEA YPY+  DG+CN K     A  I GYEDVP+N+E+
Sbjct: 185 QGCEGGLMDDAFKFILQNKGLATEAIYPYEGFDGTCNAKADGNHAGSIKGYEDVPANSES 244

Query: 250 ALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVK 309
           AL+KAVANQPVSVAI+ASG  FQFYS GVFTG CGT LDHGVT+VGYG  DDGTKYWLVK
Sbjct: 245 ALLKAVANQPVSVAIEASGFKFQFYSGGVFTGSCGTNLDHGVTSVGYGVGDDGTKYWLVK 304

Query: 310 NSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           NSWG  WGE GYIRMQRD+ AKEGLCGIAM ASYP+A
Sbjct: 305 NSWGVKWGEKGYIRMQRDVAAKEGLCGIAMLASYPSA 341


>gi|356554921|ref|XP_003545789.1| PREDICTED: LOW QUALITY PROTEIN: thiol protease SEN102-like [Glycine
           max]
          Length = 439

 Score =  469 bits (1206), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 231/338 (68%), Positives = 264/338 (78%), Gaps = 5/338 (1%)

Query: 10  LVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEY 69
           + LA +L     A Q    TL DA+M ERHE WM ++G+VY+D  E+E RF+IF ENV Y
Sbjct: 106 ISLAMLLCTAFLAFQVTCCTLQDASMYERHEQWMTRHGKVYKDPREREKRFRIFNENVNY 165

Query: 70  IASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS-V 128
           + +FNN A NKPYKLGIN+F D TN+EF APRN +K  + S     TT   F+YEN + V
Sbjct: 166 VEAFNNAA-NKPYKLGINQFXDLTNQEFIAPRNRFKGHMCSSIIRTTT---FKYENVTTV 221

Query: 129 PASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGE 188
           P+++DWR+ GAVT VKDQGQCGCCWAFSAVAA EGI+ ++  KL SLSEQELVDCDT G 
Sbjct: 222 PSTVDWRQNGAVTPVKDQGQCGCCWAFSAVAATEGIHALSGGKLISLSEQELVDCDTKGV 281

Query: 189 DQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNE 248
           DQGCEGGLMDDA++FII N GL TEA YPYK  DG CN  EA   AA I+GYEDVP+NNE
Sbjct: 282 DQGCEGGLMDDAYKFIIQNHGLNTEANYPYKGVDGKCNANEAANHAATITGYEDVPANNE 341

Query: 249 AALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLV 308
            AL KAVANQPVSVAIDAS SDFQFY SG FTG CGTELDHGVTAVGYG +D GTKYWLV
Sbjct: 342 KALQKAVANQPVSVAIDASSSDFQFYKSGAFTGSCGTELDHGVTAVGYGVSDHGTKYWLV 401

Query: 309 KNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           KNSWGT WGE GYIRMQR +D++EG+CGIAMQASYPTA
Sbjct: 402 KNSWGTEWGEEGYIRMQRGVDSEEGVCGIAMQASYPTA 439


>gi|356517426|ref|XP_003527388.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
           max]
          Length = 343

 Score =  466 bits (1200), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 231/338 (68%), Positives = 264/338 (78%), Gaps = 5/338 (1%)

Query: 10  LVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEY 69
           + LA +  +G  A Q   RTL DA+M ERH  WMA+Y +VY+D  E+E RF+IFKENV Y
Sbjct: 10  ISLALLFCMGFLAFQVTCRTLQDASMYERHAQWMARYAKVYKDPQEREKRFRIFKENVNY 69

Query: 70  IASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASV- 128
           I +FN+ A NK YKL IN+FAD TNEEF APRN +K  + S   S T   +F+YEN +V 
Sbjct: 70  IETFNS-ADNKSYKLDINQFADLTNEEFIAPRNRFKGHMCS---SITRTTTFKYENVTVI 125

Query: 129 PASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGE 188
           P+++DWR+KGAVT +KDQGQCGCCWAFSAVAA EGI+ +   KL SLSEQE+VDCDT G+
Sbjct: 126 PSTVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALNAGKLISLSEQEVVDCDTKGQ 185

Query: 189 DQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNE 248
           DQGC GG MD AF+FII N GL TE  YPYKA+DG CN K A   AA I+GYEDVP NNE
Sbjct: 186 DQGCAGGFMDGAFKFIIQNHGLNTEPNYPYKAADGKCNAKAAANHAATITGYEDVPVNNE 245

Query: 249 AALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLV 308
            AL KAVANQPVSVAIDASGSDFQFY SGVFTG CGTELDHGVTAVGYG + DGT+YWLV
Sbjct: 246 KALQKAVANQPVSVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSADGTEYWLV 305

Query: 309 KNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           KNSWGT WGE GYIRMQR + A+EGLCGIAM ASYPTA
Sbjct: 306 KNSWGTEWGEEGYIRMQRGVKAEEGLCGIAMMASYPTA 343


>gi|297740489|emb|CBI30671.3| unnamed protein product [Vitis vinifera]
          Length = 320

 Score =  466 bits (1199), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 238/344 (69%), Positives = 268/344 (77%), Gaps = 25/344 (7%)

Query: 4   ILLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIF 63
           + LE+K++   +L++GVWA Q+ SRTL++ +M+ERHE WM  YGR Y+D AEKE RFKIF
Sbjct: 1   MALESKIICITLLIMGVWASQALSRTLHEVSMSERHEDWMGLYGRTYKDIAEKERRFKIF 60

Query: 64  KENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRY 123
           KENVEYI S N                     +F+A RNGY       RSSE T  SFRY
Sbjct: 61  KENVEYIESVN---------------------KFKASRNGYNMS-SRPRSSEIT--SFRY 96

Query: 124 EN-ASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVD 182
           EN A+VP+S+DWRKKGAVT +KDQGQCGCCWAFSAVAAMEG+  + T +L SLSEQELVD
Sbjct: 97  ENVAAVPSSMDWRKKGAVTPIKDQGQCGCCWAFSAVAAMEGVTQLKTGELISLSEQELVD 156

Query: 183 CDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYED 242
           CDTSGEDQGC GGLMD AFEFII N GL TEA YPYK  D +CNKK+A  SAAKI  YED
Sbjct: 157 CDTSGEDQGCGGGLMDSAFEFIIGNGGLTTEANYPYKGVDATCNKKKAASSAAKIKNYED 216

Query: 243 VPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDG 302
           VP+N+EAAL+KAVA  PVSVAIDA GSDFQFYSSGVFTGQCGTELDHGVTAVGYG  DDG
Sbjct: 217 VPANSEAALLKAVAQHPVSVAIDAGGSDFQFYSSGVFTGQCGTELDHGVTAVGYGKTDDG 276

Query: 303 TKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           TKYWLVKNSWGT WGE+GYI M+RDI A EGLCGIAM+ASYPTA
Sbjct: 277 TKYWLVKNSWGTGWGEDGYIWMERDIGADEGLCGIAMEASYPTA 320


>gi|224162986|ref|XP_002338508.1| predicted protein [Populus trichocarpa]
 gi|222872535|gb|EEF09666.1| predicted protein [Populus trichocarpa]
          Length = 306

 Score =  465 bits (1196), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 224/313 (71%), Positives = 258/313 (82%), Gaps = 8/313 (2%)

Query: 35  MNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTN 94
           M ERHE WM QYGRVY+D+ E+  R+ IFKENV  I +FN++   K YKLG+N+FAD TN
Sbjct: 1   MYERHEQWMTQYGRVYKDDNERATRYSIFKENVARIDAFNSQT-GKSYKLGVNQFADLTN 59

Query: 95  EEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS-VPASIDWRKKGAVTGVKDQGQCGCCW 153
           EEF+A RN +K  + S ++       FRYEN S VP+++DWRK+GAVT VKDQGQCGCCW
Sbjct: 60  EEFKASRNRFKGHMCSPQAG-----PFRYENVSAVPSTVDWRKEGAVTPVKDQGQCGCCW 114

Query: 154 AFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATE 213
           AFSAVAAMEGIN +TT KL SLSEQE+VDCDT GEDQGC GGLMDDAF+FI  NKGL TE
Sbjct: 115 AFSAVAAMEGINKLTTGKLISLSEQEVVDCDTKGEDQGCNGGLMDDAFKFIEQNKGLTTE 174

Query: 214 AKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQF 273
           A YPYK +DG+CN K++   AAKI+G+EDVP+N+EAALMKAVA QPVSVAIDA GSDFQF
Sbjct: 175 ANYPYKGTDGTCNTKKSAIHAAKITGFEDVPANSEAALMKAVAKQPVSVAIDAGGSDFQF 234

Query: 274 YSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEG 333
           YSSG+FTG C T+LDHGVTAVGYG + DG+KYWLVKNSWG  WGE GYIRMQ+DI AKEG
Sbjct: 235 YSSGIFTGSCDTQLDHGVTAVGYGVS-DGSKYWLVKNSWGAQWGEEGYIRMQKDISAKEG 293

Query: 334 LCGIAMQASYPTA 346
           LCGIAMQASYPTA
Sbjct: 294 LCGIAMQASYPTA 306


>gi|356539398|ref|XP_003538185.1| PREDICTED: vignain-like [Glycine max]
          Length = 343

 Score =  461 bits (1187), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 227/343 (66%), Positives = 262/343 (76%), Gaps = 6/343 (1%)

Query: 5   LLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFK 64
           +L     LA  LV    A +  +RTL DA M ERHE WMA +G+VY  + EKE +++ FK
Sbjct: 6   VLFQYFTLALCLVFAFCAFEGNARTLEDAPMRERHEQWMAIHGKVYTHSYEKEQKYQTFK 65

Query: 65  ENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYE 124
           ENV+ I +FN+ A NKPYKLGIN FAD TNEEF+A      R    V S  T   +FRYE
Sbjct: 66  ENVQRIEAFNH-AGNKPYKLGINHFADLTNEEFKA----INRFKGHVCSKITRTPTFRYE 120

Query: 125 N-ASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDC 183
           N  +VPA++DWR++GAVT +KDQGQCGCCWAFSAVAA EGI  ++T KL SLSEQELVDC
Sbjct: 121 NMTAVPATLDWRQEGAVTPIKDQGQCGCCWAFSAVAATEGITKLSTGKLISLSEQELVDC 180

Query: 184 DTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDV 243
           DT G DQGCEGGLMDDAF+FI+ NKGLA EA YPY+  DG+CN K     A  I GYEDV
Sbjct: 181 DTKGVDQGCEGGLMDDAFKFILQNKGLAAEAIYPYEGVDGTCNAKAEGNHATSIKGYEDV 240

Query: 244 PSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGT 303
           P+N+E+AL+KAVANQPVSVAI+ASG +FQFYS GVFTG CGT LDHGVTAVGYG +DDGT
Sbjct: 241 PANSESALLKAVANQPVSVAIEASGFEFQFYSGGVFTGSCGTNLDHGVTAVGYGVSDDGT 300

Query: 304 KYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           KYWLVKNSWG  WG+ GYIRMQRD+ AKEGLCGIAM ASYP A
Sbjct: 301 KYWLVKNSWGVKWGDKGYIRMQRDVAAKEGLCGIAMLASYPNA 343


>gi|356543116|ref|XP_003540009.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
           max]
          Length = 337

 Score =  457 bits (1175), Expect = e-126,   Method: Compositional matrix adjust.
 Identities = 223/338 (65%), Positives = 262/338 (77%), Gaps = 12/338 (3%)

Query: 11  VLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYI 70
           +LA +L+L +   Q  SR L++A+M+ERHE WM +YG+VY+D AEK+ R  IFK+NVE+I
Sbjct: 10  ILALVLLLSICTSQVMSRYLHEASMSERHEQWMKKYGKVYKDAAEKQKRLLIFKDNVEFI 69

Query: 71  ASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS-VP 129
            SFN  A NKPYKLGIN  ADQTNEEF A  NGYK +      +  +   F+YEN + VP
Sbjct: 70  ESFN-AAGNKPYKLGINHLADQTNEEFVASHNGYKHK------ASHSQTPFKYENVTGVP 122

Query: 130 ASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGED 189
            ++DWR+ GAVT VKDQGQCG CWAFS VAA EGI  ITT  L SLSEQELVDCD+   D
Sbjct: 123 NAVDWRENGAVTAVKDQGQCGSCWAFSTVAATEGIYQITTSMLMSLSEQELVDCDSV--D 180

Query: 190 QGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCN-KKEANPSAAKISGYEDVPSNNE 248
            GC+GG M+  FEFII N G+++EA YPY A DG+C+  KEA+P AA+I GYE VP+N+E
Sbjct: 181 HGCDGGYMEGGFEFIIKNGGISSEANYPYTAVDGTCDANKEASP-AAQIKGYETVPANSE 239

Query: 249 AALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLV 308
            AL KAVANQPVSV IDA GS FQFYSSGVFTGQCGT+LDHGVTAVGYG+ DDGT+YW+V
Sbjct: 240 DALQKAVANQPVSVTIDAGGSAFQFYSSGVFTGQCGTQLDHGVTAVGYGSTDDGTQYWIV 299

Query: 309 KNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           KNSWGT WGE GYIRMQR  DA+EGLCGIAM ASYPTA
Sbjct: 300 KNSWGTQWGEEGYIRMQRGTDAQEGLCGIAMDASYPTA 337


>gi|47524507|gb|AAT34987.1| putative cysteine protease [Gossypium hirsutum]
          Length = 344

 Score =  456 bits (1172), Expect = e-126,   Method: Compositional matrix adjust.
 Identities = 232/351 (66%), Positives = 268/351 (76%), Gaps = 14/351 (3%)

Query: 3   MILLENKLVLAAILVLGVWAPQ--SWSRTLNDATMNERHEMWMAQYGRVYRDNAE--KEM 58
           M LL+  L +A +L    ++ Q    SR L D   + RHE WM+Q+GRVY D  E  K  
Sbjct: 1   MALLQIFLFVALVLSF-CFSIQLAGLSRPLLDED-SMRHEEWMSQHGRVYADEQEDHKNK 58

Query: 59  RFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTD 118
           RF +FKENVE I  FN+    K +KL IN+FAD TNEEFRA  NG+K   P V SS+ T 
Sbjct: 59  RFNVFKENVERIEEFND---GKTFKLAINQFADLTNEEFRASYNGFKG--PMVLSSQITK 113

Query: 119 -VSFRYENAS--VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSL 175
              FRYEN S  +P S+DWRKKGAVT VK+QGQCGCCWAFSAVAA+EGI  I+T KL SL
Sbjct: 114 PTPFRYENVSSALPVSVDWRKKGAVTPVKNQGQCGCCWAFSAVAAIEGITQISTGKLISL 173

Query: 176 SEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAA 235
           SEQELVDCDT G D GCEGGLMD AFEFII+N GL TE+ YPYK  DG+CN  + NP A 
Sbjct: 174 SEQELVDCDTKGIDHGCEGGLMDTAFEFIINNGGLTTESNYPYKGEDGTCNFNKTNPIAV 233

Query: 236 KISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVG 295
            I+GYEDVP+N+E ALMKAVA+QPVSVAI+A GSDFQFYSSGVFTG+CGTELDH VTAVG
Sbjct: 234 SITGYEDVPANDEQALMKAVAHQPVSVAIEAGGSDFQFYSSGVFTGECGTELDHAVTAVG 293

Query: 296 YGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           YG ++DG+KYW+VKNSWGT WGE+GYI MQ+DI  K+GLCGIAMQASYPTA
Sbjct: 294 YGESEDGSKYWIVKNSWGTKWGESGYIEMQKDIKVKQGLCGIAMQASYPTA 344


>gi|302143412|emb|CBI21973.3| unnamed protein product [Vitis vinifera]
          Length = 320

 Score =  455 bits (1171), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 225/347 (64%), Positives = 262/347 (75%), Gaps = 28/347 (8%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRF 60
           MA +     + LA + VL  WA Q+ +R L++A+M ERHE WM QYGR Y+D  EK  R+
Sbjct: 1   MASVNQYQYICLALLFVLAAWASQATARNLHEASMYERHEDWMVQYGREYKDADEKSKRY 60

Query: 61  KIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVS 120
           KIFK+NV  I SFN KA +K YKL INEFAD TNEEFRA RN +K  + S  ++     S
Sbjct: 61  KIFKDNVARIESFN-KAMDKSYKLSINEFADLTNEEFRASRNRFKAHICSTEAT-----S 114

Query: 121 FRYENAS-VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQE 179
           F+YEN + VP+++DWRKKGAVT +KDQGQCG CWAFSAVAAMEGI  ++T KL SLSEQE
Sbjct: 115 FKYENVTAVPSTVDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQE 174

Query: 180 LVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISG 239
           LVDCDTSGEDQGC                       YPY  +DG+CN+K+A   AAKI+G
Sbjct: 175 LVDCDTSGEDQGC---------------------TNYPYAGTDGTCNRKKAAHPAAKING 213

Query: 240 YEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTA 299
           YEDVP+NNE AL KAVA+QP++VAIDA GS+FQFYSSGVFTGQCGTELDHGV+AVGYGT+
Sbjct: 214 YEDVPANNEKALQKAVAHQPIAVAIDAGGSEFQFYSSGVFTGQCGTELDHGVSAVGYGTS 273

Query: 300 DDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           DDG KYWLVKNSWGT WGE GYIRMQRD+ AKEGLCGIAMQASYPTA
Sbjct: 274 DDGMKYWLVKNSWGTGWGEEGYIRMQRDVTAKEGLCGIAMQASYPTA 320


>gi|302143411|emb|CBI21972.3| unnamed protein product [Vitis vinifera]
          Length = 320

 Score =  455 bits (1170), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 225/347 (64%), Positives = 262/347 (75%), Gaps = 28/347 (8%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRF 60
           MA +     + LA + VL  WA Q+ +R+L++A+M ERHE WM QYGR Y+D  EK  R+
Sbjct: 1   MASVNQYQYICLALLFVLAAWASQATARSLHEASMYERHEDWMVQYGREYKDADEKSKRY 60

Query: 61  KIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVS 120
           KIFK+NV  I SFN KA +K YKL INEFAD TNEEFRA RN +K  + S  ++     S
Sbjct: 61  KIFKDNVARIESFN-KAMDKSYKLSINEFADLTNEEFRASRNRFKAHICSTEAT-----S 114

Query: 121 FRYENAS-VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQE 179
           F+YEN + VP+++DWRKKGAVT +KDQGQCG CWAFSAVAAMEGI  ++T KL SLSEQE
Sbjct: 115 FKYENVTAVPSTVDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQE 174

Query: 180 LVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISG 239
           LVDCDTSGEDQGC                       YPY  +DG+CN+K+A   AAKI+G
Sbjct: 175 LVDCDTSGEDQGC---------------------TNYPYAGTDGTCNRKKAAHPAAKING 213

Query: 240 YEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTA 299
           YEDVP+NNE AL KAVA+QP++VAIDASGS+FQFYSSGVFTGQCGTELDHGV AVGYGT+
Sbjct: 214 YEDVPANNEKALQKAVAHQPIAVAIDASGSEFQFYSSGVFTGQCGTELDHGVAAVGYGTS 273

Query: 300 DDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           DDG KYWLVKNSW T WGE GYIRMQRD+ AKEGLCGIAMQASYPTA
Sbjct: 274 DDGMKYWLVKNSWSTGWGEEGYIRMQRDVTAKEGLCGIAMQASYPTA 320


>gi|302143415|emb|CBI21976.3| unnamed protein product [Vitis vinifera]
          Length = 322

 Score =  455 bits (1170), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 226/347 (65%), Positives = 261/347 (75%), Gaps = 26/347 (7%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRF 60
           MA +     + LA + VL  WA Q+ +R L++A+M ERHE WMAQYGRVY+D  EK  R+
Sbjct: 1   MASVNQYQYICLALLFVLAAWASQATARNLHEASMYERHEDWMAQYGRVYKDADEKSKRY 60

Query: 61  KIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVS 120
           KIFK+NV  I SFN KA +K YKL INEFAD TNEEF   RN +K  + S  ++     S
Sbjct: 61  KIFKDNVARIESFN-KAMDKSYKLSINEFADLTNEEFGTSRNRFKAHICSTEAT-----S 114

Query: 121 FRYENAS-VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQE 179
           F+YEN + VP++IDWRKKGAVT +KDQGQCG CWAFSAVAAMEGI  ++T KL SLSEQE
Sbjct: 115 FKYENVTAVPSTIDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQE 174

Query: 180 LVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISG 239
           LVDCDTSGEDQGC G                   A YPY  +DG+CN+K+A   AAKI+G
Sbjct: 175 LVDCDTSGEDQGCNG-------------------ANYPYAGTDGTCNRKKAAHPAAKING 215

Query: 240 YEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTA 299
           YEDVP+NNE AL KAV +QP++VAIDA G +FQFYSSGVFTGQCGTELDHGV AVGYGT+
Sbjct: 216 YEDVPANNEKALQKAVVHQPIAVAIDAGGFEFQFYSSGVFTGQCGTELDHGVAAVGYGTS 275

Query: 300 DDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           DDG KYWLVKNSWGT WGE GYIRMQRD+ AKEGLCGIAMQASYPTA
Sbjct: 276 DDGMKYWLVKNSWGTGWGEEGYIRMQRDVTAKEGLCGIAMQASYPTA 322


>gi|356543124|ref|XP_003540013.1| PREDICTED: vignain-like [Glycine max]
 gi|356543126|ref|XP_003540014.1| PREDICTED: vignain-like [Glycine max]
          Length = 337

 Score =  453 bits (1166), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 221/338 (65%), Positives = 260/338 (76%), Gaps = 12/338 (3%)

Query: 11  VLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYI 70
           +LA +L+L +   Q  SR L++A+M+ERHE WM +YG+VY+D AEK+ R  IFK+NVE+I
Sbjct: 10  ILALVLLLSICTSQVMSRNLHEASMSERHEQWMKKYGKVYKDAAEKQKRLLIFKDNVEFI 69

Query: 71  ASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS-VP 129
            SFN  A N+PYKL IN  ADQTNEEF A  NGYK +         +   F+YEN + VP
Sbjct: 70  ESFN-AAGNRPYKLSINHLADQTNEEFVASHNGYKHK------GSHSQTPFKYENVTGVP 122

Query: 130 ASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGED 189
            ++DWR+ GAVT VKDQGQCG CWAFS VAA EGI  ITT  L SLSEQELVDCD+   D
Sbjct: 123 NAVDWRENGAVTAVKDQGQCGSCWAFSTVAATEGIYQITTSMLMSLSEQELVDCDSV--D 180

Query: 190 QGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCN-KKEANPSAAKISGYEDVPSNNE 248
            GC+GG M+  FEFII N G+++EA YPY A DG+C+  KEA+P AA+I GYE VP+N+E
Sbjct: 181 HGCDGGYMEGGFEFIIKNGGISSEANYPYTAVDGTCDANKEASP-AAQIKGYETVPANSE 239

Query: 249 AALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLV 308
            AL KAVANQPVSV IDA GS FQFYSSGVFTGQCGT+LDHGVTAVGYG+ DDGT+YW+V
Sbjct: 240 DALQKAVANQPVSVTIDAGGSAFQFYSSGVFTGQCGTQLDHGVTAVGYGSTDDGTQYWIV 299

Query: 309 KNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           KNSWGT WGE GYIRMQR  DA+EGLCGIAM ASYPTA
Sbjct: 300 KNSWGTQWGEEGYIRMQRGTDAQEGLCGIAMDASYPTA 337


>gi|144905112|dbj|BAF56429.1| cysteine proteinase [Lotus japonicus]
          Length = 341

 Score =  452 bits (1162), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 227/349 (65%), Positives = 262/349 (75%), Gaps = 11/349 (3%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSRTLNDA--TMNERHEMWMAQYGRVYRDNAEKEM 58
           MA    + + +LA  L+L V   +  SR L++   ++ ERHE WMA+Y +VY+D AEKE 
Sbjct: 1   MASSTRQKQYILALFLLLAVGISRVISRELHETETSLIERHEQWMAKYDKVYKDAAEKEK 60

Query: 59  RFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTD 118
           RF IFK+NVE+I SFN  A NKPYKLG+N  AD T EEF+A RNG KR        E   
Sbjct: 61  RFLIFKDNVEFIESFN-AAGNKPYKLGVNHLADLTIEEFKASRNGLKRSY----DYEVGT 115

Query: 119 VSFRYENAS-VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSE 177
            SF+YEN + +PAS+DWRKKGAVT +KDQGQCG CWAFS VAA EGI+ I+T KL SLSE
Sbjct: 116 TSFKYENVTAIPASVDWRKKGAVTPIKDQGQCGSCWAFSTVAATEGIHKISTGKLVSLSE 175

Query: 178 QELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKI 237
           QELVDCD  G DQGCEGG M+D FEFII N G+ TEA YPYKA DGSC  K A   AA+I
Sbjct: 176 QELVDCDRKGTDQGCEGGYMEDGFEFIIKNGGITTEANYPYKAVDGSC--KNATAPAAQI 233

Query: 238 SGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYG 297
            GYE VP N+E AL+KAVANQPVSV+IDA+   F FYSSG+FTG+CGTELDHGVTAVGYG
Sbjct: 234 KGYEKVPVNSEKALLKAVANQPVSVSIDAADGSFMFYSSGIFTGECGTELDHGVTAVGYG 293

Query: 298 TADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
            A +GT YW+VKNSWGT WGE GYIRMQR I AKEGLCGIAM +SYPTA
Sbjct: 294 RA-NGTDYWIVKNSWGTVWGEQGYIRMQRGIAAKEGLCGIAMDSSYPTA 341


>gi|318136892|gb|ADV41672.1| cysteine protease [Nicotiana tabacum]
          Length = 349

 Score =  452 bits (1162), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 217/350 (62%), Positives = 265/350 (75%), Gaps = 5/350 (1%)

Query: 1   MAMILLENKLVLAAILV-LGVWAPQ-SWSRTLN-DATMNERHEMWMAQYGRVYRDNAEKE 57
           MA   L   L LA   + LGVW  Q + SR +N +A+M  RH+ W+A + +VY+D  EKE
Sbjct: 1   MAFANLSQYLCLALFFIFLGVWRSQVASSRPINYEASMRARHDQWIAHHDKVYKDLNEKE 60

Query: 58  MRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETT 117
           MRFKIFKENVE I +FN    +K YKLG+N+F+D TNE+FR    GYKR  P V SS   
Sbjct: 61  MRFKIFKENVERIEAFN-AGEDKGYKLGVNKFSDLTNEKFRVLHTGYKRSHPKVMSSSKP 119

Query: 118 DVSFRYENAS-VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLS 176
              FRY N + +P ++DWRKKGAVT +KDQ +CGCCWAFSAVAA EG++ + T KL  LS
Sbjct: 120 KTHFRYANVTDIPPTMDWRKKGAVTPIKDQKECGCCWAFSAVAATEGLHQLKTGKLIPLS 179

Query: 177 EQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAK 236
           EQELVDCD  GED+GC GGL+D AF+FI+ NKGL TEA YPYK  DG CNKK++  SAAK
Sbjct: 180 EQELVDCDVEGEDEGCSGGLLDTAFDFILKNKGLTTEANYPYKGEDGVCNKKKSALSAAK 239

Query: 237 ISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGY 296
           I+GYEDVP+N+E AL++AVANQPVSVAID S  DFQFYSSGVF+G C T L+H VTAVGY
Sbjct: 240 IAGYEDVPANSEKALLQAVANQPVSVAIDGSSFDFQFYSSGVFSGSCSTWLNHAVTAVGY 299

Query: 297 GTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           G   DGTKYW++KNSWG+ WG++GY+R++RD+  KEGLCG+AM ASYPTA
Sbjct: 300 GATTDGTKYWIIKNSWGSKWGDSGYMRIKRDVHEKEGLCGLAMDASYPTA 349


>gi|357452075|ref|XP_003596314.1| Cysteine proteinase [Medicago truncatula]
 gi|355485362|gb|AES66565.1| Cysteine proteinase [Medicago truncatula]
          Length = 341

 Score =  451 bits (1159), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 228/347 (65%), Positives = 266/347 (76%), Gaps = 10/347 (2%)

Query: 4   ILLENKL---VLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRF 60
           +++ N+L     A  L LG+ + Q+ SRTL +  M E HE WM Q+G+VY+   EK+ RF
Sbjct: 1   MVMNNQLHYIPFALFLCLGLLSFQATSRTLQNDPMYEMHEQWMVQHGKVYKAAHEKQKRF 60

Query: 61  KIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVS 120
            IFKENV YI +FNN   NK YKLG+N FAD TN EF A RN +   L       +   +
Sbjct: 61  GIFKENVNYIEAFNNVG-NKSYKLGLNHFADLTNHEFIAARNKFNGYLHG-----SIITT 114

Query: 121 FRYENAS-VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQE 179
           F+Y+N S VP+++DWR++GAVT VK+QGQCGCCWAFSAVA+ EGI+ +TT  L SLSEQE
Sbjct: 115 FKYKNVSDVPSAVDWRQEGAVTPVKNQGQCGCCWAFSAVASTEGIHKLTTGNLVSLSEQE 174

Query: 180 LVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISG 239
           LVDCDT+GEDQGCEGGLMDDAFEFII N GL+TEA+YPY+  DG+CNK E   SAA ISG
Sbjct: 175 LVDCDTNGEDQGCEGGLMDDAFEFIIQNNGLSTEAEYPYQGVDGTCNKTEVGSSAATISG 234

Query: 240 YEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTA 299
           YE+VP N+E AL KAVANQPVSVAIDASGSDFQFY SGVFTG CGTELDHGV  VGYG  
Sbjct: 235 YENVPVNDEQALQKAVANQPVSVAIDASGSDFQFYKSGVFTGSCGTELDHGVAVVGYGVG 294

Query: 300 DDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           +D T+YWLVKNSWGT WGE GYIRMQR +DA EGLCGIAMQ SYPTA
Sbjct: 295 EDETEYWLVKNSWGTQWGEEGYIRMQRGVDASEGLCGIAMQPSYPTA 341


>gi|356545118|ref|XP_003540992.1| PREDICTED: thiol protease SEN102-like [Glycine max]
          Length = 337

 Score =  450 bits (1158), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 220/342 (64%), Positives = 256/342 (74%), Gaps = 12/342 (3%)

Query: 7   ENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKEN 66
           + +  +A  L+L +  PQ  SR L++ +M ERHE WMA+YG+VY+D AEKE RF IFK N
Sbjct: 6   QKQYTIALFLLLALGIPQMMSRKLHETSMRERHEQWMAEYGKVYKDAAEKEKRFLIFKHN 65

Query: 67  VEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENA 126
           VE+I SFN  A NKPYKLG+N  AD T EEF+A RNG KR        E +   F+YEN 
Sbjct: 66  VEFIESFN-AAANKPYKLGVNHLADLTVEEFKASRNGLKRPY------ELSTTPFKYENV 118

Query: 127 S-VPASIDWRKKGAVTGVKDQGQC-GCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCD 184
           + +PA+IDWR KGAVT +KDQGQC G CWAFS VAA EGI+ ITT KL SLSEQELVDCD
Sbjct: 119 TAIPAAIDWRTKGAVTSIKDQGQCAGSCWAFSTVAATEGIHQITTGKLVSLSEQELVDCD 178

Query: 185 TSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVP 244
           T G DQGCEGG M+D FEFII N G+ +EA YPYKA DG CNK  A    A+I GYE VP
Sbjct: 179 TKGVDQGCEGGYMEDGFEFIIKNGGITSEANYPYKAVDGKCNK--ATSPVAQIKGYEKVP 236

Query: 245 SNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTK 304
            N+E  L KAVANQPVSV+IDA+G  F FYSSG++ G+CGTELDHGVTAVGYG A +GT 
Sbjct: 237 PNSEKTLQKAVANQPVSVSIDANGEGFMFYSSGIYNGECGTELDHGVTAVGYGIA-NGTD 295

Query: 305 YWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           YWLVKNSWGT WGE GY+RMQR + AK GLCGIA+ +SYPTA
Sbjct: 296 YWLVKNSWGTQWGEKGYVRMQRGVAAKHGLCGIALDSSYPTA 337


>gi|40806498|gb|AAR92154.1| putative cysteine protease 1 [Iris x hollandica]
          Length = 340

 Score =  449 bits (1156), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 224/349 (64%), Positives = 267/349 (76%), Gaps = 12/349 (3%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWS-RTLND-ATMNERHEMWMAQYGRVYRDNAEKEM 58
           MA       L   A+L++ +WA Q  + R+L +  +M ERHE WMAQ+GRVY++ AEK  
Sbjct: 1   MAAFKTVKLLPALALLIVAIWASQGEAGRSLGENKSMLERHEQWMAQHGRVYKNAAEKAH 60

Query: 59  RFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTD 118
           RF+IF+ NVE I SFN  A N  +KLG+N+FAD TNEEF+  RN  K   PS  +S    
Sbjct: 61  RFEIFRANVERIESFN--AENHKFKLGVNQFADLTNEEFKT-RNTLK---PSKMASTK-- 112

Query: 119 VSFRYENAS-VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSE 177
            SF+YEN + VPA++DWR KGAVT +KDQGQCG CWAFSAVAA EGI  ++T KL SLSE
Sbjct: 113 -SFKYENVTAVPATMDWRTKGAVTPIKDQGQCGSCWAFSAVAATEGITKLSTGKLISLSE 171

Query: 178 QELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKI 237
           QE+VDCD + +DQGC GG MDDAFE+II NKG+ TEA YPYKA+DG+CN K+A   AA I
Sbjct: 172 QEVVDCDVTSDDQGCNGGEMDDAFEYIIKNKGITTEANYPYKAADGTCNTKKAASHAASI 231

Query: 238 SGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYG 297
           +GYEDV  N+EAAL+KA ANQP++VAIDA    FQ YSSGVFTG CGT+LDHGVT VGYG
Sbjct: 232 TGYEDVTVNSEAALLKAAANQPIAVAIDAGDFAFQMYSSGVFTGDCGTDLDHGVTLVGYG 291

Query: 298 TADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
              DGTKYWLVKNSWGT+WGE+GYIRM+RD+DAKEGLCGIAM ASYPTA
Sbjct: 292 ATSDGTKYWLVKNSWGTSWGEDGYIRMERDVDAKEGLCGIAMDASYPTA 340


>gi|356515048|ref|XP_003526213.1| PREDICTED: vignain-like [Glycine max]
          Length = 350

 Score =  448 bits (1153), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 219/337 (64%), Positives = 260/337 (77%), Gaps = 13/337 (3%)

Query: 11  VLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYI 70
           +LA +L+L +   Q  SR L++A+M+ERHE WM +YG+VY+D AEK+ R  IFK+NVE+I
Sbjct: 10  ILALVLLLSICTSQVMSRNLHEASMSERHEQWMKKYGKVYKDAAEKQKRLLIFKDNVEFI 69

Query: 71  ASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS-VP 129
            SFN  A NKPYKL IN  ADQTNEEF A  NGYK +         +   F+Y N + +P
Sbjct: 70  ESFN-AAGNKPYKLSINHLADQTNEEFVASHNGYKYK------GSHSQTPFKYGNVTDIP 122

Query: 130 ASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGED 189
            ++DWR+ GAVT VKDQGQCG CWAFS VAA EGI  I+T  L SLSEQELVDCD+   D
Sbjct: 123 TAVDWRQNGAVTAVKDQGQCGSCWAFSTVAATEGIYQISTGMLMSLSEQELVDCDSV--D 180

Query: 190 QGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCN-KKEANPSAAKISGYEDVPSNNE 248
            GC+GGLM+D FEFII N G+++EA YPY A DG+C+  KEA+P AA+I GYE VP+N+E
Sbjct: 181 HGCDGGLMEDGFEFIIKNGGISSEANYPYTAVDGTCDASKEASP-AAQIKGYETVPANSE 239

Query: 249 AALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGT-KYWL 307
            AL +AVANQPVSV+IDA GS FQFYSSGVFTGQCGT+LDHGVT VGYGT DDGT +YW+
Sbjct: 240 EALQQAVANQPVSVSIDAGGSGFQFYSSGVFTGQCGTQLDHGVTVVGYGTTDDGTHEYWI 299

Query: 308 VKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           VKNSWGT WGE GYIRMQR IDA+EGLCGIAM ASYP
Sbjct: 300 VKNSWGTQWGEEGYIRMQRGIDAQEGLCGIAMDASYP 336


>gi|84181681|gb|AAW78661.2| senescence-specific cysteine protease [Nicotiana tabacum]
          Length = 349

 Score =  447 bits (1151), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 213/350 (60%), Positives = 264/350 (75%), Gaps = 5/350 (1%)

Query: 1   MAMILLENKLVLAAILV-LGVWAPQ-SWSRTLN-DATMNERHEMWMAQYGRVYRDNAEKE 57
           MA   L   L LA   + LG+W+ Q + SR +N +ATM  RH+ W+  + +VY+D  EKE
Sbjct: 1   MAFANLSQYLCLALFFICLGLWSSQVALSRPINYEATMRARHDQWIVHHEKVYKDLNEKE 60

Query: 58  MRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETT 117
           +RF+IFKENVE I +FN    +K YKLG N+F+D TNEEFR    GYKR  P V +S   
Sbjct: 61  VRFQIFKENVERIEAFN-AGEDKGYKLGFNKFSDLTNEEFRVLHTGYKRSHPKVMTSSKG 119

Query: 118 DVSFRYENAS-VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLS 176
              FRY N + +P ++DWRKKGAVT +KDQ +CGCCWAFSAVAAMEG++ + T +L  LS
Sbjct: 120 KTHFRYTNVTDIPPTMDWRKKGAVTPIKDQKECGCCWAFSAVAAMEGLHQLKTGELIPLS 179

Query: 177 EQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAK 236
           EQELVDCD  GED+GC GGL+D AF+FI+ NKGL TE  YPYK  DG CNKK++  SAAK
Sbjct: 180 EQELVDCDVEGEDEGCSGGLLDTAFDFILKNKGLTTEVNYPYKGEDGVCNKKKSALSAAK 239

Query: 237 ISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGY 296
           I+GYEDVP+N+E AL++AVANQPVSVAID S  DFQFYSSGVF+G C T L+H VTAVGY
Sbjct: 240 ITGYEDVPANSEKALLQAVANQPVSVAIDGSSFDFQFYSSGVFSGSCSTWLNHAVTAVGY 299

Query: 297 GTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           G   DGTKYW++KNSWG+ WG++GY+R++RD+  KEGLCG+AM ASYPTA
Sbjct: 300 GATTDGTKYWIIKNSWGSKWGDSGYMRIKRDVHEKEGLCGLAMDASYPTA 349


>gi|357458911|ref|XP_003599736.1| Cysteine proteinase [Medicago truncatula]
 gi|357474719|ref|XP_003607644.1| Cysteine proteinase [Medicago truncatula]
 gi|355488784|gb|AES69987.1| Cysteine proteinase [Medicago truncatula]
 gi|355508699|gb|AES89841.1| Cysteine proteinase [Medicago truncatula]
          Length = 340

 Score =  447 bits (1149), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 216/322 (67%), Positives = 253/322 (78%), Gaps = 10/322 (3%)

Query: 27  SRTLNDA-TMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLG 85
           SR L ++ ++ ERHE WM ++G+VY D  EKE RF IFK+NVE+I SFN  A N+PYKL 
Sbjct: 27  SRKLYESLSLQERHEQWMTEHGKVYEDAIEKEKRFMIFKDNVEFIESFN-AADNQPYKLS 85

Query: 86  INEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS-VPASIDWRKKGAVTGVK 144
           +N  AD T +EF+A RNGYK+        E T  SF+YEN + +PA++DWR KGAVT +K
Sbjct: 86  VNHLADLTLDEFKASRNGYKKI-----DREFTTTSFKYENVTAIPAAVDWRVKGAVTPIK 140

Query: 145 DQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFI 204
           DQGQCG CWAFS VAA EGIN ITT KL SLSEQELVDCDT GEDQGCEGGLM+D FEFI
Sbjct: 141 DQGQCGSCWAFSTVAATEGINQITTGKLVSLSEQELVDCDTKGEDQGCEGGLMEDGFEFI 200

Query: 205 ISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAI 264
           I N G+ +E  YPYKA+DGSCN     P  AKI+GYE VP N+E +L+KAVANQP+SV+I
Sbjct: 201 IKNGGITSETNYPYKAADGSCNTATTTP-VAKITGYEKVPVNSEKSLLKAVANQPISVSI 259

Query: 265 DASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRM 324
           DAS S F FYSSG++TG+CGTELDHGVTAVGYG+A +GT YW+VKNSWGT WGE GYIRM
Sbjct: 260 DASDSSFMFYSSGIYTGECGTELDHGVTAVGYGSA-NGTDYWIVKNSWGTVWGEKGYIRM 318

Query: 325 QRDIDAKEGLCGIAMQASYPTA 346
           QR I AKEGLCGIAM +SYPTA
Sbjct: 319 QRGIAAKEGLCGIAMDSSYPTA 340


>gi|357474725|ref|XP_003607647.1| Cysteine proteinase [Medicago truncatula]
 gi|355508702|gb|AES89844.1| Cysteine proteinase [Medicago truncatula]
          Length = 340

 Score =  443 bits (1140), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 214/322 (66%), Positives = 253/322 (78%), Gaps = 10/322 (3%)

Query: 27  SRTLNDA-TMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLG 85
           SR L ++ ++ ERHE WM++YG++Y+D  EKE RF IFK+NVE+I SFN  A NKPYKL 
Sbjct: 27  SRKLYESPSLQERHEQWMSEYGKLYKDAIEKEKRFMIFKDNVEFIESFN-AADNKPYKLS 85

Query: 86  INEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS-VPASIDWRKKGAVTGVK 144
           +N  AD T +EF+A RNGYK+        E    SF+YEN + +P ++DWR KGAVT +K
Sbjct: 86  VNHLADLTLDEFKASRNGYKKI-----DREFATTSFKYENVTAIPEAVDWRVKGAVTPIK 140

Query: 145 DQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFI 204
           DQGQCG CWAFS VAA+EGIN ITT KL SLSEQELVDCDT GEDQGCEGGLM+D FEFI
Sbjct: 141 DQGQCGSCWAFSTVAAIEGINQITTGKLISLSEQELVDCDTKGEDQGCEGGLMEDGFEFI 200

Query: 205 ISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAI 264
           I N G+ +E  YPYKA+DGSCN     P  AKI+GYE VP N+E +L+KAVANQP+SV+I
Sbjct: 201 IKNGGITSETNYPYKAADGSCNTATTAP-VAKITGYEKVPVNSEISLLKAVANQPISVSI 259

Query: 265 DASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRM 324
           DAS S F FYSSG++TG+CGTELDHGVTAVGYG+A +GT YW+VKNSWGT WGE GYIRM
Sbjct: 260 DASDSSFMFYSSGIYTGECGTELDHGVTAVGYGSA-NGTDYWIVKNSWGTVWGEKGYIRM 318

Query: 325 QRDIDAKEGLCGIAMQASYPTA 346
           QR I  KEGLCGIAM +SYPTA
Sbjct: 319 QRGIADKEGLCGIAMDSSYPTA 340


>gi|302143416|emb|CBI21977.3| unnamed protein product [Vitis vinifera]
          Length = 297

 Score =  443 bits (1139), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 224/306 (73%), Positives = 250/306 (81%), Gaps = 11/306 (3%)

Query: 43  MAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRN 102
           MA+YGR+Y+D  EKE RFKIFK+NV  I SFN KA +K YKL INEFAD TNEEFR+ RN
Sbjct: 1   MARYGRMYKDANEKEKRFKIFKDNVARIESFN-KAMDKTYKLSINEFADLTNEEFRSLRN 59

Query: 103 GYKRRLPSVRSSETTDVSFRYENAS-VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAM 161
            +K  +     SE T  +F+YEN + VP++IDWRKKGAVT +KDQ QCGCCWAFSAVAA 
Sbjct: 60  RFKAHI----CSEAT--TFKYENVTAVPSTIDWRKKGAVTPIKDQQQCGCCWAFSAVAAT 113

Query: 162 EGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKAS 221
           EGI  ITT KL SLSEQELVDCDT GE+QGC GGLMDDAF FI  + GLA+EA YPY+  
Sbjct: 114 EGITQITTGKLISLSEQELVDCDTGGENQGCSGGLMDDAFRFIKIH-GLASEATYPYEGD 172

Query: 222 DGSCN-KKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFT 280
           DG+CN KKEA+P AAKI GYEDVP+NNE AL KAVA+QPV+VAIDA G +FQFY+SGVFT
Sbjct: 173 DGTCNSKKEAHP-AAKIKGYEDVPANNEKALQKAVAHQPVAVAIDAGGFEFQFYTSGVFT 231

Query: 281 GQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQ 340
           GQCGTELDHGV AVGYG  DDG  YWLVKNSWGT WGE GYIRMQRD+ AKEGLCGIAMQ
Sbjct: 232 GQCGTELDHGVAAVGYGIGDDGMMYWLVKNSWGTGWGEEGYIRMQRDVTAKEGLCGIAMQ 291

Query: 341 ASYPTA 346
           ASYPTA
Sbjct: 292 ASYPTA 297


>gi|388512155|gb|AFK44139.1| unknown [Medicago truncatula]
          Length = 340

 Score =  441 bits (1135), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 213/322 (66%), Positives = 253/322 (78%), Gaps = 10/322 (3%)

Query: 27  SRTLNDA-TMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLG 85
           SR L ++ ++ ERHE WM++YG++Y+D  EKE RF IFK+NVE+I SFN  A NKPYKL 
Sbjct: 27  SRKLYESPSLQERHEQWMSEYGKLYKDAIEKEKRFMIFKDNVEFIESFN-AADNKPYKLS 85

Query: 86  INEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS-VPASIDWRKKGAVTGVK 144
           +N  AD T +EF+A RNGYK+        E    SF+YEN + +P ++DWR KGAVT +K
Sbjct: 86  VNHLADLTLDEFKASRNGYKKI-----DREFATTSFKYENVTAIPEAVDWRVKGAVTPIK 140

Query: 145 DQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFI 204
           DQGQCG CWAFS VAA+EGIN ITT KL SLSEQELVDCDT GEDQGCEGGLM+D FEFI
Sbjct: 141 DQGQCGSCWAFSTVAAIEGINQITTGKLISLSEQELVDCDTKGEDQGCEGGLMEDGFEFI 200

Query: 205 ISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAI 264
           I N G+ +E  YPYKA+DGSC+     P  AKI+GYE VP N+E +L+KAVANQP+SV+I
Sbjct: 201 IKNGGITSETNYPYKAADGSCSAATTAP-VAKITGYEKVPVNSEISLLKAVANQPISVSI 259

Query: 265 DASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRM 324
           DAS S F FYSSG++TG+CGTELDHGVTAVGYG+A +GT YW+VKNSWGT WGE GYIRM
Sbjct: 260 DASDSSFMFYSSGIYTGECGTELDHGVTAVGYGSA-NGTDYWIVKNSWGTVWGEKGYIRM 318

Query: 325 QRDIDAKEGLCGIAMQASYPTA 346
           QR I  KEGLCGIAM +SYPTA
Sbjct: 319 QRGIADKEGLCGIAMDSSYPTA 340


>gi|319826926|gb|ADV74756.1| cysteine protease [Lactuca sativa]
          Length = 363

 Score =  441 bits (1135), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 213/320 (66%), Positives = 247/320 (77%), Gaps = 4/320 (1%)

Query: 27  SRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGI 86
           SRTLND TM  RHE WMA +GR+Y D  EK++RF+IFK NV YI + N ++ ++ Y L +
Sbjct: 43  SRTLNDPTMIARHEQWMAHHGRIYTDENEKQLRFQIFKNNVAYIDAHNARS-DQSYTLEV 101

Query: 87  NEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS-VPASIDWRKKGAVTGVKD 145
           N+FAD TN+EFRA RNGYK++  S   S      FRY N S VP  +DWRK+GAVT VKD
Sbjct: 102 NKFADLTNDEFRASRNGYKKQPDS--DSHVVSGLFRYANVSAVPDEVDWRKEGAVTPVKD 159

Query: 146 QGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFII 205
           QG CGCCWAFSAVAAMEGIN +   KL SLSEQELVDCD  G DQGCEGGLM++AF+FI 
Sbjct: 160 QGDCGCCWAFSAVAAMEGINKLENGKLVSLSEQELVDCDIDGIDQGCEGGLMENAFQFIE 219

Query: 206 SNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAID 265
             KGLA E+ YPY   DG CN K+A   AAKISG+E VP+NNE AL++AVANQPVS+AID
Sbjct: 220 KRKGLAAESVYPYTGEDGICNTKKAAIPAAKISGHEKVPANNEKALLQAVANQPVSIAID 279

Query: 266 ASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQ 325
           ASG +FQFYS GVFTG CGTELDH +TAVGYG   DGTKYWL+KNSWG +WGENGYIR++
Sbjct: 280 ASGYEFQFYSGGVFTGSCGTELDHAITAVGYGATMDGTKYWLMKNSWGASWGENGYIRIK 339

Query: 326 RDIDAKEGLCGIAMQASYPT 345
           RD  AKEGLCGIAM  SYP 
Sbjct: 340 RDSLAKEGLCGIAMDPSYPV 359


>gi|356515036|ref|XP_003526207.1| PREDICTED: thiol protease SEN102-like [Glycine max]
          Length = 336

 Score =  440 bits (1132), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 211/324 (65%), Positives = 249/324 (76%), Gaps = 11/324 (3%)

Query: 24  QSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYK 83
           Q   R L++ +M ERHE WM +YG+VY+D AEK+ RF+IFK+NVE+I SFN    NKPYK
Sbjct: 23  QVMCRKLHETSMRERHEQWMTEYGKVYKDAAEKDKRFQIFKDNVEFIESFNADG-NKPYK 81

Query: 84  LGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS-VPASIDWRKKGAVTG 142
           LG+N  AD T EEF+A RNG+KR        E +  +F+YEN + +PA+IDWR KGAVT 
Sbjct: 82  LGVNHLADLTVEEFKASRNGFKR------PHEFSTTTFKYENVTAIPAAIDWRTKGAVTP 135

Query: 143 VKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFE 202
           +KDQGQCG CWAFS +AA EGI+ ITT KL SLSEQELVDCDT G DQGCEGG M+D FE
Sbjct: 136 IKDQGQCGSCWAFSTIAATEGIHQITTGKLVSLSEQELVDCDTKGVDQGCEGGYMEDGFE 195

Query: 203 FIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSV 262
           FII N G+ +E  YPYKA DG CNK  A    A+I GYE VP N+E AL KAVANQPVSV
Sbjct: 196 FIIKNGGITSETNYPYKAVDGKCNK--ATSPVAQIKGYEKVPPNSETALQKAVANQPVSV 253

Query: 263 AIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYI 322
           +IDA G+ F FYSSG++ G+CGTELDHGVTAVGYGTA +GT YW+VKNSWGT WGE GY+
Sbjct: 254 SIDADGAGFMFYSSGIYNGECGTELDHGVTAVGYGTA-NGTDYWIVKNSWGTQWGEKGYV 312

Query: 323 RMQRDIDAKEGLCGIAMQASYPTA 346
           RMQR I AK GLCGIA+ +SYPT+
Sbjct: 313 RMQRGIAAKHGLCGIALDSSYPTS 336


>gi|224083868|ref|XP_002307151.1| predicted protein [Populus trichocarpa]
 gi|222856600|gb|EEE94147.1| predicted protein [Populus trichocarpa]
          Length = 298

 Score =  440 bits (1132), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 218/313 (69%), Positives = 249/313 (79%), Gaps = 16/313 (5%)

Query: 35  MNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTN 94
           M ERHE WMAQYGRVY+D+AEKE R+ IFKENV  I +FN++   K Y LG+N+FAD +N
Sbjct: 1   MYERHEQWMAQYGRVYKDDAEKETRYNIFKENVARIDAFNSQT-GKSYNLGVNQFADLSN 59

Query: 95  EEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS-VPASIDWRKKGAVTGVKDQGQCGCCW 153
           EEF+A RN +K  + S ++       FRYEN S VPA++DWRKKGAVT VKDQGQC    
Sbjct: 60  EEFKASRNRFKGHMCSPQAG-----PFRYENVSAVPATMDWRKKGAVTPVKDQGQC---- 110

Query: 154 AFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATE 213
               VAAMEGIN +TT KL SLSEQE+VDCDT GEDQGC GGLMDDAF+FI  NKGL TE
Sbjct: 111 ----VAAMEGINQLTTGKLISLSEQEVVDCDTKGEDQGCNGGLMDDAFKFIEQNKGLTTE 166

Query: 214 AKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQF 273
           A YPY  +DG+CN ++    AAKI+G++DVP+N+EAALMKAVA QPVSVAIDA G +FQF
Sbjct: 167 ANYPYTGTDGTCNTQKEVSHAAKITGFQDVPANSEAALMKAVAKQPVSVAIDAGGFEFQF 226

Query: 274 YSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEG 333
           YSSG+FTG CGTELDHGVTAVGYG   DGTKYWLVKNSWG  WGE GYIRMQ+DI AKEG
Sbjct: 227 YSSGIFTGSCGTELDHGVTAVGYG-GSDGTKYWLVKNSWGAQWGEEGYIRMQKDISAKEG 285

Query: 334 LCGIAMQASYPTA 346
           LCGIAMQASYPTA
Sbjct: 286 LCGIAMQASYPTA 298


>gi|356515050|ref|XP_003526214.1| PREDICTED: vignain-like [Glycine max]
          Length = 344

 Score =  438 bits (1127), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 221/344 (64%), Positives = 261/344 (75%), Gaps = 9/344 (2%)

Query: 7   ENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKEN 66
           + + +LA  L L V   Q   R L+   + ERHE WMA+YG++Y+D AEKE RF+IFK+N
Sbjct: 6   QKQHMLALFLFLAVGISQVMPRKLHQTALRERHENWMAEYGKIYKDAAEKEKRFQIFKDN 65

Query: 67  VEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENA 126
           VE+I SFN  A NKPYKLG+N  AD T EEF+  RNG KR      ++   +  F+YEN 
Sbjct: 66  VEFIESFN-AAGNKPYKLGVNHLADLTLEEFKDSRNGLKRTYEFSTTTFKLN-GFKYENV 123

Query: 127 S-VPASIDWRKKGAVTGVKDQG-QCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCD 184
           + +P +IDWR KGAVT +KDQG QCG CWAFS VAA EGI  I+T  L SLSEQELVDCD
Sbjct: 124 TDIPEAIDWRVKGAVTPIKDQGDQCGSCWAFSTVAATEGIYQISTGMLMSLSEQELVDCD 183

Query: 185 TSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCN-KKEANPSAAKISGYEDV 243
           +   D GC+GGLM+D FEFII N G+++EA YPY A DG+C+  KEA+P AA+I GYE V
Sbjct: 184 SV--DHGCDGGLMEDGFEFIIKNGGISSEANYPYTAVDGTCDASKEASP-AAQIKGYETV 240

Query: 244 PSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGT 303
           P+N+E AL +AVANQPVSV+IDA GS FQFYSSGVFTGQCGT+LDHGVT VGYGT DDGT
Sbjct: 241 PANSEEALQQAVANQPVSVSIDAGGSGFQFYSSGVFTGQCGTQLDHGVTVVGYGTTDDGT 300

Query: 304 -KYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
            +YW+VKNSWGT WGE GYIRMQR IDA EGLCGIAM ASYPTA
Sbjct: 301 HEYWIVKNSWGTQWGEEGYIRMQRGIDALEGLCGIAMDASYPTA 344


>gi|356542631|ref|XP_003539770.1| PREDICTED: vignain-like [Glycine max]
          Length = 343

 Score =  434 bits (1116), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 207/337 (61%), Positives = 264/337 (78%), Gaps = 7/337 (2%)

Query: 12  LAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIA 71
           LA +L+ G WA  + +RTL DA+M+ERHE WMAQ+G+VY+D+ EKE+R+KIF++NV+ I 
Sbjct: 12  LALLLLFGFWAFSANTRTLEDASMHERHEQWMAQHGKVYKDHHEKELRYKIFQQNVKGIE 71

Query: 72  SFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS-VPA 130
            FNN A NK +KLG+N+FAD T EEF+A  N  K  + S  S  +T   F+YE+ + VPA
Sbjct: 72  GFNN-AGNKSHKLGVNQFADLTEEEFKAI-NKLKGYMWSKISRTST---FKYEHVTKVPA 126

Query: 131 SIDWRKKGAVTGVKDQG-QCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGED 189
           ++DWR+KGAVT +K QG +CG CWAF+AVAA EGI  +TT +L SLSEQEL+DCDT+G++
Sbjct: 127 TLDWRQKGAVTPIKSQGLKCGSCWAFAAVAATEGITKLTTGELISLSEQELIDCDTNGDN 186

Query: 190 QGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEA 249
            GC+ G++ +AF+FI+ NKGLATEA YPY+A DG+CN K  +   A I GYEDVP+NNE 
Sbjct: 187 GGCKWGIIQEAFKFIVQNKGLATEASYPYQAVDGTCNAKVESKHVASIKGYEDVPANNET 246

Query: 250 ALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVK 309
           AL+ AVANQPVSV +D+S  DF+FYSSGV +G CGT  DH VT VGYG +DDGTKYWL+K
Sbjct: 247 ALLNAVANQPVSVLVDSSDYDFRFYSSGVLSGSCGTTFDHAVTVVGYGVSDDGTKYWLIK 306

Query: 310 NSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           NSWG  WGE GYIR++RD+ AKEG+CGIAMQASYP A
Sbjct: 307 NSWGVYWGEQGYIRIKRDVAAKEGMCGIAMQASYPIA 343


>gi|413944253|gb|AFW76902.1| hypothetical protein ZEAMMB73_056195 [Zea mays]
          Length = 340

 Score =  434 bits (1115), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 212/325 (65%), Positives = 247/325 (76%), Gaps = 15/325 (4%)

Query: 27  SRTLN-DATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLG 85
           +R LN D+ M  RHE WMAQY RVY+D AEK  RF++FK NV++I SFN    N+ + LG
Sbjct: 24  ARDLNEDSAMVARHEQWMAQYSRVYKDAAEKARRFEVFKANVKFIESFNTGG-NRKFWLG 82

Query: 86  INEFADQTNEEFRAPRN--GYKRRLPSVRSSETTDVSFRYENASV---PASIDWRKKGAV 140
           IN+FAD TN+EFR  +   G+K  L  V +       FRYEN SV   PA+IDWR  GAV
Sbjct: 83  INQFADLTNDEFRTTKTNKGFKPSLDKVSTG------FRYENVSVDAIPATIDWRTNGAV 136

Query: 141 TGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDA 200
           T +KDQGQCGCCWAFSAVAA EGI  I+T KL SLSEQELVDCD  GEDQGCEGGLMDDA
Sbjct: 137 TPIKDQGQCGCCWAFSAVAATEGIVKISTGKLISLSEQELVDCDVHGEDQGCEGGLMDDA 196

Query: 201 FEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPV 260
           F+FII N GL TE+ YPY A+DG C  K  + SAA I GYEDVP+N+EAALMKAVANQPV
Sbjct: 197 FKFIIKNGGLTTESNYPYTAADGKC--KSGSNSAANIKGYEDVPTNDEAALMKAVANQPV 254

Query: 261 SVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENG 320
           SVA+D     FQFYS GV TG CGT+LDHG+ A+GYG   DGTKYWL+KNSWGTTWGENG
Sbjct: 255 SVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGKTSDGTKYWLMKNSWGTTWGENG 314

Query: 321 YIRMQRDIDAKEGLCGIAMQASYPT 345
           Y+RM++DI  K+G+CG+AM+ SYPT
Sbjct: 315 YLRMEKDISDKKGMCGLAMEPSYPT 339


>gi|38346003|emb|CAD40112.2| OSJNBa0035O13.5 [Oryza sativa Japonica Group]
 gi|125589427|gb|EAZ29777.1| hypothetical protein OsJ_13835 [Oryza sativa Japonica Group]
          Length = 339

 Score =  434 bits (1115), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 216/347 (62%), Positives = 256/347 (73%), Gaps = 12/347 (3%)

Query: 4   ILLENKLVLAAILVLGVWAPQSWSRTLND-ATMNERHEMWMAQYGRVYRDNAEKEMRFKI 62
           + +   L+ A +  L + +    +R  +D A M  RHE WM QYGRVY+D  EK  RF+I
Sbjct: 1   MAIPKALLFAILSCLCLCSAVLAAREQSDHAAMVARHERWMEQYGRVYKDATEKARRFEI 60

Query: 63  FKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFR 122
           FK NV +I SFN  A N  + LG+N+FAD TN EFRA +   K  +PS     TT   FR
Sbjct: 61  FKANVAFIESFN--AGNHKFWLGVNQFADLTNYEFRATKTN-KGFIPSTVRVPTT---FR 114

Query: 123 YENASV---PASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQE 179
           YEN S+   PA++DWR KGAVT +KDQGQCGCCWAFSAVAAMEGI  ++T KL SLSEQE
Sbjct: 115 YENVSIDTLPATVDWRTKGAVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQE 174

Query: 180 LVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISG 239
           LVDCD  GEDQGCEGGLMDDAF+FII N GL TE+KYPY A+DG CN    + SAA I G
Sbjct: 175 LVDCDVHGEDQGCEGGLMDDAFKFIIKNGGLTTESKYPYTAADGKCNG--GSNSAATIKG 232

Query: 240 YEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTA 299
           YEDVP+NNEAALMKAVANQPVSVA+D     FQFYS GV TG CGT+LDHG+ A+GYG  
Sbjct: 233 YEDVPANNEAALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIVAIGYGKD 292

Query: 300 DDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
            DGT+YWL+KNSWGTTWGENG++RM++DI  K G+CG+AM+ SYPTA
Sbjct: 293 GDGTQYWLLKNSWGTTWGENGFLRMEKDISDKRGMCGLAMEPSYPTA 339


>gi|255568299|ref|XP_002525124.1| cysteine protease, putative [Ricinus communis]
 gi|223535583|gb|EEF37251.1| cysteine protease, putative [Ricinus communis]
          Length = 342

 Score =  433 bits (1114), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 205/347 (59%), Positives = 261/347 (75%), Gaps = 6/347 (1%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRF 60
           MA++     L++A   VL +WA Q+ +R L+++TM ERHE WMA++G+VY+D+ EK  RF
Sbjct: 1   MALLCKGQFLLIALFFVLAMWADQASTRELHESTMVERHEKWMAKHGKVYKDDEEKLRRF 60

Query: 61  KIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVS 120
           +IFK NVE+I S +N A N  Y LGIN FAD TNEEFRA  NGYKR L + R        
Sbjct: 61  QIFKNNVEFIES-SNAAGNNSYMLGINRFADLTNEEFRASWNGYKRPLDASR----IVTP 115

Query: 121 FRYENAS-VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQE 179
           F+YEN + +P S+DWR+KGAVT +KDQ +CG CWAFSAVAA EG++ + T KL SLSEQE
Sbjct: 116 FKYENVTALPYSMDWRRKGAVTSIKDQRECGSCWAFSAVAATEGVHKLRTGKLVSLSEQE 175

Query: 180 LVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISG 239
           LVDCD  GED+GC+GGLM+DAF+FI  N G+ TEA Y Y+  DG C+ K+     AKI+G
Sbjct: 176 LVDCDVKGEDKGCQGGLMEDAFKFIKRNGGITTEANYAYRGRDGKCDTKKEASHVAKITG 235

Query: 240 YEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTA 299
           Y+ VP N+EAAL+KAVA+QPVSV+IDA    FQFY SG++ G CG++L+HGV AVGYGT+
Sbjct: 236 YQVVPENSEAALLKAVAHQPVSVSIDAGSMSFQFYQSGIYAGSCGSDLNHGVAAVGYGTS 295

Query: 300 DDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
             G+KYW+VKNSWG  WGE GY+RM+RDI +++GLCGIAM  SYPTA
Sbjct: 296 SSGSKYWIVKNSWGPEWGERGYVRMKRDITSRKGLCGIAMDCSYPTA 342


>gi|125547236|gb|EAY93058.1| hypothetical protein OsI_14861 [Oryza sativa Indica Group]
          Length = 339

 Score =  433 bits (1113), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 214/347 (61%), Positives = 256/347 (73%), Gaps = 12/347 (3%)

Query: 4   ILLENKLVLAAILVLGVWAPQSWSRTL-NDATMNERHEMWMAQYGRVYRDNAEKEMRFKI 62
           + +   L+ A +  L + +    +R L +DA M  RHE WMAQYGRVYRD+AEK  RF++
Sbjct: 1   MAMAKALLFAILGCLCLCSAVLAARELSDDAAMAARHERWMAQYGRVYRDDAEKARRFEV 60

Query: 63  FKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFR 122
           FK NV +I SFN  A N  + LG+N+FAD TN+EFR  +   K  +PS     T    FR
Sbjct: 61  FKANVAFIESFN--AGNHNFWLGVNQFADLTNDEFRWTKTN-KGFIPSTTRVPT---GFR 114

Query: 123 YENASV---PASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQE 179
           YEN ++   PA++DWR KGAVT +KDQGQCGCCWAFSAVAAMEGI  ++T KL SLSEQE
Sbjct: 115 YENVNIDALPATVDWRTKGAVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQE 174

Query: 180 LVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISG 239
           LVDCD  GEDQGCEGGLMDDAF+FII N GL TE+ YPY A+D  C  K  + S A I G
Sbjct: 175 LVDCDVHGEDQGCEGGLMDDAFKFIIKNGGLTTESNYPYAAADDKC--KSVSNSVASIKG 232

Query: 240 YEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTA 299
           YEDVP+NNEAALMKAVANQPVSVA+D     FQFY  GV TG CGT+LDHG+ A+GYG A
Sbjct: 233 YEDVPANNEAALMKAVANQPVSVAVDGGDMTFQFYKGGVMTGSCGTDLDHGIVAIGYGKA 292

Query: 300 DDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
            DGTKYWL+KNSWGTTWGENG++RM++DI  K G+CG+AM+ SYPTA
Sbjct: 293 SDGTKYWLLKNSWGTTWGENGFLRMEKDISDKRGMCGLAMEPSYPTA 339


>gi|38345008|emb|CAD40026.2| OSJNBa0052O21.11 [Oryza sativa Japonica Group]
 gi|125589414|gb|EAZ29764.1| hypothetical protein OsJ_13822 [Oryza sativa Japonica Group]
          Length = 339

 Score =  432 bits (1111), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 214/347 (61%), Positives = 256/347 (73%), Gaps = 12/347 (3%)

Query: 4   ILLENKLVLAAILVLGVWAPQSWSRTL-NDATMNERHEMWMAQYGRVYRDNAEKEMRFKI 62
           + +   L+ A +  L + +    +R L +DA M  RHE WMAQYGRVYRD+AEK  RF++
Sbjct: 1   MAMAKALLFAILGCLCLCSAVLAARELSDDAAMAARHERWMAQYGRVYRDDAEKARRFEV 60

Query: 63  FKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFR 122
           FK NV +I SFN  A N  + LG+N+FAD TN+EFR  +   K  +PS     T    FR
Sbjct: 61  FKANVAFIESFN--AGNHNFWLGVNQFADLTNDEFRWMKTN-KGFIPSTTRVPT---GFR 114

Query: 123 YENASV---PASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQE 179
           YEN ++   PA++DWR KGAVT +KDQGQCGCCWAFSAVAAMEGI  ++T KL SLSEQE
Sbjct: 115 YENVNIDALPATVDWRTKGAVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQE 174

Query: 180 LVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISG 239
           LVDCD  GEDQGCEGGLMDDAF+FII N GL TE+ YPY A+D  C  K  + S A I G
Sbjct: 175 LVDCDVHGEDQGCEGGLMDDAFKFIIKNGGLTTESNYPYAAADDKC--KSVSNSVASIKG 232

Query: 240 YEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTA 299
           YEDVP+NNEAALMKAVANQPVSVA+D     FQFY  GV TG CGT+LDHG+ A+GYG A
Sbjct: 233 YEDVPANNEAALMKAVANQPVSVAVDGGDMTFQFYKGGVMTGSCGTDLDHGIVAIGYGKA 292

Query: 300 DDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
            DGTKYWL+KNSWGTTWGENG++RM++DI  K G+CG+AM+ SYPTA
Sbjct: 293 SDGTKYWLLKNSWGTTWGENGFLRMEKDISDKRGMCGLAMEPSYPTA 339


>gi|125547256|gb|EAY93078.1| hypothetical protein OsI_14879 [Oryza sativa Indica Group]
          Length = 339

 Score =  432 bits (1111), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 215/347 (61%), Positives = 256/347 (73%), Gaps = 12/347 (3%)

Query: 4   ILLENKLVLAAILVLGVWAPQSWSRTLND-ATMNERHEMWMAQYGRVYRDNAEKEMRFKI 62
           + +   L+ A +  L + +    +R  +D A M  RHE WM QYGRVY+D  EK  RF+I
Sbjct: 1   MAIPKALLFAILSCLCLCSAVLAAREQSDHAAMVARHERWMEQYGRVYKDATEKARRFEI 60

Query: 63  FKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFR 122
           FK NV +I SFN  A N  + LG+N+FAD TN EFRA +   K  +PS     TT   FR
Sbjct: 61  FKANVAFIESFN--AGNHKFWLGVNQFADLTNYEFRATKTN-KGFIPSTVRVPTT---FR 114

Query: 123 YENASV---PASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQE 179
           YEN S+   PA++DWR KGAVT +KDQGQCGCCWAFSAVAAMEGI  ++T KL SLSEQE
Sbjct: 115 YENVSIDTLPATVDWRTKGAVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQE 174

Query: 180 LVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISG 239
           LVDCD  GEDQGCEGGLMDDAF+FII N GL TE+KYPY A+DG CN    + SAA I G
Sbjct: 175 LVDCDVHGEDQGCEGGLMDDAFKFIIKNGGLTTESKYPYTAADGKCNG--GSNSAATIKG 232

Query: 240 YEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTA 299
           YE+VP+NNEAALMKAVANQPVSVA+D     FQFYS GV TG CGT+LDHG+ A+GYG  
Sbjct: 233 YEEVPANNEAALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIVAIGYGKD 292

Query: 300 DDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
            DGT+YWL+KNSWGTTWGENG++RM++DI  K G+CG+AM+ SYPTA
Sbjct: 293 GDGTQYWLLKNSWGTTWGENGFLRMEKDISDKRGMCGLAMEPSYPTA 339


>gi|255568297|ref|XP_002525123.1| cysteine protease, putative [Ricinus communis]
 gi|223535582|gb|EEF37250.1| cysteine protease, putative [Ricinus communis]
          Length = 349

 Score =  432 bits (1111), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 210/347 (60%), Positives = 258/347 (74%), Gaps = 6/347 (1%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRF 60
           MA +     L +A   VL + A Q+ SR L++  M  RHE WMA++G+VY+D+ EK  RF
Sbjct: 1   MAFLCKGKILPIALFFVLAMCADQAASRELHELEMTGRHEKWMAKHGKVYKDDKEKLRRF 60

Query: 61  KIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVS 120
           +IFK NV +I SFN  A NK Y LGIN+FAD TNEEFRA  NGYKR L + R        
Sbjct: 61  QIFKSNVVFIESFNT-AGNKSYMLGINKFADLTNEEFRAFWNGYKRPLGASRKI----TP 115

Query: 121 FRYENAS-VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQE 179
           F+YEN + +P+SIDWR KGAVT +KDQG CG CWAFSAVAA EGI+ + T KL SLSEQE
Sbjct: 116 FKYENVTALPSSIDWRSKGAVTPIKDQGVCGSCWAFSAVAATEGIHKLRTGKLVSLSEQE 175

Query: 180 LVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISG 239
           LVDCD  G+D+GC+GGLM DAF+FI  + G+ +EA YPY+  DG C+ K+    A KI+G
Sbjct: 176 LVDCDVKGQDKGCQGGLMVDAFKFIKRHGGMTSEANYPYQGRDGKCDTKKEASRAVKITG 235

Query: 240 YEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTA 299
           Y+ VP N+EAAL+KAVANQPVSVAIDA    FQFY SG+FTG CG +++HGV AVGYG +
Sbjct: 236 YQAVPKNSEAALLKAVANQPVSVAIDAGSLSFQFYRSGIFTGICGKDINHGVAAVGYGRS 295

Query: 300 DDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           + G+KYW+VKNSWGT WGE GYIRM+RD+ +KEGLCGIAM+ SYPTA
Sbjct: 296 NSGSKYWIVKNSWGTEWGEKGYIRMKRDVRSKEGLCGIAMECSYPTA 342


>gi|116309130|emb|CAH66233.1| H0825G02.10 [Oryza sativa Indica Group]
          Length = 339

 Score =  431 bits (1109), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 215/347 (61%), Positives = 255/347 (73%), Gaps = 12/347 (3%)

Query: 4   ILLENKLVLAAILVLGVWAPQSWSRTLND-ATMNERHEMWMAQYGRVYRDNAEKEMRFKI 62
           + +   L+ A +  L + +    +R  +D A M  RHE WM QYGRVY+D  EK  RF+I
Sbjct: 1   MAIPKALLFAILSCLCLCSAVLAAREQSDHAAMVARHERWMEQYGRVYKDATEKARRFEI 60

Query: 63  FKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFR 122
           FK NV +I SFN  A N  + L +N+FAD TN EFRA +   K  +PS     TT   FR
Sbjct: 61  FKANVAFIESFN--AGNHKFWLSVNQFADLTNYEFRATKTN-KGFIPSTVRVPTT---FR 114

Query: 123 YENASV---PASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQE 179
           YEN S+   PA++DWR KGAVT +KDQGQCGCCWAFSAVAAMEGI  ++T KL SLSEQE
Sbjct: 115 YENVSIDTLPATVDWRTKGAVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQE 174

Query: 180 LVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISG 239
           LVDCD  GEDQGCEGGLMDDAF+FII N GL TE+KYPY A+DG CN    + SAA I G
Sbjct: 175 LVDCDVHGEDQGCEGGLMDDAFKFIIKNGGLTTESKYPYTAADGKCNG--GSNSAATIKG 232

Query: 240 YEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTA 299
           YEDVP+NNEAALMKAVANQPVSVA+D     FQFYS GV TG CGT+LDHG+ A+GYG  
Sbjct: 233 YEDVPANNEAALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIVAIGYGKD 292

Query: 300 DDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
            DGT+YWL+KNSWGTTWGENG++RM++DI  K G+CG+AM+ SYPTA
Sbjct: 293 GDGTQYWLLKNSWGTTWGENGFLRMEKDISDKRGMCGLAMEPSYPTA 339


>gi|116309178|emb|CAH66275.1| OSIGBa0147O06.5 [Oryza sativa Indica Group]
          Length = 339

 Score =  431 bits (1108), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 211/347 (60%), Positives = 256/347 (73%), Gaps = 12/347 (3%)

Query: 4   ILLENKLVLAAILVLGVWAPQSWSRTL-NDATMNERHEMWMAQYGRVYRDNAEKEMRFKI 62
           + +   L+ A +  L + +    +R L +DA M  RHE WMAQYGR+Y+D+AEK  RF++
Sbjct: 1   MAMAKALLFAILGCLCLCSAVLAARELSDDAAMAARHERWMAQYGRMYKDDAEKARRFEV 60

Query: 63  FKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFR 122
           FK NV +I SFN  A N  + LG+N+FAD TN+EFR+ +   K  +PS     T    FR
Sbjct: 61  FKANVAFIESFN--AGNHKFWLGVNQFADLTNDEFRSTKTN-KGFIPSTTRVPT---GFR 114

Query: 123 YENASV---PASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQE 179
           YEN ++   PA++DWR KG VT +KDQGQCGCCWAFSAVAAMEGI  ++T KL SLSEQE
Sbjct: 115 YENVNIDALPATMDWRTKGVVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQE 174

Query: 180 LVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISG 239
           LVDCD  GEDQGCEGGLMDDAF+FII N GL TE+ YPY A+D  C  K  + S A I G
Sbjct: 175 LVDCDVHGEDQGCEGGLMDDAFKFIIKNGGLTTESNYPYAAADDKC--KSVSNSVASIKG 232

Query: 240 YEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTA 299
           YEDVP+NNEAALMKAVANQPVSVA+D     FQFY  GV TG CGT+LDHG+ A+GYG A
Sbjct: 233 YEDVPANNEAALMKAVANQPVSVAVDGGDMTFQFYKGGVMTGSCGTDLDHGIVAIGYGKA 292

Query: 300 DDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
            DGTKYWL+KNSWGTTWGENG++RM++DI  K G+CG+AM+ SYPTA
Sbjct: 293 SDGTKYWLLKNSWGTTWGENGFLRMEKDISDKRGMCGLAMEPSYPTA 339


>gi|224076972|ref|XP_002305074.1| predicted protein [Populus trichocarpa]
 gi|224106329|ref|XP_002333698.1| predicted protein [Populus trichocarpa]
 gi|222837984|gb|EEE76349.1| predicted protein [Populus trichocarpa]
 gi|222848038|gb|EEE85585.1| predicted protein [Populus trichocarpa]
          Length = 307

 Score =  431 bits (1108), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 207/313 (66%), Positives = 243/313 (77%), Gaps = 7/313 (2%)

Query: 35  MNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTN 94
           M +RHE WMAQ+GRVY D  EKE R+ IFKEN+E I +FNN   ++ YKLG+N+FAD TN
Sbjct: 1   MLKRHEEWMAQHGRVYGDMKEKEKRYLIFKENIERIEAFNN-GSDRGYKLGVNKFADLTN 59

Query: 95  EEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS-VPASIDWRKKGAVTGVKDQGQCGCCW 153
           EEFRA  +GYKR+     SS+    SFR+EN S +P S+DWRK GAVT VKDQG CGCCW
Sbjct: 60  EEFRAMHHGYKRQ-----SSKLMSSSFRHENLSAIPTSMDWRKAGAVTPVKDQGTCGCCW 114

Query: 154 AFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATE 213
           AFSAVAA+EGI  + T KL SLSEQ+LVDCD  G DQGC GGLMD+AF+FI+ N GL +E
Sbjct: 115 AFSAVAAIEGIIKLKTGKLISLSEQQLVDCDVKGVDQGCGGGLMDNAFQFILRNGGLTSE 174

Query: 214 AKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQF 273
           A YPY+  DG+C  K+     AKI+GYEDVP NNE AL++AVA QPVSVA++  G DFQF
Sbjct: 175 ATYPYQGVDGTCKSKKTASIEAKITGYEDVPVNNENALLQAVAKQPVSVAVEGGGYDFQF 234

Query: 274 YSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEG 333
           Y SGVF G CGT LDH VTA+GYGT  DGT YWLVKNSWGT+WGE+GY+RMQR I A+EG
Sbjct: 235 YKSGVFKGDCGTYLDHAVTAIGYGTNSDGTNYWLVKNSWGTSWGESGYMRMQRGIGAREG 294

Query: 334 LCGIAMQASYPTA 346
           LCG+AM ASYPTA
Sbjct: 295 LCGVAMDASYPTA 307


>gi|356543122|ref|XP_003540012.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
          Length = 342

 Score =  431 bits (1108), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 215/323 (66%), Positives = 248/323 (76%), Gaps = 9/323 (2%)

Query: 27  SRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGI 86
           SR L+DA+M ERHE WM +YG+VY+D+AE E RF IF+ NVE+I SFN  A NKPYKL I
Sbjct: 26  SRKLHDASMYERHEQWMEKYGKVYKDSAEXEKRFLIFENNVEFIESFN-AAGNKPYKLSI 84

Query: 87  NEFADQTNEEFRAPRNGYK-RRLPSVRSSETTDVSFRYENAS-VPASIDWRKKGAVTGVK 144
           N  ADQTNEEF A   GYK      +R   TT   F+YEN + +P ++DWR+KG  T +K
Sbjct: 85  NHLADQTNEEFMASHKGYKGSHWQGLRI--TTQTPFKYENVTDIPWAVDWRQKGDATSIK 142

Query: 145 DQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFI 204
           DQGQCG CWAFSAVAA EGI  ITT  L SLSEQELVDCD+   D GC+GGLM+  FEFI
Sbjct: 143 DQGQCGICWAFSAVAATEGIYQITTGNLVSLSEQELVDCDSV--DHGCDGGLMEHGFEFI 200

Query: 205 ISNKGLATEAKYPYKASDGSCN-KKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVA 263
           I N G+++EA YPY A +G+C+  KEA+P  A+I GYE VP N E  L KAVANQPVSV+
Sbjct: 201 IKNGGISSEANYPYTAVNGTCDTNKEASP-GAQIKGYETVPVNCEEELQKAVANQPVSVS 259

Query: 264 IDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIR 323
           IDA GS FQFYSSGVFTGQCGT+LDHGVTAVGYG+ DDG +YW+VKNSWGT WGE GYIR
Sbjct: 260 IDAGGSAFQFYSSGVFTGQCGTQLDHGVTAVGYGSTDDGIQYWIVKNSWGTQWGEEGYIR 319

Query: 324 MQRDIDAKEGLCGIAMQASYPTA 346
           M R IDA+EGLCGIAM ASYPTA
Sbjct: 320 MLRGIDAQEGLCGIAMDASYPTA 342


>gi|242072572|ref|XP_002446222.1| hypothetical protein SORBIDRAFT_06g005410 [Sorghum bicolor]
 gi|241937405|gb|EES10550.1| hypothetical protein SORBIDRAFT_06g005410 [Sorghum bicolor]
          Length = 340

 Score =  430 bits (1106), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 216/341 (63%), Positives = 253/341 (74%), Gaps = 16/341 (4%)

Query: 12  LAAILVLGVWAPQSWS-RTLND-ATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEY 69
           + AIL L ++   + + R LND + M  RHE WMAQY RVY+D  EK  RF++FK NV++
Sbjct: 8   ILAILGLALFCGAALAARDLNDDSAMVARHEQWMAQYNRVYKDATEKAQRFEVFKANVKF 67

Query: 70  IASFNNKARNKPYKLGINEFADQTNEEFRAPRN--GYKRRLPSVRSSETTDVSFRYENAS 127
           I SFN    N+ + LG+N+FAD TN+EFRA +   G+K   PS     T    FRYEN S
Sbjct: 68  IESFN-AGGNRKFWLGVNQFADLTNDEFRATKTNKGFK---PSPVKVPT---GFRYENVS 120

Query: 128 V---PASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCD 184
           V   PASIDWR KGAVT +KDQGQCGCCWAFSAVAA EGI  I+T KL SLSEQELVDCD
Sbjct: 121 VDALPASIDWRTKGAVTPIKDQGQCGCCWAFSAVAATEGIVKISTDKLISLSEQELVDCD 180

Query: 185 TSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVP 244
             GEDQGCEGGLMDDAF+FII N GL TE+ YPY A+DG C  K    SAA I G+EDVP
Sbjct: 181 VHGEDQGCEGGLMDDAFKFIIKNGGLTTESSYPYTATDGKC--KSGTNSAANIKGFEDVP 238

Query: 245 SNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTK 304
           +N+EAALMKAVANQPVSVA+D     FQ YS GV TG CGT+LDHG+ A+GYG   DGTK
Sbjct: 239 ANDEAALMKAVANQPVSVAVDGGDMTFQLYSGGVMTGSCGTDLDHGIAAIGYGQTSDGTK 298

Query: 305 YWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           YWL+KNSWGTTWGENGY+RM++DI  K G+CG+AM+ SYPT
Sbjct: 299 YWLLKNSWGTTWGENGYLRMEKDISDKRGMCGLAMEPSYPT 339


>gi|413953667|gb|AFW86316.1| hypothetical protein ZEAMMB73_635707 [Zea mays]
          Length = 340

 Score =  429 bits (1103), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 207/320 (64%), Positives = 245/320 (76%), Gaps = 14/320 (4%)

Query: 31  NDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFA 90
           +D+ M  RHE WMAQY RVY+D +EK  RF++FK NV++I SFN    NK + LG+N+FA
Sbjct: 29  DDSAMVARHEQWMAQYSRVYKDASEKARRFEVFKANVKFIESFNAGGNNK-FWLGVNQFA 87

Query: 91  DQTNEEFRAPRN--GYKRRLPSVRSSETTDVSFRYENASV---PASIDWRKKGAVTGVKD 145
           D TN+EFR+ +   G+K       S+      FRYEN SV   P +IDWR KGAVT +KD
Sbjct: 88  DLTNDEFRSIKTNKGFKS------SNMKIPTGFRYENVSVDALPTTIDWRTKGAVTPIKD 141

Query: 146 QGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFII 205
           QGQCGCCWAFSAVAA EGI  I+T KL SL+EQELVDCD  GEDQGCEGGLMDDAF+FII
Sbjct: 142 QGQCGCCWAFSAVAATEGIVKISTGKLVSLAEQELVDCDVHGEDQGCEGGLMDDAFKFII 201

Query: 206 SNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAID 265
           +N GL TE+ YPY A+DG C  K  + SAA I GYEDVP+N+EAALMKAVANQPVSVA+D
Sbjct: 202 NNGGLTTESSYPYTAADGKC--KSGSNSAATIKGYEDVPANDEAALMKAVANQPVSVAVD 259

Query: 266 ASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQ 325
                FQFYSSGV TG CGT+LDHG+ A+GYG   DGTKYWL+KNSWGTTWGENGY+RM+
Sbjct: 260 GGDMTFQFYSSGVMTGSCGTDLDHGIAAIGYGKTSDGTKYWLMKNSWGTTWGENGYLRME 319

Query: 326 RDIDAKEGLCGIAMQASYPT 345
           +DI  K G+CG+AM+ SYPT
Sbjct: 320 KDISDKRGMCGLAMEPSYPT 339


>gi|37780043|gb|AAP32194.1| cysteine protease 1 [Trifolium repens]
          Length = 292

 Score =  429 bits (1103), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 206/293 (70%), Positives = 236/293 (80%), Gaps = 4/293 (1%)

Query: 55  EKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSS 114
           E+E R +IF +NV YI + N+   NK YKL IN+FAD TNEEF A RN +K  + S    
Sbjct: 3   EREKRLRIFNKNVNYIEASNSAVNNKLYKLSINKFADLTNEEFIASRNKFKGHMCSSIIR 62

Query: 115 ETTDVSFRYENAS-VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLT 173
            TT   F+YENAS +P+++DWRKKGAVT VK+QGQCG CWAFSAVAA EGI+ ++T KL 
Sbjct: 63  TTT---FKYENASAIPSTVDWRKKGAVTPVKNQGQCGSCWAFSAVAATEGIHQLSTGKLV 119

Query: 174 SLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPS 233
           SLSEQEL+DCDT G DQGCEGGLMDDAF+FII N GL+TE +YPY+  DG+CN  +A+  
Sbjct: 120 SLSEQELIDCDTKGVDQGCEGGLMDDAFKFIIQNHGLSTEVQYPYEGVDGTCNANKASIH 179

Query: 234 AAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTA 293
           A  I+GYEDVP+NNE AL KAVANQP+SVAIDASGSDFQFY+SGVFTG CGTELDHGVTA
Sbjct: 180 AVTITGYEDVPANNELALQKAVANQPISVAIDASGSDFQFYNSGVFTGSCGTELDHGVTA 239

Query: 294 VGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           VGYG  +DGTKYWLVKNSWG  WGE GYIRMQR I A EGLCGIAMQASYPTA
Sbjct: 240 VGYGVGNDGTKYWLVKNSWGADWGEEGYIRMQRGIAAAEGLCGIAMQASYPTA 292


>gi|413953668|gb|AFW86317.1| hypothetical protein ZEAMMB73_339067 [Zea mays]
          Length = 433

 Score =  429 bits (1102), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 205/318 (64%), Positives = 243/318 (76%), Gaps = 10/318 (3%)

Query: 31  NDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFA 90
           +D+ M  RHE WMAQY RVY+D +EK  RF++FK NV++I SFN    NK + LG+N+FA
Sbjct: 122 DDSVMVARHEQWMAQYSRVYKDASEKARRFEVFKANVQFIESFNAGGNNK-FWLGVNQFA 180

Query: 91  DQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS---VPASIDWRKKGAVTGVKDQG 147
           D TN+EFR+ +    + L S  S+      FRYEN S   +P +IDWR KGAVT +KDQG
Sbjct: 181 DLTNDEFRSTKT--NKGLKS--SNMKIPTGFRYENVSADALPTTIDWRTKGAVTPIKDQG 236

Query: 148 QCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISN 207
           QCGCCWAFSAVAA EGI  I+T KL SL+EQELVDCD  GEDQGCEGGLMDDAF+FII N
Sbjct: 237 QCGCCWAFSAVAATEGIVKISTGKLVSLAEQELVDCDVHGEDQGCEGGLMDDAFKFIIKN 296

Query: 208 KGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDAS 267
            GL TE+ YPY A+DG C  K  + SAA I GYEDVP+N+EAALMKAVANQPVSVA+D  
Sbjct: 297 GGLTTESSYPYTAADGKC--KSGSNSAATIKGYEDVPANDEAALMKAVANQPVSVAVDGG 354

Query: 268 GSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRD 327
              FQFYS GV TG CGT+LDHG+ A+GYG   DGTKYWL+KNSWGTTWGENGY+RM++D
Sbjct: 355 DMTFQFYSGGVMTGSCGTDLDHGIAAIGYGKTSDGTKYWLMKNSWGTTWGENGYLRMEKD 414

Query: 328 IDAKEGLCGIAMQASYPT 345
           I  K G+CG+AM+ SYPT
Sbjct: 415 ISDKRGMCGLAMEPSYPT 432


>gi|356543118|ref|XP_003540010.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
          Length = 339

 Score =  426 bits (1094), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 213/340 (62%), Positives = 252/340 (74%), Gaps = 14/340 (4%)

Query: 11  VLAAILVLGVWAPQSWSRTLNDAT--MNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVE 68
           +LA +L+L +   Q  SR L++A+  M+ERHE W  +YG+VY+D AEK+ R  IFK+NVE
Sbjct: 10  ILALVLLLPICISQVMSRNLHEASXCMSERHEQWTKKYGKVYKDAAEKQKRLLIFKDNVE 69

Query: 69  YIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS- 127
           +I SFN  A NKPYKL IN   DQTNEEF A  NGYK +         +   F+YEN + 
Sbjct: 70  FIESFN-AAGNKPYKLSINHLTDQTNEEFVASHNGYKHK------GSHSQTPFKYENITG 122

Query: 128 VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSG 187
           VP ++DWR+ GAV  +KDQGQCG CWAFS VA  EGI  ITT  L SLSEQELVDCD+  
Sbjct: 123 VPNAVDWRENGAVXAMKDQGQCGNCWAFSTVATTEGIYQITTSMLMSLSEQELVDCDSV- 181

Query: 188 EDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCN-KKEANPSAAKISGYEDVPSN 246
            D GC+GG M+  FEFI  N G+++EA YPY A DG+ +  KEA+P AA+I GYE VP+N
Sbjct: 182 -DHGCDGGYMEGGFEFIXKNGGISSEANYPYTAVDGTYDANKEASP-AAQIKGYETVPAN 239

Query: 247 NEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYW 306
           +E AL KAVANQPVSV ID  GS FQF SSGVFTGQCGT+LDHGVTAVGYG+ DDGT+YW
Sbjct: 240 SEDALQKAVANQPVSVTIDVGGSAFQFNSSGVFTGQCGTQLDHGVTAVGYGSTDDGTQYW 299

Query: 307 LVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           +VKNSWGT WGE GYIRMQR  DA+EGLCGIAM ASYPTA
Sbjct: 300 IVKNSWGTQWGEEGYIRMQRGTDAQEGLCGIAMDASYPTA 339


>gi|357160591|ref|XP_003578813.1| PREDICTED: vignain-like [Brachypodium distachyon]
          Length = 339

 Score =  425 bits (1092), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 209/347 (60%), Positives = 252/347 (72%), Gaps = 12/347 (3%)

Query: 4   ILLENKLVLAAILVLGVWAPQSWSRTLND-ATMNERHEMWMAQYGRVYRDNAEKEMRFKI 62
           + + N  +LA +  L  +A    +R LND  +M  RHE WM+QYGR Y+D AEK+ +F++
Sbjct: 1   MAIPNASLLAILGCLCFFASGLAARELNDDLSMVARHESWMSQYGRSYKDAAEKDRKFEV 60

Query: 63  FKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFR 122
           FK N  +I SFN  A+N  + LGIN+FAD TNEEF+  +         VR+S      F 
Sbjct: 61  FKANAAFIDSFN--AKNHKFWLGINQFADITNEEFKVTKTNKGFISNKVRAS----TGFS 114

Query: 123 YENASV---PASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQE 179
           YEN S+   PA+IDWR KGAVT VKDQGQCGCCWAFSAVAA EGI  ++T KL SLSEQE
Sbjct: 115 YENVSIDALPATIDWRTKGAVTPVKDQGQCGCCWAFSAVAATEGIVKLSTGKLVSLSEQE 174

Query: 180 LVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISG 239
           LVDCD  GEDQGCEGGLMDDAF+FII+N GL  E+ YPY A DG C  K  + SA  I  
Sbjct: 175 LVDCDVHGEDQGCEGGLMDDAFKFIITNGGLTQESSYPYDAEDGKC--KSGSKSAGTIKS 232

Query: 240 YEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTA 299
           YEDVP+NNE ALMKAVANQPVSVA+D     FQFYS GV TG CGT+LDHG+ A+GYG  
Sbjct: 233 YEDVPANNEGALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGVT 292

Query: 300 DDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
            DGTKYWL+KNSWGT+WGENG++RM++DI  K+G+CG+AM+ SYPTA
Sbjct: 293 SDGTKYWLMKNSWGTSWGENGFLRMEKDIADKKGMCGLAMEPSYPTA 339


>gi|357160572|ref|XP_003578808.1| PREDICTED: vignain-like [Brachypodium distachyon]
          Length = 339

 Score =  424 bits (1090), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 211/345 (61%), Positives = 254/345 (73%), Gaps = 22/345 (6%)

Query: 11  VLAAILVLGVWAPQSWSRTLND-ATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEY 69
           +LA +  L +      +R LND  +M  RHE WM QYGRVY+D AEK  +F++FK N E+
Sbjct: 8   LLAILGCLCLCGSVLAARELNDDLSMVARHENWMLQYGRVYKDAAEKAQKFEVFKANAEF 67

Query: 70  IASFNNKARNKPYKLGINEFADQTNEEFRAPRN--GY---KRRLPSVRSSETTDVSFRYE 124
           I SFN  A N  + LGIN+FAD TNEEF+A +   G+   K R+P+          F YE
Sbjct: 68  INSFN--AGNHKFWLGINQFADITNEEFKATKTNKGFISNKVRVPT---------GFMYE 116

Query: 125 NAS---VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELV 181
           N S   +PA+IDWR KGAVT +KDQGQCGCCWAFSAVAAMEGI  ++T KL SLSEQELV
Sbjct: 117 NMSFDALPATIDWRTKGAVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLVSLSEQELV 176

Query: 182 DCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYE 241
           DCD  GEDQGCEGGLMDDAF+FII N GL  E+ YPY A+DG C  K  + SAA I  YE
Sbjct: 177 DCDVHGEDQGCEGGLMDDAFKFIIKNGGLTQESNYPYDAADGKC--KSGSSSAATIKSYE 234

Query: 242 DVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADD 301
           DVP+NNE ALMKAVANQPVSVA+D     FQFYS GV TG CGT+LDHG+ A+GYGT  D
Sbjct: 235 DVPANNEGALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGTTSD 294

Query: 302 GTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           GTK+W++KNSWGT+WGENG++RM++DI  K+G+CG+AM+ SYPTA
Sbjct: 295 GTKFWIMKNSWGTSWGENGFLRMEKDIADKKGMCGLAMEPSYPTA 339


>gi|357160599|ref|XP_003578815.1| PREDICTED: vignain-like [Brachypodium distachyon]
          Length = 339

 Score =  422 bits (1086), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 208/340 (61%), Positives = 249/340 (73%), Gaps = 12/340 (3%)

Query: 11  VLAAILVLGVWAPQSWSRTLND-ATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEY 69
           +LA +  L   +    +R LND  +M  RHE WMAQYGRVY+D AEK  +F++FK N  +
Sbjct: 8   ILAILGCLCFCSSVLAARELNDDLSMAARHETWMAQYGRVYKDAAEKAQKFEVFKANARF 67

Query: 70  IASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASV- 128
           I SFN  A N  + LGIN+FAD TNEEF+A +    +   S ++  +T   F+YEN  + 
Sbjct: 68  IDSFN--AENHKFWLGINQFADLTNEEFKATKT--NKGFISNKARVST--GFKYENLKIE 121

Query: 129 --PASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTS 186
             P SIDWR KGAVT VKDQGQCGCCWAFSAVAA EGI  ++T KL SLSEQELVDCD  
Sbjct: 122 ALPTSIDWRTKGAVTPVKDQGQCGCCWAFSAVAATEGIVKLSTGKLVSLSEQELVDCDVH 181

Query: 187 GEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSN 246
           GEDQGCEGGLMDDAF+FII+N GL  E+ YPY A DG C  K  + SA  I  YEDVP+N
Sbjct: 182 GEDQGCEGGLMDDAFKFIITNGGLTQESSYPYDAEDGKC--KSGSKSAGTIKSYEDVPAN 239

Query: 247 NEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYW 306
           NE ALMKAVANQPVSVA+D     FQFYS GV TG CGT+LDHG+ A+GYG   DGTK+W
Sbjct: 240 NEGALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGVTSDGTKFW 299

Query: 307 LVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           L+KNSWGTTWGENG++RM++DI  K+G+CG+AM+ SYPTA
Sbjct: 300 LMKNSWGTTWGENGFLRMEKDIADKKGMCGLAMEPSYPTA 339


>gi|242072392|ref|XP_002446132.1| hypothetical protein SORBIDRAFT_06g002150 [Sorghum bicolor]
 gi|241937315|gb|EES10460.1| hypothetical protein SORBIDRAFT_06g002150 [Sorghum bicolor]
          Length = 337

 Score =  420 bits (1080), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 204/322 (63%), Positives = 243/322 (75%), Gaps = 12/322 (3%)

Query: 27  SRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGI 86
           +R L+DA M ERHE WM +YGRVY+D AEK  RF+ FK NV ++ SFN   +NK + LG+
Sbjct: 24  ARELSDAAMVERHENWMVEYGRVYKDAAEKARRFEAFKHNVAFVESFNTNKKNK-FWLGV 82

Query: 87  NEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASV---PASIDWRKKGAVTGV 143
           N+FAD T EEF+A + G+K   P+     TT   F+YEN SV   P ++DWR KGAVT +
Sbjct: 83  NQFADLTTEEFKANK-GFK---PTAEKVPTT--GFKYENLSVSALPTAVDWRTKGAVTPI 136

Query: 144 KDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEF 203
           K+QGQCGCCWAFSAVAAMEGI  ++T  L SLSEQELVDCDT   D+GCEGG MD AFEF
Sbjct: 137 KNQGQCGCCWAFSAVAAMEGIVKLSTGNLISLSEQELVDCDTHSMDEGCEGGWMDSAFEF 196

Query: 204 IISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVA 263
           +I N GLATE+ YPYKA DG C  K  + SAA I G+EDVP NNEAALMKAVANQPVSVA
Sbjct: 197 VIKNGGLATESNYPYKAVDGKC--KGGSKSAATIKGHEDVPVNNEAALMKAVANQPVSVA 254

Query: 264 IDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIR 323
           +DAS   F  YS GV TG CGTELDHG+ A+GYG   DGTKYW++KNSWGTTWGE G++R
Sbjct: 255 VDASDRTFMLYSGGVMTGSCGTELDHGIAAIGYGMESDGTKYWILKNSWGTTWGEKGFLR 314

Query: 324 MQRDIDAKEGLCGIAMQASYPT 345
           M++DI  K G+CG+AM+ SYPT
Sbjct: 315 MEKDITDKRGMCGLAMKPSYPT 336


>gi|224076970|ref|XP_002305073.1| predicted protein [Populus trichocarpa]
 gi|222848037|gb|EEE85584.1| predicted protein [Populus trichocarpa]
          Length = 340

 Score =  419 bits (1077), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 207/348 (59%), Positives = 259/348 (74%), Gaps = 10/348 (2%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSRTLNDAT-MNERHEMWMAQYGRVYRDNAEKEMR 59
           MA      ++ L  +L+L  WA +   R L++   M +RHE WMAQ+GRVY D  EKE R
Sbjct: 1   MAAKKCNTRIFLPFLLILAAWATKIACRPLDEQEYMLKRHEEWMAQHGRVYGDMKEKEKR 60

Query: 60  FKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDV 119
           + IFKEN+E I +FNN + ++ YKLG+N+FAD TNEEFRA  +GYKR+   + SS     
Sbjct: 61  YLIFKENIERIEAFNNGS-DRGYKLGVNKFADLTNEEFRAMYHGYKRQSSKLMSS----- 114

Query: 120 SFRYENAS-VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQ 178
           SFRYEN S +P S+DWR  GAVT VKDQG CGCCWAFS VAA+EGI  + T  L SLSEQ
Sbjct: 115 SFRYENLSDIPTSMDWRNDGAVTPVKDQGTCGCCWAFSTVAAIEGIIKLQTGNLISLSEQ 174

Query: 179 ELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKIS 238
           +LVDC T+G ++GC+GGLMD AF++II N GL +E  YPY+  DG+C+ ++A  + A+I+
Sbjct: 175 QLVDC-TAG-NKGCQGGLMDTAFQYIIRNGGLTSEDNYPYQGVDGTCSSEKAASTEAQIT 232

Query: 239 GYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGT 298
           GYEDVP NNE AL++AVA QPVSV +D  G+DFQFY SGVF G CGT+ +H VTA+GYGT
Sbjct: 233 GYEDVPQNNENALLQAVAKQPVSVGVDGGGNDFQFYKSGVFNGDCGTQQNHAVTAIGYGT 292

Query: 299 ADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
             DGT YWLVKNSWGT+WGENGY+RM+R I + EGLCG+AM ASYPTA
Sbjct: 293 DIDGTDYWLVKNSWGTSWGENGYMRMRRGIGSSEGLCGVAMDASYPTA 340


>gi|37780049|gb|AAP32197.1| cysteine protease 10 [Trifolium repens]
          Length = 272

 Score =  418 bits (1075), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 200/274 (72%), Positives = 226/274 (82%), Gaps = 4/274 (1%)

Query: 74  NNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS-VPASI 132
           N+   NK YKLGIN+FAD TNEEF+A RN +K  + S     TT   F+YENAS +P+++
Sbjct: 2   NSNVNNKLYKLGINKFADLTNEEFKASRNKFKGHMCSSIIRTTT---FKYENASAIPSTV 58

Query: 133 DWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGC 192
           DWRKKGAVT VK+QGQCG CWAFSAVAA EGI+ ++T KL SLSEQEL+DCDT G DQGC
Sbjct: 59  DWRKKGAVTPVKNQGQCGSCWAFSAVAATEGIHQLSTGKLVSLSEQELIDCDTKGVDQGC 118

Query: 193 EGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALM 252
           EGGLMDDAF+FII N GL+TE +YPY+  DG+CN  EA+  A  I+GYEDVP+NNE AL 
Sbjct: 119 EGGLMDDAFKFIIQNHGLSTEVQYPYEGVDGTCNTNEASIHAVTITGYEDVPANNELALQ 178

Query: 253 KAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSW 312
           KAVANQP+SVAIDASGSDFQFY+SGVFTG CGTELDHGVTAVGYG  +DGTKYWLVKNSW
Sbjct: 179 KAVANQPISVAIDASGSDFQFYNSGVFTGSCGTELDHGVTAVGYGVGNDGTKYWLVKNSW 238

Query: 313 GTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           G  WGE GYIRMQR IDA EGLCGIAMQASYPTA
Sbjct: 239 GADWGEEGYIRMQRGIDAAEGLCGIAMQASYPTA 272


>gi|357160569|ref|XP_003578807.1| PREDICTED: vignain-like [Brachypodium distachyon]
          Length = 339

 Score =  418 bits (1075), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 206/324 (63%), Positives = 240/324 (74%), Gaps = 12/324 (3%)

Query: 27  SRTLND-ATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLG 85
           +R LND  +M  RHE WM QYGRVY+D AEK  +F++FK N  +I SFN  A N  + LG
Sbjct: 24  ARELNDDLSMVARHESWMLQYGRVYKDAAEKASKFEVFKANAGFIDSFN--AGNHKFWLG 81

Query: 86  INEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS---VPASIDWRKKGAVTG 142
           IN+FAD TN+EF+A +         VR+       F YEN S   +PASIDWR KGAVT 
Sbjct: 82  INQFADITNKEFKATKTNKGFISNKVRAP----TGFSYENVSFDALPASIDWRTKGAVTP 137

Query: 143 VKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFE 202
           VKDQGQCGCCWAFSAVAA EGI  ++T KL SLSEQELVDCD  GEDQGCEGGLMDDAF+
Sbjct: 138 VKDQGQCGCCWAFSAVAATEGIVKLSTGKLVSLSEQELVDCDVHGEDQGCEGGLMDDAFK 197

Query: 203 FIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSV 262
           FIISN GL  E+ YPY A DG C  K  + SA  I  YEDVP+NNE ALMKAVANQPVSV
Sbjct: 198 FIISNGGLTQESSYPYDAEDGKC--KSGSKSAGTIKSYEDVPANNEGALMKAVANQPVSV 255

Query: 263 AIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYI 322
           A+D     FQFYS GV TG CGT+LDHG+ A+GYG   DGTKYWL+KNSWGT+WGENG++
Sbjct: 256 AVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGVTSDGTKYWLMKNSWGTSWGENGFL 315

Query: 323 RMQRDIDAKEGLCGIAMQASYPTA 346
           RM++DI  K+G+CG+AM+ SYPTA
Sbjct: 316 RMEKDIADKKGMCGLAMEPSYPTA 339


>gi|125551397|gb|EAY97106.1| hypothetical protein OsI_19029 [Oryza sativa Indica Group]
          Length = 350

 Score =  418 bits (1075), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 205/321 (63%), Positives = 237/321 (73%), Gaps = 13/321 (4%)

Query: 32  DATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFAD 91
           DA M  RHE WMAQ+GRVY+D AEK  R ++FK NV +I SFN   +N+ Y LG+N+FAD
Sbjct: 37  DAAMAARHERWMAQHGRVYKDAAEKARRLEVFKANVAFIESFNAGGKNR-YWLGVNQFAD 95

Query: 92  QTNEEFRAP---RNGYKRRLPSVRSSETTDVSFRYENAS---VPASIDWRKKGAVTGVKD 145
            T+EEF+A      G+      VR S      F+YEN S   +PAS+DWR KGAVT +KD
Sbjct: 96  LTSEEFKATMTNSKGFSTPNNGVRVS----TGFKYENVSADALPASVDWRTKGAVTRIKD 151

Query: 146 QGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFII 205
           QGQCGCCWAFSAVAAMEGI  ++T KL SLSEQELVDCD  G DQGCEGG +D AF+FI+
Sbjct: 152 QGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVDGNDQGCEGGEIDGAFQFIL 211

Query: 206 SNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAID 265
           SN GL  EA YPY A DG C    A   AA I GYEDVP+N+E +LMKAVA QPVSVA+D
Sbjct: 212 SNGGLTAEANYPYTAEDGRCKTTAAADVAASIRGYEDVPANDEPSLMKAVAGQPVSVAVD 271

Query: 266 ASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQ 325
           A  S FQFY  GV  G+CGT LDHGVT +GYG A DGTKYWLVKNSWGTTWGE GY+RM+
Sbjct: 272 A--SKFQFYGGGVMAGECGTSLDHGVTVIGYGAASDGTKYWLVKNSWGTTWGEAGYLRME 329

Query: 326 RDIDAKEGLCGIAMQASYPTA 346
           +DID K G+CG+AMQ SYPTA
Sbjct: 330 KDIDDKRGMCGLAMQPSYPTA 350


>gi|242072394|ref|XP_002446133.1| hypothetical protein SORBIDRAFT_06g002160 [Sorghum bicolor]
 gi|241937316|gb|EES10461.1| hypothetical protein SORBIDRAFT_06g002160 [Sorghum bicolor]
          Length = 338

 Score =  418 bits (1075), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 200/322 (62%), Positives = 242/322 (75%), Gaps = 11/322 (3%)

Query: 27  SRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGI 86
           +R L+DA M ERHE WM +YGRVY+D AEK  RF+ FK NV ++ SFN   +NK + LG+
Sbjct: 24  ARELSDAAMVERHENWMVEYGRVYKDAAEKARRFEAFKHNVAFVESFNTNKKNK-FWLGV 82

Query: 87  NEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASV---PASIDWRKKGAVTGV 143
           N+FAD T EEF+A + G+K     + +       F+YEN SV   P ++DWR KGAVT +
Sbjct: 83  NQFADLTTEEFKANK-GFK----PISAEMVPTTGFKYENLSVSALPTAVDWRTKGAVTPI 137

Query: 144 KDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEF 203
           K+QGQCGCCWAFSAVAAMEGI  ++T  L SLSEQELVDCDT   D+GCEGG MD AFEF
Sbjct: 138 KNQGQCGCCWAFSAVAAMEGIVKLSTGNLISLSEQELVDCDTHSMDEGCEGGWMDSAFEF 197

Query: 204 IISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVA 263
           +I N GLATE+ YPYKA DG C  K  + SAA I G+EDVP N+EAALMKAVANQPVSVA
Sbjct: 198 VIKNGGLATESSYPYKAVDGKC--KGGSKSAATIKGHEDVPVNDEAALMKAVANQPVSVA 255

Query: 264 IDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIR 323
           +DAS   F  YS GV TG CGTELDHG+ A+GYG   DGTKYW++KNSWGTTWGE G++R
Sbjct: 256 VDASDRTFMLYSGGVMTGSCGTELDHGIAAIGYGVESDGTKYWILKNSWGTTWGEKGFLR 315

Query: 324 MQRDIDAKEGLCGIAMQASYPT 345
           M++DI  K+G+CG+AM+ SYPT
Sbjct: 316 MEKDISDKQGMCGLAMKPSYPT 337


>gi|102140014|gb|ABF70145.1| cysteine protease, putative [Musa acuminata]
          Length = 373

 Score =  417 bits (1071), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 207/335 (61%), Positives = 245/335 (73%), Gaps = 7/335 (2%)

Query: 14  AILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASF 73
           A+L LG+ A    +  L DA+M ERH  WMA++GR Y+D AEKE R  IFK NVEYI SF
Sbjct: 10  ALLALGLGACSPAAAELGDASMAERHVEWMARHGRTYKDAAEKEQRLGIFKSNVEYIESF 69

Query: 74  NNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYEN-ASVPASI 132
           N  A  + Y+L  N+FAD T+EEF+A   G+K   PS   ++     FR+ + +SVP S+
Sbjct: 70  N--AGKRKYQLAANQFADLTHEEFKAMHTGFK---PSGTGAKKAGNGFRHGSLSSVPDSV 124

Query: 133 DWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGC 192
           DWR KGAVT VKDQG CG CWAF+ VAA+EGI  I T KL SLSEQ+LVDCD  G+DQGC
Sbjct: 125 DWRSKGAVTPVKDQGLCGSCWAFTVVAAVEGITKIVTGKLISLSEQQLVDCDVHGKDQGC 184

Query: 193 EGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALM 252
           +GG MD AFEFI++N G+ +EA YPY+     CN   A+   A I  +EDVP+N+E AL 
Sbjct: 185 QGGDMDAAFEFIVNNGGITSEANYPYEEVQRLCNAHNASFVVATIESHEDVPTNDEKALR 244

Query: 253 KAVANQPVSVAIDASGS-DFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNS 311
           KAVANQPVSV IDA  S DFQ YS GVF+G+CGT+LDH VT VGYGT  DGTKYWL KNS
Sbjct: 245 KAVANQPVSVGIDAGSSLDFQLYSGGVFSGECGTDLDHAVTVVGYGTTSDGTKYWLAKNS 304

Query: 312 WGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           WG TWGENGYIRM+RD+ AKEGLCGIAMQASYPTA
Sbjct: 305 WGETWGENGYIRMERDVAAKEGLCGIAMQASYPTA 339


>gi|242072398|ref|XP_002446135.1| hypothetical protein SORBIDRAFT_06g002170 [Sorghum bicolor]
 gi|241937318|gb|EES10463.1| hypothetical protein SORBIDRAFT_06g002170 [Sorghum bicolor]
          Length = 338

 Score =  417 bits (1071), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 205/345 (59%), Positives = 253/345 (73%), Gaps = 12/345 (3%)

Query: 5   LLENKLVLAAIL-VLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIF 63
           ++ +K  L AIL    + +    +R L+DA M ERHE WM +YGRVY+D AEK  RF++F
Sbjct: 1   MVSSKAFLLAILGCASLCSSVLAARELSDAAMVERHENWMVEYGRVYKDAAEKARRFEVF 60

Query: 64  KENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRY 123
           K+NV ++ SFN    NK + LGIN+FAD T EEF+A + G+K     + + +     F+Y
Sbjct: 61  KDNVAFVESFNTNKNNK-FWLGINQFADLTIEEFKANK-GFK----PISAEKVPTTGFKY 114

Query: 124 ENASV---PASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQEL 180
           EN SV   P ++DWR KGAVT +K+QGQCGCCWAFSAVAAMEGI  ++T  L SLSEQEL
Sbjct: 115 ENLSVSALPTAVDWRTKGAVTPIKNQGQCGCCWAFSAVAAMEGIVKLSTGNLISLSEQEL 174

Query: 181 VDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGY 240
           VDCDT   D+GCEGG MD AFEF+I N GLAT + YPYKA DG C  K  + SAA I G+
Sbjct: 175 VDCDTHSMDEGCEGGWMDSAFEFVIKNGGLATVSSYPYKAVDGKC--KGGSKSAATIKGH 232

Query: 241 EDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTAD 300
           EDVP N+EAALMKAVANQPVSVA+DAS   F  YS GV TG CGTELDHG+ A+GYG   
Sbjct: 233 EDVPVNDEAALMKAVANQPVSVAVDASDRTFMLYSGGVMTGSCGTELDHGIAAIGYGVES 292

Query: 301 DGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           DGTKYW++KNSWGTTWGE G++RM++DI  K+G+CG+AM+ SYPT
Sbjct: 293 DGTKYWILKNSWGTTWGEKGFLRMEKDISDKQGMCGLAMKPSYPT 337


>gi|357160300|ref|XP_003578721.1| PREDICTED: oryzain beta chain-like [Brachypodium distachyon]
          Length = 349

 Score =  416 bits (1070), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 195/324 (60%), Positives = 237/324 (73%), Gaps = 5/324 (1%)

Query: 27  SRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGI 86
           +R L DA M ERHE WMAQ+GRVY+D AEK  RF+ F+ NV +I SFN     + + LG+
Sbjct: 25  ARELGDAAMVERHEQWMAQHGRVYKDGAEKARRFEAFRNNVVFIESFNAAGNRRKFWLGV 84

Query: 87  NEFADQTNEEFRAPRN--GYKRRLPSVRSSETTDVSFRYENAS---VPASIDWRKKGAVT 141
           N+F D TN+EFRA +   G+ +R  +  +  +   +FRY N S   +PA++DWR KGAVT
Sbjct: 85  NQFTDLTNDEFRATKTNKGFIKRNAAAVNKASPTGTFRYSNVSADALPAAVDWRAKGAVT 144

Query: 142 GVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAF 201
            +K+QGQCGCCWAFSAVAA EGI  ++T KL  LSEQELVDCD +G D GCEGG MDDAF
Sbjct: 145 PIKNQGQCGCCWAFSAVAATEGIVQLSTGKLVPLSEQELVDCDANGADHGCEGGEMDDAF 204

Query: 202 EFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVS 261
           EFII N GL +E  YPY A DG C  K    S A I GYEDVP+N+EA+LMKAVA QPVS
Sbjct: 205 EFIIKNGGLTSETNYPYTAQDGQCKAKNTINSVATIKGYEDVPANDEASLMKAVAAQPVS 264

Query: 262 VAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGY 321
           VA+D     FQ Y+ GV +G CGT LDHG+ AVGYG ADDGTK+WL+KNSWGTTWGE+GY
Sbjct: 265 VAVDGGDMVFQHYAGGVLSGSCGTSLDHGIVAVGYGAADDGTKFWLMKNSWGTTWGEDGY 324

Query: 322 IRMQRDIDAKEGLCGIAMQASYPT 345
           IRM++D+    G+CG+AMQ SYPT
Sbjct: 325 IRMEKDVADAGGMCGLAMQPSYPT 348


>gi|414588010|tpg|DAA38581.1| TPA: hypothetical protein ZEAMMB73_156486 [Zea mays]
          Length = 347

 Score =  416 bits (1069), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 211/349 (60%), Positives = 252/349 (72%), Gaps = 17/349 (4%)

Query: 9   KLVLAAILVLGVW---APQSWSRTL---NDATMNERHEMWMAQYGRVYRDNAEKEMRFKI 62
           K +L AIL  GV    A    +R L   ++  M  RHE WM Q+GRVY+D  +K  RF +
Sbjct: 5   KALLLAILGCGVCLCSAAVLAARELGGDDELAMVARHEQWMVQHGRVYKDETDKAHRFLV 64

Query: 63  FKENVEYIASFNNKAR--NKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVS 120
           FK NV++I SFN  A   N+ + LG+N+FAD TN+EFRA +   K   P+V    T    
Sbjct: 65  FKANVKFIESFNAAAAAGNRKFWLGVNQFADLTNDEFRATKTN-KGFNPNVVKVPT---G 120

Query: 121 FRYENASV---PASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSE 177
           FRY+N S+   P ++DWR KGAVT +KDQGQCGCCWAFSAVAA EGI  I+T KLTSLSE
Sbjct: 121 FRYQNLSIDALPQTVDWRTKGAVTPIKDQGQCGCCWAFSAVAATEGIVKISTGKLTSLSE 180

Query: 178 QELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKI 237
           QELVDCD  GEDQGC GG MDDAF+FII N GL TE+ YPY A DG C  K  +  AA I
Sbjct: 181 QELVDCDVHGEDQGCNGGEMDDAFKFIIKNGGLTTESNYPYTAQDGQC--KSGSNGAATI 238

Query: 238 SGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYG 297
            GYEDVP+N+EAALMKAVA+QPVSVA+D     FQFYS GV TG CGT+LDHG+ A+GYG
Sbjct: 239 KGYEDVPANDEAALMKAVASQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYG 298

Query: 298 TADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
              DGTKYWL+KNSWGTTWGENG++RM++DI  K+G+CG+AMQ SYPTA
Sbjct: 299 KTSDGTKYWLMKNSWGTTWGENGFLRMEKDIADKKGMCGLAMQPSYPTA 347


>gi|310656789|gb|ADP02218.1| Peptidase_C1 domain-containing protein [Triticum aestivum]
          Length = 341

 Score =  416 bits (1068), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 194/322 (60%), Positives = 234/322 (72%), Gaps = 9/322 (2%)

Query: 27  SRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGI 86
           +R L D  M ERHE WMA++ RVY+D  EK  RF++FK NV +I SFN  A N+ + LG+
Sbjct: 25  ARELGDTAMVERHEQWMAKFNRVYKDGTEKAQRFEVFKANVAFIESFN--AENRKFWLGV 82

Query: 87  NEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASV---PASIDWRKKGAVTGV 143
           N+F D TN+EFRA +     ++   R+       F+Y N S+   P ++DWR KG VT +
Sbjct: 83  NQFTDLTNDEFRATKTNKGLKMSGGRAP----TGFKYSNVSIDALPTAVDWRTKGVVTPI 138

Query: 144 KDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEF 203
           KDQGQCGCCWAFSAV A EGI  ++T KL SLSEQELVDCD  G DQGCEGG MDDAF+F
Sbjct: 139 KDQGQCGCCWAFSAVVATEGIVKLSTGKLISLSEQELVDCDVHGVDQGCEGGEMDDAFKF 198

Query: 204 IISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVA 263
           II N GL TEA YPY A DG C    A+ S A I GYEDVP+N+E++LMKAVANQPVSVA
Sbjct: 199 IIKNGGLTTEANYPYTAQDGQCKTSIASNSVATIKGYEDVPANDESSLMKAVANQPVSVA 258

Query: 264 IDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIR 323
           +D     FQ YS GV TG CGT+LDHG+ A+GYG   DGTKYWL+KNSWGTTWGE+GY+R
Sbjct: 259 VDGGDVIFQHYSGGVMTGSCGTDLDHGIAAIGYGMTSDGTKYWLLKNSWGTTWGESGYLR 318

Query: 324 MQRDIDAKEGLCGIAMQASYPT 345
           M++DI  K G+CG+AMQ SYPT
Sbjct: 319 MEKDISDKSGMCGLAMQPSYPT 340


>gi|356543114|ref|XP_003540008.1| PREDICTED: LOW QUALITY PROTEIN: KDEL-tailed cysteine endopeptidase
           CEP1-like [Glycine max]
          Length = 343

 Score =  415 bits (1067), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 210/324 (64%), Positives = 247/324 (76%), Gaps = 10/324 (3%)

Query: 27  SRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGI 86
           SR L+DA+M ERHE WM +YG+VY+D+AE + RF IF+ NVE+I SFN  A NKPYKL I
Sbjct: 26  SRKLHDASMYERHEQWMEKYGKVYKDSAEMQKRFLIFENNVEFIESFN-AAGNKPYKLSI 84

Query: 87  NEFADQTNEEFRAPRNGYK-RRLPSVRSSETTDVSFRYENAS-VPASIDWRKKGAVTGVK 144
           N  ADQTNEEF A   GYK      +R   TT   F+YEN + +P ++DWR+KG VT +K
Sbjct: 85  NHLADQTNEEFMASHKGYKGSHWQGLRI--TTQTPFKYENVTDIPWAVDWRQKGDVTSIK 142

Query: 145 DQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFI 204
           DQ QCG CWAFSAVAA EGI  ITT  L SLSE+ELVDCD+   D GC+GGLM+  FEFI
Sbjct: 143 DQAQCGNCWAFSAVAATEGIYQITTGNLVSLSEKELVDCDSV--DHGCDGGLMEHGFEFI 200

Query: 205 ISNKGLATEAKYPYKASDGSCN-KKEANPSAAKISGYEDVPSNNEAALMKAVANQ-PVSV 262
           I N G+++EA YPY A +G+C+  KEA+P  A+I+GYE VP N E  L KAVANQ  +SV
Sbjct: 201 IKNGGISSEANYPYTAVNGTCDTNKEASP-VAQITGYETVPVNCEEELQKAVANQLTMSV 259

Query: 263 AIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYI 322
           +IDA GS FQFY SGVFTGQCGT+LDHGVTAVGYG+ D GT+YW+VKNSWGT WGE GYI
Sbjct: 260 SIDAGGSAFQFYPSGVFTGQCGTQLDHGVTAVGYGSTDYGTQYWIVKNSWGTQWGEEGYI 319

Query: 323 RMQRDIDAKEGLCGIAMQASYPTA 346
           RM R IDA+EGLCGIAM ASYPTA
Sbjct: 320 RMLRGIDAQEGLCGIAMDASYPTA 343


>gi|414587996|tpg|DAA38567.1| TPA: hypothetical protein ZEAMMB73_390779 [Zea mays]
          Length = 343

 Score =  415 bits (1067), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 205/343 (59%), Positives = 252/343 (73%), Gaps = 15/343 (4%)

Query: 10  LVLAAILV---LGVWAPQSWSRTL-NDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKE 65
           L+L AIL        +P   +R L +DA M ERHE WMA YGRVY+D AEK  RF++FK+
Sbjct: 8   LLLLAILTGCACSFPSPVLAARELSDDAAMAERHERWMAVYGRVYKDAAEKARRFEVFKD 67

Query: 66  NVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYEN 125
           N+ ++ SFN   +NK + LG+N+FAD T EEF+A + G+K     + + E     F+YEN
Sbjct: 68  NLAFVESFNADKKNK-FWLGVNQFADLTTEEFKANK-GFK----PISAEEVPTTGFKYEN 121

Query: 126 ASV---PASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVD 182
            SV   P ++DWR KGAVT +K+QGQCGCCWAFSAVAAMEGI  ++T  L SLSEQELVD
Sbjct: 122 LSVSALPTAVDWRTKGAVTPIKNQGQCGCCWAFSAVAAMEGIVKLSTDNLVSLSEQELVD 181

Query: 183 CDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYED 242
           CDT   D+GCEGG MD AFEF+I N GLATE+ YPYKA DG C  K  + SAA I G+ED
Sbjct: 182 CDTHSMDEGCEGGWMDSAFEFVIKNGGLATESSYPYKAVDGKC--KGGSKSAATIKGHED 239

Query: 243 VPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDG 302
           VP NNEAALMKAVA+QPVSVA+DAS   F  YS GV TG CGT+LDHG+ A+GYG   DG
Sbjct: 240 VPPNNEAALMKAVASQPVSVAVDASDRTFMLYSGGVMTGSCGTQLDHGIAAIGYGVESDG 299

Query: 303 TKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           TKYW++KNSWGTTWGE  ++RM++DI  K+G+CG+AM+ SYPT
Sbjct: 300 TKYWILKNSWGTTWGEKRFLRMEKDISDKQGMCGLAMKPSYPT 342


>gi|77554625|gb|ABA97421.1| Vignain precursor, putative [Oryza sativa Japonica Group]
 gi|222630746|gb|EEE62878.1| hypothetical protein OsJ_17681 [Oryza sativa Japonica Group]
          Length = 350

 Score =  415 bits (1066), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 203/320 (63%), Positives = 235/320 (73%), Gaps = 13/320 (4%)

Query: 32  DATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFAD 91
           DA M  RHE WMAQ+GRVY+D AEK  R ++FK NV +I SFN   +N+ Y LG+N+FAD
Sbjct: 37  DAAMAARHERWMAQHGRVYKDAAEKARRLEVFKANVAFIESFNAGGKNR-YWLGVNQFAD 95

Query: 92  QTNEEFRAP---RNGYKRRLPSVRSSETTDVSFRYENAS---VPASIDWRKKGAVTGVKD 145
            T+EEF+A      G+      VR S      F+YEN S   +PAS+DWR KGAVT +KD
Sbjct: 96  LTSEEFKATMTNSKGFSTPNNGVRVS----TGFKYENVSADALPASVDWRTKGAVTRIKD 151

Query: 146 QGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFII 205
           QGQCGCCWAFSAVAAMEG   ++T KL SLSEQELVDCD  G DQGCEGG +D AF+FI+
Sbjct: 152 QGQCGCCWAFSAVAAMEGFVKLSTGKLISLSEQELVDCDVDGNDQGCEGGEIDGAFQFIL 211

Query: 206 SNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAID 265
           SN GL  EA YPY A DG C    A   AA I GYEDVP+N+E +LMKAVA QPVSVA+D
Sbjct: 212 SNGGLTAEANYPYTAEDGRCKTTAAADVAASIRGYEDVPANDEPSLMKAVAGQPVSVAVD 271

Query: 266 ASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQ 325
           A  S FQFY  GV  G+CGT LDHGVT +GYG A DGTKYWLVKNSWGTTWGE GY+RM+
Sbjct: 272 A--SKFQFYGGGVMAGECGTSLDHGVTVIGYGAASDGTKYWLVKNSWGTTWGEAGYLRME 329

Query: 326 RDIDAKEGLCGIAMQASYPT 345
           +DID K G+CG+AMQ SYPT
Sbjct: 330 KDIDDKRGMCGLAMQPSYPT 349


>gi|218202087|gb|EEC84514.1| hypothetical protein OsI_31214 [Oryza sativa Indica Group]
          Length = 348

 Score =  413 bits (1061), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 203/339 (59%), Positives = 246/339 (72%), Gaps = 12/339 (3%)

Query: 6   LENKLVLAAILVLGVWAPQSWSRTL-NDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFK 64
           +   L+ A +  L + +    +R L +DA M  RHE WMAQYGR+Y+D+AEK  RF++FK
Sbjct: 3   MAKALLFAILGCLCLCSAVLAARELSDDAAMAARHERWMAQYGRMYKDDAEKARRFEVFK 62

Query: 65  ENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYE 124
            N  +I SFN  A N  + LG+N+FAD TN+EFR  +   K  +PS     T    FRYE
Sbjct: 63  ANAAFIESFN--AGNHKFWLGVNQFADLTNDEFRLTKTN-KGFIPSTTRVPT---GFRYE 116

Query: 125 NASV---PASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELV 181
           N ++   PA++DWR KG VT +KDQGQCGCCWAFSAVAAMEGI  ++T KL SLSEQELV
Sbjct: 117 NVNIDALPATMDWRTKGVVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQELV 176

Query: 182 DCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYE 241
           DCD  GEDQGCEGGLMDDAF+FII N GL TE+ YPY A+D  C  K  + S A I GYE
Sbjct: 177 DCDVHGEDQGCEGGLMDDAFKFIIKNGGLTTESNYPYAAADDKC--KSVSNSVASIKGYE 234

Query: 242 DVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADD 301
           DVP+NNEAALMKAVANQPVSVA+D     FQFY  GV  G CGT+LDHG+ A+GYG A D
Sbjct: 235 DVPANNEAALMKAVANQPVSVAVDGDDMTFQFYKGGVMIGSCGTDLDHGIVAIGYGKASD 294

Query: 302 GTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQ 340
           GTKYWL+KNSWG TWGENG++RM++DI  K G+CG+AM+
Sbjct: 295 GTKYWLLKNSWGMTWGENGFLRMEKDISDKRGMCGLAME 333


>gi|356515040|ref|XP_003526209.1| PREDICTED: thiol protease SEN102-like [Glycine max]
          Length = 342

 Score =  412 bits (1058), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 203/342 (59%), Positives = 248/342 (72%), Gaps = 7/342 (2%)

Query: 7   ENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKEN 66
           + + +LA  L L V   Q   R L+   + ERHE WMA+YG++Y+D AEKE RF+IFK+N
Sbjct: 6   QKQHMLALFLFLAVGISQVMPRKLHQTALRERHENWMAEYGKMYKDAAEKEKRFQIFKDN 65

Query: 67  VEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENA 126
           VE+I SFN  A NKPYKLG+N  AD T EEF+  RNG KR      ++   +  F+YEN 
Sbjct: 66  VEFIESFN-AAGNKPYKLGVNHLADLTLEEFKDSRNGLKRTYEFSTTTFKLN-GFKYENV 123

Query: 127 S-VPASIDWRKKGAVTGVKDQG-QCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCD 184
           + +P +IDWR KGAVT +KDQG QCG CWAFS +AA EGI+ I+T  L SLSEQELVDCD
Sbjct: 124 TDIPEAIDWRVKGAVTPIKDQGDQCGSCWAFSTIAATEGIHQISTGNLVSLSEQELVDCD 183

Query: 185 TSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVP 244
           +   D GCEGG M+D FEFII N G+ +E  YPYK  DG+CN   A    A+I GYE VP
Sbjct: 184 SV--DDGCEGGFMEDGFEFIIKNGGITSETNYPYKGVDGTCNTTIAASPVAQIKGYEIVP 241

Query: 245 SNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTK 304
           S +E AL KAVANQPVSV+I A+ + F FYSSG++ G+CGT+LDHGVTAVGYGT ++GT 
Sbjct: 242 SYSEEALQKAVANQPVSVSIHATNATFMFYSSGIYNGECGTDLDHGVTAVGYGT-ENGTD 300

Query: 305 YWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           YW+VKNSWGT WGE GYIRM R I AK G+CGIA+ +SYPTA
Sbjct: 301 YWIVKNSWGTQWGEKGYIRMHRGIAAKHGICGIALDSSYPTA 342


>gi|224076968|ref|XP_002305072.1| predicted protein [Populus trichocarpa]
 gi|222848036|gb|EEE85583.1| predicted protein [Populus trichocarpa]
          Length = 305

 Score =  411 bits (1057), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 200/313 (63%), Positives = 244/313 (77%), Gaps = 9/313 (2%)

Query: 35  MNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTN 94
           M +RHE WMAQ+GRVY D  EKE R+ IFKEN+E I +FNN + ++ YKLG+N+FAD TN
Sbjct: 1   MLKRHEEWMAQHGRVYGDMKEKEKRYLIFKENIERIEAFNNGS-DRGYKLGVNKFADLTN 59

Query: 95  EEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS-VPASIDWRKKGAVTGVKDQGQCGCCW 153
           EEFRA  +GYKR+   + SS     SFRYEN S +P S+DWR  GAVT VKDQG CGCCW
Sbjct: 60  EEFRAMYHGYKRQSSKLMSS-----SFRYENLSDIPTSMDWRNDGAVTPVKDQGTCGCCW 114

Query: 154 AFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATE 213
           AFS VAA+EGI  + T  L SLSEQ+LVDC T+G ++GC+GGLMD AF++II N GL +E
Sbjct: 115 AFSTVAAIEGIIKLQTGNLISLSEQQLVDC-TAG-NKGCQGGLMDTAFQYIIRNGGLTSE 172

Query: 214 AKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQF 273
             YPY+  DG+C+ ++A  + A+I+GYEDVP NNE AL++AVA QPVSVA+D  G+DF+F
Sbjct: 173 DNYPYQGVDGTCSSEKAASTEAQITGYEDVPQNNENALLQAVAKQPVSVAVDGGGNDFRF 232

Query: 274 YSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEG 333
           Y SGVF G CGT L+HGVTA+GYGT  DGT YWLVKNSWGT+WGE+GY RMQR I A EG
Sbjct: 233 YKSGVFEGDCGTNLNHGVTAIGYGTDSDGTDYWLVKNSWGTSWGESGYTRMQRGIGASEG 292

Query: 334 LCGIAMQASYPTA 346
           LCG+AM ASYPT+
Sbjct: 293 LCGVAMDASYPTS 305


>gi|297794671|ref|XP_002865220.1| senescence-associated gene 12 [Arabidopsis lyrata subsp. lyrata]
 gi|297311055|gb|EFH41479.1| senescence-associated gene 12 [Arabidopsis lyrata subsp. lyrata]
          Length = 346

 Score =  409 bits (1050), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 195/347 (56%), Positives = 244/347 (70%), Gaps = 6/347 (1%)

Query: 3   MILLENKLVLAAILVLGVWAPQSWSRTL-NDATMNERHEMWMAQYGRVYRDNAEKEMRFK 61
           M     ++ L   +    +   S SR L N+  M +RH  WM ++GRVY D  EK  R+ 
Sbjct: 1   MAFKHMQIFLFVAIFSSFYFSISLSRPLDNELIMQKRHIEWMTKHGRVYADVKEKSNRYV 60

Query: 62  IFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSF 121
           +FK NVE I   NN    + +KL +N+FAD TN+EFR+   G+K        S+T   SF
Sbjct: 61  VFKSNVERIEHLNNIPAGRTFKLAVNQFADLTNDEFRSMYTGFKGVSSLSSQSQTKTTSF 120

Query: 122 RYENAS---VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQ 178
           RY+N S   +P S+DWR KGAVT +K+QG CGCCWAFSAVAA+EG   I   KL SLSEQ
Sbjct: 121 RYQNVSSGALPISVDWRTKGAVTPIKNQGSCGCCWAFSAVAAIEGATQIKKGKLISLSEQ 180

Query: 179 ELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKIS 238
           +LVDCDT+  D GCEGGLMD AFE I++  GL TE+ YPYK  D +CN K+ NP A  I+
Sbjct: 181 QLVDCDTN--DFGCEGGLMDTAFEHIMATGGLTTESNYPYKGEDATCNSKKTNPKATSIT 238

Query: 239 GYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGT 298
           GYEDVP N+E ALMKAVA+QPVSV I+  G DFQFYSSGVFTG+C T LDH VTA+GYG 
Sbjct: 239 GYEDVPVNDEQALMKAVAHQPVSVGIEGGGFDFQFYSSGVFTGECTTYLDHAVTAIGYGQ 298

Query: 299 ADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           + +G+KYW++KNSWGT WGE+GY+R+Q+DI  K+GLCG+AM+ASYPT
Sbjct: 299 STNGSKYWIIKNSWGTKWGESGYMRIQKDIKDKQGLCGLAMKASYPT 345


>gi|356515044|ref|XP_003526211.1| PREDICTED: LOW QUALITY PROTEIN: thiol protease SEN102-like [Glycine
           max]
          Length = 337

 Score =  408 bits (1049), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 199/335 (59%), Positives = 246/335 (73%), Gaps = 10/335 (2%)

Query: 12  LAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIA 71
           LA  L+L +   Q  SR L++ ++ E HE W+A+YG+VY+  AEKE  F+IFKENVE+I 
Sbjct: 11  LALFLLLSIEISQVMSRKLHETSLREEHENWIARYGQVYKVAAEKET-FQIFKENVEFIE 69

Query: 72  SFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS-VPA 130
           SFN  A NKPYKLG+N FAD T EEF+  R G K+      + E +   F+YEN + +P 
Sbjct: 70  SFN-AAANKPYKLGVNLFADLTLEEFKDFRFGLKK------THEFSITPFKYENVTDIPE 122

Query: 131 SIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQ 190
           ++DWR+KGAVT +KDQGQCG CWAFS VAA EGI+ ITT  L SL EQELV CDT G DQ
Sbjct: 123 ALDWREKGAVTPIKDQGQCGSCWAFSTVAATEGIHQITTGNLVSLXEQELVSCDTKGVDQ 182

Query: 191 GCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAA 250
           GCEGG M+D FEFII N G+ T+A YPYK  +G+CN   A  + A+I GYE VPS +E A
Sbjct: 183 GCEGGYMEDGFEFIIKNGGITTKANYPYKGVNGTCNTTIAASTVAQIKGYETVPSYSEEA 242

Query: 251 LMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKN 310
           L KAVANQPVSV+IDA+   F FY+ G++TG+CGT+LDHGVTAVGYGT ++ T YW+VKN
Sbjct: 243 LQKAVANQPVSVSIDANNGHFMFYAGGIYTGECGTDLDHGVTAVGYGTTNE-TDYWIVKN 301

Query: 311 SWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           SWGT W E G+IRMQR I  K GLCG+A+ +SYPT
Sbjct: 302 SWGTGWDEKGFIRMQRGITVKHGLCGVALDSSYPT 336


>gi|356517308|ref|XP_003527330.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
           max]
          Length = 342

 Score =  407 bits (1047), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 202/337 (59%), Positives = 245/337 (72%), Gaps = 6/337 (1%)

Query: 12  LAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIA 71
           L   LVL VW     SR L++A  +ERHE WMAQYGRVY+D AEKE RF++FK NV +I 
Sbjct: 10  LILFLVLAVWTSHVMSRRLSEACTSERHEKWMAQYGRVYKDAAEKEKRFQVFKNNVHFIE 69

Query: 72  SFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS-VPA 130
           SFN  A +KP+ L IN+FAD  +EEF+A     +++   V +S  T+ SFRYE+ + +PA
Sbjct: 70  SFN-AAGDKPFNLSINQFADLNDEEFKALLINVQKKASWVETS--TETSFRYESVTKIPA 126

Query: 131 SIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQ 190
           +IDWRK+GAVT +KDQG+CG CWAFSAVAA EGI+ ITT KL  LSEQELVDC   GE +
Sbjct: 127 TIDWRKRGAVTPIKDQGRCGSCWAFSAVAATEGIHQITTGKLVPLSEQELVDC-VKGESE 185

Query: 191 GCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAA 250
           GC GG +DDAFEFI    G+A+E  YPYK  + +C  K+     A+I GYE VPSNNE A
Sbjct: 186 GCIGGYVDDAFEFIAKKGGIASETHYPYKGVNKTCKVKKETHGVAEIKGYEKVPSNNEKA 245

Query: 251 LMKAVANQPVSVAIDASGSDFQFYSSGVFTGQ-CGTELDHGVTAVGYGTADDGTKYWLVK 309
           L+KAVANQPVSV IDA    F++YSSG+F  + CGT+ +H V  VGYG A DG+KYWLVK
Sbjct: 246 LLKAVANQPVSVYIDAGTHAFKYYSSGIFNARNCGTDPNHAVAVVGYGKALDGSKYWLVK 305

Query: 310 NSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           NSWGT WGE GYIR++RDI AKEGLCGIA    YPTA
Sbjct: 306 NSWGTEWGERGYIRIKRDIRAKEGLCGIAKYPYYPTA 342


>gi|356515046|ref|XP_003526212.1| PREDICTED: thiol protease SEN102-like [Glycine max]
          Length = 342

 Score =  407 bits (1045), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 202/342 (59%), Positives = 247/342 (72%), Gaps = 7/342 (2%)

Query: 7   ENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKEN 66
           + + +LA  L L V   Q   R L+   + ERHE WMA+YG++Y+D AEKE RF+IFK+N
Sbjct: 6   QKQHMLALFLFLAVGISQVMPRKLHQTALRERHENWMAEYGKMYKDAAEKEKRFQIFKDN 65

Query: 67  VEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENA 126
           VE+I SFN  A NKPYKLG+N  AD T EEF+  RNG KR      ++   +  F+YEN 
Sbjct: 66  VEFIESFN-AAGNKPYKLGVNHLADLTLEEFKDSRNGLKRTYEFSTTTFKLN-GFKYENV 123

Query: 127 S-VPASIDWRKKGAVTGVKDQG-QCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCD 184
           + +P +IDWR KGAVT +KDQG QCG  WAFS +AA EGI+ I+T  L SLSEQELVDCD
Sbjct: 124 TDIPEAIDWRVKGAVTPIKDQGDQCGRFWAFSTIAATEGIHQISTGNLVSLSEQELVDCD 183

Query: 185 TSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVP 244
           +   D GCEGG M+D FEFII N G+ +E  YPYK  DG+CN   A    A+I GYE VP
Sbjct: 184 SV--DDGCEGGFMEDGFEFIIKNGGITSETNYPYKGVDGTCNTTIAASPVAQIKGYEIVP 241

Query: 245 SNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTK 304
           S +E AL KAVANQPVSV+I A+ + F FYSSG++ G+CGT+LDHGVTAVGYGT ++GT 
Sbjct: 242 SYSEEALKKAVANQPVSVSIHATNATFMFYSSGIYNGECGTDLDHGVTAVGYGT-ENGTD 300

Query: 305 YWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           YW+VKNSWGT WGE GYIRM R I AK G+CGIA+ +SYPTA
Sbjct: 301 YWIVKNSWGTQWGEKGYIRMHRGIAAKHGICGIALDSSYPTA 342


>gi|357167190|ref|XP_003581045.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like
           [Brachypodium distachyon]
          Length = 415

 Score =  405 bits (1042), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 199/339 (58%), Positives = 240/339 (70%), Gaps = 10/339 (2%)

Query: 12  LAAILVLGVWAPQSWSRTL-NDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYI 70
           L AIL          +R L +D +M  RHE WMA+YGRVY D AEK  R ++FK NV +I
Sbjct: 83  LIAILACTCAVSALAARDLTDDLSMVARHEQWMAKYGRVYNDVAEKAQRLEVFKANVAFI 142

Query: 71  ASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASV-- 128
              N  A N  + L  N+FAD T +EFRA   GYK  +P+ +   T    F+Y N S+  
Sbjct: 143 ELVN--AGNDKFSLEANQFADMTVDEFRAAHTGYKP-VPANKGRTT---QFKYANVSLDA 196

Query: 129 -PASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSG 187
            PAS+DWR KGAVT +KDQGQCGCCWAFS VA++EGI  ++T KL SLSEQELVDCD  G
Sbjct: 197 LPASMDWRAKGAVTPIKDQGQCGCCWAFSTVASVEGIVKLSTGKLISLSEQELVDCDVDG 256

Query: 188 EDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNN 247
            DQGCEGGLMD+AFEFII N GL TE  YPY  +D SCN  + +   A I GYEDVPSN+
Sbjct: 257 MDQGCEGGLMDNAFEFIIDNGGLTTEGNYPYTGTDDSCNSNKESNDVASIKGYEDVPSND 316

Query: 248 EAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWL 307
           E +L+KAVA QPVS+A+D   + F+FY  GV +G CGTELDHG+ AVGYG   DGTK+WL
Sbjct: 317 ETSLLKAVAAQPVSIAVDGGDNLFRFYKGGVLSGACGTELDHGIAAVGYGITSDGTKFWL 376

Query: 308 VKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           +KNSWGT+WGE G+IRM+RDI  +EGLCG+AMQ SYPTA
Sbjct: 377 MKNSWGTSWGEKGFIRMERDIADEEGLCGLAMQPSYPTA 415


>gi|356543112|ref|XP_003540007.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
           max]
          Length = 345

 Score =  405 bits (1042), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 197/328 (60%), Positives = 242/328 (73%), Gaps = 4/328 (1%)

Query: 21  WAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNK 80
           W     SR L +A  +ERHE WMAQYG+VY+D AEK+ RF+IFK NV +I SFN  A +K
Sbjct: 20  WTSHIMSRRLFEACTSERHENWMAQYGKVYKDAAEKKKRFQIFKNNVHFIESFNT-AGDK 78

Query: 81  PYKLGINEFADQTNEEFRAP-RNGYKRRLPSVRSSETTDVSFRYENAS-VPASIDWRKKG 138
           P+ L IN+FAD  +EEF+A   NG K+    V ++  T+ SF+Y   + + A++DWRK+G
Sbjct: 79  PFNLSINQFADLHDEEFKALLTNGNKKVRSVVGTATETETSFKYNRVTKLLATMDWRKRG 138

Query: 139 AVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMD 198
           AVT +KDQ +CG CWAFSAVAA+EGI+ ITT KL SLSEQELVDC   GE +GC GG M+
Sbjct: 139 AVTPIKDQRRCGSCWAFSAVAAIEGIHQITTSKLVSLSEQELVDC-VKGESEGCNGGYME 197

Query: 199 DAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQ 258
           DAFEF+    G+A+E+ YPYK  D SC  K+     ++I GYE VPSN+E AL KAVA+Q
Sbjct: 198 DAFEFVAKKGGIASESYYPYKGKDKSCKVKKETHGVSQIKGYEKVPSNSEKALQKAVAHQ 257

Query: 259 PVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGE 318
           PVSV ++A G+ FQFYSSG+FTG+CGT  DH +T VGYG +  GTKYWLVKNSWG  WGE
Sbjct: 258 PVSVYVEAGGNAFQFYSSGIFTGKCGTNTDHAITVVGYGKSRGGTKYWLVKNSWGAGWGE 317

Query: 319 NGYIRMQRDIDAKEGLCGIAMQASYPTA 346
            GYIRM+RDI AKEGLCGIAM A YPTA
Sbjct: 318 KGYIRMKRDIRAKEGLCGIAMNAFYPTA 345


>gi|356515056|ref|XP_003526217.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
           max]
          Length = 342

 Score =  405 bits (1042), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 202/337 (59%), Positives = 244/337 (72%), Gaps = 6/337 (1%)

Query: 12  LAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIA 71
           L   LVL VW     SR L++A  +ERHE WMAQYGRVY+D AEKE RF++FK NV +I 
Sbjct: 10  LILFLVLSVWTSHVMSRRLSEACTSERHEKWMAQYGRVYKDAAEKEKRFQVFKNNVHFIE 69

Query: 72  SFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS-VPA 130
           SFN  A +KP+ L IN+FAD  +EEF+A     +++   V +S  T  SFRYE+ + +PA
Sbjct: 70  SFN-AAGDKPFNLSINQFADLNDEEFKALLINVQKKASWVETS--TQTSFRYESVTKIPA 126

Query: 131 SIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQ 190
           +IDWRK+GAVT +KDQG+CG CWAFSAVAA EGI+ ITT KL  LSEQELVDC   GE +
Sbjct: 127 TIDWRKRGAVTPIKDQGRCGSCWAFSAVAATEGIHQITTGKLVPLSEQELVDC-VKGESE 185

Query: 191 GCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAA 250
           GC GG +DDAFEFI    G+A+E  YPYK  + +C  K+     A+I GYE VPSNNE A
Sbjct: 186 GCIGGYVDDAFEFIAKKGGIASETHYPYKGVNKTCKVKKETHGVAEIKGYEKVPSNNEKA 245

Query: 251 LMKAVANQPVSVAIDASGSDFQFYSSGVFTGQ-CGTELDHGVTAVGYGTADDGTKYWLVK 309
           L+KAVANQPVSV IDA    F++YSSG+F  + CGT+ +H V  VGYG A DG+KYWLVK
Sbjct: 246 LLKAVANQPVSVYIDAGTHAFKYYSSGIFNVRNCGTDPNHAVAVVGYGKALDGSKYWLVK 305

Query: 310 NSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           NSWGT WGE GYIR++RDI AKEGLCGIA    YPTA
Sbjct: 306 NSWGTEWGERGYIRIKRDIRAKEGLCGIAKYPYYPTA 342


>gi|356515080|ref|XP_003526229.1| PREDICTED: vignain-like [Glycine max]
          Length = 284

 Score =  405 bits (1041), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 197/284 (69%), Positives = 221/284 (77%), Gaps = 5/284 (1%)

Query: 64  KENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRY 123
           KENV YI +FNN A NKPYKLGIN+FAD T+EEF  PRN +   +   R S T   +F+Y
Sbjct: 5   KENVNYIEAFNNAA-NKPYKLGINQFADLTSEEFIVPRNRFNGHM---RFSNTRTTTFKY 60

Query: 124 ENASV-PASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVD 182
           EN +V P SIDWR+KGAVT +K+QG CGCCWAFSA+AA EGI+ I+T KL SLSEQE+VD
Sbjct: 61  ENVTVLPDSIDWRQKGAVTPIKNQGSCGCCWAFSAIAATEGIHKISTGKLVSLSEQEVVD 120

Query: 183 CDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYED 242
           CDT G D GCEGG MD AF+FII N G+ TEA YPYK  DG CN KE    A  I+GYED
Sbjct: 121 CDTKGTDHGCEGGYMDGAFKFIIQNHGINTEASYPYKGVDGKCNIKEEAVHATTITGYED 180

Query: 243 VPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDG 302
           VP NNE AL KAVANQPVSVAIDA G+DFQFY SG+FTG CGTELDHGVTAVGYG  ++G
Sbjct: 181 VPINNEKALQKAVANQPVSVAIDARGADFQFYKSGIFTGSCGTELDHGVTAVGYGENNEG 240

Query: 303 TKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           TKYWLVKNSWGT WGE GY  MQR + A EG+CGIAM ASYPTA
Sbjct: 241 TKYWLVKNSWGTEWGEEGYTMMQRGVKAVEGICGIAMLASYPTA 284


>gi|356515052|ref|XP_003526215.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
           max]
          Length = 339

 Score =  404 bits (1038), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 197/336 (58%), Positives = 245/336 (72%), Gaps = 7/336 (2%)

Query: 12  LAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIA 71
           L   L+L VW     SR L++   +ERHE WMAQYG++Y D AEKE RF+IFK NV++I 
Sbjct: 10  LILFLILTVWTFHVMSRRLSEVCTSERHEKWMAQYGKLYTDAAEKEKRFQIFKNNVQFIE 69

Query: 72  SFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS-VPA 130
           SFN  A +KP+ L IN+FAD  NEEF+A     +++   V ++  T+ SFRYE+ + +P 
Sbjct: 70  SFN-AAGDKPFNLSINQFADLHNEEFKASLINVQKKESGVETA--TETSFRYESITKIPV 126

Query: 131 SIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQ 190
           ++DWRK+GAVT +KDQG CG CWAFS VAA+EGI+ ITT KL SLSEQELVDC   G+ +
Sbjct: 127 TMDWRKRGAVTPIKDQGNCGSCWAFSTVAAIEGIHQITTGKLVSLSEQELVDC-VKGKSE 185

Query: 191 GCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAA 250
           GC  G  ++AFEF+  N GLA+E  YPYKA++ +C  K+     A+I GYE+VPSN+E A
Sbjct: 186 GCNFGYKEEAFEFVAKNGGLASEISYPYKANNKTCMVKKETQGVAQIKGYENVPSNSEKA 245

Query: 251 LMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKN 310
           L+KAVANQPVSV IDA     QFYSSG+FTG+CGT  +H VT +GYG A  G KYWLVKN
Sbjct: 246 LLKAVANQPVSVYIDAGA--LQFYSSGIFTGKCGTAPNHAVTVIGYGKARGGAKYWLVKN 303

Query: 311 SWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           SWGT WGE GYI+M+RDI AKEGLCGIA  ASYPT 
Sbjct: 304 SWGTKWGEKGYIKMKRDIRAKEGLCGIATNASYPTV 339


>gi|147772785|emb|CAN62838.1| hypothetical protein VITISV_003391 [Vitis vinifera]
          Length = 298

 Score =  404 bits (1037), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 207/337 (61%), Positives = 237/337 (70%), Gaps = 52/337 (15%)

Query: 12  LAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIA 71
           +A + +L  WA Q+ SR+L++A+M ERHE WMA+YGR+Y+D  EKE RFKI         
Sbjct: 12  MALLFILAAWASQATSRSLHEASMYERHEDWMARYGRMYKDANEKEKRFKI--------- 62

Query: 72  SFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS-VPA 130
                            F D   +                        +F+YEN + VP+
Sbjct: 63  -----------------FKDNVAQA----------------------TTFKYENVTAVPS 83

Query: 131 SIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQ 190
           +IDWRKKGAVT +KDQ QCG CWAFSAVAA EGI  ITT KL SLSEQELVDCDT GE+Q
Sbjct: 84  TIDWRKKGAVTPIKDQQQCGSCWAFSAVAATEGITQITTGKLISLSEQELVDCDTGGENQ 143

Query: 191 GCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCN-KKEANPSAAKISGYEDVPSNNEA 249
           GC GGL DDAF FI  + GLA+EA YPY+  DG+CN KKEA+P AAKI GYEDVP+NNE 
Sbjct: 144 GCSGGLXDDAFRFIXIH-GLASEATYPYEGDDGTCNSKKEAHP-AAKIKGYEDVPANNEK 201

Query: 250 ALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVK 309
           AL KAVA+QPV+VAIDA G +FQFY+SGVFTGQCGTELDHGV AVGYG  DDG  YWLVK
Sbjct: 202 ALQKAVAHQPVAVAIDAGGFEFQFYTSGVFTGQCGTELDHGVAAVGYGIGDDGMXYWLVK 261

Query: 310 NSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           NSWGT WGE GYIRMQRD+ AKEGLCGIAMQASYPTA
Sbjct: 262 NSWGTGWGEEGYIRMQRDVTAKEGLCGIAMQASYPTA 298


>gi|5823018|gb|AAD53011.1|AF089848_1 senescence-specific cysteine protease [Brassica napus]
          Length = 346

 Score =  403 bits (1036), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 196/347 (56%), Positives = 241/347 (69%), Gaps = 6/347 (1%)

Query: 3   MILLENKLVLAAILVLGVWAPQSWSRTLND-ATMNERHEMWMAQYGRVYRDNAEKEMRFK 61
           M L   K+ L   LV       + SR L+D   M ++H+ WMA++GR Y D  EK  R+ 
Sbjct: 1   MALEHIKIFLIVSLVSSFCFSTTLSRLLDDELIMQKKHDEWMAEHGRTYADMNEKNNRYV 60

Query: 62  IFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSF 121
           +FK NVE I   NN    + +KL +N+FAD TN+EFR    GYK        S+T   SF
Sbjct: 61  VFKRNVERIERLNNVPAGRTFKLAVNQFADLTNDEFRFMYTGYKGDFVLFSQSQTKSTSF 120

Query: 122 RYENA---SVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQ 178
           RY+N    ++P ++DWRKKGAVT +K+QG CGCCWAFSAVAA+EG   I   KL SLSEQ
Sbjct: 121 RYQNVFFGALPIAVDWRKKGAVTPIKNQGSCGCCWAFSAVAAIEGATQIKKGKLISLSEQ 180

Query: 179 ELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKIS 238
           +LVDCDT+  D GC GGLMD AFE I++  GL TE+ YPYK  D +C  K   PSAA I+
Sbjct: 181 QLVDCDTN--DFGCSGGLMDTAFEHIMATGGLTTESNYPYKGEDANCKIKSTKPSAASIT 238

Query: 239 GYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGT 298
           GYEDVP N+E ALMKAVA+QPVSV I+  G DFQFYSSGVFTG+C T LDH VTAVGY  
Sbjct: 239 GYEDVPVNDENALMKAVAHQPVSVGIEGGGFDFQFYSSGVFTGECTTYLDHAVTAVGYSQ 298

Query: 299 ADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           +  G+KYW++KNSWGT WGE GY+R+++DI  KEGLCG+AM+ASYPT
Sbjct: 299 SSAGSKYWIIKNSWGTKWGEGGYMRIKKDIKDKEGLCGLAMKASYPT 345


>gi|356515038|ref|XP_003526208.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
           max]
          Length = 339

 Score =  403 bits (1036), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 197/336 (58%), Positives = 244/336 (72%), Gaps = 7/336 (2%)

Query: 12  LAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIA 71
           L   L+L VW     SR L++   +ERHE WMAQYG++Y D AEKE RF+IFK NV++I 
Sbjct: 10  LILFLILTVWTFHVMSRRLSEVCTSERHEKWMAQYGKLYTDAAEKEKRFQIFKNNVQFIE 69

Query: 72  SFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS-VPA 130
           SFN  A +KP+ L IN+FAD  NEEF+A     +++   V ++  T+ SFRYE+ + +P 
Sbjct: 70  SFN-AAGDKPFNLSINQFADLHNEEFKASLINVQKKESGVETA--TETSFRYESITKIPV 126

Query: 131 SIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQ 190
           ++DWRK+GAVT +KDQG CG CWAFS VAA+EGI+ ITT KL SLSEQELVDC   G+ +
Sbjct: 127 TMDWRKRGAVTPIKDQGNCGSCWAFSIVAAIEGIHQITTGKLVSLSEQELVDC-VKGKSE 185

Query: 191 GCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAA 250
           GC  G  ++AFEF+  N GLA+E  YPYKA++ +C  K+     A+I GYE+VPSN+E A
Sbjct: 186 GCNFGYKEEAFEFVAKNGGLASEISYPYKANNKTCMVKKETQGVAQIKGYENVPSNSEKA 245

Query: 251 LMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKN 310
           L+KAVANQPVSV IDA     QFYSSG+FTG+CGT  +H  T +GYG A  G KYWLVKN
Sbjct: 246 LLKAVANQPVSVYIDAGA--LQFYSSGIFTGKCGTAPNHAATVIGYGKARGGAKYWLVKN 303

Query: 311 SWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           SWGT WGE GYIRM+RDI AKEGLCGIA  ASYPT 
Sbjct: 304 SWGTKWGEKGYIRMKRDIRAKEGLCGIATNASYPTV 339


>gi|60100207|gb|AAX13273.1| putative cysteine protease [Oryza sativa Japonica Group]
          Length = 349

 Score =  403 bits (1035), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 201/325 (61%), Positives = 233/325 (71%), Gaps = 9/325 (2%)

Query: 28  RTLNDAT-MNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGI 86
           R L DA  M +RHE WMA++GR Y D+AEK  R ++F++NV +I S N  A    + L  
Sbjct: 28  RDLVDAAAMAQRHERWMAKHGRAYADDAEKARRLEVFRDNVAFIESVNAAASQHKFWLEE 87

Query: 87  NEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS---VPASIDWRKKGAVTGV 143
           N+FAD TN EFRA R G +   PS         SFRY N S   +PAS+DWR KGAV  V
Sbjct: 88  NQFADLTNAEFRATRTGLR---PSSSRGNRAPTSFRYANVSTGDLPASVDWRGKGAVNPV 144

Query: 144 KDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEF 203
           KDQG CGCCWAFSAVAAMEG   + T KL SLSEQ+LV CD  GEDQGCEGGLMDDAF+F
Sbjct: 145 KDQGDCGCCWAFSAVAAMEGAVKLATGKLVSLSEQQLVSCDVKGEDQGCEGGLMDDAFDF 204

Query: 204 IISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVA 263
           II N GLA E+ YPY ASD  C    A  +AA I GYEDVP+N+EAAL+KAVANQPVSVA
Sbjct: 205 IIKNGGLAAESDYPYTASDDKCATAGAGAAAATIKGYEDVPANDEAALLKAVANQPVSVA 264

Query: 264 IDASGSDFQFYSSGVFTGQ--CGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGY 321
           ID     FQFY  GV +G   C TELDH +TAVGYG A DGTKYWL+KNSWGT+WGE+GY
Sbjct: 265 IDGGDRHFQFYKGGVLSGAAGCATELDHAITAVGYGVASDGTKYWLMKNSWGTSWGEDGY 324

Query: 322 IRMQRDIDAKEGLCGIAMQASYPTA 346
           +RM+R +  KEG+CG+AM ASYPTA
Sbjct: 325 VRMERGVADKEGVCGLAMMASYPTA 349


>gi|38346007|emb|CAD40110.2| OSJNBa0035O13.9 [Oryza sativa Japonica Group]
 gi|125589429|gb|EAZ29779.1| hypothetical protein OsJ_13837 [Oryza sativa Japonica Group]
          Length = 314

 Score =  400 bits (1028), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 197/317 (62%), Positives = 229/317 (72%), Gaps = 8/317 (2%)

Query: 35  MNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTN 94
           M +RHE WMA++GR Y D+AEK  R ++F++NV +I S N  A    + L  N+FAD TN
Sbjct: 1   MAQRHERWMAKHGRAYADDAEKARRLEVFRDNVAFIESVNAAASQHKFWLEENQFADLTN 60

Query: 95  EEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS---VPASIDWRKKGAVTGVKDQGQCGC 151
            EFRA R G +   PS         SFRY N S   +PAS+DWR KGAV  VKDQG CGC
Sbjct: 61  AEFRATRTGLR---PSSSRGNRAPTSFRYANVSTGDLPASVDWRGKGAVNPVKDQGDCGC 117

Query: 152 CWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLA 211
           CWAFSAVAAMEG   + T KL SLSEQ+LV CD  GEDQGCEGGLMDDAF+FII N GLA
Sbjct: 118 CWAFSAVAAMEGAVKLATGKLVSLSEQQLVSCDVKGEDQGCEGGLMDDAFDFIIKNGGLA 177

Query: 212 TEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDF 271
            E+ YPY ASD  C    A  +AA I GYEDVP+N+EAAL+KAVANQPVSVAID     F
Sbjct: 178 AESDYPYTASDDKCATAGAGAAAATIKGYEDVPANDEAALLKAVANQPVSVAIDGGDRHF 237

Query: 272 QFYSSGVFTGQ--CGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDID 329
           QFY  GV +G   C TELDH +TAVGYG A DGTKYWL+KNSWGT+WGE+GY+RM+R + 
Sbjct: 238 QFYKGGVLSGAAGCATELDHAITAVGYGVASDGTKYWLMKNSWGTSWGEDGYVRMERGVA 297

Query: 330 AKEGLCGIAMQASYPTA 346
            KEG+CG+AM ASYPTA
Sbjct: 298 DKEGVCGLAMMASYPTA 314


>gi|18422605|ref|NP_568651.1| senescence-associated protein 12 [Arabidopsis thaliana]
 gi|13877737|gb|AAK43946.1|AF370131_1 putative senescence-specific cysteine protease SAG12 [Arabidopsis
           thaliana]
 gi|9758936|dbj|BAB09317.1| senescence-specific cysteine protease [Arabidopsis thaliana]
 gi|14532898|gb|AAK64131.1| putative senescence-specific cysteine protease SAG12 [Arabidopsis
           thaliana]
 gi|332007929|gb|AED95312.1| senescence-associated protein 12 [Arabidopsis thaliana]
          Length = 346

 Score =  400 bits (1028), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 190/342 (55%), Positives = 243/342 (71%), Gaps = 10/342 (2%)

Query: 12  LAAILVLGVWAPQSWSRTL-----NDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKEN 66
           +   L + +++   +S TL     N+  M +RH  WM ++GRVY D  E+  R+ +FK N
Sbjct: 6   MQIFLFVAIFSSFCFSITLSRPLDNELIMQKRHIEWMTKHGRVYADVKEENNRYVVFKNN 65

Query: 67  VEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENA 126
           VE I   N+    + +KL +N+FAD TN+EFR+   G+K        S+T    FRY+N 
Sbjct: 66  VERIEHLNSIPAGRTFKLAVNQFADLTNDEFRSMYTGFKGVSALSSQSQTKMSPFRYQNV 125

Query: 127 S---VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDC 183
           S   +P S+DWRKKGAVT +K+QG CGCCWAFSAVAA+EG   I   KL SLSEQ+LVDC
Sbjct: 126 SSGALPVSVDWRKKGAVTPIKNQGSCGCCWAFSAVAAIEGATQIKKGKLISLSEQQLVDC 185

Query: 184 DTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDV 243
           DT+  D GCEGGLMD AFE I +  GL TE+ YPYK  D +CN K+ NP A  I+GYEDV
Sbjct: 186 DTN--DFGCEGGLMDTAFEHIKATGGLTTESNYPYKGEDATCNSKKTNPKATSITGYEDV 243

Query: 244 PSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGT 303
           P N+E ALMKAVA+QPVSV I+  G DFQFYSSGVFTG+C T LDH VTA+GYG + +G+
Sbjct: 244 PVNDEQALMKAVAHQPVSVGIEGGGFDFQFYSSGVFTGECTTYLDHAVTAIGYGESTNGS 303

Query: 304 KYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           KYW++KNSWGT WGE+GY+R+Q+D+  K+GLCG+AM+ASYPT
Sbjct: 304 KYWIIKNSWGTKWGESGYMRIQKDVKDKQGLCGLAMKASYPT 345


>gi|125547258|gb|EAY93080.1| hypothetical protein OsI_14881 [Oryza sativa Indica Group]
          Length = 314

 Score =  400 bits (1027), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 197/317 (62%), Positives = 229/317 (72%), Gaps = 8/317 (2%)

Query: 35  MNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTN 94
           M +RHE WMA++GR Y D+AEK  R ++F++NV +I S N  A    + L  N+FAD TN
Sbjct: 1   MAQRHERWMAKHGRAYADDAEKVRRLEVFRDNVAFIESVNAAASQHKFWLEENQFADLTN 60

Query: 95  EEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS---VPASIDWRKKGAVTGVKDQGQCGC 151
            EFRA R G +   PS         SFRY N S   +PAS+DWR KGAV  VKDQG CGC
Sbjct: 61  AEFRATRTGLR---PSSSRGNRAPTSFRYANVSTGDLPASVDWRGKGAVNPVKDQGDCGC 117

Query: 152 CWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLA 211
           CWAFSAVAAMEG   + T KL SLSEQ+LV CD  GEDQGCEGGLMDDAF+FII N GLA
Sbjct: 118 CWAFSAVAAMEGAVKLATGKLVSLSEQQLVSCDVKGEDQGCEGGLMDDAFDFIIKNGGLA 177

Query: 212 TEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDF 271
            E+ YPY ASD  C    A  +AA I GYEDVP+N+EAAL+KAVANQPVSVAID     F
Sbjct: 178 AESDYPYTASDDKCATAGAGAAAATIKGYEDVPANDEAALLKAVANQPVSVAIDGGDRHF 237

Query: 272 QFYSSGVFTGQ--CGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDID 329
           QFY  GV +G   C TELDH +TAVGYG A DGTKYWL+KNSWGT+WGE+GY+RM+R + 
Sbjct: 238 QFYKGGVLSGAAGCATELDHAITAVGYGVASDGTKYWLMKNSWGTSWGEDGYVRMERGVA 297

Query: 330 AKEGLCGIAMQASYPTA 346
            KEG+CG+AM ASYPTA
Sbjct: 298 DKEGVCGLAMMASYPTA 314


>gi|312281697|dbj|BAJ33714.1| unnamed protein product [Thellungiella halophila]
          Length = 347

 Score =  399 bits (1026), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 191/342 (55%), Positives = 241/342 (70%), Gaps = 7/342 (2%)

Query: 9   KLVLAAILVLGVWAPQSWSRTLND--ATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKEN 66
           ++ L   L+       + SR L+D    M +RH+ WMA++GRVY D  EK  R+ +FK N
Sbjct: 7   QIFLIVSLISSFCLSITLSRPLDDNELIMQKRHDEWMAKHGRVYADMKEKNNRYVVFKRN 66

Query: 67  VEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENA 126
           VE I   NN    + +KL +N+FAD TN+EFR+   GYK        S T   SFRY+N 
Sbjct: 67  VERIERLNNVPAGRTFKLAVNQFADLTNDEFRSMYTGYKGGSVLSSQSGTKTSSFRYQNV 126

Query: 127 S---VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDC 183
           S   +P S+DWRKKGAVT +K+QG CGCCWAFSAVAA+EG   I   KL SLSEQ+LVDC
Sbjct: 127 SSGALPVSVDWRKKGAVTPIKNQGTCGCCWAFSAVAAIEGATKIKKGKLISLSEQQLVDC 186

Query: 184 DTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDV 243
           DT+  D GC GGLMD AFE I++  GL TE+ YPYK  D +C  K   P+A  I+GYEDV
Sbjct: 187 DTN--DFGCSGGLMDTAFEHIMATGGLTTESNYPYKGKDATCKIKNTKPTATSITGYEDV 244

Query: 244 PSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGT 303
           P N+E ALMKAVA+QPVS+ I+  G DFQFY SGVFTG+C T LDH VTAVGYG + +G+
Sbjct: 245 PVNDEKALMKAVAHQPVSIGIEGGGFDFQFYGSGVFTGECTTYLDHAVTAVGYGQSSNGS 304

Query: 304 KYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           KYW++KNSWGT WGE+GY+R+++D+  K+GLCG+AM+ASYPT
Sbjct: 305 KYWIIKNSWGTKWGESGYMRIKKDVKDKKGLCGLAMKASYPT 346


>gi|356545116|ref|XP_003540991.1| PREDICTED: vignain-like [Glycine max]
          Length = 342

 Score =  399 bits (1024), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 191/341 (56%), Positives = 247/341 (72%), Gaps = 5/341 (1%)

Query: 7   ENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKEN 66
           + K +L   LVL VW  Q  SR L++A  + +HE WMAQYG+VY+D AEKE RF+IFK N
Sbjct: 6   QKKNILVVFLVLTVWTSQVMSRRLSEAYSSVKHEKWMAQYGKVYKDAAEKEKRFQIFKNN 65

Query: 67  VEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENA 126
           V +I SF+  A +KP+ L IN+FAD    +F+A     +++  +VR++  T+ SF+Y++ 
Sbjct: 66  VHFIESFH-AAGDKPFNLSINQFADL--HKFKALLINGQKKEHNVRTATATEASFKYDSV 122

Query: 127 S-VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDT 185
           + +P+S+DWRK+GAVT +KDQG C  CWAFS VA +EG++ IT  +L SLSEQELVDC  
Sbjct: 123 TRIPSSLDWRKRGAVTPIKDQGTCRSCWAFSTVATIEGLHQITKGELVSLSEQELVDC-V 181

Query: 186 SGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPS 245
            G+ +GC GG ++DAFEFI    G+A+E  YPYK  + +C  K+      +I GYE VPS
Sbjct: 182 KGDSEGCYGGYVEDAFEFIAKKGGVASETHYPYKGVNKTCKVKKETHGVVQIKGYEQVPS 241

Query: 246 NNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKY 305
           N+E AL+KAVA+QPVS  ++A G  FQFYSSG+FTG+CGT++DH VT VGYG A  G KY
Sbjct: 242 NSEKALLKAVAHQPVSAYVEAGGYAFQFYSSGIFTGKCGTDIDHSVTVVGYGKARGGNKY 301

Query: 306 WLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           WLVKNSWGT WGE GYIRM+RDI AKEGLCGIA  A YPTA
Sbjct: 302 WLVKNSWGTEWGEKGYIRMKRDIRAKEGLCGIATGALYPTA 342


>gi|356517310|ref|XP_003527331.1| PREDICTED: vignain-like [Glycine max]
          Length = 342

 Score =  398 bits (1023), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 199/337 (59%), Positives = 242/337 (71%), Gaps = 6/337 (1%)

Query: 12  LAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIA 71
           L   LVL VW     SR L++A  +ERHE WMAQYGRVY+D AEKE RF++FK NV +I 
Sbjct: 10  LILFLVLAVWTSHVMSRRLSEACTSERHEKWMAQYGRVYKDAAEKEKRFQVFKNNVHFIE 69

Query: 72  SFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS-VPA 130
           SFN  A +KP+ L IN+FAD  +EEF+A     +++   V +S  T+ SFRYE+ + +PA
Sbjct: 70  SFN-AAGDKPFNLSINQFADLNDEEFKALLINVQKKASWVETS--TETSFRYESVTKIPA 126

Query: 131 SIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQ 190
           +ID RK+GAVT +KDQG+CG CWAFSAVAA EGI+ ITT KL  LSEQELVDC   GE +
Sbjct: 127 TIDRRKRGAVTPIKDQGRCGSCWAFSAVAATEGIHQITTGKLVPLSEQELVDC-VKGESE 185

Query: 191 GCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAA 250
           GC GG +DDAFEFI    G+A+E  YPYK  + +C  K+     A+I GYE VPSNNE A
Sbjct: 186 GCIGGYVDDAFEFIAKKGGIASETHYPYKGVNKTCKVKKETHGVAEIKGYEKVPSNNEKA 245

Query: 251 LMKAVANQPVSVAIDASGSDFQFYSSGVFTGQ-CGTELDHGVTAVGYGTADDGTKYWLVK 309
           L+KAVANQPVSV IDA    F++YSSG+F  + CGT+ +H V  VGYG A D +KYWLVK
Sbjct: 246 LLKAVANQPVSVYIDAGTHAFKYYSSGIFNARNCGTDPNHAVAVVGYGKALDDSKYWLVK 305

Query: 310 NSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           NSWGT WGE GYIR++RDI AKEGLCGIA    YP A
Sbjct: 306 NSWGTEWGERGYIRIKRDIRAKEGLCGIAKYPYYPIA 342


>gi|242092702|ref|XP_002436841.1| hypothetical protein SORBIDRAFT_10g009840 [Sorghum bicolor]
 gi|241915064|gb|EER88208.1| hypothetical protein SORBIDRAFT_10g009840 [Sorghum bicolor]
          Length = 328

 Score =  398 bits (1022), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 201/342 (58%), Positives = 241/342 (70%), Gaps = 24/342 (7%)

Query: 9   KLVLAAILVLGVWAPQSWS-RTLND-ATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKEN 66
           K  + AIL L  +   + + R LND + M  RHE WM QY RVY+D  EK  RF++FK N
Sbjct: 5   KASILAILGLAFFCGAALAARDLNDDSAMVARHEQWMVQYSRVYKDTTEKARRFEVFKAN 64

Query: 67  VEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENA 126
           V++I SFN    N+ + LG+N+FAD TN+EFRA +     +   V+ S      FRYEN 
Sbjct: 65  VKFIESFN-AGGNRKFWLGVNQFADLTNDEFRATKTNKGFKPSPVKVS----TGFRYENV 119

Query: 127 SV---PASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDC 183
           SV   PA+IDWR KGAVT +KDQGQC            EGI  I+T KL SLSEQELVDC
Sbjct: 120 SVDALPATIDWRTKGAVTPIKDQGQC------------EGIVKISTGKLISLSEQELVDC 167

Query: 184 DTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDV 243
           D  GEDQGCEGGLMDDAF+FII N GL TE+ YPY A+DG C  K  + SAA + G+EDV
Sbjct: 168 DVHGEDQGCEGGLMDDAFKFIIKNGGLTTESSYPYTAADGKC--KSGSNSAATVKGFEDV 225

Query: 244 PSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGT 303
           P+N+EAALMKAVANQPVSVA+D     FQFYS GV TG CGT+LDHG+ A+GYG   DGT
Sbjct: 226 PANDEAALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGQTSDGT 285

Query: 304 KYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           KYWL+KNSWGTTWGENGY+RM++DI  K G+CG+AM+ SYPT
Sbjct: 286 KYWLLKNSWGTTWGENGYLRMEKDISDKRGMCGLAMEPSYPT 327


>gi|356517368|ref|XP_003527359.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
          Length = 332

 Score =  397 bits (1021), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 211/341 (61%), Positives = 248/341 (72%), Gaps = 22/341 (6%)

Query: 10  LVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEY 69
           +  A +L +   A Q   RTL DA+M ERHE  M +YG+VY+D  ++      FKENV Y
Sbjct: 10  IAFAMLLCMAFLAFQVTCRTLQDASMXERHEQRMTRYGKVYKDPPKRX-----FKENVNY 64

Query: 70  IASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS-V 128
           I + NN A NKPYK GIN+FA         PRN +K  + S     TT   F++EN +  
Sbjct: 65  IEACNNAA-NKPYKRGINQFA---------PRNRFKGHMCSSIIRITT---FKFENVTAT 111

Query: 129 PASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGE 188
           P+++D R+KGAVT +KDQGQCGCCWAFSAVAA EGI+ ++  KL SLSEQELVDCDT G 
Sbjct: 112 PSTVDCRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALSAGKLISLSEQELVDCDTKGV 171

Query: 189 DQGCEGGLMDDAFEFIISNKGLATEAKYP-YKASDGSCNKKEANPSAAKI-SGYEDVPSN 246
           D GCEGGLMDDAF+FII N GL   ++ P Y   DG CN  EA  +AA I +GYEDVP+N
Sbjct: 172 DXGCEGGLMDDAFKFIIQNHGLKHXSQLPLYMGVDGKCNANEAAKNAATIITGYEDVPAN 231

Query: 247 NEAA-LMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKY 305
           NE A L KAVAN PVS AIDASGSDFQFY SGVFTG CGTELDHGVTAVGYG +DDGT+Y
Sbjct: 232 NEKAHLQKAVANNPVSEAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSDDGTEY 291

Query: 306 WLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           WLVKNSWGT WGE GYIRMQR +D++E LCGIA+QASYP+A
Sbjct: 292 WLVKNSWGTEWGEEGYIRMQRGVDSEEALCGIAVQASYPSA 332


>gi|357167196|ref|XP_003581047.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like
           [Brachypodium distachyon]
          Length = 338

 Score =  397 bits (1021), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 191/319 (59%), Positives = 235/319 (73%), Gaps = 8/319 (2%)

Query: 31  NDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFA 90
           +D  +  RHE WMA+YGRVY D AEK  R ++FK NV +I S N  A N  + L  N+FA
Sbjct: 25  DDWLIAARHEQWMARYGRVYSDVAEKARRLEVFKANVGFIESVN--AGNHKFWLEANQFA 82

Query: 91  DQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASV---PASIDWRKKGAVTGVKDQG 147
           D T +EFRA   GYK ++   ++  T    FRY N S+   PAS+DWR  GAVT VKDQG
Sbjct: 83  DITKDEFRAMHKGYKMQVIGSKARAT---GFRYANVSIDDLPASVDWRANGAVTPVKDQG 139

Query: 148 QCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISN 207
           QCGCCWAFS VA+MEGI  ++T KL SLSEQELVDCD   +++GC GGLMD+AFEFI++N
Sbjct: 140 QCGCCWAFSTVASMEGIVKVSTGKLISLSEQELVDCDVGMQNKGCGGGLMDNAFEFIVNN 199

Query: 208 KGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDAS 267
            GL TEA YPY  +DG+CN  + +  AA I GYEDVP+N+EA+L KAVA QPVS+A+D  
Sbjct: 200 GGLDTEADYPYTGADGTCNSNKESNIAASIKGYEDVPANDEASLQKAVAAQPVSIAVDGG 259

Query: 268 GSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRD 327
              F+FY  GV TG CGTELDHGV AVGYG A DGTKYWLVKNSWGT+WGE+G+IR++RD
Sbjct: 260 DDLFRFYKGGVLTGACGTELDHGVAAVGYGVAGDGTKYWLVKNSWGTSWGEDGFIRLERD 319

Query: 328 IDAKEGLCGIAMQASYPTA 346
           +  + G+CG+AM+ SYPTA
Sbjct: 320 VADEAGMCGLAMKPSYPTA 338


>gi|1046373|gb|AAC49135.1| SAG12 protein [Arabidopsis thaliana]
          Length = 346

 Score =  396 bits (1018), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 189/342 (55%), Positives = 242/342 (70%), Gaps = 10/342 (2%)

Query: 12  LAAILVLGVWAPQSWSRTL-----NDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKEN 66
           +   L + +++   +S TL     N+  M +RH  WM ++GRVY D  E+  R+ +FK N
Sbjct: 6   MQIFLFVAIFSSFCFSITLSRPLDNELIMQKRHIEWMTKHGRVYADVKEENNRYVVFKNN 65

Query: 67  VEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENA 126
           VE I   N+    + +KL +N+FAD TN+EF +   G+K        S+T    FRY+N 
Sbjct: 66  VERIEHLNSIPAGRTFKLAVNQFADLTNDEFCSMYTGFKGVSALSSQSQTKMSPFRYQNV 125

Query: 127 S---VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDC 183
           S   +P S+DWRKKGAVT +K+QG CGCCWAFSAVAA+EG   I   KL SLSEQ+LVDC
Sbjct: 126 SSGALPVSVDWRKKGAVTPIKNQGSCGCCWAFSAVAAIEGATQIKKGKLISLSEQQLVDC 185

Query: 184 DTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDV 243
           DT+  D GCEGGLMD AFE I +  GL TE+ YPYK  D +CN K+ NP A  I+GYEDV
Sbjct: 186 DTN--DFGCEGGLMDTAFEHIKATGGLTTESDYPYKGEDATCNSKKTNPKATSITGYEDV 243

Query: 244 PSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGT 303
           P N+E ALMKAVA+QPVSV I+  G DFQFYSSGVFTG+C T LDH VTA+GYG + +G+
Sbjct: 244 PVNDEQALMKAVAHQPVSVGIEGGGFDFQFYSSGVFTGECTTYLDHAVTAIGYGESTNGS 303

Query: 304 KYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           KYW++KNSWGT WGE+GY+R+Q+D+  K+GLCG+AM+ASYPT
Sbjct: 304 KYWIIKNSWGTKWGESGYMRIQKDVKDKQGLCGLAMKASYPT 345


>gi|414588007|tpg|DAA38578.1| TPA: hypothetical protein ZEAMMB73_159244 [Zea mays]
          Length = 307

 Score =  396 bits (1017), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 189/314 (60%), Positives = 233/314 (74%), Gaps = 11/314 (3%)

Query: 35  MNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTN 94
           M ERHE WMA+Y RVY+D AEK  RF++FK+N  ++ SFN   +NK + LG+N+FAD T 
Sbjct: 1   MAERHERWMAEYDRVYKDAAEKARRFEVFKDNFAFVESFNADKKNK-FWLGVNQFADLTT 59

Query: 95  EEFRAPRNGYKRRLPSVRSSETTDVSFRYENASV---PASIDWRKKGAVTGVKDQGQCGC 151
           EEF+A + G+K     + + E     F+YEN SV   P ++DWR KGAVT +K+QGQCGC
Sbjct: 60  EEFKANK-GFK----PISAEEVPTTGFKYENLSVSALPTAVDWRTKGAVTPIKNQGQCGC 114

Query: 152 CWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLA 211
           CWAFSA+AAMEGI  ++T  L SLSEQE VDCDT   D+GCEGG MD+AFEF+I N GLA
Sbjct: 115 CWAFSAIAAMEGIVKLSTGNLVSLSEQEPVDCDTHNMDEGCEGGWMDNAFEFVIKNGGLA 174

Query: 212 TEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDF 271
           TE+ YPYK  DG C  K  + SAA I G+EDVP NNEAALMK VA+QPVSVA+DAS   F
Sbjct: 175 TESSYPYKVVDGKC--KGGSKSAATIKGHEDVPPNNEAALMKVVASQPVSVAVDASDRTF 232

Query: 272 QFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAK 331
             YS GV TG CGT+LDHG+ A+GYG   D TKYW++KNSWGTTWGE G++RM++DI  K
Sbjct: 233 MLYSGGVMTGSCGTQLDHGIAAIGYGVESDDTKYWILKNSWGTTWGEKGFLRMEKDISDK 292

Query: 332 EGLCGIAMQASYPT 345
            G+C +AM+ SYPT
Sbjct: 293 RGMCDLAMKPSYPT 306


>gi|357113934|ref|XP_003558756.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like
           [Brachypodium distachyon]
          Length = 346

 Score =  395 bits (1015), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 194/350 (55%), Positives = 238/350 (68%), Gaps = 17/350 (4%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSRTLNDA--TMNERHEMWMAQYGRVYRDNAEKEM 58
           + ++ +   L L +  VL        +R L DA   M  RHE WMAQ+GRVY+D AEK  
Sbjct: 8   LLLVAIVGCLCLCSTAVLA-------ARELGDADNAMAARHEQWMAQFGRVYKDPAEKAH 60

Query: 59  RFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTD 118
           R ++FK NV +I SFN  A N  + LG N+FAD TN+EFRA +     +   VR + T  
Sbjct: 61  RLEVFKANVAFIESFN--AENHEFWLGANQFADLTNDEFRASKTNKGIKQGGVRDAPT-- 116

Query: 119 VSFRYENASV---PASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSL 175
             F+Y + S+   PAS+DWR KGAVT +K+QGQCG CWAFSAVAA EG+  ++T KL SL
Sbjct: 117 -GFKYSDVSIDALPASVDWRTKGAVTPIKNQGQCGSCWAFSAVAATEGVVKLSTGKLVSL 175

Query: 176 SEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAA 235
           SEQELVDCD  G DQGC GG MDDAF+FII N GL TEA YPY   D  C   E    AA
Sbjct: 176 SEQELVDCDVHGVDQGCMGGWMDDAFKFIIKNGGLTTEANYPYTGEDDKCKSNETVNVAA 235

Query: 236 KISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVG 295
            I GYEDVP+N+E+ALMKAVA+QPVSV +D     FQ Y+ GV TG CG E+DHG+ A+G
Sbjct: 236 TIKGYEDVPANDESALMKAVAHQPVSVVVDGGDMTFQLYAGGVMTGSCGVEMDHGIAAIG 295

Query: 296 YGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           YG   +GTKYWL+KNSWGTTWGE G++RM +DI  K G+CG+AM+ SYPT
Sbjct: 296 YGATSNGTKYWLMKNSWGTTWGEKGFLRMAKDIPDKRGMCGLAMKPSYPT 345


>gi|242092700|ref|XP_002436840.1| hypothetical protein SORBIDRAFT_10g009830 [Sorghum bicolor]
 gi|241915063|gb|EER88207.1| hypothetical protein SORBIDRAFT_10g009830 [Sorghum bicolor]
          Length = 328

 Score =  395 bits (1015), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 203/343 (59%), Positives = 242/343 (70%), Gaps = 28/343 (8%)

Query: 9   KLVLAAILVLGVWAPQSWS-RTLND-ATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKEN 66
           K  + AIL L  +   + + R LND + M  RHE WM QY RVY+D  EK  RF++FK N
Sbjct: 5   KASILAILGLAFFCGAALAARDLNDDSAMVARHEQWMVQYSRVYKDTTEKARRFEVFKAN 64

Query: 67  VEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRN--GYKRRLPSVRSSETTDVSFRYE 124
           V++I SFN    N+ + LG+N+FAD TN+EFRA +   G+K   PS     T    FRYE
Sbjct: 65  VKFIESFN-AGGNRKFWLGVNQFADLTNDEFRATKTNKGFK---PSPVKVPT---GFRYE 117

Query: 125 NASV---PASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELV 181
           N SV   PA+IDWR KGAVT +KDQGQC            EGI  I+T KL SLSEQELV
Sbjct: 118 NVSVDALPATIDWRTKGAVTPIKDQGQC------------EGIVKISTGKLISLSEQELV 165

Query: 182 DCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYE 241
           DCD  GEDQGCEGGLMDDAF+FII N GL TE+ YPY A+DG C  K  + SAA + G+E
Sbjct: 166 DCDVHGEDQGCEGGLMDDAFQFIIKNGGLTTESSYPYTAADGKC--KSGSNSAATVKGFE 223

Query: 242 DVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADD 301
           DVP+N+EAALMKAVANQPVSVA+D     FQFYS GV TG CGT+LDHG+ A+GYG   D
Sbjct: 224 DVPANDEAALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGQTSD 283

Query: 302 GTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           GTKYWL+KNSWGTTWGENGY+RM++DI  K G+CG+AM+ SYP
Sbjct: 284 GTKYWLLKNSWGTTWGENGYLRMEKDISDKRGMCGLAMEPSYP 326


>gi|157093728|gb|ABV22590.1| KDEL-tailed cysteine endopeptidase [Solanum lycopersicum]
          Length = 360

 Score =  394 bits (1012), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 199/337 (59%), Positives = 235/337 (69%), Gaps = 6/337 (1%)

Query: 10  LVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEY 69
             LA +L LG            +    E +E W + +  V R   EK  RF +FK NV Y
Sbjct: 9   FTLALVLRLGESFDFHEKELETEEKFWELYERWRSHH-TVSRSLDEKHKRFNVFKANVHY 67

Query: 70  IASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYK-RRLPSVRSSETTDVSFRYENA-S 127
           + +FN K  +KPYKL +N+FAD TN EFR    G K +   ++  +   + +F Y N  +
Sbjct: 68  VHNFNKK--DKPYKLKLNKFADMTNHEFRQHYAGSKIKHHRTLLGASRANGTFMYANEDN 125

Query: 128 VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSG 187
           VP SIDWRKKGAVT VKDQGQCG CWAFS V A+EGIN I T+KL SLSEQELVDCDT+ 
Sbjct: 126 VPPSIDWRKKGAVTPVKDQGQCGSCWAFSTVVAVEGINQIKTKKLVSLSEQELVDCDTT- 184

Query: 188 EDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNN 247
           E+QGC GGLMD AF+FI    G+ TE +YPYKA D  C+ ++ N     I G+EDVP N+
Sbjct: 185 ENQGCNGGLMDPAFDFIKKRGGITTEERYPYKAEDDKCDIQKRNTPVVSIDGHEDVPPND 244

Query: 248 EAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWL 307
           E AL+KAVANQP+SVAIDASGS FQFYS GVFTG+CGTELDHGV  VGYGT  DGTKYW+
Sbjct: 245 EDALLKAVANQPISVAIDASGSQFQFYSEGVFTGECGTELDHGVAIVGYGTTVDGTKYWI 304

Query: 308 VKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           VKNSWG  WGE GYIRMQR +DA+EGLCGIAMQ SYP
Sbjct: 305 VKNSWGAGWGEKGYIRMQRKVDAEEGLCGIAMQPSYP 341


>gi|242072390|ref|XP_002446131.1| hypothetical protein SORBIDRAFT_06g002140 [Sorghum bicolor]
 gi|241937314|gb|EES10459.1| hypothetical protein SORBIDRAFT_06g002140 [Sorghum bicolor]
          Length = 328

 Score =  392 bits (1008), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 195/322 (60%), Positives = 235/322 (72%), Gaps = 21/322 (6%)

Query: 27  SRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGI 86
           +R L+DA M ERHE WM +YGRVY+D AEK  RF++FK+NV ++ SFN    NK + LG+
Sbjct: 24  ARELSDAAMVERHENWMVEYGRVYKDAAEKARRFQVFKDNVAFVESFNTNKNNK-FWLGV 82

Query: 87  NEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASV---PASIDWRKKGAVTGV 143
           N+FAD T EEF+A + G+K   P+     TT   F+YEN SV   P ++DWR KGAVT +
Sbjct: 83  NQFADLTTEEFKANK-GFK---PTAEKVPTT--GFKYENLSVSALPTAVDWRTKGAVTPI 136

Query: 144 KDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEF 203
           K+QGQC         AAMEGI  ++T  L SLSEQELVDCDT   D+GCEGG MD AFEF
Sbjct: 137 KNQGQC---------AAMEGIVKLSTGNLISLSEQELVDCDTHSMDEGCEGGWMDSAFEF 187

Query: 204 IISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVA 263
           +I N GLATE+ YPYKA DG C  K  + SAA I G+EDVP NNEAALMKAVANQPVSVA
Sbjct: 188 VIKNGGLATESNYPYKAVDGKC--KGGSKSAATIKGHEDVPVNNEAALMKAVANQPVSVA 245

Query: 264 IDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIR 323
           +DAS   F  YS GV TG CGTELDHG+ A+GYG   DGTKYW++KNSWGTTWGE G++R
Sbjct: 246 VDASDRTFMLYSGGVMTGSCGTELDHGIAAIGYGMESDGTKYWILKNSWGTTWGEKGFLR 305

Query: 324 MQRDIDAKEGLCGIAMQASYPT 345
           M++DI  K G+CG+AM+ SYPT
Sbjct: 306 MEKDITDKRGMCGLAMKPSYPT 327


>gi|413917937|gb|AFW57869.1| hypothetical protein ZEAMMB73_830006 [Zea mays]
          Length = 443

 Score =  391 bits (1005), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 190/309 (61%), Positives = 228/309 (73%), Gaps = 8/309 (2%)

Query: 32  DATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFAD 91
           D  M  RHE WMA+Y RVY D AEK  RF++FK N+  I S N  A N  + L  N FAD
Sbjct: 34  DQAMVARHEEWMAKYDRVYSDAAEKARRFEVFKANMALIESVN--AGNHKFWLEANRFAD 91

Query: 92  QTNEEFRAPRNGYKRRLPSVRS---SETTDVSFRYENAS---VPASIDWRKKGAVTGVKD 145
            T++EFRA   GY+ +  +  S   S T    F+Y N S   VPAS+DWR KGAVT +K+
Sbjct: 92  LTDDEFRATWTGYRPKTAAASSKGRSRTATTGFKYANVSLDDVPASVDWRTKGAVTPIKN 151

Query: 146 QGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFII 205
           QG+CGCCWAFSAVA+MEG+  ++T KL SLSEQELVDCD +G DQGCEGG MDDAF+FI+
Sbjct: 152 QGECGCCWAFSAVASMEGVVKLSTGKLVSLSEQELVDCDVNGMDQGCEGGEMDDAFDFIV 211

Query: 206 SNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAID 265
            N GL TE++YPY ASDG+CN  EA+  AA I GYEDVP+N+EA+L KAVANQPVSVA+D
Sbjct: 212 GNGGLTTESRYPYTASDGTCNSNEASGDAASIKGYEDVPANDEASLRKAVANQPVSVAVD 271

Query: 266 ASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQ 325
              S F+FY  GV +G CGTELDHG+ AVGYG A DGTKYW++KNSWGT+WGE GYIRM+
Sbjct: 272 GGDSHFRFYKGGVLSGACGTELDHGIAAVGYGVASDGTKYWVMKNSWGTSWGEAGYIRME 331

Query: 326 RDIDAKEGL 334
           RDI  +E L
Sbjct: 332 RDIADEEVL 340


>gi|38345188|emb|CAE03344.2| OSJNBb0005B05.11 [Oryza sativa Japonica Group]
 gi|125589403|gb|EAZ29753.1| hypothetical protein OsJ_13812 [Oryza sativa Japonica Group]
          Length = 323

 Score =  391 bits (1005), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 198/347 (57%), Positives = 240/347 (69%), Gaps = 28/347 (8%)

Query: 4   ILLENKLVLAAILVLGVWAPQSWSRTL-NDATMNERHEMWMAQYGRVYRDNAEKEMRFKI 62
           + +   L+ A +  L + +    +R L +DA M  RHE WMAQYGR+Y+D+AEK  RF++
Sbjct: 1   MAMAKALLFAILGCLCLCSAVLAARELSDDAAMAARHERWMAQYGRMYKDDAEKARRFEV 60

Query: 63  FKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFR 122
           FK NV +I SFN  A N  + LG+N+FAD TN+EFR+ +   K  +PS     T    FR
Sbjct: 61  FKANVAFIESFN--AGNHKFWLGVNQFADLTNDEFRSTKTN-KGFIPSTTRVPT---GFR 114

Query: 123 YENASV---PASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQE 179
            EN ++   PA++DWR KG VT +KDQGQCGCCWAFSAVAAME                E
Sbjct: 115 NENVNIDALPATMDWRTKGVVTPIKDQGQCGCCWAFSAVAAME----------------E 158

Query: 180 LVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISG 239
           LVDCD  GEDQGCEGGLMDDAF+FII N GL TE+ YPY A D     K  + S A I G
Sbjct: 159 LVDCDVHGEDQGCEGGLMDDAFKFIIKNGGLTTESNYPYAAVDDKF--KSVSNSVASIKG 216

Query: 240 YEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTA 299
           YEDVP+NNEAALMKAVANQPVSVA+D     FQFY  GV TG CGT+LDHG+ A+GYG A
Sbjct: 217 YEDVPANNEAALMKAVANQPVSVAVDGGDMTFQFYKGGVMTGSCGTDLDHGIVAIGYGKA 276

Query: 300 DDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
            DGTKYWL+KNSWG TWGENG++RM++DI  K G+CG+AM+ SYPTA
Sbjct: 277 SDGTKYWLLKNSWGMTWGENGFLRMEKDISDKRGMCGLAMEPSYPTA 323


>gi|359483514|ref|XP_003632971.1| PREDICTED: LOW QUALITY PROTEIN: oryzain beta chain-like [Vitis
           vinifera]
          Length = 340

 Score =  389 bits (998), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 189/333 (56%), Positives = 243/333 (72%), Gaps = 4/333 (1%)

Query: 15  ILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFN 74
           I  L   A ++ SR L++A+M ERHE WMA+Y R Y+D+AE+E RF +FK+NV++I +F+
Sbjct: 11  IYYLEHRASEATSRPLHEASMYERHEQWMARYSRNYKDDAEEERRFXMFKDNVDFIQTFD 70

Query: 75  NKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS-VPASID 133
             A N P KLG+N  AD T+EEFRA  N +K  +P      +   SFR++N + +P+++D
Sbjct: 71  T-AGNMPNKLGVNALADMTHEEFRASGNTFK--IPPNLGLRSETTSFRHQNVTRIPSTMD 127

Query: 134 WRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCE 193
           WRKK  VT +K+Q QCG CWAFSAVAAMEGI  + T K  SLSEQELVDCD  G + GCE
Sbjct: 128 WRKKRTVTHIKNQLQCGGCWAFSAVAAMEGIAKLQTSKSISLSEQELVDCDIFGSNIGCE 187

Query: 194 GGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMK 253
           GG MDDAF+FII N+GL +EA+Y YK  +G CNKK+ +  AA+I+ YE++P  +E AL+K
Sbjct: 188 GGCMDDAFKFIIQNRGLNSEARYLYKGVEGHCNKKKESSRAARINDYENMPEFSEKALLK 247

Query: 254 AVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWG 313
            VA+QP+SVAIDA GS FQFY  G+ T + G +LD+GVT  GYG + DG K+WLVKNSWG
Sbjct: 248 VVAHQPISVAIDAGGSAFQFYEIGIITXESGNDLDYGVTTDGYGRSADGKKHWLVKNSWG 307

Query: 314 TTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           T WGENGY RM+R + A  GLCG  MQASYPTA
Sbjct: 308 TDWGENGYTRMERGVKATTGLCGFTMQASYPTA 340


>gi|225456820|ref|XP_002278323.1| PREDICTED: vignain [Vitis vinifera]
          Length = 360

 Score =  388 bits (996), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 193/308 (62%), Positives = 222/308 (72%), Gaps = 6/308 (1%)

Query: 39  HEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFR 98
           +E W + +  V R   EK  RF +FKENV ++  FN K  ++PYKL +N+FAD TN EFR
Sbjct: 38  YERWRSHH-TVSRSLDEKHKRFNVFKENVNFVHEFNKK--DEPYKLKLNKFADMTNHEFR 94

Query: 99  APRNGYK-RRLPSVRSSETTDVSFRYENA-SVPASIDWRKKGAVTGVKDQGQCGCCWAFS 156
           +   G K       R S+    SF YE   SVP S+DWRKKGAVT +KDQGQCG CWAFS
Sbjct: 95  STYAGSKVNHHRMFRGSQHAAGSFMYEKVKSVPPSVDWRKKGAVTPIKDQGQCGSCWAFS 154

Query: 157 AVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKY 216
            V A+EGINHI T KL SLSEQELVDCDTS E+QGC GGLM  AFEFI    G+ TE  Y
Sbjct: 155 TVVAVEGINHIKTNKLVSLSEQELVDCDTS-ENQGCNGGLMGYAFEFIKEKGGITTEQSY 213

Query: 217 PYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSS 276
           PY A DG+C+  + N     I G+E VP NNE AL+KA ANQP+SVAIDA GS FQFYS 
Sbjct: 214 PYTAEDGTCDVSKVNSPVVSIDGHETVPPNNEDALLKAAANQPISVAIDAGGSAFQFYSE 273

Query: 277 GVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCG 336
           GVF G+CGT+LDHGV  VGYGT  DGTKYW+VKNSWGT WGENGYIRM+R I AKEGLCG
Sbjct: 274 GVFAGRCGTDLDHGVAIVGYGTTLDGTKYWIVKNSWGTDWGENGYIRMKRGISAKEGLCG 333

Query: 337 IAMQASYP 344
           IA++ASYP
Sbjct: 334 IAVEASYP 341


>gi|310656790|gb|ADP02219.1| Peptidase_C1 domain-containing protein [Triticum aestivum]
          Length = 419

 Score =  387 bits (995), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 188/315 (59%), Positives = 225/315 (71%), Gaps = 19/315 (6%)

Query: 27  SRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGI 86
           +R L DA M E+HE WMA++ RVY+D+ EK  RFK FK NV +I SFN    N  + LG+
Sbjct: 25  ARELGDAAMVEKHEQWMAKFNRVYKDSTEKAQRFKAFKANVAFIESFNTG--NHKFWLGV 82

Query: 87  NEFADQTNEEFRAPRN--GYKR---RLPSVRSSETTDVSFRYENAS---VPASIDWRKKG 138
           N+F D TN+EFRA +   G KR   R P+          F+Y N S   +PA++DWR KG
Sbjct: 83  NQFTDLTNDEFRATKTNKGLKRNGARAPT---------RFKYNNVSTDALPAAVDWRTKG 133

Query: 139 AVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMD 198
            VT +KDQGQCGCCWAFSAVAA EGI  ++T KL SLSEQELVDCD  G DQGCEGG MD
Sbjct: 134 VVTPIKDQGQCGCCWAFSAVAATEGIVKLSTGKLVSLSEQELVDCDVHGVDQGCEGGEMD 193

Query: 199 DAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQ 258
           +AF+FII N GL TEA YPY A DG C     + S A I GYEDVP+N+E++LMKAVANQ
Sbjct: 194 NAFKFIIKNGGLTTEANYPYTAQDGQCKTSTTSNSVATIKGYEDVPANDESSLMKAVANQ 253

Query: 259 PVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGE 318
           PVSVA+D     FQ YS GV TG CGT+LDHG+ A+GYG   DGTK+WL+KNSWGTTWGE
Sbjct: 254 PVSVAVDGGDVIFQHYSGGVMTGSCGTDLDHGIVAIGYGMTSDGTKFWLLKNSWGTTWGE 313

Query: 319 NGYIRMQRDIDAKEG 333
           +GY+RM++DI  K G
Sbjct: 314 SGYLRMEKDISDKSG 328


>gi|58531896|gb|AAW78660.1| cysteine protease [Nicotiana tabacum]
          Length = 361

 Score =  386 bits (991), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 193/310 (62%), Positives = 224/310 (72%), Gaps = 6/310 (1%)

Query: 37  ERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEE 96
           E +E W + +  V R   EK+ RF +FK NV Y+ +FN K  +KPYKL +N+FAD TN E
Sbjct: 36  ELYERWRSHH-TVSRSLDEKDKRFNVFKANVHYVHNFNKK--DKPYKLKLNKFADMTNHE 92

Query: 97  FRAPRNGYK-RRLPSVRSSETTDVSFRYENAS-VPASIDWRKKGAVTGVKDQGQCGCCWA 154
           FR    G K +   S   +   + +F Y N   VP S+DWRKKGAVT VKDQG+CG CWA
Sbjct: 93  FRHHYAGSKIKHHRSFLGASRANGTFMYANVEDVPPSVDWRKKGAVTPVKDQGKCGSCWA 152

Query: 155 FSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEA 214
           FS V A+EGIN I T +L SLSEQELVDCDTS ++QGC GGLMD AFEFI    G+ TE 
Sbjct: 153 FSTVVAVEGINQIKTNELVSLSEQELVDCDTS-QNQGCNGGLMDMAFEFIKKKGGINTEE 211

Query: 215 KYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFY 274
            YPY A  G C+ ++ N     I GYEDVP N+E +L+KAVANQPVSVAI ASGSDFQFY
Sbjct: 212 NYPYMAEGGECDIQKRNSPVVSIDGYEDVPPNDEDSLLKAVANQPVSVAIQASGSDFQFY 271

Query: 275 SSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGL 334
           S GVFTG CGTELDHGV  VGYGT  DGTKYW+V+NSWG  WGE GYIRMQR+IDA+EGL
Sbjct: 272 SEGVFTGDCGTELDHGVAIVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQREIDAEEGL 331

Query: 335 CGIAMQASYP 344
           CGIAMQ SYP
Sbjct: 332 CGIAMQPSYP 341


>gi|356517398|ref|XP_003527374.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
          Length = 333

 Score =  385 bits (990), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 207/343 (60%), Positives = 241/343 (70%), Gaps = 25/343 (7%)

Query: 10  LVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEY 69
           +  A +L +   A Q   RTL DA+M ERHE  M +Y +VY+D  E       F  NV Y
Sbjct: 10  IAFAMLLCMAFLAFQVTCRTLQDASMYERHEQRMTRYSKVYKDPPES------FXGNVNY 63

Query: 70  IASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS-V 128
           I + NN A +KPYK GIN+F          PRN +K  + S     TT   F++EN +  
Sbjct: 64  IEACNNAA-DKPYKXGINQFP---------PRNRFKGHMCSSIIRITT---FKFENVTAT 110

Query: 129 PASIDWRKKGAVT--GVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLS-EQELVDCDT 185
           P+++D R+KGAVT   VKDQGQCGC WA SAVAA EGI+ +   KL  LS E ELVDCDT
Sbjct: 111 PSTVDCRQKGAVTPYTVKDQGQCGCFWALSAVAATEGIHALXAGKLILLSXEPELVDCDT 170

Query: 186 SGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKI-SGYEDVP 244
            G DQGCEGGL DDAF+FII N GL TEA YPYK  DG CN  EA+ +AA I +GY+DVP
Sbjct: 171 KGVDQGCEGGLTDDAFKFIIQNHGLNTEANYPYKGVDGKCNANEADKNAATIITGYDDVP 230

Query: 245 SNNEAA-LMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGT 303
           +NNE A L KAVAN PVSVAIDASGSDFQFY SGVFTG CGTELDHGVTAVGYG +DDGT
Sbjct: 231 ANNEKAHLQKAVANNPVSVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSDDGT 290

Query: 304 KYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           +YWLVKNS G  WGE GYIRMQR +D++E LCGIA+QASYP+A
Sbjct: 291 EYWLVKNSRGPEWGEEGYIRMQRGVDSEEALCGIAVQASYPSA 333


>gi|242092704|ref|XP_002436842.1| hypothetical protein SORBIDRAFT_10g009850 [Sorghum bicolor]
 gi|241915065|gb|EER88209.1| hypothetical protein SORBIDRAFT_10g009850 [Sorghum bicolor]
          Length = 296

 Score =  385 bits (988), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 192/316 (60%), Positives = 227/316 (71%), Gaps = 26/316 (8%)

Query: 35  MNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTN 94
           M  RHE WM QY RVY+D  EK  RF++FK NV++I SFN    N+ + LG+N+FAD TN
Sbjct: 1   MVARHEQWMVQYSRVYKDATEKAQRFEVFKSNVKFIESFN-AGGNRKFWLGVNQFADLTN 59

Query: 95  EEFRAPRN--GYKRRLPSVRSSETTDVSFRYENASV---PASIDWRKKGAVTGVKDQGQC 149
           +EFRA +   G+K   PS     T    FRYEN SV   PA+IDWR KGAVT +KDQGQC
Sbjct: 60  DEFRATKTNKGFK---PSPVKVPT---GFRYENISVDALPATIDWRTKGAVTPIKDQGQC 113

Query: 150 GCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKG 209
                       EGI  I+T KL SLSEQELVDCD  GEDQGCEGGLMDDAF+FII   G
Sbjct: 114 ------------EGIVKISTGKLISLSEQELVDCDVHGEDQGCEGGLMDDAFKFIIKKGG 161

Query: 210 LATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGS 269
           L TE+ YPY A+DG C  K  + S A + G+EDVP+N+EA+LMKAVANQPVSVA+D    
Sbjct: 162 LTTESSYPYTAADGKC--KSGSNSVATVKGFEDVPANDEASLMKAVANQPVSVAVDGGDM 219

Query: 270 DFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDID 329
            FQFYS GV TG CGT+LDHG+ A+GYG   DGTKYWL+KNSWGTTWGENGY+RM++DI 
Sbjct: 220 TFQFYSGGVMTGSCGTDLDHGIAAIGYGQTSDGTKYWLLKNSWGTTWGENGYLRMEKDIS 279

Query: 330 AKEGLCGIAMQASYPT 345
            K G+CG+AM+ SYPT
Sbjct: 280 DKRGMCGLAMEPSYPT 295


>gi|297602242|ref|NP_001052232.2| Os04g0203500 [Oryza sativa Japonica Group]
 gi|255675217|dbj|BAF14146.2| Os04g0203500 [Oryza sativa Japonica Group]
          Length = 336

 Score =  384 bits (987), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 196/345 (56%), Positives = 241/345 (69%), Gaps = 15/345 (4%)

Query: 6   LENKLVLAAILVLGVWAPQSWSRTL-NDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFK 64
           +   L+ A +  L + +    +R L +DA M  RHE WMAQYGR+Y+D+AEK  RF++FK
Sbjct: 3   MAKALLFAILGCLCLCSAVLAARELSDDAAMAARHERWMAQYGRMYKDDAEKARRFEVFK 62

Query: 65  ENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYE 124
            NV +I SFN  A N  + LG+N+FAD TN+EFR+ +   K  +PS     T    FR E
Sbjct: 63  ANVAFIESFN--AGNHKFWLGVNQFADLTNDEFRSTKTN-KGFIPSTTRVPT---GFRNE 116

Query: 125 NASV---PASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELV 181
           N ++   PA++DWR KG VT +KDQGQCGCCWAFSAVAAMEGI  ++T KL S S  + +
Sbjct: 117 NVNIDALPATMDWRTKGVVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISHSLNKSL 176

Query: 182 DCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYE 241
               S    GCEGGLMDDAF+FII N GL TE+ YPY A D     K  + S A I GYE
Sbjct: 177 LTVMS---MGCEGGLMDDAFKFIIKNGGLTTESNYPYAAVDDKF--KSVSNSVASIKGYE 231

Query: 242 DVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADD 301
           DVP+NNEAALMKAVANQPVSVA+D     FQFY  GV TG CGT+LDHG+ A+GYG A D
Sbjct: 232 DVPANNEAALMKAVANQPVSVAVDGGDMTFQFYKGGVMTGSCGTDLDHGIVAIGYGKASD 291

Query: 302 GTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           GTKYWL+KNSWG TWGENG++RM++DI  K G+CG+AM+ SYPTA
Sbjct: 292 GTKYWLLKNSWGMTWGENGFLRMEKDISDKRGMCGLAMEPSYPTA 336


>gi|4731374|gb|AAD28477.1|AF133839_1 papain-like cysteine protease [Sandersonia aurantiaca]
          Length = 357

 Score =  384 bits (987), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 193/344 (56%), Positives = 237/344 (68%), Gaps = 16/344 (4%)

Query: 11  VLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGR------VYRDNAEKEMRFKIFK 64
           +   +LVL +    + S  + +  +     +W + Y R      V RD  +K+ RF +FK
Sbjct: 4   LFPVLLVLALAFGSTLSIPIKEKDLESEDSLW-SLYERWRSHHAVSRDLDQKQKRFNVFK 62

Query: 65  ENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYK----RRLPSVRSSETTDVS 120
           ENV++I  FN K ++  +KL +N+F D TN+EFRA   G K    R +   R    +   
Sbjct: 63  ENVKFIHEFN-KNKDVTFKLALNKFGDMTNQEFRAKYAGSKVHHHRTMKGSRHGSGSGAK 121

Query: 121 FRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQEL 180
           F YENA  P SIDWR++GAV  VK+QGQCG CWAFSA+AA+EGIN I T++L  LSEQEL
Sbjct: 122 FMYENAVAPPSIDWRERGAVAAVKNQGQCGSCWAFSAIAAVEGINQIVTKELVPLSEQEL 181

Query: 181 VDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGY 240
           +DCDT  ++QGC GGLMD AFEFI +N G+ TE  YPY+A D +C K   N  A  I GY
Sbjct: 182 IDCDTD-QNQGCSGGLMDYAFEFIKNNGGITTEDVYPYQAEDATCKK---NSPAVVIDGY 237

Query: 241 EDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTAD 300
           EDVP+N+E ALMKAVANQPV+VAI+ASG  FQFYS GVFTG+CGTELDHGV  VGYGT  
Sbjct: 238 EDVPTNDEDALMKAVANQPVAVAIEASGYVFQFYSEGVFTGRCGTELDHGVAVVGYGTTQ 297

Query: 301 DGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           DGTKYW V+NSWG  WGE+GY+RMQR I A  GLCGIAMQASYP
Sbjct: 298 DGTKYWTVRNSWGADWGESGYVRMQRGIKATHGLCGIAMQASYP 341


>gi|357458909|ref|XP_003599735.1| Cysteine proteinase [Medicago truncatula]
 gi|357474677|ref|XP_003607623.1| Cysteine proteinase [Medicago truncatula]
 gi|355488783|gb|AES69986.1| Cysteine proteinase [Medicago truncatula]
 gi|355508678|gb|AES89820.1| Cysteine proteinase [Medicago truncatula]
          Length = 342

 Score =  384 bits (987), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 188/342 (54%), Positives = 239/342 (69%), Gaps = 7/342 (2%)

Query: 8   NKLVLAAILVLGVWA-PQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKEN 66
           N  ++   L+   W  P   S  + +  ++ +HE WM Q+G+ Y+D AEKE RF+IFK N
Sbjct: 5   NNFIIPMFLIFTTWMLPYVMSSRVLEPYLSNKHEKWMTQFGKSYKDAAEKEKRFQIFKNN 64

Query: 67  VEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRS-SETTDVSFRYEN 125
           VE+I  FN    NKP+ L IN FAD TNEEF+A  NG K+        +ETT  SFRY N
Sbjct: 65  VEFIELFN-AVGNKPFNLSINHFADLTNEEFKASLNGNKKLHDKFDILNETT--SFRYHN 121

Query: 126 A-SVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCD 184
             SVPAS+DWRK+GAVT +K+QG CG CWAFS VA++EGI+ ITT +L SLSEQEL+DC 
Sbjct: 122 VTSVPASMDWRKRGAVTPIKNQGSCGSCWAFSTVASIEGIHQITTGELVSLSEQELIDC- 180

Query: 185 TSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVP 244
             G   GC GG ++DAF+FI    G+A+E  YPYK +D  C  K+ +   A+I GYE VP
Sbjct: 181 VRGNSSGCSGGYLEDAFKFIAKKGGMASETNYPYKETDEKCKFKKESKHVAEIKGYEKVP 240

Query: 245 SNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTK 304
           SN+E  L+KAVANQPVSV +DA    FQFYS G+FTG+CGT+ DH VT VGYG + D T+
Sbjct: 241 SNSENDLLKAVANQPVSVYVDAGDYVFQFYSGGIFTGKCGTDTDHVVTIVGYGVSLDYTE 300

Query: 305 YWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           YWLVKNSWGT WGE GY++++R++D+K+GLCGIA   SYP A
Sbjct: 301 YWLVKNSWGTGWGEKGYMKLKRNVDSKKGLCGIATNPSYPVA 342


>gi|356517384|ref|XP_003527367.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
          Length = 332

 Score =  382 bits (982), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 203/341 (59%), Positives = 241/341 (70%), Gaps = 22/341 (6%)

Query: 10  LVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEY 69
           +  A +L +   A Q   RTL DA+M E H   M +Y +V +D  +      +FKENV Y
Sbjct: 10  IAFAMLLSMAFLAFQVTCRTLQDASMYESHGQRMTRYSKVDKDPPDX-----VFKENVNY 64

Query: 70  IASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS-V 128
           I + NN A +KPYK  IN+FA         P+  +K  + S     TT   F++EN +  
Sbjct: 65  IEACNNAA-DKPYKRDINQFA---------PKKRFKGHMCSSIIRITT---FKFENVTAT 111

Query: 129 PASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLS-EQELVDCDTSG 187
           P+++D R+K AVT +KDQGQCGC WA SAVAA EGI+ +   KL  LS EQELVDCDT G
Sbjct: 112 PSTVDCRQKVAVTPIKDQGQCGCFWALSAVAATEGIHALXAGKLILLSSEQELVDCDTKG 171

Query: 188 EDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKI-SGYEDVPSN 246
            DQ C+GGLMDDAF+FII N GL TEA YPYK  DG CN  EA+ +AA I +GYEDVP+N
Sbjct: 172 VDQDCQGGLMDDAFKFIIQNHGLNTEANYPYKGVDGKCNAYEADKNAATIITGYEDVPAN 231

Query: 247 NEAA-LMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKY 305
           NE A L KAVAN PVSVAIDASGSDFQFY SGVFTG CGTELDHGVTAVGYG +DDGT+Y
Sbjct: 232 NEKAHLQKAVANNPVSVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSDDGTEY 291

Query: 306 WLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           WLVKNS GT WGE GYIRMQR +D++E LCGIA+QASYP+A
Sbjct: 292 WLVKNSRGTEWGEEGYIRMQRGVDSEEALCGIAVQASYPSA 332


>gi|21666724|gb|AAM73806.1|AF448505_1 cysteine proteinase [Brassica napus]
 gi|21666726|gb|AAM73807.1|AF448506_1 cysteine proteinase [Brassica napus]
          Length = 343

 Score =  382 bits (981), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 180/321 (56%), Positives = 227/321 (70%), Gaps = 7/321 (2%)

Query: 28  RTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGIN 87
           R L++ TM +RH  WM ++GRVY D  EK  R+ +FK NVE I   N       +KL +N
Sbjct: 26  RPLDEVTMQKRHAAWMTEHGRVYADANEKNNRYVVFKRNVESIERLNEVQYGLTFKLAVN 85

Query: 88  EFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS---VPASIDWRKKGAVTGVK 144
           +FAD TNEEFR+   GYK    SV SS T   SFRY++ S   +P S+DWRKKGAVT +K
Sbjct: 86  QFADLTNEEFRSMYTGYKGN--SVLSSRTKPTSFRYQHVSSDALPISVDWRKKGAVTPIK 143

Query: 145 DQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFI 204
           DQG CG CWAFSAVAA+EG+  I   KL SLSEQELVDCDT+  D GC GG M+ AF + 
Sbjct: 144 DQGSCGSCWAFSAVAAIEGVAQIKKGKLISLSEQELVDCDTN--DDGCMGGYMNSAFNYT 201

Query: 205 ISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAI 264
           ++  GL +E+ YPYK++DG+CN  +    A  I G+EDVP+N+E ALMKAVA+ PVS+ I
Sbjct: 202 MTTGGLTSESNYPYKSTDGTCNINKTKQIATSIKGFEDVPANDEKALMKAVAHHPVSIGI 261

Query: 265 DASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRM 324
              G+ FQFYSSGVF+G+C T LDHGV  VGYG + +G+KYW++KNSWG  WGE GY+R+
Sbjct: 262 AGGGTGFQFYSSGVFSGECSTHLDHGVAVVGYGKSSNGSKYWILKNSWGPKWGERGYMRI 321

Query: 325 QRDIDAKEGLCGIAMQASYPT 345
           ++D  AK G CG+AM ASYPT
Sbjct: 322 KKDTKAKHGQCGLAMNASYPT 342


>gi|358248896|ref|NP_001239703.1| uncharacterized protein LOC100799247 precursor [Glycine max]
 gi|255636729|gb|ACU18700.1| unknown [Glycine max]
          Length = 341

 Score =  380 bits (977), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 188/329 (57%), Positives = 238/329 (72%), Gaps = 10/329 (3%)

Query: 21  WAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNK 80
           W  +  SR L     +ERHE WMAQYG+VY+D AEKE RF++FK NV++I SFN  A +K
Sbjct: 20  WISRVMSRGL---ITSERHEKWMAQYGKVYKDAAEKEKRFQVFKNNVQFIESFN-AAGDK 75

Query: 81  PYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS-VPASIDWRKKGA 139
           P+ L IN+FAD  +EEF+A  N  +++   V ++  T+ SFRYEN + +P+++DWRK+GA
Sbjct: 76  PFNLSINQFADLHDEEFKALLNNVQKKASRVETA--TETSFRYENVTKIPSTMDWRKRGA 133

Query: 140 VTGVKDQG-QCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMD 198
           VT +KDQG  CG CWAF+ VA +E ++ ITT +L SLSEQELVDC   G+ +GC GG ++
Sbjct: 134 VTPIKDQGYTCGSCWAFATVATVESLHQITTGELVSLSEQELVDC-VRGDSEGCRGGYVE 192

Query: 199 DAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQ 258
           +AFEFI +  G+ +EA YPYK  D SC  K+     A+I GYE VPSN+E AL+KAVANQ
Sbjct: 193 NAFEFIANKGGITSEAYYPYKGKDRSCKVKKETHGVARIIGYESVPSNSEKALLKAVANQ 252

Query: 259 PVSVAIDASGSDFQFYSSGVFTGQ-CGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWG 317
           PVSV IDA    F+FYSSG+F  + CGT LDH V  VGYG   DGTKYWLVKNSW T WG
Sbjct: 253 PVSVYIDAGAIAFKFYSSGIFEARNCGTHLDHAVAVVGYGKLRDGTKYWLVKNSWSTAWG 312

Query: 318 ENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           E GY+R++RDI AK+GLCGIA  ASYP A
Sbjct: 313 EKGYMRIKRDIRAKKGLCGIASNASYPIA 341


>gi|224133760|ref|XP_002321654.1| predicted protein [Populus trichocarpa]
 gi|222868650|gb|EEF05781.1| predicted protein [Populus trichocarpa]
          Length = 362

 Score =  380 bits (975), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 193/339 (56%), Positives = 233/339 (68%), Gaps = 8/339 (2%)

Query: 10  LVLAAILVLGVWAPQSWSRT--LNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENV 67
           + L+  LVLG+     +      ++ ++ + +E W + +  V     EK  RF +FKENV
Sbjct: 9   VALSLALVLGITESLDFHEKDLESEESLWDLYERWRSHH-TVSTSLDEKHKRFNVFKENV 67

Query: 68  EYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYK-RRLPSVRSSETTDVSFRYENA 126
            ++   N     KPYKL +N+FAD TN EFR+   G K +     R +   + SF Y   
Sbjct: 68  MHVHKTNKMG--KPYKLKLNKFADMTNHEFRSVYAGSKVKHHRMFRGTTRGNGSFMYGKV 125

Query: 127 -SVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDT 185
             VP S+DWRKKGAVT VKDQGQCG CWAFS + A+EGIN+I T +L SLSEQELVDCDT
Sbjct: 126 EKVPTSVDWRKKGAVTAVKDQGQCGSCWAFSTIVAVEGINYIKTNELVSLSEQELVDCDT 185

Query: 186 SGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPS 245
           + E+QGC GGLM+ AFEFI   +G+ TE+ YPYKA DG C+  + N  A  I GYE VP 
Sbjct: 186 T-ENQGCNGGLMEYAFEFIKKKRGITTESTYPYKAEDGHCDAAKENNPAVSIDGYEKVPE 244

Query: 246 NNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKY 305
           N+E AL+KA ANQPVSVAIDA GSDFQFYS GVF G+CGTELDHGV  VGYGT  DGTKY
Sbjct: 245 NDEDALLKAAANQPVSVAIDAGGSDFQFYSEGVFIGECGTELDHGVAVVGYGTTLDGTKY 304

Query: 306 WLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           W+V+NSWG  WGE GYIRMQR I  KEGLCGIAM+ASYP
Sbjct: 305 WIVRNSWGPEWGEKGYIRMQRGISDKEGLCGIAMEASYP 343


>gi|255540425|ref|XP_002511277.1| cysteine protease, putative [Ricinus communis]
 gi|46395620|sp|O65039.1|CYSEP_RICCO RecName: Full=Vignain; AltName: Full=Cysteine endopeptidase; Flags:
           Precursor
 gi|2944446|gb|AAC62396.1| cysteine endopeptidase precursor [Ricinus communis]
 gi|223550392|gb|EEF51879.1| cysteine protease, putative [Ricinus communis]
          Length = 360

 Score =  380 bits (975), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 190/308 (61%), Positives = 222/308 (72%), Gaps = 6/308 (1%)

Query: 39  HEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFR 98
           +E W + +  V R   EK+ RF +FK N  ++ + N    +KPYKL +N+FAD TN EFR
Sbjct: 38  YERWRSHH-TVSRSLHEKQKRFNVFKHNAMHVHNANK--MDKPYKLKLNKFADMTNHEFR 94

Query: 99  APRNGYK-RRLPSVRSSETTDVSFRYENA-SVPASIDWRKKGAVTGVKDQGQCGCCWAFS 156
              +G K +     R     + +F YE   +VPAS+DWRKKGAVT VKDQGQCG CWAFS
Sbjct: 95  NTYSGSKVKHHRMFRGGPRGNGTFMYEKVDTVPASVDWRKKGAVTSVKDQGQCGSCWAFS 154

Query: 157 AVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKY 216
            + A+EGIN I T KL SLSEQELVDCDT  ++QGC GGLMD AFEFI    G+ TEA Y
Sbjct: 155 TIVAVEGINQIKTNKLVSLSEQELVDCDTD-QNQGCNGGLMDYAFEFIKQRGGITTEANY 213

Query: 217 PYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSS 276
           PY+A DG+C+  + N  A  I G+E+VP N+E AL+KAVANQPVSVAIDA GSDFQFYS 
Sbjct: 214 PYEAYDGTCDVSKENAPAVSIDGHENVPENDENALLKAVANQPVSVAIDAGGSDFQFYSE 273

Query: 277 GVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCG 336
           GVFTG CGTELDHGV  VGYGT  DGTKYW VKNSWG  WGE GYIRM+R I  KEGLCG
Sbjct: 274 GVFTGSCGTELDHGVAIVGYGTTIDGTKYWTVKNSWGPEWGEKGYIRMERGISDKEGLCG 333

Query: 337 IAMQASYP 344
           IAM+ASYP
Sbjct: 334 IAMEASYP 341


>gi|172052260|gb|ACB70409.1| cysteine protease [Nicotiana tabacum]
          Length = 361

 Score =  380 bits (975), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 190/310 (61%), Positives = 224/310 (72%), Gaps = 6/310 (1%)

Query: 37  ERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEE 96
           E +E W + +  V R   EK+ RF +FK NV Y+ +FN K  +KPYKL +N+FAD TN E
Sbjct: 36  ELYERWRSHH-TVSRSLDEKDKRFNVFKANVHYVHNFNKK--DKPYKLKLNKFADMTNHE 92

Query: 97  FRAPRNGYK-RRLPSVRSSETTDVSFRYENA-SVPASIDWRKKGAVTGVKDQGQCGCCWA 154
           FR    G K +   +   +   + +F Y +  SVP ++DWRKKGAVT VKDQG+CG CWA
Sbjct: 93  FRHHYAGSKIKHHRTFLGASRANGTFMYAHEDSVPPTVDWRKKGAVTPVKDQGKCGSCWA 152

Query: 155 FSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEA 214
           FS V A+EGIN I T +L SLSEQELVDCDTS ++QGC GGLMD AFEFI    G+ TE 
Sbjct: 153 FSTVVAVEGINQIKTNELVSLSEQELVDCDTS-QNQGCNGGLMDMAFEFIKKKGGINTEE 211

Query: 215 KYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFY 274
            YPY A  G C+ ++ N     I G+EDVP N+E +L+KAVANQPVSVAI ASGSDFQFY
Sbjct: 212 NYPYMAEGGECDIQKRNSPVVSIDGHEDVPPNDEGSLLKAVANQPVSVAIQASGSDFQFY 271

Query: 275 SSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGL 334
           S GVFTG CGTELDHGV  VGYGT  D TKYW+VKNSWG  WGE GYIRMQR+IDA+EGL
Sbjct: 272 SEGVFTGDCGTELDHGVAIVGYGTTLDRTKYWIVKNSWGPEWGEKGYIRMQREIDAEEGL 331

Query: 335 CGIAMQASYP 344
           CGIAMQ SYP
Sbjct: 332 CGIAMQPSYP 341


>gi|445927|prf||1910332A Cys endopeptidase
          Length = 362

 Score =  379 bits (972), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 187/308 (60%), Positives = 220/308 (71%), Gaps = 6/308 (1%)

Query: 39  HEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFR 98
           +E W + +  V R   EK  RF +FK NV ++   N    +KPYKL +N+FAD TN EFR
Sbjct: 40  YERWRSHH-TVSRSLGEKHKRFNVFKANVMHV--HNTNKMDKPYKLKLNKFADMTNHEFR 96

Query: 99  APRNGYK-RRLPSVRSSETTDVSFRYENA-SVPASIDWRKKGAVTGVKDQGQCGCCWAFS 156
           +   G K       R S+    +F YE   SVPAS+DWRKKGAVT VKDQGQCG CWAFS
Sbjct: 97  STYAGSKVNHHKMFRGSQHGSGTFMYEKVGSVPASVDWRKKGAVTDVKDQGQCGSCWAFS 156

Query: 157 AVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKY 216
            + A+EGIN I T KL SLSEQELVDCD   E+QGC GGLM+ AFEFI    G+ TE+ Y
Sbjct: 157 TIVAVEGINQIKTNKLVSLSEQELVDCDKE-ENQGCNGGLMESAFEFIKQKGGITTESNY 215

Query: 217 PYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSS 276
           PYKA +G+C++ + N  A  I G+E+VP N+E AL+KAVANQPVSVAIDA GSDFQFYS 
Sbjct: 216 PYKAQEGTCDESKVNDLAVSIDGHENVPVNDENALLKAVANQPVSVAIDAGGSDFQFYSE 275

Query: 277 GVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCG 336
           GVFTG C T+L+HGV  VGYGT  DGT YW+V+NSWG  WGE GYIRMQR+I  KEGLCG
Sbjct: 276 GVFTGDCNTDLNHGVAIVGYGTTVDGTNYWIVRNSWGPEWGEQGYIRMQRNISKKEGLCG 335

Query: 337 IAMQASYP 344
           IAM ASYP
Sbjct: 336 IAMMASYP 343


>gi|357452869|ref|XP_003596711.1| Cysteine proteinase [Medicago truncatula]
 gi|355485759|gb|AES66962.1| Cysteine proteinase [Medicago truncatula]
          Length = 344

 Score =  379 bits (972), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 185/340 (54%), Positives = 238/340 (70%), Gaps = 11/340 (3%)

Query: 8   NKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENV 67
           N L L  IL L       W+  +  + + E+HE WM ++G+ Y+D AEKE RF+IFKEN+
Sbjct: 11  NILTLFFILTL-------WTSLVISSRLLEKHEQWMEEHGKFYKDAAEKEQRFQIFKENL 63

Query: 68  EYIASFNNKARNKPYKLGINEFADQTNEEFRAPR-NGYKRRLPSVRSSETTDVS-FRYEN 125
           E+I SFN    N  + L IN+F DQTN+EF+A   NG K+ L  V  +   + S FRYEN
Sbjct: 64  EFIESFNAAGDN-GFNLSINQFGDQTNDEFKANYLNGKKKPLIGVGIAAIEEESVFRYEN 122

Query: 126 AS-VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCD 184
            + VPA++DWR++GAVT +K Q  CG CWAF+ VAA+EGI+ ITT +L SLSEQELVDC 
Sbjct: 123 VTEVPATMDWRERGAVTPIKHQHLCGSCWAFATVAAIEGIHQITTGRLVSLSEQELVDCV 182

Query: 185 TSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVP 244
            +    GC GG ++DA +FI+   G+ +E  YPY   DG CN ++   + AKI GYE VP
Sbjct: 183 KTNTTDGCNGGYVEDACDFIVKKGGITSETNYPYTRVDGKCNVRKGTYNVAKIKGYEHVP 242

Query: 245 SNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTK 304
           +NNE AL+KAVANQP++V I A+   FQFYSSG+  G+CG +LDH VT VGYGT+DDG K
Sbjct: 243 ANNEKALLKAVANQPIAVYIAATKRAFQFYSSGILKGKCGIDLDHTVTIVGYGTSDDGVK 302

Query: 305 YWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           YWLVKNSWGT WGE GYI+++RD+ AKEG CGIAM  +YP
Sbjct: 303 YWLVKNSWGTKWGEKGYIKIKRDVHAKEGSCGIAMVPTYP 342


>gi|5823020|gb|AAD53012.1|AF089849_1 senescence-specific cysteine protease [Brassica napus]
          Length = 344

 Score =  378 bits (971), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 181/319 (56%), Positives = 226/319 (70%), Gaps = 7/319 (2%)

Query: 30  LNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEF 89
           L++  M +RH  WM ++GRVY D  EK  R+ +FK NVE I   N+      +KL +N+F
Sbjct: 29  LDEVAMQKRHAEWMTEHGRVYADANEKNNRYAVFKRNVERIERLNDVQSGLTFKLAVNQF 88

Query: 90  ADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS---VPASIDWRKKGAVTGVKDQ 146
           AD TNEEFR+   G+K    SV SS T   SFRY+N S   +P S+DWRKKGAVT +KDQ
Sbjct: 89  ADLTNEEFRSMYTGFKGN--SVLSSRTKPTSFRYQNVSSDALPVSVDWRKKGAVTPIKDQ 146

Query: 147 GQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIIS 206
           G CG CWAFSAVAA+EG+  I   KL SLSEQELVDCDT+  D GC GGLMD AF + I+
Sbjct: 147 GLCGSCWAFSAVAAIEGVAQIKKGKLISLSEQELVDCDTN--DGGCMGGLMDTAFNYTIT 204

Query: 207 NKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDA 266
             GL +E+ YPYK+++G+CN  +    A  I G+EDVP+N+E ALMKAVA+ PVS+ I  
Sbjct: 205 IGGLTSESNYPYKSTNGTCNFNKTKQIATSIKGFEDVPANDEKALMKAVAHHPVSIGIAG 264

Query: 267 SGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQR 326
               FQFYSSGVF+G+C T LDHGVTAVGYG + +G KYW++KNSWG  WGE GY+R+++
Sbjct: 265 GDIGFQFYSSGVFSGECTTHLDHGVTAVGYGRSKNGLKYWILKNSWGPKWGERGYMRIKK 324

Query: 327 DIDAKEGLCGIAMQASYPT 345
           DI  K G CG+AM ASYPT
Sbjct: 325 DIKPKHGQCGLAMNASYPT 343


>gi|413953666|gb|AFW86315.1| hypothetical protein ZEAMMB73_539008 [Zea mays]
          Length = 314

 Score =  378 bits (971), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 188/320 (58%), Positives = 221/320 (69%), Gaps = 40/320 (12%)

Query: 31  NDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFA 90
           +D+ M  RHE WMAQY RVY+D +EK  RFK                           FA
Sbjct: 29  DDSAMVARHEQWMAQYSRVYKDASEKARRFK---------------------------FA 61

Query: 91  DQTNEEFRAPRN--GYKRRLPSVRSSETTDVSFRYENAS---VPASIDWRKKGAVTGVKD 145
           D TN EFR+ +   G+K       S+      FRYEN S   +P +IDWR KG VT +KD
Sbjct: 62  DLTNHEFRSVKTNKGFKS------SNMKILTGFRYENVSADALPTTIDWRTKGVVTPIKD 115

Query: 146 QGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFII 205
           QGQCGCC AFSAVAA EGI  I+T KL SL++QELVDCD  GEDQGCEGGLMDDAF+FII
Sbjct: 116 QGQCGCCSAFSAVAATEGIVKISTGKLVSLADQELVDCDVHGEDQGCEGGLMDDAFKFII 175

Query: 206 SNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAID 265
            N GL TE+ YPY A+DG CN    + SAA I GYEDVP+N+EAALMKA+ANQPVSVA+D
Sbjct: 176 KNGGLTTESSYPYTAADGKCN--SGSNSAATIKGYEDVPANDEAALMKAMANQPVSVAVD 233

Query: 266 ASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQ 325
                F+FYS GV TG CGT+LDHG+ A+GYG   DGTKYWL+KNSWGTTWGENGY+RM+
Sbjct: 234 GGDMTFRFYSGGVMTGSCGTDLDHGIAAIGYGKTSDGTKYWLMKNSWGTTWGENGYLRME 293

Query: 326 RDIDAKEGLCGIAMQASYPT 345
           +DI  K G+CG+AM+ SYPT
Sbjct: 294 KDISDKRGMCGLAMEPSYPT 313


>gi|544129|sp|P25803.2|CYSEP_PHAVU RecName: Full=Vignain; AltName: Full=Bean endopeptidase; AltName:
           Full=Cysteine proteinase EP-C1; Flags: Precursor
 gi|20994|emb|CAA44816.1| endopeptidase [Phaseolus vulgaris]
          Length = 362

 Score =  377 bits (967), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 192/342 (56%), Positives = 233/342 (68%), Gaps = 14/342 (4%)

Query: 10  LVLAAILVLGVWAPQSWSRTLNDATMNERHEMW-----MAQYGRVYRDNAEKEMRFKIFK 64
           +VL+  LVLGV    + S   +D  +     +W        +  V R   EK  RF +FK
Sbjct: 9   VVLSFSLVLGV----ANSFDFHDKDLASEESLWDLYERWRSHHTVSRSLGEKHKRFNVFK 64

Query: 65  ENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSV-RSSETTDVSFRY 123
            N+ ++   N    +KPYKL +N+FAD TN EFR+   G K   P + R +   + +F Y
Sbjct: 65  ANLMHV--HNTNKMDKPYKLKLNKFADMTNHEFRSTYAGSKVNHPRMFRGTPHENGAFMY 122

Query: 124 ENA-SVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVD 182
           E   SVP S+DWRKKGAVT VKDQGQCG CWAFS V A+EGIN I T KL +LSEQELVD
Sbjct: 123 EKVVSVPPSVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQIKTNKLVALSEQELVD 182

Query: 183 CDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYED 242
           CD   E+QGC GGLM+ AFEFI    G+ TE+ YPYKA +G+C+  + N  A  I G+E+
Sbjct: 183 CDKE-ENQGCNGGLMESAFEFIKQKGGITTESNYPYKAQEGTCDASKVNDLAVSIDGHEN 241

Query: 243 VPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDG 302
           VP+N+E AL+KAVANQPVSVAIDA GSDFQFYS GVFTG C T+L+HGV  VGYGT  DG
Sbjct: 242 VPANDEDALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDCSTDLNHGVAIVGYGTTVDG 301

Query: 303 TKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           T YW+V+NSWG  WGE+GYIRMQR+I  KEGLCGIAM  SYP
Sbjct: 302 TNYWIVRNSWGPEWGEHGYIRMQRNISKKEGLCGIAMLPSYP 343


>gi|118158|sp|P12412.1|CYSEP_VIGMU RecName: Full=Vignain; AltName: Full=Bean endopeptidase; AltName:
           Full=Cysteine proteinase; AltName:
           Full=Sulfhydryl-endopeptidase; Short=SH-EP; Contains:
           RecName: Full=Vignain-1; Contains: RecName:
           Full=Vignain-2; Flags: Precursor
 gi|22062|emb|CAA33753.1| sulfhydryl-pre-endopeptidase (AA -20 to 342) [Vigna mungo]
 gi|22066|emb|CAA36181.1| sulfhydryl-endopeptidase [Vigna mungo]
          Length = 362

 Score =  377 bits (967), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 186/308 (60%), Positives = 219/308 (71%), Gaps = 6/308 (1%)

Query: 39  HEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFR 98
           +E W + +  V R   EK  RF +FK NV ++   N    +KPYKL +N+FAD TN EFR
Sbjct: 40  YERWRSHH-TVSRSLGEKHKRFNVFKANVMHV--HNTNKMDKPYKLKLNKFADMTNHEFR 96

Query: 99  APRNGYK-RRLPSVRSSETTDVSFRYENA-SVPASIDWRKKGAVTGVKDQGQCGCCWAFS 156
           +   G K       R S+    +F YE   SVPAS+DWRKKGAVT VKDQGQCG CWAFS
Sbjct: 97  STYAGSKVNHHKMFRGSQHGSGTFMYEKVGSVPASVDWRKKGAVTDVKDQGQCGSCWAFS 156

Query: 157 AVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKY 216
            + A+EGIN I T KL SLSEQELVDCD   E+QGC GGLM+ AFEFI    G+ TE+ Y
Sbjct: 157 TIVAVEGINQIKTNKLVSLSEQELVDCDKE-ENQGCNGGLMESAFEFIKQKGGITTESNY 215

Query: 217 PYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSS 276
           PY A +G+C++ + N  A  I G+E+VP N+E AL+KAVANQPVSVAIDA GSDFQFYS 
Sbjct: 216 PYTAQEGTCDESKVNDLAVSIDGHENVPVNDENALLKAVANQPVSVAIDAGGSDFQFYSE 275

Query: 277 GVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCG 336
           GVFTG C T+L+HGV  VGYGT  DGT YW+V+NSWG  WGE GYIRMQR+I  KEGLCG
Sbjct: 276 GVFTGDCNTDLNHGVAIVGYGTTVDGTNYWIVRNSWGPEWGEQGYIRMQRNISKKEGLCG 335

Query: 337 IAMQASYP 344
           IAM ASYP
Sbjct: 336 IAMMASYP 343


>gi|224133764|ref|XP_002321655.1| predicted protein [Populus trichocarpa]
 gi|222868651|gb|EEF05782.1| predicted protein [Populus trichocarpa]
          Length = 360

 Score =  376 bits (966), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 189/308 (61%), Positives = 220/308 (71%), Gaps = 6/308 (1%)

Query: 39  HEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFR 98
           +E W + +  V     EK  RF +F+ NV ++   N    +KPYKL +N+FAD TN EFR
Sbjct: 38  YEKWRSHH-TVSTSLDEKRKRFNVFRANVLHV--HNTNKMDKPYKLKLNKFADMTNHEFR 94

Query: 99  APRNGYKRRLPSV-RSSETTDVSFRYENA-SVPASIDWRKKGAVTGVKDQGQCGCCWAFS 156
                 K +  ++ R +   + SF Y N   VPASIDWRKKGAVT VKDQG+CG CWAFS
Sbjct: 95  TAYASSKVKHHTMFRGAPLGNGSFMYGNIDKVPASIDWRKKGAVTPVKDQGKCGSCWAFS 154

Query: 157 AVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKY 216
            + A+EGIN I T KL SLSEQELVDC+T GE+ GC GGLMD AFEFI   KG+ TEA Y
Sbjct: 155 TIVAVEGINFIKTNKLISLSEQELVDCNT-GENHGCNGGLMDYAFEFITKQKGITTEANY 213

Query: 217 PYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSS 276
           PY+A DG C+  +AN  A  I G+EDV  NNE AL+KAVANQPVSVAIDA GSDFQFYS 
Sbjct: 214 PYRAQDGHCDANKANQPAVSIDGHEDVLHNNENALLKAVANQPVSVAIDAGGSDFQFYSE 273

Query: 277 GVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCG 336
           GVFTG+CG ELDHGV  VGYGT  DGTKYW+V+NSWG  WGE GYIRMQR I  + GLCG
Sbjct: 274 GVFTGECGKELDHGVAIVGYGTTVDGTKYWIVRNSWGPEWGERGYIRMQRGISDRRGLCG 333

Query: 337 IAMQASYP 344
           IAM+ASYP
Sbjct: 334 IAMEASYP 341


>gi|18423124|ref|NP_568722.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|75309064|sp|Q9FGR9.1|CEP1_ARATH RecName: Full=KDEL-tailed cysteine endopeptidase CEP1; AltName:
           Full=Cysteine proteinase CP56; Short=AtCP56; Flags:
           Precursor
 gi|9759028|dbj|BAB09397.1| cysteine endopeptidase [Arabidopsis thaliana]
 gi|20258850|gb|AAM13907.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|308097832|gb|ADO14465.1| papain [Arabidopsis thaliana]
 gi|332008536|gb|AED95919.1| putative cysteine proteinase [Arabidopsis thaliana]
          Length = 361

 Score =  376 bits (965), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 190/310 (61%), Positives = 221/310 (71%), Gaps = 6/310 (1%)

Query: 37  ERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEE 96
           E +E W + +  V R   EK  RF +FK NV++I   N K  +K YKL +N+F D T+EE
Sbjct: 36  ELYERWRSHH-TVARSLEEKAKRFNVFKHNVKHIHETNKK--DKSYKLKLNKFGDMTSEE 92

Query: 97  FRAPRNGYKRRLPSVRSSETTDV-SFRYENA-SVPASIDWRKKGAVTGVKDQGQCGCCWA 154
           FR    G   +   +   E     SF Y N  ++P S+DWRK GAVT VK+QGQCG CWA
Sbjct: 93  FRRTYAGSNIKHHRMFQGEKKATKSFMYANVNTLPTSVDWRKNGAVTPVKNQGQCGSCWA 152

Query: 155 FSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEA 214
           FS V A+EGIN I T+KLTSLSEQELVDCDT+ ++QGC GGLMD AFEFI    GL +E 
Sbjct: 153 FSTVVAVEGINQIRTKKLTSLSEQELVDCDTN-QNQGCNGGLMDLAFEFIKEKGGLTSEL 211

Query: 215 KYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFY 274
            YPYKASD +C+  + N     I G+EDVP N+E  LMKAVANQPVSVAIDA GSDFQFY
Sbjct: 212 VYPYKASDETCDTNKENAPVVSIDGHEDVPKNSEDDLMKAVANQPVSVAIDAGGSDFQFY 271

Query: 275 SSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGL 334
           S GVFTG+CGTEL+HGV  VGYGT  DGTKYW+VKNSWG  WGE GYIRMQR I  KEGL
Sbjct: 272 SEGVFTGRCGTELNHGVAVVGYGTTIDGTKYWIVKNSWGEEWGEKGYIRMQRGIRHKEGL 331

Query: 335 CGIAMQASYP 344
           CGIAM+ASYP
Sbjct: 332 CGIAMEASYP 341


>gi|1223922|gb|AAA92063.1| cysteinyl endopeptidase [Vigna radiata]
          Length = 362

 Score =  375 bits (964), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 186/308 (60%), Positives = 219/308 (71%), Gaps = 6/308 (1%)

Query: 39  HEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFR 98
           +E W + +  V R   EK  RF +FKENV ++   N    +KPYKL +N+FAD TN EFR
Sbjct: 40  YERWRSHH-TVSRSLTEKHKRFNVFKENVMHV--HNTNKMDKPYKLKLNKFADMTNHEFR 96

Query: 99  APRNGYK-RRLPSVRSSETTDVSFRYENA-SVPASIDWRKKGAVTGVKDQGQCGCCWAFS 156
           +   G K       R ++  + +F YE   SVPAS+DWRKKGAVT VKDQGQCG CWAFS
Sbjct: 97  STYAGSKVNHHKMFRGTQHGNGTFMYEKVGSVPASVDWRKKGAVTDVKDQGQCGSCWAFS 156

Query: 157 AVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKY 216
            V A+EGIN I T KL SLSEQELVDCD   E+QGC GGLM+ AFEFI    G+ TE+ Y
Sbjct: 157 TVVAVEGINQIKTDKLVSLSEQELVDCDKE-ENQGCNGGLMESAFEFIKQKGGITTESNY 215

Query: 217 PYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSS 276
           PY A +G+C+  + N  A  I G+E+VP N+E AL+KAVANQPVSVAIDA GSDFQFYS 
Sbjct: 216 PYTAQEGTCDASKVNDLAVSIDGHENVPVNDENALLKAVANQPVSVAIDAGGSDFQFYSE 275

Query: 277 GVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCG 336
           GV TG C T+L+HGV  VGYGT  DGT YW+V+NSWG  WGE GYIRMQR+I  KEGLCG
Sbjct: 276 GVLTGDCNTDLNHGVAIVGYGTTVDGTNYWIVRNSWGPEWGEQGYIRMQRNISKKEGLCG 335

Query: 337 IAMQASYP 344
           IAM ASYP
Sbjct: 336 IAMMASYP 343


>gi|40806500|gb|AAR92155.1| putative cysteine protease 2 [Iris x hollandica]
          Length = 359

 Score =  375 bits (963), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 186/308 (60%), Positives = 222/308 (72%), Gaps = 7/308 (2%)

Query: 39  HEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFR 98
           +E W + +  V RD +EK  RF +FKEN ++I  FN K  + PYKLG+N+FAD TN+EFR
Sbjct: 40  YERWRSHH-TVSRDLSEKNKRFNVFKENAKFIHEFNKK--DAPYKLGLNKFADMTNQEFR 96

Query: 99  APRNGYK-RRLPSVRSSETTDVSFRYENA-SVPASIDWRKKGAVTGVKDQGQCGCCWAFS 156
           +   G K     + R +     SF YEN  S+PAS+DWR +GAV  VKDQGQCG CWAFS
Sbjct: 97  STYAGSKIHHHRTQRGTPRATGSFMYENVHSIPASVDWRTQGAVAPVKDQGQCGSCWAFS 156

Query: 157 AVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKY 216
            +A++EGIN I T +L  LS Q+LVDCDT  +++GC GGLMD AFEFI SN G+ +E+ Y
Sbjct: 157 TIASVEGINKIKTNQLVPLSGQQLVDCDTD-QNEGCNGGLMDYAFEFIKSNGGITSESAY 215

Query: 217 PYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSS 276
           PY A  GSC  + + P    I GYEDVP+NNEAALMKAVANQ VSVAI+ASG  FQFYS 
Sbjct: 216 PYTAEQGSCASESSAP-VVTIDGYEDVPANNEAALMKAVANQVVSVAIEASGMAFQFYSE 274

Query: 277 GVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCG 336
           GVFTG CG ELDHGV  VGYG   DGTKYW+V+NSWG  WGE GYIRMQR I A+ GLCG
Sbjct: 275 GVFTGSCGNELDHGVAVVGYGATRDGTKYWIVRNSWGAEWGEKGYIRMQRGIRARHGLCG 334

Query: 337 IAMQASYP 344
           IAM+ SYP
Sbjct: 335 IAMEPSYP 342


>gi|1169186|sp|P43156.1|CYSP_HEMSP RecName: Full=Thiol protease SEN102; Flags: Precursor
 gi|396568|emb|CAA52425.1| thiol-protease [Hemerocallis hybrid cultivar]
          Length = 360

 Score =  374 bits (961), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 194/338 (57%), Positives = 234/338 (69%), Gaps = 7/338 (2%)

Query: 10  LVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEY 69
           LV  + L +    P +     ++ ++   +E W   +  V RD  EK  RF +FKENV++
Sbjct: 11  LVALSFLSIAQSIPFTEKDLASEDSLWNLYEKWRTHH-TVARDLDEKNRRFNVFKENVKF 69

Query: 70  IASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYK-RRLPSVRSSETTDVSFRYENA-S 127
           I  FN K ++ PYKL +N+F D TN+EFR+   G K +   S R  +    SF YEN  S
Sbjct: 70  IHEFNQK-KDAPYKLALNKFGDMTNQEFRSKYAGSKIQHHRSQRGIQKNTGSFMYENVGS 128

Query: 128 VPA-SIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTS 186
           +PA SIDWR KGAVTGVKDQGQCG CWAFS +A++EGIN I T +L SLSEQELVDCDTS
Sbjct: 129 LPAASIDWRAKGAVTGVKDQGQCGSCWAFSTIASVEGINQIKTGELVSLSEQELVDCDTS 188

Query: 187 GEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSN 246
             ++GC GGLMD AFEFI  N G+ TE  YPY   DG+C     N     I G++DVP+N
Sbjct: 189 -YNEGCNGGLMDYAFEFIQKN-GITTEDSYPYAEQDGTCASNLLNSPVVSIDGHQDVPAN 246

Query: 247 NEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYW 306
           NE ALM+AVANQP+SV+I+ASG  FQFYS GVFTG+CGTELDHGV  VGYG   DGTKYW
Sbjct: 247 NENALMQAVANQPISVSIEASGYGFQFYSEGVFTGRCGTELDHGVAIVGYGATRDGTKYW 306

Query: 307 LVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           +VKNSWG  WGE+GYIRMQR I  K G CGIAM+ASYP
Sbjct: 307 IVKNSWGEEWGESGYIRMQRGISDKRGKCGIAMEASYP 344


>gi|297792329|ref|XP_002864049.1| hypothetical protein ARALYDRAFT_495086 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297309884|gb|EFH40308.1| hypothetical protein ARALYDRAFT_495086 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 361

 Score =  374 bits (960), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 193/341 (56%), Positives = 234/341 (68%), Gaps = 10/341 (2%)

Query: 10  LVLAAILVLGVWAPQSWSRTLNDA----TMNERHEMWMAQYGRVYRDNAEKEMRFKIFKE 65
           +VLA  +++ +   +S      D     ++ E +E W + +  + R   EK  RF +FK 
Sbjct: 5   IVLALCMLMVLETTKSLDFHEKDVESEDSLWELYERWKSHH-TIARSLEEKAKRFNVFKH 63

Query: 66  NVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSE-TTDVSFRYE 124
           NV++I   N K  +  YKL +N+F D T+EEFR    G   +   +   E  T  SF Y 
Sbjct: 64  NVKHIHETNKKENS--YKLKLNKFGDMTSEEFRRTYAGSNIKHHRMFQGERQTTKSFMYA 121

Query: 125 NA-SVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDC 183
           N  ++P S+DWRK GAVT VK+QGQCG CWAFS V A+EGIN I T+KLTSLSEQELVDC
Sbjct: 122 NVDTLPTSVDWRKNGAVTPVKNQGQCGSCWAFSTVVAVEGINQIRTKKLTSLSEQELVDC 181

Query: 184 DTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDV 243
           DT+ ++QGC GGLMD AFEFI    GL +E  YPYKASD +C+  + N     I G+EDV
Sbjct: 182 DTN-KNQGCNGGLMDLAFEFIKEKGGLTSELVYPYKASDETCDTNKENAPVVSIDGHEDV 240

Query: 244 PSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGT 303
           P N+E  LMKAVA+QPVSVAIDA GSDFQFYS GVFTG+CGTEL+HGV  VGYGT  DGT
Sbjct: 241 PKNSEVDLMKAVAHQPVSVAIDAGGSDFQFYSEGVFTGRCGTELNHGVAVVGYGTTIDGT 300

Query: 304 KYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           KYW+VKNSWG  WGE GYIRMQR I  KEGLCGIAM+ASYP
Sbjct: 301 KYWIVKNSWGEEWGEKGYIRMQRGIRHKEGLCGIAMEASYP 341


>gi|1345573|emb|CAA40073.1| endopeptidase (EP-C1) [Phaseolus vulgaris]
          Length = 361

 Score =  373 bits (958), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 191/342 (55%), Positives = 231/342 (67%), Gaps = 14/342 (4%)

Query: 10  LVLAAILVLGVWAPQSWSRTLNDATMNERHEMW-----MAQYGRVYRDNAEKEMRFKIFK 64
           +VL+  LVLGV    + S   +D  +     +W        +  V R   EK  RF +FK
Sbjct: 8   VVLSFSLVLGV----ANSFDFHDKDLASEESLWDLYERWRSHHTVSRSLGEKHKRFNVFK 63

Query: 65  ENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYK-RRLPSVRSSETTDVSFRY 123
            N+ ++   N    +KPYKL +N+FAD TN EFR+   G K       R +   + +F Y
Sbjct: 64  ANLMHV--HNTNKMDKPYKLKLNKFADMTNHEFRSTYAGSKVNHHRMFRGTPHENGAFMY 121

Query: 124 ENA-SVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVD 182
           E   SVP S+DWRKKGAVT VKDQGQCG CWAFS V A+EGIN I T KL +LSEQELVD
Sbjct: 122 EKVVSVPPSVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQIKTNKLVALSEQELVD 181

Query: 183 CDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYED 242
           CD   E+QGC GGLM+ AFEFI    G+ TE+ YPYKA +G+C+  + N  A  I G+E+
Sbjct: 182 CDKE-ENQGCNGGLMESAFEFIKQKGGITTESNYPYKAQEGTCDASKVNDLAVSIDGHEN 240

Query: 243 VPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDG 302
           VP+N+E AL+KAVANQPVSVAIDA GSDFQFYS GVFTG C T+L+HGV  VGYGT  DG
Sbjct: 241 VPANDEDALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDCSTDLNHGVAIVGYGTTVDG 300

Query: 303 TKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           T YW+V+NSWG  WGE+GYIRMQR+I  KEGLCGIAM  SYP
Sbjct: 301 TNYWIVRNSWGPEWGEHGYIRMQRNISKKEGLCGIAMLPSYP 342


>gi|147769019|emb|CAN62459.1| hypothetical protein VITISV_015168 [Vitis vinifera]
          Length = 246

 Score =  373 bits (957), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 184/269 (68%), Positives = 208/269 (77%), Gaps = 25/269 (9%)

Query: 79  NKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS-VPASIDWRKK 137
           +K YKL INEFAD TNEEF   RN +K  + S  ++     SF+YEN + VP++ DWRKK
Sbjct: 2   DKSYKLSINEFADLTNEEFGTSRNRFKAHICSTEAT-----SFKYENVTAVPSTXDWRKK 56

Query: 138 GAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLM 197
           GAVT +KDQGQCG CWAFSAVAAMEGI  ++T KL SLSEQELVDCDTSGEDQGC G   
Sbjct: 57  GAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQGCXG--- 113

Query: 198 DDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN 257
                           A YPY  +DG+CN+K+A   AAKI+GYEDVP+NNE AL KAVA+
Sbjct: 114 ----------------ANYPYAGTDGTCNRKKAAHPAAKINGYEDVPANNEKALQKAVAH 157

Query: 258 QPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWG 317
           QP++VAIDA G +FQFYSSGVFTGQCGTELDHGV AVGYGT+DDG KYWLVKNSWGT WG
Sbjct: 158 QPIAVAIDAGGXEFQFYSSGVFTGQCGTELDHGVXAVGYGTSDDGMKYWLVKNSWGTGWG 217

Query: 318 ENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           E GYIRMQRD+ AKEGLCGIAMQASYPTA
Sbjct: 218 EEGYIRMQRDVTAKEGLCGIAMQASYPTA 246


>gi|171702829|dbj|BAG16370.1| cysteine protease [Brassica oleracea var. italica]
          Length = 332

 Score =  372 bits (956), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 176/317 (55%), Positives = 223/317 (70%), Gaps = 7/317 (2%)

Query: 28  RTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGIN 87
           R L++ TM +RH  WM ++GRVY D  EK  R+ +FK NVE I   N       +KL +N
Sbjct: 20  RPLDEVTMQKRHAAWMTEHGRVYADANEKNNRYVVFKRNVESIERLNEVQYGLTFKLAVN 79

Query: 88  EFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS---VPASIDWRKKGAVTGVK 144
           +FAD TNEEFR+   GYK    SV SS T   SFRY++ S   +P S+DWRKKGAVT +K
Sbjct: 80  QFADLTNEEFRSMYTGYKGN--SVLSSRTKPTSFRYQHVSSDALPISVDWRKKGAVTPIK 137

Query: 145 DQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFI 204
           DQG CG CWAFSAVAA+EG+  I   KL SLSEQELVDCDT+  D GC GG M+ AF + 
Sbjct: 138 DQGSCGSCWAFSAVAAIEGVAQIKKGKLISLSEQELVDCDTN--DDGCMGGYMNSAFNYT 195

Query: 205 ISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAI 264
           ++  GL +E+ YPYK++DG+CN  +    A  I G+EDVP+N+E ALMKAVA+ PVS+ I
Sbjct: 196 MTTGGLTSESNYPYKSTDGTCNINKTKQIATSIKGFEDVPANDEKALMKAVAHHPVSIGI 255

Query: 265 DASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRM 324
              G+ FQFYSSGVF+G+C T LDHGV  VGYG + +G+KYW++KNSWG  WGE GY+R+
Sbjct: 256 AGGGTGFQFYSSGVFSGECSTHLDHGVAVVGYGKSSNGSKYWILKNSWGPKWGERGYMRI 315

Query: 325 QRDIDAKEGLCGIAMQA 341
           ++D  AK G CG+AM A
Sbjct: 316 KKDTKAKHGQCGLAMNA 332


>gi|351629615|gb|AEQ54771.1| KDDL-tailed cysteine proteinase CP4 [Coffea canephora]
          Length = 359

 Score =  372 bits (955), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 189/337 (56%), Positives = 235/337 (69%), Gaps = 7/337 (2%)

Query: 10  LVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEY 69
           +VLA ILV  +    +     ++ ++ + +E W + +  V RD +EK  RF +FK NV +
Sbjct: 11  VVLAVILVAAMSMEITERDLASEESLWDLYERWRSHH-TVSRDLSEKRKRFNVFKANVHH 69

Query: 70  IASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVP 129
           I   N K  +KPYKL +N FAD TN EFR   +   +    +  S         +  S+P
Sbjct: 70  IHKVNQK--DKPYKLKLNSFADMTNHEFREFYSSKVKHYRMLHGSRANTGFMHGKTESLP 127

Query: 130 ASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGED 189
           AS+DWRK+GAVTGVK+QG+CG CWAFS V  +EGIN I T +L SLSEQELVDC+T  ++
Sbjct: 128 ASVDWRKQGAVTGVKNQGKCGSCWAFSTVVGVEGINKIKTGQLVSLSEQELVDCET--DN 185

Query: 190 QGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEA 249
           +GC GGLM++A+EFI  + G+ TE  YPYKA DGSC+  + N  A  I G+E VP+N+E 
Sbjct: 186 EGCNGGLMENAYEFIKKSGGITTERLYPYKARDGSCDSSKMNAPAVTIDGHEMVPANDEN 245

Query: 250 ALMKAVANQPVSVAIDASGSDFQFYSSGVFTG-QCGTELDHGVTAVGYGTADDGTKYWLV 308
           ALMKAVANQPVSVAIDASGSD QFYS GV+ G  CG ELDHGV  VGYGTA DGTKYW+V
Sbjct: 246 ALMKAVANQPVSVAIDASGSDMQFYSEGVYAGDSCGNELDHGVAVVGYGTALDGTKYWIV 305

Query: 309 KNSWGTTWGENGYIRMQRDIDAKE-GLCGIAMQASYP 344
           KNSWGT WGE GYIRMQR +DA E G+CGIAM+ASYP
Sbjct: 306 KNSWGTGWGEQGYIRMQRGVDAAEGGVCGIAMEASYP 342


>gi|537437|gb|AAC35211.1| cysteine proteinase [Hemerocallis hybrid cultivar]
          Length = 359

 Score =  372 bits (954), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 192/340 (56%), Positives = 232/340 (68%), Gaps = 11/340 (3%)

Query: 11  VLAAILVLGVWA-----PQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKE 65
           +L+ +LVLG  A     P       ++ ++   +E W A +  V RD  + + RF +FKE
Sbjct: 8   LLSVVLVLGSVALAQSIPFDEKDLASEESLWSLYEKWRAHHA-VSRDLDDTDKRFNVFKE 66

Query: 66  NVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYEN 125
           NV++I  FN K ++  YKL +N+F D TN+EFR+   G K               F YE 
Sbjct: 67  NVKFIHEFNQK-KDATYKLALNKFGDMTNQEFRSTYAGSKIDHHMTLRGVKDAGEFSYEK 125

Query: 126 -ASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCD 184
              +P S+DWR+KGAVTGVKDQGQCG CWAFS V A+EGIN I T +L SLSEQ+LVDCD
Sbjct: 126 FHDLPTSVDWREKGAVTGVKDQGQCGSCWAFSTVVAVEGINQIKTNELVSLSEQQLVDCD 185

Query: 185 TSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVP 244
           T  ++ GC GGLMD AF+FI +N GL++E  YPY A   SC   EAN +   I GY+DVP
Sbjct: 186 T--KNSGCNGGLMDYAFDFIKNNGGLSSEDSYPYLAEQKSCGS-EANSAVVTIDGYQDVP 242

Query: 245 SNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTK 304
            NNEAALMKAVANQPVSVAI+ASG  FQFYS GVF+G CGTELDHGV AVGYG  DDG K
Sbjct: 243 RNNEAALMKAVANQPVSVAIEASGYAFQFYSQGVFSGHCGTELDHGVAAVGYGVDDDGKK 302

Query: 305 YWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           YW+VKNSWG  WGE+GYIRM+R I  K G CGIAM+ASYP
Sbjct: 303 YWIVKNSWGEGWGESGYIRMERGIKDKRGKCGIAMEASYP 342


>gi|242081867|ref|XP_002445702.1| hypothetical protein SORBIDRAFT_07g024430 [Sorghum bicolor]
 gi|241942052|gb|EES15197.1| hypothetical protein SORBIDRAFT_07g024430 [Sorghum bicolor]
          Length = 372

 Score =  371 bits (953), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 189/312 (60%), Positives = 222/312 (71%), Gaps = 13/312 (4%)

Query: 39  HEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFR 98
           +E W  ++  V RD  +K  RF +FKENV  I  FN   R++PYKL +N F D T +EFR
Sbjct: 47  YERWRGRHA-VARDLGDKARRFNVFKENVRLIHDFNQ--RDEPYKLRLNRFGDMTADEFR 103

Query: 99  ----APRNGYKRRLPSVRSSETTDVSFRYENA-SVPASIDWRKKGAVTGVKDQGQCGCCW 153
                 R  + R     R    +  SF Y  A  +P S+DWR+KGAVT VKDQGQCG CW
Sbjct: 104 RHYAGSRVAHHRMFRGDRQGSAS--SFMYAGARDLPTSVDWRQKGAVTDVKDQGQCGSCW 161

Query: 154 AFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATE 213
           AFS +AA+EGIN I T+ LTSLSEQ+LVDCDT G + GC+GGLMD AF++I  + G+A E
Sbjct: 162 AFSTIAAVEGINAIKTKNLTSLSEQQLVDCDTKG-NAGCDGGLMDYAFQYIAKHGGVAAE 220

Query: 214 AKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQF 273
             YPYKA   SC K  A   A  I GYEDVP+N+E+AL KAVA+QPVSVAI+ASGS FQF
Sbjct: 221 DAYPYKARQASCKKSPA--PAVTIDGYEDVPANDESALKKAVAHQPVSVAIEASGSHFQF 278

Query: 274 YSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEG 333
           YS GVF G+CGTELDHGVTAVGYG A DGTKYW+VKNSWG  WGE GYIRM RD+ AKEG
Sbjct: 279 YSEGVFAGRCGTELDHGVTAVGYGVAADGTKYWVVKNSWGPEWGEKGYIRMARDVAAKEG 338

Query: 334 LCGIAMQASYPT 345
            CGIAM+ASYP 
Sbjct: 339 HCGIAMEASYPV 350


>gi|351726339|ref|NP_001237379.1| cysteine proteinase precursor [Glycine max]
 gi|31559526|dbj|BAC77521.1| cysteine proteinase [Glycine max]
 gi|31559528|dbj|BAC77522.1| cysteine proteinase [Glycine max]
          Length = 362

 Score =  371 bits (953), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 185/308 (60%), Positives = 217/308 (70%), Gaps = 6/308 (1%)

Query: 39  HEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFR 98
           +E W + +  V R   +K  RF +FK NV ++   N    +KPYKL +N+FAD TN EFR
Sbjct: 40  YERWRSHH-TVSRSLGDKHKRFNVFKANVMHV--HNTNKMDKPYKLKLNKFADMTNHEFR 96

Query: 99  APRNGYK-RRLPSVRSSETTDVSFRYEN-ASVPASIDWRKKGAVTGVKDQGQCGCCWAFS 156
           +   G K       + +   + +F YE   SVP S+DWRK GAVTGVKDQGQCG CWAFS
Sbjct: 97  STYAGSKVNHHRMFQGTPRGNGTFMYEKVGSVPPSVDWRKNGAVTGVKDQGQCGSCWAFS 156

Query: 157 AVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKY 216
            V A+EGIN I T KL SLSEQELVDCDT  ++ GC GGLM+ AFEFI    G+ TE+ Y
Sbjct: 157 TVVAVEGINQIKTNKLVSLSEQELVDCDTK-KNAGCNGGLMESAFEFIKQKGGITTESNY 215

Query: 217 PYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSS 276
           PY A DG+C+  +AN  A  I G+E+VP+N+E AL+KAVANQPVSVAIDA GSDFQFYS 
Sbjct: 216 PYTAQDGTCDASKANDLAVSIDGHENVPANDENALLKAVANQPVSVAIDAGGSDFQFYSE 275

Query: 277 GVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCG 336
           GVFTG C TEL+HGV  VGYGT  DGT YW V+NSWG  WGE GYIRMQR I  KEGLCG
Sbjct: 276 GVFTGDCSTELNHGVAIVGYGTTVDGTNYWTVRNSWGPEWGEQGYIRMQRSISKKEGLCG 335

Query: 337 IAMQASYP 344
           IAM ASYP
Sbjct: 336 IAMMASYP 343


>gi|358348957|ref|XP_003638507.1| Cysteine proteinase [Medicago truncatula]
 gi|355504442|gb|AES85645.1| Cysteine proteinase [Medicago truncatula]
          Length = 362

 Score =  371 bits (953), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 190/342 (55%), Positives = 233/342 (68%), Gaps = 12/342 (3%)

Query: 10  LVLAAILVLGVWAPQSWSRTLNDATMNER----HEMWMAQYGRVYRDNAEKEMRFKIFKE 65
           +VL+  LVL V   +S+     D + +E     +E W + +  V R+  EK+ RF +FK 
Sbjct: 9   IVLSIALVLVV--SESFDFHDKDVSSDESLWDLYERWRSHH-TVSRNLNEKQKRFNVFKS 65

Query: 66  NVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYK-RRLPSVRSSETTDVSFRYE 124
           NV ++   N    +KPYKL +N+FAD TN EF+    G K       R +     +F YE
Sbjct: 66  NVMHV--HNTNKMDKPYKLKLNKFADMTNHEFKTTYAGSKVNHHRMFRGTPRVSGTFMYE 123

Query: 125 N-ASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDC 183
           N    PAS+DWRKKGAVT VKDQGQCG CWAFS V A+EGIN I T +L  LSEQEL+DC
Sbjct: 124 NFTKAPASVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQIKTNRLVPLSEQELIDC 183

Query: 184 DTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDV 243
           D   E+QGC GGLM+ AFE+I    G+ TE+ YPY A+DGSC+  + N  A  I G+E V
Sbjct: 184 DNQ-ENQGCNGGLMEYAFEYIKQKGGITTESYYPYTANDGSCDATKENVPAVSIDGHETV 242

Query: 244 PSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGT 303
           P+N+E AL+KAVANQPVSVAIDA GSDFQFYS GVFTG CG EL+HGV  VGYGT  DGT
Sbjct: 243 PANDEDALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDCGKELNHGVAIVGYGTTVDGT 302

Query: 304 KYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
            YW+V+NSWG  WGE GYIRM+R++  KEGLCGIAM+ASYP 
Sbjct: 303 NYWIVRNSWGAEWGEQGYIRMKRNVSNKEGLCGIAMEASYPV 344


>gi|226507950|ref|NP_001151278.1| LOC100284911 precursor [Zea mays]
 gi|195645488|gb|ACG42212.1| vignain precursor [Zea mays]
          Length = 376

 Score =  371 bits (952), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 186/312 (59%), Positives = 220/312 (70%), Gaps = 11/312 (3%)

Query: 39  HEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFR 98
           +E W  ++  + RD  +K  RF +FK NV  I  FN   R++PYKL +N F D T +EFR
Sbjct: 49  YERWRGRHA-LARDLGDKARRFNVFKANVRLIHEFNR--RDEPYKLRLNRFGDMTADEFR 105

Query: 99  ----APRNGYKRRLPSVRSSETTDVSFRYENA-SVPASIDWRKKGAVTGVKDQGQCGCCW 153
                 R  + R     R   +   SF Y +A  VPAS+DWR+KGAVT VKDQGQCG CW
Sbjct: 106 RHYAGSRVAHHRMFRGDRQGSSASASFMYADARDVPASVDWRQKGAVTDVKDQGQCGSCW 165

Query: 154 AFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATE 213
           AFS +AA+EGIN I T+ LTSLSEQ+LVDCDT   + GC GGLMD AF++I  + G+A E
Sbjct: 166 AFSTIAAVEGINAIKTKNLTSLSEQQLVDCDTKA-NAGCNGGLMDYAFQYIAKHGGVAAE 224

Query: 214 AKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQF 273
             YPY+A   SC K  A      I GYEDVP+N+E+AL KAVA+QPVSVAI+ASGS FQF
Sbjct: 225 DAYPYRARQASCKKSPA--PVVTIDGYEDVPANDESALKKAVAHQPVSVAIEASGSHFQF 282

Query: 274 YSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEG 333
           YS GVF+G+CGTELDHGVTAVGYG   DGTKYWLVKNSWG  WGE GYIRM RD+ AKEG
Sbjct: 283 YSEGVFSGRCGTELDHGVTAVGYGVTADGTKYWLVKNSWGPEWGEKGYIRMARDVAAKEG 342

Query: 334 LCGIAMQASYPT 345
            CGIAM+ASYP 
Sbjct: 343 HCGIAMEASYPV 354


>gi|30141021|dbj|BAC75924.1| cysteine protease-2 [Helianthus annuus]
          Length = 362

 Score =  371 bits (952), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 192/329 (58%), Positives = 225/329 (68%), Gaps = 15/329 (4%)

Query: 25  SWSRTLNDATMNERHEMW-MAQYGR--VYRDNAEKEMRFKIFKENVEYIASFNNKARNKP 81
           +WS   ++  +     +W M +  R  V  ++ EK  RF +FK NV ++   N    +KP
Sbjct: 20  AWSFDFHEKELETEDNLWDMYERWRHKVATNHGEKLRRFNVFKSNVLHVHETNK--MDKP 77

Query: 82  YKLGINEFADQTNEEFRAPRNGYK-----RRLPSVRSSETTDVSFRYENA-SVPASIDWR 135
           YKL +N+FAD TN EFR+   G K     R L   RS   T   F Y N  SVP S+DWR
Sbjct: 78  YKLKLNKFADMTNHEFRSVYAGSKIHHHDRSLQGDRSGSKT---FMYANVESVPTSVDWR 134

Query: 136 KKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGG 195
           KKGAV  VKDQGQCG CWAFS VAA+EGIN I T +L SLSEQELVDCDT  E+QGC GG
Sbjct: 135 KKGAVAPVKDQGQCGSCWAFSTVAAVEGINKIKTNELVSLSEQELVDCDTL-ENQGCNGG 193

Query: 196 LMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAV 255
           LMD AF+FI    GL  E  YPY A DG C+  + N     I G+EDVP N+E +LMKAV
Sbjct: 194 LMDLAFDFIKKTGGLTREDAYPYAAEDGKCDSNKMNSPVVSIDGHEDVPKNDEQSLMKAV 253

Query: 256 ANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTT 315
           ANQPV+VAIDA  SDFQFYS GVFTG+CGT+LDHGV AVGYGT  DGTKYW+V+NSWG+ 
Sbjct: 254 ANQPVAVAIDAGSSDFQFYSEGVFTGKCGTQLDHGVAAVGYGTTLDGTKYWIVRNSWGSE 313

Query: 316 WGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           WGE GYIRM+R I  K GLCGIAM+ASYP
Sbjct: 314 WGEKGYIRMERGISDKRGLCGIAMEASYP 342


>gi|351721126|ref|NP_001237199.1| cysteine proteinase precursor [Glycine max]
 gi|31559530|dbj|BAC77523.1| cysteine proteinase [Glycine max]
 gi|31559532|dbj|BAC77524.1| cysteine proteinase [Glycine max]
          Length = 362

 Score =  370 bits (951), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 192/342 (56%), Positives = 228/342 (66%), Gaps = 14/342 (4%)

Query: 10  LVLAAILVLGVWAPQSWSRTLNDATMNERHEMW-----MAQYGRVYRDNAEKEMRFKIFK 64
           +VL+  LVLGV    + S   +D  +     +W        +  V R   +K  RF +FK
Sbjct: 9   VVLSLSLVLGV----ANSFDFHDKDLESEESLWDLYERWRSHHTVSRSLGDKHKRFNVFK 64

Query: 65  ENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYK-RRLPSVRSSETTDVSFRY 123
            N+ ++   N    +KPYKL +N+FAD TN EFR+   G K       R     + +F Y
Sbjct: 65  ANMMHV--HNTNKMDKPYKLKLNKFADMTNHEFRSTYAGSKVNHHRMFRDMPRGNGTFMY 122

Query: 124 ENA-SVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVD 182
           E   SVPAS+DWRKKGAVT VKDQG CG CWAFS V A+EGIN I T KL SLSEQELVD
Sbjct: 123 EKVGSVPASVDWRKKGAVTDVKDQGHCGSCWAFSTVVAVEGINQIKTNKLVSLSEQELVD 182

Query: 183 CDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYED 242
           CDT  E+ GC GGLM+ AF+FI    G+ TE+ YPY A DG+C+  +AN  A  I G+E+
Sbjct: 183 CDTE-ENAGCNGGLMESAFQFIKQKGGITTESYYPYTAQDGTCDASKANDLAVSIDGHEN 241

Query: 243 VPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDG 302
           VP N+E AL+KAVANQPVSVAIDA GSDFQFYS GVFTG C TEL+HGV  VGYG   DG
Sbjct: 242 VPGNDENALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDCSTELNHGVAIVGYGATVDG 301

Query: 303 TKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           T YW+V+NSWG  WGE GYIRMQR+I  KEGLCGIAM ASYP
Sbjct: 302 TSYWIVRNSWGPEWGELGYIRMQRNISKKEGLCGIAMLASYP 343


>gi|296081395|emb|CBI16828.3| unnamed protein product [Vitis vinifera]
          Length = 359

 Score =  370 bits (951), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 189/346 (54%), Positives = 229/346 (66%), Gaps = 10/346 (2%)

Query: 6   LENKLVLAAILVLGVWAPQSWSRTLNDATMNER----HEMWMAQYGRVYRDNAEKEMRFK 61
           +E  +++A  LVL     +S+     D    E     +E W + Y  V RD  EK  RF 
Sbjct: 3   MEKVILVALSLVLVFGLAESFDFDEKDLASEESLWDLYERWRS-YHTVSRDLEEKNKRFN 61

Query: 62  IFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYK-RRLPSVRSSETTDVS 120
           +FKEN +++   N    +KPYKL +N+FAD TN EFR+   G K +    +R        
Sbjct: 62  VFKENTKHVHKVNQ--MDKPYKLKLNKFADMTNHEFRSSYGGSKVKHYRMLRGDRRGTGG 119

Query: 121 FRYENAS-VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQE 179
           F +E  + +P S+DWRKKGAVTG+KDQG+CG CWAFS V  +EGIN I T++L SLSEQ+
Sbjct: 120 FMHEKTTYLPPSVDWRKKGAVTGIKDQGKCGSCWAFSTVVGVEGINQIKTKELLSLSEQQ 179

Query: 180 LVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISG 239
           L+DCD S +D GC GGLM+ AFEFI  N G+ TE  YPYKA D  C+  + N     I G
Sbjct: 180 LIDCDRS-DDHGCNGGLMESAFEFIKKNGGITTENNYPYKAKDERCDMLKMNAPVVTIDG 238

Query: 240 YEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTA 299
           +E VP N+E ALMKAVA+QPVSVAIDA GSD QFYS GVF G+CGTELDHGV  VGYGT 
Sbjct: 239 HESVPVNDERALMKAVAHQPVSVAIDAGGSDLQFYSEGVFDGECGTELDHGVAIVGYGTT 298

Query: 300 DDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
            DGTKYW+VKNSWG  WGE GYIRM R I A EG CGIAM+ASYP 
Sbjct: 299 LDGTKYWIVKNSWGAEWGEKGYIRMARGIQAAEGQCGIAMEASYPV 344


>gi|359473128|ref|XP_002285397.2| PREDICTED: vignain-like [Vitis vinifera]
          Length = 357

 Score =  370 bits (951), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 189/346 (54%), Positives = 229/346 (66%), Gaps = 10/346 (2%)

Query: 6   LENKLVLAAILVLGVWAPQSWSRTLNDATMNER----HEMWMAQYGRVYRDNAEKEMRFK 61
           +E  +++A  LVL     +S+     D    E     +E W + Y  V RD  EK  RF 
Sbjct: 1   MEKVILVALSLVLVFGLAESFDFDEKDLASEESLWDLYERWRS-YHTVSRDLEEKNKRFN 59

Query: 62  IFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYK-RRLPSVRSSETTDVS 120
           +FKEN +++   N    +KPYKL +N+FAD TN EFR+   G K +    +R        
Sbjct: 60  VFKENTKHVHKVNQ--MDKPYKLKLNKFADMTNHEFRSSYGGSKVKHYRMLRGDRRGTGG 117

Query: 121 FRYENAS-VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQE 179
           F +E  + +P S+DWRKKGAVTG+KDQG+CG CWAFS V  +EGIN I T++L SLSEQ+
Sbjct: 118 FMHEKTTYLPPSVDWRKKGAVTGIKDQGKCGSCWAFSTVVGVEGINQIKTKELLSLSEQQ 177

Query: 180 LVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISG 239
           L+DCD S +D GC GGLM+ AFEFI  N G+ TE  YPYKA D  C+  + N     I G
Sbjct: 178 LIDCDRS-DDHGCNGGLMESAFEFIKKNGGITTENNYPYKAKDERCDMLKMNAPVVTIDG 236

Query: 240 YEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTA 299
           +E VP N+E ALMKAVA+QPVSVAIDA GSD QFYS GVF G+CGTELDHGV  VGYGT 
Sbjct: 237 HESVPVNDERALMKAVAHQPVSVAIDAGGSDLQFYSEGVFDGECGTELDHGVAIVGYGTT 296

Query: 300 DDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
            DGTKYW+VKNSWG  WGE GYIRM R I A EG CGIAM+ASYP 
Sbjct: 297 LDGTKYWIVKNSWGAEWGEKGYIRMARGIQAAEGQCGIAMEASYPV 342


>gi|148907299|gb|ABR16787.1| unknown [Picea sitchensis]
          Length = 372

 Score =  370 bits (950), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 189/322 (58%), Positives = 237/322 (73%), Gaps = 12/322 (3%)

Query: 31  NDATMNERHEMWMAQYGRVYR--DNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINE 88
           +D ++   ++ W  Q+ R  R  D+ E   RF+IFKENV++I S N K  + PYKLG+N+
Sbjct: 37  SDESLRGLYDKWALQH-RSTRSLDSDEHARRFEIFKENVKHIDSVNKK--DGPYKLGLNK 93

Query: 89  FADQTNEEFRAPRNGYK-RRLPSVRSSETTDV-SFRYENAS-VPASIDWRKKGAVTGVKD 145
           FAD +NEEF+A     K  +  S+R     +  SF Y+N+  +PASIDWRKKGAVT VK+
Sbjct: 94  FADLSNEEFKAMHMTTKMEKHKSLRGDRGVESGSFMYQNSKRLPASIDWRKKGAVTPVKN 153

Query: 146 QGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFII 205
           QGQCG CWAFS +A++EGIN+I T KL SLSEQ+LVDC  S E+ GC GGLMD+AF++II
Sbjct: 154 QGQCGSCWAFSTIASVEGINYIKTGKLVSLSEQQLVDC--SKENAGCNGGLMDNAFQYII 211

Query: 206 SNKGLATEAKYPYKASDGSCN--KKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVA 263
            N G+ TE +YPY A  G C+  K E+   A  I G+EDVP+NNE AL KAVA+QPVS+A
Sbjct: 212 DNGGIVTEDEYPYTAEAGECSTTKIESKSIATIIDGFEDVPANNEGALKKAVAHQPVSIA 271

Query: 264 IDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIR 323
           I+ASG DFQFYS+GVFTG+CGTELDHGV  VGYG + +G  YW+V+NSWG  WGE GYIR
Sbjct: 272 IEASGHDFQFYSTGVFTGKCGTELDHGVVVVGYGKSPEGINYWIVRNSWGPEWGEQGYIR 331

Query: 324 MQRDIDAKEGLCGIAMQASYPT 345
           MQR I+A EG CGI+MQASYPT
Sbjct: 332 MQRGIEATEGKCGISMQASYPT 353


>gi|356563155|ref|XP_003549830.1| PREDICTED: vignain-like [Glycine max]
          Length = 361

 Score =  369 bits (948), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 184/308 (59%), Positives = 220/308 (71%), Gaps = 7/308 (2%)

Query: 39  HEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFR 98
           +E W + +  V R   EK  RF +FK NV ++ S N    +KPYKL +N FAD TN EFR
Sbjct: 40  YERWRSHH-TVSRSLDEKHNRFNVFKGNVMHVHSSNK--MDKPYKLKLNRFADMTNHEFR 96

Query: 99  APRNGYK-RRLPSVRSSETTDVSFRYENAS-VPASIDWRKKGAVTGVKDQGQCGCCWAFS 156
           +   G K       R +   + +F Y+N   VP+S+DWRKKGAVT VKDQGQCG CWAFS
Sbjct: 97  SIYAGSKVNHHRMFRGTPRGNGTFMYQNVDRVPSSVDWRKKGAVTDVKDQGQCGSCWAFS 156

Query: 157 AVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKY 216
            + A+EGIN I T KL  LSEQELVDCDT+ ++QGC GGLM+ AFEFI    G+ T + Y
Sbjct: 157 TIVAVEGINQIKTHKLVPLSEQELVDCDTT-QNQGCNGGLMESAFEFI-KQYGITTASNY 214

Query: 217 PYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSS 276
           PY+A DG+C+  + N  A  I G+E+VP NNEAAL+KAVA+QPVSVAI+A G DFQFYS 
Sbjct: 215 PYEAKDGTCDASKVNEPAVSIDGHENVPVNNEAALLKAVAHQPVSVAIEAGGIDFQFYSE 274

Query: 277 GVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCG 336
           GVFTG CGT LDHGV  VGYGT  DGTKYW VKNSWG+ WGE GYIRM+R I  K+GLCG
Sbjct: 275 GVFTGNCGTALDHGVAIVGYGTTQDGTKYWTVKNSWGSEWGEKGYIRMKRSISVKKGLCG 334

Query: 337 IAMQASYP 344
           IAM+ASYP
Sbjct: 335 IAMEASYP 342


>gi|224102377|ref|XP_002312656.1| predicted protein [Populus trichocarpa]
 gi|222852476|gb|EEE90023.1| predicted protein [Populus trichocarpa]
          Length = 358

 Score =  369 bits (947), Expect = 1e-99,   Method: Compositional matrix adjust.
 Identities = 187/343 (54%), Positives = 230/343 (67%), Gaps = 11/343 (3%)

Query: 9   KLVLAAILVLGVW-APQSWSRTLNDATMNER----HEMWMAQYGRVYRDNAEKEMRFKIF 63
           K++LA   V+ V+    S+  T  D    ER    +E W + +  V R  AEK+ RF +F
Sbjct: 5   KVILAVFSVVLVFRLADSFDYTEEDLASEERLRDLYERWRSHH-TVSRSLAEKQERFNVF 63

Query: 64  KENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYK-RRLPSVRSSETTDVSFR 122
           KEN+++I   N+K R  PYKL +N FAD TN EF     G K      +R       S  
Sbjct: 64  KENLKHIHKVNHKDR--PYKLKLNSFADMTNHEFLQHYGGSKVSHYRVLRGQRQGTGSMH 121

Query: 123 YENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVD 182
            + + +P+S+DWRK GAVTG+KDQG+CG CWAFS VAA+EGIN I T +L SLSEQELVD
Sbjct: 122 EDTSKLPSSVDWRKNGAVTGIKDQGKCGSCWAFSTVAAVEGINKIKTGELISLSEQELVD 181

Query: 183 CDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYED 242
           CD+  ++ GC GGLM+DAF FI    GL +E  YPY+A +  C+  + N     I GYE 
Sbjct: 182 CDS--DNHGCNGGLMEDAFNFIKQIGGLTSENTYPYRAKEEPCDSNKMNSPVVNIDGYEM 239

Query: 243 VPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDG 302
           VP N+E ALMKAVANQPV++A+DA G D QFYS  +FTG CGTEL+HGV  VGYGT  DG
Sbjct: 240 VPENDENALMKAVANQPVAIAMDAGGKDLQFYSEAIFTGDCGTELNHGVALVGYGTTQDG 299

Query: 303 TKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           TKYW+VKNSWGT WGE GYIRMQR IDA+EGLCGI M+ASYP 
Sbjct: 300 TKYWIVKNSWGTDWGEKGYIRMQRGIDAEEGLCGITMEASYPV 342


>gi|125533982|gb|EAY80530.1| hypothetical protein OsI_35710 [Oryza sativa Indica Group]
          Length = 378

 Score =  369 bits (947), Expect = 1e-99,   Method: Compositional matrix adjust.
 Identities = 182/337 (54%), Positives = 233/337 (69%), Gaps = 16/337 (4%)

Query: 23  PQSWSRTLNDATMNERHEMWMAQY--------GRVYRDNAEKEMRFKIFKENVEYIASFN 74
           P + S   ++ ++   +E W ++Y        G V  D+ E   RF +F EN  YI   N
Sbjct: 26  PFTESDLSSEESLRALYERWRSRYTVSRPAASGGVGNDDGEARRRFNVFVENARYIHEAN 85

Query: 75  NKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPS--VRSSETTDVSFRY---ENASVP 129
            +   +P++L +N+FAD T +EFR    G + R              SFRY   +  ++P
Sbjct: 86  RRG-GRPFRLALNKFADMTTDEFRRTYAGSRARHHRSLRGGRGGEGGSFRYGGDDEDNLP 144

Query: 130 ASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGED 189
            ++DWR++GAVTG+KDQGQCG CWAFSAVAA+EG+N I T +L +LSEQELVDCDT G++
Sbjct: 145 PAVDWRERGAVTGIKDQGQCGSCWAFSAVAAVEGVNKIKTGRLVTLSEQELVDCDT-GDN 203

Query: 190 QGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEA 249
           QGC+GGLMD AF+FI  N G+ TE+ YPY+A  G CNK +A+     I GYEDVP+N+E+
Sbjct: 204 QGCDGGLMDYAFQFIKRNGGITTESNYPYRAEQGRCNKAKASSHDVTIDGYEDVPANDES 263

Query: 250 ALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVK 309
           AL KAVANQPV+VA++ASG DFQFYS GVFTG+CGT+LDHGV AVGYG   DGTKYW+VK
Sbjct: 264 ALQKAVANQPVAVAVEASGQDFQFYSEGVFTGECGTDLDHGVAAVGYGITRDGTKYWIVK 323

Query: 310 NSWGTTWGENGYIRMQRDIDA-KEGLCGIAMQASYPT 345
           NSWG  WGE GYIRMQR + +   GLCGIAM+ASYP 
Sbjct: 324 NSWGEDWGERGYIRMQRGVSSDSNGLCGIAMEASYPV 360


>gi|171702841|dbj|BAG16376.1| cysteine protease [Brassica rapa var. perviridis]
          Length = 333

 Score =  369 bits (946), Expect = 1e-99,   Method: Compositional matrix adjust.
 Identities = 177/315 (56%), Positives = 222/315 (70%), Gaps = 7/315 (2%)

Query: 30  LNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEF 89
           L++  M +RH  WM ++GRVY D  EK  R+ +FK NVE I   N+      +KL +N+F
Sbjct: 23  LDEVAMQKRHAEWMTEHGRVYADANEKNNRYAVFKRNVERIERLNDVQSGLTFKLAVNQF 82

Query: 90  ADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS---VPASIDWRKKGAVTGVKDQ 146
           AD TNEEFR+   G+K    SV SS T   SFRY+N S   +P S+DWRKKGAVT +KDQ
Sbjct: 83  ADLTNEEFRSMYTGFKGN--SVLSSRTKPTSFRYQNVSSDALPVSVDWRKKGAVTPIKDQ 140

Query: 147 GQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIIS 206
           G CG CWAFSAVAA+EG+  I   KL SLSEQELVDCDT+  D GC GGLMD AF + I+
Sbjct: 141 GLCGSCWAFSAVAAIEGVAQIKKGKLISLSEQELVDCDTN--DGGCMGGLMDTAFNYTIT 198

Query: 207 NKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDA 266
             GL +E+ YPYK+++G+CN  +    A  I G+EDVP+N+E ALMKAVA+ PVS+ I  
Sbjct: 199 IGGLTSESNYPYKSTNGTCNFNKTKQIATSIKGFEDVPANDEKALMKAVAHHPVSIGIAG 258

Query: 267 SGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQR 326
               FQFYSSGVF+G+C T LDHGVTAVGYG + +G KYW++KNSWG  WGE GY+R+++
Sbjct: 259 GDIGFQFYSSGVFSGECTTHLDHGVTAVGYGRSKNGLKYWILKNSWGPKWGERGYMRIKK 318

Query: 327 DIDAKEGLCGIAMQA 341
           DI  K G CG+AM A
Sbjct: 319 DIKPKHGQCGLAMNA 333


>gi|116781957|gb|ABK22314.1| unknown [Picea sitchensis]
          Length = 369

 Score =  369 bits (946), Expect = 2e-99,   Method: Compositional matrix adjust.
 Identities = 187/320 (58%), Positives = 232/320 (72%), Gaps = 12/320 (3%)

Query: 31  NDATMNERHEMWMAQYGRVYR--DNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINE 88
           ++ ++   ++ W  Q+ R  R  D+ E   RF+IFKENV+YI S N K  + PYKLG+N+
Sbjct: 38  SEKSLRSLYDNWALQH-RSSRSLDSEEHAERFEIFKENVKYIDSVNKK--DSPYKLGLNK 94

Query: 89  FADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS-VPASIDWRKKGAVTGVKDQG 147
           FAD +NEEF+A   G K  L   R  E    SF Y+N+  +PASIDWR+KGAV  VK+QG
Sbjct: 95  FADLSNEEFKAIYMGTKMDLRGDR--EVQSGSFMYQNSEPLPASIDWRQKGAVAAVKNQG 152

Query: 148 QCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISN 207
            CG CWAFS VA++EGIN+ITT  L SLSEQ+LVDC T  E+ GC GGLMD AF++II+N
Sbjct: 153 HCGSCWAFSTVASVEGINYITTGNLVSLSEQQLVDCST--ENSGCNGGLMDTAFQYIINN 210

Query: 208 KGLATEAKYPYKASDGSCNKKEANPSAAK--ISGYEDVPSNNEAALMKAVANQPVSVAID 265
            G+ TE  YPY A    C+  + N    +  I G+EDVP+NNE AL +AVA+QPVSVAI+
Sbjct: 211 GGIVTEDNYPYTAEATECSSTKINSQTTRVVIDGFEDVPANNEQALKEAVAHQPVSVAIE 270

Query: 266 ASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQ 325
           ASG DFQFYS+GVFTG+CGT LDHGV AVGYGT+ +G  YW+V+NSWG  WGE GYIRMQ
Sbjct: 271 ASGQDFQFYSTGVFTGKCGTALDHGVVAVGYGTSPEGINYWIVRNSWGPKWGEEGYIRMQ 330

Query: 326 RDIDAKEGLCGIAMQASYPT 345
           + I+A EG CGIAMQASYPT
Sbjct: 331 QGIEAAEGKCGIAMQASYPT 350


>gi|115484973|ref|NP_001067630.1| Os11g0255300 [Oryza sativa Japonica Group]
 gi|530335|emb|CAA56844.1| cysteine protease [Oryza sativa Japonica Group]
 gi|5761322|dbj|BAA83472.1| cysteine endopeptidase [Oryza sativa Japonica Group]
 gi|62732672|gb|AAX94791.1| cysteine proteinase (EC 3.4.22.-) - rice [Oryza sativa Japonica
           Group]
 gi|62732673|gb|AAX94792.1| cysteine proteinase (EC 3.4.22.-) - rice [Oryza sativa Japonica
           Group]
 gi|62732674|gb|AAX94793.1| cysteine proteinase (EC 3.4.22.-) - rice [Oryza sativa Japonica
           Group]
 gi|77549615|gb|ABA92412.1| Thiol protease SEN102 precursor, putative, expressed [Oryza sativa
           Japonica Group]
 gi|77549616|gb|ABA92413.1| Thiol protease SEN102 precursor, putative, expressed [Oryza sativa
           Japonica Group]
 gi|77549617|gb|ABA92414.1| Thiol protease SEN102 precursor, putative, expressed [Oryza sativa
           Japonica Group]
 gi|113644852|dbj|BAF27993.1| Os11g0255300 [Oryza sativa Japonica Group]
 gi|125576789|gb|EAZ18011.1| hypothetical protein OsJ_33558 [Oryza sativa Japonica Group]
 gi|215701098|dbj|BAG92522.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 378

 Score =  368 bits (945), Expect = 2e-99,   Method: Compositional matrix adjust.
 Identities = 182/337 (54%), Positives = 233/337 (69%), Gaps = 16/337 (4%)

Query: 23  PQSWSRTLNDATMNERHEMWMAQY--------GRVYRDNAEKEMRFKIFKENVEYIASFN 74
           P + S   ++ ++   +E W ++Y        G V  D+ E   RF +F EN  YI   N
Sbjct: 26  PFTESDLSSEESLRALYERWRSRYTVSRPAASGGVGNDDGEARRRFNVFVENARYIHEAN 85

Query: 75  NKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDV--SFRY---ENASVP 129
            +   +P++L +N+FAD T +EFR    G + R     S        SFRY   +  ++P
Sbjct: 86  RRG-GRPFRLALNKFADMTTDEFRRTYAGSRARHHRSLSGGRGGEGGSFRYGGDDEDNLP 144

Query: 130 ASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGED 189
            ++DWR++GAVTG+KDQGQCG CWAFS VAA+EG+N I T +L +LSEQELVDCDT G++
Sbjct: 145 PAVDWRERGAVTGIKDQGQCGSCWAFSTVAAVEGVNKIKTGRLVTLSEQELVDCDT-GDN 203

Query: 190 QGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEA 249
           QGC+GGLMD AF+FI  N G+ TE+ YPY+A  G CNK +A+     I GYEDVP+N+E+
Sbjct: 204 QGCDGGLMDYAFQFIKRNGGITTESNYPYRAEQGRCNKAKASSHDVTIDGYEDVPANDES 263

Query: 250 ALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVK 309
           AL KAVANQPV+VA++ASG DFQFYS GVFTG+CGT+LDHGV AVGYG   DGTKYW+VK
Sbjct: 264 ALQKAVANQPVAVAVEASGQDFQFYSEGVFTGECGTDLDHGVAAVGYGITRDGTKYWIVK 323

Query: 310 NSWGTTWGENGYIRMQRDIDA-KEGLCGIAMQASYPT 345
           NSWG  WGE GYIRMQR + +   GLCGIAM+ASYP 
Sbjct: 324 NSWGEDWGERGYIRMQRGVSSDSNGLCGIAMEASYPV 360


>gi|242038089|ref|XP_002466439.1| hypothetical protein SORBIDRAFT_01g007820 [Sorghum bicolor]
 gi|241920293|gb|EER93437.1| hypothetical protein SORBIDRAFT_01g007820 [Sorghum bicolor]
          Length = 353

 Score =  368 bits (944), Expect = 3e-99,   Method: Compositional matrix adjust.
 Identities = 187/314 (59%), Positives = 230/314 (73%), Gaps = 6/314 (1%)

Query: 35  MNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTN 94
           M  RHE WMA++GR Y D AEK  R +IF+ N E+I SFN+  ++  ++L  N FAD T+
Sbjct: 43  MVSRHEKWMAEHGRTYTDEAEKARRLEIFRANAEFIDSFNDAGKHS-HRLATNRFADLTD 101

Query: 95  EEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVP---ASIDWRKKGAVTGVKDQGQCGC 151
           EEFRA R G++ R     ++ +    FRYEN S+     S+DWR  GAVTGVKDQG+CGC
Sbjct: 102 EEFRAARTGFRPRPAPAAAAGSG-GRFRYENFSLADAAQSVDWRAMGAVTGVKDQGECGC 160

Query: 152 CWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLA 211
           CWAFSAVAA+EG+N I T +L SLSEQELVDCD +GEDQGCEGGLMDDAF+FI    GLA
Sbjct: 161 CWAFSAVAAVEGLNKIRTGRLVSLSEQELVDCDVNGEDQGCEGGLMDDAFQFIERRGGLA 220

Query: 212 TEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDF 271
           +E+ YPY+  DGSC    A   AA I G+EDVP NNEAAL  AVANQPVSVAI+     F
Sbjct: 221 SESGYPYQGDDGSCRSSAAAARAASIRGHEDVPRNNEAALAAAVANQPVSVAINGEDYAF 280

Query: 272 QFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAK 331
           +FY SGV  G+CGT+L+H +TAVGYGTA DG+KYWL+KNSWGT+WGE GY+R++R +   
Sbjct: 281 RFYDSGVLGGECGTDLNHAITAVGYGTAADGSKYWLMKNSWGTSWGEGGYVRIRRGVRG- 339

Query: 332 EGLCGIAMQASYPT 345
           EG+CG+A   SYP 
Sbjct: 340 EGVCGLAKLPSYPV 353


>gi|414870137|tpg|DAA48694.1| TPA: vignain [Zea mays]
          Length = 484

 Score =  368 bits (944), Expect = 3e-99,   Method: Compositional matrix adjust.
 Identities = 186/313 (59%), Positives = 220/313 (70%), Gaps = 12/313 (3%)

Query: 39  HEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFR 98
           +E W  ++  + RD  +K  RF +FK NV  I  FN   R++PYKL +N F D T +EFR
Sbjct: 156 YERWRGRHA-LARDLGDKARRFNVFKANVRLIHEFNR--RDEPYKLRLNRFGDMTADEFR 212

Query: 99  ----APRNGYKRRLPSVRS-SETTDVSFRYENA-SVPASIDWRKKGAVTGVKDQGQCGCC 152
                 R  + R     R  S  +  SF Y +A  VPAS+DWR+KGAVT VKDQGQCG C
Sbjct: 213 RHYAGSRVAHHRMFRGDRQGSSASASSFMYADARDVPASVDWRQKGAVTDVKDQGQCGSC 272

Query: 153 WAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLAT 212
           WAFS +AA+EGIN I T+ LTSLSEQ+LVDCDT   + GC GGLMD AF++I  + G+A 
Sbjct: 273 WAFSTIAAVEGINAIKTKNLTSLSEQQLVDCDTKA-NAGCNGGLMDYAFQYIAKHGGVAA 331

Query: 213 EAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQ 272
           E  YPY+A   SC K  A      I GYEDVP+N+E+AL KAVA+QPVSVAI+ASGS FQ
Sbjct: 332 EDAYPYRARQASCKKSPA--PVVTIDGYEDVPANDESALKKAVAHQPVSVAIEASGSHFQ 389

Query: 273 FYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKE 332
           FYS GVF+G+CGTELDHGV AVGYG   DGTKYWLVKNSWG  WGE GYIRM RD+ AKE
Sbjct: 390 FYSEGVFSGRCGTELDHGVAAVGYGVTADGTKYWLVKNSWGPEWGEKGYIRMARDVAAKE 449

Query: 333 GLCGIAMQASYPT 345
           G CGIAM+ASYP 
Sbjct: 450 GHCGIAMEASYPV 462


>gi|255547982|ref|XP_002515048.1| cysteine protease, putative [Ricinus communis]
 gi|223546099|gb|EEF47602.1| cysteine protease, putative [Ricinus communis]
          Length = 359

 Score =  366 bits (940), Expect = 7e-99,   Method: Compositional matrix adjust.
 Identities = 179/307 (58%), Positives = 217/307 (70%), Gaps = 6/307 (1%)

Query: 39  HEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFR 98
           +E W + +  V R   EK  RF +FKEN+++I   N K R  PYKL +N+FAD TN EF 
Sbjct: 40  YERWRSHH-TVSRSLTEKNQRFNVFKENLKHIHKVNQKDR--PYKLRLNKFADMTNHEFL 96

Query: 99  APRNGYKRRLPSVRSSETTDVSFRYENAS-VPASIDWRKKGAVTGVKDQGQCGCCWAFSA 157
               G K     +         F +EN S +P+SIDWRK+GAVTGVKDQG+CG CWAFS+
Sbjct: 97  QHYGGSKVSHYRMFHGSRRQTGFAHENTSNLPSSIDWRKQGAVTGVKDQGKCGSCWAFSS 156

Query: 158 VAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYP 217
           VAA+EGIN I T +L SLSEQELVDC++   + GC+GGLM+ AF FI    GL TE  YP
Sbjct: 157 VAAVEGINKIKTGELISLSEQELVDCNSV--NHGCDGGLMEQAFSFIEKTGGLTTENNYP 214

Query: 218 YKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSG 277
           Y+A DG C+  + N     I GYE VP N+E ALM+AVANQPVS+AIDA G DFQFYS G
Sbjct: 215 YRAKDGYCDSAKMNTPMVTIDGYEMVPENDEHALMQAVANQPVSIAIDAGGQDFQFYSEG 274

Query: 278 VFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGI 337
           V+TG CGTEL+HGV  VGYG   DGTKYW+VKNSWG+ WGENG+IRMQR+ D +EGLCGI
Sbjct: 275 VYTGDCGTELNHGVALVGYGATQDGTKYWIVKNSWGSEWGENGFIRMQRENDVEEGLCGI 334

Query: 338 AMQASYP 344
            ++ASYP
Sbjct: 335 TLEASYP 341


>gi|302816909|ref|XP_002990132.1| hypothetical protein SELMODRAFT_428615 [Selaginella moellendorffii]
 gi|300142145|gb|EFJ08849.1| hypothetical protein SELMODRAFT_428615 [Selaginella moellendorffii]
          Length = 358

 Score =  366 bits (939), Expect = 1e-98,   Method: Compositional matrix adjust.
 Identities = 184/319 (57%), Positives = 224/319 (70%), Gaps = 12/319 (3%)

Query: 32  DATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFAD 91
           D +    +E WM  +GRVY    EKE RF+IF++N EYI   +N+  N+ Y LG+N FAD
Sbjct: 27  DGSFRALYEKWMVDHGRVYNGIGEKERRFQIFRDNAEYIEE-HNRQVNQTYWLGLNNFAD 85

Query: 92  QTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS-VPASIDWRKKGAVTGVKDQGQCG 150
            T++EF+A   G K  L     S T    FRYE+A+ +P   DWR KGAV  VK+QG CG
Sbjct: 86  MTHDEFKALYFGTKVPL-----SNTIKSGFRYEDATNLPLDTDWRSKGAVATVKNQGACG 140

Query: 151 CCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGL 210
            CWAFS VAA+EG+N I T +L SLSEQELVDCD   ++QGC GGLMD AFEFII N GL
Sbjct: 141 SCWAFSTVAAVEGVNQIVTGELVSLSEQELVDCDKQ-KNQGCNGGLMDSAFEFIIQNGGL 199

Query: 211 ATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSD 270
            +EA YPYKA  GSC++   N     I G+EDVP+ +EA L+KAVANQPVSVAI+ASG +
Sbjct: 200 DSEADYPYKAVSGSCDESRRNSHVVTIDGFEDVPAESEADLLKAVANQPVSVAIEASGRN 259

Query: 271 FQFYSSGVFTGQCGTELDHGVTAVGYGTAD--DG--TKYWLVKNSWGTTWGENGYIRMQR 326
           FQ YS GV+TG CG ELDHGV AVGYGT+   DG  T YW+V+NSWG  WGE+GYIR+QR
Sbjct: 260 FQLYSGGVYTGHCGYELDHGVVAVGYGTSKTPDGVATDYWIVRNSWGDAWGESGYIRLQR 319

Query: 327 DIDAKEGLCGIAMQASYPT 345
           ++ +  G CGIAM ASYP 
Sbjct: 320 NVASSRGKCGIAMMASYPV 338


>gi|217073894|gb|ACJ85307.1| unknown [Medicago truncatula]
 gi|388507498|gb|AFK41815.1| unknown [Medicago truncatula]
          Length = 362

 Score =  365 bits (938), Expect = 1e-98,   Method: Compositional matrix adjust.
 Identities = 188/342 (54%), Positives = 231/342 (67%), Gaps = 12/342 (3%)

Query: 10  LVLAAILVLGVWAPQSWSRTLNDATMNER----HEMWMAQYGRVYRDNAEKEMRFKIFKE 65
           +VL+  LVL V   +S+     D + +E     +E W + +  V R+  EK+ RF +FK 
Sbjct: 9   IVLSIALVLVV--SESFDFHDKDVSSDESLWDLYERWRSHH-TVSRNLNEKQKRFNVFKS 65

Query: 66  NVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYK-RRLPSVRSSETTDVSFRYE 124
           NV ++   N    +KPYKL +N+FAD TN EF+    G K       R +     +F YE
Sbjct: 66  NVMHV--HNTNKMDKPYKLKLNKFADMTNHEFKTTYAGSKVNHHRMFRGTPRVSGTFMYE 123

Query: 125 N-ASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDC 183
           N    PAS+DWRKKGAVT VKDQGQCG CWAFS V A+EGIN I T +L  LSEQEL+DC
Sbjct: 124 NFTKAPASVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQIKTNRLVPLSEQELIDC 183

Query: 184 DTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDV 243
           D   E+QGC GGLM+ AFE+I    G+ TE+ YPY A+DGSC+  + N     I G+E V
Sbjct: 184 DNQ-ENQGCNGGLMEYAFEYIKQKGGVTTESYYPYTANDGSCDATKENVPTVSIDGHETV 242

Query: 244 PSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGT 303
           P+N+E AL+KAVANQPVSVAIDA GSDFQFYS GVFTG CG EL+HGV  VGYGT  DGT
Sbjct: 243 PANDEDALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDCGKELNHGVAIVGYGTTVDGT 302

Query: 304 KYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
            YW+V+NSWG  WGE G IRM+R++  KEGLCGIAM+ASYP 
Sbjct: 303 NYWIVRNSWGAEWGEQGCIRMKRNVSNKEGLCGIAMEASYPV 344


>gi|388517427|gb|AFK46775.1| unknown [Medicago truncatula]
          Length = 362

 Score =  365 bits (938), Expect = 1e-98,   Method: Compositional matrix adjust.
 Identities = 188/342 (54%), Positives = 231/342 (67%), Gaps = 12/342 (3%)

Query: 10  LVLAAILVLGVWAPQSWSRTLNDATMNER----HEMWMAQYGRVYRDNAEKEMRFKIFKE 65
           +VL+  LVL V   +S+     D + +E     +E W + +  V R+  EK+ RF +FK 
Sbjct: 9   IVLSIALVLVV--SESFDFHDKDVSSDESLWDLYERWRSHH-TVSRNLNEKQKRFNVFKS 65

Query: 66  NVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYK-RRLPSVRSSETTDVSFRYE 124
           NV ++   N    +KPYKL +N+FAD TN EF+    G K       R +     +F YE
Sbjct: 66  NVMHV--HNTNKMDKPYKLKLNKFADMTNHEFKTTYAGTKVNHHRMFRGTPRVSGTFMYE 123

Query: 125 N-ASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDC 183
           N    PAS+DWRKKGAVT VKDQGQCG CWAFS V A+EGIN I T +L  LSEQEL+DC
Sbjct: 124 NFTKAPASVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQIKTNRLVPLSEQELIDC 183

Query: 184 DTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDV 243
           D   E+QGC GGLM+ AFE+I    G+ TE+ YPY A+DGSC+  + N     I G+E V
Sbjct: 184 DNQ-ENQGCNGGLMEYAFEYIKQKGGVTTESYYPYTANDGSCDATKENVPTVSIDGHETV 242

Query: 244 PSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGT 303
           P+N+E AL+KAVANQPVSVAIDA GSDFQFYS GVFTG CG EL+HGV  VGYGT  DGT
Sbjct: 243 PANDEDALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDCGKELNHGVAIVGYGTTVDGT 302

Query: 304 KYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
            YW+V+NSWG  WGE G IRM+R++  KEGLCGIAM+ASYP 
Sbjct: 303 NYWIVRNSWGAEWGEQGCIRMKRNVSNKEGLCGIAMEASYPV 344


>gi|242071345|ref|XP_002450949.1| hypothetical protein SORBIDRAFT_05g021550 [Sorghum bicolor]
 gi|241936792|gb|EES09937.1| hypothetical protein SORBIDRAFT_05g021550 [Sorghum bicolor]
          Length = 371

 Score =  365 bits (937), Expect = 2e-98,   Method: Compositional matrix adjust.
 Identities = 196/356 (55%), Positives = 238/356 (66%), Gaps = 26/356 (7%)

Query: 10  LVLAAI-LVLGVWAPQSWS-------RTLNDATMNERHEMWMAQYGRVYR-----DNAEK 56
           LVLAA+ L L V AP + +          ++ ++   +E W + Y  V R     +  +K
Sbjct: 5   LVLAAVSLALLVLAPPARAGIPFTEKDLASEESLRALYEQWRSHY-MVSRPAGLQEQDDK 63

Query: 57  EMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFR-----APRNGYKRRLPS- 110
              F +FKENV YI   N K R+  ++L +N+FAD T +EFR       R  + R L S 
Sbjct: 64  ARWFNVFKENVRYIHEANKKGRS--FRLALNKFADMTTDEFRRAYAAGSRTRHHRALSSG 121

Query: 111 VRSSETTDVSFRYENA-SVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITT 169
           +R     D SF Y  A ++P ++DWR++GAVTG+KDQGQCG CWAFS +AA+EGIN I T
Sbjct: 122 IR--RHGDGSFMYAQAGNLPLAVDWRQRGAVTGIKDQGQCGSCWAFSTIAAVEGINKIRT 179

Query: 170 RKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKE 229
            KL SLSEQELVDCD   ++QGC GGLMD AF++I  N G+ TE+ YPY A   SCNK +
Sbjct: 180 GKLVSLSEQELVDCDDV-DNQGCNGGLMDYAFQYIKRNGGITTESNYPYLAEQRSCNKAK 238

Query: 230 ANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDH 289
                  I GYEDVP+NNE AL KAVANQPVS+AI+ASG DFQFYS GVFTG CGTELDH
Sbjct: 239 ERSHDVTIDGYEDVPANNEDALQKAVANQPVSIAIEASGQDFQFYSEGVFTGSCGTELDH 298

Query: 290 GVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           GV AVGYG   DGTKYW+VKNSWG  WGE GYIRMQR I   +GLCGIAM+ SYPT
Sbjct: 299 GVAAVGYGITRDGTKYWIVKNSWGEDWGERGYIRMQRGISDSQGLCGIAMEPSYPT 354


>gi|374530932|gb|AEP83812.2| cysteine endopeptidase EP8 [Secale cereale x Triticum durum]
          Length = 364

 Score =  365 bits (937), Expect = 2e-98,   Method: Compositional matrix adjust.
 Identities = 185/312 (59%), Positives = 222/312 (71%), Gaps = 10/312 (3%)

Query: 39  HEMWMAQYGRVYRD---NAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNE 95
           +E W + Y    R    +AE E RF +FKEN  YI   N K R  P++L +N+FAD T +
Sbjct: 40  YERWRSHYTVSRRGLGADAE-ERRFNVFKENARYIHEGNKKDR--PFRLALNKFADMTTD 96

Query: 96  EFRAPRNGYK-RRLPSVRSSETTDVSFRYENA-SVPASIDWRKKGAVTGVKDQGQCGCCW 153
           EFR    G + R   S+      D SFRY +A ++P ++DWR+KGAVT +KDQGQCG CW
Sbjct: 97  EFRRTYAGSRVRHHLSLSGGRRGDGSFRYGDADNLPPAVDWRQKGAVTAIKDQGQCGSCW 156

Query: 154 AFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATE 213
           AFS + A+EGIN I T KL SLSEQEL+DCD    +QGC+GGLMD AF+FI  N G+ TE
Sbjct: 157 AFSTIVAVEGINKIRTGKLVSLSEQELMDCDNV-NNQGCDGGLMDYAFQFIHKN-GITTE 214

Query: 214 AKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQF 273
           + YPY+   GSC+  +    A  I GYEDVP+N+E+AL KAVA QPVSVAIDASG+DFQF
Sbjct: 215 SNYPYQGEQGSCDLAKEKAHAVTIDGYEDVPANDESALQKAVAGQPVSVAIDASGNDFQF 274

Query: 274 YSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEG 333
           YS GVFTG+C T+LDHGV AVGYGT  DGTKYW+VKNSWG  WGE GYIRMQR +   EG
Sbjct: 275 YSEGVFTGECSTDLDHGVAAVGYGTTRDGTKYWIVKNSWGEDWGEKGYIRMQRGVSQAEG 334

Query: 334 LCGIAMQASYPT 345
            CGIAMQASYPT
Sbjct: 335 QCGIAMQASYPT 346


>gi|356533293|ref|XP_003535200.1| PREDICTED: LOW QUALITY PROTEIN: cysteine proteinase RD21a-like
           [Glycine max]
          Length = 466

 Score =  364 bits (935), Expect = 3e-98,   Method: Compositional matrix adjust.
 Identities = 183/328 (55%), Positives = 231/328 (70%), Gaps = 12/328 (3%)

Query: 26  WSRTLNDATMNERH-----EMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNK 80
           W+  ++    +E H     E W+ ++G+ Y    EKE RFKIFK+N+ +I   +N A +K
Sbjct: 30  WAMDMSIIDYDESHTRHVYEAWLVKHGKAYNALGEKERRFKIFKDNLRFIEE-HNGAGDK 88

Query: 81  PYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYE---NASVPASIDWRKK 137
            YKLG+N+FAD TNEE+RA   G + R P  +++     + RY       +PA +DWR+K
Sbjct: 89  SYKLGLNKFADLTNEEYRAMFLGTRTRGPKNKAAVVAKKTDRYAYRAGEELPAMVDWREK 148

Query: 138 GAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLM 197
           GAVT +KDQGQCG CWAFS V A+EGIN I T  LTSLSEQELVDCD  G + GC GGLM
Sbjct: 149 GAVTPIKDQGQCGSCWAFSTVGAVEGINQIVTGNLTSLSEQELVDCD-RGYNMGCNGGLM 207

Query: 198 DDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN 257
           D AFEFI+ N G+ TE  YPY A D +C+    N     I GYEDVP+N+E +LMKAVAN
Sbjct: 208 DYAFEFIVQNGGIDTEEDYPYHAKDNTCDPNRKNARVVTIDGYEDVPTNDEKSLMKAVAN 267

Query: 258 QPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWG 317
           QPVSVAI+A G +FQ Y SGVFTG+CGT LDHGV AVGYGT ++GT YWLV+NSWG+ WG
Sbjct: 268 QPVSVAIEAGGMEFQLYQSGVFTGRCGTNLDHGVVAVGYGT-ENGTDYWLVRNSWGSAWG 326

Query: 318 ENGYIRMQRDIDAKE-GLCGIAMQASYP 344
           ENGYI+++R++   E G CGIA++ASYP
Sbjct: 327 ENGYIKLERNVQNTETGKCGIAIEASYP 354


>gi|116786779|gb|ABK24233.1| unknown [Picea sitchensis]
          Length = 463

 Score =  364 bits (935), Expect = 3e-98,   Method: Compositional matrix adjust.
 Identities = 192/352 (54%), Positives = 240/352 (68%), Gaps = 13/352 (3%)

Query: 1   MAMILLENKLVLAAIL------VLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNA 54
           M ++LL   L L+A+          +    S     +DA M E +E+W+AQ+ + Y    
Sbjct: 1   MGILLLFAVLALSAMAGSASRADFSIIGYDSKDLREDDAIM-ELYELWLAQHKKAYNGLG 59

Query: 55  EKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSS 114
           EK+ RF +FK+N  YI   NN+  N  YKLG+N+FAD ++EEF+A   G K      R S
Sbjct: 60  EKQNRFSVFKDNFLYIHQHNNQG-NPSYKLGLNQFADLSHEEFKATYLGAKLDTKK-RLS 117

Query: 115 ETTDVSFRYENA-SVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLT 173
            +    ++Y +   +P SIDWR+KGAVT VKDQG CG CWAFS VAA+EGIN I T  LT
Sbjct: 118 NSPSPRYQYSDGEDLPESIDWREKGAVTAVKDQGSCGSCWAFSTVAAVEGINQIVTGNLT 177

Query: 174 SLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPS 233
           SLSEQELVDCDTS  +QGC GGLMD AF+FII+N GL +E  YPYKA+DGSC+    N  
Sbjct: 178 SLSEQELVDCDTS-YNQGCNGGLMDYAFQFIINNGGLDSEDDYPYKANDGSCDAYRKNAH 236

Query: 234 AAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTA 293
              I  YEDVP N+E +L KA ANQP+SVAI+ASG  FQFY SGVFT  CGT+LDHGVT 
Sbjct: 237 VVTIDDYEDVPENDEKSLKKAAANQPISVAIEASGRAFQFYESGVFTSTCGTQLDHGVTL 296

Query: 294 VGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDID-AKEGLCGIAMQASYP 344
           VGYG+ + GT YW+VKNSWG +WGE G+IR+QR+I+    G+CGIAM+ASYP
Sbjct: 297 VGYGS-ESGTDYWIVKNSWGKSWGEKGFIRLQRNIEGVSTGMCGIAMEASYP 347


>gi|147790682|emb|CAN61026.1| hypothetical protein VITISV_001146 [Vitis vinifera]
          Length = 469

 Score =  363 bits (932), Expect = 7e-98,   Method: Compositional matrix adjust.
 Identities = 182/308 (59%), Positives = 227/308 (73%), Gaps = 7/308 (2%)

Query: 39  HEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFR 98
           +E W+A++G+ Y    EKE RF+IFK+N+ +I   N  A N+ YK+G+N FAD TNEE+R
Sbjct: 53  YEAWLAKHGKSYNALGEKERRFQIFKDNLRFIDEHN--AENRTYKVGLNRFADLTNEEYR 110

Query: 99  APRNGYKRRLPSVRSSETTD-VSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSA 157
           +   G +       S++ +D  +FR  + S+P S+DWRKKGAV  VKDQG CG CWAFS 
Sbjct: 111 SMYLGTRTAAKRRSSNKISDRYAFRVGD-SLPESVDWRKKGAVVEVKDQGSCGSCWAFST 169

Query: 158 VAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYP 217
           +AA+EGIN I T  L SLSEQELVDCDTS  ++GC GGLMD AFEFII+N G+ +E  YP
Sbjct: 170 IAAVEGINKIVTGGLISLSEQELVDCDTS-YNEGCNGGLMDYAFEFIINNGGIDSEEDYP 228

Query: 218 YKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSG 277
           YKASDG C++   N     I GYEDVP N+E +L KAVANQPVSVAI+A G +FQ Y SG
Sbjct: 229 YKASDGRCDQYRKNAXVVTIDGYEDVPENDEKSLEKAVANQPVSVAIEAGGREFQLYQSG 288

Query: 278 VFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDI-DAKEGLCG 336
           +FTG+CGT LDHGVTAVGYGT ++G  YW+VKNSWG +WGE GYIRM+RD+  +  G CG
Sbjct: 289 IFTGRCGTALDHGVTAVGYGT-ENGVDYWIVKNSWGASWGEEGYIRMERDLATSATGKCG 347

Query: 337 IAMQASYP 344
           IAM+ASYP
Sbjct: 348 IAMEASYP 355


>gi|225458701|ref|XP_002284973.1| PREDICTED: cysteine proteinase RD21a-like [Vitis vinifera]
          Length = 467

 Score =  363 bits (932), Expect = 7e-98,   Method: Compositional matrix adjust.
 Identities = 189/358 (52%), Positives = 243/358 (67%), Gaps = 19/358 (5%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSRTLNDATMNER------------HEMWMAQYGR 48
           M +    + + +   L+LG+ +    S    D T  ++            +E W+A++G+
Sbjct: 1   MGLCRSSSSMAVFLFLLLGLASALDMSIIGYDETHGDKSSWRTDEDVMAVYEAWLAKHGK 60

Query: 49  VYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRL 108
            Y    EKE RF+IFK+N+ +I   N  A N+ YK+G+N FAD TNEE+R+   G +   
Sbjct: 61  SYNALGEKERRFQIFKDNLRFIDEHN--AENRTYKVGLNRFADLTNEEYRSMYLGTRTAA 118

Query: 109 PSVRSSETTD-VSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHI 167
               S++ +D  +FR  + S+P S+DWRKKGAV  VKDQG CG CWAFS +AA+EGIN I
Sbjct: 119 KRRSSNKISDRYAFRVGD-SLPESVDWRKKGAVVEVKDQGSCGSCWAFSTIAAVEGINKI 177

Query: 168 TTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNK 227
            T  L SLSEQELVDCDTS  ++GC GGLMD AFEFII+N G+ +E  YPYKASDG C++
Sbjct: 178 VTGGLISLSEQELVDCDTS-YNEGCNGGLMDYAFEFIINNGGIDSEEDYPYKASDGRCDQ 236

Query: 228 KEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTEL 287
              N     I GYEDVP N+E +L KAVANQPVSVAI+A G +FQ Y SG+FTG+CGT L
Sbjct: 237 YRKNAKVVTIDGYEDVPENDEKSLEKAVANQPVSVAIEAGGREFQLYQSGIFTGRCGTAL 296

Query: 288 DHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDI-DAKEGLCGIAMQASYP 344
           DHGVTAVGYGT ++G  YW+VKNSWG +WGE GYIRM+RD+  +  G CGIAM+ASYP
Sbjct: 297 DHGVTAVGYGT-ENGVDYWIVKNSWGASWGEEGYIRMERDLATSATGKCGIAMEASYP 353


>gi|255646088|gb|ACU23531.1| unknown [Glycine max]
          Length = 362

 Score =  363 bits (932), Expect = 7e-98,   Method: Compositional matrix adjust.
 Identities = 184/308 (59%), Positives = 214/308 (69%), Gaps = 6/308 (1%)

Query: 39  HEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFR 98
           +E W + Y  V R   +K  RF +FK NV ++   N    +KPYKL +N+FAD TN EFR
Sbjct: 40  YERWRS-YRTVSRSLGDKHKRFNVFKANVMHV--HNTNKMDKPYKLKLNKFADMTNHEFR 96

Query: 99  APRNGYK-RRLPSVRSSETTDVSFRYEN-ASVPASIDWRKKGAVTGVKDQGQCGCCWAFS 156
           +   G K       + +   + +F YE   SVP S DWRK GAVTGVKDQGQCG CWAFS
Sbjct: 97  STYAGSKVNHHRMFQGTPRGNGTFMYEKVGSVPPSADWRKNGAVTGVKDQGQCGSCWAFS 156

Query: 157 AVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKY 216
            V A+EGIN I T KL SLSEQELVDCDT  ++ GC GGLM+ AFEFI    G+ TE+ Y
Sbjct: 157 TVVAVEGINQIKTNKLVSLSEQELVDCDTK-KNAGCNGGLMESAFEFIKQKGGITTESNY 215

Query: 217 PYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSS 276
           PY A DG+C+  +AN  A  I G+E+VP+N+E AL+KAVANQPVSVAIDA G DFQFY  
Sbjct: 216 PYTAQDGTCDASKANDLAVSIDGHENVPANDENALLKAVANQPVSVAIDAGGFDFQFYFE 275

Query: 277 GVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCG 336
           GVFTG C TEL+HGV  VGYGT  DGT YW V+NSWG  WGE GYIRMQR I  KEGLCG
Sbjct: 276 GVFTGDCSTELNHGVAIVGYGTTVDGTNYWTVRNSWGPEWGEQGYIRMQRSIFKKEGLCG 335

Query: 337 IAMQASYP 344
           IAM ASYP
Sbjct: 336 IAMMASYP 343


>gi|302816222|ref|XP_002989790.1| hypothetical protein SELMODRAFT_184826 [Selaginella moellendorffii]
 gi|300142356|gb|EFJ09057.1| hypothetical protein SELMODRAFT_184826 [Selaginella moellendorffii]
          Length = 358

 Score =  363 bits (931), Expect = 8e-98,   Method: Compositional matrix adjust.
 Identities = 183/319 (57%), Positives = 224/319 (70%), Gaps = 12/319 (3%)

Query: 32  DATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFAD 91
           D +    +E WM  +GRVY    EKE RF+IF++N EYI   +N+  N+ Y LG+N FAD
Sbjct: 27  DRSFRALYEKWMVDHGRVYNGIGEKERRFQIFRDNAEYIEE-HNRQVNQTYWLGLNNFAD 85

Query: 92  QTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS-VPASIDWRKKGAVTGVKDQGQCG 150
            T++EF+A   G K  L     S T    FRY++A+ +P   DWR KGAV  VK+QG CG
Sbjct: 86  MTHDEFKALYFGTKVPL-----SNTIKSGFRYKDATNLPLDTDWRSKGAVATVKNQGACG 140

Query: 151 CCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGL 210
            CWAFS VAA+EG+N I T +L SLSEQELVDCD   ++QGC GGLMD AFEFII N GL
Sbjct: 141 SCWAFSTVAAVEGVNQIVTGELVSLSEQELVDCDKQ-KNQGCNGGLMDSAFEFIIQNGGL 199

Query: 211 ATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSD 270
            +EA YPYKA  GSC++   N     I G+EDVP+ +EA L+KAVANQPVSVAI+ASG +
Sbjct: 200 DSEADYPYKAVSGSCDESRRNSHVVTIDGFEDVPAESEADLLKAVANQPVSVAIEASGRN 259

Query: 271 FQFYSSGVFTGQCGTELDHGVTAVGYGTAD--DG--TKYWLVKNSWGTTWGENGYIRMQR 326
           FQ YS GV+TG CG ELDHGV AVGYGT+   DG  T YW+V+NSWG  WGE+GYIR+QR
Sbjct: 260 FQLYSGGVYTGHCGYELDHGVVAVGYGTSKTPDGVATDYWIVRNSWGDAWGESGYIRLQR 319

Query: 327 DIDAKEGLCGIAMQASYPT 345
           ++ +  G CGIAM ASYP 
Sbjct: 320 NVASPRGKCGIAMMASYPV 338


>gi|358343350|ref|XP_003635767.1| Cysteine proteinase [Medicago truncatula]
 gi|355501702|gb|AES82905.1| Cysteine proteinase [Medicago truncatula]
          Length = 338

 Score =  363 bits (931), Expect = 8e-98,   Method: Compositional matrix adjust.
 Identities = 181/342 (52%), Positives = 234/342 (68%), Gaps = 17/342 (4%)

Query: 11  VLAAILVLGVW-----APQSWSR-TLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFK 64
           +  +I++L +W      P+  ++ + N A M +R+E W+ +YGR YRD  E E+RF I++
Sbjct: 5   ITLSIVILNLWIIASACPEIHTKNSTNPAVMKKRYETWLKRYGRHYRDREEWEVRFDIYQ 64

Query: 65  ENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRY- 123
            NV+YI  +N  ++N  YKL  N FAD TNEEF++   GY   LP  R        FRY 
Sbjct: 65  SNVQYIEFYN--SQNYSYKLIDNRFADITNEEFKSTYLGY---LPRFR----VQTEFRYH 115

Query: 124 ENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDC 183
           ++  +P SIDWRKKGAVT VKDQG+CG CWAFSAVAA+EGIN I T  L SLSEQ+L+DC
Sbjct: 116 KHGELPKSIDWRKKGAVTHVKDQGRCGSCWAFSAVAAVEGINKIKTENLVSLSEQQLIDC 175

Query: 184 DTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDV 243
           D    ++GCEGG M  AF +I  + G+AT  +YPYK  DG+CNK +A  +A  ISGYE V
Sbjct: 176 DIKSGNEGCEGGDMYIAFNYIKKHGGIATAKEYPYKGRDGNCNKSKAKNNAVTISGYESV 235

Query: 244 PSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGT 303
           P+ NE  L  AVA+QPVS+A DA G  FQFYS G+F+G CG  L+HG+T VGYG  ++G 
Sbjct: 236 PARNEKMLKAAVAHQPVSIATDAGGYAFQFYSKGIFSGSCGKNLNHGMTIVGYG-EENGD 294

Query: 304 KYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           KYW+VKNSW   WGE+GY+RM+RD   K+G CGIAM A+YP 
Sbjct: 295 KYWIVKNSWANDWGESGYVRMKRDTKDKDGTCGIAMDATYPV 336


>gi|115477767|ref|NP_001062479.1| Os08g0556900 [Oryza sativa Japonica Group]
 gi|42407937|dbj|BAD09076.1| putative cysteine proteinase [Oryza sativa Japonica Group]
 gi|113624448|dbj|BAF24393.1| Os08g0556900 [Oryza sativa Japonica Group]
 gi|125562525|gb|EAZ07973.1| hypothetical protein OsI_30231 [Oryza sativa Indica Group]
 gi|215701458|dbj|BAG92882.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 385

 Score =  362 bits (930), Expect = 1e-97,   Method: Compositional matrix adjust.
 Identities = 184/313 (58%), Positives = 222/313 (70%), Gaps = 11/313 (3%)

Query: 37  ERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEE 96
           E +E W  Q+ RV RD  EK  RF +FK+NV  I  FN   R++PYKL +N F D T +E
Sbjct: 46  ELYERWRGQH-RVARDLGEKARRFNVFKDNVRLIHEFNR--RDEPYKLRLNRFGDMTADE 102

Query: 97  FR----APRNGYKRRLPSVRSSETTDVSFRYENA-SVPASIDWRKKGAVTGVKDQGQCGC 151
           FR    + R  + R     R        F Y  A  +PA++DWR+KGAV  VKDQGQCG 
Sbjct: 103 FRRAYASSRVSHHRMF---RGRGERRSGFMYAGARDLPAAVDWREKGAVGAVKDQGQCGS 159

Query: 152 CWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLA 211
           CWAFS +AA+EGIN I T  LT+LSEQ+LVDCDT   + GC+GGLMD+AF++I  + G+A
Sbjct: 160 CWAFSTIAAVEGINAIRTSNLTALSEQQLVDCDTKTGNAGCDGGLMDNAFQYIAKHGGVA 219

Query: 212 TEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDF 271
             + YPY+A   SC    A+  A  I GYEDVP+N+E+AL KAVANQPVSVAI+A GS F
Sbjct: 220 ASSAYPYRARQSSCKSSAASSPAVTIDGYEDVPANSESALKKAVANQPVSVAIEAGGSHF 279

Query: 272 QFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAK 331
           QFYS GVF G+CGTELDHGV AVGYGT  DGTKYW+V+NSWG  WGE GYIRM+RD+ AK
Sbjct: 280 QFYSEGVFAGKCGTELDHGVAAVGYGTTVDGTKYWIVRNSWGADWGEKGYIRMKRDVSAK 339

Query: 332 EGLCGIAMQASYP 344
           EGLCGIAM+ASYP
Sbjct: 340 EGLCGIAMEASYP 352


>gi|302796898|ref|XP_002980210.1| hypothetical protein SELMODRAFT_153766 [Selaginella moellendorffii]
 gi|300151826|gb|EFJ18470.1| hypothetical protein SELMODRAFT_153766 [Selaginella moellendorffii]
          Length = 479

 Score =  362 bits (930), Expect = 1e-97,   Method: Compositional matrix adjust.
 Identities = 190/343 (55%), Positives = 233/343 (67%), Gaps = 22/343 (6%)

Query: 14  AILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNA--------EKEMRFKIFKE 65
           +IL LG + PQ  S   ++  +    + WM Q+G+ Y DNA        EK  R+ IFK+
Sbjct: 36  SILDLG-YDPQDLS---SEERLQALFDSWMLQHGKSYADNALSGDSQAGEKATRYGIFKD 91

Query: 66  NVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYEN 125
           N+ +I   N K  N+ Y LG+N FAD TNEEFRA R+G +      R+S      FRY +
Sbjct: 92  NLRFIHGENEK--NQGYFLGLNAFADLTNEEFRAQRHGGRFDRSRERTSHE---EFRYGS 146

Query: 126 ASV---PASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVD 182
             +   P SIDWR+KGAV GVKDQG CG CWAFSAVAA+EG+N + T +L SLSEQELVD
Sbjct: 147 VQLKDLPDSIDWREKGAVVGVKDQGSCGSCWAFSAVAAIEGVNKLATGELVSLSEQELVD 206

Query: 183 CDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYED 242
           CD  GED+GC GGLMD AF F+I N GL TEA YPYK     C++ + N     I GYED
Sbjct: 207 CD-KGEDEGCNGGLMDYAFGFVIKNGGLDTEADYPYKGYGTRCDRSKMNAKVVTIDGYED 265

Query: 243 VPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDG 302
           VP N+E AL+KAVA+QPVSVAIDA GS  QFY SG+FTG+CGT+LDHGVT VGYG  +DG
Sbjct: 266 VPVNDETALLKAVAHQPVSVAIDAGGSSMQFYRSGIFTGRCGTDLDHGVTNVGYGK-EDG 324

Query: 303 TKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
             YW++KNSWG+ WGE GY++M R+     GLCGI M+ASYPT
Sbjct: 325 KAYWIIKNSWGSNWGEKGYVKMARNTGLAAGLCGINMEASYPT 367


>gi|2224808|emb|CAB09697.1| cysteine endopeptidase EP-A [Hordeum vulgare subsp. vulgare]
 gi|326502180|dbj|BAK06781.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 365

 Score =  362 bits (928), Expect = 2e-97,   Method: Compositional matrix adjust.
 Identities = 184/312 (58%), Positives = 223/312 (71%), Gaps = 10/312 (3%)

Query: 39  HEMWMAQYGRVYRD---NAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNE 95
           +E W + Y    R    +AE E RF +FKEN  Y+   N   R++P++L +N+FAD T +
Sbjct: 41  YERWRSHYTVSRRGLGADAE-ERRFNVFKENARYVHEGNK--RDRPFRLALNKFADMTTD 97

Query: 96  EFRAPRNGYK-RRLPSVRSSETTDVSFRYENA-SVPASIDWRKKGAVTGVKDQGQCGCCW 153
           EFR    G + R   S+      D  FRY +A ++P ++DWR+KGAVT +KDQGQCG CW
Sbjct: 98  EFRRTYAGSRVRHHLSLSGGRRGDGGFRYADADNLPPAVDWRQKGAVTAIKDQGQCGSCW 157

Query: 154 AFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATE 213
           AFS + A+EGIN I T KL SLSEQEL+DCD    +QGCEGGLMD AF+FI  N G+ TE
Sbjct: 158 AFSTIVAVEGINKIRTGKLVSLSEQELMDCDNV-NNQGCEGGLMDYAFQFIQKN-GITTE 215

Query: 214 AKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQF 273
           + YPY+   GSC++ + N  A  I GYEDVP+N+E+AL KAVA QPVSVAIDASG DFQF
Sbjct: 216 SNYPYQGEQGSCDQAKENAQAVTIDGYEDVPANDESALQKAVAGQPVSVAIDASGQDFQF 275

Query: 274 YSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEG 333
           YS GVFTG+C T+LDHGV AVGYG   DGTKYW+VKNSWG  WGE GYIRMQR +   EG
Sbjct: 276 YSEGVFTGECSTDLDHGVAAVGYGATRDGTKYWIVKNSWGEDWGEKGYIRMQRGVSQTEG 335

Query: 334 LCGIAMQASYPT 345
           LCGIAMQASYPT
Sbjct: 336 LCGIAMQASYPT 347


>gi|357156854|ref|XP_003577598.1| PREDICTED: thiol protease SEN102-like [Brachypodium distachyon]
          Length = 368

 Score =  361 bits (927), Expect = 2e-97,   Method: Compositional matrix adjust.
 Identities = 175/288 (60%), Positives = 212/288 (73%), Gaps = 5/288 (1%)

Query: 59  RFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYK-RRLPSVRSSETT 117
           RF +FKENV+YI   N K R  P++L +N+FAD T +E R    G + R   ++      
Sbjct: 68  RFNVFKENVKYIHEANKKDR--PFRLALNKFADMTTDELRHSYAGSRVRHHRALSGGRRA 125

Query: 118 DVSFRYENA-SVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLS 176
             +F Y +A ++P ++DWR+KGAVTG+KDQGQCG CWAFS +AA+E IN I T KL SLS
Sbjct: 126 QGNFTYSDAENLPPAVDWREKGAVTGIKDQGQCGSCWAFSTIAAVESINKIRTGKLVSLS 185

Query: 177 EQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAK 236
           EQEL+DCD    DQGC+GGLMD AF+FI  N G+ +EA YPY+    +C++ + N     
Sbjct: 186 EQELMDCDNV-NDQGCDGGLMDYAFQFIQKNGGVTSEANYPYQGQQNTCDQAKENTHDVA 244

Query: 237 ISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGY 296
           I GYEDVP+N+E+AL KAVA QPVSVAI+ASG DFQFYS GVFTGQC T+LDHGV AVGY
Sbjct: 245 IDGYEDVPANDESALQKAVAYQPVSVAIEASGQDFQFYSEGVFTGQCTTDLDHGVAAVGY 304

Query: 297 GTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           GTA DGTKYW+VKNSWG  WGE GYIRMQR +   EGLCGIAMQASYP
Sbjct: 305 GTARDGTKYWIVKNSWGLDWGEKGYIRMQRGVSQAEGLCGIAMQASYP 352


>gi|62526575|gb|AAX84673.1| cysteine protease CP1 [Manihot esculenta]
          Length = 467

 Score =  361 bits (927), Expect = 3e-97,   Method: Compositional matrix adjust.
 Identities = 176/306 (57%), Positives = 223/306 (72%), Gaps = 4/306 (1%)

Query: 39  HEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFR 98
           +E W+ + G+VY    E+E RF++FK+N+ +I   N++  N+ YKLG+N FAD TNEE+R
Sbjct: 52  YEEWLVKQGKVYNALGEREKRFQVFKDNLRFIDEHNSE--NRTYKLGLNGFADLTNEEYR 109

Query: 99  APRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAV 158
           +   G +  +   R  +T+D        S+P S+DWRK+GAV  VKDQG CG CWAFS +
Sbjct: 110 STYLGARGGMKRNRLRKTSDRYAPRVGESLPDSVDWRKEGAVAEVKDQGSCGSCWAFSTI 169

Query: 159 AAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPY 218
           AA+EGIN I T  L SLSEQELVDCDTS  ++GC GGLMD AFEFII+N G+ TE  YPY
Sbjct: 170 AAVEGINKIVTGDLISLSEQELVDCDTS-YNEGCNGGLMDYAFEFIINNGGIDTEEDYPY 228

Query: 219 KASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGV 278
            A DG C+    N     I  YEDVP N+E AL KAVANQPVSVAI+A G DFQFY+SG+
Sbjct: 229 LARDGRCDTYRKNAKVVTIDDYEDVPVNSETALQKAVANQPVSVAIEAGGRDFQFYASGI 288

Query: 279 FTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIA 338
           F+G+CGT+LDHGV AVGYGT ++G  YW+V+NSWG +WGENGY+RM R I++  G+CGIA
Sbjct: 289 FSGRCGTQLDHGVAAVGYGT-ENGKDYWIVRNSWGKSWGENGYLRMARSINSPTGICGIA 347

Query: 339 MQASYP 344
           M+ASYP
Sbjct: 348 MEASYP 353


>gi|357474523|ref|XP_003607546.1| Cysteine proteinase [Medicago truncatula]
 gi|358347207|ref|XP_003637651.1| Cysteine proteinase [Medicago truncatula]
 gi|355503586|gb|AES84789.1| Cysteine proteinase [Medicago truncatula]
 gi|355508601|gb|AES89743.1| Cysteine proteinase [Medicago truncatula]
          Length = 345

 Score =  361 bits (926), Expect = 3e-97,   Method: Compositional matrix adjust.
 Identities = 185/343 (53%), Positives = 237/343 (69%), Gaps = 13/343 (3%)

Query: 3   MILLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKI 62
           +++L N  V+A+        P    ++ +   M +R + W+ ++GR Y+ N E+E+RF I
Sbjct: 13  LLMLCNTCVIAS---ESECPPTHKQKSSDVEAMKKRFDGWVKRHGRKYKHNDEREVRFGI 69

Query: 63  FKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFR 122
           ++ NV+YI   N  A+   Y L  N+FAD TNEEF++   G   RL   RS  T    FR
Sbjct: 70  YQANVQYIQCKN--AQKNSYNLTDNKFADLTNEEFQSTYMGLSTRL---RSHNT---GFR 121

Query: 123 Y-ENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELV 181
           Y E+  +P S DWRK+GAVT + DQGQCG CWAF+AVAA+EGIN I + KL SLSEQEL+
Sbjct: 122 YDEHGDLPESKDWRKEGAVTEIMDQGQCGGCWAFAAVAAVEGINKIKSGKLISLSEQELI 181

Query: 182 DCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYE 241
           DCD    +QGC+GGLM+ A+ FII N GL TE  YPY+  DG+C  ++A   AA ISGYE
Sbjct: 182 DCDVKSGNQGCQGGLMETAYTFIIENGGLTTEQDYPYEGVDGTCKMEKAAHYAASISGYE 241

Query: 242 DVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADD 301
           +VP++NEA L  A A+QPVSVAIDA G  FQFYS GVF+G CG +L+HGVT VGYG  + 
Sbjct: 242 EVPADNEAKLKAAAAHQPVSVAIDAGGYSFQFYSEGVFSGICGKQLNHGVTVVGYG-KET 300

Query: 302 GTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
             KYW+VKNSWG  WGE+GYIRM+RD  +KEG+CGIAMQASYP
Sbjct: 301 INKYWIVKNSWGADWGESGYIRMKRDTLSKEGMCGIAMQASYP 343


>gi|34223513|gb|AAQ62999.1| oil palm polygalacturonase allergen PEST472 [Elaeis guineensis]
          Length = 525

 Score =  361 bits (926), Expect = 4e-97,   Method: Compositional matrix adjust.
 Identities = 180/314 (57%), Positives = 226/314 (71%), Gaps = 6/314 (1%)

Query: 35  MNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKAR--NKPYKLGINEFADQ 92
           M   +E W+A++GR      EKE RF+IFK+NV +I + N  A   ++ ++LG+N FAD 
Sbjct: 46  MRLLYEGWLAKHGRADNALGEKERRFEIFKDNVRFIDAHNAAADSGHRSFRLGLNRFADM 105

Query: 93  TNEEFRAPRNGYKRRLPSVRSSETTDVSFRYE-NASVPASIDWRKKGAVTGVKDQGQCGC 151
           TNEE+R    G +      R+   +D  +RY     +P S+DWR KGAVT VKDQG CG 
Sbjct: 106 TNEEYRTVYLGTRPASHRRRARLGSD-RYRYNAGEELPESVDWRDKGAVTTVKDQGSCGS 164

Query: 152 CWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLA 211
           CWAFS +AA+EGIN I T  L SLSEQELVDCD +G++QGC GGLMD AFEFII+N G+ 
Sbjct: 165 CWAFSTIAAVEGINKIVTGDLISLSEQELVDCD-NGQNQGCNGGLMDYAFEFIINNGGID 223

Query: 212 TEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDF 271
           TE  YPYKA DG C++   N     I GYEDVP N+E AL KAVANQPVSVAI+A G +F
Sbjct: 224 TEEDYPYKARDGKCDQYRKNAKVVSIDGYEDVPVNDEKALQKAVANQPVSVAIEAGGREF 283

Query: 272 QFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAK 331
           Q Y SG+FTG+CGT+LDHGV AVGYGT ++G  YW+V+NSWG  WGE+GYIRM+R+++A 
Sbjct: 284 QLYHSGIFTGRCGTDLDHGVVAVGYGT-ENGKDYWIVRNSWGGDWGESGYIRMERNVNAS 342

Query: 332 EGLCGIAMQASYPT 345
            G CGIAM++SYPT
Sbjct: 343 TGKCGIAMESSYPT 356


>gi|255563134|ref|XP_002522571.1| cysteine protease, putative [Ricinus communis]
 gi|223538262|gb|EEF39871.1| cysteine protease, putative [Ricinus communis]
          Length = 343

 Score =  360 bits (925), Expect = 4e-97,   Method: Compositional matrix adjust.
 Identities = 188/351 (53%), Positives = 243/351 (69%), Gaps = 16/351 (4%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSR-TLNDATMNERHEMWMAQYGRVYRDNAEKEMR 59
           M + L   KLV+  +++LG W  Q+  R  LN   + E+HE WMA++GR Y DNAEKE R
Sbjct: 1   MPLSLQITKLVITLLMILGTWVSQAMPRPLLNAEAIAEKHEQWMARHGRTYHDNAEKERR 60

Query: 60  FKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYK--RRLPSVRSS-ET 116
           F+IFK N++YI +FN KA NK YKLG+N+F+D + EEF    NGY+    LP+  ++ + 
Sbjct: 61  FQIFKNNLDYIENFN-KAFNKTYKLGLNKFSDLSEEEFVTTYNGYEMPTTLPTANTTVKP 119

Query: 117 TDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLS 176
           T  S  Y    VP SIDWR+ G VT VK+QG+CGCCWAFSAVAA+EGI         SLS
Sbjct: 120 TFFSNYYNQDEVPESIDWRENGVVTSVKNQGECGCCWAFSAVAAVEGI----AGNGASLS 175

Query: 177 EQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAK 236
            Q+L+DC   G++ GC GG M  AFE+I+ N+G+ ++  YPY+ +   C  +  +  AA+
Sbjct: 176 AQQLLDC--VGDNSGCGGGTMIKAFEYIVQNQGIVSDTDYPYEQTQEMC--RSGSNVAAR 231

Query: 237 ISGYEDVPSNNEAALMKAVANQPVSVAIDAS-GSDFQFYSSGVFTGQ-CGTELDHGVTAV 294
           I+GYE V  + EA L +AVA QP+SVAIDAS G +F+ Y SGVF+ + CGT L H VT V
Sbjct: 232 ITGYESVIQSEEA-LKRAVAKQPISVAIDASSGPNFKSYISGVFSAEDCGTHLTHAVTLV 290

Query: 295 GYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           GYGT +DGTKYWLVKNSWG  WGE+GY+R+QRD+ A EG CGIAMQASYPT
Sbjct: 291 GYGTTEDGTKYWLVKNSWGEEWGESGYMRLQRDVGAMEGPCGIAMQASYPT 341


>gi|109390302|gb|ABG33750.1| cysteine protease [Hevea brasiliensis]
          Length = 457

 Score =  360 bits (925), Expect = 5e-97,   Method: Compositional matrix adjust.
 Identities = 178/309 (57%), Positives = 223/309 (72%), Gaps = 10/309 (3%)

Query: 39  HEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFR 98
           +E W+ ++G+ Y    EKE RF++FK+N+ +I   N++  N+ Y++G+N FAD TNEE+R
Sbjct: 42  YEDWLVKHGKAYNSLGEKERRFEVFKDNLRFIDEHNSE--NRTYRVGLNRFADLTNEEYR 99

Query: 99  APRNGYKRRLPSVRSSETTDVSFRYE---NASVPASIDWRKKGAVTGVKDQGQCGCCWAF 155
           +    Y   L  +R ++   +S RY      S+P S+DWRK+GAV GVKDQG CG CWAF
Sbjct: 100 SM---YLGALSGIRRNKLRKISDRYTPRVGDSLPDSVDWRKEGAVVGVKDQGSCGSCWAF 156

Query: 156 SAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAK 215
           SAVAA+EGIN I T  L SLSEQELVDCD S  ++GC GGLMD  FEFII+N G+ +E  
Sbjct: 157 SAVAAVEGINKIVTGDLISLSEQELVDCDNS-YNEGCNGGLMDYGFEFIINNGGIDSEED 215

Query: 216 YPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYS 275
           YPY A DG C+    N     I  YEDVP NNEAAL KAVANQPVSVAI+A G DFQ YS
Sbjct: 216 YPYLARDGRCDTYRKNARVVSIDSYEDVPVNNEAALQKAVANQPVSVAIEAGGRDFQLYS 275

Query: 276 SGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLC 335
           SGVF+G+CGT LDHGV AVGYGT ++G  YW+V+NSWG +WGE+GY+RM R+I    G+C
Sbjct: 276 SGVFSGRCGTALDHGVVAVGYGT-ENGQDYWIVRNSWGKSWGESGYLRMARNIRKPTGIC 334

Query: 336 GIAMQASYP 344
           GIAM+ASYP
Sbjct: 335 GIAMEASYP 343


>gi|302759380|ref|XP_002963113.1| hypothetical protein SELMODRAFT_270344 [Selaginella moellendorffii]
 gi|300169974|gb|EFJ36576.1| hypothetical protein SELMODRAFT_270344 [Selaginella moellendorffii]
          Length = 479

 Score =  360 bits (924), Expect = 5e-97,   Method: Compositional matrix adjust.
 Identities = 192/344 (55%), Positives = 234/344 (68%), Gaps = 24/344 (6%)

Query: 14  AILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNA--------EKEMRFKIFKE 65
           +IL LG + PQ  S   ++  +    + WM Q+G+ Y +NA        EK  R+ IFK+
Sbjct: 36  SILDLG-YDPQDLS---SEERLQALFDSWMLQHGKSYAENALSGDSQAGEKATRYGIFKD 91

Query: 66  NVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVS-FRYE 124
           N+ +I   N K  N+ Y LG+N FAD TNEEFRA R+G +      RS E T    FRY 
Sbjct: 92  NLRFIHGENEK--NQGYFLGLNAFADLTNEEFRAQRHGGRFD----RSRERTSYEEFRYG 145

Query: 125 NASV---PASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELV 181
           +  +   P SIDWR+KGAV GVKDQG CG CWAFSAVAA+EG+N + T +L SLSEQELV
Sbjct: 146 SVQLKDLPDSIDWREKGAVVGVKDQGSCGSCWAFSAVAAIEGVNKLATGELVSLSEQELV 205

Query: 182 DCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYE 241
           DCD  GED+GC GGLMD AF F+I N GL TEA YPYK     C++ + N     I GYE
Sbjct: 206 DCD-KGEDEGCNGGLMDYAFGFVIKNGGLDTEADYPYKGYGTRCDRSKMNAKVVTIDGYE 264

Query: 242 DVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADD 301
           DVP N+E AL+KAVA+QPVSVAIDA GS  QFY SG+FTG+CGT+LDHGVT VGYG  +D
Sbjct: 265 DVPVNDETALLKAVAHQPVSVAIDAGGSSMQFYRSGIFTGRCGTDLDHGVTNVGYGK-ED 323

Query: 302 GTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           G  YW++KNSWG+ WGE GYI+M R+     GLCGI M+ASYPT
Sbjct: 324 GKAYWIIKNSWGSNWGEKGYIKMARNTGLAAGLCGINMEASYPT 367


>gi|414591545|tpg|DAA42116.1| TPA: hypothetical protein ZEAMMB73_388689 [Zea mays]
          Length = 384

 Score =  360 bits (924), Expect = 5e-97,   Method: Compositional matrix adjust.
 Identities = 181/333 (54%), Positives = 223/333 (66%), Gaps = 12/333 (3%)

Query: 23  PQSWSRTLNDATMNERHEMWMAQYGRVY-RDNAEKEM---RFKIFKENVEYIASFNNKAR 78
           P S     ++ ++   +E W + Y RV  RD  +K+    RF +FKEN  Y+   N K  
Sbjct: 25  PFSERDLASEESLRALYERWRSHYHRVSPRDGDDKQQQARRFNVFKENARYVHEANRKD- 83

Query: 79  NKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYE------NASVPASI 132
            +P++L +N+FAD T +EFR    G + R    +  E    +            ++P ++
Sbjct: 84  GRPFRLALNKFADMTTDEFRRTYAGSRTRHHRAQLGEARSFAHAQHGRGGSGTTNLPPAV 143

Query: 133 DWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGC 192
           DWR +GAVTGVKDQGQCG CWAFSA+AA+EG+N I T KL SLSEQELVDCD   ++QGC
Sbjct: 144 DWRLRGAVTGVKDQGQCGSCWAFSAIAAVEGVNKIMTGKLVSLSEQELVDCDDV-DNQGC 202

Query: 193 EGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALM 252
           +GGLMD AF++I  N G+ TE+ YPY A   SCNK +       I GYEDVP+NNE AL 
Sbjct: 203 DGGLMDYAFQYIQRNGGVTTESNYPYLAEQRSCNKAKERSHDVTIDGYEDVPANNEDALQ 262

Query: 253 KAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSW 312
           KAVA+QPV+VAI+ASG DFQFYS GVFTG CGT+LDHGV AVGYGT  DGTKYW VKNSW
Sbjct: 263 KAVASQPVAVAIEASGQDFQFYSEGVFTGSCGTDLDHGVAAVGYGTTGDGTKYWTVKNSW 322

Query: 313 GTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           G  WGE GYIRMQR +    GLCGIAM+ SYPT
Sbjct: 323 GEDWGERGYIRMQRGVPDSRGLCGIAMEPSYPT 355


>gi|262360187|gb|ACY38051.2| cysteine proteinase C1A [Dactylis glomerata]
          Length = 365

 Score =  360 bits (924), Expect = 6e-97,   Method: Compositional matrix adjust.
 Identities = 187/335 (55%), Positives = 229/335 (68%), Gaps = 10/335 (2%)

Query: 16  LVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRD-NAEKEMR-FKIFKENVEYIASF 73
           L LGV  P +     ++ ++   +E W + +    R   AE E R F +FKENV YI   
Sbjct: 19  LALGV--PFTEKDLASEESLRGLYETWRSHHTVSRRGLGAEAEARRFNVFKENVRYIHEA 76

Query: 74  NNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPS--VRSSETTDVSFRYENA-SVPA 130
           N K R  P++L +N+FAD T +EFR    G + R              SF Y +A ++PA
Sbjct: 77  NKKDR--PFRLALNKFADMTTDEFRRTYAGSRVRHHRSLSGGRRQGGGSFMYADAENLPA 134

Query: 131 SIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQ 190
           ++DWR+KGAVT +KDQGQCG CWAFS + A+EGIN I T +L SLSEQEL+DC+  GE+ 
Sbjct: 135 AVDWRQKGAVTPIKDQGQCGSCWAFSTIVAVEGINKIRTGRLVSLSEQELMDCNI-GEND 193

Query: 191 GCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAA 250
           GC GGLMD AF+FI  N G+ TEA YPY+    SC++ + N     I GYEDVP+N+E+A
Sbjct: 194 GCNGGLMDVAFQFIQQNGGITTEASYPYQGEQNSCDQSKENSHDVSIDGYEDVPANDESA 253

Query: 251 LMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKN 310
           L KAVANQPVSVAIDASG+DFQFYS GVFT   GT+LDHGV AVGYGT  DGTKYW+VKN
Sbjct: 254 LQKAVANQPVSVAIDASGNDFQFYSEGVFTTDGGTDLDHGVAAVGYGTTRDGTKYWIVKN 313

Query: 311 SWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           SWG  WGE GYIRMQR +   EGLCGIAM+ASYPT
Sbjct: 314 SWGEDWGEKGYIRMQRGVKQAEGLCGIAMEASYPT 348


>gi|195637152|gb|ACG38044.1| vignain precursor [Zea mays]
          Length = 377

 Score =  360 bits (924), Expect = 6e-97,   Method: Compositional matrix adjust.
 Identities = 180/297 (60%), Positives = 210/297 (70%), Gaps = 10/297 (3%)

Query: 54  AEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFR----APRNGYKRRLP 109
           A +   F +FK NV  I  FN   R++PYKL +N F D T +EFR      R  + R   
Sbjct: 64  ATRRAVFNVFKANVRLIHEFNR--RDEPYKLRLNRFGDMTADEFRRHYAGSRVAHHRMFR 121

Query: 110 SVRSSETTDVSFRYENA-SVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHIT 168
             R   +   SF Y +A  VPAS+DWR+KGAVT VKDQGQCG CWAFS +AA+EGIN I 
Sbjct: 122 GDRQGSSASASFMYADARDVPASVDWRQKGAVTDVKDQGQCGSCWAFSTIAAVEGINAIK 181

Query: 169 TRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKK 228
           T+ LTSLSEQ+LVDCDT   + GC GGLMD AF++I  + G+A E  YPY+A   SC K 
Sbjct: 182 TKNLTSLSEQQLVDCDTKA-NAGCNGGLMDYAFQYIAKHGGVAAEDAYPYRARQASCKKS 240

Query: 229 EANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELD 288
            A      I GYEDVP+N+E+AL KAVA+QPVSVAI+ASGS FQFYS GVF+G+CGTELD
Sbjct: 241 PA--PVVTIDGYEDVPANDESALKKAVAHQPVSVAIEASGSHFQFYSEGVFSGRCGTELD 298

Query: 289 HGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           HGV AVGYG   DGTKYWLVKNSWG  WGE GYIRM RD+ AKEG CGIAM+ASYP 
Sbjct: 299 HGVAAVGYGVTADGTKYWLVKNSWGPEWGEKGYIRMARDVAAKEGHCGIAMEASYPV 355


>gi|1173630|gb|AAB37233.1| cysteine proteinase [Phalaenopsis sp. SM9108]
          Length = 359

 Score =  360 bits (923), Expect = 6e-97,   Method: Compositional matrix adjust.
 Identities = 184/313 (58%), Positives = 223/313 (71%), Gaps = 13/313 (4%)

Query: 39  HEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFR 98
           +E W + +  V RD  EK+ RF +FKEN  YI  FN K ++ PYKL +N+FAD TN EFR
Sbjct: 38  YERWRSHH-TVSRDLDEKQKRFNVFKENPRYIHDFN-KRKDIPYKLRLNKFADLTNHEFR 95

Query: 99  A----PRNGYKRRLPSVRSSETTDVSFRYENA---SVPASIDWRKKGAVTGVKDQGQCGC 151
           +     R  + R L   R    T+ SF Y++    S+PASIDWR+KGAVT VKDQGQCG 
Sbjct: 96  STYAGSRINHHRSLRGSRRGGATN-SFMYQSLDSRSLPASIDWRQKGAVTAVKDQGQCGS 154

Query: 152 CWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLA 211
           CWAFS VAA+EGIN I T+KL SLSEQEL+DCDT  E+ GC GGLMD AF+FI  N G++
Sbjct: 155 CWAFSTVAAVEGINQIKTKKLLSLSEQELIDCDTD-ENNGCNGGLMDYAFDFIKKNGGIS 213

Query: 212 TEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDF 271
           +EA+YPY A D  C   E       I G+EDVP+N+E +L+KAVANQPVS+AI+ASG DF
Sbjct: 214 SEAEYPYAAEDSYC-ATEKKSHVVSIDGHEDVPANDEDSLLKAVANQPVSIAIEASGYDF 272

Query: 272 QFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAK 331
           QFYS GVFTG+ GTELDHGV  VGYG    GTKYW+V+NSWG  WGE GYIR+    D+K
Sbjct: 273 QFYSEGVFTGRSGTELDHGVAIVGYGKTQQGTKYWIVRNSWGAEWGEKGYIRISAASDSK 332

Query: 332 EGLCGIAMQASYP 344
             LCG+AM+ASYP
Sbjct: 333 R-LCGLAMEASYP 344


>gi|1208549|gb|AAC49455.1| Pseudotzain [Pseudotsuga menziesii]
          Length = 454

 Score =  358 bits (920), Expect = 2e-96,   Method: Compositional matrix adjust.
 Identities = 189/353 (53%), Positives = 240/353 (67%), Gaps = 13/353 (3%)

Query: 1   MAMILLENKLVLAAIL------VLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNA 54
           M ++LL   L L+A+          + +  S     +DA M E +E+W+AQ+ + Y    
Sbjct: 1   MGILLLFAVLALSAMAGSASRADFSIISYDSQDLIGDDAIM-ELYELWLAQHKKAYNGLD 59

Query: 55  EKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSS 114
           EK+ +F +FK+N  YI   NN+  N  YKLG+N+FAD ++EEF+A   G K      R S
Sbjct: 60  EKQKKFSVFKDNFLYIHQHNNQG-NPSYKLGLNQFADLSHEEFKAAYLGTKLDAKK-RLS 117

Query: 115 ETTDVSFRYE-NASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLT 173
            +    ++Y     +P SIDWR+KGAVT VK+QG CG CWAFS VAA+EGIN I T  LT
Sbjct: 118 RSPSPRYQYSVGEDLPESIDWREKGAVTAVKNQGSCGSCWAFSTVAAVEGINQIVTGNLT 177

Query: 174 SLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPS 233
           SLSEQELVDCDTS  +QGC GGLMD AF+FIISN GL +E  YPYKA++GSC+    N  
Sbjct: 178 SLSEQELVDCDTS-YNQGCNGGLMDYAFQFIISNGGLDSEDDYPYKANNGSCDAYRKNAH 236

Query: 234 AAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTA 293
              I  YEDVP N+E +L KA ANQP+SVAI+ASG  FQFY SGVFT  CGT+LDHGVT 
Sbjct: 237 VVTIDDYEDVPENDEKSLKKAAANQPISVAIEASGRAFQFYESGVFTSNCGTQLDHGVTL 296

Query: 294 VGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDID-AKEGLCGIAMQASYPT 345
           VGYG+ + G  YWLVKNSWG +WGE G+I++QR+++ A  G+CGIAM+ASYP 
Sbjct: 297 VGYGS-ESGIDYWLVKNSWGNSWGEKGFIKLQRNLEGASTGMCGIAMEASYPV 348


>gi|297826875|ref|XP_002881320.1| hypothetical protein ARALYDRAFT_321132 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297327159|gb|EFH57579.1| hypothetical protein ARALYDRAFT_321132 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 341

 Score =  358 bits (919), Expect = 2e-96,   Method: Compositional matrix adjust.
 Identities = 187/341 (54%), Positives = 238/341 (69%), Gaps = 11/341 (3%)

Query: 10  LVLAAILVLGVWAPQSWSRTL--NDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENV 67
           + +  IL       Q+ SRT+  ++ +  E+HE WMA++ RVYRD  EK+MR  +FK+N+
Sbjct: 8   VTIFTILFTTFSISQATSRTVTFHEPSSLEKHEQWMARFSRVYRDELEKQMRRDVFKKNL 67

Query: 68  EYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS 127
           ++I +FN K  NK YKLG+NEFAD TNEEF A   G K  L S    ET  +S R  N S
Sbjct: 68  KFIENFNKKG-NKSYKLGVNEFADWTNEEFLAIHTGLKG-LSSKVVDET--ISSRSWNIS 123

Query: 128 --VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDT 185
             V  S DWR +GAVT VK QGQCGCCWAFSAVAA+EG+  I    L SLSEQ+L+DCD 
Sbjct: 124 DMVGVSKDWRAEGAVTPVKYQGQCGCCWAFSAVAAVEGVTKIAGGNLVSLSEQQLLDCDR 183

Query: 186 SGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPS 245
              D+GC+GG+M DAF +II N+G+A+E  Y Y+ SDG C +  A P AA+ISG++ VPS
Sbjct: 184 E-YDRGCDGGIMSDAFNYIIQNRGIASENDYSYQGSDGRC-RSSARP-AARISGFQTVPS 240

Query: 246 NNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKY 305
           NNE AL++AV+ QPVSV++DA+G  F  YS GV+ G CGT  +H VT VGYGT+ DGTKY
Sbjct: 241 NNEQALLEAVSRQPVSVSMDANGDGFMHYSGGVYDGPCGTSSNHAVTFVGYGTSQDGTKY 300

Query: 306 WLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           WL KNSWG TWGE GYIR++RD+   +G+CG+A  A YP A
Sbjct: 301 WLAKNSWGETWGEKGYIRIRRDVAWPQGMCGVAQYAFYPVA 341


>gi|124484387|dbj|BAF46304.1| cysteine proteinase precursor [Ipomoea nil]
          Length = 474

 Score =  358 bits (919), Expect = 2e-96,   Method: Compositional matrix adjust.
 Identities = 179/312 (57%), Positives = 218/312 (69%), Gaps = 4/312 (1%)

Query: 35  MNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTN 94
           + E  E W+ ++G+ Y    EK+ RFKIF++N++YI    N   N+ YKLG+N FAD TN
Sbjct: 46  VKEMFESWLVKHGKSYNAVDEKDKRFKIFRDNLKYIDE-KNSLENRSYKLGLNRFADITN 104

Query: 95  EEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWA 154
           EE+R    G KR          +D        S+P SIDWR+KGAVTGVKDQG CG CWA
Sbjct: 105 EEYRTGYLGAKRDASRNMVKSKSDRYAPVAGDSLPDSIDWREKGAVTGVKDQGSCGSCWA 164

Query: 155 FSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEA 214
           FS +AA+EG+N + T  L SLSEQELVDCD    +QGC GG M  AF+FII N G+ +E 
Sbjct: 165 FSTIAAVEGVNQLATGNLISLSEQELVDCDRK-INQGCNGGDMGYAFQFIIKNGGIDSEE 223

Query: 215 KYPYKASDGSCNK-KEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQF 273
            YPY   DG C+  ++ N   A I GYE+VP NNE +L KAVANQPVSVAI+A G DFQ 
Sbjct: 224 DYPYTGKDGKCDSYRQNNAKVASIDGYEEVPVNNEKSLQKAVANQPVSVAIEAGGYDFQL 283

Query: 274 YSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEG 333
           YSSG+FTG CGT+LDHGV AVGYGT ++G  YW+VKNSWG  WGE GY+RMQR++ AK G
Sbjct: 284 YSSGIFTGSCGTDLDHGVAAVGYGT-ENGVDYWIVKNSWGDYWGEKGYVRMQRNVKAKTG 342

Query: 334 LCGIAMQASYPT 345
           LCGIAM+ASYPT
Sbjct: 343 LCGIAMEASYPT 354


>gi|413933049|gb|AFW67600.1| cysteine protease 1 [Zea mays]
          Length = 341

 Score =  358 bits (918), Expect = 3e-96,   Method: Compositional matrix adjust.
 Identities = 183/312 (58%), Positives = 221/312 (70%), Gaps = 11/312 (3%)

Query: 38  RHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEF 97
           RHE WMA++GR Y+D AEK  R ++F+ N E I SFN  A    ++L  N FAD T EEF
Sbjct: 37  RHEKWMAEHGRAYKDEAEKARRLEVFRANAELIDSFN-AAGTHSHRLATNRFADLTVEEF 95

Query: 98  RAPRNGYKRR-LPSVRSSETTDVSFRYENASVP---ASIDWRKKGAVTGVKDQGQCGCCW 153
           RA R G + R  PS  +       FRYEN S+     S+DWR  GAVTGVKDQG CGCCW
Sbjct: 96  RAARTGLRPRPAPSAGAGR-----FRYENFSLADAAQSVDWRAMGAVTGVKDQGACGCCW 150

Query: 154 AFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATE 213
           AFSAVAA+EG+N I T +L SLSEQELVDCD SG DQGC+GGLMD+AF+F+    GLA+E
Sbjct: 151 AFSAVAAVEGLNKIRTGRLVSLSEQELVDCDVSGVDQGCDGGLMDNAFQFVARRGGLASE 210

Query: 214 AKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQF 273
           + YPY+  DG C    A   AA I G+EDVP NNEAAL  AVANQPVSVAI+     F+F
Sbjct: 211 SGYPYQGRDGPCRSSAAAARAASIRGHEDVPRNNEAALAAAVANQPVSVAINGEDMAFRF 270

Query: 274 YSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEG 333
           Y SGV  G CGT+L+H +TAVGYGTA+DGT+YWL+KNSWG +WGE GY+R++R +   EG
Sbjct: 271 YDSGVLGGACGTDLNHAITAVGYGTANDGTRYWLMKNSWGASWGEGGYVRIRRGVRG-EG 329

Query: 334 LCGIAMQASYPT 345
           +CG+A   SYP 
Sbjct: 330 VCGLAKLPSYPV 341


>gi|116787404|gb|ABK24495.1| unknown [Picea sitchensis]
 gi|224286306|gb|ACN40861.1| unknown [Picea sitchensis]
          Length = 452

 Score =  357 bits (917), Expect = 4e-96,   Method: Compositional matrix adjust.
 Identities = 184/317 (58%), Positives = 224/317 (70%), Gaps = 8/317 (2%)

Query: 31  NDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFA 90
           +DA M E +E+W+A++ R Y    EK+ RF +FK+N  YI   N    N+ YKLG+N+FA
Sbjct: 35  DDAIM-ELYELWLAEHKRAYNGLDEKQKRFSVFKDNFLYIHEHNQG--NRSYKLGLNQFA 91

Query: 91  DQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENA-SVPASIDWRKKGAVTGVKDQGQC 149
           D ++EEF+A   G K      R S      ++Y +   +P SIDWR+KGAVT VKDQG C
Sbjct: 92  DLSHEEFKATYLGAKLDTKK-RLSRPPSRRYQYSDGEDLPESIDWREKGAVTSVKDQGSC 150

Query: 150 GCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKG 209
           G CWAFS VAA+EGIN I T  L SLSEQELVDCDTS  +QGC GGLMD AFEFII+N G
Sbjct: 151 GSCWAFSTVAAVEGINQIVTGDLISLSEQELVDCDTS-YNQGCNGGLMDYAFEFIINNGG 209

Query: 210 LATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGS 269
           L +E  YPY A DGSC+    N     I  YEDVP N+E +L KA ANQP+SVAI+ASG 
Sbjct: 210 LDSEEDYPYTAYDGSCDSYRKNAHVVTIDDYEDVPENDEKSLKKAAANQPISVAIEASGR 269

Query: 270 DFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDID 329
           +FQFY SGVFT  CGT+LDHGVT VGYG+ + GT YW VKNSWG +WGE G+IR+QR+I+
Sbjct: 270 EFQFYDSGVFTSTCGTQLDHGVTLVGYGS-ESGTDYWTVKNSWGKSWGEEGFIRLQRNIE 328

Query: 330 -AKEGLCGIAMQASYPT 345
            A  G+CGIAM+ASYP 
Sbjct: 329 VASTGMCGIAMEASYPV 345


>gi|326502440|dbj|BAJ95283.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 349

 Score =  357 bits (916), Expect = 4e-96,   Method: Compositional matrix adjust.
 Identities = 182/318 (57%), Positives = 231/318 (72%), Gaps = 9/318 (2%)

Query: 32  DATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFAD 91
           D+ M  RHE WMA++GR Y +  EK  R ++F+ N + I SFN+ A +  ++L  N FAD
Sbjct: 37  DSAMVSRHEKWMAEHGRTYANEEEKARRLEVFRANAKLIDSFNS-AEDSTHRLATNRFAD 95

Query: 92  QTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVP---ASIDWRKKGAVTGVKDQGQ 148
            T+EEFRA R G +R   +   + +    FRYEN S+     S+DWR  GAVTGVKDQG 
Sbjct: 96  LTDEEFRAARTGLRRPPAAAAGAGSGAGGFRYENFSLADAAGSMDWRAMGAVTGVKDQGS 155

Query: 149 CGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNK 208
           CGCCWAFSAVAA+EG+  I T +L SLSEQ+LVDCD  G+D+GC GGLMD+AFE++I+  
Sbjct: 156 CGCCWAFSAVAAVEGLTKIRTGRLVSLSEQQLVDCDVYGDDEGCAGGLMDNAFEYMINRG 215

Query: 209 GLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASG 268
           GL TE+ YPY+ +DGSC +   + SAA I GYEDVP+NNEAALM AVA+QPVSVAI+   
Sbjct: 216 GLTTESSYPYRGTDGSCRR---SASAASIRGYEDVPANNEAALMAAVAHQPVSVAINGGD 272

Query: 269 SDFQFYSSGVFTGQ-CGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRD 327
           S F+FY SGV  G  CGTEL+H +TAVGYGTA DGTKYW++KNSWG +WGE GY+R++R 
Sbjct: 273 SVFRFYDSGVLGGSGCGTELNHAITAVGYGTASDGTKYWIMKNSWGGSWGEGGYVRIRRG 332

Query: 328 IDAKEGLCGIAMQASYPT 345
           +   EG+CG+A  ASYP 
Sbjct: 333 VRG-EGVCGLAQLASYPV 349


>gi|357507505|ref|XP_003624041.1| Cysteine proteinase [Medicago truncatula]
 gi|355499056|gb|AES80259.1| Cysteine proteinase [Medicago truncatula]
          Length = 342

 Score =  357 bits (916), Expect = 5e-96,   Method: Compositional matrix adjust.
 Identities = 179/313 (57%), Positives = 229/313 (73%), Gaps = 15/313 (4%)

Query: 34  TMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQT 93
           +++ER E W  +YG VY+D AE++  F+IFK NV YI  FN  A NKPYKL IN F D+ 
Sbjct: 37  SLSERFEYWKTKYGVVYKDVAEQKKHFQIFKHNVAYIDYFN-AAGNKPYKLAINRFVDKP 95

Query: 94  NEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS-VPASIDWRKKGAVTGVKDQGQCGCC 152
            E+     +G++R      ++ T   +F+YEN + +PA++DWRK+GAVT +K+QG+CG C
Sbjct: 96  IED---SDDGFERT-----TTTTPTTTFKYENVTDIPATVDWRKRGAVTPIKNQGKCGSC 147

Query: 153 WAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLAT 212
           WAFSAVAA+EGI  IT+  L SLSEQ+LVDCD SG  +GC+ G M +AF+FI+ N G+AT
Sbjct: 148 WAFSAVAAIEGIQKITSGNLVSLSEQQLVDCDRSGRTKGCDNGNMINAFKFILENGGIAT 207

Query: 213 EAKYPYK-ASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDF 271
           EA YPYK    G+C K        +I  YE+VPSN+E +L+KAVANQPVSV ID  G  F
Sbjct: 208 EANYPYKRVVKGTCKKVS---HKVQIKSYEEVPSNSEDSLLKAVANQPVSVGIDMRGM-F 263

Query: 272 QFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAK 331
           +FYSSG+FTG+CGT+ +H +T VGYGT+ DG KYWLVKNSW   WGE GYIR++RDIDAK
Sbjct: 264 KFYSSGIFTGECGTKPNHALTIVGYGTSKDGIKYWLVKNSWSKRWGEKGYIRIKRDIDAK 323

Query: 332 EGLCGIAMQASYP 344
           EGLCGIAM+ SYP
Sbjct: 324 EGLCGIAMKPSYP 336


>gi|146216004|gb|ABQ10204.1| cysteine protease Cp6 [Actinidia deliciosa]
          Length = 461

 Score =  357 bits (916), Expect = 5e-96,   Method: Compositional matrix adjust.
 Identities = 183/322 (56%), Positives = 225/322 (69%), Gaps = 12/322 (3%)

Query: 27  SRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGI 86
           S +  D  +   +E W+ ++G+ Y    EKE RF+IFK+N+ +I   N ++R   YK+G+
Sbjct: 34  SSSRTDDEVMAMYESWLVKHGKSYNAIGEKEKRFQIFKDNLRFIDEHNAESRT--YKVGL 91

Query: 87  NEFADQTNEEFRA----PRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTG 142
           N FAD TN+E+R+     R G +RRL + + S   D        S+P S+DWR+KGAV G
Sbjct: 92  NRFADLTNDEYRSMYLGARTGSRRRLSTQKRS---DRYVPVAGESLPDSVDWREKGAVVG 148

Query: 143 VKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFE 202
           VKDQG CG CWAFS +AA+EGIN I T  L SLSEQELVDCDTS  ++GC GGLMD AFE
Sbjct: 149 VKDQGSCGSCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDTS-YNEGCNGGLMDYAFE 207

Query: 203 FIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSV 262
           FII N G+ TE  YPY A DG C++   N     I  YEDVP NNE AL KAVANQPVSV
Sbjct: 208 FIIKNGGIDTEEDYPYNARDGRCDQYRKNAKVVTIDDYEDVPVNNEQALQKAVANQPVSV 267

Query: 263 AIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYI 322
           AI+ASG  FQFY SGVFTG CGT LDHGVTAVGYGT ++   YW+VKNSWG++WGE+GYI
Sbjct: 268 AIEASGMAFQFYESGVFTGNCGTALDHGVTAVGYGT-ENSVDYWIVKNSWGSSWGESGYI 326

Query: 323 RMQRDIDAKEGLCGIAMQASYP 344
           RM+R+  A  G CGIA++ SYP
Sbjct: 327 RMERNTGAT-GKCGIAVEPSYP 347


>gi|2224810|emb|CAB09698.1| cysteine proteinase [Hordeum vulgare subsp. vulgare]
          Length = 349

 Score =  357 bits (915), Expect = 7e-96,   Method: Compositional matrix adjust.
 Identities = 182/318 (57%), Positives = 230/318 (72%), Gaps = 9/318 (2%)

Query: 32  DATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFAD 91
           DA M  RHE WMA++GR Y +  EK  R ++F+ N + I SFN+ A +  ++L  N FAD
Sbjct: 37  DAAMVSRHEKWMAEHGRTYANEEEKARRLEVFRANAKLIDSFNS-AEDSTHRLATNRFAD 95

Query: 92  QTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVP---ASIDWRKKGAVTGVKDQGQ 148
            T+EEFRA R G +R   +   + +    FRYEN S+     S+DWR  GAVTGVKDQG 
Sbjct: 96  LTDEEFRAARTGLRRPPAAAAGAGSGAGGFRYENFSLADAAGSMDWRAMGAVTGVKDQGS 155

Query: 149 CGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNK 208
           CGCCWAFSAVAA+EG+  I T +L SLSEQ+LVDCD  G+D+GC GGLMD+AFE++I+  
Sbjct: 156 CGCCWAFSAVAAVEGLTKIRTGRLVSLSEQQLVDCDVYGDDEGCAGGLMDNAFEYMINRG 215

Query: 209 GLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASG 268
           GL TE+ YPY+ +DGSC +   + SAA I GYEDVP+NNEAALM AVA+QPVSVAI+   
Sbjct: 216 GLTTESSYPYRGTDGSCRR---SASAASIRGYEDVPANNEAALMAAVAHQPVSVAINGGD 272

Query: 269 SDFQFYSSGVFTGQ-CGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRD 327
           S F+FY SGV  G  CGTEL+H +TA GYGTA DGTKYW++KNSWG +WGE GY+R++R 
Sbjct: 273 SVFRFYDSGVLGGSGCGTELNHAITAAGYGTASDGTKYWIMKNSWGGSWGEGGYVRIRRG 332

Query: 328 IDAKEGLCGIAMQASYPT 345
           +   EG+CG+A  ASYP 
Sbjct: 333 VRG-EGVCGLAQLASYPV 349


>gi|18403438|ref|NP_565780.1| cysteine proteinase-like protein [Arabidopsis thaliana]
 gi|2342728|gb|AAB67626.1| cysteine proteinase [Arabidopsis thaliana]
 gi|330253821|gb|AEC08915.1| cysteine proteinase-like protein [Arabidopsis thaliana]
          Length = 345

 Score =  356 bits (914), Expect = 8e-96,   Method: Compositional matrix adjust.
 Identities = 185/356 (51%), Positives = 243/356 (68%), Gaps = 21/356 (5%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSRTL--NDATMNERHEMWMAQYGRVYRDNAEKEM 58
           MA I++   +++  IL  G    Q+ SRT+   + +M ++HE WMA++ R YRD  EK M
Sbjct: 1   MASIMVLVTVLI--ILFTGFRISQATSRTVIFREQSMVDKHEQWMARFSREYRDELEKNM 58

Query: 59  RFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYK--------RRLPS 110
           R  +FK+N+++I +FN K  NK YKLG+NEFAD TNEEF A   G K        + +  
Sbjct: 59  RRDVFKKNLKFIENFNKKG-NKSYKLGVNEFADWTNEEFLAIHTGLKGLTEVSPSKVVAK 117

Query: 111 VRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTR 170
             SS+T +VS       V  S DWR +GAVT VK QGQCGCCWAFSAVAA+EG+  I   
Sbjct: 118 TISSQTWNVS-----DMVVESKDWRAEGAVTPVKYQGQCGCCWAFSAVAAVEGVAKIAGG 172

Query: 171 KLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEA 230
            L SLSEQ+L+DCD    D+GC+GG+M DAF +++ N+G+A+E  Y Y+ SDG C +  A
Sbjct: 173 NLVSLSEQQLLDCDRE-YDRGCDGGIMSDAFNYVVQNRGIASENDYSYQGSDGGC-RSNA 230

Query: 231 NPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHG 290
            P AA+ISG++ VPSNNE AL++AV+ QPVSV++DA+G  F  YS GV+ G CGT  +H 
Sbjct: 231 RP-AARISGFQTVPSNNERALLEAVSRQPVSVSMDATGDGFMHYSGGVYDGPCGTSSNHA 289

Query: 291 VTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           VT VGYGT+ DGTKYWL KNSWG TWGE GYIR++RD+   +G+CG+A  A YP A
Sbjct: 290 VTFVGYGTSQDGTKYWLAKNSWGETWGEKGYIRIRRDVAWPQGMCGVAQYAFYPVA 345


>gi|224103643|ref|XP_002313136.1| predicted protein [Populus trichocarpa]
 gi|222849544|gb|EEE87091.1| predicted protein [Populus trichocarpa]
          Length = 477

 Score =  355 bits (912), Expect = 1e-95,   Method: Compositional matrix adjust.
 Identities = 181/312 (58%), Positives = 222/312 (71%), Gaps = 11/312 (3%)

Query: 39  HEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFR 98
           +EMW+ +YG+ Y    EKE RF+IFK+N++++   +N   N  YKLG+N+FAD +NEE+R
Sbjct: 49  YEMWLVKYGKAYNALGEKERRFEIFKDNLKFVDQ-HNSVGNPSYKLGLNKFADLSNEEYR 107

Query: 99  APRNGY----KRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWA 154
           A   G     KRRL  +   ++    F+ +   +P S+DWR+KGAV  VKDQGQCG CWA
Sbjct: 108 AAYLGTRMDGKRRL--LGGPKSARYLFK-DGDDLPESVDWREKGAVAPVKDQGQCGSCWA 164

Query: 155 FSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEA 214
           FS V A+EGIN I T  LTSLSEQELVDCD    +QGC GGLMD AFEFI+ N G+ TE 
Sbjct: 165 FSTVGAVEGINQIVTGNLTSLSEQELVDCDKV-YNQGCNGGLMDYAFEFIMKNGGIDTEE 223

Query: 215 KYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFY 274
            YPYKA D  C+    N     I GYEDVP N+E +L KAVANQPVSVAI+A G  FQ Y
Sbjct: 224 DYPYKAVDSMCDPNRKNARVVTIDGYEDVPQNDEKSLRKAVANQPVSVAIEAGGRAFQLY 283

Query: 275 SSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKE-G 333
            SGVFTG CGT+LDHGV AVGYGT ++G  YW+V+NSWG  WGENGYIRM+R++ + E G
Sbjct: 284 QSGVFTGSCGTQLDHGVVAVGYGT-ENGVDYWVVRNSWGPAWGENGYIRMERNVASTETG 342

Query: 334 LCGIAMQASYPT 345
            CGIAM+ASYPT
Sbjct: 343 KCGIAMEASYPT 354


>gi|2224812|emb|CAB09699.1| cysteine endopeptidase EP-A [Hordeum vulgare subsp. vulgare]
          Length = 365

 Score =  355 bits (911), Expect = 2e-95,   Method: Compositional matrix adjust.
 Identities = 177/312 (56%), Positives = 216/312 (69%), Gaps = 10/312 (3%)

Query: 39  HEMWMAQYGRVYRD---NAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNE 95
           +E W + Y    R    +AE E RF +FK+N  Y+   N   R+ P++L +N+FAD T +
Sbjct: 41  YERWRSHYTVSRRGLGADAE-ERRFNVFKQNARYVHEGNK--RDMPFRLALNKFADMTTD 97

Query: 96  EFRAPRNGYKRR--LPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCW 153
           EFR    G + R  L               +  ++P ++DWR+KGAVT +KDQGQCG CW
Sbjct: 98  EFRRTYAGSRVRHHLSLSGGRRGDGGFRYGDADNLPPAVDWRQKGAVTAIKDQGQCGSCW 157

Query: 154 AFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATE 213
           AFS + A+EGIN I T KL SLSEQEL+DCD    +QGC+GGLMD AF+FI  N G+ TE
Sbjct: 158 AFSTIVAVEGINKIRTGKLVSLSEQELMDCDNV-NNQGCDGGLMDYAFQFIQKN-GITTE 215

Query: 214 AKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQF 273
           + YPY+   GSC++ + N  A  I GYEDVP+N+E+AL KAVA QPVSVAIDASG DFQF
Sbjct: 216 SNYPYQGEQGSCDQAKENAQAVTIDGYEDVPANDESALQKAVAGQPVSVAIDASGQDFQF 275

Query: 274 YSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEG 333
           YS GVFTG+C T+LDHGV AVGYG   DGTKYW+VKNSWG  WGE GYIRMQR +   EG
Sbjct: 276 YSEGVFTGECSTDLDHGVAAVGYGATRDGTKYWIVKNSWGEDWGEKGYIRMQRGVSQTEG 335

Query: 334 LCGIAMQASYPT 345
           LCGIAMQASYPT
Sbjct: 336 LCGIAMQASYPT 347


>gi|148927382|gb|ABR19827.1| cysteine proteinase [Elaeis guineensis]
          Length = 470

 Score =  355 bits (911), Expect = 2e-95,   Method: Compositional matrix adjust.
 Identities = 178/314 (56%), Positives = 220/314 (70%), Gaps = 6/314 (1%)

Query: 35  MNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKAR--NKPYKLGINEFADQ 92
           M   +E W+A++GR Y    EKE RF+IFK+NV +I + N  A   ++ ++LG+N FAD 
Sbjct: 46  MRILYEGWLAKHGRAYNALGEKERRFEIFKDNVLFIDAHNAAADAGHRSFRLGLNRFADM 105

Query: 93  TNEEFRAPRNGYKRRLPSVRSSETTDVSFRYE-NASVPASIDWRKKGAVTGVKDQGQCGC 151
           TNEE+RA   G  R     R +      +RY     +P S+DWR KGAV  VKDQG CG 
Sbjct: 106 TNEEYRAVYLG-TRPAGHRRRARVGSDRYRYNAGEDLPESVDWRAKGAVAAVKDQGSCGS 164

Query: 152 CWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLA 211
           CWAFS VAA+EGIN I T  L SLSEQELVDCD +G +QGC GGLMD  FEFII+N G+ 
Sbjct: 165 CWAFSTVAAVEGINKIVTGDLISLSEQELVDCD-NGYNQGCNGGLMDYGFEFIINNGGID 223

Query: 212 TEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDF 271
           TE  YPY A DG C++   N     I GYEDVP N+E AL KAVANQPVSVAI+A G +F
Sbjct: 224 TEEDYPYTARDGKCDQYRKNAKVVSIDGYEDVPVNDEKALQKAVANQPVSVAIEAGGREF 283

Query: 272 QFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAK 331
           Q Y SG+FTG+CGT+LDHGV AVGYGT ++G  YW+V+NSWG  WGE+GYIRM+R+++  
Sbjct: 284 QLYHSGIFTGRCGTDLDHGVVAVGYGT-ENGKDYWIVRNSWGGDWGESGYIRMERNVNTS 342

Query: 332 EGLCGIAMQASYPT 345
            G CGIA++ SYPT
Sbjct: 343 TGKCGIAIEPSYPT 356


>gi|302143414|emb|CBI21975.3| unnamed protein product [Vitis vinifera]
          Length = 286

 Score =  354 bits (909), Expect = 3e-95,   Method: Compositional matrix adjust.
 Identities = 192/347 (55%), Positives = 221/347 (63%), Gaps = 62/347 (17%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRF 60
           MA +     + LA +  L  WA Q+ +R L +A+M ERHE WMAQYGRVY+D  EK  R+
Sbjct: 1   MASVNQYQYICLALLFFLAAWASQATARNLLEASMYERHEDWMAQYGRVYKDADEKSKRY 60

Query: 61  KIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVS 120
           KIFK+NV  I SFN KA +K YKL INEFAD TNEEFRA RN +K  + S  ++     S
Sbjct: 61  KIFKDNVARIESFN-KAMDKSYKLSINEFADLTNEEFRASRNRFKAHICSTEAT-----S 114

Query: 121 FRYEN-ASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQE 179
           F+YE+ A+VP+++DWRKKGAVT +KDQGQCG CWAFSAVAAMEGI  ++T KL SLSEQE
Sbjct: 115 FKYEHVAAVPSTVDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQE 174

Query: 180 LVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISG 239
           LVDCDT       E  L           K +A +                  P A  I  
Sbjct: 175 LVDCDTKQNHANNEKAL----------QKAVAHQ------------------PIAVAI-- 204

Query: 240 YEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTA 299
                                    DA G +FQFYSSGVFTGQCGTELDHGV AVGYGT+
Sbjct: 205 -------------------------DAGGFEFQFYSSGVFTGQCGTELDHGVAAVGYGTS 239

Query: 300 DDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           DDG KYWLVKNSWGT WGE GYIRMQRD+ AKEGLCGIAMQASYPTA
Sbjct: 240 DDGMKYWLVKNSWGTGWGEVGYIRMQRDVTAKEGLCGIAMQASYPTA 286


>gi|4100157|gb|AAD10337.1| cysteine proteinase precursor [Hordeum vulgare]
          Length = 365

 Score =  354 bits (909), Expect = 3e-95,   Method: Compositional matrix adjust.
 Identities = 175/311 (56%), Positives = 213/311 (68%), Gaps = 8/311 (2%)

Query: 39  HEMWMAQYGRVYRDNAEK--EMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEE 96
           +E W + Y    R       E RF +FK+N  Y+   N   R+ P++L +N+FAD T +E
Sbjct: 41  YERWRSHYTVSRRGLGADAGERRFNVFKQNARYVHEGNK--RDMPFRLALNKFADMTTDE 98

Query: 97  FRAPRNGYKRR--LPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWA 154
           FR    G + R  L               +  ++P ++DWR+KGAVT +KDQGQCG CWA
Sbjct: 99  FRRTYAGSRVRHHLSLSGGRRGDGGFRYGDADNLPPAVDWRQKGAVTAIKDQGQCGSCWA 158

Query: 155 FSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEA 214
           FS + A+EGIN I T KL SLSEQEL+DCD    +QGC+GGLMD AF+FI  N G+ TE+
Sbjct: 159 FSTIVAVEGINKIRTGKLVSLSEQELMDCDNV-NNQGCDGGLMDYAFQFIQKN-GITTES 216

Query: 215 KYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFY 274
            YPY+   GSC++ + N  A  I GYEDVP+N+E+AL KAVA QPVSVAIDASG DFQFY
Sbjct: 217 NYPYQGEQGSCDQAKENAQAVTIDGYEDVPANDESALQKAVAGQPVSVAIDASGQDFQFY 276

Query: 275 SSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGL 334
           S GVFTG+C T+LDHGV AVGYG   DGTKYW+VKNSWG  WGE GYIRMQR +   EGL
Sbjct: 277 SEGVFTGECSTDLDHGVAAVGYGATRDGTKYWIVKNSWGEDWGEKGYIRMQRGVSQTEGL 336

Query: 335 CGIAMQASYPT 345
           CGIAMQASYPT
Sbjct: 337 CGIAMQASYPT 347


>gi|414584879|tpg|DAA35450.1| TPA: cysteine protease 1 [Zea mays]
          Length = 522

 Score =  354 bits (909), Expect = 3e-95,   Method: Compositional matrix adjust.
 Identities = 167/311 (53%), Positives = 222/311 (71%), Gaps = 8/311 (2%)

Query: 39  HEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFR 98
           +E+W+A++GR Y    E++ RF++F +N+ ++ + N +A    ++LG+N+FAD TN+EFR
Sbjct: 109 YELWLAEHGRAYNALGERDRRFRVFWDNLRFVDAHNERAAEHGFRLGMNQFADLTNDEFR 168

Query: 99  APRNGYKRRLPSVRSSETTDVSFRYENAS----VPASIDWRKKGAVTGVKDQGQCGCCWA 154
           A   G   R+P+ R   T  V  RY +      +P S+DWR+KGAV  VK+QGQCG CWA
Sbjct: 169 AAYLGA--RIPASRRRGTA-VGERYRHGGGAEELPESVDWREKGAVAPVKNQGQCGSCWA 225

Query: 155 FSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEA 214
           FSAV+++E +N I T ++ +LSEQELV+C T G + GC GGLMD AF+FII N G+ TE 
Sbjct: 226 FSAVSSVESVNQIVTGEMVTLSEQELVECSTDGGNSGCNGGLMDAAFDFIIKNGGIDTEG 285

Query: 215 KYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFY 274
            YPYKA DG C+    N     I G+EDVP N+E +L KAVA+QPVSVAI+A G +FQ Y
Sbjct: 286 DYPYKAVDGKCDINRENAKVVSIDGFEDVPENDEKSLQKAVAHQPVSVAIEAGGREFQLY 345

Query: 275 SSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGL 334
            +GVFTG C T LDHGV AVGYGT ++G  YW+V+NSWG  WGE+GYIRM+R+++A  G 
Sbjct: 346 KAGVFTGTCTTNLDHGVVAVGYGT-ENGKDYWIVRNSWGAKWGEDGYIRMERNVNATTGK 404

Query: 335 CGIAMQASYPT 345
           CGIAM ASYPT
Sbjct: 405 CGIAMMASYPT 415


>gi|255539310|ref|XP_002510720.1| cysteine protease, putative [Ricinus communis]
 gi|223551421|gb|EEF52907.1| cysteine protease, putative [Ricinus communis]
          Length = 349

 Score =  354 bits (908), Expect = 4e-95,   Method: Compositional matrix adjust.
 Identities = 178/325 (54%), Positives = 224/325 (68%), Gaps = 10/325 (3%)

Query: 21  WAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNK 80
           +AP+    T ND  + +  E W++++GRVY    EK  RF+IFK+N+ +I   N K RN 
Sbjct: 32  YAPEDL--TSNDKLI-DLFESWISRFGRVYESAEEKLERFEIFKDNLFHIDDTNKKVRN- 87

Query: 81  PYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAV 140
            Y LG+NEFAD ++EEF+   N Y    P +         F Y++ ++P S+DWRKKGAV
Sbjct: 88  -YWLGLNEFADLSHEEFK---NKYLGLKPDLSKRAQCPEEFTYKDVAIPKSVDWRKKGAV 143

Query: 141 TGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDA 200
           T VK+QG CG CWAFS VAA+EGIN I T  LTSLSEQEL+DCDT+  + GC GGLMD A
Sbjct: 144 TPVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSLSEQELIDCDTT-YNNGCNGGLMDYA 202

Query: 201 FEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPV 260
           F +I++N GL  E  YPY   +G+C+ ++    A  ISGY DVP N+E +L+KA+ANQP+
Sbjct: 203 FAYIVANGGLHKEEDYPYIMEEGTCDMRKEESDAVTISGYHDVPQNSEESLLKALANQPL 262

Query: 261 SVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENG 320
           S+AI+ASG DFQFYS GVF G CGTELDHGV AVGYGT+  G  Y +VKNSWG  WGE G
Sbjct: 263 SIAIEASGRDFQFYSGGVFDGHCGTELDHGVAAVGYGTS-KGLDYIIVKNSWGPKWGEKG 321

Query: 321 YIRMQRDIDAKEGLCGIAMQASYPT 345
           YIRM+R     EG+CGI   ASYPT
Sbjct: 322 YIRMKRKTSKPEGICGIYKMASYPT 346


>gi|374713649|gb|AEZ65082.1| cysteine protease [Carica papaya]
          Length = 471

 Score =  353 bits (907), Expect = 6e-95,   Method: Compositional matrix adjust.
 Identities = 185/355 (52%), Positives = 241/355 (67%), Gaps = 18/355 (5%)

Query: 1   MAMILLENKLVLAAILVLGVWAP----QSWSRTLNDATMNERHEMWMAQYGRVYRDNAEK 56
           ++ + +   L LA++ ++    P    QS  RT  +A M + +E W+ ++G+ Y    EK
Sbjct: 12  ISFLFMVFSLSLASMSIIDYDLPADPLQSTERT--EAHMMKMYEHWLVKHGKNYNAIGEK 69

Query: 57  EMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSET 116
           E RF+IFK+N+ ++    N    + YKLG+ +FAD TNEE+RA   G K      +    
Sbjct: 70  ERRFEIFKDNLRFVDE-QNSVPGRTYKLGLTKFADLTNEEYRAMYLGAKMEK---KEKLR 125

Query: 117 TDVSFRYENAS-----VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRK 171
           T+ S RY + +     +P+ +DWR+KGAVT VKDQGQCG CWAFS V ++EGIN I T  
Sbjct: 126 TERSQRYLHKAGNDDDLPSHVDWREKGAVTEVKDQGQCGSCWAFSTVGSVEGINQIVTGD 185

Query: 172 LTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEAN 231
           L SLSEQELVDCD +  +QGC GGLMD AFEFII N G+ +EA YPY+ASD  C+    N
Sbjct: 186 LISLSEQELVDCDKA-YNQGCNGGLMDYAFEFIIKNGGIDSEADYPYRASDNMCDSNRKN 244

Query: 232 PSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGV 291
                I GYEDVP N+E +L KAVANQPVSVAI+A G +FQ Y SGVFTG+CGT LDHGV
Sbjct: 245 AHVVTIDGYEDVPENDEESLKKAVANQPVSVAIEAGGREFQLYQSGVFTGRCGTNLDHGV 304

Query: 292 TAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKE-GLCGIAMQASYPT 345
            AVGYGT ++G  YW+V+NSWG  WGE+GYIRM+R++ + + G CGIAM+ASYPT
Sbjct: 305 VAVGYGT-ENGIDYWIVRNSWGPKWGESGYIRMERNVASTDTGKCGIAMEASYPT 358


>gi|350535639|ref|NP_001233949.1| phytophthora-inhibited protease 1 [Solanum lycopersicum]
 gi|108937128|gb|ABG23376.1| phytophthora-inhibited protease 1 [Solanum lycopersicum]
          Length = 345

 Score =  353 bits (906), Expect = 6e-95,   Method: Compositional matrix adjust.
 Identities = 175/341 (51%), Positives = 233/341 (68%), Gaps = 11/341 (3%)

Query: 10  LVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEY 69
           +VL    +L ++     SR L + +M ERHE WM  +GRVY+D+ EKE RFK FKENVE+
Sbjct: 12  VVLLLFSILSLYPFIVTSRNLKELSMLERHENWMVHHGRVYKDDIEKEHRFKTFKENVEF 71

Query: 70  IASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS-V 128
           I SFN     + YKL +N++AD T EEF     G    L S + S  T  SF+Y++ + V
Sbjct: 72  IESFNKNGTQR-YKLAVNKYADLTTEEFTTSFMGLDTSLLSQQESTATTTSFKYDSVTEV 130

Query: 129 PASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGE 188
           P S+DWRK+G+VTGVKDQG CGCCWAFSA AA+EG   I   +L SLSEQ+L+DC T  +
Sbjct: 131 PNSMDWRKRGSVTGVKDQGVCGCCWAFSAAAAIEGAYQIANNELISLSEQQLLDCST--Q 188

Query: 189 DQGCEGGLMDDAFEFIISNK--GLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSN 246
           ++GCEGGLM  A++F++ N   G+ TE  YPY+ +   C  K   P+A  I+GYE VPS 
Sbjct: 189 NKGCEGGLMTVAYDFLLQNNGGGITTETNYPYEEAQNVC--KTEQPAAVTINGYEVVPS- 245

Query: 247 NEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTA-DDGTKY 305
           +E++L+KAV NQP+SV I A+  +F  Y SG++ G C + L+H VT +GYGT+ +DGTKY
Sbjct: 246 DESSLLKAVVNQPISVGI-AANDEFHMYGSGIYDGSCNSRLNHAVTVIGYGTSEEDGTKY 304

Query: 306 WLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           W+VKNSWG+ WGE GY+R+ RD+    G CGIA  AS+PTA
Sbjct: 305 WIVKNSWGSDWGEEGYMRIARDVGVDGGHCGIAKVASFPTA 345


>gi|238006338|gb|ACR34204.1| unknown [Zea mays]
          Length = 465

 Score =  353 bits (906), Expect = 6e-95,   Method: Compositional matrix adjust.
 Identities = 167/311 (53%), Positives = 222/311 (71%), Gaps = 8/311 (2%)

Query: 39  HEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFR 98
           +E+W+A++GR Y    E++ RF++F +N+ ++ + N +A    ++LG+N+FAD TN+EFR
Sbjct: 52  YELWLAEHGRAYNALGERDRRFRVFWDNLRFVDAHNERAAEHGFRLGMNQFADLTNDEFR 111

Query: 99  APRNGYKRRLPSVRSSETTDVSFRYENAS----VPASIDWRKKGAVTGVKDQGQCGCCWA 154
           A   G   R+P+ R   T  V  RY +      +P S+DWR+KGAV  VK+QGQCG CWA
Sbjct: 112 AAYLGA--RIPASRRRGTA-VGERYRHGGGAEELPESVDWREKGAVAPVKNQGQCGSCWA 168

Query: 155 FSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEA 214
           FSAV+++E +N I T ++ +LSEQELV+C T G + GC GGLMD AF+FII N G+ TE 
Sbjct: 169 FSAVSSVESVNQIVTGEMVTLSEQELVECSTDGGNSGCNGGLMDAAFDFIIKNGGIDTEG 228

Query: 215 KYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFY 274
            YPYKA DG C+    N     I G+EDVP N+E +L KAVA+QPVSVAI+A G +FQ Y
Sbjct: 229 DYPYKAVDGKCDINRENAKVVSIDGFEDVPENDEKSLQKAVAHQPVSVAIEAGGREFQLY 288

Query: 275 SSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGL 334
            +GVFTG C T LDHGV AVGYGT ++G  YW+V+NSWG  WGE+GYIRM+R+++A  G 
Sbjct: 289 KAGVFTGTCTTNLDHGVVAVGYGT-ENGKDYWIVRNSWGAKWGEDGYIRMERNVNATTGK 347

Query: 335 CGIAMQASYPT 345
           CGIAM ASYPT
Sbjct: 348 CGIAMMASYPT 358


>gi|297819568|ref|XP_002877667.1| hypothetical protein ARALYDRAFT_348033 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297323505|gb|EFH53926.1| hypothetical protein ARALYDRAFT_348033 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 341

 Score =  353 bits (906), Expect = 6e-95,   Method: Compositional matrix adjust.
 Identities = 170/323 (52%), Positives = 230/323 (71%), Gaps = 13/323 (4%)

Query: 30  LNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEF 89
           L +A+  E+HE WM+++ RVY D++EK  RF+IFK+N++++ SFN    NK Y L +NEF
Sbjct: 26  LFEASAIEKHEQWMSRFHRVYSDDSEKTSRFEIFKKNLKFVESFNMNT-NKTYTLDVNEF 84

Query: 90  ADQTNEEFRAPRNGY-----KRRLPSVRSSETTDVSFRYENA-SVPASIDWRKKGAVTGV 143
           +D T+EEF+A   G        R+ +  S ET  VSFRYEN      S+DWR++GAVT V
Sbjct: 85  SDLTDEEFKARYTGLVVPEGMTRMSTTDSHET--VSFRYENVGETGESMDWREEGAVTSV 142

Query: 144 KDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEF 203
           K Q QCGCCWAFSAVAA+EG+  I   +L SLSEQ+L+DC T  E+ GC+GG+M  AF++
Sbjct: 143 KHQQQCGCCWAFSAVAAVEGMTKIAKGELVSLSEQQLLDCST--ENDGCDGGIMWKAFDY 200

Query: 204 IISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVA 263
           I+ N+G+  E  YPY+ +  +C       +AA ISGYE VP N+E AL+KAV+ QPVSVA
Sbjct: 201 IVENQGITAEDNYPYQGAQQTCESNHV--AAATISGYETVPQNDEEALLKAVSQQPVSVA 258

Query: 264 IDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIR 323
           I+ SG +F  YS G+F G+CGT L+H VT VGYG +++G KYWL+KNSWG +WGE+GY+R
Sbjct: 259 IEGSGYEFIHYSGGIFNGECGTHLNHAVTIVGYGVSEEGIKYWLLKNSWGESWGEDGYMR 318

Query: 324 MQRDIDAKEGLCGIAMQASYPTA 346
           + RD+DA +G+CG+A  A YP A
Sbjct: 319 IMRDVDAPQGMCGLASLAYYPVA 341


>gi|357467173|ref|XP_003603871.1| Cysteine proteinase [Medicago truncatula]
 gi|355492919|gb|AES74122.1| Cysteine proteinase [Medicago truncatula]
 gi|388499154|gb|AFK37643.1| unknown [Medicago truncatula]
          Length = 350

 Score =  353 bits (906), Expect = 6e-95,   Method: Compositional matrix adjust.
 Identities = 171/309 (55%), Positives = 218/309 (70%), Gaps = 6/309 (1%)

Query: 37  ERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEE 96
           E  E WM+++G++Y    EK +RF++FK+N+++I   N    N  Y LG+NEFAD +++E
Sbjct: 45  ELFESWMSRHGKIYETIEEKLLRFEVFKDNLKHIDDRNKVVSN--YWLGLNEFADLSHQE 102

Query: 97  FRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFS 156
           F+    G K  L   R  E+++  F Y +  +P S+DWRKKGAVT VK+QGQCG CWAFS
Sbjct: 103 FKNKYLGLKVDLSQRR--ESSEEEFTYRDVDLPKSVDWRKKGAVTPVKNQGQCGSCWAFS 160

Query: 157 AVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKY 216
            VAA+EGIN I T  LTSLSEQEL+DCDT+  + GC GGLMD AF FI+ N GL  E  Y
Sbjct: 161 TVAAVEGINQIVTGNLTSLSEQELIDCDTT-YNNGCNGGLMDYAFSFIVKNGGLHKEEDY 219

Query: 217 PYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSS 276
           PY   + +C  K+       I+GY DVP NNE +L+KA+ANQP+SVAI+ASG DFQFYS 
Sbjct: 220 PYIMEESTCEMKKEVSEVVTINGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQFYSG 279

Query: 277 GVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCG 336
           GVF G CG+ELDHGV+AVGYGT+  G  Y +VKNSWG  WGE G+IRM+R+I   EG+CG
Sbjct: 280 GVFDGHCGSELDHGVSAVGYGTS-KGLDYIIVKNSWGAKWGEKGFIRMKRNIGKSEGICG 338

Query: 337 IAMQASYPT 345
           +   ASYPT
Sbjct: 339 LYKMASYPT 347


>gi|255555337|ref|XP_002518705.1| cysteine protease, putative [Ricinus communis]
 gi|223542086|gb|EEF43630.1| cysteine protease, putative [Ricinus communis]
          Length = 471

 Score =  353 bits (906), Expect = 6e-95,   Method: Compositional matrix adjust.
 Identities = 178/316 (56%), Positives = 225/316 (71%), Gaps = 7/316 (2%)

Query: 32  DATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFAD 91
           D+ +   +EMW+ ++G+ Y    EKE RF+IFK+N+ +I   N+  R+  YK+G+N FAD
Sbjct: 44  DSQVRRMYEMWLVEHGKAYNALGEKEKRFEIFKDNLRFIDEHNSVDRS--YKVGLNRFAD 101

Query: 92  QTNEEFRAPRNGYK-RRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCG 150
            TNEE++A   G K  R      + +    F+ +   +P ++DWR+KGAV  VKDQGQCG
Sbjct: 102 LTNEEYKAMFLGTKMERKNRFLGTRSQRYLFK-DGDDLPENVDWREKGAVVPVKDQGQCG 160

Query: 151 CCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGL 210
            CWAFS V A+EGIN I T +L SLSEQELVDCD S  +QGC GGLMD AFEFII+N G+
Sbjct: 161 SCWAFSTVGAVEGINQIVTGELISLSEQELVDCDKS-YNQGCNGGLMDYAFEFIINNGGI 219

Query: 211 ATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSD 270
            TE  YPYKASD  C+    N     I GYEDVP N+E +L KAVA+QPVSVAI+A G  
Sbjct: 220 DTEEDYPYKASDNICDPNRKNAKVVTIDGYEDVPENDENSLKKAVAHQPVSVAIEAGGRA 279

Query: 271 FQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDI-D 329
           FQ Y SGVFTG+CGTELDHGV AVGYGT ++G  YW+V+NSWG+ WGE+GYIRM+R++ +
Sbjct: 280 FQLYKSGVFTGRCGTELDHGVVAVGYGT-ENGVNYWIVRNSWGSAWGESGYIRMERNVAN 338

Query: 330 AKEGLCGIAMQASYPT 345
            K G CGIA+Q SYPT
Sbjct: 339 TKTGKCGIAIQPSYPT 354


>gi|226495425|ref|NP_001148706.1| cysteine protease 1 precursor [Zea mays]
 gi|195621544|gb|ACG32602.1| cysteine protease 1 precursor [Zea mays]
          Length = 463

 Score =  353 bits (906), Expect = 7e-95,   Method: Compositional matrix adjust.
 Identities = 175/308 (56%), Positives = 219/308 (71%), Gaps = 12/308 (3%)

Query: 42  WMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARN--KPYKLGINEFADQTNEEFRA 99
           WMA +GR Y    E+E R+++F++N+ YI + N  A      ++LG+N FAD TN+E+RA
Sbjct: 44  WMAAHGRTYNAVGEEERRYQVFRDNLRYIDAHNAAADAGVHSFRLGLNRFADLTNDEYRA 103

Query: 100 PRNGYKRRLPSVRSSETTDVSFRY---ENASVPASIDWRKKGAVTGVKDQGQCGCCWAFS 156
              G + R    R      +  RY   +N  +P S+DWR KGAV  VKDQG CG CWAFS
Sbjct: 104 TYLGARTRPQRERK-----LGARYHAADNEDLPESVDWRAKGAVAEVKDQGSCGSCWAFS 158

Query: 157 AVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKY 216
            +AA+EGIN I T  L SLSEQELVDCDTS  +QGC GGLMD AFEFII+N G+ TE  Y
Sbjct: 159 TIAAVEGINQIVTGDLISLSEQELVDCDTS-YNQGCNGGLMDYAFEFIINNGGIDTEKDY 217

Query: 217 PYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSS 276
           PYK +DG C+    N     I  YEDVP+N+E +L KAVANQPVSVAI+A+G+ FQ YSS
Sbjct: 218 PYKGTDGRCDVNRKNAKVVTIDSYEDVPANDEKSLQKAVANQPVSVAIEAAGTAFQLYSS 277

Query: 277 GVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCG 336
           G+FTG CGT LDHGVTAVGYGT ++G  YW+VKNSWG++WGE+GY+RM+R+I A  G CG
Sbjct: 278 GIFTGSCGTALDHGVTAVGYGT-ENGKDYWIVKNSWGSSWGESGYVRMERNIKASSGKCG 336

Query: 337 IAMQASYP 344
           IA++ SYP
Sbjct: 337 IAVEPSYP 344


>gi|18141283|gb|AAL60579.1|AF454957_1 senescence-associated cysteine protease [Brassica oleracea]
          Length = 460

 Score =  353 bits (906), Expect = 7e-95,   Method: Compositional matrix adjust.
 Identities = 180/321 (56%), Positives = 224/321 (69%), Gaps = 17/321 (5%)

Query: 31  NDATMNERHEMWMAQYGRVYRDNA----EKEMRFKIFKENVEYIASFNNKARNKPYKLGI 86
           +DA +   +E WM ++G+  + N     EK+ RF+IFK+N+ +I   NNK  N  YKLG+
Sbjct: 41  SDAEVARIYEAWMEKHGKKAQSNGLVGEEKDQRFEIFKDNLRFIDEHNNK--NLSYKLGL 98

Query: 87  NEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYE---NASVPASIDWRKKGAVTGV 143
             FAD TNEE+R+   G K +   +++S+      RY+     ++P S+DWRK+GAV  V
Sbjct: 99  TRFADLTNEEYRSIYLGAKSKKRVLKTSD------RYQPRVGDAIPDSVDWRKEGAVAAV 152

Query: 144 KDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEF 203
           KDQG CG CWAFS + A+EGIN I T  L SLSEQELVDCDTS  +QGC GGLMD AFEF
Sbjct: 153 KDQGSCGSCWAFSTIGAVEGINKIVTGDLISLSEQELVDCDTS-YNQGCNGGLMDYAFEF 211

Query: 204 IISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVA 263
           II N G+ TE  YPYKA+DG C++   N     I  YEDVP NNEAAL K +ANQP+SVA
Sbjct: 212 IIKNGGIDTEEDYPYKAADGRCDQTRKNAKVVTIDAYEDVPENNEAALKKTLANQPISVA 271

Query: 264 IDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIR 323
           I+A G  FQ YSSGVF G CGTELDHGV AVGYGT ++G  YW+V+NSWG +WGE+GYI+
Sbjct: 272 IEAGGRAFQLYSSGVFDGICGTELDHGVVAVGYGT-ENGKDYWIVRNSWGGSWGESGYIK 330

Query: 324 MQRDIDAKEGLCGIAMQASYP 344
           M R+I    G CGIAM+ASYP
Sbjct: 331 MARNIAEPTGKCGIAMEASYP 351


>gi|297826061|ref|XP_002880913.1| hypothetical protein ARALYDRAFT_481640 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297326752|gb|EFH57172.1| hypothetical protein ARALYDRAFT_481640 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 347

 Score =  353 bits (905), Expect = 8e-95,   Method: Compositional matrix adjust.
 Identities = 180/327 (55%), Positives = 235/327 (71%), Gaps = 15/327 (4%)

Query: 30  LNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEF 89
           L +A+  E+HE WMA++ RVY D +EK  RF IFK+N+E++ SFN   +N  YKL +NEF
Sbjct: 26  LFEASPIEKHEQWMARFNRVYSDESEKRNRFNIFKKNLEFVQSFNMN-KNITYKLDVNEF 84

Query: 90  ADQTNEEFRAPRNGYKRRLP------SVRSSETTDVSFRYENAS-VPASIDWRKKGAVTG 142
           +D T+EEFRA   G    +P      S  SS+ T V FRY N S    S+DWR++GAVT 
Sbjct: 85  SDLTDEEFRATHTGLV--VPEEITGISTLSSDKT-VPFRYGNVSDTGESMDWRQEGAVTP 141

Query: 143 VKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFE 202
           VK QG+CG CWAFSAVAA+EGI  IT  +L SLSEQ+L+DCDT   +QGC GG+M  AFE
Sbjct: 142 VKYQGRCGGCWAFSAVAAVEGITKITKGELVSLSEQQLLDCDTD-YNQGCHGGIMSKAFE 200

Query: 203 FIISNKGLATEAKYPYKASDGSCNKKEANPS---AAKISGYEDVPSNNEAALMKAVANQP 259
           +II N+G+ TE  YPY+ S  +C+      S   AA ISGYE VP NNE AL++AV+ QP
Sbjct: 201 YIIKNQGITTEDNYPYQESQQTCSSSTTLSSSFRAATISGYETVPMNNEEALLQAVSQQP 260

Query: 260 VSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGEN 319
           VSV I+ +G+ F+ YS G+F G+CGT+L H VT VGYG +++GTKYW+VKNSWG TWGE+
Sbjct: 261 VSVGIEGTGAGFRHYSGGIFNGECGTDLHHAVTIVGYGMSEEGTKYWVVKNSWGETWGED 320

Query: 320 GYIRMQRDIDAKEGLCGIAMQASYPTA 346
           G++R++RD+DA +G+CG+AM A YP A
Sbjct: 321 GFMRIKRDVDAPQGMCGLAMLAFYPLA 347


>gi|414585111|tpg|DAA35682.1| TPA: cysteine proteinase Mir3 [Zea mays]
          Length = 468

 Score =  353 bits (905), Expect = 8e-95,   Method: Compositional matrix adjust.
 Identities = 175/308 (56%), Positives = 219/308 (71%), Gaps = 12/308 (3%)

Query: 42  WMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARN--KPYKLGINEFADQTNEEFRA 99
           WMA +GR Y    E+E R+++F++N+ YI + N  A      ++LG+N FAD TN+E+RA
Sbjct: 49  WMAAHGRTYNAVGEEERRYQVFRDNLRYIDAHNAAADAGVHSFRLGLNRFADLTNDEYRA 108

Query: 100 PRNGYKRRLPSVRSSETTDVSFRY---ENASVPASIDWRKKGAVTGVKDQGQCGCCWAFS 156
              G + R    R      +  RY   +N  +P S+DWR KGAV  VKDQG CG CWAFS
Sbjct: 109 TYLGARTRPQRERK-----LGARYHAADNEDLPESVDWRAKGAVAEVKDQGSCGSCWAFS 163

Query: 157 AVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKY 216
            +AA+EGIN I T  L SLSEQELVDCDTS  +QGC GGLMD AFEFII+N G+ TE  Y
Sbjct: 164 TIAAVEGINQIVTGDLISLSEQELVDCDTS-YNQGCNGGLMDYAFEFIINNGGIDTEKDY 222

Query: 217 PYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSS 276
           PYK +DG C+    N     I  YEDVP+N+E +L KAVANQPVSVAI+A+G+ FQ YSS
Sbjct: 223 PYKGTDGRCDVNRKNAKVVTIDSYEDVPANDEKSLQKAVANQPVSVAIEAAGTAFQLYSS 282

Query: 277 GVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCG 336
           G+FTG CGT LDHGVTAVGYGT ++G  YW+VKNSWG++WGE+GY+RM+R+I A  G CG
Sbjct: 283 GIFTGSCGTALDHGVTAVGYGT-ENGKDYWIVKNSWGSSWGESGYVRMERNIKASSGKCG 341

Query: 337 IAMQASYP 344
           IA++ SYP
Sbjct: 342 IAVEPSYP 349


>gi|226501480|ref|NP_001150266.1| cysteine protease 1 precursor [Zea mays]
 gi|195637948|gb|ACG38442.1| cysteine protease 1 precursor [Zea mays]
          Length = 462

 Score =  353 bits (905), Expect = 9e-95,   Method: Compositional matrix adjust.
 Identities = 166/311 (53%), Positives = 222/311 (71%), Gaps = 8/311 (2%)

Query: 39  HEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFR 98
           +E+W+A++GR Y    E++ RF++F +N+ ++ + N +A    ++LG+N+FAD TN+EFR
Sbjct: 49  YELWLAEHGRAYNALGERDRRFRVFWDNLRFVDAHNERAAEHGFRLGMNQFADLTNDEFR 108

Query: 99  APRNGYKRRLPSVRSSETTDVSFRYENAS----VPASIDWRKKGAVTGVKDQGQCGCCWA 154
           A   G   R+P+ R   T  V  RY +      +P S+DWR+KGAV  VK+QGQCG CWA
Sbjct: 109 AAYLGA--RIPAARRRGTA-VGERYRHGGGAEELPESVDWREKGAVAPVKNQGQCGSCWA 165

Query: 155 FSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEA 214
           FSAV+++E +N I T ++ +LSEQELV+C T G + GC GGLMD AF+FII N G+ TE 
Sbjct: 166 FSAVSSVESVNQIVTGEMVTLSEQELVECSTDGGNSGCNGGLMDAAFDFIIKNGGIDTEG 225

Query: 215 KYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFY 274
            YPYKA DG C+    N     I G+EDVP N+E +L KAVA+QPVSVAI+A G +FQ Y
Sbjct: 226 DYPYKAVDGKCDINRENAKVVSIDGFEDVPENDEKSLQKAVAHQPVSVAIEAGGREFQLY 285

Query: 275 SSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGL 334
            +GVF+G C T LDHGV AVGYGT ++G  YW+V+NSWG  WGE+GYIRM+R+++A  G 
Sbjct: 286 KAGVFSGTCTTNLDHGVVAVGYGT-ENGKDYWIVRNSWGAKWGEDGYIRMERNVNATTGK 344

Query: 335 CGIAMQASYPT 345
           CGIAM ASYPT
Sbjct: 345 CGIAMMASYPT 355


>gi|2414570|emb|CAB16317.1| cysteine proteinase precursor [Nicotiana tabacum]
          Length = 374

 Score =  353 bits (905), Expect = 9e-95,   Method: Compositional matrix adjust.
 Identities = 174/325 (53%), Positives = 223/325 (68%), Gaps = 7/325 (2%)

Query: 24  QSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYK 83
           + W+   ++  +  R+EMW+A++GR Y    EKE RF+IFK+N+ +I   NN   N+ YK
Sbjct: 35  RKWTLQSDEDQVKNRYEMWLAEHGRAYNALGEKEKRFEIFKDNLRFIEGHNNSG-NRTYK 93

Query: 84  LGINEFADQTNEEFRAPRNGYKR--RLPSVRSSETTDVSFRYENASVPASIDWRKKGAVT 141
           +G+N+FAD TNEE+R    G K   R   V+S   +       N  +P S+DWRK+GAV 
Sbjct: 94  VGLNQFADLTNEEYRTMYLGTKSDARRRFVKSKNPSQRYASRPNELMPHSVDWRKRGAVA 153

Query: 142 GVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAF 201
            +K+QG CG CWAFS VAA+EGIN I T ++ +LSEQELVDCD   ++ GC GGLMD AF
Sbjct: 154 PIKNQGSCGSCWAFSTVAAVEGINQIVTGEMITLSEQELVDCDRV-QNSGCNGGLMDYAF 212

Query: 202 EFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVS 261
           EFIISN G+ TE  YPY+  +G C+    N     I GYEDVP  NE AL KAVA+QPV 
Sbjct: 213 EFIISNGGMDTEKHYPYRGVEGRCDPVRKNYKVVSIDGYEDVP-RNERALQKAVAHQPVC 271

Query: 262 VAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGY 321
           VAI+ASG  FQ YSSGVFTG+CG E+DHGV  VGYG+ +DG  YW+V+NSWGT WGENGY
Sbjct: 272 VAIEASGRAFQLYSSGVFTGECGEEVDHGVVVVGYGS-EDGVDYWIVRNSWGTKWGENGY 330

Query: 322 IRMQRDIDAKE-GLCGIAMQASYPT 345
           ++M+R++     G CGI  +ASYPT
Sbjct: 331 VKMERNVKKSHLGKCGIMTEASYPT 355


>gi|359359213|gb|AEV41117.1| putative oryzain beta chain precursor [Oryza officinalis]
          Length = 465

 Score =  352 bits (904), Expect = 1e-94,   Method: Compositional matrix adjust.
 Identities = 167/308 (54%), Positives = 219/308 (71%), Gaps = 6/308 (1%)

Query: 39  HEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFR 98
           +++W+A+ GR Y    E E RF++F +N+ +  + N +A +  ++LG+N FAD TNEEFR
Sbjct: 54  YDLWLAENGRSYNALGEHERRFRVFWDNLRFADAHNARADDHGFRLGMNRFADLTNEEFR 113

Query: 99  APRNGYKRRLPSVRSSETTDVSFRYENAS-VPASIDWRKKGAVTGVKDQGQCGCCWAFSA 157
           A   G K     V  S      +R++    +P S+DWR+KGAV  VK+QGQCG CWAFSA
Sbjct: 114 ATFLGAK----VVERSRAAGERYRHDGVEELPESVDWREKGAVAPVKNQGQCGSCWAFSA 169

Query: 158 VAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYP 217
           V+ +E IN + T ++ +LSEQELV+C T+G++ GC GGLMDDAF+FII N G+ TE  YP
Sbjct: 170 VSTVESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMDDAFDFIIKNGGIDTEDDYP 229

Query: 218 YKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSG 277
           YKA DG C+    N     I G+EDVP N+E +L KAVA+QPVSVAI+A G +FQ Y SG
Sbjct: 230 YKAVDGKCDINRENAKVVSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGGREFQLYHSG 289

Query: 278 VFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGI 337
           VF+G+CGT LDHGV AVGYGT D+G  YW+V+NSWG  WGE+GY+RM+R+I+   G CGI
Sbjct: 290 VFSGRCGTSLDHGVVAVGYGT-DNGKDYWIVRNSWGPKWGESGYVRMERNINVTTGKCGI 348

Query: 338 AMQASYPT 345
           AM ASYPT
Sbjct: 349 AMMASYPT 356


>gi|595986|gb|AAA79915.1| cysteine proteinase, partial [Dianthus caryophyllus]
          Length = 427

 Score =  352 bits (904), Expect = 1e-94,   Method: Compositional matrix adjust.
 Identities = 175/309 (56%), Positives = 217/309 (70%), Gaps = 6/309 (1%)

Query: 40  EMWMAQYGRVYRDNAEKEMRFKIFKENVEYI---ASFNNKARNKPYKLGINEFADQTNEE 96
           + W+ ++ + Y    EKE RF IF++N+E+I    + NN      ++LG+N+FAD TN+E
Sbjct: 6   QSWLVKHRKNYNALGEKEKRFAIFRDNLEFIDQHNNNNNGGGGGEFELGLNKFADLTNDE 65

Query: 97  FRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFS 156
           FR    G KR  P    S  +D     E   +P S+DWRKKGAV+ VKDQGQCG CWAFS
Sbjct: 66  FRRIYFGVKR--PEKAESVKSDRYAVKEGDELPESVDWRKKGAVSHVKDQGQCGSCWAFS 123

Query: 157 AVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKY 216
           A+ A+EGIN I T  L +LSEQELVDCDTS  + GC+GGLMD AF FII+N G+ T+  Y
Sbjct: 124 AIGAVEGINKIVTGDLITLSEQELVDCDTS-YNSGCDGGLMDYAFRFIINNGGIDTDKDY 182

Query: 217 PYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSS 276
           PYKA+DGSC+    N     I G EDVP+NNE AL KAVA+QPV +AI+A G DFQ Y S
Sbjct: 183 PYKATDGSCDSNRKNAKVVTIDGLEDVPANNEKALQKAVAHQPVRLAIEAGGRDFQLYKS 242

Query: 277 GVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCG 336
           GVFTG CGT LDHGV AVGYGT DDG  YW+V+NSWG  WGE+GYIRM+R+ ++K G CG
Sbjct: 243 GVFTGSCGTSLDHGVVAVGYGTTDDGKDYWIVRNSWGDDWGEDGYIRMERNTESKSGKCG 302

Query: 337 IAMQASYPT 345
           IA++ SYP 
Sbjct: 303 IAIEPSYPV 311


>gi|3688528|emb|CAA06243.1| pre-pro-TPE4A protein [Pisum sativum]
          Length = 360

 Score =  352 bits (904), Expect = 1e-94,   Method: Compositional matrix adjust.
 Identities = 185/310 (59%), Positives = 221/310 (71%), Gaps = 11/310 (3%)

Query: 39  HEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFR 98
           +E W + +  V R   EK  RF +FK NV ++   N    +KPYKL +N+FAD TN EFR
Sbjct: 40  YERWRSHH-TVTRSLDEKHNRFNVFKANVMHV--HNTNKLDKPYKLKLNKFADMTNYEFR 96

Query: 99  ---APRNGYKRRLPSVRSSETTDVSFRYENA-SVPASIDWRKKGAVTGVKDQGQCGCCWA 154
              A       R+   R     + +F YEN  +VP+SIDWRKKGAVT VKDQGQCG CWA
Sbjct: 97  RIYADSKVSHHRM--FRGMSNENGTFMYENVKNVPSSIDWRKKGAVTDVKDQGQCGSCWA 154

Query: 155 FSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEA 214
           FS + A+EGIN I T+KL SLSEQELVDCDT G ++GC GGLM+ AFEFI  N G+ TE+
Sbjct: 155 FSTIVAVEGINQIKTQKLVSLSEQELVDCDTGG-NEGCNGGLMEYAFEFIKQN-GITTES 212

Query: 215 KYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFY 274
            YPY A DG+C+ K+ + +   I GYE+VP NNEAAL+KA A QPVSVAIDA G +FQFY
Sbjct: 213 NYPYAAKDGTCDLKKEDKAEVSIDGYENVPINNEAALLKAAAKQPVSVAIDAGGYNFQFY 272

Query: 275 SSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGL 334
           S GVF+G CGT+L+HGV  VGYG   D TKYW+VKNSWG+ WGE GYIRMQR I  KEGL
Sbjct: 273 SEGVFSGHCGTDLNHGVAVVGYGVTQDRTKYWIVKNSWGSEWGEQGYIRMQRGISHKEGL 332

Query: 335 CGIAMQASYP 344
           CGIAM+ASYP
Sbjct: 333 CGIAMEASYP 342


>gi|18422289|ref|NP_568620.1| Granulin repeat cysteine protease family protein [Arabidopsis
           thaliana]
 gi|9757832|dbj|BAB08269.1| cysteine protease component of protease-inhibitor complex
           [Arabidopsis thaliana]
 gi|17065064|gb|AAL32686.1| cysteine protease component of protease-inhibitor complex
           [Arabidopsis thaliana]
 gi|21387153|gb|AAM47980.1| cysteine protease component of protease-inhibitor complex
           [Arabidopsis thaliana]
 gi|332007522|gb|AED94905.1| Granulin repeat cysteine protease family protein [Arabidopsis
           thaliana]
          Length = 463

 Score =  352 bits (904), Expect = 1e-94,   Method: Compositional matrix adjust.
 Identities = 179/318 (56%), Positives = 224/318 (70%), Gaps = 11/318 (3%)

Query: 31  NDATMNERHEMWMAQYGRVYRDN----AEKEMRFKIFKENVEYIASFNNKARNKPYKLGI 86
           +D+ +   +E WM ++G+   +     AEK+ RF+IFK+N+ +I   N K  N  YKLG+
Sbjct: 42  SDSEVERIYEAWMVEHGKKKMNQNGLGAEKDQRFEIFKDNLRFIDEHNTK--NLSYKLGL 99

Query: 87  NEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQ 146
             FAD TNEE+R+   G K   P+ R  +T+D        ++P S+DWRK+GAV  VKDQ
Sbjct: 100 TRFADLTNEEYRSMYLGAK---PTKRVLKTSDRYQARVGDALPDSVDWRKEGAVADVKDQ 156

Query: 147 GQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIIS 206
           G CG CWAFS + A+EGIN I T  L SLSEQELVDCDTS  +QGC GGLMD AFEFII 
Sbjct: 157 GSCGSCWAFSTIGAVEGINKIVTGDLISLSEQELVDCDTS-YNQGCNGGLMDYAFEFIIK 215

Query: 207 NKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDA 266
           N G+ TEA YPYKA+DG C++   N     I  YEDVP N+EA+L KA+A+QP+SVAI+A
Sbjct: 216 NGGIDTEADYPYKAADGRCDQNRKNAKVVTIDSYEDVPENSEASLKKALAHQPISVAIEA 275

Query: 267 SGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQR 326
            G  FQ YSSGVF G CGTELDHGV AVGYGT ++G  YW+V+NSWG  WGE+GYI+M R
Sbjct: 276 GGRAFQLYSSGVFDGLCGTELDHGVVAVGYGT-ENGKDYWIVRNSWGNRWGESGYIKMAR 334

Query: 327 DIDAKEGLCGIAMQASYP 344
           +I+A  G CGIAM+ASYP
Sbjct: 335 NIEAPTGKCGIAMEASYP 352


>gi|118124|sp|P25250.1|CYSP2_HORVU RecName: Full=Cysteine proteinase EP-B 2; Flags: Precursor
 gi|1146118|gb|AAA85036.1| cysteine proteinase EPB2 precursor [Hordeum vulgare subsp. vulgare]
          Length = 373

 Score =  352 bits (904), Expect = 1e-94,   Method: Compositional matrix adjust.
 Identities = 174/311 (55%), Positives = 215/311 (69%), Gaps = 7/311 (2%)

Query: 39  HEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFR 98
           +E W + + RV R +AEK  RF  FK N  +I S N +  + PY+L +N F D    EFR
Sbjct: 46  YERWQSAH-RVRRHHAEKHRRFGTFKSNAHFIHSHNKRG-DHPYRLHLNRFGDMDQAEFR 103

Query: 99  APRNGYKRRLPSVRSSETTDVSFRYENAS-VPASIDWRKKGAVTGVKDQGQCGCCWAFSA 157
           A   G  RR    +        +   N S +P S+DWR+KGAVTGVKDQG+CG CWAFS 
Sbjct: 104 ATFVGDLRRDTPSKPPSVPGFMYAALNVSDLPPSVDWRQKGAVTGVKDQGKCGSCWAFST 163

Query: 158 VAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYP 217
           V ++EGIN I T  L SLSEQEL+DCDT+  D GC+GGLMD+AFE+I +N GL TEA YP
Sbjct: 164 VVSVEGINAIRTGSLVSLSEQELIDCDTADND-GCQGGLMDNAFEYIKNNGGLITEAAYP 222

Query: 218 YKASDGSCNKKEA---NPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFY 274
           Y+A+ G+CN   A   +P    I G++DVP+N+E  L +AVANQPVSVA++ASG  F FY
Sbjct: 223 YRAARGTCNVARAAQNSPVVVHIDGHQDVPANSEEDLARAVANQPVSVAVEASGKAFMFY 282

Query: 275 SSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGL 334
           S GVFTG+CGTELDHGV  VGYG A+DG  YW VKNSWG +WGE GYIR+++D  A  GL
Sbjct: 283 SEGVFTGECGTELDHGVAVVGYGVAEDGKAYWTVKNSWGPSWGEQGYIRVEKDSGASGGL 342

Query: 335 CGIAMQASYPT 345
           CGIAM+ASYP 
Sbjct: 343 CGIAMEASYPV 353


>gi|21070926|gb|AAM34401.1|AF377947_7 putative cysteine proteinase [Oryza sativa Japonica Group]
 gi|31712050|gb|AAP68356.1| putative cysteine protease [Oryza sativa Japonica Group]
 gi|40538988|gb|AAR87245.1| putative cysteine protease [Oryza sativa Japonica Group]
 gi|108711126|gb|ABF98921.1| Papain family cysteine protease containing protein, expressed
           [Oryza sativa Japonica Group]
 gi|125545747|gb|EAY91886.1| hypothetical protein OsI_13535 [Oryza sativa Indica Group]
          Length = 350

 Score =  352 bits (903), Expect = 2e-94,   Method: Compositional matrix adjust.
 Identities = 179/315 (56%), Positives = 218/315 (69%), Gaps = 12/315 (3%)

Query: 38  RHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKP---YKLGINEFADQTN 94
           RHE WMA++G+ Y+D  EK  R ++F+ N + I SFN  A       ++L  N FAD T+
Sbjct: 41  RHEKWMAKHGKTYKDEEEKARRLEVFRANAKLIDSFNAAAEKDGGGGHRLATNRFADLTD 100

Query: 95  EEFRAPRNGYKRRLPSVRSSETTDVSFRYEN---ASVPASIDWRKKGAVTGVKDQGQCGC 151
           +EFRA R GY+R  P   +       F YEN   A+ P S+DWR  GAVTGVKDQG CGC
Sbjct: 101 DEFRAARTGYQR--PPA-AVAGAGGGFLYENFSLAAAPQSMDWRAMGAVTGVKDQGSCGC 157

Query: 152 CWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLA 211
           CWAFSAVAA+EG+  I T +L SLSEQELVDCD  GEDQGCEGGLMD AF++I    GLA
Sbjct: 158 CWAFSAVAAVEGLAKIRTGQLVSLSEQELVDCDVRGEDQGCEGGLMDTAFQYIARRGGLA 217

Query: 212 TEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDF 271
            E+ YPY+  D    +  A  +AA I G++DVPSN+E ALM AVA QPVSVAI+ +G  F
Sbjct: 218 AESSYPYRGVD-GACRAAAGRAAASIRGFQDVPSNDEGALMAAVARQPVSVAINGAGYVF 276

Query: 272 QFYSSGVFTGQ-CGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDA 330
           +FY  GV  G  CGTEL+H VTAVGYGTA DGT YWL+KNSWG +WGE GY+R++R +  
Sbjct: 277 RFYDRGVLGGAGCGTELNHAVTAVGYGTASDGTGYWLMKNSWGASWGEGGYVRIRRGV-G 335

Query: 331 KEGLCGIAMQASYPT 345
           +EG CGIA  ASYP 
Sbjct: 336 REGACGIAQMASYPV 350


>gi|118120|sp|P25249.1|CYSP1_HORVU RecName: Full=Cysteine proteinase EP-B 1; Flags: Precursor
 gi|1146116|gb|AAA85035.1| cysteine proteinase EPB1 precursor [Hordeum vulgare subsp. vulgare]
          Length = 371

 Score =  352 bits (902), Expect = 2e-94,   Method: Compositional matrix adjust.
 Identities = 175/311 (56%), Positives = 215/311 (69%), Gaps = 7/311 (2%)

Query: 39  HEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFR 98
           +E W + + RV R +AEK  RF  FK N  +I S +NK  + PY+L +N F D    EFR
Sbjct: 46  YERWQSAH-RVRRHHAEKHRRFGTFKSNAHFIHS-HNKRGDHPYRLHLNRFGDMDQAEFR 103

Query: 99  APRNGYKRRLPSVRSSETTDVSFRYENAS-VPASIDWRKKGAVTGVKDQGQCGCCWAFSA 157
           A   G  RR    +        +   N S +P S+DWR+KGAVTGVKDQG+CG CWAFS 
Sbjct: 104 ATFVGDLRRDTPAKPPSVPGFMYAALNVSDLPPSVDWRQKGAVTGVKDQGKCGSCWAFST 163

Query: 158 VAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYP 217
           V ++EGIN I T  L SLSEQEL+DCDT+  D GC+GGLMD+AFE+I +N GL TEA YP
Sbjct: 164 VVSVEGINAIRTGSLVSLSEQELIDCDTADND-GCQGGLMDNAFEYIKNNGGLITEAAYP 222

Query: 218 YKASDGSCNKKEA---NPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFY 274
           Y+A+ G+CN   A   +P    I G++DVP+N+E  L +AVANQPVSVA++ASG  F FY
Sbjct: 223 YRAARGTCNVARAAQNSPVVVHIDGHQDVPANSEEDLARAVANQPVSVAVEASGKAFMFY 282

Query: 275 SSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGL 334
           S GVFTG CGTELDHGV  VGYG A+DG  YW VKNSWG +WGE GYIR+++D  A  GL
Sbjct: 283 SEGVFTGDCGTELDHGVAVVGYGVAEDGKAYWTVKNSWGPSWGEQGYIRVEKDSGASGGL 342

Query: 335 CGIAMQASYPT 345
           CGIAM+ASYP 
Sbjct: 343 CGIAMEASYPV 353


>gi|50355623|dbj|BAD29960.1| cysteine protease [Daucus carota]
          Length = 460

 Score =  352 bits (902), Expect = 2e-94,   Method: Compositional matrix adjust.
 Identities = 167/314 (53%), Positives = 217/314 (69%), Gaps = 3/314 (0%)

Query: 32  DATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFAD 91
           D  +   +E W+ ++G+ Y    EKE RF+IFK+N  YI    N A+++ +KLG+N FAD
Sbjct: 37  DDVIMAAYESWLVKHGKSYNALGEKEQRFQIFKDNFLYIDE-QNAAKDRSFKLGLNRFAD 95

Query: 92  QTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGC 151
            TNEE+R+   G + +    + S  +         S+P S+DWR+ GAV  VKDQGQCG 
Sbjct: 96  LTNEEYRSKYTGIRTKDSRKKVSGKSQRYASLAGESLPESVDWREHGAVASVKDQGQCGS 155

Query: 152 CWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLA 211
           CWAFS ++A+EGIN I T KL +LSEQELVDCD S  ++GC GGLMDDAF+FII+N G+ 
Sbjct: 156 CWAFSTISAVEGINQIATGKLITLSEQELVDCDRS-YNEGCNGGLMDDAFQFIINNGGID 214

Query: 212 TEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDF 271
           ++A YPY   DG C++   N     I  YEDVP  +E AL KA ANQP+SVAI+ASG DF
Sbjct: 215 SDADYPYTGRDGQCDQYRKNAKVVTIDSYEDVPEYDEKALQKAAANQPISVAIEASGRDF 274

Query: 272 QFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAK 331
           QFY SG+FTG+CGT+LDHGV  VGYGT ++G  YW+V+NSWG  WGE GY+RM+R I +K
Sbjct: 275 QFYDSGIFTGKCGTDLDHGVVVVGYGT-ENGKDYWIVRNSWGADWGEKGYLRMERGISSK 333

Query: 332 EGLCGIAMQASYPT 345
            G+CGI  + SYP 
Sbjct: 334 AGICGITSEPSYPV 347


>gi|255568345|ref|XP_002525147.1| cysteine protease, putative [Ricinus communis]
 gi|223535606|gb|EEF37274.1| cysteine protease, putative [Ricinus communis]
          Length = 347

 Score =  351 bits (901), Expect = 3e-94,   Method: Compositional matrix adjust.
 Identities = 178/341 (52%), Positives = 233/341 (68%), Gaps = 16/341 (4%)

Query: 12  LAAILVLGVWAP-----QSWSRTLNDA--TMNERHEMWMAQYGRVYRDNAEKEMRFKIFK 64
           L  I +  +W P     +  S  ++ A   M  R++ W+ QYGR Y    E  +RF I+ 
Sbjct: 12  LMLITLCTLWIPSIARSEIHSLPIDSAPTAMKVRYDKWLEQYGRKYDTKDEYLLRFGIYH 71

Query: 65  ENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYE 124
            N+++I   N  ++N  +KL  N+FAD TN+EF +   GY+     +RS +  ++S  +E
Sbjct: 72  SNIQFIEYIN--SQNLSFKLTDNKFADLTNDEFNSIYLGYQ-----IRSYKRRNLSHMHE 124

Query: 125 NAS-VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDC 183
           N++ +P ++DWR+ GAVT +KDQGQCG CWAFSAVAA+EGIN I T  L SLSEQELVDC
Sbjct: 125 NSTDLPDAVDWRENGAVTPIKDQGQCGSCWAFSAVAAVEGINKIKTGNLVSLSEQELVDC 184

Query: 184 DTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDV 243
           D +G+++GC GG M+ AF FI S  GL TE  YPYK +DGSC K + +  A  I GYE V
Sbjct: 185 DVNGDNKGCNGGFMEKAFTFIKSIGGLTTENDYPYKGTDGSCEKAKTDNHAVIIGGYETV 244

Query: 244 PSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGT 303
           P+NNE +L  AV+ QPVSVAIDASG +FQ YS GVF+G CG +L+HGVT VGYG  ++G 
Sbjct: 245 PANNENSLKVAVSKQPVSVAIDASGYEFQLYSEGVFSGYCGIQLNHGVTIVGYGD-NNGQ 303

Query: 304 KYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           KYWLVKNSWG  WGE+GYIRM+RD    +G+CGIAM+ SYP
Sbjct: 304 KYWLVKNSWGKGWGESGYIRMKRDSSDTKGMCGIAMEPSYP 344


>gi|109119897|dbj|BAE96008.1| cysteine proteinase [Triticum aestivum]
          Length = 377

 Score =  351 bits (900), Expect = 3e-94,   Method: Compositional matrix adjust.
 Identities = 175/313 (55%), Positives = 219/313 (69%), Gaps = 11/313 (3%)

Query: 39  HEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFR 98
           +E W   + RV R +AEK  RF  FK NV +I S N +  ++PY+L +N F D +  EFR
Sbjct: 46  YERWQTAH-RVPRHHAEKHRRFGTFKSNVHFIHSHNKRG-DRPYRLRLNRFGDMSQAEFR 103

Query: 99  APRNG-----YKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCW 153
           A   G      +R  P+   S    +      + +P S+DWR+KGAVTGVK+QG+CG CW
Sbjct: 104 ATFAGSRVSDRRRDGPATPPSVPGFMYAAVNVSDLPRSVDWRQKGAVTGVKNQGKCGSCW 163

Query: 154 AFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATE 213
           AFS V ++EGIN I T KL SLSEQEL+DCDT+  D GCEGGLMD+AFE+I  N GL TE
Sbjct: 164 AFSTVVSVEGINAIRTGKLVSLSEQELIDCDTADND-GCEGGLMDNAFEYIKKNGGLTTE 222

Query: 214 AKYPYKASDGSCNKKE---ANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSD 270
           A YPY+A++G+C   +   ++P    I G++DVP+N+E AL KAVANQPVSV IDASG  
Sbjct: 223 AAYPYRAANGTCKAAKVAKSSPMVVHIDGHQDVPANSEEALAKAVANQPVSVGIDASGKA 282

Query: 271 FQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDA 330
           F FYS GVFTG+CGTELDHGV  VGYG A+DG  YW VKNSWG +WGE GYIR+++D  A
Sbjct: 283 FMFYSEGVFTGECGTELDHGVAVVGYGVAEDGKAYWTVKNSWGPSWGEKGYIRVEKDSGA 342

Query: 331 KEGLCGIAMQASY 343
           + GLCGIAM+ASY
Sbjct: 343 EGGLCGIAMEASY 355


>gi|225438807|ref|XP_002283263.1| PREDICTED: germination-specific cysteine protease 1-like isoform 1
           [Vitis vinifera]
          Length = 374

 Score =  351 bits (900), Expect = 3e-94,   Method: Compositional matrix adjust.
 Identities = 177/310 (57%), Positives = 219/310 (70%), Gaps = 9/310 (2%)

Query: 39  HEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFR 98
           ++ WMA++G+ Y    EKE RF+IFK+N+++I   N  A+N+ YK+G+N FAD TNEE+R
Sbjct: 46  YQWWMAKHGKAYNGLGEKEKRFEIFKDNLKFIDEHN--AQNRTYKVGLNRFADLTNEEYR 103

Query: 99  APRNGYKRRLPSVRSSETTDVSFRY---ENASVPASIDWRKKGAVTGVKDQGQCGCCWAF 155
           A   G  R  P  R ++  + S RY       +P S+DWR+ GAV  VKDQ  CG CWAF
Sbjct: 104 AIYLG-TRSDPKRRFAKLKNASPRYAVMPGEVLPESVDWRETGAVNPVKDQRSCGSCWAF 162

Query: 156 SAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAK 215
           S VAA+EGIN I T +L SLSEQELVDCDT   D GC GGLMD AF+FII N GL TE  
Sbjct: 163 STVAAVEGINQIVTGELISLSEQELVDCDTE-YDMGCNGGLMDYAFDFIIKNGGLDTEKD 221

Query: 216 YPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYS 275
           YPY   DG CN    +     I GYEDVP  +E AL KAVA+QPVSVA++A G   Q Y 
Sbjct: 222 YPYTGFDGECNLSGKSSKVVSIDGYEDVPPFDEKALQKAVAHQPVSVAVEAGGRALQLYV 281

Query: 276 SGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDI-DAKEGL 334
           SG+FTG+CGT LDHG+ AVGYGT ++GT YW+V+NSWG++WGENGYIRM+R++ DA  G 
Sbjct: 282 SGIFTGECGTALDHGIVAVGYGT-ENGTDYWIVRNSWGSSWGENGYIRMERNMADAFSGK 340

Query: 335 CGIAMQASYP 344
           CGIAM+ASYP
Sbjct: 341 CGIAMEASYP 350


>gi|297791625|ref|XP_002863697.1| hypothetical protein ARALYDRAFT_917391 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297309532|gb|EFH39956.1| hypothetical protein ARALYDRAFT_917391 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 463

 Score =  350 bits (899), Expect = 4e-94,   Method: Compositional matrix adjust.
 Identities = 182/321 (56%), Positives = 224/321 (69%), Gaps = 17/321 (5%)

Query: 31  NDATMNERHEMWMAQYGRVYRDN----AEKEMRFKIFKENVEYIASFNNKARNKPYKLGI 86
           +DA +   +E WM ++G+   +     AEK+ RF+IFK+N+ YI   N K  N  YKLG+
Sbjct: 42  SDAEVERIYEAWMVEHGKKKMNQNGLGAEKDQRFEIFKDNLRYIDEHNTK--NLSYKLGL 99

Query: 87  NEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYE---NASVPASIDWRKKGAVTGV 143
             FAD TN+E+R+   G K   P  R  +T+D   RYE     ++P S+DWRK+GAV  V
Sbjct: 100 TRFADLTNDEYRSMYLGAK---PVKRVLKTSD---RYEARVGDALPDSVDWRKEGAVADV 153

Query: 144 KDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEF 203
           KDQG CG CWAFS + A+EGIN I T  L SLSEQELVDCDTS  +QGC GGLMD AFEF
Sbjct: 154 KDQGSCGSCWAFSTIGAVEGINKIVTGDLISLSEQELVDCDTS-YNQGCNGGLMDYAFEF 212

Query: 204 IISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVA 263
           II N G+ TEA YPYKA+DG C++   N     I  YEDVP N+EA+L KA+A+QP+SVA
Sbjct: 213 IIKNGGIDTEADYPYKAADGRCDQNRKNAKVVTIDSYEDVPENSEASLKKALAHQPISVA 272

Query: 264 IDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIR 323
           I+A G  FQ YSSGVF G CGTELDHGV AVGYGT ++G  YW+V+NSWG  WGE+GYI+
Sbjct: 273 IEAGGRAFQLYSSGVFDGICGTELDHGVVAVGYGT-ENGKDYWIVRNSWGNRWGESGYIK 331

Query: 324 MQRDIDAKEGLCGIAMQASYP 344
           M R+I    G CGIAM+ASYP
Sbjct: 332 MARNIAEPTGKCGIAMEASYP 352


>gi|413919736|gb|AFW59668.1| cysteine protease 1 [Zea mays]
          Length = 469

 Score =  350 bits (899), Expect = 5e-94,   Method: Compositional matrix adjust.
 Identities = 171/305 (56%), Positives = 215/305 (70%), Gaps = 6/305 (1%)

Query: 42  WMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARN--KPYKLGINEFADQTNEEFRA 99
           WMA +GR Y    E+E RF++F++N+ Y+ + N  A      ++LG+N FAD TN+E+RA
Sbjct: 49  WMAAHGRTYNAVGEEERRFEVFRDNLRYVDAHNAAADAGVHSFRLGLNRFADLTNDEYRA 108

Query: 100 PRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVA 159
              G + R    R     D     +N  +P S+DWR KGAV  VKDQG CG CWAFS +A
Sbjct: 109 TYLGVRSR--PQRERRLGDRYLAGDNEDLPESVDWRAKGAVAEVKDQGSCGSCWAFSTIA 166

Query: 160 AMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYK 219
           A+EGIN I T  + SLSEQELVDCDTS  +QGC GGLMD AFEFII+N G+ TE  YPYK
Sbjct: 167 AVEGINQIVTGDMISLSEQELVDCDTS-YNQGCNGGLMDYAFEFIINNGGIDTEEDYPYK 225

Query: 220 ASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVF 279
            +DG C+    N     I  YEDVP+N+E +L KAVANQP+SVAI+A G  FQ Y+SG+F
Sbjct: 226 GTDGRCDVNRKNAKVVTIDSYEDVPANSEKSLQKAVANQPISVAIEAGGRAFQLYNSGIF 285

Query: 280 TGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAM 339
           TG CGT LDHGVTAVGYGT ++G  YW+VKNSWG++WGE+GY+RM+R+I A  G CGIA+
Sbjct: 286 TGTCGTALDHGVTAVGYGT-ENGKDYWIVKNSWGSSWGESGYVRMERNIKASSGKCGIAV 344

Query: 340 QASYP 344
           + SYP
Sbjct: 345 EPSYP 349


>gi|110737404|dbj|BAF00646.1| putative cysteine proteinase [Arabidopsis thaliana]
          Length = 345

 Score =  350 bits (898), Expect = 5e-94,   Method: Compositional matrix adjust.
 Identities = 183/356 (51%), Positives = 241/356 (67%), Gaps = 21/356 (5%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSRTL--NDATMNERHEMWMAQYGRVYRDNAEKEM 58
           MA I++   +++  IL  G    Q+ SRT+   + +M ++HE WMA++ R YRD  EK M
Sbjct: 1   MASIMVLVTVLI--ILFTGFRISQATSRTVIFREQSMVDKHEQWMARFSREYRDELEKNM 58

Query: 59  RFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYK--------RRLPS 110
           R  +FK+N+++I +FN K  NK YKLG+NEFAD TNEEF A   G K        + +  
Sbjct: 59  RRDVFKKNLKFIENFNKKG-NKSYKLGVNEFADWTNEEFLAIHTGLKGLTEVSPSKVVAK 117

Query: 111 VRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTR 170
             SS+T +VS       V  S DWR +GAVT VK QGQCGCCWAFSAVAA+EG+  I   
Sbjct: 118 TISSQTWNVS-----DMVVESKDWRAEGAVTPVKYQGQCGCCWAFSAVAAVEGVAKIAGG 172

Query: 171 KLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEA 230
            L SLSEQ+L+DCD    D+ C+GG+M DAF +++ N+G+A+E  Y Y+ SDG C +  A
Sbjct: 173 NLVSLSEQQLLDCDRE-YDRDCDGGIMSDAFNYVVQNRGIASENDYSYQGSDGGC-RSNA 230

Query: 231 NPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHG 290
            P AA+ISG++ VPSNNE AL++AV+ QPVSV++DA+G  F  YS GV+ G CGT  +H 
Sbjct: 231 RP-AARISGFQTVPSNNERALLEAVSRQPVSVSMDATGDGFMHYSGGVYDGPCGTSSNHA 289

Query: 291 VTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           VT VGYGT+ DGTKYWL KNSWG TW E GYIR++RD+   +G+CG+A  A YP A
Sbjct: 290 VTFVGYGTSQDGTKYWLAKNSWGETWEEKGYIRIRRDVAWPQGMCGVAQYAFYPVA 345


>gi|326493368|dbj|BAJ85145.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 436

 Score =  350 bits (898), Expect = 6e-94,   Method: Compositional matrix adjust.
 Identities = 173/309 (55%), Positives = 214/309 (69%), Gaps = 12/309 (3%)

Query: 42  WMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARN--KPYKLGINEFADQTNEEFRA 99
           WMA++G  Y    E+E RF+ F++N+ YI   N  A      ++LG+N FAD TNEE+R+
Sbjct: 46  WMAEHGSTYNAIGEEERRFEAFRDNLRYIDQHNAAADAGVHSFRLGLNRFADLTNEEYRS 105

Query: 100 PRNGYKRRLPSVRSSETTDVSFRYE---NASVPASIDWRKKGAVTGVKDQGQCGCCWAFS 156
              G + +    R      +S RY+   N  +P S+DWRKKGAV  VKDQG CG CWAFS
Sbjct: 106 TYLGARTKPDRERK-----LSARYQAADNDELPESVDWRKKGAVGAVKDQGGCGSCWAFS 160

Query: 157 AVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKY 216
           A+AA+EGIN I T  +  LSEQELVDCDTS  +QGC GGLMD AFEFII+N G+ +E  Y
Sbjct: 161 AIAAVEGINQIVTGDMIPLSEQELVDCDTS-YNQGCNGGLMDYAFEFIINNGGIDSEEDY 219

Query: 217 PYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSS 276
           PYK  D  C+  + N     I GYEDVP N+E +L KAVANQP+SVAI+A G  FQ Y S
Sbjct: 220 PYKERDNRCDANKKNAKVVTIDGYEDVPVNSEKSLQKAVANQPISVAIEAGGRAFQLYKS 279

Query: 277 GVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCG 336
           G+FTG CGT LDHGV AVGYGT ++G  YWLV+NSWG+ WGE+GYIRM+R+I A  G CG
Sbjct: 280 GIFTGTCGTALDHGVAAVGYGT-ENGKDYWLVRNSWGSVWGEDGYIRMERNIKASSGKCG 338

Query: 337 IAMQASYPT 345
           IA++ SYPT
Sbjct: 339 IAVEPSYPT 347


>gi|30141019|dbj|BAC75923.1| cysteine protease-1 [Helianthus annuus]
          Length = 461

 Score =  350 bits (898), Expect = 6e-94,   Method: Compositional matrix adjust.
 Identities = 171/314 (54%), Positives = 218/314 (69%), Gaps = 5/314 (1%)

Query: 32  DATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFAD 91
           D  +N  +E W+ ++G+ Y    EK+ RF+IFK+N+ +I   N  + +  YKLG+N+FAD
Sbjct: 45  DDEVNALYESWLVKHGKTYNALGEKDRRFQIFKDNLRFIDEHN--SGDHTYKLGLNKFAD 102

Query: 92  QTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENA-SVPASIDWRKKGAVTGVKDQGQCG 150
            TNEE+R    G K      + S+     + Y +  S+P  +DWR++GAVT VKDQG CG
Sbjct: 103 LTNEEYRMTYTGIKTIDDKKKLSKMKSDRYAYRSGDSLPEYVDWREQGAVTDVKDQGSCG 162

Query: 151 CCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGL 210
            CWAFS   ++EG+N I T  L S+SEQELV+CDTS  +QGC GGLMD AFEFII N G+
Sbjct: 163 SCWAFSTTGSVEGVNKIVTGDLISVSEQELVNCDTS-YNQGCNGGLMDYAFEFIIKNGGI 221

Query: 211 ATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSD 270
            TE  YPY   DG C+K + N     I  YEDVP N+E++L KAV+NQPV+VAI+A G D
Sbjct: 222 DTEEDYPYTGKDGKCDKNKKNAKVVTIDSYEDVPVNDESSLKKAVSNQPVAVAIEAGGRD 281

Query: 271 FQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDA 330
           FQFY+SG+FTG CGT LDHGV A GYGT +DG  YWLVKNSWG  WGE GY++M+R+I  
Sbjct: 282 FQFYTSGIFTGSCGTALDHGVLAAGYGT-EDGKDYWLVKNSWGAEWGEGGYLKMERNIAD 340

Query: 331 KEGLCGIAMQASYP 344
           K G CGIAM+ASYP
Sbjct: 341 KSGKCGIAMEASYP 354


>gi|13897890|gb|AAK48495.1|AF259983_1 putative cysteine protease [Ipomoea batatas]
          Length = 462

 Score =  350 bits (898), Expect = 6e-94,   Method: Compositional matrix adjust.
 Identities = 180/323 (55%), Positives = 224/323 (69%), Gaps = 17/323 (5%)

Query: 31  NDATMNERHEMWMAQYGRVYRD-NAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEF 89
           +D  +   +E W+ ++G+ Y     EK+ RF+IFK+N+ YI   N++  ++ YKLG+N F
Sbjct: 41  SDEEVMALYESWLVEHGKSYNGLGGEKDKRFEIFKDNLRYIDEQNSRG-DRSYKLGLNRF 99

Query: 90  ADQTNEEFRAPRNGYK----RRLPSVRSSETTDVSFRYE---NASVPASIDWRKKGAVTG 142
           AD TNEE+R+   G K    RR+   +S        RY      S+P SIDWR+KGAV  
Sbjct: 100 ADLTNEEYRSTYLGAKTDARRRIAKTKSDR------RYAPKAGGSLPDSIDWREKGAVAE 153

Query: 143 VKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFE 202
           VKDQG CG CWAFS +AA+EGIN I T +L SLSEQELVDCDTS  ++GC GGLMD AFE
Sbjct: 154 VKDQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVDCDTS-YNEGCNGGLMDYAFE 212

Query: 203 FIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSV 262
           FII N G+ TEA YPY    G C++   N     I GYEDV   +EAAL +AVA QPVSV
Sbjct: 213 FIIKNGGIDTEADYPYTGRYGRCDQTRKNAKVVSIDGYEDVTPYDEAALKEAVAGQPVSV 272

Query: 263 AIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYI 322
           AI+A G DFQ YSSG+FTG CGT+LDHGVTAVGYGT ++G  YW+VKNSW  +WGE GY+
Sbjct: 273 AIEAGGRDFQLYSSGIFTGSCGTDLDHGVTAVGYGT-ENGVDYWIVKNSWAASWGEKGYL 331

Query: 323 RMQRDIDAKEGLCGIAMQASYPT 345
           RMQR++  K GLCGIA++ SYPT
Sbjct: 332 RMQRNVKDKNGLCGIAIEPSYPT 354


>gi|226496089|ref|NP_001149658.1| cysteine protease 1 precursor [Zea mays]
 gi|195629242|gb|ACG36262.1| cysteine protease 1 precursor [Zea mays]
          Length = 469

 Score =  350 bits (897), Expect = 7e-94,   Method: Compositional matrix adjust.
 Identities = 170/305 (55%), Positives = 215/305 (70%), Gaps = 6/305 (1%)

Query: 42  WMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARN--KPYKLGINEFADQTNEEFRA 99
           WMA +GR Y    E+E RF++F++N+ Y+ + N  A      ++LG+N FAD TN+E+RA
Sbjct: 49  WMAAHGRTYNAVGEEERRFEVFRDNLRYVDAHNAAADAGVHSFRLGLNRFADLTNDEYRA 108

Query: 100 PRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVA 159
              G + R    R     D     +N  +P S+DWR KGAV  +KDQG CG CWAFS +A
Sbjct: 109 TYLGVRSR--PQRERRLGDRYLAGDNEDLPESVDWRAKGAVAEIKDQGSCGSCWAFSTIA 166

Query: 160 AMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYK 219
           A+EGIN I T  + SLSEQELVDCDTS  +QGC GGLMD AFEFII+N G+ TE  YPYK
Sbjct: 167 AVEGINQIVTGDMISLSEQELVDCDTS-YNQGCNGGLMDYAFEFIINNGGIDTEEDYPYK 225

Query: 220 ASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVF 279
            +DG C+    N     I  YEDVP+N+E +L KAVANQP+SVAI+A G  FQ Y+SG+F
Sbjct: 226 GTDGRCDVNRKNAKVVTIDSYEDVPANSEKSLQKAVANQPISVAIEAGGRAFQLYNSGIF 285

Query: 280 TGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAM 339
           TG CGT LDHGVTAVGYGT ++G  YW+VKNSWG++WGE+GY+RM+R+I A  G CGIA+
Sbjct: 286 TGTCGTALDHGVTAVGYGT-ENGKDYWIVKNSWGSSWGESGYVRMERNIKASSGKCGIAV 344

Query: 340 QASYP 344
           + SYP
Sbjct: 345 EPSYP 349


>gi|28192373|gb|AAK07730.1| CPR1-like cysteine proteinase [Nicotiana tabacum]
          Length = 374

 Score =  350 bits (897), Expect = 8e-94,   Method: Compositional matrix adjust.
 Identities = 173/325 (53%), Positives = 222/325 (68%), Gaps = 7/325 (2%)

Query: 24  QSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYK 83
           + W+   ++  +  R+EMW+A++GR Y    EKE RF+IFK+N+ +I   NN   N+ YK
Sbjct: 35  RKWTLQSDEDQVKNRYEMWLAEHGRAYNALGEKEKRFEIFKDNLRFIEEHNNSG-NRTYK 93

Query: 84  LGINEFADQTNEEFRAPRNGYKR--RLPSVRSSETTDVSFRYENASVPASIDWRKKGAVT 141
           +G+N+FAD TNEE+R    G K   R   V+S   +       N  +P S+DWRK+GAV 
Sbjct: 94  VGLNQFADLTNEEYRTMYLGTKSDARRRFVKSKNPSQRYASRPNELMPHSVDWRKRGAVA 153

Query: 142 GVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAF 201
            +K+QG CG CWAFS VAA+ GIN I T ++ +LSEQELVDCD   ++ GC GGLMD AF
Sbjct: 154 PIKNQGSCGSCWAFSTVAAVGGINQIVTGEMITLSEQELVDCDRV-QNSGCNGGLMDYAF 212

Query: 202 EFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVS 261
           EFIISN G+ TE  YPY+  +G C+    N     I GYEDVP  NE AL KAVA+QPV 
Sbjct: 213 EFIISNGGMDTEKHYPYRGVEGRCDPVRKNYKVVSIDGYEDVP-RNERALQKAVAHQPVC 271

Query: 262 VAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGY 321
           VAI+ASG  FQ YSSGVFTG+CG E+DHGV  VGYG+ +DG  YW+V+NSWGT WGENGY
Sbjct: 272 VAIEASGRAFQLYSSGVFTGECGEEVDHGVVVVGYGS-EDGVDYWIVRNSWGTKWGENGY 330

Query: 322 IRMQRDIDAKE-GLCGIAMQASYPT 345
           ++M+R++     G CGI  +ASYPT
Sbjct: 331 VKMERNVKKSHLGKCGIMTEASYPT 355


>gi|359359066|gb|AEV40973.1| putative oryzain beta chain precursor [Oryza punctata]
          Length = 461

 Score =  350 bits (897), Expect = 8e-94,   Method: Compositional matrix adjust.
 Identities = 166/309 (53%), Positives = 223/309 (72%), Gaps = 7/309 (2%)

Query: 39  HEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKA-RNKPYKLGINEFADQTNEEF 97
           +++W+A+ GR Y    E+E RF++F +N++++ + N +A  +  ++LG+N FAD TN+EF
Sbjct: 49  YDLWLAENGRSYNALGERERRFRVFWDNLKFVDAHNARADEHGGFRLGMNRFADLTNDEF 108

Query: 98  RAPRNGYKRRLPSVRSSETTDVSFRYENAS-VPASIDWRKKGAVTGVKDQGQCGCCWAFS 156
           R+   G K     V  S      +R++    +P S+DWR+KGAV  VK+QGQCG CWAFS
Sbjct: 109 RSTFLGAK----VVERSRAAGERYRHDGVEELPESVDWREKGAVAPVKNQGQCGSCWAFS 164

Query: 157 AVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKY 216
           AV+ +E IN + T ++ +LSEQELV+C T+G++ GC GGLMDDAF+FII N G+ TE  Y
Sbjct: 165 AVSTVESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMDDAFDFIIKNGGIDTEDDY 224

Query: 217 PYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSS 276
           PYKA DG C+    N     I G+EDVP N+E +L KAVA+QPVSVAI+A G +FQ Y S
Sbjct: 225 PYKAVDGKCDINRENAKVVSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGGREFQLYHS 284

Query: 277 GVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCG 336
           GVF+G+CGT LDHGV AVGYGT D+G  YW+V+NSWG  WGE+GY+RM+R+I+A  G CG
Sbjct: 285 GVFSGRCGTSLDHGVVAVGYGT-DNGKDYWIVRNSWGPKWGESGYVRMERNINATTGKCG 343

Query: 337 IAMQASYPT 345
           IAM ASYPT
Sbjct: 344 IAMMASYPT 352


>gi|18401420|ref|NP_565649.1| cysteine proteinase-like protein [Arabidopsis thaliana]
 gi|4314384|gb|AAD15594.1| cysteine proteinase [Arabidopsis thaliana]
 gi|17381154|gb|AAL36389.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|20465849|gb|AAM20029.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|330252901|gb|AEC07995.1| cysteine proteinase-like protein [Arabidopsis thaliana]
          Length = 348

 Score =  349 bits (896), Expect = 9e-94,   Method: Compositional matrix adjust.
 Identities = 182/352 (51%), Positives = 243/352 (69%), Gaps = 15/352 (4%)

Query: 6   LENKLVLAAILVLGVWAPQSWSR-TLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFK 64
           + + ++    + L      + SR +L +A+  E+HE WMA++ RVY D  EK  RF IFK
Sbjct: 1   MASTIIFILTIFLSYRTSLATSRGSLFEASAIEKHEQWMARFNRVYSDETEKRNRFNIFK 60

Query: 65  ENVEYIASFNNKARNK-PYKLGINEFADQTNEEFRAPRNGYK-----RRLPSVRSSETTD 118
           +N+E++ +FN    NK  YK+ INEF+D T+EEFRA   G        R+ ++ S + T 
Sbjct: 61  KNLEFVQNFN--MNNKITYKVDINEFSDLTDEEFRATHTGLVVPEAITRISTLSSGKNT- 117

Query: 119 VSFRYENASVPA-SIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSE 177
           V FRY N S    S+DWR++GAVT VK QG+CG CWAFSAVAA+EGI  IT  +L SLSE
Sbjct: 118 VPFRYGNVSDNGESMDWRQEGAVTPVKYQGRCGGCWAFSAVAAVEGITKITKGELVSLSE 177

Query: 178 QELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPS---A 234
           Q+L+DCD    +QGC GG+M  AFE+II N+G+ TE  YPY+ S  +C+      S   A
Sbjct: 178 QQLLDCDRD-YNQGCRGGIMSKAFEYIIKNQGITTEDNYPYQESQQTCSSSTTLSSSFRA 236

Query: 235 AKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAV 294
           A ISGYE VP NNE AL++AV+ QPVSV I+ +G+ F+ YS GVF G+CGT+L H VT V
Sbjct: 237 ATISGYETVPMNNEEALLQAVSQQPVSVGIEGTGAAFRHYSGGVFNGECGTDLHHAVTIV 296

Query: 295 GYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           GYG +++GTKYW+VKNSWG TWGENGY+R++RD+DA +G+CG+A+ A YP A
Sbjct: 297 GYGMSEEGTKYWVVKNSWGETWGENGYMRIKRDVDAPQGMCGLAILAFYPLA 348


>gi|224056176|ref|XP_002298740.1| predicted protein [Populus trichocarpa]
 gi|222845998|gb|EEE83545.1| predicted protein [Populus trichocarpa]
          Length = 455

 Score =  349 bits (896), Expect = 9e-94,   Method: Compositional matrix adjust.
 Identities = 182/322 (56%), Positives = 220/322 (68%), Gaps = 17/322 (5%)

Query: 32  DATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFAD 91
           +A     +EMW+ ++GR Y    EKE RF+IFK+N+++I   +N   N  YKLG+N+FAD
Sbjct: 18  EAETRRIYEMWLVKHGRAYNALGEKERRFEIFKDNLKFIDE-HNSVGNPSYKLGLNKFAD 76

Query: 92  QTNEEFRA----PRNGYKRRLPSVRSSETTDVSFRY---ENASVPASIDWRKKGAVTGVK 144
            +N+E+R+     R   K RL     SE      RY   E   +P ++DWR+KGAV  VK
Sbjct: 77  LSNDEYRSVYLGTRMDGKGRLLGGPKSE------RYLFKEGDDLPETVDWREKGAVAPVK 130

Query: 145 DQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFI 204
           DQGQCG CWAFS V A+EGIN I T  LTSLSEQELVDCD +  + GC GGLMD AF+FI
Sbjct: 131 DQGQCGSCWAFSTVGAVEGINQIVTGNLTSLSEQELVDCDKT-YNLGCNGGLMDYAFDFI 189

Query: 205 ISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAI 264
           I N G+ TE  YPYKA D  C+    N     I GYEDVP N+E +L KAVANQPVSVAI
Sbjct: 190 IENGGIDTEEDYPYKAIDSMCDPNRKNARVVTIDGYEDVPQNDEKSLKKAVANQPVSVAI 249

Query: 265 DASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRM 324
           +A G  FQ Y SGVFTG CGT+LDHGV  VGYGT + G  YW+V+NSWG  WGENGYIRM
Sbjct: 250 EAGGRGFQLYQSGVFTGSCGTQLDHGVVTVGYGT-EHGVDYWIVRNSWGPAWGENGYIRM 308

Query: 325 QRDIDAKE-GLCGIAMQASYPT 345
           +RD+ + E G CGIAM+ASYPT
Sbjct: 309 ERDVASTETGKCGIAMEASYPT 330


>gi|224136808|ref|XP_002326950.1| predicted protein [Populus trichocarpa]
 gi|222835265|gb|EEE73700.1| predicted protein [Populus trichocarpa]
          Length = 456

 Score =  349 bits (896), Expect = 1e-93,   Method: Compositional matrix adjust.
 Identities = 185/353 (52%), Positives = 235/353 (66%), Gaps = 20/353 (5%)

Query: 1   MAMIL-----LENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAE 55
           M M+L     L +   ++ I      A +S  RT  D  +   +E W+ ++G+ Y    E
Sbjct: 1   MLMLLFLVFALSSAFDMSIISYHQTHATKSSWRT--DDEVMAMYEEWLVKHGKNYNALGE 58

Query: 56  KEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRA----PRNGYKRRLPSV 111
           KE RF+IFK+N+ +I   N  + N+ Y +G+N FAD TNEEFR+     R G+K+RLP  
Sbjct: 59  KEKRFEIFKDNLMFIDQHN--SENRTYTVGLNRFADLTNEEFRSMYLGTRTGHKKRLP-- 114

Query: 112 RSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRK 171
              +T+D        S+P S+DWRK+GAV  VKDQG CG CWAFS +AA+EGIN I T  
Sbjct: 115 ---KTSDRYAPRVGDSLPDSVDWRKEGAVAEVKDQGGCGSCWAFSTIAAVEGINKIVTGD 171

Query: 172 LTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEAN 231
           L +LSEQELVDCDTS  ++GC GGLMD AFEFII+N G+ TE  YPY   DG C+    N
Sbjct: 172 LIALSEQELVDCDTS-YNEGCNGGLMDYAFEFIINNGGIDTEDDYPYLGRDGRCDTYRKN 230

Query: 232 PSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGV 291
                I  YEDVP N+E AL KAVANQPVSVAI+  G +FQ Y+SGVFTG+CGT LDHGV
Sbjct: 231 AKVVSIDSYEDVPENDETALKKAVANQPVSVAIEGGGRNFQLYNSGVFTGECGTSLDHGV 290

Query: 292 TAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
            AVGYGT + G  YW+V+NSWG +WGE+GYIRM+R+I +  G CGIA++ SYP
Sbjct: 291 AAVGYGT-EKGKDYWIVRNSWGKSWGESGYIRMERNIASPTGKCGIAIEPSYP 342


>gi|118486542|gb|ABK95110.1| unknown [Populus trichocarpa]
          Length = 465

 Score =  349 bits (896), Expect = 1e-93,   Method: Compositional matrix adjust.
 Identities = 175/310 (56%), Positives = 219/310 (70%), Gaps = 13/310 (4%)

Query: 39  HEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFR 98
           +E W+ ++G+ Y    EKE RF+IFK+N+ +I   N  + N+ Y +G+N FAD TNEEFR
Sbjct: 51  YEEWLVKHGKNYNALGEKEKRFEIFKDNLMFIDQHN--SENRTYTVGLNRFADLTNEEFR 108

Query: 99  A----PRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWA 154
           +     R G+K+RLP     +T+D        S+P S+DWRK+GAV  VKDQG CG CWA
Sbjct: 109 SMYLGTRTGHKKRLP-----KTSDRYAPRVGDSLPDSVDWRKEGAVAEVKDQGGCGSCWA 163

Query: 155 FSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEA 214
           FS +AA+EGIN I T  L +LSEQELVDCDTS  ++GC GGLMD AFEFII+N G+ TE 
Sbjct: 164 FSTIAAVEGINKIVTGDLIALSEQELVDCDTS-YNEGCNGGLMDYAFEFIINNGGIDTED 222

Query: 215 KYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFY 274
            YPY   DG C+    N     I  YEDVP N+E AL KAVANQPVSVAI+  G +FQ Y
Sbjct: 223 DYPYLGRDGRCDTYRKNAKVVSIDSYEDVPENDETALKKAVANQPVSVAIEGGGRNFQLY 282

Query: 275 SSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGL 334
           +SGVFTG+CGT LDHGV AVGYGT + G  YW+V+NSWG +WGE+GYIRM+R+I +  G 
Sbjct: 283 NSGVFTGECGTSLDHGVAAVGYGT-EKGKDYWIVRNSWGKSWGESGYIRMERNIASPTGK 341

Query: 335 CGIAMQASYP 344
           CGIA++ SYP
Sbjct: 342 CGIAIEPSYP 351


>gi|162459393|ref|NP_001105993.1| cysteine protease component of protease-inhibitor complex precursor
           [Zea mays]
 gi|6682829|dbj|BAA88898.1| cysteine protease component of protease-inhibitor complex [Zea
           mays]
          Length = 465

 Score =  349 bits (895), Expect = 1e-93,   Method: Compositional matrix adjust.
 Identities = 174/308 (56%), Positives = 218/308 (70%), Gaps = 12/308 (3%)

Query: 42  WMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARN--KPYKLGINEFADQTNEEFRA 99
           WMA +GR Y    E+E R+++F++N+ YI + N  A      ++LG+N FAD TN+E+RA
Sbjct: 47  WMAAHGRTYNAVGEEERRYQVFRDNLRYIDAHNAAADAGVHSFRLGLNRFADLTNDEYRA 106

Query: 100 PRNGYKRRLPSVRSSETTDVSFRY---ENASVPASIDWRKKGAVTGVKDQGQCGCCWAFS 156
              G + R    R      +  RY   +N  +P S+DWR KGAV  VKDQG  G CWAFS
Sbjct: 107 TYLGARTRPQRERK-----LGARYHAADNEDLPESVDWRAKGAVAEVKDQGSYGSCWAFS 161

Query: 157 AVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKY 216
            +AA+EGIN I T  L SLSEQELVDCDTS  +QGC GGLMD AFEFII+N G+ TE  Y
Sbjct: 162 TIAAVEGINQIVTGDLISLSEQELVDCDTS-YNQGCNGGLMDYAFEFIINNGGIDTEKDY 220

Query: 217 PYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSS 276
           PYK +DG C+    N     I  YEDVP+N+E +L KAVANQPVSVAI+A+G+ FQ YSS
Sbjct: 221 PYKGTDGRCDVNRKNAKVVTIDSYEDVPANDEKSLQKAVANQPVSVAIEAAGTQFQLYSS 280

Query: 277 GVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCG 336
           G+FTG CGT LDHGVTAVGYGT ++G  YW+VKNSWG++WGE+GY+RM+R+I A  G CG
Sbjct: 281 GIFTGSCGTALDHGVTAVGYGT-ENGKDYWIVKNSWGSSWGESGYVRMERNIKASSGKCG 339

Query: 337 IAMQASYP 344
           IA++ SYP
Sbjct: 340 IAVEPSYP 347


>gi|162463464|ref|NP_001104879.1| cysteine proteinase Mir3 precursor [Zea mays]
 gi|2425066|gb|AAB88263.1| cysteine proteinase Mir3 [Zea mays]
          Length = 480

 Score =  349 bits (895), Expect = 1e-93,   Method: Compositional matrix adjust.
 Identities = 173/308 (56%), Positives = 217/308 (70%), Gaps = 12/308 (3%)

Query: 42  WMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARN--KPYKLGINEFADQTNEEFRA 99
           WMA +GR Y     +E R+++F++N+ YI + N  A      ++LG+N FAD TN+E+ A
Sbjct: 47  WMAAHGRTYNAVGAEERRYQVFRDNLRYIDAHNAAADAGVHSFRLGLNRFADLTNDEYPA 106

Query: 100 PRNGYKRRLPSVRSSETTDVSFRY---ENASVPASIDWRKKGAVTGVKDQGQCGCCWAFS 156
              G + R    R      +  RY   +N  +P S+DWR KGAV  VKDQG CG CWAFS
Sbjct: 107 TYLGARTRPQRDRK-----LGARYHAADNEDLPESVDWRAKGAVAEVKDQGSCGTCWAFS 161

Query: 157 AVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKY 216
            +AA+EGIN I T  L SLSEQELVDCDTS  +QGC GGLMD AFEFII+N G+ TE  Y
Sbjct: 162 TIAAVEGINQIVTGDLISLSEQELVDCDTS-YNQGCNGGLMDYAFEFIINNGGIDTEKDY 220

Query: 217 PYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSS 276
           PYK +DG C+    N     I  YEDVP+N+E +L KAVANQPVSVAI+A+G+ FQ YSS
Sbjct: 221 PYKGTDGRCDVNRKNAKVVTIDSYEDVPANDEKSLQKAVANQPVSVAIEAAGTAFQLYSS 280

Query: 277 GVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCG 336
           G+FTG CGT LDHGVTAVGYGT ++G  YW+VKNSWG++WGE+GY+RM+R+I A  G CG
Sbjct: 281 GIFTGSCGTRLDHGVTAVGYGT-ENGKDYWIVKNSWGSSWGESGYVRMERNIKASSGKCG 339

Query: 337 IAMQASYP 344
           IA++ SYP
Sbjct: 340 IAVEPSYP 347


>gi|449438381|ref|XP_004136967.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
          Length = 479

 Score =  348 bits (894), Expect = 2e-93,   Method: Compositional matrix adjust.
 Identities = 180/331 (54%), Positives = 223/331 (67%), Gaps = 19/331 (5%)

Query: 23  PQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPY 82
           P S +    D  +   +E W+  +G+ Y    EKE RF+IFK+N+ +I   N ++R   Y
Sbjct: 46  PHSDAHQRPDEEVAALYESWLVHHGKAYNAIGEKERRFEIFKDNLRFIDEHNRESRT--Y 103

Query: 83  KLGINEFADQTNEEFRAP----RNGYKRRLPSVRSSETTDVSFRYENA---SVPASIDWR 135
           K+G+  FAD TNEE+RA     R   K RL + +S        RY  A    +P  +DWR
Sbjct: 104 KVGLTRFADLTNEEYRARFLGGRFSRKPRLSAAKSG-------RYAAALGDDLPDDVDWR 156

Query: 136 KKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGG 195
           KKGAV  VKDQGQCG CWAFS+VAA+EGIN I T +L  LSEQELVDCD S  + GC GG
Sbjct: 157 KKGAVATVKDQGQCGSCWAFSSVAAVEGINQIVTGELIPLSEQELVDCDKS-FNMGCNGG 215

Query: 196 LMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAV 255
           LMD AF+FII N G+ TE  YPYK  D +C+    N     I GYEDVP N+E++L KAV
Sbjct: 216 LMDYAFQFIIGNGGIDTEEDYPYKGRDAACDPNRKNAKVVTIDGYEDVPENDESSLKKAV 275

Query: 256 ANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTT 315
           ANQPVSVAI+A G  FQ Y SGVFTG+CGT+LDHGV AVGYGT D+GT YW+V+NSWG  
Sbjct: 276 ANQPVSVAIEAGGRAFQLYQSGVFTGRCGTDLDHGVVAVGYGT-DNGTDYWIVRNSWGKD 334

Query: 316 WGENGYIRMQRDI-DAKEGLCGIAMQASYPT 345
           WGE+GYIR++R++ +   G CGIA+Q SYPT
Sbjct: 335 WGESGYIRLERNVANITTGKCGIAVQPSYPT 365


>gi|357507617|ref|XP_003624097.1| Cysteine protease [Medicago truncatula]
 gi|355499112|gb|AES80315.1| Cysteine protease [Medicago truncatula]
          Length = 340

 Score =  348 bits (893), Expect = 2e-93,   Method: Compositional matrix adjust.
 Identities = 179/336 (53%), Positives = 234/336 (69%), Gaps = 13/336 (3%)

Query: 11  VLAAILVLGVWAPQSWSRTLNDA-TMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEY 69
           ++  ++V+ V  P + ++  + + T++ER++ W  +Y  +Y+D+AE+E   +IFK NV Y
Sbjct: 10  LINILIVIWVMFPSNQNQENDQSLTLSERYKHWKIKYRVIYKDDAEEEKHIQIFKHNVAY 69

Query: 70  IASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS-V 128
           I SFN  A NK YKL IN FAD   E      +G+K+R    +   TT   F+Y+N + +
Sbjct: 70  IDSFN-AAGNKSYKLTINRFADLPTE---PSDDGFKKR----KLEPTTSSLFKYKNITDI 121

Query: 129 PASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGE 188
           PA++DWRK+GAVT VK+Q +CG CWAFSAV A+EGI  IT+  L SLSEQELVD   S  
Sbjct: 122 PAAVDWRKRGAVTPVKNQRECGSCWAFSAVGALEGIQQITSGNLVSLSEQELVDRVRSNW 181

Query: 189 DQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNE 248
             GC GG + DAFEF++ N G+ATEA YPY+   G+ +KK +     +I  YE VP N+E
Sbjct: 182 TNGCNGGYLIDAFEFVLENGGIATEASYPYRGVKGNNSKKVSR--QVQIKSYEQVPRNSE 239

Query: 249 AALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLV 308
            +L+K VANQPVSV ID SG   +FYSSG+FTG+CGT+ +H V  VGYGT++DGTKYWLV
Sbjct: 240 DSLLKVVANQPVSVGIDISGM-IRFYSSGIFTGECGTKPNHAVIIVGYGTSNDGTKYWLV 298

Query: 309 KNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           KNSWG  WGE  YIRM+RDIDAKEGLCGI M ASYP
Sbjct: 299 KNSWGIRWGEKRYIRMKRDIDAKEGLCGIPMDASYP 334


>gi|225428328|ref|XP_002279940.1| PREDICTED: cysteine proteinase-like [Vitis vinifera]
          Length = 707

 Score =  348 bits (892), Expect = 2e-93,   Method: Compositional matrix adjust.
 Identities = 171/309 (55%), Positives = 217/309 (70%), Gaps = 8/309 (2%)

Query: 38  RHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEF 97
           R E W++++G+VY+   EK  RF++F+EN+ +I   N +  +  Y LG+NEFAD ++EEF
Sbjct: 403 RFESWVSKHGKVYKSMEEKLHRFEVFRENLNHIDERNKEVSS--YWLGLNEFADLSHEEF 460

Query: 98  RAPRNGYKRRLPSVRSSETTDVSFRYEN-ASVPASIDWRKKGAVTGVKDQGQCGCCWAFS 156
           ++   G +   P  R        FRY + A +P S+DWRKKGAVT VK+QG CG CWAFS
Sbjct: 461 KSKYLGLRAEFPRSRDYSG---EFRYRDVADLPESVDWRKKGAVTHVKNQGACGSCWAFS 517

Query: 157 AVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKY 216
            VAA+EGIN I T  LT+LSEQEL+DCDT+  + GC GGLMD AF FI SN GL  E  Y
Sbjct: 518 TVAAVEGINQIVTGNLTTLSEQELIDCDTTF-NSGCNGGLMDYAFAFIASNGGLHKEDDY 576

Query: 217 PYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSS 276
           PY   +G+C +++ +     ISGYEDVP  +E +L+KA+A+QP+SVAI+ASG DFQFYS 
Sbjct: 577 PYLMEEGTCEEQKEDVDIVTISGYEDVPEKDEESLLKALAHQPLSVAIEASGRDFQFYSG 636

Query: 277 GVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCG 336
           GVF G CGTELDHGV AVGYG++  G  Y +VKNSWG  WGE GYIRM+R+    EGLCG
Sbjct: 637 GVFNGPCGTELDHGVAAVGYGSS-KGLDYIIVKNSWGPKWGEKGYIRMKRNTGKTEGLCG 695

Query: 337 IAMQASYPT 345
           I   ASYPT
Sbjct: 696 INKMASYPT 704


>gi|413944252|gb|AFW76901.1| hypothetical protein ZEAMMB73_101481 [Zea mays]
          Length = 232

 Score =  348 bits (892), Expect = 2e-93,   Method: Compositional matrix adjust.
 Identities = 164/228 (71%), Positives = 186/228 (81%), Gaps = 5/228 (2%)

Query: 121 FRYENASV---PASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSE 177
           FRYEN SV   PA+IDWR  GAVT +KDQGQCGCCWAFSAVAA EGI  I+T KL SLSE
Sbjct: 6   FRYENVSVDAIPATIDWRTNGAVTPIKDQGQCGCCWAFSAVAATEGIVKISTGKLISLSE 65

Query: 178 QELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKI 237
           QELVDCD  GEDQGCEGGLMDDAF+FII N GL TE+ YPY A+DG C  K  + SAA I
Sbjct: 66  QELVDCDVYGEDQGCEGGLMDDAFKFIIKNGGLTTESNYPYTAADGKC--KSGSNSAANI 123

Query: 238 SGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYG 297
            GYEDVP+N+EAALMKAVANQPVSVA+D     FQFYS GV TG CGT+LDHG+ A+GYG
Sbjct: 124 KGYEDVPTNDEAALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYG 183

Query: 298 TADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
              DGTKYWL+KNSWGTTWGENGY+RM++DI  K+G+CG+A++ SYPT
Sbjct: 184 KTSDGTKYWLMKNSWGTTWGENGYLRMEKDISDKKGMCGLAIEPSYPT 231


>gi|357129125|ref|XP_003566217.1| PREDICTED: thiol protease SEN102-like [Brachypodium distachyon]
          Length = 380

 Score =  348 bits (892), Expect = 3e-93,   Method: Compositional matrix adjust.
 Identities = 182/316 (57%), Positives = 220/316 (69%), Gaps = 12/316 (3%)

Query: 39  HEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFR 98
           +E W A++  V RD AEK  RF +F+EN   +  FN + R+ PYKL +N FAD T++EFR
Sbjct: 49  YERWRARH-TVSRDLAEKSRRFNVFRENARLVHEFNLR-RDAPYKLRLNRFADLTSDEFR 106

Query: 99  --------APRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCG 150
                   +    +K R  +    +    S      ++P S+DWR+KGAVTGVKDQGQCG
Sbjct: 107 RSYASSRVSHHRMFKPRAANNNDDDDDKGSSFTHGGALPTSVDWREKGAVTGVKDQGQCG 166

Query: 151 CCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGL 210
            CWAFS +AA+EGIN I T  LTSLSEQ+LVDCDT   + GC+GGLMDDAF +I  + G+
Sbjct: 167 SCWAFSTIAAVEGINAIRTNNLTSLSEQQLVDCDTK-TNAGCDGGLMDDAFSYIAKHGGV 225

Query: 211 ATEAKYPYKA-SDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGS 269
           A E  YPY+A    SCN K+A  +   I GYEDVP N+E AL KAVA QPV+VAI+A GS
Sbjct: 226 AAEKSYPYRARQSSSCNSKKAAAAVVSIDGYEDVPRNDETALKKAVAAQPVAVAIEAGGS 285

Query: 270 DFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDID 329
            FQFYS GVF G+CGTELDHGV AVGYG   DGTKYW+VKNSWG  WGE GYIRM+RD+ 
Sbjct: 286 HFQFYSEGVFAGKCGTELDHGVAAVGYGVTVDGTKYWIVKNSWGEEWGEKGYIRMKRDVA 345

Query: 330 AKEGLCGIAMQASYPT 345
            KEGLCGIAM+ASYP 
Sbjct: 346 DKEGLCGIAMEASYPV 361


>gi|37780041|gb|AAP32193.1| cysteine protease 14 [Trifolium repens]
          Length = 351

 Score =  347 bits (891), Expect = 3e-93,   Method: Compositional matrix adjust.
 Identities = 168/309 (54%), Positives = 216/309 (69%), Gaps = 5/309 (1%)

Query: 37  ERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEE 96
           E  E WM+++G++Y    EK +RF++FK+N+++I   N    N  Y LG+NEFAD +++E
Sbjct: 45  ELFESWMSRHGKIYETIEEKLLRFEVFKDNLKHIDERNKIVSN--YWLGLNEFADLSHQE 102

Query: 97  FRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFS 156
           F+    G K  L   R S + +  F Y +  +P S+DWRKKGAVT VK+QGQCG CWAFS
Sbjct: 103 FKNKYLGLKVNLSQRRES-SNEEEFTYRDVDLPKSVDWRKKGAVTPVKNQGQCGSCWAFS 161

Query: 157 AVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKY 216
            VAA+EGIN I T  LTSLSEQEL+DCDT+  + GC GGLMD AF FI+ N GL  E  Y
Sbjct: 162 TVAAVEGINQIVTGNLTSLSEQELIDCDTT-YNNGCNGGLMDYAFSFIVQNGGLHKEDDY 220

Query: 217 PYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSS 276
           PY   + +C  K+       I+GY DVP NNE +L+KA+ANQP+SVAI+AS  DFQFYS 
Sbjct: 221 PYIMEESTCEMKKEETQVVTINGYHDVPQNNEQSLLKALANQPLSVAIEASSRDFQFYSG 280

Query: 277 GVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCG 336
           GVF G CG++LDHGV+AVGYGT+ +   Y +VKNSWG  WGE G+IRM+R+I   EG+CG
Sbjct: 281 GVFDGHCGSDLDHGVSAVGYGTSKN-LDYIIVKNSWGAKWGEKGFIRMKRNIGKPEGICG 339

Query: 337 IAMQASYPT 345
           +   ASYPT
Sbjct: 340 LYKMASYPT 348


>gi|449460678|ref|XP_004148072.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Cucumis
           sativus]
          Length = 317

 Score =  347 bits (890), Expect = 4e-93,   Method: Compositional matrix adjust.
 Identities = 172/312 (55%), Positives = 218/312 (69%), Gaps = 12/312 (3%)

Query: 35  MNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTN 94
           + +R++ WM +YGR Y+   E E RF I++ NV+YI +FN  + N  + L  N FAD TN
Sbjct: 15  IQDRYQKWMDKYGRQYKSREEWERRFTIYQANVQYIDNFN--SMNHSHTLAENNFADLTN 72

Query: 95  EEFRAPRNGYKRRLPSVRSSETTDVSFRYEN-ASVPASIDWRKKGAVTGVKDQGQCGCCW 153
           EEF+A   GYK       +    D  FRY N  ++P ++DWR++GAVT +K+QGQCG CW
Sbjct: 73  EEFKATYLGYK-------TVSIPDTCFRYGNMVNLPTNVDWRQEGAVTPIKNQGQCGSCW 125

Query: 154 AFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATE 213
           AFSAVAA+EGIN I   KL SLSEQELVDCD +  +QGC GG M  AFEFI    GL TE
Sbjct: 126 AFSAVAAVEGINKIKAGKLISLSEQELVDCDVTSGNQGCNGGYMYKAFEFI-KRTGLTTE 184

Query: 214 AKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQF 273
            +YPY+ ++ +CN+++       ISGYE VP N+E +L  AVANQPVSVAIDA G++FQF
Sbjct: 185 IEYPYQGAESACNEQKEKYQFVSISGYEKVPVNDEKSLKAAVANQPVSVAIDAEGNNFQF 244

Query: 274 YSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEG 333
           YS G+F+G CG +L+HGV  VGYG   +   YWLVKNSWGT WGE+GYIRM+RD   ++G
Sbjct: 245 YSGGIFSGNCGNQLNHGVAIVGYGETSN-QAYWLVKNSWGTDWGESGYIRMKRDSTDRQG 303

Query: 334 LCGIAMQASYPT 345
            CGIAM ASYPT
Sbjct: 304 TCGIAMMASYPT 315


>gi|357115272|ref|XP_003559414.1| PREDICTED: thiol protease SEN102-like [Brachypodium distachyon]
          Length = 360

 Score =  347 bits (890), Expect = 4e-93,   Method: Compositional matrix adjust.
 Identities = 186/338 (55%), Positives = 222/338 (65%), Gaps = 25/338 (7%)

Query: 28  RTLND----ATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKP-- 81
           R L+D    A M  RHE WMA++GR Y D  EK  R +IF+ N E I SFN+KA      
Sbjct: 28  RELDDVAVGAAMASRHESWMAEHGRTYADAEEKARRLEIFRANAERIDSFNSKADAAAGE 87

Query: 82  ----YKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPA----SID 133
               ++L  N FAD T+EEFRA R G +R       +      FRYEN S+ A    S+D
Sbjct: 88  SVDSHRLATNRFADLTDEEFRAARTGLRR---PAAVAGAVGGGFRYENFSLQADAAGSMD 144

Query: 134 WRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCE 193
           WR  GAVTGVKDQG CGCCWAFSAVAAMEG+  I T +L SLSEQ+LVDCD  G+DQGCE
Sbjct: 145 WRAMGAVTGVKDQGSCGCCWAFSAVAAMEGLTKIRTGRLVSLSEQQLVDCDVYGDDQGCE 204

Query: 194 GGLMDDAFEFIISNKGLATEAKYPYKASD-GSCNKKEANPSAAKISGYEDVPSNNEAALM 252
           GGLMD+AF++I    GLA+E+ YPY   D GSC    A P AA I G+EDVP+NNE ALM
Sbjct: 205 GGLMDNAFQYISRQGGLASESAYPYSGEDGGSCRSGRAQP-AASIRGHEDVPANNEGALM 263

Query: 253 KAVANQPVSVAIDASGSDFQFY----SSGVFTGQC-GTELDHGVTAVGYGTADDGTKYWL 307
            AVA+QPVSVAI+     F+FY          G C  TELDH +TAVGYG A DGT YWL
Sbjct: 264 AAVAHQPVSVAINGGDYVFRFYDRGVLGAGGNGGCESTELDHAITAVGYGMAGDGTGYWL 323

Query: 308 VKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           +KNSWG+ WGE+GY+R++R     EG+CG+A  ASYP 
Sbjct: 324 MKNSWGSGWGESGYVRIRRG-SRGEGVCGLAKLASYPV 360


>gi|356508490|ref|XP_003522989.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
          Length = 349

 Score =  347 bits (890), Expect = 5e-93,   Method: Compositional matrix adjust.
 Identities = 171/309 (55%), Positives = 213/309 (68%), Gaps = 7/309 (2%)

Query: 37  ERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEE 96
           E  E WM+++G++Y+   EK +RF+IFK+N+++I   N    N  Y LG+NEFAD +++E
Sbjct: 45  ELFESWMSKHGKIYQSIEEKLLRFEIFKDNLKHIDERNKVVSN--YWLGLNEFADLSHQE 102

Query: 97  FRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFS 156
           F+    G K      R S      F Y++  +P S+DWRKKGAV  VK+QG CG CWAFS
Sbjct: 103 FKNKYLGLKVDYSRRRESPE---EFTYKDVELPKSVDWRKKGAVAPVKNQGSCGSCWAFS 159

Query: 157 AVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKY 216
            VAA+EGIN I T  LTSLSEQEL+DCD +  + GC GGLMD AF FI+ N GL  E  Y
Sbjct: 160 TVAAVEGINQIVTGNLTSLSEQELIDCDRT-YNNGCNGGLMDYAFSFIVENGGLHKEEDY 218

Query: 217 PYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSS 276
           PY   +G+C   +       ISGY DVP NNE +L+KA+ANQP+SVAI+ASG DFQFYS 
Sbjct: 219 PYIMEEGTCEMTKEETEVVTISGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQFYSG 278

Query: 277 GVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCG 336
           GVF G CG++LDHGV AVGYGTA  G  Y +VKNSWG+ WGE GYIRM+R+I   EG+CG
Sbjct: 279 GVFDGHCGSDLDHGVAAVGYGTA-KGVDYIIVKNSWGSKWGEKGYIRMRRNIGKPEGICG 337

Query: 337 IAMQASYPT 345
           I   ASYPT
Sbjct: 338 IYKMASYPT 346


>gi|224131910|ref|XP_002328138.1| predicted protein [Populus trichocarpa]
 gi|222837653|gb|EEE76018.1| predicted protein [Populus trichocarpa]
          Length = 349

 Score =  347 bits (890), Expect = 5e-93,   Method: Compositional matrix adjust.
 Identities = 173/311 (55%), Positives = 219/311 (70%), Gaps = 7/311 (2%)

Query: 35  MNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTN 94
           + E  E W++ +G+ Y    EK  RF++FKEN+++I   N +  +  Y LG+NEFAD ++
Sbjct: 43  LVELFESWISGHGKAYNSLEEKLHRFEVFKENLKHIDQRNKEVTS--YWLGLNEFADLSH 100

Query: 95  EEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWA 154
           EEF++   G     P  +SSE  D S+R +   +P SIDWRKKGAVT VK+QG CG CWA
Sbjct: 101 EEFKSKFLGLYPEFPRKKSSE--DFSYR-DVVDLPKSIDWRKKGAVTPVKNQGSCGSCWA 157

Query: 155 FSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEA 214
           FS VAA+EGIN I    LTSLSEQ+L+DCDTS  + GC GGLMD AFEFI++N GL  E 
Sbjct: 158 FSTVAAVEGINQIVAGNLTSLSEQQLIDCDTSF-NNGCNGGLMDYAFEFIVNNGGLHKEE 216

Query: 215 KYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFY 274
            YPY   +G+C++K        ISGY DVP N+E +L+KA+A+QP+SVAIDASG DFQFY
Sbjct: 217 DYPYLMEEGTCDEKREEMEVVTISGYHDVPRNDEQSLLKALAHQPLSVAIDASGRDFQFY 276

Query: 275 SSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGL 334
           S GVF+G CGT+LDHGV AVGYG++  G  Y +VKNSWG  WGE GY+RM+R+    EGL
Sbjct: 277 SGGVFSGPCGTDLDHGVAAVGYGSS-SGIDYIIVKNSWGPKWGERGYLRMKRNTGKPEGL 335

Query: 335 CGIAMQASYPT 345
           CGI   ASYPT
Sbjct: 336 CGINKMASYPT 346


>gi|449448298|ref|XP_004141903.1| PREDICTED: germination-specific cysteine protease 1-like [Cucumis
           sativus]
 gi|449531757|ref|XP_004172852.1| PREDICTED: germination-specific cysteine protease 1-like [Cucumis
           sativus]
          Length = 365

 Score =  347 bits (889), Expect = 5e-93,   Method: Compositional matrix adjust.
 Identities = 177/340 (52%), Positives = 230/340 (67%), Gaps = 11/340 (3%)

Query: 10  LVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEY 69
           L L +   L + A     R+  D  + E +++W+A++G+ Y    E+E RF+IFKEN+++
Sbjct: 8   LALLSFFFLSISASALSRRS--DGEVREIYDLWLAKHGKAYNGIDEREKRFQIFKENLKF 65

Query: 70  IASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASV- 128
           I   N++  N+ YK+G+N FAD TNEE+RA   G  R  P+ R  +    S RY   ++ 
Sbjct: 66  IDDHNSE--NRTYKVGLNMFADLTNEEYRALYLG-TRSPPARRVMKAKTASRRYAVNNLD 122

Query: 129 --PASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTS 186
             P S+DWR +GAV  VK+QG CG CWAFS +AA+EGIN I T +L SLSEQELV CD  
Sbjct: 123 RLPESMDWRTRGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVSCDKK 182

Query: 187 GEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSN 246
             + GC GGLMD AF+FII N GL TE  YPY+A DG C+    N     I  YEDVP+N
Sbjct: 183 -YNSGCNGGLMDYAFQFIIDNGGLDTEEDYPYEAFDGQCDPTRKNAKVVSIDAYEDVPAN 241

Query: 247 NEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYW 306
           +E +L KAVA+QPVSVAI+ASG   Q Y SGVFTG+CG+ LDHGV AVGYG  ++G  YW
Sbjct: 242 DEESLKKAVAHQPVSVAIEASGLALQLYQSGVFTGKCGSALDHGVVAVGYGK-ENGVDYW 300

Query: 307 LVKNSWGTTWGENGYIRMQRDI-DAKEGLCGIAMQASYPT 345
           LV+NSWGT+WGE+GY +++R++    EG CGIAMQASYP 
Sbjct: 301 LVRNSWGTSWGEDGYFKLERNVKHITEGKCGIAMQASYPV 340


>gi|449524070|ref|XP_004169046.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like, partial
           [Cucumis sativus]
          Length = 314

 Score =  347 bits (889), Expect = 7e-93,   Method: Compositional matrix adjust.
 Identities = 172/311 (55%), Positives = 217/311 (69%), Gaps = 12/311 (3%)

Query: 35  MNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTN 94
           + +R++ WM +YGR Y+   E E RF I++ NV+YI +FN  + N  + L  N FAD TN
Sbjct: 15  IQDRYQKWMDKYGRQYKSREEWERRFTIYQANVQYIDNFN--SMNHSHTLAENNFADLTN 72

Query: 95  EEFRAPRNGYKRRLPSVRSSETTDVSFRYEN-ASVPASIDWRKKGAVTGVKDQGQCGCCW 153
           EEF+A   GYK       +    D  FRY N  ++P ++DWR++GAVT +K+QGQCG CW
Sbjct: 73  EEFKATYLGYK-------TVSIPDTCFRYGNMVNLPTNVDWRQEGAVTPIKNQGQCGSCW 125

Query: 154 AFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATE 213
           AFSAVAA+EGIN I   KL SLSEQELVDCD +  +QGC GG M  AFEFI    GL TE
Sbjct: 126 AFSAVAAVEGINKIKAGKLISLSEQELVDCDVTSGNQGCNGGYMYKAFEFI-KRTGLTTE 184

Query: 214 AKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQF 273
            +YPY+ ++ +CN+++       ISGYE VP N+E +L  AVANQPVSVAIDA G++FQF
Sbjct: 185 IEYPYQGAESACNEQKEKYQFVSISGYEKVPVNDEKSLKAAVANQPVSVAIDAEGNNFQF 244

Query: 274 YSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEG 333
           YS G+F+G CG +L+HGV  VGYG   +   YWLVKNSWGT WGE+GYIRM+RD   K+G
Sbjct: 245 YSGGIFSGNCGNQLNHGVAIVGYGETSN-QAYWLVKNSWGTDWGESGYIRMKRDSTDKQG 303

Query: 334 LCGIAMQASYP 344
            CGIAM ASYP
Sbjct: 304 TCGIAMMASYP 314


>gi|226504984|ref|NP_001151293.1| cysteine protease 1 precursor [Zea mays]
 gi|195645596|gb|ACG42266.1| cysteine protease 1 precursor [Zea mays]
          Length = 340

 Score =  347 bits (889), Expect = 7e-93,   Method: Compositional matrix adjust.
 Identities = 180/312 (57%), Positives = 221/312 (70%), Gaps = 12/312 (3%)

Query: 38  RHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEF 97
           RHE WMA++GR Y+D AEK  R ++F+ N E I SFN  A    ++L  N FAD T +EF
Sbjct: 37  RHEKWMAEHGRAYKDEAEKARRLEVFRANAELIDSFN-AAGTHSHRLATNRFADLTVQEF 95

Query: 98  RAPRNGYKRR-LPSVRSSETTDVSFRYENASVP---ASIDWRKKGAVTGVKDQGQCGCCW 153
           RA R G + R  PS  +       FRYEN S+     S+DWR  GAVTGVKDQG  GCCW
Sbjct: 96  RAARTGLRPRPAPSAGAGR-----FRYENFSLADAAQSVDWRAMGAVTGVKDQGASGCCW 150

Query: 154 AFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATE 213
           AFSAVAA+EG+N I T +L SLSEQELVDCD SG DQGC+GGLMD+AF+F+    GLA+E
Sbjct: 151 AFSAVAAVEGLNKIRTGRLVSLSEQELVDCDVSGVDQGCDGGLMDNAFQFVARRGGLASE 210

Query: 214 AKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQF 273
           + YPY+  DG C +  A  +AA I G+EDVP NNEAAL  AVA+QPVSVAI+     F+F
Sbjct: 211 SGYPYQCRDGPC-RSSAAAAAASIRGHEDVPRNNEAALAAAVAHQPVSVAINGEDMAFRF 269

Query: 274 YSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEG 333
           Y SGV  G CGT+L+H +TAVGYGTA DGT+YWL+KNSWG +WGE GY+R++R +   EG
Sbjct: 270 YDSGVLGGACGTDLNHAITAVGYGTAADGTRYWLMKNSWGASWGEGGYVRIRRGVRG-EG 328

Query: 334 LCGIAMQASYPT 345
           +CG+A   SYP 
Sbjct: 329 VCGLAKLPSYPV 340


>gi|242077600|ref|XP_002448736.1| hypothetical protein SORBIDRAFT_06g032320 [Sorghum bicolor]
 gi|241939919|gb|EES13064.1| hypothetical protein SORBIDRAFT_06g032320 [Sorghum bicolor]
          Length = 467

 Score =  347 bits (889), Expect = 7e-93,   Method: Compositional matrix adjust.
 Identities = 167/316 (52%), Positives = 220/316 (69%), Gaps = 5/316 (1%)

Query: 32  DATMNERHEMWMAQYGR-VYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFA 90
           +A +   +E+W+ ++GR V     E + RF++F +N+ ++ + N +A    ++LG+N+FA
Sbjct: 49  EAEVRAMYELWLVEHGRRVSNVLGEHDSRFRVFWDNLRFVDAHNERAGEHGFRLGMNQFA 108

Query: 91  DQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS-VPASIDWRKKGAVTGVKDQGQC 149
           D TN+EFRA   G   R+P+ RS       +R++ A  +P S+DWR+KGAV  VK+QGQC
Sbjct: 109 DLTNDEFRAAYLG--ARIPAARSGNAVGEMYRHDGAEELPESVDWREKGAVAPVKNQGQC 166

Query: 150 GCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKG 209
           G CWAFSAV+++E IN I T ++ +LSEQELV+C T G + GC GGLMD AF FII N G
Sbjct: 167 GSCWAFSAVSSVESINQIVTGEMVTLSEQELVECSTDGGNSGCNGGLMDAAFNFIIKNGG 226

Query: 210 LATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGS 269
           + TE  YPYKA DG C+    N     I  +EDVP N+E +L KAVA+QPVSVAI+A G 
Sbjct: 227 IDTEDDYPYKAVDGKCDINRRNAKVVSIDAFEDVPENDEKSLQKAVAHQPVSVAIEAGGR 286

Query: 270 DFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDID 329
            FQ Y SGVF+G C T LDHGV AVGYGT ++G  YW+V+NSWG  WGE GYIRM+R+I+
Sbjct: 287 QFQLYKSGVFSGSCTTNLDHGVVAVGYGT-ENGKDYWIVRNSWGPKWGEAGYIRMERNIN 345

Query: 330 AKEGLCGIAMQASYPT 345
           A  G CGIAM ASYPT
Sbjct: 346 ATTGKCGIAMMASYPT 361


>gi|148927394|gb|ABR19828.1| cysteine proteinase [Elaeis guineensis]
          Length = 469

 Score =  347 bits (889), Expect = 7e-93,   Method: Compositional matrix adjust.
 Identities = 179/319 (56%), Positives = 219/319 (68%), Gaps = 8/319 (2%)

Query: 31  NDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPY--KLGINE 88
           +D  ++  ++ W AQ+ R Y    E E R +IF++N+ +I   N  A    Y  +LG+  
Sbjct: 39  SDDEVHRLYQAWKAQHARSYNALDEDEQRLEIFRDNLRFIDQHNAAANAGKYSFRLGLTR 98

Query: 89  FADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS---VPASIDWRKKGAVTGVKD 145
           FAD TNEE+R+   G  R   S R   +T  S RY   S   +P SIDWR KGAV  VKD
Sbjct: 99  FADLTNEEYRSTYLGV-RTAGSRRRRNSTVGSNRYRFRSSDDLPDSIDWRDKGAVVDVKD 157

Query: 146 QGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFII 205
           QG CG CWAFS +AA+EGINHI T  L SLSEQELVDCDT   +QGC GGLMD AFEFII
Sbjct: 158 QGSCGSCWAFSTIAAVEGINHIVTGDLISLSEQELVDCDTY-YNQGCNGGLMDYAFEFII 216

Query: 206 SNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAID 265
           SN G+ T+  YPY   DGSC++   N     I  YEDVP N+E +L KAVANQPVSVAI+
Sbjct: 217 SNGGIDTDEDYPYTGRDGSCDQYRKNAHVVTIDSYEDVPINDEKSLQKAVANQPVSVAIE 276

Query: 266 ASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQ 325
           A G  FQ Y SG+FTG CGTELDHGVTA+GYG+ ++G  YW+VKNSWG+ WGE+GYIRM+
Sbjct: 277 AGGRAFQLYESGIFTGYCGTELDHGVTAIGYGS-ENGKYYWIVKNSWGSDWGESGYIRME 335

Query: 326 RDIDAKEGLCGIAMQASYP 344
           R+I++  G CGIAM+ASYP
Sbjct: 336 RNINSATGKCGIAMEASYP 354


>gi|37780039|gb|AAP32192.1| cysteine protease 14 [Trifolium repens]
          Length = 351

 Score =  346 bits (888), Expect = 7e-93,   Method: Compositional matrix adjust.
 Identities = 169/309 (54%), Positives = 215/309 (69%), Gaps = 5/309 (1%)

Query: 37  ERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEE 96
           E  E WM+++G++Y    EK +RF++FK+N+++I   N    N  Y LG+NEFAD +++E
Sbjct: 45  ELFESWMSRHGKIYETIEEKLLRFEVFKDNLKHIDDRNKIVSN--YWLGLNEFADLSHQE 102

Query: 97  FRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFS 156
           F+    G K  L   R S + +  F Y +  +P S+DWRKKGAVT VK+QGQCG CWAFS
Sbjct: 103 FKNKYLGLKVDLSQRRES-SNEEEFTYRDVDLPKSVDWRKKGAVTPVKNQGQCGSCWAFS 161

Query: 157 AVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKY 216
            VAA+EGIN I T  LTSLSEQEL+DCDT+  + GC GGLMD AF FI  N GL  E  Y
Sbjct: 162 TVAAVEGINQIVTGNLTSLSEQELIDCDTT-YNNGCNGGLMDYAFSFIGQNGGLHKEEDY 220

Query: 217 PYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSS 276
           PY   + +C  K+       I+GY DVP NNE +L+KA+ANQP+SVAI+AS  DFQFYS 
Sbjct: 221 PYIMEESTCEMKKEETQVVTINGYHDVPQNNEQSLLKALANQPLSVAIEASSRDFQFYSG 280

Query: 277 GVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCG 336
           GVF G CG++LDHGV+AVGYGT+ +   Y +VKNSWG  WGE G+IRM+RDI   EG+CG
Sbjct: 281 GVFDGHCGSDLDHGVSAVGYGTSKN-LDYIIVKNSWGAKWGEKGFIRMKRDIGKPEGICG 339

Query: 337 IAMQASYPT 345
           +   ASYPT
Sbjct: 340 LYKMASYPT 348


>gi|297598407|ref|NP_001045533.2| Os01g0971400 [Oryza sativa Japonica Group]
 gi|15289977|dbj|BAB63672.1| putative cysteine protease CP1 [Oryza sativa Japonica Group]
 gi|125529282|gb|EAY77396.1| hypothetical protein OsI_05384 [Oryza sativa Indica Group]
 gi|125573472|gb|EAZ14987.1| hypothetical protein OsJ_04922 [Oryza sativa Japonica Group]
 gi|215740756|dbj|BAG97412.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215741010|dbj|BAG97505.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215765325|dbj|BAG87022.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215767338|dbj|BAG99566.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|255674119|dbj|BAF07447.2| Os01g0971400 [Oryza sativa Japonica Group]
          Length = 365

 Score =  346 bits (888), Expect = 8e-93,   Method: Compositional matrix adjust.
 Identities = 182/321 (56%), Positives = 218/321 (67%), Gaps = 16/321 (4%)

Query: 35  MNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTN 94
           + E  E +MA+Y + Y    EK  RF++FK+N+ +I   N K     Y LG+NEFAD T+
Sbjct: 48  LMELFEKFMAKYRKAYSSLEEKLRRFEVFKDNLNHIDEENKKITG--YWLGLNEFADLTH 105

Query: 95  EEFRAPRNGYKRRLPSVRSSETTDVSFRYEN---ASVPASIDWRKKGAVTGVKDQGQCGC 151
           +EF+A   G     P+ R+S   D  FRYE    AS+P  +DWRKKGAVT VK+QGQCG 
Sbjct: 106 DEFKAAYLGLTL-TPARRNS--NDQLFRYEEVEAASLPKEVDWRKKGAVTEVKNQGQCGS 162

Query: 152 CWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLA 211
           CWAFS VAA+EGIN I T  LT LSEQEL+DCDT G + GC GGLMD AF +I +N GL 
Sbjct: 163 CWAFSTVAAVEGINAIVTGNLTRLSEQELIDCDTDG-NNGCSGGLMDYAFSYIAANGGLH 221

Query: 212 TEAKYPYKASDGSCNKKEAN-------PSAAKISGYEDVPSNNEAALMKAVANQPVSVAI 264
           TE  YPY   +G+C +            +A  ISGYEDVP NNE AL+KA+A+QPVSVAI
Sbjct: 222 TEESYPYLMEEGTCRRGSTEGDDDGEAAAAVTISGYEDVPRNNEQALLKALAHQPVSVAI 281

Query: 265 DASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRM 324
           +ASG +FQFYS GVF G CGT LDHGVTAVGYGTA  G  Y +VKNSWG+ WGE GYIRM
Sbjct: 282 EASGRNFQFYSGGVFDGPCGTRLDHGVTAVGYGTASKGHDYIIVKNSWGSHWGEKGYIRM 341

Query: 325 QRDIDAKEGLCGIAMQASYPT 345
           +R     +GLCGI   ASYPT
Sbjct: 342 RRGTGKHDGLCGINKMASYPT 362


>gi|350538043|ref|NP_001234324.1| cysteine protease TDI-65 precursor [Solanum lycopersicum]
 gi|5726641|gb|AAD48496.1|AF172856_1 cysteine protease TDI-65 [Solanum lycopersicum]
 gi|2828252|emb|CAA05894.1| CYP1 [Solanum lycopersicum]
          Length = 466

 Score =  346 bits (888), Expect = 8e-93,   Method: Compositional matrix adjust.
 Identities = 169/315 (53%), Positives = 218/315 (69%), Gaps = 4/315 (1%)

Query: 32  DATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFAD 91
           D  ++  +E W+ ++G+ Y    EK+ RF+IFK+N+ YI    N   N+ YKLG+ +FAD
Sbjct: 42  DDEVSALYESWLIEHGKSYNALGEKDKRFQIFKDNLRYIDE-QNSVPNQSYKLGLTKFAD 100

Query: 92  QTNEEFRAPRNGYKRRLPSVRSSET-TDVSFRYENASVPASIDWRKKGAVTGVKDQGQCG 150
            TNEE+R+   G K      + S+  +D        S+P SIDWR+KG + GVKDQG CG
Sbjct: 101 LTNEEYRSIYLGTKSSGDRKKLSKNKSDRYLPKVGDSLPESIDWREKGVLVGVKDQGSCG 160

Query: 151 CCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGL 210
            CWAFSAVAAME IN I T  L SLSEQELVDCD S  ++GC+GGLMD AFEF+I N G+
Sbjct: 161 SCWAFSAVAAMESINAIVTGNLISLSEQELVDCDRS-YNEGCDGGLMDYAFEFVIKNGGI 219

Query: 211 ATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSD 270
            TE  YPYK  +G C++   N    KI  YEDVP NNE AL KAVA+QPVS+A++A G D
Sbjct: 220 DTEEDYPYKERNGVCDQYRKNAKVVKIDSYEDVPVNNEKALQKAVAHQPVSIALEAGGRD 279

Query: 271 FQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDA 330
           FQ Y SG+FTG+CGT +DHGV   GYGT ++G  YW+V+NSWG  WGENGY+R+QR++ +
Sbjct: 280 FQHYKSGIFTGKCGTAVDHGVVIAGYGT-ENGMDYWIVRNSWGANWGENGYLRVQRNVAS 338

Query: 331 KEGLCGIAMQASYPT 345
             GLCG+A++ SYP 
Sbjct: 339 SSGLCGLAIEPSYPV 353


>gi|297802418|ref|XP_002869093.1| hypothetical protein ARALYDRAFT_491113 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297314929|gb|EFH45352.1| hypothetical protein ARALYDRAFT_491113 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 355

 Score =  346 bits (888), Expect = 9e-93,   Method: Compositional matrix adjust.
 Identities = 171/310 (55%), Positives = 219/310 (70%), Gaps = 7/310 (2%)

Query: 37  ERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEE 96
           E  E WM+++ +VY+   EK  RF++F+EN+ +I   NN+  +  Y LG+NEFAD T+EE
Sbjct: 49  ELFESWMSEHSKVYKSVEEKVHRFEVFRENLMHIDQRNNEINS--YWLGLNEFADLTHEE 106

Query: 97  FRAPRNGYKRRLPSVRSSETTDVSFRYENAS-VPASIDWRKKGAVTGVKDQGQCGCCWAF 155
           F+    G  +  P          +FRY + + +P S+DWRKKGAV  VKDQGQCG CWAF
Sbjct: 107 FKGRYLGLAK--PQFSRKRQPSANFRYRDITDLPKSVDWRKKGAVAPVKDQGQCGSCWAF 164

Query: 156 SAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAK 215
           S VAA+EGIN ITT  L+SLSEQEL+DCDT+  + GC GGLMD AF++IIS  GL  E  
Sbjct: 165 STVAAVEGINQITTGNLSSLSEQELIDCDTTF-NSGCNGGLMDYAFQYIISTGGLHKEDD 223

Query: 216 YPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYS 275
           YPY   +G C +++ +     ISGYEDVP N++ +L+KA+A+QPVSVAI+ASG DFQFY 
Sbjct: 224 YPYLMEEGICQEQKEDVERVTISGYEDVPENDDESLVKALAHQPVSVAIEASGRDFQFYK 283

Query: 276 SGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLC 335
            GVF GQCGT+LDHGV AVGYG++  G+ Y +VKNSWG  WGE G+IRM+R+    EGLC
Sbjct: 284 GGVFNGQCGTDLDHGVAAVGYGSS-KGSDYVIVKNSWGPRWGEKGFIRMKRNTGKPEGLC 342

Query: 336 GIAMQASYPT 345
           GI   ASYPT
Sbjct: 343 GINKMASYPT 352


>gi|225428879|ref|XP_002285299.1| PREDICTED: cysteine proteinase RD21a-like [Vitis vinifera]
          Length = 469

 Score =  346 bits (887), Expect = 1e-92,   Method: Compositional matrix adjust.
 Identities = 175/317 (55%), Positives = 222/317 (70%), Gaps = 10/317 (3%)

Query: 32  DATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFAD 91
           DA +   +E W+ ++G+ Y    E+E RF+IFK+N+ +I   N  A N+ YK+G+N FAD
Sbjct: 47  DAEVMAVYEAWLVKHGKSYNALGERERRFEIFKDNLRFIEEHN--AVNRTYKVGLNRFAD 104

Query: 92  QTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYE---NASVPASIDWRKKGAVTGVKDQGQ 148
            TNEE+R+   G  RR  + R    + VS RY       +P S+DWR+KGAV  VKDQG 
Sbjct: 105 LTNEEYRSRYLG--RRDETRRGLRASRVSDRYSFRAGEDLPESVDWREKGAVVPVKDQGN 162

Query: 149 CGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNK 208
           CG CWAFS +AA+EGIN I T  L SLSEQELVDCD S  +QGC GGLMD AFEFII+N 
Sbjct: 163 CGSCWAFSTIAAVEGINQIATGDLISLSEQELVDCDKS-YNQGCNGGLMDYAFEFIINNG 221

Query: 209 GLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASG 268
           G+ +E  YPY+A+D +C+    N     I GYEDVP N+E +L KAVANQPVSVAI+A G
Sbjct: 222 GIDSEEDYPYRAADTTCDPNRKNARVVSIDGYEDVPQNDERSLKKAVANQPVSVAIEAGG 281

Query: 269 SDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDI 328
             FQ Y SGVFTGQCGT+LDHGV AVGYGT ++   YW+V+NSWG  WGE+GYI+++R++
Sbjct: 282 RAFQLYQSGVFTGQCGTQLDHGVVAVGYGT-ENSVDYWIVRNSWGPNWGESGYIKLERNL 340

Query: 329 DAKE-GLCGIAMQASYP 344
              E G CGIA++ SYP
Sbjct: 341 AGTETGKCGIAIEPSYP 357


>gi|2511689|emb|CAB17074.1| cysteine proteinase precursor [Phaseolus vulgaris]
          Length = 364

 Score =  346 bits (887), Expect = 1e-92,   Method: Compositional matrix adjust.
 Identities = 176/345 (51%), Positives = 232/345 (67%), Gaps = 16/345 (4%)

Query: 10  LVLAAILVLGVWAPQSWSRTLNDATMNERHEM---WMAQYGRVYRDNAEKEMRFKIFKEN 66
           +++  +L+L      + + ++ + + NE  +M   W+ ++ +VY    EKE RF++FK+N
Sbjct: 4   MLIPTLLLLSFTFSHATAMSIINYSENEVMDMYEEWLVKHRKVYNGLDEKEKRFQVFKDN 63

Query: 67  VEYIASFNNKARNKPYKLGINEFADQTNEEFRA----PRNGYKRRLPSVRSSETTDVSFR 122
           + +I   N  A+N  Y LG+N+FAD TNEE+RA     R   KRR   V  ++ T   + 
Sbjct: 64  LGFIQDHN--AQNNTYTLGLNKFADITNEEYRAMYLGTRTDAKRR---VMKTQNTGHRYA 118

Query: 123 YENA-SVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELV 181
           Y +   +P  +DWR KGAV  +KDQG CG CWAFS VAA+EGIN+I T +  SLSEQELV
Sbjct: 119 YNSGDQLPVHVDWRLKGAVGPIKDQGNCGSCWAFSTVAAVEGINNIVTGEFVSLSEQELV 178

Query: 182 DCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYE 241
           DCD    D+GC GGLMD AF+FII N G+ TE  YPY+  DG+C++ +      +I GYE
Sbjct: 179 DCDRE-YDEGCNGGLMDYAFQFIIQNGGIDTEEDYPYQGIDGTCDQTKKKTKVVQIDGYE 237

Query: 242 DVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADD 301
           DVPSNNE AL KAV++QPVSVAI+ASG   Q Y SGVFTG+CGT LDHGV  VGYGT ++
Sbjct: 238 DVPSNNENALKKAVSHQPVSVAIEASGRALQLYQSGVFTGKCGTALDHGVVVVGYGT-EN 296

Query: 302 GTKYWLVKNSWGTTWGENGYIRMQRDI-DAKEGLCGIAMQASYPT 345
           G  YWLV+NSWGT WGE+GY +M+R++    EG CGIAM  SYP 
Sbjct: 297 GVDYWLVRNSWGTGWGEDGYFKMERNVRSTSEGKCGIAMDCSYPV 341


>gi|46401612|dbj|BAD16614.1| cysteine proteinase [Dianthus caryophyllus]
          Length = 459

 Score =  345 bits (886), Expect = 1e-92,   Method: Compositional matrix adjust.
 Identities = 172/316 (54%), Positives = 221/316 (69%), Gaps = 8/316 (2%)

Query: 32  DATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFAD 91
           D  +   +E W+ ++G+ Y    EK++RF IFK+N+ ++   N++  N  +KLG+N FAD
Sbjct: 36  DDEIASLYETWLVKHGKNYNGLGEKQLRFNIFKDNLRFVDERNSE--NLSFKLGLNRFAD 93

Query: 92  QTNEEFRAPRNGYKRRLPSV-RS--SETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQ 148
            TNEE+R+   G + R  +V RS  S++   +FR  + ++P S+DWRKKGAV G+KDQG 
Sbjct: 94  LTNEEYRSVYLGTRPRSVAVARSGRSKSDRYAFRAGD-TLPESVDWRKKGAVAGIKDQGS 152

Query: 149 CGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNK 208
           CG CWAFSA+AA+EG+N I T  L SLSEQELV+CDTS  D GC+GGLMD AFEFII N+
Sbjct: 153 CGSCWAFSAIAAVEGVNQIVTGDLISLSEQELVECDTSYND-GCDGGLMDYAFEFIIKNE 211

Query: 209 GLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASG 268
           G+ ++  YPY   DG C+    N     I  YED P  +E +L KAVANQPVSVAI+  G
Sbjct: 212 GIDSDEDYPYTGRDGRCDTNRKNAKVVTIDDYEDSPVYDEKSLQKAVANQPVSVAIEGGG 271

Query: 269 SDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDI 328
            DFQ Y SGVFTG+CGT LDHGV  VGYGT +DG  YW+V+NSWG TWGE GYIRMQR+ 
Sbjct: 272 RDFQLYDSGVFTGKCGTALDHGVAVVGYGT-EDGLDYWIVRNSWGDTWGEGGYIRMQRNT 330

Query: 329 DAKEGLCGIAMQASYP 344
               G+CGIA++ SYP
Sbjct: 331 KLPSGICGIAIEPSYP 346


>gi|355344587|gb|AER60490.1| cysteine proteases [Gossypium hirsutum]
          Length = 371

 Score =  345 bits (886), Expect = 1e-92,   Method: Compositional matrix adjust.
 Identities = 178/335 (53%), Positives = 224/335 (66%), Gaps = 8/335 (2%)

Query: 14  AILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASF 73
           + L L    P S S   +D  M   ++ W+ Q+G+ Y    E+E RF+IFK+N+ +I   
Sbjct: 21  STLTLNQNHPSSSSWRSDDEVMG-LYKSWVIQHGKAYNGIGEEEKRFEIFKDNLRFIDEH 79

Query: 74  NNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS---VPA 130
           N+   N  YKLG+N+FAD TN+E+RA   G  R  P  R  ++   S RY + +   +P 
Sbjct: 80  NSN-NNTTYKLGLNKFADLTNQEYRAKFLG-TRTDPRRRLMKSKIPSSRYAHRAGDNLPD 137

Query: 131 SIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQ 190
           S+DWR  GAV+ VKDQG CG CWAFS +A +EGIN I + +L SLSEQELVDCD S  D 
Sbjct: 138 SVDWRDHGAVSPVKDQGSCGSCWAFSTIATVEGINKIVSGELVSLSEQELVDCDRS-YDA 196

Query: 191 GCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAA 250
           GC GGLMD AF+FI+ N G+ TE  YPY   +  C+  + N     I GYEDVP NNE A
Sbjct: 197 GCNGGLMDYAFQFIMDNGGIDTEKDYPYLGFNNQCDPTKKNAKVVSIDGYEDVP-NNENA 255

Query: 251 LMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKN 310
           L KAVA+QPVS+AI+A G  FQ Y SGVF G+CG  LDHGV AVGYGT D+G  YW+V+N
Sbjct: 256 LKKAVAHQPVSIAIEAGGRAFQLYESGVFNGECGLALDHGVVAVGYGTDDNGQDYWIVRN 315

Query: 311 SWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           SWG+ WGENGYIRM+R+I+A  G CGIAM+ASYP 
Sbjct: 316 SWGSNWGENGYIRMERNINANTGKCGIAMEASYPV 350


>gi|356545079|ref|XP_003540973.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
          Length = 330

 Score =  345 bits (886), Expect = 1e-92,   Method: Compositional matrix adjust.
 Identities = 184/306 (60%), Positives = 209/306 (68%), Gaps = 11/306 (3%)

Query: 18  LGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKA 77
           +   A Q   RTL DA+M ERHE WM++YG+VY+D  E+E RF+IFKEN+ YI + NN A
Sbjct: 1   MAFLASQVTCRTLQDASMYERHEEWMSRYGKVYKDPREREKRFRIFKENMNYIETSNNVA 60

Query: 78  RNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKK 137
             KP KL IN+FAD  NEEF APRN +K  +     S      F       P      KK
Sbjct: 61  I-KPXKLVINQFADLNNEEFIAPRNIFKGMILCRFLSRKHTFPF-------PYVFLGHKK 112

Query: 138 GAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLM 197
           GAVT VKDQG CG CWAF  VA+ EGI  +T  KL SLSEQELVDCDT G DQGCE GLM
Sbjct: 113 GAVTPVKDQGHCGFCWAFYDVASTEGILALTAGKLISLSEQELVDCDTKGVDQGCECGLM 172

Query: 198 DDAFEFIISNKGLATEAKYPYKASDGSCN-KKEANPSAAKISGYEDVPSNNEAALMKAVA 256
           DDAF+FII N G+  +A YPYK  DG CN  +EANP AA I+G EDVP+NNE AL K VA
Sbjct: 173 DDAFKFIIQNHGVX-DANYPYKGVDGKCNANEEANP-AATITGXEDVPANNEKALQKVVA 230

Query: 257 NQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTW 316
           NQPV VAIDA  SDFQFY SGVFTG C TEL+HGVT +GYG + DGT+YWLVKNS  T W
Sbjct: 231 NQPVFVAIDACDSDFQFYKSGVFTGSCETELNHGVTTMGYGVSHDGTQYWLVKNSXETEW 290

Query: 317 GENGYI 322
             N  I
Sbjct: 291 NPNRAI 296


>gi|600111|emb|CAA84378.1| cysteine proteinase [Vicia sativa]
          Length = 359

 Score =  345 bits (885), Expect = 2e-92,   Method: Compositional matrix adjust.
 Identities = 183/310 (59%), Positives = 220/310 (70%), Gaps = 12/310 (3%)

Query: 39  HEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFR 98
           +E W + +  V R+  EK  RF +FK NV ++   N    +KPYKL +N+F D TN EFR
Sbjct: 40  YERWRSHH-TVTRNLDEKHNRFNVFKANVMHV--HNTNKLDKPYKLKLNKFGDMTNYEFR 96

Query: 99  ---APRNGYKRRLPSVRSSETTDVSFRYENA-SVPASIDWRKKGAVTGVKDQGQCGCCWA 154
              A       R+   R     + +F YENA  VP+SIDWR KGAVTGVKDQGQCG CWA
Sbjct: 97  RIYADSKISHHRM--FRGMSHENGTFMYENAVDVPSSIDWRNKGAVTGVKDQGQCGSCWA 154

Query: 155 FSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEA 214
           FS +AA+EGIN I T+KL SLSEQ+LVDCDT  E++GC GGLM+ AFEFI  N G+ TE+
Sbjct: 155 FSTIAAVEGINQIKTQKLVSLSEQQLVDCDTE-ENEGCNGGLMEYAFEFIKQN-GITTES 212

Query: 215 KYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFY 274
            YPY A DG+C+  E    A  I G+E+VP NNEAAL+KA A QPVSVAIDA G +FQFY
Sbjct: 213 NYPYAAKDGTCDV-EKEDKAVSIDGHENVPINNEAALLKAAAKQPVSVAIDAGGYNFQFY 271

Query: 275 SSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGL 334
           S GVFTG C T+L+HGV  VGYG   D TKYW++KNSWG+ WGE GYIRMQR I ++EGL
Sbjct: 272 SEGVFTGHCDTDLNHGVAIVGYGVTQDRTKYWIMKNSWGSEWGEQGYIRMQRGISSREGL 331

Query: 335 CGIAMQASYP 344
           CGIAM+ASYP
Sbjct: 332 CGIAMEASYP 341


>gi|413951605|gb|AFW84254.1| hypothetical protein ZEAMMB73_933931 [Zea mays]
          Length = 423

 Score =  345 bits (885), Expect = 2e-92,   Method: Compositional matrix adjust.
 Identities = 176/322 (54%), Positives = 221/322 (68%), Gaps = 13/322 (4%)

Query: 31  NDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFA 90
           +D  + + +E W   + RV+R + EK  RF  FKENV +I + N +  ++PY+L +N F 
Sbjct: 80  SDEALWDLYERWQTHH-RVHRHHGEKGRRFGTFKENVRFIHAHNKRG-DRPYRLRLNRFG 137

Query: 91  DQTNEEFRA----PRNGYKRRLPSVRSSETTDVSFRYENAS-VPASIDWRKKGAVTGVKD 145
           D   EEFR+     R    RR  S  +       F Y++A+  P S+DWR++GAVTGVKD
Sbjct: 138 DMGREEFRSTFADSRINDLRRQDSPAARAGAVPGFMYDSAADPPRSVDWRQEGAVTGVKD 197

Query: 146 QGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFII 205
           QG CG CWAFS V A+EGIN I T  L SLSEQEL+DCDT  ++ GC+GGLM++AFEFI 
Sbjct: 198 QGHCGSCWAFSTVVAVEGINAIRTGSLASLSEQELIDCDT--DENGCQGGLMENAFEFIK 255

Query: 206 SNKGLATEAKYPYKASDGSCNKKEAN---PSAAKISGYEDVPSNNEAALMKAVANQPVSV 262
           S  G+ TEA YPY+AS+G+C+   A         I G++ VP+ +E AL KAVA+QPVSV
Sbjct: 256 SFGGITTEAAYPYRASNGTCDGDRARRGGGVVVVIDGHQMVPAGSEDALAKAVAHQPVSV 315

Query: 263 AIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYI 322
           A+DA G  FQFYS GVFTG CGT+LDHGV AVGYG  DDGT YW+VKNSWGT+WGE GYI
Sbjct: 316 AVDAGGQAFQFYSEGVFTGDCGTDLDHGVAAVGYGVGDDGTPYWIVKNSWGTSWGEGGYI 375

Query: 323 RMQRDIDAKEGLCGIAMQASYP 344
           RMQR      GLCGIAM+AS+P
Sbjct: 376 RMQRGA-GNGGLCGIAMEASFP 396


>gi|413953665|gb|AFW86314.1| hypothetical protein ZEAMMB73_546353 [Zea mays]
          Length = 233

 Score =  345 bits (885), Expect = 2e-92,   Method: Compositional matrix adjust.
 Identities = 162/228 (71%), Positives = 184/228 (80%), Gaps = 5/228 (2%)

Query: 121 FRYENAS---VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSE 177
           FRYEN S   +P +IDWR KGAVT +KDQGQCGCCWAFSAVAA EGI  I+T KL SL+E
Sbjct: 7   FRYENVSADALPTTIDWRTKGAVTPIKDQGQCGCCWAFSAVAATEGIVKISTGKLVSLAE 66

Query: 178 QELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKI 237
           QELVDCD   EDQGCEGGLMDDAF+FII N GL TE+ YPY A+DG C  K  + SAA I
Sbjct: 67  QELVDCDVHDEDQGCEGGLMDDAFKFIIKNGGLTTESSYPYTAADGKC--KSGSNSAATI 124

Query: 238 SGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYG 297
            GYEDVP+N+EAALMKAVANQPVSVA+D     FQFYS GV TG CGT+LDHG+ A+GYG
Sbjct: 125 KGYEDVPANDEAALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYG 184

Query: 298 TADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
              DGTKYWL+KNSWGTTWGENGY+RM++DI  K G+CG+AM+ SYPT
Sbjct: 185 KTSDGTKYWLMKNSWGTTWGENGYLRMEKDISDKRGMCGLAMEPSYPT 232


>gi|50355611|dbj|BAD29954.1| cysteine protease [Daucus carota]
          Length = 474

 Score =  345 bits (885), Expect = 2e-92,   Method: Compositional matrix adjust.
 Identities = 175/311 (56%), Positives = 220/311 (70%), Gaps = 8/311 (2%)

Query: 39  HEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFR 98
           +E W+ ++ + Y    EKE RF IFK+NV ++   +N  RN+ YKLG+N+FAD TN+E+R
Sbjct: 60  YESWLVKHHKNYNALGEKETRFGIFKDNVGFVDR-HNSMRNQSYKLGLNKFADLTNDEYR 118

Query: 99  APRNGYKRRLPSVRSSETTDVSFRY---ENASVPASIDWRKKGAVTGVKDQGQCGCCWAF 155
           +     K  +   R +E    S R+   +   +P S+DWR +GAV  VKDQGQCG CWAF
Sbjct: 119 SLYLSGKM-MKRERKNEDGFRSDRFVFEDGDHLPESVDWRDRGAVAPVKDQGQCGSCWAF 177

Query: 156 SAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAK 215
           S V A+EGIN I T +L SLSEQELVDCD +G +QGC GGLMD AFEFI+ N G+ TE  
Sbjct: 178 STVGAVEGINKIVTGELISLSEQELVDCD-NGYNQGCNGGLMDYAFEFIVKNGGIDTEDD 236

Query: 216 YPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYS 275
           YPYK  DG C++   N     I+GYEDVP N+E +L KAVA+QPVSVAI+A G  FQ Y 
Sbjct: 237 YPYKGVDGLCDQNRKNAKVVTINGYEDVPHNDEKSLKKAVAHQPVSVAIEAGGRAFQLYE 296

Query: 276 SGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDI-DAKEGL 334
           SGVFTGQCGTELDHGV AVGYG+ ++G  YW+V+NSWG  WGE+GYIR++R++     G 
Sbjct: 297 SGVFTGQCGTELDHGVVAVGYGS-ENGKDYWIVRNSWGPDWGESGYIRLERNVASTSTGK 355

Query: 335 CGIAMQASYPT 345
           CGIAMQASYPT
Sbjct: 356 CGIAMQASYPT 366


>gi|194352754|emb|CAQ00105.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
 gi|326513690|dbj|BAJ87864.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326514532|dbj|BAJ96253.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 463

 Score =  345 bits (885), Expect = 2e-92,   Method: Compositional matrix adjust.
 Identities = 173/309 (55%), Positives = 214/309 (69%), Gaps = 12/309 (3%)

Query: 42  WMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARN--KPYKLGINEFADQTNEEFRA 99
           WMA++G  Y    E+E RF+ F++N+ YI   N  A      ++LG+N FAD TNEE+R+
Sbjct: 46  WMAEHGSTYNAIGEEERRFEAFRDNLRYIDQHNAAADAGVHSFRLGLNRFADLTNEEYRS 105

Query: 100 PRNGYKRRLPSVRSSETTDVSFRYE---NASVPASIDWRKKGAVTGVKDQGQCGCCWAFS 156
              G + +    R      +S RY+   N  +P S+DWRKKGAV  VKDQG CG CWAFS
Sbjct: 106 TYLGARTKPDRERK-----LSARYQAADNDELPESVDWRKKGAVGAVKDQGGCGSCWAFS 160

Query: 157 AVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKY 216
           A+AA+EGIN I T  +  LSEQELVDCDTS  +QGC GGLMD AFEFII+N G+ +E  Y
Sbjct: 161 AIAAVEGINQIVTGDMIPLSEQELVDCDTS-YNQGCNGGLMDYAFEFIINNGGIDSEEDY 219

Query: 217 PYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSS 276
           PYK  D  C+  + N     I GYEDVP N+E +L KAVANQP+SVAI+A G  FQ Y S
Sbjct: 220 PYKERDNRCDANKKNAKVVTIDGYEDVPVNSEKSLQKAVANQPISVAIEAGGRAFQLYKS 279

Query: 277 GVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCG 336
           G+FTG CGT LDHGV AVGYGT ++G  YWLV+NSWG+ WGE+GYIRM+R+I A  G CG
Sbjct: 280 GIFTGTCGTALDHGVAAVGYGT-ENGKDYWLVRNSWGSVWGEDGYIRMERNIKASSGKCG 338

Query: 337 IAMQASYPT 345
           IA++ SYPT
Sbjct: 339 IAVEPSYPT 347


>gi|168057475|ref|XP_001780740.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162667829|gb|EDQ54449.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 463

 Score =  345 bits (885), Expect = 2e-92,   Method: Compositional matrix adjust.
 Identities = 175/315 (55%), Positives = 218/315 (69%), Gaps = 10/315 (3%)

Query: 31  NDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFA 90
           +DA ++  H+ W+  + RVYR  +EK  RF+IFKEN  YI + N +   K Y LG+N+F+
Sbjct: 42  DDAILDVFHQ-WLETHSRVYRSLSEKHHRFQIFKENFLYIHAHNKQ--QKSYWLGLNKFS 98

Query: 91  DQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCG 150
           D T++EFRA   G K   P  R  +  + +F YE+      +DWR KGAVT VKDQG CG
Sbjct: 99  DLTHQEFRAQYLGTK---PVNR--QRKEANFMYEDVEAEPKVDWRLKGAVTDVKDQGACG 153

Query: 151 CCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGL 210
            CWAFSAV ++EG+N I T +L SLSEQELVDCD   ++QGC GGLMD AFEFII N G+
Sbjct: 154 SCWAFSAVGSVEGVNAIKTGELVSLSEQELVDCDRK-QNQGCNGGLMDYAFEFIIKNGGI 212

Query: 211 ATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSD 270
            TE  YPYKA DG C++   N     I  Y+DVP+ +E+ALMKA+   PVSVAI+A G D
Sbjct: 213 DTEKDYPYKARDGRCDEGRRNSKVVVIDDYQDVPTQSESALMKALTKNPVSVAIEAGGRD 272

Query: 271 FQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQR-DID 329
           FQ Y  GVFTG CG+ELDHGV AVGYGT DDG  YW+VKNSWG  WGE GYIRM+R   D
Sbjct: 273 FQHYQGGVFTGPCGSELDHGVLAVGYGTDDDGVNYWIVKNSWGPGWGEKGYIRMERFGSD 332

Query: 330 AKEGLCGIAMQASYP 344
           + +G CGI ++AS+P
Sbjct: 333 STDGKCGINIEASFP 347


>gi|255544115|ref|XP_002513120.1| cysteine protease, putative [Ricinus communis]
 gi|223548131|gb|EEF49623.1| cysteine protease, putative [Ricinus communis]
          Length = 362

 Score =  345 bits (884), Expect = 2e-92,   Method: Compositional matrix adjust.
 Identities = 170/270 (62%), Positives = 207/270 (76%), Gaps = 8/270 (2%)

Query: 10  LVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEY 69
           +  A    +G W  Q  +RTL +A+M ERHE WMA Y RVY+D  EK+MR+KIFKENV+ 
Sbjct: 10  ITFALFFSIGAWTSQCMARTLQEASMYERHEQWMASYARVYKDANEKQMRYKIFKENVQR 69

Query: 70  IASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS-V 128
           I SFN+++ +K YKL +N+FAD TNEEF++ RNG+K  + S ++       FRYEN + V
Sbjct: 70  IDSFNSES-DKSYKLAVNQFADLTNEEFKSLRNGFKGHMCSAQAGH-----FRYENVTAV 123

Query: 129 PASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGE 188
           PASIDWRKKGAVT +K+QGQCG CWAFSAVAA+EGI  I T KL SLSEQELVDCDT+ E
Sbjct: 124 PASIDWRKKGAVTQIKEQGQCGSCWAFSAVAAVEGITEIKTGKLISLSEQELVDCDTNSE 183

Query: 189 DQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNE 248
           DQGC+GGLMDDAF+F I   GLA+EA YPY A+D +C  KE    +AKI+GYEDVP+N+E
Sbjct: 184 DQGCQGGLMDDAFKF-IEQHGLASEATYPYDAADSTCKTKEEAKPSAKITGYEDVPANDE 242

Query: 249 AALMKAVANQPVSVAIDASGSDFQFYSSGV 278
           AAL  AVANQPVSVAIDA G +FQFYSSG+
Sbjct: 243 AALKNAVANQPVSVAIDAGGFEFQFYSSGI 272


>gi|413951606|gb|AFW84255.1| hypothetical protein ZEAMMB73_933931 [Zea mays]
          Length = 379

 Score =  345 bits (884), Expect = 2e-92,   Method: Compositional matrix adjust.
 Identities = 176/322 (54%), Positives = 221/322 (68%), Gaps = 13/322 (4%)

Query: 31  NDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFA 90
           +D  + + +E W   + RV+R + EK  RF  FKENV +I + N +  ++PY+L +N F 
Sbjct: 36  SDEALWDLYERWQTHH-RVHRHHGEKGRRFGTFKENVRFIHAHNKRG-DRPYRLRLNRFG 93

Query: 91  DQTNEEFRA----PRNGYKRRLPSVRSSETTDVSFRYENAS-VPASIDWRKKGAVTGVKD 145
           D   EEFR+     R    RR  S  +       F Y++A+  P S+DWR++GAVTGVKD
Sbjct: 94  DMGREEFRSTFADSRINDLRRQDSPAARAGAVPGFMYDSAADPPRSVDWRQEGAVTGVKD 153

Query: 146 QGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFII 205
           QG CG CWAFS V A+EGIN I T  L SLSEQEL+DCDT  ++ GC+GGLM++AFEFI 
Sbjct: 154 QGHCGSCWAFSTVVAVEGINAIRTGSLASLSEQELIDCDT--DENGCQGGLMENAFEFIK 211

Query: 206 SNKGLATEAKYPYKASDGSCNKKEAN---PSAAKISGYEDVPSNNEAALMKAVANQPVSV 262
           S  G+ TEA YPY+AS+G+C+   A         I G++ VP+ +E AL KAVA+QPVSV
Sbjct: 212 SFGGITTEAAYPYRASNGTCDGDRARRGGGVVVVIDGHQMVPAGSEDALAKAVAHQPVSV 271

Query: 263 AIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYI 322
           A+DA G  FQFYS GVFTG CGT+LDHGV AVGYG  DDGT YW+VKNSWGT+WGE GYI
Sbjct: 272 AVDAGGQAFQFYSEGVFTGDCGTDLDHGVAAVGYGVGDDGTPYWIVKNSWGTSWGEGGYI 331

Query: 323 RMQRDIDAKEGLCGIAMQASYP 344
           RMQR      GLCGIAM+AS+P
Sbjct: 332 RMQRGA-GNGGLCGIAMEASFP 352


>gi|356517184|ref|XP_003527269.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
          Length = 350

 Score =  345 bits (884), Expect = 2e-92,   Method: Compositional matrix adjust.
 Identities = 171/309 (55%), Positives = 212/309 (68%), Gaps = 7/309 (2%)

Query: 37  ERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEE 96
           E  E W++++G++Y+   EK  RF+IFK+N+++I   N    N  Y LG+NEFAD +++E
Sbjct: 46  ELFESWISRHGKIYQSIEEKLHRFEIFKDNLKHIDERNKVVSN--YWLGLNEFADLSHQE 103

Query: 97  FRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFS 156
           F+    G K      R S      F Y++  +P S+DWRKKGAVT VK+QG CG CWAFS
Sbjct: 104 FKNKYLGLKVDYSRRRESPE---EFTYKDVELPKSVDWRKKGAVTQVKNQGSCGSCWAFS 160

Query: 157 AVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKY 216
            VAA+EGIN I T  LTSLSEQEL+DCD +  + GC GGLMD AF FI+ N GL  E  Y
Sbjct: 161 TVAAVEGINQIVTGNLTSLSEQELIDCDRT-YNNGCNGGLMDYAFSFIVENDGLHKEEDY 219

Query: 217 PYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSS 276
           PY   +G+C   +       ISGY DVP NNE +L+KA+ANQP+SVAI+ASG DFQFYS 
Sbjct: 220 PYIMEEGTCEMAKEETEVVTISGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQFYSG 279

Query: 277 GVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCG 336
           GVF G CG++LDHGV AVGYGTA  G  Y  VKNSWG+ WGE GYIRM+R+I   EG+CG
Sbjct: 280 GVFDGHCGSDLDHGVAAVGYGTA-KGVDYITVKNSWGSKWGEKGYIRMRRNIGKPEGICG 338

Query: 337 IAMQASYPT 345
           I   ASYPT
Sbjct: 339 IYKMASYPT 347


>gi|111073715|dbj|BAF02546.1| triticain alpha [Triticum aestivum]
 gi|388890585|gb|AFK80346.1| cysteine endopeptidase EP alpha [Secale cereale x Triticum durum]
          Length = 461

 Score =  345 bits (884), Expect = 2e-92,   Method: Compositional matrix adjust.
 Identities = 170/309 (55%), Positives = 215/309 (69%), Gaps = 12/309 (3%)

Query: 42  WMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARN--KPYKLGINEFADQTNEEFRA 99
           WM+++ R Y    E+E RF++F++N+ YI   N  A      ++LG+N FAD TNEE+R+
Sbjct: 44  WMSEHRRTYNAIGEEERRFEVFRDNLRYIDQHNAAADAGLHSFRLGLNRFADLTNEEYRS 103

Query: 100 PRNGYKRRLPSVRSSETTDVSFRYE---NASVPASIDWRKKGAVTGVKDQGQCGCCWAFS 156
              G + +    R      +S RY+   N  +P ++DWRKKGAV  +KDQG CG CWAFS
Sbjct: 104 TYLGARTKPDRERK-----LSARYQADDNEELPETVDWRKKGAVAAIKDQGGCGSCWAFS 158

Query: 157 AVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKY 216
           A+AA+EGIN I T  +  LSEQELVDCDTS  ++GC GGLMD AFEFII+N G+ +E  Y
Sbjct: 159 AIAAVEGINQIVTGDMIPLSEQELVDCDTS-YNEGCNGGLMDYAFEFIINNGGIDSEEDY 217

Query: 217 PYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSS 276
           PYK  D  C+  + N     I GYEDVP N+E +L KAVANQP+SVAI+A G  FQ Y S
Sbjct: 218 PYKERDNRCDANKKNAKVVTIDGYEDVPVNSEKSLQKAVANQPISVAIEAGGRAFQLYKS 277

Query: 277 GVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCG 336
           G+FTG CGT LDHGV AVGYGT ++G  YWLV+NSWGT WGE+GYIRM+R+I A  G CG
Sbjct: 278 GIFTGTCGTALDHGVAAVGYGT-ENGKDYWLVRNSWGTVWGEDGYIRMERNIKASSGKCG 336

Query: 337 IAMQASYPT 345
           IA++ SYPT
Sbjct: 337 IAVEPSYPT 345


>gi|18408828|ref|NP_566920.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|12324451|gb|AAG52191.1|AC012329_18 putative cysteine proteinase; 15366-14136 [Arabidopsis thaliana]
 gi|6723404|emb|CAB66413.1| cysteine protease-like protein [Arabidopsis thaliana]
 gi|332645009|gb|AEE78530.1| putative cysteine proteinase [Arabidopsis thaliana]
          Length = 341

 Score =  345 bits (884), Expect = 3e-92,   Method: Compositional matrix adjust.
 Identities = 168/323 (52%), Positives = 226/323 (69%), Gaps = 13/323 (4%)

Query: 30  LNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEF 89
           L +A+  E+HE WM+++ RVY D++EK  RF+IF  N++++ S N    NK Y L +NEF
Sbjct: 26  LFEASAVEKHEQWMSRFNRVYSDDSEKTSRFEIFTNNLKFVESINMNT-NKTYTLDVNEF 84

Query: 90  ADQTNEEFRAPRNGY-----KRRLPSVRSSETTDVSFRYENA-SVPASIDWRKKGAVTGV 143
           +D T+EEF+A   G        R+ +  S ET  VSFRYEN      S+DW ++GAVT V
Sbjct: 85  SDLTDEEFKARYTGLVVPEGMTRISTTDSHET--VSFRYENVGETGESMDWIQEGAVTSV 142

Query: 144 KDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEF 203
           K Q QCGCCWAFSAVAA+EG+  I   +L SLSEQ+L+DC T  E+ GC GG+M  AF++
Sbjct: 143 KHQQQCGCCWAFSAVAAVEGMTKIANGELVSLSEQQLLDCST--ENNGCGGGIMWKAFDY 200

Query: 204 IISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVA 263
           I  N+G+ TE  YPY+ +  +C     + +AA ISGYE VP N+E AL+KAV+ QPVSVA
Sbjct: 201 IKENQGITTEDNYPYQGAQQTCESN--HLAAATISGYETVPQNDEEALLKAVSQQPVSVA 258

Query: 264 IDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIR 323
           I+ SG +F  YS G+F G+CGT+L H VT VGYG +++G KYWL+KNSWG +WGENGY+R
Sbjct: 259 IEGSGYEFIHYSGGIFNGECGTQLTHAVTIVGYGVSEEGIKYWLLKNSWGESWGENGYMR 318

Query: 324 MQRDIDAKEGLCGIAMQASYPTA 346
           + RD+D+ +G+CG+A  A YP A
Sbjct: 319 IMRDVDSPQGMCGLASLAYYPVA 341


>gi|226529105|ref|NP_001150196.1| cysteine protease 1 precursor [Zea mays]
 gi|194701798|gb|ACF84983.1| unknown [Zea mays]
 gi|194704800|gb|ACF86484.1| unknown [Zea mays]
 gi|195637480|gb|ACG38208.1| cysteine protease 1 precursor [Zea mays]
 gi|413919895|gb|AFW59827.1| cysteine protease 1 [Zea mays]
          Length = 470

 Score =  345 bits (884), Expect = 3e-92,   Method: Compositional matrix adjust.
 Identities = 162/313 (51%), Positives = 221/313 (70%), Gaps = 9/313 (2%)

Query: 39  HEMWMAQYGRVY----RDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTN 94
           +++W+A++GR Y        E++ RF +F +N+ ++ + N +A  + ++LG+N+FAD TN
Sbjct: 57  YDLWLAEHGRAYNALGEGEGERDRRFLVFWDNLRFVDAHNERAGARGFRLGMNQFADLTN 116

Query: 95  EEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS--VPASIDWRKKGAVTGVKDQGQCGCC 152
           +EFRA   G    +P+ R        +R++ A+  +P S+DWR+KGAV  VK+QGQCG C
Sbjct: 117 DEFRAAYLGAM--VPAARRGAVVGERYRHDGAAEELPESVDWREKGAVAPVKNQGQCGSC 174

Query: 153 WAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLAT 212
           WAFSAV+++E +N I T ++ +LSEQELV+C T G + GC GGLMD AF+FII N G+ T
Sbjct: 175 WAFSAVSSVESVNQIVTGEMVTLSEQELVECSTDGGNSGCNGGLMDAAFDFIIKNGGIDT 234

Query: 213 EAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQ 272
           E  YPY+A DG C+    N     I G+EDVP N+E +L KAVA+QPVSVAI+A G +FQ
Sbjct: 235 EDDYPYRAVDGKCDMNRKNARVVSIDGFEDVPENDEKSLQKAVAHQPVSVAIEAGGREFQ 294

Query: 273 FYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKE 332
            Y SGVF+G C T LDHGV AVGYG A++G  YW+V+NSWG  WGE GYIRM+R+++A  
Sbjct: 295 LYKSGVFSGSCTTNLDHGVVAVGYG-AENGKDYWIVRNSWGPKWGEAGYIRMERNVNAST 353

Query: 333 GLCGIAMQASYPT 345
           G CGIAM ASYPT
Sbjct: 354 GKCGIAMMASYPT 366


>gi|297851334|ref|XP_002893548.1| peptidase C1A papain family protein [Arabidopsis lyrata subsp.
           lyrata]
 gi|297339390|gb|EFH69807.1| peptidase C1A papain family protein [Arabidopsis lyrata subsp.
           lyrata]
          Length = 346

 Score =  344 bits (883), Expect = 3e-92,   Method: Compositional matrix adjust.
 Identities = 180/353 (50%), Positives = 234/353 (66%), Gaps = 14/353 (3%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSR-TLNDATMNERHEMWMAQYGRVYRDNAEKEMR 59
           M  IL     V   IL + +   Q+ SR T ++  + E H+ WM ++ RVY D  EK+MR
Sbjct: 1   MTSILF--MFVSLTILSMSLKVSQATSRVTFHEPIVAEHHQQWMTRFSRVYSDELEKQMR 58

Query: 60  FKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDV 119
           F +FK+N+++I  FN K  ++ YKLG+NEFAD T EEF A   G K     + SSE  D 
Sbjct: 59  FDVFKKNLKFIEKFNKKG-DRTYKLGVNEFADWTKEEFIATHTGLKG-FNGIPSSEFVDE 116

Query: 120 SFRYENASV-----PASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTS 174
                N +V     P   DWR +GAVT VK QGQCGCCWAFS+VAA+EG+  I    L S
Sbjct: 117 MIPSWNWNVSDVAGPEIKDWRYEGAVTPVKYQGQCGCCWAFSSVAAVEGLTKIVGGNLVS 176

Query: 175 LSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSA 234
           LSEQ+L+DCD    D GC GG+M DAF +II N+G+A+EA YPY+ ++G+C +  A PSA
Sbjct: 177 LSEQQLLDCDRE-RDNGCNGGIMSDAFSYIIKNRGIASEASYPYQETEGTC-RYNAKPSA 234

Query: 235 AKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVF-TGQCGTELDHGVTA 293
             I G++ VPSNNE AL++AV+ QPVSV+IDA G  F  YS GV+    CGT+++H VT 
Sbjct: 235 W-IRGFQTVPSNNERALLEAVSRQPVSVSIDADGPGFMHYSGGVYDEPYCGTDVNHAVTF 293

Query: 294 VGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           VGYGT+ +G KYWL KNSWG TWGENGYIR++RD+   +G+CG+A  A YP A
Sbjct: 294 VGYGTSPEGIKYWLAKNSWGETWGENGYIRIRRDVAWPQGMCGVAQYAFYPVA 346


>gi|356564154|ref|XP_003550321.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
          Length = 476

 Score =  344 bits (883), Expect = 3e-92,   Method: Compositional matrix adjust.
 Identities = 175/310 (56%), Positives = 223/310 (71%), Gaps = 10/310 (3%)

Query: 39  HEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFR 98
           +E W+ ++G+VY    EKE RF+IFK+N+ +I   +N A ++ YKLG+N FAD TNEE+R
Sbjct: 59  YEQWLVKHGKVYNALGEKEKRFQIFKDNLRFIDD-HNSAEDRTYKLGLNRFADLTNEEYR 117

Query: 99  APRNGYKRRLPSVRSSETTDVSFRYE---NASVPASIDWRKKGAVTGVKDQGQCGCCWAF 155
           A   G K   P+ R  +T   S RY       +P S+DWRK+GAV  VKDQG CG CWAF
Sbjct: 118 AKYLGTKID-PNRRLGKTP--SNRYAPRVGDKLPDSVDWRKEGAVPPVKDQGGCGSCWAF 174

Query: 156 SAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAK 215
           SA+ A+EGIN I T +L SLSEQELVDCDT G +QGC GGLMD AFEFII+N G+ ++  
Sbjct: 175 SAIGAVEGINKIVTGELISLSEQELVDCDT-GYNQGCNGGLMDYAFEFIINNGGIDSDED 233

Query: 216 YPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYS 275
           YPY+  DG C+    N     I  YEDVP+ +E AL KAVANQPVSVAI+  G +FQ Y 
Sbjct: 234 YPYRGVDGRCDTYRKNAKVVSIDDYEDVPAYDELALKKAVANQPVSVAIEGGGREFQLYV 293

Query: 276 SGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDI-DAKEGL 334
           SGVFTG+CGT LDHGV AVGYGTA  G  YW+V+NSWG++WGE+GYIR++R++ +++ G 
Sbjct: 294 SGVFTGRCGTALDHGVVAVGYGTA-KGHDYWIVRNSWGSSWGEDGYIRLERNLANSRSGK 352

Query: 335 CGIAMQASYP 344
           CGIA++ SYP
Sbjct: 353 CGIAIEPSYP 362


>gi|334185815|ref|NP_680113.3| putative cysteine proteinase [Arabidopsis thaliana]
 gi|75313879|sp|Q9STL4.1|CEP2_ARATH RecName: Full=KDEL-tailed cysteine endopeptidase CEP2; Flags:
           Precursor
 gi|4678354|emb|CAB41164.1| cysteine endopeptidase-like protein [Arabidopsis thaliana]
 gi|332644882|gb|AEE78403.1| putative cysteine proteinase [Arabidopsis thaliana]
          Length = 361

 Score =  344 bits (883), Expect = 3e-92,   Method: Compositional matrix adjust.
 Identities = 173/311 (55%), Positives = 220/311 (70%), Gaps = 11/311 (3%)

Query: 39  HEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFR 98
           ++ W + +  V R   E+E RF +F+ NV ++ + N K  N+ YKL +N+FAD T  EF+
Sbjct: 38  YDRWRSHHS-VPRSLNEREKRFNVFRHNVMHVHNTNKK--NRSYKLKLNKFADLTINEFK 94

Query: 99  APRNG----YKRRLPSVRSSETTDVSFRYENAS-VPASIDWRKKGAVTGVKDQGQCGCCW 153
               G    + R L   +   +    + +EN S +P+S+DWRKKGAVT +K+QG+CG CW
Sbjct: 95  NAYTGSNIKHHRMLQGPKRG-SKQFMYDHENLSKLPSSVDWRKKGAVTEIKNQGKCGSCW 153

Query: 154 AFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATE 213
           AFS VAA+EGIN I T KL SLSEQELVDCDT  +++GC GGLM+ AFEFI  N G+ TE
Sbjct: 154 AFSTVAAVEGINKIKTNKLVSLSEQELVDCDTK-QNEGCNGGLMEIAFEFIKKNGGITTE 212

Query: 214 AKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQF 273
             YPY+  DG C+  + N     I G+EDVP N+E AL+KAVANQPVSVAIDA  SDFQF
Sbjct: 213 DSYPYEGIDGKCDASKDNGVLVTIDGHEDVPENDENALLKAVANQPVSVAIDAGSSDFQF 272

Query: 274 YSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEG 333
           YS GVFTG CGTEL+HGV AVGYG+ + G KYW+V+NSWG  WGE GYI+++R+ID  EG
Sbjct: 273 YSEGVFTGSCGTELNHGVAAVGYGS-ERGKKYWIVRNSWGAEWGEGGYIKIEREIDEPEG 331

Query: 334 LCGIAMQASYP 344
            CGIAM+ASYP
Sbjct: 332 RCGIAMEASYP 342


>gi|449500145|ref|XP_004161017.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
          Length = 349

 Score =  344 bits (882), Expect = 4e-92,   Method: Compositional matrix adjust.
 Identities = 170/310 (54%), Positives = 219/310 (70%), Gaps = 9/310 (2%)

Query: 37  ERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEE 96
           E  E WM+++ + YR   EK  RF+IF +N+++I   N K  +  Y LG+NEFAD ++EE
Sbjct: 45  ELFESWMSKHSKTYRSIEEKLHRFEIFLDNLKHIDETNKKVSS--YWLGLNEFADLSHEE 102

Query: 97  FRAPRNGYKRRLPSVRSSETTDVSFRYENAS-VPASIDWRKKGAVTGVKDQGQCGCCWAF 155
           F++   G +   P  RSS      F Y +   +P S+DWR KGAVT VK+QG CG CWAF
Sbjct: 103 FKSKYLGLRVEFPRKRSSR----GFSYGDVEDLPESVDWRTKGAVTPVKNQGSCGSCWAF 158

Query: 156 SAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAK 215
           S VAA+EGIN I T  LTSLSEQEL+DCD S  + GC GGLMD AF++I+SN GL  E  
Sbjct: 159 STVAAVEGINQIVTGNLTSLSEQELIDCDRSF-NNGCYGGLMDYAFQYIMSNSGLRKEED 217

Query: 216 YPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYS 275
           YPY   +G C +++       ISGYEDVP+N+E +L+KA+++QPVSVAI+AS  +FQFY 
Sbjct: 218 YPYLMEEGRCIREKEQFEVVTISGYEDVPANDEQSLLKALSHQPVSVAIEASSRNFQFYK 277

Query: 276 SGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLC 335
            G+FTG+CGT++DHGVTAVGYG++ +GT Y +VKNSWG  WGENGYIRM+R+    EGLC
Sbjct: 278 GGIFTGRCGTQMDHGVTAVGYGSS-EGTDYIIVKNSWGPKWGENGYIRMKRNTGKPEGLC 336

Query: 336 GIAMQASYPT 345
           GI   ASYPT
Sbjct: 337 GINQMASYPT 346


>gi|297816028|ref|XP_002875897.1| hypothetical protein ARALYDRAFT_347926 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297321735|gb|EFH52156.1| hypothetical protein ARALYDRAFT_347926 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 361

 Score =  344 bits (882), Expect = 4e-92,   Method: Compositional matrix adjust.
 Identities = 172/314 (54%), Positives = 221/314 (70%), Gaps = 9/314 (2%)

Query: 35  MNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTN 94
           +++ ++ W + +  V R   E+E RF +F+ NV ++ + N K  N+ YKL +N+FAD T 
Sbjct: 34  LSKLYDRWRSHHS-VPRSLHEREKRFNVFRHNVMHVHNSNKK--NRSYKLKLNKFADLTI 90

Query: 95  EEFRAPRNGYK---RRLPSVRSSETTDVSFRYENAS-VPASIDWRKKGAVTGVKDQGQCG 150
            EF+    G K    R+       +    + +EN S +P+S+DWRKKGAVT +K+QG+CG
Sbjct: 91  HEFKNAYTGSKIKHHRMLQGPKRGSKQFMYDHENVSKLPSSVDWRKKGAVTEIKNQGKCG 150

Query: 151 CCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGL 210
            CWAFS VAA+EGIN I T KL SLSEQELVDCDT+ +++GC GGLM+ AFEFI  N G+
Sbjct: 151 SCWAFSTVAAVEGINKIKTNKLVSLSEQELVDCDTN-QNEGCNGGLMEIAFEFIKKNGGI 209

Query: 211 ATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSD 270
            TE  YPY+  DG C+  + N     I G+E+VP N+E AL+KAVANQPVSVAIDA  SD
Sbjct: 210 TTEDSYPYEGIDGKCDASKDNGVLVTIDGHENVPENDENALLKAVANQPVSVAIDAGSSD 269

Query: 271 FQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDA 330
           FQFYS GVFTG CGTEL+HGV  VGYG+   G KYW+V+NSWGT WGE GYI+++R ID 
Sbjct: 270 FQFYSEGVFTGDCGTELNHGVATVGYGS-QGGKKYWIVRNSWGTEWGEGGYIKIERGIDE 328

Query: 331 KEGLCGIAMQASYP 344
            EG CGIAM+ASYP
Sbjct: 329 PEGRCGIAMEASYP 342


>gi|449454309|ref|XP_004144898.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
 gi|449471311|ref|XP_004153272.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
          Length = 349

 Score =  344 bits (882), Expect = 4e-92,   Method: Compositional matrix adjust.
 Identities = 170/310 (54%), Positives = 219/310 (70%), Gaps = 9/310 (2%)

Query: 37  ERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEE 96
           E  E WM+++ + YR   EK  RF+IF +N+++I   N K  +  Y LG+NEFAD ++EE
Sbjct: 45  ELFESWMSKHSKAYRSIEEKLHRFEIFLDNLKHIDETNKKVSS--YWLGLNEFADLSHEE 102

Query: 97  FRAPRNGYKRRLPSVRSSETTDVSFRYENAS-VPASIDWRKKGAVTGVKDQGQCGCCWAF 155
           F++   G +   P  RSS      F Y +   +P S+DWR KGAVT VK+QG CG CWAF
Sbjct: 103 FKSKYLGLRVEFPRKRSSR----GFSYGDVEDLPESVDWRTKGAVTPVKNQGSCGSCWAF 158

Query: 156 SAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAK 215
           S VAA+EGIN I T  LTSLSEQEL+DCD S  + GC GGLMD AF++I+SN GL  E  
Sbjct: 159 STVAAVEGINQIVTGNLTSLSEQELIDCDRSF-NNGCYGGLMDYAFQYIMSNSGLRKEED 217

Query: 216 YPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYS 275
           YPY   +G C +++       ISGYEDVP+N+E +L+KA+++QPVSVAI+AS  +FQFY 
Sbjct: 218 YPYLMEEGRCIREKEQFEVVTISGYEDVPANDEQSLLKALSHQPVSVAIEASSRNFQFYK 277

Query: 276 SGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLC 335
            G+FTG+CGT++DHGVTAVGYG++ +GT Y +VKNSWG  WGENGYIRM+R+    EGLC
Sbjct: 278 GGIFTGRCGTQMDHGVTAVGYGSS-EGTDYIIVKNSWGPKWGENGYIRMKRNTGKPEGLC 336

Query: 336 GIAMQASYPT 345
           GI   ASYPT
Sbjct: 337 GINQMASYPT 346


>gi|1256830|gb|AAB68374.1| cysteine endopeptidase 1 [Phaseolus vulgaris]
 gi|2959418|emb|CAA12118.1| cysteine protease [Phaseolus vulgaris]
          Length = 364

 Score =  344 bits (882), Expect = 4e-92,   Method: Compositional matrix adjust.
 Identities = 175/345 (50%), Positives = 232/345 (67%), Gaps = 16/345 (4%)

Query: 10  LVLAAILVLGVWAPQSWSRTLNDATMNERHEM---WMAQYGRVYRDNAEKEMRFKIFKEN 66
           +++  +L+L      + + ++ + + NE  +M   W+ ++ +VY    EKE RF++FK+N
Sbjct: 4   MLIPTLLLLSFTFSHATAMSIINYSENEVMDMYEEWLVKHRKVYNGLDEKEKRFQVFKDN 63

Query: 67  VEYIASFNNKARNKPYKLGINEFADQTNEEFRA----PRNGYKRRLPSVRSSETTDVSFR 122
           + +I   N  A+N  Y LG+N+FAD TN+E+RA     R   KRR   V  ++ T   + 
Sbjct: 64  LGFIQDHN--AQNNTYTLGLNKFADITNKEYRAMYLGTRTDAKRR---VMKTQNTGHRYA 118

Query: 123 YENA-SVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELV 181
           Y +   +P  +DWR KGAV  +KDQG CG CWAFS VAA+EGIN+I T +  SLSEQELV
Sbjct: 119 YNSGDQLPVHVDWRLKGAVGPIKDQGNCGSCWAFSTVAAVEGINNIVTGEFVSLSEQELV 178

Query: 182 DCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYE 241
           DCD    D+GC GGLMD AF+FII N G+ TE  YPY+  DG+C++ +      +I GYE
Sbjct: 179 DCDRE-YDEGCNGGLMDYAFQFIIQNGGIDTEEDYPYQGIDGTCDETKKKTKVVQIDGYE 237

Query: 242 DVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADD 301
           DVPSNNE AL KAV++QPVSVAI+ASG   Q Y SGVFTG+CGT LDHGV  VGYGT ++
Sbjct: 238 DVPSNNENALKKAVSHQPVSVAIEASGRALQLYQSGVFTGKCGTALDHGVVVVGYGT-EN 296

Query: 302 GTKYWLVKNSWGTTWGENGYIRMQRDI-DAKEGLCGIAMQASYPT 345
           G  YWLV+NSWGT WGE+GY +M+R++    EG CGIAM  SYP 
Sbjct: 297 GVDYWLVRNSWGTGWGEDGYFKMERNVRSTSEGKCGIAMDCSYPV 341


>gi|356508487|ref|XP_003522988.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
          Length = 349

 Score =  344 bits (882), Expect = 4e-92,   Method: Compositional matrix adjust.
 Identities = 170/309 (55%), Positives = 211/309 (68%), Gaps = 7/309 (2%)

Query: 37  ERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEE 96
           E  E WM+++G++Y+   EK  RF IFK+N+++I   N    N  Y LG+NEFAD +++E
Sbjct: 45  ELFESWMSRHGKIYQSIEEKLHRFDIFKDNLKHIDERNKVVSN--YWLGLNEFADLSHQE 102

Query: 97  FRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFS 156
           F+    G K      R S      F Y++  +P S+DWRKKGAVT VK+QG CG CWAFS
Sbjct: 103 FKNKYLGLKVDYSRRRESPE---EFTYKDFELPKSVDWRKKGAVTQVKNQGSCGSCWAFS 159

Query: 157 AVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKY 216
            VAA+EGIN I T  LTSLSEQEL+DCD +  + GC GGLMD AF FI+ N GL  E  Y
Sbjct: 160 TVAAVEGINQIVTGNLTSLSEQELIDCDRT-YNNGCNGGLMDYAFSFIVENGGLHKEEDY 218

Query: 217 PYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSS 276
           PY   +G+C   +       ISGY DVP NNE +L+KA+ NQP+SVAI+ASG DFQFYS 
Sbjct: 219 PYIMEEGTCEMTKEETEVVTISGYHDVPQNNEQSLLKALVNQPLSVAIEASGRDFQFYSG 278

Query: 277 GVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCG 336
           GVF G CG++LDHGV AVGYGT+  G  Y +VKNSWG+ WGE GYIRM+R+I   EG+CG
Sbjct: 279 GVFDGHCGSDLDHGVAAVGYGTS-KGVNYIIVKNSWGSKWGEKGYIRMRRNIGKPEGICG 337

Query: 337 IAMQASYPT 345
           I   ASYPT
Sbjct: 338 IYKMASYPT 346


>gi|146216000|gb|ABQ10202.1| cysteine protease Cp4 [Actinidia deliciosa]
          Length = 463

 Score =  344 bits (882), Expect = 4e-92,   Method: Compositional matrix adjust.
 Identities = 172/315 (54%), Positives = 221/315 (70%), Gaps = 6/315 (1%)

Query: 32  DATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFAD 91
           DA     +E W+  +G+ Y    EKE RF+IFK+N+ ++   N  A +  Y++G+N FAD
Sbjct: 40  DAEAMAIYEKWLTTHGKAYNAIGEKERRFEIFKDNLRFVDEHNAVAGS--YRVGLNRFAD 97

Query: 92  QTNEEFRAPRNGYKRRLPSVRSSETTD-VSFRYENASVPASIDWRKKGAVTGVKDQGQCG 150
            TNEE+R+   G    +    +S  +D  +FR  +  +P S+DWR+KGAV+ VKDQGQCG
Sbjct: 98  LTNEEYRSMFLGGNMEMKERSASTKSDRYAFRAGD-KLPGSVDWREKGAVSPVKDQGQCG 156

Query: 151 CCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGL 210
            CWAFS ++A+EGIN I T +L SLSEQELVDCD S  + GC GGLMD  F+FII+N G+
Sbjct: 157 SCWAFSTISAVEGINQIVTGELISLSEQELVDCDKS-YNMGCNGGLMDYGFQFIINNGGI 215

Query: 211 ATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSD 270
            TE  YPY+A DG+C++   N     I+GYEDVP ++E +L KAVANQPVSVAI+A G  
Sbjct: 216 DTEEDYPYRAVDGTCDQFRKNARVVSINGYEDVPEDDENSLKKAVANQPVSVAIEAGGRA 275

Query: 271 FQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDA 330
           FQ Y SGVFTG CGT LDHGV AVGYGT ++G  YW V+NSWG  WGENGYI+++R+I+A
Sbjct: 276 FQLYESGVFTGHCGTNLDHGVVAVGYGT-ENGVDYWTVRNSWGPKWGENGYIKLERNINA 334

Query: 331 KEGLCGIAMQASYPT 345
             G CGIA  ASYPT
Sbjct: 335 TSGKCGIASMASYPT 349


>gi|50355619|dbj|BAD29958.1| cysteine protease [Daucus carota]
          Length = 496

 Score =  344 bits (882), Expect = 4e-92,   Method: Compositional matrix adjust.
 Identities = 165/306 (53%), Positives = 211/306 (68%), Gaps = 3/306 (0%)

Query: 40  EMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRA 99
           E W+  +G+ Y    E+E RF+IFK N+ YI    N   ++ +KLG+N+FAD TNEE+R+
Sbjct: 46  ESWLVTHGKSYNALGEEEKRFQIFKNNLRYIDE-QNLVEDRGFKLGLNKFADLTNEEYRS 104

Query: 100 PRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVA 159
              G K +    + S  +         S+P S+DWR+ GAV  VKDQG CG CWAFS ++
Sbjct: 105 KYTGIKSKDLRKKVSAKSGRYATLSGESLPESVDWRESGAVATVKDQGSCGSCWAFSTIS 164

Query: 160 AMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYK 219
           A+EGIN I T KL +LSEQELVDCD S  ++GC GGLMD AFEFII+N G+ T+  YPY 
Sbjct: 165 AVEGINQIATGKLITLSEQELVDCDRS-YNEGCNGGLMDYAFEFIINNGGIDTDVDYPYT 223

Query: 220 ASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVF 279
             DG C++   N     I  YEDVP+ +E AL KA ANQP+SVAI+ASG DFQFY SG+F
Sbjct: 224 GRDGKCDQYRKNAKVVTIDSYEDVPAYDELALKKAAANQPISVAIEASGRDFQFYDSGIF 283

Query: 280 TGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAM 339
           TG+CG  LDHGV  VGYGT ++G  YW+V+NSWG  WGENGY+RM+R I +K G+CGIA+
Sbjct: 284 TGKCGIALDHGVVVVGYGT-ENGKDYWIVRNSWGADWGENGYLRMERGISSKTGICGIAI 342

Query: 340 QASYPT 345
           + SYP 
Sbjct: 343 EPSYPV 348


>gi|356517188|ref|XP_003527271.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
          Length = 350

 Score =  343 bits (881), Expect = 5e-92,   Method: Compositional matrix adjust.
 Identities = 171/309 (55%), Positives = 210/309 (67%), Gaps = 7/309 (2%)

Query: 37  ERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEE 96
           E  E WM+++G++Y +  EK +RF+IFK+N+++I   N    N  Y LG+NEFAD ++ E
Sbjct: 46  ELFESWMSRHGKIYENIEEKLLRFEIFKDNLKHIDERNKVVSN--YWLGLNEFADLSHRE 103

Query: 97  FRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFS 156
           F     G K      R S      F Y++  +P S+DWRKKGAV  VK+QG CG CWAFS
Sbjct: 104 FNNKYLGLKVDYSRRRESPE---EFTYKDVELPKSVDWRKKGAVAPVKNQGSCGSCWAFS 160

Query: 157 AVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKY 216
            VAA+EGIN I T  LTSLSEQEL+DCD +  + GC GGLMD AF FI+ N GL  E  Y
Sbjct: 161 TVAAVEGINQIVTGNLTSLSEQELIDCDRT-YNNGCNGGLMDYAFSFIVENGGLHKEEDY 219

Query: 217 PYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSS 276
           PY   +G+C   +       ISGY DVP NNE +L+KA+ANQP+SVAI+ASG DFQFYS 
Sbjct: 220 PYIMEEGTCEMTKEETQVVTISGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQFYSG 279

Query: 277 GVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCG 336
           GVF G CG++LDHGV AVGYGTA  G  Y  VKNSWG+ WGE GYIRM+R+I   EG+CG
Sbjct: 280 GVFDGHCGSDLDHGVAAVGYGTA-KGVDYITVKNSWGSKWGEKGYIRMRRNIGKPEGICG 338

Query: 337 IAMQASYPT 345
           I   ASYPT
Sbjct: 339 IYKMASYPT 347


>gi|57282619|emb|CAE54307.1| cysteine proteinase [Gossypium hirsutum]
          Length = 372

 Score =  343 bits (881), Expect = 5e-92,   Method: Compositional matrix adjust.
 Identities = 172/310 (55%), Positives = 216/310 (69%), Gaps = 7/310 (2%)

Query: 39  HEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFR 98
           ++ W+ Q+G+ Y    E+E RF+IFK+N+ +I   N+   N  YKLG+N+FAD TN+E+R
Sbjct: 46  YKSWVIQHGKAYNGIGEEEKRFEIFKDNLRFIDEHNSN-NNTTYKLGLNKFADLTNQEYR 104

Query: 99  APRNGYKRRLPSVRSSETTDVSFRYENAS---VPASIDWRKKGAVTGVKDQGQCGCCWAF 155
           A   G  R  P  R  ++   S RY + +   +P S++WR  GAV+ VKDQG CG CWAF
Sbjct: 105 AKFLG-TRTDPRRRLMKSKIPSSRYAHRAGDNLPDSVNWRDHGAVSRVKDQGSCGSCWAF 163

Query: 156 SAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAK 215
           SA+AA+EGIN I + +L SLSEQELVDCD S  D GC GGLMD AF+FII N G+ TE  
Sbjct: 164 SAIAAVEGINKIVSGELISLSEQELVDCDRS-YDAGCNGGLMDYAFQFIIDNGGIDTEKD 222

Query: 216 YPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYS 275
           YPY   +  C+  + N     I GYEDVP NNE AL KAVA+QPVS+AI+A G  FQ Y 
Sbjct: 223 YPYLGFNNQCDPTKKNAKVVSIDGYEDVP-NNENALKKAVAHQPVSIAIEAGGRAFQLYE 281

Query: 276 SGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLC 335
           SGVF G+CG  LDHGV AVGYG+ D+G  YW+V+NSWG  WGENGYIRM+R+I+A  G C
Sbjct: 282 SGVFNGECGLALDHGVVAVGYGSDDNGQDYWIVRNSWGGNWGENGYIRMERNINANTGKC 341

Query: 336 GIAMQASYPT 345
           GIAM+ASYP 
Sbjct: 342 GIAMEASYPV 351


>gi|356553978|ref|XP_003545327.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
          Length = 496

 Score =  343 bits (881), Expect = 5e-92,   Method: Compositional matrix adjust.
 Identities = 175/318 (55%), Positives = 226/318 (71%), Gaps = 10/318 (3%)

Query: 31  NDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFA 90
           +D  +   +E W+ ++G+VY    EKE RF+IFK+N+ +I   N++  ++ YKLG+N FA
Sbjct: 71  SDEELMSMYEQWLVKHGKVYNALGEKEKRFQIFKDNLRFIDDHNSQ-EDRTYKLGLNRFA 129

Query: 91  DQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYE---NASVPASIDWRKKGAVTGVKDQG 147
           D TNEE+RA   G K   P+ R  +T   S RY       +P S+DWRK+GAV  VKDQG
Sbjct: 130 DLTNEEYRAKYLGTKID-PNRRLGKTP--SNRYAPRVGDKLPESVDWRKEGAVPPVKDQG 186

Query: 148 QCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISN 207
            CG CWAFSA+ A+EGIN I T +L SLSEQELVDCDT G ++GC GGLMD AFEFII+N
Sbjct: 187 GCGSCWAFSAIGAVEGINKIVTGELISLSEQELVDCDT-GYNEGCNGGLMDYAFEFIINN 245

Query: 208 KGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDAS 267
            G+ +E  YPY+  DG C+    N     I  YEDVP+ +E AL KAVANQPVSVAI+  
Sbjct: 246 GGIDSEEDYPYRGVDGRCDTYRKNAKVVSIDDYEDVPAYDELALKKAVANQPVSVAIEGG 305

Query: 268 GSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRD 327
           G +FQ Y SGVFTG+CGT LDHGV AVGYGTA +G  YW+V+NSWG +WGE+GYIR++R+
Sbjct: 306 GREFQLYVSGVFTGRCGTALDHGVVAVGYGTA-NGHDYWIVRNSWGPSWGEDGYIRLERN 364

Query: 328 I-DAKEGLCGIAMQASYP 344
           + +++ G CGIA++ SYP
Sbjct: 365 LANSRSGKCGIAIEPSYP 382


>gi|357166359|ref|XP_003580684.1| PREDICTED: oryzain alpha chain-like [Brachypodium distachyon]
          Length = 456

 Score =  343 bits (881), Expect = 5e-92,   Method: Compositional matrix adjust.
 Identities = 169/305 (55%), Positives = 213/305 (69%), Gaps = 6/305 (1%)

Query: 42  WMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARN--KPYKLGINEFADQTNEEFRA 99
           WMA+ GR Y    E+E RF++F++N+ Y+   N  A      ++LG+N FAD TNEE+R 
Sbjct: 45  WMAENGRTYNAIGEEERRFEVFRDNLRYVDQHNAAADAGLHSFRLGLNRFADLTNEEYRD 104

Query: 100 PRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVA 159
              G   R   VR    +      +N  +P S+DWR+KGAV  VKDQG CG CWAFSA+A
Sbjct: 105 TYLGV--RTKPVRERRLSGRYQAADNEELPESVDWREKGAVAKVKDQGGCGSCWAFSAIA 162

Query: 160 AMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYK 219
           A+EGIN I T  + +LSEQELVDCDTS  +QGC GGLMD AFEFII+N G+ +E  YPYK
Sbjct: 163 AVEGINQIVTGDMIALSEQELVDCDTS-YNQGCNGGLMDYAFEFIINNGGIDSEEDYPYK 221

Query: 220 ASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVF 279
             D  C+  + N     I GYEDVP N+E +L KAVANQP+SVAI+A G  FQ Y SG+F
Sbjct: 222 ERDNRCDANKKNAKVVTIDGYEDVPVNSELSLKKAVANQPISVAIEAGGRAFQLYKSGIF 281

Query: 280 TGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAM 339
           TG+CGT LDHGVTAVGYG+ ++G  YW+VKNSWGT WGE+GY+R++R+I A  G CGIA+
Sbjct: 282 TGRCGTALDHGVTAVGYGS-ENGKDYWIVKNSWGTVWGEDGYVRLERNIKATSGKCGIAI 340

Query: 340 QASYP 344
           + SYP
Sbjct: 341 EPSYP 345


>gi|357126406|ref|XP_003564878.1| PREDICTED: cysteine proteinase EP-B 1-like [Brachypodium
           distachyon]
          Length = 377

 Score =  343 bits (881), Expect = 6e-92,   Method: Compositional matrix adjust.
 Identities = 174/319 (54%), Positives = 217/319 (68%), Gaps = 13/319 (4%)

Query: 37  ERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKAR------NKP-YKLGINEF 89
           E +  W + +    + +AEK  RF  FK NV +I + N +        N P Y+L +N F
Sbjct: 40  ELYTRWQSAHRLPPQHHAEKHRRFGTFKSNVLFIHAHNTRLNDTSTNNNGPSYRLRLNRF 99

Query: 90  ADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS-VPASIDWRKKGAVTGVKDQGQ 148
            D    EFR+   G   R    R +++    F Y+    +P ++DWR+KGAVTGVKDQG+
Sbjct: 100 GDMDQAEFRSTFAGPLHR--HTRPAQSIP-GFIYDTVKDIPQAVDWRQKGAVTGVKDQGK 156

Query: 149 CGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFII-SN 207
           CG CWAFSAVA++EG+N I T  L SLSEQEL+DCDT G+D GC+GGLM+ AFEFI  S 
Sbjct: 157 CGSCWAFSAVASVEGLNAIRTGSLVSLSEQELIDCDTGGDDNGCQGGLMESAFEFIAHSA 216

Query: 208 KGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDAS 267
            GLATEA YPY AS+G+CN    +  + +I G++ VP+ NE AL KAVA+QPVSVAIDA 
Sbjct: 217 GGLATEAAYPYHASNGTCNANRGSSVSVRIDGHQSVPAGNEEALAKAVAHQPVSVAIDAG 276

Query: 268 GSDFQFYSSGVFTGQCGTELDHGVTAVGYGTA-DDGTKYWLVKNSWGTTWGENGYIRMQR 326
           G  FQFYS GVFTG CG+ELDHGV  VGYG A +DG +YW+VKNSWG  WGE+GY+RMQR
Sbjct: 277 GQAFQFYSEGVFTGDCGSELDHGVAVVGYGVAEEDGKEYWIVKNSWGPGWGEHGYVRMQR 336

Query: 327 DIDAKEGLCGIAMQASYPT 345
           D     GLCGIAM+ASYP 
Sbjct: 337 DSGVDGGLCGIAMEASYPV 355


>gi|224065647|ref|XP_002301901.1| predicted protein [Populus trichocarpa]
 gi|222843627|gb|EEE81174.1| predicted protein [Populus trichocarpa]
          Length = 336

 Score =  343 bits (881), Expect = 6e-92,   Method: Compositional matrix adjust.
 Identities = 171/307 (55%), Positives = 214/307 (69%), Gaps = 8/307 (2%)

Query: 40  EMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRA 99
           E W++++G++Y    EK +RF+IFK+N+ +I   N K  N  Y LG+NEF+D ++EEF+ 
Sbjct: 34  ESWISKHGKIYESIEEKWLRFEIFKDNLFHIDETNKKVVN--YWLGLNEFSDLSHEEFKN 91

Query: 100 PRNGYKRRLPSVRSSETTDVSFRYENA-SVPASIDWRKKGAVTGVKDQGQCGCCWAFSAV 158
              G K  +   R        F Y++  S+P S+DWRKKGAVT VK+QG CG CWAFS V
Sbjct: 92  KYLGLKVDMSERRECSQ---EFNYKDVMSIPKSVDWRKKGAVTDVKNQGSCGSCWAFSTV 148

Query: 159 AAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPY 218
           AA+EGIN I T  LTSLSEQELVDCDT+  + GC GGLMD AF +IISN GL  E  YPY
Sbjct: 149 AAVEGINQIVTGNLTSLSEQELVDCDTT-NNYGCNGGLMDYAFSYIISNGGLHKEVDYPY 207

Query: 219 KASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGV 278
              +G+C  ++       ISGY DVP N+E +L+KA+ANQP+SVAI+ASG DFQFYS GV
Sbjct: 208 IMEEGTCEMRKEESEVVTISGYHDVPQNSEESLLKALANQPLSVAIEASGRDFQFYSGGV 267

Query: 279 FTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIA 338
           F G CGT+LDHGV AVGYG+  +G  Y +VKNSWG+ WGE GYIRM+R+     GLCGI 
Sbjct: 268 FDGHCGTQLDHGVAAVGYGST-NGLDYIIVKNSWGSKWGEKGYIRMKRNTGKPAGLCGIN 326

Query: 339 MQASYPT 345
             ASYPT
Sbjct: 327 KMASYPT 333


>gi|30690594|ref|NP_564321.2| cysteine proteinase-like protein [Arabidopsis thaliana]
 gi|28393492|gb|AAO42167.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|332192920|gb|AEE31041.1| cysteine proteinase-like protein [Arabidopsis thaliana]
          Length = 355

 Score =  343 bits (880), Expect = 6e-92,   Method: Compositional matrix adjust.
 Identities = 180/353 (50%), Positives = 235/353 (66%), Gaps = 14/353 (3%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSR-TLNDATMNERHEMWMAQYGRVYRDNAEKEMR 59
           M  IL    LV   IL + +   Q+ SR T ++  + E H+ WM ++ RVY D  EK+MR
Sbjct: 10  MTSILF--MLVSLTILSMNLKVSQATSRVTFHEPIVAEHHQQWMTRFSRVYSDELEKQMR 67

Query: 60  FKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDV 119
           F +FK+N+++I  FN K  ++ YKLG+NEFAD T EEF A   G K  +  + SSE  D 
Sbjct: 68  FDVFKKNLKFIEKFNKKG-DRTYKLGVNEFADWTREEFIATHTGLKG-VNGIPSSEFVDE 125

Query: 120 SFRYENASVP-----ASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTS 174
                N +V       + DWR +GAVT VK QGQCGCCWAFS+VAA+EG+  I    L S
Sbjct: 126 MIPSWNWNVSDVAGRETKDWRYEGAVTPVKYQGQCGCCWAFSSVAAVEGLTKIVGNNLVS 185

Query: 175 LSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSA 234
           LSEQ+L+DCD    D GC GG+M DAF +II N+G+A+EA YPY+A++G+C +    PSA
Sbjct: 186 LSEQQLLDCDRE-RDNGCNGGIMSDAFSYIIKNRGIASEASYPYQAAEGTC-RYNGKPSA 243

Query: 235 AKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVF-TGQCGTELDHGVTA 293
             I G++ VPSNNE AL++AV+ QPVSV+IDA G  F  YS GV+    CGT ++H VT 
Sbjct: 244 W-IRGFQTVPSNNERALLEAVSKQPVSVSIDADGPGFMHYSGGVYDEPYCGTNVNHAVTF 302

Query: 294 VGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           VGYGT+ +G KYWL KNSWG TWGENGYIR++RD+   +G+CG+A  A YP A
Sbjct: 303 VGYGTSPEGIKYWLAKNSWGETWGENGYIRIRRDVAWPQGMCGVAQYAFYPVA 355


>gi|255567869|ref|XP_002524912.1| cysteine protease, putative [Ricinus communis]
 gi|223535747|gb|EEF37409.1| cysteine protease, putative [Ricinus communis]
          Length = 366

 Score =  343 bits (880), Expect = 6e-92,   Method: Compositional matrix adjust.
 Identities = 170/307 (55%), Positives = 219/307 (71%), Gaps = 8/307 (2%)

Query: 42  WMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPR 101
           W+A++ + Y    E+E RF+IFK N+ +I   NN ++N+ YK+G+  FAD TNEE+RA  
Sbjct: 51  WLAKHSKTYNKLGEREKRFEIFKNNLRFIDEHNN-SKNRTYKVGLTRFADLTNEEYRAKF 109

Query: 102 NGYKRRLPSVRSSETTDVSFRYENAS---VPASIDWRKKGAVTGVKDQGQCGCCWAFSAV 158
            G K   P  R  ++ + S RY   +   +P SIDWR+ GAV+ +KDQG CG CWAFS +
Sbjct: 110 LGTKSD-PKRRLMKSKNPSQRYAFKAGDVLPESIDWRQSGAVSAIKDQGSCGSCWAFSTI 168

Query: 159 AAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPY 218
           AA+EG+N I T +L SLSEQELVDCD S  + GC GGLMD+AF+FII+N G+ T+  YPY
Sbjct: 169 AAVEGVNKIVTGELISLSEQELVDCDRS-YNAGCNGGLMDNAFQFIINNGGIDTDKDYPY 227

Query: 219 KASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGV 278
           +A DG C+  +    A  I G+EDV + +E AL KAVA+QPVSVAI+ASG   QFY SGV
Sbjct: 228 QAVDGKCDTTKVKNKAVTIDGFEDVMAFDEMALQKAVAHQPVSVAIEASGMALQFYQSGV 287

Query: 279 FTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRD-IDAKEGLCGI 337
           FTG+CG+ LDHGV  VGYGT +DG  YWLV+NSWG  WGENGYI+MQR+ +D   G CGI
Sbjct: 288 FTGECGSALDHGVVIVGYGT-EDGIDYWLVRNSWGRDWGENGYIKMQRNVVDTFTGKCGI 346

Query: 338 AMQASYP 344
           AM++SYP
Sbjct: 347 AMESSYP 353


>gi|220983358|dbj|BAH11164.1| cysteine protease [Hordeum vulgare]
          Length = 462

 Score =  343 bits (880), Expect = 8e-92,   Method: Compositional matrix adjust.
 Identities = 173/309 (55%), Positives = 212/309 (68%), Gaps = 12/309 (3%)

Query: 42  WMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARN--KPYKLGINEFADQTNEEFRA 99
           WMA++   Y    E+E RF+ F+ N+ YI   N  A      ++LG+N FAD TNEE+R+
Sbjct: 45  WMAEHHSTYNPIGEEERRFEAFRNNLRYIDQHNAAADAGVHSFRLGLNRFADLTNEEYRS 104

Query: 100 PRNGYKRRLPSVRSSETTDVSFRYE---NASVPASIDWRKKGAVTGVKDQGQCGCCWAFS 156
              G + +    R      +S RY+   N  +P S+DWRKKGAV  VKDQG CG CWAFS
Sbjct: 105 TYLGARTKPDRERK-----LSARYQAADNDELPESVDWRKKGAVGAVKDQGGCGSCWAFS 159

Query: 157 AVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKY 216
           A+AA+EGIN I T  +  LSEQELVDCDTS  +QGC GGLMD AFEFII+N G+ +E  Y
Sbjct: 160 AIAAVEGINQIVTGDMIPLSEQELVDCDTS-YNQGCNGGLMDYAFEFIINNGGIDSEEDY 218

Query: 217 PYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSS 276
           PYK  D  C+  + N     I GYEDVP N+E +L KAVANQP+SVAI+A G  FQ Y S
Sbjct: 219 PYKERDNRCDANKKNAKVVTIDGYEDVPVNSEKSLQKAVANQPISVAIEAGGRAFQLYKS 278

Query: 277 GVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCG 336
           G+FTG CGT LDHGV AVGYGT ++G  YWLV+NSWG+ WGENGYIRM+R+I A  G CG
Sbjct: 279 GIFTGTCGTALDHGVAAVGYGT-ENGKDYWLVRNSWGSVWGENGYIRMERNIKASSGKCG 337

Query: 337 IAMQASYPT 345
           IA++ SYPT
Sbjct: 338 IAVEPSYPT 346


>gi|242074728|ref|XP_002447300.1| hypothetical protein SORBIDRAFT_06g032360 [Sorghum bicolor]
 gi|241938483|gb|EES11628.1| hypothetical protein SORBIDRAFT_06g032360 [Sorghum bicolor]
          Length = 471

 Score =  343 bits (880), Expect = 8e-92,   Method: Compositional matrix adjust.
 Identities = 167/324 (51%), Positives = 222/324 (68%), Gaps = 17/324 (5%)

Query: 31  NDATMNERHEMWMAQYGRVYRDN-AEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEF 89
            +A +   +E WMA++G+   +   E + RF+ F +N+ ++ + N +A  + Y+LGIN F
Sbjct: 44  TEAQVRAMYEQWMARHGKAASNALGEHDRRFRAFWDNLRFVDAHNARAGARGYRLGINRF 103

Query: 90  ADQTNEEFRAP------RNGYKRRLPSVRSSETTDVSFRYENA-SVPASIDWRKKGAVTG 142
           AD TN EFRA       RNG         ++  T   +R++   ++P  +DWR+KGAV  
Sbjct: 104 ADLTNAEFRAAYLSAGARNGT--------ATAATGERYRHDGVEALPEFVDWRQKGAVAP 155

Query: 143 VKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFE 202
           VK+QGQCG CWAFSAV A+EGIN I T +L +LSEQELVDC  +G++ GC+GG+MDDAF 
Sbjct: 156 VKNQGQCGSCWAFSAVGAVEGINQIVTGELVTLSEQELVDCSKNGQNGGCDGGMMDDAFA 215

Query: 203 FIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSV 262
           FI+ N G+ T+  YPY A DG C+  + +     I G+E VP N+E +L KAVA+QPV+V
Sbjct: 216 FIVGNGGIDTDKDYPYTARDGKCDVAKRSRHVVSIDGFEGVPRNDEKSLQKAVAHQPVAV 275

Query: 263 AIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGT-ADDGTKYWLVKNSWGTTWGENGY 321
           AI+A G +FQ Y SGVFTG+CGT LDHGV AVGYGT AD G  YWLV+NSWG  WGE GY
Sbjct: 276 AIEAGGREFQLYQSGVFTGRCGTSLDHGVVAVGYGTEADGGRDYWLVRNSWGADWGEGGY 335

Query: 322 IRMQRDIDAKEGLCGIAMQASYPT 345
           IRM+R++ A+ G CGIAM+ASYP 
Sbjct: 336 IRMERNVGARAGKCGIAMEASYPV 359


>gi|359491865|ref|XP_002273243.2| PREDICTED: xylem cysteine proteinase 1-like [Vitis vinifera]
          Length = 351

 Score =  343 bits (879), Expect = 8e-92,   Method: Compositional matrix adjust.
 Identities = 179/357 (50%), Positives = 232/357 (64%), Gaps = 21/357 (5%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSRTL-------NDAT----MNERHEMWMAQYGRV 49
           MA+    N  +L   + + V+A  +++R         +D T    + +  E WM+++G+ 
Sbjct: 1   MALSPFSNFFLL--FISMAVFAYSAFARDFSIVGYSPDDLTSMDKLTDLFESWMSKHGKS 58

Query: 50  YRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLP 109
           YR   EK  RF++F++N+++I   N K  +  Y LG+NEFAD ++EEF+    G K  LP
Sbjct: 59  YRSFEEKLHRFEVFQDNLKHIDETNKKVSS--YWLGLNEFADLSHEEFKRKYLGLKIELP 116

Query: 110 SVRSSETTDVSFRYEN-ASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHIT 168
             R S      F Y++ A +P S+DWRKKGAV  VK+QG CG CWAFS VAA+EGIN I 
Sbjct: 117 KRRDSPE---EFSYKDVADLPKSVDWRKKGAVAHVKNQGACGSCWAFSTVAAVEGINQIV 173

Query: 169 TRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKK 228
           T  LT+LSEQEL+DCD    + GC GGLMD AF FIISN GL  E  YPY   +G+C +K
Sbjct: 174 TGNLTALSEQELIDCDKPF-NNGCNGGLMDYAFAFIISNGGLRKEEDYPYVMEEGTCGEK 232

Query: 229 EANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELD 288
           +       ISGY DVP +NE + +KA+ANQP+SVAI+AS   FQFYS G+F G CGTELD
Sbjct: 233 KEELEVVTISGYHDVPEDNEQSFLKALANQPLSVAIEASSRGFQFYSGGIFNGHCGTELD 292

Query: 289 HGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           HGV AVGYGT+  G  Y  VKNSWG+ WGE GYIRM+R++   EG+CGI   ASYPT
Sbjct: 293 HGVAAVGYGTS-KGVDYITVKNSWGSKWGEKGYIRMKRNVGKPEGICGIYKMASYPT 348


>gi|356563584|ref|XP_003550041.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
          Length = 366

 Score =  343 bits (879), Expect = 9e-92,   Method: Compositional matrix adjust.
 Identities = 178/356 (50%), Positives = 232/356 (65%), Gaps = 22/356 (6%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSW-SRTLNDATMNE---RHEMWMAQYGRVYRDNAEK 56
           MA I+    L+++ +L L      +  + T+ + T NE    +E W+ ++ +VY    EK
Sbjct: 1   MASIM---TLMISTLLFLSFTLSCAIDTSTITNYTDNEVMTMYEEWLVKHQKVYNGLGEK 57

Query: 57  EMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRA----PRNGYKRRLPSVR 112
           + RF++FK+N+ +I   NN  +N  YKLG+N+FAD TNEE+R      ++  KRRL   +
Sbjct: 58  DKRFQVFKDNLGFIQEHNNN-QNNTYKLGLNKFADMTNEEYRVMYFGTKSDAKRRLMKTK 116

Query: 113 SSETTDVSFRYENAS---VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITT 169
           S+       RY  ++   +P  +DWR KGAV  +KDQG CG CWAFS VA +E IN I T
Sbjct: 117 ST-----GHRYAYSAGDQLPVHVDWRVKGAVAPIKDQGSCGSCWAFSTVATVEAINKIVT 171

Query: 170 RKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKE 229
            K  SLSEQELVDCD +  +QGC GGLMD AFEFII N G+ T+  YPY+  DG C+  +
Sbjct: 172 GKFVSLSEQELVDCDRA-YNQGCNGGLMDYAFEFIIQNGGIDTDKDYPYRGFDGICDPTK 230

Query: 230 ANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDH 289
            N  A  I GYEDVP  +E AL KAVA QPVS+AI+ASG   Q Y SGVFTG+CGT LDH
Sbjct: 231 KNAKAVNIDGYEDVPPYDENALKKAVARQPVSIAIEASGRALQLYQSGVFTGECGTSLDH 290

Query: 290 GVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           GV  VGYG +++G  YWLV+NSWGT WGE+GY +MQR++    G CGI M+ASYP 
Sbjct: 291 GVVVVGYG-SENGVDYWLVRNSWGTGWGEDGYFKMQRNVRTPTGKCGITMEASYPV 345


>gi|111073717|dbj|BAF02547.1| triticain beta [Triticum aestivum]
          Length = 472

 Score =  343 bits (879), Expect = 9e-92,   Method: Compositional matrix adjust.
 Identities = 167/314 (53%), Positives = 222/314 (70%), Gaps = 10/314 (3%)

Query: 39  HEMWMAQYGRVYRDNA----EKEMRFKIFKENVEYIASFNNKAR--NKPYKLGINEFADQ 92
           +++W+A+ G     NA    E+E RF+ F +N+ ++ + N +A    + Y+LG+N FAD 
Sbjct: 53  YDLWLAENGGGSSPNANSIPERERRFRAFWDNLNFVDAHNARAAAGEEGYRLGMNRFADL 112

Query: 93  TNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS-VPASIDWRKKGAVTGVKDQGQCGC 151
           TN+EFRA   G K +    R        +R++ A  +P ++DWR+KGAV  VK+QGQCG 
Sbjct: 113 TNDEFRAAYLGVKAQ--RARPGRMVGERYRHDGAEELPEAVDWREKGAVAPVKNQGQCGS 170

Query: 152 CWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLA 211
           CWAFSAV+ +E IN I T ++ +LSEQELV+CDT+G+  GC GGLMDDAFEFII N G+ 
Sbjct: 171 CWAFSAVSTVESINQIVTGEMVTLSEQELVECDTNGQSSGCNGGLMDDAFEFIIKNGGID 230

Query: 212 TEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDF 271
           TE  YPYKA DG C+    N     I G+EDVP N+E +L KAVA+QPVSVAI+A G +F
Sbjct: 231 TEDDYPYKAIDGRCDVLRKNAKVVSIDGFEDVPENDEKSLQKAVAHQPVSVAIEAGGREF 290

Query: 272 QFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAK 331
           Q Y SGVF+G+CGT+LDHGV AVGYGT ++G  YW+V+NSWG  WGE+GY+RM+R+I+  
Sbjct: 291 QLYHSGVFSGRCGTQLDHGVVAVGYGT-ENGKDYWIVRNSWGPNWGESGYLRMERNINVT 349

Query: 332 EGLCGIAMQASYPT 345
            G CGIAM +SYPT
Sbjct: 350 SGKCGIAMMSSYPT 363


>gi|255538210|ref|XP_002510170.1| cysteine protease, putative [Ricinus communis]
 gi|223550871|gb|EEF52357.1| cysteine protease, putative [Ricinus communis]
          Length = 469

 Score =  342 bits (878), Expect = 1e-91,   Method: Compositional matrix adjust.
 Identities = 169/309 (54%), Positives = 219/309 (70%), Gaps = 7/309 (2%)

Query: 39  HEMWMAQYGRVYRDN---AEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNE 95
           +E W+ + G+ + +N    EKE RF++FK+N+ +I   N++  N+ YK+G+N FAD TNE
Sbjct: 51  YEEWLVKNGKAHSNNNALGEKERRFQVFKDNLRFIDEHNSE--NRSYKVGLNRFADLTNE 108

Query: 96  EFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAF 155
           E+R+   G +      R S +++        S+P S+DWRK+GAV  VKDQG CG CWAF
Sbjct: 109 EYRSMYLGARSGAKRNRLSRSSNRYLPRVGDSLPDSVDWRKEGAVAEVKDQGSCGSCWAF 168

Query: 156 SAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAK 215
           S +AA+EGIN I T  L SLSEQELVDCD S  ++GC GGLMD AF+FII+N G+ +E  
Sbjct: 169 STIAAVEGINKIVTGDLISLSEQELVDCDRS-YNEGCNGGLMDYAFQFIINNGGIDSEED 227

Query: 216 YPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYS 275
           YPY A DG+C+    N     I  YEDVP N+E AL KAVANQPVSVAI+A G +FQFY 
Sbjct: 228 YPYLARDGTCDTYRKNAKVVTIDNYEDVPVNDEKALQKAVANQPVSVAIEAGGREFQFYQ 287

Query: 276 SGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLC 335
           SG+FTG+CGT LDHGV AVGYGT ++G  YW+V+NSWG +WGE+GYIRM+R+I    G C
Sbjct: 288 SGIFTGRCGTALDHGVAAVGYGT-ENGKDYWIVRNSWGKSWGESGYIRMERNIATATGKC 346

Query: 336 GIAMQASYP 344
           GIA++ SYP
Sbjct: 347 GIAIEPSYP 355


>gi|388497270|gb|AFK36701.1| unknown [Lotus japonicus]
          Length = 343

 Score =  342 bits (878), Expect = 1e-91,   Method: Compositional matrix adjust.
 Identities = 176/339 (51%), Positives = 226/339 (66%), Gaps = 7/339 (2%)

Query: 11  VLAAILVLGVWAPQSWSRTLNDAT---MNERHEMWMAQYGRVYRDNAEKEMRFKIFKENV 67
           ++A   +L   A  + SRTL D T   + + H+ WM QYGR Y ++AE E RFKIF EN+
Sbjct: 7   IIALCTMLWACAYTAMSRTLYDETSSVVAKTHQQWMLQYGRSYTNDAEMEKRFKIFMENL 66

Query: 68  EYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS 127
           EYI  FNN   NK YKL +N+F+D TNEEF A   G         SS         + + 
Sbjct: 67  EYIEKFNNAPGNKSYKLDLNQFSDLTNEEFIASHTGLMIDPSKPSSSSKRASPASLDLSD 126

Query: 128 VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSG 187
            P S+DWR++GAVT VK+QG CG CWAFSAVAA+EGI  I    L SLSEQ+LVDC ++ 
Sbjct: 127 TPTSLDWREQGAVTDVKNQGNCGSCWAFSAVAAVEGIVKIKNGNLISLSEQQLVDCASNE 186

Query: 188 EDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNN 247
           ++QGC GG MD+AF +I  N G+A+E  Y Y+   G+C   E    AA+ISGYEDVP+  
Sbjct: 187 QNQGCGGGFMDNAFSYITEN-GIASENDYQYRGGAGTCQNNEMITPAARISGYEDVPA-G 244

Query: 248 EAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTA-DDGTKYW 306
           E  L+ AV+ QPVSVAI A G  F  Y  G+++G CG+ L+HGVT VGYGT+ +DGTKYW
Sbjct: 245 EDQLLLAVSQQPVSVAI-AVGQSFHLYKEGIYSGPCGSSLNHGVTLVGYGTSEEDGTKYW 303

Query: 307 LVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           L+KNSWG +WGENGY+R+ R+    EG CGIA++AS+PT
Sbjct: 304 LIKNSWGESWGENGYMRLLRESGQSEGHCGIAVKASHPT 342


>gi|297733654|emb|CBI14901.3| unnamed protein product [Vitis vinifera]
          Length = 273

 Score =  342 bits (878), Expect = 1e-91,   Method: Compositional matrix adjust.
 Identities = 170/256 (66%), Positives = 190/256 (74%), Gaps = 7/256 (2%)

Query: 93  TNEEFRAPRNGYK---RRLPSVRSSETTDVSFRYENA-SVPASIDWRKKGAVTGVKDQGQ 148
           TN EFR+   G K    R+   R S+    SF YE   SVP S+DWRKKGAVT +KDQGQ
Sbjct: 2   TNHEFRSTYAGSKVNHHRM--FRGSQHAAGSFMYEKVKSVPPSVDWRKKGAVTPIKDQGQ 59

Query: 149 CGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNK 208
           CG CWAFS V A+EGINHI T KL SLSEQELVDCDTS E+QGC GGLM  AFEFI    
Sbjct: 60  CGSCWAFSTVVAVEGINHIKTNKLVSLSEQELVDCDTS-ENQGCNGGLMGYAFEFIKEKG 118

Query: 209 GLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASG 268
           G+ TE  YPY A DG+C+  + N     I G+E VP NNE AL+KA ANQP+SVAIDA G
Sbjct: 119 GITTEQSYPYTAEDGTCDVSKVNSPVVSIDGHETVPPNNEDALLKAAANQPISVAIDAGG 178

Query: 269 SDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDI 328
           S FQFYS GVF G+CGT+LDHGV  VGYGT  DGTKYW+VKNSWGT WGENGYIRM+R I
Sbjct: 179 SAFQFYSEGVFAGRCGTDLDHGVAIVGYGTTLDGTKYWIVKNSWGTDWGENGYIRMKRGI 238

Query: 329 DAKEGLCGIAMQASYP 344
            AKEGLCGIA++ASYP
Sbjct: 239 SAKEGLCGIAVEASYP 254


>gi|356559055|ref|XP_003547817.1| PREDICTED: cysteine proteinase RD21a [Glycine max]
          Length = 366

 Score =  342 bits (877), Expect = 1e-91,   Method: Compositional matrix adjust.
 Identities = 172/351 (49%), Positives = 224/351 (63%), Gaps = 15/351 (4%)

Query: 2   AMILLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFK 61
           ++I L    +L     L      S      D  +   +E W+ ++ +VY    EK+ RF+
Sbjct: 3   SIITLVTSTLLFLSFTLSCAIDTSTITNYTDNEVMTMYEEWLVKHQKVYNGLREKDKRFQ 62

Query: 62  IFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRA----PRNGYKRRLPSVRSSETT 117
           +FK+N+ +I   NN  +N  YKLG+N+FAD TNEE+R      ++  KRRL   +S+   
Sbjct: 63  VFKDNLGFIQEHNNN-QNNTYKLGLNQFADMTNEEYRVMYFGTKSDAKRRLMKTKST--- 118

Query: 118 DVSFRYENAS---VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTS 174
               RY  ++   +P  +DWR KGAV  +KDQG CG CWAFS VA +E IN I T K  S
Sbjct: 119 --GHRYAYSAGDRLPVHVDWRVKGAVAPIKDQGSCGSCWAFSTVATVEAINKIVTGKFVS 176

Query: 175 LSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSA 234
           LSEQELVDCD +  ++GC GGLMD AFEFII N G+ T+  YPY+  DG C+  + N   
Sbjct: 177 LSEQELVDCDRA-YNEGCNGGLMDYAFEFIIQNGGIDTDKDYPYRGFDGICDPTKKNAKV 235

Query: 235 AKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAV 294
             I G+EDVP  +E AL KAVA+QPVS+AI+ASG D Q Y SGVFTG+CGT LDHGV  V
Sbjct: 236 VNIDGFEDVPPYDENALKKAVAHQPVSIAIEASGRDLQLYQSGVFTGKCGTSLDHGVVVV 295

Query: 295 GYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           GYG +++G  YWLV+NSWGT WGE+GY +MQR++    G CGI M+ASYP 
Sbjct: 296 GYG-SENGVDYWLVRNSWGTGWGEDGYFKMQRNVRTPTGKCGITMEASYPV 345


>gi|171702831|dbj|BAG16371.1| cysteine protease [Brassica oleracea var. italica]
          Length = 441

 Score =  342 bits (877), Expect = 2e-91,   Method: Compositional matrix adjust.
 Identities = 170/348 (48%), Positives = 236/348 (67%), Gaps = 16/348 (4%)

Query: 3   MILLENKLVLAAILVLGVWAPQSWSRTLN---DATMNERHEMWMAQYGRVYRDNAEKEMR 59
           +IL    +V+++ + + + +      T++   DA ++  +E W+ ++G+      EK+ R
Sbjct: 3   VILFLTMIVVSSAMDMSIISYDKNHHTVSSRSDAEVSRLYEEWLVKHGKAQNSLTEKDRR 62

Query: 60  FKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDV 119
           F+IFK+N+ +I   N K  N  Y+LG+ +FAD TN+E+R+   G + +  + +SS     
Sbjct: 63  FEIFKDNLRFIDEHNGK--NLSYRLGLTKFADLTNDEYRSMYLGSRLKRKATKSS----- 115

Query: 120 SFRYE---NASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLS 176
             RYE     ++P S+DWRK+GAV  VKDQG CG CWAFS + A+EGIN I T  L +LS
Sbjct: 116 -LRYEVRVGDAIPESVDWRKEGAVAEVKDQGSCGSCWAFSTIGAVEGINKIVTGDLITLS 174

Query: 177 EQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAK 236
           EQELVDCDTS  ++GC GGLMD AFEFII+N G+ TE  YPYK  DG C++   N     
Sbjct: 175 EQELVDCDTS-YNEGCNGGLMDYAFEFIINNGGIDTEEDYPYKGVDGRCDQTRKNAKVVT 233

Query: 237 ISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGY 296
           I  YEDVP+N+E +L KA+++QP+SVAI+  G  FQ Y SG+F G CGT+LDHGV AVGY
Sbjct: 234 IDLYEDVPANSEESLKKALSHQPISVAIEGGGRAFQLYDSGIFDGICGTDLDHGVVAVGY 293

Query: 297 GTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           GT ++G  YW+VKNSWGT+WGE+GYIRM+R+I +  G CGIA++ SYP
Sbjct: 294 GT-ENGKDYWIVKNSWGTSWGESGYIRMERNIASSAGKCGIAVEPSYP 340


>gi|255557851|ref|XP_002519955.1| cysteine protease, putative [Ricinus communis]
 gi|223541001|gb|EEF42559.1| cysteine protease, putative [Ricinus communis]
          Length = 321

 Score =  342 bits (877), Expect = 2e-91,   Method: Compositional matrix adjust.
 Identities = 178/343 (51%), Positives = 231/343 (67%), Gaps = 31/343 (9%)

Query: 6   LENKLVLAAILVLGVWAPQSWSRTL-NDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFK 64
           LE KL +A ++V   WA Q+ +R L N+  + E+HE WMA++GR Y+D+ EKE RF+IFK
Sbjct: 5   LEKKLAIALLVVFSTWASQAMARQLINEDALVEKHEQWMARHGRTYQDSEEKERRFQIFK 64

Query: 65  ENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYE 124
            N+EYI +FN KA N+ Y+LG+N FAD ++EE+ A      R++P               
Sbjct: 65  SNLEYIDNFN-KASNQTYQLGLNNFADLSHEEYVATYTA--RKMP--------------- 106

Query: 125 NASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCD 184
              VP SIDWR  GAVT +K+Q QCGCCWAFSA AA+EGI         SLS Q+L+DC 
Sbjct: 107 -VEVPESIDWRDHGAVTPIKNQYQCGCCWAFSAAAAVEGI----VANGVSLSAQQLLDCV 161

Query: 185 TSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVP 244
           +  ++QGC+GG M++AF +II N+G+A E  YPY+     C+ + A   AA+ISG+EDV 
Sbjct: 162 S--DNQGCKGGWMNNAFNYIIQNQGIALETDYPYQQMQQMCSSRMA---AAQISGFEDVT 216

Query: 245 SNNEAALMKAVANQPVSVAIDA-SGSDFQFYSSGVFTGQ-CGTELDHGVTAVGYGTADDG 302
             +E ALM+AVA QPVSV IDA S  +F+ Y  GVFT   CG    H VT VGYGT++DG
Sbjct: 217 PKDEEALMRAVAKQPVSVTIDATSNPNFKLYKEGVFTAAGCGNGHSHAVTLVGYGTSEDG 276

Query: 303 TKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           TKYWL KNSWG TWGE+GY+R+QRDI  + G CGIA+ ASYPT
Sbjct: 277 TKYWLAKNSWGETWGESGYMRLQRDIGLEGGPCGIALYASYPT 319


>gi|18418684|ref|NP_567983.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
 gi|71153408|sp|O65493.1|XCP1_ARATH RecName: Full=Xylem cysteine proteinase 1; Short=AtXCP1; Flags:
           Precursor
 gi|6708181|gb|AAF25831.1|AF191027_1 papain-type cysteine endopeptidase XCP1 [Arabidopsis thaliana]
 gi|3080415|emb|CAA18734.1| cysteine proteinase-like protein [Arabidopsis thaliana]
 gi|7270487|emb|CAB80252.1| cysteine proteinase-like protein [Arabidopsis thaliana]
 gi|26449881|dbj|BAC42063.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|28827736|gb|AAO50712.1| unknown protein [Arabidopsis thaliana]
 gi|332661101|gb|AEE86501.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
          Length = 355

 Score =  342 bits (877), Expect = 2e-91,   Method: Compositional matrix adjust.
 Identities = 170/316 (53%), Positives = 220/316 (69%), Gaps = 7/316 (2%)

Query: 31  NDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFA 90
           N   + E  E WM+++ + Y+   EK  RF++F+EN+ +I   NN+  +  Y LG+NEFA
Sbjct: 43  NTDKLLELFESWMSEHSKAYKSVEEKVHRFEVFRENLMHIDQRNNEINS--YWLGLNEFA 100

Query: 91  DQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS-VPASIDWRKKGAVTGVKDQGQC 149
           D T+EEF+    G  +  P          +FRY + + +P S+DWRKKGAV  VKDQGQC
Sbjct: 101 DLTHEEFKGRYLGLAK--PQFSRKRQPSANFRYRDITDLPKSVDWRKKGAVAPVKDQGQC 158

Query: 150 GCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKG 209
           G CWAFS VAA+EGIN ITT  L+SLSEQEL+DCDT+  + GC GGLMD AF++IIS  G
Sbjct: 159 GSCWAFSTVAAVEGINQITTGNLSSLSEQELIDCDTTF-NSGCNGGLMDYAFQYIISTGG 217

Query: 210 LATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGS 269
           L  E  YPY   +G C +++ +     ISGYEDVP N++ +L+KA+A+QPVSVAI+ASG 
Sbjct: 218 LHKEDDYPYLMEEGICQEQKEDVERVTISGYEDVPENDDESLVKALAHQPVSVAIEASGR 277

Query: 270 DFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDID 329
           DFQFY  GVF G+CGT+LDHGV AVGYG++  G+ Y +VKNSWG  WGE G+IRM+R+  
Sbjct: 278 DFQFYKGGVFNGKCGTDLDHGVAAVGYGSS-KGSDYVIVKNSWGPRWGEKGFIRMKRNTG 336

Query: 330 AKEGLCGIAMQASYPT 345
             EGLCGI   ASYPT
Sbjct: 337 KPEGLCGINKMASYPT 352


>gi|45738078|gb|AAS75836.1| fastuosain precursor [Bromelia fastuosa]
          Length = 324

 Score =  342 bits (876), Expect = 2e-91,   Method: Compositional matrix adjust.
 Identities = 173/315 (54%), Positives = 212/315 (67%), Gaps = 16/315 (5%)

Query: 35  MNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTN 94
           M ER E WMA+YGRVY DNAEK  RF+IFK NV +I +FNN++ N  Y LG+N+F D TN
Sbjct: 6   MMERFEEWMAEYGRVYNDNAEKMRRFQIFKNNVNHIETFNNRSGNS-YTLGVNQFTDMTN 64

Query: 95  EEFRAPRNGYKRRL----PSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCG 150
            EF A   G    L      V S +  D+S      +VP SIDWR  GAVT VK+QG CG
Sbjct: 65  NEFLARYTGASLPLNIERDPVVSFDDVDIS------AVPQSIDWRDYGAVTSVKNQGSCG 118

Query: 151 CCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGL 210
            CWAFSA+A +EGI  I    L SLSEQE++DC  S    GC+GG ++ A++FIISN G+
Sbjct: 119 SCWAFSAIATVEGIYKIKAGNLISLSEQEVLDCALS---YGCDGGWVNKAYDFIISNNGV 175

Query: 211 ATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSD 270
            + A  PYK   G CN  +  P+ A I+GY  V SNNE ++M AVANQP++  IDA G D
Sbjct: 176 TSFANLPYKGYKGPCNHNDL-PNKAYITGYTYVQSNNERSMMIAVANQPIAALIDAGG-D 233

Query: 271 FQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDA 330
           FQ+Y SGVFTG CGT L+H +T +GYG    GTKYW+VKNSWGT+WGE GYIRM RD+ +
Sbjct: 234 FQYYKSGVFTGSCGTSLNHAITVIGYGQTSSGTKYWIVKNSWGTSWGERGYIRMARDVSS 293

Query: 331 KEGLCGIAMQASYPT 345
             GLCGIAM   +PT
Sbjct: 294 PYGLCGIAMAPLFPT 308


>gi|146215996|gb|ABQ10200.1| cysteine protease Cp2 [Actinidia deliciosa]
          Length = 376

 Score =  342 bits (876), Expect = 2e-91,   Method: Compositional matrix adjust.
 Identities = 173/330 (52%), Positives = 227/330 (68%), Gaps = 17/330 (5%)

Query: 23  PQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPY 82
           P   S +  D  +   +  W+A++G+ Y    E+E RF+IFK+N++++   N++  N+ Y
Sbjct: 31  PNHKSSSRTDEEVMGIYAEWLAKHGKAYNGIGERERRFEIFKDNLKFVDEHNSE--NRSY 88

Query: 83  KLGINEFADQTNEEFRA----PRNGYKRRLPSVRSSETTDVSFRY---ENASVPASIDWR 135
           K+G+N FAD TNEE+R+     +   KRR    +S+     S RY   ++  +P S+DWR
Sbjct: 89  KVGLNRFADLTNEEYRSMFLGTKTDSKRRFMKSKSA-----SRRYAVQDSDMLPESVDWR 143

Query: 136 KKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGG 195
           + GAV  +KDQG CG CWAFS VAA+EG+N I T ++  LSEQELVDCD +  D GC GG
Sbjct: 144 ESGAVAPIKDQGSCGSCWAFSTVAAVEGVNQIATGEMIQLSEQELVDCDRT-YDAGCNGG 202

Query: 196 LMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAV 255
           LMD AFEFII+N G+ TE  YPY+  DG+C+ +  N     I+ YEDVP  +E AL KAV
Sbjct: 203 LMDYAFEFIINNGGIDTEEDYPYRGVDGTCDPERKNTKVVSINDYEDVPPYDEMALKKAV 262

Query: 256 ANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTT 315
           A+QPVSVAI+ASG  FQ Y SGVFTG+CG  LDHGV  VGYGT D+G  +W+V+NSWGT+
Sbjct: 263 AHQPVSVAIEASGRAFQLYLSGVFTGECGRALDHGVVVVGYGT-DNGADHWIVRNSWGTS 321

Query: 316 WGENGYIRMQRD-IDAKEGLCGIAMQASYP 344
           WGENGYIRM+R+ +D   G CGIAMQASYP
Sbjct: 322 WGENGYIRMERNVVDNFGGKCGIAMQASYP 351


>gi|30141025|dbj|BAC75926.1| cysteine protease-4 [Helianthus annuus]
          Length = 352

 Score =  342 bits (876), Expect = 2e-91,   Method: Compositional matrix adjust.
 Identities = 168/306 (54%), Positives = 212/306 (69%), Gaps = 6/306 (1%)

Query: 40  EMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRA 99
           E W+A++ ++Y    EK  RF+IF +N+++I   N K  N  Y LG+NEFAD T+EEF+ 
Sbjct: 50  ESWLAKHSKIYESLDEKLHRFEIFMDNLKHIDDTNKKVSN--YWLGLNEFADLTHEEFKN 107

Query: 100 PRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVA 159
              G K  LP  +     + S+R +   +P S+DWRKKGAV  VK+QGQCG CWAFS VA
Sbjct: 108 KFLGLKGELPERKDESIEEFSYR-DFVDLPKSVDWRKKGAVAPVKNQGQCGSCWAFSTVA 166

Query: 160 AMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYK 219
           A+EGIN I T  LT LSEQEL+DCDT+  + GC GGLMD AF +++ + GL  E +YPY 
Sbjct: 167 AVEGINQIVTGNLTMLSEQELIDCDTTF-NNGCNGGLMDYAFAYVMRS-GLHKEEEYPYI 224

Query: 220 ASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVF 279
            S+G+C++K+       ISGY DVP NNE + +KA+ANQP+SVAI+ASG DFQFYS GVF
Sbjct: 225 MSEGTCDEKKDVSETVTISGYHDVPRNNEDSFLKALANQPISVAIEASGRDFQFYSGGVF 284

Query: 280 TGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAM 339
            G CGTELDHGV AVGYGT   G  Y +V+NSWG  WGE GYIRM+R      G+CG+ M
Sbjct: 285 DGHCGTELDHGVAAVGYGTT-KGLDYVIVRNSWGPKWGEKGYIRMKRKTGKPHGMCGLYM 343

Query: 340 QASYPT 345
            ASYPT
Sbjct: 344 MASYPT 349


>gi|255646767|gb|ACU23856.1| unknown [Glycine max]
          Length = 350

 Score =  342 bits (876), Expect = 2e-91,   Method: Compositional matrix adjust.
 Identities = 170/309 (55%), Positives = 210/309 (67%), Gaps = 7/309 (2%)

Query: 37  ERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEE 96
           E  E WM+++G++Y +  EK +RF+IFK+N+++I   N    N  Y LG++EFAD ++ E
Sbjct: 46  ELFESWMSRHGKIYENIEEKLLRFEIFKDNLKHIDERNKVVSN--YWLGLSEFADLSHRE 103

Query: 97  FRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFS 156
           F     G K      R S      F Y++  +P S+DWRKKGAV  VK+QG CG CWAFS
Sbjct: 104 FNNKYLGLKVDYSRRRESPE---EFTYKDVELPKSVDWRKKGAVAPVKNQGSCGSCWAFS 160

Query: 157 AVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKY 216
            VAA+EGIN I T  LTSLSEQEL+DCD +  + GC GGLMD AF FI+ N GL  E  Y
Sbjct: 161 TVAAVEGINQIVTGNLTSLSEQELIDCDRT-YNNGCNGGLMDYAFSFIVENGGLHKEEDY 219

Query: 217 PYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSS 276
           PY   +G+C   +       ISGY DVP NNE +L+KA+ANQP+SVAI+ASG DFQFYS 
Sbjct: 220 PYIMEEGACEMTKEETQVVTISGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQFYSG 279

Query: 277 GVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCG 336
           GVF G CG++LDHGV AVGYGTA  G  Y  VKNSWG+ WGE GYIRM+R+I   EG+CG
Sbjct: 280 GVFDGHCGSDLDHGVAAVGYGTA-KGVDYITVKNSWGSKWGEKGYIRMRRNIGKPEGICG 338

Query: 337 IAMQASYPT 345
           I   ASYPT
Sbjct: 339 IYKMASYPT 347


>gi|222425026|dbj|BAH20463.1| cysteine protease [Spinacia oleracea]
          Length = 473

 Score =  342 bits (876), Expect = 2e-91,   Method: Compositional matrix adjust.
 Identities = 177/325 (54%), Positives = 227/325 (69%), Gaps = 10/325 (3%)

Query: 27  SRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGI 86
           S + +D  +   +E W+ Q+ + Y    EKE RF IFK+N+E+I   N+   ++ +K+G+
Sbjct: 41  SSSRSDDEVMRIYESWLVQHRKNYNALGEKEKRFAIFKDNLEFIDQHNSD-DSQTFKVGL 99

Query: 87  NEFADQTNEEFRAPRNGYKRRLPSVRSSETTDV---SFRY---ENASVPASIDWRKKGAV 140
           N+FAD TNEEFR+   G K+   S     +      S RY   E   +P ++DWRK GAV
Sbjct: 100 NKFADLTNEEFRSVYLGRKKSSSSSPLLSSAKSKVKSDRYLFKEGDELPEAVDWRKNGAV 159

Query: 141 TGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDA 200
             VKDQGQCG CWAFS +AA+EGIN I T +L SLSEQELVDCDTS  + GC+GGLMD A
Sbjct: 160 AKVKDQGQCGSCWAFSTIAAVEGINQIVTGELLSLSEQELVDCDTS-YNSGCDGGLMDYA 218

Query: 201 FEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPV 260
           +EFII+N G+ T+A YPY A DG C++   N     I  +EDVP N+E AL KAVA+QPV
Sbjct: 219 YEFIINNGGIDTDADYPYTAKDGKCDQYRKNAKVVTIDDFEDVPENDEKALQKAVAHQPV 278

Query: 261 SVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENG 320
           SVAI+A GS FQFY SGVFTG+CG +LDHGV AVGYG+ DDG  YW+V+NSWG  WGE+G
Sbjct: 279 SVAIEAGGSTFQFYQSGVFTGKCGADLDHGVVAVGYGS-DDGKDYWIVRNSWGADWGESG 337

Query: 321 YIRMQRDID-AKEGLCGIAMQASYP 344
           YIRM+R+++  K G CGIA++ SYP
Sbjct: 338 YIRMERNLETVKTGKCGIAIEPSYP 362


>gi|226506492|ref|NP_001140873.1| uncharacterized protein LOC100272949 precursor [Zea mays]
 gi|194701540|gb|ACF84854.1| unknown [Zea mays]
          Length = 379

 Score =  342 bits (876), Expect = 2e-91,   Method: Compositional matrix adjust.
 Identities = 175/322 (54%), Positives = 220/322 (68%), Gaps = 13/322 (4%)

Query: 31  NDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFA 90
           +D  + + +E W   + RV+R + EK  RF  FKENV +I + N +  ++PY+L +N F 
Sbjct: 36  SDEALWDLYERWQTHH-RVHRHHGEKGRRFGTFKENVRFIHAHNKRG-DRPYRLRLNRFG 93

Query: 91  DQTNEEFRA----PRNGYKRRLPSVRSSETTDVSFRYENAS-VPASIDWRKKGAVTGVKD 145
           D   EEFR+     R    RR  S  +       F Y++A+  P S+DWR++GAVTGVK 
Sbjct: 94  DMGREEFRSTFADSRINDLRRQDSPAARAGAVPGFMYDSAADPPRSVDWRQEGAVTGVKV 153

Query: 146 QGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFII 205
           QG CG CWAFS V A+EGIN I T  L SLSEQEL+DCDT  ++ GC+GGLM++AFEFI 
Sbjct: 154 QGHCGSCWAFSTVVAVEGINAIRTGSLASLSEQELIDCDT--DENGCQGGLMENAFEFIK 211

Query: 206 SNKGLATEAKYPYKASDGSCNKKEAN---PSAAKISGYEDVPSNNEAALMKAVANQPVSV 262
           S  G+ TEA YPY+AS+G+C+   A         I G++ VP+ +E AL KAVA+QPVSV
Sbjct: 212 SFGGITTEAAYPYRASNGTCDGDRARRGGGVVVVIDGHQMVPAGSEDALAKAVAHQPVSV 271

Query: 263 AIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYI 322
           A+DA G  FQFYS GVFTG CGT+LDHGV AVGYG  DDGT YW+VKNSWGT+WGE GYI
Sbjct: 272 AVDAGGQAFQFYSEGVFTGDCGTDLDHGVAAVGYGVGDDGTPYWIVKNSWGTSWGEGGYI 331

Query: 323 RMQRDIDAKEGLCGIAMQASYP 344
           RMQR      GLCGIAM+AS+P
Sbjct: 332 RMQRGA-GNGGLCGIAMEASFP 352


>gi|18141285|gb|AAL60580.1|AF454958_1 senescence-associated cysteine protease [Brassica oleracea]
          Length = 485

 Score =  341 bits (875), Expect = 3e-91,   Method: Compositional matrix adjust.
 Identities = 170/348 (48%), Positives = 236/348 (67%), Gaps = 16/348 (4%)

Query: 3   MILLENKLVLAAILVLGVWAPQSWSRTLN---DATMNERHEMWMAQYGRVYRDNAEKEMR 59
           +IL    +V+++ + + + +      T++   DA ++  +E W+ ++G+      EK+ R
Sbjct: 9   VILFLTMIVVSSAMDMSIISYDKNHHTVSSRSDAEVSRLYEEWLVKHGKAQNSLTEKDRR 68

Query: 60  FKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDV 119
           F+IFK+N+ +I   N K  N  Y+LG+ +FAD TN+E+R+   G + +  + +SS     
Sbjct: 69  FEIFKDNLRFIDEHNGK--NLSYRLGLTKFADLTNDEYRSMYLGSRLKRKATKSS----- 121

Query: 120 SFRYE---NASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLS 176
             RYE     ++P S+DWRK+GAV  VKDQG CG CWAFS + A+EGIN I T  L +LS
Sbjct: 122 -LRYEVRVGDAIPESVDWRKEGAVAEVKDQGSCGSCWAFSTIGAVEGINKIVTGDLITLS 180

Query: 177 EQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAK 236
           EQELVDCDTS  ++GC GGLMD AFEFII+N G+ TE  YPYK  DG C++   N     
Sbjct: 181 EQELVDCDTS-YNEGCNGGLMDYAFEFIINNGGIDTEEDYPYKGVDGRCDQTRKNAKVVT 239

Query: 237 ISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGY 296
           I  YEDVP+N+E +L KA+++QP+SVAI+  G  FQ Y SG+F G CGT+LDHGV AVGY
Sbjct: 240 IDLYEDVPANSEESLKKALSHQPISVAIEGGGRAFQLYDSGIFDGICGTDLDHGVVAVGY 299

Query: 297 GTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           GT ++G  YW+VKNSWGT+WGE+GYIRM+R+I +  G CGIA++ SYP
Sbjct: 300 GT-ENGKDYWIVKNSWGTSWGESGYIRMERNIASSAGKCGIAVEPSYP 346


>gi|2511693|emb|CAB17076.1| cysteine proteinase precursor [Phaseolus vulgaris]
          Length = 455

 Score =  341 bits (874), Expect = 3e-91,   Method: Compositional matrix adjust.
 Identities = 181/352 (51%), Positives = 233/352 (66%), Gaps = 21/352 (5%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRF 60
            A+  L + L ++ I        ++  RT  D  +N  +E W+ ++G++Y    EK+ RF
Sbjct: 4   FALFALSSALDMSIISYDNAHQDKATWRT--DEEVNSLYEEWLVKHGKLYNALGEKDKRF 61

Query: 61  KIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYK----RRL---PSVRS 113
           +IFK+N+ +I   N  A N+ YKLG+N FAD TNEE+RA   G K    RRL   PS R 
Sbjct: 62  QIFKDNLRFIDQQN--AENRTYKLGLNRFADLTNEEYRARYLGTKIDPNRRLGRTPSNRY 119

Query: 114 SETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLT 173
           +            ++P S+DWRK+GAV  VKDQ  CG CWAFSA+ A+EGIN I T  L 
Sbjct: 120 APRV-------GETLPDSVDWRKEGAVVPVKDQASCGSCWAFSAIGAVEGINKIVTGDLI 172

Query: 174 SLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPS 233
           SLSEQELVDCDT G + GC GGLMD AFEFII N G+ +E  YPYK  DG C++   N  
Sbjct: 173 SLSEQELVDCDT-GYNMGCNGGLMDYAFEFIIKNGGIDSEEDYPYKGVDGRCDEYRKNAK 231

Query: 234 AAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTA 293
              I GYEDV + +E AL KAVANQPVSVA++  G +FQ YSSGVFTG+CGT LDHGV A
Sbjct: 232 VVSIDGYEDVNTYDELALKKAVANQPVSVAVEGGGREFQLYSSGVFTGRCGTALDHGVVA 291

Query: 294 VGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDI-DAKEGLCGIAMQASYP 344
           VGYGT D+G  +W+V+NSWG  WGE GYIR++R++ +++ G CGIA++ SYP
Sbjct: 292 VGYGT-DNGHDFWIVRNSWGADWGEEGYIRLERNLGNSRSGKCGIAIEPSYP 342


>gi|224083362|ref|XP_002306996.1| predicted protein [Populus trichocarpa]
 gi|222856445|gb|EEE93992.1| predicted protein [Populus trichocarpa]
          Length = 336

 Score =  340 bits (873), Expect = 4e-91,   Method: Compositional matrix adjust.
 Identities = 173/307 (56%), Positives = 214/307 (69%), Gaps = 8/307 (2%)

Query: 40  EMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRA 99
           E W++++ ++Y    EK  RF+IFK+N+ +I   N K  N  Y LG+NEFAD ++EEF+ 
Sbjct: 34  ESWISKHQKIYESIEEKWHRFEIFKDNLFHIDETNKKVVN--YWLGLNEFADLSHEEFKN 91

Query: 100 PRNGYKRRLPSVRSSETTDVSFRYEN-ASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAV 158
              G    L + R        F Y++ +S+P S+DWRKKGAVT VK+QG CG CWAFS V
Sbjct: 92  KYLGLNVDLSNRRECSE---EFTYKDVSSIPKSVDWRKKGAVTDVKNQGSCGSCWAFSTV 148

Query: 159 AAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPY 218
           AA+EGIN I T  LTSLSEQELVDCDT+  + GC GGLMD AF +IISN GL  E  YPY
Sbjct: 149 AAVEGINQIVTGNLTSLSEQELVDCDTT-YNNGCNGGLMDYAFAYIISNGGLHKEEDYPY 207

Query: 219 KASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGV 278
              +G+C  ++A      ISGY DVP N+E +L+KA+ANQP+SVAIDASG DFQFYS GV
Sbjct: 208 IMEEGTCEMRKAESEVVTISGYHDVPQNSEESLLKALANQPLSVAIDASGRDFQFYSGGV 267

Query: 279 FTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIA 338
           F G CGTELDHGV AVGYG+A  G  + +VKNSWG+ WGE G+IRM+R+     GLCGI 
Sbjct: 268 FDGHCGTELDHGVAAVGYGSA-KGLDFIVVKNSWGSKWGEKGFIRMKRNTGKPAGLCGIN 326

Query: 339 MQASYPT 345
             ASYPT
Sbjct: 327 KMASYPT 333


>gi|5777889|emb|CAB53515.1| cysteine protease [Solanum tuberosum]
          Length = 466

 Score =  340 bits (873), Expect = 4e-91,   Method: Compositional matrix adjust.
 Identities = 167/320 (52%), Positives = 219/320 (68%), Gaps = 12/320 (3%)

Query: 31  NDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFA 90
           +D  ++  +E W+ ++G+ Y    EK+ RF+IFK+N++YI    N   N+ YKLG+ +FA
Sbjct: 41  SDDEVSALYESWLIEHGKSYNALGEKDKRFQIFKDNLKYIDE-QNSVPNQSYKLGLTKFA 99

Query: 91  DQTNEEFRA-----PRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKD 145
           D TNEE+R+       +G +R+L    S   +D        S+P S+DWR KG + GVKD
Sbjct: 100 DLTNEEYRSIYLGTKSSGDRRKL----SKNKSDRYLPKVGDSLPESVDWRDKGVLVGVKD 155

Query: 146 QGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFII 205
           QG CG CWAFSAVAAME IN I T  L SLSEQELVDCD S  ++GC+GGLMD AFEF+I
Sbjct: 156 QGSCGSCWAFSAVAAMESINAIVTGNLISLSEQELVDCDKS-YNEGCDGGLMDYAFEFVI 214

Query: 206 SNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAID 265
           +N G+ TE  YPYK  +  C++   N    KI  YEDVP NNE AL KAVA+QPVS+AI+
Sbjct: 215 NNGGIDTEEDYPYKERNDVCDQYRKNAKVVKIDSYEDVPVNNEKALQKAVAHQPVSIAIE 274

Query: 266 ASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQ 325
           A G D Q Y SG+FTG+CGT +DHGV A GYG+ ++G  YW+V+NSWG  WGE GY+R+Q
Sbjct: 275 AGGRDLQHYKSGIFTGKCGTAVDHGVVAAGYGS-ENGMDYWIVRNSWGAKWGEKGYLRVQ 333

Query: 326 RDIDAKEGLCGIAMQASYPT 345
           R++ +  GLCG+A + SYP 
Sbjct: 334 RNVASSSGLCGLATEPSYPV 353


>gi|374713651|gb|AEZ65083.1| cysteine protease [Carica papaya]
          Length = 467

 Score =  340 bits (873), Expect = 5e-91,   Method: Compositional matrix adjust.
 Identities = 174/310 (56%), Positives = 219/310 (70%), Gaps = 9/310 (2%)

Query: 39  HEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFR 98
           +E W+ ++G+ Y    EKE RF IFK+N+ +I   N  ++N  Y+LG+N FAD TNEE+R
Sbjct: 49  YEAWLVKHGKAYNALGEKEKRFGIFKDNLRFIDEHN--SQNLTYRLGLNRFADLTNEEYR 106

Query: 99  APRNGYK---RRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAF 155
           +   G K    R+    S ++   + R  +A +P  IDWRK+GAV GVKDQG CG CWAF
Sbjct: 107 SMYLGVKPGATRVTRKVSRKSDRFAARVGDA-LPDFIDWRKEGAVVGVKDQGSCGSCWAF 165

Query: 156 SAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAK 215
           S +AA+EGIN I T  L SLSEQELVDCDTS  ++GC GGLMD AFEFII+N G+ +E  
Sbjct: 166 STIAAVEGINQIVTGDLISLSEQELVDCDTS-YNEGCNGGLMDYAFEFIINNGGIDSEED 224

Query: 216 YPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYS 275
           YPY+A+D  C++   N +   I GYEDVP N+EAAL KAVA QPVSVAI+A G  FQ Y 
Sbjct: 225 YPYRAADQKCDQYRKNANVVSIDGYEDVPENDEAALKKAVAKQPVSVAIEAGGRAFQLYQ 284

Query: 276 SGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDI-DAKEGL 334
           SGVFTG+CGT LDHGV AVGYGT ++G  YW+V NSWG  WGE+GYIRM+R++  +  G 
Sbjct: 285 SGVFTGKCGTSLDHGVAAVGYGT-ENGQDYWIVGNSWGKNWGEDGYIRMERNLAGSSSGK 343

Query: 335 CGIAMQASYP 344
           CGIA+  SYP
Sbjct: 344 CGIAIGPSYP 353


>gi|171702843|dbj|BAG16377.1| cysteine protease [Brassica rapa var. perviridis]
          Length = 431

 Score =  340 bits (873), Expect = 5e-91,   Method: Compositional matrix adjust.
 Identities = 170/348 (48%), Positives = 233/348 (66%), Gaps = 16/348 (4%)

Query: 3   MILLENKLVLAAILVLGVWAPQSWSRTLN---DATMNERHEMWMAQYGRVYRDNAEKEMR 59
           +IL    +V+++ + + + +      T++   D  ++  +E W+ ++G+      EK+ R
Sbjct: 3   VILFLAMIVVSSAMDMSIISYDKNHHTVSSRSDVEVSRLYEEWVVKHGKAQNSLTEKDRR 62

Query: 60  FKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDV 119
           F+IFK+N+ +I   N K  N  Y+LG+ +FAD TN+E+R+   G + +       + T  
Sbjct: 63  FEIFKDNLRFIDEHNGK--NLSYRLGLTKFADLTNDEYRSMYLGSRLK------RKATKT 114

Query: 120 SFRYE---NASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLS 176
           S RYE     ++P S+DWRK+GAV  VKDQG CG CWAFS + A+EGIN I T  L SLS
Sbjct: 115 SLRYEARVGDAIPESVDWRKEGAVAEVKDQGSCGSCWAFSTIGAVEGINKIVTGDLISLS 174

Query: 177 EQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAK 236
           EQELVDCDTS  ++GC GGLMD AFEFII N G+ TE  YPYK  DG C++   N     
Sbjct: 175 EQELVDCDTS-YNEGCNGGLMDYAFEFIIKNGGIDTEEDYPYKGVDGRCDQTRKNAKVVT 233

Query: 237 ISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGY 296
           I  YEDVP+N+E +L KA+++QP+SVAI+  G  FQ Y SG+F G CGT+LDHGV AVGY
Sbjct: 234 IDSYEDVPANSEESLKKALSHQPISVAIEGGGRAFQLYDSGIFDGICGTDLDHGVVAVGY 293

Query: 297 GTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           GT ++G  YW+VKNSWGT+WGE+GYIRM+R+I +  G CGIA++ SYP
Sbjct: 294 GT-ENGKDYWIVKNSWGTSWGESGYIRMERNIASSAGKCGIAVEPSYP 340


>gi|357162587|ref|XP_003579458.1| PREDICTED: oryzain beta chain-like [Brachypodium distachyon]
          Length = 470

 Score =  340 bits (873), Expect = 5e-91,   Method: Compositional matrix adjust.
 Identities = 169/311 (54%), Positives = 220/311 (70%), Gaps = 9/311 (2%)

Query: 41  MWMAQYGRVYRDN-AEKEMRFKIFKENVEYIASFNNKAR--NKPYKLGINEFADQTNEEF 97
           +W A++G    ++  E+E RF+ F +N+ ++ + N +A    + ++LG+N FAD TN+EF
Sbjct: 54  LWRAEHGSGNSNSLGEEERRFRAFWDNLRFVDAHNARAAAGEEGFRLGMNRFADLTNDEF 113

Query: 98  RAPRNGYKRRLPSVRSSETTDVSFRYENASV---PASIDWRKKGAVTGVKDQGQCGCCWA 154
           RA   G K      R S    V  RY +  V   P ++DWR+KGAV  VK+QGQCG CWA
Sbjct: 114 RAAYLGVKG--AGQRRSARAGVGERYRHDGVEELPEAVDWREKGAVAPVKNQGQCGSCWA 171

Query: 155 FSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEA 214
           FSAV+A+E IN + T +L +LSEQELV+CD +G+  GC GGLMDDAF+FII+N G+ TE 
Sbjct: 172 FSAVSAVESINQLVTGELVTLSEQELVECDINGQSNGCNGGLMDDAFDFIINNGGIDTED 231

Query: 215 KYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFY 274
            YPYKA DG C+    N     I G+EDVP N+E +L KAVA+QPVSVAI+A G +FQ Y
Sbjct: 232 DYPYKALDGKCDINRRNAKVVSIDGFEDVPENDEKSLQKAVAHQPVSVAIEAGGREFQLY 291

Query: 275 SSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGL 334
            SGVFTG+CGTELDHGV AVGYGT ++G  YW+V+NSWG  WGE GY+RM+R+I+A  G 
Sbjct: 292 HSGVFTGRCGTELDHGVVAVGYGT-ENGKDYWIVRNSWGPKWGEAGYLRMERNINATTGK 350

Query: 335 CGIAMQASYPT 345
           CGIAM +SYPT
Sbjct: 351 CGIAMMSSYPT 361


>gi|50355617|dbj|BAD29957.1| cysteine protease [Daucus carota]
          Length = 437

 Score =  340 bits (873), Expect = 5e-91,   Method: Compositional matrix adjust.
 Identities = 167/307 (54%), Positives = 213/307 (69%), Gaps = 8/307 (2%)

Query: 42  WMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPR 101
           W+ ++G+ Y    EKE RF+IFK+N+ YI + +N   ++ Y+LG+N FAD TNEE+RA  
Sbjct: 52  WLVKHGKSYNALGEKETRFQIFKDNLRYIDN-HNADPDRSYELGLNRFADLTNEEYRAKY 110

Query: 102 NGYKRRLPSVRSSETTDVSFRY---ENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAV 158
            G K R    R   +   S RY   E   +P SIDWR+KGAV  VKDQG CG CWAFSA+
Sbjct: 111 LGTKSR--ESRPKLSKGPSDRYAPVEGEELPDSIDWREKGAVAAVKDQGSCGSCWAFSAI 168

Query: 159 AAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPY 218
            A+EGIN ITT +L +LSEQELVDCD S  ++GCEGGLMD AF FII N G+ ++  YPY
Sbjct: 169 GAVEGINQITTGELITLSEQELVDCDRS-YNEGCEGGLMDYAFNFIIKNGGIDSDLDYPY 227

Query: 219 KASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGV 278
              DG+CN+ + N     I  YEDVP  +E AL KA ANQP+SVAI+A G DFQ Y SG+
Sbjct: 228 TGRDGTCNQNKENAKVVTIDSYEDVPVYDEKALQKAAANQPISVAIEAGGMDFQLYVSGI 287

Query: 279 FTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIA 338
           FTG+CGT +DHGV  VGYG+ ++G  YW+V+NSWG  WGE GY++MQR++    GLCGI 
Sbjct: 288 FTGKCGTAVDHGVVVVGYGS-EEGMDYWIVRNSWGAAWGEAGYLKMQRNVGKSSGLCGIT 346

Query: 339 MQASYPT 345
           ++ SYP 
Sbjct: 347 IEPSYPV 353


>gi|297816030|ref|XP_002875898.1| hypothetical protein ARALYDRAFT_485194 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297321736|gb|EFH52157.1| hypothetical protein ARALYDRAFT_485194 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 363

 Score =  340 bits (872), Expect = 6e-91,   Method: Compositional matrix adjust.
 Identities = 172/310 (55%), Positives = 212/310 (68%), Gaps = 7/310 (2%)

Query: 39  HEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFR 98
           +E W   +  V R + E   RF +F+ NV ++   N K  NKPYKL +N FAD T+ EFR
Sbjct: 37  YERWRDHHS-VTRASHEALKRFNVFRHNVLHVHRTNKK--NKPYKLKVNRFADITHHEFR 93

Query: 99  APRNGYK-RRLPSVRSSETTDVSFRYENAS-VPASIDWRKKGAVTGVKDQGQCGCCWAFS 156
           +   G   +    +R  +     F YEN + VP+S+DWR+KGAVT VK+Q  CG CWAFS
Sbjct: 94  SSYAGSNVKHHRMLRGPKRGSGGFMYENVTRVPSSVDWREKGAVTEVKNQQDCGSCWAFS 153

Query: 157 AVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKY 216
            VAA+EGIN I T KL SLSEQELVDCDT  E+QGC GGLM+ AFEFI +N G+ TE  Y
Sbjct: 154 TVAAVEGINKIRTNKLVSLSEQELVDCDTE-ENQGCAGGLMEPAFEFIKNNGGIKTEETY 212

Query: 217 PYKASDGS-CNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYS 275
           PY ++D   C  K  +     I G+E VP N+E AL+KAVA+QPVSVAIDA  SDFQ YS
Sbjct: 213 PYDSNDVQFCRAKSIDGETVTIDGHEHVPENDEEALLKAVAHQPVSVAIDAGSSDFQLYS 272

Query: 276 SGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLC 335
            GVF G+CGT+L+HGV  VGYG   +GTKYW+V+NSWG  WGE GY+R++R I   EG C
Sbjct: 273 EGVFIGECGTQLNHGVVIVGYGETKNGTKYWIVRNSWGPEWGEGGYVRIERGISENEGRC 332

Query: 336 GIAMQASYPT 345
           GIAM+ASYPT
Sbjct: 333 GIAMEASYPT 342


>gi|226533314|ref|NP_001150119.1| xylem cysteine proteinase 2 [Zea mays]
 gi|195636886|gb|ACG37911.1| xylem cysteine proteinase 2 precursor [Zea mays]
 gi|223946183|gb|ACN27175.1| unknown [Zea mays]
 gi|413951209|gb|AFW83858.1| Xylem cysteine proteinase 2 [Zea mays]
          Length = 385

 Score =  340 bits (871), Expect = 7e-91,   Method: Compositional matrix adjust.
 Identities = 179/333 (53%), Positives = 220/333 (66%), Gaps = 24/333 (7%)

Query: 34  TMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQT 93
           ++ E  E W++++ R Y    EK  RF++FK+N+ +I   N K  +  Y LG+NEFAD T
Sbjct: 54  SLAELFERWLSRHRRAYASLEEKLRRFQVFKDNLHHIDETNRKVSS--YWLGLNEFADLT 111

Query: 94  NEEFRAPRNGYKRRLPSVRSSETTDVSFRYEN-------ASVPASIDWRKKGAVTGVKDQ 146
           ++EF+A   G +  +    S    D     E        AS+P S+DWR KGAVTGVK+Q
Sbjct: 112 HDEFKATYLGLRSSVGDGGSGIDDDDEPEEEEGYEGVDGASLPKSVDWRSKGAVTGVKNQ 171

Query: 147 GQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIIS 206
           GQCG CWAFS VAA+EGIN I T  LT+LSEQEL+DCDT G + GC GGLMD AF +I  
Sbjct: 172 GQCGSCWAFSTVAAVEGINQIVTGNLTALSEQELIDCDTDG-NNGCNGGLMDYAFSYIAH 230

Query: 207 NKGLATEAKYPYKASDGSCNK------------KEANPSAA--KISGYEDVPSNNEAALM 252
           N GL TE  YPY   +G+C +            ++AN  AA   ISGYEDVP NNE AL+
Sbjct: 231 NGGLHTEEAYPYLMEEGTCQRSSSSEKKWPGSSEDANDDAAVVTISGYEDVPRNNEQALL 290

Query: 253 KAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSW 312
           KA+A QPVSVAI+ASG +FQFYS GVF G CGT+LDHGV AVGYGTA  G  Y +VKNSW
Sbjct: 291 KALAQQPVSVAIEASGRNFQFYSGGVFDGPCGTQLDHGVAAVGYGTAAKGHDYIIVKNSW 350

Query: 313 GTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           G +WGE GYIRM+R    ++GLCGI   ASYPT
Sbjct: 351 GPSWGEKGYIRMRRGTGKRQGLCGINKMASYPT 383


>gi|2342494|dbj|BAA21848.1| bromelain [Ananas comosus]
 gi|2463582|dbj|BAA22543.1| FB31 precursor [Ananas comosus]
          Length = 352

 Score =  339 bits (870), Expect = 9e-91,   Method: Compositional matrix adjust.
 Identities = 177/339 (52%), Positives = 227/339 (66%), Gaps = 10/339 (2%)

Query: 9   KLVLAAILVLGVWA-PQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENV 67
           +LV   + +  +WA P + SR      M +R E WMA+YGRVY+DN EK  RF+IFK NV
Sbjct: 6   QLVFLFLFLCVMWASPSAASRDEPSDPMMKRFEEWMAEYGRVYKDNDEKMRRFQIFKNNV 65

Query: 68  EYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS 127
            +I +FNN+  N  Y LGIN+F D TN EF A   G   R  ++       VSF   N S
Sbjct: 66  NHIETFNNRNGNS-YTLGINKFTDMTNNEFVAQYTGGISRPLNIEKEPV--VSFDDVNIS 122

Query: 128 -VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTS 186
            V  SIDWR  GAVT VKDQ  CG CWAFSA+A +EGI  I T  L SLSEQE++DC  S
Sbjct: 123 AVGQSIDWRDYGAVTEVKDQNPCGSCWAFSAIATVEGIYKIVTGYLVSLSEQEVLDCAVS 182

Query: 187 GEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSN 246
               GC+GG +D+A++FIISN G+A+EA YPY+A  G C    + P++A I+GY  V SN
Sbjct: 183 ---NGCDGGFVDNAYDFIISNNGVASEADYPYQAYQGDC-AANSWPNSAYITGYSYVRSN 238

Query: 247 NEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYW 306
           +E+++  AV NQP++ AIDASG +FQ+Y+ GVF+G CGT L+H +T +GYG    GT+YW
Sbjct: 239 DESSMKYAVWNQPIAAAIDASGDNFQYYNGGVFSGPCGTSLNHAITIIGYGQDSSGTQYW 298

Query: 307 LVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           +VKNSWG++WGE GYIRM R + +  GLCGIAM   YPT
Sbjct: 299 IVKNSWGSSWGERGYIRMARGV-SSSGLCGIAMDPLYPT 336


>gi|242086591|ref|XP_002439128.1| hypothetical protein SORBIDRAFT_09g000960 [Sorghum bicolor]
 gi|241944413|gb|EES17558.1| hypothetical protein SORBIDRAFT_09g000960 [Sorghum bicolor]
          Length = 371

 Score =  339 bits (870), Expect = 9e-91,   Method: Compositional matrix adjust.
 Identities = 177/310 (57%), Positives = 216/310 (69%), Gaps = 12/310 (3%)

Query: 40  EMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRA 99
           E W+A+Y + Y    EK  RF++FK+N+ +I   N K     Y LG+N FAD T++EF+A
Sbjct: 67  EEWVAKYRKAYASFEEKLHRFEVFKDNLHHIDEANKKVTT--YWLGLNAFADLTHDEFKA 124

Query: 100 PRNGYKRRLPSVRSSETTDVSFRYENAS---VPASIDWRKKGAVTGVKDQGQCGCCWAFS 156
              G   R P  +  +TTD  FRY   +   VPAS+DWRKKGAVT VK+QGQCG CWAFS
Sbjct: 125 TYLGL--RQPETK--KTTDSRFRYGGVADDDVPASVDWRKKGAVTDVKNQGQCGSCWAFS 180

Query: 157 AVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKY 216
            VAA+EGIN I T  LTSLSEQELVDC T G + GC GG+MD+AF +I S+ GL TE  Y
Sbjct: 181 TVAAVEGINQIVTGNLTSLSEQELVDCSTDG-NNGCNGGVMDNAFSYIASSGGLRTEEAY 239

Query: 217 PYKASDGSCNKKEAN-PSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYS 275
           PY   +G C+ K  +      ISGYEDVP+N+E AL+KA+A+QP+SVAI+ASG  FQFYS
Sbjct: 240 PYLMEEGDCDDKARDGEQVVTISGYEDVPANDEQALVKALAHQPLSVAIEASGRHFQFYS 299

Query: 276 SGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLC 335
            GVF G CG+ELDHGV AVGYG++  G  Y +VKNSWG+ WGE GYIRM+R     EGLC
Sbjct: 300 GGVFNGPCGSELDHGVAAVGYGSS-KGQDYIIVKNSWGSHWGEKGYIRMKRGTGKPEGLC 358

Query: 336 GIAMQASYPT 345
           GI   ASYPT
Sbjct: 359 GINKMASYPT 368


>gi|414879123|tpg|DAA56254.1| TPA: hypothetical protein ZEAMMB73_708930 [Zea mays]
          Length = 368

 Score =  339 bits (870), Expect = 1e-90,   Method: Compositional matrix adjust.
 Identities = 172/320 (53%), Positives = 223/320 (69%), Gaps = 14/320 (4%)

Query: 31  NDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFA 90
           +D  + + +E W   + RV+R + EK  RF  FKEN  +I + N +  ++PY+L +N F 
Sbjct: 34  SDEALWDLYERWQTHH-RVHRHHGEKGRRFGTFKENARFIHAHNKRG-DRPYRLRLNRFG 91

Query: 91  DQTNEEFRAPRNGY-KRRLPSVRSSETTDVS---FRYENAS-VPASIDWRKKGAVTGVKD 145
           D   EEFR+   G+   R+  +R   T   +   F Y++A+ +P S+DWR+KGAVT VK+
Sbjct: 92  DMGREEFRS---GFADSRINDLRREPTAAPAVPGFMYDDATDLPRSVDWRQKGAVTAVKN 148

Query: 146 QGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFII 205
           QG+CG CWAFS V A+EGIN I T  L SLSEQEL+DCDT  ++ GC+GGLM++AFEFI 
Sbjct: 149 QGRCGSCWAFSTVVAVEGINAIRTGSLVSLSEQELIDCDT--DENGCQGGLMENAFEFIK 206

Query: 206 SNKGLATEAKYPYKASDGSCNKKEANPS-AAKISGYEDVPSNNEAALMKAVANQPVSVAI 264
           S+ G+ TE+ YPY AS+G+C+   A       I G++ VP+ +E AL KAVA+QPVSVAI
Sbjct: 207 SHGGITTESAYPYHASNGTCDGARARRGRVVAIDGHQAVPAGSEDALAKAVAHQPVSVAI 266

Query: 265 DASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRM 324
           DA G   QFYS GVFTG CGT+LDHGV AVGYG +DDGT YW+VKNSWG +WGE GYIRM
Sbjct: 267 DAGGQALQFYSEGVFTGDCGTDLDHGVAAVGYGVSDDGTPYWIVKNSWGPSWGEGGYIRM 326

Query: 325 QRDIDAKEGLCGIAMQASYP 344
           QR      GLCGIAM+AS+P
Sbjct: 327 QRGT-GNGGLCGIAMEASFP 345


>gi|357437715|ref|XP_003589133.1| Cysteine proteinase [Medicago truncatula]
 gi|87240770|gb|ABD32628.1| Granulin; Peptidase C1A, papain [Medicago truncatula]
 gi|355478181|gb|AES59384.1| Cysteine proteinase [Medicago truncatula]
          Length = 474

 Score =  339 bits (870), Expect = 1e-90,   Method: Compositional matrix adjust.
 Identities = 173/357 (48%), Positives = 240/357 (67%), Gaps = 21/357 (5%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSRTLNDATMNER--------HEMWMAQYGRVYRD 52
           + ++L+ +   ++  L + +    S+ +T  D + ++R        +E W+ ++G+ Y  
Sbjct: 12  LMIVLIISSFTVSLALDMSII---SYDKTHPDKSTSKRTNKEVLTMYEEWLVKHGKSYNG 68

Query: 53  NAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYK----RRL 108
             EK+ RF+IFK+N+++I   N    N  Y+LG+  FAD TNEE+R+   G K    RR+
Sbjct: 69  LGEKDKRFEIFKDNLKFIDEHN--GLNSTYRLGLTRFADLTNEEYRSKFLGTKIDPNRRM 126

Query: 109 PSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHIT 168
             +  S++   + R  +  +P S+DWRK+GAV GVKDQ  CG CWAFSA+AA+EGIN I 
Sbjct: 127 KKLGGSKSNRYAPRVGD-KLPESVDWRKEGAVVGVKDQASCGSCWAFSAIAAVEGINKIV 185

Query: 169 TRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKK 228
           T  L SLSEQELVDCDTS  ++GC GGLMD AFEFIISN G+ +E  YPYKA DG C++ 
Sbjct: 186 TGDLISLSEQELVDCDTS-YNEGCNGGLMDYAFEFIISNGGIDSEDDYPYKAVDGRCDQN 244

Query: 229 EANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELD 288
             N     I  YEDVP+ +E AL KAVANQP++VA++  G +FQ Y  GVFTG+CGT LD
Sbjct: 245 RKNAKVVTIDDYEDVPAYDELALQKAVANQPIAVAVEGGGREFQLYEYGVFTGRCGTALD 304

Query: 289 HGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDI-DAKEGLCGIAMQASYP 344
           HGV AVGYGT ++G  YW+V+NSWG +WGE GYIR++R++  ++ G CGIA++ SYP
Sbjct: 305 HGVAAVGYGT-ENGKDYWIVRNSWGGSWGEQGYIRLERNLASSRAGKCGIAIEPSYP 360


>gi|160858205|dbj|BAF93840.1| triticain beta 2 [Triticum aestivum]
          Length = 469

 Score =  339 bits (869), Expect = 1e-90,   Method: Compositional matrix adjust.
 Identities = 164/314 (52%), Positives = 221/314 (70%), Gaps = 10/314 (3%)

Query: 39  HEMWMAQYGRVYRDNA----EKEMRFKIFKENVEYIASFNNKAR--NKPYKLGINEFADQ 92
           +++W+A++G     NA    E+E RF+ F +N+ ++ + N +A    + ++L +N FAD 
Sbjct: 50  YDLWLAEHGGGSYPNANSIPERERRFRAFWDNLRFVDAHNARAAAGEEGFRLAMNRFADL 109

Query: 93  TNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS-VPASIDWRKKGAVTGVKDQGQCGC 151
           TN+EFRA   G K +    R        +R++ A  +P ++DWR+KGAV  VK+QGQCG 
Sbjct: 110 TNDEFRAAYLGVKGQ--RARPGRVVGERYRHDGAEELPEAVDWREKGAVAPVKNQGQCGS 167

Query: 152 CWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLA 211
           CWAFSA++ +E IN I T ++ +LSEQELV+CDT+G+  GC GGLMDDAFEFII N G+ 
Sbjct: 168 CWAFSAISTVESINQIVTGEMVTLSEQELVECDTNGQSSGCNGGLMDDAFEFIIKNGGID 227

Query: 212 TEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDF 271
           TE  YPYKA DG C+    N     I G+EDVP N+E +L KAVA+QPVSVAI+A G +F
Sbjct: 228 TEDDYPYKAIDGRCDVLRKNAKVVSIDGFEDVPENDEKSLQKAVAHQPVSVAIEAGGREF 287

Query: 272 QFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAK 331
           Q Y SGVF+G+CGT+LDHGV AVGYGT ++G  YW+V+NSWG  WGE GY+RM+R+I+  
Sbjct: 288 QLYHSGVFSGRCGTQLDHGVVAVGYGT-ENGKDYWIVRNSWGPNWGEAGYLRMERNINVT 346

Query: 332 EGLCGIAMQASYPT 345
            G CGIAM +SYPT
Sbjct: 347 SGKCGIAMMSSYPT 360


>gi|18408616|ref|NP_566901.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|75313880|sp|Q9STL5.1|CEP3_ARATH RecName: Full=KDEL-tailed cysteine endopeptidase CEP3; Flags:
           Precursor
 gi|4678353|emb|CAB41163.1| cysteine endopeptidase precursor-like protein [Arabidopsis
           thaliana]
 gi|26453052|dbj|BAC43602.1| putative cysteine endopeptidase precursor [Arabidopsis thaliana]
 gi|332644885|gb|AEE78406.1| putative cysteine proteinase [Arabidopsis thaliana]
          Length = 364

 Score =  339 bits (869), Expect = 1e-90,   Method: Compositional matrix adjust.
 Identities = 172/310 (55%), Positives = 209/310 (67%), Gaps = 7/310 (2%)

Query: 39  HEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFR 98
           +E W   +  V R + E   RF +F+ NV ++   N K  NKPYKL IN FAD T+ EFR
Sbjct: 38  YERWRGHHS-VSRASHEAIKRFNVFRHNVLHVHRTNKK--NKPYKLKINRFADITHHEFR 94

Query: 99  APRNGYK-RRLPSVRSSETTDVSFRYENAS-VPASIDWRKKGAVTGVKDQGQCGCCWAFS 156
           +   G   +    +R  +     F YEN + VP+S+DWR+KGAVT VK+Q  CG CWAFS
Sbjct: 95  SSYAGSNVKHHRMLRGPKRGSGGFMYENVTRVPSSVDWREKGAVTEVKNQQDCGSCWAFS 154

Query: 157 AVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKY 216
            VAA+EGIN I T KL SLSEQELVDCDT  E+QGC GGLM+ AFEFI +N G+ TE  Y
Sbjct: 155 TVAAVEGINKIRTNKLVSLSEQELVDCDTE-ENQGCAGGLMEPAFEFIKNNGGIKTEETY 213

Query: 217 PYKASDGS-CNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYS 275
           PY +SD   C           I G+E VP N+E  L+KAVA+QPVSVAIDA  SDFQ YS
Sbjct: 214 PYDSSDVQFCRANSIGGETVTIDGHEHVPENDEEELLKAVAHQPVSVAIDAGSSDFQLYS 273

Query: 276 SGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLC 335
            GVF G+CGT+L+HGV  VGYG   +GTKYW+V+NSWG  WGE GY+R++R I   EG C
Sbjct: 274 EGVFIGECGTQLNHGVVIVGYGETKNGTKYWIVRNSWGPEWGEGGYVRIERGISENEGRC 333

Query: 336 GIAMQASYPT 345
           GIAM+ASYPT
Sbjct: 334 GIAMEASYPT 343


>gi|357437719|ref|XP_003589135.1| Cysteine proteinase [Medicago truncatula]
 gi|355478183|gb|AES59386.1| Cysteine proteinase [Medicago truncatula]
          Length = 457

 Score =  339 bits (869), Expect = 1e-90,   Method: Compositional matrix adjust.
 Identities = 173/357 (48%), Positives = 240/357 (67%), Gaps = 21/357 (5%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSRTLNDATMNER--------HEMWMAQYGRVYRD 52
           + ++L+ +   ++  L + +    S+ +T  D + ++R        +E W+ ++G+ Y  
Sbjct: 12  LMIVLIISSFTVSLALDMSII---SYDKTHPDKSTSKRTNKEVLTMYEEWLVKHGKSYNG 68

Query: 53  NAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYK----RRL 108
             EK+ RF+IFK+N+++I   N    N  Y+LG+  FAD TNEE+R+   G K    RR+
Sbjct: 69  LGEKDKRFEIFKDNLKFIDEHN--GLNSTYRLGLTRFADLTNEEYRSKFLGTKIDPNRRM 126

Query: 109 PSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHIT 168
             +  S++   + R  +  +P S+DWRK+GAV GVKDQ  CG CWAFSA+AA+EGIN I 
Sbjct: 127 KKLGGSKSNRYAPRVGD-KLPESVDWRKEGAVVGVKDQASCGSCWAFSAIAAVEGINKIV 185

Query: 169 TRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKK 228
           T  L SLSEQELVDCDTS  ++GC GGLMD AFEFIISN G+ +E  YPYKA DG C++ 
Sbjct: 186 TGDLISLSEQELVDCDTS-YNEGCNGGLMDYAFEFIISNGGIDSEDDYPYKAVDGRCDQN 244

Query: 229 EANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELD 288
             N     I  YEDVP+ +E AL KAVANQP++VA++  G +FQ Y  GVFTG+CGT LD
Sbjct: 245 RKNAKVVTIDDYEDVPAYDELALQKAVANQPIAVAVEGGGREFQLYEYGVFTGRCGTALD 304

Query: 289 HGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDI-DAKEGLCGIAMQASYP 344
           HGV AVGYGT ++G  YW+V+NSWG +WGE GYIR++R++  ++ G CGIA++ SYP
Sbjct: 305 HGVAAVGYGT-ENGKDYWIVRNSWGGSWGEQGYIRLERNLASSRAGKCGIAIEPSYP 360


>gi|30141023|dbj|BAC75925.1| cysteine protease-3 [Helianthus annuus]
          Length = 348

 Score =  338 bits (868), Expect = 2e-90,   Method: Compositional matrix adjust.
 Identities = 166/316 (52%), Positives = 214/316 (67%), Gaps = 11/316 (3%)

Query: 32  DATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFAD 91
           D ++ + +E W +Q+  V R   EK+ RF +FK NV +I   N     KPYKL +NEFAD
Sbjct: 33  DKSLWDLYERWGSQH-MVSRAPDEKKKRFNVFKYNVNHINRVNQLG--KPYKLKLNEFAD 89

Query: 92  QTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASV---PASIDWRKKGAVTGVKDQGQ 148
            TN EF+A   G+  ++   R  +       + +A     P SIDWR  GAV  +K+QG+
Sbjct: 90  MTNHEFKA---GFDSKILHFRMLKGKRRQTPFTHAKTTDPPPSIDWRTNGAVNPIKNQGR 146

Query: 149 CGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNK 208
           CG CWAFS +  +EGIN I T +L SLSEQELVDC+T  E  GC GGLM++ +EFI    
Sbjct: 147 CGSCWAFSTIVGVEGINKIKTNQLVSLSEQELVDCETDCE--GCNGGLMENGYEFIKETG 204

Query: 209 GLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASG 268
           G+ TE  YPY A +G C+  + N    KI G+E+VP+N+E+A+++AVANQPVS+AIDA G
Sbjct: 205 GVTTEQIYPYFARNGRCDISKRNSPVVKIDGFENVPANDESAMLRAVANQPVSIAIDAGG 264

Query: 269 SDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDI 328
            +FQFYS GVF G CGTEL+HGV  VGYGT  DGT YW+V+NSWGT WGE GY+RMQR +
Sbjct: 265 LNFQFYSQGVFNGACGTELNHGVAIVGYGTTQDGTNYWIVRNSWGTGWGEQGYVRMQRGV 324

Query: 329 DAKEGLCGIAMQASYP 344
           +  EGLCG+AM ASYP
Sbjct: 325 NVPEGLCGLAMDASYP 340


>gi|4426617|gb|AAD20453.1| cysteine endopeptidase precursor [Oryza sativa]
          Length = 368

 Score =  338 bits (868), Expect = 2e-90,   Method: Compositional matrix adjust.
 Identities = 170/317 (53%), Positives = 215/317 (67%), Gaps = 7/317 (2%)

Query: 31  NDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFA 90
           +D  + + +E W   +  V R + EK  RF  FK+NV YI   N +A   P    +N F 
Sbjct: 38  SDEALWDLYERWQEHH-HVPRHHGEKHRRFGAFKDNVRYIHEHNKRAPGYP---PLNRFG 93

Query: 91  DQTNEEFRAPRNG-YKRRLPSVRSSETTDVSFRYENA-SVPASIDWRKKGAVTGVKDQGQ 148
           D   EEFRA   G +   L     +      F YE    +P ++DWR+KGAVTGVKDQG+
Sbjct: 94  DMGREEFRATFAGSHANDLRRDGLAAPPLPGFMYEGVRDLPRAVDWRRKGAVTGVKDQGK 153

Query: 149 CGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNK 208
           CG CWAFS V ++EGIN I T +L SLSEQEL+DCDT+ ++ GC+GGLM++AFE+I  + 
Sbjct: 154 CGSCWAFSTVVSVEGINAIRTGRLVSLSEQELIDCDTA-DNSGCQGGLMENAFEYIKHSG 212

Query: 209 GLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASG 268
           G+ TE+ YPY+A++G+C+   A      I G+++VP+N+EAAL KAVANQPVSVAIDA  
Sbjct: 213 GITTESAYPYRAANGTCDAVRARGGLVVIDGHQNVPANSEAALAKAVANQPVSVAIDAGD 272

Query: 269 SDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDI 328
             FQFYS GVF G CGT+LDHGV  VGYG  +DGT+YW+VKNSWGT WGE GYIRMQRD 
Sbjct: 273 QSFQFYSDGVFAGDCGTDLDHGVAVVGYGETNDGTEYWIVKNSWGTAWGEGGYIRMQRDS 332

Query: 329 DAKEGLCGIAMQASYPT 345
               GLCGIAM+ASYP 
Sbjct: 333 GYDGGLCGIAMEASYPV 349


>gi|125604306|gb|EAZ43631.1| hypothetical protein OsJ_28254 [Oryza sativa Japonica Group]
          Length = 369

 Score =  338 bits (868), Expect = 2e-90,   Method: Compositional matrix adjust.
 Identities = 177/308 (57%), Positives = 209/308 (67%), Gaps = 17/308 (5%)

Query: 37  ERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEE 96
           E +E W  Q+ RV RD  EK  RF +FK+NV  I  FN   R++PYKL +N F D T +E
Sbjct: 46  ELYERWRGQH-RVARDLGEKARRFNVFKDNVRLIHEFNR--RDEPYKLRLNRFGDMTADE 102

Query: 97  FRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFS 156
                        S R S       R E A        R  GAV  VKDQGQCG CWAFS
Sbjct: 103 SAGA-------YASSRVSHHRMFRGRGEKAQ-------RLHGAVGAVKDQGQCGSCWAFS 148

Query: 157 AVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKY 216
            +AA+EGIN I T  LT+LSEQ+LVDCDT   + GC+GGLMD+AF++I  + G+A  + Y
Sbjct: 149 TIAAVEGINAIRTSNLTALSEQQLVDCDTKTGNAGCDGGLMDNAFQYIAKHGGVAASSAY 208

Query: 217 PYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSS 276
           PY+A   SC    A+  A  I GYEDVP+N+E+AL KAVANQPVSVAI+A GS FQFYS 
Sbjct: 209 PYRARQSSCKSSAASSPAVTIDGYEDVPANSESALKKAVANQPVSVAIEAGGSHFQFYSE 268

Query: 277 GVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCG 336
           GVF G+CGTELDHGV AVGYGT  DGTKYW+V+NSWG  WGE GYIRM+RD+ AKEGLCG
Sbjct: 269 GVFAGKCGTELDHGVAAVGYGTTVDGTKYWIVRNSWGADWGEKGYIRMKRDVSAKEGLCG 328

Query: 337 IAMQASYP 344
           IAM+ASYP
Sbjct: 329 IAMEASYP 336


>gi|115441717|ref|NP_001045138.1| Os01g0907600 [Oryza sativa Japonica Group]
 gi|5761329|dbj|BAA83473.1| cysteine endopeptidase [Oryza sativa]
 gi|20804884|dbj|BAB92565.1| cysteine endopeptidase [Oryza sativa Japonica Group]
 gi|56785107|dbj|BAD82745.1| putative cysteine proteinase [Oryza sativa Japonica Group]
 gi|113534669|dbj|BAF07052.1| Os01g0907600 [Oryza sativa Japonica Group]
 gi|119395242|gb|ABL74582.1| cysteine endopeptidase [Oryza sativa Japonica Group]
 gi|125528777|gb|EAY76891.1| hypothetical protein OsI_04850 [Oryza sativa Indica Group]
 gi|125573036|gb|EAZ14551.1| hypothetical protein OsJ_04473 [Oryza sativa Japonica Group]
          Length = 371

 Score =  338 bits (868), Expect = 2e-90,   Method: Compositional matrix adjust.
 Identities = 170/318 (53%), Positives = 218/318 (68%), Gaps = 6/318 (1%)

Query: 31  NDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFA 90
           +D  + + +E W   +  V R + EK  RF  FK+NV YI   N +   + Y+L +N F 
Sbjct: 38  SDEALWDLYERWQEHH-HVPRHHGEKHRRFGAFKDNVRYIHEHNKRG-GRGYRLRLNRFG 95

Query: 91  DQTNEEFRAPRNG-YKRRLPSVRSSETTDVSFRYENA-SVPASIDWRKKGAVTGVKDQGQ 148
           D   EEFRA   G +   L     +      F YE    +P ++DWR+KGAVTGVKDQG+
Sbjct: 96  DMGREEFRATFAGSHANDLRRDGLAAPPLPGFMYEGVRDLPRAVDWRRKGAVTGVKDQGK 155

Query: 149 CGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNK 208
           CG CWAFS V ++EGIN I T +L SLSEQEL+DCDT+ ++ GC+GGLM++AFE+I  + 
Sbjct: 156 CGSCWAFSTVVSVEGINAIRTGRLVSLSEQELIDCDTA-DNSGCQGGLMENAFEYIKHSG 214

Query: 209 GLATEAKYPYKASDGSCNKKEANPSA-AKISGYEDVPSNNEAALMKAVANQPVSVAIDAS 267
           G+ TE+ YPY+A++G+C+   A  +    I G+++VP+N+EAAL KAVANQPVSVAIDA 
Sbjct: 215 GITTESAYPYRAANGTCDAVRARRAPLVVIDGHQNVPANSEAALAKAVANQPVSVAIDAG 274

Query: 268 GSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRD 327
              FQFYS GVF G CGT+LDHGV  VGYG  +DGT+YW+VKNSWGT WGE GYIRMQRD
Sbjct: 275 DQSFQFYSDGVFAGDCGTDLDHGVAVVGYGETNDGTEYWIVKNSWGTAWGEGGYIRMQRD 334

Query: 328 IDAKEGLCGIAMQASYPT 345
                GLCGIAM+ASYP 
Sbjct: 335 SGYDGGLCGIAMEASYPV 352


>gi|30141027|dbj|BAC75927.1| cysteine protease-5 [Helianthus annuus]
          Length = 365

 Score =  338 bits (868), Expect = 2e-90,   Method: Compositional matrix adjust.
 Identities = 170/326 (52%), Positives = 222/326 (68%), Gaps = 14/326 (4%)

Query: 27  SRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGI 86
           S T  D  +   +E+W+A++G+ Y    EKE RF+IF +N+++I   +N + N+ YK+G+
Sbjct: 24  SNTRTDEEVRNTYELWLARHGKTYNALGEKESRFRIFADNLKFIDE-HNLSGNRSYKVGL 82

Query: 87  NEFADQTNEEFRAPRNGYK----RRLPSVRSSETTDVSFRY---ENASVPASIDWRKKGA 139
           N+FAD TNEE+R+   G K    RR+  ++  E   +S RY   EN   PA +DWR++GA
Sbjct: 83  NQFADLTNEEYRSMYLGTKVDPYRRIAKMQRGE---ISRRYAVQENEMFPAKVDWRERGA 139

Query: 140 VTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDD 199
           V+ VK+QG CG CWAFS VA++EGIN I T  L SLSEQELVDCD    + GC GG MD 
Sbjct: 140 VSPVKNQGGCGSCWAFSTVASVEGINKIVTGDLISLSEQELVDCDNK-YNSGCNGGSMDY 198

Query: 200 AFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQP 259
           AF+FI+SN G+ +E+ YPYK     C+          I GYEDVP  NE ALMKAVA+QP
Sbjct: 199 AFQFIVSNGGIDSESDYPYKGVGAVCDPVRNKAKIVSIDGYEDVPPMNEKALMKAVAHQP 258

Query: 260 VSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGEN 319
           VSV I+ASG  FQ Y+SGV TG CGT LDHGV  VGYG+ ++G  YW+V+NSWG  WGE+
Sbjct: 259 VSVGIEASGRAFQLYTSGVLTGSCGTNLDHGVVVVGYGS-ENGKDYWIVRNSWGPEWGED 317

Query: 320 GYIRMQRD-IDAKEGLCGIAMQASYP 344
           GYIRM+R+ +D   G+CGI + ASYP
Sbjct: 318 GYIRMERNMVDTPVGMCGITLMASYP 343


>gi|363807062|ref|NP_001242584.1| uncharacterized protein LOC100804015 precursor [Glycine max]
 gi|255640677|gb|ACU20623.1| unknown [Glycine max]
          Length = 366

 Score =  338 bits (868), Expect = 2e-90,   Method: Compositional matrix adjust.
 Identities = 172/351 (49%), Positives = 226/351 (64%), Gaps = 17/351 (4%)

Query: 2   AMILLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFK 61
           +M ++   L L+  L   +      + T N+      +E W+ ++ + Y +  +K+ RF+
Sbjct: 3   SMTMIYTLLFLSFTLSYAIKTSTIINYTDNEVM--AMYEEWLVRHQKGYNELGKKDKRFQ 60

Query: 62  IFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRA----PRNGYKRRLPSVRSSETT 117
           +FK+N+ +I   NN   N  YKLG+N+FAD TNEE+RA     ++  KRRL   +S+   
Sbjct: 61  VFKDNLGFIQEHNNNL-NNTYKLGLNKFADMTNEEYRAMYLGTKSNAKRRLMKTKST--- 116

Query: 118 DVSFRYENAS---VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTS 174
               RY  ++   +P  +DWR KGAV  +KDQG CG CWAFS VA +E IN I T K  S
Sbjct: 117 --GHRYAFSARDRLPVHVDWRMKGAVAPIKDQGSCGSCWAFSTVATVEAINKIVTGKFVS 174

Query: 175 LSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSA 234
           LSEQELVDCD +  ++GC GGLMD AFEFII N G+ T+  YPY+  DG C+  + N   
Sbjct: 175 LSEQELVDCDRA-YNEGCNGGLMDYAFEFIIQNGGIDTDKDYPYRGFDGICDPTKKNAKV 233

Query: 235 AKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAV 294
             I GYEDVP  +E AL KAVA+QPVSVAI+ASG   Q Y SGVFTG+CGT LDHGV  V
Sbjct: 234 VNIDGYEDVPPYDENALKKAVAHQPVSVAIEASGRALQLYQSGVFTGKCGTSLDHGVVVV 293

Query: 295 GYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           GYG+ ++G  YWLV+NSWGT WGE+GY +MQR++    G CGI M+ASYP 
Sbjct: 294 GYGS-ENGVDYWLVRNSWGTGWGEDGYFKMQRNVRTSTGKCGITMEASYPV 343


>gi|242068363|ref|XP_002449458.1| hypothetical protein SORBIDRAFT_05g013840 [Sorghum bicolor]
 gi|241935301|gb|EES08446.1| hypothetical protein SORBIDRAFT_05g013840 [Sorghum bicolor]
          Length = 350

 Score =  338 bits (867), Expect = 2e-90,   Method: Compositional matrix adjust.
 Identities = 184/347 (53%), Positives = 238/347 (68%), Gaps = 26/347 (7%)

Query: 13  AAILVLGVWAPQSWSRTLNDAT-------MNERHEMWMAQYGRVYRDNAEKEMRFKIFKE 65
           AA+++L V      +R L+ +T       M  RH+ WMA++GR Y+D AEK  RF++FK 
Sbjct: 16  AALMILAVMTMVVEARDLSTSTGGYGEEAMKVRHQQWMAEHGRTYKDEAEKARRFQVFKA 75

Query: 66  NVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYEN 125
           N +++   +N A  K Y+L INEFAD TN+EF A   G    L  V +       F+YEN
Sbjct: 76  NADFV-DRSNAAGGKSYELAINEFADMTNDEFVAMYTG----LKPVPAGPKKMAGFKYEN 130

Query: 126 ASVP----ASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELV 181
            ++      ++DWR+KGAVTG+K+QGQCGCCWAF+AVAA+E I+ ITT  L SLSEQ+++
Sbjct: 131 LTLSDVDQQAVDWRQKGAVTGIKNQGQCGCCWAFAAVAAVESIHQITTGNLVSLSEQQVL 190

Query: 182 DCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYE 241
           DCDT G + GC GG +D+AF++IISN GLATE  YPY A+ G+C +    P A  IS Y+
Sbjct: 191 DCDTDGNN-GCNGGYIDNAFQYIISNGGLATEDAYPYAAAQGTC-QSSVQP-AVTISSYQ 247

Query: 242 DVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQ-CGT-ELDHGVTAVGYGTA 299
           DVPS +EAAL  AVANQPV+VAIDA  ++FQFYSSGV T   CGT  L+H VTAVGY TA
Sbjct: 248 DVPSGDEAALAAAVANQPVAVAIDAH-NNFQFYSSGVLTADTCGTPSLNHAVTAVGYSTA 306

Query: 300 DDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           +DGT YWL+KN WG  WGE GY+R++R  +A    CG+A QASYP A
Sbjct: 307 EDGTPYWLLKNQWGQNWGEGGYLRVERGTNA----CGVAQQASYPVA 349


>gi|1514953|dbj|BAA11170.1| cysteine proteinase [Oryza sativa (japonica cultivar-group)]
          Length = 368

 Score =  338 bits (867), Expect = 2e-90,   Method: Compositional matrix adjust.
 Identities = 170/317 (53%), Positives = 215/317 (67%), Gaps = 7/317 (2%)

Query: 31  NDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFA 90
           +D  + + +E W   +  V R + EK  RF  FK+NV YI   N +A   P    +N F 
Sbjct: 38  SDEALWDLYERWQEHH-HVPRHHGEKHRRFGAFKDNVRYIHEHNKRA---PGYAPLNRFG 93

Query: 91  DQTNEEFRAPRNG-YKRRLPSVRSSETTDVSFRYENA-SVPASIDWRKKGAVTGVKDQGQ 148
           D   EEFRA   G +   L     +      F YE    +P ++DWR+KGAVTGVKDQG+
Sbjct: 94  DMGREEFRATFAGSHANDLRRDGLAAPPLPGFMYEGVRDLPRAVDWRRKGAVTGVKDQGK 153

Query: 149 CGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNK 208
           CG CWAFS V ++EGIN I T +L SLSEQEL+DCDT+ ++ GC+GGLM++AFE+I  + 
Sbjct: 154 CGSCWAFSTVVSVEGINAIRTGRLVSLSEQELIDCDTA-DNSGCQGGLMENAFEYIKHSG 212

Query: 209 GLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASG 268
           G+ TE+ YPY+A++G+C+   A      I G+++VP+N+EAAL KAVANQPVSVAIDA  
Sbjct: 213 GITTESAYPYRAANGTCDAVRARGGLVVIDGHQNVPANSEAALAKAVANQPVSVAIDAGD 272

Query: 269 SDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDI 328
             FQFYS GVF G CGT+LDHGV  VGYG  +DGT+YW+VKNSWGT WGE GYIRMQRD 
Sbjct: 273 QSFQFYSDGVFAGDCGTDLDHGVAVVGYGETNDGTEYWIVKNSWGTAWGEGGYIRMQRDS 332

Query: 329 DAKEGLCGIAMQASYPT 345
               GLCGIAM+ASYP 
Sbjct: 333 GYDGGLCGIAMEASYPV 349


>gi|182375363|gb|ACB87490.1| mucunain [Mucuna pruriens]
          Length = 422

 Score =  338 bits (866), Expect = 3e-90,   Method: Compositional matrix adjust.
 Identities = 170/310 (54%), Positives = 217/310 (70%), Gaps = 9/310 (2%)

Query: 39  HEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFR 98
           +E W+ ++G+ Y    EK+ RF IFK+N+ +I   N  A N+ YKLG+N FAD TNEE+R
Sbjct: 4   YEQWLVKHGKAYNALGEKDKRFDIFKDNLRFIDDHN--ADNRTYKLGLNRFADLTNEEYR 61

Query: 99  APRNGYKRRLPSVRSSETTDVSFRYE---NASVPASIDWRKKGAVTGVKDQGQCGCCWAF 155
           A   G  R  P+ R  +T   S RY      ++P S+DWR + AV  VKDQG CG CWAF
Sbjct: 62  ARYLG-TRIDPNRRFVKTKTQSNRYAPRVGDNLPESVDWRNESAVLPVKDQGNCGSCWAF 120

Query: 156 SAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAK 215
           S + A+EGIN I T  L SLSEQELVDCDTS  +QGC GGLMD A+EFII+N G+ +E  
Sbjct: 121 STIGAVEGINKIVTGDLISLSEQELVDCDTS-YNQGCNGGLMDYAYEFIINNGGIDSEED 179

Query: 216 YPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYS 275
           YPY+A DG+C++   N     I  YEDVP+N+E AL KAVANQPVSVAI+  G +FQ Y 
Sbjct: 180 YPYRAVDGTCDQYRKNAKVVTIDSYEDVPANDELALKKAVANQPVSVAIEGGGREFQLYV 239

Query: 276 SGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDI-DAKEGL 334
           SGVFTG+CGT LDHGV AVGYG+   G  YW+V+NSWG +WGE GY+R++R++  ++ G 
Sbjct: 240 SGVFTGRCGTALDHGVVAVGYGSV-KGHDYWIVRNSWGASWGEEGYVRLERNLAKSRSGK 298

Query: 335 CGIAMQASYP 344
           CGIA++ SYP
Sbjct: 299 CGIAIEPSYP 308


>gi|9502421|gb|AAF88120.1|AC021043_13 Putative cysteine proteinase [Arabidopsis thaliana]
          Length = 331

 Score =  338 bits (866), Expect = 3e-90,   Method: Compositional matrix adjust.
 Identities = 170/324 (52%), Positives = 222/324 (68%), Gaps = 11/324 (3%)

Query: 29  TLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINE 88
           T ++  + E H+ WM ++ RVY D  EK+MRF +FK+N+++I  FN K  ++ YKLG+NE
Sbjct: 13  TFHEPIVAEHHQQWMTRFSRVYSDELEKQMRFDVFKKNLKFIEKFNKKG-DRTYKLGVNE 71

Query: 89  FADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVP-----ASIDWRKKGAVTGV 143
           FAD T EEF A   G K  +  + SSE  D      N +V       + DWR +GAVT V
Sbjct: 72  FADWTREEFIATHTGLKG-VNGIPSSEFVDEMIPSWNWNVSDVAGRETKDWRYEGAVTPV 130

Query: 144 KDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEF 203
           K QGQCGCCWAFS+VAA+EG+  I    L SLSEQ+L+DCD    D GC GG+M DAF +
Sbjct: 131 KYQGQCGCCWAFSSVAAVEGLTKIVGNNLVSLSEQQLLDCDRE-RDNGCNGGIMSDAFSY 189

Query: 204 IISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVA 263
           II N+G+A+EA YPY+A++G+C +    PSA  I G++ VPSNNE AL++AV+ QPVSV+
Sbjct: 190 IIKNRGIASEASYPYQAAEGTC-RYNGKPSAW-IRGFQTVPSNNERALLEAVSKQPVSVS 247

Query: 264 IDASGSDFQFYSSGVF-TGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYI 322
           IDA G  F  YS GV+    CGT ++H VT VGYGT+ +G KYWL KNSWG TWGENGYI
Sbjct: 248 IDADGPGFMHYSGGVYDEPYCGTNVNHAVTFVGYGTSPEGIKYWLAKNSWGETWGENGYI 307

Query: 323 RMQRDIDAKEGLCGIAMQASYPTA 346
           R++RD+   +G+CG+A  A YP A
Sbjct: 308 RIRRDVAWPQGMCGVAQYAFYPVA 331


>gi|194352762|emb|CAQ00109.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
 gi|326517250|dbj|BAJ99991.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 367

 Score =  338 bits (866), Expect = 3e-90,   Method: Compositional matrix adjust.
 Identities = 185/313 (59%), Positives = 219/313 (69%), Gaps = 13/313 (4%)

Query: 39  HEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFR 98
           +E W  Q+  V RD  EK  RF +F+ENV  I  FN    + PYKL +N F D T +EFR
Sbjct: 47  YERWREQH-TVARDLGEKARRFNVFRENVRLIHEFNRG--DAPYKLRLNRFGDMTADEFR 103

Query: 99  APRNGYKRRLPSVRSSETTDVSFRYENAS----VPASIDWRKKGAVTGVKDQGQCGCCWA 154
                 +     + S +     F + +A+    VP S+DWR+KGAVT VKDQGQCG CWA
Sbjct: 104 RAYASSRVSHHRMFSLKEGGGGFMHGSAASVRDVPPSVDWRQKGAVTAVKDQGQCGSCWA 163

Query: 155 FSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEA 214
           FS +AA+EGIN I ++ LTSLSEQ+LVDCDT   + GC GGLMD AF++I  + G+A E 
Sbjct: 164 FSTIAAVEGINAIRSKNLTSLSEQQLVDCDTK-SNAGCNGGLMDYAFQYIAKHGGVAAED 222

Query: 215 KYPYKASDGS-CNKKEANPSAA-KISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQ 272
            YPYKA   S CNKK   PSA   I GYEDVP+N+E AL KAVA QPV+VAI+ASGS FQ
Sbjct: 223 AYPYKARQASSCNKK---PSAVVTIDGYEDVPANDETALKKAVAAQPVAVAIEASGSHFQ 279

Query: 273 FYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKE 332
           FYS GVF G+CGTELDHGV AVGYGT  DGTKYW+VKNSWG  WGE GYIRM+RD+  KE
Sbjct: 280 FYSEGVFAGKCGTELDHGVAAVGYGTTVDGTKYWIVKNSWGPEWGEKGYIRMKRDVKDKE 339

Query: 333 GLCGIAMQASYPT 345
           GLCGIAM+ASYP 
Sbjct: 340 GLCGIAMEASYPV 352


>gi|2463586|dbj|BAA22545.1| FB22 precursor [Ananas comosus]
          Length = 340

 Score =  338 bits (866), Expect = 3e-90,   Method: Compositional matrix adjust.
 Identities = 176/339 (51%), Positives = 225/339 (66%), Gaps = 11/339 (3%)

Query: 9   KLVLAAILVLGVWA-PQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENV 67
           +LV   + +  +WA P + SR      M +R E WMA+YGRVY+DN EK  RF+IFK NV
Sbjct: 6   QLVFLFLFLCVMWASPSAASRDEPSDPMMKRFEEWMAEYGRVYKDNDEKMRRFQIFKNNV 65

Query: 68  EYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS 127
            +I +FNN+  N  Y LGIN+F D TN EF     G    L   R      VSF   N S
Sbjct: 66  NHIETFNNRNGNS-YTLGINKFTDMTNNEFVTQYTGVSLPLNFKREPV---VSFDDVNIS 121

Query: 128 -VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTS 186
            V  SIDWR  GAVT VKDQ  CG CWAFSA+A +EGI  I T  L SLSEQE++DC  S
Sbjct: 122 AVGQSIDWRDYGAVTEVKDQNPCGSCWAFSAIATVEGIYKIVTGYLVSLSEQEVLDCAVS 181

Query: 187 GEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSN 246
               GC+GG +D+A++FIISN G+A+EA YPY+A +G C      P++A I+GY  V SN
Sbjct: 182 ---NGCDGGFVDNAYDFIISNNGVASEADYPYQAYEGDCTANSW-PNSAYITGYSYVRSN 237

Query: 247 NEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYW 306
           +E+++  AV NQP++ AIDASG +FQ+Y+ GVF+G CGT L+H +T +GYG    GT+YW
Sbjct: 238 DESSMKYAVWNQPIAAAIDASGDNFQYYNGGVFSGPCGTSLNHAITIIGYGQDSSGTQYW 297

Query: 307 LVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           +VKNSWG++WGE GY+RM R + +  GLCGIAM   YPT
Sbjct: 298 IVKNSWGSSWGERGYVRMARGV-SSSGLCGIAMDPLYPT 335


>gi|326507362|dbj|BAK03074.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 473

 Score =  337 bits (865), Expect = 4e-90,   Method: Compositional matrix adjust.
 Identities = 162/314 (51%), Positives = 218/314 (69%), Gaps = 8/314 (2%)

Query: 39  HEMWMAQYGRVYRDNA----EKEMRFKIFKENVEYIASFNNKAR--NKPYKLGINEFADQ 92
           +++W+A++G     NA    ++E RF  F +N+ ++ + N +A    + ++L +N FAD 
Sbjct: 52  YDLWLAEHGGGSSPNANSIADRERRFSAFWDNLRFVDAHNARAAAGEEGFRLAMNRFADL 111

Query: 93  TNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS-VPASIDWRKKGAVTGVKDQGQCGC 151
           TN+EFRA   G K      R+       +R++ A  +P ++DWR+KGAV  VK+QGQCG 
Sbjct: 112 TNDEFRAAYLGVKGAAERNRAGRVVGERYRHDGAEELPEAVDWREKGAVAPVKNQGQCGS 171

Query: 152 CWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLA 211
           CWAFSAV+ +E IN I T ++ +LSEQELV+CD +G+  GC GGLMDDAFEFII N G+ 
Sbjct: 172 CWAFSAVSTVESINQIVTGEMVTLSEQELVECDINGQSSGCNGGLMDDAFEFIIKNGGID 231

Query: 212 TEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDF 271
           TE  YPYKA DG C+    N     I G+EDVP N+E +L KAVA+ PVSVAI+A G +F
Sbjct: 232 TEDDYPYKAVDGRCDVLRKNAKVVSIDGFEDVPENDEKSLQKAVAHHPVSVAIEAGGREF 291

Query: 272 QFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAK 331
           Q Y SGVF+G+CGT+LDHGV AVGYGT ++G  YW+V+NSWG  WGE GY+RM+R+I+  
Sbjct: 292 QLYHSGVFSGRCGTQLDHGVVAVGYGT-ENGKDYWIVRNSWGPNWGEAGYLRMERNINVT 350

Query: 332 EGLCGIAMQASYPT 345
            G CGIAM +SYPT
Sbjct: 351 SGKCGIAMMSSYPT 364


>gi|204307508|gb|ACI00280.1| triticain beta 2 [Hordeum vulgare]
          Length = 473

 Score =  337 bits (865), Expect = 4e-90,   Method: Compositional matrix adjust.
 Identities = 162/314 (51%), Positives = 218/314 (69%), Gaps = 8/314 (2%)

Query: 39  HEMWMAQYGRVYRDNA----EKEMRFKIFKENVEYIASFNNKAR--NKPYKLGINEFADQ 92
           +++W+A++G     NA    ++E RF  F +N+ ++ + N +A    + ++L +N FAD 
Sbjct: 52  YDLWLAEHGGGSSPNANSIADRERRFSAFWDNLRFVDAHNARAAAGEEGFRLAMNRFADL 111

Query: 93  TNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS-VPASIDWRKKGAVTGVKDQGQCGC 151
           TN+EFRA   G K      R+       +R++ A  +P ++DWR+KGAV  VK+QGQCG 
Sbjct: 112 TNDEFRAAYLGVKGAAERNRAGRVVGERYRHDGAEELPEAVDWREKGAVAPVKNQGQCGS 171

Query: 152 CWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLA 211
           CWAFSAV+ +E IN I T ++ +LSEQELV+CD +G+  GC GGLMDDAFEFII N G+ 
Sbjct: 172 CWAFSAVSTVESINQIVTGEMVTLSEQELVECDINGQSSGCNGGLMDDAFEFIIKNGGID 231

Query: 212 TEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDF 271
           TE  YPYKA DG C+    N     I G+EDVP N+E +L KAVA+ PVSVAI+A G +F
Sbjct: 232 TEDDYPYKAVDGRCDVLRKNAKVVSIDGFEDVPENDEKSLQKAVAHHPVSVAIEAGGREF 291

Query: 272 QFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAK 331
           Q Y SGVF+G+CGT+LDHGV AVGYGT ++G  YW+V+NSWG  WGE GY+RM+R+I+  
Sbjct: 292 QLYHSGVFSGRCGTQLDHGVVAVGYGT-ENGKDYWIVRNSWGPNWGEAGYLRMERNINVT 350

Query: 332 EGLCGIAMQASYPT 345
            G CGIAM +SYPT
Sbjct: 351 SGKCGIAMMSSYPT 364


>gi|194352750|emb|CAQ00103.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
 gi|326514262|dbj|BAJ92281.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326519402|dbj|BAJ96700.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326524351|dbj|BAK00559.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326531998|dbj|BAK01375.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 356

 Score =  337 bits (864), Expect = 5e-90,   Method: Compositional matrix adjust.
 Identities = 179/323 (55%), Positives = 221/323 (68%), Gaps = 16/323 (4%)

Query: 32  DATMNER----HEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGIN 87
           D + NER     E W+A++ + Y    EK  RF++FK+N+++I   N +  +  Y LG+N
Sbjct: 38  DLSSNERLVELFEKWLAKHQKAYASFEEKLHRFEVFKDNLKHIDKINREVTS--YWLGLN 95

Query: 88  EFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS---VPASIDWRKKGAVTGVK 144
           EFAD T++EF+A   G     P+ R S     SFRYE+ S   +P S+DWRKKGAVT VK
Sbjct: 96  EFADLTHDEFKAAYLGLDAA-PARRGSSR---SFRYEDVSASDLPKSVDWRKKGAVTEVK 151

Query: 145 DQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFI 204
           +QGQCG CWAFS VAA+EGIN I T  LT+LSEQEL+DC   G + GC GGLMD AF +I
Sbjct: 152 NQGQCGSCWAFSTVAAVEGINAIVTGNLTALSEQELIDCSVDG-NSGCNGGLMDYAFSYI 210

Query: 205 ISNKGLATEAKYPYKASDGSC-NKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVA 263
            S+ GL TE  YPY   +GSC + K+A   A  ISGYEDVP+N+E AL+KA+A+QPVSVA
Sbjct: 211 ASSGGLHTEEAYPYLMEEGSCGDGKKAESEAVTISGYEDVPANDEQALIKALAHQPVSVA 270

Query: 264 IDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGT-ADDGTKYWLVKNSWGTTWGENGYI 322
           I+ASG  FQFYS GVF G CG +LDHGV AVGYG+    G  Y +V+NSWG  WGE GYI
Sbjct: 271 IEASGRHFQFYSGGVFDGPCGAQLDHGVAAVGYGSDKGKGHDYIIVRNSWGAQWGEKGYI 330

Query: 323 RMQRDIDAKEGLCGIAMQASYPT 345
           RM+R     EGLCGI   ASYPT
Sbjct: 331 RMKRGTSNGEGLCGINKMASYPT 353


>gi|194352756|emb|CAQ00106.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
          Length = 476

 Score =  337 bits (864), Expect = 5e-90,   Method: Compositional matrix adjust.
 Identities = 162/314 (51%), Positives = 218/314 (69%), Gaps = 8/314 (2%)

Query: 39  HEMWMAQYGRVYRDNA----EKEMRFKIFKENVEYIASFNNKAR--NKPYKLGINEFADQ 92
           +++W+A++G     NA    ++E RF  F +N+ ++ + N +A    + ++L +N FAD 
Sbjct: 52  YDLWLAEHGGGSSPNANSIADRERRFSAFWDNLRFVDAHNARAAAGEEGFRLAMNRFADL 111

Query: 93  TNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS-VPASIDWRKKGAVTGVKDQGQCGC 151
           TN+EFRA   G K      R+       +R++ A  +P ++DWR+KGAV  VK+QGQCG 
Sbjct: 112 TNDEFRAAYLGVKGAAERNRAGRVVGDRYRHDGAEELPEAVDWREKGAVAPVKNQGQCGS 171

Query: 152 CWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLA 211
           CWAFSAV+ +E IN I T ++ +LSEQELV+CD +G+  GC GGLMDDAFEFII N G+ 
Sbjct: 172 CWAFSAVSTVESINQIVTGEMVTLSEQELVECDINGQSSGCNGGLMDDAFEFIIKNGGID 231

Query: 212 TEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDF 271
           TE  YPYKA DG C+    N     I G+EDVP N+E +L KAVA+ PVSVAI+A G +F
Sbjct: 232 TEDDYPYKAVDGRCDVLRKNAKVVSIDGFEDVPENDEKSLQKAVAHHPVSVAIEAGGREF 291

Query: 272 QFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAK 331
           Q Y SGVF+G+CGT+LDHGV AVGYGT ++G  YW+V+NSWG  WGE GY+RM+R+I+  
Sbjct: 292 QLYHSGVFSGRCGTQLDHGVVAVGYGT-ENGKDYWIVRNSWGPNWGEAGYLRMERNINVT 350

Query: 332 EGLCGIAMQASYPT 345
            G CGIAM +SYPT
Sbjct: 351 SGKCGIAMMSSYPT 364


>gi|302764466|ref|XP_002965654.1| hypothetical protein SELMODRAFT_230713 [Selaginella moellendorffii]
 gi|300166468|gb|EFJ33074.1| hypothetical protein SELMODRAFT_230713 [Selaginella moellendorffii]
          Length = 345

 Score =  337 bits (863), Expect = 7e-90,   Method: Compositional matrix adjust.
 Identities = 163/307 (53%), Positives = 218/307 (71%), Gaps = 8/307 (2%)

Query: 39  HEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFR 98
           ++ W+ ++G+ Y    E + RF+IFKENV YI S N + RN  + LG+N+FAD TN EFR
Sbjct: 38  YQKWIQEHGKAYNSAHEYKKRFQIFKENVNYINSHNAR-RNNSHSLGLNKFADLTNSEFR 96

Query: 99  APRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAV 158
               G  +R       E  D++   + A+   S+DWRKKG VT +KDQG CG CWAFSAV
Sbjct: 97  GLYVGRLQRPAPFH--EVGDIALVADTAT---SVDWRKKGGVTEIKDQGDCGSCWAFSAV 151

Query: 159 AAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPY 218
           AA+EG+  ++T  L SLSEQELVDCDT+  +QGC+GG+MD AF+++I N G+ +++ YPY
Sbjct: 152 AAVEGLTFLSTGTLVSLSEQELVDCDTT-VNQGCDGGIMDYAFQYMIRNGGITSQSNYPY 210

Query: 219 KASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGV 278
           +A  G+C+K +    AA I+G++ +P  +E  L++AVANQPVSVAI+A G DFQ YSSGV
Sbjct: 211 RALRGACDKDKVKYHAATINGFQAIPPQSEELLLRAVANQPVSVAIEAGGQDFQLYSSGV 270

Query: 279 FTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIA 338
           FTG+CG+ LDHGV  VGYGT   G +YWLVKNSWG+ WGE+GY+RM+R      G+CGI 
Sbjct: 271 FTGECGSNLDHGVAIVGYGTDAGGRQYWLVKNSWGSGWGESGYVRMERQ-GPGAGVCGIN 329

Query: 339 MQASYPT 345
           + ASYPT
Sbjct: 330 LDASYPT 336


>gi|297603535|ref|NP_001054211.2| Os04g0670200 [Oryza sativa Japonica Group]
 gi|109939735|sp|P25777.2|ORYB_ORYSJ RecName: Full=Oryzain beta chain; Flags: Precursor
 gi|32488398|emb|CAE02823.1| OSJNBa0043A12.28 [Oryza sativa Japonica Group]
 gi|90399163|emb|CAJ86092.1| H0818H01.14 [Oryza sativa Indica Group]
 gi|125550169|gb|EAY95991.1| hypothetical protein OsI_17862 [Oryza sativa Indica Group]
 gi|215766596|dbj|BAG98700.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|255675868|dbj|BAF16125.2| Os04g0670200 [Oryza sativa Japonica Group]
          Length = 466

 Score =  337 bits (863), Expect = 7e-90,   Method: Compositional matrix adjust.
 Identities = 165/311 (53%), Positives = 220/311 (70%), Gaps = 9/311 (2%)

Query: 39  HEMWMAQYGRVYRD--NAEKEMRFKIFKENVEYIASFNNKARNKP-YKLGINEFADQTNE 95
           +++W+A+ G    +    E E RF +F +N++++ + N +A  +  ++LG+N FAD TNE
Sbjct: 52  YDLWLAENGGGSPNALGGEHERRFLVFWDNLKFVDAHNARADERGGFRLGMNRFADLTNE 111

Query: 96  EFRAPRNGYKRRLPSVRSSETTDVSFRYENAS-VPASIDWRKKGAVTGVKDQGQCGCCWA 154
           EFRA   G K    S  + E     +R++    +P S+DWR+KGAV  VK+QGQCG CWA
Sbjct: 112 EFRATFLGAKVAERSRAAGE----RYRHDGVEELPESVDWREKGAVAPVKNQGQCGSCWA 167

Query: 155 FSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEA 214
           FSAV+ +E IN + T ++ +LSEQELV+C T+G++ GC GGLMDDAF+FII N G+ TE 
Sbjct: 168 FSAVSTVESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMDDAFDFIIKNGGIDTED 227

Query: 215 KYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFY 274
            YPYKA DG C+    N     I G+EDVP N+E +L KAVA+QPVSVAI+A G +FQ Y
Sbjct: 228 DYPYKAVDGKCDINRENAKVVSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGGREFQLY 287

Query: 275 SSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGL 334
            SGVF+G+CGT LDHGV AVGYGT D+G  YW+V+NSWG  WGE+GY+RM+R+I+   G 
Sbjct: 288 HSGVFSGRCGTSLDHGVVAVGYGT-DNGKDYWIVRNSWGPKWGESGYVRMERNINVTTGK 346

Query: 335 CGIAMQASYPT 345
           CGIAM ASYPT
Sbjct: 347 CGIAMMASYPT 357


>gi|535473|emb|CAA53377.1| cysteine protease [Vicia sativa]
          Length = 368

 Score =  336 bits (862), Expect = 8e-90,   Method: Compositional matrix adjust.
 Identities = 176/353 (49%), Positives = 228/353 (64%), Gaps = 20/353 (5%)

Query: 2   AMILLENKLVLAAI---LVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEM 58
           +M +L   L  + I   L L +  P   S   ND  M   +E W+ ++ +VY    EK+ 
Sbjct: 3   SMTILPFFLFFSLITFSLALDIQLPTGRS---NDEVMT-MYEEWLVKHQKVYNGLREKDQ 58

Query: 59  RFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFR----APRNGYKRRLPSVRSS 114
           RF+IFK+N+ +I   N  A+N  Y +G+N+FAD TNEE+R      R+  KRR   +  +
Sbjct: 59  RFQIFKDNLNFIDEHN--AQNYTYIVGLNKFADMTNEEYRDMYLGTRSDIKRR---IMKN 113

Query: 115 ETTDVSFRYENAS-VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLT 173
           + T   + Y +   +P  +DWR KGA+T +KDQG CG CWAFS +A +E IN I T KL 
Sbjct: 114 KITGHRYAYNSGDRLPVHVDWRLKGAITHIKDQGSCGSCWAFSTIATVEAINKIVTGKLV 173

Query: 174 SLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPS 233
           SLSEQELVDCD +  ++GC GGLMD AFEFII N G+ T+  YPYK  +G C+       
Sbjct: 174 SLSEQELVDCDRA-FNEGCNGGLMDYAFEFIIGNGGIDTDQHYPYKGFEGRCDPTRKKAK 232

Query: 234 AAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTA 293
              I GYEDVPSNNE AL KAVA+QPVSVAI+ASG   Q Y SGVFTG+CGT LDH V  
Sbjct: 233 IVSIDGYEDVPSNNENALKKAVAHQPVSVAIEASGRALQLYQSGVFTGKCGTSLDHAVVI 292

Query: 294 VGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKE-GLCGIAMQASYPT 345
           VGYG+ ++G  YWLV+NSWGT WGE+GY +M+R++     G CGIA++ASYP 
Sbjct: 293 VGYGS-ENGLDYWLVRNSWGTNWGEDGYFKMERNVKGTHTGKCGIAVEASYPV 344


>gi|22759715|dbj|BAC10906.1| cysteine proteinase [Zinnia elegans]
          Length = 352

 Score =  336 bits (861), Expect = 1e-89,   Method: Compositional matrix adjust.
 Identities = 165/306 (53%), Positives = 211/306 (68%), Gaps = 6/306 (1%)

Query: 40  EMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRA 99
           E W+ ++ + Y    EK  RF+IF +N+++I   N K  N  Y LG+NEFAD T+EEF+ 
Sbjct: 50  ESWLVKHSKFYESLDEKLHRFEIFMDNLKHIDETNKKVSN--YWLGLNEFADLTHEEFKH 107

Query: 100 PRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVA 159
              G+K  L   +   + +  +R +   +P S+DWRKKGAV  VK+QGQCG CWAFS VA
Sbjct: 108 KFLGFKGELAERKDESSKEFGYR-DFVDLPKSVDWRKKGAVAPVKNQGQCGSCWAFSTVA 166

Query: 160 AMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYK 219
           A+EGIN I T  LT LSEQEL+DCDT+  + GC GGLMD AF +++ + GL  E +YPY 
Sbjct: 167 AVEGINQIVTGNLTMLSEQELIDCDTTF-NNGCNGGLMDYAFAYVMRS-GLHKEEEYPYI 224

Query: 220 ASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVF 279
            S+G+C++K+       ISGY DVP N+EA+ +KA+ANQP+SVAI+ASG DFQFYS GVF
Sbjct: 225 MSEGTCDEKKDVSEKVTISGYHDVPRNDEASFLKALANQPISVAIEASGRDFQFYSGGVF 284

Query: 280 TGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAM 339
            G CGTELDHGV AVGYGT   G  Y +V+NSWG  WGE GYIRM+R      G+CG+ M
Sbjct: 285 DGHCGTELDHGVAAVGYGTT-KGLDYVIVRNSWGPKWGEKGYIRMKRGSGKPHGMCGLYM 343

Query: 340 QASYPT 345
            ASYPT
Sbjct: 344 MASYPT 349


>gi|449525012|ref|XP_004169515.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
          Length = 459

 Score =  336 bits (861), Expect = 1e-89,   Method: Compositional matrix adjust.
 Identities = 174/337 (51%), Positives = 222/337 (65%), Gaps = 6/337 (1%)

Query: 10  LVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRD-NAEKEMRFKIFKENVE 68
           L+    + L   +P S      D  +   ++ W A++G+++ +  AE E RF IFK+N++
Sbjct: 12  LLFFLFIALSAASPSSIIPQRTDDEVMALYDQWRAKHGKLHNNLGAEPENRFHIFKDNLK 71

Query: 69  YIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASV 128
           +I   N  A+N PY+LG+N FAD TNEE+R+   G K    S R + T++         +
Sbjct: 72  FIDEIN--AQNLPYRLGLNVFADLTNEEYRSRYLGGKFASGS-RRNRTSNRYLPRLGDDL 128

Query: 129 PASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGE 188
           P SIDWR KGAV  VKDQG CG CWAFS VA++E IN I T  L +LSEQELVDCD S  
Sbjct: 129 PDSIDWRAKGAVAPVKDQGSCGSCWAFSTVASVEAINQIVTGDLIALSEQELVDCDRS-Y 187

Query: 189 DQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNE 248
           ++GC GGLMD AFEFII N GL TE  YPY   D SC + + N     I  YEDVP NNE
Sbjct: 188 NEGCNGGLMDYAFEFIIENGGLDTEEDYPYYGFDSSCIQYKKNAKVVAIDSYEDVPVNNE 247

Query: 249 AALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLV 308
            AL KAV+ Q VSVAI+  G  FQ Y SG+FTG+CGT+LDHGV  VGYG+ + G  YW+V
Sbjct: 248 KALQKAVSKQVVSVAIEGGGRSFQLYQSGIFTGRCGTDLDHGVNVVGYGS-EGGVDYWIV 306

Query: 309 KNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           +NSWG +WGE+GY++MQR+I +  GLCGIAM+ SYPT
Sbjct: 307 RNSWGGSWGESGYVKMQRNIASPTGLCGIAMEPSYPT 343


>gi|357130141|ref|XP_003566711.1| PREDICTED: xylem cysteine proteinase 1-like [Brachypodium
           distachyon]
          Length = 457

 Score =  335 bits (860), Expect = 1e-89,   Method: Compositional matrix adjust.
 Identities = 174/314 (55%), Positives = 216/314 (68%), Gaps = 12/314 (3%)

Query: 37  ERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEE 96
           E  E W+A++ + Y    EK  RF++FK+N+++I   N +  +  Y LG+NEFAD T+EE
Sbjct: 148 ELFEKWLAKHQKAYASFEEKLHRFEVFKDNLKHIDKVNREVTS--YWLGLNEFADLTHEE 205

Query: 97  FRAPRNGYKRRLPSVRSSETTDVSFRYENAS---VPASIDWRKKGAVTGVKDQGQCGCCW 153
           F+A   G     P+  S      SF+YE+ S   +P S+DWR KGAVT VK+QGQCG CW
Sbjct: 206 FKATYLGLAPPAPARESRG----SFKYEDVSADDLPKSVDWRTKGAVTEVKNQGQCGSCW 261

Query: 154 AFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATE 213
           AFS VAA+EGIN I T  LT+LSEQEL+DC   G + GC GGLMD AF +I S+ GL TE
Sbjct: 262 AFSTVAAVEGINAIVTGNLTALSEQELIDCSVDG-NNGCNGGLMDYAFSYIASSGGLHTE 320

Query: 214 AKYPYKASDGSC-NKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQ 272
             YPY   +GSC + K++   A  ISGYEDVP++NE AL+KA+A+QPVSVAI+ASG  FQ
Sbjct: 321 EAYPYLMEEGSCGDGKKSESEAVTISGYEDVPAHNEQALIKALAHQPVSVAIEASGRHFQ 380

Query: 273 FYSSGVFTGQCGTELDHGVTAVGYGT-ADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAK 331
           FYS GVF G CGT+LDHGV AVGYG+    G  Y +V+NSWG  WGE GYIRM+R     
Sbjct: 381 FYSGGVFDGPCGTQLDHGVAAVGYGSDKGKGHDYIIVRNSWGAKWGEKGYIRMKRGTGKG 440

Query: 332 EGLCGIAMQASYPT 345
           EGLCGI   ASYPT
Sbjct: 441 EGLCGINKMASYPT 454


>gi|225446523|ref|XP_002275891.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP2 [Vitis vinifera]
          Length = 358

 Score =  335 bits (859), Expect = 2e-89,   Method: Compositional matrix adjust.
 Identities = 169/311 (54%), Positives = 208/311 (66%), Gaps = 8/311 (2%)

Query: 35  MNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTN 94
           M +R+E W+ Q+GR Y++  E +  F I++ NV +I   N  A+N  + L  N+FAD TN
Sbjct: 41  MEKRYERWLVQHGRRYKNRDEWQRHFGIYQSNVRFINYIN--AQNFSFTLTDNQFADMTN 98

Query: 95  EEFRAPRNGYKRRLPSVRSSETTDVSFRYENASV-PASIDWRKKGAVTGVKDQGQCGCCW 153
           EE++A   G    L +  +S     SF+ E + V P S+DWRK GAVT V++QG+CG CW
Sbjct: 99  EEYKALYMG----LGTSETSRKNQSSFKRERSKVLPISVDWRKMGAVTPVRNQGECGSCW 154

Query: 154 AFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATE 213
           AFS VAA+EGIN I T KL SLSEQEL+DCD    ++GC GG M +AF+FI  N G+ T 
Sbjct: 155 AFSTVAAVEGINKIRTGKLVSLSEQELLDCDIDSGNEGCNGGYMVNAFKFIKQNGGITTA 214

Query: 214 AKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQF 273
             YPY    G CNK +A     KISGYE VP NNE  L  AVA QPVSVAIDA G +FQ 
Sbjct: 215 RNYPYIGEQGICNKDKAANHVVKISGYETVPPNNEKILQAAVAKQPVSVAIDAGGYEFQL 274

Query: 274 YSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEG 333
           YS G+F G CG +L+H VT +GYG  D+G KYWLVKNSWGT WGE GY RM RD    EG
Sbjct: 275 YSKGIFNGFCGKQLNHAVTVIGYGE-DNGKKYWLVKNSWGTGWGEAGYARMIRDSRDDEG 333

Query: 334 LCGIAMQASYP 344
           +CGIAM+ASYP
Sbjct: 334 ICGIAMEASYP 344


>gi|297745594|emb|CBI40759.3| unnamed protein product [Vitis vinifera]
          Length = 300

 Score =  335 bits (859), Expect = 2e-89,   Method: Compositional matrix adjust.
 Identities = 168/304 (55%), Positives = 210/304 (69%), Gaps = 8/304 (2%)

Query: 43  MAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRN 102
           M+++G+ YR   EK  RF++F++N+++I   N K  +  Y LG+NEFAD ++EEF+    
Sbjct: 1   MSKHGKSYRSFEEKLHRFEVFQDNLKHIDETNKKVSS--YWLGLNEFADLSHEEFKRKYL 58

Query: 103 GYKRRLPSVRSSETTDVSFRYEN-ASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAM 161
           G K  LP  R S      F Y++ A +P S+DWRKKGAV  VK+QG CG CWAFS VAA+
Sbjct: 59  GLKIELPKRRDSPE---EFSYKDVADLPKSVDWRKKGAVAHVKNQGACGSCWAFSTVAAV 115

Query: 162 EGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKAS 221
           EGIN I T  LT+LSEQEL+DCD    + GC GGLMD AF FIISN GL  E  YPY   
Sbjct: 116 EGINQIVTGNLTALSEQELIDCDKPF-NNGCNGGLMDYAFAFIISNGGLRKEEDYPYVME 174

Query: 222 DGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTG 281
           +G+C +K+       ISGY DVP +NE + +KA+ANQP+SVAI+AS   FQFYS G+F G
Sbjct: 175 EGTCGEKKEELEVVTISGYHDVPEDNEQSFLKALANQPLSVAIEASSRGFQFYSGGIFNG 234

Query: 282 QCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQA 341
            CGTELDHGV AVGYGT+  G  Y  VKNSWG+ WGE GYIRM+R++   EG+CGI   A
Sbjct: 235 HCGTELDHGVAAVGYGTS-KGVDYITVKNSWGSKWGEKGYIRMKRNVGKPEGICGIYKMA 293

Query: 342 SYPT 345
           SYPT
Sbjct: 294 SYPT 297


>gi|359359166|gb|AEV41071.1| putative oryzain beta chain precursor [Oryza minuta]
          Length = 464

 Score =  335 bits (859), Expect = 2e-89,   Method: Compositional matrix adjust.
 Identities = 167/308 (54%), Positives = 219/308 (71%), Gaps = 6/308 (1%)

Query: 39  HEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFR 98
           +++W+A+ GR Y    E E RF++F +N+ +  + N +A +  ++LG+N FAD TNEEFR
Sbjct: 53  YDLWLAENGRSYNALGEHERRFRVFWDNLRFADAHNARADDHGFRLGMNRFADLTNEEFR 112

Query: 99  APRNGYKRRLPSVRSSETTDVSFRYENAS-VPASIDWRKKGAVTGVKDQGQCGCCWAFSA 157
           A   G K     V  S      +R++    +P S+DWR+KGAV  VK+QGQCG CWAFSA
Sbjct: 113 ATFLGAK----VVERSRAAGERYRHDGVEELPESVDWREKGAVAPVKNQGQCGSCWAFSA 168

Query: 158 VAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYP 217
           V+ +E IN + T ++ +LSEQELV+C T+G++ GC GGLMDDAF+FII N G+ TE  YP
Sbjct: 169 VSTVESINQLVTGEMITLSEQELVECSTNGQNGGCNGGLMDDAFDFIIKNGGIDTEDDYP 228

Query: 218 YKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSG 277
           YKA DG C+    N     I G+EDVP N+E +L KAVA+QPVSVAI+A G +FQ Y SG
Sbjct: 229 YKAVDGKCDINRENAKVVSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGGREFQLYHSG 288

Query: 278 VFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGI 337
           VF+G+CGT LDHGV AVGYGT D+G  YW+V+NSWG  WGE+GY+RM+R+I+   G CGI
Sbjct: 289 VFSGRCGTSLDHGVVAVGYGT-DNGKDYWIVRNSWGPKWGESGYVRMERNINVTTGKCGI 347

Query: 338 AMQASYPT 345
           AM ASYPT
Sbjct: 348 AMMASYPT 355


>gi|302143380|emb|CBI21941.3| unnamed protein product [Vitis vinifera]
          Length = 354

 Score =  335 bits (859), Expect = 2e-89,   Method: Compositional matrix adjust.
 Identities = 169/311 (54%), Positives = 208/311 (66%), Gaps = 8/311 (2%)

Query: 35  MNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTN 94
           M +R+E W+ Q+GR Y++  E +  F I++ NV +I   N  A+N  + L  N+FAD TN
Sbjct: 37  MEKRYERWLVQHGRRYKNRDEWQRHFGIYQSNVRFINYIN--AQNFSFTLTDNQFADMTN 94

Query: 95  EEFRAPRNGYKRRLPSVRSSETTDVSFRYENASV-PASIDWRKKGAVTGVKDQGQCGCCW 153
           EE++A   G    L +  +S     SF+ E + V P S+DWRK GAVT V++QG+CG CW
Sbjct: 95  EEYKALYMG----LGTSETSRKNQSSFKRERSKVLPISVDWRKMGAVTPVRNQGECGSCW 150

Query: 154 AFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATE 213
           AFS VAA+EGIN I T KL SLSEQEL+DCD    ++GC GG M +AF+FI  N G+ T 
Sbjct: 151 AFSTVAAVEGINKIRTGKLVSLSEQELLDCDIDSGNEGCNGGYMVNAFKFIKQNGGITTA 210

Query: 214 AKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQF 273
             YPY    G CNK +A     KISGYE VP NNE  L  AVA QPVSVAIDA G +FQ 
Sbjct: 211 RNYPYIGEQGICNKDKAANHVVKISGYETVPPNNEKILQAAVAKQPVSVAIDAGGYEFQL 270

Query: 274 YSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEG 333
           YS G+F G CG +L+H VT +GYG  D+G KYWLVKNSWGT WGE GY RM RD    EG
Sbjct: 271 YSKGIFNGFCGKQLNHAVTVIGYGE-DNGKKYWLVKNSWGTGWGEAGYARMIRDSRDDEG 329

Query: 334 LCGIAMQASYP 344
           +CGIAM+ASYP
Sbjct: 330 ICGIAMEASYP 340


>gi|146215980|gb|ABQ10192.1| actinidin Act2a [Actinidia deliciosa]
          Length = 378

 Score =  335 bits (858), Expect = 2e-89,   Method: Compositional matrix adjust.
 Identities = 179/347 (51%), Positives = 223/347 (64%), Gaps = 17/347 (4%)

Query: 4   ILLENKLVLAAILVL--GVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFK 61
           I+ ++ L  + +L+L   +    S  RT ND  M   +E W+ ++G+ Y    EKEMRF+
Sbjct: 7   IISKSLLFFSTLLILSSAIDIENSVQRT-NDQVM-AMYESWLVEHGKSYNSLDEKEMRFE 64

Query: 62  IFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSF 121
           IFKEN+  I   N  A N+ Y LG+N FAD T+EE+R+   G KR          TDVS 
Sbjct: 65  IFKENLRIIDDHNADA-NRSYSLGLNRFADLTDEEYRSTYLGLKR-------GPKTDVSN 116

Query: 122 RYE---NASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQ 178
           +Y      ++P  +DWR  GAV GVK+QG C  CWAFSAVAA+EGIN I T  L SLSEQ
Sbjct: 117 QYMPKVGDALPDYVDWRTVGAVVGVKNQGLCSSCWAFSAVAAVEGINKIVTGNLISLSEQ 176

Query: 179 ELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKIS 238
           ELVDC  +   +GC  GLM DAF+FII+N G+ TE  YPY A DG CN    N     I 
Sbjct: 177 ELVDCGRTQITKGCNRGLMTDAFKFIINNGGINTENNYPYTAKDGQCNLSLKNQKYVTID 236

Query: 239 GYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGT 298
            Y++VPSNNE AL KAVA QPVSV +++ G  F+ Y+SG+FTG CGT +DHGVT VGYGT
Sbjct: 237 SYKNVPSNNEMALKKAVAYQPVSVGVESEGGKFKLYTSGIFTGSCGTAVDHGVTIVGYGT 296

Query: 299 ADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
            + G  YW+VKNSWGT WGE+GYIR+QR+I    G CGIA   SYP 
Sbjct: 297 -ERGMDYWIVKNSWGTNWGESGYIRIQRNIGG-AGKCGIAKMPSYPV 341


>gi|121308860|dbj|BAF43527.1| cysteine proteinase [Zinnia elegans]
          Length = 352

 Score =  335 bits (858), Expect = 3e-89,   Method: Compositional matrix adjust.
 Identities = 165/306 (53%), Positives = 211/306 (68%), Gaps = 6/306 (1%)

Query: 40  EMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRA 99
           E W+ ++ + Y    EK  RF+IF +N+++I   N K  N  Y LG+NEFAD T+EEF+ 
Sbjct: 50  ESWLVKHSKFYESLDEKLHRFEIFMDNLKHIDETNKKVSN--YWLGLNEFADLTHEEFKH 107

Query: 100 PRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVA 159
              G+K  L   +   + +  +R +   +P S+DWRKKGAV  VK+QGQCG CWAFS VA
Sbjct: 108 KFLGFKGELAERKDESSKEFGYR-DFVDLPKSVDWRKKGAVAPVKNQGQCGNCWAFSTVA 166

Query: 160 AMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYK 219
           A+EGIN I T  LT LSEQEL+DCDT+  + GC GGLMD AF +++ + GL  E +YPY 
Sbjct: 167 AVEGINQIVTGNLTMLSEQELIDCDTTF-NNGCNGGLMDYAFAYVMRS-GLHKEEEYPYI 224

Query: 220 ASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVF 279
            S+G+C++K+       ISGY DVP N+EA+ +KA+ANQP+SVAI+ASG DFQFYS GVF
Sbjct: 225 MSEGTCDEKKDVSEKVTISGYHDVPRNDEASFLKALANQPISVAIEASGRDFQFYSGGVF 284

Query: 280 TGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAM 339
            G CGTELDHGV AVGYGT   G  Y +V+NSWG  WGE GYIRM+R      G+CG+ M
Sbjct: 285 DGHCGTELDHGVAAVGYGTT-KGLDYVIVRNSWGPKWGEKGYIRMKRGSGKPHGMCGLYM 343

Query: 340 QASYPT 345
            ASYPT
Sbjct: 344 MASYPT 349


>gi|359359118|gb|AEV41024.1| putative oryzain beta chain precursor [Oryza minuta]
          Length = 493

 Score =  334 bits (857), Expect = 3e-89,   Method: Compositional matrix adjust.
 Identities = 167/341 (48%), Positives = 223/341 (65%), Gaps = 39/341 (11%)

Query: 39  HEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKA-RNKPYKLGINEFADQTNEEF 97
           +++W+A+ GR Y    E+E RF++F +N++++ + N +A  +  ++LG+N FAD TN+EF
Sbjct: 49  YDLWLAENGRSYNALGERERRFRVFWDNLKFVDAHNARADEHGGFRLGMNRFADLTNDEF 108

Query: 98  RAPRNGYKRRLPSVRSSETTDVSFRYENAS-VPASIDWRKKGAVTGVKDQGQC------- 149
           RA   G K     V  S      +R++    +P S+DWR+KGAV  VK+QGQC       
Sbjct: 109 RATFLGAK----FVERSRAAGERYRHDGVEELPESVDWREKGAVAPVKNQGQCVDRIIVW 164

Query: 150 -------------------------GCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCD 184
                                    G CWAFSAV+ +E IN + T ++ +LSEQELV+C 
Sbjct: 165 NSMVRIYVVDAGCMLENPLMGLTVQGSCWAFSAVSTVESINQLVTGEMITLSEQELVECS 224

Query: 185 TSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVP 244
           T+G++ GC GGLMDDAF+FII N G+ TE  YPYKA DG C+    N     I G+EDVP
Sbjct: 225 TNGQNSGCNGGLMDDAFDFIIKNGGIDTEDDYPYKAVDGKCDINRENAKVVSIDGFEDVP 284

Query: 245 SNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTK 304
            N+E +L KAVA+QPVSVAI+A G +FQ Y SGVF+G+CGT LDHGV AVGYGT D+G  
Sbjct: 285 QNDEKSLQKAVAHQPVSVAIEAGGREFQLYHSGVFSGRCGTSLDHGVVAVGYGT-DNGKD 343

Query: 305 YWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           YW+V+NSWG  WGE+GY+RM+R+I+A  G CGIAM ASYPT
Sbjct: 344 YWIVRNSWGPKWGESGYVRMERNINATTGKCGIAMMASYPT 384


>gi|168006315|ref|XP_001755855.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162693174|gb|EDQ79528.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 454

 Score =  334 bits (856), Expect = 4e-89,   Method: Compositional matrix adjust.
 Identities = 172/316 (54%), Positives = 216/316 (68%), Gaps = 10/316 (3%)

Query: 31  NDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFA 90
           N+  ++E+   W  ++G+VY    E   R+ ++K+N+EYI   + K  N+ Y LG+ +FA
Sbjct: 38  NERLLSEQFGAWAHKHGKVYSSLEEHAHRYMVWKDNLEYIQRHSEK--NRSYWLGLTKFA 95

Query: 91  DQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCG 150
           D TN+EFR    G  R   S RS   T   FRY ++  P S+DWRKKGAVT VKDQG CG
Sbjct: 96  DITNDEFRRQYTG-TRIDRSKRSKRKT--GFRYADSEAPESVDWRKKGAVTTVKDQGSCG 152

Query: 151 CCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGL 210
            CWAFSA+ ++EGIN I T +  SLSEQELVDCD    +QGC GGLMD AF+FI+ N G+
Sbjct: 153 SCWAFSAIGSVEGINAIRTGEAVSLSEQELVDCDLE-YNQGCNGGLMDYAFDFILENGGI 211

Query: 211 ATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSD 270
            TE  YPYK  DG C+  + N     I GYEDVP N+E AL KAVA QPVSVAI+A G D
Sbjct: 212 DTENDYPYKGLDGRCDNNKKNAHVVTIDGYEDVPENDEEALKKAVAGQPVSVAIEAGGRD 271

Query: 271 FQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDI-- 328
           FQ YS GVFTG+CGT+LDHGV AVGYG+ +    YW+VKNSWG  WGE+GY+RMQR+I  
Sbjct: 272 FQLYSGGVFTGECGTDLDHGVLAVGYGS-EGSLDYWIVKNSWGEYWGESGYLRMQRNIKD 330

Query: 329 -DAKEGLCGIAMQASY 343
            + + GLCGI ++ SY
Sbjct: 331 SNHQFGLCGINIEPSY 346


>gi|255032|gb|AAB23155.1| COT44=cysteine proteinase homolog [Brassica napus, seedling, rapid
           cycling base population CrGC5, Peptide, 328 aa]
          Length = 328

 Score =  334 bits (856), Expect = 4e-89,   Method: Compositional matrix adjust.
 Identities = 161/295 (54%), Positives = 207/295 (70%), Gaps = 7/295 (2%)

Query: 55  EKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSS 114
           +++ RF IFK+N+ +I   N   +N  YKLG+  FA+ TN+E+R+   G  R  P  R +
Sbjct: 24  QQDERFNIFKDNLRFIDLHNENNKNATYKLGLTIFANLTNDEYRSLYLG-ARTEPVRRIT 82

Query: 115 ETTDVSFRYENA----SVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTR 170
           +  +V+ +Y  A     VP ++DWR+KGAV  +KDQG CG CWAFS  AA+EGIN I T 
Sbjct: 83  KAKNVNMKYSAAVNDVEVPVTVDWRQKGAVNAIKDQGTCGSCWAFSTAAAVEGINKIVTG 142

Query: 171 KLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEA 230
           +L SLSEQELVDCD S  +QGC GGLMD AF+FI+ N GL TE  YPY  ++G CN    
Sbjct: 143 ELVSLSEQELVDCDKS-YNQGCNGGLMDYAFQFIMKNGGLNTEKDYPYHGTNGKCNSLLK 201

Query: 231 NPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHG 290
           N     I GYEDVPS +E AL +AV+ QPVSVAIDA G  FQ Y SG+FTG+CGT +DH 
Sbjct: 202 NSRVVTIDGYEDVPSKDETALKRAVSYQPVSVAIDAGGRAFQHYQSGIFTGKCGTNMDHA 261

Query: 291 VTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           V AVGYG+ ++G  YW+V+NSWGT WGE+GYIRM+R++ +K G CGIA++ASYP 
Sbjct: 262 VVAVGYGS-ENGVDYWIVRNSWGTRWGEDGYIRMERNVASKSGKCGIAIEASYPV 315


>gi|118127|sp|P25251.1|CYSP4_BRANA RecName: Full=Cysteine proteinase COT44; Flags: Precursor
          Length = 328

 Score =  334 bits (856), Expect = 4e-89,   Method: Compositional matrix adjust.
 Identities = 161/295 (54%), Positives = 207/295 (70%), Gaps = 7/295 (2%)

Query: 55  EKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSS 114
           +++ RF IFK+N+ +I   N   +N  YKLG+  FA+ TN+E+R+   G  R  P  R +
Sbjct: 24  QQDERFNIFKDNLRFIDLHNENNKNATYKLGLTIFANLTNDEYRSLYLG-ARTEPVRRIT 82

Query: 115 ETTDVSFRYENA----SVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTR 170
           +  +V+ +Y  A     VP ++DWR+KGAV  +KDQG CG CWAFS  AA+EGIN I T 
Sbjct: 83  KAKNVNMKYSAAVNVDEVPVTVDWRQKGAVNAIKDQGTCGSCWAFSTAAAVEGINKIVTG 142

Query: 171 KLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEA 230
           +L SLSEQELVDCD S  +QGC GGLMD AF+FI+ N GL TE  YPY  ++G CN    
Sbjct: 143 ELVSLSEQELVDCDKS-YNQGCNGGLMDYAFQFIMKNGGLNTEKDYPYHGTNGKCNSLLK 201

Query: 231 NPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHG 290
           N     I GYEDVPS +E AL +AV+ QPVSVAIDA G  FQ Y SG+FTG+CGT +DH 
Sbjct: 202 NSRVVTIDGYEDVPSKDETALKRAVSYQPVSVAIDAGGRAFQHYQSGIFTGKCGTNMDHA 261

Query: 291 VTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           V AVGYG+ ++G  YW+V+NSWGT WGE+GYIRM+R++ +K G CGIA++ASYP 
Sbjct: 262 VVAVGYGS-ENGVDYWIVRNSWGTRWGEDGYIRMERNVASKSGKCGIAIEASYPV 315


>gi|359359215|gb|AEV41119.1| putative cysteine protease [Oryza officinalis]
          Length = 499

 Score =  334 bits (856), Expect = 4e-89,   Method: Compositional matrix adjust.
 Identities = 172/315 (54%), Positives = 220/315 (69%), Gaps = 14/315 (4%)

Query: 39  HEMWMAQYGRVYRDN-----AEKEMRFKIFKENVEYIASFNNKA-RNKPYKLGINEFADQ 92
           +++W+A++ R   D+      E E RF++F +N++++ + N +A  +  ++LG+N FAD 
Sbjct: 65  YDLWVARH-RHGGDSHNGLVGEYERRFRVFWDNLKFVDAHNARADEHGGFRLGMNRFADL 123

Query: 93  TNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASV-PASIDWRKKGAVTG-VKDQGQCG 150
           TN+EFRA    Y    P+ R     + ++R++   V P S+DWR KGAV   VK+QGQCG
Sbjct: 124 TNDEFRA---AYLGTTPAGRGRHVGE-AYRHDGVEVLPDSVDWRDKGAVVAPVKNQGQCG 179

Query: 151 CCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGL 210
            CWAFSAVAA+EGIN I T +L SLSEQELV+C  +G + GC GG+MDDAF FI  N GL
Sbjct: 180 SCWAFSAVAAVEGINKIVTGELVSLSEQELVECARNGANSGCNGGMMDDAFAFIARNGGL 239

Query: 211 ATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSD 270
            TE  YPY A DG CN  + +     I G+EDVP N+E +L KAVA+QPVSVAIDA G +
Sbjct: 240 DTEEDYPYTAMDGKCNLAKKSRKVVSIDGFEDVPENDELSLQKAVAHQPVSVAIDAGGRE 299

Query: 271 FQFYSSGVFTGQCGTELDHGVTAVGYGT-ADDGTKYWLVKNSWGTTWGENGYIRMQRDID 329
           FQ Y SGVFTG+CGT LDHGV AVGYGT A  GT YW V+NSWG  WGENGYIRM+R++ 
Sbjct: 300 FQLYDSGVFTGRCGTSLDHGVVAVGYGTDAATGTDYWTVRNSWGPDWGENGYIRMERNVT 359

Query: 330 AKEGLCGIAMQASYP 344
           A+ G CGIAM ASYP
Sbjct: 360 ARTGKCGIAMMASYP 374


>gi|242032709|ref|XP_002463749.1| hypothetical protein SORBIDRAFT_01g005350 [Sorghum bicolor]
 gi|241917603|gb|EER90747.1| hypothetical protein SORBIDRAFT_01g005350 [Sorghum bicolor]
          Length = 381

 Score =  333 bits (855), Expect = 6e-89,   Method: Compositional matrix adjust.
 Identities = 172/322 (53%), Positives = 217/322 (67%), Gaps = 12/322 (3%)

Query: 31  NDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFA 90
           +D  + + +E W   + RV+R + EK  RF  FKENV +I + N +     Y+L +N F 
Sbjct: 38  SDEALWDLYERWQTHH-RVHRHHGEKGRRFGTFKENVRFIHAHNKRGDRPSYRLRLNRFG 96

Query: 91  DQTNEEFRA----PRNGYKRRLPSVRSSETTDVSFRYENAS-VPASIDWRKKGAVTGVKD 145
           D   EEFR+     R    RR      + T    F Y++A+ VP S+DWR+ GAVT VK+
Sbjct: 97  DMGPEEFRSTFADSRINDLRRYRESSPAATAVPGFMYDDATDVPRSVDWRQHGAVTAVKN 156

Query: 146 QGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFII 205
           QG+CG CWAFS V A+EGIN I T  L SLSEQELVDCDT+  + GC+GGLM++AF+FI 
Sbjct: 157 QGRCGSCWAFSTVVAVEGINAIRTGSLVSLSEQELVDCDTA--ENGCQGGLMENAFDFIK 214

Query: 206 SNKGLATEAKYPYKASDGSCNKKEANPSAAKIS--GYEDVPSNNEAALMKAVANQPVSVA 263
           S  G+ TE+ YPY+AS+G+C+   A      +S  G++ VP+ +E AL KAVA QPVSVA
Sbjct: 215 SYGGITTESAYPYRASNGTCDGMRARRGRVHVSIDGHQMVPTGSEDALAKAVARQPVSVA 274

Query: 264 IDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTAD-DGTKYWLVKNSWGTTWGENGYI 322
           IDA G  FQFYS GVFTG CGT+LDHGV  VGYG +D DGT YW+VKNSWG +WGE GYI
Sbjct: 275 IDAGGQAFQFYSEGVFTGDCGTDLDHGVAVVGYGVSDVDGTPYWIVKNSWGPSWGEGGYI 334

Query: 323 RMQRDIDAKEGLCGIAMQASYP 344
           RMQR      GLCGIAM+AS+P
Sbjct: 335 RMQRGA-GNGGLCGIAMEASFP 355


>gi|449522968|ref|XP_004168497.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
          Length = 348

 Score =  333 bits (854), Expect = 6e-89,   Method: Compositional matrix adjust.
 Identities = 169/310 (54%), Positives = 217/310 (70%), Gaps = 8/310 (2%)

Query: 37  ERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEE 96
           E  E W++ +G++Y    EK  RF++FK+N+++I   N K  +  Y LG+NEFAD T++E
Sbjct: 43  ELFEEWISNHGKIYETIEEKWHRFEVFKDNLKHIDETNKKVTS--YWLGVNEFADLTHQE 100

Query: 97  FRAPRNGYKRRLPSVRSSETTDVSFRYENA-SVPASIDWRKKGAVTGVKDQGQCGCCWAF 155
           F+    G K  + S R+ ++ +  F Y++   +P S+DWRKKGAVT VK+QG CG CWAF
Sbjct: 101 FKNMYLGLK--VESSRTRQSPE-EFTYKDVVDLPKSVDWRKKGAVTRVKNQGSCGSCWAF 157

Query: 156 SAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAK 215
           S VAA+EGIN I    LTSLSEQEL+DCD    + GC GGLMD AF FI+S+ GL  E  
Sbjct: 158 STVAAVEGINKIVGGNLTSLSEQELIDCDRP-YNNGCHGGLMDYAFSFIVSSGGLHKEED 216

Query: 216 YPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYS 275
           YPY   + +C+ K+       ISGY+DVP NNEA+L+KA+A+QP+SVAI+ASG DFQFYS
Sbjct: 217 YPYLEVESTCDNKKGELEVVTISGYKDVPENNEASLIKALAHQPLSVAIEASGRDFQFYS 276

Query: 276 SGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLC 335
            GVF G CGT+LDHGVTAVGYG++  G  Y +VKNSWG  WGE GYIRM+R+     GLC
Sbjct: 277 GGVFDGPCGTQLDHGVTAVGYGSS-KGVDYIIVKNSWGPKWGEKGYIRMKRNTGKPAGLC 335

Query: 336 GIAMQASYPT 345
           GI   ASYPT
Sbjct: 336 GINKMASYPT 345


>gi|222629675|gb|EEE61807.1| hypothetical protein OsJ_16426 [Oryza sativa Japonica Group]
          Length = 459

 Score =  333 bits (854), Expect = 6e-89,   Method: Compositional matrix adjust.
 Identities = 165/305 (54%), Positives = 210/305 (68%), Gaps = 6/305 (1%)

Query: 42  WMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARN--KPYKLGINEFADQTNEEFRA 99
           W A++G+ Y    E+E R+  F++N+ YI   N  A      ++LG+N FAD TNEE+R 
Sbjct: 44  WKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHSFRLGLNRFADLTNEEYRD 103

Query: 100 PRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVA 159
              G + +    R  + +D     +N ++P S+DWR KGAV  +KDQG CG CWAFSA+A
Sbjct: 104 TYLGLRNK--PRRERKVSDRYLAADNEALPESVDWRTKGAVAEIKDQGGCGSCWAFSAIA 161

Query: 160 AMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYK 219
           A+EGIN I T  L SLSEQELVDCDTS  ++GC GGLMD AF+FII+N G+ TE  YPYK
Sbjct: 162 AVEGINQIVTGDLISLSEQELVDCDTS-YNEGCNGGLMDYAFDFIINNGGIDTEDDYPYK 220

Query: 220 ASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVF 279
             D  C+    N     I  YEDV  N+E +L KAVANQPVSVAI+A G  FQ YSSG+F
Sbjct: 221 GKDERCDVNRKNAKVVTIDSYEDVTPNSETSLQKAVANQPVSVAIEAGGRAFQLYSSGIF 280

Query: 280 TGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAM 339
           TG+CGT LDHGV AVGYGT ++G  YW+V+NSWG +WGE+GY+RM+R+I A  G CGIA+
Sbjct: 281 TGKCGTALDHGVAAVGYGT-ENGKDYWIVRNSWGKSWGESGYVRMERNIKASSGKCGIAV 339

Query: 340 QASYP 344
           + SYP
Sbjct: 340 EPSYP 344


>gi|195628596|gb|ACG36128.1| vignain precursor [Zea mays]
          Length = 362

 Score =  333 bits (854), Expect = 7e-89,   Method: Compositional matrix adjust.
 Identities = 160/319 (50%), Positives = 218/319 (68%), Gaps = 12/319 (3%)

Query: 31  NDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFA 90
           ++A M  R++ WMAQY R Y+D+AEK  RF++FK N E+I   N   + K Y LG N+FA
Sbjct: 51  DEAMMMARYKKWMAQYRRKYKDDAEKAHRFQVFKANAEFIDRSNAGGKKK-YVLGTNQFA 109

Query: 91  DQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASV---PASIDWRKKGAVTGVKDQG 147
           D T++EF A   G ++       ++     F+Y+N +       +DWR++GAVT VK+QG
Sbjct: 110 DLTSKEFAAMYTGLRKPAAVPSGAKQIPAGFKYQNFTRLDDDVQVDWRQQGAVTPVKNQG 169

Query: 148 QCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISN 207
           QCGCCWAFSAV AMEG+  ITT  L SLSEQ+++DCD S  +QGC GG MD+AF+++++N
Sbjct: 170 QCGCCWAFSAVGAMEGLIMITTGNLVSLSEQQILDCDESDGNQGCNGGYMDNAFQYVVNN 229

Query: 208 KGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDAS 267
            G+ TE  YPY A  G+C   +    AA ISG++D+PS +E AL  AVANQPVSV +D  
Sbjct: 230 GGVTTEDAYPYSAVQGTCQNVQ---PAATISGFQDLPSGDENALANAVANQPVSVGVDGG 286

Query: 268 GSDFQFYSSGVFTGQ-CGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQR 326
            S FQFY  G++ G  CGT+++H VTA+GYG  D GT+YW++KNSWGT WGENG++++Q 
Sbjct: 287 SSPFQFYQGGIYDGDGCGTDMNHAVTAIGYGADDQGTQYWILKNSWGTGWGENGFMQLQM 346

Query: 327 DIDAKEGLCGIAMQASYPT 345
            +    G CGI+  ASYPT
Sbjct: 347 GV----GACGISTMASYPT 361


>gi|359359168|gb|AEV41073.1| putative cysteine protease [Oryza minuta]
          Length = 499

 Score =  333 bits (854), Expect = 7e-89,   Method: Compositional matrix adjust.
 Identities = 170/314 (54%), Positives = 219/314 (69%), Gaps = 12/314 (3%)

Query: 39  HEMWMAQY----GRVYRDNAEKEMRFKIFKENVEYIASFNNKA-RNKPYKLGINEFADQT 93
           +++W+A++    G       E E RF++F +N++++ + N +A  +  ++LG+N FAD T
Sbjct: 65  YDLWVARHRHGGGSHNGLVGEYERRFRVFWDNLKFVDAHNARADEHGGFRLGMNRFADLT 124

Query: 94  NEEFRAPRNGYKRRLPSVRSSETTDVSFRYENA-SVPASIDWRKKGAVTG-VKDQGQCGC 151
           N+EFRA    Y    P+ R     + ++R++   ++P S+DWR KGAV   VK+QGQCG 
Sbjct: 125 NDEFRA---AYLGTTPAGRGRHVGE-AYRHDGVEALPDSVDWRDKGAVVAPVKNQGQCGS 180

Query: 152 CWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLA 211
           CWAFSAVAA+EGIN I T +L SLSEQELV+C  +G + GC GG+MDDAF FI  N GL 
Sbjct: 181 CWAFSAVAAVEGINKIVTGELVSLSEQELVECARNGANSGCNGGMMDDAFAFIARNGGLD 240

Query: 212 TEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDF 271
           TE  YPY A DG CN  + +     I G+EDVP N+E +L KAVA+QPVSVAIDA G +F
Sbjct: 241 TEEDYPYTAMDGKCNLAKKSRKVVSIDGFEDVPENDELSLQKAVAHQPVSVAIDAGGREF 300

Query: 272 QFYSSGVFTGQCGTELDHGVTAVGYGT-ADDGTKYWLVKNSWGTTWGENGYIRMQRDIDA 330
           Q Y SGVFTG+CGT LDHGV AVGYGT A  GT YW V+NSWG  WGENGYIRM+R++ A
Sbjct: 301 QLYDSGVFTGRCGTSLDHGVVAVGYGTDAATGTDYWTVRNSWGPDWGENGYIRMERNVTA 360

Query: 331 KEGLCGIAMQASYP 344
           + G CGIAM ASYP
Sbjct: 361 RTGKCGIAMMASYP 374


>gi|109939734|sp|P25776.2|ORYA_ORYSJ RecName: Full=Oryzain alpha chain; Flags: Precursor
 gi|78192122|gb|ABB30151.1| oryzain alpha [Oryza sativa Japonica Group]
          Length = 458

 Score =  333 bits (854), Expect = 7e-89,   Method: Compositional matrix adjust.
 Identities = 165/305 (54%), Positives = 210/305 (68%), Gaps = 6/305 (1%)

Query: 42  WMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARN--KPYKLGINEFADQTNEEFRA 99
           W A++G+ Y    E+E R+  F++N+ YI   N  A      ++LG+N FAD TNEE+R 
Sbjct: 43  WKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHSFRLGLNRFADLTNEEYRD 102

Query: 100 PRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVA 159
              G + +    R  + +D     +N ++P S+DWR KGAV  +KDQG CG CWAFSA+A
Sbjct: 103 TYLGLRNK--PRRERKVSDRYLAADNEALPESVDWRTKGAVAEIKDQGGCGSCWAFSAIA 160

Query: 160 AMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYK 219
           A+EGIN I T  L SLSEQELVDCDTS  ++GC GGLMD AF+FII+N G+ TE  YPYK
Sbjct: 161 AVEGINQIVTGDLISLSEQELVDCDTS-YNEGCNGGLMDYAFDFIINNGGIDTEDDYPYK 219

Query: 220 ASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVF 279
             D  C+    N     I  YEDV  N+E +L KAVANQPVSVAI+A G  FQ YSSG+F
Sbjct: 220 GKDERCDVNRKNAKVVTIDSYEDVTPNSETSLQKAVANQPVSVAIEAGGRAFQLYSSGIF 279

Query: 280 TGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAM 339
           TG+CGT LDHGV AVGYGT ++G  YW+V+NSWG +WGE+GY+RM+R+I A  G CGIA+
Sbjct: 280 TGKCGTALDHGVAAVGYGT-ENGKDYWIVRNSWGKSWGESGYVRMERNIKASSGKCGIAV 338

Query: 340 QASYP 344
           + SYP
Sbjct: 339 EPSYP 343


>gi|168017893|ref|XP_001761481.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162687165|gb|EDQ73549.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 471

 Score =  333 bits (854), Expect = 7e-89,   Method: Compositional matrix adjust.
 Identities = 166/318 (52%), Positives = 223/318 (70%), Gaps = 12/318 (3%)

Query: 31  NDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFA 90
           +D  ++  H+ W+ ++ RVY   +EK+ RF+IFK+N+ YI   N+  + K Y LG+N+F+
Sbjct: 45  DDGMLDVFHQ-WLERHSRVYHSLSEKQRRFQIFKDNLHYI--HNHNKQEKSYWLGLNKFS 101

Query: 91  DQTNEEFRAPRNGYKR--RLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQ 148
           D T++EFRA   G +   R   +R+ +     F YE+      +DWRKKGAV+ VKDQG 
Sbjct: 102 DLTHDEFRALYLGIRPAGRAHGLRNGDR----FIYEDVVAEEMVDWRKKGAVSDVKDQGS 157

Query: 149 CGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNK 208
           CG CWAFSA+ ++EG+N I T +L SLSEQELVDCD  G++QGC GGLMD AF+FII N 
Sbjct: 158 CGSCWAFSAIGSVEGVNAIVTGELISLSEQELVDCDR-GQNQGCNGGLMDYAFDFIIKNG 216

Query: 209 GLATEAKYPYKASDGSCNK-KEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDAS 267
           G+ TE  YPYKA+DG C++ ++       I  Y+DVP+ +E++L+KAV+  PVSVAI+A 
Sbjct: 217 GIDTEEDYPYKATDGQCDEARKETSKVVVIDDYQDVPTKSESSLLKAVSKNPVSVAIEAG 276

Query: 268 GSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQR- 326
           G DFQ Y  GVFTG CGT+LDHGV AVGYGT DDG  YW+VKNSWG +WGE GYIRM+R 
Sbjct: 277 GRDFQHYQGGVFTGPCGTDLDHGVLAVGYGTDDDGVNYWIVKNSWGPSWGEKGYIRMERM 336

Query: 327 DIDAKEGLCGIAMQASYP 344
             ++  G CGI ++ S+P
Sbjct: 337 GSNSTSGKCGINIEPSFP 354


>gi|218183|dbj|BAA14403.1| oryzain beta precursor [Oryza sativa Japonica Group]
          Length = 471

 Score =  333 bits (854), Expect = 8e-89,   Method: Compositional matrix adjust.
 Identities = 164/311 (52%), Positives = 218/311 (70%), Gaps = 9/311 (2%)

Query: 39  HEMWMAQYGRVYRD--NAEKEMRFKIFKENVEYIASFNNKA-RNKPYKLGINEFADQTNE 95
           +++W+A+ G    +    E E RF +F +N++++ + N +A     ++LG+N FAD TNE
Sbjct: 51  YDLWLAENGGGSPNALGGEHERRFLVFWDNLKFVDAHNARADEGGGFRLGMNRFADLTNE 110

Query: 96  EFRAPRNGYKRRLPSVRSSETTDVSFRYENAS-VPASIDWRKKGAVTGVKDQGQCGCCWA 154
           EFRA   G K    S  + E     +R++    +P S+DWR+KGAV  VK+QGQCG CWA
Sbjct: 111 EFRATFLGAKVAERSRAAGE----RYRHDGVEELPESVDWREKGAVAPVKNQGQCGSCWA 166

Query: 155 FSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEA 214
           FSAV+ +E IN + T ++ +LSEQELV+C T+G++ GC GGLM DAF+FII N G+ TE 
Sbjct: 167 FSAVSTVESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMADAFDFIIKNGGIDTED 226

Query: 215 KYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFY 274
            YPYKA DG C+    N     I G+EDVP N+E +L KAVA+QPVSVAI+A G +FQ Y
Sbjct: 227 DYPYKAVDGKCDINRENAKVVSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGGREFQLY 286

Query: 275 SSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGL 334
            SGVF+G+CGT LDHGV AVGYGT D+G  YW+V+NSWG  WGE+GY+RM+R+I+   G 
Sbjct: 287 HSGVFSGRCGTSLDHGVVAVGYGT-DNGKDYWIVRNSWGPKWGESGYVRMERNINVTTGK 345

Query: 335 CGIAMQASYPT 345
           CGIAM ASYPT
Sbjct: 346 CGIAMMASYPT 356


>gi|218195711|gb|EEC78138.1| hypothetical protein OsI_17694 [Oryza sativa Indica Group]
          Length = 458

 Score =  333 bits (853), Expect = 8e-89,   Method: Compositional matrix adjust.
 Identities = 165/305 (54%), Positives = 210/305 (68%), Gaps = 6/305 (1%)

Query: 42  WMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARN--KPYKLGINEFADQTNEEFRA 99
           W A++G+ Y    E+E R+  F++N+ YI   N  A      ++LG+N FAD TNEE+R 
Sbjct: 43  WKAEHGKNYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHSFRLGLNRFADLTNEEYRD 102

Query: 100 PRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVA 159
              G + +    R  + +D     +N ++P S+DWR KGAV  +KDQG CG CWAFSA+A
Sbjct: 103 TYLGLRNK--PRRERKVSDRYLAADNEALPESVDWRTKGAVAEIKDQGGCGSCWAFSAIA 160

Query: 160 AMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYK 219
           A+EGIN I T  L SLSEQELVDCDTS  ++GC GGLMD AF+FII+N G+ TE  YPYK
Sbjct: 161 AVEGINQIVTGDLISLSEQELVDCDTS-YNEGCNGGLMDYAFDFIINNGGIDTEDDYPYK 219

Query: 220 ASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVF 279
             D  C+    N     I  YEDV  N+E +L KAVANQPVSVAI+A G  FQ YSSG+F
Sbjct: 220 GKDERCDVNRKNAKVVTIDSYEDVTPNSETSLQKAVANQPVSVAIEAGGRAFQLYSSGIF 279

Query: 280 TGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAM 339
           TG+CGT LDHGV AVGYGT ++G  YW+V+NSWG +WGE+GY+RM+R+I A  G CGIA+
Sbjct: 280 TGKCGTALDHGVAAVGYGT-ENGKDYWIVRNSWGKSWGESGYVRMERNIKASSGKCGIAV 338

Query: 340 QASYP 344
           + SYP
Sbjct: 339 EPSYP 343


>gi|388519351|gb|AFK47737.1| unknown [Medicago truncatula]
          Length = 359

 Score =  333 bits (853), Expect = 9e-89,   Method: Compositional matrix adjust.
 Identities = 171/350 (48%), Positives = 223/350 (63%), Gaps = 14/350 (4%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRF 60
           MA I +   L+  +++ L +    S        TM   +E W+ ++ +VY    EK+ RF
Sbjct: 1   MASITI-TSLLFFSLITLSLAMDTSMRSNEEVMTM---YEEWLVKHHKVYNGLGEKDQRF 56

Query: 61  KIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFR----APRNGYKRRLPSVRSSET 116
           +IFK+N+ +I   N  A+N  YK+G+N+FAD TNEE+R      +N  KR +  ++ +  
Sbjct: 57  EIFKDNLGFIDEHN--AQNYTYKVGLNKFADTTNEEYRNMYLGTKNDAKRNVMKIKITTG 114

Query: 117 TDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLS 176
              +F      +P  +DWR KGAV  +KDQG CG CWAFS +A +E IN I T KL SLS
Sbjct: 115 HRYAFN-SGDRLPVHVDWRSKGAVAHIKDQGSCGSCWAFSTIATVEAINKIVTGKLVSLS 173

Query: 177 EQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAK 236
           EQELVDCD +  ++GC GGLMD AFEFI+ N G+ TE  YPYK  +G C+    N     
Sbjct: 174 EQELVDCDRA-FNEGCNGGLMDYAFEFIVENGGIDTEQDYPYKGFEGRCDPTRKNAKVVS 232

Query: 237 ISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGY 296
           I GYEDVP+ NE AL KAV +QPVSVAI+A G   Q Y SGVFTG+CGT LDHGV  VGY
Sbjct: 233 IDGYEDVPAYNENALKKAVFHQPVSVAIEAGGRALQLYQSGVFTGRCGTNLDHGVVVVGY 292

Query: 297 GTADDGTKYWLVKNSWGTTWGENGYIRMQRDI-DAKEGLCGIAMQASYPT 345
           G  ++G  YWLV+NSWGT WGE+GY +++R++     G CGIAMQASYP 
Sbjct: 293 G-FENGVDYWLVRNSWGTNWGEDGYFKLERNVKKINTGKCGIAMQASYPV 341


>gi|115461667|ref|NP_001054433.1| Os05g0108600 [Oryza sativa Japonica Group]
 gi|14719319|gb|AAK73137.1|AC079022_10 putative cysteine proteinase [Oryza sativa]
 gi|33151125|gb|AAP97431.1| cysteine protease CP1 [Oryza sativa]
 gi|52353572|gb|AAU44138.1| cysteine proteinase CP1 [Oryza sativa Japonica Group]
 gi|113577984|dbj|BAF16347.1| Os05g0108600 [Oryza sativa Japonica Group]
 gi|125550541|gb|EAY96250.1| hypothetical protein OsI_18148 [Oryza sativa Indica Group]
          Length = 358

 Score =  333 bits (853), Expect = 1e-88,   Method: Compositional matrix adjust.
 Identities = 174/312 (55%), Positives = 209/312 (66%), Gaps = 8/312 (2%)

Query: 37  ERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEE 96
           E  E W+A+Y + Y    EK  RF++FK+N+ +I   N K  +  Y LG+NEFAD T++E
Sbjct: 49  ELFEKWVAKYRKAYASFEEKVRRFEVFKDNLNHIDDINKKVTS--YWLGLNEFADLTHDE 106

Query: 97  FRAPRNGYKRRLPSVRSSETTDVSFRY---ENASVPASIDWRKKGAVTGVKDQGQCGCCW 153
           F+A   G         S   +   FRY    N  VP  +DWRKK AVT VK+QGQCG CW
Sbjct: 107 FKATYLGLTPPPTRSNSKHYSSEEFRYGKMSNGEVPKEMDWRKKNAVTEVKNQGQCGSCW 166

Query: 154 AFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATE 213
           AFS VAA+EGIN I T  LTSLSEQEL+DC T G + GC GGLMD AF +I S  GL TE
Sbjct: 167 AFSTVAAVEGINAIVTGNLTSLSEQELIDCSTDG-NNGCNGGLMDYAFSYIASTGGLRTE 225

Query: 214 AKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQF 273
             YPY   +G C++ +   +   ISGYEDVP+N+E AL+KA+A+QPVSVAI+ASG  FQF
Sbjct: 226 EAYPYAMEEGDCDEGKG-AAVVTISGYEDVPANDEQALVKALAHQPVSVAIEASGRHFQF 284

Query: 274 YSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEG 333
           YS GVF G CG +LDHGVTAVGYGT+  G  Y +VKNSWG  WGE GYIRM+R     EG
Sbjct: 285 YSGGVFDGPCGEQLDHGVTAVGYGTS-KGQDYIIVKNSWGPHWGEKGYIRMKRGTGKGEG 343

Query: 334 LCGIAMQASYPT 345
           LCGI   ASYPT
Sbjct: 344 LCGINKMASYPT 355


>gi|18402225|ref|NP_566633.1| Granulin repeat cysteine protease family protein [Arabidopsis
           thaliana]
 gi|11994461|dbj|BAB02463.1| cysteine proteinase [Arabidopsis thaliana]
 gi|17065298|gb|AAL32803.1| cysteine proteinase [Arabidopsis thaliana]
 gi|20260004|gb|AAM13349.1| cysteine proteinase [Arabidopsis thaliana]
 gi|332642713|gb|AEE76234.1| Granulin repeat cysteine protease family protein [Arabidopsis
           thaliana]
          Length = 452

 Score =  333 bits (853), Expect = 1e-88,   Method: Compositional matrix adjust.
 Identities = 169/338 (50%), Positives = 224/338 (66%), Gaps = 7/338 (2%)

Query: 10  LVLAAILV-LGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVE 68
           L+ + +L+ L + +  +   T N+A     +E W+ +  + Y    EKE RF+IFK+N++
Sbjct: 13  LIFSVLLISLSLGSVTATETTRNEAEARRMYERWLVENRKNYNGLGEKERRFEIFKDNLK 72

Query: 69  YIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASV 128
           ++   ++   N+ Y++G+  FAD TN+EFRA     + ++   R     +        S+
Sbjct: 73  FVEE-HSSIPNRTYEVGLTRFADLTNDEFRAIY--LRSKMERTRVPVKGEKYLYKVGDSL 129

Query: 129 PASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGE 188
           P +IDWR KGAV  VKDQG CG CWAFSA+ A+EGIN I T +L SLSEQELVDCDTS  
Sbjct: 130 PDAIDWRAKGAVNPVKDQGSCGSCWAFSAIGAVEGINQIKTGELISLSEQELVDCDTSYN 189

Query: 189 DQGCEGGLMDDAFEFIISNKGLATEAKYPYKASD-GSCNKKEANPSAAKISGYEDVPSNN 247
           D GC GGLMD AF+FII N G+ TE  YPY A+D   CN  + N     I GYEDVP N+
Sbjct: 190 D-GCGGGLMDYAFKFIIENGGIDTEEDYPYIATDVNVCNSDKKNTRVVTIDGYEDVPQND 248

Query: 248 EAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWL 307
           E +L KA+ANQP+SVAI+A G  FQ Y+SGVFTG CGT LDHGV AVGYG+ + G  YW+
Sbjct: 249 EKSLKKALANQPISVAIEAGGRAFQLYTSGVFTGTCGTSLDHGVVAVGYGS-EGGQDYWI 307

Query: 308 VKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           V+NSWG+ WGE+GY +++R+I    G CG+AM ASYPT
Sbjct: 308 VRNSWGSNWGESGYFKLERNIKESSGKCGVAMMASYPT 345


>gi|449455625|ref|XP_004145553.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
          Length = 351

 Score =  332 bits (852), Expect = 1e-88,   Method: Compositional matrix adjust.
 Identities = 169/310 (54%), Positives = 217/310 (70%), Gaps = 8/310 (2%)

Query: 37  ERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEE 96
           E  E W++ +G++Y    EK  RF++FK+N+++I   N K  +  Y LG+NEFAD T++E
Sbjct: 46  ELFEEWISNHGKIYETIEEKWHRFEVFKDNLKHIDETNKKVTS--YWLGVNEFADLTHQE 103

Query: 97  FRAPRNGYKRRLPSVRSSETTDVSFRYENA-SVPASIDWRKKGAVTGVKDQGQCGCCWAF 155
           F+    G K  + S R+ ++ +  F Y++   +P S+DWRKKGAVT VK+QG CG CWAF
Sbjct: 104 FKNMYLGLK--VESSRTRQSPE-EFTYKDVVDLPKSVDWRKKGAVTRVKNQGSCGSCWAF 160

Query: 156 SAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAK 215
           S VAA+EGIN I    LTSLSEQEL+DCD    + GC GGLMD AF FI+S+ GL  E  
Sbjct: 161 STVAAVEGINKIVGGNLTSLSEQELIDCDRP-YNNGCHGGLMDYAFSFIVSSGGLHKEED 219

Query: 216 YPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYS 275
           YPY   + +C+ K+       ISGY+DVP NNEA+L+KA+A+QP+SVAI+ASG DFQFYS
Sbjct: 220 YPYLEVESTCDNKKGELEVVTISGYKDVPENNEASLIKALAHQPLSVAIEASGRDFQFYS 279

Query: 276 SGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLC 335
            GVF G CGT+LDHGVTAVGYG++  G  Y +VKNSWG  WGE GYIRM+R+     GLC
Sbjct: 280 GGVFDGPCGTQLDHGVTAVGYGSS-KGVDYIIVKNSWGPKWGEKGYIRMKRNTGKPAGLC 338

Query: 336 GIAMQASYPT 345
           GI   ASYPT
Sbjct: 339 GINKMASYPT 348


>gi|297852302|ref|XP_002894032.1| F2G19.31/F2G19.31 [Arabidopsis lyrata subsp. lyrata]
 gi|297339874|gb|EFH70291.1| F2G19.31/F2G19.31 [Arabidopsis lyrata subsp. lyrata]
          Length = 455

 Score =  332 bits (851), Expect = 1e-88,   Method: Compositional matrix adjust.
 Identities = 168/319 (52%), Positives = 217/319 (68%), Gaps = 14/319 (4%)

Query: 31  NDATMNERHEMWMAQYGRVYRDNA--EKEMRFKIFKENVEYIASFNNKARNKPYKLGINE 88
           +DA +   +E W+ ++G+    N+  EK+ RF+IFK+N+ +I   N K  N  Y+LG+  
Sbjct: 35  SDAEVMSIYEAWLVKHGKAQNQNSLVEKDRRFEIFKDNLRFIDDHNKK--NLSYRLGLTR 92

Query: 89  FADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYE---NASVPASIDWRKKGAVTGVKD 145
           FAD TN+E+R+   G K      R +     S RYE      +P SIDWRKKGAV  VKD
Sbjct: 93  FADLTNDEYRSKYLGAKMEKKGERRT-----SQRYEARVGDELPESIDWRKKGAVAEVKD 147

Query: 146 QGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFII 205
           QG CG CWAFS + A+EGIN I T  L +LSEQELVDCDTS  ++GC GGLMD AFEFII
Sbjct: 148 QGSCGSCWAFSTIGAVEGINQIVTGDLITLSEQELVDCDTS-YNEGCNGGLMDYAFEFII 206

Query: 206 SNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAID 265
            N G+ T+  YPYK  DG+C++   N     I  YEDVP+ +E +L KAVA+QPVSVAI+
Sbjct: 207 KNGGIDTDKDYPYKGVDGTCDQIRKNAKVVTIDSYEDVPTYSEESLKKAVAHQPVSVAIE 266

Query: 266 ASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQ 325
           A G  FQ Y SG+F G CGT+LDHGV AVGYGT ++G  YW+V+NSWG +WGE+GY++M 
Sbjct: 267 AGGRAFQLYDSGIFDGTCGTQLDHGVVAVGYGT-ENGKDYWIVRNSWGKSWGESGYLKMA 325

Query: 326 RDIDAKEGLCGIAMQASYP 344
           R+I +  G CGIA++ SYP
Sbjct: 326 RNIASSSGKCGIAIEPSYP 344


>gi|242055323|ref|XP_002456807.1| hypothetical protein SORBIDRAFT_03g043220 [Sorghum bicolor]
 gi|241928782|gb|EES01927.1| hypothetical protein SORBIDRAFT_03g043220 [Sorghum bicolor]
          Length = 369

 Score =  332 bits (851), Expect = 2e-88,   Method: Compositional matrix adjust.
 Identities = 167/296 (56%), Positives = 209/296 (70%), Gaps = 12/296 (4%)

Query: 55  EKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSS 114
           EK  RF  FKENV +I + +NK  ++PY+L +N F D   EEFR+       R+  +R +
Sbjct: 57  EKGRRFGTFKENVRFIHA-HNKRGDRPYRLSLNRFGDMGREEFRS--TFADSRINDLRRA 113

Query: 115 ETTDV----SFRYENAS-VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITT 169
           E+        F Y+  + +P S+DWRK+GAVT VKDQG CG CWAFS V ++EGIN I T
Sbjct: 114 ESPAAPAVPGFMYDGVTDLPPSVDWRKEGAVTAVKDQGHCGSCWAFSTVVSVEGINAIRT 173

Query: 170 RKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNK-K 228
             L SLSEQEL+DCDT  ++ GC+GGLM++AFEFI S  G+ TE+ YPY+AS+G+C+  +
Sbjct: 174 GSLVSLSEQELIDCDT--DENGCQGGLMENAFEFIKSYGGVTTESAYPYRASNGTCDSVR 231

Query: 229 EANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELD 288
                   I G++ VP+ +E AL KAVANQPVSVAIDA G  FQFYS GVFTG CGT+LD
Sbjct: 232 SRRGQIVSIDGHQMVPTGSEDALAKAVANQPVSVAIDAGGQAFQFYSEGVFTGDCGTDLD 291

Query: 289 HGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           HGV AVGYG +DDGT YW+VKNSWG +WGE GYIRMQR      GLCGIAM+AS+P
Sbjct: 292 HGVAAVGYGVSDDGTAYWIVKNSWGPSWGEGGYIRMQRGA-GNGGLCGIAMEASFP 346


>gi|224096714|ref|XP_002310708.1| predicted protein [Populus trichocarpa]
 gi|222853611|gb|EEE91158.1| predicted protein [Populus trichocarpa]
          Length = 356

 Score =  332 bits (851), Expect = 2e-88,   Method: Compositional matrix adjust.
 Identities = 169/311 (54%), Positives = 216/311 (69%), Gaps = 9/311 (2%)

Query: 39  HEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFR 98
           ++ W+ ++G+ Y    EK  RF+IFK N+ +I   N  ++N+ YK+G+ +FAD TN+E+R
Sbjct: 28  YKWWLQKHGKAYNRLGEKAKRFEIFKNNLRFIDEHN--SQNRTYKVGLTKFADLTNQEYR 85

Query: 99  APRNGYKRRLPSVRSSETTDVSFRYENAS---VPASIDWRKKGAVTGVKDQGQCGCCWAF 155
           A   G  R  P  R  ++ + S RY   +   +P S+DWR KGAV  +KDQG CG CWAF
Sbjct: 86  AMFLG-TRSDPKRRLMKSKNPSERYAYKAGDKLPESVDWRGKGAVNPIKDQGSCGSCWAF 144

Query: 156 SAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAK 215
           S VAA+EGIN I T +L SLSEQELVDCD    + GC GGLMD AF+FII+N GL TE  
Sbjct: 145 STVAAVEGINQIVTGELISLSEQELVDCDRF-YNAGCNGGLMDYAFQFIINNGGLDTEKD 203

Query: 216 YPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYS 275
           YPY  +D +C++ +    A  I G+EDV   +E AL KAVA+QPVSVAI+ASG   QFY 
Sbjct: 204 YPYLGNDDTCDRDKMKTKAVSIDGFEDVLPFDEKALQKAVAHQPVSVAIEASGMALQFYQ 263

Query: 276 SGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDI-DAKEGL 334
           SGVFTG+CGT LDHGV  VGYGT + G  YWLV+NSWGT WGE+GYI+MQR++ D   G 
Sbjct: 264 SGVFTGECGTALDHGVVVVGYGT-EKGLDYWLVRNSWGTEWGEHGYIKMQRNVRDTYTGR 322

Query: 335 CGIAMQASYPT 345
           CGIAM++SYP 
Sbjct: 323 CGIAMESSYPV 333


>gi|168058022|ref|XP_001781010.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162667567|gb|EDQ54194.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 457

 Score =  332 bits (850), Expect = 2e-88,   Method: Compositional matrix adjust.
 Identities = 173/318 (54%), Positives = 213/318 (66%), Gaps = 13/318 (4%)

Query: 32  DATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFAD 91
           D  +  +   W  ++G+VY    E+  RF ++K+N+EYI   + K  N  Y LG+ +FAD
Sbjct: 38  DQLLAGQFAAWAHKHGKVYSAAEERAHRFLVWKDNLEYIQRHSEK--NLSYWLGLTKFAD 95

Query: 92  QTNEEFRAPRNGYK----RRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQG 147
            TNEEFR    G +    RRL   R++     SFRY N+  P SIDWR+KGAVT VKDQG
Sbjct: 96  LTNEEFRRQYTGTRIDRSRRLKKGRNATG---SFRYANSEAPKSIDWREKGAVTSVKDQG 152

Query: 148 QCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISN 207
            CG CWAFSAV ++EGIN I T    SLS QELVDCD    +QGC GGLMD AF+F+I N
Sbjct: 153 SCGSCWAFSAVGSVEGINAIRTGDAISLSVQELVDCDKK-YNQGCNGGLMDYAFDFVIQN 211

Query: 208 KGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDAS 267
            G+ TE  YPY+  DG C+  + N     I  YEDVP N+E AL KAVA QPVSVAI+A 
Sbjct: 212 GGIDTEKDYPYQGYDGRCDVNKMNARVVTIDSYEDVPENDEEALKKAVAGQPVSVAIEAG 271

Query: 268 GSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRD 327
           G DFQ YS GVFTG+CGT+LDHGV AVGYG+ + G  YW+VKNSWG  WGE+GY+RMQR+
Sbjct: 272 GRDFQLYSGGVFTGRCGTDLDHGVLAVGYGS-EKGLDYWIVKNSWGEYWGESGYLRMQRN 330

Query: 328 I--DAKEGLCGIAMQASY 343
           +  D   GLCGI ++ SY
Sbjct: 331 LKDDNGYGLCGINIEPSY 348


>gi|62320725|dbj|BAD95392.1| cysteine proteinase RD21A [Arabidopsis thaliana]
          Length = 433

 Score =  332 bits (850), Expect = 2e-88,   Method: Compositional matrix adjust.
 Identities = 165/319 (51%), Positives = 217/319 (68%), Gaps = 14/319 (4%)

Query: 31  NDATMNERHEMWMAQYGRVYRDNA--EKEMRFKIFKENVEYIASFNNKARNKPYKLGINE 88
           ++A +   +E W+ ++G+    N+  EK+ RF+IFK+N+ ++   N K  N  Y+LG+  
Sbjct: 42  SEAEVMSIYEAWLVKHGKAQSQNSLVEKDRRFEIFKDNLRFVDEHNEK--NLSYRLGLTR 99

Query: 89  FADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYE---NASVPASIDWRKKGAVTGVKD 145
           FAD TN+E+R+   G K      R +     S RYE      +P SIDWRKKGAV  VKD
Sbjct: 100 FADLTNDEYRSKYLGAKMEKKGERRT-----SLRYEARVGDELPESIDWRKKGAVAEVKD 154

Query: 146 QGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFII 205
           QG CG CWAFS + A+EGIN I T  L +LSEQELVDCDTS  ++GC GGLMD AFEFII
Sbjct: 155 QGGCGSCWAFSTIGAVEGINQIVTGDLITLSEQELVDCDTS-YNEGCNGGLMDYAFEFII 213

Query: 206 SNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAID 265
            N G+ T+  YPYK  DG+C++   N     I  YEDVP+ +E +L KAVA+QP+S+AI+
Sbjct: 214 KNGGIDTDKDYPYKGVDGTCDQIRKNAKVVTIDSYEDVPTYSEESLKKAVAHQPISIAIE 273

Query: 266 ASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQ 325
           A G  FQ Y SG+F G CGT+LDHGV AVGYGT ++G  YW+V+NSWG +WGE+GY+RM 
Sbjct: 274 AGGRAFQLYDSGIFDGSCGTQLDHGVVAVGYGT-ENGKDYWIVRNSWGKSWGESGYLRMA 332

Query: 326 RDIDAKEGLCGIAMQASYP 344
           R+I +  G CGIA++ SYP
Sbjct: 333 RNIASSSGKCGIAIEPSYP 351


>gi|357154164|ref|XP_003576692.1| PREDICTED: vignain-like [Brachypodium distachyon]
          Length = 427

 Score =  332 bits (850), Expect = 2e-88,   Method: Compositional matrix adjust.
 Identities = 165/316 (52%), Positives = 203/316 (64%), Gaps = 9/316 (2%)

Query: 35  MNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTN 94
           M  R E WM ++GR Y +  EK+ RF+++KEN+  I  FN+      Y L  N+FAD TN
Sbjct: 115 MRMRFEQWMGKHGRAYANGGEKQRRFEVYKENLALIEEFNSGGHG--YTLTDNKFADLTN 172

Query: 95  EEFRAPRNGYKRRLP-----SVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQC 149
           EEFRA   G     P     +  +S   ++     +  +P  +DWRKKGAV  VK+QG C
Sbjct: 173 EEFRAKMLGGLGADPDRRRRARHASNALELPGNDNSTDLPKDVDWRKKGAVVEVKNQGSC 232

Query: 150 GCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKG 209
           G CWAFSAVAAMEG+N I   KL SLSEQELVDCD   E  GC GG M  AFEF+++N G
Sbjct: 233 GSCWAFSAVAAMEGLNQIKNGKLVSLSEQELVDCDA--EAVGCAGGFMSWAFEFVMANHG 290

Query: 210 LATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGS 269
           L TEA YPYK  +G+C   + N S+  I+GY +V  N+EA L+K  A QPVSVA+DA G 
Sbjct: 291 LTTEASYPYKGINGACQTAKLNESSVSITGYVNVTVNSEAELLKVAAVQPVSVAVDAGGF 350

Query: 270 DFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDID 329
            FQ Y+ GVF+G C  +++HGVT VGYG  D   KYW+VKNSWG  WGE GY+ MQRD  
Sbjct: 351 LFQLYAGGVFSGPCTAQINHGVTVVGYGETDKAEKYWIVKNSWGPEWGEAGYMLMQRDAG 410

Query: 330 AKEGLCGIAMQASYPT 345
              GLCGIAM ASYP 
Sbjct: 411 VPTGLCGIAMLASYPV 426


>gi|47169030|pdb|1S4V|A Chain A, The 2.0 A Crystal Structure Of The Kdel-Tailed Cysteine
           Endopeptidase Functioning In Programmed Cell Death Of
           Ricinus Communis Endosperm
 gi|47169031|pdb|1S4V|B Chain B, The 2.0 A Crystal Structure Of The Kdel-Tailed Cysteine
           Endopeptidase Functioning In Programmed Cell Death Of
           Ricinus Communis Endosperm
          Length = 229

 Score =  332 bits (850), Expect = 2e-88,   Method: Compositional matrix adjust.
 Identities = 158/218 (72%), Positives = 175/218 (80%), Gaps = 1/218 (0%)

Query: 127 SVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTS 186
           +VPAS+DWRKKGAVT VKDQGQCG CWAFS + A+EGIN I T KL SLSEQELVDCDT 
Sbjct: 1   TVPASVDWRKKGAVTSVKDQGQCGSCWAFSTIVAVEGINQIKTNKLVSLSEQELVDCDTD 60

Query: 187 GEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSN 246
            ++QGC GGLMD AFEFI    G+ TEA YPY+A DG+C+  + N  A  I G+E+VP N
Sbjct: 61  -QNQGCNGGLMDYAFEFIKQRGGITTEANYPYEAYDGTCDVSKENAPAVSIDGHENVPEN 119

Query: 247 NEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYW 306
           +E AL+KAVANQPVSVAIDA GSDFQFYS GVFTG CGTELDHGV  VGYGT  DGTKYW
Sbjct: 120 DENALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGSCGTELDHGVAIVGYGTTIDGTKYW 179

Query: 307 LVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
            VKNSWG  WGE GYIRM+R I  KEGLCGIAM+ASYP
Sbjct: 180 TVKNSWGPEWGEKGYIRMERGISDKEGLCGIAMEASYP 217


>gi|3980198|emb|CAA46863.1| thiolprotease [Pisum sativum]
          Length = 464

 Score =  332 bits (850), Expect = 2e-88,   Method: Compositional matrix adjust.
 Identities = 175/351 (49%), Positives = 230/351 (65%), Gaps = 13/351 (3%)

Query: 1   MAMILLENKLVLA---AILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKE 57
           +  I L   L LA    I+      P   +   ND  +   +E W+ ++G+ Y    EKE
Sbjct: 7   ILFITLTFTLSLALDMCIISYDKTHPDKSTPRTNDQVLT-MYEEWLVKHGKNYNALGEKE 65

Query: 58  MRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETT 117
            RF+IFK+N+ +I   N+K  N  ++LG+N FAD TNEE+R    G  R  P+ R+ +  
Sbjct: 66  KRFEIFKDNLGFIDEHNSK--NLSFRLGLNRFADLTNEEYRTRFLG-TRINPNRRNRKVN 122

Query: 118 DVSFRYENA---SVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTS 174
             + RY       +P S+DWRK+GAV GVKDQG CG CWAFSA+AA+EG+N + T  L S
Sbjct: 123 SQTNRYATRVGDKLPESVDWRKEGAVVGVKDQGSCGSCWAFSAIAAVEGVNKLATGDLIS 182

Query: 175 LSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSA 234
           LSEQELVDCDTS  ++GC GGLMD AFEFII+   L  E  YPY+A DG C++   N   
Sbjct: 183 LSEQELVDCDTS-YNEGCNGGLMDYAFEFIINMVALTPEEDYPYRAIDGRCDQNRKNAKV 241

Query: 235 AKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAV 294
             I  YEDVP+ +E AL KAVANQ ++VA++  G +FQ Y SGVFTG+CGT LDHGV AV
Sbjct: 242 VSIDQYEDVPAYDEGALKKAVANQVIAVAVEGGGREFQLYDSGVFTGRCGTALDHGVAAV 301

Query: 295 GYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDI-DAKEGLCGIAMQASYP 344
           GYGT ++G  YW+V+NSWG +WGE GYIR++R++  +K G CGIA++ SYP
Sbjct: 302 GYGT-ENGKDYWIVRNSWGGSWGEAGYIRLERNLATSKSGKCGIAIEPSYP 351


>gi|146215982|gb|ABQ10193.1| actinidin Act2b [Actinidia eriantha]
          Length = 378

 Score =  332 bits (850), Expect = 2e-88,   Method: Compositional matrix adjust.
 Identities = 178/348 (51%), Positives = 218/348 (62%), Gaps = 19/348 (5%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRF 60
           M+++     L+L+  L +      S  RT ND  M   +E W+ + G+ Y    EKEMRF
Sbjct: 10  MSLLFFSTLLILSLALDI----ENSVQRT-NDQVM-AMYESWLVEQGKSYNSLDEKEMRF 63

Query: 61  KIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVS 120
           +IFKEN+  I   N  A N+ Y LG+N FAD T+EE+R+   G K           TDVS
Sbjct: 64  EIFKENLRIIDDHNADA-NRSYSLGLNRFADLTDEEYRSTYLGLKM-------GPKTDVS 115

Query: 121 FRYE---NASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSE 177
             Y      ++P  +DWR  GAV GVK+QG C  CWAFSAV A+EGIN I T  L SLSE
Sbjct: 116 NEYMPKVGEALPDYVDWRTVGAVVGVKNQGLCSSCWAFSAVTAVEGINKIVTGNLISLSE 175

Query: 178 QELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKI 237
           QELVDC  +   +GC  GLM DAF+FII+N G+ TE  YPY A DG CN    N     I
Sbjct: 176 QELVDCGRTQRTKGCNRGLMTDAFQFIINNGGINTEDNYPYTAKDGQCNLSLKNQKYVTI 235

Query: 238 SGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYG 297
             Y++VPSNNE AL KAVA QPVSV +++ G  F+ Y+SG+FTG CGT +DHGVT VGYG
Sbjct: 236 DNYKNVPSNNEMALKKAVAYQPVSVGVESEGGKFKLYTSGIFTGFCGTAVDHGVTIVGYG 295

Query: 298 TADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           T + G  YW+VKNSWGT WGENGYIR+QR+I    G CGIA   SYP 
Sbjct: 296 T-ERGMDYWIVKNSWGTNWGENGYIRIQRNIGGA-GKCGIARMPSYPV 341


>gi|14517542|gb|AAK62661.1| F2G19.31/F2G19.31 [Arabidopsis thaliana]
 gi|19548039|gb|AAL87383.1| F2G19.31/F2G19.31 [Arabidopsis thaliana]
          Length = 462

 Score =  331 bits (849), Expect = 3e-88,   Method: Compositional matrix adjust.
 Identities = 165/319 (51%), Positives = 217/319 (68%), Gaps = 14/319 (4%)

Query: 31  NDATMNERHEMWMAQYGRVYRDNA--EKEMRFKIFKENVEYIASFNNKARNKPYKLGINE 88
           ++A +   +E W+ ++G+    N+  EK+ RF+IFK+N+ ++   N K  N  Y+LG+  
Sbjct: 42  SEAEVMSIYEAWLVKHGKAQSQNSLVEKDRRFEIFKDNLRFVDEHNEK--NLSYRLGLTR 99

Query: 89  FADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYE---NASVPASIDWRKKGAVTGVKD 145
           FAD TN+E+R+   G K      R +     S RYE      +P SIDWRKKGAV  VKD
Sbjct: 100 FADLTNDEYRSKYLGAKMEKKGERRT-----SLRYEARVGDELPESIDWRKKGAVAEVKD 154

Query: 146 QGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFII 205
           QG CG CWAFS + A+EGIN I T  L +LSEQELVDCDTS  ++GC GGLMD AFEFII
Sbjct: 155 QGGCGSCWAFSTIGAVEGINQIVTGDLITLSEQELVDCDTS-YNEGCNGGLMDYAFEFII 213

Query: 206 SNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAID 265
            N G+ T+  YPYK  DG+C++   N     I  YEDVP+ +E +L KAVA+QP+S+AI+
Sbjct: 214 KNGGIDTDKDYPYKGVDGTCDQIRKNAKVVTIDSYEDVPTYSEESLKKAVAHQPISIAIE 273

Query: 266 ASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQ 325
           A G  FQ Y SG+F G CGT+LDHGV AVGYGT ++G  YW+V+NSWG +WGE+GY+RM 
Sbjct: 274 AGGRAFQLYDSGIFDGSCGTQLDHGVVAVGYGT-ENGKDYWIVRNSWGKSWGESGYLRMA 332

Query: 326 RDIDAKEGLCGIAMQASYP 344
           R+I +  G CGIA++ SYP
Sbjct: 333 RNIASSSGKCGIAIEPSYP 351


>gi|18401614|ref|NP_564497.1| cysteine proteinase RD21a [Arabidopsis thaliana]
 gi|1172873|sp|P43297.1|RD21A_ARATH RecName: Full=Cysteine proteinase RD21a; Short=RD21; Flags:
           Precursor
 gi|12321010|gb|AAG50628.1|AC083835_13 cysteine protease, putative [Arabidopsis thaliana]
 gi|435619|dbj|BAA02374.1| thiol protease [Arabidopsis thaliana]
 gi|18175926|gb|AAL59952.1| putative cysteine proteinase RD21A [Arabidopsis thaliana]
 gi|22136972|gb|AAM91715.1| putative cysteine proteinase RD21A [Arabidopsis thaliana]
 gi|332194014|gb|AEE32135.1| cysteine proteinase RD21a [Arabidopsis thaliana]
          Length = 462

 Score =  331 bits (849), Expect = 3e-88,   Method: Compositional matrix adjust.
 Identities = 165/319 (51%), Positives = 217/319 (68%), Gaps = 14/319 (4%)

Query: 31  NDATMNERHEMWMAQYGRVYRDNA--EKEMRFKIFKENVEYIASFNNKARNKPYKLGINE 88
           ++A +   +E W+ ++G+    N+  EK+ RF+IFK+N+ ++   N K  N  Y+LG+  
Sbjct: 42  SEAEVMSIYEAWLVKHGKAQSQNSLVEKDRRFEIFKDNLRFVDEHNEK--NLSYRLGLTR 99

Query: 89  FADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYE---NASVPASIDWRKKGAVTGVKD 145
           FAD TN+E+R+   G K      R +     S RYE      +P SIDWRKKGAV  VKD
Sbjct: 100 FADLTNDEYRSKYLGAKMEKKGERRT-----SLRYEARVGDELPESIDWRKKGAVAEVKD 154

Query: 146 QGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFII 205
           QG CG CWAFS + A+EGIN I T  L +LSEQELVDCDTS  ++GC GGLMD AFEFII
Sbjct: 155 QGGCGSCWAFSTIGAVEGINQIVTGDLITLSEQELVDCDTS-YNEGCNGGLMDYAFEFII 213

Query: 206 SNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAID 265
            N G+ T+  YPYK  DG+C++   N     I  YEDVP+ +E +L KAVA+QP+S+AI+
Sbjct: 214 KNGGIDTDKDYPYKGVDGTCDQIRKNAKVVTIDSYEDVPTYSEESLKKAVAHQPISIAIE 273

Query: 266 ASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQ 325
           A G  FQ Y SG+F G CGT+LDHGV AVGYGT ++G  YW+V+NSWG +WGE+GY+RM 
Sbjct: 274 AGGRAFQLYDSGIFDGSCGTQLDHGVVAVGYGT-ENGKDYWIVRNSWGKSWGESGYLRMA 332

Query: 326 RDIDAKEGLCGIAMQASYP 344
           R+I +  G CGIA++ SYP
Sbjct: 333 RNIASSSGKCGIAIEPSYP 351


>gi|50355615|dbj|BAD29956.1| cysteine protease [Daucus carota]
          Length = 423

 Score =  331 bits (848), Expect = 3e-88,   Method: Compositional matrix adjust.
 Identities = 169/305 (55%), Positives = 213/305 (69%), Gaps = 8/305 (2%)

Query: 43  MAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRN 102
           + ++ + Y     KE RF+IFK+N+ +I   +NK  N+ +KLG+N+FAD +NEE+++   
Sbjct: 11  LVKHHKNYNALGAKEKRFEIFKDNLRFIDE-HNKGVNQSFKLGLNKFADLSNEEYKSMFL 69

Query: 103 GYKRRLPSVRSSETTDVSFRYE-NASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAM 161
           G   R+   R    +D  F+Y     +P S+DWR+KGAV  VKDQGQCG CWAFS VAA+
Sbjct: 70  G--GRMVRDRKGFESD-RFKYGVGDELPQSVDWREKGAVAPVKDQGQCGSCWAFSTVAAV 126

Query: 162 EGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKAS 221
           EGIN I T  L SLSEQELVDCD  G +QGC GG MD AFEFI+ N G+ TE  YPYK  
Sbjct: 127 EGINQIATGDLISLSEQELVDCD-KGFNQGCNGGFMDYAFEFIVKNGGIDTEDDYPYKGV 185

Query: 222 DGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTG 281
           DG C++   N     I+G+EDVP N+E +L KAVA+QPVSVAI+A G  FQ Y SG+F G
Sbjct: 186 DGQCDQNRKNAKVVTINGFEDVPQNDEKSLKKAVAHQPVSVAIEAGGRAFQLYESGIFNG 245

Query: 282 QCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDI-DAKEGLCGIAMQ 340
            CGT+LDHGV AVGYGT +DG  YW+V+NSWG  WGENGYIR++R++     G CGIAMQ
Sbjct: 246 LCGTDLDHGVVAVGYGT-EDGKDYWIVRNSWGPNWGENGYIRLERNVASTNTGKCGIAMQ 304

Query: 341 ASYPT 345
            SYPT
Sbjct: 305 PSYPT 309


>gi|90265242|emb|CAH67695.1| H0624F09.3 [Oryza sativa Indica Group]
          Length = 494

 Score =  331 bits (848), Expect = 3e-88,   Method: Compositional matrix adjust.
 Identities = 166/294 (56%), Positives = 209/294 (71%), Gaps = 8/294 (2%)

Query: 55  EKEMRFKIFKENVEYIASFNNKARNKP-YKLGINEFADQTNEEFRAPRNGYKRRLPSVRS 113
           E E RF++F +N++++ + N +A  +  ++LG+N FAD TN EFRA    Y    P+ R 
Sbjct: 84  EHERRFRVFWDNLKFVDAHNARADERGGFRLGMNRFADLTNGEFRAT---YLGTTPAGRG 140

Query: 114 SETTDVSFRYENA-SVPASIDWRKKGAVTG-VKDQGQCGCCWAFSAVAAMEGINHITTRK 171
               + ++R++   ++P S+DWR KGAV   VK+QGQCG CWAFSAVAA+EGIN I T +
Sbjct: 141 RRVGE-AYRHDGVEALPDSVDWRDKGAVVAPVKNQGQCGSCWAFSAVAAVEGINKIVTGE 199

Query: 172 LTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEAN 231
           L SLSEQELV+C  +G++ GC GG+MDDAF FI  N GL TE  YPY A DG CN  + +
Sbjct: 200 LVSLSEQELVECARNGQNSGCNGGIMDDAFAFIARNGGLDTEEDYPYTAMDGKCNLAKRS 259

Query: 232 PSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGV 291
                I G+EDVP N+E +L KAVA+QPVSVAIDA G +FQ Y SGVFTG+CGT LDHGV
Sbjct: 260 RKVVSIDGFEDVPENDELSLQKAVAHQPVSVAIDAGGREFQLYDSGVFTGRCGTNLDHGV 319

Query: 292 TAVGYGT-ADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
            AVGYGT A  G  YW V+NSWG  WGENGYIRM+R++ A+ G CGIAM ASYP
Sbjct: 320 VAVGYGTDAATGAAYWTVRNSWGPDWGENGYIRMERNVTARTGKCGIAMMASYP 373


>gi|18394919|ref|NP_564126.1| Xylem cysteine proteinase 2 [Arabidopsis thaliana]
 gi|71153409|sp|Q9LM66.2|XCP2_ARATH RecName: Full=Xylem cysteine proteinase 2; Short=AtXCP2; Flags:
           Precursor
 gi|4836904|gb|AAD30607.1|AC007369_17 Putative cysteine proteinase [Arabidopsis thaliana]
 gi|6708183|gb|AAF25832.1|AF191028_1 papain-type cysteine endopeptidase XCP2 [Arabidopsis thaliana]
 gi|28466959|gb|AAO44088.1| At1g20850 [Arabidopsis thaliana]
 gi|110743795|dbj|BAE99733.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|332191910|gb|AEE30031.1| Xylem cysteine proteinase 2 [Arabidopsis thaliana]
          Length = 356

 Score =  331 bits (848), Expect = 4e-88,   Method: Compositional matrix adjust.
 Identities = 163/310 (52%), Positives = 216/310 (69%), Gaps = 6/310 (1%)

Query: 37  ERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEE 96
           E  E W++ + + Y    EK +RF++FK+N+++I   N K   K Y LG+NEFAD ++EE
Sbjct: 49  ELFENWISNFEKAYETVEEKFLRFEVFKDNLKHIDETNKKG--KSYWLGLNEFADLSHEE 106

Query: 97  FRAPRNGYKRRLPSVRSSETTDVSFRYENA-SVPASIDWRKKGAVTGVKDQGQCGCCWAF 155
           F+    G K  +   R  E +   F Y +  +VP S+DWRKKGAV  VK+QG CG CWAF
Sbjct: 107 FKKMYLGLKTDIVR-RDEERSYAEFAYRDVEAVPKSVDWRKKGAVAEVKNQGSCGSCWAF 165

Query: 156 SAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAK 215
           S VAA+EGIN I T  LT+LSEQEL+DCDT+  + GC GGLMD AFE+I+ N GL  E  
Sbjct: 166 STVAAVEGINKIVTGNLTTLSEQELIDCDTT-YNNGCNGGLMDYAFEYIVKNGGLRKEED 224

Query: 216 YPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYS 275
           YPY   +G+C  ++       I+G++DVP+N+E +L+KA+A+QP+SVAIDASG +FQFYS
Sbjct: 225 YPYSMEEGTCEMQKDESETVTINGHQDVPTNDEKSLLKALAHQPLSVAIDASGREFQFYS 284

Query: 276 SGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLC 335
            GVF G+CG +LDHGV AVGYG++  G+ Y +VKNSWG  WGE GYIR++R+    EGLC
Sbjct: 285 GGVFDGRCGVDLDHGVAAVGYGSS-KGSDYIIVKNSWGPKWGEKGYIRLKRNTGKPEGLC 343

Query: 336 GIAMQASYPT 345
           GI   AS+PT
Sbjct: 344 GINKMASFPT 353


>gi|115461226|ref|NP_001054213.1| Os04g0670500 [Oryza sativa Japonica Group]
 gi|62510688|sp|Q7XR52.2|CYSP1_ORYSJ RecName: Full=Cysteine protease 1; AltName: Full=OsCP1; Flags:
           Precursor
 gi|38345300|emb|CAE02828.2| OSJNBa0043A12.33 [Oryza sativa Japonica Group]
 gi|113565784|dbj|BAF16127.1| Os04g0670500 [Oryza sativa Japonica Group]
 gi|215741575|dbj|BAG98070.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 490

 Score =  330 bits (847), Expect = 4e-88,   Method: Compositional matrix adjust.
 Identities = 166/294 (56%), Positives = 209/294 (71%), Gaps = 8/294 (2%)

Query: 55  EKEMRFKIFKENVEYIASFNNKARNKP-YKLGINEFADQTNEEFRAPRNGYKRRLPSVRS 113
           E E RF++F +N++++ + N +A  +  ++LG+N FAD TN EFRA    Y    P+ R 
Sbjct: 84  EHERRFRVFWDNLKFVDAHNARADERGGFRLGMNRFADLTNGEFRAT---YLGTTPAGRG 140

Query: 114 SETTDVSFRYENA-SVPASIDWRKKGAVTG-VKDQGQCGCCWAFSAVAAMEGINHITTRK 171
               + ++R++   ++P S+DWR KGAV   VK+QGQCG CWAFSAVAA+EGIN I T +
Sbjct: 141 RRVGE-AYRHDGVEALPDSVDWRDKGAVVAPVKNQGQCGSCWAFSAVAAVEGINKIVTGE 199

Query: 172 LTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEAN 231
           L SLSEQELV+C  +G++ GC GG+MDDAF FI  N GL TE  YPY A DG CN  + +
Sbjct: 200 LVSLSEQELVECARNGQNSGCNGGIMDDAFAFIARNGGLDTEEDYPYTAMDGKCNLAKRS 259

Query: 232 PSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGV 291
                I G+EDVP N+E +L KAVA+QPVSVAIDA G +FQ Y SGVFTG+CGT LDHGV
Sbjct: 260 RKVVSIDGFEDVPENDELSLQKAVAHQPVSVAIDAGGREFQLYDSGVFTGRCGTNLDHGV 319

Query: 292 TAVGYGT-ADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
            AVGYGT A  G  YW V+NSWG  WGENGYIRM+R++ A+ G CGIAM ASYP
Sbjct: 320 VAVGYGTDAATGAAYWTVRNSWGPDWGENGYIRMERNVTARTGKCGIAMMASYP 373


>gi|89274062|dbj|BAE80740.1| cysteine proteinase [Platycodon grandiflorus]
          Length = 462

 Score =  330 bits (847), Expect = 5e-88,   Method: Compositional matrix adjust.
 Identities = 175/351 (49%), Positives = 229/351 (65%), Gaps = 14/351 (3%)

Query: 1   MAMILLENKLVLAAILVLGV------WAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNA 54
           MA+ LL    V ++ L + +       A +S  RT  D  +   +E W+ ++G+ Y    
Sbjct: 8   MAIALLFALFVASSALDMSIINYDATHASKSSWRT--DDEVMAMYESWLVKHGKSYNALG 65

Query: 55  EKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSS 114
           EKE RF+IFK+N+ +I   +N   N  YK+G+N FAD TNEE+R+   G K + P +   
Sbjct: 66  EKEKRFQIFKDNLRFIDE-HNAEENLSYKVGLNRFADLTNEEYRSTYLGAKSK-PKLSKV 123

Query: 115 ETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTS 174
           ++   + R  + S+P S+DWR KGAV  +KDQG CG CWAFS V A+EGIN I T +L +
Sbjct: 124 KSDRYAPRVGD-SLPESVDWRAKGAVAPIKDQGSCGSCWAFSTVNAVEGINQIVTGELIT 182

Query: 175 LSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSA 234
           LSEQELVDCD S  ++GC+GGLMD  FEFII+N G+ T+  YPY   D  C++   N   
Sbjct: 183 LSEQELVDCDKS-YNEGCDGGLMDYGFEFIINNGGIDTDKDYPYLGRDARCDQYRKNAKV 241

Query: 235 AKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAV 294
             I  YEDVP NNE AL KAVA+QPVSV I+  G  FQFY SG+FTG+CGT LDHGV  V
Sbjct: 242 VTIDSYEDVPVNNEEALKKAVASQPVSVGIEGGGRAFQFYDSGIFTGKCGTALDHGVNVV 301

Query: 295 GYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKE-GLCGIAMQASYP 344
           GYGT + G  YW+V+NSWG++WGE GYIRM+R++     G CGIAM+ SYP
Sbjct: 302 GYGT-EKGKDYWIVRNSWGSSWGEAGYIRMERNLAGTSVGKCGIAMEPSYP 351


>gi|341850671|gb|AEK97329.1| chromoplast senescence-associated protein 12 [Brassica rapa var.
           parachinensis]
          Length = 260

 Score =  330 bits (847), Expect = 5e-88,   Method: Compositional matrix adjust.
 Identities = 161/260 (61%), Positives = 195/260 (75%), Gaps = 5/260 (1%)

Query: 88  EFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS---VPASIDWRKKGAVTGVK 144
           +FA+ TN+EFR+   GYK        S+T   SFRY+N S   +P ++DWRKKGAVT +K
Sbjct: 1   QFAEITNDEFRSMYTGYKGDSVLSSQSQTKSTSFRYQNVSSGALPIAVDWRKKGAVTPIK 60

Query: 145 DQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFI 204
           +QG CGCCWAFSAVAA+EG   I   KL SLSEQ+LVDCDT+  D GC GGL+D AFE I
Sbjct: 61  NQGSCGCCWAFSAVAAIEGATQIKKGKLISLSEQQLVDCDTN--DFGCSGGLIDTAFEHI 118

Query: 205 ISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAI 264
           ++  GL TE+ YPYK  D +C  K   PSAA I+GYEDVP N+E ALMKAVA+QPVSV I
Sbjct: 119 MATGGLTTESNYPYKGEDATCKIKSTXPSAASITGYEDVPVNDENALMKAVAHQPVSVGI 178

Query: 265 DASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRM 324
           +  G DFQFYSSGVFTG+C T LDH VTAVGY  +  G+KYW++KNSWGT WGE GY+R+
Sbjct: 179 EGGGFDFQFYSSGVFTGECTTYLDHAVTAVGYSQSSAGSKYWIIKNSWGTKWGEGGYMRI 238

Query: 325 QRDIDAKEGLCGIAMQASYP 344
           ++DI  KEGLCG+AM+ASYP
Sbjct: 239 KKDIKDKEGLCGLAMKASYP 258


>gi|297744465|emb|CBI37727.3| unnamed protein product [Vitis vinifera]
          Length = 331

 Score =  330 bits (846), Expect = 5e-88,   Method: Compositional matrix adjust.
 Identities = 167/308 (54%), Positives = 211/308 (68%), Gaps = 27/308 (8%)

Query: 38  RHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEF 97
           R E W++++G+VY+   EK  RF++F+EN+ +I   N +  +  Y LG+NEFAD ++EEF
Sbjct: 48  RFESWVSKHGKVYKSMEEKLHRFEVFRENLNHIDERNKEVSS--YWLGLNEFADLSHEEF 105

Query: 98  RAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSA 157
           +                 + DV      A +P S+DWRKKGAVT VK+QG CG CWAFS 
Sbjct: 106 K-----------------SKDV------ADLPESVDWRKKGAVTHVKNQGACGSCWAFST 142

Query: 158 VAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYP 217
           VAA+EGIN I T  LT+LSEQEL+DCDT+  + GC GGLMD AF FI SN GL  E  YP
Sbjct: 143 VAAVEGINQIVTGNLTTLSEQELIDCDTTF-NSGCNGGLMDYAFAFIASNGGLHKEDDYP 201

Query: 218 YKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSG 277
           Y   +G+C +++ +     ISGYEDVP  +E +L+KA+A+QP+SVAI+ASG DFQFYS G
Sbjct: 202 YLMEEGTCEEQKEDVDIVTISGYEDVPEKDEESLLKALAHQPLSVAIEASGRDFQFYSGG 261

Query: 278 VFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGI 337
           VF G CGTELDHGV AVGYG++  G  Y +VKNSWG  WGE GYIRM+R+    EGLCGI
Sbjct: 262 VFNGPCGTELDHGVAAVGYGSS-KGLDYIIVKNSWGPKWGEKGYIRMKRNTGKTEGLCGI 320

Query: 338 AMQASYPT 345
              ASYPT
Sbjct: 321 NKMASYPT 328


>gi|357465603|ref|XP_003603086.1| Cysteine proteinase [Medicago truncatula]
 gi|355492134|gb|AES73337.1| Cysteine proteinase [Medicago truncatula]
          Length = 474

 Score =  330 bits (846), Expect = 5e-88,   Method: Compositional matrix adjust.
 Identities = 165/320 (51%), Positives = 224/320 (70%), Gaps = 10/320 (3%)

Query: 31  NDATMNERHEMWMAQYGRVYR--DNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINE 88
           +D  +   +E W  ++G++    D +EK+ RF+IFK+N+++I   N  A N+ YK+G+N 
Sbjct: 45  SDKEVKNIYEEWRVKHGKLNNNIDGSEKDKRFEIFKDNLKFIDEHN--AENRTYKVGLNR 102

Query: 89  FADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENA---SVPASIDWRKKGAVTGVKD 145
           FAD +NEE+R+   G K     +  + T   S RY  +    +P S+DWR +GAV  VKD
Sbjct: 103 FADLSNEEYRSRYLGTKIDPIGMMMARTKTRSNRYAPSVGDKLPKSVDWRSQGAVVQVKD 162

Query: 146 QGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFII 205
           QG CG CWAFS +AA+EGIN I T +L SLSEQELVDCD +  + GC+GGLM+ AFEFII
Sbjct: 163 QGSCGSCWAFSTIAAVEGINKIVTGELVSLSEQELVDCDRT-VNAGCDGGLMEYAFEFII 221

Query: 206 SNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAID 265
           +N G+ ++  YPY+  DG C++ + N     I  YE VP+ +E AL KAVANQP+SVAI+
Sbjct: 222 NNGGIDSDEDYPYRGVDGKCDQYKKNARVVSIDDYEQVPAYDELALKKAVANQPISVAIE 281

Query: 266 ASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQ 325
           A G +FQ Y SG+FTG+CGT LDHGVTAVGYGT ++G  YW+V+NSWG +WGE+GY+RM+
Sbjct: 282 AGGREFQLYVSGIFTGKCGTALDHGVTAVGYGT-ENGVDYWIVRNSWGKSWGESGYVRME 340

Query: 326 RDIDAK-EGLCGIAMQASYP 344
           R++ A   G CGI MQ+SYP
Sbjct: 341 RNLAASVAGKCGIVMQSSYP 360


>gi|297830592|ref|XP_002883178.1| hypothetical protein ARALYDRAFT_479457 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297329018|gb|EFH59437.1| hypothetical protein ARALYDRAFT_479457 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 452

 Score =  330 bits (845), Expect = 7e-88,   Method: Compositional matrix adjust.
 Identities = 168/338 (49%), Positives = 222/338 (65%), Gaps = 7/338 (2%)

Query: 10  LVLAAILV-LGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVE 68
           L+ + +L+ L + +  +   T N+A     +E W+ +  + Y    EKE RF+IF +N++
Sbjct: 13  LIFSMLLISLSLGSVTAADTTRNEAEARRMYEQWLVENRKNYNGLGEKETRFEIFTDNLK 72

Query: 69  YIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASV 128
           YI   +N   N+ +++G+  FAD TN+EFRA     + ++   R     +        ++
Sbjct: 73  YIEE-HNSVPNQTFEVGLTRFADLTNDEFRAIY--LRSKMERTRVPVKGERYLYKVGDTL 129

Query: 129 PASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGE 188
           P  IDWR KGAV  VKDQG CG CWAFSA+ A+EGIN I T +L SLSEQELVDCDTS  
Sbjct: 130 PDQIDWRAKGAVNPVKDQGNCGSCWAFSAIGAVEGINQIKTGELISLSEQELVDCDTS-Y 188

Query: 189 DQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGS-CNKKEANPSAAKISGYEDVPSNN 247
           + GC GGLMD AF+FII N G+ TE  YPY A+D + CN  + N     I GYEDVP N+
Sbjct: 189 NGGCGGGLMDYAFKFIIENGGIDTEEDYPYTATDDNICNSDKKNSRVVTIDGYEDVPQND 248

Query: 248 EAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWL 307
           E +L KA+ANQP+SVAI+A G  FQ Y SGVFTG CGT LDHGV AVGYG+ + G  YW+
Sbjct: 249 EKSLKKALANQPISVAIEAGGRAFQLYKSGVFTGTCGTSLDHGVVAVGYGS-EGGQDYWI 307

Query: 308 VKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           V+NSWG+ WGE+GY +++R+I    G CG+AM ASYPT
Sbjct: 308 VRNSWGSNWGESGYFKLERNIKESSGKCGVAMMASYPT 345


>gi|218181|dbj|BAA14402.1| oryzain alpha precursor [Oryza sativa Japonica Group]
          Length = 458

 Score =  328 bits (842), Expect = 2e-87,   Method: Compositional matrix adjust.
 Identities = 163/305 (53%), Positives = 208/305 (68%), Gaps = 6/305 (1%)

Query: 42  WMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARN--KPYKLGINEFADQTNEEFRA 99
           W A++G+ Y    E+E R+  F++N+ YI   N  A      ++LG+N FAD TNEE+R 
Sbjct: 43  WKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHSFRLGLNRFADLTNEEYRD 102

Query: 100 PRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVA 159
              G + +    R  + +D     +N ++P S+DWR KGAV  +KDQG CG CWAFSA+A
Sbjct: 103 TYLGLRNK--PRRERKVSDRYLAADNEALPESVDWRTKGAVAEIKDQGGCGSCWAFSAIA 160

Query: 160 AMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYK 219
           A+E IN I T  L SLSEQELVDCDTS  ++GC GGLMD AF+FII+N G+ TE  YPYK
Sbjct: 161 AVEDINQIVTGDLISLSEQELVDCDTS-YNEGCNGGLMDYAFDFIINNGGIDTEDDYPYK 219

Query: 220 ASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVF 279
             D  C+    N     I  YEDV  N+E +L KAV NQPVSVAI+A G  FQ YSSG+F
Sbjct: 220 GKDERCDVNRKNAKVVTIDSYEDVTPNSETSLQKAVRNQPVSVAIEAGGRAFQLYSSGIF 279

Query: 280 TGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAM 339
           TG+CGT LDHGV AVGYGT ++G  YW+V+NSWG +WGE+GY+RM+R+I A  G CGIA+
Sbjct: 280 TGKCGTALDHGVAAVGYGT-ENGKDYWIVRNSWGKSWGESGYVRMERNIKASSGKCGIAV 338

Query: 340 QASYP 344
           + SYP
Sbjct: 339 EPSYP 343


>gi|168063167|ref|XP_001783545.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162664932|gb|EDQ51634.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 461

 Score =  328 bits (842), Expect = 2e-87,   Method: Compositional matrix adjust.
 Identities = 169/310 (54%), Positives = 208/310 (67%), Gaps = 11/310 (3%)

Query: 37  ERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEE 96
           E+   W  ++G+ Y D  +   RF ++K+N+ YI    +   N+ Y LG+ +FAD TNEE
Sbjct: 52  EQFAAWAHKHGKAYHDAEQCLHRFAVWKDNLAYI---RHSETNRTYSLGLTKFADLTNEE 108

Query: 97  FRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFS 156
           FR    G  R   S R+   T   FRY ++  P S+DWRK GAVT VKDQG CG CWAFS
Sbjct: 109 FRRMYTG-TRIDRSRRAKRRT--GFRYADSEAPESVDWRKNGAVTSVKDQGSCGSCWAFS 165

Query: 157 AVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKY 216
           AV ++EGIN I   +  SLSEQELVDCD    +QGC GGLMD AF+FII N G+ TE  Y
Sbjct: 166 AVGSVEGINAIRNGEAVSLSEQELVDCDLE-YNQGCNGGLMDYAFDFIIQNGGIDTEKDY 224

Query: 217 PYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSS 276
           PYK  DG C+  + N     I GYEDVP N+E AL KAVA QPVSVAI+A G DFQ Y+ 
Sbjct: 225 PYKGFDGRCDNSKKNAHVVTIDGYEDVPENDEEALKKAVAGQPVSVAIEAGGRDFQLYAQ 284

Query: 277 GVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQR---DIDAKEG 333
           GVF+G+CGT+LDHGV AVGYGT +DG  YW+VKNSWG  WGE+GY+RM+R   D +   G
Sbjct: 285 GVFSGECGTDLDHGVLAVGYGT-EDGVDYWIVKNSWGEYWGESGYLRMKRNMKDSNDGPG 343

Query: 334 LCGIAMQASY 343
           LCGI ++ SY
Sbjct: 344 LCGINIEPSY 353


>gi|18141281|gb|AAL60578.1|AF454956_1 senescence-associated cysteine protease [Brassica oleracea]
          Length = 445

 Score =  328 bits (841), Expect = 2e-87,   Method: Compositional matrix adjust.
 Identities = 165/307 (53%), Positives = 210/307 (68%), Gaps = 7/307 (2%)

Query: 40  EMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRA 99
           E W+ +  + Y    EK+ RF+IF +N++++   +N   N+ Y+LG+  FAD TNEEFRA
Sbjct: 38  ERWLVENHKNYNGLGEKDKRFEIFMDNLKFVQE-HNSVPNQSYELGLTRFADLTNEEFRA 96

Query: 100 PRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVA 159
                + ++   R S  ++         +P  +DWR KGAV  VKDQG CG CWAFSA+ 
Sbjct: 97  IY--LRSKMERTRDSVKSERYLHNVGDKLPDEVDWRAKGAVVPVKDQGSCGSCWAFSAIG 154

Query: 160 AMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYK 219
           A+EGIN I T +L SLSEQELVDCDTS  + GC GGLMD AF+FIISN G+ TE  YPY 
Sbjct: 155 AVEGINQIKTGELVSLSEQELVDCDTS-YNNGCGGGLMDYAFQFIISNGGIDTEEDYPYT 213

Query: 220 ASDGS-CNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGV 278
           A+D + CN  + N     I GYEDVP  NE +L KA+ANQP+SVAI+A G  FQ Y SGV
Sbjct: 214 ATDDNICNTDKKNTRVVTIDGYEDVPE-NENSLKKALANQPISVAIEAGGRGFQLYKSGV 272

Query: 279 FTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIA 338
           FTG CGT LDHGV AVGYGT+ +G  YW+++NSWG+ WGE+GYI++QR+I    G CG+A
Sbjct: 273 FTGTCGTALDHGVVAVGYGTS-EGQDYWIIRNSWGSNWGESGYIKLQRNIKDSSGKCGVA 331

Query: 339 MQASYPT 345
           M ASYPT
Sbjct: 332 MMASYPT 338


>gi|356515062|ref|XP_003526220.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
          Length = 337

 Score =  328 bits (841), Expect = 2e-87,   Method: Compositional matrix adjust.
 Identities = 165/311 (53%), Positives = 215/311 (69%), Gaps = 8/311 (2%)

Query: 39  HEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFR 98
           HE WMAQ+G+VY+D AEKE   +IF+ N+E+I SF+    +K + L  N+FAD  +EEF+
Sbjct: 32  HEKWMAQHGKVYKDAAEKERCLQIFENNMEFIESFD-VCGDKSFNLSTNQFADLHDEEFK 90

Query: 99  AP-RNGYKRRLPSVRSSETTDVSFRYENAS-VPASIDWRKKGAVTGVKDQGQCGCCWAFS 156
           A   NG+K+      ++ET    FRY+N + +PAS+DWRK+G VT +KDQG+C  CWAFS
Sbjct: 91  ALLTNGHKKEHSLWTTTETL---FRYDNVTKIPASMDWRKRGVVTPIKDQGKCLSCWAFS 147

Query: 157 -AVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAK 215
             VA +EG++ I T +L  LSEQELVD    GE +GC G  ++DAF+FI     + +E  
Sbjct: 148 LCVATIEGLHQIITSELVPLSEQELVDF-VKGESEGCYGDYVEDAFKFITKKGRIESETH 206

Query: 216 YPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYS 275
           YPYK  + +C  K+     A+I GY+ VPS +E AL+KAVANQ VSV+++A  S FQFYS
Sbjct: 207 YPYKGVNNTCKVKKETHGVAQIKGYKKVPSKSENALLKAVANQLVSVSVEARDSAFQFYS 266

Query: 276 SGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLC 335
           SG+FTG+CGT+ DH V    YG + DGTKYWL KNSWGT WGE GYIR++ DI AKEGLC
Sbjct: 267 SGIFTGKCGTDTDHRVALASYGESGDGTKYWLAKNSWGTEWGEKGYIRIKXDIPAKEGLC 326

Query: 336 GIAMQASYPTA 346
           GIA    YP A
Sbjct: 327 GIAKYPYYPIA 337


>gi|2463584|dbj|BAA22544.1| FBSB precursor [Ananas comosus]
          Length = 356

 Score =  328 bits (841), Expect = 2e-87,   Method: Compositional matrix adjust.
 Identities = 169/339 (49%), Positives = 225/339 (66%), Gaps = 10/339 (2%)

Query: 9   KLVLAAILVLGVWA-PQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENV 67
           ++V   + +  +WA P + S       M +R E WM +YGRVY+DN EK  RF+IFK NV
Sbjct: 6   QVVFLFLFLCVMWASPSAASADEPSDPMMKRFEEWMVEYGRVYKDNDEKMRRFQIFKNNV 65

Query: 68  EYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFR-YENA 126
            +I +FN++  N  Y LGIN+F D TN EF A   G   R  ++       VSF   + +
Sbjct: 66  NHIETFNSRNENS-YTLGINQFTDMTNNEFIAQYTGGISRPLNIEREPV--VSFDDVDIS 122

Query: 127 SVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTS 186
           +VP SIDWR  GAVT VK+Q  CG CWAF+A+A +E I  I    L  LSEQ+++DC   
Sbjct: 123 AVPQSIDWRDYGAVTSVKNQNPCGACWAFAAIATVESIYKIKKGILEPLSEQQVLDC--- 179

Query: 187 GEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSN 246
            +  GC+GG    AFEFIISNKG+A+ A YPYKA+ G+C K    P++A I+GY  VP N
Sbjct: 180 AKGYGCKGGWEFRAFEFIISNKGVASGAIYPYKAAKGTC-KTNGVPNSAYITGYARVPRN 238

Query: 247 NEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYW 306
           NE+++M AV+ QP++VA+DA+ ++FQ+Y SGVF G CGT L+H VTA+GYG   +G KYW
Sbjct: 239 NESSMMYAVSKQPITVAVDAN-ANFQYYKSGVFNGPCGTSLNHAVTAIGYGQDSNGKKYW 297

Query: 307 LVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           +VKNSWG  WGE GYIRM RD+ +  G+CGIA+ + YPT
Sbjct: 298 IVKNSWGARWGEAGYIRMARDVSSSSGICGIAIDSLYPT 336


>gi|146215976|gb|ABQ10190.1| actinidin Act1b [Actinidia arguta]
          Length = 380

 Score =  328 bits (840), Expect = 3e-87,   Method: Compositional matrix adjust.
 Identities = 170/339 (50%), Positives = 219/339 (64%), Gaps = 14/339 (4%)

Query: 10  LVLAAILVLGV-WAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVE 68
           L  + +LVL + +  ++ ++  ND  +   +E W+ +YG+ Y    E E RF+IFKE + 
Sbjct: 13  LFFSTLLVLSLAFNAKNLTKRTNDE-LKAMYESWLTKYGKSYNSLGEWERRFEIFKETLR 71

Query: 69  YIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYE---N 125
           +I   +N   N+ Y++G+N+FADQTNEEF++   G+         S    VS RYE    
Sbjct: 72  FIDE-HNADTNRSYRVGLNQFADQTNEEFQSTYLGF------TSGSNKMKVSNRYEPRVG 124

Query: 126 ASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDT 185
             +P  +DWR  GAV  +K QGQCG CWAFSA+A +EGIN I T  L SLSEQELVDC  
Sbjct: 125 QVLPDYVDWRSAGAVVDIKSQGQCGSCWAFSAIATVEGINKIVTGDLISLSEQELVDCGR 184

Query: 186 SGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPS 245
           +   +GC+GG + D F+FII+N G+ TEA YPY A DG CN    N   A I  YE+VP 
Sbjct: 185 TQNTRGCDGGSITDGFQFIINNGGINTEANYPYTAEDGQCNLDLQNEKYASIDTYENVPY 244

Query: 246 NNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKY 305
           NNE AL  AVA QPVSVA++A+G  FQ YSSG+FTG CGT +DH VT VGYGT + G  Y
Sbjct: 245 NNEWALQTAVAYQPVSVALEAAGDAFQHYSSGIFTGPCGTAVDHAVTIVGYGT-EGGIDY 303

Query: 306 WLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           W+VKNSW TTWGE GYIR+ R++    G CGIA + SYP
Sbjct: 304 WIVKNSWDTTWGEEGYIRILRNVGGA-GTCGIATKPSYP 341


>gi|194352752|emb|CAQ00104.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
          Length = 351

 Score =  327 bits (839), Expect = 4e-87,   Method: Compositional matrix adjust.
 Identities = 170/316 (53%), Positives = 215/316 (68%), Gaps = 12/316 (3%)

Query: 35  MNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTN 94
           + E  E W+A++ + Y    EK  RF++FK+N++ I   N +  +  Y LG+NEFAD T+
Sbjct: 40  LVELFEKWLAKHQKAYASFEEKLHRFEVFKDNLKLIDEINREVTS--YWLGLNEFADLTH 97

Query: 95  EEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS---VPASIDWRKKGAVTGVKDQGQCGC 151
           +EF+    G    L    +  ++  SFRYEN +   +P ++DWRKKGAVT VK+QGQCG 
Sbjct: 98  DEFKTTYLG----LSPPPARRSSSRSFRYENVAAHDLPKAVDWRKKGAVTDVKNQGQCGS 153

Query: 152 CWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLA 211
           CWAFS VAA+EGIN I T  LT+LSEQEL+DC   G + GC GG+MD AF +I S+ GL 
Sbjct: 154 CWAFSTVAAVEGINAIVTGNLTALSEQELIDCSVDG-NSGCNGGMMDYAFSYIASSGGLH 212

Query: 212 TEAKYPYKASDGSC-NKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSD 270
           TE  YPY   +GSC + K++   A  ISGYEDVP+ +E AL+KA+A+QPVSVAI+ASG  
Sbjct: 213 TEEAYPYLMEEGSCGDGKKSESEAVSISGYEDVPTKDEQALIKALAHQPVSVAIEASGRH 272

Query: 271 FQFYSSGVFTGQCGTELDHGVTAVGYGT-ADDGTKYWLVKNSWGTTWGENGYIRMQRDID 329
           FQFYS GVF G CG +LDHGV AVGYG+    G  Y +VKNSWG  WGE GYIRM+R   
Sbjct: 273 FQFYSGGVFDGPCGAQLDHGVAAVGYGSDKGKGHDYIIVKNSWGGKWGEKGYIRMKRGTG 332

Query: 330 AKEGLCGIAMQASYPT 345
             EGLCGI   ASYPT
Sbjct: 333 KSEGLCGINKMASYPT 348


>gi|242070333|ref|XP_002450443.1| hypothetical protein SORBIDRAFT_05g005530 [Sorghum bicolor]
 gi|241936286|gb|EES09431.1| hypothetical protein SORBIDRAFT_05g005530 [Sorghum bicolor]
          Length = 351

 Score =  327 bits (839), Expect = 4e-87,   Method: Compositional matrix adjust.
 Identities = 181/364 (49%), Positives = 238/364 (65%), Gaps = 31/364 (8%)

Query: 1   MAMILLENK-LVLAAILVLGVWAPQSW------SRTLNDAT------MNERHEMWMAQYG 47
           MA  +  NK L+ AA+ +L V A  +       +R L+ +T      M  RHE WM ++G
Sbjct: 1   MASYIANNKPLITAAVALLTVLAIANCIGCAVAARDLSSSTGYGEEAMTARHEKWMVEHG 60

Query: 48  RVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRR 107
           R Y+D AEK  RF++FK N  ++ + N  A  K Y L IN FAD T++EF A   G+K  
Sbjct: 61  RTYKDEAEKARRFQVFKANAAFVDTSNAAAGGKKYHLAINRFADMTHDEFMARYTGFKP- 119

Query: 108 LPSVRSSETTDVSFRYENASVPA----SIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEG 163
           LP+          F+Y N ++ +    ++DWRKKGAVT VK+Q +CGCCWAFSAVAA+EG
Sbjct: 120 LPATGKKMP---GFKYANVTLSSEDQQAVDWRKKGAVTDVKNQQKCGCCWAFSAVAAIEG 176

Query: 164 INHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDG 223
           ++ I T +L SLSEQ+LVDC T+G + GC GG M+DAF+++I N G+ATEA YPY A  G
Sbjct: 177 MHQINTGELVSLSEQQLVDCSTNGNNNGCGGGTMEDAFQYVIGNNGIATEAAYPYTAMQG 236

Query: 224 SCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTG-Q 282
            C  +   P+ A +  Y+ VP ++E AL  AVA QPVSVA+DA  ++FQFY  GV T   
Sbjct: 237 MC--QNVQPAVA-VRSYQQVPRDDEDALAAAVAGQPVSVAVDA--NNFQFYKGGVMTADS 291

Query: 283 CGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQAS 342
           CGT L+H VTAVGYGTA+DGT YWL+KN WG+TWGE GY+R+QR +    G CG+A  AS
Sbjct: 292 CGTNLNHAVTAVGYGTAEDGTPYWLLKNQWGSTWGEEGYLRLQRGV----GACGVAKDAS 347

Query: 343 YPTA 346
           YP A
Sbjct: 348 YPVA 351


>gi|226503129|ref|NP_001149806.1| LOC100283433 precursor [Zea mays]
 gi|195634783|gb|ACG36860.1| xylem cysteine proteinase 2 precursor [Zea mays]
 gi|219884977|gb|ACL52863.1| unknown [Zea mays]
          Length = 377

 Score =  327 bits (839), Expect = 4e-87,   Method: Compositional matrix adjust.
 Identities = 176/324 (54%), Positives = 215/324 (66%), Gaps = 20/324 (6%)

Query: 32  DATMNER----HEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGIN 87
           D T ++R     E W+A+Y + Y    EK  RF++FK+N+ +I   N K     Y LG+N
Sbjct: 61  DLTQHDRLVRLFEEWVAKYRKAYGSFEEKLRRFEVFKDNLHHIDEANRKEVTS-YWLGLN 119

Query: 88  EFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASV-----PASIDWRKKGAVTG 142
            FAD T++EF+A    Y   LP      T+   FRY          PAS+DWRKKGAVT 
Sbjct: 120 AFADLTHDEFKAT---YLGLLPK----RTSGGRFRYGGVGDGGDEVPASVDWRKKGAVTE 172

Query: 143 VKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFE 202
           VK+QGQCG CWAFS VAA+EGIN I T  LTSLSEQ+LVDC T G + GC GG+MD+AF 
Sbjct: 173 VKNQGQCGSCWAFSTVAAVEGINQIVTGNLTSLSEQQLVDCSTDG-NNGCSGGVMDNAFS 231

Query: 203 FIISNKGLATEAKYPYKASDGSCNKKEANPSA-AKISGYEDVPSNNEAALMKAVANQPVS 261
           FI +  GL +E  YPY   +G C+ +  +      ISGYEDVP+N+E AL+KA+A+QPVS
Sbjct: 232 FIATGAGLRSEEAYPYLMEEGDCDDRARDGEVLVTISGYEDVPANDEQALVKALAHQPVS 291

Query: 262 VAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGY 321
           VAI+ASG  FQFYS GVF G CG+ELDHGV AVGYG++  G  Y +VKNSWGT WGE GY
Sbjct: 292 VAIEASGRHFQFYSGGVFDGPCGSELDHGVAAVGYGSS-KGQDYIIVKNSWGTHWGEKGY 350

Query: 322 IRMQRDIDAKEGLCGIAMQASYPT 345
           IRM+R     EGLCGI   ASYPT
Sbjct: 351 IRMKRGTGKPEGLCGINKMASYPT 374


>gi|302781881|ref|XP_002972714.1| hypothetical protein SELMODRAFT_98707 [Selaginella moellendorffii]
 gi|300159315|gb|EFJ25935.1| hypothetical protein SELMODRAFT_98707 [Selaginella moellendorffii]
          Length = 446

 Score =  327 bits (839), Expect = 4e-87,   Method: Compositional matrix adjust.
 Identities = 157/319 (49%), Positives = 214/319 (67%), Gaps = 7/319 (2%)

Query: 31  NDATMNERHEMWMAQYGR-VYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEF 89
           +D+ ++  +  W A++G+     N+  + RF+ FKEN  YI   +N+A    Y+LG+N+F
Sbjct: 5   SDSDLSGEYASWCAKFGKECASSNSLGDRRFETFKENFRYIEE-HNRAGKHSYRLGLNQF 63

Query: 90  ADQTNEEFRAPRNGYKRRL---PSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQ 146
           +D T+EEFR    G +  L   P ++    +D+   ++N  +PAS+DWRK GAVT  KDQ
Sbjct: 64  SDLTSEEFRQRFLGLRPDLIDSPVLKMPRDSDIEEGFQNVDLPASVDWRKHGAVTAPKDQ 123

Query: 147 GQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIIS 206
           G CG CWAF+   A+EGIN I T +L SLSEQEL+DCD    D+GC+GGLM++A++FI+ 
Sbjct: 124 GSCGGCWAFATTGAIEGINQIVTGQLMSLSEQELIDCDKKA-DKGCDGGLMENAYQFIVE 182

Query: 207 NKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDA 266
           N GL TE  YPY AS+  CN K+ N     I GYE +P  +E AL++AVA QPVSVAI+ 
Sbjct: 183 NGGLDTETDYPYHASESHCNMKKLNSRVVAIDGYEAIPDGDEQALLRAVAKQPVSVAIEG 242

Query: 267 SGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQR 326
           +  DFQ Y+SGVFTG CG E++HGV  VGYGT +DG  YW+VKNSW  TWG+ G+++MQR
Sbjct: 243 ASKDFQHYASGVFTGHCGEEINHGVLIVGYGT-EDGLDYWIVKNSWAATWGDGGFVKMQR 301

Query: 327 DIDAKEGLCGIAMQASYPT 345
           +   + GLC I   ASYP 
Sbjct: 302 NTGKRGGLCSINTLASYPV 320


>gi|226508570|ref|NP_001141984.1| uncharacterized protein LOC100274134 precursor [Zea mays]
 gi|194706676|gb|ACF87422.1| unknown [Zea mays]
 gi|413920745|gb|AFW60677.1| vignain [Zea mays]
          Length = 363

 Score =  327 bits (839), Expect = 4e-87,   Method: Compositional matrix adjust.
 Identities = 162/321 (50%), Positives = 220/321 (68%), Gaps = 15/321 (4%)

Query: 31  NDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFA 90
           ++A M  R++ WMAQY R Y+D+AEK  RF++FK N E+I   N   + K Y LG N+FA
Sbjct: 51  DEAMMMARYKKWMAQYRRKYKDDAEKAHRFQVFKANAEFIDRSNAGGKKK-YVLGTNQFA 109

Query: 91  DQTNEEFRAPRNGYKR--RLPSVRSSETTDVSFRYENASV---PASIDWRKKGAVTGVKD 145
           D T++EF A   G ++   +PS  + +      +Y+N +       +DWR++GAVT VK+
Sbjct: 110 DLTSKEFAAMYTGLRKPAAVPS-GAKQIPAAGSKYQNFTRLDDDVQVDWRQQGAVTPVKN 168

Query: 146 QGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFII 205
           QGQCGCCWAFSAV AMEG+  ITT  L SLSEQ+++DCD S  +QGC GG MD+AF+++I
Sbjct: 169 QGQCGCCWAFSAVGAMEGLIMITTGNLVSLSEQQILDCDESDGNQGCNGGYMDNAFQYVI 228

Query: 206 SNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAID 265
           +N G+ TE  YPY A  G+C   +    AA ISG++D+PS +E AL  AVANQPVSV +D
Sbjct: 229 NNGGVTTEDAYPYSAVQGTCQNVQ---PAATISGFQDLPSGDENALANAVANQPVSVGVD 285

Query: 266 ASGSDFQFYSSGVFTGQ-CGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRM 324
              S FQFY  G++ G  CGT+++H VTA+GYG  D GT+YW++KNSWGT WGENG++++
Sbjct: 286 GGSSPFQFYQGGIYDGDGCGTDMNHAVTAIGYGADDQGTQYWILKNSWGTGWGENGFMQL 345

Query: 325 QRDIDAKEGLCGIAMQASYPT 345
           Q  +    G CGI+  ASYPT
Sbjct: 346 QMGV----GACGISTMASYPT 362


>gi|13432122|sp|P80884.2|ANAN_ANACO RecName: Full=Ananain; Flags: Precursor
 gi|2623956|emb|CAA05487.1| Ananain precursor [Ananas comosus]
          Length = 345

 Score =  327 bits (838), Expect = 5e-87,   Method: Compositional matrix adjust.
 Identities = 170/343 (49%), Positives = 223/343 (65%), Gaps = 19/343 (5%)

Query: 9   KLVLAAILVLGVWA-PQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENV 67
           +LV   + +  +WA P + S       M ++ E WMA+YGRVY+DN EK +RF+IFK NV
Sbjct: 6   QLVFLFLFLCVMWASPSAASCDEPSDPMMKQFEEWMAEYGRVYKDNDEKMLRFQIFKNNV 65

Query: 68  EYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYK-----RRLPSVRSSETTDVSFR 122
            +I +FNN+  N  Y LGIN+F D TN EF A   G       +R P V S +  D+S  
Sbjct: 66  NHIETFNNRNGNS-YTLGINQFTDMTNNEFVAQYTGLSLPLNIKREPVV-SFDDVDIS-- 121

Query: 123 YENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVD 182
               SVP SIDWR  GAVT VK+QG+CG CWAF+++A +E I  I    L SLSEQ+++D
Sbjct: 122 ----SVPQSIDWRDSGAVTSVKNQGRCGSCWAFASIATVESIYKIKRGNLVSLSEQQVLD 177

Query: 183 CDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYED 242
           C  S    GC+GG ++ A+ FIISNKG+A+ A YPYKA+ G+C K    P++A I+ Y  
Sbjct: 178 CAVS---YGCKGGWINKAYSFIISNKGVASAAIYPYKAAKGTC-KTNGVPNSAYITRYTY 233

Query: 243 VPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDG 302
           V  NNE  +M AV+NQP++ A+DASG +FQ Y  GVFTG CGT L+H +  +GYG    G
Sbjct: 234 VQRNNERNMMYAVSNQPIAAALDASG-NFQHYKRGVFTGPCGTRLNHAIVIIGYGQDSSG 292

Query: 303 TKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
            K+W+V+NSWG  WGE GYIR+ RD+ +  GLCGIAM   YPT
Sbjct: 293 KKFWIVRNSWGAGWGEGGYIRLARDVSSSFGLCGIAMDPLYPT 335


>gi|413942348|gb|AFW74997.1| Xylem cysteine proteinase 2 [Zea mays]
          Length = 391

 Score =  327 bits (838), Expect = 6e-87,   Method: Compositional matrix adjust.
 Identities = 176/324 (54%), Positives = 215/324 (66%), Gaps = 20/324 (6%)

Query: 32  DATMNER----HEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGIN 87
           D T ++R     E W+A+Y + Y    EK  RF++FK+N+ +I   N K     Y LG+N
Sbjct: 75  DLTQHDRLVRLFEEWVAKYRKAYGSFEEKLRRFEVFKDNLHHIDEANRKEVTS-YWLGLN 133

Query: 88  EFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASV-----PASIDWRKKGAVTG 142
            FAD T++EF+A    Y   LP      T+   FRY          PAS+DWRKKGAVT 
Sbjct: 134 AFADLTHDEFKAT---YLGLLPK----RTSGGRFRYGGVGDGGDEVPASVDWRKKGAVTE 186

Query: 143 VKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFE 202
           VK+QGQCG CWAFS VAA+EGIN I T  LTSLSEQ+LVDC T G + GC GG+MD+AF 
Sbjct: 187 VKNQGQCGSCWAFSTVAAVEGINQIVTGNLTSLSEQQLVDCSTDG-NNGCSGGVMDNAFS 245

Query: 203 FIISNKGLATEAKYPYKASDGSCNKKEANPSA-AKISGYEDVPSNNEAALMKAVANQPVS 261
           FI +  GL +E  YPY   +G C+ +  +      ISGYEDVP+N+E AL+KA+A+QPVS
Sbjct: 246 FIATGAGLRSEEAYPYLMEEGDCDDRARDGEVLVTISGYEDVPANDEQALVKALAHQPVS 305

Query: 262 VAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGY 321
           VAI+ASG  FQFYS GVF G CG+ELDHGV AVGYG++  G  Y +VKNSWGT WGE GY
Sbjct: 306 VAIEASGRHFQFYSGGVFDGPCGSELDHGVAAVGYGSS-KGQDYIIVKNSWGTHWGEKGY 364

Query: 322 IRMQRDIDAKEGLCGIAMQASYPT 345
           IRM+R     EGLCGI   ASYPT
Sbjct: 365 IRMKRGTGKPEGLCGINKMASYPT 388


>gi|3377950|emb|CAA08861.1| cysteine proteinase precursor, AN11 [Ananas comosus]
          Length = 357

 Score =  327 bits (837), Expect = 6e-87,   Method: Compositional matrix adjust.
 Identities = 170/344 (49%), Positives = 225/344 (65%), Gaps = 19/344 (5%)

Query: 9   KLVLAAILVLGVWA-PQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENV 67
           +LV   + +  +WA P + SR      M +R E WMA+YGRVY+DN EK  RF+IFK NV
Sbjct: 6   QLVFLFLFLCVMWASPSAASRDEPSDPMMKRFEEWMAEYGRVYKDNDEKMRRFQIFKNNV 65

Query: 68  EYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRL----PSVRSSETTDVSFRY 123
            +I +FN++  N  Y LGIN+F D TN EF A   G    L      V S +  D+S   
Sbjct: 66  NHIETFNSRNGNS-YTLGINQFTDMTNNEFVAQYTGVSLPLNIEREPVVSFDDVDIS--- 121

Query: 124 ENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDC 183
              +VP SIDWR  GAVT VK+   CG CWAF+A+A +E I  I    L SLSEQ+++DC
Sbjct: 122 ---AVPQSIDWRNYGAVTSVKNHIPCGSCWAFAAIATVESIYKIKRGYLISLSEQQVLDC 178

Query: 184 DTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDG--SCNKKEANPSAAKISGYE 241
             S    GC+GG ++ A++FIISNKG+A+ A YPYKAS G  +C +    P++A I+GY 
Sbjct: 179 AVS---YGCDGGWVNKAYDFIISNKGVASAAIYPYKASQGQGTC-RINGVPNSAYITGYT 234

Query: 242 DVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADD 301
            V SNNE ++M AV+NQP++ +I+ASG DFQ Y  GVF+G CGT L+H +T +GYG    
Sbjct: 235 RVQSNNERSMMYAVSNQPIAASIEASG-DFQHYKRGVFSGPCGTSLNHAITIIGYGQDSS 293

Query: 302 GTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           G K+W+V+NSWG +WGE GYIRM RD+ +  GLCGIA++  YPT
Sbjct: 294 GKKFWIVRNSWGASWGERGYIRMARDVSSSSGLCGIAIRPLYPT 337


>gi|356521444|ref|XP_003529366.1| PREDICTED: thiol protease SEN102-like [Glycine max]
          Length = 340

 Score =  327 bits (837), Expect = 6e-87,   Method: Compositional matrix adjust.
 Identities = 169/344 (49%), Positives = 230/344 (66%), Gaps = 11/344 (3%)

Query: 4   ILLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIF 63
           + L+ K V    ++      ++ SRTL+++++  +HE WMA + RVY D+AEK+ R +IF
Sbjct: 3   LTLDKKSVGTFFMLFLTCICRASSRTLSESSIATQHEEWMAMHDRVYADSAEKDRRQQIF 62

Query: 64  KENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRY 123
           KEN+E+I   NN+ + K Y L +N FAD TNEEF A   G   + P+   S   + S  +
Sbjct: 63  KENLEFIEKHNNEGK-KRYNLSLNSFADLTNEEFVASHTGALYKPPTQLGSFKINHSLGF 121

Query: 124 ENASV---PASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQEL 180
              SV    AS+DWRK+GAV  +K+QG+CG CWAFSAVAA+EGIN I   +L SLSEQ L
Sbjct: 122 HKMSVGDIEASLDWRKRGAVNDIKNQGRCGSCWAFSAVAAVEGINQIKNGQLVSLSEQNL 181

Query: 181 VDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGY 240
           VDC +   + GC G  ++ AF++I  + GLA E +YPY  + G+C+   +NP A +I GY
Sbjct: 182 VDCAS---NDGCHGQYVEKAFDYI-RDYGLANEEEYPYVETVGTCSGN-SNP-AIQIRGY 235

Query: 241 EDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTAD 300
           + V   NE  L+ AVA+QPVSV ++A G  FQFYS GVF+G+CGTEL+H VT VGYG   
Sbjct: 236 QSVTPQNEEQLLTAVASQPVSVLLEAKGQGFQFYSGGVFSGECGTELNHAVTIVGYGEEA 295

Query: 301 DGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           +G KYWL++NSWG +WGE GY+++ RD    +GLCGI MQASYP
Sbjct: 296 EG-KYWLIRNSWGKSWGEGGYMKLMRDTGNPQGLCGINMQASYP 338


>gi|146215986|gb|ABQ10195.1| actinidin Act2d [Actinidia eriantha]
          Length = 381

 Score =  327 bits (837), Expect = 6e-87,   Method: Compositional matrix adjust.
 Identities = 173/348 (49%), Positives = 217/348 (62%), Gaps = 19/348 (5%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRF 60
           M+++     L+L++ L +      S  RT ND  M   +E W+ + G+ Y    EKEMRF
Sbjct: 12  MSLLFFSTLLILSSALDI----KNSVQRT-NDQVM-AMYESWLVEQGKSYNSLDEKEMRF 65

Query: 61  KIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVS 120
           +IFKEN+  I   N  A N+ Y LG+N FAD T+EE+R+   G+K       S     VS
Sbjct: 66  EIFKENLRIIDDHNADA-NRSYSLGLNRFADLTDEEYRSTYLGFK-------SGPKAKVS 117

Query: 121 FRYE---NASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSE 177
            RY       +P  +DWR  GAV GVKDQG C  CWAFSAVAA+EGIN I T  L SLSE
Sbjct: 118 NRYVPKVGVVLPNYVDWRTVGAVVGVKDQGLCSSCWAFSAVAAVEGINKIVTGNLISLSE 177

Query: 178 QELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKI 237
           QELVDC  +   +GC  G M+DAF+FII N G+ TE  YPY A DG C+    N     I
Sbjct: 178 QELVDCGRTQRTRGCNRGYMNDAFQFIIDNGGINTEDNYPYTAQDGQCDWYRKNQRYVTI 237

Query: 238 SGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYG 297
             YE +P+NNE  L  AVA QP++V +++ G  F+ Y+SG++TG CGT +DHGVT VGYG
Sbjct: 238 DNYEQLPANNEWVLQNAVAYQPITVGLESEGGKFKLYTSGIYTGYCGTAIDHGVTIVGYG 297

Query: 298 TADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           T + G  YW+VKNSWGT WGENGYIR+QR+I    G CGIAM  SYP 
Sbjct: 298 T-ERGLDYWIVKNSWGTNWGENGYIRIQRNIGGA-GKCGIAMVPSYPV 343


>gi|226502454|ref|NP_001140922.1| hypothetical protein [Zea mays]
 gi|223948637|gb|ACN28402.1| unknown [Zea mays]
 gi|413920877|gb|AFW60809.1| hypothetical protein ZEAMMB73_830238 [Zea mays]
          Length = 354

 Score =  327 bits (837), Expect = 7e-87,   Method: Compositional matrix adjust.
 Identities = 183/368 (49%), Positives = 239/368 (64%), Gaps = 36/368 (9%)

Query: 1   MAMILLENKLVLA----AILVLGVWAPQSWSRTLNDAT--------MNERHEMWMAQYGR 48
           MA  ++ NK V+A    A+ +L V    + +R L+  +        M  RH+ WMA++GR
Sbjct: 1   MAPYIVVNKTVIAFTAVALTILAVKTMMAEARDLSSTSTGGYGEEAMKVRHQQWMAEHGR 60

Query: 49  VYRDNAEKEMRFKIFKENVEYIASFNNKARNK-PYKLGINEFADQTNEEFRAPRNGYKRR 107
            YRD AEK  RF++FK N +++ + N    +K  Y++ +NEFAD TN+EF A   G    
Sbjct: 61  TYRDEAEKAHRFQVFKANADFVDASNAAGDDKKSYRMELNEFADMTNDEFMAMYTG---- 116

Query: 108 LPSVRSSETTDVSFRYENASVP------ASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAM 161
           L  V +       F+Y N ++        ++DWR+KGAVTG+K+QGQCGCCWAF+AVAA+
Sbjct: 117 LRPVPAGAKKMAGFKYGNVTLSDADDNQQTVDWRQKGAVTGIKNQGQCGCCWAFAAVAAV 176

Query: 162 EGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKAS 221
           EGI+ ITT  L SLSEQ+++DCDT G + GC GG +D+AF++I  N GLATE  YPY A+
Sbjct: 177 EGIHQITTGNLVSLSEQQVLDCDTEGNN-GCNGGYIDNAFQYIAGNGGLATEDAYPYTAA 235

Query: 222 DGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFT- 280
              C  +   P AA ISGY+DVPS +EAAL  AVANQPVSVAIDA   +FQ Y  GV T 
Sbjct: 236 QAMC--QSVQPVAA-ISGYQDVPSGDEAALAAAVANQPVSVAIDA--HNFQLYGGGVMTA 290

Query: 281 GQCGT--ELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIA 338
             C T   L+H VTAVGYGTA+DGT YWL+KN WG  WGE GY+R++R  +A    CG+A
Sbjct: 291 ASCSTPPNLNHAVTAVGYGTAEDGTPYWLLKNQWGQNWGEGGYLRLERGANA----CGVA 346

Query: 339 MQASYPTA 346
            QASYP A
Sbjct: 347 QQASYPVA 354


>gi|38345906|emb|CAE04498.2| OSJNBb0059K02.8 [Oryza sativa Japonica Group]
          Length = 458

 Score =  327 bits (837), Expect = 7e-87,   Method: Compositional matrix adjust.
 Identities = 163/305 (53%), Positives = 208/305 (68%), Gaps = 6/305 (1%)

Query: 42  WMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARN--KPYKLGINEFADQTNEEFRA 99
           W A++G+ Y    E+E R+  F++N+ YI   N  A      ++LG+N FAD TNEE+R 
Sbjct: 43  WKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHSFRLGLNRFADLTNEEYRD 102

Query: 100 PRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVA 159
              G + +    R  + +D     +N ++P S+DWR KGAV  +KDQ   G CWAFSA+A
Sbjct: 103 TYLGLRNK--PRRERKVSDRYLAADNEALPESVDWRTKGAVAEIKDQEVAGSCWAFSAIA 160

Query: 160 AMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYK 219
           A+EGIN I T  L SLSEQELVDCDTS  ++GC GGLMD AF+FII+N G+ TE  YPYK
Sbjct: 161 AVEGINQIVTGDLISLSEQELVDCDTS-YNEGCNGGLMDYAFDFIINNGGIDTEDDYPYK 219

Query: 220 ASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVF 279
             D  C+    N     I  YEDV  N+E +L KAVANQPVSVAI+A G  FQ YSSG+F
Sbjct: 220 GKDERCDVNRKNAKVVTIDSYEDVTPNSETSLQKAVANQPVSVAIEAGGRAFQLYSSGIF 279

Query: 280 TGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAM 339
           TG+CGT LDHGV AVGYGT ++G  YW+V+NSWG +WGE+GY+RM+R+I A  G CGIA+
Sbjct: 280 TGKCGTALDHGVAAVGYGT-ENGKDYWIVRNSWGKSWGESGYVRMERNIKASSGKCGIAV 338

Query: 340 QASYP 344
           + SYP
Sbjct: 339 EPSYP 343


>gi|302142276|emb|CBI19479.3| unnamed protein product [Vitis vinifera]
          Length = 388

 Score =  326 bits (836), Expect = 8e-87,   Method: Compositional matrix adjust.
 Identities = 169/307 (55%), Positives = 208/307 (67%), Gaps = 37/307 (12%)

Query: 39  HEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFR 98
           +E W+A++G+ Y    EKE RF+IFK+N+ +I   N  A N+ YK+              
Sbjct: 4   YEAWLAKHGKSYNALGEKERRFQIFKDNLRFIDEHN--AENRTYKI-------------- 47

Query: 99  APRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAV 158
                            +   +FR  + S+P S+DWRKKGAV  VKDQG CG CWAFS +
Sbjct: 48  -----------------SDRYAFRVGD-SLPESVDWRKKGAVVEVKDQGSCGSCWAFSTI 89

Query: 159 AAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPY 218
           AA+EGIN I T  L SLSEQELVDCDTS  ++GC GGLMD AFEFII+N G+ +E  YPY
Sbjct: 90  AAVEGINKIVTGGLISLSEQELVDCDTS-YNEGCNGGLMDYAFEFIINNGGIDSEEDYPY 148

Query: 219 KASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGV 278
           KASDG C++   N     I GYEDVP N+E +L KAVANQPVSVAI+A G +FQ Y SG+
Sbjct: 149 KASDGRCDQYRKNAKVVTIDGYEDVPENDEKSLEKAVANQPVSVAIEAGGREFQLYQSGI 208

Query: 279 FTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDI-DAKEGLCGI 337
           FTG+CGT LDHGVTAVGYGT ++G  YW+VKNSWG +WGE GYIRM+RD+  +  G CGI
Sbjct: 209 FTGRCGTALDHGVTAVGYGT-ENGVDYWIVKNSWGASWGEEGYIRMERDLATSATGKCGI 267

Query: 338 AMQASYP 344
           AM+ASYP
Sbjct: 268 AMEASYP 274


>gi|146215984|gb|ABQ10194.1| actinidin Act2c [Actinidia arguta]
          Length = 378

 Score =  326 bits (836), Expect = 9e-87,   Method: Compositional matrix adjust.
 Identities = 175/348 (50%), Positives = 221/348 (63%), Gaps = 19/348 (5%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRF 60
           M+++     L+L++ L +      S  RT ND  + + +E W+ + G+ Y    EKEMRF
Sbjct: 10  MSLLFFSTLLILSSALDI----VNSAQRT-NDQ-VRDMYESWLVEQGKSYNSLDEKEMRF 63

Query: 61  KIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVS 120
           +IFK+N+  I   N  A N+ + LG+N FAD T+EE+R+   G+K       S     VS
Sbjct: 64  EIFKDNLRIIDDHNADA-NRSFSLGLNRFADLTDEEYRSTYLGFK-------SGPKAKVS 115

Query: 121 FRYE---NASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSE 177
            RY       +P  +DWR  GAV GVK+QG C  CWAFSAVAA+EGIN I T  L SLSE
Sbjct: 116 NRYVPKVGDVLPNYVDWRTVGAVVGVKNQGLCSSCWAFSAVAAVEGINKIMTGNLLSLSE 175

Query: 178 QELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKI 237
           QELVDC  +   +GC  G M DAF+FII+N G+ TE  YPY A DG CN+   N     I
Sbjct: 176 QELVDCGRTQSTRGCNRGYMTDAFQFIINNGGINTEDNYPYTAQDGQCNRYLQNQKYVTI 235

Query: 238 SGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYG 297
             YE+VPSNNE AL  AVA+QPVSV +++ G  F+ Y+SG+FT  CGT +DHGVT VGYG
Sbjct: 236 DDYENVPSNNEWALQNAVAHQPVSVGLESEGGKFKLYTSGIFTQYCGTAIDHGVTIVGYG 295

Query: 298 TADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           T + G  YW+VKNSWGT WGENGYIR+QR+I    G CGIA  ASYP 
Sbjct: 296 T-ERGLDYWIVKNSWGTNWGENGYIRIQRNIGGA-GKCGIARMASYPV 341


>gi|212275830|ref|NP_001130503.1| cysteine protease 1 [Zea mays]
 gi|194689328|gb|ACF78748.1| unknown [Zea mays]
 gi|219886279|gb|ACL53514.1| unknown [Zea mays]
 gi|238010470|gb|ACR36270.1| unknown [Zea mays]
 gi|413920875|gb|AFW60807.1| cysteine protease 1 [Zea mays]
          Length = 354

 Score =  326 bits (836), Expect = 9e-87,   Method: Compositional matrix adjust.
 Identities = 182/368 (49%), Positives = 238/368 (64%), Gaps = 36/368 (9%)

Query: 1   MAMILLENKLVL----AAILVLGVWAPQSWSRTLNDAT--------MNERHEMWMAQYGR 48
           MA  ++ NK V+     A+ +L V    + +R L+  +        M  RH+ WMA++GR
Sbjct: 1   MAPHIVVNKTVITFTAVALTILAVTTMMAEARDLSSTSTGGYGEEAMKVRHQQWMAEHGR 60

Query: 49  VYRDNAEKEMRFKIFKENVEYIASFNNKARNK-PYKLGINEFADQTNEEFRAPRNGYKRR 107
            YRD AEK  RF++FK N +++ + N    +K  Y+L +NEFAD TN+EF A   G    
Sbjct: 61  TYRDEAEKAHRFQVFKANADFVDASNAAGDDKKSYRLELNEFADMTNDEFMAMYTG---- 116

Query: 108 LPSVRSSETTDVSFRYENASVP------ASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAM 161
           L  V +       F+Y N ++        ++DWR+KGAVTG+K+QGQCGCCWAF+AVAA+
Sbjct: 117 LRPVPAGAKKMAGFKYGNVTLSDADDDQQTVDWRQKGAVTGIKNQGQCGCCWAFAAVAAV 176

Query: 162 EGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKAS 221
           EGI+ ITT  L SLSEQ+++DCDT G + GC GG +D+AF++I+ N GL TE  YPY A+
Sbjct: 177 EGIHQITTGNLVSLSEQQVLDCDTDGNN-GCNGGYIDNAFQYIVGNGGLGTEDAYPYTAA 235

Query: 222 DGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFT- 280
              C  +   P AA ISGY+DVPS +EAAL  AVANQPVSVAIDA   +FQ Y  GV T 
Sbjct: 236 QAMC--QSVQPVAA-ISGYQDVPSGDEAALAAAVANQPVSVAIDA--HNFQLYGGGVMTA 290

Query: 281 GQCGT--ELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIA 338
             C T   L+H VTAVGYGTA+DGT YWL+KN WG  WGE GY+R++R  +A    CG+A
Sbjct: 291 ASCSTPPNLNHAVTAVGYGTAEDGTPYWLLKNQWGQNWGEGGYLRLERGANA----CGVA 346

Query: 339 MQASYPTA 346
            QASYP A
Sbjct: 347 QQASYPVA 354


>gi|3377948|emb|CAA08860.1| cysteine proteinase precursor, AN8 [Ananas comosus]
          Length = 356

 Score =  326 bits (835), Expect = 1e-86,   Method: Compositional matrix adjust.
 Identities = 168/339 (49%), Positives = 225/339 (66%), Gaps = 10/339 (2%)

Query: 9   KLVLAAILVLGVWA-PQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENV 67
           +LV   + +  +WA P + S       M +R E WM +YGRVY+DN EK  RF+IFK NV
Sbjct: 6   QLVFLFLFLCVMWASPSAASADEPSDPMMKRFEEWMVEYGRVYKDNDEKMRRFQIFKNNV 65

Query: 68  EYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFR-YENA 126
            +I +FN++ ++  Y LGIN+F D TN EF A   G   R  ++       VSF   + +
Sbjct: 66  NHIETFNSRNKDS-YTLGINQFTDMTNNEFVAQYTGGISRPLNIEREPV--VSFDDVDIS 122

Query: 127 SVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTS 186
           +VP SIDWR  GAVT VK+Q  CG CWAF+A+A +E I  I    L  LSEQ+++DC   
Sbjct: 123 AVPQSIDWRDYGAVTSVKNQNPCGACWAFAAIATVESIYKIKKGILEPLSEQQVLDC--- 179

Query: 187 GEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSN 246
            +  GC+GG    AFEFIISNKG+A+ A YPYKA+ G+C K    P++A I+GY  VP N
Sbjct: 180 AKGYGCKGGWEFRAFEFIISNKGVASVAIYPYKAAKGTC-KTNGVPNSAYITGYARVPRN 238

Query: 247 NEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYW 306
           NE+++M AV+ QP++VA+DA+ +  Q+Y+SGVF G CGT L+H VTA+GYG   +G KYW
Sbjct: 239 NESSMMYAVSKQPITVAVDANANS-QYYNSGVFNGPCGTSLNHAVTAIGYGQDSNGKKYW 297

Query: 307 LVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           +VKNSWG  WGE GYIRM RD+ +  G+CGIA+ + YPT
Sbjct: 298 IVKNSWGARWGEAGYIRMARDVSSSSGICGIAIDSLYPT 336


>gi|297845064|ref|XP_002890413.1| hypothetical protein ARALYDRAFT_472321 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297336255|gb|EFH66672.1| hypothetical protein ARALYDRAFT_472321 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 357

 Score =  325 bits (834), Expect = 1e-86,   Method: Compositional matrix adjust.
 Identities = 170/342 (49%), Positives = 223/342 (65%), Gaps = 14/342 (4%)

Query: 13  AAILVLGVWAPQSWSRTLNDATMNERH-------EMWMAQYGRVYRDNAEKEMRFKIFKE 65
           AA L L V A   +S         E H       E W++ + + Y    EK +RF++FK+
Sbjct: 18  AATLSLSVAASHDYSIVGYSPEDLESHDKLIELFENWISNFEKAYETVEEKLLRFEVFKD 77

Query: 66  NVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYEN 125
           N+++I   N K   K Y LG+NEFAD ++EEF+    G K  +   R  E +   F Y +
Sbjct: 78  NLKHIDETNKKV--KSYWLGLNEFADLSHEEFKKMYLGLKTDIVR-RDEERSYAEFAYRD 134

Query: 126 A-SVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCD 184
             +VP S+DWRKKGAV  VK+QG CG CWAFS VAA+EGIN I T  LT+LSEQEL+DCD
Sbjct: 135 VEAVPKSVDWRKKGAVAEVKNQGSCGSCWAFSTVAAVEGINKIVTGNLTTLSEQELIDCD 194

Query: 185 TSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVP 244
           T+  + GC GGLMD AFE+I+ N GL  E  YPY   +G+C  ++       I G++DVP
Sbjct: 195 TT-YNNGCNGGLMDYAFEYIVKNGGLRKEEDYPYSMEEGTCEMQKDESETVTIDGHQDVP 253

Query: 245 SNNEAALMKAVANQPVSVAIDASGSDFQFYSS-GVFTGQCGTELDHGVTAVGYGTADDGT 303
           +N+E +L+KA+A+QP+SVAIDASG +FQFYS   VF G+CG +LDHGV AVGYG++  G+
Sbjct: 254 TNDEKSLLKALAHQPLSVAIDASGREFQFYSGVSVFDGRCGVDLDHGVAAVGYGSS-KGS 312

Query: 304 KYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
            Y +VKNSWG  WGE GYIR++R+    EGLCGI   AS+PT
Sbjct: 313 DYIIVKNSWGPKWGEKGYIRLKRNTGKPEGLCGINKMASFPT 354


>gi|359359120|gb|AEV41026.1| putative cysteine protease [Oryza minuta]
          Length = 464

 Score =  325 bits (834), Expect = 1e-86,   Method: Compositional matrix adjust.
 Identities = 165/295 (55%), Positives = 208/295 (70%), Gaps = 8/295 (2%)

Query: 54  AEKEMRFKIFKENVEYIASFNNKA-RNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVR 112
            E E RF++F +N++++ + N  A  +  ++LG+N FAD TN+EFRA    Y    P+ R
Sbjct: 85  GEYERRFRVFWDNLKFVDAHNAHADEHGGFRLGMNRFADLTNDEFRA---AYLGTTPAGR 141

Query: 113 SSETTDVSFRYENA-SVPASIDWRKKGAVTG-VKDQGQCGCCWAFSAVAAMEGINHITTR 170
                ++ +R++   ++P S+DWR KGAV   VK+QGQCG CWAFSAVAA+EGIN I T 
Sbjct: 142 GRHVGEM-YRHDGVEALPDSVDWRDKGAVVSPVKNQGQCGSCWAFSAVAAVEGINKIVTG 200

Query: 171 KLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEA 230
           +L SLSEQELV+C  +  + GC GG+MDDAF FI  N GL TE  YPY A DG C+  + 
Sbjct: 201 ELVSLSEQELVECARNRGNSGCNGGIMDDAFAFITRNGGLDTEEDYPYTAMDGKCDLAKK 260

Query: 231 NPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHG 290
           +     I G+EDVP N+E +L KAVA+QPVSVAIDA G +FQ Y SGVFTG+CGT LDHG
Sbjct: 261 SRKVVSIDGFEDVPENDELSLQKAVAHQPVSVAIDAGGREFQLYDSGVFTGRCGTSLDHG 320

Query: 291 VTAVGYGT-ADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           V AVGYGT A  GT YW V+NSWG  WGENGYIRM+R++ A+ G CGIAM ASYP
Sbjct: 321 VVAVGYGTDAATGTDYWTVRNSWGPDWGENGYIRMERNVTARTGKCGIAMMASYP 375


>gi|363814535|ref|NP_001242660.1| uncharacterized protein LOC100807362 precursor [Glycine max]
 gi|255636658|gb|ACU18666.1| unknown [Glycine max]
          Length = 367

 Score =  325 bits (834), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 162/307 (52%), Positives = 217/307 (70%), Gaps = 6/307 (1%)

Query: 39  HEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFR 98
           +E W+ ++G+VY    EKE RF+IFK+N+ +I   N  A N+ YK+G+N F+D +NEE+R
Sbjct: 52  YEEWLVKHGKVYNAVEEKEKRFQIFKDNLNFIEEHN--AVNRTYKVGLNRFSDLSNEEYR 109

Query: 99  APRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAV 158
           +   G K     + +  +   S R  + ++P S+DWRK+GAV  VK+Q +C  CWAFSA+
Sbjct: 110 SKYLGTKIDPSRMMARPSRRYSPRVAD-NLPESVDWRKEGAVVRVKNQSECEGCWAFSAI 168

Query: 159 AAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPY 218
           AA+EGIN I T  LT+LSEQEL+DCD +  + GC GGL+D AFEFII+N G+ TE  YP+
Sbjct: 169 AAVEGINKIVTGNLTALSEQELLDCDRT-VNAGCSGGLVDYAFEFIINNGGIDTEEDYPF 227

Query: 219 KASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGV 278
           + +DG C++ + N  A  I GYE VP+ +E AL KAVANQPVSVAI+A G +FQ Y SG+
Sbjct: 228 QGADGICDQYKINARAVTIDGYERVPAYDELALKKAVANQPVSVAIEAYGKEFQLYESGI 287

Query: 279 FTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDI-DAKEGLCGI 337
           FTG CGT +DHGVTAVGYGT ++G  YW+VKNSWG  WGE GY+ M+R+I +   G CGI
Sbjct: 288 FTGTCGTSIDHGVTAVGYGT-ENGIDYWIVKNSWGENWGEAGYVGMERNIAEDTAGKCGI 346

Query: 338 AMQASYP 344
           A+   YP
Sbjct: 347 AILTLYP 353


>gi|223946391|gb|ACN27279.1| unknown [Zea mays]
          Length = 279

 Score =  325 bits (833), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 162/258 (62%), Positives = 188/258 (72%), Gaps = 5/258 (1%)

Query: 90  ADQTNEEFRAPRNGYKRRLPSVRS-SETTDVSFRYENA-SVPASIDWRKKGAVTGVKDQG 147
           AD+    +   R  + R     R  S  +  SF Y +A  VPAS+DWR+KGAVT VKDQG
Sbjct: 3   ADEFRRHYAGSRVAHHRMFRGDRQGSSASASSFMYADARDVPASVDWRQKGAVTDVKDQG 62

Query: 148 QCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISN 207
           QCG CWAFS +AA+EGIN I T+ LTSLSEQ+LVDCDT   + GC GGLMD AF++I  +
Sbjct: 63  QCGSCWAFSTIAAVEGINAIKTKNLTSLSEQQLVDCDTKA-NAGCNGGLMDYAFQYIAKH 121

Query: 208 KGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDAS 267
            G+A E  YPY+A   SC K  A      I GYEDVP+N+E+AL KAVA+QPVSVAI+AS
Sbjct: 122 GGVAAEDAYPYRARQASCKKSPA--PVVTIDGYEDVPANDESALKKAVAHQPVSVAIEAS 179

Query: 268 GSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRD 327
           GS FQFYS GVF+G+CGTELDHGV AVGYG   DGTKYWLVKNSWG  WGE GYIRM RD
Sbjct: 180 GSHFQFYSEGVFSGRCGTELDHGVAAVGYGVTADGTKYWLVKNSWGPEWGEKGYIRMARD 239

Query: 328 IDAKEGLCGIAMQASYPT 345
           + AKEG CGIAM+ASYP 
Sbjct: 240 VAAKEGHCGIAMEASYPV 257


>gi|222629922|gb|EEE62054.1| hypothetical protein OsJ_16838 [Oryza sativa Japonica Group]
          Length = 336

 Score =  325 bits (833), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 171/309 (55%), Positives = 205/309 (66%), Gaps = 8/309 (2%)

Query: 40  EMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRA 99
           E  +  Y + Y    EK  RF++FK+N+ +I   N K  +  Y LG+NEFAD T++EF+A
Sbjct: 30  EFSIVGYRKAYASFEEKVRRFEVFKDNLNHIDDINKKVTS--YWLGLNEFADLTHDEFKA 87

Query: 100 PRNGYKRRLPSVRSSETTDVSFRY---ENASVPASIDWRKKGAVTGVKDQGQCGCCWAFS 156
              G         S   +   FRY    N  VP  +DWRKK AVT VK+QGQCG CWAFS
Sbjct: 88  TYLGLTPPPTRSNSKHYSSEEFRYGKMSNGEVPKEMDWRKKNAVTEVKNQGQCGSCWAFS 147

Query: 157 AVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKY 216
            VAA+EGIN I T  LTSLSEQEL+DC T G + GC GGLMD AF +I S  GL TE  Y
Sbjct: 148 TVAAVEGINAIVTGNLTSLSEQELIDCSTDG-NNGCNGGLMDYAFSYIASTGGLRTEEAY 206

Query: 217 PYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSS 276
           PY   +G C++ +   +   ISGYEDVP+N+E AL+KA+A+QPVSVAI+ASG  FQFYS 
Sbjct: 207 PYAMEEGDCDEGKG-AAVVTISGYEDVPANDEQALVKALAHQPVSVAIEASGRHFQFYSG 265

Query: 277 GVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCG 336
           GVF G CG +LDHGVTAVGYGT+  G  Y +VKNSWG  WGE GYIRM+R     EGLCG
Sbjct: 266 GVFDGPCGEQLDHGVTAVGYGTS-KGQDYIIVKNSWGPHWGEKGYIRMKRGTGKGEGLCG 324

Query: 337 IAMQASYPT 345
           I   ASYPT
Sbjct: 325 INKMASYPT 333


>gi|255635584|gb|ACU18142.1| unknown [Glycine max]
          Length = 345

 Score =  325 bits (832), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 164/308 (53%), Positives = 208/308 (67%), Gaps = 8/308 (2%)

Query: 37  ERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEE 96
           E  E WM+++G++Y+   EK +RF+IFK+N+++I   N    N  Y LG+NEFAD +++E
Sbjct: 45  ELFESWMSKHGKIYQSIEEKLLRFEIFKDNLKHIDERNKVVSN--YWLGLNEFADLSHQE 102

Query: 97  FRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFS 156
           F+    G K      R S      F Y++  +P S+DWRKKGAV  VK+QG CG CWAFS
Sbjct: 103 FKNKYLGLKVDYSRRRESPE---EFTYKDVELPKSVDWRKKGAVAPVKNQGSCGSCWAFS 159

Query: 157 AVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKY 216
            VAA+EGIN I T  LTSLSEQEL+DCD +  + GC GGLMD AF FI+ N GL  E  Y
Sbjct: 160 TVAAVEGINQIVTGNLTSLSEQELIDCDRTYSN-GCNGGLMDYAFSFIVENGGLHKEEDY 218

Query: 217 PYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSS 276
           PY   +G+C   +       ISGY DVP NNE +L+KA+ANQ +SVAI+ASG DFQFYS 
Sbjct: 219 PYIMEEGTCEMTKEETEVVTISGYHDVPQNNEQSLLKALANQSLSVAIEASGRDFQFYSG 278

Query: 277 GVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCG 336
           GVF G CG++LDHGV AVGYGTA  G  Y +VKNSWG+ WGE GYIRM+  ++ +  L  
Sbjct: 279 GVFDGHCGSDLDHGVAAVGYGTA-KGVDYIIVKNSWGSKWGEKGYIRMRGTLETRGNLRY 337

Query: 337 IAMQASYP 344
           + M ASYP
Sbjct: 338 LQM-ASYP 344


>gi|317106666|dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas]
          Length = 441

 Score =  325 bits (832), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 159/305 (52%), Positives = 199/305 (65%), Gaps = 3/305 (0%)

Query: 40  EMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRA 99
           E W  Q+G+ Y    EK  R K+F++N +++   N++  N  Y L +N FAD T+ EF+A
Sbjct: 31  ETWCQQHGKTYASQEEKLFRLKVFQDNYDFVTEHNSQG-NSSYTLSLNAFADLTHHEFKA 89

Query: 100 PRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVA 159
            R G      +  + + ++       A VPAS+DWRK GAVT VKDQG CG CW+FSA  
Sbjct: 90  SRLGLSSAASASLNVDRSNRQIPDFVADVPASVDWRKNGAVTQVKDQGNCGACWSFSATG 149

Query: 160 AMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYK 219
           A+EGIN I T  L SLSEQELVDCD S  + GCEGG+MD AF+F+I N G+ TE  YPY+
Sbjct: 150 AIEGINKIVTGSLVSLSEQELVDCDKS-YNNGCEGGIMDYAFQFVIDNHGIDTEEDYPYQ 208

Query: 220 ASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVF 279
             D SCNK++       I GY DVP NNE  L+KAVANQPVSV I  S   FQ YS G+F
Sbjct: 209 GRDRSCNKEKLKRHVVTIDGYVDVPQNNEKELLKAVANQPVSVGICGSERAFQLYSKGIF 268

Query: 280 TGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAM 339
           TG C T LDH V  VGYG+ ++G  YW+VKNSWG+ WG +GY+ MQR+  +  GLCGI M
Sbjct: 269 TGPCSTSLDHAVLIVGYGS-ENGVDYWIVKNSWGSYWGMDGYMHMQRNSGSSRGLCGINM 327

Query: 340 QASYP 344
            ASYP
Sbjct: 328 LASYP 332


>gi|357143305|ref|XP_003572875.1| PREDICTED: xylem cysteine proteinase 1-like [Brachypodium
           distachyon]
          Length = 473

 Score =  325 bits (832), Expect = 3e-86,   Method: Compositional matrix adjust.
 Identities = 159/305 (52%), Positives = 209/305 (68%), Gaps = 8/305 (2%)

Query: 42  WMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPR 101
           W  ++ ++Y    EK  R+++FK+N+++I   N   RN  Y LG+N+FAD  +EEF++  
Sbjct: 51  WSVKHSKIYVSPEEKVKRYEVFKQNLKHIVETNR--RNGSYWLGLNQFADVAHEEFKSTY 108

Query: 102 NGYKRRLPSVRSSETTDVSFRYENA-SVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAA 160
            G K  +     + T   +FRYEN+ ++P S+DWRKKGAVT VK+QG+CG CWAFS VAA
Sbjct: 109 LGLKTGMDGPARAPT---AFRYENSVNLPWSVDWRKKGAVTPVKNQGECGSCWAFSTVAA 165

Query: 161 MEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKA 220
           +EGIN I T KL SLSEQEL+DCDT+  D GC GG MD AF +I+ N G+ T+  YPY  
Sbjct: 166 VEGINQIATGKLESLSEQELMDCDTT-FDHGCGGGFMDFAFAYIMGNLGIHTDDDYPYLM 224

Query: 221 SDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFT 280
            +G C +K+       ISGYEDVP N+E +L+KA+A+QP+SV I A   DFQFY  GVF 
Sbjct: 225 EEGYCKEKQPQSKVVTISGYEDVPENSEVSLLKALAHQPISVGIAAGSKDFQFYKRGVFE 284

Query: 281 GQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQ 340
           G CGTELDH +TAVGYG++ DG  Y ++KNSWG +WGE GY R++R     EG+C I   
Sbjct: 285 GSCGTELDHALTAVGYGSS-DGQDYIIMKNSWGKSWGEQGYFRIKRGTGKPEGVCSIYSM 343

Query: 341 ASYPT 345
           ASYPT
Sbjct: 344 ASYPT 348


>gi|90399361|emb|CAJ86180.1| H0212B02.7 [Oryza sativa Indica Group]
          Length = 470

 Score =  325 bits (832), Expect = 3e-86,   Method: Compositional matrix adjust.
 Identities = 165/317 (52%), Positives = 211/317 (66%), Gaps = 18/317 (5%)

Query: 42  WMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARN--KPYKLGINEFADQTNEEFRA 99
           W A++G+ Y    E+E R+  F++N+ YI   N  A      ++LG+N FAD TNEE+R 
Sbjct: 43  WKAEHGKNYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHSFRLGLNRFADLTNEEYRD 102

Query: 100 PRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVA 159
              G + +    R  + +D     +N ++P S+DWR KGAV  +KDQG CG CWAFSA+A
Sbjct: 103 TYLGLRNK--PRRERKVSDRYLAADNEALPESVDWRTKGAVAEIKDQGGCGSCWAFSAIA 160

Query: 160 AMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYK 219
           A+EGIN I T  L SLSEQELVDCDTS  ++GC GGLMD AF+FII+N G+ TE  YPYK
Sbjct: 161 AVEGINQIVTGDLISLSEQELVDCDTS-YNEGCNGGLMDYAFDFIINNGGIDTEDDYPYK 219

Query: 220 ASDGSCNKK------------EANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDAS 267
             D  C+              + N     I  YEDV  N+E +L KAVANQPVSVAI+A 
Sbjct: 220 GKDERCDVNRVSFVFFAPLVFQKNAKVVTIDSYEDVTPNSETSLQKAVANQPVSVAIEAG 279

Query: 268 GSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRD 327
           G  FQ YSSG+FTG+CGT LDHGV AVGYGT ++G  YW+V+NSWG +WGE+GY+RM+R+
Sbjct: 280 GRAFQLYSSGIFTGKCGTALDHGVAAVGYGT-ENGKDYWIVRNSWGKSWGESGYVRMERN 338

Query: 328 IDAKEGLCGIAMQASYP 344
           I A  G CGIA++ SYP
Sbjct: 339 IKASSGKCGIAVEPSYP 355


>gi|75277440|sp|O23791.1|BROM1_ANACO RecName: Full=Fruit bromelain; AltName: Allergen=Ana c 2; Flags:
           Precursor
 gi|2342496|dbj|BAA21849.1| bromelain [Ananas comosus]
          Length = 351

 Score =  324 bits (831), Expect = 3e-86,   Method: Compositional matrix adjust.
 Identities = 166/339 (48%), Positives = 223/339 (65%), Gaps = 11/339 (3%)

Query: 9   KLVLAAILVLGVWA-PQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENV 67
           +LV   + +  +WA P + SR   +  M +R E WMA+YGRVY+D+ EK  RF+IFK NV
Sbjct: 6   QLVFLFLFLCAMWASPSAASRDEPNDPMMKRFEEWMAEYGRVYKDDDEKMRRFQIFKNNV 65

Query: 68  EYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS 127
           ++I +FN++  N  Y LGIN+F D T  EF A   G    L   R      VSF   N S
Sbjct: 66  KHIETFNSRNENS-YTLGINQFTDMTKSEFVAQYTGVSLPLNIEREPV---VSFDDVNIS 121

Query: 128 -VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTS 186
            VP SIDWR  GAV  VK+Q  CG CW+F+A+A +EGI  I T  L SLSEQE++DC  S
Sbjct: 122 AVPQSIDWRDYGAVNEVKNQNPCGSCWSFAAIATVEGIYKIKTGYLVSLSEQEVLDCAVS 181

Query: 187 GEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSN 246
               GC+GG ++ A++FIISN G+ TE  YPY A  G+CN   + P++A I+GY  V  N
Sbjct: 182 ---YGCKGGWVNKAYDFIISNNGVTTEENYPYLAYQGTCNAN-SFPNSAYITGYSYVRRN 237

Query: 247 NEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYW 306
           +E ++M AV+NQP++  IDAS  +FQ+Y+ GVF+G CGT L+H +T +GYG    GTKYW
Sbjct: 238 DERSMMYAVSNQPIAALIDAS-ENFQYYNGGVFSGPCGTSLNHAITIIGYGQDSSGTKYW 296

Query: 307 LVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           +V+NSWG++WGE GY+RM R + +  G+CGIAM   +PT
Sbjct: 297 IVRNSWGSSWGEGGYVRMARGVSSSSGVCGIAMAPLFPT 335


>gi|2351107|dbj|BAA21929.1| bromelain [Ananas comosus]
          Length = 312

 Score =  324 bits (831), Expect = 3e-86,   Method: Compositional matrix adjust.
 Identities = 166/304 (54%), Positives = 210/304 (69%), Gaps = 9/304 (2%)

Query: 43  MAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRN 102
           MA+YGRVY+DN EK  RF+IFK NV +I +FNN+  N  Y LGIN+F D TN EF A   
Sbjct: 1   MAEYGRVYKDNDEKMRRFQIFKNNVNHIETFNNRNGNS-YTLGINKFTDMTNNEFVAQYT 59

Query: 103 GYKRRLPSVRSSETTDVSFRYENAS-VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAM 161
           G   R  ++       VSF   N S V  SIDWR  GAVT VKDQ  CG CWAFSA+A +
Sbjct: 60  GGISRPLNIEKEPV--VSFDDVNISAVGQSIDWRDYGAVTEVKDQNPCGSCWAFSAIATV 117

Query: 162 EGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKAS 221
           EGI  I T  L SLSEQE++DC  S    GC+GG +D+A++FIISN G+A+EA YPY+A 
Sbjct: 118 EGIYKIVTGYLVSLSEQEVLDCAVS---NGCDGGFVDNAYDFIISNNGVASEADYPYQAY 174

Query: 222 DGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTG 281
            G C    + P++A I+GY  V SN+E+++  AV NQP++ AIDASG +FQ+Y+ GVF+G
Sbjct: 175 QGDC-AANSWPNSAYITGYSYVRSNDESSMKYAVWNQPIAAAIDASGDNFQYYNGGVFSG 233

Query: 282 QCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQA 341
            CGT L+H +T +GYG    GT+YW+VKNSWG++WGE GYIRM R + +  GLCGIAM  
Sbjct: 234 PCGTSLNHAITIIGYGQDSSGTQYWIVKNSWGSSWGERGYIRMARGV-SSSGLCGIAMDP 292

Query: 342 SYPT 345
            YPT
Sbjct: 293 LYPT 296


>gi|297802228|ref|XP_002868998.1| cysteine proteinase [Arabidopsis lyrata subsp. lyrata]
 gi|297314834|gb|EFH45257.1| cysteine proteinase [Arabidopsis lyrata subsp. lyrata]
          Length = 375

 Score =  324 bits (830), Expect = 4e-86,   Method: Compositional matrix adjust.
 Identities = 168/330 (50%), Positives = 217/330 (65%), Gaps = 14/330 (4%)

Query: 25  SWSRTLNDATMNERHEMWMAQYGRVYRDNA----EKEMRFKIFKENVEYIASFNNKARNK 80
           SW RT  D  +   +  W A +G+   +N     +++ RF IFK+N+ +I   N K +N 
Sbjct: 37  SWWRT--DEEVRSIYLQWSADHGKTNNNNNGIINDQDKRFNIFKDNLRFIDLHNEKNKNA 94

Query: 81  PYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENA----SVPASIDWRK 136
            YKLG+ +F D TNEE+R+   G  R  P  R ++  +V+ +Y  A     VP ++DWR 
Sbjct: 95  TYKLGLTKFTDLTNEEYRSLYLG-ARTEPVRRIAKAKNVNQKYSAAVDGKEVPETVDWRL 153

Query: 137 KGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGL 196
           KGAV  +KDQG CG CWAFS  AA+EGIN I T +L SLSEQELVDCD S  +QGC GGL
Sbjct: 154 KGAVNPIKDQGTCGSCWAFSTAAAVEGINKIVTGELISLSEQELVDCDNS-YNQGCNGGL 212

Query: 197 MDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVA 256
           MD AF+FI+ N GL TE  YPY+   G CN    N     I GYEDVP+ +E AL +A++
Sbjct: 213 MDYAFQFIMKNGGLKTEKDYPYRGFGGKCNSFLKNAKVVSIDGYEDVPTKDETALKRAIS 272

Query: 257 NQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTW 316
            QPVSVAI+A G  FQ Y +G+FTG CGT LDH V AVGYG+ ++G  YW+V+NSWG  W
Sbjct: 273 LQPVSVAIEAGGRIFQHYQTGIFTGNCGTNLDHAVVAVGYGS-ENGVDYWIVRNSWGPRW 331

Query: 317 GENGYIRMQRDI-DAKEGLCGIAMQASYPT 345
           GE GYIRM+R++  +K G CGIA++ASYP 
Sbjct: 332 GEEGYIRMERNLASSKSGKCGIAVEASYPV 361


>gi|334904467|gb|AEH26024.1| cysteine peptidase [Ananas comosus]
          Length = 352

 Score =  324 bits (830), Expect = 4e-86,   Method: Compositional matrix adjust.
 Identities = 167/339 (49%), Positives = 220/339 (64%), Gaps = 10/339 (2%)

Query: 9   KLVLAAILVLGVWA-PQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENV 67
           +LV   + +  +WA P + SR      M +R E WMA+YGRVY+DN EK  RF+IFK NV
Sbjct: 6   QLVFLFLFLCVMWASPSAASRDEPSDPMMKRFEEWMAEYGRVYKDNDEKMRRFQIFKNNV 65

Query: 68  EYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS 127
            +I +FN+   N  Y LGIN+F D T  EF A   G   R  ++       VSF   N S
Sbjct: 66  NHIETFNSHNGNS-YTLGINQFTDMTKSEFVAQYTGGISRPLNIEREPV--VSFDDVNIS 122

Query: 128 -VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTS 186
            VP SIDWR  GAV  VK+Q  CG CWAF+A+A +EGI  I T  L SLSEQE++DC  S
Sbjct: 123 AVPQSIDWRDYGAVNEVKNQNPCGSCWAFAAIATVEGIYKIKTGYLVSLSEQEVLDCAVS 182

Query: 187 GEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSN 246
               GC+GG ++ A++FIISN G+ TE  YPY+A  G+CN     P++A I+GY  V  N
Sbjct: 183 ---YGCKGGWVNKAYDFIISNNGVTTEENYPYQAYQGTCNANSF-PNSAYITGYSYVRRN 238

Query: 247 NEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYW 306
           +E ++M AV+NQP++  IDAS  +FQ+Y+ GVF+G CGT L+H +T +GYG    GTKYW
Sbjct: 239 DERSMMYAVSNQPIAALIDAS-ENFQYYNGGVFSGPCGTSLNHAITIIGYGQDSSGTKYW 297

Query: 307 LVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           +V+NSWG++WGE GY+RM R + +  G CGIAM   +PT
Sbjct: 298 IVRNSWGSSWGEGGYVRMARGVSSSSGACGIAMSPLFPT 336


>gi|242055753|ref|XP_002457022.1| hypothetical protein SORBIDRAFT_03g047290 [Sorghum bicolor]
 gi|241928997|gb|EES02142.1| hypothetical protein SORBIDRAFT_03g047290 [Sorghum bicolor]
          Length = 378

 Score =  324 bits (830), Expect = 4e-86,   Method: Compositional matrix adjust.
 Identities = 176/341 (51%), Positives = 221/341 (64%), Gaps = 36/341 (10%)

Query: 34  TMNERHEMWMAQYGR-VYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQ 92
           ++ E  E W++++ +  Y    EK  RF++FK+N+ +I   N K  +  Y LG+NEFAD 
Sbjct: 43  SLAELFERWLSRHRKGAYASLEEKLRRFEVFKDNLHHIDETNRKVSS--YWLGLNEFADL 100

Query: 93  TNEEFRAPRNGYKRRLPSVRSSETTDVS--------------------FRYEN---ASVP 129
           T++EF+A    Y    PS    +   +                     FRYE    A +P
Sbjct: 101 THDEFKAT---YLGLSPSGGGGDVVHMHHDDDDEEPEEEGSSSSSSFRFRYEGVDAARLP 157

Query: 130 ASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGED 189
            S+DWR KGAVTGVK+QGQCG CWAFS VAA+EGIN I T  LT+LSEQELVDCDT G +
Sbjct: 158 KSVDWRSKGAVTGVKNQGQCGSCWAFSTVAAVEGINQIVTGNLTALSEQELVDCDTDG-N 216

Query: 190 QGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEA 249
            GC GGLMD AF +I  N GL TE  YPY   +G+C++  ++ +   ISGYEDVP NNE 
Sbjct: 217 NGCNGGLMDYAFSYIAHNGGLHTEEAYPYLMEEGTCSRG-SSAAVVTISGYEDVPRNNEQ 275

Query: 250 ALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTA--DDG---TK 304
           AL+KA+A+QPVSVAI+ASG + QFYS GVF G CGT+LDHGV AVGYGTA  D+G     
Sbjct: 276 ALLKALAHQPVSVAIEASGRNLQFYSGGVFDGPCGTQLDHGVAAVGYGTAGKDNGHVVAD 335

Query: 305 YWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           Y +VKNSWG +WGE GYIRM+R    ++GLCGI    SYPT
Sbjct: 336 YIIVKNSWGPSWGEKGYIRMRRGTGKRQGLCGINKMPSYPT 376


>gi|302812789|ref|XP_002988081.1| hypothetical protein SELMODRAFT_183539 [Selaginella moellendorffii]
 gi|300144187|gb|EFJ10873.1| hypothetical protein SELMODRAFT_183539 [Selaginella moellendorffii]
          Length = 425

 Score =  323 bits (828), Expect = 8e-86,   Method: Compositional matrix adjust.
 Identities = 155/319 (48%), Positives = 213/319 (66%), Gaps = 7/319 (2%)

Query: 31  NDATMNERHEMWMAQYGR-VYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEF 89
           +D+ ++  +  W A++G+     N+  + RF+ FKEN  YI   +N+A    Y+LG+N+F
Sbjct: 5   SDSDLSGEYASWCAKFGKECASSNSLGDHRFETFKENFRYIEE-HNRAGKHSYRLGLNQF 63

Query: 90  ADQTNEEFRAPRNGYKRRL---PSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQ 146
           +D T+EEFR    G +  L   P ++    +D+   ++N  +PAS+DWR+ GAVT  KDQ
Sbjct: 64  SDLTSEEFRQRFLGLRPDLIDSPVLKMPRDSDIEEGFQNVDLPASVDWRQHGAVTAPKDQ 123

Query: 147 GQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIIS 206
           G CG CWAF+   A+EGIN I T +L SLSEQEL+DCD    D+GC+GGLM++A++FI+ 
Sbjct: 124 GSCGGCWAFATTGAIEGINQIVTGQLVSLSEQELIDCDKKA-DKGCDGGLMENAYQFIVE 182

Query: 207 NKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDA 266
           N GL TE  YPY AS+  CN K+ N     I GY+ +P  +E AL+ AVA QPVSVAI+ 
Sbjct: 183 NGGLDTETDYPYHASESHCNMKKLNSRVVAIDGYKAIPEGDEQALLLAVAKQPVSVAIEG 242

Query: 267 SGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQR 326
           +  DFQ Y+SGVFTG CG E++HGV  VGYGT +DG  YW+VKNSW  TWG+ G+++MQR
Sbjct: 243 ASKDFQHYASGVFTGHCGEEINHGVLIVGYGT-EDGLDYWIVKNSWAATWGDGGFVKMQR 301

Query: 327 DIDAKEGLCGIAMQASYPT 345
           +   + GLC I   ASYP 
Sbjct: 302 NTGKRGGLCSINTLASYPV 320


>gi|30685308|ref|NP_566634.2| putative cysteine proteinase [Arabidopsis thaliana]
 gi|30315949|sp|Q9LT77.1|CPR1_ARATH RecName: Full=Probable cysteine proteinase At3g19400; Flags:
           Precursor
 gi|11994462|dbj|BAB02464.1| cysteine proteinase [Arabidopsis thaliana]
 gi|332642715|gb|AEE76236.1| putative cysteine proteinase [Arabidopsis thaliana]
          Length = 362

 Score =  323 bits (827), Expect = 1e-85,   Method: Compositional matrix adjust.
 Identities = 159/317 (50%), Positives = 210/317 (66%), Gaps = 6/317 (1%)

Query: 31  NDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFA 90
           N+  +   +E W+ +  + Y    EKE RFKIFK+N++++   +N   ++ +++G+  FA
Sbjct: 36  NETEVRLMYEQWLVENRKNYNGLGEKERRFKIFKDNLKFVDE-HNSVPDRTFEVGLTRFA 94

Query: 91  DQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCG 150
           D TNEEFRA     ++++   + S  T+     E   +P  +DWR  GAV  VKDQG CG
Sbjct: 95  DLTNEEFRAIY--LRKKMERTKDSVKTERYLYKEGDVLPDEVDWRANGAVVSVKDQGNCG 152

Query: 151 CCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGL 210
            CWAFSAV A+EGIN ITT +L SLSEQELVDCD    + GC+GG+M+ AFEFI+ N G+
Sbjct: 153 SCWAFSAVGAVEGINQITTGELISLSEQELVDCDRGFVNAGCDGGIMNYAFEFIMKNGGI 212

Query: 211 ATEAKYPYKASD-GSCN-KKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASG 268
            T+  YPY A+D G CN  K  N     I GYEDVP ++E +L KAVA+QPVSVAI+AS 
Sbjct: 213 ETDQDYPYNANDLGLCNADKNNNTRVVTIDGYEDVPRDDEKSLKKAVAHQPVSVAIEASS 272

Query: 269 SDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDI 328
             FQ Y SGV TG CG  LDHGV  VGYG+   G  YW+++NSWG  WG++GY+++QR+I
Sbjct: 273 QAFQLYKSGVMTGTCGISLDHGVVVVGYGST-SGEDYWIIRNSWGLNWGDSGYVKLQRNI 331

Query: 329 DAKEGLCGIAMQASYPT 345
           D   G CGIAM  SYPT
Sbjct: 332 DDPFGKCGIAMMPSYPT 348


>gi|186516984|ref|NP_195406.2| cysteine proteinase1 [Arabidopsis thaliana]
 gi|15290508|gb|AAK92229.1| cysteine proteinase [Arabidopsis thaliana]
 gi|332661313|gb|AEE86713.1| cysteine proteinase1 [Arabidopsis thaliana]
          Length = 376

 Score =  322 bits (826), Expect = 1e-85,   Method: Compositional matrix adjust.
 Identities = 165/323 (51%), Positives = 214/323 (66%), Gaps = 12/323 (3%)

Query: 32  DATMNERHEMWMAQYGRVYRDNA----EKEMRFKIFKENVEYIASFNNKARNKPYKLGIN 87
           D  +   +  W A++G+   +N     +++ RF IFK+N+ +I   N   +N  YKLG+ 
Sbjct: 42  DEEVRSIYLQWSAEHGKTNNNNNGIINDQDKRFNIFKDNLRFIDLHNENNKNATYKLGLT 101

Query: 88  EFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENA----SVPASIDWRKKGAVTGV 143
           +F D TN+E+R    G  R  P+ R ++  +V+ +Y  A     VP ++DWR+KGAV  +
Sbjct: 102 KFTDLTNDEYRKLYLG-ARTEPARRIAKAKNVNQKYSAAVNGKEVPETVDWRQKGAVNPI 160

Query: 144 KDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEF 203
           KDQG CG CWAFS  AA+EGIN I T +L SLSEQELVDCD S  +QGC GGLMD AF+F
Sbjct: 161 KDQGTCGSCWAFSTTAAVEGINKIVTGELISLSEQELVDCDKS-YNQGCNGGLMDYAFQF 219

Query: 204 IISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVA 263
           I+ N GL TE  YPY+   G CN    N     I GYEDVP+ +E AL KA++ QPVSVA
Sbjct: 220 IMKNGGLNTEKDYPYRGFGGKCNSFLKNSRVVSIDGYEDVPTKDETALKKAISYQPVSVA 279

Query: 264 IDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIR 323
           I+A G  FQ Y SG+FTG CGT LDH V AVGYG+ ++G  YW+V+NSWG  WGE GYIR
Sbjct: 280 IEAGGRIFQHYQSGIFTGSCGTNLDHAVVAVGYGS-ENGVDYWIVRNSWGPRWGEEGYIR 338

Query: 324 MQRDIDA-KEGLCGIAMQASYPT 345
           M+R++ A K G CGIA++ASYP 
Sbjct: 339 MERNLAASKSGKCGIAVEASYPV 361


>gi|46395939|sp|Q94B08.2|GCP1_ARATH RecName: Full=Germination-specific cysteine protease 1; Flags:
           Precursor
 gi|4006883|emb|CAB16767.1| cysteine proteinase [Arabidopsis thaliana]
 gi|7270637|emb|CAB80354.1| cysteine proteinase [Arabidopsis thaliana]
          Length = 376

 Score =  322 bits (825), Expect = 2e-85,   Method: Compositional matrix adjust.
 Identities = 165/323 (51%), Positives = 214/323 (66%), Gaps = 12/323 (3%)

Query: 32  DATMNERHEMWMAQYGRVYRDNA----EKEMRFKIFKENVEYIASFNNKARNKPYKLGIN 87
           D  +   +  W A++G+   +N     +++ RF IFK+N+ +I   N   +N  YKLG+ 
Sbjct: 42  DEEVRSIYLQWSAEHGKTNNNNNGIINDQDKRFNIFKDNLRFIDLHNEDNKNATYKLGLT 101

Query: 88  EFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENA----SVPASIDWRKKGAVTGV 143
           +F D TN+E+R    G  R  P+ R ++  +V+ +Y  A     VP ++DWR+KGAV  +
Sbjct: 102 KFTDLTNDEYRKLYLG-ARTEPARRIAKAKNVNQKYSAAVNGKEVPETVDWRQKGAVNPI 160

Query: 144 KDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEF 203
           KDQG CG CWAFS  AA+EGIN I T +L SLSEQELVDCD S  +QGC GGLMD AF+F
Sbjct: 161 KDQGTCGSCWAFSTTAAVEGINKIVTGELISLSEQELVDCDKS-YNQGCNGGLMDYAFQF 219

Query: 204 IISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVA 263
           I+ N GL TE  YPY+   G CN    N     I GYEDVP+ +E AL KA++ QPVSVA
Sbjct: 220 IMKNGGLNTEKDYPYRGFGGKCNSFLKNSRVVSIDGYEDVPTKDETALKKAISYQPVSVA 279

Query: 264 IDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIR 323
           I+A G  FQ Y SG+FTG CGT LDH V AVGYG+ ++G  YW+V+NSWG  WGE GYIR
Sbjct: 280 IEAGGRIFQHYQSGIFTGSCGTNLDHAVVAVGYGS-ENGVDYWIVRNSWGPRWGEEGYIR 338

Query: 324 MQRDIDA-KEGLCGIAMQASYPT 345
           M+R++ A K G CGIA++ASYP 
Sbjct: 339 MERNLAASKSGKCGIAVEASYPV 361


>gi|356543010|ref|XP_003539956.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
          Length = 306

 Score =  322 bits (824), Expect = 2e-85,   Method: Compositional matrix adjust.
 Identities = 176/316 (55%), Positives = 209/316 (66%), Gaps = 19/316 (6%)

Query: 35  MNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTN 94
           M  R E W+ Q  R Y+D  E E+RF I++ N+EYI   N  ++   Y L  N+FAD TN
Sbjct: 1   MRVRFERWLKQNDRXYKDKEEWEVRFGIYQANLEYIECKN--SQEXSYNLTDNKFADLTN 58

Query: 95  EEFRAPRNGYKRR-LPSVRSSETTDVSFRY-ENASVPASIDWRKKGAVTGVKDQGQCGCC 152
           EEF +P  G+  R LP           F Y E+  +P S DWRK+GAV+ +KDQG CG C
Sbjct: 59  EEFVSPYLGFGTRFLPHT--------GFMYHEHEDLPESKDWRKEGAVSDIKDQGNCGSC 110

Query: 153 WAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLAT 212
           WAFSAVAA+EGIN I + KL SLSEQE  DCD    +QGCEGGLMD AF FI  N GL T
Sbjct: 111 WAFSAVAAVEGINKIKSGKLVSLSEQEFRDCDVEDGNQGCEGGLMDTAFAFIKKNGGLTT 170

Query: 213 EAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAAL--MKAVANQPVSVAIDASGSD 270
              YPY+  DG+CNK++A   AA ISG+  VP+N+EA L    A ANQ  SVAIDA G  
Sbjct: 171 SKDYPYEGVDGTCNKEKALHHAANISGHVKVPANDEAMLKAKAAAANQXESVAIDAGGHA 230

Query: 271 FQFYSSGVFTGQCGTELDHGVTAVGY--GTADDGTKYWLVKNSWGTTWGENGYIRMQRDI 328
           FQ Y  GVF+G CG +L+HGVT VGY  GT+D   KYW+VKNSWG  WGE+GYIRM+RD 
Sbjct: 231 FQLYLKGVFSGICGKQLNHGVTIVGYGKGTSD---KYWIVKNSWGADWGESGYIRMKRDA 287

Query: 329 DAKEGLCGIAMQASYP 344
             K G CGIAMQASYP
Sbjct: 288 FDKAGTCGIAMQASYP 303


>gi|218202389|gb|EEC84816.1| hypothetical protein OsI_31898 [Oryza sativa Indica Group]
          Length = 350

 Score =  322 bits (824), Expect = 2e-85,   Method: Compositional matrix adjust.
 Identities = 163/326 (50%), Positives = 215/326 (65%), Gaps = 19/326 (5%)

Query: 35  MNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTN 94
           M +R E WM ++GR Y D+ EK+ RF++++ NVE + +FN+ +    YKL  N+FAD TN
Sbjct: 28  MLDRFEQWMIRHGRAYTDSGEKQRRFEVYRRNVELVETFNSMSNG--YKLADNKFADLTN 85

Query: 95  EEFRAPRNGYKRR--LPSVRSSETTDVSFRYENAS--VPASIDWRKKGAVTGVKDQGQCG 150
           EEFRA   G++    +P + ++ + D++   E++   +P S+DWRKKGAV  VK+QG CG
Sbjct: 86  EEFRAKMLGFRPHVTIPQISNTCSADIAMPGESSDDILPKSVDWRKKGAVVEVKNQGDCG 145

Query: 151 CCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGL 210
            CWAFSAVAA+EGIN I   +L SLSEQELVDCD   E  GC GG M  AFEF++ N GL
Sbjct: 146 SCWAFSAVAAIEGINQIKNGELVSLSEQELVDCDD--EAVGCGGGYMSWAFEFVVGNHGL 203

Query: 211 ATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSD 270
            TEA YPY A++G+C   + N SA  I+GY +V  ++E  L +A A QPVSVA+D     
Sbjct: 204 TTEASYPYHAANGACQAAKLNQSAVAIAGYRNVTPSSEPDLARAAAAQPVSVAVDGGSFM 263

Query: 271 FQFYSSGVFTGQCGTELDHGVTAVGYGTADDGT----------KYWLVKNSWGTTWGENG 320
           FQ Y SGV+TG C  +++HGVT VGYG ++  T          KYW+VKNSWG  WG+ G
Sbjct: 264 FQLYGSGVYTGPCTADVNHGVTVVGYGESEPKTDGGGAAKGGEKYWIVKNSWGAEWGDAG 323

Query: 321 YIRMQRDIDA-KEGLCGIAMQASYPT 345
           YI MQRD+     GLCGIA+  SYP 
Sbjct: 324 YILMQRDVAGLASGLCGIALLPSYPV 349


>gi|26452046|dbj|BAC43113.1| putative cysteine proteinase RD21A precursor [Arabidopsis thaliana]
          Length = 362

 Score =  322 bits (824), Expect = 2e-85,   Method: Compositional matrix adjust.
 Identities = 159/317 (50%), Positives = 210/317 (66%), Gaps = 6/317 (1%)

Query: 31  NDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFA 90
           N+  +   +E W+ +  + Y    EKE RFKIFK+N++++   +N   ++ +++G+  FA
Sbjct: 36  NETEVRLMYEQWLVENRKNYNGLGEKERRFKIFKDNLKFVDE-HNSVPDRTFEVGLTRFA 94

Query: 91  DQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCG 150
           D TNEEFRA     ++++   + S  T+     E   +P  +DWR  GAV  VKDQG CG
Sbjct: 95  DLTNEEFRAIY--LRKKMERNKDSVKTERYLYKEGDVLPDEVDWRANGAVVSVKDQGNCG 152

Query: 151 CCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGL 210
            CWAFSAV A+EGIN ITT +L SLSEQELVDCD    + GC+GG+M+ AFEFI+ N G+
Sbjct: 153 SCWAFSAVGAVEGINQITTGELISLSEQELVDCDRGFVNAGCDGGIMNYAFEFIMKNGGI 212

Query: 211 ATEAKYPYKASD-GSCN-KKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASG 268
            T+  YPY A+D G CN  K  N     I GYEDVP ++E +L KAVA+QPVSVAI+AS 
Sbjct: 213 ETDQDYPYNANDLGLCNADKNNNTRVVTIDGYEDVPRDDEKSLKKAVAHQPVSVAIEASS 272

Query: 269 SDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDI 328
             FQ Y SGV TG CG  LDHGV  VGYG+   G  YW+++NSWG  WG++GY+++QR+I
Sbjct: 273 QAFQLYKSGVMTGTCGISLDHGVVVVGYGST-SGEDYWIIRNSWGLNWGDSGYVKLQRNI 331

Query: 329 DAKEGLCGIAMQASYPT 345
           D   G CGIAM  SYPT
Sbjct: 332 DDPFGKCGIAMMPSYPT 348


>gi|400180422|gb|AFP73349.1| cysteine protease [Solanum chmielewskii]
          Length = 344

 Score =  322 bits (824), Expect = 2e-85,   Method: Compositional matrix adjust.
 Identities = 161/347 (46%), Positives = 225/347 (64%), Gaps = 9/347 (2%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRF 60
           MAM +    +++    V+ ++  Q+  R+    +++ERHE+WM+++GRVY+D  EK  RF
Sbjct: 1   MAMKVDLMNILITLFFVISIFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERF 60

Query: 61  KIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVS 120
            IFKEN+++I S N KA N  YKLG+NEFAD T++EF A   G       +  S  +   
Sbjct: 61  MIFKENMKFIESVN-KAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE 119

Query: 121 FRYENAS---VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSE 177
           F+  + S   +P+++DWR+ GAVT VK QGQCGCCWAFSAV ++EG   I T  L   SE
Sbjct: 120 FKTNDLSDDDMPSNLDWRESGAVTQVKHQGQCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179

Query: 178 QELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKI 237
           QEL+DC T+  + GC GG M +AF+FII N G++ E+ Y Y     +C  +E   +A +I
Sbjct: 180 QELLDCTTN--NYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQEKT-AAVQI 236

Query: 238 SGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYG 297
           S Y+ VP   E +L++AV  QPVS+ I AS  D QFYS G + G C   ++H VTA+GYG
Sbjct: 237 SSYQVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYSGGTYDGSCADRINHAVTAIGYG 294

Query: 298 TADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           T ++G KYWL+KNSWGT+WGENG++++ RD     GLC IA  +SYP
Sbjct: 295 TDEEGQKYWLLKNSWGTSWGENGFMKIIRDSGDPSGLCDIAKMSSYP 341


>gi|146215978|gb|ABQ10191.1| actinidin Act1c [Actinidia eriantha]
          Length = 368

 Score =  321 bits (822), Expect = 3e-85,   Method: Compositional matrix adjust.
 Identities = 167/347 (48%), Positives = 218/347 (62%), Gaps = 22/347 (6%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRF 60
           M+++     L+L+  L          ++  ND  +   +E W+ ++G+ Y    E+E RF
Sbjct: 10  MSLLFFSTLLILSLALD---------AKRTNDE-VKAMYESWLIKHGKSYNSLGERERRF 59

Query: 61  KIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVS 120
           +IFKE + +I   +N   ++ YK+G+N+FAD TNEEFR+   G+       R S  T VS
Sbjct: 60  EIFKETLRFIDE-HNADTSRSYKVGLNQFADLTNEEFRSTYLGF------TRGSNKTKVS 112

Query: 121 FRYE---NASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSE 177
            RYE      +P  +DWR +GAV  +K+QGQCG CWAFSA+AA+EGIN I T  L SLSE
Sbjct: 113 NRYEPRVGQVLPDYVDWRSEGAVVDIKNQGQCGSCWAFSAIAAVEGINKIVTGNLISLSE 172

Query: 178 QELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKI 237
           QELVDC  +   +GC+GG M D FEFII+N G+ TE  YPY A +G C+    N     I
Sbjct: 173 QELVDCGRTQSTKGCDGGYMTDGFEFIINNGGINTEENYPYTAQEGQCDLNLQNEKYVTI 232

Query: 238 SGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYG 297
             YE+VP  NE AL  AVA QPVSVA++++G  FQ YSSG+FTG CGT  DH VT VGYG
Sbjct: 233 DNYENVPYYNEWALQTAVAYQPVSVALESAGDAFQHYSSGIFTGPCGTATDHAVTIVGYG 292

Query: 298 TADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           T + G  YW+VKNSW TTWGE GY+R+ R++    G CGIA   SYP
Sbjct: 293 T-EGGIDYWIVKNSWDTTWGEEGYMRILRNVGGA-GTCGIATMPSYP 337


>gi|115479933|ref|NP_001063560.1| Os09g0497500 [Oryza sativa Japonica Group]
 gi|113631793|dbj|BAF25474.1| Os09g0497500 [Oryza sativa Japonica Group]
 gi|215704298|dbj|BAG93138.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 349

 Score =  321 bits (822), Expect = 4e-85,   Method: Compositional matrix adjust.
 Identities = 163/326 (50%), Positives = 214/326 (65%), Gaps = 19/326 (5%)

Query: 35  MNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTN 94
           M +R E WM ++GR Y D  EK+ RF++++ NVE + +FN+ +    YKL  N+FAD TN
Sbjct: 27  MLDRFEQWMIRHGRAYTDAGEKQRRFEVYRRNVELVETFNSMSNG--YKLADNKFADLTN 84

Query: 95  EEFRAPRNGYKRR--LPSVRSSETTDVSFRYENAS--VPASIDWRKKGAVTGVKDQGQCG 150
           EEFRA   G++    +P + ++ + D++   E++   +P S+DWRKKGAV  VK+QG CG
Sbjct: 85  EEFRAKMLGFRPHVTIPQISNTCSADIAMPGESSDDILPKSVDWRKKGAVVEVKNQGDCG 144

Query: 151 CCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGL 210
            CWAFSAVAA+EGIN I   +L SLSEQELVDCD   E  GC GG M  AFEF++ N GL
Sbjct: 145 SCWAFSAVAAIEGINQIKNGELVSLSEQELVDCDD--EAVGCGGGYMSWAFEFVVGNHGL 202

Query: 211 ATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSD 270
            TEA YPY A++G+C   + N SA  I+GY +V  ++E  L +A A QPVSVA+D     
Sbjct: 203 TTEASYPYHAANGACQAAKLNQSAVAIAGYRNVTPSSEPDLARAAAAQPVSVAVDGGSFM 262

Query: 271 FQFYSSGVFTGQCGTELDHGVTAVGYGTADDGT----------KYWLVKNSWGTTWGENG 320
           FQ Y SGV+TG C  +++HGVT VGYG ++  T          KYW+VKNSWG  WG+ G
Sbjct: 263 FQLYGSGVYTGPCTADVNHGVTVVGYGESEPKTDGGGAAKGGEKYWIVKNSWGAEWGDAG 322

Query: 321 YIRMQRDIDA-KEGLCGIAMQASYPT 345
           YI MQRD+     GLCGIA+  SYP 
Sbjct: 323 YILMQRDVAGLASGLCGIALLPSYPV 348


>gi|242094000|ref|XP_002437490.1| hypothetical protein SORBIDRAFT_10g028000 [Sorghum bicolor]
 gi|241915713|gb|EER88857.1| hypothetical protein SORBIDRAFT_10g028000 [Sorghum bicolor]
          Length = 372

 Score =  321 bits (822), Expect = 4e-85,   Method: Compositional matrix adjust.
 Identities = 158/317 (49%), Positives = 214/317 (67%), Gaps = 8/317 (2%)

Query: 32  DATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARN--KPYKLGINEF 89
           D  +   +E W +++G  +   ++  +R ++F++N+ YI + N +A      ++LG+  F
Sbjct: 45  DDEVRRMYEAWKSEHGHGH--GSDDRLRLEVFRDNLRYIDAHNAEADAGLHTFRLGLTPF 102

Query: 90  ADQTNEEFRAPRNGYK-RRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQ 148
           AD T EE+R    G++ RR  + R    +    R     +P +IDWR+ GAVTGVK+Q Q
Sbjct: 103 ADLTLEEYRGRALGFRARRGGASRVGSGSSYRPRPRGGDLPDAIDWRELGAVTGVKNQEQ 162

Query: 149 CGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNK 208
           CG CWAFSAVAA+EGIN I T  L SLSEQE++DCDT  +D GC GG M +AF+F+I+N 
Sbjct: 163 CGGCWAFSAVAAIEGINEIVTGNLVSLSEQEIIDCDT--QDGGCNGGEMQNAFQFVINNG 220

Query: 209 GLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASG 268
           G+ TEA YPY  +D +C+    N     I G+  V + NE AL +AVANQPVSVAIDASG
Sbjct: 221 GIDTEADYPYLGTDAACDANRVNERVVTIDGFVSVATENETALQEAVANQPVSVAIDASG 280

Query: 269 SDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDI 328
             FQ Y+SG+F G CGT+LDHGVTAVGYG+ ++G  YW+VKNSW ++WGE GYIR++R++
Sbjct: 281 RKFQHYTSGIFNGPCGTQLDHGVTAVGYGS-ENGKDYWIVKNSWSSSWGEAGYIRIRRNV 339

Query: 329 DAKEGLCGIAMQASYPT 345
            A  G CGIAM ASYP 
Sbjct: 340 AAATGKCGIAMDASYPV 356


>gi|224081756|ref|XP_002306486.1| predicted protein [Populus trichocarpa]
 gi|222855935|gb|EEE93482.1| predicted protein [Populus trichocarpa]
          Length = 352

 Score =  321 bits (822), Expect = 4e-85,   Method: Compositional matrix adjust.
 Identities = 169/312 (54%), Positives = 217/312 (69%), Gaps = 11/312 (3%)

Query: 39  HEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFR 98
           ++ W+A++G+ Y    E+  RF+IFK N+ +I   N  ++N  YK+G+ +FAD TNEE+R
Sbjct: 4   YKWWLAKHGKAYNGLGEEAERFEIFKNNLRFIDEHN--SQNHTYKVGLTKFADLTNEEYR 61

Query: 99  A----PRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWA 154
           A     R+  KRRL   +S  +   +F+  +  +P S+DWR KGAV  +KDQG CG CWA
Sbjct: 62  AMFLGTRSDAKRRLMKSKSP-SERYAFKAGD-KLPESVDWRAKGAVNPIKDQGSCGSCWA 119

Query: 155 FSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEA 214
           FS VAA+EGIN I T +L SLSEQELVDCD +  + GC GGLMD AF+FII+N GL TE 
Sbjct: 120 FSTVAAVEGINQIVTGELISLSEQELVDCDRT-YNAGCNGGLMDYAFQFIINNGGLDTEK 178

Query: 215 KYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFY 274
            YPY   D  C+K +    A  I G+EDV   +E AL KAVA+QPVSVAI+ASG   QFY
Sbjct: 179 DYPYVGDDDKCDKDKMKTKAVSIDGFEDVLPYDEKALQKAVAHQPVSVAIEASGMALQFY 238

Query: 275 SSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDI-DAKEG 333
            SGVFTG+CGT LDHGV  VGY + ++G  YWLV+NSWGT WGE+GYI+MQR++ D   G
Sbjct: 239 QSGVFTGECGTALDHGVVVVGYAS-ENGLDYWLVRNSWGTEWGEHGYIKMQRNVGDTYTG 297

Query: 334 LCGIAMQASYPT 345
            CGIAM++SYP 
Sbjct: 298 RCGIAMESSYPV 309


>gi|110737959|dbj|BAF00916.1| cysteine proteinase [Arabidopsis thaliana]
          Length = 376

 Score =  320 bits (821), Expect = 4e-85,   Method: Compositional matrix adjust.
 Identities = 164/323 (50%), Positives = 213/323 (65%), Gaps = 12/323 (3%)

Query: 32  DATMNERHEMWMAQYGRVYRDNA----EKEMRFKIFKENVEYIASFNNKARNKPYKLGIN 87
           D  +   +  W A++G+   +N     +++ RF IFK+N+ +I   N   +N  YKLG+ 
Sbjct: 42  DEEVRSIYLQWSAEHGKTNNNNNGIINDQDKRFNIFKDNLRFIDLHNENNKNATYKLGLT 101

Query: 88  EFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENA----SVPASIDWRKKGAVTGV 143
           +F D TN+E+R    G  R  P+ R ++  +V+ +Y  A     VP ++DWR+KGAV  +
Sbjct: 102 KFTDLTNDEYRKLYLG-ARTEPARRIAKAKNVNQKYSAAVNGKEVPETVDWRQKGAVNPI 160

Query: 144 KDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEF 203
           KDQG CG CWAFS  AA+EGIN I T +L SLSEQELVDCD S  +QGC GGLMD AF+F
Sbjct: 161 KDQGTCGSCWAFSTTAAVEGINKIVTGELISLSEQELVDCDKS-YNQGCNGGLMDYAFQF 219

Query: 204 IISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVA 263
           I+ N GL TE  YPY+   G CN    N     I GYEDVP+ +E AL KA++ QPV VA
Sbjct: 220 IMKNGGLNTEKDYPYRGFGGKCNSFLKNSRVVSIDGYEDVPTKDETALKKAISYQPVRVA 279

Query: 264 IDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIR 323
           I+A G  FQ Y SG+FTG CGT LDH V AVGYG+ ++G  YW+V+NSWG  WGE GYIR
Sbjct: 280 IEAGGRIFQHYQSGIFTGSCGTNLDHAVVAVGYGS-ENGVDYWIVRNSWGPRWGEEGYIR 338

Query: 324 MQRDIDA-KEGLCGIAMQASYPT 345
           M+R++ A K G CGIA++ASYP 
Sbjct: 339 MERNLAASKSGKCGIAVEASYPV 361


>gi|357446975|ref|XP_003593763.1| Cysteine proteinase [Medicago truncatula]
 gi|355482811|gb|AES64014.1| Cysteine proteinase [Medicago truncatula]
          Length = 350

 Score =  320 bits (821), Expect = 5e-85,   Method: Compositional matrix adjust.
 Identities = 165/337 (48%), Positives = 221/337 (65%), Gaps = 6/337 (1%)

Query: 10  LVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEY 69
           L+   I++L   A  + SRTL ++++ E H+ WM +Y R Y +++E E R KIFKEN+EY
Sbjct: 4   LIGFCIILLWACAYPTMSRTLTESSVVEAHQQWMMKYERTYTNSSEMEKRKKIFKENLEY 63

Query: 70  IASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYE-NASV 128
           I +FNN   NK YKLG+N ++D T+EEF A   G+K     +  S+   V+  +  N  V
Sbjct: 64  IENFNNVG-NKSYKLGLNRYSDLTSEEFIASHTGFKVS-DQLSDSKMRSVAIPFNLNDDV 121

Query: 129 PASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGE 188
           P + DWR+KG VT VK+Q QCGCCWAF+AVAA+EGI  I    L SLSEQ+LVDCD   +
Sbjct: 122 PTNFDWREKGVVTDVKNQRQCGCCWAFTAVAAVEGIVKIKNGNLISLSEQQLVDCDR--Q 179

Query: 189 DQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNE 248
             GC GG    AF+ II ++G+  E  YPYKA+D    +    P AA+I+GY  VP+N+E
Sbjct: 180 SSGCGGGDFVLAFDSIIKSRGIVKEDDYPYKANDVQTCQLGQIPGAAQINGYFKVPANDE 239

Query: 249 AALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLV 308
             L++AV  QPVSVAI  S  DF  Y  GV+ G CG +L+H VT +GYG ++ G KYWL+
Sbjct: 240 QQLLRAVLQQPVSVAISTS-YDFHHYMGGVYEGSCGPKLNHAVTIIGYGVSEAGKKYWLI 298

Query: 309 KNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           KNSWG TWGE GY+++ R+  A  G C IA+ A+YPT
Sbjct: 299 KNSWGETWGEKGYMKVLRESSATGGQCSIAVHAAYPT 335


>gi|384253406|gb|EIE26881.1| hypothetical protein COCSUDRAFT_21961 [Coccomyxa subellipsoidea
           C-169]
          Length = 481

 Score =  320 bits (821), Expect = 5e-85,   Method: Compositional matrix adjust.
 Identities = 159/304 (52%), Positives = 200/304 (65%), Gaps = 5/304 (1%)

Query: 42  WMAQYGRVYRDNAEK-EMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAP 100
           W+    + Y+DN E+ E +F ++ +N+E++ S N K  +  +KLG+  FAD T++E+R  
Sbjct: 51  WVEHLQKAYKDNVEEYERKFSVWLDNLEFVHSHNEK--DSTFKLGLTNFADLTHDEYRQH 108

Query: 101 RNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAA 160
             GY+  L            F+Y +   P SIDWRKKGAVT VK+Q QCG CWAFS   +
Sbjct: 109 ALGYRPELKGTGLGTGKSTGFQYADYEAPPSIDWRKKGAVTDVKNQQQCGSCWAFSTTGS 168

Query: 161 MEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKA 220
           +EG N I + +L SLSEQELVDCD + +D GC GGLMD AF FII N G+ TE  Y YKA
Sbjct: 169 VEGANAIYSGELVSLSEQELVDCDVT-QDHGCHGGLMDFAFSFIIRNGGIDTEKDYKYKA 227

Query: 221 SDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFT 280
            DG CN  +       I  YEDVP N+E+AL KA ANQP+SVAI+A   +FQ Y+ GVF 
Sbjct: 228 QDGVCNIAKEKRHVVTIDSYEDVPPNDESALKKAAANQPISVAIEADQREFQLYAGGVFD 287

Query: 281 GQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQ 340
             CGT LDHGV  VGYG+ D+GT YW+VKNSWG  WG++GYIR+ R I    G CGIAMQ
Sbjct: 288 APCGTALDHGVLVVGYGS-DNGTDYWIVKNSWGDFWGDSGYIRLARGISNSAGQCGIAMQ 346

Query: 341 ASYP 344
           ASYP
Sbjct: 347 ASYP 350


>gi|20334373|gb|AAM19207.1|AF493232_1 cysteine protease [Solanum pimpinellifolium]
 gi|400180424|gb|AFP73350.1| cysteine protease [Solanum pimpinellifolium]
 gi|400180433|gb|AFP73354.1| cysteine protease [Solanum lycopersicum]
          Length = 344

 Score =  320 bits (819), Expect = 7e-85,   Method: Compositional matrix adjust.
 Identities = 160/347 (46%), Positives = 227/347 (65%), Gaps = 9/347 (2%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRF 60
           MAM +    +++    V+ ++  Q+  R+    +++ERHE+WM+++GRVY+D  EK  RF
Sbjct: 1   MAMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERF 60

Query: 61  KIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVS 120
            IFKEN+++I S N KA N  YKLG+NEFAD T++EF A   G       +  S  +   
Sbjct: 61  MIFKENMKFIESVN-KAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE 119

Query: 121 FRYENAS---VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSE 177
           F+  + S   +P+++DWR+ GAVT VK QG+CGCCWAFSAV ++EG   I T  L   SE
Sbjct: 120 FKINDLSDDYMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179

Query: 178 QELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKI 237
           QEL+DC T+  + GC GGLM +AF+FII N G++ E+ Y Y     +C  +E   +A +I
Sbjct: 180 QELLDCTTN--NYGCNGGLMTNAFDFIIENGGISRESDYEYLGEQYTCRSREKT-AAVQI 236

Query: 238 SGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYG 297
           S Y+ VP   E +L++AV  QPVS+ I AS  D QFY+ G + G C  +++H VTA+GYG
Sbjct: 237 SSYKVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGNCADQINHAVTAIGYG 294

Query: 298 TADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           T ++G KYWL+KNSWGT+WGENG++++ RD     GLC IA  +SYP
Sbjct: 295 TDEEGQKYWLLKNSWGTSWGENGFMKIIRDSGDPSGLCDIAKMSSYP 341


>gi|400180443|gb|AFP73358.1| cysteine protease, partial [Solanum habrochaites]
          Length = 345

 Score =  320 bits (819), Expect = 8e-85,   Method: Compositional matrix adjust.
 Identities = 160/347 (46%), Positives = 225/347 (64%), Gaps = 9/347 (2%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRF 60
           MAM +    +++    V+ ++  Q+ +R+    +++ERHE+WM+++GRVY+D  EK  RF
Sbjct: 1   MAMKVDLMSILITLFFVISMFNSQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERF 60

Query: 61  KIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVS 120
            IFKEN+++I S N KA N  YKLG+NEFAD T+EEF A   G       +  S      
Sbjct: 61  MIFKENMKFIESVN-KAGNLSYKLGMNEFADITSEEFLAKFTGLNIPNSYLSPSPMPSTE 119

Query: 121 FRYENAS---VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSE 177
           F+  + S   +P+++DWR+ GAVT VK+QGQCGCCWAFSAV ++EG   I T  L   SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179

Query: 178 QELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKI 237
           QEL+DC T+  + GC GG M +AF+FII N G++ E+ Y Y     +C + +   +A +I
Sbjct: 180 QELLDCTTN--NYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTC-RSQGKTAAVQI 236

Query: 238 SGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYG 297
           S Y+ VP   E +L++AV  QPVS+ I AS  D QFY+ G + G C   ++H VTA+GYG
Sbjct: 237 SNYQVVPEG-ETSLLQAVTKQPVSIGIAAS-HDLQFYAGGTYDGSCANRINHAVTAIGYG 294

Query: 298 TADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           T + G KYWL+KNSWGT+WGENG++++ RD     GLC IA  +SYP
Sbjct: 295 TDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPAGLCDIAKMSSYP 341


>gi|242072388|ref|XP_002446130.1| hypothetical protein SORBIDRAFT_06g002130 [Sorghum bicolor]
 gi|241937313|gb|EES10458.1| hypothetical protein SORBIDRAFT_06g002130 [Sorghum bicolor]
          Length = 276

 Score =  320 bits (819), Expect = 9e-85,   Method: Compositional matrix adjust.
 Identities = 160/285 (56%), Positives = 196/285 (68%), Gaps = 31/285 (10%)

Query: 64  KENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRY 123
           ++NV ++ SFN    NK + LG+N+FAD T EEF+A + G+K       + +     F+Y
Sbjct: 19  RDNVAFVESFNANKNNK-FWLGVNQFADLTTEEFKANK-GFK----PTSAEKVPTTGFKY 72

Query: 124 ENASV---PASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQEL 180
           EN SV   P ++DWR KGAVT +K+QGQCGCCWAFSAVAAMEGI  ++T  L SLS+QEL
Sbjct: 73  ENLSVSALPTAVDWRTKGAVTPIKNQGQCGCCWAFSAVAAMEGIVKLSTGNLISLSKQEL 132

Query: 181 VDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGY 240
           VDCDT   D+GCE                     + PYKA DG C  K  + SAA I G+
Sbjct: 133 VDCDTHSMDEGCE--------------------VQLPYKAVDGKC--KGGSKSAATIKGH 170

Query: 241 EDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTAD 300
           EDVP NNEAALMKAVANQPVSVA+DAS   F  YS GV TG CGTELDHG+ A+GYG   
Sbjct: 171 EDVPVNNEAALMKAVANQPVSVAVDASDRTFMLYSGGVMTGSCGTELDHGIAAIGYGMES 230

Query: 301 DGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           DGTKYW++KNSWGTTWGE G++RM++DI  K G+CG+AM+ SYPT
Sbjct: 231 DGTKYWILKNSWGTTWGEKGFLRMEKDITDKRGMCGLAMKPSYPT 275


>gi|400180449|gb|AFP73361.1| cysteine protease [Solanum chilense]
          Length = 344

 Score =  319 bits (818), Expect = 1e-84,   Method: Compositional matrix adjust.
 Identities = 160/347 (46%), Positives = 225/347 (64%), Gaps = 9/347 (2%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRF 60
           MAM +    +++    V+ ++  Q+  R+    +++ERHE+WM+++GRVY+D  EK  RF
Sbjct: 1   MAMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERF 60

Query: 61  KIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVS 120
            IFK+N+++I S N KA N  YKLG+NEFAD T++EF A   G       +  S  +   
Sbjct: 61  MIFKKNMKFIESVN-KAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE 119

Query: 121 FRYENAS---VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSE 177
           F+  + S   +P+++DWR+ GAVT VK QGQCGCCWAFSAV ++EG   I T KL   SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGQCGCCWAFSAVGSLEGAYKIATGKLMEFSE 179

Query: 178 QELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKI 237
           QEL+DC T+  + GC GG M +AF+FII N G++ E+ Y Y     +C  +E   +A +I
Sbjct: 180 QELLDCTTN--NYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCRSQEKT-AAVQI 236

Query: 238 SGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYG 297
           S Y+ VP   E +L++AV  QPVS+ I AS  D QFY+ G + G C   ++H VTA+GYG
Sbjct: 237 SSYQVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAEGTYDGSCADRINHAVTAIGYG 294

Query: 298 TADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           T + G KYWL+KNSWGT+WGENG++++ RD     GLC IA  +SYP
Sbjct: 295 TDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341


>gi|400180377|gb|AFP73327.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  319 bits (818), Expect = 1e-84,   Method: Compositional matrix adjust.
 Identities = 159/347 (45%), Positives = 226/347 (65%), Gaps = 9/347 (2%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRF 60
           MAM +    +++    V+ ++  Q+  R+  + +++ERHE+WM+++GRVY+D  EK  RF
Sbjct: 1   MAMKVDLMNILITLFFVISMFNTQTRGRSQPELSVSERHELWMSRHGRVYKDEVEKGERF 60

Query: 61  KIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVS 120
            IFKEN+++I S N KA N  YKLG+NEFAD T++EF A   G       +  S  +   
Sbjct: 61  MIFKENMKFIESVN-KAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE 119

Query: 121 FRYENAS---VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSE 177
           F+  + S   +P+++DWR+ GAVT VK QG+CGCCWAFSAV ++EG   I T  L   SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179

Query: 178 QELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKI 237
           QEL+DC T+  + GC GG M +AF+FII N G++ E+ Y Y+    +C  +E   +A +I
Sbjct: 180 QELLDCTTN--NYGCNGGFMTNAFDFIIENGGISRESDYEYQGEQYTCRSQEKT-AAVQI 236

Query: 238 SGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYG 297
           S Y+ VP   E +L++AV  QPVS+ I AS  D QFY+ G + G C   ++H VTA+GYG
Sbjct: 237 SSYQVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYG 294

Query: 298 TADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           T + G KYWL+KNSWGT+WGENG++++ RD     GLC IA  +SYP
Sbjct: 295 TDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341


>gi|242066206|ref|XP_002454392.1| hypothetical protein SORBIDRAFT_04g029960 [Sorghum bicolor]
 gi|241934223|gb|EES07368.1| hypothetical protein SORBIDRAFT_04g029960 [Sorghum bicolor]
          Length = 356

 Score =  319 bits (817), Expect = 1e-84,   Method: Compositional matrix adjust.
 Identities = 159/305 (52%), Positives = 206/305 (67%), Gaps = 5/305 (1%)

Query: 42  WMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPR 101
           W  ++ ++Y    EK  R+ IFK+N+ +IA  N K  N  Y LG+N+FAD T+EEF+A  
Sbjct: 48  WSVKHRKIYVSPKEKLKRYGIFKQNLMHIAETNRK--NGSYWLGLNQFADITHEEFKANH 105

Query: 102 NGYKRRLPSVRSSETTDVSFRYENAS-VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAA 160
            G K+ L  + +   T  +FRY  A+ +P S+DWR KGAVT VK+QG+CG CWAFS+VAA
Sbjct: 106 LGLKQGLSRMGAQTRTPTTFRYAAAANLPWSVDWRYKGAVTPVKNQGKCGSCWAFSSVAA 165

Query: 161 MEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKA 220
           +EGIN I T KL SLSEQEL+DCDT   D GCEGGLMD AF +I+ ++G+  E  YPY  
Sbjct: 166 VEGINQIVTGKLVSLSEQELMDCDTM-LDHGCEGGLMDFAFAYIMGSQGIHAEDDYPYLM 224

Query: 221 SDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFT 280
            +G C +K+   +   I+GYEDVP N+E +L+KA+A+QPVSV I A   DFQFY  GVF 
Sbjct: 225 EEGYCKEKQPYANVVTITGYEDVPENSEISLLKALAHQPVSVGIAAGSRDFQFYKGGVFD 284

Query: 281 GQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQ 340
           G C  ELDH +TAVGYG++  G  Y  +KNSWG  WGE GY+R++      EG+CGI   
Sbjct: 285 GSCSDELDHALTAVGYGSS-YGQNYITMKNSWGKNWGEQGYVRIKMGTGKPEGVCGIYTM 343

Query: 341 ASYPT 345
           ASYP 
Sbjct: 344 ASYPV 348


>gi|449450419|ref|XP_004142960.1| PREDICTED: vignain-like [Cucumis sativus]
          Length = 345

 Score =  319 bits (817), Expect = 2e-84,   Method: Compositional matrix adjust.
 Identities = 169/312 (54%), Positives = 218/312 (69%), Gaps = 16/312 (5%)

Query: 39  HEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEF- 97
           +E W  ++  + R+  EK  RF +FKENV ++ + N    +KPYKL +N+FAD +N EF 
Sbjct: 41  YERW-GKHHTISRNLKEKHKRFSVFKENVNHVFTVNQM--DKPYKLKLNKFADMSNYEFV 97

Query: 98  ----RAPRNGYKRRLPSVRSSETTDVSFRYE-NASVPASIDWRKKGAVTGVKDQGQCGCC 152
               R+  + Y++     R +      F YE +  +P+S+DWR++GAV  VK+QG+CG C
Sbjct: 98  NFYARSNISHYRKLHERRRGAG----GFMYEQDTDLPSSVDWRERGAVNAVKEQGRCGSC 153

Query: 153 WAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLAT 212
           WAFS+VAA+EGIN I T +L SLSEQEL+DC+    ++GC GG M+ AF+FI  N G+AT
Sbjct: 154 WAFSSVAAVEGINKIKTNQLLSLSEQELLDCNY--RNKGCNGGFMEIAFDFIKRNGGIAT 211

Query: 213 EAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQ 272
           E  YPY  S G C     +    KI GYE VP N E ALM+AVANQPVSVAIDA+G DFQ
Sbjct: 212 ENSYPYHGSRGLCRSSRISSPIVKIDGYESVPEN-EDALMQAVANQPVSVAIDAAGRDFQ 270

Query: 273 FYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKE 332
           FYS GVF G CGTEL+HGV A+GYGT +DGT YWLV+NSWG  WGE+GY+RM+R ++  E
Sbjct: 271 FYSQGVFDGYCGTELNHGVVAIGYGTTEDGTDYWLVRNSWGVGWGEDGYVRMKRGVEQAE 330

Query: 333 GLCGIAMQASYP 344
           GLCGIAM+ASYP
Sbjct: 331 GLCGIAMEASYP 342


>gi|400180447|gb|AFP73360.1| cysteine protease [Solanum chilense]
          Length = 345

 Score =  319 bits (817), Expect = 2e-84,   Method: Compositional matrix adjust.
 Identities = 161/348 (46%), Positives = 224/348 (64%), Gaps = 10/348 (2%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRF 60
           MAM +    +++    V+ ++  Q+  R+    +++ERHE+WM+++GRVY+D  EK  RF
Sbjct: 1   MAMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERF 60

Query: 61  KIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVS 120
            IFKEN+++I S N KA N  YKLG+NEFAD T++EF A   G       +  S  +   
Sbjct: 61  MIFKENMKFIESVN-KAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE 119

Query: 121 FRYENA----SVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLS 176
           F+  N      +P+++DWR+ GAVT VK QGQCGCCWAFSAV ++EG   I T KL   S
Sbjct: 120 FKKINDLSDDDMPSNLDWRESGAVTQVKHQGQCGCCWAFSAVGSLEGAYKIATGKLMEFS 179

Query: 177 EQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAK 236
           EQEL+DC T+  + GC GG M +AF+FII N G++ E+ Y Y     +C  +E   +A +
Sbjct: 180 EQELLDCTTN--NYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCRSQEKT-AAVQ 236

Query: 237 ISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGY 296
           IS Y+ VP   E +L++AV  QPVS+ I AS  D QFY+ G + G C   ++H VTA+GY
Sbjct: 237 ISSYQVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGY 294

Query: 297 GTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           GT + G KYWL+KNSWGT+WGENG++++ RD     GLC IA  +SYP
Sbjct: 295 GTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 342


>gi|1174171|gb|AAB41816.1| NTH1 [Pisum sativum]
          Length = 367

 Score =  318 bits (816), Expect = 2e-84,   Method: Compositional matrix adjust.
 Identities = 167/352 (47%), Positives = 226/352 (64%), Gaps = 22/352 (6%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRF 60
           MA IL    L+L  ++ L +    S  R+ N   M   +E W+ ++ +VY    EK  RF
Sbjct: 1   MASILYS--LILFGLITLSLSLDMSSGRS-NKEVMT-MYEKWLVKHQKVYYGLGEKNQRF 56

Query: 61  KIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRA------PRNGYKRRLPSVRSS 114
           +IFK+N+ +I   N  A N  Y++G+NEF+D TN+E+R         N  K ++ SVR +
Sbjct: 57  QIFKDNLIFIDEHN--APNHSYRVGLNEFSDITNKEYRDTYLSRWSNNNIKNKITSVRYA 114

Query: 115 ETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTS 174
                     N  +P S+DWR  GA+T +K+QG CG CWAFSAVAA+E IN I T  L S
Sbjct: 115 YKAG-----HNNKLPVSVDWR--GALTPIKNQGSCGACWAFSAVAAVEAINKIVTGSLVS 167

Query: 175 LSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSA 234
           LSEQELVDCD + +++GC GG   +A+ FI+ N GL ++  YPY     +CN+ + N   
Sbjct: 168 LSEQELVDCDRT-KNKGCNGGNQVNAYRFIVENGGLDSQIDYPYLGRQSTCNQAKKNTKV 226

Query: 235 AKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAV 294
             I+GY++V  N+E+ALM+AVANQPVSV I+A G DFQ Y SGVFTG CGT LDH V  V
Sbjct: 227 VSINGYKNVQRNSESALMEAVANQPVSVGIEAYGKDFQLYQSGVFTGSCGTSLDHAVVVV 286

Query: 295 GYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDI-DAKEGLCGIAMQASYPT 345
           GYG +++G  YWLVKNSWGT WGE GY++++R++ +   G CGIAM A+YPT
Sbjct: 287 GYG-SENGKDYWLVKNSWGTNWGERGYLKIERNLKNTNTGKCGIAMDATYPT 337


>gi|400180345|gb|AFP73311.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  318 bits (816), Expect = 2e-84,   Method: Compositional matrix adjust.
 Identities = 159/347 (45%), Positives = 226/347 (65%), Gaps = 9/347 (2%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRF 60
           MAM +    +++    V+ ++  Q+  R+    +++ERHE+WM+++GRVY+D  EK  RF
Sbjct: 1   MAMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERF 60

Query: 61  KIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVS 120
            IFKEN+++I S N KA N  YKLG+NEFAD T++EF A   G       +  S  +   
Sbjct: 61  MIFKENMKFIESVN-KAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE 119

Query: 121 FRYENAS---VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSE 177
           F+  + S   +P+++DWR+ GAVT VK QG+CGCCWAFSAV ++EG   I T  L   SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179

Query: 178 QELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKI 237
           QEL+DC T+  + GC+GG M +AF+FII N G++ E+ Y Y     +C  +E   +A +I
Sbjct: 180 QELLDCTTN--NYGCDGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQEKT-AAVQI 236

Query: 238 SGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYG 297
           S Y+ VP   E +L++AV  QPVS+ I AS  D QFY+ G + G C   ++H VTA+GYG
Sbjct: 237 SSYQVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYG 294

Query: 298 TADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           T ++G KYWL+KNSWGT+WGENG++++ RD     GLC IA  +SYP
Sbjct: 295 TDENGQKYWLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341


>gi|400180419|gb|AFP73348.1| cysteine protease [Solanum lycopersicoides]
          Length = 343

 Score =  318 bits (815), Expect = 3e-84,   Method: Compositional matrix adjust.
 Identities = 162/348 (46%), Positives = 228/348 (65%), Gaps = 12/348 (3%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRF 60
           MAM +    +++    V+ ++  Q+ +R+    +++ERHE+WM+++GRVY+D  EK  RF
Sbjct: 1   MAMKIDLMSILITLFFVISMFNSQTTARSQPKLSVSERHELWMSRHGRVYKDEVEKGERF 60

Query: 61  KIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRS-SETTDV 119
            IFKEN+++I S N KA N  YKLGINEFAD T+EEF     G    +PS  S S  +  
Sbjct: 61  MIFKENMKFIESVN-KAGNLSYKLGINEFADITSEEFLTKFTGI--NIPSYLSPSPMSST 117

Query: 120 SFRYENAS---VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLS 176
            F+  + S   +P+++DWR+ GAVT VK+QGQCGCCWAFSAV ++EG   I T  L   S
Sbjct: 118 EFKINDLSDDDMPSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYKIATGNLMEFS 177

Query: 177 EQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAK 236
           EQEL+DC T+  + GC GG M +AF+FI  N G+++E+ Y Y+    +C  +E   +A +
Sbjct: 178 EQELLDCTTN--NYGCNGGFMTNAFDFIKENGGISSESDYEYQGQQYTCRSQEKT-AAVQ 234

Query: 237 ISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGY 296
           IS Y+ VP   E +L++AV  QPVS+ I AS  D QFY+ G + G C   ++H VTA+GY
Sbjct: 235 ISSYQVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGY 292

Query: 297 GTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           GT + G KYWL+KNSWGT+WGENG++++ RD     G C IA  +SYP
Sbjct: 293 GTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPGGHCDIAKMSSYP 340


>gi|400180426|gb|AFP73351.1| cysteine protease [Solanum corneliomuelleri]
          Length = 344

 Score =  318 bits (815), Expect = 3e-84,   Method: Compositional matrix adjust.
 Identities = 159/347 (45%), Positives = 225/347 (64%), Gaps = 9/347 (2%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRF 60
           MAM +    +++    V+ ++  Q+  R+    +++ERHE+WM+++GRVY+D  EK  RF
Sbjct: 1   MAMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERF 60

Query: 61  KIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVS 120
            IFKEN+++I S N KA N  YKLG+NEFAD T++EF A   G       +  S  +   
Sbjct: 61  MIFKENMKFIESVN-KAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE 119

Query: 121 FRYENAS---VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSE 177
           F+  + S   +P+++DWR+ GAVT VK QG+CGCCWAFSAV ++EG   I T  L   SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179

Query: 178 QELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKI 237
           QEL+DC T+  + GC GG M +AF+FII N G++ E+ Y Y     +C  +E   +A +I
Sbjct: 180 QELLDCTTN--NYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQEKT-AAVQI 236

Query: 238 SGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYG 297
           S Y+ VP   E +L++AV  QPVS+ I AS  D QFY+ G + G C   ++H VTA+GYG
Sbjct: 237 SSYQVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYG 294

Query: 298 TADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           T ++G KYWL+KNSWGT+WGENG++++ RD     GLC IA  +SYP
Sbjct: 295 TDENGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341


>gi|115468686|ref|NP_001057942.1| Os06g0582600 [Oryza sativa Japonica Group]
 gi|55296512|dbj|BAD68726.1| putative cysteine proteinase [Oryza sativa Japonica Group]
 gi|113595982|dbj|BAF19856.1| Os06g0582600 [Oryza sativa Japonica Group]
 gi|215695236|dbj|BAG90427.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 357

 Score =  318 bits (814), Expect = 3e-84,   Method: Compositional matrix adjust.
 Identities = 163/324 (50%), Positives = 213/324 (65%), Gaps = 15/324 (4%)

Query: 31  NDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFA 90
           +D+ M ER+E W A +GR Y+D+ EK  RF++F+ N  +I SFN     K  +L  N+FA
Sbjct: 41  DDSAMRERYEKWAADHGRTYKDSLEKARRFEVFRTNALFIDSFNAAGGKKSPRLTTNKFA 100

Query: 91  DQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYEN---ASVPASIDWRKKGAVTGVKDQG 147
           D TNEEF A   G     P +  S      F Y N   + VPA+I+WR +GAVT VK+Q 
Sbjct: 101 DLTNEEF-AEYYGRPFSTPVIGGS-----GFMYGNVRTSDVPANINWRDRGAVTQVKNQK 154

Query: 148 QCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISN 207
            C  CWAFSAVAA+EGI+ I +  L +LS Q+L+DC T   + GC  G MD+AF +I SN
Sbjct: 155 DCASCWAFSAVAAVEGIHQIRSHNLVALSTQQLLDCSTGRNNHGCNRGDMDEAFRYITSN 214

Query: 208 KGLATEAKYPYK-ASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDA 266
            G+A E+ YPY+  + G+C +    P AA I G++ VP NNE AL+ AVA+QPVSVA+D 
Sbjct: 215 GGIAAESDYPYEDRALGTC-RASGKPVAASIRGFQYVPPNNETALLLAVAHQPVSVALDG 273

Query: 267 SGSDFQFYSSGVFTGQ----CGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYI 322
            G   QF+SSGVF       C T+L+H +TAVGYGT + GTKYWL+KNSWGT WGE GY+
Sbjct: 274 VGKVSQFFSSGVFGAMQNETCTTDLNHAMTAVGYGTDEHGTKYWLMKNSWGTDWGEGGYM 333

Query: 323 RMQRDIDAKEGLCGIAMQASYPTA 346
           ++ RD+ +  GLCG+AMQ SYP A
Sbjct: 334 KIARDVASNTGLCGLAMQPSYPVA 357


>gi|400180357|gb|AFP73317.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  318 bits (814), Expect = 3e-84,   Method: Compositional matrix adjust.
 Identities = 159/347 (45%), Positives = 225/347 (64%), Gaps = 9/347 (2%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRF 60
           MAM +    +++    V+ ++  Q+ +R+    +++ERHE+WM+++GRVY+D  EK  RF
Sbjct: 1   MAMKVDLMSILITLFFVISMFNSQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERF 60

Query: 61  KIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVS 120
            IFKEN+++I S N KA N  YKLG+NEFAD T++EF A   G       +  S  +   
Sbjct: 61  MIFKENMKFIESVN-KAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE 119

Query: 121 FRYENAS---VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSE 177
           F+  + S   +P+++DWR+ GAVT VK QG+CGCCWAFSAV ++EG   I T  L   SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179

Query: 178 QELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKI 237
           QEL+DC T+  + GC GG M +AF+FII N G++ E+ Y Y     +C  +E   +A +I
Sbjct: 180 QELLDCTTN--NYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQEKT-AAVQI 236

Query: 238 SGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYG 297
           S Y+ VP   E +L++AV  QPVS+ I AS  D QFY+ G + G C   ++H VTA+GYG
Sbjct: 237 SSYKVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYG 294

Query: 298 TADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           T + G KYWL+KNSWGT+WGENG++++ RD     GLC IA  +SYP
Sbjct: 295 TDEKGQKYWLLKNSWGTSWGENGFMKIIRDYGNPSGLCDIAKMSSYP 341


>gi|400180347|gb|AFP73312.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  318 bits (814), Expect = 3e-84,   Method: Compositional matrix adjust.
 Identities = 159/347 (45%), Positives = 225/347 (64%), Gaps = 9/347 (2%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRF 60
           MAM +    +++    V+ ++  Q+  R+    +++ERHE+WM+++GRVY+D  EK  RF
Sbjct: 1   MAMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERF 60

Query: 61  KIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVS 120
            IFKEN+++I S N KA N  YKLG+NEFAD T++EF A   G       +  S  +   
Sbjct: 61  MIFKENMKFIESVN-KAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE 119

Query: 121 FRYENAS---VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSE 177
           F+  + S   +P+++DWR+ GAVT VK QG+CGCCWAFSAV ++EG   I T  L   SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179

Query: 178 QELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKI 237
           QEL+DC T+  + GC+GG M +AF+FII N G++ E+ Y Y     +C  +E   +A +I
Sbjct: 180 QELLDCTTN--NYGCDGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQEKT-AAVQI 236

Query: 238 SGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYG 297
           S Y+ VP   E +L++AV  QPVS+ I AS  D QFY+ G + G C   ++H VTA+GYG
Sbjct: 237 SSYQVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYG 294

Query: 298 TADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           T + G KYWL+KNSWGT+WGENG++++ RD     GLC IA  +SYP
Sbjct: 295 TDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341


>gi|2463588|dbj|BAA22546.1| FB1035 precursor [Ananas comosus]
          Length = 324

 Score =  318 bits (814), Expect = 3e-84,   Method: Compositional matrix adjust.
 Identities = 162/316 (51%), Positives = 211/316 (66%), Gaps = 11/316 (3%)

Query: 31  NDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFA 90
           ND  M +R E WMA+YGR+Y+DN EK  RF+IFK NV++I +FN++  N  Y LGIN+F 
Sbjct: 3   NDPMM-KRFEEWMAEYGRIYKDNDEKMRRFQIFKNNVKHIETFNSRNGNS-YTLGINQFT 60

Query: 91  DQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS-VPASIDWRKKGAVTGVKDQGQC 149
           D T  EF A   G    L   R      VSF   N S VP SIDWR  GAV  VK+Q  C
Sbjct: 61  DMTKSEFVAQYTGVSLPLNIEREPV---VSFDDVNISAVPQSIDWRDYGAVNEVKNQNPC 117

Query: 150 GCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKG 209
           G CWAF+A+A +EGI  I T  L SLSEQE++DC  S    GC+GG ++ A++FIISN G
Sbjct: 118 GSCWAFAAIATVEGIYKIKTGYLVSLSEQEVLDCAVS---YGCKGGWVNKAYDFIISNNG 174

Query: 210 LATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGS 269
           + TE  YPY+A  G+CN     P++A I+GY  V  N+E ++M AV+NQP++  IDAS  
Sbjct: 175 VTTEENYPYQAYQGTCNANSF-PNSAYITGYSYVRRNDERSMMYAVSNQPIAALIDAS-E 232

Query: 270 DFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDID 329
           +FQ+Y+ GVF+G CGT L+H +T +GYG    GTKYW+V+NSWG++WGE GY+RM R + 
Sbjct: 233 NFQYYNGGVFSGPCGTSLNHAITIIGYGQDSSGTKYWIVRNSWGSSWGEGGYVRMARGVS 292

Query: 330 AKEGLCGIAMQASYPT 345
           +  G CGIAM   +PT
Sbjct: 293 SSSGACGIAMSPLFPT 308


>gi|400180453|gb|AFP73363.1| cysteine protease [Solanum chilense]
          Length = 344

 Score =  318 bits (814), Expect = 3e-84,   Method: Compositional matrix adjust.
 Identities = 159/347 (45%), Positives = 225/347 (64%), Gaps = 9/347 (2%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRF 60
           MAM +    +++    V+ ++  Q+ +R+    +++ERHE+WM+++GRVY+D  EK  RF
Sbjct: 1   MAMKVDLMNILITLFFVISMFNTQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERF 60

Query: 61  KIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVS 120
            IFKEN+++I S N KA N  YKLG+NEFAD T++EF A   G       +  S  +   
Sbjct: 61  MIFKENMKFIESVN-KAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPVSSTE 119

Query: 121 FRYENAS---VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSE 177
           F+  + S   +P+++DWR+ GAVT VK QG+CGCCWAFSAV ++EG   I T  L   SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179

Query: 178 QELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKI 237
           QEL+DC T+  + GC GG M +AF+FII N G++ E+ Y Y     +C  +E   +A +I
Sbjct: 180 QELLDCTTN--NYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCRSQEKT-AAVQI 236

Query: 238 SGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYG 297
           S Y+ VP   E +L++AV  QPVS+ I AS  D QFY+ G + G C   ++H VTA+GYG
Sbjct: 237 SSYQVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYG 294

Query: 298 TADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           T + G KYWL+KNSWGT+WGENG++++ RD     GLC IA  +SYP
Sbjct: 295 TDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341


>gi|193806686|sp|A5HII1.1|ACTN_ACTDE RecName: Full=Actinidain; Short=Actinidin; AltName: Full=Allergen
           Act d 1; AltName: Allergen=Act d 1; Flags: Precursor
 gi|146215974|gb|ABQ10189.1| actinidin Act1a [Actinidia deliciosa]
          Length = 380

 Score =  317 bits (813), Expect = 4e-84,   Method: Compositional matrix adjust.
 Identities = 164/340 (48%), Positives = 215/340 (63%), Gaps = 14/340 (4%)

Query: 10  LVLAAILVLGV-WAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVE 68
           L  + +L+L + +  ++ ++  ND  +   +E W+ +YG+ Y    E E RF+IFKE + 
Sbjct: 13  LFFSTLLILSLAFNAKNLTQRTNDE-VKAMYESWLIKYGKSYNSLGEWERRFEIFKETLR 71

Query: 69  YIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYE---N 125
           +I   +N   N+ YK+G+N+FAD T+EEFR+   G+         S  T VS RYE    
Sbjct: 72  FIDE-HNADTNRSYKVGLNQFADLTDEEFRSTYLGF------TSGSNKTKVSNRYEPRVG 124

Query: 126 ASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDT 185
             +P+ +DWR  GAV  +K QG+CG CWAFSA+A +EGIN I T  L SLSEQEL+DC  
Sbjct: 125 QVLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIDCGR 184

Query: 186 SGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPS 245
           +   +GC GG + D F+FII+N G+ TE  YPY A DG CN    N     I  YE+VP 
Sbjct: 185 TQNTRGCNGGYITDGFQFIINNGGINTEENYPYTAQDGECNLDLQNEKYVTIDTYENVPY 244

Query: 246 NNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKY 305
           NNE AL  AV  QPVSVA+DA+G  F+ YSSG+FTG CGT +DH VT VGYGT + G  Y
Sbjct: 245 NNEWALQTAVTYQPVSVALDAAGDAFKHYSSGIFTGPCGTAIDHAVTIVGYGT-EGGIDY 303

Query: 306 WLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           W+VKNSW TTWGE GY+R+ R++    G CGIA   SYP 
Sbjct: 304 WIVKNSWDTTWGEEGYMRILRNVGGA-GTCGIATMPSYPV 342


>gi|15984|emb|CAA34486.1| unnamed protein product [Actinidia deliciosa]
          Length = 380

 Score =  317 bits (813), Expect = 4e-84,   Method: Compositional matrix adjust.
 Identities = 164/340 (48%), Positives = 215/340 (63%), Gaps = 14/340 (4%)

Query: 10  LVLAAILVLGV-WAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVE 68
           L  + +L+L + +  ++ ++  ND  +   +E W+ +YG+ Y    E E RF+IFKE + 
Sbjct: 13  LFFSTLLILSLAFNAKNLTQRTNDE-VKAMYESWLIKYGKSYNSLGEWERRFEIFKETLR 71

Query: 69  YIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYE---N 125
           +I   +N   N+ YK+G+N+FAD T+EEFR+   G+         S  T VS RYE    
Sbjct: 72  FIDE-HNADTNRSYKVGLNQFADLTDEEFRSTYLGF------TSGSNKTKVSNRYEPRFG 124

Query: 126 ASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDT 185
             +P+ +DWR  GAV  +K QG+CG CWAFSA+A +EGIN I T  L SLSEQEL+DC  
Sbjct: 125 QVLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIDCGR 184

Query: 186 SGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPS 245
           +   +GC GG + D F+FII+N G+ TE  YPY A DG CN    N     I  YE+VP 
Sbjct: 185 TQNTRGCNGGYITDGFQFIINNGGINTEENYPYTAQDGECNLDLQNEKYVTIDTYENVPY 244

Query: 246 NNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKY 305
           NNE AL  AV  QPVSVA+DA+G  F+ YSSG+FTG CGT +DH VT VGYGT + G  Y
Sbjct: 245 NNEWALQTAVTYQPVSVALDAAGDAFKHYSSGIFTGPCGTAIDHAVTIVGYGT-EGGIDY 303

Query: 306 WLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           W+VKNSW TTWGE GY+R+ R++    G CGIA   SYP 
Sbjct: 304 WIVKNSWDTTWGEEGYMRILRNVGGA-GTCGIATMPSYPV 342


>gi|400180428|gb|AFP73352.1| cysteine protease [Solanum corneliomuelleri]
          Length = 344

 Score =  317 bits (813), Expect = 4e-84,   Method: Compositional matrix adjust.
 Identities = 159/347 (45%), Positives = 225/347 (64%), Gaps = 9/347 (2%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRF 60
           MAM +    +++    V+ ++  Q+ +R+    +++ERHE+WM+++GRVY+D  EK  RF
Sbjct: 1   MAMKVDLMNILITLFFVISMFNTQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERF 60

Query: 61  KIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVS 120
            IFKEN+++I S N KA N  YKLG+NEFAD T++EF A   G       +  S  +   
Sbjct: 61  MIFKENMKFIESVN-KAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE 119

Query: 121 FRYENAS---VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSE 177
           F+  + S   +P+++DWR+ GAVT VK QG+CGCCWAFSAV ++EG   I T  L   SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179

Query: 178 QELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKI 237
           QEL+DC T+  + GC GG M +AF+FII N G++ E+ Y Y     +C  +E   +A +I
Sbjct: 180 QELLDCTTN--NYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCRSQEKT-AAVQI 236

Query: 238 SGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYG 297
           S Y+ VP   E +L++AV  QPVS+ I AS  D QFY+ G + G C   ++H VTA+GYG
Sbjct: 237 SSYQVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYG 294

Query: 298 TADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           T + G KYWL+KNSWGT+WGENG++++ RD     GLC IA  +SYP
Sbjct: 295 TDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341


>gi|2144501|pir||TAGB actinidain (EC 3.4.22.14) precursor - kiwi fruit
 gi|166317|gb|AAA32629.1| actinidin [Actinidia deliciosa]
          Length = 380

 Score =  317 bits (813), Expect = 4e-84,   Method: Compositional matrix adjust.
 Identities = 164/340 (48%), Positives = 216/340 (63%), Gaps = 14/340 (4%)

Query: 10  LVLAAILVLGV-WAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVE 68
           L  + +L+L + +  ++ ++  ND  +   +E W+ +YG+ Y    E E RF+IFKE + 
Sbjct: 13  LFFSTLLILSLAFNAKNLTQRTNDE-VKAMYESWLIKYGKSYNSLGEWERRFEIFKETLR 71

Query: 69  YIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYE---N 125
           +I   +N   N+ YK+G+N+FAD T+EEFR+   G+         S  T VS RYE    
Sbjct: 72  FIDE-HNADTNRSYKVGLNQFADLTDEEFRSTYLGF------TSGSNKTKVSNRYEPRVG 124

Query: 126 ASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDT 185
             +P+ +DWR  GAV  +K QG+CG CWAFSA+A +EGIN I T  L SLSEQEL+DC  
Sbjct: 125 QVLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIDCGR 184

Query: 186 SGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPS 245
           +   +GC GG + D F+FII+N G+ TE  YPY A DG CN +  N     I  YE+VP 
Sbjct: 185 TQNTRGCNGGYITDGFQFIINNGGINTEENYPYTAQDGECNVELQNEKYVTIDTYENVPY 244

Query: 246 NNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKY 305
           NNE AL  AV  QPVSVA+DA+G  F+ YSSG+FTG CGT +DH VT VGYGT + G  Y
Sbjct: 245 NNEWALQTAVTYQPVSVALDAAGDAFKQYSSGIFTGPCGTAIDHAVTIVGYGT-EGGIDY 303

Query: 306 WLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           W+VKNSW TTWGE GY+R+ R++    G CGIA   SYP 
Sbjct: 304 WIVKNSWDTTWGEEGYMRILRNVGGA-GTCGIATMPSYPV 342


>gi|400180441|gb|AFP73357.1| cysteine protease [Solanum habrochaites]
          Length = 344

 Score =  317 bits (812), Expect = 5e-84,   Method: Compositional matrix adjust.
 Identities = 159/347 (45%), Positives = 225/347 (64%), Gaps = 9/347 (2%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRF 60
           MAM +    +++    V+ ++  Q+ +R+    +++ERHE+WM+++GRVY+D  EK  RF
Sbjct: 1   MAMKVDLMSILITLFFVISMFNSQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERF 60

Query: 61  KIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVS 120
            IFKEN+++I S N KA N  YKLG+NEFAD T+EEF A   G       +  S  +   
Sbjct: 61  MIFKENMKFIESVN-KAGNLSYKLGMNEFADITSEEFLAKFTGLNIPNSYLSPSPMSSTE 119

Query: 121 FRYENAS---VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSE 177
           F+  + S   +P+++DWR+ GAVT VK+QGQCGCCWAFSAV ++EG   I T  L   SE
Sbjct: 120 FKINDISDDDMPSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179

Query: 178 QELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKI 237
           QEL+DC T+  + GC GG M +AF+FI  N G++ E+ Y Y     +C  +E   +A +I
Sbjct: 180 QELLDCTTN--NYGCNGGFMTNAFDFIRENGGISRESDYEYLGQQYTCRSQEKT-AAVQI 236

Query: 238 SGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYG 297
           S Y+ VP   E +L++AV  QPVS+ I AS  D QFY+ G + G C   ++H VTA+GYG
Sbjct: 237 SSYQVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCANRINHAVTAIGYG 294

Query: 298 TADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           T ++G KYWL+KNSWGT+WGE G++++ RD     GLC IA  +SYP
Sbjct: 295 TDENGQKYWLLKNSWGTSWGEKGFMKIIRDYGNPSGLCDIAKLSSYP 341


>gi|312451836|gb|ADQ85985.1| actinidin [Actinidia chinensis]
          Length = 380

 Score =  317 bits (812), Expect = 5e-84,   Method: Compositional matrix adjust.
 Identities = 164/339 (48%), Positives = 215/339 (63%), Gaps = 14/339 (4%)

Query: 10  LVLAAILVLGV-WAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVE 68
           L  + +L+L + +  ++ ++  ND  +   +E W+ +YG+ Y    E E RF+IFKE + 
Sbjct: 13  LFFSTLLILSLAFNTKNLTQRTNDE-VKAMYESWLIKYGKSYNSLGEWERRFEIFKETLR 71

Query: 69  YIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYE---N 125
           +I   +N   N+ YK+G+N+FAD T+EEFR+   G+         S  T VS RYE    
Sbjct: 72  FIDE-HNADTNRSYKVGLNQFADLTDEEFRSTYLGF------TSGSNKTKVSNRYEPRVG 124

Query: 126 ASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDT 185
             +P+ +DWR  GAV  +K QG+CG CWAFSA+A +EGIN I T  L SLSEQEL+DC  
Sbjct: 125 QVLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIDCGR 184

Query: 186 SGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPS 245
           +   +GC GG + D F+FII+N G+ TE  YPY A DG CN    N     I  YE+VP 
Sbjct: 185 TQNTRGCNGGYITDGFQFIINNGGINTEENYPYTAQDGECNVDLQNEKYVTIDTYENVPY 244

Query: 246 NNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKY 305
           NNE AL  AV  QPVSVA+DA+G  F+ YSSG+FTG CGT +DH VT VGYGT + G  Y
Sbjct: 245 NNEWALQTAVTYQPVSVALDAAGDAFKQYSSGIFTGPCGTAIDHAVTIVGYGT-EGGIDY 303

Query: 306 WLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           W+VKNSW TTWGE GY+R+ R++    G CGIA   SYP
Sbjct: 304 WIVKNSWDTTWGEEGYMRILRNVGGA-GTCGIATMPSYP 341


>gi|400180375|gb|AFP73326.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  317 bits (812), Expect = 5e-84,   Method: Compositional matrix adjust.
 Identities = 158/347 (45%), Positives = 226/347 (65%), Gaps = 9/347 (2%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRF 60
           MAM +    +++    V+ ++  Q+ +R+    +++ERHE+WM+++GRVY+D  EK  RF
Sbjct: 1   MAMKVDLMSILITLFFVISMFNSQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERF 60

Query: 61  KIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVS 120
            IFKEN+++I S N KA N  YKLG+NEFAD T++EF A   G       +  S  +   
Sbjct: 61  MIFKENIKFIESVN-KAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE 119

Query: 121 FRYENAS---VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSE 177
           F+  + S   +P+++DWR+ GAVT VK QG+CGCCWAFSAV ++EG   I T  L   SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179

Query: 178 QELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKI 237
           QEL+DC T+  + GC+GG M +AF+FI  N G+++E+ Y Y     +C  +E   +A +I
Sbjct: 180 QELLDCTTN--NYGCDGGFMTNAFDFIKENGGISSESDYEYLGEQYTCRSQEKT-AAVQI 236

Query: 238 SGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYG 297
           S Y+ VP   E +L++AV  QPVS+ I AS  D QFY+ G + G C   ++H VTA+GYG
Sbjct: 237 SSYQVVP-EGETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYG 294

Query: 298 TADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           T + G KYWL+KNSWGT+WGENG++++ RD     GLC IA  +SYP
Sbjct: 295 TDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPAGLCDIAKMSSYP 341


>gi|400180355|gb|AFP73316.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  317 bits (812), Expect = 6e-84,   Method: Compositional matrix adjust.
 Identities = 159/347 (45%), Positives = 224/347 (64%), Gaps = 9/347 (2%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRF 60
           MAM +    +++    V+ ++  Q+  R+    +++ERHE+WM+++GRVY+D  EK  RF
Sbjct: 1   MAMKVDLMSILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERF 60

Query: 61  KIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVS 120
            IFKEN+++I S N KA N  YKLG+NEFAD T++EF A   G       +  S  +   
Sbjct: 61  MIFKENMKFIESVN-KAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE 119

Query: 121 FRYENAS---VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSE 177
           F+  + S   +P+++DWR+ GAVT VK QG+CGCCWAFSAV ++EG   I T  L   SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179

Query: 178 QELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKI 237
           QEL+DC T+  + GC GG M +AF+FII N G++ E+ Y Y     +C  +E   +A +I
Sbjct: 180 QELLDCTTN--NYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCRSQEKT-AAVQI 236

Query: 238 SGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYG 297
           S Y+ VP   E +L++AV  QPVS+ I AS  D QFY+ G + G C   ++H VTA+GYG
Sbjct: 237 SSYQVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYG 294

Query: 298 TADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           T + G KYWL+KNSWGT+WGENG++++ RD     GLC IA  +SYP
Sbjct: 295 TDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341


>gi|400180367|gb|AFP73322.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  317 bits (812), Expect = 6e-84,   Method: Compositional matrix adjust.
 Identities = 159/347 (45%), Positives = 224/347 (64%), Gaps = 9/347 (2%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRF 60
           MAM +    +++    V+ ++  Q+  R+    +++ERHE+WM+++GRVY+D  EK  RF
Sbjct: 1   MAMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKVERF 60

Query: 61  KIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVS 120
            IFKEN+++I S N KA N  YKLG+NEFAD T++EF A   G       +  S  +   
Sbjct: 61  MIFKENMKFIESVN-KAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE 119

Query: 121 FRYENAS---VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSE 177
           F+  + S   +P+++DWR+ GAVT VK QG+CGCCWAFSAV ++EG   I T  L   SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179

Query: 178 QELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKI 237
           QEL+DC T+  + GC GG M +AF+FII N G++ E+ Y Y     +C  +E   +A +I
Sbjct: 180 QELLDCTTN--NYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCRSQEKT-AAVQI 236

Query: 238 SGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYG 297
           S Y+ VP   E +L++AV  QPVS+ I AS  D QFY+ G + G C   ++H VTA+GYG
Sbjct: 237 SSYKVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYG 294

Query: 298 TADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           T + G KYWL+KNSWGT+WGENG++++ RD     GLC IA  +SYP
Sbjct: 295 TDEKGQKYWLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341


>gi|414589857|tpg|DAA40428.1| TPA: Vignain [Zea mays]
          Length = 377

 Score =  317 bits (812), Expect = 6e-84,   Method: Compositional matrix adjust.
 Identities = 165/331 (49%), Positives = 207/331 (62%), Gaps = 24/331 (7%)

Query: 35  MNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTN 94
           M ER E WM ++GR+Y D  EK+ R ++++ NVE + +FN+      Y+L  N+FAD TN
Sbjct: 50  MLERFEQWMGRHGRLYADAGEKQRRLEVYRRNVELVETFNSMGNG--YRLADNKFADLTN 107

Query: 95  EEFRAPRNGYKR----------RLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVK 144
           EEFRA   G+ R            PS  +   + +  R   + +P S+DWR+KGAV  VK
Sbjct: 108 EEFRAKMLGFGRPRSGGGAGHSTAPSTVACIGSGLMGRQGYSDLPKSVDWREKGAVAPVK 167

Query: 145 DQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFI 204
            QG CG CWAFSAVAA+EGIN I   KL SLSEQELVDCDT  +  GC GG M  AFEF+
Sbjct: 168 SQGDCGSCWAFSAVAAIEGINQIKNGKLVSLSEQELVDCDT--KAIGCAGGYMSWAFEFV 225

Query: 205 ISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAI 264
           + N+GL TE  YPY+  +G+C   +   SA  ISGY +V  ++E  L++A A QPVSVA+
Sbjct: 226 MKNRGLTTERNYPYQGLNGACQTPKLKESAVSISGYMNVTPSSEPDLLRAAAAQPVSVAV 285

Query: 265 DASGSDFQFYSSGVFTGQCGTELDHGVTAVGYG-----TADDGT-----KYWLVKNSWGT 314
           DA    +Q Y  GVFTG C  EL+HGVT VGYG     T  DG+     KYW+VKNSWG 
Sbjct: 286 DAGSFVWQLYGGGVFTGPCTAELNHGVTVVGYGETQGDTDGDGSGVPGKKYWIVKNSWGP 345

Query: 315 TWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
            WG+ GYI MQR+     GLCGIAM  SYP 
Sbjct: 346 EWGDAGYILMQREASVASGLCGIAMLPSYPV 376


>gi|226507844|ref|NP_001148894.1| LOC100282514 precursor [Zea mays]
 gi|194703250|gb|ACF85709.1| unknown [Zea mays]
 gi|195622994|gb|ACG33327.1| vignain precursor [Zea mays]
          Length = 356

 Score =  317 bits (812), Expect = 6e-84,   Method: Compositional matrix adjust.
 Identities = 165/331 (49%), Positives = 207/331 (62%), Gaps = 24/331 (7%)

Query: 35  MNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTN 94
           M ER E WM ++GR+Y D  EK+ R ++++ NVE + +FN+      Y+L  N+FAD TN
Sbjct: 29  MLERFEQWMGRHGRLYADAGEKQRRLEVYRRNVELVETFNSMGNG--YRLADNKFADLTN 86

Query: 95  EEFRAPRNGYKR----------RLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVK 144
           EEFRA   G+ R            PS  +   + +  R   + +P S+DWR+KGAV  VK
Sbjct: 87  EEFRAKMLGFGRPRSGGGAGHSTAPSTVACIGSGLMGRQGYSDLPKSVDWREKGAVAPVK 146

Query: 145 DQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFI 204
            QG CG CWAFSAVAA+EGIN I   KL SLSEQELVDCDT  +  GC GG M  AFEF+
Sbjct: 147 SQGDCGSCWAFSAVAAIEGINQIKNGKLVSLSEQELVDCDT--KAIGCAGGYMSWAFEFV 204

Query: 205 ISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAI 264
           + N+GL TE  YPY+  +G+C   +   SA  ISGY +V  ++E  L++A A QPVSVA+
Sbjct: 205 MKNRGLTTERNYPYQGLNGACQTPKLKESAVSISGYMNVTPSSEPDLLRAAAAQPVSVAV 264

Query: 265 DASGSDFQFYSSGVFTGQCGTELDHGVTAVGYG-----TADDGT-----KYWLVKNSWGT 314
           DA    +Q Y  GVFTG C  EL+HGVT VGYG     T  DG+     KYW+VKNSWG 
Sbjct: 265 DAGSFVWQLYGGGVFTGPCTAELNHGVTVVGYGETQGDTDGDGSGVPGKKYWIVKNSWGP 324

Query: 315 TWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
            WG+ GYI MQR+     GLCGIAM  SYP 
Sbjct: 325 EWGDAGYILMQREASVASGLCGIAMLPSYPV 355


>gi|20334377|gb|AAM19209.1|AF493234_1 cysteine protease [Solanum lycopersicum]
 gi|400180431|gb|AFP73353.1| cysteine protease [Solanum lycopersicum]
          Length = 345

 Score =  317 bits (811), Expect = 6e-84,   Method: Compositional matrix adjust.
 Identities = 160/348 (45%), Positives = 224/348 (64%), Gaps = 10/348 (2%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRF 60
           MAM +    +++    V+ ++  Q+  R+    +++ERHE+WM+++GRVY+D  EK  RF
Sbjct: 1   MAMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERF 60

Query: 61  KIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVS 120
            IFKEN+++I S N KA N  YKLG+NEFAD T++EF A   G       +  S  +   
Sbjct: 61  MIFKENMKFIESVN-KAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE 119

Query: 121 FRYENA----SVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLS 176
           F+  N      +P+++DWR+ GAVT VK QG+CGCCWAFSAV ++EG   I T  L   S
Sbjct: 120 FKKINDLSDDYMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFS 179

Query: 177 EQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAK 236
           EQEL+DC T+  + GC GG M +AF+FII N G++ E+ Y Y     +C  +E   +A +
Sbjct: 180 EQELLDCTTN--NYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQEKT-AAVQ 236

Query: 237 ISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGY 296
           IS Y+ VP   E +L++AV  QPVS+ I AS  D QFY+ G + G C   ++H VTA+GY
Sbjct: 237 ISSYQVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGNCADRINHAVTAIGY 294

Query: 297 GTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           GT ++G KYWL+KNSWGT+WGENGY+++ RD     GLC IA  +SYP
Sbjct: 295 GTDEEGQKYWLLKNSWGTSWGENGYMKIIRDSGDPSGLCDIAKMSSYP 342


>gi|400180365|gb|AFP73321.1| cysteine protease [Solanum peruvianum]
 gi|400180395|gb|AFP73336.1| cysteine protease [Solanum peruvianum]
 gi|400180405|gb|AFP73341.1| cysteine protease [Solanum peruvianum]
 gi|400180409|gb|AFP73343.1| cysteine protease [Solanum peruvianum]
 gi|400180411|gb|AFP73344.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  317 bits (811), Expect = 7e-84,   Method: Compositional matrix adjust.
 Identities = 159/347 (45%), Positives = 224/347 (64%), Gaps = 9/347 (2%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRF 60
           MAM +    +++    V+ ++  Q+  R+    +++ERHE+WM+++GRVY+D  EK  RF
Sbjct: 1   MAMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERF 60

Query: 61  KIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVS 120
            IFKEN+++I S N KA N  YKLG+NEFAD T++EF A   G       +  S  +   
Sbjct: 61  MIFKENMKFIESVN-KAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE 119

Query: 121 FRYENAS---VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSE 177
           F+  + S   +P+++DWR+ GAVT VK QG+CGCCWAFSAV ++EG   I T  L   SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179

Query: 178 QELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKI 237
           QEL+DC T+  + GC GG M +AF+FII N G++ E+ Y Y     +C  +E   +A +I
Sbjct: 180 QELLDCTTN--NYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQEKT-AAVQI 236

Query: 238 SGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYG 297
           S Y+ VP   E +L++AV  QPVS+ I AS  D QFY+ G + G C   ++H VTA+GYG
Sbjct: 237 SSYKVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYG 294

Query: 298 TADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           T + G KYWL+KNSWGT+WGENG++++ RD     GLC IA  +SYP
Sbjct: 295 TDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341


>gi|400180451|gb|AFP73362.1| cysteine protease [Solanum chilense]
          Length = 344

 Score =  317 bits (811), Expect = 7e-84,   Method: Compositional matrix adjust.
 Identities = 158/347 (45%), Positives = 226/347 (65%), Gaps = 9/347 (2%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRF 60
           MAM +    +++    V+ ++  Q+  R+  + +++ERHE+WM+++GRVY+D  EK  RF
Sbjct: 1   MAMKVDLMNILITLFFVISMFNTQTRGRSQPELSVSERHELWMSRHGRVYKDEVEKGERF 60

Query: 61  KIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVS 120
            IFKEN+++I S N KA N  YKLG+NEFAD T++EF A   G       +  S  +   
Sbjct: 61  MIFKENMKFIESVN-KAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE 119

Query: 121 FRYENAS---VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSE 177
           F+  + S   +P+++DWR+ GAVT VK QG+CGCCWAFSAV ++EG   I T  L   SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179

Query: 178 QELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKI 237
           QEL+DC T+  + GC+GG M +AF+FI  N G+++E+ Y Y     +C  +E   +A +I
Sbjct: 180 QELLDCTTN--NYGCDGGFMTNAFDFIKENGGISSESDYEYLGQQYTCRSQEKT-AAVQI 236

Query: 238 SGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYG 297
           S Y+ VP   E +L++AV  QPVS+ I AS  D QFY+ G + G C   ++H VTA+GYG
Sbjct: 237 SSYQVVP-EGETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYG 294

Query: 298 TADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           T + G KYWL+KNSWGT+WGENG++++ RD     GLC IA  +SYP
Sbjct: 295 TDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGDPSGLCDIAKMSSYP 341


>gi|400180445|gb|AFP73359.1| cysteine protease, partial [Solanum chilense]
          Length = 345

 Score =  317 bits (811), Expect = 7e-84,   Method: Compositional matrix adjust.
 Identities = 158/347 (45%), Positives = 226/347 (65%), Gaps = 9/347 (2%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRF 60
           MAM +    +++    V+ ++  Q+  R+  + +++ERHE+WM+++GRVY+D  EK  RF
Sbjct: 1   MAMKVDLMNILITLFFVISMFNTQTRGRSQPELSVSERHELWMSRHGRVYKDEVEKGERF 60

Query: 61  KIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVS 120
            IFKEN+++I S N KA N  YKLG+NEFAD T++EF A   G       +  S  +   
Sbjct: 61  MIFKENMKFIESVN-KAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE 119

Query: 121 FRYENAS---VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSE 177
           F+  + S   +P+++DWR+ GAVT VK QG+CGCCWAFSAV ++EG   I T  L   SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179

Query: 178 QELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKI 237
           QEL+DC T+  + GC+GG M +AF+FI  N G+++E+ Y Y     +C  +E   +A +I
Sbjct: 180 QELLDCTTN--NYGCDGGFMTNAFDFIKENGGISSESDYEYLGQQYTCRSQEKT-AAVQI 236

Query: 238 SGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYG 297
           S Y+ VP   E +L++AV  QPVS+ I AS  D QFY+ G + G C   ++H VTA+GYG
Sbjct: 237 SSYQVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYG 294

Query: 298 TADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           T + G KYWL+KNSWGT+WGENG++++ RD     GLC IA  +SYP
Sbjct: 295 TDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGDPSGLCDIAKMSSYP 341


>gi|255538788|ref|XP_002510459.1| cysteine protease, putative [Ricinus communis]
 gi|223551160|gb|EEF52646.1| cysteine protease, putative [Ricinus communis]
          Length = 422

 Score =  317 bits (811), Expect = 8e-84,   Method: Compositional matrix adjust.
 Identities = 160/306 (52%), Positives = 197/306 (64%), Gaps = 4/306 (1%)

Query: 40  EMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRA 99
           E W  ++G+ Y    +K  RFKIF+EN E++   N++  N  Y L +N FAD T+ EF+A
Sbjct: 33  ESWTKEHGKTYTSKEDKLYRFKIFEENYEFVKKHNSQG-NSSYTLSLNAFADLTHHEFKA 91

Query: 100 PRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVA 159
            R G      S + S   +         VP SIDWRKKGAV+ VKDQG CG CW+FSA  
Sbjct: 92  SRLGLSAFSTSGKLSRR-NFPLHDFVGDVPISIDWRKKGAVSQVKDQGNCGACWSFSATG 150

Query: 160 AMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYK 219
           A+EGIN I T  L SLSEQELVDCD S  + GCEGGLMD A++F+I N G+ TE  YPY+
Sbjct: 151 AIEGINKIVTGSLVSLSEQELVDCDRS-YNNGCEGGLMDYAYQFVIENNGIDTEEDYPYQ 209

Query: 220 ASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVF 279
           A + +CNK++       I GY DVP NNE  L+KAVA QPVSV I  S   FQ YS G+F
Sbjct: 210 AREKTCNKEKLKRHVVTIDGYTDVPQNNEKELLKAVAAQPVSVGICGSERAFQLYSKGIF 269

Query: 280 TGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAM 339
           TG C T LDH V  VGYG+ ++G  YW+VKNSWGT WG NGY+ M R+    +GLCGI M
Sbjct: 270 TGPCSTSLDHAVLIVGYGS-ENGVDYWIVKNSWGTHWGINGYMYMLRNSGNSQGLCGINM 328

Query: 340 QASYPT 345
            AS+P 
Sbjct: 329 LASFPV 334


>gi|400180389|gb|AFP73333.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  316 bits (810), Expect = 8e-84,   Method: Compositional matrix adjust.
 Identities = 159/347 (45%), Positives = 224/347 (64%), Gaps = 9/347 (2%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRF 60
           MAM +    +++    V+ ++  Q+  R+    +++ERHE+WM+++GRVY+D  EK  RF
Sbjct: 1   MAMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERF 60

Query: 61  KIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVS 120
            IFKEN+++I S N KA N  YKLG+NEFAD T++EF A   G       +  S  +   
Sbjct: 61  MIFKENMKFIESVN-KAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE 119

Query: 121 FRYENAS---VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSE 177
           F+  + S   +P+++DWR+ GAVT VK QG+CGCCWAFSAV ++EG   I T  L   SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179

Query: 178 QELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKI 237
           QEL+DC T+  + GC GG M +AF+FII N G++ E+ Y Y     +C  +E   +A +I
Sbjct: 180 QELLDCTTN--NYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCRSQEKT-AAVQI 236

Query: 238 SGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYG 297
           S Y+ VP   E +L++AV  QPVS+ I AS  D QFY+ G + G C   ++H VTA+GYG
Sbjct: 237 SSYKVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYG 294

Query: 298 TADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           T + G KYWL+KNSWGT+WGENG++++ RD     GLC IA  +SYP
Sbjct: 295 TDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341


>gi|18396939|ref|NP_564320.1| Papain family cysteine protease [Arabidopsis thaliana]
 gi|9502427|gb|AAF88126.1|AC021043_19 Putative cysteine proteinase [Arabidopsis thaliana]
 gi|67633400|gb|AAY78625.1| peptidase C1A papain family protein [Arabidopsis thaliana]
 gi|332192919|gb|AEE31040.1| Papain family cysteine protease [Arabidopsis thaliana]
          Length = 346

 Score =  316 bits (810), Expect = 8e-84,   Method: Compositional matrix adjust.
 Identities = 165/358 (46%), Positives = 230/358 (64%), Gaps = 26/358 (7%)

Query: 3   MILLENKLVLAAILVLGVWAPQSWSRT--LNDATMNERHEMWMAQYGRVYRDNAEKEMRF 60
           M  +E   V+  I  + +   ++ SR      +++ + H+ WM Q+ RVY D  EK++R 
Sbjct: 1   MDFVEFVCVVLTIFFMDLKISEATSRVALYKPSSIVDYHQQWMIQFSRVYDDEFEKQLRL 60

Query: 61  KIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVS 120
           ++  EN+++I SFNN   N+ YKLG+NEF D T EEF A   G       +R    T   
Sbjct: 61  QVLTENLKFIESFNNMG-NQSYKLGVNEFTDWTKEEFLATYTG-------LRGVNVTS-P 111

Query: 121 FRYENASVPA-----------SIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITT 169
           F   N + PA           + DWR +GAVT VK QG+CG CWAFSA+AA+EG+  I  
Sbjct: 112 FEVVNETKPAWNWTVSDVLGTNKDWRNEGAVTPVKSQGECGGCWAFSAIAAVEGLTKIAR 171

Query: 170 RKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKE 229
             L SLSEQ+L+DC T  ++ GC+GG   +AF +II ++G+++E +YPY+  +G C +  
Sbjct: 172 GNLISLSEQQLLDC-TREQNNGCKGGTFVNAFNYIIKHRGISSENEYPYQVKEGPC-RSN 229

Query: 230 ANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQ-CGTELD 288
           A P A  I G+E+VPSNNE AL++AV+ QPV+VAIDAS + F  YS GV+  + CGT ++
Sbjct: 230 ARP-AILIRGFENVPSNNERALLEAVSRQPVAVAIDASEAGFVHYSGGVYNARNCGTSVN 288

Query: 289 HGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           H VT VGYGT+ +G KYWL KNSWG TWGENGYIR++RD++  +G+CG+A  ASYP A
Sbjct: 289 HAVTLVGYGTSPEGMKYWLAKNSWGKTWGENGYIRIRRDVEWPQGMCGVAQYASYPVA 346


>gi|449447027|ref|XP_004141271.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
          Length = 458

 Score =  316 bits (810), Expect = 8e-84,   Method: Compositional matrix adjust.
 Identities = 173/340 (50%), Positives = 220/340 (64%), Gaps = 13/340 (3%)

Query: 10  LVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRD-NAEKEMRFKIFKENVE 68
           L+    + L   +P S      D  +   ++ W A++G+++ +  AE E RF IFK+N++
Sbjct: 12  LLFFLFIALSAASPSSIIPQRTDDEVMALYDQWRAKHGKLHNNLGAEPENRFHIFKDNLK 71

Query: 69  YIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASV 128
           +I   N  A+N PY+LG+N FAD TNEE+R+   G K    S R + T++         +
Sbjct: 72  FIDEIN--AQNLPYRLGLNVFADLTNEEYRSRYLGGKFASGS-RRNRTSNRYLPRLGDDL 128

Query: 129 PASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGE 188
           P SIDWR KGAV  VKDQG CG CWAFS VA++E IN I T  L +LSEQELVDCD S  
Sbjct: 129 PDSIDWRAKGAVAPVKDQGSCGSCWAFSTVASVEAINQIVTGDLIALSEQELVDCDRS-Y 187

Query: 189 DQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNE 248
           ++GC GGLMD AFEFII N GL TE  YPY   D SC + + N     I GYEDVP NNE
Sbjct: 188 NEGCNGGLMDYAFEFIIENGGLDTEEDYPYYGFDSSCIQYKKN----AIDGYEDVPVNNE 243

Query: 249 AALMKA---VANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKY 305
            AL KA        VSVAI+  G  FQ Y SG+FTG+CGT+LDHGV  VGYG+ + G  Y
Sbjct: 244 KALQKAVSKQVVSVVSVAIEGGGRSFQLYQSGIFTGRCGTDLDHGVNVVGYGS-EGGVDY 302

Query: 306 WLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           W+V+NSWG +WGE+GY++MQR+I +  GLCGIAM+ SYPT
Sbjct: 303 WIVRNSWGGSWGESGYVKMQRNIASPTGLCGIAMEPSYPT 342


>gi|297809385|ref|XP_002872576.1| hypothetical protein ARALYDRAFT_489965 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297318413|gb|EFH48835.1| hypothetical protein ARALYDRAFT_489965 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 371

 Score =  316 bits (810), Expect = 8e-84,   Method: Compositional matrix adjust.
 Identities = 156/315 (49%), Positives = 212/315 (67%), Gaps = 7/315 (2%)

Query: 32  DATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFAD 91
           DA      E WM ++G+VY   AEKE R  IF++N+ +I   N  A N  Y+LG+N FAD
Sbjct: 49  DAEATLMFESWMVKHGKVYESVAEKERRLTIFEDNLRFIT--NRNAENLSYRLGLNRFAD 106

Query: 92  QTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASV-PASIDWRKKGAVTGVKDQGQCG 150
            +  E+    +G   R P      T+   ++  +  V P S+DWR +GAVT VKDQGQC 
Sbjct: 107 LSLHEYAQICHGADPRPPRNHVFMTSSNRYKTSDGDVLPKSVDWRNEGAVTEVKDQGQCR 166

Query: 151 CCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGL 210
            CWAFS V A+EG+N I T +L +LSEQ+L++C+   E+ GC GG ++ A+EFI++N GL
Sbjct: 167 SCWAFSTVGAVEGLNKIVTGELVTLSEQDLINCNK--ENNGCGGGKVETAYEFIMNNGGL 224

Query: 211 ATEAKYPYKASDGSCNKK-EANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGS 269
            T+  YPYKA +G CN + + N     I GYE++P+N+E+ALMKAVA+QPV+  +D+S  
Sbjct: 225 GTDNDYPYKALNGVCNDRLKENNKNVMIDGYENLPANDESALMKAVAHQPVTAVVDSSSR 284

Query: 270 DFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDID 329
           +FQ Y+SGVF G CGT L+HGV  VGYGT ++G  YW+V+NS G TWGE GY++M R+I 
Sbjct: 285 EFQLYASGVFDGTCGTNLNHGVVVVGYGT-ENGRDYWIVRNSRGNTWGEAGYMKMARNIA 343

Query: 330 AKEGLCGIAMQASYP 344
              GLCGIAM+ASYP
Sbjct: 344 NPRGLCGIAMRASYP 358


>gi|255563136|ref|XP_002522572.1| cysteine protease, putative [Ricinus communis]
 gi|223538263|gb|EEF39872.1| cysteine protease, putative [Ricinus communis]
          Length = 340

 Score =  316 bits (810), Expect = 9e-84,   Method: Compositional matrix adjust.
 Identities = 174/346 (50%), Positives = 232/346 (67%), Gaps = 17/346 (4%)

Query: 6   LENKLVLAAILVLGVWAPQSWSRTLNDA-TMNERHEMWMAQYGRVYRDNAEKEMRFKIFK 64
           L+ KL +  +++L  W  Q+  R L D   + E+HE WMA++GR Y+D+ EKE RF IFK
Sbjct: 5   LQTKLAIV-LMILVTWVSQAMPRPLIDEDAVAEKHEQWMARHGRTYQDDEEKERRFHIFK 63

Query: 65  ENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYK--RRLPSVR-SSETTDVSF 121
           +N+++I +FNN A N+ YKLG+N FAD T+EEF A   GYK  + LP+   +++TT  S 
Sbjct: 64  KNLKHIENFNN-AFNRTYKLGLNHFADLTDEEFLATYTGYKMPKVLPTANITTKTTQSSD 122

Query: 122 RYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELV 181
               A+VP SIDWR +G VT VK+QG+CGCCWAFSA AA+EGI         SLS Q+L+
Sbjct: 123 VLYEANVPESIDWRTRGVVTPVKNQGRCGCCWAFSAAAAVEGI----IGNGVSLSAQQLL 178

Query: 182 DCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYE 241
           DC    +  GC GG MD+AF +II N+GLA+   YPY+     C       +AA+ISGY 
Sbjct: 179 DC--VPDSNGCNGGFMDNAFRYIIQNQGLASATYYPYQLMREMCRPSN---NAARISGYV 233

Query: 242 DVPSNNEAALMKAVANQPVSVAIDASGS-DFQFYSSGVFTGQ-CGTELDHGVTAVGYGTA 299
           DV   +E  L  AVA QPVS A+DA+   +F++Y  G+F  Q CG+ L H +T VGYGT+
Sbjct: 234 DVTPADEETLKSAVARQPVSAAVDATSELNFKYYGGGIFPPQDCGSTLTHAITIVGYGTS 293

Query: 300 DDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
            +GTKYWL+KNSWG  WGE GY+R+QRD+ +  G CGIA++ASYPT
Sbjct: 294 AEGTKYWLIKNSWGEGWGEGGYMRLQRDVGSYGGACGIALRASYPT 339


>gi|400180435|gb|AFP73355.1| cysteine protease [Solanum pennellii]
          Length = 344

 Score =  316 bits (809), Expect = 1e-83,   Method: Compositional matrix adjust.
 Identities = 159/347 (45%), Positives = 225/347 (64%), Gaps = 9/347 (2%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRF 60
           MAM +    +++    V+ ++  Q+ +R+    +++ERHE+WM+++GRVY+D  EK  RF
Sbjct: 1   MAMKVDLMSILITLFFVISMFNTQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERF 60

Query: 61  KIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVS 120
            IFKEN+++I S N KA N  YKLG+NEFAD T++EF A   G       V  S  +   
Sbjct: 61  MIFKENMKFIESVN-KAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYVSPSPMSSTE 119

Query: 121 FRYENAS---VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSE 177
           F+  + S   +P+++DWR+ GAVT VK+QGQCGCCWAFSAV ++EG   I T  L   SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179

Query: 178 QELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKI 237
           QEL+DC T+  + GC GG M +AF+FI  N G++ E+ Y Y     +C  +E   +A +I
Sbjct: 180 QELLDCTTN--NYGCNGGFMTNAFDFIKENGGISRESDYEYLGQQYTCRSQEKT-AAVQI 236

Query: 238 SGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYG 297
           S Y+ VP   E +L++AV  QPVS+ I AS  D QFY+ G + G C   ++H VTA+GYG
Sbjct: 237 SSYQVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCANRINHAVTAIGYG 294

Query: 298 TADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           T + G KYWL+KNSWGT+WGE+G++++ RD     GLC IA  +SYP
Sbjct: 295 TDEKGQKYWLLKNSWGTSWGEDGFMKIIRDSGNPAGLCDIAKVSSYP 341


>gi|32396020|gb|AAP41847.1| senescence-associated cysteine protease [Anthurium andraeanum]
          Length = 460

 Score =  316 bits (809), Expect = 1e-83,   Method: Compositional matrix adjust.
 Identities = 163/312 (52%), Positives = 200/312 (64%), Gaps = 10/312 (3%)

Query: 39  HEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFR 98
           +E W+   G+ Y    EKE RF+IF +N+ YI   N    N  Y LG+  FAD TNEE+R
Sbjct: 38  YEGWLVGNGKAYNLLGEKERRFEIFWDNLRYIDDHNRAENNHSYTLGLTRFADLTNEEYR 97

Query: 99  APRNGYKRRLPSVRSSETTDVSFRYENAS-----VPASIDWRKKGAVTGVKDQGQCGCCW 153
           +   G K     VR         R  + S     +P  +DWR+KGAV  +KDQG CG CW
Sbjct: 98  STYLGVKPG--QVRPRRANRAPGRGRDLSANGDDLPQKVDWREKGAVAPIKDQGGCGSCW 155

Query: 154 AFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATE 213
           AFS VAA+EGIN I T  L  LSEQELVDCDT+  ++GC GGLMD AF+FIISN G+ TE
Sbjct: 156 AFSTVAAVEGINQIVTGDLIVLSEQELVDCDTA-YNEGCNGGLMDYAFQFIISNGGIDTE 214

Query: 214 AKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQF 273
             YPYK  DG C+    N     I  YEDV  N+E AL  AVA+QPVSVAI+  G  FQ 
Sbjct: 215 EDYPYKERDGLCDPNRKNAKVVSIDSYEDVLENDEHALKTAVAHQPVSVAIEGGGRSFQL 274

Query: 274 YSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDI-DAKE 332
           Y SG+F G+CG +LDHGV AVGYGT + G  YW+V+NSWG +WGE GYIRM+R++  +  
Sbjct: 275 YKSGIFDGRCGIDLDHGVVAVGYGT-ESGKDYWIVRNSWGKSWGEAGYIRMERNLPSSSS 333

Query: 333 GLCGIAMQASYP 344
           G CGIA++ SYP
Sbjct: 334 GKCGIAIEPSYP 345


>gi|312282059|dbj|BAJ33895.1| unnamed protein product [Thellungiella halophila]
          Length = 379

 Score =  316 bits (809), Expect = 1e-83,   Method: Compositional matrix adjust.
 Identities = 154/309 (49%), Positives = 213/309 (68%), Gaps = 11/309 (3%)

Query: 40  EMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRA 99
           E W+ ++G+VY   AEKE R  IFK+N+ +I   N  + N  Y+LG+N FAD +  E++ 
Sbjct: 65  ESWIVKHGKVYDSVAEKERRLTIFKDNLRFIT--NRNSENLGYRLGLNRFADLSLHEYKE 122

Query: 100 PRNGYKRRLPSVRSSETTDVSFRYENAS---VPASIDWRKKGAVTGVKDQGQCGCCWAFS 156
             +G   + P  R+      S RY+ ++   +P S+DWR +GAVT VKDQG C  CWAFS
Sbjct: 123 ICHGADPKPP--RNHVFMSSSDRYKTSAGDVLPKSVDWRNEGAVTEVKDQGHCRSCWAFS 180

Query: 157 AVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKY 216
            V A+EG+N I T +L +LSEQ+L++C+   E+ GC GG ++ A+EFI+SN GL T+  Y
Sbjct: 181 TVGAVEGLNKIVTGELVTLSEQDLINCNK--ENNGCGGGKVETAYEFIVSNGGLGTDNDY 238

Query: 217 PYKASDGSCNKK-EANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYS 275
           PYKA +G+C+ + + N     I GYE++P+N+E ALMKAVA+QPV+  ID+S  +FQ Y 
Sbjct: 239 PYKAVNGACDGRLKENIKNVMIDGYENLPANDELALMKAVAHQPVTAVIDSSSREFQLYE 298

Query: 276 SGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLC 335
           SGVF G+CGT L+HGV  VGYGT ++G  YW+V+NSWG TWGE GY++M R+I    GLC
Sbjct: 299 SGVFDGRCGTNLNHGVVVVGYGT-ENGRNYWIVRNSWGNTWGEAGYMKMARNIANPRGLC 357

Query: 336 GIAMQASYP 344
           GIAM+ SYP
Sbjct: 358 GIAMRVSYP 366


>gi|356545071|ref|XP_003540969.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
          Length = 317

 Score =  316 bits (809), Expect = 1e-83,   Method: Compositional matrix adjust.
 Identities = 169/307 (55%), Positives = 191/307 (62%), Gaps = 26/307 (8%)

Query: 18  LGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKA 77
           +   A Q   RTL DA+M ERHE WM++YG+VY+D  E+E RF+IFKEN+ YI +  N A
Sbjct: 1   MAFLASQVTCRTLQDASMYERHEEWMSRYGKVYKDPWEREKRFRIFKENMNYIETSKNAA 60

Query: 78  RNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKK 137
             KPYKL IN+FAD  NEEF AP+N +K  +     S                       
Sbjct: 61  I-KPYKLVINQFADLNNEEFIAPQNIFKGMIICRLLSR---------------------- 97

Query: 138 GAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLM 197
            AVT VKDQG CG CWAF  VA+ EGI  +T  KL SLSEQELVDCDT G DQGCEG LM
Sbjct: 98  -AVTPVKDQGHCGFCWAFYDVASTEGILALTAGKLISLSEQELVDCDTKGVDQGCEGDLM 156

Query: 198 DDAF--EFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAV 255
           DDAF     +SN              DG CN  E    A  I+G EDVP+NNE AL K V
Sbjct: 157 DDAFFMAVTLSNSSFKILESRCQLGVDGKCNANEEVNPATTITGXEDVPANNEKALQKVV 216

Query: 256 ANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTT 315
           ANQPVS+AIDA  SDFQFY  GVFTG CGTELDHGVT VGYG + DGT+YWLVKNSW T 
Sbjct: 217 ANQPVSIAIDACDSDFQFYKRGVFTGSCGTELDHGVTIVGYGVSHDGTQYWLVKNSWETE 276

Query: 316 WGENGYI 322
           W  N  I
Sbjct: 277 WNSNRAI 283


>gi|225458143|ref|XP_002280937.1| PREDICTED: cysteine proteinase RD21a [Vitis vinifera]
 gi|302142569|emb|CBI19772.3| unnamed protein product [Vitis vinifera]
          Length = 436

 Score =  316 bits (809), Expect = 1e-83,   Method: Compositional matrix adjust.
 Identities = 165/336 (49%), Positives = 206/336 (61%), Gaps = 14/336 (4%)

Query: 10  LVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEY 69
           L   +IL+L V +  S + +  D       E W  QYG+ Y    EK  R K+F+EN  +
Sbjct: 5   LWAVSILILAVHSSVSEASSTADL-----FEAWCEQYGKTYSSEEEKASRLKVFEENHAF 59

Query: 70  IASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKR-RLPSVRSSETTDVSFRYENASV 128
           +   N+ A N  Y L +N FAD T+ EF+A R G+   R  S+RS     V    +   V
Sbjct: 60  VTQHNSMA-NASYTLALNAFADLTHHEFKASRLGFSPGRAQSIRS-----VGTPVQELHV 113

Query: 129 PASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGE 188
           P ++DWRK GAVTGVKDQG CG CW+FS   A+EGIN I T  L SLSEQELVDCD S  
Sbjct: 114 PPAVDWRKSGAVTGVKDQGNCGGCWSFSTTGAIEGINKIVTGSLVSLSEQELVDCDRS-Y 172

Query: 189 DQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNE 248
           + GCEGGLMD A++F+I N+G+ +EA YPY   D  CNK++       I GY D+P N+E
Sbjct: 173 NSGCEGGLMDYAYQFVIKNQGIDSEADYPYVGMDKPCNKEKLKKHIVTIDGYTDIPPNDE 232

Query: 249 AALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLV 308
             L++ VA QPVSV I  S   FQ YS GV+TG C + LDH V  VGYGT +DG  +W+V
Sbjct: 233 KQLLQVVAKQPVSVGICGSEKTFQLYSKGVYTGPCSSTLDHAVLIVGYGT-EDGVDFWIV 291

Query: 309 KNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           KNSWG  WG  GYI M R+    EG+CGI M ASYP
Sbjct: 292 KNSWGEHWGMRGYIHMLRNNGTAEGICGINMLASYP 327


>gi|400180373|gb|AFP73325.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  315 bits (807), Expect = 2e-83,   Method: Compositional matrix adjust.
 Identities = 158/347 (45%), Positives = 224/347 (64%), Gaps = 9/347 (2%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRF 60
           MAM +    +++    V+ ++  Q+  R+    +++ERHE+WM+++G VY+D  EK  RF
Sbjct: 1   MAMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGHVYKDEVEKGERF 60

Query: 61  KIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVS 120
            IFKEN+++I S N KA N  YKLG+NEFAD T++EF A   G       +  S  +   
Sbjct: 61  MIFKENMKFIESVN-KAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE 119

Query: 121 FRYENAS---VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSE 177
           F+  + S   +P+++DWR+ GAVT VK QGQCGCCWAFSAV ++EG   I T  L   SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGQCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179

Query: 178 QELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKI 237
           QEL+DC T+  + GC+GG M +AF+FI  N G+++E+ Y Y     +C  +E   +A +I
Sbjct: 180 QELLDCTTN--NYGCDGGFMTNAFDFIKENGGISSESDYEYLGEQYTCRSQEKT-AAVQI 236

Query: 238 SGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYG 297
           S Y+ VP   E +L++AV  QPVS+ I AS  D QFY+ G + G C   ++H VTA+GYG
Sbjct: 237 SSYQVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYG 294

Query: 298 TADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           T + G KYWL+KNSWGT+WGENG++++ RD     GLC IA  +SYP
Sbjct: 295 TDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPAGLCDIAKMSSYP 341


>gi|400180353|gb|AFP73315.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  315 bits (807), Expect = 2e-83,   Method: Compositional matrix adjust.
 Identities = 159/347 (45%), Positives = 223/347 (64%), Gaps = 9/347 (2%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRF 60
           MAM +    +++    V+ ++  Q+  R+    +++ERHE+WM+++GRVY+D  EK  RF
Sbjct: 1   MAMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERF 60

Query: 61  KIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVS 120
            IFKEN+++I S N KA N  YKLG+NEFAD T++EF A   G       +  S  +   
Sbjct: 61  MIFKENMKFIESVN-KAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE 119

Query: 121 FRYENAS---VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSE 177
           F   + S   +P+++DWR+ GAVT VK QG+CGCCWAFSAV ++EG   I T  L   SE
Sbjct: 120 FIINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179

Query: 178 QELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKI 237
           QEL+DC T+  + GC GG M +AF+FII N G++ E+ Y Y     +C  +E   +A +I
Sbjct: 180 QELLDCTTN--NYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCRSQEKT-AAVQI 236

Query: 238 SGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYG 297
           S Y+ VP   E +L++AV  QPVS+ I AS  D QFY+ G + G C   ++H VTA+GYG
Sbjct: 237 SSYQVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYG 294

Query: 298 TADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           T + G KYWL+KNSWGT+WGENG++++ RD     GLC IA  +SYP
Sbjct: 295 TDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPAGLCDIAKMSSYP 341


>gi|400180381|gb|AFP73329.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  315 bits (807), Expect = 2e-83,   Method: Compositional matrix adjust.
 Identities = 158/347 (45%), Positives = 223/347 (64%), Gaps = 9/347 (2%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRF 60
           MAM +    +++    V+ ++  Q+  R+    +++ERHE+WM+++GRVY+D  EK  RF
Sbjct: 1   MAMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERF 60

Query: 61  KIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVS 120
            IFKEN+++I S N KA N  YKLG+NEFAD T++EF A   G       +  S  +   
Sbjct: 61  MIFKENMKFIESVN-KAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE 119

Query: 121 FRYENAS---VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSE 177
           F+  + S   +P+++DWR+ GAVT VK QG+CGCCWAFSAV ++EG   I T  L   SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179

Query: 178 QELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKI 237
           QEL+DC T+  + GC GG M +AF+FII N G++ E+ Y Y     +C  +E   +A +I
Sbjct: 180 QELLDCTTN--NYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQEKT-AAVQI 236

Query: 238 SGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYG 297
           S Y+ VP   E +L++AV  QPVS+ I AS  D QFY+ G + G C   ++H VTA+GYG
Sbjct: 237 SSYKVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYG 294

Query: 298 TADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           T + G KYWL+KNSWGT+WGENG++++ RD     GLC I   +SYP
Sbjct: 295 TDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGDPSGLCDITKMSSYP 341


>gi|400180359|gb|AFP73318.1| cysteine protease [Solanum peruvianum]
 gi|400180477|gb|AFP73375.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  315 bits (807), Expect = 2e-83,   Method: Compositional matrix adjust.
 Identities = 158/347 (45%), Positives = 224/347 (64%), Gaps = 9/347 (2%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRF 60
           MAM +    +++    V+ ++  Q+ +R+    +++ERHE+WM+++GRVY+D  EK  RF
Sbjct: 1   MAMKVDLMSILITLFFVISMFNSQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERF 60

Query: 61  KIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVS 120
            IFKEN+++I S N KA N  YKLG+NEFAD T++EF A   G       +  S  +   
Sbjct: 61  MIFKENMKFIESVN-KAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE 119

Query: 121 FRYENAS---VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSE 177
           F+  + S   +P+++DWR+ GAVT VK QG+CGCCWAFSAV ++EG   I T  L   SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179

Query: 178 QELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKI 237
           QEL+DC T+  + GC GG M +AF+FI  N G++ E+ Y Y     +C  +E   +A +I
Sbjct: 180 QELLDCTTN--NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKT-AAVQI 236

Query: 238 SGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYG 297
           S Y+ VP   E +L++AV  QPVS+ I AS  D QFY+ G + G C   ++H VTA+GYG
Sbjct: 237 SSYQVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYG 294

Query: 298 TADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           T + G KYWL+KNSWGT+WGENG++++ RD     GLC IA  +SYP
Sbjct: 295 TDEKGQKYWLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341


>gi|115448287|ref|NP_001047923.1| Os02g0715000 [Oryza sativa Japonica Group]
 gi|42408029|dbj|BAD09165.1| putative cysteine proteinase [Oryza sativa Japonica Group]
 gi|113537454|dbj|BAF09837.1| Os02g0715000 [Oryza sativa Japonica Group]
 gi|215737450|dbj|BAG96580.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215765786|dbj|BAG87483.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222623551|gb|EEE57683.1| hypothetical protein OsJ_08138 [Oryza sativa Japonica Group]
          Length = 366

 Score =  315 bits (807), Expect = 2e-83,   Method: Compositional matrix adjust.
 Identities = 155/305 (50%), Positives = 203/305 (66%), Gaps = 5/305 (1%)

Query: 42  WMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPR 101
           W  ++ ++Y    EK  R++IFK N+ +I   N   RN  Y LG+N FAD  +EEF+A  
Sbjct: 58  WSVKHSKIYASPKEKVKRYEIFKRNLRHIVETNR--RNGSYWLGLNHFADIAHEEFKASY 115

Query: 102 NGYKRRLPSVRSSETTDVSFRYENA-SVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAA 160
            G K  L    +      +FRY NA ++P ++DWRKKGAVT VK+QG+CG CWAFS VAA
Sbjct: 116 LGLKPGLARRDAQPHGSTTFRYANAVNLPWAVDWRKKGAVTPVKNQGECGSCWAFSTVAA 175

Query: 161 MEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKA 220
           +EGIN I T KL SLSEQEL+DCD +  + GC GGLMD AF +I+ N+G+ TE  YPY  
Sbjct: 176 VEGINQIVTGKLVSLSEQELMDCDNT-FNHGCRGGLMDFAFAYIMGNQGIYTEEDYPYLM 234

Query: 221 SDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFT 280
            +G C +K+ +     I+GYEDVP+N+E +L+KA+A+QPVSV I A   DFQFY  G+F 
Sbjct: 235 EEGYCREKQPHSKVITITGYEDVPANSETSLLKALAHQPVSVGIAAGSRDFQFYKGGIFD 294

Query: 281 GQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQ 340
           G+CG + DH +TAVGYG+   G  Y ++KNSWG  WGE GY R++R     EG+C I   
Sbjct: 295 GECGIQPDHALTAVGYGSY-YGQDYIIMKNSWGKNWGEQGYFRIRRGTGKPEGVCDIYKI 353

Query: 341 ASYPT 345
           ASYPT
Sbjct: 354 ASYPT 358


>gi|400180383|gb|AFP73330.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  315 bits (806), Expect = 2e-83,   Method: Compositional matrix adjust.
 Identities = 159/347 (45%), Positives = 223/347 (64%), Gaps = 9/347 (2%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRF 60
           MAM +    +++    V+ ++  Q+  R+    +++ERHE+WM+++GRVY+D  EK  RF
Sbjct: 1   MAMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERF 60

Query: 61  KIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVS 120
            IFKEN+++I S N KA N  YKLG+NEFAD T++EF A   G       +  S  +   
Sbjct: 61  MIFKENMKFIESVN-KAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE 119

Query: 121 FRYENAS---VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSE 177
           F   + S   +P+++DWR+ GAVT VK QG+CGCCWAFSAV ++EG   I T  L   SE
Sbjct: 120 FIINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179

Query: 178 QELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKI 237
           QEL+DC T+  + GC GG M +AF+FII N G++ E+ Y Y     +C  +E   +A +I
Sbjct: 180 QELLDCTTN--NYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQEKT-AAVQI 236

Query: 238 SGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYG 297
           S Y+ VP   E +L++AV  QPVS+ I AS  D QFY+ G + G C   ++H VTA+GYG
Sbjct: 237 SSYKVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYG 294

Query: 298 TADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           T + G KYWL+KNSWGT+WGENG++++ RD     GLC IA  +SYP
Sbjct: 295 TDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341


>gi|400180457|gb|AFP73365.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  315 bits (806), Expect = 2e-83,   Method: Compositional matrix adjust.
 Identities = 159/347 (45%), Positives = 223/347 (64%), Gaps = 9/347 (2%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRF 60
           MAM +    +++    V+ ++  Q+  R+    +++ERHE+WM+++GRVY+D  EK  RF
Sbjct: 1   MAMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERF 60

Query: 61  KIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVS 120
            IFKEN+++I S N KA N  YKLG+NEFAD T++EF A   G       +  S  +   
Sbjct: 61  MIFKENMKFIESVN-KAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE 119

Query: 121 FRYENAS---VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSE 177
           F   + S   +P+++DWR+ GAVT VK QG+CGCCWAFSAV ++EG   I T  L   SE
Sbjct: 120 FIINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179

Query: 178 QELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKI 237
           QEL+DC T+  + GC GG M +AF+FII N G++ E+ Y Y     +C  +E   +A +I
Sbjct: 180 QELLDCTTN--NYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCRSQEKT-AAVQI 236

Query: 238 SGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYG 297
           S Y+ VP   E +L++AV  QPVS+ I AS  D QFY+ G + G C   ++H VTA+GYG
Sbjct: 237 SSYKVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYG 294

Query: 298 TADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           T + G KYWL+KNSWGT+WGENG++++ RD     GLC IA  +SYP
Sbjct: 295 TDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341


>gi|400180467|gb|AFP73370.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  315 bits (806), Expect = 3e-83,   Method: Compositional matrix adjust.
 Identities = 158/347 (45%), Positives = 224/347 (64%), Gaps = 9/347 (2%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRF 60
           MAM +    +++    V+ ++  Q+ +R+    +++ERHE+WM+++GRVY+D  EK  RF
Sbjct: 1   MAMKVDLMNILITLFFVISMFNTQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERF 60

Query: 61  KIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVS 120
            IFKEN+++I S N KA N  YKLG+NEFAD T++EF A   G       +  S  +   
Sbjct: 61  MIFKENMKFIESVN-KAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE 119

Query: 121 FRYENAS---VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSE 177
           F+  + S   +P+++DWR+ GAVT VK QG+CGCCWAFSAV ++EG   I T  L   SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179

Query: 178 QELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKI 237
           QEL+DC T+  + GC GG M +AF+FI  N G++ E+ Y Y     +C  +E   +A +I
Sbjct: 180 QELLDCTTN--NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKT-AAVQI 236

Query: 238 SGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYG 297
           S Y+ VP   E +L++AV  QPVS+ I AS  D QFY+ G + G C   ++H VTA+GYG
Sbjct: 237 SSYQVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYG 294

Query: 298 TADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           T + G KYWL+KNSWGT+WGENG++++ RD     GLC IA  +SYP
Sbjct: 295 TDEKGQKYWLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341


>gi|190358935|sp|P00785.4|ACTN_ACTCH RecName: Full=Actinidain; Short=Actinidin; AltName: Allergen=Act c
           1; Flags: Precursor
 gi|12744965|gb|AAK06862.1|AF343446_1 actinidin protease [Actinidia chinensis]
          Length = 380

 Score =  315 bits (806), Expect = 3e-83,   Method: Compositional matrix adjust.
 Identities = 164/340 (48%), Positives = 214/340 (62%), Gaps = 14/340 (4%)

Query: 10  LVLAAILVLGV-WAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVE 68
           L  + +L+L + +  ++ ++  ND  +   +E W+ +YG+ Y    E E RF+IFKE + 
Sbjct: 13  LFFSTLLILSLAFNAKNLTQRTNDE-VKAMYESWLIKYGKSYNSLGEWERRFEIFKETLR 71

Query: 69  YIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYE---N 125
           +I   +N   N+ YK+G+N+FAD T+EEFR+        L     S  T VS RYE    
Sbjct: 72  FIDE-HNADTNRSYKVGLNQFADLTDEEFRS------TYLRFTSGSNKTKVSNRYEPRVG 124

Query: 126 ASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDT 185
             +P+ +DWR  GAV  +K QG+CG CWAFSA+A +EGIN I T  L SLSEQEL+DC  
Sbjct: 125 QVLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIDCGR 184

Query: 186 SGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPS 245
           +   +GC GG + D F+FII+N G+ TE  YPY A DG CN    N     I  YE+VP 
Sbjct: 185 TQNTRGCNGGYITDGFQFIINNGGINTEENYPYTAQDGECNVDLQNEKYVTIDTYENVPY 244

Query: 246 NNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKY 305
           NNE AL  AV  QPVSVA+DA+G  F+ YSSG+FTG CGT +DH VT VGYGT + G  Y
Sbjct: 245 NNEWALQTAVTYQPVSVALDAAGDAFKQYSSGIFTGPCGTAVDHAVTIVGYGT-EGGIDY 303

Query: 306 WLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           W+VKNSW TTWGE GY+R+ R++    G CGIA   SYP 
Sbjct: 304 WIVKNSWDTTWGEEGYMRILRNVGGA-GTCGIATMPSYPV 342


>gi|400180393|gb|AFP73335.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  314 bits (805), Expect = 3e-83,   Method: Compositional matrix adjust.
 Identities = 158/347 (45%), Positives = 223/347 (64%), Gaps = 9/347 (2%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRF 60
           MAM +    +++    V+ ++  Q+  R+    +++ERHE+WM+++GRVY+D  EK  RF
Sbjct: 1   MAMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKVERF 60

Query: 61  KIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVS 120
            IFKEN+++I S N KA N  YKLG+NEFAD T++EF A   G       +  S  +   
Sbjct: 61  MIFKENMKFIESVN-KAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE 119

Query: 121 FRYENAS---VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSE 177
            +  + S   +P+++DWR+ GAVT VK QG+CGCCWAFSAV ++EG   I T  L   SE
Sbjct: 120 LKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179

Query: 178 QELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKI 237
           QEL+DC T+  + GC GG M +AF+FII N G++ E+ Y Y     +C  +E   +A +I
Sbjct: 180 QELLDCTTN--NYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCRSQEKT-AAVQI 236

Query: 238 SGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYG 297
           S Y+ VP   E +L++AV  QPVS+ I AS  D QFY+ G + G C   ++H VTA+GYG
Sbjct: 237 SSYKVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYG 294

Query: 298 TADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           T + G KYWL+KNSWGT+WGENG++++ RD     GLC IA  +SYP
Sbjct: 295 TDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341


>gi|400180379|gb|AFP73328.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  314 bits (805), Expect = 3e-83,   Method: Compositional matrix adjust.
 Identities = 158/347 (45%), Positives = 224/347 (64%), Gaps = 9/347 (2%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRF 60
           MAM +    +++    V+ ++  Q+  R+    +++ERHE+WM+++GRVY+D  EK  RF
Sbjct: 1   MAMKVDLMNILITLFFVITMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERF 60

Query: 61  KIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVS 120
            IFKEN+++I S N KA N  YKLG+NEFAD T++EF A   G       +  S  +   
Sbjct: 61  MIFKENMKFIESVN-KAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE 119

Query: 121 FRYENAS---VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSE 177
           F+  + S   +P+++DWR+ GAVT VK QG+CGCCWAFSAV ++EG   I T  L   SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179

Query: 178 QELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKI 237
           QEL+DC T+  + GC+GG M +AF+FI  N G++ E+ Y Y     +C  +E   +A +I
Sbjct: 180 QELLDCTTN--NYGCDGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKT-AAVQI 236

Query: 238 SGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYG 297
           S Y+ VP   E +L++AV  QPVS+ I AS  D QFY+ G + G C   ++H VTA+GYG
Sbjct: 237 SSYQVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYG 294

Query: 298 TADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           T + G KYWL+KNSWGT+WGENG++++ RD     GLC IA  +SYP
Sbjct: 295 TDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341


>gi|400180463|gb|AFP73368.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  314 bits (805), Expect = 3e-83,   Method: Compositional matrix adjust.
 Identities = 158/347 (45%), Positives = 224/347 (64%), Gaps = 9/347 (2%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRF 60
           MAM +    +++    V+ ++  Q+ +R+    +++ERHE+WM+++GRVY+D  EK  RF
Sbjct: 1   MAMKVDLMNILITLFFVISMFNTQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERF 60

Query: 61  KIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVS 120
            IFKEN+++I S N KA N  YKLG+NEFAD T++EF A   G       +  S  +   
Sbjct: 61  MIFKENMKFIESVN-KAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE 119

Query: 121 FRYENAS---VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSE 177
           F+  + S   +P+++DWR+ GAVT VK QG+CGCCWAFSAV ++EG   I T  L   SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179

Query: 178 QELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKI 237
           QEL+DC T+  + GC GG M +AF+FI  N G++ E+ Y Y     +C  +E   +A +I
Sbjct: 180 QELLDCTTN--NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKT-AAVQI 236

Query: 238 SGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYG 297
           S Y+ VP   E +L++AV  QPVS+ I AS  D QFY+ G + G C   ++H VTA+GYG
Sbjct: 237 SSYQVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYG 294

Query: 298 TADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           T + G KYWL+KNSWGT+WGENG++++ RD     GLC IA  +SYP
Sbjct: 295 TDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGDPSGLCDIAKMSSYP 341


>gi|400180407|gb|AFP73342.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  314 bits (805), Expect = 4e-83,   Method: Compositional matrix adjust.
 Identities = 158/347 (45%), Positives = 223/347 (64%), Gaps = 9/347 (2%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRF 60
           MAM +    +++    V+ ++  Q+  R+    +++ERHE+WM+++GRVY+D  EK  RF
Sbjct: 1   MAMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKVERF 60

Query: 61  KIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVS 120
            IFKEN+++I S N KA N  YKLG+NEFAD T++EF A   G       +  S  +   
Sbjct: 61  MIFKENMKFIESVN-KAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE 119

Query: 121 FRYENAS---VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSE 177
           F+  + S   +P+++DWR+ GAVT VK QG+CGCCWAFSAV ++EG   I T  L   SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179

Query: 178 QELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKI 237
           QEL+DC T+  + GC GG M +AF+FI  N G++ E+ Y Y     +C  +E   +A +I
Sbjct: 180 QELLDCTTN--NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKT-AAVQI 236

Query: 238 SGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYG 297
           S Y+ VP   E +L++AV  QPVS+ I AS  D QFY+ G + G C   ++H VTA+GYG
Sbjct: 237 SSYQVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYG 294

Query: 298 TADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           T + G KYWL+KNSWGT+WGENG++++ RD     GLC IA  +SYP
Sbjct: 295 TDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341


>gi|413938554|gb|AFW73105.1| hypothetical protein ZEAMMB73_931917 [Zea mays]
          Length = 361

 Score =  314 bits (804), Expect = 4e-83,   Method: Compositional matrix adjust.
 Identities = 166/357 (46%), Positives = 224/357 (62%), Gaps = 21/357 (5%)

Query: 6   LENKLVLAAILVLGVWAPQSWSRTLNDATMNERHE----------MWMAQYGRVYRDNAE 55
           +E KL +A  ++   +A  S +   + + +    E           W  ++G++Y    E
Sbjct: 1   MEPKLAVAVFVLFLAFAACSANHHRDPSVVGYSQEDLALPSSLFRSWSVKHGKLYASPTE 60

Query: 56  KEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSE 115
           K  R++IFK+N+ +IA  N K  N  Y LG+N+FAD  +EEF+A   G KR LP   + +
Sbjct: 61  KLERYEIFKQNLMHIAETNRK--NGSYWLGLNQFADVAHEEFKASYLGLKRALPRAGAPQ 118

Query: 116 T-TDVSFRYEN---ASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRK 171
           T T  +FRY      S+P S+DWR KGAVT VK+QG+CG CWAFS+VAA+EGIN I T K
Sbjct: 119 TRTPTAFRYAAAAAGSLPWSVDWRYKGAVTPVKNQGKCGSCWAFSSVAAVEGINQIVTGK 178

Query: 172 LTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEA- 230
           L SLSEQELVDCDT+  D GCEGG MD AF +++ ++G+  E  YPY   +G C +K+  
Sbjct: 179 LVSLSEQELVDCDTT-LDHGCEGGTMDLAFAYMMGSQGIHAEDDYPYLMEEGYCKEKQPC 237

Query: 231 --NPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELD 288
               +   ++G+EDVP N+E +L+KA+A+QPVSV I A   DFQFY  GVF G C  ELD
Sbjct: 238 VLGITEQDLTGFEDVPENSEISLLKALAHQPVSVGIAAGSRDFQFYRGGVFDGACSVELD 297

Query: 289 HGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           H +TAVGYG++  G  Y  +KNSWG  WGE GY+R++      EG+CGI   ASYP 
Sbjct: 298 HALTAVGYGSS-YGQNYITMKNSWGKNWGEQGYVRIKMGTGKPEGVCGIYTMASYPV 353


>gi|400180455|gb|AFP73364.1| cysteine protease [Solanum peruvianum]
 gi|400180459|gb|AFP73366.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  314 bits (804), Expect = 4e-83,   Method: Compositional matrix adjust.
 Identities = 158/347 (45%), Positives = 223/347 (64%), Gaps = 9/347 (2%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRF 60
           MAM +    +++    V+ ++  Q+  R+    +++ERHE+WM+++GRVY+D  EK  RF
Sbjct: 1   MAMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERF 60

Query: 61  KIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVS 120
            IFKEN+++I S N KA N  YKLG+NEFAD T++EF A   G       +  S  +   
Sbjct: 61  MIFKENMKFIESVN-KAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE 119

Query: 121 FRYENAS---VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSE 177
           F+  + S   +P+++DWR+ GAVT VK QG+CGCCWAFSAV ++EG   I T  L   SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179

Query: 178 QELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKI 237
           QEL+DC T+  + GC GG M +AF+FI  N G++ E+ Y Y     +C  +E   +A +I
Sbjct: 180 QELLDCTTN--NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKT-AAVQI 236

Query: 238 SGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYG 297
           S Y+ VP   E +L++AV  QPVS+ I AS  D QFY+ G + G C   ++H VTA+GYG
Sbjct: 237 SSYQVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYG 294

Query: 298 TADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           T + G KYWL+KNSWGT+WGENG++++ RD     GLC IA  +SYP
Sbjct: 295 TDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGDPSGLCDIAKMSSYP 341


>gi|357166364|ref|XP_003580686.1| PREDICTED: oryzain alpha chain-like [Brachypodium distachyon]
          Length = 360

 Score =  314 bits (804), Expect = 4e-83,   Method: Compositional matrix adjust.
 Identities = 167/309 (54%), Positives = 213/309 (68%), Gaps = 13/309 (4%)

Query: 42  WMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARN--KPYKLGINEFADQTNEEFRA 99
           W AQ+G    +  E+E R++ F++N+ YI   N  A      ++LG+N FA  TNEE+RA
Sbjct: 46  WTAQHGSPITN--EEEGRYEAFRDNLRYIDEHNAAADAGIHSFRLGLNRFAGLTNEEYRA 103

Query: 100 PRNGYKRRLPSVRSSETTDVSFRYENA---SVPASIDWRKKGAVTGVKDQGQ-CGCCWAF 155
              G + R  +V   +    S RYE A   ++P S+DWR+KGAV  VKDQG+ CG  WAF
Sbjct: 104 AYLGLRLRSGAV--GDLRKPSARYEAADGEALPESVDWREKGAVGKVKDQGRSCGSAWAF 161

Query: 156 SAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAK 215
           SA+AA+E IN I T +L SLSEQEL+DCDTS  + GC+GGLMDDAFEFIISN G+ T+  
Sbjct: 162 SAIAAVESINQIVTGELISLSEQELMDCDTS-YNAGCDGGLMDDAFEFIISNGGIDTDED 220

Query: 216 YPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYS 275
           YPYKA + SC+  + N  A  I  YED+   NE +L KAV+NQPVSVAI+A G DFQ Y 
Sbjct: 221 YPYKARNDSCDANKRNRKAVTIDDYEDL-RMNEKSLQKAVSNQPVSVAIEAGGRDFQLYK 279

Query: 276 SGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLC 335
           SG+FTG CGT+LDH  T VGYG+ ++GT YW+VK S+GT+WGE+GY RM+R+I    G C
Sbjct: 280 SGIFTGTCGTDLDHATTIVGYGS-ENGTDYWIVKESYGTSWGESGYARMERNIKETSGKC 338

Query: 336 GIAMQASYP 344
           GIAM  SYP
Sbjct: 339 GIAMLPSYP 347


>gi|297819566|ref|XP_002877666.1| hypothetical protein ARALYDRAFT_906213 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297323504|gb|EFH53925.1| hypothetical protein ARALYDRAFT_906213 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 304

 Score =  314 bits (804), Expect = 4e-83,   Method: Compositional matrix adjust.
 Identities = 157/319 (49%), Positives = 218/319 (68%), Gaps = 25/319 (7%)

Query: 30  LNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEF 89
           L +A+  E+HE WM+++ RVY D++EK  RF+IFK+N++++ SFN    N  YKL +N+F
Sbjct: 9   LFEASAIEKHEQWMSRFNRVYSDDSEKTSRFEIFKKNLKFVESFNMNTNNT-YKLDVNKF 67

Query: 90  ADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS-VPASIDWRKKGAVTGVKDQGQ 148
           +D T+EEF+A   G      +  S +T  VSFRYEN S    S+DWR +GAVT VKDQGQ
Sbjct: 68  SDLTDEEFQARYMGLVPEGMTGDSQKT--VSFRYENVSETGESMDWRLEGAVTPVKDQGQ 125

Query: 149 CGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNK 208
           CGCCWAF+AVAA+EG+  I   +L SLSEQ+LVDC T+  + GC+GGL   A+++I  N+
Sbjct: 126 CGCCWAFAAVAAVEGVTKIANGELVSLSEQQLVDCSTANNNMGCDGGLALTAYDYIKENQ 185

Query: 209 GLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASG 268
           G+ +E  YPY+A   +C  K  +P+AA ISGYE VP ++E AL+KAV+            
Sbjct: 186 GITSEENYPYQAVQQTC--KSTDPAAATISGYEAVPKDDEEALLKAVSQH---------- 233

Query: 269 SDFQFYSSGVFTGQ-CGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRD 327
                   G+F  + CGT+  H VT VGYGT+++G KYWL+KNSWG +WGENGY+R++RD
Sbjct: 234 --------GIFEDEYCGTDSHHAVTIVGYGTSEEGIKYWLLKNSWGESWGENGYMRIKRD 285

Query: 328 IDAKEGLCGIAMQASYPTA 346
           +D  +G+CG+A +A YP A
Sbjct: 286 VDEPQGMCGLAHRAYYPVA 304


>gi|400180417|gb|AFP73347.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  314 bits (804), Expect = 5e-83,   Method: Compositional matrix adjust.
 Identities = 159/347 (45%), Positives = 224/347 (64%), Gaps = 9/347 (2%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRF 60
           MAM +    +++    V+ ++  Q+  R+    +++ERHE+WM+++GRVY+D  EK  RF
Sbjct: 1   MAMKVDLMNILITVFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERF 60

Query: 61  KIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVS 120
            IFKEN+++I S N KA N  YKLG+NEFAD T++EF A   G       +  S  +   
Sbjct: 61  MIFKENMKFIESVN-KAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPLSSTE 119

Query: 121 FRYENAS---VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSE 177
           F+  + S   +P+++DWR+ GAVT VK QG+CGCCWAFSAV ++EG   I T  L   SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179

Query: 178 QELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKI 237
           QEL+DC T+  + GC GG M +AF+FII N G++ E+ Y Y     +C  +E   +A +I
Sbjct: 180 QELLDCTTN--NYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQE-KTAAVQI 236

Query: 238 SGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYG 297
           S Y+ VP   E +L++AV  QPVS+ I AS  D QFY+ G + G C   ++H VTA+GYG
Sbjct: 237 SSYKVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYG 294

Query: 298 TADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           T + G KYWL+KNSWGT+WGENG++++ RD     GLC IA  +SYP
Sbjct: 295 TDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341


>gi|400180369|gb|AFP73323.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  314 bits (804), Expect = 5e-83,   Method: Compositional matrix adjust.
 Identities = 158/347 (45%), Positives = 223/347 (64%), Gaps = 9/347 (2%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRF 60
           MAM +    +++    V+ ++  Q+  R+    +++ERHE+WM+++GRVY+D  EK  RF
Sbjct: 1   MAMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERF 60

Query: 61  KIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVS 120
            IFKEN+++I S N KA N  YKLG+NEFAD T++EF A   G       +  S  +   
Sbjct: 61  MIFKENMKFIESVN-KAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE 119

Query: 121 FRYENAS---VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSE 177
           F+  + S   +P+++DWR+ GAVT VK QG+CGCCWAFSAV ++EG   I T  L   SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179

Query: 178 QELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKI 237
           QEL+DC T+  + GC GG M +AF+FI  N G++ E+ Y Y     +C  +E   +A +I
Sbjct: 180 QELLDCTTN--NYGCNGGFMTNAFDFIKENGGISRESDYEYLGQQYTCRSQEKT-AAVQI 236

Query: 238 SGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYG 297
           S Y+ VP   E +L++AV  QPVS+ I AS  D QFY+ G + G C   ++H VTA+GYG
Sbjct: 237 SSYKVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYG 294

Query: 298 TADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           T + G KYWL+KNSWGT+WGENG++++ RD     GLC IA  +SYP
Sbjct: 295 TDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341


>gi|400180351|gb|AFP73314.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  314 bits (804), Expect = 5e-83,   Method: Compositional matrix adjust.
 Identities = 158/347 (45%), Positives = 223/347 (64%), Gaps = 9/347 (2%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRF 60
           MAM +    +++    V+ ++  Q+  R+    +++ERHE+WM+++GRVY+D  EK  RF
Sbjct: 1   MAMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERF 60

Query: 61  KIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVS 120
            IFKEN+++I S N KA N  YKLG+NEFAD T++EF A   G       +  S  +   
Sbjct: 61  MIFKENMKFIESVN-KAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE 119

Query: 121 FRYENAS---VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSE 177
           F+  + S   +P+++DWR+ GAVT VK QG+CGCCWAFSAV ++EG   I T  L   SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179

Query: 178 QELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKI 237
           QEL+DC T+  + GC GG M +AF+FI  N G++ E+ Y Y     +C  +E   +A +I
Sbjct: 180 QELLDCTTN--NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKT-AAVQI 236

Query: 238 SGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYG 297
           S Y+ VP   E +L++AV  QPVS+ I AS  D QFY+ G + G C   ++H VTA+GYG
Sbjct: 237 SSYQVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYG 294

Query: 298 TADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           T + G KYWL+KNSWGT+WGENG++++ RD     GLC IA  +SYP
Sbjct: 295 TDEKGQKYWLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341


>gi|449500383|ref|XP_004161083.1| PREDICTED: vignain-like [Cucumis sativus]
          Length = 345

 Score =  314 bits (804), Expect = 5e-83,   Method: Compositional matrix adjust.
 Identities = 168/312 (53%), Positives = 217/312 (69%), Gaps = 16/312 (5%)

Query: 39  HEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEF- 97
           +E W  ++  + R+  EK  RF +FKENV ++ + N    +KPYKL +N+FAD +N EF 
Sbjct: 41  YERW-GKHHTISRNLKEKHKRFSVFKENVNHVFTVNQM--DKPYKLKLNKFADMSNYEFV 97

Query: 98  ----RAPRNGYKRRLPSVRSSETTDVSFRYE-NASVPASIDWRKKGAVTGVKDQGQCGCC 152
               R+  + Y++     R +      F YE +  +P+S+D R++GAV  VK+QG+CG C
Sbjct: 98  NFYARSNISHYRKLHERRRGAG----GFMYEQDTDLPSSVDGRERGAVNAVKEQGRCGSC 153

Query: 153 WAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLAT 212
           WAFS+VAA+EGIN I T +L SLSEQEL+DC+    ++GC GG M+ AF+FI  N G+AT
Sbjct: 154 WAFSSVAAVEGINKIKTNQLLSLSEQELLDCNY--RNKGCNGGFMEIAFDFIKRNGGIAT 211

Query: 213 EAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQ 272
           E  YPY  S G C     +    KI GYE VP N E ALM+AVANQPVSVAIDA+G DFQ
Sbjct: 212 ENSYPYHGSRGLCRSSRISSPIVKIDGYESVPEN-EDALMQAVANQPVSVAIDAAGRDFQ 270

Query: 273 FYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKE 332
           FYS GVF G CGTEL+HGV A+GYGT +DGT YWLV+NSWG  WGE+GY+RM+R ++  E
Sbjct: 271 FYSQGVFDGYCGTELNHGVVAIGYGTTEDGTDYWLVRNSWGVGWGEDGYVRMKRGVEQAE 330

Query: 333 GLCGIAMQASYP 344
           GLCGIAM+ASYP
Sbjct: 331 GLCGIAMEASYP 342


>gi|641905|gb|AAC49406.1| cysteine proteinase [Zinnia violacea]
          Length = 342

 Score =  313 bits (803), Expect = 5e-83,   Method: Compositional matrix adjust.
 Identities = 159/286 (55%), Positives = 201/286 (70%), Gaps = 8/286 (2%)

Query: 43  MAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRN 102
           + ++ ++Y    EK  RF+IF +N+++I   N K  N  Y LG+NEFAD T+EEF+    
Sbjct: 53  LVKHSKIYESFDEKLHRFEIFMDNLKHIDETNKKVSN--YWLGLNEFADLTHEEFKNKFL 110

Query: 103 GYKRRLPSVRSSETTDVSFRYEN-ASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAM 161
           G+K  L   R  E+ +  FRY +   +P S+DWRKKGAV+ VK+QGQCG CWAFS VAA+
Sbjct: 111 GFKGELAE-RKDESIE-QFRYRDFVDLPKSVDWRKKGAVSPVKNQGQCGSCWAFSTVAAV 168

Query: 162 EGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKAS 221
           EGIN I T  LT LSEQEL+DCDT+  + GC GGLMD AF ++  N GL  E +YPY  S
Sbjct: 169 EGINQIVTGNLTVLSEQELIDCDTTF-NNGCNGGLMDYAFAYVTRN-GLHKEEEYPYIMS 226

Query: 222 DGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTG 281
           +G+C++K        ISGY DVP NNE + +KA+ANQP+SVAI+ASG DFQFYS GVF G
Sbjct: 227 EGTCDEKRDASEKVTISGYHDVPRNNEDSFLKALANQPISVAIEASGRDFQFYSGGVFDG 286

Query: 282 QCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRD 327
            CGTELDHGV AVGYGT+  G  Y +V+NSWG  WGE GYIRM+R+
Sbjct: 287 HCGTELDHGVAAVGYGTS-KGLDYVIVRNSWGPKWGEKGYIRMKRN 331


>gi|400180403|gb|AFP73340.1| cysteine protease [Solanum peruvianum]
 gi|400180413|gb|AFP73345.1| cysteine protease [Solanum peruvianum]
 gi|400180415|gb|AFP73346.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  313 bits (803), Expect = 5e-83,   Method: Compositional matrix adjust.
 Identities = 158/347 (45%), Positives = 223/347 (64%), Gaps = 9/347 (2%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRF 60
           MAM +    +++    V+ ++  Q+  R+    +++ERHE+WM+++GRVY+D  EK  RF
Sbjct: 1   MAMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERF 60

Query: 61  KIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVS 120
            IFKEN+++I S N KA N  YKLG+NEFAD T++EF A   G       +  S  +   
Sbjct: 61  MIFKENMKFIESVN-KAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE 119

Query: 121 FRYENAS---VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSE 177
           F+  + S   +P+++DWR+ GAVT VK QG+CGCCWAFSAV ++EG   I T  L   SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179

Query: 178 QELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKI 237
           QEL+DC T+  + GC GG M +AF+FI  N G++ E+ Y Y     +C  +E   +A +I
Sbjct: 180 QELLDCTTN--NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKT-AAVQI 236

Query: 238 SGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYG 297
           S Y+ VP   E +L++AV  QPVS+ I AS  D QFY+ G + G C   ++H VTA+GYG
Sbjct: 237 SSYQVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYG 294

Query: 298 TADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           T + G KYWL+KNSWGT+WGENG++++ RD     GLC IA  +SYP
Sbjct: 295 TDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPAGLCDIAKMSSYP 341


>gi|125540888|gb|EAY87283.1| hypothetical protein OsI_08685 [Oryza sativa Indica Group]
          Length = 357

 Score =  313 bits (803), Expect = 5e-83,   Method: Compositional matrix adjust.
 Identities = 155/305 (50%), Positives = 202/305 (66%), Gaps = 5/305 (1%)

Query: 42  WMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPR 101
           W  ++ ++Y    EK  R++IFK N+ +I   N   RN  Y LG+N FAD  +EEF+A  
Sbjct: 49  WSVKHSKIYASPKEKVKRYEIFKRNLRHIVETNR--RNGSYWLGLNHFADIAHEEFKASY 106

Query: 102 NGYKRRLPSVRSSETTDVSFRYENA-SVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAA 160
            G K  L    +      +FRY NA ++P ++DWRKKGAVT VK+QG+CG CWAFS VAA
Sbjct: 107 LGLKPGLARRDAQPHGSTTFRYANAVNLPWAVDWRKKGAVTPVKNQGECGSCWAFSTVAA 166

Query: 161 MEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKA 220
           +EGIN I T KL SLSEQEL+DCD +  + GC GGLMD AF +I+ N+G+ TE  YPY  
Sbjct: 167 VEGINQIVTGKLVSLSEQELMDCDNT-FNHGCRGGLMDFAFAYIMGNQGIYTEEDYPYLM 225

Query: 221 SDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFT 280
            +G C +K+ +     I+GYEDVP N+E +L+KA+A+QPVSV I A   DFQFY  G+F 
Sbjct: 226 EEGYCREKQPHSKVITITGYEDVPENSETSLLKALAHQPVSVGIAAGSRDFQFYKGGIFD 285

Query: 281 GQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQ 340
           G+CG + DH +TAVGYG+   G  Y ++KNSWG  WGE GY R++R     EG+C I   
Sbjct: 286 GECGIQPDHALTAVGYGSY-YGQDYIIMKNSWGKNWGEQGYFRIRRGTGKPEGVCDIYKI 344

Query: 341 ASYPT 345
           ASYPT
Sbjct: 345 ASYPT 349


>gi|297843430|ref|XP_002889596.1| hypothetical protein ARALYDRAFT_887827 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297335438|gb|EFH65855.1| hypothetical protein ARALYDRAFT_887827 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 343

 Score =  313 bits (803), Expect = 6e-83,   Method: Compositional matrix adjust.
 Identities = 167/356 (46%), Positives = 220/356 (61%), Gaps = 27/356 (7%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQ--SWSRTLNDA--TMNERHEMWMAQYGRVYRDNAEK 56
           M  +L  + L L  ++   + A +  S + ++ D   T+ +R E W+  + ++Y    E 
Sbjct: 1   MLNVLRNSNLTLVVLICFVLIASKLCSVNSSVYDPHKTLKQRFEKWLKTHSKLYGGRDEW 60

Query: 57  EMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNG--------YKRRL 108
            +RF I++ NV+ I   N  + + P+KL  N FAD TN EF+A   G        +K++ 
Sbjct: 61  MLRFGIYQSNVQLIDYIN--SLHLPFKLTDNRFADMTNSEFKAHFLGLNTSSLRLHKKQR 118

Query: 109 PSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHIT 168
           P    +            +VP ++DWR +GAVT +++QG+CG CWAFSAVAA+EGIN I 
Sbjct: 119 PVCDPA-----------GNVPDAVDWRTQGAVTPIRNQGKCGGCWAFSAVAAIEGINKIK 167

Query: 169 TRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKK 228
           T  L SLSEQ+L+DCD    ++GC GGLM+ AFEFI SN GL TE  YPY   +G+C+++
Sbjct: 168 TGNLVSLSEQQLIDCDVGTYNKGCSGGLMETAFEFIKSNGGLTTETDYPYTGIEGTCDQE 227

Query: 229 EANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELD 288
           +A      I GY+ V + NEA+L  A A QPVSV IDA G  FQ YSSGVFT  CGT L+
Sbjct: 228 KAKNKVVTIQGYQKV-AQNEASLQIAAAQQPVSVGIDAGGFIFQLYSSGVFTSYCGTNLN 286

Query: 289 HGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           HGVT VGYG   D  KYW+VKNSWGT WGE GYIRM+R I    G CGIAM ASYP
Sbjct: 287 HGVTVVGYGVEGD-QKYWIVKNSWGTGWGEEGYIRMERGISEDTGKCGIAMLASYP 341


>gi|18390634|ref|NP_563764.1| cysteine proteinase-like protein [Arabidopsis thaliana]
 gi|8844131|gb|AAF80223.1|AC025290_12 Contains similarity to a cysteine endopeptidase 1 from Phaseolus
           vulgaris gb|U52970 and is a member of the papain
           cysteine protease family PF|00112 [Arabidopsis thaliana]
 gi|332189848|gb|AEE27969.1| cysteine proteinase-like protein [Arabidopsis thaliana]
          Length = 343

 Score =  313 bits (803), Expect = 7e-83,   Method: Compositional matrix adjust.
 Identities = 166/356 (46%), Positives = 221/356 (62%), Gaps = 27/356 (7%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQ--SWSRTLNDA--TMNERHEMWMAQYGRVYRDNAEK 56
           M  +L  + L LA ++   + A +  S   ++ D   T+ +R E W+  + ++Y    E 
Sbjct: 1   MLNVLRNSNLTLAVLICFVLIASKLCSVDSSVYDPHKTLKQRFEKWLKTHSKLYGGRDEW 60

Query: 57  EMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNG--------YKRRL 108
            +RF I++ NV+ I   N  + + P+KL  N FAD TN EF+A   G        +K++ 
Sbjct: 61  MLRFGIYQSNVQLIDYIN--SLHLPFKLTDNRFADMTNSEFKAHFLGLNTSSLRLHKKQR 118

Query: 109 PSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHIT 168
           P    +            +VP ++DWR +GAVT +++QG+CG CWAFSAVAA+EGIN I 
Sbjct: 119 PVCDPA-----------GNVPDAVDWRTQGAVTPIRNQGKCGGCWAFSAVAAIEGINKIK 167

Query: 169 TRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKK 228
           T  L SLSEQ+L+DCD    ++GC GGLM+ AFEFI +N GLATE  YPY   +G+C+++
Sbjct: 168 TGNLVSLSEQQLIDCDVGTYNKGCSGGLMETAFEFIKTNGGLATETDYPYTGIEGTCDQE 227

Query: 229 EANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELD 288
           ++      I GY+ V + NEA+L  A A QPVSV IDA G  FQ YSSGVFT  CGT L+
Sbjct: 228 KSKNKVVTIQGYQKV-AQNEASLQIAAAQQPVSVGIDAGGFIFQLYSSGVFTNYCGTNLN 286

Query: 289 HGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           HGVT VGYG   D  KYW+VKNSWGT WGE GYIRM+R +    G CGIAM ASYP
Sbjct: 287 HGVTVVGYGVEGD-QKYWIVKNSWGTGWGEEGYIRMERGVSEDTGKCGIAMMASYP 341


>gi|312451845|gb|ADQ85986.1| actinidin [Actinidia chinensis]
          Length = 380

 Score =  313 bits (802), Expect = 7e-83,   Method: Compositional matrix adjust.
 Identities = 163/340 (47%), Positives = 213/340 (62%), Gaps = 14/340 (4%)

Query: 10  LVLAAILVLGV-WAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVE 68
           L  + +L+L + +  ++ ++  ND  +   +E W+ +YG+ Y    E E RF+IFKE + 
Sbjct: 13  LFFSTLLILSLAFNAKNLTQRTNDE-VKAMYESWLIKYGKSYNSLGEWERRFEIFKETLR 71

Query: 69  YIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYE---N 125
           +I   +N   N+ YK+G+N+FAD T+EEFR+   G+         S  T VS RYE    
Sbjct: 72  FIDE-HNADTNRSYKVGLNQFADLTDEEFRSTYLGF------TSGSNKTKVSNRYEPRVG 124

Query: 126 ASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDT 185
             +P+ +DWR  GAV  +K QG+CG CWAFSA+A +EGIN I T  L SLSEQEL+DC  
Sbjct: 125 QVLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIDCGR 184

Query: 186 SGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPS 245
           +   +GC G  + D F FII+N G+ TE  YPY A DG CN    N     I  YE+VP 
Sbjct: 185 TQNTRGCNGSYITDGFPFIINNGGINTEENYPYTAQDGECNVDLQNEKYVTIDTYENVPY 244

Query: 246 NNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKY 305
           NNE AL  AV  QPVSVA+DA+G  F+ YSSG+FTG CGT +DH VT VGYGT + G  Y
Sbjct: 245 NNEWALQTAVTYQPVSVALDAAGDAFKQYSSGIFTGPCGTAIDHAVTIVGYGT-EGGIDY 303

Query: 306 WLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           W+VKNSW TTWGE GY+R+ R++    G CGIA   SYP 
Sbjct: 304 WIVKNSWDTTWGEEGYMRILRNVGGA-GTCGIATMPSYPV 342


>gi|20334375|gb|AAM19208.1|AF493233_1 cysteine protease [Solanum pennellii]
          Length = 337

 Score =  313 bits (802), Expect = 7e-83,   Method: Compositional matrix adjust.
 Identities = 158/344 (45%), Positives = 223/344 (64%), Gaps = 10/344 (2%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRF 60
           MAM +    +++    V+ ++  Q+  R+    +++ERHE+WM+++GRVY+D  EK  RF
Sbjct: 1   MAMKIDLMSILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERF 60

Query: 61  KIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVS 120
            IFKEN+++I S N KA N  YKLG+NEFAD T++EF A   G       +  S   D+S
Sbjct: 61  MIFKENMKFIESVN-KAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPINDLS 119

Query: 121 FRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQEL 180
               +  +P+++DWR+ GAVT VK+QGQCGCCWAFSAV ++EG   I T  L   SEQEL
Sbjct: 120 ----DDDMPSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYKIATGNLMEFSEQEL 175

Query: 181 VDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGY 240
           +DC T+  + GC GG M +AF+FI  N G++ E+ Y Y     +C  +E   +A +IS Y
Sbjct: 176 LDCTTN--NYGCNGGFMTNAFDFIKENGGISRESDYEYLGQQYTCRSQEKT-AAVQISSY 232

Query: 241 EDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTAD 300
           + VP   E +L++AV  QPVS+ I AS  D QFY+ G + G C   ++H VTA+GYGT +
Sbjct: 233 QVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCANRINHAVTAIGYGTDE 290

Query: 301 DGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
            G KYWL+KNSWGT+WGE+G++++ RD     GLC IA  +SYP
Sbjct: 291 KGQKYWLLKNSWGTSWGEDGFMKIIRDSGNPAGLCDIAKVSSYP 334


>gi|449469929|ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
 gi|449529596|ref|XP_004171784.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
          Length = 431

 Score =  313 bits (802), Expect = 8e-83,   Method: Compositional matrix adjust.
 Identities = 155/311 (49%), Positives = 200/311 (64%), Gaps = 6/311 (1%)

Query: 33  ATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQ 92
           + ++E  E+W  ++G+ Y    EK  R  +F +N E++   NN   N  Y L +N +AD 
Sbjct: 23  SNVSELFEIWCTEHGKSYSSAEEKLYRLGVFADNYEFVTHHNN-LDNSSYTLSLNSYADL 81

Query: 93  TNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCC 152
           T+ EF+  R G+   L + R     + S   +   VP S+DWRKKGAVT VKDQG CG C
Sbjct: 82  THHEFKVSRLGFSPALRNFRPVLPQEPSLPRD---VPDSLDWRKKGAVTAVKDQGSCGAC 138

Query: 153 WAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLAT 212
           W+FSA  AMEGIN I T  L SLSEQEL+DCD S  + GC GGLMD A++F+ISN G+ T
Sbjct: 139 WSFSATGAMEGINQIMTGSLISLSEQELIDCDRS-YNSGCGGGLMDYAYQFVISNHGIDT 197

Query: 213 EAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQ 272
           E  YPY+A DGSC K +   +   I GY D+PSN+E  L++AVA QPVSV I  S   FQ
Sbjct: 198 ENDYPYQARDGSCRKDKLQRNVVTIDGYADIPSNDEGKLLQAVAAQPVSVGICGSERAFQ 257

Query: 273 FYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKE 332
            YS G+F+G C T LDH V  VGYG+ ++G  YW+VKNSWG +WG +GY+ MQR+    E
Sbjct: 258 LYSKGIFSGPCSTSLDHAVLIVGYGS-ENGVDYWIVKNSWGKSWGMDGYMHMQRNSGNSE 316

Query: 333 GLCGIAMQASY 343
           G+CGI   ASY
Sbjct: 317 GVCGINKLASY 327


>gi|400180437|gb|AFP73356.1| cysteine protease [Solanum pennellii]
          Length = 337

 Score =  313 bits (801), Expect = 1e-82,   Method: Compositional matrix adjust.
 Identities = 158/344 (45%), Positives = 223/344 (64%), Gaps = 10/344 (2%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRF 60
           MAM +    +++    V+ ++  Q+  R+    +++ERHE+WM+++GRVY+D  EK  RF
Sbjct: 1   MAMKVDLMSILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERF 60

Query: 61  KIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVS 120
            IFKEN+++I S N KA N  YKLG+NEFAD T++EF A   G       +  S   D+S
Sbjct: 61  MIFKENMKFIESVN-KAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPINDLS 119

Query: 121 FRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQEL 180
               +  +P+++DWR+ GAVT VK+QGQCGCCWAFSAV ++EG   I T  L   SEQEL
Sbjct: 120 ----DDDMPSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYKIATGNLMEFSEQEL 175

Query: 181 VDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGY 240
           +DC T+  + GC GG M +AF+FI  N G++ E+ Y Y     +C  +E   +A +IS Y
Sbjct: 176 LDCTTN--NYGCNGGFMTNAFDFIKENGGISRESDYEYLGQQYTCRSQEKT-AAVQISSY 232

Query: 241 EDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTAD 300
           + VP   E +L++AV  QPVS+ I AS  D QFY+ G + G C   ++H VTA+GYGT +
Sbjct: 233 QVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCANRINHAVTAIGYGTDE 290

Query: 301 DGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
            G KYWL+KNSWGT+WGE+G++++ RD     GLC IA  +SYP
Sbjct: 291 KGQKYWLLKNSWGTSWGEDGFMKIIRDSGNPAGLCDIAKVSSYP 334


>gi|359359068|gb|AEV40975.1| putative cysteine protease [Oryza punctata]
          Length = 464

 Score =  313 bits (801), Expect = 1e-82,   Method: Compositional matrix adjust.
 Identities = 166/295 (56%), Positives = 209/295 (70%), Gaps = 8/295 (2%)

Query: 54  AEKEMRFKIFKENVEYIASFNNKAR-NKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVR 112
            E E RF++F +N++++ + N  A  +  ++LG+N FAD TN+EFRA    Y    P+ R
Sbjct: 85  GEYERRFRVFWDNLKFVDAHNAHADGHGGFRLGMNRFADLTNDEFRA---AYLGTTPAGR 141

Query: 113 SSETTDVSFRYENA-SVPASIDWRKKGAVTG-VKDQGQCGCCWAFSAVAAMEGINHITTR 170
                ++ +R++   ++P S+DWR KGAV   VK+QGQCG CWAFSAVAA+EGIN I T 
Sbjct: 142 GRHVGEM-YRHDGVEALPDSVDWRDKGAVVSPVKNQGQCGSCWAFSAVAAVEGINKIVTG 200

Query: 171 KLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEA 230
           +L SLSEQELV+C  +G + GC GG+MDDAF FI  N GL TE  YPY A DG C+  + 
Sbjct: 201 ELVSLSEQELVECARNGGNSGCNGGIMDDAFAFITRNGGLDTEEDYPYTAMDGKCDLAKK 260

Query: 231 NPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHG 290
           +     I G+EDVP N+E +L KAVA+QPVSVAIDA G +FQ Y SGVFTG+CGT LDHG
Sbjct: 261 SRKVVSIDGFEDVPENDELSLQKAVAHQPVSVAIDAGGREFQLYDSGVFTGRCGTSLDHG 320

Query: 291 VTAVGYGT-ADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           V AVGYGT A  GT YW V+NSWG  WGENGYIRM+R++ A+ G CGIAM ASYP
Sbjct: 321 VVAVGYGTDAATGTDYWTVRNSWGPDWGENGYIRMERNVTARTGKCGIAMMASYP 375


>gi|400180385|gb|AFP73331.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  312 bits (800), Expect = 1e-82,   Method: Compositional matrix adjust.
 Identities = 157/347 (45%), Positives = 222/347 (63%), Gaps = 9/347 (2%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRF 60
           MAM +    +++    V+ ++  Q+  R+    +++ERHE+WM+++GRVY+D  EK  RF
Sbjct: 1   MAMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERF 60

Query: 61  KIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVS 120
            IFKEN+++I S N KA N  YKLG+NEFAD T++EF A   G       +  S  +   
Sbjct: 61  MIFKENMKFIESVN-KAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE 119

Query: 121 FRYENAS---VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSE 177
           F+  + S   +P+++DWR+ GAVT VK QG+CGCCWAFSAV ++EG   I T  L   SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179

Query: 178 QELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKI 237
           QEL+DC T+  + GC GG M +AF+FI  N G++ E+ Y Y     +C  +E   +A +I
Sbjct: 180 QELLDCTTN--NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKT-AAVQI 236

Query: 238 SGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYG 297
           S Y+ VP   E +L++AV  QPVS+ I AS  D QFY+ G + G C   ++H VTA+GYG
Sbjct: 237 SSYQVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYG 294

Query: 298 TADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           T + G KYWL+KNSWGT+WGENG++++ RD     GLC I   +SYP
Sbjct: 295 TDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGDPSGLCDITKMSSYP 341


>gi|400180461|gb|AFP73367.1| cysteine protease [Solanum peruvianum]
 gi|400180473|gb|AFP73373.1| cysteine protease [Solanum peruvianum]
 gi|400180475|gb|AFP73374.1| cysteine protease [Solanum peruvianum]
 gi|400180479|gb|AFP73376.1| cysteine protease [Solanum peruvianum]
 gi|400180481|gb|AFP73377.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  312 bits (800), Expect = 1e-82,   Method: Compositional matrix adjust.
 Identities = 158/347 (45%), Positives = 222/347 (63%), Gaps = 9/347 (2%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRF 60
           MAM +    +++    V+ ++  Q+  R+    +++ERHE+WM+++GRVY+D  EK  RF
Sbjct: 1   MAMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERF 60

Query: 61  KIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVS 120
            IFKEN+++I S N KA N  YKLG+NEFAD T++EF A   G       +  S  +   
Sbjct: 61  MIFKENMKFIESVN-KAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE 119

Query: 121 FRYENAS---VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSE 177
           F   + S   +P+++DWR+ GAVT VK QG+CGCCWAFSAV ++EG   I T  L   SE
Sbjct: 120 FIINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179

Query: 178 QELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKI 237
           QEL+DC T+  + GC GG M +AF+FI  N G++ E+ Y Y     +C  +E   +A +I
Sbjct: 180 QELLDCTTN--NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKT-AAVQI 236

Query: 238 SGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYG 297
           S Y+ VP   E +L++AV  QPVS+ I AS  D QFY+ G + G C   ++H VTA+GYG
Sbjct: 237 SSYQVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYG 294

Query: 298 TADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           T + G KYWL+KNSWGT+WGENG++++ RD     GLC IA  +SYP
Sbjct: 295 TDEKGQKYWLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341


>gi|400180399|gb|AFP73338.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  312 bits (799), Expect = 2e-82,   Method: Compositional matrix adjust.
 Identities = 158/347 (45%), Positives = 222/347 (63%), Gaps = 9/347 (2%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRF 60
           MAM +    +++    V+ ++  Q+  R+    +++ERHE+WM+++GRVY+D  EK  RF
Sbjct: 1   MAMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERF 60

Query: 61  KIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVS 120
            IFKEN+++I S N KA N  YKLG+NEFAD T++EF A   G       +  S  +   
Sbjct: 61  MIFKENMKFIESVN-KAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE 119

Query: 121 FRYENAS---VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSE 177
           F   + S   +P+++DWR+ GAVT VK QG+CGCCWAFSAV ++EG   I T  L   SE
Sbjct: 120 FIINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179

Query: 178 QELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKI 237
           QEL+DC T+  + GC GG M +AF+FI  N G++ E+ Y Y     +C  +E   +A +I
Sbjct: 180 QELLDCTTN--NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKT-AAVQI 236

Query: 238 SGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYG 297
           S Y+ VP   E +L++AV  QPVS+ I AS  D QFY+ G + G C   ++H VTA+GYG
Sbjct: 237 SSYQVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYG 294

Query: 298 TADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           T + G KYWL+KNSWGT+WGENG++++ RD     GLC IA  +SYP
Sbjct: 295 TDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGDPSGLCDIAKMSSYP 341


>gi|400180371|gb|AFP73324.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  312 bits (799), Expect = 2e-82,   Method: Compositional matrix adjust.
 Identities = 158/347 (45%), Positives = 222/347 (63%), Gaps = 9/347 (2%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRF 60
           MAM +    +++    V+ ++  Q+  R+    +++ERHE+WM+++GRVY+D  EK  RF
Sbjct: 1   MAMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERF 60

Query: 61  KIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVS 120
            IFKEN+++I S N KA N  YKLG+NEFAD T++EF A   G       +  S  +   
Sbjct: 61  MIFKENMKFIESVN-KAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE 119

Query: 121 FRYENAS---VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSE 177
           F   + S   +P+++DWR+ GAVT VK QG+CGCCWAFSAV ++EG   I T  L   SE
Sbjct: 120 FIINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179

Query: 178 QELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKI 237
           QEL+DC T+  + GC GG M +AF+FI  N G++ E+ Y Y     +C  +E   +A +I
Sbjct: 180 QELLDCTTN--NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKT-AAVQI 236

Query: 238 SGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYG 297
           S Y+ VP   E +L++AV  QPVS+ I AS  D QFY+ G + G C   ++H VTA+GYG
Sbjct: 237 SSYQVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYG 294

Query: 298 TADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           T + G KYWL+KNSWGT+WGENG++++ RD     GLC IA  +SYP
Sbjct: 295 TDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341


>gi|400180349|gb|AFP73313.1| cysteine protease [Solanum peruvianum]
 gi|400180469|gb|AFP73371.1| cysteine protease [Solanum peruvianum]
 gi|400180471|gb|AFP73372.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  312 bits (799), Expect = 2e-82,   Method: Compositional matrix adjust.
 Identities = 157/347 (45%), Positives = 222/347 (63%), Gaps = 9/347 (2%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRF 60
           MAM +    +++    V+ ++  Q+  R+    +++ERHE+WM+++GRVY+D  EK  RF
Sbjct: 1   MAMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKVERF 60

Query: 61  KIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVS 120
            IFKEN+++I S N KA N  YKLG+NEFAD T++EF A   G       +  S  +   
Sbjct: 61  MIFKENMKFIESVN-KAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE 119

Query: 121 FRYENAS---VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSE 177
            +  + S   +P+++DWR+ GAVT VK QG+CGCCWAFSAV ++EG   I T  L   SE
Sbjct: 120 LKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179

Query: 178 QELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKI 237
           QEL+DC T+  + GC GG M +AF+FI  N G++ E+ Y Y     +C  +E   +A +I
Sbjct: 180 QELLDCTTN--NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKT-AAVQI 236

Query: 238 SGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYG 297
           S Y+ VP   E +L++AV  QPVS+ I AS  D QFY+ G + G C   ++H VTA+GYG
Sbjct: 237 SSYQVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYG 294

Query: 298 TADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           T + G KYWL+KNSWGT+WGENG++++ RD     GLC IA  +SYP
Sbjct: 295 TDEKGQKYWLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341


>gi|400180363|gb|AFP73320.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  312 bits (799), Expect = 2e-82,   Method: Compositional matrix adjust.
 Identities = 157/347 (45%), Positives = 222/347 (63%), Gaps = 9/347 (2%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRF 60
           MAM +    +++    V+ ++  Q+  R+    +++ERHE+WM+++GRVY+D  EK  RF
Sbjct: 1   MAMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERF 60

Query: 61  KIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVS 120
            IFKEN+++I S N KA N  YKLG+NEFAD T++EF A   G       +  S  +   
Sbjct: 61  MIFKENMKFIESVN-KAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE 119

Query: 121 FRYENAS---VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSE 177
            +  + S   +P+++DWR+ GAVT VK QG+CGCCWAFSAV ++EG   I T  L   SE
Sbjct: 120 LKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179

Query: 178 QELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKI 237
           QEL+DC T+  + GC GG M +AF+FI  N G++ E+ Y Y     +C  +E   +A +I
Sbjct: 180 QELLDCTTN--NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKT-AAVQI 236

Query: 238 SGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYG 297
           S Y+ VP   E +L++AV  QPVS+ I AS  D QFY+ G + G C   ++H VTA+GYG
Sbjct: 237 SSYQVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYG 294

Query: 298 TADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           T + G KYWL+KNSWGT+WGENG++++ RD     GLC IA  +SYP
Sbjct: 295 TDEKGQKYWLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341


>gi|944916|gb|AAA74430.1| cysteine proteinase [Mesembryanthemum crystallinum]
          Length = 367

 Score =  311 bits (798), Expect = 2e-82,   Method: Compositional matrix adjust.
 Identities = 164/317 (51%), Positives = 206/317 (64%), Gaps = 14/317 (4%)

Query: 31  NDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFA 90
           +D T+ + +E W + Y    R   EK+ RF +FKENV+YI   N    +KPYKL +N+F 
Sbjct: 36  SDETLWDLYERWRSVYTSA-RSFGEKQNRFHVFKENVKYINEVNKM--DKPYKLRLNQFG 92

Query: 91  DQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCG 150
           D T  EF       K     +  +      F YEN  VP SIDWR KGAVT VK+QG+CG
Sbjct: 93  DLTPSEFARTYANSK----IIEGTRNESGGFMYENVEVPRSIDWRVKGAVTPVKNQGRCG 148

Query: 151 CCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGL 210
            CWAFSA AA+EGIN ITT +L SLSEQ+L+DCDT  ++ GC GG M  AFE+I    G+
Sbjct: 149 GCWAFSAAAAVEGINQITTGQLISLSEQQLIDCDT--QNSGCRGGTMGRAFEYIKQRGGI 206

Query: 211 ATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDA---S 267
            +EA YPYKA  G C           I GY ++   +E A++K +A+QPVSVA+DA   S
Sbjct: 207 TSEANYPYKAQAGMCKNNLIQRPTVSIDGYYNI-RRSEDAVLKILAHQPVSVAVDATTWS 265

Query: 268 GSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRD 327
             D+ FY  GVFTG CGT+L+HGVTAVGYGT +DG  YW++KNSWG TWGE GY+RM R 
Sbjct: 266 SLDWMFYFQGVFTGPCGTKLNHGVTAVGYGTTNDGYDYWIIKNSWGETWGERGYMRMLRG 325

Query: 328 IDAKEGLCGIAMQASYP 344
           + +  GLCGIAMQAS+P
Sbjct: 326 V-SPYGLCGIAMQASFP 341


>gi|356542171|ref|XP_003539543.1| PREDICTED: LOW QUALITY PROTEIN: KDEL-tailed cysteine endopeptidase
           CEP2-like [Glycine max]
          Length = 342

 Score =  311 bits (798), Expect = 2e-82,   Method: Compositional matrix adjust.
 Identities = 161/346 (46%), Positives = 215/346 (62%), Gaps = 13/346 (3%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSRTLNDA-TMNERHEMWMAQYGRVYRDNAEKEMR 59
           + ++ + N LVL  + +     P   +   +D+  M  R+E W+ +YG+ YR+  E E R
Sbjct: 5   ITLVAIINLLVLCNLWITASACPAKHNDNSSDSEVMRMRYESWLKKYGQKYRNKDEWEFR 64

Query: 60  FKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDV 119
           F+I++ NV++I  +N  ++N  YKL  N+F D TNEEFR     Y+ R            
Sbjct: 65  FEIYRANVQFIEVYN--SQNYSYKLMDNKFVDLTNEEFRRMYLVYQPR-------SHLQT 115

Query: 120 SFRYE-NASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQ 178
            F Y+ +  +P  IDWR +GAVT +KDQG CG CW+FSAVA +E IN I T KL SLSEQ
Sbjct: 116 RFMYQKHGDLPKRIDWRTRGAVTXIKDQGHCGSCWSFSAVATVEDINKIKTGKLVSLSEQ 175

Query: 179 ELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKIS 238
           +L+DCD    ++GC GG M+  F FI    GL T+  YPY+ SDG  NK +    A  I 
Sbjct: 176 QLIDCDNRNGNEGCNGGHME-TFTFITKRGGLTTDKNYPYQGSDGDXNKAKVRNHAVAIC 234

Query: 239 GYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGT 298
           GYE++P++NE  L  AVA+QP SVA DA G  FQ YS G F+G CG +L+H +T VGYG 
Sbjct: 235 GYENLPAHNENMLKAAVAHQPASVATDAGGYAFQLYSKGTFSGSCGKDLNHRMTIVGYG- 293

Query: 299 ADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
            ++G KYWLVKNSW    G +GYIRM+RD   K+G CG AM+ASYP
Sbjct: 294 EENGEKYWLVKNSWANDXGVSGYIRMKRDPKDKDGTCGTAMEASYP 339


>gi|18413505|ref|NP_567376.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|30315954|sp|Q9SUT0.1|CPR3_ARATH RecName: Full=Probable cysteine proteinase At4g11310; Flags:
           Precursor
 gi|5596477|emb|CAB51415.1| drought-inducible cysteine proteinase RD21A precursor-like protein
           [Arabidopsis thaliana]
 gi|7267830|emb|CAB81232.1| drought-inducible cysteine proteinase RD21A precursor-like protein
           [Arabidopsis thaliana]
 gi|332657595|gb|AEE82995.1| putative cysteine proteinase [Arabidopsis thaliana]
          Length = 364

 Score =  311 bits (798), Expect = 2e-82,   Method: Compositional matrix adjust.
 Identities = 163/352 (46%), Positives = 229/352 (65%), Gaps = 16/352 (4%)

Query: 2   AMILLENKLVLAAI-----LVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEK 56
           AM++L   +V+A+      + +  +   +   ++ DA  +   E WM ++G+VY   AEK
Sbjct: 7   AMLILLVAMVIASCATAIDMSVVSYDDNNRLHSVFDAEASLIFESWMVKHGKVYGSVAEK 66

Query: 57  EMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSET 116
           E R  IF++N+ +I   N  A N  Y+LG+  FAD +  E++   +G   R P  R+   
Sbjct: 67  ERRLTIFEDNLRFIN--NRNAENLSYRLGLTGFADLSLHEYKEVCHGADPRPP--RNHVF 122

Query: 117 TDVSFRYENAS---VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLT 173
              S RY+ ++   +P S+DWR +GAVT VKDQG C  CWAFS V A+EG+N I T +L 
Sbjct: 123 MTSSDRYKTSADDVLPKSVDWRNEGAVTEVKDQGHCRSCWAFSTVGAVEGLNKIVTGELV 182

Query: 174 SLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKK-EANP 232
           +LSEQ+L++C+   E+ GC GG ++ A+EFI+ N GL T+  YPYKA +G C+ + + N 
Sbjct: 183 TLSEQDLINCNK--ENNGCGGGKLETAYEFIMKNGGLGTDNDYPYKAVNGVCDGRLKENN 240

Query: 233 SAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVT 292
               I GYE++P+N+E+ALMKAVA+QPV+  ID+S  +FQ Y SGVF G CGT L+HGV 
Sbjct: 241 KNVMIDGYENLPANDESALMKAVAHQPVTAVIDSSSREFQLYESGVFDGSCGTNLNHGVV 300

Query: 293 AVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
            VGYGT ++G  YWLVKNS G TWGE GY++M R+I    GLCGIAM+ASYP
Sbjct: 301 VVGYGT-ENGRDYWLVKNSRGITWGEAGYMKMARNIANPRGLCGIAMRASYP 351


>gi|224085750|ref|XP_002307688.1| predicted protein [Populus trichocarpa]
 gi|222857137|gb|EEE94684.1| predicted protein [Populus trichocarpa]
          Length = 436

 Score =  311 bits (797), Expect = 3e-82,   Method: Compositional matrix adjust.
 Identities = 156/308 (50%), Positives = 195/308 (63%), Gaps = 10/308 (3%)

Query: 40  EMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRA 99
           E W  ++G+ Y    E+  R K+F++N +++   N+K  N  Y L +N FAD T+ EF+ 
Sbjct: 30  ETWCKEHGKSYTSQEERSHRLKVFEDNYDFVTKHNSKG-NSSYSLALNAFADLTHHEFKT 88

Query: 100 PRNGYKRRLPSV--RSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSA 157
            R G      ++  R+ E T V        +PASIDWR KG VT VKDQG CG CW+FSA
Sbjct: 89  SRLGLSAAPLNLAHRNLEITGVV-----GDIPASIDWRNKGVVTNVKDQGSCGACWSFSA 143

Query: 158 VAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYP 217
             A+EGIN I T  L SLSEQEL++CD S  D GC GGLMD AF+F+I+N G+ TE  YP
Sbjct: 144 TGAIEGINKIVTGSLVSLSEQELIECDKSYND-GCGGGLMDYAFQFVINNHGIDTEEDYP 202

Query: 218 YKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSG 277
           Y+A DG+CNK         I  Y DVP NNE  L++AVA QPVSV I  S   FQ YS G
Sbjct: 203 YRARDGTCNKDRMKRRVVTIDKYVDVPENNEKQLLQAVAAQPVSVGICGSERAFQMYSKG 262

Query: 278 VFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGI 337
           +FTG C T LDH V  VGYG+ ++G  YW+VKNSWGT WG  GY+ MQR+    +G+CGI
Sbjct: 263 IFTGPCSTSLDHAVLIVGYGS-ENGVDYWIVKNSWGTGWGMRGYMHMQRNSGNSQGVCGI 321

Query: 338 AMQASYPT 345
            M ASYP 
Sbjct: 322 NMLASYPV 329


>gi|22093636|dbj|BAC06931.1| putative cysteine proteinase [Oryza sativa Japonica Group]
 gi|50510021|dbj|BAD30633.1| putative cysteine proteinase [Oryza sativa Japonica Group]
          Length = 352

 Score =  311 bits (797), Expect = 3e-82,   Method: Compositional matrix adjust.
 Identities = 168/354 (47%), Positives = 229/354 (64%), Gaps = 18/354 (5%)

Query: 3   MILLENKL-VLAAILVLGVWAPQSWSRTLNDA----TMNERHEMWMAQYGRVYRDNAEKE 57
           M    +KL V+AA L+L V    S    +  A    TM  RH+ WMA++GR Y+D AEK 
Sbjct: 1   MARTSSKLQVMAASLLLVVAGGLSTMAKVTMASRAGTMEARHDKWMAEHGRTYKDAAEKA 60

Query: 58  MRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETT 117
            RF++FK NV+ I   +N A NK Y+L  N F D T+ EF A   GY     ++ ++   
Sbjct: 61  RRFRVFKANVDLI-DRSNAAGNKRYRLATNRFTDLTDAEFAAMYTGYNP-ANTMYAAANA 118

Query: 118 DVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSE 177
                 E+   PA +DWR++GAVTGVK+Q  CGCCWAFS VAA+EGI+ ITT +L SLSE
Sbjct: 119 TTRLSSEDDQQPAEVDWRQQGAVTGVKNQRSCGCCWAFSTVAAVEGIHQITTGELVSLSE 178

Query: 178 QELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCN---KKEANPSA 234
           Q+L+DC  +G   GC GG +D+AF+++ ++ G+ TEA Y Y+ + G+C       A+  A
Sbjct: 179 QQLLDCADNG---GCTGGSLDNAFQYMANSGGVTTEAAYAYQGAQGACQFDASSSASGVA 235

Query: 235 AKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTG-QCGTELDHGVTA 293
           A ISGY+ V  N+E +L  AVA+QPVSVAI+ SG+ F+ Y SGVFT   CGT+LDH V  
Sbjct: 236 ATISGYQRVNPNDEGSLAAAVASQPVSVAIEGSGAMFRHYGSGVFTADSCGTKLDHAVAV 295

Query: 294 VGYGTADDGT---KYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           VGYG   DG+    YW++KNSWGTTWG+ GY+++++D+   +G CG+AM  SYP
Sbjct: 296 VGYGAEADGSGGGGYWIIKNSWGTTWGDGGYMKLEKDV-GSQGACGVAMAPSYP 348


>gi|20260334|gb|AAM13065.1| drought-inducible cysteine proteinase RD21A precursor-like protein
           [Arabidopsis thaliana]
 gi|23197782|gb|AAN15418.1| drought-inducible cysteine proteinase RD21A precursor-like protein
           [Arabidopsis thaliana]
          Length = 357

 Score =  311 bits (796), Expect = 4e-82,   Method: Compositional matrix adjust.
 Identities = 158/317 (49%), Positives = 213/317 (67%), Gaps = 11/317 (3%)

Query: 32  DATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFAD 91
           DA  +   E WM ++G+VY   AEKE R  IF++N+ +I   N  A N  Y+LG+  FAD
Sbjct: 35  DAEASLIFESWMVKHGKVYGSVAEKERRLTIFEDNLRFIN--NRNAENLSYRLGLTGFAD 92

Query: 92  QTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS---VPASIDWRKKGAVTGVKDQGQ 148
            +  E++   +G   R P  R+      S RY+ ++   +P S+DWR +GAVT VKDQG 
Sbjct: 93  LSLHEYKEVCHGADPRPP--RNHVFMTSSDRYKTSADDVLPKSVDWRNEGAVTEVKDQGH 150

Query: 149 CGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNK 208
           C  CWAFS V A+EG+N I T +L +LSEQ+L++C+   E+ GC GG ++ A+EFI+ N 
Sbjct: 151 CRSCWAFSTVGAVEGLNKIVTGELVTLSEQDLINCNK--ENNGCGGGKLETAYEFIMKNG 208

Query: 209 GLATEAKYPYKASDGSCNKK-EANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDAS 267
           GL T+  YPYKA +G C+ + + N     I GYE++P+N+E+ALMKAVA+QPV+  ID+S
Sbjct: 209 GLGTDNDYPYKAVNGVCDGRLKENNKNVMIDGYENLPANDESALMKAVAHQPVTAVIDSS 268

Query: 268 GSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRD 327
             +FQ Y SGVF G CGT L+HGV  VGYGT ++G  YWLVKNS G TWGE GY++M R+
Sbjct: 269 SREFQLYESGVFDGSCGTNLNHGVVVVGYGT-ENGRDYWLVKNSRGITWGEAGYMKMARN 327

Query: 328 IDAKEGLCGIAMQASYP 344
           I    GLCGIAM+ASYP
Sbjct: 328 IANPRGLCGIAMRASYP 344


>gi|18391078|ref|NP_563855.1| xylem bark cysteine peptidase 3 [Arabidopsis thaliana]
 gi|110741821|dbj|BAE98853.1| papain-like cysteine peptidase XBCP3 [Arabidopsis thaliana]
 gi|111074448|gb|ABH04597.1| At1g09850 [Arabidopsis thaliana]
 gi|332190386|gb|AEE28507.1| xylem bark cysteine peptidase 3 [Arabidopsis thaliana]
          Length = 437

 Score =  310 bits (795), Expect = 5e-82,   Method: Compositional matrix adjust.
 Identities = 156/311 (50%), Positives = 201/311 (64%), Gaps = 7/311 (2%)

Query: 35  MNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTN 94
           ++E  + W  ++G+ Y    E++ R +IFK+N +++   +N   N  Y L +N FAD T+
Sbjct: 28  ISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQ-HNLITNATYSLSLNAFADLTH 86

Query: 95  EEFRAPRNGYKRRLPSV-RSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCW 153
            EF+A R G     PSV  +S+   +     +  VP S+DWRKKGAVT VKDQG CG CW
Sbjct: 87  HEFKASRLGLSVSAPSVIMASKGQSLG---GSVKVPDSVDWRKKGAVTNVKDQGSCGACW 143

Query: 154 AFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATE 213
           +FSA  AMEGIN I T  L SLSEQEL+DCD S  + GC GGLMD AFEF+I N G+ TE
Sbjct: 144 SFSATGAMEGINQIVTGDLISLSEQELIDCDKS-YNAGCNGGLMDYAFEFVIKNHGIDTE 202

Query: 214 AKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQF 273
             YPY+  DG+C K +       I  Y  V SN+E ALM+AVA QPVSV I  S   FQ 
Sbjct: 203 KDYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSERAFQL 262

Query: 274 YSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEG 333
           YSSG+F+G C T LDH V  VGYG+  +G  YW+VKNSWG +WG +G++ MQR+ +  +G
Sbjct: 263 YSSGIFSGPCSTSLDHAVLIVGYGS-QNGVDYWIVKNSWGKSWGMDGFMHMQRNTENSDG 321

Query: 334 LCGIAMQASYP 344
           +CGI M ASYP
Sbjct: 322 VCGINMLASYP 332


>gi|400180391|gb|AFP73334.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  310 bits (795), Expect = 5e-82,   Method: Compositional matrix adjust.
 Identities = 157/347 (45%), Positives = 223/347 (64%), Gaps = 9/347 (2%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRF 60
           MAM +    +++    V+ ++  Q+ +R+    +++ERHE+WM+++GRVY+D  EK  RF
Sbjct: 1   MAMKVDLMSILITLFFVISMFNSQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERF 60

Query: 61  KIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVS 120
            IFKEN+++I S N KA N  YKLG+NEFAD T++EF A   G       +  S  +   
Sbjct: 61  MIFKENMKFIESVN-KAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE 119

Query: 121 FRYENAS---VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSE 177
           F+  + S   +P+++DWR+ GAVT VK QG+CGCCWAFSAV ++EG   I T  L   SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179

Query: 178 QELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKI 237
           QEL+DC T+  + GC GG M +AF+FI  N G++ E+ Y Y     +C  +E   +A +I
Sbjct: 180 QELLDCTTN--NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKT-AAVQI 236

Query: 238 SGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYG 297
           S Y+ VP   E +L++AV  QPVS+ I AS  D QF + G + G C   ++H VTA+GYG
Sbjct: 237 SSYQVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFCAGGTYDGSCADRINHAVTAIGYG 294

Query: 298 TADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           T + G KYWL+KNSWGT+WGENG++++ RD     GLC IA  +SYP
Sbjct: 295 TDEKGQKYWLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341


>gi|400180361|gb|AFP73319.1| cysteine protease [Solanum peruvianum]
 gi|400180397|gb|AFP73337.1| cysteine protease [Solanum peruvianum]
 gi|400180401|gb|AFP73339.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  310 bits (795), Expect = 5e-82,   Method: Compositional matrix adjust.
 Identities = 157/347 (45%), Positives = 221/347 (63%), Gaps = 9/347 (2%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRF 60
           MAM +    +++    V+ ++  Q+  R+    +++ERHE+WM+++GRVY+D  EK  RF
Sbjct: 1   MAMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERF 60

Query: 61  KIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVS 120
            IFKEN+++I S N KA N  YKLG+NEFAD T++EF A   G       +  S  +   
Sbjct: 61  MIFKENMKFIESVN-KAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE 119

Query: 121 FRYENAS---VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSE 177
           F   + S   +P+++DWR+ GAVT VK QG+CGCCWAFSAV ++EG   I T  L   SE
Sbjct: 120 FIINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179

Query: 178 QELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKI 237
           QEL+DC T+  + GC GG M +AF+FI  N G++ E+ Y Y     +C  +E   +A +I
Sbjct: 180 QELLDCTTN--NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKT-AAVQI 236

Query: 238 SGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYG 297
           S Y+ VP   E +L++AV  QPVS+ I AS  D QFY+ G + G C   ++H VTA+GYG
Sbjct: 237 SSYQVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYG 294

Query: 298 TADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           T + G KYWL+KNSWGT+WGENG++++ RD     GLC I   +SYP
Sbjct: 295 TDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGDPSGLCDITKMSSYP 341


>gi|558563|emb|CAA57538.1| cysteine proteinase [Cicer arietinum]
          Length = 325

 Score =  310 bits (795), Expect = 5e-82,   Method: Compositional matrix adjust.
 Identities = 158/312 (50%), Positives = 205/312 (65%), Gaps = 13/312 (4%)

Query: 39  HEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFR 98
           +E W+ ++ ++Y    EK+ RF+IFK+N+ +I   N  A+N  YK+G+N+FAD  NEE+R
Sbjct: 4   YEKWLVKHQKMYNGLGEKDTRFQIFKDNLRFIDEHN--AQNYSYKVGLNKFADINNEEYR 61

Query: 99  ----APRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWA 154
                 ++  KRR   V  ++ T     Y +  V   +DWR KGAVT +KDQG CG CWA
Sbjct: 62  DMYLGTKSDAKRR---VMKTKITGHRITYNSVIVTVKVDWRLKGAVTHIKDQGSCGSCWA 118

Query: 155 FSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEA 214
           FS +A +E IN I T K  SLSEQELVDCD +  ++GC GGLMD AFEFII N G+ T+ 
Sbjct: 119 FSTIATVEAINKIVTGKFVSLSEQELVDCDRA-FNEGCNGGLMDYAFEFIIRNGGIDTDQ 177

Query: 215 KYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFY 274
            YPY   +  C+  + N     I GYEDVPS    AL KAVA+QPVSVAI   G   Q Y
Sbjct: 178 DYPYNGFERKCDPTKKNAKVVSIDGYEDVPSYMN-ALKKAVAHQPVSVAIAGLGRALQLY 236

Query: 275 SSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRM-QRDIDAKEG 333
            SGVFTG+CGT+LDHGV  VGYG+ ++G  YWLV+NSWGT WGE+GY ++  R++ +   
Sbjct: 237 QSGVFTGKCGTDLDHGVVVVGYGS-ENGVDYWLVRNSWGTNWGEDGYFKIASRNVKSLYR 295

Query: 334 LCGIAMQASYPT 345
            CGIAM+ASYP 
Sbjct: 296 KCGIAMEASYPV 307


>gi|400180465|gb|AFP73369.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  310 bits (794), Expect = 6e-82,   Method: Compositional matrix adjust.
 Identities = 157/347 (45%), Positives = 222/347 (63%), Gaps = 9/347 (2%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRF 60
           MAM +    +++    V+ ++  Q+  R+    +++ERHE+WM+++GRVY+D  EK  RF
Sbjct: 1   MAMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERF 60

Query: 61  KIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVS 120
            IFKEN+++I S N KA N  YKLG+NEFAD T++EF A   G       +  S  +   
Sbjct: 61  MIFKENMKFIESVN-KAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE 119

Query: 121 FRYENAS---VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSE 177
           F+  + S   +P+++DWR+ GAVT VK QG+CGCCWAFSAV ++E    I T  L   SE
Sbjct: 120 FKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEVAYKIATGNLMEFSE 179

Query: 178 QELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKI 237
           QEL+DC T+  + GC GG M +AF+FI  N G++ E+ Y Y     +C  +E   +A +I
Sbjct: 180 QELLDCTTN--NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKT-AAVQI 236

Query: 238 SGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYG 297
           S Y+ VP   E +L++AV  QPVS+ I AS  D QFY+ G + G C   ++H VTA+GYG
Sbjct: 237 SSYQVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYG 294

Query: 298 TADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           T + G KYWL+KNSWGT+WGENG++++ RD     GLC IA  +SYP
Sbjct: 295 TDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPAGLCDIAKMSSYP 341


>gi|218198967|gb|EEC81394.1| hypothetical protein OsI_24614 [Oryza sativa Indica Group]
          Length = 342

 Score =  310 bits (793), Expect = 8e-82,   Method: Compositional matrix adjust.
 Identities = 157/318 (49%), Positives = 214/318 (67%), Gaps = 13/318 (4%)

Query: 34  TMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQT 93
           TM  RH+ WMA++GR Y+D AEK  RF++FK NV+ I   +N A NK Y+L  N F D T
Sbjct: 27  TMEARHDKWMAEHGRTYKDAAEKARRFRVFKANVDLI-DRSNAAGNKRYRLATNRFTDLT 85

Query: 94  NEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCW 153
           + EF A   GY     ++ ++         E+   PA +DWR++GAVTGVK+Q  CGCCW
Sbjct: 86  DAEFAAMYTGYNP-ANTMYAAANATTRLSSEDDQQPAEVDWRQQGAVTGVKNQRSCGCCW 144

Query: 154 AFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATE 213
           AFS VAA+EGI+ ITT +L SLSEQ+L+DC  +G   GC GG +D+AF+++ ++ G+ TE
Sbjct: 145 AFSTVAAVEGIHQITTGELVSLSEQQLLDCADNG---GCTGGSLDNAFQYMANSGGVTTE 201

Query: 214 AKYPYKASDGSCN---KKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSD 270
           A Y Y+ + G+C       A+  AA ISGY+ V  N+E +L  AVA+QPVSVAI+ SG+ 
Sbjct: 202 AAYAYQGAQGACQFDASSSASGVAATISGYQRVNPNDEGSLAAAVASQPVSVAIEGSGAM 261

Query: 271 FQFYSSGVFTG-QCGTELDHGVTAVGYGTADDGT---KYWLVKNSWGTTWGENGYIRMQR 326
           F+ Y SGVFT   CGT+LDH V  VGYG   DG+    YW++KNSWGTTWG+ GY+++++
Sbjct: 262 FRHYGSGVFTADSCGTKLDHAVAVVGYGAEADGSGGGGYWIIKNSWGTTWGDGGYMKLEK 321

Query: 327 DIDAKEGLCGIAMQASYP 344
           D+   +G CG+AM  SYP
Sbjct: 322 DV-GSQGACGVAMAPSYP 338


>gi|400180387|gb|AFP73332.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  309 bits (791), Expect = 1e-81,   Method: Compositional matrix adjust.
 Identities = 156/347 (44%), Positives = 221/347 (63%), Gaps = 9/347 (2%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRF 60
           MAM +    +++    V+ ++  Q+  R+    +++ERHE+WM+++GRVY+D  EK  RF
Sbjct: 1   MAMKVDLMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKVERF 60

Query: 61  KIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVS 120
            IFKEN+++I S N KA N  YKLG+NEFAD T++EF A   G       +  S  +   
Sbjct: 61  MIFKENMKFIESVN-KAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTE 119

Query: 121 FRYENAS---VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSE 177
            +  + S   +P+++DW + GAVT VK QG+CGCCWAFSAV ++EG   I T  L   SE
Sbjct: 120 LKINDLSDDDMPSNLDWIESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSE 179

Query: 178 QELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKI 237
           QEL+DC T+  + GC GG M +AF+FI  N G++ E+ Y Y     +C  +E   +A +I
Sbjct: 180 QELLDCTTN--NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQEKT-AAVQI 236

Query: 238 SGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYG 297
           S Y+ VP   E +L++AV  QPVS+ I AS  D QFY+ G + G C   ++H VTA+GYG
Sbjct: 237 SSYQVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYG 294

Query: 298 TADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           T + G KYWL+KNSWGT+WGENG++++ RD     GLC IA  +SYP
Sbjct: 295 TDEKGQKYWLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341


>gi|3451077|emb|CAA20473.1| cysteine proteinase-like protein [Arabidopsis thaliana]
 gi|7269200|emb|CAB79307.1| cysteine proteinase-like protein [Arabidopsis thaliana]
          Length = 355

 Score =  309 bits (791), Expect = 1e-81,   Method: Compositional matrix adjust.
 Identities = 150/307 (48%), Positives = 209/307 (68%), Gaps = 9/307 (2%)

Query: 40  EMWMAQYGRVYRDN-AEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFR 98
           +MWM+++G+ Y +   EKE RF+ FK+N+ +I   N  A+N  Y+LG+  FAD T +E+R
Sbjct: 48  QMWMSKHGKTYTNALGEKERRFQNFKDNLRFIDQHN--AKNLSYQLGLTRFADLTVQEYR 105

Query: 99  APRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAV 158
               G  +  P  R+ +T+          +P S+DWR++GAV+ +KDQG C  CWAFS V
Sbjct: 106 DLFPGSPK--PKQRNLKTSRRYVPLAGDQLPESVDWRQEGAVSEIKDQGTCNSCWAFSTV 163

Query: 159 AAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEG-GLMDDAFEFIISNKGLATEAKYP 217
           AA+EG+N I T +L SLSEQELVDC+    + GC G GLMD AF+F+I+N GL +E  YP
Sbjct: 164 AAVEGLNKIVTGELISLSEQELVDCNLV--NNGCYGSGLMDTAFQFLINNNGLDSEKDYP 221

Query: 218 YKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSG 277
           Y+ + GSCN+K+ +     I  YEDVP+N+E +L KAVA+QPVSV +D    +F  Y S 
Sbjct: 222 YQGTQGSCNRKQVHLLVITIDSYEDVPANDEISLQKAVAHQPVSVGVDKKSQEFMLYRSC 281

Query: 278 VFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGI 337
           ++ G CGT LDH +  VGYG+ ++G  YW+V+NSWGTTWG+ GYI++ R+ +  +GLCGI
Sbjct: 282 IYNGPCGTNLDHALVIVGYGS-ENGQDYWIVRNSWGTTWGDAGYIKIARNFEDPKGLCGI 340

Query: 338 AMQASYP 344
           AM ASYP
Sbjct: 341 AMLASYP 347


>gi|18413507|ref|NP_567377.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|30315953|sp|Q9SUS9.1|CPR4_ARATH RecName: Full=Probable cysteine proteinase At4g11320; Flags:
           Precursor
 gi|5596478|emb|CAB51416.1| drought-inducible cysteine proteinase RD21A precursor-like protein
           [Arabidopsis thaliana]
 gi|7267831|emb|CAB81233.1| drought-inducible cysteine proteinase RD21A precursor-like protein
           [Arabidopsis thaliana]
 gi|14334764|gb|AAK59560.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|15293257|gb|AAK93739.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|332657596|gb|AEE82996.1| putative cysteine proteinase [Arabidopsis thaliana]
          Length = 371

 Score =  309 bits (791), Expect = 1e-81,   Method: Compositional matrix adjust.
 Identities = 158/316 (50%), Positives = 210/316 (66%), Gaps = 9/316 (2%)

Query: 32  DATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFAD 91
           DA      E WM ++G+VY   AEKE R  IF++N+ +I   N  A N  Y+LG+N FAD
Sbjct: 49  DAEATLMFESWMVKHGKVYDSVAEKERRLTIFEDNLRFIT--NRNAENLSYRLGLNRFAD 106

Query: 92  QTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASV-PASIDWRKKGAVTGVKDQGQCG 150
            +  E+    +G   R P      T+   ++  +  V P S+DWR +GAVT VKDQG C 
Sbjct: 107 LSLHEYGEICHGADPRPPRNHVFMTSSNRYKTSDGDVLPKSVDWRNEGAVTEVKDQGLCR 166

Query: 151 CCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGL 210
            CWAFS V A+EG+N I T +L +LSEQ+L++C+   E+ GC GG ++ A+EFI++N GL
Sbjct: 167 SCWAFSTVGAVEGLNKIVTGELVTLSEQDLINCNK--ENNGCGGGKVETAYEFIMNNGGL 224

Query: 211 ATEAKYPYKASDGSCNK--KEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASG 268
            T+  YPYKA +G C    KE N +   I GYE++P+N+EAALMKAVA+QPV+  +D+S 
Sbjct: 225 GTDNDYPYKALNGVCEGRLKEDNKNVM-IDGYENLPANDEAALMKAVAHQPVTAVVDSSS 283

Query: 269 SDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDI 328
            +FQ Y SGVF G CGT L+HGV  VGYGT ++G  YW+VKNS G TWGE GY++M R+I
Sbjct: 284 REFQLYESGVFDGTCGTNLNHGVVVVGYGT-ENGRDYWIVKNSRGDTWGEAGYMKMARNI 342

Query: 329 DAKEGLCGIAMQASYP 344
               GLCGIAM+ASYP
Sbjct: 343 ANPRGLCGIAMRASYP 358


>gi|296090463|emb|CBI40282.3| unnamed protein product [Vitis vinifera]
          Length = 386

 Score =  308 bits (790), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 159/307 (51%), Positives = 201/307 (65%), Gaps = 37/307 (12%)

Query: 39  HEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFR 98
           +E W+ ++G+ Y    E+E RF+IFK+N+ +I   N  A N+ YK+G           FR
Sbjct: 4   YEAWLVKHGKSYNALGERERRFEIFKDNLRFIEEHN--AVNRTYKVG-------DRYSFR 54

Query: 99  APRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAV 158
           A  +                         +P S+DWR+KGAV  VKDQG CG CWAFS +
Sbjct: 55  AGED-------------------------LPESVDWREKGAVVPVKDQGNCGSCWAFSTI 89

Query: 159 AAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPY 218
           AA+EGIN I T  L SLSEQELVDCD S  +QGC GGLMD AFEFII+N G+ +E  YPY
Sbjct: 90  AAVEGINQIATGDLISLSEQELVDCDKS-YNQGCNGGLMDYAFEFIINNGGIDSEEDYPY 148

Query: 219 KASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGV 278
           +A+D +C+    N     I GYEDVP N+E +L KAVANQPVSVAI+A G  FQ Y SGV
Sbjct: 149 RAADTTCDPNRKNARVVSIDGYEDVPQNDERSLKKAVANQPVSVAIEAGGRAFQLYQSGV 208

Query: 279 FTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKE-GLCGI 337
           FTGQCGT+LDHGV AVGYGT ++   YW+V+NSWG  WGE+GYI+++R++   E G CGI
Sbjct: 209 FTGQCGTQLDHGVVAVGYGT-ENSVDYWIVRNSWGPNWGESGYIKLERNLAGTETGKCGI 267

Query: 338 AMQASYP 344
           A++ SYP
Sbjct: 268 AIEPSYP 274


>gi|14600257|gb|AAK71314.1|AF388175_1 papain-like cysteine peptidase XBCP3 [Arabidopsis thaliana]
          Length = 437

 Score =  308 bits (790), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 155/311 (49%), Positives = 200/311 (64%), Gaps = 7/311 (2%)

Query: 35  MNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTN 94
           ++E  + W  ++G+ Y    E++ R +IFK+N +++   +N   N  Y L +N FAD T+
Sbjct: 28  ISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQ-HNLITNATYSLSLNAFADLTH 86

Query: 95  EEFRAPRNGYKRRLPSV-RSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCW 153
            EF+A R G     PSV  +S+   +     +  VP S+DWRKKGAVT VKDQG CG CW
Sbjct: 87  HEFKASRLGLSVSAPSVIMASKGQSLG---GSVKVPDSVDWRKKGAVTNVKDQGSCGACW 143

Query: 154 AFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATE 213
           +FSA  AMEGIN I T  L SLSEQEL+DCD S  + GC GGLMD AFEF+I N G+ TE
Sbjct: 144 SFSATGAMEGINQIVTGDLISLSEQELIDCDKS-YNAGCNGGLMDYAFEFVIKNHGIDTE 202

Query: 214 AKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQF 273
             YPY+  DG+C K +       I  Y  V SN+E ALM+AVA QPVSV I  S   FQ 
Sbjct: 203 KDYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSERAFQL 262

Query: 274 YSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEG 333
           YS G+F+G C T LDH V  VGYG+  +G  YW+VKNSWG +WG +G++ MQR+ +  +G
Sbjct: 263 YSRGIFSGPCSTSLDHAVLIVGYGS-QNGVDYWIVKNSWGKSWGMDGFMHMQRNTENSDG 321

Query: 334 LCGIAMQASYP 344
           +CGI M ASYP
Sbjct: 322 VCGINMLASYP 332


>gi|162459488|ref|NP_001105571.1| maize insect resistance1 precursor [Zea mays]
 gi|5731354|gb|AAB70820.2| cysteine protease Mir1 [Zea mays]
          Length = 398

 Score =  308 bits (790), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 162/331 (48%), Positives = 212/331 (64%), Gaps = 22/331 (6%)

Query: 32  DATMNERHEMWMAQYGRVYRDN--------------AEKEMRFKIFKENVEYIASFNNKA 77
           D  +   +E W +++GR    N               ++ +R ++F++N+ YI + N +A
Sbjct: 47  DEEVRRMYEAWKSKHGRGGSSNDDCDMAPGDDEQEEEDRRLRLEVFRDNLRYIDAHNAEA 106

Query: 78  RN--KPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWR 135
                 ++LG+  FAD T EE+R    G++ R     +   +  S R     +P +IDWR
Sbjct: 107 DAGLHTFRLGLTPFADLTLEEYRGRVLGFRARGRRSGARYGSGYSVR--GGDLPDAIDWR 164

Query: 136 KKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGG 195
           + GAVT VKDQ QCG CWAFSAVAA+EG+N I T  L SLSEQE++DCD   +D GC+GG
Sbjct: 165 QLGAVTEVKDQQQCGGCWAFSAVAAIEGVNAIATGNLVSLSEQEIIDCDA--QDSGCDGG 222

Query: 196 LMDDAFEFIISNKGLATEAKYPYKASDGSCN-KKEANPSAAKISGYEDVPSNNEAALMKA 254
            M++AF F+I N G+ TEA YP+  +DG+C+  KE N   A I G  +V SNNE AL +A
Sbjct: 223 QMENAFRFVIGNGGIDTEADYPFIGTDGTCDASKEKNEKVATIDGLVEVASNNETALQEA 282

Query: 255 VANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGT 314
           VA QPVSVAIDASG  FQ YSSG+F G CGT LDHGVTAVGYG+ + G  YW+VKNSW  
Sbjct: 283 VAIQPVSVAIDASGRAFQHYSSGIFNGPCGTSLDHGVTAVGYGS-ESGKDYWIVKNSWSA 341

Query: 315 TWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           +WGE GYIRM+R++    G CGIAM ASYP 
Sbjct: 342 SWGEAGYIRMRRNVPRPTGKCGIAMDASYPV 372


>gi|297851332|ref|XP_002893547.1| peptidase C1A papain family protein [Arabidopsis lyrata subsp.
           lyrata]
 gi|297339389|gb|EFH69806.1| peptidase C1A papain family protein [Arabidopsis lyrata subsp.
           lyrata]
          Length = 345

 Score =  308 bits (790), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 155/329 (47%), Positives = 212/329 (64%), Gaps = 24/329 (7%)

Query: 30  LNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEF 89
           L++ T+   H+ WM  + RVY D  EK+MR ++F EN+++I +FNN   ++ YKLG+N+F
Sbjct: 29  LHEPTIFYYHQKWMINFSRVYDDEFEKQMRLEVFTENLKFIENFNNMG-SQSYKLGVNKF 87

Query: 90  ADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPA-----------SIDWRKKG 138
            D T EEF A   G         S       F   N + PA           + DWR +G
Sbjct: 88  TDWTKEEFLATHTGL--------SGINVTSPFEVVNETTPAWNWTVSDVLGTTKDWRNEG 139

Query: 139 AVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMD 198
           AVT VK QG+CG CWAFSA+AA+EG+  I    L SLSEQ+L+DC    ++ GC+GG M 
Sbjct: 140 AVTPVKYQGECGGCWAFSAIAAVEGLTKIARGNLISLSEQQLLDCARE-QNNGCKGGTMI 198

Query: 199 DAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQ 258
           +AF +I+ N G+++E  YPY+  +G C   +    A  I G+E+VPSNNE AL++AV+ Q
Sbjct: 199 EAFNYIVKNGGVSSENAYPYQVKEGPCRSNDI--PAIVIRGFENVPSNNERALLEAVSRQ 256

Query: 259 PVSVAIDASGSDFQFYSSGVFTGQ-CGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWG 317
           PV+V IDAS + F  YS GV+  + CGT ++H VT VGYGT+ +G KYWL KNSWG TWG
Sbjct: 257 PVAVDIDASETGFIHYSGGVYNARDCGTSVNHAVTLVGYGTSQEGIKYWLAKNSWGKTWG 316

Query: 318 ENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           ENGYIR++RD++  +G+CG+A  ASYP A
Sbjct: 317 ENGYIRIRRDVEWPQGMCGVAQYASYPVA 345


>gi|146215990|gb|ABQ10197.1| actinidin Act4a [Actinidia eriantha]
          Length = 385

 Score =  308 bits (789), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 155/325 (47%), Positives = 207/325 (63%), Gaps = 13/325 (4%)

Query: 24  QSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYK 83
           + W +  ND  +    E W+ +YG+ Y    EKE RF+IFK+N+ ++   N    N+ YK
Sbjct: 34  EKWEQRTNDEVI-AMFESWLVEYGKSYNALGEKERRFEIFKDNLRFVDEHNADV-NRSYK 91

Query: 84  LGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYE---NASVPASIDWRKKGAV 140
           +G+N+F+D T+ E+ +   G K  +        T+VS RYE      +P S+DWRKKGAV
Sbjct: 92  VGLNQFSDLTDAEYSSIYLGTKFNI------RMTNVSDRYEPRVGDQLPDSVDWRKKGAV 145

Query: 141 TGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDA 200
            GVK+QG CG CW F+++AA+EGIN I T  L SLSEQE+VDC     + GC GG +  A
Sbjct: 146 LGVKNQGNCGSCWTFASIAAVEGINKIVTGNLISLSEQEIVDCQRKYPNNGCNGGTLSGA 205

Query: 201 FEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPV 260
           ++FII+N G+ TEA YPY   DG C++ + N     I  YE+VPSNNE AL KAVA QPV
Sbjct: 206 YQFIINNGGINTEANYPYTGRDGVCDQNKKNKKYVTIDRYENVPSNNEKALQKAVAFQPV 265

Query: 261 SVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENG 320
           SV I ++ + F+ Y SG+F G CG  +DHGVT VGYGT + G  YW+V+NSWG  WGE+G
Sbjct: 266 SVVIASNSTAFKSYKSGIFNGPCGPRIDHGVTIVGYGT-EGGKDYWIVRNSWGPNWGESG 324

Query: 321 YIRMQRDIDAKEGLCGIAMQASYPT 345
           Y+RMQR++    G C IA    YP 
Sbjct: 325 YVRMQRNVGG-SGKCFIARAPVYPV 348


>gi|297809383|ref|XP_002872575.1| hypothetical protein ARALYDRAFT_911472 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297318412|gb|EFH48834.1| hypothetical protein ARALYDRAFT_911472 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 371

 Score =  308 bits (788), Expect = 3e-81,   Method: Compositional matrix adjust.
 Identities = 156/317 (49%), Positives = 213/317 (67%), Gaps = 11/317 (3%)

Query: 32  DATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFAD 91
           DA  +   + WM ++G+VY   AEKE R  IF++N+ +I+  N  A N  Y+LG+ +FAD
Sbjct: 49  DAEASLIFDSWMVKHGKVYGSVAEKERRLTIFEDNLRFIS--NRNAENLSYRLGLTQFAD 106

Query: 92  QTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS---VPASIDWRKKGAVTGVKDQGQ 148
            +  E+    +G   R P  R+      S RY+ ++   +P S+DWR +GAVT VKDQG 
Sbjct: 107 LSLHEYGEVCHGADPRPP--RNHVFMTSSDRYKTSAGDVLPKSVDWRNEGAVTEVKDQGH 164

Query: 149 CGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNK 208
           C  CWAFS V A+EG+N I T +L +LSEQ+L++C+   E+ GC GG ++ A+EFI+ N 
Sbjct: 165 CRSCWAFSTVGAVEGLNKIVTGELVTLSEQDLINCNK--ENNGCGGGKVETAYEFIMKNG 222

Query: 209 GLATEAKYPYKASDGSCNKK-EANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDAS 267
           GL T+  YPYKA +G C+ + + N     I G+E++P+N+E ALMKAVA+QPV+  ID+S
Sbjct: 223 GLGTDNDYPYKAVNGVCDGRLKENNKNVMIDGFENLPANDEFALMKAVAHQPVTAVIDSS 282

Query: 268 GSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRD 327
             +FQ Y SGVF G CGT L+HGV  VGYGT ++G  YWLVKNS G TWGE GY++M R+
Sbjct: 283 SREFQLYESGVFDGSCGTNLNHGVVVVGYGT-ENGRDYWLVKNSRGNTWGEAGYMKMARN 341

Query: 328 IDAKEGLCGIAMQASYP 344
           I    GLCGIAM+ASYP
Sbjct: 342 IANPRGLCGIAMRASYP 358


>gi|146215992|gb|ABQ10198.1| actinidin Act4b [Actinidia eriantha]
          Length = 379

 Score =  308 bits (788), Expect = 3e-81,   Method: Compositional matrix adjust.
 Identities = 157/325 (48%), Positives = 209/325 (64%), Gaps = 14/325 (4%)

Query: 24  QSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYK 83
           + W +  ND  M    E W+ +YG+ Y    EKE RF+IFK+N+ ++   N    N+ YK
Sbjct: 34  KKWEQRTNDEVM-AMFESWLVEYGKSYNALGEKERRFEIFKDNLRFVDEHNADV-NRSYK 91

Query: 84  LGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYE---NASVPASIDWRKKGAV 140
           +G+N+F+D T EE+ +   G K  +        T+VS RYE      +P SIDWRKKGAV
Sbjct: 92  VGLNQFSDLTLEEYSSIYLGTKFDM------RMTNVSDRYEPRVGDQLPNSIDWRKKGAV 145

Query: 141 TGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDA 200
            GVK+QG CG CW F+ +AA+E IN I T  L SLSEQ++VDC     + GC+GG    A
Sbjct: 146 LGVKNQGNCGSCWTFAPIAAVEAINQIVTGNLISLSEQQIVDCQRKSPNNGCKGGSRAGA 205

Query: 201 FEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPV 260
           ++FII N G+ TEA YPYKA DG C++++ N     I  YE+VP  NE AL KAV+NQ V
Sbjct: 206 YQFIIDNGGINTEANYPYKAQDGECDEQK-NQKYVTIDRYENVPRKNEKALQKAVSNQLV 264

Query: 261 SVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENG 320
           SV I ++ S+F+ Y SG+FTG CG ++DH VT VGYGT + G  YW+V+NSWG+ WGENG
Sbjct: 265 SVGIASNSSEFKAYKSGIFTGPCGAKIDHAVTIVGYGT-EGGMDYWIVRNSWGSNWGENG 323

Query: 321 YIRMQRDIDAKEGLCGIAMQASYPT 345
           Y+RMQR++    G C IA   +YP 
Sbjct: 324 YVRMQRNV-GNAGTCFIATSPNYPV 347


>gi|255546708|ref|XP_002514413.1| cysteine protease, putative [Ricinus communis]
 gi|223546510|gb|EEF48009.1| cysteine protease, putative [Ricinus communis]
          Length = 324

 Score =  308 bits (788), Expect = 4e-81,   Method: Compositional matrix adjust.
 Identities = 158/311 (50%), Positives = 198/311 (63%), Gaps = 32/311 (10%)

Query: 35  MNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTN 94
           + E  E WM+++G+ Y    EK  R ++FK+N+ +I   N       Y L +NEFAD ++
Sbjct: 43  LTELFESWMSKHGKTYESIEEKLHRLEVFKDNLMHIDRRNRDVTT--YWLALNEFADLSH 100

Query: 95  EEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWA 154
           EEF       K +L  +R  E                     KGAV  VK+QG CG CWA
Sbjct: 101 EEF-------KSKLAQIRRLE---------------------KGAVAPVKNQGSCGSCWA 132

Query: 155 FSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEA 214
           FS VAA+EGIN I T  LTSLSEQEL+DCDTS  + GC GGLMD AF++I++N GL  E 
Sbjct: 133 FSTVAAVEGINQIVTGNLTSLSEQELIDCDTSF-NSGCNGGLMDYAFDYIVNNGGLHKEE 191

Query: 215 KYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFY 274
            YPY   +G+C++K        ISGY DVP NNE +L+KA+A+QP+S+AI+ASG DFQFY
Sbjct: 192 DYPYLMEEGTCDEKREEMEVVTISGYHDVPENNEESLLKALAHQPLSIAIEASGRDFQFY 251

Query: 275 SSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGL 334
             GVF G CGT+LDHGV AVGYG++  G  Y +VKNSWG  WGE GYIRM+R+    EGL
Sbjct: 252 GRGVFNGPCGTDLDHGVAAVGYGSS-KGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPEGL 310

Query: 335 CGIAMQASYPT 345
           CGI   ASYPT
Sbjct: 311 CGINKMASYPT 321


>gi|357477225|ref|XP_003608898.1| Cysteine proteinase, partial [Medicago truncatula]
 gi|355509953|gb|AES91095.1| Cysteine proteinase, partial [Medicago truncatula]
          Length = 260

 Score =  307 bits (787), Expect = 4e-81,   Method: Compositional matrix adjust.
 Identities = 158/264 (59%), Positives = 183/264 (69%), Gaps = 28/264 (10%)

Query: 86  INEFADQTNEEFRA----PRNGYKRRLPSVRSSETTDVSFRYENAS-VPASIDWRKKGAV 140
           +N+FAD TN EFR+     +  + R     R     +  F YEN   VP+SIDWRK GAV
Sbjct: 2   LNKFADMTNYEFRSIYADSKVNHHRMF---RGMSHDNGPFMYENVEGVPSSIDWRKIGAV 58

Query: 141 TGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDA 200
           TGVKDQGQCG CWAFS + A+EGIN I T+KL SLSEQELVDCDT   +QGC GGLM+ A
Sbjct: 59  TGVKDQGQCGSCWAFSTIVAVEGINQIKTQKLVSLSEQELVDCDTE-VNQGCNGGLMEYA 117

Query: 201 FEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPV 260
           FEFI  N G+ TE  YPY A DG+CN ++ N  A  I G+E+VP+NNE AL+KA ANQP+
Sbjct: 118 FEFIKQN-GITTETNYPYAAKDGTCNIQKENKPAVSIDGHENVPANNEKALLKAAANQPI 176

Query: 261 SVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENG 320
           SVAIDA GSDFQFYS GVFTG CGTEL+HGV                  NSWG+ WGE G
Sbjct: 177 SVAIDAGGSDFQFYSEGVFTGHCGTELNHGV------------------NSWGSEWGEQG 218

Query: 321 YIRMQRDIDAKEGLCGIAMQASYP 344
           YIRMQR I  K+GLCGIAM+ASYP
Sbjct: 219 YIRMQRAISHKQGLCGIAMEASYP 242


>gi|112490572|pdb|2FO5|A Chain A, Crystal Structure Of Recombinant Barley Cysteine
           Endoprotease B Isoform 2 (Ep-B2) In Complex With
           Leupeptin
 gi|112490573|pdb|2FO5|B Chain B, Crystal Structure Of Recombinant Barley Cysteine
           Endoprotease B Isoform 2 (Ep-B2) In Complex With
           Leupeptin
 gi|112490574|pdb|2FO5|C Chain C, Crystal Structure Of Recombinant Barley Cysteine
           Endoprotease B Isoform 2 (Ep-B2) In Complex With
           Leupeptin
 gi|112490575|pdb|2FO5|D Chain D, Crystal Structure Of Recombinant Barley Cysteine
           Endoprotease B Isoform 2 (Ep-B2) In Complex With
           Leupeptin
          Length = 262

 Score =  306 bits (785), Expect = 7e-81,   Method: Compositional matrix adjust.
 Identities = 143/221 (64%), Positives = 173/221 (78%), Gaps = 4/221 (1%)

Query: 128 VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSG 187
           +P S+DWR+KGAVTGVKDQG+CG CWAFS V ++EGIN I T  L SLSEQEL+DCDT+ 
Sbjct: 4   LPPSVDWRQKGAVTGVKDQGKCGSCWAFSTVVSVEGINAIRTGSLVSLSEQELIDCDTAD 63

Query: 188 EDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEA---NPSAAKISGYEDVP 244
            D GC+GGLMD+AFE+I +N GL TEA YPY+A+ G+CN   A   +P    I G++DVP
Sbjct: 64  ND-GCQGGLMDNAFEYIKNNGGLITEAAYPYRAARGTCNVARAAQNSPVVVHIDGHQDVP 122

Query: 245 SNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTK 304
           +N+E  L +AVANQPVSVA++ASG  F FYS GVFTG+CGTELDHGV  VGYG A+DG  
Sbjct: 123 ANSEEDLARAVANQPVSVAVEASGKAFMFYSEGVFTGECGTELDHGVAVVGYGVAEDGKA 182

Query: 305 YWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           YW VKNSWG +WGE GYIR+++D  A  GLCGIAM+ASYP 
Sbjct: 183 YWTVKNSWGPSWGEQGYIRVEKDSGASGGLCGIAMEASYPV 223


>gi|242094002|ref|XP_002437491.1| hypothetical protein SORBIDRAFT_10g028010 [Sorghum bicolor]
 gi|241915714|gb|EER88858.1| hypothetical protein SORBIDRAFT_10g028010 [Sorghum bicolor]
          Length = 397

 Score =  306 bits (783), Expect = 1e-80,   Method: Compositional matrix adjust.
 Identities = 164/336 (48%), Positives = 216/336 (64%), Gaps = 26/336 (7%)

Query: 32  DATMNERHEMWMAQYGRVYRDNA-----EKEMRFKIFKENVEYIASFNNKARN--KPYKL 84
           D  +   +E W +++GR  R N      E  +R ++F++N+ YI + N +A      ++L
Sbjct: 47  DEEVRRMYEAWKSKHGRP-RGNCDMAGDEDRLRLEVFRDNLRYIDAHNAEADAGLHTFRL 105

Query: 85  GINEFADQTNEEFRAPRNGYKRRL---PSVRS-----------SETTDVSFRYENASVPA 130
           G+  FAD T EE+R    G++ R    PS R+           S       R     +P 
Sbjct: 106 GLTPFADLTLEEYRGRALGFRARHRGGPSARAAASRVGSGGTRSHHRRPRPRPRCGDLPD 165

Query: 131 SIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQ 190
           +IDWR+ GAVT VK+Q QCG CWAFSAVAA+EGIN I T  L SLSEQE++DCDT  +D 
Sbjct: 166 AIDWRQLGAVTDVKNQEQCGGCWAFSAVAAIEGINAIVTGNLVSLSEQEIIDCDT--QDS 223

Query: 191 GCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEAN-PSAAKISGYEDVPSNNEA 249
           GC GG M++AF+F+I N G+ +EA YP+ A+DG+C+  +AN    A I G+ +V SNNE 
Sbjct: 224 GCNGGQMENAFQFVIDNGGIDSEADYPFIATDGTCDANKANDEKVAAIDGFVEVASNNET 283

Query: 250 ALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVK 309
           AL +AVA QPVSVAIDA G  FQ YSSG+F G CGT LDHGVT VGYG+ ++G  YW+VK
Sbjct: 284 ALQEAVAIQPVSVAIDAGGRAFQHYSSGIFNGPCGTNLDHGVTVVGYGS-ENGKAYWIVK 342

Query: 310 NSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           NSW  +WGE GYIR++R++    G CGIAM ASYP 
Sbjct: 343 NSWSDSWGEAGYIRIRRNVFLPVGKCGIAMDASYPV 378


>gi|42567068|ref|NP_567686.2| putative cysteine proteinase [Arabidopsis thaliana]
 gi|332659371|gb|AEE84771.1| putative cysteine proteinase [Arabidopsis thaliana]
          Length = 356

 Score =  305 bits (782), Expect = 2e-80,   Method: Compositional matrix adjust.
 Identities = 150/308 (48%), Positives = 210/308 (68%), Gaps = 10/308 (3%)

Query: 40  EMWMAQYGRVYRDN-AEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFR 98
           +MWM+++G+ Y +   EKE RF+ FK+N+ +I   N  A+N  Y+LG+  FAD T +E+R
Sbjct: 48  QMWMSKHGKTYTNALGEKERRFQNFKDNLRFIDQHN--AKNLSYQLGLTRFADLTVQEYR 105

Query: 99  APRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAV 158
               G  +  P  R+ +T+          +P S+DWR++GAV+ +KDQG C  CWAFS V
Sbjct: 106 DLFPGSPK--PKQRNLKTSRRYVPLAGDQLPESVDWRQEGAVSEIKDQGTCNSCWAFSTV 163

Query: 159 AAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEG-GLMDDAFEFIISNKGLATEAKYP 217
           AA+EG+N I T +L SLSEQELVDC+    + GC G GLMD AF+F+I+N GL +E  YP
Sbjct: 164 AAVEGLNKIVTGELISLSEQELVDCNLV--NNGCYGSGLMDTAFQFLINNNGLDSEKDYP 221

Query: 218 YKASDGSCNKKEANPS-AAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSS 276
           Y+ + GSCN+K++  +    I  YEDVP+N+E +L KAVA+QPVSV +D    +F  Y S
Sbjct: 222 YQGTQGSCNRKQSTSNKVITIDSYEDVPANDEISLQKAVAHQPVSVGVDKKSQEFMLYRS 281

Query: 277 GVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCG 336
            ++ G CGT LDH +  VGYG+ ++G  YW+V+NSWGTTWG+ GYI++ R+ +  +GLCG
Sbjct: 282 CIYNGPCGTNLDHALVIVGYGS-ENGQDYWIVRNSWGTTWGDAGYIKIARNFEDPKGLCG 340

Query: 337 IAMQASYP 344
           IAM ASYP
Sbjct: 341 IAMLASYP 348


>gi|297830594|ref|XP_002883179.1| hypothetical protein ARALYDRAFT_318695 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297329019|gb|EFH59438.1| hypothetical protein ARALYDRAFT_318695 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 308

 Score =  305 bits (781), Expect = 2e-80,   Method: Compositional matrix adjust.
 Identities = 158/309 (51%), Positives = 202/309 (65%), Gaps = 18/309 (5%)

Query: 39  HEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFR 98
           +E W+ +  + Y    EKE R KIFKEN+++I   +N   N+ +++G+  FAD TN+E  
Sbjct: 2   YERWLVENRKNYNGLGEKERRCKIFKENLKFIDE-HNSLPNQTFEVGLTRFADLTNDE-- 58

Query: 99  APRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAV 158
            P++  K            D     E   +P  IDWR KGAV  VKDQG CG CWAFSAV
Sbjct: 59  -PKDFMK-----------ADRYLYKEGDILPDEIDWRAKGAVVPVKDQGNCGSCWAFSAV 106

Query: 159 AAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPY 218
            A+EGIN I T +L SLS+QEL+DCD    + GCEGG+M+ AFEFII+N G+ ++  YPY
Sbjct: 107 GAVEGINQIKTGELISLSDQELIDCDRGFVNAGCEGGVMNYAFEFIINNGGIESDQDYPY 166

Query: 219 KASD-GSCN-KKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSS 276
            A+D G CN  K+ N    KI GYE V  N+E +L KAVA+QPV VAI+AS   F+ Y S
Sbjct: 167 TATDLGVCNADKKNNTRVVKIDGYEYVAQNDEKSLKKAVAHQPVGVAIEASSQAFKLYKS 226

Query: 277 GVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCG 336
           GVFTG CG  LDHGV  VGYGT+  G  YW+++NSWG  WGENGY+++QR+ID   G CG
Sbjct: 227 GVFTGTCGIYLDHGVVVVGYGTS-SGEDYWIIRNSWGLNWGENGYVKLQRNIDDSFGKCG 285

Query: 337 IAMQASYPT 345
           +AM  SYPT
Sbjct: 286 VAMMPSYPT 294


>gi|146215988|gb|ABQ10196.1| actinidin Act3a [Actinidia eriantha]
          Length = 380

 Score =  305 bits (780), Expect = 2e-80,   Method: Compositional matrix adjust.
 Identities = 162/349 (46%), Positives = 219/349 (62%), Gaps = 20/349 (5%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRF 60
           M+++     L+ +      + A  S  RT ND  M   +E W+ +YG+ Y    E+EMR 
Sbjct: 10  MSLLFFSTFLIFS----FAIDAKISPLRT-NDEVM-ALYESWLVKYGKSYNSLGEREMRI 63

Query: 61  KIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVS 120
           +IFKEN+ +I   +N   N+ Y +G+N+FAD T+EE+R+   G+K  L S        VS
Sbjct: 64  EIFKENLRFIDE-HNADPNRSYTVGLNQFADLTDEEYRSTYLGFKSSLKS-------KVS 115

Query: 121 FRYE---NASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSE 177
            RY       +P  +DWR  GAV  VK+QG C  CWAF+ +A +E IN I T  L SLSE
Sbjct: 116 NRYMPQVGEVLPDYVDWRTTGAVVDVKNQGLCSSCWAFATIATVESINQIITGDLISLSE 175

Query: 178 QELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKI 237
           QELVDC+ +  ++GC+GG MDDA+EFII+N G+ TE  YPY   D  C++ + N +   I
Sbjct: 176 QELVDCNRTPINEGCKGGFMDDAYEFIINNGGINTEENYPYIGQDDQCDEPKKNQNYVTI 235

Query: 238 SGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFT-GQCGTELDHGVTAVGY 296
             YE VP N+E A+ +AVA QPVSVAIDA    F+FY SG+FT G CGT L+H VT +GY
Sbjct: 236 DSYEQVPPNDELAMKRAVAYQPVSVAIDAYCLGFRFYQSGIFTGGSCGTTLNHAVTIIGY 295

Query: 297 GTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           GT ++G  YW+VKNS+GT WGE+GY ++QR++   EG CGIA    YP 
Sbjct: 296 GT-ENGIDYWIVKNSYGTQWGESGYGKVQRNV-GGEGRCGIASYPFYPV 342


>gi|2160175|gb|AAB60738.1| Strong similarity to Dianthus cysteine proteinase (gb|U17135)
           [Arabidopsis thaliana]
          Length = 416

 Score =  305 bits (780), Expect = 2e-80,   Method: Compositional matrix adjust.
 Identities = 156/318 (49%), Positives = 201/318 (63%), Gaps = 14/318 (4%)

Query: 35  MNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTN 94
           ++E  + W  ++G+ Y    E++ R +IFK+N +++   +N   N  Y L +N FAD T+
Sbjct: 26  ISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQ-HNLITNATYSLSLNAFADLTH 84

Query: 95  EEFRAPRNGYKRRLPSV-RSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCW 153
            EF+A R G     PSV  +S+   +     +  VP S+DWRKKGAVT VKDQG CG CW
Sbjct: 85  HEFKASRLGLSVSAPSVIMASKGQSLG---GSVKVPDSVDWRKKGAVTNVKDQGSCGACW 141

Query: 154 AFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATE 213
           +FSA  AMEGIN I T  L SLSEQEL+DCD S  + GC GGLMD AFEF+I N G+ TE
Sbjct: 142 SFSATGAMEGINQIVTGDLISLSEQELIDCDKS-YNAGCNGGLMDYAFEFVIKNHGIDTE 200

Query: 214 AKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQF 273
             YPY+  DG+C K +       I  Y  V SN+E ALM+AVA QPVSV I  S   FQ 
Sbjct: 201 KDYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSERAFQL 260

Query: 274 YSS-------GVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQR 326
           YSS       G+F+G C T LDH V  VGYG+  +G  YW+VKNSWG +WG +G++ MQR
Sbjct: 261 YSSKFYLLMQGIFSGPCSTSLDHAVLIVGYGS-QNGVDYWIVKNSWGKSWGMDGFMHMQR 319

Query: 327 DIDAKEGLCGIAMQASYP 344
           + +  +G+CGI M ASYP
Sbjct: 320 NTENSDGVCGINMLASYP 337


>gi|242049716|ref|XP_002462602.1| hypothetical protein SORBIDRAFT_02g028840 [Sorghum bicolor]
 gi|241925979|gb|EER99123.1| hypothetical protein SORBIDRAFT_02g028840 [Sorghum bicolor]
          Length = 384

 Score =  305 bits (780), Expect = 3e-80,   Method: Compositional matrix adjust.
 Identities = 166/360 (46%), Positives = 214/360 (59%), Gaps = 53/360 (14%)

Query: 35  MNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTN 94
           M ER E WM ++GR+Y D  EK+ R ++++ NV  + +FN+ + N  Y+L  N+FAD TN
Sbjct: 28  MLERFEQWMGRHGRLYADAGEKQRRLEVYRRNVALVETFNSMS-NGGYRLADNKFADLTN 86

Query: 95  EEFRAPRNGYKRRLPSVRSSETTD-----------VSFRYENASVPASIDWRKKGAVTGV 143
           EEFRA   G+ R  P  R++  T            +  RY +  +P S+DWR+KGAV  V
Sbjct: 87  EEFRAKMLGFGRPPPHGRATGHTTTPGTVACIGSGLGRRYSD-ELPKSVDWREKGAVAPV 145

Query: 144 KDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEF 203
           K+QG+CG CWAFSAVAA+EGIN I   KL SLSEQELVDCDT  +  GC GG M  AFEF
Sbjct: 146 KNQGECGSCWAFSAVAAIEGINQIKNGKLVSLSEQELVDCDT--KAIGCAGGYMSWAFEF 203

Query: 204 IISNKGLATEAKYPYKAS----------------------------DGSCNKKEANPSAA 235
           +++N GL TE  YPY+ +                            +G+C   +   SA 
Sbjct: 204 VMNNSGLTTERNYPYQGTYAHGNRKTHALPFDCTKGSSTCDSRAGMNGACQTPKLKESAV 263

Query: 236 KISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVG 295
            ISGY +V +++E  L++A A QPVSVA+DA    +Q Y  GVFTG C  +L+HGVT VG
Sbjct: 264 SISGYVNVTASSEPDLLRAAAAQPVSVAVDAGSFVWQLYGGGVFTGPCTADLNHGVTVVG 323

Query: 296 YG-----TADDGT-----KYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           YG     T  DGT     KYW+VKNSWG  WG+ GYI MQR+     GLCGIA+  SYP 
Sbjct: 324 YGETQRDTDGDGTGVPGQKYWIVKNSWGPEWGDAGYILMQREASVASGLCGIALLPSYPV 383


>gi|326430490|gb|EGD76060.1| cysteine proteinase [Salpingoeca sp. ATCC 50818]
          Length = 448

 Score =  304 bits (778), Expect = 4e-80,   Method: Compositional matrix adjust.
 Identities = 165/341 (48%), Positives = 215/341 (63%), Gaps = 19/341 (5%)

Query: 9   KLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVE 68
           KLVL   LV G    +  S T+N   +    + +  ++ +VY    E+  RF +F +N++
Sbjct: 4   KLVLVCALV-GAAMAEPLSLTVNKGRL---FDAFKTKFNKVYESAEEEARRFSVFSQNID 59

Query: 69  YIASFNNKARN--KPYKLGINEFADQTNEEFRAPRNGYKRRLPS-VRSSETTDVSFRYEN 125
           +I   N +A      + + +N+FAD TNEE+R     Y R  P+ +   E  +V     N
Sbjct: 60  FINRHNAEAARGVHTHTVDVNQFADLTNEEYR---QLYLRPYPTELLGRERQEVWLDGPN 116

Query: 126 ASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDT 185
           A    S+DWR+KGAVT +K+QGQCG CW+FS   ++EG + I T  L SLSEQ+LVDC  
Sbjct: 117 A---GSVDWRQKGAVTPIKNQGQCGSCWSFSTTGSVEGAHAIATGNLVSLSEQQLVDCSG 173

Query: 186 SGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPS 245
           S  +QGC GGLMD+AF++IISN GL TE  YPY A DG C+K + +  A  ISGY+DVP 
Sbjct: 174 SFGNQGCNGGLMDNAFKYIISNGGLDTEQDYPYTARDGVCDKSKESKHAVSISGYKDVPQ 233

Query: 246 NNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKY 305
           NNE  L  AV   PVSVAI+A    FQ YSSGVF+G CGT LDHGV  VGY T+D    Y
Sbjct: 234 NNEDQLAAAVEKGPVSVAIEADQQSFQMYSSGVFSGPCGTNLDHGVLVVGY-TSD----Y 288

Query: 306 WLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           W+VKNSWG +WG+ GYI M+R + +  G+CGIAMQ SYP A
Sbjct: 289 WIVKNSWGASWGDQGYIMMKRGVSSA-GICGIAMQPSYPIA 328


>gi|413943290|gb|AFW75939.1| maize insect resistance1 [Zea mays]
          Length = 435

 Score =  303 bits (777), Expect = 6e-80,   Method: Compositional matrix adjust.
 Identities = 161/333 (48%), Positives = 210/333 (63%), Gaps = 22/333 (6%)

Query: 32  DATMNERHEMWMAQYGRVYRDN-------------AEKEMRFKIFKENVEYIASFNNKAR 78
           D  +   +E W +++GR    N              ++ +R ++F++N+ YI   N +A 
Sbjct: 77  DEEVRRMYEAWKSKHGRGGSSNDDCDMAPGDDEQEEDRRLRLEVFRDNLRYIDKHNAEAD 136

Query: 79  N--KPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS---VPASID 133
                ++LG+  FAD T +E+R    G++ R     +       +R        +P +ID
Sbjct: 137 AGLHTFRLGLTPFADLTLDEYRGRVLGFRARARRSGARYGHGHGYRARPRGGDLLPDAID 196

Query: 134 WRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCE 193
           WR+ GAVT VKDQ QCG CWAFSAVAA+EGIN I T  L SLSEQE++DCD   +D GC+
Sbjct: 197 WRQLGAVTEVKDQQQCGGCWAFSAVAAIEGINAIATGNLVSLSEQEIIDCDA--QDSGCD 254

Query: 194 GGLMDDAFEFIISNKGLATEAKYPYKASDGSCN-KKEANPSAAKISGYEDVPSNNEAALM 252
           GG M++AF F+I N G+ TEA YP+  +DG+C+  KE N   A I G  +V SNNE AL 
Sbjct: 255 GGQMENAFRFVIGNGGIDTEADYPFIGTDGTCDASKENNEKVATIDGLVEVASNNETALQ 314

Query: 253 KAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSW 312
           +AVA QPVSVAIDASG  FQ YSSG+F G CGT LDHGVTAVGYG+ + G  YW+VKNSW
Sbjct: 315 EAVAIQPVSVAIDASGRAFQHYSSGIFNGPCGTSLDHGVTAVGYGS-ESGKDYWIVKNSW 373

Query: 313 GTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
             +WGE GYIRM+R++    G CGIAM ASYP 
Sbjct: 374 SASWGEAGYIRMRRNVPRPTGKCGIAMDASYPV 406


>gi|356517306|ref|XP_003527329.1| PREDICTED: LOW QUALITY PROTEIN: thiol protease SEN102-like [Glycine
           max]
          Length = 333

 Score =  303 bits (776), Expect = 9e-80,   Method: Compositional matrix adjust.
 Identities = 167/344 (48%), Positives = 214/344 (62%), Gaps = 20/344 (5%)

Query: 7   ENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKEN 66
           +N  VL   L+L VW  +  SR L     +ERHE W+AQYG+VY+D  E E RF++FK N
Sbjct: 6   QNHYVLVLFLILTVWISRVMSRGL---IRSERHEKWIAQYGKVYKDAVE-EKRFQVFKNN 61

Query: 67  VEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSE--TTDVSFRYE 124
           V++I SFN  A +KP+ L IN+F D  +EEF+A     +++   V + +    D+    E
Sbjct: 62  VQFIESFN-AAGDKPFNLSINQFVDLHDEEFKALLINVQKKASGVETVKEPAMDIQKLTE 120

Query: 125 NASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCD 184
            A        +KK     + D G       F  +A +E ++ IT  +L  LSEQELVDC 
Sbjct: 121 EACRENX---KKKNEKKPMWDLG-------FFLIATIESLHQITIGELVFLSEQELVDC- 169

Query: 185 TSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVP 244
             G+ + C GG +++AFEFI +  G+ +EA YPYK  D SC  K+     A+  GYE VP
Sbjct: 170 VRGDSEACHGGFVENAFEFIANKGGITSEAYYPYKGKDRSCKVKKETHGVARNIGYEKVP 229

Query: 245 SNN-EAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQ-CGTELDHGVTAVGYGTADDG 302
           SNN E AL+KAVANQPVSV IDA    ++FYSSG+F  + CGT LDH  T VGYG   DG
Sbjct: 230 SNNSEKALLKAVANQPVSVYIDAGAPAYKFYSSGIFNARNCGTHLDHAATVVGYGKLHDG 289

Query: 303 TKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           TKYWLVKNSW T WGE GYIRM+RDI +K+GLCGIA  ASYP A
Sbjct: 290 TKYWLVKNSWSTAWGEKGYIRMKRDIHSKKGLCGIASNASYPIA 333


>gi|356515116|ref|XP_003526247.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
          Length = 333

 Score =  303 bits (775), Expect = 9e-80,   Method: Compositional matrix adjust.
 Identities = 173/343 (50%), Positives = 205/343 (59%), Gaps = 46/343 (13%)

Query: 35  MNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTN 94
           M  R + W+   G  Y D  E E+RF I++ NVEYI     K++   Y L  N+FAD TN
Sbjct: 1   MKVRFDRWLKXNGXNYEDKEEWEIRFVIYQANVEYIGC--KKSQKNSYNLTDNKFADLTN 58

Query: 95  EEFRAPRNGYKRRL-PSVRSSETTDVSFRY-ENASVPASIDWRKKGAVTGVKDQGQCG-- 150
           EEF +   G+  RL P  R        F+Y E+ ++P S DWRK+GAVT +KDQG CG  
Sbjct: 59  EEFVSTYLGFATRLIPHTR--------FKYHEHGNLPXSKDWRKEGAVTDIKDQGNCGKH 110

Query: 151 ---------------------------CCWAFSAVAAMEGINHITTRKLTSLSEQELVDC 183
                                        WAFS VAA+E IN I + KL SLSEQELVD 
Sbjct: 111 STWFSPEISHNLRNILTNYNTINFRDISFWAFSVVAAVERINKIKSGKLVSLSEQELVDY 170

Query: 184 DTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDV 243
           D + ++QGCEGGLMD  F FI  N GL T   YPY+  DGSCNK++A   A  ISGYE  
Sbjct: 171 DVANKNQGCEGGLMDTTFAFIKKNGGLTTSKDYPYEGVDGSCNKEKALHHAVNISGYERA 230

Query: 244 PSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGT 303
           PS +EA L  A ANQP+SVAIDA G  FQ YS GVF+G CG +L+HGVT VGY   D GT
Sbjct: 231 PSKDEAMLKVAAANQPISVAIDAGGYAFQLYSQGVFSGVCGKKLNHGVTIVGY---DKGT 287

Query: 304 --KYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
             KY  VKNS G  WGE+GYIRM+RD   K G CGIAM+ASYP
Sbjct: 288 FDKYRTVKNSXGADWGESGYIRMKRDAFDKAGTCGIAMKASYP 330


>gi|297843784|ref|XP_002889773.1| hypothetical protein ARALYDRAFT_471096 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297335615|gb|EFH66032.1| hypothetical protein ARALYDRAFT_471096 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 439

 Score =  302 bits (774), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 156/311 (50%), Positives = 197/311 (63%), Gaps = 9/311 (2%)

Query: 37  ERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEE 96
           E  + W  ++G+ Y    E++ R +IFK+N +++   +N   N  Y L +N FAD T+ E
Sbjct: 30  ELFDDWCQRHGKTYGSEEERQQRIQIFKDNHDFVTQ-HNLITNATYSLSLNAFADLTHHE 88

Query: 97  FRAPRNGYKRRLPS-VRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAF 155
           F+A R G      S + +S+   +     NA VP S+DWRKKGAVT VKDQG CG CW+F
Sbjct: 89  FKASRLGLSVSASSLIMASKGQSLG---GNAKVPDSVDWRKKGAVTNVKDQGSCGACWSF 145

Query: 156 SAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAK 215
           SA  AMEGIN I T  L SLSEQEL+DCD S  + GC GGLMD AFEF+I N G+ TE  
Sbjct: 146 SATGAMEGINQIVTGDLISLSEQELIDCDKS-YNAGCNGGLMDYAFEFVIKNHGIDTEKD 204

Query: 216 YPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYS 275
           YPY+  DG+C K +       I  Y  V SN+E AL +AVA QPVSV I  S   FQ YS
Sbjct: 205 YPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALREAVAAQPVSVGICGSERAFQLYS 264

Query: 276 --SGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEG 333
             SG+F+G C T LDH V  VGYG+  +G  YW+VKNSWG +WG +G++ MQR+    EG
Sbjct: 265 RVSGIFSGPCSTSLDHAVLIVGYGS-QNGVDYWIVKNSWGKSWGMDGFMHMQRNTGNSEG 323

Query: 334 LCGIAMQASYP 344
           +CGI M ASYP
Sbjct: 324 ICGINMLASYP 334


>gi|238007404|gb|ACR34737.1| unknown [Zea mays]
 gi|413943289|gb|AFW75938.1| cysteine proteinase Mir2 [Zea mays]
          Length = 484

 Score =  302 bits (773), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 157/296 (53%), Positives = 196/296 (66%), Gaps = 17/296 (5%)

Query: 59  RFKIFKENVEYIASFNNKARN--KPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSET 116
           R ++F+ N+ YI + N +A      ++LG+  FAD T EE+RA      R L   R    
Sbjct: 83  RLEVFRYNLRYIDAHNAEADAGLHGFRLGLTRFADLTLEEYRA------RLLLGSRGRNG 136

Query: 117 TDV----SFRY---ENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITT 169
           T V    S RY       +P ++DWR++GAV  VKDQGQCG CWAFSAVAA+EGIN I T
Sbjct: 137 TAVGVVGSRRYLPLAGEQLPDAVDWRERGAVAEVKDQGQCGACWAFSAVAAVEGINKIVT 196

Query: 170 RKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKE 229
             L SLSEQEL+DCD   +DQGC+GGLMD+AF F+I N G+ TEA YP+   DG+C+ K 
Sbjct: 197 GSLISLSEQELIDCDKF-QDQGCDGGLMDNAFVFMIKNGGIDTEADYPFTGHDGTCDLKL 255

Query: 230 ANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDH 289
            N     I  +E VP N E AL KAVA+QPVS +I+AS   FQ YSSG+F G+CGT LDH
Sbjct: 256 KNTRVVSIDSFERVPINYERALQKAVAHQPVSASIEASRRAFQLYSSGIFDGRCGTYLDH 315

Query: 290 GVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           GVT VGYG+ + G  YW+VKNSWGT WGE GY+RM R++  + G CGIAM+  YP 
Sbjct: 316 GVTVVGYGS-EGGKDYWIVKNSWGTQWGEAGYVRMARNVRVRAGKCGIAMEPLYPV 370


>gi|297799636|ref|XP_002867702.1| hypothetical protein ARALYDRAFT_329301 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297313538|gb|EFH43961.1| hypothetical protein ARALYDRAFT_329301 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 357

 Score =  301 bits (771), Expect = 3e-79,   Method: Compositional matrix adjust.
 Identities = 149/309 (48%), Positives = 206/309 (66%), Gaps = 10/309 (3%)

Query: 40  EMWMAQYGRVYRDN-AEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFR 98
           +MWM+++G+ Y +   EKE RF+ FK+N+ +I   N  A+N  Y+LG+  FAD T +E+R
Sbjct: 49  QMWMSKHGKTYTNALGEKERRFQNFKDNLRFIDQHN--AKNLSYQLGLTRFADLTVQEYR 106

Query: 99  APRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAV 158
               G  +  P  R+   +      +   +P S+DWR +GAV+ +KDQG C  CWAFS V
Sbjct: 107 DLFPGSPK--PKQRNLRISRRYVPLDGDQLPESVDWRNEGAVSAIKDQGTCNSCWAFSTV 164

Query: 159 AAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEG-GLMDDAFEFIISNKGLATEAKYP 217
           AA+EGIN I T +L SLSEQELVDC+    + GC G G MD AF+F+I+N GL ++  YP
Sbjct: 165 AAVEGINKIVTGELVSLSEQELVDCNLV--NNGCYGSGTMDAAFQFLINNGGLDSDTDYP 222

Query: 218 YKASDGSCNKKEANPS-AAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSS 276
           Y+ S G CN+KE+  +    I  YEDVP+N+E +L KAVA+QPVSV +D    +F  Y S
Sbjct: 223 YQGSQGYCNRKESTSNKIITIDSYEDVPANDEISLQKAVAHQPVSVGVDKKSQEFMLYRS 282

Query: 277 GVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCG 336
           G++ G CGT+LDH +  VGYG+ ++G  YW+V+NSWGTTWG+ GY +M R+ +   G+CG
Sbjct: 283 GIYNGPCGTDLDHALVIVGYGS-ENGQDYWIVRNSWGTTWGDAGYAKMARNFEYPSGVCG 341

Query: 337 IAMQASYPT 345
           IAM ASYP 
Sbjct: 342 IAMLASYPV 350


>gi|302845628|ref|XP_002954352.1| hypothetical protein VOLCADRAFT_76255 [Volvox carteri f.
           nagariensis]
 gi|300260282|gb|EFJ44502.1| hypothetical protein VOLCADRAFT_76255 [Volvox carteri f.
           nagariensis]
          Length = 489

 Score =  301 bits (771), Expect = 3e-79,   Method: Compositional matrix adjust.
 Identities = 174/359 (48%), Positives = 218/359 (60%), Gaps = 22/359 (6%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSR-----TLNDATMNER--HEMWMAQYGRVY-RD 52
           MA+  L   L++AA   +G  AP+   R      L DA  N     + WM QY + Y  D
Sbjct: 1   MAVRFLIAALLVAASGGVGA-APELQLREQHEKLLLDAKANPMAAFQQWMMQYTKAYAND 59

Query: 53  NAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGY--KRRLPS 110
             E E RF ++ EN+ YI ++N  AR   + L +N FAD T +EFR  R GY  K R  S
Sbjct: 60  IKELETRFSVWLENLNYILAYN--ARTTSHWLHLNAFADLTTDEFRN-RLGYDFKARQAS 116

Query: 111 VRSSETTDVSFRYENA---SVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHI 167
            R   +    F Y+N     +P  IDWRKKGAVT VK+QGQCG CWAF+   ++EGIN I
Sbjct: 117 NRLQSS---PFIYDNVDANQLPTEIDWRKKGAVTEVKNQGQCGSCWAFATTGSVEGINAI 173

Query: 168 TTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNK 227
            T +L SLSEQELVDCDT  ED+GC GGLMD A+++II N GL TE  YPY A DG C  
Sbjct: 174 VTGELASLSEQELVDCDTD-EDRGCSGGLMDYAYQWIIKNGGLDTEDDYPYTAEDGVCVA 232

Query: 228 KEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQ-CGTE 286
            + N     I GY D+P N+E AL KA A+QP++VAI+A    FQ Y  GV+    CGT 
Sbjct: 233 AKKNRRVVTIDGYVDIPENDEVALKKAAAHQPIAVAIEADAKSFQLYGGGVYDDPTCGTS 292

Query: 287 LDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           L+HGV  VGYG       YW+VKNSWG  WG+NGYIR++   +  +G+CGIAM  S+PT
Sbjct: 293 LNHGVLVVGYGKDPHFGNYWIVKNSWGPEWGDNGYIRLRMGAEDVQGMCGIAMAPSFPT 351


>gi|125606204|gb|EAZ45240.1| hypothetical protein OsJ_29883 [Oryza sativa Japonica Group]
          Length = 350

 Score =  300 bits (768), Expect = 6e-79,   Method: Compositional matrix adjust.
 Identities = 158/327 (48%), Positives = 208/327 (63%), Gaps = 20/327 (6%)

Query: 35  MNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTN 94
           M +R E WM ++GR Y D  EK+ RF++++ NVE + +FN+ +    YKL  N+FAD TN
Sbjct: 27  MLDRFEQWMIRHGRAYTDAGEKQRRFEVYRRNVELVETFNSMSNG--YKLADNKFADLTN 84

Query: 95  EEFRAPRNGYKRR--LPSVRSSETTDVSFRYENAS--VPASIDWRKKGAVTGV-KDQGQC 149
           EEFRA   G++    +P + ++ + D++   E++   +P S+DWR KGAV    K     
Sbjct: 85  EEFRAKMLGFRPHVTIPQISNTCSADIAMPGESSDDILPKSVDWRNKGAVINRWKICVDA 144

Query: 150 GCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKG 209
           G CWAFSAVAA+EGIN I   +L SLSEQELVDCD   E  GC GG M  AFEF++ N G
Sbjct: 145 GSCWAFSAVAAIEGINQIKNGELVSLSEQELVDCDD--EAVGCGGGYMSWAFEFVVGNHG 202

Query: 210 LATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGS 269
           L TEA YPY A++G+C   + N SA  I+GY +V  ++E  L +A A QPVSVA+D    
Sbjct: 203 LTTEASYPYHAANGACQAAKLNQSAVAIAGYRNVTPSSEPDLARAAAAQPVSVAVDGGSF 262

Query: 270 DFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGT----------KYWLVKNSWGTTWGEN 319
            FQ Y SGV+TG C  +++HGVT VGYG ++  T          KYW+VKNSWG  WG+ 
Sbjct: 263 MFQLYGSGVYTGPCTADVNHGVTVVGYGESEPKTDGGGAAKGGEKYWIVKNSWGAEWGDA 322

Query: 320 GYIRMQRDIDA-KEGLCGIAMQASYPT 345
           GYI MQRD+     GLCGIA+  SYP 
Sbjct: 323 GYILMQRDVAGLASGLCGIALLPSYPV 349


>gi|356545112|ref|XP_003540989.1| PREDICTED: LOW QUALITY PROTEIN: KDEL-tailed cysteine endopeptidase
           CEP1-like [Glycine max]
          Length = 400

 Score =  300 bits (768), Expect = 7e-79,   Method: Compositional matrix adjust.
 Identities = 156/302 (51%), Positives = 205/302 (67%), Gaps = 7/302 (2%)

Query: 24  QSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYK 83
           Q  S++  +A  +ERHE WMAQYG+VY D AE E RF+IFK NV++I SFN  A +KP+ 
Sbjct: 100 QCRSKSRLEACTSERHEKWMAQYGKVYEDAAEMEKRFQIFKNNVQFIESFN-VAGDKPFN 158

Query: 84  LGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENA--SVPASIDWRKKGAVT 141
           + IN+F D  +EEF+A     +R++  V ++ T + SFRY +   ++PA++D RKKG VT
Sbjct: 159 IRINQFPDLHDEEFKALLINGQRKVSGVETA-TEETSFRYGSVVTNIPATMDGRKKGVVT 217

Query: 142 GVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAF 201
            +KDQG  G CWA SAVAA+EGI+ ITT KL  LS+Q+LVD    GE +GC GG ++DAF
Sbjct: 218 PIKDQGIIGSCWALSAVAAIEGIHQITTSKLMFLSKQKLVD-SVKGESEGCIGGYVEDAF 276

Query: 202 EFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVS 261
           EFI+   G+ +E  YPYK  +  C  ++   S A I GYE VPSNN+ AL+K VANQPVS
Sbjct: 277 EFIVKKGGILSETHYPYKGVN-XCKVEKETHSVAHIKGYEKVPSNNKKALLKVVANQPVS 335

Query: 262 VAIDASGSDFQFYSSGVFTGQ-CGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENG 320
           V ID     F++YSS +F  + CG++ +H V  VGYG A DG KYW VKNSWGT WG   
Sbjct: 336 VYIDVGAHAFKYYSSEIFNARNCGSDPNHVVAVVGYGKALDGAKYWPVKNSWGTEWGGKW 395

Query: 321 YI 322
           Y+
Sbjct: 396 YM 397


>gi|167521499|ref|XP_001745088.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163776702|gb|EDQ90321.1| predicted protein [Monosiga brevicollis MX1]
          Length = 294

 Score =  300 bits (767), Expect = 9e-79,   Method: Compositional matrix adjust.
 Identities = 160/305 (52%), Positives = 194/305 (63%), Gaps = 15/305 (4%)

Query: 42  WMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARN--KPYKLGINEFADQTNEEFRA 99
           + + Y + Y   A +  R   F+ N+E+I   N +       Y +G+NEFAD T +EF A
Sbjct: 1   FKSDYSKSYESEAVEAKRLAAFEANLEFINKHNAEHAQGLHSYTVGVNEFADLTIDEFMA 60

Query: 100 PRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVA 159
                   +PS + + T   +  Y  A+   S+DWR KGAVT +K+QGQCG CW+FS   
Sbjct: 61  ------LYVPS-KFNRTMPYNTVYLPATSEDSVDWRTKGAVTPIKNQGQCGSCWSFSTTG 113

Query: 160 AMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYK 219
           + EG + I T  L SLSEQ+LVDC  S  +QGC GGLMDDAF++IISNKGL TE  YPY 
Sbjct: 114 STEGAHAIATGNLVSLSEQQLVDCSGSFGNQGCNGGLMDDAFKYIISNKGLDTEEDYPYT 173

Query: 220 ASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVF 279
           A DG+CNK++    AA IS Y DVP NNE  L  AVA  PVSVAI+A  S FQ Y SGVF
Sbjct: 174 AQDGTCNKEKEAKHAATISSYSDVPKNNEDQLAAAVAKGPVSVAIEADQSGFQLYKSGVF 233

Query: 280 TGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAM 339
            G CGT LDHGV  VGY   DD   YW+VKNSWGTTWG  GYI M+R + A  G+CGIAM
Sbjct: 234 DGNCGTNLDHGVLVVGY--TDD---YWIVKNSWGTTWGVEGYINMKRGVSAS-GICGIAM 287

Query: 340 QASYP 344
           Q SYP
Sbjct: 288 QPSYP 292


>gi|326514800|dbj|BAJ99761.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 291

 Score =  300 bits (767), Expect = 1e-78,   Method: Compositional matrix adjust.
 Identities = 155/220 (70%), Positives = 177/220 (80%), Gaps = 6/220 (2%)

Query: 128 VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSG 187
           VP+S+DWR+KGAVT VKDQGQCG CWAFS +AA+EGIN I T+ LTSLSEQ+LVDCDT  
Sbjct: 61  VPSSVDWRQKGAVTAVKDQGQCGSCWAFSTIAAVEGINAIRTKNLTSLSEQQLVDCDTK- 119

Query: 188 EDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGS-CNKKEANPSAA-KISGYEDVPS 245
            + GC GGLMD AF++I  + G+A E  YPYKA   S CNKK   PSA   I GYEDVP+
Sbjct: 120 SNAGCNGGLMDYAFQYIAKHGGVAAEDAYPYKARQASSCNKK---PSAVVTIDGYEDVPA 176

Query: 246 NNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKY 305
           N+E AL KAVA QPV+VAI+ASGS FQFYS GVF G+CGTELDHGV AVGYGT  DGTKY
Sbjct: 177 NDETALKKAVAAQPVAVAIEASGSHFQFYSEGVFAGKCGTELDHGVAAVGYGTTVDGTKY 236

Query: 306 WLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           W+VKNSWG  WGE GYIRM+RD++ KEGLCGIAM+ASYP 
Sbjct: 237 WIVKNSWGPEWGEKGYIRMKRDVEDKEGLCGIAMEASYPV 276


>gi|326490904|dbj|BAJ90119.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 457

 Score =  299 bits (765), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 150/314 (47%), Positives = 197/314 (62%), Gaps = 8/314 (2%)

Query: 38  RHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNK-ARNKP----YKLGINEFADQ 92
           + E W A++G+ Y    E+  R   F EN  ++A+ N+  A + P    Y L +N FAD 
Sbjct: 38  QFEAWCAEHGKAYATPGERAARLAAFAENAAFVAAHNDAVASSGPGGPSYTLALNAFADL 97

Query: 93  TNEEFRAPRNGYKRRLPS-VRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGC 151
           T++EFRA R G     P  + +   +D  F     +VP ++DWR+ GAVT VKDQG CG 
Sbjct: 98  THDEFRAARLGRLAVGPGPLGAPSPSDGGFEGRVGAVPDALDWRQSGAVTKVKDQGSCGA 157

Query: 152 CWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLA 211
           CW+FSA  AMEGIN ITT  L SLSEQEL+DCD S  + GC GGLM  A++F+I N G+ 
Sbjct: 158 CWSFSATGAMEGINKITTGSLLSLSEQELIDCDRS-YNTGCGGGLMTYAYKFVIKNGGID 216

Query: 212 TEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDF 271
           TE  YP++ +DG+CNK +       I GY++VPS+ E  L++AVA QP+SV I  S   F
Sbjct: 217 TEDDYPFREADGTCNKNKLKKHVVTIDGYKEVPSSKEDLLLQAVAQQPISVGICGSARAF 276

Query: 272 QFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAK 331
           Q YS G+F G C T LDH V  VGYG+ + G  YW+VKNSWG  WG  GY+ M R+  + 
Sbjct: 277 QLYSQGIFDGPCPTSLDHAVLIVGYGS-EGGKDYWIVKNSWGERWGMKGYMHMHRNTGSS 335

Query: 332 EGLCGIAMQASYPT 345
            G+CGI M AS+PT
Sbjct: 336 SGICGINMMASFPT 349


>gi|57118005|gb|AAW34134.1| cysteine protease gp2a [Zingiber officinale]
          Length = 381

 Score =  299 bits (765), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 154/308 (50%), Positives = 197/308 (63%), Gaps = 17/308 (5%)

Query: 48  RVYRDNAEK-----EMRFKIFKENVEYIASFNNKARN--KPYKLGINEFADQTNEEFRAP 100
           RV    AEK     E R ++FKEN++++   N  A      + LG+N FAD TNEE+R  
Sbjct: 57  RVKNHPAEKYLDLNEYRLEVFKENLQFVDEHNAAADRGEHTFLLGMNRFADLTNEEYRTR 116

Query: 101 RNGYKRRLPSVRSSETTDVSFRY---ENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSA 157
              + R    +R S +  +S RY   E   +P SIDWR+ GAV  VK+QG CG CWAFS 
Sbjct: 117 ---FLRDFSRLRRSASGKISSRYRLREGDDLPDSIDWRENGAVVPVKNQGGCGSCWAFST 173

Query: 158 VAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYP 217
           VAA+EGIN I T  L SLSEQ+LVDC T+  + GC GG M+ AF+FI++N G+ +E  YP
Sbjct: 174 VAAVEGINQIVTGDLISLSEQQLVDCTTA--NHGCRGGWMNPAFQFIVNNGGINSEETYP 231

Query: 218 YKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSG 277
           Y+  +G CN    N     I  YE+VPS+NE +L KAVANQPVSV +DA+G DFQ Y SG
Sbjct: 232 YRGQNGICNST-VNAPVVSIDSYENVPSHNEQSLQKAVANQPVSVTMDAAGRDFQLYRSG 290

Query: 278 VFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGI 337
           +FTG C    +H +T VGYGT +D   +W+VKNSWG  WGE+GYIR +R+I+   G CGI
Sbjct: 291 IFTGSCNISANHALTVVGYGTEND-KDFWIVKNSWGKNWGESGYIRAERNIENPNGKCGI 349

Query: 338 AMQASYPT 345
              ASYP 
Sbjct: 350 TRFASYPV 357


>gi|219687002|dbj|BAH08632.1| daikon cysteine protease RD21 [Raphanus sativus]
          Length = 289

 Score =  298 bits (763), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 144/218 (66%), Positives = 167/218 (76%), Gaps = 2/218 (0%)

Query: 127 SVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTS 186
           ++P S+DWRK+GAV  VKDQG CG CWAFS + A+EGIN I T  L SLSEQELVDCDTS
Sbjct: 2   AIPESVDWRKEGAVAAVKDQGSCGSCWAFSTIGAVEGINKIVTGDLISLSEQELVDCDTS 61

Query: 187 GEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSN 246
             +QGC GGLMD AFEFII N G+ TE  YPYKA+DG C++   N     I  YEDVP N
Sbjct: 62  -YNQGCNGGLMDYAFEFIIKNGGIDTEEDYPYKAADGRCDQNRKNAKVVTIDAYEDVPEN 120

Query: 247 NEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYW 306
           NEAAL KA+ANQP+SVAI+A G  FQ YSSGVF G CGTELDHGV AVGYGT ++G  YW
Sbjct: 121 NEAALKKALANQPISVAIEAGGRAFQLYSSGVFDGTCGTELDHGVVAVGYGT-ENGKDYW 179

Query: 307 LVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           +V+NSWG +WGE+GYI+M R+I    G CGIAM+ASYP
Sbjct: 180 IVRNSWGGSWGESGYIKMARNIAEATGKCGIAMEASYP 217


>gi|57118007|gb|AAW34135.1| cysteine protease gp2b [Zingiber officinale]
          Length = 379

 Score =  296 bits (759), Expect = 8e-78,   Method: Compositional matrix adjust.
 Identities = 150/294 (51%), Positives = 192/294 (65%), Gaps = 12/294 (4%)

Query: 57  EMRFKIFKENVEYIASFNNKARN--KPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSS 114
           E R ++FKEN++++   N  A      ++LG+N FAD TNEE+R     + R    +R S
Sbjct: 69  EYRLEVFKENLQFVDKHNAAADRGEHTFRLGMNRFADLTNEEYRTR---FLRDFSRLRRS 125

Query: 115 ETTDVSFRY---ENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRK 171
            +  +S RY   E   +P SIDWR+KGAV  VK+QG CG CWAFS VAA+EGIN I T  
Sbjct: 126 ASGKISSRYRLREGDDLPDSIDWREKGAVVPVKNQGGCGSCWAFSTVAAVEGINQIVTGD 185

Query: 172 LTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEAN 231
           L SLSEQ+LVDC T+  + GC GG M+ AF+FI++N G+ +E  YPY+  +G CN    N
Sbjct: 186 LISLSEQQLVDCTTA--NHGCRGGWMNPAFQFIVNNGGINSEETYPYRGQNGICNST-VN 242

Query: 232 PSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGV 291
                I  YE+VPS+NE +L KAVANQPVSV +DA+G DFQ Y SG+FTG C    +H +
Sbjct: 243 APVVSIDSYENVPSHNEQSLQKAVANQPVSVTMDAAGRDFQLYRSGIFTGSCNISANHAL 302

Query: 292 TAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           T VGYGT +D   Y  VKNSWG  WGE+GYIR++R+I    G CGI   ASYP 
Sbjct: 303 TVVGYGTEND-KDYRTVKNSWGKNWGESGYIRVERNIGNPNGKCGITRFASYPV 355


>gi|310656788|gb|ADP02217.1| Peptidase_C1 domain-containing protein [Triticum aestivum]
          Length = 294

 Score =  296 bits (758), Expect = 1e-77,   Method: Compositional matrix adjust.
 Identities = 153/323 (47%), Positives = 195/323 (60%), Gaps = 58/323 (17%)

Query: 27  SRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGI 86
           +R L DA M ERHE WM ++ RVY+DNAEK   F++FK NV +I SFN  ARN  + LG+
Sbjct: 25  ARELADAAMVERHEQWMVKFNRVYKDNAEKVRWFEVFKANVAFIESFN--ARNHKFWLGV 82

Query: 87  NEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS---VPASIDWRKKGAVTGV 143
           N+F D TN+EF+A +     +    R+S      F+Y N S   +P ++DWR KGA+T +
Sbjct: 83  NQFTDLTNDEFKATKTNKGLK----RTSSRAPTRFKYNNVSTDALPTAVDWRTKGAITPI 138

Query: 144 KDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQG-CEGGLMDDAFE 202
           K                                            DQG C+G     AF+
Sbjct: 139 K--------------------------------------------DQGQCDG----QAFK 150

Query: 203 FIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSV 262
           FII    L +EA YPY A DG C    A+ + A I GYEDVP+N+E++LMKAVANQPVSV
Sbjct: 151 FIIKIGSLTSEANYPYTAQDGQCKTSIASNNVATIKGYEDVPANDESSLMKAVANQPVSV 210

Query: 263 AIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYI 322
           A+D   + FQ YS G  TG CGT+LDHG+ A+GYG   DGTKYWL+KNSWGTTWGE+GY+
Sbjct: 211 AVDGGDAIFQHYSGGAMTGSCGTDLDHGIAAIGYGMTSDGTKYWLLKNSWGTTWGESGYL 270

Query: 323 RMQRDIDAKEGLCGIAMQASYPT 345
           RM++DI  K G+CG+AMQ SYPT
Sbjct: 271 RMEKDISDKSGMCGLAMQPSYPT 293


>gi|57118011|gb|AAW34137.1| cysteine protease gp3b [Zingiber officinale]
          Length = 466

 Score =  296 bits (757), Expect = 1e-77,   Method: Compositional matrix adjust.
 Identities = 153/311 (49%), Positives = 205/311 (65%), Gaps = 11/311 (3%)

Query: 39  HEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARN--KPYKLGINEFADQTNEE 96
           ++ W A++     D    + R ++FKEN+ ++   N  A      Y+LG+N FAD TNEE
Sbjct: 43  YQEWRAKHRPAENDQYVGDYRLEVFKENLRFVDEHNAAADRGEHAYRLGMNRFADLTNEE 102

Query: 97  FRAPRNGYKRRLPSVRSSETTDVSFRY---ENASVPASIDWRKKGAVTGVKDQGQCGCCW 153
           +RA    + R L  +  S + ++S +Y   E   +P SIDWR+KGAV  VK QG+CG CW
Sbjct: 103 YRAR---FLRDLSRLGRSTSGEISNQYRLREGDVLPDSIDWREKGAVVAVKSQGRCGSCW 159

Query: 154 AFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATE 213
           AF+A+A +EGIN I T  L SLSEQ+LVDC T   + GCEGG    AF++II+N G+ +E
Sbjct: 160 AFAAIATVEGINQIVTGDLISLSEQQLVDCST--RNHGCEGGWPYRAFQYIINNGGVNSE 217

Query: 214 AKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQF 273
             YPY  ++G+CN  + N     I  Y +VPSN+E +L KAVANQP+SV I+ASG +FQ 
Sbjct: 218 EHYPYTGTNGTCNTTKGNAHVVSIDSYRNVPSNDEKSLQKAVANQPISVGINASGRNFQL 277

Query: 274 YSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEG 333
           Y SG+FTG C T L+HGVT VGYGT  +G  YW+VKNSWG +WG++GYI M+R+I    G
Sbjct: 278 YHSGIFTGSCNTSLNHGVTVVGYGTV-NGNDYWIVKNSWGESWGDSGYILMERNIAESSG 336

Query: 334 LCGIAMQASYP 344
            CGIA+  SYP
Sbjct: 337 KCGIAISPSYP 347


>gi|162463334|ref|NP_001104878.1| maize insect resistance2 precursor [Zea mays]
 gi|2425064|gb|AAB88262.1| cysteine proteinase Mir2 [Zea mays]
          Length = 493

 Score =  296 bits (757), Expect = 1e-77,   Method: Compositional matrix adjust.
 Identities = 154/296 (52%), Positives = 194/296 (65%), Gaps = 17/296 (5%)

Query: 59  RFKIFKENVEYIASFNNKARN--KPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSET 116
           R ++F++N+ YI + N +A      ++LG+  FAD T EE+RA      R L   R    
Sbjct: 92  RLEVFRDNLRYIDAHNAEADAGLHGFRLGLTRFADLTLEEYRA------RLLLGSRGRNG 145

Query: 117 TDVSF----RY---ENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITT 169
           T V      RY       +P ++DWR++GAV  VKDQGQCG CWAFSAVAA+EGIN I T
Sbjct: 146 TAVGVVGRRRYLPLAGEQLPDAVDWRERGAVAEVKDQGQCGGCWAFSAVAAVEGINKIVT 205

Query: 170 RKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKE 229
             L SLSEQEL+DCD   +DQGC+GGLMD+AF F+I N G+ TEA YP+   DG+C+ K 
Sbjct: 206 GSLISLSEQELIDCDKF-QDQGCDGGLMDNAFVFMIKNGGIDTEADYPFTGHDGTCDLKL 264

Query: 230 ANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDH 289
            N     I  +E VP N E AL KAVA+QPVS +I+AS   FQ YSSG+F G+CGT LDH
Sbjct: 265 KNTRVVSIDSFERVPINYERALQKAVAHQPVSASIEASRRAFQLYSSGIFDGRCGTYLDH 324

Query: 290 GVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           GVT VGYG+ + G  YW+VKNSWGT WGE GY+RM R++  +    GIAM+  YP 
Sbjct: 325 GVTVVGYGS-EGGKDYWIVKNSWGTQWGEAGYVRMARNVRVRPPSAGIAMEPLYPV 379


>gi|302763831|ref|XP_002965337.1| hypothetical protein SELMODRAFT_230602 [Selaginella moellendorffii]
 gi|300167570|gb|EFJ34175.1| hypothetical protein SELMODRAFT_230602 [Selaginella moellendorffii]
          Length = 343

 Score =  295 bits (755), Expect = 2e-77,   Method: Compositional matrix adjust.
 Identities = 160/358 (44%), Positives = 228/358 (63%), Gaps = 30/358 (8%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSR--TLNDATMNERHEM---WMAQYGRVYRDNAE 55
           MA  ++ + L+L  ++V+G   P + +R   L D    E   M   W A++G+ Y  + E
Sbjct: 1   MASNMIASTLIL--LVVVGA-TPFAIARPAALEDGRALEIKNMFEDWAAKHGKSYSSDLE 57

Query: 56  KEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNG------YKRRLP 109
           K  R  IF + + YI   N +  N  + LG+N+F+D TN EFRA   G      Y+ RLP
Sbjct: 58  KARRLMIFSDTLAYIEKHNAQP-NTTFTLGLNKFSDLTNAEFRAMHVGKFKRPRYQDRLP 116

Query: 110 SVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITT 169
           +    E  DVS      S+P S+DWR+KGAVT +KDQG CG CWAFSA+A++E  + + T
Sbjct: 117 A--EDEDVDVS------SLPTSLDWRQKGAVTPIKDQGDCGSCWAFSAIASIESAHFLAT 168

Query: 170 RKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSC--NK 227
           ++L SLSEQ+L+DCDT   D GC+GGLM+ AF+F++ N G+ TEA YPY  S GSC  NK
Sbjct: 169 KELVSLSEQQLMDCDTV--DAGCDGGLMETAFKFVVKNGGVTTEASYPYTGSVGSCNANK 226

Query: 228 KEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTEL 287
                  A+I+G++ V  ++  ALMKAV+  PV+V+I  S  +FQ Y SG+ +GQCG  L
Sbjct: 227 VAIINKVAEITGFKVVTEDSADALMKAVSKTPVTVSICGSDENFQNYKSGILSGQCGDSL 286

Query: 288 DHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           DHGV  +GYGT + G  YW++KNSWGT+WGE+G+++++R     +G+CG+   +SYPT
Sbjct: 287 DHGVLLIGYGT-EGGMPYWIIKNSWGTSWGEDGFMKIER--KDGDGICGMNGDSSYPT 341


>gi|57118009|gb|AAW34136.1| cysteine protease gp3a [Zingiber officinale]
          Length = 475

 Score =  294 bits (753), Expect = 4e-77,   Method: Compositional matrix adjust.
 Identities = 153/311 (49%), Positives = 204/311 (65%), Gaps = 11/311 (3%)

Query: 39  HEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARN--KPYKLGINEFADQTNEE 96
           ++ W  ++     D    + R ++FKEN+ ++   N  A      Y+LG+N FAD TNEE
Sbjct: 52  YQEWRVKHRPAENDQYVGDYRLEVFKENLRFVDEHNAAADRGEHAYRLGMNRFADLTNEE 111

Query: 97  FRAPRNGYKRRLPSVRSSETTDVSFRY---ENASVPASIDWRKKGAVTGVKDQGQCGCCW 153
           +RA    + R L  +  S + ++S +Y   E   +P SIDWR+KGAV  VK+QG+CG CW
Sbjct: 112 YRAR---FLRDLSRLGRSTSGEISNQYRLREGDVLPDSIDWREKGAVVAVKNQGRCGSCW 168

Query: 154 AFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATE 213
           AF+A+AA+EGIN I T  L SLSEQ+LVDC T   + GCEGG    AF++II+N G+ +E
Sbjct: 169 AFAAIAAVEGINQIVTGDLISLSEQQLVDCST--RNYGCEGGWPYRAFQYIINNGGVNSE 226

Query: 214 AKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQF 273
             YPY  ++G+CN  + N     I  Y +VPSN+E +L KA ANQP+SV IDASG +FQ 
Sbjct: 227 EHYPYTGTNGTCNTTKENAHVVSIDSYRNVPSNDEKSLQKAAANQPISVGIDASGRNFQL 286

Query: 274 YSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEG 333
           Y SG+FTG C T L+HGVT VGYGT ++G  YW+VKNSWG  WG +GYI M+R+I    G
Sbjct: 287 YHSGIFTGSCNTSLNHGVTVVGYGT-ENGNDYWIVKNSWGENWGNSGYILMERNIAESSG 345

Query: 334 LCGIAMQASYP 344
            CGIA+  SYP
Sbjct: 346 KCGIAISPSYP 356


>gi|302790828|ref|XP_002977181.1| hypothetical protein SELMODRAFT_106402 [Selaginella moellendorffii]
 gi|300155157|gb|EFJ21790.1| hypothetical protein SELMODRAFT_106402 [Selaginella moellendorffii]
          Length = 337

 Score =  294 bits (752), Expect = 5e-77,   Method: Compositional matrix adjust.
 Identities = 147/312 (47%), Positives = 208/312 (66%), Gaps = 20/312 (6%)

Query: 40  EMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRA 99
           E W A++G+ Y  + EK  R  IF + + YI   N +  N  + LG+N+F+D TN EFRA
Sbjct: 38  EDWAAKHGKSYSSDWEKARRLMIFSDTLAYIEKHNAQP-NTTFTLGLNKFSDLTNAEFRA 96

Query: 100 PRNG------YKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCW 153
              G      Y+ RLP+    E  DVS      S+P S+DWR+KGAVT +KDQG CG CW
Sbjct: 97  MHVGKFKRPRYQDRLPA--EDEDVDVS------SLPTSLDWRQKGAVTPIKDQGDCGSCW 148

Query: 154 AFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATE 213
           AFSA+A++E  + + T++L SLSEQ+L+DCDT   D GC+GGLM+ AF+F++ N G+ TE
Sbjct: 149 AFSAIASIESAHFLATKELVSLSEQQLMDCDTV--DAGCDGGLMETAFKFVVKNGGVTTE 206

Query: 214 AKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQF 273
           A YPY  S GSCN  +A    A+I+G++ V  ++  ALMKAV+  PV+V+I  S  +FQ 
Sbjct: 207 AAYPYTGSVGSCNANKAKNKVAEITGFKVVTEDSADALMKAVSKTPVTVSICGSDENFQN 266

Query: 274 YSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEG 333
           Y SG+ +G+C   LDHGV  +GYGT + G  YW++KNSWGT+WGE+G+++++R     +G
Sbjct: 267 YKSGILSGKCDDSLDHGVLLIGYGT-EGGMPYWIIKNSWGTSWGEDGFMKIER--KDGDG 323

Query: 334 LCGIAMQASYPT 345
           +CG+   +SYPT
Sbjct: 324 MCGMNGDSSYPT 335


>gi|194352758|emb|CAQ00107.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
          Length = 457

 Score =  294 bits (752), Expect = 5e-77,   Method: Compositional matrix adjust.
 Identities = 148/312 (47%), Positives = 195/312 (62%), Gaps = 8/312 (2%)

Query: 38  RHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNK-ARNKP----YKLGINEFADQ 92
           + E W A++G+ Y    E+  R   F EN  ++A+ N+  A + P    Y L +N FAD 
Sbjct: 38  QFEAWCAEHGKAYATPGERAARLAAFAENAAFVAAHNDAVASSGPGGPSYTLALNAFADL 97

Query: 93  TNEEFRAPRNGYKRRLPS-VRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGC 151
           T++EFRA R G     P  + +   +D  F     +VP ++DWR+ GAVT VKDQG CG 
Sbjct: 98  THDEFRAARLGRLAVGPGPLGAPSPSDGGFEGRVGAVPDALDWRQSGAVTKVKDQGSCGA 157

Query: 152 CWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLA 211
           CW+FSA  AMEGIN ITT  L SLSEQEL+DCD S  + GC GGLM  A++F+I N G+ 
Sbjct: 158 CWSFSATGAMEGINKITTGSLLSLSEQELIDCDRS-YNTGCGGGLMTYAYKFVIKNGGID 216

Query: 212 TEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDF 271
           TE  YP++ +DG+CNK +       I GY++VPS+ E  L++AVA QP+SV I  S   F
Sbjct: 217 TEDDYPFREADGTCNKNKLKKHVVTIDGYKEVPSSKEDLLLQAVAQQPISVGICGSARAF 276

Query: 272 QFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAK 331
           Q YS G+F G C T LDH V  VGYG+ + G  YW+VKNSWG  WG  GY+ M R+  + 
Sbjct: 277 QLYSQGIFDGPCPTSLDHAVLIVGYGS-EGGKDYWIVKNSWGERWGMKGYMHMHRNTGSS 335

Query: 332 EGLCGIAMQASY 343
            G+CGI M AS+
Sbjct: 336 SGICGINMMASF 347


>gi|260516678|gb|ACX43965.1| cysteine protease 1 [Brachiaria hybrid cultivar]
          Length = 338

 Score =  294 bits (752), Expect = 5e-77,   Method: Compositional matrix adjust.
 Identities = 161/317 (50%), Positives = 199/317 (62%), Gaps = 15/317 (4%)

Query: 31  NDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFA 90
           ++  + +    +M QY + Y  +AE   RF  FK NVE I   +N   N  Y +G+NEFA
Sbjct: 34  SEVMLQDMFTAFMKQYSKAY-SHAEFSSRFNQFKANVETI-RLHNTLANASYTMGLNEFA 91

Query: 91  DQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCG 150
           D + EEF+    GYK     V        +   E  + P SIDWR   AVT +KDQGQCG
Sbjct: 92  DLSFEEFKGKYFGYKH----VEREFARSNNLHQEVEAAPTSIDWRTSNAVTPIKDQGQCG 147

Query: 151 CCWAFSAVAAMEGINHITTRK-LTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKG 209
            CWAFSA  ++EG   +  +  LTSLSEQ+LVDC TS  D GC GGLMD AFE+II+NKG
Sbjct: 148 SCWAFSATGSIEGAWVLQGKHTLTSLSEQQLVDCSTSYGDAGCNGGLMDYAFEYIIANKG 207

Query: 210 LATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDASG 268
           +  E+ YPYK   G C K  +      ISGY+DV S +EA+L+ AV    PVSVAI+A  
Sbjct: 208 ICAESAYPYKGVGGLCQK--SCTKVVTISGYKDVASGDEASLLNAVGTVGPVSVAIEADQ 265

Query: 269 SDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDI 328
           + FQFYSSGVF+G CG  LDHGV AVGYGT      YW+VKNSWGT+WGE+GYIRM R+ 
Sbjct: 266 AGFQFYSSGVFSGTCGHNLDHGVLAVGYGTTGS-QDYWIVKNSWGTSWGESGYIRMIRN- 323

Query: 329 DAKEGLCGIAMQASYPT 345
              +  CGIA+Q SYPT
Sbjct: 324 ---KNQCGIAIQPSYPT 337


>gi|260516654|gb|ACX43954.1| cysteine protease 1 [Brachiaria hybrid cultivar]
 gi|260516656|gb|ACX43955.1| cysteine protease 1 [Brachiaria hybrid cultivar]
 gi|260516658|gb|ACX43956.1| cysteine protease 1 [Brachiaria hybrid cultivar]
 gi|260516660|gb|ACX43957.1| cysteine protease 1 [Brachiaria hybrid cultivar]
 gi|260516662|gb|ACX43958.1| cysteine protease 2 [Brachiaria hybrid cultivar]
 gi|260516664|gb|ACX43959.1| cysteine protease 2 [Brachiaria hybrid cultivar]
 gi|260516666|gb|ACX43960.1| cysteine protease 2 [Brachiaria hybrid cultivar]
 gi|260516668|gb|ACX43961.1| cysteine protease 2 [Brachiaria hybrid cultivar]
 gi|260516670|gb|ACX43962.1| cysteine protease 2 [Brachiaria hybrid cultivar]
          Length = 338

 Score =  292 bits (747), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 160/317 (50%), Positives = 199/317 (62%), Gaps = 15/317 (4%)

Query: 31  NDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFA 90
           ++  + +    +M QY + Y  +AE   RF  FK NVE I   +N   N  Y +G+NEFA
Sbjct: 34  SEVMLQDMFTAFMKQYSKAY-SHAEFSSRFNQFKANVETI-RLHNTLANASYTMGLNEFA 91

Query: 91  DQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCG 150
           D + EEF+    GYK     V        +   E  + P SIDWR   AVT +KDQGQCG
Sbjct: 92  DLSFEEFKGKYFGYKH----VEREFARSNNLHQEVEAAPTSIDWRTSNAVTPIKDQGQCG 147

Query: 151 CCWAFSAVAAMEGINHITTRK-LTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKG 209
            CWAFSA  ++EG   +  +  LTSLSEQ+LVDC TS  + GC GGLMD AFE+II+NKG
Sbjct: 148 SCWAFSATGSIEGAWVLQGKHTLTSLSEQQLVDCSTSYGNAGCNGGLMDYAFEYIIANKG 207

Query: 210 LATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDASG 268
           +  E+ YPYK   G C K  +      ISGY+DV S +EA+L+ AV    PVSVAI+A  
Sbjct: 208 ICAESAYPYKGVGGLCQK--SCTKVVTISGYKDVASGDEASLLNAVGTVGPVSVAIEADQ 265

Query: 269 SDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDI 328
           + FQFYSSGVF+G CG  LDHGV AVGYGT      YW+VKNSWGT+WGE+GYIRM R+ 
Sbjct: 266 AGFQFYSSGVFSGTCGHNLDHGVLAVGYGTTGS-QDYWIVKNSWGTSWGESGYIRMIRNK 324

Query: 329 DAKEGLCGIAMQASYPT 345
           +     CGIA+Q SYPT
Sbjct: 325 NQ----CGIAIQPSYPT 337


>gi|226505708|ref|NP_001141813.1| uncharacterized protein LOC100273952 precursor [Zea mays]
 gi|194706024|gb|ACF87096.1| unknown [Zea mays]
 gi|413945958|gb|AFW78607.1| hypothetical protein ZEAMMB73_489507 [Zea mays]
          Length = 460

 Score =  291 bits (746), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 152/323 (47%), Positives = 195/323 (60%), Gaps = 14/323 (4%)

Query: 35  MNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARN------------KPY 82
           +  + + W A++G+ Y    E+  R  +F +N  ++A+ N +A                Y
Sbjct: 32  IEAQFDAWCAEHGKAYATPEERAARLAVFADNAAFVAAHNARAGANAAGGGGGGAAPPSY 91

Query: 83  KLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTG 142
            L +N FAD T+EEFRA R G      ++RS            A+VP ++DWRK GAVT 
Sbjct: 92  TLALNAFADLTHEEFRAARLGRIAPGAALRSRAAPVYWGLGGGAAVPDALDWRKSGAVTK 151

Query: 143 VKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFE 202
           VKDQG CG CW+FSA  AMEGIN I T  L SLSEQEL+DCD S  + GC GGLMD A++
Sbjct: 152 VKDQGSCGACWSFSATGAMEGINKIKTGSLVSLSEQELIDCDRS-YNSGCGGGLMDYAYK 210

Query: 203 FIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSV 262
           F+I N G+ TE  YPY+ +DG+CNK +       I GY DVPSN E  L++AVA QPVSV
Sbjct: 211 FVIKNGGIDTEEDYPYREADGTCNKNKLKKRVVTIDGYTDVPSNKEDLLLQAVAQQPVSV 270

Query: 263 AIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYI 322
            I  S   FQ Y  G+F G C T LDH V  VGYG+ + G  YW+VKNSWG +WG  GY+
Sbjct: 271 GICGSARAFQLYYQGIFDGPCPTSLDHAVLIVGYGS-EGGKDYWIVKNSWGESWGMKGYM 329

Query: 323 RMQRDIDAKEGLCGIAMQASYPT 345
            M R+    +G+CGI M AS+PT
Sbjct: 330 HMHRNTGDSKGVCGINMMASFPT 352


>gi|159479072|ref|XP_001697622.1| cysteine endopeptidase [Chlamydomonas reinhardtii]
 gi|158274232|gb|EDP00016.1| cysteine endopeptidase [Chlamydomonas reinhardtii]
          Length = 469

 Score =  291 bits (746), Expect = 3e-76,   Method: Compositional matrix adjust.
 Identities = 153/311 (49%), Positives = 195/311 (62%), Gaps = 12/311 (3%)

Query: 42  WMAQYGRVY-RDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAP 100
           W   + R Y  D AE E RFK++ EN+EY+ ++N  AR   + L +N  AD +  E+++ 
Sbjct: 16  WAQTHSRSYVNDVAEFENRFKVWLENLEYVLAYN--ARTTSHWLTLNHLADLSTPEYKSK 73

Query: 101 RNGYKRRLPSVRSSETTDVSFRYENA---SVPASIDWRKKGAVTGVKDQGQCGCCWAFSA 157
             G+  +    R+   T   FRYE+    ++P +IDWRKK AV  VK+QGQCG CWAF+ 
Sbjct: 74  LLGFDNQARVARNKLKT--GFRYEDVDAEALPPAIDWRKKNAVAEVKNQGQCGSCWAFAT 131

Query: 158 VAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYP 217
             ++EGIN I T  L SLSEQELVDCDT  +D+GC GGLMD A+ +II NKG+ TE  YP
Sbjct: 132 TGSVEGINAIVTGSLVSLSEQELVDCDTE-QDKGCSGGLMDYAYAWIIKNKGINTEEDYP 190

Query: 218 YKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSG 277
           Y A DG C+  +       I  YEDVP N+E AL KA A+QPV+VAI+A    FQ Y  G
Sbjct: 191 YTAMDGQCDVAKMKRRVVTIDSYEDVPENDEVALKKAAAHQPVAVAIEADAKSFQLYGGG 250

Query: 278 VFTG-QCGTELDHGVTAVGYG--TADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGL 334
           V+    CGT L+HGV  VGYG      G+ YW+VKNSWG  WG+ GYIR++      EGL
Sbjct: 251 VYDDPTCGTSLNHGVLVVGYGKDVTGSGSNYWIVKNSWGAEWGDAGYIRLKMGSTDAEGL 310

Query: 335 CGIAMQASYPT 345
           CGIAM  SYP 
Sbjct: 311 CGIAMAPSYPV 321


>gi|156142226|gb|ABU51882.1| ervatamin-C precursor [Tabernaemontana divaricata]
          Length = 365

 Score =  291 bits (744), Expect = 4e-76,   Method: Compositional matrix adjust.
 Identities = 156/317 (49%), Positives = 201/317 (63%), Gaps = 18/317 (5%)

Query: 32  DATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFAD 91
           D  + E +E+W+A++ +VY    E E RF+IFK+N+++I   N++  N  YK+G+  + D
Sbjct: 38  DEEVKEIYELWLAKHDKVYSGLVEYEKRFEIFKDNLKFIDEHNSE--NHTYKMGLTPYTD 95

Query: 92  QTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS---VPASIDWRKKGAVTGVKDQGQ 148
            TNEEF+A   G  R     R   T ++S RY   +   +P  IDWRKKGAVT VK+QG+
Sbjct: 96  LTNEEFQAIYLG-TRSDTIHRLKRTINISERYAYEAGDNLPEQIDWRKKGAVTPVKNQGK 154

Query: 149 CGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNK 208
           CG CWAFS V+ +E IN I T  L SLSEQ+LVDC+   ++ GC+GG    A+++II N 
Sbjct: 155 CGSCWAFSTVSTVESINQIRTGNLISLSEQQLVDCNK--KNHGCKGGAFVYAYQYIIDNG 212

Query: 209 GLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASG 268
           G+ TEA YPYKA  G C    A     +I GY+ VP  NE AL KAVA+QP  VAIDAS 
Sbjct: 213 GIDTEANYPYKAVQGPC---RAAKKVVRIDGYKGVPHCNENALKKAVASQPSVVAIDASS 269

Query: 269 SDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDI 328
             FQ Y SG+F+G CGT+L+HGV  VGY        YW+V+NSWG  WGE GYIRM+R  
Sbjct: 270 KQFQHYKSGIFSGPCGTKLNHGVVIVGY-----WKDYWIVRNSWGRYWGEQGYIRMKRVG 324

Query: 329 DAKEGLCGIAMQASYPT 345
               GLCGIA    YPT
Sbjct: 325 GC--GLCGIARLPYYPT 339


>gi|449530091|ref|XP_004172030.1| PREDICTED: vignain-like [Cucumis sativus]
          Length = 351

 Score =  291 bits (744), Expect = 4e-76,   Method: Compositional matrix adjust.
 Identities = 144/325 (44%), Positives = 207/325 (63%), Gaps = 20/325 (6%)

Query: 32  DATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFAD 91
           + ++ + ++ W + + R+ R+  E   RFK+FK N +++   N     K  KL +N+FAD
Sbjct: 34  EKSLMQLYKRWSSHH-RISRNANEMHNRFKVFKNNAKHVFKVN--LMGKSLKLKLNQFAD 90

Query: 92  QTNEEFRAPRNGYKRRLPSVRSSETTDVS--------FRYENAS-VPASIDWRKKGAVTG 142
            +++EFR   N Y   +   +      +         F YE+A+ +P+SIDWRKKGAV  
Sbjct: 91  MSDDEFR---NMYSSNITYYKDLHAKKIEATGGRIGGFMYEHANNIPSSIDWRKKGAVNA 147

Query: 143 VKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFE 202
           +K+QG+CG CWAF+AVAA+E I+ I T +L SLSE+E++DCD    D GC GG  + AFE
Sbjct: 148 IKNQGRCGSCWAFAAVAAVESIHQIKTNELVSLSEEEVLDCDY--RDGGCRGGFYNSAFE 205

Query: 203 FIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSV 262
           F++ N G+  E  YPY   +G C ++       +I GYE+VP NNE ALMKAVA+QPV+V
Sbjct: 206 FMMDNDGVTIEDNYPYYEGNGYCRRRGGRNKRVRIDGYENVPRNNEYALMKAVAHQPVAV 265

Query: 263 AIDASGSDFQFYSSGVFTGQ--CGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENG 320
           AI + GSDF+FY  G+FT    CG  +DH V  VGYGT +DG  YW+++N +G  WG NG
Sbjct: 266 AIASGGSDFKFYGGGMFTENDFCGFNIDHTVVVVGYGTDEDGD-YWIIRNQYGHRWGMNG 324

Query: 321 YIRMQRDIDAKEGLCGIAMQASYPT 345
           Y++MQR   + +G+CG+AMQ +YP 
Sbjct: 325 YMKMQRGAHSPQGVCGMAMQPAYPV 349


>gi|226499884|ref|NP_001148278.1| thiol protease SEN102 precursor [Zea mays]
 gi|195617112|gb|ACG30386.1| thiol protease SEN102 precursor [Zea mays]
          Length = 374

 Score =  290 bits (743), Expect = 6e-76,   Method: Compositional matrix adjust.
 Identities = 157/333 (47%), Positives = 209/333 (62%), Gaps = 21/333 (6%)

Query: 31  NDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNK--PYKLGINE 88
           +D++M ER + W A Y + Y   AE+  RF+++  N+ YI + N +A      Y+LG   
Sbjct: 42  DDSSMIERFQRWKAAYNKSYATVAEERRRFRVYARNMAYIEATNAEAEAAGLTYELGETA 101

Query: 89  FADQTNEEFRAPRNGYK-RRLPSVRSSETTDVS--------------FRYENASVPASID 133
           + D TN+EF A        +LP+  S  TT                 +   +AS PAS+D
Sbjct: 102 YTDLTNQEFMAMYTAPALAQLPADESVITTRAGPVDAVGGAPGQLPVYVNLSASAPASVD 161

Query: 134 WRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCE 193
           WR  GAVT VK+QG+CG CWAFS VA +EGI  I T KL SLSEQELVDCDT   D GC+
Sbjct: 162 WRASGAVTPVKNQGRCGSCWAFSTVAVVEGIYQIRTGKLVSLSEQELVDCDT--LDDGCD 219

Query: 194 GGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMK 253
           GG+   A  +I SN G+ TEA YPY  +  +CN+ + + +A  I+G   V + +EA+L  
Sbjct: 220 GGISYRALRWIASNGGITTEADYPYTGTTDACNRAKLSHNAVSIAGLRRVATRSEASLAN 279

Query: 254 AVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGT-ADDGTKYWLVKNSW 312
           AVA QPV+V+I+A G +FQ Y  GV+ G CGT L+HGVT VGYG  A  G +YW+VKNSW
Sbjct: 280 AVAGQPVAVSIEAGGDNFQHYKKGVYNGPCGTNLNHGVTVVGYGQEAAAGDRYWIVKNSW 339

Query: 313 GTTWGENGYIRMQRDIDAK-EGLCGIAMQASYP 344
           G  WG++GYIRM++D+  K EGLCGIA++ SYP
Sbjct: 340 GQGWGDDGYIRMKKDVAGKPEGLCGIAIRPSYP 372


>gi|307110445|gb|EFN58681.1| hypothetical protein CHLNCDRAFT_56822 [Chlorella variabilis]
          Length = 466

 Score =  290 bits (742), Expect = 7e-76,   Method: Compositional matrix adjust.
 Identities = 152/311 (48%), Positives = 189/311 (60%), Gaps = 9/311 (2%)

Query: 37  ERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEE 96
           E  + W+    R Y    E E RF ++ +N+ ++  +N  A +  + L +  +AD + +E
Sbjct: 38  EAFDFWVQTLKRAYASAEEYERRFDVWLDNLRFVHEYN--AGHTSHWLSMGVYADLSQDE 95

Query: 97  FRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFS 156
           +R+   GY   L   R        F YE    P  +DW  KGAVT VK+Q  CG CWAFS
Sbjct: 96  YRSKALGYNADLHEERPLRAA--PFLYEGTVPPKEVDWVAKGAVTPVKNQLLCGSCWAFS 153

Query: 157 AVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKY 216
              A+EG + I T KL SLSEQ LVDCD    D GC GGLMD AFEFI+ N G+ TE  Y
Sbjct: 154 TTGAVEGASAIATGKLASLSEQMLVDCDRE-RDNGCHGGLMDFAFEFIMKNGGIDTEDDY 212

Query: 217 PYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSS 276
           PY A +G C   +       I  Y+DVP N+E ALMKAVANQPVSVAI+A    FQ Y  
Sbjct: 213 PYTAEEGMCQDNKMRRHVVTIDDYQDVPPNDEHALMKAVANQPVSVAIEADQRAFQLYGG 272

Query: 277 GVFTGQCGTELDHGVTAVGYGTADDGTK---YWLVKNSWGTTWGENGYIRMQRDIDAKEG 333
           GVF  +CGT LDHGV  VGYGTA +GT    YWLVKNSWG  WG+ GYIR+ R++  +EG
Sbjct: 273 GVFDAECGTALDHGVLVVGYGTASNGTHHLPYWLVKNSWGAEWGDKGYIRLLRNL-GEEG 331

Query: 334 LCGIAMQASYP 344
            CG+AMQAS+P
Sbjct: 332 QCGVAMQASFP 342


>gi|357124027|ref|XP_003563708.1| PREDICTED: germination-specific cysteine protease 1-like
           [Brachypodium distachyon]
          Length = 334

 Score =  290 bits (742), Expect = 7e-76,   Method: Compositional matrix adjust.
 Identities = 150/329 (45%), Positives = 210/329 (63%), Gaps = 19/329 (5%)

Query: 31  NDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNK----ARNKPYKLGI 86
           +D  M ER+E WMA+ GR Y+D+ EK  RF++FK N  +I S N       +++P KL  
Sbjct: 12  DDKAMRERYEKWMAEQGRTYKDSTEKARRFEVFKSNAHFIDSHNAATGPGGKSRP-KLTT 70

Query: 87  NEFADQTNEEFRAPRNGY--KRRLPSVRSSETTDVSFRYENAS---VPASIDWRKKGAVT 141
           N+FAD T +EFR   N Y    R+    +S  TD  F++   S   VP SIDWR +GAVT
Sbjct: 71  NKFADLTEDEFR---NIYVTGHRVNYRPTSLVTDTVFKFGAVSLSDVPPSIDWRARGAVT 127

Query: 142 GVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAF 201
            VKDQ  C CCWAFS+ AA+EGI+ ITT    SLS Q+LVDC  +  ++ C+ G +D A+
Sbjct: 128 SVKDQHLCACCWAFSSAAAVEGIHQITTGNQVSLSVQQLVDCSNAANEK-CKAGEIDKAY 186

Query: 202 EFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVS 261
           E+I  + GL  +  YPY+   G+C +     + A+ISG++ VP+ NE AL+ AVA+QPVS
Sbjct: 187 EYIARSGGLVADQDYPYEGHSGTC-RVYGKQAVARISGFQYVPARNETALLLAVAHQPVS 245

Query: 262 VAIDASGSDFQFYSSGVFTGQ---CGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGE 318
           VA+D      Q   +G+F      C T L+H +T VGYGT + GT+YWL+KNSWG+ WG+
Sbjct: 246 VALDGLSRALQHIGTGIFGSAGEPCTTNLNHAMTIVGYGTDEHGTRYWLMKNSWGSDWGD 305

Query: 319 NGYIRMQRDIDAK-EGLCGIAMQASYPTA 346
            GY++  RD+ ++  G+CG+A++ASYP A
Sbjct: 306 KGYVKFARDVASEINGVCGLALEASYPVA 334


>gi|307111936|gb|EFN60170.1| hypothetical protein CHLNCDRAFT_59551 [Chlorella variabilis]
          Length = 364

 Score =  289 bits (740), Expect = 1e-75,   Method: Compositional matrix adjust.
 Identities = 153/303 (50%), Positives = 186/303 (61%), Gaps = 11/303 (3%)

Query: 48  RVYRDNAE-KEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKR 106
           R Y  +AE  E RF I+ +N+ +   +N  AR+  + L +  +AD + +E+R+   GY  
Sbjct: 59  RAYASSAEVYERRFNIWLDNLRFAHEYN--ARHTSHWLSMGVYADLSQDEYRSKALGYNA 116

Query: 107 RLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINH 166
            L   R        F Y+    P  +DW   GAVT VKDQ  CG CWAFS   A+EG N 
Sbjct: 117 HLHKKRPLRAA--PFLYKGTVPPEEVDWVAGGAVTPVKDQLLCGSCWAFSTTGAVEGANA 174

Query: 167 ITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCN 226
           I T KL SLSEQ LVDCD    D GC GG MD AF+FI++N G+ TE  YPY+A DG C 
Sbjct: 175 IATGKLVSLSEQMLVDCDRE-YDTGCRGGFMDSAFDFIVNNGGIDTEDDYPYRAEDGICQ 233

Query: 227 KKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTE 286
                     I GY+DVP N+E ALMKAVA+QPVSVAI+A    FQ Y  GVF  +CGT 
Sbjct: 234 DNRTRRHVVTIDGYQDVPPNDENALMKAVAHQPVSVAIEADQLAFQLYGGGVFDAECGTA 293

Query: 287 LDHGVTAVGYGTADDGTK---YWLVKNSWGTTWGENGYIRMQRDI--DAKEGLCGIAMQA 341
           LDH V  VGYGTA +GT    YWLVKNSWG  WGE GYIR+ R++  DA EG CG+AM A
Sbjct: 294 LDHAVLVVGYGTASNGTHNLPYWLVKNSWGAEWGEKGYIRLLRNLGKDAPEGQCGLAMYA 353

Query: 342 SYP 344
           S+P
Sbjct: 354 SFP 356


>gi|53791858|dbj|BAD53944.1| putative cysteine protease [Oryza sativa Japonica Group]
          Length = 335

 Score =  289 bits (740), Expect = 1e-75,   Method: Compositional matrix adjust.
 Identities = 158/347 (45%), Positives = 200/347 (57%), Gaps = 15/347 (4%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRF 60
           M  I+L   LV   + +  + A   ++   +D    +  E WMA++G+ Y+ + EKE RF
Sbjct: 1   MTSIVL---LVCTLMALQAMAASAYYNNGSDDGVTMQMFEEWMAKFGKTYKCHGEKEHRF 57

Query: 61  KIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVS 120
            IF++NV +I  +  +       +GIN+FAD TN+EF A   G K   P         + 
Sbjct: 58  GIFRDNVHFIRGYKPQVTYDS-AVGINQFADLTNDEFVATYTGAKPPHPKEAPRPVDPIW 116

Query: 121 FRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQEL 180
                   P  IDWR +GAVTGVKDQG CG CWAF+AVAA+EG+  I T +LT LSEQEL
Sbjct: 117 -------TPCCIDWRFRGAVTGVKDQGACGSCWAFAAVAAIEGLTKIRTGQLTPLSEQEL 169

Query: 181 VDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEA-NPSAAKISG 239
           VDCDT+    GC GG  D AFE + S  G+  E+ Y Y+   G C   +     AA I G
Sbjct: 170 VDCDTN--SNGCGGGHTDRAFELVASKGGITAESDYRYEGFQGKCRVDDMLFNHAASIGG 227

Query: 240 YEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGT- 298
           Y  VP N+E  L  AVA QPV+V IDASG  FQFY SGVF G CG   +H VT VGY   
Sbjct: 228 YRAVPPNDERQLATAVARQPVTVYIDASGPAFQFYKSGVFPGPCGASSNHAVTLVGYCQD 287

Query: 299 ADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
              G KYWL KNSWG TWG+ GYI +++DI    G CG+A+   YPT
Sbjct: 288 GASGKKYWLAKNSWGKTWGQQGYILLEKDIVQPHGTCGLAVSPFYPT 334


>gi|15290195|dbj|BAB63884.1| putative cysteine protease [Oryza sativa Japonica Group]
 gi|125525813|gb|EAY73927.1| hypothetical protein OsI_01811 [Oryza sativa Indica Group]
          Length = 342

 Score =  289 bits (739), Expect = 1e-75,   Method: Compositional matrix adjust.
 Identities = 157/347 (45%), Positives = 201/347 (57%), Gaps = 14/347 (4%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRF 60
           MA  +L     L A+  +G  A   ++   +D    +  E WMA++G+ Y+ + EKE RF
Sbjct: 7   MASAVLLVVCTLMALQAMG--ADAYYNNGSDDGVTMQMFEEWMAKFGKTYKCHGEKEHRF 64

Query: 61  KIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVS 120
            IF++NV +I  +  +       +GIN+FAD TN+EF A   G K   P         + 
Sbjct: 65  GIFRDNVHFIRGYKPQVTYDS-AVGINQFADLTNDEFVATYTGAKPPHPKEAPRPVDPIW 123

Query: 121 FRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQEL 180
                   P  IDWR +GAVTGVKDQG CG CWAF+AVAA+EG+  I T +LT LSEQEL
Sbjct: 124 -------TPCCIDWRFRGAVTGVKDQGACGSCWAFAAVAAIEGLTKIRTGQLTPLSEQEL 176

Query: 181 VDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEA-NPSAAKISG 239
           VDCDT+    GC GG  D AFE + S  G+  E+ Y Y+   G C   +     AA+I G
Sbjct: 177 VDCDTN--SNGCGGGHTDRAFELVASKGGITAESDYRYEGFQGKCRVDDMLFNHAARIGG 234

Query: 240 YEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGT- 298
           Y  VP N+E  L  AVA QPV+V IDASG  FQFY SGVF G CG   +H VT VGY   
Sbjct: 235 YRAVPPNDERQLATAVARQPVTVYIDASGPAFQFYKSGVFPGPCGASSNHAVTLVGYCQD 294

Query: 299 ADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
              G KYW+ KNSWG TWG+ GYI +++D+    G CG+A+   YPT
Sbjct: 295 GASGKKYWVAKNSWGKTWGQQGYILLEKDVLQPHGTCGLAVSPFYPT 341


>gi|313118772|gb|ADR32298.1| C14 cysteine protease [Solanum demissum]
          Length = 217

 Score =  289 bits (739), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 135/217 (62%), Positives = 162/217 (74%), Gaps = 2/217 (0%)

Query: 129 PASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGE 188
           P S+DWR KG + GVKDQG CG CWAFSAVAAME IN I T  L SLSEQELVDCD S  
Sbjct: 2   PVSVDWRDKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGDLISLSEQELVDCDKS-Y 60

Query: 189 DQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNE 248
           +QGC+GGLMD AFEF+I+N G+ TE  YPYK  +  C++   N    KI  YEDVP NNE
Sbjct: 61  NQGCDGGLMDYAFEFVINNGGIDTEEDYPYKERNDVCDQYRKNAKVVKIDSYEDVPVNNE 120

Query: 249 AALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLV 308
            AL KAVA+QPVS+A++A G DFQ Y SG+FTG+CGT +DHGV A GYGT ++G  YW+V
Sbjct: 121 KALQKAVAHQPVSIALEAGGRDFQHYKSGIFTGKCGTAVDHGVVAAGYGT-ENGMDYWIV 179

Query: 309 KNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           +NSWG  WGE GY+R+QR+I +  GLCG+A + SYP 
Sbjct: 180 RNSWGAKWGEKGYLRVQRNIASSSGLCGLATEPSYPV 216


>gi|424513619|emb|CCO66241.1| predicted protein [Bathycoccus prasinos]
          Length = 396

 Score =  288 bits (738), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 151/332 (45%), Positives = 201/332 (60%), Gaps = 16/332 (4%)

Query: 28  RTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNK--ARNKPYKLG 85
           R L ++ + +  + W+ +Y +   +  E+  R KIF EN  ++   N K  A    + + 
Sbjct: 61  RVLRESKIEDAFDAWLVKYDKEIANAEERLKRLKIFGENYLFVLEHNAKYVAGKVSHYVE 120

Query: 86  INEFADQTNEEFRAPRNGYKRRLPSVRSS--ETTDVSF-RYENASVPASIDWRKKGAVTG 142
           +N+FA  T EE+R    G+K+ L   + S     DVS   YE    P SIDW  +G +T 
Sbjct: 121 MNKFAAHTREEYRKML-GFKKSLRRKKDSGEAAKDVSLWEYEGVEAPESIDWVDEGVITT 179

Query: 143 VKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFE 202
            K+QG CG CWAFSA+ A+EGIN I T KL SLSEQELV C   G +QGC GGLMD+AFE
Sbjct: 180 PKNQGSCGSCWAFSAIGAVEGINAIRTGKLVSLSEQELVSCAREGGNQGCNGGLMDNAFE 239

Query: 203 FIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSV 262
           +I+ N G+ +E +Y YKAS   C  ++     A I G+ DVPSN+E AL KAV+ QPVSV
Sbjct: 240 WIVENGGVDSEKQYQYKASFDDCKTRKTLLHIASIDGFNDVPSNDETALKKAVSQQPVSV 299

Query: 263 AIDASGSDFQFYSSGVFTGQ-CGTELDHGVTAVGYGTADDGT---------KYWLVKNSW 312
           AI+A    FQ Y  GV+  + CGT+LDHGV  VGYG   + +         KYW +KNSW
Sbjct: 300 AIEADQRSFQLYGGGVYHAEDCGTQLDHGVLVVGYGIDHNSSNVIIPGATKKYWKIKNSW 359

Query: 313 GTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
              WGE GYIR+ RD+++  G+CG+A  ASYP
Sbjct: 360 SEQWGEGGYIRIARDVESPSGMCGVAEMASYP 391


>gi|313118764|gb|ADR32294.1| C14 cysteine protease [Solanum stoloniferum]
          Length = 217

 Score =  288 bits (738), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 133/217 (61%), Positives = 163/217 (75%), Gaps = 2/217 (0%)

Query: 129 PASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGE 188
           P S+DWR KG + GVKDQG CG CWAFSAVAAME IN I T  L SLSEQELVDCD S  
Sbjct: 2   PVSVDWRDKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGNLISLSEQELVDCDKS-Y 60

Query: 189 DQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNE 248
           +QGC+GGLMD AFEF+I+N G+ +E  YPYK  +G C++   N     I  YEDVP NNE
Sbjct: 61  NQGCDGGLMDYAFEFVINNGGIDSEEDYPYKERNGVCDQYRKNAKVVVIDSYEDVPVNNE 120

Query: 249 AALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLV 308
            AL KAVA+QPVS+A++A G DFQ Y SG+FTG+CGT +DHGV A GYGT ++G  YW+V
Sbjct: 121 KALQKAVAHQPVSIALEAGGRDFQHYKSGIFTGKCGTAVDHGVVAAGYGT-ENGLDYWIV 179

Query: 309 KNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           +NSWG  WGE GY+R+QR++ +  GLCG+A++ SYP 
Sbjct: 180 RNSWGADWGEKGYLRVQRNVASSSGLCGLAIEPSYPV 216


>gi|4731372|gb|AAD28476.1|AF133838_1 papain-like cysteine protease [Sandersonia aurantiaca]
          Length = 370

 Score =  288 bits (736), Expect = 3e-75,   Method: Compositional matrix adjust.
 Identities = 140/243 (57%), Positives = 171/243 (70%), Gaps = 8/243 (3%)

Query: 103 GYKRRLPSVRSSETTDVSFRYENA-SVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAM 161
           G  RR P + S       +RY    ++P S+DWR+KGAV  +KDQG CG CWAFS +A++
Sbjct: 20  GAGRRTPGLASDR-----YRYRAGDALPDSVDWREKGAVVPIKDQGGCGSCWAFSTIASV 74

Query: 162 EGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKAS 221
           EGIN I T  L SLSEQELVDCD +  D GC GGLMD AF+FII N G+ TE  YPY   
Sbjct: 75  EGINKIVTGDLISLSEQELVDCDKTYND-GCNGGLMDYAFQFIIDNGGIDTEKDYPYTEQ 133

Query: 222 DGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTG 281
           DG C+    N     I+ YEDVP N+E AL KA A+QP++VAID  G  FQ Y+SG+FTG
Sbjct: 134 DGRCDSYRKNAKVVSINSYEDVPVNDEQALKKAAASQPIAVAIDGGGRSFQLYNSGIFTG 193

Query: 282 QCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQA 341
           +CGT LDHGVT VGYG+ + G  YW+V+NSWG +WGE GYIRM R+ID+  G+CGIAM+A
Sbjct: 194 KCGTSLDHGVTVVGYGS-ESGKDYWIVRNSWGESWGEKGYIRMARNIDSPSGICGIAMEA 252

Query: 342 SYP 344
           SYP
Sbjct: 253 SYP 255


>gi|125525815|gb|EAY73929.1| hypothetical protein OsI_01813 [Oryza sativa Indica Group]
          Length = 336

 Score =  288 bits (736), Expect = 4e-75,   Method: Compositional matrix adjust.
 Identities = 154/339 (45%), Positives = 198/339 (58%), Gaps = 13/339 (3%)

Query: 10  LVLAAILVLGVWAPQSW-SRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVE 68
           LV+  ++ L   A  ++ +   +D    +  E WMA++G+ Y+ + EKE RF IF++NV 
Sbjct: 7   LVVCTLMALQAMAASAYYNNGSDDGVTMQMFEEWMAKFGKTYKCHGEKEHRFGIFRDNVH 66

Query: 69  YIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASV 128
           +I  +  +       +GIN+FAD TN+EF A   G K   P         +         
Sbjct: 67  FIRGYKPQVTYDS-AVGINQFADLTNDEFVATYTGAKPPHPKEAPRPVDPIW-------T 118

Query: 129 PASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGE 188
           P  IDWR +GAVTGVKDQG CG CWAF+AVAA+EG+  I T +LT LSEQELVDCDT+  
Sbjct: 119 PCCIDWRFRGAVTGVKDQGACGSCWAFAAVAAIEGLTKIRTGQLTPLSEQELVDCDTN-- 176

Query: 189 DQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEA-NPSAAKISGYEDVPSNN 247
             GC GG  D AFE + S  G+  E+ Y Y+   G C   +     AA I GY  VP N+
Sbjct: 177 SNGCGGGHTDRAFELVASKGGITAESDYRYEGFQGKCRVDDMLFNHAASIGGYRAVPPND 236

Query: 248 EAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGT-ADDGTKYW 306
           E  L  AVA QPV+V IDASG  FQFY SGVF G CG   +H VT VGY      G KYW
Sbjct: 237 ERQLATAVARQPVTVYIDASGPAFQFYKSGVFPGPCGASSNHAVTLVGYCQDGASGKKYW 296

Query: 307 LVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           + KNSWG TWG+ GYI +++D+    G CG+A+   YPT
Sbjct: 297 VAKNSWGKTWGQQGYILLEKDVLQPHGTCGLAVSPFYPT 335


>gi|386648114|gb|AFJ15104.1| mexicain-like cystein protease, partial [Jacaratia mexicana]
          Length = 323

 Score =  288 bits (736), Expect = 4e-75,   Method: Compositional matrix adjust.
 Identities = 150/320 (46%), Positives = 202/320 (63%), Gaps = 16/320 (5%)

Query: 31  NDATMNER----HEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGI 86
           +D T  ER     E W  +  ++Y++  EK  RF+IFK+N+ YI   N K  N  Y LG+
Sbjct: 10  DDLTSIERLVRLFESWTLENDKIYKNIDEKIYRFEIFKDNLMYIDETNKK--NSSYWLGL 67

Query: 87  NEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENA-SVPASIDWRKKGAVTGVKD 145
           NEFAD T++EF+A   G      ++   ++ D  F Y++    P SIDWR+KGAVT VK+
Sbjct: 68  NEFADLTHDEFKAKYVGSLGEDSTI-IEQSDDEEFPYKHVVDYPESIDWRQKGAVTPVKN 126

Query: 146 QGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFII 205
           Q  CG CWAFS VA +EGIN I T KL SLSEQEL+DCD      GC+GG    + +++ 
Sbjct: 127 QNPCGSCWAFSTVATVEGINKIVTGKLISLSEQELLDCDR--RSHGCKGGYQTTSLQYVA 184

Query: 206 SNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAID 265
            N G+ TE +YPY+   G C  K+   S  KI+GY+ VP+NNE +L++A+ANQPVSV ++
Sbjct: 185 DN-GVHTEKEYPYEKKQGKCRAKDKKGSKVKITGYKRVPANNEVSLIQAIANQPVSVVVE 243

Query: 266 ASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQ 325
           + G  FQFY  G+F G CGT++DH VTAVGY     G  Y L+KNSWG  WGE GYIR++
Sbjct: 244 SKGRAFQFYKGGIFEGPCGTKVDHAVTAVGY-----GKNYILIKNSWGPKWGEKGYIRIK 298

Query: 326 RDIDAKEGLCGIAMQASYPT 345
           R     +G CG+   + +PT
Sbjct: 299 RASGKSKGTCGVYSSSYFPT 318


>gi|313118768|gb|ADR32296.1| C14 cysteine protease [Solanum demissum]
 gi|313118770|gb|ADR32297.1| C14 cysteine protease [Solanum demissum]
          Length = 217

 Score =  287 bits (735), Expect = 4e-75,   Method: Compositional matrix adjust.
 Identities = 132/217 (60%), Positives = 162/217 (74%), Gaps = 2/217 (0%)

Query: 129 PASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGE 188
           P S+DWR KG + GVKDQG CG CWAFSAVAAME IN I T  L SLSEQELVDCD S  
Sbjct: 2   PVSVDWRDKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGNLISLSEQELVDCDKS-Y 60

Query: 189 DQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNE 248
           ++GC+GGLMD AFEF+I+N G+ TE  YPYK  +G C++   N     I  YEDVP NNE
Sbjct: 61  NEGCDGGLMDYAFEFVINNGGIDTEEDYPYKERNGVCDQYRKNAKVVTIDSYEDVPVNNE 120

Query: 249 AALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLV 308
            AL KAVA+QPVS+A++A G DFQ Y SG+FTG+CGT +DHGV   GYGT ++G  YW+V
Sbjct: 121 KALQKAVAHQPVSIALEAGGRDFQHYKSGIFTGKCGTAVDHGVVVAGYGT-ENGMDYWIV 179

Query: 309 KNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           +NSWG  WGE GY+R+QR++ +  GLCG+A++ SYP 
Sbjct: 180 RNSWGAKWGEKGYLRVQRNVASSSGLCGLAIEPSYPV 216


>gi|125570286|gb|EAZ11801.1| hypothetical protein OsJ_01675 [Oryza sativa Japonica Group]
          Length = 319

 Score =  287 bits (735), Expect = 4e-75,   Method: Compositional matrix adjust.
 Identities = 152/317 (47%), Positives = 188/317 (59%), Gaps = 12/317 (3%)

Query: 31  NDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFA 90
           +D    +  E WMA++G+ Y+ + EKE RF IF++NV +I  +  +       +GIN+FA
Sbjct: 12  DDGVTMQMFEEWMAKFGKTYKCHGEKEHRFGIFRDNVHFIRGYKPQVTYDS-AVGINQFA 70

Query: 91  DQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCG 150
           D TN+EF A   G K   P         +         P  IDWR +GAVTGVKDQG CG
Sbjct: 71  DLTNDEFVATYTGAKPPHPKEAPRPVDPIW-------TPCCIDWRFRGAVTGVKDQGACG 123

Query: 151 CCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGL 210
            CWAF+AVAA+EG+  I T +LT LSEQELVDCDT+    GC GG  D AFE + S  G+
Sbjct: 124 SCWAFAAVAAIEGLTKIRTGQLTPLSEQELVDCDTN--SNGCGGGHTDRAFELVASKGGI 181

Query: 211 ATEAKYPYKASDGSCNKKEA-NPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGS 269
             E+ Y Y+   G C   +     AA I GY  VP N+E  L  AVA QPV+V IDASG 
Sbjct: 182 TAESDYRYEGFQGKCRVDDMLFNHAASIGGYRAVPPNDERQLATAVARQPVTVYIDASGP 241

Query: 270 DFQFYSSGVFTGQCGTELDHGVTAVGYGT-ADDGTKYWLVKNSWGTTWGENGYIRMQRDI 328
            FQFY SGVF G CG   +H VT VGY      G KYWL KNSWG TWG+ GYI +++DI
Sbjct: 242 AFQFYKSGVFPGPCGASSNHAVTLVGYCQDGASGKKYWLAKNSWGKTWGQQGYILLEKDI 301

Query: 329 DAKEGLCGIAMQASYPT 345
               G CG+A+   YPT
Sbjct: 302 VQPHGTCGLAVSPFYPT 318


>gi|156124996|gb|ABU50816.1| Ale o 1 allergen [Aleuroglyphus ovatus]
          Length = 337

 Score =  287 bits (734), Expect = 5e-75,   Method: Compositional matrix adjust.
 Identities = 159/325 (48%), Positives = 206/325 (63%), Gaps = 13/325 (4%)

Query: 27  SRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARN--KPYKL 84
           S  L +  +  + E + + +GRVY     +  R  IF+ N+++I   N    N    + +
Sbjct: 21  SMLLTEGELEAQFEQFKSTFGRVYPSPEIELHRKSIFRANLQFILRHNIDYFNGDSTFSV 80

Query: 85  GINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVK 144
            +N F D +NEEFRA  NGY RRL +V  +++       E  ++PA++DW  KG VT +K
Sbjct: 81  SVNNFTDLSNEEFRATFNGY-RRLAAVSLADSVHADNDVE--ALPATVDWTTKGVVTPIK 137

Query: 145 DQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFI 204
           +Q QCG CWAFSAVA+MEG + + T KL SLSEQ LVDC  +  D GC GG MD AF+++
Sbjct: 138 NQQQCGSCWAFSAVASMEGQHALKTGKLVSLSEQNLVDCSAAEGDMGCSGGWMDYAFKYV 197

Query: 205 ISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVA 263
           I N+G+ TEA YPYKA D SC  K  N   A I  + DV + +E+AL  AVA+  P+SVA
Sbjct: 198 IQNRGIDTEASYPYKAIDESCEFKR-NSIGATIHSFVDVKTGDESALQNAVASIGPISVA 256

Query: 264 IDASGSDFQFYSSGVFT-GQCGTE-LDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGY 321
           IDAS   FQFYSSGV+    C TE LDHGVTAVGYGT  +G  YW VKNSWGT+WG+ GY
Sbjct: 257 IDASQPSFQFYSSGVYNEPDCSTEILDHGVTAVGYGTL-NGVPYWKVKNSWGTSWGQKGY 315

Query: 322 IRMQRDIDAKEGLCGIAMQASYPTA 346
           I M R+   K+  CGIA +ASYP  
Sbjct: 316 IFMSRN---KQNQCGIATKASYPVV 337


>gi|242088413|ref|XP_002440039.1| hypothetical protein SORBIDRAFT_09g024940 [Sorghum bicolor]
 gi|241945324|gb|EES18469.1| hypothetical protein SORBIDRAFT_09g024940 [Sorghum bicolor]
          Length = 463

 Score =  287 bits (734), Expect = 6e-75,   Method: Compositional matrix adjust.
 Identities = 147/317 (46%), Positives = 194/317 (61%), Gaps = 18/317 (5%)

Query: 40  EMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKAR-------NKPYKLGINEFADQ 92
           + W A++G+ Y    E+  R  +F +N  ++A+ N +            Y L +N FAD 
Sbjct: 42  DAWCAEHGKAYATPEERAARLAVFADNAAFVAAHNARVNAAGGGGAPPSYTLALNAFADL 101

Query: 93  TNEEFRAPRNGYKRRLPSVRSSETTDVSFRYEN-----ASVPASIDWRKKGAVTGVKDQG 147
           T+EEFRA R G   R+ +  ++  +  +  Y        +VP ++DWR+ GAVT VKDQG
Sbjct: 102 THEEFRAARLG---RIAAGAAALRSPAAPVYRGLDGGLGAVPDALDWRENGAVTKVKDQG 158

Query: 148 QCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISN 207
            CG CW+FSA  AMEGIN I T  L SLSEQEL+DCD S  + GC GGLMD A++F++ N
Sbjct: 159 SCGACWSFSATGAMEGINKIKTGSLVSLSEQELIDCDRS-YNSGCGGGLMDYAYKFVVKN 217

Query: 208 KGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDAS 267
            G+ TE  YPY+ +DG+CNK +       I GY DVPSN E  L++AVA QPVSV I  S
Sbjct: 218 GGIDTEEDYPYREADGTCNKNKLKKRIVTIDGYSDVPSNKEDLLLQAVAQQPVSVGICGS 277

Query: 268 GSDFQFYS-SGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQR 326
              FQ YS  G+F G C T LDH V  VGYG+ + G  YW+VKNSWG +WG  GY+ M R
Sbjct: 278 ARAFQLYSQQGIFDGPCPTSLDHAVLIVGYGS-EGGKDYWIVKNSWGESWGMKGYMHMHR 336

Query: 327 DIDAKEGLCGIAMQASY 343
           +    +G+CGI M AS+
Sbjct: 337 NTGDSKGVCGINMMASF 353


>gi|27728675|gb|AAO18731.1| cysteine protease [Gossypium hirsutum]
          Length = 389

 Score =  287 bits (734), Expect = 7e-75,   Method: Compositional matrix adjust.
 Identities = 145/321 (45%), Positives = 204/321 (63%), Gaps = 9/321 (2%)

Query: 30  LNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFN--NKARNKPYKLGIN 87
           L++  + E  + W  ++ +VYR   E E RF+ FK N++YI   N   KA    + +G+N
Sbjct: 40  LSEERVLEIFQQWKEKHRKVYRHAEEAEKRFENFKGNLKYILERNAKRKANKWEHHVGLN 99

Query: 88  EFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQG 147
           +FAD +NEEFR       ++  +   + + ++  + ++   P+S+DWR  G VT VKDQG
Sbjct: 100 KFADMSNEEFRKAYLSKVKKPINKGITLSRNMRRKVQSCDAPSSLDWRNYGVVTAVKDQG 159

Query: 148 QCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISN 207
            CG CWAFS+  AMEGIN + T  L SLSEQELV+CDTS  + GCEGG MD AFE++I+N
Sbjct: 160 SCGSCWAFSSTGAMEGINALVTGDLISLSEQELVECDTS--NYGCEGGYMDYAFEWVINN 217

Query: 208 KGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDAS 267
            G+ +E+ YPY   DG+CN  +       I GY+DV   +++AL+ AVA QPVSV ID S
Sbjct: 218 GGIDSESDYPYTGVDGTCNTTKEETKVVSIDGYQDV-EQSDSALLCAVAQQPVSVGIDGS 276

Query: 268 GSDFQFYSSGVFTGQCG---TELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRM 324
             DFQ Y+ G++ G C     ++DH V  VGYG+ +D  +YW+VKNSWGT+WG +GY  +
Sbjct: 277 AIDFQLYTGGIYDGSCSDDPDDIDHAVLIVGYGS-EDSEEYWIVKNSWGTSWGIDGYFYL 335

Query: 325 QRDIDAKEGLCGIAMQASYPT 345
           +RD D   G+C +   ASYPT
Sbjct: 336 KRDTDLPYGVCAVNAMASYPT 356


>gi|313118760|gb|ADR32292.1| C14 cysteine protease [Solanum stoloniferum]
          Length = 217

 Score =  286 bits (733), Expect = 7e-75,   Method: Compositional matrix adjust.
 Identities = 133/217 (61%), Positives = 162/217 (74%), Gaps = 2/217 (0%)

Query: 129 PASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGE 188
           P S+DWR KG + GVKDQG CG CWAFSAVAAME IN I T  L SLSEQELVDCD S  
Sbjct: 2   PVSVDWRDKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGNLISLSEQELVDCDKS-Y 60

Query: 189 DQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNE 248
           ++GC+GGLMD AFEF+I+N G+ +E  YPYK  +  C++   N    KI  YEDVP NNE
Sbjct: 61  NEGCDGGLMDYAFEFVINNGGIDSEEDYPYKERNDVCDQYRKNAKVVKIDSYEDVPVNNE 120

Query: 249 AALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLV 308
            AL KAVA+QPVS+A++A G DFQ Y SG+FTG+CGT +DHGV A GYGT ++G  YW+V
Sbjct: 121 KALQKAVAHQPVSIALEAGGRDFQHYKSGIFTGKCGTAVDHGVVAAGYGT-ENGMDYWIV 179

Query: 309 KNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           +NSWG  WGE GY+R+QR+I +  GLCG+A + SYP 
Sbjct: 180 RNSWGANWGEKGYLRVQRNIASSSGLCGLATEPSYPV 216


>gi|194701748|gb|ACF84958.1| unknown [Zea mays]
 gi|414589103|tpg|DAA39674.1| TPA: thiol protease SEN102 [Zea mays]
          Length = 374

 Score =  286 bits (733), Expect = 7e-75,   Method: Compositional matrix adjust.
 Identities = 156/337 (46%), Positives = 208/337 (61%), Gaps = 21/337 (6%)

Query: 27  SRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNK--PYKL 84
           S + +D++M ER + W A Y + Y   AE+  RF++   N+ YI + N +A      Y+L
Sbjct: 38  SMSTDDSSMIERFQRWKAAYNKSYATVAEERRRFRVCARNMAYIEATNAEAEAAGLTYEL 97

Query: 85  GINEFADQTNEEFRAPRNG-YKRRLPSVRSSETTDVS--------------FRYENASVP 129
           G   + D TN+EF A        +LP+  S  TT                 +   + S P
Sbjct: 98  GETAYTDLTNQEFMAMYTAPAPAQLPADESVITTRAGPVDAVGGAPGQLPVYVNLSTSAP 157

Query: 130 ASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGED 189
           AS+DWR  GAVT VK+QG+CG CWAFS VA +EGI  I T KL SLSEQELVDCDT   D
Sbjct: 158 ASVDWRASGAVTPVKNQGRCGSCWAFSTVAVVEGIYQIRTGKLVSLSEQELVDCDT--LD 215

Query: 190 QGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEA 249
            GC+GG+   A  +I SN G+ TE  YPY  +  +CN+ + + +A  I+G   V + +EA
Sbjct: 216 DGCDGGISYRALRWIASNGGITTETDYPYTGTTDACNRAKLSHNAVSIAGLRRVATRSEA 275

Query: 250 ALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGT-ADDGTKYWLV 308
           +L  AVA QPV+V+I+A G +FQ Y  GV+ G CGT L+HGVT VGYG  A  G +YW+V
Sbjct: 276 SLANAVAGQPVAVSIEAGGDNFQHYKKGVYNGPCGTNLNHGVTVVGYGQEAAGGDRYWIV 335

Query: 309 KNSWGTTWGENGYIRMQRDIDAK-EGLCGIAMQASYP 344
           KNSWG  WG++GYIRM++D+  K EGLCGIA++ SYP
Sbjct: 336 KNSWGQGWGDDGYIRMKKDVAGKPEGLCGIAIRPSYP 372


>gi|357133074|ref|XP_003568153.1| PREDICTED: cysteine proteinase RD21a-like [Brachypodium distachyon]
          Length = 565

 Score =  286 bits (733), Expect = 8e-75,   Method: Compositional matrix adjust.
 Identities = 148/317 (46%), Positives = 187/317 (58%), Gaps = 17/317 (5%)

Query: 40  EMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARN-------KPYKLGINEFADQ 92
           E W A++G+ Y    E+  R   F +N  ++A+ N              Y L +N FAD 
Sbjct: 43  EAWCAEHGKAYASPGERAARLAAFADNAAFVAAHNAGGGGAGGSNAAPSYTLALNAFADL 102

Query: 93  TNEEFRAPRNGY----KRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQ 148
           T+ EFRA R G       R P         V       +VP ++DWR+ GAVT VKDQG 
Sbjct: 103 THAEFRAARLGRLAVGGARAPPSEGGFAGSVGV----GAVPEALDWRQSGAVTKVKDQGS 158

Query: 149 CGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNK 208
           CG CW+FSA  A+EGIN I T  L SLSEQEL+DCD S  + GC GGLMD A+ F+I N 
Sbjct: 159 CGACWSFSATGAIEGINKIKTGSLISLSEQELIDCDRS-YNAGCGGGLMDYAYRFVIKNG 217

Query: 209 GLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASG 268
           G+ TE  YPY+ +DG+CNK +       I GY DVP+N E +L++AVA QP+SV I  S 
Sbjct: 218 GIDTEDDYPYREADGTCNKNKLKRHVVTIDGYSDVPANKEDSLLQAVAQQPISVGICGSA 277

Query: 269 SDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDI 328
             FQ YS G+F G C T LDH V  VGYG+ + G  YW+VKNSWG  WG  GY+ M R+ 
Sbjct: 278 RAFQLYSQGIFDGPCPTSLDHAVLIVGYGS-EGGKDYWIVKNSWGERWGMKGYMHMHRNT 336

Query: 329 DAKEGLCGIAMQASYPT 345
            +  G+CGI M AS+PT
Sbjct: 337 GSSSGICGINMMASFPT 353


>gi|297602258|ref|NP_001052246.2| Os04g0208200 [Oryza sativa Japonica Group]
 gi|255675225|dbj|BAF14160.2| Os04g0208200, partial [Oryza sativa Japonica Group]
          Length = 219

 Score =  286 bits (733), Expect = 8e-75,   Method: Compositional matrix adjust.
 Identities = 138/199 (69%), Positives = 156/199 (78%), Gaps = 2/199 (1%)

Query: 150 GCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKG 209
           GCCWAFSAVAAMEG   + T KL SLSEQ+LV CD  GEDQGCEGGLMDDAF+FII N G
Sbjct: 21  GCCWAFSAVAAMEGAVKLATGKLVSLSEQQLVSCDVKGEDQGCEGGLMDDAFDFIIKNGG 80

Query: 210 LATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGS 269
           LA E+ YPY ASD  C    A  +AA I GYEDVP+N+EAAL+KAVANQPVSVAID    
Sbjct: 81  LAAESDYPYTASDDKCATAGAGAAAATIKGYEDVPANDEAALLKAVANQPVSVAIDGGDR 140

Query: 270 DFQFYSSGVFTGQ--CGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRD 327
            FQFY  GV +G   C TELDH +TAVGYG A DGTKYWL+KNSWGT+WGE+GY+RM+R 
Sbjct: 141 HFQFYKGGVLSGAAGCATELDHAITAVGYGVASDGTKYWLMKNSWGTSWGEDGYVRMERG 200

Query: 328 IDAKEGLCGIAMQASYPTA 346
           +  KEG+CG+AM ASYPTA
Sbjct: 201 VADKEGVCGLAMMASYPTA 219


>gi|313118766|gb|ADR32295.1| C14 cysteine protease [Solanum demissum]
 gi|313118774|gb|ADR32299.1| C14 cysteine protease [Solanum verrucosum]
 gi|313118776|gb|ADR32300.1| C14 cysteine protease [Solanum verrucosum]
 gi|313118778|gb|ADR32301.1| C14 cysteine protease [Solanum verrucosum]
          Length = 217

 Score =  286 bits (732), Expect = 9e-75,   Method: Compositional matrix adjust.
 Identities = 133/217 (61%), Positives = 162/217 (74%), Gaps = 2/217 (0%)

Query: 129 PASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGE 188
           P S+DWR KG + GVKDQG CG CWAFSAVAAME IN I T  L SLSEQELVDCD S  
Sbjct: 2   PVSVDWRDKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGNLISLSEQELVDCDKS-Y 60

Query: 189 DQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNE 248
           ++GC+GGLMD AFEF+I+N G+ +E  YPYK  +  C++   N    KI  YEDVP NNE
Sbjct: 61  NEGCDGGLMDYAFEFVINNGGIDSEEDYPYKERNDVCDQYRKNAKVVKIDSYEDVPVNNE 120

Query: 249 AALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLV 308
            AL KAVA+QPVS+A++A G DFQ Y SG+FTG+CGT +DHGV A GYGT ++G  YW+V
Sbjct: 121 KALQKAVAHQPVSIALEAGGRDFQHYKSGIFTGKCGTAVDHGVVAAGYGT-ENGMDYWIV 179

Query: 309 KNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           +NSWG  WGE GY+R+QR+I +  GLCG+A + SYP 
Sbjct: 180 RNSWGAKWGEKGYLRVQRNIASSSGLCGLATEPSYPV 216


>gi|302790836|ref|XP_002977185.1| hypothetical protein SELMODRAFT_106228 [Selaginella moellendorffii]
 gi|300155161|gb|EFJ21794.1| hypothetical protein SELMODRAFT_106228 [Selaginella moellendorffii]
          Length = 299

 Score =  286 bits (731), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 142/308 (46%), Positives = 203/308 (65%), Gaps = 14/308 (4%)

Query: 40  EMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRA 99
           E W A++G+ Y  ++EK  R  IF + + YI   N +  N  + LG+N+F+D TN EFRA
Sbjct: 3   EDWAAKHGKSYSSDSEKARRLMIFSDTLAYIEKHNAQP-NTTFTLGLNKFSDLTNAEFRA 61

Query: 100 PRNGYKR--RLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSA 157
              G  +  R    R ++  DV      +S+P S+DWR++GAVT +KDQGQCG CWAFSA
Sbjct: 62  NYVGKFKSPRYQDRRPAKDVDVDV----SSLPTSLDWRQEGAVTPIKDQGQCGSCWAFSA 117

Query: 158 VAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYP 217
           +A++E  + + T++L SLSEQ+L+DCDT   DQGC+GG  +DAF+F++ N G+ TE  YP
Sbjct: 118 IASIESAHFLATKELVSLSEQQLIDCDTV--DQGCQGGFPEDAFKFVVENGGVTTEEAYP 175

Query: 218 YKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSG 277
           Y    GSCN  +      +I+GY+DV  ++  ALMKAV+  PV+V I  S  +FQ Y SG
Sbjct: 176 YTGFAGSCNANKNK--VVEITGYKDVTKDSADALMKAVSKTPVTVGICGSDQNFQNYRSG 233

Query: 278 VFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGI 337
           + +GQC    DH V  +GYGT + G  YW++KNSWGT+WGENG++++++     EG+CG+
Sbjct: 234 ILSGQCSNSRDHAVLVIGYGT-EGGMPYWIIKNSWGTSWGENGFMKIKK--KDGEGMCGM 290

Query: 338 AMQASYPT 345
             Q+SYPT
Sbjct: 291 NGQSSYPT 298


>gi|442754503|gb|JAA69411.1| Putative cathepsin l-like cysteine proteinase b [Ixodes ricinus]
          Length = 335

 Score =  286 bits (731), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 156/309 (50%), Positives = 200/309 (64%), Gaps = 11/309 (3%)

Query: 42  WMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNK-ARNK-PYKLGINEFADQTNEEFRA 99
           + A++G+ Y    E+  R KI+ EN   IA  N K AR + PY + +NEF D  + EF +
Sbjct: 30  FKAKHGKSYVSETEEVFRLKIYMENRHKIAKHNEKYARGEVPYSMAMNEFGDMLHHEFVS 89

Query: 100 PRNGYKRRLP-SVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAV 158
            RNG+KR      R   T       E+ S+P ++DWR KGAVT VK+QGQCG CWAFSA 
Sbjct: 90  TRNGFKRNYKDQPREGSTYLEPENIEDFSLPKTVDWRTKGAVTPVKNQGQCGSCWAFSAT 149

Query: 159 AAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPY 218
            ++EG +   +  + SLSEQ LVDC T   + GCEGGLMD+AF++I +NKG+ TE  YPY
Sbjct: 150 GSLEGQHFRKSGSMVSLSEQNLVDCSTDFGNNGCEGGLMDNAFKYIRANKGIDTEKSYPY 209

Query: 219 KASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDASGSDFQFYSSG 277
             +DG+C+ K++   A   SG+ D+   +E  L KAVA   P+SVAIDAS   FQFYS G
Sbjct: 210 NGTDGTCHFKKSTVGATD-SGFVDIKEGSETQLKKAVATVGPISVAIDASHESFQFYSDG 268

Query: 278 VF-TGQCGTE-LDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLC 335
           V+   +C +E LDHGV  VGYGT  +GT YWLVKNSWGTTWG+ GYIRM R+   K+  C
Sbjct: 269 VYDEPECDSESLDHGVLVVGYGTL-NGTDYWLVKNSWGTTWGDEGYIRMSRN---KKNQC 324

Query: 336 GIAMQASYP 344
           GIA  ASYP
Sbjct: 325 GIASSASYP 333


>gi|296082368|emb|CBI21373.3| unnamed protein product [Vitis vinifera]
          Length = 245

 Score =  286 bits (731), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 139/218 (63%), Positives = 163/218 (74%), Gaps = 3/218 (1%)

Query: 128 VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSG 187
           +P S+DWR+ GAV  VKDQ  CG CWAFS VAA+EGIN I T +L SLSEQELVDCDT  
Sbjct: 6   LPESVDWRETGAVNPVKDQRSCGSCWAFSTVAAVEGINQIVTGELISLSEQELVDCDTE- 64

Query: 188 EDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNN 247
            D GC GGLMD AF+FII N GL TE  YPY   DG CN    +     I GYEDVP  +
Sbjct: 65  YDMGCNGGLMDYAFDFIIKNGGLDTEKDYPYTGFDGECNLSGKSSKVVSIDGYEDVPPFD 124

Query: 248 EAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWL 307
           E AL KAVA+QPVSVA++A G   Q Y SG+FTG+CGT LDHG+ AVGYGT ++GT YW+
Sbjct: 125 EKALQKAVAHQPVSVAVEAGGRALQLYVSGIFTGECGTALDHGIVAVGYGT-ENGTDYWI 183

Query: 308 VKNSWGTTWGENGYIRMQRDI-DAKEGLCGIAMQASYP 344
           V+NSWG++WGENGYIRM+R++ DA  G CGIAM+ASYP
Sbjct: 184 VRNSWGSSWGENGYIRMERNMADAFSGKCGIAMEASYP 221


>gi|313118762|gb|ADR32293.1| C14 cysteine protease [Solanum stoloniferum]
          Length = 217

 Score =  285 bits (730), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 133/217 (61%), Positives = 161/217 (74%), Gaps = 2/217 (0%)

Query: 129 PASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGE 188
           P S+DWR KG + GVKDQG CG CWAFSAVAAME IN I T  L SLSEQELVDCD S  
Sbjct: 2   PVSVDWRDKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGNLISLSEQELVDCDKS-Y 60

Query: 189 DQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNE 248
           ++GC+GGLMD AFEF+I+N G+ +E  YPYK  +  C++   N    KI  YEDVP NNE
Sbjct: 61  NEGCDGGLMDYAFEFVINNGGIDSEEDYPYKERNDVCDQYRKNAKVVKIDSYEDVPVNNE 120

Query: 249 AALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLV 308
            AL KAVA+QPVS+A++A G DFQ Y SG+FTG+CGT +DHGV A GYGT ++G  YW+V
Sbjct: 121 KALQKAVAHQPVSIALEAGGRDFQHYKSGIFTGKCGTAVDHGVVAAGYGT-ENGMDYWIV 179

Query: 309 KNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           +NSWG  WGE GY+R+QR+I    GLCG+A + SYP 
Sbjct: 180 RNSWGAKWGEKGYLRVQRNIARSSGLCGLATEPSYPV 216


>gi|218202077|gb|EEC84504.1| hypothetical protein OsI_31195 [Oryza sativa Indica Group]
          Length = 362

 Score =  285 bits (730), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 160/353 (45%), Positives = 201/353 (56%), Gaps = 14/353 (3%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRF 60
           + + LL +   L A  +L   A       + D  M +R   W   + R Y    E   RF
Sbjct: 13  LTLALLASCGALLATSMLPARATAGSCLDVGDMVMMDRFRAWQGAHNRSYPSAEEALQRF 72

Query: 61  KIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETT--- 117
            +++ N E+I + N +  +  Y+L  NEFAD T EEF A   GY      V  S  T   
Sbjct: 73  DVYRRNAEFIDAVNLRG-DLTYRLAENEFADLTEEEFLATYTGYYAGDGPVDDSVITTGA 131

Query: 118 ---DVSFRYENASVPASIDWRKKGAVTGVKDQ-GQCGCCWAFSAVAAMEGINHITTRKLT 173
              D SF Y    VPAS+DWR +GAV   K Q   C  CWAF   A +E +N I T KL 
Sbjct: 132 GDVDASFSYR-VDVPASVDWRAQGAVVPPKSQTSTCSSCWAFVTAATIESLNMIKTGKLV 190

Query: 174 SLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPS 233
           SLSEQ+LVDCD+   D GC  G    A+++++ N GL TEA YPY A  G CN+ ++   
Sbjct: 191 SLSEQQLVDCDS--YDGGCNLGSYGRAYKWVVENGGLTTEADYPYTARRGPCNRAKSAHH 248

Query: 234 AAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTA 293
           AAKI+G+  VP  NEAAL  AVA QPV+VAI+  GS  QFY  GV+TG CGT L H VT 
Sbjct: 249 AAKITGFGKVPPRNEAALQAAVARQPVAVAIEV-GSGMQFYKGGVYTGPCGTRLAHAVTV 307

Query: 294 VGYGT-ADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           VGYGT A  G KYW +KNSWG +WGE GYIR+ RD+    GLCG+ +  +YPT
Sbjct: 308 VGYGTDASSGAKYWTIKNSWGQSWGERGYIRILRDVGGP-GLCGVTLDIAYPT 359


>gi|125525812|gb|EAY73926.1| hypothetical protein OsI_01810 [Oryza sativa Indica Group]
          Length = 319

 Score =  285 bits (730), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 150/317 (47%), Positives = 188/317 (59%), Gaps = 12/317 (3%)

Query: 31  NDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFA 90
           +D    +  E WMA++G+ Y+ + EKE RF IF++NV +I  +  +       +GIN+FA
Sbjct: 12  DDGVTMQMFEEWMAKFGKTYKCHGEKEHRFGIFRDNVHFIRGYKPQVTYDS-AVGINQFA 70

Query: 91  DQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCG 150
           D TN+EF A   G K   P         +         P  IDWR +GAVTGVKDQG CG
Sbjct: 71  DLTNDEFVATYTGAKPPHPKEAPRPVDPIW-------TPCCIDWRFRGAVTGVKDQGACG 123

Query: 151 CCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGL 210
            CWAF+AVAA+EG+  I T +LT LSEQELVDCDT+    GC GG  D AFE + S  G+
Sbjct: 124 SCWAFAAVAAIEGLTKIRTGQLTPLSEQELVDCDTN--SNGCGGGHTDRAFELVASKGGI 181

Query: 211 ATEAKYPYKASDGSCNKKEA-NPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGS 269
             E+ Y Y+   G C   +     AA I GY  VP N+E  L  AVA QPV+V IDASG 
Sbjct: 182 TAESDYRYEGFQGKCRVDDMLFNHAASIGGYRAVPPNDERQLATAVARQPVTVYIDASGP 241

Query: 270 DFQFYSSGVFTGQCGTELDHGVTAVGYGT-ADDGTKYWLVKNSWGTTWGENGYIRMQRDI 328
            FQFY SGVF G CG   +H VT VGY      G KYW+ KNSWG TWG+ GYI +++D+
Sbjct: 242 AFQFYKSGVFPGPCGASSNHAVTLVGYCQDGASGKKYWVAKNSWGKTWGQQGYILLEKDV 301

Query: 329 DAKEGLCGIAMQASYPT 345
               G CG+A+   YPT
Sbjct: 302 LQPHGTCGLAVSPFYPT 318


>gi|118145|sp|P20721.1|CYSPL_SOLLC RecName: Full=Low-temperature-induced cysteine proteinase; Flags:
           Precursor
 gi|806314|gb|AAA66308.1| thiol protease, partial [Solanum lycopersicum]
          Length = 346

 Score =  285 bits (730), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 134/219 (61%), Positives = 164/219 (74%), Gaps = 2/219 (0%)

Query: 127 SVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTS 186
           S+P SIDWR+KG + GVKDQG CG CWAFSAVAAME IN I T  L SLSEQELVDCD S
Sbjct: 17  SLPESIDWREKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGNLISLSEQELVDCDRS 76

Query: 187 GEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSN 246
             ++GC+GGLMD AFEF+I N G+ TE  YPYK  +G C++   N    KI  YEDVP N
Sbjct: 77  -YNEGCDGGLMDYAFEFVIKNGGIDTEEDYPYKERNGVCDQYRKNAKVVKIDSYEDVPVN 135

Query: 247 NEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYW 306
           NE AL KAVA+QPVS+A++A G DFQ Y SG+FTG+CGT +DHGV   GYGT ++G  YW
Sbjct: 136 NEKALQKAVAHQPVSIALEAGGRDFQHYKSGIFTGKCGTAVDHGVVIAGYGT-ENGMDYW 194

Query: 307 LVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           +V+NSWG    ENGY+R+QR++ +  GLCG+A++ SYP 
Sbjct: 195 IVRNSWGANCRENGYLRVQRNVSSSSGLCGLAIEPSYPV 233


>gi|255078398|ref|XP_002502779.1| cysteine endopeptidase [Micromonas sp. RCC299]
 gi|226518045|gb|ACO64037.1| cysteine endopeptidase [Micromonas sp. RCC299]
          Length = 414

 Score =  285 bits (729), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 149/314 (47%), Positives = 199/314 (63%), Gaps = 12/314 (3%)

Query: 39  HEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYK--LGINEFADQTNEE 96
           HE W  ++G+ Y    EKE+R KIF +N E++   N +  N  +   +G+N  AD T +E
Sbjct: 69  HE-WTQKHGKTYDSEEEKELRLKIFADNHEFVQKHNAEYENGEHTHFVGLNHLADLTKDE 127

Query: 97  FRAPRNGYKRRLPSVRSSETTDVS-FRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAF 155
           F+    GY   L + R+    D S + Y + + P  IDW   GAVT VK+Q QCG CWAF
Sbjct: 128 FKKML-GYNAALRASRAP--VDASTWEYADVTPPEEIDWVASGAVTPVKNQKQCGSCWAF 184

Query: 156 SAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAK 215
           S   A+EG+N I T KL SLSE+EL+ C T+G + GC GGLMD+ FE+I++N+G+ TE  
Sbjct: 185 STTGAVEGVNAIKTGKLISLSEEELISCSTNG-NMGCNGGLMDNGFEWIVNNRGIDTEDG 243

Query: 216 YPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYS 275
           + Y A +  C     +  A  I G++DVPSN+E +LMKAV+ QPVSVAI+A    FQ Y+
Sbjct: 244 WEYVAKEEKCGFFRRHHRAVAIDGFKDVPSNDEDSLMKAVSQQPVSVAIEADHQSFQLYA 303

Query: 276 SGVFTGQ-CGTELDHGVTAVGYGTADDGTK---YWLVKNSWGTTWGENGYIRMQRDIDAK 331
            GV++ + CGTELDHGV  VGYG     TK   +W +KNSWG  WGE+GYIR+ +     
Sbjct: 304 GGVYSAKDCGTELDHGVLLVGYGVDPKSTKHKHFWKIKNSWGPAWGEDGYIRIAKGGSGV 363

Query: 332 EGLCGIAMQASYPT 345
           EG CG+AMQ SYPT
Sbjct: 364 EGQCGVAMQPSYPT 377


>gi|194352760|emb|CAQ00108.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
 gi|326510977|dbj|BAJ91836.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326523875|dbj|BAJ96948.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326528631|dbj|BAJ97337.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 368

 Score =  285 bits (729), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 149/334 (44%), Positives = 206/334 (61%), Gaps = 29/334 (8%)

Query: 35  MNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKA-RNKPYKLGINEFADQT 93
           M +R   W A++ R Y    E+  R +++  N+ YI + N  A     Y+LG   + D T
Sbjct: 38  MAQRFRRWKAEHSRTYATPEEERHRLRVYARNMRYIEATNGDAGAGLTYELGETAYTDLT 97

Query: 94  NEEFRAPRNGYKRRLPSVRSSET----TDVSFRY-----------------ENASVPASI 132
           ++EF A    Y  R P +   +     T ++ R                  E+A  PAS+
Sbjct: 98  SDEFTAM---YTSRAPPLSDDDDDLPMTMITTRAGPVAAAGGGGWLQVYVNESAGAPASV 154

Query: 133 DWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGC 192
           DWR++GAVT VK+QGQCG CWAFS VA +EGI+ I T KL SLSEQELVDCD    D GC
Sbjct: 155 DWRERGAVTAVKNQGQCGSCWAFSTVAVIEGIHQIKTGKLASLSEQELVDCDKL--DHGC 212

Query: 193 EGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALM 252
            GG+   A ++I SN G+ ++  YPY A D +C+ K+ +  AA ISG++ V + +E +L 
Sbjct: 213 NGGVSYRALQWITSNGGITSQDDYPYTAKDDTCDTKKLSHHAASISGFQRVATRSELSLT 272

Query: 253 KAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTAD-DGTKYWLVKNS 311
            AVA QPV+V+I+A G++FQ Y +GV+ G CGT L+HGVT VGYG  +  G  YW+VKNS
Sbjct: 273 NAVAMQPVAVSIEAGGANFQHYRNGVYNGPCGTRLNHGVTVVGYGEDEVTGESYWIVKNS 332

Query: 312 WGTTWGENGYIRMQRD-IDAKEGLCGIAMQASYP 344
           WG  WG+NGY+RM++  ID  EG+CGIA++ S+P
Sbjct: 333 WGEKWGDNGYLRMKKGIIDKPEGICGIAIRPSFP 366


>gi|115478933|ref|NP_001063060.1| Os09g0381400 [Oryza sativa Japonica Group]
 gi|113631293|dbj|BAF24974.1| Os09g0381400 [Oryza sativa Japonica Group]
 gi|215678649|dbj|BAG92304.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|218202075|gb|EEC84502.1| hypothetical protein OsI_31193 [Oryza sativa Indica Group]
          Length = 362

 Score =  285 bits (729), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 160/353 (45%), Positives = 201/353 (56%), Gaps = 14/353 (3%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRF 60
           + + LL +   L A  +L   A       + D  M +R   W   + R Y    E   RF
Sbjct: 13  LTLALLASCGALLATSMLPARATAGSCLDVGDMVMMDRFRAWQGAHNRSYPSAEEALQRF 72

Query: 61  KIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETT--- 117
            +++ N E+I + N +  +  Y+L  NEFAD T EEF A   GY      V  S  T   
Sbjct: 73  DVYRRNAEFIDAVNLRG-DLTYQLAENEFADLTEEEFLATYTGYYAGDGPVDDSVITTGA 131

Query: 118 ---DVSFRYENASVPASIDWRKKGAVTGVKDQ-GQCGCCWAFSAVAAMEGINHITTRKLT 173
              D SF Y    VPAS+DWR +GAV   K Q   C  CWAF   A +E +N I T KL 
Sbjct: 132 GDVDASFSYR-VDVPASVDWRAQGAVVPPKSQTSTCSSCWAFVTAATIESLNMIKTGKLV 190

Query: 174 SLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPS 233
           SLSEQ+LVDCD+   D GC  G    A+++++ N GL TEA YPY A  G CN+ ++   
Sbjct: 191 SLSEQQLVDCDS--YDGGCNLGSYGRAYKWVVENGGLTTEADYPYTARRGPCNRAKSAHH 248

Query: 234 AAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTA 293
           AAKI+G+  VP  NEAAL  AVA QPV+VAI+  GS  QFY  GV+TG CGT L H VT 
Sbjct: 249 AAKITGFGKVPPRNEAALQAAVARQPVAVAIEV-GSGMQFYKGGVYTGPCGTRLAHAVTV 307

Query: 294 VGYGT-ADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           VGYGT A  G KYW +KNSWG +WGE GYIR+ RD+    GLCG+ +  +YPT
Sbjct: 308 VGYGTDASSGAKYWTIKNSWGQSWGERGYIRILRDVGGP-GLCGVTLDIAYPT 359


>gi|156124998|gb|ABU50817.1| Ale o 1 allergen [Aleuroglyphus ovatus]
          Length = 337

 Score =  285 bits (729), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 158/324 (48%), Positives = 205/324 (63%), Gaps = 13/324 (4%)

Query: 27  SRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARN--KPYKL 84
           S  L +  +  + E + + +GRVY     +  R  IF+ N+++I   N    N    + +
Sbjct: 21  SMLLTEGELEAQFEQFKSTFGRVYPSPEIELHRKSIFRANLQFILRHNIDYFNGDSTFSV 80

Query: 85  GINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVK 144
            +N F D +NEEFRA  NGY RRL +V  +++       E  ++PA++DW  KG VT +K
Sbjct: 81  SVNNFTDLSNEEFRATFNGY-RRLAAVSLADSVHADNDVE--ALPATVDWTTKGVVTPIK 137

Query: 145 DQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFI 204
           +Q QCG CWAFSAVA+MEG + + T KL SLSEQ LVDC  +  D GC GG MD AF+++
Sbjct: 138 NQQQCGSCWAFSAVASMEGQHALKTGKLVSLSEQNLVDCSAAEGDMGCSGGWMDYAFKYV 197

Query: 205 ISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVA 263
           I N+G+ TEA YPYKA D SC  K  N   A I  + DV + +E+AL  AVA+  P+SVA
Sbjct: 198 IQNRGIDTEASYPYKAIDESCEFKR-NSVGATIHSFVDVKTGDESALQNAVASIGPISVA 256

Query: 264 IDASGSDFQFYSSGVFT-GQCGTE-LDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGY 321
           IDA+   FQFYSSGV+    C TE LDHGVTAVGYGT  +G  YW VKNSWGT+WG  GY
Sbjct: 257 IDAAQPSFQFYSSGVYNEPDCSTEILDHGVTAVGYGTL-NGAPYWKVKNSWGTSWGRKGY 315

Query: 322 IRMQRDIDAKEGLCGIAMQASYPT 345
           I M R+   K+  CGIA +ASYP 
Sbjct: 316 IFMSRN---KQNQCGIATKASYPV 336


>gi|49387634|dbj|BAD25828.1| putative cysteine proteinase [Oryza sativa Japonica Group]
 gi|49388888|dbj|BAD26098.1| putative cysteine proteinase [Oryza sativa Japonica Group]
          Length = 358

 Score =  285 bits (729), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 160/353 (45%), Positives = 201/353 (56%), Gaps = 14/353 (3%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRF 60
           + + LL +   L A  +L   A       + D  M +R   W   + R Y    E   RF
Sbjct: 9   LTLALLASCGALLATSMLPARATAGSCLDVGDMVMMDRFRAWQGAHNRSYPSAEEALQRF 68

Query: 61  KIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETT--- 117
            +++ N E+I + N +  +  Y+L  NEFAD T EEF A   GY      V  S  T   
Sbjct: 69  DVYRRNAEFIDAVNLRG-DLTYQLAENEFADLTEEEFLATYTGYYAGDGPVDDSVITTGA 127

Query: 118 ---DVSFRYENASVPASIDWRKKGAVTGVKDQ-GQCGCCWAFSAVAAMEGINHITTRKLT 173
              D SF Y    VPAS+DWR +GAV   K Q   C  CWAF   A +E +N I T KL 
Sbjct: 128 GDVDASFSYR-VDVPASVDWRAQGAVVPPKSQTSTCSSCWAFVTAATIESLNMIKTGKLV 186

Query: 174 SLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPS 233
           SLSEQ+LVDCD+   D GC  G    A+++++ N GL TEA YPY A  G CN+ ++   
Sbjct: 187 SLSEQQLVDCDS--YDGGCNLGSYGRAYKWVVENGGLTTEADYPYTARRGPCNRAKSAHH 244

Query: 234 AAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTA 293
           AAKI+G+  VP  NEAAL  AVA QPV+VAI+  GS  QFY  GV+TG CGT L H VT 
Sbjct: 245 AAKITGFGKVPPRNEAALQAAVARQPVAVAIEV-GSGMQFYKGGVYTGPCGTRLAHAVTV 303

Query: 294 VGYGT-ADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           VGYGT A  G KYW +KNSWG +WGE GYIR+ RD+    GLCG+ +  +YPT
Sbjct: 304 VGYGTDASSGAKYWTIKNSWGQSWGERGYIRILRDVGGP-GLCGVTLDIAYPT 355


>gi|33348836|gb|AAQ16118.1| cathepsin L-like cysteine proteinase B [Rhipicephalus
           haemaphysaloides haemaphysaloides]
          Length = 335

 Score =  285 bits (728), Expect = 3e-74,   Method: Compositional matrix adjust.
 Identities = 160/307 (52%), Positives = 200/307 (65%), Gaps = 11/307 (3%)

Query: 44  AQYGRVYRDNAEKEMRFKIFKENVEYIASFNNK-ARNK-PYKLGINEFADQTNEEFRAPR 101
           A +G+ Y  + E+  R KI+ EN   IA  N K A+++  YKL +NEF D  + EF + R
Sbjct: 32  ALHGKDYASDTEEYYRLKIYMENRLKIARHNEKYAKSQVSYKLAMNEFGDLLHHEFVSTR 91

Query: 102 NGYKRRL-PSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAA 160
           NG+KR    S R          +E+  +P ++DWRKKGAVT VK+QGQCG CWAFS   +
Sbjct: 92  NGFKRNYRDSPREGSFFVEPEGFEDLQLPKTVDWRKKGAVTPVKNQGQCGSCWAFSTTGS 151

Query: 161 MEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKA 220
           +EG +   TRKL SLSEQ LVDC  S  + GCEGGLMD+AF++I SNKG+ TE  YPY A
Sbjct: 152 LEGPHFRKTRKLVSLSEQNLVDCSRSFGNNGCEGGLMDNAFKYIKSNKGIDTEWSYPYNA 211

Query: 221 SDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDASGSDFQFYSSGVF 279
           +DG C+   ++  A   +G+ D+P  +E  L KAVA   PVSVAIDAS   FQFYS GV+
Sbjct: 212 TDGVCHFNRSDVGATD-TGFVDIPEGDENKLKKAVAAVGPVSVAIDASHESFQFYSEGVY 270

Query: 280 -TGQCGTE-LDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGI 337
              +C +E LDHGV  VGYGT  DG  YWLVKNSWGTTWG+ GYI M R+   K+  CGI
Sbjct: 271 DEPECSSEQLDHGVLVVGYGTK-DGQDYWLVKNSWGTTWGDEGYIYMTRN---KDNQCGI 326

Query: 338 AMQASYP 344
           A  ASYP
Sbjct: 327 ASSASYP 333


>gi|242093994|ref|XP_002437487.1| hypothetical protein SORBIDRAFT_10g027980 [Sorghum bicolor]
 gi|241915710|gb|EER88854.1| hypothetical protein SORBIDRAFT_10g027980 [Sorghum bicolor]
          Length = 341

 Score =  284 bits (727), Expect = 4e-74,   Method: Compositional matrix adjust.
 Identities = 152/319 (47%), Positives = 199/319 (62%), Gaps = 30/319 (9%)

Query: 32  DATMNERHEMWMAQYGRVYRD--NAEKEMRFKIFKENVEYIASFNNKARN--KPYKLGIN 87
           D  + + ++ W +++GR  RD  +    +R K+F++N+ YI + N +A      ++LG+ 
Sbjct: 44  DEEVRQLYKTWKSEHGRP-RDGISVADGLRLKVFRDNLRYIDAHNAEADAGLHTFRLGLT 102

Query: 88  EFADQTNEEFRAPRNGY-KRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQ 146
            F D T EEFRA   G+    LP V S    D         +P ++DWR++GAVTGVK+Q
Sbjct: 103 PFTDLTLEEFRAHALGFLNSTLPRVAS----DRYLPRAGDDLPDAVDWRQQGAVTGVKNQ 158

Query: 147 GQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIIS 206
             CG CWAFSAVAAMEGIN I T  L SLSEQEL+DCDT  ED GC+GG M  AF+F+I 
Sbjct: 159 LDCGGCWAFSAVAAMEGINKIVTNNLISLSEQELIDCDT--EDYGCQGGEMQKAFQFVID 216

Query: 207 NKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDA 266
           N G+ TEA YP+  ++G+C+          I  YE+VP+N+E AL KAVANQP       
Sbjct: 217 NGGIDTEADYPFIGTNGTCDAIREKRKVVSIDSYENVPTNDEEALQKAVANQP------- 269

Query: 267 SGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQR 326
                     G+F G CG  LDHGVTAVGYG+ D+G  +W+VKNSWG  WGE+GYIRM+R
Sbjct: 270 ----------GIFNGPCGFILDHGVTAVGYGS-DNGEDFWIVKNSWGAEWGESGYIRMKR 318

Query: 327 DIDAKEGLCGIAMQASYPT 345
           ++    G CGIAM ASYP 
Sbjct: 319 NVLLPMGKCGIAMYASYPV 337


>gi|357446979|ref|XP_003593765.1| Cysteine proteinase [Medicago truncatula]
 gi|355482813|gb|AES64016.1| Cysteine proteinase [Medicago truncatula]
          Length = 364

 Score =  284 bits (726), Expect = 5e-74,   Method: Compositional matrix adjust.
 Identities = 151/293 (51%), Positives = 199/293 (67%), Gaps = 9/293 (3%)

Query: 54  AEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYK--RRLPSV 111
           +E E R +IFK N+EYI +FNN A NK YKLG+N+++D T++EF A   G K  ++L S 
Sbjct: 77  SELEKRKRIFKNNLEYIENFNN-AGNKSYKLGLNQYSDLTSDEFLASHTGLKVSKQLSSS 135

Query: 112 RSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRK 171
           +   +  V F   N  VP + DWR++GAVT VKDQG CGCCWAFS VAA+EG   I T +
Sbjct: 136 KM-RSAAVPFNL-NDDVPTNFDWRQQGAVTDVKDQGSCGCCWAFSVVAAVEGAVKINTGE 193

Query: 172 LTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEAN 231
           L SLSEQ+LVDCD    + GC GG MD AF++II  KG+ +EA YPY+    +C   +  
Sbjct: 194 LISLSEQQLVDCDE--RNSGCHGGNMDSAFKYIIQ-KGIVSEADYPYQEGSQTCQLNDQM 250

Query: 232 PSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGV 291
              A+I+ + DVP+N+E  L++AVA QPVSV I+  G +FQ Y   V++G CG  ++H V
Sbjct: 251 KFEAQITNFIDVPANDEQQLLQAVAQQPVSVGIEV-GDEFQHYMGDVYSGTCGQSMNHAV 309

Query: 292 TAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           TAVGYG ++DGTKYWL+KNSWG  WGE GY+++ R+     G CGIA  ASYP
Sbjct: 310 TAVGYGVSEDGTKYWLIKNSWGKGWGEEGYMKLLRESGEPGGQCGIAAHASYP 362


>gi|125552927|gb|EAY98636.1| hypothetical protein OsI_20560 [Oryza sativa Indica Group]
          Length = 449

 Score =  284 bits (726), Expect = 6e-74,   Method: Compositional matrix adjust.
 Identities = 145/306 (47%), Positives = 188/306 (61%), Gaps = 4/306 (1%)

Query: 38  RHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEF 97
           + E W A++GR Y    E+  R   F +N  ++A+ N    +  Y L +N FAD T++EF
Sbjct: 37  QFEAWCAEHGRSYATPGERAARLAAFADNAAFVAAHNGAPAS--YALALNAFADLTHDEF 94

Query: 98  RAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSA 157
           RA R G        R      +       +VP ++DWR+ GAVT VKDQG CG CW+FSA
Sbjct: 95  RAARLGRLAAAGPGRDGGAPYLGVDGGVGAVPDAVDWRQSGAVTKVKDQGSCGACWSFSA 154

Query: 158 VAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYP 217
             AMEGIN I T  L SLSEQEL+DCD S  + GC GGLMD A++F++ N G+ TEA YP
Sbjct: 155 TGAMEGINKIKTGSLISLSEQELIDCDRS-YNSGCGGGLMDYAYKFVVKNGGIDTEADYP 213

Query: 218 YKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSG 277
           Y+ +DG+CNK +       I GY+DVP+NNE  L++AVA QPVSV I  S   FQ YS G
Sbjct: 214 YRETDGTCNKNKLKRRVVTIDGYKDVPANNEDMLLQAVAQQPVSVGICGSARAFQLYSKG 273

Query: 278 VFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGI 337
           +F G C T LDH +  VGYG+ + G  YW+VKNSWG +WG  GY+ M R+     G+CGI
Sbjct: 274 IFDGPCPTSLDHAILIVGYGS-EGGKDYWIVKNSWGESWGMKGYMYMHRNTGNSNGVCGI 332

Query: 338 AMQASY 343
               S+
Sbjct: 333 NQMPSF 338


>gi|157093563|gb|ABV22436.1| cysteine proteinase [Oxyrrhis marina]
          Length = 329

 Score =  283 bits (725), Expect = 7e-74,   Method: Compositional matrix adjust.
 Identities = 149/315 (47%), Positives = 198/315 (62%), Gaps = 10/315 (3%)

Query: 33  ATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQ 92
           A ++ + E + A++G  Y    E+  R  +F +NV+ I   N+K     Y LG+N+FAD 
Sbjct: 13  ADIDAQWEEFKAKFGESYNGEEEEAERKGVFAQNVQLINEENSKGHT--YTLGVNQFADL 70

Query: 93  TNEEFRAPRNGYKRRLPSVRSSETTDVSFR-YENASVPASIDWRKKGAVTGVKDQGQCGC 151
           T EEF     G+K+  P+ +  +   +    Y   ++P S+DW  +GAVT VK+QGQCG 
Sbjct: 71  TVEEFSKTYMGFKK--PAQKYGDAAYLGRHVYNGEALPTSVDWSSQGAVTPVKNQGQCGS 128

Query: 152 CWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLA 211
           CW+FS   ++EG N I+T KL SLSEQ+ VDC  +  +QGC GGLMD AF++  +N  L 
Sbjct: 129 CWSFSTTGSLEGANEISTGKLVSLSEQQFVDCAGTYGNQGCNGGLMDSAFKYAEANA-LC 187

Query: 212 TEAKYPYKASDGSCNKKEANPSAAK--ISGYEDVPSNNEAALMKAVANQPVSVAIDASGS 269
           TE  YPYK +DGSC     +   AK  +SGY+DV S++E  +M AVA QPVS+AI+A  S
Sbjct: 188 TEQSYPYKGTDGSCQASSCSTGLAKGSVSGYKDVSSDSEQDMMSAVAQQPVSIAIEADKS 247

Query: 270 DFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDID 329
            FQ YS GV TG CG  LDHGV AVGYGT   GT YW VKNSWG+TWG +GY+ +QR   
Sbjct: 248 VFQLYSGGVLTGACGASLDHGVLAVGYGTL-SGTDYWKVKNSWGSTWGMSGYVLLQRG-K 305

Query: 330 AKEGLCGIAMQASYP 344
              G CG+  + SYP
Sbjct: 306 GGSGECGLLSEPSYP 320


>gi|422001787|dbj|BAM66994.1| germination-specific cysteine protease 1, partial [Raphanus
           sativus]
          Length = 235

 Score =  283 bits (724), Expect = 9e-74,   Method: Compositional matrix adjust.
 Identities = 132/220 (60%), Positives = 166/220 (75%), Gaps = 3/220 (1%)

Query: 127 SVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTS 186
           ++P ++DWR+KGAV  +K+QG CG CWAFS  A +EGIN I T +L SLSEQELVDCD S
Sbjct: 3   ALPETVDWRQKGAVNAIKNQGTCGSCWAFSTAAVVEGINKIVTGELISLSEQELVDCDKS 62

Query: 187 GEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSN 246
             +QGC GGLMD AF+FI+ N GL TE  YPY+ SDG CN    N     I GYEDVP+N
Sbjct: 63  -YNQGCNGGLMDYAFQFIMKNGGLNTEQDYPYRGSDGKCNSLLKNSKVVTIDGYEDVPTN 121

Query: 247 NEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYW 306
           +E AL +AV+ QPVSVAIDA G  FQ Y SG+FTG+CGT++DH V AVGYG+ ++G  YW
Sbjct: 122 DETALKRAVSYQPVSVAIDAGGRVFQHYQSGIFTGECGTKMDHAVVAVGYGS-ENGVDYW 180

Query: 307 LVKNSWGTTWGENGYIRMQRDI-DAKEGLCGIAMQASYPT 345
           +V+NSWG  WGE+GYIR++R++  +K G CGIA++ASYP 
Sbjct: 181 IVRNSWGQKWGEDGYIRIERNLASSKSGKCGIAIEASYPV 220


>gi|302763837|ref|XP_002965340.1| hypothetical protein SELMODRAFT_143126 [Selaginella moellendorffii]
 gi|302790566|ref|XP_002977050.1| hypothetical protein SELMODRAFT_232903 [Selaginella moellendorffii]
 gi|300155026|gb|EFJ21659.1| hypothetical protein SELMODRAFT_232903 [Selaginella moellendorffii]
 gi|300167573|gb|EFJ34178.1| hypothetical protein SELMODRAFT_143126 [Selaginella moellendorffii]
          Length = 300

 Score =  283 bits (723), Expect = 1e-73,   Method: Compositional matrix adjust.
 Identities = 141/308 (45%), Positives = 202/308 (65%), Gaps = 14/308 (4%)

Query: 40  EMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRA 99
           E W A++G+ Y  + EK  R  IF + + YI   +N   N  + LG+N+F+D TN EFRA
Sbjct: 3   EGWAAKHGKSYSSDWEKARRLMIFSDTLAYIEK-HNALPNTTFTLGLNKFSDLTNAEFRA 61

Query: 100 PRNGYKR--RLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSA 157
              G  +  R    R ++  DV      +S+P S+DWR++GAVT +KDQGQCG CWAFSA
Sbjct: 62  NYVGKFKPPRYQDRRPAKDVDVDV----SSLPTSLDWRQEGAVTPIKDQGQCGSCWAFSA 117

Query: 158 VAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYP 217
           +A++E  + + T++L SLSEQ+L+DCDT   DQGC+GG  +DAF+F++ N G+ TE  YP
Sbjct: 118 IASIESAHFLATKELVSLSEQQLIDCDTV--DQGCQGGFPEDAFKFVVENGGVTTEEAYP 175

Query: 218 YKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSG 277
           Y    GSCN  +      +I+GY+DV  ++  ALMKAV+  PV+V I  S  +FQ Y SG
Sbjct: 176 YTGFAGSCNANKNK--VVEITGYKDVTKDSADALMKAVSKTPVTVGICGSDQNFQNYRSG 233

Query: 278 VFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGI 337
           + +G C    DH V  +GYGT + G  YW++KNSWGT+WGE+G++R+++  +  EG+CG+
Sbjct: 234 ILSGHCSNSRDHAVLVIGYGT-EGGMPYWIIKNSWGTSWGEDGFMRIKK--EDGEGMCGM 290

Query: 338 AMQASYPT 345
             Q+SYPT
Sbjct: 291 NGQSSYPT 298


>gi|346469447|gb|AEO34568.1| hypothetical protein [Amblyomma maculatum]
          Length = 333

 Score =  283 bits (723), Expect = 1e-73,   Method: Compositional matrix adjust.
 Identities = 156/310 (50%), Positives = 198/310 (63%), Gaps = 11/310 (3%)

Query: 40  EMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNK--PYKLGINEFADQTNEEF 97
           E + + + + Y+ N E+ +RFKIF EN  +IA  N K       YKLGIN+FAD    EF
Sbjct: 28  EAFKSTHKKTYKSNVEELLRFKIFTENSLFIAKHNVKYAKGLVSYKLGINQFADLLPHEF 87

Query: 98  RAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSA 157
               NGY+ +  + R S T        ++S+P ++DWRKKGAVT VKDQGQCG CWAFS+
Sbjct: 88  VKMMNGYQGKRLAGRGS-TYLPPANLNDSSLPKTVDWRKKGAVTPVKDQGQCGSCWAFSS 146

Query: 158 VAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYP 217
             ++EG + + T KL SLSEQ LVDC ++  +QGC GGLMD++F +I +N G+ TE  YP
Sbjct: 147 TGSLEGQHFLKTGKLVSLSEQNLVDCSSAYGNQGCNGGLMDNSFNYIKANGGIDTEDSYP 206

Query: 218 YKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDASGSDFQFYSS 276
           Y+A DG C  K+ +  A   +G+ D+   +E  L KAVA   PVSVAIDAS   FQ YS 
Sbjct: 207 YEAEDGDCRYKKEDVGATD-TGFVDIKEGSEKDLQKAVATVGPVSVAIDASQQSFQLYSE 265

Query: 277 GVF-TGQCGTE-LDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGL 334
           GV+    C +E LDHGV AVGYG   +G KYWLVKNSW  TWG++GYI M RD   K   
Sbjct: 266 GVYDEPNCSSESLDHGVLAVGYGVK-NGKKYWLVKNSWAETWGQDGYILMSRD---KNNQ 321

Query: 335 CGIAMQASYP 344
           CGIA  ASYP
Sbjct: 322 CGIASSASYP 331


>gi|33112581|gb|AAP94046.1| cathepsin-L-like cysteine peptidase 02 [Tenebrio molitor]
          Length = 337

 Score =  282 bits (722), Expect = 1e-73,   Method: Compositional matrix adjust.
 Identities = 161/342 (47%), Positives = 202/342 (59%), Gaps = 17/342 (4%)

Query: 10  LVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEY 69
           L+  AI V G  A   +        + E+   +   + + Y+ + E+  R KIF EN   
Sbjct: 4   LIFLAICVAGSQAVSFFD------LVQEQWGAFKMTHNKQYQSDTEERFRMKIFMENSHT 57

Query: 70  IASFNNKARNK--PYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTD-VSFRYE-N 125
           +A  N         +KLGIN++AD  + EF    NG+ R    +RS E+ D V+F    N
Sbjct: 58  VAKHNKLYAQGLVSFKLGINKYADMLHHEFVQVLNGFNRTKSGLRSGESDDSVTFLPPAN 117

Query: 126 ASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDT 185
             +P  IDWR KGAVT VKDQGQCG CW+FSA  ++EG +   + KL SLSEQ LVDC  
Sbjct: 118 VQLPGQIDWRDKGAVTPVKDQGQCGSCWSFSATGSLEGQHFRKSGKLVSLSEQNLVDCSE 177

Query: 186 SGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPS 245
              + GC GGLMD+AF +I +N G+ TE  YPYKA D  C+ K  N  A    GY D+ S
Sbjct: 178 KFGNNGCNGGLMDNAFRYIKANGGIDTEQAYPYKAEDEKCHYKPKNKGATD-RGYVDIES 236

Query: 246 NNEAALMKAVAN-QPVSVAIDASGSDFQFYSSGV-FTGQCG-TELDHGVTAVGYGTADDG 302
            NE  L  AVA   PVSVAIDAS   FQ YS GV +  +C  ++LDHGV  VGYGT DDG
Sbjct: 237 GNEDKLQSAVATVGPVSVAIDASHQSFQLYSGGVYYEPECSPSQLDHGVLVVGYGTEDDG 296

Query: 303 TKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           T YWLVKNSWG +WG+ GYI+M R+ D     CGIA +ASYP
Sbjct: 297 TDYWLVKNSWGKSWGDQGYIKMARNRDNN---CGIATEASYP 335


>gi|302790570|ref|XP_002977052.1| hypothetical protein SELMODRAFT_268054 [Selaginella moellendorffii]
 gi|300155028|gb|EFJ21661.1| hypothetical protein SELMODRAFT_268054 [Selaginella moellendorffii]
          Length = 300

 Score =  282 bits (721), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 141/308 (45%), Positives = 201/308 (65%), Gaps = 14/308 (4%)

Query: 40  EMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRA 99
           E W A++G+ Y  + EK  R  IF + + YI   +N   N  + LG+N+F+D TN EFRA
Sbjct: 3   EGWAAKHGKSYSSDWEKARRLMIFSDTLAYIEK-HNALPNTTFTLGLNKFSDLTNAEFRA 61

Query: 100 PRNGYKR--RLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSA 157
              G  +  R    R ++  DV      +S+P S+DWR++GAVT +KDQGQCG CWAFSA
Sbjct: 62  NYVGKFKPPRYQDRRPAKDVDVDV----SSLPTSLDWRQEGAVTPIKDQGQCGSCWAFSA 117

Query: 158 VAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYP 217
           +A++E  + + T++L SLSEQ+L+DCDT   DQGC+GG  +DAF+F++ N G+ TE  YP
Sbjct: 118 IASIESAHFLATKELVSLSEQQLIDCDTV--DQGCQGGFPEDAFKFVVENGGVTTEEAYP 175

Query: 218 YKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSG 277
           Y    GSCN  +      +I+GY+DV  ++  ALMKAV+  PV+V I  S  +FQ Y SG
Sbjct: 176 YTGFAGSCNANKNK--VVEITGYKDVTKDSADALMKAVSKTPVTVGICGSDQNFQNYRSG 233

Query: 278 VFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGI 337
           + +G C    DH V  +GYGT + G  YW++KNSWGT+WGE+G++R+++     EG+CG+
Sbjct: 234 ILSGHCSNSRDHAVLVIGYGT-EGGMPYWIIKNSWGTSWGEDGFMRIKK--KDGEGMCGM 290

Query: 338 AMQASYPT 345
             Q+SYPT
Sbjct: 291 NGQSSYPT 298


>gi|33112583|gb|AAP94047.1| cathepsin-L-like cysteine peptidase 03 [Tenebrio molitor]
          Length = 337

 Score =  282 bits (721), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 161/342 (47%), Positives = 201/342 (58%), Gaps = 17/342 (4%)

Query: 10  LVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEY 69
           L+  AI V G  A   +        + E+   +   + + Y+ + E+  R KIF EN   
Sbjct: 4   LIFLAICVAGSQAVSFFD------LVQEQWGAFKMTHNKQYQSDTEERFRMKIFMENSHT 57

Query: 70  IASFNNKARNK--PYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTD-VSFRYE-N 125
           +A  N         +KLGIN++AD  + EF    NG+ R    +RS E+ D V+F    N
Sbjct: 58  VAKHNKLYAQGLVSFKLGINKYADMLHHEFVQVLNGFNRTKSGLRSGESDDSVTFLPPAN 117

Query: 126 ASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDT 185
             +P  IDWR KGAVT VKDQGQCG CW+FSA  ++EG +   + KL SLSEQ LVDC  
Sbjct: 118 VQLPGQIDWRDKGAVTPVKDQGQCGSCWSFSATGSLEGQHFRKSGKLVSLSEQNLVDCSE 177

Query: 186 SGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPS 245
              + GC GGLMD+AF +I +N G+ TE  YPYKA D  C+ K  N  A    GY D+ S
Sbjct: 178 KFGNNGCNGGLMDNAFRYIKANGGIDTEQAYPYKAEDEKCHYKPKNKGATD-RGYVDIES 236

Query: 246 NNEAALMKAVAN-QPVSVAIDASGSDFQFYSSGV-FTGQC-GTELDHGVTAVGYGTADDG 302
            NE  L  AVA   PVSVAIDAS   FQ YS GV +   C  ++LDHGV  VGYGT DDG
Sbjct: 237 GNEDKLQSAVATVGPVSVAIDASHQSFQLYSGGVYYEPDCSASQLDHGVLVVGYGTEDDG 296

Query: 303 TKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           T YWLVKNSWG +WG+ GYI+M R+ D     CGIA +ASYP
Sbjct: 297 TDYWLVKNSWGKSWGDQGYIKMARNRDNN---CGIATEASYP 335


>gi|217072410|gb|ACJ84565.1| unknown [Medicago truncatula]
          Length = 328

 Score =  282 bits (721), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 138/240 (57%), Positives = 175/240 (72%), Gaps = 4/240 (1%)

Query: 106 RRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGIN 165
           RR+     S++   + R  +  +P S+DWRK+GAV GVKDQ  CG CWAFSA+AA+EGIN
Sbjct: 3   RRMKKFGGSKSNRYAPRVGD-KLPESVDWRKEGAVVGVKDQASCGSCWAFSAIAAVEGIN 61

Query: 166 HITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSC 225
            I T  L SLSEQELVDCDTS  ++GC GGLMD AFEFIISN G+ +E  YPYKA DG C
Sbjct: 62  KIVTGDLISLSEQELVDCDTS-YNEGCNGGLMDYAFEFIISNGGIDSEDDYPYKAVDGRC 120

Query: 226 NKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGT 285
           ++   N     I  YEDVP+ +E AL KAVANQP++VA++  G +FQ Y  GV TG+CGT
Sbjct: 121 DQNRKNAKVVTIDDYEDVPAYDELALQKAVANQPIAVAVEGGGREFQLYEYGVLTGRCGT 180

Query: 286 ELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDI-DAKEGLCGIAMQASYP 344
            LDHGV AVGYGT ++G  YW+V+NSWG +WGE GYIR++R++  ++ G CGIA++ SYP
Sbjct: 181 ALDHGVAAVGYGT-ENGKDYWIVRNSWGGSWGEQGYIRLERNLASSRAGKCGIAIEPSYP 239


>gi|346466067|gb|AEO32878.1| hypothetical protein [Amblyomma maculatum]
          Length = 358

 Score =  281 bits (720), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 156/307 (50%), Positives = 197/307 (64%), Gaps = 11/307 (3%)

Query: 44  AQYGRVYRDNAEKEMRFKIFKENVEYIASFNNK-ARNK-PYKLGINEFADQTNEEFRAPR 101
           A +G+ Y    E+  R KI+ EN   IA  N K A NK  YKL +NEF D  + EF + R
Sbjct: 55  ALHGKEYHSETEEYYRLKIYMENRLKIARHNEKYANNKASYKLAMNEFGDLLHHEFVSTR 114

Query: 102 NGYKRRLPSVRSSETTDVSFR-YENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAA 160
           NG+KR   S     +  +     E+  +P ++DWRKKGAVT VK+QGQCG CWAFS   +
Sbjct: 115 NGFKRNYRSTPREGSFYIEPEGIEDKHLPKTVDWRKKGAVTPVKNQGQCGSCWAFSTTGS 174

Query: 161 MEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKA 220
           +EG +   T ++ SLSEQ LVDC     + GCEGGLMD+AF++I +N G+ TE  YPY  
Sbjct: 175 LEGQHFRKTGRMVSLSEQNLVDCSGKFGNNGCEGGLMDNAFKYIKANGGIDTELSYPYNG 234

Query: 221 SDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDASGSDFQFYSSGVF 279
           +DG C+ ++++  A   +G+ D+P  NE  L KAVA   PVSVAIDAS   FQFYS GV+
Sbjct: 235 TDGICHFEKSDVGATD-TGFVDIPEGNEQLLKKAVATVGPVSVAIDASHESFQFYSQGVY 293

Query: 280 -TGQCGTE-LDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGI 337
              +C +E LDHGV  VGYGT  DG  YWLVKNSWGTTWG++GYI M R+   KE  CGI
Sbjct: 294 DEPECSSESLDHGVLVVGYGTK-DGQDYWLVKNSWGTTWGDDGYIYMTRN---KENQCGI 349

Query: 338 AMQASYP 344
           A  ASYP
Sbjct: 350 ASSASYP 356


>gi|312306194|gb|ADQ73946.1| cathepsin L [Paralithodes camtschaticus]
          Length = 324

 Score =  281 bits (719), Expect = 3e-73,   Method: Compositional matrix adjust.
 Identities = 159/316 (50%), Positives = 195/316 (61%), Gaps = 16/316 (5%)

Query: 34  TMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNK--PYKLGINEFAD 91
           T    H+ +  QYGR Y    E+  R  ++ +N+E+I + N +  N    Y L IN+F D
Sbjct: 18  TFTSFHQ-FKVQYGRQYATAQEERYRSSVYDQNMEFIEAHNEQYTNGEVTYMLAINQFGD 76

Query: 92  QTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGC 151
            TNEE  A  NG    LP+  S     +  R  + ++PA +DWR KGAVT VKDQ  CG 
Sbjct: 77  MTNEEINAVMNGL---LPASESRGVAVLGGR--DDTLPAEVDWRTKGAVTPVKDQKACGS 131

Query: 152 CWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLA 211
           CWAFSA  ++EG + +   KL SLSEQ LVDC T   D GC GGLMD AF +I  N G+ 
Sbjct: 132 CWAFSATGSLEGQHFLKDGKLVSLSEQNLVDCSTKQGDHGCGGGLMDFAFTYIKDNGGID 191

Query: 212 TEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDASGSD 270
           TEA YPY+A+DG C    AN S A ++GY DV  ++E AL KAVA   P+SVAIDAS S 
Sbjct: 192 TEASYPYEATDGKCQYNPAN-SGATVTGYVDVEHDSEDALQKAVATIGPISVAIDASRST 250

Query: 271 FQFYSSGV-FTGQC-GTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDI 328
           F FY  GV +  +C  T LDHGV AVGYGT  DGT YWLVKNSW  TWG +G+I M R+ 
Sbjct: 251 FHFYHKGVYYDKECSSTSLDHGVLAVGYGT-QDGTDYWLVKNSWNITWGNHGFIEMSRN- 308

Query: 329 DAKEGLCGIAMQASYP 344
             +   CGIA QASYP
Sbjct: 309 --RNNNCGIATQASYP 322


>gi|356509992|ref|XP_003523725.1| PREDICTED: oryzain alpha chain-like [Glycine max]
          Length = 439

 Score =  281 bits (719), Expect = 3e-73,   Method: Compositional matrix adjust.
 Identities = 150/313 (47%), Positives = 191/313 (61%), Gaps = 10/313 (3%)

Query: 36  NERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARN----KPYKLGINEFAD 91
           +E  E W  ++ + Y    EK  R K+F++N  ++A  N  A N      Y L +N FAD
Sbjct: 30  SELFEKWCKEHSKTYSSEEEKLYRLKVFEDNYAFVAQHNQNANNNNNNSSYTLSLNAFAD 89

Query: 92  QTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGC 151
            T+ EF+  R G    L   +  +        +   +P+ IDWR+ GAVT VKDQ  CG 
Sbjct: 90  LTHHEFKTTRLGLPLTLLRFKRPQNQQSR---DLLHIPSQIDWRQSGAVTPVKDQASCGA 146

Query: 152 CWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLA 211
           CWAFSA  A+EGIN I T  L SLSEQEL+DCDTS  + GC GGLMD A++F+I NKG+ 
Sbjct: 147 CWAFSATGAIEGINKIVTGSLVSLSEQELIDCDTS-YNSGCGGGLMDFAYQFVIDNKGID 205

Query: 212 TEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDF 271
           TE  YPY+A   SC+K +    A  I  Y DVP + E  ++KAVA+QPVSV I  S  +F
Sbjct: 206 TEDDYPYQARQRSCSKDKLKRRAVTIEDYVDVPPSEE-EILKAVASQPVSVGICGSEREF 264

Query: 272 QFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAK 331
           Q YS G+FTG C T LDH V  VGYG +++G  YW+VKNSWG  WG NGYI M R+    
Sbjct: 265 QLYSKGIFTGPCSTFLDHAVLIVGYG-SENGVDYWIVKNSWGKYWGMNGYIHMIRNSGNS 323

Query: 332 EGLCGIAMQASYP 344
           +G+CGI   ASYP
Sbjct: 324 KGICGINTLASYP 336


>gi|351629617|gb|AEQ54772.1| KDEL-tailed cysteine proteinase CP4, partial [Coffea canephora]
          Length = 215

 Score =  281 bits (719), Expect = 3e-73,   Method: Compositional matrix adjust.
 Identities = 137/200 (68%), Positives = 159/200 (79%), Gaps = 4/200 (2%)

Query: 147 GQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIIS 206
           G+CG CWAFS V  +EGIN I T +L SLSEQELVDC+T  +++GC GGLM++A+EFI  
Sbjct: 1   GKCGSCWAFSTVVGVEGINKIKTGQLVSLSEQELVDCET--DNEGCNGGLMENAYEFIKK 58

Query: 207 NKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDA 266
           + G+ TE  YPYKA DGSC+  + N  A  I G+E VP+N+E ALMKAVANQPVSVAIDA
Sbjct: 59  SGGITTERLYPYKARDGSCDSSKMNAPAVTIDGHEMVPANDENALMKAVANQPVSVAIDA 118

Query: 267 SGSDFQFYSSGVFTG-QCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQ 325
           SGSD QFYS GV+TG  CG ELDHGV  VGYGTA DGTKYW+VKNSWGT WGE GYIRMQ
Sbjct: 119 SGSDMQFYSEGVYTGDSCGNELDHGVAVVGYGTALDGTKYWIVKNSWGTGWGEQGYIRMQ 178

Query: 326 RDIDAKE-GLCGIAMQASYP 344
           R +DA E G+CGIAM+ASYP
Sbjct: 179 RGVDAAEGGVCGIAMEASYP 198


>gi|72008176|ref|XP_780713.1| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
          Length = 335

 Score =  281 bits (719), Expect = 4e-73,   Method: Compositional matrix adjust.
 Identities = 159/338 (47%), Positives = 208/338 (61%), Gaps = 16/338 (4%)

Query: 14  AILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASF 73
           ++L++      S S +  D   +E    W  ++G+ Y  + E+  R  I+++N++ +   
Sbjct: 5   SVLLVAACVVSSLSMSFTD--FDEDWNQWKNEHGKRYLSDEEEASRKLIWEKNLDIVIKH 62

Query: 74  NNK--ARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYEN--ASVP 129
           N K    +  Y LG+N+FAD  NEEF A   G++    S  +  +T   F   N    +P
Sbjct: 63  NLKYDLGHFTYALGMNQFADLKNEEFVAMMTGFRVNGTSKAAKGST---FLPSNNIGELP 119

Query: 130 ASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGED 189
            ++DWR KG VT VKDQGQCG CWAFS   ++EG +   T KL SLSEQ LVDC     +
Sbjct: 120 KTVDWRTKGYVTPVKDQGQCGSCWAFSTTGSLEGQHFKATGKLVSLSEQNLVDCSGKEGN 179

Query: 190 QGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEA 249
           +GC+GGLMD AF++II   G+ TE  YPYKA DG C+ K+AN   A ++GY DV S++E 
Sbjct: 180 EGCDGGLMDQAFQYIIKAGGIDTEESYPYKAVDGECHFKKANI-GATVTGYTDVTSDSET 238

Query: 250 ALMKAVAN-QPVSVAIDASGSDFQFYSSGVFT-GQC-GTELDHGVTAVGYGTADDGTKYW 306
           AL KAVA+  P+SVAIDAS   FQ Y SGV+    C  T LDHGV AVGYGT  DGT YW
Sbjct: 239 ALQKAVAHIGPISVAIDASHMSFQLYKSGVYNEPDCSSTLLDHGVLAVGYGTTSDGTDYW 298

Query: 307 LVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           +VKNSW  TWG NGY+ M R+   K+  CGIA QASYP
Sbjct: 299 IVKNSWAETWGMNGYLWMSRN---KDNQCGIATQASYP 333


>gi|30023547|gb|AAO48766.2| cathepsin L-like cysteine proteinase [Tenebrio molitor]
          Length = 337

 Score =  281 bits (718), Expect = 4e-73,   Method: Compositional matrix adjust.
 Identities = 161/344 (46%), Positives = 201/344 (58%), Gaps = 17/344 (4%)

Query: 8   NKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENV 67
           N L+  AI V G  A   +        + E+   +   + + Y+   E+  R KIF EN 
Sbjct: 2   NFLIFLAICVAGSQAVSFFD------LVQEQWGAFKMTHNKQYQSETEERFRMKIFMENS 55

Query: 68  EYIASFNNKARNK--PYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTD-VSFRYE 124
             +A  N         +KLGIN++AD  + EF    NG+ R    +RS E+ D V+F   
Sbjct: 56  HTVAKHNKLYAQGLVSFKLGINKYADMLHHEFVQVLNGFNRTKSGLRSGESDDSVTFLPP 115

Query: 125 -NASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDC 183
            N  +P  IDWR KGAVT VKDQGQCG CW+FSA  ++EG +   + KL SLSEQ LVDC
Sbjct: 116 ANVQLPGQIDWRDKGAVTPVKDQGQCGSCWSFSATGSLEGQHFRQSGKLVSLSEQNLVDC 175

Query: 184 DTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDV 243
                + GC GGLMD+AF +I +N G+ TE  YPYKA D  C+ K  N  A    GY D+
Sbjct: 176 SEKFGNNGCNGGLMDNAFRYIKANGGIDTEQAYPYKAEDEKCHYKPKNKGATD-RGYVDI 234

Query: 244 PSNNEAALMKAVAN-QPVSVAIDASGSDFQFYSSGV-FTGQC-GTELDHGVTAVGYGTAD 300
            S NE  L  AVA   PVSVAIDAS   FQ YS GV +   C  ++LDHGV  VGYGT D
Sbjct: 235 ESGNEDKLQSAVATVGPVSVAIDASHQSFQLYSGGVYYEPDCSASQLDHGVLVVGYGTED 294

Query: 301 DGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           DGT YWLVKNSWG +WG+ GYI+M R+   +   CGIA +ASYP
Sbjct: 295 DGTDYWLVKNSWGKSWGDQGYIKMARN---RNNNCGIATEASYP 335


>gi|242048430|ref|XP_002461961.1| hypothetical protein SORBIDRAFT_02g011230 [Sorghum bicolor]
 gi|241925338|gb|EER98482.1| hypothetical protein SORBIDRAFT_02g011230 [Sorghum bicolor]
          Length = 380

 Score =  280 bits (717), Expect = 5e-73,   Method: Compositional matrix adjust.
 Identities = 157/341 (46%), Positives = 207/341 (60%), Gaps = 25/341 (7%)

Query: 27  SRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNK--PYKL 84
           S T +++ M ER + W A Y + Y   AE   RF ++  N+ YI + N +A      Y+L
Sbjct: 40  SSTDDNSPMIERFQRWKAAYNKSYATVAEDRRRFLVYARNMAYIEATNAEAEAAGLTYEL 99

Query: 85  GINEFADQTNEEFRAPRNGYKR--RLP-------------SVRSSETTDVSFR--YENAS 127
           G   + D TN+EF A         +LP             + R+     V     Y N S
Sbjct: 100 GETAYTDLTNQEFMAMYTAAPSPAQLPADEDEDDAAEAVITTRAGPVDAVGQLPVYVNLS 159

Query: 128 V--PASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDT 185
              PAS+DWR  GAVT VK+QG+CG CWAFS VA +EGI  I T KL SLSEQELVDCDT
Sbjct: 160 TAAPASVDWRASGAVTPVKNQGRCGSCWAFSTVAVVEGIYQIRTGKLVSLSEQELVDCDT 219

Query: 186 SGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPS 245
              D GC+GG+   A  +I SN GL TE  YPY  +  +CN+ +   +AA I+G   V +
Sbjct: 220 --LDAGCDGGISYRALRWITSNGGLTTEEDYPYTGTTDACNRAKLAHNAASIAGLRRVAT 277

Query: 246 NNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGT-ADDGTK 304
            +EA+L  AVA QPV+V+I+A G +FQ Y  GV+ G CGT L+HGVT VGYG   +DG K
Sbjct: 278 RSEASLANAVAGQPVAVSIEAGGDNFQHYKRGVYNGPCGTSLNHGVTVVGYGQEEEDGDK 337

Query: 305 YWLVKNSWGTTWGENGYIRMQRDIDAK-EGLCGIAMQASYP 344
           YW++KNSWG +WG+ GYI+M++D+  K EGLCGIA++ S+P
Sbjct: 338 YWIIKNSWGASWGDGGYIKMRKDVAGKPEGLCGIAIRPSFP 378


>gi|357122137|ref|XP_003562772.1| PREDICTED: fruit bromelain-like [Brachypodium distachyon]
          Length = 358

 Score =  280 bits (717), Expect = 5e-73,   Method: Compositional matrix adjust.
 Identities = 156/331 (47%), Positives = 207/331 (62%), Gaps = 19/331 (5%)

Query: 30  LNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEF 89
           + D  M +R   + A Y R Y    E+  RF++++ NV+YI + N +  +  Y+LG N+F
Sbjct: 31  VGDMLMMDRFRAFQATYNRTYASPEERLRRFEVYRRNVDYIEAMNRRG-DLTYELGENQF 89

Query: 90  ADQTNEEFRA----------PRNGYKRR--LPSVRSSETTDVSFRYENA---SVPASIDW 134
           AD T +EFRA            + ++RR  + ++    T D    Y +A   + P S+DW
Sbjct: 90  ADLTVQEFRAMYTMPARVDSRPDAWRRRQMITTLAGPVTEDGGSYYSDAWEEAGPTSVDW 149

Query: 135 RKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEG 194
           R KGAVT VKDQG CGCCWAF+ VA +EG++ I T +L SLSEQELVDCD + +  G   
Sbjct: 150 RSKGAVTPVKDQGGCGCCWAFATVATIEGLHKIKTGQLVSLSEQELVDCDDADDGCGGG- 208

Query: 195 GLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKA 254
            L + A E++  N GL TEA YPY    G C++ +A+  AAKI+  + V +N+EA L +A
Sbjct: 209 -LPEIAMEWVAHNGGLTTEANYPYTGKAGKCDRGKASNHAAKIAAAQMVRANSEAELERA 267

Query: 255 VANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGT 314
           VA QPV+VAI+A  S   FY SGV++G C  E DH VT VGYG  + G KYW++KNSW  
Sbjct: 268 VARQPVAVAINAPDS-LMFYKSGVYSGPCTAEFDHAVTVVGYGADNKGHKYWIIKNSWAE 326

Query: 315 TWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           TWGE GY RMQR + AKEGLCGIA  ASYP 
Sbjct: 327 TWGEKGYGRMQRGVAAKEGLCGIATHASYPV 357


>gi|325303202|tpg|DAA34687.1| TPA_inf: cathepsin L-like cysteine proteinase B [Amblyomma
           variegatum]
          Length = 337

 Score =  280 bits (717), Expect = 5e-73,   Method: Compositional matrix adjust.
 Identities = 153/307 (49%), Positives = 199/307 (64%), Gaps = 11/307 (3%)

Query: 44  AQYGRVYRDNAEKEMRFKIFKENVEYIASFNNK-ARNK-PYKLGINEFADQTNEEFRAPR 101
           A +G+ Y+   E+  R KI+ EN   IA  N K A NK  YKL +NE+ D  + EF + R
Sbjct: 34  ALHGKEYQSETEEYYRLKIYMENRMMIARHNEKYANNKVSYKLAMNEYGDMLHHEFVSTR 93

Query: 102 NGYKRRLPSVRSSETTDVSFR-YENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAA 160
           NG++R   S     +  +     E+  +P ++DWRKKGAVT VK+QGQCG CWAFS   +
Sbjct: 94  NGFRRDYRSKPRQGSFYIEPEGIEDKHLPKTVDWRKKGAVTPVKNQGQCGSCWAFSTTGS 153

Query: 161 MEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKA 220
           +EG +   +  + SLSEQ LVDC T+  + GCEGGLMD+AF++I +N G+ TE  YPY  
Sbjct: 154 LEGQHFRKSGDMVSLSEQNLVDCSTAFGNNGCEGGLMDNAFKYIKANGGIDTEKSYPYNG 213

Query: 221 SDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDASGSDFQFYSSGVF 279
           +DG+C+ K+++  A   +G+ D+P  NE  L KAVA   P+SVAIDAS   FQFYS GV+
Sbjct: 214 TDGTCHFKKSDVGATD-TGFVDIPEGNEHLLKKAVATVGPISVAIDASHQSFQFYSQGVY 272

Query: 280 -TGQCGTE-LDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGI 337
              +C +E LDHGV  VGYGT DD   YWLVKNSWGTTWG+ GYI M R+   K+  CGI
Sbjct: 273 DEPECSSENLDHGVLVVGYGTKDD-QDYWLVKNSWGTTWGDGGYIYMTRN---KDNQCGI 328

Query: 338 AMQASYP 344
           A  ASYP
Sbjct: 329 ASSASYP 335


>gi|262410743|gb|ACY66807.1| cathepsin L [Aphis gossypii]
          Length = 341

 Score =  280 bits (716), Expect = 7e-73,   Method: Compositional matrix adjust.
 Identities = 160/343 (46%), Positives = 206/343 (60%), Gaps = 14/343 (4%)

Query: 12  LAAILVLG--VWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEY 69
           +  ++VLG  V+A  S S    +  + E   ++ AQ+ ++Y D  E+  R K++ +N   
Sbjct: 1   MKVVIVLGLVVFAISSVSSINLNEVIEEEWSLFKAQFKKIYEDVKEEAFRKKVYLDNKLK 60

Query: 70  IASFNN--KARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTD--VSF-RYE 124
           IA  N   +   + Y L +N F D    E++   NG+K  L     + T D  V+F + E
Sbjct: 61  IARHNKLYETGEETYALEMNHFGDLMQHEYKKMMNGFKPSLAGGDKNFTDDDAVTFLKSE 120

Query: 125 NASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCD 184
           N  VP +IDWRKKG VT VK+QGQCG CW+FSA  ++EG +   T  L SLSEQ L+DC 
Sbjct: 121 NVVVPKAIDWRKKGYVTPVKNQGQCGSCWSFSATGSLEGQHFRKTGVLVSLSEQNLIDCS 180

Query: 185 TSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVP 244
               + GCEGGLMD AF++I SNKGL TE  YPY+A D  C     N S A   G+ D+P
Sbjct: 181 RKYGNNGCEGGLMDLAFKYIKSNKGLDTEKSYPYEAEDDKCRYNPEN-SGATDKGFVDIP 239

Query: 245 SNNEAALMKAVANQ-PVSVAIDASGSDFQFYSSGVFTG-QC-GTELDHGVTAVGYGTADD 301
             +E ALM A+A   PVS+AIDAS   FQFY  GVF   +C  TELDHGV AVGYGT   
Sbjct: 240 EGDEDALMHALATVGPVSIAIDASSEKFQFYKKGVFYNPRCSSTELDHGVLAVGYGTDHK 299

Query: 302 GTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           G  YW+VKNSWG TWG+ GYI M R+   K+  CG+A  ASYP
Sbjct: 300 GGDYWIVKNSWGKTWGDQGYIMMARN---KKNNCGVASSASYP 339


>gi|126681066|gb|ABO26562.1| cathepsin L-like cysteine protease [Ixodes ricinus]
          Length = 335

 Score =  280 bits (716), Expect = 7e-73,   Method: Compositional matrix adjust.
 Identities = 154/309 (49%), Positives = 197/309 (63%), Gaps = 11/309 (3%)

Query: 42  WMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNK-ARNK-PYKLGINEFADQTNEEFRA 99
           + A++G+ Y    E+  R KI+ EN   IA  N K AR + PY + +NEF D  + EF +
Sbjct: 30  FKAKHGKSYVSETEEVFRLKIYMENRHKIAKHNEKYARGEVPYSMAMNEFGDMLHHEFVS 89

Query: 100 PRNGYKRRLP-SVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAV 158
            RNG+KR      R   T       E+ S+P ++DWR KGAVT VK+QGQCG CWAFSA 
Sbjct: 90  TRNGFKRNYKDQPREGSTYLEPENIEDFSLPKTVDWRTKGAVTPVKNQGQCGSCWAFSAT 149

Query: 159 AAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPY 218
            ++EG +   +  + SLSEQ LV C T   + GCEGGLMDDAF++I +NKG+ TE  YPY
Sbjct: 150 GSLEGQHFRKSGSMVSLSEQNLVGCSTDFGNNGCEGGLMDDAFKYIRANKGIDTEKSYPY 209

Query: 219 KASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDASGSDFQFYSSG 277
             +DG+C+ K++   A   SG+ D+   +E  L KAVA   P+SVAIDAS   FQFYS G
Sbjct: 210 NGTDGTCHFKKSTVGATD-SGFVDIKEGSETQLKKAVATVGPISVAIDASHESFQFYSDG 268

Query: 278 VF-TGQCGTE-LDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLC 335
           V+   +C +E LDHGV  VGYGT  +GT YW VKNSWGTTWG+ GYIRM R+   K+  C
Sbjct: 269 VYDEPECDSESLDHGVLVVGYGTL-NGTDYWFVKNSWGTTWGDEGYIRMSRN---KKNQC 324

Query: 336 GIAMQASYP 344
           GIA  AS P
Sbjct: 325 GIASSASIP 333


>gi|359483753|ref|XP_002266308.2| PREDICTED: oryzain alpha chain-like [Vitis vinifera]
          Length = 501

 Score =  280 bits (716), Expect = 7e-73,   Method: Compositional matrix adjust.
 Identities = 160/365 (43%), Positives = 217/365 (59%), Gaps = 43/365 (11%)

Query: 8   NKLVLAAILVLGVWA---------PQSWSRTLNDATMNER----HEMWMAQYGRVYRDNA 54
            K+ LA  LVL +WA         P  +  T  +    ER      +W  ++ RVY+   
Sbjct: 4   QKIQLA--LVLFIWASLACLSSSLPTEFYITGEEFASEERVRELFHLWKERHKRVYKHAE 61

Query: 55  EKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFR-----------APRNG 103
           E   RF+IFKEN++Y+   N+K     + LG+N+FAD +NEEF+             +N 
Sbjct: 62  ETAKRFEIFKENLKYVIERNSKGHR--HTLGMNKFADMSNEEFKEKYLSKIKKPINKKNN 119

Query: 104 YKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEG 163
           Y RR  S++  + T       +   P+S+DWRKKG VTG+KDQG CG CWAFS+  AMEG
Sbjct: 120 YLRR--SMQQKKGT------ASCEAPSSLDWRKKGVVTGIKDQGDCGSCWAFSSTGAMEG 171

Query: 164 INHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDG 223
           IN I T  L SLSEQELVDCDT+  + GCEGG MD AFE++ISN G+ +E+ YPY  +DG
Sbjct: 172 INAIVTGDLISLSEQELVDCDTT--NYGCEGGYMDYAFEWVISNGGIDSESDYPYTGTDG 229

Query: 224 SCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTG-- 281
           +CN  + +     I GY+DV   +++AL+ A  NQP+SV +D S  DFQ Y+SG++ G  
Sbjct: 230 TCNTTKEDTKVVSIDGYKDV-DESDSALLCAAVNQPISVGMDGSALDFQLYTSGIYAGDC 288

Query: 282 -QCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQ 340
                ++DH V  VGYG+ +D   YW+ KNSWGT+WG  GY  ++R+ D   G C I   
Sbjct: 289 SDDPDDIDHAVLIVGYGS-EDSEDYWICKNSWGTSWGMEGYFYIKRNTDLPYGECAINAM 347

Query: 341 ASYPT 345
           ASYPT
Sbjct: 348 ASYPT 352


>gi|386648112|gb|AFJ15103.1| mexicain-like cystein protease, partial [Jacaratia mexicana]
          Length = 348

 Score =  280 bits (716), Expect = 7e-73,   Method: Compositional matrix adjust.
 Identities = 147/320 (45%), Positives = 197/320 (61%), Gaps = 17/320 (5%)

Query: 31  NDATMNER----HEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGI 86
           +D T  ER     E WM ++ RVY +  EK  RF+IFK+N+ YI   N K  N  Y LG+
Sbjct: 36  DDLTSTERLIRLFESWMLKHDRVYNNIEEKIHRFEIFKDNLMYIDETNKK--NNSYWLGL 93

Query: 87  NEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENA-SVPASIDWRKKGAVTGVKD 145
           NEF D T++EF+    G       V   ++ D  F Y++    P SIDWR KGAVT VK 
Sbjct: 94  NEFVDLTHDEFKEKYVGSIGE-DFVTIEQSNDEEFPYKHVVDYPESIDWRDKGAVTPVKP 152

Query: 146 QGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFII 205
              CG CWAFS VA +EGIN I T KL SLSEQEL+DCD      GC+GG    + ++++
Sbjct: 153 N-PCGSCWAFSTVATVEGINKIVTGKLISLSEQELLDCDR--RSHGCKGGYQTTSLQYVV 209

Query: 206 SNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAID 265
            N G+ TE +YPY+   G C  KE   +  +I+GY+ VP+N+E +L++A+ANQPVSV ++
Sbjct: 210 DN-GVHTEKEYPYEKKQGKCRAKEKKGTKVQITGYKRVPANDEISLIQAIANQPVSVLLE 268

Query: 266 ASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQ 325
           + G  FQ Y  G+F G CGT+LDH VTA+GYG       Y L+KNSWG  WGE GY++++
Sbjct: 269 SKGRAFQLYKGGIFNGPCGTKLDHAVTAIGYGKT-----YILIKNSWGPNWGEKGYLKIK 323

Query: 326 RDIDAKEGLCGIAMQASYPT 345
           R     EG CG+   + +PT
Sbjct: 324 RASGKSEGTCGVYKSSYFPT 343


>gi|390337642|ref|XP_780653.3| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
          Length = 333

 Score =  280 bits (715), Expect = 8e-73,   Method: Compositional matrix adjust.
 Identities = 161/338 (47%), Positives = 207/338 (61%), Gaps = 18/338 (5%)

Query: 14  AILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASF 73
           ++L++      S S +  D   +E    W  ++G+ Y  + E+  R  I+++N++ +   
Sbjct: 5   SVLLVAACVVSSLSMSFTD--FDEDWNEWKNEHGKRYLSDEEEASRRLIWQKNLDIVIKH 62

Query: 74  NNK--ARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYEN--ASVP 129
           N K    +  Y LGIN+F D  NEEF A   G++    S  +  +T   F   N    +P
Sbjct: 63  NLKYDLGHFTYDLGINQFTDLQNEEFVAMMTGFRVSGTSKAAKGST---FLPPNNVGELP 119

Query: 130 ASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGED 189
            ++DWR KG VT VKDQGQCG CWAFS   ++EG +   T KL SLSEQ LVDC  SG D
Sbjct: 120 KTVDWRTKGYVTPVKDQGQCGSCWAFSTTGSVEGQHFKATGKLVSLSEQNLVDC--SGRD 177

Query: 190 QGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEA 249
            GC+GG MD AF++II   G+ TEA YPYKA DG C+ K+AN   A ++GY DV S +E 
Sbjct: 178 AGCDGGFMDRAFQYIIDAGGIDTEASYPYKAVDGKCHFKKANV-GATVTGYTDVTSGSEK 236

Query: 250 ALMKAVAN-QPVSVAIDASGSDFQFYSSGVFT--GQCGTELDHGVTAVGYGTADDGTKYW 306
           AL KAVA+  P+SVAIDAS   FQ Y SGV+   G   T LDHGV AVGYGT+ DGT YW
Sbjct: 237 ALQKAVAHVGPISVAIDASHMSFQHYKSGVYNEPGCDSTVLDHGVLAVGYGTSSDGTDYW 296

Query: 307 LVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           +VKNSW  TWG NGY+ M R+   K+  CGIA  ASYP
Sbjct: 297 IVKNSWAETWGMNGYVWMSRN---KDNQCGIATNASYP 331


>gi|302763109|ref|XP_002964976.1| hypothetical protein SELMODRAFT_83176 [Selaginella moellendorffii]
 gi|302763113|ref|XP_002964978.1| hypothetical protein SELMODRAFT_83554 [Selaginella moellendorffii]
 gi|300167209|gb|EFJ33814.1| hypothetical protein SELMODRAFT_83176 [Selaginella moellendorffii]
 gi|300167211|gb|EFJ33816.1| hypothetical protein SELMODRAFT_83554 [Selaginella moellendorffii]
          Length = 300

 Score =  280 bits (715), Expect = 9e-73,   Method: Compositional matrix adjust.
 Identities = 140/308 (45%), Positives = 201/308 (65%), Gaps = 14/308 (4%)

Query: 40  EMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRA 99
           E W A++ + Y  + EK  R  +F + + YI   N +  N  + LG+N+F+D TN EFRA
Sbjct: 3   EDWAAKHDKSYSSDWEKARRLMVFSDTLAYIEKHNAQP-NTTFTLGLNKFSDLTNAEFRA 61

Query: 100 PRNGYKR--RLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSA 157
              G  +  R    R ++  DV      +S+P S+DWR++GAVT +KDQGQCG CWAFSA
Sbjct: 62  NYVGKFKPPRYQDRRPAKDVDVDV----SSLPTSLDWRQEGAVTPIKDQGQCGSCWAFSA 117

Query: 158 VAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYP 217
           +A++E  + + T++L SLSEQ+L+DCDT   DQGC+GG  DDAF+F++ N G+ TE  YP
Sbjct: 118 IASIESAHFLATKELVSLSEQQLIDCDTV--DQGCQGGFPDDAFKFVVENGGVTTEEAYP 175

Query: 218 YKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSG 277
           Y    GSCN  +      +I+GY+DV  ++  ALMKAV+  PV+V I  S  +FQ Y SG
Sbjct: 176 YTGFAGSCNTNKNK--VVEITGYKDVTKDSADALMKAVSKTPVTVGICGSDQNFQNYRSG 233

Query: 278 VFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGI 337
           + +GQC    DH V  +GYGT + G  YW++KNSWGT+WGE+G++++++     EG+CG+
Sbjct: 234 ILSGQCCNSRDHAVLVIGYGT-EGGMPYWIIKNSWGTSWGEDGFMKIKK--KDGEGMCGM 290

Query: 338 AMQASYPT 345
             Q+SYPT
Sbjct: 291 NGQSSYPT 298


>gi|115464789|ref|NP_001055994.1| Os05g0508300 [Oryza sativa Japonica Group]
 gi|48475189|gb|AAT44258.1| hypothetical protein [Oryza sativa Japonica Group]
 gi|113579545|dbj|BAF17908.1| Os05g0508300 [Oryza sativa Japonica Group]
          Length = 450

 Score =  280 bits (715), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 145/307 (47%), Positives = 188/307 (61%), Gaps = 5/307 (1%)

Query: 38  RHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEF 97
           + E W A++GR Y    E+  R   F +N  ++A+ N    +  Y L +N FAD T++EF
Sbjct: 37  QFEAWCAEHGRSYATPGERAARLAAFADNAAFVAAHNGAPAS--YALALNAFADLTHDEF 94

Query: 98  RAPRNGYKRRLPSV-RSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFS 156
           RA R G         R      +       +VP ++DWR+ GAVT VKDQG CG CW+FS
Sbjct: 95  RAARLGRLAAAGGPGRDGGAPYLGVDGGVGAVPDAVDWRQSGAVTKVKDQGSCGACWSFS 154

Query: 157 AVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKY 216
           A  AMEGIN I T  L SLSEQEL+DCD S  + GC GGLMD A++F++ N G+ TEA Y
Sbjct: 155 ATGAMEGINKIKTGSLISLSEQELIDCDRS-YNSGCGGGLMDYAYKFVVKNGGIDTEADY 213

Query: 217 PYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSS 276
           PY+ +DG+CNK +       I GY+DVP+NNE  L++AVA QPVSV I  S   FQ YS 
Sbjct: 214 PYRETDGTCNKNKLKRRVVTIDGYKDVPANNEDMLLQAVAQQPVSVGICGSARAFQLYSK 273

Query: 277 GVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCG 336
           G+F G C T LDH +  VGYG+ + G  YW+VKNSWG +WG  GY+ M R+     G+CG
Sbjct: 274 GIFDGPCPTSLDHAILIVGYGS-EGGKDYWIVKNSWGESWGMKGYMYMHRNTGNSNGVCG 332

Query: 337 IAMQASY 343
           I    S+
Sbjct: 333 INQMPSF 339


>gi|222625810|gb|EEE59942.1| hypothetical protein OsJ_12596 [Oryza sativa Japonica Group]
          Length = 213

 Score =  280 bits (715), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 139/215 (64%), Positives = 164/215 (76%), Gaps = 3/215 (1%)

Query: 132 IDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQG 191
           +DWR  GAVTGVKDQG CGCCWAFSAVAA+EG+  I T +L SLSEQELVDCD  GEDQG
Sbjct: 1   MDWRAMGAVTGVKDQGSCGCCWAFSAVAAVEGLAKIRTGQLVSLSEQELVDCDVRGEDQG 60

Query: 192 CEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAAL 251
           CEGGLMD AF++I    GLA E+ YPY+  DG+  +  A  +AA I G++DVPSN+E AL
Sbjct: 61  CEGGLMDTAFQYIARRGGLAAESSYPYRGVDGA-CRAAAGRAAASIRGFQDVPSNDEGAL 119

Query: 252 MKAVANQPVSVAIDASGSDFQFYSSGVFTGQ-CGTELDHGVTAVGYGTADDGTKYWLVKN 310
           M AVA QPVSVAI+ +G  F+FY  GV  G  CGTEL+H VTAVGYGTA DGT YWL+KN
Sbjct: 120 MAAVARQPVSVAINGAGYVFRFYDRGVLGGAGCGTELNHAVTAVGYGTASDGTGYWLMKN 179

Query: 311 SWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           SWG +WGE GY+R++R +  +EG CGIA  ASYP 
Sbjct: 180 SWGASWGEGGYVRIRRGV-GREGACGIAQMASYPV 213


>gi|357160095|ref|XP_003578656.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP2-like
           [Brachypodium distachyon]
          Length = 377

 Score =  279 bits (714), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 155/333 (46%), Positives = 200/333 (60%), Gaps = 27/333 (8%)

Query: 34  TMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFN-NKARNKPYKLGINEFADQ 92
           TM  R + W A++GR Y    E+  R +++  NV YI + N + A    Y+LG   + D 
Sbjct: 48  TMAPRFQRWKAEHGRAYATRDEELRRLRVYARNVRYIEAANGDPAAGLTYQLGETAYTDL 107

Query: 93  TNEEFRAPRNGYKRRLPSVRSSE---------TT----------DVSFRYENASVPASID 133
           T +EF A    Y    P + + +         TT           V F    A  PAS+D
Sbjct: 108 TADEFTAM---YTSPSPVLSAHDDEAAGAMMITTRAGAVDAGGQQVYFNVSTAGAPASVD 164

Query: 134 WRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCE 193
           WR KGAVT VK+QG+CG CWAFS VA +EGI+ I T  L SLSEQELVDCDT   D GC+
Sbjct: 165 WRAKGAVTEVKNQGRCGSCWAFSTVAVVEGIHQIRTGNLISLSEQELVDCDTL--DYGCD 222

Query: 194 GGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMK 253
           GG+   A E+I SN G+ATEA YPY   DG+C   +    AA ISG+  V + +E +L  
Sbjct: 223 GGVSYHALEWIASNGGIATEADYPYTGKDGACVANKLPLHAAAISGFARVATRSEPSLAN 282

Query: 254 AVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAV-GYGTADDGTKYWLVKNSW 312
           AVA QPV+V+I+A G++FQ Y  GV+ G CGT L+HGVT V       DG KYW+VKNSW
Sbjct: 283 AVAAQPVAVSIEAGGANFQHYVKGVYNGPCGTRLNHGVTVVGYGEEEGDGEKYWIVKNSW 342

Query: 313 GTTWGENGYIRMQRDIDAK-EGLCGIAMQASYP 344
           G  WG+ GY RM++D+  K EGLCGIA++ S+P
Sbjct: 343 GKKWGDGGYFRMKKDVAGKPEGLCGIAIRPSFP 375


>gi|154183745|gb|ABS70713.1| cathepsin L-like cysteine proteinase [Dermacentor variabilis]
          Length = 333

 Score =  279 bits (714), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 157/313 (50%), Positives = 201/313 (64%), Gaps = 17/313 (5%)

Query: 40  EMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNK-ARNK-PYKLGINEFADQTNEEF 97
           E + A + + Y+ N E+ +RFKIF EN   +A  N K AR    YKLG+N+F D    EF
Sbjct: 28  EAFKATHKKSYQSNMEELLRFKIFSENSLLVARHNEKYARGLVSYKLGMNQFGDLLPHEF 87

Query: 98  RAPRNGYKRRLPSVRSSE---TTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWA 154
               NGY+    + R S      +V++    +S+P S+DWR+KGAVT VK+QGQCG CWA
Sbjct: 88  ARMFNGYRGARTAGRGSTFLPPANVNY----SSLPQSMDWREKGAVTPVKNQGQCGSCWA 143

Query: 155 FSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEA 214
           FS   ++EG + + T  L SLSEQ LVDC  +  + GCEGGLMD+AF++I +N G+ TE 
Sbjct: 144 FSTTGSLEGQHFLKTGVLVSLSEQNLVDCSETFGNHGCEGGLMDNAFQYIKANGGIDTEK 203

Query: 215 KYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDASGSDFQF 273
            YPY+A DG C  K+ N  A   +G+ D+   +E  L KAVA   PVSVAIDAS S FQ 
Sbjct: 204 SYPYEAEDGECRFKKQNVGATD-TGFVDIEQGSEDDLKKAVATVGPVSVAIDASHSSFQL 262

Query: 274 YSSGVF-TGQCGTE-LDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAK 331
           YS GV+   +C +E LDHGV  VGYG  +DG KYWLVKNSW  +WG+NGYI+M RD D +
Sbjct: 263 YSEGVYDETECSSEQLDHGVLVVGYGV-EDGKKYWLVKNSWAESWGDNGYIKMSRDKDNQ 321

Query: 332 EGLCGIAMQASYP 344
              CGIA  ASYP
Sbjct: 322 ---CGIASAASYP 331


>gi|66735056|gb|AAY53767.1| cysteine protease [Saprolegnia parasitica]
          Length = 523

 Score =  279 bits (713), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 143/316 (45%), Positives = 200/316 (63%), Gaps = 6/316 (1%)

Query: 32  DATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFAD 91
           DA+   +   WM ++  V  +  E   RF++F  N + I + N  A +  + +G NE++ 
Sbjct: 21  DASYEAKFLSWMKKFA-VKLNPLEWVHRFEVFILNDQRIEAHNKDASSS-FTMGHNEYSH 78

Query: 92  QTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS-VPASIDWRKKGAVTGVKDQGQCG 150
            T +EF+  R G +     ++S     +     N + VP  +DW ++G VT VK+QG CG
Sbjct: 79  LTFDEFKKLRTGLRVSPSYIQSRAKYALMAPAVNMTDVPNEMDWVEQGGVTPVKNQGMCG 138

Query: 151 CCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGL 210
            CWAFS   A+EG   +++++L S+SEQELVDCD +G D GC GGLMD+AF+++ ++KGL
Sbjct: 139 SCWAFSTTGAIEGAAFVSSKQLVSVSEQELVDCDHNG-DMGCNGGLMDNAFKWVKTHKGL 197

Query: 211 ATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSD 270
             E  YPY A +G+C  K+  P   K++ + DVP+N+E AL  AVA QPVSVAI+A   +
Sbjct: 198 CKEEDYPYHAKEGTCALKKCKP-VTKVTAFHDVPANDEQALKAAVAKQPVSVAIEADQPE 256

Query: 271 FQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDA 330
           FQFY SGVF   CGT+LDHGV  VGYG  + G KYW VKNSWG  WG+ GYI++ R+   
Sbjct: 257 FQFYKSGVFDKSCGTKLDHGVLVVGYGE-EGGKKYWKVKNSWGADWGDKGYIKLAREFGP 315

Query: 331 KEGLCGIAMQASYPTA 346
           + G CG+AM  SYPTA
Sbjct: 316 ETGQCGVAMVPSYPTA 331


>gi|312381833|gb|EFR27483.1| hypothetical protein AND_05794 [Anopheles darlingi]
          Length = 344

 Score =  279 bits (713), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 156/331 (47%), Positives = 201/331 (60%), Gaps = 25/331 (7%)

Query: 33  ATMNERHEMWMA---QYGRVYRDNAEKEMRFKIFKENVEYIASFNNKAR--NKPYKLGIN 87
           +  N   E W A   Q+ + Y   +E+ +R KI+ +N   IA  N +     + ++L +N
Sbjct: 18  SIFNLVKEEWNAFKLQHRKKYDSESEERIRMKIYVQNKHKIAKHNQRYDLGQEKFRLRVN 77

Query: 88  EFADQTNEEFRAPRNGYKRRLPS---------VRSSETTDVSFRYENASVPASIDWRKKG 138
           ++AD  +EEF    NG+ R   +         + + E         N  VP +IDWR+KG
Sbjct: 78  KYADLLHEEFVHTLNGFNRSAAAGSKLLGREQLMTIEEPITWIEPANVDVPTTIDWREKG 137

Query: 139 AVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMD 198
           AVT VKDQG CG CW+FSA  A+EG +   T KL SLSEQ LVDC T   + GC GGLMD
Sbjct: 138 AVTPVKDQGHCGSCWSFSATGALEGQHFRKTGKLVSLSEQNLVDCSTKYGNNGCNGGLMD 197

Query: 199 DAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSA--AKISGYEDVPSNNEAALMKAVA 256
           +AF+++  NKG+ TE  YPY+A D  C+    NP A  A   G+ D+P  +E AL KA+A
Sbjct: 198 NAFQYVKDNKGIDTEKAYPYEAIDDECH---YNPKAIGATDKGFVDIPQGDEKALKKALA 254

Query: 257 NQ-PVSVAIDASGSDFQFYSSGV-FTGQCGTE-LDHGVTAVGYGTADDGTKYWLVKNSWG 313
              PVSVAIDAS   FQFYS GV +  QC +E LDHGV AVGYGT +DG  YWLVKNSWG
Sbjct: 255 TVGPVSVAIDASHESFQFYSEGVYYEPQCDSEQLDHGVLAVGYGTTEDGEDYWLVKNSWG 314

Query: 314 TTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           TTWG+ GY++M R+   +E  CGIA  ASYP
Sbjct: 315 TTWGDQGYVKMARN---RENHCGIATTASYP 342


>gi|242093944|ref|XP_002437462.1| hypothetical protein SORBIDRAFT_10g027570 [Sorghum bicolor]
 gi|241915685|gb|EER88829.1| hypothetical protein SORBIDRAFT_10g027570 [Sorghum bicolor]
          Length = 366

 Score =  279 bits (713), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 153/325 (47%), Positives = 191/325 (58%), Gaps = 26/325 (8%)

Query: 39  HEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEF- 97
           +E W A Y  + RD+ EK  RF +FKEN   I   N++  N  Y LG+N F+D T+EEF 
Sbjct: 48  YERWCAHY-NMARDHGEKTRRFDLFKENARRIYEHNHQG-NATYTLGLNRFSDMTDEEFN 105

Query: 98  RAPRNGYK----------RRLPSVRSSETTDVSFRYENAS------VPASIDWRKKGAVT 141
           R+P  G              L      +  D SF   + S       P ++DWR + AVT
Sbjct: 106 RSPYGGCLTAPRMSDDEIEELHHHHHQQEDDGSFNLTHGSGGGKLGAPPAVDWRGR-AVT 164

Query: 142 GVKDQGQ-CGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDA 200
            VKDQG  CG CWAFSA+AA+EGIN I TR L  LSEQ+LVDCD    + GC GGLM  A
Sbjct: 165 RVKDQGPTCGSCWAFSAIAAVEGINAIRTRNLVPLSEQQLVDCDKL--NHGCNGGLMTTA 222

Query: 201 FEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPV 260
           F F++ N+G+  E  YPY   +G C    A P    I GY+ VP  +  ALM AVA QPV
Sbjct: 223 FSFVVRNRGVVPEGAYPYMGREGRCKHVMAPP--VTIYGYQRVPRFDANALMNAVAAQPV 280

Query: 261 SVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENG 320
           SVAI+AS  +F+ Y  GVF G CG  L H  TAVGYG AD G  +W+VKNSWG  WGE G
Sbjct: 281 SVAIEASSFEFRHYQGGVFNGNCGGRLGHAATAVGYG-ADAGGPFWIVKNSWGPGWGEGG 339

Query: 321 YIRMQRDIDAKEGLCGIAMQASYPT 345
           Y+R+ R+   ++G+CGI  + SYP 
Sbjct: 340 YVRISRNTPVRQGVCGILTENSYPV 364


>gi|8886940|gb|AAF80626.1|AC069251_19 F2D10.37 [Arabidopsis thaliana]
          Length = 315

 Score =  279 bits (713), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 137/264 (51%), Positives = 184/264 (69%), Gaps = 5/264 (1%)

Query: 37  ERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEE 96
           E  E W++ + + Y    EK +RF++FK+N+++I   N K   K Y LG+NEFAD ++EE
Sbjct: 49  ELFENWISNFEKAYETVEEKFLRFEVFKDNLKHIDETNKKG--KSYWLGLNEFADLSHEE 106

Query: 97  FRAPRNGYKRRLPSVRSSETTDVSFRYENA-SVPASIDWRKKGAVTGVKDQGQCGCCWAF 155
           F+    G K  +   R  E +   F Y +  +VP S+DWRKKGAV  VK+QG CG CWAF
Sbjct: 107 FKKMYLGLKTDIVR-RDEERSYAEFAYRDVEAVPKSVDWRKKGAVAEVKNQGSCGSCWAF 165

Query: 156 SAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAK 215
           S VAA+EGIN I T  LT+LSEQEL+DCDT+  + GC GGLMD AFE+I+ N GL  E  
Sbjct: 166 STVAAVEGINKIVTGNLTTLSEQELIDCDTT-YNNGCNGGLMDYAFEYIVKNGGLRKEED 224

Query: 216 YPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYS 275
           YPY   +G+C  ++       I+G++DVP+N+E +L+KA+A+QP+SVAIDASG +FQFYS
Sbjct: 225 YPYSMEEGTCEMQKDESETVTINGHQDVPTNDEKSLLKALAHQPLSVAIDASGREFQFYS 284

Query: 276 SGVFTGQCGTELDHGVTAVGYGTA 299
            GVF G+CG +LDHGV AVGYG++
Sbjct: 285 GGVFDGRCGVDLDHGVAAVGYGSS 308


>gi|195056367|ref|XP_001995082.1| GH22826 [Drosophila grimshawi]
 gi|193899288|gb|EDV98154.1| GH22826 [Drosophila grimshawi]
          Length = 340

 Score =  279 bits (713), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 149/320 (46%), Positives = 196/320 (61%), Gaps = 13/320 (4%)

Query: 34  TMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNN--KARNKPYKLGINEFAD 91
            + E  + +  ++ + Y+D  E+  R KIF EN   IA  N    A    +K+G+N++AD
Sbjct: 23  VIKEEWQTFKLEHRKQYQDETEERFRLKIFNENKHKIAKHNQLYAAGEVSFKMGLNKYAD 82

Query: 92  QTNEEFRAPRNGYKRRL-PSVRSSETTDVSFRY---ENASVPASIDWRKKGAVTGVKDQG 147
             + EF    NG+   L   +R+S+ T     +   E+  +P S+DWR KGAVTGVKDQG
Sbjct: 83  MLHHEFHETMNGFNYTLHKQLRASDATFTGVTFISPEHVKLPQSVDWRNKGAVTGVKDQG 142

Query: 148 QCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISN 207
            CG CWAFS+  A+EG +   T  L SLSEQ LVDC T   + GC GGLMD+AF +I  N
Sbjct: 143 HCGSCWAFSSTGALEGQHFRKTGTLISLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDN 202

Query: 208 KGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDA 266
            G+ TE  YPY+  D SC+  +    A    G+ D+P  +E  L +AVA   PVSVAIDA
Sbjct: 203 GGIDTEKSYPYEGIDDSCHFNKGTIGATD-RGFTDIPQGDEKKLAQAVATIGPVSVAIDA 261

Query: 267 SGSDFQFYSSGVF-TGQCGTE-LDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRM 324
           S   FQFYS+GV+   QC  + LDHGV  VGYGT ++G  YWLVKNSWGTTWG+ G+I+M
Sbjct: 262 SHESFQFYSTGVYDEPQCDPQNLDHGVLVVGYGTDENGKDYWLVKNSWGTTWGDKGFIKM 321

Query: 325 QRDIDAKEGLCGIAMQASYP 344
            R+ D +   CGIA  +SYP
Sbjct: 322 ARNDDNQ---CGIATASSYP 338


>gi|224116884|ref|XP_002317418.1| predicted protein [Populus trichocarpa]
 gi|222860483|gb|EEE98030.1| predicted protein [Populus trichocarpa]
          Length = 503

 Score =  278 bits (712), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 143/325 (44%), Positives = 200/325 (61%), Gaps = 9/325 (2%)

Query: 26  WSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIAS-FNNKARNKPYKL 84
           +S  +++ ++ E  + W  ++ +VY   AE E R++ FK N++YI      K     + +
Sbjct: 37  FSELVSEESIIEIFQQWRDRHQKVYEHAAESEKRYRNFKRNLKYIIEKAGKKTAALGHSV 96

Query: 85  GINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFR-YENASVPASIDWRKKGAVTGV 143
           G+N+FAD +NEEF+       ++  +++ S   D   R  +    P+S+DWRKKG VT V
Sbjct: 97  GLNKFADLSNEEFKELYLSKVKKPINIKRSTARDWRQRNLQTCDAPSSLDWRKKGVVTAV 156

Query: 144 KDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEF 203
           KDQG CG CW+FS   A+EGIN I T  L SLSEQELVDCDT+  + GCEGG MD AFE+
Sbjct: 157 KDQGDCGSCWSFSTTGAIEGINAIVTGDLISLSEQELVDCDTT--NYGCEGGYMDYAFEW 214

Query: 204 IISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVA 263
           +I+N G+ TEA YPY   DG+CN  +       I GY DV    ++AL+ A   QP+SV 
Sbjct: 215 VINNGGIDTEANYPYTGVDGTCNTTKEEIKVVSIDGYTDV-DETDSALLCATVQQPISVG 273

Query: 264 IDASGSDFQFYSSGVFTGQCG---TELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENG 320
           +D S  DFQ Y+ G++ G C     ++DH V  VGYG+ ++G  YW+VKNSWGT WG  G
Sbjct: 274 MDGSALDFQLYTGGIYDGDCSDDPNDIDHAVLIVGYGS-ENGEDYWIVKNSWGTEWGMEG 332

Query: 321 YIRMQRDIDAKEGLCGIAMQASYPT 345
           Y  ++R+ D   G+C I  +ASYPT
Sbjct: 333 YFYIKRNTDLPYGVCAINAEASYPT 357


>gi|2507252|sp|P14080.2|PAPA2_CARPA RecName: Full=Chymopapain; AltName: Full=Papaya proteinase II;
           Short=PPII; Flags: Precursor
 gi|1332461|emb|CAA66378.1| chymopapain [Carica papaya]
          Length = 352

 Score =  278 bits (712), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 148/347 (42%), Positives = 206/347 (59%), Gaps = 14/347 (4%)

Query: 3   MILLENKLVLAAILVLGVWAPQSWSRTLNDATMNER----HEMWMAQYGRVYRDNAEKEM 58
           +I L   L++   L    +    +S+  +D T  ER     + WM ++ ++Y    EK  
Sbjct: 10  IIFLATCLIIHMGLSSADFYTVGYSQ--DDLTSIERLIQLFDSWMLKHNKIYESIDEKIY 67

Query: 59  RFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGY-KRRLPSVRSSETT 117
           RF+IF++N+ YI   N K  N  Y LG+N FAD +N+EF+    G+       +   +  
Sbjct: 68  RFEIFRDNLMYIDETNKK--NNSYWLGLNGFADLSNDEFKKKYVGFVAEDFTGLEHFDNE 125

Query: 118 DVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSE 177
           D ++++   + P SIDWR KGAVT VK+QG CG CWAFS +A +EGIN I T  L  LSE
Sbjct: 126 DFTYKHV-TNYPQSIDWRAKGAVTPVKNQGACGSCWAFSTIATVEGINKIVTGNLLELSE 184

Query: 178 QELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKI 237
           QELVDCD      GC+GG    + +++ +N G+ T   YPY+A    C   +      KI
Sbjct: 185 QELVDCDK--HSYGCKGGYQTTSLQYV-ANNGVHTSKVYPYQAKQYKCRATDKPGPKVKI 241

Query: 238 SGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYG 297
           +GY+ VPSN E + + A+ANQP+SV ++A G  FQ Y SGVF G CGT+LDH VTAVGYG
Sbjct: 242 TGYKRVPSNCETSFLGALANQPLSVLVEAGGKPFQLYKSGVFDGPCGTKLDHAVTAVGYG 301

Query: 298 TADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           T+ DG  Y ++KNSWG  WGE GY+R++R     +G CG+   + YP
Sbjct: 302 TS-DGKNYIIIKNSWGPNWGEKGYMRLKRQSGNSQGTCGVYKSSYYP 347


>gi|66378053|gb|AAY45871.1| cathepsin L-like cysteine proteinase [Longidorus elongatus]
          Length = 358

 Score =  278 bits (712), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 155/358 (43%), Positives = 215/358 (60%), Gaps = 22/358 (6%)

Query: 5   LLENKLVLAAILVLGVWAPQSWSRT----LNDATMNE-RHEMWM---AQYGRVYRDNAEK 56
           ++   L+L +I +LG    +  S+      N+  +N   + +W     ++ + Y+   E+
Sbjct: 1   MIRITLLLHSIFLLGFVNSEQISQIQEHPRNNLLINHPYYPVWTNFKLKHAKSYKTKDEE 60

Query: 57  EMRFKIFKENVEYIASFN--NKARNKPYKLGINEFADQTNEEFRAPRNGYK----RRLPS 110
            +RF++F  N + I   N   +A    + L +N+FAD TN EFR   NG+K    R+L  
Sbjct: 61  LLRFQVFASNHKVIEQHNIEYEAGQHSFALSLNKFADMTNAEFRQRMNGFKLPAKRKLAK 120

Query: 111 VRSSETTDVSFRY-ENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITT 169
            +  +   + F   +N ++P S+DWRK+G VT VKDQG CG CWAFSA  ++EG ++  T
Sbjct: 121 SQPLKEDGMIFEMPDNVTIPDSVDWRKEGYVTKVKDQGSCGSCWAFSATGSLEGQHYKQT 180

Query: 170 RKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKE 229
            KL SLSEQ LVDCD +G+D+GC GG MD AF+++ +NKG+ TEA YPYK  DG C  K 
Sbjct: 181 GKLVSLSEQNLVDCDVNGDDEGCNGGYMDGAFQYVETNKGIDTEASYPYKGRDGRCRFKS 240

Query: 230 ANPSAAKISGYEDVPSNNEAALMKAVANQ-PVSVAIDASGSDFQFYSSGVFTGQ-CGTE- 286
            +  A   +G+ D+P  NE  L  A+A   PVSVAIDA+   FQFYS GV+  + C  E 
Sbjct: 241 EDVGATD-TGFVDIPEGNETLLEAAIATVGPVSVAIDAASFKFQFYSHGVYYDRSCSPEY 299

Query: 287 LDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           LDHGV AVGY +  DG +Y++VKNSW   WG++GYI M R    K   CGIA  ASYP
Sbjct: 300 LDHGVLAVGYNSTKDGKQYYIVKNSWSEDWGDDGYILMSR---RKNNNCGIATMASYP 354


>gi|194320502|gb|ACF48469.1| cathepsin L [Triatoma brasiliensis]
          Length = 330

 Score =  278 bits (712), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 156/340 (45%), Positives = 211/340 (62%), Gaps = 21/340 (6%)

Query: 10  LVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEY 69
           LV  AI+ L      S++    D    E H ++ A +G+ Y++  E+  R KIF +N + 
Sbjct: 5   LVAVAIIAL------SYAHPSFDIYPEEWH-VFKAMHGKTYKNQFEEMFRMKIFMDNKKK 57

Query: 70  IASFNNKARNK--PYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS 127
           I + N K       YK+ +N F D    EF+A  NG+K    + R+ E     +   N++
Sbjct: 58  IEAHNAKYEQGEVSYKMMMNHFGDLMVHEFKALMNGFKMSPDTKRNGEL----YFPSNSN 113

Query: 128 VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSG 187
           +P ++DWR+KGAVT VKDQGQCG CW+FSA  ++EG   + T KL SLSEQ LVDC TS 
Sbjct: 114 LPKTVDWRQKGAVTPVKDQGQCGSCWSFSATGSLEGQVFLKTGKLVSLSEQNLVDCSTSY 173

Query: 188 EDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNN 247
            + GCEGGLMD AF+++  NKG+ TEA YPY+A + +C  K+ N       G+ D+P+ +
Sbjct: 174 GNNGCEGGLMDQAFQYVSDNKGIDTEASYPYEARENTCRFKK-NKVGGTDKGHVDIPAGD 232

Query: 248 EAALMKAVAN-QPVSVAIDASGSDFQFYSSGVFT-GQCGT-ELDHGVTAVGYGTADDGTK 304
           E AL  A+A   P+SVAIDA+   FQFYS GV+    C + +LDHGV AVGYGT ++G  
Sbjct: 233 EKALQNALATVGPISVAIDANHGSFQFYSKGVYNEPNCSSYDLDHGVLAVGYGT-ENGQD 291

Query: 305 YWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           YWLVKNSWG +WGENGYI++ R+       CGIA  ASYP
Sbjct: 292 YWLVKNSWGPSWGENGYIKIARN---HSNHCGIASMASYP 328


>gi|302779822|ref|XP_002971686.1| hypothetical protein SELMODRAFT_16221 [Selaginella moellendorffii]
 gi|300160818|gb|EFJ27435.1| hypothetical protein SELMODRAFT_16221 [Selaginella moellendorffii]
          Length = 214

 Score =  278 bits (712), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 128/215 (59%), Positives = 167/215 (77%), Gaps = 2/215 (0%)

Query: 131 SIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQ 190
           S+DWRKKG VT +KDQG CG CWAFSA+AA+EG+  ++T  L SLSEQELVDCDT+  +Q
Sbjct: 1   SVDWRKKGGVTEIKDQGDCGNCWAFSAIAAVEGLTFLSTGTLVSLSEQELVDCDTT-VNQ 59

Query: 191 GCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAA 250
           GC+GG+MD AF+++I N G+ +++ YPY+A  G+C+K +    AA I+G++ +P  +E  
Sbjct: 60  GCDGGMMDYAFQYMIRNGGITSQSNYPYRAQRGACDKDKVKYHAATINGFQAIPPQSEEL 119

Query: 251 LMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKN 310
           L++AVANQPVSVAI+A G DFQ YSSGVFTG+CG+ LDHGV  VGYGT   G +YWLVKN
Sbjct: 120 LLRAVANQPVSVAIEAGGQDFQLYSSGVFTGECGSNLDHGVAIVGYGTDAGGRQYWLVKN 179

Query: 311 SWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           SWG+ WGE+GY+RM+R      G+CGI + ASYPT
Sbjct: 180 SWGSGWGESGYVRMERQ-GPGAGVCGINLDASYPT 213


>gi|242072384|ref|XP_002446128.1| hypothetical protein SORBIDRAFT_06g002110 [Sorghum bicolor]
 gi|241937311|gb|EES10456.1| hypothetical protein SORBIDRAFT_06g002110 [Sorghum bicolor]
          Length = 186

 Score =  278 bits (710), Expect = 3e-72,   Method: Compositional matrix adjust.
 Identities = 129/186 (69%), Positives = 149/186 (80%)

Query: 161 MEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKA 220
           MEG   I+T KL SLSEQELVDCD +G DQGCEGG MDDAFEF++ N GL TE+KYPY  
Sbjct: 1   MEGAVKISTGKLVSLSEQELVDCDVNGMDQGCEGGEMDDAFEFVVDNGGLTTESKYPYTG 60

Query: 221 SDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFT 280
           SDG+CN  EA   AA I+GYEDVP+N+E +L KAVANQPVSVA+D   + F+FY  GV +
Sbjct: 61  SDGNCNSDEAKNDAASITGYEDVPANDETSLRKAVANQPVSVAVDGGDNLFRFYKGGVLS 120

Query: 281 GQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQ 340
           G CGTELDHG+ AVGYG A DGTK+WL+KNSWGT+WGE GYIRM+RDI   EGLCG+AMQ
Sbjct: 121 GACGTELDHGIAAVGYGVAGDGTKFWLMKNSWGTSWGEAGYIRMERDIADDEGLCGLAMQ 180

Query: 341 ASYPTA 346
            SYPTA
Sbjct: 181 PSYPTA 186


>gi|158300877|ref|XP_001689282.1| AGAP011828-PA [Anopheles gambiae str. PEST]
 gi|157013372|gb|EDO63348.1| AGAP011828-PA [Anopheles gambiae str. PEST]
          Length = 344

 Score =  278 bits (710), Expect = 3e-72,   Method: Compositional matrix adjust.
 Identities = 155/323 (47%), Positives = 195/323 (60%), Gaps = 24/323 (7%)

Query: 40  EMWMA---QYGRVYRDNAEKEMRFKIFKENVEYIASFNNKAR--NKPYKLGINEFADQTN 94
           E W A   Q+ + Y    E+ +R KI+ +N   IA  N +     + ++L +N++AD  +
Sbjct: 26  EEWTAFKLQHRKKYDSETEERIRMKIYVQNKHKIAKHNQRYDLGQEKFRLRVNKYADLLH 85

Query: 95  EEFRAPRNGYKRRLP--------SVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQ 146
           EEF    NG+ R +          ++  E         N  VP ++DWR KGAVT VKDQ
Sbjct: 86  EEFVHTLNGFNRSVSGKGQLLRGELKPIEEPVTWIEPANVDVPTAMDWRTKGAVTQVKDQ 145

Query: 147 GQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIIS 206
           G CG CW+FSA  A+EG +   T KL SLSEQ LVDC     + GC GG+MD AF++I  
Sbjct: 146 GHCGSCWSFSATGALEGQHFRKTGKLVSLSEQNLVDCSQKYGNNGCNGGMMDFAFQYIKD 205

Query: 207 NKGLATEAKYPYKASDGSCNKKEANPSA--AKISGYEDVPSNNEAALMKAVANQ-PVSVA 263
           NKG+ TE  YPY+A D  C+    NP A  A   G+ D+P  NE ALMKA+A   PVSVA
Sbjct: 206 NKGIDTEKSYPYEAIDDECH---YNPKAVGATDKGFVDIPQGNEKALMKALATVGPVSVA 262

Query: 264 IDASGSDFQFYSSGV-FTGQCGTE-LDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGY 321
           IDAS   FQFYS GV +  QC +E LDHGV AVGYGT +DG  YWLVKNSWGTTWG+ GY
Sbjct: 263 IDASHESFQFYSEGVYYEPQCDSEQLDHGVLAVGYGTTEDGEDYWLVKNSWGTTWGDQGY 322

Query: 322 IRMQRDIDAKEGLCGIAMQASYP 344
           ++M R+ D     CGIA  ASYP
Sbjct: 323 VKMARNRDNH---CGIATTASYP 342


>gi|94421564|gb|ABF18889.1| cathepsin-L [Lygus lineolaris]
          Length = 314

 Score =  278 bits (710), Expect = 3e-72,   Method: Compositional matrix adjust.
 Identities = 155/326 (47%), Positives = 200/326 (61%), Gaps = 19/326 (5%)

Query: 6   LENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKE 65
           ++  L  A ++ +G+  P      L+D    E  E + A+YG+ Y  N  +  R  I+  
Sbjct: 1   MKTVLAFACLVAVGLALP------LSDDNQAEW-ESYKAKYGKTYESNENEAARRTIYFM 53

Query: 66  NVEYIASFNNKARNK--PYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRY 123
             E +   N +       YKLG+N FAD  N EFR   NGY+R  P  R+S    V    
Sbjct: 54  AKEKVMEHNARFEQGLVSYKLGLNSFADMHNGEFRKMMNGYRRGTP--RNSVVVHVE--- 108

Query: 124 ENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDC 183
            N ++PAS+DWR KGAVT +K+QGQCG CWAFS   ++EG + +   KL SLSEQELVDC
Sbjct: 109 SNITLPASVDWRTKGAVTPIKNQGQCGSCWAFSTTGSLEGQHALKKGKLVSLSEQELVDC 168

Query: 184 DTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDV 243
             +  + GC+GGLMDDAF +I  N G+ TE  YPY   DG+C+ K+++  AA ++G+ DV
Sbjct: 169 SAAEGNDGCDGGLMDDAFTYIKKNNGIDTEQSYPYTGEDGTCSFKKSDV-AATVTGFVDV 227

Query: 244 PSNNEAALMKAVAN-QPVSVAIDASGSDFQFYSSGVF-TGQCG-TELDHGVTAVGYGTAD 300
            S +E+ L  A A   P+SVAIDAS  DFQ Y SGV+    C  TELDHGV  VGYGT D
Sbjct: 228 TSGSESGLQDASATIGPISVAIDASSWDFQLYESGVYDVSDCSTTELDHGVLVVGYGT-D 286

Query: 301 DGTKYWLVKNSWGTTWGENGYIRMQR 326
           DGT YWLVKNSWGT WG +GYI+M R
Sbjct: 287 DGTAYWLVKNSWGTDWGHHGYIQMSR 312


>gi|22653679|sp|Q26636.1|CATL_SARPE RecName: Full=Cathepsin L; Contains: RecName: Full=Cathepsin L
           heavy chain; Contains: RecName: Full=Cathepsin L light
           chain; Flags: Precursor
 gi|505140|dbj|BAA03970.1| cathepsin L precursor [Sarcophaga peregrina]
          Length = 339

 Score =  278 bits (710), Expect = 4e-72,   Method: Compositional matrix adjust.
 Identities = 152/320 (47%), Positives = 193/320 (60%), Gaps = 12/320 (3%)

Query: 35  MNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNK-ARNK-PYKLGINEFADQ 92
           + E    +  Q+ + Y +  E+  R KIF EN   IA  N   A+ K  YKLG+N++AD 
Sbjct: 24  IKEEWHTYKLQHRKNYANEVEERFRMKIFNENRHKIAKHNQLFAQGKVSYKLGLNKYADM 83

Query: 93  TNEEFRAPRNGYKRRLPSVRSSETTDVSFRY---ENASVPASIDWRKKGAVTGVKDQGQC 149
            + EF+   NGY   L  +    T  V   Y    + +VP S+DWR+ GAVTGVKDQG C
Sbjct: 84  LHHEFKETMNGYNHTLRQLMRERTGLVGATYIPPAHVTVPKSVDWREHGAVTGVKDQGHC 143

Query: 150 GCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKG 209
           G CWAFS+  A+EG +      L SLSEQ LVDC T   + GC GGLMD+AF +I  N G
Sbjct: 144 GSCWAFSSTGALEGQHFRKAGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGG 203

Query: 210 LATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQ-PVSVAIDASG 268
           + TE  YPY+  D SC+  +A   A   +G+ D+P  +E  + KAVA   PVSVAIDAS 
Sbjct: 204 IDTEKSYPYEGIDDSCHFNKATIGATD-TGFVDIPEGDEEKMKKAVATMGPVSVAIDASH 262

Query: 269 SDFQFYSSGVFT-GQCGTE-LDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQR 326
             FQ YS GV+   +C  + LDHGV  VGYGT + G  YWLVKNSWGTTWGE GYI+M R
Sbjct: 263 ESFQLYSEGVYNEPECDEQNLDHGVLVVGYGTDESGMDYWLVKNSWGTTWGEQGYIKMAR 322

Query: 327 DIDAKEGLCGIAMQASYPTA 346
           + + +   CGIA  +SYPT 
Sbjct: 323 NQNNQ---CGIATASSYPTV 339


>gi|170041165|ref|XP_001848344.1| cathepsin l [Culex quinquefasciatus]
 gi|167864709|gb|EDS28092.1| cathepsin l [Culex quinquefasciatus]
          Length = 340

 Score =  277 bits (709), Expect = 4e-72,   Method: Compositional matrix adjust.
 Identities = 153/320 (47%), Positives = 200/320 (62%), Gaps = 21/320 (6%)

Query: 40  EMWMA---QYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARN--KPYKLGINEFADQTN 94
           E W A   Q+ + Y    E+ +R KI+ +N   IA  N +     + ++L +N++ D  +
Sbjct: 25  EEWNAYKLQHRKKYDSETEERLRLKIYVQNKHKIAKHNQRFEQGQEKFRLRVNKYTDLLH 84

Query: 95  EEFRAPRNGYKR---RLPSVRSSETTDVSFRYE--NASVPASIDWRKKGAVTGVKDQGQC 149
           EEF    NG+ R   + P ++  +  +     E  N  VP ++DWR+KGAVT VKDQG C
Sbjct: 85  EEFVQTLNGFNRTNAKKPMLKGVKIDEPVTYIEPANVEVPKTVDWREKGAVTPVKDQGHC 144

Query: 150 GCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKG 209
           G CW+FSA  A+EG +   T KL SLSEQ LVDC T   + GC GG+MD AF++I  N G
Sbjct: 145 GSCWSFSATGALEGQHFRKTGKLVSLSEQNLVDCSTKYGNNGCNGGMMDFAFQYIKDNGG 204

Query: 210 LATEAKYPYKASDGSCNKKEANPSA--AKISGYEDVPSNNEAALMKAVANQ-PVSVAIDA 266
           + TE  YPY+A D +C+    NP A  A   G+ D+P  +E ALMKA+A   PVSVAIDA
Sbjct: 205 IDTEKAYPYEAIDDTCH---YNPKAVGATDKGFVDIPQGDEKALMKAIATAGPVSVAIDA 261

Query: 267 SGSDFQFYSSGV-FTGQCGTE-LDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRM 324
           S   FQFYS GV +  QC +E LDHGV AVGYGT+++G  YWLVKNSWGTTWG+ GY++M
Sbjct: 262 SHESFQFYSEGVYYEPQCDSENLDHGVLAVGYGTSEEGEDYWLVKNSWGTTWGDQGYVKM 321

Query: 325 QRDIDAKEGLCGIAMQASYP 344
            R+ D     CGIA  ASYP
Sbjct: 322 ARNRDNH---CGIATAASYP 338


>gi|390337645|ref|XP_001199228.2| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
          Length = 333

 Score =  277 bits (709), Expect = 5e-72,   Method: Compositional matrix adjust.
 Identities = 161/338 (47%), Positives = 209/338 (61%), Gaps = 18/338 (5%)

Query: 14  AILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASF 73
           ++L++ V    S S +  D   +E  + W  ++G+ Y  + E+  R  I+++N++ +   
Sbjct: 5   SVLLVAVCVVSSLSMSFTD--FDEDWKEWKNEHGKRYLSDEEEASRRLIWQKNLDIVIRH 62

Query: 74  NNK--ARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYEN--ASVP 129
           N K    +  Y LG+N+FAD  N+EF A   G++    S  +  +T   F   N    +P
Sbjct: 63  NLKYDLGHFTYDLGMNQFADLQNKEFVAMMTGFRVNGTSKAAKGST---FLPPNNVGKLP 119

Query: 130 ASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGED 189
            ++DWR KG VT VKDQGQCG CWAFSA  ++EG +   T KL SLSEQ LVDC  S ++
Sbjct: 120 KTVDWRTKGYVTPVKDQGQCGSCWAFSATGSLEGQHFKKTGKLVSLSEQNLVDC--SDKN 177

Query: 190 QGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEA 249
            GC GGLMD AF++II   G+ TE  YPY A DG+C+ K AN   A ++GY DV S +E 
Sbjct: 178 YGCNGGLMDRAFQYIIDAGGIDTEESYPYIAMDGNCHFKTAN-VGATVTGYTDVTSGSEK 236

Query: 250 ALMKAVAN-QPVSVAIDASGSDFQFYSSGVFT--GQCGTELDHGVTAVGYGTADDGTKYW 306
           AL KAVA+  P+SVAIDAS   FQ Y SGV+   G   T LDHGV AVGYGT  DGT YW
Sbjct: 237 ALQKAVAHIGPISVAIDASHFSFQLYQSGVYNEPGCSSTLLDHGVLAVGYGTTIDGTDYW 296

Query: 307 LVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           +VKNSW  TWG NGYI M R+   K+  CGIA QASYP
Sbjct: 297 IVKNSWAETWGMNGYIWMSRN---KDNQCGIATQASYP 331


>gi|414591548|tpg|DAA42119.1| TPA: hypothetical protein ZEAMMB73_388689, partial [Zea mays]
          Length = 229

 Score =  277 bits (708), Expect = 5e-72,   Method: Compositional matrix adjust.
 Identities = 133/196 (67%), Positives = 151/196 (77%), Gaps = 1/196 (0%)

Query: 150 GCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKG 209
           G CWAFSA+AA+EG+N I T KL SLSEQELVDCD   ++QGC+GGLMD AF++I  N G
Sbjct: 13  GSCWAFSAIAAVEGVNKIMTGKLVSLSEQELVDCDDV-DNQGCDGGLMDYAFQYIQRNGG 71

Query: 210 LATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGS 269
           + TE+ YPY A   SCNK +       I GYEDVP+NNE AL KAVA+QPV+VAI+ASG 
Sbjct: 72  VTTESNYPYLAEQRSCNKAKERSHDVTIDGYEDVPANNEDALQKAVASQPVAVAIEASGQ 131

Query: 270 DFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDID 329
           DFQFYS GVFTG CGT+LDHGV AVGYGT  DGTKYW VKNSWG  WGE GYIRMQR + 
Sbjct: 132 DFQFYSEGVFTGSCGTDLDHGVAAVGYGTTGDGTKYWTVKNSWGEDWGERGYIRMQRGVP 191

Query: 330 AKEGLCGIAMQASYPT 345
              GLCGIAM+ SYPT
Sbjct: 192 DSRGLCGIAMEPSYPT 207


>gi|380014284|ref|XP_003691169.1| PREDICTED: cathepsin L-like [Apis florea]
          Length = 345

 Score =  277 bits (708), Expect = 6e-72,   Method: Compositional matrix adjust.
 Identities = 149/314 (47%), Positives = 191/314 (60%), Gaps = 15/314 (4%)

Query: 42  WMA---QYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNK--PYKLGINEFADQTNEE 96
           WM    ++ + Y+ + E+  R KIF +N   IA  N+    K   YKL +N++ D  + E
Sbjct: 28  WMTFKMEHKKAYKSDVEERFRMKIFMDNKHKIAKHNSNYEMKKVSYKLKMNKYGDMLHHE 87

Query: 97  FRAPRNGYKRRLPSVRSSETTDVSFRY---ENASVPASIDWRKKGAVTGVKDQGQCGCCW 153
           F    NG+ + + +   SE   +   +    N ++P  +DWRK+GAVT VKDQG CG CW
Sbjct: 88  FVNILNGFNKSINTQLRSERMPIGASFIEPANVALPKKVDWRKEGAVTPVKDQGHCGSCW 147

Query: 154 AFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATE 213
           +FSA  A+EG +   T  L SLSEQ L+DC     + GC GGLMD AF++I  NKGL TE
Sbjct: 148 SFSATGALEGQHFRRTGVLVSLSEQNLIDCSGKYGNNGCNGGLMDQAFQYIKDNKGLDTE 207

Query: 214 AKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDASGSDFQ 272
           A YPY+A +  C    AN  A  + GY D+P+ NE  L  AVA   PVSVAIDAS   FQ
Sbjct: 208 ASYPYEAENDKCRYNPANSGAIDV-GYIDIPTGNEKLLKAAVATIGPVSVAIDASHQSFQ 266

Query: 273 FYSSGV-FTGQCGT-ELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDA 330
           FYS GV +  +C + ELDHGV  +GYGT ++G  YWLVKNSWG TWG NGYI+M R+   
Sbjct: 267 FYSEGVYYEPECSSEELDHGVLVIGYGTNENGEDYWLVKNSWGETWGNNGYIKMARN--- 323

Query: 331 KEGLCGIAMQASYP 344
           K   CGIA  ASYP
Sbjct: 324 KLNHCGIASSASYP 337


>gi|328776427|ref|XP_625135.3| PREDICTED: cathepsin L-like [Apis mellifera]
          Length = 351

 Score =  277 bits (708), Expect = 6e-72,   Method: Compositional matrix adjust.
 Identities = 153/329 (46%), Positives = 197/329 (59%), Gaps = 15/329 (4%)

Query: 27  SRTLNDATMNERHEMWMA---QYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNK--P 81
           SRT   +     ++ WM    ++ +VY+ + E+  R KIF +N   IA  N+    K   
Sbjct: 19  SRTHAVSFFELVNQEWMTFKMEHKKVYKSDVEERFRMKIFMDNKHKIAKHNSNYEMKKVS 78

Query: 82  YKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRY---ENASVPASIDWRKKG 138
           YKL +N++ D  + EF    NG+ + + +   SE   V   +    N  +P  +DWRK+G
Sbjct: 79  YKLKMNKYGDMLHHEFVNILNGFNKSINTQLRSERLPVGASFIEPANVVLPKKVDWRKEG 138

Query: 139 AVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMD 198
           AVT VKDQG CG CW+FSA  A+EG +   T  L SLSEQ L+DC     + GC GGLMD
Sbjct: 139 AVTPVKDQGHCGSCWSFSATGALEGQHFRRTGVLVSLSEQNLIDCSGKYGNNGCNGGLMD 198

Query: 199 DAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN- 257
            AF++I  NKGL TEA YPY+A +  C    AN  A  + GY D+P+ +E  L  AVA  
Sbjct: 199 QAFQYIKDNKGLDTEASYPYEAENDKCRYNPANSGAIDV-GYIDIPTGDEKLLKAAVATI 257

Query: 258 QPVSVAIDASGSDFQFYSSGV-FTGQCGT-ELDHGVTAVGYGTADDGTKYWLVKNSWGTT 315
            PVSVAIDAS   FQFYS GV +  +C + ELDHGV  +GYGT ++G  YWLVKNSWG T
Sbjct: 258 GPVSVAIDASHQSFQFYSEGVYYEPECSSEELDHGVLVIGYGTNENGQDYWLVKNSWGET 317

Query: 316 WGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           WG NGYI+M R+   K   CGIA  ASYP
Sbjct: 318 WGNNGYIKMARN---KLNHCGIASSASYP 343


>gi|4469153|emb|CAB38314.1| chymopapain isoform II [Carica papaya]
          Length = 352

 Score =  277 bits (708), Expect = 6e-72,   Method: Compositional matrix adjust.
 Identities = 147/347 (42%), Positives = 205/347 (59%), Gaps = 14/347 (4%)

Query: 3   MILLENKLVLAAILVLGVWAPQSWSRTLNDATMNER----HEMWMAQYGRVYRDNAEKEM 58
           +I L   L++   L    +    +S+  +D T  ER     + WM ++ ++Y    EK  
Sbjct: 10  IIFLATCLIIHMGLSSADFYTVGYSQ--DDLTSIERLIQLFDSWMLKHNKIYESIDEKIY 67

Query: 59  RFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGY-KRRLPSVRSSETT 117
           RF+IF++N+ YI   N K  N  Y LG+N FAD +N+EF+    G+       +   +  
Sbjct: 68  RFEIFRDNLMYIDETNKK--NNSYWLGLNGFADLSNDEFKKKYVGFVAEDFTGLEHFDNE 125

Query: 118 DVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSE 177
           D ++++   + P SIDWR KGAVT VK+QG CG CWAFS +A +EGIN I T  L  LSE
Sbjct: 126 DFTYKHV-TNYPQSIDWRAKGAVTPVKNQGACGSCWAFSTIATVEGINKIVTGNLLELSE 184

Query: 178 QELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKI 237
           QELVDCD      GC+GG    + +++ +N G+ T   YPY+A    C   +      KI
Sbjct: 185 QELVDCDK--HSYGCKGGYQTTSLQYV-ANNGVHTSKVYPYQAKQYKCRATDKPGPKVKI 241

Query: 238 SGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYG 297
           +GY+ VPSN E + + A+ANQP+S  ++A G  FQ Y SGVF G CGT+LDH VTAVGYG
Sbjct: 242 TGYKRVPSNCETSFLGALANQPLSFLVEAGGKPFQLYKSGVFDGPCGTKLDHAVTAVGYG 301

Query: 298 TADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           T+ DG  Y ++KNSWG  WGE GY+R++R     +G CG+   + YP
Sbjct: 302 TS-DGKNYIIIKNSWGPNWGEKGYMRLKRQSGNSQGTCGVYKSSYYP 347


>gi|330803818|ref|XP_003289899.1| hypothetical protein DICPUDRAFT_154350 [Dictyostelium purpureum]
 gi|325080010|gb|EGC33584.1| hypothetical protein DICPUDRAFT_154350 [Dictyostelium purpureum]
          Length = 326

 Score =  277 bits (708), Expect = 6e-72,   Method: Compositional matrix adjust.
 Identities = 155/340 (45%), Positives = 211/340 (62%), Gaps = 17/340 (5%)

Query: 9   KLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVE 68
           +LVLA I    +    S +R  +        + WM ++ + Y  N E   R+ +F++N++
Sbjct: 2   RLVLALIFCFLIINCCSAARIFSQKQYQTAFQNWMVKHQKSYT-NDEFGSRYSVFQDNMD 60

Query: 69  YIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASV 128
            +A +N K  N    LG+N  AD TNEEF+    G K  + + +      VS       +
Sbjct: 61  IVAKWNQKGSNTI--LGLNVMADLTNEEFKKLYLGTKANV-TYKKKTLVGVS------GL 111

Query: 129 PASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGE 188
           PAS+DWR  GAVT VK+QGQCG C+AFS   ++EGI+ IT+++L  LSEQ+++DC  S  
Sbjct: 112 PASVDWRANGAVTAVKNQGQCGGCYAFSTTGSVEGIHEITSQQLVPLSEQQILDCSGSEG 171

Query: 189 DQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNE 248
           + GC+GGLM ++FE+II+  GL TEA YPY    G C   + N   A I+GY++V S +E
Sbjct: 172 NNGCDGGLMTNSFEYIIAVGGLDTEASYPYTGEVGKCKFNKKNI-GATITGYKNVESGSE 230

Query: 249 AALMKAVANQPVSVAIDASGSDFQFYSSGV-FTGQC-GTELDHGVTAVGYGTADDGTKYW 306
           + L  AVA QPVSVAIDAS S FQ Y+SGV +  +C  T+LDHGV AVGYG+   G  YW
Sbjct: 231 SDLQTAVAAQPVSVAIDASQSSFQLYASGVYYEPECSSTQLDHGVLAVGYGS-QSGQDYW 289

Query: 307 LVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           +VKNSWG  WGENG+I M R+   K+  CGIA  AS+PTA
Sbjct: 290 IVKNSWGADWGENGFILMARN---KDNNCGIATMASFPTA 326


>gi|81542|pir||S02728 actinidain (EC 3.4.22.14) precursor (clone pAC.1) - kiwi fruit
           (fragment)
 gi|15957|emb|CAA31435.1| actinidin precursor [Actinidia chinensis]
 gi|166319|gb|AAA32630.1| actinidin precursor [Actinidia deliciosa]
 gi|226542|prf||1601514A actinidin
          Length = 302

 Score =  277 bits (708), Expect = 7e-72,   Method: Compositional matrix adjust.
 Identities = 141/274 (51%), Positives = 176/274 (64%), Gaps = 11/274 (4%)

Query: 74  NNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYE---NASVPA 130
           +N   N+ YK+G+N+FAD T EEFR+   G+         S  T VS RYE   +  +P+
Sbjct: 7   HNADTNRSYKVGLNQFADLTGEEFRSTYLGF------TGGSNKTKVSNRYEPRVSQVLPS 60

Query: 131 SIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQ 190
            +DWR  GAV  +K QG+CG CWAFSA+A +EGIN I T  L SLSEQEL+ C  +   +
Sbjct: 61  YVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIGCGGTQNTR 120

Query: 191 GCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAA 250
           GC GG + D F+FII+N G+ T   YPY A DG CN    N     I  Y +VP NNE A
Sbjct: 121 GCNGGYITDGFQFIINNGGINTGENYPYTAQDGECNLDLQNEKYVTIDTYGNVPYNNEWA 180

Query: 251 LMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKN 310
           L  AV  QPVSVA+DA+G  F+ YSSG+FTG CGT +DH VT VGYGT + G  YW+V+N
Sbjct: 181 LQTAVTYQPVSVALDAAGDAFKHYSSGIFTGPCGTAIDHAVTIVGYGT-EGGIDYWIVEN 239

Query: 311 SWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           SW TTWGE GY+R+ R++    G CGIA   SYP
Sbjct: 240 SWDTTWGEEGYMRILRNVGGA-GTCGIATMPSYP 272


>gi|157093355|gb|ABV22332.1| cysteine protease 1 [Noctiluca scintillans]
          Length = 338

 Score =  277 bits (708), Expect = 7e-72,   Method: Compositional matrix adjust.
 Identities = 150/308 (48%), Positives = 189/308 (61%), Gaps = 22/308 (7%)

Query: 45  QYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGY 104
           +YG+VY    E  +RF IFK NV+ I + N  ARN  + LG+NEF D T EE  A   G 
Sbjct: 33  KYGKVYNGINEDAVRFGIFKANVDIIYATN--ARNLTFALGVNEFTDLTQEELAASYTGL 90

Query: 105 K-----RRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVA 159
           K       LP + + E       Y  A + +S+DW  +G VT VK+QGQCG CW+FS   
Sbjct: 91  KPASLWSGLPRLSTHE-------YNGAPLASSVDWTTQGVVTPVKNQGQCGSCWSFSTTG 143

Query: 160 AMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYK 219
           A+EG   ++T  L SLSEQ+ VDCDT+  D GC GG MD+AF F   N  + TE  YPY 
Sbjct: 144 ALEGAWALSTGNLVSLSEQQFVDCDTT--DSGCNGGWMDNAFSFAKKNS-ICTEGSYPYT 200

Query: 220 ASDGSCNKK--EANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSG 277
           A+DG+CN    +       + GY DV +++E A+M AVA QPVS+AI+A    FQ YSSG
Sbjct: 201 ATDGTCNLSGCQVGIPQGGVVGYTDVSTDSEQAMMSAVAQQPVSIAIEADQYSFQLYSSG 260

Query: 278 VFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCG- 336
           V T  CGT LDHGV AVGYG+ + GT YW VKNSWG++WGE GY+R+QR      G CG 
Sbjct: 261 VLTASCGTRLDHGVLAVGYGS-EAGTDYWKVKNSWGSSWGEQGYVRLQRG-KGGAGECGL 318

Query: 337 IAMQASYP 344
           +A   SYP
Sbjct: 319 LAGPPSYP 326


>gi|340727787|ref|XP_003402217.1| PREDICTED: cathepsin L-like [Bombus terrestris]
          Length = 343

 Score =  277 bits (708), Expect = 7e-72,   Method: Compositional matrix adjust.
 Identities = 149/318 (46%), Positives = 194/318 (61%), Gaps = 12/318 (3%)

Query: 35  MNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNK--PYKLGINEFADQ 92
           +N+    +  ++ +VY+++ E+  R KIF +N   IA  N     K   YKL +N++ D 
Sbjct: 24  VNQEWTTFKMEHNKVYKNDVEERFRMKIFMDNKHKIAKHNGNYEMKKVSYKLKMNKYGDM 83

Query: 93  TNEEFRAPRNGYKRRLPSVRSSETTDVSFRY---ENASVPASIDWRKKGAVTGVKDQGQC 149
            + EF    NG+ + + +   SE   ++  +    N  +P ++DWR+ GAVT VKDQG C
Sbjct: 84  LHHEFVNTLNGFNKSINTQLRSERLPIAASFIEPANVVLPKTVDWREHGAVTPVKDQGHC 143

Query: 150 GCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKG 209
           G CW+FSA  A+EG +   T  L  LSEQ L+DC     + GC GGLMD AF++I  NKG
Sbjct: 144 GSCWSFSATGALEGQHFRRTGILIPLSEQNLIDCSGKYGNNGCNGGLMDQAFQYIKDNKG 203

Query: 210 LATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDASG 268
           L TE  YPY+A +  C    AN S A+  GY D+P  NE  L  AVA   PVSVAIDAS 
Sbjct: 204 LDTEVTYPYEAENDKCRYNAAN-SGARDVGYVDIPQGNEKKLKAAVATIGPVSVAIDASH 262

Query: 269 SDFQFYSSGV-FTGQCGTE-LDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQR 326
             FQFYS GV +  +C +E LDHGV AVGYGT ++G  YWLVKNSWG TWG+NGYI+M R
Sbjct: 263 QSFQFYSEGVYYEPECSSENLDHGVLAVGYGTDENGQDYWLVKNSWGETWGDNGYIKMAR 322

Query: 327 DIDAKEGLCGIAMQASYP 344
           +   K   CGIA  ASYP
Sbjct: 323 N---KLNHCGIASTASYP 337


>gi|324983200|gb|ADY68475.1| stem bromelain [Ananas comosus]
          Length = 291

 Score =  276 bits (707), Expect = 7e-72,   Method: Compositional matrix adjust.
 Identities = 148/291 (50%), Positives = 192/291 (65%), Gaps = 9/291 (3%)

Query: 9   KLVLAAILVLGVWA-PQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENV 67
           +LV   + +  +WA P + SR      M +R E WMA+YGRVY+DN EK  RF+IFK NV
Sbjct: 6   QLVFLFLFLCVMWASPSAASRDEPSDPMMKRFEEWMAEYGRVYKDNDEKMRRFQIFKNNV 65

Query: 68  EYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS 127
            +I +FNN+  N  Y LGIN+F D TN EF A   G   R  ++       VSF   N S
Sbjct: 66  NHIETFNNRNGNS-YTLGINKFTDMTNNEFVAQYTGGISRPLNIEKEPV--VSFDDVNIS 122

Query: 128 -VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTS 186
            V  SIDWR  GAVT VKDQ  CG CWAFSA+A +EGI  I T  L SLSEQE++DC  S
Sbjct: 123 AVGQSIDWRDYGAVTEVKDQNPCGSCWAFSAIATVEGIYKIVTGYLVSLSEQEVLDCAVS 182

Query: 187 GEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSN 246
               GC+GG +D+A++FIISN G+A+EA YPY+A  G C    + P++A I+GY  V SN
Sbjct: 183 ---NGCDGGFVDNAYDFIISNNGVASEADYPYQAYQGDC-AANSWPNSAYITGYSYVRSN 238

Query: 247 NEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYG 297
           +E+++  AV NQP++ AIDASG +FQ+Y+ GVF+G CGT L+H +T +GYG
Sbjct: 239 DESSMKYAVWNQPIAAAIDASGDNFQYYNGGVFSGPCGTSLNHAITIIGYG 289


>gi|333069454|gb|AEF13978.1| chymopapain [Carica papaya]
          Length = 352

 Score =  276 bits (707), Expect = 9e-72,   Method: Compositional matrix adjust.
 Identities = 147/347 (42%), Positives = 205/347 (59%), Gaps = 14/347 (4%)

Query: 3   MILLENKLVLAAILVLGVWAPQSWSRTLNDATMNER----HEMWMAQYGRVYRDNAEKEM 58
           +I L   L++   L    +    +S+  +D T  ER     + WM ++ ++Y    EK  
Sbjct: 10  IIFLATCLIIHMSLSSADFYTVGYSQ--DDLTSIERLIQLFDSWMLKHNKIYESIDEKIY 67

Query: 59  RFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNG-YKRRLPSVRSSETT 117
           RF+IF++N+ YI   N K  N  Y LG+N FAD +N+EF+    G        +   +  
Sbjct: 68  RFEIFRDNLMYIDETNKK--NNSYWLGLNGFADLSNDEFKKKYVGSVAEDFTGLEHFDNE 125

Query: 118 DVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSE 177
           D ++++   + P SIDWR KGAVT VK+QG CG CWAFS +A +EG+N I T  L  LSE
Sbjct: 126 DFTYKHV-TNYPQSIDWRAKGAVTPVKNQGSCGSCWAFSTIATVEGVNKIVTGNLLELSE 184

Query: 178 QELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKI 237
           QELVDCD +    GC+GG    + +++  N G+ T   YPY+A    C   +      KI
Sbjct: 185 QELVDCDKN--SHGCKGGYQTTSLQYVADN-GVHTSKVYPYQAKAMQCRATDKPGPKVKI 241

Query: 238 SGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYG 297
           +GY+ VPSN E + + A+ANQP+SV ++A G  FQ Y SGVF G CGT+LDH VTAVGYG
Sbjct: 242 TGYKRVPSNCETSFLGALANQPLSVLVEAGGKPFQLYKSGVFDGPCGTKLDHAVTAVGYG 301

Query: 298 TADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           T+ DG  Y ++KNSWG  WGE GY+R++R     +G CG+   + YP
Sbjct: 302 TS-DGKNYIIIKNSWGPNWGEKGYMRLKRQSGNSQGTCGVYKSSYYP 347


>gi|427797099|gb|JAA64001.1| Putative cathepsin l cathepsin l, partial [Rhipicephalus
           pulchellus]
          Length = 331

 Score =  276 bits (707), Expect = 9e-72,   Method: Compositional matrix adjust.
 Identities = 154/309 (49%), Positives = 197/309 (63%), Gaps = 11/309 (3%)

Query: 42  WMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNK-ARNK-PYKLGINEFADQTNEEFRA 99
           + A +G+ Y  + E+  R KI+ EN   IA  N K A+++  YKL +NEF D  + EF +
Sbjct: 26  FKALHGKEYESDTEEYYRLKIYMENRLKIARHNEKYAKSQVSYKLAMNEFGDMLHHEFVS 85

Query: 100 PRNGYKRRLPSVRSSETTDVSFR-YENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAV 158
            RNG+KR         +  V     E+  +P ++DWRKKGAVT VK+QGQCG CW+FS  
Sbjct: 86  TRNGFKRNYRDTPREGSFFVEPEGLEDFHLPKTVDWRKKGAVTPVKNQGQCGSCWSFSTT 145

Query: 159 AAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPY 218
            ++EG +     KL SLSEQ L+DC  S  + GCEGGLMD AF++I +NKG+ TE  YPY
Sbjct: 146 GSLEGQHFRKLHKLVSLSEQNLIDCSRSFGNNGCEGGLMDYAFKYIKANKGIDTEQSYPY 205

Query: 219 KASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDASGSDFQFYSSG 277
            A+DG C+  ++   A   +G+ D+P  +E  L KAVA   PVSVAIDAS   FQFYS G
Sbjct: 206 NATDGVCHFNKSAVGATD-TGFVDIPEGDENKLKKAVATVGPVSVAIDASHESFQFYSEG 264

Query: 278 VF-TGQCGTE-LDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLC 335
           V+   +C +E LDHGV  VGYGT  DG  YWLVKNSWGTTWG+ GYI M R+   K+  C
Sbjct: 265 VYDEPECDSEQLDHGVLVVGYGTK-DGQDYWLVKNSWGTTWGDGGYIYMSRN---KDNQC 320

Query: 336 GIAMQASYP 344
           GIA  ASYP
Sbjct: 321 GIASAASYP 329


>gi|52630917|gb|AAU84922.1| putative cathepsin L [Toxoptera citricida]
          Length = 341

 Score =  276 bits (707), Expect = 9e-72,   Method: Compositional matrix adjust.
 Identities = 158/343 (46%), Positives = 205/343 (59%), Gaps = 14/343 (4%)

Query: 12  LAAILVLG--VWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEY 69
           +  ++VLG  V+A  S S    +  + E  +++  Q+ ++Y D  E+  R K++ +N   
Sbjct: 1   MKVVIVLGLVVFAISSVSSINLNEIIEEEWDLFKVQFKKIYEDVKEEAFRKKVYLDNKLK 60

Query: 70  IASFNN--KARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTD--VSF-RYE 124
           IA  N   +   + Y L +N F D    E+    NG+K  L     + T D  V+F + E
Sbjct: 61  IARHNKLYETGEETYALEMNHFGDLMQHEYTKMMNGFKPSLAGGDKNFTDDDAVTFLKSE 120

Query: 125 NASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCD 184
           N  +P SIDWRKKG VT VK+QGQCG CW+FSA  ++EG +   T  L SLSEQ L+DC 
Sbjct: 121 NVVIPKSIDWRKKGYVTPVKNQGQCGSCWSFSATGSLEGQHFRKTGVLVSLSEQNLIDCS 180

Query: 185 TSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVP 244
               + GCEGGLMD AF++I SNKGL TE  YPY+A D  C     N S A   G+ D+P
Sbjct: 181 RKYGNNGCEGGLMDLAFKYIKSNKGLDTEKSYPYEAEDDKCRYNPEN-SGATDKGFVDIP 239

Query: 245 SNNEAALMKAVANQ-PVSVAIDASGSDFQFYSSGVFTG-QC-GTELDHGVTAVGYGTADD 301
             +E AL+ A+A   PVS+AIDAS   FQFY  GVF   +C  TELDHGV AVGYGT   
Sbjct: 240 EGDEDALVHALATVGPVSIAIDASSEKFQFYKKGVFYNPRCSSTELDHGVLAVGYGTDHK 299

Query: 302 GTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           G  YW+VKNSWG TWG+ GYI M R+   K+  CG+A  ASYP
Sbjct: 300 GGDYWIVKNSWGKTWGDQGYIMMARN---KKNNCGVASSASYP 339


>gi|350412176|ref|XP_003489564.1| PREDICTED: cathepsin L-like [Bombus impatiens]
          Length = 343

 Score =  276 bits (706), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 149/318 (46%), Positives = 193/318 (60%), Gaps = 12/318 (3%)

Query: 35  MNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNK--PYKLGINEFADQ 92
           +N+    +  ++ +VY+++ E+  R KIF +N   IA  N     K   YKL +N++ D 
Sbjct: 24  VNQEWTTFKMEHNKVYKNDIEERFRMKIFMDNKHKIAKHNGNYEMKKVSYKLKMNKYGDM 83

Query: 93  TNEEFRAPRNGYKRRLPSVRSSETTDVSFRY---ENASVPASIDWRKKGAVTGVKDQGQC 149
            + EF    NG+ + + +   SE   +   +    N  +P ++DWR+ GAVT VKDQG C
Sbjct: 84  LHHEFVNTLNGFNKSINTQLRSERLPIGASFIEPANVVLPKTVDWREHGAVTPVKDQGHC 143

Query: 150 GCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKG 209
           G CW+FSA  A+EG +   T  L  LSEQ L+DC     + GC GGLMD AF++I  NKG
Sbjct: 144 GSCWSFSATGALEGQHFRRTGILIPLSEQNLIDCSGKYGNNGCNGGLMDQAFQYIKDNKG 203

Query: 210 LATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDASG 268
           L TE  YPY+A +  C    AN S A+  GY D+P  NE  L  AVA   PVSVAIDAS 
Sbjct: 204 LDTEVTYPYEAENDKCRYNAAN-SGARDVGYVDIPQGNEKKLKAAVATIGPVSVAIDASH 262

Query: 269 SDFQFYSSGV-FTGQCGTE-LDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQR 326
             FQFYS GV +  +C +E LDHGV AVGYGT ++G  YWLVKNSWG TWG+NGYI+M R
Sbjct: 263 QSFQFYSEGVYYEPECSSENLDHGVLAVGYGTDENGQDYWLVKNSWGETWGDNGYIKMAR 322

Query: 327 DIDAKEGLCGIAMQASYP 344
           +   K   CGIA  ASYP
Sbjct: 323 N---KLNHCGIASTASYP 337


>gi|94448674|emb|CAI91575.1| cathepsin L2 [Lubomirskia baicalensis]
          Length = 324

 Score =  276 bits (706), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 155/314 (49%), Positives = 196/314 (62%), Gaps = 25/314 (7%)

Query: 42  WMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPR 101
           W A++G+ YR++ E+ +R   ++ N +YI   N  A    Y L +N+F D  N EF++  
Sbjct: 25  WKAEHGKSYRNHKEEMLRHVTWQANKKYIDEHNQHAGVFGYTLKMNQFGDLENSEFKSLY 84

Query: 102 NGYK------RRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAF 155
           NGY+      +  P V ++   D+         PAS+DW KKG VT VK+QGQCG CW+F
Sbjct: 85  NGYRMSNAPRKGKPFVPAARVQDL---------PASVDWSKKGWVTPVKNQGQCGSCWSF 135

Query: 156 SAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAK 215
           SA  +MEG +   T  L SLSEQ LVDC  +  + GC GGLMDDAFE++I N G+ TEA 
Sbjct: 136 SATGSMEGQHFNATGTLMSLSEQNLVDCSAAEGNHGCNGGLMDDAFEYVIKNNGIDTEAS 195

Query: 216 YPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDASGSDFQFY 274
           YPY+A D +C    A+   A ISGY DV  ++E+ L  AVA   PVSVAIDAS   FQFY
Sbjct: 196 YPYRAVDSTCKFNTAD-VGATISGYVDVTKDSESDLQVAVATIGPVSVAIDASHISFQFY 254

Query: 275 SSGVFTGQC--GTELDHGVTAVGYGTADDGTK-YWLVKNSWGTTWGENGYIRMQRDIDAK 331
           SSGV+       T LDHGV AVGYGT  DG+K YWLVKNSWG +WG +GYI M R+ + K
Sbjct: 255 SSGVYDPLICSSTNLDHGVLAVGYGT--DGSKDYWLVKNSWGASWGMSGYIEMVRNHNNK 312

Query: 332 EGLCGIAMQASYPT 345
              CGIA  ASYP 
Sbjct: 313 ---CGIATSASYPV 323


>gi|195429415|ref|XP_002062758.1| GK19626 [Drosophila willistoni]
 gi|194158843|gb|EDW73744.1| GK19626 [Drosophila willistoni]
          Length = 341

 Score =  276 bits (706), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 156/345 (45%), Positives = 205/345 (59%), Gaps = 16/345 (4%)

Query: 9   KLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVE 68
           +  L  +L+  V   Q+ S +     + E    +  ++ + Y D+ E+  R KIF EN  
Sbjct: 2   RFALITLLIALVAMTQAVSYS---ELVREEWNTFKLEHRKNYADSTEETFRMKIFNENKH 58

Query: 69  YIASFNNK--ARNKPYKLGINEFADQTNEEFRAPRNGYKRRL-PSVRSSET--TDVSF-R 122
           +IA  N +       YKL +N++AD  + EFR   NG+   L   +RS++   T V+F  
Sbjct: 59  HIAKHNQRYATGEVSYKLALNKYADMLHHEFRETMNGFNYTLHKQLRSTDESFTGVTFIS 118

Query: 123 YENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVD 182
            E+  +P ++DWR KGAVT VKDQG CG CWAFS+  A+EG +   +  L SLSEQ LVD
Sbjct: 119 PEHVKLPTAVDWRTKGAVTEVKDQGHCGSCWAFSSTGAIEGQHFRKSGTLVSLSEQNLVD 178

Query: 183 CDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYED 242
           C T   + GC GGLMD+AF ++  N G+ TE  Y Y+  D SC+  + N   A   G+ D
Sbjct: 179 CSTKYGNNGCNGGLMDNAFRYVKDNGGIDTEKSYAYEGIDDSCH-FDKNSIGATDRGFAD 237

Query: 243 VPSNNEAALMKAVAN-QPVSVAIDASGSDFQFYSSGVF-TGQCGTE-LDHGVTAVGYGTA 299
           +P  NE  L +AVA   PVSVAIDAS   FQFYS GV+    C  E LDHGV  VGYGT 
Sbjct: 238 IPQGNEKKLAQAVATIGPVSVAIDASQQSFQFYSEGVYDEPNCSAENLDHGVLVVGYGTE 297

Query: 300 DDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
            DG+ YWLVKNSWGTTWG+ G+I+M R+   KE  CGIA  +SYP
Sbjct: 298 KDGSDYWLVKNSWGTTWGDKGFIKMSRN---KENQCGIASASSYP 339


>gi|326497561|dbj|BAK05870.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 340

 Score =  276 bits (706), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 154/316 (48%), Positives = 193/316 (61%), Gaps = 14/316 (4%)

Query: 31  NDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFA 90
            D  M +R   W A + R Y    E+  RF++++ NVEYI + N +     Y+LG N+FA
Sbjct: 37  GDMLMMDRFRQWQATHNRSYLSAEERLRRFEVYRTNVEYIDATNRRG-GLTYELGENQFA 95

Query: 91  DQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQG-QC 149
           D T EEF A   G      ++ ++   D S     A  PAS+DWR KGAVT VK+QG QC
Sbjct: 96  DLTGEEFLARYAG-GHTGSAITTAAEADGSL---EADPPASVDWRAKGAVTPVKNQGSQC 151

Query: 150 GCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKG 209
             CWAFSAVA ME +  I T KL +LSEQ+LVDCD    D GC  G    AF++I+ N G
Sbjct: 152 YSCWAFSAVATMESLYFIKTGKLVALSEQQLVDCDK--YDGGCNKGYYHRAFQWIMENGG 209

Query: 210 LATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGS 269
           + T A+YPYKA  G+C+   A   A  I+G+  V + NE AL  AVA QP+ VAI+   S
Sbjct: 210 ITTAAQYPYKAVRGACS---AAKPAVTITGHLAV-AKNELALQSAVARQPIGVAIEVPIS 265

Query: 270 DFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDID 329
             QFY SGVF+  CG ++ H V  VGYG    G KYWLVKNSWG TWGE GYIRM+RD+ 
Sbjct: 266 -MQFYKSGVFSAACGIQMSHAVVTVGYGADASGLKYWLVKNSWGQTWGEAGYIRMRRDVG 324

Query: 330 AKEGLCGIAMQASYPT 345
              GLCGIA+  +YPT
Sbjct: 325 GG-GLCGIALDTAYPT 339


>gi|157132324|ref|XP_001655999.1| cathepsin l [Aedes aegypti]
 gi|108881694|gb|EAT45919.1| AAEL002833-PA [Aedes aegypti]
          Length = 339

 Score =  276 bits (706), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 157/321 (48%), Positives = 200/321 (62%), Gaps = 24/321 (7%)

Query: 40  EMWMA---QYGRVYRDNAEKEMRFKIFKENVEYIASFNNK--ARNKPYKLGINEFADQTN 94
           E W A   Q+ + Y    E+ +R KI+ +N   IA  N +     + Y+L +N++AD  +
Sbjct: 25  EEWNAFKLQHRKNYDSETEERIRLKIYVQNKHKIAKHNQRFDLGQEKYRLRVNKYADLLH 84

Query: 95  EEFRAPRNGY-----KRRLPSVRSSETTDVSF-RYENASVPASIDWRKKGAVTGVKDQGQ 148
           EEF    NG+     K+ L  VR  E   V+F    N  VP ++DWRKKGAVT VKDQG 
Sbjct: 85  EEFVQTVNGFNRTDSKKSLKGVRIEEP--VTFIEPANVEVPTTVDWRKKGAVTPVKDQGH 142

Query: 149 CGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNK 208
           CG CW+FSA  A+EG +   T KL SLSEQ LVDC     + GC GG+MD AF++I  N 
Sbjct: 143 CGSCWSFSATGALEGQHFRKTGKLVSLSEQNLVDCSGKYGNNGCNGGMMDYAFQYIKDNG 202

Query: 209 GLATEAKYPYKASDGSCNKKEANPSA--AKISGYEDVPSNNEAALMKAVANQ-PVSVAID 265
           G+ TE  YPY+A D +C+    NP A  A   GY D+P  +E AL KA+A   PVS+AID
Sbjct: 203 GIDTEKSYPYEAIDDTCH---FNPKAVGATDKGYVDIPQGDEEALKKALATVGPVSIAID 259

Query: 266 ASGSDFQFYSSGV-FTGQCGTE-LDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIR 323
           AS   FQFYS GV +  QC +E LDHGV AVGYGT+++G  YWLVKNSWGTTWG+ GY++
Sbjct: 260 ASHESFQFYSEGVYYEPQCDSENLDHGVLAVGYGTSEEGEDYWLVKNSWGTTWGDQGYVK 319

Query: 324 MQRDIDAKEGLCGIAMQASYP 344
           M R+ D     CG+A  ASYP
Sbjct: 320 MARNRDNH---CGVATCASYP 337


>gi|157093357|gb|ABV22333.1| cysteine protease 1 [Noctiluca scintillans]
          Length = 338

 Score =  276 bits (706), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 150/308 (48%), Positives = 189/308 (61%), Gaps = 22/308 (7%)

Query: 45  QYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGY 104
           +YG+VY    E  +RF IFK NV+ I + N  ARN  + LG+NEF D T EEF A   G 
Sbjct: 33  KYGKVYNGINEDAVRFGIFKANVDIIYATN--ARNLTFALGVNEFTDLTQEEFAASYTGL 90

Query: 105 K-----RRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVA 159
           K       LP + + E       Y  A + +S+DW  +G VT VK+QGQCG CW+FS   
Sbjct: 91  KPASLWSGLPRLSTHE-------YNGAPLASSVDWTTQGVVTPVKNQGQCGSCWSFSTTG 143

Query: 160 AMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYK 219
           A+EG   ++T  L SLSEQ+  DCDT+  D GC GG MD+AF F   N  + TE  YPY 
Sbjct: 144 ALEGAWALSTGNLVSLSEQQFEDCDTT--DSGCNGGWMDNAFSFAKKNS-ICTEGSYPYT 200

Query: 220 ASDGSCNKK--EANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSG 277
           A+DG+CN    +       + GY DV +++E A+M AVA QPVS+AI+A    FQ YSSG
Sbjct: 201 ATDGTCNLSGCQVGIPQGGVVGYTDVSTDSEQAMMSAVAQQPVSIAIEADQYSFQLYSSG 260

Query: 278 VFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCG- 336
           V T  CGT LDHGV AVGYG+ + GT YW VKNSWG++WGE GY+R+QR      G CG 
Sbjct: 261 VLTASCGTRLDHGVLAVGYGS-EAGTDYWKVKNSWGSSWGEQGYVRLQRG-KGGAGECGL 318

Query: 337 IAMQASYP 344
           +A   SYP
Sbjct: 319 LAGPPSYP 326


>gi|303283194|ref|XP_003060888.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226457239|gb|EEH54538.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 422

 Score =  276 bits (705), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 155/323 (47%), Positives = 201/323 (62%), Gaps = 13/323 (4%)

Query: 33  ATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFN--NKARNKPYKLGINEFA 90
           AT+  R + W+A +G+ Y    E+  R  IF +N E++   N  + A  K + L +N  A
Sbjct: 64  ATIEARFDRWLATHGKAYACPKERAKRLAIFADNAEFVRVHNEAHAAGKKSHWLRLNHLA 123

Query: 91  DQTNEEFRAPRNGY---KRRLPSVRSSETTDVS-FRYENASVPASIDWRKKGAVTGVKDQ 146
           D T EEF+    GY   K+R+ S  SS   D + + Y + + P ++DW  +GAVT VK+Q
Sbjct: 124 DLTREEFKH-MLGYDASKKRVES--SSPPVDAANWEYADVTPPETMDWVSRGAVTPVKNQ 180

Query: 147 GQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIIS 206
           GQCG CWAFS V A+EG+  + T  L SLSEQELV C   G + GC+GGLMD+ FE+I+ 
Sbjct: 181 GQCGSCWAFSTVGAVEGVVAVKTGDLISLSEQELVSCAKIGGNNGCKGGLMDNGFEWIVE 240

Query: 207 NKGLATEAKYPYKASDGSCN-KKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAID 265
           N+G+  E  + Y A D  CN  K+    AA I G++DVP N+E AL KAV+ QPV+VAI+
Sbjct: 241 NRGVDDEEDWGYLAKDRRCNWFKKRRAKAASIDGFKDVPRNDEDALKKAVSQQPVAVAIE 300

Query: 266 ASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGT---KYWLVKNSWGTTWGENGYI 322
           A   +FQ YS GVF G+CGT LDHGV  VGYG   +      YW VKNSWG  WGE GYI
Sbjct: 301 ADHREFQLYSGGVFDGECGTNLDHGVLVVGYGYDGESAGHKHYWTVKNSWGAKWGEEGYI 360

Query: 323 RMQRDIDAKEGLCGIAMQASYPT 345
           R+ R      G CG+AMQASYPT
Sbjct: 361 RIARGGMGPAGQCGVAMQASYPT 383


>gi|91992508|gb|ABE72970.1| cathepsin L [Aedes aegypti]
          Length = 339

 Score =  276 bits (705), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 157/321 (48%), Positives = 200/321 (62%), Gaps = 24/321 (7%)

Query: 40  EMWMA---QYGRVYRDNAEKEMRFKIFKENVEYIASFNNK--ARNKPYKLGINEFADQTN 94
           E W A   Q+ + Y    E+ +R KI+ +N   IA  N +     + Y+L +N++AD  +
Sbjct: 25  EEWNAFKLQHRKNYDSETEERIRLKIYVQNKHKIAKHNQRFDLGQEKYRLRVNKYADLLH 84

Query: 95  EEFRAPRNGY-----KRRLPSVRSSETTDVSF-RYENASVPASIDWRKKGAVTGVKDQGQ 148
           EEF    NG+     K+ L  VR  E   V+F    N  VP ++DWRKKGAVT VKDQG 
Sbjct: 85  EEFVQTVNGFNRTDSKKSLKGVRIEEP--VTFIEPANVEVPTTVDWRKKGAVTPVKDQGH 142

Query: 149 CGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNK 208
           CG CW+FSA  A+EG +   T KL SLSEQ LVDC     + GC GG+MD AF++I  N 
Sbjct: 143 CGSCWSFSATGALEGQHFRKTGKLVSLSEQNLVDCSGKYGNNGCNGGMMDYAFQYIKDNG 202

Query: 209 GLATEAKYPYKASDGSCNKKEANPSA--AKISGYEDVPSNNEAALMKAVANQ-PVSVAID 265
           G+ TE  YPY+A D +C+    NP A  A   GY D+P  +E AL KA+A   PVS+AID
Sbjct: 203 GIDTEKSYPYEAIDDTCH---FNPKAVGATDKGYVDIPQGDEEALKKALATVGPVSIAID 259

Query: 266 ASGSDFQFYSSGV-FTGQCGTE-LDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIR 323
           AS   FQFYS GV +  QC +E LDHGV AVGYGT+++G  YWLVKNSWGTTWG+ GY++
Sbjct: 260 ASHESFQFYSEGVYYEPQCDSENLDHGVLAVGYGTSEEGEDYWLVKNSWGTTWGDQGYVK 319

Query: 324 MQRDIDAKEGLCGIAMQASYP 344
           M R+ D     CG+A  ASYP
Sbjct: 320 MARNHDNH---CGVATCASYP 337


>gi|357114837|ref|XP_003559200.1| PREDICTED: fruit bromelain-like [Brachypodium distachyon]
          Length = 371

 Score =  275 bits (704), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 153/326 (46%), Positives = 194/326 (59%), Gaps = 16/326 (4%)

Query: 30  LNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEF 89
           L+D  M +R   W A + R Y D  E+  RF++++ N+EYI + N +     Y+LG N+F
Sbjct: 50  LDDMLMLDRFVRWQAAHNRTYGDAEERLRRFQVYRANIEYIEATNRRG-GLTYELGENQF 108

Query: 90  ADQTNEEFR---APRNGYKRRLPSVRSSETTDVSFRYE------NASVPASIDWRKKGAV 140
           AD T+EEF    A       R     +  TTDV+           A  P S DWR KGAV
Sbjct: 109 ADLTSEEFLSMYASSYDAGDRADDEAALITTDVAGDGAWSDGDLEALPPPSWDWRAKGAV 168

Query: 141 TGVKDQGQ-CGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDD 199
           T  K+QG  C  CWAF  VA +EG+  I T KL SLSEQ+LVDCD    D GC  G    
Sbjct: 169 TPPKNQGPTCSSCWAFVTVATIEGLTFIKTGKLISLSEQQLVDCDMY--DGGCNTGSYSR 226

Query: 200 AFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQP 259
            F +++ N GL TEA+YPY A+ G CN+ ++   AAKI+G   +P  NE  + KAVA QP
Sbjct: 227 GFRWVLENGGLTTEAEYPYTAARGPCNRAKSAHHAAKITGQGRIPPQNELVMQKAVAGQP 286

Query: 260 VSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGT-ADDGTKYWLVKNSWGTTWGE 318
           V VAI+  GS  QFY +GV++G CGT L H VT VGYG     G KYW+VKNSWG  WGE
Sbjct: 287 VGVAIEV-GSGMQFYKTGVYSGPCGTNLAHAVTVVGYGVDPASGAKYWIVKNSWGQAWGE 345

Query: 319 NGYIRMQRDIDAKEGLCGIAMQASYP 344
            G+IRM+RD+    GLCGIA+  +YP
Sbjct: 346 RGFIRMRRDVGGP-GLCGIALDVAYP 370


>gi|159485468|ref|XP_001700766.1| cysteine endopeptidase [Chlamydomonas reinhardtii]
 gi|158281265|gb|EDP07020.1| cysteine endopeptidase [Chlamydomonas reinhardtii]
          Length = 498

 Score =  275 bits (704), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 154/313 (49%), Positives = 201/313 (64%), Gaps = 17/313 (5%)

Query: 41  MWMAQYGRVYRDNA-EKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRA 99
           +W  Q+ R Y + + E   R  +F +NV  IA  N   RN    L +NE+AD+T EEF A
Sbjct: 42  LWATQHARTYSEGSPEYTRRLGVFADNVRAIAEQNR--RNTGITLALNEYADETWEEFAA 99

Query: 100 PRNGYKRRLPSVRSSET-----TDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWA 154
            R G K     +++ E      +  S+RY     PA++DWR K AVT VK+QGQCG CWA
Sbjct: 100 KRLGLKISQEQLKAREARSSSSSSSSWRYAQVQTPAAVDWRAKNAVTQVKNQGQCGSCWA 159

Query: 155 FSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEA 214
           FSAV ++EG N + T +L +LSEQ+LVDCDT+  + GC GGLMDDAF++++ N G+ TE 
Sbjct: 160 FSAVGSIEGANALATGQLVALSEQQLVDCDTA-SNMGCSGGLMDDAFKYVLDNGGIDTEE 218

Query: 215 KYPYKASDGS---CNK-KEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSD 270
            Y Y +  G    CNK K+ +  A  I GYEDVP+ +E AL+KAVA QPV+VAI AS ++
Sbjct: 219 DYSYWSGYGFGFWCNKRKQTDRPAVSIDGYEDVPT-SEPALLKAVAGQPVAVAICAS-AN 276

Query: 271 FQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDA 330
            QFYSSGV    C   L+HGV AVGY T+D    YW+VKNSWG +WGE GY R++   + 
Sbjct: 277 MQFYSSGVIN-SCCEGLNHGVLAVGYDTSDKAQPYWIVKNSWGGSWGEQGYFRLKMG-EG 334

Query: 331 KEGLCGIAMQASY 343
            +GLCGIA  ASY
Sbjct: 335 PKGLCGIASAASY 347


>gi|440793751|gb|ELR14926.1| Cysteine proteinase 5, putative [Acanthamoeba castellanii str.
           Neff]
          Length = 326

 Score =  275 bits (703), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 148/307 (48%), Positives = 192/307 (62%), Gaps = 15/307 (4%)

Query: 42  WMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPR 101
           WM ++ + Y  N E   R+ +++EN  YI + N++  NK + L +N+F D TN EF    
Sbjct: 33  WMQEHQKSYA-NEEFVYRWNVWRENYLYIEAHNHQ--NKSFHLAMNKFGDLTNAEFNKLF 89

Query: 102 NGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAM 161
            G    + + ++ + +D++       +PA  DWR+KGAVT VK+QGQCG CW+FS   + 
Sbjct: 90  KGLS--ITADQAKQESDIA---PAPGLPADFDWRQKGAVTHVKNQGQCGSCWSFSTTGST 144

Query: 162 EGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKAS 221
           EG N +   +LTSLSEQ LVDC TS  + GC GGLMD AFE+II NKG+ TE  YPY AS
Sbjct: 145 EGANFLKHGRLTSLSEQNLVDCSTSYGNHGCNGGLMDYAFEYIIRNKGIDTEESYPYHAS 204

Query: 222 DGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVF-- 279
            G+C   + + S  ++  Y +VPS NE AL+ AVA QP SVAIDAS S FQFY  GV+  
Sbjct: 205 QGTCRYNKQH-SGGELVSYTNVPSGNEGALLNAVATQPTSVAIDASHSSFQFYKGGVYDE 263

Query: 280 TGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAM 339
                + LDHGV AVG+G   DG  YWLVKNSWG  WG +GYI M R+   K   CGIA 
Sbjct: 264 PACSSSRLDHGVLAVGWGV-RDGKDYWLVKNSWGADWGLSGYIEMSRN---KHNQCGIAT 319

Query: 340 QASYPTA 346
            AS+P A
Sbjct: 320 AASHPHA 326


>gi|18396952|ref|NP_564322.1| cysteine proteinase-like protein [Arabidopsis thaliana]
 gi|332192922|gb|AEE31043.1| cysteine proteinase-like protein [Arabidopsis thaliana]
          Length = 334

 Score =  275 bits (703), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 145/355 (40%), Positives = 211/355 (59%), Gaps = 34/355 (9%)

Query: 3   MILLENKLVLAAILVLGVWAPQSWSR-TLNDATMNERHEMWMAQYGRVYRDNAEKEMRFK 61
           M+ + +  V   IL + +   Q+    TLN+ ++ + H+ WM Q+ RVY+D +EKEMR K
Sbjct: 1   MVSVRSVFVALTILSMDLRISQARPHVTLNEQSIVDYHQQWMTQFSRVYKDESEKEMRLK 60

Query: 62  IFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSV---------- 111
           +FK+N+++I +FNN   N+ Y LG+NEF D   EEF A   G +  + S+          
Sbjct: 61  VFKKNLKFIENFNNMG-NQSYTLGVNEFTDWKTEEFLATHTGLRVNVTSLSELFNKTKPS 119

Query: 112 RSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRK 171
           R+   +D+    E      S DWR +GAVT VK QG C              +  I+ + 
Sbjct: 120 RNWNMSDIDMEDE------SKDWRDEGAVTPVKYQGACR-------------LTKISGKN 160

Query: 172 LTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEAN 231
           L +LSEQ+L+DCD   ++ GC GG  ++AF++II N G++ E +YPY+    SC      
Sbjct: 161 LLTLSEQQLIDCDIE-KNGGCNGGEFEEAFKYIIKNGGVSLETEYPYQVKKESCRANARR 219

Query: 232 PSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTG-QCGTELDHG 290
               +I G++ VPS+NE AL++AV  QPVSV IDA    F  Y  GV+ G  CGT+++H 
Sbjct: 220 APHTQIRGFQMVPSHNERALLEAVRRQPVSVLIDARADSFGHYKGGVYAGLDCGTDVNHA 279

Query: 291 VTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           VT VGYGT   G  YW++KNSWG +WGENGY+R++RD++  +G+CGIA  A+YP 
Sbjct: 280 VTIVGYGTM-SGLNYWVLKNSWGESWGENGYMRIRRDVEWPQGMCGIAQVAAYPV 333


>gi|125592009|gb|EAZ32359.1| hypothetical protein OsJ_16569 [Oryza sativa Japonica Group]
          Length = 480

 Score =  275 bits (702), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 147/337 (43%), Positives = 198/337 (58%), Gaps = 47/337 (13%)

Query: 39  HEMWMAQYGRVYRD--NAEKEMRFKIFKENVEYIASFNNKARNKP-YKLGINEFADQTNE 95
           +++W+A+ G    +    E E RF +F +N++++ + N +A  +  ++LG+N        
Sbjct: 52  YDLWLAENGGGSPNALGGEHERRFLVFWDNLKFVDAHNARADERGGFRLGMNRL------ 105

Query: 96  EFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGV------------ 143
                R  ++R +P                  VP     R+ G   GV            
Sbjct: 106 -----RRSHQRGVPRDLPRRQGRREEPRRRGEVPP----RRGGGAAGVRRLEGEGRRRPR 156

Query: 144 ---------------KDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGE 188
                          K  GQ G CWAFSAV+ +E IN + T ++ +LSEQELV+C T+G+
Sbjct: 157 QEPGPMRSFSVHLSVKYFGQ-GSCWAFSAVSTVESINQLVTGEMITLSEQELVECSTNGQ 215

Query: 189 DQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNE 248
           + GC GGLMDDAF+FII N G+ TE  YPYKA DG C+    N     I G+EDVP N+E
Sbjct: 216 NSGCNGGLMDDAFDFIIKNGGIDTEDDYPYKAVDGKCDINRENAKVVSIDGFEDVPQNDE 275

Query: 249 AALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLV 308
            +L KAVA+QPVSVAI+A G +FQ Y SGVF+G+CGT LDHGV AVGYGT D+G  YW+V
Sbjct: 276 KSLQKAVAHQPVSVAIEAGGREFQLYHSGVFSGRCGTSLDHGVVAVGYGT-DNGKDYWIV 334

Query: 309 KNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           +NSWG  WGE+GY+RM+R+I+   G CGIAM ASYPT
Sbjct: 335 RNSWGPKWGESGYVRMERNINVTTGKCGIAMMASYPT 371


>gi|238481789|gb|ACR43934.1| cathepsin L-like cysteine proteinase [Haliotis diversicolor
           supertexta]
          Length = 347

 Score =  275 bits (702), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 157/305 (51%), Positives = 197/305 (64%), Gaps = 12/305 (3%)

Query: 45  QYGRVYRDNAEKEMRFKIFKENVEYIASFNNKAR--NKPYKLGINEFADQTNEEFRAPRN 102
           Q+GR+Y  + E+E RF+IFK+N++YI   N K     K Y LGIN+FAD  NEEFR   N
Sbjct: 48  QHGRLYEKHEEEEERFEIFKQNLQYIEEHNKKFSLGQKSYYLGINQFADMKNEEFRM-YN 106

Query: 103 GYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAME 162
           G +R     R  + ++     E    P  +DWRKKG VT VK+QGQCG CW+FS   ++E
Sbjct: 107 GLRRDYNYSREVQCSN-HLTPEYLVAPDEVDWRKKGYVTAVKNQGQCGSCWSFSTTGSLE 165

Query: 163 GINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASD 222
           G +   + KL SLSEQ+LVDC     ++GC GGLMD AFE+II+N G+ TE +YPY A  
Sbjct: 166 GQHFHKSGKLVSLSEQQLVDCSGKFGNEGCNGGLMDQAFEYIITNGGIETEEEYPYDARQ 225

Query: 223 GSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDASGSDFQFYSSGVF-T 280
             C+ K++   AA  SG  DV S +E  L  +VA   PVS+AIDAS   FQ YS GV+  
Sbjct: 226 ERCHFKKSE-VAATASGCVDVKSGDETDLKNSVAEVGPVSIAIDASHQSFQLYSGGVYDE 284

Query: 281 GQC-GTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAM 339
            +C  TELDHGV  VGYGT DDG  YWLVKNSWGTTWG  GY++M R+ D +   CG+A 
Sbjct: 285 PKCSSTELDHGVLVVGYGT-DDGQDYWLVKNSWGTTWGLEGYVKMSRNQDNQ---CGVAT 340

Query: 340 QASYP 344
           QASYP
Sbjct: 341 QASYP 345


>gi|307192137|gb|EFN75465.1| Cathepsin L [Harpegnathos saltator]
          Length = 339

 Score =  275 bits (702), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 153/340 (45%), Positives = 194/340 (57%), Gaps = 14/340 (4%)

Query: 13  AAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIAS 72
           A  L+LG+ A        N  T  E    +   + + Y    E+  R KIF EN   IA 
Sbjct: 4   AIFLLLGILAAAQAISFFNLVT--EEWNTFKVTHRKAYDSKIEESFRMKIFMENWHKIAL 61

Query: 73  FNNK--ARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRY---ENAS 127
            N K       YKLG+N++ D  + EF    NG+ + + +   ++   +  R+    N  
Sbjct: 62  HNQKYELNEVSYKLGMNKYGDMLHHEFINTLNGFNKSVSAQLRAQRRPIGSRFIEPANVE 121

Query: 128 VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSG 187
           +P+S+DWR  GAVT +KDQG CG CW+FSA  A+EG ++  T KL SLSEQ L+DC    
Sbjct: 122 IPSSVDWRTHGAVTPIKDQGHCGSCWSFSATGALEGQHYRITGKLVSLSEQNLIDCSGRY 181

Query: 188 EDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNN 247
            + GC GGLMD AF++I  N GL TE  YPY+A +  C     N  A   SGY D+P  N
Sbjct: 182 GNNGCNGGLMDQAFQYIKDNHGLDTEISYPYEAENDKCRYNPRNNGATD-SGYVDIPEGN 240

Query: 248 EAALMKAVAN-QPVSVAIDASGSDFQFYSSGV-FTGQCGTE-LDHGVTAVGYGTADDGTK 304
           E  L  AVA   PVSVAIDAS   FQFY  GV +  +C +E LDHGV  VGYGT D+   
Sbjct: 241 EKKLKAAVATIGPVSVAIDASAESFQFYREGVYYEPRCSSENLDHGVLVVGYGTDDNDQD 300

Query: 305 YWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           YWLVKNSWG TWG+ GYI+M R+   K+  CGIA  ASYP
Sbjct: 301 YWLVKNSWGVTWGDEGYIKMARN---KDNHCGIASSASYP 337


>gi|32396018|gb|AAP41846.1| cysteine protease [Anthurium andraeanum]
          Length = 502

 Score =  274 bits (701), Expect = 4e-71,   Method: Compositional matrix adjust.
 Identities = 151/323 (46%), Positives = 194/323 (60%), Gaps = 20/323 (6%)

Query: 36  NERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYK---LGINEFADQ 92
            E  E WM ++ +VY    EK  R+  F  N+ ++   N + R  P     +G+N FAD 
Sbjct: 48  QELFERWMEKHRKVYAHPGEKARRYANFLSNLAFVRKRNAEGRRAPSSGQGVGMNVFADL 107

Query: 93  TNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASV------PASIDWRKKGAVTGVKDQ 146
           +NEEFR     Y  R+   +++E      R     V      PAS+DWRK+GAVT VK+Q
Sbjct: 108 SNEEFREV---YSSRVLRKKAAEGRGARRRAGEGRVVAGCDAPASLDWRKRGAVTAVKNQ 164

Query: 147 GQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIIS 206
           G CG CWAFS+  AMEGIN ITT +L SLSEQELVDCDT+ E  GC+GG MD AFE++I+
Sbjct: 165 GDCGSCWAFSSTGAMEGINAITTGELISLSEQELVDCDTTNE--GCDGGYMDYAFEWVIN 222

Query: 207 NKGLATEAKYPYKA-SDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAID 265
           N G+ +EA YPY   +D  CN  +       I GYEDV + +E+AL+ A   QPVSV ID
Sbjct: 223 NGGIDSEANYPYTGQADSVCNTTKEEIKVVSIDGYEDV-ATSESALLCAAVQQPVSVGID 281

Query: 266 ASGSDFQFYSSGVFTGQCG---TELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYI 322
            S  DFQ Y+ G++ G C     ++DH V  VGYG    GT YW+VKNSWGT WG  GYI
Sbjct: 282 GSSLDFQLYAGGIYDGDCSGNPDDIDHAVLVVGYGQ-QGGTDYWIVKNSWGTDWGMQGYI 340

Query: 323 RMQRDIDAKEGLCGIAMQASYPT 345
            ++R+     G+C I   ASYPT
Sbjct: 341 YIRRNTGLPYGVCAIDAMASYPT 363


>gi|151176971|gb|ABR88030.1| digestive cysteine protease [Dermestes frischii]
          Length = 339

 Score =  274 bits (701), Expect = 4e-71,   Method: Compositional matrix adjust.
 Identities = 156/346 (45%), Positives = 202/346 (58%), Gaps = 23/346 (6%)

Query: 10  LVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEY 69
            VLA + ++G  A   +        + E+   +  Q+ + Y+ + E++ R KIF EN   
Sbjct: 4   FVLALVFIVGAQAVSFFD------LVQEQWGTFKLQHKKQYKSDTEEKFRMKIFMENSHK 57

Query: 70  IASFNNKARNK--PYKLGINEFADQTNEEFRAPRNGYKR--RLPSVRSSETTD--VSFRY 123
           +A  N         YKL IN++AD  + EF    NG+ R    P + +SE          
Sbjct: 58  VAKXNKLYEMGLVSYKLKINKYADMLHHEFVHTVNGFNRTKNTPLLGTSEDEQGATFIAP 117

Query: 124 ENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDC 183
            N   P ++DWR+ GAVT VKDQG CG CW+FSA  A+EG +   T KL SLSEQ LVDC
Sbjct: 118 ANVKFPENVDWREHGAVTXVKDQGHCGSCWSFSATGALEGQHFRKTNKLVSLSEQNLVDC 177

Query: 184 DTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANP--SAAKISGYE 241
            T   + GC GGLMD+AF+++  N G+ TEA YPY A D  C+    NP  S A   G+ 
Sbjct: 178 STKFGNDGCNGGLMDNAFKYVKYNHGIDTEASYPYHADDEKCH---YNPKTSGATDRGFV 234

Query: 242 DVPSNNEAALMKAVANQ-PVSVAIDASGSDFQFYSSGVFTG-QCGT-ELDHGVTAVGYGT 298
           D+P+ +E  LM AVA   PVSVAIDAS   FQ YS GV+   +C + ELDHGV  VGYGT
Sbjct: 235 DIPTGDEEKLMAAVATVGPVSVAIDASHESFQLYSEGVYYDPECSSEELDHGVLVVGYGT 294

Query: 299 ADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
            ++G  YW+VKNSWG +WGE GYI+M R+ D     CGIA QASYP
Sbjct: 295 DENGQDYWIVKNSWGESWGEQGYIKMARNRDNN---CGIATQASYP 337


>gi|194757786|ref|XP_001961143.1| GF13722 [Drosophila ananassae]
 gi|190622441|gb|EDV37965.1| GF13722 [Drosophila ananassae]
          Length = 417

 Score =  274 bits (700), Expect = 4e-71,   Method: Compositional matrix adjust.
 Identities = 158/360 (43%), Positives = 206/360 (57%), Gaps = 27/360 (7%)

Query: 1   MAMILLENKLVLAAIL-----VLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAE 55
           + ++L  N  +L  IL        + A +   RT   AT        + ++ + Y D  E
Sbjct: 67  VVVMLFVNAFILVFILKKRKAYQNLKATEEQPRTSYAATSTH-----VLEHRKNYLDETE 121

Query: 56  KEMRFKIFKENVEYIASFNN--KARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRS 113
           +  R KIF EN   IA  N    +    YKL +N++AD  + EFR   NG+   L   + 
Sbjct: 122 ERFRLKIFNENKHKIAKHNQLWASGKVSYKLAVNKYADMLHHEFRQLMNGFNYTLH--KE 179

Query: 114 SETTDVSFR------YENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHI 167
               D SF+       E+ ++P S+DWR KGAVTGVKDQG CG CWAFS+  A+EG ++ 
Sbjct: 180 LRAADESFKGVTFISPEHVTLPKSVDWRDKGAVTGVKDQGHCGSCWAFSSTGALEGQHYR 239

Query: 168 TTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNK 227
            +  L SLSEQ LVDC T   + GC GGLMD+AF +I  N G+ TE  YPY+A D SC+ 
Sbjct: 240 KSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYPYEALDDSCHF 299

Query: 228 KEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDASGSDFQFYSSGVFT-GQCGT 285
            +    A    G+ D+P  NE  L +AVA   PVSVAIDAS   FQFYS GV+    C  
Sbjct: 300 NKGTIGATD-RGFVDIPQGNEKKLAEAVATIGPVSVAIDASHESFQFYSEGVYVEPACDA 358

Query: 286 E-LDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           + LDHGV  VG+GT + G  YWLVKNSWGTTWG+ G+I+M R+   K+  CGIA  +SYP
Sbjct: 359 QNLDHGVLVVGFGTDESGQDYWLVKNSWGTTWGDKGFIKMLRN---KDNQCGIASASSYP 415


>gi|326531188|dbj|BAK04945.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 360

 Score =  274 bits (700), Expect = 5e-71,   Method: Compositional matrix adjust.
 Identities = 154/337 (45%), Positives = 196/337 (58%), Gaps = 32/337 (9%)

Query: 30  LNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEF 89
           + D  M +R  MW A + + YR   E+  RF+++++NVEYI + N +  +  Y+LG N+F
Sbjct: 33  VGDMLMMDRFLMWQATHNQSYRSAEERLRRFQVYRDNVEYIETTNRRG-DLTYQLGENQF 91

Query: 90  ADQTNEEFRAPRNGYKRRL-------------------PSVRSSETTDVSFRYENASVPA 130
           AD T EEF A    Y                       P + SS   DVS        P 
Sbjct: 92  ADLTREEFIARFTSYNGDDDRTGDDDSVITTAAVGGGDPDLWSSGGDDVSLD------PP 145

Query: 131 SIDWRKKGAVTGVKDQGQCGCC-WAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGED 189
           S+DWR KGAV   K Q       WAF AVA +E ++ I T KL +LSEQ+LVDCD    D
Sbjct: 146 SVDWRAKGAVVPPKSQSSSCSSSWAFVAVATIESLHAIKTGKLVALSEQQLVDCDQY--D 203

Query: 190 QGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEA 249
            GC  G    AF ++I N GL TEA+YPY A+ G+CN  +++   A ISG+  VP +NE 
Sbjct: 204 GGCNRGTFRRAFHWVIQNGGLTTEAEYPYTAAQGTCNSAKSDHHVAAISGHASVPGSNEL 263

Query: 250 ALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADD-GTKYWLV 308
           A+  AVA QPV+ AI+  GSD QFY SGV++G CG  L+H VT VGYG  +  G KYW+V
Sbjct: 264 AMKHAVATQPVAAAIEL-GSDMQFYKSGVYSGPCGARLEHAVTVVGYGADESTGDKYWIV 322

Query: 309 KNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           KNSWG TWGE GYIRMQR I    GLCGI +  +YPT
Sbjct: 323 KNSWGQTWGERGYIRMQRKI-LGPGLCGIMLDVAYPT 358


>gi|357130488|ref|XP_003566880.1| PREDICTED: fruit bromelain-like [Brachypodium distachyon]
          Length = 356

 Score =  274 bits (700), Expect = 5e-71,   Method: Compositional matrix adjust.
 Identities = 145/320 (45%), Positives = 193/320 (60%), Gaps = 16/320 (5%)

Query: 38  RHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEF 97
           RHE WMA++GRVY D  EK  R ++F  N  Y+ + N +A N+ Y LG+N+F+D T++EF
Sbjct: 38  RHEEWMAKFGRVYTDAQEKARRQEVFGANARYVDAVN-RAGNRTYTLGLNKFSDLTDDEF 96

Query: 98  RAPRNGYKRRLPSVRSSETTDVS----FRYENASVPASIDWRKKGAVTGVKDQGQCGCCW 153
                GY+         E  +VS      Y  A +P S+DWR +GAVTGVK+QG CGCCW
Sbjct: 97  VQTHLGYRGHQQGGLRPEEENVSKVAALGYGQADMPESVDWRAQGAVTGVKNQGSCGCCW 156

Query: 154 AFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQG----CEGGLMDDAFEFIISNKG 209
           AF+AVAA EG+  I T  L S+SEQ+++DC       G    C+GG +DDA  ++ +++G
Sbjct: 157 AFAAVAATEGLVKIATGNLISMSEQQVLDCTGQSPGMGNTNTCDGGHIDDALRYVAASRG 216

Query: 210 LATEAKYPYKASDGSCNKKEANPSAAKISGYEDVP-SNNEAALMKAVANQPVSVAIDASG 268
           L  EA Y Y    G+C       SAA     + V    +E  L   VA QP++V+++AS 
Sbjct: 217 LQPEAAYAYTGLQGACQSGFTPNSAASFGEPQTVTLQGDEGRLQGLVAGQPIAVSVEAS- 275

Query: 269 SDFQFYSSGVFTG---QCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQ 325
            DF+ Y SGVFT     CG  L+H VT VGYG+AD G +YWLVKN WGT+WGE GY+R+ 
Sbjct: 276 DDFRHYMSGVFTAGTSSCGQRLNHAVTVVGYGSADGGQEYWLVKNQWGTSWGEGGYMRIA 335

Query: 326 RDIDAKEGLCGIAMQASYPT 345
           R   A    CGI+  A YPT
Sbjct: 336 RGNGAPN--CGISAYAYYPT 353


>gi|21953244|emb|CAD42716.1| putative cathepsin L [Myzus persicae]
          Length = 341

 Score =  274 bits (700), Expect = 5e-71,   Method: Compositional matrix adjust.
 Identities = 157/343 (45%), Positives = 206/343 (60%), Gaps = 14/343 (4%)

Query: 12  LAAILVLGV--WAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEY 69
           +  ++VLG+  +A  S S    +  + E   ++  Q+ ++Y D  E+  R K++ +N   
Sbjct: 1   MKVVIVLGLVAFAISSVSSINLNEVIEEEWSLFKMQFKKLYEDIKEETFRKKVYLDNKLK 60

Query: 70  IASFNN--KARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTD--VSF-RYE 124
           IA  N   ++  + Y L +N F D    E+    NG+K  L    S+ T D  V+F + E
Sbjct: 61  IARHNKLYESGEETYALEMNHFGDLMQHEYSKMMNGFKPSLAGGDSNFTNDEGVTFLKSE 120

Query: 125 NASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCD 184
           N  +P SIDWRKKG VT VK+QGQCG CW+FSA  ++EG +   T  L SLSEQ L+DC 
Sbjct: 121 NVVIPKSIDWRKKGYVTPVKNQGQCGSCWSFSATGSLEGQHFRKTGVLVSLSEQNLIDCS 180

Query: 185 TSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVP 244
               + GCEGGLMD AF++I SNKGL TE  YPY+A D  C     N S A  +G+ D+P
Sbjct: 181 RKYGNNGCEGGLMDLAFKYIKSNKGLDTEKSYPYEAEDDKCRYNPDN-SGATDNGFVDIP 239

Query: 245 SNNEAALMKAVANQ-PVSVAIDASGSDFQFYSSGVFTG-QC-GTELDHGVTAVGYGTADD 301
             +E ALM A+A   PVS+AIDAS   FQFY  GVF   +C  TELDHGV AVG+ T   
Sbjct: 240 EGDEEALMHALATVGPVSIAIDASSEKFQFYKKGVFYNPRCSSTELDHGVLAVGFRTDKK 299

Query: 302 GTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           G  YW+VKNSWG TWG+ GYI M R+   K+  CG+A  ASYP
Sbjct: 300 GGDYWIVKNSWGKTWGDEGYIMMARN---KKNNCGVASSASYP 339


>gi|330803820|ref|XP_003289900.1| hypothetical protein DICPUDRAFT_80649 [Dictyostelium purpureum]
 gi|325080011|gb|EGC33585.1| hypothetical protein DICPUDRAFT_80649 [Dictyostelium purpureum]
          Length = 328

 Score =  273 bits (699), Expect = 6e-71,   Method: Compositional matrix adjust.
 Identities = 152/342 (44%), Positives = 212/342 (61%), Gaps = 19/342 (5%)

Query: 9   KLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVE 68
           +++LA +    +    S +R  +        + WM ++ + Y  N E   R+ IF++N++
Sbjct: 2   RIILALVFCFLIVNCISAARVFSQKQYQTAFQNWMVKHQKSYT-NDEFGSRYTIFQDNMD 60

Query: 69  YIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRL--PSVRSSETTDVSFRYENA 126
           ++  +N K  +    LG+N  AD TN+E++    G K  +  P++     TDVS      
Sbjct: 61  FVTKWNQKGSDTI--LGLNSMADLTNQEYQRIYLGTKTTVKKPNLIIG-VTDVS------ 111

Query: 127 SVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTS 186
             PAS+DWR  GAVT VK+QGQCG C++FS   ++EGI+ IT+++L SLSEQ+++DC  S
Sbjct: 112 KAPASVDWRANGAVTAVKNQGQCGGCYSFSTTGSVEGIHEITSKQLVSLSEQQILDCSGS 171

Query: 187 GEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSN 246
             + GC+GGLM ++FE+II+  GL TEA YPY+   G C   +AN   A I+GY++V S 
Sbjct: 172 EGNNGCDGGLMTNSFEYIIAVGGLDTEASYPYEGVVGKCKFNKANI-GATITGYKNVKSG 230

Query: 247 NEAALMKAVANQPVSVAIDASGSDFQFYSSGVF--TGQCGTELDHGVTAVGYGTADDGTK 304
           +E+ L  AVA QPVSVAIDAS + FQ YSSGV+       T+LDHGV AVGYG+   G  
Sbjct: 231 SESDLQTAVAAQPVSVAIDASQNSFQLYSSGVYYEPACSSTQLDHGVLAVGYGS-QSGQD 289

Query: 305 YWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           YW+VKNSWG  WGE G+I M R+   K   CGIA  ASYPTA
Sbjct: 290 YWIVKNSWGADWGEKGFILMARN---KHNNCGIATMASYPTA 328


>gi|29165304|gb|AAO65603.1| cathepsin L precursor [Hydra vulgaris]
          Length = 324

 Score =  273 bits (699), Expect = 7e-71,   Method: Compositional matrix adjust.
 Identities = 158/337 (46%), Positives = 208/337 (61%), Gaps = 20/337 (5%)

Query: 11  VLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYI 70
           V  A+L+LGV    +  R + D    E    W   + +VY  + E+ +R+ I+K+N   I
Sbjct: 3   VFCALLLLGVTLAYTIERPVKD----ESWIQWKMYHNKVYSHDGEETVRYTIWKDNERRI 58

Query: 71  ASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPA 130
              N K  +  + L +N+F D TN EF+A  NGY        S+  T  +F       P 
Sbjct: 59  REHNLKGGD--FILKMNQFGDMTNSEFKA-FNGYLSHKHVNGSTFLTPNNF-----VAPD 110

Query: 131 SIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQ 190
           ++DWR +G VT VKDQGQCG CWAFS   ++EG +   T KL SLSEQ LVDC T+  + 
Sbjct: 111 TVDWRNEGYVTPVKDQGQCGSCWAFSTTGSLEGQHFKKTGKLVSLSEQNLVDCSTAYGNN 170

Query: 191 GCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAA 250
           GC+GGLMD+AF +I  NKG+ +EA YPY A DG C  K+++  AA  +G+ D+P  NE  
Sbjct: 171 GCDGGLMDNAFTYIKENKGIDSEASYPYTAEDGKCVFKKSS-VAATDTGFVDIPEGNENK 229

Query: 251 LMKAVAN-QPVSVAIDASGSDFQFYSSGVFT-GQC-GTELDHGVTAVGYGTADDGTKYWL 307
           L +AVA+  P+SVAIDAS   FQFYSSGV+    C  TELDHGV  VGYGT + G  YWL
Sbjct: 230 LKEAVASVGPISVAIDASHESFQFYSSGVYNEPSCSSTELDHGVLVVGYGT-ESGKDYWL 288

Query: 308 VKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           VKNSW T+WG+ GYI+M+R+   +   CGIA +ASYP
Sbjct: 289 VKNSWNTSWGDKGYIKMRRNAKNQ---CGIATKASYP 322


>gi|167526493|ref|XP_001747580.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163774026|gb|EDQ87660.1| predicted protein [Monosiga brevicollis MX1]
          Length = 330

 Score =  273 bits (698), Expect = 9e-71,   Method: Compositional matrix adjust.
 Identities = 142/288 (49%), Positives = 186/288 (64%), Gaps = 15/288 (5%)

Query: 57  EMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSET 116
           E  F+    N+  I + N  A N  + +GI +FAD T  EF A    Y +R P   +   
Sbjct: 45  EPAFRCHLANLRVIEAHN--AGNSSFTMGITQFADLTAAEFSA----YVKRFPMNVTRPR 98

Query: 117 TDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLS 176
            +V   +   +    +DWR+K AVT +K+QGQCG CW+FS   ++EG + I T KL SLS
Sbjct: 99  NEV---WITEAPLQEVDWRQKNAVTEIKNQGQCGSCWSFSTTGSVEGAHAIATGKLVSLS 155

Query: 177 EQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAK 236
           EQ+L+DC T   + GC GGLMD AFE++I+N GL TE  YPY A DG CN ++    AA+
Sbjct: 156 EQQLMDCSTRYGNHGCNGGLMDYAFEYVIANGGLDTEEDYPYTAEDGKCNTEKEKKHAAE 215

Query: 237 ISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGY 296
           I G+ +VP  +E  L  AV+  PVSVAI+A  + FQ Y+SGVF G+CGT LDHGV  VGY
Sbjct: 216 IHGFRNVPKEHEDQLAAAVSIGPVSVAIEADQAGFQHYTSGVFDGKCGTSLDHGVLVVGY 275

Query: 297 GTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
             +DD   YW+VKNSWG +WGE GYIR++R +D K+G+CGI MQASYP
Sbjct: 276 --SDD---YWIVKNSWGKSWGEEGYIRLKRGVD-KKGMCGITMQASYP 317


>gi|326520387|dbj|BAK07452.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 349

 Score =  273 bits (698), Expect = 9e-71,   Method: Compositional matrix adjust.
 Identities = 156/324 (48%), Positives = 193/324 (59%), Gaps = 21/324 (6%)

Query: 31  NDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFA 90
            D  M +R   W A + R Y    E+  RF++++ NVEYI + N +     Y+LG N+FA
Sbjct: 37  GDMLMMDRFRQWQATHNRSYLSAEERLRRFEVYRTNVEYIDATNRRG-GLTYELGENQFA 95

Query: 91  DQTNEEFRAPRNG--------YKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTG 142
           D T EEF A   G               + SS  +D S     A  PAS+DWR KGAVT 
Sbjct: 96  DLTGEEFLARYAGGHTGSAITTAAEADGLWSSGGSDGSL---EADPPASVDWRAKGAVTP 152

Query: 143 VKDQG-QCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAF 201
           VK+QG QC  CWAFSAVA ME +  I T KL +LSEQ+LVDCD    D GC  G    AF
Sbjct: 153 VKNQGSQCYSCWAFSAVATMESLYFIKTGKLVALSEQQLVDCDK--YDGGCNKGYYHRAF 210

Query: 202 EFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVS 261
           ++I+ N G+ T A+YPYKA  G+C+   A   A  I+G+  V + NE AL  AVA QP+ 
Sbjct: 211 QWIMENGGITTAAQYPYKAVRGACS---AAKPAVTITGHLAV-AKNELALQSAVARQPIG 266

Query: 262 VAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGY 321
           VAI+   S  QFY SGVF+  CG ++ H V  VGYG    G KYWLVKNSWG TWGE GY
Sbjct: 267 VAIEVPIS-MQFYKSGVFSAACGIQMSHAVVTVGYGADASGLKYWLVKNSWGQTWGEAGY 325

Query: 322 IRMQRDIDAKEGLCGIAMQASYPT 345
           IRM+RD+    GLCGIA+  +YPT
Sbjct: 326 IRMRRDVGGG-GLCGIALDTAYPT 348


>gi|4469155|emb|CAB38315.1| chymopapain isoform III [Carica papaya]
          Length = 361

 Score =  273 bits (698), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 146/347 (42%), Positives = 204/347 (58%), Gaps = 14/347 (4%)

Query: 3   MILLENKLVLAAILVLGVWAPQSWSRTLNDATMNER----HEMWMAQYGRVYRDNAEKEM 58
           +I L   L++   L    +    +S+  +D T  ER     + WM ++ ++Y    EK  
Sbjct: 10  IIFLATCLIIHMGLSSADFYTVGYSQ--DDLTSIERLIQLFDSWMLKHNKIYESIDEKIY 67

Query: 59  RFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGY-KRRLPSVRSSETT 117
           RF+IF++N+ YI   N K  N  Y LG+N FAD +N+EF+    G+       +   +  
Sbjct: 68  RFEIFRDNLMYIDETNKK--NNSYWLGLNGFADLSNDEFKKKYVGFVAEDFTGLEHFDNE 125

Query: 118 DVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSE 177
           D ++++   + P SIDWR KGAVT VK+QG CG CWAFS +A +EGIN I T  L  LSE
Sbjct: 126 DFTYKHV-TNYPQSIDWRAKGAVTPVKNQGACGSCWAFSTIATVEGINKIVTGNLLELSE 184

Query: 178 QELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKI 237
           QELVDCD      GC+GG    + +++ +N G+ T   YP +A    C   +      KI
Sbjct: 185 QELVDCDK--HSYGCKGGYQTTSLQYV-ANNGVHTSKVYPCQAKQYKCRATDKPGPKVKI 241

Query: 238 SGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYG 297
           +GY+ VPSN E + + A+ANQP+S  ++A G  FQ Y SGVF G CGT+LDH VTAVGYG
Sbjct: 242 TGYKRVPSNCETSFLGALANQPLSFLVEAGGKPFQLYKSGVFDGPCGTKLDHAVTAVGYG 301

Query: 298 TADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           T+ DG  Y ++KNSWG  WGE GY+R++R     +G CG+   + YP
Sbjct: 302 TS-DGKNYIIIKNSWGPNWGEKGYMRLKRQSGNSQGTCGVYKSSYYP 347


>gi|195484843|ref|XP_002090843.1| GE12574 [Drosophila yakuba]
 gi|194176944|gb|EDW90555.1| GE12574 [Drosophila yakuba]
          Length = 341

 Score =  273 bits (697), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 151/327 (46%), Positives = 198/327 (60%), Gaps = 18/327 (5%)

Query: 29  TLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNK-ARNK-PYKLGI 86
           +  D  M E H   + ++ + Y+D+ E+  R KIF EN   IA  N + A  K  +KL +
Sbjct: 20  SFADVVMEEWHTFKL-EHRKNYQDDTEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAV 78

Query: 87  NEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFR------YENASVPASIDWRKKGAV 140
           N++AD  + EFR   NG+   L   +    TD SF+        + ++P S+DWR KGAV
Sbjct: 79  NKYADLLHHEFRQLMNGFNYTLH--KQLRATDDSFKGVTFISPAHVTLPKSVDWRSKGAV 136

Query: 141 TGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDA 200
           T VKDQG CG CWAFS+  A+EG +   +  L SLSEQ LVDC T   + GC GGLMD+A
Sbjct: 137 TAVKDQGHCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNA 196

Query: 201 FEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QP 259
           F +I  N G+ TE  YPY+A D SC+  +    A    G+ D+P  +E  + +AVA   P
Sbjct: 197 FRYIKDNGGIDTEKSYPYEAIDDSCHFNKGTIGATD-RGFTDIPQGDEKKMAEAVATVGP 255

Query: 260 VSVAIDASGSDFQFYSSGVFT-GQCGTE-LDHGVTAVGYGTADDGTKYWLVKNSWGTTWG 317
           VSVAIDAS   FQFYS GV+   QC  + LDHGV  VG+GT + G  YWLVKNSWGTTWG
Sbjct: 256 VSVAIDASHESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGDDYWLVKNSWGTTWG 315

Query: 318 ENGYIRMQRDIDAKEGLCGIAMQASYP 344
           + G+I+M R+   K+  CGIA  +SYP
Sbjct: 316 DKGFIKMLRN---KDNQCGIASASSYP 339


>gi|194883222|ref|XP_001975702.1| GG20414 [Drosophila erecta]
 gi|190658889|gb|EDV56102.1| GG20414 [Drosophila erecta]
          Length = 341

 Score =  273 bits (697), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 151/327 (46%), Positives = 199/327 (60%), Gaps = 18/327 (5%)

Query: 29  TLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNK-ARNK-PYKLGI 86
           +  D  M E H   + ++ + Y+D+ E+  R KIF EN   IA  N + A  K  +KL +
Sbjct: 20  SFADVVMEEWHTFKL-EHRKNYQDDTEERFRLKIFNENKHKIAKHNQRYAEGKVSFKLAV 78

Query: 87  NEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFR------YENASVPASIDWRKKGAV 140
           N++AD  + EFR   NG+   L   +   +TD SF+        + ++P S+DWR KGAV
Sbjct: 79  NKYADLLHHEFRQLMNGFNYTLH--KQLRSTDDSFKGVTFISPAHVTLPKSVDWRTKGAV 136

Query: 141 TGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDA 200
           T VKDQG CG CWAFS+  A+EG +   +  L SLSEQ LVDC T   + GC GGLMD+A
Sbjct: 137 TAVKDQGHCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNA 196

Query: 201 FEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QP 259
           F +I  N G+ TE  YPY+A D SC+  +    A    G+ D+P  +E  + +AVA   P
Sbjct: 197 FRYIKDNGGIDTEKSYPYEAIDDSCHFNKGAIGATD-RGFTDIPQGDEKKMAEAVATVGP 255

Query: 260 VSVAIDASGSDFQFYSSGVFT-GQCGTE-LDHGVTAVGYGTADDGTKYWLVKNSWGTTWG 317
           V+VAIDAS   FQFYS GV+   QC  + LDHGV  VGYGT + G  YWLVKNSWGTTWG
Sbjct: 256 VAVAIDASHESFQFYSEGVYNEPQCDAQNLDHGVLVVGYGTDESGDDYWLVKNSWGTTWG 315

Query: 318 ENGYIRMQRDIDAKEGLCGIAMQASYP 344
           + G+I+M R+   K+  CGIA  +SYP
Sbjct: 316 DKGFIKMLRN---KDNQCGIASASSYP 339


>gi|225719058|gb|ACO15375.1| Cathepsin L1 precursor [Caligus clemensi]
          Length = 326

 Score =  272 bits (696), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 153/309 (49%), Positives = 191/309 (61%), Gaps = 16/309 (5%)

Query: 42  WMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARN--KPYKLGINEFADQTNEEFRA 99
           W A +G+VY    E+ +RFKIF+EN   I   N + R     Y LG+N F D  + EF  
Sbjct: 26  WKATHGKVYNSADEESLRFKIFQENSLMITQHNEEYRQGFHTYILGMNHFGDLLHSEFLE 85

Query: 100 PRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVA 159
             NG++  +         DV     NA VP+  +W  KGAVT VKDQG+CG CWAFSA  
Sbjct: 86  RSNGFQGGVSG------GDVFTFDTNAPVPSYANWTAKGAVTPVKDQGKCGSCWAFSATG 139

Query: 160 AMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYK 219
           ++EG   +  +KL SLSEQ+LVDC     + GC GGLMD+AF++ I+NKG+A E  YPY 
Sbjct: 140 SVEGQIFLKKKKLMSLSEQQLVDCSGDEGNLGCGGGLMDNAFKYFIANKGIANEKSYPYT 199

Query: 220 ASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDASGSDFQFYSSGV 278
           A D  C  K++  S A IS ++DV   +E  L  AVAN  PVSVAIDAS S FQFY SGV
Sbjct: 200 AKDNDCKYKKS-MSVATISSFKDVKHKDEDQLKMAVANVGPVSVAIDASSSKFQFYESGV 258

Query: 279 FTGQ-CGTE-LDHGVTAVGYGT-ADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLC 335
           +  + C +E LDHGV AVGYGT    G  +WLVKNSW  +WG NGYI+M R+   K+  C
Sbjct: 259 YYDENCSSEVLDHGVLAVGYGTDKKSGMDFWLVKNSWAASWGLNGYIKMARN---KDNNC 315

Query: 336 GIAMQASYP 344
           GIA  ASYP
Sbjct: 316 GIATMASYP 324


>gi|9502426|gb|AAF88125.1|AC021043_18 Putative cysteine proteinase [Arabidopsis thaliana]
          Length = 365

 Score =  272 bits (696), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 147/373 (39%), Positives = 217/373 (58%), Gaps = 39/373 (10%)

Query: 3   MILLENKLVLAAILVLGVWAPQSWSR-TLNDATMNERHEMWMAQYGRVYRDNAEKEMRFK 61
           M+ + +  V   IL + +   Q+    TLN+ ++ + H+ WM Q+ RVY+D +EKEMR K
Sbjct: 1   MVSVRSVFVALTILSMDLRISQARPHVTLNEQSIVDYHQQWMTQFSRVYKDESEKEMRLK 60

Query: 62  IFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSV---------- 111
           +FK+N+++I +FNN   N+ Y LG+NEF D   EEF A   G +  + S+          
Sbjct: 61  VFKKNLKFIENFNNMG-NQSYTLGVNEFTDWKTEEFLATHTGLRVNVTSLSELFNKTKPS 119

Query: 112 RSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCG------------CCWAFSAVA 159
           R+   +D+    E      S DWR +GAVT VK QG C                 ++ + 
Sbjct: 120 RNWNMSDIDMEDE------SKDWRDEGAVTPVKYQGACPEFPTKQIRRNSLVGKQYTKLL 173

Query: 160 AM------EGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATE 213
            +      EG+  I+ + L +LSEQ+L+DCD   ++ GC GG  ++AF++II N G++ E
Sbjct: 174 GVLSDWGDEGLTKISGKNLLTLSEQQLIDCDIE-KNGGCNGGEFEEAFKYIIKNGGVSLE 232

Query: 214 AKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQF 273
            +YPY+    SC          +I G++ VPS+NE AL++AV  QPVSV IDA    F  
Sbjct: 233 TEYPYQVKKESCRANARRAPHTQIRGFQMVPSHNERALLEAVRRQPVSVLIDARADSFGH 292

Query: 274 YSSGVFTG-QCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKE 332
           Y  GV+ G  CGT+++H VT VGYGT   G  YW++KNSWG +WGENGY+R++RD++  +
Sbjct: 293 YKGGVYAGLDCGTDVNHAVTIVGYGTM-SGLNYWVLKNSWGESWGENGYMRIRRDVEWPQ 351

Query: 333 GLCGIAMQASYPT 345
           G+CGIA  A+YP 
Sbjct: 352 GMCGIAQVAAYPV 364


>gi|219884655|gb|ACL52702.1| unknown [Zea mays]
 gi|413916718|gb|AFW56650.1| thiol protease SEN102 [Zea mays]
          Length = 349

 Score =  272 bits (696), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 149/318 (46%), Positives = 194/318 (61%), Gaps = 17/318 (5%)

Query: 37  ERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEE 96
           +R + W A+Y R Y    E + RF ++ ENV++I + N    +  Y+LG N+FAD T EE
Sbjct: 35  DRFQAWQAEYNRTYATPEEFQQRFMVYSENVKFIETMNQPGSS--YELGENQFADLTEEE 92

Query: 97  FRAPRNGYKRRLPSVRSSE-----TTDVSFRYENAS------VPASIDWRKKGAVTGVKD 145
           F+   + Y  +L +V SS      T D   R   +        P S+DWR KGAVT VK 
Sbjct: 93  FK---DTYLMKLDNVASSPEAMALTVDTMNRAGTSGGSNTNEAPNSVDWRTKGAVTPVKS 149

Query: 146 QGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFII 205
           Q  CG CWAF+AVA++EG++ I T +L SLSEQE+VDCD  G + GC GG    A E++ 
Sbjct: 150 QQHCGSCWAFAAVASIEGVHKIKTGRLVSLSEQEIVDCDRGGNNHGCHGGHSSSAMEWVT 209

Query: 206 SNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAID 265
            N GL TE+ YPY    G C   +    AAKI G + V   NE AL  AVA +PV+V+I+
Sbjct: 210 RNGGLTTESDYPYVGRQGQCMSDKLGHHAAKIRGRQAVQGKNEGALQHAVAGRPVAVSIN 269

Query: 266 ASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQ 325
           AS + FQFY  G+F+G C T  +H VT VGYG    G KYW+VKNSWG  WGE GY+RMQ
Sbjct: 270 ASRA-FQFYKRGIFSGPCNTTRNHAVTVVGYGANASGHKYWIVKNSWGERWGEKGYVRMQ 328

Query: 326 RDIDAKEGLCGIAMQASY 343
           R + A+EG+CGIA+   Y
Sbjct: 329 RGVRAREGVCGIAIAPFY 346


>gi|91092014|ref|XP_970644.1| PREDICTED: similar to cathepsin-L-like cysteine peptidase 02
           [Tribolium castaneum]
 gi|270001249|gb|EEZ97696.1| cathepsin L precursor [Tribolium castaneum]
          Length = 337

 Score =  272 bits (696), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 155/343 (45%), Positives = 197/343 (57%), Gaps = 19/343 (5%)

Query: 10  LVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEY 69
           LV  A+ V+G  A   +        + E+   +   + + Y    E+  R KIF EN   
Sbjct: 4   LVFVALCVVGSQAVSFFD------LVQEQWGAFKVTHKKQYESETEERFRMKIFMENAHK 57

Query: 70  IASFNNKARNK--PYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRY---E 124
           +A  N         +KLG+N+++D  N EF    NGY R    +RS E  D S  +    
Sbjct: 58  VAKHNKLYAQGLVSFKLGVNKYSDMLNHEFVHTLNGYNRSKTPLRSGEL-DESITFIPPA 116

Query: 125 NASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCD 184
           N  +P  IDWRK GAVT VKDQGQCG CW+FS   ++EG +   ++KL SLSEQ L+DC 
Sbjct: 117 NVELPKQIDWRKLGAVTPVKDQGQCGSCWSFSTTGSLEGQHFRKSKKLVSLSEQNLIDCS 176

Query: 185 TSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVP 244
               + GC GGLMD+AF +I  N G+ TE  YPYKA D  C+ K  N  A    G+ D+ 
Sbjct: 177 EKYGNNGCNGGLMDNAFRYIKDNGGIDTEQSYPYKAEDEKCHYKPRNKGATD-RGFVDIE 235

Query: 245 SNNEAALMKAVAN-QPVSVAIDASGSDFQFYSSGV-FTGQCGTE-LDHGVTAVGYGTADD 301
           S +E  L  AVA   P+SVAIDAS   FQ YS GV +  +C +E LDHGV  VGYGT +D
Sbjct: 236 SGDEEKLKAAVATVGPISVAIDASHPTFQQYSEGVYYEPECSSEQLDHGVLVVGYGTDED 295

Query: 302 GTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           G  YWLVKNSWG +WG+ GYI+M R+ D     CGIA QASYP
Sbjct: 296 GNDYWLVKNSWGDSWGDQGYIKMARNRDNN---CGIATQASYP 335


>gi|244539471|dbj|BAH82657.1| cysteine protease [Lotus japonicus]
          Length = 286

 Score =  272 bits (696), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 133/251 (52%), Positives = 168/251 (66%), Gaps = 6/251 (2%)

Query: 37  ERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEE 96
           E  E WM+++G++Y    EK +RF+IFK+N+++I   N    N  Y LG+NEFAD ++ E
Sbjct: 6   ELFESWMSRHGKIYESIEEKLLRFEIFKDNLKHIDETNKVVSN--YWLGLNEFADLSHHE 63

Query: 97  FRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFS 156
           F+    G K    + R S      F Y +  +P S+DWRKKGAVT +K+QG CG CWAFS
Sbjct: 64  FKKQYLGLKVDFSTRRESSE---EFTYRDVDLPKSVDWRKKGAVTNIKNQGSCGSCWAFS 120

Query: 157 AVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKY 216
            VAA+EGIN I T  LTSLSEQEL+DCD +  + GC GGLMD AF FI+ N GL  E  Y
Sbjct: 121 TVAAVEGINQIVTGNLTSLSEQELIDCDRT-YNSGCNGGLMDYAFSFIVENGGLHKEDDY 179

Query: 217 PYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSS 276
           PY   +G+C   +       ISGY DVP NNE +L+KA+ANQP+SVAI+ASG DFQFYS 
Sbjct: 180 PYIMEEGTCEMSKEESQVVTISGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQFYSG 239

Query: 277 GVFTGQCGTEL 287
           GVF G CGT+L
Sbjct: 240 GVFDGHCGTQL 250


>gi|221090861|ref|XP_002167224.1| PREDICTED: cathepsin L-like [Hydra magnipapillata]
          Length = 324

 Score =  272 bits (696), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 160/338 (47%), Positives = 207/338 (61%), Gaps = 22/338 (6%)

Query: 11  VLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYI 70
           V  A+L+LGV    +  R + D    E    W   + +VY  + E+ +R+ I+K+N   I
Sbjct: 3   VFCALLLLGVTLAYTIERPVKD----ESWIQWKMYHNKVYSHDGEETVRYTIWKDNERRI 58

Query: 71  ASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPA 130
              N K  +  + L +N+F D TN EF+A  NGY        S+  T  +F       P 
Sbjct: 59  REHNLKGGD--FLLKMNQFGDMTNSEFKA-FNGYLSHKHVNGSTFLTPNNF-----VAPD 110

Query: 131 SIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQ 190
           ++DWR +G VT VKDQGQCG CWAFS   ++EG +   T KL SLSEQ LVDC T+  + 
Sbjct: 111 TVDWRNEGYVTPVKDQGQCGSCWAFSTTGSLEGQHFKKTGKLVSLSEQNLVDCSTAYGNN 170

Query: 191 GCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPS-AAKISGYEDVPSNNEA 249
           GC GGLMD+AF +I  NKG+ +EA YPY A DG C  K+  PS AA  +G+ D+P  NE 
Sbjct: 171 GCNGGLMDNAFTYIKENKGIDSEASYPYTAEDGKCVFKK--PSVAATDTGFVDLPEGNEN 228

Query: 250 ALMKAVAN-QPVSVAIDASGSDFQFYSSGVFT-GQC-GTELDHGVTAVGYGTADDGTKYW 306
            L +AVA+  P+SVAIDAS   FQFYSSGV+    C  TELDHGV  VGYGT + G  YW
Sbjct: 229 KLKEAVASVGPISVAIDASHESFQFYSSGVYNEPSCSSTELDHGVLVVGYGT-ESGKDYW 287

Query: 307 LVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           LVKNSW T+WG+ GYI+M+R+   +   CGIA +ASYP
Sbjct: 288 LVKNSWNTSWGDKGYIKMRRNAKNQ---CGIATKASYP 322


>gi|121543825|gb|ABM55577.1| putative cathepsin L-like protease [Maconellicoccus hirsutus]
          Length = 341

 Score =  272 bits (695), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 151/324 (46%), Positives = 194/324 (59%), Gaps = 14/324 (4%)

Query: 29  TLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNK--PYKLGI 86
           + ND  + E  E++  Q+ + Y    E++ R K+F +N   IA  N   +N    Y+L +
Sbjct: 22  SFND-LIAEEWELFKTQFSKAYNTEIEEKFRMKVFMDNKHKIARHNKLFQNGEVSYELEM 80

Query: 87  NEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSF-RYENASVPASIDWRKKGAVTGVKD 145
           N F D  + EF    NGY+  L  V   E   V+F    N +VP S+DWR +GAVT VK+
Sbjct: 81  NHFGDLLHHEFVKTVNGYRHSLRRVTGDEIDSVTFIPAYNVTVPDSVDWRTEGAVTEVKN 140

Query: 146 QGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFII 205
           QGQCG CWAFS   ++EG +   T++LTSLSEQ L+DC     + GC GGLMD+AF +I 
Sbjct: 141 QGQCGSCWAFSTTGSLEGQHFRNTKQLTSLSEQNLIDCSGKYGNNGCSGGLMDNAFAYIK 200

Query: 206 SNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAI 264
           SNKG+ TE  YPY+  D  C  K    S A   G+ D+P  +E  L  AVA   P+SVAI
Sbjct: 201 SNKGIDTEQSYPYEGIDDKCRYK-PQESGATDKGFVDIPQGDEEKLKLAVATVGPISVAI 259

Query: 265 DASGSDFQFYSSGVFTGQ-CGT---ELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENG 320
           DAS   FQFY  GV+  + CG    +LDHGV AVGYGT ++G  YWLVKNSWG  WG +G
Sbjct: 260 DASHQSFQFYKKGVYYDKGCGNGEEDLDHGVLAVGYGT-ENGKDYWLVKNSWGKRWGLDG 318

Query: 321 YIRMQRDIDAKEGLCGIAMQASYP 344
           YI+M R+   K   CGIA  ASYP
Sbjct: 319 YIKMARN---KHNHCGIATSASYP 339


>gi|195583187|ref|XP_002081405.1| GD10995 [Drosophila simulans]
 gi|194193414|gb|EDX06990.1| GD10995 [Drosophila simulans]
          Length = 341

 Score =  272 bits (695), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 151/327 (46%), Positives = 197/327 (60%), Gaps = 18/327 (5%)

Query: 29  TLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNK-ARNK-PYKLGI 86
           +  D  M E H   + ++ + Y+D+ E+  R KIF EN   IA  N + A  K  +KL +
Sbjct: 20  SFADVVMEEWHTFKL-EHRKNYQDDTEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAV 78

Query: 87  NEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFR------YENASVPASIDWRKKGAV 140
           N++AD  + EFR   NG+   L   +     D SF+        + ++P S+DWR KGAV
Sbjct: 79  NKYADLLHHEFRQLMNGFNYTLH--KQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAV 136

Query: 141 TGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDA 200
           T VKDQG CG CWAFS+  A+EG +   +  L SLSEQ LVDC T   + GC GGLMD+A
Sbjct: 137 TAVKDQGHCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNA 196

Query: 201 FEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QP 259
           F +I  N G+ TE  YPY+A D SC+  +    A    G+ D+P  +E  + +AVA   P
Sbjct: 197 FRYIKDNGGIDTEKSYPYEAIDDSCHFNKGTIGATD-RGFTDIPQGDEKKMAEAVATVGP 255

Query: 260 VSVAIDASGSDFQFYSSGVFT-GQCGTE-LDHGVTAVGYGTADDGTKYWLVKNSWGTTWG 317
           VSVAIDAS   FQFYS GV+   QC  + LDHGV  VG+GT + G  YWLVKNSWGTTWG
Sbjct: 256 VSVAIDASHESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGDDYWLVKNSWGTTWG 315

Query: 318 ENGYIRMQRDIDAKEGLCGIAMQASYP 344
           + G+I+M R+   KE  CGIA  +SYP
Sbjct: 316 DKGFIKMLRN---KENQCGIASASSYP 339


>gi|21425246|emb|CAD33266.1| cathepsin L [Aphis gossypii]
          Length = 341

 Score =  272 bits (695), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 154/343 (44%), Positives = 205/343 (59%), Gaps = 14/343 (4%)

Query: 12  LAAILVLGV--WAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEY 69
           +  ++VLG+  +A  + S    +  + E   ++  Q+ ++Y D  E+  R K++ +N   
Sbjct: 1   MKVVIVLGLVAFAISTVSSINLNEVIEEEWSLFKIQFKKLYEDIKEETFRKKVYLDNKLK 60

Query: 70  IASFNN--KARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTD--VSF-RYE 124
           IA  N   ++  + Y L +N F D    E+    NG+K  L     + T D  V+F + E
Sbjct: 61  IAGHNKLYESGEETYALEMNHFGDLMQHEYTKMMNGFKPSLAGGDRNFTNDEAVTFLKSE 120

Query: 125 NASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCD 184
           N  +P S+DWRKKG VT VK+QGQCG CW+FSA  ++EG +   T  L SLSEQ L+DC 
Sbjct: 121 NVVIPKSVDWRKKGYVTPVKNQGQCGSCWSFSATGSLEGQHFRKTGVLVSLSEQNLIDCS 180

Query: 185 TSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVP 244
               + GCEGGLMD AF++I SNKGL TE  YPY+A D  C     N S A   G+ D+P
Sbjct: 181 RKYGNNGCEGGLMDLAFKYIKSNKGLDTEKSYPYEAEDDKCRYNPEN-SGATDKGFVDIP 239

Query: 245 SNNEAALMKAVAN-QPVSVAIDASGSDFQFYSSGVFTG-QC-GTELDHGVTAVGYGTADD 301
             +E ALM A+A   PVS+AIDAS   FQFY  GVF   +C  TELDHGV AVG+G+   
Sbjct: 240 EGDEDALMHALATVGPVSIAIDASSEKFQFYKKGVFYNPRCSSTELDHGVLAVGFGSDKK 299

Query: 302 GTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           G  YW+VKNSWG TWG+ GYI M R+   K+  CG+A  ASYP
Sbjct: 300 GGDYWIVKNSWGKTWGDEGYIMMARN---KKNNCGVASSASYP 339


>gi|116794072|gb|ABK26996.1| unknown [Picea sitchensis]
          Length = 367

 Score =  272 bits (695), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 150/356 (42%), Positives = 212/356 (59%), Gaps = 25/356 (7%)

Query: 10  LVLAAILVLGVWAPQSWSRTLNDATMNERHEM------WMAQYGRVYRDNAEKEMRFKIF 63
           L+++A ++  V A ++   +     +N  + +      W+ ++G++Y  + EK  R +IF
Sbjct: 8   LLISATIICLVSAAKAVQHSYEVGDINSGNGLVRLFDRWLGRHGKLYGSHEEKARRLQIF 67

Query: 64  KENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYK------RRLPSVRSSET- 116
           + N++YI + +NK  N  ++LG+N+FAD TNEEF+    G        RR   +  +E  
Sbjct: 68  RTNLQYIHA-HNKNSNSSFRLGLNKFADLTNEEFKTRYFGKNSKQWRDRRRTELEGAELR 126

Query: 117 ----TDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKL 172
                 V  +  + S+ +S+DWRKKGAVTGVKDQ QCG CWAFS   A+EG+N I+T KL
Sbjct: 127 PVLKQTVGSQSSSCSIASSLDWRKKGAVTGVKDQAQCGSCWAFSTTGAIEGVNFISTGKL 186

Query: 173 TSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANP 232
            SLSEQELV CD +  + GCEGG MD AF ++I N G+ TE  Y Y   D +CN  +   
Sbjct: 187 VSLSEQELVACDAT--NYGCEGGDMDYAFTWVIQNGGIDTEKDYSYTGVDSTCNTNKEAK 244

Query: 233 SAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCG---TELDH 289
               I GY DV S +++AL+ A  +QPVSV ID S  DFQ Y+ G++ G C     ++DH
Sbjct: 245 KIVSIDGYTDV-SPDDSALLCAAGSQPVSVGIDGSAIDFQLYTGGIYDGDCSGNPDDIDH 303

Query: 290 GVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
            V  VGY +A +G  YW+VKNSWGT WG  GY  + R+ +   G+C I   ASYPT
Sbjct: 304 AVLVVGY-SAKNGKDYWIVKNSWGTDWGLEGYFYILRNTELPYGVCAINAMASYPT 358


>gi|16304178|gb|AAL16954.1|AF426414_1 cathepsin L-like cysteine protease precursor [Delia radicum]
          Length = 337

 Score =  272 bits (695), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 147/320 (45%), Positives = 193/320 (60%), Gaps = 11/320 (3%)

Query: 34  TMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNN-KARNK-PYKLGINEFAD 91
            + E  + +  ++ + +    E+  R KIF EN   IA  N   A+ K  +KLG+N+++D
Sbjct: 22  VIKEEWQTFKMEHRKNFLSEVEERFRMKIFNENRHKIAKHNQLYAQGKVSFKLGLNKYSD 81

Query: 92  QTNEEFRAPRNGYKRRLPSVRSSE--TTDVSFRYENASVPASIDWRKKGAVTGVKDQGQC 149
               EF+   NGY   +  V  ++  +  +     N  +P S+DWR+ GAVT VKDQG C
Sbjct: 82  MLYHEFKETMNGYNHTMRKVLRAQGFSGIIYIPPANVQIPKSVDWRQHGAVTAVKDQGHC 141

Query: 150 GCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKG 209
           G CWAFS+ AA+EG +      L SLSEQ LVDC T   + GC GGLMD+AF +I  N G
Sbjct: 142 GSCWAFSSTAALEGQHFRKAGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGG 201

Query: 210 LATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQ-PVSVAIDASG 268
           + TE  YPY+  D SC+  ++   A   +G+ D+P  +E ALMKAVA   PVSVAIDAS 
Sbjct: 202 IDTEKSYPYEGIDDSCHFTKSGVGATD-TGFVDIPQGDEEALMKAVATMGPVSVAIDASH 260

Query: 269 SDFQFYSSGVFT-GQCGTE-LDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQR 326
             FQ YS GV+   +C  + LDHGV  VGYGT   G  YWLVKNSWGTTWG+ GYI+M R
Sbjct: 261 ESFQLYSEGVYNEPECDAQNLDHGVLVVGYGTDKTGLDYWLVKNSWGTTWGDQGYIKMAR 320

Query: 327 DIDAKEGLCGIAMQASYPTA 346
           + D +   CGIA  +SYPT 
Sbjct: 321 NQDNQ---CGIATASSYPTV 337


>gi|2804262|dbj|BAA24442.1| cysteine proteinase [Sitophilus zeamais]
          Length = 338

 Score =  272 bits (695), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 151/343 (44%), Positives = 202/343 (58%), Gaps = 19/343 (5%)

Query: 10  LVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEY 69
           L+LAA+++         + +  D  + E+   +  Q+ + Y    E+  R KIF EN   
Sbjct: 5   LILAAVVI------SCQAVSFYD-LVQEQWSSFKMQHSKNYDSETEERFRMKIFMENAHK 57

Query: 70  IASFNNKARNK--PYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRY---E 124
           +A  N         +KLG+N++AD  + EF +  NG+ +   ++      + + R+    
Sbjct: 58  VAKHNKLFSQGFVKFKLGLNKYADMLHHEFVSTLNGFNKTKNNILKGSDLNDAVRFISPA 117

Query: 125 NASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCD 184
           N  +P ++DWR KGAVT VKDQG CG CW+FSA  ++EG +   T KL SLSEQ LVDC 
Sbjct: 118 NVKLPDTVDWRDKGAVTEVKDQGHCGSCWSFSATGSLEGQHFRKTGKLVSLSEQNLVDCS 177

Query: 185 TSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVP 244
               + GC GGLMD+AF +I  N G+ TE  YPY A D  C+ K  N S A   G+ D+ 
Sbjct: 178 GRYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYPYLAEDEKCHYKAQN-SGATDKGFVDIE 236

Query: 245 SNNEAALMKAVANQ-PVSVAIDASGSDFQFYSSGVFTG-QCGT-ELDHGVTAVGYGTADD 301
             NE  L  AVA   PVS+AIDAS   FQ YS GV++  +C + ELDHGV  VGYGT+DD
Sbjct: 237 EANEDDLKAAVATVGPVSIAIDASHETFQLYSDGVYSDPECSSQELDHGVLVVGYGTSDD 296

Query: 302 GTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           G  YWLVKNSWG +WG NGYI+M R+ D    +CG+A QASYP
Sbjct: 297 GQDYWLVKNSWGPSWGLNGYIKMARNQD---NMCGVASQASYP 336


>gi|24653514|ref|NP_523735.2| cysteine proteinase-1, isoform C [Drosophila melanogaster]
 gi|118572624|sp|Q95029.2|CATL_DROME RecName: Full=Cathepsin L; AltName: Full=Cysteine proteinase 1;
           Contains: RecName: Full=Cathepsin L heavy chain;
           Contains: RecName: Full=Cathepsin L light chain; Flags:
           Precursor
 gi|21627209|gb|AAM68565.1| cysteine proteinase-1, isoform C [Drosophila melanogaster]
          Length = 371

 Score =  272 bits (695), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 151/327 (46%), Positives = 196/327 (59%), Gaps = 18/327 (5%)

Query: 29  TLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNK-ARNK-PYKLGI 86
           +  D  M E H   + ++ + Y+D  E+  R KIF EN   IA  N + A  K  +KL +
Sbjct: 50  SFADVVMEEWHTFKL-EHRKNYQDETEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAV 108

Query: 87  NEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFR------YENASVPASIDWRKKGAV 140
           N++AD  + EFR   NG+   L   +     D SF+        + ++P S+DWR KGAV
Sbjct: 109 NKYADLLHHEFRQLMNGFNYTLH--KQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAV 166

Query: 141 TGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDA 200
           T VKDQG CG CWAFS+  A+EG +   +  L SLSEQ LVDC T   + GC GGLMD+A
Sbjct: 167 TAVKDQGHCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNA 226

Query: 201 FEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QP 259
           F +I  N G+ TE  YPY+A D SC+  +    A    G+ D+P  +E  + +AVA   P
Sbjct: 227 FRYIKDNGGIDTEKSYPYEAIDDSCHFNKGTVGATD-RGFTDIPQGDEKKMAEAVATVGP 285

Query: 260 VSVAIDASGSDFQFYSSGVFT-GQCGTE-LDHGVTAVGYGTADDGTKYWLVKNSWGTTWG 317
           VSVAIDAS   FQFYS GV+   QC  + LDHGV  VG+GT + G  YWLVKNSWGTTWG
Sbjct: 286 VSVAIDASHESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGEDYWLVKNSWGTTWG 345

Query: 318 ENGYIRMQRDIDAKEGLCGIAMQASYP 344
           + G+I+M R+   KE  CGIA  +SYP
Sbjct: 346 DKGFIKMLRN---KENQCGIASASSYP 369


>gi|255522980|gb|ACU12382.1| RE21773p [Drosophila melanogaster]
          Length = 375

 Score =  271 bits (694), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 151/327 (46%), Positives = 196/327 (59%), Gaps = 18/327 (5%)

Query: 29  TLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNK-ARNK-PYKLGI 86
           +  D  M E H   + ++ + Y+D  E+  R KIF EN   IA  N + A  K  +KL +
Sbjct: 54  SFADVVMEEWHTFKL-EHRKNYQDETEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAV 112

Query: 87  NEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFR------YENASVPASIDWRKKGAV 140
           N++AD  + EFR   NG+   L   +     D SF+        + ++P S+DWR KGAV
Sbjct: 113 NKYADLLHHEFRQLMNGFNYTLH--KQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAV 170

Query: 141 TGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDA 200
           T VKDQG CG CWAFS+  A+EG +   +  L SLSEQ LVDC T   + GC GGLMD+A
Sbjct: 171 TAVKDQGHCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNA 230

Query: 201 FEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QP 259
           F +I  N G+ TE  YPY+A D SC+  +    A    G+ D+P  +E  + +AVA   P
Sbjct: 231 FRYIKDNGGIDTEKSYPYEAIDDSCHFNKGTVGATD-RGFTDIPQGDEKKMAEAVATVGP 289

Query: 260 VSVAIDASGSDFQFYSSGVFT-GQCGTE-LDHGVTAVGYGTADDGTKYWLVKNSWGTTWG 317
           VSVAIDAS   FQFYS GV+   QC  + LDHGV  VG+GT + G  YWLVKNSWGTTWG
Sbjct: 290 VSVAIDASHESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGEDYWLVKNSWGTTWG 349

Query: 318 ENGYIRMQRDIDAKEGLCGIAMQASYP 344
           + G+I+M R+   KE  CGIA  +SYP
Sbjct: 350 DKGFIKMLRN---KENQCGIASASSYP 373


>gi|222641485|gb|EEE69617.1| hypothetical protein OsJ_29194 [Oryza sativa Japonica Group]
          Length = 360

 Score =  271 bits (694), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 153/336 (45%), Positives = 191/336 (56%), Gaps = 13/336 (3%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRF 60
           + + LL +   L A  +L   A       + D  M +R   W   + R Y    E   RF
Sbjct: 13  LTLALLASCGALLATSMLPARATAGSCLDVGDMVMMDRFRAWQGAHNRSYPSAEEALQRF 72

Query: 61  KIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETT--- 117
            +++ N E+I + N +  +  Y+L  NEFAD T EEF A   GY      V  S  T   
Sbjct: 73  DVYRRNAEFIDAVNLRG-DLTYQLAENEFADLTEEEFLATYTGYYAGDGPVDDSVITTGA 131

Query: 118 ---DVSFRYENASVPASIDWRKKGAVTGVKDQ-GQCGCCWAFSAVAAMEGINHITTRKLT 173
              D SF Y    VPAS+DWR +GAV   K Q   C  CWAF   A +E +N I T KL 
Sbjct: 132 GDVDASFSYR-VDVPASVDWRAQGAVVPPKSQTSTCSSCWAFVTAATIESLNMIKTGKLV 190

Query: 174 SLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPS 233
           SLSEQ+LVDCD+   D GC  G    A+++++ N GL TEA YPY A  G CN+ ++   
Sbjct: 191 SLSEQQLVDCDS--YDGGCNLGSYGRAYKWVVENGGLTTEADYPYTARRGPCNRAKSAHH 248

Query: 234 AAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTA 293
           AAKI+G+  VP  NEAAL  AVA QPV+VAI+  GS  QFY  GV+TG CGT L H VT 
Sbjct: 249 AAKITGFGKVPPRNEAALQAAVARQPVAVAIEV-GSGMQFYKGGVYTGPCGTRLAHAVTV 307

Query: 294 VGYGT-ADDGTKYWLVKNSWGTTWGENGYIRMQRDI 328
           VGYGT A  G KYW +KNSWG +WGE GYIR+ RD+
Sbjct: 308 VGYGTDASSGAKYWTIKNSWGQSWGERGYIRILRDV 343


>gi|209693435|ref|NP_001129410.1| cathepsin L precursor [Acyrthosiphon pisum]
 gi|251823771|ref|NP_001156569.1| cathepsin L precursor [Acyrthosiphon pisum]
          Length = 341

 Score =  271 bits (694), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 154/343 (44%), Positives = 205/343 (59%), Gaps = 14/343 (4%)

Query: 12  LAAILVLGV--WAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEY 69
           +  ++VLG+  +A  + S    +  + E   ++  Q+ ++Y D  E+  R K++ +N   
Sbjct: 1   MKVVIVLGLVAFAISTVSSINLNEVIEEEWSLFKIQFKKLYEDIKEETFRKKVYLDNKLK 60

Query: 70  IASFNN--KARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTD--VSF-RYE 124
           IA  N   ++  + Y L +N F D    E+    NG+K  L     + T D  V+F + E
Sbjct: 61  IARHNKLYESGEETYALEMNHFGDLMQHEYTKMMNGFKPSLAGGDRNFTNDEAVTFLKSE 120

Query: 125 NASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCD 184
           N  +P S+DWRKKG VT VK+QGQCG CW+FSA  ++EG +   T  L SLSEQ L+DC 
Sbjct: 121 NVVIPKSVDWRKKGYVTPVKNQGQCGSCWSFSATGSLEGQHFRKTGVLVSLSEQNLIDCS 180

Query: 185 TSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVP 244
               + GCEGGLMD AF++I SNKGL TE  YPY+A D  C     N S A   G+ D+P
Sbjct: 181 RKYGNNGCEGGLMDLAFKYIKSNKGLDTEKSYPYEAEDDKCRYNPEN-SGATDKGFVDIP 239

Query: 245 SNNEAALMKAVANQ-PVSVAIDASGSDFQFYSSGVFTG-QC-GTELDHGVTAVGYGTADD 301
             +E ALM A+A   PVS+AIDAS   FQFY  GVF   +C  TELDHGV AVG+G+   
Sbjct: 240 EGDEDALMHALATVGPVSIAIDASSEKFQFYKKGVFYNPRCSSTELDHGVLAVGFGSDKK 299

Query: 302 GTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           G  YW+VKNSWG TWG+ GYI M R+   K+  CG+A  ASYP
Sbjct: 300 GGDYWIVKNSWGKTWGDEGYIMMARN---KKNNCGVASSASYP 339


>gi|24653516|ref|NP_725347.1| cysteine proteinase-1, isoform A [Drosophila melanogaster]
 gi|24653518|ref|NP_725348.1| cysteine proteinase-1, isoform B [Drosophila melanogaster]
 gi|1658527|gb|AAB18345.1| cysteine proteinase 1 [Drosophila melanogaster]
 gi|2305221|gb|AAB65749.1| cysteine proteinase-1 [Drosophila melanogaster]
 gi|7303249|gb|AAF58311.1| cysteine proteinase-1, isoform A [Drosophila melanogaster]
 gi|21627210|gb|AAM68566.1| cysteine proteinase-1, isoform B [Drosophila melanogaster]
 gi|54650754|gb|AAV36956.1| LP06554p [Drosophila melanogaster]
 gi|220951982|gb|ACL88534.1| Cp1-PA [synthetic construct]
          Length = 341

 Score =  271 bits (694), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 151/327 (46%), Positives = 196/327 (59%), Gaps = 18/327 (5%)

Query: 29  TLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNK-ARNK-PYKLGI 86
           +  D  M E H   + ++ + Y+D  E+  R KIF EN   IA  N + A  K  +KL +
Sbjct: 20  SFADVVMEEWHTFKL-EHRKNYQDETEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAV 78

Query: 87  NEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFR------YENASVPASIDWRKKGAV 140
           N++AD  + EFR   NG+   L   +     D SF+        + ++P S+DWR KGAV
Sbjct: 79  NKYADLLHHEFRQLMNGFNYTLH--KQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAV 136

Query: 141 TGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDA 200
           T VKDQG CG CWAFS+  A+EG +   +  L SLSEQ LVDC T   + GC GGLMD+A
Sbjct: 137 TAVKDQGHCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNA 196

Query: 201 FEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QP 259
           F +I  N G+ TE  YPY+A D SC+  +    A    G+ D+P  +E  + +AVA   P
Sbjct: 197 FRYIKDNGGIDTEKSYPYEAIDDSCHFNKGTVGATD-RGFTDIPQGDEKKMAEAVATVGP 255

Query: 260 VSVAIDASGSDFQFYSSGVFT-GQCGTE-LDHGVTAVGYGTADDGTKYWLVKNSWGTTWG 317
           VSVAIDAS   FQFYS GV+   QC  + LDHGV  VG+GT + G  YWLVKNSWGTTWG
Sbjct: 256 VSVAIDASHESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGEDYWLVKNSWGTTWG 315

Query: 318 ENGYIRMQRDIDAKEGLCGIAMQASYP 344
           + G+I+M R+   KE  CGIA  +SYP
Sbjct: 316 DKGFIKMLRN---KENQCGIASASSYP 339


>gi|7523482|dbj|BAA94210.1| putative cysteine protease [Oryza sativa Japonica Group]
 gi|10800060|dbj|BAB16480.1| putative cysteine protease [Oryza sativa Japonica Group]
          Length = 349

 Score =  271 bits (694), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 147/321 (45%), Positives = 185/321 (57%), Gaps = 21/321 (6%)

Query: 36  NERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNE 95
            +  E WMA++G+ Y  + EKE RF +F++NV +I S+   A      L +N+FAD TN+
Sbjct: 38  TQMFEEWMAKFGKKYPCHGEKEYRFGVFRDNVRFIRSYRPPAGYN-SALRVNQFADLTND 96

Query: 96  EFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAF 155
           EF +   G K   P        D     +   +P  IDWR KGAVT VKDQG CG CWAF
Sbjct: 97  EFVSTHTGAKPPCPK-------DAPRGVDPIWLPCCIDWRYKGAVTDVKDQGACGSCWAF 149

Query: 156 SAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAK 215
           +AVAA+EG+  I T KLT LSEQELVDCDT     GC GG  D AFE + +  G+  E+ 
Sbjct: 150 AAVAAIEGLTQIRTGKLTPLSEQELVDCDTG--SSGCAGGHTDRAFELVAAKGGITAESG 207

Query: 216 YPYKASDGSCNKKEA-NPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFY 274
           Y Y+   G C   +A    AA+I G+  VP  +E  L  AVA QPV+  IDASG  FQFY
Sbjct: 208 YRYEGYRGKCRADDALFNHAARIGGHRAVPPGDERQLATAVARQPVTAYIDASGPAFQFY 267

Query: 275 SSGVFTGQC---------GTELDHGVTAVGYGT-ADDGTKYWLVKNSWGTTWGENGYIRM 324
            SGVF G C             +H VT VGY      G KYW+ KNSWG TWGE GYI +
Sbjct: 268 GSGVFPGPCGSGSGAAAAAPTTNHAVTLVGYCQDGASGKKYWVAKNSWGKTWGEKGYILL 327

Query: 325 QRDIDAKEGLCGIAMQASYPT 345
           ++D+ +  G CG+A+   YPT
Sbjct: 328 EKDVASPHGTCGVAVSPFYPT 348


>gi|42563538|gb|AAS20467.1| cysteine protease-like protein [Pelargonium x hortorum]
          Length = 234

 Score =  271 bits (694), Expect = 3e-70,   Method: Compositional matrix adjust.
 Identities = 130/198 (65%), Positives = 154/198 (77%), Gaps = 3/198 (1%)

Query: 149 CGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNK 208
           CG CWAFS +AA+EGINHI T +L SLSEQELVDCD S  +QGC GGLMD AFEFII N 
Sbjct: 1   CGRCWAFSTIAAVEGINHIVTGELISLSEQELVDCDRS-YNQGCNGGLMDYAFEFIIKNG 59

Query: 209 GLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASG 268
           G+ +E  YPYKA DG+C+    N     I GYEDVP N+E +L KAVA QPVSVAI+A G
Sbjct: 60  GIDSEEDYPYKAVDGTCDPIRKNAKVVTIDGYEDVPENDENSLKKAVAYQPVSVAIEAGG 119

Query: 269 SDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDI 328
            +FQ Y SG+FTG+CGT LDHGV AVGYGT ++G  YW+V+NSWG++WGENGYIRM+R++
Sbjct: 120 REFQLYQSGIFTGRCGTALDHGVAAVGYGT-ENGIDYWIVRNSWGSSWGENGYIRMERNV 178

Query: 329 D-AKEGLCGIAMQASYPT 345
              K G CGIAM+ASYPT
Sbjct: 179 KTTKTGKCGIAMEASYPT 196


>gi|125564712|gb|EAZ10092.1| hypothetical protein OsI_32402 [Oryza sativa Indica Group]
          Length = 382

 Score =  271 bits (694), Expect = 3e-70,   Method: Compositional matrix adjust.
 Identities = 149/338 (44%), Positives = 196/338 (57%), Gaps = 31/338 (9%)

Query: 34  TMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQT 93
           TM E  + W A+Y R Y    E+  R +++  NV YI +  N A    Y+LG   + D T
Sbjct: 47  TMMEMFQRWKAEYNRSYATPEEERRRLRVYARNVRYIEA-TNAAAGLAYELGETAYTDLT 105

Query: 94  NEEFRAPRNGYKRR------------------LPSVRSSETTDVSFRYENASVPASIDWR 135
           N+EF A       R                     V   +  +V F  E+A  PAS+DWR
Sbjct: 106 NDEFMAMYTAPPLRSAADDDDDAATTTIITTRAGPVDEHQQPEVYFN-ESAGAPASVDWR 164

Query: 136 KKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGG 195
             GAVT VKDQG+CG CWAFS VA +EGI  I   KL SLSEQELVDCDT   D GC+GG
Sbjct: 165 ASGAVTEVKDQGRCGSCWAFSTVAVVEGIQKIKKGKLVSLSEQELVDCDTL--DSGCDGG 222

Query: 196 LMDDAFEFIISNKGLATEAKYPYKA-SDGSCNKKEANPSAAKISGYEDVPSNNEAALMKA 254
           +   A E+I +N G+ T   YPY   +  +C++ +    AA I+G   V + +EA+L  A
Sbjct: 223 VSYRALEWITANGGITTRDDYPYTGAAAAACDRAKLGHHAATIAGLRRVATRSEASLQNA 282

Query: 255 VANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTAD-------DGTKYWL 307
            A QPV+V+I+A G +FQ Y  GV+ G CGT L+HGVT VGYG  +        G KYW+
Sbjct: 283 AAAQPVAVSIEAGGDNFQHYRKGVYDGPCGTRLNHGVTVVGYGQEEAPVDGSAAGDKYWI 342

Query: 308 VKNSWGTTWGENGYIRMQRDIDAK-EGLCGIAMQASYP 344
           +KNSWG  WG+ GYI+M++D+  K EGLCGIA++ S+P
Sbjct: 343 IKNSWGKNWGDQGYIKMKKDVAGKPEGLCGIAIRPSFP 380


>gi|22661|emb|CAA49504.1| papaya proteinase omega [Carica papaya]
          Length = 367

 Score =  271 bits (694), Expect = 3e-70,   Method: Compositional matrix adjust.
 Identities = 157/356 (44%), Positives = 207/356 (58%), Gaps = 23/356 (6%)

Query: 1   MAMILLENKLVLAAILVL-------GVWAPQSWSRTLNDATMNER----HEMWMAQYGRV 49
           MAMI   +KL+  AI +        G ++   +S+  +D T  ER       WM  + + 
Sbjct: 1   MAMIPSISKLLFVAICLFVHMSVSFGDFSIVGYSQ--DDLTSTERLIQLFNSWMLNHNKF 58

Query: 50  YRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLP 109
           Y +  EK  RF+IFK+N+ YI   N K  N  Y+LG+NEFAD +N+EF      Y   L 
Sbjct: 59  YENVDEKLYRFEIFKDNLNYIDETNKK--NNSYRLGLNEFADLSNDEFNEK---YVGSLI 113

Query: 110 SVRSSETTDVSFRYEN-ASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHIT 168
                ++ D  F  E+  ++P ++DWRKKGAVT V+ QG CG CWAFSAVA +EGIN I 
Sbjct: 114 DATIEQSYDEEFINEDIVNLPENVDWRKKGAVTPVRHQGSCGSCWAFSAVATVEGINKIR 173

Query: 169 TRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKK 228
           T KL  LSEQELVDC+      GC+GG    A E++  N G+   +KYPYKA  G+C  K
Sbjct: 174 TGKLVELSEQELVDCER--RSHGCKGGYPPYALEYVAKN-GIHLRSKYPYKAKQGTCRAK 230

Query: 229 EANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELD 288
           +      K SG   V  NNE  L+ A+A QPVSV +++ G  FQ Y  G+F G CGT++D
Sbjct: 231 QVGGPIVKTSGVGRVQPNNEGNLLNAIAKQPVSVVVESKGRPFQLYKGGIFEGPCGTKVD 290

Query: 289 HGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           H VTAVGYG +       L+KNSWGT WGE GYIR++R      G+CG+   + YP
Sbjct: 291 HAVTAVGYGKSGGKGYI-LIKNSWGTAWGEKGYIRIKRAPGNSPGVCGLYKSSYYP 345


>gi|33348834|gb|AAQ16117.1| cathepsin L-like cysteine proteinase A [Rhipicephalus
           haemaphysaloides haemaphysaloides]
          Length = 332

 Score =  271 bits (694), Expect = 3e-70,   Method: Compositional matrix adjust.
 Identities = 152/310 (49%), Positives = 194/310 (62%), Gaps = 12/310 (3%)

Query: 40  EMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNK--PYKLGINEFADQTNEEF 97
           E +   + + Y  + E+ +RFKIF EN   IA  N K       YKLG+N+F D    EF
Sbjct: 28  EAFKTTHKKSYESHMEELLRFKIFTENSLIIAKHNAKYAKGLVSYKLGMNQFGDLLAHEF 87

Query: 98  RAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSA 157
               NGY+ +  S  S+     +    ++S+P+++DWRKKGAVT VKDQGQCG CWAFSA
Sbjct: 88  AKIFNGYRGQRTSRGSTFMPPANVN--DSSLPSTVDWRKKGAVTPVKDQGQCGSCWAFSA 145

Query: 158 VAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYP 217
             ++EG + +   +L SLSEQ LVDC  S  + GCEGGLMD+AF++I +N G+  E  YP
Sbjct: 146 TGSLEGQHFLKDGELVSLSEQNLVDCSQSFGNNGCEGGLMDNAFKYIKANDGIDAEESYP 205

Query: 218 YKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDASGSDFQFYSS 276
           Y+A D  C  K+ +  A   +G+ D+   +E  L KAVA   P+SVAIDA  S FQ YS 
Sbjct: 206 YEAMDDKCRFKKEDVGATD-TGFVDIEGGSEDDLKKAVATVGPISVAIDAGHSSFQLYSE 264

Query: 277 GVF-TGQCGT-ELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGL 334
           GV+   +C + ELDHGV AVGYG   DG KYWLVKNSWG +WG+NGYI M RD   K   
Sbjct: 265 GVYDEPECSSEELDHGVLAVGYGVK-DGKKYWLVKNSWGGSWGDNGYILMSRD---KNNQ 320

Query: 335 CGIAMQASYP 344
           CGIA  ASYP
Sbjct: 321 CGIASAASYP 330


>gi|384247445|gb|EIE20932.1| hypothetical protein COCSUDRAFT_18161 [Coccomyxa subellipsoidea
           C-169]
          Length = 387

 Score =  271 bits (693), Expect = 3e-70,   Method: Compositional matrix adjust.
 Identities = 157/331 (47%), Positives = 194/331 (58%), Gaps = 36/331 (10%)

Query: 46  YGRVYRDNAEKEMRFKIFKENVEYIASFNNKARN----KPY------------------- 82
           + + Y +  E  +R  IFK NV+YI S N+  ++    K +                   
Sbjct: 7   FNKKYSNEEEAALRLNIFKTNVDYITSVNSAQQSYQASKHFSENTQQTALSSLFLSQLAH 66

Query: 83  -----KLGINEFADQTNEEFRAPRNGYKR-RLPSVRSSETTDVSFRYENASVPASIDWRK 136
                +LG+NEFADQT EEF +   G       S RSS  T   FR+ + +   SI+W +
Sbjct: 67  TDLLPQLGLNEFADQTWEEFSSTHLGLNAGEDGSFRSSANT--GFRHADVTPANSINWVE 124

Query: 137 KGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGL 196
            GAVT VK+Q  CG CWAFS   ++EG N + T  L SLSEQ+LVDCDT  +DQGC GGL
Sbjct: 125 AGAVTPVKNQAFCGSCWAFSTTGSVEGANFLATGDLVSLSEQQLVDCDTK-KDQGCGGGL 183

Query: 197 MDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVA 256
           MD AF++II N GL TE  Y Y +  G CNK     +   I GYEDVP N+E AL KAV+
Sbjct: 184 MDYAFDYIIKNGGLDTEEDYSYWSVGGFCNKLREERTVVSIDGYEDVPVNDEVALAKAVS 243

Query: 257 NQPVSVAIDASGSDFQFYSSGVFT--GQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGT 314
            QPVSVAI AS +  QFYSSGV    G C   L+HGV A GY   + G  YWLVKNSWG 
Sbjct: 244 KQPVSVAICASEA-MQFYSSGVIAAKGSC-IGLNHGVLAAGYDVDESGKPYWLVKNSWGG 301

Query: 315 TWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           TWG  GY+++++D   KEG CGIAM ASYP 
Sbjct: 302 TWGMQGYMKLEKDSSVKEGACGIAMAASYPV 332


>gi|413947586|gb|AFW80235.1| hypothetical protein ZEAMMB73_542371 [Zea mays]
          Length = 264

 Score =  271 bits (693), Expect = 4e-70,   Method: Compositional matrix adjust.
 Identities = 142/259 (54%), Positives = 182/259 (70%), Gaps = 15/259 (5%)

Query: 3   MILLENKLVLAAILVLGVWA---PQSWSRTL-NDATMNERHEMWMAQYGRVYRDNAEKEM 58
           M+  +  L+L A+L+  V +   P   +R L +DA M ERHE WMA+YGRVY+D A+K  
Sbjct: 1   MVSSKAFLLLLAVLIGCVCSFPSPVLAARELSDDAAMAERHERWMAEYGRVYKDAADKAR 60

Query: 59  RFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTD 118
           RF++FK+N  ++ SFN   +NK + LG+N+FAD T E F+A + G+K     + + +   
Sbjct: 61  RFEVFKDNFAFVESFNADKKNK-FWLGVNQFADLTTEAFKANK-GFK----PISAEKAPT 114

Query: 119 VSFRYENASV---PASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSL 175
             F+YEN S+   P ++DWR KGAVT +K+QGQCGCCWAFSAVAA+EGI  ++T  L SL
Sbjct: 115 TGFKYENLSISALPTAVDWRTKGAVTPIKNQGQCGCCWAFSAVAAVEGIVKLSTGNLVSL 174

Query: 176 SEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAA 235
           SEQELVDCDT   D+GCEGG MD AFEF+I N GLATE+ YPYKA DG C  K  + SAA
Sbjct: 175 SEQELVDCDTHSMDEGCEGGWMDSAFEFVIKNGGLATESSYPYKAVDGKC--KGGSKSAA 232

Query: 236 KISGYEDVPSNNEAALMKA 254
            I G+EDVP NNEAALMKA
Sbjct: 233 TIKGHEDVPPNNEAALMKA 251


>gi|224079085|ref|XP_002305743.1| predicted protein [Populus trichocarpa]
 gi|222848707|gb|EEE86254.1| predicted protein [Populus trichocarpa]
          Length = 494

 Score =  271 bits (692), Expect = 4e-70,   Method: Compositional matrix adjust.
 Identities = 142/318 (44%), Positives = 195/318 (61%), Gaps = 8/318 (2%)

Query: 32  DATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFAD 91
           D ++ E  + W  ++ + Y+   E E RF  FK N++YI     K     +++G+N+FAD
Sbjct: 36  DESIIEIFQQWRDRHQKAYKHAEEAEKRFGNFKRNLKYIIEKTGKETTLRHRVGLNKFAD 95

Query: 92  QTNEEFRAPRNGYKRRLPSVRSSETTDVSFR-YENASVPASIDWRKKGAVTGVKDQGQCG 150
            +NEEF+       ++  +    +  D S R  ++   P+S+DWRKKG VT VKDQG CG
Sbjct: 96  LSNEEFKQLYLSKVKKPINKTRIDAEDRSRRNLQSCDAPSSLDWRKKGVVTAVKDQGDCG 155

Query: 151 CCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGL 210
            CW+FS   A+EGIN I T  L SLSEQELVDCDT+  + GCEGG MD AFE++I+N G+
Sbjct: 156 SCWSFSTTGAIEGINAIVTSDLISLSEQELVDCDTT--NYGCEGGYMDYAFEWVINNGGI 213

Query: 211 ATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSD 270
            TEA YPY   DG+CN  +       I GY+DV    ++AL+ A A QP+SV ID S  D
Sbjct: 214 DTEANYPYTGVDGTCNTAKEEIKVVSIDGYKDV-DETDSALLCAAAQQPISVGIDGSAID 272

Query: 271 FQFYSSGVF---TGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRD 327
           FQ Y+ G++         ++DH V  VGYG+ ++G  YW+VKNSWGT+WG  GY  ++R+
Sbjct: 273 FQLYTGGIYDGDCSDDPDDIDHAVLIVGYGS-ENGEDYWIVKNSWGTSWGIEGYFYIKRN 331

Query: 328 IDAKEGLCGIAMQASYPT 345
            D   G+C I   ASYPT
Sbjct: 332 TDLPYGVCAINAMASYPT 349


>gi|320164780|gb|EFW41679.1| cathepsin L2 [Capsaspora owczarzaki ATCC 30864]
          Length = 334

 Score =  271 bits (692), Expect = 4e-70,   Method: Compositional matrix adjust.
 Identities = 149/313 (47%), Positives = 198/313 (63%), Gaps = 9/313 (2%)

Query: 35  MNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTN 94
           +N   E W   +G+ Y D  E+  R  +++ N + +   +N A    Y LG+N FAD T+
Sbjct: 26  LNMEFEAWKRTFGKSYSDAVEEINRRAVWEAN-KMLVDAHNGAGIHSYTLGMNIFADLTH 84

Query: 95  EEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWA 154
           EEF+    G K  L   RS+ ++         ++P S+DWR  G VT VKDQGQCG CW+
Sbjct: 85  EEFKRFYLGTKVDLNRPRSNFSSTFIPTANVGALPDSVDWRTAGIVTPVKDQGQCGSCWS 144

Query: 155 FSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEA 214
           FS   ++EG +   T +L SLSEQ LVDC  +  +QGC GGLMDDAF++II+NKG+ TEA
Sbjct: 145 FSTTGSVEGQHARKTGQLVSLSEQNLVDCSKAQGNQGCNGGLMDDAFQYIITNKGIDTEA 204

Query: 215 KYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDASGSDFQF 273
            YPY A DG+C    AN   A +S ++D+   +E+ L  AVA   PVSVAIDAS + FQ 
Sbjct: 205 SYPYTAKDGTCKFNAAN-VGATLSSFQDITRGSESDLQNAVATVGPVSVAIDASKNSFQL 263

Query: 274 YSSGVFT-GQC-GTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAK 331
           Y+SGV+   +C  T LDHGV A GYGT+ +GT YWLVKNSWG++WG+ GYI M R+ + +
Sbjct: 264 YTSGVYNEKKCSSTSLDHGVLAAGYGTS-NGTPYWLVKNSWGSSWGQAGYIWMSRNANNQ 322

Query: 332 EGLCGIAMQASYP 344
              CGIA  ASYP
Sbjct: 323 ---CGIATSASYP 332


>gi|55740406|gb|AAV63979.1| cathepsin L1 precursor [Artemia parthenogenetica]
          Length = 338

 Score =  271 bits (692), Expect = 4e-70,   Method: Compositional matrix adjust.
 Identities = 151/340 (44%), Positives = 204/340 (60%), Gaps = 16/340 (4%)

Query: 10  LVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEY 69
            +LAA+LV       S + +L +   +E H ++ A + + Y    E+++R KI+ EN   
Sbjct: 8   FLLAAVLV-----QLSAALSLTNLLADEWH-LFKATHKKEYPSQLEEKLRMKIYLENKHK 61

Query: 70  IASFN--NKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS 127
           +A  N   +   K Y++ +N+F D  + EFR+  NGY+ +  +   +E+T       N  
Sbjct: 62  VAKHNILYEKGEKSYQVAMNKFGDLLHHEFRSIMNGYQHKKQNSSRAESTFTFMEPANVE 121

Query: 128 VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSG 187
           VP S+DWR+KGA+T VKDQGQCG CWAFS+  A+EG     T KL SLSEQ L+DC    
Sbjct: 122 VPESVDWREKGAITPVKDQGQCGSCWAFSSTGALEGQTFRKTGKLVSLSEQNLIDCSGKY 181

Query: 188 EDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNN 247
            ++GC GGLMD AF++I  NKG+ TE  YPY+A DG C     N  A    G+ D+PS  
Sbjct: 182 GNEGCNGGLMDQAFQYIKDNKGIDTENTYPYEAEDGVCRYNPRNRGAVD-RGFVDIPSGE 240

Query: 248 EAALMKAVANQ-PVSVAIDASGSDFQFYSSG-VFTGQCGT-ELDHGVTAVGYGTADDGTK 304
           E  L  AVA   PVSVAIDAS   FQFYS G  +   C + +LDHGV  VGYG+ D+G  
Sbjct: 241 EDKLKAAVATVGPVSVAIDASHESFQFYSKGXYYEPSCDSDDLDHGVLVVGYGS-DNGED 299

Query: 305 YWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           YWLVKNSW   WG+ GYI++ R+   ++  CG+A  ASYP
Sbjct: 300 YWLVKNSWSEHWGDEGYIKIARN---RKNHCGVATAASYP 336


>gi|195334204|ref|XP_002033774.1| GM21500 [Drosophila sechellia]
 gi|194125744|gb|EDW47787.1| GM21500 [Drosophila sechellia]
          Length = 341

 Score =  271 bits (692), Expect = 4e-70,   Method: Compositional matrix adjust.
 Identities = 150/327 (45%), Positives = 197/327 (60%), Gaps = 18/327 (5%)

Query: 29  TLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNK-ARNK-PYKLGI 86
           +  D  M E H   + ++ + Y+D+ E+  R KIF EN   IA  N + A  K  +KL +
Sbjct: 20  SFADVVMEEWHTFKL-EHRKNYQDDTEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAV 78

Query: 87  NEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFR------YENASVPASIDWRKKGAV 140
           N++AD  + EFR   NG+   L   +     D SF+        + ++P S+DWR KGAV
Sbjct: 79  NKYADLLHHEFRQLMNGFNYTLH--KQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAV 136

Query: 141 TGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDA 200
           T VKDQG CG CWAFS+  A+EG +   +  L SLSEQ LVDC T   + GC GGLMD+A
Sbjct: 137 TAVKDQGHCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNA 196

Query: 201 FEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QP 259
           F +I  N G+ TE  YPY+A D SC+  +    A    G+ D+P  +E  + +AVA   P
Sbjct: 197 FRYIKDNGGIDTEKSYPYEAIDDSCHFNKGTIGATD-RGFTDIPQGDEKKMAEAVATVGP 255

Query: 260 VSVAIDASGSDFQFYSSGVFT-GQCGTE-LDHGVTAVGYGTADDGTKYWLVKNSWGTTWG 317
           V+VAIDAS   FQFYS GV+   QC  + LDHGV  VG+GT + G  YWLVKNSWGTTWG
Sbjct: 256 VAVAIDASHESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGEDYWLVKNSWGTTWG 315

Query: 318 ENGYIRMQRDIDAKEGLCGIAMQASYP 344
           + G+I+M R+   KE  CGIA  +SYP
Sbjct: 316 DKGFIKMLRN---KENQCGIASASSYP 339


>gi|129614|sp|P00784.1|PAPA1_CARPA RecName: Full=Papain; AltName: Full=Papaya proteinase I; Short=PPI;
           AltName: Allergen=Car p 1; Flags: Precursor
 gi|167391|gb|AAB02650.1| papain precursor [Carica papaya]
 gi|387885|gb|AAA72774.1| papain [synthetic construct]
 gi|225437|prf||1303270A papain
          Length = 345

 Score =  271 bits (692), Expect = 5e-70,   Method: Compositional matrix adjust.
 Identities = 154/362 (42%), Positives = 206/362 (56%), Gaps = 36/362 (9%)

Query: 1   MAMILLENKLVLAAI-------LVLGVWAPQSWSRTLNDATMNER----HEMWMAQYGRV 49
           MAMI   +KL+  AI       L  G ++   +S+  ND T  ER     E WM ++ ++
Sbjct: 1   MAMIPSISKLLFVAICLFVYMGLSFGDFSIVGYSQ--NDLTSTERLIQLFESWMLKHNKI 58

Query: 50  YRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLP 109
           Y++  EK  RF+IFK+N++YI   N K  N  Y LG+N FAD +N+EF+    G      
Sbjct: 59  YKNIDEKIYRFEIFKDNLKYIDETNKK--NNSYWLGLNVFADMSNDEFKEKYTG------ 110

Query: 110 SVRSSETTDVSFRYE------NASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEG 163
           S+  + TT     YE      + ++P  +DWR+KGAVT VK+QG CG CWAFSAV  +EG
Sbjct: 111 SIAGNYTT-TELSYEEVLNDGDVNIPEYVDWRQKGAVTPVKNQGSCGSCWAFSAVVTIEG 169

Query: 164 INHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDG 223
           I  I T  L   SEQEL+DCD      GC GG    A + +++  G+     YPY+    
Sbjct: 170 IIKIRTGNLNEYSEQELLDCDR--RSYGCNGGYPWSALQ-LVAQYGIHYRNTYPYEGVQR 226

Query: 224 SCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQC 283
            C  +E  P AAK  G   V   NE AL+ ++ANQPVSV ++A+G DFQ Y  G+F G C
Sbjct: 227 YCRSREKGPYAAKTDGVRQVQPYNEGALLYSIANQPVSVVLEAAGKDFQLYRGGIFVGPC 286

Query: 284 GTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASY 343
           G ++DH V AVGY     G  Y L+KNSWGT WGENGYIR++R      G+CG+   + Y
Sbjct: 287 GNKVDHAVAAVGY-----GPNYILIKNSWGTGWGENGYIRIKRGTGNSYGVCGLYTSSFY 341

Query: 344 PT 345
           P 
Sbjct: 342 PV 343


>gi|413953046|gb|AFW85695.1| thiol protease SEN102 [Zea mays]
          Length = 382

 Score =  271 bits (692), Expect = 5e-70,   Method: Compositional matrix adjust.
 Identities = 156/323 (48%), Positives = 199/323 (61%), Gaps = 19/323 (5%)

Query: 37  ERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEE 96
           ER + W A+Y R Y    E + RF I+ ENV +I + N  +    Y+LG N+F D T EE
Sbjct: 62  ERFKAWQAEYNRTYATPEEFQQRFMIYSENVRFIKTMNQLSTGSSYELGENQFTDLTEEE 121

Query: 97  FRAPRNGYKRRL-----------PSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKD 145
           F+   + Y  +L           P+V +  T  +S        P S+DWR KGAVT VKD
Sbjct: 122 FK---DTYLMKLDEQPPAAEAMPPTVGTMSTAGMSNGNNTGEAPNSVDWRTKGAVTRVKD 178

Query: 146 QGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFII 205
           Q QCG CWAF+ VA++EG++ I T +L SLSEQE+VDCD  G D GC GG    A E++ 
Sbjct: 179 QQQCGSCWAFATVASIEGVHQIKTGRLVSLSEQEIVDCDRGGNDNGCRGGSPRSAMEWVT 238

Query: 206 SNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAID 265
            N GL TE+ YPY  S   C   +    AA+I GY+ V  NNEA L +AVA QPV+V +D
Sbjct: 239 RNGGLTTESDYPYVGSQRQCMSGKLGHHAARIRGYQAVQRNNEAELERAVAGQPVAVFVD 298

Query: 266 ASGSDFQFYSSGVFTGQC-GTELDHGVTAVGYG-TADD--GTKYWLVKNSWGTTWGENGY 321
           AS + FQFY SGVF+G C  T ++H VT VGYG T  D  G KYW+VKNSWG  WGENGY
Sbjct: 299 ASRA-FQFYKSGVFSGPCDTTTVNHVVTVVGYGSTGSDSGGRKYWIVKNSWGQGWGENGY 357

Query: 322 IRMQRDIDAKEGLCGIAMQASYP 344
           +RM R + A+EG+C IA++  YP
Sbjct: 358 VRMARRVRAREGMCAIAIEPYYP 380


>gi|224116880|ref|XP_002317417.1| predicted protein [Populus trichocarpa]
 gi|118488173|gb|ABK95906.1| unknown [Populus trichocarpa]
 gi|222860482|gb|EEE98029.1| predicted protein [Populus trichocarpa]
          Length = 498

 Score =  271 bits (692), Expect = 5e-70,   Method: Compositional matrix adjust.
 Identities = 141/321 (43%), Positives = 192/321 (59%), Gaps = 12/321 (3%)

Query: 30  LNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNK-PYKLGINE 88
           L +  + E  ++W  ++ +VY+   E E R   FK N++YI   N K ++   +K+G+N+
Sbjct: 41  LTEEGITEVFKLWKEKHQKVYKHAEEAERRIGNFKRNLKYIIEKNGKRKSGLEHKVGLNK 100

Query: 89  FADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQ 148
           FAD +NEEFR     Y  ++    + E        +    P+S+DWR KG VT VKDQG 
Sbjct: 101 FADLSNEEFR---EMYLSKVKKPITIEEKRKHRHLQTCDAPSSLDWRNKGVVTAVKDQGD 157

Query: 149 CGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNK 208
           CG CW+FS   A+E IN I T  L SLSEQELVDCDT+  + GCEGG MD AF+++I N 
Sbjct: 158 CGSCWSFSTTGAIEAINAIVTGDLISLSEQELVDCDTTN-NYGCEGGDMDSAFQWVIGNG 216

Query: 209 GLATEAKYPYKASDGSCNKKEANPSAAKISGYEDV-PSNNEAALMKAVANQPVSVAIDAS 267
           G+ TEA YPY   DG+CN  +       I GY DV PS  ++AL+ A   QP+SV +D S
Sbjct: 217 GIDTEADYPYTGVDGTCNTAKEEKKVVSIEGYVDVDPS--DSALLCATVQQPISVGMDGS 274

Query: 268 GSDFQFYSSGVFTGQCG---TELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRM 324
             DFQ Y+ G++ G C     ++DH +  VGYG+ +D   YW+VKNSWGT WG  GY  +
Sbjct: 275 ALDFQLYTGGIYDGDCSGDPNDIDHAILIVGYGSEND-EDYWIVKNSWGTEWGMEGYFYI 333

Query: 325 QRDIDAKEGLCGIAMQASYPT 345
           +R+     G+C I   ASYPT
Sbjct: 334 RRNTSKPYGVCAINADASYPT 354


>gi|218187750|gb|EEC70177.1| hypothetical protein OsI_00904 [Oryza sativa Indica Group]
 gi|222617983|gb|EEE54115.1| hypothetical protein OsJ_00884 [Oryza sativa Japonica Group]
          Length = 327

 Score =  271 bits (692), Expect = 5e-70,   Method: Compositional matrix adjust.
 Identities = 147/321 (45%), Positives = 185/321 (57%), Gaps = 21/321 (6%)

Query: 36  NERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNE 95
            +  E WMA++G+ Y  + EKE RF +F++NV +I S+   A      L +N+FAD TN+
Sbjct: 16  TQMFEEWMAKFGKKYPCHGEKEYRFGVFRDNVRFIRSYRPPAGYNS-ALRVNQFADLTND 74

Query: 96  EFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAF 155
           EF +   G K   P        D     +   +P  IDWR KGAVT VKDQG CG CWAF
Sbjct: 75  EFVSTHTGAKPPCPK-------DAPRGVDPIWLPCCIDWRYKGAVTDVKDQGACGSCWAF 127

Query: 156 SAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAK 215
           +AVAA+EG+  I T KLT LSEQELVDCDT     GC GG  D AFE + +  G+  E+ 
Sbjct: 128 AAVAAIEGLTQIRTGKLTPLSEQELVDCDTG--SSGCAGGHTDRAFELVAAKGGITAESG 185

Query: 216 YPYKASDGSCNKKEA-NPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFY 274
           Y Y+   G C   +A    AA+I G+  VP  +E  L  AVA QPV+  IDASG  FQFY
Sbjct: 186 YRYEGYRGKCRADDALFNHAARIGGHRAVPPGDERQLATAVARQPVTAYIDASGPAFQFY 245

Query: 275 SSGVFTGQC---------GTELDHGVTAVGYGT-ADDGTKYWLVKNSWGTTWGENGYIRM 324
            SGVF G C             +H VT VGY      G KYW+ KNSWG TWGE GYI +
Sbjct: 246 GSGVFPGPCGSGSGAAAAAPTTNHAVTLVGYCQDGASGKKYWVAKNSWGKTWGEKGYILL 305

Query: 325 QRDIDAKEGLCGIAMQASYPT 345
           ++D+ +  G CG+A+   YPT
Sbjct: 306 EKDVASPHGTCGVAVSPFYPT 326


>gi|238816977|gb|ACR56863.1| cathepsin L-like cysteine proteinase [Delia coarctata]
          Length = 338

 Score =  271 bits (692), Expect = 5e-70,   Method: Compositional matrix adjust.
 Identities = 146/321 (45%), Positives = 194/321 (60%), Gaps = 12/321 (3%)

Query: 34  TMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNN-KARNK-PYKLGINEFAD 91
            + E  + +  ++ + Y    E+  R KIF EN   IA  N   A+ K  +KLG+N++AD
Sbjct: 22  VIKEEWQTFKMEHRKNYLSEVEERFRMKIFNENRHKIAKHNQLYAQGKVSFKLGLNKYAD 81

Query: 92  QTNEEFRAPRNGYKRRL-PSVRSSETTD--VSFRYENASVPASIDWRKKGAVTGVKDQGQ 148
             + EF+   NGY   +   +R+ E  +        N  VP ++DWR+ GAVT VKDQG 
Sbjct: 82  MLHHEFKETMNGYNHTMRKELRAQEGFNGITYISPANVQVPKAVDWRQHGAVTSVKDQGH 141

Query: 149 CGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNK 208
           CG CW+FS+  ++EG +      L SLSEQ LVDC T   + GC GGLMD+AF +I  N 
Sbjct: 142 CGSCWSFSSTGSLEGQHFRKAGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNG 201

Query: 209 GLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQ-PVSVAIDAS 267
           G+ TE  YPY+  D SC+  +A   A   +G+ D+P  +E A+MKAVA   PV+VAIDAS
Sbjct: 202 GVDTEKSYPYEGIDDSCHFNKATVGATD-TGFVDIPQGDEEAMMKAVATMGPVAVAIDAS 260

Query: 268 GSDFQFYSSGVFTG-QCGTE-LDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQ 325
              FQ YS GV+    C ++ LDHGV  VGYGT  DG  YWLVKNSWGTTWG+ GYI+M 
Sbjct: 261 NESFQLYSEGVYNDPNCSSDNLDHGVLVVGYGTDKDGQDYWLVKNSWGTTWGDQGYIKMA 320

Query: 326 RDIDAKEGLCGIAMQASYPTA 346
           R+ D +   CGIA  +S+PT 
Sbjct: 321 RNQDNQ---CGIATASSFPTV 338


>gi|1483570|emb|CAA68066.1| cathepsin l [Litopenaeus vannamei]
          Length = 328

 Score =  270 bits (691), Expect = 5e-70,   Method: Compositional matrix adjust.
 Identities = 148/306 (48%), Positives = 189/306 (61%), Gaps = 13/306 (4%)

Query: 44  AQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNK--PYKLGINEFADQTNEEFRAPR 101
           A++GR Y    E+  R  +F++N ++I   N +  N    + L +N+F D T+EEF A  
Sbjct: 29  AEHGRRYASVQEERYRLSVFEQNQQFIDDHNARFENGEVTFTLQMNQFGDMTSEEFTATM 88

Query: 102 NGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAM 161
           NG+   +PS R +          + ++P  +DWR KGAVT VKDQ QCG CWAFS   ++
Sbjct: 89  NGF-LNVPSRRPTAILRAD---PDETLPKEVDWRTKGAVTPVKDQKQCGSCWAFSTTGSL 144

Query: 162 EGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKAS 221
           EG + +   KL SLSEQ LVDC     + GC GGLMD AF +I +NKG+ TE  YPY+A 
Sbjct: 145 EGQHFLKDGKLVSLSEQNLVDCSDKFGNMGCMGGLMDQAFRYIKANKGIDTEDSYPYEAQ 204

Query: 222 DGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDASGSDFQFYSSGVF- 279
           DG C + +A+   A  +GY DV   +E+AL KAVA   P+SVAIDAS   FQFY  GV+ 
Sbjct: 205 DGKC-RFDASNVGATDTGYVDVEHGSESALKKAVATIGPISVAIDASQPSFQFYHDGVYY 263

Query: 280 -TGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIA 338
             G   T LDHGV AVGYG  + G  YWLVKNSW T+WG  GYI+M RD   K+  CGIA
Sbjct: 264 EEGCSSTMLDHGVLAVGYGETEKGEAYWLVKNSWNTSWGNKGYIQMSRD---KKNNCGIA 320

Query: 339 MQASYP 344
            QASYP
Sbjct: 321 SQASYP 326


>gi|383849553|ref|XP_003700409.1| PREDICTED: cathepsin L-like [Megachile rotundata]
          Length = 343

 Score =  270 bits (691), Expect = 5e-70,   Method: Compositional matrix adjust.
 Identities = 156/354 (44%), Positives = 201/354 (56%), Gaps = 23/354 (6%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRF 60
           M  ILL   +  AA+  +      S+   +N   +N + E     + + Y+  AE+ +R 
Sbjct: 1   MKTILLLIVITCAAVQAI------SFFELVNQEWINFKME-----HKKCYKHEAEERLRM 49

Query: 61  KIFKENVEYIASFN--NKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTD 118
           KI+ +N   IA  N   + +   Y+L IN++ D  N EF+   NGY R +     +E   
Sbjct: 50  KIYMKNKLQIAQHNCDYELKKVTYRLKINKYGDMLNHEFKNMLNGYNRTINHTLRNERLP 109

Query: 119 VSFRYE---NASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSL 175
           V   +    N  +P  +DWRK GAVT VKDQG CG CWAFSA  ++EG +   T  L SL
Sbjct: 110 VGAAFIEPCNVELPKMVDWRKCGAVTEVKDQGHCGSCWAFSATGSLEGQHFRRTGVLVSL 169

Query: 176 SEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAA 235
           SEQ L+DC  S  + GC GGLMD AF +I  NKGL TE  YPY+  D  C + +   S A
Sbjct: 170 SEQNLIDCSGSYGNNGCNGGLMDQAFSYIKDNKGLDTEKTYPYEGEDDKC-RYDKRSSGA 228

Query: 236 KISGYEDVPSNNEAALMKAVANQ-PVSVAIDASGSDFQFYSSGV-FTGQC-GTELDHGVT 292
              G+ D+P  +E  L  AVA   PVSVAIDAS   FQFYS G+ F  +C  T LDHGV 
Sbjct: 229 SDVGFVDIPVGDEQKLKAAVATVGPVSVAIDASHQSFQFYSDGIYFEPECSSTNLDHGVL 288

Query: 293 AVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
            VGYGT ++G  YW+VKNSWG +WGE GYI+M R+ID     CGIA  ASYP  
Sbjct: 289 VVGYGTDEEGRDYWIVKNSWGESWGEKGYIKMARNIDNH---CGIASSASYPIV 339


>gi|320169652|gb|EFW46551.1| cathepsin L2 [Capsaspora owczarzaki ATCC 30864]
          Length = 325

 Score =  270 bits (691), Expect = 5e-70,   Method: Compositional matrix adjust.
 Identities = 147/307 (47%), Positives = 193/307 (62%), Gaps = 11/307 (3%)

Query: 42  WMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPR 101
           W A + R Y    E+ +R +I+  N+E I   N   R+  Y LG+NEF D  + EF A  
Sbjct: 24  WKALHNRQYASAQEEALRQEIYLSNLELINEHNAAGRHS-YTLGMNEFGDLAHHEFAAKY 82

Query: 102 NGYKRRLPSVRSSET-TDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAA 160
            G   R   V ++++    ++     S+P S+DWR  G VT VK+QGQCG CW+FS   +
Sbjct: 83  LGV--RFNGVNATKSFASSTYLPRMVSLPDSVDWRTAGIVTPVKNQGQCGSCWSFSTTGS 140

Query: 161 MEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKA 220
           +EG +   T  L SLSEQ LVDC +   ++GC GGLMDDAFE+II N G+ TEA YPY A
Sbjct: 141 VEGQHARKTGTLVSLSEQNLVDCSSQEGNEGCNGGLMDDAFEYIIKNGGIDTEASYPYTA 200

Query: 221 SDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDASGSDFQFYSSGVF 279
           + G+C    AN   A ++ Y+D+ + +E+ L  AVA   PVSVAIDAS  +FQFY +GV+
Sbjct: 201 TTGTCKFNAANI-GATVASYQDIITGSESDLQNAVATVGPVSVAIDASHINFQFYFTGVY 259

Query: 280 T-GQCG-TELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGI 337
              +C  T+LDHGV AVGYGT+ +G  YWLVKNSWG TWG+ GYI M R+ D +   CGI
Sbjct: 260 NEKKCSTTQLDHGVLAVGYGTSTEGKDYWLVKNSWGATWGKAGYIWMSRNADNQ---CGI 316

Query: 338 AMQASYP 344
           A  ASYP
Sbjct: 317 ATSASYP 323


>gi|226499806|ref|NP_001151335.1| cysteine protease 1 [Zea mays]
 gi|195645896|gb|ACG42416.1| cysteine protease 1 precursor [Zea mays]
          Length = 258

 Score =  270 bits (691), Expect = 5e-70,   Method: Compositional matrix adjust.
 Identities = 150/270 (55%), Positives = 185/270 (68%), Gaps = 23/270 (8%)

Query: 86  INEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVP------ASIDWRKKGA 139
           +NEFAD TN+EF A   G    L  V +       F+Y N ++        ++DWR+KGA
Sbjct: 3   LNEFADMTNDEFMAMYTG----LRPVPAGAKKMAGFKYGNVTLSDADDDQQTVDWRQKGA 58

Query: 140 VTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDD 199
           VTG+KDQ QCGCCWAF+AVAA+EGI+ ITT  L SLSEQ+++DCDT G + GC GG +D+
Sbjct: 59  VTGIKDQRQCGCCWAFAAVAAVEGIHQITTGNLVSLSEQQVLDCDTDGNN-GCNGGYIDN 117

Query: 200 AFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQP 259
           AF++I+ N GLATE  YPY A+   C  +   P AA ISGY+DVPS +EAAL  AVANQP
Sbjct: 118 AFQYIVGNGGLATEDAYPYTAAQAMC--QSVQPVAA-ISGYQDVPSGDEAALAAAVANQP 174

Query: 260 VSVAIDASGSDFQFYSSGVFT-GQCGT--ELDHGVTAVGYGTADDGTKYWLVKNSWGTTW 316
           VSVAIDA   +FQ Y  GV T   C T   L+H VTAVGYGTA+DGT YWL+KN WG  W
Sbjct: 175 VSVAIDA--HNFQLYGGGVMTAASCSTPPNLNHAVTAVGYGTAEDGTPYWLLKNQWGQNW 232

Query: 317 GENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           GE GY+R++R  +A    CG+A QASYP A
Sbjct: 233 GEGGYLRLERGANA----CGVAQQASYPVA 258


>gi|414875906|tpg|DAA53037.1| TPA: hypothetical protein ZEAMMB73_586844 [Zea mays]
          Length = 1039

 Score =  270 bits (691), Expect = 5e-70,   Method: Compositional matrix adjust.
 Identities = 126/195 (64%), Positives = 152/195 (77%), Gaps = 2/195 (1%)

Query: 150 GCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKG 209
           G CWAFS +AA+EGIN I T  L SLSEQELVDCDTS  +QGC GGLMD AFEFII+N G
Sbjct: 713 GSCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDTS-YNQGCNGGLMDYAFEFIINNGG 771

Query: 210 LATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGS 269
           + TE  YPYK +DG C+    N     I  YEDVP+N+E +L KAVANQPVSVAI+A+G+
Sbjct: 772 IDTEKDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANDEKSLQKAVANQPVSVAIEAAGT 831

Query: 270 DFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDID 329
            FQ YSSG+FTG CGT LDHGVT VGYGT ++G  YW++KNSWG++WGE+GY+RM+R+I 
Sbjct: 832 TFQLYSSGIFTGSCGTALDHGVTVVGYGT-ENGKDYWIMKNSWGSSWGESGYVRMERNIK 890

Query: 330 AKEGLCGIAMQASYP 344
           A  G CGIA++ SYP
Sbjct: 891 ASSGKCGIAVEPSYP 905


>gi|357130490|ref|XP_003566881.1| PREDICTED: actinidain-like [Brachypodium distachyon]
          Length = 350

 Score =  270 bits (691), Expect = 5e-70,   Method: Compositional matrix adjust.
 Identities = 153/339 (45%), Positives = 202/339 (59%), Gaps = 17/339 (5%)

Query: 5   LLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFK 64
           L    LVL A  V    A Q  +      T+  RHE WMA++GRVY D  EK  R  +F 
Sbjct: 10  LCAGLLVLVATAVFHAVAAQGEA----GLTVAARHEQWMAKFGRVYTDANEKARRQAVFG 65

Query: 65  ENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLP-SVRSSETTDVSFRY 123
            N  Y+ + N +A N+ Y LG+NEF+D T+ EF     GY+   P +   S+  D  +  
Sbjct: 66  ANARYVDAVN-RAGNRTYTLGLNEFSDLTDNEFAKTHLGYREFRPETANISKGVDPGYGL 124

Query: 124 ENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDC 183
              ++P S DWR KGAVT VK QG CGCCWAF+AVAA EG+  I    L S+SEQ+++DC
Sbjct: 125 A-GNIPKSFDWRTKGAVTEVKSQGGCGCCWAFAAVAATEGLVKIAKGTLISMSEQQVLDC 183

Query: 184 DTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGY-ED 242
            T   +  C+GG M+DA  ++ ++ GL TE  Y Y A  G+C +++  P+ A   G+ E 
Sbjct: 184 TTG--NNTCKGGYMNDALSYVFASGGLQTEEDYEYNAEKGAC-RRDVTPNPATSVGHAEY 240

Query: 243 VP-SNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTG--QCGTELDHGVTAVGYGTA 299
           +P   NE  L K VA QPV VA++A G+DF+ Y  GVFTG   CG  LDH  T VGYG A
Sbjct: 241 MPLDGNEFLLQKLVARQPVVVAVEAYGTDFKNYGGGVFTGSPSCGQNLDHFFTVVGYGFA 300

Query: 300 DDGTK-YWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGI 337
           D G + YWLVKN WGT+WGE+GY+R+ R   A+   CG+
Sbjct: 301 DGGKQMYWLVKNQWGTSWGESGYMRIARGSSARN--CGM 337


>gi|1709574|sp|P10056.2|PAPA3_CARPA RecName: Full=Caricain; AltName: Full=Papaya peptidase A; AltName:
           Full=Papaya proteinase III; Short=PPIII; AltName:
           Full=Papaya proteinase omega; Flags: Precursor
 gi|18098|emb|CAA46862.1| proteinase omega [Carica papaya]
          Length = 348

 Score =  270 bits (691), Expect = 6e-70,   Method: Compositional matrix adjust.
 Identities = 158/357 (44%), Positives = 207/357 (57%), Gaps = 23/357 (6%)

Query: 1   MAMILLENKLVLAAILVL-------GVWAPQSWSRTLNDATMNER----HEMWMAQYGRV 49
           MAMI   +KL+  AI +        G ++   +S+  +D T  ER       WM  + + 
Sbjct: 1   MAMIPSISKLLFVAICLFVHMSVSFGDFSIVGYSQ--DDLTSTERLIQLFNSWMLNHNKF 58

Query: 50  YRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLP 109
           Y +  EK  RF+IFK+N+ YI   N K  N  Y LG+NEFAD +N+EF      Y   L 
Sbjct: 59  YENVDEKLYRFEIFKDNLNYIDETNKK--NNSYWLGLNEFADLSNDEFNEK---YVGSLI 113

Query: 110 SVRSSETTDVSFRYEN-ASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHIT 168
                ++ D  F  E+  ++P ++DWRKKGAVT V+ QG CG CWAFSAVA +EGIN I 
Sbjct: 114 DATIEQSYDEEFINEDTVNLPENVDWRKKGAVTPVRHQGSCGSCWAFSAVATVEGINKIR 173

Query: 169 TRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKK 228
           T KL  LSEQELVDC+      GC+GG    A E++  N G+   +KYPYKA  G+C  K
Sbjct: 174 TGKLVELSEQELVDCER--RSHGCKGGYPPYALEYVAKN-GIHLRSKYPYKAKQGTCRAK 230

Query: 229 EANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELD 288
           +      K SG   V  NNE  L+ A+A QPVSV +++ G  FQ Y  G+F G CGT++D
Sbjct: 231 QVGGPIVKTSGVGRVQPNNEGNLLNAIAKQPVSVVVESKGRPFQLYKGGIFEGPCGTKVD 290

Query: 289 HGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           H VTAVGYG +       L+KNSWGT WGE GYIR++R      G+CG+   + YPT
Sbjct: 291 HAVTAVGYGKSGGKGYI-LIKNSWGTAWGEKGYIRIKRAPGNSPGVCGLYKSSYYPT 346


>gi|310942960|pdb|3P5W|A Chain A, Actinidin From Actinidia Arguta Planch (Sarusashi)
          Length = 220

 Score =  270 bits (691), Expect = 6e-70,   Method: Compositional matrix adjust.
 Identities = 130/218 (59%), Positives = 156/218 (71%), Gaps = 2/218 (0%)

Query: 128 VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSG 187
           +P  +DWR  GAV  +KDQGQCG CWAFS +AA+EGIN I T  L SLSEQELVDC  + 
Sbjct: 1   LPDYVDWRSSGAVVDIKDQGQCGSCWAFSTIAAVEGINKIATGDLISLSEQELVDCGRTQ 60

Query: 188 EDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNN 247
             +GC+GG M D F+FII+N G+ TEA YPY A +G CN          I  YE+VP NN
Sbjct: 61  NTRGCDGGFMTDGFQFIINNGGINTEANYPYTAEEGQCNLDLQQEKYVSIDTYENVPYNN 120

Query: 248 EAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWL 307
           E AL  AVA QPVSVA++A+G +FQ YSSG+FTG CGT +DH VT VGYGT + G  YW+
Sbjct: 121 EWALQTAVAYQPVSVALEAAGYNFQHYSSGIFTGPCGTAVDHAVTIVGYGT-EGGIDYWI 179

Query: 308 VKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           VKNSWGTTWGE GY+R+QR++    G CGIA +ASYP 
Sbjct: 180 VKNSWGTTWGEEGYMRIQRNVGGV-GQCGIAKKASYPV 216


>gi|358345461|ref|XP_003636796.1| Cysteine proteinase [Medicago truncatula]
 gi|355502731|gb|AES83934.1| Cysteine proteinase [Medicago truncatula]
          Length = 475

 Score =  270 bits (690), Expect = 7e-70,   Method: Compositional matrix adjust.
 Identities = 141/314 (44%), Positives = 190/314 (60%), Gaps = 27/314 (8%)

Query: 37  ERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKP--YKLGINEFADQTN 94
           E  + W  ++ + Y    E  +R + FK N++YI    N  RN P  + LG+N FAD +N
Sbjct: 49  ELFQQWKKEHQKFYIHPEEAALRLENFKRNLKYIVE-RNAMRNSPVGHHLGLNRFADMSN 107

Query: 95  EEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWA 154
           EEF+       + +  V S +             P S+DWRKKG VTGVKDQG CG CW+
Sbjct: 108 EEFK------NKFISKVESCD-----------DAPYSLDWRKKGVVTGVKDQGNCGSCWS 150

Query: 155 FSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEA 214
           FS+  A+EG+N I T  L SLSEQELVDCDT+ +  GCEGG MD AFE++I+N G+ TEA
Sbjct: 151 FSSTGAIEGVNAIVTGDLISLSEQELVDCDTTND--GCEGGYMDYAFEWVINNGGIDTEA 208

Query: 215 KYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFY 274
            YPY    G+CN  +       I GY DV + +++AL  A   QP+SV ID S  DFQ Y
Sbjct: 209 DYPYIGVGGTCNVTKEETKVVTIDGYTDV-TQSDSALFCATVKQPISVGIDGSTLDFQLY 267

Query: 275 SSGVFTGQCGT---ELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAK 331
           + G++ G C +   ++DH V  VGYG +D    YW+VKNSWGT+WG  G+I ++R+ + K
Sbjct: 268 TGGIYDGDCSSNPDDIDHAVLIVGYG-SDGNQDYWIVKNSWGTSWGIEGFIYIRRNTNLK 326

Query: 332 EGLCGIAMQASYPT 345
            G+C I   AS+PT
Sbjct: 327 YGVCAINYMASFPT 340


>gi|410519429|gb|AFV73398.1| cathepsin L [Haliotis discus hannai]
          Length = 326

 Score =  270 bits (690), Expect = 7e-70,   Method: Compositional matrix adjust.
 Identities = 153/324 (47%), Positives = 193/324 (59%), Gaps = 17/324 (5%)

Query: 30  LNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARN--KPYKLGIN 87
           L   +++    M+  ++ + Y+DN E+  R  +F + VEYI   N +A      +++GIN
Sbjct: 13  LASCSLDREWGMFKVRHNKQYKDNQEEAYRKGVFMKAVEYIQQHNLEADRGVHSFRVGIN 72

Query: 88  EFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQG 147
           E+AD  NEEF    NGYK +    ++      S       +PA++DWR KG VT VK+QG
Sbjct: 73  EYADMPNEEFVRVMNGYKMQEQRPKAPTYMPPS---NVGDLPATVDWRTKGYVTEVKNQG 129

Query: 148 QCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISN 207
           QCG CWAFS+  ++EG       KL SLSEQ LVDC T   + GC GGLMD AF +I  N
Sbjct: 130 QCGSCWAFSSTGSLEGQTFKKYNKLISLSEQNLVDCSTEQGNMGCGGGLMDQAFTYIKVN 189

Query: 208 KGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDA 266
            G+ TE  YPY+A+ G C   +AN   A  +GY D+ S +E+ L  AVA   P++VAIDA
Sbjct: 190 DGIDTETSYPYEAASGKCRFNKAN-VGANDTGYTDIKSKSESDLQSAVATVGPIAVAIDA 248

Query: 267 SGSDFQFYSSGV----FTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYI 322
           S   FQ Y SGV    F  Q  T LDHGV AVGYGT D G  YWLVKNSWG TWG+ GYI
Sbjct: 249 SHMSFQLYKSGVYHYIFCSQ--TRLDHGVLAVGYGT-DSGKDYWLVKNSWGATWGQQGYI 305

Query: 323 RMQRDIDAKEGLCGIAMQASYPTA 346
            M R+ D     CGIA QASYPT 
Sbjct: 306 MMSRNRDNN---CGIATQASYPTV 326


>gi|151573014|gb|ABS17682.1| cathepsin L-1 [Artemia salina]
          Length = 334

 Score =  270 bits (690), Expect = 7e-70,   Method: Compositional matrix adjust.
 Identities = 151/336 (44%), Positives = 201/336 (59%), Gaps = 12/336 (3%)

Query: 15  ILVLG-VWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASF 73
           I +LG V    S + +L +   +E H ++ A + + Y    E++ R KI+ EN   +A  
Sbjct: 3   IFLLGAVLVQLSAALSLTNLLADEWH-LFKATHKKEYPSQLEEKFRMKIYLENKHKVAKH 61

Query: 74  N--NKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPAS 131
           N   +   K Y + +N+F D  + EFR+  NGY+ +  +   +E+T       N +VP S
Sbjct: 62  NILYEKGEKSYHVAMNKFGDLLHHEFRSIMNGYQHKKQNSSRAESTFTFMEPANVTVPES 121

Query: 132 IDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQG 191
           +DWR+KGA+T VKDQGQCG CWAFS+  A+EG     T KL SLSEQ L+DC     ++G
Sbjct: 122 VDWREKGAITPVKDQGQCGSCWAFSSTGALEGQTFRKTGKLVSLSEQNLIDCSGKYGNEG 181

Query: 192 CEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAAL 251
           C GGLMD AF++I  NKG+ TE  YPY+A D  C     N  A    G+ D+PS  E  L
Sbjct: 182 CNGGLMDQAFQYIKDNKGIDTENTYPYEAEDDVCRYNPRNRGAVD-RGFVDIPSGEEDKL 240

Query: 252 MKAVANQ-PVSVAIDASGSDFQFYSSGV-FTGQCGT-ELDHGVTAVGYGTADDGTKYWLV 308
             AVA   PVSVAIDAS   FQFYS GV +   C + +LDHGV  VGYG+ D+G  YWLV
Sbjct: 241 KAAVATVGPVSVAIDASHESFQFYSKGVYYEPSCDSDDLDHGVLVVGYGS-DNGKDYWLV 299

Query: 309 KNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           KNSW   WG+ GYI+M R+   ++  CG+A  ASYP
Sbjct: 300 KNSWSEHWGDEGYIKMARN---RKNHCGVASAASYP 332


>gi|356549192|ref|XP_003542981.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
          Length = 517

 Score =  270 bits (690), Expect = 7e-70,   Method: Compositional matrix adjust.
 Identities = 141/314 (44%), Positives = 196/314 (62%), Gaps = 13/314 (4%)

Query: 37  ERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPY--KLGINEFADQTN 94
           E  + W  +  ++YR   ++++RF+ FK N++YIA  N+K R  PY   LG+N FAD +N
Sbjct: 48  ELFQRWKEENKKIYRSPDQEKLRFENFKRNLKYIAEKNSK-RISPYGQSLGLNRFADMSN 106

Query: 95  EEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWA 154
           EEF++     K + P  + +  +      E+A  P S+DWRKKG VT VKDQG CGCCWA
Sbjct: 107 EEFKSKFTS-KVKKPFSKRNGLSGKDHSCEDA--PYSLDWRKKGVVTAVKDQGYCGCCWA 163

Query: 155 FSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEA 214
           FS+  A+EGIN I +  L SLSE ELVDCD +  + GC+GG MD AFE+++ N G+ TE 
Sbjct: 164 FSSTGAIEGINAIVSGDLISLSEPELVDCDRT--NDGCDGGHMDYAFEWVMHNGGIDTET 221

Query: 215 KYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFY 274
            YPY  +DG+CN  +       I GY +V   ++ +L+ A   QP+S  ID S  DFQ Y
Sbjct: 222 NYPYSGADGTCNVAKEETKVIGIDGYYNV-EQSDRSLLCATVKQPISAGIDGSSWDFQLY 280

Query: 275 SSGVFTGQCGT---ELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAK 331
             G++ G C +   ++DH +  VGYG+  D   YW+VKNSWGT+WG  GYI ++R+ + K
Sbjct: 281 IGGIYDGDCSSDPDDIDHAILVVGYGSEGD-EDYWIVKNSWGTSWGMEGYIYIRRNTNLK 339

Query: 332 EGLCGIAMQASYPT 345
            G+C I   ASYPT
Sbjct: 340 YGVCAINYMASYPT 353


>gi|225706370|gb|ACO09031.1| Cathepsin L precursor [Osmerus mordax]
          Length = 337

 Score =  270 bits (690), Expect = 8e-70,   Method: Compositional matrix adjust.
 Identities = 151/322 (46%), Positives = 194/322 (60%), Gaps = 17/322 (5%)

Query: 32  DATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFN--NKARNKPYKLGINEF 89
           DA  +E  ++W + + + Y+   E+  R  ++++N++ I   N  +      Y LG+N F
Sbjct: 22  DAQFDEHWDLWKSWHSKNYQHEKEEGWRRMVWEKNLKKIEMHNLEHSLGKHSYSLGMNHF 81

Query: 90  ADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQC 149
            D TNEEFR   NGYK +    + S    +     N   P  +DWR++G VT VKDQGQC
Sbjct: 82  GDMTNEEFRQVMNGYKLQQRKFKGS----LFLEPNNMEAPKQVDWREEGYVTPVKDQGQC 137

Query: 150 GCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKG 209
           G CWAFS   AMEG     T+KL SLSEQ LVDC     ++GC GGLMD AF++I  N G
Sbjct: 138 GSCWAFSTTGAMEGQMFRKTQKLVSLSEQNLVDCSRPEGNEGCNGGLMDQAFQYIQDNSG 197

Query: 210 LATEAKYPYKASDGS-CNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDAS 267
           L +E  YPY  +D   CN K A  SAA  +G+ D+PS  E ALMKA+A+  PVSVAIDA 
Sbjct: 198 LDSEEAYPYLGTDDQPCNYK-AEFSAANDTGFMDIPSGKEHALMKAIASVGPVSVAIDAG 256

Query: 268 GSDFQFYSSGV-FTGQCGT-ELDHGVTAVGYGTAD---DGTKYWLVKNSWGTTWGENGYI 322
              FQFY SG+ +  +C + ELDHGV AVGYG      DG KYW+VKNSW   WG+ GYI
Sbjct: 257 HESFQFYQSGIYYEKECSSEELDHGVLAVGYGFEGEDVDGKKYWIVKNSWSEKWGDKGYI 316

Query: 323 RMQRDIDAKEGLCGIAMQASYP 344
            M +D   ++  CGIA  ASYP
Sbjct: 317 LMAKD---RKNHCGIATAASYP 335


>gi|226503205|ref|NP_001150062.1| thiol protease SEN102 precursor [Zea mays]
 gi|195636390|gb|ACG37663.1| thiol protease SEN102 precursor [Zea mays]
          Length = 349

 Score =  270 bits (690), Expect = 8e-70,   Method: Compositional matrix adjust.
 Identities = 149/318 (46%), Positives = 192/318 (60%), Gaps = 17/318 (5%)

Query: 37  ERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEE 96
           +R + W A+Y R Y    E + RF ++ ENV++I + N    +  Y+LG N FAD T EE
Sbjct: 35  DRFQAWQAEYNRTYATPEEFQQRFMVYSENVKFIETMNQPGSS--YELGENRFADLTEEE 92

Query: 97  FRAPRNGYKRRLPSVRSSE-----TTDVSFRYENAS------VPASIDWRKKGAVTGVKD 145
           F+   + Y  +L +V SS      T D   R   +        P S+DWR KGAVT VK 
Sbjct: 93  FK---DTYLMKLDNVASSPEAMALTVDTMNRAGTSGGSNTNEAPNSVDWRTKGAVTPVKS 149

Query: 146 QGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFII 205
           Q  CG CWAF+AVA++EG++ I T  L SLSEQE+VDCD  G + GC GG    A E++ 
Sbjct: 150 QQHCGSCWAFAAVASIEGVHKIKTGLLVSLSEQEIVDCDRGGNNHGCHGGHSSSAMEWVT 209

Query: 206 SNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAID 265
            N GL TE+ YPY    G C   +    AAKI G + V   NE AL  AVA +PV+V+I+
Sbjct: 210 RNGGLTTESDYPYVGRQGQCMSDKLGHHAAKIRGRQAVQGKNEGALQHAVAGRPVAVSIN 269

Query: 266 ASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQ 325
           AS + FQFY  G+F+G C T  +H VT VGYG    G KYW+VKNSWG  WGE GY+RMQ
Sbjct: 270 ASRA-FQFYKRGIFSGPCNTTRNHAVTVVGYGANASGHKYWIVKNSWGERWGEKGYVRMQ 328

Query: 326 RDIDAKEGLCGIAMQASY 343
           R + A+EG+CGIA+   Y
Sbjct: 329 RGVRAREGVCGIAIAPFY 346


>gi|530736|emb|CAA56915.1| cathepsin l [Nephrops norvegicus]
 gi|1582621|prf||2119193B cathepsin L-related Cys protease
          Length = 313

 Score =  270 bits (689), Expect = 8e-70,   Method: Compositional matrix adjust.
 Identities = 155/318 (48%), Positives = 201/318 (63%), Gaps = 16/318 (5%)

Query: 33  ATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNK--PYKLGINEFA 90
           AT +   E +  QYGR Y D  E+  R ++F++N + + +FN K  N    +K+ +N+F 
Sbjct: 6   ATASPSWEHFKTQYGRKYGDAKEELYRQRVFQQNEQLVEAFNKKFENGEVTFKVAMNQFG 65

Query: 91  DQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCG 150
           D TNEEF A   GYK+     R   TT   F  E   + A +DWR KGAVT VKDQGQCG
Sbjct: 66  DMTNEEFNAVMKGYKK---GSRGEPTT--VFTAEGRPMAADVDWRTKGAVTPVKDQGQCG 120

Query: 151 CCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGL 210
            CWAFSA  ++EG + +   +L SLSEQELVDC T   + GC GG M  AF++I  N G+
Sbjct: 121 SCWAFSATGSLEGQHFLKNNELVSLSEQELVDCSTEYGNDGCGGGWMTSAFDYIKDNGGI 180

Query: 211 ATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDASGS 269
            TE+ YPY+A D SC + +AN   A  +G+ +V  + E AL +AV++  P+SVAIDAS  
Sbjct: 181 DTESSYPYEAQDRSC-RFDANSIGATCTGFVEV-QHTEEALHEAVSDIGPISVAIDASHF 238

Query: 270 DFQFYSSGV-FTGQCG-TELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRD 327
            FQFYSSGV +  +C  T LDHGV AVGYGT +    YWLVKNSWG+ WG+ GYI+M R+
Sbjct: 239 SFQFYSSGVYYEKKCSPTNLDHGVLAVGYGT-ESTEDYWLVKNSWGSGWGDAGYIKMSRN 297

Query: 328 IDAKEGLCGIAMQASYPT 345
            D     CGIA + SYPT
Sbjct: 298 RDNN---CGIASEPSYPT 312


>gi|195124431|ref|XP_002006696.1| GI21205 [Drosophila mojavensis]
 gi|193911764|gb|EDW10631.1| GI21205 [Drosophila mojavensis]
          Length = 339

 Score =  270 bits (689), Expect = 9e-70,   Method: Compositional matrix adjust.
 Identities = 151/343 (44%), Positives = 203/343 (59%), Gaps = 17/343 (4%)

Query: 11  VLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYI 70
            L A+L L V   Q+ S    D    E H   + ++ + Y+D  E+  R KIF EN   I
Sbjct: 3   TLYALLAL-VAVAQAVS--FADVIKEEWHTFKL-EHRKTYQDETEERFRLKIFNENKHKI 58

Query: 71  ASFNNK--ARNKPYKLGINEFADQTNEEFRAPRNGYKRRL-PSVRSSETTDVSFRY---E 124
           A  N +       +K+ +N++AD  + EFR   NG+   L   +R+S+ +     +    
Sbjct: 59  AKHNQRYATGEVTFKMAVNKYADMLHHEFRETMNGFNYTLHKELRASDPSFTGITFISPA 118

Query: 125 NASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCD 184
           +  +P S+DWR+KGAVT VKDQG CG CWAFS+  A+EG +   T  L SLSEQ LVDC 
Sbjct: 119 HVKLPKSVDWREKGAVTAVKDQGHCGSCWAFSSTGALEGQHFRKTGTLVSLSEQNLVDCS 178

Query: 185 TSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVP 244
               + GC GGLMD+AF +I  N G+ TE  YPY+  D SC+  + +   A   G+ D+P
Sbjct: 179 AKYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYPYEGIDDSCHFNK-DSVGATDRGFADIP 237

Query: 245 SNNEAALMKAVAN-QPVSVAIDASGSDFQFYSSGVFT-GQCGTE-LDHGVTAVGYGTADD 301
             NE  + +AVA   PVSVAIDAS   FQFYS G++   +C ++ LDHGV  VGYGT + 
Sbjct: 238 QGNEKKMAEAVATIGPVSVAIDASHESFQFYSEGIYNEPECNSQNLDHGVLVVGYGTDES 297

Query: 302 GTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           G  YWLVKNSWGTTWG+ G+I+M R+ D +   CGIA  +SYP
Sbjct: 298 GKDYWLVKNSWGTTWGDKGFIKMARNEDNQ---CGIASASSYP 337


>gi|32394728|gb|AAM96000.1| cathepsin L precursor [Metapenaeus ensis]
          Length = 322

 Score =  270 bits (689), Expect = 9e-70,   Method: Compositional matrix adjust.
 Identities = 146/306 (47%), Positives = 189/306 (61%), Gaps = 14/306 (4%)

Query: 44  AQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNK--PYKLGINEFADQTNEEFRAPR 101
            QYGR Y    E   R  +F++N ++I   N K  N    + L +N+F D T+EEF A  
Sbjct: 24  VQYGRHYGTAREDLYRQSVFEQNQQFIEDHNAKFENGEVTFTLKMNQFGDMTSEEFAATM 83

Query: 102 NGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAM 161
           NG+   +P+       +     ++ ++P  +DWR KGAVT VKDQ QCG CWAFS   ++
Sbjct: 84  NGF-LNVPTRHPVAILEA----DDETLPKHVDWRTKGAVTPVKDQKQCGSCWAFSTTGSL 138

Query: 162 EGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKAS 221
           EG + +   KL SLSEQ LVDC     + GC GGLMD AF++I  NKG+ TE  YPY+A 
Sbjct: 139 EGQHFLKDGKLVSLSEQNLVDCSGKFGNMGCCGGLMDQAFKYIKENKGIDTEESYPYEAQ 198

Query: 222 DGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDASGSDFQFYSSGV-F 279
           DG C    +N  A   +G+ D+    E +LMKAVAN  P+SVAIDAS   FQFY  GV +
Sbjct: 199 DGKCRFDSSNVGATD-TGFVDIAHGEENSLMKAVANIGPISVAIDASHPSFQFYHQGVYY 257

Query: 280 TGQC-GTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIA 338
             +C  T LDHGV A+GYG  DDG +YWLVKNSW T+WG+ G+I+M R+   K+  CGIA
Sbjct: 258 EKECSSTMLDHGVLAIGYGETDDGKEYWLVKNSWNTSWGDKGFIQMSRN---KKNNCGIA 314

Query: 339 MQASYP 344
            QASYP
Sbjct: 315 SQASYP 320


>gi|402770517|gb|AFQ98393.1| cathepsin L [Rhipicephalus microplus]
          Length = 332

 Score =  270 bits (689), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 151/310 (48%), Positives = 193/310 (62%), Gaps = 12/310 (3%)

Query: 40  EMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNK--PYKLGINEFADQTNEEF 97
           E +   + + Y+ + E+ +RFKIF EN   IA  N K       YKLG+N+F D    EF
Sbjct: 28  EAFKTTHKKTYQSHMEELLRFKIFTENSLIIAKHNAKYAKGLVSYKLGMNQFGDLLAHEF 87

Query: 98  RAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSA 157
               NG+     +  SS     +    ++S+P  +DWRKKGAVT VKDQGQCG CWAFSA
Sbjct: 88  ARIFNGHHGTRKTGGSSFLPPANVN--DSSLPKVVDWRKKGAVTPVKDQGQCGSCWAFSA 145

Query: 158 VAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYP 217
             ++EG + +   +L SLSEQ LVDC  S  + GCEGGLM+DAF++I +N G+ TE  YP
Sbjct: 146 TGSLEGQHFLKNGELVSLSEQNLVDCSQSFGNNGCEGGLMEDAFKYIKANDGIDTEKSYP 205

Query: 218 YKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDASGSDFQFYSS 276
           YKA DG C  K+ +  A   +GY ++ + +E  L KAVA   P+SVAIDAS S FQ YS 
Sbjct: 206 YKAVDGECRFKKEDVGATD-TGYVEIKAGSEVDLKKAVATVGPISVAIDASHSSFQLYSE 264

Query: 277 GVF-TGQCGTE-LDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGL 334
           GV+   +C +E LDHGV  VGYG    G KYWLVKNSW  +WG+ GYI M RD + +   
Sbjct: 265 GVYDEPECSSEDLDHGVLVVGYGV-KGGKKYWLVKNSWAESWGDQGYILMSRDNNNQ--- 320

Query: 335 CGIAMQASYP 344
           CGIA QASYP
Sbjct: 321 CGIASQASYP 330


>gi|402770505|gb|AFQ98387.1| cathepsin L [Rhipicephalus microplus]
          Length = 332

 Score =  270 bits (689), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 155/339 (45%), Positives = 206/339 (60%), Gaps = 19/339 (5%)

Query: 11  VLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYI 70
           VL AI+ + V A        +   +  + E +   + + Y+ + E+ +RFKIF EN   I
Sbjct: 6   VLCAIVAVTVAAS-------SQEILRTQWEAFKTTHKKTYQSHMEELLRFKIFTENSLII 58

Query: 71  ASFNNKARNK--PYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASV 128
           A  N K       YKLG+N+F D    EF    NG++    +  S+     +    ++S+
Sbjct: 59  AKHNAKYAKGLVSYKLGMNQFGDLLAHEFARIFNGHRGTRKTGGSTFLPPANVN--DSSL 116

Query: 129 PASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGE 188
           P ++DWRKKGAVT VKDQGQCG CWAFSA  ++EG + +   +L SLSEQ LVDC  S  
Sbjct: 117 PKAVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQSFG 176

Query: 189 DQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNE 248
           + GCEGGLM+DAF++I +N G+ TE  YPY+A DG C  K+ +  A   +GY ++ + +E
Sbjct: 177 NNGCEGGLMEDAFKYIKANDGIDTEKSYPYEAVDGECRFKKEDVGATD-TGYVEIKAGSE 235

Query: 249 AALMKAVAN-QPVSVAIDASGSDFQFYSSGVF-TGQCGTE-LDHGVTAVGYGTADDGTKY 305
             L KAVA   P+SVAIDAS S FQ YS GV+   +C +E LDHGV  VGYG    G KY
Sbjct: 236 VDLKKAVATVGPISVAIDASHSSFQLYSEGVYDEPECSSEDLDHGVLVVGYGV-KGGKKY 294

Query: 306 WLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           WLVKNSW  +WG+ GYI M RD + +   CGIA QASYP
Sbjct: 295 WLVKNSWAESWGDQGYILMSRDNNNQ---CGIASQASYP 330


>gi|225718114|gb|ACO14903.1| Cathepsin L precursor [Caligus clemensi]
          Length = 336

 Score =  270 bits (689), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 152/345 (44%), Positives = 209/345 (60%), Gaps = 23/345 (6%)

Query: 11  VLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYI 70
           +L ++LV+   A  + + +  D  +++  E W   +G+ Y  + E+++R KI+ EN   I
Sbjct: 6   LLLSVLVI---ASTANAVSFFDVVLSDW-ESWKLMHGKTYSSSIEEKLRLKIYMENSLKI 61

Query: 71  ASFNNKARN--KPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRY---EN 125
           +  N++A N   PY + +N + D  + EF A  NGY+       +++T  +   Y   +N
Sbjct: 62  SRHNSEALNGIHPYYMKMNHYGDLLHHEFVAMVNGYQY------ANKTASLGGTYIPNKN 115

Query: 126 ASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDT 185
             +P  +DWR++GAVT VK+QGQCG CW+FSA  A+EG +   T KL SLSEQ LVDC  
Sbjct: 116 IQLPTHVDWREEGAVTPVKNQGQCGSCWSFSATGALEGQDFRKTGKLISLSEQNLVDCSR 175

Query: 186 SGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPS 245
              + GCEGGLMD AF +I  NKG+ TEA YPY+  DG C+    N   + I G+ D+  
Sbjct: 176 KFGNNGCEGGLMDFAFTYIRDNKGIDTEASYPYEGIDGHCHYNPKNKGGSDI-GFVDIKK 234

Query: 246 NNEAALMKAVAN-QPVSVAIDASGSDFQFYSSGVFT-GQCGT-ELDHGVTAVGYGT-ADD 301
            +E  L KAVA   P+SVAIDAS   FQFYS GV+   +C + ELDHGV  VG+GT +  
Sbjct: 235 GSEKDLKKAVAGVGPISVAIDASHMSFQFYSHGVYVESKCSSEELDHGVLVVGFGTDSVS 294

Query: 302 GTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           G  YWLVKNSW   WG+ GYI+M R+   KE +CGIA  ASYP  
Sbjct: 295 GEDYWLVKNSWSEKWGDQGYIKMARN---KENMCGIASSASYPVV 336


>gi|226531284|ref|NP_001147086.1| thiol protease SEN102 precursor [Zea mays]
 gi|195607128|gb|ACG25394.1| thiol protease SEN102 precursor [Zea mays]
          Length = 356

 Score =  270 bits (689), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 162/350 (46%), Positives = 210/350 (60%), Gaps = 22/350 (6%)

Query: 10  LVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEY 69
           L+ A  L+L   A  ++S       + ER + W A+Y R Y    E + RF I+ ENV +
Sbjct: 12  LMFACSLLL---AGTAFSDDTIAIPLLERFKAWQAEYNRTYATPEEFQQRFMIYSENVRF 68

Query: 70  IASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRL-----------PSVRSSETTD 118
           I + N  +    Y+LG N+F D T EEF+   + Y  +L           P+V +  T  
Sbjct: 69  IKTMNQLSTGSSYELGENQFTDLTEEEFK---DTYLMKLDEQPPAAEAMGPTVGTMSTAG 125

Query: 119 VSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQ 178
           +S        P S+DWR KGAVT VKDQ QCG CWAF+ VA++EG++ I T +L SLSEQ
Sbjct: 126 MSNGNNTGEAPNSVDWRTKGAVTRVKDQQQCGSCWAFATVASIEGVHQIKTGRLVSLSEQ 185

Query: 179 ELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKIS 238
           E+VDCD  G D GC GG    A E++  N GL TE+ YPY  S   C   +    AA+I 
Sbjct: 186 EIVDCDRGGNDNGCRGGSPRSAMEWVTRNGGLTTESDYPYVGSQRQCMSGKLGHHAARIR 245

Query: 239 GYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQC-GTELDHGVTAVGYG 297
           GY+ V  NNEA L +AVA +PV+V IDAS + FQFY SGVF+G C  T ++H VT VGYG
Sbjct: 246 GYQAVQRNNEAELERAVAERPVAVFIDASRA-FQFYKSGVFSGPCDTTTVNHVVTVVGYG 304

Query: 298 -TADD--GTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
            T  D  G KYW+VKNSWG  WGENGY+RM R + A+EG+C IA++  YP
Sbjct: 305 STGSDSGGRKYWIVKNSWGQGWGENGYVRMARRVRAREGMCAIAIEPYYP 354


>gi|402770499|gb|AFQ98384.1| cathepsin L, partial [Hyalomma anatolicum anatolicum]
          Length = 312

 Score =  270 bits (689), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 151/310 (48%), Positives = 193/310 (62%), Gaps = 12/310 (3%)

Query: 40  EMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNK--PYKLGINEFADQTNEEF 97
           E +   + + Y+   E+ +R+KIF EN   IA  N K       YKLG+N+F D    EF
Sbjct: 8   EAFKTTHKKSYQSKMEELLRYKIFTENSLLIAKHNAKYAKGLVSYKLGMNQFGDLLPHEF 67

Query: 98  RAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSA 157
               NGY        S+     +    ++S+P ++DWRKKGAVT VKDQGQCG CWAFSA
Sbjct: 68  AKMFNGYHGERKGRGSTFLPPANVN--DSSLPKTVDWRKKGAVTPVKDQGQCGSCWAFSA 125

Query: 158 VAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYP 217
             ++EG + + + KL SLSEQ L+DC  S  ++GC GGLMD+AF++I +N G+ TE  YP
Sbjct: 126 TGSLEGQHFLKSGKLVSLSEQNLIDCSGSFGNEGCGGGLMDNAFKYIKANDGIDTEESYP 185

Query: 218 YKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDASGSDFQFYSS 276
           Y+A DG C  K+ +  A   +G+ D+   +E  L KAVA   P+SVAIDAS S FQ YS 
Sbjct: 186 YEAMDGDCRFKKEDVGATD-TGFVDIQQGSEDDLQKAVATVGPISVAIDASHSSFQLYSE 244

Query: 277 GVF-TGQCGT-ELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGL 334
           GV+    C + ELDHGV AVGYG   +G KYWLVKNSW  TWG+NGYI M RD   K+  
Sbjct: 245 GVYDEPNCSSEELDHGVLAVGYGVK-NGKKYWLVKNSWAETWGDNGYILMSRD---KDNQ 300

Query: 335 CGIAMQASYP 344
           CGIA  ASYP
Sbjct: 301 CGIASSASYP 310


>gi|156938919|gb|ABU97481.1| cathepsin L-like cysteine protease [Tyrophagus putrescentiae]
          Length = 333

 Score =  270 bits (689), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 158/348 (45%), Positives = 215/348 (61%), Gaps = 28/348 (8%)

Query: 9   KLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVE 68
           K V A + +  V+AP + S  +++  +     ++  ++GR Y +  E+  R ++F  N+E
Sbjct: 2   KFVFAVLAL--VFAPTA-SELISEGELEAHFNLFKTRFGRSYANFEEEIFRKRVFASNLE 58

Query: 69  YIASFNNK--ARNKPYKLGINEFADQTNEEFRAPRNGYK----RRLPSVRSSETTDVSFR 122
           +I + N +  A NK + + +N F D +N EFRA  NG +    +  P++ S+        
Sbjct: 59  FIFNHNREFFAGNKNFNVAVNNFTDMSNTEFRARFNGLRHSGVQSAPAIHSASAE----- 113

Query: 123 YENASVPASIDWRK-KGAVTGVKDQGQCGCCWAF-SAVAAMEGINHITTRKLTSLSEQEL 180
                +PA++DW K K  VT +K+Q QCG CWAF SAVA+MEG + + T KL SLSEQ L
Sbjct: 114 ----GLPATVDWTKVKNVVTPIKNQEQCGSCWAFFSAVASMEGQHGLKTGKLVSLSEQNL 169

Query: 181 VDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGY 240
           VDC  +  + GCEGGLMD AF+++I+NKG+ TE  YPYKA D S   K+ N   A I  Y
Sbjct: 170 VDCSAAEGNMGCEGGLMDQAFQYVIANKGIDTEMSYPYKAIDESWEFKK-NSVGATIKSY 228

Query: 241 EDVPSNNEAALMKAVANQ-PVSVAIDASGSDFQFYSSGVF-TGQCGTE-LDHGVTAVGYG 297
            DV + +E++L  AVA   P+SV IDAS   FQFYSSGV+    C T  LDHGVTAVGYG
Sbjct: 229 VDVKTGSESSLQSAVATVGPISVGIDASQLSFQFYSSGVYEEPACSTTILDHGVTAVGYG 288

Query: 298 TADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
            A +GT YW VKNSWGT+WG +GYI M R+   K+  CGIA  AS+P 
Sbjct: 289 -ALNGTPYWKVKNSWGTSWGMSGYIFMSRN---KQNQCGIATAASWPV 332


>gi|32394730|gb|AAM96001.1| cathepsin L precursor [Metapenaeus ensis]
          Length = 306

 Score =  269 bits (688), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 146/306 (47%), Positives = 189/306 (61%), Gaps = 14/306 (4%)

Query: 44  AQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNK--PYKLGINEFADQTNEEFRAPR 101
            QYGR Y    E   R  +F++N ++I   N K  N    + L +N+F D T+EEF A  
Sbjct: 8   VQYGRHYGTAREDLYRQSVFEQNQQFIEDHNAKFENGEVTFTLKMNQFGDMTSEEFAATM 67

Query: 102 NGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAM 161
           NG+   +P+       +     ++ ++P  +DWR KGAVT VKDQ QCG CWAFS   ++
Sbjct: 68  NGF-LNVPTRHPVAILEA----DDETLPKHVDWRTKGAVTPVKDQKQCGSCWAFSTTGSL 122

Query: 162 EGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKAS 221
           EG + +   KL SLSEQ LVDC     + GC GGLMD AF++I  NKG+ TE  YPY+A 
Sbjct: 123 EGQHFLKDGKLVSLSEQNLVDCSGKFGNMGCCGGLMDQAFKYIKENKGIDTEESYPYEAQ 182

Query: 222 DGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDASGSDFQFYSSGV-F 279
           DG C    +N  A   +G+ D+    E +LMKAVAN  P+SVAIDAS   FQFY  GV +
Sbjct: 183 DGKCRFDSSNVGATD-TGFVDIAHGEENSLMKAVANIGPISVAIDASHPSFQFYHQGVYY 241

Query: 280 TGQC-GTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIA 338
             +C  T LDHGV A+GYG  DDG +YWLVKNSW T+WG+ G+I+M R+   K+  CGIA
Sbjct: 242 EKECSSTMLDHGVLAIGYGETDDGKEYWLVKNSWNTSWGDKGFIQMSRN---KKNNCGIA 298

Query: 339 MQASYP 344
            QASYP
Sbjct: 299 SQASYP 304


>gi|55740402|gb|AAV63977.1| cathepsin L precursor [Artemia franciscana]
          Length = 338

 Score =  269 bits (687), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 150/339 (44%), Positives = 202/339 (59%), Gaps = 12/339 (3%)

Query: 12  LAAILVLG-VWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYI 70
           +  I +LG V    S + +L +   +E H ++ A + + Y    E++ R KI+ EN   +
Sbjct: 4   ITLIFLLGAVLVQLSAALSLTNLLADEWH-LFKATHKKEYPSQLEEKFRMKIYLENKHKV 62

Query: 71  ASFN--NKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASV 128
           A  N   +   K Y++ +N+F D  + EFR+  NGY+ +  +   +E+T       N  V
Sbjct: 63  AKHNILYEKGEKSYQVAMNKFGDLLHHEFRSIMNGYQHKKQNSSRAESTFTFMEPANVEV 122

Query: 129 PASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGE 188
           P S+DWR+KGA+T VKDQGQCG CWAFS+  A+EG     T KL SLSEQ L+DC     
Sbjct: 123 PESVDWREKGAITPVKDQGQCGSCWAFSSTGALEGQTFRKTGKLISLSEQNLIDCSGKYG 182

Query: 189 DQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNE 248
           ++GC GGLMD AF++I  NKG+ TE  YPY+A D  C     N  A    G+ D+PS  E
Sbjct: 183 NEGCNGGLMDQAFQYIKDNKGIDTENTYPYEAEDDVCRYNPRNRGAVD-RGFVDIPSGEE 241

Query: 249 AALMKAVANQ-PVSVAIDASGSDFQFYSSGV-FTGQCGT-ELDHGVTAVGYGTADDGTKY 305
             L  AVA   PVSVAIDAS   FQFYS GV +   C + +LDHGV  VGYG+ D+G  Y
Sbjct: 242 DKLKAAVATVGPVSVAIDASHESFQFYSKGVYYEPSCDSDDLDHGVLVVGYGS-DNGKDY 300

Query: 306 WLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           WLVKNSW   WG+ GYI++ R+   ++  CG+A  ASYP
Sbjct: 301 WLVKNSWSEHWGDEGYIKIARN---RKNHCGVATAASYP 336


>gi|390368662|ref|XP_780781.2| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
          Length = 333

 Score =  269 bits (687), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 156/338 (46%), Positives = 205/338 (60%), Gaps = 18/338 (5%)

Query: 14  AILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASF 73
           ++L++ V    S S +  D   +E    W  ++G+ Y  + E+  R  I+++N++ +   
Sbjct: 5   SVLLVAVCVVSSLSMSFTD--FDEDWNQWKNEHGKRYLSDEEEASRKLIWEKNLDIVIKH 62

Query: 74  NNK--ARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENA--SVP 129
           N K    +  Y LG+N+FAD  NEEF A   G++    S  +  +T   F   N    +P
Sbjct: 63  NLKYDLGHFTYALGMNQFADLQNEEFVAMMTGFRVNGTSKAAKGST---FLPSNNVDKLP 119

Query: 130 ASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGED 189
            ++DWR KG VT VKDQGQCG CWAFSA  ++EG     T KL SLSEQ LVDC  S  +
Sbjct: 120 KTVDWRTKGYVTPVKDQGQCGSCWAFSATGSLEGQQFKKTGKLVSLSEQNLVDC--SYRN 177

Query: 190 QGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEA 249
            GC GG MD AF++II   G+ TEA Y Y+A DG+C+ K+AN  A  ++GY DV S +E 
Sbjct: 178 YGCHGGFMDRAFQYIIDAGGIDTEATYSYRAVDGNCHFKKANVGAT-VTGYTDVTSGSEK 236

Query: 250 ALMKAVAN-QPVSVAIDASGSDFQFYSSGVFT--GQCGTELDHGVTAVGYGTADDGTKYW 306
           AL KAVA+  P+SVAIDAS   F+FY SGV+   G   T L H V  VGYGT  DGT YW
Sbjct: 237 ALQKAVAHIGPISVAIDASHKFFKFYKSGVYNEPGCSTTRLGHAVLVVGYGTTSDGTDYW 296

Query: 307 LVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           +VKNSW  TWG NGY+ M R+   K+  CGIA +ASYP
Sbjct: 297 IVKNSWAKTWGMNGYLWMSRN---KDNQCGIASEASYP 331


>gi|402770511|gb|AFQ98390.1| cathepsin L [Rhipicephalus microplus]
 gi|402770513|gb|AFQ98391.1| cathepsin L [Rhipicephalus microplus]
          Length = 332

 Score =  269 bits (687), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 156/339 (46%), Positives = 204/339 (60%), Gaps = 19/339 (5%)

Query: 11  VLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYI 70
           VL AI+ + V A        +   +  + E +   + + Y+ + E+ +RFKIF EN   I
Sbjct: 6   VLCAIVAVTVAAS-------SQEILRTQWEAFKTTHKKTYQSHMEELLRFKIFTENSLII 58

Query: 71  ASFNNKARNK--PYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASV 128
           A  N K       YKLG+N+F D    EF    NG+     +  SS     +    ++S+
Sbjct: 59  AKHNAKYAKGLVSYKLGMNQFGDLLAHEFARIFNGHHGTRKTGGSSFLPPANVN--DSSL 116

Query: 129 PASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGE 188
           P  +DWRKKGAVT VKDQGQCG CWAFSA  ++EG + +   +L SLSEQ LVDC  S  
Sbjct: 117 PKVVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQSFG 176

Query: 189 DQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNE 248
           + GCEGGLM+DAF++I +N G+ TE  YPY+A DG C  K+ +  A   +GY ++ + +E
Sbjct: 177 NNGCEGGLMEDAFKYIKANDGIDTEKSYPYEAVDGECRFKKEDVGATD-TGYVEIKAGSE 235

Query: 249 AALMKAVAN-QPVSVAIDASGSDFQFYSSGVF-TGQCGTE-LDHGVTAVGYGTADDGTKY 305
             L KAVA   P+SVAIDAS S FQ YS GV+   +C +E LDHGV  VGYG    G KY
Sbjct: 236 VDLKKAVATVGPISVAIDASHSSFQLYSEGVYDEPECSSEDLDHGVLVVGYGV-KGGKKY 294

Query: 306 WLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           WLVKNSW  +WG+ GYI M RD + +   CGIA QASYP
Sbjct: 295 WLVKNSWAESWGDQGYILMSRDNNNQ---CGIASQASYP 330


>gi|330805273|ref|XP_003290609.1| hypothetical protein DICPUDRAFT_92519 [Dictyostelium purpureum]
 gi|325079248|gb|EGC32857.1| hypothetical protein DICPUDRAFT_92519 [Dictyostelium purpureum]
          Length = 333

 Score =  268 bits (686), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 144/307 (46%), Positives = 198/307 (64%), Gaps = 12/307 (3%)

Query: 42  WMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPR 101
           WM ++ R Y  + E   R++ FKEN+++I  +N++  +    LG+ +FAD TNEE++   
Sbjct: 36  WMRKHDRAY-SHEEFTDRYQAFKENMDFIHKWNSQESDTV--LGLTKFADLTNEEYKKHY 92

Query: 102 NGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAM 161
            G K  +   ++        ++   + P SIDWR+KGAV+ VKDQGQCG CW+FS   A+
Sbjct: 93  LGIKVNVK--KNLNAAQKGLKFFKFTGPDSIDWREKGAVSQVKDQGQCGSCWSFSTTGAV 150

Query: 162 EGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKAS 221
           EG + I +  + SLSEQ LVDC     +QGCEGGLM +AFE+II N G+ATE+ YPY A+
Sbjct: 151 EGAHQIKSGNMVSLSEQNLVDCSGQYGNQGCEGGLMVNAFEYIIDNGGIATESSYPYTAA 210

Query: 222 DGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVF-T 280
            G C K   + + A I GY+++P   E +L  A+A QPVSVAIDAS   FQ YSSGV+  
Sbjct: 211 QGRC-KFTKSMNGANIIGYKEIPQGEEDSLTAALAKQPVSVAIDASHMSFQLYSSGVYDE 269

Query: 281 GQCGTE-LDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAM 339
             C +E LDHGV AVGYGT  +G  Y+++KNSWG TWG++GYI M R+    +  CG+A 
Sbjct: 270 PACSSEALDHGVLAVGYGTL-EGKDYYIIKNSWGPTWGQDGYIFMSRN---AQNQCGVAT 325

Query: 340 QASYPTA 346
            ASYP +
Sbjct: 326 MASYPIS 332


>gi|156399477|ref|XP_001638528.1| predicted protein [Nematostella vectensis]
 gi|156225649|gb|EDO46465.1| predicted protein [Nematostella vectensis]
          Length = 325

 Score =  268 bits (686), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 156/339 (46%), Positives = 203/339 (59%), Gaps = 20/339 (5%)

Query: 9   KLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVE 68
           K  LA +LV  + A     +  ++ + + +   W   +G+ Y    E+++R  I+ +N+E
Sbjct: 2   KAFLACLLVAVLIA-----QCFSELSQDRQWHAWKDFHGKTYT-GEEEDLRRAIWNDNLE 55

Query: 69  YIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASV 128
            +   N  A N  YKL +N FAD T  EF+    GY+    S   S    +S    N  +
Sbjct: 56  IVKKHN--AENHSYKLDMNHFADLTVTEFKQRFMGYRAASNSTGGSTFLPLS----NVQL 109

Query: 129 PASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGE 188
           PA +DWR KG VT VK+QGQCG CWAFS+  ++EG +   T KL SLSEQ LVDC     
Sbjct: 110 PAEVDWRDKGFVTAVKNQGQCGSCWAFSSTGSLEGQHFRKTGKLVSLSEQNLVDCSKKYG 169

Query: 189 DQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNE 248
           + GCEGGLMD AF++I +N G+ TE  YPY A DG C+ K  +   A ++GY DV   +E
Sbjct: 170 NNGCEGGLMDYAFKYIKNNDGIDTEQSYPYTARDGQCHFKPGSV-GATVTGYTDVQRGSE 228

Query: 249 AALMKAVAN-QPVSVAIDASGSDFQFYSSGVFT-GQC-GTELDHGVTAVGYGTADDGTKY 305
             L  AVA   P+SVAIDA  S FQ Y +GV++   C  T+LDHGV AVGYG A+DG  Y
Sbjct: 229 GDLQSAVATVGPISVAIDAGHSSFQLYKTGVYSEPDCSSTQLDHGVLAVGYG-AEDGKDY 287

Query: 306 WLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           WLVKNSWG  WG NGYI+M R+   K+  CGIA QASYP
Sbjct: 288 WLVKNSWGEGWGMNGYIKMSRN---KDNQCGIATQASYP 323


>gi|195381187|ref|XP_002049336.1| GJ20806 [Drosophila virilis]
 gi|194144133|gb|EDW60529.1| GJ20806 [Drosophila virilis]
          Length = 339

 Score =  268 bits (686), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 147/345 (42%), Positives = 203/345 (58%), Gaps = 18/345 (5%)

Query: 9   KLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVE 68
           +++ A + ++ V    S++  +      E  + +  ++ + Y D  E+  R KIF EN  
Sbjct: 2   RILFALLALVAVAQAVSYADVIK-----EEWQTFKLEHRKNYVDETEERFRLKIFNENKH 56

Query: 69  YIASFNNK--ARNKPYKLGINEFADQTNEEFRAPRNGYKRRL-PSVRSSETTDVSFRY-- 123
            IA  N +  +    +K+ +N++AD  + EF    NG+   L   +R+S+ + V   +  
Sbjct: 57  KIAKHNQRYASGEVSFKMAVNKYADMLHHEFHTTMNGFNYTLHKQLRASDPSFVGVTFIS 116

Query: 124 -ENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVD 182
            E+  +P S+DWR KGAVT VKDQG CG CWAFS+  A+EG +      L SLSEQ LVD
Sbjct: 117 PEHVKIPKSVDWRSKGAVTEVKDQGHCGSCWAFSSTGALEGQHFRKAGTLISLSEQNLVD 176

Query: 183 CDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYED 242
           C T   + GC GGLMD+AF +I  N G+ TE  YPY+  D SC+  +A   A    G  D
Sbjct: 177 CSTKYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYPYEGIDDSCHFNKATIGATD-RGSVD 235

Query: 243 VPSNNEAALMKAVAN-QPVSVAIDASGSDFQFYSSGVFT-GQCGTE-LDHGVTAVGYGTA 299
           +P  +E  + +AVA   PVSVAIDAS   FQFYS G++   QC  + LDHGV  VGYGT 
Sbjct: 236 IPQGDEKKMAEAVATIGPVSVAIDASHESFQFYSEGIYNEPQCDPQNLDHGVLVVGYGTD 295

Query: 300 DDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           + G  YWLVKNSWGTTWG+ G+I+M R+ D +   CGIA  +SYP
Sbjct: 296 ESGQDYWLVKNSWGTTWGDKGFIKMARNADNQ---CGIASASSYP 337


>gi|84660246|emb|CAI43320.1| cathepsin L [Lubomirskia baicalensis]
 gi|85677150|emb|CAI46307.1| cathepsin L [Lubomirskia baicalensis]
          Length = 327

 Score =  268 bits (685), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 152/312 (48%), Positives = 199/312 (63%), Gaps = 12/312 (3%)

Query: 37  ERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEE 96
           E  E W  ++G+VY  + E+  R  I++ N +Y+   N  A    + +G+N+FAD  + E
Sbjct: 20  EEWESWKKEHGKVYNSDREELTRHIIWQANRKYVDEHNAHAEKFGFTVGMNQFADLESSE 79

Query: 97  FRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFS 156
           F    NGY  + PS++ +++    F  +   +P S+DWR KG VT +K+QGQCG CWAFS
Sbjct: 80  FGRLYNGYNNK-PSMKKAQSK--VFSTKVGDLPTSVDWRTKGFVTAIKNQGQCGSCWAFS 136

Query: 157 AVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKY 216
           AVA +EG +   T  L SLSEQ LVDC T+  +QGC GGLMD+AF+++I N G+ TEA Y
Sbjct: 137 AVAGLEGQHFNATGTLVSLSEQNLVDCSTAEGNQGCNGGLMDNAFQYVIKNGGIDTEASY 196

Query: 217 PYKASDGSCNKKEANPSAAKISGYEDV-PSNNEAALMKAVANQ-PVSVAIDASGSDFQFY 274
           PYKA D  C    AN   +  SG+ D+ P  +EAAL  AVA   P+SVAIDAS + FQ Y
Sbjct: 197 PYKAVDQKCKFNAANV-GSTCSGFSDILPHKSEAALQVAVAVVGPISVAIDASHTSFQLY 255

Query: 275 SSGVFT-GQCG-TELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKE 332
            SGV++   C  T LDHGVTAVGY ++  G  YW+VKNSWGTTWG+ GYI M R+   K 
Sbjct: 256 KSGVYSESACSQTSLDHGVTAVGYDSS-SGVAYWIVKNSWGTTWGQAGYIWMSRN---KN 311

Query: 333 GLCGIAMQASYP 344
             CGIA  ASYP
Sbjct: 312 NQCGIATAASYP 323


>gi|402770515|gb|AFQ98392.1| cathepsin L [Rhipicephalus microplus]
          Length = 332

 Score =  268 bits (685), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 150/310 (48%), Positives = 193/310 (62%), Gaps = 12/310 (3%)

Query: 40  EMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNK--PYKLGINEFADQTNEEF 97
           E +   + + Y+ + E+ +RFKIF EN   IA  N K       YKLG+N+F D    EF
Sbjct: 28  EAFKTTHKKTYQSHMEELLRFKIFTENSLIIAKHNAKYAKGLVSYKLGMNQFGDLLAHEF 87

Query: 98  RAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSA 157
               NG+     +  SS     +    ++S+P  +DWRKKGAVT VKDQGQCG CWAFSA
Sbjct: 88  ARIFNGHHGTRKTGGSSFLPPANVN--DSSLPKVVDWRKKGAVTPVKDQGQCGSCWAFSA 145

Query: 158 VAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYP 217
             ++EG + +   +L SLSEQ LVDC  S  + GCEGGLM+DAF++I +N G+ TE  YP
Sbjct: 146 TGSLEGQHFLKNGELVSLSEQNLVDCSQSFGNNGCEGGLMEDAFKYIKANDGIDTEKSYP 205

Query: 218 YKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDASGSDFQFYSS 276
           Y+A DG C  K+ +  A   +GY ++ + +E  L KAVA   P+SVAIDAS S FQ YS 
Sbjct: 206 YEAVDGECRFKKEDVGATD-TGYVEIKAGSEVDLKKAVATVGPISVAIDASHSSFQLYSE 264

Query: 277 GVF-TGQCGTE-LDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGL 334
           GV+   +C +E LDHGV  VGYG    G KYWLVKNSW  +WG+ GYI M RD + +   
Sbjct: 265 GVYDEPECSSEDLDHGVLVVGYGV-KGGKKYWLVKNSWAESWGDQGYILMSRDNNNQ--- 320

Query: 335 CGIAMQASYP 344
           CGIA QASYP
Sbjct: 321 CGIASQASYP 330


>gi|402770507|gb|AFQ98388.1| cathepsin L [Rhipicephalus microplus]
          Length = 332

 Score =  267 bits (683), Expect = 5e-69,   Method: Compositional matrix adjust.
 Identities = 156/339 (46%), Positives = 203/339 (59%), Gaps = 19/339 (5%)

Query: 11  VLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYI 70
           VL AI+ + V A        +   +  + E +   + + Y+ + E+ +RFKIF EN   I
Sbjct: 6   VLCAIVAVTVAAS-------SQEILRTQWEAFKTTHKKTYQSHMEELLRFKIFTENSLII 58

Query: 71  ASFNNKARNK--PYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASV 128
           A  N K       YKLG+N+F D    EF    NGY     S  S+     +    ++S+
Sbjct: 59  AKHNAKYAKGLVSYKLGMNQFGDLLAHEFARIFNGYHGSRKSGGSTFLPPANVN--DSSL 116

Query: 129 PASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGE 188
           P ++DWRKKGAVT VKDQGQCG CWAFS   ++EG + +   +L SLSEQ LVDC  S  
Sbjct: 117 PKAVDWRKKGAVTPVKDQGQCGSCWAFSTTGSLEGQHFLKNGELVSLSEQNLVDCSQSFG 176

Query: 189 DQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNE 248
           + GCEGGLM+DAF++I +N G+ TE  YPY+A DG C  K+ +  A   +GY ++ +  E
Sbjct: 177 NNGCEGGLMEDAFKYIKANDGIDTEKSYPYEAVDGECRFKKEDVGATD-TGYVEIKAGCE 235

Query: 249 AALMKAVAN-QPVSVAIDASGSDFQFYSSGVF-TGQCGTE-LDHGVTAVGYGTADDGTKY 305
             L KAVA   P+SVAIDAS S FQ YS GV+   +C +E LDHGV  VGYG    G KY
Sbjct: 236 DDLKKAVATVGPISVAIDASHSSFQLYSEGVYDEPECSSEDLDHGVLVVGYGV-KGGKKY 294

Query: 306 WLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           WLVKNSW  +WG+ GYI M RD + +   CGIA QASYP
Sbjct: 295 WLVKNSWAESWGDQGYILMSRDNNNQ---CGIASQASYP 330


>gi|269784818|ref|NP_001161481.1| cathepsin L1 precursor [Gallus gallus]
          Length = 353

 Score =  267 bits (683), Expect = 5e-69,   Method: Compositional matrix adjust.
 Identities = 154/344 (44%), Positives = 203/344 (59%), Gaps = 24/344 (6%)

Query: 11  VLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYI 70
           +L+  L L   AP+       D  ++   ++W + + + Y +  E   R  ++++N++ I
Sbjct: 22  ILSLCLGLAFAAPRV------DPDLDSHWQLWKSWHSKDYHEREESWRRV-VWEKNLKMI 74

Query: 71  ASFN--NKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLP--SVRSSETTDVSFRYENA 126
              N  +      YKLG+N+F D T EEFR   NGYK +      R S+  + SF     
Sbjct: 75  ELHNLDHSLGKHSYKLGMNQFGDMTAEEFRQLMNGYKHKKSERKYRGSQFLEPSF----L 130

Query: 127 SVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTS 186
             P S+DWR+KG VT VKDQGQCG CWAFS   A+EG +   T KL SLSEQ LVDC   
Sbjct: 131 EAPRSVDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQHFRKTGKLVSLSEQNLVDCSRP 190

Query: 187 GEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSN 246
             +QGC GGLMD AF+++  N G+ +E  YPY A D    + +A  +AA  +G+ D+P  
Sbjct: 191 EGNQGCNGGLMDQAFQYVQDNGGIDSEESYPYTAKDDEDCRYKAEYNAANDTGFVDIPQG 250

Query: 247 NEAALMKAVAN-QPVSVAIDASGSDFQFYSSGV-FTGQCGTE-LDHGVTAVGYGTAD--- 300
           +E ALMKAVA+  PVSVAIDA  S FQFY SG+ +   C +E LDHGV  VGYG      
Sbjct: 251 HERALMKAVASVGPVSVAIDAGHSSFQFYQSGIYYEPDCSSEDLDHGVLVVGYGFEGEDV 310

Query: 301 DGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           DG KYW+VKNSWG  WG+ GYI M +D   ++  CGIA  ASYP
Sbjct: 311 DGKKYWIVKNSWGEKWGDKGYIYMAKD---RKNHCGIATAASYP 351


>gi|297729067|ref|NP_001176897.1| Os12g0273900 [Oryza sativa Japonica Group]
 gi|255670225|dbj|BAH95625.1| Os12g0273900 [Oryza sativa Japonica Group]
          Length = 184

 Score =  267 bits (683), Expect = 5e-69,   Method: Compositional matrix adjust.
 Identities = 126/185 (68%), Positives = 142/185 (76%), Gaps = 2/185 (1%)

Query: 161 MEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKA 220
           MEG   ++T KL SLSEQELVDCD  G DQGCEGG +D AF+FI+SN GL  EA YPY A
Sbjct: 1   MEGFVKLSTGKLISLSEQELVDCDVDGNDQGCEGGEIDGAFQFILSNGGLTAEANYPYTA 60

Query: 221 SDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFT 280
            DG C    A   AA I GYEDVP+N+E +LMKAVA QPVSVA+DA  S FQFY  GV  
Sbjct: 61  EDGRCKTTAAADVAASIRGYEDVPANDEPSLMKAVAGQPVSVAVDA--SKFQFYGGGVMA 118

Query: 281 GQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQ 340
           G+CGT LDHGVT +GYG A DGTKYWLVKNSWGTTWGE GY+RM++DID K G+CG+AMQ
Sbjct: 119 GECGTSLDHGVTVIGYGAASDGTKYWLVKNSWGTTWGEAGYLRMEKDIDDKRGMCGLAMQ 178

Query: 341 ASYPT 345
            SYPT
Sbjct: 179 PSYPT 183


>gi|242020003|ref|XP_002430447.1| Cathepsin L precursor, putative [Pediculus humanus corporis]
 gi|212515585|gb|EEB17709.1| Cathepsin L precursor, putative [Pediculus humanus corporis]
          Length = 345

 Score =  267 bits (682), Expect = 6e-69,   Method: Compositional matrix adjust.
 Identities = 153/350 (43%), Positives = 208/350 (59%), Gaps = 25/350 (7%)

Query: 10  LVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEY 69
           L   A+ VL + A   +   +      E  +++ A++ + Y ++ E++ R KIF +N + 
Sbjct: 4   LFFIALTVLSINAVSFYDLVM------EEWQLFKAEHKKNYNNDVEEKFRMKIFMDNKQK 57

Query: 70  IASFNNKARNKP--YKLGINEFADQTNEEFRAPRNGYKRRL--PSVRSSE-TTDVSFRY- 123
           I   N K +     YKLG+N+++D  + EF    NG+ + +  P +RS+   T +   + 
Sbjct: 58  ITKHNTKYQRGEVGYKLGLNKYSDMLHHEFINTFNGFNKSIIPPHLRSNNGKTHLKGSFF 117

Query: 124 ---ENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQEL 180
               N  +P  +DW K GAVT VKDQG CG CWAFSA  A+EG++   T+ L SLSEQ L
Sbjct: 118 IPPANVKLPKHVDWVKLGAVTPVKDQGHCGSCWAFSATGALEGLHFRKTKVLVSLSEQNL 177

Query: 181 VDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGY 240
           +DC T   + GC GGLMD AF+++  N G+ TE  YPY+ ++  C + E   S A  +GY
Sbjct: 178 IDCSTEEGNNGCNGGLMDQAFQYVRINGGIDTERSYPYEGNNDVC-RYEPENSGAIDTGY 236

Query: 241 EDVPSNNEAALMKAVANQ-PVSVAIDASGSDFQFYSSGV-FTGQCGTE---LDHGVTAVG 295
            DVP  +E AL  AVA   PVSVAIDAS   FQ YSSGV F   C  E   LDHGV  VG
Sbjct: 237 TDVPLGDEDALKSAVATVGPVSVAIDASQESFQLYSSGVYFEPNCKNEPESLDHGVLVVG 296

Query: 296 YGTADDGTK-YWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           YGT ++  + YWLVKNSWG +WGENGYI+M R+ D +   CGIA Q S+P
Sbjct: 297 YGTDEETQQDYWLVKNSWGDSWGENGYIKMARNADNQ---CGIATQPSFP 343


>gi|7381610|gb|AAF61565.1|AF227957_1 cathepsin L-like proteinase precursor [Rhipicephalus microplus]
          Length = 332

 Score =  267 bits (682), Expect = 6e-69,   Method: Compositional matrix adjust.
 Identities = 155/339 (45%), Positives = 204/339 (60%), Gaps = 19/339 (5%)

Query: 11  VLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYI 70
           VL AI+ + V A        +   +  + E +   + + Y+ + E+ +RFKIF EN   I
Sbjct: 6   VLCAIVAVTVAAS-------SQEILRTQWEAFKTTHKKSYQSHMEELLRFKIFTENSLII 58

Query: 71  ASFNNKARNK--PYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASV 128
           A  N K       YKLG+N+F D    EF    NG+     +  S+     +    ++S+
Sbjct: 59  AKHNAKYAKGLVSYKLGMNQFGDLLAHEFARIFNGHHGTRKTGGSTFLPPANVN--DSSL 116

Query: 129 PASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGE 188
           P  +DWRKKGAVT VKDQGQCG CWAFSA  ++EG + +   +L SLSEQ LVDC  S  
Sbjct: 117 PKVVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQSFG 176

Query: 189 DQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNE 248
           + GCEGGLM+DAF++I +N G+ TE  YPY+A DG C  K+ +  A   +GY ++ + +E
Sbjct: 177 NNGCEGGLMEDAFKYIKANDGIDTEKSYPYEAVDGECRFKKEDVGATD-TGYVEIKAGSE 235

Query: 249 AALMKAVAN-QPVSVAIDASGSDFQFYSSGVF-TGQCGTE-LDHGVTAVGYGTADDGTKY 305
             L KAVA   P+SVAIDAS S FQ YS GV+   +C +E LDHGV  VGYG    G KY
Sbjct: 236 VDLKKAVATVGPISVAIDASHSSFQLYSEGVYDEPECSSEDLDHGVLVVGYGV-KGGKKY 294

Query: 306 WLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           WLVKNSW  +WG+ GYI M RD + +   CGIA QASYP
Sbjct: 295 WLVKNSWAESWGDQGYILMSRDNNNQ---CGIASQASYP 330


>gi|402770501|gb|AFQ98385.1| cathepsin L [Rhipicephalus microplus]
          Length = 332

 Score =  267 bits (682), Expect = 6e-69,   Method: Compositional matrix adjust.
 Identities = 149/310 (48%), Positives = 194/310 (62%), Gaps = 12/310 (3%)

Query: 40  EMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNK--PYKLGINEFADQTNEEF 97
           E +   + + Y+ + E+ +RFKIF EN   IA  N K       YKLG+N+F D    EF
Sbjct: 28  EAFKTTHKKTYQSHMEELLRFKIFTENSLIIAKHNAKYAKGLVSYKLGMNQFGDLLAHEF 87

Query: 98  RAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSA 157
               NG+     +  S+     +    ++S+P ++DWRKKGAVT VKDQGQCG CWAFSA
Sbjct: 88  ARIFNGHHGTRKTGGSTFLPPANVN--DSSLPKAVDWRKKGAVTPVKDQGQCGSCWAFSA 145

Query: 158 VAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYP 217
             ++EG + +   +L SLSEQ LVDC  S  + GCEGGLM+DAF++I +N G+ TE  YP
Sbjct: 146 TGSLEGQHFLKNGELVSLSEQNLVDCSQSFGNNGCEGGLMEDAFKYIKANDGIDTEKSYP 205

Query: 218 YKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDASGSDFQFYSS 276
           Y+A DG C  K+ +  A   +GY ++ + +E  L KAVA   P+SVAIDAS S FQ YS 
Sbjct: 206 YEAVDGECRFKKEDVGATD-TGYVEIKAGSEDDLKKAVATVGPISVAIDASHSSFQLYSE 264

Query: 277 GVF-TGQCGTE-LDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGL 334
           GV+   +C +E LDHGV  VGYG    G KYWLVKNSW  +WG+ GYI M RD + +   
Sbjct: 265 GVYDEPECSSEDLDHGVLVVGYGV-KGGKKYWLVKNSWAESWGDQGYILMSRDNNNQ--- 320

Query: 335 CGIAMQASYP 344
           CGIA QASYP
Sbjct: 321 CGIASQASYP 330


>gi|410898132|ref|XP_003962552.1| PREDICTED: cathepsin L-like [Takifugu rubripes]
          Length = 335

 Score =  266 bits (681), Expect = 8e-69,   Method: Compositional matrix adjust.
 Identities = 152/309 (49%), Positives = 190/309 (61%), Gaps = 11/309 (3%)

Query: 42  WMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARN--KPYKLGINEFADQTNEEFRA 99
           W  ++GR YR  +E+  R +I+  N + +   N  A    K Y+LG+ +FAD  NEE+++
Sbjct: 30  WKLKFGRSYRTPSEEVQRMQIWLNNRKLVLVHNILADQGIKSYRLGMTQFADMDNEEYKS 89

Query: 100 PRNGYKRRLPSVRSSETTDVSFRY-ENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAV 158
             +    R  +  +       FR  E   +P ++DWR KG VTGVKDQ QCG CWAFSA 
Sbjct: 90  LISLGCLRAFNTSAPRRGSAFFRLAEGTHLPTTVDWRDKGYVTGVKDQKQCGSCWAFSAT 149

Query: 159 AAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPY 218
            ++EG N   T KL SLSEQ+LVDC     + GC GGLMD AF++I  N G+ TE  YPY
Sbjct: 150 GSLEGQNFRKTGKLVSLSEQQLVDCSGDYGNMGCNGGLMDYAFKYIQENGGIDTEKSYPY 209

Query: 219 KASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDASGSDFQFYSSG 277
           +A DG C  K  N   AK +GY DV   +E AL +AVA   PVSV IDAS S FQ Y SG
Sbjct: 210 EAEDGQCRFKPENV-GAKCTGYVDVTVGDEDALKEAVATIGPVSVGIDASHSSFQLYDSG 268

Query: 278 VFTGQ-CGTE-LDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLC 335
           V+  Q C ++ LDHGV AVGYGT D+G  YWLVKNSWG  WG+ GYI M R+   K+  C
Sbjct: 269 VYDEQDCSSQDLDHGVLAVGYGT-DNGQDYWLVKNSWGLGWGQEGYIMMSRN---KDNQC 324

Query: 336 GIAMQASYP 344
           GIA  ASYP
Sbjct: 325 GIATAASYP 333


>gi|391336140|ref|XP_003742440.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
          Length = 330

 Score =  266 bits (681), Expect = 8e-69,   Method: Compositional matrix adjust.
 Identities = 153/305 (50%), Positives = 188/305 (61%), Gaps = 17/305 (5%)

Query: 48  RVYRDNAEKEMRFKIFKENVEYIASFNN-KARNKPYKLGINEFADQTNEEFRAPRNGY-- 104
           + YRD  E+ +R  IF++N+  I  FN   A    + LG+NEFAD TN EF     G   
Sbjct: 37  KSYRDGQEELIRRFIFEDNLHTIEEFNRVNASLAGFTLGVNEFADMTNTEFSNMLLGLGG 96

Query: 105 KRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGI 164
           + ++      E++ V        +PA +DW +KG VT VK+QGQCG CWAFS   ++EG 
Sbjct: 97  RNKIAGDSVFESSHVQ------DLPAEVDWTQKGYVTEVKNQGQCGSCWAFSTTGSLEGQ 150

Query: 165 NHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGS 224
               T KL SLSEQ LVDC TS  +QGC GGLMD AF +I  N G+ TEA YPY  SDG+
Sbjct: 151 VFKKTGKLVSLSEQNLVDCSTSEGNQGCNGGLMDQAFTYIKKNGGIDTEAAYPYTGSDGT 210

Query: 225 CNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDASGSDFQFYSSGVFTGQ- 282
           C   E N   A +SG+ DV S +E AL +AVA   P+SVAIDAS   FQFY  GV+    
Sbjct: 211 CRFLE-NKVGATVSGFVDVKSGDENALKEAVATVGPISVAIDASSIFFQFYRGGVYNPWF 269

Query: 283 -CGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQA 341
              TELDHGV  VGYGT + G  YWLVKNSWG++WG  GYI+M R+   K+  CGIA QA
Sbjct: 270 CSSTELDHGVLVVGYGT-EGGKDYWLVKNSWGSSWGLKGYIKMVRN---KKNRCGIATQA 325

Query: 342 SYPTA 346
           SYPT 
Sbjct: 326 SYPTV 330


>gi|151573016|gb|ABS17683.1| cathepsin L-1 [Artemia persimilis]
          Length = 334

 Score =  266 bits (681), Expect = 8e-69,   Method: Compositional matrix adjust.
 Identities = 149/336 (44%), Positives = 201/336 (59%), Gaps = 12/336 (3%)

Query: 15  ILVLG-VWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASF 73
           I +LG V+   S + +L +   +E H ++ A + + Y    E++ R KI+ EN   +A  
Sbjct: 3   IFLLGAVFVQLSAALSLTNLLADEWH-LFKATHKKEYPSQLEEKFRMKIYLENKHKVAKH 61

Query: 74  N--NKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPAS 131
           N   +   K Y++ +N+F D  + EFR+  NGY+ +  +   +E+T       N  VP S
Sbjct: 62  NILFEKGEKSYQVAMNKFGDLLHHEFRSIMNGYQHKKQNSSRAESTFTFMEPANVEVPES 121

Query: 132 IDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQG 191
           +DWR+KGA+T VKDQGQCG CWAFS+  A+EG     T KL SL EQ L+DC     ++G
Sbjct: 122 VDWREKGAITPVKDQGQCGPCWAFSSTGALEGQTFRKTGKLVSLREQNLIDCSGKYGNEG 181

Query: 192 CEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAAL 251
           C GGLMD AF++I  NKG+ TE  YPY+A D  C     N  A    G+ D+PS  E  L
Sbjct: 182 CNGGLMDQAFQYIKDNKGIDTENTYPYEAEDDVCRYNPRNRGAVD-RGFVDIPSGEEDKL 240

Query: 252 MKAVANQ-PVSVAIDASGSDFQFYSSGV-FTGQCGT-ELDHGVTAVGYGTADDGTKYWLV 308
             AVA   PVSVAIDAS   FQFYS GV +   C + +LDHGV  VGYG+ D+G  YWLV
Sbjct: 241 KAAVATVGPVSVAIDASHESFQFYSKGVYYEPSCDSDDLDHGVLVVGYGS-DNGKDYWLV 299

Query: 309 KNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           KNSW   WG+ GYI++ R+   ++  CG+A  ASYP
Sbjct: 300 KNSWSEHWGDQGYIKIARN---RKNHCGVATAASYP 332


>gi|310942958|pdb|3P5U|A Chain A, Actinidin From Actinidia Arguta Planch (Sarusashi)
 gi|310942959|pdb|3P5V|A Chain A, Actinidin From Actinidia Arguta Planch (Sarusashi)
 gi|310942961|pdb|3P5X|A Chain A, Actinidin From Actinidia Arguta Planch (Sarusashi)
          Length = 220

 Score =  266 bits (681), Expect = 8e-69,   Method: Compositional matrix adjust.
 Identities = 129/218 (59%), Positives = 155/218 (71%), Gaps = 2/218 (0%)

Query: 128 VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSG 187
           +P  +DWR  GAV  +KDQGQCG  WAFS +AA+EGIN I T  L SLSEQELVDC  + 
Sbjct: 1   LPDYVDWRSSGAVVDIKDQGQCGSXWAFSTIAAVEGINKIATGDLISLSEQELVDCGRTQ 60

Query: 188 EDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNN 247
             +GC+GG M D F+FII+N G+ TEA YPY A +G CN          I  YE+VP NN
Sbjct: 61  NTRGCDGGFMTDGFQFIINNGGINTEANYPYTAEEGQCNLDLQQEKYVSIDTYENVPYNN 120

Query: 248 EAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWL 307
           E AL  AVA QPVSVA++A+G +FQ YSSG+FTG CGT +DH VT VGYGT + G  YW+
Sbjct: 121 EWALQTAVAYQPVSVALEAAGYNFQHYSSGIFTGPCGTAVDHAVTIVGYGT-EGGIDYWI 179

Query: 308 VKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           VKNSWGTTWGE GY+R+QR++    G CGIA +ASYP 
Sbjct: 180 VKNSWGTTWGEEGYMRIQRNVGGV-GQCGIAKKASYPV 216


>gi|330434686|gb|AEC22811.1| cathepsin L [Macrobrachium nipponense]
          Length = 342

 Score =  266 bits (681), Expect = 9e-69,   Method: Compositional matrix adjust.
 Identities = 148/318 (46%), Positives = 191/318 (60%), Gaps = 14/318 (4%)

Query: 37  ERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNN--KARNKPYKLGINEFADQTN 94
           E  E +  ++ + Y  + E+  R KIF EN + IA+ N      +K YKLG+N++ D  +
Sbjct: 27  EEWESFKFEHSKKYESDTEETFRMKIFAENKQKIAAHNKLYHTGSKTYKLGMNKYGDMLH 86

Query: 95  EEFRAPRNGYKRRLPSV-----RSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQC 149
            EF    NG++           R  +        E+  +P S+DWR+KGAVT VKDQG C
Sbjct: 87  HEFVNMMNGFRANTSGAGYKANRGFQGAHFVEPPEDVVMPKSVDWREKGAVTEVKDQGSC 146

Query: 150 GCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKG 209
           G CWAFSA  A+EG ++  T  L SLSEQ LVDC +   + GC GGLMD+AF++I  N G
Sbjct: 147 GSCWAFSATGALEGQHYRQTGDLVSLSEQNLVDCSSKFGNNGCNGGLMDNAFQYIKVNGG 206

Query: 210 LATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDASG 268
           + TE  YPY+A D  C    AN + A   G+ DV   NE AL KA+A   PVSVAIDAS 
Sbjct: 207 IDTEKSYPYEAEDEPCRYNPAN-AGADDRGFVDVREGNENALKKAIATIGPVSVAIDASQ 265

Query: 269 SDFQFYSSGVFTG-QCGTE-LDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQR 326
             FQFY  GV++   C  E LDHGV AVGYGT +DG  YWLVKNSW  +WG+ GYI++ R
Sbjct: 266 DSFQFYQHGVYSDPDCSAENLDHGVLAVGYGTTEDGQDYWLVKNSWSKSWGDQGYIKIAR 325

Query: 327 DIDAKEGLCGIAMQASYP 344
           +   +  +CGIA  ASYP
Sbjct: 326 N---QNNMCGIASAASYP 340


>gi|326501772|dbj|BAK02675.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 333

 Score =  266 bits (680), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 159/339 (46%), Positives = 203/339 (59%), Gaps = 14/339 (4%)

Query: 12  LAAILVLGVWAPQSWSRTLN-DATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYI 70
           + AI VL V A  ++S TL  DA +N+  ++W     + Y D AE+ +R   ++ N++ +
Sbjct: 1   MHAISVLAVLAL-AFSCTLAFDAKLNQHWKLWKEANNKRYSD-AEEHVRRATWEGNLQKV 58

Query: 71  ASFNNKAR--NKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASV 128
              N +A      Y LG+N++AD T  EF    NGY   +   R+ +    SF  + A +
Sbjct: 59  QEHNLQADLGVHTYWLGMNKYADMTVTEFVKVMNGYNATMRGQRTQDRHTFSFNSKIA-L 117

Query: 129 PASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGE 188
           P ++DWR KG VT VKDQGQCG CWAFS   A+EG +   T KL SLSEQ LVDC     
Sbjct: 118 PDTVDWRDKGYVTDVKDQGQCGSCWAFSTTGALEGQHFKQTGKLVSLSEQNLVDCSGKQG 177

Query: 189 DQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNE 248
           + GC GGLMD AFE+I  N G+ TE  YPY+A D  C  K AN  A   +G+ D+ S +E
Sbjct: 178 NMGCNGGLMDQAFEYIKENNGIDTEDSYPYEAVDNQCRFKAANVGATD-TGFTDITSKDE 236

Query: 249 AALMKAVAN-QPVSVAIDASGSDFQFYSSGVFTGQ-CG-TELDHGVTAVGYGTADDGTKY 305
           +AL +AVA   P+SVAIDA  + FQ Y  GV+    C  T LDHGV AVGYGT D G  Y
Sbjct: 237 SALQQAVATVGPISVAIDAGHTSFQLYKHGVYNEPFCSQTRLDHGVLAVGYGT-DSGKDY 295

Query: 306 WLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           WLVKNSWG  WG+ GYI+M R+   K   CGIA  ASYP
Sbjct: 296 WLVKNSWGEGWGDKGYIKMTRN---KRNQCGIATAASYP 331


>gi|405966498|gb|EKC31776.1| Cathepsin L [Crassostrea gigas]
          Length = 330

 Score =  266 bits (680), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 153/323 (47%), Positives = 197/323 (60%), Gaps = 16/323 (4%)

Query: 30  LNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKE-NVEYIASFNNKARNKPYK--LGI 86
           L  + ++   ++++  +G+ Y   AE+E R ++  E N++YI   N  A    Y   LG+
Sbjct: 18  LPKSELDSEWQLYLKAHGKQY--GAEEEARRRVIWEGNLDYIEKHNLAADRGDYSFWLGM 75

Query: 87  NEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQ 146
           NE+ D TNEEFR+  NGYK R  + R S     S       +P ++DWR KG VT +K+Q
Sbjct: 76  NEYGDMTNEEFRSTMNGYKMRNGTSRGSLYLPPS---NIGDLPDTVDWRPKGYVTPIKNQ 132

Query: 147 GQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIIS 206
           GQCG CW+FSA  ++EG     T KL SLSEQ LVDC     + GC+GGLMDDAF++I  
Sbjct: 133 GQCGSCWSFSATGSLEGQTFKKTGKLPSLSEQNLVDCSQKQGNHGCQGGLMDDAFQYIKD 192

Query: 207 NKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAID 265
           N G+ TE+ YPY+A +G C    AN  A   SG+ D+ S +E+ L  AVA   P+SVAID
Sbjct: 193 NSGIDTESSYPYEAKNGKCRFNAANVGATD-SGFTDIKSKSESDLQSAVATVGPISVAID 251

Query: 266 ASGSDFQFYSSGVFTG-QCG-TELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIR 323
           AS   FQ Y SGV+    C  T LDHGV AVGYGT + G  YWLVKNSWG +WG+ GYI 
Sbjct: 252 ASHMSFQLYRSGVYHEFFCSETRLDHGVLAVGYGT-ESGKDYWLVKNSWGESWGQKGYIM 310

Query: 324 MQRDIDAKEGLCGIAMQASYPTA 346
           M R+   K   CGIA  ASYPT 
Sbjct: 311 MSRN---KRNNCGIATSASYPTV 330


>gi|402770503|gb|AFQ98386.1| cathepsin L [Rhipicephalus microplus]
          Length = 332

 Score =  266 bits (680), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 155/339 (45%), Positives = 203/339 (59%), Gaps = 19/339 (5%)

Query: 11  VLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYI 70
           VL AI+ + V A        +   +  + E +   + + Y+ + E+ +RFKIF EN   I
Sbjct: 6   VLCAIVAVTVAAS-------SQEILRTQWEAFKTTHKKTYQSHMEELLRFKIFTENSLII 58

Query: 71  ASFNNKARNK--PYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASV 128
           A  N K       YKLG+N+F D    EF    NG+     +  S+     +    ++S+
Sbjct: 59  AKHNAKYAKGLVSYKLGMNQFGDLLAHEFARIFNGHHGTRKTGGSTFLPPANVN--DSSL 116

Query: 129 PASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGE 188
           P  +DWRKKGAVT VKDQGQCG CWAFSA  ++EG + +   +L SLSEQ LVDC  S  
Sbjct: 117 PKVVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGRHFLKNGELVSLSEQNLVDCSQSFG 176

Query: 189 DQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNE 248
           + GCEGGLM+DAF++I  N G+ TE  YPY+A DG C  K+ +  A   +GY ++ + +E
Sbjct: 177 NNGCEGGLMEDAFKYIKENDGIDTEKSYPYEAVDGECRFKKEDVGATD-TGYVEIKAGSE 235

Query: 249 AALMKAVAN-QPVSVAIDASGSDFQFYSSGVF-TGQCGTE-LDHGVTAVGYGTADDGTKY 305
             L KAVA   P+SVAIDAS S FQ YS GV+   +C +E LDHGV  VGYG    G KY
Sbjct: 236 DDLKKAVATVGPISVAIDASHSSFQLYSEGVYDEPECSSEDLDHGVLVVGYGV-KGGKKY 294

Query: 306 WLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           WLVKNSW  +WG+ GYI M RD + +   CGIA QASYP
Sbjct: 295 WLVKNSWAESWGDQGYILMSRDNNNQ---CGIASQASYP 330


>gi|413919735|gb|AFW59667.1| hypothetical protein ZEAMMB73_680472 [Zea mays]
          Length = 344

 Score =  266 bits (679), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 132/248 (53%), Positives = 166/248 (66%), Gaps = 5/248 (2%)

Query: 42  WMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARN--KPYKLGINEFADQTNEEFRA 99
           WMA +GR Y    E+E RF++F++N+ Y+ + N  A      ++LG+N FAD TN+E+RA
Sbjct: 49  WMAAHGRTYNAVGEEERRFEVFRDNLRYVDAHNAAADAGVHSFRLGLNRFADLTNDEYRA 108

Query: 100 PRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVA 159
              G + R P  R     D     +N  +P S+DWR KGAV  VKDQG CG CWAFS +A
Sbjct: 109 TYLGVRSR-PQ-RERRLGDRYLAGDNEDLPESVDWRAKGAVAEVKDQGSCGSCWAFSTIA 166

Query: 160 AMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYK 219
           A+EGIN I T  + SLSEQELVDCDTS  +QGC GGLMD AFEFII+N G+ TE  YPYK
Sbjct: 167 AVEGINQIVTGDMISLSEQELVDCDTS-YNQGCNGGLMDYAFEFIINNGGIDTEEDYPYK 225

Query: 220 ASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVF 279
            +DG C+    N     I  YEDVP+N+E +L KAVANQP+SVAI+A G  FQ Y+SG+F
Sbjct: 226 GTDGRCDVNRKNAKVVTIDSYEDVPANSEKSLQKAVANQPISVAIEAGGRAFQLYNSGIF 285

Query: 280 TGQCGTEL 287
           TG CG  +
Sbjct: 286 TGTCGNSV 293


>gi|5081735|gb|AAD39513.1|AF147207_1 cathepsin L-like protease precursor [Artemia franciscana]
          Length = 338

 Score =  266 bits (679), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 150/339 (44%), Positives = 200/339 (58%), Gaps = 12/339 (3%)

Query: 12  LAAILVLG-VWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYI 70
           +  I +LG V    S + +L +   +E H ++ A + + Y    E++ R KI+ EN   +
Sbjct: 4   ITLIFLLGAVLVQLSAALSLTNLLADEWH-LFKATHKKEYPSQLEEKFRMKIYLENKHKV 62

Query: 71  ASFN--NKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASV 128
           A  N   +   K Y++ +N+F D  + EFR+  NGY+ +  +   +E+T       N  V
Sbjct: 63  AKHNILYEKGEKSYQVAMNKFGDLLHHEFRSIMNGYQHKKQNSSRAESTFTFMEPANVEV 122

Query: 129 PASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGE 188
           P S+DWR KGA+T VKDQGQCG CWAFS+  A+EG     T KL SLSEQ L+DC     
Sbjct: 123 PESVDWRVKGAITPVKDQGQCGSCWAFSSTGALEGQTFRKTGKLISLSEQNLIDCSGKYG 182

Query: 189 DQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNE 248
           ++GC GGLMD AF++I  NKG+ TE  YPY+A D  C     N  A    G+  +PS  E
Sbjct: 183 NEGCNGGLMDQAFQYIKDNKGIDTENTYPYEAEDNVCRYNPRNRGAID-RGFVHIPSGEE 241

Query: 249 AALMKAVANQ-PVSVAIDASGSDFQFYSSGV-FTGQCGT-ELDHGVTAVGYGTADDGTKY 305
             L  AVA   PVSVAIDAS   FQFYS GV +   C + +LDHGV  VGYG+ D+G  Y
Sbjct: 242 DKLKAAVATVGPVSVAIDASHESFQFYSKGVYYEPSCDSDDLDHGVLVVGYGS-DNGKDY 300

Query: 306 WLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           WLVKNSW   WG+ GYI++ R+   ++  CGIA  ASYP
Sbjct: 301 WLVKNSWSEHWGDEGYIKIARN---RKNHCGIATAASYP 336


>gi|254674508|dbj|BAH86062.1| cysteine protease [Haemaphysalis longicornis]
          Length = 333

 Score =  266 bits (679), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 148/310 (47%), Positives = 189/310 (60%), Gaps = 11/310 (3%)

Query: 40  EMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNK--PYKLGINEFADQTNEEF 97
           E + +Q+ + Y  + E+ +RFKIF EN   +A  N K       YKL +N+F D    EF
Sbjct: 28  EAFKSQHNKAYSSHVEELLRFKIFTENTLLVAKHNAKYAKGLVSYKLAMNKFGDLLPHEF 87

Query: 98  RAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSA 157
               NGY+ +  +     T        ++S+P ++DWRKKGAVT VK+QGQCG CWAFS 
Sbjct: 88  AKMVNGYRGK-QNKEQRPTFIPPANLNDSSLPTTVDWRKKGAVTPVKNQGQCGSCWAFST 146

Query: 158 VAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYP 217
             ++EG +   T KL SLSEQ LVDC     +QGC GGLMD+ F++I +N G+ TE  +P
Sbjct: 147 TGSLEGQHFRKTGKLVSLSEQNLVDCSDDFGNQGCNGGLMDNGFQYIKANGGIDTEESHP 206

Query: 218 YKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDASGSDFQFYSS 276
           Y A DG C  K+A+  A   +G+ D+   +E  L KAVA   PVSVAIDAS   FQ YS 
Sbjct: 207 YTAQDGDCKFKKADVGATD-AGFVDIQQGSEDDLKKAVATVGPVSVAIDASHGSFQLYSQ 265

Query: 277 GVF-TGQC-GTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGL 334
           GV+    C  ++LDHGV  VGYG   +G KYWLVKNSWG  WG+NGYI M RD   K+  
Sbjct: 266 GVYDEPDCSSSQLDHGVLTVGYGVK-NGKKYWLVKNSWGGDWGDNGYILMSRD---KDNQ 321

Query: 335 CGIAMQASYP 344
           CGIA  ASYP
Sbjct: 322 CGIASSASYP 331


>gi|390347681|ref|XP_801784.2| PREDICTED: cathepsin L1-like isoform 2 [Strongylocentrotus
           purpuratus]
          Length = 336

 Score =  266 bits (679), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 149/319 (46%), Positives = 195/319 (61%), Gaps = 21/319 (6%)

Query: 35  MNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFN--NKARNKPYKLGINEFADQ 92
           ++ R   W   + + Y ++  +  R  +++ENV+ I   N  +    K ++LG+NE+ D 
Sbjct: 28  LDSRWLEWKIAHTKSYTNDMHELERRLVWEENVKMINMHNLDHSLHKKGFRLGMNEYGDM 87

Query: 93  TNEEFRAPRNGYKRRLPSVRSSETTDVS----FRYENASVPASIDWRKKGAVTGVKDQGQ 148
              E R+  NGYK       SS  T V         N  VP ++DWR KG VT VK+QGQ
Sbjct: 88  RLHEVRSTMNGYK-------SSNVTKVQGSTFLTPSNIQVPDTVDWRTKGYVTPVKNQGQ 140

Query: 149 CGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNK 208
           CG CWAFS   ++EG     T KL SLSEQ LVDC  +  + GCEGGLMD  F+++I N 
Sbjct: 141 CGSCWAFSTTGSLEGQTFKKTSKLVSLSEQNLVDCSRTEGNMGCEGGLMDQGFQYVIDNH 200

Query: 209 GLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQ-PVSVAIDAS 267
           G+ +E  YPY A D +C+ K A+  +A+++G+ DV S +E ALM+AVA+  PVSVAIDAS
Sbjct: 201 GIDSEDCYPYDAEDETCHYK-ASCDSAEVTGFTDVTSGDEQALMEAVASVGPVSVAIDAS 259

Query: 268 GSDFQFYSSGVF-TGQC-GTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQ 325
              FQ Y SGV+   +C  +ELDHGV  VGYGT D G  YWLVKNSWG TWG +GYI+M 
Sbjct: 260 HQSFQLYESGVYDEPECSSSELDHGVLVVGYGT-DGGKDYWLVKNSWGETWGLSGYIKMS 318

Query: 326 RDIDAKEGLCGIAMQASYP 344
           R+   K   CGIA  ASYP
Sbjct: 319 RN---KSNQCGIATSASYP 334


>gi|413953051|gb|AFW85700.1| hypothetical protein ZEAMMB73_033873 [Zea mays]
          Length = 359

 Score =  265 bits (678), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 149/325 (45%), Positives = 196/325 (60%), Gaps = 20/325 (6%)

Query: 37  ERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEE 96
           ER + W A+Y R Y    E + RF ++ EN+ +I + N  +    Y+LG N+F D T EE
Sbjct: 38  ERFKAWQAEYNRTYATPEEFQQRFMVYSENLRFIKTMNQLSTGSSYELGENQFTDLTEEE 97

Query: 97  FRAPRNGYKRRL-----------PSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKD 145
           F+   + Y  +L           P V +  T  +S        P S+DWR KGAVT VK+
Sbjct: 98  FK---DTYLMKLDEQPPAAEAMPPIVGTMSTAGMSNGDNTGEAPNSVDWRTKGAVTPVKN 154

Query: 146 QGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFII 205
           Q QCG CWAF+ VA++EG++ I T +L SLSEQE+VDCD  G D GC GG    A E++ 
Sbjct: 155 QQQCGSCWAFATVASIEGVHQIKTGRLVSLSEQEIVDCDRGGNDHGCRGGYPRSAMEWVT 214

Query: 206 SNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAID 265
            N GL TE+ YPY  S   C   +    AA+I GY+ V   NEA L +AVA +PV+V ID
Sbjct: 215 RNGGLTTESDYPYVGSQRQCMSGKLGHHAARIRGYQAVQRKNEAELERAVAGRPVAVVID 274

Query: 266 ASGSDFQFYSSGVFTGQCG-TELDHGVTAVGYGTADDGT----KYWLVKNSWGTTWGENG 320
           AS + FQFY  GVF+G C  T ++H VT VGYG+A   +    KYW+VKNSWG  WGENG
Sbjct: 275 ASRA-FQFYKRGVFSGPCNTTTVNHAVTVVGYGSAGSDSGGGRKYWIVKNSWGQRWGENG 333

Query: 321 YIRMQRDIDAKEGLCGIAMQASYPT 345
           Y+RM R + A+EG+C IA++  YP 
Sbjct: 334 YVRMARRVRAREGMCAIAIEPYYPV 358


>gi|334332720|ref|XP_001367595.2| PREDICTED: cathepsin L1-like [Monodelphis domestica]
          Length = 333

 Score =  265 bits (678), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 153/320 (47%), Positives = 197/320 (61%), Gaps = 17/320 (5%)

Query: 32  DATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNK--ARNKPYKLGINEF 89
           D T++ +   W AQ+ R Y  N E   R   +++N++ I   N +  A    ++LG+N+F
Sbjct: 22  DQTLDSQWHQWKAQHRRTYAAN-EDGWRRATWEKNLKMIEMHNLEYSAGKHSFQLGMNKF 80

Query: 90  ADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYEN--ASVPASIDWRKKGAVTGVKDQG 147
            D T EEF+   NGY     S  S + T  S   E   A +P S+DWR+KG VT VK+QG
Sbjct: 81  GDMTTEEFKQVMNGYN----SNGSQKRTKGSLYREPLLAQLPKSVDWREKGYVTPVKNQG 136

Query: 148 QCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISN 207
           QCG CWAFSA  ++EG     T+KL SLSEQ LVDC TS  + GC GGLMD+AFE++ +N
Sbjct: 137 QCGSCWAFSATGSLEGQWFHKTKKLVSLSEQNLVDCSTSEGNNGCSGGLMDNAFEYVKNN 196

Query: 208 KGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDA 266
            G+ TE  YPY   D  C K  A  S A ++G+ D+PS NE ALMKAVAN  P+SVAIDA
Sbjct: 197 GGIDTEQAYPYLGQDNEC-KYRAECSGANVTGFVDIPSMNERALMKAVANVGPISVAIDA 255

Query: 267 SGSDFQFYSSGV-FTGQC-GTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRM 324
               FQFY SGV +  QC  ++LDHGV  VGYG+     +YW+VKNSWG  WG+ GY+ M
Sbjct: 256 GNPSFQFYESGVYYEPQCSSSQLDHGVLVVGYGSIGK-DEYWIVKNSWGEEWGKKGYVLM 314

Query: 325 QRDIDAKEGLCGIAMQASYP 344
            +    +   CGIA  ASYP
Sbjct: 315 AK---FRNNHCGIATAASYP 331


>gi|118412468|gb|ABK81670.1| fastuosain precursor [Bromelia fastuosa]
          Length = 220

 Score =  265 bits (678), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 127/220 (57%), Positives = 157/220 (71%), Gaps = 5/220 (2%)

Query: 126 ASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDT 185
           ++VP SIDWR  GAVT VK+QG CG CWAFSA+A +EGI  I    L SLSEQE++DC  
Sbjct: 3   SAVPQSIDWRDYGAVTSVKNQGSCGSCWAFSAIATVEGIYKIKAGNLISLSEQEVLDCAL 62

Query: 186 SGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPS 245
           S    GC+GG ++ A++FIISN G+ + A  PYK   G CN  +  P+ A I+GY  V S
Sbjct: 63  S---YGCDGGWVNKAYDFIISNNGVTSFANLPYKGYKGPCNHNDL-PNKAYITGYTYVQS 118

Query: 246 NNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKY 305
           NNE ++M AVANQP++  IDA G DFQ+Y SGVFTG CGT L+H +T +GYG    GTKY
Sbjct: 119 NNERSMMIAVANQPIAALIDAGG-DFQYYKSGVFTGSCGTSLNHAITVIGYGQTSSGTKY 177

Query: 306 WLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           W+VKNSWGT+WGE GYIRM RD+ +  GLCGIAM   +PT
Sbjct: 178 WIVKNSWGTSWGERGYIRMARDVSSPYGLCGIAMAPLFPT 217


>gi|125811033|ref|XP_001361727.1| GA25021 [Drosophila pseudoobscura pseudoobscura]
 gi|54636904|gb|EAL26307.1| GA25021 [Drosophila pseudoobscura pseudoobscura]
          Length = 341

 Score =  265 bits (678), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 147/350 (42%), Positives = 201/350 (57%), Gaps = 22/350 (6%)

Query: 6   LENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKE 65
           +   L+L  + ++ V    S++  +      E    +  ++ + Y+D  E+  R KIF E
Sbjct: 1   MRTALILPLLALVAVAQAVSYAEVIQ-----EEWHTFKLEHRKNYQDETEERFRLKIFNE 55

Query: 66  NVEYIASFNN--KARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFR- 122
           N   IA  N         +K+ +N++AD  + EF +  NG+   L   +     D SF+ 
Sbjct: 56  NKHKIAKHNQLWATGAVSFKMAVNKYADMLHHEFYSTMNGFNYTLH--KQLRNADESFKG 113

Query: 123 -----YENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSE 177
                 E+ ++P  +DWR KGAVT VKDQG CG CWAFS+  A+EG ++  +  L SLSE
Sbjct: 114 VTFISPEHVTLPKQVDWRTKGAVTDVKDQGHCGSCWAFSSTGALEGQHYRKSGVLVSLSE 173

Query: 178 QELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKI 237
           Q LVDC T   + GC GGLMD+AF +I  N G+ TE  YPY+A D SC+  +    A   
Sbjct: 174 QNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYPYEAIDDSCHFNKGTIGATD- 232

Query: 238 SGYEDVPSNNEAALMKAVAN-QPVSVAIDASGSDFQFYSSGVFT-GQCGTE-LDHGVTAV 294
            G+ D+P  NE  + +AVA   PV+VAIDAS   FQFYS GV+    C  + LDHGV  V
Sbjct: 233 RGFVDIPQGNEKKMAEAVATIGPVAVAIDASHESFQFYSEGVYNEPACDAQNLDHGVLVV 292

Query: 295 GYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           G+GT + G  YWLVKNSWGTTWG+ G+I+M R+   KE  CGIA  +SYP
Sbjct: 293 GFGTDESGQDYWLVKNSWGTTWGDKGFIKMLRN---KENQCGIASASSYP 339


>gi|82796372|gb|ABB91778.1| cathepsin L [Hymeniacidon perlevis]
          Length = 323

 Score =  265 bits (678), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 150/310 (48%), Positives = 192/310 (61%), Gaps = 12/310 (3%)

Query: 40  EMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRA 99
           E W  ++ + Y D+ E+  R+KI++ N + I   N  +    + LG+N+F D  + EF  
Sbjct: 23  EDWKNEHNKKYSDDLEELTRYKIWQGNQKIIEVHNANSDKFGFTLGMNKFGDLESHEFAE 82

Query: 100 PRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVA 159
             NGY   +   RS+ +T V     N     ++DWR KGAVTGVK+QGQCG CWAFS   
Sbjct: 83  MFNGY---MMQARSN-STKVFVADPNYKADPTVDWRTKGAVTGVKNQGQCGSCWAFSTTG 138

Query: 160 AMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYK 219
           ++EG + + T KL SLSEQ LVDC     ++GC GGLMD AFE+I  N G+ TEA YPY+
Sbjct: 139 SLEGQHFLKTGKLVSLSEQNLVDCSGKEGNEGCNGGLMDQAFEYIKKNGGIDTEASYPYQ 198

Query: 220 ASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDASGSDFQFYSSGV 278
           A D  C  K A+   A  +GY D+   +E ALM+AV    PVSVAIDAS S FQ Y SGV
Sbjct: 199 AHDERCRFK-ASDVGATCTGYVDIKREDENALMQAVEKIGPVSVAIDASHSSFQLYRSGV 257

Query: 279 -FTGQCG-TELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCG 336
            +  +C  T LDHGV A+GYGT + G+ YWLVKNSWGT WG  GYI M R+   +   CG
Sbjct: 258 YYERECSQTALDHGVLAIGYGT-EGGSDYWLVKNSWGTDWGMEGYIMMSRN---RNNNCG 313

Query: 337 IAMQASYPTA 346
           IA +ASYPT 
Sbjct: 314 IATEASYPTV 323


>gi|113120269|gb|ABI30274.1| VS-B, partial [Vasconcellea stipulata]
          Length = 341

 Score =  265 bits (678), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 138/290 (47%), Positives = 185/290 (63%), Gaps = 16/290 (5%)

Query: 40  EMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRA 99
           E WM ++ +VY+   EK  RF+ FK+N+ YI   N K  N  Y LG+NEFAD T++EF+ 
Sbjct: 49  ESWMLKHDKVYKTIDEKIYRFETFKDNLMYIDETNKK--NNSYWLGLNEFADLTHDEFKE 106

Query: 100 PRNGYKRRLP--SVRSSETTDVSFRYENA-SVPASIDWRKKGAVTGVKDQGQCGCCWAFS 156
               Y   +P  S+   ++ DV F  ++    P SIDWR+KGAVT VK+Q  CG CWAFS
Sbjct: 107 K---YVGSIPEDSMIIEQSDDVEFPNKHVVDYPESIDWRQKGAVTPVKNQNPCGSCWAFS 163

Query: 157 AVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKY 216
            VA +EGIN I T  L SLSEQEL+DCD      GC+GG    + ++++ N G+ TE +Y
Sbjct: 164 TVATVEGINKIVTGNLISLSEQELLDCDR--RSHGCKGGYQTTSLKYVVDN-GVHTEKEY 220

Query: 217 PYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSS 276
           PY+   G+C  K        I+GY+ VPSN+E +L+K ++ QPVSV +++ G  FQFY  
Sbjct: 221 PYEKKQGNCRAKNKKGLKVYINGYKRVPSNDEISLIKTISIQPVSVLVESKGRPFQFYKG 280

Query: 277 GVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQR 326
           GVF G CGT+LDH VTAVGY     G  Y L+KNSWG  WG+ GYI+++R
Sbjct: 281 GVFGGPCGTKLDHAVTAVGY-----GKDYILIKNSWGPKWGDKGYIKIKR 325


>gi|52546918|gb|AAU81592.1| cysteine proteinase, partial [Petunia x hybrida]
          Length = 196

 Score =  265 bits (677), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 126/174 (72%), Positives = 140/174 (80%), Gaps = 1/174 (0%)

Query: 171 KLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEA 230
           KL SLSEQELVDCD +GE+QGC GGLMD AF+FI    G+ TE  YPY A+DG C+ K+ 
Sbjct: 4   KLVSLSEQELVDCD-NGENQGCNGGLMDLAFDFIKKKGGITTEENYPYMAADGKCDLKKR 62

Query: 231 NPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHG 290
           N     I G+EDVP N+E +L+KAVANQPVSVAI+ASGSDFQFYS GVFTG CGTELDHG
Sbjct: 63  NTPVVSIDGHEDVPPNDEESLLKAVANQPVSVAIEASGSDFQFYSEGVFTGDCGTELDHG 122

Query: 291 VTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           V  VGYGT  DGTKYW V+NSWG  WGE GYIRMQRDIDA+EGLCGIAMQ SYP
Sbjct: 123 VAIVGYGTTLDGTKYWTVRNSWGPEWGEKGYIRMQRDIDAEEGLCGIAMQPSYP 176


>gi|195153545|ref|XP_002017686.1| GL17172 [Drosophila persimilis]
 gi|194113482|gb|EDW35525.1| GL17172 [Drosophila persimilis]
          Length = 341

 Score =  265 bits (677), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 147/350 (42%), Positives = 202/350 (57%), Gaps = 22/350 (6%)

Query: 6   LENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKE 65
           +   L+L  + ++ V    S++  +      E    +  ++ + Y+D  E+  R KIF E
Sbjct: 1   MRTALILPLLALVAVAQAVSYAEVIQ-----EEWHTFKLEHRKNYQDETEERFRLKIFNE 55

Query: 66  NVEYIASFNN--KARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFR- 122
           N   IA  N         +K+ +N++AD  + EF +  NG+   L   +     D SF+ 
Sbjct: 56  NKHKIAKHNQLWATGAVSFKMAVNKYADMLHHEFYSTMNGFNYTLH--KQLRNADESFKG 113

Query: 123 -----YENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSE 177
                 E+ ++P  +DWR KGAVT VKDQG CG CWAFS+  A+EG ++  +  L SLSE
Sbjct: 114 VTFISPEHVTLPKQVDWRTKGAVTDVKDQGHCGSCWAFSSTGALEGQHYRKSGVLVSLSE 173

Query: 178 QELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKI 237
           Q LVDC T   + GC GGLMD+AF +I  N G+ TE  YPY+A D SC+  + +  A   
Sbjct: 174 QNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYPYEAIDDSCHFNKGSIGATD- 232

Query: 238 SGYEDVPSNNEAALMKAVAN-QPVSVAIDASGSDFQFYSSGVFT-GQCGTE-LDHGVTAV 294
            G+ D+P  NE  + +AVA   PV+VAIDAS   FQFYS GV+    C  + LDHGV  V
Sbjct: 233 RGFVDIPQGNEKKMAEAVATIGPVAVAIDASHESFQFYSEGVYNEPACDAQNLDHGVLVV 292

Query: 295 GYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           G+GT + G  YWLVKNSWGTTWG+ G+I+M R+   KE  CGIA  +SYP
Sbjct: 293 GFGTDESGEDYWLVKNSWGTTWGDKGFIKMLRN---KENQCGIASASSYP 339


>gi|224106333|ref|XP_002333699.1| predicted protein [Populus trichocarpa]
 gi|222837985|gb|EEE76350.1| predicted protein [Populus trichocarpa]
          Length = 197

 Score =  265 bits (677), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 124/197 (62%), Positives = 153/197 (77%), Gaps = 2/197 (1%)

Query: 150 GCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKG 209
           GCCWAFSAVAA+EGI  + T  L SLS+Q+LV+ D    ++GC GGLMD AF++II N+G
Sbjct: 3   GCCWAFSAVAAIEGIIKLKTGNLISLSKQQLVNRDVG--NKGCHGGLMDTAFQYIIRNEG 60

Query: 210 LATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGS 269
           L +E  YPY+  DG+C+ ++A   AA+I+G E+ P NNE AL++AVA QPVSV +D  G+
Sbjct: 61  LTSEDNYPYQGVDGTCSSEKAASIAAEITGDENAPKNNENALLQAVAKQPVSVGVDGGGN 120

Query: 270 DFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDID 329
           DFQFY SGVF G CGT+ +H VTA+GYGT  DGT YWLVKNSWGT+WGE+GY RMQR I 
Sbjct: 121 DFQFYKSGVFNGDCGTQQNHAVTAIGYGTDSDGTDYWLVKNSWGTSWGESGYTRMQRGIG 180

Query: 330 AKEGLCGIAMQASYPTA 346
           A EGLCG+AM ASYPTA
Sbjct: 181 ASEGLCGVAMDASYPTA 197


>gi|405958751|gb|EKC24845.1| Cathepsin L [Crassostrea gigas]
          Length = 330

 Score =  265 bits (677), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 152/323 (47%), Positives = 197/323 (60%), Gaps = 16/323 (4%)

Query: 30  LNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKE-NVEYIASFNNKARNKPYK--LGI 86
           L  + ++   ++++  +G+ Y   AE+E R ++  E N++YI   N  A    Y   LG+
Sbjct: 18  LPKSELDSEWQLYLKAHGKQY--GAEEEARRRVIWEGNLDYIEKHNLAADRGDYSFWLGM 75

Query: 87  NEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQ 146
           NE+ D TNEEFR+  NGYK R  + R S     S       +P ++DWR KG VT +K+Q
Sbjct: 76  NEYGDMTNEEFRSTMNGYKMRNGTSRGSLYLPPS---NIGDLPDTVDWRPKGYVTPIKNQ 132

Query: 147 GQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIIS 206
           GQCG CW+FSA  ++EG     T KL SLSEQ LVDC     + GC+GGLMDDAF++I  
Sbjct: 133 GQCGSCWSFSATGSLEGQTFKKTGKLPSLSEQNLVDCSQKQGNHGCQGGLMDDAFQYIKD 192

Query: 207 NKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAID 265
           N G+ TE+ YPY+A +G C    AN  A   SG+ D+ S +E+ L  AVA   P++VAID
Sbjct: 193 NNGIDTESSYPYEAKNGKCRFNAANVGATD-SGFTDIKSKSESDLQSAVATVGPIAVAID 251

Query: 266 ASGSDFQFYSSGVFTG-QCG-TELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIR 323
           AS   FQ Y SGV+    C  T LDHGV AVGYGT + G  YWLVKNSWG +WG+ GYI 
Sbjct: 252 ASHMSFQLYKSGVYHEFFCSETRLDHGVLAVGYGT-ESGKDYWLVKNSWGESWGQKGYIM 310

Query: 324 MQRDIDAKEGLCGIAMQASYPTA 346
           M R+   K   CGIA  ASYPT 
Sbjct: 311 MSRN---KRNNCGIATSASYPTV 330


>gi|320169658|gb|EFW46557.1| cathepsin L2 [Capsaspora owczarzaki ATCC 30864]
          Length = 324

 Score =  265 bits (677), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 144/313 (46%), Positives = 197/313 (62%), Gaps = 13/313 (4%)

Query: 36  NERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNE 95
           N   + W A +G  Y    E+  R  I++ N+++I   N++  +  YKL +N+FAD T  
Sbjct: 19  NPCFDSWKATHGVSYATVGEETARRGIYRANLDFIEKHNSEGHS--YKLAVNKFADLTYP 76

Query: 96  EFRAPRNGYKRRLPSVRSSETTDVS-FRYENASVPASIDWRKKGAVTGVKDQGQCGCCWA 154
           EF A   G   R  +  ++++   S +     S+P S+DWR  G VT +KDQGQCG CW+
Sbjct: 77  EFAAKYLGL--RFDATNATKSFAASTYLPRMVSLPDSVDWRTAGIVTPIKDQGQCGSCWS 134

Query: 155 FSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEA 214
           FS   ++EG +   T +L SLSEQ LVDC ++  + GC GGLMD AF++IISN G+ TE+
Sbjct: 135 FSTTGSVEGQHARKTGQLVSLSEQNLVDCSSAQGNAGCNGGLMDQAFQYIISNNGIDTES 194

Query: 215 KYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDASGSDFQF 273
            YPY A DG+C    AN   A ++ Y+D+ S +E+ L  AVA   P+SVAIDAS   FQF
Sbjct: 195 SYPYTAQDGTCQFNSAN-VGATVASYQDIASGSESDLQNAVATVGPISVAIDASQPSFQF 253

Query: 274 YSSGVFT--GQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAK 331
           YSSGV+       ++LDHGV AVGYGT+   + YWLVKNSWGT+WG++GYI M R+ + +
Sbjct: 254 YSSGVYNEPACSSSQLDHGVLAVGYGTSGS-SDYWLVKNSWGTSWGQSGYIWMTRNSNNQ 312

Query: 332 EGLCGIAMQASYP 344
              CGIA  ASYP
Sbjct: 313 ---CGIATAASYP 322


>gi|33242870|gb|AAQ01139.1| cathepsin [Branchiostoma lanceolatum]
          Length = 334

 Score =  265 bits (677), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 153/337 (45%), Positives = 199/337 (59%), Gaps = 14/337 (4%)

Query: 15  ILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFN 74
           +L+LG     + +  L     N+  EMW  Q+G+ Y   AE+  R  IF++N   IA  N
Sbjct: 3   LLILGAVISMATAGVL---PHNKEWEMWKLQHGKQYETEAEEYSRRFIFEKNTVKIAEHN 59

Query: 75  NKAR--NKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASI 132
            +A      Y L +N+F D  +EEF     G   ++   +    +DV    +N ++P S+
Sbjct: 60  IRASLGMHSYTLAMNKFGDMHHEEFHQRIMGGCLKIVK-KPLLGSDVGDNDDNGTLPKSV 118

Query: 133 DWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGC 192
           DWR    V+ VKDQG+CG CWAFS   ++EG +   T KL  LSEQ+LVDC     +QGC
Sbjct: 119 DWRNSHMVSEVKDQGECGSCWAFSTTGSLEGQHSNKTGKLVDLSEQQLVDCSKDFGNQGC 178

Query: 193 EGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALM 252
            GGLMD AF++I +N GL TE  YPY A+D    K + +   A + GY+DV S NE AL 
Sbjct: 179 GGGLMDQAFQYITANGGLDTEESYPYTATDDEPCKFDNSSVGATLVGYKDVKSGNEHALK 238

Query: 253 KAVAN-QPVSVAIDASGSDFQFYSSGVF-TGQCGTE-LDHGVTAVGYGTADDGTK--YWL 307
           +AVA   PVSVAIDA    FQFYSSGV+   QC TE LDHGV AVGYG  +D +   +W+
Sbjct: 239 RAVATVGPVSVAIDAGHESFQFYSSGVYDEPQCSTEQLDHGVLAVGYGAMNDNSHQAFWI 298

Query: 308 VKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           VKNSWG +WG+ GYI M R+   K   CGIA  ASYP
Sbjct: 299 VKNSWGPSWGDQGYIMMSRN---KNNQCGIATSASYP 332


>gi|402770509|gb|AFQ98389.1| cathepsin L [Rhipicephalus microplus]
          Length = 332

 Score =  265 bits (676), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 148/310 (47%), Positives = 194/310 (62%), Gaps = 12/310 (3%)

Query: 40  EMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNK--PYKLGINEFADQTNEEF 97
           E +   + + Y+ + E+ +RFKIF E+   IA  N K       YKLG+N+F D    EF
Sbjct: 28  EAFKTTHKKTYQSHMEELLRFKIFTESSLIIARHNAKYAKGLVSYKLGMNQFGDLLAHEF 87

Query: 98  RAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSA 157
               NG+     +  S+     +    ++S+P ++DWRKKGAVT VKDQGQCG CWAFSA
Sbjct: 88  ARIFNGHHGTRKTGGSTFLPPANVN--DSSLPKAVDWRKKGAVTPVKDQGQCGSCWAFSA 145

Query: 158 VAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYP 217
             ++EG + +   +L SLSEQ LVDC  S  + GCEGGLM+DAF++I +N G+ TE  YP
Sbjct: 146 TGSLEGQHFLKNGELVSLSEQNLVDCSQSFGNNGCEGGLMEDAFKYIKANDGIDTEKSYP 205

Query: 218 YKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDASGSDFQFYSS 276
           Y+A DG C  K+ +  A   +GY ++ + +E  L KAVA   P+SVAIDAS S FQ YS 
Sbjct: 206 YEAVDGECRFKKEDVGATD-TGYVEIKAGSEDDLKKAVATVGPISVAIDASHSSFQLYSE 264

Query: 277 GVF-TGQCGTE-LDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGL 334
           GV+   +C +E LDHGV  VGYG    G KYWLVKNSW  +WG+ GYI M RD + +   
Sbjct: 265 GVYDEPECSSEDLDHGVLVVGYGV-KGGKKYWLVKNSWAESWGDQGYILMSRDNNNQ--- 320

Query: 335 CGIAMQASYP 344
           CGIA QASYP
Sbjct: 321 CGIASQASYP 330


>gi|146217394|gb|ABQ10739.1| cathepsin L [Penaeus monodon]
          Length = 341

 Score =  265 bits (676), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 148/319 (46%), Positives = 189/319 (59%), Gaps = 17/319 (5%)

Query: 37  ERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNN--KARNKPYKLGINEFADQTN 94
           E  E +  ++ + Y    E+  R KIF EN   IA+ N      +  YKL +N++ D  +
Sbjct: 27  EEWEAFKLEHSKKYDSEVEESFRMKIFTENKHKIANHNKGFAQGHHTYKLSMNKYGDMLH 86

Query: 95  EEFRAPRNGYKRRLPSVRSSETTDVSFRY----ENASVPASIDWRKKGAVTGVKDQGQCG 150
            EF +  NG++        +        +    ++  +P ++DWR KGAVT +KDQGQCG
Sbjct: 87  HEFVSTMNGFRGNHTGGYKNNRAYTGATFIEPDDDVQLPKNVDWRTKGAVTPIKDQGQCG 146

Query: 151 CCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGL 210
            CWAFSA  A+EG     T +L SLSEQ LVDC     + GC GGLMD+AFE++  N G+
Sbjct: 147 SCWAFSATGALEGQTFRKTGQLVSLSEQNLVDCSRKFGNNGCNGGLMDNAFEYVKENGGI 206

Query: 211 ATEAKYPYKASDGSCNKKEANPSA--AKISGYEDVPSNNEAALMKAVAN-QPVSVAIDAS 267
            TE  YPY A D  C+    NP A  A+  G+ DV   +E AL KAVA   PVSVAIDAS
Sbjct: 207 DTEESYPYDAEDEKCH---YNPRAAGAEDKGFVDVREGSEHALKKAVATVGPVSVAIDAS 263

Query: 268 GSDFQFYSSGVFT-GQCGTE-LDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQ 325
              FQFYS GV+   +C  E LDHGV  VGYG  DDGT YWLVKNSWGTTWG+ GY++M 
Sbjct: 264 HESFQFYSHGVYIEPECSPEMLDHGVLVVGYGIDDDGTDYWLVKNSWGTTWGDQGYVKMA 323

Query: 326 RDIDAKEGLCGIAMQASYP 344
           R+ D +   CGIA  AS+P
Sbjct: 324 RNRDNQ---CGIASSASFP 339


>gi|33242872|gb|AAQ01140.1| cathepsin [Branchiostoma lanceolatum]
          Length = 334

 Score =  264 bits (675), Expect = 4e-68,   Method: Compositional matrix adjust.
 Identities = 153/337 (45%), Positives = 199/337 (59%), Gaps = 14/337 (4%)

Query: 15  ILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFN 74
           +L+LG     + +  L     N+  EMW  Q+G+ Y   AE+  R  IF++N   IA  N
Sbjct: 3   LLILGAVISMATAGVL---PHNKEWEMWKLQHGKQYETEAEEYSRRFIFEKNTVKIAEHN 59

Query: 75  NKAR--NKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASI 132
            +A      Y L +N+F D  +EEF     G   ++   +    +DV    +N ++P S+
Sbjct: 60  IRASLGMHSYTLAMNKFGDMHHEEFHQRIMGGCLKIVK-KPLLGSDVGDNDDNGTLPKSV 118

Query: 133 DWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGC 192
           DWR    V+ VKDQG+CG CWAFS   ++EG +   T KL  LSEQ+LVDC     +QGC
Sbjct: 119 DWRNSHMVSEVKDQGECGSCWAFSTTGSLEGQHSSKTGKLVDLSEQQLVDCSKDFGNQGC 178

Query: 193 EGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALM 252
            GGLMD AF++I +N GL TE  YPY A+D    K + +   A + GY+DV S NE AL 
Sbjct: 179 GGGLMDQAFQYIKANGGLDTEESYPYTATDDKPCKFDNSSVGATLVGYKDVKSGNEHALK 238

Query: 253 KAVAN-QPVSVAIDASGSDFQFYSSGVF-TGQCGTE-LDHGVTAVGYGTADDGTK--YWL 307
           +AVA   PVSVAIDA    FQFYSSGV+   QC TE LDHGV AVGYG  +D +   +W+
Sbjct: 239 RAVATVGPVSVAIDAGHESFQFYSSGVYDEPQCSTEQLDHGVLAVGYGAMNDNSHQAFWI 298

Query: 308 VKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           VKNSWG +WG+ GYI M R+   K   CGIA  ASYP
Sbjct: 299 VKNSWGPSWGDQGYIMMSRN---KNNQCGIATSASYP 332


>gi|348687948|gb|EGZ27762.1| papain-like cysteine protease C1 [Phytophthora sojae]
          Length = 533

 Score =  264 bits (675), Expect = 4e-68,   Method: Compositional matrix adjust.
 Identities = 143/316 (45%), Positives = 186/316 (58%), Gaps = 7/316 (2%)

Query: 33  ATMNERHEM--WMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFA 90
           + +   HE   WM+ +G  + D  E   R + +  N  YI   N +      KLG N F+
Sbjct: 20  SPLEYEHEFSAWMSAHGVTFSDALEFARRLENYIANDMYILEHNAENAWTGVKLGHNAFS 79

Query: 91  DQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCG 150
             + +EF+    G       +     + V   + +  VP+++DW  KG VT VK+QG CG
Sbjct: 80  HMSFDEFKFKMTGLVLPEGYLEQRLASRVDGLWSDVEVPSAVDWVDKGGVTPVKNQGMCG 139

Query: 151 CCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGL 210
            CWAFS   A+EG   +++ KL SLSEQELVDCD +G D GC GGLMD AF++I  + G+
Sbjct: 140 SCWAFSTTGAVEGATFVSSGKLLSLSEQELVDCDHNG-DMGCNGGLMDHAFQWIEDHGGI 198

Query: 211 ATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSD 270
            +E  Y YKA    C K +   S  K++G++DV   +E AL  AVA QPVSVAI+A    
Sbjct: 199 CSEDDYEYKAKAQVCRKCD---SVVKVTGFQDVNPQDEHALKVAVAQQPVSVAIEADQKA 255

Query: 271 FQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDA 330
           FQFY SGVF   CGT LDHGV AVGYG  D+G K+W VKNSWG +WGE GYIR+ R+ + 
Sbjct: 256 FQFYKSGVFNLTCGTRLDHGVLAVGYGN-DNGQKFWKVKNSWGASWGEQGYIRLAREENG 314

Query: 331 KEGLCGIAMQASYPTA 346
             G CGIA   SYP A
Sbjct: 315 PAGQCGIASVPSYPFA 330


>gi|311265493|ref|XP_003130681.1| PREDICTED: cathepsin L1-like [Sus scrofa]
          Length = 332

 Score =  264 bits (675), Expect = 4e-68,   Method: Compositional matrix adjust.
 Identities = 158/347 (45%), Positives = 203/347 (58%), Gaps = 26/347 (7%)

Query: 8   NKLVLAAILVLGVW--APQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKE 65
           N  +L A   LG+   AP+      +D +++     W A + ++Y  N E   R  I+++
Sbjct: 2   NPSLLLAAFCLGIASAAPR------HDHSLDADWYKWKATHRKLYGLNEEGRRR-AIWEK 54

Query: 66  NVEYIASFNNKARN--KPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRY 123
           N++ I   N + R     + + +N F D TNEEFR   NG++ +       +   V    
Sbjct: 55  NMKMIERHNWEHRQGKHSFTMAMNAFGDMTNEEFRKTMNGFQNQ-----KHKKGKVFLDA 109

Query: 124 ENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDC 183
            +A  P S+DWR+KG VT VK+QG CG CWAFSA  A+EG     T KL SLSEQ LVDC
Sbjct: 110 GSALTPHSVDWREKGYVTAVKNQGHCGSCWAFSATGALEGQMFRKTSKLISLSEQNLVDC 169

Query: 184 DTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDV 243
                ++GC GGLMD+AF++I  N GL +E  YPY   DGSC K +   SAA  +GY D+
Sbjct: 170 SWPEGNEGCNGGLMDNAFQYIKDNGGLDSEESYPYFGKDGSC-KYKPQSSAANDTGYVDI 228

Query: 244 PSNNEAALMKAVAN-QPVSVAIDASGSDFQFYSSGV-FTGQCGTE-LDHGVTAVGYGT-- 298
           P   E ALMKAVA   P+SV IDAS   FQFYS+G+ F  QC +E LDHGV  VGYG   
Sbjct: 229 PK-QEKALMKAVATVGPISVGIDASHESFQFYSTGIYFEPQCSSEDLDHGVLVVGYGVEG 287

Query: 299 ADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           A    KYWLVKNSWG TWG +GYI+M +D   +   CGIA  ASYP 
Sbjct: 288 AHSNNKYWLVKNSWGNTWGMDGYIKMTKD---QNNHCGIATMASYPV 331


>gi|2765358|emb|CAA74241.1| cathepsin L [Litopenaeus vannamei]
          Length = 325

 Score =  264 bits (675), Expect = 4e-68,   Method: Compositional matrix adjust.
 Identities = 142/316 (44%), Positives = 197/316 (62%), Gaps = 14/316 (4%)

Query: 34  TMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNK--PYKLGINEFAD 91
           ++ ++ + + A++GR Y    E+  R  +F++N ++I   N +  N    + L +N+F D
Sbjct: 17  SLRQQWQNFKAEHGRRYASVQEERYRLSVFEQNQQFIDDHNARFENGEVTFTLQMNQFGD 76

Query: 92  QTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGC 151
            T+EE  A  NG+    P+ R +       + ++ ++P  +DWR KGAVT VKDQ QCG 
Sbjct: 77  MTSEEIVATMNGF-LGAPTRRPAAV----LKADDETLPEKVDWRTKGAVTPVKDQKQCGS 131

Query: 152 CWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLA 211
           CWAFS   ++EG + +   KL SLSEQ LVDC     + GC GGLMD AF +I +NKG+ 
Sbjct: 132 CWAFSTTGSLEGQHFLKDGKLVSLSEQNLVDCSDKFRNMGCMGGLMDQAFRYIKANKGID 191

Query: 212 TEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDASGSD 270
           TE  YPY+A DG C + +A+   A  +GY DV   +E+AL KAVA   P+SV IDAS S 
Sbjct: 192 TEDSYPYEAQDGKC-RFDASNVGATDTGYVDVEHGSESALKKAVATIGPISVGIDASQST 250

Query: 271 FQFYSSGVFT-GQC-GTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDI 328
           F FY +GV+    C  T LDHGV AVGYG+ ++G  +WLVKNSW T+WG+ GYI+M R+ 
Sbjct: 251 FHFYHTGVYHDDHCSSTMLDHGVLAVGYGSDENGGDFWLVKNSWNTSWGDKGYIKMSRN- 309

Query: 329 DAKEGLCGIAMQASYP 344
             +   CGIA QASYP
Sbjct: 310 --RNNNCGIASQASYP 323


>gi|260516672|gb|ACX43963.1| cysteine protease 3, partial [Brachiaria hybrid cultivar]
          Length = 319

 Score =  264 bits (675), Expect = 4e-68,   Method: Compositional matrix adjust.
 Identities = 145/295 (49%), Positives = 183/295 (62%), Gaps = 11/295 (3%)

Query: 31  NDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFA 90
           ++  + +    +M QY + Y  +AE   RF  FK +VE I   +N   N  Y +G+NEFA
Sbjct: 34  SEVMLQDMFTAFMKQYSKAY-SHAEFSSRFNQFKASVETI-RLHNTLANASYTMGLNEFA 91

Query: 91  DQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCG 150
           D + EEF+    G K     V        +   E  + P SIDWR   AVT +KDQGQCG
Sbjct: 92  DLSFEEFKGKYFGCKH----VEREFARSNNLHQEVEAAPTSIDWRTSNAVTPIKDQGQCG 147

Query: 151 CCWAFSAVAAMEGINHITTRK-LTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKG 209
            CWAFSA  ++EG   +  +  LTSLSEQ+LVDC TS  + GC GGLMD AFE+II+NKG
Sbjct: 148 SCWAFSATGSIEGAWVLQGKHTLTSLSEQQLVDCSTSYGNAGCNGGLMDYAFEYIIANKG 207

Query: 210 LATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDASG 268
           +  E+ YPYK   G C K  +      ISG++DV S +EA+ + AV    PVSVAI+A  
Sbjct: 208 ICAESAYPYKGVGGLCQK--SCTKVVTISGHKDVASGDEASSLNAVGTVGPVSVAIEADQ 265

Query: 269 SDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIR 323
           + FQFYSSGVF+G CG  LDHGV AVGYGT      YW+VKNSWGT+WGE+GYIR
Sbjct: 266 AGFQFYSSGVFSGTCGHNLDHGVLAVGYGTTGS-QDYWIVKNSWGTSWGESGYIR 319


>gi|66270077|gb|AAY43368.1| cysteine protease [Phytophthora infestans]
          Length = 510

 Score =  264 bits (675), Expect = 4e-68,   Method: Compositional matrix adjust.
 Identities = 146/311 (46%), Positives = 182/311 (58%), Gaps = 7/311 (2%)

Query: 38  RHEM--WMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNE 95
            HE   WM  +   + D  E   R + +  N  YI   N +      KL  NEF+  + E
Sbjct: 26  EHEFSAWMKTHSVSFSDALEFAKRLENYIANDMYIMEHNLENAWTGVKLDHNEFSSMSFE 85

Query: 96  EFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAF 155
           EF+    GY      +     + V   + +  VP S+DW+ KG VT VK+QG CG CWAF
Sbjct: 86  EFKFKMTGYVMPEGYLEQRLASRVDNLWSDVQVPDSVDWQDKGGVTPVKNQGMCGSCWAF 145

Query: 156 SAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAK 215
           S   A+EG   +++ KL SLSEQELVDCD +G D GC GGLMD AF +I  N G+ +E  
Sbjct: 146 STTGAVEGAAFVSSGKLVSLSEQELVDCDHNG-DMGCNGGLMDHAFAWIEDNGGICSEDD 204

Query: 216 YPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYS 275
           Y YKA    C   E      KISG++DV   +E AL  AVA QPVSVAI+A    FQFY 
Sbjct: 205 YEYKAKAQVCRDCE---KVVKISGFQDVNPQDEHALKVAVAQQPVSVAIEADQKAFQFYK 261

Query: 276 SGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLC 335
           SGVF   CGT LDHGV AVGYG+ ++G K+W VKNSWG++WGE GYIR+ R+ +   G C
Sbjct: 262 SGVFNLTCGTRLDHGVLAVGYGS-ENGQKFWKVKNSWGSSWGEKGYIRLAREENGPAGQC 320

Query: 336 GIAMQASYPTA 346
           GIA   SYP A
Sbjct: 321 GIASVPSYPFA 331


>gi|301116794|ref|XP_002906125.1| cysteine protease family C01A, putative [Phytophthora infestans
           T30-4]
 gi|262107474|gb|EEY65526.1| cysteine protease family C01A, putative [Phytophthora infestans
           T30-4]
          Length = 535

 Score =  264 bits (675), Expect = 5e-68,   Method: Compositional matrix adjust.
 Identities = 146/311 (46%), Positives = 182/311 (58%), Gaps = 7/311 (2%)

Query: 38  RHEM--WMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNE 95
            HE   WM  +   + D  E   R + +  N  YI   N +      KL  NEF+  + E
Sbjct: 26  EHEFSAWMKTHSVSFSDALEFAKRLENYIANDMYIMEHNLENAWTGVKLDHNEFSSMSFE 85

Query: 96  EFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAF 155
           EF+    GY      +     + V   + +  VP S+DW+ KG VT VK+QG CG CWAF
Sbjct: 86  EFKFKMTGYVMPEGYLEQRLASRVDNLWSDVQVPDSVDWQDKGGVTPVKNQGMCGSCWAF 145

Query: 156 SAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAK 215
           S   A+EG   +++ KL SLSEQELVDCD +G D GC GGLMD AF +I  N G+ +E  
Sbjct: 146 STTGAVEGAAFVSSGKLVSLSEQELVDCDHNG-DMGCNGGLMDHAFAWIEDNGGICSEDD 204

Query: 216 YPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYS 275
           Y YKA    C   E      KISG++DV   +E AL  AVA QPVSVAI+A    FQFY 
Sbjct: 205 YEYKAKAQVCRDCE---KVVKISGFQDVNPQDEHALKVAVAQQPVSVAIEADQKAFQFYK 261

Query: 276 SGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLC 335
           SGVF   CGT LDHGV AVGYG+ ++G K+W VKNSWG++WGE GYIR+ R+ +   G C
Sbjct: 262 SGVFNLTCGTRLDHGVLAVGYGS-ENGQKFWKVKNSWGSSWGEKGYIRLAREENGPAGQC 320

Query: 336 GIAMQASYPTA 346
           GIA   SYP A
Sbjct: 321 GIASVPSYPFA 331


>gi|33242874|gb|AAQ01141.1| cathepsin [Branchiostoma lanceolatum]
          Length = 334

 Score =  264 bits (674), Expect = 5e-68,   Method: Compositional matrix adjust.
 Identities = 153/337 (45%), Positives = 199/337 (59%), Gaps = 14/337 (4%)

Query: 15  ILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFN 74
           +L+LG     + +  L     N+  EMW  Q+G+ Y   AE+  R  IF++N   IA  N
Sbjct: 3   LLILGAVISMATAGVL---PHNKEWEMWKLQHGKQYETEAEEYSRRFIFEKNTVKIAEHN 59

Query: 75  NKAR--NKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASI 132
            +A      Y L +N+F D  +EEF     G   ++   +    +DV    +N ++P S+
Sbjct: 60  IRASLGMHSYTLAMNKFGDMHHEEFHQRIMGGCLKIVK-KPLLGSDVGDNDDNGTLPKSV 118

Query: 133 DWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGC 192
           DWR    V+ VKDQG+CG CWAFS   ++EG +   T KL  LSEQ+LVDC     +QGC
Sbjct: 119 DWRNSHMVSEVKDQGECGSCWAFSTTGSLEGQHSNKTGKLVDLSEQQLVDCSKDFGNQGC 178

Query: 193 EGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALM 252
            GGLMD AF++I +N GL TE  YPY A+D    K + +   A + GY+DV S NE AL 
Sbjct: 179 GGGLMDQAFQYIKANGGLDTEESYPYTATDDKPCKFDNSSVGATLVGYKDVKSGNEHALK 238

Query: 253 KAVAN-QPVSVAIDASGSDFQFYSSGVF-TGQCGTE-LDHGVTAVGYGTADDGTK--YWL 307
           +AVA   PVSVAIDA    FQFYSSGV+   QC TE LDHGV AVGYG  +D +   +W+
Sbjct: 239 RAVATVGPVSVAIDAGHESFQFYSSGVYDEPQCSTEQLDHGVLAVGYGAMNDNSHQAFWI 298

Query: 308 VKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           VKNSWG +WG+ GYI M R+   K   CGIA  ASYP
Sbjct: 299 VKNSWGPSWGDQGYIMMSRN---KNNQCGIATSASYP 332


>gi|116563690|gb|ABJ99858.1| cathepsin L [Hippoglossus hippoglossus]
          Length = 336

 Score =  264 bits (674), Expect = 5e-68,   Method: Compositional matrix adjust.
 Identities = 147/341 (43%), Positives = 202/341 (59%), Gaps = 15/341 (4%)

Query: 12  LAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIA 71
           +  +LVL        S  + DA +NE  ++W + + + Y +  E   R  ++++N++ I 
Sbjct: 1   MLPLLVLTACLSSVLSAPVLDAQLNEHWDLWKSWHSKKYHEKEEGWRRM-VWEKNLQKIE 59

Query: 72  SFN--NKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVP 129
             N  +      ++LG+N F D T+EEFR   NGYK +    +   T  +       + P
Sbjct: 60  LHNLEHSMGTHSFRLGMNHFGDMTHEEFRQIMNGYKLK---TQRKFTGSLFMEPNFMTAP 116

Query: 130 ASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGED 189
           +++DWR+KG VT VKDQGQCG CWAFS   A+EG     T KL SLSEQ LVDC     +
Sbjct: 117 SAVDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQQFRKTGKLVSLSEQNLVDCSRPEGN 176

Query: 190 QGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEA 249
           +GC GGLMD AF+++  N+GL +E  YPY  +D      +   ++A  +G+ DVPS  E 
Sbjct: 177 EGCGGGLMDQAFQYVTDNQGLDSEDSYPYTGTDDQPCHYDPLYNSANDTGFVDVPSGKEH 236

Query: 250 ALMKAVAN-QPVSVAIDASGSDFQFYSSGV-FTGQCGT-ELDHGVTAVGYGTADD---GT 303
           ALMKAVA+  PVSVAIDA    FQFY SG+ +  +C + ELDHGV AVGYG   +   G 
Sbjct: 237 ALMKAVASVGPVSVAIDAGHESFQFYQSGIYYEKECSSEELDHGVLAVGYGFEGEDKMGK 296

Query: 304 KYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           K+W+VKNSWG  WG+ GYI M +D   ++  CGIA  ASYP
Sbjct: 297 KFWIVKNSWGEKWGDKGYIYMAKD---RKNHCGIATAASYP 334


>gi|326494040|dbj|BAJ85482.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 355

 Score =  264 bits (674), Expect = 5e-68,   Method: Compositional matrix adjust.
 Identities = 144/319 (45%), Positives = 188/319 (58%), Gaps = 16/319 (5%)

Query: 37  ERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEE 96
            RHE WMA+YGRVY D AEK  R ++F  N  +I + N +A N+ Y LG+N F+D TNEE
Sbjct: 39  HRHERWMAKYGRVYADAAEKLRRQEVFAANARHIDAVN-RAGNRTYTLGLNHFSDLTNEE 97

Query: 97  FRAPRNGYKRR-----LPSVRSSETTDVSFR-YENASVPASIDWRKKGAVTGVKDQGQCG 150
           F     GY+ +     L    SS    V+    +  S P S+DWR +GAVT VK QG CG
Sbjct: 98  FAQTHLGYRHQPGPGGLRPEDSSPAAAVNVTDAQLQSTPDSVDWRARGAVTPVKHQGHCG 157

Query: 151 CCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGL 210
            CWAF+AVAA EG+  I T  L S+SEQ+++DC  +G    C+ G ++ A  +I ++ GL
Sbjct: 158 SCWAFAAVAATEGLVQIATGNLISMSEQQVLDC--TGGTSSCKSGYVNAALTYITASGGL 215

Query: 211 ATEAKYPYKASDGSCNKKEANPSAAKISGYED--VPSNNEAALMKAVANQPVSVAIDASG 268
            TEA Y Y A  G+C    A+P++A   G     + + +E AL   VA QPV+VA++A  
Sbjct: 216 QTEAAYAYSAEQGACRSGGASPNSAAAVGVHRSAMLNGDEGALQVLVAGQPVAVAVEAE- 274

Query: 269 SDFQFYSSGVFTG--QCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQR 326
            DF  Y SGV+ G   CG +L H VT VGYG   DG  YW+VKN WG  WGE GY+R+ R
Sbjct: 275 PDFHHYKSGVYVGSPSCGQKLHHAVTVVGYGADGDGQGYWVVKNQWGAGWGEVGYMRLTR 334

Query: 327 DIDAKEGLCGIAMQASYPT 345
                   CG+A  A YPT
Sbjct: 335 GNGGNN--CGMATHAYYPT 351


>gi|728637|emb|CAA59441.1| cathepsin l [Litopenaeus vannamei]
          Length = 326

 Score =  263 bits (673), Expect = 8e-68,   Method: Compositional matrix adjust.
 Identities = 142/316 (44%), Positives = 197/316 (62%), Gaps = 14/316 (4%)

Query: 34  TMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNK--PYKLGINEFAD 91
           ++ ++ + + A++GR Y    E+  R  +F++N ++I   N +  N    + L +N+F D
Sbjct: 18  SLRQQWQNFKAEHGRRYASVQEERYRLSVFEQNQQFIDDHNARFENGEVTFTLQMNQFGD 77

Query: 92  QTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGC 151
            T+EE  A  NG+    P+ R +       + ++ ++P  +DWR KGAVT VKDQ QCG 
Sbjct: 78  MTSEEIVATMNGF-LGAPTRRPAAV----LKADDETLPEKVDWRTKGAVTPVKDQKQCGS 132

Query: 152 CWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLA 211
           CWAFS   ++EG + +   KL SLSEQ LVDC     + GC GGLMD AF +I +NKG+ 
Sbjct: 133 CWAFSTTGSLEGQHFLKDGKLVSLSEQNLVDCSDKFGNMGCMGGLMDQAFRYIKANKGID 192

Query: 212 TEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDASGSD 270
           TE  YPY+A DG C + +A+   A  +GY DV   +E+AL KAVA   P+SV IDAS S 
Sbjct: 193 TEDSYPYEAQDGKC-RFDASNVGATDTGYVDVEHGSESALKKAVATIGPISVGIDASQST 251

Query: 271 FQFYSSGVFT-GQC-GTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDI 328
           F FY +GV+    C  T LDHGV AVGYG+ ++G  +WLVKNSW T+WG+ GYI+M R+ 
Sbjct: 252 FHFYHTGVYHDDHCSSTMLDHGVLAVGYGSDENGGDFWLVKNSWNTSWGDKGYIKMSRN- 310

Query: 329 DAKEGLCGIAMQASYP 344
             +   CGIA QASYP
Sbjct: 311 --RNNNCGIASQASYP 324


>gi|388890776|gb|AFK80364.1| cysteine proteinase 3, partial [Acanthamoeba castellanii]
          Length = 329

 Score =  263 bits (672), Expect = 8e-68,   Method: Compositional matrix adjust.
 Identities = 160/348 (45%), Positives = 200/348 (57%), Gaps = 37/348 (10%)

Query: 10  LVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEY 69
           ++LAAI V    A      T +D       E WM    + Y  N E   R+ +++EN + 
Sbjct: 8   VLLAAICVASTLA------TTHDPLTGVFAE-WMRDNSKSY-SNEEFVFRWNVWRENQQL 59

Query: 70  IASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS-- 127
           I   N    NK   L +N+F D TN EF     G              D SF    A+  
Sbjct: 60  IEEHNRS--NKTSFLAMNKFGDLTNAEFNKLFKGLAF-----------DYSFHANKAAAE 106

Query: 128 --VPA-----SIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQEL 180
             VPA       DWR+KGAVT VK+QGQCG CW+FS   + EG N + T +LTSLSEQ L
Sbjct: 107 KAVPAPGLSADFDWRQKGAVTHVKNQGQCGSCWSFSTTGSTEGANFLKTGRLTSLSEQNL 166

Query: 181 VDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGY 240
           +DC  S  + GC GGLMD AFE+II+NKG+ TEA YPY+ +  +C    AN S   ++ Y
Sbjct: 167 IDCSGSYGNNGCNGGLMDYAFEYIINNKGIDTEASYPYQTAQYTCQYNPAN-SGGSLTSY 225

Query: 241 EDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVF--TGQCGTELDHGVTAVGYGT 298
            DV S +E AL+ AVA +P SVAIDAS + FQFYS GV+  +    T+LDHGV AVG+GT
Sbjct: 226 TDVSSGDENALLNAVATEPTSVAIDASHNSFQFYSGGVYYESACSSTQLDHGVLAVGWGT 285

Query: 299 ADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
            +DG  YWLVKNSWG  WG  GYI+M R+   +   CGIA  ASYPTA
Sbjct: 286 -EDGQDYWLVKNSWGADWGLAGYIKMARN---RSNNCGIATSASYPTA 329


>gi|326430491|gb|EGD76061.1| cathepsin [Salpingoeca sp. ATCC 50818]
          Length = 381

 Score =  263 bits (672), Expect = 8e-68,   Method: Compositional matrix adjust.
 Identities = 140/274 (51%), Positives = 177/274 (64%), Gaps = 14/274 (5%)

Query: 46  YGRVYRDNAEKEMRFKIFKENVEYIASFNNKARN--KPYKLGINEFADQTNEEFRAPRNG 103
           + + Y    E+  RF IF +N+ +IA  N +A      + +G+N+FAD TNEE+R     
Sbjct: 27  FEKQYESPEEEARRFAIFADNLAFIARHNAEAARGLHTHTVGVNQFADLTNEEYRQL--- 83

Query: 104 YKRRLPS-VRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAME 162
           Y R  P+ +   E  +V     NA    S+DWR+KGAVT +K+QGQCG CW+FS   ++E
Sbjct: 84  YLRPYPTELLGRERQEVWLDGPNA---GSVDWRQKGAVTPIKNQGQCGSCWSFSTTGSVE 140

Query: 163 GINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASD 222
           G + I T  L SLSEQ+LVDC  S  +QGC GGLMD+AF++IISN GL TE  YPY A D
Sbjct: 141 GAHAIATGNLVSLSEQQLVDCSGSFGNQGCNGGLMDNAFKYIISNGGLDTEQDYPYTARD 200

Query: 223 GSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQ 282
           G C+K + +  A  ISGY+DVP NNE  L  AV   PVSVAI+A    FQ YSSGVF+G 
Sbjct: 201 GVCDKSKESKHAVSISGYKDVPQNNEDQLAAAVEKGPVSVAIEADQQSFQMYSSGVFSGP 260

Query: 283 CGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTW 316
           CGT LDHGV  VGY T+D    YW+VKNSWG +W
Sbjct: 261 CGTNLDHGVLVVGY-TSD----YWIVKNSWGASW 289


>gi|413953050|gb|AFW85699.1| hypothetical protein ZEAMMB73_033873 [Zea mays]
          Length = 361

 Score =  263 bits (672), Expect = 9e-68,   Method: Compositional matrix adjust.
 Identities = 148/325 (45%), Positives = 196/325 (60%), Gaps = 20/325 (6%)

Query: 37  ERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEE 96
           ER + W A+Y R Y    E + RF ++ EN+ +I + N  +    Y+LG N+F D T EE
Sbjct: 38  ERFKAWQAEYNRTYATPEEFQQRFMVYSENLRFIKTMNQLSTGSSYELGENQFTDLTEEE 97

Query: 97  FRAPRNGYKRRL-----------PSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKD 145
           F+   + Y  +L           P V +  T  +S        P S+DWR KGAVT VK+
Sbjct: 98  FK---DTYLMKLDEQPPAAEAMPPIVGTMSTAGMSNGDNTGEAPNSVDWRTKGAVTPVKN 154

Query: 146 QGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFII 205
           Q QCG CWAF+ VA++EG++ I T +L SLSEQE+VDCD  G D GC GG    A E++ 
Sbjct: 155 QQQCGSCWAFATVASIEGVHQIKTGRLVSLSEQEIVDCDRGGNDHGCRGGYPRSAMEWVT 214

Query: 206 SNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAID 265
            N GL TE+ YPY  S   C   +    AA+I GY+ V   NEA L +AVA +PV+V ID
Sbjct: 215 RNGGLTTESDYPYVGSQRQCMSGKLGHHAARIRGYQAVQRKNEAELERAVAGRPVAVVID 274

Query: 266 ASGSDFQFYSSGVFTGQCG-TELDHGVTAVGYGTADDGT----KYWLVKNSWGTTWGENG 320
           AS + FQFY  GVF+G C  T ++H VT VGYG+A   +    KYW+VKNSWG  WGENG
Sbjct: 275 ASRA-FQFYKRGVFSGPCNTTTVNHAVTVVGYGSAGSDSGGGRKYWIVKNSWGQRWGENG 333

Query: 321 YIRMQRDIDAKEGLCGIAMQASYPT 345
           Y+RM R + A+EG+C IA++   P+
Sbjct: 334 YVRMARRVRAREGMCAIAIEPLLPS 358


>gi|356514419|ref|XP_003525903.1| PREDICTED: LOW QUALITY PROTEIN: cysteine proteinase RD21a-like
           [Glycine max]
          Length = 343

 Score =  263 bits (672), Expect = 9e-68,   Method: Compositional matrix adjust.
 Identities = 144/308 (46%), Positives = 196/308 (63%), Gaps = 32/308 (10%)

Query: 39  HEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFR 98
           +E  +A++G+VY    E E RF+I KEN++++   N  A N+ YK+G+N FAD++    R
Sbjct: 52  YEEXLAKHGKVYNAIDEMEERFQISKENLKFVEQHN--AGNRTYKVGLNRFADRSRMMTR 109

Query: 99  APRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAV 158
            P + Y  R+                  ++  S+DWRK+GAV  VK Q +C  C  F+ +
Sbjct: 110 -PSSRYAPRVSD----------------NLSESVDWRKEGAVVRVKTQSECESCRTFTVI 152

Query: 159 AAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPY 218
           AA+EGIN I T  LT+LS     DCD +  + GC GGL D A EFII+N G+ TE  YP+
Sbjct: 153 AAVEGINKIVTGNLTALS-----DCDRT-VNAGCSGGLADYALEFIINNGGIDTEEDYPF 206

Query: 219 KASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVA-IDASGSDFQFYSSG 277
           + + G C++ + N     + GYE VP+ +E AL KAVANQPVSVA I+A G +FQ Y SG
Sbjct: 207 QGAVGICDQYKINA----VDGYERVPAYDELALKKAVANQPVSVAYIEAYGKEFQLYESG 262

Query: 278 VFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDI-DAKEGLCG 336
           +FTG+CGT +DHGVTAVGYGT ++G  YW+VKNSWG  WGE GY+RM+R+  +   G CG
Sbjct: 263 IFTGKCGTSIDHGVTAVGYGT-ENGIDYWIVKNSWGENWGEAGYVRMERNTAEDTAGKCG 321

Query: 337 IAMQASYP 344
           IA+   YP
Sbjct: 322 IAILTLYP 329


>gi|449275508|gb|EMC84350.1| Cathepsin L1, partial [Columba livia]
          Length = 319

 Score =  263 bits (672), Expect = 9e-68,   Method: Compositional matrix adjust.
 Identities = 149/323 (46%), Positives = 194/323 (60%), Gaps = 18/323 (5%)

Query: 32  DATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFN--NKARNKPYKLGINEF 89
           D  ++   ++W + + + Y +  E   R  ++++N++ I   N  +      YKLG+N+F
Sbjct: 3   DPELDGHWQLWKSWHNKDYHEREESWRRV-VWEKNLKMIELHNLDHTLGKHSYKLGMNQF 61

Query: 90  ADQTNEEFRAPRNGY--KRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQG 147
            D T EEFR   NGY  K+     R S+  + SF       P S+DWR+KG VT VKDQG
Sbjct: 62  GDMTTEEFRQLMNGYAHKKSERKYRGSQFLEPSF----LEAPRSVDWREKGYVTPVKDQG 117

Query: 148 QCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISN 207
           QCG CWAFS   A+EG +   T KL SLSEQ LVDC     +QGC GGLMD AF+++  N
Sbjct: 118 QCGSCWAFSTTGALEGQHFRKTGKLVSLSEQNLVDCSRPEGNQGCNGGLMDQAFQYVQDN 177

Query: 208 KGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDA 266
            G+ +E  YPY A D    + +A  +AA  +G+ D+P  +E ALMKAVA   PVSVAIDA
Sbjct: 178 GGIDSEESYPYTAKDDEDCRYKAEYNAANDTGFVDIPQGHERALMKAVAAVGPVSVAIDA 237

Query: 267 SGSDFQFYSSGV-FTGQCGTE-LDHGVTAVGYGTAD---DGTKYWLVKNSWGTTWGENGY 321
             S FQFY SG+ +   C +E LDHGV  VGYG      DG KYW+VKNSWG  WG+ GY
Sbjct: 238 GHSSFQFYQSGIYYEPDCSSEDLDHGVLVVGYGFEGEDVDGKKYWIVKNSWGEKWGDKGY 297

Query: 322 IRMQRDIDAKEGLCGIAMQASYP 344
           I M +D   ++  CGIA  ASYP
Sbjct: 298 IYMAKD---RKNHCGIATAASYP 317


>gi|391343119|ref|XP_003745860.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
          Length = 385

 Score =  263 bits (672), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 155/340 (45%), Positives = 201/340 (59%), Gaps = 16/340 (4%)

Query: 11  VLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYI 70
           +LA + V+G+ +  S      +  +N+  E + A++ + Y    E+ MR  IF+EN ++I
Sbjct: 58  LLAVLAVIGLASALS-----PNPNLNQHWENFKAEHNKKYESFPEELMRRLIFEENHQFI 112

Query: 71  ASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS-VP 129
              N+K     Y LG+N F D TN+E+R    GY+R  P    S+ + +  R E    VP
Sbjct: 113 EDHNSKKEFDFY-LGMNHFGDLTNKEYRERYLGYRR--PENTPSKASYIFSRAEKIEDVP 169

Query: 130 ASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGED 189
             IDWR +G VT VK+QGQCG CWAFSAV ++EG +  +T KL SLSEQ LVDC T   +
Sbjct: 170 DQIDWRDQGFVTPVKNQGQCGSCWAFSAVGSLEGQHFKSTGKLVSLSEQNLVDCSTPEGN 229

Query: 190 QGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEA 249
            GC GG MD AFE++  N G+ TE  YPY  +DGSC+ K  +   A + G+ DV   +E 
Sbjct: 230 SGCNGGWMDQAFEYVKDNHGIDTEDSYPYVGTDGSCHFKNKS-IGATLKGFMDVKEGDEE 288

Query: 250 ALMKAV-ANQPVSVAIDASGSDFQFYSSGVF-TGQCGT-ELDHGVTAVGYGTADDGTKYW 306
           AL +AV    PVSVAIDAS   FQFY  GV+    C T ELDHGV  VGYG    G  +W
Sbjct: 289 ALRQAVGVAGPVSVAIDASSMLFQFYRGGVYNVPWCSTSELDHGVLVVGYGKQFQGKDFW 348

Query: 307 LVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           +VKNSWG  WG  GYI M R+   K   CGIA +AS PT 
Sbjct: 349 MVKNSWGVGWGIYGYIEMSRN---KGNQCGIASKASIPTV 385


>gi|443708542|gb|ELU03619.1| hypothetical protein CAPTEDRAFT_17807 [Capitella teleta]
          Length = 350

 Score =  263 bits (671), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 151/338 (44%), Positives = 204/338 (60%), Gaps = 12/338 (3%)

Query: 13  AAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIAS 72
            A+ +  +  P +  R  N     +  + +   + R Y +  E + R ++F+ N++ I +
Sbjct: 17  GAMPMTNILRPDTTLRFPNLVPFEKLWQDFKTVHERTYGETEESQ-RKEVFRNNLKKIQA 75

Query: 73  FNN--KARNKPYKLGINEFADQTNEEFRAPRNGYK-RRLPSVRSSETTDVSFRYENASVP 129
            N+  +    PY++GIN+FAD    EF +  NG++      VR     +        SVP
Sbjct: 76  HNHLHEQGKSPYRMGINQFADMEANEFASIMNGFRMNNRTEVRDHLHANYISPAIPVSVP 135

Query: 130 ASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGED 189
           A +DWRK+G VT VK+QGQCG CWAFS   ++EG +   T KL SLSEQ LVDC TS  +
Sbjct: 136 AEVDWRKEGYVTPVKNQGQCGSCWAFSTTGSLEGQHFRKTGKLVSLSEQNLVDCSTSYGN 195

Query: 190 QGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEA 249
           +GC GG++D AF++I  N G  TEA YPY+A DG+C  K      A  +GY D+P  +EA
Sbjct: 196 EGCNGGIVDYAFQYIKDNDGDDTEACYPYEAVDGTCRFKSVC-VGATCTGYTDLPKGDEA 254

Query: 250 ALMKAVA-NQPVSVAIDASGSDFQFYSSGVFTGQ-CGT-ELDHGVTAVGYGTADDGTKYW 306
            + +AVA   PVSVAIDAS S FQ Y SG++  Q C   +LDH V  VGYGT + G  YW
Sbjct: 255 KMKEAVALVGPVSVAIDASHSSFQMYQSGIYVEQECSPKQLDHAVLVVGYGT-EQGQDYW 313

Query: 307 LVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           LVKNSWGTTWG+ GYI+M R++D +   CGIA QASYP
Sbjct: 314 LVKNSWGTTWGDEGYIKMARNMDNQ---CGIASQASYP 348


>gi|449673497|ref|XP_002169904.2| PREDICTED: cathepsin L-like [Hydra magnipapillata]
          Length = 325

 Score =  263 bits (671), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 142/307 (46%), Positives = 190/307 (61%), Gaps = 15/307 (4%)

Query: 41  MWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAP 100
           +W   + + Y   +E+ +R+ I+K+N+  I  +N+K++N    L +N F D TN EFRA 
Sbjct: 29  VWKMAHNKAYSHESEENVRYAIWKDNMNRITEYNSKSKN--VILRMNHFGDMTNTEFRAK 86

Query: 101 RNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAA 160
            NG       +   +         + + P ++DWR +G VT VK+QGQCG CWAFS+  A
Sbjct: 87  MNGLL-----LHKHQNGSTFLVPSHTAAPDAVDWRSEGYVTPVKNQGQCGSCWAFSSTGA 141

Query: 161 MEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKA 220
           +EG +   T +L SLSEQ LVDC T   + GC GGLMD+AF +I +N G+ TE  YPY+ 
Sbjct: 142 LEGQHFKKTGRLVSLSEQNLVDCSTDYGNNGCNGGLMDNAFSYIKANGGIDTETGYPYEG 201

Query: 221 SDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDASGSDFQFYSSGVF 279
            DG+C   +++  A   +G+ D+P  +E AL +AVA   PVSVAIDAS   FQFY SGV+
Sbjct: 202 QDGTCRYSKSSIGADD-TGFVDIPEGDEDALKQAVATVGPVSVAIDASHMSFQFYHSGVY 260

Query: 280 -TGQCG-TELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGI 337
              QC  + LDHGV  VGYGT D+G  YWLVKNSWGT WG  GYI M R+    +  CGI
Sbjct: 261 DEPQCSPSALDHGVLVVGYGT-DNGKDYWLVKNSWGTGWGTEGYIYMSRN---NQNQCGI 316

Query: 338 AMQASYP 344
           A +ASYP
Sbjct: 317 ASKASYP 323


>gi|196002275|ref|XP_002111005.1| expressed hypothetical protein [Trichoplax adhaerens]
 gi|190586956|gb|EDV27009.1| expressed hypothetical protein [Trichoplax adhaerens]
          Length = 325

 Score =  263 bits (671), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 142/308 (46%), Positives = 188/308 (61%), Gaps = 13/308 (4%)

Query: 40  EMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRA 99
           E W + +G+ Y +  E + R  +F +N++ IA+ N K+    +K+ INEF+D T +EF  
Sbjct: 26  EAWKSFHGKKYHNQGEDDFRHYVFLQNIKTIAAHNAKST---FKMAINEFSDLTRKEFVK 82

Query: 100 PRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVA 159
             NGY  RL   +S+          N ++P  +DWRK+G VT +K+QG+CG CWAFS   
Sbjct: 83  TYNGY--RLSMKKSTNKPSTFMAPLNTNMPTEVDWRKEGYVTPIKNQGRCGSCWAFSTTG 140

Query: 160 AMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYK 219
           ++EG +   T KL SLSEQ L+DC  +  + GC GG MDDAFE+I  N G+ TEA YPY+
Sbjct: 141 SLEGQHFRKTGKLVSLSEQNLIDCSAAEGNDGCGGGFMDDAFEYIKLNNGIDTEASYPYE 200

Query: 220 ASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQ-PVSVAIDASGSDFQFYSSGV 278
             D  C  K+ N  A   +GY D+   +E  L  AVA   P+SVAIDAS   F  Y +GV
Sbjct: 201 GRDDICRYKKTNKGAID-TGYMDIKQYSEDDLKAAVATVGPISVAIDASHKSFHMYHTGV 259

Query: 279 FT-GQCG-TELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCG 336
           +   +C  T LDHGV  VGYGT ++G  YWLVKNSWGT WG NGYI+M R+   +   CG
Sbjct: 260 YHEPECSQTVLDHGVLVVGYGT-ENGEDYWLVKNSWGTDWGMNGYIKMSRN---RSNNCG 315

Query: 337 IAMQASYP 344
           IA  ASYP
Sbjct: 316 IATNASYP 323


>gi|189525870|ref|XP_001923796.1| PREDICTED: cathepsin L1 [Danio rerio]
          Length = 335

 Score =  262 bits (670), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 150/341 (43%), Positives = 201/341 (58%), Gaps = 19/341 (5%)

Query: 11  VLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYI 70
           +L  + +  V+A  S      D  +++    W +Q+G+ Y ++ E   R  I++EN+  I
Sbjct: 5   LLVTLYISAVFAAPSI-----DIQLDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRKI 58

Query: 71  --ASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASV 128
              +F     N  +K+G+N+F D TNEEFR   NGYK   P+  S     +  ++  A  
Sbjct: 59  EQHNFEYSLGNHTFKMGMNQFGDMTNEEFRQAMNGYKHD-PNRTSQGPLFMEPKFFAA-- 115

Query: 129 PASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGE 188
           P  +DWR++G VT VKDQ QCG CW+FS+  A+EG     T KL S+SEQ LVDC     
Sbjct: 116 PQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPHG 175

Query: 189 DQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNE 248
           +QGC GGLMD AF+++  NKGL +E  YPY A D    + +   + AKI+G+ D+P  NE
Sbjct: 176 NQGCNGGLMDQAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPKGNE 235

Query: 249 AALMKAVANQ-PVSVAIDASGSDFQFYSSGVFTGQ-CGTELDHGVTAVGYGT--AD-DGT 303
            ALM AVA   PVSVAIDAS    QFY SG++  + C ++LDH V  VGYG   AD  G 
Sbjct: 236 LALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACTSQLDHAVLVVGYGYQGADVAGN 295

Query: 304 KYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           +YW+VKNSW   WG+ GYI M +D   K   CGIA  ASYP
Sbjct: 296 RYWIVKNSWSDKWGDKGYIYMAKD---KNNHCGIATMASYP 333


>gi|226509942|ref|NP_001146834.1| cysteine protease precursor [Zea mays]
 gi|159506725|gb|ABW97700.1| cysteine protease [Zea mays]
 gi|414867308|tpg|DAA45865.1| TPA: cysteine protease [Zea mays]
          Length = 352

 Score =  262 bits (670), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 149/316 (47%), Positives = 195/316 (61%), Gaps = 14/316 (4%)

Query: 35  MNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTN 94
           M +R   W A Y R Y    E++ RF++++ N+E+I +  N+A N  Y LG N+FAD T 
Sbjct: 45  MMDRFLSWQATYNRSYPTAEERQRRFQVYRRNIEHIEA-TNRAGNLTYTLGENQFADLTE 103

Query: 95  EEFRAPRNGYKRRLPSVR---SSETTDVSFRYENASVPASIDWRKKGAVTGVKDQG-QCG 150
           EEF    + Y  +   VR     +  +VS        P S+DWR KGAVT +K+QG  C 
Sbjct: 104 EEFL---DLYTMKGMPVRRDAGKKRANVSSSAAAVDAPTSVDWRSKGAVTPIKNQGPSCS 160

Query: 151 CCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGL 210
            CWAF   A +E I  ITT KL SLSEQEL+DCD    D GC  G   + + ++I N GL
Sbjct: 161 SCWAFVTAATIESITKITTGKLVSLSEQELIDCDP--YDGGCNLGYFVNGYRWVIQNGGL 218

Query: 211 ATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSD 270
            TEA YPY+A   +C++  A   AA IS Y  +P+  E  L +AVA QPV+ AI+  GS 
Sbjct: 219 TTEANYPYQARRYACSRSRAAQHAATISDYVQLPAG-EGQLQQAVAQQPVAAAIEMGGS- 276

Query: 271 FQFYSSGVFTGQCGTELDHGVTAVGYGT-ADDGTKYWLVKNSWGTTWGENGYIRMQRDID 329
            QFYS GVF+GQCGT ++H +T VGYG  +  G KYWLVKNSWG +WGE GY+RM+RD+ 
Sbjct: 277 LQFYSGGVFSGQCGTRMNHAITVVGYGADSSSGLKYWLVKNSWGQSWGERGYLRMRRDV- 335

Query: 330 AKEGLCGIAMQASYPT 345
            + GLCGIA+  +YP 
Sbjct: 336 GRGGLCGIALDLAYPV 351


>gi|389610697|dbj|BAM18960.1| cathepsin L [Papilio polytes]
          Length = 341

 Score =  262 bits (670), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 148/319 (46%), Positives = 190/319 (59%), Gaps = 18/319 (5%)

Query: 40  EMWMA---QYGRVYRDNAEKEMRFKIFKENVEYIASFNNK-ARNK-PYKLGINEFADQTN 94
           E W A   ++ + Y    E + R KI+ EN   IA  N K AR + P+++  N++ D  +
Sbjct: 25  EEWNAFKMEHQKQYDSEVEDKFRMKIYAENKHKIAKHNQKFARGQVPFRVKQNKYGDMLH 84

Query: 95  EEFRAPRNGYKRR------LPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQ 148
            EF    NG+ +       L    + E         N  VP  +DWRK GAVT VKDQG+
Sbjct: 85  HEFVHTMNGFNKTTKNGKGLFGKSAGERGATFIPPANVRVPDHVDWRKHGAVTEVKDQGK 144

Query: 149 CGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNK 208
           CG CW+FSA  A+EG ++  T  L SLSEQ L+DC T+  + GC GGLMD+AF++I  NK
Sbjct: 145 CGSCWSFSATGALEGQHYRQTNILVSLSEQNLIDCSTAYGNNGCNGGLMDNAFKYIKDNK 204

Query: 209 GLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDAS 267
           G+ TE  YPY+A D  C     N  A  + G+ D+PS +E  LM AVA   PVSVAIDAS
Sbjct: 205 GIDTEKSYPYEAVDDKCRYNPRNSGADDV-GFIDIPSGDEGKLMAAVATVGPVSVAIDAS 263

Query: 268 GSDFQFYSSGV-FTGQC-GTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQ 325
              FQFYS GV F   C  T LDHGV  VGYGT ++G  YWLVKNSWG +WG+ GYI+M 
Sbjct: 264 QETFQFYSDGVYFDENCSSTSLDHGVLVVGYGTDENGGDYWLVKNSWGRSWGDLGYIKMA 323

Query: 326 RDIDAKEGLCGIAMQASYP 344
           R+   ++  CGIA  AS+P
Sbjct: 324 RN---RDNHCGIATAASFP 339


>gi|340371596|ref|XP_003384331.1| PREDICTED: digestive cysteine proteinase 2-like [Amphimedon
           queenslandica]
          Length = 327

 Score =  262 bits (670), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 149/338 (44%), Positives = 203/338 (60%), Gaps = 23/338 (6%)

Query: 12  LAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIA 71
           +A +L++G+      S  +NDA   E   +W  +YG+ YR   E  MR KI+ +N +Y+ 
Sbjct: 10  VAVLLLIGLV-----SAAVNDA---EEWRLWKGKYGKTYRSIYEDNMRQKIWLQNRDYVN 61

Query: 72  SFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPAS 131
             N  + +  ++L +NEFAD T EEF +  NGY +     R +      +RY   ++P S
Sbjct: 62  EHN--SMDSSFQLEVNEFADLTAEEFSSIYNGYGK--GRNRENHENTTIYRYTGGAIPDS 117

Query: 132 IDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQG 191
           +DWR KG VT VK+Q QCG CWAFS   ++EG +   T KL SLSEQ LVDCD   +D G
Sbjct: 118 VDWRTKGLVTPVKNQKQCGSCWAFSTTGSLEGAHAKKTGKLVSLSEQNLVDCDK--KDHG 175

Query: 192 CEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAAL 251
           C+GGLM  AF++I  NKG+ TE  YPYKA +G C  K+ +   A +  +  + + +  AL
Sbjct: 176 CQGGLMTTAFKYIEENKGIDTEESYPYKAKNGRCEFKK-DDIGATVERHVSILTTDCEAL 234

Query: 252 MKAVAN-QPVSVAIDASGSDFQFYSSGVFTGQ-CGT-ELDHGVTAVGYGTADDGTKYWLV 308
            KAVA   P+SVA+DAS S FQ Y SG++  + C + +LDHGV  VGYG  +DG +YWLV
Sbjct: 235 KKAVAEIGPISVAMDASHSSFQLYKSGIYDPKICSSRKLDHGVLVVGYG-KEDGEEYWLV 293

Query: 309 KNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           KNSWG  WG  GY +    I +K+ LCGI   A YP  
Sbjct: 294 KNSWGKNWGMEGYFK----IASKKNLCGICTSACYPVV 327


>gi|157644745|gb|ABV59078.1| cathepsin L [Lates calcarifer]
          Length = 337

 Score =  262 bits (670), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 150/342 (43%), Positives = 200/342 (58%), Gaps = 20/342 (5%)

Query: 11  VLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYI 70
           VLA  L   + AP        D  +++  ++W + + + Y +  E   R  ++++N++ I
Sbjct: 6   VLAVCLSAALSAPSL------DPQLDDHWDLWKSWHSKKYHEKEEGWRRM-VWEKNLKKI 58

Query: 71  ASFN--NKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASV 128
              N  +     PY+LG+N F D T+EEFR   NGYK+R    +   +  +   +  A  
Sbjct: 59  ELHNLEHSMGKHPYRLGMNHFGDMTHEEFRQIMNGYKQRKTERKFKGSLFMEPNFLEA-- 116

Query: 129 PASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGE 188
           P ++DWR KG VT VKDQGQCG CWAFS   A+EG     T KL SLSEQ LVDC     
Sbjct: 117 PRALDWRDKGYVTPVKDQGQCGSCWAFSTTGALEGQQFRKTGKLVSLSEQNLVDCSRPEG 176

Query: 189 DQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNE 248
           ++GC GGLMD AF+++  N+GL +E  YPY  +D      + N ++A  +G+ DVPS  E
Sbjct: 177 NEGCNGGLMDQAFQYVKDNQGLDSEDSYPYLGTDDQPCHYDPNYNSANDTGFVDVPSGKE 236

Query: 249 AALMKAVAN-QPVSVAIDASGSDFQFYSSGVFTGQ-CGT-ELDHGVTAVGYGTAD---DG 302
            ALMKAVA   PVSVAIDA    FQFY SG++  + C + ELDHGV  VGYG      DG
Sbjct: 237 RALMKAVAAVGPVSVAIDAGHESFQFYQSGIYYEKDCSSEELDHGVLVVGYGYEGEDVDG 296

Query: 303 TKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
            KYW+VKNSW   WG+ GYI M +D   ++  CGIA  ASYP
Sbjct: 297 KKYWIVKNSWSEKWGDKGYIYMAKD---RKNHCGIATAASYP 335


>gi|332375975|gb|AEE63128.1| unknown [Dendroctonus ponderosae]
          Length = 338

 Score =  262 bits (669), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 144/318 (45%), Positives = 187/318 (58%), Gaps = 12/318 (3%)

Query: 35  MNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNK--PYKLGINEFADQ 92
           + E+   +  Q+ + Y    E+  R KIF +N   +A  N        PYKL +N++ D 
Sbjct: 23  VQEQWNSFKVQHKKQYESETEERFRMKIFMDNSHKVAKHNKLFEQGLYPYKLAMNKYGDL 82

Query: 93  TNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASV--PASIDWRKKGAVTGVKDQGQCG 150
            + EF    NG+ R    ++  E  D     E A V  P ++DWR++GAVT VKDQG CG
Sbjct: 83  LHHEFVGLLNGFNRTKTYLKRGELQDSITFIEPAHVDIPDTVDWRQEGAVTPVKDQGHCG 142

Query: 151 CCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGL 210
            CW+FSA  A+EG +   T+KL SLSEQ LVDC +   + GC GGLMD+AF +I +N G+
Sbjct: 143 SCWSFSATGALEGQHFRQTKKLVSLSEQNLVDCSSRFGNNGCNGGLMDNAFRYIKNNGGI 202

Query: 211 ATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQ-PVSVAIDASGS 269
            TEA YPY   D        N  A    G+ D+PS +E  L  AVA   P+S+AIDAS  
Sbjct: 203 DTEAAYPYMGEDEKFRYSAKNRGATD-KGFVDIPSGDEDKLKAAVATVGPISIAIDASHE 261

Query: 270 DFQFYSSGVFTGQC--GTELDHGVTAVGYGTADD-GTKYWLVKNSWGTTWGENGYIRMQR 326
            FQ YS+GV++      TELDHGV  VGYGT +  G  YWLVKNSWG TWG +GYI+M R
Sbjct: 262 SFQLYSNGVYSDPTCSSTELDHGVLVVGYGTDEKTGMDYWLVKNSWGDTWGLDGYIKMAR 321

Query: 327 DIDAKEGLCGIAMQASYP 344
           + D +   CG+A QASYP
Sbjct: 322 NQDNQ---CGVATQASYP 336


>gi|33242882|gb|AAQ01145.1| cathepsin [Branchiostoma lanceolatum]
          Length = 334

 Score =  262 bits (669), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 155/339 (45%), Positives = 198/339 (58%), Gaps = 18/339 (5%)

Query: 15  ILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFN 74
           +L+LG     + +  L     N+  EMW  Q+G+ Y   AE+  R  IF++N   IA  N
Sbjct: 3   LLILGAVISMATAGVL---PHNKEWEMWKLQHGKQYETEAEEYSRRFIFEKNTVKIAEHN 59

Query: 75  NKAR--NKPYKLGINEFADQTNEEFRAPRNG--YKRRLPSVRSSETTDVSFRYENASVPA 130
            +A      Y L +N+F D  +EEF     G   K     +  SE  D     +N ++P 
Sbjct: 60  IRASLGMHSYTLAMNKFGDMHHEEFHQRIMGGCLKIVKKPLLGSEVGDSD---DNGTLPK 116

Query: 131 SIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQ 190
           S+DWR    V+ VKDQG+CG CWAFS   ++EG +   T KL  LSEQ+LVDC     +Q
Sbjct: 117 SVDWRNSHMVSEVKDQGECGPCWAFSTTGSLEGQHSNKTGKLVDLSEQQLVDCSKDFGNQ 176

Query: 191 GCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAA 250
           GC GGLMD AF++I +N GL TE  YPY A+D    K + +   A + GY+DV S NE A
Sbjct: 177 GCGGGLMDQAFQYIPANGGLDTEESYPYTATDDKPCKFDNSSVGATLVGYKDVKSGNEHA 236

Query: 251 LMKAVAN-QPVSVAIDASGSDFQFYSSGVF-TGQCGTE-LDHGVTAVGYGTADDGTK--Y 305
           L +AVA   PVSVAIDA    FQFYSSGV+   QC TE LDHGV AVGYG  +D +   +
Sbjct: 237 LKRAVATVGPVSVAIDAGHESFQFYSSGVYDEPQCSTEQLDHGVLAVGYGAMNDNSHQAF 296

Query: 306 WLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           W+VKNSWG +WG+ GYI M R+   K   CGIA  ASYP
Sbjct: 297 WIVKNSWGPSWGDQGYIMMSRN---KNNQCGIATSASYP 332


>gi|33242878|gb|AAQ01143.1| cathepsin [Branchiostoma lanceolatum]
          Length = 334

 Score =  262 bits (669), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 151/318 (47%), Positives = 190/318 (59%), Gaps = 15/318 (4%)

Query: 36  NERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKAR--NKPYKLGINEFADQT 93
           N+  EMW  Q+G+ Y   AE+  R  IF++N   IA  N +A      Y L +N+F D  
Sbjct: 21  NKEWEMWKLQHGKQYETEAEEYSRRFIFEKNTVKIAEHNIRASLGMHSYTLAMNKFGDMH 80

Query: 94  NEEFRAPRNG--YKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGC 151
           +EEF     G   K     +  SE  D     +N ++P S+DWR    V+ VKDQG+CG 
Sbjct: 81  HEEFHQRIMGGCLKIVKKPLLGSEVGD---NDDNGTLPKSVDWRNSHMVSEVKDQGECGS 137

Query: 152 CWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLA 211
           CWAFS   ++EG +   T KL  LSEQ+LVDC     +QGC GGLMD AF++I +N GL 
Sbjct: 138 CWAFSTTGSLEGQHSNKTGKLVDLSEQQLVDCSKDFGNQGCGGGLMDQAFQYIKANGGLD 197

Query: 212 TEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDASGSD 270
           TE  YPY A+D    K + +   A + GY+DV S NE AL +AVA   PVSVAIDA    
Sbjct: 198 TEESYPYTATDDKPCKFDNSSVGATLVGYKDVKSGNEHALKRAVATVGPVSVAIDAGHES 257

Query: 271 FQFYSSGVF-TGQCGTE-LDHGVTAVGYGTADDGTK--YWLVKNSWGTTWGENGYIRMQR 326
           FQFYSSGV+   QC TE LDHGV AVGYG  +D +   +W+VKNSWG +WG+ GYI M R
Sbjct: 258 FQFYSSGVYDEPQCSTEQLDHGVLAVGYGAMNDNSHQAFWIVKNSWGPSWGDQGYIMMSR 317

Query: 327 DIDAKEGLCGIAMQASYP 344
           +   K   CGIA  ASYP
Sbjct: 318 N---KNNQCGIATSASYP 332


>gi|449683741|ref|XP_002155462.2| PREDICTED: cathepsin L-like [Hydra magnipapillata]
          Length = 324

 Score =  262 bits (669), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 152/337 (45%), Positives = 200/337 (59%), Gaps = 20/337 (5%)

Query: 11  VLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYI 70
           V  A+L+LGV    + +  +   T ++    W   + + Y  + E+ +R+ I+K+N   I
Sbjct: 3   VFCALLLLGV----TLAYIIERPTEDDSWIRWKMAHNKAYSHDGEETVRYTIWKDNERRI 58

Query: 71  ASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPA 130
              N +  +  + L +N+F D TN EF+   NGY        S+  T  SF       P 
Sbjct: 59  REHNLQGGD--FLLEMNQFGDMTNNEFK-DFNGYLSHKHVSGSTFLTPNSF-----VAPD 110

Query: 131 SIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQ 190
           S+DWR +G VT VKDQGQCG CWAFS   ++EG N   T KL SLSEQ LVDC T+  + 
Sbjct: 111 SVDWRNEGYVTPVKDQGQCGSCWAFSTTGSLEGQNFKKTGKLVSLSEQNLVDCSTAYGNN 170

Query: 191 GCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAA 250
           GC GGLMD+AF +I  N G+ +EA YPY A DG C   + N  AA  +G+ D+PS +E  
Sbjct: 171 GCNGGLMDNAFTYIKENNGIDSEASYPYTAKDGKCAFTKPN-VAATDTGFVDIPSGDENK 229

Query: 251 LMKAVAN-QPVSVAIDASGSDFQFYSSGVFTGQ--CGTELDHGVTAVGYGTADDGTKYWL 307
           L +AVA+  P+SVAIDAS   FQFY  GV+  +    TELDHGV  VGYGT + G  YWL
Sbjct: 230 LKEAVASVGPISVAIDASHFSFQFYRKGVYNERKCSSTELDHGVLVVGYGT-ESGKDYWL 288

Query: 308 VKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           VKNSW T+WG+ GYI+M R+   +   CGIA  ASYP
Sbjct: 289 VKNSWNTSWGDKGYIKMSRNAKNQ---CGIATNASYP 322


>gi|33242876|gb|AAQ01142.1| cathepsin [Branchiostoma lanceolatum]
          Length = 334

 Score =  262 bits (669), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 152/337 (45%), Positives = 198/337 (58%), Gaps = 14/337 (4%)

Query: 15  ILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFN 74
           +L+LG     + +  L     N+  EMW  Q+G+ Y   AE+  R  I ++N   IA  N
Sbjct: 3   LLILGAVISMATAGVL---PHNKEWEMWKLQHGKQYETEAEEYSRRFILEKNTVKIAEHN 59

Query: 75  NKAR--NKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASI 132
            +A      Y L +N+F D  +EEF     G   ++   +    +DV    +N ++P S+
Sbjct: 60  IRASLGMHSYTLAMNKFGDMHHEEFHQRIMGGCLKIVK-KPLLGSDVGDNDDNGTLPKSV 118

Query: 133 DWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGC 192
           DWR    V+ VKDQG+CG CWAFS   ++EG +   T KL  LSEQ+LVDC     +QGC
Sbjct: 119 DWRNSHMVSEVKDQGECGSCWAFSTTGSLEGQHSNKTGKLVDLSEQQLVDCSKDFGNQGC 178

Query: 193 EGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALM 252
            GGLMD AF++I +N GL TE  YPY A+D    K + +   A + GY+DV S NE AL 
Sbjct: 179 GGGLMDQAFQYIKANGGLDTEESYPYTATDDKPCKFDNSSVGATLVGYKDVKSGNEHALK 238

Query: 253 KAVAN-QPVSVAIDASGSDFQFYSSGVF-TGQCGTE-LDHGVTAVGYGTADDGTK--YWL 307
           +AVA   PVSVAIDA    FQFYSSGV+   QC TE LDHGV AVGYG  +D +   +W+
Sbjct: 239 RAVATVGPVSVAIDAGHESFQFYSSGVYDEPQCSTEQLDHGVLAVGYGAMNDNSHQAFWI 298

Query: 308 VKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           VKNSWG +WG+ GYI M R+   K   CGIA  ASYP
Sbjct: 299 VKNSWGPSWGDQGYIMMSRN---KNNQCGIATSASYP 332


>gi|237844793|ref|XP_002371694.1| cathepsin L-like thiolproteinase, putative [Toxoplasma gondii ME49]
 gi|50313163|gb|AAT74529.1| toxopain-2 [Toxoplasma gondii]
 gi|89242977|gb|ABD64744.1| cathepsin L [Toxoplasma gondii]
 gi|95007485|emb|CAJ20707.1| toxopain-2 [Toxoplasma gondii RH]
 gi|211969358|gb|EEB04554.1| cathepsin L-like thiolproteinase, putative [Toxoplasma gondii ME49]
 gi|221480879|gb|EEE19300.1| cysteine protease, putative [Toxoplasma gondii GT1]
 gi|221501596|gb|EEE27366.1| cysteine protease, putative [Toxoplasma gondii VEG]
          Length = 422

 Score =  261 bits (668), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 135/317 (42%), Positives = 189/317 (59%), Gaps = 8/317 (2%)

Query: 32  DATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFAD 91
           +A   +    + A Y + Y    EK+ R+ IFK N+ YI + N +  +  Y L +N F D
Sbjct: 110 EAHFQDAFSSFQAMYAKSYATEEEKQRRYAIFKNNLVYIHTHNQQGYS--YSLKMNHFGD 167

Query: 92  QTNEEFRAPRNGYK--RRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQC 149
            + +EFR    G+K  R L S      T++      + +PA +DWR +G VT VKDQ  C
Sbjct: 168 LSRDEFRRKYLGFKKSRNLKSHHLGVATEL-LNVLPSELPAGVDWRSRGCVTPVKDQRDC 226

Query: 150 GCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKG 209
           G CWAFS   A+EG +   T KL SLSEQEL+DC  +  +Q C GG M+DAF++++ + G
Sbjct: 227 GSCWAFSTTGALEGAHCAKTGKLVSLSEQELMDCSRAEGNQSCSGGEMNDAFQYVLDSGG 286

Query: 210 LATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGS 269
           + +E  YPY A D  C + ++     KI G++DVP  +EAA+  A+A  PVS+AI+A   
Sbjct: 287 ICSEDAYPYLARDEEC-RAQSCEKVVKILGFKDVPRRSEAAMKAALAKSPVSIAIEADQM 345

Query: 270 DFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTK-YWLVKNSWGTTWGENGYIRMQRDI 328
            FQFY  GVF   CGT+LDHGV  VGYGT  +  K +W++KNSWGT WG +GY+ M    
Sbjct: 346 PFQFYHEGVFDASCGTDLDHGVLLVGYGTDKESKKDFWIMKNSWGTGWGRDGYMYMAMH- 404

Query: 329 DAKEGLCGIAMQASYPT 345
             +EG CG+ + AS+P 
Sbjct: 405 KGEEGQCGLLLDASFPV 421


>gi|146152090|gb|ABQ08058.1| cathepsin L [Misgurnus mizolepis]
          Length = 337

 Score =  261 bits (668), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 154/345 (44%), Positives = 200/345 (57%), Gaps = 24/345 (6%)

Query: 10  LVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEY 69
           L L  + + GV+A  S  + L+D       E W   +G+ Y +  E   R  I+++N+  
Sbjct: 5   LALFTLCLSGVFAAPSLDKQLDD-----HWEQWKTWHGKNYHEKEEGWRRM-IWEKNLRK 58

Query: 70  IASFNNKARN---KPYKLGINEFADQTNEEFRAPRNGYKRRLP-SVRSSETTDVSFRYEN 125
           I  F+N   +     Y+LG+N F D  +EEFR   NGYK +     + S   + +F    
Sbjct: 59  I-QFHNLEHSMGIHTYRLGMNHFGDMNHEEFRQVMNGYKHKTERKFKGSLFMEPNF---- 113

Query: 126 ASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDT 185
             VP+ +DWR+KG VT VKDQG+CG CWAFS   AMEG       KL SLSEQ LVDC  
Sbjct: 114 LEVPSKLDWREKGYVTPVKDQGECGSCWAFSTTGAMEGQMFRKQGKLVSLSEQNLVDCSR 173

Query: 186 SGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPS 245
              ++GC GGLMD AF++I  N GL +E  YPY  +D      +   +AA  +G+ D+PS
Sbjct: 174 PEGNEGCNGGLMDQAFQYIKDNNGLDSEEAYPYLGTDDQPCHYDPKYNAANDTGFVDIPS 233

Query: 246 NNEAALMKAVAN-QPVSVAIDASGSDFQFYSSGV-FTGQCGT-ELDHGVTAVGYGTAD-- 300
             E ALMKAVA+  PVSVAIDA    FQFY SG+ F  +C + ELDHGV  VGYG     
Sbjct: 234 GKEHALMKAVASVGPVSVAIDAGHESFQFYQSGIYFEKECSSEELDHGVLVVGYGFEGED 293

Query: 301 -DGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
            DG KYW+VKNSW  +WG+ GYI M +D   ++  CGIA  ASYP
Sbjct: 294 VDGKKYWIVKNSWSESWGDKGYIYMAKD---RKNHCGIATAASYP 335


>gi|2098464|pdb|1PCI|A Chain A, Procaricain
 gi|2098465|pdb|1PCI|B Chain B, Procaricain
 gi|2098466|pdb|1PCI|C Chain C, Procaricain
          Length = 322

 Score =  261 bits (668), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 147/320 (45%), Positives = 189/320 (59%), Gaps = 14/320 (4%)

Query: 31  NDATMNER----HEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGI 86
           +D T  ER       WM  + + Y +  EK  RF+IFK+N+ YI   N K  N  Y LG+
Sbjct: 10  DDLTSTERLIQLFNSWMLNHNKFYENVDEKLYRFEIFKDNLNYIDETNKK--NNSYWLGL 67

Query: 87  NEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYEN-ASVPASIDWRKKGAVTGVKD 145
           NEFAD +N+EF      Y   L      ++ D  F  E+  ++P ++DWRKKGAVT V+ 
Sbjct: 68  NEFADLSNDEFNEK---YVGSLIDATIEQSYDEEFINEDIVNLPENVDWRKKGAVTPVRH 124

Query: 146 QGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFII 205
           QG CG CWAFSAVA +EGIN I T KL  LSEQELVDC+      GC+GG    A E++ 
Sbjct: 125 QGSCGSCWAFSAVATVEGINKIRTGKLVELSEQELVDCER--RSHGCKGGYPPYALEYVA 182

Query: 206 SNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAID 265
            N G+   +KYPYKA  G+C  K+      K SG   V  NNE  L+ A+A QPVSV ++
Sbjct: 183 KN-GIHLRSKYPYKAKQGTCRAKQVGGPIVKTSGVGRVQPNNEGNLLNAIAKQPVSVVVE 241

Query: 266 ASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQ 325
           + G  FQ Y  G+F G CGT++D  VTAVGYG +       L+KNSWGT WGE GYIR++
Sbjct: 242 SKGRPFQLYKGGIFEGPCGTKVDGAVTAVGYGKSGGKGYI-LIKNSWGTAWGEKGYIRIK 300

Query: 326 RDIDAKEGLCGIAMQASYPT 345
           R      G+CG+   + YPT
Sbjct: 301 RAPGNSPGVCGLYKSSYYPT 320


>gi|288548566|gb|ADC52431.1| cathepsin L2 cysteine protease [Pinctada fucata]
          Length = 330

 Score =  261 bits (668), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 145/309 (46%), Positives = 192/309 (62%), Gaps = 18/309 (5%)

Query: 45  QYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARN--KPYKLGINEFADQTNEEFRAPRN 102
           QY ++Y++  E   R  +++ N+++I   N  A      + +G+NE+ D TNEEF    N
Sbjct: 33  QYNKLYQNEEEARRRL-VWESNLDFITLHNLAADRGEHTFWVGMNEYGDMTNEEFTKTMN 91

Query: 103 GYKRRLPSVRSSETTDVSFRYEN--ASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAA 160
           GY+     +R+  +    F   N    +P ++DWR KG VT +K+QGQCG CW+FSA  +
Sbjct: 92  GYR-----MRNKTSNAPVFMPPNNMGDLPDTVDWRPKGYVTPIKNQGQCGSCWSFSATGS 146

Query: 161 MEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKA 220
           +EG     T KL SLSEQ LVDC     + GCEGGLMDDAF +I +N G+ TEA YPYKA
Sbjct: 147 LEGQTFKKTGKLVSLSEQNLVDCSKKQGNHGCEGGLMDDAFTYIKANNGIDTEASYPYKA 206

Query: 221 SDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDASGSDFQFYSSGVF 279
            DG C  K A+  A   +G+ D+ + +E AL +AVA   P+SVAIDAS   FQ Y +GV+
Sbjct: 207 RDGKCEFKSADVGATD-TGFVDIKTKDEEALKQAVATVGPISVAIDASHMSFQLYRTGVY 265

Query: 280 TGQ-CG-TELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGI 337
               C  T+LDHGV AVGYGT +D   YWLVKNSWG +WG+ GYI+M R+   +   CGI
Sbjct: 266 HDWFCSQTKLDHGVLAVGYGT-EDSKDYWLVKNSWGESWGQKGYIQMSRN---RRNNCGI 321

Query: 338 AMQASYPTA 346
           A  ASYPT 
Sbjct: 322 ATSASYPTV 330


>gi|283898066|emb|CBI99501.1| cysteine peptidase precursor [Bromelia hieronymi]
          Length = 230

 Score =  261 bits (667), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 121/219 (55%), Positives = 157/219 (71%), Gaps = 5/219 (2%)

Query: 127 SVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTS 186
           +VP SIDWR  GAVT VK+QG+CG CW+FSA+A +EGI  I T  L SLSEQE++DC  S
Sbjct: 1   AVPQSIDWRDYGAVTSVKNQGRCGSCWSFSAIATVEGIYKIKTGNLVSLSEQEVLDCAVS 60

Query: 187 GEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSN 246
               GC+GG +D A+ FIISN G+ + A YPYK   G+C      P+AA I+GY+ V  N
Sbjct: 61  ---HGCKGGWVDKAYNFIISNNGVTSAAYYPYKGYQGTCGANSV-PNAAYITGYKYVQRN 116

Query: 247 NEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYW 306
           NE ++M A++NQP++  IDASG +FQ+Y  GV++G CGT L+H +T +GYG    G KYW
Sbjct: 117 NERSMMYALSNQPIAALIDASGKNFQYYKGGVYSGPCGTSLNHAITVIGYGQDSSGIKYW 176

Query: 307 LVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           +VKNSWGT+WGE GYIRM RD+ +  G+CGIAM   +PT
Sbjct: 177 IVKNSWGTSWGERGYIRMARDVSS-SGICGIAMAPLFPT 214


>gi|327263389|ref|XP_003216502.1| PREDICTED: cathepsin L1-like [Anolis carolinensis]
          Length = 339

 Score =  261 bits (667), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 153/324 (47%), Positives = 193/324 (59%), Gaps = 19/324 (5%)

Query: 32  DATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFN--NKARNKPYKLGINEF 89
           D+ +++  + W   + + Y    E   R  I+++N++ I   N  +      Y+LG+N F
Sbjct: 22  DSALDDHWQAWKTWHSKKYHQQEEGWRRM-IWEKNLKMIQLHNLDHSLGKHSYRLGMNHF 80

Query: 90  ADQTNEEFRAPRNGYK--RRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQG 147
            D TNEEFR   NGYK  +     R SE  + +F      VP S+DWR+KG VT VKDQG
Sbjct: 81  GDMTNEEFRQVMNGYKHSKTEKKYRGSEFLEPNF----LVVPKSVDWREKGYVTPVKDQG 136

Query: 148 QCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISN 207
           QCG CWAFS   ++EG +   T KL SLSEQ LVDC     +QGC GGLMD AFE+I  N
Sbjct: 137 QCGSCWAFSTTGSLEGQHFRKTGKLVSLSEQNLVDCSRPEGNQGCNGGLMDQAFEYIADN 196

Query: 208 KGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDA 266
            G+ +E  YPY A D      ++  +AA  +G+ DVP  +E ALMKAVA   PVSVAIDA
Sbjct: 197 GGIDSEESYPYIAKDDEDCLYKSEFNAANDTGFVDVPEGHERALMKAVAAVGPVSVAIDA 256

Query: 267 SGSDFQFYSSGVFTG-QCGT-ELDHGVTAVGY---GTADDG-TKYWLVKNSWGTTWGENG 320
           S S FQFY SG++    C + ELDHGV  VGY   GT DD   KYW+VKNSW   WG+ G
Sbjct: 257 SHSTFQFYESGIYYDPDCSSEELDHGVLVVGYGFEGTDDDNKKKYWIVKNSWSDKWGDKG 316

Query: 321 YIRMQRDIDAKEGLCGIAMQASYP 344
           YI M +D   +   CGIA  ASYP
Sbjct: 317 YILMAKD---RNNHCGIATAASYP 337


>gi|186701255|gb|ACC91281.1| putative cysteine proteinase [Capsella rubella]
          Length = 324

 Score =  261 bits (667), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 143/344 (41%), Positives = 205/344 (59%), Gaps = 43/344 (12%)

Query: 10  LVLAAILVLGVWAPQS---WSRTLNDATMNER----HEMWMAQYGRVYRDN-AEKEMRFK 61
           ++  ++L++ +  P S    S T      NE      + WM+++G+ Y +   +KE RF+
Sbjct: 9   MITLSLLIIFLLPPSSAMDLSVTSGGLRSNEEVGFIFQTWMSKHGKTYTNALGDKEQRFQ 68

Query: 62  IFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSF 121
            FK+N+ +I   N  A+N  Y+LG+ +FAD T +E++   +G  R +   ++   T    
Sbjct: 69  NFKDNLRFIDQHN--AKNLSYRLGLTQFADLTVQEYQDLFSG--RPIQKQKALRVTHRYV 124

Query: 122 RYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELV 181
                 +P S+DWR+KGAV+ +KDQG+C           +E IN I T +L SLSEQELV
Sbjct: 125 PLAEDQLPQSVDWRQKGAVSEIKDQGRC----------TVESINKIVTGELISLSEQELV 174

Query: 182 DCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKE-ANPSAAKISGY 240
           DC  S ++ GC GGLMD AF+F+I+N GL  ++ YPY+A  G CN  +  +    KI GY
Sbjct: 175 DC--SIDNHGCNGGLMDSAFQFLINNNGLEYQSDYPYQAVQGYCNHNQNTSKKVIKIDGY 232

Query: 241 EDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTAD 300
           EDVP+NNE +L KAVA+QP                 G++TG CGT+LDH V  VGYGT +
Sbjct: 233 EDVPANNENSLQKAVAHQP-----------------GIYTGPCGTDLDHAVVIVGYGT-E 274

Query: 301 DGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           +G  YW+V+NSWGT WGE GY ++ R+ +   G+CGIAM ASYP
Sbjct: 275 NGQDYWIVRNSWGTVWGEAGYAKIARNFENPTGVCGIAMVASYP 318


>gi|164472556|gb|ABY58967.1| cathepsin L [Toxoplasma gondii]
          Length = 421

 Score =  261 bits (667), Expect = 4e-67,   Method: Compositional matrix adjust.
 Identities = 135/317 (42%), Positives = 189/317 (59%), Gaps = 8/317 (2%)

Query: 32  DATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFAD 91
           +A   +    + A Y + Y    EK+ R+ IFK N+ YI + N +  +  Y L +N F D
Sbjct: 109 EAHFQDAFSSFQAMYAKSYATEEEKQRRYAIFKNNLVYIHTHNQQGYS--YSLKMNHFGD 166

Query: 92  QTNEEFRAPRNGYK--RRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQC 149
            + +EFR    G+K  R L S      T++      + +PA +DWR +G VT VKDQ  C
Sbjct: 167 LSRDEFRRKYLGFKKSRNLKSHHLGVATEL-LNVLPSELPAGVDWRSRGCVTPVKDQRDC 225

Query: 150 GCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKG 209
           G CWAFS   A+EG +   T KL SLSEQEL+DC  +  +Q C GG M+DAF++++ + G
Sbjct: 226 GSCWAFSTTGALEGAHCAKTGKLVSLSEQELMDCSRAEGNQSCSGGEMNDAFQYVLDSGG 285

Query: 210 LATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGS 269
           + +E  YPY A D  C + ++     KI G++DVP  +EAA+  A+A  PVS+AI+A   
Sbjct: 286 ICSEDAYPYLARDEEC-RAQSCEKVVKILGFKDVPRRSEAAMKAALAKSPVSIAIEADQM 344

Query: 270 DFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTK-YWLVKNSWGTTWGENGYIRMQRDI 328
            FQFY  GVF   CGT+LDHGV  VGYGT  +  K +W++KNSWGT WG +GY+ M    
Sbjct: 345 PFQFYHEGVFDASCGTDLDHGVLLVGYGTDKESKKDFWIMKNSWGTGWGRDGYMYMAMH- 403

Query: 329 DAKEGLCGIAMQASYPT 345
             +EG CG+ + AS+P 
Sbjct: 404 KGEEGQCGLLLDASFPV 420


>gi|391338876|ref|XP_003743781.1| PREDICTED: cathepsin L-like isoform 4 [Metaseiulus occidentalis]
          Length = 336

 Score =  261 bits (666), Expect = 4e-67,   Method: Compositional matrix adjust.
 Identities = 151/349 (43%), Positives = 199/349 (57%), Gaps = 22/349 (6%)

Query: 3   MILLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKI 62
           M  L  K ++ A+LV    A  +  +  +    N     +   + + Y  +  +  R KI
Sbjct: 1   MEQLSMKFLILAVLVGAASAALTLEQLFDAEWQN-----FKVHHNKKYEGSTVEAFRKKI 55

Query: 63  FKENVEYIASFNNKARN--KPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVS 120
           F +N   IA  N K       YKL +N+F D  + EF +  NG       +RS+ T   S
Sbjct: 56  FLQNTHLIARHNIKHAKGETTYKLKMNQFGDMLHHEFVSTMNGL------LRSNRTYFGS 109

Query: 121 --FRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQ 178
                E+ S+P S+DWR+KGAVT VK+QG CG CW+FS   A+EG     T +L SLSEQ
Sbjct: 110 TWIEPESVSLPKSVDWREKGAVTPVKNQGHCGSCWSFSTTGALEGQLFRKTGELVSLSEQ 169

Query: 179 ELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKIS 238
            L+DC TS  + GC GGLMD+AF +I  N G+ TE  YPY+   G C +     SA + +
Sbjct: 170 NLIDCSTSYGNNGCGGGLMDNAFTYIKENHGIDTEESYPYEGKQGKC-RYHKEDSAGRDT 228

Query: 239 GYEDVPSNNEAALMKAVAN-QPVSVAIDASGSDFQFYSSGVFT-GQCGTE-LDHGVTAVG 295
           G+ D+PS NE AL KA+A   PVSVAIDAS   FQFY  GV+    C +  LDHGV AVG
Sbjct: 229 GFVDIPSGNERALAKALATIGPVSVAIDASHESFQFYHEGVYNPPDCDSHSLDHGVLAVG 288

Query: 296 YGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           YGT DDG  Y+++KNSWG  WG+ GY+ M R+    +  CG+A QASYP
Sbjct: 289 YGTTDDGQDYYIIKNSWGERWGQEGYVLMARN---SKNECGVATQASYP 334


>gi|222632170|gb|EEE64302.1| hypothetical protein OsJ_19139 [Oryza sativa Japonica Group]
          Length = 1105

 Score =  261 bits (666), Expect = 4e-67,   Method: Compositional matrix adjust.
 Identities = 130/258 (50%), Positives = 162/258 (62%), Gaps = 15/258 (5%)

Query: 101 RNGYKRRLPSVRSS--------ETTDVSFRYEN-----ASVPASIDWRKKGAVTGVKDQG 147
           R  Y RR+P+ R S           D    Y        +VP ++DWR+ GAVT VKDQG
Sbjct: 89  RGPYARRVPAPRRSGRLAAAGGPGRDGGAPYLGVDGGVGAVPDAVDWRQSGAVTKVKDQG 148

Query: 148 QCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISN 207
            CG CW+FSA  AMEGIN I T  L SLSEQEL+DCD S  + GC GGLMD A++F++ N
Sbjct: 149 SCGACWSFSATGAMEGINKIKTGSLISLSEQELIDCDRS-YNSGCGGGLMDYAYKFVVKN 207

Query: 208 KGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDAS 267
            G+ TEA YPY+ +DG+CNK +       I GY+DVP+NNE  L++AVA QPVSV I  S
Sbjct: 208 GGIDTEADYPYRETDGTCNKNKLKRRVVTIDGYKDVPANNEDMLLQAVAQQPVSVGICGS 267

Query: 268 GSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRD 327
              FQ YS G+F G C T LDH +  VGYG+ + G  YW+VKNSWG +WG  GY+ M R+
Sbjct: 268 ARAFQLYSKGIFDGPCPTSLDHAILIVGYGS-EGGKDYWIVKNSWGESWGMKGYMYMHRN 326

Query: 328 IDAKEGLCGIAMQASYPT 345
                G+CGI    S+PT
Sbjct: 327 TGNSNGVCGINQMPSFPT 344


>gi|449513868|ref|XP_002191976.2| PREDICTED: cathepsin L1-like [Taeniopygia guttata]
          Length = 443

 Score =  261 bits (666), Expect = 5e-67,   Method: Compositional matrix adjust.
 Identities = 148/323 (45%), Positives = 194/323 (60%), Gaps = 18/323 (5%)

Query: 32  DATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFN--NKARNKPYKLGINEF 89
           D  ++   ++W + + + Y +  E   R  ++++N++ I   N  +      YKLG+N+F
Sbjct: 127 DPELDGHWQLWKSWHRKDYHEREEGWRRV-VWEKNLKMIEIHNLDHALGKHSYKLGMNQF 185

Query: 90  ADQTNEEFRAPRNGY--KRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQG 147
            D T EEFR   NGY  K+     R S+  + +F       P S+DWR+KG VT VKDQG
Sbjct: 186 GDMTTEEFRQLMNGYVHKKSERKYRGSQFLEPNF----LEAPRSVDWREKGYVTPVKDQG 241

Query: 148 QCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISN 207
           QCG CWAFS   A+EG +   T KL SLSEQ LVDC     +QGC GGLMD AF+++  N
Sbjct: 242 QCGSCWAFSTTGALEGQHFRKTGKLVSLSEQNLVDCSRPEGNQGCNGGLMDQAFQYVQDN 301

Query: 208 KGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDA 266
            G+ +E  YPY A D    + +A  +AA  +G+ D+P  +E ALMKAVA   PVSVAIDA
Sbjct: 302 GGIDSEESYPYTAKDDEDCRYKAEYNAANDTGFVDIPQGHERALMKAVAAVGPVSVAIDA 361

Query: 267 SGSDFQFYSSGV-FTGQCGTE-LDHGVTAVGYGTAD---DGTKYWLVKNSWGTTWGENGY 321
             S FQFY SG+ +   C +E LDHGV  VGYG      DG KYW+VKNSWG  WG+ GY
Sbjct: 362 GHSSFQFYQSGIYYEPDCSSEDLDHGVLVVGYGFEGEDVDGKKYWIVKNSWGEKWGDKGY 421

Query: 322 IRMQRDIDAKEGLCGIAMQASYP 344
           I M +D   ++  CGIA  ASYP
Sbjct: 422 IYMAKD---RKNHCGIATAASYP 441


>gi|342675481|gb|AEL31666.1| cathepsin L [Cynoglossus semilaevis]
          Length = 336

 Score =  261 bits (666), Expect = 5e-67,   Method: Compositional matrix adjust.
 Identities = 152/344 (44%), Positives = 205/344 (59%), Gaps = 19/344 (5%)

Query: 10  LVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEY 69
           ++  A+L LGV A  S + +L DA +++  E+W   + + Y +  E   R  I+++N+  
Sbjct: 1   MLPLALLALGVSAVLS-APSL-DARLSDHWELWKNWHSKKYHEKEEGWRRM-IWEKNLNK 57

Query: 70  IASFN--NKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS 127
           I   N  +      Y+LG+N F D T+EEFR   NGY+R+       +     F   N  
Sbjct: 58  IELHNLEHSMGKHSYRLGMNHFGDMTHEEFRQIMNGYQRK----TERKAIGSLFMEPNFM 113

Query: 128 V-PASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTS 186
           V P+++DWR+KG VT VKDQGQCG CWAFS   A+ZG N     KL SLSEQ LVDC   
Sbjct: 114 VAPSAVDWREKGYVTPVKDQGQCGSCWAFSTTGALZGQNFRKMGKLVSLSEQNLVDCSRP 173

Query: 187 GEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSN 246
             ++GC GGLMD AF+++  N+GL +E  YPY  +D      +   ++   +G+ D+PS 
Sbjct: 174 EGNEGCGGGLMDQAFQYVKDNQGLDSEDSYPYLGTDDQPCHYDPKYNSVNDTGFVDIPSG 233

Query: 247 NEAALMKAVAN-QPVSVAIDASGSDFQFYSSGV-FTGQCGT-ELDHGVTAVGYGTAD--- 300
            E ALMKAVA+  PVSVAIDA    FQFY SG+ +  +C + ELDHGV AVGYG      
Sbjct: 234 KEHALMKAVASVGPVSVAIDAGHESFQFYQSGIYYEKECSSEELDHGVLAVGYGFEGEDV 293

Query: 301 DGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           DG KYW+VKNSW   WG+ GYI M +D   ++  CGIA  ASYP
Sbjct: 294 DGKKYWIVKNSWSEKWGDKGYIYMAKD---RKNHCGIATAASYP 334


>gi|326503122|dbj|BAJ99186.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326512552|dbj|BAJ99631.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 389

 Score =  261 bits (666), Expect = 5e-67,   Method: Compositional matrix adjust.
 Identities = 140/330 (42%), Positives = 191/330 (57%), Gaps = 25/330 (7%)

Query: 35  MNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNK--PYKLGINEFADQ 92
           M  R  +WM    R Y  ++EK  RFK+++ N+ YI + N +A      Y+LG   F D 
Sbjct: 56  MMARFHVWMTVQNRSYPTSSEKAHRFKVYRSNMRYIEALNAEATTSGFTYELGEGPFTDL 115

Query: 93  TNEEF------RAPRNGYKR-----------RLPSVRSSETTDVSFRYENASVPASIDWR 135
           T+EEF      + P + ++               SV  +E   V   + +A  P  +DWR
Sbjct: 116 TDEEFISLYTGKIPDDDHREDGVHDEQIITTHAGSVNGAEGVTVYANF-SAGAPIRMDWR 174

Query: 136 KKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGG 195
           K+GAVT VKDQG+CG CWAF  VA +EGI+ I   +L SLSEQ+LVDCD    D GC GG
Sbjct: 175 KRGAVTPVKDQGKCGSCWAFPTVATIEGIHKIKRGRLVSLSEQQLVDCDF--LDGGCNGG 232

Query: 196 LMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAV 255
              +AF++II N G+ T + Y YKA++G C  K     AAKI+GY  V SN+E +++  V
Sbjct: 233 WPRNAFQWIIQNGGITTTSSYTYKAAEGQC--KGNRKPAAKITGYRKVKSNSEVSMVNIV 290

Query: 256 ANQPVSVAIDASGSDFQFYSSGVFTGQCGT-ELDHGVTAVGYGTADDGTKYWLVKNSWGT 314
           ANQP++ +I   G  FQ Y  G++ G C T +L+H +T VGYG    G KYW+VKNSWG 
Sbjct: 291 ANQPIAASIVVHGGQFQHYKGGIYNGPCATSKLNHVITIVGYGQQAYGAKYWIVKNSWGA 350

Query: 315 TWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
            WG  GY+ M+R      G CGIA++  +P
Sbjct: 351 AWGNKGYMLMKRGTKNPLGQCGIAVRPIFP 380


>gi|33242880|gb|AAQ01144.1| cathepsin [Branchiostoma lanceolatum]
          Length = 334

 Score =  260 bits (665), Expect = 5e-67,   Method: Compositional matrix adjust.
 Identities = 154/339 (45%), Positives = 197/339 (58%), Gaps = 18/339 (5%)

Query: 15  ILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFN 74
           +L+LG     + +  L     N+  EMW  Q+G+ Y   AE+  R  IF++N   IA  N
Sbjct: 3   LLILGAVISMATAGVL---PHNKEWEMWKLQHGKQYETEAEEYSRRFIFEKNTVKIAEHN 59

Query: 75  NKAR--NKPYKLGINEFADQTNEEFRAPRNG--YKRRLPSVRSSETTDVSFRYENASVPA 130
            +A      Y L +N+F D  +EEF     G   K     +  SE  D     +N ++P 
Sbjct: 60  IRASLGMHSYTLAMNKFGDMHHEEFHQRIMGGCLKIVKKPLLGSEVGDND---DNGTLPK 116

Query: 131 SIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQ 190
           S+DWR    V+ VKDQG+CG CWAFS   ++EG +   T KL  LSEQ+LVDC     +Q
Sbjct: 117 SVDWRNSHMVSEVKDQGECGSCWAFSTTGSLEGQHSNKTGKLVDLSEQQLVDCSKDFGNQ 176

Query: 191 GCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAA 250
           GC GGLMD AF++I +N GL TE  YPY A+D    K + +   A + GY+DV S+NE A
Sbjct: 177 GCGGGLMDQAFQYIKANGGLDTEESYPYTATDDKPCKFDNSSVGATLIGYKDVKSSNEHA 236

Query: 251 LMKAVAN-QPVSVAIDASGSDFQFYSSGVF-TGQCGTE-LDHGVTAVGYGTADDGTK--Y 305
           L +AVA   PVSVAIDA    FQFYSSGV+   QC TE LDHGV  VGYG  +D +   +
Sbjct: 237 LKRAVATVGPVSVAIDAGHESFQFYSSGVYDEPQCSTEQLDHGVLVVGYGAMNDNSHQAF 296

Query: 306 WLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           W+VKNSWG  WG+ GYI M R+   K   CGIA  ASYP
Sbjct: 297 WIVKNSWGPNWGDQGYIMMSRN---KNNQCGIATSASYP 332


>gi|228244|prf||1801240B Cys protease 2
          Length = 323

 Score =  260 bits (665), Expect = 5e-67,   Method: Compositional matrix adjust.
 Identities = 146/310 (47%), Positives = 191/310 (61%), Gaps = 14/310 (4%)

Query: 40  EMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNK--PYKLGINEFADQTNEEF 97
           E +  +YGR Y D  E   R  IF++N +YI  FN K  N    + L +N+F D T EEF
Sbjct: 21  EHFKGKYGRQYVDAEEDSYRRVIFEQNQKYIEEFNKKYENGEVTFNLAMNKFGDMTLEEF 80

Query: 98  RAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSA 157
            A   G   R    RS+  +    + E       +DWR KGAVT VKDQGQCG CWAFS 
Sbjct: 81  NAVMKGNIPR----RSAPVSVFYPKKETGPQATEVDWRTKGAVTPVKDQGQCGSCWAFST 136

Query: 158 VAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYP 217
             ++EG + + T  L SL+EQ+LVDC      QGC GG M+DAF++I +N G+ TEA YP
Sbjct: 137 TGSLEGQHFLKTGSLISLAEQQLVDCSRPYGPQGCNGGWMNDAFDYIKANNGIDTEASYP 196

Query: 218 YKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDASGSDFQFYSS 276
           Y+A DGSC + ++N  AA  SG+ ++ S +E  L +AV +  P+SV IDA+ S FQFYSS
Sbjct: 197 YEARDGSC-RFDSNSVAATCSGHTNIASGSETGLQQAVRDIGPISVTIDAAHSSFQFYSS 255

Query: 277 GV-FTGQCG-TELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGL 334
           GV +   C  + LDH V AVGYG+ + G  +WLVKNSW T+WG+ GYI+M R+   +   
Sbjct: 256 GVYYEPSCSPSYLDHAVLAVGYGS-EGGQDFWLVKNSWATSWGDAGYIKMSRN---RNNN 311

Query: 335 CGIAMQASYP 344
           CGIA  ASYP
Sbjct: 312 CGIATVASYP 321


>gi|18202415|sp|P82474.1|CPGP2_ZINOF RecName: Full=Zingipain-2; AltName: Full=Cysteine proteinase GP-II
 gi|6137410|pdb|1CQD|A Chain A, The 2.1 Angstrom Structure Of A Cysteine Protease With
           Proline Specificity From Ginger Rhizome, Zingiber
           Officinale
 gi|6137411|pdb|1CQD|B Chain B, The 2.1 Angstrom Structure Of A Cysteine Protease With
           Proline Specificity From Ginger Rhizome, Zingiber
           Officinale
 gi|6137412|pdb|1CQD|C Chain C, The 2.1 Angstrom Structure Of A Cysteine Protease With
           Proline Specificity From Ginger Rhizome, Zingiber
           Officinale
 gi|6137413|pdb|1CQD|D Chain D, The 2.1 Angstrom Structure Of A Cysteine Protease With
           Proline Specificity From Ginger Rhizome, Zingiber
           Officinale
          Length = 221

 Score =  260 bits (665), Expect = 5e-67,   Method: Compositional matrix adjust.
 Identities = 124/218 (56%), Positives = 154/218 (70%), Gaps = 4/218 (1%)

Query: 128 VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSG 187
           +P SIDWR+ GAV  VK+QG CG CWAFS VAA+EGIN I T  L SLSEQ+LVDC T+ 
Sbjct: 3   LPDSIDWRENGAVVPVKNQGGCGSCWAFSTVAAVEGINQIVTGDLISLSEQQLVDCTTA- 61

Query: 188 EDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNN 247
            + GC GG M+ AF+FI++N G+ +E  YPY+  DG CN    N     I  YE+VPS+N
Sbjct: 62  -NHGCRGGWMNPAFQFIVNNGGINSEETYPYRGQDGICNST-VNAPVVSIDSYENVPSHN 119

Query: 248 EAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWL 307
           E +L KAVANQPVSV +DA+G DFQ Y SG+FTG C    +H +T VGYGT +D   +W+
Sbjct: 120 EQSLQKAVANQPVSVTMDAAGRDFQLYRSGIFTGSCNISANHALTVVGYGTEND-KDFWI 178

Query: 308 VKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           VKNSWG  WGE+GYIR +R+I+  +G CGI   ASYP 
Sbjct: 179 VKNSWGKNWGESGYIRAERNIENPDGKCGITRFASYPV 216


>gi|242040563|ref|XP_002467676.1| hypothetical protein SORBIDRAFT_01g032090 [Sorghum bicolor]
 gi|241921530|gb|EER94674.1| hypothetical protein SORBIDRAFT_01g032090 [Sorghum bicolor]
          Length = 358

 Score =  260 bits (665), Expect = 5e-67,   Method: Compositional matrix adjust.
 Identities = 146/312 (46%), Positives = 193/312 (61%), Gaps = 10/312 (3%)

Query: 35  MNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTN 94
           M +R   W A Y R Y    E++ RF++++ N+E+I +  N+A N  Y LG N+FAD T 
Sbjct: 53  MMDRFLRWQATYNRSYPTAEERQRRFQVYRRNMEHIEA-TNRAGNLTYTLGENQFADLTE 111

Query: 95  EEFRAPRNGYKRRLPSVRSSETTDVSFRYENA-SVPASIDWRKKGAVTGVKDQG-QCGCC 152
           EEF        + +P VR          + +    P S+DWR +GAVT +K+QG  C  C
Sbjct: 112 EEFLDLYT--MKGMPPVRRDAGKKQQANFSSVVDAPTSVDWRSRGAVTPIKNQGPSCSSC 169

Query: 153 WAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLAT 212
           WAF   A +E I  I T KL SLSEQEL+DCD    D GC  G   + ++++I N GL T
Sbjct: 170 WAFVTAATIESITQIRTGKLVSLSEQELIDCDPY--DGGCNLGYFVNGYKWVIQNGGLTT 227

Query: 213 EAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQ 272
           EA YPY+A    CN+ +A   AA+IS Y  +P   EA L +AVA QPV+ AI+  GS  Q
Sbjct: 228 EANYPYQARRYQCNRSKAGQRAARISNYRQLP-QGEAQLQQAVAQQPVAAAIEMGGS-LQ 285

Query: 273 FYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKE 332
           FYS GV++GQCGT ++H +T VGYG    G KYWLVKNSWG TWGE GY+RM++D+  + 
Sbjct: 286 FYSGGVWSGQCGTRMNHAITVVGYGADSSGVKYWLVKNSWGQTWGERGYLRMRKDVR-QG 344

Query: 333 GLCGIAMQASYP 344
           GLCGIA+  +YP
Sbjct: 345 GLCGIALDLAYP 356


>gi|300122868|emb|CBK23875.2| unnamed protein product [Blastocystis hominis]
          Length = 316

 Score =  260 bits (665), Expect = 6e-67,   Method: Compositional matrix adjust.
 Identities = 147/316 (46%), Positives = 190/316 (60%), Gaps = 20/316 (6%)

Query: 31  NDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFA 90
           +DA   +  + + A+YG+ Y  ++E+E R K+   N+++I  FN+   +  + LG+  FA
Sbjct: 19  SDAYYEKLFQTFEAKYGKNYL-SSEREYRKKVLAYNMDWIEKFNSDEHS--FTLGMTPFA 75

Query: 91  DQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCG 150
           D TN EF         +L             R  N     SIDWR+KGAVT VK+QG CG
Sbjct: 76  DMTNTEFAT------SKLCGCMKKPLNHKQARVLNNMAVESIDWREKGAVTPVKNQGSCG 129

Query: 151 CCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGL 210
            CWAFSA  A+EG N + T KL SLSEQ+LVDCDT  ED GC GG MD AFE+++  KGL
Sbjct: 130 SCWAFSATGALEGGNFVATGKLVSLSEQQLVDCDT--EDAGCGGGFMDTAFEYVM-KKGL 186

Query: 211 ATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSD 270
            TE  YPY A D  C K +   S   I+GYEDVP+N+  AL +A+   PVSVAI A    
Sbjct: 187 CTEEDYPYHAKDEDC-KDDQCTSVISITGYEDVPANDGVALKQALTKAPVSVAIQADSFV 245

Query: 271 FQFYSSGVF-TGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDID 329
           FQ Y+ GV  +  CGT L+HGV AVGY       +Y +VKNSWG +WG+ GY+++    D
Sbjct: 246 FQMYTGGVLDSDMCGTSLNHGVLAVGY-----AKEYIIVKNSWGASWGDKGYVKIAHR-D 299

Query: 330 AKEGLCGIAMQASYPT 345
             EG+CGI M ASYPT
Sbjct: 300 QGEGICGINMAASYPT 315


>gi|440799058|gb|ELR20119.1| cysteine proteinase [Acanthamoeba castellanii str. Neff]
          Length = 401

 Score =  260 bits (665), Expect = 6e-67,   Method: Compositional matrix adjust.
 Identities = 144/309 (46%), Positives = 186/309 (60%), Gaps = 11/309 (3%)

Query: 42  WMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARN-KPYKLGINEFADQTNEEFRAP 100
           WM  + + Y  +     RF+I+K N  +I  +N K  N   + + IN+F D T++EF   
Sbjct: 98  WMRTHRKSYHHD-HFLPRFEIWKTNNRWITHWNKKHANASSFTVAINQFGDLTSDEFNRL 156

Query: 101 RNGYKRRLPSVRSSETTDVSFRYEN-ASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVA 159
            NG      + ++SE  +   ++ N A +P S DWR+KG V+ VKDQG CG CWAFS   
Sbjct: 157 YNGL-HVFSAPKASEKVERPRQWANTAGIPESGDWRQKGVVSRVKDQGMCGSCWAFSTTG 215

Query: 160 AMEGINHITTRKLTSLSEQELVDCDTSGEDQ-GCEGGLMDDAFEFIISNKGLATEAKYPY 218
           + EGIN ITT +L  LSEQ LVDC T+  D  GC GG MD+AF +II NKG+ +EA YPY
Sbjct: 216 STEGINAITTSRLVPLSEQNLVDCATAAYDNYGCNGGFMDNAFRYIIDNKGIDSEASYPY 275

Query: 219 KASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGV 278
            A+DG C          K    + +P  +E AL+ A A QP+SV IDA    FQFYS GV
Sbjct: 276 VAADGQCRFNPKTVYGGKGGTLKSLPKGDEKALLVAAARQPISVGIDAGRPSFQFYSKGV 335

Query: 279 FT-GQC-GTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCG 336
           +   +C  TEL+HGV  VG+G  + G  YWLVKNSWG TWG +GYI+M RD   K   CG
Sbjct: 336 YNEPECSSTELNHGVLIVGWGV-ERGQAYWLVKNSWGQTWGMDGYIKMSRD---KNNQCG 391

Query: 337 IAMQASYPT 345
           IA  ASYP+
Sbjct: 392 IATLASYPS 400


>gi|28932704|gb|AAO60046.1| midgut cysteine proteinase 3 [Rhipicephalus appendiculatus]
          Length = 334

 Score =  260 bits (665), Expect = 6e-67,   Method: Compositional matrix adjust.
 Identities = 149/304 (49%), Positives = 194/304 (63%), Gaps = 10/304 (3%)

Query: 46  YGRVYRDNAEKEMRFKIFKENVEYIASFNNK-ARNK-PYKLGINEFADQTNEEFRAPRNG 103
           +G+ Y  + E+  R KI+ EN   IA  N K A+++  YKL +NEF D  + EF + RNG
Sbjct: 34  HGKEYDSDTEEYYRLKIYMENRLKIARHNEKYAKSQVSYKLAMNEFGDLLHHEFVSTRNG 93

Query: 104 YKRRLPSVRSSETTDVSFR-YENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAME 162
           +KR         +  +    +E+  +P ++DWRKKGAVT VK+QGQCG CWAFS   ++E
Sbjct: 94  FKRNYRDTPREGSFFIEPEGFEDLHLPKTVDWRKKGAVTPVKNQGQCGSCWAFSTTGSLE 153

Query: 163 GINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASD 222
           G +    RKL SLSEQ LVDC     + GC GGLMD+AF++I +NKG+ TE  YPY A+D
Sbjct: 154 GQHFRKMRKLVSLSEQNLVDCMQKLGNNGCGGGLMDNAFKYIKANKGIDTELSYPYNATD 213

Query: 223 GSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVF-TG 281
           G C+ K++    A  +G+ED+P+ +E +        PVSVAIDAS   FQFYS GV    
Sbjct: 214 GVCHFKKSG-VGATATGFEDIPARDENSWDAVAPVGPVSVAIDASHESFQFYSEGVLDEP 272

Query: 282 QCGT-ELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQ 340
           +C + +LDHGV  VGYGT  DG  YWLVKNSWGTTWG+ GYI M R+   K+  CGIA  
Sbjct: 273 ECSSDQLDHGVLVVGYGTK-DGQDYWLVKNSWGTTWGDEGYIYMTRN---KDNQCGIASS 328

Query: 341 ASYP 344
           ASYP
Sbjct: 329 ASYP 332


>gi|157278115|ref|NP_001098156.1| cathepsin L precursor [Oryzias latipes]
 gi|50251128|dbj|BAD27581.1| cathepsin L [Oryzias latipes]
          Length = 336

 Score =  260 bits (665), Expect = 7e-67,   Method: Compositional matrix adjust.
 Identities = 147/322 (45%), Positives = 190/322 (59%), Gaps = 17/322 (5%)

Query: 32  DATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFN--NKARNKPYKLGINEF 89
           D  +++  ++W   + + Y +  E   R  ++++N+  I   N  +      Y+LG+N F
Sbjct: 21  DPQLDQHWQLWKGWHSKNYHEKEEGWRRL-VWEKNLRKIELHNLEHSMGKHSYRLGMNHF 79

Query: 90  ADQTNEEFRAPRNGYKRRLPSVRS-SETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQ 148
            D T+EEFR   NGYKRR     S S   + +F       P ++DWR KG VT VKDQGQ
Sbjct: 80  GDMTHEEFRQIMNGYKRREQRKYSGSLFMEPNF----LEAPRAVDWRDKGYVTPVKDQGQ 135

Query: 149 CGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNK 208
           CG CWAFS   A+EG     T KL SLSEQ LVDC     ++GC GGLMD AF+++  N+
Sbjct: 136 CGSCWAFSTTGALEGQQFRKTGKLVSLSEQNLVDCSRPEGNEGCNGGLMDQAFQYVKDNQ 195

Query: 209 GLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDAS 267
           GL +E  YPYK +D    +  A  SA   +G+ D+PS  E ALMKAVA+  PVSVAIDA 
Sbjct: 196 GLDSEDFYPYKGTDDQPCQYNAQYSAVNDTGFVDIPSGKERALMKAVASVGPVSVAIDAG 255

Query: 268 GSDFQFYSSGV-FTGQCGT-ELDHGVTAVGYGTAD---DGTKYWLVKNSWGTTWGENGYI 322
              FQFY SG+ F  +C + ELDHGV  VGYG      DG KYW+VKNSW   WG+ G+I
Sbjct: 256 HESFQFYQSGIYFEKECSSDELDHGVLVVGYGFEGEDVDGKKYWIVKNSWSEKWGDKGFI 315

Query: 323 RMQRDIDAKEGLCGIAMQASYP 344
            M +D   +   CGIA  ASYP
Sbjct: 316 YMAKD---RHNHCGIATAASYP 334


>gi|146216002|gb|ABQ10203.1| cysteine protease Cp5 [Actinidia deliciosa]
          Length = 509

 Score =  260 bits (665), Expect = 7e-67,   Method: Compositional matrix adjust.
 Identities = 139/319 (43%), Positives = 188/319 (58%), Gaps = 14/319 (4%)

Query: 37  ERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNN-KARNKPYKLGINEFADQTNE 95
           E  + W  ++G+VY+   E E +F+ F++N+ Y+   N  +  +  + +G+N+FAD +NE
Sbjct: 49  ELFKKWTEKHGKVYKHGQEVEKKFQNFRDNLRYVMEKNGERGASGGHLVGLNKFADMSNE 108

Query: 96  EFR------APRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQC 149
           EFR        +   KR     R       +        P S+DWRK G VTGVKDQG C
Sbjct: 109 EFREVYVSKVKKPTSKRMAIERRRQGKAAAAKAVAACDGPTSLDWRKYGIVTGVKDQGDC 168

Query: 150 GCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKG 209
           G CWAFS+  A+EGIN +    L SLSEQELVDCD++  + GCEGG MD AFE+++SN G
Sbjct: 169 GSCWAFSSTGAIEGINALANGDLISLSEQELVDCDST--NDGCEGGYMDYAFEWVMSNGG 226

Query: 210 LATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGS 269
           + TE  YPY   DG+CN  +    A  I GYEDV +  E+AL  AV  QP+SV ID    
Sbjct: 227 IDTETDYPYTGEDGTCNTTKEETKAVSIDGYEDV-AEEESALFCAVLKQPISVGIDGGAI 285

Query: 270 DFQFYSSGVF---TGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQR 326
           DFQ Y+ G++         ++DH V  VGYG A+ G +YW++KNSWGT WG  GY  ++R
Sbjct: 286 DFQLYTGGIYDGDCSDDPDDIDHAVLVVGYG-AESGEEYWIIKNSWGTDWGMKGYAYIKR 344

Query: 327 DIDAKEGLCGIAMQASYPT 345
           +     G+C I   ASYPT
Sbjct: 345 NTSKDYGVCAINAMASYPT 363


>gi|405966499|gb|EKC31777.1| Cathepsin L [Crassostrea gigas]
          Length = 331

 Score =  260 bits (664), Expect = 7e-67,   Method: Compositional matrix adjust.
 Identities = 150/343 (43%), Positives = 202/343 (58%), Gaps = 21/343 (6%)

Query: 8   NKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENV 67
           N L++ A L +  +A    ++ L+   +     ++   + + Y  + E++MR  I+++NV
Sbjct: 2   NTLIVVASLCVTAFASPILNKDLDGDWV-----LYKQTHKKTYSQD-EEQMRRLIWEDNV 55

Query: 68  EYIASFNNKARN--KPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYEN 125
            YI   N  A      Y LG NE+AD T  EFRA  NGYK      +     D+     N
Sbjct: 56  NYIQKHNLAADRGEHTYWLGQNEYADMTIFEFRAIMNGYKMSANRTKG----DLYMSPSN 111

Query: 126 -ASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCD 184
              +P S+DWRK+G VT +K+QG CG CW+FSA  ++EG +   ++KL SLSEQ LVDC 
Sbjct: 112 IGDLPDSVDWRKEGYVTDIKNQGHCGSCWSFSATGSLEGQHFKASKKLVSLSEQNLVDCS 171

Query: 185 TSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVP 244
               + GC+GGLMD+AF +I SNKG+ TE  YPY A +G C+ K  N  A   +GY D+P
Sbjct: 172 KKEGNHGCQGGLMDNAFRYIESNKGIDTEESYPYTAKNGFCHFKAENVGATD-TGYVDIP 230

Query: 245 SNNEAALMKAVAN-QPVSVAIDASGSDFQFYSSGVFT--GQCGTELDHGVTAVGYGTADD 301
              E  L +AVA   P+SV IDA    FQ Y  GV++      ++LDHGV AVGYGT + 
Sbjct: 231 HMQEDKLQEAVATVGPISVGIDAGHKSFQLYREGVYSEPACSSSKLDHGVLAVGYGT-ES 289

Query: 302 GTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           G  YWLVKNSWGT+WG  GY+ M R+   K  +CGIA QASYP
Sbjct: 290 GDDYWLVKNSWGTSWGMQGYVMMARN---KHNMCGIATQASYP 329


>gi|118123|sp|P25782.1|CYSP2_HOMAM RecName: Full=Digestive cysteine proteinase 2; Flags: Precursor
 gi|11053|emb|CAA45128.1| cysteine proteinase preproenzyme [Homarus americanus]
          Length = 323

 Score =  260 bits (664), Expect = 7e-67,   Method: Compositional matrix adjust.
 Identities = 146/310 (47%), Positives = 191/310 (61%), Gaps = 14/310 (4%)

Query: 40  EMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNK--PYKLGINEFADQTNEEF 97
           E +  +YGR Y D  E   R  IF++N +YI  FN K  N    + L +N+F D T EEF
Sbjct: 21  EHFKGKYGRQYVDAEEDSYRRVIFEQNQKYIEEFNKKYENGEVTFNLAMNKFGDMTLEEF 80

Query: 98  RAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSA 157
            A   G   R    RS+  +    + E       +DWR KGAVT VKDQGQCG CWAFS 
Sbjct: 81  NAVMKGNIPR----RSAPVSVFYPKKETGPQATEVDWRTKGAVTPVKDQGQCGSCWAFST 136

Query: 158 VAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYP 217
             ++EG + + T  L SL+EQ+LVDC      QGC GG M+DAF++I +N G+ TEA YP
Sbjct: 137 TGSLEGQHFLKTGSLISLAEQQLVDCSRPYGPQGCNGGWMNDAFDYIKANNGIDTEAAYP 196

Query: 218 YKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDASGSDFQFYSS 276
           Y+A DGSC + ++N  AA  SG+ ++ S +E  L +AV +  P+SV IDA+ S FQFYSS
Sbjct: 197 YEARDGSC-RFDSNSVAATCSGHTNIASGSETGLQQAVRDIGPISVTIDAAHSSFQFYSS 255

Query: 277 GV-FTGQCG-TELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGL 334
           GV +   C  + LDH V AVGYG+ + G  +WLVKNSW T+WG+ GYI+M R+   +   
Sbjct: 256 GVYYEPSCSPSYLDHAVLAVGYGS-EGGQDFWLVKNSWATSWGDAGYIKMSRN---RNNN 311

Query: 335 CGIAMQASYP 344
           CGIA  ASYP
Sbjct: 312 CGIATVASYP 321


>gi|354507493|ref|XP_003515790.1| PREDICTED: cathepsin L1-like [Cricetulus griseus]
 gi|344259154|gb|EGW15258.1| Cathepsin L1 [Cricetulus griseus]
          Length = 333

 Score =  260 bits (664), Expect = 7e-67,   Method: Compositional matrix adjust.
 Identities = 148/323 (45%), Positives = 199/323 (61%), Gaps = 19/323 (5%)

Query: 32  DATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKAR--NKPYKLGINEF 89
           + T N +   W + Y R+Y  N E+E R  ++++N++ I   N +       Y + +N F
Sbjct: 22  NQTFNAQWHKWKSTYRRLYGTN-EEEWRRAVWEKNMKMIELHNGEYSEGKHGYTMEMNAF 80

Query: 90  ADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQC 149
            D TNEEFR   NGYK +    R  +        +   +P S+DWR+KG VT VK+QGQC
Sbjct: 81  GDMTNEEFRQLVNGYKHQ--KHRKGKVFQEPLMLQ---LPKSVDWREKGCVTPVKNQGQC 135

Query: 150 GCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKG 209
           G CWAFSA  A+EG   + T  L SLSEQ LVDC  +  +QGC GGLMD AF+++++NKG
Sbjct: 136 GSCWAFSACGALEGQMCLKTGVLVSLSEQNLVDCSQAEGNQGCNGGLMDFAFQYVLNNKG 195

Query: 210 LATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDASG 268
           L +E  YPY+A DG+C K +   +AA  +GY D+P   E ALMKAVA   P+++AIDAS 
Sbjct: 196 LDSEESYPYEAKDGTC-KYKPEFAAANDTGYVDIPQ-LEKALMKAVATVGPIAIAIDASH 253

Query: 269 SDFQFYSSGV-FTGQCGT-ELDHGVTAVGY---GTADDGTKYWLVKNSWGTTWGENGYIR 323
             FQFYSSG+ +   C + ELDHGV  VGY   GT  +  KYW+VKNSWG++WG  G+  
Sbjct: 254 PSFQFYSSGIYYEPNCSSKELDHGVLVVGYGFEGTDSNKKKYWIVKNSWGSSWGMGGFFH 313

Query: 324 MQRDIDAKEGLCGIAMQASYPTA 346
           + +D   K   CG+A  ASYPT 
Sbjct: 314 IAKD---KNNHCGVATAASYPTV 333


>gi|309380130|gb|ADO65978.1| cathepsin L [Eriocheir sinensis]
 gi|309380134|gb|ADO65980.1| cathepsin L [Eriocheir sinensis]
          Length = 325

 Score =  260 bits (664), Expect = 7e-67,   Method: Compositional matrix adjust.
 Identities = 147/321 (45%), Positives = 195/321 (60%), Gaps = 21/321 (6%)

Query: 34  TMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNK--PYKLGINEFAD 91
           T+NE  + + A+YG+ YR   E   R  ++++N E+I S N +  N    + L +N+F D
Sbjct: 18  TLNEWQQ-FKARYGKQYRSTKEDSYRQSVYEQNQEFINSHNEQYENGLVSFTLAMNQFGD 76

Query: 92  QTNEEFRAPRNGY---KRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQ 148
            T EE  A  NG+    +++P           ++     +P ++DWR KGAVT VKDQ  
Sbjct: 77  MTTEEINAAMNGFLSAGKKVPR-------GTMYQPLVDELPDTVDWRDKGAVTPVKDQKA 129

Query: 149 CGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNK 208
           CG CWAFSA  ++EG + ++T KL SLSEQ LVDC     + GC GGLMD+AF +I  N 
Sbjct: 130 CGSCWAFSATGSLEGQHFLSTGKLVSLSEQNLVDCSDKYGNFGCGGGLMDNAFRYIKDNN 189

Query: 209 GLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQ-PVSVAIDAS 267
           G+ TE  YPY+A +G C     N   A +S Y D+   +E  L KAVA + PVSVAIDAS
Sbjct: 190 GIDTEESYPYEAKNGPCRFNSDN-VGATLSSYVDIQHGSEDDLQKAVAEKGPVSVAIDAS 248

Query: 268 GSDFQFYSSGVFTGQ-CGTE-LDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQ 325
            S F FYS G++  + C +  LDHGV AVGYGT DD + YWLVKNSW  TWG++GYI+M 
Sbjct: 249 TSTFHFYSRGIYYDEKCSSSFLDHGVLAVGYGT-DDSSDYWLVKNSWNETWGDSGYIKMS 307

Query: 326 RDIDAKEGLCGIAMQASYPTA 346
           R+   +   CGIA QASYP  
Sbjct: 308 RN---RNNNCGIASQASYPVV 325


>gi|403300975|ref|XP_003941187.1| PREDICTED: cathepsin L1-like isoform 1 [Saimiri boliviensis
           boliviensis]
 gi|403300977|ref|XP_003941188.1| PREDICTED: cathepsin L1-like isoform 2 [Saimiri boliviensis
           boliviensis]
 gi|403300979|ref|XP_003941189.1| PREDICTED: cathepsin L1-like isoform 3 [Saimiri boliviensis
           boliviensis]
          Length = 333

 Score =  260 bits (664), Expect = 7e-67,   Method: Compositional matrix adjust.
 Identities = 155/347 (44%), Positives = 207/347 (59%), Gaps = 23/347 (6%)

Query: 8   NKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENV 67
           N  ++ A   LG+    S + T N  ++  +   W A + R+Y  N E+E R  ++++N+
Sbjct: 2   NPTLILAAFCLGL---ASAALTFNH-SLEAQWIKWKAMHNRLYGKN-EEEWRRAVWEKNM 56

Query: 68  EYIASFNNKARN--KPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYEN 125
           + I   N++       + + +N F D TNEEFR   NG++ R P  R+ +       +E 
Sbjct: 57  KTIELHNHEYNQGKHSFTMAMNTFGDMTNEEFRQVMNGFQNRKP--RNGKVFQEPLLHE- 113

Query: 126 ASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDT 185
              P S+DWR+KG VT VK+QGQCG CWAFSA  A+EG     T KL SLSEQ LVDC  
Sbjct: 114 --APRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSG 171

Query: 186 SGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPS 245
              +QGC GGLMD AF+++  N GL +E  YPY+A++ SC K     S A  +G+ D+P 
Sbjct: 172 PQGNQGCNGGLMDYAFQYVQENGGLDSEESYPYEATEESC-KYNPKYSVANDTGFVDIPK 230

Query: 246 NNEAALMKAVAN-QPVSVAIDASGSDFQFYSSGV-FTGQCGTE-LDHGVTAVGYG---TA 299
             E ALMKAVA   P+SVAIDA    FQFY  G+ F  +C +E +DHGV  VGYG   T 
Sbjct: 231 -LEKALMKAVATVGPISVAIDAGHESFQFYKEGIYFEPECSSEDMDHGVLVVGYGFERTG 289

Query: 300 DDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
            D +KYWLVKNSWG  WG +GYI+M +D   ++  CGIA  ASYPT 
Sbjct: 290 SDNSKYWLVKNSWGEEWGMDGYIKMAKD---RKNHCGIASAASYPTV 333


>gi|357167707|ref|XP_003581294.1| PREDICTED: actinidain-like [Brachypodium distachyon]
          Length = 358

 Score =  260 bits (664), Expect = 8e-67,   Method: Compositional matrix adjust.
 Identities = 143/325 (44%), Positives = 195/325 (60%), Gaps = 20/325 (6%)

Query: 34  TMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQT 93
           TM  RHE WMA++GR Y D  EK  R ++F  N  ++ + N +A N+ Y LG+N+F+D T
Sbjct: 37  TMASRHERWMARFGRSYTDAGEKARRQEVFGANARHVDAVN-RAGNRTYTLGLNQFSDLT 95

Query: 94  NEEFRAPRNGYKRR-------LPSVR-SSETTDVSFRYENASVPASIDWRKKGAVTGVKD 145
           + EF     GY R        LP      + T + +      +P S+DWR KGAVT +K+
Sbjct: 96  DHEFLQQHLGYGRHHGQRGLLLPEEEVMPKATALGY---GQDMPYSVDWRAKGAVTEIKN 152

Query: 146 QGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFII 205
           Q  CG CWAF+AVAA EG+  I T  L S+SEQ+++DC  +G+   C+ G + DA  +++
Sbjct: 153 QRSCGSCWAFAAVAATEGLVKIATGNLISMSEQQVLDC--TGDRSSCDSGYISDALRYVV 210

Query: 206 SNKGLATEAKYPYKASDGSC-NKKEANP-SAAKISGYEDVPSN-NEAALMKAVANQPVSV 262
           ++ GL  EA Y Y    G+C +++ A P SAA + G      N +E AL    A QPV+V
Sbjct: 211 TSGGLQREAAYAYTGQKGACGSRRFARPNSAASVGGVHMATLNGDEGALQGLAARQPVAV 270

Query: 263 AIDASGSDFQFYSSGVFTG--QCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENG 320
            ++AS  DF+ YSSGV+ G   CG EL+H +T VGYGT +   +YWLVKN WGT WGENG
Sbjct: 271 IVEASEPDFRHYSSGVYAGSASCGRELNHALTVVGYGTENGAGEYWLVKNQWGTWWGENG 330

Query: 321 YIRMQRDIDAKEGLCGIAMQASYPT 345
           Y+R+ R   A    CGIA  A YPT
Sbjct: 331 YMRVARRNGAGAN-CGIASVAFYPT 354


>gi|391338870|ref|XP_003743778.1| PREDICTED: cathepsin L-like isoform 1 [Metaseiulus occidentalis]
 gi|391338872|ref|XP_003743779.1| PREDICTED: cathepsin L-like isoform 2 [Metaseiulus occidentalis]
 gi|391338874|ref|XP_003743780.1| PREDICTED: cathepsin L-like isoform 3 [Metaseiulus occidentalis]
          Length = 331

 Score =  259 bits (663), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 149/343 (43%), Positives = 197/343 (57%), Gaps = 22/343 (6%)

Query: 9   KLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVE 68
           K ++ A+LV    A  +  +  +    N     +   + + Y  +  +  R KIF +N  
Sbjct: 2   KFLILAVLVGAASAALTLEQLFDAEWQN-----FKVHHNKKYEGSTVEAFRKKIFLQNTH 56

Query: 69  YIASFNNKARN--KPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVS--FRYE 124
            IA  N K       YKL +N+F D  + EF +  NG       +RS+ T   S     E
Sbjct: 57  LIARHNIKHAKGETTYKLKMNQFGDMLHHEFVSTMNGL------LRSNRTYFGSTWIEPE 110

Query: 125 NASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCD 184
           + S+P S+DWR+KGAVT VK+QG CG CW+FS   A+EG     T +L SLSEQ L+DC 
Sbjct: 111 SVSLPKSVDWREKGAVTPVKNQGHCGSCWSFSTTGALEGQLFRKTGELVSLSEQNLIDCS 170

Query: 185 TSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVP 244
           TS  + GC GGLMD+AF +I  N G+ TE  YPY+   G C +     SA + +G+ D+P
Sbjct: 171 TSYGNNGCGGGLMDNAFTYIKENHGIDTEESYPYEGKQGKC-RYHKEDSAGRDTGFVDIP 229

Query: 245 SNNEAALMKAVAN-QPVSVAIDASGSDFQFYSSGVFT-GQCGTE-LDHGVTAVGYGTADD 301
           S NE AL KA+A   PVSVAIDAS   FQFY  GV+    C +  LDHGV AVGYGT DD
Sbjct: 230 SGNERALAKALATIGPVSVAIDASHESFQFYHEGVYNPPDCDSHSLDHGVLAVGYGTTDD 289

Query: 302 GTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           G  Y+++KNSWG  WG+ GY+ M R+    +  CG+A QASYP
Sbjct: 290 GQDYYIIKNSWGERWGQEGYVLMARN---SKNECGVATQASYP 329


>gi|41688064|dbj|BAD08618.1| cathepsin L preproprotein [Cyprinus carpio]
          Length = 337

 Score =  259 bits (663), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 156/353 (44%), Positives = 201/353 (56%), Gaps = 27/353 (7%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRF 60
           M + L    L L+A+      AP     TL D  ++   E W   +G+ Y +  E   R 
Sbjct: 1   MRVFLAAFALCLSAVFA----AP-----TL-DKQLDNHWEQWKNWHGKKYHEKEEGWRRM 50

Query: 61  KIFKENVEYIASFN--NKARNKPYKLGINEFADQTNEEFRAPRNGYK-RRLPSVRSSETT 117
            ++++N++ I   N  +      Y+LG+N F D T+EEFR   NGYK ++    R S   
Sbjct: 51  -VWEKNLQKIELHNLEHSMGTHTYRLGMNRFGDMTHEEFRQVMNGYKHKKERRFRGSLFM 109

Query: 118 DVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSE 177
           + +F      VP S+DWR+KG VT VKDQG+CG CWAFS   AMEG     T KL SLSE
Sbjct: 110 EPNF----LEVPNSLDWREKGYVTPVKDQGECGSCWAFSTTGAMEGQMFRKTGKLVSLSE 165

Query: 178 QELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKI 237
           Q LVDC     ++GC GGLMD AF++I    GL +E  YPY  +D      +   SAA  
Sbjct: 166 QNLVDCSRPEGNEGCNGGLMDQAFQYIKDQNGLDSEESYPYVGTDDQPCHYDPKYSAAND 225

Query: 238 SGYEDVPSNNEAALMKAVAN-QPVSVAIDASGSDFQFYSSGV-FTGQCGT-ELDHGVTAV 294
           +G+ D+PS  E ALMKA+A   PVSVAIDA    FQFY SG+ +  +C + ELDHGV AV
Sbjct: 226 TGFVDIPSGKEHALMKAIAAVGPVSVAIDAGHESFQFYQSGIYYEKECSSEELDHGVLAV 285

Query: 295 GYGTAD---DGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           GYG      DG KYW+VKNSW   WG+ GY+ M +D   +   CGIA  ASYP
Sbjct: 286 GYGFEGEDVDGKKYWIVKNSWSENWGDKGYVYMAKD---RHNHCGIATAASYP 335


>gi|449524450|ref|XP_004169236.1| PREDICTED: vignain-like [Cucumis sativus]
          Length = 283

 Score =  259 bits (663), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 134/291 (46%), Positives = 186/291 (63%), Gaps = 17/291 (5%)

Query: 59  RFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTD 118
           RFK+FK+N +++   N+    K  KL +N+FAD +++EF            ++ +     
Sbjct: 4   RFKVFKDNAKHVFKVNHMG--KSLKLKLNQFADMSDDEFSKTYGSNITYYKNLHAKVGGR 61

Query: 119 VS-FRYENAS-VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLS 176
           V  F YE A+ +P+SIDWRKKGA        +  CCWAF+AVAA+E I+ I T +L SLS
Sbjct: 62  VGGFMYERATNIPSSIDWRKKGA--------RRMCCWAFAAVAAVESIHQIRTNELVSLS 113

Query: 177 EQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAK 236
           EQE+VDCD   +  GC GG    AFEFI+ N G+  E  YPY A DG C ++  N     
Sbjct: 114 EQEVVDCDY--KVGGCRGGDYISAFEFIMENGGITVENNYPYYAGDGYCRRRGPNNERVT 171

Query: 237 ISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQ--CGTELDHGVTAV 294
           I GYE+VP NNE ALMKAVA+QPV+V+I + GSDF+FY  G+FT +  CG  +DH V  V
Sbjct: 172 IDGYENVPRNNEYALMKAVAHQPVAVSIASRGSDFKFYGEGMFTEENFCGIRIDHTVVVV 231

Query: 295 GYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           GYG+ ++G  YW+++N +GT WG NGY++MQR   + +G+CG+AM  ++P 
Sbjct: 232 GYGSDEEG-DYWIIRNQYGTQWGMNGYMKMQRGTRSPQGVCGMAMYPAFPV 281


>gi|296189340|ref|XP_002742739.1| PREDICTED: cathepsin L1 [Callithrix jacchus]
          Length = 333

 Score =  259 bits (663), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 148/313 (47%), Positives = 194/313 (61%), Gaps = 19/313 (6%)

Query: 42  WMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARN--KPYKLGINEFADQTNEEFRA 99
           W A + R+Y  N E+E R  ++++N++ I   N++       + + +N F D TNEEFR 
Sbjct: 32  WKAMHNRLYGMN-EEEWRRAVWEKNMKMIELHNHEYNQGKHSFTMAMNAFGDMTNEEFRQ 90

Query: 100 PRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVA 159
             NG++ R P  R+ +       +E    P S+DWR+KG VT VK+QGQCG CWAFSA  
Sbjct: 91  VMNGFQNRKP--RNGKVFQEPLFHE---APRSVDWREKGYVTPVKNQGQCGSCWAFSATG 145

Query: 160 AMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYK 219
           A+EG     T KL SLSEQ LVDC     +QGC+GGLMD AF+++  N GL +E  YPY+
Sbjct: 146 ALEGQMFRKTGKLVSLSEQNLVDCSGPQGNQGCDGGLMDYAFQYVQENGGLDSEESYPYE 205

Query: 220 ASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDASGSDFQFYSSGV 278
           A++ SC K     S A  +G+ D+P   E ALMKAVA   P+SVAIDA    FQFY  G+
Sbjct: 206 ATEESC-KYNPEYSVANDTGFVDIPK-LEKALMKAVATVGPISVAIDAGHESFQFYKEGI 263

Query: 279 -FTGQCGTE-LDHGVTAVGYG---TADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEG 333
            F  +C +E +DHGV  VGYG   T  D +KYWLVKNSWG  WG +GYI+M +D   ++ 
Sbjct: 264 YFEPECSSEDMDHGVLVVGYGFERTGSDNSKYWLVKNSWGEKWGMDGYIKMAKD---RKN 320

Query: 334 LCGIAMQASYPTA 346
            CGIA  ASYPT 
Sbjct: 321 HCGIASAASYPTV 333


>gi|354549232|gb|AER27707.1| putative cysteine protease [Phytophthora sp. SH-2011]
          Length = 533

 Score =  259 bits (663), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 141/316 (44%), Positives = 184/316 (58%), Gaps = 7/316 (2%)

Query: 33  ATMNERHEM--WMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFA 90
           + +   HE   WM  +G  + D  E   R + +  N  YI   N +       LG N F+
Sbjct: 20  SPLEYEHEFSAWMGAHGVTFSDALEFARRLENYIVNDMYIMEHNAENAWTGVTLGHNAFS 79

Query: 91  DQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCG 150
             + +EF+    G       +     + V   + +  VP+++DW  KG VT VK+QG CG
Sbjct: 80  HMSFDEFKFKMTGLVLPEGYLEQRLASRVDGLWSDVEVPSAVDWVDKGGVTPVKNQGMCG 139

Query: 151 CCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGL 210
            CWAFS   A+EG   +++ KL SLSEQELVDCD +G D GC GGLMD AF++I  + G+
Sbjct: 140 SCWAFSTTGAVEGATFVSSGKLPSLSEQELVDCDHNG-DMGCNGGLMDHAFQWIEDHGGI 198

Query: 211 ATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSD 270
            +E  Y YKA    C + +   S  K++G++DV   +E AL  AVA QPVSVAI+A    
Sbjct: 199 CSEDDYEYKAKAQVCRECD---SVVKVTGFQDVNPQDEHALKVAVAQQPVSVAIEADQKA 255

Query: 271 FQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDA 330
           FQFY SGVF   CGT LDHGV AVGYG  D+G K+W VKNSWG +WGE GYIR+ R+ + 
Sbjct: 256 FQFYKSGVFNLTCGTRLDHGVLAVGYGN-DNGHKFWKVKNSWGASWGEQGYIRLAREENG 314

Query: 331 KEGLCGIAMQASYPTA 346
             G CGIA   SYP A
Sbjct: 315 PAGQCGIASVPSYPFA 330


>gi|290462225|gb|ADD24160.1| Cathepsin L [Lepeophtheirus salmonis]
          Length = 334

 Score =  259 bits (662), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 149/341 (43%), Positives = 205/341 (60%), Gaps = 19/341 (5%)

Query: 14  AILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASF 73
           +IL+L V    + + +  D  +++  E W   + + Y  + E+++R KIF EN   I+  
Sbjct: 5   SILLLSVIISTASAVSFFDVVLSDW-ESWKLTHQKGYDSSVEEKLRLKIFMENSLRISRH 63

Query: 74  NNKARN--KPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETT--DVSFRYENASVP 129
           N +A      Y + +N + D  + EF A  NGY      + +++TT        +N ++P
Sbjct: 64  NAEAIQGRHTYFMKMNHYGDLLHHEFVAMVNGY------IYNNKTTLGGTFIPSKNINLP 117

Query: 130 ASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGED 189
             +DWR++GAVT VK+QGQCG CW+FSA  ++EG +   T KL SLSEQ LVDC     +
Sbjct: 118 EHVDWREEGAVTPVKNQGQCGSCWSFSATGSLEGQDFRKTGKLISLSEQNLVDCSRKYGN 177

Query: 190 QGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEA 249
            GCEGGLMD AF++I  N G+ TEA YPY+  DG C+    N   + I G+ D+   +E 
Sbjct: 178 NGCEGGLMDYAFKYIQDNNGIDTEASYPYEGIDGHCHYDPKNKGGSDI-GFVDIKKGSEK 236

Query: 250 ALMKAVAN-QPVSVAIDASGSDFQFYSSGVFT-GQCGTE-LDHGVTAVGYGTAD-DGTKY 305
            L KA+A   P+SVAIDAS   FQFYS GV++  +C  E LDHGV AVGYGT +  G  Y
Sbjct: 237 DLQKALATVGPISVAIDASHMSFQFYSHGVYSEKKCSPENLDHGVLAVGYGTDEVTGEDY 296

Query: 306 WLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           WLVKNSW   WGE+GYI+M R+   K+ +CGIA  ASYP  
Sbjct: 297 WLVKNSWSEKWGEDGYIKMARN---KDNMCGIASSASYPVV 334


>gi|299507656|gb|ADJ21807.1| cathepsin L [Oplegnathus fasciatus]
          Length = 336

 Score =  259 bits (661), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 149/342 (43%), Positives = 198/342 (57%), Gaps = 17/342 (4%)

Query: 12  LAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIA 71
           +  + VL V    + S    D  ++E  ++W + + + Y +  E   R  ++++N++ I 
Sbjct: 1   MLPVAVLAVCLSAALSAPSLDPQLDEHWDLWKSWHTKKYHEKEEGWRRM-VWEKNLKKIE 59

Query: 72  SFN--NKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLP-SVRSSETTDVSFRYENASV 128
             N  +      Y+LG+N F D T+EEFR   NGYKR+     + S   + +F       
Sbjct: 60  LHNLEHSMGEHTYRLGMNHFGDMTHEEFRQIMNGYKRKSERKFKGSLFMEPNF----LEA 115

Query: 129 PASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGE 188
           P S+DWR  G VT VKDQGQCG CWAFS   AMEG +   T KL SLSEQ LVDC     
Sbjct: 116 PRSVDWRDNGYVTPVKDQGQCGSCWAFSTTGAMEGQHFRKTGKLVSLSEQNLVDCSRPEG 175

Query: 189 DQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNE 248
           ++GC GGLMD AF++I  N+GL +E  YPY  +D      +   ++A  +G+ D+PS  E
Sbjct: 176 NEGCNGGLMDQAFQYIKDNQGLDSEDSYPYLGTDDQPCHYDPKYNSANDTGFIDIPSGKE 235

Query: 249 AALMKAVAN-QPVSVAIDASGSDFQFYSSGV-FTGQCGT-ELDHGVTAVGYGTAD---DG 302
            ALMKAVA   PVSVAIDA    FQFY SG+ +  +C + ELDHGV  VGYG      DG
Sbjct: 236 RALMKAVAAVGPVSVAIDAGHESFQFYQSGIYYEKECSSEELDHGVLVVGYGFEGEDVDG 295

Query: 303 TKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
            KYW+VKNSW   WG+ GYI M +D   ++  CGIA  ASYP
Sbjct: 296 KKYWIVKNSWSEKWGDKGYIYMAKD---RKNHCGIATAASYP 334


>gi|401397136|ref|XP_003879989.1| cathepsin L, related [Neospora caninum Liverpool]
 gi|325114397|emb|CBZ49954.1| cathepsin L, related [Neospora caninum Liverpool]
          Length = 415

 Score =  259 bits (661), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 133/284 (46%), Positives = 178/284 (62%), Gaps = 8/284 (2%)

Query: 44  AQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNG 103
           A YG+ Y    E + R+ IFK N+ YI + N +  +  Y L +N F D + EEFR    G
Sbjct: 124 ATYGKSYATEEETQKRYAIFKNNLAYIHTHNQQGYS--YSLKMNHFGDLSREEFRRKYLG 181

Query: 104 Y--KRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAM 161
           Y   R L S      T++  +   + VP+++DWR+KG VT VKDQ  CG CWAFSA  A+
Sbjct: 182 YNKSRNLKSNNLGVATEL-LKVSPSDVPSAVDWREKGCVTPVKDQRDCGSCWAFSATGAL 240

Query: 162 EGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKAS 221
           EG +   T +L SLSEQELVDC  +  +QGC GG M+DAF++++ + GL +E  YPY A 
Sbjct: 241 EGAHCAKTGELLSLSEQELVDCSLAEGNQGCSGGEMNDAFQYVVDSGGLCSEEGYPYLAR 300

Query: 222 DGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTG 281
           DG C  K A      ISG++DVP  +E A+  A+A+ PVS+AI+A    FQFY  GVF  
Sbjct: 301 DGEC--KRACKKVVTISGFKDVPRKSETAMKAALAHSPVSIAIEADQLPFQFYHEGVFDA 358

Query: 282 QCGTELDHGVTAVGYGTADDGTK-YWLVKNSWGTTWGENGYIRM 324
            CGT+LDHGV  VGYGT  +  K +W++KNSWG+ WG +GY+ M
Sbjct: 359 SCGTDLDHGVLLVGYGTDKETKKDFWIMKNSWGSGWGRDGYMYM 402


>gi|307175095|gb|EFN65237.1| Cathepsin L [Camponotus floridanus]
          Length = 372

 Score =  259 bits (661), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 144/307 (46%), Positives = 182/307 (59%), Gaps = 12/307 (3%)

Query: 45  QYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKP--YKLGINEFADQTNEEFRAPRN 102
            + +VY+   E+  R KIF +N   I   N K   K   YKLG+N++ D  + E     N
Sbjct: 69  HHKKVYKSPIEEGYRMKIFLDNKRKIVEHNRKYEMKEVNYKLGMNKYGDMLHHELINTLN 128

Query: 103 GYKRRLPSVRSSETTDVSF-RYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAM 161
           G+ + + +V   +    +F    N  +P S+DWRKKGAVT +KDQGQCG CWAFS+  A+
Sbjct: 129 GFNKSV-TVSEEQLIGATFIEPANVELPKSVDWRKKGAVTAIKDQGQCGSCWAFSSTGAL 187

Query: 162 EGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKAS 221
           EG +   +  L SLSEQ L+DC     + GC GGLMD AF +I  NKGL TE  YPY+A 
Sbjct: 188 EGQHFRQSGVLVSLSEQNLIDCSGKYGNNGCNGGLMDYAFRYIKENKGLDTEKSYPYEAE 247

Query: 222 DGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDASGSDFQFYSSGV-F 279
           +  C     N  A+ + G+ D+P  +E  L  AVA   P+SVAIDAS   F FYS GV +
Sbjct: 248 NDQCRYNPKNSGASDV-GFVDIPEGDEDKLKAAVATIGPISVAIDASHESFHFYSEGVYY 306

Query: 280 TGQCG-TELDHGVTAVGYGT-ADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGI 337
             +C    LDHGV  VGYGT +  G  YWLVKNSWG TWGE GYI+M R+   KE  CGI
Sbjct: 307 EPECSPANLDHGVLIVGYGTDSGTGEDYWLVKNSWGETWGEKGYIKMARN---KENHCGI 363

Query: 338 AMQASYP 344
           A  ASYP
Sbjct: 364 ASSASYP 370


>gi|392884266|gb|AFM90965.1| cathepsin L [Callorhinchus milii]
          Length = 338

 Score =  259 bits (661), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 148/345 (42%), Positives = 200/345 (57%), Gaps = 24/345 (6%)

Query: 10  LVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEY 69
           +VL+  L  G+ AP        D  ++   E W + +G+ Y +  E+  R  +++E++  
Sbjct: 6   VVLSLCLAGGLAAPSL------DPGLDTHWEQWKSWHGKSY-EQKEETWRRMVWEEHLRV 58

Query: 70  IASFN--NKARNKPYKLGINEFADQTNEEFRAPRNGYKRR--LPSVRSSETTDVSFRYEN 125
           I   N  +      ++LG+N F D  NEEFR   NGYK +     ++ S   + +F    
Sbjct: 59  IEIHNLEHSLGKHSFRLGMNHFGDMPNEEFRQLMNGYKYKQTHKKLQGSHFLEPNF---- 114

Query: 126 ASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDT 185
             VP  +DWR +G VT VKDQGQCG CWAFS   A+EG +   T +L SLSEQ LV+C  
Sbjct: 115 LEVPKHVDWRDEGYVTPVKDQGQCGSCWAFSTTGALEGQHFRRTGQLVSLSEQNLVECSK 174

Query: 186 SGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPS 245
              ++GC GGLMD AF+++  N G+ +E  YPY  +D +        +AA  +G+ D+PS
Sbjct: 175 PEGNEGCNGGLMDQAFQYVKDNGGIDSEDSYPYVGTDDTPCHYNPQYNAANDTGFVDIPS 234

Query: 246 NNEAALMKAVAN-QPVSVAIDASGSDFQFYSSGV-FTGQC-GTELDHGVTAVGYGTAD-- 300
             E ALMKA+A   PVSVAIDA  + FQFY SG+ F  +C  T+LDHGV  VGYG     
Sbjct: 235 GKERALMKAIAAVGPVSVAIDAGHTSFQFYQSGIYFEAECSSTDLDHGVLVVGYGVEKRD 294

Query: 301 -DGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
            DG KYW+VKNSW   WG+NGYI M +D   K+  CGIA  ASYP
Sbjct: 295 TDGKKYWIVKNSWSEKWGQNGYILMAKD---KDNHCGIATAASYP 336


>gi|449469176|ref|XP_004152297.1| PREDICTED: vignain-like [Cucumis sativus]
          Length = 340

 Score =  259 bits (661), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 137/319 (42%), Positives = 197/319 (61%), Gaps = 17/319 (5%)

Query: 31  NDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFA 90
           ++ ++ + ++ W + + R+ R+  E   RFKIF++N + +   N+    K  KL +N+FA
Sbjct: 33  SEKSLMQLYKRWSSHH-RISRNAHEMHKRFKIFQDNAKRVFKVNHMG--KSLKLRLNQFA 89

Query: 91  DQTNEEFRAPRNGYKRRLPSVRSSETTDVS-FRYENA-SVPASIDWRKKGAVTGVKDQGQ 148
           D +++EF            ++ +     V  F YE A ++P SIDWR+KGAV  +K+QG 
Sbjct: 90  DLSDDEFSMMYGSNITHYNNLHAKAGGRVGGFMYERAMNIPFSIDWREKGAVNAIKNQGL 149

Query: 149 CGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNK 208
           C       AVAA+E I+ I T +L SLSEQE+VDCD   +  GC GG  D AFEFI+ N 
Sbjct: 150 C-------AVAAVESIHQIKTNELVSLSEQEVVDCDY--KVGGCRGGNYDSAFEFIMQNG 200

Query: 209 GLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASG 268
           G+  E  YPY A +G C ++  N     I GYE VP NNE ALMKAVA+QPV+V++ +SG
Sbjct: 201 GITIEENYPYFAGNGYCRRRGPNSERVTIDGYECVPQNNEYALMKAVAHQPVAVSVASSG 260

Query: 269 SDFQFYSSGVFT--GQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQR 326
           SDF+FY  G+      CG  +DH V  VGYG+ ++G  YW+++N +GT WG NGY++MQR
Sbjct: 261 SDFRFYGEGMLREGSFCGYRIDHTVVVVGYGSDEEG-DYWIIRNQYGTQWGMNGYMKMQR 319

Query: 327 DIDAKEGLCGIAMQASYPT 345
                +G+CG+AMQ S+P 
Sbjct: 320 GTRNPQGVCGMAMQPSFPV 338


>gi|156739289|ref|NP_001096592.1| uncharacterized protein LOC569326 precursor [Danio rerio]
 gi|156230119|gb|AAI52283.1| Im:6910535 protein [Danio rerio]
          Length = 335

 Score =  259 bits (661), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 144/322 (44%), Positives = 193/322 (59%), Gaps = 18/322 (5%)

Query: 32  DATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYI--ASFNNKARNKPYKLGINEF 89
           D  +++    W +Q+G+ Y ++ E   R  I++EN+  I   +F     N  +K+G+N+F
Sbjct: 21  DIQLDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRKIEQHNFEYSLGNHTFKMGMNQF 79

Query: 90  ADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENA--SVPASIDWRKKGAVTGVKDQG 147
            D TNEEFR   NGYK+       + T+  +   E +  + P  +DWR++G VT VKDQ 
Sbjct: 80  GDMTNEEFRQAMNGYKQD-----PNRTSKGALFMEPSFFAAPQQVDWRQRGYVTPVKDQK 134

Query: 148 QCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISN 207
           QCG CW+FS+  A+EG     T KL S+SEQ LVDC     +QGC GG+MD AF+++  N
Sbjct: 135 QCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQGNQGCNGGIMDQAFQYVKEN 194

Query: 208 KGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQ-PVSVAIDA 266
           KGL +E  YPY A D    + +   + AKI+G+ D+P  NE ALM AVA   PVSVAIDA
Sbjct: 195 KGLDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPRGNELALMNAVAAVGPVSVAIDA 254

Query: 267 SGSDFQFYSSGVFTGQ-CGTELDHGVTAVGYGT--AD-DGTKYWLVKNSWGTTWGENGYI 322
           S    QFY SG++  + C + LDH V  VGYG   AD  G +YW+VKNSW   WG+ GYI
Sbjct: 255 SHQSLQFYQSGIYYERACTSRLDHAVLVVGYGYQGADVAGNRYWIVKNSWSDKWGDKGYI 314

Query: 323 RMQRDIDAKEGLCGIAMQASYP 344
            M +D   K   CGIA  ASYP
Sbjct: 315 YMAKD---KNNHCGIATMASYP 333


>gi|297845822|ref|XP_002890792.1| hypothetical protein ARALYDRAFT_473117 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297336634|gb|EFH67051.1| hypothetical protein ARALYDRAFT_473117 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 322

 Score =  259 bits (661), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 139/344 (40%), Positives = 209/344 (60%), Gaps = 40/344 (11%)

Query: 11  VLAAILVLGVWAPQSWSR---TLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENV 67
           VL A+ +L +    S +R   TLN+ ++ + H+ WM Q+ RVY+D +EKEMR ++FK+N+
Sbjct: 7   VLVALTILSMDLRISQARPHVTLNEQSIVDYHQQWMTQFSRVYQDESEKEMRLQVFKKNL 66

Query: 68  EYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS 127
           ++I +FNN   N+ Y +G+NEF D T EEF A   G +  + ++  SE  + +    N +
Sbjct: 67  KFIENFNNMG-NQSYTVGVNEFTDWTIEEFLATHTGLRVNVTTL--SELFNETMPSRNWN 123

Query: 128 VP------ASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELV 181
           +        S DWR +GAV  VK QG CG             +  I+ + L +LSEQ+L+
Sbjct: 124 ISDIDIDDESKDWRDEGAVIPVKVQGACG-------------LTKISGKNLLTLSEQQLI 170

Query: 182 DCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYE 241
           DCDT  ++ GC+GG +++AF++II N G++ E +YPY+   GSC     + +  +I G+E
Sbjct: 171 DCDTE-KNTGCDGGGIEEAFKYIIKNGGVSLETEYPYQVKKGSCRANARSATQTQIRGFE 229

Query: 242 DVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTG-QCGTELDHGVTAVGYGTAD 300
            VPS+NE AL++AV  QPVSV IDA    F+ Y  GV+ G  CGT+++H VT VGYGT  
Sbjct: 230 MVPSHNERALLEAVRRQPVSVLIDARADSFKTYKGGVYAGLDCGTDVNHAVTFVGYGTMI 289

Query: 301 DGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
                         +WGENGY+R++RD++  +G+CGIA  A+YP
Sbjct: 290 Q-------------SWGENGYMRIRRDVEWPQGMCGIAQVAAYP 320


>gi|326672302|ref|XP_003199633.1| PREDICTED: cathepsin L1-like [Danio rerio]
 gi|157423549|gb|AAI53506.1| Im:6910535 [Danio rerio]
          Length = 335

 Score =  258 bits (660), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 144/322 (44%), Positives = 193/322 (59%), Gaps = 18/322 (5%)

Query: 32  DATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYI--ASFNNKARNKPYKLGINEF 89
           D  +++    W +Q+G+ Y ++ E   R  I++EN+  I   +F     N  +K+G+N+F
Sbjct: 21  DIQLDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRKIEQHNFEYSYGNHTFKMGMNQF 79

Query: 90  ADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENA--SVPASIDWRKKGAVTGVKDQG 147
            D TNEEFR   NGYK+       + T+  +   E +  + P  +DWR++G VT VKDQ 
Sbjct: 80  GDMTNEEFRQAMNGYKQD-----PNRTSKGALFMEPSFFAAPQQVDWRQRGYVTPVKDQK 134

Query: 148 QCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISN 207
           QCG CW+FS+  A+EG     T KL S+SEQ LVDC     +QGC GG+MD AF+++  N
Sbjct: 135 QCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQGNQGCNGGIMDQAFQYVKEN 194

Query: 208 KGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQ-PVSVAIDA 266
           KGL +E  YPY A D    + +   + AKI+G+ D+P  NE ALM AVA   PVSVAIDA
Sbjct: 195 KGLDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPKGNELALMNAVAAVGPVSVAIDA 254

Query: 267 SGSDFQFYSSGVFTGQ-CGTELDHGVTAVGYGT--AD-DGTKYWLVKNSWGTTWGENGYI 322
           S    QFY SG++  + C + LDH V  VGYG   AD  G +YW+VKNSW   WG+ GYI
Sbjct: 255 SHQSLQFYQSGIYYERACTSRLDHAVLVVGYGYQGADVAGNRYWIVKNSWSDKWGDKGYI 314

Query: 323 RMQRDIDAKEGLCGIAMQASYP 344
            M +D   K   CGIA  ASYP
Sbjct: 315 YMAKD---KNNHCGIATMASYP 333


>gi|156739281|ref|NP_001096588.1| cathepsin L1-like precursor [Danio rerio]
 gi|166158351|ref|NP_001107526.1| uncharacterized protein LOC100135391 precursor [Xenopus (Silurana)
           tropicalis]
 gi|326672305|ref|XP_003199634.1| PREDICTED: cathepsin L1-like [Danio rerio]
 gi|156230096|gb|AAI52237.1| MGC174155 protein [Danio rerio]
 gi|163916362|gb|AAI57707.1| LOC100135391 protein [Xenopus (Silurana) tropicalis]
          Length = 335

 Score =  258 bits (660), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 151/344 (43%), Positives = 203/344 (59%), Gaps = 25/344 (7%)

Query: 11  VLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYI 70
           +L  + +  V+A  S      D  +++    W +Q+G+ Y ++ E   R  I++EN+  I
Sbjct: 5   LLVTLCISAVFAASSI-----DIQLDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRKI 58

Query: 71  ASFNNKAR--NKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSE---TTDVSFRYEN 125
              N +    N  +K+G+N+F D TNEEFR   NGYK   P+ R+S+     + SF    
Sbjct: 59  EQHNFEYSYGNHTFKMGMNQFGDMTNEEFRQAMNGYKHD-PN-RTSQGPLFMEPSF---- 112

Query: 126 ASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDT 185
            + P  +DWR++G VT VKDQ QCG CW+FS+  A+EG     T KL S+SEQ LVDC  
Sbjct: 113 FAAPQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSR 172

Query: 186 SGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPS 245
              +QGC GG+MD AF+++  NKGL +E  YPY A D    + +   + AKI+G+ D+P 
Sbjct: 173 PQGNQGCNGGIMDQAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPR 232

Query: 246 NNEAALMKAVANQ-PVSVAIDASGSDFQFYSSGVFTGQ-CGTELDHGVTAVGYGT--AD- 300
            NE ALM AVA   PVSVAIDAS    QFY SG++  + C + LDH V  VGYG   AD 
Sbjct: 233 GNELALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACTSRLDHAVLVVGYGYQGADV 292

Query: 301 DGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
            G +YW+VKNSW   WG+ GYI M +D   K   CGIA  ASYP
Sbjct: 293 AGNRYWIVKNSWSDKWGDKGYIYMAKD---KNNHCGIATMASYP 333


>gi|228245|prf||1801240C Cys protease 3
          Length = 321

 Score =  258 bits (660), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 149/318 (46%), Positives = 193/318 (60%), Gaps = 15/318 (4%)

Query: 33  ATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNK--PYKLGINEFA 90
           AT +   + +  QYGR Y D  E+  R ++F++N + I  FN K  N    +K+ +N+F 
Sbjct: 13  ATASPSWDHFKTQYGRKYGDAKEELYRQRVFQQNEQLIEDFNKKFENGEVTFKVAMNQFG 72

Query: 91  DQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCG 150
           D TNEEF A   GYK+      S       F  E   +   +DWR K  VT VKDQ QCG
Sbjct: 73  DMTNEEFNAVMKGYKKG-----SRGEPKAVFTAEGRPMARDVDWRTKALVTPVKDQEQCG 127

Query: 151 CCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGL 210
            CWAFSA  A+EG + +   +L SLSEQ+LVDC T   + GC GG M  AF++I  N G+
Sbjct: 128 SCWAFSATGALEGQHFLKNDELVSLSEQQLVDCSTDYGNDGCGGGWMTSAFDYIKDNGGI 187

Query: 211 ATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDASGS 269
            TE+ YPY+A D SC + +AN   A  +G  ++  + E AL +AV+   P+SVAIDAS  
Sbjct: 188 DTESSYPYEAEDRSC-RFDANSIGAICTGSVEIVQHTEEALQEAVSGVGPISVAIDASHF 246

Query: 270 DFQFYSSGVFTGQ-CG-TELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRD 327
            FQFYSSGV+  Q C  T LDHGV AVGYGT +    YWLVKNSWG++WG+ GYI+M R+
Sbjct: 247 SFQFYSSGVYYEQNCSPTFLDHGVLAVGYGT-ESTKDYWLVKNSWGSSWGDAGYIKMSRN 305

Query: 328 IDAKEGLCGIAMQASYPT 345
            D     CGIA + SYPT
Sbjct: 306 RDNN---CGIASEPSYPT 320


>gi|306992173|gb|ADN19567.1| cathepsin L-like proteinase [Spodoptera frugiperda]
          Length = 344

 Score =  258 bits (660), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 146/322 (45%), Positives = 187/322 (58%), Gaps = 21/322 (6%)

Query: 40  EMWMA---QYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNK--PYKLGINEFADQTN 94
           E W A   ++ + Y    E + R KI+ EN   IA  N +   +   YKL  N++AD  +
Sbjct: 25  EEWNAFKMEHSKQYDSEVEDKFRMKIYVENKHRIAKHNQRFEQRLVSYKLKPNKYADMLH 84

Query: 95  EEFRAPRNGYKR------RLPSVRSSETTDVSFRY---ENASVPASIDWRKKGAVTGVKD 145
            EF    NG+ +      R  +V S      +  +    + S P  +DWRKKGAVT VKD
Sbjct: 85  HEFVHTMNGFNKTAKHGGRNKAVHSKGRDGRAATFIAPAHVSYPDHVDWRKKGAVTDVKD 144

Query: 146 QGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFII 205
           QG+CG CWAFS   A+EG +   T  L SLSEQ LVDC  +  + GC GGLMD+AF++I 
Sbjct: 145 QGKCGSCWAFSTTGALEGQHFRKTGYLVSLSEQNLVDCSAAYGNNGCNGGLMDNAFKYIK 204

Query: 206 SNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAI 264
            N G+ TE  YPY+A D  C     N  A  + G+ D+P  +E  LM+AVA   P+SVAI
Sbjct: 205 DNGGIDTEKSYPYEAVDDKCRYNPKNSGADDV-GFVDIPQGDEEKLMQAVATVGPISVAI 263

Query: 265 DASGSDFQFYSSGVFTGQ--CGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYI 322
           DAS   FQFYS GV+  +    T+LDHGV  VGYGT ++G  YWLVKNSWG +WGE GYI
Sbjct: 264 DASQETFQFYSKGVYYDENCSSTDLDHGVMVVGYGTEEEGGDYWLVKNSWGRSWGELGYI 323

Query: 323 RMQRDIDAKEGLCGIAMQASYP 344
           +M  +   K   CGIA  ASYP
Sbjct: 324 KMAHN---KNNHCGIASSASYP 342


>gi|156398078|ref|XP_001638016.1| predicted protein [Nematostella vectensis]
 gi|156225133|gb|EDO45953.1| predicted protein [Nematostella vectensis]
          Length = 326

 Score =  258 bits (660), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 140/306 (45%), Positives = 187/306 (61%), Gaps = 14/306 (4%)

Query: 42  WMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPR 101
           W + +G+ Y D  E+  R  I+++N+E I   N  A +  YK+ +N   D T +EFR   
Sbjct: 30  WKSYHGKSYSDVHEERTRMAIWQQNLEKIKRHN--AEDHSYKMAMNHLGDLTEDEFRYFY 87

Query: 102 NGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAM 161
            G +    S +    T +     N  +P+S+DW +KG VTGVK+QGQCG CWAFS   ++
Sbjct: 88  LGVRAHHNSTKRGWATYMPP--SNVKIPSSVDWSQKGYVTGVKNQGQCGSCWAFSTTGSV 145

Query: 162 EGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKAS 221
           EG +   T  L SLSEQ L+DC  S  + GC+GGLMD+AF +I SN G+ TE+ YPY   
Sbjct: 146 EGQHFRKTGSLVSLSEQNLIDCSGSYGNNGCQGGLMDNAFRYIESNGGIDTESSYPYLGQ 205

Query: 222 DGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDASGSDFQFYSSGVFT 280
            GSC+   ++   A+++GY+D+P  +E AL  AVA   PVSVA+DA  S +QFYSSGV+ 
Sbjct: 206 QGSCHFSSSHV-GARVTGYQDIPQGSEQALQSAVATVGPVSVAVDA--SQWQFYSSGVYD 262

Query: 281 GQ--CGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIA 338
                 T+LDHGV  +GYG   +G  YWLVKNSWG +WG  GYI M R+   K   CGIA
Sbjct: 263 NPYCSSTQLDHGVLVIGYGNY-NGQDYWLVKNSWGYSWGVEGYIMMSRN---KNNQCGIA 318

Query: 339 MQASYP 344
             ASYP
Sbjct: 319 SSASYP 324


>gi|343978787|gb|AEM76722.1| cathepsin L-like proteinase [Triatoma brasiliensis]
          Length = 330

 Score =  258 bits (660), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 149/343 (43%), Positives = 202/343 (58%), Gaps = 19/343 (5%)

Query: 9   KLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVE 68
           K++L A+ V+ V     +   +N     E  E +   +G+ Y++  E+  R KIF  N +
Sbjct: 2   KVLLVAVAVIAVSCANRF-YNINP----EEWETFKVVHGKNYKNQFEEMFRRKIFMNNKK 56

Query: 69  YIASFNNKARNK--PYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENA 126
            I + N K       YK+ +N F D  + E +A  NG+K    + R  +    S    N 
Sbjct: 57  RIEAHNAKYEQGEVSYKMKMNHFGDLMSHEIKALMNGFKMTPNTKREGKIYFPS----ND 112

Query: 127 SVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTS 186
            +P S+DWR+KGAVT VKDQGQCG CW+FSA  ++EG   +   KL SLSEQ L+DC   
Sbjct: 113 KLPKSVDWRQKGAVTPVKDQGQCGSCWSFSATGSLEGQIFLKKGKLVSLSEQNLMDCSKE 172

Query: 187 GEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSN 246
             + GCEGGLMD AF+++  NKG+ TE+ YPY+A D +C  K+ +       GY D+P  
Sbjct: 173 YGNNGCEGGLMDKAFQYVSDNKGIDTESSYPYEARDYACRFKK-DKVGGTDKGYVDIPEG 231

Query: 247 NEAALMKAVAN-QPVSVAIDASGSDFQFYSSGVFT-GQCGT-ELDHGVTAVGYGTADDGT 303
           +E AL  A+A   P+SVAIDAS   F FYS GV+    C + +LDHGV AVGYGT ++G 
Sbjct: 232 DEKALQNALATVGPISVAIDASHESFHFYSEGVYNEPYCSSYDLDHGVLAVGYGT-ENGQ 290

Query: 304 KYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
            YWLVKNSWG +WGE+GYI++ R+       CGIA  ASYP  
Sbjct: 291 DYWLVKNSWGPSWGESGYIKIARN---HSNHCGIASMASYPIV 330


>gi|426219875|ref|XP_004004143.1| PREDICTED: cathepsin L1 [Ovis aries]
          Length = 333

 Score =  258 bits (660), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 153/349 (43%), Positives = 207/349 (59%), Gaps = 27/349 (7%)

Query: 8   NKLVLAAILVLGVW--APQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKE 65
           N  +L  +L LG+   AP+       D ++N + E+W A + + Y D  E+  R  ++K+
Sbjct: 2   NPSLLLTVLCLGIASAAPKF------DHSLNTQWELWKAVHRKPY-DLNEEGWRKAVWKK 54

Query: 66  NVEYIASFNNKARN--KPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRY 123
           N++ I   N +       + + +N F D T+EEFR   NG++R+      ++   V    
Sbjct: 55  NMKMIELHNQEYSQGKHSFSMAMNAFGDLTSEEFRQMMNGFQRQ-----ENKKGKVFHET 109

Query: 124 ENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDC 183
             AS+P S+DWR+KG VT VK+QG+CG CWAFS   A+EG     T KL SLSEQ LVDC
Sbjct: 110 IFASIPPSVDWREKGYVTPVKNQGKCGSCWAFSTTGALEGQMFRKTGKLVSLSEQNLVDC 169

Query: 184 DTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDV 243
                ++GC GGLMD+AF++++   GL +E  YPY    G+CN    N SAA  +G+ D+
Sbjct: 170 SQPEGNRGCHGGLMDNAFQYVLDVGGLDSEESYPYTGLVGTCNYNPKN-SAANETGFVDL 228

Query: 244 PSNNEAALMKAVANQ-PVSVAIDASGSDFQFYSSGV-FTGQCGTE-LDHGVTAVGY---G 297
           P   E ALMKAVA   P+SVA+DAS   FQFY SG+ +  +C +E +DHGV  VGY   G
Sbjct: 229 PK-QENALMKAVATLGPISVAVDASNPSFQFYKSGIYYEPKCKSESVDHGVLVVGYGFEG 287

Query: 298 TADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
              D  KYWLVKNSWG  WG NGYI+M +D   +   CGIA  ASYPT 
Sbjct: 288 ADSDDNKYWLVKNSWGKHWGINGYIKMAKD---QNNHCGIATMASYPTV 333


>gi|404312774|pdb|3TNX|A Chain A, Structure Of The Precursor Of A Thermostable Variant Of
           Papain At 2.6 Angstroem Resolution
 gi|404312775|pdb|3TNX|C Chain C, Structure Of The Precursor Of A Thermostable Variant Of
           Papain At 2.6 Angstroem Resolution
 gi|428698029|pdb|3USV|A Chain A, Structure Of The Precursor Of A Thermostable Variant Of
           Papain At 3.8 A Resolution From A Crystal Soaked At Ph 4
 gi|428698030|pdb|3USV|C Chain C, Structure Of The Precursor Of A Thermostable Variant Of
           Papain At 3.8 A Resolution From A Crystal Soaked At Ph 4
          Length = 363

 Score =  258 bits (659), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 140/325 (43%), Positives = 188/325 (57%), Gaps = 27/325 (8%)

Query: 31  NDATMNER----HEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGI 86
           ND T  ER     E WM ++ ++Y++  EK  RF+IFK+N++YI   N K  N  Y LG+
Sbjct: 54  NDLTSTERLIQLFESWMLKHNKIYKNIDEKIYRFEIFKDNLKYIDETNKK--NNSYWLGL 111

Query: 87  NEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYE------NASVPASIDWRKKGAV 140
           N FAD +N+EF+    G      S+  + TT     YE      + ++P  +DWR+KGAV
Sbjct: 112 NVFADMSNDEFKEKYTG------SIAGNYTT-TELSYEEVLNDGDVNIPEYVDWRQKGAV 164

Query: 141 TGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDA 200
           T VK+QG CG  WAFSAV+ +E I  I T  L   SEQEL+DCD      GC GG    A
Sbjct: 165 TPVKNQGSCGSAWAFSAVSTIESIIKIRTGNLNEYSEQELLDCDR--RSYGCNGGYPWSA 222

Query: 201 FEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPV 260
            + +++  G+     YPY+     C  +E  P AAK  G   V   NE AL+ ++ANQPV
Sbjct: 223 LQ-LVAQYGIHYRNTYPYEGVQRYCRSREKGPYAAKTDGVRQVQPYNEGALLYSIANQPV 281

Query: 261 SVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENG 320
           SV ++A+G DFQ Y  G+F G CG ++DH V AVGY     G  Y L++NSWGT WGENG
Sbjct: 282 SVVLEAAGKDFQLYRGGIFVGPCGNKVDHAVAAVGY-----GPNYILIRNSWGTGWGENG 336

Query: 321 YIRMQRDIDAKEGLCGIAMQASYPT 345
           YIR++R      G+CG+   + YP 
Sbjct: 337 YIRIKRGTGNSYGVCGLYTSSFYPV 361


>gi|261289811|ref|XP_002611767.1| hypothetical protein BRAFLDRAFT_284308 [Branchiostoma floridae]
 gi|229297139|gb|EEN67777.1| hypothetical protein BRAFLDRAFT_284308 [Branchiostoma floridae]
          Length = 336

 Score =  258 bits (659), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 149/343 (43%), Positives = 198/343 (57%), Gaps = 20/343 (5%)

Query: 10  LVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEY 69
           L+L A++ +          T      N+  EMW  Q+G+ Y   AE+  R   F++N   
Sbjct: 4   LILGAVITMA---------TAGVLPHNKEWEMWKLQHGKQYETEAEEYSRRFTFEKNTIK 54

Query: 70  IASFNNKAR--NKPYKLGINEFADQTNEEFRAPRNGYKRRLPSV-RSSETTDVSFRYENA 126
           IA  N +A      Y L +N+F D  +EEF     G   ++  V +    ++V    +N 
Sbjct: 55  IAEHNIRASLGMHSYTLAMNKFGDMHHEEFHQRIMGGCLKIVKVNKPLLGSEVGDNDDNG 114

Query: 127 SVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTS 186
           ++P S+DWR    V+ VKDQG+CG CWAFS   ++EG +   T KL  LSEQ+LVDC   
Sbjct: 115 TLPKSVDWRNSAMVSEVKDQGECGSCWAFSTTGSLEGQHANKTGKLVDLSEQQLVDCSKD 174

Query: 187 GEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSN 246
             +QGC GGLMD AF++I +N GL TE  YPY A+D    K + +   A + GY+DV S 
Sbjct: 175 FGNQGCGGGLMDQAFQYIKANGGLDTEESYPYTATDDKPCKFDNSSVGATLIGYKDVKSG 234

Query: 247 NEAALMKAVAN-QPVSVAIDASGSDFQFYSSGVF-TGQCGTE-LDHGVTAVGYGTADDGT 303
           NE AL +AVA   P+SVAIDA    FQFYSSGV+   QC +E LDHGV  VGYG  +D +
Sbjct: 235 NEHALKRAVATVGPISVAIDAGHESFQFYSSGVYDEPQCSSEQLDHGVLVVGYGAMNDNS 294

Query: 304 K--YWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
              +W+VKNSWG  WG+ GYI M R+   K+  CGIA  ASYP
Sbjct: 295 HQAFWIVKNSWGPNWGDQGYIMMSRN---KDNQCGIATSASYP 334


>gi|405971603|gb|EKC36430.1| Cathepsin L [Crassostrea gigas]
          Length = 360

 Score =  258 bits (659), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 147/304 (48%), Positives = 191/304 (62%), Gaps = 13/304 (4%)

Query: 46  YGRVYRDNAEKEMRFKIFKENVEYIASFNNKAR--NKPYKLGINEFADQTNEEFRAPRNG 103
           + + Y    E+  RF+IF+ENV+ I   N       K Y LG+N+F+D  +EEF    NG
Sbjct: 63  HDKTYDALEEESRRFEIFRENVQKIEEHNKLYHLGKKSYYLGVNQFSDLKHEEF-VKYNG 121

Query: 104 YKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEG 163
            K+   S++    +       N   P S+DWRKKG VT VK+QGQCG CW+FS   ++EG
Sbjct: 122 LKK--TSLKDGGCSSY-LAANNLVEPDSVDWRKKGYVTDVKNQGQCGSCWSFSTTGSLEG 178

Query: 164 INHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDG 223
            +   + KL SLSE +LVDC  S  ++GC GGLMD+AF++I S  GL +E  YPYK   G
Sbjct: 179 QHFRKSGKLVSLSESQLVDCSQSFGNEGCNGGLMDNAFKYIKSVGGLESEEDYPYKPKQG 238

Query: 224 SCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDASGSDFQFYSSGVF-TG 281
           +C K +    AA  +G  DV S +E+AL KAV+   PVSVAIDAS S FQ Y+ GV+   
Sbjct: 239 TC-KFDDTKVAATDTGCVDVESGSESALKKAVSEVGPVSVAIDASHSSFQSYAGGVYDEP 297

Query: 282 QCGTE-LDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQ 340
           +C +E LDHGV  VGYGT D G  YW+VKNSWG  WGE+GY++M R+   K+  CGIA Q
Sbjct: 298 ECSSEQLDHGVLCVGYGTDDQGQDYWIVKNSWGAEWGEDGYVKMSRN---KKNQCGIATQ 354

Query: 341 ASYP 344
           ASYP
Sbjct: 355 ASYP 358


>gi|189525868|ref|XP_001341714.2| PREDICTED: cathepsin L1-like isoform 1 [Danio rerio]
          Length = 336

 Score =  258 bits (659), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 148/345 (42%), Positives = 203/345 (58%), Gaps = 21/345 (6%)

Query: 10  LVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEY 69
           ++ A ++ L + A  + S    D  +++    W +Q+G+ Y ++ E   R  I++EN+  
Sbjct: 1   MMFALLVTLSISAVFAASSI--DIQLDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRK 57

Query: 70  I--ASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENA- 126
           I   +F     N  +K+G+N+F D TNEEFR   NGYK        ++T+      E + 
Sbjct: 58  IEQHNFEYSYGNHTFKMGMNQFGDMTNEEFRQAMNGYKHD-----PNQTSQGPLFMEPSF 112

Query: 127 -SVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDT 185
            + P  +DWR++G VT VKDQ QCG CW+FS+  A+EG     T KL S+SEQ LVDC  
Sbjct: 113 FAAPQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSR 172

Query: 186 SGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPS 245
              +QGC GGLMD AF+++  NKGL +E  YPY A D    + +   + AKI+G+ D+PS
Sbjct: 173 PQGNQGCNGGLMDQAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPS 232

Query: 246 NNEAALMKAVANQ-PVSVAIDASGSDFQFYSSGVFTGQC--GTELDHGVTAVGYGT--AD 300
            NE ALM AVA   PVSVAIDAS    QFY SG++  +    + LDH V  VGYG   AD
Sbjct: 233 GNELALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACSSSRLDHAVLVVGYGYQGAD 292

Query: 301 -DGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
             G +YW+VKNSW   WG+ GYI M +D   K   CG+A +ASYP
Sbjct: 293 VAGNRYWIVKNSWSDKWGDKGYIYMAKD---KNNHCGVATKASYP 334


>gi|392881548|gb|AFM89606.1| cathepsin L [Callorhinchus milii]
          Length = 338

 Score =  258 bits (659), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 147/345 (42%), Positives = 201/345 (58%), Gaps = 24/345 (6%)

Query: 10  LVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEY 69
           +VL+  L  G+ AP        D  ++   E W + +G+ Y +  E+  R  ++++++  
Sbjct: 6   VVLSLCLAGGLAAPSL------DPGLDTHWEQWKSWHGKSY-EQKEETWRRMVWEKHLRV 58

Query: 70  IASFN--NKARNKPYKLGINEFADQTNEEFRAPRNGYKRR--LPSVRSSETTDVSFRYEN 125
           I   N  +      ++LG+N F D  NEEFR   NGYK +     ++ S   + +F+   
Sbjct: 59  IEIHNLEHSLGKHSFRLGMNHFGDMPNEEFRQLMNGYKYKQTHKKLQGSHFLEPNFQ--- 115

Query: 126 ASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDT 185
             VP  +DWR +G VT VKDQGQCG CWAFS   A+EG +   T +L SLSEQ LV+C  
Sbjct: 116 -EVPKHVDWRDEGYVTPVKDQGQCGSCWAFSTTGALEGQHFRRTGQLVSLSEQNLVECSK 174

Query: 186 SGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPS 245
              ++GC GGLMD AF+++  N G+ +E  YPY  +D +        +AA  +G+ D+PS
Sbjct: 175 PEGNEGCNGGLMDQAFQYVKDNGGIDSEDSYPYVGTDDTPCHYNPQYNAANDTGFVDIPS 234

Query: 246 NNEAALMKAVAN-QPVSVAIDASGSDFQFYSSGV-FTGQC-GTELDHGVTAVGYGTAD-- 300
             E ALMKA+A   PVSVAIDA  + FQFY SG+ F  +C  T+LDHGV  VGYG     
Sbjct: 235 GKERALMKAIAAVGPVSVAIDAGHTSFQFYQSGIYFEAECSSTDLDHGVLVVGYGVEKRD 294

Query: 301 -DGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
            DG KYW+VKNSW   WG+NGYI M +D   K+  CGIA  ASYP
Sbjct: 295 TDGKKYWIVKNSWSEKWGQNGYILMAKD---KDNHCGIATAASYP 336


>gi|410923307|ref|XP_003975123.1| PREDICTED: cathepsin L1-like [Takifugu rubripes]
          Length = 336

 Score =  258 bits (659), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 150/342 (43%), Positives = 198/342 (57%), Gaps = 17/342 (4%)

Query: 12  LAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIA 71
           +  ++VL +    + S    D  ++E   +W   + + Y +  E   R  ++++N++ I 
Sbjct: 1   MFPVVVLALCVTAALSAPSLDPQLDEHWNLWKDWHSKKYHEKEEGWRRM-VWEKNLKKIE 59

Query: 72  SFN--NKARNKPYKLGINEFADQTNEEFRAPRNGYK-RRLPSVRSSETTDVSFRYENASV 128
             N  +      Y LG+N F D T+EEFR   NGYK +    +R S   + +F       
Sbjct: 60  LHNLEHSMGKHTYSLGMNHFGDMTHEEFRQIMNGYKLKSQRKLRGSLFMEPNF----LEA 115

Query: 129 PASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGE 188
           P S+DWR KG VT VKDQGQCG CWAFS   AMEG +   T  L SLSEQ LVDC     
Sbjct: 116 PRSVDWRDKGYVTPVKDQGQCGSCWAFSTTGAMEGQHFRKTGTLVSLSEQNLVDCSRPEG 175

Query: 189 DQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNE 248
           ++GC GGLMD AF++I  N GL +E  YPY  +D      + + ++A  +G+ DVPS +E
Sbjct: 176 NEGCNGGLMDQAFQYIKDNGGLDSEESYPYLGTDEGPCHYDPSYNSANDTGFVDVPSGSE 235

Query: 249 AALMKAVAN-QPVSVAIDASGSDFQFYSSGV-FTGQCGT-ELDHGVTAVGYGTAD---DG 302
            ALMKAVA+  PVSVAIDA    FQFY SG+ +  +C + ELDHGV  VGYG      DG
Sbjct: 236 RALMKAVASVGPVSVAIDAGHESFQFYHSGIYYDKECSSEELDHGVLVVGYGFEGKDVDG 295

Query: 303 TKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
            KYW+VKNSW   WG+ GYI M +D   K+  CGIA  ASYP
Sbjct: 296 KKYWIVKNSWSENWGDKGYIYMAKD---KKNHCGIATAASYP 334


>gi|46576360|sp|P60994.1|ERVB_TABDI RecName: Full=Ervatamin-B; Short=ERV-B
 gi|30749291|pdb|1IWD|A Chain A, Proposed Amino Acid Sequence And The 1.63 Angstrom X-ray
           Crystal Structure Of A Plant Cysteine Protease Ervatamin
           B: Insight Into The Structural Basis Of Its Stability
           And Substrate Specificity
          Length = 215

 Score =  258 bits (659), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 123/218 (56%), Positives = 155/218 (71%), Gaps = 5/218 (2%)

Query: 128 VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSG 187
           +P+ +DWR KGAV  +K+Q QCG CWAFSAVAA+E IN I T +L SLSEQELVDCDT+ 
Sbjct: 1   LPSFVDWRSKGAVNSIKNQKQCGSCWAFSAVAAVESINKIRTGQLISLSEQELVDCDTA- 59

Query: 188 EDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNN 247
              GC GG M++AF++II+N G+ T+  YPY A  GSC  K        I+G++ V  NN
Sbjct: 60  -SHGCNGGWMNNAFQYIITNGGIDTQQNYPYSAVQGSC--KPYRLRVVSINGFQRVTRNN 116

Query: 248 EAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWL 307
           E+AL  AVA+QPVSV ++A+G+ FQ YSSG+FTG CGT  +HGV  VGYGT   G  YW+
Sbjct: 117 ESALQSAVASQPVSVTVEAAGAPFQHYSSGIFTGPCGTAQNHGVVIVGYGT-QSGKNYWI 175

Query: 308 VKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           V+NSWG  WG  GYI M+R++ +  GLCGIA   SYPT
Sbjct: 176 VRNSWGQNWGNQGYIWMERNVASSAGLCGIAQLPSYPT 213


>gi|297818854|ref|XP_002877310.1| hypothetical protein ARALYDRAFT_484828 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297323148|gb|EFH53569.1| hypothetical protein ARALYDRAFT_484828 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 376

 Score =  258 bits (659), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 141/321 (43%), Positives = 194/321 (60%), Gaps = 15/321 (4%)

Query: 31  NDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFA 90
           N+A +   +E W+ ++G+ Y    EKE RFKIFK+N+++I   N+   N+ Y  G+N+F+
Sbjct: 33  NEAEVRTIYERWLVEHGKNYNGLGEKERRFKIFKDNLKHIEEHNSDP-NRSYDRGLNQFS 91

Query: 91  DQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRY---ENASVPASIDWRKKGAVTG-VKDQ 146
           D T +EF+A   G K     +     +DV+ RY   E   +P  +DWR++GAV   VK Q
Sbjct: 92  DLTVDEFQASYLGGK-----IEKKSLSDVAERYQYKEGDILPDEVDWRERGAVVPRVKRQ 146

Query: 147 GQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIIS 206
           G CG CWAF+A  A+EGIN ITT +L SLSEQEL+DCD   ++ GC GG    AFEFI  
Sbjct: 147 GDCGSCWAFAATGAVEGINQITTGELLSLSEQELIDCDRGKDNFGCAGGGAVWAFEFIKE 206

Query: 207 NKGLATEAKYPYKASD-GSCNKKEANPS-AAKISGYEDVPSNNEAALMKAVANQPVSVAI 264
           N G+ T+  Y Y   D  +C   E   +    I+G+E VP N+E +L KAV+ QP+SV I
Sbjct: 207 NGGIVTDEDYGYTGDDTAACKAIEMKTTRVVTINGHEVVPVNDEMSLKKAVSYQPISVMI 266

Query: 265 DASGSDFQFYSSGVFTGQCGTEL-DHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIR 323
             S ++   Y SGV+ G C     DH V  VGYGT+ D   YWL++NSWG  WGE GY+R
Sbjct: 267 --SAANMSDYKSGVYKGPCSNLWGDHNVLIVGYGTSSDEGDYWLIRNSWGPGWGEGGYLR 324

Query: 324 MQRDIDAKEGLCGIAMQASYP 344
           +QR+ +   G C +A+   YP
Sbjct: 325 LQRNFNEPTGKCAVAVAPVYP 345


>gi|156739275|ref|NP_001096585.1| cathepsin L1-like precursor [Danio rerio]
 gi|156230123|gb|AAI52285.1| MGC174857 protein [Danio rerio]
          Length = 335

 Score =  258 bits (658), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 147/323 (45%), Positives = 195/323 (60%), Gaps = 20/323 (6%)

Query: 32  DATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKAR--NKPYKLGINEF 89
           D  +++    W +Q+G+ Y ++ E   R  I++EN+  I   N +    N  +K+G+N+F
Sbjct: 21  DIQLDDHWNSWKSQHGKSYHEDLEVGRRM-IWEENLRKIEQHNFEYSYGNHTFKMGMNQF 79

Query: 90  ADQTNEEFRAPRNGYKRRLPSVRSSE---TTDVSFRYENASVPASIDWRKKGAVTGVKDQ 146
            D TNEEFR   NGYK   P+ R+S+     + SF     + P  +DWR++G VT VKDQ
Sbjct: 80  GDMTNEEFRQAMNGYKHD-PN-RTSQGPLFMEPSF----FAAPQQVDWRQRGYVTPVKDQ 133

Query: 147 GQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIIS 206
            QCG CW+FS+  A+EG     T KL S+SEQ LVDC     +QGC GG+MD AF+++  
Sbjct: 134 KQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQGNQGCNGGIMDQAFQYVKE 193

Query: 207 NKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQ-PVSVAID 265
           NKGL +E  YPY A D    + +   + AKI+G+ D+P  NE ALM AVA   PVSVAID
Sbjct: 194 NKGLDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPRGNELALMNAVAAVGPVSVAID 253

Query: 266 ASGSDFQFYSSGVFTGQ-CGTELDHGVTAVGYGT--AD-DGTKYWLVKNSWGTTWGENGY 321
           AS    QFY SG++  + C + LDH V  VGYG   AD  G +YW+VKNSW   WG+ GY
Sbjct: 254 ASHQSLQFYQSGIYYERACTSRLDHAVLVVGYGYQGADVAGNRYWIVKNSWSDKWGDKGY 313

Query: 322 IRMQRDIDAKEGLCGIAMQASYP 344
           I M +D   K   CGIA  ASYP
Sbjct: 314 IYMAKD---KNNHCGIATMASYP 333


>gi|344953542|gb|AEN28617.1| cathepsin L-like cysteine protease [Epinephelus coioides]
          Length = 336

 Score =  258 bits (658), Expect = 4e-66,   Method: Compositional matrix adjust.
 Identities = 143/322 (44%), Positives = 191/322 (59%), Gaps = 17/322 (5%)

Query: 32  DATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFN--NKARNKPYKLGINEF 89
           D  +++  E+W + + + Y +  E   R  ++++N++ I   N  +      Y+LG+N F
Sbjct: 21  DPQLDDHWELWKSWHSKKYHEKEEGWRRM-VWEKNLKKIELHNLEHSMGTHSYRLGMNHF 79

Query: 90  ADQTNEEFRAPRNGYKRRLPS-VRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQ 148
            D T+EEFR   NGYKR+  +  R S   + +F       P S+DWR  G VT VKDQGQ
Sbjct: 80  GDMTHEEFRQLMNGYKRKAETKARGSLFLEPNF----LEAPKSVDWRDNGYVTPVKDQGQ 135

Query: 149 CGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNK 208
           CG CWAFS   A+EG +   T KL SLSEQ LVDC     ++GC GGLMD AF+++  N+
Sbjct: 136 CGSCWAFSTTGALEGQHFRKTGKLVSLSEQNLVDCSRPEGNEGCNGGLMDQAFQYVKDNQ 195

Query: 209 GLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDAS 267
           GL +E  YPY  +D      +   ++   +G+ D+PS  E ALMKAVA   PVSVAIDA 
Sbjct: 196 GLDSEDSYPYLGTDDQPCHYDPTYNSVNDTGFVDIPSGKERALMKAVAAVGPVSVAIDAG 255

Query: 268 GSDFQFYSSGV-FTGQCGT-ELDHGVTAVGYGTAD---DGTKYWLVKNSWGTTWGENGYI 322
              FQFY SG+ +  +C + ELDHGV  VGYG      DG KYW+VKNSW   WG+ GYI
Sbjct: 256 HESFQFYQSGIYYEKECSSEELDHGVLVVGYGFQGEDVDGKKYWIVKNSWSEKWGDKGYI 315

Query: 323 RMQRDIDAKEGLCGIAMQASYP 344
            M +D   ++  CGIA  ASYP
Sbjct: 316 YMAKD---RKNHCGIATAASYP 334


>gi|34559455|gb|AAQ75437.1| cathepsin L-like protease [Helicoverpa armigera]
 gi|338855117|gb|AEJ31938.1| cathepsin L-like protease [Helicoverpa assulta]
          Length = 341

 Score =  258 bits (658), Expect = 4e-66,   Method: Compositional matrix adjust.
 Identities = 145/319 (45%), Positives = 186/319 (58%), Gaps = 18/319 (5%)

Query: 40  EMWMA---QYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNK--PYKLGINEFADQTN 94
           E W A   ++ + Y    E + R KI+ EN   IA  N +       YKL  N++AD  +
Sbjct: 25  EEWSAFKLEHSKRYDSEVEDKFRMKIYLENKHRIAKHNQRFEQGAVSYKLRPNKYADMLS 84

Query: 95  EEFRAPRNGYKRRLPSVRS-----SETTDVSFRYE-NASVPASIDWRKKGAVTGVKDQGQ 148
            EF    NG+ + L   ++      E+   +F    + + P  +DWRKKGAVT VKDQG+
Sbjct: 85  HEFVHVMNGFNKTLKHPKAVHGKGRESRPATFIAPAHVTYPDHVDWRKKGAVTEVKDQGK 144

Query: 149 CGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNK 208
           CG CWAFS   A+EG +   T  L SLSEQ L+DC  +  + GC GGLMD+AF++I  N 
Sbjct: 145 CGSCWAFSTTGALEGQHFRKTGYLVSLSEQNLIDCSAAYGNNGCNGGLMDNAFKYIKDNG 204

Query: 209 GLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDAS 267
           G+ TE  YPY+  D  C     N  A  + G+ D+P  +E  LM+AVA   PVSVAIDAS
Sbjct: 205 GIDTEKAYPYEGVDDKCRYNAKNSGADDV-GFVDIPQGDEEKLMQAVATVGPVSVAIDAS 263

Query: 268 GSDFQFYSSGVFTGQ--CGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQ 325
              FQFYS GV+  +    T+LDHGV  VGYGT + G  YWLVKNSWG TWG+ GYI+M 
Sbjct: 264 QESFQFYSDGVYYDENCSSTDLDHGVMVVGYGTDEQGGDYWLVKNSWGRTWGDLGYIKMA 323

Query: 326 RDIDAKEGLCGIAMQASYP 344
           R+   K   CGIA  ASYP
Sbjct: 324 RN---KNNHCGIASSASYP 339


>gi|50539796|ref|NP_001002368.1| cathepsin L.1 precursor [Danio rerio]
 gi|49900360|gb|AAH75887.1| Cathepsin L.1 [Danio rerio]
          Length = 334

 Score =  258 bits (658), Expect = 4e-66,   Method: Compositional matrix adjust.
 Identities = 151/313 (48%), Positives = 194/313 (61%), Gaps = 19/313 (6%)

Query: 42  WMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARN--KPYKLGINEFADQTNEEFRA 99
           W  ++G+ YR   E+  R   +  N + +   N  A    K Y+LG+  FAD +NEE+R 
Sbjct: 29  WKLKFGKSYRSAEEESHRQLTWLTNRKLVLVHNMMADQGLKSYRLGMTYFADMSNEEYRQ 88

Query: 100 PRNGYKRRLPSVRSSETTDVS--FRYENASV-PASIDWRKKGAVTGVKDQGQCGCCWAFS 156
               ++  L S+ +++    S  FR   A+V P ++DWR KG VT +KDQ QCG CWAFS
Sbjct: 89  LV--FRGCLGSMNNTKARGGSTFFRLRKAAVVPDTVDWRDKGYVTDIKDQKQCGSCWAFS 146

Query: 157 AVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKY 216
           A  ++EG     T KL SLSEQ+LVDC  S  + GC+GGLMD AF++I +NKGL TE  Y
Sbjct: 147 ATGSLEGQTFRKTGKLVSLSEQQLVDCSGSYGNYGCDGGLMDQAFQYIEANKGLDTEDSY 206

Query: 217 PYKASDGSCNKKEANPS--AAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDASGSDFQF 273
           PY+A DG C     NPS   A  +GY D+ S +E+AL +AVA   P+SVAIDA  S FQ 
Sbjct: 207 PYEAQDGEC---RFNPSTVGASCTGYVDIASGDESALQEAVATIGPISVAIDAGHSSFQL 263

Query: 274 YSSGVFT-GQC-GTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAK 331
           YSSGV+    C  +ELDHGV AVGYG++ +G  YW+VKNSWG  WG  GYI M R+   K
Sbjct: 264 YSSGVYNEPDCSSSELDHGVLAVGYGSS-NGDDYWIVKNSWGLDWGVQGYILMSRN---K 319

Query: 332 EGLCGIAMQASYP 344
              CGIA  ASYP
Sbjct: 320 SNQCGIATAASYP 332


>gi|118125|sp|P25784.1|CYSP3_HOMAM RecName: Full=Digestive cysteine proteinase 3; Flags: Precursor
          Length = 321

 Score =  258 bits (658), Expect = 4e-66,   Method: Compositional matrix adjust.
 Identities = 151/318 (47%), Positives = 194/318 (61%), Gaps = 16/318 (5%)

Query: 33  ATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNK--PYKLGINEFA 90
           AT +   + +  QYGR Y D  E+  R ++F++N + I  FN K  N    +K+ +N+F 
Sbjct: 14  ATASPSWDHFKTQYGRKYGDAKEELYRQRVFQQNEQLIEDFNKKFENGEVTFKVAMNQFG 73

Query: 91  DQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCG 150
           D TNEEF A   GYK+      S       F  E   + A +DWR K  VT VKDQ QCG
Sbjct: 74  DMTNEEFNAVMKGYKKG-----SRGEPKAVFTAEAGPMAADVDWRTKALVTPVKDQEQCG 128

Query: 151 CCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGL 210
            CWAFSA  A+EG + +   +L SLSEQ+LVDC T   + GC GG M  AF++I  N G+
Sbjct: 129 SCWAFSATGALEGQHFLKNDELVSLSEQQLVDCSTDYGNDGCGGGWMTSAFDYIKDNGGI 188

Query: 211 ATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDASGS 269
            TE+ YPY+A D SC + +AN   A  +G  +V  + E AL +AV+   P+SVAIDAS  
Sbjct: 189 DTESSYPYEAEDRSC-RFDANSIGAICTGSVEV-QHTEEALQEAVSGVGPISVAIDASHF 246

Query: 270 DFQFYSSGVFTGQ-CG-TELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRD 327
            FQFYSSGV+  Q C  T LDHGV AVGYGT +    YWLVKNSWG++WG+ GYI+M R+
Sbjct: 247 SFQFYSSGVYYEQNCSPTFLDHGVLAVGYGT-ESTKDYWLVKNSWGSSWGDAGYIKMSRN 305

Query: 328 IDAKEGLCGIAMQASYPT 345
            D     CGIA + SYPT
Sbjct: 306 RDNN---CGIASEPSYPT 320


>gi|395514298|ref|XP_003761356.1| PREDICTED: cathepsin L1-like [Sarcophilus harrisii]
          Length = 365

 Score =  258 bits (658), Expect = 4e-66,   Method: Compositional matrix adjust.
 Identities = 156/348 (44%), Positives = 199/348 (57%), Gaps = 41/348 (11%)

Query: 32  DATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNK--ARNKPYKLGINEF 89
           D T++ +   W AQ+ R Y +N  ++ R  I+++N+  I   N +  A    +++ +N+F
Sbjct: 22  DRTLDAQWYQWKAQHRRDYGEN--EDWRRAIWEKNLRSIEMHNLEYSAGKHSFQMEMNKF 79

Query: 90  ADQTNEEFRAPRNGY-----KRRLPSVRSSETTDVS------------------------ 120
            D TNEEFR   NG+     +RR       E   V                         
Sbjct: 80  GDMTNEEFRQVMNGFSTHRVQRRTKGRLFREPLLVQIPKSVDWRDKGYVTPVKNQLVRRL 139

Query: 121 FRYEN-ASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQE 179
           FR      +P S+DWR KG VT VK+QGQCG CWAFSA  ++EG     T KL SLSEQ 
Sbjct: 140 FREPLLVQIPKSVDWRDKGYVTPVKNQGQCGSCWAFSATGSLEGQWFRKTGKLVSLSEQN 199

Query: 180 LVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISG 239
           LVDC T+  + GC+GGLMD+AFE++  N G+ TE  YPY A+D +C  K    S A I+G
Sbjct: 200 LVDCSTAQGNSGCQGGLMDNAFEYVKENGGIDTEESYPYIAADDTCQYK-PQYSGANITG 258

Query: 240 YEDVPSNNEAALMKAVANQ-PVSVAIDASGSDFQFYSSGV-FTGQCGTE-LDHGVTAVGY 296
           Y D+PS  E AL KAVA   P+SVAIDA  S FQFY SGV +  +C +E LDHGV AVGY
Sbjct: 259 YVDIPSRMEKALEKAVATVGPISVAIDAGHSSFQFYRSGVYYEPECSSEDLDHGVLAVGY 318

Query: 297 GTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           G      KYW+VKNSWG  WG++GYI M RD   +   CGIA  ASYP
Sbjct: 319 GVQGKNGKYWIVKNSWGEEWGDSGYILMARD---RNNHCGIATAASYP 363


>gi|322799749|gb|EFZ20954.1| hypothetical protein SINV_06041 [Solenopsis invicta]
          Length = 337

 Score =  258 bits (658), Expect = 4e-66,   Method: Compositional matrix adjust.
 Identities = 151/342 (44%), Positives = 196/342 (57%), Gaps = 20/342 (5%)

Query: 10  LVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEY 69
           L+  A+L +G     S+++ L+         ++   + +VY+   E+  R KI+ +N   
Sbjct: 7   LLFLAVLAMG--QTVSFNKILDAEWF-----IFKLHHNKVYKSPVEEGYRMKIYMDNKRK 59

Query: 70  IASFNNK--ARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSF-RYENA 126
           IA  N K       YKLG+N++ D  + EF    NG+ + + +    ET  V+F    N 
Sbjct: 60  IAEHNRKYELNEVTYKLGMNKYGDMLHHEFVNTLNGFNKSVTA--GIETEGVTFISPANV 117

Query: 127 SVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTS 186
            +P  +DW K+GAVT VKDQG CG CWAFS+  A+EG +  +T  L SLSEQ L+DC   
Sbjct: 118 KLPDEVDWTKQGAVTAVKDQGHCGSCWAFSSTGALEGQHFRSTGYLVSLSEQNLIDCSGK 177

Query: 187 GEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSN 246
             + GC GGLMD AF++I  NKGL TE  YPY+A +  C     N S A   GY D+P  
Sbjct: 178 YGNNGCNGGLMDYAFQYIKDNKGLDTEKTYPYEAENDRCRYNPRN-SGATDKGYVDIPQG 236

Query: 247 NEAALMKAVAN-QPVSVAIDASGSDFQFYSSGVFTG-QCGTE-LDHGVTAVGYGTAD-DG 302
           +E  L  AVA   P+SVAIDAS   FQ YS GV+    C  E LDHGV  VGYGT +  G
Sbjct: 237 DEEKLKAAVATIGPISVAIDASHESFQLYSEGVYYDPDCSAENLDHGVLIVGYGTDETSG 296

Query: 303 TKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
             YWLVKNSWG TWG+ GYI+M R+   K   CGIA  ASYP
Sbjct: 297 HDYWLVKNSWGKTWGQKGYIKMARN---KNNHCGIASSASYP 335


>gi|261289783|ref|XP_002611753.1| hypothetical protein BRAFLDRAFT_236364 [Branchiostoma floridae]
 gi|229297125|gb|EEN67763.1| hypothetical protein BRAFLDRAFT_236364 [Branchiostoma floridae]
          Length = 307

 Score =  258 bits (658), Expect = 4e-66,   Method: Compositional matrix adjust.
 Identities = 150/297 (50%), Positives = 183/297 (61%), Gaps = 13/297 (4%)

Query: 55  EKEMRFKIFKENVEYIASFNNKAR--NKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVR 112
           E+  R +IF+ N + I   NN+A      Y LG N+FA  TN+EF A   G    L    
Sbjct: 15  EESRRMEIFENNTKLINLHNNEADLGMHTYWLGHNQFAHMTNDEFVANVIG-GCLLDRNA 73

Query: 113 SSETTDVSFRYEN--ASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTR 170
           S  T D   +Y++    +P ++DWR KG VT VK+Q QCG CWAFS   ++EG     T 
Sbjct: 74  SKSTADRVHQYDSNLVELPDTVDWRTKGYVTPVKNQEQCGSCWAFSTTGSLEGQTFKKTG 133

Query: 171 KLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEA 230
           KL SLSEQ LVDC     +QGC GGLMDDAF++I +N G+ TE  YPY+A DG C  K A
Sbjct: 134 KLVSLSEQNLVDCSGEFGNQGCNGGLMDDAFKYIKANGGIDTEDSYPYEARDGKCRFKPA 193

Query: 231 NPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDASGSDFQFYSSGV-FTGQC-GTEL 287
           +   A ++GY D+   +E AL +AVA   P+SVAIDAS   FQ YS GV +  QC  TEL
Sbjct: 194 DV-GATVTGYTDISEGDEGALTQAVATVGPISVAIDASHHTFQMYSHGVYYEPQCSSTEL 252

Query: 288 DHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           DHGV AVGYGT + G  YWLVKNSWG  WG+NGYI M R+   K   CGIA  ASYP
Sbjct: 253 DHGVLAVGYGT-EGGKDYWLVKNSWGEVWGQNGYIMMSRN---KNNQCGIATSASYP 305


>gi|15593252|gb|AAL02222.1|AF410882_1 cysteine protease CP14 precursor [Frankliniella occidentalis]
          Length = 333

 Score =  258 bits (658), Expect = 4e-66,   Method: Compositional matrix adjust.
 Identities = 146/321 (45%), Positives = 196/321 (61%), Gaps = 16/321 (4%)

Query: 31  NDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNK--ARNKPYKLGINE 88
           +D  +    E + A + + Y +  E+  R K+FKEN   IA  N++  +    +K+G N+
Sbjct: 20  SDMEIQAHWESFKATHAKTYANAVEEAYRAKVFKENAIRIAKHNDRFASGEVTFKVGYNQ 79

Query: 89  FADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPAS--IDWRKKGAVTGVKDQ 146
           +AD    E     NGY+  L    +   T       N S P S  +DWR KGAVT +KDQ
Sbjct: 80  YADMHTHEVTEKLNGYRSGLKQASAFVHTA-----SNDSWPWSKKVDWRSKGAVTPIKDQ 134

Query: 147 GQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIIS 206
           GQCG CW+FSA  ++EG   +  + L SLSEQ LVDC     ++GC GGLMD AFE++ S
Sbjct: 135 GQCGSCWSFSATGSLEGQLFLKNKNLVSLSEQNLVDCSWDFGNEGCNGGLMDSAFEYVKS 194

Query: 207 NKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAID 265
           N G+ TE  YPY A DG+C  K AN +A   +GY+DV + +E+AL  AV    PVSVAID
Sbjct: 195 NGGIDTEESYPYTAEDGTCLYKAAN-NAGVNTGYKDVQAKSESALRDAVEKVGPVSVAID 253

Query: 266 ASGSDFQFYSSGV-FTGQCGTE-LDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIR 323
           AS   FQ Y+SG+ +   C ++ LDHGV AVGYG+     ++W+VKNSWGT+WGE GYI+
Sbjct: 254 ASNWSFQMYTSGIYYEPACSSDSLDHGVLAVGYGSEWPNKEFWIVKNSWGTSWGEEGYIK 313

Query: 324 MQRDIDAKEGLCGIAMQASYP 344
           M R+   K+  CGIA +ASYP
Sbjct: 314 MARN---KKNNCGIATEASYP 331


>gi|159792912|gb|ABW98676.1| cathepsin L [Apostichopus japonicus]
          Length = 332

 Score =  257 bits (657), Expect = 4e-66,   Method: Compositional matrix adjust.
 Identities = 151/324 (46%), Positives = 193/324 (59%), Gaps = 15/324 (4%)

Query: 29  TLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARN--KPYKLGI 86
           T+ D   ++  E W   + + Y    E++ R KI+++N++ ++  N +       Y LG+
Sbjct: 18  TIIDKGFDDTWEAWKQTHSKQYT-KEEEDNRRKIWEDNLQKVSKHNTEHSLGLHSYTLGM 76

Query: 87  NEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSF-RYENASVPASIDWRKKGAVTGVKD 145
           N++AD   EEF    NG K       S E   + F  Y     P S+DWR +G VT VKD
Sbjct: 77  NKYADLRGEEFVQMMNGLKFD----ASRERQGIKFLSYAKFQAPDSVDWRDEGYVTPVKD 132

Query: 146 QGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFII 205
           QGQCG CWAFS   ++EG +  +T  LTSLSEQ LVDC  S  + GCEGGLMD AF++I 
Sbjct: 133 QGQCGSCWAFSTTGSLEGQHFRSTGVLTSLSEQNLVDCSISYGNNGCEGGLMDYAFQYIK 192

Query: 206 SNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKA-VANQPVSVAI 264
            N G+ TE KYPY+A D +C     N  A   SGY DV S +E AL +A  AN P+SVAI
Sbjct: 193 DNLGIDTEDKYPYEAEDDTCRFSPDNVGATD-SGYVDVDSGDEDALKEACAANGPISVAI 251

Query: 265 DASGSDFQFYSSGVFTGQ-CGT-ELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYI 322
           DAS   FQ Y SGV+  + C + ELDHGV  VGYGT   G  YW+VKNSWG +WG+ GYI
Sbjct: 252 DASHESFQLYESGVYDEESCSSIELDHGVLVVGYGTDSVGGDYWIVKNSWGLSWGQEGYI 311

Query: 323 RMQRDIDAKEGLCGIAMQASYPTA 346
            M R+   K+  CGIA  ASYPT 
Sbjct: 312 WMSRN---KDNQCGIATSASYPTV 332


>gi|194719810|emb|CAR31335.1| pro-asclepain f [Gomphocarpus fruticosus subsp. fruticosus]
          Length = 340

 Score =  257 bits (657), Expect = 4e-66,   Method: Compositional matrix adjust.
 Identities = 141/314 (44%), Positives = 197/314 (62%), Gaps = 14/314 (4%)

Query: 39  HEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFN--NKARNKPYKLGINEFADQTNEE 96
           +E W+ ++ ++Y    EK  RF+IFK+N+ YI   N  NK  +  + LG+N+FAD T +E
Sbjct: 34  YEEWLVKHQKLYSSLGEKIKRFEIFKDNLRYIDQQNHYNKVNHMNFTLGLNQFADLTLDE 93

Query: 97  FRAPRNG----YKRRLPSVRSSETTDVSFRYENA-SVPASIDWRKKGAVTGVKDQGQCGC 151
           F +   G    Y++ + S  + +  +     E+   +P S+DWR+KG V  +++QG+CG 
Sbjct: 94  FSSIYLGTSVDYEQIISSNPNHDDVEEDILKEDVVELPDSVDWREKGVVFPIRNQGKCGS 153

Query: 152 CWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLA 211
           CW FSAVA++E +N I    + +LSEQEL+DC+T    QGC+GG  ++AF ++  N G+ 
Sbjct: 154 CWTFSAVASIETLNGIKKGHMIALSEQELLDCETI--SQGCKGGHYNNAFAYVAKN-GIT 210

Query: 212 TEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDF 271
           +E KYPY    G C +KE      KISGY+ VP NN   L  AVA Q VSVA+     DF
Sbjct: 211 SEEKYPYIFRQGQCYQKE---KVVKISGYKRVPRNNGGQLQSAVAQQVVSVAVKCESKDF 267

Query: 272 QFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAK 331
           QFY  G+F+G CG  LDH V  VGYG+   G  YW+++NSWGT WGENGY+R+Q++    
Sbjct: 268 QFYDRGIFSGACGPILDHAVNIVGYGSK-GGANYWIMRNSWGTNWGENGYMRIQKNSKHY 326

Query: 332 EGLCGIAMQASYPT 345
           EG CGIAMQ SYP 
Sbjct: 327 EGHCGIAMQPSYPV 340


>gi|325185016|emb|CCA19507.1| cysteine protease family C01A putative [Albugo laibachii Nc14]
          Length = 492

 Score =  257 bits (657), Expect = 4e-66,   Method: Compositional matrix adjust.
 Identities = 141/312 (45%), Positives = 180/312 (57%), Gaps = 40/312 (12%)

Query: 42  WMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPR 101
           W+  +   + D  E   R + +  N  YI + N   +   +KLG N F+  TNEEFR   
Sbjct: 36  WLKTHHLTFSDAFEYAKRLETYIANDIYILTHN--LQESSFKLGHNAFSHLTNEEFRQRF 93

Query: 102 NGYK-------RRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWA 154
           NG+K       +RL   +S+  +  +F+Y +  +P S+DW +KGAVTGVK+QG CG CWA
Sbjct: 94  NGFKASDDYLTKRL--AQSNVASSTNFQYID--LPESVDWVEKGAVTGVKNQGMCGSCWA 149

Query: 155 FSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEA 214
           FS   A+EG   I++ KL SLSEQELVDCD +G D GC GGLMD AF +I  + G+ +E 
Sbjct: 150 FSTTGAIEGATFISSGKLVSLSEQELVDCDHNG-DHGCNGGLMDHAFSWISEHDGICSEE 208

Query: 215 KYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFY 274
            Y Y  S   C  +   P  +                       PV+VAIDA    FQFY
Sbjct: 209 DYAYIHSQSLC--RSCKPVVS-----------------------PVAVAIDAGDRSFQFY 243

Query: 275 SSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGL 334
            SGV+   CGT+LDHGV  VGYG  +DG KYW VKNSWG +WGE GYIR+ RD + + G 
Sbjct: 244 QSGVYNKTCGTQLDHGVLTVGYGV-EDGQKYWKVKNSWGNSWGEKGYIRLSRDQNGRSGQ 302

Query: 335 CGIAMQASYPTA 346
           CGIAM  SYPTA
Sbjct: 303 CGIAMVPSYPTA 314


>gi|11055|emb|CAA45129.1| cysteine proteinase preproenzyme [Homarus americanus]
          Length = 320

 Score =  257 bits (657), Expect = 4e-66,   Method: Compositional matrix adjust.
 Identities = 151/318 (47%), Positives = 194/318 (61%), Gaps = 16/318 (5%)

Query: 33  ATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNK--PYKLGINEFA 90
           AT +   + +  QYGR Y D  E+  R ++F++N + I  FN K  N    +K+ +N+F 
Sbjct: 13  ATASPSWDHFKTQYGRKYGDAKEELYRQRVFQQNEQLIEDFNKKFENGEVTFKVAMNQFG 72

Query: 91  DQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCG 150
           D TNEEF A   GYK+      S       F  E   + A +DWR K  VT VKDQ QCG
Sbjct: 73  DMTNEEFNAVMKGYKKG-----SRGEPKAVFTAEAGPMAADVDWRTKALVTPVKDQEQCG 127

Query: 151 CCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGL 210
            CWAFSA  A+EG + +   +L SLSEQ+LVDC T   + GC GG M  AF++I  N G+
Sbjct: 128 SCWAFSATGALEGQHFLKNDELVSLSEQQLVDCSTDYGNDGCGGGWMTSAFDYIKDNGGI 187

Query: 211 ATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDASGS 269
            TE+ YPY+A D SC + +AN   A  +G  +V  + E AL +AV+   P+SVAIDAS  
Sbjct: 188 DTESSYPYEAEDRSC-RFDANSIGAICTGSVEV-QHTEEALQEAVSGVGPISVAIDASHF 245

Query: 270 DFQFYSSGVFTGQ-CG-TELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRD 327
            FQFYSSGV+  Q C  T LDHGV AVGYGT +    YWLVKNSWG++WG+ GYI+M R+
Sbjct: 246 SFQFYSSGVYYEQNCSPTFLDHGVLAVGYGT-ESTKDYWLVKNSWGSSWGDAGYIKMSRN 304

Query: 328 IDAKEGLCGIAMQASYPT 345
            D     CGIA + SYPT
Sbjct: 305 RDNN---CGIASEPSYPT 319


>gi|387914010|gb|AFK10614.1| cathepsin L [Callorhinchus milii]
 gi|392873762|gb|AFM85713.1| cathepsin L [Callorhinchus milii]
 gi|392877488|gb|AFM87576.1| cathepsin L [Callorhinchus milii]
          Length = 338

 Score =  257 bits (657), Expect = 5e-66,   Method: Compositional matrix adjust.
 Identities = 147/345 (42%), Positives = 200/345 (57%), Gaps = 24/345 (6%)

Query: 10  LVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEY 69
           +VL+  L  G+ AP        D  ++   E W + +G+ Y +  E+  R  ++++++  
Sbjct: 6   VVLSLCLAGGLAAPSL------DPGLDTHWEQWKSWHGKSY-EQKEETWRRMVWEKHLRV 58

Query: 70  IASFN--NKARNKPYKLGINEFADQTNEEFRAPRNGYKRR--LPSVRSSETTDVSFRYEN 125
           I   N  +      ++LG+N F D  NEEFR   NGYK +     ++ S   + +F    
Sbjct: 59  IEIHNLEHSLGKHSFRLGMNHFGDMPNEEFRQLMNGYKYKQTHKKLQGSHFLEPNF---- 114

Query: 126 ASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDT 185
             VP  +DWR +G VT VKDQGQCG CWAFS   A+EG +   T +L SLSEQ LV+C  
Sbjct: 115 LEVPKHVDWRDEGYVTPVKDQGQCGSCWAFSTTGALEGQHFRRTGQLVSLSEQNLVECSK 174

Query: 186 SGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPS 245
              ++GC GGLMD AF+++  N G+ +E  YPY  +D +        +AA  +G+ D+PS
Sbjct: 175 PEGNEGCNGGLMDQAFQYVKDNGGIDSEDSYPYVGTDDTPCHYNPQYNAANDTGFVDIPS 234

Query: 246 NNEAALMKAVAN-QPVSVAIDASGSDFQFYSSGV-FTGQC-GTELDHGVTAVGYGTAD-- 300
             E ALMKA+A   PVSVAIDA  + FQFY SG+ F  +C  T+LDHGV  VGYG     
Sbjct: 235 GKERALMKAIAAVGPVSVAIDAGHTSFQFYQSGIYFEAECSSTDLDHGVLVVGYGVEKRD 294

Query: 301 -DGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
            DG KYW+VKNSW   WG+NGYI M +D   K+  CGIA  ASYP
Sbjct: 295 TDGKKYWIVKNSWSEKWGQNGYILMAKD---KDNHCGIATAASYP 336


>gi|318037269|ref|NP_001187182.1| cathepsin L precursor [Ictalurus punctatus]
 gi|196475596|gb|ACG76367.1| cathepsin L [Ictalurus punctatus]
          Length = 336

 Score =  257 bits (657), Expect = 5e-66,   Method: Compositional matrix adjust.
 Identities = 145/321 (45%), Positives = 185/321 (57%), Gaps = 16/321 (4%)

Query: 32  DATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFN--NKARNKPYKLGINEF 89
           D  ++   + W   + + Y +  E   R  ++++N++ I   N  +      Y+L +N F
Sbjct: 22  DRELDGHWQQWKEWHNKDYHEKEEGWRRM-VWEKNLKKIELHNLEHSLGKHSYRLAMNHF 80

Query: 90  ADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQC 149
            D  +EEFR   NGYK ++  +R S   + +F       P+ +DWR+KG VT VKDQGQC
Sbjct: 81  GDMPHEEFRQVMNGYKHKVRKIRGSLFMEPNF----LEAPSKLDWREKGYVTPVKDQGQC 136

Query: 150 GCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKG 209
           G CWAFS   AMEG     T KL SLSEQ LVDC     ++GC GGLMD AF++I  N G
Sbjct: 137 GSCWAFSTTGAMEGQQFRKTGKLVSLSEQNLVDCSRPEGNEGCNGGLMDQAFQYIKDNGG 196

Query: 210 LATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAV-ANQPVSVAIDASG 268
           L TE  YPY  +D      + + SAA  +G+ D+PS  E ALMKAV A  PVSVAIDA  
Sbjct: 197 LDTEKFYPYLGTDDQPCHYDPSYSAANDTGFVDIPSGKEHALMKAVTAVGPVSVAIDAGH 256

Query: 269 SDFQFYSSGV-FTGQCGTE-LDHGVTAVGYGTAD---DGTKYWLVKNSWGTTWGENGYIR 323
             FQFY SG+ +   C +E LDHGV  VGYG      DG KYW+VKNSW   WG  GYI 
Sbjct: 257 ESFQFYQSGIYYEADCSSEDLDHGVLVVGYGYEGENVDGKKYWIVKNSWSEQWGNKGYIY 316

Query: 324 MQRDIDAKEGLCGIAMQASYP 344
           M +D   +   CGIA  ASYP
Sbjct: 317 MAKD---RHNHCGIATAASYP 334


>gi|198427748|ref|XP_002130282.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
          Length = 340

 Score =  257 bits (657), Expect = 5e-66,   Method: Compositional matrix adjust.
 Identities = 145/324 (44%), Positives = 198/324 (61%), Gaps = 20/324 (6%)

Query: 35  MNERH----EMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNK--ARNKPYKLGINE 88
           +N++H    + W   + +VY+   E+E +   +  N   I+  N +   + K Y+L +NE
Sbjct: 21  LNQQHVSLFQTWKNLWKKVYQTVEEEEQKMATWFNNWNKISEHNMQYSLKQKSYRLEMNE 80

Query: 89  FADQTNEEFRAPRNGYK-----RRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGV 143
           + D T+EEF +  NGY+     +R  +  S+    +SF  +   +P  +DWRK G VT V
Sbjct: 81  YGDLTSEEFSSMMNGYRNDIRLKRKSTGGSTYLNLLSFGSQ-IQLPTLVDWRKHGLVTPV 139

Query: 144 KDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEF 203
           K+QGQCG CW+FSA  ++EG +   T KL SLSEQ L+DC T   + GC GGLMD AF++
Sbjct: 140 KNQGQCGSCWSFSATGSLEGQHKKKTGKLVSLSEQNLIDCSTPEGNDGCNGGLMDQAFKY 199

Query: 204 IISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSV 262
           I    G+ TEA YPY+A D +C +     S A  +G+ D+ S +E  L +A A   P+SV
Sbjct: 200 IKIQGGIDTEAYYPYEAKDDTC-RFNITDSGATDTGFVDIKSGDEEMLKEAAATVGPISV 258

Query: 263 AIDASGSDFQFYSSGVF--TGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENG 320
           AIDAS + FQFYS+GV+  T    T LDHGV  VGYGT ++G  YWLVKNSWG  WGE G
Sbjct: 259 AIDASHTSFQFYSNGVYSETACSSTMLDHGVLVVGYGT-ENGKDYWLVKNSWGEGWGEAG 317

Query: 321 YIRMQRDIDAKEGLCGIAMQASYP 344
           YI+M R+ D +   CGIA QASYP
Sbjct: 318 YIKMSRNADNQ---CGIATQASYP 338


>gi|326493706|dbj|BAJ85314.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 365

 Score =  257 bits (657), Expect = 6e-66,   Method: Compositional matrix adjust.
 Identities = 140/334 (41%), Positives = 184/334 (55%), Gaps = 24/334 (7%)

Query: 30  LNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPY------- 82
           L ++ + ER   WM +Y + Y    E+EMRF++FK N   I   + +  N          
Sbjct: 39  LPESEVRERFSKWMIKYSKHYSCKQEEEMRFQVFKNNTNSIGQLDRQNPNPGVGGALGPS 98

Query: 83  --------KLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDW 134
                   K+ +N F D +  E      G      S R++  T + +   ++  P  +DW
Sbjct: 99  GSQVHTFQKVSMNRFGDLSPREVIQQYTGLNTT--SFRTASPTYLPY---HSFKPCCVDW 153

Query: 135 RKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEG 194
           R  GAVTGVK QG CG CWAF+AVAA+EG+N I T +L SLSEQ LVDCDT     GC G
Sbjct: 154 RSSGAVTGVKHQGTCGSCWAFAAVAAIEGMNKIRTGELVSLSEQVLVDCDTV--STGCGG 211

Query: 195 GLMDDAFEFIISNKGLATEAKYPYKASDGSCN-KKEANPSAAKISGYEDVPSNNEAALMK 253
           G  D A   + +  G+ +E +YPY    G C+  K      A I G++ VPSNNEA L  
Sbjct: 212 GHSDSAMALVAARGGITSEERYPYAGFQGKCDVDKLMFDHQASIKGFKAVPSNNEAQLAI 271

Query: 254 AVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTAD-DGTKYWLVKNSW 312
           AVA QPV+V IDASGS FQFYS G++ G C   ++H VT VGY     +G KYW+ KNSW
Sbjct: 272 AVAMQPVTVYIDASGSAFQFYSGGIYRGPCSANVNHAVTIVGYCEGPGEGNKYWIAKNSW 331

Query: 313 GTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
              WGE GY+ + +D+    G CG+A    YPTA
Sbjct: 332 SNDWGEQGYVYLAKDVAWSTGTCGLATSPFYPTA 365


>gi|308810026|ref|XP_003082322.1| cysteine protease-1 (ISS) [Ostreococcus tauri]
 gi|116060790|emb|CAL57268.1| cysteine protease-1 (ISS) [Ostreococcus tauri]
          Length = 430

 Score =  257 bits (656), Expect = 6e-66,   Method: Compositional matrix adjust.
 Identities = 148/338 (43%), Positives = 191/338 (56%), Gaps = 36/338 (10%)

Query: 38  RH-EMWMAQYG--RVYRDNAEKEMRFKIFKENVEYIASFNN--KARNKPYKLGINEFADQ 92
           RH E W +++G  R  RD  E   R   F EN  Y+   N         + +G+N  A  
Sbjct: 96  RHFERWCSEHGLERYLRDTEEYAKRLATFAENAAYVVEHNALYAIGEVSHWVGLNSLAAT 155

Query: 93  TNEEFRAPRNGYKRRLPSVRSS--------------ETTDVSFRYENASVPASIDWRKKG 138
           T EE+RA   GYK   P +RSS              E    S+ Y +   P +IDW + G
Sbjct: 156 TREEYRALL-GYK---PELRSSGDAEMLEATSTDKVEQYKASWEYASVDPPEAIDWVELG 211

Query: 139 AVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMD 198
           AVT  K+QGQCG CWAFS   A+EGI  I T +L SLSEQE+V C  S ++ GC GGLMD
Sbjct: 212 AVTPPKNQGQCGSCWAFSTTGAVEGITKIRTGRLVSLSEQEMVSC--SKQNMGCNGGLMD 269

Query: 199 DAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQ 258
            AF +I+ N G+ +E +YPY A   +CN+ +     A I G++DVP  +E  L KAV+ Q
Sbjct: 270 YAFRWIVKNGGIDSEFQYPYSAEALACNRWKLQLHVATIDGFKDVPPGDEKELEKAVSQQ 329

Query: 259 PVSVAIDASGSDFQFYSSGVF-TGQCGTELDHGVTAVGYG---TADDGTK-------YWL 307
           PVS+AI+A    FQ Y  GV+ + +CG+++DHGV  VGYG   T  + TK       +W 
Sbjct: 330 PVSIAIEADTKSFQLYDGGVYDSKECGSQVDHGVLVVGYGFDDTHHNATKHHKRHRHFWK 389

Query: 308 VKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           VKNSWG TWGE G+IRM R I  + G CGI    SYPT
Sbjct: 390 VKNSWGGTWGEGGFIRMARRISDETGQCGITTAPSYPT 427


>gi|281203744|gb|EFA77940.1| hypothetical protein PPL_08585 [Polysphondylium pallidum PN500]
          Length = 505

 Score =  257 bits (656), Expect = 6e-66,   Method: Compositional matrix adjust.
 Identities = 148/372 (39%), Positives = 217/372 (58%), Gaps = 38/372 (10%)

Query: 1   MAMILLE--NKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEM 58
            ++I L+  N+ +   +L+ G+ A  + +   ++       E W+ ++ + Y D +E + 
Sbjct: 142 FSIIFLKIMNRYINILLLIFGLIAISN-ALLFSEEQYKNEFENWIDRFEKKY-DVSEFKK 199

Query: 59  RFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRR--LPSVRSSET 116
           RF IFK N++++ S+N+K  N    LG+N  AD TN E+R    G  ++  L +  + E 
Sbjct: 200 RFSIFKSNMDFVHSWNSK--NSQTVLGLNHLADLTNLEYRQFYLGTHKKAVLGTPGNHEV 257

Query: 117 TDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLS 176
           +++   + ++   A++DWR+KGAV+ +KDQGQCG CW+FS   ++EG + I +  +  LS
Sbjct: 258 SNLQSVFGDS---ATVDWRQKGAVSPIKDQGQCGSCWSFSTTGSVEGAHQIKSGNMVELS 314

Query: 177 EQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAK 236
           EQ LVDC TS  + GC GGLMD AFE+II+N G+ TE+ YPY AS G+  K     S A 
Sbjct: 315 EQNLVDCSTSEGNMGCNGGLMDYAFEYIITNNGIDTESSYPYTASSGTTCKYNKANSGAT 374

Query: 237 ISGYEDVPSNNEAALMKAVANQ-PVSVAIDASGSDFQFYSSGV-FTGQCGT-ELDHGVTA 293
           IS Y+++ + +E+ L  AV N  PVSVAIDAS + FQ YS G+ +   C +  LDHGV  
Sbjct: 375 ISSYKNITAGSESDLADAVKNAGPVSVAIDASHNSFQLYSHGIYYDASCSSVNLDHGVLV 434

Query: 294 VGYGT---------------------ADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKE 332
           VGYG+                      DD   YW+VKNSWGT+WG+ G+I M +D D   
Sbjct: 435 VGYGSGTPDSDSRVHKGSQVRVKVPKTDDTKNYWIVKNSWGTSWGDKGFIYMSKDRDNN- 493

Query: 333 GLCGIAMQASYP 344
             CGIA  ASYP
Sbjct: 494 --CGIASCASYP 503


>gi|28194647|gb|AAO33585.1|AF479267_1 cathepsin L [Mesocricetus auratus]
          Length = 333

 Score =  257 bits (656), Expect = 6e-66,   Method: Compositional matrix adjust.
 Identities = 148/323 (45%), Positives = 197/323 (60%), Gaps = 19/323 (5%)

Query: 32  DATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKAR--NKPYKLGINEF 89
           + T N +   W + + R+Y D  E+E R  ++++N++ I   N +       + + +N F
Sbjct: 22  NQTFNAQWHKWKSTHRRLY-DTNEEEWRRAVWEKNMKMIELHNGEYSEGKHGFTMEMNAF 80

Query: 90  ADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQC 149
            D TNEEFR   NGYK +    R  +        +   +P S+DWR+KG VT VK+QGQC
Sbjct: 81  GDMTNEEFRQLVNGYKHQ--KHRKGKLFQEPLMLQ---LPKSVDWREKGCVTPVKNQGQC 135

Query: 150 GCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKG 209
           G CWAFSA  A+EG   + T  L SLSEQ LVDC     +QGC GGLMD AF+++++NKG
Sbjct: 136 GSCWAFSACGALEGQMCLKTGVLVSLSEQNLVDCSRGEGNQGCNGGLMDFAFQYVLNNKG 195

Query: 210 LATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDASG 268
           L +E  YPY+A DG+C K +   +AA  +GY D+P   E ALMKAVA   P++VAIDAS 
Sbjct: 196 LDSEESYPYEAKDGTC-KYKPEFAAANDTGYVDIPQ-LEKALMKAVATVGPIAVAIDASH 253

Query: 269 SDFQFYSSGV-FTGQCGT-ELDHGVTAVGY---GTADDGTKYWLVKNSWGTTWGENGYIR 323
             FQFYSSG+ F   C + +LDHGV  +GY   GT  +  KYW+VKNSWGT WG  G+  
Sbjct: 254 PSFQFYSSGIYFEPNCSSKDLDHGVLVIGYGFEGTDSNKKKYWIVKNSWGTGWGMGGFFH 313

Query: 324 MQRDIDAKEGLCGIAMQASYPTA 346
           + +D   K   CGIA  ASYPT 
Sbjct: 314 IAKD---KNNHCGIATAASYPTV 333


>gi|326672297|ref|XP_003199631.1| PREDICTED: cathepsin L1-like [Danio rerio]
          Length = 336

 Score =  257 bits (656), Expect = 7e-66,   Method: Compositional matrix adjust.
 Identities = 148/344 (43%), Positives = 199/344 (57%), Gaps = 24/344 (6%)

Query: 11  VLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYI 70
           +L  + +  V+A  S      D  +++    W +Q+G+ Y ++ E   R  I++EN+  I
Sbjct: 5   LLVTLCISAVFAASSI-----DIQLDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRKI 58

Query: 71  --ASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENA-- 126
              +F     N  +K+G+N+F D TNEEFR   NGYK        ++T+      E +  
Sbjct: 59  EQHNFEYSYGNHTFKMGMNQFGDMTNEEFRHAMNGYKHD-----PNQTSQGPLFMEPSFF 113

Query: 127 SVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTS 186
           + P  +DWR++G VT VKDQ QCG CW+FS+  A+EG     T KL S+SEQ LVDC   
Sbjct: 114 AAPQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRP 173

Query: 187 GEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSN 246
             +QGC GGLMD AF+++  NKGL +E  YPY A D    + +   + AKI+G+ D+P  
Sbjct: 174 HGNQGCNGGLMDQAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPKG 233

Query: 247 NEAALMKAVANQ-PVSVAIDASGSDFQFYSSGVFTGQC--GTELDHGVTAVGYGT--AD- 300
           NE ALM AVA   PVSVAIDAS    QFY SG++  +    + LDH V  VGYG   AD 
Sbjct: 234 NELALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACSSSRLDHAVLVVGYGYQGADV 293

Query: 301 DGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
            G +YW+VKNSW   WG+ GYI M +D   K   CGIA  ASYP
Sbjct: 294 AGNRYWIVKNSWSDKWGDKGYIYMAKD---KNNHCGIATMASYP 334


>gi|357130486|ref|XP_003566879.1| PREDICTED: fruit bromelain-like [Brachypodium distachyon]
          Length = 354

 Score =  257 bits (656), Expect = 7e-66,   Method: Compositional matrix adjust.
 Identities = 141/323 (43%), Positives = 189/323 (58%), Gaps = 15/323 (4%)

Query: 34  TMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQT 93
           T+  RHE WMA++GR Y+D  EK  R ++F  N  ++ + N ++ N+ Y LG+N F+D T
Sbjct: 33  TVASRHERWMARFGRAYKDADEKARRQEVFGANARHVDAVN-RSGNRTYTLGLNHFSDLT 91

Query: 94  NEEFRAPRNGYKRRLPS---VRSSETTDVSFRYENAS----VPASIDWRKKGAVTGVKDQ 146
           + EF     GY+   P    +   E  D+S     A     VP S+DWR +GAVT +K+Q
Sbjct: 92  DHEFLQQHLGYRHHQPGPGGLLRPEDQDMSKATALADYGQDVPDSVDWRAQGAVTEIKNQ 151

Query: 147 GQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIIS 206
             CG CWAF+AVAA EG+  I T  L S+SEQ+++DC   G    C+GG ++ A  ++ +
Sbjct: 152 RSCGSCWAFAAVAATEGLVKIATGNLISMSEQQVLDCTGGGNT--CDGGDINAALRYVAA 209

Query: 207 NKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVP-SNNEAALMKAVANQPVSVAID 265
           + GL  EA Y Y A  G+C       SAA + G        +E AL    A QPV+VA++
Sbjct: 210 SGGLQPEAAYAYAAQKGACRGASPANSAASVGGARFARLGGDEGALRGLAAGQPVAVALE 269

Query: 266 ASGSDFQFYSSGVFTG--QCGTELDHGVTAVGYGTADD-GTKYWLVKNSWGTTWGENGYI 322
           AS  DF+ Y SGV+ G   CG  L+HGVT VGYG  DD G +YW+VKN WGT WGE GY+
Sbjct: 270 ASEPDFRHYKSGVYAGSASCGRRLNHGVTVVGYGAEDDSGDEYWVVKNQWGTLWGEKGYM 329

Query: 323 RMQRDIDAKEGLCGIAMQASYPT 345
           R+ R  D     CGIA  A YPT
Sbjct: 330 RVARG-DVAGANCGIASYAYYPT 351


>gi|1709576|sp|P05994.3|PAPA4_CARPA RecName: Full=Papaya proteinase 4; AltName: Full=Glycyl
           endopeptidase; AltName: Full=Papaya peptidase B;
           AltName: Full=Papaya proteinase IV; Short=PPIV; Flags:
           Precursor
 gi|953176|emb|CAA54974.1| proteinase IV [Carica papaya]
          Length = 348

 Score =  256 bits (655), Expect = 7e-66,   Method: Compositional matrix adjust.
 Identities = 150/354 (42%), Positives = 206/354 (58%), Gaps = 19/354 (5%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSRTL-----NDATMNER----HEMWMAQYGRVYR 51
           MA+I   +KL+  AI + G  +      ++     +D T  ER       WM ++ + Y+
Sbjct: 1   MAIICSFSKLLFVAICLFGHMSLSYCDFSIVGYSQDDLTSTERLIQLFNSWMLKHNKNYK 60

Query: 52  DNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSV 111
           +  EK  RF+IFK+N++YI    NK  N  Y LG+NEF+D +N+EF+     Y   LP  
Sbjct: 61  NVDEKLYRFEIFKDNLKYIDE-RNKMING-YWLGLNEFSDLSNDEFKEK---YVGSLPED 115

Query: 112 RSSETTDVSFRYEN-ASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTR 170
            +++  D  F  E+   +P S+DWR KGAVT VK QG C  CWAFS VA +EGIN I T 
Sbjct: 116 YTNQPYDEEFVNEDIVDLPESVDWRAKGAVTPVKHQGYCESCWAFSTVATVEGINKIKTG 175

Query: 171 KLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEA 230
            L  LSEQELVDCD   +  GC  G    + +++  N G+   AKYPY A   +C   + 
Sbjct: 176 NLVELSEQELVDCDK--QSYGCNRGYQSTSLQYVAQN-GIHLRAKYPYIAKQQTCRANQV 232

Query: 231 NPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHG 290
                K +G   V SNNE +L+ A+A+QPVSV ++++G DFQ Y  G+F G CGT++DH 
Sbjct: 233 GGPKVKTNGVGRVQSNNEGSLLNAIAHQPVSVVVESAGRDFQNYKGGIFEGSCGTKVDHA 292

Query: 291 VTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           VTAVGYG +       L+KNSWG  WGENGYIR++R      G+CG+   + YP
Sbjct: 293 VTAVGYGKSGGKGYI-LIKNSWGPGWGENGYIRIRRASGNSPGVCGVYRSSYYP 345


>gi|443698586|gb|ELT98517.1| hypothetical protein CAPTEDRAFT_128252 [Capitella teleta]
          Length = 324

 Score =  256 bits (655), Expect = 8e-66,   Method: Compositional matrix adjust.
 Identities = 146/318 (45%), Positives = 195/318 (61%), Gaps = 17/318 (5%)

Query: 32  DATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKAR--NKPYKLGINEF 89
           D  ++E   ++   + + Y   AE   RF I++ ++  I   N +A      + LG+NE+
Sbjct: 17  DEALDEMWTLFKTTHSKTYATEAEDMRRF-IWERHLNMINQHNIEADLGKHTFSLGMNEY 75

Query: 90  ADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQC 149
            D T  E+ A  +GYK    SV SS         EN  VP ++DWR+KG VT VK+QGQC
Sbjct: 76  GDLTQHEY-AAMSGYKMAKSSVGSS-----FLEPENLQVPKTVDWREKGYVTPVKNQGQC 129

Query: 150 GCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKG 209
           G CWAFS+  ++EG     T +L S+SEQ LVDC     + GC GGLMD+AF +I  N G
Sbjct: 130 GSCWAFSSTGSLEGQVFRKTGRLPSISEQNLVDCSRDEGNMGCSGGLMDNAFTYIKKNMG 189

Query: 210 LATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDASG 268
           + +E  YPY+A DG C  K+++ S    SG+ D+P  +E AL  AVA+  PVSVAIDAS 
Sbjct: 190 IDSEKSYPYEAVDGECRYKKSD-SVTTDSGFVDIPHGDETALRTAVASVGPVSVAIDASH 248

Query: 269 SDFQFYSSGVFT-GQC-GTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQR 326
           + FQFY +GV+T   C  T+LDHGV  VGYG  ++G  YWLVKNSWG +WGE GYI++ R
Sbjct: 249 TSFQFYKTGVYTEANCSSTQLDHGVLVVGYGV-ENGQDYWLVKNSWGASWGEAGYIKLAR 307

Query: 327 DIDAKEGLCGIAMQASYP 344
           +   +   CGIA QASYP
Sbjct: 308 NHGNQ---CGIASQASYP 322


>gi|332260024|ref|XP_003279085.1| PREDICTED: cathepsin L1 isoform 3 [Nomascus leucogenys]
 gi|441593306|ref|XP_004087072.1| PREDICTED: cathepsin L1 [Nomascus leucogenys]
 gi|441593309|ref|XP_004087073.1| PREDICTED: cathepsin L1 [Nomascus leucogenys]
          Length = 333

 Score =  256 bits (655), Expect = 8e-66,   Method: Compositional matrix adjust.
 Identities = 154/348 (44%), Positives = 201/348 (57%), Gaps = 25/348 (7%)

Query: 8   NKLVLAAILVLGVWAPQSWSRTLN-DATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKEN 66
           N  ++ A   LG+      S TL  D ++  +   W A + R+Y  N E+  R  ++++N
Sbjct: 2   NPTLILAAFCLGIA-----SATLTFDHSLEAQWTKWKAMHNRLYGMN-EEGWRRAVWEKN 55

Query: 67  VEYIASFNNKAR--NKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYE 124
           ++ I   N + R     + + +N F D T+EEFR   NG++ R P  R  +       YE
Sbjct: 56  MKMIEQHNQEYREGKHSFTMAMNAFGDMTSEEFRQVMNGFQNRKP--RKGKVFQEPLFYE 113

Query: 125 NASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCD 184
               P S+DWR+KG VT VK+QGQCG CWAFSA  A+EG     T KL SLSEQ LVDC 
Sbjct: 114 ---APRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCS 170

Query: 185 TSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVP 244
               ++GC GGLMD AF+++  N GL +E  YPY+A++ SC K     S A  +G+ D+P
Sbjct: 171 GPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESC-KYNPKYSVANDTGFVDIP 229

Query: 245 SNNEAALMKAVAN-QPVSVAIDASGSDFQFYSSGV-FTGQCGTE-LDHGVTAVGYG---T 298
              E ALMKAVA   P+SVA+DA    FQFY  G+ F   C +E +DHGV  VGYG   T
Sbjct: 230 K-QEKALMKAVATVGPISVAVDAGHQSFQFYKEGIYFEPDCSSEDMDHGVLVVGYGFEST 288

Query: 299 ADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
             D  KYWLVKNSWG  WG  GYI+M +D   +   CGIA  ASYPT 
Sbjct: 289 ESDNNKYWLVKNSWGEEWGMGGYIKMAKD---RRNHCGIASAASYPTV 333


>gi|38147395|gb|AAR12010.1| cathepsin L-like proteinase [Triatoma infestans]
          Length = 328

 Score =  256 bits (655), Expect = 9e-66,   Method: Compositional matrix adjust.
 Identities = 144/315 (45%), Positives = 186/315 (59%), Gaps = 18/315 (5%)

Query: 40  EMWMA---QYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNK--PYKLGINEFADQTN 94
           E W+A   Q+G+ Y+++ E+  R  ++KEN   I   N +  N    YKL +N F D   
Sbjct: 24  EEWLAFKAQFGKSYKNSFEELFRMNVYKENQRKIDEHNKRYENGEVSYKLKMNHFGDLMQ 83

Query: 95  EEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWA 154
            EF+A  N  KR      S E     FR     +PA +DWR+KGAVT VKD GQCG CWA
Sbjct: 84  HEFKA-LNKLKRSAKQQNSGEV----FRATGGKLPAKVDWRQKGAVTPVKDPGQCGSCWA 138

Query: 155 FSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEA 214
           FS+  ++ G   +  +KL SLSEQ+LVDC  +  + GC+GG+M  AF++I  N G+ TE 
Sbjct: 139 FSSTGSLGGQLFLKNKKLVSLSEQQLVDCSGNYGNDGCDGGIMVQAFQYIKGNGGIDTEG 198

Query: 215 KYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDASGSDFQF 273
            YPY+A D  C  K     A    GY D+   +E AL +AVA   P+SVAIDA    FQF
Sbjct: 199 SYPYEAEDDKCRYK-TKSVAGTDKGYVDIAQGDENALKEAVAEIGPISVAIDAGNLSFQF 257

Query: 274 YSSGVFTGQ--CGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAK 331
           YS G++       TELDHGV  VGYGT ++G  YWLVKNSWG +WGENGYI++ R+ +  
Sbjct: 258 YSEGIYDEPFCSNTELDHGVLVVGYGT-ENGQDYWLVKNSWGPSWGENGYIKIARNHNNH 316

Query: 332 EGLCGIAMQASYPTA 346
              CGIA  ASYP  
Sbjct: 317 ---CGIASMASYPIV 328


>gi|148224022|ref|NP_001087489.1| cathepsin L2 precursor [Xenopus laevis]
 gi|51258284|gb|AAH80004.1| MGC81823 protein [Xenopus laevis]
          Length = 335

 Score =  256 bits (655), Expect = 9e-66,   Method: Compositional matrix adjust.
 Identities = 148/352 (42%), Positives = 201/352 (57%), Gaps = 27/352 (7%)

Query: 1   MAMILLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRF 60
           MA+ L+   L L  +      AP +      D  +++   +W   + + Y    E   R 
Sbjct: 1   MALYLVAAALCLTTVFA----APTT------DPALDDHWHLWKNWHKKSYLPKEEGWRRV 50

Query: 61  KIFKENVEYIASFN--NKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTD 118
            ++++N+  I   N  +      Y+LG+N+F D TNEEFR   NGYK +     S+    
Sbjct: 51  -LWEKNLRTIEFHNLDHSLGKHSYRLGMNQFGDMTNEEFRQLMNGYKNQKMIKGSTFLAP 109

Query: 119 VSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQ 178
            +F       P ++DWR+KG VT VKDQGQCG CWAFS   A+EG ++    KL SLSEQ
Sbjct: 110 NNFE-----APKTVDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQHYRKAGKLISLSEQ 164

Query: 179 ELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKIS 238
            LVDC  +  +QGC GGLMD AF+++  N G+ +E  YPY A D      + N ++A  +
Sbjct: 165 NLVDCSRAQGNQGCNGGLMDQAFQYVKDNGGIDSEDSYPYTAKDDQECHYDPNYNSANDT 224

Query: 239 GYEDVPSNNEAALMKAVAN-QPVSVAIDASGSDFQFYSSGVFTG-QCGTE-LDHGVTAVG 295
           G+ DVPS +E  LMKAVA+  PVSVA+DA    FQFY SG++   +C +E LDHGV  VG
Sbjct: 225 GFVDVPSGSEKDLMKAVASVGPVSVAVDAGHKSFQFYQSGIYYDPECSSEDLDHGVLVVG 284

Query: 296 YGTAD---DGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           YG      DG +YW+VKNSW   WG NGYI++ +D   +   CGIA  ASYP
Sbjct: 285 YGFEGEDVDGKRYWIVKNSWSEKWGNNGYIKIAKD---RHNHCGIATAASYP 333


>gi|119433808|gb|ABL74967.1| cysteine protease [Acanthamoeba castellanii]
          Length = 330

 Score =  256 bits (654), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 145/307 (47%), Positives = 186/307 (60%), Gaps = 11/307 (3%)

Query: 42  WMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPR 101
           WM  + + Y  N E   R+ +++EN  +I   N K  N  Y L +N+F D TN EF    
Sbjct: 33  WMRTHTKSY-SNEEFVFRWNVWRENYNFIQEENRK--NNSYYLTMNKFGDLTNAEFNKVY 89

Query: 102 NGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAM 161
            G      S    +    +       +PA+ DWR+KGAVT VK+QGQCG CW+FS   + 
Sbjct: 90  KGLAFDY-SAHILKAKAATPAAPAPGLPANFDWRQKGAVTHVKNQGQCGSCWSFSTTGST 148

Query: 162 EGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKAS 221
           EG N +    L SLSEQ L+DC  S  + GC GGLMD AFE+II+NKG+ TEA YPY+ +
Sbjct: 149 EGANFLKRGTLVSLSEQNLIDCSGSYGNNGCNGGLMDYAFEYIINNKGIDTEASYPYETA 208

Query: 222 DGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVF-- 279
             +C    AN S   ++ Y DV S +E AL+ AVA +P SVAIDAS + FQFYS GV+  
Sbjct: 209 QYNCRYNPAN-SGGSLTSYTDVSSGDENALLNAVAIEPTSVAIDASHNSFQFYSGGVYYE 267

Query: 280 TGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAM 339
           +    T+LDHGV AVG+GT ++G  YWLVKNSWG  WG  GYI+M R+   +   CGIA 
Sbjct: 268 SSCSSTQLDHGVLAVGWGT-ENGQDYWLVKNSWGADWGLQGYIKMARN---RHNNCGIAT 323

Query: 340 QASYPTA 346
            ASYPTA
Sbjct: 324 AASYPTA 330


>gi|342305188|dbj|BAK55648.1| cathepsin L [Oplegnathus fasciatus]
          Length = 336

 Score =  256 bits (654), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 148/342 (43%), Positives = 197/342 (57%), Gaps = 17/342 (4%)

Query: 12  LAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIA 71
           +  + VL V    + S    D  ++E  ++W + + + Y +  E   R  ++++N++ I 
Sbjct: 1   MLPVAVLAVCLSAALSAPSLDPQLDEHWDLWKSWHTKKYHEKEEGWRRM-VWEKNLKKIE 59

Query: 72  SFN--NKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLP-SVRSSETTDVSFRYENASV 128
             N  +      Y+LG+N F D T+EEFR    GYKR+     + S   + +F       
Sbjct: 60  LHNLEHSMGEHTYRLGMNHFGDMTHEEFRQIMYGYKRKSERKFKGSLFMEPNF----LEA 115

Query: 129 PASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGE 188
           P S+DWR  G VT VKDQGQCG CWAFS   AMEG +   T KL SLSEQ LVDC     
Sbjct: 116 PRSVDWRDNGYVTPVKDQGQCGSCWAFSTTGAMEGQHFRKTGKLVSLSEQNLVDCSRPEG 175

Query: 189 DQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNE 248
           ++GC GGLMD AF++I  N+GL +E  YPY  +D      +   ++A  +G+ D+PS  E
Sbjct: 176 NEGCNGGLMDQAFQYIKDNQGLDSEDSYPYLGTDDQPCHYDPKYNSANDTGFIDIPSGKE 235

Query: 249 AALMKAVAN-QPVSVAIDASGSDFQFYSSGV-FTGQCGT-ELDHGVTAVGYGTAD---DG 302
            ALMKAVA   PVSVAIDA    FQFY SG+ +  +C + ELDHGV  VGYG      DG
Sbjct: 236 RALMKAVAAVGPVSVAIDAGHESFQFYQSGIYYEKECSSEELDHGVLVVGYGFEGEDVDG 295

Query: 303 TKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
            KYW+VKNSW   WG+ GYI M +D   ++  CGIA  ASYP
Sbjct: 296 KKYWIVKNSWSEKWGDKGYIYMAKD---RKNHCGIATAASYP 334


>gi|157787177|ref|NP_001099150.1| cathepsin L1-like precursor [Danio rerio]
 gi|157422879|gb|AAI53505.1| MGC174152 protein [Danio rerio]
          Length = 336

 Score =  256 bits (654), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 148/344 (43%), Positives = 199/344 (57%), Gaps = 24/344 (6%)

Query: 11  VLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYI 70
           +L  + +  V+A  S      D  +++    W +Q+G+ Y ++ E   R  I++EN+  I
Sbjct: 5   LLVTLCISAVFAASSI-----DIQLDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRKI 58

Query: 71  --ASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENA-- 126
              +F     N  +K+G+N+F D TNEEFR   NGYK        ++T+      E +  
Sbjct: 59  EQHNFEYSYGNHTFKMGMNQFGDMTNEEFRHAMNGYKHD-----PNQTSQGPLFMEPSFF 113

Query: 127 SVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTS 186
           + P  +DWR++G VT VKDQ QCG CW+FS+  A+EG     T KL S+SEQ LVDC   
Sbjct: 114 AAPQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRP 173

Query: 187 GEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSN 246
             +QGC GGLMD AF+++  NKGL +E  YPY A D    + +   + AKI+G+ D+P  
Sbjct: 174 QGNQGCNGGLMDQAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPRG 233

Query: 247 NEAALMKAVANQ-PVSVAIDASGSDFQFYSSGVFTGQC--GTELDHGVTAVGYGT--AD- 300
           NE ALM AVA   PVSVAIDAS    QFY SG++  +    + LDH V  VGYG   AD 
Sbjct: 234 NELALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACSSSRLDHAVLVVGYGYQGADV 293

Query: 301 DGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
            G +YW+VKNSW   WG+ GYI M +D   K   CGIA  ASYP
Sbjct: 294 AGNRYWIVKNSWSDKWGDKGYIYMAKD---KNNHCGIATMASYP 334


>gi|428170119|gb|EKX39047.1| hypothetical protein GUITHDRAFT_154556 [Guillardia theta CCMP2712]
          Length = 352

 Score =  256 bits (654), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 154/354 (43%), Positives = 203/354 (57%), Gaps = 25/354 (7%)

Query: 10  LVLAAILVLGVWAPQSWSRTLN-DATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVE 68
           L+LAAI           S+T + D  ++     W  ++ +VY D AE   RF +FK N+E
Sbjct: 5   LLLAAIAATCAIPTSPASKTSSVDDEIHLAFISWKNKFEKVY-DGAEHLARFAVFKANME 63

Query: 69  YIASFNNKARNKPYKLG-------INEFADQTNEEFRAPRNGYKRRLPSVRSSETTD--- 118
            I     +A N  Y+LG        N+FAD T EEF+    GYK  L   R  +  +   
Sbjct: 64  II-----RAHNALYELGEETFSMAANQFADMTAEEFKRTVLGYKPELKGKRLLQGLNSGK 118

Query: 119 -VSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSE 177
             + R  N++ P +IDWR K AVT VK+QGQCG CW+FS   A+EG   +    L SLSE
Sbjct: 119 NCTHRSNNSTRPKAIDWRTKSAVTPVKNQGQCGSCWSFSTTGAVEGAWVVAGHPLISLSE 178

Query: 178 QELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGS---CNKKEANPSA 234
           +ELV CDT   DQGC GGLMD+A+ +II N G+A E  YPY + +G+   C+    +   
Sbjct: 179 EELVQCDTK-SDQGCNGGLMDNAYAWIIQNGGIAAEDVYPYISGNGTTGVCHVAFLSKKV 237

Query: 235 AKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTG-QCGTELDHGVTA 293
           A IS + D+   +E+ L  A+  QPV+VAI+A  S FQFY+ GV    +CGT+LDHGV A
Sbjct: 238 ASISDWCDLKPEDESDLELALVQQPVAVAIEADQSSFQFYNGGVLPAKKCGTKLDHGVLA 297

Query: 294 VGYG-TADDGTKYWLVKNSWGTTWGENGYIRMQR-DIDAKEGLCGIAMQASYPT 345
           VGYG        YW+VKNSWG  WG+ GYIR+++     K   CGIA  ASYPT
Sbjct: 298 VGYGYDKKHKMHYWIVKNSWGAEWGDEGYIRLEKMPKKTKHSACGIAKAASYPT 351


>gi|417399134|gb|JAA46597.1| Putative cathepsin l1 [Desmodus rotundus]
          Length = 335

 Score =  256 bits (654), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 153/350 (43%), Positives = 207/350 (59%), Gaps = 26/350 (7%)

Query: 6   LENKLVLAAILVLGVWA--PQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIF 63
           ++  L+LAA L LG+ +  P+       D ++N     W A Y R+Y  + E+  R  ++
Sbjct: 1   MKTSLLLAA-LCLGIASAIPKF------DHSLNAEWYQWKATYRRLYGAD-EEGWRRAVW 52

Query: 64  KENVEYIASFNNK--ARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSF 121
           ++N + I   N +   R   + + +N F D TNEEFR   NG+ ++          +  F
Sbjct: 53  EKNRKMIELHNREYSQRKHGFTMAMNAFGDMTNEEFRQVMNGFLKQKQHRNGRLFREPLF 112

Query: 122 RYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELV 181
               A +P+S+DWR+KG VT VK+QGQCG CWAFSA  A+EG     T KL SLSEQ LV
Sbjct: 113 ----AEIPSSVDWRQKGYVTPVKNQGQCGSCWAFSANGALEGQMFRKTGKLVSLSEQNLV 168

Query: 182 DCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYE 241
           DC  S  +QGC GGLMD+AF+++  NKGL +E  YPY   + +        SAA  +G+ 
Sbjct: 169 DCSHSQGNQGCNGGLMDNAFQYVKDNKGLDSEESYPYLGRESNTCNYRPEYSAANDTGFV 228

Query: 242 DVPSNNEAALMKAVAN-QPVSVAIDASGSDFQFYSSGV-FTGQCGT-ELDHGVTAVGYGT 298
           D+P  +E  LMKAVA   P+SVAIDA  S FQFYS G+ +   C + +LDHGV  VGYG+
Sbjct: 229 DIP-QHERGLMKAVATVGPISVAIDAGHSSFQFYSEGIYYEPNCSSKDLDHGVLVVGYGS 287

Query: 299 ---ADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
                D  K+W+VKNSWGT WG +GY++M RD   +   CGIA  ASYPT
Sbjct: 288 EGAQSDSNKFWIVKNSWGTGWGMSGYVKMARD---QSNHCGIATAASYPT 334


>gi|164420679|ref|NP_001037464.2| fibroinase precursor [Bombyx mori]
 gi|40556818|gb|AAR87763.1| fibroinase precursor [Bombyx mori]
          Length = 341

 Score =  256 bits (654), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 147/346 (42%), Positives = 189/346 (54%), Gaps = 21/346 (6%)

Query: 10  LVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEY 69
           LVL    V  V A Q +        + E    +  Q+   Y    E   R KI+ E+   
Sbjct: 4   LVLLLCAVAAVSAVQFFD------LVKEEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHI 57

Query: 70  IASFNNKARNK--PYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFR----- 122
           IA  N K       YKLG+N++ D  + EF    NG+ +     ++      S R     
Sbjct: 58  IAKHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFI 117

Query: 123 -YENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELV 181
              N  +P  +DWRK GAVT +KDQG+CG CW+FS   A+EG +   +  L SLSEQ L+
Sbjct: 118 SPANVKLPEQVDWRKHGAVTDIKDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLI 177

Query: 182 DCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYE 241
           DC     + GC GGLMD+AF++I  N G+ TE  YPY+  D  C     N  A  + G+ 
Sbjct: 178 DCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTEQTYPYEGVDDKCRYNPKNTGAEDV-GFV 236

Query: 242 DVPSNNEAALMKAVAN-QPVSVAIDASGSDFQFYSSGVFTGQ--CGTELDHGVTAVGYGT 298
           D+P  +E  LM+AVA   PVSVAIDAS + FQ YSSGV+  +    T+LDHGV  VGYGT
Sbjct: 237 DIPEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECSSTDLDHGVLVVGYGT 296

Query: 299 ADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
            + G  YWLVKNSWG +WGE GYI+M R+   K   CGIA  ASYP
Sbjct: 297 DEQGVDYWLVKNSWGRSWGELGYIKMIRN---KNNRCGIASSASYP 339


>gi|15593246|gb|AAL02220.1|AF410880_1 cysteine protease CP7 precursor [Frankliniella occidentalis]
          Length = 333

 Score =  256 bits (654), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 146/321 (45%), Positives = 196/321 (61%), Gaps = 16/321 (4%)

Query: 31  NDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNK--ARNKPYKLGINE 88
           +D  +    E + A + + Y + AE+  R K+FKEN   IA  N++  +    +K+G N+
Sbjct: 20  SDMEIQAHWESFKATHAKTYANAAEEAYRAKVFKENAIRIAKHNDRFASGEVTFKVGYNQ 79

Query: 89  FADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPAS--IDWRKKGAVTGVKDQ 146
           +AD    E     NGY+  L    +   T       N S P S  +DWR KGAVT +KDQ
Sbjct: 80  YADMHTHEVTEKLNGYRSGLKQASAFVHTA-----SNDSWPWSKKVDWRSKGAVTPIKDQ 134

Query: 147 GQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIIS 206
           GQCG CW+FSA  ++EG   +  + L SLSEQ LVDC     ++GC GGLMD AFE++ S
Sbjct: 135 GQCGSCWSFSATGSLEGQLFLKNKNLVSLSEQNLVDCSWDFGNEGCNGGLMDSAFEYVKS 194

Query: 207 NKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAID 265
             G+ TE  YPY A DG+C  K AN +A   +GY+DV + +E+AL  AV    PVSVAID
Sbjct: 195 YGGIDTEESYPYTAEDGTCLYKAAN-NAGVNTGYKDVQAKSESALRDAVEKVGPVSVAID 253

Query: 266 ASGSDFQFYSSGV-FTGQCGTE-LDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIR 323
           AS   FQ Y+SG+ +   C ++ LDHGV AVGYG+     ++W+VKNSWGT+WGE GYI+
Sbjct: 254 ASNWSFQMYTSGIYYEPACSSDSLDHGVLAVGYGSEWPNKEFWIVKNSWGTSWGEEGYIK 313

Query: 324 MQRDIDAKEGLCGIAMQASYP 344
           M R+   K+  CGIA +ASYP
Sbjct: 314 MARN---KKNNCGIATEASYP 331


>gi|110739710|dbj|BAF01762.1| cysteine protease component of protease-inhibitor complex
           [Arabidopsis thaliana]
          Length = 300

 Score =  256 bits (654), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 125/191 (65%), Positives = 147/191 (76%), Gaps = 2/191 (1%)

Query: 154 AFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATE 213
           AFS + A+EGIN I T  L SLSEQELVDCDTS  +QGC GGLMD AFEFII N G+ TE
Sbjct: 1   AFSTIGAVEGINKIVTGDLISLSEQELVDCDTS-YNQGCNGGLMDYAFEFIIKNGGIDTE 59

Query: 214 AKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQF 273
           A YPYKA+DG C++   N     I  YEDVP N+EA+L KA+A+QP+SVAI+A G  FQ 
Sbjct: 60  ADYPYKAADGRCDQNRKNAKVVTIDSYEDVPENSEASLKKALAHQPISVAIEAGGRAFQL 119

Query: 274 YSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEG 333
           YSSGVF G CGTELDHGV AVGYGT ++G  YW+V+NSWG  WGE+GYI+M R+I+A  G
Sbjct: 120 YSSGVFDGLCGTELDHGVVAVGYGT-ENGKGYWIVRNSWGNRWGESGYIKMARNIEAPTG 178

Query: 334 LCGIAMQASYP 344
            CGIAM+ASYP
Sbjct: 179 KCGIAMEASYP 189


>gi|387015022|gb|AFJ49630.1| Cathepsin L1-like [Crotalus adamanteus]
          Length = 338

 Score =  256 bits (653), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 153/346 (44%), Positives = 200/346 (57%), Gaps = 21/346 (6%)

Query: 10  LVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEY 69
           ++   IL L   A  S++    D  +N+    W + + + Y +  E   R  I+++N++ 
Sbjct: 1   MIYLCILALSFGA--SFAAPGLDPALNDHWLSWKSWHSKKYHEKEEGWRRM-IWEKNLKM 57

Query: 70  IASFN--NKARNKPYKLGINEFADQTNEEFRAPRNGYK--RRLPSVRSSETTDVSFRYEN 125
           I   N  +      Y+LG+N F D TNEEFR   NG+K  R     + S+  + +F    
Sbjct: 58  IELHNLDHSLGKHSYRLGMNHFGDMTNEEFRQVMNGFKQSRSQRKYKGSQFLEPNF---- 113

Query: 126 ASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDT 185
              P S+DWR+KG VT VKDQGQCG CWAFSA  A+EG +   T KL SLSEQ L+DC  
Sbjct: 114 LQAPKSVDWREKGYVTPVKDQGQCGSCWAFSATGALEGQHFRKTGKLVSLSEQNLIDCSG 173

Query: 186 SGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPS 245
              +QGC GGLMD AF++I  N G+ +E  YPY   D      +   ++A  +G+ D+P 
Sbjct: 174 PEGNQGCNGGLMDQAFQYIKDNNGIDSEESYPYIGKDDEDCLYKPEYNSANDTGFVDIPE 233

Query: 246 NNEAALMKAVAN-QPVSVAIDASGSDFQFYSSGV-FTGQCGT-ELDHGVTAVGY---GTA 299
             E ALMKAVA   P+SVAIDAS + FQFY SGV +  QC + ELDHGV  VGY   GT 
Sbjct: 234 GRERALMKAVAAVGPISVAIDASHTSFQFYESGVYYEPQCNSEELDHGVLVVGYGYEGTD 293

Query: 300 DDGTK-YWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           DD  K YW+VKNSW   WG+ GYI M +D   +   CGIA  ASYP
Sbjct: 294 DDNKKRYWIVKNSWSEKWGDQGYIHMAKD---RSNNCGIASAASYP 336


>gi|389608655|dbj|BAM17937.1| cathepsin L [Papilio xuthus]
          Length = 341

 Score =  256 bits (653), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 144/319 (45%), Positives = 189/319 (59%), Gaps = 18/319 (5%)

Query: 40  EMWMA---QYGRVYRDNAEKEMRFKIFKENVEYIASFNNK-ARNK-PYKLGINEFADQTN 94
           E W A   ++ + Y    E + R KI+ EN   IA  N K AR +  ++L  N++ D  +
Sbjct: 25  EEWNAFKMEHQKQYDSEVEDKFRMKIYAENKHNIAKHNQKYARGEVSFRLKQNKYGDMLH 84

Query: 95  EEFRAPRNGYKRRLPSVR------SSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQ 148
            EF    NG+ +   + +      + E         N  +P  +DWRK GAVT VKDQG+
Sbjct: 85  HEFVHTMNGFNKTTKNSKGLFGKSAGERGATFITPANVHLPDHVDWRKHGAVTEVKDQGK 144

Query: 149 CGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNK 208
           CG CW+FS+  A+EG ++  T  L SLSEQ L+DC  +  + GC GGLMD+AF++I  N+
Sbjct: 145 CGSCWSFSSTGALEGQHYRRTNILVSLSEQNLIDCSAAYGNNGCNGGLMDNAFKYIKDNR 204

Query: 209 GLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDAS 267
           G+ TE  YPY+  D  C     N + A  +G+ D+PS +E  LM AVA   PVSVAIDAS
Sbjct: 205 GIDTEKSYPYEGIDDKCRYNPKN-TGADDNGFVDIPSGDEGKLMAAVATVGPVSVAIDAS 263

Query: 268 GSDFQFYSSGV-FTGQC-GTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQ 325
            S FQFYS GV F   C  + LDHGV  VGYGT ++G  YWLVKNSWG +WG+ GYI+M 
Sbjct: 264 QSSFQFYSDGVYFDENCSSSSLDHGVLVVGYGTDENGGDYWLVKNSWGRSWGDLGYIKMA 323

Query: 326 RDIDAKEGLCGIAMQASYP 344
           R+ D     CGIA  ASYP
Sbjct: 324 RNRDNH---CGIATAASYP 339


>gi|18858809|ref|NP_571273.1| cathepsin L, 1 b precursor [Danio rerio]
 gi|1752664|emb|CAA69623.1| cathepsin L [Danio rerio]
          Length = 336

 Score =  256 bits (653), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 147/344 (42%), Positives = 200/344 (58%), Gaps = 24/344 (6%)

Query: 11  VLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYI 70
           +L  + +  V+A  S      D  +++    W +Q+G+ Y ++ E   R  I++EN+  I
Sbjct: 5   LLVTLYISAVFAAPSI-----DIQLDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRKI 58

Query: 71  --ASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENA-- 126
              +F     N  +K+G+N+F D TNEEFR   NGY         ++T+      E +  
Sbjct: 59  EQHNFEYSYGNHTFKMGMNQFGDMTNEEFRQAMNGYTHD-----PNQTSQGPLFMEPSFF 113

Query: 127 SVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTS 186
           + P  +DWR++G VT VKDQ QCG CW+FS+  A+EG     T KL S+SEQ LVDC   
Sbjct: 114 AAPQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRP 173

Query: 187 GEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSN 246
             +QGC GGLMD AF+++  NKGL +E  YPY A D    + +   + AKI+G+ D+PS 
Sbjct: 174 QGNQGCNGGLMDQAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPSG 233

Query: 247 NEAALMKAVANQ-PVSVAIDASGSDFQFYSSGVFTGQC--GTELDHGVTAVGYGT--AD- 300
           NE ALM AVA   PVSVAIDAS    QFY SG++  +    + LDH V  VGYG   AD 
Sbjct: 234 NELALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACSSSRLDHAVLVVGYGYQGADV 293

Query: 301 DGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
            G +YW+VKNSW   WG+ GYI M +D   K   CG+A +ASYP
Sbjct: 294 AGNRYWIVKNSWSDKWGDKGYIYMAKD---KNNHCGVATKASYP 334


>gi|302831223|ref|XP_002947177.1| hypothetical protein VOLCADRAFT_103269 [Volvox carteri f.
           nagariensis]
 gi|300267584|gb|EFJ51767.1| hypothetical protein VOLCADRAFT_103269 [Volvox carteri f.
           nagariensis]
          Length = 514

 Score =  256 bits (653), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 155/340 (45%), Positives = 193/340 (56%), Gaps = 41/340 (12%)

Query: 41  MWMAQYGRVYRDNA-EKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRA 99
           +W  QYGR Y + + E   R  IF +NV  I   + K  +    L +NE+AD T EEF +
Sbjct: 40  LWSRQYGRTYVEQSPEYTRRLSIFSDNVRAIQESHEK--DPGVTLALNEYADLTWEEFSS 97

Query: 100 PRNGYKRRLPSVRSSETTDV----SFRYENA-SVPASIDWRKKGAVTGVKDQGQCGCCWA 154
            R G +     +            ++RY  A   P +IDWR+KGAV  VK+QGQCG CWA
Sbjct: 98  TRLGLRIDQDQLDRRSRRSASRRNAWRYAAAVDNPKAIDWREKGAVAEVKNQGQCGSCWA 157

Query: 155 FSAVAAMEGINHITTRKLTSLSEQELVDCDTSGE-------------------------D 189
           FS   A+EGIN I T +L SLSEQ+LVDCDT                            +
Sbjct: 158 FSTTGAIEGINAIVTGQLQSLSEQQLVDCDTGKRTVTRSKRSCTVILPSYSSNSCRNESN 217

Query: 190 QGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGS---CNK-KEANPSAAKISGYEDVPS 245
            GC GGLMDDAF+++I N GL TE  Y Y +  G    CNK K+ +  A  I GYEDVP 
Sbjct: 218 MGCSGGLMDDAFKYVIQNGGLDTEQDYAYWSGYGLGFWCNKRKQTDRPAVSIDGYEDVP- 276

Query: 246 NNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKY 305
             E  L+KAVA+QPV+VAI A G+  QFYS GV +  C   L+HGV  VGY  + DG KY
Sbjct: 277 QGEDNLLKAVAHQPVAVAICA-GASMQFYSRGVIS-TCCEGLNHGVLTVGYNVSQDGEKY 334

Query: 306 WLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPT 345
           W+VKNSWG  WGE GY R++  +  + GLCGIA  ASYPT
Sbjct: 335 WIVKNSWGAGWGEQGYFRLKMGV-GETGLCGIASAASYPT 373


>gi|449532567|ref|XP_004173252.1| PREDICTED: oryzain alpha chain-like [Cucumis sativus]
          Length = 321

 Score =  256 bits (653), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 125/197 (63%), Positives = 149/197 (75%), Gaps = 3/197 (1%)

Query: 150 GCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKG 209
           G CWAFS+VAA+EGIN I T +L  LSEQELVDCD S  + GC GGLMD AF+FII N G
Sbjct: 13  GSCWAFSSVAAVEGINQIVTGELIPLSEQELVDCDKS-FNMGCNGGLMDYAFQFIIGNGG 71

Query: 210 LATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGS 269
           + TE  YPYK  D +C+    N     I GYEDVP N+E++L KAVANQPVSVAI+A G 
Sbjct: 72  IDTEEDYPYKGRDAACDPNRKNAKVVTIDGYEDVPENDESSLKKAVANQPVSVAIEAGGR 131

Query: 270 DFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDI- 328
            FQ Y SGVFTG+CGT+LDHGV AVGYGT D+GT YW+V+NSWG  WGE+GYIR++R++ 
Sbjct: 132 AFQLYQSGVFTGRCGTDLDHGVVAVGYGT-DNGTDYWIVRNSWGKDWGESGYIRLERNVA 190

Query: 329 DAKEGLCGIAMQASYPT 345
           +   G CGIA+Q SYPT
Sbjct: 191 NITTGKCGIAVQPSYPT 207


>gi|115715524|ref|XP_780580.2| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
          Length = 334

 Score =  256 bits (653), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 147/337 (43%), Positives = 200/337 (59%), Gaps = 15/337 (4%)

Query: 14  AILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASF 73
           ++L++      S S +  D   +E    W  ++G+ Y  + E+  R  I+++N++ +   
Sbjct: 5   SVLLVAACVVSSLSMSFID--FDEDWNQWKNEHGKRYLSDEEEASRRLIWQKNLDIVIKH 62

Query: 74  NNK--ARNKPYKLGINEFADQTNEEFRAPRNGYK-RRLPSVRSSETTDVSFRYENASVPA 130
           N K    +  Y LG+N+FAD  NEEF +  NG++     + R S     S  ++   +P 
Sbjct: 63  NLKYDLGHFTYDLGMNQFADLKNEEFVSLMNGFRGNSSKATRGSTFLPPSNVFD---MPT 119

Query: 131 SIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQ 190
            +DWR KG VT VK+Q QCG CWAFSA  ++EG +   T KL SLSEQ LVDC     + 
Sbjct: 120 MVDWRTKGYVTPVKNQLQCGSCWAFSATGSLEGQHFKKTGKLVSLSEQNLVDCSGKEGNM 179

Query: 191 GCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAA 250
           GCEGGLMD AF++I+   G+ TE  YPY A DG C+  +AN  A   +GY DV + +E+A
Sbjct: 180 GCEGGLMDQAFQYILDVGGIDTEMSYPYTAMDGQCHFNKANIGATD-TGYTDVTTGSESA 238

Query: 251 LMKAVAN-QPVSVAIDASGSDFQFYSSGVFT--GQCGTELDHGVTAVGYGTADDGTKYWL 307
           L  AVA+  P+SVAIDAS   FQ Y SGV+       T LDHGV AVGYGT+ DGT Y+ 
Sbjct: 239 LQMAVASVGPISVAIDASHQSFQLYKSGVYNEPACSSTLLDHGVLAVGYGTSSDGTDYFF 298

Query: 308 VKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
             +SWG  WG NGY+ M R+   K+  CGIA +ASYP
Sbjct: 299 FFHSWGAAWGMNGYLWMSRN---KDNQCGIATKASYP 332


>gi|254746340|emb|CAX16635.1| putative C1A cysteine protease precursor [Manduca sexta]
          Length = 342

 Score =  256 bits (653), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 147/319 (46%), Positives = 182/319 (57%), Gaps = 18/319 (5%)

Query: 40  EMWMA---QYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNK--PYKLGINEFADQTN 94
           E W+A   Q+ + Y    E   R KI+ EN   IA  N         YKLG N++ D  +
Sbjct: 26  EEWVAFKMQHDKKYDSEVEDRFRMKIYAENKHKIAKHNQLYEQGLVSYKLGPNKYTDMLH 85

Query: 95  EEFRAPRNGYKRRLPSVRS--SETTDVS----FRYENASVPASIDWRKKGAVTGVKDQGQ 148
            EF    NGY R     +    +  DV         +   P  +DW KKGAVT VKDQG+
Sbjct: 86  HEFIQAMNGYNRTAKHNKGLYGKKHDVRGATFIPPAHVKYPDHVDWTKKGAVTEVKDQGK 145

Query: 149 CGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNK 208
           CG CWAFS   A+EG +   +  L SLSEQ L+DC ++  + GC GGLMD+AF++I  N 
Sbjct: 146 CGSCWAFSTTGALEGQHFRKSGYLVSLSEQNLIDCSSTYGNNGCNGGLMDNAFKYIKDNG 205

Query: 209 GLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDAS 267
           G+ TE  YPY+  D  C     N  A  + G+ D+PS +E  LM+AVA   PVSVAIDAS
Sbjct: 206 GIDTEKTYPYEGVDDKCRYNPKNSGAEDV-GFVDIPSGDEEKLMQAVATVGPVSVAIDAS 264

Query: 268 GSDFQFYSSGVF--TGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQ 325
            + FQFYS GV+  T    T+LDHGV  VGYGT + G  YWLVKNSW  TWGE GYI+M 
Sbjct: 265 QNSFQFYSGGVYYDTECSSTDLDHGVLVVGYGTDEAGGDYWLVKNSWSRTWGELGYIKMA 324

Query: 326 RDIDAKEGLCGIAMQASYP 344
           R+ D     CGIA  ASYP
Sbjct: 325 RNRDNH---CGIATDASYP 340


>gi|185135439|ref|NP_001117777.1| procathepsin L precursor [Oncorhynchus mykiss]
 gi|14582899|gb|AAK69706.1|AF358668_1 procathepsin L [Oncorhynchus mykiss]
          Length = 338

 Score =  256 bits (653), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 154/350 (44%), Positives = 201/350 (57%), Gaps = 33/350 (9%)

Query: 11  VLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYI 70
           +  A+LVL V A  +  R   D+ + +   +W   + + Y + +E+  R  ++++N++ I
Sbjct: 4   LYLAVLVLCVSAVCAAPRF--DSQLEDHWHLWKNWHSKHYHE-SEEGWRRMVWEKNLKKI 60

Query: 71  ASFN--NKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFR------ 122
              N  +      Y+LG+N F D TNEEFR   NGYK         +TT+  F+      
Sbjct: 61  EIHNLEHTMGKHSYRLGMNHFGDMTNEEFRQTMNGYK---------QTTERKFKGSLFME 111

Query: 123 --YENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQEL 180
             Y  A  P ++DWR+KG VT VKDQG CG CWAFS   AMEG     T KL SLSEQ L
Sbjct: 112 PNYLQA--PKAVDWREKGYVTPVKDQGSCGSCWAFSTTGAMEGQQFRKTGKLVSLSEQNL 169

Query: 181 VDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGY 240
           VDC     ++GC GGLMD AF++I  N GL TE  YPY  +D      +   SAA  +G+
Sbjct: 170 VDCSRPEGNEGCNGGLMDQAFQYIQDNAGLDTEESYPYVGTDEDPCHYKPEFSAANETGF 229

Query: 241 EDVPSNNEAALMKAVAN-QPVSVAIDASGSDFQFYSSGV-FTGQCGT-ELDHGVTAVGYG 297
            D+PS  E A+MKAVA   PVSVAIDA    FQFY SG+ +  +C + ELDHGV  VGYG
Sbjct: 230 VDIPSGKEHAMMKAVAAVGPVSVAIDAGHESFQFYESGIYYEKECSSEELDHGVLVVGYG 289

Query: 298 TAD---DGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
                 DG KYW+VKNSW   WG+ GYI M +D   ++  CGIA  +SYP
Sbjct: 290 FEGEDVDGKKYWIVKNSWSEKWGDKGYIYMAKD---RKNHCGIATASSYP 336


>gi|225709022|gb|ACO10357.1| Cathepsin L precursor [Caligus rogercresseyi]
          Length = 332

 Score =  255 bits (652), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 143/313 (45%), Positives = 185/313 (59%), Gaps = 14/313 (4%)

Query: 40  EMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARN--KPYKLGINEFADQTNEEF 97
           E W   +G+ Y  + E+++R KI  EN   I+  N +A N    Y + +N + D  + EF
Sbjct: 28  ESWKLTHGKSYESSIEEKLRLKIHMENSLKISRHNAEAINGKHSYYMKMNHYGDLLHHEF 87

Query: 98  RAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSA 157
            A  NGY+     V  +         +N  +P  +DWR+ GAVT VK+QGQCG CWAFS+
Sbjct: 88  VAMVNGYEY----VNKTSLGGSFIPSKNVKLPTHVDWREDGAVTPVKNQGQCGSCWAFSS 143

Query: 158 VAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYP 217
             ++EG     T KL  LSEQ LVDC     + GCEGGLMD AF +I  NKG+ TE  YP
Sbjct: 144 TGSLEGQTFRKTGKLIPLSEQNLVDCSRKYGNNGCEGGLMDFAFTYIRDNKGIDTEGSYP 203

Query: 218 YKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDASGSDFQFYSS 276
           Y+   G C+   +   ++ I G+ DV   +E  L+KAVA+  PVSVAIDAS   FQFYS 
Sbjct: 204 YEGVGGRCHYDPSKKGSSDI-GFVDVKKGSEEELLKAVASVGPVSVAIDASHMSFQFYSH 262

Query: 277 GV-FTGQCGTE-LDHGVTAVGYGTADD-GTKYWLVKNSWGTTWGENGYIRMQRDIDAKEG 333
           GV F  +C  E LDHGV  VGYGT ++ G  YWLVKNSW   WG+ GYI+M R+   K+ 
Sbjct: 263 GVYFESKCSPENLDHGVLVVGYGTDENSGEDYWLVKNSWSENWGDQGYIKMARN---KKN 319

Query: 334 LCGIAMQASYPTA 346
           +CGIA  ASYP  
Sbjct: 320 MCGIASSASYPVV 332


>gi|157311713|ref|NP_001098585.1| uncharacterized protein LOC564979 precursor [Danio rerio]
 gi|156230121|gb|AAI52284.1| Wu:fa26c03 protein [Danio rerio]
          Length = 336

 Score =  255 bits (652), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 150/345 (43%), Positives = 202/345 (58%), Gaps = 26/345 (7%)

Query: 11  VLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYI 70
           +L  + +  V+A  S      D  +++    W +Q+G+ Y ++ E   R  I++EN+  I
Sbjct: 5   LLVTLCISAVFAASSI-----DIQLDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRKI 58

Query: 71  ASFNNKAR--NKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSE---TTDVSFRYEN 125
              N +    N  +K+G+N+F D TNEEFR   NGYK   P+ R+S+     + SF    
Sbjct: 59  EQHNFEYSYGNHTFKMGMNQFGDMTNEEFRQAMNGYKHD-PN-RTSQGPLFMEPSF---- 112

Query: 126 ASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDT 185
            + P  +DWR++G VT VKDQ QCG CW+FS+  A+EG     T KL S+SEQ LVDC  
Sbjct: 113 FAAPQQVDWRQRGFVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSR 172

Query: 186 SGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPS 245
              +QGC GGLMD AF+++  NKGL +E  YPY A D    + +   + AKI+G+ D+P 
Sbjct: 173 PQGNQGCNGGLMDQAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPR 232

Query: 246 NNEAALMKAVANQ-PVSVAIDASGSDFQFYSSGVFTGQC--GTELDHGVTAVGYGT--AD 300
            NE ALM AVA   PVSVAIDAS    QFY SG++  +    + LDH V  VGYG   AD
Sbjct: 233 GNELALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACSSSRLDHAVLVVGYGYQGAD 292

Query: 301 -DGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
             G +YW+VKNSW   WG+ GYI M +D   K   CG+A  ASYP
Sbjct: 293 VAGNRYWIVKNSWSDKWGDKGYIYMAKD---KNNHCGVATSASYP 334


>gi|157829826|pdb|1AEC|A Chain A, Crystal Structure Of Actinidin-E-64 Complex+
          Length = 218

 Score =  255 bits (652), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 123/217 (56%), Positives = 149/217 (68%), Gaps = 2/217 (0%)

Query: 128 VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSG 187
           +P+ +DWR  GAV  +K QG+CG CWAFSA+A +EGIN I T  L SLSEQEL+DC  + 
Sbjct: 1   LPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIDCGRTQ 60

Query: 188 EDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNN 247
             +GC GG + D F+FII+N G+ TE  YPY A DG CN    N     I  YE+VP NN
Sbjct: 61  NTRGCNGGYITDGFQFIINNGGINTEENYPYTAQDGECNVDLQNEKYVTIDTYENVPYNN 120

Query: 248 EAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGTELDHGVTAVGYGTADDGTKYWL 307
           E AL  AV  QPVSVA+DA+G  F+ YSSG+FTG CGT +DH VT VGYGT + G  YW+
Sbjct: 121 EWALQTAVTYQPVSVALDAAGDAFKQYSSGIFTGPCGTAIDHAVTIVGYGT-EGGIDYWI 179

Query: 308 VKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           VKNSW TTWGE GY+R+ R++    G CGIA   SYP
Sbjct: 180 VKNSWDTTWGEEGYMRILRNVGGA-GTCGIATMPSYP 215


>gi|5853329|gb|AAD54424.1|AF182079_1 thiol protease [Matricaria chamomilla]
          Length = 501

 Score =  255 bits (652), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 139/328 (42%), Positives = 200/328 (60%), Gaps = 20/328 (6%)

Query: 30  LNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNK-PYKLGINE 88
           L+ A +++    W   +G+ Y+   E+ +R + FK++V+++   N++ +++  + +G+N+
Sbjct: 41  LSSAKVSDLFGKWKELHGKTYQHEEEENLRLENFKKSVKFVMEKNSERKSELDHTVGLNK 100

Query: 89  FADQTNEEFRAPRNGYKRRLPSVRSSETT------DVSFRYENASVPASIDWRKKGAVTG 142
           FAD +NEEF+     Y  ++   RS+E        ++S        P S+DWR KG VT 
Sbjct: 101 FADLSNEEFK---EMYMSKVKGSRSNELKMGGVKRNMSVSSRTCDAPTSLDWRDKGVVTP 157

Query: 143 VKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFE 202
           +KDQGQCG CWAFS   ++E  N I T  L  LSEQELVDCDT   D GC+GG MD A+ 
Sbjct: 158 MKDQGQCGSCWAFSVSGSIESANAIATGDLIRLSEQELVDCDT--YDYGCDGGNMDTAYR 215

Query: 203 FIISNKGLATEAKYPYKAS---DGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQP 259
           +II N GL +E  YPY +S   DG C+K ++  S   +  Y +V SN +A L  AVA  P
Sbjct: 216 WIIKNGGLDSEDDYPYTSSNGRDGKCDKTKSAKSVVSLDSYVEVESNEDAVLC-AVATTP 274

Query: 260 VSVAIDASGSDFQFYSSGVFTGQCGT---ELDHGVTAVGYGTADDGTKYWLVKNSWGTTW 316
           V++ I  S  DFQ Y+ GV+ GQC +   ++DH V  VGYG+  DG  YW+VKNSWGT W
Sbjct: 275 VTIGIVGSAYDFQLYTGGVYNGQCSSKPYDIDHAVLIVGYGS-QDGKDYWIVKNSWGTYW 333

Query: 317 GENGYIRMQRDIDAKEGLCGIAMQASYP 344
           G  GYI M+R+ D K G+CG+ ++  YP
Sbjct: 334 GLEGYILMERNTDIKNGVCGMYLEPVYP 361


>gi|81294188|gb|AAI08032.1| Cathepsin L, 1 b [Danio rerio]
          Length = 336

 Score =  255 bits (651), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 147/344 (42%), Positives = 200/344 (58%), Gaps = 24/344 (6%)

Query: 11  VLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYI 70
           +L  + +  V+A  S      D  +++    W +Q+G+ Y ++ E   R  I++EN+  I
Sbjct: 5   LLVTLYISAVFAAPSI-----DIQLDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRKI 58

Query: 71  --ASFNNKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENA-- 126
              +F     N  +K+G+N+F D TNEEFR   NGY         ++T+      E +  
Sbjct: 59  EQHNFEYSYGNHTFKMGMNQFGDMTNEEFRQAMNGYTHD-----PNQTSQGPLFMEPSFF 113

Query: 127 SVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTS 186
           + P  +DWR++G VT VKDQ QCG CW+FS+  A+EG     T KL S+SEQ LVDC   
Sbjct: 114 AAPQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRP 173

Query: 187 GEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSN 246
             +QGC GGLMD AF+++  NKGL +E  YPY A D    + +   + AKI+G+ D+PS 
Sbjct: 174 QGNQGCNGGLMDLAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPSG 233

Query: 247 NEAALMKAVANQ-PVSVAIDASGSDFQFYSSGVFTGQC--GTELDHGVTAVGYGT--AD- 300
           NE ALM AVA   PVSVAIDAS    QFY SG++  +    + LDH V  VGYG   AD 
Sbjct: 234 NELALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACSSSRLDHAVLVVGYGYQGADV 293

Query: 301 DGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
            G +YW+VKNSW   WG+ GYI M +D   K   CG+A +ASYP
Sbjct: 294 AGNRYWIVKNSWSDKWGDKGYIYMAKD---KNNHCGVATKASYP 334


>gi|356557743|ref|XP_003547170.1| PREDICTED: LOW QUALITY PROTEIN: xylem cysteine proteinase 1-like
           [Glycine max]
          Length = 400

 Score =  255 bits (651), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 139/316 (43%), Positives = 196/316 (62%), Gaps = 16/316 (5%)

Query: 37  ERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPY--KLGINEFADQTN 94
           E  + W  +  ++YR+  E+++RF+ FK N++YI   N+K R  PY   LG+N+FAD +N
Sbjct: 48  ELFQRWKEENKKIYRNPEEEKLRFENFKRNLKYIVEKNSK-RISPYGQSLGLNQFADMSN 106

Query: 95  EEFRAPRNGYKRRLPSVRSSETT-DVSFRYENASVPASIDWRKKGAVT-GVKDQGQCGCC 152
           EEF++      ++  S R+  ++ D S   E    P S+DWRKKG VT  VKDQG CG  
Sbjct: 107 EEFKSKFMSKVKKPFSKRNGVSSKDHSCEDE----PYSLDWRKKGVVTLAVKDQGYCGSY 162

Query: 153 WAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLAT 212
           WAFS+  A+EGIN I T  L SLSEQELVDCD++ +  GC+GG MD AFE+++ N G+ T
Sbjct: 163 WAFSSTDAIEGINAIVTADLISLSEQELVDCDSTND--GCDGGXMDYAFEWVMYNGGIDT 220

Query: 213 EAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQ 272
           E  YPY  +DG+CN  +       I GY DV   ++++L+ A   QP+S  ID +  DFQ
Sbjct: 221 ETNYPYIGADGTCNVTKEKTKVIGIDGYYDV-GQSDSSLLCATVKQPISAGIDGTSWDFQ 279

Query: 273 FYSSGVFTGQCGT---ELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDID 329
            Y  G++ G C +   ++DH +  VGYG+  D   YW+VKNSW T+WG  G I ++++ +
Sbjct: 280 LYIGGIYDGDCSSDPDDIDHAILVVGYGSEGD-DDYWIVKNSWRTSWGMEGCIYLRKNTN 338

Query: 330 AKEGLCGIAMQASYPT 345
            K G C I   ASYPT
Sbjct: 339 LKYGXCAINYMASYPT 354


>gi|6753558|ref|NP_034114.1| cathepsin L1 preproprotein [Mus musculus]
 gi|115742|sp|P06797.2|CATL1_MOUSE RecName: Full=Cathepsin L1; AltName: Full=Major excreted protein;
           Short=MEP; AltName: Full=p39 cysteine proteinase;
           Contains: RecName: Full=Cathepsin L1 heavy chain;
           Contains: RecName: Full=Cathepsin L1 light chain; Flags:
           Precursor
 gi|53047|emb|CAA29470.1| unnamed protein product [Mus musculus]
 gi|309186|gb|AAA37445.1| preprocysteine proteinase [Mus musculus]
 gi|12832050|dbj|BAB21945.1| unnamed protein product [Mus musculus]
 gi|26340196|dbj|BAC33761.1| unnamed protein product [Mus musculus]
 gi|45768760|gb|AAH68163.1| Cathepsin L [Mus musculus]
 gi|74139700|dbj|BAE31701.1| unnamed protein product [Mus musculus]
 gi|74146632|dbj|BAE41323.1| unnamed protein product [Mus musculus]
 gi|74151584|dbj|BAE41141.1| unnamed protein product [Mus musculus]
 gi|74185397|dbj|BAE30172.1| unnamed protein product [Mus musculus]
 gi|74197196|dbj|BAE35143.1| unnamed protein product [Mus musculus]
 gi|74203006|dbj|BAE26206.1| unnamed protein product [Mus musculus]
 gi|74219606|dbj|BAE29572.1| unnamed protein product [Mus musculus]
 gi|148684295|gb|EDL16242.1| cathepsin L [Mus musculus]
          Length = 334

 Score =  254 bits (650), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 147/323 (45%), Positives = 192/323 (59%), Gaps = 19/323 (5%)

Query: 32  DATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARN--KPYKLGINEF 89
           D T +     W + + R+Y  N E+E R  I+++N+  I   N +  N    + + +N F
Sbjct: 22  DQTFSAEWHQWKSTHRRLYGTN-EEEWRRAIWEKNMRMIQLHNGEYSNGQHGFSMEMNAF 80

Query: 90  ADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQC 149
            D TNEEFR   NGY+ +           +  +     +P S+DWR+KG VT VK+QGQC
Sbjct: 81  GDMTNEEFRQVVNGYRHQKHKKGRLFQEPLMLK-----IPKSVDWREKGCVTPVKNQGQC 135

Query: 150 GCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKG 209
           G CWAFSA   +EG   + T KL SLSEQ LVDC  +  +QGC GGLMD AF++I  N G
Sbjct: 136 GSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGCNGGLMDFAFQYIKENGG 195

Query: 210 LATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDASG 268
           L +E  YPY+A DGSC K  A  + A  +G+ D+P   E ALMKAVA   P+SVA+DAS 
Sbjct: 196 LDSEESYPYEAKDGSC-KYRAEFAVANDTGFVDIPQ-QEKALMKAVATVGPISVAMDASH 253

Query: 269 SDFQFYSSGV-FTGQCGTE-LDHGVTAVGY---GTADDGTKYWLVKNSWGTTWGENGYIR 323
              QFYSSG+ +   C ++ LDHGV  VGY   GT  +  KYWLVKNSWG+ WG  GYI+
Sbjct: 254 PSLQFYSSGIYYEPNCSSKNLDHGVLLVGYGYEGTDSNKNKYWLVKNSWGSEWGMEGYIK 313

Query: 324 MQRDIDAKEGLCGIAMQASYPTA 346
           + +D D     CG+A  ASYP  
Sbjct: 314 IAKDRDNH---CGLATAASYPVV 333


>gi|17062058|gb|AAL34984.1|AF320565_1 cathepsine L-like cysteine protease [Rhodnius prolixus]
          Length = 316

 Score =  254 bits (650), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 140/308 (45%), Positives = 190/308 (61%), Gaps = 14/308 (4%)

Query: 44  AQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKAR--NKPYKLGINEFADQTNEEFRAPR 101
           A +G+ YR+  E+  R K+F +N + I   N K       YK+ +N   D    EF+A  
Sbjct: 18  AMHGKNYRNQFEEIFRMKVFIDNKKKIDEHNAKYELGEASYKMKMNHLGDLMVHEFKALM 77

Query: 102 NGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAM 161
           NG+K+   + R+ +    S    N ++P S+DWR++GAVT VKDQG CG CW+FSA  ++
Sbjct: 78  NGFKKTPNAERNGKIYVPS----NENLPKSVDWRQRGAVTPVKDQGHCGSCWSFSATGSL 133

Query: 162 EGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKAS 221
           EG   + T +L SLSEQ LVDC  +  + GCEGGLM+ AF+++  NKG+ TEA YPY+A 
Sbjct: 134 EGQLFLKTGRLVSLSEQNLVDCSKTYGNSGCEGGLMNQAFQYVRDNKGIDTEASYPYEAR 193

Query: 222 DGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDASGSDFQFYSSGVFT 280
           + +C  KE +       GY D+   +E  L  AVA   P+SV IDAS   FQFYS GV+ 
Sbjct: 194 ENNCRFKE-DKVGGTDKGYVDILEASEKDLQSAVATVGPISVRIDASHESFQFYSEGVYK 252

Query: 281 GQ-CG-TELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIA 338
            Q C  ++LDHGV  VGYGT ++G  YWLVKNSWG +WGE+GYI++ R+    +  CGIA
Sbjct: 253 EQYCSPSQLDHGVLTVGYGT-ENGQDYWLVKNSWGPSWGESGYIKIARN---HKNHCGIA 308

Query: 339 MQASYPTA 346
             ASYP  
Sbjct: 309 SMASYPVV 316


>gi|12847813|dbj|BAB27719.1| unnamed protein product [Mus musculus]
          Length = 334

 Score =  254 bits (650), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 147/323 (45%), Positives = 192/323 (59%), Gaps = 19/323 (5%)

Query: 32  DATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARN--KPYKLGINEF 89
           D T +     W + + R+Y  N E+E R  I+++N+  I   N +  N    + + +N F
Sbjct: 22  DQTFSAEWHQWKSTHRRLYGTN-EEEWRRAIWEKNMRMIQLHNGEYSNGQHGFSMEMNAF 80

Query: 90  ADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQC 149
            D TNEEFR   NGY+ +           +  +     +P S+DWR+KG VT VK+QGQC
Sbjct: 81  GDMTNEEFRQVVNGYRHQKHKKGRLFQEPLMLK-----IPKSVDWREKGCVTPVKNQGQC 135

Query: 150 GCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKG 209
           G CWAFSA   +EG   + T KL SLSEQ LVDC  +  +QGC GGLMD AF++I  N G
Sbjct: 136 GSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGCNGGLMDYAFQYIKENGG 195

Query: 210 LATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDASG 268
           L +E  YPY+A DGSC K  A  + A  +G+ D+P   E ALMKAVA   P+SVA+DAS 
Sbjct: 196 LDSEESYPYEAKDGSC-KYRAEFAVANDTGFVDIPQ-QEKALMKAVATVGPISVAMDASH 253

Query: 269 SDFQFYSSGV-FTGQCGTE-LDHGVTAVGY---GTADDGTKYWLVKNSWGTTWGENGYIR 323
              QFYSSG+ +   C ++ LDHGV  VGY   GT  +  KYWLVKNSWG+ WG  GYI+
Sbjct: 254 PSLQFYSSGIYYEPNCSSKNLDHGVLLVGYGYEGTDSNKNKYWLVKNSWGSEWGMEGYIK 313

Query: 324 MQRDIDAKEGLCGIAMQASYPTA 346
           + +D D     CG+A  ASYP  
Sbjct: 314 IAKDRDNH---CGLATAASYPVV 333


>gi|18407678|ref|NP_566867.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|30315950|sp|Q9LXW3.1|CPR2_ARATH RecName: Full=Probable cysteine proteinase At3g43960; Flags:
           Precursor
 gi|7594557|emb|CAB88124.1| cysteine proteinase-like protein [Arabidopsis thaliana]
 gi|26452289|dbj|BAC43231.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|332644328|gb|AEE77849.1| putative cysteine proteinase [Arabidopsis thaliana]
          Length = 376

 Score =  254 bits (650), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 141/321 (43%), Positives = 193/321 (60%), Gaps = 15/321 (4%)

Query: 31  NDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFA 90
           N+  +   +E W+ + G+ Y    EKE RFKIFK+N++ I   N+   N+ Y+ G+N+F+
Sbjct: 33  NEGEVLTMYEQWLVENGKNYNGLGEKERRFKIFKDNLKRIEEHNSDP-NRSYERGLNKFS 91

Query: 91  DQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRY---ENASVPASIDWRKKGAVTG-VKDQ 146
           D T +EF+A   G K    S+     +DV+ RY   E   +P  +DWR++GAV   VK Q
Sbjct: 92  DLTADEFQASYLGGKMEKKSL-----SDVAERYQYKEGDVLPDEVDWRERGAVVPRVKRQ 146

Query: 147 GQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIIS 206
           G+CG CWAF+A  A+EGIN ITT +L SLSEQEL+DCD   ++ GC GG    AFEFI  
Sbjct: 147 GECGSCWAFAATGAVEGINQITTGELVSLSEQELIDCDRGNDNFGCAGGGAVWAFEFIKE 206

Query: 207 NKGLATEAKYPYKASD-GSCNKKEANPS-AAKISGYEDVPSNNEAALMKAVANQPVSVAI 264
           N G+ ++  Y Y   D  +C   E   +    I+G+E VP N+E +L KAVA QP+SV I
Sbjct: 207 NGGIVSDEVYGYTGEDTAACKAIEMKTTRVVTINGHEVVPVNDEMSLKKAVAYQPISVMI 266

Query: 265 DASGSDFQFYSSGVFTGQCGTEL-DHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIR 323
            A  ++   Y SGV+ G C     DH V  VGYGT+ D   YWL++NSWG  WGE GY+R
Sbjct: 267 SA--ANMSDYKSGVYKGACSNLWGDHNVLIVGYGTSSDEGDYWLIRNSWGPEWGEGGYLR 324

Query: 324 MQRDIDAKEGLCGIAMQASYP 344
           +QR+     G C +A+   YP
Sbjct: 325 LQRNFHEPTGKCAVAVAPVYP 345


>gi|74151179|dbj|BAE27712.1| unnamed protein product [Mus musculus]
          Length = 334

 Score =  254 bits (650), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 147/323 (45%), Positives = 192/323 (59%), Gaps = 19/323 (5%)

Query: 32  DATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARN--KPYKLGINEF 89
           D T +     W + + R+Y  N E+E R  I+++N+  I   N +  N    + + +N F
Sbjct: 22  DQTFSAEWHQWKSTHRRLYGTN-EEEWRRAIWEKNMRMIQLHNGEYSNGQHGFSMEMNAF 80

Query: 90  ADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQC 149
            D TNEEFR   NGY+ +           +  +     +P S+DWR+KG VT VK+QGQC
Sbjct: 81  GDMTNEEFRQVVNGYRHQKHKKGRLFQEPLMLK-----IPKSVDWREKGCVTPVKNQGQC 135

Query: 150 GCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKG 209
           G CWAFSA   +EG   + T KL SLSEQ LVDC  +  +QGC GGLMD AF++I  N G
Sbjct: 136 GSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGCNGGLMDFAFQYIKENGG 195

Query: 210 LATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDASG 268
           L +E  YPY+A DGSC K  A  + A  +G+ D+P   E ALMKAVA   P+SVA+DAS 
Sbjct: 196 LDSEESYPYEAKDGSC-KYRAEFAVANDTGFVDIPQQEE-ALMKAVATVGPISVAMDASH 253

Query: 269 SDFQFYSSGV-FTGQCGTE-LDHGVTAVGY---GTADDGTKYWLVKNSWGTTWGENGYIR 323
              QFYSSG+ +   C ++ LDHGV  VGY   GT  +  KYWLVKNSWG+ WG  GYI+
Sbjct: 254 PSLQFYSSGIYYEPNCSSKNLDHGVLLVGYGYEGTDSNKNKYWLVKNSWGSEWGMEGYIK 313

Query: 324 MQRDIDAKEGLCGIAMQASYPTA 346
           + +D D     CG+A  ASYP  
Sbjct: 314 IAKDRDNH---CGLATAASYPVV 333


>gi|305434756|gb|ADM53740.1| cathepsin L1 precursor [Lepeophtheirus salmonis]
          Length = 325

 Score =  254 bits (650), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 145/314 (46%), Positives = 191/314 (60%), Gaps = 21/314 (6%)

Query: 41  MWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARN--KPYKLGINEFADQTNEEFR 98
           +W   +G+ Y     +E+R KIF+EN   I   N +A+N    Y L +N++ D    EF 
Sbjct: 23  LWTKLHGKTYTSFEIEELRVKIFEENRIKIQKHNAEAQNGLHTYSLEMNQYGDLLQSEFL 82

Query: 99  APRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAV 158
               G  +   S  ++   D S     A VP+ ++W K GAVT VKDQ  CG CWAFS  
Sbjct: 83  QGYTGLAKGSYSGDNTVILDNS-----APVPSYVNWTKNGAVTAVKDQKDCGSCWAFSTT 137

Query: 159 AAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPY 218
            ++EG   I  +KL S SEQ+LVDC +   ++GC GG MD+AF+++I+NKG+ATE  YPY
Sbjct: 138 GSVEGQYFIKNKKLLSFSEQQLVDCSSDFRNEGCNGGWMDNAFKYLIANKGIATEDTYPY 197

Query: 219 KASDGSC--NKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDASGSDFQFYS 275
            A+DG C  NK  A   A +IS ++DV   +E  L  AVA   P+SVAIDAS  DFQFY 
Sbjct: 198 TATDGVCVYNKTMA---AGRISSFKDVKHGSEDQLKLAVAQIGPISVAIDASSGDFQFYK 254

Query: 276 SGVFTG-QCGTE-LDHGVTAVGYGTADDGT--KYWLVKNSWGTTWGENGYIRMQRDIDAK 331
            GV+   +C ++ LDHGV AVGYGT D GT   YWLVKNSW  +WG+ GYI+M R+    
Sbjct: 255 KGVYVDEECSSKYLDHGVLAVGYGT-DKGTGLDYWLVKNSWSASWGDQGYIKMARN---H 310

Query: 332 EGLCGIAMQASYPT 345
           + +CGIA  ASYP 
Sbjct: 311 KNMCGIASLASYPV 324


>gi|74149661|dbj|BAE36450.1| unnamed protein product [Mus musculus]
          Length = 334

 Score =  254 bits (650), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 147/323 (45%), Positives = 192/323 (59%), Gaps = 19/323 (5%)

Query: 32  DATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARN--KPYKLGINEF 89
           D T +     W + + R+Y  N E+E R  I+++N+  I   N +  N    + + +N F
Sbjct: 22  DQTFSAEWHQWKSTHRRLYGTN-EEEWRRAIWEKNMRMIQLHNGEYSNGQHGFSMEMNAF 80

Query: 90  ADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQC 149
            D TNEEFR   NGY+ +           +  +     +P S+DWR+KG VT VK+QGQC
Sbjct: 81  GDMTNEEFRQVVNGYRHQKHKKGRLFQEPLMLK-----IPKSVDWREKGCVTPVKNQGQC 135

Query: 150 GCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKG 209
           G CWAFSA   +EG   + T KL SLSEQ LVDC  +  +QGC GGLMD AF++I  N G
Sbjct: 136 GSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGCNGGLMDFAFQYIKENGG 195

Query: 210 LATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDASG 268
           L +E  YPY+A DGSC K  A  + A  +G+ D+P   E ALMKAVA   P+SVA+DAS 
Sbjct: 196 LDSEESYPYEAKDGSC-KYRAEFAVANGTGFVDIPQ-QEKALMKAVATVGPISVAMDASH 253

Query: 269 SDFQFYSSGV-FTGQCGTE-LDHGVTAVGY---GTADDGTKYWLVKNSWGTTWGENGYIR 323
              QFYSSG+ +   C ++ LDHGV  VGY   GT  +  KYWLVKNSWG+ WG  GYI+
Sbjct: 254 PSLQFYSSGIYYEPNCSSKNLDHGVLLVGYGYEGTDSNKNKYWLVKNSWGSEWGMEGYIK 313

Query: 324 MQRDIDAKEGLCGIAMQASYPTA 346
           + +D D     CG+A  ASYP  
Sbjct: 314 IAKDRDNH---CGLATAASYPVV 333


>gi|4886998|gb|AAD32136.1|AF121837_1 cathepsin L [Mus musculus]
 gi|4887000|gb|AAD32137.1|AF121838_1 cathepsin L [Mus musculus]
 gi|4887002|gb|AAD32138.1|AF121839_1 cathepsin L [Mus musculus]
 gi|200501|gb|AAA39984.1| preprocathepsin L precursor [Mus musculus]
          Length = 334

 Score =  254 bits (650), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 147/323 (45%), Positives = 192/323 (59%), Gaps = 19/323 (5%)

Query: 32  DATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARN--KPYKLGINEF 89
           D T +     W + + R+Y  N E+E R  I+++N+  I   N +  N    + + +N F
Sbjct: 22  DQTFSAEWHQWKSTHRRLYGTN-EEEWRRAIWEKNMRIIQLHNGEYSNGQHGFSMEMNAF 80

Query: 90  ADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQC 149
            D TNEEFR   NGY+ +           +  +     +P S+DWR+KG VT VK+QGQC
Sbjct: 81  GDMTNEEFRQVVNGYRHQKHKKGRLFQEPLMLK-----IPKSVDWREKGCVTPVKNQGQC 135

Query: 150 GCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKG 209
           G CWAFSA   +EG   + T KL SLSEQ LVDC  +  +QGC GGLMD AF++I  N G
Sbjct: 136 GSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGCNGGLMDFAFQYIKENGG 195

Query: 210 LATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDASG 268
           L +E  YPY+A DGSC K  A  + A  +G+ D+P   E ALMKAVA   P+SVA+DAS 
Sbjct: 196 LDSEESYPYEAKDGSC-KYRAEFAVANDTGFVDIPQ-QEKALMKAVATVGPISVAMDASH 253

Query: 269 SDFQFYSSGV-FTGQCGTE-LDHGVTAVGY---GTADDGTKYWLVKNSWGTTWGENGYIR 323
              QFYSSG+ +   C ++ LDHGV  VGY   GT  +  KYWLVKNSWG+ WG  GYI+
Sbjct: 254 PSLQFYSSGIYYEPNCSSKNLDHGVLLVGYGYEGTDSNKNKYWLVKNSWGSEWGMEGYIK 313

Query: 324 MQRDIDAKEGLCGIAMQASYPTA 346
           + +D D     CG+A  ASYP  
Sbjct: 314 IAKDRDNH---CGLATAASYPVV 333


>gi|161408097|dbj|BAF94152.1| cathepsin L-like cysteine protease 2 [Plautia stali]
          Length = 334

 Score =  254 bits (650), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 147/344 (42%), Positives = 201/344 (58%), Gaps = 17/344 (4%)

Query: 6   LENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKE 65
           ++  LVL+ ++ LG       + +  D + +E   ++   + + Y +  E+  R KIF E
Sbjct: 1   MQGLLVLSCLIALG------QAVSFFDLSADE-FTLFKKFHRKEYDNELEESYRKKIFLE 53

Query: 66  NVEYIASFNNKARNK--PYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRY 123
           N + I   N++ +     +KL +N  AD    E+     G+ +   +  +   +      
Sbjct: 54  NKKRIEKHNSRYKQGKVSFKLKLNHLADMLIHEYSDVYLGFNKSSKANNNKLQSYTFIPP 113

Query: 124 ENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDC 183
            + ++   +DWR KGAVT VK+QG CG CWAFS   A+EG N   T KL SLSEQ LVDC
Sbjct: 114 AHVTLNKEVDWRTKGAVTPVKNQGHCGSCWAFSTTGALEGQNFRKTGKLVSLSEQNLVDC 173

Query: 184 DTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDV 243
             S  + GCEGGLMD+AF++I  N G+ TE  YPY+  D +C  ++ +  A   SG+ D+
Sbjct: 174 SGSYGNNGCEGGLMDNAFQYIKENHGIDTEKSYPYEGEDETCRFRKTSIGATD-SGFVDI 232

Query: 244 PSNNEAALMKAVAN-QPVSVAIDASGSDFQFYSSGV-FTGQCGTE-LDHGVTAVGYGTAD 300
              +E ALM+AVA   P+SVAIDAS   FQFYS GV +  +C +E LDHGV  VGYG  +
Sbjct: 233 TQGDEEALMQAVATIGPISVAIDASHQSFQFYSEGVYYEPECSSENLDHGVLVVGYG-VE 291

Query: 301 DGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           D  KYWLVKNSWGT WG+ GYI+M RD D     CGIA QASYP
Sbjct: 292 DNQKYWLVKNSWGTQWGDGGYIKMARDQDNN---CGIATQASYP 332


>gi|226443040|ref|NP_001140018.1| Cathepsin L1 precursor [Salmo salar]
 gi|221221188|gb|ACM09255.1| Cathepsin L1 precursor [Salmo salar]
          Length = 338

 Score =  254 bits (650), Expect = 4e-65,   Method: Compositional matrix adjust.
 Identities = 153/350 (43%), Positives = 200/350 (57%), Gaps = 33/350 (9%)

Query: 11  VLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYI 70
           +  A+LVL V A  +  R   D+ + +   +W   + + Y + +E+  R  ++++N++ I
Sbjct: 4   LYLAVLVLCVSAVCAAPRF--DSQLEDHWHLWKNWHSKSYHE-SEEGWRRMVWEKNLKKI 60

Query: 71  ASFN--NKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFR------ 122
              N  +      Y+LG+N F D TNEEFR   NGYK         +TT+  F+      
Sbjct: 61  EMHNLEHTMGKHSYRLGMNHFGDMTNEEFRQTMNGYK---------QTTERKFKGSLFME 111

Query: 123 --YENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQEL 180
             Y  A  P ++DWR+KG VT VKDQG CG CWAFS   AMEG     T KL SLSEQ L
Sbjct: 112 PNYLQA--PKAVDWREKGYVTPVKDQGSCGSCWAFSTTGAMEGQQFRKTGKLVSLSEQNL 169

Query: 181 VDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGY 240
           VDC     ++GC GGLMD AF++I  N GL TE  YPY  +D      +   S A  +G+
Sbjct: 170 VDCSRPEGNEGCNGGLMDQAFQYIQDNAGLDTEESYPYVGTDEDPCHYKPEFSGANETGF 229

Query: 241 EDVPSNNEAALMKAVAN-QPVSVAIDASGSDFQFYSSGV-FTGQCGT-ELDHGVTAVGYG 297
            D+PS  E A+MKAVA   PVSVAIDA    FQFY SG+ +  +C + ELDHGV  VGYG
Sbjct: 230 VDIPSGKEHAMMKAVAAVGPVSVAIDAGHESFQFYESGIYYEKECSSEELDHGVLVVGYG 289

Query: 298 TAD---DGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
                 DG KYW+VKNSW   WG+ GYI M +D   ++  CGIA  +SYP
Sbjct: 290 FEGEDVDGKKYWIVKNSWSEKWGDKGYIYMAKD---RKNHCGIATASSYP 336


>gi|340370276|ref|XP_003383672.1| PREDICTED: digestive cysteine proteinase 2-like [Amphimedon
           queenslandica]
          Length = 327

 Score =  254 bits (650), Expect = 4e-65,   Method: Compositional matrix adjust.
 Identities = 136/308 (44%), Positives = 191/308 (62%), Gaps = 12/308 (3%)

Query: 42  WMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPR 101
           W  +Y +VY     +  R  I++ N +++ + N  +    + + +NEFAD    EF    
Sbjct: 27  WKVKYNKVYETKETELERQIIWESNKKFVENHNANSDKFGFTVAMNEFADLDAGEFGRIF 86

Query: 102 NGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAM 161
           NG    LP   S  +T++ ++     VP ++DW++KGAVT +K+QGQCG CW+FS+  ++
Sbjct: 87  NGL---LPRPSSYNSTNI-YKPSGVKVPDTVDWKEKGAVTPIKNQGQCGSCWSFSSTGSL 142

Query: 162 EGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKAS 221
           EG + I T  L SLSEQ+L+DC T   + GC GGLMD++F ++ S  G  TE  YPY A 
Sbjct: 143 EGQHFINTGTLVSLSEQQLMDCSTKYGNHGCNGGLMDNSFRYLKSVAGDETEDNYPYTAE 202

Query: 222 DGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDASGSDFQFYSSGVFT 280
           +G C + +++ +      Y D+P  +E +L  AVAN  P+SVAIDAS S FQ Y+SGV+ 
Sbjct: 203 NGVC-RYDSSLAVVTDKSYVDIPQGDEDSLKDAVANVGPISVAIDASHSSFQLYNSGVYY 261

Query: 281 GQC--GTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIA 338
                 T+LDHGV A+GYGT +DG  YWLVKNSWGT+WG  GYI+M R+   +   CGIA
Sbjct: 262 ASTCSSTQLDHGVLAIGYGT-EDGKDYWLVKNSWGTSWGMEGYIKMSRN---RNNNCGIA 317

Query: 339 MQASYPTA 346
            QASYPT 
Sbjct: 318 TQASYPTG 325


>gi|229367042|gb|ACQ58501.1| Cathepsin L precursor [Anoplopoma fimbria]
          Length = 334

 Score =  254 bits (650), Expect = 4e-65,   Method: Compositional matrix adjust.
 Identities = 153/323 (47%), Positives = 184/323 (56%), Gaps = 39/323 (12%)

Query: 42  WMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARN--KPYKLGINEFADQTNEEFR- 98
           W  Q+GR Y   AE+  R +I+  N   +   N  A    K Y+LG+  FAD  NEE++ 
Sbjct: 29  WKLQFGRSYNSPAEEAQRKEIWLSNRRLVLVHNIMADQGIKSYRLGMTYFADMENEEYKR 88

Query: 99  -------------APRNGYKR-RLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVK 144
                         PR G    RLP              E A +P S+DWR+KG VT VK
Sbjct: 89  QISQGCLGSFNASLPRRGSAYLRLP--------------EGADLPNSVDWREKGYVTEVK 134

Query: 145 DQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFI 204
           DQ QCG CWAFS   ++EG     T KL SLSEQ+LVDC     ++GC GGLMD AF +I
Sbjct: 135 DQKQCGSCWAFSTTGSLEGQTFRKTGKLVSLSEQQLVDCSGDYGNEGCMGGLMDSAFRYI 194

Query: 205 ISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVA 263
            +N G+ TE  YPY+A DG C    AN   A  +GY DV   +E AL +AVA   PVSVA
Sbjct: 195 QANGGIDTEDSYPYEAEDGQCRYNSANI-GATCTGYVDVKQGDEDALKEAVATIGPVSVA 253

Query: 264 IDASGSDFQFYSSGVF-TGQC-GTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGY 321
           IDAS S FQ Y SGV+   +C  +ELDHGV AVGYG+ D+G  YWLVKNSWG  WG  GY
Sbjct: 254 IDASHSSFQLYESGVYDEPECSSSELDHGVLAVGYGS-DNGHDYWLVKNSWGLGWGNKGY 312

Query: 322 IRMQRDIDAKEGLCGIAMQASYP 344
           I M R+   K   CGIA  +SYP
Sbjct: 313 IMMTRN---KHNQCGIATASSYP 332


>gi|21593501|gb|AAM65468.1| cysteine proteinase [Arabidopsis thaliana]
          Length = 376

 Score =  254 bits (649), Expect = 4e-65,   Method: Compositional matrix adjust.
 Identities = 141/321 (43%), Positives = 193/321 (60%), Gaps = 15/321 (4%)

Query: 31  NDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFA 90
           N+  +   +E W+ + G+ Y    EKE RFKIFK+N++ I   N+   N+ Y+ G+N+F+
Sbjct: 33  NEGGVLTMYEQWLVENGKNYNGLGEKERRFKIFKDNLKRIEEHNSDP-NRSYERGLNKFS 91

Query: 91  DQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRY---ENASVPASIDWRKKGAVTG-VKDQ 146
           D T +EF+A   G K    S+     +DV+ RY   E   +P  +DWR++GAV   VK Q
Sbjct: 92  DLTADEFQASYLGGKMEKKSL-----SDVAERYQYKEGDVLPDEVDWRERGAVVPRVKRQ 146

Query: 147 GQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIIS 206
           G+CG CWAF+A  A+EGIN ITT +L SLSEQEL+DCD   ++ GC GG    AFEFI  
Sbjct: 147 GECGSCWAFAATGAVEGINQITTGELVSLSEQELIDCDRGNDNFGCAGGGAVWAFEFIKE 206

Query: 207 NKGLATEAKYPYKASD-GSCNKKEANPS-AAKISGYEDVPSNNEAALMKAVANQPVSVAI 264
           N G+ ++  Y Y   D  +C   E   +    I+G+E VP N+E +L KAVA QP+SV I
Sbjct: 207 NGGIVSDEVYGYTGEDTAACKAIEMKTTRVVTINGHEVVPVNDEMSLKKAVAYQPISVMI 266

Query: 265 DASGSDFQFYSSGVFTGQCGTEL-DHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIR 323
            A  ++   Y SGV+ G C     DH V  VGYGT+ D   YWL++NSWG  WGE GY+R
Sbjct: 267 SA--ANMSDYKSGVYKGACSNLWGDHNVLIVGYGTSSDEGDYWLIRNSWGPEWGEGGYLR 324

Query: 324 MQRDIDAKEGLCGIAMQASYP 344
           +QR+     G C +A+   YP
Sbjct: 325 LQRNFHEPTGKCAVAVAPVYP 345


>gi|118424553|gb|ABK90824.1| cathepsin L-like cysteine proteinase [Spodoptera exigua]
          Length = 344

 Score =  254 bits (649), Expect = 4e-65,   Method: Compositional matrix adjust.
 Identities = 144/316 (45%), Positives = 185/316 (58%), Gaps = 22/316 (6%)

Query: 45  QYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNK--PYKLGINEFADQTNEEFRAPRN 102
           ++ + Y    E + R KI+ EN   I   N +   +   YKL  N++AD  + EF    N
Sbjct: 33  EHSKQYDSEVEDKFRMKIYVENKHRITKHNQRFEQRLVSYKLKPNKYADMLHHEFVHTMN 92

Query: 103 GYKR------RLPSVRSSETTDVSFRY---ENASVPASIDWRKKGAVTGVKDQGQCGCCW 153
           G+ +      R  +V        +  +    + S P  +DWRKKGAVT VKDQG+CG CW
Sbjct: 93  GFNKTAKHGGRNKNVHGKGHDGRAATFIAPAHVSYPDHVDWRKKGAVTDVKDQGKCGSCW 152

Query: 154 AFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATE 213
           AFS   A+EG +   T  L SLSEQ L+DC  +  + GC GGLMD+AF++I  N G+ TE
Sbjct: 153 AFSTTGALEGQHFRKTGYLVSLSEQNLIDCSAAYGNNGCNGGLMDNAFKYIKDNGGIDTE 212

Query: 214 AKYPYKASDGSC--NKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDASGSD 270
             YPY+A D  C  N KE   S A   G+ D+P  +E  LM+AVA   P+SVAIDAS   
Sbjct: 213 KSYPYEAVDDKCRYNPKE---SGADDVGFVDIPQGDEEKLMQAVATVGPISVAIDASQET 269

Query: 271 FQFYSSGVFTGQ--CGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDI 328
           FQFYS GV+  +    T+LDHGV  VGYGT +DG+  WLVKNSWG +WGE GYI+M R+ 
Sbjct: 270 FQFYSKGVYYDENCSSTDLDHGVMVVGYGTEEDGSDDWLVKNSWGRSWGELGYIKMARN- 328

Query: 329 DAKEGLCGIAMQASYP 344
             K   CGIA  ASYP
Sbjct: 329 --KNNHCGIASSASYP 342


>gi|391328505|ref|XP_003738729.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
          Length = 323

 Score =  254 bits (649), Expect = 4e-65,   Method: Compositional matrix adjust.
 Identities = 149/349 (42%), Positives = 202/349 (57%), Gaps = 33/349 (9%)

Query: 5   LLENKLVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFK 64
           +L    +L A+L L   A  +W             +++   +G+ Y  + E+  R ++F 
Sbjct: 1   MLRTTAILVALLGL---ASANW-------------DLYKKVHGKSYGHD-EEHFRRQLFY 43

Query: 65  ENVEYIASFN--NKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFR 122
           ++V  I + N  +      Y++G+N+F D T+EEFR     +K        ++     F+
Sbjct: 44  KSVAKINAHNLRHDLGLTTYRMGLNKFTDMTSEEFR----NFKGLKFDATKTKRNGTRFQ 99

Query: 123 YE--NASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQEL 180
            E    ++P  +DWR+KG VT VK+QGQCG CWAFS   ++EG +   T KL SLSEQ L
Sbjct: 100 KELLGEALPTQVDWREKGYVTPVKNQGQCGSCWAFSTTGSLEGQHFKATGKLVSLSEQNL 159

Query: 181 VDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGY 240
           VDC     + GC GGLMD+ F +I  N G+ TE  YPY   DG C   E N   A++ G+
Sbjct: 160 VDCSRVEGNNGCNGGLMDNGFTYIQQNGGIDTEESYPYTGKDGDCAFNE-NSVGARVKGF 218

Query: 241 EDVPSNNEAALMKAVAN-QPVSVAIDASGSDFQFYSSGVF-TGQCG-TELDHGVTAVGYG 297
            DVP  +EAAL  AVA+  PVSVAIDAS   FQ+Y  GV+    C  ++LDHGV  VGYG
Sbjct: 219 VDVPQRDEAALQAAVASVGPVSVAIDASNDSFQYYKEGVYDEPSCSFSQLDHGVLVVGYG 278

Query: 298 TADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
           T ++G  YWLVKNSWG TWG++GYI+M R+   KE  CGIA  ASYPT 
Sbjct: 279 T-ENGVDYWLVKNSWGPTWGQDGYIKMMRN---KENQCGIASMASYPTV 323


>gi|147836416|emb|CAN75313.1| hypothetical protein VITISV_033592 [Vitis vinifera]
          Length = 201

 Score =  254 bits (649), Expect = 4e-65,   Method: Compositional matrix adjust.
 Identities = 123/186 (66%), Positives = 146/186 (78%), Gaps = 6/186 (3%)

Query: 79  NKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS-VPASIDWRKK 137
           +K YKL INEFAD TNEEFRA RN +K  + S  ++     SF+YE+ + VP+++DWRKK
Sbjct: 2   DKSYKLSINEFADLTNEEFRASRNRFKAHICSTEAT-----SFKYEHVTAVPSTVDWRKK 56

Query: 138 GAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLM 197
           GAVT +KDQGQCG CWAFSAVAAMEGI  ++T KL SLSEQELVDCDTSGEDQGC GGLM
Sbjct: 57  GAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQGCNGGLM 116

Query: 198 DDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN 257
           DDAF+FI  N GL TEA YPY  +DG+CN K+A   AAKI+GYEDVP+NNE AL KAVA+
Sbjct: 117 DDAFKFIEQNHGLTTEANYPYAGTDGTCNNKKAAHPAAKINGYEDVPANNEKALQKAVAH 176

Query: 258 QPVSVA 263
             +S +
Sbjct: 177 LAISTS 182


>gi|37994576|gb|AAH60335.1| Unknown (protein for MGC:68554) [Xenopus laevis]
          Length = 335

 Score =  254 bits (649), Expect = 4e-65,   Method: Compositional matrix adjust.
 Identities = 148/343 (43%), Positives = 198/343 (57%), Gaps = 22/343 (6%)

Query: 10  LVLAAILVLGVWAPQSWSRTLNDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEY 69
           L++ AI +  ++A  +      D  ++     W   + + Y    E   R  ++++N++ 
Sbjct: 5   LIIGAICLTTLYAAPA-----TDPALDNHWYSWKDWHKKTYAPKEEGWRRV-LWEKNLKM 58

Query: 70  IASFN--NKARNKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENAS 127
           I   N  +      Y+LG+N+F D TNEEF+   NGYK +   +R S          N  
Sbjct: 59  IEFHNLDHSLGKHSYRLGMNQFGDMTNEEFKQLMNGYKNQ-KMIRGS----TFLAPNNFE 113

Query: 128 VPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSG 187
            P S+DWRKKG VT VKDQGQCG CWAFS   A+EG ++  T KL SLSEQ LVDC  + 
Sbjct: 114 APKSVDWRKKGYVTPVKDQGQCGSCWAFSTTGALEGQHYRKTSKLISLSEQNLVDCSRAQ 173

Query: 188 EDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNN 247
            ++GC GGLMD AF+++  N G+ +E  YPY A D      + N ++A  +G+ DV S  
Sbjct: 174 GNEGCNGGLMDQAFQYVKDNGGIDSEDSYPYTAKDDQECHYDPNNNSANDTGFVDVQSGC 233

Query: 248 EAALMKAVAN-QPVSVAIDASGSDFQFYSSGV-FTGQCGTE-LDHGVTAVGYGTAD---D 301
           E  LMKAVA+  PVSVAIDA    FQFY SG+ +  +C +E LDHGV  VGYG      D
Sbjct: 234 EKDLMKAVASVGPVSVAIDAGHQSFQFYQSGIYYEPECSSEDLDHGVLVVGYGFESEDVD 293

Query: 302 GTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           G KYW+VKNSW   WG+NGYI + +D   +   CGIA  ASYP
Sbjct: 294 GKKYWIVKNSWSEKWGDNGYINIAKD---RHNHCGIATAASYP 333


>gi|229893789|gb|ACQ90252.1| cathepsin L [Pinctada fucata]
          Length = 362

 Score =  254 bits (649), Expect = 5e-65,   Method: Compositional matrix adjust.
 Identities = 144/314 (45%), Positives = 191/314 (60%), Gaps = 14/314 (4%)

Query: 39  HEMWM---AQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKAR--NKPYKLGINEFADQT 93
           HE W      +G+VY    E+  RF IF++ +E I   N K     K Y +G+N+F+D +
Sbjct: 51  HETWKEFKTLFGKVYDTVEEEIKRFDIFRDTLERIEEHNRKYHMGQKSYYMGVNQFSDMS 110

Query: 94  NEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCW 153
           ++E+    NG +R        E  D S+      +   +DWR KG VT VK+QGQCG CW
Sbjct: 111 HDEY-LRHNGLRRGNRKYSKGEGCD-SYTKSGKQLDDKVDWRDKGYVTPVKNQGQCGSCW 168

Query: 154 AFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATE 213
           +FS   ++EG +   T KL SLSEQ+LVDC  +  ++GC GGLMD+AFE+I S  GL  E
Sbjct: 169 SFSTTGSLEGQHFRQTGKLISLSEQQLVDCSGTFGNEGCNGGLMDNAFEYIKSIGGLEGE 228

Query: 214 AKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDASGSDFQ 272
             YPY A  G C+ K++    A  +G  DV S +E AL  A+A+  P+SVAIDAS + FQ
Sbjct: 229 DDYPYTAKQGKCHLKKS-LFKANDTGCTDVESGDEDALKDALASVGPISVAIDASHASFQ 287

Query: 273 FYSSGVF-TGQCGTE-LDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDA 330
            Y  GV+   +C ++ LDHGV  VGYGT ++G  YWLVKNSWG  WGE GYI+M R+ D 
Sbjct: 288 SYDGGVYDEEECSSQNLDHGVLTVGYGTEENGGDYWLVKNSWGEMWGEEGYIKMSRNKDN 347

Query: 331 KEGLCGIAMQASYP 344
           +   CGIA QASYP
Sbjct: 348 Q---CGIATQASYP 358


>gi|395740610|ref|XP_002819972.2| PREDICTED: cathepsin L1 [Pongo abelii]
          Length = 333

 Score =  254 bits (648), Expect = 5e-65,   Method: Compositional matrix adjust.
 Identities = 154/348 (44%), Positives = 199/348 (57%), Gaps = 25/348 (7%)

Query: 8   NKLVLAAILVLGVWAPQSWSRTLN-DATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKEN 66
           N  +  A   LG+      S TL  D ++  R   W A + R+Y  N E+  R  ++++N
Sbjct: 2   NPTLFLAAFCLGIA-----SATLTFDHSLEARWTKWKAMHNRLYGMN-EEGWRRAVWEKN 55

Query: 67  VEYIASFNNKAR--NKPYKLGINEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYE 124
           ++ I   N + R     + + +N F D T+EEFR   NG++ R P  R  +       YE
Sbjct: 56  MKMIELHNQEYREGKHSFTMAMNAFGDMTSEEFRQVMNGFQNRKP--RKGKVFQEPLFYE 113

Query: 125 NASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCD 184
               P S+DWR+KG VT VK+QGQCG CWAFSA  A+EG     T KL SLSEQ LVDC 
Sbjct: 114 ---APRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLISLSEQNLVDCS 170

Query: 185 TSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDVP 244
               ++GC GGLMD AF+++  N GL +E  YPY+A++ SC K     S A  +G+ D+P
Sbjct: 171 GPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESC-KYNPKYSVANDTGFVDIP 229

Query: 245 SNNEAALMKAVAN-QPVSVAIDASGSDFQFYSSGV-FTGQCGTE-LDHGVTAVGYG---T 298
              E ALMKAVA   P+SVAIDA    F FY  G+ F   C +E +DHGV  VGYG   T
Sbjct: 230 K-QEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFEST 288

Query: 299 ADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYPTA 346
             D  KYWLVKNSWG  WG  GY++M +D   +   CGIA  ASYPT 
Sbjct: 289 ESDNNKYWLVKNSWGEEWGMGGYVKMAKD---RRNHCGIASAASYPTV 333


>gi|47230018|emb|CAG10432.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 294

 Score =  254 bits (648), Expect = 5e-65,   Method: Compositional matrix adjust.
 Identities = 151/305 (49%), Positives = 186/305 (60%), Gaps = 27/305 (8%)

Query: 54  AEKEMRFKIFKENVEYIASFNNKARN--KPYKLGINEFADQTNEEFRAPRNGYKRRLPSV 111
           +E+  R +I+  N + +   N  A    K Y+LG+ +FAD  NEE++        RL S+
Sbjct: 1   SEEAARRQIWLSNRKLVLVHNILADQGIKSYRLGMTQFADMDNEEYK--------RLISL 52

Query: 112 RSSETTDVS--------FRY-ENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAME 162
                 + S        FR  E   +P ++DWR KG VTGVKDQ QCG CWAFSA  ++E
Sbjct: 53  GCLGAFNASAPRKGSAFFRLAEGTPLPTTVDWRDKGYVTGVKDQKQCGSCWAFSATGSLE 112

Query: 163 GINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASD 222
           G N+  T KL SLSEQ+LVDC     + GC GGLMD AF++I  N G+ TE  YPY+A D
Sbjct: 113 GQNYRKTGKLVSLSEQQLVDCSGDYGNMGCGGGLMDSAFKYIQENGGIDTEESYPYEAED 172

Query: 223 GSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDASGSDFQFYSSGVFTG 281
           G C  K  N   AK +GY DV + +E AL +AVA   PVSVAIDAS S FQ Y SGV+  
Sbjct: 173 GKCRFKPQNI-GAKCTGYVDVTAGDEDALKEAVATIGPVSVAIDASHSSFQLYESGVYDE 231

Query: 282 -QCGTE-LDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAM 339
            +C +E LDHGV AVGYGT D+G  YWLVKNSWG  WG+ GYI M R+   K   CGIA 
Sbjct: 232 LECSSEDLDHGVLAVGYGT-DNGQDYWLVKNSWGLGWGQKGYIMMSRN---KHNQCGIAS 287

Query: 340 QASYP 344
            ASYP
Sbjct: 288 MASYP 292


>gi|94480716|emb|CAI91577.1| cathepsin L [Aphrocallistes vastus]
          Length = 329

 Score =  254 bits (648), Expect = 5e-65,   Method: Compositional matrix adjust.
 Identities = 143/308 (46%), Positives = 186/308 (60%), Gaps = 14/308 (4%)

Query: 40  EMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRA 99
           E W  +Y R Y    ++E+R KI+  N+ Y+  FN  A    YKL  N+FAD TN E+R 
Sbjct: 31  EGWKLKYNRSY--GLDEELRKKIWANNMLYVKEFN--AEGHSYKLAANQFADLTNLEYRQ 86

Query: 100 PRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVA 159
              GY       R  E      + ++  +P ++DWR KG VT VK+QGQCG CW+FSA  
Sbjct: 87  IYLGYDNEARLSRKREGKVFQRKMKDEDLPTTVDWRSKGVVTPVKNQGQCGSCWSFSATG 146

Query: 160 AMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYK 219
           ++EG   I + KL S SEQELVDC TS  + GC+GGLMD AF++  +N     E+ Y Y 
Sbjct: 147 SLEGQYAIKSGKLVSFSEQELVDCSTSLGNHGCQGGLMDYAFKYWETNLA-EKESDYTYT 205

Query: 220 ASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQ-PVSVAIDASGSDFQFYSSGV 278
           A +G C K  A     K S + D+PS N  AL +AVAN+ P++VA+DAS + FQ Y SG+
Sbjct: 206 AKNGKC-KYNAQLGVTKDSSFTDIPSENCDALKEAVANKGPIAVAMDASHTSFQMYHSGI 264

Query: 279 FTG-QCG-TELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCG 336
           +T   C  T+LDHGV  VGYGT D+G  YWL+KNSWG  WG +GY +    I+ K   CG
Sbjct: 265 YTPFLCSKTKLDHGVLVVGYGT-DNGVDYWLIKNSWGMAWGMDGYFK----IEMKSDKCG 319

Query: 337 IAMQASYP 344
           I  QASYP
Sbjct: 320 ICTQASYP 327


>gi|18308182|gb|AAL67857.1|AF462309_1 cysteine proteinase [Acanthamoeba healyi]
          Length = 330

 Score =  254 bits (648), Expect = 5e-65,   Method: Compositional matrix adjust.
 Identities = 140/323 (43%), Positives = 192/323 (59%), Gaps = 18/323 (5%)

Query: 33  ATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFN------NKARNKPYKLGI 86
           +T+   H+     + +  R+N +   RF     N E+I  +N      +  +NK Y L +
Sbjct: 17  STLAATHDPLTGVFAKWMRENTKSNYRF--VYSNEEFIYRWNVWRDEEHNRQNKSYFLAM 74

Query: 87  NEFADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQ 146
           N+F D TN EF     G        + ++    +       +P+  DWR+KGAVT VK+Q
Sbjct: 75  NQFGDLTNAEFNRLFKGLA--FDYSKHAKIHTAAPEAPATGIPSEFDWRQKGAVTHVKNQ 132

Query: 147 GQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIIS 206
           GQCG CW+FS   + EG N + T +L SLSEQ L+DC  S  + GC GGLMD AFE+II+
Sbjct: 133 GQCGSCWSFSTTGSTEGANFLKTGRLVSLSEQNLIDCSVSYGNNGCNGGLMDYAFEYIIN 192

Query: 207 NKGLATEAKYPYK-ASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAID 265
           N+G+ TEA YPY+ A   +C    AN   + ++GY DV S +E AL+ A   +PVSVAID
Sbjct: 193 NRGIDTEASYPYQTAGPLTCQYNAANKGGS-LTGYTDVTSGDENALLNAAVKEPVSVAID 251

Query: 266 ASGSDFQFYSSGVF--TGQCGTELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIR 323
           AS + FQFYS GV+  +    T+LDHGV  VG+G+ ++G  +W VKNSWG +WG NGYI+
Sbjct: 252 ASHNSFQFYSGGVYYESACSSTQLDHGVLVVGWGS-ENGQDFWWVKNSWGASWGLNGYIK 310

Query: 324 MQRDIDAKEGLCGIAMQASYPTA 346
           M R+   +   CGIA  ASYPTA
Sbjct: 311 MSRN---QNNNCGIATAASYPTA 330


>gi|330805277|ref|XP_003290611.1| hypothetical protein DICPUDRAFT_81345 [Dictyostelium purpureum]
 gi|325079250|gb|EGC32859.1| hypothetical protein DICPUDRAFT_81345 [Dictyostelium purpureum]
          Length = 330

 Score =  254 bits (648), Expect = 5e-65,   Method: Compositional matrix adjust.
 Identities = 136/307 (44%), Positives = 192/307 (62%), Gaps = 15/307 (4%)

Query: 42  WMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPR 101
           WM ++ R Y  + E   +++ FK+N+++I ++N   +N    LG+ +FAD TNEE+R   
Sbjct: 36  WMKKHDRSYHHH-EFNNKYQAFKDNMDFIHNWNTN-KNSKTVLGLTQFADLTNEEYRKIY 93

Query: 102 NGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAM 161
            G K  +   + +      F   + + P SIDWR KGAV+ VKDQGQCG CW+FS   ++
Sbjct: 94  LGTKVNVAPEKHN------FNMIHFTGPDSIDWRTKGAVSHVKDQGQCGSCWSFSTTGSV 147

Query: 162 EGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKAS 221
           EG + I T  + +LSEQ LVDC     + GC+GGLM +AF+FI+S  G+ATE  YPY A 
Sbjct: 148 EGAHQIKTGNMVTLSEQNLVDCSGKFGNNGCDGGLMVNAFKFIMSQGGVATEDSYPYNAV 207

Query: 222 DGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVF-T 280
            G C K   +   A ISGY+++   +E  L  A+  QPVS+AIDAS   FQ Y SGV+  
Sbjct: 208 QGKC-KFTKSMVGANISGYKEITQGSELELQAALTKQPVSIAIDASQQSFQLYKSGVYDE 266

Query: 281 GQCGT-ELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAM 339
            +C + +LDHGV AVGYGT ++G  Y++VKNSW  +WG++GYI M R+   +   CG+A 
Sbjct: 267 PECSSYQLDHGVLAVGYGT-ENGKDYYIVKNSWADSWGQDGYIFMSRNAKNQ---CGVAT 322

Query: 340 QASYPTA 346
            ASYP +
Sbjct: 323 MASYPIS 329


>gi|330805275|ref|XP_003290610.1| hypothetical protein DICPUDRAFT_98747 [Dictyostelium purpureum]
 gi|325079249|gb|EGC32858.1| hypothetical protein DICPUDRAFT_98747 [Dictyostelium purpureum]
          Length = 334

 Score =  254 bits (648), Expect = 6e-65,   Method: Compositional matrix adjust.
 Identities = 135/307 (43%), Positives = 189/307 (61%), Gaps = 11/307 (3%)

Query: 42  WMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNKPYKLGINEFADQTNEEFRAPR 101
           WM ++ + Y  + E   +++ FK+N+++I ++N+K  +    LG+N FAD TNEE++   
Sbjct: 37  WMKKHNKAYHHH-EFNDKYQTFKDNMDFIHNWNSKESDTV--LGLNRFADLTNEEYKKTY 93

Query: 102 NGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAM 161
            G    + ++R+++       +E  + P+SIDWR+ GAV  VKDQG CG CWAF+   A+
Sbjct: 94  LGMSINV-NLRANQVPMNGLNFERFTGPSSIDWRQNGAVAYVKDQGHCGSCWAFATTGAV 152

Query: 162 EGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKAS 221
           EG + I T  + + SEQ LVDC     + GC+GGLM  AF++II N G+ATE  YPY A+
Sbjct: 153 EGAHQIKTGNMVTFSEQHLVDCSGRYGNNGCDGGLMTSAFKYIIDNDGIATEEAYPYTAT 212

Query: 222 DGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFT- 280
              C         A ISGY+DVP  +E+AL  A++ QPV+VAIDAS   FQ Y SGV+  
Sbjct: 213 QNRCVYNTTMLGTA-ISGYKDVPRGSESALTAAISKQPVAVAIDASPITFQLYKSGVYQE 271

Query: 281 GQCGT-ELDHGVTAVGYGTADDGTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAM 339
             C +  L+HGV AVGYGT  +G  Y++VKNSW  TWG  GYI M R+ +     CGIA 
Sbjct: 272 ATCSSYRLNHGVLAVGYGTL-EGKDYYIVKNSWAETWGNQGYILMARNANNH---CGIAT 327

Query: 340 QASYPTA 346
            ASY + 
Sbjct: 328 MASYASV 334


>gi|74213650|dbj|BAE35627.1| unnamed protein product [Mus musculus]
          Length = 334

 Score =  254 bits (648), Expect = 6e-65,   Method: Compositional matrix adjust.
 Identities = 147/323 (45%), Positives = 191/323 (59%), Gaps = 19/323 (5%)

Query: 32  DATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARN--KPYKLGINEF 89
           D T +     W + + R+Y  N E+E R  I+++N+  I   N +  N    + + +N F
Sbjct: 22  DQTFSAEWHQWKSTHRRLYGTN-EEEWRRAIWEKNMRMIQLHNGEYSNGQHGFSMEMNAF 80

Query: 90  ADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYENASVPASIDWRKKGAVTGVKDQGQC 149
            D TNEEFR   NGY+ +           +  +     +P S+DWR+KG VT VK+QGQC
Sbjct: 81  GDMTNEEFRQVVNGYRHQKHKKGRLFQEPLMLK-----IPKSVDWREKGCVTPVKNQGQC 135

Query: 150 GCCWAFSAVAAMEGINHITTRKLTSLSEQELVDCDTSGEDQGCEGGLMDDAFEFIISNKG 209
           G CWAFSA   +EG   + T KL SLSEQ LVDC  +  +QGC GGLMD AF++I  N G
Sbjct: 136 GSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGCNGGLMDFAFQYIKENGG 195

Query: 210 LATEAKYPYKASDGSCNKKEANPSAAKISGYEDVPSNNEAALMKAVAN-QPVSVAIDASG 268
           L +E  YPY+A DGSC K  A  + A  +G+ D+P   E ALMKAVA   P+SVA+DAS 
Sbjct: 196 LDSEESYPYEAKDGSC-KYRAEFAVANDTGFVDIPQ-QEKALMKAVATVGPISVAMDASH 253

Query: 269 SDFQFYSSGV-FTGQCGTE-LDHGVTAVGY---GTADDGTKYWLVKNSWGTTWGENGYIR 323
              QFYSSG+ +   C ++ LDHGV  VGY   GT  +  KYWLVKNSWG+ WG  GYI 
Sbjct: 254 PSLQFYSSGIYYEPNCSSKNLDHGVLLVGYGYEGTDSNKNKYWLVKNSWGSEWGMEGYIE 313

Query: 324 MQRDIDAKEGLCGIAMQASYPTA 346
           + +D D     CG+A  ASYP  
Sbjct: 314 IAKDRDNH---CGLATAASYPVV 333


>gi|357153071|ref|XP_003576329.1| PREDICTED: fruit bromelain-like [Brachypodium distachyon]
          Length = 398

 Score =  254 bits (648), Expect = 6e-65,   Method: Compositional matrix adjust.
 Identities = 150/343 (43%), Positives = 191/343 (55%), Gaps = 36/343 (10%)

Query: 31  NDATMNERHEMWMAQYGRVYRDNAEKEMRFKIFKENVEYIASFNNKARNK--PYKLGINE 88
           ND  M  R + WMA  GR Y    E   RF+++K NV YI + N +A      ++LG   
Sbjct: 54  NDLLMMGRFQGWMAAQGRSYWTAEETARRFEVYKSNVRYIEAVNAEAATTGLTFELGEGP 113

Query: 89  FADQTNEEFRAPRNGYKRRLPSVRSSETTDVSFRYE------------NASV-------- 128
           F D T+EEF A  NG    +P     E  D+    E            N +V        
Sbjct: 114 FTDLTHEEFSALYNG---SMPPPEEEEGDDIQEEDEQVIATVVDGVDVNVAVHTNLSAGG 170

Query: 129 -----PASIDWRKKGAVTGVKDQGQCGCCWAFSAVAAMEGINHITTRKLTSLSEQELVDC 183
                P S DWRK GAVT +KDQG+CG CWAF  VA +EG + I    L SLSEQ+L+DC
Sbjct: 171 PRPWPPRSRDWRKHGAVTPIKDQGRCGSCWAFPTVATIEGKHKIVRGNLVSLSEQQLIDC 230

Query: 184 DTSGEDQGCEGGLMDDAFEFIISNKGLATEAKYPYKASDGSCNKKEANPSAAKISGYEDV 243
           D +  + GC+GG +  A+ +I    GL T + YPYK + G C K+    +AA+I+G+  V
Sbjct: 231 DYT--NSGCKGGFVIRAYRWIRKIGGLTTSSAYPYKGARGKCMKRRR--AAARIAGWRSV 286

Query: 244 PSNNEAALMKAVANQPVSVAIDASGSDFQFYSSGVFTGQCGT-ELDHGVTAVGYG-TADD 301
            S +E AL+ AVA QPV+V I ASG +FQ Y  G+  G C T  L+H VT VGYG  AD 
Sbjct: 287 RSRSEVALVNAVAGQPVAVYISASGKNFQHYKKGILNGPCDTARLNHAVTVVGYGRQADT 346

Query: 302 GTKYWLVKNSWGTTWGENGYIRMQRDIDAKEGLCGIAMQASYP 344
           G KYW+VKNSWGTTWG+ GYI M+R      G CGIA    +P
Sbjct: 347 GAKYWIVKNSWGTTWGQEGYILMKRGTRNPRGQCGIATSPVFP 389


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.315    0.130    0.392 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,534,060,382
Number of Sequences: 23463169
Number of extensions: 232501302
Number of successful extensions: 575018
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 6476
Number of HSP's successfully gapped in prelim test: 1234
Number of HSP's that attempted gapping in prelim test: 545528
Number of HSP's gapped (non-prelim): 9020
length of query: 346
length of database: 8,064,228,071
effective HSP length: 143
effective length of query: 203
effective length of database: 9,003,962,200
effective search space: 1827804326600
effective search space used: 1827804326600
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 77 (34.3 bits)