BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 019063
         (346 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|255564910|ref|XP_002523448.1| cysteine protease, putative [Ricinus communis]
 gi|223537276|gb|EEF38907.1| cysteine protease, putative [Ricinus communis]
          Length = 341

 Score =  401 bits (1031), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 197/339 (58%), Positives = 243/339 (71%), Gaps = 13/339 (3%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
           MFV +++V    SQ  S RS+H+ ++ E+HE WM ++GR YKD  EK  R  IF+ N+E+
Sbjct: 10  MFVALLVVGLWVSQAWS-RSLHDAAMNERHEMWMVKYGRVYKDNSEKERRFEIFRNNVEF 68

Query: 73  IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVP 132
           IE  NK GNR YKL  NEF+DLTNEEF+A   GY R   S +   S  S+F+Y NVT VP
Sbjct: 69  IESFNKPGNRPYKLDINEFADLTNEEFKASRNGYKR---SSNVGLSEKSSFRYGNVTAVP 125

Query: 133 TSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST--DN 190
           TS+DWR+KGAVT IKDQGQCG CWAFSAVAA+EGIT+++ GKLI LSEQ+LVDC T  ++
Sbjct: 126 TSMDWRQKGAVTPIKDQGQCGCCWAFSAVAAMEGITKLSTGKLISLSEQELVDCDTSGED 185

Query: 191 HGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQ 250
            GC GGLMD AFE+I +N GL TEA+YPY+  +GTC+  K    AA I+ YED+P   E 
Sbjct: 186 QGCEGGLMDDAFEFIKQNGGLTTEANYPYQGTDGTCNTNKAGNDAAKITGYEDVPANSED 245

Query: 251 ALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWL 310
           ALL+AV++QPVSV +DASG AF FY  GV   DCG   DHGV  VG+GT++   G KYWL
Sbjct: 246 ALLKAVASQPVSVAIDASGSAFQFYSGGVFTGDCGTELDHGVTAVGYGTSD---GTKYWL 302

Query: 311 IKNSWGETWGESGYIRILRD----AGLCGIATAASYPVA 345
           +KNSWG +WGE GYIR+ RD     GLCGIA  +SYP A
Sbjct: 303 VKNSWGTSWGEDGYIRMERDIEAKEGLCGIAMQSSYPTA 341


>gi|255564908|ref|XP_002523447.1| cysteine protease, putative [Ricinus communis]
 gi|223537275|gb|EEF38906.1| cysteine protease, putative [Ricinus communis]
          Length = 342

 Score =  399 bits (1026), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 196/339 (57%), Positives = 242/339 (71%), Gaps = 12/339 (3%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
           MFV +++V   ASQ  S RS+H+ ++ E+HE WMA++GR YKD  EK  R  IF+ N+E+
Sbjct: 10  MFVALLVVGLWASQAWS-RSLHDAAMNERHEMWMAKYGRVYKDNSEKERRFEIFRNNVEF 68

Query: 73  IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVP 132
           IE  NK GNR YKL  NEF+DLTNEEF+    GY R   S     +  S+F+Y NVT VP
Sbjct: 69  IESFNKLGNRPYKLDINEFADLTNEEFKVSKNGYKR---SSGVGLTEKSSFRYANVTAVP 125

Query: 133 TSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST--DN 190
           TS+DWR+ GAVT IKDQGQCG CWAFSAVAA+EGIT+++ GKLI LSEQ+LVDC T  ++
Sbjct: 126 TSMDWRQNGAVTPIKDQGQCGCCWAFSAVAAMEGITKLSTGKLISLSEQELVDCDTSGED 185

Query: 191 HGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQ 250
            GC GGLMD AFE+I +N GL TEA+YPY+  +GTC+  K    AA I+ YED+P   E 
Sbjct: 186 QGCEGGLMDDAFEFIKQNGGLTTEANYPYQGTDGTCNTNKAGNDAAKITGYEDVPANSED 245

Query: 251 ALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWL 310
           ALL+AV++QPVSV +DASG AF FY  GV   DCG   DHGV  VG+GT+++  G KYWL
Sbjct: 246 ALLKAVASQPVSVAIDASGSAFQFYSGGVFTGDCGTELDHGVTAVGYGTSDD--GTKYWL 303

Query: 311 IKNSWGETWGESGYIRILRD----AGLCGIATAASYPVA 345
           +KNSWG +WGE GYIR+ RD     GLCGIA   SYP A
Sbjct: 304 VKNSWGTSWGEDGYIRMERDIEAKEGLCGIAMQPSYPTA 342


>gi|255563110|ref|XP_002522559.1| cysteine protease, putative [Ricinus communis]
 gi|223538250|gb|EEF39859.1| cysteine protease, putative [Ricinus communis]
          Length = 344

 Score =  396 bits (1018), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 193/339 (56%), Positives = 242/339 (71%), Gaps = 10/339 (2%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
           + + ++LV   ASQ  S RS+HE S+  +H+ WM Q+GR YK  +EK  R  IFK+N+E+
Sbjct: 10  VLMAMLLVTLWASQSWS-RSLHEASMELRHKTWMTQYGRVYKGNVEKEKRFKIFKENVEF 68

Query: 73  IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVP 132
           IE  N  GN+ YKLG N F+DLTNEEFRA + GY   + S  + S R  +F+Y+NVT VP
Sbjct: 69  IESFNNNGNKPYKLGINAFTDLTNEEFRASHNGYTMSMSS-HQSSYRTKSFRYENVTAVP 127

Query: 133 TSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--N 190
            S+DWR KGAVTHIKDQGQCG CWAFSAVAA+EGIT+++ G LI LSEQ+LVDC T   +
Sbjct: 128 PSLDWRTKGAVTHIKDQGQCGCCWAFSAVAAMEGITKLSTGTLISLSEQELVDCDTSGMD 187

Query: 191 HGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQ 250
            GC GGLMD AFE+IIEN GL TEA+YPY   +G+C+ +K    AA I+ YE++P  DE+
Sbjct: 188 QGCEGGLMDDAFEFIIENNGLTTEANYPYEGVDGSCNTRKAANHAAKITGYENVPAYDEE 247

Query: 251 ALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWL 310
           AL +AV+NQPVSV +DA   AF  Y SG+   DCG   DHGV VVG+GT+++  G KYWL
Sbjct: 248 ALRKAVANQPVSVAIDAGESAFQHYSSGIFTGDCGTELDHGVTVVGYGTSDD--GTKYWL 305

Query: 311 IKNSWGETWGESGYIRILRD----AGLCGIATAASYPVA 345
           +KNSWG +WGE GYIR+ RD     GLCGIA   SYP A
Sbjct: 306 VKNSWGTSWGEDGYIRMERDIDAKEGLCGIAMEPSYPTA 344


>gi|144905116|dbj|BAF56430.1| cysteine proteinase [Lotus japonicus]
          Length = 341

 Score =  390 bits (1001), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 187/337 (55%), Positives = 237/337 (70%), Gaps = 13/337 (3%)

Query: 15  VIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIE 74
           + ++LV    S   + R++ + S+ E+HEQWMAQ+G+ YKD  EK +R  IFK+N++ IE
Sbjct: 12  LTLLLVFGFLSFEANARTLEDASMHERHEQWMAQYGKVYKDSYEKELRSKIFKENVQRIE 71

Query: 75  KANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTS 134
             N  GN++YKLG N+F+DLTNEEF+A     NR    +   S+R  TFKY++VT VP S
Sbjct: 72  AFNNAGNKSYKLGINQFADLTNEEFKA----RNRFKGHMCSNSTRTPTFKYEHVTSVPAS 127

Query: 135 IDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHG 192
           +DWR+KGAVT IKDQGQCG CWAFSAVAA EGIT+++ GKLI LSEQ+LVDC T   + G
Sbjct: 128 LDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGITKLSTGKLISLSEQELVDCDTKGVDQG 187

Query: 193 CSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQAL 252
           C GGLMD AF++I++NKGL TEA YPY+  + TC+   E   AA+I  +ED+P   E AL
Sbjct: 188 CEGGLMDDAFKFIMQNKGLNTEAKYPYQGVDATCNANAEAKDAASIKGFEDVPANSESAL 247

Query: 253 LQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIK 312
           L+AV+NQP+SV +DASG  F FY SGV    CG   DHGV  VG+G+   + G KYWL+K
Sbjct: 248 LKAVANQPISVAIDASGSEFQFYSSGVFTGSCGTELDHGVTAVGYGS---DGGTKYWLVK 304

Query: 313 NSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
           NSWGE WGE GYIR+ RD     GLCG A  ASYP A
Sbjct: 305 NSWGEQWGEQGYIRMQRDVAAEEGLCGFAMQASYPTA 341


>gi|50355621|dbj|BAD29959.1| cysteine protease [Daucus carota]
          Length = 361

 Score =  389 bits (998), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 186/339 (54%), Positives = 245/339 (72%), Gaps = 12/339 (3%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
           M   +IL+   A Q  S R++ E S+ E+HEQWM Q+GR YKDE EK++R  IF  N+++
Sbjct: 29  MIAALILLGAWACQATS-RTLPEASMFERHEQWMIQYGRVYKDEAEKSVRFQIFMDNVKF 87

Query: 73  IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVP 132
           IE+ NK+G ++YKL  NEF+D TNEEF+A   GY     +VS + S+ + F+Y+NVT VP
Sbjct: 88  IEEFNKDGRQSYKLAVNEFADQTNEEFQASRNGYKM---AVSSRPSQTTLFRYENVTAVP 144

Query: 133 TSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC--STDN 190
           +S+DWR+KGAVT +KDQGQCGSCWAFS +AA EGIT++  GKLI LSEQ+LVDC  + ++
Sbjct: 145 SSMDWRKKGAVTPVKDQGQCGSCWAFSTIAATEGITKLKTGKLISLSEQELVDCDKTGED 204

Query: 191 HGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQ 250
            GC GG M+  FE+I++NKG+A EA YPY   +GTC++++E + AA IS YE +P   E 
Sbjct: 205 QGCEGGYMEDGFEFIVKNKGIALEASYPYTAADGTCNSKEEASRAAKISGYEKVPANSET 264

Query: 251 ALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWL 310
           ALL+AV+NQPVSV +DASG AF FY SGV   +CG + DHGV  VG+G  +  +G KYWL
Sbjct: 265 ALLKAVANQPVSVSIDASGVAFQFYSSGVFTGECGTDLDHGVTAVGYG--KTSDGTKYWL 322

Query: 311 IKNSWGETWGESGYIRILR----DAGLCGIATAASYPVA 345
           +KNSWG +WG+SGYI + R      GLCGIA  ASYP A
Sbjct: 323 VKNSWGASWGDSGYIMMQRGVAAKGGLCGIAMDASYPTA 361


>gi|224114698|ref|XP_002316833.1| predicted protein [Populus trichocarpa]
 gi|222859898|gb|EEE97445.1| predicted protein [Populus trichocarpa]
          Length = 305

 Score =  387 bits (995), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 187/313 (59%), Positives = 235/313 (75%), Gaps = 14/313 (4%)

Query: 39  VEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEE 98
           +E+HE WMAQ+GR YK  +EK  RLNIFK N+E+IE  NK G + YKL  NEF+DLTNEE
Sbjct: 1   MERHETWMAQYGRAYKGHVEKERRLNIFKNNVEFIESFNKVGKKPYKLSVNEFADLTNEE 60

Query: 99  FRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAF 158
           F+A   GY +    +S  S++P  F+Y+NV+ VP+++DWR+KGAVT IKDQGQCG CWAF
Sbjct: 61  FQASRNGY-KMSAHLSSSSTKP--FRYENVSAVPSTMDWRKKGAVTPIKDQGQCGCCWAF 117

Query: 159 SAVAAVEGITQITRGKLIELSEQQLVDCST--DNHGCSGGLMDKAFEYIIENKGLATEAD 216
           SAVAA EGITQ++ GKLI LSEQ+LVDC T  ++ GC+GGLMD AF++II+NKGL TEA+
Sbjct: 118 SAVAATEGITQLSTGKLISLSEQELVDCDTSGEDQGCNGGLMDDAFDFIIQNKGLTTEAN 177

Query: 217 YPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYK 276
           YPY+  +G C++ K    AA I+ YED+P   E ALL+AV+NQPVSV +DA G AF FY 
Sbjct: 178 YPYQGADGACNSGK---AAAKITGYEDVPANSEAALLKAVANQPVSVAIDAGGSAFQFYS 234

Query: 277 SGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD----AG 332
           SGV   DCG + DHGV  VG+G +++  G KYWL+KNSWG +WGE+GYIR+ RD     G
Sbjct: 235 SGVFTGDCGTDLDHGVTAVGYGMSDD--GTKYWLVKNSWGTSWGENGYIRMERDIDAQEG 292

Query: 333 LCGIATAASYPVA 345
           LCGIA  ASYP A
Sbjct: 293 LCGIAMEASYPTA 305


>gi|144905108|dbj|BAF56428.1| cysteine proteinase [Lotus japonicus]
          Length = 342

 Score =  387 bits (993), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 184/337 (54%), Positives = 237/337 (70%), Gaps = 12/337 (3%)

Query: 15  VIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIE 74
           + ++LV    +   + R++ + S+ E+HEQWM Q+G+ Y D  EK +R NIFK+N++ IE
Sbjct: 12  LALLLVFGFLAFEANARTLEDVSLKERHEQWMTQYGKVYTDSYEKELRSNIFKENVQRIE 71

Query: 75  KANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTS 134
             N  GN+ YKLG N+F+DLTNEEF+A     NR    +   S+R  TFKY++V+ VP S
Sbjct: 72  AFNNAGNKPYKLGINQFADLTNEEFKA----RNRFKGHMCSNSTRTPTFKYEDVSSVPAS 127

Query: 135 IDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHG 192
           +DWR+KGAVT IKDQGQCG CWAFSAVAA EGIT+++ GKLI LSEQ+LVDC T   + G
Sbjct: 128 LDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGITKLSTGKLISLSEQELVDCDTKGVDQG 187

Query: 193 CSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQAL 252
           C GGLMD AF++I++NKGL TEA YPY+  + TC+   E   AA+I  +ED+P   E AL
Sbjct: 188 CEGGLMDDAFKFIMQNKGLNTEAKYPYQGVDATCNANAEAKDAASIKGFEDVPANSESAL 247

Query: 253 LQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIK 312
           L+AV+NQP+SV +DASG  F FY SG+    CG   DHGV  VG+G +++  G KYWL+K
Sbjct: 248 LKAVANQPISVAIDASGSEFQFYSSGLFTGSCGTELDHGVTAVGYGVSDD--GTKYWLVK 305

Query: 313 NSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
           NSWGE WGE GYIR+ RD     GLCGIA  ASYP A
Sbjct: 306 NSWGEQWGEEGYIRMQRDVAAEEGLCGIAMQASYPTA 342


>gi|40806498|gb|AAR92154.1| putative cysteine protease 1 [Iris x hollandica]
          Length = 340

 Score =  386 bits (992), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 192/348 (55%), Positives = 247/348 (70%), Gaps = 18/348 (5%)

Query: 5   FEKSFIIPMFVIIILVITCASQVVSGRSMHE-PSIVEKHEQWMAQHGRTYKDELEKAMRL 63
           F+   ++P   ++I+ I  ASQ  +GRS+ E  S++E+HEQWMAQHGR YK+  EKA R 
Sbjct: 4   FKTVKLLPALALLIVAI-WASQGEAGRSLGENKSMLERHEQWMAQHGRVYKNAAEKAHRF 62

Query: 64  NIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTF 123
            IF+ N+E IE  N E N  +KLG N+F+DLTNEEF+       R     S+ +S  S F
Sbjct: 63  EIFRANVERIESFNAE-NHKFKLGVNQFADLTNEEFK------TRNTLKPSKMASTKS-F 114

Query: 124 KYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQL 183
           KY+NVT VP ++DWR KGAVT IKDQGQCGSCWAFSAVAA EGIT+++ GKLI LSEQ++
Sbjct: 115 KYENVTAVPATMDWRTKGAVTPIKDQGQCGSCWAFSAVAATEGITKLSTGKLISLSEQEV 174

Query: 184 VDC--STDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKY 241
           VDC  ++D+ GC+GG MD AFEYII+NKG+ TEA+YPY+  +GTC+ +K  + AA+I+ Y
Sbjct: 175 VDCDVTSDDQGCNGGEMDDAFEYIIKNKGITTEANYPYKAADGTCNTKKAASHAASITGY 234

Query: 242 EDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAE 301
           ED+    E ALL+A +NQP++V +DA   AF  Y SGV   DCG + DHGV +VG+G   
Sbjct: 235 EDVTVNSEAALLKAAANQPIAVAIDAGDFAFQMYSSGVFTGDCGTDLDHGVTLVGYGATS 294

Query: 302 EENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
           +  G KYWL+KNSWG +WGE GYIR+ RD     GLCGIA  ASYP A
Sbjct: 295 D--GTKYWLVKNSWGTSWGEDGYIRMERDVDAKEGLCGIAMDASYPTA 340


>gi|225446581|ref|XP_002280246.1| PREDICTED: vignain [Vitis vinifera]
          Length = 341

 Score =  385 bits (990), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 187/337 (55%), Positives = 241/337 (71%), Gaps = 13/337 (3%)

Query: 15  VIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIE 74
           + ++ V+   +   + RS+HE S+ E+HE WM Q+GR YKD  EK+ R  IFK N+  IE
Sbjct: 12  LALLFVLAAWASQATARSLHEASMYERHEDWMVQYGREYKDADEKSKRYKIFKDNVARIE 71

Query: 75  KANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTS 134
             NK  +++YKL  NEF+DLTNEEFRA     NR    +   S+  ++FKY+NVT VP++
Sbjct: 72  SFNKAMDKSYKLSINEFADLTNEEFRA---SRNRFKAHIC--STEATSFKYENVTAVPST 126

Query: 135 IDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST--DNHG 192
           +DWR+KGAVT IKDQGQCGSCWAFSAVAA+EGITQ++ GKLI LSEQ+LVDC T  ++ G
Sbjct: 127 VDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQG 186

Query: 193 CSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQAL 252
           CSGGLMD AF++I +N GL TEA+YPY   +GTC+ +K    AA I+ YED+P  +E+AL
Sbjct: 187 CSGGLMDDAFKFIEQNHGLTTEANYPYAGTDGTCNRKKAAHPAAKINGYEDVPANNEKAL 246

Query: 253 LQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIK 312
            +AV++QP++V +DASG  F FY SGV    CG   DHGVA VG+GT+++  G KYWL+K
Sbjct: 247 QKAVAHQPIAVAIDASGSEFQFYSSGVFTGQCGTELDHGVAAVGYGTSDD--GMKYWLVK 304

Query: 313 NSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
           NSW   WGE GYIR+ RD     GLCGIA  ASYP A
Sbjct: 305 NSWSTGWGEEGYIRMQRDVTAKEGLCGIAMQASYPTA 341


>gi|47524507|gb|AAT34987.1| putative cysteine protease [Gossypium hirsutum]
          Length = 344

 Score =  385 bits (988), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 189/349 (54%), Positives = 253/349 (72%), Gaps = 20/349 (5%)

Query: 10  IIPMFVIIILVIT-CASQVVSGRS---MHEPSIVEKHEQWMAQHGRTYKDELE--KAMRL 63
           ++ +F+ + LV++ C S  ++G S   + E S+  +HE+WM+QHGR Y DE E  K  R 
Sbjct: 3   LLQIFLFVALVLSFCFSIQLAGLSRPLLDEDSM--RHEEWMSQHGRVYADEQEDHKNKRF 60

Query: 64  NIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTF 123
           N+FK+N+E IE+ N    +T+KL  N+F+DLTNEEFRA Y G+  P+  +S Q ++P+ F
Sbjct: 61  NVFKENVERIEEFND--GKTFKLAINQFADLTNEEFRASYNGFKGPM-VLSSQITKPTPF 117

Query: 124 KYQNVTD-VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQ 182
           +Y+NV+  +P S+DWR+KGAVT +K+QGQCG CWAFSAVAA+EGITQI+ GKLI LSEQ+
Sbjct: 118 RYENVSSALPVSVDWRKKGAVTPVKNQGQCGCCWAFSAVAAIEGITQISTGKLISLSEQE 177

Query: 183 LVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISK 240
           LVDC T   +HGC GGLMD AFE+II N GL TE++YPY+ E+GTC+  K   +A +I+ 
Sbjct: 178 LVDCDTKGIDHGCEGGLMDTAFEFIINNGGLTTESNYPYKGEDGTCNFNKTNPIAVSITG 237

Query: 241 YEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTA 300
           YED+P  DEQAL++AV++QPVSV ++A G  F FY SGV   +CG   DH V  VG+G  
Sbjct: 238 YEDVPANDEQALMKAVAHQPVSVAIEAGGSDFQFYSSGVFTGECGTELDHAVTAVGYG-- 295

Query: 301 EEENGAKYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPVA 345
           E E+G+KYW++KNSWG  WGESGYI + +D     GLCGIA  ASYP A
Sbjct: 296 ESEDGSKYWIVKNSWGTKWGESGYIEMQKDIKVKQGLCGIAMQASYPTA 344


>gi|224135841|ref|XP_002327317.1| predicted protein [Populus trichocarpa]
 gi|222835687|gb|EEE74122.1| predicted protein [Populus trichocarpa]
          Length = 342

 Score =  385 bits (988), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 181/338 (53%), Positives = 233/338 (68%), Gaps = 11/338 (3%)

Query: 14  FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYI 73
           F   IL++   +  V+ R + EPS+  +HEQWM   G+ Y D  EK  R  IFK N+EYI
Sbjct: 10  FFAFILILGMWAYEVASRELQEPSMSARHEQWMETFGKVYADAAEKERRFEIFKDNVEYI 69

Query: 74  EKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPT 133
           E  N  GN+ YKL  N+F+DLTNEE +    GY RP+ +   +  + ++FKY+NVT VP 
Sbjct: 70  ESFNTAGNKPYKLSVNKFADLTNEELKVARNGYRRPLQT---RPMKVTSFKYENVTAVPA 126

Query: 134 SIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST--DNH 191
           ++DWR+KGAVT IKDQGQCGSCWAFS VAA EGI Q+T GKL+ LSEQ+LVDC T  ++ 
Sbjct: 127 TMDWRKKGAVTPIKDQGQCGSCWAFSTVAATEGINQLTTGKLVSLSEQELVDCDTQGEDQ 186

Query: 192 GCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQA 251
           GC GGLM+  FE+II+N G+ TEA+YPY+  +GTC+++KE +  A I+ YE +P   E A
Sbjct: 187 GCEGGLMEDGFEFIIKNHGITTEANYPYQAADGTCNSKKEASRIAKITGYESVPANSEAA 246

Query: 252 LLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLI 311
           LL+AV++QP+SV +DA G  F FY SGV    CG   DHGV  VG+G  E  +G KYWL+
Sbjct: 247 LLKAVASQPISVSIDAGGSDFQFYSSGVFTGQCGTELDHGVTAVGYG--ETSDGTKYWLV 304

Query: 312 KNSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
           KNSWG +WGE GYIR+ RD     GLCGIA  +SYP A
Sbjct: 305 KNSWGTSWGEEGYIRMQRDTEAEEGLCGIAMDSSYPTA 342


>gi|356515048|ref|XP_003526213.1| PREDICTED: vignain-like [Glycine max]
          Length = 350

 Score =  384 bits (986), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 184/337 (54%), Positives = 235/337 (69%), Gaps = 12/337 (3%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
           +  +++L+  C SQV+S R++HE S+ E+HEQWM ++G+ YKD  EK  RL IFK N+E+
Sbjct: 10  ILALVLLLSICTSQVMS-RNLHEASMSERHEQWMKKYGKVYKDAAEKQKRLLIFKDNVEF 68

Query: 73  IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVP 132
           IE  N  GN+ YKL  N  +D TNEEF A + GY        + S   + FKY NVTD+P
Sbjct: 69  IESFNAAGNKPYKLSINHLADQTNEEFVASHNGYKY------KGSHSQTPFKYGNVTDIP 122

Query: 133 TSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHG 192
           T++DWR+ GAVT +KDQGQCGSCWAFS VAA EGI QI+ G L+ LSEQ+LVDC + +HG
Sbjct: 123 TAVDWRQNGAVTAVKDQGQCGSCWAFSTVAATEGIYQISTGMLMSLSEQELVDCDSVDHG 182

Query: 193 CSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQAL 252
           C GGLM+  FE+II+N G+++EA+YPY   +GTCD  KE + AA I  YE +P   E+AL
Sbjct: 183 CDGGLMEDGFEFIIKNGGISSEANYPYTAVDGTCDASKEASPAAQIKGYETVPANSEEAL 242

Query: 253 LQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIK 312
            QAV+NQPVSV +DA G  F FY SGV    CG   DHGV VVG+GT ++    +YW++K
Sbjct: 243 QQAVANQPVSVSIDAGGSGFQFYSSGVFTGQCGTQLDHGVTVVGYGTTDDGT-HEYWIVK 301

Query: 313 NSWGETWGESGYIRILR--DA--GLCGIATAASYPVA 345
           NSWG  WGE GYIR+ R  DA  GLCGIA  ASYP+ 
Sbjct: 302 NSWGTQWGEEGYIRMQRGIDAQEGLCGIAMDASYPMG 338


>gi|147839728|emb|CAN70559.1| hypothetical protein VITISV_032465 [Vitis vinifera]
          Length = 341

 Score =  384 bits (985), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 186/337 (55%), Positives = 240/337 (71%), Gaps = 13/337 (3%)

Query: 15  VIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIE 74
           + ++ V+   +   + R +HE S+ E+HE WM Q+GR YKD  EK+ R  IFK N+  IE
Sbjct: 12  LALLFVLAAWASQATARXLHEASMYERHEDWMVQYGREYKDADEKSKRYKIFKDNVARIE 71

Query: 75  KANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTS 134
             NK  +++YKL  NEF+DLTNEEFRA     NR    +   S+  ++FKY+NVT VP++
Sbjct: 72  SFNKAMDKSYKLSINEFADLTNEEFRA---SRNRFKAHIC--STEATSFKYENVTAVPST 126

Query: 135 IDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST--DNHG 192
           +DWR+KGAVT IKDQGQCGSCWAFSAVAA+EGITQ++ GKLI LSEQ+LVDC T  ++ G
Sbjct: 127 VDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQG 186

Query: 193 CSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQAL 252
           CSGGLMD AF++I +N GL TEA+YPY   +GTC+ +K    AA I+ YED+P  +E+AL
Sbjct: 187 CSGGLMDDAFKFIEQNHGLTTEANYPYAGTDGTCNRKKAAHPAAKINGYEDVPANNEKAL 246

Query: 253 LQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIK 312
            +AV++QP++V +DASG  F FY SGV    CG   DHGVA VG+GT+++  G KYWL+K
Sbjct: 247 QKAVAHQPIAVAIDASGSEFQFYSSGVFTGQCGTELDHGVAAVGYGTSDD--GMKYWLVK 304

Query: 313 NSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
           NSW   WGE GYIR+ RD     GLCGIA  ASYP A
Sbjct: 305 NSWSTGWGEEGYIRMQRDVTVKEGLCGIAMQASYPTA 341


>gi|225446583|ref|XP_002280204.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1 [Vitis vinifera]
          Length = 341

 Score =  383 bits (984), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 185/337 (54%), Positives = 241/337 (71%), Gaps = 13/337 (3%)

Query: 15  VIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIE 74
           + ++ V+   +   + R++HE S+ E+HE WM Q+GR YKD  EK+ R  IFK N+  IE
Sbjct: 12  LALLFVLAAWASQATARNLHEASMYERHEDWMVQYGREYKDADEKSKRYKIFKDNVARIE 71

Query: 75  KANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTS 134
             NK  +++YKL  NEF+DLTNEEFRA     NR    +   S+  ++FKY+NVT VP++
Sbjct: 72  SFNKAMDKSYKLSINEFADLTNEEFRA---SRNRFKAHIC--STEATSFKYENVTAVPST 126

Query: 135 IDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST--DNHG 192
           +DWR+KGAVT IKDQGQCGSCWAFSAVAA+EGITQ++ GKLI LSEQ+LVDC T  ++ G
Sbjct: 127 VDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQG 186

Query: 193 CSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQAL 252
           CSGGLMD AF++I +N GL TEA+YPY   +GTC+ +K    AA I+ YED+P  +E+AL
Sbjct: 187 CSGGLMDDAFKFIEQNHGLTTEANYPYAGTDGTCNRKKAAHPAAKINGYEDVPANNEKAL 246

Query: 253 LQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIK 312
            +AV++QP++V +DA G  F FY SGV    CG   DHGV+ VG+GT+++  G KYWL+K
Sbjct: 247 QKAVAHQPIAVAIDAGGSEFQFYSSGVFTGQCGTELDHGVSAVGYGTSDD--GMKYWLVK 304

Query: 313 NSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
           NSWG  WGE GYIR+ RD     GLCGIA  ASYP A
Sbjct: 305 NSWGTGWGEEGYIRMQRDVTAKEGLCGIAMQASYPTA 341


>gi|356543116|ref|XP_003540009.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
           max]
          Length = 337

 Score =  382 bits (981), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 184/337 (54%), Positives = 232/337 (68%), Gaps = 13/337 (3%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
           +  +++L+  C SQV+S R +HE S+ E+HEQWM ++G+ YKD  EK  RL IFK N+E+
Sbjct: 10  ILALVLLLSICTSQVMS-RYLHEASMSERHEQWMKKYGKVYKDAAEKQKRLLIFKDNVEF 68

Query: 73  IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVP 132
           IE  N  GN+ YKLG N  +D TNEEF A + GY        + S   + FKY+NVT VP
Sbjct: 69  IESFNAAGNKPYKLGINHLADQTNEEFVASHNGYKH------KASHSQTPFKYENVTGVP 122

Query: 133 TSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHG 192
            ++DWRE GAVT +KDQGQCGSCWAFS VAA EGI QIT   L+ LSEQ+LVDC + +HG
Sbjct: 123 NAVDWRENGAVTAVKDQGQCGSCWAFSTVAATEGIYQITTSMLMSLSEQELVDCDSVDHG 182

Query: 193 CSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQAL 252
           C GG M+  FE+II+N G+++EA+YPY   +GTCD  KE + AA I  YE +P   E AL
Sbjct: 183 CDGGYMEGGFEFIIKNGGISSEANYPYTAVDGTCDANKEASPAAQIKGYETVPANSEDAL 242

Query: 253 LQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIK 312
            +AV+NQPVSV +DA G AF FY SGV    CG   DHGV  VG+G+ ++  G +YW++K
Sbjct: 243 QKAVANQPVSVTIDAGGSAFQFYSSGVFTGQCGTQLDHGVTAVGYGSTDD--GTQYWIVK 300

Query: 313 NSWGETWGESGYIRILR--DA--GLCGIATAASYPVA 345
           NSWG  WGE GYIR+ R  DA  GLCGIA  ASYP A
Sbjct: 301 NSWGTQWGEEGYIRMQRGTDAQEGLCGIAMDASYPTA 337


>gi|356543124|ref|XP_003540013.1| PREDICTED: vignain-like [Glycine max]
 gi|356543126|ref|XP_003540014.1| PREDICTED: vignain-like [Glycine max]
          Length = 337

 Score =  382 bits (980), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 184/337 (54%), Positives = 232/337 (68%), Gaps = 13/337 (3%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
           +  +++L+  C SQV+S R++HE S+ E+HEQWM ++G+ YKD  EK  RL IFK N+E+
Sbjct: 10  ILALVLLLSICTSQVMS-RNLHEASMSERHEQWMKKYGKVYKDAAEKQKRLLIFKDNVEF 68

Query: 73  IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVP 132
           IE  N  GNR YKL  N  +D TNEEF A + GY        + S   + FKY+NVT VP
Sbjct: 69  IESFNAAGNRPYKLSINHLADQTNEEFVASHNGYKH------KGSHSQTPFKYENVTGVP 122

Query: 133 TSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHG 192
            ++DWRE GAVT +KDQGQCGSCWAFS VAA EGI QIT   L+ LSEQ+LVDC + +HG
Sbjct: 123 NAVDWRENGAVTAVKDQGQCGSCWAFSTVAATEGIYQITTSMLMSLSEQELVDCDSVDHG 182

Query: 193 CSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQAL 252
           C GG M+  FE+II+N G+++EA+YPY   +GTCD  KE + AA I  YE +P   E AL
Sbjct: 183 CDGGYMEGGFEFIIKNGGISSEANYPYTAVDGTCDANKEASPAAQIKGYETVPANSEDAL 242

Query: 253 LQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIK 312
            +AV+NQPVSV +DA G AF FY SGV    CG   DHGV  VG+G+ ++  G +YW++K
Sbjct: 243 QKAVANQPVSVTIDAGGSAFQFYSSGVFTGQCGTQLDHGVTAVGYGSTDD--GTQYWIVK 300

Query: 313 NSWGETWGESGYIRILR--DA--GLCGIATAASYPVA 345
           NSWG  WGE GYIR+ R  DA  GLCGIA  ASYP A
Sbjct: 301 NSWGTQWGEEGYIRMQRGTDAQEGLCGIAMDASYPTA 337


>gi|359485281|ref|XP_002280230.2| PREDICTED: LOW QUALITY PROTEIN: KDEL-tailed cysteine endopeptidase
           CEP1 [Vitis vinifera]
          Length = 341

 Score =  381 bits (978), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 186/337 (55%), Positives = 240/337 (71%), Gaps = 13/337 (3%)

Query: 15  VIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIE 74
           + ++ V+   +   + R++HE S+ E+HE WMAQ+GR YKD  EK+ R  IFK N+  IE
Sbjct: 12  LALLFVLAAWASQATARNLHEASMYERHEDWMAQYGRVYKDADEKSKRYKIFKDNVARIE 71

Query: 75  KANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTS 134
             NK  +++YKL  NEF+DLTNEEF    T  NR    +   S+  ++FKY+NVT VP++
Sbjct: 72  SFNKAMDKSYKLSINEFADLTNEEFG---TSRNRFKAHIC--STEATSFKYENVTAVPST 126

Query: 135 IDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST--DNHG 192
           IDWR+KGAVT IKDQGQCGSCWAFSAVAA+EGITQ++ GKLI LSEQ+LVDC T  ++ G
Sbjct: 127 IDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQG 186

Query: 193 CSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQAL 252
           C+GGLMD AF++I +N GL TEA+YPY   +GTC+ +K    AA I+ YED+P  +E+AL
Sbjct: 187 CNGGLMDDAFKFIKQNHGLTTEANYPYAGTDGTCNRKKAAHPAAKINGYEDVPANNEKAL 246

Query: 253 LQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIK 312
            +AV +QP++V +DA G  F FY SGV    CG   DHGVA VG+GT+++  G KYWL+K
Sbjct: 247 QKAVVHQPIAVAIDAGGFEFQFYSSGVFTGQCGTELDHGVAAVGYGTSDD--GMKYWLVK 304

Query: 313 NSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
           NSWG  WGE GYIR+ RD     GLCGIA  ASYP A
Sbjct: 305 NSWGTGWGEEGYIRMQRDVTAKEGLCGIAMQASYPTA 341


>gi|147788834|emb|CAN64655.1| hypothetical protein VITISV_005140 [Vitis vinifera]
          Length = 341

 Score =  381 bits (978), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 185/337 (54%), Positives = 240/337 (71%), Gaps = 13/337 (3%)

Query: 15  VIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIE 74
           + ++ V+   +     R++HE S+ E+HE WMAQ+GR YKD  EK+ R  IFK N+  IE
Sbjct: 12  LALLFVLAAWASHAKARNLHEASMYERHEDWMAQYGRVYKDAGEKSKRYKIFKDNVARIE 71

Query: 75  KANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTS 134
             NK  N++YKL  NEF+DLTNEEFRA     NR    +   S+  ++FKY++V  VP++
Sbjct: 72  SFNKAMNKSYKLSINEFADLTNEEFRA---SRNRFKAHIC--STEATSFKYEHVXAVPST 126

Query: 135 IDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST--DNHG 192
           +DWR+KGAVT IKDQGQCGSCWAFSAVAA+EGITQ++ GKLI LSEQ+LVDC T  ++ G
Sbjct: 127 VDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQG 186

Query: 193 CSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQAL 252
           CSGGLMD AF++I +N GL TEA+YPY   +GTC+ +K    AA I+ YED+P  +E+AL
Sbjct: 187 CSGGLMDDAFKFIEQNHGLTTEANYPYAGTDGTCNRKKAAHPAAKINGYEDVPANNEKAL 246

Query: 253 LQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIK 312
            +AV++QP++V +DA G  F FY SGV    CG   DHGV+ VG+GT+++  G KYWL+K
Sbjct: 247 QKAVAHQPIAVAIDAGGFEFQFYSSGVFTGQCGTELDHGVSAVGYGTSDD--GMKYWLVK 304

Query: 313 NSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
           NSWG  WGE GYIR+ RD     GLCGIA  ASYP A
Sbjct: 305 NSWGTGWGEEGYIRMQRDVTEKEGLCGIAMQASYPTA 341


>gi|124484401|dbj|BAF46311.1| cysteine proteinase precursor [Ipomoea nil]
          Length = 339

 Score =  380 bits (977), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 183/341 (53%), Positives = 238/341 (69%), Gaps = 13/341 (3%)

Query: 11  IPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNL 70
           + + + + LV   ++ + + R++ +  +  +HEQWMAQ+GR YK+E+EK  R NIFK+N+
Sbjct: 6   LKLLIALALVFATSAYLATSRTLLDSLMAVRHEQWMAQYGRVYKNEVEKTKRYNIFKENV 65

Query: 71  EYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
           EYIE  NK G + YKLG N F+DLTN+EF A   GY  P      + S  + F+Y+NV+ 
Sbjct: 66  EYIESFNKAGTKPYKLGINAFADLTNKEFIASRNGYILP-----HECSSNTPFRYENVSA 120

Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD- 189
           VPT++DWR+KGAVT +KDQGQCG CWAFSAVAA+EGIT+++ G LI LSEQ+LVDC    
Sbjct: 121 VPTTVDWRKKGAVTPVKDQGQCGCCWAFSAVAAMEGITKLSTGNLISLSEQELVDCDVKG 180

Query: 190 -NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
            + GC GGLMD AF +II NKGL TE++YPY+  +G+C   K    AA IS YED+P   
Sbjct: 181 IDQGCEGGLMDDAFTFIINNKGLTTESNYPYQGTDGSCKKSKSSNSAAKISGYEDVPANS 240

Query: 249 EQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKY 308
           E AL +AV+NQPVSV +DA G  F FY SGV   +CG   DHGV  VG+G AE+  G+KY
Sbjct: 241 ESALEKAVANQPVSVAIDAGGSDFQFYSSGVFTGECGTELDHGVTAVGYGIAED--GSKY 298

Query: 309 WLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
           WL+KNSWG +WGE GYIR+ +D     GLCGIA  +SYP A
Sbjct: 299 WLVKNSWGTSWGEKGYIRMQKDIEAKEGLCGIAMQSSYPSA 339


>gi|255568297|ref|XP_002525123.1| cysteine protease, putative [Ricinus communis]
 gi|223535582|gb|EEF37250.1| cysteine protease, putative [Ricinus communis]
          Length = 349

 Score =  379 bits (974), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 181/345 (52%), Positives = 240/345 (69%), Gaps = 14/345 (4%)

Query: 7   KSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIF 66
           K  I+P+ +  +L + CA Q  S R +HE  +  +HE+WMA+HG+ YKD+ EK  R  IF
Sbjct: 6   KGKILPIALFFVLAM-CADQAAS-RELHELEMTGRHEKWMAKHGKVYKDDKEKLRRFQIF 63

Query: 67  KQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQ 126
           K N+ +IE  N  GN++Y LG N+F+DLTNEEFRA + GY RP+ +    S + + FKY+
Sbjct: 64  KSNVVFIESFNTAGNKSYMLGINKFADLTNEEFRAFWNGYKRPLGA----SRKITPFKYE 119

Query: 127 NVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC 186
           NVT +P+SIDWR KGAVT IKDQG CGSCWAFSAVAA EGI ++  GKL+ LSEQ+LVDC
Sbjct: 120 NVTALPSSIDWRSKGAVTPIKDQGVCGSCWAFSAVAATEGIHKLRTGKLVSLSEQELVDC 179

Query: 187 ST--DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDL 244
                + GC GGLM  AF++I  + G+ +EA+YPY+  +G CD +KE + A  I+ Y+ +
Sbjct: 180 DVKGQDKGCQGGLMVDAFKFIKRHGGMTSEANYPYQGRDGKCDTKKEASRAVKITGYQAV 239

Query: 245 PKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEEN 304
           PK  E ALL+AV+NQPVSV +DA   +F FY+SG+    CG + +HGVA VG+G +   +
Sbjct: 240 PKNSEAALLKAVANQPVSVAIDAGSLSFQFYRSGIFTGICGKDINHGVAAVGYGRS--NS 297

Query: 305 GAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
           G+KYW++KNSWG  WGE GYIR+ RD     GLCGIA   SYP A
Sbjct: 298 GSKYWIVKNSWGTEWGEKGYIRMKRDVRSKEGLCGIAMECSYPTA 342


>gi|225446589|ref|XP_002280263.1| PREDICTED: vignain [Vitis vinifera]
          Length = 339

 Score =  379 bits (974), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 194/341 (56%), Positives = 237/341 (69%), Gaps = 17/341 (4%)

Query: 11  IPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNL 70
           + M ++ IL    ASQ  S RS+HE S+ E+HE WMA++GR YKD  EK  R  IFK N+
Sbjct: 10  VSMALLFILA-AWASQATS-RSLHEASMYERHEDWMARYGRMYKDANEKEKRFKIFKDNV 67

Query: 71  EYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
             IE  NK  ++TYKL  NEF+DLTNEEFR+L   +   +       S  +TFKY+NVT 
Sbjct: 68  ARIESFNKAMDKTYKLSINEFADLTNEEFRSLRNRFKAHI------CSEATTFKYENVTA 121

Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST-- 188
           VP++IDWR+KGAVT IKDQ QCG CWAFSAVAA EGITQIT GKLI LSEQ+LVDC T  
Sbjct: 122 VPSTIDWRKKGAVTPIKDQQQCGCCWAFSAVAATEGITQITTGKLISLSEQELVDCDTGG 181

Query: 189 DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
           +N GCSGGLMD AF + I+  GLA+EA YPY  ++GTC+++KE   AA I  YED+P  +
Sbjct: 182 ENQGCSGGLMDDAFRF-IKIHGLASEATYPYEGDDGTCNSKKEAHPAAKIKGYEDVPANN 240

Query: 249 EQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKY 308
           E+AL +AV++QPV+V +DA G  F FY SGV    CG   DHGVA VG+G    ++G  Y
Sbjct: 241 EKALQKAVAHQPVAVAIDAGGFEFQFYTSGVFTGQCGTELDHGVAAVGYGIG--DDGMMY 298

Query: 309 WLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
           WL+KNSWG  WGE GYIR+ RD     GLCGIA  ASYP A
Sbjct: 299 WLVKNSWGTGWGEEGYIRMQRDVTAKEGLCGIAMQASYPTA 339


>gi|225443827|ref|XP_002274223.1| PREDICTED: vignain-like [Vitis vinifera]
          Length = 340

 Score =  378 bits (971), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 189/337 (56%), Positives = 236/337 (70%), Gaps = 12/337 (3%)

Query: 15  VIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIE 74
           + ++++   ASQ +S R++HE S+ E+HE WM  +GRTYKD  EK  R  IFK+N+EYIE
Sbjct: 10  ITLLIMGVWASQALS-RTLHEVSMSERHEDWMGLYGRTYKDIAEKERRFKIFKENVEYIE 68

Query: 75  KANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTS 134
             N  GNR YKL  NEF+D TNEEF+A   GYN    S   +SS  ++F+Y+NV  VP+S
Sbjct: 69  SVNSAGNRRYKLSINEFADQTNEEFKASRNGYNM---SSRPRSSEITSFRYENVAAVPSS 125

Query: 135 IDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST--DNHG 192
           +DWR+KGAVT IKDQGQCG CWAFSAVAA+EG+TQ+  G+LI LSEQ+LVDC T  ++ G
Sbjct: 126 MDWRKKGAVTPIKDQGQCGCCWAFSAVAAMEGVTQLKTGELISLSEQELVDCDTSGEDQG 185

Query: 193 CSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQAL 252
           C GGLMD AFE+II N GL TEA+YPY+  + TC+ +K  + AA I  YED+P   E AL
Sbjct: 186 CGGGLMDSAFEFIIGNGGLTTEANYPYKGVDATCNKKKAASSAAKIKNYEDVPANSEAAL 245

Query: 253 LQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIK 312
           L+AV+  PVSV +DA G  F FY SGV    CG   DHGV  VG+G  + ++G KYWL+K
Sbjct: 246 LKAVAQHPVSVAIDAGGSDFQFYSSGVFTGQCGTELDHGVTAVGYG--KTDDGTKYWLVK 303

Query: 313 NSWGETWGESGYIRILR----DAGLCGIATAASYPVA 345
           NSWG  WGE GYI + R    D GLCGIA  ASYP A
Sbjct: 304 NSWGTGWGEDGYIWMERDIGADEGLCGIAMEASYPTA 340


>gi|24285904|gb|AAL14199.1| cysteine proteinase precursor [Ipomoea batatas]
 gi|56961686|gb|AAK15148.2| cysteine proteinase-like protein [Ipomoea batatas]
          Length = 341

 Score =  378 bits (971), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 181/328 (55%), Positives = 232/328 (70%), Gaps = 13/328 (3%)

Query: 24  ASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRT 83
           ++ + + R++ +  +V +HEQWMAQ+GR Y++E+EK  R NIFK+N+EYIE  NK G + 
Sbjct: 21  SAYLATSRTLSDSLMVVRHEQWMAQYGRVYENEVEKTKRFNIFKENVEYIESFNKAGTKP 80

Query: 84  YKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAV 143
           YKLG N F+DLTN+EF+A   GY  P        S  + F+Y+NV+ VPT++DWR KGAV
Sbjct: 81  YKLGINAFADLTNQEFKASRNGYKLP-----HDCSSNTPFRYENVSSVPTTVDWRTKGAV 135

Query: 144 THIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKA 201
           T +KDQGQCG CWAFSAVAA+EGIT+++ G LI LSEQ+LVDC     + GC GGLMD A
Sbjct: 136 TPVKDQGQCGCCWAFSAVAAMEGITKLSTGNLISLSEQELVDCDVKGIDQGCEGGLMDDA 195

Query: 202 FEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPV 261
           F +II NKGL TE++YPY+  +G+C   K    AA IS YED+P   E AL +AV+NQPV
Sbjct: 196 FSFIINNKGLTTESNYPYQGTDGSCKKSKSSNSAAKISGYEDVPANSESALEKAVANQPV 255

Query: 262 SVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGE 321
           SV +DA G  F FY SGV   +CG   DHGV  VG+G AE+  G+KYWL+KNSWG +WGE
Sbjct: 256 SVAIDAGGSDFQFYSSGVFTGECGTELDHGVTAVGYGIAED--GSKYWLVKNSWGTSWGE 313

Query: 322 SGYIRILRD----AGLCGIATAASYPVA 345
            GYIR+ +D     GLCGIA  +SYP A
Sbjct: 314 KGYIRMQKDIEAKEGLCGIAMQSSYPSA 341


>gi|13491750|gb|AAK27968.1|AF242372_1 cysteine protease [Ipomoea batatas]
          Length = 339

 Score =  378 bits (970), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 182/325 (56%), Positives = 228/325 (70%), Gaps = 13/325 (4%)

Query: 27  VVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKL 86
           + + R++ +  +V +HEQWMAQ+GR YK E EK  R NIFK+N+EYIE  NK G + YKL
Sbjct: 22  LATSRTLSDSLMVVRHEQWMAQYGRVYKTEAEKTKRFNIFKENVEYIESFNKAGTKPYKL 81

Query: 87  GTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHI 146
           G N F+DLTN+EF+A   GY  P        S  + F+Y+NV+ VPT++DWR KGAVT +
Sbjct: 82  GINAFADLTNQEFKASRNGYKLP-----HDCSSNTPFRYENVSSVPTTVDWRTKGAVTPV 136

Query: 147 KDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEY 204
           KDQGQCG CWAFSAVAA+EGIT+++ G LI LSEQ+LVDC     + GC GGLMD AF +
Sbjct: 137 KDQGQCGCCWAFSAVAAMEGITKLSTGNLISLSEQELVDCDVKGTDQGCEGGLMDDAFSF 196

Query: 205 IIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVC 264
           II NKGL TE++YPY+  +G+C   K    AA IS YED+P   E AL +AV+NQPVSV 
Sbjct: 197 IINNKGLTTESNYPYQGTDGSCKKSKSSNSAAKISGYEDVPANSESALEKAVANQPVSVA 256

Query: 265 VDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGY 324
           +DA G  F FY SGV   +CG   DHGV  VG+G AE+  G+KYWL+KNSWG +WGE GY
Sbjct: 257 IDAGGSDFQFYSSGVFTGECGTELDHGVTAVGYGIAED--GSKYWLVKNSWGTSWGEKGY 314

Query: 325 IRILRD----AGLCGIATAASYPVA 345
           IR+ +D     GLCGIA  +SYP A
Sbjct: 315 IRMQKDIEAKEGLCGIAMQSSYPSA 339


>gi|10336513|dbj|BAB13759.1| cysteine proteinase [Astragalus sinicus]
          Length = 343

 Score =  377 bits (969), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 188/343 (54%), Positives = 240/343 (69%), Gaps = 16/343 (4%)

Query: 13  MFVIIILVITC----ASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQ 68
           ++ I + ++ C    A QV S R++ + S+ E+H+QWM Q+ + Y D  E   R  IFK+
Sbjct: 7   LYYISLALLMCLGLWAVQVTS-RTLQDASMYERHQQWMGQYAKIYNDHQEWEKRFQIFKE 65

Query: 69  NLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNV 128
           N+ YIE +NKEG R YKLG N+F DLTNEEF A     NR    +     R +T+KY+NV
Sbjct: 66  NVNYIETSNKEGGRFYKLGVNQFVDLTNEEFIAPR---NRFKGHMCSSIIRTNTYKYENV 122

Query: 129 TDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST 188
           T VP+++DWR+KGAVT +KDQGQCG CWAFSAVAA EGI Q++ GKLI LSEQ+LVDC T
Sbjct: 123 TTVPSNVDWRQKGAVTPVKDQGQCGCCWAFSAVAATEGIHQLSTGKLISLSEQELVDCDT 182

Query: 189 D--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPK 246
              + GC GGLMD AF++II+N GL TEA YPY+  +GTC+  +    AATI+ YED+P 
Sbjct: 183 KGVDQGCEGGLMDDAFKFIIQNHGLDTEAKYPYQGVDGTCNANEASINAATITSYEDVPT 242

Query: 247 GDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGA 306
            +EQAL +AV+NQP+SV +DASG  F FY SGV    CG   DHGV  VG+G +++  G 
Sbjct: 243 NNEQALQKAVANQPISVAIDASGSDFQFYTSGVFTGSCGTELDHGVTAVGYGVSDD--GT 300

Query: 307 KYWLIKNSWGETWGESGYIRILR--DA--GLCGIATAASYPVA 345
           KYWL+KNSWG +WGE GYIR+ R  DA  GLCGIA  ASYP+A
Sbjct: 301 KYWLVKNSWGTSWGEEGYIRMQRGVDAVEGLCGIAMQASYPIA 343


>gi|318136892|gb|ADV41672.1| cysteine protease [Nicotiana tabacum]
          Length = 349

 Score =  377 bits (968), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 182/340 (53%), Positives = 234/340 (68%), Gaps = 9/340 (2%)

Query: 13  MFVIIILVITCASQVVSGRSM-HEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLE 71
           + +  I +    SQV S R + +E S+  +H+QW+A H + YKD  EK MR  IFK+N+E
Sbjct: 12  LALFFIFLGVWRSQVASSRPINYEASMRARHDQWIAHHDKVYKDLNEKEMRFKIFKENVE 71

Query: 72  YIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDV 131
            IE  N   ++ YKLG N+FSDLTNE+FR L+TGY R  P V   S   + F+Y NVTD+
Sbjct: 72  RIEAFNAGEDKGYKLGVNKFSDLTNEKFRVLHTGYKRSHPKVMSSSKPKTHFRYANVTDI 131

Query: 132 PTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST--D 189
           P ++DWR+KGAVT IKDQ +CG CWAFSAVAA EG+ Q+  GKLI LSEQ+LVDC    +
Sbjct: 132 PPTMDWRKKGAVTPIKDQKECGCCWAFSAVAATEGLHQLKTGKLIPLSEQELVDCDVEGE 191

Query: 190 NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDE 249
           + GCSGGL+D AF++I++NKGL TEA+YPY+ E+G C+ +K    AA I+ YED+P   E
Sbjct: 192 DEGCSGGLLDTAFDFILKNKGLTTEANYPYKGEDGVCNKKKSALSAAKIAGYEDVPANSE 251

Query: 250 QALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYW 309
           +ALLQAV+NQPVSV +D S   F FY SGV +  C    +H V  VG+G   +  G KYW
Sbjct: 252 KALLQAVANQPVSVAIDGSSFDFQFYSSGVFSGSCSTWLNHAVTAVGYGATTD--GTKYW 309

Query: 310 LIKNSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
           +IKNSWG  WG+SGY+RI RD     GLCG+A  ASYP A
Sbjct: 310 IIKNSWGSKWGDSGYMRIKRDVHEKEGLCGLAMDASYPTA 349


>gi|225446585|ref|XP_002280215.1| PREDICTED: vignain [Vitis vinifera]
          Length = 341

 Score =  377 bits (968), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 186/338 (55%), Positives = 241/338 (71%), Gaps = 14/338 (4%)

Query: 14  FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYI 73
             ++  +   ASQ  + R++ E S+ E+HE WMAQ+GR YKD  EK+ R  IFK N+  I
Sbjct: 12  LALLFFLAAWASQATA-RNLLEASMYERHEDWMAQYGRVYKDADEKSKRYKIFKDNVARI 70

Query: 74  EKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPT 133
           E  NK  +++YKL  NEF+DLTNEEFRA     NR    +   S+  ++FKY++V  VP+
Sbjct: 71  ESFNKAMDKSYKLSINEFADLTNEEFRA---SRNRFKAHIC--STEATSFKYEHVAAVPS 125

Query: 134 SIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST--DNH 191
           ++DWR+KGAVT IKDQGQCGSCWAFSAVAA+EGITQ++ GKLI LSEQ+LVDC T  ++ 
Sbjct: 126 TVDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQ 185

Query: 192 GCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQA 251
           GC+GGLMD AF++I +N GLATEA+YPY   +GTC+ +K    AA I+ YED+P  +E+A
Sbjct: 186 GCNGGLMDDAFKFIEQNHGLATEANYPYAGTDGTCNRKKAAHPAAKINGYEDVPANNEKA 245

Query: 252 LLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLI 311
           L +AV++QP++V +DA G  F FY SGV    CG   DHGVA VG+GT+++  G KYWL+
Sbjct: 246 LQKAVAHQPIAVAIDAGGFEFQFYSSGVFTGQCGTELDHGVAAVGYGTSDD--GMKYWLV 303

Query: 312 KNSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
           KNSWG  WGE GYIR+ RD     GLCGIA  ASYP A
Sbjct: 304 KNSWGTGWGEVGYIRMQRDVTAKEGLCGIAMQASYPTA 341


>gi|356542633|ref|XP_003539771.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
          Length = 341

 Score =  377 bits (967), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 186/349 (53%), Positives = 237/349 (67%), Gaps = 14/349 (4%)

Query: 3   LKFEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMR 62
           + F+K F   + + +I    CA +  + R++ +  + E+HEQWMA HG+ YK   EK  +
Sbjct: 1   MAFKKLFHCTLALFLIFAF-CAFEA-NARTLEDAPMRERHEQWMATHGKVYKHSYEKEQK 58

Query: 63  LNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPST 122
             IF +N++ IE  N  G + YKLG N F+DLTNEEF+A+    NR    V  + +R +T
Sbjct: 59  YQIFMENVQRIEAFNNAGXKPYKLGINHFADLTNEEFKAI----NRFKGHVCSKRTRTTT 114

Query: 123 FKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQ 182
           F+Y+NVT VP S+DWR+KGAVT IKDQGQCG CWAFSAVAA EGIT++  GKLI LSEQ+
Sbjct: 115 FRYENVTAVPASLDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGITKLRTGKLISLSEQE 174

Query: 183 LVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISK 240
           LVDC T   + GC GGLMD AF++I++NKGLATEA YPY   +GTC+ + +   A +I  
Sbjct: 175 LVDCDTKGVDQGCEGGLMDDAFKFILQNKGLATEAIYPYEGFDGTCNAKADGNHAGSIKG 234

Query: 241 YEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTA 300
           YED+P   E ALL+AV+NQPVSV ++ASG  F FY  GV    CG N DHGV  VG+G  
Sbjct: 235 YEDVPANSESALLKAVANQPVSVAIEASGFKFQFYSGGVFTGSCGTNLDHGVTSVGYGVG 294

Query: 301 EEENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
           ++  G KYWL+KNSWG  WGE GYIR+ RD     GLCGIA  ASYP A
Sbjct: 295 DD--GTKYWLVKNSWGVKWGEKGYIRMQRDVAAKEGLCGIAMLASYPSA 341


>gi|224076970|ref|XP_002305073.1| predicted protein [Populus trichocarpa]
 gi|222848037|gb|EEE85584.1| predicted protein [Populus trichocarpa]
          Length = 340

 Score =  377 bits (967), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 180/340 (52%), Positives = 237/340 (69%), Gaps = 16/340 (4%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPS-IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLE 71
           +F+  +L++   +  ++ R + E   ++++HE+WMAQHGR Y D  EK  R  IFK+N+E
Sbjct: 10  IFLPFLLILAAWATKIACRPLDEQEYMLKRHEEWMAQHGRVYGDMKEKEKRYLIFKENIE 69

Query: 72  YIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSR--PSTFKYQNVT 129
            IE  N   +R YKLG N+F+DLTNEEFRA+Y GY        RQSS+   S+F+Y+N++
Sbjct: 70  RIEAFNNGSDRGYKLGVNKFADLTNEEFRAMYHGY-------KRQSSKLMSSSFRYENLS 122

Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD 189
           D+PTS+DWR  GAVT +KDQG CG CWAFS VAA+EGI ++  G LI LSEQQLVDC+  
Sbjct: 123 DIPTSMDWRNDGAVTPVKDQGTCGCCWAFSTVAAIEGIIKLQTGNLISLSEQQLVDCTAG 182

Query: 190 NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDE 249
           N GC GGLMD AF+YII N GL +E +YPY+  +GTC ++K  +  A I+ YED+P+ +E
Sbjct: 183 NKGCQGGLMDTAFQYIIRNGGLTSEDNYPYQGVDGTCSSEKAASTEAQITGYEDVPQNNE 242

Query: 250 QALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYW 309
            ALLQAV+ QPVSV VD  G  F FYKSGV N DCG   +H V  +G+GT  + +G  YW
Sbjct: 243 NALLQAVAKQPVSVGVDGGGNDFQFYKSGVFNGDCGTQQNHAVTAIGYGT--DIDGTDYW 300

Query: 310 LIKNSWGETWGESGYIRILR----DAGLCGIATAASYPVA 345
           L+KNSWG +WGE+GY+R+ R      GLCG+A  ASYP A
Sbjct: 301 LVKNSWGTSWGENGYMRMRRGIGSSEGLCGVAMDASYPTA 340


>gi|144905104|dbj|BAF56427.1| cysteine proteinase [Lotus japonicus]
          Length = 342

 Score =  377 bits (967), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 183/338 (54%), Positives = 242/338 (71%), Gaps = 13/338 (3%)

Query: 14  FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYI 73
           F +++ +   A QV S R++ + S+ E+HEQWMA++G+ YKD  EK  R NIF++N++YI
Sbjct: 12  FALVLCLGLWAFQV-SSRTLQDASMHERHEQWMARYGKVYKDLQEKEKRFNIFQENVKYI 70

Query: 74  EKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPT 133
           E +N  GN+ YKLG N+F+DLTN+EF A     N+    +S   +R +TFKY+NVT  P+
Sbjct: 71  EASNNAGNKPYKLGVNQFTDLTNKEFIATR---NKFKGHMSSSITRTTTFKYENVT-APS 126

Query: 134 SIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NH 191
           ++DWR++GAVT +K+QG CG CWAFSAVAA EGI +++ G L+ LSEQ+LVDC T   + 
Sbjct: 127 TVDWRQEGAVTPVKNQGTCGCCWAFSAVAATEGIHKLSTGNLVSLSEQELVDCDTSGADQ 186

Query: 192 GCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQA 251
           GC GGLMD AF++II+N GL TEA YPY+  +GTC+  +E    ATI+ YED+P  +EQA
Sbjct: 187 GCQGGLMDDAFKFIIQNGGLNTEAQYPYQGVDGTCNTNEEVTHVATITGYEDVPSNNEQA 246

Query: 252 LLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLI 311
           L QAV+NQP+SV +DASG  F  Y+SGV    CG   DHGVAVVG+G +++  G KYWL+
Sbjct: 247 LQQAVANQPISVAIDASGSDFQNYQSGVFTGSCGTQLDHGVAVVGYGVSDD--GTKYWLV 304

Query: 312 KNSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
           KNSWGE WGE GYIR+ RD     GLCGIA   SYP A
Sbjct: 305 KNSWGEDWGEEGYIRMQRDVEAPEGLCGIAMQPSYPTA 342


>gi|535454|gb|AAA50755.1| cysteine proteinase [Alnus glutinosa]
          Length = 340

 Score =  376 bits (965), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 181/345 (52%), Positives = 239/345 (69%), Gaps = 15/345 (4%)

Query: 9   FIIPMFVIIILVI--TCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIF 66
           F+   F ++++V     ASQ+ + RS+ + S+ E+HE+WMA +GR YKD  EK  R  IF
Sbjct: 3   FVSQCFCLVVMVTLGALASQLAAARSLQDASMRERHEEWMASYGRVYKDINEKQKRYKIF 62

Query: 67  KQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQ 126
           ++N+  IE +NK+ N+ YKL  N+F+DLTNEEF+A    +   + S     ++ ++FKY 
Sbjct: 63  EENVALIESSNKDANKPYKLSVNQFADLTNEEFKASRNRFKGHICS-----TKSTSFKYG 117

Query: 127 NVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC 186
           NV+ VP+++DWR KGAVT +KDQGQCG CWAFSAVAA EGIT++T G+LI LSEQ+LVDC
Sbjct: 118 NVSAVPSAMDWRMKGAVTPVKDQGQCGCCWAFSAVAATEGITKLTTGELISLSEQELVDC 177

Query: 187 STD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDL 244
            T   + GC GGLMD AF +I  N GLA+EA+YPY+  +GTC+  K+   AA I+ +ED+
Sbjct: 178 DTSGVDQGCEGGLMDNAFTFIQHNHGLASEANYPYKGVDGTCNTNKQAIHAAEINGFEDV 237

Query: 245 PKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEEN 304
           P   E+ALL AV++QPVSV +DA G  F FY  GV    CG   DHGV  VG+GT+++  
Sbjct: 238 PANSEEALLNAVAHQPVSVAIDAGGSGFQFYSKGVFIGACGTQLDHGVTAVGYGTSDD-- 295

Query: 305 GAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
           G KYWL+KNSWG  WGE GYIR+ RD     GLCGIA  ASYP A
Sbjct: 296 GTKYWLVKNSWGTQWGEEGYIRMQRDVDAKEGLCGIAMKASYPTA 340


>gi|255568299|ref|XP_002525124.1| cysteine protease, putative [Ricinus communis]
 gi|223535583|gb|EEF37251.1| cysteine protease, putative [Ricinus communis]
          Length = 342

 Score =  376 bits (965), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 175/339 (51%), Positives = 238/339 (70%), Gaps = 12/339 (3%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
           + + +  V+   +   S R +HE ++VE+HE+WMA+HG+ YKD+ EK  R  IFK N+E+
Sbjct: 10  LLIALFFVLAMWADQASTRELHESTMVERHEKWMAKHGKVYKDDEEKLRRFQIFKNNVEF 69

Query: 73  IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVP 132
           IE +N  GN +Y LG N F+DLTNEEFRA + GY RP+ +    S   + FKY+NVT +P
Sbjct: 70  IESSNAAGNNSYMLGINRFADLTNEEFRASWNGYKRPLDA----SRIVTPFKYENVTALP 125

Query: 133 TSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST--DN 190
            S+DWR KGAVT IKDQ +CGSCWAFSAVAA EG+ ++  GKL+ LSEQ+LVDC    ++
Sbjct: 126 YSMDWRRKGAVTSIKDQRECGSCWAFSAVAATEGVHKLRTGKLVSLSEQELVDCDVKGED 185

Query: 191 HGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQ 250
            GC GGLM+ AF++I  N G+ TEA+Y YR  +G CD +KE +  A I+ Y+ +P+  E 
Sbjct: 186 KGCQGGLMEDAFKFIKRNGGITTEANYAYRGRDGKCDTKKEASHVAKITGYQVVPENSEA 245

Query: 251 ALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWL 310
           ALL+AV++QPVSV +DA   +F FY+SG+    CG++ +HGVA VG+GT+   +G+KYW+
Sbjct: 246 ALLKAVAHQPVSVSIDAGSMSFQFYQSGIYAGSCGSDLNHGVAAVGYGTS--SSGSKYWI 303

Query: 311 IKNSWGETWGESGYIRILRD----AGLCGIATAASYPVA 345
           +KNSWG  WGE GY+R+ RD     GLCGIA   SYP A
Sbjct: 304 VKNSWGPEWGERGYVRMKRDITSRKGLCGIAMDCSYPTA 342


>gi|224076968|ref|XP_002305072.1| predicted protein [Populus trichocarpa]
 gi|222848036|gb|EEE85583.1| predicted protein [Populus trichocarpa]
          Length = 305

 Score =  375 bits (964), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 177/314 (56%), Positives = 224/314 (71%), Gaps = 15/314 (4%)

Query: 38  IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
           ++++HE+WMAQHGR Y D  EK  R  IFK+N+E IE  N   +R YKLG N+F+DLTNE
Sbjct: 1   MLKRHEEWMAQHGRVYGDMKEKEKRYLIFKENIERIEAFNNGSDRGYKLGVNKFADLTNE 60

Query: 98  EFRALYTGYNRPVPSVSRQSSR--PSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSC 155
           EFRA+Y GY        RQSS+   S+F+Y+N++D+PTS+DWR  GAVT +KDQG CG C
Sbjct: 61  EFRAMYHGY-------KRQSSKLMSSSFRYENLSDIPTSMDWRNDGAVTPVKDQGTCGCC 113

Query: 156 WAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLATEA 215
           WAFS VAA+EGI ++  G LI LSEQQLVDC+  N GC GGLMD AF+YII N GL +E 
Sbjct: 114 WAFSTVAAIEGIIKLQTGNLISLSEQQLVDCTAGNKGCQGGLMDTAFQYIIRNGGLTSED 173

Query: 216 DYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFY 275
           +YPY+  +GTC ++K  +  A I+ YED+P+ +E ALLQAV+ QPVSV VD  G  F FY
Sbjct: 174 NYPYQGVDGTCSSEKAASTEAQITGYEDVPQNNENALLQAVAKQPVSVAVDGGGNDFRFY 233

Query: 276 KSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILR----DA 331
           KSGV   DCG N +HGV  +G+GT  + +G  YWL+KNSWG +WGESGY R+ R      
Sbjct: 234 KSGVFEGDCGTNLNHGVTAIGYGT--DSDGTDYWLVKNSWGTSWGESGYTRMQRGIGASE 291

Query: 332 GLCGIATAASYPVA 345
           GLCG+A  ASYP +
Sbjct: 292 GLCGVAMDASYPTS 305


>gi|50355613|dbj|BAD29955.1| cysteine protease [Daucus carota]
          Length = 365

 Score =  375 bits (963), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 184/352 (52%), Positives = 242/352 (68%), Gaps = 18/352 (5%)

Query: 1   MVLKFEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKA 60
           + +K + + +  +F I +L     + + + RS++E S+ E H+QWMA++GR YK   EK 
Sbjct: 3   LTIKHQCTPLALLFTIGVL-----ASLAAARSLNEASMTETHDQWMARYGRVYKTANEKN 57

Query: 61  MRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRP 120
            R  IF++NL+YI+  NK  N+ YKLG NEF+DLTNEEF      +   V +     +  
Sbjct: 58  RRSTIFQENLKYIQTFNKANNKPYKLGVNEFADLTNEEFTTSRNKFKSHVCA-----TVT 112

Query: 121 STFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSE 180
           + F+Y+NVT VP ++DWR+KGAVT IK+QGQCG CWAFSAVAA+EGITQ+  GKLI LSE
Sbjct: 113 NVFRYENVTAVPATMDWRKKGAVTPIKNQGQCGCCWAFSAVAAMEGITQLKTGKLISLSE 172

Query: 181 QQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATI 238
           Q+LVDC T+  + GC GGLMD AF++I +N GL+TE +YPY   +GTC+  KE   AATI
Sbjct: 173 QELVDCDTNGEDQGCEGGLMDYAFDFIQQNHGLSTETNYPYSGTDGTCNANKEANHAATI 232

Query: 239 SKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFG 298
           + +ED+P   E ALL+AV+NQP+SV +DASG  F FY SGV   +CG   DHGV  VG+G
Sbjct: 233 TGHEDVPANSESALLKAVANQPISVAIDASGSDFQFYSSGVFTGECGTELDHGVTAVGYG 292

Query: 299 TAEEENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPVAI 346
           TA +  G KYWL+KNSWG +WGE GYI++ R      GLCGIA  ASYP A 
Sbjct: 293 TAAD--GTKYWLVKNSWGTSWGEEGYIQMQRGVAAAEGLCGIAMQASYPTAF 342


>gi|255580657|ref|XP_002531151.1| cysteine protease, putative [Ricinus communis]
 gi|223529264|gb|EEF31236.1| cysteine protease, putative [Ricinus communis]
          Length = 340

 Score =  374 bits (960), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 183/338 (54%), Positives = 237/338 (70%), Gaps = 15/338 (4%)

Query: 14  FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYI 73
             +I L+    SQ ++ R++ + S+ EKHE+WM++ GR Y D  EK +R  IFK+N++ I
Sbjct: 12  LALIFLLGALVSQAMA-RTLQDASMHEKHEEWMSRFGRVYNDGNEKEIRYKIFKENVQRI 70

Query: 74  EKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPT 133
           E  NK   ++YKLG N+F+DLTNEEF+   T  NR    +   SS+   F+Y+N+T  P+
Sbjct: 71  ESFNKASGKSYKLGINQFADLTNEEFK---TSRNRFKGHMC--SSQAGPFRYENLTAAPS 125

Query: 134 SIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST--DNH 191
           S+DWR+KGAVT IKDQGQCGSCWAFSAVAAVEGITQ+   KLI LSEQ+LVDC T  ++ 
Sbjct: 126 SMDWRKKGAVTAIKDQGQCGSCWAFSAVAAVEGITQLATSKLISLSEQELVDCDTKGEDQ 185

Query: 192 GCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQA 251
           GC GGLMD AF++I +N+GL TEA+YPY   +GTC+ ++E   AA I+ +ED+P  +E A
Sbjct: 186 GCQGGLMDDAFKFIEQNQGLTTEANYPYEGSDGTCNTKQEANHAAKINGFEDVPANNEGA 245

Query: 252 LLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLI 311
           L++AV+ QPVSV +DA G  F FY SG+   DCG   DHGVA VG+G   E NG  YWL+
Sbjct: 246 LMKAVAKQPVSVAIDAGGFGFQFYSSGIFTGDCGTELDHGVAAVGYG---ESNGMNYWLV 302

Query: 312 KNSWGETWGESGYIRILRD----AGLCGIATAASYPVA 345
           KNSWG  WGE GYIR+ +D     GLCGIA  ASYP A
Sbjct: 303 KNSWGTQWGEEGYIRMQKDIDAKEGLCGIAMQASYPTA 340


>gi|409190991|gb|AFV30165.1| cysteine proteinase [Lotus japonicus]
          Length = 342

 Score =  374 bits (960), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 182/349 (52%), Positives = 243/349 (69%), Gaps = 13/349 (3%)

Query: 3   LKFEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMR 62
           +  +  F    F +++ +   A QV S R++ + S+ E+HEQWMA++GR YKD  EK  R
Sbjct: 1   MATKNQFYQVSFALVLCLGLWAFQV-SSRTLQDASMQERHEQWMARYGRVYKDLQEKEKR 59

Query: 63  LNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPST 122
            +IFK+N+ YIE +N  G++ YKLG N+F+DLTNEEF A     N+    +S   +R +T
Sbjct: 60  FSIFKENVNYIEASNNAGDKPYKLGVNQFADLTNEEFIATR---NKFKGHMSSSITRTTT 116

Query: 123 FKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQ 182
           FKY+NVT  P+++DWR++GAVT +K+QG CG CWAFSAVAA EGI +++ G L+ LSEQ+
Sbjct: 117 FKYENVT-APSTVDWRQEGAVTPVKNQGTCGCCWAFSAVAATEGIHKLSTGNLVSLSEQE 175

Query: 183 LVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISK 240
           LVDC T   + GC GGLMD AF++II+N GL TEA YPY+  +GTC+  +E    ATI+ 
Sbjct: 176 LVDCDTSGADQGCQGGLMDDAFKFIIQNGGLNTEAQYPYQGVDGTCNTNEEATHVATITG 235

Query: 241 YEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTA 300
           YED+P  +EQAL QAV+NQP+S+ +DASG  F  Y+SGV    CG   DHGVAVVG+G +
Sbjct: 236 YEDVPSNNEQALQQAVANQPISIAIDASGSDFQNYQSGVFTGSCGTQLDHGVAVVGYGVS 295

Query: 301 EEENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
           ++  G KYWL+KNSWG  WGE GYIR+ RD     GLCG+A   SYP A
Sbjct: 296 DD--GTKYWLVKNSWGADWGEEGYIRMQRDVDAPEGLCGLAMQPSYPTA 342


>gi|255580659|ref|XP_002531152.1| cysteine protease, putative [Ricinus communis]
 gi|223529265|gb|EEF31237.1| cysteine protease, putative [Ricinus communis]
          Length = 340

 Score =  373 bits (958), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 183/338 (54%), Positives = 236/338 (69%), Gaps = 15/338 (4%)

Query: 14  FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYI 73
             +I  +   ASQ ++ R++ + SI EKHE+WM +  R Y D  EK +R  IFK+N++ I
Sbjct: 12  LALIFFLGALASQAIA-RTLQDASIHEKHEEWMTRFKRVYSDAKEKEIRYKIFKENVQRI 70

Query: 74  EKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPT 133
           E  NK   ++YKLG N+F+DLTNEEF+   T  NR    +   SS+   F+Y+N+T VP+
Sbjct: 71  ESFNKASEKSYKLGINQFADLTNEEFK---TSRNRFKGHMC--SSQAGPFRYENITAVPS 125

Query: 134 SIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST--DNH 191
           S+DWR++GAVT IKDQGQCGSCWAFSAVAAVEGITQ+   KLI LSEQ+LVDC T  ++ 
Sbjct: 126 SMDWRKEGAVTAIKDQGQCGSCWAFSAVAAVEGITQLATSKLISLSEQELVDCDTKGEDQ 185

Query: 192 GCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQA 251
           GC GGLMD AF++I +N+GL TEA+YPY   +GTC+ ++E   AA I+ +ED+P  +E A
Sbjct: 186 GCQGGLMDDAFKFIEQNQGLTTEANYPYEGSDGTCNTKQEANHAAKINGFEDVPANNEGA 245

Query: 252 LLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLI 311
           L++AV+ QPVSV +DA G  F FY SG+   DCG   DHGVA VG+G   E NG  YWL+
Sbjct: 246 LMKAVAKQPVSVAIDAGGFEFQFYSSGIFTGDCGTELDHGVAAVGYG---ESNGMNYWLV 302

Query: 312 KNSWGETWGESGYIRILRD----AGLCGIATAASYPVA 345
           KNSWG  WGE GYIR+ +D     GLCGIA  ASYP A
Sbjct: 303 KNSWGTQWGEEGYIRMQKDIDAKEGLCGIAMQASYPTA 340


>gi|224121800|ref|XP_002330656.1| predicted protein [Populus trichocarpa]
 gi|222872260|gb|EEF09391.1| predicted protein [Populus trichocarpa]
          Length = 342

 Score =  373 bits (957), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 176/338 (52%), Positives = 229/338 (67%), Gaps = 11/338 (3%)

Query: 14  FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYI 73
           F   IL++   +  V+ R + E  +  +HEQWMA +G+ Y D  EK  R  IFK N+EYI
Sbjct: 10  FFAFILILGMWAFEVASRELQESYMSARHEQWMATYGKVYVDAAEKERRFKIFKNNVEYI 69

Query: 74  EKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPT 133
           E  N  GN+ YKL  N+F+D TNE+F+    GY RP  +   +  + ++FKY+NVT VP 
Sbjct: 70  ESFNTAGNKPYKLSVNKFADQTNEKFKGARNGYRRPFQT---RPMKVTSFKYENVTAVPA 126

Query: 134 SIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST--DNH 191
           ++DWR+KGAVT IKDQGQCGSCWAFS VAA EGI Q+T GKL+ LSEQ+LVDC    ++ 
Sbjct: 127 TMDWRKKGAVTLIKDQGQCGSCWAFSTVAATEGINQLTTGKLVSLSEQELVDCDIQGEDQ 186

Query: 192 GCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQA 251
           GC GGLM+  FE+II+N G+ TEA+YPY+  +GTC+++K+ +  A I+ YE +P   E  
Sbjct: 187 GCEGGLMEDGFEFIIKNHGITTEANYPYQAADGTCNSKKQASHIAKITGYESVPANSEAE 246

Query: 252 LLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLI 311
           LL+ V+NQP+SV +DA G  F FY SGV    CG   DHGV  VG+G  E  +G KYWL+
Sbjct: 247 LLKVVANQPISVSIDAGGSDFQFYSSGVFTGKCGTELDHGVTAVGYG--ETSDGTKYWLV 304

Query: 312 KNSWGETWGESGYIRILRD----AGLCGIATAASYPVA 345
           KNSWG +WGE GYIR+ RD     GLCGIA  +SYP A
Sbjct: 305 KNSWGTSWGEEGYIRMQRDIDTEEGLCGIAMDSSYPTA 342


>gi|356515050|ref|XP_003526214.1| PREDICTED: vignain-like [Glycine max]
          Length = 344

 Score =  372 bits (954), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 185/350 (52%), Positives = 237/350 (67%), Gaps = 11/350 (3%)

Query: 1   MVLKFEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKA 60
           M    +K  ++ +F+ + + I   SQV+  R +H+ ++ E+HE WMA++G+ YKD  EK 
Sbjct: 1   MAFTGQKQHMLALFLFLAVGI---SQVMP-RKLHQTALRERHENWMAEYGKIYKDAAEKE 56

Query: 61  MRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRP 120
            R  IFK N+E+IE  N  GN+ YKLG N  +DLT EEF+    G  R     S  + + 
Sbjct: 57  KRFQIFKDNVEFIESFNAAGNKPYKLGVNHLADLTLEEFKDSRNGLKRTY-EFSTTTFKL 115

Query: 121 STFKYQNVTDVPTSIDWREKGAVTHIKDQG-QCGSCWAFSAVAAVEGITQITRGKLIELS 179
           + FKY+NVTD+P +IDWR KGAVT IKDQG QCGSCWAFS VAA EGI QI+ G L+ LS
Sbjct: 116 NGFKYENVTDIPEAIDWRVKGAVTPIKDQGDQCGSCWAFSTVAATEGIYQISTGMLMSLS 175

Query: 180 EQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATIS 239
           EQ+LVDC + +HGC GGLM+  FE+II+N G+++EA+YPY   +GTCD  KE + AA I 
Sbjct: 176 EQELVDCDSVDHGCDGGLMEDGFEFIIKNGGISSEANYPYTAVDGTCDASKEASPAAQIK 235

Query: 240 KYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGT 299
            YE +P   E+AL QAV+NQPVSV +DA G  F FY SGV    CG   DHGV VVG+GT
Sbjct: 236 GYETVPANSEEALQQAVANQPVSVSIDAGGSGFQFYSSGVFTGQCGTQLDHGVTVVGYGT 295

Query: 300 AEEENGAKYWLIKNSWGETWGESGYIRILR--DA--GLCGIATAASYPVA 345
            ++    +YW++KNSWG  WGE GYIR+ R  DA  GLCGIA  ASYP A
Sbjct: 296 TDDGT-HEYWIVKNSWGTQWGEEGYIRMQRGIDALEGLCGIAMDASYPTA 344


>gi|356542631|ref|XP_003539770.1| PREDICTED: vignain-like [Glycine max]
          Length = 343

 Score =  372 bits (954), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 181/322 (56%), Positives = 236/322 (73%), Gaps = 13/322 (4%)

Query: 31  RSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNE 90
           R++ + S+ E+HEQWMAQHG+ YKD  EK +R  IF+QN++ IE  N  GN+++KLG N+
Sbjct: 28  RTLEDASMHERHEQWMAQHGKVYKDHHEKELRYKIFQQNVKGIEGFNNAGNKSHKLGVNQ 87

Query: 91  FSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQG 150
           F+DLT EEF+A+    N+    +  + SR STFKY++VT VP ++DWR+KGAVT IK QG
Sbjct: 88  FADLTEEEFKAI----NKLKGYMWSKISRTSTFKYEHVTKVPATLDWRQKGAVTPIKSQG 143

Query: 151 -QCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST--DNHGCSGGLMDKAFEYIIE 207
            +CGSCWAF+AVAA EGIT++T G+LI LSEQ+L+DC T  DN GC  G++ +AF++I++
Sbjct: 144 LKCGSCWAFAAVAATEGITKLTTGELISLSEQELIDCDTNGDNGGCKWGIIQEAFKFIVQ 203

Query: 208 NKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDA 267
           NKGLATEA YPY+  +GTC+ + E    A+I  YED+P  +E ALL AV+NQPVSV VD+
Sbjct: 204 NKGLATEASYPYQAVDGTCNAKVESKHVASIKGYEDVPANNETALLNAVANQPVSVLVDS 263

Query: 268 SGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRI 327
           S   F FY SGVL+  CG   DH V VVG+G +++  G KYWLIKNSWG  WGE GYIRI
Sbjct: 264 SDYDFRFYSSGVLSGSCGTTFDHAVTVVGYGVSDD--GTKYWLIKNSWGVYWGEQGYIRI 321

Query: 328 LRDA----GLCGIATAASYPVA 345
            RD     G+CGIA  ASYP+A
Sbjct: 322 KRDVAAKEGMCGIAMQASYPIA 343


>gi|224081320|ref|XP_002306369.1| predicted protein [Populus trichocarpa]
 gi|222855818|gb|EEE93365.1| predicted protein [Populus trichocarpa]
          Length = 340

 Score =  371 bits (953), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 181/351 (51%), Positives = 240/351 (68%), Gaps = 17/351 (4%)

Query: 1   MVLKFEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKA 60
           M L  +  FI    + ++ V+       + R++ + S+ E+HEQWMAQ+GR YKD+ EK 
Sbjct: 1   MRLTKQSQFIC---LALLFVLGAWPSKSAARTLQDVSMYERHEQWMAQYGRVYKDDAEKE 57

Query: 61  MRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRP 120
            R NIFK+N+  I+  N +  ++YKLG N+F+DL+NEEF+A     NR    +    + P
Sbjct: 58  TRYNIFKENVARIDAFNSQTGKSYKLGVNQFADLSNEEFKA---SRNRFKGHMCSPQAGP 114

Query: 121 STFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSE 180
             F+Y+NV+ VP ++DWR+KGAVT +KDQGQCG CWAFSAVAA+EGI Q+T GKLI LSE
Sbjct: 115 --FRYENVSAVPATMDWRKKGAVTPVKDQGQCGCCWAFSAVAAMEGINQLTTGKLISLSE 172

Query: 181 QQLVDCST--DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATI 238
           Q++VDC T  ++ GC+GGLMD AF++I +NKGL TEA+YPY   +GTC+ QKE   AA I
Sbjct: 173 QEVVDCDTKGEDQGCNGGLMDDAFKFIEQNKGLTTEANYPYTGTDGTCNTQKEATHAAKI 232

Query: 239 SKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFG 298
           + +ED+P   E AL++AV+ QPVSV +DA G  F FY SG+    CG   DHGV  VG+G
Sbjct: 233 TGFEDVPANSEAALMKAVAKQPVSVAIDAGGFEFQFYSSGIFTGSCGTQLDHGVTAVGYG 292

Query: 299 TAEEENGAKYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPVA 345
            ++   G KYWL+KNSWG  WGE GYIR+ +D     GLCGIA  ASYP A
Sbjct: 293 ISD---GTKYWLVKNSWGAQWGEEGYIRMQKDISAKEGLCGIAMQASYPSA 340


>gi|297819568|ref|XP_002877667.1| hypothetical protein ARALYDRAFT_348033 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297323505|gb|EFH53926.1| hypothetical protein ARALYDRAFT_348033 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 341

 Score =  371 bits (952), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 178/343 (51%), Positives = 242/343 (70%), Gaps = 13/343 (3%)

Query: 11  IPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNL 70
           I  F++ I++ +  S   S   + E S +EKHEQWM++  R Y D+ EK  R  IFK+NL
Sbjct: 4   IIFFLLAIILSSRTSGATSRGGLFEASAIEKHEQWMSRFHRVYSDDSEKTSRFEIFKKNL 63

Query: 71  EYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPS----TFKYQ 126
           +++E  N   N+TY L  NEFSDLT+EEF+A YTG   P   ++R S+  S    +F+Y+
Sbjct: 64  KFVESFNMNTNKTYTLDVNEFSDLTDEEFKARYTGLVVP-EGMTRMSTTDSHETVSFRYE 122

Query: 127 NVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC 186
           NV +   S+DWRE+GAVT +K Q QCG CWAFSAVAAVEG+T+I +G+L+ LSEQQL+DC
Sbjct: 123 NVGETGESMDWREEGAVTSVKHQQQCGCCWAFSAVAAVEGMTKIAKGELVSLSEQQLLDC 182

Query: 187 STDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPK 246
           ST+N GC GG+M KAF+YI+EN+G+  E +YPY+  + TC++      AATIS YE +P+
Sbjct: 183 STENDGCDGGIMWKAFDYIVENQGITAEDNYPYQGAQQTCESN--HVAAATISGYETVPQ 240

Query: 247 GDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGA 306
            DE+ALL+AVS QPVSV ++ SG  F  Y  G+ N +CG + +H V +VG+G +EE  G 
Sbjct: 241 NDEEALLKAVSQQPVSVAIEGSGYEFIHYSGGIFNGECGTHLNHAVTIVGYGVSEE--GI 298

Query: 307 KYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
           KYWL+KNSWGE+WGE GY+RI+RD     G+CG+A+ A YPVA
Sbjct: 299 KYWLLKNSWGESWGEDGYMRIMRDVDAPQGMCGLASLAYYPVA 341


>gi|356577811|ref|XP_003557016.1| PREDICTED: vignain-like [Glycine max]
          Length = 343

 Score =  370 bits (951), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 182/345 (52%), Positives = 236/345 (68%), Gaps = 11/345 (3%)

Query: 7   KSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIF 66
           K+    + + ++L +   +  V+ RS+ + S+ E+HEQWM ++G+ YKD  E+  R  IF
Sbjct: 4   KNHFCHISLAMLLCMAFLAFQVTCRSLQDASMYERHEQWMTRYGKVYKDPQEREKRFRIF 63

Query: 67  KQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQ 126
           K+N+ YIE  N   N+ YKL  N+F+DLTNEEF A     NR    +     R +TFKY+
Sbjct: 64  KENVNYIEAFNNAANKRYKLAINQFADLTNEEFIA---PRNRFKGHMCSSIIRTTTFKYE 120

Query: 127 NVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC 186
           NVT VP+++DWR+KGAVT IKDQGQCG CWAFSAVAA EGI  +T GKLI LSEQ+LVDC
Sbjct: 121 NVTAVPSTVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALTSGKLISLSEQELVDC 180

Query: 187 STD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDL 244
            T   + GC GGLMD AF+++I+N GL TEA+YPY+  +G C+  +    AATI+ YED+
Sbjct: 181 DTKGVDQGCEGGLMDDAFKFVIQNHGLNTEANYPYKGVDGKCNVNEAANDAATITGYEDV 240

Query: 245 PKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEEN 304
           P  +E+AL +AV+NQPVSV +DASG  F FYKSGV    CG   DHGV  VG+G + +  
Sbjct: 241 PANNEKALQKAVANQPVSVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSND-- 298

Query: 305 GAKYWLIKNSWGETWGESGYIRILR----DAGLCGIATAASYPVA 345
           G +YWL+KNSWG  WGE GYIR+ R    + GLCGIA  ASYP A
Sbjct: 299 GTEYWLVKNSWGTEWGEEGYIRMQRGVNSEEGLCGIAMQASYPTA 343


>gi|224099295|ref|XP_002334495.1| predicted protein [Populus trichocarpa]
 gi|222872550|gb|EEF09681.1| predicted protein [Populus trichocarpa]
          Length = 342

 Score =  370 bits (950), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 175/338 (51%), Positives = 228/338 (67%), Gaps = 11/338 (3%)

Query: 14  FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYI 73
           F   IL++   +  V+ R + E  +  +HEQWMA +G+ Y D  EK  R  IFK N+EYI
Sbjct: 10  FFAFILILGMWAFEVASRELQESYMSARHEQWMATYGKVYVDAAEKERRFKIFKNNVEYI 69

Query: 74  EKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPT 133
           E  N  GN+ YKL  N+F+D TNE+F+    GY RP  +   +  + ++FKY+NVT VP 
Sbjct: 70  ESFNTAGNKPYKLSVNKFADQTNEKFKGARNGYRRPFQT---RPMKVTSFKYENVTAVPA 126

Query: 134 SIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST--DNH 191
           ++DWR+KGAVT IKDQGQCGSCWAFS VAA EGI Q+T GKL+ LSEQ+LVDC    ++ 
Sbjct: 127 TMDWRKKGAVTPIKDQGQCGSCWAFSTVAATEGINQLTTGKLVSLSEQELVDCDNQGEDQ 186

Query: 192 GCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQA 251
           GC GGLM+  FE+II+N G+ TEA+YPY+  +GTC+++K+ +  A I+ YE +P   E  
Sbjct: 187 GCEGGLMEDGFEFIIKNHGITTEANYPYQAADGTCNSKKQASHIAKITGYESVPANSEAE 246

Query: 252 LLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLI 311
           LL+ V+NQP+SV +DA G  F FY SGV    CG   DHGV  VG+G  E  +G KYWL+
Sbjct: 247 LLKVVANQPISVSIDAGGSDFQFYSSGVFTGKCGTELDHGVTAVGYG--ETSDGTKYWLV 304

Query: 312 KNSWGETWGESGYIRILRD----AGLCGIATAASYPVA 345
           KNSW  +WGE GYIR+ RD     GLCGIA  +SYP A
Sbjct: 305 KNSWXTSWGEEGYIRMQRDIDAEEGLCGIAMDSSYPTA 342


>gi|357477459|ref|XP_003609015.1| Cysteine proteinase [Medicago truncatula]
 gi|355510070|gb|AES91212.1| Cysteine proteinase [Medicago truncatula]
          Length = 345

 Score =  370 bits (950), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 188/352 (53%), Positives = 243/352 (69%), Gaps = 14/352 (3%)

Query: 1   MVLKFEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKA 60
           M  K +  + I +  I  L + CA QV S RS+   S+ E+HEQWM+Q+ + YKD  E+ 
Sbjct: 1   MASKNQLYYSIALTFIFCLGL-CAIQVTS-RSLQVDSMYERHEQWMSQYSKVYKDPQERE 58

Query: 61  MRLNIFKQNLEYIEKANKEGN-RTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSR 119
            R  IF  N+ YIE  N + N + YKLG N+F+DLTNEEF A     N+    +    ++
Sbjct: 59  ERHKIFTANVNYIEVFNNDANNKLYKLGINQFADLTNEEFIA---SRNKFKGHMCSSIAK 115

Query: 120 PSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELS 179
            +TFKY+NV+ +P+++DWR+KGAVT +K+QGQCG CWAFSAVAA EGIT+++ GKL+ LS
Sbjct: 116 TTTFKYENVSAIPSTVDWRKKGAVTPVKNQGQCGCCWAFSAVAATEGITKLSTGKLVSLS 175

Query: 180 EQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAAT 237
           EQ+LVDC T   + GC GGLMD AF++II+N GL+TEA YPY+  +GTC+  K    AAT
Sbjct: 176 EQELVDCDTKGVDQGCEGGLMDDAFKFIIQNHGLSTEAAYPYQGVDGTCNANKASIHAAT 235

Query: 238 ISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGF 297
           I+ YED+P  +EQAL +AV+NQP+SV +DASG  F FYKSGV +  CG   DHGV  VG+
Sbjct: 236 ITGYEDVPANNEQALQKAVANQPISVAIDASGSDFQFYKSGVFSGSCGTELDHGVTAVGY 295

Query: 298 GTAEEENGAKYWLIKNSWGETWGESGYIRILR--DA--GLCGIATAASYPVA 345
           G   +  G KYWL+KNSWG  WGE GYIR+ R  DA  GLCGIA  ASYP A
Sbjct: 296 GVGND--GTKYWLVKNSWGTDWGEEGYIRMQRGVDAAEGLCGIAMQASYPTA 345


>gi|356577813|ref|XP_003557017.1| PREDICTED: uncharacterized protein LOC100801364 [Glycine max]
          Length = 890

 Score =  369 bits (947), Expect = 1e-99,   Method: Compositional matrix adjust.
 Identities = 179/335 (53%), Positives = 230/335 (68%), Gaps = 11/335 (3%)

Query: 17  IILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKA 76
           ++L +   +  V+ RS+ + S+ E+HEQWM ++G+ YKD  E+  R  IFK+N+ YIE  
Sbjct: 561 MLLCMAFLAFQVTCRSLQDASMYERHEQWMTRYGKVYKDPQEREKRFRIFKENVNYIEAF 620

Query: 77  NKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSID 136
           N   N+ YKL  N+F+DLTNEEF A     NR    +     R +TFKY+NVT VP+++D
Sbjct: 621 NNAANKRYKLAINQFADLTNEEFIA---PRNRFKGHMCSSIIRTTTFKYENVTAVPSTVD 677

Query: 137 WREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCS 194
           WR+KGAVT IKDQGQCG CWAFSAVAA EGI  +T GKLI LSEQ+LVDC T   + GC 
Sbjct: 678 WRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALTSGKLISLSEQELVDCDTKGVDQGCE 737

Query: 195 GGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQ 254
           GGLMD AF+++I+N GL TEA+YPY+  +G C+  +      TI+ YED+P  +E+AL +
Sbjct: 738 GGLMDDAFKFVIQNHGLNTEANYPYKGVDGKCNANEAANDVVTITGYEDVPANNEKALQK 797

Query: 255 AVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNS 314
           AV+NQPVSV +DASG  F FYKSGV    CG   DHGV  VG+G + +  G +YWL+KNS
Sbjct: 798 AVANQPVSVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSND--GTEYWLVKNS 855

Query: 315 WGETWGESGYIRILR----DAGLCGIATAASYPVA 345
           WG  WGE GYIR+ R    + GLCGIA  ASYP A
Sbjct: 856 WGTEWGEEGYIRMQRGVDSEEGLCGIAMQASYPTA 890


>gi|37780047|gb|AAP32196.1| cysteine protease 8 [Trifolium repens]
          Length = 343

 Score =  368 bits (945), Expect = 2e-99,   Method: Compositional matrix adjust.
 Identities = 173/324 (53%), Positives = 227/324 (70%), Gaps = 11/324 (3%)

Query: 28  VSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLG 87
           V+ R++ + S+ E+H QWM+Q+G+ YKD  E+  R  IF +N+ Y+E +N +  ++YKLG
Sbjct: 25  VTSRTLQDDSMYERHGQWMSQYGKIYKDHQERETRFKIFTENVNYVEASNADDTKSYKLG 84

Query: 88  TNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIK 147
            N+F+DLTNEEF A     N+    +    +R +TFKY+NV+ +P+++DWR+KGAVT +K
Sbjct: 85  INQFADLTNEEFVA---SRNKFKGHMCSSITRTTTFKYENVSAIPSTVDWRKKGAVTPVK 141

Query: 148 DQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYI 205
           +QGQCG CWAFSAVAA EGI +++ GKLI LSEQ+LVDC T   + GC GGLMD AF++I
Sbjct: 142 NQGQCGCCWAFSAVAATEGIHKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFI 201

Query: 206 IENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCV 265
           I+N GL+TEA YPY   +GTC+  K    A TI+ YED+P   EQAL +AV+NQP+SV +
Sbjct: 202 IQNHGLSTEAQYPYEGVDGTCNANKASVQAVTITGYEDVPANSEQALQKAVANQPISVAI 261

Query: 266 DASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYI 325
           DASG  F FYKSGV    CG   DHGV  VG+G + +  G KYWL+KNSWG  WGE GYI
Sbjct: 262 DASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSND--GTKYWLVKNSWGTDWGEEGYI 319

Query: 326 RILRDA----GLCGIATAASYPVA 345
            + R      GLCGIA  ASYP A
Sbjct: 320 MMQRGVEAAEGLCGIAMQASYPTA 343


>gi|356539398|ref|XP_003538185.1| PREDICTED: vignain-like [Glycine max]
          Length = 343

 Score =  368 bits (945), Expect = 2e-99,   Method: Compositional matrix adjust.
 Identities = 180/349 (51%), Positives = 234/349 (67%), Gaps = 12/349 (3%)

Query: 3   LKFEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMR 62
           + F+K       + + LV    +   + R++ +  + E+HEQWMA HG+ Y    EK  +
Sbjct: 1   MAFKKVLFQYFTLALCLVFAFCAFEGNARTLEDAPMRERHEQWMAIHGKVYTHSYEKEQK 60

Query: 63  LNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPST 122
              FK+N++ IE  N  GN+ YKLG N F+DLTNEEF+A+    NR    V  + +R  T
Sbjct: 61  YQTFKENVQRIEAFNHAGNKPYKLGINHFADLTNEEFKAI----NRFKGHVCSKITRTPT 116

Query: 123 FKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQ 182
           F+Y+N+T VP ++DWR++GAVT IKDQGQCG CWAFSAVAA EGIT+++ GKLI LSEQ+
Sbjct: 117 FRYENMTAVPATLDWRQEGAVTPIKDQGQCGCCWAFSAVAATEGITKLSTGKLISLSEQE 176

Query: 183 LVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISK 240
           LVDC T   + GC GGLMD AF++I++NKGLA EA YPY   +GTC+ + E   A +I  
Sbjct: 177 LVDCDTKGVDQGCEGGLMDDAFKFILQNKGLAAEAIYPYEGVDGTCNAKAEGNHATSIKG 236

Query: 241 YEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTA 300
           YED+P   E ALL+AV+NQPVSV ++ASG  F FY  GV    CG N DHGV  VG+G +
Sbjct: 237 YEDVPANSESALLKAVANQPVSVAIEASGFEFQFYSGGVFTGSCGTNLDHGVTAVGYGVS 296

Query: 301 EEENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
           ++  G KYWL+KNSWG  WG+ GYIR+ RD     GLCGIA  ASYP A
Sbjct: 297 DD--GTKYWLVKNSWGVKWGDKGYIRMQRDVAAKEGLCGIAMLASYPNA 343


>gi|84181681|gb|AAW78661.2| senescence-specific cysteine protease [Nicotiana tabacum]
          Length = 349

 Score =  368 bits (944), Expect = 3e-99,   Method: Compositional matrix adjust.
 Identities = 178/346 (51%), Positives = 235/346 (67%), Gaps = 13/346 (3%)

Query: 11  IPMFVIIILVITC----ASQVVSGRSM-HEPSIVEKHEQWMAQHGRTYKDELEKAMRLNI 65
           +  ++ + L   C    +SQV   R + +E ++  +H+QW+  H + YKD  EK +R  I
Sbjct: 6   LSQYLCLALFFICLGLWSSQVALSRPINYEATMRARHDQWIVHHEKVYKDLNEKEVRFQI 65

Query: 66  FKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKY 125
           FK+N+E IE  N   ++ YKLG N+FSDLTNEEFR L+TGY R  P V   S   + F+Y
Sbjct: 66  FKENVERIEAFNAGEDKGYKLGFNKFSDLTNEEFRVLHTGYKRSHPKVMTSSKGKTHFRY 125

Query: 126 QNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVD 185
            NVTD+P ++DWR+KGAVT IKDQ +CG CWAFSAVAA+EG+ Q+  G+LI LSEQ+LVD
Sbjct: 126 TNVTDIPPTMDWRKKGAVTPIKDQKECGCCWAFSAVAAMEGLHQLKTGELIPLSEQELVD 185

Query: 186 CST--DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYED 243
           C    ++ GCSGGL+D AF++I++NKGL TE +YPY+ E+G C+ +K    AA I+ YED
Sbjct: 186 CDVEGEDEGCSGGLLDTAFDFILKNKGLTTEVNYPYKGEDGVCNKKKSALSAAKITGYED 245

Query: 244 LPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEE 303
           +P   E+ALLQAV+NQPVSV +D S   F FY SGV +  C    +H V  VG+G   + 
Sbjct: 246 VPANSEKALLQAVANQPVSVAIDGSSFDFQFYSSGVFSGSCSTWLNHAVTAVGYGATTD- 304

Query: 304 NGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
            G KYW+IKNSWG  WG+SGY+RI RD     GLCG+A  ASYP A
Sbjct: 305 -GTKYWIIKNSWGSKWGDSGYMRIKRDVHEKEGLCGLAMDASYPTA 349


>gi|356545063|ref|XP_003540965.1| PREDICTED: thiol protease SEN102-like [Glycine max]
          Length = 361

 Score =  367 bits (943), Expect = 3e-99,   Method: Compositional matrix adjust.
 Identities = 180/345 (52%), Positives = 234/345 (67%), Gaps = 11/345 (3%)

Query: 7   KSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIF 66
           K+    + + ++L +   +  V+ RS+ + S+ E+HEQWM ++G+ YKD  E+  R  IF
Sbjct: 22  KNHFCHISLAMLLCMAFLAFQVTCRSLQDASMYERHEQWMTRYGKVYKDPQEREKRFRIF 81

Query: 67  KQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQ 126
           K+N+ YIE  N   N+ YKL  N+F+DLTNEEF A     NR    +     R +TFKY+
Sbjct: 82  KENVNYIEAFNNAANKRYKLAINQFADLTNEEFIA---PRNRFKGHMCSSIIRTTTFKYE 138

Query: 127 NVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC 186
           NVT VP+++DWR+KGAVT IKDQGQCG CWAFSAVAA EGI  +T GKLI LSEQ+LVDC
Sbjct: 139 NVTAVPSTVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALTSGKLISLSEQELVDC 198

Query: 187 STD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDL 244
            T   + GC GGLMD AF+++I+N GL TEA+YPY+  +G C+  +      TI+ YED+
Sbjct: 199 DTKGVDQGCEGGLMDDAFKFVIQNHGLNTEANYPYKGVDGKCNANEAANDVVTITGYEDV 258

Query: 245 PKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEEN 304
           P  +E+AL +AV+NQPVSV +DASG  F FYKSGV    CG   DHGV  VG+G + +  
Sbjct: 259 PANNEKALQKAVANQPVSVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSND-- 316

Query: 305 GAKYWLIKNSWGETWGESGYIRILR----DAGLCGIATAASYPVA 345
           G +YWL+KNSWG  WGE GYIR+ R    + GLCGIA  ASYP A
Sbjct: 317 GTEYWLVKNSWGTEWGEEGYIRMQRGVDSEEGLCGIAMQASYPTA 361


>gi|356515086|ref|XP_003526232.1| PREDICTED: vignain-like [Glycine max]
          Length = 343

 Score =  367 bits (941), Expect = 5e-99,   Method: Compositional matrix adjust.
 Identities = 178/337 (52%), Positives = 236/337 (70%), Gaps = 11/337 (3%)

Query: 15  VIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIE 74
           + ++L +T  +  V+ R++ + S+ E+HEQWM ++G+ YKD  E+  R  +FK+N+ YIE
Sbjct: 12  LAMLLCMTFLAFQVTCRTLQDASMYERHEQWMTRYGKVYKDPQEREKRFRVFKENVNYIE 71

Query: 75  KANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTS 134
             N   N++YKLG N+F+DLTN+EF A   G+   + S      R +TFK++NVT  P++
Sbjct: 72  AFNNAANKSYKLGINQFADLTNKEFIAPRNGFKGHMCS---SIIRTTTFKFENVTATPST 128

Query: 135 IDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHG 192
           +DWR+KGAVT IKDQGQCG CWAFSAVAA EGI  ++ GKLI LSEQ+LVDC T   + G
Sbjct: 129 VDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALSAGKLISLSEQELVDCDTKGVDQG 188

Query: 193 CSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQAL 252
           C GGLMD AF++II+N GL TEA+YPY+  +G C+  +    AATI+ YED+P  +E AL
Sbjct: 189 CEGGLMDDAFKFIIQNHGLNTEANYPYKGVDGKCNANEAAKNAATITGYEDVPANNEMAL 248

Query: 253 LQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIK 312
            +AV+NQPVSV +DASG  F FYKSGV    CG   DHGV  VG+G +++  G +YWL+K
Sbjct: 249 QKAVANQPVSVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSDD--GTEYWLVK 306

Query: 313 NSWGETWGESGYIRILR----DAGLCGIATAASYPVA 345
           NSWG  WGE GYIR+ R    + GLCGIA  ASYP A
Sbjct: 307 NSWGTEWGEEGYIRMQRGVDSEEGLCGIAMQASYPTA 343


>gi|357167190|ref|XP_003581045.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like
           [Brachypodium distachyon]
          Length = 415

 Score =  367 bits (941), Expect = 6e-99,   Method: Compositional matrix adjust.
 Identities = 179/341 (52%), Positives = 232/341 (68%), Gaps = 16/341 (4%)

Query: 14  FVIIILVITCASQVVSGRSMHEP-SIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
           F+I IL  TCA   ++ R + +  S+V +HEQWMA++GR Y D  EKA RL +FK N+ +
Sbjct: 82  FLIAILACTCAVSALAARDLTDDLSMVARHEQWMAKYGRVYNDVAEKAQRLEVFKANVAF 141

Query: 73  IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT--D 130
           IE  N  GN  + L  N+F+D+T +EFRA +TGY +PVP+      R + FKY NV+   
Sbjct: 142 IELVNA-GNDKFSLEANQFADMTVDEFRAAHTGY-KPVPA---NKGRTTQFKYANVSLDA 196

Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD- 189
           +P S+DWR KGAVT IKDQGQCG CWAFS VA+VEGI +++ GKLI LSEQ+LVDC  D 
Sbjct: 197 LPASMDWRAKGAVTPIKDQGQCGCCWAFSTVASVEGIVKLSTGKLISLSEQELVDCDVDG 256

Query: 190 -NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
            + GC GGLMD AFE+II+N GL TE +YPY   + +C++ KE    A+I  YED+P  D
Sbjct: 257 MDQGCEGGLMDNAFEFIIDNGGLTTEGNYPYTGTDDSCNSNKESNDVASIKGYEDVPSND 316

Query: 249 EQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKY 308
           E +LL+AV+ QPVS+ VD     F FYK GVL+  CG   DHG+A VG+G   +  G K+
Sbjct: 317 ETSLLKAVAAQPVSIAVDGGDNLFRFYKGGVLSGACGTELDHGIAAVGYGITSD--GTKF 374

Query: 309 WLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
           WL+KNSWG +WGE G+IR+ RD     GLCG+A   SYP A
Sbjct: 375 WLMKNSWGTSWGEKGFIRMERDIADEEGLCGLAMQPSYPTA 415


>gi|312281697|dbj|BAJ33714.1| unnamed protein product [Thellungiella halophila]
          Length = 347

 Score =  366 bits (940), Expect = 8e-99,   Method: Compositional matrix adjust.
 Identities = 172/343 (50%), Positives = 239/343 (69%), Gaps = 11/343 (3%)

Query: 11  IPMFVIIILVITCASQVVSGRSMHEPSIV--EKHEQWMAQHGRTYKDELEKAMRLNIFKQ 68
           I +F+I+ L+ +    +   R + +  ++  ++H++WMA+HGR Y D  EK  R  +FK+
Sbjct: 6   IQIFLIVSLISSFCLSITLSRPLDDNELIMQKRHDEWMAKHGRVYADMKEKNNRYVVFKR 65

Query: 69  NLEYIEKANK-EGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQN 127
           N+E IE+ N     RT+KL  N+F+DLTN+EFR++YTGY       S+  ++ S+F+YQN
Sbjct: 66  NVERIERLNNVPAGRTFKLAVNQFADLTNDEFRSMYTGYKGGSVLSSQSGTKTSSFRYQN 125

Query: 128 VTD--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVD 185
           V+   +P S+DWR+KGAVT IK+QG CG CWAFSAVAA+EG T+I +GKLI LSEQQLVD
Sbjct: 126 VSSGALPVSVDWRKKGAVTPIKNQGTCGCCWAFSAVAAIEGATKIKKGKLISLSEQQLVD 185

Query: 186 CSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLP 245
           C T++ GCSGGLMD AFE+I+   GL TE++YPY+ ++ TC  +  K  A +I+ YED+P
Sbjct: 186 CDTNDFGCSGGLMDTAFEHIMATGGLTTESNYPYKGKDATCKIKNTKPTATSITGYEDVP 245

Query: 246 KGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENG 305
             DE+AL++AV++QPVS+ ++  G  F FY SGV   +C    DH V  VG+G  +  NG
Sbjct: 246 VNDEKALMKAVAHQPVSIGIEGGGFDFQFYGSGVFTGECTTYLDHAVTAVGYG--QSSNG 303

Query: 306 AKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
           +KYW+IKNSWG  WGESGY+RI +D     GLCG+A  ASYP 
Sbjct: 304 SKYWIIKNSWGTKWGESGYMRIKKDVKDKKGLCGLAMKASYPT 346


>gi|18408828|ref|NP_566920.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|12324451|gb|AAG52191.1|AC012329_18 putative cysteine proteinase; 15366-14136 [Arabidopsis thaliana]
 gi|6723404|emb|CAB66413.1| cysteine protease-like protein [Arabidopsis thaliana]
 gi|332645009|gb|AEE78530.1| putative cysteine proteinase [Arabidopsis thaliana]
          Length = 341

 Score =  365 bits (938), Expect = 1e-98,   Method: Compositional matrix adjust.
 Identities = 179/343 (52%), Positives = 239/343 (69%), Gaps = 13/343 (3%)

Query: 11  IPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNL 70
           I  F++ IL+ +  S V S   + E S VEKHEQWM++  R Y D+ EK  R  IF  NL
Sbjct: 4   IVFFLLAILLSSRTSGVTSRGGLFEASAVEKHEQWMSRFNRVYSDDSEKTSRFEIFTNNL 63

Query: 71  EYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPS----TFKYQ 126
           +++E  N   N+TY L  NEFSDLT+EEF+A YTG   P   ++R S+  S    +F+Y+
Sbjct: 64  KFVESINMNTNKTYTLDVNEFSDLTDEEFKARYTGLVVP-EGMTRISTTDSHETVSFRYE 122

Query: 127 NVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC 186
           NV +   S+DW ++GAVT +K Q QCG CWAFSAVAAVEG+T+I  G+L+ LSEQQL+DC
Sbjct: 123 NVGETGESMDWIQEGAVTSVKHQQQCGCCWAFSAVAAVEGMTKIANGELVSLSEQQLLDC 182

Query: 187 STDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPK 246
           ST+N+GC GG+M KAF+YI EN+G+ TE +YPY+  + TC++      AATIS YE +P+
Sbjct: 183 STENNGCGGGIMWKAFDYIKENQGITTEDNYPYQGAQQTCESN--HLAAATISGYETVPQ 240

Query: 247 GDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGA 306
            DE+ALL+AVS QPVSV ++ SG  F  Y  G+ N +CG    H V +VG+G +EE  G 
Sbjct: 241 NDEEALLKAVSQQPVSVAIEGSGYEFIHYSGGIFNGECGTQLTHAVTIVGYGVSEE--GI 298

Query: 307 KYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
           KYWL+KNSWGE+WGE+GY+RI+RD     G+CG+A+ A YPVA
Sbjct: 299 KYWLLKNSWGESWGENGYMRIMRDVDSPQGMCGLASLAYYPVA 341


>gi|37780051|gb|AAP32198.1| cysteine protease 12 [Trifolium repens]
          Length = 343

 Score =  365 bits (937), Expect = 2e-98,   Method: Compositional matrix adjust.
 Identities = 182/340 (53%), Positives = 236/340 (69%), Gaps = 18/340 (5%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
            F + +L I      V+ R++ + SI E+HEQWM  +G+ YK+  E+  RL IF +NL+Y
Sbjct: 15  FFCLGLLAIQ-----VTSRTLQDDSIFERHEQWMTHYGKVYKNPQEREKRLRIFTENLKY 69

Query: 73  IEKANKEGNRT-YKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDV 131
           IE +N  GN+  YKLG N+F+DLTNEEF A     N+    +     R +TFKY+N T V
Sbjct: 70  IEASNNAGNKKPYKLGINQFADLTNEEFIA---SRNKFKGHMCSSIIRTTTFKYEN-TSV 125

Query: 132 PTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-- 189
           P+++DWR+KGAVT +K+QGQCG CWAFSA+AA EGI +I+ GKL+ LSEQ+LVDC T+  
Sbjct: 126 PSTVDWRKKGAVTPVKNQGQCGCCWAFSAIAATEGIHKISTGKLVSLSEQELVDCDTNGV 185

Query: 190 NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDE 249
           + GC GGLMD AF++II+N G++TEA YPY+  +GTC   +    AATI+ YED+P  +E
Sbjct: 186 DQGCEGGLMDDAFKFIIQNNGISTEAGYPYQGVDGTCKANEASTSAATITGYEDVPANNE 245

Query: 250 QALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYW 309
            AL +AV+NQP+SV +DASG  F FYKSGV    CG   DHGV  VG+G + +  G KYW
Sbjct: 246 NALQKAVANQPISVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGISND--GTKYW 303

Query: 310 LIKNSWGETWGESGYIRILR--DA--GLCGIATAASYPVA 345
           L+KNSWG  WGE GYIR+ R  DA  GLCGIA  ASYP A
Sbjct: 304 LVKNSWGTDWGEEGYIRMQRSIDAAEGLCGIAMQASYPTA 343


>gi|37780045|gb|AAP32195.1| cysteine protease 5 [Trifolium repens]
          Length = 343

 Score =  365 bits (937), Expect = 2e-98,   Method: Compositional matrix adjust.
 Identities = 182/340 (53%), Positives = 236/340 (69%), Gaps = 18/340 (5%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
            F + +L I      V+ R++ + SI E+HEQWM  +G+ YK+  E+  RL IF +NL+Y
Sbjct: 15  FFCLGLLAIQ-----VTSRTLQDDSIFERHEQWMTHYGKVYKNPQEREKRLRIFTENLKY 69

Query: 73  IEKANKEGN-RTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDV 131
           IE +N  GN + YKLG N+F+DLTNEEF A     N+    +     R +TFKY+N T V
Sbjct: 70  IEASNNAGNNKPYKLGINQFADLTNEEFIA---SRNKFKGHMCSSIIRTTTFKYEN-TSV 125

Query: 132 PTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-- 189
           P+++DWR+KGAVT +K+QGQCG CWAFSA+AA EGI +I+ GKL+ LSEQ+LVDC T+  
Sbjct: 126 PSTVDWRKKGAVTPVKNQGQCGCCWAFSAIAATEGIHKISTGKLVSLSEQELVDCDTNGV 185

Query: 190 NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDE 249
           + GC GGLMD AF++II+N G++TEA YPY+  +GTC   +    AATI+ YED+P  +E
Sbjct: 186 DQGCEGGLMDDAFKFIIQNNGISTEAGYPYQGVDGTCKANEASTSAATITGYEDVPANNE 245

Query: 250 QALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYW 309
            AL +AV+NQP+SV +DASG  F FYKSGV    CG   DHGV  VG+G + +  G KYW
Sbjct: 246 NALQKAVANQPISVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGISND--GTKYW 303

Query: 310 LIKNSWGETWGESGYIRILR--DA--GLCGIATAASYPVA 345
           L+KNSWG  WGE GYIR+ R  DA  GLCGIA  ASYP A
Sbjct: 304 LVKNSWGTDWGEEGYIRMQRSIDAAEGLCGIAMQASYPTA 343


>gi|356543122|ref|XP_003540012.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
          Length = 342

 Score =  364 bits (935), Expect = 3e-98,   Method: Compositional matrix adjust.
 Identities = 178/327 (54%), Positives = 225/327 (68%), Gaps = 8/327 (2%)

Query: 23  CASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNR 82
           C SQV S R +H+ S+ E+HEQWM ++G+ YKD  E   R  IF+ N+E+IE  N  GN+
Sbjct: 20  CTSQVKS-RKLHDASMYERHEQWMEKYGKVYKDSAEXEKRFLIFENNVEFIESFNAAGNK 78

Query: 83  TYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGA 142
            YKL  N  +D TNEEF A + GY        R +++ + FKY+NVTD+P ++DWR+KG 
Sbjct: 79  PYKLSINHLADQTNEEFMASHKGYKGSHWQGLRITTQ-TPFKYENVTDIPWAVDWRQKGD 137

Query: 143 VTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAF 202
            T IKDQGQCG CWAFSAVAA EGI QIT G L+ LSEQ+LVDC + +HGC GGLM+  F
Sbjct: 138 ATSIKDQGQCGICWAFSAVAATEGIYQITTGNLVSLSEQELVDCDSVDHGCDGGLMEHGF 197

Query: 203 EYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVS 262
           E+II+N G+++EA+YPY    GTCD  KE +  A I  YE +P   E+ L +AV+NQPVS
Sbjct: 198 EFIIKNGGISSEANYPYTAVNGTCDTNKEASPGAQIKGYETVPVNCEEELQKAVANQPVS 257

Query: 263 VCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGES 322
           V +DA G AF FY SGV    CG   DHGV  VG+G+ ++  G +YW++KNSWG  WGE 
Sbjct: 258 VSIDAGGSAFQFYSSGVFTGQCGTQLDHGVTAVGYGSTDD--GIQYWIVKNSWGTQWGEE 315

Query: 323 GYIRILR--DA--GLCGIATAASYPVA 345
           GYIR+LR  DA  GLCGIA  ASYP A
Sbjct: 316 GYIRMLRGIDAQEGLCGIAMDASYPTA 342


>gi|18401420|ref|NP_565649.1| cysteine proteinase-like protein [Arabidopsis thaliana]
 gi|4314384|gb|AAD15594.1| cysteine proteinase [Arabidopsis thaliana]
 gi|17381154|gb|AAL36389.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|20465849|gb|AAM20029.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|330252901|gb|AEC07995.1| cysteine proteinase-like protein [Arabidopsis thaliana]
          Length = 348

 Score =  363 bits (931), Expect = 7e-98,   Method: Compositional matrix adjust.
 Identities = 184/345 (53%), Positives = 240/345 (69%), Gaps = 14/345 (4%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
           +F++ I +    S   S  S+ E S +EKHEQWMA+  R Y DE EK  R NIFK+NLE+
Sbjct: 6   IFILTIFLSYRTSLATSRGSLFEASAIEKHEQWMARFNRVYSDETEKRNRFNIFKKNLEF 65

Query: 73  IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRP--VPSVSRQSSRPST--FKYQNV 128
           ++  N     TYK+  NEFSDLT+EEFRA +TG   P  +  +S  SS  +T  F+Y NV
Sbjct: 66  VQNFNMNNKITYKVDINEFSDLTDEEFRATHTGLVVPEAITRISTLSSGKNTVPFRYGNV 125

Query: 129 TDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST 188
           +D   S+DWR++GAVT +K QG+CG CWAFSAVAAVEGIT+IT+G+L+ LSEQQL+DC  
Sbjct: 126 SDNGESMDWRQEGAVTPVKYQGRCGGCWAFSAVAAVEGITKITKGELVSLSEQQLLDCDR 185

Query: 189 D-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAV---AATISKYEDL 244
           D N GC GG+M KAFEYII+N+G+ TE +YPY+  + TC +    +    AATIS YE +
Sbjct: 186 DYNQGCRGGIMSKAFEYIIKNQGITTEDNYPYQESQQTCSSSTTLSSSFRAATISGYETV 245

Query: 245 PKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEEN 304
           P  +E+ALLQAVS QPVSV ++ +G AF  Y  GV N +CG +  H V +VG+G +EE  
Sbjct: 246 PMNNEEALLQAVSQQPVSVGIEGTGAAFRHYSGGVFNGECGTDLHHAVTIVGYGMSEE-- 303

Query: 305 GAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
           G KYW++KNSWGETWGE+GY+RI RD     G+CG+A  A YP+A
Sbjct: 304 GTKYWVVKNSWGETWGENGYMRIKRDVDAPQGMCGLAILAFYPLA 348


>gi|297826061|ref|XP_002880913.1| hypothetical protein ARALYDRAFT_481640 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297326752|gb|EFH57172.1| hypothetical protein ARALYDRAFT_481640 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 347

 Score =  362 bits (930), Expect = 1e-97,   Method: Compositional matrix adjust.
 Identities = 182/344 (52%), Positives = 238/344 (69%), Gaps = 13/344 (3%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
           +F++ I +    S   S   + E S +EKHEQWMA+  R Y DE EK  R NIFK+NLE+
Sbjct: 6   IFILTIFLSYRTSLATSRGGLFEASPIEKHEQWMARFNRVYSDESEKRNRFNIFKKNLEF 65

Query: 73  IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRP--VPSVSRQSSRPST-FKYQNVT 129
           ++  N   N TYKL  NEFSDLT+EEFRA +TG   P  +  +S  SS  +  F+Y NV+
Sbjct: 66  VQSFNMNKNITYKLDVNEFSDLTDEEFRATHTGLVVPEEITGISTLSSDKTVPFRYGNVS 125

Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD 189
           D   S+DWR++GAVT +K QG+CG CWAFSAVAAVEGIT+IT+G+L+ LSEQQL+DC TD
Sbjct: 126 DTGESMDWRQEGAVTPVKYQGRCGGCWAFSAVAAVEGITKITKGELVSLSEQQLLDCDTD 185

Query: 190 -NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAV---AATISKYEDLP 245
            N GC GG+M KAFEYII+N+G+ TE +YPY+  + TC +    +    AATIS YE +P
Sbjct: 186 YNQGCHGGIMSKAFEYIIKNQGITTEDNYPYQESQQTCSSSTTLSSSFRAATISGYETVP 245

Query: 246 KGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENG 305
             +E+ALLQAVS QPVSV ++ +G  F  Y  G+ N +CG +  H V +VG+G +EE  G
Sbjct: 246 MNNEEALLQAVSQQPVSVGIEGTGAGFRHYSGGIFNGECGTDLHHAVTIVGYGMSEE--G 303

Query: 306 AKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
            KYW++KNSWGETWGE G++RI RD     G+CG+A  A YP+A
Sbjct: 304 TKYWVVKNSWGETWGEDGFMRIKRDVDAPQGMCGLAMLAFYPLA 347


>gi|357474573|ref|XP_003607571.1| Cysteine proteinase EP-B [Medicago truncatula]
 gi|34329348|gb|AAQ63885.1| putative cysteine proteinase [Medicago truncatula]
 gi|355508626|gb|AES89768.1| Cysteine proteinase EP-B [Medicago truncatula]
          Length = 345

 Score =  362 bits (930), Expect = 1e-97,   Method: Compositional matrix adjust.
 Identities = 181/326 (55%), Positives = 230/326 (70%), Gaps = 14/326 (4%)

Query: 28  VSGRSMHEPSIV-EKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGN-RTYK 85
           V+ R++ + SI+ EKHEQWM  +G+ YKD  E+  RL IFK+N+ YIE +N  GN + YK
Sbjct: 26  VTSRTLQDDSIIYEKHEQWMVHYGKVYKDLQERENRLKIFKENVNYIEASNNAGNNKLYK 85

Query: 86  LGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTH 145
           LG N+F+DLTNEEF A     N+    +    ++ STFKY+N + VP+++DWR+KGAVT 
Sbjct: 86  LGINQFADLTNEEFIA---SRNKFKGHMCSSITKTSTFKYENAS-VPSTVDWRKKGAVTP 141

Query: 146 IKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFE 203
           +K+QGQCG CWAFSAVAA EGI +++ GKL+ LSEQ+LVDC T   + GC GGLMD AF+
Sbjct: 142 VKNQGQCGCCWAFSAVAATEGIHKLSTGKLVSLSEQELVDCDTKGVDQGCEGGLMDDAFK 201

Query: 204 YIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSV 263
           +II+N GL TEA YPY+  +GTC   K    A TI+ YED+P  +EQAL +AV+NQP+SV
Sbjct: 202 FIIQNHGLNTEAQYPYQGVDGTCSANKASIHAVTITGYEDVPANNEQALQKAVANQPISV 261

Query: 264 CVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESG 323
            +DASG  F FYKSGV    CG   DHGV  VG+G   +  G KYWL+KNSWG  WGE G
Sbjct: 262 AIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVGND--GTKYWLVKNSWGTDWGEEG 319

Query: 324 YIRILR--DA--GLCGIATAASYPVA 345
           YI++ R  DA  GLCGIA  ASYP A
Sbjct: 320 YIKMQRGVDAAEGLCGIAMEASYPTA 345


>gi|357474579|ref|XP_003607574.1| Cysteine protease [Medicago truncatula]
 gi|355508629|gb|AES89771.1| Cysteine protease [Medicago truncatula]
          Length = 345

 Score =  362 bits (930), Expect = 1e-97,   Method: Compositional matrix adjust.
 Identities = 178/326 (54%), Positives = 229/326 (70%), Gaps = 14/326 (4%)

Query: 28  VSGRSMHEPSIV-EKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGN-RTYK 85
           V+ R++ + SI+ EKHEQWM  +G+ YKD  E+  RL IFK+N+ YIE +N  GN + YK
Sbjct: 26  VTSRTLQDDSIIYEKHEQWMVHYGKVYKDLQERENRLKIFKENVNYIEASNNAGNNKLYK 85

Query: 86  LGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTH 145
           LG N+F+D+TNEEF A     N+    +    ++ STFKY+N + VP+++DWR+KGAVT 
Sbjct: 86  LGINQFADITNEEFIA---SRNKFKGHMCSSITKTSTFKYENAS-VPSTVDWRKKGAVTP 141

Query: 146 IKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFE 203
           +K+QGQCG CWAFSAVAA EGI +++ GKL+ LSEQ+LVDC T   + GC GGLMD AF+
Sbjct: 142 VKNQGQCGCCWAFSAVAATEGIHKLSTGKLVSLSEQELVDCDTKGVDQGCEGGLMDDAFK 201

Query: 204 YIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSV 263
           +II+N GL TEA YPY+  +GTC   +    AATI+ YED+P  +E AL +AV+NQP+SV
Sbjct: 202 FIIQNHGLHTEAQYPYQGVDGTCSANETSTPAATIAGYEDVPANNENALQKAVANQPISV 261

Query: 264 CVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESG 323
            +DASG  F FYKSGV    CG   DHGV  VG+G + +  G KYWL+KNSWG  WGE G
Sbjct: 262 AIDASGSDFQFYKSGVFTGSCGTQLDHGVTAVGYGISND--GTKYWLVKNSWGNDWGEEG 319

Query: 324 YIRILRDA----GLCGIATAASYPVA 345
           YIR+ R      GLCGIA  ASYP A
Sbjct: 320 YIRMQRSVDAAQGLCGIAMMASYPTA 345


>gi|224093956|ref|XP_002310053.1| predicted protein [Populus trichocarpa]
 gi|224147016|ref|XP_002336386.1| predicted protein [Populus trichocarpa]
 gi|222834869|gb|EEE73318.1| predicted protein [Populus trichocarpa]
 gi|222852956|gb|EEE90503.1| predicted protein [Populus trichocarpa]
          Length = 340

 Score =  362 bits (929), Expect = 1e-97,   Method: Compositional matrix adjust.
 Identities = 173/349 (49%), Positives = 240/349 (68%), Gaps = 15/349 (4%)

Query: 3   LKFEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMR 62
           ++F K F      ++ ++    S+  + R++ +  + E+HEQWM Q+GR YKD+ E+A R
Sbjct: 1   MRFTKQFQFVCLALLFILGAWPSKSTA-RTLLDAPMYERHEQWMTQYGRVYKDDNERATR 59

Query: 63  LNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPST 122
            +IFK+N+  I+  N +  ++YKLG N+F+DLTNEEF+A     NR    +    + P  
Sbjct: 60  YSIFKENVARIDAFNSQTGKSYKLGVNQFADLTNEEFKA---SRNRFKGHMCSPQAGP-- 114

Query: 123 FKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQ 182
           F+Y+NV+ VP+++DWR++GAVT +KDQGQCG CWAFSAVAA+EGI ++T GKLI LSEQ+
Sbjct: 115 FRYENVSAVPSTVDWRKEGAVTPVKDQGQCGCCWAFSAVAAMEGINKLTTGKLISLSEQE 174

Query: 183 LVDCST--DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISK 240
           +VDC T  ++ GC+GGLMD AF++I +NKGL TEA+YPY+  +GTC+  K    AA I+ 
Sbjct: 175 VVDCDTKGEDQGCNGGLMDDAFKFIEQNKGLTTEANYPYKGTDGTCNTNKAAIHAAKITG 234

Query: 241 YEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTA 300
           +ED+P   E AL++AV+ QPVSV +DA G  F FY SG+    C    DHGV  VG+G +
Sbjct: 235 FEDVPANSEAALMKAVAKQPVSVAIDAGGSDFQFYSSGIFTGSCDTQLDHGVTAVGYGVS 294

Query: 301 EEENGAKYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPVA 345
           +   G+KYWL+KNSWG  WGE GYIR+ +D     GLCGIA  ASYP A
Sbjct: 295 D---GSKYWLVKNSWGAQWGEEGYIRMQKDISAKEGLCGIAMQASYPTA 340


>gi|356545118|ref|XP_003540992.1| PREDICTED: thiol protease SEN102-like [Glycine max]
          Length = 337

 Score =  362 bits (928), Expect = 2e-97,   Method: Compositional matrix adjust.
 Identities = 183/352 (51%), Positives = 232/352 (65%), Gaps = 22/352 (6%)

Query: 1   MVLKFEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKA 60
           M    +K + I +F+++ L I    Q++S R +HE S+ E+HEQWMA++G+ YKD  EK 
Sbjct: 1   MAFTSQKQYTIALFLLLALGI---PQMMS-RKLHETSMRERHEQWMAEYGKVYKDAAEKE 56

Query: 61  MRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRP 120
            R  IFK N+E+IE  N   N+ YKLG N  +DLT EEF+A   G  RP       S+ P
Sbjct: 57  KRFLIFKHNVEFIESFNAAANKPYKLGVNHLADLTVEEFKASRNGLKRPY----ELSTTP 112

Query: 121 STFKYQNVTDVPTSIDWREKGAVTHIKDQGQC-GSCWAFSAVAAVEGITQITRGKLIELS 179
             FKY+NVT +P +IDWR KGAVT IKDQGQC GSCWAFS VAA EGI QIT GKL+ LS
Sbjct: 113 --FKYENVTAIPAAIDWRTKGAVTSIKDQGQCAGSCWAFSTVAATEGIHQITTGKLVSLS 170

Query: 180 EQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAAT 237
           EQ+LVDC T   + GC GG M+  FE+II+N G+ +EA+YPY+  +G C+  K  +  A 
Sbjct: 171 EQELVDCDTKGVDQGCEGGYMEDGFEFIIKNGGITSEANYPYKAVDGKCN--KATSPVAQ 228

Query: 238 ISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGF 297
           I  YE +P   E+ L +AV+NQPVSV +DA+G  F FY SG+ N +CG   DHGV  VG+
Sbjct: 229 IKGYEKVPPNSEKTLQKAVANQPVSVSIDANGEGFMFYSSGIYNGECGTELDHGVTAVGY 288

Query: 298 GTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
           G A   NG  YWL+KNSWG  WGE GY+R+ R      GLCGIA  +SYP A
Sbjct: 289 GIA---NGTDYWLVKNSWGTQWGEKGYVRMQRGVAAKHGLCGIALDSSYPTA 337


>gi|297794671|ref|XP_002865220.1| senescence-associated gene 12 [Arabidopsis lyrata subsp. lyrata]
 gi|297311055|gb|EFH41479.1| senescence-associated gene 12 [Arabidopsis lyrata subsp. lyrata]
          Length = 346

 Score =  361 bits (926), Expect = 3e-97,   Method: Compositional matrix adjust.
 Identities = 174/343 (50%), Positives = 238/343 (69%), Gaps = 12/343 (3%)

Query: 11  IPMFVIIILVITCASQVVSGRSMHEPSIVEK-HEQWMAQHGRTYKDELEKAMRLNIFKQN 69
           + +F+ + +  +    +   R +    I++K H +WM +HGR Y D  EK+ R  +FK N
Sbjct: 6   MQIFLFVAIFSSFYFSISLSRPLDNELIMQKRHIEWMTKHGRVYADVKEKSNRYVVFKSN 65

Query: 70  LEYIEKANK-EGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQS-SRPSTFKYQN 127
           +E IE  N     RT+KL  N+F+DLTN+EFR++YTG+ + V S+S QS ++ ++F+YQN
Sbjct: 66  VERIEHLNNIPAGRTFKLAVNQFADLTNDEFRSMYTGF-KGVSSLSSQSQTKTTSFRYQN 124

Query: 128 VTD--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVD 185
           V+   +P S+DWR KGAVT IK+QG CG CWAFSAVAA+EG TQI +GKLI LSEQQLVD
Sbjct: 125 VSSGALPISVDWRTKGAVTPIKNQGSCGCCWAFSAVAAIEGATQIKKGKLISLSEQQLVD 184

Query: 186 CSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLP 245
           C T++ GC GGLMD AFE+I+   GL TE++YPY+ E+ TC+++K    A +I+ YED+P
Sbjct: 185 CDTNDFGCEGGLMDTAFEHIMATGGLTTESNYPYKGEDATCNSKKTNPKATSITGYEDVP 244

Query: 246 KGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENG 305
             DEQAL++AV++QPVSV ++  G  F FY SGV   +C    DH V  +G+G  +  NG
Sbjct: 245 VNDEQALMKAVAHQPVSVGIEGGGFDFQFYSSGVFTGECTTYLDHAVTAIGYG--QSTNG 302

Query: 306 AKYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPV 344
           +KYW+IKNSWG  WGESGY+RI +D     GLCG+A  ASYP 
Sbjct: 303 SKYWIIKNSWGTKWGESGYMRIQKDIKDKQGLCGLAMKASYPT 345


>gi|118627554|emb|CAL64936.1| putative cysteine protease 8 [Trifolium pratense]
          Length = 344

 Score =  361 bits (926), Expect = 4e-97,   Method: Compositional matrix adjust.
 Identities = 173/325 (53%), Positives = 226/325 (69%), Gaps = 12/325 (3%)

Query: 28  VSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANK-EGNRTYKL 86
           V+ R++ + S+ E+H QWM+Q+G+ YKD  E+  R  IFK+N+ YIE  N  +  ++YKL
Sbjct: 25  VTSRTLQDDSMYERHGQWMSQYGKIYKDHQERETRFKIFKENVNYIETFNNADDTKSYKL 84

Query: 87  GTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHI 146
           G N+F+DLTNEEF A     N+    +     R ++FKY+NV+ +P+++DWR+KGAVT +
Sbjct: 85  GINQFADLTNEEFIA---SRNKFKGHMCSSIMRTTSFKYENVSGIPSTVDWRKKGAVTPV 141

Query: 147 KDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEY 204
           K+QGQCG CWAFSAVAA EGI +++ GKLI LSEQ+LVDC T   + GC GGLMD AF++
Sbjct: 142 KNQGQCGCCWAFSAVAATEGIHKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKF 201

Query: 205 IIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVC 264
           II+N GL+TEA YPY   +GTC+  K    A TI+ YED+P   EQAL +AV+NQP+SV 
Sbjct: 202 IIQNHGLSTEAQYPYEGVDGTCNANKASVQAVTITGYEDVPANSEQALQKAVANQPISVA 261

Query: 265 VDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGY 324
           +DASG  F FYKSGV    CG   DHGV  VG+G + +  G KYWL+KNSWG  WGE GY
Sbjct: 262 IDASGSDFQFYKSGVFTGACGTELDHGVTAVGYGVSND--GTKYWLVKNSWGTDWGEEGY 319

Query: 325 IRILRDA----GLCGIATAASYPVA 345
           I + R      G+CGIA  ASYP A
Sbjct: 320 IMMQRGIEAAEGICGIAMQASYPTA 344


>gi|357474527|ref|XP_003607548.1| Cysteine protease [Medicago truncatula]
 gi|358347211|ref|XP_003637653.1| Cysteine protease [Medicago truncatula]
 gi|355503588|gb|AES84791.1| Cysteine protease [Medicago truncatula]
 gi|355508603|gb|AES89745.1| Cysteine protease [Medicago truncatula]
          Length = 345

 Score =  360 bits (925), Expect = 4e-97,   Method: Compositional matrix adjust.
 Identities = 182/329 (55%), Positives = 228/329 (69%), Gaps = 13/329 (3%)

Query: 24  ASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGN-R 82
           A QV S     + +I EKHEQWM  +G+ YKD  E+  RL IFK+N+ YIE +N  GN +
Sbjct: 23  AIQVTSRTLQDDSNIYEKHEQWMVHYGKVYKDLQERENRLKIFKENVNYIEASNNAGNNK 82

Query: 83  TYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGA 142
            YKLG N+F+DLTNEEF A     N+    +    ++ STFKY+N + VP+++DWR+KGA
Sbjct: 83  LYKLGINQFADLTNEEFIA---SRNKFKGHMCSSITKTSTFKYENAS-VPSTVDWRKKGA 138

Query: 143 VTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDK 200
           VT +K+QGQCG CWAFSAVAA EGI +++ GKL+ LSEQ+LVDC T   + GC GGLMD 
Sbjct: 139 VTPVKNQGQCGCCWAFSAVAATEGIHKLSTGKLVSLSEQELVDCDTKGVDQGCEGGLMDD 198

Query: 201 AFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQP 260
           AF++II+N GL TEA YPY+  +GTC   K    A TI+ YED+P  +EQAL +AV+NQP
Sbjct: 199 AFKFIIQNHGLNTEAQYPYQGVDGTCSANKASIHAVTITGYEDVPANNEQALQKAVANQP 258

Query: 261 VSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWG 320
           +SV +DASG  F FYKSGV    CG   DHGV  VG+G   +  G KYWL+KNSWG  WG
Sbjct: 259 ISVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVGND--GTKYWLVKNSWGTDWG 316

Query: 321 ESGYIRILR--DA--GLCGIATAASYPVA 345
           E GYI++ R  DA  GLCGIA  ASYP A
Sbjct: 317 EEGYIKMQRGVDAAEGLCGIAMEASYPTA 345


>gi|5823018|gb|AAD53011.1|AF089848_1 senescence-specific cysteine protease [Brassica napus]
          Length = 346

 Score =  360 bits (925), Expect = 4e-97,   Method: Compositional matrix adjust.
 Identities = 176/342 (51%), Positives = 235/342 (68%), Gaps = 10/342 (2%)

Query: 11  IPMFVIIILVIT-CASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
           I +F+I+ LV + C S  +S     E  + +KH++WMA+HGRTY D  EK  R  +FK+N
Sbjct: 6   IKIFLIVSLVSSFCFSTTLSRLLDDELIMQKKHDEWMAEHGRTYADMNEKNNRYVVFKRN 65

Query: 70  LEYIEKANK-EGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNV 128
           +E IE+ N     RT+KL  N+F+DLTN+EFR +YTGY       S+  ++ ++F+YQNV
Sbjct: 66  VERIERLNNVPAGRTFKLAVNQFADLTNDEFRFMYTGYKGDFVLFSQSQTKSTSFRYQNV 125

Query: 129 --TDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC 186
               +P ++DWR+KGAVT IK+QG CG CWAFSAVAA+EG TQI +GKLI LSEQQLVDC
Sbjct: 126 FFGALPIAVDWRKKGAVTPIKNQGSCGCCWAFSAVAAIEGATQIKKGKLISLSEQQLVDC 185

Query: 187 STDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPK 246
            T++ GCSGGLMD AFE+I+   GL TE++YPY+ E+  C  +  K  AA+I+ YED+P 
Sbjct: 186 DTNDFGCSGGLMDTAFEHIMATGGLTTESNYPYKGEDANCKIKSTKPSAASITGYEDVPV 245

Query: 247 GDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGA 306
            DE AL++AV++QPVSV ++  G  F FY SGV   +C    DH V  VG+  ++   G+
Sbjct: 246 NDENALMKAVAHQPVSVGIEGGGFDFQFYSSGVFTGECTTYLDHAVTAVGY--SQSSAGS 303

Query: 307 KYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
           KYW+IKNSWG  WGE GY+RI +D     GLCG+A  ASYP 
Sbjct: 304 KYWIIKNSWGTKWGEGGYMRIKKDIKDKEGLCGLAMKASYPT 345


>gi|224076972|ref|XP_002305074.1| predicted protein [Populus trichocarpa]
 gi|224106329|ref|XP_002333698.1| predicted protein [Populus trichocarpa]
 gi|222837984|gb|EEE76349.1| predicted protein [Populus trichocarpa]
 gi|222848038|gb|EEE85585.1| predicted protein [Populus trichocarpa]
          Length = 307

 Score =  360 bits (925), Expect = 5e-97,   Method: Compositional matrix adjust.
 Identities = 173/316 (54%), Positives = 223/316 (70%), Gaps = 17/316 (5%)

Query: 38  IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
           ++++HE+WMAQHGR Y D  EK  R  IFK+N+E IE  N   +R YKLG N+F+DLTNE
Sbjct: 1   MLKRHEEWMAQHGRVYGDMKEKEKRYLIFKENIERIEAFNNGSDRGYKLGVNKFADLTNE 60

Query: 98  EFRALYTGYNRPVPSVSRQSSR--PSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSC 155
           EFRA++ GY R       QSS+   S+F+++N++ +PTS+DWR+ GAVT +KDQG CG C
Sbjct: 61  EFRAMHHGYKR-------QSSKLMSSSFRHENLSAIPTSMDWRKAGAVTPVKDQGTCGCC 113

Query: 156 WAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLAT 213
           WAFSAVAA+EGI ++  GKLI LSEQQLVDC     + GC GGLMD AF++I+ N GL +
Sbjct: 114 WAFSAVAAIEGIIKLKTGKLISLSEQQLVDCDVKGVDQGCGGGLMDNAFQFILRNGGLTS 173

Query: 214 EADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFH 273
           EA YPY+  +GTC ++K  ++ A I+ YED+P  +E ALLQAV+ QPVSV V+  G  F 
Sbjct: 174 EATYPYQGVDGTCKSKKTASIEAKITGYEDVPVNNENALLQAVAKQPVSVAVEGGGYDFQ 233

Query: 274 FYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD--- 330
           FYKSGV   DCG   DH V  +G+GT    +G  YWL+KNSWG +WGESGY+R+ R    
Sbjct: 234 FYKSGVFKGDCGTYLDHAVTAIGYGT--NSDGTNYWLVKNSWGTSWGESGYMRMQRGIGA 291

Query: 331 -AGLCGIATAASYPVA 345
             GLCG+A  ASYP A
Sbjct: 292 REGLCGVAMDASYPTA 307


>gi|357483847|ref|XP_003612210.1| Cysteine proteinase [Medicago truncatula]
 gi|355513545|gb|AES95168.1| Cysteine proteinase [Medicago truncatula]
          Length = 344

 Score =  360 bits (923), Expect = 6e-97,   Method: Compositional matrix adjust.
 Identities = 178/344 (51%), Positives = 235/344 (68%), Gaps = 17/344 (4%)

Query: 13  MFVIIILVITC----ASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQ 68
           ++ I + ++ C    A QV S R++ + S+ E+HE+WM  +G+ YKD  E+  R  IF +
Sbjct: 7   LYHISLALVFCLGLWAIQVTS-RTLQDGSMHERHERWMNHYGKVYKDHQEREKRFKIFTE 65

Query: 69  NLEYIEKANK-EGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQN 127
           N++YIE  N  + N +YKLG N+F+DLTNEEF A     N+    +     R +TFKY+N
Sbjct: 66  NMKYIEAFNNGDNNESYKLGINQFADLTNEEFVA---SRNKFKGHMCSSIIRTTTFKYEN 122

Query: 128 VTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
           V+ +P+++DWR+KGAVT +K+QGQCG CWAFSAVAA EGI +++ GKL+ LSEQ+LVDC 
Sbjct: 123 VSAIPSTVDWRKKGAVTPVKNQGQCGCCWAFSAVAATEGIHKLSTGKLVSLSEQELVDCD 182

Query: 188 TD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLP 245
           T   + GC GGLMD AF++II+N GL TEA YPY+  +GTC+  K    A TI+ YED+P
Sbjct: 183 TKGVDQGCEGGLMDDAFKFIIQNHGLNTEAQYPYQGVDGTCNANKASIQATTITGYEDVP 242

Query: 246 KGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENG 305
             +EQAL +AV+NQP+SV +DASG  F FYKSGV    CG   DHGV  VG+G + +  G
Sbjct: 243 ANNEQALQKAVANQPISVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSND--G 300

Query: 306 AKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
            KYWL+KNSWG  WGE GYI + R      GLCGIA  ASYP A
Sbjct: 301 TKYWLVKNSWGTDWGEEGYIMMQRGVEAAEGLCGIAMQASYPTA 344


>gi|356543038|ref|XP_003539970.1| PREDICTED: vignain-like [Glycine max]
          Length = 343

 Score =  360 bits (923), Expect = 7e-97,   Method: Compositional matrix adjust.
 Identities = 174/324 (53%), Positives = 227/324 (70%), Gaps = 11/324 (3%)

Query: 28  VSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLG 87
           V+ R++ + S+ E+HE+WMA++ + YKD  E+  R  IFK+N+ YIE  N   N+ YKLG
Sbjct: 25  VTSRTLQDASMYERHEEWMARYAKVYKDPEEREKRFKIFKENVNYIEAFNNAANKPYKLG 84

Query: 88  TNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIK 147
            N+F+DLTNEEF A     NR    +    +R +TFKY+NVT +P+++DWR+KGAVT IK
Sbjct: 85  INQFADLTNEEFIAPR---NRFKGHMCSSITRTTTFKYENVTALPSTVDWRQKGAVTPIK 141

Query: 148 DQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST--DNHGCSGGLMDKAFEYI 205
           DQGQCG CWAFSAVAA EGI  +  GKLI LSEQ++VDC T  ++ GC+GG MD AF++I
Sbjct: 142 DQGQCGCCWAFSAVAATEGIHALNSGKLISLSEQEVVDCDTKGEDQGCAGGFMDGAFKFI 201

Query: 206 IENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCV 265
           I+N GL TEA+YPY+  +G C+  +    AATI+ YED+P  +E+AL +AV+NQPVSV +
Sbjct: 202 IQNHGLNTEANYPYKAVDGKCNANEAANHAATITGYEDVPVNNEKALQKAVANQPVSVAI 261

Query: 266 DASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYI 325
           DASG  F FYK+GV    CG   DHGV  VG+G + +  G +YWL+KNSWG  WGE GYI
Sbjct: 262 DASGSDFQFYKTGVFTGSCGTQLDHGVTAVGYGVSAD--GTQYWLVKNSWGTEWGEEGYI 319

Query: 326 RILRDA----GLCGIATAASYPVA 345
            + R      GLCGIA  ASYP A
Sbjct: 320 MMQRGVKAQEGLCGIAMMASYPTA 343


>gi|356515036|ref|XP_003526207.1| PREDICTED: thiol protease SEN102-like [Glycine max]
          Length = 336

 Score =  359 bits (921), Expect = 1e-96,   Method: Compositional matrix adjust.
 Identities = 175/327 (53%), Positives = 220/327 (67%), Gaps = 18/327 (5%)

Query: 25  SQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTY 84
           SQV+  R +HE S+ E+HEQWM ++G+ YKD  EK  R  IFK N+E+IE  N +GN+ Y
Sbjct: 22  SQVMC-RKLHETSMRERHEQWMTEYGKVYKDAAEKDKRFQIFKDNVEFIESFNADGNKPY 80

Query: 85  KLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVT 144
           KLG N  +DLT EEF+A   G+ RP           +TFKY+NVT +P +IDWR KGAVT
Sbjct: 81  KLGVNHLADLTVEEFKASRNGFKRP------HEFSTTTFKYENVTAIPAAIDWRTKGAVT 134

Query: 145 HIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAF 202
            IKDQGQCGSCWAFS +AA EGI QIT GKL+ LSEQ+LVDC T   + GC GG M+  F
Sbjct: 135 PIKDQGQCGSCWAFSTIAATEGIHQITTGKLVSLSEQELVDCDTKGVDQGCEGGYMEDGF 194

Query: 203 EYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVS 262
           E+II+N G+ +E +YPY+  +G C+  K  +  A I  YE +P   E AL +AV+NQPVS
Sbjct: 195 EFIIKNGGITSETNYPYKAVDGKCN--KATSPVAQIKGYEKVPPNSETALQKAVANQPVS 252

Query: 263 VCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGES 322
           V +DA G  F FY SG+ N +CG   DHGV  VG+GTA   NG  YW++KNSWG  WGE 
Sbjct: 253 VSIDADGAGFMFYSSGIYNGECGTELDHGVTAVGYGTA---NGTDYWIVKNSWGTQWGEK 309

Query: 323 GYIRILR----DAGLCGIATAASYPVA 345
           GY+R+ R      GLCGIA  +SYP +
Sbjct: 310 GYVRMQRGIAAKHGLCGIALDSSYPTS 336


>gi|18422605|ref|NP_568651.1| senescence-associated protein 12 [Arabidopsis thaliana]
 gi|13877737|gb|AAK43946.1|AF370131_1 putative senescence-specific cysteine protease SAG12 [Arabidopsis
           thaliana]
 gi|9758936|dbj|BAB09317.1| senescence-specific cysteine protease [Arabidopsis thaliana]
 gi|14532898|gb|AAK64131.1| putative senescence-specific cysteine protease SAG12 [Arabidopsis
           thaliana]
 gi|332007929|gb|AED95312.1| senescence-associated protein 12 [Arabidopsis thaliana]
          Length = 346

 Score =  358 bits (919), Expect = 2e-96,   Method: Compositional matrix adjust.
 Identities = 179/352 (50%), Positives = 239/352 (67%), Gaps = 15/352 (4%)

Query: 1   MVLKFEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKA 60
           M LK  + F+     + I    C S  +S    +E  + ++H +WM +HGR Y D  E+ 
Sbjct: 1   MALKHMQIFLF----VAIFSSFCFSITLSRPLDNELIMQKRHIEWMTKHGRVYADVKEEN 56

Query: 61  MRLNIFKQNLEYIEKANK-EGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQS-S 118
            R  +FK N+E IE  N     RT+KL  N+F+DLTN+EFR++YTG+ + V ++S QS +
Sbjct: 57  NRYVVFKNNVERIEHLNSIPAGRTFKLAVNQFADLTNDEFRSMYTGF-KGVSALSSQSQT 115

Query: 119 RPSTFKYQNVTD--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLI 176
           + S F+YQNV+   +P S+DWR+KGAVT IK+QG CG CWAFSAVAA+EG TQI +GKLI
Sbjct: 116 KMSPFRYQNVSSGALPVSVDWRKKGAVTPIKNQGSCGCCWAFSAVAAIEGATQIKKGKLI 175

Query: 177 ELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAA 236
            LSEQQLVDC T++ GC GGLMD AFE+I    GL TE++YPY+ E+ TC+++K    A 
Sbjct: 176 SLSEQQLVDCDTNDFGCEGGLMDTAFEHIKATGGLTTESNYPYKGEDATCNSKKTNPKAT 235

Query: 237 TISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVG 296
           +I+ YED+P  DEQAL++AV++QPVSV ++  G  F FY SGV   +C    DH V  +G
Sbjct: 236 SITGYEDVPVNDEQALMKAVAHQPVSVGIEGGGFDFQFYSSGVFTGECTTYLDHAVTAIG 295

Query: 297 FGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
           +G  E  NG+KYW+IKNSWG  WGESGY+RI +D     GLCG+A  ASYP 
Sbjct: 296 YG--ESTNGSKYWIIKNSWGTKWGESGYMRIQKDVKDKQGLCGLAMKASYPT 345


>gi|357167196|ref|XP_003581047.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like
           [Brachypodium distachyon]
          Length = 338

 Score =  358 bits (918), Expect = 2e-96,   Method: Compositional matrix adjust.
 Identities = 175/342 (51%), Positives = 227/342 (66%), Gaps = 16/342 (4%)

Query: 14  FVIIILVITCASQVVSGRSMHEPS--IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLE 71
           F+  ++V T A   +  R + +    I  +HEQWMA++GR Y D  EKA RL +FK N+ 
Sbjct: 3   FLFALVVCTFALGALGARDLADDDWLIAARHEQWMARYGRVYSDVAEKARRLEVFKANVG 62

Query: 72  YIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT-- 129
           +IE  N  GN  + L  N+F+D+T +EFRA++ GY   V       +R + F+Y NV+  
Sbjct: 63  FIESVNA-GNHKFWLEANQFADITKDEFRAMHKGYKMQVIG---SKARATGFRYANVSID 118

Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST- 188
           D+P S+DWR  GAVT +KDQGQCG CWAFS VA++EGI +++ GKLI LSEQ+LVDC   
Sbjct: 119 DLPASVDWRANGAVTPVKDQGQCGCCWAFSTVASMEGIVKVSTGKLISLSEQELVDCDVG 178

Query: 189 -DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
             N GC GGLMD AFE+I+ N GL TEADYPY   +GTC++ KE  +AA+I  YED+P  
Sbjct: 179 MQNKGCGGGLMDNAFEFIVNNGGLDTEADYPYTGADGTCNSNKESNIAASIKGYEDVPAN 238

Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
           DE +L +AV+ QPVS+ VD     F FYK GVL   CG   DHGVA VG+G A +  G K
Sbjct: 239 DEASLQKAVAAQPVSIAVDGGDDLFRFYKGGVLTGACGTELDHGVAAVGYGVAGD--GTK 296

Query: 308 YWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPVA 345
           YWL+KNSWG +WGE G+IR+ RD    AG+CG+A   SYP A
Sbjct: 297 YWLVKNSWGTSWGEDGFIRLERDVADEAGMCGLAMKPSYPTA 338


>gi|357471211|ref|XP_003605890.1| Cysteine proteinase [Medicago truncatula]
 gi|355506945|gb|AES88087.1| Cysteine proteinase [Medicago truncatula]
          Length = 343

 Score =  358 bits (918), Expect = 3e-96,   Method: Compositional matrix adjust.
 Identities = 175/325 (53%), Positives = 228/325 (70%), Gaps = 13/325 (4%)

Query: 28  VSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANK-EGNRTYKL 86
           V+ R++ +  + E+H QWM+Q+G+ YKD  E+  R  IF +N+ YIE  NK + N+ Y L
Sbjct: 25  VTSRTLQD-DMYERHRQWMSQYGKVYKDSQEREKRFKIFTENVNYIEAFNKGDNNKLYTL 83

Query: 87  GTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHI 146
           G N+F+DLTN+EF    +  N+    +    +R STFKY+N + +P+S+DWR+KGAVT +
Sbjct: 84  GVNQFADLTNDEFT---SSRNKFKGHMCSSITRTSTFKYENASAIPSSVDWRKKGAVTPV 140

Query: 147 KDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEY 204
           K+QGQCG CWAFSAVAA EGI +++ GKLI LSEQ+LVDC T   + GC GGLMD AF++
Sbjct: 141 KNQGQCGCCWAFSAVAATEGIHKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKF 200

Query: 205 IIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVC 264
           II+N GL TEA+YPY+  +GTC+  K    A TI+ YED+P  +EQAL +AV+NQP+SV 
Sbjct: 201 IIQNHGLNTEANYPYQGVDGTCNANKGSINAVTITGYEDVPTNNEQALQKAVANQPISVA 260

Query: 265 VDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGY 324
           +DASG  F FYKSGV    CG   DHGV  VG+G + +  G KYWL+KNSWG  WGE GY
Sbjct: 261 IDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSND--GTKYWLVKNSWGTEWGEEGY 318

Query: 325 IRILR--DA--GLCGIATAASYPVA 345
           I + R  DA  GLCGIA  ASYP A
Sbjct: 319 IMMQRGVDAAEGLCGIAMQASYPTA 343


>gi|319826926|gb|ADV74756.1| cysteine protease [Lactuca sativa]
          Length = 363

 Score =  358 bits (918), Expect = 3e-96,   Method: Compositional matrix adjust.
 Identities = 169/323 (52%), Positives = 222/323 (68%), Gaps = 10/323 (3%)

Query: 28  VSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLG 87
            + R++++P+++ +HEQWMA HGR Y DE EK +R  IFK N+ YI+  N   +++Y L 
Sbjct: 41  ATSRTLNDPTMIARHEQWMAHHGRIYTDENEKQLRFQIFKNNVAYIDAHNARSDQSYTLE 100

Query: 88  TNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIK 147
            N+F+DLTN+EFRA   GY +   S S   S    F+Y NV+ VP  +DWR++GAVT +K
Sbjct: 101 VNKFADLTNDEFRASRNGYKKQPDSDSHVVS--GLFRYANVSAVPDEVDWRKEGAVTPVK 158

Query: 148 DQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYI 205
           DQG CG CWAFSAVAA+EGI ++  GKL+ LSEQ+LVDC  D  + GC GGLM+ AF++I
Sbjct: 159 DQGDCGCCWAFSAVAAMEGINKLENGKLVSLSEQELVDCDIDGIDQGCEGGLMENAFQFI 218

Query: 206 IENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCV 265
            + KGLA E+ YPY  E+G C+ +K    AA IS +E +P  +E+ALLQAV+NQPVS+ +
Sbjct: 219 EKRKGLAAESVYPYTGEDGICNTKKAAIPAAKISGHEKVPANNEKALLQAVANQPVSIAI 278

Query: 266 DASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYI 325
           DASG  F FY  GV    CG   DH +  VG+G   +  G KYWL+KNSWG +WGE+GYI
Sbjct: 279 DASGYEFQFYSGGVFTGSCGTELDHAITAVGYGATMD--GTKYWLMKNSWGASWGENGYI 336

Query: 326 RILRDA----GLCGIATAASYPV 344
           RI RD+    GLCGIA   SYPV
Sbjct: 337 RIKRDSLAKEGLCGIAMDPSYPV 359


>gi|144905112|dbj|BAF56429.1| cysteine proteinase [Lotus japonicus]
          Length = 341

 Score =  358 bits (918), Expect = 3e-96,   Method: Compositional matrix adjust.
 Identities = 179/348 (51%), Positives = 234/348 (67%), Gaps = 21/348 (6%)

Query: 6   EKSFIIPMFVIIILVITCASQVVSGRSMHEP--SIVEKHEQWMAQHGRTYKDELEKAMRL 63
           +K +I+ +F+++ + I   S+V+S R +HE   S++E+HEQWMA++ + YKD  EK  R 
Sbjct: 7   QKQYILALFLLLAVGI---SRVIS-RELHETETSLIERHEQWMAKYDKVYKDAAEKEKRF 62

Query: 64  NIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTF 123
            IFK N+E+IE  N  GN+ YKLG N  +DLT EEF+A   G  R            ++F
Sbjct: 63  LIFKDNVEFIESFNAAGNKPYKLGVNHLADLTIEEFKASRNGLKRSYD----YEVGTTSF 118

Query: 124 KYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQL 183
           KY+NVT +P S+DWR+KGAVT IKDQGQCGSCWAFS VAA EGI +I+ GKL+ LSEQ+L
Sbjct: 119 KYENVTAIPASVDWRKKGAVTPIKDQGQCGSCWAFSTVAATEGIHKISTGKLVSLSEQEL 178

Query: 184 VDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKY 241
           VDC     + GC GG M+  FE+II+N G+ TEA+YPY+  +G+C N    A AA I  Y
Sbjct: 179 VDCDRKGTDQGCEGGYMEDGFEFIIKNGGITTEANYPYKAVDGSCKNA--TAPAAQIKGY 236

Query: 242 EDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAE 301
           E +P   E+ALL+AV+NQPVSV +DA+  +F FY SG+   +CG   DHGV  VG+G A 
Sbjct: 237 EKVPVNSEKALLKAVANQPVSVSIDAADGSFMFYSSGIFTGECGTELDHGVTAVGYGRA- 295

Query: 302 EENGAKYWLIKNSWGETWGESGYIRILR----DAGLCGIATAASYPVA 345
             NG  YW++KNSWG  WGE GYIR+ R      GLCGIA  +SYP A
Sbjct: 296 --NGTDYWIVKNSWGTVWGEQGYIRMQRGIAAKEGLCGIAMDSSYPTA 341


>gi|357160300|ref|XP_003578721.1| PREDICTED: oryzain beta chain-like [Brachypodium distachyon]
          Length = 349

 Score =  357 bits (917), Expect = 4e-96,   Method: Compositional matrix adjust.
 Identities = 174/347 (50%), Positives = 232/347 (66%), Gaps = 16/347 (4%)

Query: 11  IPMFVIIILVITC---ASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFK 67
           IP   ++ +V+ C    S V+S R + + ++VE+HEQWMAQHGR YKD  EKA R   F+
Sbjct: 3   IPKVFLLAVVLGCICLCSTVLSARELGDAAMVERHEQWMAQHGRVYKDGAEKARRFEAFR 62

Query: 68  QNLEYIEKANKEGNR-TYKLGTNEFSDLTNEEFRALYT--GYNRPVPSVSRQSSRPSTFK 124
            N+ +IE  N  GNR  + LG N+F+DLTN+EFRA  T  G+ +   +   ++S   TF+
Sbjct: 63  NNVVFIESFNAAGNRRKFWLGVNQFTDLTNDEFRATKTNKGFIKRNAAAVNKASPTGTFR 122

Query: 125 YQNVTD--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQ 182
           Y NV+   +P ++DWR KGAVT IK+QGQCG CWAFSAVAA EGI Q++ GKL+ LSEQ+
Sbjct: 123 YSNVSADALPAAVDWRAKGAVTPIKNQGQCGCCWAFSAVAATEGIVQLSTGKLVPLSEQE 182

Query: 183 LVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISK 240
           LVDC  +  +HGC GG MD AFE+II+N GL +E +YPY  ++G C  +      ATI  
Sbjct: 183 LVDCDANGADHGCEGGEMDDAFEFIIKNGGLTSETNYPYTAQDGQCKAKNTINSVATIKG 242

Query: 241 YEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTA 300
           YED+P  DE +L++AV+ QPVSV VD     F  Y  GVL+  CG + DHG+  VG+G A
Sbjct: 243 YEDVPANDEASLMKAVAAQPVSVAVDGGDMVFQHYAGGVLSGSCGTSLDHGIVAVGYGAA 302

Query: 301 EEENGAKYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
           ++  G K+WL+KNSWG TWGE GYIR+ +D     G+CG+A   SYP
Sbjct: 303 DD--GTKFWLMKNSWGTTWGEDGYIRMEKDVADAGGMCGLAMQPSYP 347


>gi|356517426|ref|XP_003527388.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
           max]
          Length = 343

 Score =  357 bits (916), Expect = 4e-96,   Method: Compositional matrix adjust.
 Identities = 174/324 (53%), Positives = 226/324 (69%), Gaps = 11/324 (3%)

Query: 28  VSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLG 87
           V+ R++ + S+ E+H QWMA++ + YKD  E+  R  IFK+N+ YIE  N   N++YKL 
Sbjct: 25  VTCRTLQDASMYERHAQWMARYAKVYKDPQEREKRFRIFKENVNYIETFNSADNKSYKLD 84

Query: 88  TNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIK 147
            N+F+DLTNEEF A     NR    +    +R +TFKY+NVT +P+++DWR+KGAVT IK
Sbjct: 85  INQFADLTNEEFIA---PRNRFKGHMCSSITRTTTFKYENVTVIPSTVDWRQKGAVTPIK 141

Query: 148 DQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST--DNHGCSGGLMDKAFEYI 205
           DQGQCG CWAFSAVAA EGI  +  GKLI LSEQ++VDC T   + GC+GG MD AF++I
Sbjct: 142 DQGQCGCCWAFSAVAATEGIHALNAGKLISLSEQEVVDCDTKGQDQGCAGGFMDGAFKFI 201

Query: 206 IENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCV 265
           I+N GL TE +YPY+  +G C+ +     AATI+ YED+P  +E+AL +AV+NQPVSV +
Sbjct: 202 IQNHGLNTEPNYPYKAADGKCNAKAAANHAATITGYEDVPVNNEKALQKAVANQPVSVAI 261

Query: 266 DASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYI 325
           DASG  F FYKSGV    CG   DHGV  VG+G + +  G +YWL+KNSWG  WGE GYI
Sbjct: 262 DASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSAD--GTEYWLVKNSWGTEWGEEGYI 319

Query: 326 RILR----DAGLCGIATAASYPVA 345
           R+ R    + GLCGIA  ASYP A
Sbjct: 320 RMQRGVKAEEGLCGIAMMASYPTA 343


>gi|356577763|ref|XP_003556992.1| PREDICTED: vignain-like [Glycine max]
          Length = 343

 Score =  357 bits (916), Expect = 4e-96,   Method: Compositional matrix adjust.
 Identities = 172/342 (50%), Positives = 233/342 (68%), Gaps = 14/342 (4%)

Query: 13  MFVIIILVITCASQV---VSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
            + I + ++ C+  +   V+ R++ + S+ E+HE+WM ++ + YKD  E+  R  IFK+N
Sbjct: 7   FYQISLALLFCSGFLTFQVTCRTLQDASMYERHEEWMGRYAKVYKDPQERERRFKIFKEN 66

Query: 70  LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
           + YIE  N   N+ Y LG N+F+DLTNEEF A     NR    +    +R +TFKY+NVT
Sbjct: 67  VNYIEAFNNAANKPYTLGINQFADLTNEEFIAPR---NRFKGHMCSSITRTTTFKYENVT 123

Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST- 188
            +P+++DWR+KGAVT IKDQGQCG CWAFSAVAA EGI  ++ GKLI LSEQ++VDC T 
Sbjct: 124 AIPSTVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALSAGKLISLSEQEVVDCDTK 183

Query: 189 -DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
            ++ GC+GG MD AF++II+N GL  E +YPY+  +G C+ +      ATI+ YED+P  
Sbjct: 184 GEDQGCAGGFMDGAFKFIIQNHGLNNEPNYPYKAVDGKCNAKAAANHVATITGYEDVPVN 243

Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
           +E+AL +AV+NQPVSV +DASG  F FY+SGV    CG   DHGV  VG+G + +  G +
Sbjct: 244 NEKALQKAVANQPVSVAIDASGSDFQFYQSGVFTGSCGTELDHGVTAVGYGVSAD--GTE 301

Query: 308 YWLIKNSWGETWGESGYIRILR----DAGLCGIATAASYPVA 345
           YWL+KNSWG  WGE GYIR+ R    + GLCGIA  ASYP A
Sbjct: 302 YWLVKNSWGTEWGEEGYIRMQRGVKAEEGLCGIAMMASYPTA 343


>gi|356517348|ref|XP_003527349.1| PREDICTED: vignain-like [Glycine max]
          Length = 343

 Score =  357 bits (916), Expect = 4e-96,   Method: Compositional matrix adjust.
 Identities = 172/342 (50%), Positives = 233/342 (68%), Gaps = 14/342 (4%)

Query: 13  MFVIIILVITCASQV---VSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
            + I + ++ C+  +   V+ R++ + S+ E+HE+WM ++ + YKD  E+  R  IFK+N
Sbjct: 7   FYQISLALLFCSGFLAFQVTCRTLQDASMYERHEEWMGRYAKVYKDPQERERRFKIFKEN 66

Query: 70  LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
           + YIE  N   N+ Y LG N+F+DLTNEEF A     NR    +    +R +TFKY+NVT
Sbjct: 67  VNYIEAFNNAANKPYTLGINQFADLTNEEFIAPR---NRFKGHMCSSITRTTTFKYENVT 123

Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST- 188
            +P+++DWR+KGAVT IKDQGQCG CWAFSAVAA EGI  ++ GKLI LSEQ++VDC T 
Sbjct: 124 AIPSTVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALSAGKLISLSEQEVVDCDTK 183

Query: 189 -DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
            ++ GC+GG MD AF++II+N GL  E +YPY+  +G C+ +      ATI+ YED+P  
Sbjct: 184 GEDQGCAGGFMDGAFKFIIQNHGLNNEPNYPYKAVDGKCNAKAAANHVATITGYEDVPVN 243

Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
           +E+AL +AV+NQPVSV +DASG  F FY+SGV    CG   DHGV  VG+G + +  G +
Sbjct: 244 NEKALQKAVANQPVSVAIDASGSDFQFYQSGVFTGSCGTELDHGVTAVGYGVSAD--GTE 301

Query: 308 YWLIKNSWGETWGESGYIRILR----DAGLCGIATAASYPVA 345
           YWL+KNSWG  WGE GYIR+ R    + GLCGIA  ASYP A
Sbjct: 302 YWLVKNSWGTEWGEEGYIRMQRGVKAEEGLCGIAMMASYPTA 343


>gi|356543076|ref|XP_003539989.1| PREDICTED: vignain-like [Glycine max]
          Length = 343

 Score =  357 bits (915), Expect = 6e-96,   Method: Compositional matrix adjust.
 Identities = 172/324 (53%), Positives = 227/324 (70%), Gaps = 11/324 (3%)

Query: 28  VSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLG 87
           V+ R++ + S+ E+HE+WMA++ + YKD  E+  R  IFK+N+ YIE  N   ++ YKLG
Sbjct: 25  VTSRTLQDASMYERHEEWMARYAKVYKDPEEREKRFKIFKENVNYIEAFNNAADKPYKLG 84

Query: 88  TNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIK 147
            N+F+DLTNEEF A     N+    +    +R +TFKY+NVT +P+++DWR+KGAVT IK
Sbjct: 85  INQFADLTNEEFIAPR---NKFKGHMCSSITRTTTFKYENVTALPSTVDWRQKGAVTPIK 141

Query: 148 DQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST--DNHGCSGGLMDKAFEYI 205
           DQGQCG CWAFSAVAA EGI  +  GKLI LSEQ++VDC T  ++ GC+GG MD AF++I
Sbjct: 142 DQGQCGCCWAFSAVAATEGIHALNSGKLISLSEQEVVDCDTKGEDQGCAGGFMDGAFKFI 201

Query: 206 IENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCV 265
           I+N GL TEA+YPY+  +G C+  +    AATI+ YED+P  +E+AL +AV+NQPVSV +
Sbjct: 202 IQNHGLNTEANYPYKAVDGKCNANEAANHAATITGYEDVPVNNEKALQKAVANQPVSVAI 261

Query: 266 DASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYI 325
           DASG  F FYK+GV    CG   DHGV  VG+G + +  G +YWL+KNSWG  WGE GYI
Sbjct: 262 DASGSDFQFYKTGVFTGSCGTQLDHGVTAVGYGVSAD--GTQYWLVKNSWGTEWGEEGYI 319

Query: 326 RILRDA----GLCGIATAASYPVA 345
            + R      GLCGIA  ASYP A
Sbjct: 320 MMQRGVKAQEGLCGIAMMASYPTA 343


>gi|357452075|ref|XP_003596314.1| Cysteine proteinase [Medicago truncatula]
 gi|355485362|gb|AES66565.1| Cysteine proteinase [Medicago truncatula]
          Length = 341

 Score =  356 bits (914), Expect = 7e-96,   Method: Compositional matrix adjust.
 Identities = 184/335 (54%), Positives = 234/335 (69%), Gaps = 13/335 (3%)

Query: 17  IILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKA 76
           + L +   S   + R++    + E HEQWM QHG+ YK   EK  R  IFK+N+ YIE  
Sbjct: 14  LFLCLGLLSFQATSRTLQNDPMYEMHEQWMVQHGKVYKAAHEKQKRFGIFKENVNYIEAF 73

Query: 77  NKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSID 136
           N  GN++YKLG N F+DLTN EF A    +N  +       S  +TFKY+NV+DVP+++D
Sbjct: 74  NNVGNKSYKLGLNHFADLTNHEFIAARNKFNGYL-----HGSIITTFKYKNVSDVPSAVD 128

Query: 137 WREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCS 194
           WR++GAVT +K+QGQCG CWAFSAVA+ EGI ++T G L+ LSEQ+LVDC T+  + GC 
Sbjct: 129 WRQEGAVTPVKNQGQCGCCWAFSAVASTEGIHKLTTGNLVSLSEQELVDCDTNGEDQGCE 188

Query: 195 GGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQ 254
           GGLMD AFE+II+N GL+TEA+YPY+  +GTC+  +  + AATIS YE++P  DEQAL +
Sbjct: 189 GGLMDDAFEFIIQNNGLSTEAEYPYQGVDGTCNKTEVGSSAATISGYENVPVNDEQALQK 248

Query: 255 AVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNS 314
           AV+NQPVSV +DASG  F FYKSGV    CG   DHGVAVVG+G  E+E   +YWL+KNS
Sbjct: 249 AVANQPVSVAIDASGSDFQFYKSGVFTGSCGTELDHGVAVVGYGVGEDE--TEYWLVKNS 306

Query: 315 WGETWGESGYIRILR--DA--GLCGIATAASYPVA 345
           WG  WGE GYIR+ R  DA  GLCGIA   SYP A
Sbjct: 307 WGTQWGEEGYIRMQRGVDASEGLCGIAMQPSYPTA 341


>gi|356517350|ref|XP_003527350.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
           max]
 gi|356577765|ref|XP_003556993.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
           max]
          Length = 343

 Score =  356 bits (914), Expect = 8e-96,   Method: Compositional matrix adjust.
 Identities = 175/324 (54%), Positives = 226/324 (69%), Gaps = 11/324 (3%)

Query: 28  VSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLG 87
           V+ R++ + S+ E+HEQWMA++G+ YKD  EK  R  +FK+N+ YIE  N   N+ YKLG
Sbjct: 25  VASRTLQDASMYERHEQWMARYGKVYKDPEEKEKRFRVFKENVNYIEAFNNAANKPYKLG 84

Query: 88  TNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIK 147
            N+F+DLT+EEF       NR        ++R +TFKY+NVT +P SIDWR+KGAVT IK
Sbjct: 85  INQFADLTSEEF---IVPRNRFNGHTRSSNTRTTTFKYENVTVLPDSIDWRQKGAVTPIK 141

Query: 148 DQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYI 205
           +QG CG CWAFSA+AA EGI +I+ GKL+ LSEQ++VDC T   +HGC GG MD AF++I
Sbjct: 142 NQGSCGCCWAFSAIAATEGIHKISTGKLVSLSEQEVVDCDTKGTDHGCEGGYMDGAFKFI 201

Query: 206 IENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCV 265
           I+N G+ TEA YPY+  +G C+ ++E   AATI+ YED+P  +E+AL +AV+NQPVSV +
Sbjct: 202 IQNHGINTEASYPYKGVDGKCNIKEEAVHAATITGYEDVPINNEKALQKAVANQPVSVAI 261

Query: 266 DASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYI 325
           DASG  F FYKSG+    CG   DHGV  VG+G  E   G KYWL+KNSWG  WGE GYI
Sbjct: 262 DASGADFQFYKSGIFTGSCGTELDHGVTAVGYG--ENNEGTKYWLVKNSWGTEWGEEGYI 319

Query: 326 RILRDA----GLCGIATAASYPVA 345
            + R      G+CGIA  ASYP A
Sbjct: 320 MMQRGVKAVEGICGIAMMASYPTA 343


>gi|1046373|gb|AAC49135.1| SAG12 protein [Arabidopsis thaliana]
          Length = 346

 Score =  356 bits (913), Expect = 1e-95,   Method: Compositional matrix adjust.
 Identities = 179/352 (50%), Positives = 238/352 (67%), Gaps = 15/352 (4%)

Query: 1   MVLKFEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKA 60
           M LK  + F+     + I    C S  +S    +E  + ++H +WM +HGR Y D  E+ 
Sbjct: 1   MALKHMQIFLF----VAIFSSFCFSITLSRPLDNELIMQKRHIEWMTKHGRVYADVKEEN 56

Query: 61  MRLNIFKQNLEYIEKANK-EGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQS-S 118
            R  +FK N+E IE  N     RT+KL  N+F+DLTN+EF ++YTG+ + V ++S QS +
Sbjct: 57  NRYVVFKNNVERIEHLNSIPAGRTFKLAVNQFADLTNDEFCSMYTGF-KGVSALSSQSQT 115

Query: 119 RPSTFKYQNVTD--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLI 176
           + S F+YQNV+   +P S+DWR+KGAVT IK+QG CG CWAFSAVAA+EG TQI +GKLI
Sbjct: 116 KMSPFRYQNVSSGALPVSVDWRKKGAVTPIKNQGSCGCCWAFSAVAAIEGATQIKKGKLI 175

Query: 177 ELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAA 236
            LSEQQLVDC T++ GC GGLMD AFE+I    GL TE+DYPY+ E+ TC+++K    A 
Sbjct: 176 SLSEQQLVDCDTNDFGCEGGLMDTAFEHIKATGGLTTESDYPYKGEDATCNSKKTNPKAT 235

Query: 237 TISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVG 296
           +I+ YED+P  DEQAL++AV++QPVSV ++  G  F FY SGV   +C    DH V  +G
Sbjct: 236 SITGYEDVPVNDEQALMKAVAHQPVSVGIEGGGFDFQFYSSGVFTGECTTYLDHAVTAIG 295

Query: 297 FGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
           +G  E  NG+KYW+IKNSWG  WGESGY+RI +D     GLCG+A  ASYP 
Sbjct: 296 YG--ESTNGSKYWIIKNSWGTKWGESGYMRIQKDVKDKQGLCGLAMKASYPT 345


>gi|21666724|gb|AAM73806.1|AF448505_1 cysteine proteinase [Brassica napus]
 gi|21666726|gb|AAM73807.1|AF448506_1 cysteine proteinase [Brassica napus]
          Length = 343

 Score =  355 bits (912), Expect = 1e-95,   Method: Compositional matrix adjust.
 Identities = 169/341 (49%), Positives = 237/341 (69%), Gaps = 11/341 (3%)

Query: 11  IPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNL 70
           I +F+I+ LV + +      R + E ++ ++H  WM +HGR Y D  EK  R  +FK+N+
Sbjct: 6   IQIFLIVSLVSSFSLSTTLSRPLDEVTMQKRHAAWMTEHGRVYADANEKNNRYVVFKRNV 65

Query: 71  EYIEKANK-EGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
           E IE+ N+ +   T+KL  N+F+DLTNEEFR++YTGY     SV    ++P++F+YQ+V+
Sbjct: 66  ESIERLNEVQYGLTFKLAVNQFADLTNEEFRSMYTGYKGN--SVLSSRTKPTSFRYQHVS 123

Query: 130 D--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
              +P S+DWR+KGAVT IKDQG CGSCWAFSAVAA+EG+ QI +GKLI LSEQ+LVDC 
Sbjct: 124 SDALPISVDWRKKGAVTPIKDQGSCGSCWAFSAVAAIEGVAQIKKGKLISLSEQELVDCD 183

Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
           T++ GC GG M+ AF Y +   GL +E++YPY+  +GTC+  K K +A +I  +ED+P  
Sbjct: 184 TNDDGCMGGYMNSAFNYTMTTGGLTSESNYPYKSTDGTCNINKTKQIATSIKGFEDVPAN 243

Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
           DE+AL++AV++ PVS+ +   G  F FY SGV + +C  + DHGVAVVG+G  +  NG+K
Sbjct: 244 DEKALMKAVAHHPVSIGIAGGGTGFQFYSSGVFSGECSTHLDHGVAVVGYG--KSSNGSK 301

Query: 308 YWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
           YW++KNSWG  WGE GY+RI +D     G CG+A  ASYP 
Sbjct: 302 YWILKNSWGPKWGERGYMRIKKDTKAKHGQCGLAMNASYPT 342


>gi|224162986|ref|XP_002338508.1| predicted protein [Populus trichocarpa]
 gi|222872535|gb|EEF09666.1| predicted protein [Populus trichocarpa]
          Length = 306

 Score =  355 bits (911), Expect = 2e-95,   Method: Compositional matrix adjust.
 Identities = 168/314 (53%), Positives = 225/314 (71%), Gaps = 14/314 (4%)

Query: 38  IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
           + E+HEQWM Q+GR YKD+ E+A R +IFK+N+  I+  N +  ++YKLG N+F+DLTNE
Sbjct: 1   MYERHEQWMTQYGRVYKDDNERATRYSIFKENVARIDAFNSQTGKSYKLGVNQFADLTNE 60

Query: 98  EFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWA 157
           EF+A     NR    +    + P  F+Y+NV+ VP+++DWR++GAVT +KDQGQCG CWA
Sbjct: 61  EFKA---SRNRFKGHMCSPQAGP--FRYENVSAVPSTVDWRKEGAVTPVKDQGQCGCCWA 115

Query: 158 FSAVAAVEGITQITRGKLIELSEQQLVDCST--DNHGCSGGLMDKAFEYIIENKGLATEA 215
           FSAVAA+EGI ++T GKLI LSEQ++VDC T  ++ GC+GGLMD AF++I +NKGL TEA
Sbjct: 116 FSAVAAMEGINKLTTGKLISLSEQEVVDCDTKGEDQGCNGGLMDDAFKFIEQNKGLTTEA 175

Query: 216 DYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFY 275
           +YPY+  +GTC+ +K    AA I+ +ED+P   E AL++AV+ QPVSV +DA G  F FY
Sbjct: 176 NYPYKGTDGTCNTKKSAIHAAKITGFEDVPANSEAALMKAVAKQPVSVAIDAGGSDFQFY 235

Query: 276 KSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD----A 331
            SG+    C    DHGV  VG+G ++   G+KYWL+KNSWG  WGE GYIR+ +D     
Sbjct: 236 SSGIFTGSCDTQLDHGVTAVGYGVSD---GSKYWLVKNSWGAQWGEEGYIRMQKDISAKE 292

Query: 332 GLCGIATAASYPVA 345
           GLCGIA  ASYP A
Sbjct: 293 GLCGIAMQASYPTA 306


>gi|356554921|ref|XP_003545789.1| PREDICTED: LOW QUALITY PROTEIN: thiol protease SEN102-like [Glycine
           max]
          Length = 439

 Score =  354 bits (909), Expect = 3e-95,   Method: Compositional matrix adjust.
 Identities = 171/320 (53%), Positives = 221/320 (69%), Gaps = 11/320 (3%)

Query: 32  SMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEF 91
           ++ + S+ E+HEQWM +HG+ YKD  E+  R  IF +N+ Y+E  N   N+ YKLG N+F
Sbjct: 125 TLQDASMYERHEQWMTRHGKVYKDPREREKRFRIFNENVNYVEAFNNAANKPYKLGINQF 184

Query: 92  SDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQ 151
            DLTN+EF A     NR    +     R +TFKY+NVT VP+++DWR+ GAVT +KDQGQ
Sbjct: 185 XDLTNQEFIA---PRNRFKGHMCSSIIRTTTFKYENVTTVPSTVDWRQNGAVTPVKDQGQ 241

Query: 152 CGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENK 209
           CG CWAFSAVAA EGI  ++ GKLI LSEQ+LVDC T   + GC GGLMD A+++II+N 
Sbjct: 242 CGCCWAFSAVAATEGIHALSGGKLISLSEQELVDCDTKGVDQGCEGGLMDDAYKFIIQNH 301

Query: 210 GLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASG 269
           GL TEA+YPY+  +G C+  +    AATI+ YED+P  +E+AL +AV+NQPVSV +DAS 
Sbjct: 302 GLNTEANYPYKGVDGKCNANEAANHAATITGYEDVPANNEKALQKAVANQPVSVAIDASS 361

Query: 270 RAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILR 329
             F FYKSG     CG   DHGV  VG+G +  ++G KYWL+KNSWG  WGE GYIR+ R
Sbjct: 362 SDFQFYKSGAFTGSCGTELDHGVTAVGYGVS--DHGTKYWLVKNSWGTEWGEEGYIRMQR 419

Query: 330 ----DAGLCGIATAASYPVA 345
               + G+CGIA  ASYP A
Sbjct: 420 GVDSEEGVCGIAMQASYPTA 439


>gi|357474725|ref|XP_003607647.1| Cysteine proteinase [Medicago truncatula]
 gi|355508702|gb|AES89844.1| Cysteine proteinase [Medicago truncatula]
          Length = 340

 Score =  354 bits (908), Expect = 4e-95,   Method: Compositional matrix adjust.
 Identities = 171/327 (52%), Positives = 224/327 (68%), Gaps = 15/327 (4%)

Query: 25  SQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTY 84
           + V+S +    PS+ E+HEQWM+++G+ YKD +EK  R  IFK N+E+IE  N   N+ Y
Sbjct: 23  TNVMSRKLYESPSLQERHEQWMSEYGKLYKDAIEKEKRFMIFKDNVEFIESFNAADNKPY 82

Query: 85  KLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVT 144
           KL  N  +DLT +EF+A   GY +    + R+ +  S FKY+NVT +P ++DWR KGAVT
Sbjct: 83  KLSVNHLADLTLDEFKASRNGYKK----IDREFATTS-FKYENVTAIPEAVDWRVKGAVT 137

Query: 145 HIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST--DNHGCSGGLMDKAF 202
            IKDQGQCGSCWAFS VAA+EGI QIT GKLI LSEQ+LVDC T  ++ GC GGLM+  F
Sbjct: 138 PIKDQGQCGSCWAFSTVAAIEGINQITTGKLISLSEQELVDCDTKGEDQGCEGGLMEDGF 197

Query: 203 EYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVS 262
           E+II+N G+ +E +YPY+  +G+C N    A  A I+ YE +P   E +LL+AV+NQP+S
Sbjct: 198 EFIIKNGGITSETNYPYKAADGSC-NTATTAPVAKITGYEKVPVNSEISLLKAVANQPIS 256

Query: 263 VCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGES 322
           V +DAS  +F FY SG+   +CG   DHGV  VG+G+A   NG  YW++KNSWG  WGE 
Sbjct: 257 VSIDASDSSFMFYSSGIYTGECGTELDHGVTAVGYGSA---NGTDYWIVKNSWGTVWGEK 313

Query: 323 GYIRILR----DAGLCGIATAASYPVA 345
           GYIR+ R      GLCGIA  +SYP A
Sbjct: 314 GYIRMQRGIADKEGLCGIAMDSSYPTA 340


>gi|102140014|gb|ABF70145.1| cysteine protease, putative [Musa acuminata]
          Length = 373

 Score =  354 bits (908), Expect = 4e-95,   Method: Compositional matrix adjust.
 Identities = 174/320 (54%), Positives = 221/320 (69%), Gaps = 13/320 (4%)

Query: 33  MHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFS 92
           + + S+ E+H +WMA+HGRTYKD  EK  RL IFK N+EYIE  N  G R Y+L  N+F+
Sbjct: 26  LGDASMAERHVEWMARHGRTYKDAAEKEQRLGIFKSNVEYIESFNA-GKRKYQLAANQFA 84

Query: 93  DLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQC 152
           DLT+EEF+A++TG+    PS +      + F++ +++ VP S+DWR KGAVT +KDQG C
Sbjct: 85  DLTHEEFKAMHTGFK---PSGTGAKKAGNGFRHGSLSSVPDSVDWRSKGAVTPVKDQGLC 141

Query: 153 GSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST--DNHGCSGGLMDKAFEYIIENKG 210
           GSCWAF+ VAAVEGIT+I  GKLI LSEQQLVDC     + GC GG MD AFE+I+ N G
Sbjct: 142 GSCWAFTVVAAVEGITKIVTGKLISLSEQQLVDCDVHGKDQGCQGGDMDAAFEFIVNNGG 201

Query: 211 LATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDA-SG 269
           + +EA+YPY   +  C+      V ATI  +ED+P  DE+AL +AV+NQPVSV +DA S 
Sbjct: 202 ITSEANYPYEEVQRLCNAHNASFVVATIESHEDVPTNDEKALRKAVANQPVSVGIDAGSS 261

Query: 270 RAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILR 329
             F  Y  GV + +CG + DH V VVG+GT  +  G KYWL KNSWGETWGE+GYIR+ R
Sbjct: 262 LDFQLYSGGVFSGECGTDLDHAVTVVGYGTTSD--GTKYWLAKNSWGETWGENGYIRMER 319

Query: 330 DA----GLCGIATAASYPVA 345
           D     GLCGIA  ASYP A
Sbjct: 320 DVAAKEGLCGIAMQASYPTA 339


>gi|356543114|ref|XP_003540008.1| PREDICTED: LOW QUALITY PROTEIN: KDEL-tailed cysteine endopeptidase
           CEP1-like [Glycine max]
          Length = 343

 Score =  354 bits (908), Expect = 4e-95,   Method: Compositional matrix adjust.
 Identities = 175/328 (53%), Positives = 225/328 (68%), Gaps = 9/328 (2%)

Query: 23  CASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNR 82
           C SQV S R +H+ S+ E+HEQWM ++G+ YKD  E   R  IF+ N+E+IE  N  GN+
Sbjct: 20  CTSQVKS-RKLHDASMYERHEQWMEKYGKVYKDSAEMQKRFLIFENNVEFIESFNAAGNK 78

Query: 83  TYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGA 142
            YKL  N  +D TNEEF A + GY        R +++ + FKY+NVTD+P ++DWR+KG 
Sbjct: 79  PYKLSINHLADQTNEEFMASHKGYKGSHWQGLRITTQ-TPFKYENVTDIPWAVDWRQKGD 137

Query: 143 VTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAF 202
           VT IKDQ QCG+CWAFSAVAA EGI QIT G L+ LSE++LVDC + +HGC GGLM+  F
Sbjct: 138 VTSIKDQAQCGNCWAFSAVAATEGIYQITTGNLVSLSEKELVDCDSVDHGCDGGLMEHGF 197

Query: 203 EYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQ-PV 261
           E+II+N G+++EA+YPY    GTCD  KE +  A I+ YE +P   E+ L +AV+NQ  +
Sbjct: 198 EFIIKNGGISSEANYPYTAVNGTCDTNKEASPVAQITGYETVPVNCEEELQKAVANQLTM 257

Query: 262 SVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGE 321
           SV +DA G AF FY SGV    CG   DHGV  VG+G+ +   G +YW++KNSWG  WGE
Sbjct: 258 SVSIDAGGSAFQFYPSGVFTGQCGTQLDHGVTAVGYGSTDY--GTQYWIVKNSWGTQWGE 315

Query: 322 SGYIRILR--DA--GLCGIATAASYPVA 345
            GYIR+LR  DA  GLCGIA  ASYP A
Sbjct: 316 EGYIRMLRGIDAQEGLCGIAMDASYPTA 343


>gi|356543118|ref|XP_003540010.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
          Length = 339

 Score =  353 bits (907), Expect = 6e-95,   Method: Compositional matrix adjust.
 Identities = 174/339 (51%), Positives = 225/339 (66%), Gaps = 15/339 (4%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPS--IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNL 70
           +  +++L+  C SQV+S R++HE S  + E+HEQW  ++G+ YKD  EK  RL IFK N+
Sbjct: 10  ILALVLLLPICISQVMS-RNLHEASXCMSERHEQWTKKYGKVYKDAAEKQKRLLIFKDNV 68

Query: 71  EYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
           E+IE  N  GN+ YKL  N  +D TNEEF A + GY        + S   + FKY+N+T 
Sbjct: 69  EFIESFNAAGNKPYKLSINHLTDQTNEEFVASHNGYKH------KGSHSQTPFKYENITG 122

Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDN 190
           VP ++DWRE GAV  +KDQGQCG+CWAFS VA  EGI QIT   L+ LSEQ+LVDC + +
Sbjct: 123 VPNAVDWRENGAVXAMKDQGQCGNCWAFSTVATTEGIYQITTSMLMSLSEQELVDCDSVD 182

Query: 191 HGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQ 250
           HGC GG M+  FE+I +N G+++EA+YPY   +GT D  KE + AA I  YE +P   E 
Sbjct: 183 HGCDGGYMEGGFEFIXKNGGISSEANYPYTAVDGTYDANKEASPAAQIKGYETVPANSED 242

Query: 251 ALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWL 310
           AL +AV+NQPVSV +D  G AF F  SGV    CG   DHGV  VG+G+ ++  G +YW+
Sbjct: 243 ALQKAVANQPVSVTIDVGGSAFQFNSSGVFTGQCGTQLDHGVTAVGYGSTDD--GTQYWI 300

Query: 311 IKNSWGETWGESGYIRILR--DA--GLCGIATAASYPVA 345
           +KNSWG  WGE GYIR+ R  DA  GLCGIA  ASYP A
Sbjct: 301 VKNSWGTQWGEEGYIRMQRGTDAQEGLCGIAMDASYPTA 339


>gi|125551397|gb|EAY97106.1| hypothetical protein OsI_19029 [Oryza sativa Indica Group]
          Length = 350

 Score =  353 bits (907), Expect = 6e-95,   Method: Compositional matrix adjust.
 Identities = 179/353 (50%), Positives = 234/353 (66%), Gaps = 28/353 (7%)

Query: 12  PMFVIIILVITC------ASQVVSGRSMH-EPSIVEKHEQWMAQHGRTYKDELEKAMRLN 64
           P+ + I+  I C       + V + R +  + ++  +HE+WMAQHGR YKD  EKA RL 
Sbjct: 7   PLLLAILCCIVCLYSSSGGAIVAAARELGGDAAMAARHERWMAQHGRVYKDAAEKARRLE 66

Query: 65  IFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYT---GYNRPVPSVSRQSSRPS 121
           +FK N+ +IE  N  G   Y LG N+F+DLT+EEF+A  T   G++ P   V     R S
Sbjct: 67  VFKANVAFIESFNAGGKNRYWLGVNQFADLTSEEFKATMTNSKGFSTPNNGV-----RVS 121

Query: 122 T-FKYQNVTD--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIEL 178
           T FKY+NV+   +P S+DWR KGAVT IKDQGQCG CWAFSAVAA+EGI +++ GKLI L
Sbjct: 122 TGFKYENVSADALPASVDWRTKGAVTRIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISL 181

Query: 179 SEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAA 236
           SEQ+LVDC  D  + GC GG +D AF++I+ N GL  EA+YPY  E+G C       VAA
Sbjct: 182 SEQELVDCDVDGNDQGCEGGEIDGAFQFILSNGGLTAEANYPYTAEDGRCKTTAAADVAA 241

Query: 237 TISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVG 296
           +I  YED+P  DE +L++AV+ QPVSV VDAS   F FY  GV+  +CG + DHGV V+G
Sbjct: 242 SIRGYEDVPANDEPSLMKAVAGQPVSVAVDAS--KFQFYGGGVMAGECGTSLDHGVTVIG 299

Query: 297 FGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
           +G A +  G KYWL+KNSWG TWGE+GY+R+ +D     G+CG+A   SYP A
Sbjct: 300 YGAASD--GTKYWLVKNSWGTTWGEAGYLRMEKDIDDKRGMCGLAMQPSYPTA 350


>gi|356517358|ref|XP_003527354.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
 gi|356577767|ref|XP_003556994.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
          Length = 343

 Score =  353 bits (906), Expect = 7e-95,   Method: Compositional matrix adjust.
 Identities = 171/342 (50%), Positives = 232/342 (67%), Gaps = 14/342 (4%)

Query: 13  MFVIIILVITCASQV---VSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
            + I + ++ C+  +   V+ R++ + S+ E+HE+WM ++ + YKD  E+  R  IFK+N
Sbjct: 7   FYQISLALLFCSGFLAFQVTCRTLQDASMYERHEEWMGRYAKVYKDPQERERRFKIFKEN 66

Query: 70  LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
           + YIE  N   N+ Y LG N+F+DLTNEEF A     NR    +    +R +TFKY+NVT
Sbjct: 67  VNYIEAFNNAANKPYTLGINQFADLTNEEFIAPR---NRFKGHMCSSITRTTTFKYENVT 123

Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST- 188
            +P+++DWR+KGAVT IKDQGQCG CWAFSAVAA EGI  ++ GKLI LSEQ++VDC T 
Sbjct: 124 AIPSTVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALSAGKLISLSEQEVVDCDTK 183

Query: 189 -DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
            ++ GC+GG MD AF++II+N GL  E +YPY+  +G C+ +      ATI+ YED+P  
Sbjct: 184 GEDQGCAGGFMDGAFKFIIQNHGLNNEPNYPYKAVDGKCNAKAAANHVATITGYEDVPVN 243

Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
           +E+AL +AV+NQPVSV +DASG  F FY+SGV    CG   DHGV  VG+G + +  G +
Sbjct: 244 NEKALQKAVANQPVSVAIDASGSDFQFYQSGVFTGSCGTELDHGVTAVGYGVSAD--GTE 301

Query: 308 YWLIKNSWGETWGESGYIRILR----DAGLCGIATAASYPVA 345
           YWL+KNSWG  WGE GYIR+ R    + GL GIA  ASYP A
Sbjct: 302 YWLVKNSWGTEWGEEGYIRMQRGVKAEEGLXGIAMMASYPTA 343


>gi|297826875|ref|XP_002881320.1| hypothetical protein ARALYDRAFT_321132 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297327159|gb|EFH57579.1| hypothetical protein ARALYDRAFT_321132 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 341

 Score =  353 bits (905), Expect = 8e-95,   Method: Compositional matrix adjust.
 Identities = 174/342 (50%), Positives = 233/342 (68%), Gaps = 13/342 (3%)

Query: 10  IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
           ++ +F I+    + +       + HEPS +EKHEQWMA+  R Y+DELEK MR ++FK+N
Sbjct: 7   LVTIFTILFTTFSISQATSRTVTFHEPSSLEKHEQWMARFSRVYRDELEKQMRRDVFKKN 66

Query: 70  LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
           L++IE  NK+GN++YKLG NEF+D TNEEF A++TG       V  ++    ++   N++
Sbjct: 67  LKFIENFNKKGNKSYKLGVNEFADWTNEEFLAIHTGLKGLSSKVVDETISSRSW---NIS 123

Query: 130 D-VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST 188
           D V  S DWR +GAVT +K QGQCG CWAFSAVAAVEG+T+I  G L+ LSEQQL+DC  
Sbjct: 124 DMVGVSKDWRAEGAVTPVKYQGQCGCCWAFSAVAAVEGVTKIAGGNLVSLSEQQLLDCDR 183

Query: 189 D-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
           + + GC GG+M  AF YII+N+G+A+E DY Y+  +G C +      AA IS ++ +P  
Sbjct: 184 EYDRGCDGGIMSDAFNYIIQNRGIASENDYSYQGSDGRCRSSARP--AARISGFQTVPSN 241

Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
           +EQALL+AVS QPVSV +DA+G  F  Y  GV +  CG + +H V  VG+GT+++  G K
Sbjct: 242 NEQALLEAVSRQPVSVSMDANGDGFMHYSGGVYDGPCGTSSNHAVTFVGYGTSQD--GTK 299

Query: 308 YWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
           YWL KNSWGETWGE GYIRI RD     G+CG+A  A YPVA
Sbjct: 300 YWLAKNSWGETWGEKGYIRIRRDVAWPQGMCGVAQYAFYPVA 341


>gi|388512155|gb|AFK44139.1| unknown [Medicago truncatula]
          Length = 340

 Score =  352 bits (903), Expect = 2e-94,   Method: Compositional matrix adjust.
 Identities = 170/327 (51%), Positives = 224/327 (68%), Gaps = 15/327 (4%)

Query: 25  SQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTY 84
           + V+S +    PS+ E+HEQWM+++G+ YKD +EK  R  IFK N+E+IE  N   N+ Y
Sbjct: 23  TNVMSRKLYESPSLQERHEQWMSEYGKLYKDAIEKEKRFMIFKDNVEFIESFNAADNKPY 82

Query: 85  KLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVT 144
           KL  N  +DLT +EF+A   GY +    + R+ +  S FKY+NVT +P ++DWR KGAVT
Sbjct: 83  KLSVNHLADLTLDEFKASRNGYKK----IDREFATTS-FKYENVTAIPEAVDWRVKGAVT 137

Query: 145 HIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST--DNHGCSGGLMDKAF 202
            IKDQGQCGSCWAFS VAA+EGI QIT GKLI LSEQ+LVDC T  ++ GC GGLM+  F
Sbjct: 138 PIKDQGQCGSCWAFSTVAAIEGINQITTGKLISLSEQELVDCDTKGEDQGCEGGLMEDGF 197

Query: 203 EYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVS 262
           E+II+N G+ +E +YPY+  +G+C +    A  A I+ YE +P   E +LL+AV+NQP+S
Sbjct: 198 EFIIKNGGITSETNYPYKAADGSC-SAATTAPVAKITGYEKVPVNSEISLLKAVANQPIS 256

Query: 263 VCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGES 322
           V +DAS  +F FY SG+   +CG   DHGV  VG+G+A   NG  YW++KNSWG  WGE 
Sbjct: 257 VSIDASDSSFMFYSSGIYTGECGTELDHGVTAVGYGSA---NGTDYWIVKNSWGTVWGEK 313

Query: 323 GYIRILR----DAGLCGIATAASYPVA 345
           GYIR+ R      GLCGIA  +SYP A
Sbjct: 314 GYIRMQRGIADKEGLCGIAMDSSYPTA 340


>gi|413953667|gb|AFW86316.1| hypothetical protein ZEAMMB73_635707 [Zea mays]
          Length = 340

 Score =  352 bits (902), Expect = 2e-94,   Method: Compositional matrix adjust.
 Identities = 169/340 (49%), Positives = 228/340 (67%), Gaps = 16/340 (4%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
           +  I+     C + + +     + ++V +HEQWMAQ+ R YKD  EKA R  +FK N+++
Sbjct: 8   ILAILGFAFFCGAALAARDLSDDSAMVARHEQWMAQYSRVYKDASEKARRFEVFKANVKF 67

Query: 73  IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT--D 130
           IE  N  GN  + LG N+F+DLTN+EFR++ T  N+   S + +   P+ F+Y+NV+   
Sbjct: 68  IESFNAGGNNKFWLGVNQFADLTNDEFRSIKT--NKGFKSSNMK--IPTGFRYENVSVDA 123

Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST-- 188
           +PT+IDWR KGAVT IKDQGQCG CWAFSAVAA EGI +I+ GKL+ L+EQ+LVDC    
Sbjct: 124 LPTTIDWRTKGAVTPIKDQGQCGCCWAFSAVAATEGIVKISTGKLVSLAEQELVDCDVHG 183

Query: 189 DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
           ++ GC GGLMD AF++II N GL TE+ YPY   +G C +      AATI  YED+P  D
Sbjct: 184 EDQGCEGGLMDDAFKFIINNGGLTTESSYPYTAADGKCKSGSNS--AATIKGYEDVPAND 241

Query: 249 EQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKY 308
           E AL++AV+NQPVSV VD     F FY SGV+   CG + DHG+A +G+G  +  +G KY
Sbjct: 242 EAALMKAVANQPVSVAVDGGDMTFQFYSSGVMTGSCGTDLDHGIAAIGYG--KTSDGTKY 299

Query: 309 WLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
           WL+KNSWG TWGE+GY+R+ +D     G+CG+A   SYP 
Sbjct: 300 WLMKNSWGTTWGENGYLRMEKDISDKRGMCGLAMEPSYPT 339


>gi|356515040|ref|XP_003526209.1| PREDICTED: thiol protease SEN102-like [Glycine max]
          Length = 342

 Score =  351 bits (901), Expect = 2e-94,   Method: Compositional matrix adjust.
 Identities = 172/350 (49%), Positives = 230/350 (65%), Gaps = 13/350 (3%)

Query: 1   MVLKFEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKA 60
           M    +K  ++ +F+ + + I   SQV+  R +H+ ++ E+HE WMA++G+ YKD  EK 
Sbjct: 1   MAFTGQKQHMLALFLFLAVGI---SQVMP-RKLHQTALRERHENWMAEYGKMYKDAAEKE 56

Query: 61  MRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRP 120
            R  IFK N+E+IE  N  GN+ YKLG N  +DLT EEF+    G  R     S  + + 
Sbjct: 57  KRFQIFKDNVEFIESFNAAGNKPYKLGVNHLADLTLEEFKDSRNGLKRTY-EFSTTTFKL 115

Query: 121 STFKYQNVTDVPTSIDWREKGAVTHIKDQG-QCGSCWAFSAVAAVEGITQITRGKLIELS 179
           + FKY+NVTD+P +IDWR KGAVT IKDQG QCGSCWAFS +AA EGI QI+ G L+ LS
Sbjct: 116 NGFKYENVTDIPEAIDWRVKGAVTPIKDQGDQCGSCWAFSTIAATEGIHQISTGNLVSLS 175

Query: 180 EQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATIS 239
           EQ+LVDC + + GC GG M+  FE+II+N G+ +E +YPY+  +GTC+     +  A I 
Sbjct: 176 EQELVDCDSVDDGCEGGFMEDGFEFIIKNGGITSETNYPYKGVDGTCNTTIAASPVAQIK 235

Query: 240 KYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGT 299
            YE +P   E+AL +AV+NQPVSV + A+   F FY SG+ N +CG + DHGV  VG+GT
Sbjct: 236 GYEIVPSYSEEALQKAVANQPVSVSIHATNATFMFYSSGIYNGECGTDLDHGVTAVGYGT 295

Query: 300 AEEENGAKYWLIKNSWGETWGESGYIRILR----DAGLCGIATAASYPVA 345
              ENG  YW++KNSWG  WGE GYIR+ R      G+CGIA  +SYP A
Sbjct: 296 ---ENGTDYWIVKNSWGTQWGEKGYIRMHRGIAAKHGICGIALDSSYPTA 342


>gi|310656789|gb|ADP02218.1| Peptidase_C1 domain-containing protein [Triticum aestivum]
          Length = 341

 Score =  351 bits (900), Expect = 3e-94,   Method: Compositional matrix adjust.
 Identities = 170/339 (50%), Positives = 226/339 (66%), Gaps = 15/339 (4%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
           +  I+  +  C+S V+S R + + ++VE+HEQWMA+  R YKD  EKA R  +FK N+ +
Sbjct: 8   LLAIVGCICLCSSAVLSARELGDTAMVERHEQWMAKFNRVYKDGTEKAQRFEVFKANVAF 67

Query: 73  IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT--D 130
           IE  N E NR + LG N+F+DLTN+EFRA  T  N+ +     ++  P+ FKY NV+   
Sbjct: 68  IESFNAE-NRKFWLGVNQFTDLTNDEFRATKT--NKGLKMSGGRA--PTGFKYSNVSIDA 122

Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD- 189
           +PT++DWR KG VT IKDQGQCG CWAFSAV A EGI +++ GKLI LSEQ+LVDC    
Sbjct: 123 LPTAVDWRTKGVVTPIKDQGQCGCCWAFSAVVATEGIVKLSTGKLISLSEQELVDCDVHG 182

Query: 190 -NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
            + GC GG MD AF++II+N GL TEA+YPY  ++G C         ATI  YED+P  D
Sbjct: 183 VDQGCEGGEMDDAFKFIIKNGGLTTEANYPYTAQDGQCKTSIASNSVATIKGYEDVPAND 242

Query: 249 EQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKY 308
           E +L++AV+NQPVSV VD     F  Y  GV+   CG + DHG+A +G+G   +  G KY
Sbjct: 243 ESSLMKAVANQPVSVAVDGGDVIFQHYSGGVMTGSCGTDLDHGIAAIGYGMTSD--GTKY 300

Query: 309 WLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
           WL+KNSWG TWGESGY+R+ +D    +G+CG+A   SYP
Sbjct: 301 WLLKNSWGTTWGESGYLRMEKDISDKSGMCGLAMQPSYP 339


>gi|302143416|emb|CBI21977.3| unnamed protein product [Vitis vinifera]
          Length = 297

 Score =  350 bits (899), Expect = 4e-94,   Method: Compositional matrix adjust.
 Identities = 178/306 (58%), Positives = 215/306 (70%), Gaps = 15/306 (4%)

Query: 46  MAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTG 105
           MA++GR YKD  EK  R  IFK N+  IE  NK  ++TYKL  NEF+DLTNEEFR+L   
Sbjct: 1   MARYGRMYKDANEKEKRFKIFKDNVARIESFNKAMDKTYKLSINEFADLTNEEFRSLRNR 60

Query: 106 YNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVE 165
           +   +       S  +TFKY+NVT VP++IDWR+KGAVT IKDQ QCG CWAFSAVAA E
Sbjct: 61  FKAHI------CSEATTFKYENVTAVPSTIDWRKKGAVTPIKDQQQCGCCWAFSAVAATE 114

Query: 166 GITQITRGKLIELSEQQLVDCST--DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEE 223
           GITQIT GKLI LSEQ+LVDC T  +N GCSGGLMD AF + I+  GLA+EA YPY  ++
Sbjct: 115 GITQITTGKLISLSEQELVDCDTGGENQGCSGGLMDDAFRF-IKIHGLASEATYPYEGDD 173

Query: 224 GTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNAD 283
           GTC+++KE   AA I  YED+P  +E+AL +AV++QPV+V +DA G  F FY SGV    
Sbjct: 174 GTCNSKKEAHPAAKIKGYEDVPANNEKALQKAVAHQPVAVAIDAGGFEFQFYTSGVFTGQ 233

Query: 284 CGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATA 339
           CG   DHGVA VG+G    ++G  YWL+KNSWG  WGE GYIR+ RD     GLCGIA  
Sbjct: 234 CGTELDHGVAAVGYGIG--DDGMMYWLVKNSWGTGWGEEGYIRMQRDVTAKEGLCGIAMQ 291

Query: 340 ASYPVA 345
           ASYP A
Sbjct: 292 ASYPTA 297


>gi|116786779|gb|ABK24233.1| unknown [Picea sitchensis]
          Length = 463

 Score =  350 bits (899), Expect = 4e-94,   Method: Compositional matrix adjust.
 Identities = 172/351 (49%), Positives = 236/351 (67%), Gaps = 21/351 (5%)

Query: 10  IIPMFVIIILVITCASQVVSGRSM---------HEPSIVEKHEQWMAQHGRTYKDELEKA 60
           I+ +F ++ L     S   +  S+          + +I+E +E W+AQH + Y    EK 
Sbjct: 3   ILLLFAVLALSAMAGSASRADFSIIGYDSKDLREDDAIMELYELWLAQHKKAYNGLGEKQ 62

Query: 61  MRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRP 120
            R ++FK N  YI + N +GN +YKLG N+F+DL++EEF+A Y G    + +  R S+ P
Sbjct: 63  NRFSVFKDNFLYIHQHNNQGNPSYKLGLNQFADLSHEEFKATYLG--AKLDTKKRLSNSP 120

Query: 121 ST-FKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELS 179
           S  ++Y +  D+P SIDWREKGAVT +KDQG CGSCWAFS VAAVEGI QI  G L  LS
Sbjct: 121 SPRYQYSDGEDLPESIDWREKGAVTAVKDQGSCGSCWAFSTVAAVEGINQIVTGNLTSLS 180

Query: 180 EQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATI 238
           EQ+LVDC T  N GC+GGLMD AF++II N GL +E DYPY+  +G+CD  ++ A   TI
Sbjct: 181 EQELVDCDTSYNQGCNGGLMDYAFQFIINNGGLDSEDDYPYKANDGSCDAYRKNAHVVTI 240

Query: 239 SKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFG 298
             YED+P+ DE++L +A +NQP+SV ++ASGRAF FY+SGV  + CG   DHGV +VG+G
Sbjct: 241 DDYEDVPENDEKSLKKAAANQPISVAIEASGRAFQFYESGVFTSTCGTQLDHGVTLVGYG 300

Query: 299 TAEEENGAKYWLIKNSWGETWGESGYIRILRD-----AGLCGIATAASYPV 344
           +   E+G  YW++KNSWG++WGE G+IR+ R+      G+CGIA  ASYP+
Sbjct: 301 S---ESGTDYWIVKNSWGKSWGEKGFIRLQRNIEGVSTGMCGIAMEASYPL 348


>gi|357458911|ref|XP_003599736.1| Cysteine proteinase [Medicago truncatula]
 gi|357474719|ref|XP_003607644.1| Cysteine proteinase [Medicago truncatula]
 gi|355488784|gb|AES69987.1| Cysteine proteinase [Medicago truncatula]
 gi|355508699|gb|AES89841.1| Cysteine proteinase [Medicago truncatula]
          Length = 340

 Score =  350 bits (898), Expect = 5e-94,   Method: Compositional matrix adjust.
 Identities = 169/325 (52%), Positives = 222/325 (68%), Gaps = 16/325 (4%)

Query: 28  VSGRSMHEP-SIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKL 86
           V  R ++E  S+ E+HEQWM +HG+ Y+D +EK  R  IFK N+E+IE  N   N+ YKL
Sbjct: 25  VMSRKLYESLSLQERHEQWMTEHGKVYEDAIEKEKRFMIFKDNVEFIESFNAADNQPYKL 84

Query: 87  GTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHI 146
             N  +DLT +EF+A   GY +    + R+ +  S FKY+NVT +P ++DWR KGAVT I
Sbjct: 85  SVNHLADLTLDEFKASRNGYKK----IDREFTTTS-FKYENVTAIPAAVDWRVKGAVTPI 139

Query: 147 KDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST--DNHGCSGGLMDKAFEY 204
           KDQGQCGSCWAFS VAA EGI QIT GKL+ LSEQ+LVDC T  ++ GC GGLM+  FE+
Sbjct: 140 KDQGQCGSCWAFSTVAATEGINQITTGKLVSLSEQELVDCDTKGEDQGCEGGLMEDGFEF 199

Query: 205 IIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVC 264
           II+N G+ +E +YPY+  +G+C+      VA  I+ YE +P   E++LL+AV+NQP+SV 
Sbjct: 200 IIKNGGITSETNYPYKAADGSCNTATTTPVAK-ITGYEKVPVNSEKSLLKAVANQPISVS 258

Query: 265 VDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGY 324
           +DAS  +F FY SG+   +CG   DHGV  VG+G+A   NG  YW++KNSWG  WGE GY
Sbjct: 259 IDASDSSFMFYSSGIYTGECGTELDHGVTAVGYGSA---NGTDYWIVKNSWGTVWGEKGY 315

Query: 325 IRILR----DAGLCGIATAASYPVA 345
           IR+ R      GLCGIA  +SYP A
Sbjct: 316 IRMQRGIAAKEGLCGIAMDSSYPTA 340


>gi|77554625|gb|ABA97421.1| Vignain precursor, putative [Oryza sativa Japonica Group]
 gi|222630746|gb|EEE62878.1| hypothetical protein OsJ_17681 [Oryza sativa Japonica Group]
          Length = 350

 Score =  350 bits (898), Expect = 6e-94,   Method: Compositional matrix adjust.
 Identities = 177/352 (50%), Positives = 232/352 (65%), Gaps = 28/352 (7%)

Query: 12  PMFVIIILVITC------ASQVVSGRSMH-EPSIVEKHEQWMAQHGRTYKDELEKAMRLN 64
           P+ + I+  I C       + V + R +  + ++  +HE+WMAQHGR YKD  EKA RL 
Sbjct: 7   PLLLAILCCIVCLYSSSGGAIVAAARELGGDAAMAARHERWMAQHGRVYKDAAEKARRLE 66

Query: 65  IFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYT---GYNRPVPSVSRQSSRPS 121
           +FK N+ +IE  N  G   Y LG N+F+DLT+EEF+A  T   G++ P   V     R S
Sbjct: 67  VFKANVAFIESFNAGGKNRYWLGVNQFADLTSEEFKATMTNSKGFSTPNNGV-----RVS 121

Query: 122 T-FKYQNVTD--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIEL 178
           T FKY+NV+   +P S+DWR KGAVT IKDQGQCG CWAFSAVAA+EG  +++ GKLI L
Sbjct: 122 TGFKYENVSADALPASVDWRTKGAVTRIKDQGQCGCCWAFSAVAAMEGFVKLSTGKLISL 181

Query: 179 SEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAA 236
           SEQ+LVDC  D  + GC GG +D AF++I+ N GL  EA+YPY  E+G C       VAA
Sbjct: 182 SEQELVDCDVDGNDQGCEGGEIDGAFQFILSNGGLTAEANYPYTAEDGRCKTTAAADVAA 241

Query: 237 TISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVG 296
           +I  YED+P  DE +L++AV+ QPVSV VDAS   F FY  GV+  +CG + DHGV V+G
Sbjct: 242 SIRGYEDVPANDEPSLMKAVAGQPVSVAVDAS--KFQFYGGGVMAGECGTSLDHGVTVIG 299

Query: 297 FGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
           +G A +  G KYWL+KNSWG TWGE+GY+R+ +D     G+CG+A   SYP 
Sbjct: 300 YGAASD--GTKYWLVKNSWGTTWGEAGYLRMEKDIDDKRGMCGLAMQPSYPT 349


>gi|414588010|tpg|DAA38581.1| TPA: hypothetical protein ZEAMMB73_156486 [Zea mays]
          Length = 347

 Score =  349 bits (896), Expect = 1e-93,   Method: Compositional matrix adjust.
 Identities = 175/355 (49%), Positives = 238/355 (67%), Gaps = 30/355 (8%)

Query: 11  IPMFVIIIL----VITCASQVVSGRSM---HEPSIVEKHEQWMAQHGRTYKDELEKAMRL 63
           IP  +++ +    V  C++ V++ R +    E ++V +HEQWM QHGR YKDE +KA R 
Sbjct: 3   IPKALLLAILGCGVCLCSAAVLAARELGGDDELAMVARHEQWMVQHGRVYKDETDKAHRF 62

Query: 64  NIFKQNLEYIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYT--GYNRPVPSVSRQSS 118
            +FK N+++IE  N     GNR + LG N+F+DLTN+EFRA  T  G+N  V  V     
Sbjct: 63  LVFKANVKFIESFNAAAAAGNRKFWLGVNQFADLTNDEFRATKTNKGFNPNVVKV----- 117

Query: 119 RPSTFKYQNVT--DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLI 176
            P+ F+YQN++   +P ++DWR KGAVT IKDQGQCG CWAFSAVAA EGI +I+ GKL 
Sbjct: 118 -PTGFRYQNLSIDALPQTVDWRTKGAVTPIKDQGQCGCCWAFSAVAATEGIVKISTGKLT 176

Query: 177 ELSEQQLVDCST--DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAV 234
            LSEQ+LVDC    ++ GC+GG MD AF++II+N GL TE++YPY  ++G C +      
Sbjct: 177 SLSEQELVDCDVHGEDQGCNGGEMDDAFKFIIKNGGLTTESNYPYTAQDGQCKSGSNG-- 234

Query: 235 AATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAV 294
           AATI  YED+P  DE AL++AV++QPVSV VD     F FY  GV+   CG + DHG+A 
Sbjct: 235 AATIKGYEDVPANDEAALMKAVASQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAA 294

Query: 295 VGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPVA 345
           +G+G  +  +G KYWL+KNSWG TWGE+G++R+ +D     G+CG+A   SYP A
Sbjct: 295 IGYG--KTSDGTKYWLMKNSWGTTWGENGFLRMEKDIADKKGMCGLAMQPSYPTA 347


>gi|116787404|gb|ABK24495.1| unknown [Picea sitchensis]
 gi|224286306|gb|ACN40861.1| unknown [Picea sitchensis]
          Length = 452

 Score =  349 bits (896), Expect = 1e-93,   Method: Compositional matrix adjust.
 Identities = 172/327 (52%), Positives = 228/327 (69%), Gaps = 16/327 (4%)

Query: 27  VVSGRSMHEP-SIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYK 85
           ++S + + E  +I+E +E W+A+H R Y    EK  R ++FK N  YI + N +GNR+YK
Sbjct: 26  IISSKDLREDDAIMELYELWLAEHKRAYNGLDEKQKRFSVFKDNFLYIHEHN-QGNRSYK 84

Query: 86  LGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQ--NVTDVPTSIDWREKGAV 143
           LG N+F+DL++EEF+A Y G         ++ SRP + +YQ  +  D+P SIDWREKGAV
Sbjct: 85  LGLNQFADLSHEEFKATYLGAKL---DTKKRLSRPPSRRYQYSDGEDLPESIDWREKGAV 141

Query: 144 THIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAF 202
           T +KDQG CGSCWAFS VAAVEGI QI  G LI LSEQ+LVDC T  N GC+GGLMD AF
Sbjct: 142 TSVKDQGSCGSCWAFSTVAAVEGINQIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAF 201

Query: 203 EYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVS 262
           E+II N GL +E DYPY   +G+CD+ ++ A   TI  YED+P+ DE++L +A +NQP+S
Sbjct: 202 EFIINNGGLDSEEDYPYTAYDGSCDSYRKNAHVVTIDDYEDVPENDEKSLKKAAANQPIS 261

Query: 263 VCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGES 322
           V ++ASGR F FY SGV  + CG   DHGV +VG+G+   E+G  YW +KNSWG++WGE 
Sbjct: 262 VAIEASGREFQFYDSGVFTSTCGTQLDHGVTLVGYGS---ESGTDYWTVKNSWGKSWGEE 318

Query: 323 GYIRILRD-----AGLCGIATAASYPV 344
           G+IR+ R+      G+CGIA  ASYPV
Sbjct: 319 GFIRLQRNIEVASTGMCGIAMEASYPV 345


>gi|356517308|ref|XP_003527330.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
           max]
          Length = 342

 Score =  349 bits (896), Expect = 1e-93,   Method: Compositional matrix adjust.
 Identities = 173/344 (50%), Positives = 231/344 (67%), Gaps = 10/344 (2%)

Query: 8   SFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFK 67
           SF    ++I+ LV+   +  V  R + E    E+HE+WMAQ+GR YKD  EK  R  +FK
Sbjct: 3   SFSQNHYLILFLVLAVWTSHVMSRRLSEACTSERHEKWMAQYGRVYKDAAEKEKRFQVFK 62

Query: 68  QNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQN 127
            N+ +IE  N  G++ + L  N+F+DL +EEF+AL     +    V  ++S  ++F+Y++
Sbjct: 63  NNVHFIESFNAAGDKPFNLSINQFADLNDEEFKALLINVQKKASWV--ETSTETSFRYES 120

Query: 128 VTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC- 186
           VT +P +IDWR++GAVT IKDQG+CGSCWAFSAVAA EGI QIT GKL+ LSEQ+LVDC 
Sbjct: 121 VTKIPATIDWRKRGAVTPIKDQGRCGSCWAFSAVAATEGIHQITTGKLVPLSEQELVDCV 180

Query: 187 STDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPK 246
             ++ GC GG +D AFE+I +  G+A+E  YPY+    TC  +KE    A I  YE +P 
Sbjct: 181 KGESEGCIGGYVDDAFEFIAKKGGIASETHYPYKGVNKTCKVKKETHGVAEIKGYEKVPS 240

Query: 247 GDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNA-DCGNNCDHGVAVVGFGTAEEENG 305
            +E+ALL+AV+NQPVSV +DA   AF +Y SG+ NA +CG + +H VAVVG+G A +  G
Sbjct: 241 NNEKALLKAVANQPVSVYIDAGTHAFKYYSSGIFNARNCGTDPNHAVAVVGYGKALD--G 298

Query: 306 AKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
           +KYWL+KNSWG  WGE GYIRI RD     GLCGIA    YP A
Sbjct: 299 SKYWLVKNSWGTEWGERGYIRIKRDIRAKEGLCGIAKYPYYPTA 342


>gi|413953668|gb|AFW86317.1| hypothetical protein ZEAMMB73_339067 [Zea mays]
          Length = 433

 Score =  349 bits (895), Expect = 1e-93,   Method: Compositional matrix adjust.
 Identities = 169/338 (50%), Positives = 226/338 (66%), Gaps = 16/338 (4%)

Query: 15  VIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIE 74
            II     C + + +     +  +V +HEQWMAQ+ R YKD  EKA R  +FK N+++IE
Sbjct: 103 AIIGFAFFCGAAMAARDLSDDSVMVARHEQWMAQYSRVYKDASEKARRFEVFKANVQFIE 162

Query: 75  KANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD--VP 132
             N  GN  + LG N+F+DLTN+EFR+  T  N+ + S + +   P+ F+Y+NV+   +P
Sbjct: 163 SFNAGGNNKFWLGVNQFADLTNDEFRSTKT--NKGLKSSNMKI--PTGFRYENVSADALP 218

Query: 133 TSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST--DN 190
           T+IDWR KGAVT IKDQGQCG CWAFSAVAA EGI +I+ GKL+ L+EQ+LVDC    ++
Sbjct: 219 TTIDWRTKGAVTPIKDQGQCGCCWAFSAVAATEGIVKISTGKLVSLAEQELVDCDVHGED 278

Query: 191 HGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQ 250
            GC GGLMD AF++II+N GL TE+ YPY   +G C +      AATI  YED+P  DE 
Sbjct: 279 QGCEGGLMDDAFKFIIKNGGLTTESSYPYTAADGKCKSGSNS--AATIKGYEDVPANDEA 336

Query: 251 ALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWL 310
           AL++AV+NQPVSV VD     F FY  GV+   CG + DHG+A +G+G  +  +G KYWL
Sbjct: 337 ALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYG--KTSDGTKYWL 394

Query: 311 IKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
           +KNSWG TWGE+GY+R+ +D     G+CG+A   SYP 
Sbjct: 395 MKNSWGTTWGENGYLRMEKDISDKRGMCGLAMEPSYPT 432


>gi|297802418|ref|XP_002869093.1| hypothetical protein ARALYDRAFT_491113 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297314929|gb|EFH45352.1| hypothetical protein ARALYDRAFT_491113 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 355

 Score =  349 bits (895), Expect = 1e-93,   Method: Compositional matrix adjust.
 Identities = 168/312 (53%), Positives = 220/312 (70%), Gaps = 11/312 (3%)

Query: 38  IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
           ++E  E WM++H + YK   EK  R  +F++NL +I++ N E N +Y LG NEF+DLT+E
Sbjct: 47  LLELFESWMSEHSKVYKSVEEKVHRFEVFRENLMHIDQRNNEIN-SYWLGLNEFADLTHE 105

Query: 98  EFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWA 157
           EF+  Y G  +P  S  RQ S  + F+Y+++TD+P S+DWR+KGAV  +KDQGQCGSCWA
Sbjct: 106 EFKGRYLGLAKPQFSRKRQPS--ANFRYRDITDLPKSVDWRKKGAVAPVKDQGQCGSCWA 163

Query: 158 FSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEAD 216
           FS VAAVEGI QIT G L  LSEQ+L+DC T  N GC+GGLMD AF+YII   GL  E D
Sbjct: 164 FSTVAAVEGINQITTGNLSSLSEQELIDCDTTFNSGCNGGLMDYAFQYIISTGGLHKEDD 223

Query: 217 YPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYK 276
           YPY  EEG C  QKE     TIS YED+P+ D+++L++A+++QPVSV ++ASGR F FYK
Sbjct: 224 YPYLMEEGICQEQKEDVERVTISGYEDVPENDDESLVKALAHQPVSVAIEASGRDFQFYK 283

Query: 277 SGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----G 332
            GV N  CG + DHGVA VG+G+++   G+ Y ++KNSWG  WGE G+IR+ R+     G
Sbjct: 284 GGVFNGQCGTDLDHGVAAVGYGSSK---GSDYVIVKNSWGPRWGEKGFIRMKRNTGKPEG 340

Query: 333 LCGIATAASYPV 344
           LCGI   ASYP 
Sbjct: 341 LCGINKMASYPT 352


>gi|5823020|gb|AAD53012.1|AF089849_1 senescence-specific cysteine protease [Brassica napus]
          Length = 344

 Score =  348 bits (894), Expect = 2e-93,   Method: Compositional matrix adjust.
 Identities = 169/342 (49%), Positives = 234/342 (68%), Gaps = 12/342 (3%)

Query: 11  IPMFVIIILVITCASQVVSGRSM-HEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
           I +F+I+ LV + +  +   R +  E ++ ++H +WM +HGR Y D  EK  R  +FK+N
Sbjct: 6   IQIFLIVSLVSSFSLSITLSRPLLDEVAMQKRHAEWMTEHGRVYADANEKNNRYAVFKRN 65

Query: 70  LEYIEKANK-EGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNV 128
           +E IE+ N  +   T+KL  N+F+DLTNEEFR++YTG+     SV    ++P++F+YQNV
Sbjct: 66  VERIERLNDVQSGLTFKLAVNQFADLTNEEFRSMYTGFKGN--SVLSSRTKPTSFRYQNV 123

Query: 129 TD--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC 186
           +   +P S+DWR+KGAVT IKDQG CGSCWAFSAVAA+EG+ QI +GKLI LSEQ+LVDC
Sbjct: 124 SSDALPVSVDWRKKGAVTPIKDQGLCGSCWAFSAVAAIEGVAQIKKGKLISLSEQELVDC 183

Query: 187 STDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPK 246
            T++ GC GGLMD AF Y I   GL +E++YPY+   GTC+  K K +A +I  +ED+P 
Sbjct: 184 DTNDGGCMGGLMDTAFNYTITIGGLTSESNYPYKSTNGTCNFNKTKQIATSIKGFEDVPA 243

Query: 247 GDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGA 306
            DE+AL++AV++ PVS+ +      F FY SGV + +C  + DHGV  VG+G    +NG 
Sbjct: 244 NDEKALMKAVAHHPVSIGIAGGDIGFQFYSSGVFSGECTTHLDHGVTAVGYG--RSKNGL 301

Query: 307 KYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
           KYW++KNSWG  WGE GY+RI +D     G CG+A  ASYP 
Sbjct: 302 KYWILKNSWGPKWGERGYMRIKKDIKPKHGQCGLAMNASYPT 343


>gi|242072572|ref|XP_002446222.1| hypothetical protein SORBIDRAFT_06g005410 [Sorghum bicolor]
 gi|241937405|gb|EES10550.1| hypothetical protein SORBIDRAFT_06g005410 [Sorghum bicolor]
          Length = 340

 Score =  348 bits (893), Expect = 2e-93,   Method: Compositional matrix adjust.
 Identities = 169/341 (49%), Positives = 224/341 (65%), Gaps = 18/341 (5%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
           +  I+ L + C + + +     + ++V +HEQWMAQ+ R YKD  EKA R  +FK N+++
Sbjct: 8   ILAILGLALFCGAALAARDLNDDSAMVARHEQWMAQYNRVYKDATEKAQRFEVFKANVKF 67

Query: 73  IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYN-RPVPSVSRQSSRPSTFKYQNVT-- 129
           IE  N  GNR + LG N+F+DLTN+EFRA  T    +P P        P+ F+Y+NV+  
Sbjct: 68  IESFNAGGNRKFWLGVNQFADLTNDEFRATKTNKGFKPSPV-----KVPTGFRYENVSVD 122

Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST- 188
            +P SIDWR KGAVT IKDQGQCG CWAFSAVAA EGI +I+  KLI LSEQ+LVDC   
Sbjct: 123 ALPASIDWRTKGAVTPIKDQGQCGCCWAFSAVAATEGIVKISTDKLISLSEQELVDCDVH 182

Query: 189 -DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
            ++ GC GGLMD AF++II+N GL TE+ YPY   +G C +      AA I  +ED+P  
Sbjct: 183 GEDQGCEGGLMDDAFKFIIKNGGLTTESSYPYTATDGKCKSGTNS--AANIKGFEDVPAN 240

Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
           DE AL++AV+NQPVSV VD     F  Y  GV+   CG + DHG+A +G+G  +  +G K
Sbjct: 241 DEAALMKAVANQPVSVAVDGGDMTFQLYSGGVMTGSCGTDLDHGIAAIGYG--QTSDGTK 298

Query: 308 YWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
           YWL+KNSWG TWGE+GY+R+ +D     G+CG+A   SYP 
Sbjct: 299 YWLLKNSWGTTWGENGYLRMEKDISDKRGMCGLAMEPSYPT 339


>gi|18418684|ref|NP_567983.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
 gi|71153408|sp|O65493.1|XCP1_ARATH RecName: Full=Xylem cysteine proteinase 1; Short=AtXCP1; Flags:
           Precursor
 gi|6708181|gb|AAF25831.1|AF191027_1 papain-type cysteine endopeptidase XCP1 [Arabidopsis thaliana]
 gi|3080415|emb|CAA18734.1| cysteine proteinase-like protein [Arabidopsis thaliana]
 gi|7270487|emb|CAB80252.1| cysteine proteinase-like protein [Arabidopsis thaliana]
 gi|26449881|dbj|BAC42063.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|28827736|gb|AAO50712.1| unknown protein [Arabidopsis thaliana]
 gi|332661101|gb|AEE86501.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
          Length = 355

 Score =  348 bits (893), Expect = 2e-93,   Method: Compositional matrix adjust.
 Identities = 168/312 (53%), Positives = 220/312 (70%), Gaps = 11/312 (3%)

Query: 38  IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
           ++E  E WM++H + YK   EK  R  +F++NL +I++ N E N +Y LG NEF+DLT+E
Sbjct: 47  LLELFESWMSEHSKAYKSVEEKVHRFEVFRENLMHIDQRNNEIN-SYWLGLNEFADLTHE 105

Query: 98  EFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWA 157
           EF+  Y G  +P  S  RQ S  + F+Y+++TD+P S+DWR+KGAV  +KDQGQCGSCWA
Sbjct: 106 EFKGRYLGLAKPQFSRKRQPS--ANFRYRDITDLPKSVDWRKKGAVAPVKDQGQCGSCWA 163

Query: 158 FSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEAD 216
           FS VAAVEGI QIT G L  LSEQ+L+DC T  N GC+GGLMD AF+YII   GL  E D
Sbjct: 164 FSTVAAVEGINQITTGNLSSLSEQELIDCDTTFNSGCNGGLMDYAFQYIISTGGLHKEDD 223

Query: 217 YPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYK 276
           YPY  EEG C  QKE     TIS YED+P+ D+++L++A+++QPVSV ++ASGR F FYK
Sbjct: 224 YPYLMEEGICQEQKEDVERVTISGYEDVPENDDESLVKALAHQPVSVAIEASGRDFQFYK 283

Query: 277 SGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----G 332
            GV N  CG + DHGVA VG+G+++   G+ Y ++KNSWG  WGE G+IR+ R+     G
Sbjct: 284 GGVFNGKCGTDLDHGVAAVGYGSSK---GSDYVIVKNSWGPRWGEKGFIRMKRNTGKPEG 340

Query: 333 LCGIATAASYPV 344
           LCGI   ASYP 
Sbjct: 341 LCGINKMASYPT 352


>gi|356515056|ref|XP_003526217.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
           max]
          Length = 342

 Score =  348 bits (893), Expect = 2e-93,   Method: Compositional matrix adjust.
 Identities = 172/344 (50%), Positives = 231/344 (67%), Gaps = 10/344 (2%)

Query: 8   SFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFK 67
           SF    ++I+ LV++  +  V  R + E    E+HE+WMAQ+GR YKD  EK  R  +FK
Sbjct: 3   SFSQNHYLILFLVLSVWTSHVMSRRLSEACTSERHEKWMAQYGRVYKDAAEKEKRFQVFK 62

Query: 68  QNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQN 127
            N+ +IE  N  G++ + L  N+F+DL +EEF+AL     +    V  ++S  ++F+Y++
Sbjct: 63  NNVHFIESFNAAGDKPFNLSINQFADLNDEEFKALLINVQKKASWV--ETSTQTSFRYES 120

Query: 128 VTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC- 186
           VT +P +IDWR++GAVT IKDQG+CGSCWAFSAVAA EGI QIT GKL+ LSEQ+LVDC 
Sbjct: 121 VTKIPATIDWRKRGAVTPIKDQGRCGSCWAFSAVAATEGIHQITTGKLVPLSEQELVDCV 180

Query: 187 STDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPK 246
             ++ GC GG +D AFE+I +  G+A+E  YPY+    TC  +KE    A I  YE +P 
Sbjct: 181 KGESEGCIGGYVDDAFEFIAKKGGIASETHYPYKGVNKTCKVKKETHGVAEIKGYEKVPS 240

Query: 247 GDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNA-DCGNNCDHGVAVVGFGTAEEENG 305
            +E+ALL+AV+NQPVSV +DA   AF +Y SG+ N  +CG + +H VAVVG+G A +  G
Sbjct: 241 NNEKALLKAVANQPVSVYIDAGTHAFKYYSSGIFNVRNCGTDPNHAVAVVGYGKALD--G 298

Query: 306 AKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
           +KYWL+KNSWG  WGE GYIRI RD     GLCGIA    YP A
Sbjct: 299 SKYWLVKNSWGTEWGERGYIRIKRDIRAKEGLCGIAKYPYYPTA 342


>gi|242068363|ref|XP_002449458.1| hypothetical protein SORBIDRAFT_05g013840 [Sorghum bicolor]
 gi|241935301|gb|EES08446.1| hypothetical protein SORBIDRAFT_05g013840 [Sorghum bicolor]
          Length = 350

 Score =  347 bits (891), Expect = 4e-93,   Method: Compositional matrix adjust.
 Identities = 178/349 (51%), Positives = 240/349 (68%), Gaps = 22/349 (6%)

Query: 10  IIPMFVIIILVITCASQVVSGRSMH-------EPSIVEKHEQWMAQHGRTYKDELEKAMR 62
           +I      ++++   + VV  R +        E ++  +H+QWMA+HGRTYKDE EKA R
Sbjct: 10  MITFTAAALMILAVMTMVVEARDLSTSTGGYGEEAMKVRHQQWMAEHGRTYKDEAEKARR 69

Query: 63  LNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPST 122
             +FK N ++++++N  G ++Y+L  NEF+D+TN+EF A+YTG  +PVP+  +   + + 
Sbjct: 70  FQVFKANADFVDRSNAAGGKSYELAINEFADMTNDEFVAMYTGL-KPVPAGPK---KMAG 125

Query: 123 FKYQNVT--DVP-TSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELS 179
           FKY+N+T  DV   ++DWR+KGAVT IK+QGQCG CWAF+AVAAVE I QIT G L+ LS
Sbjct: 126 FKYENLTLSDVDQQAVDWRQKGAVTGIKNQGQCGCCWAFAAVAAVESIHQITTGNLVSLS 185

Query: 180 EQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATI 238
           EQQ++DC TD N+GC+GG +D AF+YII N GLATE  YPY   +GTC +  + AV  TI
Sbjct: 186 EQQVLDCDTDGNNGCNGGYIDNAFQYIISNGGLATEDAYPYAAAQGTCQSSVQPAV--TI 243

Query: 239 SKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNAD-CGN-NCDHGVAVVG 296
           S Y+D+P GDE AL  AV+NQPV+V +DA    F FY SGVL AD CG  + +H V  VG
Sbjct: 244 SSYQDVPSGDEAALAAAVANQPVAVAIDAHNN-FQFYSSGVLTADTCGTPSLNHAVTAVG 302

Query: 297 FGTAEEENGAKYWLIKNSWGETWGESGYIRILRDAGLCGIATAASYPVA 345
           + TAE+  G  YWL+KN WG+ WGE GY+R+ R    CG+A  ASYPVA
Sbjct: 303 YSTAED--GTPYWLLKNQWGQNWGEGGYLRVERGTNACGVAQQASYPVA 349


>gi|413944253|gb|AFW76902.1| hypothetical protein ZEAMMB73_056195 [Zea mays]
          Length = 340

 Score =  347 bits (890), Expect = 5e-93,   Method: Compositional matrix adjust.
 Identities = 171/343 (49%), Positives = 229/343 (66%), Gaps = 22/343 (6%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPS-IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLE 71
           +  ++     C +  ++ R ++E S +V +HEQWMAQ+ R YKD  EKA R  +FK N++
Sbjct: 8   ILAVLSFAFFCGA-ALAARDLNEDSAMVARHEQWMAQYSRVYKDAAEKARRFEVFKANVK 66

Query: 72  YIEKANKEGNRTYKLGTNEFSDLTNEEFRALYT--GYNRPVPSVSRQSSRPSTFKYQNVT 129
           +IE  N  GNR + LG N+F+DLTN+EFR   T  G+    PS+ + S+    F+Y+NV+
Sbjct: 67  FIESFNTGGNRKFWLGINQFADLTNDEFRTTKTNKGFK---PSLDKVST---GFRYENVS 120

Query: 130 --DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
              +P +IDWR  GAVT IKDQGQCG CWAFSAVAA EGI +I+ GKLI LSEQ+LVDC 
Sbjct: 121 VDAIPATIDWRTNGAVTPIKDQGQCGCCWAFSAVAATEGIVKISTGKLISLSEQELVDCD 180

Query: 188 T--DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLP 245
              ++ GC GGLMD AF++II+N GL TE++YPY   +G C +      AA I  YED+P
Sbjct: 181 VHGEDQGCEGGLMDDAFKFIIKNGGLTTESNYPYTAADGKCKSGSNS--AANIKGYEDVP 238

Query: 246 KGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENG 305
             DE AL++AV+NQPVSV VD     F FY  GV+   CG + DHG+A +G+G  +  +G
Sbjct: 239 TNDEAALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYG--KTSDG 296

Query: 306 AKYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPV 344
            KYWL+KNSWG TWGE+GY+R+ +D     G+CG+A   SYP 
Sbjct: 297 TKYWLMKNSWGTTWGENGYLRMEKDISDKKGMCGLAMEPSYPT 339


>gi|400180449|gb|AFP73361.1| cysteine protease [Solanum chilense]
          Length = 344

 Score =  347 bits (889), Expect = 7e-93,   Method: Compositional matrix adjust.
 Identities = 163/340 (47%), Positives = 236/340 (69%), Gaps = 11/340 (3%)

Query: 10  IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
           ++ + + +  VI+  +    GRS  + S+ E+HE WM++HGR YKDE+EK  R  IFK+N
Sbjct: 7   LMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKKN 66

Query: 70  LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
           +++IE  NK GN +YKLG NEF+D+T++EF A +TG N P   +S      + FK  +++
Sbjct: 67  MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLS 126

Query: 130 D--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
           D  +P+++DWRE GAVT +K QGQCG CWAFSAV ++EG  +I  GKL+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGQCGCCWAFSAVGSLEGAYKIATGKLMEFSEQELLDCT 186

Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
           T+N+GC+GG M  AF++IIEN G++ E+DY Y  E+ TC +Q EK  A  IS Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCRSQ-EKTAAVQISSYQVVPEG 245

Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
            E +LLQAV+ QPVS+ + AS +   FY  G  +  C +  +H V  +G+GT  +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAEGTYDGSCADRINHAVTAIGYGT--DEKGQK 301

Query: 308 YWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
           YWL+KNSWG +WGE+G+++I+RD    +GLC IA  +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341


>gi|171702829|dbj|BAG16370.1| cysteine protease [Brassica oleracea var. italica]
          Length = 332

 Score =  346 bits (887), Expect = 1e-92,   Method: Compositional matrix adjust.
 Identities = 165/335 (49%), Positives = 233/335 (69%), Gaps = 11/335 (3%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
           +F+I+ LV + +      R + E ++ ++H  WM +HGR Y D  EK  R  +FK+N+E 
Sbjct: 2   IFLIVSLVSSFSLSTTLSRPLDEVTMQKRHAAWMTEHGRVYADANEKNNRYVVFKRNVES 61

Query: 73  IEKANK-EGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD- 130
           IE+ N+ +   T+KL  N+F+DLTNEEFR++YTGY     SV    ++P++F+YQ+V+  
Sbjct: 62  IERLNEVQYGLTFKLAVNQFADLTNEEFRSMYTGYKGN--SVLSSRTKPTSFRYQHVSSD 119

Query: 131 -VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD 189
            +P S+DWR+KGAVT IKDQG CGSCWAFSAVAA+EG+ QI +GKLI LSEQ+LVDC T+
Sbjct: 120 ALPISVDWRKKGAVTPIKDQGSCGSCWAFSAVAAIEGVAQIKKGKLISLSEQELVDCDTN 179

Query: 190 NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDE 249
           + GC GG M+ AF Y +   GL +E++YPY+  +GTC+  K K +A +I  +ED+P  DE
Sbjct: 180 DDGCMGGYMNSAFNYTMTTGGLTSESNYPYKSTDGTCNINKTKQIATSIKGFEDVPANDE 239

Query: 250 QALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYW 309
           +AL++AV++ PVS+ +   G  F FY SGV + +C  + DHGVAVVG+G  +  NG+KYW
Sbjct: 240 KALMKAVAHHPVSIGIAGGGTGFQFYSSGVFSGECSTHLDHGVAVVGYG--KSSNGSKYW 297

Query: 310 LIKNSWGETWGESGYIRILRDA----GLCGIATAA 340
           ++KNSWG  WGE GY+RI +D     G CG+A  A
Sbjct: 298 ILKNSWGPKWGERGYMRIKKDTKAKHGQCGLAMNA 332


>gi|356515046|ref|XP_003526212.1| PREDICTED: thiol protease SEN102-like [Glycine max]
          Length = 342

 Score =  346 bits (887), Expect = 1e-92,   Method: Compositional matrix adjust.
 Identities = 170/350 (48%), Positives = 228/350 (65%), Gaps = 13/350 (3%)

Query: 1   MVLKFEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKA 60
           M    +K  ++ +F+ + + I   SQV+  R +H+ ++ E+HE WMA++G+ YKD  EK 
Sbjct: 1   MAFTGQKQHMLALFLFLAVGI---SQVMP-RKLHQTALRERHENWMAEYGKMYKDAAEKE 56

Query: 61  MRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRP 120
            R  IFK N+E+IE  N  GN+ YKLG N  +DLT EEF+    G  R     S  + + 
Sbjct: 57  KRFQIFKDNVEFIESFNAAGNKPYKLGVNHLADLTLEEFKDSRNGLKRTY-EFSTTTFKL 115

Query: 121 STFKYQNVTDVPTSIDWREKGAVTHIKDQG-QCGSCWAFSAVAAVEGITQITRGKLIELS 179
           + FKY+NVTD+P +IDWR KGAVT IKDQG QCG  WAFS +AA EGI QI+ G L+ LS
Sbjct: 116 NGFKYENVTDIPEAIDWRVKGAVTPIKDQGDQCGRFWAFSTIAATEGIHQISTGNLVSLS 175

Query: 180 EQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATIS 239
           EQ+LVDC + + GC GG M+  FE+II+N G+ +E +YPY+  +GTC+     +  A I 
Sbjct: 176 EQELVDCDSVDDGCEGGFMEDGFEFIIKNGGITSETNYPYKGVDGTCNTTIAASPVAQIK 235

Query: 240 KYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGT 299
            YE +P   E+AL +AV+NQPVSV + A+   F FY SG+ N +CG + DHGV  VG+GT
Sbjct: 236 GYEIVPSYSEEALKKAVANQPVSVSIHATNATFMFYSSGIYNGECGTDLDHGVTAVGYGT 295

Query: 300 AEEENGAKYWLIKNSWGETWGESGYIRILR----DAGLCGIATAASYPVA 345
              ENG  YW++KNSWG  WGE GYIR+ R      G+CGIA  +SYP A
Sbjct: 296 ---ENGTDYWIVKNSWGTQWGEKGYIRMHRGIAAKHGICGIALDSSYPTA 342


>gi|1208549|gb|AAC49455.1| Pseudotzain [Pseudotsuga menziesii]
          Length = 454

 Score =  345 bits (886), Expect = 1e-92,   Method: Compositional matrix adjust.
 Identities = 172/351 (49%), Positives = 235/351 (66%), Gaps = 21/351 (5%)

Query: 10  IIPMFVIIILVITCAS------QVVSGRS---MHEPSIVEKHEQWMAQHGRTYKDELEKA 60
           I+ +F ++ L     S       ++S  S   + + +I+E +E W+AQH + Y    EK 
Sbjct: 3   ILLLFAVLALSAMAGSASRADFSIISYDSQDLIGDDAIMELYELWLAQHKKAYNGLDEKQ 62

Query: 61  MRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRP 120
            + ++FK N  YI + N +GN +YKLG N+F+DL++EEF+A Y G    + +  R S  P
Sbjct: 63  KKFSVFKDNFLYIHQHNNQGNPSYKLGLNQFADLSHEEFKAAYLG--TKLDAKKRLSRSP 120

Query: 121 ST-FKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELS 179
           S  ++Y    D+P SIDWREKGAVT +K+QG CGSCWAFS VAAVEGI QI  G L  LS
Sbjct: 121 SPRYQYSVGEDLPESIDWREKGAVTAVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSLS 180

Query: 180 EQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATI 238
           EQ+LVDC T  N GC+GGLMD AF++II N GL +E DYPY+   G+CD  ++ A   TI
Sbjct: 181 EQELVDCDTSYNQGCNGGLMDYAFQFIISNGGLDSEDDYPYKANNGSCDAYRKNAHVVTI 240

Query: 239 SKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFG 298
             YED+P+ DE++L +A +NQP+SV ++ASGRAF FY+SGV  ++CG   DHGV +VG+G
Sbjct: 241 DDYEDVPENDEKSLKKAAANQPISVAIEASGRAFQFYESGVFTSNCGTQLDHGVTLVGYG 300

Query: 299 TAEEENGAKYWLIKNSWGETWGESGYIRILRD-----AGLCGIATAASYPV 344
           +   E+G  YWL+KNSWG +WGE G+I++ R+      G+CGIA  ASYPV
Sbjct: 301 S---ESGIDYWLVKNSWGNSWGEKGFIKLQRNLEGASTGMCGIAMEASYPV 348


>gi|400180441|gb|AFP73357.1| cysteine protease [Solanum habrochaites]
          Length = 344

 Score =  345 bits (886), Expect = 1e-92,   Method: Compositional matrix adjust.
 Identities = 162/340 (47%), Positives = 234/340 (68%), Gaps = 11/340 (3%)

Query: 10  IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
           ++ + + +  VI+  +     RS  + S+ E+HE WM++HGR YKDE+EK  R  IFK+N
Sbjct: 7   LMSILITLFFVISMFNSQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66

Query: 70  LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
           +++IE  NK GN +YKLG NEF+D+T+EEF A +TG N P   +S      + FK  +++
Sbjct: 67  MKFIESVNKAGNLSYKLGMNEFADITSEEFLAKFTGLNIPNSYLSPSPMSSTEFKINDIS 126

Query: 130 D--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
           D  +P+++DWRE GAVT +K+QGQCG CWAFSAV ++EG  +I  G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186

Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
           T+N+GC+GG M  AF++I EN G++ E+DY Y  ++ TC +Q EK  A  IS Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIRENGGISRESDYEYLGQQYTCRSQ-EKTAAVQISSYQVVPEG 245

Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
            E +LLQAV+ QPVS+ + AS +   FY  G  +  C N  +H V  +G+GT  +ENG K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCANRINHAVTAIGYGT--DENGQK 301

Query: 308 YWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
           YWL+KNSWG +WGE G+++I+RD    +GLC IA  +SYP
Sbjct: 302 YWLLKNSWGTSWGEKGFMKIIRDYGNPSGLCDIAKLSSYP 341


>gi|400180377|gb|AFP73327.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  345 bits (886), Expect = 2e-92,   Method: Compositional matrix adjust.
 Identities = 162/340 (47%), Positives = 236/340 (69%), Gaps = 11/340 (3%)

Query: 10  IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
           ++ + + +  VI+  +    GRS  E S+ E+HE WM++HGR YKDE+EK  R  IFK+N
Sbjct: 7   LMNILITLFFVISMFNTQTRGRSQPELSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66

Query: 70  LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
           +++IE  NK GN +YKLG NEF+D+T++EF A +TG N P   +S      + FK  +++
Sbjct: 67  MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLS 126

Query: 130 D--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
           D  +P+++DWRE GAVT +K QG+CG CWAFSAV ++EG  +I  G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186

Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
           T+N+GC+GG M  AF++IIEN G++ E+DY Y+ E+ TC +Q EK  A  IS Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIIENGGISRESDYEYQGEQYTCRSQ-EKTAAVQISSYQVVPEG 245

Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
            E +LLQAV+ QPVS+ + AS +   FY  G  +  C +  +H V  +G+GT  +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301

Query: 308 YWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
           YWL+KNSWG +WGE+G+++I+RD    +GLC IA  +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341


>gi|242072394|ref|XP_002446133.1| hypothetical protein SORBIDRAFT_06g002160 [Sorghum bicolor]
 gi|241937316|gb|EES10461.1| hypothetical protein SORBIDRAFT_06g002160 [Sorghum bicolor]
          Length = 338

 Score =  345 bits (885), Expect = 2e-92,   Method: Compositional matrix adjust.
 Identities = 165/340 (48%), Positives = 226/340 (66%), Gaps = 20/340 (5%)

Query: 16  IIILVITCAS---QVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
            ++ ++ CAS    V++ R + + ++VE+HE WM ++GR YKD  EKA R   FK N+ +
Sbjct: 7   FLLAILGCASLCSSVLAARELSDAAMVERHENWMVEYGRVYKDAAEKARRFEAFKHNVAF 66

Query: 73  IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQN--VTD 130
           +E  N      + LG N+F+DLT EEF+A     N+    +S +    + FKY+N  V+ 
Sbjct: 67  VESFNTNKKNKFWLGVNQFADLTTEEFKA-----NKGFKPISAEMVPTTGFKYENLSVSA 121

Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD- 189
           +PT++DWR KGAVT IK+QGQCG CWAFSAVAA+EGI +++ G LI LSEQ+LVDC T  
Sbjct: 122 LPTAVDWRTKGAVTPIKNQGQCGCCWAFSAVAAMEGIVKLSTGNLISLSEQELVDCDTHS 181

Query: 190 -NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
            + GC GG MD AFE++I+N GLATE+ YPY+  +G C    +   AATI  +ED+P  D
Sbjct: 182 MDEGCEGGWMDSAFEFVIKNGGLATESSYPYKAVDGKCKGGSKS--AATIKGHEDVPVND 239

Query: 249 EQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKY 308
           E AL++AV+NQPVSV VDAS R F  Y  GV+   CG   DHG+A +G+G   E +G KY
Sbjct: 240 EAALMKAVANQPVSVAVDASDRTFMLYSGGVMTGSCGTELDHGIAAIGYGV--ESDGTKY 297

Query: 309 WLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPV 344
           W++KNSWG TWGE G++R+ +D     G+CG+A   SYP 
Sbjct: 298 WILKNSWGTTWGEKGFLRMEKDISDKQGMCGLAMKPSYPT 337


>gi|350535639|ref|NP_001233949.1| phytophthora-inhibited protease 1 [Solanum lycopersicum]
 gi|108937128|gb|ABG23376.1| phytophthora-inhibited protease 1 [Solanum lycopersicum]
          Length = 345

 Score =  345 bits (885), Expect = 2e-92,   Method: Compositional matrix adjust.
 Identities = 162/347 (46%), Positives = 233/347 (67%), Gaps = 14/347 (4%)

Query: 8   SFIIPMFVIIILVITCASQ---VVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLN 64
           +F +    +++L+ +  S    +V+ R++ E S++E+HE WM  HGR YKD++EK  R  
Sbjct: 4   NFFLKNITVVLLLFSILSLYPFIVTSRNLKELSMLERHENWMVHHGRVYKDDIEKEHRFK 63

Query: 65  IFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFK 124
            FK+N+E+IE  NK G + YKL  N+++DLT EEF   + G +  + S    ++  ++FK
Sbjct: 64  TFKENVEFIESFNKNGTQRYKLAVNKYADLTTEEFTTSFMGLDTSLLSQQESTATTTSFK 123

Query: 125 YQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLV 184
           Y +VT+VP S+DWR++G+VT +KDQG CG CWAFSA AA+EG  QI   +LI LSEQQL+
Sbjct: 124 YDSVTEVPNSMDWRKRGSVTGVKDQGVCGCCWAFSAAAAIEGAYQIANNELISLSEQQLL 183

Query: 185 DCSTDNHGCSGGLMDKAFEYIIENK--GLATEADYPYRHEEGTCDNQKEKAVAATISKYE 242
           DCST N GC GGLM  A++++++N   G+ TE +YPY   +  C  + E+  A TI+ YE
Sbjct: 184 DCSTQNKGCEGGLMTVAYDFLLQNNGGGITTETNYPYEEAQNVC--KTEQPAAVTINGYE 241

Query: 243 DLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEE 302
            +P  DE +LL+AV NQP+SV + A+   FH Y SG+ +  C +  +H V V+G+GT+EE
Sbjct: 242 VVPS-DESSLLKAVVNQPISVGI-AANDEFHMYGSGIYDGSCNSRLNHAVTVIGYGTSEE 299

Query: 303 ENGAKYWLIKNSWGETWGESGYIRILRDAGL----CGIATAASYPVA 345
           + G KYW++KNSWG  WGE GY+RI RD G+    CGIA  AS+P A
Sbjct: 300 D-GTKYWIVKNSWGSDWGEEGYMRIARDVGVDGGHCGIAKVASFPTA 345


>gi|38346003|emb|CAD40112.2| OSJNBa0035O13.5 [Oryza sativa Japonica Group]
 gi|125589427|gb|EAZ29777.1| hypothetical protein OsJ_13835 [Oryza sativa Japonica Group]
          Length = 339

 Score =  345 bits (885), Expect = 2e-92,   Method: Compositional matrix adjust.
 Identities = 168/341 (49%), Positives = 228/341 (66%), Gaps = 17/341 (4%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
           +F I+  +  C++ + +       ++V +HE+WM Q+GR YKD  EKA R  IFK N+ +
Sbjct: 8   LFAILSCLCLCSAVLAAREQSDHAAMVARHERWMEQYGRVYKDATEKARRFEIFKANVAF 67

Query: 73  IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT--D 130
           IE  N  GN  + LG N+F+DLTN EFRA  T     +PS  R    P+TF+Y+NV+   
Sbjct: 68  IESFNA-GNHKFWLGVNQFADLTNYEFRATKTNKGF-IPSTVRV---PTTFRYENVSIDT 122

Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST-- 188
           +P ++DWR KGAVT IKDQGQCG CWAFSAVAA+EGI +++ GKLI LSEQ+LVDC    
Sbjct: 123 LPATVDWRTKGAVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVHG 182

Query: 189 DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
           ++ GC GGLMD AF++II+N GL TE+ YPY   +G C+       AATI  YED+P  +
Sbjct: 183 EDQGCEGGLMDDAFKFIIKNGGLTTESKYPYTAADGKCNGGSNS--AATIKGYEDVPANN 240

Query: 249 EQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKY 308
           E AL++AV+NQPVSV VD     F FY  GV+   CG + DHG+  +G+G  ++ +G +Y
Sbjct: 241 EAALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIVAIGYG--KDGDGTQY 298

Query: 309 WLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
           WL+KNSWG TWGE+G++R+ +D     G+CG+A   SYP A
Sbjct: 299 WLLKNSWGTTWGENGFLRMEKDISDKRGMCGLAMEPSYPTA 339


>gi|242072398|ref|XP_002446135.1| hypothetical protein SORBIDRAFT_06g002170 [Sorghum bicolor]
 gi|241937318|gb|EES10463.1| hypothetical protein SORBIDRAFT_06g002170 [Sorghum bicolor]
          Length = 338

 Score =  345 bits (884), Expect = 2e-92,   Method: Compositional matrix adjust.
 Identities = 165/339 (48%), Positives = 227/339 (66%), Gaps = 20/339 (5%)

Query: 16  IIILVITCAS---QVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
            ++ ++ CAS    V++ R + + ++VE+HE WM ++GR YKD  EKA R  +FK N+ +
Sbjct: 7   FLLAILGCASLCSSVLAARELSDAAMVERHENWMVEYGRVYKDAAEKARRFEVFKDNVAF 66

Query: 73  IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQN--VTD 130
           +E  N   N  + LG N+F+DLT EEF+A     N+    +S +    + FKY+N  V+ 
Sbjct: 67  VESFNTNKNNKFWLGINQFADLTIEEFKA-----NKGFKPISAEKVPTTGFKYENLSVSA 121

Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD- 189
           +PT++DWR KGAVT IK+QGQCG CWAFSAVAA+EGI +++ G LI LSEQ+LVDC T  
Sbjct: 122 LPTAVDWRTKGAVTPIKNQGQCGCCWAFSAVAAMEGIVKLSTGNLISLSEQELVDCDTHS 181

Query: 190 -NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
            + GC GG MD AFE++I+N GLAT + YPY+  +G C    +   AATI  +ED+P  D
Sbjct: 182 MDEGCEGGWMDSAFEFVIKNGGLATVSSYPYKAVDGKCKGGSKS--AATIKGHEDVPVND 239

Query: 249 EQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKY 308
           E AL++AV+NQPVSV VDAS R F  Y  GV+   CG   DHG+A +G+G   E +G KY
Sbjct: 240 EAALMKAVANQPVSVAVDASDRTFMLYSGGVMTGSCGTELDHGIAAIGYGV--ESDGTKY 297

Query: 309 WLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
           W++KNSWG TWGE G++R+ +D     G+CG+A   SYP
Sbjct: 298 WILKNSWGTTWGEKGFLRMEKDISDKQGMCGLAMKPSYP 336


>gi|388497270|gb|AFK36701.1| unknown [Lotus japonicus]
          Length = 343

 Score =  345 bits (884), Expect = 3e-92,   Method: Compositional matrix adjust.
 Identities = 184/343 (53%), Positives = 240/343 (69%), Gaps = 16/343 (4%)

Query: 12  PMFVIIILVITCASQVVSGRSMHE--PSIVEK-HEQWMAQHGRTYKDELEKAMRLNIFKQ 68
           P+  +  ++  CA   +S R++++   S+V K H+QWM Q+GR+Y ++ E   R  IF +
Sbjct: 6   PIIALCTMLWACAYTAMS-RTLYDETSSVVAKTHQQWMLQYGRSYTNDAEMEKRFKIFME 64

Query: 69  NLEYIEKANKE-GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQN 127
           NLEYIEK N   GN++YKL  N+FSDLTNEEF A +TG        S  S R S     +
Sbjct: 65  NLEYIEKFNNAPGNKSYKLDLNQFSDLTNEEFIASHTGLMIDPSKPSSSSKRASPASL-D 123

Query: 128 VTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
           ++D PTS+DWRE+GAVT +K+QG CGSCWAFSAVAAVEGI +I  G LI LSEQQLVDC+
Sbjct: 124 LSDTPTSLDWREQGAVTDVKNQGNCGSCWAFSAVAAVEGIVKIKNGNLISLSEQQLVDCA 183

Query: 188 TD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLP 245
           ++  N GC GG MD AF YI EN G+A+E DY YR   GTC N +    AA IS YED+P
Sbjct: 184 SNEQNQGCGGGFMDNAFSYITEN-GIASENDYQYRGGAGTCQNNEMITPAARISGYEDVP 242

Query: 246 KGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENG 305
            G++Q LL AVS QPVSV + A G++FH YK G+ +  CG++ +HGV +VG+GT+EE+ G
Sbjct: 243 AGEDQLLL-AVSQQPVSVAI-AVGQSFHLYKEGIYSGPCGSSLNHGVTLVGYGTSEED-G 299

Query: 306 AKYWLIKNSWGETWGESGYIRILRDAGL----CGIATAASYPV 344
            KYWLIKNSWGE+WGE+GY+R+LR++G     CGIA  AS+P 
Sbjct: 300 TKYWLIKNSWGESWGENGYMRLLRESGQSEGHCGIAVKASHPT 342


>gi|20334373|gb|AAM19207.1|AF493232_1 cysteine protease [Solanum pimpinellifolium]
 gi|400180424|gb|AFP73350.1| cysteine protease [Solanum pimpinellifolium]
 gi|400180433|gb|AFP73354.1| cysteine protease [Solanum lycopersicum]
          Length = 344

 Score =  344 bits (883), Expect = 3e-92,   Method: Compositional matrix adjust.
 Identities = 162/340 (47%), Positives = 236/340 (69%), Gaps = 11/340 (3%)

Query: 10  IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
           ++ + + +  VI+  +    GRS  + S+ E+HE WM++HGR YKDE+EK  R  IFK+N
Sbjct: 7   LMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66

Query: 70  LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
           +++IE  NK GN +YKLG NEF+D+T++EF A +TG N P   +S      + FK  +++
Sbjct: 67  MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLS 126

Query: 130 D--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
           D  +P+++DWRE GAVT +K QG+CG CWAFSAV ++EG  +I  G L+E SEQ+L+DC+
Sbjct: 127 DDYMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186

Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
           T+N+GC+GGLM  AF++IIEN G++ E+DY Y  E+ TC   +EK  A  IS Y+ +P+G
Sbjct: 187 TNNYGCNGGLMTNAFDFIIENGGISRESDYEYLGEQYTC-RSREKTAAVQISSYKVVPEG 245

Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
            E +LLQAV+ QPVS+ + AS +   FY  G  + +C +  +H V  +G+GT EE  G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGNCADQINHAVTAIGYGTDEE--GQK 301

Query: 308 YWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
           YWL+KNSWG +WGE+G+++I+RD    +GLC IA  +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDSGDPSGLCDIAKMSSYP 341


>gi|302143415|emb|CBI21976.3| unnamed protein product [Vitis vinifera]
          Length = 322

 Score =  344 bits (883), Expect = 3e-92,   Method: Compositional matrix adjust.
 Identities = 174/337 (51%), Positives = 225/337 (66%), Gaps = 32/337 (9%)

Query: 15  VIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIE 74
           + ++ V+   +   + R++HE S+ E+HE WMAQ+GR YKD  EK+ R  IFK N+  IE
Sbjct: 12  LALLFVLAAWASQATARNLHEASMYERHEDWMAQYGRVYKDADEKSKRYKIFKDNVARIE 71

Query: 75  KANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTS 134
             NK  +++YKL  NEF+DLTNEEF    T  NR    +   S+  ++FKY+NVT VP++
Sbjct: 72  SFNKAMDKSYKLSINEFADLTNEEFG---TSRNRFKAHIC--STEATSFKYENVTAVPST 126

Query: 135 IDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST--DNHG 192
           IDWR+KGAVT IKDQGQCGSCWAFSAVAA+EGITQ++ GKLI LSEQ+LVDC T  ++ G
Sbjct: 127 IDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQG 186

Query: 193 CSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQAL 252
           C+G                   A+YPY   +GTC+ +K    AA I+ YED+P  +E+AL
Sbjct: 187 CNG-------------------ANYPYAGTDGTCNRKKAAHPAAKINGYEDVPANNEKAL 227

Query: 253 LQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIK 312
            +AV +QP++V +DA G  F FY SGV    CG   DHGVA VG+GT+++  G KYWL+K
Sbjct: 228 QKAVVHQPIAVAIDAGGFEFQFYSSGVFTGQCGTELDHGVAAVGYGTSDD--GMKYWLVK 285

Query: 313 NSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
           NSWG  WGE GYIR+ RD     GLCGIA  ASYP A
Sbjct: 286 NSWGTGWGEEGYIRMQRDVTAKEGLCGIAMQASYPTA 322


>gi|400180345|gb|AFP73311.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  344 bits (883), Expect = 3e-92,   Method: Compositional matrix adjust.
 Identities = 162/340 (47%), Positives = 235/340 (69%), Gaps = 11/340 (3%)

Query: 10  IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
           ++ + + +  VI+  +    GRS  + S+ E+HE WM++HGR YKDE+EK  R  IFK+N
Sbjct: 7   LMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66

Query: 70  LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
           +++IE  NK GN +YKLG NEF+D+T++EF A +TG N P   +S      + FK  +++
Sbjct: 67  MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLS 126

Query: 130 D--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
           D  +P+++DWRE GAVT +K QG+CG CWAFSAV ++EG  +I  G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186

Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
           T+N+GC GG M  AF++IIEN G++ E+DY Y  ++ TC +Q EK  A  IS Y+ +P+G
Sbjct: 187 TNNYGCDGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQ-EKTAAVQISSYQVVPEG 245

Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
            E +LLQAV+ QPVS+ + AS +   FY  G  +  C +  +H V  +G+GT  +ENG K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DENGQK 301

Query: 308 YWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
           YWL+KNSWG +WGE+G+++I+RD    AGLC IA  +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341


>gi|400180447|gb|AFP73360.1| cysteine protease [Solanum chilense]
          Length = 345

 Score =  344 bits (882), Expect = 4e-92,   Method: Compositional matrix adjust.
 Identities = 164/341 (48%), Positives = 234/341 (68%), Gaps = 12/341 (3%)

Query: 10  IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
           ++ + + +  VI+  +    GRS  + S+ E+HE WM++HGR YKDE+EK  R  IFK+N
Sbjct: 7   LMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66

Query: 70  LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
           +++IE  NK GN +YKLG NEF+D+T++EF A +TG N P   +S      + FK  N  
Sbjct: 67  MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKKINDL 126

Query: 130 ---DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC 186
              D+P+++DWRE GAVT +K QGQCG CWAFSAV ++EG  +I  GKL+E SEQ+L+DC
Sbjct: 127 SDDDMPSNLDWRESGAVTQVKHQGQCGCCWAFSAVGSLEGAYKIATGKLMEFSEQELLDC 186

Query: 187 STDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPK 246
           +T+N+GC+GG M  AF++IIEN G++ E+DY Y  E+ TC +Q EK  A  IS Y+ +P+
Sbjct: 187 TTNNYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCRSQ-EKTAAVQISSYQVVPE 245

Query: 247 GDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGA 306
           G E +LLQAV+ QPVS+ + AS +   FY  G  +  C +  +H V  +G+GT  +E G 
Sbjct: 246 G-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQ 301

Query: 307 KYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
           KYWL+KNSWG +WGE+G+++I+RD    +GLC IA  +SYP
Sbjct: 302 KYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 342


>gi|400180422|gb|AFP73349.1| cysteine protease [Solanum chmielewskii]
          Length = 344

 Score =  344 bits (882), Expect = 4e-92,   Method: Compositional matrix adjust.
 Identities = 162/340 (47%), Positives = 235/340 (69%), Gaps = 11/340 (3%)

Query: 10  IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
           ++ + + +  VI+  +    GRS  + S+ E+HE WM++HGR YKDE+EK  R  IFK+N
Sbjct: 7   LMNILITLFFVISIFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66

Query: 70  LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
           +++IE  NK GN +YKLG NEF+D+T++EF A +TG N P   +S      + FK  +++
Sbjct: 67  MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKTNDLS 126

Query: 130 D--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
           D  +P+++DWRE GAVT +K QGQCG CWAFSAV ++EG  +I  G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGQCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186

Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
           T+N+GC+GG M  AF++IIEN G++ E+DY Y  ++ TC +Q EK  A  IS Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQ-EKTAAVQISSYQVVPEG 245

Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
            E +LLQAV+ QPVS+ + AS +   FY  G  +  C +  +H V  +G+GT EE  G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYSGGTYDGSCADRINHAVTAIGYGTDEE--GQK 301

Query: 308 YWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
           YWL+KNSWG +WGE+G+++I+RD    +GLC IA  +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDSGDPSGLCDIAKMSSYP 341


>gi|400180426|gb|AFP73351.1| cysteine protease [Solanum corneliomuelleri]
          Length = 344

 Score =  344 bits (882), Expect = 4e-92,   Method: Compositional matrix adjust.
 Identities = 161/340 (47%), Positives = 236/340 (69%), Gaps = 11/340 (3%)

Query: 10  IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
           ++ + + +  VI+  +    GRS  + S+ E+HE WM++HGR YKDE+EK  R  IFK+N
Sbjct: 7   LMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66

Query: 70  LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
           +++IE  NK GN +YKLG NEF+D+T++EF A +TG N P   +S      + FK  +++
Sbjct: 67  MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLS 126

Query: 130 D--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
           D  +P+++DWRE GAVT +K QG+CG CWAFSAV ++EG  +I  G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186

Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
           T+N+GC+GG M  AF++IIEN G++ E+DY Y  ++ TC +Q EK  A  IS Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQ-EKTAAVQISSYQVVPEG 245

Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
            E +LLQAV+ QPVS+ + AS +   FY  G  +  C +  +H V  +G+GT  +ENG K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DENGQK 301

Query: 308 YWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
           YWL+KNSWG +WGE+G+++I+RD    +GLC IA  +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341


>gi|400180435|gb|AFP73355.1| cysteine protease [Solanum pennellii]
          Length = 344

 Score =  344 bits (882), Expect = 4e-92,   Method: Compositional matrix adjust.
 Identities = 162/340 (47%), Positives = 233/340 (68%), Gaps = 11/340 (3%)

Query: 10  IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
           ++ + + +  VI+  +     RS  + S+ E+HE WM++HGR YKDE+EK  R  IFK+N
Sbjct: 7   LMSILITLFFVISMFNTQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66

Query: 70  LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
           +++IE  NK GN +YKLG NEF+D+T++EF A +TG N P   VS      + FK  +++
Sbjct: 67  MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYVSPSPMSSTEFKINDLS 126

Query: 130 D--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
           D  +P+++DWRE GAVT +K+QGQCG CWAFSAV ++EG  +I  G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186

Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
           T+N+GC+GG M  AF++I EN G++ E+DY Y  ++ TC +Q EK  A  IS Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIKENGGISRESDYEYLGQQYTCRSQ-EKTAAVQISSYQVVPEG 245

Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
            E +LLQAV+ QPVS+ + AS +   FY  G  +  C N  +H V  +G+GT  +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCANRINHAVTAIGYGT--DEKGQK 301

Query: 308 YWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
           YWL+KNSWG +WGE G+++I+RD    AGLC IA  +SYP
Sbjct: 302 YWLLKNSWGTSWGEDGFMKIIRDSGNPAGLCDIAKVSSYP 341


>gi|46401612|dbj|BAD16614.1| cysteine proteinase [Dianthus caryophyllus]
          Length = 459

 Score =  343 bits (881), Expect = 5e-92,   Method: Compositional matrix adjust.
 Identities = 170/351 (48%), Positives = 232/351 (66%), Gaps = 13/351 (3%)

Query: 3   LKFEKSFIIPMFVIIILVITCASQV-VSGRSMHEPS--IVEKHEQWMAQHGRTYKDELEK 59
           +   KS I  +F II +V + A  + +  R+ + P   I   +E W+ +HG+ Y    EK
Sbjct: 1   MSTSKSTIFLLFSIIFIVSSSALDLSIIDRAFNRPDDEIASLYETWLVKHGKNYNGLGEK 60

Query: 60  AMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQS-S 118
            +R NIFK NL ++++ N E N ++KLG N F+DLTNEE+R++Y G      +V+R   S
Sbjct: 61  QLRFNIFKDNLRFVDERNSE-NLSFKLGLNRFADLTNEEYRSVYLGTRPRSVAVARSGRS 119

Query: 119 RPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIEL 178
           +   + ++    +P S+DWR+KGAV  IKDQG CGSCWAFSA+AAVEG+ QI  G LI L
Sbjct: 120 KSDRYAFRAGDTLPESVDWRKKGAVAGIKDQGSCGSCWAFSAIAAVEGVNQIVTGDLISL 179

Query: 179 SEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAAT 237
           SEQ+LV+C T  N GC GGLMD AFE+II+N+G+ ++ DYPY   +G CD  ++ A   T
Sbjct: 180 SEQELVECDTSYNDGCDGGLMDYAFEFIIKNEGIDSDEDYPYTGRDGRCDTNRKNAKVVT 239

Query: 238 ISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGF 297
           I  YED P  DE++L +AV+NQPVSV ++  GR F  Y SGV    CG   DHGVAVVG+
Sbjct: 240 IDDYEDSPVYDEKSLQKAVANQPVSVAIEGGGRDFQLYDSGVFTGKCGTALDHGVAVVGY 299

Query: 298 GTAEEENGAKYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPV 344
           GT   E+G  YW+++NSWG+TWGE GYIR+ R+    +G+CGIA   SYP+
Sbjct: 300 GT---EDGLDYWIVRNSWGDTWGEGGYIRMQRNTKLPSGICGIAIEPSYPI 347


>gi|400180367|gb|AFP73322.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  343 bits (881), Expect = 5e-92,   Method: Compositional matrix adjust.
 Identities = 162/340 (47%), Positives = 235/340 (69%), Gaps = 11/340 (3%)

Query: 10  IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
           ++ + + +  VI+  +    GRS  + S+ E+HE WM++HGR YKDE+EK  R  IFK+N
Sbjct: 7   LMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKVERFMIFKEN 66

Query: 70  LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
           +++IE  NK GN +YKLG NEF+D+T++EF A +TG N P   +S      + FK  +++
Sbjct: 67  MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLS 126

Query: 130 D--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
           D  +P+++DWRE GAVT +K QG+CG CWAFSAV ++EG  +I  G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186

Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
           T+N+GC+GG M  AF++IIEN G++ E+DY Y  E+ TC +Q EK  A  IS Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCRSQ-EKTAAVQISSYKVVPEG 245

Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
            E +LLQAV+ QPVS+ + AS +   FY  G  +  C +  +H V  +G+GT  +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301

Query: 308 YWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
           YWL+KNSWG +WGE+G+++I+RD    AGLC IA  +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341


>gi|225428328|ref|XP_002279940.1| PREDICTED: cysteine proteinase-like [Vitis vinifera]
          Length = 707

 Score =  343 bits (881), Expect = 6e-92,   Method: Compositional matrix adjust.
 Identities = 163/311 (52%), Positives = 215/311 (69%), Gaps = 12/311 (3%)

Query: 38  IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
           ++ + E W+++HG+ YK   EK  R  +F++NL +I++ NKE + +Y LG NEF+DL++E
Sbjct: 400 LIARFESWVSKHGKVYKSMEEKLHRFEVFRENLNHIDERNKEVS-SYWLGLNEFADLSHE 458

Query: 98  EFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWA 157
           EF++ Y G     P   R       F+Y++V D+P S+DWR+KGAVTH+K+QG CGSCWA
Sbjct: 459 EFKSKYLGLRAEFP---RSRDYSGEFRYRDVADLPESVDWRKKGAVTHVKNQGACGSCWA 515

Query: 158 FSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEAD 216
           FS VAAVEGI QI  G L  LSEQ+L+DC T  N GC+GGLMD AF +I  N GL  E D
Sbjct: 516 FSTVAAVEGINQIVTGNLTTLSEQELIDCDTTFNSGCNGGLMDYAFAFIASNGGLHKEDD 575

Query: 217 YPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYK 276
           YPY  EEGTC+ QKE     TIS YED+P+ DE++LL+A+++QP+SV ++ASGR F FY 
Sbjct: 576 YPYLMEEGTCEEQKEDVDIVTISGYEDVPEKDEESLLKALAHQPLSVAIEASGRDFQFYS 635

Query: 277 SGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----G 332
            GV N  CG   DHGVA VG+G+++   G  Y ++KNSWG  WGE GYIR+ R+     G
Sbjct: 636 GGVFNGPCGTELDHGVAAVGYGSSK---GLDYIIVKNSWGPKWGEKGYIRMKRNTGKTEG 692

Query: 333 LCGIATAASYP 343
           LCGI   ASYP
Sbjct: 693 LCGINKMASYP 703


>gi|400180355|gb|AFP73316.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  343 bits (881), Expect = 6e-92,   Method: Compositional matrix adjust.
 Identities = 161/340 (47%), Positives = 235/340 (69%), Gaps = 11/340 (3%)

Query: 10  IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
           ++ + + +  VI+  +    GRS  + S+ E+HE WM++HGR YKDE+EK  R  IFK+N
Sbjct: 7   LMSILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66

Query: 70  LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
           +++IE  NK GN +YKLG NEF+D+T++EF A +TG N P   +S      + FK  +++
Sbjct: 67  MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLS 126

Query: 130 D--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
           D  +P+++DWRE GAVT +K QG+CG CWAFSAV ++EG  +I  G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186

Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
           T+N+GC+GG M  AF++IIEN G++ E+DY Y  E+ TC +Q EK  A  IS Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCRSQ-EKTAAVQISSYQVVPEG 245

Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
            E +LLQAV+ QPVS+ + AS +   FY  G  +  C +  +H V  +G+GT  +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301

Query: 308 YWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
           YWL+KNSWG +WGE+G+++I+RD    +GLC IA  +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341


>gi|125547256|gb|EAY93078.1| hypothetical protein OsI_14879 [Oryza sativa Indica Group]
          Length = 339

 Score =  343 bits (880), Expect = 6e-92,   Method: Compositional matrix adjust.
 Identities = 167/341 (48%), Positives = 228/341 (66%), Gaps = 17/341 (4%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
           +F I+  +  C++ + +       ++V +HE+WM Q+GR YKD  EKA R  IFK N+ +
Sbjct: 8   LFAILSCLCLCSAVLAAREQSDHAAMVARHERWMEQYGRVYKDATEKARRFEIFKANVAF 67

Query: 73  IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT--D 130
           IE  N  GN  + LG N+F+DLTN EFRA  T     +PS  R    P+TF+Y+NV+   
Sbjct: 68  IESFNA-GNHKFWLGVNQFADLTNYEFRATKTNKGF-IPSTVRV---PTTFRYENVSIDT 122

Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST-- 188
           +P ++DWR KGAVT IKDQGQCG CWAFSAVAA+EGI +++ GKLI LSEQ+LVDC    
Sbjct: 123 LPATVDWRTKGAVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVHG 182

Query: 189 DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
           ++ GC GGLMD AF++II+N GL TE+ YPY   +G C+       AATI  YE++P  +
Sbjct: 183 EDQGCEGGLMDDAFKFIIKNGGLTTESKYPYTAADGKCNGGSNS--AATIKGYEEVPANN 240

Query: 249 EQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKY 308
           E AL++AV+NQPVSV VD     F FY  GV+   CG + DHG+  +G+G  ++ +G +Y
Sbjct: 241 EAALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIVAIGYG--KDGDGTQY 298

Query: 309 WLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
           WL+KNSWG TWGE+G++R+ +D     G+CG+A   SYP A
Sbjct: 299 WLLKNSWGTTWGENGFLRMEKDISDKRGMCGLAMEPSYPTA 339


>gi|356517310|ref|XP_003527331.1| PREDICTED: vignain-like [Glycine max]
          Length = 342

 Score =  343 bits (880), Expect = 6e-92,   Method: Compositional matrix adjust.
 Identities = 171/344 (49%), Positives = 231/344 (67%), Gaps = 10/344 (2%)

Query: 8   SFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFK 67
           SF    ++I+ LV+   +  V  R + E    E+HE+WMAQ+GR YKD  EK  R  +FK
Sbjct: 3   SFSQNHYLILFLVLAVWTSHVMSRRLSEACTSERHEKWMAQYGRVYKDAAEKEKRFQVFK 62

Query: 68  QNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQN 127
            N+ +IE  N  G++ + L  N+F+DL +EEF+AL     +    V  ++S  ++F+Y++
Sbjct: 63  NNVHFIESFNAAGDKPFNLSINQFADLNDEEFKALLINVQKKASWV--ETSTETSFRYES 120

Query: 128 VTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC- 186
           VT +P +ID R++GAVT IKDQG+CGSCWAFSAVAA EGI QIT GKL+ LSEQ+LVDC 
Sbjct: 121 VTKIPATIDRRKRGAVTPIKDQGRCGSCWAFSAVAATEGIHQITTGKLVPLSEQELVDCV 180

Query: 187 STDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPK 246
             ++ GC GG +D AFE+I +  G+A+E  YPY+    TC  +KE    A I  YE +P 
Sbjct: 181 KGESEGCIGGYVDDAFEFIAKKGGIASETHYPYKGVNKTCKVKKETHGVAEIKGYEKVPS 240

Query: 247 GDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNA-DCGNNCDHGVAVVGFGTAEEENG 305
            +E+ALL+AV+NQPVSV +DA   AF +Y SG+ NA +CG + +H VAVVG+G A ++  
Sbjct: 241 NNEKALLKAVANQPVSVYIDAGTHAFKYYSSGIFNARNCGTDPNHAVAVVGYGKALDD-- 298

Query: 306 AKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
           +KYWL+KNSWG  WGE GYIRI RD     GLCGIA    YP+A
Sbjct: 299 SKYWLVKNSWGTEWGERGYIRIKRDIRAKEGLCGIAKYPYYPIA 342


>gi|400180443|gb|AFP73358.1| cysteine protease, partial [Solanum habrochaites]
          Length = 345

 Score =  343 bits (880), Expect = 7e-92,   Method: Compositional matrix adjust.
 Identities = 166/341 (48%), Positives = 236/341 (69%), Gaps = 13/341 (3%)

Query: 10  IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
           ++ + + +  VI+  +     RS  + S+ E+HE WM++HGR YKDE+EK  R  IFK+N
Sbjct: 7   LMSILITLFFVISMFNSQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66

Query: 70  LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPST-FKYQNV 128
           +++IE  NK GN +YKLG NEF+D+T+EEF A +TG N P   +S  S  PST FK  ++
Sbjct: 67  MKFIESVNKAGNLSYKLGMNEFADITSEEFLAKFTGLNIPNSYLS-PSPMPSTEFKINDL 125

Query: 129 TD--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC 186
           +D  +P+++DWRE GAVT +K+QGQCG CWAFSAV ++EG  +I  G L+E SEQ+L+DC
Sbjct: 126 SDDDMPSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDC 185

Query: 187 STDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPK 246
           +T+N+GC+GG M  AF++IIEN G++ E+DY Y  ++ TC +Q  K  A  IS Y+ +P+
Sbjct: 186 TTNNYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQG-KTAAVQISNYQVVPE 244

Query: 247 GDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGA 306
           G E +LLQAV+ QPVS+ + AS     FY  G  +  C N  +H V  +G+GT  +E G 
Sbjct: 245 G-ETSLLQAVTKQPVSIGIAAS-HDLQFYAGGTYDGSCANRINHAVTAIGYGT--DEKGQ 300

Query: 307 KYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
           KYWL+KNSWG +WGE+G+++I+RD    AGLC IA  +SYP
Sbjct: 301 KYWLLKNSWGTSWGENGFMKIIRDSGNPAGLCDIAKMSSYP 341


>gi|302143411|emb|CBI21972.3| unnamed protein product [Vitis vinifera]
          Length = 320

 Score =  343 bits (880), Expect = 7e-92,   Method: Compositional matrix adjust.
 Identities = 172/337 (51%), Positives = 224/337 (66%), Gaps = 34/337 (10%)

Query: 15  VIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIE 74
           + ++ V+   +   + RS+HE S+ E+HE WM Q+GR YKD  EK+ R  IFK N+  IE
Sbjct: 12  LALLFVLAAWASQATARSLHEASMYERHEDWMVQYGREYKDADEKSKRYKIFKDNVARIE 71

Query: 75  KANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTS 134
             NK  +++YKL  NEF+DLTNEEFRA     NR    +   S+  ++FKY+NVT VP++
Sbjct: 72  SFNKAMDKSYKLSINEFADLTNEEFRA---SRNRFKAHIC--STEATSFKYENVTAVPST 126

Query: 135 IDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST--DNHG 192
           +DWR+KGAVT IKDQGQCGSCWAFSAVAA+EGITQ++ GKLI LSEQ+LVDC T  ++ G
Sbjct: 127 VDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQG 186

Query: 193 CSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQAL 252
           C+                     +YPY   +GTC+ +K    AA I+ YED+P  +E+AL
Sbjct: 187 CT---------------------NYPYAGTDGTCNRKKAAHPAAKINGYEDVPANNEKAL 225

Query: 253 LQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIK 312
            +AV++QP++V +DASG  F FY SGV    CG   DHGVA VG+GT+++  G KYWL+K
Sbjct: 226 QKAVAHQPIAVAIDASGSEFQFYSSGVFTGQCGTELDHGVAAVGYGTSDD--GMKYWLVK 283

Query: 313 NSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
           NSW   WGE GYIR+ RD     GLCGIA  ASYP A
Sbjct: 284 NSWSTGWGEEGYIRMQRDVTAKEGLCGIAMQASYPTA 320


>gi|374713649|gb|AEZ65082.1| cysteine protease [Carica papaya]
          Length = 471

 Score =  343 bits (880), Expect = 8e-92,   Method: Compositional matrix adjust.
 Identities = 167/317 (52%), Positives = 213/317 (67%), Gaps = 10/317 (3%)

Query: 35  EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
           E  +++ +E W+ +HG+ Y    EK  R  IFK NL ++++ N    RTYKLG  +F+DL
Sbjct: 45  EAHMMKMYEHWLVKHGKNYNAIGEKERRFEIFKDNLRFVDEQNSVPGRTYKLGLTKFADL 104

Query: 95  TNEEFRALYTGYNRPVPSVSR-QSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCG 153
           TNEE+RA+Y G         R + S+    K  N  D+P+ +DWREKGAVT +KDQGQCG
Sbjct: 105 TNEEYRAMYLGAKMEKKEKLRTERSQRYLHKAGNDDDLPSHVDWREKGAVTEVKDQGQCG 164

Query: 154 SCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLA 212
           SCWAFS V +VEGI QI  G LI LSEQ+LVDC    N GC+GGLMD AFE+II+N G+ 
Sbjct: 165 SCWAFSTVGSVEGINQIVTGDLISLSEQELVDCDKAYNQGCNGGLMDYAFEFIIKNGGID 224

Query: 213 TEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAF 272
           +EADYPYR  +  CD+ ++ A   TI  YED+P+ DE++L +AV+NQPVSV ++A GR F
Sbjct: 225 SEADYPYRASDNMCDSNRKNAHVVTIDGYEDVPENDEESLKKAVANQPVSVAIEAGGREF 284

Query: 273 HFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILR--- 329
             Y+SGV    CG N DHGV  VG+GT   ENG  YW+++NSWG  WGESGYIR+ R   
Sbjct: 285 QLYQSGVFTGRCGTNLDHGVVAVGYGT---ENGIDYWIVRNSWGPKWGESGYIRMERNVA 341

Query: 330 --DAGLCGIATAASYPV 344
             D G CGIA  ASYP 
Sbjct: 342 STDTGKCGIAMEASYPT 358


>gi|297851334|ref|XP_002893548.1| peptidase C1A papain family protein [Arabidopsis lyrata subsp.
           lyrata]
 gi|297339390|gb|EFH69807.1| peptidase C1A papain family protein [Arabidopsis lyrata subsp.
           lyrata]
          Length = 346

 Score =  343 bits (879), Expect = 9e-92,   Method: Compositional matrix adjust.
 Identities = 180/348 (51%), Positives = 236/348 (67%), Gaps = 17/348 (4%)

Query: 10  IIPMFV-IIILVITC-ASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFK 67
           I+ MFV + IL ++   SQ  S  + HEP + E H+QWM +  R Y DELEK MR ++FK
Sbjct: 4   ILFMFVSLTILSMSLKVSQATSRVTFHEPIVAEHHQQWMTRFSRVYSDELEKQMRFDVFK 63

Query: 68  QNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYN--RPVPSVSRQSSRPSTFKY 125
           +NL++IEK NK+G+RTYKLG NEF+D T EEF A +TG      +PS         ++ +
Sbjct: 64  KNLKFIEKFNKKGDRTYKLGVNEFADWTKEEFIATHTGLKGFNGIPSSEFVDEMIPSWNW 123

Query: 126 QNVTDV--PTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQL 183
            NV+DV  P   DWR +GAVT +K QGQCG CWAFS+VAAVEG+T+I  G L+ LSEQQL
Sbjct: 124 -NVSDVAGPEIKDWRYEGAVTPVKYQGQCGCCWAFSSVAAVEGLTKIVGGNLVSLSEQQL 182

Query: 184 VDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYE 242
           +DC  + ++GC+GG+M  AF YII+N+G+A+EA YPY+  EGTC    +   +A I  ++
Sbjct: 183 LDCDRERDNGCNGGIMSDAFSYIIKNRGIASEASYPYQETEGTCRYNAKP--SAWIRGFQ 240

Query: 243 DLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNAD-CGNNCDHGVAVVGFGTAE 301
            +P  +E+ALL+AVS QPVSV +DA G  F  Y  GV +   CG + +H V  VG+GT+ 
Sbjct: 241 TVPSNNERALLEAVSRQPVSVSIDADGPGFMHYSGGVYDEPYCGTDVNHAVTFVGYGTSP 300

Query: 302 EENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
           E  G KYWL KNSWGETWGE+GYIRI RD     G+CG+A  A YPVA
Sbjct: 301 E--GIKYWLAKNSWGETWGENGYIRIRRDVAWPQGMCGVAQYAFYPVA 346


>gi|116309130|emb|CAH66233.1| H0825G02.10 [Oryza sativa Indica Group]
          Length = 339

 Score =  343 bits (879), Expect = 1e-91,   Method: Compositional matrix adjust.
 Identities = 167/341 (48%), Positives = 227/341 (66%), Gaps = 17/341 (4%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
           +F I+  +  C++ + +       ++V +HE+WM Q+GR YKD  EKA R  IFK N+ +
Sbjct: 8   LFAILSCLCLCSAVLAAREQSDHAAMVARHERWMEQYGRVYKDATEKARRFEIFKANVAF 67

Query: 73  IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT--D 130
           IE  N  GN  + L  N+F+DLTN EFRA  T     +PS  R    P+TF+Y+NV+   
Sbjct: 68  IESFNA-GNHKFWLSVNQFADLTNYEFRATKTNKGF-IPSTVRV---PTTFRYENVSIDT 122

Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST-- 188
           +P ++DWR KGAVT IKDQGQCG CWAFSAVAA+EGI +++ GKLI LSEQ+LVDC    
Sbjct: 123 LPATVDWRTKGAVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVHG 182

Query: 189 DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
           ++ GC GGLMD AF++II+N GL TE+ YPY   +G C+       AATI  YED+P  +
Sbjct: 183 EDQGCEGGLMDDAFKFIIKNGGLTTESKYPYTAADGKCNGGSNS--AATIKGYEDVPANN 240

Query: 249 EQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKY 308
           E AL++AV+NQPVSV VD     F FY  GV+   CG + DHG+  +G+G  ++ +G +Y
Sbjct: 241 EAALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIVAIGYG--KDGDGTQY 298

Query: 309 WLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
           WL+KNSWG TWGE+G++R+ +D     G+CG+A   SYP A
Sbjct: 299 WLLKNSWGTTWGENGFLRMEKDISDKRGMCGLAMEPSYPTA 339


>gi|400180389|gb|AFP73333.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  343 bits (879), Expect = 1e-91,   Method: Compositional matrix adjust.
 Identities = 161/340 (47%), Positives = 235/340 (69%), Gaps = 11/340 (3%)

Query: 10  IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
           ++ + + +  VI+  +    GRS  + S+ E+HE WM++HGR YKDE+EK  R  IFK+N
Sbjct: 7   LMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66

Query: 70  LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
           +++IE  NK GN +YKLG NEF+D+T++EF A +TG N P   +S      + FK  +++
Sbjct: 67  MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLS 126

Query: 130 D--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
           D  +P+++DWRE GAVT +K QG+CG CWAFSAV ++EG  +I  G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186

Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
           T+N+GC+GG M  AF++IIEN G++ E+DY Y  E+ TC +Q EK  A  IS Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCRSQ-EKTAAVQISSYKVVPEG 245

Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
            E +LLQAV+ QPVS+ + AS +   FY  G  +  C +  +H V  +G+GT  +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301

Query: 308 YWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
           YWL+KNSWG +WGE+G+++I+RD    +GLC IA  +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341


>gi|18403438|ref|NP_565780.1| cysteine proteinase-like protein [Arabidopsis thaliana]
 gi|2342728|gb|AAB67626.1| cysteine proteinase [Arabidopsis thaliana]
 gi|330253821|gb|AEC08915.1| cysteine proteinase-like protein [Arabidopsis thaliana]
          Length = 345

 Score =  342 bits (877), Expect = 2e-91,   Method: Compositional matrix adjust.
 Identities = 176/348 (50%), Positives = 236/348 (67%), Gaps = 15/348 (4%)

Query: 8   SFIIPMFVIIILVITCASQVVSGRSM--HEPSIVEKHEQWMAQHGRTYKDELEKAMRLNI 65
           S ++ + V+IIL         + R++   E S+V+KHEQWMA+  R Y+DELEK MR ++
Sbjct: 3   SIMVLVTVLIILFTGFRISQATSRTVIFREQSMVDKHEQWMARFSREYRDELEKNMRRDV 62

Query: 66  FKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKY 125
           FK+NL++IE  NK+GN++YKLG NEF+D TNEEF A++TG  + +  VS       T   
Sbjct: 63  FKKNLKFIENFNKKGNKSYKLGVNEFADWTNEEFLAIHTGL-KGLTEVSPSKVVAKTISS 121

Query: 126 Q--NVTD-VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQ 182
           Q  NV+D V  S DWR +GAVT +K QGQCG CWAFSAVAAVEG+ +I  G L+ LSEQQ
Sbjct: 122 QTWNVSDMVVESKDWRAEGAVTPVKYQGQCGCCWAFSAVAAVEGVAKIAGGNLVSLSEQQ 181

Query: 183 LVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKY 241
           L+DC  + + GC GG+M  AF Y+++N+G+A+E DY Y+  +G C +      AA IS +
Sbjct: 182 LLDCDREYDRGCDGGIMSDAFNYVVQNRGIASENDYSYQGSDGGCRSNARP--AARISGF 239

Query: 242 EDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAE 301
           + +P  +E+ALL+AVS QPVSV +DA+G  F  Y  GV +  CG + +H V  VG+GT++
Sbjct: 240 QTVPSNNERALLEAVSRQPVSVSMDATGDGFMHYSGGVYDGPCGTSSNHAVTFVGYGTSQ 299

Query: 302 EENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
           +  G KYWL KNSWGETWGE GYIRI RD     G+CG+A  A YPVA
Sbjct: 300 D--GTKYWLAKNSWGETWGEKGYIRIRRDVAWPQGMCGVAQYAFYPVA 345


>gi|38345008|emb|CAD40026.2| OSJNBa0052O21.11 [Oryza sativa Japonica Group]
 gi|125589414|gb|EAZ29764.1| hypothetical protein OsJ_13822 [Oryza sativa Japonica Group]
          Length = 339

 Score =  342 bits (876), Expect = 2e-91,   Method: Compositional matrix adjust.
 Identities = 164/341 (48%), Positives = 229/341 (67%), Gaps = 17/341 (4%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
           +F I+  +  C++ + +     + ++  +HE+WMAQ+GR Y+D+ EKA R  +FK N+ +
Sbjct: 8   LFAILGCLCLCSAVLAARELSDDAAMAARHERWMAQYGRVYRDDAEKARRFEVFKANVAF 67

Query: 73  IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT--D 130
           IE  N  GN  + LG N+F+DLTN+EFR + T     +PS +R    P+ F+Y+NV    
Sbjct: 68  IESFNA-GNHNFWLGVNQFADLTNDEFRWMKTNKGF-IPSTTRV---PTGFRYENVNIDA 122

Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST-- 188
           +P ++DWR KGAVT IKDQGQCG CWAFSAVAA+EGI +++ GKLI LSEQ+LVDC    
Sbjct: 123 LPATVDWRTKGAVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVHG 182

Query: 189 DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
           ++ GC GGLMD AF++II+N GL TE++YPY   +  C +       A+I  YED+P  +
Sbjct: 183 EDQGCEGGLMDDAFKFIIKNGGLTTESNYPYAAADDKCKSVSNS--VASIKGYEDVPANN 240

Query: 249 EQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKY 308
           E AL++AV+NQPVSV VD     F FYK GV+   CG + DHG+  +G+G A +  G KY
Sbjct: 241 EAALMKAVANQPVSVAVDGGDMTFQFYKGGVMTGSCGTDLDHGIVAIGYGKASD--GTKY 298

Query: 309 WLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
           WL+KNSWG TWGE+G++R+ +D     G+CG+A   SYP A
Sbjct: 299 WLLKNSWGTTWGENGFLRMEKDISDKRGMCGLAMEPSYPTA 339


>gi|357113934|ref|XP_003558756.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like
           [Brachypodium distachyon]
          Length = 346

 Score =  342 bits (876), Expect = 2e-91,   Method: Compositional matrix adjust.
 Identities = 168/341 (49%), Positives = 227/341 (66%), Gaps = 16/341 (4%)

Query: 13  MFVIIILVITCASQVVSGRSMHEP--SIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNL 70
           +  I+  +  C++ V++ R + +   ++  +HEQWMAQ GR YKD  EKA RL +FK N+
Sbjct: 10  LVAIVGCLCLCSTAVLAARELGDADNAMAARHEQWMAQFGRVYKDPAEKAHRLEVFKANV 69

Query: 71  EYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT- 129
            +IE  N E N  + LG N+F+DLTN+EFRA  T  N+ +     + + P+ FKY +V+ 
Sbjct: 70  AFIESFNAE-NHEFWLGANQFADLTNDEFRASKT--NKGIKQGGVRDA-PTGFKYSDVSI 125

Query: 130 -DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST 188
             +P S+DWR KGAVT IK+QGQCGSCWAFSAVAA EG+ +++ GKL+ LSEQ+LVDC  
Sbjct: 126 DALPASVDWRTKGAVTPIKNQGQCGSCWAFSAVAATEGVVKLSTGKLVSLSEQELVDCDV 185

Query: 189 D--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPK 246
              + GC GG MD AF++II+N GL TEA+YPY  E+  C + +   VAATI  YED+P 
Sbjct: 186 HGVDQGCMGGWMDDAFKFIIKNGGLTTEANYPYTGEDDKCKSNETVNVAATIKGYEDVPA 245

Query: 247 GDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGA 306
            DE AL++AV++QPVSV VD     F  Y  GV+   CG   DHG+A +G+G     NG 
Sbjct: 246 NDESALMKAVAHQPVSVVVDGGDMTFQLYAGGVMTGSCGVEMDHGIAAIGYGAT--SNGT 303

Query: 307 KYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYP 343
           KYWL+KNSWG TWGE G++R+ +D     G+CG+A   SYP
Sbjct: 304 KYWLMKNSWGTTWGEKGFLRMAKDIPDKRGMCGLAMKPSYP 344


>gi|125547236|gb|EAY93058.1| hypothetical protein OsI_14861 [Oryza sativa Indica Group]
          Length = 339

 Score =  342 bits (876), Expect = 2e-91,   Method: Compositional matrix adjust.
 Identities = 165/342 (48%), Positives = 231/342 (67%), Gaps = 19/342 (5%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
           +F I+  +  C++ + +     + ++  +HE+WMAQ+GR Y+D+ EKA R  +FK N+ +
Sbjct: 8   LFAILGCLCLCSAVLAARELSDDAAMAARHERWMAQYGRVYRDDAEKARRFEVFKANVAF 67

Query: 73  IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRP-VPSVSRQSSRPSTFKYQNVT-- 129
           IE  N  GN  + LG N+F+DLTN+EFR  +T  N+  +PS +R    P+ F+Y+NV   
Sbjct: 68  IESFNA-GNHNFWLGVNQFADLTNDEFR--WTKTNKGFIPSTTRV---PTGFRYENVNID 121

Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST- 188
            +P ++DWR KGAVT IKDQGQCG CWAFSAVAA+EGI +++ GKLI LSEQ+LVDC   
Sbjct: 122 ALPATVDWRTKGAVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVH 181

Query: 189 -DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
            ++ GC GGLMD AF++II+N GL TE++YPY   +  C +       A+I  YED+P  
Sbjct: 182 GEDQGCEGGLMDDAFKFIIKNGGLTTESNYPYAAADDKCKSVSNS--VASIKGYEDVPAN 239

Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
           +E AL++AV+NQPVSV VD     F FYK GV+   CG + DHG+  +G+G A +  G K
Sbjct: 240 NEAALMKAVANQPVSVAVDGGDMTFQFYKGGVMTGSCGTDLDHGIVAIGYGKASD--GTK 297

Query: 308 YWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
           YWL+KNSWG TWGE+G++R+ +D     G+CG+A   SYP A
Sbjct: 298 YWLLKNSWGTTWGENGFLRMEKDISDKRGMCGLAMEPSYPTA 339


>gi|400180347|gb|AFP73312.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  342 bits (876), Expect = 2e-91,   Method: Compositional matrix adjust.
 Identities = 160/340 (47%), Positives = 234/340 (68%), Gaps = 11/340 (3%)

Query: 10  IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
           ++ + + +  VI+  +    GRS  + S+ E+HE WM++HGR YKDE+EK  R  IFK+N
Sbjct: 7   LMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66

Query: 70  LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
           +++IE  NK GN +YKLG NEF+D+T++EF A +TG N P   +S      + FK  +++
Sbjct: 67  MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLS 126

Query: 130 D--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
           D  +P+++DWRE GAVT +K QG+CG CWAFSAV ++EG  +I  G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186

Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
           T+N+GC GG M  AF++IIEN G++ E+DY Y  ++ TC +Q EK  A  IS Y+ +P+G
Sbjct: 187 TNNYGCDGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQ-EKTAAVQISSYQVVPEG 245

Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
            E +LLQAV+ QPVS+ + AS +   FY  G  +  C +  +H V  +G+GT  +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301

Query: 308 YWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
           YWL+KNSWG +WGE+G+++I+RD    +GLC IA  +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341


>gi|400180353|gb|AFP73315.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  341 bits (875), Expect = 2e-91,   Method: Compositional matrix adjust.
 Identities = 161/340 (47%), Positives = 234/340 (68%), Gaps = 11/340 (3%)

Query: 10  IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
           ++ + + +  VI+  +    GRS  + S+ E+HE WM++HGR YKDE+EK  R  IFK+N
Sbjct: 7   LMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66

Query: 70  LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
           +++IE  NK GN +YKLG NEF+D+T++EF A +TG N P   +S      + F   +++
Sbjct: 67  MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFIINDLS 126

Query: 130 D--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
           D  +P+++DWRE GAVT +K QG+CG CWAFSAV ++EG  +I  G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186

Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
           T+N+GC+GG M  AF++IIEN G++ E+DY Y  E+ TC +Q EK  A  IS Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCRSQ-EKTAAVQISSYQVVPEG 245

Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
            E +LLQAV+ QPVS+ + AS +   FY  G  +  C +  +H V  +G+GT  +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301

Query: 308 YWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
           YWL+KNSWG +WGE+G+++I+RD    AGLC IA  +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDSGNPAGLCDIAKMSSYP 341


>gi|400180365|gb|AFP73321.1| cysteine protease [Solanum peruvianum]
 gi|400180395|gb|AFP73336.1| cysteine protease [Solanum peruvianum]
 gi|400180405|gb|AFP73341.1| cysteine protease [Solanum peruvianum]
 gi|400180409|gb|AFP73343.1| cysteine protease [Solanum peruvianum]
 gi|400180411|gb|AFP73344.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  341 bits (875), Expect = 2e-91,   Method: Compositional matrix adjust.
 Identities = 160/340 (47%), Positives = 235/340 (69%), Gaps = 11/340 (3%)

Query: 10  IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
           ++ + + +  VI+  +    GRS  + S+ E+HE WM++HGR YKDE+EK  R  IFK+N
Sbjct: 7   LMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66

Query: 70  LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
           +++IE  NK GN +YKLG NEF+D+T++EF A +TG N P   +S      + FK  +++
Sbjct: 67  MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLS 126

Query: 130 D--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
           D  +P+++DWRE GAVT +K QG+CG CWAFSAV ++EG  +I  G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186

Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
           T+N+GC+GG M  AF++IIEN G++ E+DY Y  ++ TC +Q EK  A  IS Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQ-EKTAAVQISSYKVVPEG 245

Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
            E +LLQAV+ QPVS+ + AS +   FY  G  +  C +  +H V  +G+GT  +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301

Query: 308 YWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
           YWL+KNSWG +WGE+G+++I+RD    +GLC IA  +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341


>gi|400180403|gb|AFP73340.1| cysteine protease [Solanum peruvianum]
 gi|400180413|gb|AFP73345.1| cysteine protease [Solanum peruvianum]
 gi|400180415|gb|AFP73346.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  341 bits (875), Expect = 3e-91,   Method: Compositional matrix adjust.
 Identities = 161/340 (47%), Positives = 234/340 (68%), Gaps = 11/340 (3%)

Query: 10  IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
           ++ + + +  VI+  +    GRS  + S+ E+HE WM++HGR YKDE+EK  R  IFK+N
Sbjct: 7   LMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66

Query: 70  LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
           +++IE  NK GN +YKLG NEF+D+T++EF A +TG N P   +S      + FK  +++
Sbjct: 67  MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLS 126

Query: 130 D--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
           D  +P+++DWRE GAVT +K QG+CG CWAFSAV ++EG  +I  G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186

Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
           T+N+GC+GG M  AF++I EN G++ E+DY Y  E+ TC +Q EK  A  IS Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQ-EKTAAVQISSYQVVPEG 245

Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
            E +LLQAV+ QPVS+ + AS +   FY  G  +  C +  +H V  +G+GT  +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301

Query: 308 YWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
           YWL+KNSWG +WGE+G+++I+RD    AGLC IA  +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDSGNPAGLCDIAKMSSYP 341


>gi|20334375|gb|AAM19208.1|AF493233_1 cysteine protease [Solanum pennellii]
          Length = 337

 Score =  341 bits (875), Expect = 3e-91,   Method: Compositional matrix adjust.
 Identities = 164/347 (47%), Positives = 235/347 (67%), Gaps = 17/347 (4%)

Query: 1   MVLKFEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKA 60
           M +K +   ++ + + +  VI+  +    GRS  + S+ E+HE WM++HGR YKDE+EK 
Sbjct: 1   MAMKID---LMSILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKG 57

Query: 61  MRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRP 120
            R  IFK+N+++IE  NK GN +YKLG NEF+D+T++EF A +TG N P   +S     P
Sbjct: 58  ERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLS-----P 112

Query: 121 STFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSE 180
           S     +  D+P+++DWRE GAVT +K+QGQCG CWAFSAV ++EG  +I  G L+E SE
Sbjct: 113 SPINDLSDDDMPSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYKIATGNLMEFSE 172

Query: 181 QQLVDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISK 240
           Q+L+DC+T+N+GC+GG M  AF++I EN G++ E+DY Y  ++ TC +Q EK  A  IS 
Sbjct: 173 QELLDCTTNNYGCNGGFMTNAFDFIKENGGISRESDYEYLGQQYTCRSQ-EKTAAVQISS 231

Query: 241 YEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTA 300
           Y+ +P+G E +LLQAV+ QPVS+ + AS +   FY  G  +  C N  +H V  +G+GT 
Sbjct: 232 YQVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCANRINHAVTAIGYGT- 288

Query: 301 EEENGAKYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
            +E G KYWL+KNSWG +WGE G+++I+RD    AGLC IA  +SYP
Sbjct: 289 -DEKGQKYWLLKNSWGTSWGEDGFMKIIRDSGNPAGLCDIAKVSSYP 334


>gi|400180373|gb|AFP73325.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  341 bits (875), Expect = 3e-91,   Method: Compositional matrix adjust.
 Identities = 161/340 (47%), Positives = 233/340 (68%), Gaps = 11/340 (3%)

Query: 10  IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
           ++ + + +  VI+  +    GRS  + S+ E+HE WM++HG  YKDE+EK  R  IFK+N
Sbjct: 7   LMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGHVYKDEVEKGERFMIFKEN 66

Query: 70  LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
           +++IE  NK GN +YKLG NEF+D+T++EF A +TG N P   +S      + FK  +++
Sbjct: 67  MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLS 126

Query: 130 D--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
           D  +P+++DWRE GAVT +K QGQCG CWAFSAV ++EG  +I  G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGQCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186

Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
           T+N+GC GG M  AF++I EN G+++E+DY Y  E+ TC +Q EK  A  IS Y+ +P+G
Sbjct: 187 TNNYGCDGGFMTNAFDFIKENGGISSESDYEYLGEQYTCRSQ-EKTAAVQISSYQVVPEG 245

Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
            E +LLQAV+ QPVS+ + AS +   FY  G  +  C +  +H V  +G+GT  +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301

Query: 308 YWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
           YWL+KNSWG +WGE+G+++I+RD    AGLC IA  +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDSGNPAGLCDIAKMSSYP 341


>gi|400180451|gb|AFP73362.1| cysteine protease [Solanum chilense]
          Length = 344

 Score =  341 bits (875), Expect = 3e-91,   Method: Compositional matrix adjust.
 Identities = 160/340 (47%), Positives = 234/340 (68%), Gaps = 11/340 (3%)

Query: 10  IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
           ++ + + +  VI+  +    GRS  E S+ E+HE WM++HGR YKDE+EK  R  IFK+N
Sbjct: 7   LMNILITLFFVISMFNTQTRGRSQPELSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66

Query: 70  LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
           +++IE  NK GN +YKLG NEF+D+T++EF A +TG N P   +S      + FK  +++
Sbjct: 67  MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLS 126

Query: 130 D--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
           D  +P+++DWRE GAVT +K QG+CG CWAFSAV ++EG  +I  G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186

Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
           T+N+GC GG M  AF++I EN G+++E+DY Y  ++ TC +Q EK  A  IS Y+ +P+G
Sbjct: 187 TNNYGCDGGFMTNAFDFIKENGGISSESDYEYLGQQYTCRSQ-EKTAAVQISSYQVVPEG 245

Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
            E +LLQAV+ QPVS+ + AS +   FY  G  +  C +  +H V  +G+GT  +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301

Query: 308 YWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
           YWL+KNSWG +WGE+G+++I+RD    +GLC IA  +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDSGDPSGLCDIAKMSSYP 341


>gi|400180437|gb|AFP73356.1| cysteine protease [Solanum pennellii]
          Length = 337

 Score =  341 bits (874), Expect = 3e-91,   Method: Compositional matrix adjust.
 Identities = 162/338 (47%), Positives = 231/338 (68%), Gaps = 14/338 (4%)

Query: 10  IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
           ++ + + +  VI+  +    GRS  + S+ E+HE WM++HGR YKDE+EK  R  IFK+N
Sbjct: 7   LMSILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66

Query: 70  LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
           +++IE  NK GN +YKLG NEF+D+T++EF A +TG N P   +S     PS     +  
Sbjct: 67  MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLS-----PSPINDLSDD 121

Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD 189
           D+P+++DWRE GAVT +K+QGQCG CWAFSAV ++EG  +I  G L+E SEQ+L+DC+T+
Sbjct: 122 DMPSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTN 181

Query: 190 NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDE 249
           N+GC+GG M  AF++I EN G++ E+DY Y  ++ TC +Q EK  A  IS Y+ +P+G E
Sbjct: 182 NYGCNGGFMTNAFDFIKENGGISRESDYEYLGQQYTCRSQ-EKTAAVQISSYQVVPEG-E 239

Query: 250 QALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYW 309
            +LLQAV+ QPVS+ + AS +   FY  G  +  C N  +H V  +G+GT  +E G KYW
Sbjct: 240 TSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCANRINHAVTAIGYGT--DEKGQKYW 296

Query: 310 LIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
           L+KNSWG +WGE G+++I+RD    AGLC IA  +SYP
Sbjct: 297 LLKNSWGTSWGEDGFMKIIRDSGNPAGLCDIAKVSSYP 334


>gi|116309178|emb|CAH66275.1| OSIGBa0147O06.5 [Oryza sativa Indica Group]
          Length = 339

 Score =  341 bits (874), Expect = 3e-91,   Method: Compositional matrix adjust.
 Identities = 164/341 (48%), Positives = 228/341 (66%), Gaps = 17/341 (4%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
           +F I+  +  C++ + +     + ++  +HE+WMAQ+GR YKD+ EKA R  +FK N+ +
Sbjct: 8   LFAILGCLCLCSAVLAARELSDDAAMAARHERWMAQYGRMYKDDAEKARRFEVFKANVAF 67

Query: 73  IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT--D 130
           IE  N  GN  + LG N+F+DLTN+EFR+  T     +PS +R    P+ F+Y+NV    
Sbjct: 68  IESFNA-GNHKFWLGVNQFADLTNDEFRSTKTNKGF-IPSTTRV---PTGFRYENVNIDA 122

Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST-- 188
           +P ++DWR KG VT IKDQGQCG CWAFSAVAA+EGI +++ GKLI LSEQ+LVDC    
Sbjct: 123 LPATMDWRTKGVVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVHG 182

Query: 189 DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
           ++ GC GGLMD AF++II+N GL TE++YPY   +  C +       A+I  YED+P  +
Sbjct: 183 EDQGCEGGLMDDAFKFIIKNGGLTTESNYPYAAADDKCKSVSNS--VASIKGYEDVPANN 240

Query: 249 EQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKY 308
           E AL++AV+NQPVSV VD     F FYK GV+   CG + DHG+  +G+G A +  G KY
Sbjct: 241 EAALMKAVANQPVSVAVDGGDMTFQFYKGGVMTGSCGTDLDHGIVAIGYGKASD--GTKY 298

Query: 309 WLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
           WL+KNSWG TWGE+G++R+ +D     G+CG+A   SYP A
Sbjct: 299 WLLKNSWGTTWGENGFLRMEKDISDKRGMCGLAMEPSYPTA 339


>gi|302143412|emb|CBI21973.3| unnamed protein product [Vitis vinifera]
          Length = 320

 Score =  341 bits (874), Expect = 3e-91,   Method: Compositional matrix adjust.
 Identities = 170/337 (50%), Positives = 224/337 (66%), Gaps = 34/337 (10%)

Query: 15  VIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIE 74
           + ++ V+   +   + R++HE S+ E+HE WM Q+GR YKD  EK+ R  IFK N+  IE
Sbjct: 12  LALLFVLAAWASQATARNLHEASMYERHEDWMVQYGREYKDADEKSKRYKIFKDNVARIE 71

Query: 75  KANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTS 134
             NK  +++YKL  NEF+DLTNEEFRA     NR    +   S+  ++FKY+NVT VP++
Sbjct: 72  SFNKAMDKSYKLSINEFADLTNEEFRA---SRNRFKAHIC--STEATSFKYENVTAVPST 126

Query: 135 IDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST--DNHG 192
           +DWR+KGAVT IKDQGQCGSCWAFSAVAA+EGITQ++ GKLI LSEQ+LVDC T  ++ G
Sbjct: 127 VDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQG 186

Query: 193 CSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQAL 252
           C+                     +YPY   +GTC+ +K    AA I+ YED+P  +E+AL
Sbjct: 187 CT---------------------NYPYAGTDGTCNRKKAAHPAAKINGYEDVPANNEKAL 225

Query: 253 LQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIK 312
            +AV++QP++V +DA G  F FY SGV    CG   DHGV+ VG+GT+++  G KYWL+K
Sbjct: 226 QKAVAHQPIAVAIDAGGSEFQFYSSGVFTGQCGTELDHGVSAVGYGTSDD--GMKYWLVK 283

Query: 313 NSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
           NSWG  WGE GYIR+ RD     GLCGIA  ASYP A
Sbjct: 284 NSWGTGWGEEGYIRMQRDVTAKEGLCGIAMQASYPTA 320


>gi|400180351|gb|AFP73314.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  341 bits (874), Expect = 3e-91,   Method: Compositional matrix adjust.
 Identities = 161/340 (47%), Positives = 234/340 (68%), Gaps = 11/340 (3%)

Query: 10  IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
           ++ + + +  VI+  +    GRS  + S+ E+HE WM++HGR YKDE+EK  R  IFK+N
Sbjct: 7   LMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66

Query: 70  LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
           +++IE  NK GN +YKLG NEF+D+T++EF A +TG N P   +S      + FK  +++
Sbjct: 67  MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLS 126

Query: 130 D--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
           D  +P+++DWRE GAVT +K QG+CG CWAFSAV ++EG  +I  G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186

Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
           T+N+GC+GG M  AF++I EN G++ E+DY Y  E+ TC +Q EK  A  IS Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQ-EKTAAVQISSYQVVPEG 245

Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
            E +LLQAV+ QPVS+ + AS +   FY  G  +  C +  +H V  +G+GT  +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301

Query: 308 YWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
           YWL+KNSWG +WGE+G+++I+RD    AGLC IA  +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341


>gi|400180428|gb|AFP73352.1| cysteine protease [Solanum corneliomuelleri]
          Length = 344

 Score =  341 bits (874), Expect = 3e-91,   Method: Compositional matrix adjust.
 Identities = 160/340 (47%), Positives = 234/340 (68%), Gaps = 11/340 (3%)

Query: 10  IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
           ++ + + +  VI+  +     RS  + S+ E+HE WM++HGR YKDE+EK  R  IFK+N
Sbjct: 7   LMNILITLFFVISMFNTQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66

Query: 70  LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
           +++IE  NK GN +YKLG NEF+D+T++EF A +TG N P   +S      + FK  +++
Sbjct: 67  MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLS 126

Query: 130 D--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
           D  +P+++DWRE GAVT +K QG+CG CWAFSAV ++EG  +I  G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186

Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
           T+N+GC+GG M  AF++IIEN G++ E+DY Y  E+ TC +Q EK  A  IS Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCRSQ-EKTAAVQISSYQVVPEG 245

Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
            E +LLQAV+ QPVS+ + AS +   FY  G  +  C +  +H V  +G+GT  +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301

Query: 308 YWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
           YWL+KNSWG +WGE+G+++I+RD    +GLC IA  +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341


>gi|400180379|gb|AFP73328.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  341 bits (874), Expect = 4e-91,   Method: Compositional matrix adjust.
 Identities = 161/340 (47%), Positives = 233/340 (68%), Gaps = 11/340 (3%)

Query: 10  IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
           ++ + + +  VIT  +    GRS  + S+ E+HE WM++HGR YKDE+EK  R  IFK+N
Sbjct: 7   LMNILITLFFVITMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66

Query: 70  LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
           +++IE  NK GN +YKLG NEF+D+T++EF A +TG N P   +S      + FK  +++
Sbjct: 67  MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLS 126

Query: 130 D--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
           D  +P+++DWRE GAVT +K QG+CG CWAFSAV ++EG  +I  G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186

Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
           T+N+GC GG M  AF++I EN G++ E+DY Y  E+ TC +Q EK  A  IS Y+ +P+G
Sbjct: 187 TNNYGCDGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQ-EKTAAVQISSYQVVPEG 245

Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
            E +LLQAV+ QPVS+ + AS +   FY  G  +  C +  +H V  +G+GT  +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301

Query: 308 YWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
           YWL+KNSWG +WGE+G+++I+RD    +GLC IA  +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341


>gi|400180445|gb|AFP73359.1| cysteine protease, partial [Solanum chilense]
          Length = 345

 Score =  340 bits (873), Expect = 4e-91,   Method: Compositional matrix adjust.
 Identities = 160/340 (47%), Positives = 234/340 (68%), Gaps = 11/340 (3%)

Query: 10  IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
           ++ + + +  VI+  +    GRS  E S+ E+HE WM++HGR YKDE+EK  R  IFK+N
Sbjct: 7   LMNILITLFFVISMFNTQTRGRSQPELSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66

Query: 70  LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
           +++IE  NK GN +YKLG NEF+D+T++EF A +TG N P   +S      + FK  +++
Sbjct: 67  MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLS 126

Query: 130 D--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
           D  +P+++DWRE GAVT +K QG+CG CWAFSAV ++EG  +I  G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186

Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
           T+N+GC GG M  AF++I EN G+++E+DY Y  ++ TC +Q EK  A  IS Y+ +P+G
Sbjct: 187 TNNYGCDGGFMTNAFDFIKENGGISSESDYEYLGQQYTCRSQ-EKTAAVQISSYQVVPEG 245

Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
            E +LLQAV+ QPVS+ + AS +   FY  G  +  C +  +H V  +G+GT  +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301

Query: 308 YWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
           YWL+KNSWG +WGE+G+++I+RD    +GLC IA  +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDSGDPSGLCDIAKMSSYP 341


>gi|171702841|dbj|BAG16376.1| cysteine protease [Brassica rapa var. perviridis]
          Length = 333

 Score =  340 bits (873), Expect = 4e-91,   Method: Compositional matrix adjust.
 Identities = 160/315 (50%), Positives = 219/315 (69%), Gaps = 11/315 (3%)

Query: 33  MHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANK-EGNRTYKLGTNEF 91
           + E ++ ++H +WM +HGR Y D  EK  R  +FK+N+E IE+ N  +   T+KL  N+F
Sbjct: 23  LDEVAMQKRHAEWMTEHGRVYADANEKNNRYAVFKRNVERIERLNDVQSGLTFKLAVNQF 82

Query: 92  SDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD--VPTSIDWREKGAVTHIKDQ 149
           +DLTNEEFR++YTG+     SV    ++P++F+YQNV+   +P S+DWR+KGAVT IKDQ
Sbjct: 83  ADLTNEEFRSMYTGFKGN--SVLSSRTKPTSFRYQNVSSDALPVSVDWRKKGAVTPIKDQ 140

Query: 150 GQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENK 209
           G CGSCWAFSAVAA+EG+ QI +GKLI LSEQ+LVDC T++ GC GGLMD AF Y I   
Sbjct: 141 GLCGSCWAFSAVAAIEGVAQIKKGKLISLSEQELVDCDTNDGGCMGGLMDTAFNYTITIG 200

Query: 210 GLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASG 269
           GL +E++YPY+   GTC+  K K +A +I  +ED+P  DE+AL++AV++ PVS+ +    
Sbjct: 201 GLTSESNYPYKSTNGTCNFNKTKQIATSIKGFEDVPANDEKALMKAVAHHPVSIGIAGGD 260

Query: 270 RAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILR 329
             F FY SGV + +C  + DHGV  VG+G    +NG KYW++KNSWG  WGE GY+RI +
Sbjct: 261 IGFQFYSSGVFSGECTTHLDHGVTAVGYG--RSKNGLKYWILKNSWGPKWGERGYMRIKK 318

Query: 330 DA----GLCGIATAA 340
           D     G CG+A  A
Sbjct: 319 DIKPKHGQCGLAMNA 333


>gi|400180453|gb|AFP73363.1| cysteine protease [Solanum chilense]
          Length = 344

 Score =  340 bits (873), Expect = 5e-91,   Method: Compositional matrix adjust.
 Identities = 160/340 (47%), Positives = 234/340 (68%), Gaps = 11/340 (3%)

Query: 10  IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
           ++ + + +  VI+  +     RS  + S+ E+HE WM++HGR YKDE+EK  R  IFK+N
Sbjct: 7   LMNILITLFFVISMFNTQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66

Query: 70  LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
           +++IE  NK GN +YKLG NEF+D+T++EF A +TG N P   +S      + FK  +++
Sbjct: 67  MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPVSSTEFKINDLS 126

Query: 130 D--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
           D  +P+++DWRE GAVT +K QG+CG CWAFSAV ++EG  +I  G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186

Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
           T+N+GC+GG M  AF++IIEN G++ E+DY Y  E+ TC +Q EK  A  IS Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCRSQ-EKTAAVQISSYQVVPEG 245

Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
            E +LLQAV+ QPVS+ + AS +   FY  G  +  C +  +H V  +G+GT  +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301

Query: 308 YWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
           YWL+KNSWG +WGE+G+++I+RD    +GLC IA  +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341


>gi|400180393|gb|AFP73335.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  340 bits (873), Expect = 5e-91,   Method: Compositional matrix adjust.
 Identities = 160/340 (47%), Positives = 234/340 (68%), Gaps = 11/340 (3%)

Query: 10  IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
           ++ + + +  VI+  +    GRS  + S+ E+HE WM++HGR YKDE+EK  R  IFK+N
Sbjct: 7   LMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKVERFMIFKEN 66

Query: 70  LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
           +++IE  NK GN +YKLG NEF+D+T++EF A +TG N P   +S      +  K  +++
Sbjct: 67  MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTELKINDLS 126

Query: 130 D--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
           D  +P+++DWRE GAVT +K QG+CG CWAFSAV ++EG  +I  G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186

Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
           T+N+GC+GG M  AF++IIEN G++ E+DY Y  E+ TC +Q EK  A  IS Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCRSQ-EKTAAVQISSYKVVPEG 245

Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
            E +LLQAV+ QPVS+ + AS +   FY  G  +  C +  +H V  +G+GT  +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301

Query: 308 YWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
           YWL+KNSWG +WGE+G+++I+RD    +GLC IA  +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341


>gi|400180407|gb|AFP73342.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  340 bits (873), Expect = 5e-91,   Method: Compositional matrix adjust.
 Identities = 160/340 (47%), Positives = 234/340 (68%), Gaps = 11/340 (3%)

Query: 10  IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
           ++ + + +  VI+  +    GRS  + S+ E+HE WM++HGR YKDE+EK  R  IFK+N
Sbjct: 7   LMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKVERFMIFKEN 66

Query: 70  LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
           +++IE  NK GN +YKLG NEF+D+T++EF A +TG N P   +S      + FK  +++
Sbjct: 67  MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLS 126

Query: 130 D--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
           D  +P+++DWRE GAVT +K QG+CG CWAFSAV ++EG  +I  G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186

Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
           T+N+GC+GG M  AF++I EN G++ E+DY Y  E+ TC +Q EK  A  IS Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQ-EKTAAVQISSYQVVPEG 245

Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
            E +LLQAV+ QPVS+ + AS +   FY  G  +  C +  +H V  +G+GT  +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301

Query: 308 YWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
           YWL+KNSWG +WGE+G+++I+RD    +GLC IA  +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341


>gi|400180375|gb|AFP73326.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  340 bits (872), Expect = 5e-91,   Method: Compositional matrix adjust.
 Identities = 160/340 (47%), Positives = 233/340 (68%), Gaps = 11/340 (3%)

Query: 10  IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
           ++ + + +  VI+  +     RS  + S+ E+HE WM++HGR YKDE+EK  R  IFK+N
Sbjct: 7   LMSILITLFFVISMFNSQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66

Query: 70  LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
           +++IE  NK GN +YKLG NEF+D+T++EF A +TG N P   +S      + FK  +++
Sbjct: 67  IKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLS 126

Query: 130 D--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
           D  +P+++DWRE GAVT +K QG+CG CWAFSAV ++EG  +I  G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186

Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
           T+N+GC GG M  AF++I EN G+++E+DY Y  E+ TC +Q EK  A  IS Y+ +P+G
Sbjct: 187 TNNYGCDGGFMTNAFDFIKENGGISSESDYEYLGEQYTCRSQ-EKTAAVQISSYQVVPEG 245

Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
            E +LLQAV+ QPVS+ + AS +   FY  G  +  C +  +H V  +G+GT  +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301

Query: 308 YWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
           YWL+KNSWG +WGE+G+++I+RD    AGLC IA  +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDSGNPAGLCDIAKMSSYP 341


>gi|297740489|emb|CBI30671.3| unnamed protein product [Vitis vinifera]
          Length = 320

 Score =  340 bits (872), Expect = 5e-91,   Method: Compositional matrix adjust.
 Identities = 176/337 (52%), Positives = 222/337 (65%), Gaps = 32/337 (9%)

Query: 15  VIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIE 74
           + ++++   ASQ +S R++HE S+ E+HE WM  +GRTYKD  EK  R  IFK+N+EYIE
Sbjct: 10  ITLLIMGVWASQALS-RTLHEVSMSERHEDWMGLYGRTYKDIAEKERRFKIFKENVEYIE 68

Query: 75  KANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTS 134
             NK                    F+A   GYN    S   +SS  ++F+Y+NV  VP+S
Sbjct: 69  SVNK--------------------FKASRNGYNM---SSRPRSSEITSFRYENVAAVPSS 105

Query: 135 IDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST--DNHG 192
           +DWR+KGAVT IKDQGQCG CWAFSAVAA+EG+TQ+  G+LI LSEQ+LVDC T  ++ G
Sbjct: 106 MDWRKKGAVTPIKDQGQCGCCWAFSAVAAMEGVTQLKTGELISLSEQELVDCDTSGEDQG 165

Query: 193 CSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQAL 252
           C GGLMD AFE+II N GL TEA+YPY+  + TC+ +K  + AA I  YED+P   E AL
Sbjct: 166 CGGGLMDSAFEFIIGNGGLTTEANYPYKGVDATCNKKKAASSAAKIKNYEDVPANSEAAL 225

Query: 253 LQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIK 312
           L+AV+  PVSV +DA G  F FY SGV    CG   DHGV  VG+G  + ++G KYWL+K
Sbjct: 226 LKAVAQHPVSVAIDAGGSDFQFYSSGVFTGQCGTELDHGVTAVGYG--KTDDGTKYWLVK 283

Query: 313 NSWGETWGESGYIRILR----DAGLCGIATAASYPVA 345
           NSWG  WGE GYI + R    D GLCGIA  ASYP A
Sbjct: 284 NSWGTGWGEDGYIWMERDIGADEGLCGIAMEASYPTA 320


>gi|255557851|ref|XP_002519955.1| cysteine protease, putative [Ricinus communis]
 gi|223541001|gb|EEF42559.1| cysteine protease, putative [Ricinus communis]
          Length = 321

 Score =  340 bits (872), Expect = 5e-91,   Method: Compositional matrix adjust.
 Identities = 180/350 (51%), Positives = 226/350 (64%), Gaps = 37/350 (10%)

Query: 1   MVLKFEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKA 60
           M L  EK   I + V+     T ASQ ++ + ++E ++VEKHEQWMA+HGRTY+D  EK 
Sbjct: 1   MALSLEKKLAIALLVVFS---TWASQAMARQLINEDALVEKHEQWMARHGRTYQDSEEKE 57

Query: 61  MRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRP 120
            R  IFK NLEYI+  NK  N+TY+LG N F+DL++EE+ A YT    PV          
Sbjct: 58  RRFQIFKSNLEYIDNFNKASNQTYQLGLNNFADLSHEEYVATYTARKMPV---------- 107

Query: 121 STFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSE 180
                    +VP SIDWR+ GAVT IK+Q QCG CWAFSA AAVEGI  +  G  + LS 
Sbjct: 108 ---------EVPESIDWRDHGAVTPIKNQYQCGCCWAFSAAAAVEGI--VANG--VSLSA 154

Query: 181 QQLVDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISK 240
           QQL+DC +DN GC GG M+ AF YII+N+G+A E DYPY+  +  C +   +  AA IS 
Sbjct: 155 QQLLDCVSDNQGCKGGWMNNAFNYIIQNQGIALETDYPYQQMQQMCSS---RMAAAQISG 211

Query: 241 YEDLPKGDEQALLQAVSNQPVSVCVDA-SGRAFHFYKSGVLN-ADCGNNCDHGVAVVGFG 298
           +ED+   DE+AL++AV+ QPVSV +DA S   F  YK GV   A CGN   H V +VG+G
Sbjct: 212 FEDVTPKDEEALMRAVAKQPVSVTIDATSNPNFKLYKEGVFTAAGCGNGHSHAVTLVGYG 271

Query: 299 TAEEENGAKYWLIKNSWGETWGESGYIRILRDAGL----CGIATAASYPV 344
           T+E+  G KYWL KNSWGETWGESGY+R+ RD GL    CGIA  ASYP 
Sbjct: 272 TSED--GTKYWLAKNSWGETWGESGYMRLQRDIGLEGGPCGIALYASYPT 319


>gi|20334377|gb|AAM19209.1|AF493234_1 cysteine protease [Solanum lycopersicum]
 gi|400180431|gb|AFP73353.1| cysteine protease [Solanum lycopersicum]
          Length = 345

 Score =  340 bits (872), Expect = 6e-91,   Method: Compositional matrix adjust.
 Identities = 163/341 (47%), Positives = 236/341 (69%), Gaps = 12/341 (3%)

Query: 10  IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
           ++ + + +  VI+  +    GRS  + S+ E+HE WM++HGR YKDE+EK  R  IFK+N
Sbjct: 7   LMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66

Query: 70  LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQN-V 128
           +++IE  NK GN +YKLG NEF+D+T++EF A +TG N P   +S      + FK  N +
Sbjct: 67  MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKKINDL 126

Query: 129 TD--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC 186
           +D  +P+++DWRE GAVT +K QG+CG CWAFSAV ++EG  +I  G L+E SEQ+L+DC
Sbjct: 127 SDDYMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDC 186

Query: 187 STDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPK 246
           +T+N+GC+GG M  AF++IIEN G++ E+DY Y  ++ TC +Q EK  A  IS Y+ +P+
Sbjct: 187 TTNNYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQ-EKTAAVQISSYQVVPE 245

Query: 247 GDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGA 306
           G E +LLQAV+ QPVS+ + AS +   FY  G  + +C +  +H V  +G+GT EE  G 
Sbjct: 246 G-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGNCADRINHAVTAIGYGTDEE--GQ 301

Query: 307 KYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
           KYWL+KNSWG +WGE+GY++I+RD    +GLC IA  +SYP
Sbjct: 302 KYWLLKNSWGTSWGENGYMKIIRDSGDPSGLCDIAKMSSYP 342


>gi|400180357|gb|AFP73317.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  340 bits (872), Expect = 6e-91,   Method: Compositional matrix adjust.
 Identities = 159/340 (46%), Positives = 234/340 (68%), Gaps = 11/340 (3%)

Query: 10  IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
           ++ + + +  VI+  +     RS  + S+ E+HE WM++HGR YKDE+EK  R  IFK+N
Sbjct: 7   LMSILITLFFVISMFNSQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66

Query: 70  LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
           +++IE  NK GN +YKLG NEF+D+T++EF A +TG N P   +S      + FK  +++
Sbjct: 67  MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLS 126

Query: 130 D--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
           D  +P+++DWRE GAVT +K QG+CG CWAFSAV ++EG  +I  G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186

Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
           T+N+GC+GG M  AF++IIEN G++ E+DY Y  ++ TC +Q EK  A  IS Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQ-EKTAAVQISSYKVVPEG 245

Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
            E +LLQAV+ QPVS+ + AS +   FY  G  +  C +  +H V  +G+GT  +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301

Query: 308 YWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
           YWL+KNSWG +WGE+G+++I+RD    +GLC IA  +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDYGNPSGLCDIAKMSSYP 341


>gi|400180455|gb|AFP73364.1| cysteine protease [Solanum peruvianum]
 gi|400180459|gb|AFP73366.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  340 bits (871), Expect = 7e-91,   Method: Compositional matrix adjust.
 Identities = 160/340 (47%), Positives = 234/340 (68%), Gaps = 11/340 (3%)

Query: 10  IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
           ++ + + +  VI+  +    GRS  + S+ E+HE WM++HGR YKDE+EK  R  IFK+N
Sbjct: 7   LMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66

Query: 70  LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
           +++IE  NK GN +YKLG NEF+D+T++EF A +TG N P   +S      + FK  +++
Sbjct: 67  MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLS 126

Query: 130 D--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
           D  +P+++DWRE GAVT +K QG+CG CWAFSAV ++EG  +I  G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186

Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
           T+N+GC+GG M  AF++I EN G++ E+DY Y  E+ TC +Q EK  A  IS Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQ-EKTAAVQISSYQVVPEG 245

Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
            E +LLQAV+ QPVS+ + AS +   FY  G  +  C +  +H V  +G+GT  +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301

Query: 308 YWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
           YWL+KNSWG +WGE+G+++I+RD    +GLC IA  +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDSGDPSGLCDIAKMSSYP 341


>gi|400180359|gb|AFP73318.1| cysteine protease [Solanum peruvianum]
 gi|400180477|gb|AFP73375.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  340 bits (871), Expect = 7e-91,   Method: Compositional matrix adjust.
 Identities = 160/340 (47%), Positives = 233/340 (68%), Gaps = 11/340 (3%)

Query: 10  IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
           ++ + + +  VI+  +     RS  + S+ E+HE WM++HGR YKDE+EK  R  IFK+N
Sbjct: 7   LMSILITLFFVISMFNSQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66

Query: 70  LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
           +++IE  NK GN +YKLG NEF+D+T++EF A +TG N P   +S      + FK  +++
Sbjct: 67  MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLS 126

Query: 130 D--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
           D  +P+++DWRE GAVT +K QG+CG CWAFSAV ++EG  +I  G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186

Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
           T+N+GC+GG M  AF++I EN G++ E+DY Y  E+ TC +Q EK  A  IS Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQ-EKTAAVQISSYQVVPEG 245

Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
            E +LLQAV+ QPVS+ + AS +   FY  G  +  C +  +H V  +G+GT  +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301

Query: 308 YWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
           YWL+KNSWG +WGE+G+++I+RD    AGLC IA  +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341


>gi|400180457|gb|AFP73365.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  340 bits (871), Expect = 8e-91,   Method: Compositional matrix adjust.
 Identities = 160/340 (47%), Positives = 234/340 (68%), Gaps = 11/340 (3%)

Query: 10  IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
           ++ + + +  VI+  +    GRS  + S+ E+HE WM++HGR YKDE+EK  R  IFK+N
Sbjct: 7   LMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66

Query: 70  LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
           +++IE  NK GN +YKLG NEF+D+T++EF A +TG N P   +S      + F   +++
Sbjct: 67  MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFIINDLS 126

Query: 130 D--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
           D  +P+++DWRE GAVT +K QG+CG CWAFSAV ++EG  +I  G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186

Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
           T+N+GC+GG M  AF++IIEN G++ E+DY Y  E+ TC +Q EK  A  IS Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCRSQ-EKTAAVQISSYKVVPEG 245

Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
            E +LLQAV+ QPVS+ + AS +   FY  G  +  C +  +H V  +G+GT  +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301

Query: 308 YWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
           YWL+KNSWG +WGE+G+++I+RD    +GLC IA  +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341


>gi|400180381|gb|AFP73329.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  340 bits (871), Expect = 8e-91,   Method: Compositional matrix adjust.
 Identities = 159/340 (46%), Positives = 234/340 (68%), Gaps = 11/340 (3%)

Query: 10  IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
           ++ + + +  VI+  +    GRS  + S+ E+HE WM++HGR YKDE+EK  R  IFK+N
Sbjct: 7   LMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66

Query: 70  LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
           +++IE  NK GN +YKLG NEF+D+T++EF A +TG N P   +S      + FK  +++
Sbjct: 67  MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLS 126

Query: 130 D--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
           D  +P+++DWRE GAVT +K QG+CG CWAFSAV ++EG  +I  G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186

Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
           T+N+GC+GG M  AF++IIEN G++ E+DY Y  ++ TC +Q EK  A  IS Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQ-EKTAAVQISSYKVVPEG 245

Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
            E +LLQAV+ QPVS+ + AS +   FY  G  +  C +  +H V  +G+GT  +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301

Query: 308 YWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
           YWL+KNSWG +WGE+G+++I+RD    +GLC I   +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDSGDPSGLCDITKMSSYP 341


>gi|400180417|gb|AFP73347.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  340 bits (871), Expect = 8e-91,   Method: Compositional matrix adjust.
 Identities = 160/340 (47%), Positives = 235/340 (69%), Gaps = 11/340 (3%)

Query: 10  IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
           ++ + + +  VI+  +    GRS  + S+ E+HE WM++HGR YKDE+EK  R  IFK+N
Sbjct: 7   LMNILITVFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66

Query: 70  LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
           +++IE  NK GN +YKLG NEF+D+T++EF A +TG N P   +S      + FK  +++
Sbjct: 67  MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPLSSTEFKINDLS 126

Query: 130 D--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
           D  +P+++DWRE GAVT +K QG+CG CWAFSAV ++EG  +I  G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186

Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
           T+N+GC+GG M  AF++IIEN G++ E+DY Y  ++ TC +Q EK  A  IS Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQ-EKTAAVQISSYKVVPEG 245

Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
            E +LLQAV+ QPVS+ + AS +   FY  G  +  C +  +H V  +G+GT  +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301

Query: 308 YWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
           YWL+KNSWG +WGE+G+++I+RD    +GLC IA  +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341


>gi|224131910|ref|XP_002328138.1| predicted protein [Populus trichocarpa]
 gi|222837653|gb|EEE76018.1| predicted protein [Populus trichocarpa]
          Length = 349

 Score =  340 bits (871), Expect = 8e-91,   Method: Compositional matrix adjust.
 Identities = 173/354 (48%), Positives = 231/354 (65%), Gaps = 18/354 (5%)

Query: 1   MVLKFEKSFIIPMFVIIIL--VITCASQVVSGRSMHEPSI---VEKHEQWMAQHGRTYKD 55
           M L   K+  +  F  + +  V+     +V     H  S+   VE  E W++ HG+ Y  
Sbjct: 1   MALSVLKTSFLTFFASLFVCSVLAHDFSIVGYSPEHLTSVDKLVELFESWISGHGKAYNS 60

Query: 56  ELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSR 115
             EK  R  +FK+NL++I++ NKE   +Y LG NEF+DL++EEF++ + G     P   R
Sbjct: 61  LEEKLHRFEVFKENLKHIDQRNKEVT-SYWLGLNEFADLSHEEFKSKFLGL---YPEFPR 116

Query: 116 QSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKL 175
           + S    F Y++V D+P SIDWR+KGAVT +K+QG CGSCWAFS VAAVEGI QI  G L
Sbjct: 117 KKSS-EDFSYRDVVDLPKSIDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVAGNL 175

Query: 176 IELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAV 234
             LSEQQL+DC T  N+GC+GGLMD AFE+I+ N GL  E DYPY  EEGTCD ++E+  
Sbjct: 176 TSLSEQQLIDCDTSFNNGCNGGLMDYAFEFIVNNGGLHKEEDYPYLMEEGTCDEKREEME 235

Query: 235 AATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAV 294
             TIS Y D+P+ DEQ+LL+A+++QP+SV +DASGR F FY  GV +  CG + DHGVA 
Sbjct: 236 VVTISGYHDVPRNDEQSLLKALAHQPLSVAIDASGRDFQFYSGGVFSGPCGTDLDHGVAA 295

Query: 295 VGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
           VG+G++   +G  Y ++KNSWG  WGE GY+R+ R+     GLCGI   ASYP 
Sbjct: 296 VGYGSS---SGIDYIIVKNSWGPKWGERGYLRMKRNTGKPEGLCGINKMASYPT 346


>gi|146216004|gb|ABQ10204.1| cysteine protease Cp6 [Actinidia deliciosa]
          Length = 461

 Score =  339 bits (870), Expect = 1e-90,   Method: Compositional matrix adjust.
 Identities = 168/319 (52%), Positives = 217/319 (68%), Gaps = 12/319 (3%)

Query: 32  SMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEF 91
           S  +  ++  +E W+ +HG++Y    EK  R  IFK NL +I++ N E +RTYK+G N F
Sbjct: 36  SRTDDEVMAMYESWLVKHGKSYNAIGEKEKRFQIFKDNLRFIDEHNAE-SRTYKVGLNRF 94

Query: 92  SDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT--DVPTSIDWREKGAVTHIKDQ 149
           +DLTN+E+R++Y G      S  R S++  + +Y  V    +P S+DWREKGAV  +KDQ
Sbjct: 95  ADLTNDEYRSMYLGAR--TGSRRRLSTQKRSDRYVPVAGESLPDSVDWREKGAVVGVKDQ 152

Query: 150 GQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIEN 208
           G CGSCWAFS +AAVEGI QI  G LI LSEQ+LVDC T  N GC+GGLMD AFE+II+N
Sbjct: 153 GSCGSCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAFEFIIKN 212

Query: 209 KGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDAS 268
            G+ TE DYPY   +G CD  ++ A   TI  YED+P  +EQAL +AV+NQPVSV ++AS
Sbjct: 213 GGIDTEEDYPYNARDGRCDQYRKNAKVVTIDDYEDVPVNNEQALQKAVANQPVSVAIEAS 272

Query: 269 GRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRIL 328
           G AF FY+SGV   +CG   DHGV  VG+GT   EN   YW++KNSWG +WGESGYIR+ 
Sbjct: 273 GMAFQFYESGVFTGNCGTALDHGVTAVGYGT---ENSVDYWIVKNSWGSSWGESGYIRME 329

Query: 329 RDAGL---CGIATAASYPV 344
           R+ G    CGIA   SYP+
Sbjct: 330 RNTGATGKCGIAVEPSYPI 348


>gi|242072392|ref|XP_002446132.1| hypothetical protein SORBIDRAFT_06g002150 [Sorghum bicolor]
 gi|241937315|gb|EES10460.1| hypothetical protein SORBIDRAFT_06g002150 [Sorghum bicolor]
          Length = 337

 Score =  339 bits (870), Expect = 1e-90,   Method: Compositional matrix adjust.
 Identities = 166/341 (48%), Positives = 227/341 (66%), Gaps = 23/341 (6%)

Query: 16  IIILVITCAS---QVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
            ++ ++ CAS    V++ R + + ++VE+HE WM ++GR YKD  EKA R   FK N+ +
Sbjct: 7   FLLAILGCASLCSSVLAARELSDAAMVERHENWMVEYGRVYKDAAEKARRFEAFKHNVAF 66

Query: 73  IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPST-FKYQN--VT 129
           +E  N      + LG N+F+DLT EEF+A   G+      V      P+T FKY+N  V+
Sbjct: 67  VESFNTNKKNKFWLGVNQFADLTTEEFKA-NKGFKPTAEKV------PTTGFKYENLSVS 119

Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD 189
            +PT++DWR KGAVT IK+QGQCG CWAFSAVAA+EGI +++ G LI LSEQ+LVDC T 
Sbjct: 120 ALPTAVDWRTKGAVTPIKNQGQCGCCWAFSAVAAMEGIVKLSTGNLISLSEQELVDCDTH 179

Query: 190 --NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
             + GC GG MD AFE++I+N GLATE++YPY+  +G C    +   AATI  +ED+P  
Sbjct: 180 SMDEGCEGGWMDSAFEFVIKNGGLATESNYPYKAVDGKCKGGSKS--AATIKGHEDVPVN 237

Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
           +E AL++AV+NQPVSV VDAS R F  Y  GV+   CG   DHG+A +G+G   E +G K
Sbjct: 238 NEAALMKAVANQPVSVAVDASDRTFMLYSGGVMTGSCGTELDHGIAAIGYGM--ESDGTK 295

Query: 308 YWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
           YW++KNSWG TWGE G++R+ +D     G+CG+A   SYP 
Sbjct: 296 YWILKNSWGTTWGEKGFLRMEKDITDKRGMCGLAMKPSYPT 336


>gi|357458909|ref|XP_003599735.1| Cysteine proteinase [Medicago truncatula]
 gi|357474677|ref|XP_003607623.1| Cysteine proteinase [Medicago truncatula]
 gi|355488783|gb|AES69986.1| Cysteine proteinase [Medicago truncatula]
 gi|355508678|gb|AES89820.1| Cysteine proteinase [Medicago truncatula]
          Length = 342

 Score =  339 bits (870), Expect = 1e-90,   Method: Compositional matrix adjust.
 Identities = 167/343 (48%), Positives = 223/343 (65%), Gaps = 11/343 (3%)

Query: 8   SFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFK 67
           +FIIPMF  +I        V+S R + EP +  KHE+WM Q G++YKD  EK  R  IFK
Sbjct: 6   NFIIPMF--LIFTTWMLPYVMSSRVL-EPYLSNKHEKWMTQFGKSYKDAAEKEKRFQIFK 62

Query: 68  QNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQN 127
            N+E+IE  N  GN+ + L  N F+DLTNEEF+A   G N+ +       +  ++F+Y N
Sbjct: 63  NNVEFIELFNAVGNKPFNLSINHFADLTNEEFKASLNG-NKKLHDKFDILNETTSFRYHN 121

Query: 128 VTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
           VT VP S+DWR++GAVT IK+QG CGSCWAFS VA++EGI QIT G+L+ LSEQ+L+DC 
Sbjct: 122 VTSVPASMDWRKRGAVTPIKNQGSCGSCWAFSTVASIEGIHQITTGELVSLSEQELIDCV 181

Query: 188 TDNH-GCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPK 246
             N  GCSGG ++ AF++I +  G+A+E +YPY+  +  C  +KE    A I  YE +P 
Sbjct: 182 RGNSSGCSGGYLEDAFKFIAKKGGMASETNYPYKETDEKCKFKKESKHVAEIKGYEKVPS 241

Query: 247 GDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGA 306
             E  LL+AV+NQPVSV VDA    F FY  G+    CG + DH V +VG+G + +    
Sbjct: 242 NSENDLLKAVANQPVSVYVDAGDYVFQFYSGGIFTGKCGTDTDHVVTIVGYGVSLDY--T 299

Query: 307 KYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
           +YWL+KNSWG  WGE GY+++ R+     GLCGIAT  SYPVA
Sbjct: 300 EYWLVKNSWGTGWGEKGYMKLKRNVDSKKGLCGIATNPSYPVA 342


>gi|400180467|gb|AFP73370.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  339 bits (869), Expect = 1e-90,   Method: Compositional matrix adjust.
 Identities = 160/340 (47%), Positives = 233/340 (68%), Gaps = 11/340 (3%)

Query: 10  IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
           ++ + + +  VI+  +     RS  + S+ E+HE WM++HGR YKDE+EK  R  IFK+N
Sbjct: 7   LMNILITLFFVISMFNTQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66

Query: 70  LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
           +++IE  NK GN +YKLG NEF+D+T++EF A +TG N P   +S      + FK  +++
Sbjct: 67  MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLS 126

Query: 130 D--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
           D  +P+++DWRE GAVT +K QG+CG CWAFSAV ++EG  +I  G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186

Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
           T+N+GC+GG M  AF++I EN G++ E+DY Y  E+ TC +Q EK  A  IS Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQ-EKTAAVQISSYQVVPEG 245

Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
            E +LLQAV+ QPVS+ + AS +   FY  G  +  C +  +H V  +G+GT  +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301

Query: 308 YWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
           YWL+KNSWG +WGE+G+++I+RD    AGLC IA  +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341


>gi|400180363|gb|AFP73320.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  339 bits (869), Expect = 1e-90,   Method: Compositional matrix adjust.
 Identities = 160/340 (47%), Positives = 233/340 (68%), Gaps = 11/340 (3%)

Query: 10  IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
           ++ + + +  VI+  +    GRS  + S+ E+HE WM++HGR YKDE+EK  R  IFK+N
Sbjct: 7   LMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66

Query: 70  LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
           +++IE  NK GN +YKLG NEF+D+T++EF A +TG N P   +S      +  K  +++
Sbjct: 67  MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTELKINDLS 126

Query: 130 D--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
           D  +P+++DWRE GAVT +K QG+CG CWAFSAV ++EG  +I  G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186

Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
           T+N+GC+GG M  AF++I EN G++ E+DY Y  E+ TC +Q EK  A  IS Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQ-EKTAAVQISSYQVVPEG 245

Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
            E +LLQAV+ QPVS+ + AS +   FY  G  +  C +  +H V  +G+GT  +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301

Query: 308 YWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
           YWL+KNSWG +WGE+G+++I+RD    AGLC IA  +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341


>gi|400180349|gb|AFP73313.1| cysteine protease [Solanum peruvianum]
 gi|400180469|gb|AFP73371.1| cysteine protease [Solanum peruvianum]
 gi|400180471|gb|AFP73372.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  338 bits (868), Expect = 2e-90,   Method: Compositional matrix adjust.
 Identities = 160/340 (47%), Positives = 233/340 (68%), Gaps = 11/340 (3%)

Query: 10  IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
           ++ + + +  VI+  +    GRS  + S+ E+HE WM++HGR YKDE+EK  R  IFK+N
Sbjct: 7   LMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKVERFMIFKEN 66

Query: 70  LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
           +++IE  NK GN +YKLG NEF+D+T++EF A +TG N P   +S      +  K  +++
Sbjct: 67  MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTELKINDLS 126

Query: 130 D--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
           D  +P+++DWRE GAVT +K QG+CG CWAFSAV ++EG  +I  G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186

Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
           T+N+GC+GG M  AF++I EN G++ E+DY Y  E+ TC +Q EK  A  IS Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQ-EKTAAVQISSYQVVPEG 245

Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
            E +LLQAV+ QPVS+ + AS +   FY  G  +  C +  +H V  +G+GT  +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301

Query: 308 YWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
           YWL+KNSWG +WGE+G+++I+RD    AGLC IA  +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341


>gi|400180383|gb|AFP73330.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  338 bits (868), Expect = 2e-90,   Method: Compositional matrix adjust.
 Identities = 159/340 (46%), Positives = 234/340 (68%), Gaps = 11/340 (3%)

Query: 10  IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
           ++ + + +  VI+  +    GRS  + S+ E+HE WM++HGR YKDE+EK  R  IFK+N
Sbjct: 7   LMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66

Query: 70  LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
           +++IE  NK GN +YKLG NEF+D+T++EF A +TG N P   +S      + F   +++
Sbjct: 67  MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFIINDLS 126

Query: 130 D--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
           D  +P+++DWRE GAVT +K QG+CG CWAFSAV ++EG  +I  G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186

Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
           T+N+GC+GG M  AF++IIEN G++ E+DY Y  ++ TC +Q EK  A  IS Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQ-EKTAAVQISSYKVVPEG 245

Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
            E +LLQAV+ QPVS+ + AS +   FY  G  +  C +  +H V  +G+GT  +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301

Query: 308 YWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
           YWL+KNSWG +WGE+G+++I+RD    +GLC IA  +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341


>gi|400180369|gb|AFP73323.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  338 bits (868), Expect = 2e-90,   Method: Compositional matrix adjust.
 Identities = 159/340 (46%), Positives = 234/340 (68%), Gaps = 11/340 (3%)

Query: 10  IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
           ++ + + +  VI+  +    GRS  + S+ E+HE WM++HGR YKDE+EK  R  IFK+N
Sbjct: 7   LMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66

Query: 70  LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
           +++IE  NK GN +YKLG NEF+D+T++EF A +TG N P   +S      + FK  +++
Sbjct: 67  MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLS 126

Query: 130 D--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
           D  +P+++DWRE GAVT +K QG+CG CWAFSAV ++EG  +I  G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186

Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
           T+N+GC+GG M  AF++I EN G++ E+DY Y  ++ TC +Q EK  A  IS Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIKENGGISRESDYEYLGQQYTCRSQ-EKTAAVQISSYKVVPEG 245

Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
            E +LLQAV+ QPVS+ + AS +   FY  G  +  C +  +H V  +G+GT  +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301

Query: 308 YWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
           YWL+KNSWG +WGE+G+++I+RD    +GLC IA  +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341


>gi|195628596|gb|ACG36128.1| vignain precursor [Zea mays]
          Length = 362

 Score =  338 bits (868), Expect = 2e-90,   Method: Compositional matrix adjust.
 Identities = 166/354 (46%), Positives = 234/354 (66%), Gaps = 24/354 (6%)

Query: 8   SFIIPMFVIIILVITCASQVVS--------GRSM--HEPSIVEKHEQWMAQHGRTYKDEL 57
           + ++ + VI I    C + V +        GR+    E  ++ ++++WMAQ+ R YKD+ 
Sbjct: 15  TLMLLLCVIAIADCICQAAVAARVEPSTTVGRTTGGDEAMMMARYKKWMAQYRRKYKDDA 74

Query: 58  EKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRP--VPSVSR 115
           EKA R  +FK N E+I+++N  G + Y LGTN+F+DLT++EF A+YTG  +P  VPS ++
Sbjct: 75  EKAHRFQVFKANAEFIDRSNAGGKKKYVLGTNQFADLTSKEFAAMYTGLRKPAAVPSGAK 134

Query: 116 QSSRPSTFKYQNVT--DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRG 173
           Q   P+ FKYQN T  D    +DWR++GAVT +K+QGQCG CWAFSAV A+EG+  IT G
Sbjct: 135 QI--PAGFKYQNFTRLDDDVQVDWRQQGAVTPVKNQGQCGCCWAFSAVGAMEGLIMITTG 192

Query: 174 KLIELSEQQLVDC--STDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKE 231
            L+ LSEQQ++DC  S  N GC+GG MD AF+Y++ N G+ TE  YPY   +GTC N + 
Sbjct: 193 NLVSLSEQQILDCDESDGNQGCNGGYMDNAFQYVVNNGGVTTEDAYPYSAVQGTCQNVQP 252

Query: 232 KAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNAD-CGNNCDH 290
              AATIS ++DLP GDE AL  AV+NQPVSV VD     F FY+ G+ + D CG + +H
Sbjct: 253 ---AATISGFQDLPSGDENALANAVANQPVSVGVDGGSSPFQFYQGGIYDGDGCGTDMNH 309

Query: 291 GVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDAGLCGIATAASYPV 344
            V  +G+G   ++ G +YW++KNSWG  WGE+G++++    G CGI+T ASYP 
Sbjct: 310 AVTAIGYGA--DDQGTQYWILKNSWGTGWGENGFMQLQMGVGACGISTMASYPT 361


>gi|302816909|ref|XP_002990132.1| hypothetical protein SELMODRAFT_428615 [Selaginella moellendorffii]
 gi|300142145|gb|EFJ08849.1| hypothetical protein SELMODRAFT_428615 [Selaginella moellendorffii]
          Length = 358

 Score =  338 bits (868), Expect = 2e-90,   Method: Compositional matrix adjust.
 Identities = 160/310 (51%), Positives = 210/310 (67%), Gaps = 12/310 (3%)

Query: 42  HEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRA 101
           +E+WM  HGR Y    EK  R  IF+ N EYIE+ N++ N+TY LG N F+D+T++EF+A
Sbjct: 34  YEKWMVDHGRVYNGIGEKERRFQIFRDNAEYIEEHNRQVNQTYWLGLNNFADMTHDEFKA 93

Query: 102 LYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAV 161
           LY G   P+ +  +     S F+Y++ T++P   DWR KGAV  +K+QG CGSCWAFS V
Sbjct: 94  LYFGTKVPLSNTIK-----SGFRYEDATNLPLDTDWRSKGAVATVKNQGACGSCWAFSTV 148

Query: 162 AAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYR 220
           AAVEG+ QI  G+L+ LSEQ+LVDC    N GC+GGLMD AFE+II+N GL +EADYPY+
Sbjct: 149 AAVEGVNQIVTGELVSLSEQELVDCDKQKNQGCNGGLMDSAFEFIIQNGGLDSEADYPYK 208

Query: 221 HEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVL 280
              G+CD  +  +   TI  +ED+P   E  LL+AV+NQPVSV ++ASGR F  Y  GV 
Sbjct: 209 AVSGSCDESRRNSHVVTIDGFEDVPAESEADLLKAVANQPVSVAIEASGRNFQLYSGGVY 268

Query: 281 NADCGNNCDHGVAVVGFGTAEEENG--AKYWLIKNSWGETWGESGYIRILRDA----GLC 334
              CG   DHGV  VG+GT++  +G    YW+++NSWG+ WGESGYIR+ R+     G C
Sbjct: 269 TGHCGYELDHGVVAVGYGTSKTPDGVATDYWIVRNSWGDAWGESGYIRLQRNVASSRGKC 328

Query: 335 GIATAASYPV 344
           GIA  ASYPV
Sbjct: 329 GIAMMASYPV 338


>gi|400180385|gb|AFP73331.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  338 bits (867), Expect = 2e-90,   Method: Compositional matrix adjust.
 Identities = 159/340 (46%), Positives = 233/340 (68%), Gaps = 11/340 (3%)

Query: 10  IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
           ++ + + +  VI+  +    GRS  + S+ E+HE WM++HGR YKDE+EK  R  IFK+N
Sbjct: 7   LMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66

Query: 70  LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
           +++IE  NK GN +YKLG NEF+D+T++EF A +TG N P   +S      + FK  +++
Sbjct: 67  MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLS 126

Query: 130 D--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
           D  +P+++DWRE GAVT +K QG+CG CWAFSAV ++EG  +I  G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186

Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
           T+N+GC+GG M  AF++I EN G++ E+DY Y  E+ TC +Q EK  A  IS Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQ-EKTAAVQISSYQVVPEG 245

Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
            E +LLQAV+ QPVS+ + AS +   FY  G  +  C +  +H V  +G+GT  +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301

Query: 308 YWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
           YWL+KNSWG +WGE+G+++I+RD    +GLC I   +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDSGDPSGLCDITKMSSYP 341


>gi|400180463|gb|AFP73368.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  338 bits (867), Expect = 2e-90,   Method: Compositional matrix adjust.
 Identities = 159/340 (46%), Positives = 233/340 (68%), Gaps = 11/340 (3%)

Query: 10  IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
           ++ + + +  VI+  +     RS  + S+ E+HE WM++HGR YKDE+EK  R  IFK+N
Sbjct: 7   LMNILITLFFVISMFNTQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66

Query: 70  LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
           +++IE  NK GN +YKLG NEF+D+T++EF A +TG N P   +S      + FK  +++
Sbjct: 67  MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLS 126

Query: 130 D--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
           D  +P+++DWRE GAVT +K QG+CG CWAFSAV ++EG  +I  G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186

Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
           T+N+GC+GG M  AF++I EN G++ E+DY Y  E+ TC +Q EK  A  IS Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQ-EKTAAVQISSYQVVPEG 245

Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
            E +LLQAV+ QPVS+ + AS +   FY  G  +  C +  +H V  +G+GT  +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301

Query: 308 YWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
           YWL+KNSWG +WGE+G+++I+RD    +GLC IA  +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDSGDPSGLCDIAKMSSYP 341


>gi|30690594|ref|NP_564321.2| cysteine proteinase-like protein [Arabidopsis thaliana]
 gi|28393492|gb|AAO42167.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|332192920|gb|AEE31041.1| cysteine proteinase-like protein [Arabidopsis thaliana]
          Length = 355

 Score =  338 bits (867), Expect = 2e-90,   Method: Compositional matrix adjust.
 Identities = 176/347 (50%), Positives = 232/347 (66%), Gaps = 19/347 (5%)

Query: 13  MFVIIILVITC----ASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQ 68
           +F+++ L I       SQ  S  + HEP + E H+QWM +  R Y DELEK MR ++FK+
Sbjct: 14  LFMLVSLTILSMNLKVSQATSRVTFHEPIVAEHHQQWMTRFSRVYSDELEKQMRFDVFKK 73

Query: 69  NLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYN--RPVPSVSRQSSRPSTFKYQ 126
           NL++IEK NK+G+RTYKLG NEF+D T EEF A +TG      +PS         ++ + 
Sbjct: 74  NLKFIEKFNKKGDRTYKLGVNEFADWTREEFIATHTGLKGVNGIPSSEFVDEMIPSWNW- 132

Query: 127 NVTDVP--TSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLV 184
           NV+DV    + DWR +GAVT +K QGQCG CWAFS+VAAVEG+T+I    L+ LSEQQL+
Sbjct: 133 NVSDVAGRETKDWRYEGAVTPVKYQGQCGCCWAFSSVAAVEGLTKIVGNNLVSLSEQQLL 192

Query: 185 DCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYED 243
           DC  + ++GC+GG+M  AF YII+N+G+A+EA YPY+  EGTC    +   +A I  ++ 
Sbjct: 193 DCDRERDNGCNGGIMSDAFSYIIKNRGIASEASYPYQAAEGTCRYNGKP--SAWIRGFQT 250

Query: 244 LPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNAD-CGNNCDHGVAVVGFGTAEE 302
           +P  +E+ALL+AVS QPVSV +DA G  F  Y  GV +   CG N +H V  VG+GT+ E
Sbjct: 251 VPSNNERALLEAVSKQPVSVSIDADGPGFMHYSGGVYDEPYCGTNVNHAVTFVGYGTSPE 310

Query: 303 ENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
             G KYWL KNSWGETWGE+GYIRI RD     G+CG+A  A YPVA
Sbjct: 311 --GIKYWLAKNSWGETWGENGYIRIRRDVAWPQGMCGVAQYAFYPVA 355


>gi|400180465|gb|AFP73369.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  338 bits (866), Expect = 3e-90,   Method: Compositional matrix adjust.
 Identities = 160/340 (47%), Positives = 233/340 (68%), Gaps = 11/340 (3%)

Query: 10  IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
           ++ + + +  VI+  +    GRS  + S+ E+HE WM++HGR YKDE+EK  R  IFK+N
Sbjct: 7   LMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66

Query: 70  LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
           +++IE  NK GN +YKLG NEF+D+T++EF A +TG N P   +S      + FK  +++
Sbjct: 67  MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLS 126

Query: 130 D--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
           D  +P+++DWRE GAVT +K QG+CG CWAFSAV ++E   +I  G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEVAYKIATGNLMEFSEQELLDCT 186

Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
           T+N+GC+GG M  AF++I EN G++ E+DY Y  E+ TC +Q EK  A  IS Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQ-EKTAAVQISSYQVVPEG 245

Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
            E +LLQAV+ QPVS+ + AS +   FY  G  +  C +  +H V  +G+GT  +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301

Query: 308 YWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
           YWL+KNSWG +WGE+G+++I+RD    AGLC IA  +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDSGNPAGLCDIAKMSSYP 341


>gi|400180461|gb|AFP73367.1| cysteine protease [Solanum peruvianum]
 gi|400180473|gb|AFP73373.1| cysteine protease [Solanum peruvianum]
 gi|400180475|gb|AFP73374.1| cysteine protease [Solanum peruvianum]
 gi|400180479|gb|AFP73376.1| cysteine protease [Solanum peruvianum]
 gi|400180481|gb|AFP73377.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  338 bits (866), Expect = 3e-90,   Method: Compositional matrix adjust.
 Identities = 160/340 (47%), Positives = 233/340 (68%), Gaps = 11/340 (3%)

Query: 10  IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
           ++ + + +  VI+  +    GRS  + S+ E+HE WM++HGR YKDE+EK  R  IFK+N
Sbjct: 7   LMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66

Query: 70  LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
           +++IE  NK GN +YKLG NEF+D+T++EF A +TG N P   +S      + F   +++
Sbjct: 67  MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFIINDLS 126

Query: 130 D--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
           D  +P+++DWRE GAVT +K QG+CG CWAFSAV ++EG  +I  G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186

Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
           T+N+GC+GG M  AF++I EN G++ E+DY Y  E+ TC +Q EK  A  IS Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQ-EKTAAVQISSYQVVPEG 245

Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
            E +LLQAV+ QPVS+ + AS +   FY  G  +  C +  +H V  +G+GT  +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301

Query: 308 YWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
           YWL+KNSWG +WGE+G+++I+RD    AGLC IA  +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341


>gi|302816222|ref|XP_002989790.1| hypothetical protein SELMODRAFT_184826 [Selaginella moellendorffii]
 gi|300142356|gb|EFJ09057.1| hypothetical protein SELMODRAFT_184826 [Selaginella moellendorffii]
          Length = 358

 Score =  338 bits (866), Expect = 3e-90,   Method: Compositional matrix adjust.
 Identities = 160/310 (51%), Positives = 210/310 (67%), Gaps = 12/310 (3%)

Query: 42  HEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRA 101
           +E+WM  HGR Y    EK  R  IF+ N EYIE+ N++ N+TY LG N F+D+T++EF+A
Sbjct: 34  YEKWMVDHGRVYNGIGEKERRFQIFRDNAEYIEEHNRQVNQTYWLGLNNFADMTHDEFKA 93

Query: 102 LYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAV 161
           LY G   P+ +  +     S F+Y++ T++P   DWR KGAV  +K+QG CGSCWAFS V
Sbjct: 94  LYFGTKVPLSNTIK-----SGFRYKDATNLPLDTDWRSKGAVATVKNQGACGSCWAFSTV 148

Query: 162 AAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYR 220
           AAVEG+ QI  G+L+ LSEQ+LVDC    N GC+GGLMD AFE+II+N GL +EADYPY+
Sbjct: 149 AAVEGVNQIVTGELVSLSEQELVDCDKQKNQGCNGGLMDSAFEFIIQNGGLDSEADYPYK 208

Query: 221 HEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVL 280
              G+CD  +  +   TI  +ED+P   E  LL+AV+NQPVSV ++ASGR F  Y  GV 
Sbjct: 209 AVSGSCDESRRNSHVVTIDGFEDVPAESEADLLKAVANQPVSVAIEASGRNFQLYSGGVY 268

Query: 281 NADCGNNCDHGVAVVGFGTAEEENG--AKYWLIKNSWGETWGESGYIRILRDA----GLC 334
              CG   DHGV  VG+GT++  +G    YW+++NSWG+ WGESGYIR+ R+     G C
Sbjct: 269 TGHCGYELDHGVVAVGYGTSKTPDGVATDYWIVRNSWGDAWGESGYIRLQRNVASPRGKC 328

Query: 335 GIATAASYPV 344
           GIA  ASYPV
Sbjct: 329 GIAMMASYPV 338


>gi|356533293|ref|XP_003535200.1| PREDICTED: LOW QUALITY PROTEIN: cysteine proteinase RD21a-like
           [Glycine max]
          Length = 466

 Score =  338 bits (866), Expect = 3e-90,   Method: Compositional matrix adjust.
 Identities = 163/314 (51%), Positives = 213/314 (67%), Gaps = 17/314 (5%)

Query: 42  HEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRA 101
           +E W+ +HG+ Y    EK  R  IFK NL +IE+ N  G+++YKLG N+F+DLTNEE+RA
Sbjct: 48  YEAWLVKHGKAYNALGEKERRFKIFKDNLRFIEEHNGAGDKSYKLGLNKFADLTNEEYRA 107

Query: 102 LYTGYNRPVPS-----VSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCW 156
           ++ G     P      V++++ R   + Y+   ++P  +DWREKGAVT IKDQGQCGSCW
Sbjct: 108 MFLGTRTRGPKNKAAVVAKKTDR---YAYRAGEELPAMVDWREKGAVTPIKDQGQCGSCW 164

Query: 157 AFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEA 215
           AFS V AVEGI QI  G L  LSEQ+LVDC    N GC+GGLMD AFE+I++N G+ TE 
Sbjct: 165 AFSTVGAVEGINQIVTGNLTSLSEQELVDCDRGYNMGCNGGLMDYAFEFIVQNGGIDTEE 224

Query: 216 DYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFY 275
           DYPY  ++ TCD  ++ A   TI  YED+P  DE++L++AV+NQPVSV ++A G  F  Y
Sbjct: 225 DYPYHAKDNTCDPNRKNARVVTIDGYEDVPTNDEKSLMKAVANQPVSVAIEAGGMEFQLY 284

Query: 276 KSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD----- 330
           +SGV    CG N DHGV  VG+GT   ENG  YWL++NSWG  WGE+GYI++ R+     
Sbjct: 285 QSGVFTGRCGTNLDHGVVAVGYGT---ENGTDYWLVRNSWGSAWGENGYIKLERNVQNTE 341

Query: 331 AGLCGIATAASYPV 344
            G CGIA  ASYP+
Sbjct: 342 TGKCGIAIEASYPI 355


>gi|225458701|ref|XP_002284973.1| PREDICTED: cysteine proteinase RD21a-like [Vitis vinifera]
          Length = 467

 Score =  338 bits (866), Expect = 3e-90,   Method: Compositional matrix adjust.
 Identities = 170/359 (47%), Positives = 230/359 (64%), Gaps = 23/359 (6%)

Query: 2   VLKFEKSFIIPMFVIIILVITCASQVVSGRSMH--------EPSIVEKHEQWMAQHGRTY 53
           + +   S  + +F+++ L       ++     H        +  ++  +E W+A+HG++Y
Sbjct: 3   LCRSSSSMAVFLFLLLGLASALDMSIIGYDETHGDKSSWRTDEDVMAVYEAWLAKHGKSY 62

Query: 54  KDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSV 113
               EK  R  IFK NL +I++ N E NRTYK+G N F+DLTNEE+R++Y G      + 
Sbjct: 63  NALGEKERRFQIFKDNLRFIDEHNAE-NRTYKVGLNRFADLTNEEYRSMYLGTR---TAA 118

Query: 114 SRQSSRPSTFKYQ-NVTD-VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQIT 171
            R+SS   + +Y   V D +P S+DWR+KGAV  +KDQG CGSCWAFS +AAVEGI +I 
Sbjct: 119 KRRSSNKISDRYAFRVGDSLPESVDWRKKGAVVEVKDQGSCGSCWAFSTIAAVEGINKIV 178

Query: 172 RGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQK 230
            G LI LSEQ+LVDC T  N GC+GGLMD AFE+II N G+ +E DYPY+  +G CD  +
Sbjct: 179 TGGLISLSEQELVDCDTSYNEGCNGGLMDYAFEFIINNGGIDSEEDYPYKASDGRCDQYR 238

Query: 231 EKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDH 290
           + A   TI  YED+P+ DE++L +AV+NQPVSV ++A GR F  Y+SG+    CG   DH
Sbjct: 239 KNAKVVTIDGYEDVPENDEKSLEKAVANQPVSVAIEAGGREFQLYQSGIFTGRCGTALDH 298

Query: 291 GVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD-----AGLCGIATAASYPV 344
           GV  VG+GT   ENG  YW++KNSWG +WGE GYIR+ RD      G CGIA  ASYP+
Sbjct: 299 GVTAVGYGT---ENGVDYWIVKNSWGASWGEEGYIRMERDLATSATGKCGIAMEASYPI 354


>gi|400180419|gb|AFP73348.1| cysteine protease [Solanum lycopersicoides]
          Length = 343

 Score =  337 bits (865), Expect = 4e-90,   Method: Compositional matrix adjust.
 Identities = 163/350 (46%), Positives = 241/350 (68%), Gaps = 17/350 (4%)

Query: 1   MVLKFEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKA 60
           M +K +   ++ + + +  VI+  +   + RS  + S+ E+HE WM++HGR YKDE+EK 
Sbjct: 1   MAMKID---LMSILITLFFVISMFNSQTTARSQPKLSVSERHELWMSRHGRVYKDEVEKG 57

Query: 61  MRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRP 120
            R  IFK+N+++IE  NK GN +YKLG NEF+D+T+EEF   +TG N  +PS    S   
Sbjct: 58  ERFMIFKENMKFIESVNKAGNLSYKLGINEFADITSEEFLTKFTGIN--IPSYLSPSPMS 115

Query: 121 ST-FKYQNVTD--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIE 177
           ST FK  +++D  +P+++DWRE GAVT +K+QGQCG CWAFSAV ++EG  +I  G L+E
Sbjct: 116 STEFKINDLSDDDMPSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYKIATGNLME 175

Query: 178 LSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAAT 237
            SEQ+L+DC+T+N+GC+GG M  AF++I EN G+++E+DY Y+ ++ TC +Q EK  A  
Sbjct: 176 FSEQELLDCTTNNYGCNGGFMTNAFDFIKENGGISSESDYEYQGQQYTCRSQ-EKTAAVQ 234

Query: 238 ISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGF 297
           IS Y+ +P+G E +LLQAV+ QPVS+ + AS +   FY  G  +  C +  +H V  +G+
Sbjct: 235 ISSYQVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGY 292

Query: 298 GTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYP 343
           GT  +E G KYWL+KNSWG +WGE+G+++I+RD+    G C IA  +SYP
Sbjct: 293 GT--DEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPGGHCDIAKMSSYP 340


>gi|356515052|ref|XP_003526215.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
           max]
          Length = 339

 Score =  337 bits (865), Expect = 4e-90,   Method: Compositional matrix adjust.
 Identities = 164/343 (47%), Positives = 226/343 (65%), Gaps = 11/343 (3%)

Query: 7   KSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIF 66
           +SF    ++I+ L++T  +  V  R + E    E+HE+WMAQ+G+ Y D  EK  R  IF
Sbjct: 2   RSFSQNHYLILFLILTVWTFHVMSRRLSEVCTSERHEKWMAQYGKLYTDAAEKEKRFQIF 61

Query: 67  KQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQ 126
           K N+++IE  N  G++ + L  N+F+DL NEEF+A      +    V  +++  ++F+Y+
Sbjct: 62  KNNVQFIESFNAAGDKPFNLSINQFADLHNEEFKASLINVQKKESGV--ETATETSFRYE 119

Query: 127 NVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC 186
           ++T +P ++DWR++GAVT IKDQG CGSCWAFS VAA+EGI QIT GKL+ LSEQ+LVDC
Sbjct: 120 SITKIPVTMDWRKRGAVTPIKDQGNCGSCWAFSTVAAIEGIHQITTGKLVSLSEQELVDC 179

Query: 187 -STDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLP 245
               + GC+ G  ++AFE++ +N GLA+E  YPY+    TC  +KE    A I  YE++P
Sbjct: 180 VKGKSEGCNFGYKEEAFEFVAKNGGLASEISYPYKANNKTCMVKKETQGVAQIKGYENVP 239

Query: 246 KGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENG 305
              E+ALL+AV+NQPVSV +DA   A  FY SG+    CG   +H V V+G+G A    G
Sbjct: 240 SNSEKALLKAVANQPVSVYIDAG--ALQFYSSGIFTGKCGTAPNHAVTVIGYGKA--RGG 295

Query: 306 AKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
           AKYWL+KNSWG  WGE GYI++ RD     GLCGIAT ASYP 
Sbjct: 296 AKYWLVKNSWGTKWGEKGYIKMKRDIRAKEGLCGIATNASYPT 338


>gi|310656790|gb|ADP02219.1| Peptidase_C1 domain-containing protein [Triticum aestivum]
          Length = 419

 Score =  337 bits (865), Expect = 4e-90,   Method: Compositional matrix adjust.
 Identities = 162/323 (50%), Positives = 212/323 (65%), Gaps = 13/323 (4%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
           +  II  +  C+S V+S R + + ++VEKHEQWMA+  R YKD  EKA R   FK N+ +
Sbjct: 8   LLAIIGSICLCSSTVLSARELGDAAMVEKHEQWMAKFNRVYKDSTEKAQRFKAFKANVAF 67

Query: 73  IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSR-PSTFKYQNVTD- 130
           IE  N  GN  + LG N+F+DLTN+EFRA  T        + R  +R P+ FKY NV+  
Sbjct: 68  IESFN-TGNHKFWLGVNQFTDLTNDEFRATKTN-----KGLKRNGARAPTRFKYNNVSTD 121

Query: 131 -VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD 189
            +P ++DWR KG VT IKDQGQCG CWAFSAVAA EGI +++ GKL+ LSEQ+LVDC   
Sbjct: 122 ALPAAVDWRTKGVVTPIKDQGQCGCCWAFSAVAATEGIVKLSTGKLVSLSEQELVDCDVH 181

Query: 190 --NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
             + GC GG MD AF++II+N GL TEA+YPY  ++G C         ATI  YED+P  
Sbjct: 182 GVDQGCEGGEMDNAFKFIIKNGGLTTEANYPYTAQDGQCKTSTTSNSVATIKGYEDVPAN 241

Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
           DE +L++AV+NQPVSV VD     F  Y  GV+   CG + DHG+  +G+G   +  G K
Sbjct: 242 DESSLMKAVANQPVSVAVDGGDVIFQHYSGGVMTGSCGTDLDHGIVAIGYGMTSD--GTK 299

Query: 308 YWLIKNSWGETWGESGYIRILRD 330
           +WL+KNSWG TWGESGY+R+ +D
Sbjct: 300 FWLLKNSWGTTWGESGYLRMEKD 322


>gi|400180371|gb|AFP73324.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  337 bits (865), Expect = 4e-90,   Method: Compositional matrix adjust.
 Identities = 159/340 (46%), Positives = 233/340 (68%), Gaps = 11/340 (3%)

Query: 10  IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
           ++ + + +  VI+  +    GRS  + S+ E+HE WM++HGR YKDE+EK  R  IFK+N
Sbjct: 7   LMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66

Query: 70  LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
           +++IE  NK GN +YKLG NEF+D+T++EF A +TG N P   +S      + F   +++
Sbjct: 67  MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFIINDLS 126

Query: 130 D--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
           D  +P+++DWRE GAVT +K QG+CG CWAFSAV ++EG  +I  G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186

Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
           T+N+GC+GG M  AF++I EN G++ E+DY Y  E+ TC +Q EK  A  IS Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQ-EKTAAVQISSYQVVPEG 245

Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
            E +LLQAV+ QPVS+ + AS +   FY  G  +  C +  +H V  +G+GT  +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301

Query: 308 YWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
           YWL+KNSWG +WGE+G+++I+RD    +GLC IA  +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341


>gi|147790682|emb|CAN61026.1| hypothetical protein VITISV_001146 [Vitis vinifera]
          Length = 469

 Score =  337 bits (864), Expect = 4e-90,   Method: Compositional matrix adjust.
 Identities = 166/318 (52%), Positives = 217/318 (68%), Gaps = 15/318 (4%)

Query: 35  EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
           +  ++  +E W+A+HG++Y    EK  R  IFK NL +I++ N E NRTYK+G N F+DL
Sbjct: 46  DEDVMAVYEAWLAKHGKSYNALGEKERRFQIFKDNLRFIDEHNAE-NRTYKVGLNRFADL 104

Query: 95  TNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQ-NVTD-VPTSIDWREKGAVTHIKDQGQC 152
           TNEE+R++Y G      +  R+SS   + +Y   V D +P S+DWR+KGAV  +KDQG C
Sbjct: 105 TNEEYRSMYLGTR---TAAKRRSSNKISDRYAFRVGDSLPESVDWRKKGAVVEVKDQGSC 161

Query: 153 GSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGL 211
           GSCWAFS +AAVEGI +I  G LI LSEQ+LVDC T  N GC+GGLMD AFE+II N G+
Sbjct: 162 GSCWAFSTIAAVEGINKIVTGGLISLSEQELVDCDTSYNEGCNGGLMDYAFEFIINNGGI 221

Query: 212 ATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRA 271
            +E DYPY+  +G CD  ++ A   TI  YED+P+ DE++L +AV+NQPVSV ++A GR 
Sbjct: 222 DSEEDYPYKASDGRCDQYRKNAXVVTIDGYEDVPENDEKSLEKAVANQPVSVAIEAGGRE 281

Query: 272 FHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD- 330
           F  Y+SG+    CG   DHGV  VG+GT   ENG  YW++KNSWG +WGE GYIR+ RD 
Sbjct: 282 FQLYQSGIFTGRCGTALDHGVTAVGYGT---ENGVDYWIVKNSWGASWGEEGYIRMERDL 338

Query: 331 ----AGLCGIATAASYPV 344
                G CGIA  ASYP+
Sbjct: 339 ATSATGKCGIAMEASYPI 356


>gi|400180399|gb|AFP73338.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  337 bits (864), Expect = 5e-90,   Method: Compositional matrix adjust.
 Identities = 159/340 (46%), Positives = 233/340 (68%), Gaps = 11/340 (3%)

Query: 10  IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
           ++ + + +  VI+  +    GRS  + S+ E+HE WM++HGR YKDE+EK  R  IFK+N
Sbjct: 7   LMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66

Query: 70  LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
           +++IE  NK GN +YKLG NEF+D+T++EF A +TG N P   +S      + F   +++
Sbjct: 67  MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFIINDLS 126

Query: 130 D--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
           D  +P+++DWRE GAVT +K QG+CG CWAFSAV ++EG  +I  G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186

Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
           T+N+GC+GG M  AF++I EN G++ E+DY Y  E+ TC +Q EK  A  IS Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQ-EKTAAVQISSYQVVPEG 245

Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
            E +LLQAV+ QPVS+ + AS +   FY  G  +  C +  +H V  +G+GT  +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301

Query: 308 YWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
           YWL+KNSWG +WGE+G+++I+RD    +GLC IA  +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDSGDPSGLCDIAKMSSYP 341


>gi|224083868|ref|XP_002307151.1| predicted protein [Populus trichocarpa]
 gi|222856600|gb|EEE94147.1| predicted protein [Populus trichocarpa]
          Length = 298

 Score =  337 bits (864), Expect = 5e-90,   Method: Compositional matrix adjust.
 Identities = 165/314 (52%), Positives = 217/314 (69%), Gaps = 22/314 (7%)

Query: 38  IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
           + E+HEQWMAQ+GR YKD+ EK  R NIFK+N+  I+  N +  ++Y LG N+F+DL+NE
Sbjct: 1   MYERHEQWMAQYGRVYKDDAEKETRYNIFKENVARIDAFNSQTGKSYNLGVNQFADLSNE 60

Query: 98  EFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWA 157
           EF+A     NR    +    + P  F+Y+NV+ VP ++DWR+KGAVT +KDQGQC     
Sbjct: 61  EFKA---SRNRFKGHMCSPQAGP--FRYENVSAVPATMDWRKKGAVTPVKDQGQC----- 110

Query: 158 FSAVAAVEGITQITRGKLIELSEQQLVDCST--DNHGCSGGLMDKAFEYIIENKGLATEA 215
              VAA+EGI Q+T GKLI LSEQ++VDC T  ++ GC+GGLMD AF++I +NKGL TEA
Sbjct: 111 ---VAAMEGINQLTTGKLISLSEQEVVDCDTKGEDQGCNGGLMDDAFKFIEQNKGLTTEA 167

Query: 216 DYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFY 275
           +YPY   +GTC+ QKE + AA I+ ++D+P   E AL++AV+ QPVSV +DA G  F FY
Sbjct: 168 NYPYTGTDGTCNTQKEVSHAAKITGFQDVPANSEAALMKAVAKQPVSVAIDAGGFEFQFY 227

Query: 276 KSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD----A 331
            SG+    CG   DHGV  VG+G ++   G KYWL+KNSWG  WGE GYIR+ +D     
Sbjct: 228 SSGIFTGSCGTELDHGVTAVGYGGSD---GTKYWLVKNSWGAQWGEEGYIRMQKDISAKE 284

Query: 332 GLCGIATAASYPVA 345
           GLCGIA  ASYP A
Sbjct: 285 GLCGIAMQASYPTA 298


>gi|357160599|ref|XP_003578815.1| PREDICTED: vignain-like [Brachypodium distachyon]
          Length = 339

 Score =  337 bits (863), Expect = 6e-90,   Method: Compositional matrix adjust.
 Identities = 167/341 (48%), Positives = 227/341 (66%), Gaps = 19/341 (5%)

Query: 15  VIIILVITC-ASQVVSGRSMHEP-SIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
           ++ IL   C  S V++ R +++  S+  +HE WMAQ+GR YKD  EKA +  +FK N  +
Sbjct: 8   ILAILGCLCFCSSVLAARELNDDLSMAARHETWMAQYGRVYKDAAEKAQKFEVFKANARF 67

Query: 73  IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQN--VTD 130
           I+  N E N  + LG N+F+DLTNEEF+A  T  N+    +S ++   + FKY+N  +  
Sbjct: 68  IDSFNAE-NHKFWLGINQFADLTNEEFKATKT--NKGF--ISNKARVSTGFKYENLKIEA 122

Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST-- 188
           +PTSIDWR KGAVT +KDQGQCG CWAFSAVAA EGI +++ GKL+ LSEQ+LVDC    
Sbjct: 123 LPTSIDWRTKGAVTPVKDQGQCGCCWAFSAVAATEGIVKLSTGKLVSLSEQELVDCDVHG 182

Query: 189 DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
           ++ GC GGLMD AF++II N GL  E+ YPY  E+G C +  +   A TI  YED+P  +
Sbjct: 183 EDQGCEGGLMDDAFKFIITNGGLTQESSYPYDAEDGKCKSGSKS--AGTIKSYEDVPANN 240

Query: 249 EQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKY 308
           E AL++AV+NQPVSV VD     F FY  GV+   CG + DHG+A +G+G   +  G K+
Sbjct: 241 EGALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGVTSD--GTKF 298

Query: 309 WLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPVA 345
           WL+KNSWG TWGE+G++R+ +D     G+CG+A   SYP A
Sbjct: 299 WLMKNSWGTTWGENGFLRMEKDIADKKGMCGLAMEPSYPTA 339


>gi|356515038|ref|XP_003526208.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
           max]
          Length = 339

 Score =  337 bits (863), Expect = 6e-90,   Method: Compositional matrix adjust.
 Identities = 164/343 (47%), Positives = 225/343 (65%), Gaps = 11/343 (3%)

Query: 7   KSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIF 66
           +SF    ++I+ L++T  +  V  R + E    E+HE+WMAQ+G+ Y D  EK  R  IF
Sbjct: 2   RSFSQNHYLILFLILTVWTFHVMSRRLSEVCTSERHEKWMAQYGKLYTDAAEKEKRFQIF 61

Query: 67  KQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQ 126
           K N+++IE  N  G++ + L  N+F+DL NEEF+A      +    V  +++  ++F+Y+
Sbjct: 62  KNNVQFIESFNAAGDKPFNLSINQFADLHNEEFKASLINVQKKESGV--ETATETSFRYE 119

Query: 127 NVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC 186
           ++T +P ++DWR++GAVT IKDQG CGSCWAFS VAA+EGI QIT GKL+ LSEQ+LVDC
Sbjct: 120 SITKIPVTMDWRKRGAVTPIKDQGNCGSCWAFSIVAAIEGIHQITTGKLVSLSEQELVDC 179

Query: 187 -STDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLP 245
               + GC+ G  ++AFE++ +N GLA+E  YPY+    TC  +KE    A I  YE++P
Sbjct: 180 VKGKSEGCNFGYKEEAFEFVAKNGGLASEISYPYKANNKTCMVKKETQGVAQIKGYENVP 239

Query: 246 KGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENG 305
              E+ALL+AV+NQPVSV +DA   A  FY SG+    CG   +H   V+G+G A    G
Sbjct: 240 SNSEKALLKAVANQPVSVYIDAG--ALQFYSSGIFTGKCGTAPNHAATVIGYGKA--RGG 295

Query: 306 AKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
           AKYWL+KNSWG  WGE GYIR+ RD     GLCGIAT ASYP 
Sbjct: 296 AKYWLVKNSWGTKWGEKGYIRMKRDIRAKEGLCGIATNASYPT 338


>gi|50355623|dbj|BAD29960.1| cysteine protease [Daucus carota]
          Length = 460

 Score =  336 bits (862), Expect = 1e-89,   Method: Compositional matrix adjust.
 Identities = 160/312 (51%), Positives = 210/312 (67%), Gaps = 9/312 (2%)

Query: 38  IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
           I+  +E W+ +HG++Y    EK  R  IFK N  YI++ N   +R++KLG N F+DLTNE
Sbjct: 40  IMAAYESWLVKHGKSYNALGEKEQRFQIFKDNFLYIDEQNAAKDRSFKLGLNRFADLTNE 99

Query: 98  EFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWA 157
           E+R+ YTG  R   S  + S +   +       +P S+DWRE GAV  +KDQGQCGSCWA
Sbjct: 100 EYRSKYTGI-RTKDSRKKVSGKSQRYASLAGESLPESVDWREHGAVASVKDQGQCGSCWA 158

Query: 158 FSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEAD 216
           FS ++AVEGI QI  GKLI LSEQ+LVDC    N GC+GGLMD AF++II N G+ ++AD
Sbjct: 159 FSTISAVEGINQIATGKLITLSEQELVDCDRSYNEGCNGGLMDDAFQFIINNGGIDSDAD 218

Query: 217 YPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYK 276
           YPY   +G CD  ++ A   TI  YED+P+ DE+AL +A +NQP+SV ++ASGR F FY 
Sbjct: 219 YPYTGRDGQCDQYRKNAKVVTIDSYEDVPEYDEKALQKAAANQPISVAIEASGRDFQFYD 278

Query: 277 SGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILR----DAG 332
           SG+    CG + DHGV VVG+GT   ENG  YW+++NSWG  WGE GY+R+ R     AG
Sbjct: 279 SGIFTGKCGTDLDHGVVVVGYGT---ENGKDYWIVRNSWGADWGEKGYLRMERGISSKAG 335

Query: 333 LCGIATAASYPV 344
           +CGI +  SYPV
Sbjct: 336 ICGITSEPSYPV 347


>gi|110737404|dbj|BAF00646.1| putative cysteine proteinase [Arabidopsis thaliana]
          Length = 345

 Score =  336 bits (861), Expect = 1e-89,   Method: Compositional matrix adjust.
 Identities = 174/348 (50%), Positives = 234/348 (67%), Gaps = 15/348 (4%)

Query: 8   SFIIPMFVIIILVITCASQVVSGRSM--HEPSIVEKHEQWMAQHGRTYKDELEKAMRLNI 65
           S ++ + V+IIL         + R++   E S+V+KHEQWMA+  R Y+DELEK MR ++
Sbjct: 3   SIMVLVTVLIILFTGFRISQATSRTVIFREQSMVDKHEQWMARFSREYRDELEKNMRRDV 62

Query: 66  FKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKY 125
           FK+NL++IE  NK+GN++YKLG NEF+D TNEEF A++TG  + +  VS       T   
Sbjct: 63  FKKNLKFIENFNKKGNKSYKLGVNEFADWTNEEFLAIHTGL-KGLTEVSPSKVVAKTISS 121

Query: 126 Q--NVTD-VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQ 182
           Q  NV+D V  S DWR +GAVT +K QGQCG CWAFSAVAAVEG+ +I  G L+ LSEQQ
Sbjct: 122 QTWNVSDMVVESKDWRAEGAVTPVKYQGQCGCCWAFSAVAAVEGVAKIAGGNLVSLSEQQ 181

Query: 183 LVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKY 241
           L+DC  + +  C GG+M  AF Y+++N+G+A+E DY Y+  +G C +      AA IS +
Sbjct: 182 LLDCDREYDRDCDGGIMSDAFNYVVQNRGIASENDYSYQGSDGGCRSNARP--AARISGF 239

Query: 242 EDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAE 301
           + +P  +E+ALL+AVS QPVSV +DA+G  F  Y  GV +  CG + +H V  VG+GT++
Sbjct: 240 QTVPSNNERALLEAVSRQPVSVSMDATGDGFMHYSGGVYDGPCGTSSNHAVTFVGYGTSQ 299

Query: 302 EENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
           +  G KYWL KNSWGETW E GYIRI RD     G+CG+A  A YPVA
Sbjct: 300 D--GTKYWLAKNSWGETWEEKGYIRIRRDVAWPQGMCGVAQYAFYPVA 345


>gi|400180361|gb|AFP73319.1| cysteine protease [Solanum peruvianum]
 gi|400180397|gb|AFP73337.1| cysteine protease [Solanum peruvianum]
 gi|400180401|gb|AFP73339.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  335 bits (860), Expect = 1e-89,   Method: Compositional matrix adjust.
 Identities = 158/340 (46%), Positives = 232/340 (68%), Gaps = 11/340 (3%)

Query: 10  IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
           ++ + + +  VI+  +    GRS  + S+ E+HE WM++HGR YKDE+EK  R  IFK+N
Sbjct: 7   LMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66

Query: 70  LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
           +++IE  NK GN +YKLG NEF+D+T++EF A +TG N P   +S      + F   +++
Sbjct: 67  MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFIINDLS 126

Query: 130 D--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
           D  +P+++DWRE GAVT +K QG+CG CWAFSAV ++EG  +I  G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186

Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
           T+N+GC+GG M  AF++I EN G++ E+DY Y  E+ TC +Q EK  A  IS Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQ-EKTAAVQISSYQVVPEG 245

Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
            E +LLQAV+ QPVS+ + AS +   FY  G  +  C +  +H V  +G+GT  +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301

Query: 308 YWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
           YWL+KNSWG +WGE+G+++I+RD    +GLC I   +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDSGDPSGLCDITKMSSYP 341


>gi|255563134|ref|XP_002522571.1| cysteine protease, putative [Ricinus communis]
 gi|223538262|gb|EEF39871.1| cysteine protease, putative [Ricinus communis]
          Length = 343

 Score =  335 bits (860), Expect = 1e-89,   Method: Compositional matrix adjust.
 Identities = 180/347 (51%), Positives = 223/347 (64%), Gaps = 19/347 (5%)

Query: 8   SFIIPMFVIIILVI--TCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNI 65
           S  I   VI +L+I  T  SQ +    ++  +I EKHEQWMA+HGRTY D  EK  R  I
Sbjct: 4   SLQITKLVITLLMILGTWVSQAMPRPLLNAEAIAEKHEQWMARHGRTYHDNAEKERRFQI 63

Query: 66  FKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTF-- 123
           FK NL+YIE  NK  N+TYKLG N+FSDL+ EEF   Y GY  P    +  ++   TF  
Sbjct: 64  FKNNLDYIENFNKAFNKTYKLGLNKFSDLSEEEFVTTYNGYEMPTTLPTANTTVKPTFFS 123

Query: 124 KYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQL 183
            Y N  +VP SIDWRE G VT +K+QG+CG CWAFSAVAAVEGI     G    LS QQL
Sbjct: 124 NYYNQDEVPESIDWRENGVVTSVKNQGECGCCWAFSAVAAVEGIA----GNGASLSAQQL 179

Query: 184 VDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYED 243
           +DC  DN GC GG M KAFEYI++N+G+ ++ DYPY   +  C  +    VAA I+ YE 
Sbjct: 180 LDCVGDNSGCGGGTMIKAFEYIVQNQGIVSDTDYPYEQTQEMC--RSGSNVAARITGYES 237

Query: 244 LPKGDEQALLQAVSNQPVSVCVDA-SGRAFHFYKSGVLNA-DCGNNCDHGVAVVGFGTAE 301
           + +  E+AL +AV+ QP+SV +DA SG  F  Y SGV +A DCG +  H V +VG+GT E
Sbjct: 238 VIQ-SEEALKRAVAKQPISVAIDASSGPNFKSYISGVFSAEDCGTHLTHAVTLVGYGTTE 296

Query: 302 EENGAKYWLIKNSWGETWGESGYIRILRDAGL----CGIATAASYPV 344
           +  G KYWL+KNSWGE WGESGY+R+ RD G     CGIA  ASYP 
Sbjct: 297 D--GTKYWLVKNSWGEEWGESGYMRLQRDVGAMEGPCGIAMQASYPT 341


>gi|400180391|gb|AFP73334.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  335 bits (860), Expect = 1e-89,   Method: Compositional matrix adjust.
 Identities = 159/340 (46%), Positives = 232/340 (68%), Gaps = 11/340 (3%)

Query: 10  IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
           ++ + + +  VI+  +     RS  + S+ E+HE WM++HGR YKDE+EK  R  IFK+N
Sbjct: 7   LMSILITLFFVISMFNSQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66

Query: 70  LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
           +++IE  NK GN +YKLG NEF+D+T++EF A +TG N P   +S      + FK  +++
Sbjct: 67  MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLS 126

Query: 130 D--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
           D  +P+++DWRE GAVT +K QG+CG CWAFSAV ++EG  +I  G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186

Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
           T+N+GC+GG M  AF++I EN G++ E+DY Y  E+ TC +Q EK  A  IS Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQ-EKTAAVQISSYQVVPEG 245

Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
            E +LLQAV+ QPVS+ + AS +   F   G  +  C +  +H V  +G+GT  +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFCAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301

Query: 308 YWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
           YWL+KNSWG +WGE+G+++I+RD    AGLC IA  +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341


>gi|400180387|gb|AFP73332.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  335 bits (859), Expect = 2e-89,   Method: Compositional matrix adjust.
 Identities = 159/340 (46%), Positives = 232/340 (68%), Gaps = 11/340 (3%)

Query: 10  IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
           ++ + + +  VI+  +    GRS  + S+ E+HE WM++HGR YKDE+EK  R  IFK+N
Sbjct: 7   LMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKVERFMIFKEN 66

Query: 70  LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
           +++IE  NK GN +YKLG NEF+D+T++EF A +TG N P   +S      +  K  +++
Sbjct: 67  MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTELKINDLS 126

Query: 130 D--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
           D  +P+++DW E GAVT +K QG+CG CWAFSAV ++EG  +I  G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWIESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186

Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
           T+N+GC+GG M  AF++I EN G++ E+DY Y  E+ TC +Q EK  A  IS Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQ-EKTAAVQISSYQVVPEG 245

Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
            E +LLQAV+ QPVS+ + AS +   FY  G  +  C +  +H V  +G+GT  +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301

Query: 308 YWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
           YWL+KNSWG +WGE+G+++I+RD    AGLC IA  +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341


>gi|357160572|ref|XP_003578808.1| PREDICTED: vignain-like [Brachypodium distachyon]
          Length = 339

 Score =  335 bits (859), Expect = 2e-89,   Method: Compositional matrix adjust.
 Identities = 164/342 (47%), Positives = 230/342 (67%), Gaps = 19/342 (5%)

Query: 13  MFVIIILVITCASQVVSGRSMHEP-SIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLE 71
           +  I+  +  C S V++ R +++  S+V +HE WM Q+GR YKD  EKA +  +FK N E
Sbjct: 8   LLAILGCLCLCGS-VLAARELNDDLSMVARHENWMLQYGRVYKDAAEKAQKFEVFKANAE 66

Query: 72  YIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT-- 129
           +I   N  GN  + LG N+F+D+TNEEF+A  T  N+    +S +   P+ F Y+N++  
Sbjct: 67  FINSFNA-GNHKFWLGINQFADITNEEFKATKT--NKGF--ISNKVRVPTGFMYENMSFD 121

Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST- 188
            +P +IDWR KGAVT IKDQGQCG CWAFSAVAA+EGI +++ GKL+ LSEQ+LVDC   
Sbjct: 122 ALPATIDWRTKGAVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLVSLSEQELVDCDVH 181

Query: 189 -DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
            ++ GC GGLMD AF++II+N GL  E++YPY   +G C  +   + AATI  YED+P  
Sbjct: 182 GEDQGCEGGLMDDAFKFIIKNGGLTQESNYPYDAADGKC--KSGSSSAATIKSYEDVPAN 239

Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
           +E AL++AV+NQPVSV VD     F FY  GV+   CG + DHG+A +G+GT  +  G K
Sbjct: 240 NEGALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGTTSD--GTK 297

Query: 308 YWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPVA 345
           +W++KNSWG +WGE+G++R+ +D     G+CG+A   SYP A
Sbjct: 298 FWIMKNSWGTSWGENGFLRMEKDIADKKGMCGLAMEPSYPTA 339


>gi|9502421|gb|AAF88120.1|AC021043_13 Putative cysteine proteinase [Arabidopsis thaliana]
          Length = 331

 Score =  335 bits (858), Expect = 2e-89,   Method: Compositional matrix adjust.
 Identities = 173/331 (52%), Positives = 225/331 (67%), Gaps = 15/331 (4%)

Query: 25  SQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTY 84
           SQ  S  + HEP + E H+QWM +  R Y DELEK MR ++FK+NL++IEK NK+G+RTY
Sbjct: 6   SQATSRVTFHEPIVAEHHQQWMTRFSRVYSDELEKQMRFDVFKKNLKFIEKFNKKGDRTY 65

Query: 85  KLGTNEFSDLTNEEFRALYTGYN--RPVPSVSRQSSRPSTFKYQNVTDVP--TSIDWREK 140
           KLG NEF+D T EEF A +TG      +PS         ++ + NV+DV    + DWR +
Sbjct: 66  KLGVNEFADWTREEFIATHTGLKGVNGIPSSEFVDEMIPSWNW-NVSDVAGRETKDWRYE 124

Query: 141 GAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMD 199
           GAVT +K QGQCG CWAFS+VAAVEG+T+I    L+ LSEQQL+DC  + ++GC+GG+M 
Sbjct: 125 GAVTPVKYQGQCGCCWAFSSVAAVEGLTKIVGNNLVSLSEQQLLDCDRERDNGCNGGIMS 184

Query: 200 KAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQ 259
            AF YII+N+G+A+EA YPY+  EGTC  +     +A I  ++ +P  +E+ALL+AVS Q
Sbjct: 185 DAFSYIIKNRGIASEASYPYQAAEGTC--RYNGKPSAWIRGFQTVPSNNERALLEAVSKQ 242

Query: 260 PVSVCVDASGRAFHFYKSGVLNAD-CGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGET 318
           PVSV +DA G  F  Y  GV +   CG N +H V  VG+GT+ E  G KYWL KNSWGET
Sbjct: 243 PVSVSIDADGPGFMHYSGGVYDEPYCGTNVNHAVTFVGYGTSPE--GIKYWLAKNSWGET 300

Query: 319 WGESGYIRILRDA----GLCGIATAASYPVA 345
           WGE+GYIRI RD     G+CG+A  A YPVA
Sbjct: 301 WGENGYIRIRRDVAWPQGMCGVAQYAFYPVA 331


>gi|60100207|gb|AAX13273.1| putative cysteine protease [Oryza sativa Japonica Group]
          Length = 349

 Score =  335 bits (858), Expect = 2e-89,   Method: Compositional matrix adjust.
 Identities = 166/320 (51%), Positives = 215/320 (67%), Gaps = 16/320 (5%)

Query: 37  SIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNR-TYKLGTNEFSDLT 95
           ++ ++HE+WMA+HGR Y D+ EKA RL +F+ N+ +IE  N   ++  + L  N+F+DLT
Sbjct: 35  AMAQRHERWMAKHGRAYADDAEKARRLEVFRDNVAFIESVNAAASQHKFWLEENQFADLT 94

Query: 96  NEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT--DVPTSIDWREKGAVTHIKDQGQCG 153
           N EFRA  TG     PS SR +  P++F+Y NV+  D+P S+DWR KGAV  +KDQG CG
Sbjct: 95  NAEFRATRTGLR---PSSSRGNRAPTSFRYANVSTGDLPASVDWRGKGAVNPVKDQGDCG 151

Query: 154 SCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST--DNHGCSGGLMDKAFEYIIENKGL 211
            CWAFSAVAA+EG  ++  GKL+ LSEQQLV C    ++ GC GGLMD AF++II+N GL
Sbjct: 152 CCWAFSAVAAMEGAVKLATGKLVSLSEQQLVSCDVKGEDQGCEGGLMDDAFDFIIKNGGL 211

Query: 212 ATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRA 271
           A E+DYPY   +  C      A AATI  YED+P  DE ALL+AV+NQPVSV +D   R 
Sbjct: 212 AAESDYPYTASDDKCATAGAGAAAATIKGYEDVPANDEAALLKAVANQPVSVAIDGGDRH 271

Query: 272 FHFYKSGVLN--ADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILR 329
           F FYK GVL+  A C    DH +  VG+G A +  G KYWL+KNSWG +WGE GY+R+ R
Sbjct: 272 FQFYKGGVLSGAAGCATELDHAITAVGYGVASD--GTKYWLMKNSWGTSWGEDGYVRMER 329

Query: 330 DA----GLCGIATAASYPVA 345
                 G+CG+A  ASYP A
Sbjct: 330 GVADKEGVCGLAMMASYPTA 349


>gi|356543112|ref|XP_003540007.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
           max]
          Length = 345

 Score =  335 bits (858), Expect = 3e-89,   Method: Compositional matrix adjust.
 Identities = 162/324 (50%), Positives = 216/324 (66%), Gaps = 8/324 (2%)

Query: 28  VSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLG 87
           +  R + E    E+HE WMAQ+G+ YKD  EK  R  IFK N+ +IE  N  G++ + L 
Sbjct: 24  IMSRRLFEACTSERHENWMAQYGKVYKDAAEKKKRFQIFKNNVHFIESFNTAGDKPFNLS 83

Query: 88  TNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPST-FKYQNVTDVPTSIDWREKGAVTHI 146
            N+F+DL +EEF+AL T  N+ V SV   ++   T FKY  VT +  ++DWR++GAVT I
Sbjct: 84  INQFADLHDEEFKALLTNGNKKVRSVVGTATETETSFKYNRVTKLLATMDWRKRGAVTPI 143

Query: 147 KDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC-STDNHGCSGGLMDKAFEYI 205
           KDQ +CGSCWAFSAVAA+EGI QIT  KL+ LSEQ+LVDC   ++ GC+GG M+ AFE++
Sbjct: 144 KDQRRCGSCWAFSAVAAIEGIHQITTSKLVSLSEQELVDCVKGESEGCNGGYMEDAFEFV 203

Query: 206 IENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCV 265
            +  G+A+E+ YPY+ ++ +C  +KE    + I  YE +P   E+AL +AV++QPVSV V
Sbjct: 204 AKKGGIASESYYPYKGKDKSCKVKKETHGVSQIKGYEKVPSNSEKALQKAVAHQPVSVYV 263

Query: 266 DASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYI 325
           +A G AF FY SG+    CG N DH + VVG+G  +   G KYWL+KNSWG  WGE GYI
Sbjct: 264 EAGGNAFQFYSSGIFTGKCGTNTDHAITVVGYG--KSRGGTKYWLVKNSWGAGWGEKGYI 321

Query: 326 RILRD----AGLCGIATAASYPVA 345
           R+ RD     GLCGIA  A YP A
Sbjct: 322 RMKRDIRAKEGLCGIAMNAFYPTA 345


>gi|537437|gb|AAC35211.1| cysteine proteinase [Hemerocallis hybrid cultivar]
          Length = 359

 Score =  334 bits (857), Expect = 3e-89,   Method: Compositional matrix adjust.
 Identities = 161/314 (51%), Positives = 213/314 (67%), Gaps = 8/314 (2%)

Query: 35  EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
           E S+   +E+W A H  + +D  +   R N+FK+N+++I + N++ + TYKL  N+F D+
Sbjct: 34  EESLWSLYEKWRAHHAVS-RDLDDTDKRFNVFKENVKFIHEFNQKKDATYKLALNKFGDM 92

Query: 95  TNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGS 154
           TN+EFR+ Y G         R       F Y+   D+PTS+DWREKGAVT +KDQGQCGS
Sbjct: 93  TNQEFRSTYAGSKIDHHMTLRGVKDAGEFSYEKFHDLPTSVDWREKGAVTGVKDQGQCGS 152

Query: 155 CWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLATE 214
           CWAFS V AVEGI QI   +L+ LSEQQLVDC T N GC+GGLMD AF++I  N GL++E
Sbjct: 153 CWAFSTVVAVEGINQIKTNELVSLSEQQLVDCDTKNSGCNGGLMDYAFDFIKNNGGLSSE 212

Query: 215 ADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHF 274
             YPY  E+ +C ++   AV  TI  Y+D+P+ +E AL++AV+NQPVSV ++ASG AF F
Sbjct: 213 DSYPYLAEQKSCGSEANSAV-VTIDGYQDVPRNNEAALMKAVANQPVSVAIEASGYAFQF 271

Query: 275 YKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILR----D 330
           Y  GV +  CG   DHGVA VG+G   +++G KYW++KNSWGE WGESGYIR+ R     
Sbjct: 272 YSQGVFSGHCGTELDHGVAAVGYGV--DDDGKKYWIVKNSWGEGWGESGYIRMERGIKDK 329

Query: 331 AGLCGIATAASYPV 344
            G CGIA  ASYP+
Sbjct: 330 RGKCGIAMEASYPI 343


>gi|414587996|tpg|DAA38567.1| TPA: hypothetical protein ZEAMMB73_390779 [Zea mays]
          Length = 343

 Score =  334 bits (857), Expect = 4e-89,   Method: Compositional matrix adjust.
 Identities = 163/345 (47%), Positives = 230/345 (66%), Gaps = 23/345 (6%)

Query: 14  FVIIILVIT-CA----SQVVSGRSMHE-PSIVEKHEQWMAQHGRTYKDELEKAMRLNIFK 67
           F++++ ++T CA    S V++ R + +  ++ E+HE+WMA +GR YKD  EKA R  +FK
Sbjct: 7   FLLLLAILTGCACSFPSPVLAARELSDDAAMAERHERWMAVYGRVYKDAAEKARRFEVFK 66

Query: 68  QNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQN 127
            NL ++E  N +    + LG N+F+DLT EEF+A     N+    +S +    + FKY+N
Sbjct: 67  DNLAFVESFNADKKNKFWLGVNQFADLTTEEFKA-----NKGFKPISAEEVPTTGFKYEN 121

Query: 128 --VTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVD 185
             V+ +PT++DWR KGAVT IK+QGQCG CWAFSAVAA+EGI +++   L+ LSEQ+LVD
Sbjct: 122 LSVSALPTAVDWRTKGAVTPIKNQGQCGCCWAFSAVAAMEGIVKLSTDNLVSLSEQELVD 181

Query: 186 CSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYED 243
           C T   + GC GG MD AFE++I+N GLATE+ YPY+  +G C    +   AATI  +ED
Sbjct: 182 CDTHSMDEGCEGGWMDSAFEFVIKNGGLATESSYPYKAVDGKCKGGSKS--AATIKGHED 239

Query: 244 LPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEE 303
           +P  +E AL++AV++QPVSV VDAS R F  Y  GV+   CG   DHG+A +G+G   E 
Sbjct: 240 VPPNNEAALMKAVASQPVSVAVDASDRTFMLYSGGVMTGSCGTQLDHGIAAIGYGV--ES 297

Query: 304 NGAKYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPV 344
           +G KYW++KNSWG TWGE  ++R+ +D     G+CG+A   SYP 
Sbjct: 298 DGTKYWILKNSWGTTWGEKRFLRMEKDISDKQGMCGLAMKPSYPT 342


>gi|357130141|ref|XP_003566711.1| PREDICTED: xylem cysteine proteinase 1-like [Brachypodium
           distachyon]
          Length = 457

 Score =  333 bits (855), Expect = 5e-89,   Method: Compositional matrix adjust.
 Identities = 169/315 (53%), Positives = 219/315 (69%), Gaps = 14/315 (4%)

Query: 38  IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
           I+E  E+W+A+H + Y    EK  R  +FK NL++I+K N+E   +Y LG NEF+DLT+E
Sbjct: 146 IIELFEKWLAKHQKAYASFEEKLHRFEVFKDNLKHIDKVNREVT-SYWLGLNEFADLTHE 204

Query: 98  EFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT--DVPTSIDWREKGAVTHIKDQGQCGSC 155
           EF+A Y G   P P+   + SR S FKY++V+  D+P S+DWR KGAVT +K+QGQCGSC
Sbjct: 205 EFKATYLGLAPPAPA---RESRGS-FKYEDVSADDLPKSVDWRTKGAVTEVKNQGQCGSC 260

Query: 156 WAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATE 214
           WAFS VAAVEGI  I  G L  LSEQ+L+DCS D N+GC+GGLMD AF YI  + GL TE
Sbjct: 261 WAFSTVAAVEGINAIVTGNLTALSEQELIDCSVDGNNGCNGGLMDYAFSYIASSGGLHTE 320

Query: 215 ADYPYRHEEGTC-DNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFH 273
             YPY  EEG+C D +K ++ A TIS YED+P  +EQAL++A+++QPVSV ++ASGR F 
Sbjct: 321 EAYPYLMEEGSCGDGKKSESEAVTISGYEDVPAHNEQALIKALAHQPVSVAIEASGRHFQ 380

Query: 274 FYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA-- 331
           FY  GV +  CG   DHGVA VG+G+ ++  G  Y +++NSWG  WGE GYIR+ R    
Sbjct: 381 FYSGGVFDGPCGTQLDHGVAAVGYGS-DKGKGHDYIIVRNSWGAKWGEKGYIRMKRGTGK 439

Query: 332 --GLCGIATAASYPV 344
             GLCGI   ASYP 
Sbjct: 440 GEGLCGINKMASYPT 454


>gi|357452869|ref|XP_003596711.1| Cysteine proteinase [Medicago truncatula]
 gi|355485759|gb|AES66962.1| Cysteine proteinase [Medicago truncatula]
          Length = 344

 Score =  333 bits (855), Expect = 5e-89,   Method: Compositional matrix adjust.
 Identities = 165/344 (47%), Positives = 227/344 (65%), Gaps = 19/344 (5%)

Query: 10  IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
           I+ +F I+ L     S V+S R      ++EKHEQWM +HG+ YKD  EK  R  IFK+N
Sbjct: 12  ILTLFFILTL---WTSLVISSR------LLEKHEQWMEEHGKFYKDAAEKEQRFQIFKEN 62

Query: 70  LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALY-TGYNRPVPSVSRQS-SRPSTFKYQN 127
           LE+IE  N  G+  + L  N+F D TN+EF+A Y  G  +P+  V   +    S F+Y+N
Sbjct: 63  LEFIESFNAAGDNGFNLSINQFGDQTNDEFKANYLNGKKKPLIGVGIAAIEEESVFRYEN 122

Query: 128 VTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
           VT+VP ++DWRE+GAVT IK Q  CGSCWAF+ VAA+EGI QIT G+L+ LSEQ+LVDC 
Sbjct: 123 VTEVPATMDWRERGAVTPIKHQHLCGSCWAFATVAAIEGIHQITTGRLVSLSEQELVDCV 182

Query: 188 TDN--HGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLP 245
             N   GC+GG ++ A ++I++  G+ +E +YPY   +G C+ +K     A I  YE +P
Sbjct: 183 KTNTTDGCNGGYVEDACDFIVKKGGITSETNYPYTRVDGKCNVRKGTYNVAKIKGYEHVP 242

Query: 246 KGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENG 305
             +E+ALL+AV+NQP++V + A+ RAF FY SG+L   CG + DH V +VG+GT+++  G
Sbjct: 243 ANNEKALLKAVANQPIAVYIAATKRAFQFYSSGILKGKCGIDLDHTVTIVGYGTSDD--G 300

Query: 306 AKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
            KYWL+KNSWG  WGE GYI+I RD     G CGIA   +YP+ 
Sbjct: 301 VKYWLVKNSWGTKWGEKGYIKIKRDVHAKEGSCGIAMVPTYPIV 344


>gi|357160569|ref|XP_003578807.1| PREDICTED: vignain-like [Brachypodium distachyon]
          Length = 339

 Score =  333 bits (855), Expect = 6e-89,   Method: Compositional matrix adjust.
 Identities = 165/341 (48%), Positives = 226/341 (66%), Gaps = 19/341 (5%)

Query: 15  VIIILVITC-ASQVVSGRSMHEP-SIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
           ++ IL   C  S V++ R +++  S+V +HE WM Q+GR YKD  EKA +  +FK N  +
Sbjct: 8   LLAILGCLCFCSSVLAARELNDDLSMVARHESWMLQYGRVYKDAAEKASKFEVFKANAGF 67

Query: 73  IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT--D 130
           I+  N  GN  + LG N+F+D+TN+EF+A  T  N+    +S +   P+ F Y+NV+   
Sbjct: 68  IDSFNA-GNHKFWLGINQFADITNKEFKATKT--NKGF--ISNKVRAPTGFSYENVSFDA 122

Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST-- 188
           +P SIDWR KGAVT +KDQGQCG CWAFSAVAA EGI +++ GKL+ LSEQ+LVDC    
Sbjct: 123 LPASIDWRTKGAVTPVKDQGQCGCCWAFSAVAATEGIVKLSTGKLVSLSEQELVDCDVHG 182

Query: 189 DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
           ++ GC GGLMD AF++II N GL  E+ YPY  E+G C +  +   A TI  YED+P  +
Sbjct: 183 EDQGCEGGLMDDAFKFIISNGGLTQESSYPYDAEDGKCKSGSKS--AGTIKSYEDVPANN 240

Query: 249 EQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKY 308
           E AL++AV+NQPVSV VD     F FY  GV+   CG + DHG+A +G+G   +  G KY
Sbjct: 241 EGALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGVTSD--GTKY 298

Query: 309 WLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPVA 345
           WL+KNSWG +WGE+G++R+ +D     G+CG+A   SYP A
Sbjct: 299 WLMKNSWGTSWGENGFLRMEKDIADKKGMCGLAMEPSYPTA 339


>gi|224065647|ref|XP_002301901.1| predicted protein [Populus trichocarpa]
 gi|222843627|gb|EEE81174.1| predicted protein [Populus trichocarpa]
          Length = 336

 Score =  333 bits (854), Expect = 7e-89,   Method: Compositional matrix adjust.
 Identities = 166/312 (53%), Positives = 217/312 (69%), Gaps = 12/312 (3%)

Query: 38  IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
           I++  E W+++HG+ Y+   EK +R  IFK NL +I++ NK+    Y LG NEFSDL++E
Sbjct: 29  IIDLFESWISKHGKIYESIEEKWLRFEIFKDNLFHIDETNKK-VVNYWLGLNEFSDLSHE 87

Query: 98  EFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWA 157
           EF+  Y G    + S  R+ S+   F Y++V  +P S+DWR+KGAVT +K+QG CGSCWA
Sbjct: 88  EFKNKYLGLKVDM-SERRECSQE--FNYKDVMSIPKSVDWRKKGAVTDVKNQGSCGSCWA 144

Query: 158 FSAVAAVEGITQITRGKLIELSEQQLVDC-STDNHGCSGGLMDKAFEYIIENKGLATEAD 216
           FS VAAVEGI QI  G L  LSEQ+LVDC +T+N+GC+GGLMD AF YII N GL  E D
Sbjct: 145 FSTVAAVEGINQIVTGNLTSLSEQELVDCDTTNNYGCNGGLMDYAFSYIISNGGLHKEVD 204

Query: 217 YPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYK 276
           YPY  EEGTC+ +KE++   TIS Y D+P+  E++LL+A++NQP+SV ++ASGR F FY 
Sbjct: 205 YPYIMEEGTCEMRKEESEVVTISGYHDVPQNSEESLLKALANQPLSVAIEASGRDFQFYS 264

Query: 277 SGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD----AG 332
            GV +  CG   DHGVA VG+G+    NG  Y ++KNSWG  WGE GYIR+ R+    AG
Sbjct: 265 GGVFDGHCGTQLDHGVAAVGYGST---NGLDYIIVKNSWGSKWGEKGYIRMKRNTGKPAG 321

Query: 333 LCGIATAASYPV 344
           LCGI   ASYP 
Sbjct: 322 LCGINKMASYPT 333


>gi|38346007|emb|CAD40110.2| OSJNBa0035O13.9 [Oryza sativa Japonica Group]
 gi|125589429|gb|EAZ29779.1| hypothetical protein OsJ_13837 [Oryza sativa Japonica Group]
          Length = 314

 Score =  333 bits (853), Expect = 9e-89,   Method: Compositional matrix adjust.
 Identities = 166/319 (52%), Positives = 214/319 (67%), Gaps = 16/319 (5%)

Query: 38  IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNR-TYKLGTNEFSDLTN 96
           + ++HE+WMA+HGR Y D+ EKA RL +F+ N+ +IE  N   ++  + L  N+F+DLTN
Sbjct: 1   MAQRHERWMAKHGRAYADDAEKARRLEVFRDNVAFIESVNAAASQHKFWLEENQFADLTN 60

Query: 97  EEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT--DVPTSIDWREKGAVTHIKDQGQCGS 154
            EFRA  TG     PS SR +  P++F+Y NV+  D+P S+DWR KGAV  +KDQG CG 
Sbjct: 61  AEFRATRTGLR---PSSSRGNRAPTSFRYANVSTGDLPASVDWRGKGAVNPVKDQGDCGC 117

Query: 155 CWAFSAVAAVEGITQITRGKLIELSEQQLVDCST--DNHGCSGGLMDKAFEYIIENKGLA 212
           CWAFSAVAA+EG  ++  GKL+ LSEQQLV C    ++ GC GGLMD AF++II+N GLA
Sbjct: 118 CWAFSAVAAMEGAVKLATGKLVSLSEQQLVSCDVKGEDQGCEGGLMDDAFDFIIKNGGLA 177

Query: 213 TEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAF 272
            E+DYPY   +  C      A AATI  YED+P  DE ALL+AV+NQPVSV +D   R F
Sbjct: 178 AESDYPYTASDDKCATAGAGAAAATIKGYEDVPANDEAALLKAVANQPVSVAIDGGDRHF 237

Query: 273 HFYKSGVLN--ADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD 330
            FYK GVL+  A C    DH +  VG+G A +  G KYWL+KNSWG +WGE GY+R+ R 
Sbjct: 238 QFYKGGVLSGAAGCATELDHAITAVGYGVASD--GTKYWLMKNSWGTSWGEDGYVRMERG 295

Query: 331 A----GLCGIATAASYPVA 345
                G+CG+A  ASYP A
Sbjct: 296 VADKEGVCGLAMMASYPTA 314


>gi|374713651|gb|AEZ65083.1| cysteine protease [Carica papaya]
          Length = 467

 Score =  332 bits (852), Expect = 1e-88,   Method: Compositional matrix adjust.
 Identities = 169/351 (48%), Positives = 224/351 (63%), Gaps = 21/351 (5%)

Query: 11  IPMFVIIILVITCA--SQVVSGRSMH--------EPSIVEKHEQWMAQHGRTYKDELEKA 60
           + +F+++I   + A    +VS    H        +  ++  +E W+ +HG+ Y    EK 
Sbjct: 8   LSLFLLMIFTASSAVDMSIVSYDQRHADKSSWRTDDEVMAMYEAWLVKHGKAYNALGEKE 67

Query: 61  MRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRP 120
            R  IFK NL +I++ N + N TY+LG N F+DLTNEE+R++Y G       V+R+ SR 
Sbjct: 68  KRFGIFKDNLRFIDEHNSQ-NLTYRLGLNRFADLTNEEYRSMYLGVKPGATRVTRKVSRK 126

Query: 121 STFKYQNVTD-VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELS 179
           S      V D +P  IDWR++GAV  +KDQG CGSCWAFS +AAVEGI QI  G LI LS
Sbjct: 127 SDRFAARVGDALPDFIDWRKEGAVVGVKDQGSCGSCWAFSTIAAVEGINQIVTGDLISLS 186

Query: 180 EQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATI 238
           EQ+LVDC T  N GC+GGLMD AFE+II N G+ +E DYPYR  +  CD  ++ A   +I
Sbjct: 187 EQELVDCDTSYNEGCNGGLMDYAFEFIINNGGIDSEEDYPYRAADQKCDQYRKNANVVSI 246

Query: 239 SKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFG 298
             YED+P+ DE AL +AV+ QPVSV ++A GRAF  Y+SGV    CG + DHGVA VG+G
Sbjct: 247 DGYEDVPENDEAALKKAVAKQPVSVAIEAGGRAFQLYQSGVFTGKCGTSLDHGVAAVGYG 306

Query: 299 TAEEENGAKYWLIKNSWGETWGESGYIRILRD-----AGLCGIATAASYPV 344
           T   ENG  YW++ NSWG+ WGE GYIR+ R+     +G CGIA   SYP+
Sbjct: 307 T---ENGQDYWIVGNSWGKNWGEDGYIRMERNLAGSSSGKCGIAIGPSYPI 354


>gi|146215996|gb|ABQ10200.1| cysteine protease Cp2 [Actinidia deliciosa]
          Length = 376

 Score =  332 bits (851), Expect = 2e-88,   Method: Compositional matrix adjust.
 Identities = 160/310 (51%), Positives = 216/310 (69%), Gaps = 11/310 (3%)

Query: 42  HEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRA 101
           + +W+A+HG+ Y    E+  R  IFK NL+++++ N E NR+YK+G N F+DLTNEE+R+
Sbjct: 47  YAEWLAKHGKAYNGIGERERRFEIFKDNLKFVDEHNSE-NRSYKVGLNRFADLTNEEYRS 105

Query: 102 LYTGYNRPVPSVSRQSSRPST-FKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSA 160
           ++ G          +S   S  +  Q+   +P S+DWRE GAV  IKDQG CGSCWAFS 
Sbjct: 106 MFLGTKTDSKRRFMKSKSASRRYAVQDSDMLPESVDWRESGAVAPIKDQGSCGSCWAFST 165

Query: 161 VAAVEGITQITRGKLIELSEQQLVDCS-TDNHGCSGGLMDKAFEYIIENKGLATEADYPY 219
           VAAVEG+ QI  G++I+LSEQ+LVDC  T + GC+GGLMD AFE+II N G+ TE DYPY
Sbjct: 166 VAAVEGVNQIATGEMIQLSEQELVDCDRTYDAGCNGGLMDYAFEFIINNGGIDTEEDYPY 225

Query: 220 RHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGV 279
           R  +GTCD +++     +I+ YED+P  DE AL +AV++QPVSV ++ASGRAF  Y SGV
Sbjct: 226 RGVDGTCDPERKNTKVVSINDYEDVPPYDEMALKKAVAHQPVSVAIEASGRAFQLYLSGV 285

Query: 280 LNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD-----AGLC 334
              +CG   DHGV VVG+GT   +NGA +W+++NSWG +WGE+GYIR+ R+      G C
Sbjct: 286 FTGECGRALDHGVVVVGYGT---DNGADHWIVRNSWGTSWGENGYIRMERNVVDNFGGKC 342

Query: 335 GIATAASYPV 344
           GIA  ASYP+
Sbjct: 343 GIAMQASYPI 352


>gi|449448298|ref|XP_004141903.1| PREDICTED: germination-specific cysteine protease 1-like [Cucumis
           sativus]
 gi|449531757|ref|XP_004172852.1| PREDICTED: germination-specific cysteine protease 1-like [Cucumis
           sativus]
          Length = 365

 Score =  332 bits (851), Expect = 2e-88,   Method: Compositional matrix adjust.
 Identities = 163/340 (47%), Positives = 226/340 (66%), Gaps = 15/340 (4%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
           + ++    ++ ++  +S RS  E  + E ++ W+A+HG+ Y    E+  R  IFK+NL++
Sbjct: 8   LALLSFFFLSISASALSRRSDGE--VREIYDLWLAKHGKAYNGIDEREKRFQIFKENLKF 65

Query: 73  IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKY--QNVTD 130
           I+  N E NRTYK+G N F+DLTNEE+RALY G   P P+     ++ ++ +Y   N+  
Sbjct: 66  IDDHNSE-NRTYKVGLNMFADLTNEEYRALYLGTRSP-PARRVMKAKTASRRYAVNNLDR 123

Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD- 189
           +P S+DWR +GAV  +K+QG CGSCWAFS +AAVEGI QI  G+LI LSEQ+LV C    
Sbjct: 124 LPESMDWRTRGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVSCDKKY 183

Query: 190 NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDE 249
           N GC+GGLMD AF++II+N GL TE DYPY   +G CD  ++ A   +I  YED+P  DE
Sbjct: 184 NSGCNGGLMDYAFQFIIDNGGLDTEEDYPYEAFDGQCDPTRKNAKVVSIDAYEDVPANDE 243

Query: 250 QALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYW 309
           ++L +AV++QPVSV ++ASG A   Y+SGV    CG+  DHGV  VG+G   +ENG  YW
Sbjct: 244 ESLKKAVAHQPVSVAIEASGLALQLYQSGVFTGKCGSALDHGVVAVGYG---KENGVDYW 300

Query: 310 LIKNSWGETWGESGYIRILRDA-----GLCGIATAASYPV 344
           L++NSWG +WGE GY ++ R+      G CGIA  ASYPV
Sbjct: 301 LVRNSWGTSWGEDGYFKLERNVKHITEGKCGIAMQASYPV 340


>gi|224133760|ref|XP_002321654.1| predicted protein [Populus trichocarpa]
 gi|222868650|gb|EEF05781.1| predicted protein [Populus trichocarpa]
          Length = 362

 Score =  332 bits (851), Expect = 2e-88,   Method: Compositional matrix adjust.
 Identities = 167/349 (47%), Positives = 229/349 (65%), Gaps = 12/349 (3%)

Query: 3   LKFEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDEL-EKAM 61
           ++ +K   + + + ++L IT +          E S+ + +E+W + H  T    L EK  
Sbjct: 1   MEMKKFLFVALSLALVLGITESLDFHEKDLESEESLWDLYERWRSHH--TVSTSLDEKHK 58

Query: 62  RLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPS 121
           R N+FK+N+ ++ K NK G + YKL  N+F+D+TN EFR++Y G       + R ++R +
Sbjct: 59  RFNVFKENVMHVHKTNKMG-KPYKLKLNKFADMTNHEFRSVYAGSKVKHHRMFRGTTRGN 117

Query: 122 -TFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSE 180
            +F Y  V  VPTS+DWR+KGAVT +KDQGQCGSCWAFS + AVEGI  I   +L+ LSE
Sbjct: 118 GSFMYGKVEKVPTSVDWRKKGAVTAVKDQGQCGSCWAFSTIVAVEGINYIKTNELVSLSE 177

Query: 181 QQLVDC-STDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATIS 239
           Q+LVDC +T+N GC+GGLM+ AFE+I + +G+ TE+ YPY+ E+G CD  KE   A +I 
Sbjct: 178 QELVDCDTTENQGCNGGLMEYAFEFIKKKRGITTESTYPYKAEDGHCDAAKENNPAVSID 237

Query: 240 KYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGT 299
            YE +P+ DE ALL+A +NQPVSV +DA G  F FY  GV   +CG   DHGVAVVG+GT
Sbjct: 238 GYEKVPENDEDALLKAAANQPVSVAIDAGGSDFQFYSEGVFIGECGTELDHGVAVVGYGT 297

Query: 300 AEEENGAKYWLIKNSWGETWGESGYIRILR----DAGLCGIATAASYPV 344
             +  G KYW+++NSWG  WGE GYIR+ R      GLCGIA  ASYP+
Sbjct: 298 TLD--GTKYWIVRNSWGPEWGEKGYIRMQRGISDKEGLCGIAMEASYPI 344


>gi|225428879|ref|XP_002285299.1| PREDICTED: cysteine proteinase RD21a-like [Vitis vinifera]
          Length = 469

 Score =  332 bits (851), Expect = 2e-88,   Method: Compositional matrix adjust.
 Identities = 160/313 (51%), Positives = 207/313 (66%), Gaps = 10/313 (3%)

Query: 38  IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
           ++  +E W+ +HG++Y    E+  R  IFK NL +IE+ N   NRTYK+G N F+DLTNE
Sbjct: 50  VMAVYEAWLVKHGKSYNALGERERRFEIFKDNLRFIEEHNAV-NRTYKVGLNRFADLTNE 108

Query: 98  EFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWA 157
           E+R+ Y G         R S     + ++   D+P S+DWREKGAV  +KDQG CGSCWA
Sbjct: 109 EYRSRYLGRRDETRRGLRASRVSDRYSFRAGEDLPESVDWREKGAVVPVKDQGNCGSCWA 168

Query: 158 FSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEAD 216
           FS +AAVEGI QI  G LI LSEQ+LVDC    N GC+GGLMD AFE+II N G+ +E D
Sbjct: 169 FSTIAAVEGINQIATGDLISLSEQELVDCDKSYNQGCNGGLMDYAFEFIINNGGIDSEED 228

Query: 217 YPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYK 276
           YPYR  + TCD  ++ A   +I  YED+P+ DE++L +AV+NQPVSV ++A GRAF  Y+
Sbjct: 229 YPYRAADTTCDPNRKNARVVSIDGYEDVPQNDERSLKKAVANQPVSVAIEAGGRAFQLYQ 288

Query: 277 SGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILR-----DA 331
           SGV    CG   DHGV  VG+GT   EN   YW+++NSWG  WGESGYI++ R     + 
Sbjct: 289 SGVFTGQCGTQLDHGVVAVGYGT---ENSVDYWIVRNSWGPNWGESGYIKLERNLAGTET 345

Query: 332 GLCGIATAASYPV 344
           G CGIA   SYP+
Sbjct: 346 GKCGIAIEPSYPI 358


>gi|62526575|gb|AAX84673.1| cysteine protease CP1 [Manihot esculenta]
          Length = 467

 Score =  332 bits (850), Expect = 2e-88,   Method: Compositional matrix adjust.
 Identities = 167/351 (47%), Positives = 224/351 (63%), Gaps = 26/351 (7%)

Query: 13  MFVIIILVITCAS----QVVSGRSMH--------EPSIVEKHEQWMAQHGRTYKDELEKA 60
           MFV++ L  T +S     ++S    H        +  ++  +E+W+ + G+ Y    E+ 
Sbjct: 11  MFVLLFLSFTLSSASDMSIISYDQTHATKSSWRTDDEVMAIYEEWLVKQGKVYNALGERE 70

Query: 61  MRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRP 120
            R  +FK NL +I++ N E NRTYKLG N F+DLTNEE+R+ Y G       + R   R 
Sbjct: 71  KRFQVFKDNLRFIDEHNSE-NRTYKLGLNGFADLTNEEYRSTYLG---ARGGMKRNRLRK 126

Query: 121 STFKYQNVT--DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIEL 178
           ++ +Y       +P S+DWR++GAV  +KDQG CGSCWAFS +AAVEGI +I  G LI L
Sbjct: 127 TSDRYAPRVGESLPDSVDWRKEGAVAEVKDQGSCGSCWAFSTIAAVEGINKIVTGDLISL 186

Query: 179 SEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAAT 237
           SEQ+LVDC T  N GC+GGLMD AFE+II N G+ TE DYPY   +G CD  ++ A   T
Sbjct: 187 SEQELVDCDTSYNEGCNGGLMDYAFEFIINNGGIDTEEDYPYLARDGRCDTYRKNAKVVT 246

Query: 238 ISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGF 297
           I  YED+P   E AL +AV+NQPVSV ++A GR F FY SG+ +  CG   DHGVA VG+
Sbjct: 247 IDDYEDVPVNSETALQKAVANQPVSVAIEAGGRDFQFYASGIFSGRCGTQLDHGVAAVGY 306

Query: 298 GTAEEENGAKYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPV 344
           GT   ENG  YW+++NSWG++WGE+GY+R+ R      G+CGIA  ASYP+
Sbjct: 307 GT---ENGKDYWIVRNSWGKSWGENGYLRMARSINSPTGICGIAMEASYPI 354


>gi|359491865|ref|XP_002273243.2| PREDICTED: xylem cysteine proteinase 1-like [Vitis vinifera]
          Length = 351

 Score =  332 bits (850), Expect = 2e-88,   Method: Compositional matrix adjust.
 Identities = 157/312 (50%), Positives = 212/312 (67%), Gaps = 12/312 (3%)

Query: 38  IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
           + +  E WM++HG++Y+   EK  R  +F+ NL++I++ NK+ + +Y LG NEF+DL++E
Sbjct: 44  LTDLFESWMSKHGKSYRSFEEKLHRFEVFQDNLKHIDETNKKVS-SYWLGLNEFADLSHE 102

Query: 98  EFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWA 157
           EF+  Y G    +P   ++   P  F Y++V D+P S+DWR+KGAV H+K+QG CGSCWA
Sbjct: 103 EFKRKYLGLKIELP---KRRDSPEEFSYKDVADLPKSVDWRKKGAVAHVKNQGACGSCWA 159

Query: 158 FSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEAD 216
           FS VAAVEGI QI  G L  LSEQ+L+DC    N+GC+GGLMD AF +II N GL  E D
Sbjct: 160 FSTVAAVEGINQIVTGNLTALSEQELIDCDKPFNNGCNGGLMDYAFAFIISNGGLRKEED 219

Query: 217 YPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYK 276
           YPY  EEGTC  +KE+    TIS Y D+P+ +EQ+ L+A++NQP+SV ++AS R F FY 
Sbjct: 220 YPYVMEEGTCGEKKEELEVVTISGYHDVPEDNEQSFLKALANQPLSVAIEASSRGFQFYS 279

Query: 277 SGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----G 332
            G+ N  CG   DHGVA VG+GT++   G  Y  +KNSWG  WGE GYIR+ R+     G
Sbjct: 280 GGIFNGHCGTELDHGVAAVGYGTSK---GVDYITVKNSWGSKWGEKGYIRMKRNVGKPEG 336

Query: 333 LCGIATAASYPV 344
           +CGI   ASYP 
Sbjct: 337 ICGIYKMASYPT 348


>gi|224103643|ref|XP_002313136.1| predicted protein [Populus trichocarpa]
 gi|222849544|gb|EEE87091.1| predicted protein [Populus trichocarpa]
          Length = 477

 Score =  332 bits (850), Expect = 2e-88,   Method: Compositional matrix adjust.
 Identities = 158/316 (50%), Positives = 209/316 (66%), Gaps = 9/316 (2%)

Query: 35  EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
           E   +  +E W+ ++G+ Y    EK  R  IFK NL+++++ N  GN +YKLG N+F+DL
Sbjct: 42  EAETLRLYEMWLVKYGKAYNALGEKERRFEIFKDNLKFVDQHNSVGNPSYKLGLNKFADL 101

Query: 95  TNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGS 154
           +NEE+RA Y G             + + + +++  D+P S+DWREKGAV  +KDQGQCGS
Sbjct: 102 SNEEYRAAYLGTRMDGKRRLLGGPKSARYLFKDGDDLPESVDWREKGAVAPVKDQGQCGS 161

Query: 155 CWAFSAVAAVEGITQITRGKLIELSEQQLVDCS-TDNHGCSGGLMDKAFEYIIENKGLAT 213
           CWAFS V AVEGI QI  G L  LSEQ+LVDC    N GC+GGLMD AFE+I++N G+ T
Sbjct: 162 CWAFSTVGAVEGINQIVTGNLTSLSEQELVDCDKVYNQGCNGGLMDYAFEFIMKNGGIDT 221

Query: 214 EADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFH 273
           E DYPY+  +  CD  ++ A   TI  YED+P+ DE++L +AV+NQPVSV ++A GRAF 
Sbjct: 222 EEDYPYKAVDSMCDPNRKNARVVTIDGYEDVPQNDEKSLRKAVANQPVSVAIEAGGRAFQ 281

Query: 274 FYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD--- 330
            Y+SGV    CG   DHGV  VG+GT   ENG  YW+++NSWG  WGE+GYIR+ R+   
Sbjct: 282 LYQSGVFTGSCGTQLDHGVVAVGYGT---ENGVDYWVVRNSWGPAWGENGYIRMERNVAS 338

Query: 331 --AGLCGIATAASYPV 344
              G CGIA  ASYP 
Sbjct: 339 TETGKCGIAMEASYPT 354


>gi|226508570|ref|NP_001141984.1| uncharacterized protein LOC100274134 precursor [Zea mays]
 gi|194706676|gb|ACF87422.1| unknown [Zea mays]
 gi|413920745|gb|AFW60677.1| vignain [Zea mays]
          Length = 363

 Score =  331 bits (849), Expect = 2e-88,   Method: Compositional matrix adjust.
 Identities = 158/317 (49%), Positives = 218/317 (68%), Gaps = 13/317 (4%)

Query: 35  EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
           E  ++ ++++WMAQ+ R YKD+ EKA R  +FK N E+I+++N  G + Y LGTN+F+DL
Sbjct: 52  EAMMMARYKKWMAQYRRKYKDDAEKAHRFQVFKANAEFIDRSNAGGKKKYVLGTNQFADL 111

Query: 95  TNEEFRALYTGYNRP--VPSVSRQSSRPSTFKYQNVT--DVPTSIDWREKGAVTHIKDQG 150
           T++EF A+YTG  +P  VPS ++Q     + KYQN T  D    +DWR++GAVT +K+QG
Sbjct: 112 TSKEFAAMYTGLRKPAAVPSGAKQIPAAGS-KYQNFTRLDDDVQVDWRQQGAVTPVKNQG 170

Query: 151 QCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC--STDNHGCSGGLMDKAFEYIIEN 208
           QCG CWAFSAV A+EG+  IT G L+ LSEQQ++DC  S  N GC+GG MD AF+Y+I N
Sbjct: 171 QCGCCWAFSAVGAMEGLIMITTGNLVSLSEQQILDCDESDGNQGCNGGYMDNAFQYVINN 230

Query: 209 KGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDAS 268
            G+ TE  YPY   +GTC N +    AATIS ++DLP GDE AL  AV+NQPVSV VD  
Sbjct: 231 GGVTTEDAYPYSAVQGTCQNVQP---AATISGFQDLPSGDENALANAVANQPVSVGVDGG 287

Query: 269 GRAFHFYKSGVLNAD-CGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRI 327
              F FY+ G+ + D CG + +H V  +G+G   ++ G +YW++KNSWG  WGE+G++++
Sbjct: 288 SSPFQFYQGGIYDGDGCGTDMNHAVTAIGYGA--DDQGTQYWILKNSWGTGWGENGFMQL 345

Query: 328 LRDAGLCGIATAASYPV 344
               G CGI+T ASYP 
Sbjct: 346 QMGVGACGISTMASYPT 362


>gi|449500145|ref|XP_004161017.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
          Length = 349

 Score =  331 bits (849), Expect = 3e-88,   Method: Compositional matrix adjust.
 Identities = 170/345 (49%), Positives = 222/345 (64%), Gaps = 16/345 (4%)

Query: 8   SFIIPMFVIIILVITCASQVVSGRSMHEPSI---VEKHEQWMAQHGRTYKDELEKAMRLN 64
           + I+   + I   I     +V     H  S+   +E  E WM++H +TY+   EK  R  
Sbjct: 10  TLILSATLFITYAIAHDFSIVGYSPEHLASMDKTIELFESWMSKHSKTYRSIEEKLHRFE 69

Query: 65  IFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFK 124
           IF  NL++I++ NK+ + +Y LG NEF+DL++EEF++ Y G     P   ++SSR   F 
Sbjct: 70  IFLDNLKHIDETNKKVS-SYWLGLNEFADLSHEEFKSKYLGLRVEFPR--KRSSR--GFS 124

Query: 125 YQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLV 184
           Y +V D+P S+DWR KGAVT +K+QG CGSCWAFS VAAVEGI QI  G L  LSEQ+L+
Sbjct: 125 YGDVEDLPESVDWRTKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSLSEQELI 184

Query: 185 DCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYED 243
           DC    N+GC GGLMD AF+YI+ N GL  E DYPY  EEG C  +KE+    TIS YED
Sbjct: 185 DCDRSFNNGCYGGLMDYAFQYIMSNSGLRKEEDYPYLMEEGRCIREKEQFEVVTISGYED 244

Query: 244 LPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEE 303
           +P  DEQ+LL+A+S+QPVSV ++AS R F FYK G+    CG   DHGV  VG+G++E  
Sbjct: 245 VPANDEQSLLKALSHQPVSVAIEASSRNFQFYKGGIFTGRCGTQMDHGVTAVGYGSSE-- 302

Query: 304 NGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
            G  Y ++KNSWG  WGE+GYIR+ R+     GLCGI   ASYP 
Sbjct: 303 -GTDYIIVKNSWGPKWGENGYIRMKRNTGKPEGLCGINQMASYPT 346


>gi|125547258|gb|EAY93080.1| hypothetical protein OsI_14881 [Oryza sativa Indica Group]
          Length = 314

 Score =  331 bits (849), Expect = 3e-88,   Method: Compositional matrix adjust.
 Identities = 165/319 (51%), Positives = 213/319 (66%), Gaps = 16/319 (5%)

Query: 38  IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNR-TYKLGTNEFSDLTN 96
           + ++HE+WMA+HGR Y D+ EK  RL +F+ N+ +IE  N   ++  + L  N+F+DLTN
Sbjct: 1   MAQRHERWMAKHGRAYADDAEKVRRLEVFRDNVAFIESVNAAASQHKFWLEENQFADLTN 60

Query: 97  EEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT--DVPTSIDWREKGAVTHIKDQGQCGS 154
            EFRA  TG     PS SR +  P++F+Y NV+  D+P S+DWR KGAV  +KDQG CG 
Sbjct: 61  AEFRATRTGLR---PSSSRGNRAPTSFRYANVSTGDLPASVDWRGKGAVNPVKDQGDCGC 117

Query: 155 CWAFSAVAAVEGITQITRGKLIELSEQQLVDCST--DNHGCSGGLMDKAFEYIIENKGLA 212
           CWAFSAVAA+EG  ++  GKL+ LSEQQLV C    ++ GC GGLMD AF++II+N GLA
Sbjct: 118 CWAFSAVAAMEGAVKLATGKLVSLSEQQLVSCDVKGEDQGCEGGLMDDAFDFIIKNGGLA 177

Query: 213 TEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAF 272
            E+DYPY   +  C      A AATI  YED+P  DE ALL+AV+NQPVSV +D   R F
Sbjct: 178 AESDYPYTASDDKCATAGAGAAAATIKGYEDVPANDEAALLKAVANQPVSVAIDGGDRHF 237

Query: 273 HFYKSGVLN--ADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD 330
            FYK GVL+  A C    DH +  VG+G A +  G KYWL+KNSWG +WGE GY+R+ R 
Sbjct: 238 QFYKGGVLSGAAGCATELDHAITAVGYGVASD--GTKYWLMKNSWGTSWGEDGYVRMERG 295

Query: 331 A----GLCGIATAASYPVA 345
                G+CG+A  ASYP A
Sbjct: 296 VADKEGVCGLAMMASYPTA 314


>gi|224056176|ref|XP_002298740.1| predicted protein [Populus trichocarpa]
 gi|222845998|gb|EEE83545.1| predicted protein [Populus trichocarpa]
          Length = 455

 Score =  331 bits (849), Expect = 3e-88,   Method: Compositional matrix adjust.
 Identities = 158/309 (51%), Positives = 206/309 (66%), Gaps = 9/309 (2%)

Query: 42  HEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRA 101
           +E W+ +HGR Y    EK  R  IFK NL++I++ N  GN +YKLG N+F+DL+N+E+R+
Sbjct: 25  YEMWLVKHGRAYNALGEKERRFEIFKDNLKFIDEHNSVGNPSYKLGLNKFADLSNDEYRS 84

Query: 102 LYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAV 161
           +Y G             +   + ++   D+P ++DWREKGAV  +KDQGQCGSCWAFS V
Sbjct: 85  VYLGTRMDGKGRLLGGPKSERYLFKEGDDLPETVDWREKGAVAPVKDQGQCGSCWAFSTV 144

Query: 162 AAVEGITQITRGKLIELSEQQLVDCS-TDNHGCSGGLMDKAFEYIIENKGLATEADYPYR 220
            AVEGI QI  G L  LSEQ+LVDC  T N GC+GGLMD AF++IIEN G+ TE DYPY+
Sbjct: 145 GAVEGINQIVTGNLTSLSEQELVDCDKTYNLGCNGGLMDYAFDFIIENGGIDTEEDYPYK 204

Query: 221 HEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVL 280
             +  CD  ++ A   TI  YED+P+ DE++L +AV+NQPVSV ++A GR F  Y+SGV 
Sbjct: 205 AIDSMCDPNRKNARVVTIDGYEDVPQNDEKSLKKAVANQPVSVAIEAGGRGFQLYQSGVF 264

Query: 281 NADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD-----AGLCG 335
              CG   DHGV  VG+GT   E+G  YW+++NSWG  WGE+GYIR+ RD      G CG
Sbjct: 265 TGSCGTQLDHGVVTVGYGT---EHGVDYWIVRNSWGPAWGENGYIRMERDVASTETGKCG 321

Query: 336 IATAASYPV 344
           IA  ASYP 
Sbjct: 322 IAMEASYPT 330


>gi|413917937|gb|AFW57869.1| hypothetical protein ZEAMMB73_830006 [Zea mays]
          Length = 443

 Score =  331 bits (848), Expect = 3e-88,   Method: Compositional matrix adjust.
 Identities = 163/341 (47%), Positives = 225/341 (65%), Gaps = 19/341 (5%)

Query: 1   MVLKFEKSFIIPMFVIIILVITCA-SQVVSGRSM--HEPSIVEKHEQWMAQHGRTYKDEL 57
           M   +  +F++    + ++   CA S  ++ R +   + ++V +HE+WMA++ R Y D  
Sbjct: 1   MATHYSSAFVL----LSVVAWACALSGSLAARDLADQDQAMVARHEEWMAKYDRVYSDAA 56

Query: 58  EKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQS 117
           EKA R  +FK N+  IE  N  GN  + L  N F+DLT++EFRA +TGY RP  + +   
Sbjct: 57  EKARRFEVFKANMALIESVNA-GNHKFWLEANRFADLTDDEFRATWTGY-RPKTAAASSK 114

Query: 118 SRPST----FKYQNVT--DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQIT 171
            R  T    FKY NV+  DVP S+DWR KGAVT IK+QG+CG CWAFSAVA++EG+ +++
Sbjct: 115 GRSRTATTGFKYANVSLDDVPASVDWRTKGAVTPIKNQGECGCCWAFSAVASMEGVVKLS 174

Query: 172 RGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQ 229
            GKL+ LSEQ+LVDC  +  + GC GG MD AF++I+ N GL TE+ YPY   +GTC++ 
Sbjct: 175 TGKLVSLSEQELVDCDVNGMDQGCEGGEMDDAFDFIVGNGGLTTESRYPYTASDGTCNSN 234

Query: 230 KEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCD 289
           +    AA+I  YED+P  DE +L +AV+NQPVSV VD     F FYK GVL+  CG   D
Sbjct: 235 EASGDAASIKGYEDVPANDEASLRKAVANQPVSVAVDGGDSHFRFYKGGVLSGACGTELD 294

Query: 290 HGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD 330
           HG+A VG+G A +  G KYW++KNSWG +WGE+GYIR+ RD
Sbjct: 295 HGIAAVGYGVASD--GTKYWVMKNSWGTSWGEAGYIRMERD 333


>gi|18423124|ref|NP_568722.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|75309064|sp|Q9FGR9.1|CEP1_ARATH RecName: Full=KDEL-tailed cysteine endopeptidase CEP1; AltName:
           Full=Cysteine proteinase CP56; Short=AtCP56; Flags:
           Precursor
 gi|9759028|dbj|BAB09397.1| cysteine endopeptidase [Arabidopsis thaliana]
 gi|20258850|gb|AAM13907.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|308097832|gb|ADO14465.1| papain [Arabidopsis thaliana]
 gi|332008536|gb|AED95919.1| putative cysteine proteinase [Arabidopsis thaliana]
          Length = 361

 Score =  331 bits (848), Expect = 4e-88,   Method: Compositional matrix adjust.
 Identities = 168/346 (48%), Positives = 221/346 (63%), Gaps = 16/346 (4%)

Query: 11  IPMFVIIILVITCASQVVSGRSMH------EPSIVEKHEQWMAQHGRTYKDELEKAMRLN 64
           +  F+++ L +    +   G   H      E S+ E +E+W + H      E EKA R N
Sbjct: 1   MKRFIVLALCMLMVLETTKGLDFHNKDVESENSLWELYERWRSHHTVARSLE-EKAKRFN 59

Query: 65  IFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPS-TF 123
           +FK N+++I + NK+ +++YKL  N+F D+T+EEFR  Y G N     + +   + + +F
Sbjct: 60  VFKHNVKHIHETNKK-DKSYKLKLNKFGDMTSEEFRRTYAGSNIKHHRMFQGEKKATKSF 118

Query: 124 KYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQL 183
            Y NV  +PTS+DWR+ GAVT +K+QGQCGSCWAFS V AVEGI QI   KL  LSEQ+L
Sbjct: 119 MYANVNTLPTSVDWRKNGAVTPVKNQGQCGSCWAFSTVVAVEGINQIRTKKLTSLSEQEL 178

Query: 184 VDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYE 242
           VDC T+ N GC+GGLMD AFE+I E  GL +E  YPY+  + TCD  KE A   +I  +E
Sbjct: 179 VDCDTNQNQGCNGGLMDLAFEFIKEKGGLTSELVYPYKASDETCDTNKENAPVVSIDGHE 238

Query: 243 DLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEE 302
           D+PK  E  L++AV+NQPVSV +DA G  F FY  GV    CG   +HGVAVVG+GT  +
Sbjct: 239 DVPKNSEDDLMKAVANQPVSVAIDAGGSDFQFYSEGVFTGRCGTELNHGVAVVGYGTTID 298

Query: 303 ENGAKYWLIKNSWGETWGESGYIRILR----DAGLCGIATAASYPV 344
             G KYW++KNSWGE WGE GYIR+ R      GLCGIA  ASYP+
Sbjct: 299 --GTKYWIVKNSWGEEWGEKGYIRMQRGIRHKEGLCGIAMEASYPL 342


>gi|356545116|ref|XP_003540991.1| PREDICTED: vignain-like [Glycine max]
          Length = 342

 Score =  330 bits (847), Expect = 4e-88,   Method: Compositional matrix adjust.
 Identities = 161/336 (47%), Positives = 218/336 (64%), Gaps = 9/336 (2%)

Query: 15  VIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIE 74
           +++ LV+T  +  V  R + E     KHE+WMAQ+G+ YKD  EK  R  IFK N+ +IE
Sbjct: 11  LVVFLVLTVWTSQVMSRRLSEAYSSVKHEKWMAQYGKVYKDAAEKEKRFQIFKNNVHFIE 70

Query: 75  KANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTS 134
             +  G++ + L  N+F+DL   +F+AL     +   +V   ++  ++FKY +VT +P+S
Sbjct: 71  SFHAAGDKPFNLSINQFADL--HKFKALLINGQKKEHNVRTATATEASFKYDSVTRIPSS 128

Query: 135 IDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC-STDNHGC 193
           +DWR++GAVT IKDQG C SCWAFS VA +EG+ QIT+G+L+ LSEQ+LVDC   D+ GC
Sbjct: 129 LDWRKRGAVTPIKDQGTCRSCWAFSTVATIEGLHQITKGELVSLSEQELVDCVKGDSEGC 188

Query: 194 SGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALL 253
            GG ++ AFE+I +  G+A+E  YPY+    TC  +KE      I  YE +P   E+ALL
Sbjct: 189 YGGYVEDAFEFIAKKGGVASETHYPYKGVNKTCKVKKETHGVVQIKGYEQVPSNSEKALL 248

Query: 254 QAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKN 313
           +AV++QPVS  V+A G AF FY SG+    CG + DH V VVG+G A    G KYWL+KN
Sbjct: 249 KAVAHQPVSAYVEAGGYAFQFYSSGIFTGKCGTDIDHSVTVVGYGKA--RGGNKYWLVKN 306

Query: 314 SWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
           SWG  WGE GYIR+ RD     GLCGIAT A YP A
Sbjct: 307 SWGTEWGEKGYIRMKRDIRAKEGLCGIATGALYPTA 342


>gi|255539310|ref|XP_002510720.1| cysteine protease, putative [Ricinus communis]
 gi|223551421|gb|EEF52907.1| cysteine protease, putative [Ricinus communis]
          Length = 349

 Score =  330 bits (847), Expect = 5e-88,   Method: Compositional matrix adjust.
 Identities = 162/312 (51%), Positives = 214/312 (68%), Gaps = 13/312 (4%)

Query: 38  IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
           +++  E W+++ GR Y+   EK  R  IFK NL +I+  NK+  R Y LG NEF+DL++E
Sbjct: 43  LIDLFESWISRFGRVYESAEEKLERFEIFKDNLFHIDDTNKK-VRNYWLGLNEFADLSHE 101

Query: 98  EFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWA 157
           EF+  Y G     P +S+++  P  F Y++V  +P S+DWR+KGAVT +K+QG CGSCWA
Sbjct: 102 EFKNKYLGLK---PDLSKRAQCPEEFTYKDVA-IPKSVDWRKKGAVTPVKNQGSCGSCWA 157

Query: 158 FSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEAD 216
           FS VAAVEGI QI  G L  LSEQ+L+DC T  N+GC+GGLMD AF YI+ N GL  E D
Sbjct: 158 FSTVAAVEGINQIVTGNLTSLSEQELIDCDTTYNNGCNGGLMDYAFAYIVANGGLHKEED 217

Query: 217 YPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYK 276
           YPY  EEGTCD +KE++ A TIS Y D+P+  E++LL+A++NQP+S+ ++ASGR F FY 
Sbjct: 218 YPYIMEEGTCDMRKEESDAVTISGYHDVPQNSEESLLKALANQPLSIAIEASGRDFQFYS 277

Query: 277 SGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----G 332
            GV +  CG   DHGVA VG+GT++   G  Y ++KNSWG  WGE GYIR+ R      G
Sbjct: 278 GGVFDGHCGTELDHGVAAVGYGTSK---GLDYIIVKNSWGPKWGEKGYIRMKRKTSKPEG 334

Query: 333 LCGIATAASYPV 344
           +CGI   ASYP 
Sbjct: 335 ICGIYKMASYPT 346


>gi|255547982|ref|XP_002515048.1| cysteine protease, putative [Ricinus communis]
 gi|223546099|gb|EEF47602.1| cysteine protease, putative [Ricinus communis]
          Length = 359

 Score =  330 bits (847), Expect = 5e-88,   Method: Compositional matrix adjust.
 Identities = 157/314 (50%), Positives = 217/314 (69%), Gaps = 8/314 (2%)

Query: 35  EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
           E S+   +E+W + H    +   EK  R N+FK+NL++I K N++ +R YKL  N+F+D+
Sbjct: 33  EESLWNLYERWRSHH-TVSRSLTEKNQRFNVFKENLKHIHKVNQK-DRPYKLRLNKFADM 90

Query: 95  TNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGS 154
           TN EF   Y G       +   S R + F ++N +++P+SIDWR++GAVT +KDQG+CGS
Sbjct: 91  TNHEFLQHYGGSKVSHYRMFHGSRRQTGFAHENTSNLPSSIDWRKQGAVTGVKDQGKCGS 150

Query: 155 CWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLATE 214
           CWAFS+VAAVEGI +I  G+LI LSEQ+LVDC++ NHGC GGLM++AF +I +  GL TE
Sbjct: 151 CWAFSSVAAVEGINKIKTGELISLSEQELVDCNSVNHGCDGGLMEQAFSFIEKTGGLTTE 210

Query: 215 ADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHF 274
            +YPYR ++G CD+ K      TI  YE +P+ DE AL+QAV+NQPVS+ +DA G+ F F
Sbjct: 211 NNYPYRAKDGYCDSAKMNTPMVTIDGYEMVPENDEHALMQAVANQPVSIAIDAGGQDFQF 270

Query: 275 YKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILR----D 330
           Y  GV   DCG   +HGVA+VG+G  ++  G KYW++KNSWG  WGE+G+IR+ R    +
Sbjct: 271 YSEGVYTGDCGTELNHGVALVGYGATQD--GTKYWIVKNSWGSEWGENGFIRMQRENDVE 328

Query: 331 AGLCGIATAASYPV 344
            GLCGI   ASYP+
Sbjct: 329 EGLCGITLEASYPI 342


>gi|356508490|ref|XP_003522989.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
          Length = 349

 Score =  330 bits (846), Expect = 6e-88,   Method: Compositional matrix adjust.
 Identities = 162/312 (51%), Positives = 216/312 (69%), Gaps = 13/312 (4%)

Query: 38  IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
           ++E  E WM++HG+ Y+   EK +R  IFK NL++I++ NK  +  Y LG NEF+DL+++
Sbjct: 43  LIELFESWMSKHGKIYQSIEEKLLRFEIFKDNLKHIDERNKVVS-NYWLGLNEFADLSHQ 101

Query: 98  EFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWA 157
           EF+  Y G        SR+   P  F Y++V ++P S+DWR+KGAV  +K+QG CGSCWA
Sbjct: 102 EFKNKYLGLK---VDYSRRRESPEEFTYKDV-ELPKSVDWRKKGAVAPVKNQGSCGSCWA 157

Query: 158 FSAVAAVEGITQITRGKLIELSEQQLVDCS-TDNHGCSGGLMDKAFEYIIENKGLATEAD 216
           FS VAAVEGI QI  G L  LSEQ+L+DC  T N+GC+GGLMD AF +I+EN GL  E D
Sbjct: 158 FSTVAAVEGINQIVTGNLTSLSEQELIDCDRTYNNGCNGGLMDYAFSFIVENGGLHKEED 217

Query: 217 YPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYK 276
           YPY  EEGTC+  KE+    TIS Y D+P+ +EQ+LL+A++NQP+SV ++ASGR F FY 
Sbjct: 218 YPYIMEEGTCEMTKEETEVVTISGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQFYS 277

Query: 277 SGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----G 332
            GV +  CG++ DHGVA VG+GTA+   G  Y ++KNSWG  WGE GYIR+ R+     G
Sbjct: 278 GGVFDGHCGSDLDHGVAAVGYGTAK---GVDYIIVKNSWGSKWGEKGYIRMRRNIGKPEG 334

Query: 333 LCGIATAASYPV 344
           +CGI   ASYP 
Sbjct: 335 ICGIYKMASYPT 346


>gi|2414570|emb|CAB16317.1| cysteine proteinase precursor [Nicotiana tabacum]
          Length = 374

 Score =  330 bits (845), Expect = 7e-88,   Method: Compositional matrix adjust.
 Identities = 162/318 (50%), Positives = 212/318 (66%), Gaps = 13/318 (4%)

Query: 35  EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
           E  +  ++E W+A+HGR Y    EK  R  IFK NL +IE  N  GNRTYK+G N+F+DL
Sbjct: 43  EDQVKNRYEMWLAEHGRAYNALGEKEKRFEIFKDNLRFIEGHNNSGNRTYKVGLNQFADL 102

Query: 95  TNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD--VPTSIDWREKGAVTHIKDQGQC 152
           TNEE+R +Y G          +S  PS  +Y +  +  +P S+DWR++GAV  IK+QG C
Sbjct: 103 TNEEYRTMYLGTKSDARRRFVKSKNPSQ-RYASRPNELMPHSVDWRKRGAVAPIKNQGSC 161

Query: 153 GSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS-TDNHGCSGGLMDKAFEYIIENKGL 211
           GSCWAFS VAAVEGI QI  G++I LSEQ+LVDC    N GC+GGLMD AFE+II N G+
Sbjct: 162 GSCWAFSTVAAVEGINQIVTGEMITLSEQELVDCDRVQNSGCNGGLMDYAFEFIISNGGM 221

Query: 212 ATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRA 271
            TE  YPYR  EG CD  ++     +I  YED+P+ +E+AL +AV++QPV V ++ASGRA
Sbjct: 222 DTEKHYPYRGVEGRCDPVRKNYKVVSIDGYEDVPR-NERALQKAVAHQPVCVAIEASGRA 280

Query: 272 FHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA 331
           F  Y SGV   +CG   DHGV VVG+G+   E+G  YW+++NSWG  WGE+GY+++ R+ 
Sbjct: 281 FQLYSSGVFTGECGEEVDHGVVVVGYGS---EDGVDYWIVRNSWGTKWGENGYVKMERNV 337

Query: 332 -----GLCGIATAASYPV 344
                G CGI T ASYP 
Sbjct: 338 KKSHLGKCGIMTEASYPT 355


>gi|326493368|dbj|BAJ85145.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 436

 Score =  330 bits (845), Expect = 8e-88,   Method: Compositional matrix adjust.
 Identities = 163/336 (48%), Positives = 214/336 (63%), Gaps = 14/336 (4%)

Query: 17  IILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKA 76
           + L       +VS     E  +   + +WMA+HG TY    E+  R   F+ NL YI++ 
Sbjct: 18  VSLAAAADMSIVSYGERSEEEVRRMYAEWMAEHGSTYNAIGEEERRFEAFRDNLRYIDQH 77

Query: 77  N---KEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPT 133
           N     G  +++LG N F+DLTNEE+R+ Y G  R  P   R+ S  + ++  +  ++P 
Sbjct: 78  NAAADAGVHSFRLGLNRFADLTNEEYRSTYLGA-RTKPDRERKLS--ARYQAADNDELPE 134

Query: 134 SIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHG 192
           S+DWR+KGAV  +KDQG CGSCWAFSA+AAVEGI QI  G +I LSEQ+LVDC T  N G
Sbjct: 135 SVDWRKKGAVGAVKDQGGCGSCWAFSAIAAVEGINQIVTGDMIPLSEQELVDCDTSYNQG 194

Query: 193 CSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQAL 252
           C+GGLMD AFE+II N G+ +E DYPY+  +  CD  K+ A   TI  YED+P   E++L
Sbjct: 195 CNGGLMDYAFEFIINNGGIDSEEDYPYKERDNRCDANKKNAKVVTIDGYEDVPVNSEKSL 254

Query: 253 LQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIK 312
            +AV+NQP+SV ++A GRAF  YKSG+    CG   DHGVA VG+GT   ENG  YWL++
Sbjct: 255 QKAVANQPISVAIEAGGRAFQLYKSGIFTGTCGTALDHGVAAVGYGT---ENGKDYWLVR 311

Query: 313 NSWGETWGESGYIRILRD----AGLCGIATAASYPV 344
           NSWG  WGE GYIR+ R+    +G CGIA   SYP 
Sbjct: 312 NSWGSVWGEDGYIRMERNIKASSGKCGIAVEPSYPT 347


>gi|359483514|ref|XP_003632971.1| PREDICTED: LOW QUALITY PROTEIN: oryzain beta chain-like [Vitis
           vinifera]
          Length = 340

 Score =  330 bits (845), Expect = 8e-88,   Method: Compositional matrix adjust.
 Identities = 163/343 (47%), Positives = 230/343 (67%), Gaps = 15/343 (4%)

Query: 13  MFVIIILVITC----ASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQ 68
           +FV + L I      AS+  S R +HE S+ E+HEQWMA++ R YKD+ E+  R  +FK 
Sbjct: 3   LFVCMTLHIYYLEHRASEATS-RPLHEASMYERHEQWMARYSRNYKDDAEEERRFXMFKD 61

Query: 69  NLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNV 128
           N+++I+  +  GN   KLG N  +D+T+EEFRA  +G    +P      S  ++F++QNV
Sbjct: 62  NVDFIQTFDTAGNMPNKLGVNALADMTHEEFRA--SGNTFKIPPNLGLRSETTSFRHQNV 119

Query: 129 TDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST 188
           T +P+++DWR+K  VTHIK+Q QCG CWAFSAVAA+EGI ++   K I LSEQ+LVDC  
Sbjct: 120 TRIPSTMDWRKKRTVTHIKNQLQCGGCWAFSAVAAMEGIAKLQTSKSISLSEQELVDCDI 179

Query: 189 --DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPK 246
              N GC GG MD AF++II+N+GL +EA Y Y+  EG C+ +KE + AA I+ YE++P+
Sbjct: 180 FGSNIGCEGGCMDDAFKFIIQNRGLNSEARYLYKGVEGHCNKKKESSRAARINDYENMPE 239

Query: 247 GDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGA 306
             E+ALL+ V++QP+SV +DA G AF FY+ G++  + GN+ D+GV   G+G + +  G 
Sbjct: 240 FSEKALLKVVAHQPISVAIDAGGSAFQFYEIGIITXESGNDLDYGVTTDGYGRSAD--GK 297

Query: 307 KYWLIKNSWGETWGESGYIRILR----DAGLCGIATAASYPVA 345
           K+WL+KNSWG  WGE+GY R+ R      GLCG    ASYP A
Sbjct: 298 KHWLVKNSWGTDWGENGYTRMERGVKATTGLCGFTMQASYPTA 340


>gi|224136808|ref|XP_002326950.1| predicted protein [Populus trichocarpa]
 gi|222835265|gb|EEE73700.1| predicted protein [Populus trichocarpa]
          Length = 456

 Score =  329 bits (844), Expect = 1e-87,   Method: Compositional matrix adjust.
 Identities = 170/355 (47%), Positives = 227/355 (63%), Gaps = 35/355 (9%)

Query: 13  MFVIIILVITCAS----QVVSGRSMH--------EPSIVEKHEQWMAQHGRTYKDELEKA 60
           M +++ LV   +S     ++S    H        +  ++  +E+W+ +HG+ Y    EK 
Sbjct: 1   MLMLLFLVFALSSAFDMSIISYHQTHATKSSWRTDDEVMAMYEEWLVKHGKNYNALGEKE 60

Query: 61  MRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALY----TGYNRPVPSVS-R 115
            R  IFK NL +I++ N E NRTY +G N F+DLTNEEFR++Y    TG+ + +P  S R
Sbjct: 61  KRFEIFKDNLMFIDQHNSE-NRTYTVGLNRFADLTNEEFRSMYLGTRTGHKKRLPKTSDR 119

Query: 116 QSSRPSTFKYQNVTD-VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGK 174
            + R        V D +P S+DWR++GAV  +KDQG CGSCWAFS +AAVEGI +I  G 
Sbjct: 120 YAPR--------VGDSLPDSVDWRKEGAVAEVKDQGGCGSCWAFSTIAAVEGINKIVTGD 171

Query: 175 LIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKA 233
           LI LSEQ+LVDC T  N GC+GGLMD AFE+II N G+ TE DYPY   +G CD  ++ A
Sbjct: 172 LIALSEQELVDCDTSYNEGCNGGLMDYAFEFIINNGGIDTEDDYPYLGRDGRCDTYRKNA 231

Query: 234 VAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVA 293
              +I  YED+P+ DE AL +AV+NQPVSV ++  GR F  Y SGV   +CG + DHGVA
Sbjct: 232 KVVSIDSYEDVPENDETALKKAVANQPVSVAIEGGGRNFQLYNSGVFTGECGTSLDHGVA 291

Query: 294 VVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPV 344
            VG+GT   E G  YW+++NSWG++WGESGYIR+ R+     G CGIA   SYP+
Sbjct: 292 AVGYGT---EKGKDYWIVRNSWGKSWGESGYIRMERNIASPTGKCGIAIEPSYPI 343


>gi|449454309|ref|XP_004144898.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
 gi|449471311|ref|XP_004153272.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
          Length = 349

 Score =  329 bits (844), Expect = 1e-87,   Method: Compositional matrix adjust.
 Identities = 172/349 (49%), Positives = 224/349 (64%), Gaps = 17/349 (4%)

Query: 5   FEKSFIIPMFVIIILVITCASQVVSGRSM-HEPSI---VEKHEQWMAQHGRTYKDELEKA 60
           F K+ +I    + I   T     + G S  H  S+   +E  E WM++H + Y+   EK 
Sbjct: 6   FSKATLILSATLFITYATAHDFSIVGYSPEHLASMDKTIELFESWMSKHSKAYRSIEEKL 65

Query: 61  MRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRP 120
            R  IF  NL++I++ NK+ + +Y LG NEF+DL++EEF++ Y G     P   ++SSR 
Sbjct: 66  HRFEIFLDNLKHIDETNKKVS-SYWLGLNEFADLSHEEFKSKYLGLRVEFPR--KRSSR- 121

Query: 121 STFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSE 180
             F Y +V D+P S+DWR KGAVT +K+QG CGSCWAFS VAAVEGI QI  G L  LSE
Sbjct: 122 -GFSYGDVEDLPESVDWRTKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSLSE 180

Query: 181 QQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATIS 239
           Q+L+DC    N+GC GGLMD AF+YI+ N GL  E DYPY  EEG C  +KE+    TIS
Sbjct: 181 QELIDCDRSFNNGCYGGLMDYAFQYIMSNSGLRKEEDYPYLMEEGRCIREKEQFEVVTIS 240

Query: 240 KYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGT 299
            YED+P  DEQ+LL+A+S+QPVSV ++AS R F FYK G+    CG   DHGV  VG+G+
Sbjct: 241 GYEDVPANDEQSLLKALSHQPVSVAIEASSRNFQFYKGGIFTGRCGTQMDHGVTAVGYGS 300

Query: 300 AEEENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
           +E   G  Y ++KNSWG  WGE+GYIR+ R+     GLCGI   ASYP 
Sbjct: 301 SE---GTDYIIVKNSWGPKWGENGYIRMKRNTGKPEGLCGINQMASYPT 346


>gi|4731374|gb|AAD28477.1|AF133839_1 papain-like cysteine protease [Sandersonia aurantiaca]
          Length = 357

 Score =  329 bits (844), Expect = 1e-87,   Method: Compositional matrix adjust.
 Identities = 165/346 (47%), Positives = 224/346 (64%), Gaps = 18/346 (5%)

Query: 10  IIPMFVIIILVI--TCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFK 67
           + P+ +++ L    T +  +       E S+   +E+W + H  + +D  +K  R N+FK
Sbjct: 4   LFPVLLVLALAFGSTLSIPIKEKDLESEDSLWSLYERWRSHHAVS-RDLDQKQKRFNVFK 62

Query: 68  QNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTG----YNRPVPSVSRQSSRPSTF 123
           +N+++I + NK  + T+KL  N+F D+TN+EFRA Y G    ++R +      S   + F
Sbjct: 63  ENVKFIHEFNKNKDVTFKLALNKFGDMTNQEFRAKYAGSKVHHHRTMKGSRHGSGSGAKF 122

Query: 124 KYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQL 183
            Y+N    P SIDWRE+GAV  +K+QGQCGSCWAFSA+AAVEGI QI   +L+ LSEQ+L
Sbjct: 123 MYENAV-APPSIDWRERGAVAAVKNQGQCGSCWAFSAIAAVEGINQIVTKELVPLSEQEL 181

Query: 184 VDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYE 242
           +DC TD N GCSGGLMD AFE+I  N G+ TE  YPY+ E+ TC   K+ + A  I  YE
Sbjct: 182 IDCDTDQNQGCSGGLMDYAFEFIKNNGGITTEDVYPYQAEDATC---KKNSPAVVIDGYE 238

Query: 243 DLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEE 302
           D+P  DE AL++AV+NQPV+V ++ASG  F FY  GV    CG   DHGVAVVG+GT ++
Sbjct: 239 DVPTNDEDALMKAVANQPVAVAIEASGYVFQFYSEGVFTGRCGTELDHGVAVVGYGTTQD 298

Query: 303 ENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
             G KYW ++NSWG  WGESGY+R+ R      GLCGIA  ASYP+
Sbjct: 299 --GTKYWTVRNSWGADWGESGYVRMQRGIKATHGLCGIAMQASYPI 342


>gi|255540425|ref|XP_002511277.1| cysteine protease, putative [Ricinus communis]
 gi|46395620|sp|O65039.1|CYSEP_RICCO RecName: Full=Vignain; AltName: Full=Cysteine endopeptidase; Flags:
           Precursor
 gi|2944446|gb|AAC62396.1| cysteine endopeptidase precursor [Ricinus communis]
 gi|223550392|gb|EEF51879.1| cysteine protease, putative [Ricinus communis]
          Length = 360

 Score =  329 bits (843), Expect = 1e-87,   Method: Compositional matrix adjust.
 Identities = 164/309 (53%), Positives = 206/309 (66%), Gaps = 10/309 (3%)

Query: 42  HEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRA 101
           +E+W + H    +   EK  R N+FK N  ++  ANK  ++ YKL  N+F+D+TN EFR 
Sbjct: 38  YERWRSHH-TVSRSLHEKQKRFNVFKHNAMHVHNANKM-DKPYKLKLNKFADMTNHEFRN 95

Query: 102 LYTGYNRPVPSVSRQSSRPS-TFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSA 160
            Y+G       + R   R + TF Y+ V  VP S+DWR+KGAVT +KDQGQCGSCWAFS 
Sbjct: 96  TYSGSKVKHHRMFRGGPRGNGTFMYEKVDTVPASVDWRKKGAVTSVKDQGQCGSCWAFST 155

Query: 161 VAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPY 219
           + AVEGI QI   KL+ LSEQ+LVDC TD N GC+GGLMD AFE+I +  G+ TEA+YPY
Sbjct: 156 IVAVEGINQIKTNKLVSLSEQELVDCDTDQNQGCNGGLMDYAFEFIKQRGGITTEANYPY 215

Query: 220 RHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGV 279
              +GTCD  KE A A +I  +E++P+ DE ALL+AV+NQPVSV +DA G  F FY  GV
Sbjct: 216 EAYDGTCDVSKENAPAVSIDGHENVPENDENALLKAVANQPVSVAIDAGGSDFQFYSEGV 275

Query: 280 LNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILR----DAGLCG 335
               CG   DHGVA+VG+GT  +  G KYW +KNSWG  WGE GYIR+ R      GLCG
Sbjct: 276 FTGSCGTELDHGVAIVGYGTTID--GTKYWTVKNSWGPEWGEKGYIRMERGISDKEGLCG 333

Query: 336 IATAASYPV 344
           IA  ASYP+
Sbjct: 334 IAMEASYPI 342


>gi|356563584|ref|XP_003550041.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
          Length = 366

 Score =  329 bits (843), Expect = 1e-87,   Method: Compositional matrix adjust.
 Identities = 157/343 (45%), Positives = 213/343 (62%), Gaps = 9/343 (2%)

Query: 8   SFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFK 67
           + +I   + +   ++CA    +  +  +  ++  +E+W+ +H + Y    EK  R  +FK
Sbjct: 6   TLMISTLLFLSFTLSCAIDTSTITNYTDNEVMTMYEEWLVKHQKVYNGLGEKDKRFQVFK 65

Query: 68  QNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPS-VSRQSSRPSTFKYQ 126
            NL +I++ N   N TYKLG N+F+D+TNEE+R +Y G        + +  S    + Y 
Sbjct: 66  DNLGFIQEHNNNQNNTYKLGLNKFADMTNEEYRVMYFGTKSDAKRRLMKTKSTGHRYAYS 125

Query: 127 NVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC 186
               +P  +DWR KGAV  IKDQG CGSCWAFS VA VE I +I  GK + LSEQ+LVDC
Sbjct: 126 AGDQLPVHVDWRVKGAVAPIKDQGSCGSCWAFSTVATVEAINKIVTGKFVSLSEQELVDC 185

Query: 187 STD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLP 245
               N GC+GGLMD AFE+II+N G+ T+ DYPYR  +G CD  K+ A A  I  YED+P
Sbjct: 186 DRAYNQGCNGGLMDYAFEFIIQNGGIDTDKDYPYRGFDGICDPTKKNAKAVNIDGYEDVP 245

Query: 246 KGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENG 305
             DE AL +AV+ QPVS+ ++ASGRA   Y+SGV   +CG + DHGV VVG+G+   ENG
Sbjct: 246 PYDENALKKAVARQPVSIAIEASGRALQLYQSGVFTGECGTSLDHGVVVVGYGS---ENG 302

Query: 306 AKYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPV 344
             YWL++NSWG  WGE GY ++ R+     G CGI   ASYPV
Sbjct: 303 VDYWLVRNSWGTGWGEDGYFKMQRNVRTPTGKCGITMEASYPV 345


>gi|218202087|gb|EEC84514.1| hypothetical protein OsI_31214 [Oryza sativa Indica Group]
          Length = 348

 Score =  329 bits (843), Expect = 2e-87,   Method: Compositional matrix adjust.
 Identities = 160/333 (48%), Positives = 222/333 (66%), Gaps = 17/333 (5%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
           +F I+  +  C++ + +     + ++  +HE+WMAQ+GR YKD+ EKA R  +FK N  +
Sbjct: 8   LFAILGCLCLCSAVLAARELSDDAAMAARHERWMAQYGRMYKDDAEKARRFEVFKANAAF 67

Query: 73  IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT--D 130
           IE  N  GN  + LG N+F+DLTN+EFR   T     +PS +R    P+ F+Y+NV    
Sbjct: 68  IESFNA-GNHKFWLGVNQFADLTNDEFRLTKTNKGF-IPSTTRV---PTGFRYENVNIDA 122

Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST-- 188
           +P ++DWR KG VT IKDQGQCG CWAFSAVAA+EGI +++ GKLI LSEQ+LVDC    
Sbjct: 123 LPATMDWRTKGVVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVHG 182

Query: 189 DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
           ++ GC GGLMD AF++II+N GL TE++YPY   +  C +       A+I  YED+P  +
Sbjct: 183 EDQGCEGGLMDDAFKFIIKNGGLTTESNYPYAAADDKCKSVSNS--VASIKGYEDVPANN 240

Query: 249 EQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKY 308
           E AL++AV+NQPVSV VD     F FYK GV+   CG + DHG+  +G+G A +  G KY
Sbjct: 241 EAALMKAVANQPVSVAVDGDDMTFQFYKGGVMIGSCGTDLDHGIVAIGYGKASD--GTKY 298

Query: 309 WLIKNSWGETWGESGYIRILRDA----GLCGIA 337
           WL+KNSWG TWGE+G++R+ +D     G+CG+A
Sbjct: 299 WLLKNSWGMTWGENGFLRMEKDISDKRGMCGLA 331


>gi|351629615|gb|AEQ54771.1| KDDL-tailed cysteine proteinase CP4 [Coffea canephora]
          Length = 359

 Score =  328 bits (842), Expect = 2e-87,   Method: Compositional matrix adjust.
 Identities = 163/351 (46%), Positives = 229/351 (65%), Gaps = 13/351 (3%)

Query: 3   LKFEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMR 62
           +K  K+F+  + + +ILV   + ++       E S+ + +E+W + H    +D  EK  R
Sbjct: 1   MKMGKAFLFAVVLAVILVAAMSMEITERDLASEESLWDLYERWRSHH-TVSRDLSEKRKR 59

Query: 63  LNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPST 122
            N+FK N+ +I K N++ ++ YKL  N F+D+TN EFR  Y+   +    +    SR +T
Sbjct: 60  FNVFKANVHHIHKVNQK-DKPYKLKLNSFADMTNHEFREFYSSKVKHYRML--HGSRANT 116

Query: 123 -FKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQ 181
            F +     +P S+DWR++GAVT +K+QG+CGSCWAFS V  VEGI +I  G+L+ LSEQ
Sbjct: 117 GFMHGKTESLPASVDWRKQGAVTGVKNQGKCGSCWAFSTVVGVEGINKIKTGQLVSLSEQ 176

Query: 182 QLVDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKY 241
           +LVDC TDN GC+GGLM+ A+E+I ++ G+ TE  YPY+  +G+CD+ K  A A TI  +
Sbjct: 177 ELVDCETDNEGCNGGLMENAYEFIKKSGGITTERLYPYKARDGSCDSSKMNAPAVTIDGH 236

Query: 242 EDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNAD-CGNNCDHGVAVVGFGTA 300
           E +P  DE AL++AV+NQPVSV +DASG    FY  GV   D CGN  DHGVAVVG+GTA
Sbjct: 237 EMVPANDENALMKAVANQPVSVAIDASGSDMQFYSEGVYAGDSCGNELDHGVAVVGYGTA 296

Query: 301 EEENGAKYWLIKNSWGETWGESGYIRILR-----DAGLCGIATAASYPVAI 346
            +  G KYW++KNSWG  WGE GYIR+ R     + G+CGIA  ASYP+ +
Sbjct: 297 LD--GTKYWIVKNSWGTGWGEQGYIRMQRGVDAAEGGVCGIAMEASYPLKL 345


>gi|212275830|ref|NP_001130503.1| cysteine protease 1 [Zea mays]
 gi|194689328|gb|ACF78748.1| unknown [Zea mays]
 gi|219886279|gb|ACL53514.1| unknown [Zea mays]
 gi|238010470|gb|ACR36270.1| unknown [Zea mays]
 gi|413920875|gb|AFW60807.1| cysteine protease 1 [Zea mays]
          Length = 354

 Score =  328 bits (842), Expect = 2e-87,   Method: Compositional matrix adjust.
 Identities = 173/355 (48%), Positives = 234/355 (65%), Gaps = 30/355 (8%)

Query: 10  IIPMFVIIILVITCASQVVSGRSMH--------EPSIVEKHEQWMAQHGRTYKDELEKAM 61
           +I    + + ++   + +   R +         E ++  +H+QWMA+HGRTY+DE EKA 
Sbjct: 11  VITFTAVALTILAVTTMMAEARDLSSTSTGGYGEEAMKVRHQQWMAEHGRTYRDEAEKAH 70

Query: 62  RLNIFKQNLEYIEKANKEGN--RTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSR 119
           R  +FK N ++++ +N  G+  ++Y+L  NEF+D+TN+EF A+YTG  RPVP+ ++   +
Sbjct: 71  RFQVFKANADFVDASNAAGDDKKSYRLELNEFADMTNDEFMAMYTGL-RPVPAGAK---K 126

Query: 120 PSTFKYQNVT-----DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGK 174
            + FKY NVT     D   ++DWR+KGAVT IK+QGQCG CWAF+AVAAVEGI QIT G 
Sbjct: 127 MAGFKYGNVTLSDADDDQQTVDWRQKGAVTGIKNQGQCGCCWAFAAVAAVEGIHQITTGN 186

Query: 175 LIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKA 233
           L+ LSEQQ++DC TD N+GC+GG +D AF+YI+ N GL TE  YPY   +  C  Q  + 
Sbjct: 187 LVSLSEQQVLDCDTDGNNGCNGGYIDNAFQYIVGNGGLGTEDAYPYTAAQAMC--QSVQP 244

Query: 234 VAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLN-ADCGN--NCDH 290
           VAA IS Y+D+P GDE AL  AV+NQPVSV +DA    F  Y  GV+  A C    N +H
Sbjct: 245 VAA-ISGYQDVPSGDEAALAAAVANQPVSVAIDA--HNFQLYGGGVMTAASCSTPPNLNH 301

Query: 291 GVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDAGLCGIATAASYPVA 345
            V  VG+GTAE+  G  YWL+KN WG+ WGE GY+R+ R A  CG+A  ASYPVA
Sbjct: 302 AVTAVGYGTAED--GTPYWLLKNQWGQNWGEGGYLRLERGANACGVAQQASYPVA 354


>gi|356517184|ref|XP_003527269.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
          Length = 350

 Score =  328 bits (842), Expect = 2e-87,   Method: Compositional matrix adjust.
 Identities = 162/312 (51%), Positives = 215/312 (68%), Gaps = 13/312 (4%)

Query: 38  IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
           ++E  E W+++HG+ Y+   EK  R  IFK NL++I++ NK  +  Y LG NEF+DL+++
Sbjct: 44  LIELFESWISRHGKIYQSIEEKLHRFEIFKDNLKHIDERNKVVS-NYWLGLNEFADLSHQ 102

Query: 98  EFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWA 157
           EF+  Y G        SR+   P  F Y++V ++P S+DWR+KGAVT +K+QG CGSCWA
Sbjct: 103 EFKNKYLGLK---VDYSRRRESPEEFTYKDV-ELPKSVDWRKKGAVTQVKNQGSCGSCWA 158

Query: 158 FSAVAAVEGITQITRGKLIELSEQQLVDCS-TDNHGCSGGLMDKAFEYIIENKGLATEAD 216
           FS VAAVEGI QI  G L  LSEQ+L+DC  T N+GC+GGLMD AF +I+EN GL  E D
Sbjct: 159 FSTVAAVEGINQIVTGNLTSLSEQELIDCDRTYNNGCNGGLMDYAFSFIVENDGLHKEED 218

Query: 217 YPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYK 276
           YPY  EEGTC+  KE+    TIS Y D+P+ +EQ+LL+A++NQP+SV ++ASGR F FY 
Sbjct: 219 YPYIMEEGTCEMAKEETEVVTISGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQFYS 278

Query: 277 SGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD----AG 332
            GV +  CG++ DHGVA VG+GTA+   G  Y  +KNSWG  WGE GYIR+ R+     G
Sbjct: 279 GGVFDGHCGSDLDHGVAAVGYGTAK---GVDYITVKNSWGSKWGEKGYIRMRRNIGKPEG 335

Query: 333 LCGIATAASYPV 344
           +CGI   ASYP 
Sbjct: 336 ICGIYKMASYPT 347


>gi|356508487|ref|XP_003522988.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
          Length = 349

 Score =  328 bits (841), Expect = 2e-87,   Method: Compositional matrix adjust.
 Identities = 161/312 (51%), Positives = 215/312 (68%), Gaps = 13/312 (4%)

Query: 38  IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
           ++E  E WM++HG+ Y+   EK  R +IFK NL++I++ NK  +  Y LG NEF+DL+++
Sbjct: 43  LIELFESWMSRHGKIYQSIEEKLHRFDIFKDNLKHIDERNKVVS-NYWLGLNEFADLSHQ 101

Query: 98  EFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWA 157
           EF+  Y G        SR+   P  F Y++  ++P S+DWR+KGAVT +K+QG CGSCWA
Sbjct: 102 EFKNKYLGLK---VDYSRRRESPEEFTYKDF-ELPKSVDWRKKGAVTQVKNQGSCGSCWA 157

Query: 158 FSAVAAVEGITQITRGKLIELSEQQLVDCS-TDNHGCSGGLMDKAFEYIIENKGLATEAD 216
           FS VAAVEGI QI  G L  LSEQ+L+DC  T N+GC+GGLMD AF +I+EN GL  E D
Sbjct: 158 FSTVAAVEGINQIVTGNLTSLSEQELIDCDRTYNNGCNGGLMDYAFSFIVENGGLHKEED 217

Query: 217 YPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYK 276
           YPY  EEGTC+  KE+    TIS Y D+P+ +EQ+LL+A+ NQP+SV ++ASGR F FY 
Sbjct: 218 YPYIMEEGTCEMTKEETEVVTISGYHDVPQNNEQSLLKALVNQPLSVAIEASGRDFQFYS 277

Query: 277 SGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----G 332
            GV +  CG++ DHGVA VG+GT++   G  Y ++KNSWG  WGE GYIR+ R+     G
Sbjct: 278 GGVFDGHCGSDLDHGVAAVGYGTSK---GVNYIIVKNSWGSKWGEKGYIRMRRNIGKPEG 334

Query: 333 LCGIATAASYPV 344
           +CGI   ASYP 
Sbjct: 335 ICGIYKMASYPT 346


>gi|297792329|ref|XP_002864049.1| hypothetical protein ARALYDRAFT_495086 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297309884|gb|EFH40308.1| hypothetical protein ARALYDRAFT_495086 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 361

 Score =  328 bits (840), Expect = 3e-87,   Method: Compositional matrix adjust.
 Identities = 171/347 (49%), Positives = 225/347 (64%), Gaps = 15/347 (4%)

Query: 7   KSFIIPMFVIIILVITCASQVVSGRSMH-EPSIVEKHEQWMAQHGRTYKDELEKAMRLNI 65
           K FI+    +++++ T  S     + +  E S+ E +E+W + H      E EKA R N+
Sbjct: 2   KRFIVLALCMLMVLETTKSLDFHEKDVESEDSLWELYERWKSHHTIARSLE-EKAKRFNV 60

Query: 66  FKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYN---RPVPSVSRQSSRPST 122
           FK N+++I + NK+ N +YKL  N+F D+T+EEFR  Y G N     +    RQ+++  +
Sbjct: 61  FKHNVKHIHETNKKEN-SYKLKLNKFGDMTSEEFRRTYAGSNIKHHRMFQGERQTTK--S 117

Query: 123 FKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQ 182
           F Y NV  +PTS+DWR+ GAVT +K+QGQCGSCWAFS V AVEGI QI   KL  LSEQ+
Sbjct: 118 FMYANVDTLPTSVDWRKNGAVTPVKNQGQCGSCWAFSTVVAVEGINQIRTKKLTSLSEQE 177

Query: 183 LVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKY 241
           LVDC T+ N GC+GGLMD AFE+I E  GL +E  YPY+  + TCD  KE A   +I  +
Sbjct: 178 LVDCDTNKNQGCNGGLMDLAFEFIKEKGGLTSELVYPYKASDETCDTNKENAPVVSIDGH 237

Query: 242 EDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAE 301
           ED+PK  E  L++AV++QPVSV +DA G  F FY  GV    CG   +HGVAVVG+GT  
Sbjct: 238 EDVPKNSEVDLMKAVAHQPVSVAIDAGGSDFQFYSEGVFTGRCGTELNHGVAVVGYGTTI 297

Query: 302 EENGAKYWLIKNSWGETWGESGYIRILR----DAGLCGIATAASYPV 344
           +  G KYW++KNSWGE WGE GYIR+ R      GLCGIA  ASYP+
Sbjct: 298 D--GTKYWIVKNSWGEEWGEKGYIRMQRGIRHKEGLCGIAMEASYPL 342


>gi|146216000|gb|ABQ10202.1| cysteine protease Cp4 [Actinidia deliciosa]
          Length = 463

 Score =  328 bits (840), Expect = 3e-87,   Method: Compositional matrix adjust.
 Identities = 160/342 (46%), Positives = 221/342 (64%), Gaps = 15/342 (4%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEK-----HEQWMAQHGRTYKDELEKAMRLNIFK 67
           +F+           ++S    H P   +      +E+W+  HG+ Y    EK  R  IFK
Sbjct: 13  LFLCFAFSSALDMSIISYDQTHPPQRTDAEAMAIYEKWLTTHGKAYNAIGEKERRFEIFK 72

Query: 68  QNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQN 127
            NL ++++ N     +Y++G N F+DLTNEE+R+++ G N  +   S  S++   + ++ 
Sbjct: 73  DNLRFVDEHNAVAG-SYRVGLNRFADLTNEEYRSMFLGGNMEMKERS-ASTKSDRYAFRA 130

Query: 128 VTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
              +P S+DWREKGAV+ +KDQGQCGSCWAFS ++AVEGI QI  G+LI LSEQ+LVDC 
Sbjct: 131 GDKLPGSVDWREKGAVSPVKDQGQCGSCWAFSTISAVEGINQIVTGELISLSEQELVDCD 190

Query: 188 TD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPK 246
              N GC+GGLMD  F++II N G+ TE DYPYR  +GTCD  ++ A   +I+ YED+P+
Sbjct: 191 KSYNMGCNGGLMDYGFQFIINNGGIDTEEDYPYRAVDGTCDQFRKNARVVSINGYEDVPE 250

Query: 247 GDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGA 306
            DE +L +AV+NQPVSV ++A GRAF  Y+SGV    CG N DHGV  VG+GT   ENG 
Sbjct: 251 DDENSLKKAVANQPVSVAIEAGGRAFQLYESGVFTGHCGTNLDHGVVAVGYGT---ENGV 307

Query: 307 KYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPV 344
            YW ++NSWG  WGE+GYI++ R+    +G CGIA+ ASYP 
Sbjct: 308 DYWTVRNSWGPKWGENGYIKLERNINATSGKCGIASMASYPT 349


>gi|194352750|emb|CAQ00103.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
 gi|326514262|dbj|BAJ92281.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326519402|dbj|BAJ96700.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326524351|dbj|BAK00559.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326531998|dbj|BAK01375.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 356

 Score =  328 bits (840), Expect = 3e-87,   Method: Compositional matrix adjust.
 Identities = 167/315 (53%), Positives = 218/315 (69%), Gaps = 14/315 (4%)

Query: 38  IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
           +VE  E+W+A+H + Y    EK  R  +FK NL++I+K N+E   +Y LG NEF+DLT++
Sbjct: 45  LVELFEKWLAKHQKAYASFEEKLHRFEVFKDNLKHIDKINRE-VTSYWLGLNEFADLTHD 103

Query: 98  EFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT--DVPTSIDWREKGAVTHIKDQGQCGSC 155
           EF+A Y G +       R SSR  +F+Y++V+  D+P S+DWR+KGAVT +K+QGQCGSC
Sbjct: 104 EFKAAYLGLD--AAPARRGSSR--SFRYEDVSASDLPKSVDWRKKGAVTEVKNQGQCGSC 159

Query: 156 WAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATE 214
           WAFS VAAVEGI  I  G L  LSEQ+L+DCS D N GC+GGLMD AF YI  + GL TE
Sbjct: 160 WAFSTVAAVEGINAIVTGNLTALSEQELIDCSVDGNSGCNGGLMDYAFSYIASSGGLHTE 219

Query: 215 ADYPYRHEEGTC-DNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFH 273
             YPY  EEG+C D +K ++ A TIS YED+P  DEQAL++A+++QPVSV ++ASGR F 
Sbjct: 220 EAYPYLMEEGSCGDGKKAESEAVTISGYEDVPANDEQALIKALAHQPVSVAIEASGRHFQ 279

Query: 274 FYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA-- 331
           FY  GV +  CG   DHGVA VG+G+ ++  G  Y +++NSWG  WGE GYIR+ R    
Sbjct: 280 FYSGGVFDGPCGAQLDHGVAAVGYGS-DKGKGHDYIIVRNSWGAQWGEKGYIRMKRGTSN 338

Query: 332 --GLCGIATAASYPV 344
             GLCGI   ASYP 
Sbjct: 339 GEGLCGINKMASYPT 353


>gi|30141027|dbj|BAC75927.1| cysteine protease-5 [Helianthus annuus]
          Length = 365

 Score =  328 bits (840), Expect = 3e-87,   Method: Compositional matrix adjust.
 Identities = 157/326 (48%), Positives = 217/326 (66%), Gaps = 12/326 (3%)

Query: 28  VSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLG 87
           V+  +  +  +   +E W+A+HG+TY    EK  R  IF  NL++I++ N  GNR+YK+G
Sbjct: 22  VTSNTRTDEEVRNTYELWLARHGKTYNALGEKESRFRIFADNLKFIDEHNLSGNRSYKVG 81

Query: 88  TNEFSDLTNEEFRALYTGYN-RPVPSVSRQSSRPSTFKY--QNVTDVPTSIDWREKGAVT 144
            N+F+DLTNEE+R++Y G    P   +++      + +Y  Q     P  +DWRE+GAV+
Sbjct: 82  LNQFADLTNEEYRSMYLGTKVDPYRRIAKMQRGEISRRYAVQENEMFPAKVDWRERGAVS 141

Query: 145 HIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFE 203
            +K+QG CGSCWAFS VA+VEGI +I  G LI LSEQ+LVDC    N GC+GG MD AF+
Sbjct: 142 PVKNQGGCGSCWAFSTVASVEGINKIVTGDLISLSEQELVDCDNKYNSGCNGGSMDYAFQ 201

Query: 204 YIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSV 263
           +I+ N G+ +E+DYPY+     CD  + KA   +I  YED+P  +E+AL++AV++QPVSV
Sbjct: 202 FIVSNGGIDSESDYPYKGVGAVCDPVRNKAKIVSIDGYEDVPPMNEKALMKAVAHQPVSV 261

Query: 264 CVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESG 323
            ++ASGRAF  Y SGVL   CG N DHGV VVG+G+   ENG  YW+++NSWG  WGE G
Sbjct: 262 GIEASGRAFQLYTSGVLTGSCGTNLDHGVVVVGYGS---ENGKDYWIVRNSWGPEWGEDG 318

Query: 324 YIRILRD-----AGLCGIATAASYPV 344
           YIR+ R+      G+CGI   ASYP+
Sbjct: 319 YIRMERNMVDTPVGMCGITLMASYPI 344


>gi|13897890|gb|AAK48495.1|AF259983_1 putative cysteine protease [Ipomoea batatas]
          Length = 462

 Score =  328 bits (840), Expect = 3e-87,   Method: Compositional matrix adjust.
 Identities = 161/321 (50%), Positives = 206/321 (64%), Gaps = 9/321 (2%)

Query: 30  GRSMHEPSIVEKHEQWMAQHGRTYKD-ELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGT 88
           G S  +  ++  +E W+ +HG++Y     EK  R  IFK NL YI++ N  G+R+YKLG 
Sbjct: 37  GLSRSDEEVMALYESWLVEHGKSYNGLGGEKDKRFEIFKDNLRYIDEQNSRGDRSYKLGL 96

Query: 89  NEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKD 148
           N F+DLTNEE+R+ Y G          ++     +  +    +P SIDWREKGAV  +KD
Sbjct: 97  NRFADLTNEEYRSTYLGAKTDARRRIAKTKSDRRYAPKAGGSLPDSIDWREKGAVAEVKD 156

Query: 149 QGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIE 207
           QG CGSCWAFS +AAVEGI QI  G+LI LSEQ+LVDC T  N GC+GGLMD AFE+II+
Sbjct: 157 QGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVDCDTSYNEGCNGGLMDYAFEFIIK 216

Query: 208 NKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDA 267
           N G+ TEADYPY    G CD  ++ A   +I  YED+   DE AL +AV+ QPVSV ++A
Sbjct: 217 NGGIDTEADYPYTGRYGRCDQTRKNAKVVSIDGYEDVTPYDEAALKEAVAGQPVSVAIEA 276

Query: 268 SGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRI 327
            GR F  Y SG+    CG + DHGV  VG+GT   ENG  YW++KNSW  +WGE GY+R+
Sbjct: 277 GGRDFQLYSSGIFTGSCGTDLDHGVTAVGYGT---ENGVDYWIVKNSWAASWGEKGYLRM 333

Query: 328 LRDA----GLCGIATAASYPV 344
            R+     GLCGIA   SYP 
Sbjct: 334 QRNVKDKNGLCGIAIEPSYPT 354


>gi|356553978|ref|XP_003545327.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
          Length = 496

 Score =  328 bits (840), Expect = 3e-87,   Method: Compositional matrix adjust.
 Identities = 166/348 (47%), Positives = 218/348 (62%), Gaps = 18/348 (5%)

Query: 10  IIPMFVIIILVITCASQVVSGRSMH------EPSIVEKHEQWMAQHGRTYKDELEKAMRL 63
           I+ +F +  +       ++S  + H      +  ++  +EQW+ +HG+ Y    EK  R 
Sbjct: 41  ILLLFTVFAVSSALDMSIISYDNAHAATSRSDEELMSMYEQWLVKHGKVYNALGEKEKRF 100

Query: 64  NIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTF 123
            IFK NL +I+  N + +RTYKLG N F+DLTNEE+RA Y G    +    R    PS  
Sbjct: 101 QIFKDNLRFIDDHNSQEDRTYKLGLNRFADLTNEEYRAKYLGTK--IDPNRRLGKTPSNR 158

Query: 124 KYQNVTD-VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQ 182
               V D +P S+DWR++GAV  +KDQG CGSCWAFSA+ AVEGI +I  G+LI LSEQ+
Sbjct: 159 YAPRVGDKLPESVDWRKEGAVPPVKDQGGCGSCWAFSAIGAVEGINKIVTGELISLSEQE 218

Query: 183 LVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKY 241
           LVDC T  N GC+GGLMD AFE+II N G+ +E DYPYR  +G CD  ++ A   +I  Y
Sbjct: 219 LVDCDTGYNEGCNGGLMDYAFEFIINNGGIDSEEDYPYRGVDGRCDTYRKNAKVVSIDDY 278

Query: 242 EDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAE 301
           ED+P  DE AL +AV+NQPVSV ++  GR F  Y SGV    CG   DHGV  VG+GTA 
Sbjct: 279 EDVPAYDELALKKAVANQPVSVAIEGGGREFQLYVSGVFTGRCGTALDHGVVAVGYGTA- 337

Query: 302 EENGAKYWLIKNSWGETWGESGYIRILRD-----AGLCGIATAASYPV 344
             NG  YW+++NSWG +WGE GYIR+ R+     +G CGIA   SYP+
Sbjct: 338 --NGHDYWIVRNSWGPSWGEDGYIRLERNLANSRSGKCGIAIEPSYPL 383


>gi|34223513|gb|AAQ62999.1| oil palm polygalacturonase allergen PEST472 [Elaeis guineensis]
          Length = 525

 Score =  328 bits (840), Expect = 3e-87,   Method: Compositional matrix adjust.
 Identities = 161/325 (49%), Positives = 209/325 (64%), Gaps = 12/325 (3%)

Query: 28  VSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKAN---KEGNRTY 84
           V G    E  +   +E W+A+HGR      EK  R  IFK N+ +I+  N     G+R++
Sbjct: 36  VQGLERSEEEMRLLYEGWLAKHGRADNALGEKERRFEIFKDNVRFIDAHNAAADSGHRSF 95

Query: 85  KLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVT 144
           +LG N F+D+TNEE+R +Y G  RP     R       ++Y    ++P S+DWR+KGAVT
Sbjct: 96  RLGLNRFADMTNEEYRTVYLG-TRPASHRRRARLGSDRYRYNAGEELPESVDWRDKGAVT 154

Query: 145 HIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST-DNHGCSGGLMDKAFE 203
            +KDQG CGSCWAFS +AAVEGI +I  G LI LSEQ+LVDC    N GC+GGLMD AFE
Sbjct: 155 TVKDQGSCGSCWAFSTIAAVEGINKIVTGDLISLSEQELVDCDNGQNQGCNGGLMDYAFE 214

Query: 204 YIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSV 263
           +II N G+ TE DYPY+  +G CD  ++ A   +I  YED+P  DE+AL +AV+NQPVSV
Sbjct: 215 FIINNGGIDTEEDYPYKARDGKCDQYRKNAKVVSIDGYEDVPVNDEKALQKAVANQPVSV 274

Query: 264 CVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESG 323
            ++A GR F  Y SG+    CG + DHGV  VG+GT   ENG  YW+++NSWG  WGESG
Sbjct: 275 AIEAGGREFQLYHSGIFTGRCGTDLDHGVVAVGYGT---ENGKDYWIVRNSWGGDWGESG 331

Query: 324 YIRILRD----AGLCGIATAASYPV 344
           YIR+ R+     G CGIA  +SYP 
Sbjct: 332 YIRMERNVNASTGKCGIAMESSYPT 356


>gi|356517188|ref|XP_003527271.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
          Length = 350

 Score =  327 bits (839), Expect = 3e-87,   Method: Compositional matrix adjust.
 Identities = 162/312 (51%), Positives = 214/312 (68%), Gaps = 13/312 (4%)

Query: 38  IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
           ++E  E WM++HG+ Y++  EK +R  IFK NL++I++ NK  +  Y LG NEF+DL++ 
Sbjct: 44  LIELFESWMSRHGKIYENIEEKLLRFEIFKDNLKHIDERNKVVS-NYWLGLNEFADLSHR 102

Query: 98  EFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWA 157
           EF   Y G        SR+   P  F Y++V ++P S+DWR+KGAV  +K+QG CGSCWA
Sbjct: 103 EFNNKYLGLK---VDYSRRRESPEEFTYKDV-ELPKSVDWRKKGAVAPVKNQGSCGSCWA 158

Query: 158 FSAVAAVEGITQITRGKLIELSEQQLVDCS-TDNHGCSGGLMDKAFEYIIENKGLATEAD 216
           FS VAAVEGI QI  G L  LSEQ+L+DC  T N+GC+GGLMD AF +I+EN GL  E D
Sbjct: 159 FSTVAAVEGINQIVTGNLTSLSEQELIDCDRTYNNGCNGGLMDYAFSFIVENGGLHKEED 218

Query: 217 YPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYK 276
           YPY  EEGTC+  KE+    TIS Y D+P+ +EQ+LL+A++NQP+SV ++ASGR F FY 
Sbjct: 219 YPYIMEEGTCEMTKEETQVVTISGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQFYS 278

Query: 277 SGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----G 332
            GV +  CG++ DHGVA VG+GTA+   G  Y  +KNSWG  WGE GYIR+ R+     G
Sbjct: 279 GGVFDGHCGSDLDHGVAAVGYGTAK---GVDYITVKNSWGSKWGEKGYIRMRRNIGKPEG 335

Query: 333 LCGIATAASYPV 344
           +CGI   ASYP 
Sbjct: 336 ICGIYKMASYPT 347


>gi|18396939|ref|NP_564320.1| Papain family cysteine protease [Arabidopsis thaliana]
 gi|9502427|gb|AAF88126.1|AC021043_19 Putative cysteine proteinase [Arabidopsis thaliana]
 gi|67633400|gb|AAY78625.1| peptidase C1A papain family protein [Arabidopsis thaliana]
 gi|332192919|gb|AEE31040.1| Papain family cysteine protease [Arabidopsis thaliana]
          Length = 346

 Score =  327 bits (839), Expect = 4e-87,   Method: Compositional matrix adjust.
 Identities = 172/347 (49%), Positives = 233/347 (67%), Gaps = 21/347 (6%)

Query: 14  FVIIILVITCA----SQVVSGRSMHEPS-IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQ 68
           FV ++L I       S+  S  ++++PS IV+ H+QWM Q  R Y DE EK +RL +  +
Sbjct: 6   FVCVVLTIFFMDLKISEATSRVALYKPSSIVDYHQQWMIQFSRVYDDEFEKQLRLQVLTE 65

Query: 69  NLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGY---NRPVPSVSRQSSRPSTFKY 125
           NL++IE  N  GN++YKLG NEF+D T EEF A YTG    N   P      ++P+    
Sbjct: 66  NLKFIESFNNMGNQSYKLGVNEFTDWTKEEFLATYTGLRGVNVTSPFEVVNETKPAW--N 123

Query: 126 QNVTDV-PTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLV 184
             V+DV  T+ DWR +GAVT +K QG+CG CWAFSA+AAVEG+T+I RG LI LSEQQL+
Sbjct: 124 WTVSDVLGTNKDWRNEGAVTPVKSQGECGGCWAFSAIAAVEGLTKIARGNLISLSEQQLL 183

Query: 185 DCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYED 243
           DC+ + N+GC GG    AF YII+++G+++E +YPY+ +EG C +    A+   I  +E+
Sbjct: 184 DCTREQNNGCKGGTFVNAFNYIIKHRGISSENEYPYQVKEGPCRSNARPAI--LIRGFEN 241

Query: 244 LPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNA-DCGNNCDHGVAVVGFGTAEE 302
           +P  +E+ALL+AVS QPV+V +DAS   F  Y  GV NA +CG + +H V +VG+GT+ E
Sbjct: 242 VPSNNERALLEAVSRQPVAVAIDASEAGFVHYSGGVYNARNCGTSVNHAVTLVGYGTSPE 301

Query: 303 ENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
             G KYWL KNSWG+TWGE+GYIRI RD     G+CG+A  ASYPVA
Sbjct: 302 --GMKYWLAKNSWGKTWGENGYIRIRRDVEWPQGMCGVAQYASYPVA 346


>gi|358248896|ref|NP_001239703.1| uncharacterized protein LOC100799247 precursor [Glycine max]
 gi|255636729|gb|ACU18700.1| unknown [Glycine max]
          Length = 341

 Score =  327 bits (839), Expect = 4e-87,   Method: Compositional matrix adjust.
 Identities = 157/313 (50%), Positives = 216/313 (69%), Gaps = 11/313 (3%)

Query: 40  EKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEF 99
           E+HE+WMAQ+G+ YKD  EK  R  +FK N+++IE  N  G++ + L  N+F+DL +EEF
Sbjct: 33  ERHEKWMAQYGKVYKDAAEKEKRFQVFKNNVQFIESFNAAGDKPFNLSINQFADLHDEEF 92

Query: 100 RALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQG-QCGSCWAF 158
           +AL     +    V  +++  ++F+Y+NVT +P+++DWR++GAVT IKDQG  CGSCWAF
Sbjct: 93  KALLNNVQKKASRV--ETATETSFRYENVTKIPSTMDWRKRGAVTPIKDQGYTCGSCWAF 150

Query: 159 SAVAAVEGITQITRGKLIELSEQQLVDC-STDNHGCSGGLMDKAFEYIIENKGLATEADY 217
           + VA VE + QIT G+L+ LSEQ+LVDC   D+ GC GG ++ AFE+I    G+ +EA Y
Sbjct: 151 ATVATVESLHQITTGELVSLSEQELVDCVRGDSEGCRGGYVENAFEFIANKGGITSEAYY 210

Query: 218 PYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKS 277
           PY+ ++ +C  +KE    A I  YE +P   E+ALL+AV+NQPVSV +DA   AF FY S
Sbjct: 211 PYKGKDRSCKVKKETHGVARIIGYESVPSNSEKALLKAVANQPVSVYIDAGAIAFKFYSS 270

Query: 278 GVLNA-DCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD----AG 332
           G+  A +CG + DH VAVVG+G   +  G KYWL+KNSW   WGE GY+RI RD     G
Sbjct: 271 GIFEARNCGTHLDHAVAVVGYGKLRD--GTKYWLVKNSWSTAWGEKGYMRIKRDIRAKKG 328

Query: 333 LCGIATAASYPVA 345
           LCGIA+ ASYP+A
Sbjct: 329 LCGIASNASYPIA 341


>gi|109390302|gb|ABG33750.1| cysteine protease [Hevea brasiliensis]
          Length = 457

 Score =  327 bits (839), Expect = 4e-87,   Method: Compositional matrix adjust.
 Identities = 165/351 (47%), Positives = 225/351 (64%), Gaps = 26/351 (7%)

Query: 13  MFVIIILVITCAS----QVVSGRSMH--------EPSIVEKHEQWMAQHGRTYKDELEKA 60
           MF+++    T +S     ++S    H        +  ++  +E W+ +HG+ Y    EK 
Sbjct: 1   MFMLLFFASTLSSASDLSIISYDQSHGTKSSWRTDDEVMAIYEDWLVKHGKAYNSLGEKE 60

Query: 61  MRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRP 120
            R  +FK NL +I++ N E NRTY++G N F+DLTNEE+R++Y G    +  + R   R 
Sbjct: 61  RRFEVFKDNLRFIDEHNSE-NRTYRVGLNRFADLTNEEYRSMYLG---ALSGIRRNKLRK 116

Query: 121 STFKYQ-NVTD-VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIEL 178
            + +Y   V D +P S+DWR++GAV  +KDQG CGSCWAFSAVAAVEGI +I  G LI L
Sbjct: 117 ISDRYTPRVGDSLPDSVDWRKEGAVVGVKDQGSCGSCWAFSAVAAVEGINKIVTGDLISL 176

Query: 179 SEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAAT 237
           SEQ+LVDC    N GC+GGLMD  FE+II N G+ +E DYPY   +G CD  ++ A   +
Sbjct: 177 SEQELVDCDNSYNEGCNGGLMDYGFEFIINNGGIDSEEDYPYLARDGRCDTYRKNARVVS 236

Query: 238 ISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGF 297
           I  YED+P  +E AL +AV+NQPVSV ++A GR F  Y SGV +  CG   DHGV  VG+
Sbjct: 237 IDSYEDVPVNNEAALQKAVANQPVSVAIEAGGRDFQLYSSGVFSGRCGTALDHGVVAVGY 296

Query: 298 GTAEEENGAKYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPV 344
           GT   ENG  YW+++NSWG++WGESGY+R+ R+     G+CGIA  ASYP+
Sbjct: 297 GT---ENGQDYWIVRNSWGKSWGESGYLRMARNIRKPTGICGIAMEASYPI 344


>gi|28192373|gb|AAK07730.1| CPR1-like cysteine proteinase [Nicotiana tabacum]
          Length = 374

 Score =  327 bits (839), Expect = 4e-87,   Method: Compositional matrix adjust.
 Identities = 161/318 (50%), Positives = 212/318 (66%), Gaps = 13/318 (4%)

Query: 35  EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
           E  +  ++E W+A+HGR Y    EK  R  IFK NL +IE+ N  GNRTYK+G N+F+DL
Sbjct: 43  EDQVKNRYEMWLAEHGRAYNALGEKEKRFEIFKDNLRFIEEHNNSGNRTYKVGLNQFADL 102

Query: 95  TNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD--VPTSIDWREKGAVTHIKDQGQC 152
           TNEE+R +Y G          +S  PS  +Y +  +  +P S+DWR++GAV  IK+QG C
Sbjct: 103 TNEEYRTMYLGTKSDARRRFVKSKNPSQ-RYASRPNELMPHSVDWRKRGAVAPIKNQGSC 161

Query: 153 GSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS-TDNHGCSGGLMDKAFEYIIENKGL 211
           GSCWAFS VAAV GI QI  G++I LSEQ+LVDC    N GC+GGLMD AFE+II N G+
Sbjct: 162 GSCWAFSTVAAVGGINQIVTGEMITLSEQELVDCDRVQNSGCNGGLMDYAFEFIISNGGM 221

Query: 212 ATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRA 271
            TE  YPYR  EG CD  ++     +I  YED+P+ +E+AL +AV++QPV V ++ASGRA
Sbjct: 222 DTEKHYPYRGVEGRCDPVRKNYKVVSIDGYEDVPR-NERALQKAVAHQPVCVAIEASGRA 280

Query: 272 FHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA 331
           F  Y SGV   +CG   DHGV VVG+G+   E+G  YW+++NSWG  WGE+GY+++ R+ 
Sbjct: 281 FQLYSSGVFTGECGEEVDHGVVVVGYGS---EDGVDYWIVRNSWGTKWGENGYVKMERNV 337

Query: 332 -----GLCGIATAASYPV 344
                G CGI T ASYP 
Sbjct: 338 KKSHLGKCGIMTEASYPT 355


>gi|148927382|gb|ABR19827.1| cysteine proteinase [Elaeis guineensis]
          Length = 470

 Score =  327 bits (839), Expect = 4e-87,   Method: Compositional matrix adjust.
 Identities = 164/322 (50%), Positives = 207/322 (64%), Gaps = 14/322 (4%)

Query: 31  RSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKAN---KEGNRTYKLG 87
           RS  E  I+  +E W+A+HGR Y    EK  R  IFK N+ +I+  N     G+R+++LG
Sbjct: 41  RSEEEMRIL--YEGWLAKHGRAYNALGEKERRFEIFKDNVLFIDAHNAAADAGHRSFRLG 98

Query: 88  TNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIK 147
            N F+D+TNEE+RA+Y G  RP     R       ++Y    D+P S+DWR KGAV  +K
Sbjct: 99  LNRFADMTNEEYRAVYLG-TRPAGHRRRARVGSDRYRYNAGEDLPESVDWRAKGAVAAVK 157

Query: 148 DQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYII 206
           DQG CGSCWAFS VAAVEGI +I  G LI LSEQ+LVDC    N GC+GGLMD  FE+II
Sbjct: 158 DQGSCGSCWAFSTVAAVEGINKIVTGDLISLSEQELVDCDNGYNQGCNGGLMDYGFEFII 217

Query: 207 ENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVD 266
            N G+ TE DYPY   +G CD  ++ A   +I  YED+P  DE+AL +AV+NQPVSV ++
Sbjct: 218 NNGGIDTEEDYPYTARDGKCDQYRKNAKVVSIDGYEDVPVNDEKALQKAVANQPVSVAIE 277

Query: 267 ASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIR 326
           A GR F  Y SG+    CG + DHGV  VG+GT   ENG  YW+++NSWG  WGESGYIR
Sbjct: 278 AGGREFQLYHSGIFTGRCGTDLDHGVVAVGYGT---ENGKDYWIVRNSWGGDWGESGYIR 334

Query: 327 ILRD----AGLCGIATAASYPV 344
           + R+     G CGIA   SYP 
Sbjct: 335 MERNVNTSTGKCGIAIEPSYPT 356


>gi|157093728|gb|ABV22590.1| KDEL-tailed cysteine endopeptidase [Solanum lycopersicum]
          Length = 360

 Score =  327 bits (839), Expect = 4e-87,   Method: Compositional matrix adjust.
 Identities = 166/349 (47%), Positives = 222/349 (63%), Gaps = 16/349 (4%)

Query: 5   FEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLN 64
            +K F++   + ++L +  +          E    E +E+W + H  +   + EK  R N
Sbjct: 1   MKKLFLVLFTLALVLRLGESFDFHEKELETEEKFWELYERWRSHHTVSRSLD-EKHKRFN 59

Query: 65  IFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTG----YNRPVPSVSRQSSRP 120
           +FK N+ Y+   NK+ ++ YKL  N+F+D+TN EFR  Y G    ++R +   SR +   
Sbjct: 60  VFKANVHYVHNFNKK-DKPYKLKLNKFADMTNHEFRQHYAGSKIKHHRTLLGASRANG-- 116

Query: 121 STFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSE 180
            TF Y N  +VP SIDWR+KGAVT +KDQGQCGSCWAFS V AVEGI QI   KL+ LSE
Sbjct: 117 -TFMYANEDNVPPSIDWRKKGAVTPVKDQGQCGSCWAFSTVVAVEGINQIKTKKLVSLSE 175

Query: 181 QQLVDC-STDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATIS 239
           Q+LVDC +T+N GC+GGLMD AF++I +  G+ TE  YPY+ E+  CD QK      +I 
Sbjct: 176 QELVDCDTTENQGCNGGLMDPAFDFIKKRGGITTEERYPYKAEDDKCDIQKRNTPVVSID 235

Query: 240 KYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGT 299
            +ED+P  DE ALL+AV+NQP+SV +DASG  F FY  GV   +CG   DHGVA+VG+GT
Sbjct: 236 GHEDVPPNDEDALLKAVANQPISVAIDASGSQFQFYSEGVFTGECGTELDHGVAIVGYGT 295

Query: 300 AEEENGAKYWLIKNSWGETWGESGYIRILR----DAGLCGIATAASYPV 344
             +  G KYW++KNSWG  WGE GYIR+ R    + GLCGIA   SYP+
Sbjct: 296 TVD--GTKYWIVKNSWGAGWGEKGYIRMQRKVDAEEGLCGIAMQPSYPI 342


>gi|18394919|ref|NP_564126.1| Xylem cysteine proteinase 2 [Arabidopsis thaliana]
 gi|71153409|sp|Q9LM66.2|XCP2_ARATH RecName: Full=Xylem cysteine proteinase 2; Short=AtXCP2; Flags:
           Precursor
 gi|4836904|gb|AAD30607.1|AC007369_17 Putative cysteine proteinase [Arabidopsis thaliana]
 gi|6708183|gb|AAF25832.1|AF191028_1 papain-type cysteine endopeptidase XCP2 [Arabidopsis thaliana]
 gi|28466959|gb|AAO44088.1| At1g20850 [Arabidopsis thaliana]
 gi|110743795|dbj|BAE99733.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|332191910|gb|AEE30031.1| Xylem cysteine proteinase 2 [Arabidopsis thaliana]
          Length = 356

 Score =  327 bits (838), Expect = 5e-87,   Method: Compositional matrix adjust.
 Identities = 157/313 (50%), Positives = 220/313 (70%), Gaps = 12/313 (3%)

Query: 38  IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
           ++E  E W++   + Y+   EK +R  +FK NL++I++ NK+G ++Y LG NEF+DL++E
Sbjct: 47  LIELFENWISNFEKAYETVEEKFLRFEVFKDNLKHIDETNKKG-KSYWLGLNEFADLSHE 105

Query: 98  EFRALYTGYNRPVPSVSRQSSRP-STFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCW 156
           EF+ +Y G    +  V R   R  + F Y++V  VP S+DWR+KGAV  +K+QG CGSCW
Sbjct: 106 EFKKMYLGLKTDI--VRRDEERSYAEFAYRDVEAVPKSVDWRKKGAVAEVKNQGSCGSCW 163

Query: 157 AFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEA 215
           AFS VAAVEGI +I  G L  LSEQ+L+DC T  N+GC+GGLMD AFEYI++N GL  E 
Sbjct: 164 AFSTVAAVEGINKIVTGNLTTLSEQELIDCDTTYNNGCNGGLMDYAFEYIVKNGGLRKEE 223

Query: 216 DYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFY 275
           DYPY  EEGTC+ QK+++   TI+ ++D+P  DE++LL+A+++QP+SV +DASGR F FY
Sbjct: 224 DYPYSMEEGTCEMQKDESETVTINGHQDVPTNDEKSLLKALAHQPLSVAIDASGREFQFY 283

Query: 276 KSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA---- 331
             GV +  CG + DHGVA VG+G+++   G+ Y ++KNSWG  WGE GYIR+ R+     
Sbjct: 284 SGGVFDGRCGVDLDHGVAAVGYGSSK---GSDYIIVKNSWGPKWGEKGYIRLKRNTGKPE 340

Query: 332 GLCGIATAASYPV 344
           GLCGI   AS+P 
Sbjct: 341 GLCGINKMASFPT 353


>gi|118486542|gb|ABK95110.1| unknown [Populus trichocarpa]
          Length = 465

 Score =  327 bits (838), Expect = 5e-87,   Method: Compositional matrix adjust.
 Identities = 164/318 (51%), Positives = 214/318 (67%), Gaps = 23/318 (7%)

Query: 38  IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
           ++  +E+W+ +HG+ Y    EK  R  IFK NL +I++ N E NRTY +G N F+DLTNE
Sbjct: 47  VMAMYEEWLVKHGKNYNALGEKEKRFEIFKDNLMFIDQHNSE-NRTYTVGLNRFADLTNE 105

Query: 98  EFRALY----TGYNRPVPSVS-RQSSRPSTFKYQNVTD-VPTSIDWREKGAVTHIKDQGQ 151
           EFR++Y    TG+ + +P  S R + R        V D +P S+DWR++GAV  +KDQG 
Sbjct: 106 EFRSMYLGTRTGHKKRLPKTSDRYAPR--------VGDSLPDSVDWRKEGAVAEVKDQGG 157

Query: 152 CGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKG 210
           CGSCWAFS +AAVEGI +I  G LI LSEQ+LVDC T  N GC+GGLMD AFE+II N G
Sbjct: 158 CGSCWAFSTIAAVEGINKIVTGDLIALSEQELVDCDTSYNEGCNGGLMDYAFEFIINNGG 217

Query: 211 LATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGR 270
           + TE DYPY   +G CD  ++ A   +I  YED+P+ DE AL +AV+NQPVSV ++  GR
Sbjct: 218 IDTEDDYPYLGRDGRCDTYRKNAKVVSIDSYEDVPENDETALKKAVANQPVSVAIEGGGR 277

Query: 271 AFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD 330
            F  Y SGV   +CG + DHGVA VG+GT   E G  YW+++NSWG++WGESGYIR+ R+
Sbjct: 278 NFQLYNSGVFTGECGTSLDHGVAAVGYGT---EKGKDYWIVRNSWGKSWGESGYIRMERN 334

Query: 331 ----AGLCGIATAASYPV 344
                G CGIA   SYP+
Sbjct: 335 IASPTGKCGIAIEPSYPI 352


>gi|356515044|ref|XP_003526211.1| PREDICTED: LOW QUALITY PROTEIN: thiol protease SEN102-like [Glycine
           max]
          Length = 337

 Score =  327 bits (838), Expect = 6e-87,   Method: Compositional matrix adjust.
 Identities = 166/340 (48%), Positives = 220/340 (64%), Gaps = 20/340 (5%)

Query: 11  IPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNL 70
           + +F+++ + I   SQV+S R +HE S+ E+HE W+A++G+ YK   EK     IFK+N+
Sbjct: 11  LALFLLLSIEI---SQVMS-RKLHETSLREEHENWIARYGQVYKVAAEKET-FQIFKENV 65

Query: 71  EYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
           E+IE  N   N+ YKLG N F+DLT EEF+    G  +        S  P  FKY+NVTD
Sbjct: 66  EFIESFNAAANKPYKLGVNLFADLTLEEFKDFRFGLKK----THEFSITP--FKYENVTD 119

Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD- 189
           +P ++DWREKGAVT IKDQGQCGSCWAFS VAA EGI QIT G L+ L EQ+LV C T  
Sbjct: 120 IPEALDWREKGAVTPIKDQGQCGSCWAFSTVAATEGIHQITTGNLVSLXEQELVSCDTKG 179

Query: 190 -NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
            + GC GG M+  FE+II+N G+ T+A+YPY+   GTC+     +  A I  YE +P   
Sbjct: 180 VDQGCEGGYMEDGFEFIIKNGGITTKANYPYKGVNGTCNTTIAASTVAQIKGYETVPSYS 239

Query: 249 EQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKY 308
           E+AL +AV+NQPVSV +DA+   F FY  G+   +CG + DHGV  VG+GT  E +   Y
Sbjct: 240 EEALQKAVANQPVSVSIDANNGHFMFYAGGIYTGECGTDLDHGVTAVGYGTTNETD---Y 296

Query: 309 WLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
           W++KNSWG  W E G+IR+ R      GLCG+A  +SYP 
Sbjct: 297 WIVKNSWGTGWDEKGFIRMQRGITVKHGLCGVALDSSYPT 336


>gi|226495425|ref|NP_001148706.1| cysteine protease 1 precursor [Zea mays]
 gi|195621544|gb|ACG32602.1| cysteine protease 1 precursor [Zea mays]
          Length = 463

 Score =  327 bits (837), Expect = 6e-87,   Method: Compositional matrix adjust.
 Identities = 160/312 (51%), Positives = 206/312 (66%), Gaps = 16/312 (5%)

Query: 42  HEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKAN---KEGNRTYKLGTNEFSDLTNEE 98
           + +WMA HGRTY    E+  R  +F+ NL YI+  N     G  +++LG N F+DLTN+E
Sbjct: 41  YAEWMAAHGRTYNAVGEEERRYQVFRDNLRYIDAHNAAADAGVHSFRLGLNRFADLTNDE 100

Query: 99  FRALYTG-YNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWA 157
           +RA Y G   RP     R+    + +   +  D+P S+DWR KGAV  +KDQG CGSCWA
Sbjct: 101 YRATYLGARTRP----QRERKLGARYHAADNEDLPESVDWRAKGAVAEVKDQGSCGSCWA 156

Query: 158 FSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEAD 216
           FS +AAVEGI QI  G LI LSEQ+LVDC T  N GC+GGLMD AFE+II N G+ TE D
Sbjct: 157 FSTIAAVEGINQIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAFEFIINNGGIDTEKD 216

Query: 217 YPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYK 276
           YPY+  +G CD  ++ A   TI  YED+P  DE++L +AV+NQPVSV ++A+G AF  Y 
Sbjct: 217 YPYKGTDGRCDVNRKNAKVVTIDSYEDVPANDEKSLQKAVANQPVSVAIEAAGTAFQLYS 276

Query: 277 SGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD----AG 332
           SG+    CG   DHGV  VG+GT   ENG  YW++KNSWG +WGESGY+R+ R+    +G
Sbjct: 277 SGIFTGSCGTALDHGVTAVGYGT---ENGKDYWIVKNSWGSSWGESGYVRMERNIKASSG 333

Query: 333 LCGIATAASYPV 344
            CGIA   SYP+
Sbjct: 334 KCGIAVEPSYPL 345


>gi|358348957|ref|XP_003638507.1| Cysteine proteinase [Medicago truncatula]
 gi|355504442|gb|AES85645.1| Cysteine proteinase [Medicago truncatula]
          Length = 362

 Score =  327 bits (837), Expect = 6e-87,   Method: Compositional matrix adjust.
 Identities = 161/345 (46%), Positives = 222/345 (64%), Gaps = 10/345 (2%)

Query: 6   EKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNI 65
           +K  +I + + ++LV++ +          + S+ + +E+W + H  + ++  EK  R N+
Sbjct: 4   KKLLLIVLSIALVLVVSESFDFHDKDVSSDESLWDLYERWRSHHTVS-RNLNEKQKRFNV 62

Query: 66  FKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPS-TFK 124
           FK N+ ++   NK  ++ YKL  N+F+D+TN EF+  Y G       + R + R S TF 
Sbjct: 63  FKSNVMHVHNTNKM-DKPYKLKLNKFADMTNHEFKTTYAGSKVNHHRMFRGTPRVSGTFM 121

Query: 125 YQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLV 184
           Y+N T  P S+DWR+KGAVT +KDQGQCGSCWAFS V AVEGI QI   +L+ LSEQ+L+
Sbjct: 122 YENFTKAPASVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQIKTNRLVPLSEQELI 181

Query: 185 DCST-DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYED 243
           DC   +N GC+GGLM+ AFEYI +  G+ TE+ YPY   +G+CD  KE   A +I  +E 
Sbjct: 182 DCDNQENQGCNGGLMEYAFEYIKQKGGITTESYYPYTANDGSCDATKENVPAVSIDGHET 241

Query: 244 LPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEE 303
           +P  DE ALL+AV+NQPVSV +DA G  F FY  GV   DCG   +HGVA+VG+GT  + 
Sbjct: 242 VPANDEDALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDCGKELNHGVAIVGYGTTVD- 300

Query: 304 NGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
            G  YW+++NSWG  WGE GYIR+ R+     GLCGIA  ASYPV
Sbjct: 301 -GTNYWIVRNSWGAEWGEQGYIRMKRNVSNKEGLCGIAMEASYPV 344


>gi|414585111|tpg|DAA35682.1| TPA: cysteine proteinase Mir3 [Zea mays]
          Length = 468

 Score =  327 bits (837), Expect = 6e-87,   Method: Compositional matrix adjust.
 Identities = 160/312 (51%), Positives = 206/312 (66%), Gaps = 16/312 (5%)

Query: 42  HEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKAN---KEGNRTYKLGTNEFSDLTNEE 98
           + +WMA HGRTY    E+  R  +F+ NL YI+  N     G  +++LG N F+DLTN+E
Sbjct: 46  YAEWMAAHGRTYNAVGEEERRYQVFRDNLRYIDAHNAAADAGVHSFRLGLNRFADLTNDE 105

Query: 99  FRALYTG-YNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWA 157
           +RA Y G   RP     R+    + +   +  D+P S+DWR KGAV  +KDQG CGSCWA
Sbjct: 106 YRATYLGARTRP----QRERKLGARYHAADNEDLPESVDWRAKGAVAEVKDQGSCGSCWA 161

Query: 158 FSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEAD 216
           FS +AAVEGI QI  G LI LSEQ+LVDC T  N GC+GGLMD AFE+II N G+ TE D
Sbjct: 162 FSTIAAVEGINQIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAFEFIINNGGIDTEKD 221

Query: 217 YPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYK 276
           YPY+  +G CD  ++ A   TI  YED+P  DE++L +AV+NQPVSV ++A+G AF  Y 
Sbjct: 222 YPYKGTDGRCDVNRKNAKVVTIDSYEDVPANDEKSLQKAVANQPVSVAIEAAGTAFQLYS 281

Query: 277 SGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD----AG 332
           SG+    CG   DHGV  VG+GT   ENG  YW++KNSWG +WGESGY+R+ R+    +G
Sbjct: 282 SGIFTGSCGTALDHGVTAVGYGT---ENGKDYWIVKNSWGSSWGESGYVRMERNIKASSG 338

Query: 333 LCGIATAASYPV 344
            CGIA   SYP+
Sbjct: 339 KCGIAVEPSYPL 350


>gi|356564154|ref|XP_003550321.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
          Length = 476

 Score =  326 bits (836), Expect = 8e-87,   Method: Compositional matrix adjust.
 Identities = 166/351 (47%), Positives = 217/351 (61%), Gaps = 21/351 (5%)

Query: 10  IIPMFVIIILVITCASQVVSGRSMH---------EPSIVEKHEQWMAQHGRTYKDELEKA 60
           I+ +F +  +       ++S  S H         E  ++  +EQW+ +HG+ Y    EK 
Sbjct: 18  IVLLFTVFAVSSALDMSIISYDSAHADKAATLRTEEELMSMYEQWLVKHGKVYNALGEKE 77

Query: 61  MRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRP 120
            R  IFK NL +I+  N   +RTYKLG N F+DLTNEE+RA Y G    +    R    P
Sbjct: 78  KRFQIFKDNLRFIDDHNSAEDRTYKLGLNRFADLTNEEYRAKYLGTK--IDPNRRLGKTP 135

Query: 121 STFKYQNVTD-VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELS 179
           S      V D +P S+DWR++GAV  +KDQG CGSCWAFSA+ AVEGI +I  G+LI LS
Sbjct: 136 SNRYAPRVGDKLPDSVDWRKEGAVPPVKDQGGCGSCWAFSAIGAVEGINKIVTGELISLS 195

Query: 180 EQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATI 238
           EQ+LVDC T  N GC+GGLMD AFE+II N G+ ++ DYPYR  +G CD  ++ A   +I
Sbjct: 196 EQELVDCDTGYNQGCNGGLMDYAFEFIINNGGIDSDEDYPYRGVDGRCDTYRKNAKVVSI 255

Query: 239 SKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFG 298
             YED+P  DE AL +AV+NQPVSV ++  GR F  Y SGV    CG   DHGV  VG+G
Sbjct: 256 DDYEDVPAYDELALKKAVANQPVSVAIEGGGREFQLYVSGVFTGRCGTALDHGVVAVGYG 315

Query: 299 TAEEENGAKYWLIKNSWGETWGESGYIRILRD-----AGLCGIATAASYPV 344
           TA+   G  YW+++NSWG +WGE GYIR+ R+     +G CGIA   SYP+
Sbjct: 316 TAK---GHDYWIVRNSWGSSWGEDGYIRLERNLANSRSGKCGIAIEPSYPL 363


>gi|255555337|ref|XP_002518705.1| cysteine protease, putative [Ricinus communis]
 gi|223542086|gb|EEF43630.1| cysteine protease, putative [Ricinus communis]
          Length = 471

 Score =  326 bits (836), Expect = 8e-87,   Method: Compositional matrix adjust.
 Identities = 159/320 (49%), Positives = 211/320 (65%), Gaps = 19/320 (5%)

Query: 35  EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
           +  +   +E W+ +HG+ Y    EK  R  IFK NL +I++ N   +R+YK+G N F+DL
Sbjct: 44  DSQVRRMYEMWLVEHGKAYNALGEKEKRFEIFKDNLRFIDEHNSV-DRSYKVGLNRFADL 102

Query: 95  TNEEFRALYTGYNRPVPSVSRQS----SRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQG 150
           TNEE++A++ G       + R++    +R   + +++  D+P ++DWREKGAV  +KDQG
Sbjct: 103 TNEEYKAMFLG-----TKMERKNRFLGTRSQRYLFKDGDDLPENVDWREKGAVVPVKDQG 157

Query: 151 QCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENK 209
           QCGSCWAFS V AVEGI QI  G+LI LSEQ+LVDC    N GC+GGLMD AFE+II N 
Sbjct: 158 QCGSCWAFSTVGAVEGINQIVTGELISLSEQELVDCDKSYNQGCNGGLMDYAFEFIINNG 217

Query: 210 GLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASG 269
           G+ TE DYPY+  +  CD  ++ A   TI  YED+P+ DE +L +AV++QPVSV ++A G
Sbjct: 218 GIDTEEDYPYKASDNICDPNRKNAKVVTIDGYEDVPENDENSLKKAVAHQPVSVAIEAGG 277

Query: 270 RAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILR 329
           RAF  YKSGV    CG   DHGV  VG+GT   ENG  YW+++NSWG  WGESGYIR+ R
Sbjct: 278 RAFQLYKSGVFTGRCGTELDHGVVAVGYGT---ENGVNYWIVRNSWGSAWGESGYIRMER 334

Query: 330 D-----AGLCGIATAASYPV 344
           +      G CGIA   SYP 
Sbjct: 335 NVANTKTGKCGIAIQPSYPT 354


>gi|58531896|gb|AAW78660.1| cysteine protease [Nicotiana tabacum]
          Length = 361

 Score =  326 bits (836), Expect = 9e-87,   Method: Compositional matrix adjust.
 Identities = 163/314 (51%), Positives = 208/314 (66%), Gaps = 16/314 (5%)

Query: 40  EKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEF 99
           E +E+W + H  +   + EK  R N+FK N+ Y+   NK+ ++ YKL  N+F+D+TN EF
Sbjct: 36  ELYERWRSHHTVSRSLD-EKDKRFNVFKANVHYVHNFNKK-DKPYKLKLNKFADMTNHEF 93

Query: 100 RALYTG----YNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSC 155
           R  Y G    ++R     SR +    TF Y NV DVP S+DWR+KGAVT +KDQG+CGSC
Sbjct: 94  RHHYAGSKIKHHRSFLGASRANG---TFMYANVEDVPPSVDWRKKGAVTPVKDQGKCGSC 150

Query: 156 WAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATE 214
           WAFS V AVEGI QI   +L+ LSEQ+LVDC T  N GC+GGLMD AFE+I +  G+ TE
Sbjct: 151 WAFSTVVAVEGINQIKTNELVSLSEQELVDCDTSQNQGCNGGLMDMAFEFIKKKGGINTE 210

Query: 215 ADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHF 274
            +YPY  E G CD QK  +   +I  YED+P  DE +LL+AV+NQPVSV + ASG  F F
Sbjct: 211 ENYPYMAEGGECDIQKRNSPVVSIDGYEDVPPNDEDSLLKAVANQPVSVAIQASGSDFQF 270

Query: 275 YKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILR----D 330
           Y  GV   DCG   DHGVA+VG+GT  +  G KYW+++NSWG  WGE GYIR+ R    +
Sbjct: 271 YSEGVFTGDCGTELDHGVAIVGYGTTLD--GTKYWIVRNSWGPEWGEKGYIRMQREIDAE 328

Query: 331 AGLCGIATAASYPV 344
            GLCGIA   SYP+
Sbjct: 329 EGLCGIAMQPSYPI 342


>gi|125533982|gb|EAY80530.1| hypothetical protein OsI_35710 [Oryza sativa Indica Group]
          Length = 378

 Score =  326 bits (836), Expect = 9e-87,   Method: Compositional matrix adjust.
 Identities = 159/328 (48%), Positives = 212/328 (64%), Gaps = 20/328 (6%)

Query: 35  EPSIVEKHEQWMAQH--------GRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKL 86
           E S+   +E+W +++        G    D+ E   R N+F +N  YI +AN+ G R ++L
Sbjct: 35  EESLRALYERWRSRYTVSRPAASGGVGNDDGEARRRFNVFVENARYIHEANRRGGRPFRL 94

Query: 87  GTNEFSDLTNEEFRALYTG----YNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGA 142
             N+F+D+T +EFR  Y G    ++R +            +   +  ++P ++DWRE+GA
Sbjct: 95  ALNKFADMTTDEFRRTYAGSRARHHRSLRGGRGGEGGSFRYGGDDEDNLPPAVDWRERGA 154

Query: 143 VTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST-DNHGCSGGLMDKA 201
           VT IKDQGQCGSCWAFSAVAAVEG+ +I  G+L+ LSEQ+LVDC T DN GC GGLMD A
Sbjct: 155 VTGIKDQGQCGSCWAFSAVAAVEGVNKIKTGRLVTLSEQELVDCDTGDNQGCDGGLMDYA 214

Query: 202 FEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPV 261
           F++I  N G+ TE++YPYR E+G C+  K  +   TI  YED+P  DE AL +AV+NQPV
Sbjct: 215 FQFIKRNGGITTESNYPYRAEQGRCNKAKASSHDVTIDGYEDVPANDESALQKAVANQPV 274

Query: 262 SVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGE 321
           +V V+ASG+ F FY  GV   +CG + DHGVA VG+G   +  G KYW++KNSWGE WGE
Sbjct: 275 AVAVEASGQDFQFYSEGVFTGECGTDLDHGVAAVGYGITRD--GTKYWIVKNSWGEDWGE 332

Query: 322 SGYIRILRDA-----GLCGIATAASYPV 344
            GYIR+ R       GLCGIA  ASYPV
Sbjct: 333 RGYIRMQRGVSSDSNGLCGIAMEASYPV 360


>gi|242086591|ref|XP_002439128.1| hypothetical protein SORBIDRAFT_09g000960 [Sorghum bicolor]
 gi|241944413|gb|EES17558.1| hypothetical protein SORBIDRAFT_09g000960 [Sorghum bicolor]
          Length = 371

 Score =  326 bits (836), Expect = 1e-86,   Method: Compositional matrix adjust.
 Identities = 165/320 (51%), Positives = 217/320 (67%), Gaps = 16/320 (5%)

Query: 33  MHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFS 92
           +H   +++  E+W+A++ + Y    EK  R  +FK NL +I++ANK+   TY LG N F+
Sbjct: 57  VHHDRLIKLFEEWVAKYRKAYASFEEKLHRFEVFKDNLHHIDEANKKVT-TYWLGLNAFA 115

Query: 93  DLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD--VPTSIDWREKGAVTHIKDQG 150
           DLT++EF+A Y G  +P      + +  S F+Y  V D  VP S+DWR+KGAVT +K+QG
Sbjct: 116 DLTHDEFKATYLGLRQP----ETKKTTDSRFRYGGVADDDVPASVDWRKKGAVTDVKNQG 171

Query: 151 QCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENK 209
           QCGSCWAFS VAAVEGI QI  G L  LSEQ+LVDCSTD N+GC+GG+MD AF YI  + 
Sbjct: 172 QCGSCWAFSTVAAVEGINQIVTGNLTSLSEQELVDCSTDGNNGCNGGVMDNAFSYIASSG 231

Query: 210 GLATEADYPYRHEEGTCDNQ-KEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDAS 268
           GL TE  YPY  EEG CD++ ++     TIS YED+P  DEQAL++A+++QP+SV ++AS
Sbjct: 232 GLRTEEAYPYLMEEGDCDDKARDGEQVVTISGYEDVPANDEQALVKALAHQPLSVAIEAS 291

Query: 269 GRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRIL 328
           GR F FY  GV N  CG+  DHGVA VG+G+++   G  Y ++KNSWG  WGE GYIR+ 
Sbjct: 292 GRHFQFYSGGVFNGPCGSELDHGVAAVGYGSSK---GQDYIIVKNSWGSHWGEKGYIRMK 348

Query: 329 RDA----GLCGIATAASYPV 344
           R      GLCGI   ASYP 
Sbjct: 349 RGTGKPEGLCGINKMASYPT 368


>gi|226502454|ref|NP_001140922.1| hypothetical protein [Zea mays]
 gi|223948637|gb|ACN28402.1| unknown [Zea mays]
 gi|413920877|gb|AFW60809.1| hypothetical protein ZEAMMB73_830238 [Zea mays]
          Length = 354

 Score =  326 bits (835), Expect = 1e-86,   Method: Compositional matrix adjust.
 Identities = 172/355 (48%), Positives = 234/355 (65%), Gaps = 30/355 (8%)

Query: 10  IIPMFVIIILVITCASQVVSGRSMH--------EPSIVEKHEQWMAQHGRTYKDELEKAM 61
           +I    + + ++   + +   R +         E ++  +H+QWMA+HGRTY+DE EKA 
Sbjct: 11  VIAFTAVALTILAVKTMMAEARDLSSTSTGGYGEEAMKVRHQQWMAEHGRTYRDEAEKAH 70

Query: 62  RLNIFKQNLEYIEKANKEGN--RTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSR 119
           R  +FK N ++++ +N  G+  ++Y++  NEF+D+TN+EF A+YTG  RPVP+ ++   +
Sbjct: 71  RFQVFKANADFVDASNAAGDDKKSYRMELNEFADMTNDEFMAMYTGL-RPVPAGAK---K 126

Query: 120 PSTFKYQNVT-----DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGK 174
            + FKY NVT     D   ++DWR+KGAVT IK+QGQCG CWAF+AVAAVEGI QIT G 
Sbjct: 127 MAGFKYGNVTLSDADDNQQTVDWRQKGAVTGIKNQGQCGCCWAFAAVAAVEGIHQITTGN 186

Query: 175 LIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKA 233
           L+ LSEQQ++DC T+ N+GC+GG +D AF+YI  N GLATE  YPY   +  C  Q  + 
Sbjct: 187 LVSLSEQQVLDCDTEGNNGCNGGYIDNAFQYIAGNGGLATEDAYPYTAAQAMC--QSVQP 244

Query: 234 VAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLN-ADCGN--NCDH 290
           VAA IS Y+D+P GDE AL  AV+NQPVSV +DA    F  Y  GV+  A C    N +H
Sbjct: 245 VAA-ISGYQDVPSGDEAALAAAVANQPVSVAIDA--HNFQLYGGGVMTAASCSTPPNLNH 301

Query: 291 GVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDAGLCGIATAASYPVA 345
            V  VG+GTAE+  G  YWL+KN WG+ WGE GY+R+ R A  CG+A  ASYPVA
Sbjct: 302 AVTAVGYGTAED--GTPYWLLKNQWGQNWGEGGYLRLERGANACGVAQQASYPVA 354


>gi|50355619|dbj|BAD29958.1| cysteine protease [Daucus carota]
          Length = 496

 Score =  326 bits (835), Expect = 1e-86,   Method: Compositional matrix adjust.
 Identities = 161/309 (52%), Positives = 205/309 (66%), Gaps = 13/309 (4%)

Query: 43  EQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRAL 102
           E W+  HG++Y    E+  R  IFK NL YI++ N   +R +KLG N+F+DLTNEE+R+ 
Sbjct: 46  ESWLVTHGKSYNALGEEEKRFQIFKNNLRYIDEQNLVEDRGFKLGLNKFADLTNEEYRSK 105

Query: 103 YTGYNRP--VPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSA 160
           YTG         VS +S R +T   +++   P S+DWRE GAV  +KDQG CGSCWAFS 
Sbjct: 106 YTGIKSKDLRKKVSAKSGRYATLSGESL---PESVDWRESGAVATVKDQGSCGSCWAFST 162

Query: 161 VAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPY 219
           ++AVEGI QI  GKLI LSEQ+LVDC    N GC+GGLMD AFE+II N G+ T+ DYPY
Sbjct: 163 ISAVEGINQIATGKLITLSEQELVDCDRSYNEGCNGGLMDYAFEFIINNGGIDTDVDYPY 222

Query: 220 RHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGV 279
              +G CD  ++ A   TI  YED+P  DE AL +A +NQP+SV ++ASGR F FY SG+
Sbjct: 223 TGRDGKCDQYRKNAKVVTIDSYEDVPAYDELALKKAAANQPISVAIEASGRDFQFYDSGI 282

Query: 280 LNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILR----DAGLCG 335
               CG   DHGV VVG+GT   ENG  YW+++NSWG  WGE+GY+R+ R      G+CG
Sbjct: 283 FTGKCGIALDHGVVVVGYGT---ENGKDYWIVRNSWGADWGENGYLRMERGISSKTGICG 339

Query: 336 IATAASYPV 344
           IA   SYPV
Sbjct: 340 IAIEPSYPV 348


>gi|2511689|emb|CAB17074.1| cysteine proteinase precursor [Phaseolus vulgaris]
          Length = 364

 Score =  325 bits (834), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 161/344 (46%), Positives = 218/344 (63%), Gaps = 12/344 (3%)

Query: 8   SFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFK 67
           S +IP  +++    + A+  +S  +  E  +++ +E+W+ +H + Y    EK  R  +FK
Sbjct: 3   SMLIPTLLLLSFTFSHAT-AMSIINYSENEVMDMYEEWLVKHRKVYNGLDEKEKRFQVFK 61

Query: 68  QNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPS-VSRQSSRPSTFKYQ 126
            NL +I+  N + N TY LG N+F+D+TNEE+RA+Y G        V +  +    + Y 
Sbjct: 62  DNLGFIQDHNAQ-NNTYTLGLNKFADITNEEYRAMYLGTRTDAKRRVMKTQNTGHRYAYN 120

Query: 127 NVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC 186
           +   +P  +DWR KGAV  IKDQG CGSCWAFS VAAVEGI  I  G+ + LSEQ+LVDC
Sbjct: 121 SGDQLPVHVDWRLKGAVGPIKDQGNCGSCWAFSTVAAVEGINNIVTGEFVSLSEQELVDC 180

Query: 187 STD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLP 245
             + + GC+GGLMD AF++II+N G+ TE DYPY+  +GTCD  K+K     I  YED+P
Sbjct: 181 DREYDEGCNGGLMDYAFQFIIQNGGIDTEEDYPYQGIDGTCDQTKKKTKVVQIDGYEDVP 240

Query: 246 KGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENG 305
             +E AL +AVS+QPVSV ++ASGRA   Y+SGV    CG   DHGV VVG+GT   ENG
Sbjct: 241 SNNENALKKAVSHQPVSVAIEASGRALQLYQSGVFTGKCGTALDHGVVVVGYGT---ENG 297

Query: 306 AKYWLIKNSWGETWGESGYIRILRDA-----GLCGIATAASYPV 344
             YWL++NSWG  WGE GY ++ R+      G CGIA   SYPV
Sbjct: 298 VDYWLVRNSWGTGWGEDGYFKMERNVRSTSEGKCGIAMDCSYPV 341


>gi|30141019|dbj|BAC75923.1| cysteine protease-1 [Helianthus annuus]
          Length = 461

 Score =  325 bits (834), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 155/308 (50%), Positives = 203/308 (65%), Gaps = 9/308 (2%)

Query: 42  HEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRA 101
           +E W+ +HG+TY    EK  R  IFK NL +I++ N  G+ TYKLG N+F+DLTNEE+R 
Sbjct: 52  YESWLVKHGKTYNALGEKDRRFQIFKDNLRFIDEHN-SGDHTYKLGLNKFADLTNEEYRM 110

Query: 102 LYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAV 161
            YTG             +   + Y++   +P  +DWRE+GAVT +KDQG CGSCWAFS  
Sbjct: 111 TYTGIKTIDDKKKLSKMKSDRYAYRSGDSLPEYVDWREQGAVTDVKDQGSCGSCWAFSTT 170

Query: 162 AAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYR 220
            +VEG+ +I  G LI +SEQ+LV+C T  N GC+GGLMD AFE+II+N G+ TE DYPY 
Sbjct: 171 GSVEGVNKIVTGDLISVSEQELVNCDTSYNQGCNGGLMDYAFEFIIKNGGIDTEEDYPYT 230

Query: 221 HEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVL 280
            ++G CD  K+ A   TI  YED+P  DE +L +AVSNQPV+V ++A GR F FY SG+ 
Sbjct: 231 GKDGKCDKNKKNAKVVTIDSYEDVPVNDESSLKKAVSNQPVAVAIEAGGRDFQFYTSGIF 290

Query: 281 NADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD----AGLCGI 336
              CG   DHGV   G+GT   E+G  YWL+KNSWG  WGE GY+++ R+    +G CGI
Sbjct: 291 TGSCGTALDHGVLAAGYGT---EDGKDYWLVKNSWGAEWGEGGYLKMERNIADKSGKCGI 347

Query: 337 ATAASYPV 344
           A  ASYP+
Sbjct: 348 AMEASYPI 355


>gi|449524070|ref|XP_004169046.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like, partial
           [Cucumis sativus]
          Length = 314

 Score =  325 bits (834), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 163/313 (52%), Positives = 215/313 (68%), Gaps = 20/313 (6%)

Query: 38  IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
           I +++++WM ++GR YK   E   R  I++ N++YI+  N   N ++ L  N F+DLTNE
Sbjct: 15  IQDRYQKWMDKYGRQYKSREEWERRFTIYQANVQYIDNFNSM-NHSHTLAENNFADLTNE 73

Query: 98  EFRALYTGYNRPVPSVSRQSSRPST-FKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCW 156
           EF+A Y GY        +  S P T F+Y N+ ++PT++DWR++GAVT IK+QGQCGSCW
Sbjct: 74  EFKATYLGY--------KTVSIPDTCFRYGNMVNLPTNVDWRQEGAVTPIKNQGQCGSCW 125

Query: 157 AFSAVAAVEGITQITRGKLIELSEQQLVDC--STDNHGCSGGLMDKAFEYIIENKGLATE 214
           AFSAVAAVEGI +I  GKLI LSEQ+LVDC  ++ N GC+GG M KAFE+I +  GL TE
Sbjct: 126 AFSAVAAVEGINKIKAGKLISLSEQELVDCDVTSGNQGCNGGYMYKAFEFI-KRTGLTTE 184

Query: 215 ADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHF 274
            +YPY+  E  C+ QKEK    +IS YE +P  DE++L  AV+NQPVSV +DA G  F F
Sbjct: 185 IEYPYQGAESACNEQKEKYQFVSISGYEKVPVNDEKSLKAAVANQPVSVAIDAEGNNFQF 244

Query: 275 YKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA--- 331
           Y  G+ + +CGN  +HGVA+VG+G   E +   YWL+KNSWG  WGESGYIR+ RD+   
Sbjct: 245 YSGGIFSGNCGNQLNHGVAIVGYG---ETSNQAYWLVKNSWGTDWGESGYIRMKRDSTDK 301

Query: 332 -GLCGIATAASYP 343
            G CGIA  ASYP
Sbjct: 302 QGTCGIAMMASYP 314


>gi|194352754|emb|CAQ00105.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
 gi|326513690|dbj|BAJ87864.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326514532|dbj|BAJ96253.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 463

 Score =  325 bits (833), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 163/336 (48%), Positives = 214/336 (63%), Gaps = 14/336 (4%)

Query: 17  IILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKA 76
           + L       +VS     E  +   + +WMA+HG TY    E+  R   F+ NL YI++ 
Sbjct: 18  VSLAAAADMSIVSYGERSEEEVRRMYAEWMAEHGSTYNAIGEEERRFEAFRDNLRYIDQH 77

Query: 77  N---KEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPT 133
           N     G  +++LG N F+DLTNEE+R+ Y G  R  P   R+ S  + ++  +  ++P 
Sbjct: 78  NAAADAGVHSFRLGLNRFADLTNEEYRSTYLG-ARTKPDRERKLS--ARYQAADNDELPE 134

Query: 134 SIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHG 192
           S+DWR+KGAV  +KDQG CGSCWAFSA+AAVEGI QI  G +I LSEQ+LVDC T  N G
Sbjct: 135 SVDWRKKGAVGAVKDQGGCGSCWAFSAIAAVEGINQIVTGDMIPLSEQELVDCDTSYNQG 194

Query: 193 CSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQAL 252
           C+GGLMD AFE+II N G+ +E DYPY+  +  CD  K+ A   TI  YED+P   E++L
Sbjct: 195 CNGGLMDYAFEFIINNGGIDSEEDYPYKERDNRCDANKKNAKVVTIDGYEDVPVNSEKSL 254

Query: 253 LQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIK 312
            +AV+NQP+SV ++A GRAF  YKSG+    CG   DHGVA VG+GT   ENG  YWL++
Sbjct: 255 QKAVANQPISVAIEAGGRAFQLYKSGIFTGTCGTALDHGVAAVGYGT---ENGKDYWLVR 311

Query: 313 NSWGETWGESGYIRILRD----AGLCGIATAASYPV 344
           NSWG  WGE GYIR+ R+    +G CGIA   SYP 
Sbjct: 312 NSWGSVWGEDGYIRMERNIKASSGKCGIAVEPSYPT 347


>gi|449460678|ref|XP_004148072.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Cucumis
           sativus]
          Length = 317

 Score =  325 bits (833), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 163/314 (51%), Positives = 215/314 (68%), Gaps = 20/314 (6%)

Query: 38  IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
           I +++++WM ++GR YK   E   R  I++ N++YI+  N   N ++ L  N F+DLTNE
Sbjct: 15  IQDRYQKWMDKYGRQYKSREEWERRFTIYQANVQYIDNFNSM-NHSHTLAENNFADLTNE 73

Query: 98  EFRALYTGYNRPVPSVSRQSSRPST-FKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCW 156
           EF+A Y GY        +  S P T F+Y N+ ++PT++DWR++GAVT IK+QGQCGSCW
Sbjct: 74  EFKATYLGY--------KTVSIPDTCFRYGNMVNLPTNVDWRQEGAVTPIKNQGQCGSCW 125

Query: 157 AFSAVAAVEGITQITRGKLIELSEQQLVDC--STDNHGCSGGLMDKAFEYIIENKGLATE 214
           AFSAVAAVEGI +I  GKLI LSEQ+LVDC  ++ N GC+GG M KAFE+I +  GL TE
Sbjct: 126 AFSAVAAVEGINKIKAGKLISLSEQELVDCDVTSGNQGCNGGYMYKAFEFI-KRTGLTTE 184

Query: 215 ADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHF 274
            +YPY+  E  C+ QKEK    +IS YE +P  DE++L  AV+NQPVSV +DA G  F F
Sbjct: 185 IEYPYQGAESACNEQKEKYQFVSISGYEKVPVNDEKSLKAAVANQPVSVAIDAEGNNFQF 244

Query: 275 YKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA--- 331
           Y  G+ + +CGN  +HGVA+VG+G   E +   YWL+KNSWG  WGESGYIR+ RD+   
Sbjct: 245 YSGGIFSGNCGNQLNHGVAIVGYG---ETSNQAYWLVKNSWGTDWGESGYIRMKRDSTDR 301

Query: 332 -GLCGIATAASYPV 344
            G CGIA  ASYP 
Sbjct: 302 QGTCGIAMMASYPT 315


>gi|358343350|ref|XP_003635767.1| Cysteine proteinase [Medicago truncatula]
 gi|355501702|gb|AES82905.1| Cysteine proteinase [Medicago truncatula]
          Length = 338

 Score =  325 bits (833), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 159/344 (46%), Positives = 226/344 (65%), Gaps = 18/344 (5%)

Query: 8   SFIIPMFVIIILVITCASQVVSGRSMHEPSIVEK-HEQWMAQHGRTYKDELEKAMRLNIF 66
           +  + + ++ + +I  A   +  ++   P++++K +E W+ ++GR Y+D  E  +R +I+
Sbjct: 4   TITLSIVILNLWIIASACPEIHTKNSTNPAVMKKRYETWLKRYGRHYRDREEWEVRFDIY 63

Query: 67  KQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQ 126
           + N++YIE  N + N +YKL  N F+D+TNEEF++ Y GY   +P    Q+     F+Y 
Sbjct: 64  QSNVQYIEFYNSQ-NYSYKLIDNRFADITNEEFKSTYLGY---LPRFRVQTE----FRYH 115

Query: 127 NVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC 186
              ++P SIDWR+KGAVTH+KDQG+CGSCWAFSAVAAVEGI +I    L+ LSEQQL+DC
Sbjct: 116 KHGELPKSIDWRKKGAVTHVKDQGRCGSCWAFSAVAAVEGINKIKTENLVSLSEQQLIDC 175

Query: 187 S--TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDL 244
              + N GC GG M  AF YI ++ G+AT  +YPY+  +G C+  K K  A TIS YE +
Sbjct: 176 DIKSGNEGCEGGDMYIAFNYIKKHGGIATAKEYPYKGRDGNCNKSKAKNNAVTISGYESV 235

Query: 245 PKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEEN 304
           P  +E+ L  AV++QPVS+  DA G AF FY  G+ +  CG N +HG+ +VG+G   EEN
Sbjct: 236 PARNEKMLKAAVAHQPVSIATDAGGYAFQFYSKGIFSGSCGKNLNHGMTIVGYG---EEN 292

Query: 305 GAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
           G KYW++KNSW   WGESGY+R+ RD     G CGIA  A+YPV
Sbjct: 293 GDKYWIVKNSWANDWGESGYVRMKRDTKDKDGTCGIAMDATYPV 336


>gi|302764466|ref|XP_002965654.1| hypothetical protein SELMODRAFT_230713 [Selaginella moellendorffii]
 gi|300166468|gb|EFJ33074.1| hypothetical protein SELMODRAFT_230713 [Selaginella moellendorffii]
          Length = 345

 Score =  325 bits (833), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 158/313 (50%), Positives = 207/313 (66%), Gaps = 20/313 (6%)

Query: 42  HEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRA 101
           +++W+ +HG+ Y    E   R  IFK+N+ YI   N   N ++ LG N+F+DLTN EFR 
Sbjct: 38  YQKWIQEHGKAYNSAHEYKKRFQIFKENVNYINSHNARRNNSHSLGLNKFADLTNSEFRG 97

Query: 102 LYTGYNRPVPSVSRQSSRPSTFKYQN----VTDVPTSIDWREKGAVTHIKDQGQCGSCWA 157
           LY G          +  RP+ F        V D  TS+DWR+KG VT IKDQG CGSCWA
Sbjct: 98  LYVG----------RLQRPAPFHEVGDIALVADTATSVDWRKKGGVTEIKDQGDCGSCWA 147

Query: 158 FSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEAD 216
           FSAVAAVEG+T ++ G L+ LSEQ+LVDC T  N GC GG+MD AF+Y+I N G+ ++++
Sbjct: 148 FSAVAAVEGLTFLSTGTLVSLSEQELVDCDTTVNQGCDGGIMDYAFQYMIRNGGITSQSN 207

Query: 217 YPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYK 276
           YPYR   G CD  K K  AATI+ ++ +P   E+ LL+AV+NQPVSV ++A G+ F  Y 
Sbjct: 208 YPYRALRGACDKDKVKYHAATINGFQAIPPQSEELLLRAVANQPVSVAIEAGGQDFQLYS 267

Query: 277 SGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD---AGL 333
           SGV   +CG+N DHGVA+VG+GT  +  G +YWL+KNSWG  WGESGY+R+ R    AG+
Sbjct: 268 SGVFTGECGSNLDHGVAIVGYGT--DAGGRQYWLVKNSWGSGWGESGYVRMERQGPGAGV 325

Query: 334 CGIATAASYPVAI 346
           CGI   ASYP  I
Sbjct: 326 CGINLDASYPTKI 338


>gi|115484973|ref|NP_001067630.1| Os11g0255300 [Oryza sativa Japonica Group]
 gi|530335|emb|CAA56844.1| cysteine protease [Oryza sativa Japonica Group]
 gi|5761322|dbj|BAA83472.1| cysteine endopeptidase [Oryza sativa Japonica Group]
 gi|62732672|gb|AAX94791.1| cysteine proteinase (EC 3.4.22.-) - rice [Oryza sativa Japonica
           Group]
 gi|62732673|gb|AAX94792.1| cysteine proteinase (EC 3.4.22.-) - rice [Oryza sativa Japonica
           Group]
 gi|62732674|gb|AAX94793.1| cysteine proteinase (EC 3.4.22.-) - rice [Oryza sativa Japonica
           Group]
 gi|77549615|gb|ABA92412.1| Thiol protease SEN102 precursor, putative, expressed [Oryza sativa
           Japonica Group]
 gi|77549616|gb|ABA92413.1| Thiol protease SEN102 precursor, putative, expressed [Oryza sativa
           Japonica Group]
 gi|77549617|gb|ABA92414.1| Thiol protease SEN102 precursor, putative, expressed [Oryza sativa
           Japonica Group]
 gi|113644852|dbj|BAF27993.1| Os11g0255300 [Oryza sativa Japonica Group]
 gi|125576789|gb|EAZ18011.1| hypothetical protein OsJ_33558 [Oryza sativa Japonica Group]
 gi|215701098|dbj|BAG92522.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 378

 Score =  325 bits (832), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 158/328 (48%), Positives = 211/328 (64%), Gaps = 20/328 (6%)

Query: 35  EPSIVEKHEQWMAQH--------GRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKL 86
           E S+   +E+W +++        G    D+ E   R N+F +N  YI +AN+ G R ++L
Sbjct: 35  EESLRALYERWRSRYTVSRPAASGGVGNDDGEARRRFNVFVENARYIHEANRRGGRPFRL 94

Query: 87  GTNEFSDLTNEEFRALYTG----YNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGA 142
             N+F+D+T +EFR  Y G    ++R +            +   +  ++P ++DWRE+GA
Sbjct: 95  ALNKFADMTTDEFRRTYAGSRARHHRSLSGGRGGEGGSFRYGGDDEDNLPPAVDWRERGA 154

Query: 143 VTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST-DNHGCSGGLMDKA 201
           VT IKDQGQCGSCWAFS VAAVEG+ +I  G+L+ LSEQ+LVDC T DN GC GGLMD A
Sbjct: 155 VTGIKDQGQCGSCWAFSTVAAVEGVNKIKTGRLVTLSEQELVDCDTGDNQGCDGGLMDYA 214

Query: 202 FEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPV 261
           F++I  N G+ TE++YPYR E+G C+  K  +   TI  YED+P  DE AL +AV+NQPV
Sbjct: 215 FQFIKRNGGITTESNYPYRAEQGRCNKAKASSHDVTIDGYEDVPANDESALQKAVANQPV 274

Query: 262 SVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGE 321
           +V V+ASG+ F FY  GV   +CG + DHGVA VG+G   +  G KYW++KNSWGE WGE
Sbjct: 275 AVAVEASGQDFQFYSEGVFTGECGTDLDHGVAAVGYGITRD--GTKYWIVKNSWGEDWGE 332

Query: 322 SGYIRILRDA-----GLCGIATAASYPV 344
            GYIR+ R       GLCGIA  ASYPV
Sbjct: 333 RGYIRMQRGVSSDSNGLCGIAMEASYPV 360


>gi|50355611|dbj|BAD29954.1| cysteine protease [Daucus carota]
          Length = 474

 Score =  325 bits (832), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 154/314 (49%), Positives = 211/314 (67%), Gaps = 10/314 (3%)

Query: 38  IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
           ++  +E W+ +H + Y    EK  R  IFK N+ ++++ N   N++YKLG N+F+DLTN+
Sbjct: 56  LLSLYESWLVKHHKNYNALGEKETRFGIFKDNVGFVDRHNSMRNQSYKLGLNKFADLTND 115

Query: 98  EFRALY-TGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCW 156
           E+R+LY +G        +    R   F +++   +P S+DWR++GAV  +KDQGQCGSCW
Sbjct: 116 EYRSLYLSGKMMKRERKNEDGFRSDRFVFEDGDHLPESVDWRDRGAVAPVKDQGQCGSCW 175

Query: 157 AFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEA 215
           AFS V AVEGI +I  G+LI LSEQ+LVDC    N GC+GGLMD AFE+I++N G+ TE 
Sbjct: 176 AFSTVGAVEGINKIVTGELISLSEQELVDCDNGYNQGCNGGLMDYAFEFIVKNGGIDTED 235

Query: 216 DYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFY 275
           DYPY+  +G CD  ++ A   TI+ YED+P  DE++L +AV++QPVSV ++A GRAF  Y
Sbjct: 236 DYPYKGVDGLCDQNRKNAKVVTINGYEDVPHNDEKSLKKAVAHQPVSVAIEAGGRAFQLY 295

Query: 276 KSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD----- 330
           +SGV    CG   DHGV  VG+G+   ENG  YW+++NSWG  WGESGYIR+ R+     
Sbjct: 296 ESGVFTGQCGTELDHGVVAVGYGS---ENGKDYWIVRNSWGPDWGESGYIRLERNVASTS 352

Query: 331 AGLCGIATAASYPV 344
            G CGIA  ASYP 
Sbjct: 353 TGKCGIAMQASYPT 366


>gi|226496089|ref|NP_001149658.1| cysteine protease 1 precursor [Zea mays]
 gi|195629242|gb|ACG36262.1| cysteine protease 1 precursor [Zea mays]
          Length = 469

 Score =  325 bits (832), Expect = 3e-86,   Method: Compositional matrix adjust.
 Identities = 161/327 (49%), Positives = 209/327 (63%), Gaps = 16/327 (4%)

Query: 27  VVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKAN---KEGNRT 83
           +VS     E      + +WMA HGRTY    E+  R  +F+ NL Y++  N     G  +
Sbjct: 31  IVSYGERSEEEARRMYAEWMAAHGRTYNAVGEEERRFEVFRDNLRYVDAHNAAADAGVHS 90

Query: 84  YKLGTNEFSDLTNEEFRALYTGY-NRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGA 142
           ++LG N F+DLTN+E+RA Y G  +RP     R+      +   +  D+P S+DWR KGA
Sbjct: 91  FRLGLNRFADLTNDEYRATYLGVRSRP----QRERRLGDRYLAGDNEDLPESVDWRAKGA 146

Query: 143 VTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKA 201
           V  IKDQG CGSCWAFS +AAVEGI QI  G +I LSEQ+LVDC T  N GC+GGLMD A
Sbjct: 147 VAEIKDQGSCGSCWAFSTIAAVEGINQIVTGDMISLSEQELVDCDTSYNQGCNGGLMDYA 206

Query: 202 FEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPV 261
           FE+II N G+ TE DYPY+  +G CD  ++ A   TI  YED+P   E++L +AV+NQP+
Sbjct: 207 FEFIINNGGIDTEEDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANSEKSLQKAVANQPI 266

Query: 262 SVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGE 321
           SV ++A GRAF  Y SG+    CG   DHGV  VG+GT   ENG  YW++KNSWG +WGE
Sbjct: 267 SVAIEAGGRAFQLYNSGIFTGTCGTALDHGVTAVGYGT---ENGKDYWIVKNSWGSSWGE 323

Query: 322 SGYIRILRD----AGLCGIATAASYPV 344
           SGY+R+ R+    +G CGIA   SYP+
Sbjct: 324 SGYVRMERNIKASSGKCGIAVEPSYPL 350


>gi|535473|emb|CAA53377.1| cysteine protease [Vicia sativa]
          Length = 368

 Score =  325 bits (832), Expect = 3e-86,   Method: Compositional matrix adjust.
 Identities = 159/345 (46%), Positives = 219/345 (63%), Gaps = 16/345 (4%)

Query: 10  IIPMFVIIILV---ITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIF 66
           I+P F+   L+   +    Q+ +GRS  E  ++  +E+W+ +H + Y    EK  R  IF
Sbjct: 6   ILPFFLFFSLITFSLALDIQLPTGRSNDE--VMTMYEEWLVKHQKVYNGLREKDQRFQIF 63

Query: 67  KQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPS-VSRQSSRPSTFKY 125
           K NL +I++ N + N TY +G N+F+D+TNEE+R +Y G    +   + +       + Y
Sbjct: 64  KDNLNFIDEHNAQ-NYTYIVGLNKFADMTNEEYRDMYLGTRSDIKRRIMKNKITGHRYAY 122

Query: 126 QNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVD 185
            +   +P  +DWR KGA+THIKDQG CGSCWAFS +A VE I +I  GKL+ LSEQ+LVD
Sbjct: 123 NSGDRLPVHVDWRLKGAITHIKDQGSCGSCWAFSTIATVEAINKIVTGKLVSLSEQELVD 182

Query: 186 CSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDL 244
           C    N GC+GGLMD AFE+II N G+ T+  YPY+  EG CD  ++KA   +I  YED+
Sbjct: 183 CDRAFNEGCNGGLMDYAFEFIIGNGGIDTDQHYPYKGFEGRCDPTRKKAKIVSIDGYEDV 242

Query: 245 PKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEEN 304
           P  +E AL +AV++QPVSV ++ASGRA   Y+SGV    CG + DH V +VG+G+   EN
Sbjct: 243 PSNNENALKKAVAHQPVSVAIEASGRALQLYQSGVFTGKCGTSLDHAVVIVGYGS---EN 299

Query: 305 GAKYWLIKNSWGETWGESGYIRILRDA-----GLCGIATAASYPV 344
           G  YWL++NSWG  WGE GY ++ R+      G CGIA  ASYPV
Sbjct: 300 GLDYWLVRNSWGTNWGEDGYFKMERNVKGTHTGKCGIAVEASYPV 344


>gi|357160591|ref|XP_003578813.1| PREDICTED: vignain-like [Brachypodium distachyon]
          Length = 339

 Score =  325 bits (832), Expect = 3e-86,   Method: Compositional matrix adjust.
 Identities = 159/341 (46%), Positives = 221/341 (64%), Gaps = 21/341 (6%)

Query: 17  IILVITCASQVVSGRSMHEP----SIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
           ++ ++ C     SG +  E     S+V +HE WM+Q+GR+YKD  EK  +  +FK N  +
Sbjct: 8   LLAILGCLCFFASGLAARELNDDLSMVARHESWMSQYGRSYKDAAEKDRKFEVFKANAAF 67

Query: 73  IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT--D 130
           I+  N + N  + LG N+F+D+TNEEF+   T  N+    +S +    + F Y+NV+   
Sbjct: 68  IDSFNAK-NHKFWLGINQFADITNEEFKVTKT--NKGF--ISNKVRASTGFSYENVSIDA 122

Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST-- 188
           +P +IDWR KGAVT +KDQGQCG CWAFSAVAA EGI +++ GKL+ LSEQ+LVDC    
Sbjct: 123 LPATIDWRTKGAVTPVKDQGQCGCCWAFSAVAATEGIVKLSTGKLVSLSEQELVDCDVHG 182

Query: 189 DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
           ++ GC GGLMD AF++II N GL  E+ YPY  E+G C +  +   A TI  YED+P  +
Sbjct: 183 EDQGCEGGLMDDAFKFIITNGGLTQESSYPYDAEDGKCKSGSKS--AGTIKSYEDVPANN 240

Query: 249 EQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKY 308
           E AL++AV+NQPVSV VD     F FY  GV+   CG + DHG+A +G+G   +  G KY
Sbjct: 241 EGALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGVTSD--GTKY 298

Query: 309 WLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPVA 345
           WL+KNSWG +WGE+G++R+ +D     G+CG+A   SYP A
Sbjct: 299 WLMKNSWGTSWGENGFLRMEKDIADKKGMCGLAMEPSYPTA 339


>gi|356521444|ref|XP_003529366.1| PREDICTED: thiol protease SEN102-like [Glycine max]
          Length = 340

 Score =  325 bits (832), Expect = 3e-86,   Method: Compositional matrix adjust.
 Identities = 168/349 (48%), Positives = 222/349 (63%), Gaps = 17/349 (4%)

Query: 1   MVLKFEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKA 60
           M L  +K  +   F   +L +TC  +  S R++ E SI  +HE+WMA H R Y D  EK 
Sbjct: 1   MALTLDKKSVGTFF---MLFLTCICRA-SSRTLSESSIATQHEEWMAMHDRVYADSAEKD 56

Query: 61  MRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTG--YNRPVPSVSRQSS 118
            R  IFK+NLE+IEK N EG + Y L  N F+DLTNEEF A +TG  Y  P    S + +
Sbjct: 57  RRQQIFKENLEFIEKHNNEGKKRYNLSLNSFADLTNEEFVASHTGALYKPPTQLGSFKIN 116

Query: 119 RPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIEL 178
               F   +V D+  S+DWR++GAV  IK+QG+CGSCWAFSAVAAVEGI QI  G+L+ L
Sbjct: 117 HSLGFHKMSVGDIEASLDWRKRGAVNDIKNQGRCGSCWAFSAVAAVEGINQIKNGQLVSL 176

Query: 179 SEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATI 238
           SEQ LVDC++ N GC G  ++KAF+Y I + GLA E +YPY    GTC      A+   I
Sbjct: 177 SEQNLVDCAS-NDGCHGQYVEKAFDY-IRDYGLANEEEYPYVETVGTCSGNSNPAI--QI 232

Query: 239 SKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFG 298
             Y+ +   +E+ LL AV++QPVSV ++A G+ F FY  GV + +CG   +H V +VG+G
Sbjct: 233 RGYQSVTPQNEEQLLTAVASQPVSVLLEAKGQGFQFYSGGVFSGECGTELNHAVTIVGYG 292

Query: 299 TAEEENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYP 343
              EE   KYWLI+NSWG++WGE GY++++RD     GLCGI   ASYP
Sbjct: 293 ---EEAEGKYWLIRNSWGKSWGEGGYMKLMRDTGNPQGLCGINMQASYP 338


>gi|242038089|ref|XP_002466439.1| hypothetical protein SORBIDRAFT_01g007820 [Sorghum bicolor]
 gi|241920293|gb|EER93437.1| hypothetical protein SORBIDRAFT_01g007820 [Sorghum bicolor]
          Length = 353

 Score =  325 bits (832), Expect = 3e-86,   Method: Compositional matrix adjust.
 Identities = 165/354 (46%), Positives = 230/354 (64%), Gaps = 23/354 (6%)

Query: 11  IPMFVIIILVITCASQVVSGRSM-------------HEPSIVEKHEQWMAQHGRTYKDEL 57
           +  FV+ +LV+       + R++                ++V +HE+WMA+HGRTY DE 
Sbjct: 3   VSRFVLTVLVVASVCTAAAPRALAVRELAGEEESAAVAAAMVSRHEKWMAEHGRTYTDEA 62

Query: 58  EKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQS 117
           EKA RL IF+ N E+I+  N  G  +++L TN F+DLT+EEFRA  TG+       +   
Sbjct: 63  EKARRLEIFRANAEFIDSFNDAGKHSHRLATNRFADLTDEEFRAARTGFRPRPAPAAAAG 122

Query: 118 SRPSTFKYQN--VTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKL 175
           S    F+Y+N  + D   S+DWR  GAVT +KDQG+CG CWAFSAVAAVEG+ +I  G+L
Sbjct: 123 S-GGRFRYENFSLADAAQSVDWRAMGAVTGVKDQGECGCCWAFSAVAAVEGLNKIRTGRL 181

Query: 176 IELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKA 233
           + LSEQ+LVDC  +  + GC GGLMD AF++I    GLA+E+ YPY+ ++G+C +    A
Sbjct: 182 VSLSEQELVDCDVNGEDQGCEGGLMDDAFQFIERRGGLASESGYPYQGDDGSCRSSAAAA 241

Query: 234 VAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVA 293
            AA+I  +ED+P+ +E AL  AV+NQPVSV ++    AF FY SGVL  +CG + +H + 
Sbjct: 242 RAASIRGHEDVPRNNEAALAAAVANQPVSVAINGEDYAFRFYDSGVLGGECGTDLNHAIT 301

Query: 294 VVGFGTAEEENGAKYWLIKNSWGETWGESGYIRI---LRDAGLCGIATAASYPV 344
            VG+GTA +  G+KYWL+KNSWG +WGE GY+RI   +R  G+CG+A   SYPV
Sbjct: 302 AVGYGTAAD--GSKYWLMKNSWGTSWGEGGYVRIRRGVRGEGVCGLAKLPSYPV 353


>gi|147772785|emb|CAN62838.1| hypothetical protein VITISV_003391 [Vitis vinifera]
          Length = 298

 Score =  324 bits (831), Expect = 3e-86,   Method: Compositional matrix adjust.
 Identities = 173/341 (50%), Positives = 212/341 (62%), Gaps = 58/341 (17%)

Query: 11  IPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNL 70
           + M ++ IL    ASQ  S RS+HE S+ E+HE WMA++GR YKD  EK  R  IFK N+
Sbjct: 10  VSMALLFILA-AWASQATS-RSLHEASMYERHEDWMARYGRMYKDANEKEKRFKIFKDNV 67

Query: 71  EYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
                                                          ++ +TFKY+NVT 
Sbjct: 68  -----------------------------------------------AQATTFKYENVTA 80

Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST-- 188
           VP++IDWR+KGAVT IKDQ QCGSCWAFSAVAA EGITQIT GKLI LSEQ+LVDC T  
Sbjct: 81  VPSTIDWRKKGAVTPIKDQQQCGSCWAFSAVAATEGITQITTGKLISLSEQELVDCDTGG 140

Query: 189 DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
           +N GCSGGL D AF +I  + GLA+EA YPY  ++GTC+++KE   AA I  YED+P  +
Sbjct: 141 ENQGCSGGLXDDAFRFIXIH-GLASEATYPYEGDDGTCNSKKEAHPAAKIKGYEDVPANN 199

Query: 249 EQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKY 308
           E+AL +AV++QPV+V +DA G  F FY SGV    CG   DHGVA VG+G    ++G  Y
Sbjct: 200 EKALQKAVAHQPVAVAIDAGGFEFQFYTSGVFTGQCGTELDHGVAAVGYGIG--DDGMXY 257

Query: 309 WLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
           WL+KNSWG  WGE GYIR+ RD     GLCGIA  ASYP A
Sbjct: 258 WLVKNSWGTGWGEEGYIRMQRDVTAKEGLCGIAMQASYPTA 298


>gi|111073715|dbj|BAF02546.1| triticain alpha [Triticum aestivum]
 gi|388890585|gb|AFK80346.1| cysteine endopeptidase EP alpha [Secale cereale x Triticum durum]
          Length = 461

 Score =  324 bits (831), Expect = 3e-86,   Method: Compositional matrix adjust.
 Identities = 161/326 (49%), Positives = 213/326 (65%), Gaps = 14/326 (4%)

Query: 27  VVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKAN---KEGNRT 83
           +VS     E  +   + +WM++H RTY    E+  R  +F+ NL YI++ N     G  +
Sbjct: 26  IVSYGERSEEEVRRMYAEWMSEHRRTYNAIGEEERRFEVFRDNLRYIDQHNAAADAGLHS 85

Query: 84  YKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAV 143
           ++LG N F+DLTNEE+R+ Y G  R  P   R+ S  + ++  +  ++P ++DWR+KGAV
Sbjct: 86  FRLGLNRFADLTNEEYRSTYLG-ARTKPDRERKLS--ARYQADDNEELPETVDWRKKGAV 142

Query: 144 THIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAF 202
             IKDQG CGSCWAFSA+AAVEGI QI  G +I LSEQ+LVDC T  N GC+GGLMD AF
Sbjct: 143 AAIKDQGGCGSCWAFSAIAAVEGINQIVTGDMIPLSEQELVDCDTSYNEGCNGGLMDYAF 202

Query: 203 EYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVS 262
           E+II N G+ +E DYPY+  +  CD  K+ A   TI  YED+P   E++L +AV+NQP+S
Sbjct: 203 EFIINNGGIDSEEDYPYKERDNRCDANKKNAKVVTIDGYEDVPVNSEKSLQKAVANQPIS 262

Query: 263 VCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGES 322
           V ++A GRAF  YKSG+    CG   DHGVA VG+GT   ENG  YWL++NSWG  WGE 
Sbjct: 263 VAIEAGGRAFQLYKSGIFTGTCGTALDHGVAAVGYGT---ENGKDYWLVRNSWGTVWGED 319

Query: 323 GYIRILRD----AGLCGIATAASYPV 344
           GYIR+ R+    +G CGIA   SYP 
Sbjct: 320 GYIRMERNIKASSGKCGIAVEPSYPT 345


>gi|37780043|gb|AAP32194.1| cysteine protease 1 [Trifolium repens]
          Length = 292

 Score =  324 bits (831), Expect = 4e-86,   Method: Compositional matrix adjust.
 Identities = 159/295 (53%), Positives = 201/295 (68%), Gaps = 12/295 (4%)

Query: 58  EKAMRLNIFKQNLEYIEKANKE-GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQ 116
           E+  RL IF +N+ YIE +N    N+ YKL  N+F+DLTNEEF A     N+    +   
Sbjct: 3   EREKRLRIFNKNVNYIEASNSAVNNKLYKLSINKFADLTNEEFIA---SRNKFKGHMCSS 59

Query: 117 SSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLI 176
             R +TFKY+N + +P+++DWR+KGAVT +K+QGQCGSCWAFSAVAA EGI Q++ GKL+
Sbjct: 60  IIRTTTFKYENASAIPSTVDWRKKGAVTPVKNQGQCGSCWAFSAVAATEGIHQLSTGKLV 119

Query: 177 ELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAV 234
            LSEQ+L+DC T   + GC GGLMD AF++II+N GL+TE  YPY   +GTC+  K    
Sbjct: 120 SLSEQELIDCDTKGVDQGCEGGLMDDAFKFIIQNHGLSTEVQYPYEGVDGTCNANKASIH 179

Query: 235 AATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAV 294
           A TI+ YED+P  +E AL +AV+NQP+SV +DASG  F FY SGV    CG   DHGV  
Sbjct: 180 AVTITGYEDVPANNELALQKAVANQPISVAIDASGSDFQFYNSGVFTGSCGTELDHGVTA 239

Query: 295 VGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
           VG+G   +  G KYWL+KNSWG  WGE GYIR+ R      GLCGIA  ASYP A
Sbjct: 240 VGYGVGND--GTKYWLVKNSWGADWGEEGYIRMQRGIAAAEGLCGIAMQASYPTA 292


>gi|445927|prf||1910332A Cys endopeptidase
          Length = 362

 Score =  324 bits (830), Expect = 4e-86,   Method: Compositional matrix adjust.
 Identities = 160/317 (50%), Positives = 212/317 (66%), Gaps = 12/317 (3%)

Query: 35  EPSIVEKHEQWMAQHGRTYKDEL-EKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSD 93
           E S+ + +E+W + H  T    L EK  R N+FK N+ ++   NK  ++ YKL  N+F+D
Sbjct: 33  EESLWDLYERWRSHH--TVSRSLGEKHKRFNVFKANVMHVHNTNKM-DKPYKLKLNKFAD 89

Query: 94  LTNEEFRALYTGYNRPVPSVSRQSSRPS-TFKYQNVTDVPTSIDWREKGAVTHIKDQGQC 152
           +TN EFR+ Y G       + R S   S TF Y+ V  VP S+DWR+KGAVT +KDQGQC
Sbjct: 90  MTNHEFRSTYAGSKVNHHKMFRGSQHGSGTFMYEKVGSVPASVDWRKKGAVTDVKDQGQC 149

Query: 153 GSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGL 211
           GSCWAFS + AVEGI QI   KL+ LSEQ+LVDC  + N GC+GGLM+ AFE+I +  G+
Sbjct: 150 GSCWAFSTIVAVEGINQIKTNKLVSLSEQELVDCDKEENQGCNGGLMESAFEFIKQKGGI 209

Query: 212 ATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRA 271
            TE++YPY+ +EGTCD  K   +A +I  +E++P  DE ALL+AV+NQPVSV +DA G  
Sbjct: 210 TTESNYPYKAQEGTCDESKVNDLAVSIDGHENVPVNDENALLKAVANQPVSVAIDAGGSD 269

Query: 272 FHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD- 330
           F FY  GV   DC  + +HGVA+VG+GT  +  G  YW+++NSWG  WGE GYIR+ R+ 
Sbjct: 270 FQFYSEGVFTGDCNTDLNHGVAIVGYGTTVD--GTNYWIVRNSWGPEWGEQGYIRMQRNI 327

Query: 331 ---AGLCGIATAASYPV 344
               GLCGIA  ASYP+
Sbjct: 328 SKKEGLCGIAMMASYPI 344


>gi|148927394|gb|ABR19828.1| cysteine proteinase [Elaeis guineensis]
          Length = 469

 Score =  324 bits (830), Expect = 4e-86,   Method: Compositional matrix adjust.
 Identities = 158/312 (50%), Positives = 207/312 (66%), Gaps = 12/312 (3%)

Query: 42  HEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKAN---KEGNRTYKLGTNEFSDLTNEE 98
           ++ W AQH R+Y    E   RL IF+ NL +I++ N     G  +++LG   F+DLTNEE
Sbjct: 47  YQAWKAQHARSYNALDEDEQRLEIFRDNLRFIDQHNAAANAGKYSFRLGLTRFADLTNEE 106

Query: 99  FRALYTGYNRPVPSVSRQSSRPST-FKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWA 157
           +R+ Y G         R S+  S  +++++  D+P SIDWR+KGAV  +KDQG CGSCWA
Sbjct: 107 YRSTYLGVRTAGSRRRRNSTVGSNRYRFRSSDDLPDSIDWRDKGAVVDVKDQGSCGSCWA 166

Query: 158 FSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEAD 216
           FS +AAVEGI  I  G LI LSEQ+LVDC T  N GC+GGLMD AFE+II N G+ T+ D
Sbjct: 167 FSTIAAVEGINHIVTGDLISLSEQELVDCDTYYNQGCNGGLMDYAFEFIISNGGIDTDED 226

Query: 217 YPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYK 276
           YPY   +G+CD  ++ A   TI  YED+P  DE++L +AV+NQPVSV ++A GRAF  Y+
Sbjct: 227 YPYTGRDGSCDQYRKNAHVVTIDSYEDVPINDEKSLQKAVANQPVSVAIEAGGRAFQLYE 286

Query: 277 SGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD----AG 332
           SG+    CG   DHGV  +G+G+   ENG  YW++KNSWG  WGESGYIR+ R+     G
Sbjct: 287 SGIFTGYCGTELDHGVTAIGYGS---ENGKYYWIVKNSWGSDWGESGYIRMERNINSATG 343

Query: 333 LCGIATAASYPV 344
            CGIA  ASYP+
Sbjct: 344 KCGIAMEASYPI 355


>gi|168063167|ref|XP_001783545.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162664932|gb|EDQ51634.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 461

 Score =  324 bits (830), Expect = 4e-86,   Method: Compositional matrix adjust.
 Identities = 161/319 (50%), Positives = 213/319 (66%), Gaps = 17/319 (5%)

Query: 34  HEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSD 93
           HE  ++E+   W  +HG+ Y D  +   R  ++K NL YI  +  E NRTY LG  +F+D
Sbjct: 46  HENLLLEQFAAWAHKHGKAYHDAEQCLHRFAVWKDNLAYIRHS--ETNRTYSLGLTKFAD 103

Query: 94  LTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCG 153
           LTNEEFR +YTG        SR++ R + F+Y + ++ P S+DWR+ GAVT +KDQG CG
Sbjct: 104 LTNEEFRRMYTGTR---IDRSRRAKRRTGFRYAD-SEAPESVDWRKNGAVTSVKDQGSCG 159

Query: 154 SCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLA 212
           SCWAFSAV +VEGI  I  G+ + LSEQ+LVDC  + N GC+GGLMD AF++II+N G+ 
Sbjct: 160 SCWAFSAVGSVEGINAIRNGEAVSLSEQELVDCDLEYNQGCNGGLMDYAFDFIIQNGGID 219

Query: 213 TEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAF 272
           TE DYPY+  +G CDN K+ A   TI  YED+P+ DE+AL +AV+ QPVSV ++A GR F
Sbjct: 220 TEKDYPYKGFDGRCDNSKKNAHVVTIDGYEDVPENDEEALKKAVAGQPVSVAIEAGGRDF 279

Query: 273 HFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD-- 330
             Y  GV + +CG + DHGV  VG+GT   E+G  YW++KNSWGE WGESGY+R+ R+  
Sbjct: 280 QLYAQGVFSGECGTDLDHGVLAVGYGT---EDGVDYWIVKNSWGEYWGESGYLRMKRNMK 336

Query: 331 -----AGLCGIATAASYPV 344
                 GLCGI    SY V
Sbjct: 337 DSNDGPGLCGINIEPSYAV 355


>gi|1223922|gb|AAA92063.1| cysteinyl endopeptidase [Vigna radiata]
          Length = 362

 Score =  324 bits (830), Expect = 4e-86,   Method: Compositional matrix adjust.
 Identities = 158/316 (50%), Positives = 212/316 (67%), Gaps = 10/316 (3%)

Query: 35  EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
           E S+ + +E+W + H    +   EK  R N+FK+N+ ++   NK  ++ YKL  N+F+D+
Sbjct: 33  EESLWDLYERWRSHH-TVSRSLTEKHKRFNVFKENVMHVHNTNKM-DKPYKLKLNKFADM 90

Query: 95  TNEEFRALYTGYNRPVPSVSRQSSRPS-TFKYQNVTDVPTSIDWREKGAVTHIKDQGQCG 153
           TN EFR+ Y G       + R +   + TF Y+ V  VP S+DWR+KGAVT +KDQGQCG
Sbjct: 91  TNHEFRSTYAGSKVNHHKMFRGTQHGNGTFMYEKVGSVPASVDWRKKGAVTDVKDQGQCG 150

Query: 154 SCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLA 212
           SCWAFS V AVEGI QI   KL+ LSEQ+LVDC  + N GC+GGLM+ AFE+I +  G+ 
Sbjct: 151 SCWAFSTVVAVEGINQIKTDKLVSLSEQELVDCDKEENQGCNGGLMESAFEFIKQKGGIT 210

Query: 213 TEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAF 272
           TE++YPY  +EGTCD  K   +A +I  +E++P  DE ALL+AV+NQPVSV +DA G  F
Sbjct: 211 TESNYPYTAQEGTCDASKVNDLAVSIDGHENVPVNDENALLKAVANQPVSVAIDAGGSDF 270

Query: 273 HFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD-- 330
            FY  GVL  DC  + +HGVA+VG+GT  +  G  YW+++NSWG  WGE GYIR+ R+  
Sbjct: 271 QFYSEGVLTGDCNTDLNHGVAIVGYGTTVD--GTNYWIVRNSWGPEWGEQGYIRMQRNIS 328

Query: 331 --AGLCGIATAASYPV 344
              GLCGIA  ASYP+
Sbjct: 329 KKEGLCGIAMMASYPI 344


>gi|297745594|emb|CBI40759.3| unnamed protein product [Vitis vinifera]
          Length = 300

 Score =  324 bits (830), Expect = 4e-86,   Method: Compositional matrix adjust.
 Identities = 155/304 (50%), Positives = 208/304 (68%), Gaps = 12/304 (3%)

Query: 46  MAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTG 105
           M++HG++Y+   EK  R  +F+ NL++I++ NK+ + +Y LG NEF+DL++EEF+  Y G
Sbjct: 1   MSKHGKSYRSFEEKLHRFEVFQDNLKHIDETNKKVS-SYWLGLNEFADLSHEEFKRKYLG 59

Query: 106 YNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVE 165
               +P   ++   P  F Y++V D+P S+DWR+KGAV H+K+QG CGSCWAFS VAAVE
Sbjct: 60  LKIELP---KRRDSPEEFSYKDVADLPKSVDWRKKGAVAHVKNQGACGSCWAFSTVAAVE 116

Query: 166 GITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEG 224
           GI QI  G L  LSEQ+L+DC    N+GC+GGLMD AF +II N GL  E DYPY  EEG
Sbjct: 117 GINQIVTGNLTALSEQELIDCDKPFNNGCNGGLMDYAFAFIISNGGLRKEEDYPYVMEEG 176

Query: 225 TCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADC 284
           TC  +KE+    TIS Y D+P+ +EQ+ L+A++NQP+SV ++AS R F FY  G+ N  C
Sbjct: 177 TCGEKKEELEVVTISGYHDVPEDNEQSFLKALANQPLSVAIEASSRGFQFYSGGIFNGHC 236

Query: 285 GNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAA 340
           G   DHGVA VG+GT++   G  Y  +KNSWG  WGE GYIR+ R+     G+CGI   A
Sbjct: 237 GTELDHGVAAVGYGTSK---GVDYITVKNSWGSKWGEKGYIRMKRNVGKPEGICGIYKMA 293

Query: 341 SYPV 344
           SYP 
Sbjct: 294 SYPT 297


>gi|255567869|ref|XP_002524912.1| cysteine protease, putative [Ricinus communis]
 gi|223535747|gb|EEF37409.1| cysteine protease, putative [Ricinus communis]
          Length = 366

 Score =  324 bits (830), Expect = 4e-86,   Method: Compositional matrix adjust.
 Identities = 162/325 (49%), Positives = 215/325 (66%), Gaps = 16/325 (4%)

Query: 29  SGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGT 88
           S RS +E  ++  +  W+A+H +TY    E+  R  IFK NL +I++ N   NRTYK+G 
Sbjct: 37  SWRSDNE--VISMYNWWLAKHSKTYNKLGEREKRFEIFKNNLRFIDEHNNSKNRTYKVGL 94

Query: 89  NEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPS---TFKYQNVTDVPTSIDWREKGAVTH 145
             F+DLTNEE+RA + G          +S  PS    FK  +V  +P SIDWR+ GAV+ 
Sbjct: 95  TRFADLTNEEYRAKFLGTKSDPKRRLMKSKNPSQRYAFKAGDV--LPESIDWRQSGAVSA 152

Query: 146 IKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEY 204
           IKDQG CGSCWAFS +AAVEG+ +I  G+LI LSEQ+LVDC    N GC+GGLMD AF++
Sbjct: 153 IKDQGSCGSCWAFSTIAAVEGVNKIVTGELISLSEQELVDCDRSYNAGCNGGLMDNAFQF 212

Query: 205 IIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVC 264
           II N G+ T+ DYPY+  +G CD  K K  A TI  +ED+   DE AL +AV++QPVSV 
Sbjct: 213 IINNGGIDTDKDYPYQAVDGKCDTTKVKNKAVTIDGFEDVMAFDEMALQKAVAHQPVSVA 272

Query: 265 VDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGY 324
           ++ASG A  FY+SGV   +CG+  DHGV +VG+GT   E+G  YWL++NSWG  WGE+GY
Sbjct: 273 IEASGMALQFYQSGVFTGECGSALDHGVVIVGYGT---EDGIDYWLVRNSWGRDWGENGY 329

Query: 325 IRILRDA-----GLCGIATAASYPV 344
           I++ R+      G CGIA  +SYP+
Sbjct: 330 IKMQRNVVDTFTGKCGIAMESSYPI 354


>gi|413919736|gb|AFW59668.1| cysteine protease 1 [Zea mays]
          Length = 469

 Score =  324 bits (830), Expect = 5e-86,   Method: Compositional matrix adjust.
 Identities = 160/327 (48%), Positives = 209/327 (63%), Gaps = 16/327 (4%)

Query: 27  VVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKAN---KEGNRT 83
           +VS     E      + +WMA HGRTY    E+  R  +F+ NL Y++  N     G  +
Sbjct: 31  IVSYGERSEEEARRMYAEWMAAHGRTYNAVGEEERRFEVFRDNLRYVDAHNAAADAGVHS 90

Query: 84  YKLGTNEFSDLTNEEFRALYTGY-NRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGA 142
           ++LG N F+DLTN+E+RA Y G  +RP     R+      +   +  D+P S+DWR KGA
Sbjct: 91  FRLGLNRFADLTNDEYRATYLGVRSRP----QRERRLGDRYLAGDNEDLPESVDWRAKGA 146

Query: 143 VTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKA 201
           V  +KDQG CGSCWAFS +AAVEGI QI  G +I LSEQ+LVDC T  N GC+GGLMD A
Sbjct: 147 VAEVKDQGSCGSCWAFSTIAAVEGINQIVTGDMISLSEQELVDCDTSYNQGCNGGLMDYA 206

Query: 202 FEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPV 261
           FE+II N G+ TE DYPY+  +G CD  ++ A   TI  YED+P   E++L +AV+NQP+
Sbjct: 207 FEFIINNGGIDTEEDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANSEKSLQKAVANQPI 266

Query: 262 SVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGE 321
           SV ++A GRAF  Y SG+    CG   DHGV  VG+GT   ENG  YW++KNSWG +WGE
Sbjct: 267 SVAIEAGGRAFQLYNSGIFTGTCGTALDHGVTAVGYGT---ENGKDYWIVKNSWGSSWGE 323

Query: 322 SGYIRILRD----AGLCGIATAASYPV 344
           SGY+R+ R+    +G CGIA   SYP+
Sbjct: 324 SGYVRMERNIKASSGKCGIAVEPSYPL 350


>gi|225438807|ref|XP_002283263.1| PREDICTED: germination-specific cysteine protease 1-like isoform 1
           [Vitis vinifera]
          Length = 374

 Score =  324 bits (830), Expect = 5e-86,   Method: Compositional matrix adjust.
 Identities = 160/318 (50%), Positives = 215/318 (67%), Gaps = 13/318 (4%)

Query: 35  EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
           E  ++  ++ WMA+HG+ Y    EK  R  IFK NL++I++ N + NRTYK+G N F+DL
Sbjct: 39  EEEVMGMYQWWMAKHGKAYNGLGEKEKRFEIFKDNLKFIDEHNAQ-NRTYKVGLNRFADL 97

Query: 95  TNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD--VPTSIDWREKGAVTHIKDQGQC 152
           TNEE+RA+Y G  R  P       + ++ +Y  +    +P S+DWRE GAV  +KDQ  C
Sbjct: 98  TNEEYRAIYLG-TRSDPKRRFAKLKNASPRYAVMPGEVLPESVDWRETGAVNPVKDQRSC 156

Query: 153 GSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGL 211
           GSCWAFS VAAVEGI QI  G+LI LSEQ+LVDC T+ + GC+GGLMD AF++II+N GL
Sbjct: 157 GSCWAFSTVAAVEGINQIVTGELISLSEQELVDCDTEYDMGCNGGLMDYAFDFIIKNGGL 216

Query: 212 ATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRA 271
            TE DYPY   +G C+   + +   +I  YED+P  DE+AL +AV++QPVSV V+A GRA
Sbjct: 217 DTEKDYPYTGFDGECNLSGKSSKVVSIDGYEDVPPFDEKALQKAVAHQPVSVAVEAGGRA 276

Query: 272 FHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD- 330
              Y SG+   +CG   DHG+  VG+GT   ENG  YW+++NSWG +WGE+GYIR+ R+ 
Sbjct: 277 LQLYVSGIFTGECGTALDHGIVAVGYGT---ENGTDYWIVRNSWGSSWGENGYIRMERNM 333

Query: 331 ----AGLCGIATAASYPV 344
               +G CGIA  ASYP+
Sbjct: 334 ADAFSGKCGIAMEASYPI 351


>gi|449438381|ref|XP_004136967.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
          Length = 479

 Score =  324 bits (830), Expect = 5e-86,   Method: Compositional matrix adjust.
 Identities = 160/311 (51%), Positives = 211/311 (67%), Gaps = 15/311 (4%)

Query: 42  HEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRA 101
           +E W+  HG+ Y    EK  R  IFK NL +I++ N+E +RTYK+G   F+DLTNEE+RA
Sbjct: 62  YESWLVHHGKAYNAIGEKERRFEIFKDNLRFIDEHNRE-SRTYKVGLTRFADLTNEEYRA 120

Query: 102 LYTG--YNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFS 159
            + G  ++R  P +S  +++   +      D+P  +DWR+KGAV  +KDQGQCGSCWAFS
Sbjct: 121 RFLGGRFSRK-PRLS--AAKSGRYAAALGDDLPDDVDWRKKGAVATVKDQGQCGSCWAFS 177

Query: 160 AVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYP 218
           +VAAVEGI QI  G+LI LSEQ+LVDC    N GC+GGLMD AF++II N G+ TE DYP
Sbjct: 178 SVAAVEGINQIVTGELIPLSEQELVDCDKSFNMGCNGGLMDYAFQFIIGNGGIDTEEDYP 237

Query: 219 YRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSG 278
           Y+  +  CD  ++ A   TI  YED+P+ DE +L +AV+NQPVSV ++A GRAF  Y+SG
Sbjct: 238 YKGRDAACDPNRKNAKVVTIDGYEDVPENDESSLKKAVANQPVSVAIEAGGRAFQLYQSG 297

Query: 279 VLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD-----AGL 333
           V    CG + DHGV  VG+GT   +NG  YW+++NSWG+ WGESGYIR+ R+      G 
Sbjct: 298 VFTGRCGTDLDHGVVAVGYGT---DNGTDYWIVRNSWGKDWGESGYIRLERNVANITTGK 354

Query: 334 CGIATAASYPV 344
           CGIA   SYP 
Sbjct: 355 CGIAVQPSYPT 365


>gi|544129|sp|P25803.2|CYSEP_PHAVU RecName: Full=Vignain; AltName: Full=Bean endopeptidase; AltName:
           Full=Cysteine proteinase EP-C1; Flags: Precursor
 gi|20994|emb|CAA44816.1| endopeptidase [Phaseolus vulgaris]
          Length = 362

 Score =  324 bits (830), Expect = 5e-86,   Method: Compositional matrix adjust.
 Identities = 159/317 (50%), Positives = 211/317 (66%), Gaps = 12/317 (3%)

Query: 35  EPSIVEKHEQWMAQHGRTYKDEL-EKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSD 93
           E S+ + +E+W + H  T    L EK  R N+FK NL ++   NK  ++ YKL  N+F+D
Sbjct: 33  EESLWDLYERWRSHH--TVSRSLGEKHKRFNVFKANLMHVHNTNKM-DKPYKLKLNKFAD 89

Query: 94  LTNEEFRALYTGYNRPVPSVSRQSSRPS-TFKYQNVTDVPTSIDWREKGAVTHIKDQGQC 152
           +TN EFR+ Y G     P + R +   +  F Y+ V  VP S+DWR+KGAVT +KDQGQC
Sbjct: 90  MTNHEFRSTYAGSKVNHPRMFRGTPHENGAFMYEKVVSVPPSVDWRKKGAVTDVKDQGQC 149

Query: 153 GSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGL 211
           GSCWAFS V AVEGI QI   KL+ LSEQ+LVDC  + N GC+GGLM+ AFE+I +  G+
Sbjct: 150 GSCWAFSTVVAVEGINQIKTNKLVALSEQELVDCDKEENQGCNGGLMESAFEFIKQKGGI 209

Query: 212 ATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRA 271
            TE++YPY+ +EGTCD  K   +A +I  +E++P  DE ALL+AV+NQPVSV +DA G  
Sbjct: 210 TTESNYPYKAQEGTCDASKVNDLAVSIDGHENVPANDEDALLKAVANQPVSVAIDAGGSD 269

Query: 272 FHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD- 330
           F FY  GV   DC  + +HGVA+VG+GT  +  G  YW+++NSWG  WGE GYIR+ R+ 
Sbjct: 270 FQFYSEGVFTGDCSTDLNHGVAIVGYGTTVD--GTNYWIVRNSWGPEWGEHGYIRMQRNI 327

Query: 331 ---AGLCGIATAASYPV 344
               GLCGIA   SYP+
Sbjct: 328 SKKEGLCGIAMLPSYPI 344


>gi|255646767|gb|ACU23856.1| unknown [Glycine max]
          Length = 350

 Score =  323 bits (829), Expect = 5e-86,   Method: Compositional matrix adjust.
 Identities = 160/312 (51%), Positives = 213/312 (68%), Gaps = 13/312 (4%)

Query: 38  IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
           ++E  E WM++HG+ Y++  EK +R  IFK NL++I++ NK  +  Y LG +EF+DL++ 
Sbjct: 44  LIELFESWMSRHGKIYENIEEKLLRFEIFKDNLKHIDERNKVVS-NYWLGLSEFADLSHR 102

Query: 98  EFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWA 157
           EF   Y G        SR+   P  F Y++V ++P S+DWR+KGAV  +K+QG CGSCWA
Sbjct: 103 EFNNKYLGLK---VDYSRRRESPEEFTYKDV-ELPKSVDWRKKGAVAPVKNQGSCGSCWA 158

Query: 158 FSAVAAVEGITQITRGKLIELSEQQLVDCS-TDNHGCSGGLMDKAFEYIIENKGLATEAD 216
           FS VAAVEGI QI  G L  LSEQ+L+DC  T N+GC+GGLMD AF +I+EN GL  E D
Sbjct: 159 FSTVAAVEGINQIVTGNLTSLSEQELIDCDRTYNNGCNGGLMDYAFSFIVENGGLHKEED 218

Query: 217 YPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYK 276
           YPY  EEG C+  KE+    TIS Y D+P+ +EQ+LL+A++NQP+SV ++ASGR F FY 
Sbjct: 219 YPYIMEEGACEMTKEETQVVTISGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQFYS 278

Query: 277 SGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----G 332
            GV +  CG++ DHGVA VG+GTA+   G  Y  +KNSWG  WGE GYIR+ R+     G
Sbjct: 279 GGVFDGHCGSDLDHGVAAVGYGTAK---GVDYITVKNSWGSKWGEKGYIRMRRNIGKPEG 335

Query: 333 LCGIATAASYPV 344
           +CGI   ASYP 
Sbjct: 336 ICGIYKMASYPT 347


>gi|1256830|gb|AAB68374.1| cysteine endopeptidase 1 [Phaseolus vulgaris]
 gi|2959418|emb|CAA12118.1| cysteine protease [Phaseolus vulgaris]
          Length = 364

 Score =  323 bits (829), Expect = 6e-86,   Method: Compositional matrix adjust.
 Identities = 160/344 (46%), Positives = 218/344 (63%), Gaps = 12/344 (3%)

Query: 8   SFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFK 67
           S +IP  +++    + A+  +S  +  E  +++ +E+W+ +H + Y    EK  R  +FK
Sbjct: 3   SMLIPTLLLLSFTFSHAT-AMSIINYSENEVMDMYEEWLVKHRKVYNGLDEKEKRFQVFK 61

Query: 68  QNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPS-VSRQSSRPSTFKYQ 126
            NL +I+  N + N TY LG N+F+D+TN+E+RA+Y G        V +  +    + Y 
Sbjct: 62  DNLGFIQDHNAQ-NNTYTLGLNKFADITNKEYRAMYLGTRTDAKRRVMKTQNTGHRYAYN 120

Query: 127 NVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC 186
           +   +P  +DWR KGAV  IKDQG CGSCWAFS VAAVEGI  I  G+ + LSEQ+LVDC
Sbjct: 121 SGDQLPVHVDWRLKGAVGPIKDQGNCGSCWAFSTVAAVEGINNIVTGEFVSLSEQELVDC 180

Query: 187 STD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLP 245
             + + GC+GGLMD AF++II+N G+ TE DYPY+  +GTCD  K+K     I  YED+P
Sbjct: 181 DREYDEGCNGGLMDYAFQFIIQNGGIDTEEDYPYQGIDGTCDETKKKTKVVQIDGYEDVP 240

Query: 246 KGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENG 305
             +E AL +AVS+QPVSV ++ASGRA   Y+SGV    CG   DHGV VVG+GT   ENG
Sbjct: 241 SNNENALKKAVSHQPVSVAIEASGRALQLYQSGVFTGKCGTALDHGVVVVGYGT---ENG 297

Query: 306 AKYWLIKNSWGETWGESGYIRILRDA-----GLCGIATAASYPV 344
             YWL++NSWG  WGE GY ++ R+      G CGIA   SYPV
Sbjct: 298 VDYWLVRNSWGTGWGEDGYFKMERNVRSTSEGKCGIAMDCSYPV 341


>gi|242092700|ref|XP_002436840.1| hypothetical protein SORBIDRAFT_10g009830 [Sorghum bicolor]
 gi|241915063|gb|EER88207.1| hypothetical protein SORBIDRAFT_10g009830 [Sorghum bicolor]
          Length = 328

 Score =  323 bits (828), Expect = 7e-86,   Method: Compositional matrix adjust.
 Identities = 159/341 (46%), Positives = 216/341 (63%), Gaps = 30/341 (8%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
           +  I+ L   C + + +     + ++V +HEQWM Q+ R YKD  EKA R  +FK N+++
Sbjct: 8   ILAILGLAFFCGAALAARDLNDDSAMVARHEQWMVQYSRVYKDTTEKARRFEVFKANVKF 67

Query: 73  IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYN-RPVPSVSRQSSRPSTFKYQNVT-- 129
           IE  N  GNR + LG N+F+DLTN+EFRA  T    +P P        P+ F+Y+NV+  
Sbjct: 68  IESFNAGGNRKFWLGVNQFADLTNDEFRATKTNKGFKPSP-----VKVPTGFRYENVSVD 122

Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST- 188
            +P +IDWR KGAVT IKDQGQC            EGI +I+ GKLI LSEQ+LVDC   
Sbjct: 123 ALPATIDWRTKGAVTPIKDQGQC------------EGIVKISTGKLISLSEQELVDCDVH 170

Query: 189 -DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
            ++ GC GGLMD AF++II+N GL TE+ YPY   +G C +      AAT+  +ED+P  
Sbjct: 171 GEDQGCEGGLMDDAFQFIIKNGGLTTESSYPYTAADGKCKSGSNS--AATVKGFEDVPAN 228

Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
           DE AL++AV+NQPVSV VD     F FY  GV+   CG + DHG+A +G+G  +  +G K
Sbjct: 229 DEAALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYG--QTSDGTK 286

Query: 308 YWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
           YWL+KNSWG TWGE+GY+R+ +D     G+CG+A   SYP+
Sbjct: 287 YWLLKNSWGTTWGENGYLRMEKDISDKRGMCGLAMEPSYPI 327


>gi|118158|sp|P12412.1|CYSEP_VIGMU RecName: Full=Vignain; AltName: Full=Bean endopeptidase; AltName:
           Full=Cysteine proteinase; AltName:
           Full=Sulfhydryl-endopeptidase; Short=SH-EP; Contains:
           RecName: Full=Vignain-1; Contains: RecName:
           Full=Vignain-2; Flags: Precursor
 gi|22062|emb|CAA33753.1| sulfhydryl-pre-endopeptidase (AA -20 to 342) [Vigna mungo]
 gi|22066|emb|CAA36181.1| sulfhydryl-endopeptidase [Vigna mungo]
          Length = 362

 Score =  323 bits (827), Expect = 9e-86,   Method: Compositional matrix adjust.
 Identities = 160/317 (50%), Positives = 211/317 (66%), Gaps = 12/317 (3%)

Query: 35  EPSIVEKHEQWMAQHGRTYKDEL-EKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSD 93
           E S+ + +E+W + H  T    L EK  R N+FK N+ ++   NK  ++ YKL  N+F+D
Sbjct: 33  EESLWDLYERWRSHH--TVSRSLGEKHKRFNVFKANVMHVHNTNKM-DKPYKLKLNKFAD 89

Query: 94  LTNEEFRALYTGYNRPVPSVSRQSSRPS-TFKYQNVTDVPTSIDWREKGAVTHIKDQGQC 152
           +TN EFR+ Y G       + R S   S TF Y+ V  VP S+DWR+KGAVT +KDQGQC
Sbjct: 90  MTNHEFRSTYAGSKVNHHKMFRGSQHGSGTFMYEKVGSVPASVDWRKKGAVTDVKDQGQC 149

Query: 153 GSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGL 211
           GSCWAFS + AVEGI QI   KL+ LSEQ+LVDC  + N GC+GGLM+ AFE+I +  G+
Sbjct: 150 GSCWAFSTIVAVEGINQIKTNKLVSLSEQELVDCDKEENQGCNGGLMESAFEFIKQKGGI 209

Query: 212 ATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRA 271
            TE++YPY  +EGTCD  K   +A +I  +E++P  DE ALL+AV+NQPVSV +DA G  
Sbjct: 210 TTESNYPYTAQEGTCDESKVNDLAVSIDGHENVPVNDENALLKAVANQPVSVAIDAGGSD 269

Query: 272 FHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD- 330
           F FY  GV   DC  + +HGVA+VG+GT  +  G  YW+++NSWG  WGE GYIR+ R+ 
Sbjct: 270 FQFYSEGVFTGDCNTDLNHGVAIVGYGTTVD--GTNYWIVRNSWGPEWGEQGYIRMQRNI 327

Query: 331 ---AGLCGIATAASYPV 344
               GLCGIA  ASYP+
Sbjct: 328 SKKEGLCGIAMMASYPI 344


>gi|50355617|dbj|BAD29957.1| cysteine protease [Daucus carota]
          Length = 437

 Score =  323 bits (827), Expect = 9e-86,   Method: Compositional matrix adjust.
 Identities = 159/314 (50%), Positives = 208/314 (66%), Gaps = 12/314 (3%)

Query: 38  IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
           ++  +  W+ +HG++Y    EK  R  IFK NL YI+  N + +R+Y+LG N F+DLTNE
Sbjct: 45  VMTMYNSWLVKHGKSYNALGEKETRFQIFKDNLRYIDNHNADPDRSYELGLNRFADLTNE 104

Query: 98  EFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT--DVPTSIDWREKGAVTHIKDQGQCGSC 155
           E+RA Y G  +   S  + S  PS  +Y  V   ++P SIDWREKGAV  +KDQG CGSC
Sbjct: 105 EYRAKYLG-TKSRESRPKLSKGPSD-RYAPVEGEELPDSIDWREKGAVAAVKDQGSCGSC 162

Query: 156 WAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATE 214
           WAFSA+ AVEGI QIT G+LI LSEQ+LVDC    N GC GGLMD AF +II+N G+ ++
Sbjct: 163 WAFSAIGAVEGINQITTGELITLSEQELVDCDRSYNEGCEGGLMDYAFNFIIKNGGIDSD 222

Query: 215 ADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHF 274
            DYPY   +GTC+  KE A   TI  YED+P  DE+AL +A +NQP+SV ++A G  F  
Sbjct: 223 LDYPYTGRDGTCNQNKENAKVVTIDSYEDVPVYDEKALQKAAANQPISVAIEAGGMDFQL 282

Query: 275 YKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD---- 330
           Y SG+    CG   DHGV VVG+G+   E G  YW+++NSWG  WGE+GY+++ R+    
Sbjct: 283 YVSGIFTGKCGTAVDHGVVVVGYGS---EEGMDYWIVRNSWGAAWGEAGYLKMQRNVGKS 339

Query: 331 AGLCGIATAASYPV 344
           +GLCGI    SYPV
Sbjct: 340 SGLCGITIEPSYPV 353


>gi|242071345|ref|XP_002450949.1| hypothetical protein SORBIDRAFT_05g021550 [Sorghum bicolor]
 gi|241936792|gb|EES09937.1| hypothetical protein SORBIDRAFT_05g021550 [Sorghum bicolor]
          Length = 371

 Score =  323 bits (827), Expect = 9e-86,   Method: Compositional matrix adjust.
 Identities = 161/326 (49%), Positives = 216/326 (66%), Gaps = 18/326 (5%)

Query: 35  EPSIVEKHEQW----MAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNE 90
           E S+   +EQW    M       +++ +KA   N+FK+N+ YI +ANK+G R+++L  N+
Sbjct: 35  EESLRALYEQWRSHYMVSRPAGLQEQDDKARWFNVFKENVRYIHEANKKG-RSFRLALNK 93

Query: 91  FSDLTNEEFRALY-----TGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTH 145
           F+D+T +EFR  Y     T ++R + S  R+    S F Y    ++P ++DWR++GAVT 
Sbjct: 94  FADMTTDEFRRAYAAGSRTRHHRALSSGIRRHGDGS-FMYAQAGNLPLAVDWRQRGAVTG 152

Query: 146 IKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS-TDNHGCSGGLMDKAFEY 204
           IKDQGQCGSCWAFS +AAVEGI +I  GKL+ LSEQ+LVDC   DN GC+GGLMD AF+Y
Sbjct: 153 IKDQGQCGSCWAFSTIAAVEGINKIRTGKLVSLSEQELVDCDDVDNQGCNGGLMDYAFQY 212

Query: 205 IIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVC 264
           I  N G+ TE++YPY  E+ +C+  KE++   TI  YED+P  +E AL +AV+NQPVS+ 
Sbjct: 213 IKRNGGITTESNYPYLAEQRSCNKAKERSHDVTIDGYEDVPANNEDALQKAVANQPVSIA 272

Query: 265 VDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGY 324
           ++ASG+ F FY  GV    CG   DHGVA VG+G   +  G KYW++KNSWGE WGE GY
Sbjct: 273 IEASGQDFQFYSEGVFTGSCGTELDHGVAAVGYGITRD--GTKYWIVKNSWGEDWGERGY 330

Query: 325 IRILR----DAGLCGIATAASYPVAI 346
           IR+ R      GLCGIA   SYP  I
Sbjct: 331 IRMQRGISDSQGLCGIAMEPSYPTKI 356


>gi|297851332|ref|XP_002893547.1| peptidase C1A papain family protein [Arabidopsis lyrata subsp.
           lyrata]
 gi|297339389|gb|EFH69806.1| peptidase C1A papain family protein [Arabidopsis lyrata subsp.
           lyrata]
          Length = 345

 Score =  323 bits (827), Expect = 1e-85,   Method: Compositional matrix adjust.
 Identities = 165/329 (50%), Positives = 226/329 (68%), Gaps = 12/329 (3%)

Query: 25  SQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTY 84
           S+  S  ++HEP+I   H++WM    R Y DE EK MRL +F +NL++IE  N  G+++Y
Sbjct: 21  SEATSRVALHEPTIFYYHQKWMINFSRVYDDEFEKQMRLEVFTENLKFIENFNNMGSQSY 80

Query: 85  KLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQ-NVTDV-PTSIDWREKGA 142
           KLG N+F+D T EEF A +TG +    +   +    +T  +   V+DV  T+ DWR +GA
Sbjct: 81  KLGVNKFTDWTKEEFLATHTGLSGINVTSPFEVVNETTPAWNWTVSDVLGTTKDWRNEGA 140

Query: 143 VTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKA 201
           VT +K QG+CG CWAFSA+AAVEG+T+I RG LI LSEQQL+DC+ + N+GC GG M +A
Sbjct: 141 VTPVKYQGECGGCWAFSAIAAVEGLTKIARGNLISLSEQQLLDCAREQNNGCKGGTMIEA 200

Query: 202 FEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPV 261
           F YI++N G+++E  YPY+ +EG C +    A+   I  +E++P  +E+ALL+AVS QPV
Sbjct: 201 FNYIVKNGGVSSENAYPYQVKEGPCRSNDIPAIV--IRGFENVPSNNERALLEAVSRQPV 258

Query: 262 SVCVDASGRAFHFYKSGVLNA-DCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWG 320
           +V +DAS   F  Y  GV NA DCG + +H V +VG+GT++E  G KYWL KNSWG+TWG
Sbjct: 259 AVDIDASETGFIHYSGGVYNARDCGTSVNHAVTLVGYGTSQE--GIKYWLAKNSWGKTWG 316

Query: 321 ESGYIRILRDA----GLCGIATAASYPVA 345
           E+GYIRI RD     G+CG+A  ASYPVA
Sbjct: 317 ENGYIRIRRDVEWPQGMCGVAQYASYPVA 345


>gi|224083362|ref|XP_002306996.1| predicted protein [Populus trichocarpa]
 gi|222856445|gb|EEE93992.1| predicted protein [Populus trichocarpa]
          Length = 336

 Score =  323 bits (827), Expect = 1e-85,   Method: Compositional matrix adjust.
 Identities = 161/312 (51%), Positives = 212/312 (67%), Gaps = 12/312 (3%)

Query: 38  IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
           I++  E W+++H + Y+   EK  R  IFK NL +I++ NK+    Y LG NEF+DL++E
Sbjct: 29  IIDLFESWISKHQKIYESIEEKWHRFEIFKDNLFHIDETNKK-VVNYWLGLNEFADLSHE 87

Query: 98  EFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWA 157
           EF+  Y G N     +S +      F Y++V+ +P S+DWR+KGAVT +K+QG CGSCWA
Sbjct: 88  EFKNKYLGLN---VDLSNRRECSEEFTYKDVSSIPKSVDWRKKGAVTDVKNQGSCGSCWA 144

Query: 158 FSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEAD 216
           FS VAAVEGI QI  G L  LSEQ+LVDC T  N+GC+GGLMD AF YII N GL  E D
Sbjct: 145 FSTVAAVEGINQIVTGNLTSLSEQELVDCDTTYNNGCNGGLMDYAFAYIISNGGLHKEED 204

Query: 217 YPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYK 276
           YPY  EEGTC+ +K ++   TIS Y D+P+  E++LL+A++NQP+SV +DASGR F FY 
Sbjct: 205 YPYIMEEGTCEMRKAESEVVTISGYHDVPQNSEESLLKALANQPLSVAIDASGRDFQFYS 264

Query: 277 SGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD----AG 332
            GV +  CG   DHGVA VG+G+A+   G  + ++KNSWG  WGE G+IR+ R+    AG
Sbjct: 265 GGVFDGHCGTELDHGVAAVGYGSAK---GLDFIVVKNSWGSKWGEKGFIRMKRNTGKPAG 321

Query: 333 LCGIATAASYPV 344
           LCGI   ASYP 
Sbjct: 322 LCGINKMASYPT 333


>gi|226529105|ref|NP_001150196.1| cysteine protease 1 precursor [Zea mays]
 gi|194701798|gb|ACF84983.1| unknown [Zea mays]
 gi|194704800|gb|ACF86484.1| unknown [Zea mays]
 gi|195637480|gb|ACG38208.1| cysteine protease 1 precursor [Zea mays]
 gi|413919895|gb|AFW59827.1| cysteine protease 1 [Zea mays]
          Length = 470

 Score =  323 bits (827), Expect = 1e-85,   Method: Compositional matrix adjust.
 Identities = 157/327 (48%), Positives = 219/327 (66%), Gaps = 17/327 (5%)

Query: 30  GRSMHEPSIVEKHEQWMAQHGRTY----KDELEKAMRLNIFKQNLEYIEKAN-KEGNRTY 84
           G    EP +   ++ W+A+HGR Y    + E E+  R  +F  NL +++  N + G R +
Sbjct: 45  GLERTEPEVRAMYDLWLAEHGRAYNALGEGEGERDRRFLVFWDNLRFVDAHNERAGARGF 104

Query: 85  KLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD-VPTSIDWREKGAV 143
           +LG N+F+DLTN+EFRA Y G    VP+  R +     +++    + +P S+DWREKGAV
Sbjct: 105 RLGMNQFADLTNDEFRAAYLGA--MVPAARRGAVVGERYRHDGAAEELPESVDWREKGAV 162

Query: 144 THIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKA 201
             +K+QGQCGSCWAFSAV++VE + QI  G+++ LSEQ+LV+CSTD  N GC+GGLMD A
Sbjct: 163 APVKNQGQCGSCWAFSAVSSVESVNQIVTGEMVTLSEQELVECSTDGGNSGCNGGLMDAA 222

Query: 202 FEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPV 261
           F++II+N G+ TE DYPYR  +G CD  ++ A   +I  +ED+P+ DE++L +AV++QPV
Sbjct: 223 FDFIIKNGGIDTEDDYPYRAVDGKCDMNRKNARVVSIDGFEDVPENDEKSLQKAVAHQPV 282

Query: 262 SVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGE 321
           SV ++A GR F  YKSGV +  C  N DHGV  VG+G    ENG  YW+++NSWG  WGE
Sbjct: 283 SVAIEAGGREFQLYKSGVFSGSCTTNLDHGVVAVGYGA---ENGKDYWIVRNSWGPKWGE 339

Query: 322 SGYIRILRD----AGLCGIATAASYPV 344
           +GYIR+ R+     G CGIA  ASYP 
Sbjct: 340 AGYIRMERNVNASTGKCGIAMMASYPT 366


>gi|255538210|ref|XP_002510170.1| cysteine protease, putative [Ricinus communis]
 gi|223550871|gb|EEF52357.1| cysteine protease, putative [Ricinus communis]
          Length = 469

 Score =  323 bits (827), Expect = 1e-85,   Method: Compositional matrix adjust.
 Identities = 157/315 (49%), Positives = 214/315 (67%), Gaps = 13/315 (4%)

Query: 38  IVEKHEQWMAQHGRTYKDEL---EKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
           ++  +E+W+ ++G+ + +     EK  R  +FK NL +I++ N E NR+YK+G N F+DL
Sbjct: 47  VMAIYEEWLVKNGKAHSNNNALGEKERRFQVFKDNLRFIDEHNSE-NRSYKVGLNRFADL 105

Query: 95  TNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGS 154
           TNEE+R++Y G  R     +R S   + +  +    +P S+DWR++GAV  +KDQG CGS
Sbjct: 106 TNEEYRSMYLG-ARSGAKRNRLSRSSNRYLPRVGDSLPDSVDWRKEGAVAEVKDQGSCGS 164

Query: 155 CWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLAT 213
           CWAFS +AAVEGI +I  G LI LSEQ+LVDC    N GC+GGLMD AF++II N G+ +
Sbjct: 165 CWAFSTIAAVEGINKIVTGDLISLSEQELVDCDRSYNEGCNGGLMDYAFQFIINNGGIDS 224

Query: 214 EADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFH 273
           E DYPY   +GTCD  ++ A   TI  YED+P  DE+AL +AV+NQPVSV ++A GR F 
Sbjct: 225 EEDYPYLARDGTCDTYRKNAKVVTIDNYEDVPVNDEKALQKAVANQPVSVAIEAGGREFQ 284

Query: 274 FYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD--- 330
           FY+SG+    CG   DHGVA VG+GT   ENG  YW+++NSWG++WGESGYIR+ R+   
Sbjct: 285 FYQSGIFTGRCGTALDHGVAAVGYGT---ENGKDYWIVRNSWGKSWGESGYIRMERNIAT 341

Query: 331 -AGLCGIATAASYPV 344
             G CGIA   SYP+
Sbjct: 342 ATGKCGIAIEPSYPI 356


>gi|297791625|ref|XP_002863697.1| hypothetical protein ARALYDRAFT_917391 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297309532|gb|EFH39956.1| hypothetical protein ARALYDRAFT_917391 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 463

 Score =  322 bits (826), Expect = 1e-85,   Method: Compositional matrix adjust.
 Identities = 164/328 (50%), Positives = 219/328 (66%), Gaps = 21/328 (6%)

Query: 27  VVSGRSMHEPSIVEK-HEQWMAQHGRTYKDE----LEKAMRLNIFKQNLEYIEKANKEGN 81
            VS RS  E   VE+ +E WM +HG+   ++     EK  R  IFK NL YI++ N + N
Sbjct: 37  TVSSRSDAE---VERIYEAWMVEHGKKKMNQNGLGAEKDQRFEIFKDNLRYIDEHNTK-N 92

Query: 82  RTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKG 141
            +YKLG   F+DLTN+E+R++Y G  +PV  V + S R   ++ +    +P S+DWR++G
Sbjct: 93  LSYKLGLTRFADLTNDEYRSMYLG-AKPVKRVLKTSDR---YEARVGDALPDSVDWRKEG 148

Query: 142 AVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDK 200
           AV  +KDQG CGSCWAFS + AVEGI +I  G LI LSEQ+LVDC T  N GC+GGLMD 
Sbjct: 149 AVADVKDQGSCGSCWAFSTIGAVEGINKIVTGDLISLSEQELVDCDTSYNQGCNGGLMDY 208

Query: 201 AFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQP 260
           AFE+II+N G+ TEADYPY+  +G CD  ++ A   TI  YED+P+  E +L +A+++QP
Sbjct: 209 AFEFIIKNGGIDTEADYPYKAADGRCDQNRKNAKVVTIDSYEDVPENSEASLKKALAHQP 268

Query: 261 VSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWG 320
           +SV ++A GRAF  Y SGV +  CG   DHGV  VG+GT   ENG  YW+++NSWG  WG
Sbjct: 269 ISVAIEAGGRAFQLYSSGVFDGICGTELDHGVVAVGYGT---ENGKDYWIVRNSWGNRWG 325

Query: 321 ESGYIRILRD----AGLCGIATAASYPV 344
           ESGYI++ R+     G CGIA  ASYP+
Sbjct: 326 ESGYIKMARNIAEPTGKCGIAMEASYPI 353


>gi|18401614|ref|NP_564497.1| cysteine proteinase RD21a [Arabidopsis thaliana]
 gi|1172873|sp|P43297.1|RD21A_ARATH RecName: Full=Cysteine proteinase RD21a; Short=RD21; Flags:
           Precursor
 gi|12321010|gb|AAG50628.1|AC083835_13 cysteine protease, putative [Arabidopsis thaliana]
 gi|435619|dbj|BAA02374.1| thiol protease [Arabidopsis thaliana]
 gi|18175926|gb|AAL59952.1| putative cysteine proteinase RD21A [Arabidopsis thaliana]
 gi|22136972|gb|AAM91715.1| putative cysteine proteinase RD21A [Arabidopsis thaliana]
 gi|332194014|gb|AEE32135.1| cysteine proteinase RD21a [Arabidopsis thaliana]
          Length = 462

 Score =  322 bits (826), Expect = 1e-85,   Method: Compositional matrix adjust.
 Identities = 163/361 (45%), Positives = 227/361 (62%), Gaps = 36/361 (9%)

Query: 9   FIIPMFVIIILVITCASQVVS----------------GRSMHEPSIVEKHEQWMAQHGRT 52
           F+ P   I+ L +   S  V                 GRS  E  ++  +E W+ +HG+ 
Sbjct: 3   FLKPTMAILFLAMVAVSSAVDMSIISYDEKHGVSTTGGRS--EAEVMSIYEAWLVKHGKA 60

Query: 53  YKDE--LEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPV 110
                 +EK  R  IFK NL ++++ N E N +Y+LG   F+DLTN+E+R+ Y G     
Sbjct: 61  QSQNSLVEKDRRFEIFKDNLRFVDEHN-EKNLSYRLGLTRFADLTNDEYRSKYLG----- 114

Query: 111 PSVSRQSSRPSTFKYQ-NVTD-VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGIT 168
             + ++  R ++ +Y+  V D +P SIDWR+KGAV  +KDQG CGSCWAFS + AVEGI 
Sbjct: 115 AKMEKKGERRTSLRYEARVGDELPESIDWRKKGAVAEVKDQGGCGSCWAFSTIGAVEGIN 174

Query: 169 QITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCD 227
           QI  G LI LSEQ+LVDC T  N GC+GGLMD AFE+II+N G+ T+ DYPY+  +GTCD
Sbjct: 175 QIVTGDLITLSEQELVDCDTSYNEGCNGGLMDYAFEFIIKNGGIDTDKDYPYKGVDGTCD 234

Query: 228 NQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNN 287
             ++ A   TI  YED+P   E++L +AV++QP+S+ ++A GRAF  Y SG+ +  CG  
Sbjct: 235 QIRKNAKVVTIDSYEDVPTYSEESLKKAVAHQPISIAIEAGGRAFQLYDSGIFDGSCGTQ 294

Query: 288 CDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
            DHGV  VG+GT   ENG  YW+++NSWG++WGESGY+R+ R+    +G CGIA   SYP
Sbjct: 295 LDHGVVAVGYGT---ENGKDYWIVRNSWGKSWGESGYLRMARNIASSSGKCGIAIEPSYP 351

Query: 344 V 344
           +
Sbjct: 352 I 352


>gi|224096714|ref|XP_002310708.1| predicted protein [Populus trichocarpa]
 gi|222853611|gb|EEE91158.1| predicted protein [Populus trichocarpa]
          Length = 356

 Score =  322 bits (826), Expect = 1e-85,   Method: Compositional matrix adjust.
 Identities = 162/317 (51%), Positives = 210/317 (66%), Gaps = 11/317 (3%)

Query: 35  EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
           +  ++  ++ W+ +HG+ Y    EKA R  IFK NL +I++ N + NRTYK+G  +F+DL
Sbjct: 21  DDEVMSIYKWWLQKHGKAYNRLGEKAKRFEIFKNNLRFIDEHNSQ-NRTYKVGLTKFADL 79

Query: 95  TNEEFRALYTGYNRPVPSVSRQSSRPS-TFKYQNVTDVPTSIDWREKGAVTHIKDQGQCG 153
           TN+E+RA++ G          +S  PS  + Y+    +P S+DWR KGAV  IKDQG CG
Sbjct: 80  TNQEYRAMFLGTRSDPKRRLMKSKNPSERYAYKAGDKLPESVDWRGKGAVNPIKDQGSCG 139

Query: 154 SCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST-DNHGCSGGLMDKAFEYIIENKGLA 212
           SCWAFS VAAVEGI QI  G+LI LSEQ+LVDC    N GC+GGLMD AF++II N GL 
Sbjct: 140 SCWAFSTVAAVEGINQIVTGELISLSEQELVDCDRFYNAGCNGGLMDYAFQFIINNGGLD 199

Query: 213 TEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAF 272
           TE DYPY   + TCD  K K  A +I  +ED+   DE+AL +AV++QPVSV ++ASG A 
Sbjct: 200 TEKDYPYLGNDDTCDRDKMKTKAVSIDGFEDVLPFDEKALQKAVAHQPVSVAIEASGMAL 259

Query: 273 HFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD-- 330
            FY+SGV   +CG   DHGV VVG+GT   E G  YWL++NSWG  WGE GYI++ R+  
Sbjct: 260 QFYQSGVFTGECGTALDHGVVVVGYGT---EKGLDYWLVRNSWGTEWGEHGYIKMQRNVR 316

Query: 331 ---AGLCGIATAASYPV 344
               G CGIA  +SYPV
Sbjct: 317 DTYTGRCGIAMESSYPV 333


>gi|124484387|dbj|BAF46304.1| cysteine proteinase precursor [Ipomoea nil]
          Length = 474

 Score =  322 bits (826), Expect = 1e-85,   Method: Compositional matrix adjust.
 Identities = 160/323 (49%), Positives = 212/323 (65%), Gaps = 14/323 (4%)

Query: 30  GRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTN 89
           G    E  + E  E W+ +HG++Y    EK  R  IF+ NL+YI++ N   NR+YKLG N
Sbjct: 38  GLVRSEDEVKEMFESWLVKHGKSYNAVDEKDKRFKIFRDNLKYIDEKNSLENRSYKLGLN 97

Query: 90  EFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT--DVPTSIDWREKGAVTHIK 147
            F+D+TNEE+R  Y G  R     SR   +  + +Y  V    +P SIDWREKGAVT +K
Sbjct: 98  RFADITNEEYRTGYLGAKR---DASRNMVKSKSDRYAPVAGDSLPDSIDWREKGAVTGVK 154

Query: 148 DQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYII 206
           DQG CGSCWAFS +AAVEG+ Q+  G LI LSEQ+LVDC    N GC+GG M  AF++II
Sbjct: 155 DQGSCGSCWAFSTIAAVEGVNQLATGNLISLSEQELVDCDRKINQGCNGGDMGYAFQFII 214

Query: 207 ENKGLATEADYPYRHEEGTCDNQKE-KAVAATISKYEDLPKGDEQALLQAVSNQPVSVCV 265
           +N G+ +E DYPY  ++G CD+ ++  A  A+I  YE++P  +E++L +AV+NQPVSV +
Sbjct: 215 KNGGIDSEEDYPYTGKDGKCDSYRQNNAKVASIDGYEEVPVNNEKSLQKAVANQPVSVAI 274

Query: 266 DASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYI 325
           +A G  F  Y SG+    CG + DHGVA VG+GT   ENG  YW++KNSWG+ WGE GY+
Sbjct: 275 EAGGYDFQLYSSGIFTGSCGTDLDHGVAAVGYGT---ENGVDYWIVKNSWGDYWGEKGYV 331

Query: 326 RILRD----AGLCGIATAASYPV 344
           R+ R+     GLCGIA  ASYP 
Sbjct: 332 RMQRNVKAKTGLCGIAMEASYPT 354


>gi|62320725|dbj|BAD95392.1| cysteine proteinase RD21A [Arabidopsis thaliana]
          Length = 433

 Score =  322 bits (826), Expect = 1e-85,   Method: Compositional matrix adjust.
 Identities = 163/361 (45%), Positives = 227/361 (62%), Gaps = 36/361 (9%)

Query: 9   FIIPMFVIIILVITCASQVVS----------------GRSMHEPSIVEKHEQWMAQHGRT 52
           F+ P   I+ L +   S  V                 GRS  E  ++  +E W+ +HG+ 
Sbjct: 3   FLKPTMAILFLAMVAVSSAVDMSIISYDEKHGVSTTGGRS--EAEVMSIYEAWLVKHGKA 60

Query: 53  YKDE--LEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPV 110
                 +EK  R  IFK NL ++++ N E N +Y+LG   F+DLTN+E+R+ Y G     
Sbjct: 61  QSQNSLVEKDRRFEIFKDNLRFVDEHN-EKNLSYRLGLTRFADLTNDEYRSKYLG----- 114

Query: 111 PSVSRQSSRPSTFKYQ-NVTD-VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGIT 168
             + ++  R ++ +Y+  V D +P SIDWR+KGAV  +KDQG CGSCWAFS + AVEGI 
Sbjct: 115 AKMEKKGERRTSLRYEARVGDELPESIDWRKKGAVAEVKDQGGCGSCWAFSTIGAVEGIN 174

Query: 169 QITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCD 227
           QI  G LI LSEQ+LVDC T  N GC+GGLMD AFE+II+N G+ T+ DYPY+  +GTCD
Sbjct: 175 QIVTGDLITLSEQELVDCDTSYNEGCNGGLMDYAFEFIIKNGGIDTDKDYPYKGVDGTCD 234

Query: 228 NQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNN 287
             ++ A   TI  YED+P   E++L +AV++QP+S+ ++A GRAF  Y SG+ +  CG  
Sbjct: 235 QIRKNAKVVTIDSYEDVPTYSEESLKKAVAHQPISIAIEAGGRAFQLYDSGIFDGSCGTQ 294

Query: 288 CDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
            DHGV  VG+GT   ENG  YW+++NSWG++WGESGY+R+ R+    +G CGIA   SYP
Sbjct: 295 LDHGVVAVGYGT---ENGKDYWIVRNSWGKSWGESGYLRMARNIASSSGKCGIAIEPSYP 351

Query: 344 V 344
           +
Sbjct: 352 I 352


>gi|18141285|gb|AAL60580.1|AF454958_1 senescence-associated cysteine protease [Brassica oleracea]
          Length = 485

 Score =  322 bits (826), Expect = 1e-85,   Method: Compositional matrix adjust.
 Identities = 165/359 (45%), Positives = 230/359 (64%), Gaps = 29/359 (8%)

Query: 3   LKFEKSFIIPMFVIIILVITCAS----------QVVSGRSMHEPSIVEKHEQWMAQHGRT 52
           +K   S  + +F+ +I+V +               VS RS  E S +  +E+W+ +HG+ 
Sbjct: 1   MKLLNSATVILFLTMIVVSSAMDMSIISYDKNHHTVSSRSDAEVSRL--YEEWLVKHGKA 58

Query: 53  YKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPS 112
                EK  R  IFK NL +I++ N + N +Y+LG  +F+DLTN+E+R++Y G      S
Sbjct: 59  QNSLTEKDRRFEIFKDNLRFIDEHNGK-NLSYRLGLTKFADLTNDEYRSMYLG------S 111

Query: 113 VSRQSSRPSTFKYQ-NVTD-VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQI 170
             ++ +  S+ +Y+  V D +P S+DWR++GAV  +KDQG CGSCWAFS + AVEGI +I
Sbjct: 112 RLKRKATKSSLRYEVRVGDAIPESVDWRKEGAVAEVKDQGSCGSCWAFSTIGAVEGINKI 171

Query: 171 TRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQ 229
             G LI LSEQ+LVDC T  N GC+GGLMD AFE+II N G+ TE DYPY+  +G CD  
Sbjct: 172 VTGDLITLSEQELVDCDTSYNEGCNGGLMDYAFEFIINNGGIDTEEDYPYKGVDGRCDQT 231

Query: 230 KEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCD 289
           ++ A   TI  YED+P   E++L +A+S+QP+SV ++  GRAF  Y SG+ +  CG + D
Sbjct: 232 RKNAKVVTIDLYEDVPANSEESLKKALSHQPISVAIEGGGRAFQLYDSGIFDGICGTDLD 291

Query: 290 HGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPV 344
           HGV  VG+GT   ENG  YW++KNSWG +WGESGYIR+ R+    AG CGIA   SYP+
Sbjct: 292 HGVVAVGYGT---ENGKDYWIVKNSWGTSWGESGYIRMERNIASSAGKCGIAVEPSYPI 347


>gi|357467173|ref|XP_003603871.1| Cysteine proteinase [Medicago truncatula]
 gi|355492919|gb|AES74122.1| Cysteine proteinase [Medicago truncatula]
 gi|388499154|gb|AFK37643.1| unknown [Medicago truncatula]
          Length = 350

 Score =  322 bits (826), Expect = 1e-85,   Method: Compositional matrix adjust.
 Identities = 157/311 (50%), Positives = 216/311 (69%), Gaps = 12/311 (3%)

Query: 38  IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
           ++E  E WM++HG+ Y+   EK +R  +FK NL++I+  NK  +  Y LG NEF+DL+++
Sbjct: 43  LIELFESWMSRHGKIYETIEEKLLRFEVFKDNLKHIDDRNKVVS-NYWLGLNEFADLSHQ 101

Query: 98  EFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWA 157
           EF+  Y G    V    R+ S    F Y++V D+P S+DWR+KGAVT +K+QGQCGSCWA
Sbjct: 102 EFKNKYLGL--KVDLSQRRESSEEEFTYRDV-DLPKSVDWRKKGAVTPVKNQGQCGSCWA 158

Query: 158 FSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEAD 216
           FS VAAVEGI QI  G L  LSEQ+L+DC T  N+GC+GGLMD AF +I++N GL  E D
Sbjct: 159 FSTVAAVEGINQIVTGNLTSLSEQELIDCDTTYNNGCNGGLMDYAFSFIVKNGGLHKEED 218

Query: 217 YPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYK 276
           YPY  EE TC+ +KE +   TI+ Y D+P+ +EQ+LL+A++NQP+SV ++ASGR F FY 
Sbjct: 219 YPYIMEESTCEMKKEVSEVVTINGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQFYS 278

Query: 277 SGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----G 332
            GV +  CG+  DHGV+ VG+GT++   G  Y ++KNSWG  WGE G+IR+ R+     G
Sbjct: 279 GGVFDGHCGSELDHGVSAVGYGTSK---GLDYIIVKNSWGAKWGEKGFIRMKRNIGKSEG 335

Query: 333 LCGIATAASYP 343
           +CG+   ASYP
Sbjct: 336 ICGLYKMASYP 346


>gi|356559055|ref|XP_003547817.1| PREDICTED: cysteine proteinase RD21a [Glycine max]
          Length = 366

 Score =  322 bits (826), Expect = 1e-85,   Method: Compositional matrix adjust.
 Identities = 153/343 (44%), Positives = 210/343 (61%), Gaps = 9/343 (2%)

Query: 8   SFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFK 67
           + +    + +   ++CA    +  +  +  ++  +E+W+ +H + Y    EK  R  +FK
Sbjct: 6   TLVTSTLLFLSFTLSCAIDTSTITNYTDNEVMTMYEEWLVKHQKVYNGLREKDKRFQVFK 65

Query: 68  QNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPS-VSRQSSRPSTFKYQ 126
            NL +I++ N   N TYKLG N+F+D+TNEE+R +Y G        + +  S    + Y 
Sbjct: 66  DNLGFIQEHNNNQNNTYKLGLNQFADMTNEEYRVMYFGTKSDAKRRLMKTKSTGHRYAYS 125

Query: 127 NVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC 186
               +P  +DWR KGAV  IKDQG CGSCWAFS VA VE I +I  GK + LSEQ+LVDC
Sbjct: 126 AGDRLPVHVDWRVKGAVAPIKDQGSCGSCWAFSTVATVEAINKIVTGKFVSLSEQELVDC 185

Query: 187 STD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLP 245
               N GC+GGLMD AFE+II+N G+ T+ DYPYR  +G CD  K+ A    I  +ED+P
Sbjct: 186 DRAYNEGCNGGLMDYAFEFIIQNGGIDTDKDYPYRGFDGICDPTKKNAKVVNIDGFEDVP 245

Query: 246 KGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENG 305
             DE AL +AV++QPVS+ ++ASGR    Y+SGV    CG + DHGV VVG+G+   ENG
Sbjct: 246 PYDENALKKAVAHQPVSIAIEASGRDLQLYQSGVFTGKCGTSLDHGVVVVGYGS---ENG 302

Query: 306 AKYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPV 344
             YWL++NSWG  WGE GY ++ R+     G CGI   ASYPV
Sbjct: 303 VDYWLVRNSWGTGWGEDGYFKMQRNVRTPTGKCGITMEASYPV 345


>gi|242070333|ref|XP_002450443.1| hypothetical protein SORBIDRAFT_05g005530 [Sorghum bicolor]
 gi|241936286|gb|EES09431.1| hypothetical protein SORBIDRAFT_05g005530 [Sorghum bicolor]
          Length = 351

 Score =  322 bits (825), Expect = 1e-85,   Method: Compositional matrix adjust.
 Identities = 167/345 (48%), Positives = 227/345 (65%), Gaps = 24/345 (6%)

Query: 14  FVIIILVITCASQVVSGRSMH------EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFK 67
            + ++ +  C    V+ R +       E ++  +HE+WM +HGRTYKDE EKA R  +FK
Sbjct: 18  LLTVLAIANCIGCAVAARDLSSSTGYGEEAMTARHEKWMVEHGRTYKDEAEKARRFQVFK 77

Query: 68  QNLEYIEKANKE-GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQ 126
            N  +++ +N   G + Y L  N F+D+T++EF A YTG+ +P+P+  +   +   FKY 
Sbjct: 78  ANAAFVDTSNAAAGGKKYHLAINRFADMTHDEFMARYTGF-KPLPATGK---KMPGFKYA 133

Query: 127 NVT---DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQL 183
           NVT   +   ++DWR+KGAVT +K+Q +CG CWAFSAVAA+EG+ QI  G+L+ LSEQQL
Sbjct: 134 NVTLSSEDQQAVDWRKKGAVTDVKNQQKCGCCWAFSAVAAIEGMHQINTGELVSLSEQQL 193

Query: 184 VDCST--DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKY 241
           VDCST  +N+GC GG M+ AF+Y+I N G+ATEA YPY   +G C N +    A  +  Y
Sbjct: 194 VDCSTNGNNNGCGGGTMEDAFQYVIGNNGIATEAAYPYTAMQGMCQNVQP---AVAVRSY 250

Query: 242 EDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNAD-CGNNCDHGVAVVGFGTA 300
           + +P+ DE AL  AV+ QPVSV VDA+   F FYK GV+ AD CG N +H V  VG+GTA
Sbjct: 251 QQVPRDDEDALAAAVAGQPVSVAVDANN--FQFYKGGVMTADSCGTNLNHAVTAVGYGTA 308

Query: 301 EEENGAKYWLIKNSWGETWGESGYIRILRDAGLCGIATAASYPVA 345
           E+  G  YWL+KN WG TWGE GY+R+ R  G CG+A  ASYPVA
Sbjct: 309 ED--GTPYWLLKNQWGSTWGEEGYLRLQRGVGACGVAKDASYPVA 351


>gi|222425026|dbj|BAH20463.1| cysteine protease [Spinacia oleracea]
          Length = 473

 Score =  322 bits (825), Expect = 2e-85,   Method: Compositional matrix adjust.
 Identities = 155/330 (46%), Positives = 219/330 (66%), Gaps = 14/330 (4%)

Query: 26  QVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYK 85
            ++   S  +  ++  +E W+ QH + Y    EK  R  IFK NLE+I++ N + ++T+K
Sbjct: 37  NLLPSSSRSDDEVMRIYESWLVQHRKNYNALGEKEKRFAIFKDNLEFIDQHNSDDSQTFK 96

Query: 86  LGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSS-----RPSTFKYQNVTDVPTSIDWREK 140
           +G N+F+DLTNEEFR++Y G  +   S    SS     +   + ++   ++P ++DWR+ 
Sbjct: 97  VGLNKFADLTNEEFRSVYLGRKKSSSSSPLLSSAKSKVKSDRYLFKEGDELPEAVDWRKN 156

Query: 141 GAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMD 199
           GAV  +KDQGQCGSCWAFS +AAVEGI QI  G+L+ LSEQ+LVDC T  N GC GGLMD
Sbjct: 157 GAVAKVKDQGQCGSCWAFSTIAAVEGINQIVTGELLSLSEQELVDCDTSYNSGCDGGLMD 216

Query: 200 KAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQ 259
            A+E+II N G+ T+ADYPY  ++G CD  ++ A   TI  +ED+P+ DE+AL +AV++Q
Sbjct: 217 YAYEFIINNGGIDTDADYPYTAKDGKCDQYRKNAKVVTIDDFEDVPENDEKALQKAVAHQ 276

Query: 260 PVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETW 319
           PVSV ++A G  F FY+SGV    CG + DHGV  VG+G+   ++G  YW+++NSWG  W
Sbjct: 277 PVSVAIEAGGSTFQFYQSGVFTGKCGADLDHGVVAVGYGS---DDGKDYWIVRNSWGADW 333

Query: 320 GESGYIRILRD-----AGLCGIATAASYPV 344
           GESGYIR+ R+      G CGIA   SYP+
Sbjct: 334 GESGYIRMERNLETVKTGKCGIAIEPSYPI 363


>gi|224133764|ref|XP_002321655.1| predicted protein [Populus trichocarpa]
 gi|222868651|gb|EEF05782.1| predicted protein [Populus trichocarpa]
          Length = 360

 Score =  322 bits (825), Expect = 2e-85,   Method: Compositional matrix adjust.
 Identities = 163/347 (46%), Positives = 222/347 (63%), Gaps = 12/347 (3%)

Query: 5   FEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDEL-EKAMRL 63
            +K   + +++ ++L  T +          E S+ + +E+W + H  T    L EK  R 
Sbjct: 1   MKKLLFVALYLALVLGFTESFDFHEKDLESEESLWDLYEKWRSHH--TVSTSLDEKRKRF 58

Query: 64  NIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPS-T 122
           N+F+ N+ ++   NK  ++ YKL  N+F+D+TN EFR  Y        ++ R +   + +
Sbjct: 59  NVFRANVLHVHNTNKM-DKPYKLKLNKFADMTNHEFRTAYASSKVKHHTMFRGAPLGNGS 117

Query: 123 FKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQ 182
           F Y N+  VP SIDWR+KGAVT +KDQG+CGSCWAFS + AVEGI  I   KLI LSEQ+
Sbjct: 118 FMYGNIDKVPASIDWRKKGAVTPVKDQGKCGSCWAFSTIVAVEGINFIKTNKLISLSEQE 177

Query: 183 LVDCST-DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKY 241
           LVDC+T +NHGC+GGLMD AFE+I + KG+ TEA+YPYR ++G CD  K    A +I  +
Sbjct: 178 LVDCNTGENHGCNGGLMDYAFEFITKQKGITTEANYPYRAQDGHCDANKANQPAVSIDGH 237

Query: 242 EDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAE 301
           ED+   +E ALL+AV+NQPVSV +DA G  F FY  GV   +CG   DHGVA+VG+GT  
Sbjct: 238 EDVLHNNENALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGECGKELDHGVAIVGYGTTV 297

Query: 302 EENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
           +  G KYW+++NSWG  WGE GYIR+ R      GLCGIA  ASYP+
Sbjct: 298 D--GTKYWIVRNSWGPEWGERGYIRMQRGISDRRGLCGIAMEASYPI 342


>gi|14517542|gb|AAK62661.1| F2G19.31/F2G19.31 [Arabidopsis thaliana]
 gi|19548039|gb|AAL87383.1| F2G19.31/F2G19.31 [Arabidopsis thaliana]
          Length = 462

 Score =  322 bits (825), Expect = 2e-85,   Method: Compositional matrix adjust.
 Identities = 163/361 (45%), Positives = 227/361 (62%), Gaps = 36/361 (9%)

Query: 9   FIIPMFVIIILVITCASQVVS----------------GRSMHEPSIVEKHEQWMAQHGRT 52
           F+ P   I+ L +   S  V                 GRS  E  ++  +E W+ +HG+ 
Sbjct: 3   FLKPTMAILFLAMVTVSSAVDMSIISYDEKHGVSTTGGRS--EAEVMSIYEAWLVKHGKA 60

Query: 53  YKDE--LEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPV 110
                 +EK  R  IFK NL ++++ N E N +Y+LG   F+DLTN+E+R+ Y G     
Sbjct: 61  QSQNSLVEKDRRFEIFKDNLRFVDEHN-EKNLSYRLGLTRFADLTNDEYRSKYLG----- 114

Query: 111 PSVSRQSSRPSTFKYQ-NVTD-VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGIT 168
             + ++  R ++ +Y+  V D +P SIDWR+KGAV  +KDQG CGSCWAFS + AVEGI 
Sbjct: 115 AKMEKKGERRTSLRYEARVGDELPESIDWRKKGAVAEVKDQGGCGSCWAFSTIGAVEGIN 174

Query: 169 QITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCD 227
           QI  G LI LSEQ+LVDC T  N GC+GGLMD AFE+II+N G+ T+ DYPY+  +GTCD
Sbjct: 175 QIVTGDLITLSEQELVDCDTSYNEGCNGGLMDYAFEFIIKNGGIDTDKDYPYKGVDGTCD 234

Query: 228 NQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNN 287
             ++ A   TI  YED+P   E++L +AV++QP+S+ ++A GRAF  Y SG+ +  CG  
Sbjct: 235 QIRKNAKVVTIDSYEDVPTYSEESLKKAVAHQPISIAIEAGGRAFQLYDSGIFDGSCGTQ 294

Query: 288 CDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
            DHGV  VG+GT   ENG  YW+++NSWG++WGESGY+R+ R+    +G CGIA   SYP
Sbjct: 295 LDHGVVAVGYGT---ENGKDYWIVRNSWGKSWGESGYLRMARNIASSSGKCGIAIEPSYP 351

Query: 344 V 344
           +
Sbjct: 352 I 352


>gi|37780041|gb|AAP32193.1| cysteine protease 14 [Trifolium repens]
          Length = 351

 Score =  322 bits (825), Expect = 2e-85,   Method: Compositional matrix adjust.
 Identities = 156/312 (50%), Positives = 219/312 (70%), Gaps = 11/312 (3%)

Query: 38  IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
           ++E  E WM++HG+ Y+   EK +R  +FK NL++I++ NK  +  Y LG NEF+DL+++
Sbjct: 43  LIELFESWMSRHGKIYETIEEKLLRFEVFKDNLKHIDERNKIVS-NYWLGLNEFADLSHQ 101

Query: 98  EFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWA 157
           EF+  Y G    + S  R+SS    F Y++V D+P S+DWR+KGAVT +K+QGQCGSCWA
Sbjct: 102 EFKNKYLGLKVNL-SQRRESSNEEEFTYRDV-DLPKSVDWRKKGAVTPVKNQGQCGSCWA 159

Query: 158 FSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEAD 216
           FS VAAVEGI QI  G L  LSEQ+L+DC T  N+GC+GGLMD AF +I++N GL  E D
Sbjct: 160 FSTVAAVEGINQIVTGNLTSLSEQELIDCDTTYNNGCNGGLMDYAFSFIVQNGGLHKEDD 219

Query: 217 YPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYK 276
           YPY  EE TC+ +KE+    TI+ Y D+P+ +EQ+LL+A++NQP+SV ++AS R F FY 
Sbjct: 220 YPYIMEESTCEMKKEETQVVTINGYHDVPQNNEQSLLKALANQPLSVAIEASSRDFQFYS 279

Query: 277 SGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----G 332
            GV +  CG++ DHGV+ VG+GT++  +   Y ++KNSWG  WGE G+IR+ R+     G
Sbjct: 280 GGVFDGHCGSDLDHGVSAVGYGTSKNLD---YIIVKNSWGAKWGEKGFIRMKRNIGKPEG 336

Query: 333 LCGIATAASYPV 344
           +CG+   ASYP 
Sbjct: 337 ICGLYKMASYPT 348


>gi|357474523|ref|XP_003607546.1| Cysteine proteinase [Medicago truncatula]
 gi|358347207|ref|XP_003637651.1| Cysteine proteinase [Medicago truncatula]
 gi|355503586|gb|AES84789.1| Cysteine proteinase [Medicago truncatula]
 gi|355508601|gb|AES89743.1| Cysteine proteinase [Medicago truncatula]
          Length = 345

 Score =  322 bits (824), Expect = 2e-85,   Method: Compositional matrix adjust.
 Identities = 164/350 (46%), Positives = 223/350 (63%), Gaps = 26/350 (7%)

Query: 11  IPMFVIIILVITCASQVVSGRSMHEPS----------IVEKHEQWMAQHGRTYKDELEKA 60
           I    I IL++ C + V++  S   P+          + ++ + W+ +HGR YK   E+ 
Sbjct: 5   ILTTTIFILLMLCNTCVIASESECPPTHKQKSSDVEAMKKRFDGWVKRHGRKYKHNDERE 64

Query: 61  MRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRP 120
           +R  I++ N++YI+  N + N +Y L  N+F+DLTNEEF++ Y G +      +R  S  
Sbjct: 65  VRFGIYQANVQYIQCKNAQKN-SYNLTDNKFADLTNEEFQSTYMGLS------TRLRSHN 117

Query: 121 STFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSE 180
           + F+Y    D+P S DWR++GAVT I DQGQCG CWAF+AVAAVEGI +I  GKLI LSE
Sbjct: 118 TGFRYDEHGDLPESKDWRKEGAVTEIMDQGQCGGCWAFAAVAAVEGINKIKSGKLISLSE 177

Query: 181 QQLVDCS--TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATI 238
           Q+L+DC   + N GC GGLM+ A+ +IIEN GL TE DYPY   +GTC  +K    AA+I
Sbjct: 178 QELIDCDVKSGNQGCQGGLMETAYTFIIENGGLTTEQDYPYEGVDGTCKMEKAAHYAASI 237

Query: 239 SKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFG 298
           S YE++P  +E  L  A ++QPVSV +DA G +F FY  GV +  CG   +HGV VVG+G
Sbjct: 238 SGYEEVPADNEAKLKAAAAHQPVSVAIDAGGYSFQFYSEGVFSGICGKQLNHGVTVVGYG 297

Query: 299 TAEEENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
              +E   KYW++KNSWG  WGESGYIR+ RD     G+CGIA  ASYP+
Sbjct: 298 ---KETINKYWIVKNSWGADWGESGYIRMKRDTLSKEGMCGIAMQASYPL 344


>gi|413933049|gb|AFW67600.1| cysteine protease 1 [Zea mays]
          Length = 341

 Score =  322 bits (824), Expect = 2e-85,   Method: Compositional matrix adjust.
 Identities = 165/342 (48%), Positives = 224/342 (65%), Gaps = 17/342 (4%)

Query: 14  FVIIILVI----TCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
           F++ +LV+     C +      +    ++  +HE+WMA+HGR YKDE EKA RL +F+ N
Sbjct: 6   FLLAVLVVGSAVLCTAAAPRALAAAAAAMASRHEKWMAEHGRAYKDEAEKARRLEVFRAN 65

Query: 70  LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQN-- 127
            E I+  N  G  +++L TN F+DLT EEFRA  TG  RP P+ S  + R   F+Y+N  
Sbjct: 66  AELIDSFNAAGTHSHRLATNRFADLTVEEFRAARTGL-RPRPAPSAGAGR---FRYENFS 121

Query: 128 VTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC- 186
           + D   S+DWR  GAVT +KDQG CG CWAFSAVAAVEG+ +I  G+L+ LSEQ+LVDC 
Sbjct: 122 LADAAQSVDWRAMGAVTGVKDQGACGCCWAFSAVAAVEGLNKIRTGRLVSLSEQELVDCD 181

Query: 187 -STDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLP 245
            S  + GC GGLMD AF+++    GLA+E+ YPY+  +G C +    A AA+I  +ED+P
Sbjct: 182 VSGVDQGCDGGLMDNAFQFVARRGGLASESGYPYQGRDGPCRSSAAAARAASIRGHEDVP 241

Query: 246 KGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENG 305
           + +E AL  AV+NQPVSV ++    AF FY SGVL   CG + +H +  VG+GTA +  G
Sbjct: 242 RNNEAALAAAVANQPVSVAINGEDMAFRFYDSGVLGGACGTDLNHAITAVGYGTAND--G 299

Query: 306 AKYWLIKNSWGETWGESGYIRI---LRDAGLCGIATAASYPV 344
            +YWL+KNSWG +WGE GY+RI   +R  G+CG+A   SYPV
Sbjct: 300 TRYWLMKNSWGASWGEGGYVRIRRGVRGEGVCGLAKLPSYPV 341


>gi|162463464|ref|NP_001104879.1| cysteine proteinase Mir3 precursor [Zea mays]
 gi|2425066|gb|AAB88263.1| cysteine proteinase Mir3 [Zea mays]
          Length = 480

 Score =  321 bits (823), Expect = 3e-85,   Method: Compositional matrix adjust.
 Identities = 157/311 (50%), Positives = 204/311 (65%), Gaps = 14/311 (4%)

Query: 42  HEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKAN---KEGNRTYKLGTNEFSDLTNEE 98
           + +WMA HGRTY     +  R  +F+ NL YI+  N     G  +++LG N F+DLTN+E
Sbjct: 44  YAEWMAAHGRTYNAVGAEERRYQVFRDNLRYIDAHNAAADAGVHSFRLGLNRFADLTNDE 103

Query: 99  FRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAF 158
           + A Y G  R  P   R+    + +   +  D+P S+DWR KGAV  +KDQG CG+CWAF
Sbjct: 104 YPATYLG-ARTRPQRDRKLG--ARYHAADNEDLPESVDWRAKGAVAEVKDQGSCGTCWAF 160

Query: 159 SAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADY 217
           S +AAVEGI QI  G LI LSEQ+LVDC T  N GC+GGLMD AFE+II N G+ TE DY
Sbjct: 161 STIAAVEGINQIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAFEFIINNGGIDTEKDY 220

Query: 218 PYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKS 277
           PY+  +G CD  ++ A   TI  YED+P  DE++L +AV+NQPVSV ++A+G AF  Y S
Sbjct: 221 PYKGTDGRCDVNRKNAKVVTIDSYEDVPANDEKSLQKAVANQPVSVAIEAAGTAFQLYSS 280

Query: 278 GVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD----AGL 333
           G+    CG   DHGV  VG+GT   ENG  YW++KNSWG +WGESGY+R+ R+    +G 
Sbjct: 281 GIFTGSCGTRLDHGVTAVGYGT---ENGKDYWIVKNSWGSSWGESGYVRMERNIKASSGK 337

Query: 334 CGIATAASYPV 344
           CGIA   SYP+
Sbjct: 338 CGIAVEPSYPL 348


>gi|297598407|ref|NP_001045533.2| Os01g0971400 [Oryza sativa Japonica Group]
 gi|15289977|dbj|BAB63672.1| putative cysteine protease CP1 [Oryza sativa Japonica Group]
 gi|125529282|gb|EAY77396.1| hypothetical protein OsI_05384 [Oryza sativa Indica Group]
 gi|125573472|gb|EAZ14987.1| hypothetical protein OsJ_04922 [Oryza sativa Japonica Group]
 gi|215740756|dbj|BAG97412.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215741010|dbj|BAG97505.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215765325|dbj|BAG87022.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215767338|dbj|BAG99566.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|255674119|dbj|BAF07447.2| Os01g0971400 [Oryza sativa Japonica Group]
          Length = 365

 Score =  321 bits (823), Expect = 3e-85,   Method: Compositional matrix adjust.
 Identities = 166/321 (51%), Positives = 212/321 (66%), Gaps = 20/321 (6%)

Query: 38  IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
           ++E  E++MA++ + Y    EK  R  +FK NL +I++ NK+    Y LG NEF+DLT++
Sbjct: 48  LMELFEKFMAKYRKAYSSLEEKLRRFEVFKDNLNHIDEENKK-ITGYWLGLNEFADLTHD 106

Query: 98  EFRALYTGYNRPVPSVSRQSSRPSTFKYQNV--TDVPTSIDWREKGAVTHIKDQGQCGSC 155
           EF+A Y G      + +R++S    F+Y+ V    +P  +DWR+KGAVT +K+QGQCGSC
Sbjct: 107 EFKAAYLGL---TLTPARRNSNDQLFRYEEVEAASLPKEVDWRKKGAVTEVKNQGQCGSC 163

Query: 156 WAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATE 214
           WAFS VAAVEGI  I  G L  LSEQ+L+DC TD N+GCSGGLMD AF YI  N GL TE
Sbjct: 164 WAFSTVAAVEGINAIVTGNLTRLSEQELIDCDTDGNNGCSGGLMDYAFSYIAANGGLHTE 223

Query: 215 ADYPYRHEEGTC-------DNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDA 267
             YPY  EEGTC       D+  E A A TIS YED+P+ +EQALL+A+++QPVSV ++A
Sbjct: 224 ESYPYLMEEGTCRRGSTEGDDDGEAAAAVTISGYEDVPRNNEQALLKALAHQPVSVAIEA 283

Query: 268 SGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRI 327
           SGR F FY  GV +  CG   DHGV  VG+GTA +  G  Y ++KNSWG  WGE GYIR+
Sbjct: 284 SGRNFQFYSGGVFDGPCGTRLDHGVTAVGYGTASK--GHDYIIVKNSWGSHWGEKGYIRM 341

Query: 328 LRDA----GLCGIATAASYPV 344
            R      GLCGI   ASYP 
Sbjct: 342 RRGTGKHDGLCGINKMASYPT 362


>gi|171702831|dbj|BAG16371.1| cysteine protease [Brassica oleracea var. italica]
          Length = 441

 Score =  321 bits (823), Expect = 3e-85,   Method: Compositional matrix adjust.
 Identities = 160/326 (49%), Positives = 218/326 (66%), Gaps = 19/326 (5%)

Query: 26  QVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYK 85
             VS RS  E S +  +E+W+ +HG+      EK  R  IFK NL +I++ N + N +Y+
Sbjct: 28  HTVSSRSDAEVSRL--YEEWLVKHGKAQNSLTEKDRRFEIFKDNLRFIDEHNGK-NLSYR 84

Query: 86  LGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQ-NVTD-VPTSIDWREKGAV 143
           LG  +F+DLTN+E+R++Y G      S  ++ +  S+ +Y+  V D +P S+DWR++GAV
Sbjct: 85  LGLTKFADLTNDEYRSMYLG------SRLKRKATKSSLRYEVRVGDAIPESVDWRKEGAV 138

Query: 144 THIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAF 202
             +KDQG CGSCWAFS + AVEGI +I  G LI LSEQ+LVDC T  N GC+GGLMD AF
Sbjct: 139 AEVKDQGSCGSCWAFSTIGAVEGINKIVTGDLITLSEQELVDCDTSYNEGCNGGLMDYAF 198

Query: 203 EYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVS 262
           E+II N G+ TE DYPY+  +G CD  ++ A   TI  YED+P   E++L +A+S+QP+S
Sbjct: 199 EFIINNGGIDTEEDYPYKGVDGRCDQTRKNAKVVTIDLYEDVPANSEESLKKALSHQPIS 258

Query: 263 VCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGES 322
           V ++  GRAF  Y SG+ +  CG + DHGV  VG+GT   ENG  YW++KNSWG +WGES
Sbjct: 259 VAIEGGGRAFQLYDSGIFDGICGTDLDHGVVAVGYGT---ENGKDYWIVKNSWGTSWGES 315

Query: 323 GYIRILRD----AGLCGIATAASYPV 344
           GYIR+ R+    AG CGIA   SYP+
Sbjct: 316 GYIRMERNIASSAGKCGIAVEPSYPI 341


>gi|171702843|dbj|BAG16377.1| cysteine protease [Brassica rapa var. perviridis]
          Length = 431

 Score =  321 bits (822), Expect = 4e-85,   Method: Compositional matrix adjust.
 Identities = 159/325 (48%), Positives = 217/325 (66%), Gaps = 17/325 (5%)

Query: 26  QVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYK 85
             VS RS  E S +  +E+W+ +HG+      EK  R  IFK NL +I++ N + N +Y+
Sbjct: 28  HTVSSRSDVEVSRL--YEEWVVKHGKAQNSLTEKDRRFEIFKDNLRFIDEHNGK-NLSYR 84

Query: 86  LGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD-VPTSIDWREKGAVT 144
           LG  +F+DLTN+E+R++Y G       + R++++ S      V D +P S+DWR++GAV 
Sbjct: 85  LGLTKFADLTNDEYRSMYLG-----SRLKRKATKTSLRYEARVGDAIPESVDWRKEGAVA 139

Query: 145 HIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFE 203
            +KDQG CGSCWAFS + AVEGI +I  G LI LSEQ+LVDC T  N GC+GGLMD AFE
Sbjct: 140 EVKDQGSCGSCWAFSTIGAVEGINKIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAFE 199

Query: 204 YIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSV 263
           +II+N G+ TE DYPY+  +G CD  ++ A   TI  YED+P   E++L +A+S+QP+SV
Sbjct: 200 FIIKNGGIDTEEDYPYKGVDGRCDQTRKNAKVVTIDSYEDVPANSEESLKKALSHQPISV 259

Query: 264 CVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESG 323
            ++  GRAF  Y SG+ +  CG + DHGV  VG+GT   ENG  YW++KNSWG +WGESG
Sbjct: 260 AIEGGGRAFQLYDSGIFDGICGTDLDHGVVAVGYGT---ENGKDYWIVKNSWGTSWGESG 316

Query: 324 YIRILRD----AGLCGIATAASYPV 344
           YIR+ R+    AG CGIA   SYP+
Sbjct: 317 YIRMERNIASSAGKCGIAVEPSYPI 341


>gi|449522968|ref|XP_004168497.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
          Length = 348

 Score =  321 bits (822), Expect = 4e-85,   Method: Compositional matrix adjust.
 Identities = 156/311 (50%), Positives = 215/311 (69%), Gaps = 12/311 (3%)

Query: 38  IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
           ++E  E+W++ HG+ Y+   EK  R  +FK NL++I++ NK+   +Y LG NEF+DLT++
Sbjct: 41  LIELFEEWISNHGKIYETIEEKWHRFEVFKDNLKHIDETNKKVT-SYWLGVNEFADLTHQ 99

Query: 98  EFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWA 157
           EF+ +Y G  +   S +RQS  P  F Y++V D+P S+DWR+KGAVT +K+QG CGSCWA
Sbjct: 100 EFKNMYLGL-KVESSRTRQS--PEEFTYKDVVDLPKSVDWRKKGAVTRVKNQGSCGSCWA 156

Query: 158 FSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEAD 216
           FS VAAVEGI +I  G L  LSEQ+L+DC    N+GC GGLMD AF +I+ + GL  E D
Sbjct: 157 FSTVAAVEGINKIVGGNLTSLSEQELIDCDRPYNNGCHGGLMDYAFSFIVSSGGLHKEED 216

Query: 217 YPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYK 276
           YPY   E TCDN+K +    TIS Y+D+P+ +E +L++A+++QP+SV ++ASGR F FY 
Sbjct: 217 YPYLEVESTCDNKKGELEVVTISGYKDVPENNEASLIKALAHQPLSVAIEASGRDFQFYS 276

Query: 277 SGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD----AG 332
            GV +  CG   DHGV  VG+G+++   G  Y ++KNSWG  WGE GYIR+ R+    AG
Sbjct: 277 GGVFDGPCGTQLDHGVTAVGYGSSK---GVDYIIVKNSWGPKWGEKGYIRMKRNTGKPAG 333

Query: 333 LCGIATAASYP 343
           LCGI   ASYP
Sbjct: 334 LCGINKMASYP 344


>gi|37780039|gb|AAP32192.1| cysteine protease 14 [Trifolium repens]
          Length = 351

 Score =  321 bits (822), Expect = 4e-85,   Method: Compositional matrix adjust.
 Identities = 157/312 (50%), Positives = 217/312 (69%), Gaps = 11/312 (3%)

Query: 38  IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
           ++E  E WM++HG+ Y+   EK +R  +FK NL++I+  NK  +  Y LG NEF+DL+++
Sbjct: 43  LIELFESWMSRHGKIYETIEEKLLRFEVFKDNLKHIDDRNKIVS-NYWLGLNEFADLSHQ 101

Query: 98  EFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWA 157
           EF+  Y G    + S  R+SS    F Y++V D+P S+DWR+KGAVT +K+QGQCGSCWA
Sbjct: 102 EFKNKYLGLKVDL-SQRRESSNEEEFTYRDV-DLPKSVDWRKKGAVTPVKNQGQCGSCWA 159

Query: 158 FSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEAD 216
           FS VAAVEGI QI  G L  LSEQ+L+DC T  N+GC+GGLMD AF +I +N GL  E D
Sbjct: 160 FSTVAAVEGINQIVTGNLTSLSEQELIDCDTTYNNGCNGGLMDYAFSFIGQNGGLHKEED 219

Query: 217 YPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYK 276
           YPY  EE TC+ +KE+    TI+ Y D+P+ +EQ+LL+A++NQP+SV ++AS R F FY 
Sbjct: 220 YPYIMEESTCEMKKEETQVVTINGYHDVPQNNEQSLLKALANQPLSVAIEASSRDFQFYS 279

Query: 277 SGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD----AG 332
            GV +  CG++ DHGV+ VG+GT++  +   Y ++KNSWG  WGE G+IR+ RD     G
Sbjct: 280 GGVFDGHCGSDLDHGVSAVGYGTSKNLD---YIIVKNSWGAKWGEKGFIRMKRDIGKPEG 336

Query: 333 LCGIATAASYPV 344
           +CG+   ASYP 
Sbjct: 337 ICGLYKMASYPT 348


>gi|363807062|ref|NP_001242584.1| uncharacterized protein LOC100804015 precursor [Glycine max]
 gi|255640677|gb|ACU20623.1| unknown [Glycine max]
          Length = 366

 Score =  320 bits (821), Expect = 4e-85,   Method: Compositional matrix adjust.
 Identities = 155/340 (45%), Positives = 214/340 (62%), Gaps = 11/340 (3%)

Query: 13  MFVIIILVITCASQVVSGRSMH--EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNL 70
           ++ ++ L  T +  + +   ++  +  ++  +E+W+ +H + Y +  +K  R  +FK NL
Sbjct: 7   IYTLLFLSFTLSYAIKTSTIINYTDNEVMAMYEEWLVRHQKGYNELGKKDKRFQVFKDNL 66

Query: 71  EYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPS-VSRQSSRPSTFKYQNVT 129
            +I++ N   N TYKLG N+F+D+TNEE+RA+Y G        + +  S    + +    
Sbjct: 67  GFIQEHNNNLNNTYKLGLNKFADMTNEEYRAMYLGTKSNAKRRLMKTKSTGHRYAFSARD 126

Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD 189
            +P  +DWR KGAV  IKDQG CGSCWAFS VA VE I +I  GK + LSEQ+LVDC   
Sbjct: 127 RLPVHVDWRMKGAVAPIKDQGSCGSCWAFSTVATVEAINKIVTGKFVSLSEQELVDCDRA 186

Query: 190 -NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
            N GC+GGLMD AFE+II+N G+ T+ DYPYR  +G CD  K+ A    I  YED+P  D
Sbjct: 187 YNEGCNGGLMDYAFEFIIQNGGIDTDKDYPYRGFDGICDPTKKNAKVVNIDGYEDVPPYD 246

Query: 249 EQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKY 308
           E AL +AV++QPVSV ++ASGRA   Y+SGV    CG + DHGV VVG+G+   ENG  Y
Sbjct: 247 ENALKKAVAHQPVSVAIEASGRALQLYQSGVFTGKCGTSLDHGVVVVGYGS---ENGVDY 303

Query: 309 WLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPV 344
           WL++NSWG  WGE GY ++ R+     G CGI   ASYPV
Sbjct: 304 WLVRNSWGTGWGEDGYFKMQRNVRTSTGKCGITMEASYPV 343


>gi|449455625|ref|XP_004145553.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
          Length = 351

 Score =  320 bits (821), Expect = 4e-85,   Method: Compositional matrix adjust.
 Identities = 156/312 (50%), Positives = 215/312 (68%), Gaps = 12/312 (3%)

Query: 38  IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
           ++E  E+W++ HG+ Y+   EK  R  +FK NL++I++ NK+   +Y LG NEF+DLT++
Sbjct: 44  LIELFEEWISNHGKIYETIEEKWHRFEVFKDNLKHIDETNKKVT-SYWLGVNEFADLTHQ 102

Query: 98  EFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWA 157
           EF+ +Y G  +   S +RQS  P  F Y++V D+P S+DWR+KGAVT +K+QG CGSCWA
Sbjct: 103 EFKNMYLGL-KVESSRTRQS--PEEFTYKDVVDLPKSVDWRKKGAVTRVKNQGSCGSCWA 159

Query: 158 FSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEAD 216
           FS VAAVEGI +I  G L  LSEQ+L+DC    N+GC GGLMD AF +I+ + GL  E D
Sbjct: 160 FSTVAAVEGINKIVGGNLTSLSEQELIDCDRPYNNGCHGGLMDYAFSFIVSSGGLHKEED 219

Query: 217 YPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYK 276
           YPY   E TCDN+K +    TIS Y+D+P+ +E +L++A+++QP+SV ++ASGR F FY 
Sbjct: 220 YPYLEVESTCDNKKGELEVVTISGYKDVPENNEASLIKALAHQPLSVAIEASGRDFQFYS 279

Query: 277 SGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD----AG 332
            GV +  CG   DHGV  VG+G+++   G  Y ++KNSWG  WGE GYIR+ R+    AG
Sbjct: 280 GGVFDGPCGTQLDHGVTAVGYGSSK---GVDYIIVKNSWGPKWGEKGYIRMKRNTGKPAG 336

Query: 333 LCGIATAASYPV 344
           LCGI   ASYP 
Sbjct: 337 LCGINKMASYPT 348


>gi|388517427|gb|AFK46775.1| unknown [Medicago truncatula]
          Length = 362

 Score =  320 bits (821), Expect = 4e-85,   Method: Compositional matrix adjust.
 Identities = 159/345 (46%), Positives = 220/345 (63%), Gaps = 10/345 (2%)

Query: 6   EKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNI 65
           +K  +I + + ++LV++ +          + S+ + +E+W + H  + ++  EK  R N+
Sbjct: 4   KKLLLIVLSIALVLVVSESFDFHDKDVSSDESLWDLYERWRSHHTVS-RNLNEKQKRFNV 62

Query: 66  FKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPS-TFK 124
           FK N+ ++   NK  ++ YKL  N+F+D+TN EF+  Y G       + R + R S TF 
Sbjct: 63  FKSNVMHVHNTNKM-DKPYKLKLNKFADMTNHEFKTTYAGTKVNHHRMFRGTPRVSGTFM 121

Query: 125 YQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLV 184
           Y+N T  P S+DWR+KGAVT +KDQGQCGSCWAFS V AVEGI QI   +L+ LSEQ+L+
Sbjct: 122 YENFTKAPASVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQIKTNRLVPLSEQELI 181

Query: 185 DCST-DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYED 243
           DC   +N GC+GGLM+ AFEYI +  G+ TE+ YPY   +G+CD  KE     +I  +E 
Sbjct: 182 DCDNQENQGCNGGLMEYAFEYIKQKGGVTTESYYPYTANDGSCDATKENVPTVSIDGHET 241

Query: 244 LPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEE 303
           +P  DE ALL+AV+NQPVSV +DA G  F FY  GV   DCG   +HGVA+VG+GT  + 
Sbjct: 242 VPANDEDALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDCGKELNHGVAIVGYGTTVD- 300

Query: 304 NGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
            G  YW+++NSWG  WGE G IR+ R+     GLCGIA  ASYPV
Sbjct: 301 -GTNYWIVRNSWGAEWGEQGCIRMKRNVSNKEGLCGIAMEASYPV 344


>gi|217073894|gb|ACJ85307.1| unknown [Medicago truncatula]
 gi|388507498|gb|AFK41815.1| unknown [Medicago truncatula]
          Length = 362

 Score =  320 bits (821), Expect = 4e-85,   Method: Compositional matrix adjust.
 Identities = 159/345 (46%), Positives = 220/345 (63%), Gaps = 10/345 (2%)

Query: 6   EKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNI 65
           +K  +I + + ++LV++ +          + S+ + +E+W + H  + ++  EK  R N+
Sbjct: 4   KKLLLIVLSIALVLVVSESFDFHDKDVSSDESLWDLYERWRSHHTVS-RNLNEKQKRFNV 62

Query: 66  FKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPS-TFK 124
           FK N+ ++   NK  ++ YKL  N+F+D+TN EF+  Y G       + R + R S TF 
Sbjct: 63  FKSNVMHVHNTNKM-DKPYKLKLNKFADMTNHEFKTTYAGSKVNHHRMFRGTPRVSGTFM 121

Query: 125 YQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLV 184
           Y+N T  P S+DWR+KGAVT +KDQGQCGSCWAFS V AVEGI QI   +L+ LSEQ+L+
Sbjct: 122 YENFTKAPASVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQIKTNRLVPLSEQELI 181

Query: 185 DCST-DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYED 243
           DC   +N GC+GGLM+ AFEYI +  G+ TE+ YPY   +G+CD  KE     +I  +E 
Sbjct: 182 DCDNQENQGCNGGLMEYAFEYIKQKGGVTTESYYPYTANDGSCDATKENVPTVSIDGHET 241

Query: 244 LPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEE 303
           +P  DE ALL+AV+NQPVSV +DA G  F FY  GV   DCG   +HGVA+VG+GT  + 
Sbjct: 242 VPANDEDALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDCGKELNHGVAIVGYGTTVD- 300

Query: 304 NGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
            G  YW+++NSWG  WGE G IR+ R+     GLCGIA  ASYPV
Sbjct: 301 -GTNYWIVRNSWGAEWGEQGCIRMKRNVSNKEGLCGIAMEASYPV 344


>gi|40806500|gb|AAR92155.1| putative cysteine protease 2 [Iris x hollandica]
          Length = 359

 Score =  320 bits (821), Expect = 5e-85,   Method: Compositional matrix adjust.
 Identities = 162/345 (46%), Positives = 223/345 (64%), Gaps = 12/345 (3%)

Query: 7   KSFIIPMFVIIILVITCASQVVSGRSM-HEPSIVEKHEQWMAQHGRTYKDELEKAMRLNI 65
           KS ++ + V +  V    +   + + +  E S+   +E+W + H    +D  EK  R N+
Sbjct: 4   KSMLLALVVALAFVGVARTIPFNEKDLASEESLWGLYERWRSHH-TVSRDLSEKNKRFNV 62

Query: 66  FKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPS-TFK 124
           FK+N ++I + NK+ +  YKLG N+F+D+TN+EFR+ Y G         R + R + +F 
Sbjct: 63  FKENAKFIHEFNKK-DAPYKLGLNKFADMTNQEFRSTYAGSKIHHHRTQRGTPRATGSFM 121

Query: 125 YQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLV 184
           Y+NV  +P S+DWR +GAV  +KDQGQCGSCWAFS +A+VEGI +I   +L+ LS QQLV
Sbjct: 122 YENVHSIPASVDWRTQGAVAPVKDQGQCGSCWAFSTIASVEGINKIKTNQLVPLSGQQLV 181

Query: 185 DCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYED 243
           DC TD N GC+GGLMD AFE+I  N G+ +E+ YPY  E+G+C ++    V  TI  YED
Sbjct: 182 DCDTDQNEGCNGGLMDYAFEFIKSNGGITSESAYPYTAEQGSCASESSAPV-VTIDGYED 240

Query: 244 LPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEE 303
           +P  +E AL++AV+NQ VSV ++ASG AF FY  GV    CGN  DHGVAVVG+G   + 
Sbjct: 241 VPANNEAALMKAVANQVVSVAIEASGMAFQFYSEGVFTGSCGNELDHGVAVVGYGATRD- 299

Query: 304 NGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
            G KYW+++NSWG  WGE GYIR+ R      GLCGIA   SYP+
Sbjct: 300 -GTKYWIVRNSWGAEWGEKGYIRMQRGIRARHGLCGIAMEPSYPL 343


>gi|220983358|dbj|BAH11164.1| cysteine protease [Hordeum vulgare]
          Length = 462

 Score =  320 bits (820), Expect = 6e-85,   Method: Compositional matrix adjust.
 Identities = 159/318 (50%), Positives = 209/318 (65%), Gaps = 14/318 (4%)

Query: 35  EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKAN---KEGNRTYKLGTNEF 91
           E  +   + +WMA+H  TY    E+  R   F+ NL YI++ N     G  +++LG N F
Sbjct: 35  EEEVRRMYAEWMAEHHSTYNPIGEEERRFEAFRNNLRYIDQHNAAADAGVHSFRLGLNRF 94

Query: 92  SDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQ 151
           +DLTNEE+R+ Y G  R  P   R+ S  + ++  +  ++P S+DWR+KGAV  +KDQG 
Sbjct: 95  ADLTNEEYRSTYLG-ARTKPDRERKLS--ARYQAADNDELPESVDWRKKGAVGAVKDQGG 151

Query: 152 CGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKG 210
           CGSCWAFSA+AAVEGI QI  G +I LSEQ+LVDC T  N GC+GGLMD AFE+II N G
Sbjct: 152 CGSCWAFSAIAAVEGINQIVTGDMIPLSEQELVDCDTSYNQGCNGGLMDYAFEFIINNGG 211

Query: 211 LATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGR 270
           + +E DYPY+  +  CD  K+ A   TI  YED+P   E++L +AV+NQP+SV ++A GR
Sbjct: 212 IDSEEDYPYKERDNRCDANKKNAKVVTIDGYEDVPVNSEKSLQKAVANQPISVAIEAGGR 271

Query: 271 AFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD 330
           AF  YKSG+    CG   DHGVA VG+GT   ENG  YWL++NSWG  WGE+GYIR+ R+
Sbjct: 272 AFQLYKSGIFTGTCGTALDHGVAAVGYGT---ENGKDYWLVRNSWGSVWGENGYIRMERN 328

Query: 331 ----AGLCGIATAASYPV 344
               +G CGIA   SYP 
Sbjct: 329 IKASSGKCGIAVEPSYPT 346


>gi|414584879|tpg|DAA35450.1| TPA: cysteine protease 1 [Zea mays]
          Length = 522

 Score =  320 bits (820), Expect = 6e-85,   Method: Compositional matrix adjust.
 Identities = 155/320 (48%), Positives = 212/320 (66%), Gaps = 16/320 (5%)

Query: 35  EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKAN-KEGNRTYKLGTNEFSD 93
           EP     +E W+A+HGR Y    E+  R  +F  NL +++  N +     ++LG N+F+D
Sbjct: 102 EPEARTLYELWLAEHGRAYNALGERDRRFRVFWDNLRFVDAHNERAAEHGFRLGMNQFAD 161

Query: 94  LTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQN---VTDVPTSIDWREKGAVTHIKDQG 150
           LTN+EFRA Y G   P    SR+       +Y++     ++P S+DWREKGAV  +K+QG
Sbjct: 162 LTNDEFRAAYLGARIPA---SRRRGTAVGERYRHGGGAEELPESVDWREKGAVAPVKNQG 218

Query: 151 QCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIEN 208
           QCGSCWAFSAV++VE + QI  G+++ LSEQ+LV+CSTD  N GC+GGLMD AF++II+N
Sbjct: 219 QCGSCWAFSAVSSVESVNQIVTGEMVTLSEQELVECSTDGGNSGCNGGLMDAAFDFIIKN 278

Query: 209 KGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDAS 268
            G+ TE DYPY+  +G CD  +E A   +I  +ED+P+ DE++L +AV++QPVSV ++A 
Sbjct: 279 GGIDTEGDYPYKAVDGKCDINRENAKVVSIDGFEDVPENDEKSLQKAVAHQPVSVAIEAG 338

Query: 269 GRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRIL 328
           GR F  YK+GV    C  N DHGV  VG+GT   ENG  YW+++NSWG  WGE GYIR+ 
Sbjct: 339 GREFQLYKAGVFTGTCTTNLDHGVVAVGYGT---ENGKDYWIVRNSWGAKWGEDGYIRME 395

Query: 329 RD----AGLCGIATAASYPV 344
           R+     G CGIA  ASYP 
Sbjct: 396 RNVNATTGKCGIAMMASYPT 415


>gi|162459393|ref|NP_001105993.1| cysteine protease component of protease-inhibitor complex precursor
           [Zea mays]
 gi|6682829|dbj|BAA88898.1| cysteine protease component of protease-inhibitor complex [Zea
           mays]
          Length = 465

 Score =  320 bits (820), Expect = 6e-85,   Method: Compositional matrix adjust.
 Identities = 158/312 (50%), Positives = 204/312 (65%), Gaps = 16/312 (5%)

Query: 42  HEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKAN---KEGNRTYKLGTNEFSDLTNEE 98
           + +WMA HGRTY    E+  R  +F+ NL YI+  N     G  +++LG N F+DLTN+E
Sbjct: 44  YAEWMAAHGRTYNAVGEEERRYQVFRDNLRYIDAHNAAADAGVHSFRLGLNRFADLTNDE 103

Query: 99  FRALYTG-YNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWA 157
           +RA Y G   RP     R+    + +   +  D+P S+DWR KGAV  +KDQG  GSCWA
Sbjct: 104 YRATYLGARTRP----QRERKLGARYHAADNEDLPESVDWRAKGAVAEVKDQGSYGSCWA 159

Query: 158 FSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEAD 216
           FS +AAVEGI QI  G LI LSEQ+LVDC T  N GC+GGLMD AFE+II N G+ TE D
Sbjct: 160 FSTIAAVEGINQIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAFEFIINNGGIDTEKD 219

Query: 217 YPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYK 276
           YPY+  +G CD  ++ A   TI  YED+P  DE++L +AV+NQPVSV ++A+G  F  Y 
Sbjct: 220 YPYKGTDGRCDVNRKNAKVVTIDSYEDVPANDEKSLQKAVANQPVSVAIEAAGTQFQLYS 279

Query: 277 SGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD----AG 332
           SG+    CG   DHGV  VG+GT   ENG  YW++KNSWG +WGESGY+R+ R+    +G
Sbjct: 280 SGIFTGSCGTALDHGVTAVGYGT---ENGKDYWIVKNSWGSSWGESGYVRMERNIKASSG 336

Query: 333 LCGIATAASYPV 344
            CGIA   SYP+
Sbjct: 337 KCGIAVEPSYPL 348


>gi|18422289|ref|NP_568620.1| Granulin repeat cysteine protease family protein [Arabidopsis
           thaliana]
 gi|9757832|dbj|BAB08269.1| cysteine protease component of protease-inhibitor complex
           [Arabidopsis thaliana]
 gi|17065064|gb|AAL32686.1| cysteine protease component of protease-inhibitor complex
           [Arabidopsis thaliana]
 gi|21387153|gb|AAM47980.1| cysteine protease component of protease-inhibitor complex
           [Arabidopsis thaliana]
 gi|332007522|gb|AED94905.1| Granulin repeat cysteine protease family protein [Arabidopsis
           thaliana]
          Length = 463

 Score =  320 bits (820), Expect = 7e-85,   Method: Compositional matrix adjust.
 Identities = 161/330 (48%), Positives = 216/330 (65%), Gaps = 21/330 (6%)

Query: 26  QVVSGRSMHEPSIVEKHEQWMAQHGRTYKDE----LEKAMRLNIFKQNLEYIEKANKEGN 81
            + +  S  +  +   +E WM +HG+   ++     EK  R  IFK NL +I++ N + N
Sbjct: 34  HITTETSRSDSEVERIYEAWMVEHGKKKMNQNGLGAEKDQRFEIFKDNLRFIDEHNTK-N 92

Query: 82  RTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQ-NVTD-VPTSIDWRE 139
            +YKLG   F+DLTNEE+R++Y G  +P   V + S R     YQ  V D +P S+DWR+
Sbjct: 93  LSYKLGLTRFADLTNEEYRSMYLG-AKPTKRVLKTSDR-----YQARVGDALPDSVDWRK 146

Query: 140 KGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLM 198
           +GAV  +KDQG CGSCWAFS + AVEGI +I  G LI LSEQ+LVDC T  N GC+GGLM
Sbjct: 147 EGAVADVKDQGSCGSCWAFSTIGAVEGINKIVTGDLISLSEQELVDCDTSYNQGCNGGLM 206

Query: 199 DKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN 258
           D AFE+II+N G+ TEADYPY+  +G CD  ++ A   TI  YED+P+  E +L +A+++
Sbjct: 207 DYAFEFIIKNGGIDTEADYPYKAADGRCDQNRKNAKVVTIDSYEDVPENSEASLKKALAH 266

Query: 259 QPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGET 318
           QP+SV ++A GRAF  Y SGV +  CG   DHGV  VG+GT   ENG  YW+++NSWG  
Sbjct: 267 QPISVAIEAGGRAFQLYSSGVFDGLCGTELDHGVVAVGYGT---ENGKDYWIVRNSWGNR 323

Query: 319 WGESGYIRILRD----AGLCGIATAASYPV 344
           WGESGYI++ R+     G CGIA  ASYP+
Sbjct: 324 WGESGYIKMARNIEAPTGKCGIAMEASYPI 353


>gi|357166359|ref|XP_003580684.1| PREDICTED: oryzain alpha chain-like [Brachypodium distachyon]
          Length = 456

 Score =  320 bits (820), Expect = 7e-85,   Method: Compositional matrix adjust.
 Identities = 158/327 (48%), Positives = 211/327 (64%), Gaps = 16/327 (4%)

Query: 27  VVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKAN---KEGNRT 83
           +VS     E  +   + +WMA++GRTY    E+  R  +F+ NL Y+++ N     G  +
Sbjct: 27  IVSYGERSEEEVRRMYVEWMAENGRTYNAIGEEERRFEVFRDNLRYVDQHNAAADAGLHS 86

Query: 84  YKLGTNEFSDLTNEEFRALYTGY-NRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGA 142
           ++LG N F+DLTNEE+R  Y G   +PV    R+      ++  +  ++P S+DWREKGA
Sbjct: 87  FRLGLNRFADLTNEEYRDTYLGVRTKPV----RERRLSGRYQAADNEELPESVDWREKGA 142

Query: 143 VTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKA 201
           V  +KDQG CGSCWAFSA+AAVEGI QI  G +I LSEQ+LVDC T  N GC+GGLMD A
Sbjct: 143 VAKVKDQGGCGSCWAFSAIAAVEGINQIVTGDMIALSEQELVDCDTSYNQGCNGGLMDYA 202

Query: 202 FEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPV 261
           FE+II N G+ +E DYPY+  +  CD  K+ A   TI  YED+P   E +L +AV+NQP+
Sbjct: 203 FEFIINNGGIDSEEDYPYKERDNRCDANKKNAKVVTIDGYEDVPVNSELSLKKAVANQPI 262

Query: 262 SVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGE 321
           SV ++A GRAF  YKSG+    CG   DHGV  VG+G+   ENG  YW++KNSWG  WGE
Sbjct: 263 SVAIEAGGRAFQLYKSGIFTGRCGTALDHGVTAVGYGS---ENGKDYWIVKNSWGTVWGE 319

Query: 322 SGYIRILRD----AGLCGIATAASYPV 344
            GY+R+ R+    +G CGIA   SYP+
Sbjct: 320 DGYVRLERNIKATSGKCGIAIEPSYPL 346


>gi|238006338|gb|ACR34204.1| unknown [Zea mays]
          Length = 465

 Score =  320 bits (820), Expect = 7e-85,   Method: Compositional matrix adjust.
 Identities = 156/325 (48%), Positives = 213/325 (65%), Gaps = 16/325 (4%)

Query: 30  GRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKAN-KEGNRTYKLGT 88
           G    EP     +E W+A+HGR Y    E+  R  +F  NL +++  N +     ++LG 
Sbjct: 40  GLERTEPEARTLYELWLAEHGRAYNALGERDRRFRVFWDNLRFVDAHNERAAEHGFRLGM 99

Query: 89  NEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQN---VTDVPTSIDWREKGAVTH 145
           N+F+DLTN+EFRA Y G   P    SR+       +Y++     ++P S+DWREKGAV  
Sbjct: 100 NQFADLTNDEFRAAYLGARIPA---SRRRGTAVGERYRHGGGAEELPESVDWREKGAVAP 156

Query: 146 IKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFE 203
           +K+QGQCGSCWAFSAV++VE + QI  G+++ LSEQ+LV+CSTD  N GC+GGLMD AF+
Sbjct: 157 VKNQGQCGSCWAFSAVSSVESVNQIVTGEMVTLSEQELVECSTDGGNSGCNGGLMDAAFD 216

Query: 204 YIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSV 263
           +II+N G+ TE DYPY+  +G CD  +E A   +I  +ED+P+ DE++L +AV++QPVSV
Sbjct: 217 FIIKNGGIDTEGDYPYKAVDGKCDINRENAKVVSIDGFEDVPENDEKSLQKAVAHQPVSV 276

Query: 264 CVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESG 323
            ++A GR F  YK+GV    C  N DHGV  VG+GT   ENG  YW+++NSWG  WGE G
Sbjct: 277 AIEAGGREFQLYKAGVFTGTCTTNLDHGVVAVGYGT---ENGKDYWIVRNSWGAKWGEDG 333

Query: 324 YIRILRD----AGLCGIATAASYPV 344
           YIR+ R+     G CGIA  ASYP 
Sbjct: 334 YIRMERNVNATTGKCGIAMMASYPT 358


>gi|350538043|ref|NP_001234324.1| cysteine protease TDI-65 precursor [Solanum lycopersicum]
 gi|5726641|gb|AAD48496.1|AF172856_1 cysteine protease TDI-65 [Solanum lycopersicum]
 gi|2828252|emb|CAA05894.1| CYP1 [Solanum lycopersicum]
          Length = 466

 Score =  320 bits (819), Expect = 7e-85,   Method: Compositional matrix adjust.
 Identities = 155/345 (44%), Positives = 212/345 (61%), Gaps = 13/345 (3%)

Query: 10  IIPMFVIIILVITCASQVVSGRSMH-----EPSIVEKHEQWMAQHGRTYKDELEKAMRLN 64
           I+ M +   L       ++S    H     +  +   +E W+ +HG++Y    EK  R  
Sbjct: 12  ILLMLIFSTLSSASDMSIISYDETHIHRRTDDEVSALYESWLIEHGKSYNALGEKDKRFQ 71

Query: 65  IFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFK 124
           IFK NL YI++ N   N++YKLG  +F+DLTNEE+R++Y G            ++   + 
Sbjct: 72  IFKDNLRYIDEQNSVPNQSYKLGLTKFADLTNEEYRSIYLGTKSSGDRKKLSKNKSDRYL 131

Query: 125 YQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLV 184
            +    +P SIDWREKG +  +KDQG CGSCWAFSAVAA+E I  I  G LI LSEQ+LV
Sbjct: 132 PKVGDSLPESIDWREKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGNLISLSEQELV 191

Query: 185 DCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYED 243
           DC    N GC GGLMD AFE++I+N G+ TE DYPY+   G CD  ++ A    I  YED
Sbjct: 192 DCDRSYNEGCDGGLMDYAFEFVIKNGGIDTEEDYPYKERNGVCDQYRKNAKVVKIDSYED 251

Query: 244 LPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEE 303
           +P  +E+AL +AV++QPVS+ ++A GR F  YKSG+    CG   DHGV + G+GT   E
Sbjct: 252 VPVNNEKALQKAVAHQPVSIALEAGGRDFQHYKSGIFTGKCGTAVDHGVVIAGYGT---E 308

Query: 304 NGAKYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPV 344
           NG  YW+++NSWG  WGE+GY+R+ R+    +GLCG+A   SYPV
Sbjct: 309 NGMDYWIVRNSWGANWGENGYLRVQRNVASSSGLCGLAIEPSYPV 353


>gi|1345573|emb|CAA40073.1| endopeptidase (EP-C1) [Phaseolus vulgaris]
          Length = 361

 Score =  320 bits (819), Expect = 7e-85,   Method: Compositional matrix adjust.
 Identities = 158/317 (49%), Positives = 210/317 (66%), Gaps = 12/317 (3%)

Query: 35  EPSIVEKHEQWMAQHGRTYKDEL-EKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSD 93
           E S+ + +E+W + H  T    L EK  R N+FK NL ++   NK  ++ YKL  N+F+D
Sbjct: 32  EESLWDLYERWRSHH--TVSRSLGEKHKRFNVFKANLMHVHNTNKM-DKPYKLKLNKFAD 88

Query: 94  LTNEEFRALYTGYNRPVPSVSRQSSRPS-TFKYQNVTDVPTSIDWREKGAVTHIKDQGQC 152
           +TN EFR+ Y G       + R +   +  F Y+ V  VP S+DWR+KGAVT +KDQGQC
Sbjct: 89  MTNHEFRSTYAGSKVNHHRMFRGTPHENGAFMYEKVVSVPPSVDWRKKGAVTDVKDQGQC 148

Query: 153 GSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGL 211
           GSCWAFS V AVEGI QI   KL+ LSEQ+LVDC  + N GC+GGLM+ AFE+I +  G+
Sbjct: 149 GSCWAFSTVVAVEGINQIKTNKLVALSEQELVDCDKEENQGCNGGLMESAFEFIKQKGGI 208

Query: 212 ATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRA 271
            TE++YPY+ +EGTCD  K   +A +I  +E++P  DE ALL+AV+NQPVSV +DA G  
Sbjct: 209 TTESNYPYKAQEGTCDASKVNDLAVSIDGHENVPANDEDALLKAVANQPVSVAIDAGGSD 268

Query: 272 FHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD- 330
           F FY  GV   DC  + +HGVA+VG+GT  +  G  YW+++NSWG  WGE GYIR+ R+ 
Sbjct: 269 FQFYSEGVFTGDCSTDLNHGVAIVGYGTTVD--GTNYWIVRNSWGPEWGEHGYIRMQRNI 326

Query: 331 ---AGLCGIATAASYPV 344
               GLCGIA   SYP+
Sbjct: 327 SKKEGLCGIAMLPSYPI 343


>gi|334185815|ref|NP_680113.3| putative cysteine proteinase [Arabidopsis thaliana]
 gi|75313879|sp|Q9STL4.1|CEP2_ARATH RecName: Full=KDEL-tailed cysteine endopeptidase CEP2; Flags:
           Precursor
 gi|4678354|emb|CAB41164.1| cysteine endopeptidase-like protein [Arabidopsis thaliana]
 gi|332644882|gb|AEE78403.1| putative cysteine proteinase [Arabidopsis thaliana]
          Length = 361

 Score =  320 bits (819), Expect = 9e-85,   Method: Compositional matrix adjust.
 Identities = 158/350 (45%), Positives = 221/350 (63%), Gaps = 13/350 (3%)

Query: 5   FEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLN 64
            +K  +I +F ++IL   C           E  +   +++W + H    +   E+  R N
Sbjct: 1   MKKLLLIFLFSLVILQTACGFDYDDKEIESEEGLSTLYDRWRSHHS-VPRSLNEREKRFN 59

Query: 65  IFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYN---RPVPSVSRQSSRPS 121
           +F+ N+ ++   NK+ NR+YKL  N+F+DLT  EF+  YTG N     +    ++ S+  
Sbjct: 60  VFRHNVMHVHNTNKK-NRSYKLKLNKFADLTINEFKNAYTGSNIKHHRMLQGPKRGSKQF 118

Query: 122 TFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQ 181
            + ++N++ +P+S+DWR+KGAVT IK+QG+CGSCWAFS VAAVEGI +I   KL+ LSEQ
Sbjct: 119 MYDHENLSKLPSSVDWRKKGAVTEIKNQGKCGSCWAFSTVAAVEGINKIKTNKLVSLSEQ 178

Query: 182 QLVDCST-DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISK 240
           +LVDC T  N GC+GGLM+ AFE+I +N G+ TE  YPY   +G CD  K+  V  TI  
Sbjct: 179 ELVDCDTKQNEGCNGGLMEIAFEFIKKNGGITTEDSYPYEGIDGKCDASKDNGVLVTIDG 238

Query: 241 YEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTA 300
           +ED+P+ DE ALL+AV+NQPVSV +DA    F FY  GV    CG   +HGVA VG+G+ 
Sbjct: 239 HEDVPENDENALLKAVANQPVSVAIDAGSSDFQFYSEGVFTGSCGTELNHGVAAVGYGS- 297

Query: 301 EEENGAKYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPVAI 346
             E G KYW+++NSWG  WGE GYI+I R+     G CGIA  ASYP+ +
Sbjct: 298 --ERGKKYWIVRNSWGAEWGEGGYIKIEREIDEPEGRCGIAMEASYPIKL 345


>gi|414588007|tpg|DAA38578.1| TPA: hypothetical protein ZEAMMB73_159244 [Zea mays]
          Length = 307

 Score =  320 bits (819), Expect = 9e-85,   Method: Compositional matrix adjust.
 Identities = 152/315 (48%), Positives = 209/315 (66%), Gaps = 17/315 (5%)

Query: 38  IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
           + E+HE+WMA++ R YKD  EKA R  +FK N  ++E  N +    + LG N+F+DLT E
Sbjct: 1   MAERHERWMAEYDRVYKDAAEKARRFEVFKDNFAFVESFNADKKNKFWLGVNQFADLTTE 60

Query: 98  EFRALYTGYNRPVPSVSRQSSRPSTFKYQN--VTDVPTSIDWREKGAVTHIKDQGQCGSC 155
           EF+A     N+    +S +    + FKY+N  V+ +PT++DWR KGAVT IK+QGQCG C
Sbjct: 61  EFKA-----NKGFKPISAEEVPTTGFKYENLSVSALPTAVDWRTKGAVTPIKNQGQCGCC 115

Query: 156 WAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDN--HGCSGGLMDKAFEYIIENKGLAT 213
           WAFSA+AA+EGI +++ G L+ LSEQ+ VDC T N   GC GG MD AFE++I+N GLAT
Sbjct: 116 WAFSAIAAMEGIVKLSTGNLVSLSEQEPVDCDTHNMDEGCEGGWMDNAFEFVIKNGGLAT 175

Query: 214 EADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFH 273
           E+ YPY+  +G C    +   AATI  +ED+P  +E AL++ V++QPVSV VDAS R F 
Sbjct: 176 ESSYPYKVVDGKCKGGSKS--AATIKGHEDVPPNNEAALMKVVASQPVSVAVDASDRTFM 233

Query: 274 FYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA-- 331
            Y  GV+   CG   DHG+A +G+G   E +  KYW++KNSWG TWGE G++R+ +D   
Sbjct: 234 LYSGGVMTGSCGTQLDHGIAAIGYGV--ESDDTKYWILKNSWGTTWGEKGFLRMEKDISD 291

Query: 332 --GLCGIATAASYPV 344
             G+C +A   SYP 
Sbjct: 292 KRGMCDLAMKPSYPT 306


>gi|297830592|ref|XP_002883178.1| hypothetical protein ARALYDRAFT_479457 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297329018|gb|EFH59437.1| hypothetical protein ARALYDRAFT_479457 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 452

 Score =  319 bits (818), Expect = 9e-85,   Method: Compositional matrix adjust.
 Identities = 155/343 (45%), Positives = 220/343 (64%), Gaps = 12/343 (3%)

Query: 8   SFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFK 67
           +  + +F ++++ ++  S   +  + +E      +EQW+ ++ + Y    EK  R  IF 
Sbjct: 9   TLALLIFSMLLISLSLGSVTAADTTRNEAEARRMYEQWLVENRKNYNGLGEKETRFEIFT 68

Query: 68  QNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQN 127
            NL+YIE+ N   N+T+++G   F+DLTN+EFRA+Y    R     +R   +   + Y+ 
Sbjct: 69  DNLKYIEEHNSVPNQTFEVGLTRFADLTNDEFRAIYL---RSKMERTRVPVKGERYLYKV 125

Query: 128 VTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
              +P  IDWR KGAV  +KDQG CGSCWAFSA+ AVEGI QI  G+LI LSEQ+LVDC 
Sbjct: 126 GDTLPDQIDWRAKGAVNPVKDQGNCGSCWAFSAIGAVEGINQIKTGELISLSEQELVDCD 185

Query: 188 TD-NHGCSGGLMDKAFEYIIENKGLATEADYPYR-HEEGTCDNQKEKAVAATISKYEDLP 245
           T  N GC GGLMD AF++IIEN G+ TE DYPY   ++  C++ K+ +   TI  YED+P
Sbjct: 186 TSYNGGCGGGLMDYAFKFIIENGGIDTEEDYPYTATDDNICNSDKKNSRVVTIDGYEDVP 245

Query: 246 KGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENG 305
           + DE++L +A++NQP+SV ++A GRAF  YKSGV    CG + DHGV  VG+G+   E G
Sbjct: 246 QNDEKSLKKALANQPISVAIEAGGRAFQLYKSGVFTGTCGTSLDHGVVAVGYGS---EGG 302

Query: 306 AKYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPV 344
             YW+++NSWG  WGESGY ++ R+    +G CG+A  ASYP 
Sbjct: 303 QDYWIVRNSWGSNWGESGYFKLERNIKESSGKCGVAMMASYPT 345


>gi|302796898|ref|XP_002980210.1| hypothetical protein SELMODRAFT_153766 [Selaginella moellendorffii]
 gi|300151826|gb|EFJ18470.1| hypothetical protein SELMODRAFT_153766 [Selaginella moellendorffii]
          Length = 479

 Score =  319 bits (818), Expect = 1e-84,   Method: Compositional matrix adjust.
 Identities = 165/319 (51%), Positives = 210/319 (65%), Gaps = 26/319 (8%)

Query: 43  EQWMAQHGRTYKDEL--------EKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
           + WM QHG++Y D          EKA R  IFK NL +I   N E N+ Y LG N F+DL
Sbjct: 58  DSWMLQHGKSYADNALSGDSQAGEKATRYGIFKDNLRFIHGEN-EKNQGYFLGLNAFADL 116

Query: 95  TNEEFRALYTG--YNRPVPSVSRQSSRPSTFKYQNV--TDVPTSIDWREKGAVTHIKDQG 150
           TNEEFRA   G  ++R     SR+ +    F+Y +V   D+P SIDWREKGAV  +KDQG
Sbjct: 117 TNEEFRAQRHGGRFDR-----SRERTSHEEFRYGSVQLKDLPDSIDWREKGAVVGVKDQG 171

Query: 151 QCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST-DNHGCSGGLMDKAFEYIIENK 209
            CGSCWAFSAVAA+EG+ ++  G+L+ LSEQ+LVDC   ++ GC+GGLMD AF ++I+N 
Sbjct: 172 SCGSCWAFSAVAAIEGVNKLATGELVSLSEQELVDCDKGEDEGCNGGLMDYAFGFVIKNG 231

Query: 210 GLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASG 269
           GL TEADYPY+     CD  K  A   TI  YED+P  DE ALL+AV++QPVSV +DA G
Sbjct: 232 GLDTEADYPYKGYGTRCDRSKMNAKVVTIDGYEDVPVNDETALLKAVAHQPVSVAIDAGG 291

Query: 270 RAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILR 329
            +  FY+SG+    CG + DHGV  VG+G   +E+G  YW+IKNSWG  WGE GY+++ R
Sbjct: 292 SSMQFYRSGIFTGRCGTDLDHGVTNVGYG---KEDGKAYWIIKNSWGSNWGEKGYVKMAR 348

Query: 330 D----AGLCGIATAASYPV 344
           +    AGLCGI   ASYP 
Sbjct: 349 NTGLAAGLCGINMEASYPT 367


>gi|262360187|gb|ACY38051.2| cysteine proteinase C1A [Dactylis glomerata]
          Length = 365

 Score =  319 bits (818), Expect = 1e-84,   Method: Compositional matrix adjust.
 Identities = 162/322 (50%), Positives = 215/322 (66%), Gaps = 18/322 (5%)

Query: 35  EPSIVEKHEQWMAQHG---RTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEF 91
           E S+   +E W + H    R    E E A R N+FK+N+ YI +ANK+ +R ++L  N+F
Sbjct: 33  EESLRGLYETWRSHHTVSRRGLGAEAE-ARRFNVFKENVRYIHEANKK-DRPFRLALNKF 90

Query: 92  SDLTNEEFRALYTG----YNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIK 147
           +D+T +EFR  Y G    ++R +    RQ     +F Y +  ++P ++DWR+KGAVT IK
Sbjct: 91  ADMTTDEFRRTYAGSRVRHHRSLSGGRRQGG--GSFMYADAENLPAAVDWRQKGAVTPIK 148

Query: 148 DQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST-DNHGCSGGLMDKAFEYII 206
           DQGQCGSCWAFS + AVEGI +I  G+L+ LSEQ+L+DC+  +N GC+GGLMD AF++I 
Sbjct: 149 DQGQCGSCWAFSTIVAVEGINKIRTGRLVSLSEQELMDCNIGENDGCNGGLMDVAFQFIQ 208

Query: 207 ENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVD 266
           +N G+ TEA YPY+ E+ +CD  KE +   +I  YED+P  DE AL +AV+NQPVSV +D
Sbjct: 209 QNGGITTEASYPYQGEQNSCDQSKENSHDVSIDGYEDVPANDESALQKAVANQPVSVAID 268

Query: 267 ASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIR 326
           ASG  F FY  GV   D G + DHGVA VG+GT  +  G KYW++KNSWGE WGE GYIR
Sbjct: 269 ASGNDFQFYSEGVFTTDGGTDLDHGVAAVGYGTTRD--GTKYWIVKNSWGEDWGEKGYIR 326

Query: 327 ILRDA----GLCGIATAASYPV 344
           + R      GLCGIA  ASYP 
Sbjct: 327 MQRGVKQAEGLCGIAMEASYPT 348


>gi|297744465|emb|CBI37727.3| unnamed protein product [Vitis vinifera]
          Length = 331

 Score =  319 bits (817), Expect = 1e-84,   Method: Compositional matrix adjust.
 Identities = 157/312 (50%), Positives = 208/312 (66%), Gaps = 33/312 (10%)

Query: 38  IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
           ++ + E W+++HG+ YK   EK  R  +F++NL +I++ NKE + +Y LG NEF+DL++E
Sbjct: 45  LIARFESWVSKHGKVYKSMEEKLHRFEVFRENLNHIDERNKEVS-SYWLGLNEFADLSHE 103

Query: 98  EFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWA 157
           EF++                        ++V D+P S+DWR+KGAVTH+K+QG CGSCWA
Sbjct: 104 EFKS------------------------KDVADLPESVDWRKKGAVTHVKNQGACGSCWA 139

Query: 158 FSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEAD 216
           FS VAAVEGI QI  G L  LSEQ+L+DC T  N GC+GGLMD AF +I  N GL  E D
Sbjct: 140 FSTVAAVEGINQIVTGNLTTLSEQELIDCDTTFNSGCNGGLMDYAFAFIASNGGLHKEDD 199

Query: 217 YPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYK 276
           YPY  EEGTC+ QKE     TIS YED+P+ DE++LL+A+++QP+SV ++ASGR F FY 
Sbjct: 200 YPYLMEEGTCEEQKEDVDIVTISGYEDVPEKDEESLLKALAHQPLSVAIEASGRDFQFYS 259

Query: 277 SGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----G 332
            GV N  CG   DHGVA VG+G+++   G  Y ++KNSWG  WGE GYIR+ R+     G
Sbjct: 260 GGVFNGPCGTELDHGVAAVGYGSSK---GLDYIIVKNSWGPKWGEKGYIRMKRNTGKTEG 316

Query: 333 LCGIATAASYPV 344
           LCGI   ASYP 
Sbjct: 317 LCGINKMASYPT 328


>gi|226501480|ref|NP_001150266.1| cysteine protease 1 precursor [Zea mays]
 gi|195637948|gb|ACG38442.1| cysteine protease 1 precursor [Zea mays]
          Length = 462

 Score =  319 bits (817), Expect = 1e-84,   Method: Compositional matrix adjust.
 Identities = 155/329 (47%), Positives = 214/329 (65%), Gaps = 16/329 (4%)

Query: 26  QVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKAN-KEGNRTY 84
               G    EP     +E W+A+HGR Y    E+  R  +F  NL +++  N +     +
Sbjct: 33  HAARGLERTEPEARTLYELWLAEHGRAYNALGERDRRFRVFWDNLRFVDAHNERAAEHGF 92

Query: 85  KLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQN---VTDVPTSIDWREKG 141
           +LG N+F+DLTN+EFRA Y G   P    +R+       +Y++     ++P S+DWREKG
Sbjct: 93  RLGMNQFADLTNDEFRAAYLGARIPA---ARRRGTAVGERYRHGGGAEELPESVDWREKG 149

Query: 142 AVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMD 199
           AV  +K+QGQCGSCWAFSAV++VE + QI  G+++ LSEQ+LV+CSTD  N GC+GGLMD
Sbjct: 150 AVAPVKNQGQCGSCWAFSAVSSVESVNQIVTGEMVTLSEQELVECSTDGGNSGCNGGLMD 209

Query: 200 KAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQ 259
            AF++II+N G+ TE DYPY+  +G CD  +E A   +I  +ED+P+ DE++L +AV++Q
Sbjct: 210 AAFDFIIKNGGIDTEGDYPYKAVDGKCDINRENAKVVSIDGFEDVPENDEKSLQKAVAHQ 269

Query: 260 PVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETW 319
           PVSV ++A GR F  YK+GV +  C  N DHGV  VG+GT   ENG  YW+++NSWG  W
Sbjct: 270 PVSVAIEAGGREFQLYKAGVFSGTCTTNLDHGVVAVGYGT---ENGKDYWIVRNSWGAKW 326

Query: 320 GESGYIRILRD----AGLCGIATAASYPV 344
           GE GYIR+ R+     G CGIA  ASYP 
Sbjct: 327 GEDGYIRMERNVNATTGKCGIAMMASYPT 355


>gi|388519351|gb|AFK47737.1| unknown [Medicago truncatula]
          Length = 359

 Score =  319 bits (817), Expect = 1e-84,   Method: Compositional matrix adjust.
 Identities = 163/345 (47%), Positives = 212/345 (61%), Gaps = 14/345 (4%)

Query: 8   SFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFK 67
           S  I   +   L+    +   S RS  E  ++  +E+W+ +H + Y    EK  R  IFK
Sbjct: 3   SITITSLLFFSLITLSLAMDTSMRSNEE--VMTMYEEWLVKHHKVYNGLGEKDQRFEIFK 60

Query: 68  QNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQ- 126
            NL +I++ N + N TYK+G N+F+D TNEE+R +Y G          +    +  +Y  
Sbjct: 61  DNLGFIDEHNAQ-NYTYKVGLNKFADTTNEEYRNMYLGTKNDAKRNVMKIKITTGHRYAF 119

Query: 127 NVTD-VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVD 185
           N  D +P  +DWR KGAV HIKDQG CGSCWAFS +A VE I +I  GKL+ LSEQ+LVD
Sbjct: 120 NSGDRLPVHVDWRSKGAVAHIKDQGSCGSCWAFSTIATVEAINKIVTGKLVSLSEQELVD 179

Query: 186 CSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDL 244
           C    N GC+GGLMD AFE+I+EN G+ TE DYPY+  EG CD  ++ A   +I  YED+
Sbjct: 180 CDRAFNEGCNGGLMDYAFEFIVENGGIDTEQDYPYKGFEGRCDPTRKNAKVVSIDGYEDV 239

Query: 245 PKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEEN 304
           P  +E AL +AV +QPVSV ++A GRA   Y+SGV    CG N DHGV VVG+G    EN
Sbjct: 240 PAYNENALKKAVFHQPVSVAIEAGGRALQLYQSGVFTGRCGTNLDHGVVVVGYGF---EN 296

Query: 305 GAKYWLIKNSWGETWGESGYIRILRDA-----GLCGIATAASYPV 344
           G  YWL++NSWG  WGE GY ++ R+      G CGIA  ASYPV
Sbjct: 297 GVDYWLVRNSWGTNWGEDGYFKLERNVKKINTGKCGIAMQASYPV 341


>gi|326502440|dbj|BAJ95283.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 349

 Score =  318 bits (816), Expect = 2e-84,   Method: Compositional matrix adjust.
 Identities = 164/318 (51%), Positives = 217/318 (68%), Gaps = 13/318 (4%)

Query: 35  EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
           + ++V +HE+WMA+HGRTY +E EKA RL +F+ N + I+  N   + T++L TN F+DL
Sbjct: 37  DSAMVSRHEKWMAEHGRTYANEEEKARRLEVFRANAKLIDSFNSAEDSTHRLATNRFADL 96

Query: 95  TNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQN--VTDVPTSIDWREKGAVTHIKDQGQC 152
           T+EEFRA  TG  RP  + +   S    F+Y+N  + D   S+DWR  GAVT +KDQG C
Sbjct: 97  TDEEFRAARTGLRRPPAAAAGAGSGAGGFRYENFSLADAAGSMDWRAMGAVTGVKDQGSC 156

Query: 153 GSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST--DNHGCSGGLMDKAFEYIIENKG 210
           G CWAFSAVAAVEG+T+I  G+L+ LSEQQLVDC    D+ GC+GGLMD AFEY+I   G
Sbjct: 157 GCCWAFSAVAAVEGLTKIRTGRLVSLSEQQLVDCDVYGDDEGCAGGLMDNAFEYMINRGG 216

Query: 211 LATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGR 270
           L TE+ YPYR  +G+C   +  A AA+I  YED+P  +E AL+ AV++QPVSV ++    
Sbjct: 217 LTTESSYPYRGTDGSC---RRSASAASIRGYEDVPANNEAALMAAVAHQPVSVAINGGDS 273

Query: 271 AFHFYKSGVLNAD-CGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRI-- 327
            F FY SGVL    CG   +H +  VG+GTA +  G KYW++KNSWG +WGE GY+RI  
Sbjct: 274 VFRFYDSGVLGGSGCGTELNHAITAVGYGTASD--GTKYWIMKNSWGGSWGEGGYVRIRR 331

Query: 328 -LRDAGLCGIATAASYPV 344
            +R  G+CG+A  ASYPV
Sbjct: 332 GVRGEGVCGLAQLASYPV 349


>gi|297816028|ref|XP_002875897.1| hypothetical protein ARALYDRAFT_347926 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297321735|gb|EFH52156.1| hypothetical protein ARALYDRAFT_347926 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 361

 Score =  318 bits (816), Expect = 2e-84,   Method: Compositional matrix adjust.
 Identities = 158/347 (45%), Positives = 223/347 (64%), Gaps = 15/347 (4%)

Query: 9   FIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQ 68
            +I +F ++IL   C           E  + + +++W + H    +   E+  R N+F+ 
Sbjct: 5   LLIFLFSLVILETACGFDYEDKEIESEEGLSKLYDRWRSHHS-VPRSLHEREKRFNVFRH 63

Query: 69  NLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTG----YNRPVPSVSRQSSRPSTFK 124
           N+ ++  +NK+ NR+YKL  N+F+DLT  EF+  YTG    ++R +    R  S+   + 
Sbjct: 64  NVMHVHNSNKK-NRSYKLKLNKFADLTIHEFKNAYTGSKIKHHRMLQGPKR-GSKQFMYD 121

Query: 125 YQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLV 184
           ++NV+ +P+S+DWR+KGAVT IK+QG+CGSCWAFS VAAVEGI +I   KL+ LSEQ+LV
Sbjct: 122 HENVSKLPSSVDWRKKGAVTEIKNQGKCGSCWAFSTVAAVEGINKIKTNKLVSLSEQELV 181

Query: 185 DCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYED 243
           DC T+ N GC+GGLM+ AFE+I +N G+ TE  YPY   +G CD  K+  V  TI  +E+
Sbjct: 182 DCDTNQNEGCNGGLMEIAFEFIKKNGGITTEDSYPYEGIDGKCDASKDNGVLVTIDGHEN 241

Query: 244 LPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEE 303
           +P+ DE ALL+AV+NQPVSV +DA    F FY  GV   DCG   +HGVA VG+G+   +
Sbjct: 242 VPENDENALLKAVANQPVSVAIDAGSSDFQFYSEGVFTGDCGTELNHGVATVGYGS---Q 298

Query: 304 NGAKYWLIKNSWGETWGESGYIRILR----DAGLCGIATAASYPVAI 346
            G KYW+++NSWG  WGE GYI+I R      G CGIA  ASYP+ +
Sbjct: 299 GGKKYWIVRNSWGTEWGEGGYIKIERGIDEPEGRCGIAMEASYPIKL 345


>gi|118120|sp|P25249.1|CYSP1_HORVU RecName: Full=Cysteine proteinase EP-B 1; Flags: Precursor
 gi|1146116|gb|AAA85035.1| cysteine proteinase EPB1 precursor [Hordeum vulgare subsp. vulgare]
          Length = 371

 Score =  318 bits (816), Expect = 2e-84,   Method: Compositional matrix adjust.
 Identities = 164/321 (51%), Positives = 205/321 (63%), Gaps = 17/321 (5%)

Query: 35  EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
           E ++ + +E+W + H R  +   EK  R   FK N  +I   NK G+  Y+L  N F D+
Sbjct: 39  EEALWDLYERWQSAH-RVRRHHAEKHRRFGTFKSNAHFIHSHNKRGDHPYRLHLNRFGDM 97

Query: 95  TNEEFRALYTG-YNRPVPSVSRQSSRPSTFKYQ--NVTDVPTSIDWREKGAVTHIKDQGQ 151
              EFRA + G   R  P+  +  S P  F Y   NV+D+P S+DWR+KGAVT +KDQG+
Sbjct: 98  DQAEFRATFVGDLRRDTPA--KPPSVPG-FMYAALNVSDLPPSVDWRQKGAVTGVKDQGK 154

Query: 152 CGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST-DNHGCSGGLMDKAFEYIIENKG 210
           CGSCWAFS V +VEGI  I  G L+ LSEQ+L+DC T DN GC GGLMD AFEYI  N G
Sbjct: 155 CGSCWAFSTVVSVEGINAIRTGSLVSLSEQELIDCDTADNDGCQGGLMDNAFEYIKNNGG 214

Query: 211 LATEADYPYRHEEGTCD---NQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDA 267
           L TEA YPYR   GTC+     +   V   I  ++D+P   E+ L +AV+NQPVSV V+A
Sbjct: 215 LITEAAYPYRAARGTCNVARAAQNSPVVVHIDGHQDVPANSEEDLARAVANQPVSVAVEA 274

Query: 268 SGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRI 327
           SG+AF FY  GV   DCG   DHGVAVVG+G AE+  G  YW +KNSWG +WGE GYIR+
Sbjct: 275 SGKAFMFYSEGVFTGDCGTELDHGVAVVGYGVAED--GKAYWTVKNSWGPSWGEQGYIRV 332

Query: 328 LRDA----GLCGIATAASYPV 344
            +D+    GLCGIA  ASYPV
Sbjct: 333 EKDSGASGGLCGIAMEASYPV 353


>gi|242092702|ref|XP_002436841.1| hypothetical protein SORBIDRAFT_10g009840 [Sorghum bicolor]
 gi|241915064|gb|EER88208.1| hypothetical protein SORBIDRAFT_10g009840 [Sorghum bicolor]
          Length = 328

 Score =  318 bits (816), Expect = 2e-84,   Method: Compositional matrix adjust.
 Identities = 160/342 (46%), Positives = 216/342 (63%), Gaps = 32/342 (9%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
           +  I+ L   C + + +     + ++V +HEQWM Q+ R YKD  EKA R  +FK N+++
Sbjct: 8   ILAILGLAFFCGAALAARDLNDDSAMVARHEQWMVQYSRVYKDTTEKARRFEVFKANVKF 67

Query: 73  IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYN-RPVP-SVSRQSSRPSTFKYQNVT- 129
           IE  N  GNR + LG N+F+DLTN+EFRA  T    +P P  VS      + F+Y+NV+ 
Sbjct: 68  IESFNAGGNRKFWLGVNQFADLTNDEFRATKTNKGFKPSPVKVS------TGFRYENVSV 121

Query: 130 -DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST 188
             +P +IDWR KGAVT IKDQGQC            EGI +I+ GKLI LSEQ+LVDC  
Sbjct: 122 DALPATIDWRTKGAVTPIKDQGQC------------EGIVKISTGKLISLSEQELVDCDV 169

Query: 189 --DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPK 246
             ++ GC GGLMD AF++II+N GL TE+ YPY   +G C +      AAT+  +ED+P 
Sbjct: 170 HGEDQGCEGGLMDDAFKFIIKNGGLTTESSYPYTAADGKCKSGSNS--AATVKGFEDVPA 227

Query: 247 GDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGA 306
            DE AL++AV+NQPVSV VD     F FY  GV+   CG + DHG+A +G+G  +  +G 
Sbjct: 228 NDEAALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYG--QTSDGT 285

Query: 307 KYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
           KYWL+KNSWG TWGE+GY+R+ +D     G+CG+A   SYP 
Sbjct: 286 KYWLLKNSWGTTWGENGYLRMEKDISDKRGMCGLAMEPSYPT 327


>gi|109119897|dbj|BAE96008.1| cysteine proteinase [Triticum aestivum]
          Length = 377

 Score =  318 bits (816), Expect = 2e-84,   Method: Compositional matrix adjust.
 Identities = 162/322 (50%), Positives = 206/322 (63%), Gaps = 15/322 (4%)

Query: 35  EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
           E ++ + +E+W   H R  +   EK  R   FK N+ +I   NK G+R Y+L  N F D+
Sbjct: 39  EEALWDLYERWQTAH-RVPRHHAEKHRRFGTFKSNVHFIHSHNKRGDRPYRLRLNRFGDM 97

Query: 95  TNEEFRALYTGYNRPVPSVSRQSSRPST--FKYQ--NVTDVPTSIDWREKGAVTHIKDQG 150
           +  EFRA + G           ++ PS   F Y   NV+D+P S+DWR+KGAVT +K+QG
Sbjct: 98  SQAEFRATFAGSRVSDRRRDGPATPPSVPGFMYAAVNVSDLPRSVDWRQKGAVTGVKNQG 157

Query: 151 QCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST-DNHGCSGGLMDKAFEYIIENK 209
           +CGSCWAFS V +VEGI  I  GKL+ LSEQ+L+DC T DN GC GGLMD AFEYI +N 
Sbjct: 158 KCGSCWAFSTVVSVEGINAIRTGKLVSLSEQELIDCDTADNDGCEGGLMDNAFEYIKKNG 217

Query: 210 GLATEADYPYRHEEGTCDN---QKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVD 266
           GL TEA YPYR   GTC      K   +   I  ++D+P   E+AL +AV+NQPVSV +D
Sbjct: 218 GLTTEAAYPYRAANGTCKAAKVAKSSPMVVHIDGHQDVPANSEEALAKAVANQPVSVGID 277

Query: 267 ASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIR 326
           ASG+AF FY  GV   +CG   DHGVAVVG+G AE+  G  YW +KNSWG +WGE GYIR
Sbjct: 278 ASGKAFMFYSEGVFTGECGTELDHGVAVVGYGVAED--GKAYWTVKNSWGPSWGEKGYIR 335

Query: 327 ILRDA----GLCGIATAASYPV 344
           + +D+    GLCGIA  ASY V
Sbjct: 336 VEKDSGAEGGLCGIAMEASYAV 357


>gi|363814535|ref|NP_001242660.1| uncharacterized protein LOC100807362 precursor [Glycine max]
 gi|255636658|gb|ACU18666.1| unknown [Glycine max]
          Length = 367

 Score =  318 bits (816), Expect = 2e-84,   Method: Compositional matrix adjust.
 Identities = 164/354 (46%), Positives = 222/354 (62%), Gaps = 22/354 (6%)

Query: 8   SFIIPMFVIIILVITCASQVVSGRSMH--------EPSIVEKHEQWMAQHGRTYKDELEK 59
           + +I +F ++ +       ++S    H        +  ++  +E+W+ +HG+ Y    EK
Sbjct: 10  TILIVLFTVLAVSSALDMSIISYDRSHADKSGWKSDEEVMSIYEEWLVKHGKVYNAVEEK 69

Query: 60  AMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSR 119
             R  IFK NL +IE+ N   NRTYK+G N FSDL+NEE+R+ Y G  +  PS  R  +R
Sbjct: 70  EKRFQIFKDNLNFIEEHNAV-NRTYKVGLNRFSDLSNEEYRSKYLG-TKIDPS--RMMAR 125

Query: 120 PSTFKYQNVTD-VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIEL 178
           PS      V D +P S+DWR++GAV  +K+Q +C  CWAFSA+AAVEGI +I  G L  L
Sbjct: 126 PSRRYSPRVADNLPESVDWRKEGAVVRVKNQSECEGCWAFSAIAAVEGINKIVTGNLTAL 185

Query: 179 SEQQLVDCS-TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAAT 237
           SEQ+L+DC  T N GCSGGL+D AFE+II N G+ TE DYP++  +G CD  K  A A T
Sbjct: 186 SEQELLDCDRTVNAGCSGGLVDYAFEFIINNGGIDTEEDYPFQGADGICDQYKINARAVT 245

Query: 238 ISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGF 297
           I  YE +P  DE AL +AV+NQPVSV ++A G+ F  Y+SG+    CG + DHGV  VG+
Sbjct: 246 IDGYERVPAYDELALKKAVANQPVSVAIEAYGKEFQLYESGIFTGTCGTSIDHGVTAVGY 305

Query: 298 GTAEEENGAKYWLIKNSWGETWGESGYIRILRD-----AGLCGIATAASYPVAI 346
           GT   ENG  YW++KNSWGE WGE+GY+ + R+     AG CGIA    YP+ I
Sbjct: 306 GT---ENGIDYWIVKNSWGENWGEAGYVGMERNIAEDTAGKCGIAILTLYPIKI 356


>gi|242072390|ref|XP_002446131.1| hypothetical protein SORBIDRAFT_06g002140 [Sorghum bicolor]
 gi|241937314|gb|EES10459.1| hypothetical protein SORBIDRAFT_06g002140 [Sorghum bicolor]
          Length = 328

 Score =  318 bits (815), Expect = 2e-84,   Method: Compositional matrix adjust.
 Identities = 159/341 (46%), Positives = 221/341 (64%), Gaps = 32/341 (9%)

Query: 16  IIILVITCAS---QVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
            ++ ++ CAS    V++ R + + ++VE+HE WM ++GR YKD  EKA R  +FK N+ +
Sbjct: 7   FLLAILGCASLCSSVLAARELSDAAMVERHENWMVEYGRVYKDAAEKARRFQVFKDNVAF 66

Query: 73  IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPST-FKYQN--VT 129
           +E  N   N  + LG N+F+DLT EEF+A   G+      V      P+T FKY+N  V+
Sbjct: 67  VESFNTNKNNKFWLGVNQFADLTTEEFKA-NKGFKPTAEKV------PTTGFKYENLSVS 119

Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD 189
            +PT++DWR KGAVT IK+QGQC         AA+EGI +++ G LI LSEQ+LVDC T 
Sbjct: 120 ALPTAVDWRTKGAVTPIKNQGQC---------AAMEGIVKLSTGNLISLSEQELVDCDTH 170

Query: 190 --NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
             + GC GG MD AFE++I+N GLATE++YPY+  +G C    +   AATI  +ED+P  
Sbjct: 171 SMDEGCEGGWMDSAFEFVIKNGGLATESNYPYKAVDGKCKGGSKS--AATIKGHEDVPVN 228

Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
           +E AL++AV+NQPVSV VDAS R F  Y  GV+   CG   DHG+A +G+G   E +G K
Sbjct: 229 NEAALMKAVANQPVSVAVDASDRTFMLYSGGVMTGSCGTELDHGIAAIGYGM--ESDGTK 286

Query: 308 YWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
           YW++KNSWG TWGE G++R+ +D     G+CG+A   SYP 
Sbjct: 287 YWILKNSWGTTWGEKGFLRMEKDITDKRGMCGLAMKPSYPT 327


>gi|356563155|ref|XP_003549830.1| PREDICTED: vignain-like [Glycine max]
          Length = 361

 Score =  318 bits (815), Expect = 2e-84,   Method: Compositional matrix adjust.
 Identities = 162/349 (46%), Positives = 226/349 (64%), Gaps = 13/349 (3%)

Query: 3   LKFEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDEL-EKAM 61
           ++ +K F + +   ++L +  + +        E  + + +E+W + H  T    L EK  
Sbjct: 1   MEVKKVFFVALSFALVLRVAESFEFNEKDLESEEGLWDLYERWRSHH--TVSRSLDEKHN 58

Query: 62  RLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPS 121
           R N+FK N+ ++  +NK  ++ YKL  N F+D+TN EFR++Y G       + R + R +
Sbjct: 59  RFNVFKGNVMHVHSSNKM-DKPYKLKLNRFADMTNHEFRSIYAGSKVNHHRMFRGTPRGN 117

Query: 122 -TFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSE 180
            TF YQNV  VP+S+DWR+KGAVT +KDQGQCGSCWAFS + AVEGI QI   KL+ LSE
Sbjct: 118 GTFMYQNVDRVPSSVDWRKKGAVTDVKDQGQCGSCWAFSTIVAVEGINQIKTHKLVPLSE 177

Query: 181 QQLVDC-STDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATIS 239
           Q+LVDC +T N GC+GGLM+ AFE+ I+  G+ T ++YPY  ++GTCD  K    A +I 
Sbjct: 178 QELVDCDTTQNQGCNGGLMESAFEF-IKQYGITTASNYPYEAKDGTCDASKVNEPAVSID 236

Query: 240 KYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGT 299
            +E++P  +E ALL+AV++QPVSV ++A G  F FY  GV   +CG   DHGVA+VG+GT
Sbjct: 237 GHENVPVNNEAALLKAVAHQPVSVAIEAGGIDFQFYSEGVFTGNCGTALDHGVAIVGYGT 296

Query: 300 AEEENGAKYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPV 344
            ++  G KYW +KNSWG  WGE GYIR+ R      GLCGIA  ASYP+
Sbjct: 297 TQD--GTKYWTVKNSWGSEWGEKGYIRMKRSISVKKGLCGIAMEASYPI 343


>gi|297845064|ref|XP_002890413.1| hypothetical protein ARALYDRAFT_472321 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297336255|gb|EFH66672.1| hypothetical protein ARALYDRAFT_472321 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 357

 Score =  318 bits (815), Expect = 2e-84,   Method: Compositional matrix adjust.
 Identities = 155/314 (49%), Positives = 217/314 (69%), Gaps = 13/314 (4%)

Query: 38  IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
           ++E  E W++   + Y+   EK +R  +FK NL++I++ NK+  ++Y LG NEF+DL++E
Sbjct: 47  LIELFENWISNFEKAYETVEEKLLRFEVFKDNLKHIDETNKK-VKSYWLGLNEFADLSHE 105

Query: 98  EFRALYTGYNRPVPSVSRQSSRP-STFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCW 156
           EF+ +Y G    +  V R   R  + F Y++V  VP S+DWR+KGAV  +K+QG CGSCW
Sbjct: 106 EFKKMYLGLKTDI--VRRDEERSYAEFAYRDVEAVPKSVDWRKKGAVAEVKNQGSCGSCW 163

Query: 157 AFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEA 215
           AFS VAAVEGI +I  G L  LSEQ+L+DC T  N+GC+GGLMD AFEYI++N GL  E 
Sbjct: 164 AFSTVAAVEGINKIVTGNLTTLSEQELIDCDTTYNNGCNGGLMDYAFEYIVKNGGLRKEE 223

Query: 216 DYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFY 275
           DYPY  EEGTC+ QK+++   TI  ++D+P  DE++LL+A+++QP+SV +DASGR F FY
Sbjct: 224 DYPYSMEEGTCEMQKDESETVTIDGHQDVPTNDEKSLLKALAHQPLSVAIDASGREFQFY 283

Query: 276 KS-GVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA--- 331
               V +  CG + DHGVA VG+G+++   G+ Y ++KNSWG  WGE GYIR+ R+    
Sbjct: 284 SGVSVFDGRCGVDLDHGVAAVGYGSSK---GSDYIIVKNSWGPKWGEKGYIRLKRNTGKP 340

Query: 332 -GLCGIATAASYPV 344
            GLCGI   AS+P 
Sbjct: 341 EGLCGINKMASFPT 354


>gi|255568345|ref|XP_002525147.1| cysteine protease, putative [Ricinus communis]
 gi|223535606|gb|EEF37274.1| cysteine protease, putative [Ricinus communis]
          Length = 347

 Score =  318 bits (815), Expect = 2e-84,   Method: Compositional matrix adjust.
 Identities = 154/310 (49%), Positives = 206/310 (66%), Gaps = 15/310 (4%)

Query: 41  KHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFR 100
           ++++W+ Q+GR Y  + E  +R  I+  N+++IE  N + N ++KL  N+F+DLTN+EF 
Sbjct: 45  RYDKWLEQYGRKYDTKDEYLLRFGIYHSNIQFIEYINSQ-NLSFKLTDNKFADLTNDEFN 103

Query: 101 ALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSA 160
           ++Y GY      +     R  +  ++N TD+P ++DWRE GAVT IKDQGQCGSCWAFSA
Sbjct: 104 SIYLGY-----QIRSYKRRNLSHMHENSTDLPDAVDWRENGAVTPIKDQGQCGSCWAFSA 158

Query: 161 VAAVEGITQITRGKLIELSEQQLVDCST--DNHGCSGGLMDKAFEYIIENKGLATEADYP 218
           VAAVEGI +I  G L+ LSEQ+LVDC    DN GC+GG M+KAF +I    GL TE DYP
Sbjct: 159 VAAVEGINKIKTGNLVSLSEQELVDCDVNGDNKGCNGGFMEKAFTFIKSIGGLTTENDYP 218

Query: 219 YRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSG 278
           Y+  +G+C+  K    A  I  YE +P  +E +L  AVS QPVSV +DASG  F  Y  G
Sbjct: 219 YKGTDGSCEKAKTDNHAVIIGGYETVPANNENSLKVAVSKQPVSVAIDASGYEFQLYSEG 278

Query: 279 VLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----GLC 334
           V +  CG   +HGV +VG+G   + NG KYWL+KNSWG+ WGESGYIR+ RD+    G+C
Sbjct: 279 VFSGYCGIQLNHGVTIVGYG---DNNGQKYWLVKNSWGKGWGESGYIRMKRDSSDTKGMC 335

Query: 335 GIATAASYPV 344
           GIA   SYP+
Sbjct: 336 GIAMEPSYPI 345


>gi|297852302|ref|XP_002894032.1| F2G19.31/F2G19.31 [Arabidopsis lyrata subsp. lyrata]
 gi|297339874|gb|EFH70291.1| F2G19.31/F2G19.31 [Arabidopsis lyrata subsp. lyrata]
          Length = 455

 Score =  318 bits (814), Expect = 3e-84,   Method: Compositional matrix adjust.
 Identities = 158/322 (49%), Positives = 214/322 (66%), Gaps = 16/322 (4%)

Query: 30  GRSMHEPSIVEKHEQWMAQHGRTYKDE--LEKAMRLNIFKQNLEYIEKANKEGNRTYKLG 87
           GRS  E  ++  +E W+ +HG+       +EK  R  IFK NL +I+  NK+ N +Y+LG
Sbjct: 33  GRSDAE--VMSIYEAWLVKHGKAQNQNSLVEKDRRFEIFKDNLRFIDDHNKK-NLSYRLG 89

Query: 88  TNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIK 147
              F+DLTN+E+R+ Y G         R S R   ++ +   ++P SIDWR+KGAV  +K
Sbjct: 90  LTRFADLTNDEYRSKYLGAKMEKKGERRTSQR---YEARVGDELPESIDWRKKGAVAEVK 146

Query: 148 DQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYII 206
           DQG CGSCWAFS + AVEGI QI  G LI LSEQ+LVDC T  N GC+GGLMD AFE+II
Sbjct: 147 DQGSCGSCWAFSTIGAVEGINQIVTGDLITLSEQELVDCDTSYNEGCNGGLMDYAFEFII 206

Query: 207 ENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVD 266
           +N G+ T+ DYPY+  +GTCD  ++ A   TI  YED+P   E++L +AV++QPVSV ++
Sbjct: 207 KNGGIDTDKDYPYKGVDGTCDQIRKNAKVVTIDSYEDVPTYSEESLKKAVAHQPVSVAIE 266

Query: 267 ASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIR 326
           A GRAF  Y SG+ +  CG   DHGV  VG+GT   ENG  YW+++NSWG++WGESGY++
Sbjct: 267 AGGRAFQLYDSGIFDGTCGTQLDHGVVAVGYGT---ENGKDYWIVRNSWGKSWGESGYLK 323

Query: 327 ILRD----AGLCGIATAASYPV 344
           + R+    +G CGIA   SYP+
Sbjct: 324 MARNIASSSGKCGIAIEPSYPI 345


>gi|116781957|gb|ABK22314.1| unknown [Picea sitchensis]
          Length = 369

 Score =  318 bits (814), Expect = 3e-84,   Method: Compositional matrix adjust.
 Identities = 158/317 (49%), Positives = 207/317 (65%), Gaps = 12/317 (3%)

Query: 35  EPSIVEKHEQWMAQHGRTYK-DELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSD 93
           E S+   ++ W  QH  +   D  E A R  IFK+N++YI+  NK+ +  YKLG N+F+D
Sbjct: 39  EKSLRSLYDNWALQHRSSRSLDSEEHAERFEIFKENVKYIDSVNKK-DSPYKLGLNKFAD 97

Query: 94  LTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCG 153
           L+NEEF+A+Y G    +     +  +  +F YQN   +P SIDWR+KGAV  +K+QG CG
Sbjct: 98  LSNEEFKAIYMGTKMDLRG--DREVQSGSFMYQNSEPLPASIDWRQKGAVAAVKNQGHCG 155

Query: 154 SCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLAT 213
           SCWAFS VA+VEGI  IT G L+ LSEQQLVDCST+N GC+GGLMD AF+YII N G+ T
Sbjct: 156 SCWAFSTVASVEGINYITTGNLVSLSEQQLVDCSTENSGCNGGLMDTAFQYIINNGGIVT 215

Query: 214 EADYPYRHEEGTCDNQK--EKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRA 271
           E +YPY  E   C + K   +     I  +ED+P  +EQAL +AV++QPVSV ++ASG+ 
Sbjct: 216 EDNYPYTAEATECSSTKINSQTTRVVIDGFEDVPANNEQALKEAVAHQPVSVAIEASGQD 275

Query: 272 FHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD- 330
           F FY +GV    CG   DHGV  VG+GT+ E  G  YW+++NSWG  WGE GYIR+ +  
Sbjct: 276 FQFYSTGVFTGKCGTALDHGVVAVGYGTSPE--GINYWIVRNSWGPKWGEEGYIRMQQGI 333

Query: 331 ---AGLCGIATAASYPV 344
               G CGIA  ASYP 
Sbjct: 334 EAAEGKCGIAMQASYPT 350


>gi|118124|sp|P25250.1|CYSP2_HORVU RecName: Full=Cysteine proteinase EP-B 2; Flags: Precursor
 gi|1146118|gb|AAA85036.1| cysteine proteinase EPB2 precursor [Hordeum vulgare subsp. vulgare]
          Length = 373

 Score =  318 bits (814), Expect = 3e-84,   Method: Compositional matrix adjust.
 Identities = 164/321 (51%), Positives = 205/321 (63%), Gaps = 17/321 (5%)

Query: 35  EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
           E ++ + +E+W + H R  +   EK  R   FK N  +I   NK G+  Y+L  N F D+
Sbjct: 39  EEALWDLYERWQSAH-RVRRHHAEKHRRFGTFKSNAHFIHSHNKRGDHPYRLHLNRFGDM 97

Query: 95  TNEEFRALYTG-YNRPVPSVSRQSSRPSTFKYQ--NVTDVPTSIDWREKGAVTHIKDQGQ 151
              EFRA + G   R  PS  +  S P  F Y   NV+D+P S+DWR+KGAVT +KDQG+
Sbjct: 98  DQAEFRATFVGDLRRDTPS--KPPSVPG-FMYAALNVSDLPPSVDWRQKGAVTGVKDQGK 154

Query: 152 CGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST-DNHGCSGGLMDKAFEYIIENKG 210
           CGSCWAFS V +VEGI  I  G L+ LSEQ+L+DC T DN GC GGLMD AFEYI  N G
Sbjct: 155 CGSCWAFSTVVSVEGINAIRTGSLVSLSEQELIDCDTADNDGCQGGLMDNAFEYIKNNGG 214

Query: 211 LATEADYPYRHEEGTCD---NQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDA 267
           L TEA YPYR   GTC+     +   V   I  ++D+P   E+ L +AV+NQPVSV V+A
Sbjct: 215 LITEAAYPYRAARGTCNVARAAQNSPVVVHIDGHQDVPANSEEDLARAVANQPVSVAVEA 274

Query: 268 SGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRI 327
           SG+AF FY  GV   +CG   DHGVAVVG+G AE+  G  YW +KNSWG +WGE GYIR+
Sbjct: 275 SGKAFMFYSEGVFTGECGTELDHGVAVVGYGVAED--GKAYWTVKNSWGPSWGEQGYIRV 332

Query: 328 LRDA----GLCGIATAASYPV 344
            +D+    GLCGIA  ASYPV
Sbjct: 333 EKDSGASGGLCGIAMEASYPV 353


>gi|225456820|ref|XP_002278323.1| PREDICTED: vignain [Vitis vinifera]
          Length = 360

 Score =  318 bits (814), Expect = 3e-84,   Method: Compositional matrix adjust.
 Identities = 161/317 (50%), Positives = 208/317 (65%), Gaps = 12/317 (3%)

Query: 35  EPSIVEKHEQWMAQHGRTYKDEL-EKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSD 93
           E S+   +E+W + H  T    L EK  R N+FK+N+ ++ + NK+ +  YKL  N+F+D
Sbjct: 31  EESLWNLYERWRSHH--TVSRSLDEKHKRFNVFKENVNFVHEFNKK-DEPYKLKLNKFAD 87

Query: 94  LTNEEFRALYTGYNRPVPSVSRQSSRPS-TFKYQNVTDVPTSIDWREKGAVTHIKDQGQC 152
           +TN EFR+ Y G       + R S   + +F Y+ V  VP S+DWR+KGAVT IKDQGQC
Sbjct: 88  MTNHEFRSTYAGSKVNHHRMFRGSQHAAGSFMYEKVKSVPPSVDWRKKGAVTPIKDQGQC 147

Query: 153 GSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST-DNHGCSGGLMDKAFEYIIENKGL 211
           GSCWAFS V AVEGI  I   KL+ LSEQ+LVDC T +N GC+GGLM  AFE+I E  G+
Sbjct: 148 GSCWAFSTVVAVEGINHIKTNKLVSLSEQELVDCDTSENQGCNGGLMGYAFEFIKEKGGI 207

Query: 212 ATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRA 271
            TE  YPY  E+GTCD  K  +   +I  +E +P  +E ALL+A +NQP+SV +DA G A
Sbjct: 208 TTEQSYPYTAEDGTCDVSKVNSPVVSIDGHETVPPNNEDALLKAAANQPISVAIDAGGSA 267

Query: 272 FHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILR-- 329
           F FY  GV    CG + DHGVA+VG+GT  +  G KYW++KNSWG  WGE+GYIR+ R  
Sbjct: 268 FQFYSEGVFAGRCGTDLDHGVAIVGYGTTLD--GTKYWIVKNSWGTDWGENGYIRMKRGI 325

Query: 330 --DAGLCGIATAASYPV 344
               GLCGIA  ASYP+
Sbjct: 326 SAKEGLCGIAVEASYPI 342


>gi|302759380|ref|XP_002963113.1| hypothetical protein SELMODRAFT_270344 [Selaginella moellendorffii]
 gi|300169974|gb|EFJ36576.1| hypothetical protein SELMODRAFT_270344 [Selaginella moellendorffii]
          Length = 479

 Score =  318 bits (814), Expect = 3e-84,   Method: Compositional matrix adjust.
 Identities = 165/319 (51%), Positives = 210/319 (65%), Gaps = 26/319 (8%)

Query: 43  EQWMAQHGRTYKDEL--------EKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
           + WM QHG++Y +          EKA R  IFK NL +I   N E N+ Y LG N F+DL
Sbjct: 58  DSWMLQHGKSYAENALSGDSQAGEKATRYGIFKDNLRFIHGEN-EKNQGYFLGLNAFADL 116

Query: 95  TNEEFRALYTG--YNRPVPSVSRQSSRPSTFKYQNV--TDVPTSIDWREKGAVTHIKDQG 150
           TNEEFRA   G  ++R     SR+ +    F+Y +V   D+P SIDWREKGAV  +KDQG
Sbjct: 117 TNEEFRAQRHGGRFDR-----SRERTSYEEFRYGSVQLKDLPDSIDWREKGAVVGVKDQG 171

Query: 151 QCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST-DNHGCSGGLMDKAFEYIIENK 209
            CGSCWAFSAVAA+EG+ ++  G+L+ LSEQ+LVDC   ++ GC+GGLMD AF ++I+N 
Sbjct: 172 SCGSCWAFSAVAAIEGVNKLATGELVSLSEQELVDCDKGEDEGCNGGLMDYAFGFVIKNG 231

Query: 210 GLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASG 269
           GL TEADYPY+     CD  K  A   TI  YED+P  DE ALL+AV++QPVSV +DA G
Sbjct: 232 GLDTEADYPYKGYGTRCDRSKMNAKVVTIDGYEDVPVNDETALLKAVAHQPVSVAIDAGG 291

Query: 270 RAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILR 329
            +  FY+SG+    CG + DHGV  VG+G   +E+G  YW+IKNSWG  WGE GYI++ R
Sbjct: 292 SSMQFYRSGIFTGRCGTDLDHGVTNVGYG---KEDGKAYWIIKNSWGSNWGEKGYIKMAR 348

Query: 330 D----AGLCGIATAASYPV 344
           +    AGLCGI   ASYP 
Sbjct: 349 NTGLAAGLCGINMEASYPT 367


>gi|242077600|ref|XP_002448736.1| hypothetical protein SORBIDRAFT_06g032320 [Sorghum bicolor]
 gi|241939919|gb|EES13064.1| hypothetical protein SORBIDRAFT_06g032320 [Sorghum bicolor]
          Length = 467

 Score =  317 bits (813), Expect = 4e-84,   Method: Compositional matrix adjust.
 Identities = 152/318 (47%), Positives = 212/318 (66%), Gaps = 13/318 (4%)

Query: 35  EPSIVEKHEQWMAQHGRTYKDEL-EKAMRLNIFKQNLEYIEKAN-KEGNRTYKLGTNEFS 92
           E  +   +E W+ +HGR   + L E   R  +F  NL +++  N + G   ++LG N+F+
Sbjct: 49  EAEVRAMYELWLVEHGRRVSNVLGEHDSRFRVFWDNLRFVDAHNERAGEHGFRLGMNQFA 108

Query: 93  DLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQC 152
           DLTN+EFRA Y G    +P+    ++    +++    ++P S+DWREKGAV  +K+QGQC
Sbjct: 109 DLTNDEFRAAYLGAR--IPAARSGNAVGEMYRHDGAEELPESVDWREKGAVAPVKNQGQC 166

Query: 153 GSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKG 210
           GSCWAFSAV++VE I QI  G+++ LSEQ+LV+CSTD  N GC+GGLMD AF +II+N G
Sbjct: 167 GSCWAFSAVSSVESINQIVTGEMVTLSEQELVECSTDGGNSGCNGGLMDAAFNFIIKNGG 226

Query: 211 LATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGR 270
           + TE DYPY+  +G CD  +  A   +I  +ED+P+ DE++L +AV++QPVSV ++A GR
Sbjct: 227 IDTEDDYPYKAVDGKCDINRRNAKVVSIDAFEDVPENDEKSLQKAVAHQPVSVAIEAGGR 286

Query: 271 AFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD 330
            F  YKSGV +  C  N DHGV  VG+GT   ENG  YW+++NSWG  WGE+GYIR+ R+
Sbjct: 287 QFQLYKSGVFSGSCTTNLDHGVVAVGYGT---ENGKDYWIVRNSWGPKWGEAGYIRMERN 343

Query: 331 ----AGLCGIATAASYPV 344
                G CGIA  ASYP 
Sbjct: 344 INATTGKCGIAMMASYPT 361


>gi|22093636|dbj|BAC06931.1| putative cysteine proteinase [Oryza sativa Japonica Group]
 gi|50510021|dbj|BAD30633.1| putative cysteine proteinase [Oryza sativa Japonica Group]
          Length = 352

 Score =  317 bits (813), Expect = 4e-84,   Method: Compositional matrix adjust.
 Identities = 161/343 (46%), Positives = 225/343 (65%), Gaps = 15/343 (4%)

Query: 13  MFVIIILVITCASQVVSGRSM--HEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNL 70
           M   ++LV+      ++  +M     ++  +H++WMA+HGRTYKD  EKA R  +FK N+
Sbjct: 11  MAASLLLVVAGGLSTMAKVTMASRAGTMEARHDKWMAEHGRTYKDAAEKARRFRVFKANV 70

Query: 71  EYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
           + I+++N  GN+ Y+L TN F+DLT+ EF A+YTGYN   P+ +  ++  +T +  +  D
Sbjct: 71  DLIDRSNAAGNKRYRLATNRFTDLTDAEFAAMYTGYN---PANTMYAAANATTRLSSEDD 127

Query: 131 -VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD 189
             P  +DWR++GAVT +K+Q  CG CWAFS VAAVEGI QIT G+L+ LSEQQL+DC+ D
Sbjct: 128 QQPAEVDWRQQGAVTGVKNQRSCGCCWAFSTVAAVEGIHQITTGELVSLSEQQLLDCA-D 186

Query: 190 NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCD---NQKEKAVAATISKYEDLPK 246
           N GC+GG +D AF+Y+  + G+ TEA Y Y+  +G C    +     VAATIS Y+ +  
Sbjct: 187 NGGCTGGSLDNAFQYMANSGGVTTEAAYAYQGAQGACQFDASSSASGVAATISGYQRVNP 246

Query: 247 GDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNAD-CGNNCDHGVAVVGFGT-AEEEN 304
            DE +L  AV++QPVSV ++ SG  F  Y SGV  AD CG   DH VAVVG+G  A+   
Sbjct: 247 NDEGSLAAAVASQPVSVAIEGSGAMFRHYGSGVFTADSCGTKLDHAVAVVGYGAEADGSG 306

Query: 305 GAKYWLIKNSWGETWGESGYIRILRDA---GLCGIATAASYPV 344
           G  YW+IKNSWG TWG+ GY+++ +D    G CG+A A SYPV
Sbjct: 307 GGGYWIIKNSWGTTWGDGGYMKLEKDVGSQGACGVAMAPSYPV 349


>gi|2224810|emb|CAB09698.1| cysteine proteinase [Hordeum vulgare subsp. vulgare]
          Length = 349

 Score =  317 bits (813), Expect = 4e-84,   Method: Compositional matrix adjust.
 Identities = 163/318 (51%), Positives = 216/318 (67%), Gaps = 13/318 (4%)

Query: 35  EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
           + ++V +HE+WMA+HGRTY +E EKA RL +F+ N + I+  N   + T++L TN F+DL
Sbjct: 37  DAAMVSRHEKWMAEHGRTYANEEEKARRLEVFRANAKLIDSFNSAEDSTHRLATNRFADL 96

Query: 95  TNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQN--VTDVPTSIDWREKGAVTHIKDQGQC 152
           T+EEFRA  TG  RP  + +   S    F+Y+N  + D   S+DWR  GAVT +KDQG C
Sbjct: 97  TDEEFRAARTGLRRPPAAAAGAGSGAGGFRYENFSLADAAGSMDWRAMGAVTGVKDQGSC 156

Query: 153 GSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST--DNHGCSGGLMDKAFEYIIENKG 210
           G CWAFSAVAAVEG+T+I  G+L+ LSEQQLVDC    D+ GC+GGLMD AFEY+I   G
Sbjct: 157 GCCWAFSAVAAVEGLTKIRTGRLVSLSEQQLVDCDVYGDDEGCAGGLMDNAFEYMINRGG 216

Query: 211 LATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGR 270
           L TE+ YPYR  +G+C   +  A AA+I  YED+P  +E AL+ AV++QPVSV ++    
Sbjct: 217 LTTESSYPYRGTDGSC---RRSASAASIRGYEDVPANNEAALMAAVAHQPVSVAINGGDS 273

Query: 271 AFHFYKSGVLNAD-CGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRI-- 327
            F FY SGVL    CG   +H +   G+GTA +  G KYW++KNSWG +WGE GY+RI  
Sbjct: 274 VFRFYDSGVLGGSGCGTELNHAITAAGYGTASD--GTKYWIMKNSWGGSWGEGGYVRIRR 331

Query: 328 -LRDAGLCGIATAASYPV 344
            +R  G+CG+A  ASYPV
Sbjct: 332 GVRGEGVCGLAQLASYPV 349


>gi|50355615|dbj|BAD29956.1| cysteine protease [Daucus carota]
          Length = 423

 Score =  317 bits (813), Expect = 4e-84,   Method: Compositional matrix adjust.
 Identities = 154/305 (50%), Positives = 208/305 (68%), Gaps = 12/305 (3%)

Query: 46  MAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTG 105
           + +H + Y     K  R  IFK NL +I++ NK  N+++KLG N+F+DL+NEE+++++ G
Sbjct: 11  LVKHHKNYNALGAKEKRFEIFKDNLRFIDEHNKGVNQSFKLGLNKFADLSNEEYKSMFLG 70

Query: 106 YNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVE 165
             R V    R+      FKY    ++P S+DWREKGAV  +KDQGQCGSCWAFS VAAVE
Sbjct: 71  -GRMV--RDRKGFESDRFKYGVGDELPQSVDWREKGAVAPVKDQGQCGSCWAFSTVAAVE 127

Query: 166 GITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEG 224
           GI QI  G LI LSEQ+LVDC    N GC+GG MD AFE+I++N G+ TE DYPY+  +G
Sbjct: 128 GINQIATGDLISLSEQELVDCDKGFNQGCNGGFMDYAFEFIVKNGGIDTEDDYPYKGVDG 187

Query: 225 TCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADC 284
            CD  ++ A   TI+ +ED+P+ DE++L +AV++QPVSV ++A GRAF  Y+SG+ N  C
Sbjct: 188 QCDQNRKNAKVVTINGFEDVPQNDEKSLKKAVAHQPVSVAIEAGGRAFQLYESGIFNGLC 247

Query: 285 GNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILR-----DAGLCGIATA 339
           G + DHGV  VG+GT   E+G  YW+++NSWG  WGE+GYIR+ R     + G CGIA  
Sbjct: 248 GTDLDHGVVAVGYGT---EDGKDYWIVRNSWGPNWGENGYIRLERNVASTNTGKCGIAMQ 304

Query: 340 ASYPV 344
            SYP 
Sbjct: 305 PSYPT 309


>gi|351721126|ref|NP_001237199.1| cysteine proteinase precursor [Glycine max]
 gi|31559530|dbj|BAC77523.1| cysteine proteinase [Glycine max]
 gi|31559532|dbj|BAC77524.1| cysteine proteinase [Glycine max]
          Length = 362

 Score =  317 bits (813), Expect = 4e-84,   Method: Compositional matrix adjust.
 Identities = 160/345 (46%), Positives = 218/345 (63%), Gaps = 20/345 (5%)

Query: 14  FVIIILVITCASQVVSGRSMH------EPSIVEKHEQWMAQH--GRTYKDELEKAMRLNI 65
           F+ ++L ++    V +    H      E S+ + +E+W + H   R+  D   K  R N+
Sbjct: 6   FLWVVLSLSLVLGVANSFDFHDKDLESEESLWDLYERWRSHHTVSRSLGD---KHKRFNV 62

Query: 66  FKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPS-TFK 124
           FK N+ ++   NK  ++ YKL  N+F+D+TN EFR+ Y G       + R   R + TF 
Sbjct: 63  FKANMMHVHNTNKM-DKPYKLKLNKFADMTNHEFRSTYAGSKVNHHRMFRDMPRGNGTFM 121

Query: 125 YQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLV 184
           Y+ V  VP S+DWR+KGAVT +KDQG CGSCWAFS V AVEGI QI   KL+ LSEQ+LV
Sbjct: 122 YEKVGSVPASVDWRKKGAVTDVKDQGHCGSCWAFSTVVAVEGINQIKTNKLVSLSEQELV 181

Query: 185 DCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYED 243
           DC T+ N GC+GGLM+ AF++I +  G+ TE+ YPY  ++GTCD  K   +A +I  +E+
Sbjct: 182 DCDTEENAGCNGGLMESAFQFIKQKGGITTESYYPYTAQDGTCDASKANDLAVSIDGHEN 241

Query: 244 LPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEE 303
           +P  DE ALL+AV+NQPVSV +DA G  F FY  GV   DC    +HGVA+VG+G   + 
Sbjct: 242 VPGNDENALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDCSTELNHGVAIVGYGATVD- 300

Query: 304 NGAKYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPV 344
            G  YW+++NSWG  WGE GYIR+ R+     GLCGIA  ASYP+
Sbjct: 301 -GTSYWIVRNSWGPEWGELGYIRMQRNISKKEGLCGIAMLASYPI 344


>gi|218198967|gb|EEC81394.1| hypothetical protein OsI_24614 [Oryza sativa Indica Group]
          Length = 342

 Score =  317 bits (813), Expect = 5e-84,   Method: Compositional matrix adjust.
 Identities = 161/343 (46%), Positives = 225/343 (65%), Gaps = 15/343 (4%)

Query: 13  MFVIIILVITCASQVVSGRSM--HEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNL 70
           M   ++LV+      ++  +M     ++  +H++WMA+HGRTYKD  EKA R  +FK N+
Sbjct: 1   MAASLLLVVAGGLSTMAKVTMASRAGTMEARHDKWMAEHGRTYKDAAEKARRFRVFKANV 60

Query: 71  EYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
           + I+++N  GN+ Y+L TN F+DLT+ EF A+YTGYN   P+ +  ++  +T +  +  D
Sbjct: 61  DLIDRSNAAGNKRYRLATNRFTDLTDAEFAAMYTGYN---PANTMYAAANATTRLSSEDD 117

Query: 131 -VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD 189
             P  +DWR++GAVT +K+Q  CG CWAFS VAAVEGI QIT G+L+ LSEQQL+DC+ D
Sbjct: 118 QQPAEVDWRQQGAVTGVKNQRSCGCCWAFSTVAAVEGIHQITTGELVSLSEQQLLDCA-D 176

Query: 190 NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCD---NQKEKAVAATISKYEDLPK 246
           N GC+GG +D AF+Y+  + G+ TEA Y Y+  +G C    +     VAATIS Y+ +  
Sbjct: 177 NGGCTGGSLDNAFQYMANSGGVTTEAAYAYQGAQGACQFDASSSASGVAATISGYQRVNP 236

Query: 247 GDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNAD-CGNNCDHGVAVVGFGT-AEEEN 304
            DE +L  AV++QPVSV ++ SG  F  Y SGV  AD CG   DH VAVVG+G  A+   
Sbjct: 237 NDEGSLAAAVASQPVSVAIEGSGAMFRHYGSGVFTADSCGTKLDHAVAVVGYGAEADGSG 296

Query: 305 GAKYWLIKNSWGETWGESGYIRILRDA---GLCGIATAASYPV 344
           G  YW+IKNSWG TWG+ GY+++ +D    G CG+A A SYPV
Sbjct: 297 GGGYWIIKNSWGTTWGDGGYMKLEKDVGSQGACGVAMAPSYPV 339


>gi|374530932|gb|AEP83812.2| cysteine endopeptidase EP8 [Secale cereale x Triticum durum]
          Length = 364

 Score =  317 bits (812), Expect = 5e-84,   Method: Compositional matrix adjust.
 Identities = 154/289 (53%), Positives = 201/289 (69%), Gaps = 10/289 (3%)

Query: 62  RLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYN-RPVPSVSRQSSRP 120
           R N+FK+N  YI + NK+ +R ++L  N+F+D+T +EFR  Y G   R   S+S      
Sbjct: 62  RFNVFKENARYIHEGNKK-DRPFRLALNKFADMTTDEFRRTYAGSRVRHHLSLSGGRRGD 120

Query: 121 STFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSE 180
            +F+Y +  ++P ++DWR+KGAVT IKDQGQCGSCWAFS + AVEGI +I  GKL+ LSE
Sbjct: 121 GSFRYGDADNLPPAVDWRQKGAVTAIKDQGQCGSCWAFSTIVAVEGINKIRTGKLVSLSE 180

Query: 181 QQLVDC-STDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATIS 239
           Q+L+DC + +N GC GGLMD AF++I +N G+ TE++YPY+ E+G+CD  KEKA A TI 
Sbjct: 181 QELMDCDNVNNQGCDGGLMDYAFQFIHKN-GITTESNYPYQGEQGSCDLAKEKAHAVTID 239

Query: 240 KYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGT 299
            YED+P  DE AL +AV+ QPVSV +DASG  F FY  GV   +C  + DHGVA VG+GT
Sbjct: 240 GYEDVPANDESALQKAVAGQPVSVAIDASGNDFQFYSEGVFTGECSTDLDHGVAAVGYGT 299

Query: 300 AEEENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
             +  G KYW++KNSWGE WGE GYIR+ R      G CGIA  ASYP 
Sbjct: 300 TRD--GTKYWIVKNSWGEDWGEKGYIRMQRGVSQAEGQCGIAMQASYPT 346


>gi|182375363|gb|ACB87490.1| mucunain [Mucuna pruriens]
          Length = 422

 Score =  317 bits (812), Expect = 5e-84,   Method: Compositional matrix adjust.
 Identities = 158/313 (50%), Positives = 204/313 (65%), Gaps = 17/313 (5%)

Query: 42  HEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRA 101
           +EQW+ +HG+ Y    EK  R +IFK NL +I+  N + NRTYKLG N F+DLTNEE+RA
Sbjct: 4   YEQWLVKHGKAYNALGEKDKRFDIFKDNLRFIDDHNAD-NRTYKLGLNRFADLTNEEYRA 62

Query: 102 LYTGY----NRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWA 157
            Y G     NR       QS+R   +  +   ++P S+DWR + AV  +KDQG CGSCWA
Sbjct: 63  RYLGTRIDPNRRFVKTKTQSNR---YAPRVGDNLPESVDWRNESAVLPVKDQGNCGSCWA 119

Query: 158 FSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEAD 216
           FS + AVEGI +I  G LI LSEQ+LVDC T  N GC+GGLMD A+E+II N G+ +E D
Sbjct: 120 FSTIGAVEGINKIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAYEFIINNGGIDSEED 179

Query: 217 YPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYK 276
           YPYR  +GTCD  ++ A   TI  YED+P  DE AL +AV+NQPVSV ++  GR F  Y 
Sbjct: 180 YPYRAVDGTCDQYRKNAKVVTIDSYEDVPANDELALKKAVANQPVSVAIEGGGREFQLYV 239

Query: 277 SGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD-----A 331
           SGV    CG   DHGV  VG+G+ +   G  YW+++NSWG +WGE GY+R+ R+     +
Sbjct: 240 SGVFTGRCGTALDHGVVAVGYGSVK---GHDYWIVRNSWGASWGEEGYVRLERNLAKSRS 296

Query: 332 GLCGIATAASYPV 344
           G CGIA   SYP+
Sbjct: 297 GKCGIAIEPSYPI 309


>gi|224102377|ref|XP_002312656.1| predicted protein [Populus trichocarpa]
 gi|222852476|gb|EEE90023.1| predicted protein [Populus trichocarpa]
          Length = 358

 Score =  317 bits (812), Expect = 5e-84,   Method: Compositional matrix adjust.
 Identities = 154/345 (44%), Positives = 219/345 (63%), Gaps = 9/345 (2%)

Query: 7   KSFIIPMF-VIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNI 65
           +  I+ +F V+++  +  +          E  + + +E+W + H    +   EK  R N+
Sbjct: 4   RKVILAVFSVVLVFRLADSFDYTEEDLASEERLRDLYERWRSHH-TVSRSLAEKQERFNV 62

Query: 66  FKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKY 125
           FK+NL++I K N + +R YKL  N F+D+TN EF   Y G       V R   + +   +
Sbjct: 63  FKENLKHIHKVNHK-DRPYKLKLNSFADMTNHEFLQHYGGSKVSHYRVLRGQRQGTGSMH 121

Query: 126 QNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVD 185
           ++ + +P+S+DWR+ GAVT IKDQG+CGSCWAFS VAAVEGI +I  G+LI LSEQ+LVD
Sbjct: 122 EDTSKLPSSVDWRKNGAVTGIKDQGKCGSCWAFSTVAAVEGINKIKTGELISLSEQELVD 181

Query: 186 CSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLP 245
           C +DNHGC+GGLM+ AF +I +  GL +E  YPYR +E  CD+ K  +    I  YE +P
Sbjct: 182 CDSDNHGCNGGLMEDAFNFIKQIGGLTSENTYPYRAKEEPCDSNKMNSPVVNIDGYEMVP 241

Query: 246 KGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENG 305
           + DE AL++AV+NQPV++ +DA G+   FY   +   DCG   +HGVA+VG+GT ++  G
Sbjct: 242 ENDENALMKAVANQPVAIAMDAGGKDLQFYSEAIFTGDCGTELNHGVALVGYGTTQD--G 299

Query: 306 AKYWLIKNSWGETWGESGYIRILR----DAGLCGIATAASYPVAI 346
            KYW++KNSWG  WGE GYIR+ R    + GLCGI   ASYPV +
Sbjct: 300 TKYWIVKNSWGTDWGEKGYIRMQRGIDAEEGLCGITMEASYPVKL 344


>gi|1169186|sp|P43156.1|CYSP_HEMSP RecName: Full=Thiol protease SEN102; Flags: Precursor
 gi|396568|emb|CAA52425.1| thiol-protease [Hemerocallis hybrid cultivar]
          Length = 360

 Score =  317 bits (812), Expect = 6e-84,   Method: Compositional matrix adjust.
 Identities = 160/349 (45%), Positives = 220/349 (63%), Gaps = 23/349 (6%)

Query: 12  PMFVIIILVITCASQVVSGRSMHEPSIVEK------HEQWMAQHGRTYKDELEKAMRLNI 65
           P F+ + LV      +       E  +  +      +E+W   H    +D  EK  R N+
Sbjct: 4   PKFIALALVALSFLSIAQSIPFTEKDLASEDSLWNLYEKWRTHH-TVARDLDEKNRRFNV 62

Query: 66  FKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTG----YNRPVPSVSRQSSRPS 121
           FK+N+++I + N++ +  YKL  N+F D+TN+EFR+ Y G    ++R    + + +    
Sbjct: 63  FKENVKFIHEFNQKKDAPYKLALNKFGDMTNQEFRSKYAGSKIQHHRSQRGIQKNTG--- 119

Query: 122 TFKYQNVTDVPT-SIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSE 180
           +F Y+NV  +P  SIDWR KGAVT +KDQGQCGSCWAFS +A+VEGI QI  G+L+ LSE
Sbjct: 120 SFMYENVGSLPAASIDWRAKGAVTGVKDQGQCGSCWAFSTIASVEGINQIKTGELVSLSE 179

Query: 181 QQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATIS 239
           Q+LVDC T  N GC+GGLMD AFE+I +N G+ TE  YPY  ++GTC +    +   +I 
Sbjct: 180 QELVDCDTSYNEGCNGGLMDYAFEFIQKN-GITTEDSYPYAEQDGTCASNLLNSPVVSID 238

Query: 240 KYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGT 299
            ++D+P  +E AL+QAV+NQP+SV ++ASG  F FY  GV    CG   DHGVA+VG+G 
Sbjct: 239 GHQDVPANNENALMQAVANQPISVSIEASGYGFQFYSEGVFTGRCGTELDHGVAIVGYGA 298

Query: 300 AEEENGAKYWLIKNSWGETWGESGYIRILR----DAGLCGIATAASYPV 344
             +  G KYW++KNSWGE WGESGYIR+ R      G CGIA  ASYP+
Sbjct: 299 TRD--GTKYWIVKNSWGEEWGESGYIRMQRGISDKRGKCGIAMEASYPI 345


>gi|89274062|dbj|BAE80740.1| cysteine proteinase [Platycodon grandiflorus]
          Length = 462

 Score =  317 bits (812), Expect = 6e-84,   Method: Compositional matrix adjust.
 Identities = 156/313 (49%), Positives = 204/313 (65%), Gaps = 12/313 (3%)

Query: 38  IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
           ++  +E W+ +HG++Y    EK  R  IFK NL +I++ N E N +YK+G N F+DLTNE
Sbjct: 46  VMAMYESWLVKHGKSYNALGEKEKRFQIFKDNLRFIDEHNAEENLSYKVGLNRFADLTNE 105

Query: 98  EFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWA 157
           E+R+ Y G  +  P +S+  S    +  +    +P S+DWR KGAV  IKDQG CGSCWA
Sbjct: 106 EYRSTYLG-AKSKPKLSKVKS--DRYAPRVGDSLPESVDWRAKGAVAPIKDQGSCGSCWA 162

Query: 158 FSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEAD 216
           FS V AVEGI QI  G+LI LSEQ+LVDC    N GC GGLMD  FE+II N G+ T+ D
Sbjct: 163 FSTVNAVEGINQIVTGELITLSEQELVDCDKSYNEGCDGGLMDYGFEFIINNGGIDTDKD 222

Query: 217 YPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYK 276
           YPY   +  CD  ++ A   TI  YED+P  +E+AL +AV++QPVSV ++  GRAF FY 
Sbjct: 223 YPYLGRDARCDQYRKNAKVVTIDSYEDVPVNNEEALKKAVASQPVSVGIEGGGRAFQFYD 282

Query: 277 SGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD-----A 331
           SG+    CG   DHGV VVG+GT   E G  YW+++NSWG +WGE+GYIR+ R+      
Sbjct: 283 SGIFTGKCGTALDHGVNVVGYGT---EKGKDYWIVRNSWGSSWGEAGYIRMERNLAGTSV 339

Query: 332 GLCGIATAASYPV 344
           G CGIA   SYP+
Sbjct: 340 GKCGIAMEPSYPL 352


>gi|18402225|ref|NP_566633.1| Granulin repeat cysteine protease family protein [Arabidopsis
           thaliana]
 gi|11994461|dbj|BAB02463.1| cysteine proteinase [Arabidopsis thaliana]
 gi|17065298|gb|AAL32803.1| cysteine proteinase [Arabidopsis thaliana]
 gi|20260004|gb|AAM13349.1| cysteine proteinase [Arabidopsis thaliana]
 gi|332642713|gb|AEE76234.1| Granulin repeat cysteine protease family protein [Arabidopsis
           thaliana]
          Length = 452

 Score =  317 bits (811), Expect = 7e-84,   Method: Compositional matrix adjust.
 Identities = 153/343 (44%), Positives = 219/343 (63%), Gaps = 12/343 (3%)

Query: 8   SFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFK 67
           +  + +F ++++ ++  S   +  + +E      +E+W+ ++ + Y    EK  R  IFK
Sbjct: 9   TLALLIFSVLLISLSLGSVTATETTRNEAEARRMYERWLVENRKNYNGLGEKERRFEIFK 68

Query: 68  QNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQN 127
            NL+++E+ +   NRTY++G   F+DLTN+EFRA+Y    R     +R   +   + Y+ 
Sbjct: 69  DNLKFVEEHSSIPNRTYEVGLTRFADLTNDEFRAIYL---RSKMERTRVPVKGEKYLYKV 125

Query: 128 VTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
              +P +IDWR KGAV  +KDQG CGSCWAFSA+ AVEGI QI  G+LI LSEQ+LVDC 
Sbjct: 126 GDSLPDAIDWRAKGAVNPVKDQGSCGSCWAFSAIGAVEGINQIKTGELISLSEQELVDCD 185

Query: 188 TD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEE-GTCDNQKEKAVAATISKYEDLP 245
           T  N GC GGLMD AF++IIEN G+ TE DYPY   +   C++ K+     TI  YED+P
Sbjct: 186 TSYNDGCGGGLMDYAFKFIIENGGIDTEEDYPYIATDVNVCNSDKKNTRVVTIDGYEDVP 245

Query: 246 KGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENG 305
           + DE++L +A++NQP+SV ++A GRAF  Y SGV    CG + DHGV  VG+G+   E G
Sbjct: 246 QNDEKSLKKALANQPISVAIEAGGRAFQLYTSGVFTGTCGTSLDHGVVAVGYGS---EGG 302

Query: 306 AKYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPV 344
             YW+++NSWG  WGESGY ++ R+    +G CG+A  ASYP 
Sbjct: 303 QDYWIVRNSWGSNWGESGYFKLERNIKESSGKCGVAMMASYPT 345


>gi|194352752|emb|CAQ00104.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
          Length = 351

 Score =  317 bits (811), Expect = 8e-84,   Method: Compositional matrix adjust.
 Identities = 167/350 (47%), Positives = 228/350 (65%), Gaps = 22/350 (6%)

Query: 11  IPMFVIIILVITCASQ----VVSGRSMHEPS----IVEKHEQWMAQHGRTYKDELEKAMR 62
           + + V+++ V  C ++     + G S  + S    +VE  E+W+A+H + Y    EK  R
Sbjct: 5   LSVAVLLLCVGACVARNSDFSIVGYSEEDLSSHDRLVELFEKWLAKHQKAYASFEEKLHR 64

Query: 63  LNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPST 122
             +FK NL+ I++ N+E   +Y LG NEF+DLT++EF+  Y G    +     + S   +
Sbjct: 65  FEVFKDNLKLIDEINRE-VTSYWLGLNEFADLTHDEFKTTYLG----LSPPPARRSSSRS 119

Query: 123 FKYQNVT--DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSE 180
           F+Y+NV   D+P ++DWR+KGAVT +K+QGQCGSCWAFS VAAVEGI  I  G L  LSE
Sbjct: 120 FRYENVAAHDLPKAVDWRKKGAVTDVKNQGQCGSCWAFSTVAAVEGINAIVTGNLTALSE 179

Query: 181 QQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTC-DNQKEKAVAATI 238
           Q+L+DCS D N GC+GG+MD AF YI  + GL TE  YPY  EEG+C D +K ++ A +I
Sbjct: 180 QELIDCSVDGNSGCNGGMMDYAFSYIASSGGLHTEEAYPYLMEEGSCGDGKKSESEAVSI 239

Query: 239 SKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFG 298
           S YED+P  DEQAL++A+++QPVSV ++ASGR F FY  GV +  CG   DHGVA VG+G
Sbjct: 240 SGYEDVPTKDEQALIKALAHQPVSVAIEASGRHFQFYSGGVFDGPCGAQLDHGVAAVGYG 299

Query: 299 TAEEENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
           + ++  G  Y ++KNSWG  WGE GYIR+ R      GLCGI   ASYP 
Sbjct: 300 S-DKGKGHDYIIVKNSWGGKWGEKGYIRMKRGTGKSEGLCGINKMASYPT 348


>gi|30141021|dbj|BAC75924.1| cysteine protease-2 [Helianthus annuus]
          Length = 362

 Score =  316 bits (810), Expect = 8e-84,   Method: Compositional matrix adjust.
 Identities = 158/317 (49%), Positives = 209/317 (65%), Gaps = 13/317 (4%)

Query: 35  EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
           E ++ + +E+W  +H +   +  EK  R N+FK N+ ++ + NK  ++ YKL  N+F+D+
Sbjct: 33  EDNLWDMYERW--RH-KVATNHGEKLRRFNVFKSNVLHVHETNKM-DKPYKLKLNKFADM 88

Query: 95  TNEEFRALYTGYNRPVPSVSRQSSRP--STFKYQNVTDVPTSIDWREKGAVTHIKDQGQC 152
           TN EFR++Y G        S Q  R    TF Y NV  VPTS+DWR+KGAV  +KDQGQC
Sbjct: 89  TNHEFRSVYAGSKIHHHDRSLQGDRSGSKTFMYANVESVPTSVDWRKKGAVAPVKDQGQC 148

Query: 153 GSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST-DNHGCSGGLMDKAFEYIIENKGL 211
           GSCWAFS VAAVEGI +I   +L+ LSEQ+LVDC T +N GC+GGLMD AF++I +  GL
Sbjct: 149 GSCWAFSTVAAVEGINKIKTNELVSLSEQELVDCDTLENQGCNGGLMDLAFDFIKKTGGL 208

Query: 212 ATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRA 271
             E  YPY  E+G CD+ K  +   +I  +ED+PK DEQ+L++AV+NQPV+V +DA    
Sbjct: 209 TREDAYPYAAEDGKCDSNKMNSPVVSIDGHEDVPKNDEQSLMKAVANQPVAVAIDAGSSD 268

Query: 272 FHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILR-- 329
           F FY  GV    CG   DHGVA VG+GT  +  G KYW+++NSWG  WGE GYIR+ R  
Sbjct: 269 FQFYSEGVFTGKCGTQLDHGVAAVGYGTTLD--GTKYWIVRNSWGSEWGEKGYIRMERGI 326

Query: 330 --DAGLCGIATAASYPV 344
               GLCGIA  ASYP+
Sbjct: 327 SDKRGLCGIAMEASYPI 343


>gi|1173630|gb|AAB37233.1| cysteine proteinase [Phalaenopsis sp. SM9108]
          Length = 359

 Score =  316 bits (810), Expect = 9e-84,   Method: Compositional matrix adjust.
 Identities = 163/350 (46%), Positives = 227/350 (64%), Gaps = 21/350 (6%)

Query: 11  IPMFVIIILVITCASQVVSGRSMH------EPSIVEKHEQWMAQHGRTYKDELEKAMRLN 64
           + +F +I++    AS   +   +       E S+   +E+W + H  + +D  EK  R N
Sbjct: 1   MKLFSLILVASFLASVAATAIDIADKDLETEDSLWNLYERWRSHHTVS-RDLDEKQKRFN 59

Query: 65  IFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTG----YNRPVPSVSRQSSRP 120
           +FK+N  YI   NK  +  YKL  N+F+DLTN EFR+ Y G    ++R +   SR+    
Sbjct: 60  VFKENPRYIHDFNKRKDIPYKLRLNKFADLTNHEFRSTYAGSRINHHRSLRG-SRRGGAT 118

Query: 121 STFKYQNVTD--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIEL 178
           ++F YQ++    +P SIDWR+KGAVT +KDQGQCGSCWAFS VAAVEGI QI   KL+ L
Sbjct: 119 NSFMYQSLDSRSLPASIDWRQKGAVTAVKDQGQCGSCWAFSTVAAVEGINQIKTKKLLSL 178

Query: 179 SEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAAT 237
           SEQ+L+DC TD N+GC+GGLMD AF++I +N G+++EA+YPY  E+  C  +K+  V  +
Sbjct: 179 SEQELIDCDTDENNGCNGGLMDYAFDFIKKNGGISSEAEYPYAAEDSYCATEKKSHV-VS 237

Query: 238 ISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGF 297
           I  +ED+P  DE +LL+AV+NQPVS+ ++ASG  F FY  GV     G   DHGVA+VG+
Sbjct: 238 IDGHEDVPANDEDSLLKAVANQPVSIAIEASGYDFQFYSEGVFTGRSGTELDHGVAIVGY 297

Query: 298 GTAEEENGAKYWLIKNSWGETWGESGYIRILRDAG---LCGIATAASYPV 344
           G  ++  G KYW+++NSWG  WGE GYIRI   +    LCG+A  ASYP+
Sbjct: 298 GKTQQ--GTKYWIVRNSWGAEWGEKGYIRISAASDSKRLCGLAMEASYPI 345


>gi|296081395|emb|CBI16828.3| unnamed protein product [Vitis vinifera]
          Length = 359

 Score =  316 bits (809), Expect = 1e-83,   Method: Compositional matrix adjust.
 Identities = 157/348 (45%), Positives = 223/348 (64%), Gaps = 10/348 (2%)

Query: 3   LKFEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMR 62
           +K EK  ++ + ++++  +  +          E S+ + +E+W + H    +D  EK  R
Sbjct: 1   MKMEKVILVALSLVLVFGLAESFDFDEKDLASEESLWDLYERWRSYH-TVSRDLEEKNKR 59

Query: 63  LNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPST 122
            N+FK+N +++ K N+  ++ YKL  N+F+D+TN EFR+ Y G       + R   R + 
Sbjct: 60  FNVFKENTKHVHKVNQM-DKPYKLKLNKFADMTNHEFRSSYGGSKVKHYRMLRGDRRGTG 118

Query: 123 -FKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQ 181
            F ++  T +P S+DWR+KGAVT IKDQG+CGSCWAFS V  VEGI QI   +L+ LSEQ
Sbjct: 119 GFMHEKTTYLPPSVDWRKKGAVTGIKDQGKCGSCWAFSTVVGVEGINQIKTKELLSLSEQ 178

Query: 182 QLVDCS-TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISK 240
           QL+DC  +D+HGC+GGLM+ AFE+I +N G+ TE +YPY+ ++  CD  K  A   TI  
Sbjct: 179 QLIDCDRSDDHGCNGGLMESAFEFIKKNGGITTENNYPYKAKDERCDMLKMNAPVVTIDG 238

Query: 241 YEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTA 300
           +E +P  DE+AL++AV++QPVSV +DA G    FY  GV + +CG   DHGVA+VG+GT 
Sbjct: 239 HESVPVNDERALMKAVAHQPVSVAIDAGGSDLQFYSEGVFDGECGTELDHGVAIVGYGTT 298

Query: 301 EEENGAKYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPV 344
            +  G KYW++KNSWG  WGE GYIR+ R      G CGIA  ASYPV
Sbjct: 299 LD--GTKYWIVKNSWGAEWGEKGYIRMARGIQAAEGQCGIAMEASYPV 344


>gi|148907299|gb|ABR16787.1| unknown [Picea sitchensis]
          Length = 372

 Score =  316 bits (809), Expect = 1e-83,   Method: Compositional matrix adjust.
 Identities = 156/312 (50%), Positives = 207/312 (66%), Gaps = 12/312 (3%)

Query: 42  HEQWMAQHGRTYK-DELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFR 100
           +++W  QH  T   D  E A R  IFK+N+++I+  NK+ +  YKLG N+F+DL+NEEF+
Sbjct: 45  YDKWALQHRSTRSLDSDEHARRFEIFKENVKHIDSVNKK-DGPYKLGLNKFADLSNEEFK 103

Query: 101 ALY--TGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAF 158
           A++  T   +       +     +F YQN   +P SIDWR+KGAVT +K+QGQCGSCWAF
Sbjct: 104 AMHMTTKMEKHKSLRGDRGVESGSFMYQNSKRLPASIDWRKKGAVTPVKNQGQCGSCWAF 163

Query: 159 SAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYP 218
           S +A+VEGI  I  GKL+ LSEQQLVDCS +N GC+GGLMD AF+YII+N G+ TE +YP
Sbjct: 164 STIASVEGINYIKTGKLVSLSEQQLVDCSKENAGCNGGLMDNAFQYIIDNGGIVTEDEYP 223

Query: 219 YRHEEGTCDNQK--EKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYK 276
           Y  E G C   K   K++A  I  +ED+P  +E AL +AV++QPVS+ ++ASG  F FY 
Sbjct: 224 YTAEAGECSTTKIESKSIATIIDGFEDVPANNEGALKKAVAHQPVSIAIEASGHDFQFYS 283

Query: 277 SGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILR----DAG 332
           +GV    CG   DHGV VVG+G + E  G  YW+++NSWG  WGE GYIR+ R      G
Sbjct: 284 TGVFTGKCGTELDHGVVVVGYGKSPE--GINYWIVRNSWGPEWGEQGYIRMQRGIEATEG 341

Query: 333 LCGIATAASYPV 344
            CGI+  ASYP 
Sbjct: 342 KCGISMQASYPT 353


>gi|18141283|gb|AAL60579.1|AF454957_1 senescence-associated cysteine protease [Brassica oleracea]
          Length = 460

 Score =  315 bits (808), Expect = 2e-83,   Method: Compositional matrix adjust.
 Identities = 157/321 (48%), Positives = 213/321 (66%), Gaps = 21/321 (6%)

Query: 35  EPSIVEKHEQWMAQHGRTYKDE----LEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNE 90
           +  +   +E WM +HG+  +       EK  R  IFK NL +I++ N + N +YKLG   
Sbjct: 42  DAEVARIYEAWMEKHGKKAQSNGLVGEEKDQRFEIFKDNLRFIDEHNNK-NLSYKLGLTR 100

Query: 91  FSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQ-NVTD-VPTSIDWREKGAVTHIKD 148
           F+DLTNEE+R++Y G      + S++    ++ +YQ  V D +P S+DWR++GAV  +KD
Sbjct: 101 FADLTNEEYRSIYLG------AKSKKRVLKTSDRYQPRVGDAIPDSVDWRKEGAVAAVKD 154

Query: 149 QGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIE 207
           QG CGSCWAFS + AVEGI +I  G LI LSEQ+LVDC T  N GC+GGLMD AFE+II+
Sbjct: 155 QGSCGSCWAFSTIGAVEGINKIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAFEFIIK 214

Query: 208 NKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDA 267
           N G+ TE DYPY+  +G CD  ++ A   TI  YED+P+ +E AL + ++NQP+SV ++A
Sbjct: 215 NGGIDTEEDYPYKAADGRCDQTRKNAKVVTIDAYEDVPENNEAALKKTLANQPISVAIEA 274

Query: 268 SGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRI 327
            GRAF  Y SGV +  CG   DHGV  VG+GT   ENG  YW+++NSWG +WGESGYI++
Sbjct: 275 GGRAFQLYSSGVFDGICGTELDHGVVAVGYGT---ENGKDYWIVRNSWGGSWGESGYIKM 331

Query: 328 LRD----AGLCGIATAASYPV 344
            R+     G CGIA  ASYP+
Sbjct: 332 ARNIAEPTGKCGIAMEASYPI 352


>gi|297602242|ref|NP_001052232.2| Os04g0203500 [Oryza sativa Japonica Group]
 gi|255675217|dbj|BAF14146.2| Os04g0203500 [Oryza sativa Japonica Group]
          Length = 336

 Score =  315 bits (808), Expect = 2e-83,   Method: Compositional matrix adjust.
 Identities = 158/342 (46%), Positives = 224/342 (65%), Gaps = 22/342 (6%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
           +F I+  +  C++ + +     + ++  +HE+WMAQ+GR YKD+ EKA R  +FK N+ +
Sbjct: 8   LFAILGCLCLCSAVLAARELSDDAAMAARHERWMAQYGRMYKDDAEKARRFEVFKANVAF 67

Query: 73  IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT--D 130
           IE  N  GN  + LG N+F+DLTN+EFR+  T     +PS +R    P+ F+ +NV    
Sbjct: 68  IESFN-AGNHKFWLGVNQFADLTNDEFRSTKTNKGF-IPSTTRV---PTGFRNENVNIDA 122

Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDN 190
           +P ++DWR KG VT IKDQGQCG CWAFSAVAA+EGI +++ GKLI  S  + +  +  +
Sbjct: 123 LPATMDWRTKGVVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISHSLNKSL-LTVMS 181

Query: 191 HGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVA---ATISKYEDLPKG 247
            GC GGLMD AF++II+N GL TE++YPY   +      K K+V+   A+I  YED+P  
Sbjct: 182 MGCEGGLMDDAFKFIIKNGGLTTESNYPYAAVD-----DKFKSVSNSVASIKGYEDVPAN 236

Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
           +E AL++AV+NQPVSV VD     F FYK GV+   CG + DHG+  +G+G A +  G K
Sbjct: 237 NEAALMKAVANQPVSVAVDGGDMTFQFYKGGVMTGSCGTDLDHGIVAIGYGKASD--GTK 294

Query: 308 YWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
           YWL+KNSWG TWGE+G++R+ +D     G+CG+A   SYP A
Sbjct: 295 YWLLKNSWGMTWGENGFLRMEKDISDKRGMCGLAMEPSYPTA 336


>gi|115461667|ref|NP_001054433.1| Os05g0108600 [Oryza sativa Japonica Group]
 gi|14719319|gb|AAK73137.1|AC079022_10 putative cysteine proteinase [Oryza sativa]
 gi|33151125|gb|AAP97431.1| cysteine protease CP1 [Oryza sativa]
 gi|52353572|gb|AAU44138.1| cysteine proteinase CP1 [Oryza sativa Japonica Group]
 gi|113577984|dbj|BAF16347.1| Os05g0108600 [Oryza sativa Japonica Group]
 gi|125550541|gb|EAY96250.1| hypothetical protein OsI_18148 [Oryza sativa Indica Group]
          Length = 358

 Score =  315 bits (808), Expect = 2e-83,   Method: Compositional matrix adjust.
 Identities = 161/314 (51%), Positives = 206/314 (65%), Gaps = 12/314 (3%)

Query: 38  IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
           ++E  E+W+A++ + Y    EK  R  +FK NL +I+  NK+   +Y LG NEF+DLT++
Sbjct: 47  LIELFEKWVAKYRKAYASFEEKVRRFEVFKDNLNHIDDINKK-VTSYWLGLNEFADLTHD 105

Query: 98  EFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD--VPTSIDWREKGAVTHIKDQGQCGSC 155
           EF+A Y G   P    + +      F+Y  +++  VP  +DWR+K AVT +K+QGQCGSC
Sbjct: 106 EFKATYLGLTPPPTRSNSKHYSSEEFRYGKMSNGEVPKEMDWRKKNAVTEVKNQGQCGSC 165

Query: 156 WAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATE 214
           WAFS VAAVEGI  I  G L  LSEQ+L+DCSTD N+GC+GGLMD AF YI    GL TE
Sbjct: 166 WAFSTVAAVEGINAIVTGNLTSLSEQELIDCSTDGNNGCNGGLMDYAFSYIASTGGLRTE 225

Query: 215 ADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHF 274
             YPY  EEG CD  K  AV  TIS YED+P  DEQAL++A+++QPVSV ++ASGR F F
Sbjct: 226 EAYPYAMEEGDCDEGKGAAV-VTISGYEDVPANDEQALVKALAHQPVSVAIEASGRHFQF 284

Query: 275 YKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA--- 331
           Y  GV +  CG   DHGV  VG+GT++   G  Y ++KNSWG  WGE GYIR+ R     
Sbjct: 285 YSGGVFDGPCGEQLDHGVTAVGYGTSK---GQDYIIVKNSWGPHWGEKGYIRMKRGTGKG 341

Query: 332 -GLCGIATAASYPV 344
            GLCGI   ASYP 
Sbjct: 342 EGLCGINKMASYPT 355


>gi|57282619|emb|CAE54307.1| cysteine proteinase [Gossypium hirsutum]
          Length = 372

 Score =  315 bits (807), Expect = 2e-83,   Method: Compositional matrix adjust.
 Identities = 153/309 (49%), Positives = 210/309 (67%), Gaps = 9/309 (2%)

Query: 42  HEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRA 101
           ++ W+ QHG+ Y    E+  R  IFK NL +I++ N   N TYKLG N+F+DLTN+E+RA
Sbjct: 46  YKSWVIQHGKAYNGIGEEEKRFEIFKDNLRFIDEHNSNNNTTYKLGLNKFADLTNQEYRA 105

Query: 102 LYTGYNRPVPSVSRQSSRPST-FKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSA 160
            + G          +S  PS+ + ++   ++P S++WR+ GAV+ +KDQG CGSCWAFSA
Sbjct: 106 KFLGTRTDPRRRLMKSKIPSSRYAHRAGDNLPDSVNWRDHGAVSRVKDQGSCGSCWAFSA 165

Query: 161 VAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPY 219
           +AAVEGI +I  G+LI LSEQ+LVDC    + GC+GGLMD AF++II+N G+ TE DYPY
Sbjct: 166 IAAVEGINKIVSGELISLSEQELVDCDRSYDAGCNGGLMDYAFQFIIDNGGIDTEKDYPY 225

Query: 220 RHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGV 279
                 CD  K+ A   +I  YED+P  +E AL +AV++QPVS+ ++A GRAF  Y+SGV
Sbjct: 226 LGFNNQCDPTKKNAKVVSIDGYEDVPN-NENALKKAVAHQPVSIAIEAGGRAFQLYESGV 284

Query: 280 LNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILR----DAGLCG 335
            N +CG   DHGV  VG+G+  ++NG  YW+++NSWG  WGE+GYIR+ R    + G CG
Sbjct: 285 FNGECGLALDHGVVAVGYGS--DDNGQDYWIVRNSWGGNWGENGYIRMERNINANTGKCG 342

Query: 336 IATAASYPV 344
           IA  ASYPV
Sbjct: 343 IAMEASYPV 351


>gi|595986|gb|AAA79915.1| cysteine proteinase, partial [Dianthus caryophyllus]
          Length = 427

 Score =  315 bits (807), Expect = 2e-83,   Method: Compositional matrix adjust.
 Identities = 153/311 (49%), Positives = 209/311 (67%), Gaps = 14/311 (4%)

Query: 43  EQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEK----ANKEGNRTYKLGTNEFSDLTNEE 98
           + W+ +H + Y    EK  R  IF+ NLE+I++     N  G   ++LG N+F+DLTN+E
Sbjct: 6   QSWLVKHRKNYNALGEKEKRFAIFRDNLEFIDQHNNNNNGGGGGEFELGLNKFADLTNDE 65

Query: 99  FRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAF 158
           FR +Y G  RP  + S +S R   +  +   ++P S+DWR+KGAV+H+KDQGQCGSCWAF
Sbjct: 66  FRRIYFGVKRPEKAESVKSDR---YAVKEGDELPESVDWRKKGAVSHVKDQGQCGSCWAF 122

Query: 159 SAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADY 217
           SA+ AVEGI +I  G LI LSEQ+LVDC T  N GC GGLMD AF +II N G+ T+ DY
Sbjct: 123 SAIGAVEGINKIVTGDLITLSEQELVDCDTSYNSGCDGGLMDYAFRFIINNGGIDTDKDY 182

Query: 218 PYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKS 277
           PY+  +G+CD+ ++ A   TI   ED+P  +E+AL +AV++QPV + ++A GR F  YKS
Sbjct: 183 PYKATDGSCDSNRKNAKVVTIDGLEDVPANNEKALQKAVAHQPVRLAIEAGGRDFQLYKS 242

Query: 278 GVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD----AGL 333
           GV    CG + DHGV  VG+GT ++  G  YW+++NSWG+ WGE GYIR+ R+    +G 
Sbjct: 243 GVFTGSCGTSLDHGVVAVGYGTTDD--GKDYWIVRNSWGDDWGEDGYIRMERNTESKSGK 300

Query: 334 CGIATAASYPV 344
           CGIA   SYPV
Sbjct: 301 CGIAIEPSYPV 311


>gi|357437715|ref|XP_003589133.1| Cysteine proteinase [Medicago truncatula]
 gi|87240770|gb|ABD32628.1| Granulin; Peptidase C1A, papain [Medicago truncatula]
 gi|355478181|gb|AES59384.1| Cysteine proteinase [Medicago truncatula]
          Length = 474

 Score =  315 bits (806), Expect = 2e-83,   Method: Compositional matrix adjust.
 Identities = 165/355 (46%), Positives = 222/355 (62%), Gaps = 17/355 (4%)

Query: 1   MVLKFEKSFIIPMFV---IIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDEL 57
           M++    SF + + +   II    T   +  S R+  E  ++  +E+W+ +HG++Y    
Sbjct: 13  MIVLIISSFTVSLALDMSIISYDKTHPDKSTSKRTNKE--VLTMYEEWLVKHGKSYNGLG 70

Query: 58  EKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYN-RPVPSVSRQ 116
           EK  R  IFK NL++I++ N   N TY+LG   F+DLTNEE+R+ + G    P   + + 
Sbjct: 71  EKDKRFEIFKDNLKFIDEHNGL-NSTYRLGLTRFADLTNEEYRSKFLGTKIDPNRRMKKL 129

Query: 117 SSRPSTFKYQNVTD-VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKL 175
               S      V D +P S+DWR++GAV  +KDQ  CGSCWAFSA+AAVEGI +I  G L
Sbjct: 130 GGSKSNRYAPRVGDKLPESVDWRKEGAVVGVKDQASCGSCWAFSAIAAVEGINKIVTGDL 189

Query: 176 IELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAV 234
           I LSEQ+LVDC T  N GC+GGLMD AFE+II N G+ +E DYPY+  +G CD  ++ A 
Sbjct: 190 ISLSEQELVDCDTSYNEGCNGGLMDYAFEFIISNGGIDSEDDYPYKAVDGRCDQNRKNAK 249

Query: 235 AATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAV 294
             TI  YED+P  DE AL +AV+NQP++V V+  GR F  Y+ GV    CG   DHGVA 
Sbjct: 250 VVTIDDYEDVPAYDELALQKAVANQPIAVAVEGGGREFQLYEYGVFTGRCGTALDHGVAA 309

Query: 295 VGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD-----AGLCGIATAASYPV 344
           VG+GT   ENG  YW+++NSWG +WGE GYIR+ R+     AG CGIA   SYP+
Sbjct: 310 VGYGT---ENGKDYWIVRNSWGGSWGEQGYIRLERNLASSRAGKCGIAIEPSYPI 361


>gi|30141025|dbj|BAC75926.1| cysteine protease-4 [Helianthus annuus]
          Length = 352

 Score =  315 bits (806), Expect = 3e-83,   Method: Compositional matrix adjust.
 Identities = 156/345 (45%), Positives = 220/345 (63%), Gaps = 20/345 (5%)

Query: 13  MFVIIILVITCASQV----VSGRSMHEPSIVEK----HEQWMAQHGRTYKDELEKAMRLN 64
           +F++ + V+ C++      + G +  + + + K     E W+A+H + Y+   EK  R  
Sbjct: 12  LFLVFVSVLACSALANEFSILGYAPEDLTSIHKVIHLFESWLAKHSKIYESLDEKLHRFE 71

Query: 65  IFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFK 124
           IF  NL++I+  NK+ +  Y LG NEF+DLT+EEF+  + G    +P   R+      F 
Sbjct: 72  IFMDNLKHIDDTNKKVS-NYWLGLNEFADLTHEEFKNKFLGLKGELPE--RKDESIEEFS 128

Query: 125 YQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLV 184
           Y++  D+P S+DWR+KGAV  +K+QGQCGSCWAFS VAAVEGI QI  G L  LSEQ+L+
Sbjct: 129 YRDFVDLPKSVDWRKKGAVAPVKNQGQCGSCWAFSTVAAVEGINQIVTGNLTMLSEQELI 188

Query: 185 DCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYED 243
           DC T  N+GC+GGLMD AF Y++ + GL  E +YPY   EGTCD +K+ +   TIS Y D
Sbjct: 189 DCDTTFNNGCNGGLMDYAFAYVMRS-GLHKEEEYPYIMSEGTCDEKKDVSETVTISGYHD 247

Query: 244 LPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEE 303
           +P+ +E + L+A++NQP+SV ++ASGR F FY  GV +  CG   DHGVA VG+GT +  
Sbjct: 248 VPRNNEDSFLKALANQPISVAIEASGRDFQFYSGGVFDGHCGTELDHGVAAVGYGTTK-- 305

Query: 304 NGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
            G  Y +++NSWG  WGE GYIR+ R      G+CG+   ASYP 
Sbjct: 306 -GLDYVIVRNSWGPKWGEKGYIRMKRKTGKPHGMCGLYMMASYPT 349


>gi|255635584|gb|ACU18142.1| unknown [Glycine max]
          Length = 345

 Score =  315 bits (806), Expect = 3e-83,   Method: Compositional matrix adjust.
 Identities = 157/311 (50%), Positives = 211/311 (67%), Gaps = 12/311 (3%)

Query: 38  IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
           ++E  E WM++HG+ Y+   EK +R  IFK NL++I++ NK  +  Y LG NEF+DL+++
Sbjct: 43  LIELFESWMSKHGKIYQSIEEKLLRFEIFKDNLKHIDERNKVVS-NYWLGLNEFADLSHQ 101

Query: 98  EFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWA 157
           EF+  Y G        SR+   P  F Y++V ++P S+DWR+KGAV  +K+QG CGSCWA
Sbjct: 102 EFKNKYLGLK---VDYSRRRESPEEFTYKDV-ELPKSVDWRKKGAVAPVKNQGSCGSCWA 157

Query: 158 FSAVAAVEGITQITRGKLIELSEQQLVDCS-TDNHGCSGGLMDKAFEYIIENKGLATEAD 216
           FS VAAVEGI QI  G L  LSEQ+L+DC  T ++GC+GGLMD AF +I+EN GL  E D
Sbjct: 158 FSTVAAVEGINQIVTGNLTSLSEQELIDCDRTYSNGCNGGLMDYAFSFIVENGGLHKEED 217

Query: 217 YPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYK 276
           YPY  EEGTC+  KE+    TIS Y D+P+ +EQ+LL+A++NQ +SV ++ASGR F FY 
Sbjct: 218 YPYIMEEGTCEMTKEETEVVTISGYHDVPQNNEQSLLKALANQSLSVAIEASGRDFQFYS 277

Query: 277 SGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRI---LRDAGL 333
            GV +  CG++ DHGVA VG+GTA+   G  Y ++KNSWG  WGE GYIR+   L   G 
Sbjct: 278 GGVFDGHCGSDLDHGVAAVGYGTAK---GVDYIIVKNSWGSKWGEKGYIRMRGTLETRGN 334

Query: 334 CGIATAASYPV 344
                 ASYP+
Sbjct: 335 LRYLQMASYPL 345


>gi|351726339|ref|NP_001237379.1| cysteine proteinase precursor [Glycine max]
 gi|31559526|dbj|BAC77521.1| cysteine proteinase [Glycine max]
 gi|31559528|dbj|BAC77522.1| cysteine proteinase [Glycine max]
          Length = 362

 Score =  314 bits (805), Expect = 3e-83,   Method: Compositional matrix adjust.
 Identities = 158/321 (49%), Positives = 207/321 (64%), Gaps = 20/321 (6%)

Query: 35  EPSIVEKHEQWMAQH--GRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFS 92
           E S  + +E+W + H   R+  D   K  R N+FK N+ ++   NK  ++ YKL  N+F+
Sbjct: 33  EESFWDLYERWRSHHTVSRSLGD---KHKRFNVFKANVMHVHNTNKM-DKPYKLKLNKFA 88

Query: 93  DLTNEEFRALYTG----YNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKD 148
           D+TN EFR+ Y G    ++R      R +    TF Y+ V  VP S+DWR+ GAVT +KD
Sbjct: 89  DMTNHEFRSTYAGSKVNHHRMFQGTPRGNG---TFMYEKVGSVPPSVDWRKNGAVTGVKD 145

Query: 149 QGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIE 207
           QGQCGSCWAFS V AVEGI QI   KL+ LSEQ+LVDC T  N GC+GGLM+ AFE+I +
Sbjct: 146 QGQCGSCWAFSTVVAVEGINQIKTNKLVSLSEQELVDCDTKKNAGCNGGLMESAFEFIKQ 205

Query: 208 NKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDA 267
             G+ TE++YPY  ++GTCD  K   +A +I  +E++P  DE ALL+AV+NQPVSV +DA
Sbjct: 206 KGGITTESNYPYTAQDGTCDASKANDLAVSIDGHENVPANDENALLKAVANQPVSVAIDA 265

Query: 268 SGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRI 327
            G  F FY  GV   DC    +HGVA+VG+GT  +  G  YW ++NSWG  WGE GYIR+
Sbjct: 266 GGSDFQFYSEGVFTGDCSTELNHGVAIVGYGTTVD--GTNYWTVRNSWGPEWGEQGYIRM 323

Query: 328 LRD----AGLCGIATAASYPV 344
            R      GLCGIA  ASYP+
Sbjct: 324 QRSISKKEGLCGIAMMASYPI 344


>gi|357437719|ref|XP_003589135.1| Cysteine proteinase [Medicago truncatula]
 gi|355478183|gb|AES59386.1| Cysteine proteinase [Medicago truncatula]
          Length = 457

 Score =  314 bits (805), Expect = 3e-83,   Method: Compositional matrix adjust.
 Identities = 165/355 (46%), Positives = 222/355 (62%), Gaps = 17/355 (4%)

Query: 1   MVLKFEKSFIIPMFV---IIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDEL 57
           M++    SF + + +   II    T   +  S R+  E  ++  +E+W+ +HG++Y    
Sbjct: 13  MIVLIISSFTVSLALDMSIISYDKTHPDKSTSKRTNKE--VLTMYEEWLVKHGKSYNGLG 70

Query: 58  EKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYN-RPVPSVSRQ 116
           EK  R  IFK NL++I++ N   N TY+LG   F+DLTNEE+R+ + G    P   + + 
Sbjct: 71  EKDKRFEIFKDNLKFIDEHNGL-NSTYRLGLTRFADLTNEEYRSKFLGTKIDPNRRMKKL 129

Query: 117 SSRPSTFKYQNVTD-VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKL 175
               S      V D +P S+DWR++GAV  +KDQ  CGSCWAFSA+AAVEGI +I  G L
Sbjct: 130 GGSKSNRYAPRVGDKLPESVDWRKEGAVVGVKDQASCGSCWAFSAIAAVEGINKIVTGDL 189

Query: 176 IELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAV 234
           I LSEQ+LVDC T  N GC+GGLMD AFE+II N G+ +E DYPY+  +G CD  ++ A 
Sbjct: 190 ISLSEQELVDCDTSYNEGCNGGLMDYAFEFIISNGGIDSEDDYPYKAVDGRCDQNRKNAK 249

Query: 235 AATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAV 294
             TI  YED+P  DE AL +AV+NQP++V V+  GR F  Y+ GV    CG   DHGVA 
Sbjct: 250 VVTIDDYEDVPAYDELALQKAVANQPIAVAVEGGGREFQLYEYGVFTGRCGTALDHGVAA 309

Query: 295 VGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD-----AGLCGIATAASYPV 344
           VG+GT   ENG  YW+++NSWG +WGE GYIR+ R+     AG CGIA   SYP+
Sbjct: 310 VGYGT---ENGKDYWIVRNSWGGSWGEQGYIRLERNLASSRAGKCGIAIEPSYPI 361


>gi|356515080|ref|XP_003526229.1| PREDICTED: vignain-like [Glycine max]
          Length = 284

 Score =  314 bits (805), Expect = 3e-83,   Method: Compositional matrix adjust.
 Identities = 155/288 (53%), Positives = 198/288 (68%), Gaps = 17/288 (5%)

Query: 67  KQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEF---RALYTGYNRPVPSVSRQSSRPSTF 123
           K+N+ YIE  N   N+ YKLG N+F+DLT+EEF   R  + G+ R        ++R +TF
Sbjct: 5   KENVNYIEAFNNAANKPYKLGINQFADLTSEEFIVPRNRFNGHMR------FSNTRTTTF 58

Query: 124 KYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQL 183
           KY+NVT +P SIDWR+KGAVT IK+QG CG CWAFSA+AA EGI +I+ GKL+ LSEQ++
Sbjct: 59  KYENVTVLPDSIDWRQKGAVTPIKNQGSCGCCWAFSAIAATEGIHKISTGKLVSLSEQEV 118

Query: 184 VDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKY 241
           VDC T   +HGC GG MD AF++II+N G+ TEA YPY+  +G C+ ++E   A TI+ Y
Sbjct: 119 VDCDTKGTDHGCEGGYMDGAFKFIIQNHGINTEASYPYKGVDGKCNIKEEAVHATTITGY 178

Query: 242 EDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAE 301
           ED+P  +E+AL +AV+NQPVSV +DA G  F FYKSG+    CG   DHGV  VG+G  E
Sbjct: 179 EDVPINNEKALQKAVANQPVSVAIDARGADFQFYKSGIFTGSCGTELDHGVTAVGYG--E 236

Query: 302 EENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
              G KYWL+KNSWG  WGE GY  + R      G+CGIA  ASYP A
Sbjct: 237 NNEGTKYWLVKNSWGTEWGEEGYTMMQRGVKAVEGICGIAMLASYPTA 284


>gi|2224808|emb|CAB09697.1| cysteine endopeptidase EP-A [Hordeum vulgare subsp. vulgare]
 gi|326502180|dbj|BAK06781.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 365

 Score =  314 bits (804), Expect = 4e-83,   Method: Compositional matrix adjust.
 Identities = 158/320 (49%), Positives = 213/320 (66%), Gaps = 16/320 (5%)

Query: 35  EPSIVEKHEQWMAQHGRTYK----DELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNE 90
           E S+   +E+W + +  + +    D  E+  R N+FK+N  Y+ + NK  +R ++L  N+
Sbjct: 34  EESLRGLYERWRSHYTVSRRGLGADAEER--RFNVFKENARYVHEGNKR-DRPFRLALNK 90

Query: 91  FSDLTNEEFRALYTGYN-RPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQ 149
           F+D+T +EFR  Y G   R   S+S        F+Y +  ++P ++DWR+KGAVT IKDQ
Sbjct: 91  FADMTTDEFRRTYAGSRVRHHLSLSGGRRGDGGFRYADADNLPPAVDWRQKGAVTAIKDQ 150

Query: 150 GQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC-STDNHGCSGGLMDKAFEYIIEN 208
           GQCGSCWAFS + AVEGI +I  GKL+ LSEQ+L+DC + +N GC GGLMD AF++I +N
Sbjct: 151 GQCGSCWAFSTIVAVEGINKIRTGKLVSLSEQELMDCDNVNNQGCEGGLMDYAFQFIQKN 210

Query: 209 KGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDAS 268
            G+ TE++YPY+ E+G+CD  KE A A TI  YED+P  DE AL +AV+ QPVSV +DAS
Sbjct: 211 -GITTESNYPYQGEQGSCDQAKENAQAVTIDGYEDVPANDESALQKAVAGQPVSVAIDAS 269

Query: 269 GRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRIL 328
           G+ F FY  GV   +C  + DHGVA VG+G     +G KYW++KNSWGE WGE GYIR+ 
Sbjct: 270 GQDFQFYSEGVFTGECSTDLDHGVAAVGYGAT--RDGTKYWIVKNSWGEDWGEKGYIRMQ 327

Query: 329 RDA----GLCGIATAASYPV 344
           R      GLCGIA  ASYP 
Sbjct: 328 RGVSQTEGLCGIAMQASYPT 347


>gi|226533314|ref|NP_001150119.1| xylem cysteine proteinase 2 [Zea mays]
 gi|195636886|gb|ACG37911.1| xylem cysteine proteinase 2 precursor [Zea mays]
 gi|223946183|gb|ACN27175.1| unknown [Zea mays]
 gi|413951209|gb|AFW83858.1| Xylem cysteine proteinase 2 [Zea mays]
          Length = 385

 Score =  314 bits (804), Expect = 5e-83,   Method: Compositional matrix adjust.
 Identities = 165/338 (48%), Positives = 210/338 (62%), Gaps = 29/338 (8%)

Query: 32  SMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEF 91
           S HE S+ E  E+W+++H R Y    EK  R  +FK NL +I++ N++ + +Y LG NEF
Sbjct: 50  SSHE-SLAELFERWLSRHRRAYASLEEKLRRFQVFKDNLHHIDETNRKVS-SYWLGLNEF 107

Query: 92  SDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNV------TDVPTSIDWREKGAVTH 145
           +DLT++EF+A Y G    V             + +          +P S+DWR KGAVT 
Sbjct: 108 ADLTHDEFKATYLGLRSSVGDGGSGIDDDDEPEEEEGYEGVDGASLPKSVDWRSKGAVTG 167

Query: 146 IKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEY 204
           +K+QGQCGSCWAFS VAAVEGI QI  G L  LSEQ+L+DC TD N+GC+GGLMD AF Y
Sbjct: 168 VKNQGQCGSCWAFSTVAAVEGINQIVTGNLTALSEQELIDCDTDGNNGCNGGLMDYAFSY 227

Query: 205 IIENKGLATEADYPYRHEEGTC--------------DNQKEKAVAATISKYEDLPKGDEQ 250
           I  N GL TE  YPY  EEGTC              ++  + A   TIS YED+P+ +EQ
Sbjct: 228 IAHNGGLHTEEAYPYLMEEGTCQRSSSSEKKWPGSSEDANDDAAVVTISGYEDVPRNNEQ 287

Query: 251 ALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWL 310
           ALL+A++ QPVSV ++ASGR F FY  GV +  CG   DHGVA VG+GTA +  G  Y +
Sbjct: 288 ALLKALAQQPVSVAIEASGRNFQFYSGGVFDGPCGTQLDHGVAAVGYGTAAK--GHDYII 345

Query: 311 IKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
           +KNSWG +WGE GYIR+ R      GLCGI   ASYP 
Sbjct: 346 VKNSWGPSWGEKGYIRMRRGTGKRQGLCGINKMASYPT 383


>gi|357465603|ref|XP_003603086.1| Cysteine proteinase [Medicago truncatula]
 gi|355492134|gb|AES73337.1| Cysteine proteinase [Medicago truncatula]
          Length = 474

 Score =  313 bits (803), Expect = 5e-83,   Method: Compositional matrix adjust.
 Identities = 161/357 (45%), Positives = 222/357 (62%), Gaps = 28/357 (7%)

Query: 12  PMFVIIILVITCAS------QVVSGRSMH--------EPSIVEKHEQWMAQHGRTYK--D 55
           PM VI+I+     +       ++S    H        +  +   +E+W  +HG+     D
Sbjct: 9   PMLVILIVFTLFTATFALDMSIISYDKTHSDKSSRRSDKEVKNIYEEWRVKHGKLNNNID 68

Query: 56  ELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYN-RPVPSV- 113
             EK  R  IFK NL++I++ N E NRTYK+G N F+DL+NEE+R+ Y G    P+  + 
Sbjct: 69  GSEKDKRFEIFKDNLKFIDEHNAE-NRTYKVGLNRFADLSNEEYRSRYLGTKIDPIGMMM 127

Query: 114 SRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRG 173
           +R  +R + +       +P S+DWR +GAV  +KDQG CGSCWAFS +AAVEGI +I  G
Sbjct: 128 ARTKTRSNRYAPSVGDKLPKSVDWRSQGAVVQVKDQGSCGSCWAFSTIAAVEGINKIVTG 187

Query: 174 KLIELSEQQLVDCS-TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEK 232
           +L+ LSEQ+LVDC  T N GC GGLM+ AFE+II N G+ ++ DYPYR  +G CD  K+ 
Sbjct: 188 ELVSLSEQELVDCDRTVNAGCDGGLMEYAFEFIINNGGIDSDEDYPYRGVDGKCDQYKKN 247

Query: 233 AVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGV 292
           A   +I  YE +P  DE AL +AV+NQP+SV ++A GR F  Y SG+    CG   DHGV
Sbjct: 248 ARVVSIDDYEQVPAYDELALKKAVANQPISVAIEAGGREFQLYVSGIFTGKCGTALDHGV 307

Query: 293 AVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD-----AGLCGIATAASYPV 344
             VG+GT   ENG  YW+++NSWG++WGESGY+R+ R+     AG CGI   +SYP+
Sbjct: 308 TAVGYGT---ENGVDYWIVRNSWGKSWGESGYVRMERNLAASVAGKCGIVMQSSYPI 361


>gi|172052260|gb|ACB70409.1| cysteine protease [Nicotiana tabacum]
          Length = 361

 Score =  313 bits (803), Expect = 6e-83,   Method: Compositional matrix adjust.
 Identities = 160/315 (50%), Positives = 205/315 (65%), Gaps = 18/315 (5%)

Query: 40  EKHEQWMAQHGRTYKDEL-EKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEE 98
           E +E+W + H  T    L EK  R N+FK N+ Y+   NK+ ++ YKL  N+F+D+TN E
Sbjct: 36  ELYERWRSHH--TVSRSLDEKDKRFNVFKANVHYVHNFNKK-DKPYKLKLNKFADMTNHE 92

Query: 99  FRALYTG----YNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGS 154
           FR  Y G    ++R     SR +    TF Y +   VP ++DWR+KGAVT +KDQG+CGS
Sbjct: 93  FRHHYAGSKIKHHRTFLGASRANG---TFMYAHEDSVPPTVDWRKKGAVTPVKDQGKCGS 149

Query: 155 CWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLAT 213
           CWAFS V AVEGI QI   +L+ LSEQ+LVDC T  N GC+GGLMD AFE+I +  G+ T
Sbjct: 150 CWAFSTVVAVEGINQIKTNELVSLSEQELVDCDTSQNQGCNGGLMDMAFEFIKKKGGINT 209

Query: 214 EADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFH 273
           E +YPY  E G CD QK  +   +I  +ED+P  DE +LL+AV+NQPVSV + ASG  F 
Sbjct: 210 EENYPYMAEGGECDIQKRNSPVVSIDGHEDVPPNDEGSLLKAVANQPVSVAIQASGSDFQ 269

Query: 274 FYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILR---- 329
           FY  GV   DCG   DHGVA+VG+GT  +    KYW++KNSWG  WGE GYIR+ R    
Sbjct: 270 FYSEGVFTGDCGTELDHGVAIVGYGTTLDR--TKYWIVKNSWGPEWGEKGYIRMQREIDA 327

Query: 330 DAGLCGIATAASYPV 344
           + GLCGIA   SYP+
Sbjct: 328 EEGLCGIAMQPSYPI 342


>gi|168006315|ref|XP_001755855.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162693174|gb|EDQ79528.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 454

 Score =  313 bits (803), Expect = 6e-83,   Method: Compositional matrix adjust.
 Identities = 157/319 (49%), Positives = 212/319 (66%), Gaps = 16/319 (5%)

Query: 34  HEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSD 93
           +E  + E+   W  +HG+ Y    E A R  ++K NLEYI++ + E NR+Y LG  +F+D
Sbjct: 38  NERLLSEQFGAWAHKHGKVYSSLEEHAHRYMVWKDNLEYIQR-HSEKNRSYWLGLTKFAD 96

Query: 94  LTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCG 153
           +TN+EFR  YTG        S++S R + F+Y + ++ P S+DWR+KGAVT +KDQG CG
Sbjct: 97  ITNDEFRRQYTGTR---IDRSKRSKRKTGFRYAD-SEAPESVDWRKKGAVTTVKDQGSCG 152

Query: 154 SCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLA 212
           SCWAFSA+ +VEGI  I  G+ + LSEQ+LVDC  + N GC+GGLMD AF++I+EN G+ 
Sbjct: 153 SCWAFSAIGSVEGINAIRTGEAVSLSEQELVDCDLEYNQGCNGGLMDYAFDFILENGGID 212

Query: 213 TEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAF 272
           TE DYPY+  +G CDN K+ A   TI  YED+P+ DE+AL +AV+ QPVSV ++A GR F
Sbjct: 213 TENDYPYKGLDGRCDNNKKNAHVVTIDGYEDVPENDEEALKKAVAGQPVSVAIEAGGRDF 272

Query: 273 HFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA- 331
             Y  GV   +CG + DHGV  VG+G+   E    YW++KNSWGE WGESGY+R+ R+  
Sbjct: 273 QLYSGGVFTGECGTDLDHGVLAVGYGS---EGSLDYWIVKNSWGEYWGESGYLRMQRNIK 329

Query: 332 ------GLCGIATAASYPV 344
                 GLCGI    SY V
Sbjct: 330 DSNHQFGLCGINIEPSYAV 348


>gi|5777889|emb|CAB53515.1| cysteine protease [Solanum tuberosum]
          Length = 466

 Score =  313 bits (802), Expect = 7e-83,   Method: Compositional matrix adjust.
 Identities = 152/350 (43%), Positives = 213/350 (60%), Gaps = 16/350 (4%)

Query: 8   SFIIPMFVIIILVITCASQVVSGRSMHEPSIVEK--------HEQWMAQHGRTYKDELEK 59
           +  I + +++I     ++  +S  S  E  I  +        +E W+ +HG++Y    EK
Sbjct: 7   TLTISLLLMLIFSTLSSASDMSIISYDETHIHHRSDDEVSALYESWLIEHGKSYNALGEK 66

Query: 60  AMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSR 119
             R  IFK NL+YI++ N   N++YKLG  +F+DLTNEE+R++Y G            ++
Sbjct: 67  DKRFQIFKDNLKYIDEQNSVPNQSYKLGLTKFADLTNEEYRSIYLGTKSSGDRRKLSKNK 126

Query: 120 PSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELS 179
              +  +    +P S+DWR+KG +  +KDQG CGSCWAFSAVAA+E I  I  G LI LS
Sbjct: 127 SDRYLPKVGDSLPESVDWRDKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGNLISLS 186

Query: 180 EQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATI 238
           EQ+LVDC    N GC GGLMD AFE++I N G+ TE DYPY+     CD  ++ A    I
Sbjct: 187 EQELVDCDKSYNEGCDGGLMDYAFEFVINNGGIDTEEDYPYKERNDVCDQYRKNAKVVKI 246

Query: 239 SKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFG 298
             YED+P  +E+AL +AV++QPVS+ ++A GR    YKSG+    CG   DHGV   G+G
Sbjct: 247 DSYEDVPVNNEKALQKAVAHQPVSIAIEAGGRDLQHYKSGIFTGKCGTAVDHGVVAAGYG 306

Query: 299 TAEEENGAKYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPV 344
           +   ENG  YW+++NSWG  WGE GY+R+ R+    +GLCG+AT  SYPV
Sbjct: 307 S---ENGMDYWIVRNSWGAKWGEKGYLRVQRNVASSSGLCGLATEPSYPV 353


>gi|21070926|gb|AAM34401.1|AF377947_7 putative cysteine proteinase [Oryza sativa Japonica Group]
 gi|31712050|gb|AAP68356.1| putative cysteine protease [Oryza sativa Japonica Group]
 gi|40538988|gb|AAR87245.1| putative cysteine protease [Oryza sativa Japonica Group]
 gi|108711126|gb|ABF98921.1| Papain family cysteine protease containing protein, expressed
           [Oryza sativa Japonica Group]
 gi|125545747|gb|EAY91886.1| hypothetical protein OsI_13535 [Oryza sativa Indica Group]
          Length = 350

 Score =  313 bits (802), Expect = 7e-83,   Method: Compositional matrix adjust.
 Identities = 164/354 (46%), Positives = 219/354 (61%), Gaps = 26/354 (7%)

Query: 11  IPMFVIIILVITCASQVVSGRSMHEPSIV----------EKHEQWMAQHGRTYKDELEKA 60
           +  FV+ +LV++ A+ +  GR +                 +HE+WMA+HG+TYKDE EKA
Sbjct: 3   LSTFVLAVLVMSGAAAL--GRELAGDGAAAAAAADVAMASRHEKWMAKHGKTYKDEEEKA 60

Query: 61  MRLNIFKQNLEYIEKAN----KEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQ 116
            RL +F+ N + I+  N    K+G   ++L TN F+DLT++EFRA  TGY RP P+    
Sbjct: 61  RRLEVFRANAKLIDSFNAAAEKDGGGGHRLATNRFADLTDDEFRAARTGYQRP-PAAVAG 119

Query: 117 SSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLI 176
           +     ++  ++   P S+DWR  GAVT +KDQG CG CWAFSAVAAVEG+ +I  G+L+
Sbjct: 120 AGGGFLYENFSLAAAPQSMDWRAMGAVTGVKDQGSCGCCWAFSAVAAVEGLAKIRTGQLV 179

Query: 177 ELSEQQLVDCST--DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAV 234
            LSEQ+LVDC    ++ GC GGLMD AF+YI    GLA E+ YPYR  +           
Sbjct: 180 SLSEQELVDCDVRGEDQGCEGGLMDTAFQYIARRGGLAAESSYPYRGVD-GACRAAAGRA 238

Query: 235 AATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVL-NADCGNNCDHGVA 293
           AA+I  ++D+P  DE AL+ AV+ QPVSV ++ +G  F FY  GVL  A CG   +H V 
Sbjct: 239 AASIRGFQDVPSNDEGALMAAVARQPVSVAINGAGYVFRFYDRGVLGGAGCGTELNHAVT 298

Query: 294 VVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA---GLCGIATAASYPV 344
            VG+GTA +  G  YWL+KNSWG +WGE GY+RI R     G CGIA  ASYPV
Sbjct: 299 AVGYGTASDGTG--YWLMKNSWGASWGEGGYVRIRRGVGREGACGIAQMASYPV 350


>gi|355344587|gb|AER60490.1| cysteine proteases [Gossypium hirsutum]
          Length = 371

 Score =  313 bits (802), Expect = 7e-83,   Method: Compositional matrix adjust.
 Identities = 151/309 (48%), Positives = 208/309 (67%), Gaps = 9/309 (2%)

Query: 42  HEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRA 101
           ++ W+ QHG+ Y    E+  R  IFK NL +I++ N   N TYKLG N+F+DLTN+E+RA
Sbjct: 45  YKSWVIQHGKAYNGIGEEEKRFEIFKDNLRFIDEHNSNNNTTYKLGLNKFADLTNQEYRA 104

Query: 102 LYTGYNRPVPSVSRQSSRPST-FKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSA 160
            + G          +S  PS+ + ++   ++P S+DWR+ GAV+ +KDQG CGSCWAFS 
Sbjct: 105 KFLGTRTDPRRRLMKSKIPSSRYAHRAGDNLPDSVDWRDHGAVSPVKDQGSCGSCWAFST 164

Query: 161 VAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPY 219
           +A VEGI +I  G+L+ LSEQ+LVDC    + GC+GGLMD AF++I++N G+ TE DYPY
Sbjct: 165 IATVEGINKIVSGELVSLSEQELVDCDRSYDAGCNGGLMDYAFQFIMDNGGIDTEKDYPY 224

Query: 220 RHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGV 279
                 CD  K+ A   +I  YED+P  +E AL +AV++QPVS+ ++A GRAF  Y+SGV
Sbjct: 225 LGFNNQCDPTKKNAKVVSIDGYEDVPN-NENALKKAVAHQPVSIAIEAGGRAFQLYESGV 283

Query: 280 LNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILR----DAGLCG 335
            N +CG   DHGV  VG+GT  ++NG  YW+++NSWG  WGE+GYIR+ R    + G CG
Sbjct: 284 FNGECGLALDHGVVAVGYGT--DDNGQDYWIVRNSWGSNWGENGYIRMERNINANTGKCG 341

Query: 336 IATAASYPV 344
           IA  ASYPV
Sbjct: 342 IAMEASYPV 350


>gi|413951605|gb|AFW84254.1| hypothetical protein ZEAMMB73_933931 [Zea mays]
          Length = 423

 Score =  313 bits (802), Expect = 7e-83,   Method: Compositional matrix adjust.
 Identities = 156/350 (44%), Positives = 210/350 (60%), Gaps = 23/350 (6%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIV------EKHEQWMAQHGRTYKDELEKAMRLNIF 66
           + V ++ V + A ++       E  +       + +E+W   H R ++   EK  R   F
Sbjct: 53  LLVALVFVSSAAVELCRAIDFDERDLASDEALWDLYERWQTHH-RVHRHHGEKGRRFGTF 111

Query: 67  KQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPST---- 122
           K+N+ +I   NK G+R Y+L  N F D+  EEFR+ +   +  +  + RQ S  +     
Sbjct: 112 KENVRFIHAHNKRGDRPYRLRLNRFGDMGREEFRSTFA--DSRINDLRRQDSPAARAGAV 169

Query: 123 --FKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSE 180
             F Y +  D P S+DWR++GAVT +KDQG CGSCWAFS V AVEGI  I  G L  LSE
Sbjct: 170 PGFMYDSAADPPRSVDWRQEGAVTGVKDQGHCGSCWAFSTVVAVEGINAIRTGSLASLSE 229

Query: 181 QQLVDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEK---AVAAT 237
           Q+L+DC TD +GC GGLM+ AFE+I    G+ TEA YPYR   GTCD  + +    V   
Sbjct: 230 QELIDCDTDENGCQGGLMENAFEFIKSFGGITTEAAYPYRASNGTCDGDRARRGGGVVVV 289

Query: 238 ISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGF 297
           I  ++ +P G E AL +AV++QPVSV VDA G+AF FY  GV   DCG + DHGVA VG+
Sbjct: 290 IDGHQMVPAGSEDALAKAVAHQPVSVAVDAGGQAFQFYSEGVFTGDCGTDLDHGVAAVGY 349

Query: 298 GTAEEENGAKYWLIKNSWGETWGESGYIRILRDA---GLCGIATAASYPV 344
           G  ++  G  YW++KNSWG +WGE GYIR+ R A   GLCGIA  AS+P+
Sbjct: 350 GVGDD--GTPYWIVKNSWGTSWGEGGYIRMQRGAGNGGLCGIAMEASFPI 397


>gi|413951606|gb|AFW84255.1| hypothetical protein ZEAMMB73_933931 [Zea mays]
          Length = 379

 Score =  313 bits (802), Expect = 8e-83,   Method: Compositional matrix adjust.
 Identities = 156/350 (44%), Positives = 210/350 (60%), Gaps = 23/350 (6%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIV------EKHEQWMAQHGRTYKDELEKAMRLNIF 66
           + V ++ V + A ++       E  +       + +E+W   H R ++   EK  R   F
Sbjct: 9   LLVALVFVSSAAVELCRAIDFDERDLASDEALWDLYERWQTHH-RVHRHHGEKGRRFGTF 67

Query: 67  KQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPST---- 122
           K+N+ +I   NK G+R Y+L  N F D+  EEFR+ +   +  +  + RQ S  +     
Sbjct: 68  KENVRFIHAHNKRGDRPYRLRLNRFGDMGREEFRSTFA--DSRINDLRRQDSPAARAGAV 125

Query: 123 --FKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSE 180
             F Y +  D P S+DWR++GAVT +KDQG CGSCWAFS V AVEGI  I  G L  LSE
Sbjct: 126 PGFMYDSAADPPRSVDWRQEGAVTGVKDQGHCGSCWAFSTVVAVEGINAIRTGSLASLSE 185

Query: 181 QQLVDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEK---AVAAT 237
           Q+L+DC TD +GC GGLM+ AFE+I    G+ TEA YPYR   GTCD  + +    V   
Sbjct: 186 QELIDCDTDENGCQGGLMENAFEFIKSFGGITTEAAYPYRASNGTCDGDRARRGGGVVVV 245

Query: 238 ISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGF 297
           I  ++ +P G E AL +AV++QPVSV VDA G+AF FY  GV   DCG + DHGVA VG+
Sbjct: 246 IDGHQMVPAGSEDALAKAVAHQPVSVAVDAGGQAFQFYSEGVFTGDCGTDLDHGVAAVGY 305

Query: 298 GTAEEENGAKYWLIKNSWGETWGESGYIRILRDA---GLCGIATAASYPV 344
           G  ++  G  YW++KNSWG +WGE GYIR+ R A   GLCGIA  AS+P+
Sbjct: 306 GVGDD--GTPYWIVKNSWGTSWGEGGYIRMQRGAGNGGLCGIAMEASFPI 353


>gi|359473128|ref|XP_002285397.2| PREDICTED: vignain-like [Vitis vinifera]
          Length = 357

 Score =  313 bits (802), Expect = 8e-83,   Method: Compositional matrix adjust.
 Identities = 156/346 (45%), Positives = 221/346 (63%), Gaps = 10/346 (2%)

Query: 5   FEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLN 64
            EK  ++ + ++++  +  +          E S+ + +E+W + H    +D  EK  R N
Sbjct: 1   MEKVILVALSLVLVFGLAESFDFDEKDLASEESLWDLYERWRSYH-TVSRDLEEKNKRFN 59

Query: 65  IFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPST-F 123
           +FK+N +++ K N+  ++ YKL  N+F+D+TN EFR+ Y G       + R   R +  F
Sbjct: 60  VFKENTKHVHKVNQM-DKPYKLKLNKFADMTNHEFRSSYGGSKVKHYRMLRGDRRGTGGF 118

Query: 124 KYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQL 183
            ++  T +P S+DWR+KGAVT IKDQG+CGSCWAFS V  VEGI QI   +L+ LSEQQL
Sbjct: 119 MHEKTTYLPPSVDWRKKGAVTGIKDQGKCGSCWAFSTVVGVEGINQIKTKELLSLSEQQL 178

Query: 184 VDCS-TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYE 242
           +DC  +D+HGC+GGLM+ AFE+I +N G+ TE +YPY+ ++  CD  K  A   TI  +E
Sbjct: 179 IDCDRSDDHGCNGGLMESAFEFIKKNGGITTENNYPYKAKDERCDMLKMNAPVVTIDGHE 238

Query: 243 DLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEE 302
            +P  DE+AL++AV++QPVSV +DA G    FY  GV + +CG   DHGVA+VG+GT  +
Sbjct: 239 SVPVNDERALMKAVAHQPVSVAIDAGGSDLQFYSEGVFDGECGTELDHGVAIVGYGTTLD 298

Query: 303 ENGAKYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPV 344
             G KYW++KNSWG  WGE GYIR+ R      G CGIA  ASYPV
Sbjct: 299 --GTKYWIVKNSWGAEWGEKGYIRMARGIQAAEGQCGIAMEASYPV 342


>gi|297819566|ref|XP_002877666.1| hypothetical protein ARALYDRAFT_906213 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297323504|gb|EFH53925.1| hypothetical protein ARALYDRAFT_906213 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 304

 Score =  313 bits (801), Expect = 9e-83,   Method: Compositional matrix adjust.
 Identities = 162/321 (50%), Positives = 216/321 (67%), Gaps = 33/321 (10%)

Query: 33  MHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFS 92
           + E S +EKHEQWM++  R Y D+ EK  R  IFK+NL+++E  N   N TYKL  N+FS
Sbjct: 9   LFEASAIEKHEQWMSRFNRVYSDDSEKTSRFEIFKKNLKFVESFNMNTNNTYKLDVNKFS 68

Query: 93  DLTNEEFRALYTGYNRPVP-SVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQ 151
           DLT+EEF+A Y G    VP  ++  S +  +F+Y+NV++   S+DWR +GAVT +KDQGQ
Sbjct: 69  DLTDEEFQARYMGL---VPEGMTGDSQKTVSFRYENVSETGESMDWRLEGAVTPVKDQGQ 125

Query: 152 CGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST--DNHGCSGGLMDKAFEYIIENK 209
           CG CWAF+AVAAVEG+T+I  G+L+ LSEQQLVDCST  +N GC GGL   A++YI EN+
Sbjct: 126 CGCCWAFAAVAAVEGVTKIANGELVSLSEQQLVDCSTANNNMGCDGGLALTAYDYIKENQ 185

Query: 210 GLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASG 269
           G+ +E +YPY+  + TC  +     AATIS YE +PK DE+ALL+AVS            
Sbjct: 186 GITSEENYPYQAVQQTC--KSTDPAAATISGYEAVPKDDEEALLKAVS------------ 231

Query: 270 RAFHFYKSGVLNAD-CGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRIL 328
                 + G+   + CG +  H V +VG+GT+EE  G KYWL+KNSWGE+WGE+GY+RI 
Sbjct: 232 ------QHGIFEDEYCGTDSHHAVTIVGYGTSEE--GIKYWLLKNSWGESWGENGYMRIK 283

Query: 329 RDA----GLCGIATAASYPVA 345
           RD     G+CG+A  A YPVA
Sbjct: 284 RDVDEPQGMCGLAHRAYYPVA 304


>gi|414591545|tpg|DAA42116.1| TPA: hypothetical protein ZEAMMB73_388689 [Zea mays]
          Length = 384

 Score =  313 bits (801), Expect = 1e-82,   Method: Compositional matrix adjust.
 Identities = 153/324 (47%), Positives = 212/324 (65%), Gaps = 16/324 (4%)

Query: 35  EPSIVEKHEQWMAQHGRTY----KDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNE 90
           E S+   +E+W + + R       D+ ++A R N+FK+N  Y+ +AN++  R ++L  N+
Sbjct: 34  EESLRALYERWRSHYHRVSPRDGDDKQQQARRFNVFKENARYVHEANRKDGRPFRLALNK 93

Query: 91  FSDLTNEEFRALYTG----YNRPVPSVSRQSSRPSTFKY-QNVTDVPTSIDWREKGAVTH 145
           F+D+T +EFR  Y G    ++R     +R  +     +     T++P ++DWR +GAVT 
Sbjct: 94  FADMTTDEFRRTYAGSRTRHHRAQLGEARSFAHAQHGRGGSGTTNLPPAVDWRLRGAVTG 153

Query: 146 IKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS-TDNHGCSGGLMDKAFEY 204
           +KDQGQCGSCWAFSA+AAVEG+ +I  GKL+ LSEQ+LVDC   DN GC GGLMD AF+Y
Sbjct: 154 VKDQGQCGSCWAFSAIAAVEGVNKIMTGKLVSLSEQELVDCDDVDNQGCDGGLMDYAFQY 213

Query: 205 IIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVC 264
           I  N G+ TE++YPY  E+ +C+  KE++   TI  YED+P  +E AL +AV++QPV+V 
Sbjct: 214 IQRNGGVTTESNYPYLAEQRSCNKAKERSHDVTIDGYEDVPANNEDALQKAVASQPVAVA 273

Query: 265 VDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGY 324
           ++ASG+ F FY  GV    CG + DHGVA VG+GT  +  G KYW +KNSWGE WGE GY
Sbjct: 274 IEASGQDFQFYSEGVFTGSCGTDLDHGVAAVGYGTTGD--GTKYWTVKNSWGEDWGERGY 331

Query: 325 IRILRDA----GLCGIATAASYPV 344
           IR+ R      GLCGIA   SYP 
Sbjct: 332 IRMQRGVPDSRGLCGIAMEPSYPT 355


>gi|357446975|ref|XP_003593763.1| Cysteine proteinase [Medicago truncatula]
 gi|355482811|gb|AES64014.1| Cysteine proteinase [Medicago truncatula]
          Length = 350

 Score =  313 bits (801), Expect = 1e-82,   Method: Compositional matrix adjust.
 Identities = 162/331 (48%), Positives = 210/331 (63%), Gaps = 9/331 (2%)

Query: 17  IILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKA 76
           IIL+  CA   +S R++ E S+VE H+QWM ++ RTY +  E   R  IFK+NLEYIE  
Sbjct: 9   IILLWACAYPTMS-RTLTESSVVEAHQQWMMKYERTYTNSSEMEKRKKIFKENLEYIENF 67

Query: 77  NKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSID 136
           N  GN++YKLG N +SDLT+EEF A +TG+ +    +S    R     +    DVPT+ D
Sbjct: 68  NNVGNKSYKLGLNRYSDLTSEEFIASHTGF-KVSDQLSDSKMRSVAIPFNLNDDVPTNFD 126

Query: 137 WREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGG 196
           WREKG VT +K+Q QCG CWAF+AVAAVEGI +I  G LI LSEQQLVDC   + GC GG
Sbjct: 127 WREKGVVTDVKNQRQCGCCWAFTAVAAVEGIVKIKNGNLISLSEQQLVDCDRQSSGCGGG 186

Query: 197 LMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAV 256
               AF+ II+++G+  E DYPY+  +       +   AA I+ Y  +P  DEQ LL+AV
Sbjct: 187 DFVLAFDSIIKSRGIVKEDDYPYKANDVQTCQLGQIPGAAQINGYFKVPANDEQQLLRAV 246

Query: 257 SNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWG 316
             QPVSV +  S   FH Y  GV    CG   +H V ++G+G +E   G KYWLIKNSWG
Sbjct: 247 LQQPVSVAISTS-YDFHHYMGGVYEGSCGPKLNHAVTIIGYGVSEA--GKKYWLIKNSWG 303

Query: 317 ETWGESGYIRILRDA----GLCGIATAASYP 343
           ETWGE GY+++LR++    G C IA  A+YP
Sbjct: 304 ETWGEKGYMKVLRESSATGGQCSIAVHAAYP 334


>gi|22759715|dbj|BAC10906.1| cysteine proteinase [Zinnia elegans]
          Length = 352

 Score =  312 bits (800), Expect = 1e-82,   Method: Compositional matrix adjust.
 Identities = 158/354 (44%), Positives = 226/354 (63%), Gaps = 17/354 (4%)

Query: 1   MVLKFEKSFIIPMFVIIILVITCASQV-VSGRSMHEPSIVEK----HEQWMAQHGRTYKD 55
            +   +K+ ++ +FV I+     A +  + G +  + + + K     E W+ +H + Y+ 
Sbjct: 3   FIFSSKKTSLLFLFVSILACSALAHEFSILGYAPEDLTSIHKVIHLFESWLVKHSKFYES 62

Query: 56  ELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSR 115
             EK  R  IF  NL++I++ NK+ +  Y LG NEF+DLT+EEF+  + G+   +     
Sbjct: 63  LDEKLHRFEIFMDNLKHIDETNKKVS-NYWLGLNEFADLTHEEFKHKFLGFKGELAERKD 121

Query: 116 QSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKL 175
           +SS+   F Y++  D+P S+DWR+KGAV  +K+QGQCGSCWAFS VAAVEGI QI  G L
Sbjct: 122 ESSKE--FGYRDFVDLPKSVDWRKKGAVAPVKNQGQCGSCWAFSTVAAVEGINQIVTGNL 179

Query: 176 IELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAV 234
             LSEQ+L+DC T  N+GC+GGLMD AF Y++ + GL  E +YPY   EGTCD +K+ + 
Sbjct: 180 TMLSEQELIDCDTTFNNGCNGGLMDYAFAYVMRS-GLHKEEEYPYIMSEGTCDEKKDVSE 238

Query: 235 AATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAV 294
             TIS Y D+P+ DE + L+A++NQP+SV ++ASGR F FY  GV +  CG   DHGVA 
Sbjct: 239 KVTISGYHDVPRNDEASFLKALANQPISVAIEASGRDFQFYSGGVFDGHCGTELDHGVAA 298

Query: 295 VGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
           VG+GT +   G  Y +++NSWG  WGE GYIR+ R +    G+CG+   ASYP 
Sbjct: 299 VGYGTTK---GLDYVIVRNSWGPKWGEKGYIRMKRGSGKPHGMCGLYMMASYPT 349


>gi|359359213|gb|AEV41117.1| putative oryzain beta chain precursor [Oryza officinalis]
          Length = 465

 Score =  312 bits (799), Expect = 2e-82,   Method: Compositional matrix adjust.
 Identities = 149/310 (48%), Positives = 209/310 (67%), Gaps = 14/310 (4%)

Query: 42  HEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKAN-KEGNRTYKLGTNEFSDLTNEEFR 100
           ++ W+A++GR+Y    E   R  +F  NL + +  N +  +  ++LG N F+DLTNEEFR
Sbjct: 54  YDLWLAENGRSYNALGEHERRFRVFWDNLRFADAHNARADDHGFRLGMNRFADLTNEEFR 113

Query: 101 ALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSA 160
           A + G       V R  +    +++  V ++P S+DWREKGAV  +K+QGQCGSCWAFSA
Sbjct: 114 ATFLG----AKVVERSRAAGERYRHDGVEELPESVDWREKGAVAPVKNQGQCGSCWAFSA 169

Query: 161 VAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYP 218
           V+ VE I Q+  G++I LSEQ+LV+CST+  N GC+GGLMD AF++II+N G+ TE DYP
Sbjct: 170 VSTVESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMDDAFDFIIKNGGIDTEDDYP 229

Query: 219 YRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSG 278
           Y+  +G CD  +E A   +I  +ED+P+ DE++L +AV++QPVSV ++A GR F  Y SG
Sbjct: 230 YKAVDGKCDINRENAKVVSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGGREFQLYHSG 289

Query: 279 VLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD----AGLC 334
           V +  CG + DHGV  VG+GT   +NG  YW+++NSWG  WGESGY+R+ R+     G C
Sbjct: 290 VFSGRCGTSLDHGVVAVGYGT---DNGKDYWIVRNSWGPKWGESGYVRMERNINVTTGKC 346

Query: 335 GIATAASYPV 344
           GIA  ASYP 
Sbjct: 347 GIAMMASYPT 356


>gi|359359066|gb|AEV40973.1| putative oryzain beta chain precursor [Oryza punctata]
          Length = 461

 Score =  312 bits (799), Expect = 2e-82,   Method: Compositional matrix adjust.
 Identities = 147/311 (47%), Positives = 211/311 (67%), Gaps = 15/311 (4%)

Query: 42  HEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNR--TYKLGTNEFSDLTNEEF 99
           ++ W+A++GR+Y    E+  R  +F  NL++++  N   +    ++LG N F+DLTN+EF
Sbjct: 49  YDLWLAENGRSYNALGERERRFRVFWDNLKFVDAHNARADEHGGFRLGMNRFADLTNDEF 108

Query: 100 RALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFS 159
           R+ + G       V R  +    +++  V ++P S+DWREKGAV  +K+QGQCGSCWAFS
Sbjct: 109 RSTFLG----AKVVERSRAAGERYRHDGVEELPESVDWREKGAVAPVKNQGQCGSCWAFS 164

Query: 160 AVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADY 217
           AV+ VE I Q+  G++I LSEQ+LV+CST+  N GC+GGLMD AF++II+N G+ TE DY
Sbjct: 165 AVSTVESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMDDAFDFIIKNGGIDTEDDY 224

Query: 218 PYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKS 277
           PY+  +G CD  +E A   +I  +ED+P+ DE++L +AV++QPVSV ++A GR F  Y S
Sbjct: 225 PYKAVDGKCDINRENAKVVSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGGREFQLYHS 284

Query: 278 GVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD----AGL 333
           GV +  CG + DHGV  VG+GT   +NG  YW+++NSWG  WGESGY+R+ R+     G 
Sbjct: 285 GVFSGRCGTSLDHGVVAVGYGT---DNGKDYWIVRNSWGPKWGESGYVRMERNINATTGK 341

Query: 334 CGIATAASYPV 344
           CGIA  ASYP 
Sbjct: 342 CGIAMMASYPT 352


>gi|32396020|gb|AAP41847.1| senescence-associated cysteine protease [Anthurium andraeanum]
          Length = 460

 Score =  312 bits (799), Expect = 2e-82,   Method: Compositional matrix adjust.
 Identities = 161/328 (49%), Positives = 215/328 (65%), Gaps = 16/328 (4%)

Query: 27  VVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANK-EGNRTYK 85
           +V+ R+  E  ++  +E W+  +G+ Y    EK  R  IF  NL YI+  N+ E N +Y 
Sbjct: 25  IVAERTEEEVRLL--YEGWLVGNGKAYNLLGEKERRFEIFWDNLRYIDDHNRAENNHSYT 82

Query: 86  LGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSR-PSTFK--YQNVTDVPTSIDWREKGA 142
           LG   F+DLTNEE+R+ Y G  +P     R+++R P   +    N  D+P  +DWREKGA
Sbjct: 83  LGLTRFADLTNEEYRSTYLGV-KPGQVRPRRANRAPGRGRDLSANGDDLPQKVDWREKGA 141

Query: 143 VTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKA 201
           V  IKDQG CGSCWAFS VAAVEGI QI  G LI LSEQ+LVDC T  N GC+GGLMD A
Sbjct: 142 VAPIKDQGGCGSCWAFSTVAAVEGINQIVTGDLIVLSEQELVDCDTAYNEGCNGGLMDYA 201

Query: 202 FEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPV 261
           F++II N G+ TE DYPY+  +G CD  ++ A   +I  YED+ + DE AL  AV++QPV
Sbjct: 202 FQFIISNGGIDTEEDYPYKERDGLCDPNRKNAKVVSIDSYEDVLENDEHALKTAVAHQPV 261

Query: 262 SVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGE 321
           SV ++  GR+F  YKSG+ +  CG + DHGV  VG+GT   E+G  YW+++NSWG++WGE
Sbjct: 262 SVAIEGGGRSFQLYKSGIFDGRCGIDLDHGVVAVGYGT---ESGKDYWIVRNSWGKSWGE 318

Query: 322 SGYIRILRD-----AGLCGIATAASYPV 344
           +GYIR+ R+     +G CGIA   SYP+
Sbjct: 319 AGYIRMERNLPSSSSGKCGIAIEPSYPI 346


>gi|357126406|ref|XP_003564878.1| PREDICTED: cysteine proteinase EP-B 1-like [Brachypodium
           distachyon]
          Length = 377

 Score =  311 bits (798), Expect = 2e-82,   Method: Compositional matrix adjust.
 Identities = 155/325 (47%), Positives = 209/325 (64%), Gaps = 19/325 (5%)

Query: 35  EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRT--------YKL 86
           E ++ E + +W + H    +   EK  R   FK N+ +I   N   N T        Y+L
Sbjct: 35  EEALWELYTRWQSAHRLPPQHHAEKHRRFGTFKSNVLFIHAHNTRLNDTSTNNNGPSYRL 94

Query: 87  GTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHI 146
             N F D+   EFR+ + G   P+   +R +     F Y  V D+P ++DWR+KGAVT +
Sbjct: 95  RLNRFGDMDQAEFRSTFAG---PLHRHTRPAQSIPGFIYDTVKDIPQAVDWRQKGAVTGV 151

Query: 147 KDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST--DNHGCSGGLMDKAFEY 204
           KDQG+CGSCWAFSAVA+VEG+  I  G L+ LSEQ+L+DC T  D++GC GGLM+ AFE+
Sbjct: 152 KDQGKCGSCWAFSAVASVEGLNAIRTGSLVSLSEQELIDCDTGGDDNGCQGGLMESAFEF 211

Query: 205 IIENKG-LATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSV 263
           I  + G LATEA YPY    GTC+  +  +V+  I  ++ +P G+E+AL +AV++QPVSV
Sbjct: 212 IAHSAGGLATEAAYPYHASNGTCNANRGSSVSVRIDGHQSVPAGNEEALAKAVAHQPVSV 271

Query: 264 CVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESG 323
            +DA G+AF FY  GV   DCG+  DHGVAVVG+G AEE+ G +YW++KNSWG  WGE G
Sbjct: 272 AIDAGGQAFQFYSEGVFTGDCGSELDHGVAVVGYGVAEED-GKEYWIVKNSWGPGWGEHG 330

Query: 324 YIRILRDA----GLCGIATAASYPV 344
           Y+R+ RD+    GLCGIA  ASYPV
Sbjct: 331 YVRMQRDSGVDGGLCGIAMEASYPV 355


>gi|226503129|ref|NP_001149806.1| LOC100283433 precursor [Zea mays]
 gi|195634783|gb|ACG36860.1| xylem cysteine proteinase 2 precursor [Zea mays]
 gi|219884977|gb|ACL52863.1| unknown [Zea mays]
          Length = 377

 Score =  311 bits (798), Expect = 2e-82,   Method: Compositional matrix adjust.
 Identities = 163/317 (51%), Positives = 210/317 (66%), Gaps = 20/317 (6%)

Query: 38  IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
           +V   E+W+A++ + Y    EK  R  +FK NL +I++AN++   +Y LG N F+DLT++
Sbjct: 68  LVRLFEEWVAKYRKAYGSFEEKLRRFEVFKDNLHHIDEANRKEVTSYWLGLNAFADLTHD 127

Query: 98  EFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDV----PTSIDWREKGAVTHIKDQGQCG 153
           EF+A Y G      S  R       F+Y  V D     P S+DWR+KGAVT +K+QGQCG
Sbjct: 128 EFKATYLGLLPKRTSGGR-------FRYGGVGDGGDEVPASVDWRKKGAVTEVKNQGQCG 180

Query: 154 SCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLA 212
           SCWAFS VAAVEGI QI  G L  LSEQQLVDCSTD N+GCSGG+MD AF +I    GL 
Sbjct: 181 SCWAFSTVAAVEGINQIVTGNLTSLSEQQLVDCSTDGNNGCSGGVMDNAFSFIATGAGLR 240

Query: 213 TEADYPYRHEEGTCDNQ-KEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRA 271
           +E  YPY  EEG CD++ ++  V  TIS YED+P  DEQAL++A+++QPVSV ++ASGR 
Sbjct: 241 SEEAYPYLMEEGDCDDRARDGEVLVTISGYEDVPANDEQALVKALAHQPVSVAIEASGRH 300

Query: 272 FHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA 331
           F FY  GV +  CG+  DHGVA VG+G+++   G  Y ++KNSWG  WGE GYIR+ R  
Sbjct: 301 FQFYSGGVFDGPCGSELDHGVAAVGYGSSK---GQDYIIVKNSWGTHWGEKGYIRMKRGT 357

Query: 332 ----GLCGIATAASYPV 344
               GLCGI   ASYP 
Sbjct: 358 GKPEGLCGINKMASYPT 374


>gi|2511693|emb|CAB17076.1| cysteine proteinase precursor [Phaseolus vulgaris]
          Length = 455

 Score =  311 bits (798), Expect = 2e-82,   Method: Compositional matrix adjust.
 Identities = 162/347 (46%), Positives = 213/347 (61%), Gaps = 21/347 (6%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEK--------HEQWMAQHGRTYKDELEKAMRLN 64
           +F +  L       ++S  + H+     +        +E+W+ +HG+ Y    EK  R  
Sbjct: 3   LFALFALSSALDMSIISYDNAHQDKATWRTDEEVNSLYEEWLVKHGKLYNALGEKDKRFQ 62

Query: 65  IFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFK 124
           IFK NL +I++ N E NRTYKLG N F+DLTNEE+RA Y G    +    R    PS   
Sbjct: 63  IFKDNLRFIDQQNAE-NRTYKLGLNRFADLTNEEYRARYLG--TKIDPNRRLGRTPSNRY 119

Query: 125 YQNVTD-VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQL 183
              V + +P S+DWR++GAV  +KDQ  CGSCWAFSA+ AVEGI +I  G LI LSEQ+L
Sbjct: 120 APRVGETLPDSVDWRKEGAVVPVKDQASCGSCWAFSAIGAVEGINKIVTGDLISLSEQEL 179

Query: 184 VDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYE 242
           VDC T  N GC+GGLMD AFE+II+N G+ +E DYPY+  +G CD  ++ A   +I  YE
Sbjct: 180 VDCDTGYNMGCNGGLMDYAFEFIIKNGGIDSEEDYPYKGVDGRCDEYRKNAKVVSIDGYE 239

Query: 243 DLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEE 302
           D+   DE AL +AV+NQPVSV V+  GR F  Y SGV    CG   DHGV  VG+GT   
Sbjct: 240 DVNTYDELALKKAVANQPVSVAVEGGGREFQLYSSGVFTGRCGTALDHGVVAVGYGT--- 296

Query: 303 ENGAKYWLIKNSWGETWGESGYIRILRD-----AGLCGIATAASYPV 344
           +NG  +W+++NSWG  WGE GYIR+ R+     +G CGIA   SYP+
Sbjct: 297 DNGHDFWIVRNSWGADWGEEGYIRLERNLGNSRSGKCGIAIEPSYPI 343


>gi|414589857|tpg|DAA40428.1| TPA: Vignain [Zea mays]
          Length = 377

 Score =  311 bits (798), Expect = 2e-82,   Method: Compositional matrix adjust.
 Identities = 157/328 (47%), Positives = 204/328 (62%), Gaps = 22/328 (6%)

Query: 38  IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
           ++E+ EQWM +HGR Y D  EK  RL ++++N+E +E  N  GN  Y+L  N+F+DLTNE
Sbjct: 50  MLERFEQWMGRHGRLYADAGEKQRRLEVYRRNVELVETFNSMGN-GYRLADNKFADLTNE 108

Query: 98  EFRALYTGYNRPVPSV-SRQSSRPST--------FKYQNVTDVPTSIDWREKGAVTHIKD 148
           EFRA   G+ RP     +  S+ PST           Q  +D+P S+DWREKGAV  +K 
Sbjct: 109 EFRAKMLGFGRPRSGGGAGHSTAPSTVACIGSGLMGRQGYSDLPKSVDWREKGAVAPVKS 168

Query: 149 QGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIEN 208
           QG CGSCWAFSAVAA+EGI QI  GKL+ LSEQ+LVDC T   GC+GG M  AFE++++N
Sbjct: 169 QGDCGSCWAFSAVAAIEGINQIKNGKLVSLSEQELVDCDTKAIGCAGGYMSWAFEFVMKN 228

Query: 209 KGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDAS 268
           +GL TE +YPY+   G C   K K  A +IS Y ++    E  LL+A + QPVSV VDA 
Sbjct: 229 RGLTTERNYPYQGLNGACQTPKLKESAVSISGYMNVTPSSEPDLLRAAAAQPVSVAVDAG 288

Query: 269 GRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEEN--------GAKYWLIKNSWGETWG 320
              +  Y  GV    C    +HGV VVG+G  + +         G KYW++KNSWG  WG
Sbjct: 289 SFVWQLYGGGVFTGPCTAELNHGVTVVGYGETQGDTDGDGSGVPGKKYWIVKNSWGPEWG 348

Query: 321 ESGYIRILRDA----GLCGIATAASYPV 344
           ++GYI + R+A    GLCGIA   SYPV
Sbjct: 349 DAGYILMQREASVASGLCGIAMLPSYPV 376


>gi|226507844|ref|NP_001148894.1| LOC100282514 precursor [Zea mays]
 gi|194703250|gb|ACF85709.1| unknown [Zea mays]
 gi|195622994|gb|ACG33327.1| vignain precursor [Zea mays]
          Length = 356

 Score =  311 bits (798), Expect = 2e-82,   Method: Compositional matrix adjust.
 Identities = 157/328 (47%), Positives = 204/328 (62%), Gaps = 22/328 (6%)

Query: 38  IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
           ++E+ EQWM +HGR Y D  EK  RL ++++N+E +E  N  GN  Y+L  N+F+DLTNE
Sbjct: 29  MLERFEQWMGRHGRLYADAGEKQRRLEVYRRNVELVETFNSMGN-GYRLADNKFADLTNE 87

Query: 98  EFRALYTGYNRPVPSV-SRQSSRPST--------FKYQNVTDVPTSIDWREKGAVTHIKD 148
           EFRA   G+ RP     +  S+ PST           Q  +D+P S+DWREKGAV  +K 
Sbjct: 88  EFRAKMLGFGRPRSGGGAGHSTAPSTVACIGSGLMGRQGYSDLPKSVDWREKGAVAPVKS 147

Query: 149 QGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIEN 208
           QG CGSCWAFSAVAA+EGI QI  GKL+ LSEQ+LVDC T   GC+GG M  AFE++++N
Sbjct: 148 QGDCGSCWAFSAVAAIEGINQIKNGKLVSLSEQELVDCDTKAIGCAGGYMSWAFEFVMKN 207

Query: 209 KGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDAS 268
           +GL TE +YPY+   G C   K K  A +IS Y ++    E  LL+A + QPVSV VDA 
Sbjct: 208 RGLTTERNYPYQGLNGACQTPKLKESAVSISGYMNVTPSSEPDLLRAAAAQPVSVAVDAG 267

Query: 269 GRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEEN--------GAKYWLIKNSWGETWG 320
              +  Y  GV    C    +HGV VVG+G  + +         G KYW++KNSWG  WG
Sbjct: 268 SFVWQLYGGGVFTGPCTAELNHGVTVVGYGETQGDTDGDGSGVPGKKYWIVKNSWGPEWG 327

Query: 321 ESGYIRILRDA----GLCGIATAASYPV 344
           ++GYI + R+A    GLCGIA   SYPV
Sbjct: 328 DAGYILMQREASVASGLCGIAMLPSYPV 355


>gi|37780049|gb|AAP32197.1| cysteine protease 10 [Trifolium repens]
          Length = 272

 Score =  311 bits (797), Expect = 3e-82,   Method: Compositional matrix adjust.
 Identities = 151/271 (55%), Positives = 191/271 (70%), Gaps = 11/271 (4%)

Query: 81  NRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREK 140
           N+ YKLG N+F+DLTNEEF+A     N+    +     R +TFKY+N + +P+++DWR+K
Sbjct: 7   NKLYKLGINKFADLTNEEFKA---SRNKFKGHMCSSIIRTTTFKYENASAIPSTVDWRKK 63

Query: 141 GAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLM 198
           GAVT +K+QGQCGSCWAFSAVAA EGI Q++ GKL+ LSEQ+L+DC T   + GC GGLM
Sbjct: 64  GAVTPVKNQGQCGSCWAFSAVAATEGIHQLSTGKLVSLSEQELIDCDTKGVDQGCEGGLM 123

Query: 199 DKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN 258
           D AF++II+N GL+TE  YPY   +GTC+  +    A TI+ YED+P  +E AL +AV+N
Sbjct: 124 DDAFKFIIQNHGLSTEVQYPYEGVDGTCNTNEASIHAVTITGYEDVPANNELALQKAVAN 183

Query: 259 QPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGET 318
           QP+SV +DASG  F FY SGV    CG   DHGV  VG+G   +  G KYWL+KNSWG  
Sbjct: 184 QPISVAIDASGSDFQFYNSGVFTGSCGTELDHGVTAVGYGVGND--GTKYWLVKNSWGAD 241

Query: 319 WGESGYIRILR--DA--GLCGIATAASYPVA 345
           WGE GYIR+ R  DA  GLCGIA  ASYP A
Sbjct: 242 WGEEGYIRMQRGIDAAEGLCGIAMQASYPTA 272


>gi|3980198|emb|CAA46863.1| thiolprotease [Pisum sativum]
          Length = 464

 Score =  311 bits (797), Expect = 3e-82,   Method: Compositional matrix adjust.
 Identities = 164/363 (45%), Positives = 224/363 (61%), Gaps = 30/363 (8%)

Query: 1   MVLKFEKSFIIPMFVIIILVITCASQVVSGRSMH--------EPSIVEKHEQWMAQHGRT 52
           M+ K    FI   F + + +  C   ++S    H           ++  +E+W+ +HG+ 
Sbjct: 1   MLSKLTILFITLTFTLSLALDMC---IISYDKTHPDKSTPRTNDQVLTMYEEWLVKHGKN 57

Query: 53  YKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGY----NR 108
           Y    EK  R  IFK NL +I++ N + N +++LG N F+DLTNEE+R  + G     NR
Sbjct: 58  YNALGEKEKRFEIFKDNLGFIDEHNSK-NLSFRLGLNRFADLTNEEYRTRFLGTRINPNR 116

Query: 109 PVPSVSRQSSRPSTFKYQNVTD-VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGI 167
               V+ Q++R +T     V D +P S+DWR++GAV  +KDQG CGSCWAFSA+AAVEG+
Sbjct: 117 RNRKVNSQTNRYAT----RVGDKLPESVDWRKEGAVVGVKDQGSCGSCWAFSAIAAVEGV 172

Query: 168 TQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTC 226
            ++  G LI LSEQ+LVDC T  N GC+GGLMD AFE+II    L  E DYPYR  +G C
Sbjct: 173 NKLATGDLISLSEQELVDCDTSYNEGCNGGLMDYAFEFIINMVALTPEEDYPYRAIDGRC 232

Query: 227 DNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGN 286
           D  ++ A   +I +YED+P  DE AL +AV+NQ ++V V+  GR F  Y SGV    CG 
Sbjct: 233 DQNRKNAKVVSIDQYEDVPAYDEGALKKAVANQVIAVAVEGGGREFQLYDSGVFTGRCGT 292

Query: 287 NCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD-----AGLCGIATAAS 341
             DHGVA VG+GT   ENG  YW+++NSWG +WGE+GYIR+ R+     +G CGIA   S
Sbjct: 293 ALDHGVAAVGYGT---ENGKDYWIVRNSWGGSWGEAGYIRLERNLATSKSGKCGIAIEPS 349

Query: 342 YPV 344
           YP+
Sbjct: 350 YPI 352


>gi|115441717|ref|NP_001045138.1| Os01g0907600 [Oryza sativa Japonica Group]
 gi|5761329|dbj|BAA83473.1| cysteine endopeptidase [Oryza sativa]
 gi|20804884|dbj|BAB92565.1| cysteine endopeptidase [Oryza sativa Japonica Group]
 gi|56785107|dbj|BAD82745.1| putative cysteine proteinase [Oryza sativa Japonica Group]
 gi|113534669|dbj|BAF07052.1| Os01g0907600 [Oryza sativa Japonica Group]
 gi|119395242|gb|ABL74582.1| cysteine endopeptidase [Oryza sativa Japonica Group]
 gi|125528777|gb|EAY76891.1| hypothetical protein OsI_04850 [Oryza sativa Indica Group]
 gi|125573036|gb|EAZ14551.1| hypothetical protein OsJ_04473 [Oryza sativa Japonica Group]
          Length = 371

 Score =  311 bits (797), Expect = 3e-82,   Method: Compositional matrix adjust.
 Identities = 157/330 (47%), Positives = 208/330 (63%), Gaps = 11/330 (3%)

Query: 23  CASQVVSGRSMH-EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGN 81
           CA+     R +  + ++ + +E+W   H    +   EK  R   FK N+ YI + NK G 
Sbjct: 26  CAAIPFDERDLESDEALWDLYERWQEHH-HVPRHHGEKHRRFGAFKDNVRYIHEHNKRGG 84

Query: 82  RTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRP-STFKYQNVTDVPTSIDWREK 140
           R Y+L  N F D+  EEFRA + G +         ++ P   F Y+ V D+P ++DWR K
Sbjct: 85  RGYRLRLNRFGDMGREEFRATFAGSHANDLRRDGLAAPPLPGFMYEGVRDLPRAVDWRRK 144

Query: 141 GAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST-DNHGCSGGLMD 199
           GAVT +KDQG+CGSCWAFS V +VEGI  I  G+L+ LSEQ+L+DC T DN GC GGLM+
Sbjct: 145 GAVTGVKDQGKCGSCWAFSTVVSVEGINAIRTGRLVSLSEQELIDCDTADNSGCQGGLME 204

Query: 200 KAFEYIIENKGLATEADYPYRHEEGTCDN-QKEKAVAATISKYEDLPKGDEQALLQAVSN 258
            AFEYI  + G+ TE+ YPYR   GTCD  +  +A    I  ++++P   E AL +AV+N
Sbjct: 205 NAFEYIKHSGGITTESAYPYRAANGTCDAVRARRAPLVVIDGHQNVPANSEAALAKAVAN 264

Query: 259 QPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGET 318
           QPVSV +DA  ++F FY  GV   DCG + DHGVAVVG+G  E  +G +YW++KNSWG  
Sbjct: 265 QPVSVAIDAGDQSFQFYSDGVFAGDCGTDLDHGVAVVGYG--ETNDGTEYWIVKNSWGTA 322

Query: 319 WGESGYIRILRDA----GLCGIATAASYPV 344
           WGE GYIR+ RD+    GLCGIA  ASYPV
Sbjct: 323 WGEGGYIRMQRDSGYDGGLCGIAMEASYPV 352


>gi|413942348|gb|AFW74997.1| Xylem cysteine proteinase 2 [Zea mays]
          Length = 391

 Score =  311 bits (797), Expect = 3e-82,   Method: Compositional matrix adjust.
 Identities = 163/317 (51%), Positives = 210/317 (66%), Gaps = 20/317 (6%)

Query: 38  IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
           +V   E+W+A++ + Y    EK  R  +FK NL +I++AN++   +Y LG N F+DLT++
Sbjct: 82  LVRLFEEWVAKYRKAYGSFEEKLRRFEVFKDNLHHIDEANRKEVTSYWLGLNAFADLTHD 141

Query: 98  EFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDV----PTSIDWREKGAVTHIKDQGQCG 153
           EF+A Y G      S  R       F+Y  V D     P S+DWR+KGAVT +K+QGQCG
Sbjct: 142 EFKATYLGLLPKRTSGGR-------FRYGGVGDGGDEVPASVDWRKKGAVTEVKNQGQCG 194

Query: 154 SCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLA 212
           SCWAFS VAAVEGI QI  G L  LSEQQLVDCSTD N+GCSGG+MD AF +I    GL 
Sbjct: 195 SCWAFSTVAAVEGINQIVTGNLTSLSEQQLVDCSTDGNNGCSGGVMDNAFSFIATGAGLR 254

Query: 213 TEADYPYRHEEGTCDNQ-KEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRA 271
           +E  YPY  EEG CD++ ++  V  TIS YED+P  DEQAL++A+++QPVSV ++ASGR 
Sbjct: 255 SEEAYPYLMEEGDCDDRARDGEVLVTISGYEDVPANDEQALVKALAHQPVSVAIEASGRH 314

Query: 272 FHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA 331
           F FY  GV +  CG+  DHGVA VG+G+++   G  Y ++KNSWG  WGE GYIR+ R  
Sbjct: 315 FQFYSGGVFDGPCGSELDHGVAAVGYGSSK---GQDYIIVKNSWGTHWGEKGYIRMKRGT 371

Query: 332 ----GLCGIATAASYPV 344
               GLCGI   ASYP 
Sbjct: 372 GKPEGLCGINKMASYPT 388


>gi|242066206|ref|XP_002454392.1| hypothetical protein SORBIDRAFT_04g029960 [Sorghum bicolor]
 gi|241934223|gb|EES07368.1| hypothetical protein SORBIDRAFT_04g029960 [Sorghum bicolor]
          Length = 356

 Score =  311 bits (796), Expect = 3e-82,   Method: Compositional matrix adjust.
 Identities = 156/348 (44%), Positives = 223/348 (64%), Gaps = 18/348 (5%)

Query: 11  IPMFVIIILVITCASQ-----VVSGRSMHEPSI----VEKHEQWMAQHGRTYKDELEKAM 61
           +P+ V+ +    C++       V G S  + ++    V   + W  +H + Y    EK  
Sbjct: 5   LPVLVLFLAFAACSASHHRDPSVVGYSQEDLALPNRLVNLFKSWSVKHRKIYVSPKEKLK 64

Query: 62  RLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPS 121
           R  IFKQNL +I + N++ N +Y LG N+F+D+T+EEF+A + G  + +  +  Q+  P+
Sbjct: 65  RYGIFKQNLMHIAETNRK-NGSYWLGLNQFADITHEEFKANHLGLKQGLSRMGAQTRTPT 123

Query: 122 TFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQ 181
           TF+Y    ++P S+DWR KGAVT +K+QG+CGSCWAFS+VAAVEGI QI  GKL+ LSEQ
Sbjct: 124 TFRYAAAANLPWSVDWRYKGAVTPVKNQGKCGSCWAFSSVAAVEGINQIVTGKLVSLSEQ 183

Query: 182 QLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISK 240
           +L+DC T  +HGC GGLMD AF YI+ ++G+  E DYPY  EEG C  ++  A   TI+ 
Sbjct: 184 ELMDCDTMLDHGCEGGLMDFAFAYIMGSQGIHAEDDYPYLMEEGYCKEKQPYANVVTITG 243

Query: 241 YEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTA 300
           YED+P+  E +LL+A+++QPVSV + A  R F FYK GV +  C +  DH +  VG+G++
Sbjct: 244 YEDVPENSEISLLKALAHQPVSVGIAAGSRDFQFYKGGVFDGSCSDELDHALTAVGYGSS 303

Query: 301 EEENGAKYWLIKNSWGETWGESGYIRIL----RDAGLCGIATAASYPV 344
             +N   Y  +KNSWG+ WGE GY+RI     +  G+CGI T ASYPV
Sbjct: 304 YGQN---YITMKNSWGKNWGEQGYVRIKMGTGKPEGVCGIYTMASYPV 348


>gi|118127|sp|P25251.1|CYSP4_BRANA RecName: Full=Cysteine proteinase COT44; Flags: Precursor
          Length = 328

 Score =  311 bits (796), Expect = 4e-82,   Method: Compositional matrix adjust.
 Identities = 157/315 (49%), Positives = 207/315 (65%), Gaps = 19/315 (6%)

Query: 44  QWMAQHGRTYKDEL----EKAMRLNIFKQNLEYIEKANKEG-NRTYKLGTNEFSDLTNEE 98
           +W  +HG++  +      ++  R NIFK NL +I+  N+   N TYKLG   F++LTN+E
Sbjct: 6   RWSLEHGKSNSNSNGIINQQDERFNIFKDNLRFIDLHNENNKNATYKLGLTIFANLTNDE 65

Query: 99  FRALYTG-YNRPVPSVSRQSSRPSTFKYQ---NVTDVPTSIDWREKGAVTHIKDQGQCGS 154
           +R+LY G    PV  +++  ++    KY    NV +VP ++DWR+KGAV  IKDQG CGS
Sbjct: 66  YRSLYLGARTEPVRRITK--AKNVNMKYSAAVNVDEVPVTVDWRQKGAVNAIKDQGTCGS 123

Query: 155 CWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLAT 213
           CWAFS  AAVEGI +I  G+L+ LSEQ+LVDC    N GC+GGLMD AF++I++N GL T
Sbjct: 124 CWAFSTAAAVEGINKIVTGELVSLSEQELVDCDKSYNQGCNGGLMDYAFQFIMKNGGLNT 183

Query: 214 EADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFH 273
           E DYPY    G C++  + +   TI  YED+P  DE AL +AVS QPVSV +DA GRAF 
Sbjct: 184 EKDYPYHGTNGKCNSLLKNSRVVTIDGYEDVPSKDETALKRAVSYQPVSVAIDAGGRAFQ 243

Query: 274 FYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD--- 330
            Y+SG+    CG N DH V  VG+G+   ENG  YW+++NSWG  WGE GYIR+ R+   
Sbjct: 244 HYQSGIFTGKCGTNMDHAVVAVGYGS---ENGVDYWIVRNSWGTRWGEDGYIRMERNVAS 300

Query: 331 -AGLCGIATAASYPV 344
            +G CGIA  ASYPV
Sbjct: 301 KSGKCGIAIEASYPV 315


>gi|121308860|dbj|BAF43527.1| cysteine proteinase [Zinnia elegans]
          Length = 352

 Score =  310 bits (795), Expect = 4e-82,   Method: Compositional matrix adjust.
 Identities = 152/318 (47%), Positives = 212/318 (66%), Gaps = 14/318 (4%)

Query: 32  SMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEF 91
           S+H+  ++   E W+ +H + Y+   EK  R  IF  NL++I++ NK+ +  Y LG NEF
Sbjct: 41  SIHK--VIHLFESWLVKHSKFYESLDEKLHRFEIFMDNLKHIDETNKKVS-NYWLGLNEF 97

Query: 92  SDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQ 151
           +DLT+EEF+  + G+   +     +SS+   F Y++  D+P S+DWR+KGAV  +K+QGQ
Sbjct: 98  ADLTHEEFKHKFLGFKGELAERKDESSK--EFGYRDFVDLPKSVDWRKKGAVAPVKNQGQ 155

Query: 152 CGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKG 210
           CG+CWAFS VAAVEGI QI  G L  LSEQ+L+DC T  N+GC+GGLMD AF Y++ + G
Sbjct: 156 CGNCWAFSTVAAVEGINQIVTGNLTMLSEQELIDCDTTFNNGCNGGLMDYAFAYVMRS-G 214

Query: 211 LATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGR 270
           L  E +YPY   EGTCD +K+ +   TIS Y D+P+ DE + L+A++NQP+SV ++ASGR
Sbjct: 215 LHKEEEYPYIMSEGTCDEKKDVSEKVTISGYHDVPRNDEASFLKALANQPISVAIEASGR 274

Query: 271 AFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD 330
            F FY  GV +  CG   DHGVA VG+GT +   G  Y +++NSWG  WGE GYIR+ R 
Sbjct: 275 DFQFYSGGVFDGHCGTELDHGVAAVGYGTTK---GLDYVIVRNSWGPKWGEKGYIRMKRG 331

Query: 331 A----GLCGIATAASYPV 344
           +    G+CG+   ASYP 
Sbjct: 332 SGKPHGMCGLYMMASYPT 349


>gi|242092704|ref|XP_002436842.1| hypothetical protein SORBIDRAFT_10g009850 [Sorghum bicolor]
 gi|241915065|gb|EER88209.1| hypothetical protein SORBIDRAFT_10g009850 [Sorghum bicolor]
          Length = 296

 Score =  310 bits (795), Expect = 5e-82,   Method: Compositional matrix adjust.
 Identities = 152/316 (48%), Positives = 203/316 (64%), Gaps = 30/316 (9%)

Query: 38  IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
           +V +HEQWM Q+ R YKD  EKA R  +FK N+++IE  N  GNR + LG N+F+DLTN+
Sbjct: 1   MVARHEQWMVQYSRVYKDATEKAQRFEVFKSNVKFIESFNAGGNRKFWLGVNQFADLTND 60

Query: 98  EFRALYTGYN-RPVPSVSRQSSRPSTFKYQNVT--DVPTSIDWREKGAVTHIKDQGQCGS 154
           EFRA  T    +P P        P+ F+Y+N++   +P +IDWR KGAVT IKDQGQC  
Sbjct: 61  EFRATKTNKGFKPSP-----VKVPTGFRYENISVDALPATIDWRTKGAVTPIKDQGQC-- 113

Query: 155 CWAFSAVAAVEGITQITRGKLIELSEQQLVDCST--DNHGCSGGLMDKAFEYIIENKGLA 212
                     EGI +I+ GKLI LSEQ+LVDC    ++ GC GGLMD AF++II+  GL 
Sbjct: 114 ----------EGIVKISTGKLISLSEQELVDCDVHGEDQGCEGGLMDDAFKFIIKKGGLT 163

Query: 213 TEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAF 272
           TE+ YPY   +G C +       AT+  +ED+P  DE +L++AV+NQPVSV VD     F
Sbjct: 164 TESSYPYTAADGKCKSGSNS--VATVKGFEDVPANDEASLMKAVANQPVSVAVDGGDMTF 221

Query: 273 HFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA- 331
            FY  GV+   CG + DHG+A +G+G  +  +G KYWL+KNSWG TWGE+GY+R+ +D  
Sbjct: 222 QFYSGGVMTGSCGTDLDHGIAAIGYG--QTSDGTKYWLLKNSWGTTWGENGYLRMEKDIS 279

Query: 332 ---GLCGIATAASYPV 344
              G+CG+A   SYP 
Sbjct: 280 DKRGMCGLAMEPSYPT 295


>gi|109939734|sp|P25776.2|ORYA_ORYSJ RecName: Full=Oryzain alpha chain; Flags: Precursor
 gi|78192122|gb|ABB30151.1| oryzain alpha [Oryza sativa Japonica Group]
          Length = 458

 Score =  310 bits (795), Expect = 5e-82,   Method: Compositional matrix adjust.
 Identities = 158/327 (48%), Positives = 208/327 (63%), Gaps = 16/327 (4%)

Query: 27  VVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKAN---KEGNRT 83
           +VS     E      + +W A+HG++Y    E+  R   F+ NL YI++ N     G  +
Sbjct: 25  IVSYGERSEEEARRLYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHS 84

Query: 84  YKLGTNEFSDLTNEEFRALYTGY-NRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGA 142
           ++LG N F+DLTNEE+R  Y G  N+P     R+      +   +   +P S+DWR KGA
Sbjct: 85  FRLGLNRFADLTNEEYRDTYLGLRNKP----RRERKVSDRYLAADNEALPESVDWRTKGA 140

Query: 143 VTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKA 201
           V  IKDQG CGSCWAFSA+AAVEGI QI  G LI LSEQ+LVDC T  N GC+GGLMD A
Sbjct: 141 VAEIKDQGGCGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYA 200

Query: 202 FEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPV 261
           F++II N G+ TE DYPY+ ++  CD  ++ A   TI  YED+    E +L +AV+NQPV
Sbjct: 201 FDFIINNGGIDTEDDYPYKGKDERCDVNRKNAKVVTIDSYEDVTPNSETSLQKAVANQPV 260

Query: 262 SVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGE 321
           SV ++A GRAF  Y SG+    CG   DHGVA VG+GT   ENG  YW+++NSWG++WGE
Sbjct: 261 SVAIEAGGRAFQLYSSGIFTGKCGTALDHGVAAVGYGT---ENGKDYWIVRNSWGKSWGE 317

Query: 322 SGYIRILRD----AGLCGIATAASYPV 344
           SGY+R+ R+    +G CGIA   SYP+
Sbjct: 318 SGYVRMERNIKASSGKCGIAVEPSYPL 344


>gi|242055323|ref|XP_002456807.1| hypothetical protein SORBIDRAFT_03g043220 [Sorghum bicolor]
 gi|241928782|gb|EES01927.1| hypothetical protein SORBIDRAFT_03g043220 [Sorghum bicolor]
          Length = 369

 Score =  310 bits (795), Expect = 6e-82,   Method: Compositional matrix adjust.
 Identities = 154/352 (43%), Positives = 216/352 (61%), Gaps = 14/352 (3%)

Query: 2   VLKFEKSFIIPMFVIIILVITCASQVVSGRSM-HEPSIVEKHEQWMAQHGRTYKDELEKA 60
           + +  K+ ++   V +  V  C +     R +  + ++ + +E+W   H        EK 
Sbjct: 1   MAQLAKTLLLVALVAMSAVELCRAIEFDERDLASDEALWDLYERWQTHHHVHRHHG-EKG 59

Query: 61  MRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRP 120
            R   FK+N+ +I   NK G+R Y+L  N F D+  EEFR+ +   +  +  + R  S  
Sbjct: 60  RRFGTFKENVRFIHAHNKRGDRPYRLSLNRFGDMGREEFRSTFA--DSRINDLRRAESPA 117

Query: 121 ST----FKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLI 176
           +     F Y  VTD+P S+DWR++GAVT +KDQG CGSCWAFS V +VEGI  I  G L+
Sbjct: 118 APAVPGFMYDGVTDLPPSVDWRKEGAVTAVKDQGHCGSCWAFSTVVSVEGINAIRTGSLV 177

Query: 177 ELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDN-QKEKAVA 235
            LSEQ+L+DC TD +GC GGLM+ AFE+I    G+ TE+ YPYR   GTCD+ +  +   
Sbjct: 178 SLSEQELIDCDTDENGCQGGLMENAFEFIKSYGGVTTESAYPYRASNGTCDSVRSRRGQI 237

Query: 236 ATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVV 295
            +I  ++ +P G E AL +AV+NQPVSV +DA G+AF FY  GV   DCG + DHGVA V
Sbjct: 238 VSIDGHQMVPTGSEDALAKAVANQPVSVAIDAGGQAFQFYSEGVFTGDCGTDLDHGVAAV 297

Query: 296 GFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA---GLCGIATAASYPV 344
           G+G +++  G  YW++KNSWG +WGE GYIR+ R A   GLCGIA  AS+P+
Sbjct: 298 GYGVSDD--GTAYWIVKNSWGPSWGEGGYIRMQRGAGNGGLCGIAMEASFPI 347


>gi|414870137|tpg|DAA48694.1| TPA: vignain [Zea mays]
          Length = 484

 Score =  310 bits (794), Expect = 6e-82,   Method: Compositional matrix adjust.
 Identities = 159/313 (50%), Positives = 198/313 (63%), Gaps = 16/313 (5%)

Query: 42  HEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRA 101
           +E+W  +H    +D  +KA R N+FK N+  I + N+  +  YKL  N F D+T +EFR 
Sbjct: 156 YERWRGRHALA-RDLGDKARRFNVFKANVRLIHEFNRR-DEPYKLRLNRFGDMTADEFRR 213

Query: 102 LYTGYNRPVPSVSR-----QSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCW 156
            Y G       + R      S+  S+F Y +  DVP S+DWR+KGAVT +KDQGQCGSCW
Sbjct: 214 HYAGSRVAHHRMFRGDRQGSSASASSFMYADARDVPASVDWRQKGAVTDVKDQGQCGSCW 273

Query: 157 AFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEA 215
           AFS +AAVEGI  I    L  LSEQQLVDC T  N GC+GGLMD AF+YI ++ G+A E 
Sbjct: 274 AFSTIAAVEGINAIKTKNLTSLSEQQLVDCDTKANAGCNGGLMDYAFQYIAKHGGVAAED 333

Query: 216 DYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFY 275
            YPYR  + +C  +K  A   TI  YED+P  DE AL +AV++QPVSV ++ASG  F FY
Sbjct: 334 AYPYRARQASC--KKSPAPVVTIDGYEDVPANDESALKKAVAHQPVSVAIEASGSHFQFY 391

Query: 276 KSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA---- 331
             GV +  CG   DHGVA VG+G     +G KYWL+KNSWG  WGE GYIR+ RD     
Sbjct: 392 SEGVFSGRCGTELDHGVAAVGYGVT--ADGTKYWLVKNSWGPEWGEKGYIRMARDVAAKE 449

Query: 332 GLCGIATAASYPV 344
           G CGIA  ASYPV
Sbjct: 450 GHCGIAMEASYPV 462


>gi|222629675|gb|EEE61807.1| hypothetical protein OsJ_16426 [Oryza sativa Japonica Group]
          Length = 459

 Score =  310 bits (794), Expect = 6e-82,   Method: Compositional matrix adjust.
 Identities = 158/327 (48%), Positives = 208/327 (63%), Gaps = 16/327 (4%)

Query: 27  VVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKAN---KEGNRT 83
           +VS     E      + +W A+HG++Y    E+  R   F+ NL YI++ N     G  +
Sbjct: 26  IVSYGERSEEEARRLYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHS 85

Query: 84  YKLGTNEFSDLTNEEFRALYTGY-NRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGA 142
           ++LG N F+DLTNEE+R  Y G  N+P     R+      +   +   +P S+DWR KGA
Sbjct: 86  FRLGLNRFADLTNEEYRDTYLGLRNKP----RRERKVSDRYLAADNEALPESVDWRTKGA 141

Query: 143 VTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKA 201
           V  IKDQG CGSCWAFSA+AAVEGI QI  G LI LSEQ+LVDC T  N GC+GGLMD A
Sbjct: 142 VAEIKDQGGCGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYA 201

Query: 202 FEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPV 261
           F++II N G+ TE DYPY+ ++  CD  ++ A   TI  YED+    E +L +AV+NQPV
Sbjct: 202 FDFIINNGGIDTEDDYPYKGKDERCDVNRKNAKVVTIDSYEDVTPNSETSLQKAVANQPV 261

Query: 262 SVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGE 321
           SV ++A GRAF  Y SG+    CG   DHGVA VG+GT   ENG  YW+++NSWG++WGE
Sbjct: 262 SVAIEAGGRAFQLYSSGIFTGKCGTALDHGVAAVGYGT---ENGKDYWIVRNSWGKSWGE 318

Query: 322 SGYIRILRD----AGLCGIATAASYPV 344
           SGY+R+ R+    +G CGIA   SYP+
Sbjct: 319 SGYVRMERNIKASSGKCGIAVEPSYPL 345


>gi|2224812|emb|CAB09699.1| cysteine endopeptidase EP-A [Hordeum vulgare subsp. vulgare]
          Length = 365

 Score =  310 bits (794), Expect = 7e-82,   Method: Compositional matrix adjust.
 Identities = 158/320 (49%), Positives = 212/320 (66%), Gaps = 16/320 (5%)

Query: 35  EPSIVEKHEQWMAQHGRTYK----DELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNE 90
           E S+   +E+W + +  + +    D  E+  R N+FKQN  Y+ + NK  +  ++L  N+
Sbjct: 34  EESLRGLYERWRSHYTVSRRGLGADAEER--RFNVFKQNARYVHEGNKR-DMPFRLALNK 90

Query: 91  FSDLTNEEFRALYTGYN-RPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQ 149
           F+D+T +EFR  Y G   R   S+S        F+Y +  ++P ++DWR+KGAVT IKDQ
Sbjct: 91  FADMTTDEFRRTYAGSRVRHHLSLSGGRRGDGGFRYGDADNLPPAVDWRQKGAVTAIKDQ 150

Query: 150 GQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC-STDNHGCSGGLMDKAFEYIIEN 208
           GQCGSCWAFS + AVEGI +I  GKL+ LSEQ+L+DC + +N GC GGLMD AF++I +N
Sbjct: 151 GQCGSCWAFSTIVAVEGINKIRTGKLVSLSEQELMDCDNVNNQGCDGGLMDYAFQFIQKN 210

Query: 209 KGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDAS 268
            G+ TE++YPY+ E+G+CD  KE A A TI  YED+P  DE AL +AV+ QPVSV +DAS
Sbjct: 211 -GITTESNYPYQGEQGSCDQAKENAQAVTIDGYEDVPANDESALQKAVAGQPVSVAIDAS 269

Query: 269 GRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRIL 328
           G+ F FY  GV   +C  + DHGVA VG+G   +  G KYW++KNSWGE WGE GYIR+ 
Sbjct: 270 GQDFQFYSEGVFTGECSTDLDHGVAAVGYGATRD--GTKYWIVKNSWGEDWGEKGYIRMQ 327

Query: 329 RDA----GLCGIATAASYPV 344
           R      GLCGIA  ASYP 
Sbjct: 328 RGVSQTEGLCGIAMQASYPT 347


>gi|4100157|gb|AAD10337.1| cysteine proteinase precursor [Hordeum vulgare]
          Length = 365

 Score =  310 bits (794), Expect = 7e-82,   Method: Compositional matrix adjust.
 Identities = 158/320 (49%), Positives = 212/320 (66%), Gaps = 16/320 (5%)

Query: 35  EPSIVEKHEQWMAQHGRTYK----DELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNE 90
           E S+   +E+W + +  + +    D  E+  R N+FKQN  Y+ + NK  +  ++L  N+
Sbjct: 34  EESLRGLYERWRSHYTVSRRGLGADAGER--RFNVFKQNARYVHEGNKR-DMPFRLALNK 90

Query: 91  FSDLTNEEFRALYTGYN-RPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQ 149
           F+D+T +EFR  Y G   R   S+S        F+Y +  ++P ++DWR+KGAVT IKDQ
Sbjct: 91  FADMTTDEFRRTYAGSRVRHHLSLSGGRRGDGGFRYGDADNLPPAVDWRQKGAVTAIKDQ 150

Query: 150 GQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC-STDNHGCSGGLMDKAFEYIIEN 208
           GQCGSCWAFS + AVEGI +I  GKL+ LSEQ+L+DC + +N GC GGLMD AF++I +N
Sbjct: 151 GQCGSCWAFSTIVAVEGINKIRTGKLVSLSEQELMDCDNVNNQGCDGGLMDYAFQFIQKN 210

Query: 209 KGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDAS 268
            G+ TE++YPY+ E+G+CD  KE A A TI  YED+P  DE AL +AV+ QPVSV +DAS
Sbjct: 211 -GITTESNYPYQGEQGSCDQAKENAQAVTIDGYEDVPANDESALQKAVAGQPVSVAIDAS 269

Query: 269 GRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRIL 328
           G+ F FY  GV   +C  + DHGVA VG+G   +  G KYW++KNSWGE WGE GYIR+ 
Sbjct: 270 GQDFQFYSEGVFTGECSTDLDHGVAAVGYGATRD--GTKYWIVKNSWGEDWGEKGYIRMQ 327

Query: 329 RDA----GLCGIATAASYPV 344
           R      GLCGIA  ASYP 
Sbjct: 328 RGVSQTEGLCGIAMQASYPT 347


>gi|3451077|emb|CAA20473.1| cysteine proteinase-like protein [Arabidopsis thaliana]
 gi|7269200|emb|CAB79307.1| cysteine proteinase-like protein [Arabidopsis thaliana]
          Length = 355

 Score =  310 bits (794), Expect = 7e-82,   Method: Compositional matrix adjust.
 Identities = 154/348 (44%), Positives = 226/348 (64%), Gaps = 27/348 (7%)

Query: 13  MFVIIILVITCASQVVS--------GRSMHEPSIVEKHEQWMAQHGRTYKDEL-EKAMRL 63
           +F++I+ V++  S  +          RS  E   +   + WM++HG+TY + L EK  R 
Sbjct: 12  LFLLIVFVLSAPSSAMDLPATSGGHNRSNEEVEFI--FQMWMSKHGKTYTNALGEKERRF 69

Query: 64  NIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTF 123
             FK NL +I++ N + N +Y+LG   F+DLT +E+R L+ G  +P     +Q +  ++ 
Sbjct: 70  QNFKDNLRFIDQHNAK-NLSYQLGLTRFADLTVQEYRDLFPGSPKP-----KQRNLKTSR 123

Query: 124 KYQNVT--DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQ 181
           +Y  +    +P S+DWR++GAV+ IKDQG C SCWAFS VAAVEG+ +I  G+LI LSEQ
Sbjct: 124 RYVPLAGDQLPESVDWRQEGAVSEIKDQGTCNSCWAFSTVAAVEGLNKIVTGELISLSEQ 183

Query: 182 QLVDCSTDNHGCSG-GLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISK 240
           +LVDC+  N+GC G GLMD AF+++I N GL +E DYPY+  +G+C+ ++   +  TI  
Sbjct: 184 ELVDCNLVNNGCYGSGLMDTAFQFLINNNGLDSEKDYPYQGTQGSCNRKQVHLLVITIDS 243

Query: 241 YEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTA 300
           YED+P  DE +L +AV++QPVSV VD   + F  Y+S + N  CG N DH + +VG+G+ 
Sbjct: 244 YEDVPANDEISLQKAVAHQPVSVGVDKKSQEFMLYRSCIYNGPCGTNLDHALVIVGYGS- 302

Query: 301 EEENGAKYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPV 344
             ENG  YW+++NSWG TWG++GYI+I R+     GLCGIA  ASYP+
Sbjct: 303 --ENGQDYWIVRNSWGTTWGDAGYIKIARNFEDPKGLCGIAMLASYPI 348


>gi|297802228|ref|XP_002868998.1| cysteine proteinase [Arabidopsis lyrata subsp. lyrata]
 gi|297314834|gb|EFH45257.1| cysteine proteinase [Arabidopsis lyrata subsp. lyrata]
          Length = 375

 Score =  310 bits (794), Expect = 7e-82,   Method: Compositional matrix adjust.
 Identities = 160/316 (50%), Positives = 206/316 (65%), Gaps = 20/316 (6%)

Query: 44  QWMAQHGRTYKDEL----EKAMRLNIFKQNLEYIEKAN-KEGNRTYKLGTNEFSDLTNEE 98
           QW A HG+T  +      ++  R NIFK NL +I+  N K  N TYKLG  +F+DLTNEE
Sbjct: 51  QWSADHGKTNNNNNGIINDQDKRFNIFKDNLRFIDLHNEKNKNATYKLGLTKFTDLTNEE 110

Query: 99  FRALYTG-YNRPVPSVSRQSSRPSTFKYQNVTD---VPTSIDWREKGAVTHIKDQGQCGS 154
           +R+LY G    PV  +++  ++    KY    D   VP ++DWR KGAV  IKDQG CGS
Sbjct: 111 YRSLYLGARTEPVRRIAK--AKNVNQKYSAAVDGKEVPETVDWRLKGAVNPIKDQGTCGS 168

Query: 155 CWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLAT 213
           CWAFS  AAVEGI +I  G+LI LSEQ+LVDC    N GC+GGLMD AF++I++N GL T
Sbjct: 169 CWAFSTAAAVEGINKIVTGELISLSEQELVDCDNSYNQGCNGGLMDYAFQFIMKNGGLKT 228

Query: 214 EADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFH 273
           E DYPYR   G C++  + A   +I  YED+P  DE AL +A+S QPVSV ++A GR F 
Sbjct: 229 EKDYPYRGFGGKCNSFLKNAKVVSIDGYEDVPTKDETALKRAISLQPVSVAIEAGGRIFQ 288

Query: 274 FYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD--- 330
            Y++G+   +CG N DH V  VG+G+   ENG  YW+++NSWG  WGE GYIR+ R+   
Sbjct: 289 HYQTGIFTGNCGTNLDHAVVAVGYGS---ENGVDYWIVRNSWGPRWGEEGYIRMERNLAS 345

Query: 331 --AGLCGIATAASYPV 344
             +G CGIA  ASYPV
Sbjct: 346 SKSGKCGIAVEASYPV 361


>gi|255646088|gb|ACU23531.1| unknown [Glycine max]
          Length = 362

 Score =  310 bits (794), Expect = 7e-82,   Method: Compositional matrix adjust.
 Identities = 158/320 (49%), Positives = 206/320 (64%), Gaps = 18/320 (5%)

Query: 35  EPSIVEKHEQWMAQHGRTYKDEL-EKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSD 93
           E S  + +E+W +   RT    L +K  R N+FK N+ ++   NK  ++ YKL  N+F+D
Sbjct: 33  EESFWDLYERWRSY--RTVSRSLGDKHKRFNVFKANVMHVHNTNKM-DKPYKLKLNKFAD 89

Query: 94  LTNEEFRALYTG----YNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQ 149
           +TN EFR+ Y G    ++R      R +    TF Y+ V  VP S DWR+ GAVT +KDQ
Sbjct: 90  MTNHEFRSTYAGSKVNHHRMFQGTPRGNG---TFMYEKVGSVPPSADWRKNGAVTGVKDQ 146

Query: 150 GQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIEN 208
           GQCGSCWAFS V AVEGI QI   KL+ LSEQ+LVDC T  N GC+GGLM+ AFE+I + 
Sbjct: 147 GQCGSCWAFSTVVAVEGINQIKTNKLVSLSEQELVDCDTKKNAGCNGGLMESAFEFIKQK 206

Query: 209 KGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDAS 268
            G+ TE++YPY  ++GTCD  K   +A +I  +E++P  DE ALL+AV+NQPVSV +DA 
Sbjct: 207 GGITTESNYPYTAQDGTCDASKANDLAVSIDGHENVPANDENALLKAVANQPVSVAIDAG 266

Query: 269 GRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIR-- 326
           G  F FY  GV   DC    +HGVA+VG+GT  +  G  YW ++NSWG  WGE GYIR  
Sbjct: 267 GFDFQFYFEGVFTGDCSTELNHGVAIVGYGTTVD--GTNYWTVRNSWGPEWGEQGYIRMQ 324

Query: 327 --ILRDAGLCGIATAASYPV 344
             I +  GLCGIA  ASYP+
Sbjct: 325 RSIFKKEGLCGIAMMASYPI 344


>gi|226504984|ref|NP_001151293.1| cysteine protease 1 precursor [Zea mays]
 gi|195645596|gb|ACG42266.1| cysteine protease 1 precursor [Zea mays]
          Length = 340

 Score =  310 bits (794), Expect = 7e-82,   Method: Compositional matrix adjust.
 Identities = 162/342 (47%), Positives = 222/342 (64%), Gaps = 18/342 (5%)

Query: 14  FVIIILVI----TCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
           F++ +LV+     C +      +    ++  +HE+WMA+HGR YKDE EKA RL +F+ N
Sbjct: 6   FLLAVLVVGSAVLCTAAAPRALAAAAAAMASRHEKWMAEHGRAYKDEAEKARRLEVFRAN 65

Query: 70  LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQN-- 127
            E I+  N  G  +++L TN F+DLT +EFRA  TG  RP P+ S  + R   F+Y+N  
Sbjct: 66  AELIDSFNAAGTHSHRLATNRFADLTVQEFRAARTGL-RPRPAPSAGAGR---FRYENFS 121

Query: 128 VTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC- 186
           + D   S+DWR  GAVT +KDQG  G CWAFSAVAAVEG+ +I  G+L+ LSEQ+LVDC 
Sbjct: 122 LADAAQSVDWRAMGAVTGVKDQGASGCCWAFSAVAAVEGLNKIRTGRLVSLSEQELVDCD 181

Query: 187 -STDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLP 245
            S  + GC GGLMD AF+++    GLA+E+ YPY+  +G C      A AA+I  +ED+P
Sbjct: 182 VSGVDQGCDGGLMDNAFQFVARRGGLASESGYPYQCRDGPC-RSSAAAAAASIRGHEDVP 240

Query: 246 KGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENG 305
           + +E AL  AV++QPVSV ++    AF FY SGVL   CG + +H +  VG+GTA +  G
Sbjct: 241 RNNEAALAAAVAHQPVSVAINGEDMAFRFYDSGVLGGACGTDLNHAITAVGYGTAAD--G 298

Query: 306 AKYWLIKNSWGETWGESGYIRI---LRDAGLCGIATAASYPV 344
            +YWL+KNSWG +WGE GY+RI   +R  G+CG+A   SYPV
Sbjct: 299 TRYWLMKNSWGASWGEGGYVRIRRGVRGEGVCGLAKLPSYPV 340


>gi|449525012|ref|XP_004169515.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
          Length = 459

 Score =  310 bits (793), Expect = 7e-82,   Method: Compositional matrix adjust.
 Identities = 161/348 (46%), Positives = 223/348 (64%), Gaps = 16/348 (4%)

Query: 5   FEKSFIIPMFVIIILVITCAS--QVVSGRSMHEPSIVEKHEQWMAQHGRTYKD-ELEKAM 61
           F+ S I+ +   + + ++ AS   ++  R+  E  ++  ++QW A+HG+ + +   E   
Sbjct: 4   FQSSPIMALLFFLFIALSAASPSSIIPQRTDDE--VMALYDQWRAKHGKLHNNLGAEPEN 61

Query: 62  RLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPS 121
           R +IFK NL++I++ N + N  Y+LG N F+DLTNEE+R+ Y G      S SR++   +
Sbjct: 62  RFHIFKDNLKFIDEINAQ-NLPYRLGLNVFADLTNEEYRSRYLG--GKFASGSRRNRTSN 118

Query: 122 TFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQ 181
            +  +   D+P SIDWR KGAV  +KDQG CGSCWAFS VA+VE I QI  G LI LSEQ
Sbjct: 119 RYLPRLGDDLPDSIDWRAKGAVAPVKDQGSCGSCWAFSTVASVEAINQIVTGDLIALSEQ 178

Query: 182 QLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISK 240
           +LVDC    N GC+GGLMD AFE+IIEN GL TE DYPY   + +C   K+ A    I  
Sbjct: 179 ELVDCDRSYNEGCNGGLMDYAFEFIIENGGLDTEEDYPYYGFDSSCIQYKKNAKVVAIDS 238

Query: 241 YEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTA 300
           YED+P  +E+AL +AVS Q VSV ++  GR+F  Y+SG+    CG + DHGV VVG+G+ 
Sbjct: 239 YEDVPVNNEKALQKAVSKQVVSVAIEGGGRSFQLYQSGIFTGRCGTDLDHGVNVVGYGS- 297

Query: 301 EEENGAKYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPV 344
             E G  YW+++NSWG +WGESGY+++ R+     GLCGIA   SYP 
Sbjct: 298 --EGGVDYWIVRNSWGGSWGESGYVKMQRNIASPTGLCGIAMEPSYPT 343


>gi|218195711|gb|EEC78138.1| hypothetical protein OsI_17694 [Oryza sativa Indica Group]
          Length = 458

 Score =  310 bits (793), Expect = 8e-82,   Method: Compositional matrix adjust.
 Identities = 158/327 (48%), Positives = 207/327 (63%), Gaps = 16/327 (4%)

Query: 27  VVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKAN---KEGNRT 83
           +VS     E      + +W A+HG+ Y    E+  R   F+ NL YI++ N     G  +
Sbjct: 25  IVSYGERSEEEARRLYAEWKAEHGKNYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHS 84

Query: 84  YKLGTNEFSDLTNEEFRALYTGY-NRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGA 142
           ++LG N F+DLTNEE+R  Y G  N+P     R+      +   +   +P S+DWR KGA
Sbjct: 85  FRLGLNRFADLTNEEYRDTYLGLRNKP----RRERKVSDRYLAADNEALPESVDWRTKGA 140

Query: 143 VTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKA 201
           V  IKDQG CGSCWAFSA+AAVEGI QI  G LI LSEQ+LVDC T  N GC+GGLMD A
Sbjct: 141 VAEIKDQGGCGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYA 200

Query: 202 FEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPV 261
           F++II N G+ TE DYPY+ ++  CD  ++ A   TI  YED+    E +L +AV+NQPV
Sbjct: 201 FDFIINNGGIDTEDDYPYKGKDERCDVNRKNAKVVTIDSYEDVTPNSETSLQKAVANQPV 260

Query: 262 SVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGE 321
           SV ++A GRAF  Y SG+    CG   DHGVA VG+GT   ENG  YW+++NSWG++WGE
Sbjct: 261 SVAIEAGGRAFQLYSSGIFTGKCGTALDHGVAAVGYGT---ENGKDYWIVRNSWGKSWGE 317

Query: 322 SGYIRILRD----AGLCGIATAASYPV 344
           SGY+R+ R+    +G CGIA   SYP+
Sbjct: 318 SGYVRMERNIKASSGKCGIAVEPSYPL 344


>gi|242032709|ref|XP_002463749.1| hypothetical protein SORBIDRAFT_01g005350 [Sorghum bicolor]
 gi|241917603|gb|EER90747.1| hypothetical protein SORBIDRAFT_01g005350 [Sorghum bicolor]
          Length = 381

 Score =  310 bits (793), Expect = 9e-82,   Method: Compositional matrix adjust.
 Identities = 152/320 (47%), Positives = 206/320 (64%), Gaps = 12/320 (3%)

Query: 35  EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNR-TYKLGTNEFSD 93
           + ++ + +E+W   H R ++   EK  R   FK+N+ +I   NK G+R +Y+L  N F D
Sbjct: 39  DEALWDLYERWQTHH-RVHRHHGEKGRRFGTFKENVRFIHAHNKRGDRPSYRLRLNRFGD 97

Query: 94  LTNEEFRALYTGYNRPVPSVSRQSSRPST----FKYQNVTDVPTSIDWREKGAVTHIKDQ 149
           +  EEFR+ +           R+SS  +T    F Y + TDVP S+DWR+ GAVT +K+Q
Sbjct: 98  MGPEEFRSTFADSRINDLRRYRESSPAATAVPGFMYDDATDVPRSVDWRQHGAVTAVKNQ 157

Query: 150 GQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENK 209
           G+CGSCWAFS V AVEGI  I  G L+ LSEQ+LVDC T  +GC GGLM+ AF++I    
Sbjct: 158 GRCGSCWAFSTVVAVEGINAIRTGSLVSLSEQELVDCDTAENGCQGGLMENAFDFIKSYG 217

Query: 210 GLATEADYPYRHEEGTCDNQKEK--AVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDA 267
           G+ TE+ YPYR   GTCD  + +   V  +I  ++ +P G E AL +AV+ QPVSV +DA
Sbjct: 218 GITTESAYPYRASNGTCDGMRARRGRVHVSIDGHQMVPTGSEDALAKAVARQPVSVAIDA 277

Query: 268 SGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRI 327
            G+AF FY  GV   DCG + DHGVAVVG+G ++ + G  YW++KNSWG +WGE GYIR+
Sbjct: 278 GGQAFQFYSEGVFTGDCGTDLDHGVAVVGYGVSDVD-GTPYWIVKNSWGPSWGEGGYIRM 336

Query: 328 LRDA---GLCGIATAASYPV 344
            R A   GLCGIA  AS+P+
Sbjct: 337 QRGAGNGGLCGIAMEASFPI 356


>gi|226507950|ref|NP_001151278.1| LOC100284911 precursor [Zea mays]
 gi|195645488|gb|ACG42212.1| vignain precursor [Zea mays]
          Length = 376

 Score =  310 bits (793), Expect = 9e-82,   Method: Compositional matrix adjust.
 Identities = 159/319 (49%), Positives = 202/319 (63%), Gaps = 15/319 (4%)

Query: 35  EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
           E ++   +E+W  +H    +D  +KA R N+FK N+  I + N+  +  YKL  N F D+
Sbjct: 42  EEALWALYERWRGRHALA-RDLGDKARRFNVFKANVRLIHEFNRR-DEPYKLRLNRFGDM 99

Query: 95  TNEEFRALYTG----YNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQG 150
           T +EFR  Y G    ++R      + SS  ++F Y +  DVP S+DWR+KGAVT +KDQG
Sbjct: 100 TADEFRRHYAGSRVAHHRMFRGDRQGSSASASFMYADARDVPASVDWRQKGAVTDVKDQG 159

Query: 151 QCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENK 209
           QCGSCWAFS +AAVEGI  I    L  LSEQQLVDC T  N GC+GGLMD AF+YI ++ 
Sbjct: 160 QCGSCWAFSTIAAVEGINAIKTKNLTSLSEQQLVDCDTKANAGCNGGLMDYAFQYIAKHG 219

Query: 210 GLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASG 269
           G+A E  YPYR  + +C  +K  A   TI  YED+P  DE AL +AV++QPVSV ++ASG
Sbjct: 220 GVAAEDAYPYRARQASC--KKSPAPVVTIDGYEDVPANDESALKKAVAHQPVSVAIEASG 277

Query: 270 RAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILR 329
             F FY  GV +  CG   DHGV  VG+G     +G KYWL+KNSWG  WGE GYIR+ R
Sbjct: 278 SHFQFYSEGVFSGRCGTELDHGVTAVGYGVT--ADGTKYWLVKNSWGPEWGEKGYIRMAR 335

Query: 330 DA----GLCGIATAASYPV 344
           D     G CGIA  ASYPV
Sbjct: 336 DVAAKEGHCGIAMEASYPV 354


>gi|46395939|sp|Q94B08.2|GCP1_ARATH RecName: Full=Germination-specific cysteine protease 1; Flags:
           Precursor
 gi|4006883|emb|CAB16767.1| cysteine proteinase [Arabidopsis thaliana]
 gi|7270637|emb|CAB80354.1| cysteine proteinase [Arabidopsis thaliana]
          Length = 376

 Score =  309 bits (792), Expect = 1e-81,   Method: Compositional matrix adjust.
 Identities = 158/315 (50%), Positives = 206/315 (65%), Gaps = 18/315 (5%)

Query: 44  QWMAQHGRTYKDEL----EKAMRLNIFKQNLEYIEKANKEG-NRTYKLGTNEFSDLTNEE 98
           QW A+HG+T  +      ++  R NIFK NL +I+  N++  N TYKLG  +F+DLTN+E
Sbjct: 51  QWSAEHGKTNNNNNGIINDQDKRFNIFKDNLRFIDLHNEDNKNATYKLGLTKFTDLTNDE 110

Query: 99  FRALYTGYNRPVPSVSRQSSRPSTFKYQ---NVTDVPTSIDWREKGAVTHIKDQGQCGSC 155
           +R LY G  R  P+     ++    KY    N  +VP ++DWR+KGAV  IKDQG CGSC
Sbjct: 111 YRKLYLGA-RTEPARRIAKAKNVNQKYSAAVNGKEVPETVDWRQKGAVNPIKDQGTCGSC 169

Query: 156 WAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATE 214
           WAFS  AAVEGI +I  G+LI LSEQ+LVDC    N GC+GGLMD AF++I++N GL TE
Sbjct: 170 WAFSTTAAVEGINKIVTGELISLSEQELVDCDKSYNQGCNGGLMDYAFQFIMKNGGLNTE 229

Query: 215 ADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHF 274
            DYPYR   G C++  + +   +I  YED+P  DE AL +A+S QPVSV ++A GR F  
Sbjct: 230 KDYPYRGFGGKCNSFLKNSRVVSIDGYEDVPTKDETALKKAISYQPVSVAIEAGGRIFQH 289

Query: 275 YKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD---- 330
           Y+SG+    CG N DH V  VG+G+   ENG  YW+++NSWG  WGE GYIR+ R+    
Sbjct: 290 YQSGIFTGSCGTNLDHAVVAVGYGS---ENGVDYWIVRNSWGPRWGEEGYIRMERNLAAS 346

Query: 331 -AGLCGIATAASYPV 344
            +G CGIA  ASYPV
Sbjct: 347 KSGKCGIAVEASYPV 361


>gi|414879123|tpg|DAA56254.1| TPA: hypothetical protein ZEAMMB73_708930 [Zea mays]
          Length = 368

 Score =  309 bits (792), Expect = 1e-81,   Method: Compositional matrix adjust.
 Identities = 146/315 (46%), Positives = 205/315 (65%), Gaps = 12/315 (3%)

Query: 37  SIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTN 96
           ++ + +E+W   H R ++   EK  R   FK+N  +I   NK G+R Y+L  N F D+  
Sbjct: 37  ALWDLYERWQTHH-RVHRHHGEKGRRFGTFKENARFIHAHNKRGDRPYRLRLNRFGDMGR 95

Query: 97  EEFRALYTGYNRPVPSVSRQ-SSRPST--FKYQNVTDVPTSIDWREKGAVTHIKDQGQCG 153
           EEFR+ +   +  +  + R+ ++ P+   F Y + TD+P S+DWR+KGAVT +K+QG+CG
Sbjct: 96  EEFRSGFA--DSRINDLRREPTAAPAVPGFMYDDATDLPRSVDWRQKGAVTAVKNQGRCG 153

Query: 154 SCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLAT 213
           SCWAFS V AVEGI  I  G L+ LSEQ+L+DC TD +GC GGLM+ AFE+I  + G+ T
Sbjct: 154 SCWAFSTVVAVEGINAIRTGSLVSLSEQELIDCDTDENGCQGGLMENAFEFIKSHGGITT 213

Query: 214 EADYPYRHEEGTCDNQK-EKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAF 272
           E+ YPY    GTCD  +  +     I  ++ +P G E AL +AV++QPVSV +DA G+A 
Sbjct: 214 ESAYPYHASNGTCDGARARRGRVVAIDGHQAVPAGSEDALAKAVAHQPVSVAIDAGGQAL 273

Query: 273 HFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILR--- 329
            FY  GV   DCG + DHGVA VG+G +++  G  YW++KNSWG +WGE GYIR+ R   
Sbjct: 274 QFYSEGVFTGDCGTDLDHGVAAVGYGVSDD--GTPYWIVKNSWGPSWGEGGYIRMQRGTG 331

Query: 330 DAGLCGIATAASYPV 344
           + GLCGIA  AS+P+
Sbjct: 332 NGGLCGIAMEASFPI 346


>gi|226506492|ref|NP_001140873.1| uncharacterized protein LOC100272949 precursor [Zea mays]
 gi|194701540|gb|ACF84854.1| unknown [Zea mays]
          Length = 379

 Score =  309 bits (792), Expect = 1e-81,   Method: Compositional matrix adjust.
 Identities = 155/350 (44%), Positives = 209/350 (59%), Gaps = 23/350 (6%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIV------EKHEQWMAQHGRTYKDELEKAMRLNIF 66
           + V ++ V + A ++       E  +       + +E+W   H R ++   EK  R   F
Sbjct: 9   LLVALVFVSSAAVELCRAIDFDERDLASDEALWDLYERWQTHH-RVHRHHGEKGRRFGTF 67

Query: 67  KQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPST---- 122
           K+N+ +I   NK G+R Y+L  N F D+  EEFR+ +   +  +  + RQ S  +     
Sbjct: 68  KENVRFIHAHNKRGDRPYRLRLNRFGDMGREEFRSTFA--DSRINDLRRQDSPAARAGAV 125

Query: 123 --FKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSE 180
             F Y +  D P S+DWR++GAVT +K QG CGSCWAFS V AVEGI  I  G L  LSE
Sbjct: 126 PGFMYDSAADPPRSVDWRQEGAVTGVKVQGHCGSCWAFSTVVAVEGINAIRTGSLASLSE 185

Query: 181 QQLVDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEK---AVAAT 237
           Q+L+DC TD +GC GGLM+ AFE+I    G+ TEA YPYR   GTCD  + +    V   
Sbjct: 186 QELIDCDTDENGCQGGLMENAFEFIKSFGGITTEAAYPYRASNGTCDGDRARRGGGVVVV 245

Query: 238 ISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGF 297
           I  ++ +P G E AL +AV++QPVSV VDA G+AF FY  GV   DCG + DHGVA VG+
Sbjct: 246 IDGHQMVPAGSEDALAKAVAHQPVSVAVDAGGQAFQFYSEGVFTGDCGTDLDHGVAAVGY 305

Query: 298 GTAEEENGAKYWLIKNSWGETWGESGYIRILRDA---GLCGIATAASYPV 344
           G  ++  G  YW++KNSWG +WGE GYIR+ R A   GLCGIA  AS+P+
Sbjct: 306 GVGDD--GTPYWIVKNSWGTSWGEGGYIRMQRGAGNGGLCGIAMEASFPI 353


>gi|357156854|ref|XP_003577598.1| PREDICTED: thiol protease SEN102-like [Brachypodium distachyon]
          Length = 368

 Score =  309 bits (791), Expect = 1e-81,   Method: Compositional matrix adjust.
 Identities = 149/293 (50%), Positives = 200/293 (68%), Gaps = 9/293 (3%)

Query: 58  EKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYN-RPVPSVSRQ 116
           + A R N+FK+N++YI +ANK+ +R ++L  N+F+D+T +E R  Y G   R   ++S  
Sbjct: 64  DPARRFNVFKENVKYIHEANKK-DRPFRLALNKFADMTTDELRHSYAGSRVRHHRALSGG 122

Query: 117 SSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLI 176
                 F Y +  ++P ++DWREKGAVT IKDQGQCGSCWAFS +AAVE I +I  GKL+
Sbjct: 123 RRAQGNFTYSDAENLPPAVDWREKGAVTGIKDQGQCGSCWAFSTIAAVESINKIRTGKLV 182

Query: 177 ELSEQQLVDC-STDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVA 235
            LSEQ+L+DC + ++ GC GGLMD AF++I +N G+ +EA+YPY+ ++ TCD  KE    
Sbjct: 183 SLSEQELMDCDNVNDQGCDGGLMDYAFQFIQKNGGVTSEANYPYQGQQNTCDQAKENTHD 242

Query: 236 ATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVV 295
             I  YED+P  DE AL +AV+ QPVSV ++ASG+ F FY  GV    C  + DHGVA V
Sbjct: 243 VAIDGYEDVPANDESALQKAVAYQPVSVAIEASGQDFQFYSEGVFTGQCTTDLDHGVAAV 302

Query: 296 GFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
           G+GTA +  G KYW++KNSWG  WGE GYIR+ R      GLCGIA  ASYP+
Sbjct: 303 GYGTARD--GTKYWIVKNSWGLDWGEKGYIRMQRGVSQAEGLCGIAMQASYPI 353


>gi|125540888|gb|EAY87283.1| hypothetical protein OsI_08685 [Oryza sativa Indica Group]
          Length = 357

 Score =  309 bits (791), Expect = 1e-81,   Method: Compositional matrix adjust.
 Identities = 156/356 (43%), Positives = 222/356 (62%), Gaps = 33/356 (9%)

Query: 11  IPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQ--------------WMAQHGRTYKDE 56
           + M  +++  + C++      S H+PS+V   ++              W  +H + Y   
Sbjct: 5   LSMLFLLLGFVACSATA----SHHDPSVVGYSQEDLALPNKLVGLFTSWSVKHSKIYASP 60

Query: 57  LEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQ 116
            EK  R  IFK+NL +I + N+  N +Y LG N F+D+ +EEF+A Y G     P ++R+
Sbjct: 61  KEKVKRYEIFKRNLRHIVETNRR-NGSYWLGLNHFADIAHEEFKASYLGLK---PGLARR 116

Query: 117 SSRP---STFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRG 173
            ++P   +TF+Y N  ++P ++DWR+KGAVT +K+QG+CGSCWAFS VAAVEGI QI  G
Sbjct: 117 DAQPHGSTTFRYANAVNLPWAVDWRKKGAVTPVKNQGECGSCWAFSTVAAVEGINQIVTG 176

Query: 174 KLIELSEQQLVDC-STDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEK 232
           KL+ LSEQ+L+DC +T NHGC GGLMD AF YI+ N+G+ TE DYPY  EEG C  ++  
Sbjct: 177 KLVSLSEQELMDCDNTFNHGCRGGLMDFAFAYIMGNQGIYTEEDYPYLMEEGYCREKQPH 236

Query: 233 AVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGV 292
           +   TI+ YED+P+  E +LL+A+++QPVSV + A  R F FYK G+ + +CG   DH +
Sbjct: 237 SKVITITGYEDVPENSETSLLKALAHQPVSVGIAAGSRDFQFYKGGIFDGECGIQPDHAL 296

Query: 293 AVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
             VG+G+     G  Y ++KNSWG+ WGE GY RI R      G+C I   ASYP 
Sbjct: 297 TAVGYGSYY---GQDYIIMKNSWGKNWGEQGYFRIRRGTGKPEGVCDIYKIASYPT 349


>gi|146215978|gb|ABQ10191.1| actinidin Act1c [Actinidia eriantha]
          Length = 368

 Score =  309 bits (791), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 154/347 (44%), Positives = 216/347 (62%), Gaps = 14/347 (4%)

Query: 3   LKFEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMR 62
           +   KSF+    +    ++  +  + + R+  E  +   +E W+ +HG++Y    E+  R
Sbjct: 1   MGLPKSFVSMSLLFFSTLLILSLALDAKRTNDE--VKAMYESWLIKHGKSYNSLGERERR 58

Query: 63  LNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPST 122
             IFK+ L +I++ N + +R+YK+G N+F+DLTNEEFR+ Y G+ R     S ++   + 
Sbjct: 59  FEIFKETLRFIDEHNADTSRSYKVGLNQFADLTNEEFRSTYLGFTRG----SNKTKVSNR 114

Query: 123 FKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQ 182
           ++ +    +P  +DWR +GAV  IK+QGQCGSCWAFSA+AAVEGI +I  G LI LSEQ+
Sbjct: 115 YEPRVGQVLPDYVDWRSEGAVVDIKNQGQCGSCWAFSAIAAVEGINKIVTGNLISLSEQE 174

Query: 183 LVDC--STDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISK 240
           LVDC  +    GC GG M   FE+II N G+ TE +YPY  +EG CD   +     TI  
Sbjct: 175 LVDCGRTQSTKGCDGGYMTDGFEFIINNGGINTEENYPYTAQEGQCDLNLQNEKYVTIDN 234

Query: 241 YEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTA 300
           YE++P  +E AL  AV+ QPVSV ++++G AF  Y SG+    CG   DH V +VG+GT 
Sbjct: 235 YENVPYYNEWALQTAVAYQPVSVALESAGDAFQHYSSGIFTGPCGTATDHAVTIVGYGT- 293

Query: 301 EEENGAKYWLIKNSWGETWGESGYIRILRD---AGLCGIATAASYPV 344
             E G  YW++KNSW  TWGE GY+RILR+   AG CGIAT  SYPV
Sbjct: 294 --EGGIDYWIVKNSWDTTWGEEGYMRILRNVGGAGTCGIATMPSYPV 338


>gi|186516984|ref|NP_195406.2| cysteine proteinase1 [Arabidopsis thaliana]
 gi|15290508|gb|AAK92229.1| cysteine proteinase [Arabidopsis thaliana]
 gi|332661313|gb|AEE86713.1| cysteine proteinase1 [Arabidopsis thaliana]
          Length = 376

 Score =  309 bits (791), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 158/315 (50%), Positives = 205/315 (65%), Gaps = 18/315 (5%)

Query: 44  QWMAQHGRTYKDEL----EKAMRLNIFKQNLEYIEKANKEG-NRTYKLGTNEFSDLTNEE 98
           QW A+HG+T  +      ++  R NIFK NL +I+  N+   N TYKLG  +F+DLTN+E
Sbjct: 51  QWSAEHGKTNNNNNGIINDQDKRFNIFKDNLRFIDLHNENNKNATYKLGLTKFTDLTNDE 110

Query: 99  FRALYTGYNRPVPSVSRQSSRPSTFKYQ---NVTDVPTSIDWREKGAVTHIKDQGQCGSC 155
           +R LY G  R  P+     ++    KY    N  +VP ++DWR+KGAV  IKDQG CGSC
Sbjct: 111 YRKLYLGA-RTEPARRIAKAKNVNQKYSAAVNGKEVPETVDWRQKGAVNPIKDQGTCGSC 169

Query: 156 WAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATE 214
           WAFS  AAVEGI +I  G+LI LSEQ+LVDC    N GC+GGLMD AF++I++N GL TE
Sbjct: 170 WAFSTTAAVEGINKIVTGELISLSEQELVDCDKSYNQGCNGGLMDYAFQFIMKNGGLNTE 229

Query: 215 ADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHF 274
            DYPYR   G C++  + +   +I  YED+P  DE AL +A+S QPVSV ++A GR F  
Sbjct: 230 KDYPYRGFGGKCNSFLKNSRVVSIDGYEDVPTKDETALKKAISYQPVSVAIEAGGRIFQH 289

Query: 275 YKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD---- 330
           Y+SG+    CG N DH V  VG+G+   ENG  YW+++NSWG  WGE GYIR+ R+    
Sbjct: 290 YQSGIFTGSCGTNLDHAVVAVGYGS---ENGVDYWIVRNSWGPRWGEEGYIRMERNLAAS 346

Query: 331 -AGLCGIATAASYPV 344
            +G CGIA  ASYPV
Sbjct: 347 KSGKCGIAVEASYPV 361


>gi|168058022|ref|XP_001781010.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162667567|gb|EDQ54194.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 457

 Score =  308 bits (790), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 165/334 (49%), Positives = 211/334 (63%), Gaps = 22/334 (6%)

Query: 27  VVSGRSMHEPSIVEKHE-------QWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE 79
           V +G  +  P+ V K +        W  +HG+ Y    E+A R  ++K NLEYI++ + E
Sbjct: 23  VANGDVIRMPTDVGKDQLLAGQFAAWAHKHGKVYSAAEERAHRFLVWKDNLEYIQR-HSE 81

Query: 80  GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPST--FKYQNVTDVPTSIDW 137
            N +Y LG  +F+DLTNEEFR  YTG  R   S   +  R +T  F+Y N ++ P SIDW
Sbjct: 82  KNLSYWLGLTKFADLTNEEFRRQYTG-TRIDRSRRLKKGRNATGSFRYAN-SEAPKSIDW 139

Query: 138 REKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGG 196
           REKGAVT +KDQG CGSCWAFSAV +VEGI  I  G  I LS Q+LVDC    N GC+GG
Sbjct: 140 REKGAVTSVKDQGSCGSCWAFSAVGSVEGINAIRTGDAISLSVQELVDCDKKYNQGCNGG 199

Query: 197 LMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAV 256
           LMD AF+++I+N G+ TE DYPY+  +G CD  K  A   TI  YED+P+ DE+AL +AV
Sbjct: 200 LMDYAFDFVIQNGGIDTEKDYPYQGYDGRCDVNKMNARVVTIDSYEDVPENDEEALKKAV 259

Query: 257 SNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWG 316
           + QPVSV ++A GR F  Y  GV    CG + DHGV  VG+G+   E G  YW++KNSWG
Sbjct: 260 AGQPVSVAIEAGGRDFQLYSGGVFTGRCGTDLDHGVLAVGYGS---EKGLDYWIVKNSWG 316

Query: 317 ETWGESGYIRILRDA------GLCGIATAASYPV 344
           E WGESGY+R+ R+       GLCGI    SY V
Sbjct: 317 EYWGESGYLRMQRNLKDDNGYGLCGINIEPSYAV 350


>gi|115448287|ref|NP_001047923.1| Os02g0715000 [Oryza sativa Japonica Group]
 gi|42408029|dbj|BAD09165.1| putative cysteine proteinase [Oryza sativa Japonica Group]
 gi|113537454|dbj|BAF09837.1| Os02g0715000 [Oryza sativa Japonica Group]
 gi|215737450|dbj|BAG96580.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215765786|dbj|BAG87483.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222623551|gb|EEE57683.1| hypothetical protein OsJ_08138 [Oryza sativa Japonica Group]
          Length = 366

 Score =  308 bits (790), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 156/356 (43%), Positives = 221/356 (62%), Gaps = 33/356 (9%)

Query: 11  IPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQ--------------WMAQHGRTYKDE 56
           + M  +++  + C++      S H+PS+V   ++              W  +H + Y   
Sbjct: 14  LSMLFLLLGFVACSATA----SHHDPSVVGYSQEDLALPNKLVGLFTSWSVKHSKIYASP 69

Query: 57  LEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQ 116
            EK  R  IFK+NL +I + N+  N +Y LG N F+D+ +EEF+A Y G     P ++R+
Sbjct: 70  KEKVKRYEIFKRNLRHIVETNRR-NGSYWLGLNHFADIAHEEFKASYLGLK---PGLARR 125

Query: 117 SSRP---STFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRG 173
            ++P   +TF+Y N  ++P ++DWR+KGAVT +K+QG+CGSCWAFS VAAVEGI QI  G
Sbjct: 126 DAQPHGSTTFRYANAVNLPWAVDWRKKGAVTPVKNQGECGSCWAFSTVAAVEGINQIVTG 185

Query: 174 KLIELSEQQLVDC-STDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEK 232
           KL+ LSEQ+L+DC +T NHGC GGLMD AF YI+ N+G+ TE DYPY  EEG C  ++  
Sbjct: 186 KLVSLSEQELMDCDNTFNHGCRGGLMDFAFAYIMGNQGIYTEEDYPYLMEEGYCREKQPH 245

Query: 233 AVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGV 292
           +   TI+ YED+P   E +LL+A+++QPVSV + A  R F FYK G+ + +CG   DH +
Sbjct: 246 SKVITITGYEDVPANSETSLLKALAHQPVSVGIAAGSRDFQFYKGGIFDGECGIQPDHAL 305

Query: 293 AVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
             VG+G+     G  Y ++KNSWG+ WGE GY RI R      G+C I   ASYP 
Sbjct: 306 TAVGYGSYY---GQDYIIMKNSWGKNWGEQGYFRIRRGTGKPEGVCDIYKIASYPT 358


>gi|255032|gb|AAB23155.1| COT44=cysteine proteinase homolog [Brassica napus, seedling, rapid
           cycling base population CrGC5, Peptide, 328 aa]
          Length = 328

 Score =  308 bits (790), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 156/315 (49%), Positives = 206/315 (65%), Gaps = 19/315 (6%)

Query: 44  QWMAQHGRTYKDEL----EKAMRLNIFKQNLEYIEKANKEG-NRTYKLGTNEFSDLTNEE 98
           +W  +HG++  +      ++  R NIFK NL +I+  N+   N TYKLG   F++LTN+E
Sbjct: 6   RWSLEHGKSNSNSNGIINQQDERFNIFKDNLRFIDLHNENNKNATYKLGLTIFANLTNDE 65

Query: 99  FRALYTG-YNRPVPSVSRQSSRPSTFKYQ---NVTDVPTSIDWREKGAVTHIKDQGQCGS 154
           +R+LY G    PV  +++  ++    KY    N  +VP ++DWR+KGAV  IKDQG CGS
Sbjct: 66  YRSLYLGARTEPVRRITK--AKNVNMKYSAAVNDVEVPVTVDWRQKGAVNAIKDQGTCGS 123

Query: 155 CWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLAT 213
           CWAFS  AAVEGI +I  G+L+ LSEQ+LVDC    N GC+GGLMD AF++I++N GL T
Sbjct: 124 CWAFSTAAAVEGINKIVTGELVSLSEQELVDCDKSYNQGCNGGLMDYAFQFIMKNGGLNT 183

Query: 214 EADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFH 273
           E DYPY    G C++  + +   TI  YED+P  DE AL +AVS QPVSV +DA GRAF 
Sbjct: 184 EKDYPYHGTNGKCNSLLKNSRVVTIDGYEDVPSKDETALKRAVSYQPVSVAIDAGGRAFQ 243

Query: 274 FYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD--- 330
            Y+SG+    CG N DH V  VG+G+   ENG  YW+++NSWG  WGE GYIR+ R+   
Sbjct: 244 HYQSGIFTGKCGTNMDHAVVAVGYGS---ENGVDYWIVRNSWGTRWGEDGYIRMERNVAS 300

Query: 331 -AGLCGIATAASYPV 344
            +G CGIA  ASYPV
Sbjct: 301 KSGKCGIAIEASYPV 315


>gi|242074728|ref|XP_002447300.1| hypothetical protein SORBIDRAFT_06g032360 [Sorghum bicolor]
 gi|241938483|gb|EES11628.1| hypothetical protein SORBIDRAFT_06g032360 [Sorghum bicolor]
          Length = 471

 Score =  308 bits (790), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 153/326 (46%), Positives = 208/326 (63%), Gaps = 11/326 (3%)

Query: 27  VVSGRSMHEPSIVEKHEQWMAQHGRTYKDEL-EKAMRLNIFKQNLEYIEKAN-KEGNRTY 84
           V  G +  E  +   +EQWMA+HG+   + L E   R   F  NL +++  N + G R Y
Sbjct: 37  VGGGMARTEAQVRAMYEQWMARHGKAASNALGEHDRRFRAFWDNLRFVDAHNARAGARGY 96

Query: 85  KLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVT 144
           +LG N F+DLTN EFRA Y   +    + +  ++    +++  V  +P  +DWR+KGAV 
Sbjct: 97  RLGINRFADLTNAEFRAAY--LSAGARNGTATAATGERYRHDGVEALPEFVDWRQKGAVA 154

Query: 145 HIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAF 202
            +K+QGQCGSCWAFSAV AVEGI QI  G+L+ LSEQ+LVDCS +  N GC GG+MD AF
Sbjct: 155 PVKNQGQCGSCWAFSAVGAVEGINQIVTGELVTLSEQELVDCSKNGQNGGCDGGMMDDAF 214

Query: 203 EYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVS 262
            +I+ N G+ T+ DYPY   +G CD  K      +I  +E +P+ DE++L +AV++QPV+
Sbjct: 215 AFIVGNGGIDTDKDYPYTARDGKCDVAKRSRHVVSIDGFEGVPRNDEKSLQKAVAHQPVA 274

Query: 263 VCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGES 322
           V ++A GR F  Y+SGV    CG + DHGV  VG+GT E + G  YWL++NSWG  WGE 
Sbjct: 275 VAIEAGGREFQLYQSGVFTGRCGTSLDHGVVAVGYGT-EADGGRDYWLVRNSWGADWGEG 333

Query: 323 GYIRILRD----AGLCGIATAASYPV 344
           GYIR+ R+    AG CGIA  ASYPV
Sbjct: 334 GYIRMERNVGARAGKCGIAMEASYPV 359


>gi|224081756|ref|XP_002306486.1| predicted protein [Populus trichocarpa]
 gi|222855935|gb|EEE93482.1| predicted protein [Populus trichocarpa]
          Length = 352

 Score =  308 bits (789), Expect = 3e-81,   Method: Compositional matrix adjust.
 Identities = 160/310 (51%), Positives = 208/310 (67%), Gaps = 11/310 (3%)

Query: 42  HEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRA 101
           ++ W+A+HG+ Y    E+A R  IFK NL +I++ N + N TYK+G  +F+DLTNEE+RA
Sbjct: 4   YKWWLAKHGKAYNGLGEEAERFEIFKNNLRFIDEHNSQ-NHTYKVGLTKFADLTNEEYRA 62

Query: 102 LYTGYNRPVPSVSRQSSRPS-TFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSA 160
           ++ G          +S  PS  + ++    +P S+DWR KGAV  IKDQG CGSCWAFS 
Sbjct: 63  MFLGTRSDAKRRLMKSKSPSERYAFKAGDKLPESVDWRAKGAVNPIKDQGSCGSCWAFST 122

Query: 161 VAAVEGITQITRGKLIELSEQQLVDCS-TDNHGCSGGLMDKAFEYIIENKGLATEADYPY 219
           VAAVEGI QI  G+LI LSEQ+LVDC  T N GC+GGLMD AF++II N GL TE DYPY
Sbjct: 123 VAAVEGINQIVTGELISLSEQELVDCDRTYNAGCNGGLMDYAFQFIINNGGLDTEKDYPY 182

Query: 220 RHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGV 279
             ++  CD  K K  A +I  +ED+   DE+AL +AV++QPVSV ++ASG A  FY+SGV
Sbjct: 183 VGDDDKCDKDKMKTKAVSIDGFEDVLPYDEKALQKAVAHQPVSVAIEASGMALQFYQSGV 242

Query: 280 LNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD-----AGLC 334
              +CG   DHGV VVG+ +   ENG  YWL++NSWG  WGE GYI++ R+      G C
Sbjct: 243 FTGECGTALDHGVVVVGYAS---ENGLDYWLVRNSWGTEWGEHGYIKMQRNVGDTYTGRC 299

Query: 335 GIATAASYPV 344
           GIA  +SYPV
Sbjct: 300 GIAMESSYPV 309


>gi|115468686|ref|NP_001057942.1| Os06g0582600 [Oryza sativa Japonica Group]
 gi|55296512|dbj|BAD68726.1| putative cysteine proteinase [Oryza sativa Japonica Group]
 gi|113595982|dbj|BAF19856.1| Os06g0582600 [Oryza sativa Japonica Group]
 gi|215695236|dbj|BAG90427.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 357

 Score =  307 bits (787), Expect = 4e-81,   Method: Compositional matrix adjust.
 Identities = 167/360 (46%), Positives = 225/360 (62%), Gaps = 31/360 (8%)

Query: 8   SFIIPMFVIIILVITCASQVV--------SGRSMHEPSIVEKHEQWMAQHGRTYKDELEK 59
           SF +   ++II++  C + +V        +     + ++ E++E+W A HGRTYKD LEK
Sbjct: 7   SFSLAAILLIIIMYCCPTGLVEAARKGPAAAGGGDDSAMRERYEKWAADHGRTYKDSLEK 66

Query: 60  AMRLNIFKQNLEYIEKANKEGNR-TYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSS 118
           A R  +F+ N  +I+  N  G + + +L TN+F+DLTNEEF A Y G     P +     
Sbjct: 67  ARRFEVFRTNALFIDSFNAAGGKKSPRLTTNKFADLTNEEF-AEYYGRPFSTPVIG---- 121

Query: 119 RPSTFKYQNV--TDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLI 176
             S F Y NV  +DVP +I+WR++GAVT +K+Q  C SCWAFSAVAAVEGI QI    L+
Sbjct: 122 -GSGFMYGNVRTSDVPANINWRDRGAVTQVKNQKDCASCWAFSAVAAVEGIHQIRSHNLV 180

Query: 177 ELSEQQLVDCST--DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEE-GTCDNQKEKA 233
            LS QQL+DCST  +NHGC+ G MD+AF YI  N G+A E+DYPY     GTC     K 
Sbjct: 181 ALSTQQLLDCSTGRNNHGCNRGDMDEAFRYITSNGGIAAESDYPYEDRALGTC-RASGKP 239

Query: 234 VAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVL----NADCGNNCD 289
           VAA+I  ++ +P  +E ALL AV++QPVSV +D  G+   F+ SGV     N  C  + +
Sbjct: 240 VAASIRGFQYVPPNNETALLLAVAHQPVSVALDGVGKVSQFFSSGVFGAMQNETCTTDLN 299

Query: 290 HGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPVA 345
           H +  VG+GT  +E+G KYWL+KNSWG  WGE GY++I RD     GLCG+A   SYPVA
Sbjct: 300 HAMTAVGYGT--DEHGTKYWLMKNSWGTDWGEGGYMKIARDVASNTGLCGLAMQPSYPVA 357


>gi|168057475|ref|XP_001780740.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162667829|gb|EDQ54449.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 463

 Score =  307 bits (787), Expect = 4e-81,   Method: Compositional matrix adjust.
 Identities = 159/346 (45%), Positives = 218/346 (63%), Gaps = 26/346 (7%)

Query: 16  IIILVITCASQVVSGRS----------MH-EPSIVEKHEQWMAQHGRTYKDELEKAMRLN 64
           +++LVI    Q  +GR+          +H + +I++   QW+  H R Y+   EK  R  
Sbjct: 12  LVLLVIAIGQQADAGRANAIVDYEGNQLHSDDAILDVFHQWLETHSRVYRSLSEKHHRFQ 71

Query: 65  IFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFK 124
           IFK+N  YI   NK+  ++Y LG N+FSDLT++EFRA Y G  +PV   +RQ  + + F 
Sbjct: 72  IFKENFLYIHAHNKQ-QKSYWLGLNKFSDLTHQEFRAQYLG-TKPV---NRQR-KEANFM 125

Query: 125 YQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLV 184
           Y++V   P  +DWR KGAVT +KDQG CGSCWAFSAV +VEG+  I  G+L+ LSEQ+LV
Sbjct: 126 YEDVEAEP-KVDWRLKGAVTDVKDQGACGSCWAFSAVGSVEGVNAIKTGELVSLSEQELV 184

Query: 185 DCS-TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYED 243
           DC    N GC+GGLMD AFE+II+N G+ TE DYPY+  +G CD  +  +    I  Y+D
Sbjct: 185 DCDRKQNQGCNGGLMDYAFEFIIKNGGIDTEKDYPYKARDGRCDEGRRNSKVVVIDDYQD 244

Query: 244 LPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEE 303
           +P   E AL++A++  PVSV ++A GR F  Y+ GV    CG+  DHGV  VG+GT  ++
Sbjct: 245 VPTQSESALMKALTKNPVSVAIEAGGRDFQHYQGGVFTGPCGSELDHGVLAVGYGT--DD 302

Query: 304 NGAKYWLIKNSWGETWGESGYIRILR-----DAGLCGIATAASYPV 344
           +G  YW++KNSWG  WGE GYIR+ R       G CGI   AS+P+
Sbjct: 303 DGVNYWIVKNSWGPGWGEKGYIRMERFGSDSTDGKCGINIEASFPI 348


>gi|218181|dbj|BAA14402.1| oryzain alpha precursor [Oryza sativa Japonica Group]
          Length = 458

 Score =  307 bits (786), Expect = 5e-81,   Method: Compositional matrix adjust.
 Identities = 157/327 (48%), Positives = 206/327 (62%), Gaps = 16/327 (4%)

Query: 27  VVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKAN---KEGNRT 83
           +VS     E      + +W A+HG++Y    E+  R   F+ NL YI++ N     G  +
Sbjct: 25  IVSYGERSEEEARRLYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHS 84

Query: 84  YKLGTNEFSDLTNEEFRALYTGY-NRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGA 142
           ++LG N F+DLTNEE+R  Y G  N+P     R+      +   +   +P S+DWR KGA
Sbjct: 85  FRLGLNRFADLTNEEYRDTYLGLRNKP----RRERKVSDRYLAADNEALPESVDWRTKGA 140

Query: 143 VTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKA 201
           V  IKDQG CGSCWAFSA+AAVE I QI  G LI LSEQ+LVDC T  N GC+GGLMD A
Sbjct: 141 VAEIKDQGGCGSCWAFSAIAAVEDINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYA 200

Query: 202 FEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPV 261
           F++II N G+ TE DYPY+ ++  CD  ++ A   TI  YED+    E +L +AV NQPV
Sbjct: 201 FDFIINNGGIDTEDDYPYKGKDERCDVNRKNAKVVTIDSYEDVTPNSETSLQKAVRNQPV 260

Query: 262 SVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGE 321
           SV ++A GRAF  Y SG+    CG   DHGVA VG+GT   ENG  YW+++NSWG++WGE
Sbjct: 261 SVAIEAGGRAFQLYSSGIFTGKCGTALDHGVAAVGYGT---ENGKDYWIVRNSWGKSWGE 317

Query: 322 SGYIRILRD----AGLCGIATAASYPV 344
           SGY+R+ R+    +G CGIA   SYP+
Sbjct: 318 SGYVRMERNIKASSGKCGIAVEPSYPL 344


>gi|242055753|ref|XP_002457022.1| hypothetical protein SORBIDRAFT_03g047290 [Sorghum bicolor]
 gi|241928997|gb|EES02142.1| hypothetical protein SORBIDRAFT_03g047290 [Sorghum bicolor]
          Length = 378

 Score =  307 bits (786), Expect = 6e-81,   Method: Compositional matrix adjust.
 Identities = 170/340 (50%), Positives = 215/340 (63%), Gaps = 31/340 (9%)

Query: 32  SMHEPSIVEKHEQWMAQHGR-TYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNE 90
           S HE S+ E  E+W+++H +  Y    EK  R  +FK NL +I++ N++ + +Y LG NE
Sbjct: 39  SSHE-SLAELFERWLSRHRKGAYASLEEKLRRFEVFKDNLHHIDETNRKVS-SYWLGLNE 96

Query: 91  FSDLTNEEFRALYTGYNRPVPSVS-----------------RQSSRPSTFKYQNV--TDV 131
           F+DLT++EF+A Y G +                          SS    F+Y+ V    +
Sbjct: 97  FADLTHDEFKATYLGLSPSGGGGDVVHMHHDDDDEEPEEEGSSSSSSFRFRYEGVDAARL 156

Query: 132 PTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-N 190
           P S+DWR KGAVT +K+QGQCGSCWAFS VAAVEGI QI  G L  LSEQ+LVDC TD N
Sbjct: 157 PKSVDWRSKGAVTGVKNQGQCGSCWAFSTVAAVEGINQIVTGNLTALSEQELVDCDTDGN 216

Query: 191 HGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQ 250
           +GC+GGLMD AF YI  N GL TE  YPY  EEGTC      AV  TIS YED+P+ +EQ
Sbjct: 217 NGCNGGLMDYAFSYIAHNGGLHTEEAYPYLMEEGTCSRGSSAAV-VTISGYEDVPRNNEQ 275

Query: 251 ALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENG---AK 307
           ALL+A+++QPVSV ++ASGR   FY  GV +  CG   DHGVA VG+GTA ++NG   A 
Sbjct: 276 ALLKALAHQPVSVAIEASGRNLQFYSGGVFDGPCGTQLDHGVAAVGYGTAGKDNGHVVAD 335

Query: 308 YWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYP 343
           Y ++KNSWG +WGE GYIR+ R      GLCGI    SYP
Sbjct: 336 YIIVKNSWGPSWGEKGYIRMRRGTGKRQGLCGINKMPSYP 375


>gi|110737959|dbj|BAF00916.1| cysteine proteinase [Arabidopsis thaliana]
          Length = 376

 Score =  307 bits (786), Expect = 6e-81,   Method: Compositional matrix adjust.
 Identities = 157/315 (49%), Positives = 204/315 (64%), Gaps = 18/315 (5%)

Query: 44  QWMAQHGRTYKDEL----EKAMRLNIFKQNLEYIEKANKEG-NRTYKLGTNEFSDLTNEE 98
           QW A+HG+T  +      ++  R NIFK NL +I+  N+   N TYKLG  +F+DLTN+E
Sbjct: 51  QWSAEHGKTNNNNNGIINDQDKRFNIFKDNLRFIDLHNENNKNATYKLGLTKFTDLTNDE 110

Query: 99  FRALYTGYNRPVPSVSRQSSRPSTFKYQ---NVTDVPTSIDWREKGAVTHIKDQGQCGSC 155
           +R LY G  R  P+     ++    KY    N  +VP ++DWR+KGAV  IKDQG CGSC
Sbjct: 111 YRKLYLGA-RTEPARRIAKAKNVNQKYSAAVNGKEVPETVDWRQKGAVNPIKDQGTCGSC 169

Query: 156 WAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATE 214
           WAFS  AAVEGI +I  G+LI LSEQ+LVDC    N GC+GGLMD AF++I++N GL TE
Sbjct: 170 WAFSTTAAVEGINKIVTGELISLSEQELVDCDKSYNQGCNGGLMDYAFQFIMKNGGLNTE 229

Query: 215 ADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHF 274
            DYPYR   G C++  + +   +I  YED+P  DE AL +A+S QPV V ++A GR F  
Sbjct: 230 KDYPYRGFGGKCNSFLKNSRVVSIDGYEDVPTKDETALKKAISYQPVRVAIEAGGRIFQH 289

Query: 275 YKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD---- 330
           Y+SG+    CG N DH V  VG+G+   ENG  YW+++NSWG  WGE GYIR+ R+    
Sbjct: 290 YQSGIFTGSCGTNLDHAVVAVGYGS---ENGVDYWIVRNSWGPRWGEEGYIRMERNLAAS 346

Query: 331 -AGLCGIATAASYPV 344
            +G CGIA  ASYPV
Sbjct: 347 KSGKCGIAVEASYPV 361


>gi|255563136|ref|XP_002522572.1| cysteine protease, putative [Ricinus communis]
 gi|223538263|gb|EEF39872.1| cysteine protease, putative [Ricinus communis]
          Length = 340

 Score =  306 bits (785), Expect = 8e-81,   Method: Compositional matrix adjust.
 Identities = 167/342 (48%), Positives = 225/342 (65%), Gaps = 21/342 (6%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
           + +++++++T  SQ +    + E ++ EKHEQWMA+HGRTY+D+ EK  R +IFK+NL++
Sbjct: 9   LAIVLMILVTWVSQAMPRPLIDEDAVAEKHEQWMARHGRTYQDDEEKERRFHIFKKNLKH 68

Query: 73  IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRP--VPS--VSRQSSRPSTFKYQNV 128
           IE  N   NRTYKLG N F+DLT+EEF A YTGY  P  +P+  ++ ++++ S   Y+  
Sbjct: 69  IENFNNAFNRTYKLGLNHFADLTDEEFLATYTGYKMPKVLPTANITTKTTQSSDVLYE-- 126

Query: 129 TDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST 188
            +VP SIDWR +G VT +K+QG+CG CWAFSA AAVEGI     G  + LS QQL+DC  
Sbjct: 127 ANVPESIDWRTRGVVTPVKNQGRCGCCWAFSAAAAVEGII----GNGVSLSAQQLLDCVP 182

Query: 189 DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
           D++GC+GG MD AF YII+N+GLA+   YPY+     C   +    AA IS Y D+   D
Sbjct: 183 DSNGCNGGFMDNAFRYIIQNQGLASATYYPYQLMREMC---RPSNNAARISGYVDVTPAD 239

Query: 249 EQALLQAVSNQPVSVCVDASGRA-FHFYKSGVLNA-DCGNNCDHGVAVVGFGTAEEENGA 306
           E+ L  AV+ QPVS  VDA+    F +Y  G+    DCG+   H + +VG+GT+ E  G 
Sbjct: 240 EETLKSAVARQPVSAAVDATSELNFKYYGGGIFPPQDCGSTLTHAITIVGYGTSAE--GT 297

Query: 307 KYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPV 344
           KYWLIKNSWGE WGE GY+R+ RD     G CGIA  ASYP 
Sbjct: 298 KYWLIKNSWGEGWGEGGYMRLQRDVGSYGGACGIALRASYPT 339


>gi|255538788|ref|XP_002510459.1| cysteine protease, putative [Ricinus communis]
 gi|223551160|gb|EEF52646.1| cysteine protease, putative [Ricinus communis]
          Length = 422

 Score =  306 bits (784), Expect = 1e-80,   Method: Compositional matrix adjust.
 Identities = 152/342 (44%), Positives = 215/342 (62%), Gaps = 14/342 (4%)

Query: 8   SFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFK 67
           +F+  +F+I +L       +          I +  E W  +HG+TY  + +K  R  IF+
Sbjct: 2   NFLSALFLITLLFF----NLSISSFSSSSDISKLFESWTKEHGKTYTSKEDKLYRFKIFE 57

Query: 68  QNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQN 127
           +N E+++K N +GN +Y L  N F+DLT+ EF+A   G +    S S + SR +   +  
Sbjct: 58  ENYEFVKKHNSQGNSSYTLSLNAFADLTHHEFKASRLGLS--AFSTSGKLSRRNFPLHDF 115

Query: 128 VTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
           V DVP SIDWR+KGAV+ +KDQG CG+CW+FSA  A+EGI +I  G L+ LSEQ+LVDC 
Sbjct: 116 VGDVPISIDWRKKGAVSQVKDQGNCGACWSFSATGAIEGINKIVTGSLVSLSEQELVDCD 175

Query: 188 TD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPK 246
              N+GC GGLMD A++++IEN G+ TE DYPY+  E TC+ +K K    TI  Y D+P+
Sbjct: 176 RSYNNGCEGGLMDYAYQFVIENNGIDTEEDYPYQAREKTCNKEKLKRHVVTIDGYTDVPQ 235

Query: 247 GDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGA 306
            +E+ LL+AV+ QPVSV +  S RAF  Y  G+    C  + DH V +VG+G+   ENG 
Sbjct: 236 NNEKELLKAVAAQPVSVGICGSERAFQLYSKGIFTGPCSTSLDHAVLIVGYGS---ENGV 292

Query: 307 KYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
            YW++KNSWG  WG +GY+ +LR++    GLCGI   AS+PV
Sbjct: 293 DYWIVKNSWGTHWGINGYMYMLRNSGNSQGLCGINMLASFPV 334


>gi|242081867|ref|XP_002445702.1| hypothetical protein SORBIDRAFT_07g024430 [Sorghum bicolor]
 gi|241942052|gb|EES15197.1| hypothetical protein SORBIDRAFT_07g024430 [Sorghum bicolor]
          Length = 372

 Score =  306 bits (784), Expect = 1e-80,   Method: Compositional matrix adjust.
 Identities = 158/317 (49%), Positives = 199/317 (62%), Gaps = 13/317 (4%)

Query: 35  EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
           E ++   +E+W  +H    +D  +KA R N+FK+N+  I   N+  +  YKL  N F D+
Sbjct: 40  EEALWALYERWRGRHA-VARDLGDKARRFNVFKENVRLIHDFNQR-DEPYKLRLNRFGDM 97

Query: 95  TNEEFRALYTGYNRPVPSVSR--QSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQC 152
           T +EFR  Y G       + R  +    S+F Y    D+PTS+DWR+KGAVT +KDQGQC
Sbjct: 98  TADEFRRHYAGSRVAHHRMFRGDRQGSASSFMYAGARDLPTSVDWRQKGAVTDVKDQGQC 157

Query: 153 GSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGL 211
           GSCWAFS +AAVEGI  I    L  LSEQQLVDC T  N GC GGLMD AF+YI ++ G+
Sbjct: 158 GSCWAFSTIAAVEGINAIKTKNLTSLSEQQLVDCDTKGNAGCDGGLMDYAFQYIAKHGGV 217

Query: 212 ATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRA 271
           A E  YPY+  + +C  +K  A A TI  YED+P  DE AL +AV++QPVSV ++ASG  
Sbjct: 218 AAEDAYPYKARQASC--KKSPAPAVTIDGYEDVPANDESALKKAVAHQPVSVAIEASGSH 275

Query: 272 FHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA 331
           F FY  GV    CG   DHGV  VG+G A   +G KYW++KNSWG  WGE GYIR+ RD 
Sbjct: 276 FQFYSEGVFAGRCGTELDHGVTAVGYGVA--ADGTKYWVVKNSWGPEWGEKGYIRMARDV 333

Query: 332 ----GLCGIATAASYPV 344
               G CGIA  ASYPV
Sbjct: 334 AAKEGHCGIAMEASYPV 350


>gi|3688528|emb|CAA06243.1| pre-pro-TPE4A protein [Pisum sativum]
          Length = 360

 Score =  306 bits (784), Expect = 1e-80,   Method: Compositional matrix adjust.
 Identities = 159/316 (50%), Positives = 212/316 (67%), Gaps = 11/316 (3%)

Query: 35  EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
           E S+ + +E+W + H  T +   EK  R N+FK N+ ++   NK  ++ YKL  N+F+D+
Sbjct: 33  EKSLWDLYERWRSHHTVT-RSLDEKHNRFNVFKANVMHVHNTNKL-DKPYKLKLNKFADM 90

Query: 95  TNEEFRALYTGYNRPVPSVSR-QSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCG 153
           TN EFR +Y         + R  S+   TF Y+NV +VP+SIDWR+KGAVT +KDQGQCG
Sbjct: 91  TNYEFRRIYADSKVSHHRMFRGMSNENGTFMYENVKNVPSSIDWRKKGAVTDVKDQGQCG 150

Query: 154 SCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST-DNHGCSGGLMDKAFEYIIENKGLA 212
           SCWAFS + AVEGI QI   KL+ LSEQ+LVDC T  N GC+GGLM+ AFE+I +N G+ 
Sbjct: 151 SCWAFSTIVAVEGINQIKTQKLVSLSEQELVDCDTGGNEGCNGGLMEYAFEFIKQN-GIT 209

Query: 213 TEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAF 272
           TE++YPY  ++GTCD +KE     +I  YE++P  +E ALL+A + QPVSV +DA G  F
Sbjct: 210 TESNYPYAAKDGTCDLKKEDKAEVSIDGYENVPINNEAALLKAAAKQPVSVAIDAGGYNF 269

Query: 273 HFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILR--- 329
            FY  GV +  CG + +HGVAVVG+G  ++    KYW++KNSWG  WGE GYIR+ R   
Sbjct: 270 QFYSEGVFSGHCGTDLNHGVAVVGYGVTQDR--TKYWIVKNSWGSEWGEQGYIRMQRGIS 327

Query: 330 -DAGLCGIATAASYPV 344
              GLCGIA  ASYP+
Sbjct: 328 HKEGLCGIAMEASYPI 343


>gi|13432122|sp|P80884.2|ANAN_ANACO RecName: Full=Ananain; Flags: Precursor
 gi|2623956|emb|CAA05487.1| Ananain precursor [Ananas comosus]
          Length = 345

 Score =  306 bits (783), Expect = 1e-80,   Method: Compositional matrix adjust.
 Identities = 150/339 (44%), Positives = 224/339 (66%), Gaps = 20/339 (5%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPS--IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNL 70
           +F+ + L +  AS   S  S  EPS  ++++ E+WMA++GR YKD  EK +R  IFK N+
Sbjct: 8   VFLFLFLCVMWASP--SAASCDEPSDPMMKQFEEWMAEYGRVYKDNDEKMLRFQIFKNNV 65

Query: 71  EYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
            +IE  N     +Y LG N+F+D+TN EF A YTG + P+ ++ R+     +F   +++ 
Sbjct: 66  NHIETFNNRNGNSYTLGINQFTDMTNNEFVAQYTGLSLPL-NIKREPV--VSFDDVDISS 122

Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDN 190
           VP SIDWR+ GAVT +K+QG+CGSCWAF+++A VE I +I RG L+ LSEQQ++DC+  +
Sbjct: 123 VPQSIDWRDSGAVTSVKNQGRCGSCWAFASIATVESIYKIKRGNLVSLSEQQVLDCAV-S 181

Query: 191 HGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAV--AATISKYEDLPKGD 248
           +GC GG ++KA+ +II NKG+A+ A YPY+  +GTC   K   V  +A I++Y  + + +
Sbjct: 182 YGCKGGWINKAYSFIISNKGVASAAIYPYKAAKGTC---KTNGVPNSAYITRYTYVQRNN 238

Query: 249 EQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKY 308
           E+ ++ AVSNQP++  +DASG  F  YK GV    CG   +H + ++G+G  ++ +G K+
Sbjct: 239 ERNMMYAVSNQPIAAALDASGN-FQHYKRGVFTGPCGTRLNHAIVIIGYG--QDSSGKKF 295

Query: 309 WLIKNSWGETWGESGYIRILRDA----GLCGIATAASYP 343
           W+++NSWG  WGE GYIR+ RD     GLCGIA    YP
Sbjct: 296 WIVRNSWGAGWGEGGYIRLARDVSSSFGLCGIAMDPLYP 334


>gi|18141281|gb|AAL60578.1|AF454956_1 senescence-associated cysteine protease [Brassica oleracea]
          Length = 445

 Score =  305 bits (782), Expect = 1e-80,   Method: Compositional matrix adjust.
 Identities = 152/324 (46%), Positives = 209/324 (64%), Gaps = 14/324 (4%)

Query: 28  VSGRSMHE-PSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKL 86
           V+ ++ H  P  V+  E+W+ ++ + Y    EK  R  IF  NL+++++ N   N++Y+L
Sbjct: 22  VTAKADHRNPEEVKMFERWLVENHKNYNGLGEKDKRFEIFMDNLKFVQEHNSVPNQSYEL 81

Query: 87  GTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHI 146
           G   F+DLTNEEFRA+Y    R     +R S +   + +     +P  +DWR KGAV  +
Sbjct: 82  GLTRFADLTNEEFRAIYL---RSKMERTRDSVKSERYLHNVGDKLPDEVDWRAKGAVVPV 138

Query: 147 KDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYI 205
           KDQG CGSCWAFSA+ AVEGI QI  G+L+ LSEQ+LVDC T  N+GC GGLMD AF++I
Sbjct: 139 KDQGSCGSCWAFSAIGAVEGINQIKTGELVSLSEQELVDCDTSYNNGCGGGLMDYAFQFI 198

Query: 206 IENKGLATEADYPYR-HEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVC 264
           I N G+ TE DYPY   ++  C+  K+     TI  YED+P+ +E +L +A++NQP+SV 
Sbjct: 199 ISNGGIDTEEDYPYTATDDNICNTDKKNTRVVTIDGYEDVPE-NENSLKKALANQPISVA 257

Query: 265 VDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGY 324
           ++A GR F  YKSGV    CG   DHGV  VG+GT+E   G  YW+I+NSWG  WGESGY
Sbjct: 258 IEAGGRGFQLYKSGVFTGTCGTALDHGVVAVGYGTSE---GQDYWIIRNSWGSNWGESGY 314

Query: 325 IRILRD----AGLCGIATAASYPV 344
           I++ R+    +G CG+A  ASYP 
Sbjct: 315 IKLQRNIKDSSGKCGVAMMASYPT 338


>gi|42567068|ref|NP_567686.2| putative cysteine proteinase [Arabidopsis thaliana]
 gi|332659371|gb|AEE84771.1| putative cysteine proteinase [Arabidopsis thaliana]
          Length = 356

 Score =  305 bits (782), Expect = 2e-80,   Method: Compositional matrix adjust.
 Identities = 154/349 (44%), Positives = 226/349 (64%), Gaps = 28/349 (8%)

Query: 13  MFVIIILVITCASQVVS--------GRSMHEPSIVEKHEQWMAQHGRTYKDEL-EKAMRL 63
           +F++I+ V++  S  +          RS  E   +   + WM++HG+TY + L EK  R 
Sbjct: 12  LFLLIVFVLSAPSSAMDLPATSGGHNRSNEEVEFI--FQMWMSKHGKTYTNALGEKERRF 69

Query: 64  NIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTF 123
             FK NL +I++ N + N +Y+LG   F+DLT +E+R L+ G  +P     +Q +  ++ 
Sbjct: 70  QNFKDNLRFIDQHNAK-NLSYQLGLTRFADLTVQEYRDLFPGSPKP-----KQRNLKTSR 123

Query: 124 KYQNVT--DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQ 181
           +Y  +    +P S+DWR++GAV+ IKDQG C SCWAFS VAAVEG+ +I  G+LI LSEQ
Sbjct: 124 RYVPLAGDQLPESVDWRQEGAVSEIKDQGTCNSCWAFSTVAAVEGLNKIVTGELISLSEQ 183

Query: 182 QLVDCSTDNHGCSG-GLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKA-VAATIS 239
           +LVDC+  N+GC G GLMD AF+++I N GL +E DYPY+  +G+C+ ++  +    TI 
Sbjct: 184 ELVDCNLVNNGCYGSGLMDTAFQFLINNNGLDSEKDYPYQGTQGSCNRKQSTSNKVITID 243

Query: 240 KYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGT 299
            YED+P  DE +L +AV++QPVSV VD   + F  Y+S + N  CG N DH + +VG+G+
Sbjct: 244 SYEDVPANDEISLQKAVAHQPVSVGVDKKSQEFMLYRSCIYNGPCGTNLDHALVIVGYGS 303

Query: 300 AEEENGAKYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPV 344
              ENG  YW+++NSWG TWG++GYI+I R+     GLCGIA  ASYP+
Sbjct: 304 ---ENGQDYWIVRNSWGTTWGDAGYIKIARNFEDPKGLCGIAMLASYPI 349


>gi|222629922|gb|EEE62054.1| hypothetical protein OsJ_16838 [Oryza sativa Japonica Group]
          Length = 336

 Score =  305 bits (780), Expect = 3e-80,   Method: Compositional matrix adjust.
 Identities = 163/323 (50%), Positives = 205/323 (63%), Gaps = 20/323 (6%)

Query: 29  SGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGT 88
           +GR+  E SIV         + + Y    EK  R  +FK NL +I+  NK+   +Y LG 
Sbjct: 24  AGRNGGEFSIV--------GYRKAYASFEEKVRRFEVFKDNLNHIDDINKK-VTSYWLGL 74

Query: 89  NEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD--VPTSIDWREKGAVTHI 146
           NEF+DLT++EF+A Y G   P    + +      F+Y  +++  VP  +DWR+K AVT +
Sbjct: 75  NEFADLTHDEFKATYLGLTPPPTRSNSKHYSSEEFRYGKMSNGEVPKEMDWRKKNAVTEV 134

Query: 147 KDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYI 205
           K+QGQCGSCWAFS VAAVEGI  I  G L  LSEQ+L+DCSTD N+GC+GGLMD AF YI
Sbjct: 135 KNQGQCGSCWAFSTVAAVEGINAIVTGNLTSLSEQELIDCSTDGNNGCNGGLMDYAFSYI 194

Query: 206 IENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCV 265
               GL TE  YPY  EEG CD  K  AV  TIS YED+P  DEQAL++A+++QPVSV +
Sbjct: 195 ASTGGLRTEEAYPYAMEEGDCDEGKGAAV-VTISGYEDVPANDEQALVKALAHQPVSVAI 253

Query: 266 DASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYI 325
           +ASGR F FY  GV +  CG   DHGV  VG+GT++   G  Y ++KNSWG  WGE GYI
Sbjct: 254 EASGRHFQFYSGGVFDGPCGEQLDHGVTAVGYGTSK---GQDYIIVKNSWGPHWGEKGYI 310

Query: 326 RILRDA----GLCGIATAASYPV 344
           R+ R      GLCGI   ASYP 
Sbjct: 311 RMKRGTGKGEGLCGINKMASYPT 333


>gi|225446523|ref|XP_002275891.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP2 [Vitis vinifera]
          Length = 358

 Score =  305 bits (780), Expect = 3e-80,   Method: Compositional matrix adjust.
 Identities = 151/348 (43%), Positives = 214/348 (61%), Gaps = 20/348 (5%)

Query: 9   FIIPMFVIIILVITCASQVVSGRSMHEP------SIVEKHEQWMAQHGRTYKDELEKAMR 62
           F   ++  ++++ T      +    HEP       + +++E+W+ QHGR YK+  E    
Sbjct: 6   FCRNVYFALLIMWTVGVSWSAFSEEHEPMESEMSDMEKRYERWLVQHGRRYKNRDEWQRH 65

Query: 63  LNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPST 122
             I++ N+ +I   N + N ++ L  N+F+D+TNEE++ALY G      S   QSS    
Sbjct: 66  FGIYQSNVRFINYINAQ-NFSFTLTDNQFADMTNEEYKALYMGLGTSETSRKNQSS---- 120

Query: 123 FKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQ 182
           FK +    +P S+DWR+ GAVT +++QG+CGSCWAFS VAAVEGI +I  GKL+ LSEQ+
Sbjct: 121 FKRERSKVLPISVDWRKMGAVTPVRNQGECGSCWAFSTVAAVEGINKIRTGKLVSLSEQE 180

Query: 183 LVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISK 240
           L+DC  D  N GC+GG M  AF++I +N G+ T  +YPY  E+G C+  K       IS 
Sbjct: 181 LLDCDIDSGNEGCNGGYMVNAFKFIKQNGGITTARNYPYIGEQGICNKDKAANHVVKISG 240

Query: 241 YEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTA 300
           YE +P  +E+ L  AV+ QPVSV +DA G  F  Y  G+ N  CG   +H V V+G+G  
Sbjct: 241 YETVPPNNEKILQAAVAKQPVSVAIDAGGYEFQLYSKGIFNGFCGKQLNHAVTVIGYG-- 298

Query: 301 EEENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
            E+NG KYWL+KNSWG  WGE+GY R++RD+    G+CGIA  ASYP+
Sbjct: 299 -EDNGKKYWLVKNSWGTGWGEAGYARMIRDSRDDEGICGIAMEASYPI 345


>gi|302143380|emb|CBI21941.3| unnamed protein product [Vitis vinifera]
          Length = 354

 Score =  304 bits (778), Expect = 5e-80,   Method: Compositional matrix adjust.
 Identities = 151/348 (43%), Positives = 214/348 (61%), Gaps = 20/348 (5%)

Query: 9   FIIPMFVIIILVITCASQVVSGRSMHEP------SIVEKHEQWMAQHGRTYKDELEKAMR 62
           F   ++  ++++ T      +    HEP       + +++E+W+ QHGR YK+  E    
Sbjct: 2   FCRNVYFALLIMWTVGVSWSAFSEEHEPMESEMSDMEKRYERWLVQHGRRYKNRDEWQRH 61

Query: 63  LNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPST 122
             I++ N+ +I   N + N ++ L  N+F+D+TNEE++ALY G      S   QSS    
Sbjct: 62  FGIYQSNVRFINYINAQ-NFSFTLTDNQFADMTNEEYKALYMGLGTSETSRKNQSS---- 116

Query: 123 FKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQ 182
           FK +    +P S+DWR+ GAVT +++QG+CGSCWAFS VAAVEGI +I  GKL+ LSEQ+
Sbjct: 117 FKRERSKVLPISVDWRKMGAVTPVRNQGECGSCWAFSTVAAVEGINKIRTGKLVSLSEQE 176

Query: 183 LVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISK 240
           L+DC  D  N GC+GG M  AF++I +N G+ T  +YPY  E+G C+  K       IS 
Sbjct: 177 LLDCDIDSGNEGCNGGYMVNAFKFIKQNGGITTARNYPYIGEQGICNKDKAANHVVKISG 236

Query: 241 YEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTA 300
           YE +P  +E+ L  AV+ QPVSV +DA G  F  Y  G+ N  CG   +H V V+G+G  
Sbjct: 237 YETVPPNNEKILQAAVAKQPVSVAIDAGGYEFQLYSKGIFNGFCGKQLNHAVTVIGYG-- 294

Query: 301 EEENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
            E+NG KYWL+KNSWG  WGE+GY R++RD+    G+CGIA  ASYP+
Sbjct: 295 -EDNGKKYWLVKNSWGTGWGEAGYARMIRDSRDDEGICGIAMEASYPI 341


>gi|146215980|gb|ABQ10192.1| actinidin Act2a [Actinidia deliciosa]
          Length = 378

 Score =  304 bits (778), Expect = 5e-80,   Method: Compositional matrix adjust.
 Identities = 155/339 (45%), Positives = 206/339 (60%), Gaps = 17/339 (5%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
           +F   +L+++ A  + +        ++  +E W+ +HG++Y    EK MR  IFK+NL  
Sbjct: 13  LFFSTLLILSSAIDIENSVQRTNDQVMAMYESWLVEHGKSYNSLDEKEMRFEIFKENLRI 72

Query: 73  IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNR-PVPSVSRQSSRPSTFKYQNVTD- 130
           I+  N + NR+Y LG N F+DLT+EE+R+ Y G  R P   VS Q           V D 
Sbjct: 73  IDDHNADANRSYSLGLNRFADLTDEEYRSTYLGLKRGPKTDVSNQ-------YMPKVGDA 125

Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD- 189
           +P  +DWR  GAV  +K+QG C SCWAFSAVAAVEGI +I  G LI LSEQ+LVDC    
Sbjct: 126 LPDYVDWRTVGAVVGVKNQGLCSSCWAFSAVAAVEGINKIVTGNLISLSEQELVDCGRTQ 185

Query: 190 -NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
              GC+ GLM  AF++II N G+ TE +YPY  ++G C+   +     TI  Y+++P  +
Sbjct: 186 ITKGCNRGLMTDAFKFIINNGGINTENNYPYTAKDGQCNLSLKNQKYVTIDSYKNVPSNN 245

Query: 249 EQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKY 308
           E AL +AV+ QPVSV V++ G  F  Y SG+    CG   DHGV +VG+GT   E G  Y
Sbjct: 246 EMALKKAVAYQPVSVGVESEGGKFKLYTSGIFTGSCGTAVDHGVTIVGYGT---ERGMDY 302

Query: 309 WLIKNSWGETWGESGYIRILRD---AGLCGIATAASYPV 344
           W++KNSWG  WGESGYIRI R+   AG CGIA   SYPV
Sbjct: 303 WIVKNSWGTNWGESGYIRIQRNIGGAGKCGIAKMPSYPV 341


>gi|115477767|ref|NP_001062479.1| Os08g0556900 [Oryza sativa Japonica Group]
 gi|42407937|dbj|BAD09076.1| putative cysteine proteinase [Oryza sativa Japonica Group]
 gi|113624448|dbj|BAF24393.1| Os08g0556900 [Oryza sativa Japonica Group]
 gi|125562525|gb|EAZ07973.1| hypothetical protein OsI_30231 [Oryza sativa Indica Group]
 gi|215701458|dbj|BAG92882.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 385

 Score =  304 bits (778), Expect = 5e-80,   Method: Compositional matrix adjust.
 Identities = 157/317 (49%), Positives = 197/317 (62%), Gaps = 11/317 (3%)

Query: 35  EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
           E ++ E +E+W  QH R  +D  EKA R N+FK N+  I + N+  +  YKL  N F D+
Sbjct: 41  EEALWELYERWRGQH-RVARDLGEKARRFNVFKDNVRLIHEFNRR-DEPYKLRLNRFGDM 98

Query: 95  TNEEFRALYTGYNRPVPSVSR-QSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCG 153
           T +EFR  Y         + R +  R S F Y    D+P ++DWREKGAV  +KDQGQCG
Sbjct: 99  TADEFRRAYASSRVSHHRMFRGRGERRSGFMYAGARDLPAAVDWREKGAVGAVKDQGQCG 158

Query: 154 SCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGL 211
           SCWAFS +AAVEGI  I    L  LSEQQLVDC T   N GC GGLMD AF+YI ++ G+
Sbjct: 159 SCWAFSTIAAVEGINAIRTSNLTALSEQQLVDCDTKTGNAGCDGGLMDNAFQYIAKHGGV 218

Query: 212 ATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRA 271
           A  + YPYR  + +C +    + A TI  YED+P   E AL +AV+NQPVSV ++A G  
Sbjct: 219 AASSAYPYRARQSSCKSSAASSPAVTIDGYEDVPANSESALKKAVANQPVSVAIEAGGSH 278

Query: 272 FHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA 331
           F FY  GV    CG   DHGVA VG+GT  +  G KYW+++NSWG  WGE GYIR+ RD 
Sbjct: 279 FQFYSEGVFAGKCGTELDHGVAAVGYGTTVD--GTKYWIVRNSWGADWGEKGYIRMKRDV 336

Query: 332 ----GLCGIATAASYPV 344
               GLCGIA  ASYP+
Sbjct: 337 SAKEGLCGIAMEASYPI 353


>gi|38345906|emb|CAE04498.2| OSJNBb0059K02.8 [Oryza sativa Japonica Group]
          Length = 458

 Score =  303 bits (777), Expect = 6e-80,   Method: Compositional matrix adjust.
 Identities = 156/327 (47%), Positives = 206/327 (62%), Gaps = 16/327 (4%)

Query: 27  VVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKAN---KEGNRT 83
           +VS     E      + +W A+HG++Y    E+  R   F+ NL YI++ N     G  +
Sbjct: 25  IVSYGERSEEEARRLYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHS 84

Query: 84  YKLGTNEFSDLTNEEFRALYTGY-NRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGA 142
           ++LG N F+DLTNEE+R  Y G  N+P     R+      +   +   +P S+DWR KGA
Sbjct: 85  FRLGLNRFADLTNEEYRDTYLGLRNKP----RRERKVSDRYLAADNEALPESVDWRTKGA 140

Query: 143 VTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKA 201
           V  IKDQ   GSCWAFSA+AAVEGI QI  G LI LSEQ+LVDC T  N GC+GGLMD A
Sbjct: 141 VAEIKDQEVAGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYA 200

Query: 202 FEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPV 261
           F++II N G+ TE DYPY+ ++  CD  ++ A   TI  YED+    E +L +AV+NQPV
Sbjct: 201 FDFIINNGGIDTEDDYPYKGKDERCDVNRKNAKVVTIDSYEDVTPNSETSLQKAVANQPV 260

Query: 262 SVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGE 321
           SV ++A GRAF  Y SG+    CG   DHGVA VG+GT   ENG  YW+++NSWG++WGE
Sbjct: 261 SVAIEAGGRAFQLYSSGIFTGKCGTALDHGVAAVGYGT---ENGKDYWIVRNSWGKSWGE 317

Query: 322 SGYIRILRD----AGLCGIATAASYPV 344
           SGY+R+ R+    +G CGIA   SYP+
Sbjct: 318 SGYVRMERNIKASSGKCGIAVEPSYPL 344


>gi|600111|emb|CAA84378.1| cysteine proteinase [Vicia sativa]
          Length = 359

 Score =  303 bits (775), Expect = 9e-80,   Method: Compositional matrix adjust.
 Identities = 162/351 (46%), Positives = 223/351 (63%), Gaps = 18/351 (5%)

Query: 3   LKFEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMR 62
           ++ +K   I + + +I  +             E S+   +E+W + H  T ++  EK  R
Sbjct: 1   MEMKKLLFISLSLALIFTVANTFDFNEHDLESEKSLWNLYERWRSHHTVT-RNLDEKHNR 59

Query: 63  LNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYT----GYNRPVPSVSRQSS 118
            N+FK N+ ++   NK  ++ YKL  N+F D+TN EFR +Y      ++R    +S ++ 
Sbjct: 60  FNVFKANVMHVHNTNKL-DKPYKLKLNKFGDMTNYEFRRIYADSKISHHRMFRGMSHENG 118

Query: 119 RPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIEL 178
              TF Y+N  DVP+SIDWR KGAVT +KDQGQCGSCWAFS +AAVEGI QI   KL+ L
Sbjct: 119 ---TFMYENAVDVPSSIDWRNKGAVTGVKDQGQCGSCWAFSTIAAVEGINQIKTQKLVSL 175

Query: 179 SEQQLVDCST-DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAAT 237
           SEQQLVDC T +N GC+GGLM+ AFE+I +N G+ TE++YPY  ++GTCD +KE   A +
Sbjct: 176 SEQQLVDCDTEENEGCNGGLMEYAFEFIKQN-GITTESNYPYAAKDGTCDVEKEDK-AVS 233

Query: 238 ISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGF 297
           I  +E++P  +E ALL+A + QPVSV +DA G  F FY  GV    C  + +HGVA+VG+
Sbjct: 234 IDGHENVPINNEAALLKAAAKQPVSVAIDAGGYNFQFYSEGVFTGHCDTDLNHGVAIVGY 293

Query: 298 GTAEEENGAKYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPV 344
           G  ++    KYW++KNSWG  WGE GYIR+ R      GLCGIA  ASYP+
Sbjct: 294 GVTQDR--TKYWIMKNSWGSEWGEQGYIRMQRGISSREGLCGIAMEASYPI 342


>gi|297603535|ref|NP_001054211.2| Os04g0670200 [Oryza sativa Japonica Group]
 gi|109939735|sp|P25777.2|ORYB_ORYSJ RecName: Full=Oryzain beta chain; Flags: Precursor
 gi|32488398|emb|CAE02823.1| OSJNBa0043A12.28 [Oryza sativa Japonica Group]
 gi|90399163|emb|CAJ86092.1| H0818H01.14 [Oryza sativa Indica Group]
 gi|125550169|gb|EAY95991.1| hypothetical protein OsI_17862 [Oryza sativa Indica Group]
 gi|215766596|dbj|BAG98700.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|255675868|dbj|BAF16125.2| Os04g0670200 [Oryza sativa Japonica Group]
          Length = 466

 Score =  303 bits (775), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 147/313 (46%), Positives = 208/313 (66%), Gaps = 17/313 (5%)

Query: 42  HEQWMAQHGRTYKDEL--EKAMRLNIFKQNLEYIEKANKEGNRT--YKLGTNEFSDLTNE 97
           ++ W+A++G    + L  E   R  +F  NL++++  N   +    ++LG N F+DLTNE
Sbjct: 52  YDLWLAENGGGSPNALGGEHERRFLVFWDNLKFVDAHNARADERGGFRLGMNRFADLTNE 111

Query: 98  EFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWA 157
           EFRA + G         R  +    +++  V ++P S+DWREKGAV  +K+QGQCGSCWA
Sbjct: 112 EFRATFLG----AKVAERSRAAGERYRHDGVEELPESVDWREKGAVAPVKNQGQCGSCWA 167

Query: 158 FSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEA 215
           FSAV+ VE I Q+  G++I LSEQ+LV+CST+  N GC+GGLMD AF++II+N G+ TE 
Sbjct: 168 FSAVSTVESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMDDAFDFIIKNGGIDTED 227

Query: 216 DYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFY 275
           DYPY+  +G CD  +E A   +I  +ED+P+ DE++L +AV++QPVSV ++A GR F  Y
Sbjct: 228 DYPYKAVDGKCDINRENAKVVSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGGREFQLY 287

Query: 276 KSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD----A 331
            SGV +  CG + DHGV  VG+GT   +NG  YW+++NSWG  WGESGY+R+ R+     
Sbjct: 288 HSGVFSGRCGTSLDHGVVAVGYGT---DNGKDYWIVRNSWGPKWGESGYVRMERNINVTT 344

Query: 332 GLCGIATAASYPV 344
           G CGIA  ASYP 
Sbjct: 345 GKCGIAMMASYPT 357


>gi|2463586|dbj|BAA22545.1| FB22 precursor [Ananas comosus]
          Length = 340

 Score =  303 bits (775), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 146/339 (43%), Positives = 210/339 (61%), Gaps = 20/339 (5%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
           +F+ + L +  AS   + R      ++++ E+WMA++GR YKD  EK  R  IFK N+ +
Sbjct: 8   VFLFLFLCVMWASPSAASRDEPSDPMMKRFEEWMAEYGRVYKDNDEKMRRFQIFKNNVNH 67

Query: 73  IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPV-----PSVSRQSSRPSTFKYQN 127
           IE  N     +Y LG N+F+D+TN EF   YTG + P+     P VS        F   N
Sbjct: 68  IETFNNRNGNSYTLGINKFTDMTNNEFVTQYTGVSLPLNFKREPVVS--------FDDVN 119

Query: 128 VTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
           ++ V  SIDWR+ GAVT +KDQ  CGSCWAFSA+A VEGI +I  G L+ LSEQ+++DC+
Sbjct: 120 ISAVGQSIDWRDYGAVTEVKDQNPCGSCWAFSAIATVEGIYKIVTGYLVSLSEQEVLDCA 179

Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
             N GC GG +D A+++II N G+A+EADYPY+  EG C        +A I+ Y  +   
Sbjct: 180 VSN-GCDGGFVDNAYDFIISNNGVASEADYPYQAYEGDC-TANSWPNSAYITGYSYVRSN 237

Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
           DE ++  AV NQP++  +DASG  F +Y  GV +  CG + +H + ++G+G  ++ +G +
Sbjct: 238 DESSMKYAVWNQPIAAAIDASGDNFQYYNGGVFSGPCGTSLNHAITIIGYG--QDSSGTQ 295

Query: 308 YWLIKNSWGETWGESGYIRILR---DAGLCGIATAASYP 343
           YW++KNSWG +WGE GY+R+ R    +GLCGIA    YP
Sbjct: 296 YWIVKNSWGSSWGERGYVRMARGVSSSGLCGIAMDPLYP 334


>gi|38345188|emb|CAE03344.2| OSJNBb0005B05.11 [Oryza sativa Japonica Group]
 gi|125589403|gb|EAZ29753.1| hypothetical protein OsJ_13812 [Oryza sativa Japonica Group]
          Length = 323

 Score =  303 bits (775), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 155/344 (45%), Positives = 217/344 (63%), Gaps = 39/344 (11%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
           +F I+  +  C++ + +     + ++  +HE+WMAQ+GR YKD+ EKA R  +FK N+ +
Sbjct: 8   LFAILGCLCLCSAVLAARELSDDAAMAARHERWMAQYGRMYKDDAEKARRFEVFKANVAF 67

Query: 73  IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT--D 130
           IE  N  GN  + LG N+F+DLTN+EFR+  T     +PS +R    P+ F+ +NV    
Sbjct: 68  IESFNA-GNHKFWLGVNQFADLTNDEFRSTKTNKGF-IPSTTRV---PTGFRNENVNIDA 122

Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST-- 188
           +P ++DWR KG VT IKDQGQCG CWAFSAVAA+E                +LVDC    
Sbjct: 123 LPATMDWRTKGVVTPIKDQGQCGCCWAFSAVAAME----------------ELVDCDVHG 166

Query: 189 DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVA---ATISKYEDLP 245
           ++ GC GGLMD AF++II+N GL TE++YPY   +      K K+V+   A+I  YED+P
Sbjct: 167 EDQGCEGGLMDDAFKFIIKNGGLTTESNYPYAAVD-----DKFKSVSNSVASIKGYEDVP 221

Query: 246 KGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENG 305
             +E AL++AV+NQPVSV VD     F FYK GV+   CG + DHG+  +G+G A +  G
Sbjct: 222 ANNEAALMKAVANQPVSVAVDGGDMTFQFYKGGVMTGSCGTDLDHGIVAIGYGKASD--G 279

Query: 306 AKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
            KYWL+KNSWG TWGE+G++R+ +D     G+CG+A   SYP A
Sbjct: 280 TKYWLLKNSWGMTWGENGFLRMEKDISDKRGMCGLAMEPSYPTA 323


>gi|312282059|dbj|BAJ33895.1| unnamed protein product [Thellungiella halophila]
          Length = 379

 Score =  301 bits (772), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 143/307 (46%), Positives = 197/307 (64%), Gaps = 9/307 (2%)

Query: 43  EQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRAL 102
           E W+ +HG+ Y    EK  RL IFK NL +I   N E N  Y+LG N F+DL+  E++ +
Sbjct: 65  ESWIVKHGKVYDSVAEKERRLTIFKDNLRFITNRNSE-NLGYRLGLNRFADLSLHEYKEI 123

Query: 103 YTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVA 162
             G +   P      S    +K      +P S+DWR +GAVT +KDQG C SCWAFS V 
Sbjct: 124 CHGADPKPPRNHVFMSSSDRYKTSAGDVLPKSVDWRNEGAVTEVKDQGHCRSCWAFSTVG 183

Query: 163 AVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHE 222
           AVEG+ +I  G+L+ LSEQ L++C+ +N+GC GG ++ A+E+I+ N GL T+ DYPY+  
Sbjct: 184 AVEGLNKIVTGELVTLSEQDLINCNKENNGCGGGKVETAYEFIVSNGGLGTDNDYPYKAV 243

Query: 223 EGTCDNQ-KEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLN 281
            G CD + KE      I  YE+LP  DE AL++AV++QPV+  +D+S R F  Y+SGV +
Sbjct: 244 NGACDGRLKENIKNVMIDGYENLPANDELALMKAVAHQPVTAVIDSSSREFQLYESGVFD 303

Query: 282 ADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIA 337
             CG N +HGV VVG+GT   ENG  YW+++NSWG TWGE+GY+++ R+     GLCGIA
Sbjct: 304 GRCGTNLNHGVVVVGYGT---ENGRNYWIVRNSWGNTWGEAGYMKMARNIANPRGLCGIA 360

Query: 338 TAASYPV 344
              SYP+
Sbjct: 361 MRVSYPL 367


>gi|357143305|ref|XP_003572875.1| PREDICTED: xylem cysteine proteinase 1-like [Brachypodium
           distachyon]
          Length = 473

 Score =  301 bits (772), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 145/312 (46%), Positives = 204/312 (65%), Gaps = 12/312 (3%)

Query: 38  IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
           +V+    W  +H + Y    EK  R  +FKQNL++I + N+  N +Y LG N+F+D+ +E
Sbjct: 44  LVDLFSSWSVKHSKIYVSPEEKVKRYEVFKQNLKHIVETNRR-NGSYWLGLNQFADVAHE 102

Query: 98  EFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWA 157
           EF++ Y G    +   +R    P+ F+Y+N  ++P S+DWR+KGAVT +K+QG+CGSCWA
Sbjct: 103 EFKSTYLGLKTGMDGPARA---PTAFRYENSVNLPWSVDWRKKGAVTPVKNQGECGSCWA 159

Query: 158 FSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEAD 216
           FS VAAVEGI QI  GKL  LSEQ+L+DC T  +HGC GG MD AF YI+ N G+ T+ D
Sbjct: 160 FSTVAAVEGINQIATGKLESLSEQELMDCDTTFDHGCGGGFMDFAFAYIMGNLGIHTDDD 219

Query: 217 YPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYK 276
           YPY  EEG C  ++ ++   TIS YED+P+  E +LL+A+++QP+SV + A  + F FYK
Sbjct: 220 YPYLMEEGYCKEKQPQSKVVTISGYEDVPENSEVSLLKALAHQPISVGIAAGSKDFQFYK 279

Query: 277 SGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----G 332
            GV    CG   DH +  VG+G++   +G  Y ++KNSWG++WGE GY RI R      G
Sbjct: 280 RGVFEGSCGTELDHALTAVGYGSS---DGQDYIIMKNSWGKSWGEQGYFRIKRGTGKPEG 336

Query: 333 LCGIATAASYPV 344
           +C I + ASYP 
Sbjct: 337 VCSIYSMASYPT 348


>gi|302763831|ref|XP_002965337.1| hypothetical protein SELMODRAFT_230602 [Selaginella moellendorffii]
 gi|300167570|gb|EFJ34175.1| hypothetical protein SELMODRAFT_230602 [Selaginella moellendorffii]
          Length = 343

 Score =  301 bits (772), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 152/343 (44%), Positives = 217/343 (63%), Gaps = 23/343 (6%)

Query: 15  VIIILVITCASQVV--------SGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIF 66
            +I+LV+  A+            GR++    I    E W A+HG++Y  +LEKA RL IF
Sbjct: 9   TLILLVVVGATPFAIARPAALEDGRALE---IKNMFEDWAAKHGKSYSSDLEKARRLMIF 65

Query: 67  KQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTG-YNRPVPSVSRQSSRPSTFKY 125
              L YIEK N + N T+ LG N+FSDLTN EFRA++ G + RP      Q   P+  + 
Sbjct: 66  SDTLAYIEKHNAQPNTTFTLGLNKFSDLTNAEFRAMHVGKFKRP----RYQDRLPAEDED 121

Query: 126 QNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVD 185
            +V+ +PTS+DWR+KGAVT IKDQG CGSCWAFSA+A++E    +   +L+ LSEQQL+D
Sbjct: 122 VDVSSLPTSLDWRQKGAVTPIKDQGDCGSCWAFSAIASIESAHFLATKELVSLSEQQLMD 181

Query: 186 CSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAV--AATISKYED 243
           C T + GC GGLM+ AF+++++N G+ TEA YPY    G+C+  K   +   A I+ ++ 
Sbjct: 182 CDTVDAGCDGGLMETAFKFVVKNGGVTTEASYPYTGSVGSCNANKVAIINKVAEITGFKV 241

Query: 244 LPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEE 303
           + +    AL++AVS  PV+V +  S   F  YKSG+L+  CG++ DHGV ++G+GT   E
Sbjct: 242 VTEDSADALMKAVSKTPVTVSICGSDENFQNYKSGILSGQCGDSLDHGVLLIGYGT---E 298

Query: 304 NGAKYWLIKNSWGETWGESGYIRILRD--AGLCGIATAASYPV 344
            G  YW+IKNSWG +WGE G+++I R    G+CG+   +SYP 
Sbjct: 299 GGMPYWIIKNSWGTSWGEDGFMKIERKDGDGICGMNGDSSYPT 341


>gi|2342494|dbj|BAA21848.1| bromelain [Ananas comosus]
 gi|2463582|dbj|BAA22543.1| FB31 precursor [Ananas comosus]
          Length = 352

 Score =  301 bits (772), Expect = 3e-79,   Method: Compositional matrix adjust.
 Identities = 145/335 (43%), Positives = 212/335 (63%), Gaps = 11/335 (3%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
           +F+ + L +  AS   + R      ++++ E+WMA++GR YKD  EK  R  IFK N+ +
Sbjct: 8   VFLFLFLCVMWASPSAASRDEPSDPMMKRFEEWMAEYGRVYKDNDEKMRRFQIFKNNVNH 67

Query: 73  IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTG-YNRPVPSVSRQSSRPSTFKYQNVTDV 131
           IE  N     +Y LG N+F+D+TN EF A YTG  +RP+   + +     +F   N++ V
Sbjct: 68  IETFNNRNGNSYTLGINKFTDMTNNEFVAQYTGGISRPL---NIEKEPVVSFDDVNISAV 124

Query: 132 PTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNH 191
             SIDWR+ GAVT +KDQ  CGSCWAFSA+A VEGI +I  G L+ LSEQ+++DC+  N 
Sbjct: 125 GQSIDWRDYGAVTEVKDQNPCGSCWAFSAIATVEGIYKIVTGYLVSLSEQEVLDCAVSN- 183

Query: 192 GCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQA 251
           GC GG +D A+++II N G+A+EADYPY+  +G C        +A I+ Y  +   DE +
Sbjct: 184 GCDGGFVDNAYDFIISNNGVASEADYPYQAYQGDCAANSWPN-SAYITGYSYVRSNDESS 242

Query: 252 LLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLI 311
           +  AV NQP++  +DASG  F +Y  GV +  CG + +H + ++G+G  ++ +G +YW++
Sbjct: 243 MKYAVWNQPIAAAIDASGDNFQYYNGGVFSGPCGTSLNHAITIIGYG--QDSSGTQYWIV 300

Query: 312 KNSWGETWGESGYIRILR---DAGLCGIATAASYP 343
           KNSWG +WGE GYIR+ R    +GLCGIA    YP
Sbjct: 301 KNSWGSSWGERGYIRMARGVSSSGLCGIAMDPLYP 335


>gi|30685308|ref|NP_566634.2| putative cysteine proteinase [Arabidopsis thaliana]
 gi|30315949|sp|Q9LT77.1|CPR1_ARATH RecName: Full=Probable cysteine proteinase At3g19400; Flags:
           Precursor
 gi|11994462|dbj|BAB02464.1| cysteine proteinase [Arabidopsis thaliana]
 gi|332642715|gb|AEE76236.1| putative cysteine proteinase [Arabidopsis thaliana]
          Length = 362

 Score =  301 bits (771), Expect = 3e-79,   Method: Compositional matrix adjust.
 Identities = 148/318 (46%), Positives = 208/318 (65%), Gaps = 14/318 (4%)

Query: 34  HEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSD 93
           +E  +   +EQW+ ++ + Y    EK  R  IFK NL+++++ N   +RT+++G   F+D
Sbjct: 36  NETEVRLMYEQWLVENRKNYNGLGEKERRFKIFKDNLKFVDEHNSVPDRTFEVGLTRFAD 95

Query: 94  LTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCG 153
           LTNEEFRA+Y    R     ++ S +   + Y+    +P  +DWR  GAV  +KDQG CG
Sbjct: 96  LTNEEFRAIYL---RKKMERTKDSVKTERYLYKEGDVLPDEVDWRANGAVVSVKDQGNCG 152

Query: 154 SCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGL 211
           SCWAFSAV AVEGI QIT G+LI LSEQ+LVDC     N GC GG+M+ AFE+I++N G+
Sbjct: 153 SCWAFSAVGAVEGINQITTGELISLSEQELVDCDRGFVNAGCDGGIMNYAFEFIMKNGGI 212

Query: 212 ATEADYPYR-HEEGTCDNQKEKAV-AATISKYEDLPKGDEQALLQAVSNQPVSVCVDASG 269
            T+ DYPY  ++ G C+  K       TI  YED+P+ DE++L +AV++QPVSV ++AS 
Sbjct: 213 ETDQDYPYNANDLGLCNADKNNNTRVVTIDGYEDVPRDDEKSLKKAVAHQPVSVAIEASS 272

Query: 270 RAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILR 329
           +AF  YKSGV+   CG + DHGV VVG+G+    +G  YW+I+NSWG  WG+SGY+++ R
Sbjct: 273 QAFQLYKSGVMTGTCGISLDHGVVVVGYGST---SGEDYWIIRNSWGLNWGDSGYVKLQR 329

Query: 330 DA----GLCGIATAASYP 343
           +     G CGIA   SYP
Sbjct: 330 NIDDPFGKCGIAMMPSYP 347


>gi|90399361|emb|CAJ86180.1| H0212B02.7 [Oryza sativa Indica Group]
          Length = 470

 Score =  301 bits (770), Expect = 4e-79,   Method: Compositional matrix adjust.
 Identities = 158/339 (46%), Positives = 207/339 (61%), Gaps = 28/339 (8%)

Query: 27  VVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKAN---KEGNRT 83
           +VS     E      + +W A+HG+ Y    E+  R   F+ NL YI++ N     G  +
Sbjct: 25  IVSYGERSEEEARRLYAEWKAEHGKNYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHS 84

Query: 84  YKLGTNEFSDLTNEEFRALYTGY-NRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGA 142
           ++LG N F+DLTNEE+R  Y G  N+P     R+      +   +   +P S+DWR KGA
Sbjct: 85  FRLGLNRFADLTNEEYRDTYLGLRNKP----RRERKVSDRYLAADNEALPESVDWRTKGA 140

Query: 143 VTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKA 201
           V  IKDQG CGSCWAFSA+AAVEGI QI  G LI LSEQ+LVDC T  N GC+GGLMD A
Sbjct: 141 VAEIKDQGGCGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYA 200

Query: 202 FEYIIENKGLATEADYPYRHEEGTCDNQK------------EKAVAATISKYEDLPKGDE 249
           F++II N G+ TE DYPY+ ++  CD  +            + A   TI  YED+    E
Sbjct: 201 FDFIINNGGIDTEDDYPYKGKDERCDVNRVSFVFFAPLVFQKNAKVVTIDSYEDVTPNSE 260

Query: 250 QALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYW 309
            +L +AV+NQPVSV ++A GRAF  Y SG+    CG   DHGVA VG+GT   ENG  YW
Sbjct: 261 TSLQKAVANQPVSVAIEAGGRAFQLYSSGIFTGKCGTALDHGVAAVGYGT---ENGKDYW 317

Query: 310 LIKNSWGETWGESGYIRILRD----AGLCGIATAASYPV 344
           +++NSWG++WGESGY+R+ R+    +G CGIA   SYP+
Sbjct: 318 IVRNSWGKSWGESGYVRMERNIKASSGKCGIAVEPSYPL 356


>gi|2463584|dbj|BAA22544.1| FBSB precursor [Ananas comosus]
          Length = 356

 Score =  301 bits (770), Expect = 4e-79,   Method: Compositional matrix adjust.
 Identities = 154/340 (45%), Positives = 223/340 (65%), Gaps = 21/340 (6%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPS--IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNL 70
           +F+ + L +  AS   S  S  EPS  ++++ E+WM ++GR YKD  EK  R  IFK N+
Sbjct: 8   VFLFLFLCVMWASP--SAASADEPSDPMMKRFEEWMVEYGRVYKDNDEKMRRFQIFKNNV 65

Query: 71  EYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYT-GYNRPVPSVSRQSSRPSTFKYQNVT 129
            +IE  N     +Y LG N+F+D+TN EF A YT G +RP+ ++ R+     +F   +++
Sbjct: 66  NHIETFNSRNENSYTLGINQFTDMTNNEFIAQYTGGISRPL-NIEREPV--VSFDDVDIS 122

Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD 189
            VP SIDWR+ GAVT +K+Q  CG+CWAF+A+A VE I +I +G L  LSEQQ++DC+  
Sbjct: 123 AVPQSIDWRDYGAVTSVKNQNPCGACWAFAAIATVESIYKIKKGILEPLSEQQVLDCA-K 181

Query: 190 NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAV--AATISKYEDLPKG 247
            +GC GG   +AFE+II NKG+A+ A YPY+  +GTC   K   V  +A I+ Y  +P+ 
Sbjct: 182 GYGCKGGWEFRAFEFIISNKGVASGAIYPYKAAKGTC---KTNGVPNSAYITGYARVPRN 238

Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
           +E +++ AVS QP++V VDA+   F +YKSGV N  CG + +H V  +G+G  ++ NG K
Sbjct: 239 NESSMMYAVSKQPITVAVDANAN-FQYYKSGVFNGPCGTSLNHAVTAIGYG--QDSNGKK 295

Query: 308 YWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
           YW++KNSWG  WGE+GYIR+ RD    +G+CGIA  + YP
Sbjct: 296 YWIVKNSWGARWGEAGYIRMARDVSSSSGICGIAIDSLYP 335


>gi|26452046|dbj|BAC43113.1| putative cysteine proteinase RD21A precursor [Arabidopsis thaliana]
          Length = 362

 Score =  301 bits (770), Expect = 4e-79,   Method: Compositional matrix adjust.
 Identities = 148/318 (46%), Positives = 208/318 (65%), Gaps = 14/318 (4%)

Query: 34  HEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSD 93
           +E  +   +EQW+ ++ + Y    EK  R  IFK NL+++++ N   +RT+++G   F+D
Sbjct: 36  NETEVRLMYEQWLVENRKNYNGLGEKERRFKIFKDNLKFVDEHNSVPDRTFEVGLTRFAD 95

Query: 94  LTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCG 153
           LTNEEFRA+Y    R     ++ S +   + Y+    +P  +DWR  GAV  +KDQG CG
Sbjct: 96  LTNEEFRAIYL---RKKMERNKDSVKTERYLYKEGDVLPDEVDWRANGAVVSVKDQGNCG 152

Query: 154 SCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGL 211
           SCWAFSAV AVEGI QIT G+LI LSEQ+LVDC     N GC GG+M+ AFE+I++N G+
Sbjct: 153 SCWAFSAVGAVEGINQITTGELISLSEQELVDCDRGFVNAGCDGGIMNYAFEFIMKNGGI 212

Query: 212 ATEADYPYR-HEEGTCDNQKEKAV-AATISKYEDLPKGDEQALLQAVSNQPVSVCVDASG 269
            T+ DYPY  ++ G C+  K       TI  YED+P+ DE++L +AV++QPVSV ++AS 
Sbjct: 213 ETDQDYPYNANDLGLCNADKNNNTRVVTIDGYEDVPRDDEKSLKKAVAHQPVSVAIEASS 272

Query: 270 RAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILR 329
           +AF  YKSGV+   CG + DHGV VVG+G+    +G  YW+I+NSWG  WG+SGY+++ R
Sbjct: 273 QAFQLYKSGVMTGTCGISLDHGVVVVGYGST---SGEDYWIIRNSWGLNWGDSGYVKLQR 329

Query: 330 DA----GLCGIATAASYP 343
           +     G CGIA   SYP
Sbjct: 330 NIDDPFGKCGIAMMPSYP 347


>gi|57118009|gb|AAW34136.1| cysteine protease gp3a [Zingiber officinale]
          Length = 475

 Score =  301 bits (770), Expect = 4e-79,   Method: Compositional matrix adjust.
 Identities = 161/346 (46%), Positives = 220/346 (63%), Gaps = 18/346 (5%)

Query: 10  IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
           ++P   I+ L    A    +GRS  E  I+  +++W  +H     D+     RL +FK+N
Sbjct: 23  VVPPLDILTLSKQ-AWAAPAGRSDEEVRII--YQEWRVKHRPAENDQYVGDYRLEVFKEN 79

Query: 70  LEYIEKANKEGNR---TYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQ 126
           L ++++ N   +R    Y+LG N F+DLTNEE+RA +    R +  + R +S   + +Y+
Sbjct: 80  LRFVDEHNAAADRGEHAYRLGMNRFADLTNEEYRARFL---RDLSRLGRSTSGEISNQYR 136

Query: 127 -NVTDV-PTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLV 184
               DV P SIDWREKGAV  +K+QG+CGSCWAF+A+AAVEGI QI  G LI LSEQQLV
Sbjct: 137 LREGDVLPDSIDWREKGAVVAVKNQGRCGSCWAFAAIAAVEGINQIVTGDLISLSEQQLV 196

Query: 185 DCSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDL 244
           DCST N+GC GG   +AF+YII N G+ +E  YPY    GTC+  KE A   +I  Y ++
Sbjct: 197 DCSTRNYGCEGGWPYRAFQYIINNGGVNSEEHYPYTGTNGTCNTTKENAHVVSIDSYRNV 256

Query: 245 PKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEEN 304
           P  DE++L +A +NQP+SV +DASGR F  Y SG+    C  + +HGV VVG+GT   EN
Sbjct: 257 PSNDEKSLQKAAANQPISVGIDASGRNFQLYHSGIFTGSCNTSLNHGVTVVGYGT---EN 313

Query: 305 GAKYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPVAI 346
           G  YW++KNSWGE WG SGYI + R+    +G CGIA + SYP+ +
Sbjct: 314 GNDYWIVKNSWGENWGNSGYILMERNIAESSGKCGIAISPSYPIKV 359


>gi|302790828|ref|XP_002977181.1| hypothetical protein SELMODRAFT_106402 [Selaginella moellendorffii]
 gi|300155157|gb|EFJ21790.1| hypothetical protein SELMODRAFT_106402 [Selaginella moellendorffii]
          Length = 337

 Score =  301 bits (770), Expect = 4e-79,   Method: Compositional matrix adjust.
 Identities = 151/341 (44%), Positives = 215/341 (63%), Gaps = 21/341 (6%)

Query: 15  VIIILVITCASQVV--------SGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIF 66
            +I+LV+  A+            GR++    I    E W A+HG++Y  + EKA RL IF
Sbjct: 5   TLILLVVVGATPFAIARPAALEDGRALE---IKNMFEDWAAKHGKSYSSDWEKARRLMIF 61

Query: 67  KQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTG-YNRPVPSVSRQSSRPSTFKY 125
              L YIEK N + N T+ LG N+FSDLTN EFRA++ G + RP      Q   P+  + 
Sbjct: 62  SDTLAYIEKHNAQPNTTFTLGLNKFSDLTNAEFRAMHVGKFKRP----RYQDRLPAEDED 117

Query: 126 QNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVD 185
            +V+ +PTS+DWR+KGAVT IKDQG CGSCWAFSA+A++E    +   +L+ LSEQQL+D
Sbjct: 118 VDVSSLPTSLDWRQKGAVTPIKDQGDCGSCWAFSAIASIESAHFLATKELVSLSEQQLMD 177

Query: 186 CSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLP 245
           C T + GC GGLM+ AF+++++N G+ TEA YPY    G+C+  K K   A I+ ++ + 
Sbjct: 178 CDTVDAGCDGGLMETAFKFVVKNGGVTTEAAYPYTGSVGSCNANKAKNKVAEITGFKVVT 237

Query: 246 KGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENG 305
           +    AL++AVS  PV+V +  S   F  YKSG+L+  C ++ DHGV ++G+GT   E G
Sbjct: 238 EDSADALMKAVSKTPVTVSICGSDENFQNYKSGILSGKCDDSLDHGVLLIGYGT---EGG 294

Query: 306 AKYWLIKNSWGETWGESGYIRILRD--AGLCGIATAASYPV 344
             YW+IKNSWG +WGE G+++I R    G+CG+   +SYP 
Sbjct: 295 MPYWIIKNSWGTSWGEDGFMKIERKDGDGMCGMNGDSSYPT 335


>gi|168017893|ref|XP_001761481.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162687165|gb|EDQ73549.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 471

 Score =  300 bits (769), Expect = 5e-79,   Method: Compositional matrix adjust.
 Identities = 153/319 (47%), Positives = 204/319 (63%), Gaps = 17/319 (5%)

Query: 35  EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
           +  +++   QW+ +H R Y    EK  R  IFK NL YI   NK+  ++Y LG N+FSDL
Sbjct: 45  DDGMLDVFHQWLERHSRVYHSLSEKQRRFQIFKDNLHYIHNHNKQ-EKSYWLGLNKFSDL 103

Query: 95  TNEEFRALYTGYNRPVPSVS--RQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQC 152
           T++EFRALY G  RP       R   R   F Y++V      +DWR+KGAV+ +KDQG C
Sbjct: 104 THDEFRALYLGI-RPAGRAHGLRNGDR---FIYEDVV-AEEMVDWRKKGAVSDVKDQGSC 158

Query: 153 GSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST-DNHGCSGGLMDKAFEYIIENKGL 211
           GSCWAFSA+ +VEG+  I  G+LI LSEQ+LVDC    N GC+GGLMD AF++II+N G+
Sbjct: 159 GSCWAFSAIGSVEGVNAIVTGELISLSEQELVDCDRGQNQGCNGGLMDYAFDFIIKNGGI 218

Query: 212 ATEADYPYRHEEGTCDN-QKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGR 270
            TE DYPY+  +G CD  +KE +    I  Y+D+P   E +LL+AVS  PVSV ++A GR
Sbjct: 219 DTEEDYPYKATDGQCDEARKETSKVVVIDDYQDVPTKSESSLLKAVSKNPVSVAIEAGGR 278

Query: 271 AFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILR- 329
            F  Y+ GV    CG + DHGV  VG+GT  +++G  YW++KNSWG +WGE GYIR+ R 
Sbjct: 279 DFQHYQGGVFTGPCGTDLDHGVLAVGYGT--DDDGVNYWIVKNSWGPSWGEKGYIRMERM 336

Query: 330 ----DAGLCGIATAASYPV 344
                +G CGI    S+P+
Sbjct: 337 GSNSTSGKCGINIEPSFPI 355


>gi|297809385|ref|XP_002872576.1| hypothetical protein ARALYDRAFT_489965 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297318413|gb|EFH48835.1| hypothetical protein ARALYDRAFT_489965 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 371

 Score =  300 bits (768), Expect = 6e-79,   Method: Compositional matrix adjust.
 Identities = 143/307 (46%), Positives = 201/307 (65%), Gaps = 9/307 (2%)

Query: 43  EQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRAL 102
           E WM +HG+ Y+   EK  RL IF+ NL +I   N E N +Y+LG N F+DL+  E+  +
Sbjct: 57  ESWMVKHGKVYESVAEKERRLTIFEDNLRFITNRNAE-NLSYRLGLNRFADLSLHEYAQI 115

Query: 103 YTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVA 162
             G +   P      +  + +K  +   +P S+DWR +GAVT +KDQGQC SCWAFS V 
Sbjct: 116 CHGADPRPPRNHVFMTSSNRYKTSDGDVLPKSVDWRNEGAVTEVKDQGQCRSCWAFSTVG 175

Query: 163 AVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHE 222
           AVEG+ +I  G+L+ LSEQ L++C+ +N+GC GG ++ A+E+I+ N GL T+ DYPY+  
Sbjct: 176 AVEGLNKIVTGELVTLSEQDLINCNKENNGCGGGKVETAYEFIMNNGGLGTDNDYPYKAL 235

Query: 223 EGTCDNQ-KEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLN 281
            G C+++ KE      I  YE+LP  DE AL++AV++QPV+  VD+S R F  Y SGV +
Sbjct: 236 NGVCNDRLKENNKNVMIDGYENLPANDESALMKAVAHQPVTAVVDSSSREFQLYASGVFD 295

Query: 282 ADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIA 337
             CG N +HGV VVG+GT   ENG  YW+++NS G TWGE+GY+++ R+     GLCGIA
Sbjct: 296 GTCGTNLNHGVVVVGYGT---ENGRDYWIVRNSRGNTWGEAGYMKMARNIANPRGLCGIA 352

Query: 338 TAASYPV 344
             ASYP+
Sbjct: 353 MRASYPL 359


>gi|218183|dbj|BAA14403.1| oryzain beta precursor [Oryza sativa Japonica Group]
          Length = 471

 Score =  300 bits (767), Expect = 9e-79,   Method: Compositional matrix adjust.
 Identities = 146/313 (46%), Positives = 207/313 (66%), Gaps = 17/313 (5%)

Query: 42  HEQWMAQHGRTYKDEL--EKAMRLNIFKQNLEYIEKANKEGNRT--YKLGTNEFSDLTNE 97
           ++ W+A++G    + L  E   R  +F  NL++++  N   +    ++LG N F+DLTNE
Sbjct: 51  YDLWLAENGGGSPNALGGEHERRFLVFWDNLKFVDAHNARADEGGGFRLGMNRFADLTNE 110

Query: 98  EFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWA 157
           EFRA + G         R  +    +++  V ++P S+DWREKGAV  +K+QGQCGSCWA
Sbjct: 111 EFRATFLG----AKVAERSRAAGERYRHDGVEELPESVDWREKGAVAPVKNQGQCGSCWA 166

Query: 158 FSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEA 215
           FSAV+ VE I Q+  G++I LSEQ+LV+CST+  N GC+GGLM  AF++II+N G+ TE 
Sbjct: 167 FSAVSTVESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMADAFDFIIKNGGIDTED 226

Query: 216 DYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFY 275
           DYPY+  +G CD  +E A   +I  +ED+P+ DE++L +AV++QPVSV ++A GR F  Y
Sbjct: 227 DYPYKAVDGKCDINRENAKVVSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGGREFQLY 286

Query: 276 KSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD----A 331
            SGV +  CG + DHGV  VG+GT   +NG  YW+++NSWG  WGESGY+R+ R+     
Sbjct: 287 HSGVFSGRCGTSLDHGVVAVGYGT---DNGKDYWIVRNSWGPKWGESGYVRMERNINVTT 343

Query: 332 GLCGIATAASYPV 344
           G CGIA  ASYP 
Sbjct: 344 GKCGIAMMASYPT 356


>gi|1514953|dbj|BAA11170.1| cysteine proteinase [Oryza sativa (japonica cultivar-group)]
          Length = 368

 Score =  300 bits (767), Expect = 1e-78,   Method: Compositional matrix adjust.
 Identities = 153/329 (46%), Positives = 203/329 (61%), Gaps = 12/329 (3%)

Query: 23  CASQVVSGRSMH-EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGN 81
           CA+     R +  + ++ + +E+W   H    +   EK  R   FK N+ YI + NK   
Sbjct: 26  CAAIPFDERDLESDEALWDLYERWQEHH-HVPRHHGEKHRRFGAFKDNVRYIHEHNKRAP 84

Query: 82  RTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRP-STFKYQNVTDVPTSIDWREK 140
               L  N F D+  EEFRA + G +         ++ P   F Y+ V D+P ++DWR K
Sbjct: 85  GYAPL--NRFGDMGREEFRATFAGSHANDLRRDGLAAPPLPGFMYEGVRDLPRAVDWRRK 142

Query: 141 GAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST-DNHGCSGGLMD 199
           GAVT +KDQG+CGSCWAFS V +VEGI  I  G+L+ LSEQ+L+DC T DN GC GGLM+
Sbjct: 143 GAVTGVKDQGKCGSCWAFSTVVSVEGINAIRTGRLVSLSEQELIDCDTADNSGCQGGLME 202

Query: 200 KAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQ 259
            AFEYI  + G+ TE+ YPYR   GTCD  + +     I  ++++P   E AL +AV+NQ
Sbjct: 203 NAFEYIKHSGGITTESAYPYRAANGTCDAVRARGGLVVIDGHQNVPANSEAALAKAVANQ 262

Query: 260 PVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETW 319
           PVSV +DA  ++F FY  GV   DCG + DHGVAVVG+G  E  +G +YW++KNSWG  W
Sbjct: 263 PVSVAIDAGDQSFQFYSDGVFAGDCGTDLDHGVAVVGYG--ETNDGTEYWIVKNSWGTAW 320

Query: 320 GESGYIRILRDA----GLCGIATAASYPV 344
           GE GYIR+ RD+    GLCGIA  ASYPV
Sbjct: 321 GEGGYIRMQRDSGYDGGLCGIAMEASYPV 349


>gi|297816030|ref|XP_002875898.1| hypothetical protein ARALYDRAFT_485194 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297321736|gb|EFH52157.1| hypothetical protein ARALYDRAFT_485194 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 363

 Score =  300 bits (767), Expect = 1e-78,   Method: Compositional matrix adjust.
 Identities = 156/346 (45%), Positives = 211/346 (60%), Gaps = 17/346 (4%)

Query: 14  FVIIILVITCASQVVSGRSMHEP------SIVEKHEQWMAQHGRTYKDELEKAMRLNIFK 67
              I+L   C  Q   G    E       ++ + +E+W   H  T +   E   R N+F+
Sbjct: 3   LFFIVLSFLCLLQASKGFDFDEKELETEENVWKLYERWRDHHSVT-RASHEALKRFNVFR 61

Query: 68  QNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPST-FKYQ 126
            N+ ++ + NK+ N+ YKL  N F+D+T+ EFR+ Y G N     + R   R S  F Y+
Sbjct: 62  HNVLHVHRTNKK-NKPYKLKVNRFADITHHEFRSSYAGSNVKHHRMLRGPKRGSGGFMYE 120

Query: 127 NVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC 186
           NVT VP+S+DWREKGAVT +K+Q  CGSCWAFS VAAVEGI +I   KL+ LSEQ+LVDC
Sbjct: 121 NVTRVPSSVDWREKGAVTEVKNQQDCGSCWAFSTVAAVEGINKIRTNKLVSLSEQELVDC 180

Query: 187 ST-DNHGCSGGLMDKAFEYIIENKGLATEADYPY-RHEEGTCDNQKEKAVAATISKYEDL 244
            T +N GC+GGLM+ AFE+I  N G+ TE  YPY  ++   C  +       TI  +E +
Sbjct: 181 DTEENQGCAGGLMEPAFEFIKNNGGIKTEETYPYDSNDVQFCRAKSIDGETVTIDGHEHV 240

Query: 245 PKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEEN 304
           P+ DE+ALL+AV++QPVSV +DA    F  Y  GV   +CG   +HGV +VG+G  E +N
Sbjct: 241 PENDEEALLKAVAHQPVSVAIDAGSSDFQLYSEGVFIGECGTQLNHGVVIVGYG--ETKN 298

Query: 305 GAKYWLIKNSWGETWGESGYIRILR----DAGLCGIATAASYPVAI 346
           G KYW+++NSWG  WGE GY+RI R    + G CGIA  ASYP  +
Sbjct: 299 GTKYWIVRNSWGPEWGEGGYVRIERGISENEGRCGIAMEASYPTKV 344


>gi|30141023|dbj|BAC75925.1| cysteine protease-3 [Helianthus annuus]
          Length = 348

 Score =  299 bits (766), Expect = 1e-78,   Method: Compositional matrix adjust.
 Identities = 146/348 (41%), Positives = 212/348 (60%), Gaps = 13/348 (3%)

Query: 3   LKFEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMR 62
           ++  K F++ + + + + +             + S+ + +E+W +QH  +   + EK  R
Sbjct: 1   MECNKVFVLSISLALFIGVVNCIDFTEKDLATDKSLWDLYERWGSQHMVSRAPD-EKKKR 59

Query: 63  LNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVP--SVSRQSSRP 120
            N+FK N+ +I + N+ G + YKL  NEF+D+TN EF+A   G++  +    + +   R 
Sbjct: 60  FNVFKYNVNHINRVNQLG-KPYKLKLNEFADMTNHEFKA---GFDSKILHFRMLKGKRRQ 115

Query: 121 STFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSE 180
           + F +   TD P SIDWR  GAV  IK+QG+CGSCWAFS +  VEGI +I   +L+ LSE
Sbjct: 116 TPFTHAKTTDPPPSIDWRTNGAVNPIKNQGRCGSCWAFSTIVGVEGINKIKTNQLVSLSE 175

Query: 181 QQLVDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISK 240
           Q+LVDC TD  GC+GGLM+  +E+I E  G+ TE  YPY    G CD  K  +    I  
Sbjct: 176 QELVDCETDCEGCNGGLMENGYEFIKETGGVTTEQIYPYFARNGRCDISKRNSPVVKIDG 235

Query: 241 YEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTA 300
           +E++P  DE A+L+AV+NQPVS+ +DA G  F FY  GV N  CG   +HGVA+VG+GT 
Sbjct: 236 FENVPANDESAMLRAVANQPVSIAIDAGGLNFQFYSQGVFNGACGTELNHGVAIVGYGTT 295

Query: 301 EEENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
           ++  G  YW+++NSWG  WGE GY+R+ R      GLCG+A  ASYP+
Sbjct: 296 QD--GTNYWIVRNSWGTGWGEQGYVRMQRGVNVPEGLCGLAMDASYPI 341


>gi|357115272|ref|XP_003559414.1| PREDICTED: thiol protease SEN102-like [Brachypodium distachyon]
          Length = 360

 Score =  299 bits (766), Expect = 1e-78,   Method: Compositional matrix adjust.
 Identities = 158/328 (48%), Positives = 203/328 (61%), Gaps = 25/328 (7%)

Query: 37  SIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNR-------TYKLGTN 89
           ++  +HE WMA+HGRTY D  EKA RL IF+ N E I+  N + +        +++L TN
Sbjct: 38  AMASRHESWMAEHGRTYADAEEKARRLEIFRANAERIDSFNSKADAAAGESVDSHRLATN 97

Query: 90  EFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT---DVPTSIDWREKGAVTHI 146
            F+DLT+EEFRA  TG  RP             F+Y+N +   D   S+DWR  GAVT +
Sbjct: 98  RFADLTDEEFRAARTGLRRPAAVAGAVGG---GFRYENFSLQADAAGSMDWRAMGAVTGV 154

Query: 147 KDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST--DNHGCSGGLMDKAFEY 204
           KDQG CG CWAFSAVAA+EG+T+I  G+L+ LSEQQLVDC    D+ GC GGLMD AF+Y
Sbjct: 155 KDQGSCGCCWAFSAVAAMEGLTKIRTGRLVSLSEQQLVDCDVYGDDQGCEGGLMDNAFQY 214

Query: 205 IIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVC 264
           I    GLA+E+ YPY  E+G          AA+I  +ED+P  +E AL+ AV++QPVSV 
Sbjct: 215 ISRQGGLASESAYPYSGEDGGSCRSGRAQPAASIRGHEDVPANNEGALMAAVAHQPVSVA 274

Query: 265 VDASGRAFHFYKSGVLNADCGNNC-----DHGVAVVGFGTAEEENGAKYWLIKNSWGETW 319
           ++     F FY  GVL A     C     DH +  VG+G A   +G  YWL+KNSWG  W
Sbjct: 275 INGGDYVFRFYDRGVLGAGGNGGCESTELDHAITAVGYGMA--GDGTGYWLMKNSWGSGW 332

Query: 320 GESGYIRILRDA---GLCGIATAASYPV 344
           GESGY+RI R +   G+CG+A  ASYPV
Sbjct: 333 GESGYVRIRRGSRGEGVCGLAKLASYPV 360


>gi|195637152|gb|ACG38044.1| vignain precursor [Zea mays]
          Length = 377

 Score =  299 bits (766), Expect = 1e-78,   Method: Compositional matrix adjust.
 Identities = 152/291 (52%), Positives = 188/291 (64%), Gaps = 14/291 (4%)

Query: 63  LNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTG----YNRPVPSVSRQSS 118
            N+FK N+  I + N+  +  YKL  N F D+T +EFR  Y G    ++R      + SS
Sbjct: 70  FNVFKANVRLIHEFNRR-DEPYKLRLNRFGDMTADEFRRHYAGSRVAHHRMFRGDRQGSS 128

Query: 119 RPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIEL 178
             ++F Y +  DVP S+DWR+KGAVT +KDQGQCGSCWAFS +AAVEGI  I    L  L
Sbjct: 129 ASASFMYADARDVPASVDWRQKGAVTDVKDQGQCGSCWAFSTIAAVEGINAIKTKNLTSL 188

Query: 179 SEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAAT 237
           SEQQLVDC T  N GC+GGLMD AF+YI ++ G+A E  YPYR  + +C  +K  A   T
Sbjct: 189 SEQQLVDCDTKANAGCNGGLMDYAFQYIAKHGGVAAEDAYPYRARQASC--KKSPAPVVT 246

Query: 238 ISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGF 297
           I  YED+P  DE AL +AV++QPVSV ++ASG  F FY  GV +  CG   DHGVA VG+
Sbjct: 247 IDGYEDVPANDESALKKAVAHQPVSVAIEASGSHFQFYSEGVFSGRCGTELDHGVAAVGY 306

Query: 298 GTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
           G   +  G KYWL+KNSWG  WGE GYIR+ RD     G CGIA  ASYPV
Sbjct: 307 GVTAD--GTKYWLVKNSWGPEWGEKGYIRMARDVAAKEGHCGIAMEASYPV 355


>gi|18413507|ref|NP_567377.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|30315953|sp|Q9SUS9.1|CPR4_ARATH RecName: Full=Probable cysteine proteinase At4g11320; Flags:
           Precursor
 gi|5596478|emb|CAB51416.1| drought-inducible cysteine proteinase RD21A precursor-like protein
           [Arabidopsis thaliana]
 gi|7267831|emb|CAB81233.1| drought-inducible cysteine proteinase RD21A precursor-like protein
           [Arabidopsis thaliana]
 gi|14334764|gb|AAK59560.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|15293257|gb|AAK93739.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|332657596|gb|AEE82996.1| putative cysteine proteinase [Arabidopsis thaliana]
          Length = 371

 Score =  299 bits (766), Expect = 1e-78,   Method: Compositional matrix adjust.
 Identities = 152/364 (41%), Positives = 222/364 (60%), Gaps = 27/364 (7%)

Query: 3   LKFEKSFIIPMFVIIILVITCAS----QVVSGRSMH-------------EPSIVEKHEQW 45
           + + KS ++ +F++ +++ +CA+     VVS    H             +       E W
Sbjct: 1   MGYAKSAML-IFLLALVIASCATAMDMSVVSSNDNHHVTAGPGRRQGIFDAEATLMFESW 59

Query: 46  MAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTG 105
           M +HG+ Y    EK  RL IF+ NL +I   N E N +Y+LG N F+DL+  E+  +  G
Sbjct: 60  MVKHGKVYDSVAEKERRLTIFEDNLRFITNRNAE-NLSYRLGLNRFADLSLHEYGEICHG 118

Query: 106 YNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVE 165
            +   P      +  + +K  +   +P S+DWR +GAVT +KDQG C SCWAFS V AVE
Sbjct: 119 ADPRPPRNHVFMTSSNRYKTSDGDVLPKSVDWRNEGAVTEVKDQGLCRSCWAFSTVGAVE 178

Query: 166 GITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGT 225
           G+ +I  G+L+ LSEQ L++C+ +N+GC GG ++ A+E+I+ N GL T+ DYPY+   G 
Sbjct: 179 GLNKIVTGELVTLSEQDLINCNKENNGCGGGKVETAYEFIMNNGGLGTDNDYPYKALNGV 238

Query: 226 CDNQ-KEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADC 284
           C+ + KE      I  YE+LP  DE AL++AV++QPV+  VD+S R F  Y+SGV +  C
Sbjct: 239 CEGRLKEDNKNVMIDGYENLPANDEAALMKAVAHQPVTAVVDSSSREFQLYESGVFDGTC 298

Query: 285 GNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAA 340
           G N +HGV VVG+GT   ENG  YW++KNS G+TWGE+GY+++ R+     GLCGIA  A
Sbjct: 299 GTNLNHGVVVVGYGT---ENGRDYWIVKNSRGDTWGEAGYMKMARNIANPRGLCGIAMRA 355

Query: 341 SYPV 344
           SYP+
Sbjct: 356 SYPL 359


>gi|146215984|gb|ABQ10194.1| actinidin Act2c [Actinidia arguta]
          Length = 378

 Score =  299 bits (765), Expect = 1e-78,   Method: Compositional matrix adjust.
 Identities = 150/338 (44%), Positives = 206/338 (60%), Gaps = 15/338 (4%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
           +F   +L+++ A  +V+        + + +E W+ + G++Y    EK MR  IFK NL  
Sbjct: 13  LFFSTLLILSSALDIVNSAQRTNDQVRDMYESWLVEQGKSYNSLDEKEMRFEIFKDNLRI 72

Query: 73  IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDV- 131
           I+  N + NR++ LG N F+DLT+EE+R+ Y G+       S   ++ S      V DV 
Sbjct: 73  IDDHNADANRSFSLGLNRFADLTDEEYRSTYLGFK------SGPKAKVSNRYVPKVGDVL 126

Query: 132 PTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC--STD 189
           P  +DWR  GAV  +K+QG C SCWAFSAVAAVEGI +I  G L+ LSEQ+LVDC  +  
Sbjct: 127 PNYVDWRTVGAVVGVKNQGLCSSCWAFSAVAAVEGINKIMTGNLLSLSEQELVDCGRTQS 186

Query: 190 NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDE 249
             GC+ G M  AF++II N G+ TE +YPY  ++G C+   +     TI  YE++P  +E
Sbjct: 187 TRGCNRGYMTDAFQFIINNGGINTEDNYPYTAQDGQCNRYLQNQKYVTIDDYENVPSNNE 246

Query: 250 QALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYW 309
            AL  AV++QPVSV +++ G  F  Y SG+    CG   DHGV +VG+GT   E G  YW
Sbjct: 247 WALQNAVAHQPVSVGLESEGGKFKLYTSGIFTQYCGTAIDHGVTIVGYGT---ERGLDYW 303

Query: 310 LIKNSWGETWGESGYIRILRD---AGLCGIATAASYPV 344
           ++KNSWG  WGE+GYIRI R+   AG CGIA  ASYPV
Sbjct: 304 IVKNSWGTNWGENGYIRIQRNIGGAGKCGIARMASYPV 341


>gi|146215986|gb|ABQ10195.1| actinidin Act2d [Actinidia eriantha]
          Length = 381

 Score =  299 bits (765), Expect = 1e-78,   Method: Compositional matrix adjust.
 Identities = 150/346 (43%), Positives = 204/346 (58%), Gaps = 31/346 (8%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
           +F   +L+++ A  + +        ++  +E W+ + G++Y    EK MR  IFK+NL  
Sbjct: 15  LFFSTLLILSSALDIKNSVQRTNDQVMAMYESWLVEQGKSYNSLDEKEMRFEIFKENLRI 74

Query: 73  IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGY---------NRPVPSVSRQSSRPSTF 123
           I+  N + NR+Y LG N F+DLT+EE+R+ Y G+         NR VP V          
Sbjct: 75  IDDHNADANRSYSLGLNRFADLTDEEYRSTYLGFKSGPKAKVSNRYVPKVG--------- 125

Query: 124 KYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQL 183
                  +P  +DWR  GAV  +KDQG C SCWAFSAVAAVEGI +I  G LI LSEQ+L
Sbjct: 126 -----VVLPNYVDWRTVGAVVGVKDQGLCSSCWAFSAVAAVEGINKIVTGNLISLSEQEL 180

Query: 184 VDC--STDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKY 241
           VDC  +    GC+ G M+ AF++II+N G+ TE +YPY  ++G CD  ++     TI  Y
Sbjct: 181 VDCGRTQRTRGCNRGYMNDAFQFIIDNGGINTEDNYPYTAQDGQCDWYRKNQRYVTIDNY 240

Query: 242 EDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAE 301
           E LP  +E  L  AV+ QP++V +++ G  F  Y SG+    CG   DHGV +VG+GT  
Sbjct: 241 EQLPANNEWVLQNAVAYQPITVGLESEGGKFKLYTSGIYTGYCGTAIDHGVTIVGYGT-- 298

Query: 302 EENGAKYWLIKNSWGETWGESGYIRILRD---AGLCGIATAASYPV 344
            E G  YW++KNSWG  WGE+GYIRI R+   AG CGIA   SYPV
Sbjct: 299 -ERGLDYWIVKNSWGTNWGENGYIRIQRNIGGAGKCGIAMVPSYPV 343


>gi|4426617|gb|AAD20453.1| cysteine endopeptidase precursor [Oryza sativa]
          Length = 368

 Score =  299 bits (765), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 153/329 (46%), Positives = 203/329 (61%), Gaps = 12/329 (3%)

Query: 23  CASQVVSGRSMH-EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGN 81
           CA+     R +  + ++ + +E+W   H    +   EK  R   FK N+ YI + NK   
Sbjct: 26  CAAIPFDERDLESDEALWDLYERWQEHH-HVPRHHGEKHRRFGAFKDNVRYIHEHNKRAP 84

Query: 82  RTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRP-STFKYQNVTDVPTSIDWREK 140
               L  N F D+  EEFRA + G +         ++ P   F Y+ V D+P ++DWR K
Sbjct: 85  GYPPL--NRFGDMGREEFRATFAGSHANDLRRDGLAAPPLPGFMYEGVRDLPRAVDWRRK 142

Query: 141 GAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST-DNHGCSGGLMD 199
           GAVT +KDQG+CGSCWAFS V +VEGI  I  G+L+ LSEQ+L+DC T DN GC GGLM+
Sbjct: 143 GAVTGVKDQGKCGSCWAFSTVVSVEGINAIRTGRLVSLSEQELIDCDTADNSGCQGGLME 202

Query: 200 KAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQ 259
            AFEYI  + G+ TE+ YPYR   GTCD  + +     I  ++++P   E AL +AV+NQ
Sbjct: 203 NAFEYIKHSGGITTESAYPYRAANGTCDAVRARGGLVVIDGHQNVPANSEAALAKAVANQ 262

Query: 260 PVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETW 319
           PVSV +DA  ++F FY  GV   DCG + DHGVAVVG+G  E  +G +YW++KNSWG  W
Sbjct: 263 PVSVAIDAGDQSFQFYSDGVFAGDCGTDLDHGVAVVGYG--ETNDGTEYWIVKNSWGTAW 320

Query: 320 GESGYIRILRDA----GLCGIATAASYPV 344
           GE GYIR+ RD+    GLCGIA  ASYPV
Sbjct: 321 GEGGYIRMQRDSGYDGGLCGIAMEASYPV 349


>gi|413953666|gb|AFW86315.1| hypothetical protein ZEAMMB73_539008 [Zea mays]
          Length = 314

 Score =  299 bits (765), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 152/340 (44%), Positives = 209/340 (61%), Gaps = 42/340 (12%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
           +  I+     C + + +     + ++V +HEQWMAQ+ R YKD  EKA R          
Sbjct: 8   ILAILGFAFFCGAALAARDLSDDSAMVARHEQWMAQYSRVYKDASEKARRF--------- 58

Query: 73  IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD-- 130
                            +F+DLTN EFR++ T  N+   S + +    + F+Y+NV+   
Sbjct: 59  -----------------KFADLTNHEFRSVKT--NKGFKSSNMKI--LTGFRYENVSADA 97

Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST-- 188
           +PT+IDWR KG VT IKDQGQCG C AFSAVAA EGI +I+ GKL+ L++Q+LVDC    
Sbjct: 98  LPTTIDWRTKGVVTPIKDQGQCGCCSAFSAVAATEGIVKISTGKLVSLADQELVDCDVHG 157

Query: 189 DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
           ++ GC GGLMD AF++II+N GL TE+ YPY   +G C++      AATI  YED+P  D
Sbjct: 158 EDQGCEGGLMDDAFKFIIKNGGLTTESSYPYTAADGKCNSGSNS--AATIKGYEDVPAND 215

Query: 249 EQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKY 308
           E AL++A++NQPVSV VD     F FY  GV+   CG + DHG+A +G+G  +  +G KY
Sbjct: 216 EAALMKAMANQPVSVAVDGGDMTFRFYSGGVMTGSCGTDLDHGIAAIGYG--KTSDGTKY 273

Query: 309 WLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
           WL+KNSWG TWGE+GY+R+ +D     G+CG+A   SYP 
Sbjct: 274 WLMKNSWGTTWGENGYLRMEKDISDKRGMCGLAMEPSYPT 313


>gi|317106666|dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas]
          Length = 441

 Score =  299 bits (765), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 143/311 (45%), Positives = 197/311 (63%), Gaps = 9/311 (2%)

Query: 38  IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
           I    E W  QHG+TY  + EK  RL +F+ N +++ + N +GN +Y L  N F+DLT+ 
Sbjct: 26  IAHLFETWCQQHGKTYASQEEKLFRLKVFQDNYDFVTEHNSQGNSSYTLSLNAFADLTHH 85

Query: 98  EFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWA 157
           EF+A   G +    S S    R +      V DVP S+DWR+ GAVT +KDQG CG+CW+
Sbjct: 86  EFKASRLGLS-SAASASLNVDRSNRQIPDFVADVPASVDWRKNGAVTQVKDQGNCGACWS 144

Query: 158 FSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEAD 216
           FSA  A+EGI +I  G L+ LSEQ+LVDC    N+GC GG+MD AF+++I+N G+ TE D
Sbjct: 145 FSATGAIEGINKIVTGSLVSLSEQELVDCDKSYNNGCEGGIMDYAFQFVIDNHGIDTEED 204

Query: 217 YPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYK 276
           YPY+  + +C+ +K K    TI  Y D+P+ +E+ LL+AV+NQPVSV +  S RAF  Y 
Sbjct: 205 YPYQGRDRSCNKEKLKRHVVTIDGYVDVPQNNEKELLKAVANQPVSVGICGSERAFQLYS 264

Query: 277 SGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----G 332
            G+    C  + DH V +VG+G+   ENG  YW++KNSWG  WG  GY+ + R++    G
Sbjct: 265 KGIFTGPCSTSLDHAVLIVGYGS---ENGVDYWIVKNSWGSYWGMDGYMHMQRNSGSSRG 321

Query: 333 LCGIATAASYP 343
           LCGI   ASYP
Sbjct: 322 LCGINMLASYP 332


>gi|357162587|ref|XP_003579458.1| PREDICTED: oryzain beta chain-like [Brachypodium distachyon]
          Length = 470

 Score =  298 bits (764), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 144/310 (46%), Positives = 201/310 (64%), Gaps = 13/310 (4%)

Query: 45  WMAQHGRTYKDEL-EKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEFSDLTNEEFR 100
           W A+HG    + L E+  R   F  NL +++  N     G   ++LG N F+DLTN+EFR
Sbjct: 55  WRAEHGSGNSNSLGEEERRFRAFWDNLRFVDAHNARAAAGEEGFRLGMNRFADLTNDEFR 114

Query: 101 ALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSA 160
           A Y G        S ++     +++  V ++P ++DWREKGAV  +K+QGQCGSCWAFSA
Sbjct: 115 AAYLGVKGAGQRRSARAGVGERYRHDGVEELPEAVDWREKGAVAPVKNQGQCGSCWAFSA 174

Query: 161 VAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYP 218
           V+AVE I Q+  G+L+ LSEQ+LV+C  +  ++GC+GGLMD AF++II N G+ TE DYP
Sbjct: 175 VSAVESINQLVTGELVTLSEQELVECDINGQSNGCNGGLMDDAFDFIINNGGIDTEDDYP 234

Query: 219 YRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSG 278
           Y+  +G CD  +  A   +I  +ED+P+ DE++L +AV++QPVSV ++A GR F  Y SG
Sbjct: 235 YKALDGKCDINRRNAKVVSIDGFEDVPENDEKSLQKAVAHQPVSVAIEAGGREFQLYHSG 294

Query: 279 VLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD----AGLC 334
           V    CG   DHGV  VG+GT   ENG  YW+++NSWG  WGE+GY+R+ R+     G C
Sbjct: 295 VFTGRCGTELDHGVVAVGYGT---ENGKDYWIVRNSWGPKWGEAGYLRMERNINATTGKC 351

Query: 335 GIATAASYPV 344
           GIA  +SYP 
Sbjct: 352 GIAMMSSYPT 361


>gi|15984|emb|CAA34486.1| unnamed protein product [Actinidia deliciosa]
          Length = 380

 Score =  298 bits (763), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 153/351 (43%), Positives = 217/351 (61%), Gaps = 18/351 (5%)

Query: 3   LKFEKSFIIP--MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKA 60
           +   KSF+    +F   +L+++ A    +        +   +E W+ ++G++Y    E  
Sbjct: 1   MGLPKSFVSMSLLFFSTLLILSLAFNAKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWE 60

Query: 61  MRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRP 120
            R  IFK+ L +I++ N + NR+YK+G N+F+DLT+EEFR+ Y G+     S S ++   
Sbjct: 61  RRFEIFKETLRFIDEHNADTNRSYKVGLNQFADLTDEEFRSTYLGFT----SGSNKTKVS 116

Query: 121 STFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSE 180
           + ++ +    +P+ +DWR  GAV  IK QG+CG CWAFSA+A VEGI +I  G LI LSE
Sbjct: 117 NRYEPRFGQVLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSE 176

Query: 181 QQLVDC--STDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTC--DNQKEKAVAA 236
           Q+L+DC  + +  GC+GG +   F++II N G+ TE +YPY  ++G C  D Q EK V  
Sbjct: 177 QELIDCGRTQNTRGCNGGYITDGFQFIINNGGINTEENYPYTAQDGECNLDLQNEKYV-- 234

Query: 237 TISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVG 296
           TI  YE++P  +E AL  AV+ QPVSV +DA+G AF  Y SG+    CG   DH V +VG
Sbjct: 235 TIDTYENVPYNNEWALQTAVTYQPVSVALDAAGDAFKHYSSGIFTGPCGTAIDHAVTIVG 294

Query: 297 FGTAEEENGAKYWLIKNSWGETWGESGYIRILRD---AGLCGIATAASYPV 344
           +GT   E G  YW++KNSW  TWGE GY+RILR+   AG CGIAT  SYPV
Sbjct: 295 YGT---EGGIDYWIVKNSWDTTWGEEGYMRILRNVGGAGTCGIATMPSYPV 342


>gi|146215976|gb|ABQ10190.1| actinidin Act1b [Actinidia arguta]
          Length = 380

 Score =  298 bits (763), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 149/337 (44%), Positives = 208/337 (61%), Gaps = 12/337 (3%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
           +F   +LV++ A    +        +   +E W+ ++G++Y    E   R  IFK+ L +
Sbjct: 13  LFFSTLLVLSLAFNAKNLTKRTNDELKAMYESWLTKYGKSYNSLGEWERRFEIFKETLRF 72

Query: 73  IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVP 132
           I++ N + NR+Y++G N+F+D TNEEF++ Y G+     S S +    + ++ +    +P
Sbjct: 73  IDEHNADTNRSYRVGLNQFADQTNEEFQSTYLGFT----SGSNKMKVSNRYEPRVGQVLP 128

Query: 133 TSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC--STDN 190
             +DWR  GAV  IK QGQCGSCWAFSA+A VEGI +I  G LI LSEQ+LVDC  + + 
Sbjct: 129 DYVDWRSAGAVVDIKSQGQCGSCWAFSAIATVEGINKIVTGDLISLSEQELVDCGRTQNT 188

Query: 191 HGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQ 250
            GC GG +   F++II N G+ TEA+YPY  E+G C+   +    A+I  YE++P  +E 
Sbjct: 189 RGCDGGSITDGFQFIINNGGINTEANYPYTAEDGQCNLDLQNEKYASIDTYENVPYNNEW 248

Query: 251 ALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWL 310
           AL  AV+ QPVSV ++A+G AF  Y SG+    CG   DH V +VG+GT   E G  YW+
Sbjct: 249 ALQTAVAYQPVSVALEAAGDAFQHYSSGIFTGPCGTAVDHAVTIVGYGT---EGGIDYWI 305

Query: 311 IKNSWGETWGESGYIRILRD---AGLCGIATAASYPV 344
           +KNSW  TWGE GYIRILR+   AG CGIAT  SYPV
Sbjct: 306 VKNSWDTTWGEEGYIRILRNVGGAGTCGIATKPSYPV 342


>gi|193806686|sp|A5HII1.1|ACTN_ACTDE RecName: Full=Actinidain; Short=Actinidin; AltName: Full=Allergen
           Act d 1; AltName: Allergen=Act d 1; Flags: Precursor
 gi|146215974|gb|ABQ10189.1| actinidin Act1a [Actinidia deliciosa]
          Length = 380

 Score =  298 bits (763), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 153/351 (43%), Positives = 217/351 (61%), Gaps = 18/351 (5%)

Query: 3   LKFEKSFIIP--MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKA 60
           +   KSF+    +F   +L+++ A    +        +   +E W+ ++G++Y    E  
Sbjct: 1   MGLPKSFVSMSLLFFSTLLILSLAFNAKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWE 60

Query: 61  MRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRP 120
            R  IFK+ L +I++ N + NR+YK+G N+F+DLT+EEFR+ Y G+     S S ++   
Sbjct: 61  RRFEIFKETLRFIDEHNADTNRSYKVGLNQFADLTDEEFRSTYLGFT----SGSNKTKVS 116

Query: 121 STFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSE 180
           + ++ +    +P+ +DWR  GAV  IK QG+CG CWAFSA+A VEGI +I  G LI LSE
Sbjct: 117 NRYEPRVGQVLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSE 176

Query: 181 QQLVDC--STDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTC--DNQKEKAVAA 236
           Q+L+DC  + +  GC+GG +   F++II N G+ TE +YPY  ++G C  D Q EK V  
Sbjct: 177 QELIDCGRTQNTRGCNGGYITDGFQFIINNGGINTEENYPYTAQDGECNLDLQNEKYV-- 234

Query: 237 TISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVG 296
           TI  YE++P  +E AL  AV+ QPVSV +DA+G AF  Y SG+    CG   DH V +VG
Sbjct: 235 TIDTYENVPYNNEWALQTAVTYQPVSVALDAAGDAFKHYSSGIFTGPCGTAIDHAVTIVG 294

Query: 297 FGTAEEENGAKYWLIKNSWGETWGESGYIRILRD---AGLCGIATAASYPV 344
           +GT   E G  YW++KNSW  TWGE GY+RILR+   AG CGIAT  SYPV
Sbjct: 295 YGT---EGGIDYWIVKNSWDTTWGEEGYMRILRNVGGAGTCGIATMPSYPV 342


>gi|312451836|gb|ADQ85985.1| actinidin [Actinidia chinensis]
          Length = 380

 Score =  298 bits (762), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 153/351 (43%), Positives = 217/351 (61%), Gaps = 18/351 (5%)

Query: 3   LKFEKSFIIP--MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKA 60
           +   KSF+    +F   +L+++ A    +        +   +E W+ ++G++Y    E  
Sbjct: 1   MGLPKSFVSMSLLFFSTLLILSLAFNTKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWE 60

Query: 61  MRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRP 120
            R  IFK+ L +I++ N + NR+YK+G N+F+DLT+EEFR+ Y G+     S S ++   
Sbjct: 61  RRFEIFKETLRFIDEHNADTNRSYKVGLNQFADLTDEEFRSTYLGFT----SGSNKTKVS 116

Query: 121 STFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSE 180
           + ++ +    +P+ +DWR  GAV  IK QG+CG CWAFSA+A VEGI +I  G LI LSE
Sbjct: 117 NRYEPRVGQVLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSE 176

Query: 181 QQLVDC--STDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTC--DNQKEKAVAA 236
           Q+L+DC  + +  GC+GG +   F++II N G+ TE +YPY  ++G C  D Q EK V  
Sbjct: 177 QELIDCGRTQNTRGCNGGYITDGFQFIINNGGINTEENYPYTAQDGECNVDLQNEKYV-- 234

Query: 237 TISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVG 296
           TI  YE++P  +E AL  AV+ QPVSV +DA+G AF  Y SG+    CG   DH V +VG
Sbjct: 235 TIDTYENVPYNNEWALQTAVTYQPVSVALDAAGDAFKQYSSGIFTGPCGTAIDHAVTIVG 294

Query: 297 FGTAEEENGAKYWLIKNSWGETWGESGYIRILRD---AGLCGIATAASYPV 344
           +GT   E G  YW++KNSW  TWGE GY+RILR+   AG CGIAT  SYPV
Sbjct: 295 YGT---EGGIDYWIVKNSWDTTWGEEGYMRILRNVGGAGTCGIATMPSYPV 342


>gi|413938554|gb|AFW73105.1| hypothetical protein ZEAMMB73_931917 [Zea mays]
          Length = 361

 Score =  298 bits (762), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 153/355 (43%), Positives = 216/355 (60%), Gaps = 26/355 (7%)

Query: 12  PMFVIIILVITCASQVVSGRSMHEPSIVEKHEQ-----------WMAQHGRTYKDELEKA 60
           P   + + V+  A    S     +PS+V   ++           W  +HG+ Y    EK 
Sbjct: 3   PKLAVAVFVLFLAFAACSANHHRDPSVVGYSQEDLALPSSLFRSWSVKHGKLYASPTEKL 62

Query: 61  MRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSR- 119
            R  IFKQNL +I + N++ N +Y LG N+F+D+ +EEF+A Y G  R +P      +R 
Sbjct: 63  ERYEIFKQNLMHIAETNRK-NGSYWLGLNQFADVAHEEFKASYLGLKRALPRAGAPQTRT 121

Query: 120 PSTFKYQNVT--DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIE 177
           P+ F+Y       +P S+DWR KGAVT +K+QG+CGSCWAFS+VAAVEGI QI  GKL+ 
Sbjct: 122 PTAFRYAAAAAGSLPWSVDWRYKGAVTPVKNQGKCGSCWAFSSVAAVEGINQIVTGKLVS 181

Query: 178 LSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAA 236
           LSEQ+LVDC T  +HGC GG MD AF Y++ ++G+  E DYPY  EEG C  ++   +  
Sbjct: 182 LSEQELVDCDTTLDHGCEGGTMDLAFAYMMGSQGIHAEDDYPYLMEEGYCKEKQPCVLGI 241

Query: 237 T---ISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVA 293
           T   ++ +ED+P+  E +LL+A+++QPVSV + A  R F FY+ GV +  C    DH + 
Sbjct: 242 TEQDLTGFEDVPENSEISLLKALAHQPVSVGIAAGSRDFQFYRGGVFDGACSVELDHALT 301

Query: 294 VVGFGTAEEENGAKYWLIKNSWGETWGESGYIRIL----RDAGLCGIATAASYPV 344
            VG+G++  +N   Y  +KNSWG+ WGE GY+RI     +  G+CGI T ASYPV
Sbjct: 302 AVGYGSSYGQN---YITMKNSWGKNWGEQGYVRIKMGTGKPEGVCGIYTMASYPV 353


>gi|2144501|pir||TAGB actinidain (EC 3.4.22.14) precursor - kiwi fruit
 gi|166317|gb|AAA32629.1| actinidin [Actinidia deliciosa]
          Length = 380

 Score =  298 bits (762), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 148/349 (42%), Positives = 215/349 (61%), Gaps = 14/349 (4%)

Query: 3   LKFEKSFIIP--MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKA 60
           +   KSF+    +F   +L+++ A    +        +   +E W+ ++G++Y    E  
Sbjct: 1   MGLPKSFVSMSLLFFSTLLILSLAFNAKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWE 60

Query: 61  MRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRP 120
            R  IFK+ L +I++ N + NR+YK+G N+F+DLT+EEFR+ Y G+     S S ++   
Sbjct: 61  RRFEIFKETLRFIDEHNADTNRSYKVGLNQFADLTDEEFRSTYLGFT----SGSNKTKVS 116

Query: 121 STFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSE 180
           + ++ +    +P+ +DWR  GAV  IK QG+CG CWAFSA+A VEGI +I  G LI LSE
Sbjct: 117 NRYEPRVGQVLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSE 176

Query: 181 QQLVDC--STDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATI 238
           Q+L+DC  + +  GC+GG +   F++II N G+ TE +YPY  ++G C+ + +     TI
Sbjct: 177 QELIDCGRTQNTRGCNGGYITDGFQFIINNGGINTEENYPYTAQDGECNVELQNEKYVTI 236

Query: 239 SKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFG 298
             YE++P  +E AL  AV+ QPVSV +DA+G AF  Y SG+    CG   DH V +VG+G
Sbjct: 237 DTYENVPYNNEWALQTAVTYQPVSVALDAAGDAFKQYSSGIFTGPCGTAIDHAVTIVGYG 296

Query: 299 TAEEENGAKYWLIKNSWGETWGESGYIRILRD---AGLCGIATAASYPV 344
           T   E G  YW++KNSW  TWGE GY+RILR+   AG CGIAT  SYPV
Sbjct: 297 T---EGGIDYWIVKNSWDTTWGEEGYMRILRNVGGAGTCGIATMPSYPV 342


>gi|18413505|ref|NP_567376.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|30315954|sp|Q9SUT0.1|CPR3_ARATH RecName: Full=Probable cysteine proteinase At4g11310; Flags:
           Precursor
 gi|5596477|emb|CAB51415.1| drought-inducible cysteine proteinase RD21A precursor-like protein
           [Arabidopsis thaliana]
 gi|7267830|emb|CAB81232.1| drought-inducible cysteine proteinase RD21A precursor-like protein
           [Arabidopsis thaliana]
 gi|332657595|gb|AEE82995.1| putative cysteine proteinase [Arabidopsis thaliana]
          Length = 364

 Score =  298 bits (762), Expect = 4e-78,   Method: Compositional matrix adjust.
 Identities = 153/351 (43%), Positives = 220/351 (62%), Gaps = 27/351 (7%)

Query: 13  MFVIIILVITCAS----QVVS---GRSMH-----EPSIVEKHEQWMAQHGRTYKDELEKA 60
           + ++ +++ +CA+     VVS      +H     E S++   E WM +HG+ Y    EK 
Sbjct: 10  ILLVAMVIASCATAIDMSVVSYDDNNRLHSVFDAEASLI--FESWMVKHGKVYGSVAEKE 67

Query: 61  MRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRP 120
            RL IF+ NL +I   N E N +Y+LG   F+DL+  E++ +  G +   P         
Sbjct: 68  RRLTIFEDNLRFINNRNAE-NLSYRLGLTGFADLSLHEYKEVCHGADPRPPR--NHVFMT 124

Query: 121 STFKYQNVTD--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIEL 178
           S+ +Y+   D  +P S+DWR +GAVT +KDQG C SCWAFS V AVEG+ +I  G+L+ L
Sbjct: 125 SSDRYKTSADDVLPKSVDWRNEGAVTEVKDQGHCRSCWAFSTVGAVEGLNKIVTGELVTL 184

Query: 179 SEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQ-KEKAVAAT 237
           SEQ L++C+ +N+GC GG ++ A+E+I++N GL T+ DYPY+   G CD + KE      
Sbjct: 185 SEQDLINCNKENNGCGGGKLETAYEFIMKNGGLGTDNDYPYKAVNGVCDGRLKENNKNVM 244

Query: 238 ISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGF 297
           I  YE+LP  DE AL++AV++QPV+  +D+S R F  Y+SGV +  CG N +HGV VVG+
Sbjct: 245 IDGYENLPANDESALMKAVAHQPVTAVIDSSSREFQLYESGVFDGSCGTNLNHGVVVVGY 304

Query: 298 GTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
           GT   ENG  YWL+KNS G TWGE+GY+++ R+     GLCGIA  ASYP+
Sbjct: 305 GT---ENGRDYWLVKNSRGITWGEAGYMKMARNIANPRGLCGIAMRASYPL 352


>gi|357507505|ref|XP_003624041.1| Cysteine proteinase [Medicago truncatula]
 gi|355499056|gb|AES80259.1| Cysteine proteinase [Medicago truncatula]
          Length = 342

 Score =  297 bits (761), Expect = 4e-78,   Method: Compositional matrix adjust.
 Identities = 155/315 (49%), Positives = 206/315 (65%), Gaps = 21/315 (6%)

Query: 37  SIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTN 96
           S+ E+ E W  ++G  YKD  E+     IFK N+ YI+  N  GN+ YKL  N F D   
Sbjct: 37  SLSERFEYWKTKYGVVYKDVAEQKKHFQIFKHNVAYIDYFNAAGNKPYKLAINRFVDKPI 96

Query: 97  EEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCW 156
           E+      G+ R     +  ++  +TFKY+NVTD+P ++DWR++GAVT IK+QG+CGSCW
Sbjct: 97  EDSD---DGFER-----TTTTTPTTTFKYENVTDIPATVDWRKRGAVTPIKNQGKCGSCW 148

Query: 157 AFSAVAAVEGITQITRGKLIELSEQQLVDC--STDNHGCSGGLMDKAFEYIIENKGLATE 214
           AFSAVAA+EGI +IT G L+ LSEQQLVDC  S    GC  G M  AF++I+EN G+ATE
Sbjct: 149 AFSAVAAIEGIQKITSGNLVSLSEQQLVDCDRSGRTKGCDNGNMINAFKFILENGGIATE 208

Query: 215 ADYPY-RHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFH 273
           A+YPY R  +GTC   K+ +    I  YE++P   E +LL+AV+NQPVSV +D  G  F 
Sbjct: 209 ANYPYKRVVKGTC---KKVSHKVQIKSYEEVPSNSEDSLLKAVANQPVSVGIDMRG-MFK 264

Query: 274 FYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD--- 330
           FY SG+   +CG   +H + +VG+GT+++  G KYWL+KNSW + WGE GYIRI RD   
Sbjct: 265 FYSSGIFTGECGTKPNHALTIVGYGTSKD--GIKYWLVKNSWSKRWGEKGYIRIKRDIDA 322

Query: 331 -AGLCGIATAASYPV 344
             GLCGIA   SYP+
Sbjct: 323 KEGLCGIAMKPSYPI 337


>gi|20260334|gb|AAM13065.1| drought-inducible cysteine proteinase RD21A precursor-like protein
           [Arabidopsis thaliana]
 gi|23197782|gb|AAN15418.1| drought-inducible cysteine proteinase RD21A precursor-like protein
           [Arabidopsis thaliana]
          Length = 357

 Score =  297 bits (761), Expect = 4e-78,   Method: Compositional matrix adjust.
 Identities = 154/351 (43%), Positives = 221/351 (62%), Gaps = 27/351 (7%)

Query: 13  MFVIIILVITCAS----QVVS---GRSMH-----EPSIVEKHEQWMAQHGRTYKDELEKA 60
           + ++ +++ +CA+     VVS      +H     E S++   E WM +HG+ Y    EK 
Sbjct: 3   ILLVAMVIASCATAIDMSVVSYDDNNRLHSVFDAEASLI--FESWMVKHGKVYGSVAEKE 60

Query: 61  MRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRP 120
            RL IF+ NL +I   N E N +Y+LG   F+DL+  E++ +  G + P P         
Sbjct: 61  RRLTIFEDNLRFINNRNAE-NLSYRLGLTGFADLSLHEYKEVCHGAD-PRPP-RNHVFMT 117

Query: 121 STFKYQNVTD--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIEL 178
           S+ +Y+   D  +P S+DWR +GAVT +KDQG C SCWAFS V AVEG+ +I  G+L+ L
Sbjct: 118 SSDRYKTSADDVLPKSVDWRNEGAVTEVKDQGHCRSCWAFSTVGAVEGLNKIVTGELVTL 177

Query: 179 SEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQ-KEKAVAAT 237
           SEQ L++C+ +N+GC GG ++ A+E+I++N GL T+ DYPY+   G CD + KE      
Sbjct: 178 SEQDLINCNKENNGCGGGKLETAYEFIMKNGGLGTDNDYPYKAVNGVCDGRLKENNKNVM 237

Query: 238 ISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGF 297
           I  YE+LP  DE AL++AV++QPV+  +D+S R F  Y+SGV +  CG N +HGV VVG+
Sbjct: 238 IDGYENLPANDESALMKAVAHQPVTAVIDSSSREFQLYESGVFDGSCGTNLNHGVVVVGY 297

Query: 298 GTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
           GT   ENG  YWL+KNS G TWGE+GY+++ R+     GLCGIA  ASYP+
Sbjct: 298 GT---ENGRDYWLVKNSRGITWGEAGYMKMARNIANPRGLCGIAMRASYPL 345


>gi|641905|gb|AAC49406.1| cysteine proteinase [Zinnia violacea]
          Length = 342

 Score =  297 bits (761), Expect = 4e-78,   Method: Compositional matrix adjust.
 Identities = 144/302 (47%), Positives = 203/302 (67%), Gaps = 10/302 (3%)

Query: 32  SMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEF 91
           S+H+  ++   E  + +H + Y+   EK  R  IF  NL++I++ NK+ +  Y LG NEF
Sbjct: 41  SIHK--VIHLFESSLVKHSKIYESFDEKLHRFEIFMDNLKHIDETNKKVS-NYWLGLNEF 97

Query: 92  SDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQ 151
           +DLT+EEF+  + G+   +    R+      F+Y++  D+P S+DWR+KGAV+ +K+QGQ
Sbjct: 98  ADLTHEEFKNKFLGFKGEL--AERKDESIEQFRYRDFVDLPKSVDWRKKGAVSPVKNQGQ 155

Query: 152 CGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKG 210
           CGSCWAFS VAAVEGI QI  G L  LSEQ+L+DC T  N+GC+GGLMD AF Y+  N G
Sbjct: 156 CGSCWAFSTVAAVEGINQIVTGNLTVLSEQELIDCDTTFNNGCNGGLMDYAFAYVTRN-G 214

Query: 211 LATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGR 270
           L  E +YPY   EGTCD +++ +   TIS Y D+P+ +E + L+A++NQP+SV ++ASGR
Sbjct: 215 LHKEEEYPYIMSEGTCDEKRDASEKVTISGYHDVPRNNEDSFLKALANQPISVAIEASGR 274

Query: 271 AFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD 330
            F FY  GV +  CG   DHGVA VG+GT++   G  Y +++NSWG  WGE GYIR+ R+
Sbjct: 275 DFQFYSGGVFDGHCGTELDHGVAAVGYGTSK---GLDYVIVRNSWGPKWGEKGYIRMKRN 331

Query: 331 AG 332
            G
Sbjct: 332 TG 333


>gi|57118007|gb|AAW34135.1| cysteine protease gp2b [Zingiber officinale]
          Length = 379

 Score =  297 bits (760), Expect = 5e-78,   Method: Compositional matrix adjust.
 Identities = 156/336 (46%), Positives = 212/336 (63%), Gaps = 18/336 (5%)

Query: 18  ILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKAN 77
           +L ++     V  RS  E  ++  + +W A++    K       RL +FK+NL++++K N
Sbjct: 29  VLTLSKQGGAVPVRSDEEVRML--YLEWRAKNHPAEKYLDLNEYRLEVFKENLQFVDKHN 86

Query: 78  KEGNR---TYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSR--PSTFKYQNVTDVP 132
              +R   T++LG N F+DLTNEE+R   T + R    + R +S    S ++ +   D+P
Sbjct: 87  AAADRGEHTFRLGMNRFADLTNEEYR---TRFLRDFSRLRRSASGKISSRYRLREGDDLP 143

Query: 133 TSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHG 192
            SIDWREKGAV  +K+QG CGSCWAFS VAAVEGI QI  G LI LSEQQLVDC+T NHG
Sbjct: 144 DSIDWREKGAVVPVKNQGGCGSCWAFSTVAAVEGINQIVTGDLISLSEQQLVDCTTANHG 203

Query: 193 CSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQAL 252
           C GG M+ AF++I+ N G+ +E  YPYR + G C N    A   +I  YE++P  +EQ+L
Sbjct: 204 CRGGWMNPAFQFIVNNGGINSEETYPYRGQNGIC-NSTVNAPVVSIDSYENVPSHNEQSL 262

Query: 253 LQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIK 312
            +AV+NQPVSV +DA+GR F  Y+SG+    C  + +H + VVG+GT   EN   Y  +K
Sbjct: 263 QKAVANQPVSVTMDAAGRDFQLYRSGIFTGSCNISANHALTVVGYGT---ENDKDYRTVK 319

Query: 313 NSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
           NSWG+ WGESGYIR+ R+     G CGI   ASYPV
Sbjct: 320 NSWGKNWGESGYIRVERNIGNPNGKCGITRFASYPV 355


>gi|18391078|ref|NP_563855.1| xylem bark cysteine peptidase 3 [Arabidopsis thaliana]
 gi|110741821|dbj|BAE98853.1| papain-like cysteine peptidase XBCP3 [Arabidopsis thaliana]
 gi|111074448|gb|ABH04597.1| At1g09850 [Arabidopsis thaliana]
 gi|332190386|gb|AEE28507.1| xylem bark cysteine peptidase 3 [Arabidopsis thaliana]
          Length = 437

 Score =  297 bits (760), Expect = 6e-78,   Method: Compositional matrix adjust.
 Identities = 145/312 (46%), Positives = 197/312 (63%), Gaps = 11/312 (3%)

Query: 38  IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
           I E  + W  +HG+TY  E E+  R+ IFK N +++ + N   N TY L  N F+DLT+ 
Sbjct: 28  ISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTHH 87

Query: 98  EFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWA 157
           EF+A   G +   PSV   S   S         VP S+DWR+KGAVT++KDQG CG+CW+
Sbjct: 88  EFKASRLGLSVSAPSVIMASKGQS---LGGSVKVPDSVDWRKKGAVTNVKDQGSCGACWS 144

Query: 158 FSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEAD 216
           FSA  A+EGI QI  G LI LSEQ+L+DC    N GC+GGLMD AFE++I+N G+ TE D
Sbjct: 145 FSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNHGIDTEKD 204

Query: 217 YPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYK 276
           YPY+  +GTC   K K    TI  Y  +   DE+AL++AV+ QPVSV +  S RAF  Y 
Sbjct: 205 YPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSERAFQLYS 264

Query: 277 SGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----G 332
           SG+ +  C  + DH V +VG+G+   +NG  YW++KNSWG++WG  G++ + R+     G
Sbjct: 265 SGIFSGPCSTSLDHAVLIVGYGS---QNGVDYWIVKNSWGKSWGMDGFMHMQRNTENSDG 321

Query: 333 LCGIATAASYPV 344
           +CGI   ASYP+
Sbjct: 322 VCGINMLASYPI 333


>gi|255546708|ref|XP_002514413.1| cysteine protease, putative [Ricinus communis]
 gi|223546510|gb|EEF48009.1| cysteine protease, putative [Ricinus communis]
          Length = 324

 Score =  296 bits (759), Expect = 6e-78,   Method: Compositional matrix adjust.
 Identities = 152/318 (47%), Positives = 201/318 (63%), Gaps = 40/318 (12%)

Query: 32  SMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEF 91
           SMH+  + E  E WM++HG+TY+   EK  RL +FK NL +I++ N++   TY L  NEF
Sbjct: 39  SMHK--LTELFESWMSKHGKTYESIEEKLHRLEVFKDNLMHIDRRNRDVT-TYWLALNEF 95

Query: 92  SDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQ 151
           +DL++EEF++      R                              EKGAV  +K+QG 
Sbjct: 96  ADLSHEEFKSKLAQIRR-----------------------------LEKGAVAPVKNQGS 126

Query: 152 CGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKG 210
           CGSCWAFS VAAVEGI QI  G L  LSEQ+L+DC T  N GC+GGLMD AF+YI+ N G
Sbjct: 127 CGSCWAFSTVAAVEGINQIVTGNLTSLSEQELIDCDTSFNSGCNGGLMDYAFDYIVNNGG 186

Query: 211 LATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGR 270
           L  E DYPY  EEGTCD ++E+    TIS Y D+P+ +E++LL+A+++QP+S+ ++ASGR
Sbjct: 187 LHKEEDYPYLMEEGTCDEKREEMEVVTISGYHDVPENNEESLLKALAHQPLSIAIEASGR 246

Query: 271 AFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD 330
            F FY  GV N  CG + DHGVA VG+G+++   G  Y ++KNSWG  WGE GYIR+ R+
Sbjct: 247 DFQFYGRGVFNGPCGTDLDHGVAAVGYGSSK---GLDYIIVKNSWGPKWGEKGYIRMKRN 303

Query: 331 A----GLCGIATAASYPV 344
                GLCGI   ASYP 
Sbjct: 304 TGKPEGLCGINKMASYPT 321


>gi|57118011|gb|AAW34137.1| cysteine protease gp3b [Zingiber officinale]
          Length = 466

 Score =  296 bits (759), Expect = 7e-78,   Method: Compositional matrix adjust.
 Identities = 156/325 (48%), Positives = 212/325 (65%), Gaps = 17/325 (5%)

Query: 29  SGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNR---TYK 85
           +GRS  E  I+  +++W A+H     D+     RL +FK+NL ++++ N   +R    Y+
Sbjct: 32  AGRSDEEVRII--YQEWRAKHRPAENDQYVGDYRLEVFKENLRFVDEHNAAADRGEHAYR 89

Query: 86  LGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQ-NVTDV-PTSIDWREKGAV 143
           LG N F+DLTNEE+RA +    R +  + R +S   + +Y+    DV P SIDWREKGAV
Sbjct: 90  LGMNRFADLTNEEYRARFL---RDLSRLGRSTSGEISNQYRLREGDVLPDSIDWREKGAV 146

Query: 144 THIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFE 203
             +K QG+CGSCWAF+A+A VEGI QI  G LI LSEQQLVDCST NHGC GG   +AF+
Sbjct: 147 VAVKSQGRCGSCWAFAAIATVEGINQIVTGDLISLSEQQLVDCSTRNHGCEGGWPYRAFQ 206

Query: 204 YIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSV 263
           YII N G+ +E  YPY    GTC+  K  A   +I  Y ++P  DE++L +AV+NQP+SV
Sbjct: 207 YIINNGGVNSEEHYPYTGTNGTCNTTKGNAHVVSIDSYRNVPSNDEKSLQKAVANQPISV 266

Query: 264 CVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESG 323
            ++ASGR F  Y SG+    C  + +HGV VVG+GT    NG  YW++KNSWGE+WG+SG
Sbjct: 267 GINASGRNFQLYHSGIFTGSCNTSLNHGVTVVGYGTV---NGNDYWIVKNSWGESWGDSG 323

Query: 324 YIRILRD----AGLCGIATAASYPV 344
           YI + R+    +G CGIA + SYP+
Sbjct: 324 YILMERNIAESSGKCGIAISPSYPI 348


>gi|3377948|emb|CAA08860.1| cysteine proteinase precursor, AN8 [Ananas comosus]
          Length = 356

 Score =  296 bits (759), Expect = 7e-78,   Method: Compositional matrix adjust.
 Identities = 152/340 (44%), Positives = 222/340 (65%), Gaps = 21/340 (6%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPS--IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNL 70
           +F+ + L +  AS   S  S  EPS  ++++ E+WM ++GR YKD  EK  R  IFK N+
Sbjct: 8   VFLFLFLCVMWASP--SAASADEPSDPMMKRFEEWMVEYGRVYKDNDEKMRRFQIFKNNV 65

Query: 71  EYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTG-YNRPVPSVSRQSSRPSTFKYQNVT 129
            +IE  N     +Y LG N+F+D+TN EF A YTG  +RP+ ++ R+     +F   +++
Sbjct: 66  NHIETFNSRNKDSYTLGINQFTDMTNNEFVAQYTGGISRPL-NIEREPV--VSFDDVDIS 122

Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD 189
            VP SIDWR+ GAVT +K+Q  CG+CWAF+A+A VE I +I +G L  LSEQQ++DC+  
Sbjct: 123 AVPQSIDWRDYGAVTSVKNQNPCGACWAFAAIATVESIYKIKKGILEPLSEQQVLDCA-K 181

Query: 190 NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAV--AATISKYEDLPKG 247
            +GC GG   +AFE+II NKG+A+ A YPY+  +GTC   K   V  +A I+ Y  +P+ 
Sbjct: 182 GYGCKGGWEFRAFEFIISNKGVASVAIYPYKAAKGTC---KTNGVPNSAYITGYARVPRN 238

Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
           +E +++ AVS QP++V VDA+  +  +Y SGV N  CG + +H V  +G+G  ++ NG K
Sbjct: 239 NESSMMYAVSKQPITVAVDANANS-QYYNSGVFNGPCGTSLNHAVTAIGYG--QDSNGKK 295

Query: 308 YWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
           YW++KNSWG  WGE+GYIR+ RD    +G+CGIA  + YP
Sbjct: 296 YWIVKNSWGARWGEAGYIRMARDVSSSSGICGIAIDSLYP 335


>gi|359359118|gb|AEV41024.1| putative oryzain beta chain precursor [Oryza minuta]
          Length = 493

 Score =  296 bits (759), Expect = 7e-78,   Method: Compositional matrix adjust.
 Identities = 148/343 (43%), Positives = 211/343 (61%), Gaps = 47/343 (13%)

Query: 42  HEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNR--TYKLGTNEFSDLTNEEF 99
           ++ W+A++GR+Y    E+  R  +F  NL++++  N   +    ++LG N F+DLTN+EF
Sbjct: 49  YDLWLAENGRSYNALGERERRFRVFWDNLKFVDAHNARADEHGGFRLGMNRFADLTNDEF 108

Query: 100 RALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQC------- 152
           RA + G       V R  +    +++  V ++P S+DWREKGAV  +K+QGQC       
Sbjct: 109 RATFLG----AKFVERSRAAGERYRHDGVEELPESVDWREKGAVAPVKNQGQCVDRIIVW 164

Query: 153 -------------------------GSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
                                    GSCWAFSAV+ VE I Q+  G++I LSEQ+LV+CS
Sbjct: 165 NSMVRIYVVDAGCMLENPLMGLTVQGSCWAFSAVSTVESINQLVTGEMITLSEQELVECS 224

Query: 188 TD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLP 245
           T+  N GC+GGLMD AF++II+N G+ TE DYPY+  +G CD  +E A   +I  +ED+P
Sbjct: 225 TNGQNSGCNGGLMDDAFDFIIKNGGIDTEDDYPYKAVDGKCDINRENAKVVSIDGFEDVP 284

Query: 246 KGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENG 305
           + DE++L +AV++QPVSV ++A GR F  Y SGV +  CG + DHGV  VG+GT   +NG
Sbjct: 285 QNDEKSLQKAVAHQPVSVAIEAGGREFQLYHSGVFSGRCGTSLDHGVVAVGYGT---DNG 341

Query: 306 AKYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPV 344
             YW+++NSWG  WGESGY+R+ R+     G CGIA  ASYP 
Sbjct: 342 KDYWIVRNSWGPKWGESGYVRMERNINATTGKCGIAMMASYPT 384


>gi|146215990|gb|ABQ10197.1| actinidin Act4a [Actinidia eriantha]
          Length = 385

 Score =  296 bits (759), Expect = 7e-78,   Method: Compositional matrix adjust.
 Identities = 139/312 (44%), Positives = 205/312 (65%), Gaps = 12/312 (3%)

Query: 38  IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
           ++   E W+ ++G++Y    EK  R  IFK NL ++++ N + NR+YK+G N+FSDLT+ 
Sbjct: 44  VIAMFESWLVEYGKSYNALGEKERRFEIFKDNLRFVDEHNADVNRSYKVGLNQFSDLTDA 103

Query: 98  EFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWA 157
           E+ ++Y G    +    R ++    ++ +    +P S+DWR+KGAV  +K+QG CGSCW 
Sbjct: 104 EYSSIYLGTKFNI----RMTNVSDRYEPRVGDQLPDSVDWRKKGAVLGVKNQGNCGSCWT 159

Query: 158 FSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEA 215
           F+++AAVEGI +I  G LI LSEQ++VDC     N+GC+GG +  A+++II N G+ TEA
Sbjct: 160 FASIAAVEGINKIVTGNLISLSEQEIVDCQRKYPNNGCNGGTLSGAYQFIINNGGINTEA 219

Query: 216 DYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFY 275
           +YPY   +G CD  K+     TI +YE++P  +E+AL +AV+ QPVSV + ++  AF  Y
Sbjct: 220 NYPYTGRDGVCDQNKKNKKYVTIDRYENVPSNNEKALQKAVAFQPVSVVIASNSTAFKSY 279

Query: 276 KSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD---AG 332
           KSG+ N  CG   DHGV +VG+GT   E G  YW+++NSWG  WGESGY+R+ R+   +G
Sbjct: 280 KSGIFNGPCGPRIDHGVTIVGYGT---EGGKDYWIVRNSWGPNWGESGYVRMQRNVGGSG 336

Query: 333 LCGIATAASYPV 344
            C IA A  YPV
Sbjct: 337 KCFIARAPVYPV 348


>gi|111073717|dbj|BAF02547.1| triticain beta [Triticum aestivum]
          Length = 472

 Score =  296 bits (759), Expect = 8e-78,   Method: Compositional matrix adjust.
 Identities = 147/321 (45%), Positives = 205/321 (63%), Gaps = 28/321 (8%)

Query: 42  HEQWMAQHGRTYKDEL----EKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEFSDL 94
           ++ W+A++G           E+  R   F  NL +++  N     G   Y+LG N F+DL
Sbjct: 53  YDLWLAENGGGSSPNANSIPERERRFRAFWDNLNFVDAHNARAAAGEEGYRLGMNRFADL 112

Query: 95  TNEEFRALYTGYNRPVPSVSRQSSRPST-----FKYQNVTDVPTSIDWREKGAVTHIKDQ 149
           TN+EFRA Y G       V  Q +RP       +++    ++P ++DWREKGAV  +K+Q
Sbjct: 113 TNDEFRAAYLG-------VKAQRARPGRMVGERYRHDGAEELPEAVDWREKGAVAPVKNQ 165

Query: 150 GQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNH--GCSGGLMDKAFEYIIE 207
           GQCGSCWAFSAV+ VE I QI  G+++ LSEQ+LV+C T+    GC+GGLMD AFE+II+
Sbjct: 166 GQCGSCWAFSAVSTVESINQIVTGEMVTLSEQELVECDTNGQSSGCNGGLMDDAFEFIIK 225

Query: 208 NKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDA 267
           N G+ TE DYPY+  +G CD  ++ A   +I  +ED+P+ DE++L +AV++QPVSV ++A
Sbjct: 226 NGGIDTEDDYPYKAIDGRCDVLRKNAKVVSIDGFEDVPENDEKSLQKAVAHQPVSVAIEA 285

Query: 268 SGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRI 327
            GR F  Y SGV +  CG   DHGV  VG+GT   ENG  YW+++NSWG  WGESGY+R+
Sbjct: 286 GGREFQLYHSGVFSGRCGTQLDHGVVAVGYGT---ENGKDYWIVRNSWGPNWGESGYLRM 342

Query: 328 LRD----AGLCGIATAASYPV 344
            R+    +G CGIA  +SYP 
Sbjct: 343 ERNINVTSGKCGIAMMSSYPT 363


>gi|45738078|gb|AAS75836.1| fastuosain precursor [Bromelia fastuosa]
          Length = 324

 Score =  296 bits (759), Expect = 8e-78,   Method: Compositional matrix adjust.
 Identities = 143/316 (45%), Positives = 209/316 (66%), Gaps = 14/316 (4%)

Query: 35  EPS--IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFS 92
           EPS  ++E+ E+WMA++GR Y D  EK  R  IFK N+ +IE  N     +Y LG N+F+
Sbjct: 1   EPSDPMMERFEEWMAEYGRVYNDNAEKMRRFQIFKNNVNHIETFNNRSGNSYTLGVNQFT 60

Query: 93  DLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQC 152
           D+TN EF A YTG + P+ ++ R      +F   +++ VP SIDWR+ GAVT +K+QG C
Sbjct: 61  DMTNNEFLARYTGASLPL-NIERDPV--VSFDDVDISAVPQSIDWRDYGAVTSVKNQGSC 117

Query: 153 GSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLA 212
           GSCWAFSA+A VEGI +I  G LI LSEQ+++DC+  ++GC GG ++KA+++II N G+ 
Sbjct: 118 GSCWAFSAIATVEGIYKIKAGNLISLSEQEVLDCAL-SYGCDGGWVNKAYDFIISNNGVT 176

Query: 213 TEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAF 272
           + A+ PY+  +G C N  +    A I+ Y  +   +E++++ AV+NQP++  +DA G  F
Sbjct: 177 SFANLPYKGYKGPC-NHNDLPNKAYITGYTYVQSNNERSMMIAVANQPIAALIDAGGD-F 234

Query: 273 HFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA- 331
            +YKSGV    CG + +H + V+G+G  +  +G KYW++KNSWG +WGE GYIR+ RD  
Sbjct: 235 QYYKSGVFTGSCGTSLNHAITVIGYG--QTSSGTKYWIVKNSWGTSWGERGYIRMARDVS 292

Query: 332 ---GLCGIATAASYPV 344
              GLCGIA A  +P 
Sbjct: 293 SPYGLCGIAMAPLFPT 308


>gi|224085750|ref|XP_002307688.1| predicted protein [Populus trichocarpa]
 gi|222857137|gb|EEE94684.1| predicted protein [Populus trichocarpa]
          Length = 436

 Score =  296 bits (758), Expect = 9e-78,   Method: Compositional matrix adjust.
 Identities = 141/313 (45%), Positives = 196/313 (62%), Gaps = 24/313 (7%)

Query: 43  EQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRAL 102
           E W  +HG++Y  + E++ RL +F+ N +++ K N +GN +Y L  N F+DLT+ EF+  
Sbjct: 30  ETWCKEHGKSYTSQEERSHRLKVFEDNYDFVTKHNSKGNSSYSLALNAFADLTHHEFKTS 89

Query: 103 YTGYNRPVPSVSRQSSRPSTFKYQN------VTDVPTSIDWREKGAVTHIKDQGQCGSCW 156
             G           S+ P    ++N      V D+P SIDWR KG VT++KDQG CG+CW
Sbjct: 90  RLGL----------SAAPLNLAHRNLEITGVVGDIPASIDWRNKGVVTNVKDQGSCGACW 139

Query: 157 AFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEA 215
           +FSA  A+EGI +I  G L+ LSEQ+L++C    N GC GGLMD AF+++I N G+ TE 
Sbjct: 140 SFSATGAIEGINKIVTGSLVSLSEQELIECDKSYNDGCGGGLMDYAFQFVINNHGIDTEE 199

Query: 216 DYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFY 275
           DYPYR  +GTC+  + K    TI KY D+P+ +E+ LLQAV+ QPVSV +  S RAF  Y
Sbjct: 200 DYPYRARDGTCNKDRMKRRVVTIDKYVDVPENNEKQLLQAVAAQPVSVGICGSERAFQMY 259

Query: 276 KSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA---- 331
             G+    C  + DH V +VG+G+   ENG  YW++KNSWG  WG  GY+ + R++    
Sbjct: 260 SKGIFTGPCSTSLDHAVLIVGYGS---ENGVDYWIVKNSWGTGWGMRGYMHMQRNSGNSQ 316

Query: 332 GLCGIATAASYPV 344
           G+CGI   ASYPV
Sbjct: 317 GVCGINMLASYPV 329


>gi|75277440|sp|O23791.1|BROM1_ANACO RecName: Full=Fruit bromelain; AltName: Allergen=Ana c 2; Flags:
           Precursor
 gi|2342496|dbj|BAA21849.1| bromelain [Ananas comosus]
          Length = 351

 Score =  296 bits (757), Expect = 1e-77,   Method: Compositional matrix adjust.
 Identities = 138/335 (41%), Positives = 218/335 (65%), Gaps = 12/335 (3%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
           +F+ + L    AS   + R      ++++ E+WMA++GR YKD+ EK  R  IFK N+++
Sbjct: 8   VFLFLFLCAMWASPSAASRDEPNDPMMKRFEEWMAEYGRVYKDDDEKMRRFQIFKNNVKH 67

Query: 73  IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVP 132
           IE  N     +Y LG N+F+D+T  EF A YTG + P+ ++ R+     +F   N++ VP
Sbjct: 68  IETFNSRNENSYTLGINQFTDMTKSEFVAQYTGVSLPL-NIEREPV--VSFDDVNISAVP 124

Query: 133 TSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHG 192
            SIDWR+ GAV  +K+Q  CGSCW+F+A+A VEGI +I  G L+ LSEQ+++DC+  ++G
Sbjct: 125 QSIDWRDYGAVNEVKNQNPCGSCWSFAAIATVEGIYKIKTGYLVSLSEQEVLDCAV-SYG 183

Query: 193 CSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQAL 252
           C GG ++KA+++II N G+ TE +YPY   +GTC N      +A I+ Y  + + DE+++
Sbjct: 184 CKGGWVNKAYDFIISNNGVTTEENYPYLAYQGTC-NANSFPNSAYITGYSYVRRNDERSM 242

Query: 253 LQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIK 312
           + AVSNQP++  +DAS   F +Y  GV +  CG + +H + ++G+G  ++ +G KYW+++
Sbjct: 243 MYAVSNQPIAALIDAS-ENFQYYNGGVFSGPCGTSLNHAITIIGYG--QDSSGTKYWIVR 299

Query: 313 NSWGETWGESGYIRILR----DAGLCGIATAASYP 343
           NSWG +WGE GY+R+ R     +G+CGIA A  +P
Sbjct: 300 NSWGSSWGEGGYVRMARGVSSSSGVCGIAMAPLFP 334


>gi|302763837|ref|XP_002965340.1| hypothetical protein SELMODRAFT_143126 [Selaginella moellendorffii]
 gi|302790566|ref|XP_002977050.1| hypothetical protein SELMODRAFT_232903 [Selaginella moellendorffii]
 gi|300155026|gb|EFJ21659.1| hypothetical protein SELMODRAFT_232903 [Selaginella moellendorffii]
 gi|300167573|gb|EFJ34178.1| hypothetical protein SELMODRAFT_143126 [Selaginella moellendorffii]
          Length = 300

 Score =  296 bits (757), Expect = 1e-77,   Method: Compositional matrix adjust.
 Identities = 146/304 (48%), Positives = 199/304 (65%), Gaps = 10/304 (3%)

Query: 43  EQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRAL 102
           E W A+HG++Y  + EKA RL IF   L YIEK N   N T+ LG N+FSDLTN EFRA 
Sbjct: 3   EGWAAKHGKSYSSDWEKARRLMIFSDTLAYIEKHNALPNTTFTLGLNKFSDLTNAEFRAN 62

Query: 103 YTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVA 162
           Y G  +P      Q  RP+     +V+ +PTS+DWR++GAVT IKDQGQCGSCWAFSA+A
Sbjct: 63  YVGKFKPP---RYQDRRPAKDVDVDVSSLPTSLDWRQEGAVTPIKDQGQCGSCWAFSAIA 119

Query: 163 AVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHE 222
           ++E    +   +L+ LSEQQL+DC T + GC GG  + AF++++EN G+ TE  YPY   
Sbjct: 120 SIESAHFLATKELVSLSEQQLIDCDTVDQGCQGGFPEDAFKFVVENGGVTTEEAYPYTGF 179

Query: 223 EGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNA 282
            G+C+  K K V   I+ Y+D+ K    AL++AVS  PV+V +  S + F  Y+SG+L+ 
Sbjct: 180 AGSCNANKNKVVE--ITGYKDVTKDSADALMKAVSKTPVTVGICGSDQNFQNYRSGILSG 237

Query: 283 DCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD--AGLCGIATAA 340
            C N+ DH V V+G+GT   E G  YW+IKNSWG +WGE G++RI ++   G+CG+   +
Sbjct: 238 HCSNSRDHAVLVIGYGT---EGGMPYWIIKNSWGTSWGEDGFMRIKKEDGEGMCGMNGQS 294

Query: 341 SYPV 344
           SYP 
Sbjct: 295 SYPT 298


>gi|359359166|gb|AEV41071.1| putative oryzain beta chain precursor [Oryza minuta]
          Length = 464

 Score =  296 bits (757), Expect = 1e-77,   Method: Compositional matrix adjust.
 Identities = 144/310 (46%), Positives = 203/310 (65%), Gaps = 14/310 (4%)

Query: 42  HEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKAN-KEGNRTYKLGTNEFSDLTNEEFR 100
           ++ W+A++GR+Y    E   R  +F  NL + +  N +  +  ++LG N F+DLTNEEFR
Sbjct: 53  YDLWLAENGRSYNALGEHERRFRVFWDNLRFADAHNARADDHGFRLGMNRFADLTNEEFR 112

Query: 101 ALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSA 160
           A + G       V R  +    +++  V ++P S+DWREKGAV  +K+QGQCGSCWAFSA
Sbjct: 113 ATFLG----AKVVERSRAAGERYRHDGVEELPESVDWREKGAVAPVKNQGQCGSCWAFSA 168

Query: 161 VAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGG--LMDKAFEYIIENKGLATEADYP 218
           V+ VE I Q+  G++I LSEQ+LV+CST+         LMD AF++II+N G+ TE DYP
Sbjct: 169 VSTVESINQLVTGEMITLSEQELVECSTNGQNGGCNGGLMDDAFDFIIKNGGIDTEDDYP 228

Query: 219 YRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSG 278
           Y+  +G CD  +E A   +I  +ED+P+ DE++L +AV++QPVSV ++A GR F  Y SG
Sbjct: 229 YKAVDGKCDINRENAKVVSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGGREFQLYHSG 288

Query: 279 VLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD----AGLC 334
           V +  CG + DHGV  VG+GT   +NG  YW+++NSWG  WGESGY+R+ R+     G C
Sbjct: 289 VFSGRCGTSLDHGVVAVGYGT---DNGKDYWIVRNSWGPKWGESGYVRMERNINVTTGKC 345

Query: 335 GIATAASYPV 344
           GIA  ASYP 
Sbjct: 346 GIAMMASYPT 355


>gi|357154164|ref|XP_003576692.1| PREDICTED: vignain-like [Brachypodium distachyon]
          Length = 427

 Score =  296 bits (757), Expect = 1e-77,   Method: Compositional matrix adjust.
 Identities = 150/312 (48%), Positives = 192/312 (61%), Gaps = 11/312 (3%)

Query: 41  KHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFR 100
           + EQWM +HGR Y +  EK  R  ++K+NL  IE+ N  G   Y L  N+F+DLTNEEFR
Sbjct: 118 RFEQWMGKHGRAYANGGEKQRRFEVYKENLALIEEFNS-GGHGYTLTDNKFADLTNEEFR 176

Query: 101 ALYTGYNRPVPSVSRQSSRPSTF----KYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCW 156
           A   G     P   R++   S         N TD+P  +DWR+KGAV  +K+QG CGSCW
Sbjct: 177 AKMLGGLGADPDRRRRARHASNALELPGNDNSTDLPKDVDWRKKGAVVEVKNQGSCGSCW 236

Query: 157 AFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLATEAD 216
           AFSAVAA+EG+ QI  GKL+ LSEQ+LVDC  +  GC+GG M  AFE+++ N GL TEA 
Sbjct: 237 AFSAVAAMEGLNQIKNGKLVSLSEQELVDCDAEAVGCAGGFMSWAFEFVMANHGLTTEAS 296

Query: 217 YPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYK 276
           YPY+   G C   K    + +I+ Y ++    E  LL+  + QPVSV VDA G  F  Y 
Sbjct: 297 YPYKGINGACQTAKLNESSVSITGYVNVTVNSEAELLKVAAVQPVSVAVDAGGFLFQLYA 356

Query: 277 SGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----G 332
            GV +  C    +HGV VVG+G  E +   KYW++KNSWG  WGE+GY+ + RDA    G
Sbjct: 357 GGVFSGPCTAQINHGVTVVGYG--ETDKAEKYWIVKNSWGPEWGEAGYMLMQRDAGVPTG 414

Query: 333 LCGIATAASYPV 344
           LCGIA  ASYPV
Sbjct: 415 LCGIAMLASYPV 426


>gi|18408616|ref|NP_566901.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|75313880|sp|Q9STL5.1|CEP3_ARATH RecName: Full=KDEL-tailed cysteine endopeptidase CEP3; Flags:
           Precursor
 gi|4678353|emb|CAB41163.1| cysteine endopeptidase precursor-like protein [Arabidopsis
           thaliana]
 gi|26453052|dbj|BAC43602.1| putative cysteine endopeptidase precursor [Arabidopsis thaliana]
 gi|332644885|gb|AEE78406.1| putative cysteine proteinase [Arabidopsis thaliana]
          Length = 364

 Score =  295 bits (756), Expect = 2e-77,   Method: Compositional matrix adjust.
 Identities = 153/349 (43%), Positives = 211/349 (60%), Gaps = 17/349 (4%)

Query: 11  IPMFVIIILVITCASQVVSGRSMHEP------SIVEKHEQWMAQHGRTYKDELEKAMRLN 64
           + +F I+++      Q   G    E       ++ + +E+W   H  + +   E   R N
Sbjct: 1   MKLFFIVLISFLSLLQASKGFDFDEKELETEENVWKLYERWRGHHSVS-RASHEAIKRFN 59

Query: 65  IFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPST-F 123
           +F+ N+ ++ + NK+ N+ YKL  N F+D+T+ EFR+ Y G N     + R   R S  F
Sbjct: 60  VFRHNVLHVHRTNKK-NKPYKLKINRFADITHHEFRSSYAGSNVKHHRMLRGPKRGSGGF 118

Query: 124 KYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQL 183
            Y+NVT VP+S+DWREKGAVT +K+Q  CGSCWAFS VAAVEGI +I   KL+ LSEQ+L
Sbjct: 119 MYENVTRVPSSVDWREKGAVTEVKNQQDCGSCWAFSTVAAVEGINKIRTNKLVSLSEQEL 178

Query: 184 VDCST-DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEE-GTCDNQKEKAVAATISKY 241
           VDC T +N GC+GGLM+ AFE+I  N G+ TE  YPY   +   C          TI  +
Sbjct: 179 VDCDTEENQGCAGGLMEPAFEFIKNNGGIKTEETYPYDSSDVQFCRANSIGGETVTIDGH 238

Query: 242 EDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAE 301
           E +P+ DE+ LL+AV++QPVSV +DA    F  Y  GV   +CG   +HGV +VG+G  E
Sbjct: 239 EHVPENDEEELLKAVAHQPVSVAIDAGSSDFQLYSEGVFIGECGTQLNHGVVIVGYG--E 296

Query: 302 EENGAKYWLIKNSWGETWGESGYIRILR----DAGLCGIATAASYPVAI 346
            +NG KYW+++NSWG  WGE GY+RI R    + G CGIA  ASYP  +
Sbjct: 297 TKNGTKYWIVRNSWGPEWGEGGYVRIERGISENEGRCGIAMEASYPTKL 345


>gi|14600257|gb|AAK71314.1|AF388175_1 papain-like cysteine peptidase XBCP3 [Arabidopsis thaliana]
          Length = 437

 Score =  295 bits (756), Expect = 2e-77,   Method: Compositional matrix adjust.
 Identities = 144/312 (46%), Positives = 196/312 (62%), Gaps = 11/312 (3%)

Query: 38  IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
           I E  + W  +HG+TY  E E+  R+ IFK N +++ + N   N TY L  N F+DLT+ 
Sbjct: 28  ISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTHH 87

Query: 98  EFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWA 157
           EF+A   G +   PSV   S   S         VP S+DWR+KGAVT++KDQG CG+CW+
Sbjct: 88  EFKASRLGLSVSAPSVIMASKGQS---LGGSVKVPDSVDWRKKGAVTNVKDQGSCGACWS 144

Query: 158 FSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEAD 216
           FSA  A+EGI QI  G LI LSEQ+L+DC    N GC+GGLMD AFE++I+N G+ TE D
Sbjct: 145 FSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNHGIDTEKD 204

Query: 217 YPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYK 276
           YPY+  +GTC   K K    TI  Y  +   DE+AL++AV+ QPVSV +  S RAF  Y 
Sbjct: 205 YPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSERAFQLYS 264

Query: 277 SGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----G 332
            G+ +  C  + DH V +VG+G+   +NG  YW++KNSWG++WG  G++ + R+     G
Sbjct: 265 RGIFSGPCSTSLDHAVLIVGYGS---QNGVDYWIVKNSWGKSWGMDGFMHMQRNTENSDG 321

Query: 333 LCGIATAASYPV 344
           +CGI   ASYP+
Sbjct: 322 VCGINMLASYPI 333


>gi|302142276|emb|CBI19479.3| unnamed protein product [Vitis vinifera]
          Length = 388

 Score =  295 bits (755), Expect = 2e-77,   Method: Compositional matrix adjust.
 Identities = 150/309 (48%), Positives = 193/309 (62%), Gaps = 43/309 (13%)

Query: 42  HEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRA 101
           +E W+A+HG++Y    EK  R  IFK NL +I++ N E NRTYK+               
Sbjct: 4   YEAWLAKHGKSYNALGEKERRFQIFKDNLRFIDEHNAE-NRTYKI--------------- 47

Query: 102 LYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAV 161
                          S R   + ++    +P S+DWR+KGAV  +KDQG CGSCWAFS +
Sbjct: 48  ---------------SDR---YAFRVGDSLPESVDWRKKGAVVEVKDQGSCGSCWAFSTI 89

Query: 162 AAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYR 220
           AAVEGI +I  G LI LSEQ+LVDC T  N GC+GGLMD AFE+II N G+ +E DYPY+
Sbjct: 90  AAVEGINKIVTGGLISLSEQELVDCDTSYNEGCNGGLMDYAFEFIINNGGIDSEEDYPYK 149

Query: 221 HEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVL 280
             +G CD  ++ A   TI  YED+P+ DE++L +AV+NQPVSV ++A GR F  Y+SG+ 
Sbjct: 150 ASDGRCDQYRKNAKVVTIDGYEDVPENDEKSLEKAVANQPVSVAIEAGGREFQLYQSGIF 209

Query: 281 NADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD-----AGLCG 335
              CG   DHGV  VG+GT   ENG  YW++KNSWG +WGE GYIR+ RD      G CG
Sbjct: 210 TGRCGTALDHGVTAVGYGT---ENGVDYWIVKNSWGASWGEEGYIRMERDLATSATGKCG 266

Query: 336 IATAASYPV 344
           IA  ASYP+
Sbjct: 267 IAMEASYPI 275


>gi|302790570|ref|XP_002977052.1| hypothetical protein SELMODRAFT_268054 [Selaginella moellendorffii]
 gi|300155028|gb|EFJ21661.1| hypothetical protein SELMODRAFT_268054 [Selaginella moellendorffii]
          Length = 300

 Score =  295 bits (755), Expect = 2e-77,   Method: Compositional matrix adjust.
 Identities = 146/304 (48%), Positives = 198/304 (65%), Gaps = 10/304 (3%)

Query: 43  EQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRAL 102
           E W A+HG++Y  + EKA RL IF   L YIEK N   N T+ LG N+FSDLTN EFRA 
Sbjct: 3   EGWAAKHGKSYSSDWEKARRLMIFSDTLAYIEKHNALPNTTFTLGLNKFSDLTNAEFRAN 62

Query: 103 YTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVA 162
           Y G  +P      Q  RP+     +V+ +PTS+DWR++GAVT IKDQGQCGSCWAFSA+A
Sbjct: 63  YVGKFKPP---RYQDRRPAKDVDVDVSSLPTSLDWRQEGAVTPIKDQGQCGSCWAFSAIA 119

Query: 163 AVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHE 222
           ++E    +   +L+ LSEQQL+DC T + GC GG  + AF++++EN G+ TE  YPY   
Sbjct: 120 SIESAHFLATKELVSLSEQQLIDCDTVDQGCQGGFPEDAFKFVVENGGVTTEEAYPYTGF 179

Query: 223 EGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNA 282
            G+C+  K K V   I+ Y+D+ K    AL++AVS  PV+V +  S + F  Y+SG+L+ 
Sbjct: 180 AGSCNANKNKVVE--ITGYKDVTKDSADALMKAVSKTPVTVGICGSDQNFQNYRSGILSG 237

Query: 283 DCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD--AGLCGIATAA 340
            C N+ DH V V+G+GT   E G  YW+IKNSWG +WGE G++RI +    G+CG+   +
Sbjct: 238 HCSNSRDHAVLVIGYGT---EGGMPYWIIKNSWGTSWGEDGFMRIKKKDGEGMCGMNGQS 294

Query: 341 SYPV 344
           SYP 
Sbjct: 295 SYPT 298


>gi|242094000|ref|XP_002437490.1| hypothetical protein SORBIDRAFT_10g028000 [Sorghum bicolor]
 gi|241915713|gb|EER88857.1| hypothetical protein SORBIDRAFT_10g028000 [Sorghum bicolor]
          Length = 372

 Score =  295 bits (755), Expect = 2e-77,   Method: Compositional matrix adjust.
 Identities = 146/310 (47%), Positives = 192/310 (61%), Gaps = 12/310 (3%)

Query: 42  HEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEFSDLTNEE 98
           +E W ++HG  +  +    +RL +F+ NL YI+  N E   G  T++LG   F+DLT EE
Sbjct: 52  YEAWKSEHGHGHGSD--DRLRLEVFRDNLRYIDAHNAEADAGLHTFRLGLTPFADLTLEE 109

Query: 99  FRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAF 158
           +R    G+       SR  S  S        D+P +IDWRE GAVT +K+Q QCG CWAF
Sbjct: 110 YRGRALGFRARRGGASRVGSGSSYRPRPRGGDLPDAIDWRELGAVTGVKNQEQCGGCWAF 169

Query: 159 SAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYP 218
           SAVAA+EGI +I  G L+ LSEQ+++DC T + GC+GG M  AF+++I N G+ TEADYP
Sbjct: 170 SAVAAIEGINEIVTGNLVSLSEQEIIDCDTQDGGCNGGEMQNAFQFVINNGGIDTEADYP 229

Query: 219 YRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSG 278
           Y   +  CD  +      TI  +  +   +E AL +AV+NQPVSV +DASGR F  Y SG
Sbjct: 230 YLGTDAACDANRVNERVVTIDGFVSVATENETALQEAVANQPVSVAIDASGRKFQHYTSG 289

Query: 279 VLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD----AGLC 334
           + N  CG   DHGV  VG+G+   ENG  YW++KNSW  +WGE+GYIRI R+     G C
Sbjct: 290 IFNGPCGTQLDHGVTAVGYGS---ENGKDYWIVKNSWSSSWGEAGYIRIRRNVAAATGKC 346

Query: 335 GIATAASYPV 344
           GIA  ASYPV
Sbjct: 347 GIAMDASYPV 356


>gi|334904467|gb|AEH26024.1| cysteine peptidase [Ananas comosus]
          Length = 352

 Score =  295 bits (754), Expect = 3e-77,   Method: Compositional matrix adjust.
 Identities = 139/336 (41%), Positives = 218/336 (64%), Gaps = 13/336 (3%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
           +F+ + L +  AS   + R      ++++ E+WMA++GR YKD  EK  R  IFK N+ +
Sbjct: 8   VFLFLFLCVMWASPSAASRDEPSDPMMKRFEEWMAEYGRVYKDNDEKMRRFQIFKNNVNH 67

Query: 73  IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTG-YNRPVPSVSRQSSRPSTFKYQNVTDV 131
           IE  N     +Y LG N+F+D+T  EF A YTG  +RP+ ++ R+     +F   N++ V
Sbjct: 68  IETFNSHNGNSYTLGINQFTDMTKSEFVAQYTGGISRPL-NIEREPV--VSFDDVNISAV 124

Query: 132 PTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNH 191
           P SIDWR+ GAV  +K+Q  CGSCWAF+A+A VEGI +I  G L+ LSEQ+++DC+  ++
Sbjct: 125 PQSIDWRDYGAVNEVKNQNPCGSCWAFAAIATVEGIYKIKTGYLVSLSEQEVLDCAV-SY 183

Query: 192 GCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQA 251
           GC GG ++KA+++II N G+ TE +YPY+  +GTC N      +A I+ Y  + + DE++
Sbjct: 184 GCKGGWVNKAYDFIISNNGVTTEENYPYQAYQGTC-NANSFPNSAYITGYSYVRRNDERS 242

Query: 252 LLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLI 311
           ++ AVSNQP++  +DAS   F +Y  GV +  CG + +H + ++G+G  ++ +G KYW++
Sbjct: 243 MMYAVSNQPIAALIDAS-ENFQYYNGGVFSGPCGTSLNHAITIIGYG--QDSSGTKYWIV 299

Query: 312 KNSWGETWGESGYIRILR----DAGLCGIATAASYP 343
           +NSWG +WGE GY+R+ R     +G CGIA +  +P
Sbjct: 300 RNSWGSSWGEGGYVRMARGVSSSSGACGIAMSPLFP 335


>gi|302790836|ref|XP_002977185.1| hypothetical protein SELMODRAFT_106228 [Selaginella moellendorffii]
 gi|300155161|gb|EFJ21794.1| hypothetical protein SELMODRAFT_106228 [Selaginella moellendorffii]
          Length = 299

 Score =  295 bits (754), Expect = 3e-77,   Method: Compositional matrix adjust.
 Identities = 145/304 (47%), Positives = 199/304 (65%), Gaps = 10/304 (3%)

Query: 43  EQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRAL 102
           E W A+HG++Y  + EKA RL IF   L YIEK N + N T+ LG N+FSDLTN EFRA 
Sbjct: 3   EDWAAKHGKSYSSDSEKARRLMIFSDTLAYIEKHNAQPNTTFTLGLNKFSDLTNAEFRAN 62

Query: 103 YTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVA 162
           Y G      S   Q  RP+     +V+ +PTS+DWR++GAVT IKDQGQCGSCWAFSA+A
Sbjct: 63  YVG---KFKSPRYQDRRPAKDVDVDVSSLPTSLDWRQEGAVTPIKDQGQCGSCWAFSAIA 119

Query: 163 AVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHE 222
           ++E    +   +L+ LSEQQL+DC T + GC GG  + AF++++EN G+ TE  YPY   
Sbjct: 120 SIESAHFLATKELVSLSEQQLIDCDTVDQGCQGGFPEDAFKFVVENGGVTTEEAYPYTGF 179

Query: 223 EGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNA 282
            G+C+  K K V   I+ Y+D+ K    AL++AVS  PV+V +  S + F  Y+SG+L+ 
Sbjct: 180 AGSCNANKNKVVE--ITGYKDVTKDSADALMKAVSKTPVTVGICGSDQNFQNYRSGILSG 237

Query: 283 DCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD--AGLCGIATAA 340
            C N+ DH V V+G+GT   E G  YW+IKNSWG +WGE+G+++I +    G+CG+   +
Sbjct: 238 QCSNSRDHAVLVIGYGT---EGGMPYWIIKNSWGTSWGENGFMKIKKKDGEGMCGMNGQS 294

Query: 341 SYPV 344
           SYP 
Sbjct: 295 SYPT 298


>gi|190358935|sp|P00785.4|ACTN_ACTCH RecName: Full=Actinidain; Short=Actinidin; AltName: Allergen=Act c
           1; Flags: Precursor
 gi|12744965|gb|AAK06862.1|AF343446_1 actinidin protease [Actinidia chinensis]
          Length = 380

 Score =  294 bits (753), Expect = 3e-77,   Method: Compositional matrix adjust.
 Identities = 152/351 (43%), Positives = 216/351 (61%), Gaps = 18/351 (5%)

Query: 3   LKFEKSFIIP--MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKA 60
           +   KSF+    +F   +L+++ A    +        +   +E W+ ++G++Y    E  
Sbjct: 1   MGLPKSFVSMSLLFFSTLLILSLAFNAKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWE 60

Query: 61  MRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRP 120
            R  IFK+ L +I++ N + NR+YK+G N+F+DLT+EEFR+ Y  +     S S ++   
Sbjct: 61  RRFEIFKETLRFIDEHNADTNRSYKVGLNQFADLTDEEFRSTYLRFT----SGSNKTKVS 116

Query: 121 STFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSE 180
           + ++ +    +P+ +DWR  GAV  IK QG+CG CWAFSA+A VEGI +I  G LI LSE
Sbjct: 117 NRYEPRVGQVLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSE 176

Query: 181 QQLVDC--STDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTC--DNQKEKAVAA 236
           Q+L+DC  + +  GC+GG +   F++II N G+ TE +YPY  ++G C  D Q EK V  
Sbjct: 177 QELIDCGRTQNTRGCNGGYITDGFQFIINNGGINTEENYPYTAQDGECNVDLQNEKYV-- 234

Query: 237 TISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVG 296
           TI  YE++P  +E AL  AV+ QPVSV +DA+G AF  Y SG+    CG   DH V +VG
Sbjct: 235 TIDTYENVPYNNEWALQTAVTYQPVSVALDAAGDAFKQYSSGIFTGPCGTAVDHAVTIVG 294

Query: 297 FGTAEEENGAKYWLIKNSWGETWGESGYIRILRD---AGLCGIATAASYPV 344
           +GT   E G  YW++KNSW  TWGE GY+RILR+   AG CGIAT  SYPV
Sbjct: 295 YGT---EGGIDYWIVKNSWDTTWGEEGYMRILRNVGGAGTCGIATMPSYPV 342


>gi|297809383|ref|XP_002872575.1| hypothetical protein ARALYDRAFT_911472 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297318412|gb|EFH48834.1| hypothetical protein ARALYDRAFT_911472 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 371

 Score =  294 bits (753), Expect = 3e-77,   Method: Compositional matrix adjust.
 Identities = 152/357 (42%), Positives = 218/357 (61%), Gaps = 31/357 (8%)

Query: 13  MFVIIILVIT-CAS----QVVSGRSMH---------------EPSIVEKHEQWMAQHGRT 52
           + +++ +VIT CA+     VVS  + H               E S++   + WM +HG+ 
Sbjct: 9   LILLVAMVITSCATAMDMSVVSSNNNHHLTTSPGRLHSGFDAEASLI--FDSWMVKHGKV 66

Query: 53  YKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPS 112
           Y    EK  RL IF+ NL +I   N E N +Y+LG  +F+DL+  E+  +  G +   P 
Sbjct: 67  YGSVAEKERRLTIFEDNLRFISNRNAE-NLSYRLGLTQFADLSLHEYGEVCHGADPRPPR 125

Query: 113 VSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITR 172
                +    +K      +P S+DWR +GAVT +KDQG C SCWAFS V AVEG+ +I  
Sbjct: 126 NHVFMTSSDRYKTSAGDVLPKSVDWRNEGAVTEVKDQGHCRSCWAFSTVGAVEGLNKIVT 185

Query: 173 GKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQ-KE 231
           G+L+ LSEQ L++C+ +N+GC GG ++ A+E+I++N GL T+ DYPY+   G CD + KE
Sbjct: 186 GELVTLSEQDLINCNKENNGCGGGKVETAYEFIMKNGGLGTDNDYPYKAVNGVCDGRLKE 245

Query: 232 KAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHG 291
                 I  +E+LP  DE AL++AV++QPV+  +D+S R F  Y+SGV +  CG N +HG
Sbjct: 246 NNKNVMIDGFENLPANDEFALMKAVAHQPVTAVIDSSSREFQLYESGVFDGSCGTNLNHG 305

Query: 292 VAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
           V VVG+GT   ENG  YWL+KNS G TWGE+GY+++ R+     GLCGIA  ASYP+
Sbjct: 306 VVVVGYGT---ENGRDYWLVKNSRGNTWGEAGYMKMARNIANPRGLCGIAMRASYPL 359


>gi|160858205|dbj|BAF93840.1| triticain beta 2 [Triticum aestivum]
          Length = 469

 Score =  294 bits (753), Expect = 4e-77,   Method: Compositional matrix adjust.
 Identities = 145/321 (45%), Positives = 207/321 (64%), Gaps = 28/321 (8%)

Query: 42  HEQWMAQHGR-TYKDE---LEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEFSDL 94
           ++ W+A+HG  +Y +     E+  R   F  NL +++  N     G   ++L  N F+DL
Sbjct: 50  YDLWLAEHGGGSYPNANSIPERERRFRAFWDNLRFVDAHNARAAAGEEGFRLAMNRFADL 109

Query: 95  TNEEFRALYTGYNRPVPSVSRQSSRPST-----FKYQNVTDVPTSIDWREKGAVTHIKDQ 149
           TN+EFRA Y G       V  Q +RP       +++    ++P ++DWREKGAV  +K+Q
Sbjct: 110 TNDEFRAAYLG-------VKGQRARPGRVVGERYRHDGAEELPEAVDWREKGAVAPVKNQ 162

Query: 150 GQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNH--GCSGGLMDKAFEYIIE 207
           GQCGSCWAFSA++ VE I QI  G+++ LSEQ+LV+C T+    GC+GGLMD AFE+II+
Sbjct: 163 GQCGSCWAFSAISTVESINQIVTGEMVTLSEQELVECDTNGQSSGCNGGLMDDAFEFIIK 222

Query: 208 NKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDA 267
           N G+ TE DYPY+  +G CD  ++ A   +I  +ED+P+ DE++L +AV++QPVSV ++A
Sbjct: 223 NGGIDTEDDYPYKAIDGRCDVLRKNAKVVSIDGFEDVPENDEKSLQKAVAHQPVSVAIEA 282

Query: 268 SGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRI 327
            GR F  Y SGV +  CG   DHGV  VG+GT   ENG  YW+++NSWG  WGE+GY+R+
Sbjct: 283 GGREFQLYHSGVFSGRCGTQLDHGVVAVGYGT---ENGKDYWIVRNSWGPNWGEAGYLRM 339

Query: 328 LRD----AGLCGIATAASYPV 344
            R+    +G CGIA  +SYP 
Sbjct: 340 ERNINVTSGKCGIAMMSSYPT 360


>gi|312451845|gb|ADQ85986.1| actinidin [Actinidia chinensis]
          Length = 380

 Score =  294 bits (752), Expect = 4e-77,   Method: Compositional matrix adjust.
 Identities = 152/351 (43%), Positives = 215/351 (61%), Gaps = 18/351 (5%)

Query: 3   LKFEKSFIIP--MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKA 60
           +   KSF+    +F   +L+++ A    +        +   +E W+ ++G++Y    E  
Sbjct: 1   MGLPKSFVSMSLLFFSTLLILSLAFNAKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWE 60

Query: 61  MRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRP 120
            R  IFK+ L +I++ N + NR+YK+G N+F+DLT+EEFR+ Y G+     S S ++   
Sbjct: 61  RRFEIFKETLRFIDEHNADTNRSYKVGLNQFADLTDEEFRSTYLGFT----SGSNKTKVS 116

Query: 121 STFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSE 180
           + ++ +    +P+ +DWR  GAV  IK QG+CG CWAFSA+A VEGI +I  G LI LSE
Sbjct: 117 NRYEPRVGQVLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSE 176

Query: 181 QQLVDC--STDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTC--DNQKEKAVAA 236
           Q+L+DC  + +  GC+G  +   F +II N G+ TE +YPY  ++G C  D Q EK V  
Sbjct: 177 QELIDCGRTQNTRGCNGSYITDGFPFIINNGGINTEENYPYTAQDGECNVDLQNEKYV-- 234

Query: 237 TISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVG 296
           TI  YE++P  +E AL  AV+ QPVSV +DA+G AF  Y SG+    CG   DH V +VG
Sbjct: 235 TIDTYENVPYNNEWALQTAVTYQPVSVALDAAGDAFKQYSSGIFTGPCGTAIDHAVTIVG 294

Query: 297 FGTAEEENGAKYWLIKNSWGETWGESGYIRILRD---AGLCGIATAASYPV 344
           +GT   E G  YW++KNSW  TWGE GY+RILR+   AG CGIAT  SYPV
Sbjct: 295 YGT---EGGIDYWIVKNSWDTTWGEEGYMRILRNVGGAGTCGIATMPSYPV 342


>gi|302763109|ref|XP_002964976.1| hypothetical protein SELMODRAFT_83176 [Selaginella moellendorffii]
 gi|302763113|ref|XP_002964978.1| hypothetical protein SELMODRAFT_83554 [Selaginella moellendorffii]
 gi|300167209|gb|EFJ33814.1| hypothetical protein SELMODRAFT_83176 [Selaginella moellendorffii]
 gi|300167211|gb|EFJ33816.1| hypothetical protein SELMODRAFT_83554 [Selaginella moellendorffii]
          Length = 300

 Score =  293 bits (751), Expect = 5e-77,   Method: Compositional matrix adjust.
 Identities = 144/304 (47%), Positives = 198/304 (65%), Gaps = 10/304 (3%)

Query: 43  EQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRAL 102
           E W A+H ++Y  + EKA RL +F   L YIEK N + N T+ LG N+FSDLTN EFRA 
Sbjct: 3   EDWAAKHDKSYSSDWEKARRLMVFSDTLAYIEKHNAQPNTTFTLGLNKFSDLTNAEFRAN 62

Query: 103 YTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVA 162
           Y G  +P      Q  RP+     +V+ +PTS+DWR++GAVT IKDQGQCGSCWAFSA+A
Sbjct: 63  YVGKFKPP---RYQDRRPAKDVDVDVSSLPTSLDWRQEGAVTPIKDQGQCGSCWAFSAIA 119

Query: 163 AVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHE 222
           ++E    +   +L+ LSEQQL+DC T + GC GG  D AF++++EN G+ TE  YPY   
Sbjct: 120 SIESAHFLATKELVSLSEQQLIDCDTVDQGCQGGFPDDAFKFVVENGGVTTEEAYPYTGF 179

Query: 223 EGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNA 282
            G+C+  K K V   I+ Y+D+ K    AL++AVS  PV+V +  S + F  Y+SG+L+ 
Sbjct: 180 AGSCNTNKNKVVE--ITGYKDVTKDSADALMKAVSKTPVTVGICGSDQNFQNYRSGILSG 237

Query: 283 DCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD--AGLCGIATAA 340
            C N+ DH V V+G+GT   E G  YW+IKNSWG +WGE G+++I +    G+CG+   +
Sbjct: 238 QCCNSRDHAVLVIGYGT---EGGMPYWIIKNSWGTSWGEDGFMKIKKKDGEGMCGMNGQS 294

Query: 341 SYPV 344
           SYP 
Sbjct: 295 SYPT 298


>gi|57118005|gb|AAW34134.1| cysteine protease gp2a [Zingiber officinale]
          Length = 381

 Score =  293 bits (751), Expect = 6e-77,   Method: Compositional matrix adjust.
 Identities = 146/292 (50%), Positives = 194/292 (66%), Gaps = 16/292 (5%)

Query: 62  RLNIFKQNLEYIEKANKEGNR---TYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSS 118
           RL +FK+NL+++++ N   +R   T+ LG N F+DLTNEE+R   T + R    + R +S
Sbjct: 73  RLEVFKENLQFVDEHNAAADRGEHTFLLGMNRFADLTNEEYR---TRFLRDFSRLRRSAS 129

Query: 119 R--PSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLI 176
               S ++ +   D+P SIDWRE GAV  +K+QG CGSCWAFS VAAVEGI QI  G LI
Sbjct: 130 GKISSRYRLREGDDLPDSIDWRENGAVVPVKNQGGCGSCWAFSTVAAVEGINQIVTGDLI 189

Query: 177 ELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAA 236
            LSEQQLVDC+T NHGC GG M+ AF++I+ N G+ +E  YPYR + G C N    A   
Sbjct: 190 SLSEQQLVDCTTANHGCRGGWMNPAFQFIVNNGGINSEETYPYRGQNGIC-NSTVNAPVV 248

Query: 237 TISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVG 296
           +I  YE++P  +EQ+L +AV+NQPVSV +DA+GR F  Y+SG+    C  + +H + VVG
Sbjct: 249 SIDSYENVPSHNEQSLQKAVANQPVSVTMDAAGRDFQLYRSGIFTGSCNISANHALTVVG 308

Query: 297 FGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
           +GT   EN   +W++KNSWG+ WGESGYIR  R+     G CGI   ASYPV
Sbjct: 309 YGT---ENDKDFWIVKNSWGKNWGESGYIRAERNIENPNGKCGITRFASYPV 357


>gi|146215982|gb|ABQ10193.1| actinidin Act2b [Actinidia eriantha]
          Length = 378

 Score =  293 bits (751), Expect = 6e-77,   Method: Compositional matrix adjust.
 Identities = 148/338 (43%), Positives = 204/338 (60%), Gaps = 15/338 (4%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
           +F   +L+++ A  + +        ++  +E W+ + G++Y    EK MR  IFK+NL  
Sbjct: 13  LFFSTLLILSLALDIENSVQRTNDQVMAMYESWLVEQGKSYNSLDEKEMRFEIFKENLRI 72

Query: 73  IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNR-PVPSVSRQSSRPSTFKYQNVTDV 131
           I+  N + NR+Y LG N F+DLT+EE+R+ Y G    P   VS +      +  +    +
Sbjct: 73  IDDHNADANRSYSLGLNRFADLTDEEYRSTYLGLKMGPKTDVSNE------YMPKVGEAL 126

Query: 132 PTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC--STD 189
           P  +DWR  GAV  +K+QG C SCWAFSAV AVEGI +I  G LI LSEQ+LVDC  +  
Sbjct: 127 PDYVDWRTVGAVVGVKNQGLCSSCWAFSAVTAVEGINKIVTGNLISLSEQELVDCGRTQR 186

Query: 190 NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDE 249
             GC+ GLM  AF++II N G+ TE +YPY  ++G C+   +     TI  Y+++P  +E
Sbjct: 187 TKGCNRGLMTDAFQFIINNGGINTEDNYPYTAKDGQCNLSLKNQKYVTIDNYKNVPSNNE 246

Query: 250 QALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYW 309
            AL +AV+ QPVSV V++ G  F  Y SG+    CG   DHGV +VG+GT   E G  YW
Sbjct: 247 MALKKAVAYQPVSVGVESEGGKFKLYTSGIFTGFCGTAVDHGVTIVGYGT---ERGMDYW 303

Query: 310 LIKNSWGETWGESGYIRILRD---AGLCGIATAASYPV 344
           ++KNSWG  WGE+GYIRI R+   AG CGIA   SYPV
Sbjct: 304 IVKNSWGTNWGENGYIRIQRNIGGAGKCGIARMPSYPV 341


>gi|242049716|ref|XP_002462602.1| hypothetical protein SORBIDRAFT_02g028840 [Sorghum bicolor]
 gi|241925979|gb|EER99123.1| hypothetical protein SORBIDRAFT_02g028840 [Sorghum bicolor]
          Length = 384

 Score =  293 bits (750), Expect = 7e-77,   Method: Compositional matrix adjust.
 Identities = 153/358 (42%), Positives = 204/358 (56%), Gaps = 53/358 (14%)

Query: 38  IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
           ++E+ EQWM +HGR Y D  EK  RL ++++N+  +E  N   N  Y+L  N+F+DLTNE
Sbjct: 28  MLERFEQWMGRHGRLYADAGEKQRRLEVYRRNVALVETFNSMSNGGYRLADNKFADLTNE 87

Query: 98  EFRALYTGYNRPVP--SVSRQSSRPSTF---------KYQNVTDVPTSIDWREKGAVTHI 146
           EFRA   G+ RP P    +  ++ P T          +Y +  ++P S+DWREKGAV  +
Sbjct: 88  EFRAKMLGFGRPPPHGRATGHTTTPGTVACIGSGLGRRYSD--ELPKSVDWREKGAVAPV 145

Query: 147 KDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYII 206
           K+QG+CGSCWAFSAVAA+EGI QI  GKL+ LSEQ+LVDC T   GC+GG M  AFE+++
Sbjct: 146 KNQGECGSCWAFSAVAAIEGINQIKNGKLVSLSEQELVDCDTKAIGCAGGYMSWAFEFVM 205

Query: 207 ENKGLATEADYPYRHE----------------------------EGTCDNQKEKAVAATI 238
            N GL TE +YPY+                               G C   K K  A +I
Sbjct: 206 NNSGLTTERNYPYQGTYAHGNRKTHALPFDCTKGSSTCDSRAGMNGACQTPKLKESAVSI 265

Query: 239 SKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFG 298
           S Y ++    E  LL+A + QPVSV VDA    +  Y  GV    C  + +HGV VVG+G
Sbjct: 266 SGYVNVTASSEPDLLRAAAAQPVSVAVDAGSFVWQLYGGGVFTGPCTADLNHGVTVVGYG 325

Query: 299 TAEEEN--------GAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
             + +         G KYW++KNSWG  WG++GYI + R+A    GLCGIA   SYPV
Sbjct: 326 ETQRDTDGDGTGVPGQKYWIVKNSWGPEWGDAGYILMQREASVASGLCGIALLPSYPV 383


>gi|357166364|ref|XP_003580686.1| PREDICTED: oryzain alpha chain-like [Brachypodium distachyon]
          Length = 360

 Score =  293 bits (750), Expect = 8e-77,   Method: Compositional matrix adjust.
 Identities = 153/325 (47%), Positives = 202/325 (62%), Gaps = 15/325 (4%)

Query: 29  SGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKAN---KEGNRTYK 85
           SG+   E      + +W AQHG    +E E   R   F+ NL YI++ N     G  +++
Sbjct: 30  SGQIRSEEETRRMYAEWTAQHGSPITNEEEG--RYEAFRDNLRYIDEHNAAADAGIHSFR 87

Query: 86  LGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTH 145
           LG N F+ LTNEE+RA Y G      +V       + ++  +   +P S+DWREKGAV  
Sbjct: 88  LGLNRFAGLTNEEYRAAYLGLRLRSGAVGDLRKPSARYEAADGEALPESVDWREKGAVGK 147

Query: 146 IKDQGQ-CGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFE 203
           +KDQG+ CGS WAFSA+AAVE I QI  G+LI LSEQ+L+DC T  N GC GGLMD AFE
Sbjct: 148 VKDQGRSCGSAWAFSAIAAVESINQIVTGELISLSEQELMDCDTSYNAGCDGGLMDDAFE 207

Query: 204 YIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSV 263
           +II N G+ T+ DYPY+    +CD  K    A TI  YEDL + +E++L +AVSNQPVSV
Sbjct: 208 FIISNGGIDTDEDYPYKARNDSCDANKRNRKAVTIDDYEDL-RMNEKSLQKAVSNQPVSV 266

Query: 264 CVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESG 323
            ++A GR F  YKSG+    CG + DH   +VG+G+   ENG  YW++K S+G +WGESG
Sbjct: 267 AIEAGGRDFQLYKSGIFTGTCGTDLDHATTIVGYGS---ENGTDYWIVKESYGTSWGESG 323

Query: 324 YIRILRD----AGLCGIATAASYPV 344
           Y R+ R+    +G CGIA   SYPV
Sbjct: 324 YARMERNIKETSGKCGIAMLPSYPV 348


>gi|384253406|gb|EIE26881.1| hypothetical protein COCSUDRAFT_21961 [Coccomyxa subellipsoidea
           C-169]
          Length = 481

 Score =  293 bits (750), Expect = 8e-77,   Method: Compositional matrix adjust.
 Identities = 146/306 (47%), Positives = 198/306 (64%), Gaps = 11/306 (3%)

Query: 45  WMAQHGRTYKDELEKAMR-LNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALY 103
           W+    + YKD +E+  R  +++  NLE++   N E + T+KLG   F+DLT++E+R   
Sbjct: 51  WVEHLQKAYKDNVEEYERKFSVWLDNLEFVHSHN-EKDSTFKLGLTNFADLTHDEYRQHA 109

Query: 104 TGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAA 163
            GY   +      + + + F+Y +  + P SIDWR+KGAVT +K+Q QCGSCWAFS   +
Sbjct: 110 LGYRPELKGTGLGTGKSTGFQYADY-EAPPSIDWRKKGAVTDVKNQQQCGSCWAFSTTGS 168

Query: 164 VEGITQITRGKLIELSEQQLVDCS-TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHE 222
           VEG   I  G+L+ LSEQ+LVDC  T +HGC GGLMD AF +II N G+ TE DY Y+ +
Sbjct: 169 VEGANAIYSGELVSLSEQELVDCDVTQDHGCHGGLMDFAFSFIIRNGGIDTEKDYKYKAQ 228

Query: 223 EGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNA 282
           +G C+  KEK    TI  YED+P  DE AL +A +NQP+SV ++A  R F  Y  GV +A
Sbjct: 229 DGVCNIAKEKRHVVTIDSYEDVPPNDESALKKAAANQPISVAIEADQREFQLYAGGVFDA 288

Query: 283 DCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILR----DAGLCGIAT 338
            CG   DHGV VVG+G+   +NG  YW++KNSWG+ WG+SGYIR+ R     AG CGIA 
Sbjct: 289 PCGTALDHGVLVVGYGS---DNGTDYWIVKNSWGDFWGDSGYIRLARGISNSAGQCGIAM 345

Query: 339 AASYPV 344
            ASYP+
Sbjct: 346 QASYPI 351


>gi|242094002|ref|XP_002437491.1| hypothetical protein SORBIDRAFT_10g028010 [Sorghum bicolor]
 gi|241915714|gb|EER88858.1| hypothetical protein SORBIDRAFT_10g028010 [Sorghum bicolor]
          Length = 397

 Score =  293 bits (750), Expect = 8e-77,   Method: Compositional matrix adjust.
 Identities = 150/328 (45%), Positives = 206/328 (62%), Gaps = 28/328 (8%)

Query: 42  HEQWMAQHGRTYKD----ELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEFSDL 94
           +E W ++HGR   +      E  +RL +F+ NL YI+  N E   G  T++LG   F+DL
Sbjct: 54  YEAWKSKHGRPRGNCDMAGDEDRLRLEVFRDNLRYIDAHNAEADAGLHTFRLGLTPFADL 113

Query: 95  TNEEFRALYTGY---NRPVPSVSRQSSRPSTFKYQN----------VTDVPTSIDWREKG 141
           T EE+R    G+   +R  PS    +SR  +   ++            D+P +IDWR+ G
Sbjct: 114 TLEEYRGRALGFRARHRGGPSARAAASRVGSGGTRSHHRRPRPRPRCGDLPDAIDWRQLG 173

Query: 142 AVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKA 201
           AVT +K+Q QCG CWAFSAVAA+EGI  I  G L+ LSEQ+++DC T + GC+GG M+ A
Sbjct: 174 AVTDVKNQEQCGGCWAFSAVAAIEGINAIVTGNLVSLSEQEIIDCDTQDSGCNGGQMENA 233

Query: 202 FEYIIENKGLATEADYPYRHEEGTCD-NQKEKAVAATISKYEDLPKGDEQALLQAVSNQP 260
           F+++I+N G+ +EADYP+   +GTCD N+      A I  + ++   +E AL +AV+ QP
Sbjct: 234 FQFVIDNGGIDSEADYPFIATDGTCDANKANDEKVAAIDGFVEVASNNETALQEAVAIQP 293

Query: 261 VSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWG 320
           VSV +DA GRAF  Y SG+ N  CG N DHGV VVG+G+   ENG  YW++KNSW ++WG
Sbjct: 294 VSVAIDAGGRAFQHYSSGIFNGPCGTNLDHGVTVVGYGS---ENGKAYWIVKNSWSDSWG 350

Query: 321 ESGYIRILRD----AGLCGIATAASYPV 344
           E+GYIRI R+     G CGIA  ASYPV
Sbjct: 351 EAGYIRIRRNVFLPVGKCGIAMDASYPV 378


>gi|255544115|ref|XP_002513120.1| cysteine protease, putative [Ricinus communis]
 gi|223548131|gb|EEF49623.1| cysteine protease, putative [Ricinus communis]
          Length = 362

 Score =  293 bits (750), Expect = 9e-77,   Method: Compositional matrix adjust.
 Identities = 146/279 (52%), Positives = 193/279 (69%), Gaps = 9/279 (3%)

Query: 3   LKFEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMR 62
           + F + +I   F +   +    SQ ++ R++ E S+ E+HEQWMA + R YKD  EK MR
Sbjct: 1   MVFTEPYICITFALFFSIGAWTSQCMA-RTLQEASMYERHEQWMASYARVYKDANEKQMR 59

Query: 63  LNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPST 122
             IFK+N++ I+  N E +++YKL  N+F+DLTNEEF++L  G+   + S     ++   
Sbjct: 60  YKIFKENVQRIDSFNSESDKSYKLAVNQFADLTNEEFKSLRNGFKGHMCS-----AQAGH 114

Query: 123 FKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQ 182
           F+Y+NVT VP SIDWR+KGAVT IK+QGQCGSCWAFSAVAAVEGIT+I  GKLI LSEQ+
Sbjct: 115 FRYENVTAVPASIDWRKKGAVTQIKEQGQCGSCWAFSAVAAVEGITEIKTGKLISLSEQE 174

Query: 183 LVDCST--DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISK 240
           LVDC T  ++ GC GGLMD AF++ IE  GLA+EA YPY   + TC  ++E   +A I+ 
Sbjct: 175 LVDCDTNSEDQGCQGGLMDDAFKF-IEQHGLASEATYPYDAADSTCKTKEEAKPSAKITG 233

Query: 241 YEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGV 279
           YED+P  DE AL  AV+NQPVSV +DA G  F FY SG+
Sbjct: 234 YEDVPANDEAALKNAVANQPVSVAIDAGGFEFQFYSSGI 272


>gi|218202389|gb|EEC84816.1| hypothetical protein OsI_31898 [Oryza sativa Indica Group]
          Length = 350

 Score =  293 bits (749), Expect = 1e-76,   Method: Compositional matrix adjust.
 Identities = 150/324 (46%), Positives = 197/324 (60%), Gaps = 19/324 (5%)

Query: 38  IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
           ++++ EQWM +HGR Y D  EK  R  ++++N+E +E  N   N  YKL  N+F+DLTNE
Sbjct: 28  MLDRFEQWMIRHGRAYTDSGEKQRRFEVYRRNVELVETFNSMSN-GYKLADNKFADLTNE 86

Query: 98  EFRALYTGYNRP---VPSVSRQSSRPSTFKYQNVTDV-PTSIDWREKGAVTHIKDQGQCG 153
           EFRA   G+ RP   +P +S   S       ++  D+ P S+DWR+KGAV  +K+QG CG
Sbjct: 87  EFRAKMLGF-RPHVTIPQISNTCSADIAMPGESSDDILPKSVDWRKKGAVVEVKNQGDCG 145

Query: 154 SCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLAT 213
           SCWAFSAVAA+EGI QI  G+L+ LSEQ+LVDC  +  GC GG M  AFE+++ N GL T
Sbjct: 146 SCWAFSAVAAIEGINQIKNGELVSLSEQELVDCDDEAVGCGGGYMSWAFEFVVGNHGLTT 205

Query: 214 EADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFH 273
           EA YPY    G C   K    A  I+ Y ++    E  L +A + QPVSV VD     F 
Sbjct: 206 EASYPYHAANGACQAAKLNQSAVAIAGYRNVTPSSEPDLARAAAAQPVSVAVDGGSFMFQ 265

Query: 274 FYKSGVLNADCGNNCDHGVAVVGFGTAEEEN--------GAKYWLIKNSWGETWGESGYI 325
            Y SGV    C  + +HGV VVG+G +E +         G KYW++KNSWG  WG++GYI
Sbjct: 266 LYGSGVYTGPCTADVNHGVTVVGYGESEPKTDGGGAAKGGEKYWIVKNSWGAEWGDAGYI 325

Query: 326 RILRD-----AGLCGIATAASYPV 344
            + RD     +GLCGIA   SYPV
Sbjct: 326 LMQRDVAGLASGLCGIALLPSYPV 349


>gi|296090463|emb|CBI40282.3| unnamed protein product [Vitis vinifera]
          Length = 386

 Score =  293 bits (749), Expect = 1e-76,   Method: Compositional matrix adjust.
 Identities = 147/309 (47%), Positives = 189/309 (61%), Gaps = 43/309 (13%)

Query: 42  HEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRA 101
           +E W+ +HG++Y    E+  R  IFK NL +IE+ N   NRTYK+G              
Sbjct: 4   YEAWLVKHGKSYNALGERERRFEIFKDNLRFIEEHNAV-NRTYKVG-------------- 48

Query: 102 LYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAV 161
                                + ++   D+P S+DWREKGAV  +KDQG CGSCWAFS +
Sbjct: 49  -------------------DRYSFRAGEDLPESVDWREKGAVVPVKDQGNCGSCWAFSTI 89

Query: 162 AAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYR 220
           AAVEGI QI  G LI LSEQ+LVDC    N GC+GGLMD AFE+II N G+ +E DYPYR
Sbjct: 90  AAVEGINQIATGDLISLSEQELVDCDKSYNQGCNGGLMDYAFEFIINNGGIDSEEDYPYR 149

Query: 221 HEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVL 280
             + TCD  ++ A   +I  YED+P+ DE++L +AV+NQPVSV ++A GRAF  Y+SGV 
Sbjct: 150 AADTTCDPNRKNARVVSIDGYEDVPQNDERSLKKAVANQPVSVAIEAGGRAFQLYQSGVF 209

Query: 281 NADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILR-----DAGLCG 335
              CG   DHGV  VG+GT   EN   YW+++NSWG  WGESGYI++ R     + G CG
Sbjct: 210 TGQCGTQLDHGVVAVGYGT---ENSVDYWIVRNSWGPNWGESGYIKLERNLAGTETGKCG 266

Query: 336 IATAASYPV 344
           IA   SYP+
Sbjct: 267 IAIEPSYPI 275


>gi|356517368|ref|XP_003527359.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
          Length = 332

 Score =  293 bits (749), Expect = 1e-76,   Method: Compositional matrix adjust.
 Identities = 161/341 (47%), Positives = 213/341 (62%), Gaps = 29/341 (8%)

Query: 14  FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYI 73
           F +++ +   A QV   R++ + S+ E+HEQ M ++G+ YKD  ++      FK+N+ YI
Sbjct: 12  FAMLLCMAFLAFQVTC-RTLQDASMXERHEQRMTRYGKVYKDPPKRX-----FKENVNYI 65

Query: 74  EKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPT 133
           E  N   N+ YK G N+F+              NR    +     R +TFK++NVT  P+
Sbjct: 66  EACNNAANKPYKRGINQFAPR------------NRFKGHMCSSIIRITTFKFENVTATPS 113

Query: 134 SIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NH 191
           ++D R+KGAVT IKDQGQCG CWAFSAVAA EGI  ++ GKLI LSEQ+LVDC T   + 
Sbjct: 114 TVDCRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALSAGKLISLSEQELVDCDTKGVDX 173

Query: 192 GCSGGLMDKAFEYIIENKGLATEADYP-YRHEEGTCDNQKEKAVAAT-ISKYEDLPKGDE 249
           GC GGLMD AF++II+N GL   +  P Y   +G C+  +    AAT I+ YED+P  +E
Sbjct: 174 GCEGGLMDDAFKFIIQNHGLKHXSQLPLYMGVDGKCNANEAAKNAATIITGYEDVPANNE 233

Query: 250 QALLQ-AVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKY 308
           +A LQ AV+N PVS  +DASG  F FYKSGV    CG   DHGV  VG+G +++  G +Y
Sbjct: 234 KAHLQKAVANNPVSEAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSDD--GTEY 291

Query: 309 WLIKNSWGETWGESGYIRILR----DAGLCGIATAASYPVA 345
           WL+KNSWG  WGE GYIR+ R    +  LCGIA  ASYP A
Sbjct: 292 WLVKNSWGTEWGEEGYIRMQRGVDSEEALCGIAVQASYPSA 332


>gi|115479933|ref|NP_001063560.1| Os09g0497500 [Oryza sativa Japonica Group]
 gi|113631793|dbj|BAF25474.1| Os09g0497500 [Oryza sativa Japonica Group]
 gi|215704298|dbj|BAG93138.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 349

 Score =  292 bits (748), Expect = 1e-76,   Method: Compositional matrix adjust.
 Identities = 150/324 (46%), Positives = 197/324 (60%), Gaps = 19/324 (5%)

Query: 38  IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
           ++++ EQWM +HGR Y D  EK  R  ++++N+E +E  N   N  YKL  N+F+DLTNE
Sbjct: 27  MLDRFEQWMIRHGRAYTDAGEKQRRFEVYRRNVELVETFNSMSN-GYKLADNKFADLTNE 85

Query: 98  EFRALYTGYNRP---VPSVSRQSSRPSTFKYQNVTDV-PTSIDWREKGAVTHIKDQGQCG 153
           EFRA   G+ RP   +P +S   S       ++  D+ P S+DWR+KGAV  +K+QG CG
Sbjct: 86  EFRAKMLGF-RPHVTIPQISNTCSADIAMPGESSDDILPKSVDWRKKGAVVEVKNQGDCG 144

Query: 154 SCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLAT 213
           SCWAFSAVAA+EGI QI  G+L+ LSEQ+LVDC  +  GC GG M  AFE+++ N GL T
Sbjct: 145 SCWAFSAVAAIEGINQIKNGELVSLSEQELVDCDDEAVGCGGGYMSWAFEFVVGNHGLTT 204

Query: 214 EADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFH 273
           EA YPY    G C   K    A  I+ Y ++    E  L +A + QPVSV VD     F 
Sbjct: 205 EASYPYHAANGACQAAKLNQSAVAIAGYRNVTPSSEPDLARAAAAQPVSVAVDGGSFMFQ 264

Query: 274 FYKSGVLNADCGNNCDHGVAVVGFGTAEEEN--------GAKYWLIKNSWGETWGESGYI 325
            Y SGV    C  + +HGV VVG+G +E +         G KYW++KNSWG  WG++GYI
Sbjct: 265 LYGSGVYTGPCTADVNHGVTVVGYGESEPKTDGGGAAKGGEKYWIVKNSWGAEWGDAGYI 324

Query: 326 RILRD-----AGLCGIATAASYPV 344
            + RD     +GLCGIA   SYPV
Sbjct: 325 LMQRDVAGLASGLCGIALLPSYPV 348


>gi|3377950|emb|CAA08861.1| cysteine proteinase precursor, AN11 [Ananas comosus]
          Length = 357

 Score =  292 bits (748), Expect = 1e-76,   Method: Compositional matrix adjust.
 Identities = 143/337 (42%), Positives = 218/337 (64%), Gaps = 14/337 (4%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
           +F+ + L +  AS   + R      ++++ E+WMA++GR YKD  EK  R  IFK N+ +
Sbjct: 8   VFLFLFLCVMWASPSAASRDEPSDPMMKRFEEWMAEYGRVYKDNDEKMRRFQIFKNNVNH 67

Query: 73  IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVP 132
           IE  N     +Y LG N+F+D+TN EF A YTG + P+ ++ R+     +F   +++ VP
Sbjct: 68  IETFNSRNGNSYTLGINQFTDMTNNEFVAQYTGVSLPL-NIEREPV--VSFDDVDISAVP 124

Query: 133 TSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHG 192
            SIDWR  GAVT +K+   CGSCWAF+A+A VE I +I RG LI LSEQQ++DC+  ++G
Sbjct: 125 QSIDWRNYGAVTSVKNHIPCGSCWAFAAIATVESIYKIKRGYLISLSEQQVLDCAV-SYG 183

Query: 193 CSGGLMDKAFEYIIENKGLATEADYPYR--HEEGTCDNQKEKAVAATISKYEDLPKGDEQ 250
           C GG ++KA+++II NKG+A+ A YPY+    +GTC        +A I+ Y  +   +E+
Sbjct: 184 CDGGWVNKAYDFIISNKGVASAAIYPYKASQGQGTCRINGVPN-SAYITGYTRVQSNNER 242

Query: 251 ALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWL 310
           +++ AVSNQP++  ++ASG  F  YK GV +  CG + +H + ++G+G  ++ +G K+W+
Sbjct: 243 SMMYAVSNQPIAASIEASGD-FQHYKRGVFSGPCGTSLNHAITIIGYG--QDSSGKKFWI 299

Query: 311 IKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
           ++NSWG +WGE GYIR+ RD    +GLCGIA    YP
Sbjct: 300 VRNSWGASWGERGYIRMARDVSSSSGLCGIAIRPLYP 336


>gi|297799636|ref|XP_002867702.1| hypothetical protein ARALYDRAFT_329301 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297313538|gb|EFH43961.1| hypothetical protein ARALYDRAFT_329301 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 357

 Score =  292 bits (748), Expect = 1e-76,   Method: Compositional matrix adjust.
 Identities = 147/310 (47%), Positives = 206/310 (66%), Gaps = 16/310 (5%)

Query: 43  EQWMAQHGRTYKDEL-EKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRA 101
           + WM++HG+TY + L EK  R   FK NL +I++ N + N +Y+LG   F+DLT +E+R 
Sbjct: 49  QMWMSKHGKTYTNALGEKERRFQNFKDNLRFIDQHNAK-NLSYQLGLTRFADLTVQEYRD 107

Query: 102 LYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAV 161
           L+ G  +P     R S R   +   +   +P S+DWR +GAV+ IKDQG C SCWAFS V
Sbjct: 108 LFPGSPKPKQRNLRISRR---YVPLDGDQLPESVDWRNEGAVSAIKDQGTCNSCWAFSTV 164

Query: 162 AAVEGITQITRGKLIELSEQQLVDCSTDNHGCSG-GLMDKAFEYIIENKGLATEADYPYR 220
           AAVEGI +I  G+L+ LSEQ+LVDC+  N+GC G G MD AF+++I N GL ++ DYPY+
Sbjct: 165 AAVEGINKIVTGELVSLSEQELVDCNLVNNGCYGSGTMDAAFQFLINNGGLDSDTDYPYQ 224

Query: 221 HEEGTCDNQKEKAVAA--TISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSG 278
             +G C N+KE       TI  YED+P  DE +L +AV++QPVSV VD   + F  Y+SG
Sbjct: 225 GSQGYC-NRKESTSNKIITIDSYEDVPANDEISLQKAVAHQPVSVGVDKKSQEFMLYRSG 283

Query: 279 VLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD----AGLC 334
           + N  CG + DH + +VG+G+   ENG  YW+++NSWG TWG++GY ++ R+    +G+C
Sbjct: 284 IYNGPCGTDLDHALVIVGYGS---ENGQDYWIVRNSWGTTWGDAGYAKMARNFEYPSGVC 340

Query: 335 GIATAASYPV 344
           GIA  ASYPV
Sbjct: 341 GIAMLASYPV 350


>gi|146215992|gb|ABQ10198.1| actinidin Act4b [Actinidia eriantha]
          Length = 379

 Score =  292 bits (748), Expect = 1e-76,   Method: Compositional matrix adjust.
 Identities = 141/312 (45%), Positives = 206/312 (66%), Gaps = 13/312 (4%)

Query: 38  IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
           ++   E W+ ++G++Y    EK  R  IFK NL ++++ N + NR+YK+G N+FSDLT E
Sbjct: 44  VMAMFESWLVEYGKSYNALGEKERRFEIFKDNLRFVDEHNADVNRSYKVGLNQFSDLTLE 103

Query: 98  EFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWA 157
           E+ ++Y G    +    R ++    ++ +    +P SIDWR+KGAV  +K+QG CGSCW 
Sbjct: 104 EYSSIYLGTKFDM----RMTNVSDRYEPRVGDQLPNSIDWRKKGAVLGVKNQGNCGSCWT 159

Query: 158 FSAVAAVEGITQITRGKLIELSEQQLVDC--STDNHGCSGGLMDKAFEYIIENKGLATEA 215
           F+ +AAVE I QI  G LI LSEQQ+VDC   + N+GC GG    A+++II+N G+ TEA
Sbjct: 160 FAPIAAVEAINQIVTGNLISLSEQQIVDCQRKSPNNGCKGGSRAGAYQFIIDNGGINTEA 219

Query: 216 DYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFY 275
           +YPY+ ++G CD QK +    TI +YE++P+ +E+AL +AVSNQ VSV + ++   F  Y
Sbjct: 220 NYPYKAQDGECDEQKNQKY-VTIDRYENVPRKNEKALQKAVSNQLVSVGIASNSSEFKAY 278

Query: 276 KSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILR---DAG 332
           KSG+    CG   DH V +VG+GT   E G  YW+++NSWG  WGE+GY+R+ R   +AG
Sbjct: 279 KSGIFTGPCGAKIDHAVTIVGYGT---EGGMDYWIVRNSWGSNWGENGYVRMQRNVGNAG 335

Query: 333 LCGIATAASYPV 344
            C IAT+ +YPV
Sbjct: 336 TCFIATSPNYPV 347


>gi|1174171|gb|AAB41816.1| NTH1 [Pisum sativum]
          Length = 367

 Score =  292 bits (747), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 146/345 (42%), Positives = 215/345 (62%), Gaps = 14/345 (4%)

Query: 8   SFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFK 67
           S +  + +  ++ ++ +  + SGRS  E  ++  +E+W+ +H + Y    EK  R  IFK
Sbjct: 3   SILYSLILFGLITLSLSLDMSSGRSNKE--VMTMYEKWLVKHQKVYYGLGEKNQRFQIFK 60

Query: 68  QNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQN 127
            NL +I++ N   N +Y++G NEFSD+TN+E+R  Y          ++ +S    +K  +
Sbjct: 61  DNLIFIDEHNAP-NHSYRVGLNEFSDITNKEYRDTYLSRWSNNNIKNKITSVRYAYKAGH 119

Query: 128 VTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
              +P S+DWR  GA+T IK+QG CG+CWAFSAVAAVE I +I  G L+ LSEQ+LVDC 
Sbjct: 120 NNKLPVSVDWR--GALTPIKNQGSCGACWAFSAVAAVEAINKIVTGSLVSLSEQELVDCD 177

Query: 188 -TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPK 246
            T N GC+GG    A+ +I+EN GL ++ DYPY   + TC+  K+     +I+ Y+++ +
Sbjct: 178 RTKNKGCNGGNQVNAYRFIVENGGLDSQIDYPYLGRQSTCNQAKKNTKVVSINGYKNVQR 237

Query: 247 GDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGA 306
             E AL++AV+NQPVSV ++A G+ F  Y+SGV    CG + DH V VVG+G+   ENG 
Sbjct: 238 NSESALMEAVANQPVSVGIEAYGKDFQLYQSGVFTGSCGTSLDHAVVVVGYGS---ENGK 294

Query: 307 KYWLIKNSWGETWGESGYIRILR-----DAGLCGIATAASYPVAI 346
            YWL+KNSWG  WGE GY++I R     + G CGIA  A+YP  +
Sbjct: 295 DYWLVKNSWGTNWGERGYLKIERNLKNTNTGKCGIAMDATYPTKL 339


>gi|297843430|ref|XP_002889596.1| hypothetical protein ARALYDRAFT_887827 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297335438|gb|EFH65855.1| hypothetical protein ARALYDRAFT_887827 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 343

 Score =  291 bits (746), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 154/340 (45%), Positives = 211/340 (62%), Gaps = 18/340 (5%)

Query: 14  FVIIILVITCASQVVS-GRSMHEP--SIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNL 70
            V++I  +  AS++ S   S+++P  ++ ++ E+W+  H + Y    E  +R  I++ N+
Sbjct: 12  LVVLICFVLIASKLCSVNSSVYDPHKTLKQRFEKWLKTHSKLYGGRDEWMLRFGIYQSNV 71

Query: 71  EYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
           + I+  N   +  +KL  N F+D+TN EF+A + G N     + ++  RP      NV  
Sbjct: 72  QLIDYINSL-HLPFKLTDNRFADMTNSEFKAHFLGLNTSSLRLHKKQ-RPVCDPAGNV-- 127

Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC--ST 188
            P ++DWR +GAVT I++QG+CG CWAFSAVAA+EGI +I  G L+ LSEQQL+DC   T
Sbjct: 128 -PDAVDWRTQGAVTPIRNQGKCGGCWAFSAVAAIEGINKIKTGNLVSLSEQQLIDCDVGT 186

Query: 189 DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
            N GCSGGLM+ AFE+I  N GL TE DYPY   EGTCD +K K    TI  Y+ + + +
Sbjct: 187 YNKGCSGGLMETAFEFIKSNGGLTTETDYPYTGIEGTCDQEKAKNKVVTIQGYQKVAQ-N 245

Query: 249 EQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKY 308
           E +L  A + QPVSV +DA G  F  Y SGV  + CG N +HGV VVG+G    E   KY
Sbjct: 246 EASLQIAAAQQPVSVGIDAGGFIFQLYSSGVFTSYCGTNLNHGVTVVGYGV---EGDQKY 302

Query: 309 WLIKNSWGETWGESGYIRILR----DAGLCGIATAASYPV 344
           W++KNSWG  WGE GYIR+ R    D G CGIA  ASYP+
Sbjct: 303 WIVKNSWGTGWGEEGYIRMERGISEDTGKCGIAMLASYPL 342


>gi|357507617|ref|XP_003624097.1| Cysteine protease [Medicago truncatula]
 gi|355499112|gb|AES80315.1| Cysteine protease [Medicago truncatula]
          Length = 340

 Score =  291 bits (745), Expect = 3e-76,   Method: Compositional matrix adjust.
 Identities = 148/340 (43%), Positives = 215/340 (63%), Gaps = 21/340 (6%)

Query: 10  IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
           +I + ++I ++        + +S+   ++ E+++ W  ++   YKD+ E+   + IFK N
Sbjct: 10  LINILIVIWVMFPSNQNQENDQSL---TLSERYKHWKIKYRVIYKDDAEEEKHIQIFKHN 66

Query: 70  LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
           + YI+  N  GN++YKL  N F+DL  E       G+ +       + +  S FKY+N+T
Sbjct: 67  VAYIDSFNAAGNKSYKLTINRFADLPTEPSD---DGFKKR----KLEPTTSSLFKYKNIT 119

Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD 189
           D+P ++DWR++GAVT +K+Q +CGSCWAFSAV A+EGI QIT G L+ LSEQ+LVD    
Sbjct: 120 DIPAAVDWRKRGAVTPVKNQRECGSCWAFSAVGALEGIQQITSGNLVSLSEQELVDRVRS 179

Query: 190 N--HGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
           N  +GC+GG +  AFE+++EN G+ATEA YPYR  +G  +N K+ +    I  YE +P+ 
Sbjct: 180 NWTNGCNGGYLIDAFEFVLENGGIATEASYPYRGVKG--NNSKKVSRQVQIKSYEQVPRN 237

Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
            E +LL+ V+NQPVSV +D SG    FY SG+   +CG   +H V +VG+GT+ +  G K
Sbjct: 238 SEDSLLKVVANQPVSVGIDISG-MIRFYSSGIFTGECGTKPNHAVIIVGYGTSND--GTK 294

Query: 308 YWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYP 343
           YWL+KNSWG  WGE  YIR+ RD     GLCGI   ASYP
Sbjct: 295 YWLVKNSWGIRWGEKRYIRMKRDIDAKEGLCGIPMDASYP 334


>gi|2160175|gb|AAB60738.1| Strong similarity to Dianthus cysteine proteinase (gb|U17135)
           [Arabidopsis thaliana]
          Length = 416

 Score =  291 bits (745), Expect = 3e-76,   Method: Compositional matrix adjust.
 Identities = 145/319 (45%), Positives = 197/319 (61%), Gaps = 18/319 (5%)

Query: 38  IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
           I E  + W  +HG+TY  E E+  R+ IFK N +++ + N   N TY L  N F+DLT+ 
Sbjct: 26  ISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTHH 85

Query: 98  EFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWA 157
           EF+A   G +   PSV   S   S         VP S+DWR+KGAVT++KDQG CG+CW+
Sbjct: 86  EFKASRLGLSVSAPSVIMASKGQS---LGGSVKVPDSVDWRKKGAVTNVKDQGSCGACWS 142

Query: 158 FSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEAD 216
           FSA  A+EGI QI  G LI LSEQ+L+DC    N GC+GGLMD AFE++I+N G+ TE D
Sbjct: 143 FSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNHGIDTEKD 202

Query: 217 YPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYK 276
           YPY+  +GTC   K K    TI  Y  +   DE+AL++AV+ QPVSV +  S RAF  Y 
Sbjct: 203 YPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSERAFQLYS 262

Query: 277 S-------GVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILR 329
           S       G+ +  C  + DH V +VG+G+   +NG  YW++KNSWG++WG  G++ + R
Sbjct: 263 SKFYLLMQGIFSGPCSTSLDHAVLIVGYGS---QNGVDYWIVKNSWGKSWGMDGFMHMQR 319

Query: 330 DA----GLCGIATAASYPV 344
           +     G+CGI   ASYP+
Sbjct: 320 NTENSDGVCGINMLASYPI 338


>gi|18390634|ref|NP_563764.1| cysteine proteinase-like protein [Arabidopsis thaliana]
 gi|8844131|gb|AAF80223.1|AC025290_12 Contains similarity to a cysteine endopeptidase 1 from Phaseolus
           vulgaris gb|U52970 and is a member of the papain
           cysteine protease family PF|00112 [Arabidopsis thaliana]
 gi|332189848|gb|AEE27969.1| cysteine proteinase-like protein [Arabidopsis thaliana]
          Length = 343

 Score =  291 bits (744), Expect = 4e-76,   Method: Compositional matrix adjust.
 Identities = 153/342 (44%), Positives = 209/342 (61%), Gaps = 17/342 (4%)

Query: 11  IPMFVIIILVITCASQVVSGRSMHEP--SIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQ 68
           + + V+I  V+  +       S+++P  ++ ++ E+W+  H + Y    E  +R  I++ 
Sbjct: 10  LTLAVLICFVLIASKLCSVDSSVYDPHKTLKQRFEKWLKTHSKLYGGRDEWMLRFGIYQS 69

Query: 69  NLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNV 128
           N++ I+  N   +  +KL  N F+D+TN EF+A + G N     + ++  RP      NV
Sbjct: 70  NVQLIDYINSL-HLPFKLTDNRFADMTNSEFKAHFLGLNTSSLRLHKKQ-RPVCDPAGNV 127

Query: 129 TDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC-- 186
              P ++DWR +GAVT I++QG+CG CWAFSAVAA+EGI +I  G L+ LSEQQL+DC  
Sbjct: 128 ---PDAVDWRTQGAVTPIRNQGKCGGCWAFSAVAAIEGINKIKTGNLVSLSEQQLIDCDV 184

Query: 187 STDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPK 246
            T N GCSGGLM+ AFE+I  N GLATE DYPY   EGTCD +K K    TI  Y+ + +
Sbjct: 185 GTYNKGCSGGLMETAFEFIKTNGGLATETDYPYTGIEGTCDQEKSKNKVVTIQGYQKVAQ 244

Query: 247 GDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGA 306
            +E +L  A + QPVSV +DA G  F  Y SGV    CG N +HGV VVG+G    E   
Sbjct: 245 -NEASLQIAAAQQPVSVGIDAGGFIFQLYSSGVFTNYCGTNLNHGVTVVGYGV---EGDQ 300

Query: 307 KYWLIKNSWGETWGESGYIRILR----DAGLCGIATAASYPV 344
           KYW++KNSWG  WGE GYIR+ R    D G CGIA  ASYP+
Sbjct: 301 KYWIVKNSWGTGWGEEGYIRMERGVSEDTGKCGIAMMASYPL 342


>gi|558563|emb|CAA57538.1| cysteine proteinase [Cicer arietinum]
          Length = 325

 Score =  290 bits (743), Expect = 5e-76,   Method: Compositional matrix adjust.
 Identities = 148/309 (47%), Positives = 190/309 (61%), Gaps = 11/309 (3%)

Query: 42  HEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRA 101
           +E+W+ +H + Y    EK  R  IFK NL +I++ N + N +YK+G N+F+D+ NEE+R 
Sbjct: 4   YEKWLVKHQKMYNGLGEKDTRFQIFKDNLRFIDEHNAQ-NYSYKVGLNKFADINNEEYRD 62

Query: 102 LYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAV 161
           +Y G          ++         N   V   +DWR KGAVTHIKDQG CGSCWAFS +
Sbjct: 63  MYLGTKSDAKRRVMKTKITGHRITYNSVIVTVKVDWRLKGAVTHIKDQGSCGSCWAFSTI 122

Query: 162 AAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYR 220
           A VE I +I  GK + LSEQ+LVDC    N GC+GGLMD AFE+II N G+ T+ DYPY 
Sbjct: 123 ATVEAINKIVTGKFVSLSEQELVDCDRAFNEGCNGGLMDYAFEFIIRNGGIDTDQDYPYN 182

Query: 221 HEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVL 280
             E  CD  K+ A   +I  YED+P     AL +AV++QPVSV +   GRA   Y+SGV 
Sbjct: 183 GFERKCDPTKKNAKVVSIDGYEDVPS-YMNALKKAVAHQPVSVAIAGLGRALQLYQSGVF 241

Query: 281 NADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRIL-RDAG----LCG 335
              CG + DHGV VVG+G+   ENG  YWL++NSWG  WGE GY +I  R+       CG
Sbjct: 242 TGKCGTDLDHGVVVVGYGS---ENGVDYWLVRNSWGTNWGEDGYFKIASRNVKSLYRKCG 298

Query: 336 IATAASYPV 344
           IA  ASYPV
Sbjct: 299 IAMEASYPV 307


>gi|341850671|gb|AEK97329.1| chromoplast senescence-associated protein 12 [Brassica rapa var.
           parachinensis]
          Length = 260

 Score =  290 bits (743), Expect = 5e-76,   Method: Compositional matrix adjust.
 Identities = 138/260 (53%), Positives = 187/260 (71%), Gaps = 8/260 (3%)

Query: 90  EFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD--VPTSIDWREKGAVTHIK 147
           +F+++TN+EFR++YTGY       S+  ++ ++F+YQNV+   +P ++DWR+KGAVT IK
Sbjct: 1   QFAEITNDEFRSMYTGYKGDSVLSSQSQTKSTSFRYQNVSSGALPIAVDWRKKGAVTPIK 60

Query: 148 DQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIE 207
           +QG CG CWAFSAVAA+EG TQI +GKLI LSEQQLVDC T++ GCSGGL+D AFE+I+ 
Sbjct: 61  NQGSCGCCWAFSAVAAIEGATQIKKGKLISLSEQQLVDCDTNDFGCSGGLIDTAFEHIMA 120

Query: 208 NKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDA 267
             GL TE++YPY+ E+ TC  +     AA+I+ YED+P  DE AL++AV++QPVSV ++ 
Sbjct: 121 TGGLTTESNYPYKGEDATCKIKSTXPSAASITGYEDVPVNDENALMKAVAHQPVSVGIEG 180

Query: 268 SGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRI 327
            G  F FY SGV   +C    DH V  VG+  ++   G+KYW+IKNSWG  WGE GY+RI
Sbjct: 181 GGFDFQFYSSGVFTGECTTYLDHAVTAVGY--SQSSAGSKYWIIKNSWGTKWGEGGYMRI 238

Query: 328 LRDA----GLCGIATAASYP 343
            +D     GLCG+A  ASYP
Sbjct: 239 KKDIKDKEGLCGLAMKASYP 258


>gi|167521499|ref|XP_001745088.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163776702|gb|EDQ90321.1| predicted protein [Monosiga brevicollis MX1]
          Length = 294

 Score =  290 bits (743), Expect = 6e-76,   Method: Compositional matrix adjust.
 Identities = 152/308 (49%), Positives = 194/308 (62%), Gaps = 27/308 (8%)

Query: 47  AQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEFSDLTNEEFRALY 103
           + + ++Y+ E  +A RL  F+ NLE+I K N E   G  +Y +G NEF+DLT +EF ALY
Sbjct: 3   SDYSKSYESEAVEAKRLAAFEANLEFINKHNAEHAQGLHSYTVGVNEFADLTIDEFMALY 62

Query: 104 --TGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAV 161
             + +NR +P        P+T +         S+DWR KGAVT IK+QGQCGSCW+FS  
Sbjct: 63  VPSKFNRTMPY--NTVYLPATSE--------DSVDWRTKGAVTPIKNQGQCGSCWSFSTT 112

Query: 162 AAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPY 219
            + EG   I  G L+ LSEQQLVDCS    N GC+GGLMD AF+YII NKGL TE DYPY
Sbjct: 113 GSTEGAHAIATGNLVSLSEQQLVDCSGSFGNQGCNGGLMDDAFKYIISNKGLDTEEDYPY 172

Query: 220 RHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGV 279
             ++GTC+ +KE   AATIS Y D+PK +E  L  AV+  PVSV ++A    F  YKSGV
Sbjct: 173 TAQDGTCNKEKEAKHAATISSYSDVPKNNEDQLAAAVAKGPVSVAIEADQSGFQLYKSGV 232

Query: 280 LNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD---AGLCGI 336
            + +CG N DHGV VVG+          YW++KNSWG TWG  GYI + R    +G+CGI
Sbjct: 233 FDGNCGTNLDHGVLVVGYTD-------DYWIVKNSWGTTWGVEGYINMKRGVSASGICGI 285

Query: 337 ATAASYPV 344
           A   SYP+
Sbjct: 286 AMQPSYPI 293


>gi|302781881|ref|XP_002972714.1| hypothetical protein SELMODRAFT_98707 [Selaginella moellendorffii]
 gi|300159315|gb|EFJ25935.1| hypothetical protein SELMODRAFT_98707 [Selaginella moellendorffii]
          Length = 446

 Score =  289 bits (740), Expect = 1e-75,   Method: Compositional matrix adjust.
 Identities = 142/293 (48%), Positives = 189/293 (64%), Gaps = 16/293 (5%)

Query: 62  RLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGY-----NRPVPSVSRQ 116
           R   FK+N  YIE+ N+ G  +Y+LG N+FSDLT+EEFR  + G      + PV  + R 
Sbjct: 34  RFETFKENFRYIEEHNRAGKHSYRLGLNQFSDLTSEEFRQRFLGLRPDLIDSPVLKMPRD 93

Query: 117 SSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLI 176
           S     F  QNV D+P S+DWR+ GAVT  KDQG CG CWAF+   A+EGI QI  G+L+
Sbjct: 94  SDIEEGF--QNV-DLPASVDWRKHGAVTAPKDQGSCGGCWAFATTGAIEGINQIVTGQLM 150

Query: 177 ELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVA 235
            LSEQ+L+DC    + GC GGLM+ A+++I+EN GL TE DYPY   E  C+ +K  +  
Sbjct: 151 SLSEQELIDCDKKADKGCDGGLMENAYQFIVENGGLDTETDYPYHASESHCNMKKLNSRV 210

Query: 236 ATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVV 295
             I  YE +P GDEQALL+AV+ QPVSV ++ + + F  Y SGV    CG   +HGV +V
Sbjct: 211 VAIDGYEAIPDGDEQALLRAVAKQPVSVAIEGASKDFQHYASGVFTGHCGEEINHGVLIV 270

Query: 296 GFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPV 344
           G+GT   E+G  YW++KNSW  TWG+ G++++ R+     GLC I T ASYPV
Sbjct: 271 GYGT---EDGLDYWIVKNSWAATWGDGGFVKMQRNTGKRGGLCSINTLASYPV 320


>gi|386648114|gb|AFJ15104.1| mexicain-like cystein protease, partial [Jacaratia mexicana]
          Length = 323

 Score =  289 bits (739), Expect = 1e-75,   Method: Compositional matrix adjust.
 Identities = 144/311 (46%), Positives = 196/311 (63%), Gaps = 14/311 (4%)

Query: 38  IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
           +V   E W  ++ + YK+  EK  R  IFK NL YI++ NK+ N +Y LG NEF+DLT++
Sbjct: 18  LVRLFESWTLENDKIYKNIDEKIYRFEIFKDNLMYIDETNKK-NSSYWLGLNEFADLTHD 76

Query: 98  EFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWA 157
           EF+A Y G      ++  QS     F Y++V D P SIDWR+KGAVT +K+Q  CGSCWA
Sbjct: 77  EFKAKYVGSLGEDSTIIEQSD-DEEFPYKHVVDYPESIDWRQKGAVTPVKNQNPCGSCWA 135

Query: 158 FSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLATEADY 217
           FS VA VEGI +I  GKLI LSEQ+L+DC   +HGC GG    + +Y+ +N G+ TE +Y
Sbjct: 136 FSTVATVEGINKIVTGKLISLSEQELLDCDRRSHGCKGGYQTTSLQYVADN-GVHTEKEY 194

Query: 218 PYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKS 277
           PY  ++G C  + +K     I+ Y+ +P  +E +L+QA++NQPVSV V++ GRAF FYK 
Sbjct: 195 PYEKKQGKCRAKDKKGSKVKITGYKRVPANNEVSLIQAIANQPVSVVVESKGRAFQFYKG 254

Query: 278 GVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----GL 333
           G+    CG   DH V  VG+       G  Y LIKNSWG  WGE GYIRI R +    G 
Sbjct: 255 GIFEGPCGTKVDHAVTAVGY-------GKNYILIKNSWGPKWGEKGYIRIKRASGKSKGT 307

Query: 334 CGIATAASYPV 344
           CG+ +++ +P 
Sbjct: 308 CGVYSSSYFPT 318


>gi|356543010|ref|XP_003539956.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
          Length = 306

 Score =  289 bits (739), Expect = 1e-75,   Method: Compositional matrix adjust.
 Identities = 154/313 (49%), Positives = 195/313 (62%), Gaps = 21/313 (6%)

Query: 41  KHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFR 100
           + E+W+ Q+ R YKD+ E  +R  I++ NLEYIE  N +   +Y L  N+F+DLTNEEF 
Sbjct: 4   RFERWLKQNDRXYKDKEEWEVRFGIYQANLEYIECKNSQ-EXSYNLTDNKFADLTNEEFV 62

Query: 101 ALYTGY-NRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFS 159
           + Y G+  R +P           F Y    D+P S DWR++GAV+ IKDQG CGSCWAFS
Sbjct: 63  SPYLGFGTRFLPHTG--------FMYHEHEDLPESKDWRKEGAVSDIKDQGNCGSCWAFS 114

Query: 160 AVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADY 217
           AVAAVEGI +I  GKL+ LSEQ+  DC  +  N GC GGLMD AF +I +N GL T  DY
Sbjct: 115 AVAAVEGINKIKSGKLVSLSEQEFRDCDVEDGNQGCEGGLMDTAFAFIKKNGGLTTSKDY 174

Query: 218 PYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQAL--LQAVSNQPVSVCVDASGRAFHFY 275
           PY   +GTC+ +K    AA IS +  +P  DE  L    A +NQ  SV +DA G AF  Y
Sbjct: 175 PYEGVDGTCNKEKALHHAANISGHVKVPANDEAMLKAKAAAANQXESVAIDAGGHAFQLY 234

Query: 276 KSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD----A 331
             GV +  CG   +HGV +VG+G    +   KYW++KNSWG  WGESGYIR+ RD    A
Sbjct: 235 LKGVFSGICGKQLNHGVTIVGYGKGTSD---KYWIVKNSWGADWGESGYIRMKRDAFDKA 291

Query: 332 GLCGIATAASYPV 344
           G CGIA  ASYP+
Sbjct: 292 GTCGIAMQASYPL 304


>gi|326430490|gb|EGD76060.1| cysteine proteinase [Salpingoeca sp. ATCC 50818]
          Length = 448

 Score =  289 bits (739), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 154/346 (44%), Positives = 209/346 (60%), Gaps = 33/346 (9%)

Query: 15  VIIILVITCASQVVSGRSMHEPSIVEKH-----EQWMAQHGRTYKDELEKAMRLNIFKQN 69
           +++ LV+ CA   + G +M EP  +  +     + +  +  + Y+   E+A R ++F QN
Sbjct: 1   MMLKLVLVCA---LVGAAMAEPLSLTVNKGRLFDAFKTKFNKVYESAEEEARRFSVFSQN 57

Query: 70  LEYIEKANKEGNR---TYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQ 126
           +++I + N E  R   T+ +  N+F+DLTNEE+R LY    RP P+      R   +   
Sbjct: 58  IDFINRHNAEAARGVHTHTVDVNQFADLTNEEYRQLYL---RPYPTELLGRERQEVW--- 111

Query: 127 NVTDVPT--SIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLV 184
              D P   S+DWR+KGAVT IK+QGQCGSCW+FS   +VEG   I  G L+ LSEQQLV
Sbjct: 112 --LDGPNAGSVDWRQKGAVTPIKNQGQCGSCWSFSTTGSVEGAHAIATGNLVSLSEQQLV 169

Query: 185 DCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYE 242
           DCS    N GC+GGLMD AF+YII N GL TE DYPY   +G CD  KE   A +IS Y+
Sbjct: 170 DCSGSFGNQGCNGGLMDNAFKYIISNGGLDTEQDYPYTARDGVCDKSKESKHAVSISGYK 229

Query: 243 DLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEE 302
           D+P+ +E  L  AV   PVSV ++A  ++F  Y SGV +  CG N DHGV VVG+ +   
Sbjct: 230 DVPQNNEDQLAAAVEKGPVSVAIEADQQSFQMYSSGVFSGPCGTNLDHGVLVVGYTS--- 286

Query: 303 ENGAKYWLIKNSWGETWGESGYIRILR---DAGLCGIATAASYPVA 345
                YW++KNSWG +WG+ GYI + R    AG+CGIA   SYP+A
Sbjct: 287 ----DYWIVKNSWGASWGDQGYIMMKRGVSSAGICGIAMQPSYPIA 328


>gi|357129125|ref|XP_003566217.1| PREDICTED: thiol protease SEN102-like [Brachypodium distachyon]
          Length = 380

 Score =  288 bits (738), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 156/325 (48%), Positives = 200/325 (61%), Gaps = 20/325 (6%)

Query: 35  EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
           E S+   +E+W A+H    +D  EK+ R N+F++N   + + N   +  YKL  N F+DL
Sbjct: 42  EESLWALYERWRARH-TVSRDLAEKSRRFNVFRENARLVHEFNLRRDAPYKLRLNRFADL 100

Query: 95  TNEEFRALYTGYN---------RPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTH 145
           T++EFR  Y             R   +      + S+F +     +PTS+DWREKGAVT 
Sbjct: 101 TSDEFRRSYASSRVSHHRMFKPRAANNNDDDDDKGSSFTHGGA--LPTSVDWREKGAVTG 158

Query: 146 IKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEY 204
           +KDQGQCGSCWAFS +AAVEGI  I    L  LSEQQLVDC T  N GC GGLMD AF Y
Sbjct: 159 VKDQGQCGSCWAFSTIAAVEGINAIRTNNLTSLSEQQLVDCDTKTNAGCDGGLMDDAFSY 218

Query: 205 IIENKGLATEADYPYR-HEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSV 263
           I ++ G+A E  YPYR  +  +C+++K  A   +I  YED+P+ DE AL +AV+ QPV+V
Sbjct: 219 IAKHGGVAAEKSYPYRARQSSSCNSKKAAAAVVSIDGYEDVPRNDETALKKAVAAQPVAV 278

Query: 264 CVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESG 323
            ++A G  F FY  GV    CG   DHGVA VG+G     +G KYW++KNSWGE WGE G
Sbjct: 279 AIEAGGSHFQFYSEGVFAGKCGTELDHGVAAVGYGVT--VDGTKYWIVKNSWGEEWGEKG 336

Query: 324 YIRILRDA----GLCGIATAASYPV 344
           YIR+ RD     GLCGIA  ASYPV
Sbjct: 337 YIRMKRDVADKEGLCGIAMEASYPV 361


>gi|146215988|gb|ABQ10196.1| actinidin Act3a [Actinidia eriantha]
          Length = 380

 Score =  288 bits (738), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 146/350 (41%), Positives = 210/350 (60%), Gaps = 23/350 (6%)

Query: 1   MVLKFEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKA 60
           M L F  +F+I  F I        +++   R+  E  ++  +E W+ ++G++Y    E+ 
Sbjct: 10  MSLLFFSTFLIFSFAI-------DAKISPLRTNDE--VMALYESWLVKYGKSYNSLGERE 60

Query: 61  MRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRP 120
           MR+ IFK+NL +I++ N + NR+Y +G N+F+DLT+EE+R+ Y G+   + S       P
Sbjct: 61  MRIEIFKENLRFIDEHNADPNRSYTVGLNQFADLTDEEYRSTYLGFKSSLKSKVSNRYMP 120

Query: 121 STFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSE 180
              +      +P  +DWR  GAV  +K+QG C SCWAF+ +A VE I QI  G LI LSE
Sbjct: 121 QVGEV-----LPDYVDWRTTGAVVDVKNQGLCSSCWAFATIATVESINQIITGDLISLSE 175

Query: 181 QQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATI 238
           Q+LVDC+    N GC GG MD A+E+II N G+ TE +YPY  ++  CD  K+     TI
Sbjct: 176 QELVDCNRTPINEGCKGGFMDDAYEFIINNGGINTEENYPYIGQDDQCDEPKKNQNYVTI 235

Query: 239 SKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLN-ADCGNNCDHGVAVVGF 297
             YE +P  DE A+ +AV+ QPVSV +DA    F FY+SG+     CG   +H V ++G+
Sbjct: 236 DSYEQVPPNDELAMKRAVAYQPVSVAIDAYCLGFRFYQSGIFTGGSCGTTLNHAVTIIGY 295

Query: 298 GTAEEENGAKYWLIKNSWGETWGESGYIRILRDA---GLCGIATAASYPV 344
           GT   ENG  YW++KNS+G  WGESGY ++ R+    G CGIA+   YPV
Sbjct: 296 GT---ENGIDYWIVKNSYGTQWGESGYGKVQRNVGGEGRCGIASYPFYPV 342


>gi|2463588|dbj|BAA22546.1| FB1035 precursor [Ananas comosus]
          Length = 324

 Score =  288 bits (738), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 135/315 (42%), Positives = 211/315 (66%), Gaps = 14/315 (4%)

Query: 35  EPS--IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFS 92
           EP+  ++++ E+WMA++GR YKD  EK  R  IFK N+++IE  N     +Y LG N+F+
Sbjct: 1   EPNDPMMKRFEEWMAEYGRIYKDNDEKMRRFQIFKNNVKHIETFNSRNGNSYTLGINQFT 60

Query: 93  DLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQC 152
           D+T  EF A YTG + P+ ++ R+     +F   N++ VP SIDWR+ GAV  +K+Q  C
Sbjct: 61  DMTKSEFVAQYTGVSLPL-NIEREPV--VSFDDVNISAVPQSIDWRDYGAVNEVKNQNPC 117

Query: 153 GSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLA 212
           GSCWAF+A+A VEGI +I  G L+ LSEQ+++DC+  ++GC GG ++KA+++II N G+ 
Sbjct: 118 GSCWAFAAIATVEGIYKIKTGYLVSLSEQEVLDCAV-SYGCKGGWVNKAYDFIISNNGVT 176

Query: 213 TEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAF 272
           TE +YPY+  +GTC N      +A I+ Y  + + DE++++ AVSNQP++  +DAS   F
Sbjct: 177 TEENYPYQAYQGTC-NANSFPNSAYITGYSYVRRNDERSMMYAVSNQPIAALIDAS-ENF 234

Query: 273 HFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILR--- 329
            +Y  GV +  CG + +H + ++G+G  ++ +G KYW+++NSWG +WGE GY+R+ R   
Sbjct: 235 QYYNGGVFSGPCGTSLNHAITIIGYG--QDSSGTKYWIVRNSWGSSWGEGGYVRMARGVS 292

Query: 330 -DAGLCGIATAASYP 343
             +G CGIA +  +P
Sbjct: 293 SSSGACGIAMSPLFP 307


>gi|356515062|ref|XP_003526220.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
          Length = 337

 Score =  288 bits (737), Expect = 3e-75,   Method: Compositional matrix adjust.
 Identities = 153/343 (44%), Positives = 210/343 (61%), Gaps = 18/343 (5%)

Query: 10  IIPMFVIIILVITCA-SQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQ 68
           I+   V  I V  C+ S+     S+        HE+WMAQHG+ YKD  EK   L IF+ 
Sbjct: 6   ILKFLVAFIEVDACSLSESCCSHSL-------SHEKWMAQHGKVYKDAAEKERCLQIFEN 58

Query: 69  NLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNV 128
           N+E+IE  +  G++++ L TN+F+DL +EEF+AL T  ++   S+   ++  + F+Y NV
Sbjct: 59  NMEFIESFDVCGDKSFNLSTNQFADLHDEEFKALLTNGHKKEHSL--WTTTETLFRYDNV 116

Query: 129 TDVPTSIDWREKGAVTHIKDQGQCGSCWAFS-AVAAVEGITQITRGKLIELSEQQLVD-C 186
           T +P S+DWR++G VT IKDQG+C SCWAFS  VA +EG+ QI   +L+ LSEQ+LVD  
Sbjct: 117 TKIPASMDWRKRGVVTPIKDQGKCLSCWAFSLCVATIEGLHQIITSELVPLSEQELVDFV 176

Query: 187 STDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPK 246
             ++ GC G  ++ AF++I +   + +E  YPY+    TC  +KE    A I  Y+ +P 
Sbjct: 177 KGESEGCYGDYVEDAFKFITKKGRIESETHYPYKGVNNTCKVKKETHGVAQIKGYKKVPS 236

Query: 247 GDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGA 306
             E ALL+AV+NQ VSV V+A   AF FY SG+    CG + DH VA+  +G  E  +G 
Sbjct: 237 KSENALLKAVANQLVSVSVEARDSAFQFYSSGIFTGKCGTDTDHRVALASYG--ESGDGT 294

Query: 307 KYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPVA 345
           KYWL KNSWG  WGE GYIRI  D     GLCGIA    YP+A
Sbjct: 295 KYWLAKNSWGTEWGEKGYIRIKXDIPAKEGLCGIAKYPYYPIA 337


>gi|449447027|ref|XP_004141271.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
          Length = 458

 Score =  288 bits (736), Expect = 3e-75,   Method: Compositional matrix adjust.
 Identities = 158/351 (45%), Positives = 220/351 (62%), Gaps = 23/351 (6%)

Query: 5   FEKSFIIPMFVIIILVITCAS--QVVSGRSMHEPSIVEKHEQWMAQHGRTYKD-ELEKAM 61
           F+ S I+ +   + + ++ AS   ++  R+  E  ++  ++QW A+HG+ + +   E   
Sbjct: 4   FQSSPIMALLFFLFIALSAASPSSIIPQRTDDE--VMALYDQWRAKHGKLHNNLGAEPEN 61

Query: 62  RLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPS 121
           R +IFK NL++I++ N + N  Y+LG N F+DLTNEE+R+ Y G      S SR++   +
Sbjct: 62  RFHIFKDNLKFIDEINAQ-NLPYRLGLNVFADLTNEEYRSRYLG--GKFASGSRRNRTSN 118

Query: 122 TFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQ 181
            +  +   D+P SIDWR KGAV  +KDQG CGSCWAFS VA+VE I QI  G LI LSEQ
Sbjct: 119 RYLPRLGDDLPDSIDWRAKGAVAPVKDQGSCGSCWAFSTVASVEAINQIVTGDLIALSEQ 178

Query: 182 QLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISK 240
           +LVDC    N GC+GGLMD AFE+IIEN GL TE DYPY   + +C   K+ A    I  
Sbjct: 179 ELVDCDRSYNEGCNGGLMDYAFEFIIENGGLDTEEDYPYYGFDSSCIQYKKNA----IDG 234

Query: 241 YEDLPKGDEQALLQA---VSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGF 297
           YED+P  +E+AL +A        VSV ++  GR+F  Y+SG+    CG + DHGV VVG+
Sbjct: 235 YEDVPVNNEKALQKAVSKQVVSVVSVAIEGGGRSFQLYQSGIFTGRCGTDLDHGVNVVGY 294

Query: 298 GTAEEENGAKYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPV 344
           G+   E G  YW+++NSWG +WGESGY+++ R+     GLCGIA   SYP 
Sbjct: 295 GS---EGGVDYWIVRNSWGGSWGESGYVKMQRNIASPTGLCGIAMEPSYPT 342


>gi|194320502|gb|ACF48469.1| cathepsin L [Triatoma brasiliensis]
          Length = 330

 Score =  288 bits (736), Expect = 3e-75,   Method: Compositional matrix adjust.
 Identities = 154/345 (44%), Positives = 215/345 (62%), Gaps = 30/345 (8%)

Query: 12  PMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWM---AQHGRTYKDELEKAMRLNIFKQ 68
           P+ V + ++           S   PS     E+W    A HG+TYK++ E+  R+ IF  
Sbjct: 3   PLLVAVAIIAL---------SYAHPSFDIYPEEWHVFKAMHGKTYKNQFEEMFRMKIFMD 53

Query: 69  NLEYIEKAN---KEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKY 125
           N + IE  N   ++G  +YK+  N F DL   EF+AL  G+      +S  + R     +
Sbjct: 54  NKKKIEAHNAKYEQGEVSYKMMMNHFGDLMVHEFKALMNGF-----KMSPDTKRNGELYF 108

Query: 126 QNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVD 185
            + +++P ++DWR+KGAVT +KDQGQCGSCW+FSA  ++EG   +  GKL+ LSEQ LVD
Sbjct: 109 PSNSNLPKTVDWRQKGAVTPVKDQGQCGSCWSFSATGSLEGQVFLKTGKLVSLSEQNLVD 168

Query: 186 CSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYED 243
           CST   N+GC GGLMD+AF+Y+ +NKG+ TEA YPY   E TC  +K K V  T   + D
Sbjct: 169 CSTSYGNNGCEGGLMDQAFQYVSDNKGIDTEASYPYEARENTCRFKKNK-VGGTDKGHVD 227

Query: 244 LPKGDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGVLN-ADCGN-NCDHGVAVVGFGTA 300
           +P GDE+AL  A++   P+SV +DA+  +F FY  GV N  +C + + DHGV  VG+GT 
Sbjct: 228 IPAGDEKALQNALATVGPISVAIDANHGSFQFYSKGVYNEPNCSSYDLDHGVLAVGYGT- 286

Query: 301 EEENGAKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
             ENG  YWL+KNSWG +WGE+GYI+I R+ +  CGIA+ ASYP+
Sbjct: 287 --ENGQDYWLVKNSWGPSWGENGYIKIARNHSNHCGIASMASYPL 329


>gi|302812789|ref|XP_002988081.1| hypothetical protein SELMODRAFT_183539 [Selaginella moellendorffii]
 gi|300144187|gb|EFJ10873.1| hypothetical protein SELMODRAFT_183539 [Selaginella moellendorffii]
          Length = 425

 Score =  287 bits (735), Expect = 5e-75,   Method: Compositional matrix adjust.
 Identities = 141/293 (48%), Positives = 189/293 (64%), Gaps = 16/293 (5%)

Query: 62  RLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGY-----NRPVPSVSRQ 116
           R   FK+N  YIE+ N+ G  +Y+LG N+FSDLT+EEFR  + G      + PV  + R 
Sbjct: 34  RFETFKENFRYIEEHNRAGKHSYRLGLNQFSDLTSEEFRQRFLGLRPDLIDSPVLKMPRD 93

Query: 117 SSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLI 176
           S     F  QNV D+P S+DWR+ GAVT  KDQG CG CWAF+   A+EGI QI  G+L+
Sbjct: 94  SDIEEGF--QNV-DLPASVDWRQHGAVTAPKDQGSCGGCWAFATTGAIEGINQIVTGQLV 150

Query: 177 ELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVA 235
            LSEQ+L+DC    + GC GGLM+ A+++I+EN GL TE DYPY   E  C+ +K  +  
Sbjct: 151 SLSEQELIDCDKKADKGCDGGLMENAYQFIVENGGLDTETDYPYHASESHCNMKKLNSRV 210

Query: 236 ATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVV 295
             I  Y+ +P+GDEQALL AV+ QPVSV ++ + + F  Y SGV    CG   +HGV +V
Sbjct: 211 VAIDGYKAIPEGDEQALLLAVAKQPVSVAIEGASKDFQHYASGVFTGHCGEEINHGVLIV 270

Query: 296 GFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPV 344
           G+GT   E+G  YW++KNSW  TWG+ G++++ R+     GLC I T ASYPV
Sbjct: 271 GYGT---EDGLDYWIVKNSWAATWGDGGFVKMQRNTGKRGGLCSINTLASYPV 320


>gi|194352756|emb|CAQ00106.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
          Length = 476

 Score =  287 bits (734), Expect = 5e-75,   Method: Compositional matrix adjust.
 Identities = 138/316 (43%), Positives = 198/316 (62%), Gaps = 16/316 (5%)

Query: 42  HEQWMAQHGRTYKDEL----EKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEFSDL 94
           ++ W+A+HG           ++  R + F  NL +++  N     G   ++L  N F+DL
Sbjct: 52  YDLWLAEHGGGSSPNANSIADRERRFSAFWDNLRFVDAHNARAAAGEEGFRLAMNRFADL 111

Query: 95  TNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGS 154
           TN+EFRA Y G                 +++    ++P ++DWREKGAV  +K+QGQCGS
Sbjct: 112 TNDEFRAAYLGVKGAAERNRAGRVVGDRYRHDGAEELPEAVDWREKGAVAPVKNQGQCGS 171

Query: 155 CWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNH--GCSGGLMDKAFEYIIENKGLA 212
           CWAFSAV+ VE I QI  G+++ LSEQ+LV+C  +    GC+GGLMD AFE+II+N G+ 
Sbjct: 172 CWAFSAVSTVESINQIVTGEMVTLSEQELVECDINGQSSGCNGGLMDDAFEFIIKNGGID 231

Query: 213 TEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAF 272
           TE DYPY+  +G CD  ++ A   +I  +ED+P+ DE++L +AV++ PVSV ++A GR F
Sbjct: 232 TEDDYPYKAVDGRCDVLRKNAKVVSIDGFEDVPENDEKSLQKAVAHHPVSVAIEAGGREF 291

Query: 273 HFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD-- 330
             Y SGV +  CG   DHGV  VG+GT   ENG  YW+++NSWG  WGE+GY+R+ R+  
Sbjct: 292 QLYHSGVFSGRCGTQLDHGVVAVGYGT---ENGKDYWIVRNSWGPNWGEAGYLRMERNIN 348

Query: 331 --AGLCGIATAASYPV 344
             +G CGIA  +SYP 
Sbjct: 349 VTSGKCGIAMMSSYPT 364


>gi|326507362|dbj|BAK03074.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 473

 Score =  287 bits (734), Expect = 6e-75,   Method: Compositional matrix adjust.
 Identities = 138/316 (43%), Positives = 198/316 (62%), Gaps = 16/316 (5%)

Query: 42  HEQWMAQHGRTYKDEL----EKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEFSDL 94
           ++ W+A+HG           ++  R + F  NL +++  N     G   ++L  N F+DL
Sbjct: 52  YDLWLAEHGGGSSPNANSIADRERRFSAFWDNLRFVDAHNARAAAGEEGFRLAMNRFADL 111

Query: 95  TNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGS 154
           TN+EFRA Y G                 +++    ++P ++DWREKGAV  +K+QGQCGS
Sbjct: 112 TNDEFRAAYLGVKGAAERNRAGRVVGERYRHDGAEELPEAVDWREKGAVAPVKNQGQCGS 171

Query: 155 CWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNH--GCSGGLMDKAFEYIIENKGLA 212
           CWAFSAV+ VE I QI  G+++ LSEQ+LV+C  +    GC+GGLMD AFE+II+N G+ 
Sbjct: 172 CWAFSAVSTVESINQIVTGEMVTLSEQELVECDINGQSSGCNGGLMDDAFEFIIKNGGID 231

Query: 213 TEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAF 272
           TE DYPY+  +G CD  ++ A   +I  +ED+P+ DE++L +AV++ PVSV ++A GR F
Sbjct: 232 TEDDYPYKAVDGRCDVLRKNAKVVSIDGFEDVPENDEKSLQKAVAHHPVSVAIEAGGREF 291

Query: 273 HFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD-- 330
             Y SGV +  CG   DHGV  VG+GT   ENG  YW+++NSWG  WGE+GY+R+ R+  
Sbjct: 292 QLYHSGVFSGRCGTQLDHGVVAVGYGT---ENGKDYWIVRNSWGPNWGEAGYLRMERNIN 348

Query: 331 --AGLCGIATAASYPV 344
             +G CGIA  +SYP 
Sbjct: 349 VTSGKCGIAMMSSYPT 364


>gi|260516654|gb|ACX43954.1| cysteine protease 1 [Brachiaria hybrid cultivar]
 gi|260516656|gb|ACX43955.1| cysteine protease 1 [Brachiaria hybrid cultivar]
 gi|260516658|gb|ACX43956.1| cysteine protease 1 [Brachiaria hybrid cultivar]
 gi|260516660|gb|ACX43957.1| cysteine protease 1 [Brachiaria hybrid cultivar]
 gi|260516662|gb|ACX43958.1| cysteine protease 2 [Brachiaria hybrid cultivar]
 gi|260516664|gb|ACX43959.1| cysteine protease 2 [Brachiaria hybrid cultivar]
 gi|260516666|gb|ACX43960.1| cysteine protease 2 [Brachiaria hybrid cultivar]
 gi|260516668|gb|ACX43961.1| cysteine protease 2 [Brachiaria hybrid cultivar]
 gi|260516670|gb|ACX43962.1| cysteine protease 2 [Brachiaria hybrid cultivar]
          Length = 338

 Score =  287 bits (734), Expect = 6e-75,   Method: Compositional matrix adjust.
 Identities = 154/305 (50%), Positives = 192/305 (62%), Gaps = 17/305 (5%)

Query: 45  WMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYT 104
           +M Q+ + Y    E + R N FK N+E I   N   N +Y +G NEF+DL+ EEF+  Y 
Sbjct: 45  FMKQYSKAY-SHAEFSSRFNQFKANVETIRLHNTLANASYTMGLNEFADLSFEEFKGKYF 103

Query: 105 GYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAV 164
           GY      V R+ +R +   +Q V   PTSIDWR   AVT IKDQGQCGSCWAFSA  ++
Sbjct: 104 GYKH----VEREFARSNNL-HQEVEAAPTSIDWRTSNAVTPIKDQGQCGSCWAFSATGSI 158

Query: 165 EGITQITRGK--LIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYR 220
           EG   + +GK  L  LSEQQLVDCST   N GC+GGLMD AFEYII NKG+  E+ YPY+
Sbjct: 159 EG-AWVLQGKHTLTSLSEQQLVDCSTSYGNAGCNGGLMDYAFEYIIANKGICAESAYPYK 217

Query: 221 HEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV 279
              G C     K V  TIS Y+D+  GDE +LL AV    PVSV ++A    F FY SGV
Sbjct: 218 GVGGLCQKSCTKVV--TISGYKDVASGDEASLLNAVGTVGPVSVAIEADQAGFQFYSSGV 275

Query: 280 LNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDAGLCGIATA 339
            +  CG+N DHGV  VG+GT   ++   YW++KNSWG +WGESGYIR++R+   CGIA  
Sbjct: 276 FSGTCGHNLDHGVLAVGYGTTGSQD---YWIVKNSWGTSWGESGYIRMIRNKNQCGIAIQ 332

Query: 340 ASYPV 344
            SYP 
Sbjct: 333 PSYPT 337


>gi|204307508|gb|ACI00280.1| triticain beta 2 [Hordeum vulgare]
          Length = 473

 Score =  287 bits (734), Expect = 6e-75,   Method: Compositional matrix adjust.
 Identities = 138/316 (43%), Positives = 198/316 (62%), Gaps = 16/316 (5%)

Query: 42  HEQWMAQHGRTYKDEL----EKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEFSDL 94
           ++ W+A+HG           ++  R + F  NL +++  N     G   ++L  N F+DL
Sbjct: 52  YDLWLAEHGGGSSPNANSIADRERRFSAFWDNLRFVDAHNARAAAGEEGFRLAMNRFADL 111

Query: 95  TNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGS 154
           TN+EFRA Y G                 +++    ++P ++DWREKGAV  +K+QGQCGS
Sbjct: 112 TNDEFRAAYLGVKGAAERNRAGRVVGERYRHDGAEELPEAVDWREKGAVAPVKNQGQCGS 171

Query: 155 CWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNH--GCSGGLMDKAFEYIIENKGLA 212
           CWAFSAV+ VE I QI  G+++ LSEQ+LV+C  +    GC+GGLMD AFE+II+N G+ 
Sbjct: 172 CWAFSAVSTVESINQIVTGEMVTLSEQELVECDINGQSSGCNGGLMDDAFEFIIKNGGID 231

Query: 213 TEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAF 272
           TE DYPY+  +G CD  ++ A   +I  +ED+P+ DE++L +AV++ PVSV ++A GR F
Sbjct: 232 TEDDYPYKAVDGRCDVLRKNAKVVSIDGFEDVPENDEKSLQKAVAHHPVSVAIEAGGREF 291

Query: 273 HFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD-- 330
             Y SGV +  CG   DHGV  VG+GT   ENG  YW+++NSWG  WGE+GY+R+ R+  
Sbjct: 292 QLYHSGVFSGRCGTQLDHGVVAVGYGT---ENGKDYWIVRNSWGPNWGEAGYLRMERNIN 348

Query: 331 --AGLCGIATAASYPV 344
             +G CGIA  +SYP 
Sbjct: 349 VTSGKCGIAMMSSYPT 364


>gi|156142226|gb|ABU51882.1| ervatamin-C precursor [Tabernaemontana divaricata]
          Length = 365

 Score =  285 bits (729), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 141/310 (45%), Positives = 195/310 (62%), Gaps = 14/310 (4%)

Query: 38  IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
           + E +E W+A+H + Y   +E   R  IFK NL++I++ N E N TYK+G   ++DLTNE
Sbjct: 41  VKEIYELWLAKHDKVYSGLVEYEKRFEIFKDNLKFIDEHNSE-NHTYKMGLTPYTDLTNE 99

Query: 98  EFRALYTG-YNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCW 156
           EF+A+Y G  +  +  + R  +    + Y+   ++P  IDWR+KGAVT +K+QG+CGSCW
Sbjct: 100 EFQAIYLGTRSDTIHRLKRTINISERYAYEAGDNLPEQIDWRKKGAVTPVKNQGKCGSCW 159

Query: 157 AFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLATEAD 216
           AFS V+ VE I QI  G LI LSEQQLVDC+  NHGC GG    A++YII+N G+ TEA+
Sbjct: 160 AFSTVSTVESINQIRTGNLISLSEQQLVDCNKKNHGCKGGAFVYAYQYIIDNGGIDTEAN 219

Query: 217 YPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYK 276
           YPY+  +G C   K+      I  Y+ +P  +E AL +AV++QP  V +DAS + F  YK
Sbjct: 220 YPYKAVQGPCRAAKK---VVRIDGYKGVPHCNENALKKAVASQPSVVAIDASSKQFQHYK 276

Query: 277 SGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILR--DAGLC 334
           SG+ +  CG   +HGV +VG+          YW+++NSWG  WGE GYIR+ R    GLC
Sbjct: 277 SGIFSGPCGTKLNHGVVIVGYWK-------DYWIVRNSWGRYWGEQGYIRMKRVGGCGLC 329

Query: 335 GIATAASYPV 344
           GIA    YP 
Sbjct: 330 GIARLPYYPT 339


>gi|260516678|gb|ACX43965.1| cysteine protease 1 [Brachiaria hybrid cultivar]
          Length = 338

 Score =  285 bits (729), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 153/305 (50%), Positives = 192/305 (62%), Gaps = 17/305 (5%)

Query: 45  WMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYT 104
           +M Q+ + Y    E + R N FK N+E I   N   N +Y +G NEF+DL+ EEF+  Y 
Sbjct: 45  FMKQYSKAYS-HAEFSSRFNQFKANVETIRLHNTLANASYTMGLNEFADLSFEEFKGKYF 103

Query: 105 GYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAV 164
           GY      V R+ +R +   +Q V   PTSIDWR   AVT IKDQGQCGSCWAFSA  ++
Sbjct: 104 GYKH----VEREFARSNNL-HQEVEAAPTSIDWRTSNAVTPIKDQGQCGSCWAFSATGSI 158

Query: 165 EGITQITRGK--LIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYR 220
           EG   + +GK  L  LSEQQLVDCST   + GC+GGLMD AFEYII NKG+  E+ YPY+
Sbjct: 159 EG-AWVLQGKHTLTSLSEQQLVDCSTSYGDAGCNGGLMDYAFEYIIANKGICAESAYPYK 217

Query: 221 HEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV 279
              G C     K V  TIS Y+D+  GDE +LL AV    PVSV ++A    F FY SGV
Sbjct: 218 GVGGLCQKSCTKVV--TISGYKDVASGDEASLLNAVGTVGPVSVAIEADQAGFQFYSSGV 275

Query: 280 LNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDAGLCGIATA 339
            +  CG+N DHGV  VG+GT   ++   YW++KNSWG +WGESGYIR++R+   CGIA  
Sbjct: 276 FSGTCGHNLDHGVLAVGYGTTGSQD---YWIVKNSWGTSWGESGYIRMIRNKNQCGIAIQ 332

Query: 340 ASYPV 344
            SYP 
Sbjct: 333 PSYPT 337


>gi|449469929|ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
 gi|449529596|ref|XP_004171784.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
          Length = 431

 Score =  285 bits (729), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 139/315 (44%), Positives = 193/315 (61%), Gaps = 20/315 (6%)

Query: 37  SIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTN 96
           ++ E  E W  +HG++Y    EK  RL +F  N E++   N   N +Y L  N ++DLT+
Sbjct: 24  NVSELFEIWCTEHGKSYSSAEEKLYRLGVFADNYEFVTHHNNLDNSSYTLSLNSYADLTH 83

Query: 97  EEFRALYTGYNRPV----PSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQC 152
            EF+    G++  +    P + ++ S P         DVP S+DWR+KGAVT +KDQG C
Sbjct: 84  HEFKVSRLGFSPALRNFRPVLPQEPSLPR--------DVPDSLDWRKKGAVTAVKDQGSC 135

Query: 153 GSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGL 211
           G+CW+FSA  A+EGI QI  G LI LSEQ+L+DC    N GC GGLMD A++++I N G+
Sbjct: 136 GACWSFSATGAMEGINQIMTGSLISLSEQELIDCDRSYNSGCGGGLMDYAYQFVISNHGI 195

Query: 212 ATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRA 271
            TE DYPY+  +G+C   K +    TI  Y D+P  DE  LLQAV+ QPVSV +  S RA
Sbjct: 196 DTENDYPYQARDGSCRKDKLQRNVVTIDGYADIPSNDEGKLLQAVAAQPVSVGICGSERA 255

Query: 272 FHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA 331
           F  Y  G+ +  C  + DH V +VG+G+   ENG  YW++KNSWG++WG  GY+ + R++
Sbjct: 256 FQLYSKGIFSGPCSTSLDHAVLIVGYGS---ENGVDYWIVKNSWGKSWGMDGYMHMQRNS 312

Query: 332 ----GLCGIATAASY 342
               G+CGI   ASY
Sbjct: 313 GNSEGVCGINKLASY 327


>gi|8886940|gb|AAF80626.1|AC069251_19 F2D10.37 [Arabidopsis thaliana]
          Length = 315

 Score =  285 bits (728), Expect = 3e-74,   Method: Compositional matrix adjust.
 Identities = 134/266 (50%), Positives = 191/266 (71%), Gaps = 5/266 (1%)

Query: 38  IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
           ++E  E W++   + Y+   EK +R  +FK NL++I++ NK+G ++Y LG NEF+DL++E
Sbjct: 47  LIELFENWISNFEKAYETVEEKFLRFEVFKDNLKHIDETNKKG-KSYWLGLNEFADLSHE 105

Query: 98  EFRALYTGYNRPVPSVSRQSSRP-STFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCW 156
           EF+ +Y G    +  V R   R  + F Y++V  VP S+DWR+KGAV  +K+QG CGSCW
Sbjct: 106 EFKKMYLGLKTDI--VRRDEERSYAEFAYRDVEAVPKSVDWRKKGAVAEVKNQGSCGSCW 163

Query: 157 AFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEA 215
           AFS VAAVEGI +I  G L  LSEQ+L+DC T  N+GC+GGLMD AFEYI++N GL  E 
Sbjct: 164 AFSTVAAVEGINKIVTGNLTTLSEQELIDCDTTYNNGCNGGLMDYAFEYIVKNGGLRKEE 223

Query: 216 DYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFY 275
           DYPY  EEGTC+ QK+++   TI+ ++D+P  DE++LL+A+++QP+SV +DASGR F FY
Sbjct: 224 DYPYSMEEGTCEMQKDESETVTINGHQDVPTNDEKSLLKALAHQPLSVAIDASGREFQFY 283

Query: 276 KSGVLNADCGNNCDHGVAVVGFGTAE 301
             GV +  CG + DHGVA VG+G+++
Sbjct: 284 SGGVFDGRCGVDLDHGVAAVGYGSSK 309


>gi|343978787|gb|AEM76722.1| cathepsin L-like proteinase [Triatoma brasiliensis]
          Length = 330

 Score =  285 bits (728), Expect = 3e-74,   Method: Compositional matrix adjust.
 Identities = 152/345 (44%), Positives = 218/345 (63%), Gaps = 29/345 (8%)

Query: 11  IPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNL 70
           + +  + ++ ++CA++  +          E+ E +   HG+ YK++ E+  R  IF  N 
Sbjct: 3   VLLVAVAVIAVSCANRFYNINP-------EEWETFKVVHGKNYKNQFEEMFRRKIFMNNK 55

Query: 71  EYIEKAN---KEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSS--RPSTFKY 125
           + IE  N   ++G  +YK+  N F DL + E +AL  G+ +  P+  R+     PS  K 
Sbjct: 56  KRIEAHNAKYEQGEVSYKMKMNHFGDLMSHEIKALMNGF-KMTPNTKREGKIYFPSNDK- 113

Query: 126 QNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVD 185
                +P S+DWR+KGAVT +KDQGQCGSCW+FSA  ++EG   + +GKL+ LSEQ L+D
Sbjct: 114 -----LPKSVDWRQKGAVTPVKDQGQCGSCWSFSATGSLEGQIFLKKGKLVSLSEQNLMD 168

Query: 186 CSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYED 243
           CS +  N+GC GGLMDKAF+Y+ +NKG+ TE+ YPY   +  C  +K+K V  T   Y D
Sbjct: 169 CSKEYGNNGCEGGLMDKAFQYVSDNKGIDTESSYPYEARDYACRFKKDK-VGGTDKGYVD 227

Query: 244 LPKGDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGVLNAD-CGN-NCDHGVAVVGFGTA 300
           +P+GDE+AL  A++   P+SV +DAS  +FHFY  GV N   C + + DHGV  VG+GT 
Sbjct: 228 IPEGDEKALQNALATVGPISVAIDASHESFHFYSEGVYNEPYCSSYDLDHGVLAVGYGT- 286

Query: 301 EEENGAKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
             ENG  YWL+KNSWG +WGESGYI+I R+ +  CGIA+ ASYP+
Sbjct: 287 --ENGQDYWLVKNSWGPSWGESGYIKIARNHSNHCGIASMASYPI 329


>gi|115461226|ref|NP_001054213.1| Os04g0670500 [Oryza sativa Japonica Group]
 gi|62510688|sp|Q7XR52.2|CYSP1_ORYSJ RecName: Full=Cysteine protease 1; AltName: Full=OsCP1; Flags:
           Precursor
 gi|38345300|emb|CAE02828.2| OSJNBa0043A12.33 [Oryza sativa Japonica Group]
 gi|113565784|dbj|BAF16127.1| Os04g0670500 [Oryza sativa Japonica Group]
 gi|215741575|dbj|BAG98070.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 490

 Score =  284 bits (727), Expect = 4e-74,   Method: Compositional matrix adjust.
 Identities = 142/296 (47%), Positives = 188/296 (63%), Gaps = 14/296 (4%)

Query: 58  EKAMRLNIFKQNLEYIEKANKEGNRT--YKLGTNEFSDLTNEEFRALYTGYNRPVPSVSR 115
           E   R  +F  NL++++  N   +    ++LG N F+DLTN EFRA Y G         R
Sbjct: 84  EHERRFRVFWDNLKFVDAHNARADERGGFRLGMNRFADLTNGEFRATYLG----TTPAGR 139

Query: 116 QSSRPSTFKYQNVTDVPTSIDWREKGAVTH-IKDQGQCGSCWAFSAVAAVEGITQITRGK 174
                  +++  V  +P S+DWR+KGAV   +K+QGQCGSCWAFSAVAAVEGI +I  G+
Sbjct: 140 GRRVGEAYRHDGVEALPDSVDWRDKGAVVAPVKNQGQCGSCWAFSAVAAVEGINKIVTGE 199

Query: 175 LIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEK 232
           L+ LSEQ+LV+C+ +  N GC+GG+MD AF +I  N GL TE DYPY   +G C+  K  
Sbjct: 200 LVSLSEQELVECARNGQNSGCNGGIMDDAFAFIARNGGLDTEEDYPYTAMDGKCNLAKRS 259

Query: 233 AVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGV 292
               +I  +ED+P+ DE +L +AV++QPVSV +DA GR F  Y SGV    CG N DHGV
Sbjct: 260 RKVVSIDGFEDVPENDELSLQKAVAHQPVSVAIDAGGREFQLYDSGVFTGRCGTNLDHGV 319

Query: 293 AVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPV 344
             VG+GT +   GA YW ++NSWG  WGE+GYIR+ R+     G CGIA  ASYP+
Sbjct: 320 VAVGYGT-DAATGAAYWTVRNSWGPDWGENGYIRMERNVTARTGKCGIAMMASYPI 374


>gi|2351107|dbj|BAA21929.1| bromelain [Ananas comosus]
          Length = 312

 Score =  284 bits (727), Expect = 4e-74,   Method: Compositional matrix adjust.
 Identities = 138/302 (45%), Positives = 195/302 (64%), Gaps = 11/302 (3%)

Query: 46  MAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTG 105
           MA++GR YKD  EK  R  IFK N+ +IE  N     +Y LG N+F+D+TN EF A YTG
Sbjct: 1   MAEYGRVYKDNDEKMRRFQIFKNNVNHIETFNNRNGNSYTLGINKFTDMTNNEFVAQYTG 60

Query: 106 -YNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAV 164
             +RP+   + +     +F   N++ V  SIDWR+ GAVT +KDQ  CGSCWAFSA+A V
Sbjct: 61  GISRPL---NIEKEPVVSFDDVNISAVGQSIDWRDYGAVTEVKDQNPCGSCWAFSAIATV 117

Query: 165 EGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEG 224
           EGI +I  G L+ LSEQ+++DC+  N GC GG +D A+++II N G+A+EADYPY+  +G
Sbjct: 118 EGIYKIVTGYLVSLSEQEVLDCAVSN-GCDGGFVDNAYDFIISNNGVASEADYPYQAYQG 176

Query: 225 TCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADC 284
            C        +A I+ Y  +   DE ++  AV NQP++  +DASG  F +Y  GV +  C
Sbjct: 177 DCA-ANSWPNSAYITGYSYVRSNDESSMKYAVWNQPIAAAIDASGDNFQYYNGGVFSGPC 235

Query: 285 GNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILR---DAGLCGIATAAS 341
           G + +H + ++G+G  ++ +G +YW++KNSWG +WGE GYIR+ R    +GLCGIA    
Sbjct: 236 GTSLNHAITIIGYG--QDSSGTQYWIVKNSWGSSWGERGYIRMARGVSSSGLCGIAMDPL 293

Query: 342 YP 343
           YP
Sbjct: 294 YP 295


>gi|297843784|ref|XP_002889773.1| hypothetical protein ARALYDRAFT_471096 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297335615|gb|EFH66032.1| hypothetical protein ARALYDRAFT_471096 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 439

 Score =  284 bits (727), Expect = 4e-74,   Method: Compositional matrix adjust.
 Identities = 142/312 (45%), Positives = 194/312 (62%), Gaps = 13/312 (4%)

Query: 40  EKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEF 99
           E  + W  +HG+TY  E E+  R+ IFK N +++ + N   N TY L  N F+DLT+ EF
Sbjct: 30  ELFDDWCQRHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTHHEF 89

Query: 100 RALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFS 159
           +A   G +    S+   S   S         VP S+DWR+KGAVT++KDQG CG+CW+FS
Sbjct: 90  KASRLGLSVSASSLIMASKGQS---LGGNAKVPDSVDWRKKGAVTNVKDQGSCGACWSFS 146

Query: 160 AVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYP 218
           A  A+EGI QI  G LI LSEQ+L+DC    N GC+GGLMD AFE++I+N G+ TE DYP
Sbjct: 147 ATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNHGIDTEKDYP 206

Query: 219 YRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYK-- 276
           Y+  +GTC   K K    TI  Y  +   DE+AL +AV+ QPVSV +  S RAF  Y   
Sbjct: 207 YQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALREAVAAQPVSVGICGSERAFQLYSRV 266

Query: 277 SGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----G 332
           SG+ +  C  + DH V +VG+G+   +NG  YW++KNSWG++WG  G++ + R+     G
Sbjct: 267 SGIFSGPCSTSLDHAVLIVGYGS---QNGVDYWIVKNSWGKSWGMDGFMHMQRNTGNSEG 323

Query: 333 LCGIATAASYPV 344
           +CGI   ASYP+
Sbjct: 324 ICGINMLASYPI 335


>gi|386648112|gb|AFJ15103.1| mexicain-like cystein protease, partial [Jacaratia mexicana]
          Length = 348

 Score =  284 bits (726), Expect = 4e-74,   Method: Compositional matrix adjust.
 Identities = 146/350 (41%), Positives = 209/350 (59%), Gaps = 19/350 (5%)

Query: 2   VLKFEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEK----HEQWMAQHGRTYKDEL 57
           +    K   +   +I+ + ++ A   + G S  + +  E+     E WM +H R Y +  
Sbjct: 4   ICSISKLIFVATCLIVHVGLSSADFSIVGYSQDDLTSTERLIRLFESWMLKHDRVYNNIE 63

Query: 58  EKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQS 117
           EK  R  IFK NL YI++ NK+ N +Y LG NEF DLT++EF+  Y G +     V+ + 
Sbjct: 64  EKIHRFEIFKDNLMYIDETNKK-NNSYWLGLNEFVDLTHDEFKEKYVG-SIGEDFVTIEQ 121

Query: 118 SRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIE 177
           S    F Y++V D P SIDWR+KGAVT +K    CGSCWAFS VA VEGI +I  GKLI 
Sbjct: 122 SNDEEFPYKHVVDYPESIDWRDKGAVTPVKPN-PCGSCWAFSTVATVEGINKIVTGKLIS 180

Query: 178 LSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAAT 237
           LSEQ+L+DC   +HGC GG    + +Y+++N G+ TE +YPY  ++G C  +++K     
Sbjct: 181 LSEQELLDCDRRSHGCKGGYQTTSLQYVVDN-GVHTEKEYPYEKKQGKCRAKEKKGTKVQ 239

Query: 238 ISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGF 297
           I+ Y+ +P  DE +L+QA++NQPVSV +++ GRAF  YK G+ N  CG   DH V  +G+
Sbjct: 240 ITGYKRVPANDEISLIQAIANQPVSVLLESKGRAFQLYKGGIFNGPCGTKLDHAVTAIGY 299

Query: 298 GTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYP 343
           G         Y LIKNSWG  WGE GY++I R +    G CG+  ++ +P
Sbjct: 300 GKT-------YILIKNSWGPNWGEKGYLKIKRASGKSEGTCGVYKSSYFP 342


>gi|90265242|emb|CAH67695.1| H0624F09.3 [Oryza sativa Indica Group]
          Length = 494

 Score =  284 bits (726), Expect = 4e-74,   Method: Compositional matrix adjust.
 Identities = 142/296 (47%), Positives = 188/296 (63%), Gaps = 14/296 (4%)

Query: 58  EKAMRLNIFKQNLEYIEKANKEGNRT--YKLGTNEFSDLTNEEFRALYTGYNRPVPSVSR 115
           E   R  +F  NL++++  N   +    ++LG N F+DLTN EFRA Y G         R
Sbjct: 84  EHERRFRVFWDNLKFVDAHNARADERGGFRLGMNRFADLTNGEFRATYLG----TTPAGR 139

Query: 116 QSSRPSTFKYQNVTDVPTSIDWREKGAVTH-IKDQGQCGSCWAFSAVAAVEGITQITRGK 174
                  +++  V  +P S+DWR+KGAV   +K+QGQCGSCWAFSAVAAVEGI +I  G+
Sbjct: 140 GRRVGEAYRHDGVEALPDSVDWRDKGAVVAPVKNQGQCGSCWAFSAVAAVEGINKIVTGE 199

Query: 175 LIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEK 232
           L+ LSEQ+LV+C+ +  N GC+GG+MD AF +I  N GL TE DYPY   +G C+  K  
Sbjct: 200 LVSLSEQELVECARNGQNSGCNGGIMDDAFAFIARNGGLDTEEDYPYTAMDGKCNLAKRS 259

Query: 233 AVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGV 292
               +I  +ED+P+ DE +L +AV++QPVSV +DA GR F  Y SGV    CG N DHGV
Sbjct: 260 RKVVSIDGFEDVPENDELSLQKAVAHQPVSVAIDAGGREFQLYDSGVFTGRCGTNLDHGV 319

Query: 293 AVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPV 344
             VG+GT +   GA YW ++NSWG  WGE+GYIR+ R+     G CGIA  ASYP+
Sbjct: 320 VAVGYGT-DAATGAAYWTVRNSWGPDWGENGYIRMERNVTARTGKCGIAMMASYPI 374


>gi|194352762|emb|CAQ00109.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
 gi|326517250|dbj|BAJ99991.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 367

 Score =  284 bits (726), Expect = 4e-74,   Method: Compositional matrix adjust.
 Identities = 159/318 (50%), Positives = 199/318 (62%), Gaps = 13/318 (4%)

Query: 35  EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
           E S+   +E+W  QH    +D  EKA R N+F++N+  I + N+ G+  YKL  N F D+
Sbjct: 40  EDSLWALYERWREQH-TVARDLGEKARRFNVFRENVRLIHEFNR-GDAPYKLRLNRFGDM 97

Query: 95  TNEEFRALYTGYNRPVPSVSRQSSRPSTFKY---QNVTDVPTSIDWREKGAVTHIKDQGQ 151
           T +EFR  Y         +         F +    +V DVP S+DWR+KGAVT +KDQGQ
Sbjct: 98  TADEFRRAYASSRVSHHRMFSLKEGGGGFMHGSAASVRDVPPSVDWRQKGAVTAVKDQGQ 157

Query: 152 CGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST-DNHGCSGGLMDKAFEYIIENKG 210
           CGSCWAFS +AAVEGI  I    L  LSEQQLVDC T  N GC+GGLMD AF+YI ++ G
Sbjct: 158 CGSCWAFSTIAAVEGINAIRSKNLTSLSEQQLVDCDTKSNAGCNGGLMDYAFQYIAKHGG 217

Query: 211 LATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGR 270
           +A E  YPY+  + +  N+K  AV  TI  YED+P  DE AL +AV+ QPV+V ++ASG 
Sbjct: 218 VAAEDAYPYKARQASSCNKKPSAV-VTIDGYEDVPANDETALKKAVAAQPVAVAIEASGS 276

Query: 271 AFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD 330
            F FY  GV    CG   DHGVA VG+GT  +  G KYW++KNSWG  WGE GYIR+ RD
Sbjct: 277 HFQFYSEGVFAGKCGTELDHGVAAVGYGTTVD--GTKYWIVKNSWGPEWGEKGYIRMKRD 334

Query: 331 A----GLCGIATAASYPV 344
                GLCGIA  ASYPV
Sbjct: 335 VKDKEGLCGIAMEASYPV 352


>gi|162459488|ref|NP_001105571.1| maize insect resistance1 precursor [Zea mays]
 gi|5731354|gb|AAB70820.2| cysteine protease Mir1 [Zea mays]
          Length = 398

 Score =  284 bits (726), Expect = 5e-74,   Method: Compositional matrix adjust.
 Identities = 147/345 (42%), Positives = 205/345 (59%), Gaps = 28/345 (8%)

Query: 22  TCASQVVSGRSMHEPSIVEKHEQWMAQHGR--------------TYKDELEKAMRLNIFK 67
           T  ++V +     +  +   +E W ++HGR                ++E ++ +RL +F+
Sbjct: 34  TTTTRVPAPAERADEEVRRMYEAWKSKHGRGGSSNDDCDMAPGDDEQEEEDRRLRLEVFR 93

Query: 68  QNLEYIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFK 124
            NL YI+  N E   G  T++LG   F+DLT EE+R    G+         +     + +
Sbjct: 94  DNLRYIDAHNAEADAGLHTFRLGLTPFADLTLEEYRGRVLGFRARGRRSGARYGSGYSVR 153

Query: 125 YQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLV 184
                D+P +IDWR+ GAVT +KDQ QCG CWAFSAVAA+EG+  I  G L+ LSEQ+++
Sbjct: 154 G---GDLPDAIDWRQLGAVTEVKDQQQCGGCWAFSAVAAIEGVNAIATGNLVSLSEQEII 210

Query: 185 DCSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEK-AVAATISKYED 243
           DC   + GC GG M+ AF ++I N G+ TEADYP+   +GTCD  KEK    ATI    +
Sbjct: 211 DCDAQDSGCDGGQMENAFRFVIGNGGIDTEADYPFIGTDGTCDASKEKNEKVATIDGLVE 270

Query: 244 LPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEE 303
           +   +E AL +AV+ QPVSV +DASGRAF  Y SG+ N  CG + DHGV  VG+G+   E
Sbjct: 271 VASNNETALQEAVAIQPVSVAIDASGRAFQHYSSGIFNGPCGTSLDHGVTAVGYGS---E 327

Query: 304 NGAKYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPV 344
           +G  YW++KNSW  +WGE+GYIR+ R+     G CGIA  ASYPV
Sbjct: 328 SGKDYWIVKNSWSASWGEAGYIRMRRNVPRPTGKCGIAMDASYPV 372


>gi|357124027|ref|XP_003563708.1| PREDICTED: germination-specific cysteine protease 1-like
           [Brachypodium distachyon]
          Length = 334

 Score =  284 bits (726), Expect = 5e-74,   Method: Compositional matrix adjust.
 Identities = 153/330 (46%), Positives = 205/330 (62%), Gaps = 27/330 (8%)

Query: 35  EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE----GNRTYKLGTNE 90
           + ++ E++E+WMA+ GRTYKD  EKA R  +FK N  +I+  N      G    KL TN+
Sbjct: 13  DKAMRERYEKWMAEQGRTYKDSTEKARRFEVFKSNAHFIDSHNAATGPGGKSRPKLTTNK 72

Query: 91  FSDLTNEEFRALY-TGYN---RPVPSVSRQSSRPSTFKYQNVT--DVPTSIDWREKGAVT 144
           F+DLT +EFR +Y TG+    RP   V+      + FK+  V+  DVP SIDWR +GAVT
Sbjct: 73  FADLTEDEFRNIYVTGHRVNYRPTSLVT-----DTVFKFGAVSLSDVPPSIDWRARGAVT 127

Query: 145 HIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS-TDNHGCSGGLMDKAFE 203
            +KDQ  C  CWAFS+ AAVEGI QIT G  + LS QQLVDCS   N  C  G +DKA+E
Sbjct: 128 SVKDQHLCACCWAFSSAAAVEGIHQITTGNQVSLSVQQLVDCSNAANEKCKAGEIDKAYE 187

Query: 204 YIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSV 263
           YI  + GL  + DYPY    GTC    ++AV A IS ++ +P  +E ALL AV++QPVSV
Sbjct: 188 YIARSGGLVADQDYPYEGHSGTCRVYGKQAV-ARISGFQYVPARNETALLLAVAHQPVSV 246

Query: 264 CVDASGRAFHFYKSGVLNA---DCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWG 320
            +D   RA     +G+  +    C  N +H + +VG+GT  +E+G +YWL+KNSWG  WG
Sbjct: 247 ALDGLSRALQHIGTGIFGSAGEPCTTNLNHAMTIVGYGT--DEHGTRYWLMKNSWGSDWG 304

Query: 321 ESGYIRILRDA-----GLCGIATAASYPVA 345
           + GY++  RD      G+CG+A  ASYPVA
Sbjct: 305 DKGYVKFARDVASEINGVCGLALEASYPVA 334


>gi|390337642|ref|XP_780653.3| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
          Length = 333

 Score =  284 bits (726), Expect = 5e-74,   Method: Compositional matrix adjust.
 Identities = 154/338 (45%), Positives = 214/338 (63%), Gaps = 15/338 (4%)

Query: 14  FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYI 73
           ++ ++LV  C   VVS  SM      E   +W  +HG+ Y  + E+A R  I+++NL+ +
Sbjct: 3   YLSVLLVAAC---VVSSLSMSFTDFDEDWNEWKNEHGKRYLSDEEEASRRLIWQKNLDIV 59

Query: 74  EKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
            K N +   G+ TY LG N+F+DL NEEF A+ TG+   V   S+ +   +     NV +
Sbjct: 60  IKHNLKYDLGHFTYDLGINQFTDLQNEEFVAMMTGFR--VSGTSKAAKGSTFLPPNNVGE 117

Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDN 190
           +P ++DWR KG VT +KDQGQCGSCWAFS   +VEG      GKL+ LSEQ LVDCS  +
Sbjct: 118 LPKTVDWRTKGYVTPVKDQGQCGSCWAFSTTGSVEGQHFKATGKLVSLSEQNLVDCSGRD 177

Query: 191 HGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQ 250
            GC GG MD+AF+YII+  G+ TEA YPY+  +G C + K+  V AT++ Y D+  G E+
Sbjct: 178 AGCDGGFMDRAFQYIIDAGGIDTEASYPYKAVDGKC-HFKKANVGATVTGYTDVTSGSEK 236

Query: 251 ALLQAVSN-QPVSVCVDASGRAFHFYKSGVLN-ADCGNNC-DHGVAVVGFGTAEEENGAK 307
           AL +AV++  P+SV +DAS  +F  YKSGV N   C +   DHGV  VG+GT+ +  G  
Sbjct: 237 ALQKAVAHVGPISVAIDASHMSFQHYKSGVYNEPGCDSTVLDHGVLAVGYGTSSD--GTD 294

Query: 308 YWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
           YW++KNSW ETWG +GY+ + R+    CGIAT ASYP+
Sbjct: 295 YWIVKNSWAETWGMNGYVWMSRNKDNQCGIATNASYPL 332


>gi|359359120|gb|AEV41026.1| putative cysteine protease [Oryza minuta]
          Length = 464

 Score =  283 bits (725), Expect = 7e-74,   Method: Compositional matrix adjust.
 Identities = 145/316 (45%), Positives = 201/316 (63%), Gaps = 18/316 (5%)

Query: 42  HEQWMAQH---GRTYKDEL-EKAMRLNIFKQNLEYIEKANKEGNR--TYKLGTNEFSDLT 95
           ++ W+A+H   G ++   + E   R  +F  NL++++  N   +    ++LG N F+DLT
Sbjct: 66  YDLWVARHRHGGGSHNGFVGEYERRFRVFWDNLKFVDAHNAHADEHGGFRLGMNRFADLT 125

Query: 96  NEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAV-THIKDQGQCGS 154
           N+EFRA Y G         R       +++  V  +P S+DWR+KGAV + +K+QGQCGS
Sbjct: 126 NDEFRAAYLG----TTPAGRGRHVGEMYRHDGVEALPDSVDWRDKGAVVSPVKNQGQCGS 181

Query: 155 CWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLA 212
           CWAFSAVAAVEGI +I  G+L+ LSEQ+LV+C+ +  N GC+GG+MD AF +I  N GL 
Sbjct: 182 CWAFSAVAAVEGINKIVTGELVSLSEQELVECARNRGNSGCNGGIMDDAFAFITRNGGLD 241

Query: 213 TEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAF 272
           TE DYPY   +G CD  K+     +I  +ED+P+ DE +L +AV++QPVSV +DA GR F
Sbjct: 242 TEEDYPYTAMDGKCDLAKKSRKVVSIDGFEDVPENDELSLQKAVAHQPVSVAIDAGGREF 301

Query: 273 HFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD-- 330
             Y SGV    CG + DHGV  VG+GT +   G  YW ++NSWG  WGE+GYIR+ R+  
Sbjct: 302 QLYDSGVFTGRCGTSLDHGVVAVGYGT-DAATGTDYWTVRNSWGPDWGENGYIRMERNVT 360

Query: 331 --AGLCGIATAASYPV 344
              G CGIA  ASYP+
Sbjct: 361 ARTGKCGIAMMASYPI 376


>gi|359359168|gb|AEV41073.1| putative cysteine protease [Oryza minuta]
          Length = 499

 Score =  283 bits (724), Expect = 8e-74,   Method: Compositional matrix adjust.
 Identities = 144/316 (45%), Positives = 200/316 (63%), Gaps = 18/316 (5%)

Query: 42  HEQWMAQH---GRTYKDEL-EKAMRLNIFKQNLEYIEKANKEGNR--TYKLGTNEFSDLT 95
           ++ W+A+H   G ++   + E   R  +F  NL++++  N   +    ++LG N F+DLT
Sbjct: 65  YDLWVARHRHGGGSHNGLVGEYERRFRVFWDNLKFVDAHNARADEHGGFRLGMNRFADLT 124

Query: 96  NEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTH-IKDQGQCGS 154
           N+EFRA Y G         R       +++  V  +P S+DWR+KGAV   +K+QGQCGS
Sbjct: 125 NDEFRAAYLG----TTPAGRGRHVGEAYRHDGVEALPDSVDWRDKGAVVAPVKNQGQCGS 180

Query: 155 CWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLA 212
           CWAFSAVAAVEGI +I  G+L+ LSEQ+LV+C+ +  N GC+GG+MD AF +I  N GL 
Sbjct: 181 CWAFSAVAAVEGINKIVTGELVSLSEQELVECARNGANSGCNGGMMDDAFAFIARNGGLD 240

Query: 213 TEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAF 272
           TE DYPY   +G C+  K+     +I  +ED+P+ DE +L +AV++QPVSV +DA GR F
Sbjct: 241 TEEDYPYTAMDGKCNLAKKSRKVVSIDGFEDVPENDELSLQKAVAHQPVSVAIDAGGREF 300

Query: 273 HFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD-- 330
             Y SGV    CG + DHGV  VG+GT +   G  YW ++NSWG  WGE+GYIR+ R+  
Sbjct: 301 QLYDSGVFTGRCGTSLDHGVVAVGYGT-DAATGTDYWTVRNSWGPDWGENGYIRMERNVT 359

Query: 331 --AGLCGIATAASYPV 344
              G CGIA  ASYP+
Sbjct: 360 ARTGKCGIAMMASYPI 375


>gi|225458143|ref|XP_002280937.1| PREDICTED: cysteine proteinase RD21a [Vitis vinifera]
 gi|302142569|emb|CBI19772.3| unnamed protein product [Vitis vinifera]
          Length = 436

 Score =  283 bits (724), Expect = 9e-74,   Method: Compositional matrix adjust.
 Identities = 144/336 (42%), Positives = 198/336 (58%), Gaps = 18/336 (5%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
           ++ + IL++   S V    S       +  E W  Q+G+TY  E EKA RL +F++N  +
Sbjct: 5   LWAVSILILAVHSSVSEASST-----ADLFEAWCEQYGKTYSSEEEKASRLKVFEENHAF 59

Query: 73  IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVP 132
           + + N   N +Y L  N F+DLT+ EF+A   G+     S  R  S  S         VP
Sbjct: 60  VTQHNSMANASYTLALNAFADLTHHEFKASRLGF-----SPGRAQSIRSVGTPVQELHVP 114

Query: 133 TSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NH 191
            ++DWR+ GAVT +KDQG CG CW+FS   A+EGI +I  G L+ LSEQ+LVDC    N 
Sbjct: 115 PAVDWRKSGAVTGVKDQGNCGGCWSFSTTGAIEGINKIVTGSLVSLSEQELVDCDRSYNS 174

Query: 192 GCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQA 251
           GC GGLMD A++++I+N+G+ +EADYPY   +  C+ +K K    TI  Y D+P  DE+ 
Sbjct: 175 GCEGGLMDYAYQFVIKNQGIDSEADYPYVGMDKPCNKEKLKKHIVTIDGYTDIPPNDEKQ 234

Query: 252 LLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLI 311
           LLQ V+ QPVSV +  S + F  Y  GV    C +  DH V +VG+GT   E+G  +W++
Sbjct: 235 LLQVVAKQPVSVGICGSEKTFQLYSKGVYTGPCSSTLDHAVLIVGYGT---EDGVDFWIV 291

Query: 312 KNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
           KNSWGE WG  GYI +LR+     G+CGI   ASYP
Sbjct: 292 KNSWGEHWGMRGYIHMLRNNGTAEGICGINMLASYP 327


>gi|359359215|gb|AEV41119.1| putative cysteine protease [Oryza officinalis]
          Length = 499

 Score =  283 bits (723), Expect = 1e-73,   Method: Compositional matrix adjust.
 Identities = 144/316 (45%), Positives = 200/316 (63%), Gaps = 18/316 (5%)

Query: 42  HEQWMAQH---GRTYKDEL-EKAMRLNIFKQNLEYIEKANKEGNR--TYKLGTNEFSDLT 95
           ++ W+A+H   G ++   + E   R  +F  NL++++  N   +    ++LG N F+DLT
Sbjct: 65  YDLWVARHRHGGDSHNGLVGEYERRFRVFWDNLKFVDAHNARADEHGGFRLGMNRFADLT 124

Query: 96  NEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTH-IKDQGQCGS 154
           N+EFRA Y G         R       +++  V  +P S+DWR+KGAV   +K+QGQCGS
Sbjct: 125 NDEFRAAYLG----TTPAGRGRHVGEAYRHDGVEVLPDSVDWRDKGAVVAPVKNQGQCGS 180

Query: 155 CWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLA 212
           CWAFSAVAAVEGI +I  G+L+ LSEQ+LV+C+ +  N GC+GG+MD AF +I  N GL 
Sbjct: 181 CWAFSAVAAVEGINKIVTGELVSLSEQELVECARNGANSGCNGGMMDDAFAFIARNGGLD 240

Query: 213 TEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAF 272
           TE DYPY   +G C+  K+     +I  +ED+P+ DE +L +AV++QPVSV +DA GR F
Sbjct: 241 TEEDYPYTAMDGKCNLAKKSRKVVSIDGFEDVPENDELSLQKAVAHQPVSVAIDAGGREF 300

Query: 273 HFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD-- 330
             Y SGV    CG + DHGV  VG+GT +   G  YW ++NSWG  WGE+GYIR+ R+  
Sbjct: 301 QLYDSGVFTGRCGTSLDHGVVAVGYGT-DAATGTDYWTVRNSWGPDWGENGYIRMERNVT 359

Query: 331 --AGLCGIATAASYPV 344
              G CGIA  ASYP+
Sbjct: 360 ARTGKCGIAMMASYPI 375


>gi|297733654|emb|CBI14901.3| unnamed protein product [Vitis vinifera]
          Length = 273

 Score =  282 bits (722), Expect = 1e-73,   Method: Compositional matrix adjust.
 Identities = 139/257 (54%), Positives = 172/257 (66%), Gaps = 8/257 (3%)

Query: 94  LTNEEFRALYTGYNRPVPSVSRQSSRPS-TFKYQNVTDVPTSIDWREKGAVTHIKDQGQC 152
           +TN EFR+ Y G       + R S   + +F Y+ V  VP S+DWR+KGAVT IKDQGQC
Sbjct: 1   MTNHEFRSTYAGSKVNHHRMFRGSQHAAGSFMYEKVKSVPPSVDWRKKGAVTPIKDQGQC 60

Query: 153 GSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST-DNHGCSGGLMDKAFEYIIENKGL 211
           GSCWAFS V AVEGI  I   KL+ LSEQ+LVDC T +N GC+GGLM  AFE+I E  G+
Sbjct: 61  GSCWAFSTVVAVEGINHIKTNKLVSLSEQELVDCDTSENQGCNGGLMGYAFEFIKEKGGI 120

Query: 212 ATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRA 271
            TE  YPY  E+GTCD  K  +   +I  +E +P  +E ALL+A +NQP+SV +DA G A
Sbjct: 121 TTEQSYPYTAEDGTCDVSKVNSPVVSIDGHETVPPNNEDALLKAAANQPISVAIDAGGSA 180

Query: 272 FHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILR-- 329
           F FY  GV    CG + DHGVA+VG+GT  +  G KYW++KNSWG  WGE+GYIR+ R  
Sbjct: 181 FQFYSEGVFAGRCGTDLDHGVAIVGYGTTLD--GTKYWIVKNSWGTDWGENGYIRMKRGI 238

Query: 330 --DAGLCGIATAASYPV 344
               GLCGIA  ASYP+
Sbjct: 239 SAKEGLCGIAVEASYPI 255


>gi|390368662|ref|XP_780781.2| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
          Length = 333

 Score =  282 bits (721), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 154/338 (45%), Positives = 211/338 (62%), Gaps = 15/338 (4%)

Query: 14  FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYI 73
           ++ ++LV  C   VVS  SM      E   QW  +HG+ Y  + E+A R  I+++NL+ +
Sbjct: 3   YLSVLLVAVC---VVSSLSMSFTDFDEDWNQWKNEHGKRYLSDEEEASRKLIWEKNLDIV 59

Query: 74  EKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
            K N +   G+ TY LG N+F+DL NEEF A+ TG+   V   S+ +   +     NV  
Sbjct: 60  IKHNLKYDLGHFTYALGMNQFADLQNEEFVAMMTGFR--VNGTSKAAKGSTFLPSNNVDK 117

Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDN 190
           +P ++DWR KG VT +KDQGQCGSCWAFSA  ++EG      GKL+ LSEQ LVDCS  N
Sbjct: 118 LPKTVDWRTKGYVTPVKDQGQCGSCWAFSATGSLEGQQFKKTGKLVSLSEQNLVDCSYRN 177

Query: 191 HGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQ 250
           +GC GG MD+AF+YII+  G+ TEA Y YR  +G C  +K   V AT++ Y D+  G E+
Sbjct: 178 YGCHGGFMDRAFQYIIDAGGIDTEATYSYRAVDGNCHFKKAN-VGATVTGYTDVTSGSEK 236

Query: 251 ALLQAVSN-QPVSVCVDASGRAFHFYKSGVLN-ADCGNN-CDHGVAVVGFGTAEEENGAK 307
           AL +AV++  P+SV +DAS + F FYKSGV N   C      H V VVG+GT  +  G  
Sbjct: 237 ALQKAVAHIGPISVAIDASHKFFKFYKSGVYNEPGCSTTRLGHAVLVVGYGTTSD--GTD 294

Query: 308 YWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
           YW++KNSW +TWG +GY+ + R+    CGIA+ ASYP+
Sbjct: 295 YWIVKNSWAKTWGMNGYLWMSRNKDNQCGIASEASYPM 332


>gi|357446979|ref|XP_003593765.1| Cysteine proteinase [Medicago truncatula]
 gi|355482813|gb|AES64016.1| Cysteine proteinase [Medicago truncatula]
          Length = 364

 Score =  282 bits (721), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 152/293 (51%), Positives = 194/293 (66%), Gaps = 11/293 (3%)

Query: 56  ELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSR 115
           ELEK  R  IFK NLEYIE  N  GN++YKLG N++SDLT++EF A +TG  +    +S 
Sbjct: 78  ELEKRKR--IFKNNLEYIENFNNAGNKSYKLGLNQYSDLTSDEFLASHTGL-KVSKQLSS 134

Query: 116 QSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKL 175
              R +   +    DVPT+ DWR++GAVT +KDQG CG CWAFS VAAVEG  +I  G+L
Sbjct: 135 SKMRSAAVPFNLNDDVPTNFDWRQQGAVTDVKDQGSCGCCWAFSVVAAVEGAVKINTGEL 194

Query: 176 IELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVA 235
           I LSEQQLVDC   N GC GG MD AF+YII+ KG+ +EADYPY+    TC    +    
Sbjct: 195 ISLSEQQLVDCDERNSGCHGGNMDSAFKYIIQ-KGIVSEADYPYQEGSQTCQLNDQMKFE 253

Query: 236 ATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVV 295
           A I+ + D+P  DEQ LLQAV+ QPVSV ++  G  F  Y   V +  CG + +H V  V
Sbjct: 254 AQITNFIDVPANDEQQLLQAVAQQPVSVGIEV-GDEFQHYMGDVYSGTCGQSMNHAVTAV 312

Query: 296 GFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
           G+G +E+  G KYWLIKNSWG+ WGE GY+++LR++    G CGIA  ASYP+
Sbjct: 313 GYGVSED--GTKYWLIKNSWGKGWGEEGYMKLLRESGEPGGQCGIAAHASYPI 363


>gi|356517398|ref|XP_003527374.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
          Length = 333

 Score =  281 bits (720), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 157/343 (45%), Positives = 211/343 (61%), Gaps = 32/343 (9%)

Query: 14  FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYI 73
           F +++ +   A QV   R++ + S+ E+HEQ M ++ + YKD  E       F  N+ YI
Sbjct: 12  FAMLLCMAFLAFQVTC-RTLQDASMYERHEQRMTRYSKVYKDPPES------FXGNVNYI 64

Query: 74  EKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPT 133
           E  N   ++ YK G N+F        R  + G+      +     R +TFK++NVT  P+
Sbjct: 65  EACNNAADKPYKXGINQFPP------RNRFKGH------MCSSIIRITTFKFENVTATPS 112

Query: 134 SIDWREKGAVTH--IKDQGQCGSCWAFSAVAAVEGITQITRGKLIELS-EQQLVDCSTD- 189
           ++D R+KGAVT   +KDQGQCG  WA SAVAA EGI  +  GKLI LS E +LVDC T  
Sbjct: 113 TVDCRQKGAVTPYTVKDQGQCGCFWALSAVAATEGIHALXAGKLILLSXEPELVDCDTKG 172

Query: 190 -NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCD-NQKEKAVAATISKYEDLPKG 247
            + GC GGL D AF++II+N GL TEA+YPY+  +G C+ N+ +K  A  I+ Y+D+P  
Sbjct: 173 VDQGCEGGLTDDAFKFIIQNHGLNTEANYPYKGVDGKCNANEADKNAATIITGYDDVPAN 232

Query: 248 DEQALLQ-AVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGA 306
           +E+A LQ AV+N PVSV +DASG  F FYKSGV    CG   DHGV  VG+G +++  G 
Sbjct: 233 NEKAHLQKAVANNPVSVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSDD--GT 290

Query: 307 KYWLIKNSWGETWGESGYIRILR----DAGLCGIATAASYPVA 345
           +YWL+KNS G  WGE GYIR+ R    +  LCGIA  ASYP A
Sbjct: 291 EYWLVKNSRGPEWGEEGYIRMQRGVDSEEALCGIAVQASYPSA 333


>gi|944916|gb|AAA74430.1| cysteine proteinase [Mesembryanthemum crystallinum]
          Length = 367

 Score =  281 bits (719), Expect = 4e-73,   Method: Compositional matrix adjust.
 Identities = 150/318 (47%), Positives = 204/318 (64%), Gaps = 20/318 (6%)

Query: 35  EPSIVEKHEQWMAQH--GRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFS 92
           + ++ + +E+W + +   R++    EK  R ++FK+N++YI + NK  ++ YKL  N+F 
Sbjct: 37  DETLWDLYERWRSVYTSARSFG---EKQNRFHVFKENVKYINEVNKM-DKPYKLRLNQFG 92

Query: 93  DLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQC 152
           DLT  EF   Y   N  +   +R  S    F Y+NV +VP SIDWR KGAVT +K+QG+C
Sbjct: 93  DLTPSEFARTYA--NSKIIEGTRNES--GGFMYENV-EVPRSIDWRVKGAVTPVKNQGRC 147

Query: 153 GSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLA 212
           G CWAFSA AAVEGI QIT G+LI LSEQQL+DC T N GC GG M +AFEYI +  G+ 
Sbjct: 148 GGCWAFSAAAAVEGINQITTGQLISLSEQQLIDCDTQNSGCRGGTMGRAFEYIKQRGGIT 207

Query: 213 TEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDA---SG 269
           +EA+YPY+ + G C N   +    +I  Y ++ +  E A+L+ +++QPVSV VDA   S 
Sbjct: 208 SEANYPYKAQAGMCKNNLIQRPTVSIDGYYNIRR-SEDAVLKILAHQPVSVAVDATTWSS 266

Query: 270 RAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILR 329
             + FY  GV    CG   +HGV  VG+GT  +  G  YW+IKNSWGETWGE GY+R+LR
Sbjct: 267 LDWMFYFQGVFTGPCGTKLNHGVTAVGYGTTND--GYDYWIIKNSWGETWGERGYMRMLR 324

Query: 330 DA---GLCGIATAASYPV 344
                GLCGIA  AS+P+
Sbjct: 325 GVSPYGLCGIAMQASFPI 342


>gi|440793751|gb|ELR14926.1| Cysteine proteinase 5, putative [Acanthamoeba castellanii str.
           Neff]
          Length = 326

 Score =  281 bits (718), Expect = 4e-73,   Method: Compositional matrix adjust.
 Identities = 150/338 (44%), Positives = 200/338 (59%), Gaps = 17/338 (5%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
           M    +L +  A  V S  ++    +      WM +H ++Y +E E   R N++++N  Y
Sbjct: 1   MRTTTLLALCVALFVASTFAVSHDPLTGVFADWMQEHQKSYANE-EFVYRWNVWRENYLY 59

Query: 73  IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVP 132
           IE  N + N+++ L  N+F DLTN EF  L+ G +       ++S             +P
Sbjct: 60  IEAHNHQ-NKSFHLAMNKFGDLTNAEFNKLFKGLSITADQAKQESDIAP------APGLP 112

Query: 133 TSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--N 190
              DWR+KGAVTH+K+QGQCGSCW+FS   + EG   +  G+L  LSEQ LVDCST   N
Sbjct: 113 ADFDWRQKGAVTHVKNQGQCGSCWSFSTTGSTEGANFLKHGRLTSLSEQNLVDCSTSYGN 172

Query: 191 HGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQ 250
           HGC+GGLMD AFEYII NKG+ TE  YPY   +GTC   K+ +    +S Y ++P G+E 
Sbjct: 173 HGCNGGLMDYAFEYIIRNKGIDTEESYPYHASQGTCRYNKQHSGGELVS-YTNVPSGNEG 231

Query: 251 ALLQAVSNQPVSVCVDASGRAFHFYKSGVLN--ADCGNNCDHGVAVVGFGTAEEENGAKY 308
           ALL AV+ QP SV +DAS  +F FYK GV +  A   +  DHGV  VG+G     +G  Y
Sbjct: 232 ALLNAVATQPTSVAIDASHSSFQFYKGGVYDEPACSSSRLDHGVLAVGWGV---RDGKDY 288

Query: 309 WLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPVA 345
           WL+KNSWG  WG SGYI + R+    CGIATAAS+P A
Sbjct: 289 WLVKNSWGADWGLSGYIEMSRNKHNQCGIATAASHPHA 326


>gi|390337645|ref|XP_001199228.2| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
          Length = 333

 Score =  280 bits (717), Expect = 5e-73,   Method: Compositional matrix adjust.
 Identities = 153/338 (45%), Positives = 214/338 (63%), Gaps = 15/338 (4%)

Query: 14  FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYI 73
           ++ ++LV  C   VVS  SM      E  ++W  +HG+ Y  + E+A R  I+++NL+ +
Sbjct: 3   YLSVLLVAVC---VVSSLSMSFTDFDEDWKEWKNEHGKRYLSDEEEASRRLIWQKNLDIV 59

Query: 74  EKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
            + N +   G+ TY LG N+F+DL N+EF A+ TG+   V   S+ +   +     NV  
Sbjct: 60  IRHNLKYDLGHFTYDLGMNQFADLQNKEFVAMMTGFR--VNGTSKAAKGSTFLPPNNVGK 117

Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDN 190
           +P ++DWR KG VT +KDQGQCGSCWAFSA  ++EG      GKL+ LSEQ LVDCS  N
Sbjct: 118 LPKTVDWRTKGYVTPVKDQGQCGSCWAFSATGSLEGQHFKKTGKLVSLSEQNLVDCSDKN 177

Query: 191 HGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQ 250
           +GC+GGLMD+AF+YII+  G+ TE  YPY   +G C + K   V AT++ Y D+  G E+
Sbjct: 178 YGCNGGLMDRAFQYIIDAGGIDTEESYPYIAMDGNC-HFKTANVGATVTGYTDVTSGSEK 236

Query: 251 ALLQAVSN-QPVSVCVDASGRAFHFYKSGVLN-ADCGNN-CDHGVAVVGFGTAEEENGAK 307
           AL +AV++  P+SV +DAS  +F  Y+SGV N   C +   DHGV  VG+GT  +  G  
Sbjct: 237 ALQKAVAHIGPISVAIDASHFSFQLYQSGVYNEPGCSSTLLDHGVLAVGYGTTID--GTD 294

Query: 308 YWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
           YW++KNSW ETWG +GYI + R+    CGIAT ASYP+
Sbjct: 295 YWIVKNSWAETWGMNGYIWMSRNKDNQCGIATQASYPL 332


>gi|413943290|gb|AFW75939.1| maize insect resistance1 [Zea mays]
          Length = 435

 Score =  280 bits (717), Expect = 6e-73,   Method: Compositional matrix adjust.
 Identities = 144/326 (44%), Positives = 197/326 (60%), Gaps = 26/326 (7%)

Query: 42  HEQWMAQHGR-------------TYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYK 85
           +E W ++HGR               + E ++ +RL +F+ NL YI+K N E   G  T++
Sbjct: 84  YEAWKSKHGRGGSSNDDCDMAPGDDEQEEDRRLRLEVFRDNLRYIDKHNAEADAGLHTFR 143

Query: 86  LGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD--VPTSIDWREKGAV 143
           LG   F+DLT +E+R    G+         +      ++ +      +P +IDWR+ GAV
Sbjct: 144 LGLTPFADLTLDEYRGRVLGFRARARRSGARYGHGHGYRARPRGGDLLPDAIDWRQLGAV 203

Query: 144 THIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFE 203
           T +KDQ QCG CWAFSAVAA+EGI  I  G L+ LSEQ+++DC   + GC GG M+ AF 
Sbjct: 204 TEVKDQQQCGGCWAFSAVAAIEGINAIATGNLVSLSEQEIIDCDAQDSGCDGGQMENAFR 263

Query: 204 YIIENKGLATEADYPYRHEEGTCDNQKE-KAVAATISKYEDLPKGDEQALLQAVSNQPVS 262
           ++I N G+ TEADYP+   +GTCD  KE     ATI    ++   +E AL +AV+ QPVS
Sbjct: 264 FVIGNGGIDTEADYPFIGTDGTCDASKENNEKVATIDGLVEVASNNETALQEAVAIQPVS 323

Query: 263 VCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGES 322
           V +DASGRAF  Y SG+ N  CG + DHGV  VG+G+   E+G  YW++KNSW  +WGE+
Sbjct: 324 VAIDASGRAFQHYSSGIFNGPCGTSLDHGVTAVGYGS---ESGKDYWIVKNSWSASWGEA 380

Query: 323 GYIRILRD----AGLCGIATAASYPV 344
           GYIR+ R+     G CGIA  ASYPV
Sbjct: 381 GYIRMRRNVPRPTGKCGIAMDASYPV 406


>gi|147769019|emb|CAN62459.1| hypothetical protein VITISV_015168 [Vitis vinifera]
          Length = 246

 Score =  280 bits (716), Expect = 8e-73,   Method: Compositional matrix adjust.
 Identities = 144/271 (53%), Positives = 182/271 (67%), Gaps = 32/271 (11%)

Query: 81  NRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREK 140
           +++YKL  NEF+DLTNEEF    T  NR    +   S+  ++FKY+NVT VP++ DWR+K
Sbjct: 2   DKSYKLSINEFADLTNEEFG---TSRNRFKAHIC--STEATSFKYENVTAVPSTXDWRKK 56

Query: 141 GAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST--DNHGCSGGLM 198
           GAVT IKDQGQCGSCWAFSAVAA+EGITQ++ GKLI LSEQ+LVDC T  ++ GC G   
Sbjct: 57  GAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQGCXG--- 113

Query: 199 DKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN 258
                           A+YPY   +GTC+ +K    AA I+ YED+P  +E+AL +AV++
Sbjct: 114 ----------------ANYPYAGTDGTCNRKKAAHPAAKINGYEDVPANNEKALQKAVAH 157

Query: 259 QPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGET 318
           QP++V +DA G  F FY SGV    CG   DHGV  VG+GT+++  G KYWL+KNSWG  
Sbjct: 158 QPIAVAIDAGGXEFQFYSSGVFTGQCGTELDHGVXAVGYGTSDD--GMKYWLVKNSWGTG 215

Query: 319 WGESGYIRILRDA----GLCGIATAASYPVA 345
           WGE GYIR+ RD     GLCGIA  ASYP A
Sbjct: 216 WGEEGYIRMQRDVTAKEGLCGIAMQASYPTA 246


>gi|224116884|ref|XP_002317418.1| predicted protein [Populus trichocarpa]
 gi|222860483|gb|EEE98030.1| predicted protein [Populus trichocarpa]
          Length = 503

 Score =  280 bits (715), Expect = 8e-73,   Method: Compositional matrix adjust.
 Identities = 148/335 (44%), Positives = 202/335 (60%), Gaps = 18/335 (5%)

Query: 19  LVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYI-EKAN 77
           +V+   S++VS     E SI+E  +QW  +H + Y+   E   R   FK+NL+YI EKA 
Sbjct: 32  IVVNDFSELVS-----EESIIEIFQQWRDRHQKVYEHAAESEKRYRNFKRNLKYIIEKAG 86

Query: 78  KE-GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSID 136
           K+     + +G N+F+DL+NEEF+ LY    +   ++ R ++R    +     D P+S+D
Sbjct: 87  KKTAALGHSVGLNKFADLSNEEFKELYLSKVKKPINIKRSTARDWRQRNLQTCDAPSSLD 146

Query: 137 WREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGG 196
           WR+KG VT +KDQG CGSCW+FS   A+EGI  I  G LI LSEQ+LVDC T N+GC GG
Sbjct: 147 WRKKGVVTAVKDQGDCGSCWSFSTTGAIEGINAIVTGDLISLSEQELVDCDTTNYGCEGG 206

Query: 197 LMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAV 256
            MD AFE++I N G+ TEA+YPY   +GTC+  KE+    +I  Y D+ + D  ALL A 
Sbjct: 207 YMDYAFEWVINNGGIDTEANYPYTGVDGTCNTTKEEIKVVSIDGYTDVDETD-SALLCAT 265

Query: 257 SNQPVSVCVDASGRAFHFYKSGVLNADCG---NNCDHGVAVVGFGTAEEENGAKYWLIKN 313
             QP+SV +D S   F  Y  G+ + DC    N+ DH V +VG+G+   ENG  YW++KN
Sbjct: 266 VQQPISVGMDGSALDFQLYTGGIYDGDCSDDPNDIDHAVLIVGYGS---ENGEDYWIVKN 322

Query: 314 SWGETWGESGYIRILRDA----GLCGIATAASYPV 344
           SWG  WG  GY  I R+     G+C I   ASYP 
Sbjct: 323 SWGTEWGMEGYFYIKRNTDLPYGVCAINAEASYPT 357


>gi|72008176|ref|XP_780713.1| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
          Length = 335

 Score =  279 bits (713), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 153/340 (45%), Positives = 212/340 (62%), Gaps = 17/340 (5%)

Query: 14  FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYI 73
           ++ ++LV  C   VVS  SM      E   QW  +HG+ Y  + E+A R  I+++NL+ +
Sbjct: 3   YLSVLLVAAC---VVSSLSMSFTDFDEDWNQWKNEHGKRYLSDEEEASRKLIWEKNLDIV 59

Query: 74  EKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
            K N +   G+ TY LG N+F+DL NEEF A+ TG+   V   S+ +   +     N+ +
Sbjct: 60  IKHNLKYDLGHFTYALGMNQFADLKNEEFVAMMTGFR--VNGTSKAAKGSTFLPSNNIGE 117

Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD- 189
           +P ++DWR KG VT +KDQGQCGSCWAFS   ++EG      GKL+ LSEQ LVDCS   
Sbjct: 118 LPKTVDWRTKGYVTPVKDQGQCGSCWAFSTTGSLEGQHFKATGKLVSLSEQNLVDCSGKE 177

Query: 190 -NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
            N GC GGLMD+AF+YII+  G+ TE  YPY+  +G C + K+  + AT++ Y D+    
Sbjct: 178 GNEGCDGGLMDQAFQYIIKAGGIDTEESYPYKAVDGEC-HFKKANIGATVTGYTDVTSDS 236

Query: 249 EQALLQAVSN-QPVSVCVDASGRAFHFYKSGVLN-ADCGNN-CDHGVAVVGFGTAEEENG 305
           E AL +AV++  P+SV +DAS  +F  YKSGV N  DC +   DHGV  VG+GT  +  G
Sbjct: 237 ETALQKAVAHIGPISVAIDASHMSFQLYKSGVYNEPDCSSTLLDHGVLAVGYGTTSD--G 294

Query: 306 AKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
             YW++KNSW ETWG +GY+ + R+    CGIAT ASYP+
Sbjct: 295 TDYWIVKNSWAETWGMNGYLWMSRNKDNQCGIATQASYPL 334


>gi|356542171|ref|XP_003539543.1| PREDICTED: LOW QUALITY PROTEIN: KDEL-tailed cysteine endopeptidase
           CEP2-like [Glycine max]
          Length = 342

 Score =  278 bits (712), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 151/342 (44%), Positives = 204/342 (59%), Gaps = 20/342 (5%)

Query: 10  IIPMFVIIILVITCASQVVSGRSMHEPSIVEK--HEQWMAQHGRTYKDELEKAMRLNIFK 67
           II + V+  L IT ++           S V +  +E W+ ++G+ Y+++ E   R  I++
Sbjct: 10  IINLLVLCNLWITASACPAKHNDNSSDSEVMRMRYESWLKKYGQKYRNKDEWEFRFEIYR 69

Query: 68  QNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQN 127
            N+++IE  N + N +YKL  N+F DLTNEEFR +Y  Y +P      +S   + F YQ 
Sbjct: 70  ANVQFIEVYNSQ-NYSYKLMDNKFVDLTNEEFRRMYLVY-QP------RSHLQTRFMYQK 121

Query: 128 VTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
             D+P  IDWR +GAVT IKDQG CGSCW+FSAVA VE I +I  GKL+ LSEQQL+DC 
Sbjct: 122 HGDLPKRIDWRTRGAVTXIKDQGHCGSCWSFSAVATVEDINKIKTGKLVSLSEQQLIDCD 181

Query: 188 --TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLP 245
               N GC+GG M+  F +I +  GL T+ +YPY+  +G  +  K +  A  I  YE+LP
Sbjct: 182 NRNGNEGCNGGHME-TFTFITKRGGLTTDKNYPYQGSDGDXNKAKVRNHAVAICGYENLP 240

Query: 246 KGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENG 305
             +E  L  AV++QP SV  DA G AF  Y  G  +  CG + +H + +VG+G   EENG
Sbjct: 241 AHNENMLKAAVAHQPASVATDAGGYAFQLYSKGTFSGSCGKDLNHRMTIVGYG---EENG 297

Query: 306 AKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYP 343
            KYWL+KNSW    G SGYIR+ RD     G CG A  ASYP
Sbjct: 298 EKYWLVKNSWANDXGVSGYIRMKRDPKDKDGTCGTAMEASYP 339


>gi|18396952|ref|NP_564322.1| cysteine proteinase-like protein [Arabidopsis thaliana]
 gi|332192922|gb|AEE31043.1| cysteine proteinase-like protein [Arabidopsis thaliana]
          Length = 334

 Score =  278 bits (712), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 145/329 (44%), Positives = 207/329 (62%), Gaps = 25/329 (7%)

Query: 25  SQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTY 84
           SQ     +++E SIV+ H+QWM Q  R YKDE EK MRL +FK+NL++IE  N  GN++Y
Sbjct: 21  SQARPHVTLNEQSIVDYHQQWMTQFSRVYKDESEKEMRLKVFKKNLKFIENFNNMGNQSY 80

Query: 85  KLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPT---SIDWREKG 141
            LG NEF+D   EEF A +TG    V S+S   ++    +  N++D+     S DWR++G
Sbjct: 81  TLGVNEFTDWKTEEFLATHTGLRVNVTSLSELFNKTKPSRNWNMSDIDMEDESKDWRDEG 140

Query: 142 AVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDK 200
           AVT +K QG C              +T+I+   L+ LSEQQL+DC  + N GC+GG  ++
Sbjct: 141 AVTPVKYQGAC-------------RLTKISGKNLLTLSEQQLIDCDIEKNGGCNGGEFEE 187

Query: 201 AFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQP 260
           AF+YII+N G++ E +YPY+ ++ +C     +A    I  ++ +P  +E+ALL+AV  QP
Sbjct: 188 AFKYIIKNGGVSLETEYPYQVKKESCRANARRAPHTQIRGFQMVPSHNERALLEAVRRQP 247

Query: 261 VSVCVDASGRAFHFYKSGVL-NADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETW 319
           VSV +DA   +F  YK GV    DCG + +H V +VG+GT    +G  YW++KNSWGE+W
Sbjct: 248 VSVLIDARADSFGHYKGGVYAGLDCGTDVNHAVTIVGYGTM---SGLNYWVLKNSWGESW 304

Query: 320 GESGYIRILRDA----GLCGIATAASYPV 344
           GE+GY+RI RD     G+CGIA  A+YPV
Sbjct: 305 GENGYMRIRRDVEWPQGMCGIAQVAAYPV 333


>gi|238007404|gb|ACR34737.1| unknown [Zea mays]
 gi|413943289|gb|AFW75938.1| cysteine proteinase Mir2 [Zea mays]
          Length = 484

 Score =  278 bits (711), Expect = 3e-72,   Method: Compositional matrix adjust.
 Identities = 148/325 (45%), Positives = 200/325 (61%), Gaps = 29/325 (8%)

Query: 42  HEQWMAQH----------GRTYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGT 88
           +E+W ++H          G     E + A RL +F+ NL YI+  N E   G   ++LG 
Sbjct: 53  YEEWRSEHDAGPRRGATGGSLGPGEDDDARRLEVFRYNLRYIDAHNAEADAGLHGFRLGL 112

Query: 89  NEFSDLTNEEFRA--LYTGYNRPVPSVSRQSSRPSTFKYQNVT--DVPTSIDWREKGAVT 144
             F+DLT EE+RA  L     R   +V    SR    +Y  +    +P ++DWRE+GAV 
Sbjct: 113 TRFADLTLEEYRARLLLGSRGRNGTAVGVVGSR----RYLPLAGEQLPDAVDWRERGAVA 168

Query: 145 HIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST-DNHGCSGGLMDKAFE 203
            +KDQGQCG+CWAFSAVAAVEGI +I  G LI LSEQ+L+DC    + GC GGLMD AF 
Sbjct: 169 EVKDQGQCGACWAFSAVAAVEGINKIVTGSLISLSEQELIDCDKFQDQGCDGGLMDNAFV 228

Query: 204 YIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSV 263
           ++I+N G+ TEADYP+   +GTCD + +     +I  +E +P   E+AL +AV++QPVS 
Sbjct: 229 FMIKNGGIDTEADYPFTGHDGTCDLKLKNTRVVSIDSFERVPINYERALQKAVAHQPVSA 288

Query: 264 CVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESG 323
            ++AS RAF  Y SG+ +  CG   DHGV VVG+G+   E G  YW++KNSWG  WGE+G
Sbjct: 289 SIEASRRAFQLYSSGIFDGRCGTYLDHGVTVVGYGS---EGGKDYWIVKNSWGTQWGEAG 345

Query: 324 YIRILRD----AGLCGIATAASYPV 344
           Y+R+ R+    AG CGIA    YPV
Sbjct: 346 YVRMARNVRVRAGKCGIAMEPLYPV 370


>gi|113120269|gb|ABI30274.1| VS-B, partial [Vasconcellea stipulata]
          Length = 341

 Score =  278 bits (711), Expect = 3e-72,   Method: Compositional matrix adjust.
 Identities = 144/329 (43%), Positives = 197/329 (59%), Gaps = 23/329 (6%)

Query: 17  IILVITCASQ---------VVSGRSMHEPSIVEK----HEQWMAQHGRTYKDELEKAMRL 63
           +I V+TC S           + G S  + + +E      E WM +H + YK   EK  R 
Sbjct: 10  LIFVVTCLSLHLGLSSADFSIVGYSQDDLTSIESSIRLFESWMLKHDKVYKTIDEKIYRF 69

Query: 64  NIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTF 123
             FK NL YI++ NK+ N +Y LG NEF+DLT++EF+  Y G + P  S+  + S    F
Sbjct: 70  ETFKDNLMYIDETNKK-NNSYWLGLNEFADLTHDEFKEKYVG-SIPEDSMIIEQSDDVEF 127

Query: 124 KYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQL 183
             ++V D P SIDWR+KGAVT +K+Q  CGSCWAFS VA VEGI +I  G LI LSEQ+L
Sbjct: 128 PNKHVVDYPESIDWRQKGAVTPVKNQNPCGSCWAFSTVATVEGINKIVTGNLISLSEQEL 187

Query: 184 VDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYED 243
           +DC   +HGC GG    + +Y+++N G+ TE +YPY  ++G C  + +K +   I+ Y+ 
Sbjct: 188 LDCDRRSHGCKGGYQTTSLKYVVDN-GVHTEKEYPYEKKQGNCRAKNKKGLKVYINGYKR 246

Query: 244 LPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEE 303
           +P  DE +L++ +S QPVSV V++ GR F FYK GV    CG   DH V  VG+      
Sbjct: 247 VPSNDEISLIKTISIQPVSVLVESKGRPFQFYKGGVFGGPCGTKLDHAVTAVGY------ 300

Query: 304 NGAKYWLIKNSWGETWGESGYIRILRDAG 332
            G  Y LIKNSWG  WG+ GYI+I R +G
Sbjct: 301 -GKDYILIKNSWGPKWGDKGYIKIKRASG 328


>gi|357130490|ref|XP_003566881.1| PREDICTED: actinidain-like [Brachypodium distachyon]
          Length = 350

 Score =  278 bits (711), Expect = 3e-72,   Method: Compositional matrix adjust.
 Identities = 144/328 (43%), Positives = 191/328 (58%), Gaps = 11/328 (3%)

Query: 16  IIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEK 75
           +++LV T     V+ +     ++  +HEQWMA+ GR Y D  EKA R  +F  N  Y++ 
Sbjct: 14  LLVLVATAVFHAVAAQGEAGLTVAARHEQWMAKFGRVYTDANEKARRQAVFGANARYVDA 73

Query: 76  ANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSI 135
            N+ GNRTY LG NEFSDLT+ EF   + GY    P  +   S+     Y    ++P S 
Sbjct: 74  VNRAGNRTYTLGLNEFSDLTDNEFAKTHLGYREFRPETA-NISKGVDPGYGLAGNIPKSF 132

Query: 136 DWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSG 195
           DWR KGAVT +K QG CG CWAF+AVAA EG+ +I +G LI +SEQQ++DC+T N+ C G
Sbjct: 133 DWRTKGAVTEVKSQGGCGCCWAFAAVAATEGLVKIAKGTLISMSEQQVLDCTTGNNTCKG 192

Query: 196 GLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLP-KGDEQALLQ 254
           G M+ A  Y+  + GL TE DY Y  E+G C        A ++   E +P  G+E  L +
Sbjct: 193 GYMNDALSYVFASGGLQTEEDYEYNAEKGACRRDVTPNPATSVGHAEYMPLDGNEFLLQK 252

Query: 255 AVSNQPVSVCVDASGRAFHFYKSGVLNA--DCGNNCDHGVAVVGFGTAEEENGAK--YWL 310
            V+ QPV V V+A G  F  Y  GV      CG N DH   VVG+G A+   G K  YWL
Sbjct: 253 LVARQPVVVAVEAYGTDFKNYGGGVFTGSPSCGQNLDHFFTVVGYGFAD---GGKQMYWL 309

Query: 311 IKNSWGETWGESGYIRILRDAGL--CGI 336
           +KN WG +WGESGY+RI R +    CG+
Sbjct: 310 VKNQWGTSWGESGYMRIARGSSARNCGM 337


>gi|53791858|dbj|BAD53944.1| putative cysteine protease [Oryza sativa Japonica Group]
          Length = 335

 Score =  278 bits (710), Expect = 3e-72,   Method: Compositional matrix adjust.
 Identities = 148/343 (43%), Positives = 207/343 (60%), Gaps = 20/343 (5%)

Query: 13  MFVIIILVITC-ASQVVSGRSMH-----EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIF 66
           M  I++LV T  A Q ++  + +     +   ++  E+WMA+ G+TYK   EK  R  IF
Sbjct: 1   MTSIVLLVCTLMALQAMAASAYYNNGSDDGVTMQMFEEWMAKFGKTYKCHGEKEHRFGIF 60

Query: 67  KQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQ 126
           + N+ +I     +      +G N+F+DLTN+EF A YTG   P P   +++ RP    + 
Sbjct: 61  RDNVHFIRGYKPQVTYDSAVGINQFADLTNDEFVATYTGAKPPHP---KEAPRPVDPIW- 116

Query: 127 NVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC 186
                P  IDWR +GAVT +KDQG CGSCWAF+AVAA+EG+T+I  G+L  LSEQ+LVDC
Sbjct: 117 ----TPCCIDWRFRGAVTGVKDQGACGSCWAFAAVAAIEGLTKIRTGQLTPLSEQELVDC 172

Query: 187 STDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCD-NQKEKAVAATISKYEDLP 245
            T+++GC GG  D+AFE +    G+  E+DY Y   +G C  +      AA+I  Y  +P
Sbjct: 173 DTNSNGCGGGHTDRAFELVASKGGITAESDYRYEGFQGKCRVDDMLFNHAASIGGYRAVP 232

Query: 246 KGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENG 305
             DE+ L  AV+ QPV+V +DASG AF FYKSGV    CG + +H V +VG+   +  +G
Sbjct: 233 PNDERQLATAVARQPVTVYIDASGPAFQFYKSGVFPGPCGASSNHAVTLVGY-CQDGASG 291

Query: 306 AKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
            KYWL KNSWG+TWG+ GYI + +D     G CG+A +  YP 
Sbjct: 292 KKYWLAKNSWGKTWGQQGYILLEKDIVQPHGTCGLAVSPFYPT 334


>gi|356509992|ref|XP_003523725.1| PREDICTED: oryzain alpha chain-like [Glycine max]
          Length = 439

 Score =  278 bits (710), Expect = 4e-72,   Method: Compositional matrix adjust.
 Identities = 138/315 (43%), Positives = 197/315 (62%), Gaps = 18/315 (5%)

Query: 40  EKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGN-----RTYKLGTNEFSDL 94
           E  E+W  +H +TY  E EK  RL +F+ N  ++ + N+  N      +Y L  N F+DL
Sbjct: 31  ELFEKWCKEHSKTYSSEEEKLYRLKVFEDNYAFVAQHNQNANNNNNNSSYTLSLNAFADL 90

Query: 95  TNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGS 154
           T+ EF+    G    +P    +  RP   + +++  +P+ IDWR+ GAVT +KDQ  CG+
Sbjct: 91  THHEFKTTRLG----LPLTLLRFKRPQNQQSRDLLHIPSQIDWRQSGAVTPVKDQASCGA 146

Query: 155 CWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLAT 213
           CWAFSA  A+EGI +I  G L+ LSEQ+L+DC T  N GC GGLMD A++++I+NKG+ T
Sbjct: 147 CWAFSATGAIEGINKIVTGSLVSLSEQELIDCDTSYNSGCGGGLMDFAYQFVIDNKGIDT 206

Query: 214 EADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFH 273
           E DYPY+  + +C   K K  A TI  Y D+P  +E+ +L+AV++QPVSV +  S R F 
Sbjct: 207 EDDYPYQARQRSCSKDKLKRRAVTIEDYVDVPPSEEE-ILKAVASQPVSVGICGSEREFQ 265

Query: 274 FYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA-- 331
            Y  G+    C    DH V +VG+G+   ENG  YW++KNSWG+ WG +GYI ++R++  
Sbjct: 266 LYSKGIFTGPCSTFLDHAVLIVGYGS---ENGVDYWIVKNSWGKYWGMNGYIHMIRNSGN 322

Query: 332 --GLCGIATAASYPV 344
             G+CGI T ASYPV
Sbjct: 323 SKGICGINTLASYPV 337


>gi|218202077|gb|EEC84504.1| hypothetical protein OsI_31195 [Oryza sativa Indica Group]
          Length = 362

 Score =  277 bits (709), Expect = 5e-72,   Method: Compositional matrix adjust.
 Identities = 139/316 (43%), Positives = 189/316 (59%), Gaps = 14/316 (4%)

Query: 38  IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
           ++++   W   H R+Y    E   R +++++N E+I+  N  G+ TY+L  NEF+DLT E
Sbjct: 47  MMDRFRAWQGAHNRSYPSAEEALQRFDVYRRNAEFIDAVNLRGDLTYRLAENEFADLTEE 106

Query: 98  EFRALYTGY---NRPVPS---VSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQ-G 150
           EF A YTGY   + PV      +      ++F Y+   DVP S+DWR +GAV   K Q  
Sbjct: 107 EFLATYTGYYAGDGPVDDSVITTGAGDVDASFSYR--VDVPASVDWRAQGAVVPPKSQTS 164

Query: 151 QCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKG 210
            C SCWAF   A +E +  I  GKL+ LSEQQLVDC + + GC+ G   +A+++++EN G
Sbjct: 165 TCSSCWAFVTAATIESLNMIKTGKLVSLSEQQLVDCDSYDGGCNLGSYGRAYKWVVENGG 224

Query: 211 LATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGR 270
           L TEADYPY    G C+  K    AA I+ +  +P  +E AL  AV+ QPV+V ++  G 
Sbjct: 225 LTTEADYPYTARRGPCNRAKSAHHAAKITGFGKVPPRNEAALQAAVARQPVAVAIEV-GS 283

Query: 271 AFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD 330
              FYK GV    CG    H V VVG+GT +  +GAKYW IKNSWG++WGE GYIRILRD
Sbjct: 284 GMQFYKGGVYTGPCGTRLAHAVTVVGYGT-DASSGAKYWTIKNSWGQSWGERGYIRILRD 342

Query: 331 A---GLCGIATAASYP 343
               GLCG+    +YP
Sbjct: 343 VGGPGLCGVTLDIAYP 358


>gi|9502426|gb|AAF88125.1|AC021043_18 Putative cysteine proteinase [Arabidopsis thaliana]
          Length = 365

 Score =  277 bits (709), Expect = 5e-72,   Method: Compositional matrix adjust.
 Identities = 148/347 (42%), Positives = 213/347 (61%), Gaps = 30/347 (8%)

Query: 25  SQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTY 84
           SQ     +++E SIV+ H+QWM Q  R YKDE EK MRL +FK+NL++IE  N  GN++Y
Sbjct: 21  SQARPHVTLNEQSIVDYHQQWMTQFSRVYKDESEKEMRLKVFKKNLKFIENFNNMGNQSY 80

Query: 85  KLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPT---SIDWREKG 141
            LG NEF+D   EEF A +TG    V S+S   ++    +  N++D+     S DWR++G
Sbjct: 81  TLGVNEFTDWKTEEFLATHTGLRVNVTSLSELFNKTKPSRNWNMSDIDMEDESKDWRDEG 140

Query: 142 AVTHIKDQGQCGSCWA------------FSAVAAV------EGITQITRGKLIELSEQQL 183
           AVT +K QG C                 ++ +  V      EG+T+I+   L+ LSEQQL
Sbjct: 141 AVTPVKYQGACPEFPTKQIRRNSLVGKQYTKLLGVLSDWGDEGLTKISGKNLLTLSEQQL 200

Query: 184 VDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYE 242
           +DC  + N GC+GG  ++AF+YII+N G++ E +YPY+ ++ +C     +A    I  ++
Sbjct: 201 IDCDIEKNGGCNGGEFEEAFKYIIKNGGVSLETEYPYQVKKESCRANARRAPHTQIRGFQ 260

Query: 243 DLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVL-NADCGNNCDHGVAVVGFGTAE 301
            +P  +E+ALL+AV  QPVSV +DA   +F  YK GV    DCG + +H V +VG+GT  
Sbjct: 261 MVPSHNERALLEAVRRQPVSVLIDARADSFGHYKGGVYAGLDCGTDVNHAVTIVGYGTM- 319

Query: 302 EENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
             +G  YW++KNSWGE+WGE+GY+RI RD     G+CGIA  A+YPV
Sbjct: 320 --SGLNYWVLKNSWGESWGENGYMRIRRDVEWPQGMCGIAQVAAYPV 364


>gi|356517384|ref|XP_003527367.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
          Length = 332

 Score =  277 bits (708), Expect = 5e-72,   Method: Compositional matrix adjust.
 Identities = 154/341 (45%), Positives = 210/341 (61%), Gaps = 29/341 (8%)

Query: 14  FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYI 73
           F +++ +   A QV   R++ + S+ E H Q M ++ +  KD  +      +FK+N+ YI
Sbjct: 12  FAMLLSMAFLAFQVTC-RTLQDASMYESHGQRMTRYSKVDKDPPDX-----VFKENVNYI 65

Query: 74  EKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPT 133
           E  N   ++ YK   N+F+       +  + G+      +     R +TFK++NVT  P+
Sbjct: 66  EACNNAADKPYKRDINQFAP------KKRFKGH------MCSSIIRITTFKFENVTATPS 113

Query: 134 SIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIEL-SEQQLVDCSTD--N 190
           ++D R+K AVT IKDQGQCG  WA SAVAA EGI  +  GKLI L SEQ+LVDC T   +
Sbjct: 114 TVDCRQKVAVTPIKDQGQCGCFWALSAVAATEGIHALXAGKLILLSSEQELVDCDTKGVD 173

Query: 191 HGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDN-QKEKAVAATISKYEDLPKGDE 249
             C GGLMD AF++II+N GL TEA+YPY+  +G C+  + +K  A  I+ YED+P  +E
Sbjct: 174 QDCQGGLMDDAFKFIIQNHGLNTEANYPYKGVDGKCNAYEADKNAATIITGYEDVPANNE 233

Query: 250 QALLQ-AVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKY 308
           +A LQ AV+N PVSV +DASG  F FYKSGV    CG   DHGV  VG+G +++  G +Y
Sbjct: 234 KAHLQKAVANNPVSVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSDD--GTEY 291

Query: 309 WLIKNSWGETWGESGYIRILR----DAGLCGIATAASYPVA 345
           WL+KNS G  WGE GYIR+ R    +  LCGIA  ASYP A
Sbjct: 292 WLVKNSRGTEWGEEGYIRMQRGVDSEEALCGIAVQASYPSA 332


>gi|226505708|ref|NP_001141813.1| uncharacterized protein LOC100273952 precursor [Zea mays]
 gi|194706024|gb|ACF87096.1| unknown [Zea mays]
 gi|413945958|gb|AFW78607.1| hypothetical protein ZEAMMB73_489507 [Zea mays]
          Length = 460

 Score =  277 bits (708), Expect = 5e-72,   Method: Compositional matrix adjust.
 Identities = 139/327 (42%), Positives = 191/327 (58%), Gaps = 22/327 (6%)

Query: 36  PSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNR------------- 82
           P+I  + + W A+HG+ Y    E+A RL +F  N  ++   N                  
Sbjct: 30  PAIEAQFDAWCAEHGKAYATPEERAARLAVFADNAAFVAAHNARAGANAAGGGGGGAAPP 89

Query: 83  TYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGA 142
           +Y L  N F+DLT+EEFRA   G   P  ++ R  + P  +       VP ++DWR+ GA
Sbjct: 90  SYTLALNAFADLTHEEFRAARLGRIAPGAAL-RSRAAPVYWGLGGGAAVPDALDWRKSGA 148

Query: 143 VTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKA 201
           VT +KDQG CG+CW+FSA  A+EGI +I  G L+ LSEQ+L+DC    N GC GGLMD A
Sbjct: 149 VTKVKDQGSCGACWSFSATGAMEGINKIKTGSLVSLSEQELIDCDRSYNSGCGGGLMDYA 208

Query: 202 FEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPV 261
           ++++I+N G+ TE DYPYR  +GTC+  K K    TI  Y D+P   E  LLQAV+ QPV
Sbjct: 209 YKFVIKNGGIDTEEDYPYREADGTCNKNKLKKRVVTIDGYTDVPSNKEDLLLQAVAQQPV 268

Query: 262 SVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGE 321
           SV +  S RAF  Y  G+ +  C  + DH V +VG+G+   E G  YW++KNSWGE+WG 
Sbjct: 269 SVGICGSARAFQLYYQGIFDGPCPTSLDHAVLIVGYGS---EGGKDYWIVKNSWGESWGM 325

Query: 322 SGYIRILRDA----GLCGIATAASYPV 344
            GY+ + R+     G+CGI   AS+P 
Sbjct: 326 KGYMHMHRNTGDSKGVCGINMMASFPT 352


>gi|47169030|pdb|1S4V|A Chain A, The 2.0 A Crystal Structure Of The Kdel-Tailed Cysteine
           Endopeptidase Functioning In Programmed Cell Death Of
           Ricinus Communis Endosperm
 gi|47169031|pdb|1S4V|B Chain B, The 2.0 A Crystal Structure Of The Kdel-Tailed Cysteine
           Endopeptidase Functioning In Programmed Cell Death Of
           Ricinus Communis Endosperm
          Length = 229

 Score =  277 bits (708), Expect = 7e-72,   Method: Compositional matrix adjust.
 Identities = 132/219 (60%), Positives = 158/219 (72%), Gaps = 7/219 (3%)

Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD- 189
           VP S+DWR+KGAVT +KDQGQCGSCWAFS + AVEGI QI   KL+ LSEQ+LVDC TD 
Sbjct: 2   VPASVDWRKKGAVTSVKDQGQCGSCWAFSTIVAVEGINQIKTNKLVSLSEQELVDCDTDQ 61

Query: 190 NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDE 249
           N GC+GGLMD AFE+I +  G+ TEA+YPY   +GTCD  KE A A +I  +E++P+ DE
Sbjct: 62  NQGCNGGLMDYAFEFIKQRGGITTEANYPYEAYDGTCDVSKENAPAVSIDGHENVPENDE 121

Query: 250 QALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYW 309
            ALL+AV+NQPVSV +DA G  F FY  GV    CG   DHGVA+VG+GT  +  G KYW
Sbjct: 122 NALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGSCGTELDHGVAIVGYGTTID--GTKYW 179

Query: 310 LIKNSWGETWGESGYIRILR----DAGLCGIATAASYPV 344
            +KNSWG  WGE GYIR+ R      GLCGIA  ASYP+
Sbjct: 180 TVKNSWGPEWGEKGYIRMERGISDKEGLCGIAMEASYPI 218


>gi|115478933|ref|NP_001063060.1| Os09g0381400 [Oryza sativa Japonica Group]
 gi|113631293|dbj|BAF24974.1| Os09g0381400 [Oryza sativa Japonica Group]
 gi|215678649|dbj|BAG92304.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|218202075|gb|EEC84502.1| hypothetical protein OsI_31193 [Oryza sativa Indica Group]
          Length = 362

 Score =  276 bits (707), Expect = 8e-72,   Method: Compositional matrix adjust.
 Identities = 139/316 (43%), Positives = 189/316 (59%), Gaps = 14/316 (4%)

Query: 38  IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
           ++++   W   H R+Y    E   R +++++N E+I+  N  G+ TY+L  NEF+DLT E
Sbjct: 47  MMDRFRAWQGAHNRSYPSAEEALQRFDVYRRNAEFIDAVNLRGDLTYQLAENEFADLTEE 106

Query: 98  EFRALYTGY---NRPVPS---VSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQ-G 150
           EF A YTGY   + PV      +      ++F Y+   DVP S+DWR +GAV   K Q  
Sbjct: 107 EFLATYTGYYAGDGPVDDSVITTGAGDVDASFSYR--VDVPASVDWRAQGAVVPPKSQTS 164

Query: 151 QCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKG 210
            C SCWAF   A +E +  I  GKL+ LSEQQLVDC + + GC+ G   +A+++++EN G
Sbjct: 165 TCSSCWAFVTAATIESLNMIKTGKLVSLSEQQLVDCDSYDGGCNLGSYGRAYKWVVENGG 224

Query: 211 LATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGR 270
           L TEADYPY    G C+  K    AA I+ +  +P  +E AL  AV+ QPV+V ++  G 
Sbjct: 225 LTTEADYPYTARRGPCNRAKSAHHAAKITGFGKVPPRNEAALQAAVARQPVAVAIEV-GS 283

Query: 271 AFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD 330
              FYK GV    CG    H V VVG+GT +  +GAKYW IKNSWG++WGE GYIRILRD
Sbjct: 284 GMQFYKGGVYTGPCGTRLAHAVTVVGYGT-DASSGAKYWTIKNSWGQSWGERGYIRILRD 342

Query: 331 A---GLCGIATAASYP 343
               GLCG+    +YP
Sbjct: 343 VGGPGLCGVTLDIAYP 358


>gi|395535909|ref|XP_003769963.1| PREDICTED: cathepsin S [Sarcophilus harrisii]
          Length = 347

 Score =  276 bits (707), Expect = 8e-72,   Method: Compositional matrix adjust.
 Identities = 149/344 (43%), Positives = 213/344 (61%), Gaps = 21/344 (6%)

Query: 10  IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
           ++ M V+I + + CAS     R  H+P +    E W   +G+ Y+++ ++  R  I+++N
Sbjct: 13  LLRMKVVIWMFLACASTTAYLR--HDPMLDNHWELWKKTYGKQYEEQNQEVTRRLIWEKN 70

Query: 70  LEYIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQ 126
           L+++   N E   G  +Y L  N  SD+T+EE  +L +    P      Q SR +T++  
Sbjct: 71  LKFVTLHNLEHSMGLHSYDLSMNHLSDMTSEEVASLMSSLRIP-----NQWSRNTTYRLN 125

Query: 127 NVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC 186
           +   +P S+DWR+KG VT +K QG CGSCWAFSAV A+E   ++  GKL+ LS Q LVDC
Sbjct: 126 SNQKLPDSVDWRDKGCVTEVKYQGTCGSCWAFSAVGALEAQLKLKTGKLVSLSAQNLVDC 185

Query: 187 STD----NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYE 242
           ST+    NHGC+GG M +AF+YII+N G+ ++A YPY+ ++G C        AAT S+Y 
Sbjct: 186 STNEKYENHGCNGGCMTEAFQYIIDNNGIDSDASYPYKAKDGKCQYNPANR-AATCSRYT 244

Query: 243 DLPKGDEQALLQAVSNQ-PVSVCVDASGRAFHFYKSGV-LNADCGNNCDHGVAVVGFGTA 300
           +LP G E AL +AV+N+ PVSV +DAS  +F  YKSGV  +  C  N +HGV V G+G  
Sbjct: 245 ELPYGSEDALKEAVANKGPVSVGIDASLPSFFLYKSGVYYDPSCTQNVNHGVLVTGYGNL 304

Query: 301 EEENGAKYWLIKNSWGETWGESGYIRILRDAG-LCGIATAASYP 343
           +   G  YWL+KNSWG ++G+ GYIRI R+ G  CGIA   SYP
Sbjct: 305 D---GKDYWLVKNSWGLSFGDKGYIRIARNRGNHCGIANFPSYP 345


>gi|255522980|gb|ACU12382.1| RE21773p [Drosophila melanogaster]
          Length = 375

 Score =  276 bits (707), Expect = 8e-72,   Method: Compositional matrix adjust.
 Identities = 149/319 (46%), Positives = 201/319 (63%), Gaps = 15/319 (4%)

Query: 38  IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANK---EGNRTYKLGTNEFSDL 94
           ++E+   +  +H + Y+DE E+  RL IF +N   I K N+   EG  ++KL  N+++DL
Sbjct: 59  VMEEWHTFKLEHRKNYQDETEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADL 118

Query: 95  TNEEFRALYTGYNRPVPSVSR---QSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQ 151
            + EFR L  G+N  +    R   +S +  TF       +P S+DWR KGAVT +KDQG 
Sbjct: 119 LHHEFRQLMNGFNYTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGH 178

Query: 152 CGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENK 209
           CGSCWAFS+  A+EG      G L+ LSEQ LVDCST   N+GC+GGLMD AF YI +N 
Sbjct: 179 CGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNG 238

Query: 210 GLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDAS 268
           G+ TE  YPY   + +C   K   V AT   + D+P+GDE+ + +AV+   PVSV +DAS
Sbjct: 239 GIDTEKSYPYEAIDDSCHFNK-GTVGATDRGFTDIPQGDEKKMAEAVATVGPVSVAIDAS 297

Query: 269 GRAFHFYKSGVLN-ADC-GNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIR 326
             +F FY  GV N   C   N DHGV VVGFGT  +E+G  YWL+KNSWG TWG+ G+I+
Sbjct: 298 HESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGT--DESGEDYWLVKNSWGTTWGDKGFIK 355

Query: 327 ILRDA-GLCGIATAASYPV 344
           +LR+    CGIA+A+SYP+
Sbjct: 356 MLRNKENQCGIASASSYPL 374


>gi|42573181|ref|NP_974687.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
 gi|332661102|gb|AEE86502.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
          Length = 288

 Score =  276 bits (707), Expect = 8e-72,   Method: Compositional matrix adjust.
 Identities = 135/245 (55%), Positives = 176/245 (71%), Gaps = 5/245 (2%)

Query: 38  IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
           ++E  E WM++H + YK   EK  R  +F++NL +I++ N E N +Y LG NEF+DLT+E
Sbjct: 47  LLELFESWMSEHSKAYKSVEEKVHRFEVFRENLMHIDQRNNEIN-SYWLGLNEFADLTHE 105

Query: 98  EFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWA 157
           EF+  Y G  +P  S  RQ S  + F+Y+++TD+P S+DWR+KGAV  +KDQGQCGSCWA
Sbjct: 106 EFKGRYLGLAKPQFSRKRQPS--ANFRYRDITDLPKSVDWRKKGAVAPVKDQGQCGSCWA 163

Query: 158 FSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEAD 216
           FS VAAVEGI QIT G L  LSEQ+L+DC T  N GC+GGLMD AF+YII   GL  E D
Sbjct: 164 FSTVAAVEGINQITTGNLSSLSEQELIDCDTTFNSGCNGGLMDYAFQYIISTGGLHKEDD 223

Query: 217 YPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYK 276
           YPY  EEG C  QKE     TIS YED+P+ D+++L++A+++QPVSV ++ASGR F FYK
Sbjct: 224 YPYLMEEGICQEQKEDVERVTISGYEDVPENDDESLVKALAHQPVSVAIEASGRDFQFYK 283

Query: 277 SGVLN 281
            GV N
Sbjct: 284 -GVYN 287


>gi|194352760|emb|CAQ00108.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
 gi|326510977|dbj|BAJ91836.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326523875|dbj|BAJ96948.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326528631|dbj|BAJ97337.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 368

 Score =  276 bits (707), Expect = 9e-72,   Method: Compositional matrix adjust.
 Identities = 140/333 (42%), Positives = 200/333 (60%), Gaps = 29/333 (8%)

Query: 38  IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNR--TYKLGTNEFSDLT 95
           + ++  +W A+H RTY    E+  RL ++ +N+ YIE  N +     TY+LG   ++DLT
Sbjct: 38  MAQRFRRWKAEHSRTYATPEEERHRLRVYARNMRYIEATNGDAGAGLTYELGETAYTDLT 97

Query: 96  NEEFRALYTGYNRPVPSVSRQSSRPSTF------------------KYQNVT-DVPTSID 136
           ++EF A+YT  +R  P        P T                    Y N +   P S+D
Sbjct: 98  SDEFTAMYT--SRAPPLSDDDDDLPMTMITTRAGPVAAAGGGGWLQVYVNESAGAPASVD 155

Query: 137 WREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGG 196
           WRE+GAVT +K+QGQCGSCWAFS VA +EGI QI  GKL  LSEQ+LVDC   +HGC+GG
Sbjct: 156 WRERGAVTAVKNQGQCGSCWAFSTVAVIEGIHQIKTGKLASLSEQELVDCDKLDHGCNGG 215

Query: 197 LMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAV 256
           +  +A ++I  N G+ ++ DYPY  ++ TCD +K    AA+IS ++ +    E +L  AV
Sbjct: 216 VSYRALQWITSNGGITSQDDYPYTAKDDTCDTKKLSHHAASISGFQRVATRSELSLTNAV 275

Query: 257 SNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWG 316
           + QPV+V ++A G  F  Y++GV N  CG   +HGV VVG+G  +E  G  YW++KNSWG
Sbjct: 276 AMQPVAVSIEAGGANFQHYRNGVYNGPCGTRLNHGVTVVGYG-EDEVTGESYWIVKNSWG 334

Query: 317 ETWGESGYIR-----ILRDAGLCGIATAASYPV 344
           E WG++GY+R     I +  G+CGIA   S+P+
Sbjct: 335 EKWGDNGYLRMKKGIIDKPEGICGIAIRPSFPL 367


>gi|24653514|ref|NP_523735.2| cysteine proteinase-1, isoform C [Drosophila melanogaster]
 gi|118572624|sp|Q95029.2|CATL_DROME RecName: Full=Cathepsin L; AltName: Full=Cysteine proteinase 1;
           Contains: RecName: Full=Cathepsin L heavy chain;
           Contains: RecName: Full=Cathepsin L light chain; Flags:
           Precursor
 gi|21627209|gb|AAM68565.1| cysteine proteinase-1, isoform C [Drosophila melanogaster]
          Length = 371

 Score =  276 bits (706), Expect = 9e-72,   Method: Compositional matrix adjust.
 Identities = 149/319 (46%), Positives = 201/319 (63%), Gaps = 15/319 (4%)

Query: 38  IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANK---EGNRTYKLGTNEFSDL 94
           ++E+   +  +H + Y+DE E+  RL IF +N   I K N+   EG  ++KL  N+++DL
Sbjct: 55  VMEEWHTFKLEHRKNYQDETEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADL 114

Query: 95  TNEEFRALYTGYNRPVPSVSR---QSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQ 151
            + EFR L  G+N  +    R   +S +  TF       +P S+DWR KGAVT +KDQG 
Sbjct: 115 LHHEFRQLMNGFNYTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGH 174

Query: 152 CGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENK 209
           CGSCWAFS+  A+EG      G L+ LSEQ LVDCST   N+GC+GGLMD AF YI +N 
Sbjct: 175 CGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNG 234

Query: 210 GLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDAS 268
           G+ TE  YPY   + +C   K   V AT   + D+P+GDE+ + +AV+   PVSV +DAS
Sbjct: 235 GIDTEKSYPYEAIDDSCHFNK-GTVGATDRGFTDIPQGDEKKMAEAVATVGPVSVAIDAS 293

Query: 269 GRAFHFYKSGVLN-ADC-GNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIR 326
             +F FY  GV N   C   N DHGV VVGFGT  +E+G  YWL+KNSWG TWG+ G+I+
Sbjct: 294 HESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGT--DESGEDYWLVKNSWGTTWGDKGFIK 351

Query: 327 ILRDA-GLCGIATAASYPV 344
           +LR+    CGIA+A+SYP+
Sbjct: 352 MLRNKENQCGIASASSYPL 370


>gi|125525815|gb|EAY73929.1| hypothetical protein OsI_01813 [Oryza sativa Indica Group]
          Length = 336

 Score =  276 bits (706), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 145/342 (42%), Positives = 206/342 (60%), Gaps = 15/342 (4%)

Query: 8   SFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFK 67
           +F++ +  ++ L    AS   +  S  +   ++  E+WMA+ G+TYK   EK  R  IF+
Sbjct: 4   AFLLVVCTLMALQAMAASAYYNNGS-DDGVTMQMFEEWMAKFGKTYKCHGEKEHRFGIFR 62

Query: 68  QNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQN 127
            N+ +I     +      +G N+F+DLTN+EF A YTG   P P   +++ RP    +  
Sbjct: 63  DNVHFIRGYKPQVTYDSAVGINQFADLTNDEFVATYTGAKPPHP---KEAPRPVDPIW-- 117

Query: 128 VTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
               P  IDWR +GAVT +KDQG CGSCWAF+AVAA+EG+T+I  G+L  LSEQ+LVDC 
Sbjct: 118 ---TPCCIDWRFRGAVTGVKDQGACGSCWAFAAVAAIEGLTKIRTGQLTPLSEQELVDCD 174

Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCD-NQKEKAVAATISKYEDLPK 246
           T+++GC GG  D+AFE +    G+  E+DY Y   +G C  +      AA+I  Y  +P 
Sbjct: 175 TNSNGCGGGHTDRAFELVASKGGITAESDYRYEGFQGKCRVDDMLFNHAASIGGYRAVPP 234

Query: 247 GDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGA 306
            DE+ L  AV+ QPV+V +DASG AF FYKSGV    CG + +H V +VG+   +  +G 
Sbjct: 235 NDERQLATAVARQPVTVYIDASGPAFQFYKSGVFPGPCGASSNHAVTLVGY-CQDGASGK 293

Query: 307 KYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
           KYW+ KNSWG+TWG+ GYI + +D     G CG+A +  YP 
Sbjct: 294 KYWVAKNSWGKTWGQQGYILLEKDVLQPHGTCGLAVSPFYPT 335


>gi|49387634|dbj|BAD25828.1| putative cysteine proteinase [Oryza sativa Japonica Group]
 gi|49388888|dbj|BAD26098.1| putative cysteine proteinase [Oryza sativa Japonica Group]
          Length = 358

 Score =  276 bits (706), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 139/316 (43%), Positives = 189/316 (59%), Gaps = 14/316 (4%)

Query: 38  IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
           ++++   W   H R+Y    E   R +++++N E+I+  N  G+ TY+L  NEF+DLT E
Sbjct: 43  MMDRFRAWQGAHNRSYPSAEEALQRFDVYRRNAEFIDAVNLRGDLTYQLAENEFADLTEE 102

Query: 98  EFRALYTGY---NRPVPS---VSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQ-G 150
           EF A YTGY   + PV      +      ++F Y+   DVP S+DWR +GAV   K Q  
Sbjct: 103 EFLATYTGYYAGDGPVDDSVITTGAGDVDASFSYR--VDVPASVDWRAQGAVVPPKSQTS 160

Query: 151 QCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKG 210
            C SCWAF   A +E +  I  GKL+ LSEQQLVDC + + GC+ G   +A+++++EN G
Sbjct: 161 TCSSCWAFVTAATIESLNMIKTGKLVSLSEQQLVDCDSYDGGCNLGSYGRAYKWVVENGG 220

Query: 211 LATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGR 270
           L TEADYPY    G C+  K    AA I+ +  +P  +E AL  AV+ QPV+V ++  G 
Sbjct: 221 LTTEADYPYTARRGPCNRAKSAHHAAKITGFGKVPPRNEAALQAAVARQPVAVAIEV-GS 279

Query: 271 AFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD 330
              FYK GV    CG    H V VVG+GT +  +GAKYW IKNSWG++WGE GYIRILRD
Sbjct: 280 GMQFYKGGVYTGPCGTRLAHAVTVVGYGT-DASSGAKYWTIKNSWGQSWGERGYIRILRD 338

Query: 331 A---GLCGIATAASYP 343
               GLCG+    +YP
Sbjct: 339 VGGPGLCGVTLDIAYP 354


>gi|157132324|ref|XP_001655999.1| cathepsin l [Aedes aegypti]
 gi|108881694|gb|EAT45919.1| AAEL002833-PA [Aedes aegypti]
          Length = 339

 Score =  276 bits (706), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 153/343 (44%), Positives = 210/343 (61%), Gaps = 19/343 (5%)

Query: 14  FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYI 73
            +I+++    A+  VS   +    + E+   +  QH + Y  E E+ +RL I+ QN   I
Sbjct: 3   ILILLMAFVAAANAVSLYEL----VKEEWNAFKLQHRKNYDSETEERIRLKIYVQNKHKI 58

Query: 74  EKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSR---PSTFKYQN 127
            K N+    G   Y+L  N+++DL +EEF     G+NR     S +  R   P TF    
Sbjct: 59  AKHNQRFDLGQEKYRLRVNKYADLLHEEFVQTVNGFNRTDSKKSLKGVRIEEPVTFIEPA 118

Query: 128 VTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
             +VPT++DWR+KGAVT +KDQG CGSCW+FSA  A+EG      GKL+ LSEQ LVDCS
Sbjct: 119 NVEVPTTVDWRKKGAVTPVKDQGHCGSCWSFSATGALEGQHFRKTGKLVSLSEQNLVDCS 178

Query: 188 TD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLP 245
               N+GC+GG+MD AF+YI +N G+ TE  YPY   + TC +   KAV AT   Y D+P
Sbjct: 179 GKYGNNGCNGGMMDYAFQYIKDNGGIDTEKSYPYEAIDDTC-HFNPKAVGATDKGYVDIP 237

Query: 246 KGDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFGTAEE 302
           +GDE+AL +A++   PVS+ +DAS  +F FY  GV     C + N DHGV  VG+GT+EE
Sbjct: 238 QGDEEALKKALATVGPVSIAIDASHESFQFYSEGVYYEPQCDSENLDHGVLAVGYGTSEE 297

Query: 303 ENGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
             G  YWL+KNSWG TWG+ GY+++ R+    CG+AT ASYP+
Sbjct: 298 --GEDYWLVKNSWGTTWGDQGYVKMARNRDNHCGVATCASYPL 338


>gi|125570286|gb|EAZ11801.1| hypothetical protein OsJ_01675 [Oryza sativa Japonica Group]
          Length = 319

 Score =  276 bits (706), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 141/311 (45%), Positives = 193/311 (62%), Gaps = 14/311 (4%)

Query: 39  VEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEE 98
           ++  E+WMA+ G+TYK   EK  R  IF+ N+ +I     +      +G N+F+DLTN+E
Sbjct: 17  MQMFEEWMAKFGKTYKCHGEKEHRFGIFRDNVHFIRGYKPQVTYDSAVGINQFADLTNDE 76

Query: 99  FRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAF 158
           F A YTG   P P   +++ RP    +      P  IDWR +GAVT +KDQG CGSCWAF
Sbjct: 77  FVATYTGAKPPHP---KEAPRPVDPIW-----TPCCIDWRFRGAVTGVKDQGACGSCWAF 128

Query: 159 SAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYP 218
           +AVAA+EG+T+I  G+L  LSEQ+LVDC T+++GC GG  D+AFE +    G+  E+DY 
Sbjct: 129 AAVAAIEGLTKIRTGQLTPLSEQELVDCDTNSNGCGGGHTDRAFELVASKGGITAESDYR 188

Query: 219 YRHEEGTCD-NQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKS 277
           Y   +G C  +      AA+I  Y  +P  DE+ L  AV+ QPV+V +DASG AF FYKS
Sbjct: 189 YEGFQGKCRVDDMLFNHAASIGGYRAVPPNDERQLATAVARQPVTVYIDASGPAFQFYKS 248

Query: 278 GVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----GL 333
           GV    CG + +H V +VG+   +  +G KYWL KNSWG+TWG+ GYI + +D     G 
Sbjct: 249 GVFPGPCGASSNHAVTLVGY-CQDGASGKKYWLAKNSWGKTWGQQGYILLEKDIVQPHGT 307

Query: 334 CGIATAASYPV 344
           CG+A +  YP 
Sbjct: 308 CGLAVSPFYPT 318


>gi|24653516|ref|NP_725347.1| cysteine proteinase-1, isoform A [Drosophila melanogaster]
 gi|24653518|ref|NP_725348.1| cysteine proteinase-1, isoform B [Drosophila melanogaster]
 gi|1658527|gb|AAB18345.1| cysteine proteinase 1 [Drosophila melanogaster]
 gi|2305221|gb|AAB65749.1| cysteine proteinase-1 [Drosophila melanogaster]
 gi|7303249|gb|AAF58311.1| cysteine proteinase-1, isoform A [Drosophila melanogaster]
 gi|21627210|gb|AAM68566.1| cysteine proteinase-1, isoform B [Drosophila melanogaster]
 gi|54650754|gb|AAV36956.1| LP06554p [Drosophila melanogaster]
 gi|220951982|gb|ACL88534.1| Cp1-PA [synthetic construct]
          Length = 341

 Score =  276 bits (705), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 149/319 (46%), Positives = 201/319 (63%), Gaps = 15/319 (4%)

Query: 38  IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANK---EGNRTYKLGTNEFSDL 94
           ++E+   +  +H + Y+DE E+  RL IF +N   I K N+   EG  ++KL  N+++DL
Sbjct: 25  VMEEWHTFKLEHRKNYQDETEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADL 84

Query: 95  TNEEFRALYTGYNRPVPSVSR---QSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQ 151
            + EFR L  G+N  +    R   +S +  TF       +P S+DWR KGAVT +KDQG 
Sbjct: 85  LHHEFRQLMNGFNYTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGH 144

Query: 152 CGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENK 209
           CGSCWAFS+  A+EG      G L+ LSEQ LVDCST   N+GC+GGLMD AF YI +N 
Sbjct: 145 CGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNG 204

Query: 210 GLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDAS 268
           G+ TE  YPY   + +C   K   V AT   + D+P+GDE+ + +AV+   PVSV +DAS
Sbjct: 205 GIDTEKSYPYEAIDDSCHFNK-GTVGATDRGFTDIPQGDEKKMAEAVATVGPVSVAIDAS 263

Query: 269 GRAFHFYKSGVLN-ADC-GNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIR 326
             +F FY  GV N   C   N DHGV VVGFGT  +E+G  YWL+KNSWG TWG+ G+I+
Sbjct: 264 HESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGT--DESGEDYWLVKNSWGTTWGDKGFIK 321

Query: 327 ILRDA-GLCGIATAASYPV 344
           +LR+    CGIA+A+SYP+
Sbjct: 322 MLRNKENQCGIASASSYPL 340


>gi|15290195|dbj|BAB63884.1| putative cysteine protease [Oryza sativa Japonica Group]
 gi|125525813|gb|EAY73927.1| hypothetical protein OsI_01811 [Oryza sativa Indica Group]
          Length = 342

 Score =  275 bits (704), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 146/345 (42%), Positives = 205/345 (59%), Gaps = 21/345 (6%)

Query: 12  PMFVIIILVITC--ASQVVSGRSMH-----EPSIVEKHEQWMAQHGRTYKDELEKAMRLN 64
           PM   ++LV+    A Q +   + +     +   ++  E+WMA+ G+TYK   EK  R  
Sbjct: 6   PMASAVLLVVCTLMALQAMGADAYYNNGSDDGVTMQMFEEWMAKFGKTYKCHGEKEHRFG 65

Query: 65  IFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFK 124
           IF+ N+ +I     +      +G N+F+DLTN+EF A YTG   P P   +++ RP    
Sbjct: 66  IFRDNVHFIRGYKPQVTYDSAVGINQFADLTNDEFVATYTGAKPPHP---KEAPRPVDPI 122

Query: 125 YQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLV 184
           +      P  IDWR +GAVT +KDQG CGSCWAF+AVAA+EG+T+I  G+L  LSEQ+LV
Sbjct: 123 W-----TPCCIDWRFRGAVTGVKDQGACGSCWAFAAVAAIEGLTKIRTGQLTPLSEQELV 177

Query: 185 DCSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCD-NQKEKAVAATISKYED 243
           DC T+++GC GG  D+AFE +    G+  E+DY Y   +G C  +      AA I  Y  
Sbjct: 178 DCDTNSNGCGGGHTDRAFELVASKGGITAESDYRYEGFQGKCRVDDMLFNHAARIGGYRA 237

Query: 244 LPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEE 303
           +P  DE+ L  AV+ QPV+V +DASG AF FYKSGV    CG + +H V +VG+   +  
Sbjct: 238 VPPNDERQLATAVARQPVTVYIDASGPAFQFYKSGVFPGPCGASSNHAVTLVGY-CQDGA 296

Query: 304 NGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
           +G KYW+ KNSWG+TWG+ GYI + +D     G CG+A +  YP 
Sbjct: 297 SGKKYWVAKNSWGKTWGQQGYILLEKDVLQPHGTCGLAVSPFYPT 341


>gi|320169658|gb|EFW46557.1| cathepsin L2 [Capsaspora owczarzaki ATCC 30864]
          Length = 324

 Score =  275 bits (704), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 148/308 (48%), Positives = 197/308 (63%), Gaps = 13/308 (4%)

Query: 43  EQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRAL 102
           + W A HG +Y    E+  R  I++ NL++IEK N EG  +YKL  N+F+DLT  EF A 
Sbjct: 23  DSWKATHGVSYATVGEETARRGIYRANLDFIEKHNSEG-HSYKLAVNKFADLTYPEFAAK 81

Query: 103 YTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVA 162
           Y G      + ++ S   ST+  + V+ +P S+DWR  G VT IKDQGQCGSCW+FS   
Sbjct: 82  YLGLRFDATNATK-SFAASTYLPRMVS-LPDSVDWRTAGIVTPIKDQGQCGSCWSFSTTG 139

Query: 163 AVEGITQITRGKLIELSEQQLVDCST--DNHGCSGGLMDKAFEYIIENKGLATEADYPYR 220
           +VEG      G+L+ LSEQ LVDCS+   N GC+GGLMD+AF+YII N G+ TE+ YPY 
Sbjct: 140 SVEGQHARKTGQLVSLSEQNLVDCSSAQGNAGCNGGLMDQAFQYIISNNGIDTESSYPYT 199

Query: 221 HEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV 279
            ++GTC       V AT++ Y+D+  G E  L  AV+   P+SV +DAS  +F FY SGV
Sbjct: 200 AQDGTCQFNSAN-VGATVASYQDIASGSESDLQNAVATVGPISVAIDASQPSFQFYSSGV 258

Query: 280 LN--ADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA-GLCGI 336
            N  A   +  DHGV  VG+GT+   +   YWL+KNSWG +WG+SGYI + R++   CGI
Sbjct: 259 YNEPACSSSQLDHGVLAVGYGTSGSSD---YWLVKNSWGTSWGQSGYIWMTRNSNNQCGI 315

Query: 337 ATAASYPV 344
           ATAASYP+
Sbjct: 316 ATAASYPL 323


>gi|242093994|ref|XP_002437487.1| hypothetical protein SORBIDRAFT_10g027980 [Sorghum bicolor]
 gi|241915710|gb|EER88854.1| hypothetical protein SORBIDRAFT_10g027980 [Sorghum bicolor]
          Length = 341

 Score =  275 bits (704), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 143/317 (45%), Positives = 197/317 (62%), Gaps = 36/317 (11%)

Query: 38  IVEKHEQWMAQHGRTYKDELEKA--MRLNIFKQNLEYIEKANKE---GNRTYKLGTNEFS 92
           + + ++ W ++HGR  +D +  A  +RL +F+ NL YI+  N E   G  T++LG   F+
Sbjct: 47  VRQLYKTWKSEHGRP-RDGISVADGLRLKVFRDNLRYIDAHNAEADAGLHTFRLGLTPFT 105

Query: 93  DLTNEEFRALYTGY-NRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQ 151
           DLT EEFRA   G+ N  +P V+     P         D+P ++DWR++GAVT +K+Q  
Sbjct: 106 DLTLEEFRAHALGFLNSTLPRVASDRYLPRAGD-----DLPDAVDWRQQGAVTGVKNQLD 160

Query: 152 CGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGL 211
           CG CWAFSAVAA+EGI +I    LI LSEQ+L+DC T+++GC GG M KAF+++I+N G+
Sbjct: 161 CGGCWAFSAVAAMEGINKIVTNNLISLSEQELIDCDTEDYGCQGGEMQKAFQFVIDNGGI 220

Query: 212 ATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRA 271
            TEADYP+    GTCD  +EK    +I  YE++P  DE+AL +AV+NQP           
Sbjct: 221 DTEADYPFIGTNGTCDAIREKRKVVSIDSYENVPTNDEEALQKAVANQP----------- 269

Query: 272 FHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA 331
                 G+ N  CG   DHGV  VG+G+   +NG  +W++KNSWG  WGESGYIR+ R+ 
Sbjct: 270 ------GIFNGPCGFILDHGVTAVGYGS---DNGEDFWIVKNSWGAEWGESGYIRMKRNV 320

Query: 332 ----GLCGIATAASYPV 344
               G CGIA  ASYPV
Sbjct: 321 LLPMGKCGIAMYASYPV 337


>gi|195056367|ref|XP_001995082.1| GH22826 [Drosophila grimshawi]
 gi|193899288|gb|EDV98154.1| GH22826 [Drosophila grimshawi]
          Length = 340

 Score =  275 bits (703), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 152/343 (44%), Positives = 207/343 (60%), Gaps = 19/343 (5%)

Query: 14  FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYI 73
           ++  +L +   +Q VS   +    I E+ + +  +H + Y+DE E+  RL IF +N   I
Sbjct: 4   YIFALLALVAVAQAVSFADV----IKEEWQTFKLEHRKQYQDETEERFRLKIFNENKHKI 59

Query: 74  EKANK---EGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRP---STFKYQN 127
            K N+    G  ++K+G N+++D+ + EF     G+N  +    R S       TF    
Sbjct: 60  AKHNQLYAAGEVSFKMGLNKYADMLHHEFHETMNGFNYTLHKQLRASDATFTGVTFISPE 119

Query: 128 VTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
              +P S+DWR KGAVT +KDQG CGSCWAFS+  A+EG      G LI LSEQ LVDCS
Sbjct: 120 HVKLPQSVDWRNKGAVTGVKDQGHCGSCWAFSSTGALEGQHFRKTGTLISLSEQNLVDCS 179

Query: 188 TD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLP 245
           T   N+GC+GGLMD AF YI +N G+ TE  YPY   + +C   K   + AT   + D+P
Sbjct: 180 TKYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYPYEGIDDSCHFNK-GTIGATDRGFTDIP 238

Query: 246 KGDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGVLN-ADCG-NNCDHGVAVVGFGTAEE 302
           +GDE+ L QAV+   PVSV +DAS  +F FY +GV +   C   N DHGV VVG+GT  +
Sbjct: 239 QGDEKKLAQAVATIGPVSVAIDASHESFQFYSTGVYDEPQCDPQNLDHGVLVVGYGT--D 296

Query: 303 ENGAKYWLIKNSWGETWGESGYIRILR-DAGLCGIATAASYPV 344
           ENG  YWL+KNSWG TWG+ G+I++ R D   CGIATA+SYP+
Sbjct: 297 ENGKDYWLVKNSWGTTWGDKGFIKMARNDDNQCGIATASSYPL 339


>gi|125525812|gb|EAY73926.1| hypothetical protein OsI_01810 [Oryza sativa Indica Group]
          Length = 319

 Score =  275 bits (703), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 140/311 (45%), Positives = 193/311 (62%), Gaps = 14/311 (4%)

Query: 39  VEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEE 98
           ++  E+WMA+ G+TYK   EK  R  IF+ N+ +I     +      +G N+F+DLTN+E
Sbjct: 17  MQMFEEWMAKFGKTYKCHGEKEHRFGIFRDNVHFIRGYKPQVTYDSAVGINQFADLTNDE 76

Query: 99  FRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAF 158
           F A YTG   P P   +++ RP    +      P  IDWR +GAVT +KDQG CGSCWAF
Sbjct: 77  FVATYTGAKPPHP---KEAPRPVDPIW-----TPCCIDWRFRGAVTGVKDQGACGSCWAF 128

Query: 159 SAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYP 218
           +AVAA+EG+T+I  G+L  LSEQ+LVDC T+++GC GG  D+AFE +    G+  E+DY 
Sbjct: 129 AAVAAIEGLTKIRTGQLTPLSEQELVDCDTNSNGCGGGHTDRAFELVASKGGITAESDYR 188

Query: 219 YRHEEGTCD-NQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKS 277
           Y   +G C  +      AA+I  Y  +P  DE+ L  AV+ QPV+V +DASG AF FYKS
Sbjct: 189 YEGFQGKCRVDDMLFNHAASIGGYRAVPPNDERQLATAVARQPVTVYIDASGPAFQFYKS 248

Query: 278 GVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----GL 333
           GV    CG + +H V +VG+   +  +G KYW+ KNSWG+TWG+ GYI + +D     G 
Sbjct: 249 GVFPGPCGASSNHAVTLVGY-CQDGASGKKYWVAKNSWGKTWGQQGYILLEKDVLQPHGT 307

Query: 334 CGIATAASYPV 344
           CG+A +  YP 
Sbjct: 308 CGLAVSPFYPT 318


>gi|391343119|ref|XP_003745860.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
          Length = 385

 Score =  275 bits (703), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 151/337 (44%), Positives = 211/337 (62%), Gaps = 16/337 (4%)

Query: 14  FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYI 73
            + ++ VI  AS +        P++ +  E + A+H + Y+   E+ MR  IF++N ++I
Sbjct: 58  LLAVLAVIGLASALSP-----NPNLNQHWENFKAEHNKKYESFPEELMRRLIFEENHQFI 112

Query: 74  EKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPT 133
           E  N +    + LG N F DLTN+E+R  Y GY RP  + S+ S   S  + + + DVP 
Sbjct: 113 EDHNSKKEFDFYLGMNHFGDLTNKEYRERYLGYRRPENTPSKASYIFS--RAEKIEDVPD 170

Query: 134 SIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NH 191
            IDWR++G VT +K+QGQCGSCWAFSAV ++EG    + GKL+ LSEQ LVDCST   N 
Sbjct: 171 QIDWRDQGFVTPVKNQGQCGSCWAFSAVGSLEGQHFKSTGKLVSLSEQNLVDCSTPEGNS 230

Query: 192 GCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQA 251
           GC+GG MD+AFEY+ +N G+ TE  YPY   +G+C + K K++ AT+  + D+ +GDE+A
Sbjct: 231 GCNGGWMDQAFEYVKDNHGIDTEDSYPYVGTDGSC-HFKNKSIGATLKGFMDVKEGDEEA 289

Query: 252 LLQAVS-NQPVSVCVDASGRAFHFYKSGVLNAD-CGNN-CDHGVAVVGFGTAEEENGAKY 308
           L QAV    PVSV +DAS   F FY+ GV N   C  +  DHGV VVG+G  ++  G  +
Sbjct: 290 LRQAVGVAGPVSVAIDASSMLFQFYRGGVYNVPWCSTSELDHGVLVVGYG--KQFQGKDF 347

Query: 309 WLIKNSWGETWGESGYIRILRDAG-LCGIATAASYPV 344
           W++KNSWG  WG  GYI + R+ G  CGIA+ AS P 
Sbjct: 348 WMVKNSWGVGWGIYGYIEMSRNKGNQCGIASKASIPT 384


>gi|170041165|ref|XP_001848344.1| cathepsin l [Culex quinquefasciatus]
 gi|167864709|gb|EDS28092.1| cathepsin l [Culex quinquefasciatus]
          Length = 340

 Score =  275 bits (702), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 151/318 (47%), Positives = 203/318 (63%), Gaps = 19/318 (5%)

Query: 43  EQWMA---QHGRTYKDELEKAMRLNIFKQNLEYIEKANK---EGNRTYKLGTNEFSDLTN 96
           E+W A   QH + Y  E E+ +RL I+ QN   I K N+   +G   ++L  N+++DL +
Sbjct: 25  EEWNAYKLQHRKKYDSETEERLRLKIYVQNKHKIAKHNQRFEQGQEKFRLRVNKYTDLLH 84

Query: 97  EEFRALYTGYNR---PVPSVSR-QSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQC 152
           EEF     G+NR     P +   +   P T+      +VP ++DWREKGAVT +KDQG C
Sbjct: 85  EEFVQTLNGFNRTNAKKPMLKGVKIDEPVTYIEPANVEVPKTVDWREKGAVTPVKDQGHC 144

Query: 153 GSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKG 210
           GSCW+FSA  A+EG      GKL+ LSEQ LVDCST   N+GC+GG+MD AF+YI +N G
Sbjct: 145 GSCWSFSATGALEGQHFRKTGKLVSLSEQNLVDCSTKYGNNGCNGGMMDFAFQYIKDNGG 204

Query: 211 LATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQ-PVSVCVDASG 269
           + TE  YPY   + TC +   KAV AT   + D+P+GDE+AL++A++   PVSV +DAS 
Sbjct: 205 IDTEKAYPYEAIDDTC-HYNPKAVGATDKGFVDIPQGDEKALMKAIATAGPVSVAIDASH 263

Query: 270 RAFHFYKSGV-LNADCGN-NCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRI 327
            +F FY  GV     C + N DHGV  VG+GT+EE  G  YWL+KNSWG TWG+ GY+++
Sbjct: 264 ESFQFYSEGVYYEPQCDSENLDHGVLAVGYGTSEE--GEDYWLVKNSWGTTWGDQGYVKM 321

Query: 328 LRDA-GLCGIATAASYPV 344
            R+    CGIATAASYP+
Sbjct: 322 ARNRDNHCGIATAASYPL 339


>gi|223946391|gb|ACN27279.1| unknown [Zea mays]
          Length = 279

 Score =  275 bits (702), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 140/261 (53%), Positives = 169/261 (64%), Gaps = 14/261 (5%)

Query: 94  LTNEEFRALYTGYNRPVPSVSR-----QSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKD 148
           +T +EFR  Y G       + R      S+  S+F Y +  DVP S+DWR+KGAVT +KD
Sbjct: 1   MTADEFRRHYAGSRVAHHRMFRGDRQGSSASASSFMYADARDVPASVDWRQKGAVTDVKD 60

Query: 149 QGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIE 207
           QGQCGSCWAFS +AAVEGI  I    L  LSEQQLVDC T  N GC+GGLMD AF+YI +
Sbjct: 61  QGQCGSCWAFSTIAAVEGINAIKTKNLTSLSEQQLVDCDTKANAGCNGGLMDYAFQYIAK 120

Query: 208 NKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDA 267
           + G+A E  YPYR  + +C  +K  A   TI  YED+P  DE AL +AV++QPVSV ++A
Sbjct: 121 HGGVAAEDAYPYRARQASC--KKSPAPVVTIDGYEDVPANDESALKKAVAHQPVSVAIEA 178

Query: 268 SGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRI 327
           SG  F FY  GV +  CG   DHGVA VG+G     +G KYWL+KNSWG  WGE GYIR+
Sbjct: 179 SGSHFQFYSEGVFSGRCGTELDHGVAAVGYGVT--ADGTKYWLVKNSWGPEWGEKGYIRM 236

Query: 328 LRDA----GLCGIATAASYPV 344
            RD     G CGIA  ASYPV
Sbjct: 237 ARDVAAKEGHCGIAMEASYPV 257


>gi|255078398|ref|XP_002502779.1| cysteine endopeptidase [Micromonas sp. RCC299]
 gi|226518045|gb|ACO64037.1| cysteine endopeptidase [Micromonas sp. RCC299]
          Length = 414

 Score =  274 bits (701), Expect = 4e-71,   Method: Compositional matrix adjust.
 Identities = 144/320 (45%), Positives = 196/320 (61%), Gaps = 13/320 (4%)

Query: 37  SIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEFSD 93
           S+ +   +W  +HG+TY  E EK +RL IF  N E+++K N E   G  T+ +G N  +D
Sbjct: 63  SLSDLFHEWTQKHGKTYDSEEEKELRLKIFADNHEFVQKHNAEYENGEHTHFVGLNHLAD 122

Query: 94  LTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCG 153
           LT +EF+ +  GYN  +   SR     ST++Y +VT  P  IDW   GAVT +K+Q QCG
Sbjct: 123 LTKDEFKKML-GYNAAL-RASRAPVDASTWEYADVTP-PEEIDWVASGAVTPVKNQKQCG 179

Query: 154 SCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLA 212
           SCWAFS   AVEG+  I  GKLI LSE++L+ CST+ N GC+GGLMD  FE+I+ N+G+ 
Sbjct: 180 SCWAFSTTGAVEGVNAIKTGKLISLSEEELISCSTNGNMGCNGGLMDNGFEWIVNNRGID 239

Query: 213 TEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAF 272
           TE  + Y  +E  C   +    A  I  ++D+P  DE +L++AVS QPVSV ++A  ++F
Sbjct: 240 TEDGWEYVAKEEKCGFFRRHHRAVAIDGFKDVPSNDEDSLMKAVSQQPVSVAIEADHQSF 299

Query: 273 HFYKSGVLNA-DCGNNCDHGVAVVGFGTAEEENGAK-YWLIKNSWGETWGESGYIRILRD 330
             Y  GV +A DCG   DHGV +VG+G   +    K +W IKNSWG  WGE GYIRI + 
Sbjct: 300 QLYAGGVYSAKDCGTELDHGVLLVGYGVDPKSTKHKHFWKIKNSWGPAWGEDGYIRIAKG 359

Query: 331 A----GLCGIATAASYPVAI 346
                G CG+A   SYP  +
Sbjct: 360 GSGVEGQCGVAMQPSYPTKL 379


>gi|195583187|ref|XP_002081405.1| GD10995 [Drosophila simulans]
 gi|194193414|gb|EDX06990.1| GD10995 [Drosophila simulans]
          Length = 341

 Score =  274 bits (700), Expect = 5e-71,   Method: Compositional matrix adjust.
 Identities = 147/319 (46%), Positives = 201/319 (63%), Gaps = 15/319 (4%)

Query: 38  IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANK---EGNRTYKLGTNEFSDL 94
           ++E+   +  +H + Y+D+ E+  RL IF +N   I K N+   EG  ++KL  N+++DL
Sbjct: 25  VMEEWHTFKLEHRKNYQDDTEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADL 84

Query: 95  TNEEFRALYTGYNRPVPSVSR---QSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQ 151
            + EFR L  G+N  +    R   +S +  TF       +P S+DWR KGAVT +KDQG 
Sbjct: 85  LHHEFRQLMNGFNYTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGH 144

Query: 152 CGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENK 209
           CGSCWAFS+  A+EG      G L+ LSEQ LVDCST   N+GC+GGLMD AF YI +N 
Sbjct: 145 CGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNG 204

Query: 210 GLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDAS 268
           G+ TE  YPY   + +C   K   + AT   + D+P+GDE+ + +AV+   PVSV +DAS
Sbjct: 205 GIDTEKSYPYEAIDDSCHFNK-GTIGATDRGFTDIPQGDEKKMAEAVATVGPVSVAIDAS 263

Query: 269 GRAFHFYKSGVLN-ADC-GNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIR 326
             +F FY  GV N   C   N DHGV VVGFGT  +E+G  YWL+KNSWG TWG+ G+I+
Sbjct: 264 HESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGT--DESGDDYWLVKNSWGTTWGDKGFIK 321

Query: 327 ILRDA-GLCGIATAASYPV 344
           +LR+    CGIA+A+SYP+
Sbjct: 322 MLRNKENQCGIASASSYPL 340


>gi|388890776|gb|AFK80364.1| cysteine proteinase 3, partial [Acanthamoeba castellanii]
          Length = 329

 Score =  274 bits (700), Expect = 5e-71,   Method: Compositional matrix adjust.
 Identities = 147/338 (43%), Positives = 202/338 (59%), Gaps = 14/338 (4%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
           M  I ILV+  A  V S  +     +     +WM  + ++Y +E E   R N++++N + 
Sbjct: 1   MRAITILVLLAAICVASTLATTHDPLTGVFAEWMRDNSKSYSNE-EFVFRWNVWRENQQL 59

Query: 73  IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVP 132
           IE+ N+  N+T  L  N+F DLTN EF  L+ G        S  +++ +  K      + 
Sbjct: 60  IEEHNRS-NKTSFLAMNKFGDLTNAEFNKLFKGL---AFDYSFHANKAAAEKAVPAPGLS 115

Query: 133 TSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--N 190
              DWR+KGAVTH+K+QGQCGSCW+FS   + EG   +  G+L  LSEQ L+DCS    N
Sbjct: 116 ADFDWRQKGAVTHVKNQGQCGSCWSFSTTGSTEGANFLKTGRLTSLSEQNLIDCSGSYGN 175

Query: 191 HGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQ 250
           +GC+GGLMD AFEYII NKG+ TEA YPY+  + TC      +   +++ Y D+  GDE 
Sbjct: 176 NGCNGGLMDYAFEYIINNKGIDTEASYPYQTAQYTCQYNPANS-GGSLTSYTDVSSGDEN 234

Query: 251 ALLQAVSNQPVSVCVDASGRAFHFYKSGVL--NADCGNNCDHGVAVVGFGTAEEENGAKY 308
           ALL AV+ +P SV +DAS  +F FY  GV   +A      DHGV  VG+GT   E+G  Y
Sbjct: 235 ALLNAVATEPTSVAIDASHNSFQFYSGGVYYESACSSTQLDHGVLAVGWGT---EDGQDY 291

Query: 309 WLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPVA 345
           WL+KNSWG  WG +GYI++ R+ +  CGIAT+ASYP A
Sbjct: 292 WLVKNSWGADWGLAGYIKMARNRSNNCGIATSASYPTA 329


>gi|326497561|dbj|BAK05870.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 340

 Score =  274 bits (700), Expect = 5e-71,   Method: Compositional matrix adjust.
 Identities = 145/340 (42%), Positives = 205/340 (60%), Gaps = 18/340 (5%)

Query: 10  IIPMFVIIILVITCASQVVSGRSMH--EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFK 67
           +IP+ V++   +  A    SG  +   +  ++++  QW A H R+Y    E+  R  +++
Sbjct: 11  VIPILVLLTGGLFAAFPAASGGRVDAGDMLMMDRFRQWQATHNRSYLSAEERLRRFEVYR 70

Query: 68  QNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQN 127
            N+EYI+  N+ G  TY+LG N+F+DLT EEF A Y G      +++  +    + +   
Sbjct: 71  TNVEYIDATNRRGGLTYELGENQFADLTGEEFLARYAG-GHTGSAITTAAEADGSLE--- 126

Query: 128 VTDVPTSIDWREKGAVTHIKDQG-QCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC 186
             D P S+DWR KGAVT +K+QG QC SCWAFSAVA +E +  I  GKL+ LSEQQLVDC
Sbjct: 127 -ADPPASVDWRAKGAVTPVKNQGSQCYSCWAFSAVATMESLYFIKTGKLVALSEQQLVDC 185

Query: 187 STDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPK 246
              + GC+ G   +AF++I+EN G+ T A YPY+   G C   K    A TI+ +  + K
Sbjct: 186 DKYDGGCNKGYYHRAFQWIMENGGITTAAQYPYKAVRGACSAAKP---AVTITGHLAVAK 242

Query: 247 GDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGA 306
            +E AL  AV+ QP+ V ++    +  FYKSGV +A CG    H V  VG+G   + +G 
Sbjct: 243 -NELALQSAVARQPIGVAIEVP-ISMQFYKSGVFSAACGIQMSHAVVTVGYGA--DASGL 298

Query: 307 KYWLIKNSWGETWGESGYIRILRD---AGLCGIATAASYP 343
           KYWL+KNSWG+TWGE+GYIR+ RD    GLCGIA   +YP
Sbjct: 299 KYWLVKNSWGQTWGEAGYIRMRRDVGGGGLCGIALDTAYP 338


>gi|91992508|gb|ABE72970.1| cathepsin L [Aedes aegypti]
          Length = 339

 Score =  274 bits (700), Expect = 6e-71,   Method: Compositional matrix adjust.
 Identities = 151/317 (47%), Positives = 200/317 (63%), Gaps = 18/317 (5%)

Query: 43  EQWMA---QHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEFSDLTN 96
           E+W A   QH + Y  E E+ +RL I+ QN   I K N+    G   Y+L  N+++DL +
Sbjct: 25  EEWNAFKLQHRKNYDSETEERIRLKIYVQNKHKIAKHNQRFDLGQEKYRLRVNKYADLLH 84

Query: 97  EEFRALYTGYNRPVPSVSRQSSR---PSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCG 153
           EEF     G+NR     S +  R   P TF      +VPT++DWR+KGAVT +KDQG CG
Sbjct: 85  EEFVQTVNGFNRTDSKKSLKGVRIEEPVTFIEPANVEVPTTVDWRKKGAVTPVKDQGHCG 144

Query: 154 SCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGL 211
           SCW+FSA  A+EG      GKL+ LSEQ LVDCS    N+GC+GG+MD AF+YI +N G+
Sbjct: 145 SCWSFSATGALEGQHFRKTGKLVSLSEQNLVDCSGKYGNNGCNGGMMDYAFQYIKDNGGI 204

Query: 212 ATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDASGR 270
            TE  YPY   + TC +   KAV AT   Y D+P+GDE+AL +A++   PVS+ +DAS  
Sbjct: 205 DTEKSYPYEAIDDTC-HFNPKAVGATDKGYVDIPQGDEEALKKALATVGPVSIAIDASHE 263

Query: 271 AFHFYKSGV-LNADCGN-NCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRIL 328
           +F FY  GV     C + N DHGV  VG+GT+EE  G  YWL+KNSWG TWG+ GY+++ 
Sbjct: 264 SFQFYSEGVYYEPQCDSENLDHGVLAVGYGTSEE--GEDYWLVKNSWGTTWGDQGYVKMA 321

Query: 329 RDA-GLCGIATAASYPV 344
           R+    CG+AT ASYP+
Sbjct: 322 RNHDNHCGVATCASYPL 338


>gi|195484843|ref|XP_002090843.1| GE12574 [Drosophila yakuba]
 gi|194176944|gb|EDW90555.1| GE12574 [Drosophila yakuba]
          Length = 341

 Score =  273 bits (699), Expect = 6e-71,   Method: Compositional matrix adjust.
 Identities = 147/319 (46%), Positives = 200/319 (62%), Gaps = 15/319 (4%)

Query: 38  IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANK---EGNRTYKLGTNEFSDL 94
           ++E+   +  +H + Y+D+ E+  RL IF +N   I K N+   EG  ++KL  N+++DL
Sbjct: 25  VMEEWHTFKLEHRKNYQDDTEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADL 84

Query: 95  TNEEFRALYTGYNRPVPSVSR---QSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQ 151
            + EFR L  G+N  +    R    S +  TF       +P S+DWR KGAVT +KDQG 
Sbjct: 85  LHHEFRQLMNGFNYTLHKQLRATDDSFKGVTFISPAHVTLPKSVDWRSKGAVTAVKDQGH 144

Query: 152 CGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENK 209
           CGSCWAFS+  A+EG      G L+ LSEQ LVDCST   N+GC+GGLMD AF YI +N 
Sbjct: 145 CGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNG 204

Query: 210 GLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDAS 268
           G+ TE  YPY   + +C   K   + AT   + D+P+GDE+ + +AV+   PVSV +DAS
Sbjct: 205 GIDTEKSYPYEAIDDSCHFNK-GTIGATDRGFTDIPQGDEKKMAEAVATVGPVSVAIDAS 263

Query: 269 GRAFHFYKSGVLN-ADC-GNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIR 326
             +F FY  GV N   C   N DHGV VVGFGT  +E+G  YWL+KNSWG TWG+ G+I+
Sbjct: 264 HESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGT--DESGDDYWLVKNSWGTTWGDKGFIK 321

Query: 327 ILRDA-GLCGIATAASYPV 344
           +LR+    CGIA+A+SYP+
Sbjct: 322 MLRNKDNQCGIASASSYPL 340


>gi|125606204|gb|EAZ45240.1| hypothetical protein OsJ_29883 [Oryza sativa Japonica Group]
          Length = 350

 Score =  273 bits (699), Expect = 6e-71,   Method: Compositional matrix adjust.
 Identities = 147/325 (45%), Positives = 192/325 (59%), Gaps = 20/325 (6%)

Query: 38  IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
           ++++ EQWM +HGR Y D  EK  R  ++++N+E +E  N   N  YKL  N+F+DLTNE
Sbjct: 27  MLDRFEQWMIRHGRAYTDAGEKQRRFEVYRRNVELVETFNSMSN-GYKLADNKFADLTNE 85

Query: 98  EFRALYTGYNRP---VPSVSRQSSRPSTFKYQNVTDV-PTSIDWREKGAVTH-IKDQGQC 152
           EFRA   G+ RP   +P +S   S       ++  D+ P S+DWR KGAV +  K     
Sbjct: 86  EFRAKMLGF-RPHVTIPQISNTCSADIAMPGESSDDILPKSVDWRNKGAVINRWKICVDA 144

Query: 153 GSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLA 212
           GSCWAFSAVAA+EGI QI  G+L+ LSEQ+LVDC  +  GC GG M  AFE+++ N GL 
Sbjct: 145 GSCWAFSAVAAIEGINQIKNGELVSLSEQELVDCDDEAVGCGGGYMSWAFEFVVGNHGLT 204

Query: 213 TEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAF 272
           TEA YPY    G C   K    A  I+ Y ++    E  L +A + QPVSV VD     F
Sbjct: 205 TEASYPYHAANGACQAAKLNQSAVAIAGYRNVTPSSEPDLARAAAAQPVSVAVDGGSFMF 264

Query: 273 HFYKSGVLNADCGNNCDHGVAVVGFGTAEEEN--------GAKYWLIKNSWGETWGESGY 324
             Y SGV    C  + +HGV VVG+G +E +         G KYW++KNSWG  WG++GY
Sbjct: 265 QLYGSGVYTGPCTADVNHGVTVVGYGESEPKTDGGGAAKGGEKYWIVKNSWGAEWGDAGY 324

Query: 325 IRILRD-----AGLCGIATAASYPV 344
           I + RD     +GLCGIA   SYPV
Sbjct: 325 ILMQRDVAGLASGLCGIALLPSYPV 349


>gi|146152090|gb|ABQ08058.1| cathepsin L [Misgurnus mizolepis]
          Length = 337

 Score =  273 bits (699), Expect = 8e-71,   Method: Compositional matrix adjust.
 Identities = 151/342 (44%), Positives = 209/342 (61%), Gaps = 16/342 (4%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
           M+  + L   C S V +  S+ +  + +  EQW   HG+ Y  E E+  R  I+++NL  
Sbjct: 1   MWTYLALFTLCLSGVFAAPSL-DKQLDDHWEQWKTWHGKNYH-EKEEGWRRMIWEKNLRK 58

Query: 73  IEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
           I+  N E   G  TY+LG N F D+ +EEFR +  GY       + +  + S F   N  
Sbjct: 59  IQFHNLEHSMGIHTYRLGMNHFGDMNHEEFRQVMNGYKHK----TERKFKGSLFMEPNFL 114

Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD 189
           +VP+ +DWREKG VT +KDQG+CGSCWAFS   A+EG     +GKL+ LSEQ LVDCS  
Sbjct: 115 EVPSKLDWREKGYVTPVKDQGECGSCWAFSTTGAMEGQMFRKQGKLVSLSEQNLVDCSRP 174

Query: 190 --NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
             N GC+GGLMD+AF+YI +N GL +E  YPY   +    +   K  AA  + + D+P G
Sbjct: 175 EGNEGCNGGLMDQAFQYIKDNNGLDSEEAYPYLGTDDQPCHYDPKYNAANDTGFVDIPSG 234

Query: 248 DEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFG-TAEEE 303
            E AL++AV++  PVSV +DA   +F FY+SG+    +C +   DHGV VVG+G   E+ 
Sbjct: 235 KEHALMKAVASVGPVSVAIDAGHESFQFYQSGIYFEKECSSEELDHGVLVVGYGFEGEDV 294

Query: 304 NGAKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
           +G KYW++KNSW E+WG+ GYI + +D    CGIATAASYP+
Sbjct: 295 DGKKYWIVKNSWSESWGDKGYIYMAKDRKNHCGIATAASYPL 336


>gi|195334204|ref|XP_002033774.1| GM21500 [Drosophila sechellia]
 gi|194125744|gb|EDW47787.1| GM21500 [Drosophila sechellia]
          Length = 341

 Score =  273 bits (698), Expect = 8e-71,   Method: Compositional matrix adjust.
 Identities = 146/319 (45%), Positives = 201/319 (63%), Gaps = 15/319 (4%)

Query: 38  IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANK---EGNRTYKLGTNEFSDL 94
           ++E+   +  +H + Y+D+ E+  RL IF +N   I K N+   EG  ++KL  N+++DL
Sbjct: 25  VMEEWHTFKLEHRKNYQDDTEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADL 84

Query: 95  TNEEFRALYTGYNRPVPSVSR---QSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQ 151
            + EFR L  G+N  +    R   +S +  TF       +P S+DWR KGAVT +KDQG 
Sbjct: 85  LHHEFRQLMNGFNYTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGH 144

Query: 152 CGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENK 209
           CGSCWAFS+  A+EG      G L+ LSEQ LVDCST   N+GC+GGLMD AF YI +N 
Sbjct: 145 CGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNG 204

Query: 210 GLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDAS 268
           G+ TE  YPY   + +C   K   + AT   + D+P+GDE+ + +AV+   PV+V +DAS
Sbjct: 205 GIDTEKSYPYEAIDDSCHFNK-GTIGATDRGFTDIPQGDEKKMAEAVATVGPVAVAIDAS 263

Query: 269 GRAFHFYKSGVLN-ADC-GNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIR 326
             +F FY  GV N   C   N DHGV VVGFGT  +E+G  YWL+KNSWG TWG+ G+I+
Sbjct: 264 HESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGT--DESGEDYWLVKNSWGTTWGDKGFIK 321

Query: 327 ILRDA-GLCGIATAASYPV 344
           +LR+    CGIA+A+SYP+
Sbjct: 322 MLRNKENQCGIASASSYPL 340


>gi|326490904|dbj|BAJ90119.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 457

 Score =  273 bits (698), Expect = 9e-71,   Method: Compositional matrix adjust.
 Identities = 137/316 (43%), Positives = 190/316 (60%), Gaps = 16/316 (5%)

Query: 41  KHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANK------EGNRTYKLGTNEFSDL 94
           + E W A+HG+ Y    E+A RL  F +N  ++   N        G  +Y L  N F+DL
Sbjct: 38  QFEAWCAEHGKAYATPGERAARLAAFAENAAFVAAHNDAVASSGPGGPSYTLALNAFADL 97

Query: 95  TNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQN-VTDVPTSIDWREKGAVTHIKDQGQCG 153
           T++EFRA   G     P      S PS   ++  V  VP ++DWR+ GAVT +KDQG CG
Sbjct: 98  THDEFRAARLGRLAVGPGPLGAPS-PSDGGFEGRVGAVPDALDWRQSGAVTKVKDQGSCG 156

Query: 154 SCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLA 212
           +CW+FSA  A+EGI +IT G L+ LSEQ+L+DC    N GC GGLM  A++++I+N G+ 
Sbjct: 157 ACWSFSATGAMEGINKITTGSLLSLSEQELIDCDRSYNTGCGGGLMTYAYKFVIKNGGID 216

Query: 213 TEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAF 272
           TE DYP+R  +GTC+  K K    TI  Y+++P   E  LLQAV+ QP+SV +  S RAF
Sbjct: 217 TEDDYPFREADGTCNKNKLKKHVVTIDGYKEVPSSKEDLLLQAVAQQPISVGICGSARAF 276

Query: 273 HFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD-- 330
             Y  G+ +  C  + DH V +VG+G+   E G  YW++KNSWGE WG  GY+ + R+  
Sbjct: 277 QLYSQGIFDGPCPTSLDHAVLIVGYGS---EGGKDYWIVKNSWGERWGMKGYMHMHRNTG 333

Query: 331 --AGLCGIATAASYPV 344
             +G+CGI   AS+P 
Sbjct: 334 SSSGICGINMMASFPT 349


>gi|116794072|gb|ABK26996.1| unknown [Picea sitchensis]
          Length = 367

 Score =  273 bits (697), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 142/324 (43%), Positives = 190/324 (58%), Gaps = 21/324 (6%)

Query: 38  IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
           +V   ++W+ +HG+ Y    EKA RL IF+ NL+YI   NK  N +++LG N+F+DLTNE
Sbjct: 39  LVRLFDRWLGRHGKLYGSHEEKARRLQIFRTNLQYIHAHNKNSNSSFRLGLNKFADLTNE 98

Query: 98  EFRALYTGYNRPVPSVSRQSS------RP----STFKYQNVTDVPTSIDWREKGAVTHIK 147
           EF+  Y G N       R++       RP    +     +   + +S+DWR+KGAVT +K
Sbjct: 99  EFKTRYFGKNSKQWRDRRRTELEGAELRPVLKQTVGSQSSSCSIASSLDWRKKGAVTGVK 158

Query: 148 DQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIE 207
           DQ QCGSCWAFS   A+EG+  I+ GKL+ LSEQ+LV C   N+GC GG MD AF ++I+
Sbjct: 159 DQAQCGSCWAFSTTGAIEGVNFISTGKLVSLSEQELVACDATNYGCEGGDMDYAFTWVIQ 218

Query: 208 NKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDA 267
           N G+ TE DY Y   + TC+  KE     +I  Y D+   D+ ALL A  +QPVSV +D 
Sbjct: 219 NGGIDTEKDYSYTGVDSTCNTNKEAKKIVSIDGYTDVSP-DDSALLCAAGSQPVSVGIDG 277

Query: 268 SGRAFHFYKSGVLNADCGNN---CDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGY 324
           S   F  Y  G+ + DC  N    DH V VVG+     +NG  YW++KNSWG  WG  GY
Sbjct: 278 SAIDFQLYTGGIYDGDCSGNPDDIDHAVLVVGY---SAKNGKDYWIVKNSWGTDWGLEGY 334

Query: 325 IRILRDA----GLCGIATAASYPV 344
             ILR+     G+C I   ASYP 
Sbjct: 335 FYILRNTELPYGVCAINAMASYPT 358


>gi|333069454|gb|AEF13978.1| chymopapain [Carica papaya]
          Length = 352

 Score =  273 bits (697), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 139/345 (40%), Positives = 203/345 (58%), Gaps = 14/345 (4%)

Query: 7   KSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEK----HEQWMAQHGRTYKDELEKAMR 62
           K   +   +II + ++ A     G S  + + +E+     + WM +H + Y+   EK  R
Sbjct: 9   KIIFLATCLIIHMSLSSADFYTVGYSQDDLTSIERLIQLFDSWMLKHNKIYESIDEKIYR 68

Query: 63  LNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPST 122
             IF+ NL YI++ NK+ N +Y LG N F+DL+N+EF+  Y G +        +      
Sbjct: 69  FEIFRDNLMYIDETNKK-NNSYWLGLNGFADLSNDEFKKKYVG-SVAEDFTGLEHFDNED 126

Query: 123 FKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQ 182
           F Y++VT+ P SIDWR KGAVT +K+QG CGSCWAFS +A VEG+ +I  G L+ELSEQ+
Sbjct: 127 FTYKHVTNYPQSIDWRAKGAVTPVKNQGSCGSCWAFSTIATVEGVNKIVTGNLLELSEQE 186

Query: 183 LVDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYE 242
           LVDC  ++HGC GG    + +Y+ +N G+ T   YPY+ +   C    +      I+ Y+
Sbjct: 187 LVDCDKNSHGCKGGYQTTSLQYVADN-GVHTSKVYPYQAKAMQCRATDKPGPKVKITGYK 245

Query: 243 DLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEE 302
            +P   E + L A++NQP+SV V+A G+ F  YKSGV +  CG   DH V  VG+GT++ 
Sbjct: 246 RVPSNCETSFLGALANQPLSVLVEAGGKPFQLYKSGVFDGPCGTKLDHAVTAVGYGTSD- 304

Query: 303 ENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYP 343
             G  Y +IKNSWG  WGE GY+R+ R +    G CG+  ++ YP
Sbjct: 305 --GKNYIIIKNSWGPNWGEKGYMRLKRQSGNSQGTCGVYKSSYYP 347


>gi|307110445|gb|EFN58681.1| hypothetical protein CHLNCDRAFT_56822 [Chlorella variabilis]
          Length = 466

 Score =  273 bits (697), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 148/349 (42%), Positives = 205/349 (58%), Gaps = 23/349 (6%)

Query: 13  MFVIIILVITCASQVVSGRSMHE----------PSIVEKHEQWMAQHGRTYKDELEKAMR 62
           M +  +L++ C+   V+     E           S  E  + W+    R Y    E   R
Sbjct: 1   MRLSCVLLVACSCLAVAAGFPFENHRLFIQQAVESPREAFDFWVQTLKRAYASAEEYERR 60

Query: 63  LNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPST 122
            +++  NL ++ + N  G+ ++ L    ++DL+ +E+R+   GYN  +     +  R + 
Sbjct: 61  FDVWLDNLRFVHEYNA-GHTSHWLSMGVYADLSQDEYRSKALGYNADLHE--ERPLRAAP 117

Query: 123 FKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQ 182
           F Y+  T  P  +DW  KGAVT +K+Q  CGSCWAFS   AVEG + I  GKL  LSEQ 
Sbjct: 118 FLYEG-TVPPKEVDWVAKGAVTPVKNQLLCGSCWAFSTTGAVEGASAIATGKLASLSEQM 176

Query: 183 LVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKY 241
           LVDC  + ++GC GGLMD AFE+I++N G+ TE DYPY  EEG C + K +    TI  Y
Sbjct: 177 LVDCDRERDNGCHGGLMDFAFEFIMKNGGIDTEDDYPYTAEEGMCQDNKMRRHVVTIDDY 236

Query: 242 EDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAE 301
           +D+P  DE AL++AV+NQPVSV ++A  RAF  Y  GV +A+CG   DHGV VVG+GTA 
Sbjct: 237 QDVPPNDEHALMKAVANQPVSVAIEADQRAFQLYGGGVFDAECGTALDHGVLVVGYGTA- 295

Query: 302 EENGA---KYWLIKNSWGETWGESGYIRILRDA---GLCGIATAASYPV 344
             NG     YWL+KNSWG  WG+ GYIR+LR+    G CG+A  AS+P+
Sbjct: 296 -SNGTHHLPYWLVKNSWGAEWGDKGYIRLLRNLGEEGQCGVAMQASFPI 343


>gi|195124431|ref|XP_002006696.1| GI21205 [Drosophila mojavensis]
 gi|193911764|gb|EDW10631.1| GI21205 [Drosophila mojavensis]
          Length = 339

 Score =  272 bits (696), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 150/340 (44%), Positives = 211/340 (62%), Gaps = 21/340 (6%)

Query: 18  ILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKAN 77
           +L +   +Q VS   +    I E+   +  +H +TY+DE E+  RL IF +N   I K N
Sbjct: 7   LLALVAVAQAVSFADV----IKEEWHTFKLEHRKTYQDETEERFRLKIFNENKHKIAKHN 62

Query: 78  KE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPS----TFKYQNVTD 130
           +    G  T+K+  N+++D+ + EFR    G+N  +    R +S PS    TF       
Sbjct: 63  QRYATGEVTFKMAVNKYADMLHHEFRETMNGFNYTLHKELR-ASDPSFTGITFISPAHVK 121

Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD- 189
           +P S+DWREKGAVT +KDQG CGSCWAFS+  A+EG      G L+ LSEQ LVDCS   
Sbjct: 122 LPKSVDWREKGAVTAVKDQGHCGSCWAFSSTGALEGQHFRKTGTLVSLSEQNLVDCSAKY 181

Query: 190 -NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
            N+GC+GGLMD AF YI +N G+ TE  YPY   + +C   K+ +V AT   + D+P+G+
Sbjct: 182 GNNGCNGGLMDNAFRYIKDNGGIDTEKSYPYEGIDDSCHFNKD-SVGATDRGFADIPQGN 240

Query: 249 EQALLQAVSN-QPVSVCVDASGRAFHFYKSGVLN-ADCGN-NCDHGVAVVGFGTAEEENG 305
           E+ + +AV+   PVSV +DAS  +F FY  G+ N  +C + N DHGV VVG+GT  +E+G
Sbjct: 241 EKKMAEAVATIGPVSVAIDASHESFQFYSEGIYNEPECNSQNLDHGVLVVGYGT--DESG 298

Query: 306 AKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
             YWL+KNSWG TWG+ G+I++ R+    CGIA+A+SYP+
Sbjct: 299 KDYWLVKNSWGTTWGDKGFIKMARNEDNQCGIASASSYPL 338


>gi|320164780|gb|EFW41679.1| cathepsin L2 [Capsaspora owczarzaki ATCC 30864]
          Length = 334

 Score =  272 bits (696), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 147/313 (46%), Positives = 193/313 (61%), Gaps = 21/313 (6%)

Query: 43  EQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRAL 102
           E W    G++Y D +E+  R  +++ N   ++  N  G  +Y LG N F+DLT+EEF+  
Sbjct: 31  EAWKRTFGKSYSDAVEEINRRAVWEANKMLVDAHNGAGIHSYTLGMNIFADLTHEEFKRF 90

Query: 103 YTG----YNRPVPSVSRQSSRPSTF-KYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWA 157
           Y G     NRP      +S+  STF    NV  +P S+DWR  G VT +KDQGQCGSCW+
Sbjct: 91  YLGTKVDLNRP------RSNFSSTFIPTANVGALPDSVDWRTAGIVTPVKDQGQCGSCWS 144

Query: 158 FSAVAAVEGITQITRGKLIELSEQQLVDCST--DNHGCSGGLMDKAFEYIIENKGLATEA 215
           FS   +VEG      G+L+ LSEQ LVDCS    N GC+GGLMD AF+YII NKG+ TEA
Sbjct: 145 FSTTGSVEGQHARKTGQLVSLSEQNLVDCSKAQGNQGCNGGLMDDAFQYIITNKGIDTEA 204

Query: 216 DYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDASGRAFHF 274
            YPY  ++GTC       V AT+S ++D+ +G E  L  AV+   PVSV +DAS  +F  
Sbjct: 205 SYPYTAKDGTCKFNAAN-VGATLSSFQDITRGSESDLQNAVATVGPVSVAIDASKNSFQL 263

Query: 275 YKSGVLNAD--CGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA- 331
           Y SGV N       + DHGV   G+GT+   NG  YWL+KNSWG +WG++GYI + R+A 
Sbjct: 264 YTSGVYNEKKCSSTSLDHGVLAAGYGTS---NGTPYWLVKNSWGSSWGQAGYIWMSRNAN 320

Query: 332 GLCGIATAASYPV 344
             CGIAT+ASYP+
Sbjct: 321 NQCGIATSASYPI 333


>gi|194883222|ref|XP_001975702.1| GG20414 [Drosophila erecta]
 gi|190658889|gb|EDV56102.1| GG20414 [Drosophila erecta]
          Length = 341

 Score =  272 bits (696), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 146/319 (45%), Positives = 201/319 (63%), Gaps = 15/319 (4%)

Query: 38  IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANK---EGNRTYKLGTNEFSDL 94
           ++E+   +  +H + Y+D+ E+  RL IF +N   I K N+   EG  ++KL  N+++DL
Sbjct: 25  VMEEWHTFKLEHRKNYQDDTEERFRLKIFNENKHKIAKHNQRYAEGKVSFKLAVNKYADL 84

Query: 95  TNEEFRALYTGYNRPVPSVSR---QSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQ 151
            + EFR L  G+N  +    R    S +  TF       +P S+DWR KGAVT +KDQG 
Sbjct: 85  LHHEFRQLMNGFNYTLHKQLRSTDDSFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGH 144

Query: 152 CGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENK 209
           CGSCWAFS+  A+EG      G L+ LSEQ LVDCST   N+GC+GGLMD AF YI +N 
Sbjct: 145 CGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNG 204

Query: 210 GLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDAS 268
           G+ TE  YPY   + +C   K  A+ AT   + D+P+GDE+ + +AV+   PV+V +DAS
Sbjct: 205 GIDTEKSYPYEAIDDSCHFNK-GAIGATDRGFTDIPQGDEKKMAEAVATVGPVAVAIDAS 263

Query: 269 GRAFHFYKSGVLN-ADC-GNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIR 326
             +F FY  GV N   C   N DHGV VVG+GT  +E+G  YWL+KNSWG TWG+ G+I+
Sbjct: 264 HESFQFYSEGVYNEPQCDAQNLDHGVLVVGYGT--DESGDDYWLVKNSWGTTWGDKGFIK 321

Query: 327 ILRDA-GLCGIATAASYPV 344
           +LR+    CGIA+A+SYP+
Sbjct: 322 MLRNKDNQCGIASASSYPL 340


>gi|115464789|ref|NP_001055994.1| Os05g0508300 [Oryza sativa Japonica Group]
 gi|48475189|gb|AAT44258.1| hypothetical protein [Oryza sativa Japonica Group]
 gi|113579545|dbj|BAF17908.1| Os05g0508300 [Oryza sativa Japonica Group]
          Length = 450

 Score =  272 bits (695), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 136/307 (44%), Positives = 186/307 (60%), Gaps = 9/307 (2%)

Query: 41  KHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFR 100
           + E W A+HGR+Y    E+A RL  F  N  ++  A+     +Y L  N F+DLT++EFR
Sbjct: 37  QFEAWCAEHGRSYATPGERAARLAAFADNAAFV-AAHNGAPASYALALNAFADLTHDEFR 95

Query: 101 ALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSA 160
           A   G         R    P       V  VP ++DWR+ GAVT +KDQG CG+CW+FSA
Sbjct: 96  AARLGRLAAAGGPGRDGGAPYLGVDGGVGAVPDAVDWRQSGAVTKVKDQGSCGACWSFSA 155

Query: 161 VAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPY 219
             A+EGI +I  G LI LSEQ+L+DC    N GC GGLMD A++++++N G+ TEADYPY
Sbjct: 156 TGAMEGINKIKTGSLISLSEQELIDCDRSYNSGCGGGLMDYAYKFVVKNGGIDTEADYPY 215

Query: 220 RHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGV 279
           R  +GTC+  K K    TI  Y+D+P  +E  LLQAV+ QPVSV +  S RAF  Y  G+
Sbjct: 216 RETDGTCNKNKLKRRVVTIDGYKDVPANNEDMLLQAVAQQPVSVGICGSARAFQLYSKGI 275

Query: 280 LNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----GLCG 335
            +  C  + DH + +VG+G+   E G  YW++KNSWGE+WG  GY+ + R+     G+CG
Sbjct: 276 FDGPCPTSLDHAILIVGYGS---EGGKDYWIVKNSWGESWGMKGYMYMHRNTGNSNGVCG 332

Query: 336 IATAASY 342
           I    S+
Sbjct: 333 INQMPSF 339


>gi|156124998|gb|ABU50817.1| Ale o 1 allergen [Aleuroglyphus ovatus]
          Length = 337

 Score =  272 bits (695), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 151/329 (45%), Positives = 204/329 (62%), Gaps = 24/329 (7%)

Query: 32  SMHEPSIV-------EKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---GN 81
           S H PS++        + EQ+ +  GR Y     +  R +IF+ NL++I + N +   G+
Sbjct: 16  SAHIPSMLLTEGELEAQFEQFKSTFGRVYPSPEIELHRKSIFRANLQFILRHNIDYFNGD 75

Query: 82  RTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKG 141
            T+ +  N F+DL+NEEFRA + GY R    ++  S   S     +V  +P ++DW  KG
Sbjct: 76  STFSVSVNNFTDLSNEEFRATFNGYRR----LAAVSLADSVHADNDVEALPATVDWTTKG 131

Query: 142 AVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST--DNHGCSGGLMD 199
            VT IK+Q QCGSCWAFSAVA++EG   +  GKL+ LSEQ LVDCS    + GCSGG MD
Sbjct: 132 VVTPIKNQQQCGSCWAFSAVASMEGQHALKTGKLVSLSEQNLVDCSAAEGDMGCSGGWMD 191

Query: 200 KAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN- 258
            AF+Y+I+N+G+ TEA YPY+  + +C+  K  +V ATI  + D+  GDE AL  AV++ 
Sbjct: 192 YAFKYVIQNRGIDTEASYPYKAIDESCE-FKRNSVGATIHSFVDVKTGDESALQNAVASI 250

Query: 259 QPVSVCVDASGRAFHFYKSGVLN-ADCGNN-CDHGVAVVGFGTAEEENGAKYWLIKNSWG 316
            P+SV +DA+  +F FY SGV N  DC     DHGV  VG+GT    NGA YW +KNSWG
Sbjct: 251 GPISVAIDAAQPSFQFYSSGVYNEPDCSTEILDHGVTAVGYGTL---NGAPYWKVKNSWG 307

Query: 317 ETWGESGYIRILRDA-GLCGIATAASYPV 344
            +WG  GYI + R+    CGIAT ASYPV
Sbjct: 308 TSWGRKGYIFMSRNKQNQCGIATKASYPV 336


>gi|242040563|ref|XP_002467676.1| hypothetical protein SORBIDRAFT_01g032090 [Sorghum bicolor]
 gi|241921530|gb|EER94674.1| hypothetical protein SORBIDRAFT_01g032090 [Sorghum bicolor]
          Length = 358

 Score =  272 bits (695), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 138/311 (44%), Positives = 197/311 (63%), Gaps = 10/311 (3%)

Query: 38  IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
           ++++  +W A + R+Y    E+  R  ++++N+E+IE  N+ GN TY LG N+F+DLT E
Sbjct: 53  MMDRFLRWQATYNRSYPTAEERQRRFQVYRRNMEHIEATNRAGNLTYTLGENQFADLTEE 112

Query: 98  EFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQG-QCGSCW 156
           EF  LYT   + +P V R + +     + +V D PTS+DWR +GAVT IK+QG  C SCW
Sbjct: 113 EFLDLYT--MKGMPPVRRDAGKKQQANFSSVVDAPTSVDWRSRGAVTPIKNQGPSCSSCW 170

Query: 157 AFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLATEAD 216
           AF   A +E ITQI  GKL+ LSEQ+L+DC   + GC+ G     ++++I+N GL TEA+
Sbjct: 171 AFVTAATIESITQIRTGKLVSLSEQELIDCDPYDGGCNLGYFVNGYKWVIQNGGLTTEAN 230

Query: 217 YPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYK 276
           YPY+     C+  K    AA IS Y  LP+G E  L QAV+ QPV+  ++  G +  FY 
Sbjct: 231 YPYQARRYQCNRSKAGQRAARISNYRQLPQG-EAQLQQAVAQQPVAAAIEMGG-SLQFYS 288

Query: 277 SGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRI---LRDAGL 333
            GV +  CG   +H + VVG+G   + +G KYWL+KNSWG+TWGE GY+R+   +R  GL
Sbjct: 289 GGVWSGQCGTRMNHAITVVGYGA--DSSGVKYWLVKNSWGQTWGERGYLRMRKDVRQGGL 346

Query: 334 CGIATAASYPV 344
           CGIA   +YP+
Sbjct: 347 CGIALDLAYPI 357


>gi|312381833|gb|EFR27483.1| hypothetical protein AND_05794 [Anopheles darlingi]
          Length = 344

 Score =  271 bits (694), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 150/322 (46%), Positives = 200/322 (62%), Gaps = 23/322 (7%)

Query: 43  EQWMA---QHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEFSDLTN 96
           E+W A   QH + Y  E E+ +R+ I+ QN   I K N+    G   ++L  N+++DL +
Sbjct: 25  EEWNAFKLQHRKKYDSESEERIRMKIYVQNKHKIAKHNQRYDLGQEKFRLRVNKYADLLH 84

Query: 97  EEFRALYTGYNRPVPSVSRQSSR--------PSTFKYQNVTDVPTSIDWREKGAVTHIKD 148
           EEF     G+NR   + S+   R        P T+      DVPT+IDWREKGAVT +KD
Sbjct: 85  EEFVHTLNGFNRSAAAGSKLLGREQLMTIEEPITWIEPANVDVPTTIDWREKGAVTPVKD 144

Query: 149 QGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYII 206
           QG CGSCW+FSA  A+EG      GKL+ LSEQ LVDCST   N+GC+GGLMD AF+Y+ 
Sbjct: 145 QGHCGSCWSFSATGALEGQHFRKTGKLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFQYVK 204

Query: 207 ENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCV 265
           +NKG+ TE  YPY   +  C +   KA+ AT   + D+P+GDE+AL +A++   PVSV +
Sbjct: 205 DNKGIDTEKAYPYEAIDDEC-HYNPKAIGATDKGFVDIPQGDEKALKKALATVGPVSVAI 263

Query: 266 DASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESG 323
           DAS  +F FY  GV     C +   DHGV  VG+GT E+  G  YWL+KNSWG TWG+ G
Sbjct: 264 DASHESFQFYSEGVYYEPQCDSEQLDHGVLAVGYGTTED--GEDYWLVKNSWGTTWGDQG 321

Query: 324 YIRILRD-AGLCGIATAASYPV 344
           Y+++ R+    CGIAT ASYP+
Sbjct: 322 YVKMARNRENHCGIATTASYPL 343


>gi|156124996|gb|ABU50816.1| Ale o 1 allergen [Aleuroglyphus ovatus]
          Length = 337

 Score =  271 bits (694), Expect = 3e-70,   Method: Compositional matrix adjust.
 Identities = 150/329 (45%), Positives = 204/329 (62%), Gaps = 24/329 (7%)

Query: 32  SMHEPSIV-------EKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---GN 81
           S H PS++        + EQ+ +  GR Y     +  R +IF+ NL++I + N +   G+
Sbjct: 16  SAHIPSMLLTEGELEAQFEQFKSTFGRVYPSPEIELHRKSIFRANLQFILRHNIDYFNGD 75

Query: 82  RTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKG 141
            T+ +  N F+DL+NEEFRA + GY R    ++  S   S     +V  +P ++DW  KG
Sbjct: 76  STFSVSVNNFTDLSNEEFRATFNGYRR----LAAVSLADSVHADNDVEALPATVDWTTKG 131

Query: 142 AVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST--DNHGCSGGLMD 199
            VT IK+Q QCGSCWAFSAVA++EG   +  GKL+ LSEQ LVDCS    + GCSGG MD
Sbjct: 132 VVTPIKNQQQCGSCWAFSAVASMEGQHALKTGKLVSLSEQNLVDCSAAEGDMGCSGGWMD 191

Query: 200 KAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN- 258
            AF+Y+I+N+G+ TEA YPY+  + +C+  K  ++ ATI  + D+  GDE AL  AV++ 
Sbjct: 192 YAFKYVIQNRGIDTEASYPYKAIDESCEF-KRNSIGATIHSFVDVKTGDESALQNAVASI 250

Query: 259 QPVSVCVDASGRAFHFYKSGVLN-ADCGNN-CDHGVAVVGFGTAEEENGAKYWLIKNSWG 316
            P+SV +DAS  +F FY SGV N  DC     DHGV  VG+GT    NG  YW +KNSWG
Sbjct: 251 GPISVAIDASQPSFQFYSSGVYNEPDCSTEILDHGVTAVGYGTL---NGVPYWKVKNSWG 307

Query: 317 ETWGESGYIRILRDA-GLCGIATAASYPV 344
            +WG+ GYI + R+    CGIAT ASYPV
Sbjct: 308 TSWGQKGYIFMSRNKQNQCGIATKASYPV 336


>gi|154183745|gb|ABS70713.1| cathepsin L-like cysteine proteinase [Dermacentor variabilis]
          Length = 333

 Score =  271 bits (693), Expect = 3e-70,   Method: Compositional matrix adjust.
 Identities = 149/341 (43%), Positives = 210/341 (61%), Gaps = 21/341 (6%)

Query: 16  IIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEK 75
           ++ + + CA  VV+  +     +  + E + A H ++Y+  +E+ +R  IF +N   + +
Sbjct: 1   MLRISLLCAFVVVTTAASSHEILRTQWEAFKATHKKSYQSNMEELLRFKIFSENSLLVAR 60

Query: 76  ANKEGNR---TYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTF---KYQNVT 129
            N++  R   +YKLG N+F DL   EF  ++ GY       +R + R STF      N +
Sbjct: 61  HNEKYARGLVSYKLGMNQFGDLLPHEFARMFNGYRG-----ARTAGRGSTFLPPANVNYS 115

Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD 189
            +P S+DWREKGAVT +K+QGQCGSCWAFS   ++EG   +  G L+ LSEQ LVDCS  
Sbjct: 116 SLPQSMDWREKGAVTPVKNQGQCGSCWAFSTTGSLEGQHFLKTGVLVSLSEQNLVDCSET 175

Query: 190 --NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
             NHGC GGLMD AF+YI  N G+ TE  YPY  E+G C  +K+  V AT + + D+ +G
Sbjct: 176 FGNHGCEGGLMDNAFQYIKANGGIDTEKSYPYEAEDGECRFKKQN-VGATDTGFVDIEQG 234

Query: 248 DEQALLQAVSN-QPVSVCVDASGRAFHFYKSGVLN-ADCGN-NCDHGVAVVGFGTAEEEN 304
            E  L +AV+   PVSV +DAS  +F  Y  GV +  +C +   DHGV VVG+G    E+
Sbjct: 235 SEDDLKKAVATVGPVSVAIDASHSSFQLYSEGVYDETECSSEQLDHGVLVVGYGV---ED 291

Query: 305 GAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
           G KYWL+KNSW E+WG++GYI++ RD    CGIA+AASYP+
Sbjct: 292 GKKYWLVKNSWAESWGDNGYIKMSRDKDNQCGIASAASYPL 332


>gi|449450419|ref|XP_004142960.1| PREDICTED: vignain-like [Cucumis sativus]
          Length = 345

 Score =  271 bits (693), Expect = 3e-70,   Method: Compositional matrix adjust.
 Identities = 141/315 (44%), Positives = 203/315 (64%), Gaps = 10/315 (3%)

Query: 35  EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
           E S+ + +E+W  +H    ++  EK  R ++FK+N+ ++   N+  ++ YKL  N+F+D+
Sbjct: 34  EESLWQLYERW-GKHHTISRNLKEKHKRFSVFKENVNHVFTVNQM-DKPYKLKLNKFADM 91

Query: 95  TNEEFRALYTGYN-RPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCG 153
           +N EF   Y   N      +  +      F Y+  TD+P+S+DWRE+GAV  +K+QG+CG
Sbjct: 92  SNYEFVNFYARSNISHYRKLHERRRGAGGFMYEQDTDLPSSVDWRERGAVNAVKEQGRCG 151

Query: 154 SCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLAT 213
           SCWAFS+VAAVEGI +I   +L+ LSEQ+L+DC+  N GC+GG M+ AF++I  N G+AT
Sbjct: 152 SCWAFSSVAAVEGINKIKTNQLLSLSEQELLDCNYRNKGCNGGFMEIAFDFIKRNGGIAT 211

Query: 214 EADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFH 273
           E  YPY    G C + +  +    I  YE +P+ +E AL+QAV+NQPVSV +DA+GR F 
Sbjct: 212 ENSYPYHGSRGLCRSSRISSPIVKIDGYESVPE-NEDALMQAVANQPVSVAIDAAGRDFQ 270

Query: 274 FYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA-- 331
           FY  GV +  CG   +HGV  +G+GT E+  G  YWL++NSWG  WGE GY+R+ R    
Sbjct: 271 FYSQGVFDGYCGTELNHGVVAIGYGTTED--GTDYWLVRNSWGVGWGEDGYVRMKRGVEQ 328

Query: 332 --GLCGIATAASYPV 344
             GLCGIA  ASYP+
Sbjct: 329 AEGLCGIAMEASYPI 343


>gi|16304178|gb|AAL16954.1|AF426414_1 cathepsin L-like cysteine protease precursor [Delia radicum]
          Length = 337

 Score =  271 bits (693), Expect = 3e-70,   Method: Compositional matrix adjust.
 Identities = 149/340 (43%), Positives = 207/340 (60%), Gaps = 17/340 (5%)

Query: 15  VIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIE 74
           V+ +L +    Q +S    +   I E+ + +  +H + +  E+E+  R+ IF +N   I 
Sbjct: 4   VLALLALVAFVQAIS----YTDVIKEEWQTFKMEHRKNFLSEVEERFRMKIFNENRHKIA 59

Query: 75  KANK---EGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSR-QSSRPSTFKYQNVTD 130
           K N+   +G  ++KLG N++SD+   EF+    GYN  +  V R Q      +       
Sbjct: 60  KHNQLYAQGKVSFKLGLNKYSDMLYHEFKETMNGYNHTMRKVLRAQGFSGIIYIPPANVQ 119

Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD- 189
           +P S+DWR+ GAVT +KDQG CGSCWAFS+ AA+EG      G L+ LSEQ LVDCST  
Sbjct: 120 IPKSVDWRQHGAVTAVKDQGHCGSCWAFSSTAALEGQHFRKAGVLVSLSEQNLVDCSTKY 179

Query: 190 -NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
            N+GC+GGLMD AF YI +N G+ TE  YPY   + +C   K   V AT + + D+P+GD
Sbjct: 180 GNNGCNGGLMDNAFRYIKDNGGIDTEKSYPYEGIDDSCHFTKS-GVGATDTGFVDIPQGD 238

Query: 249 EQALLQAVSNQ-PVSVCVDASGRAFHFYKSGVLN-ADC-GNNCDHGVAVVGFGTAEEENG 305
           E+AL++AV+   PVSV +DAS  +F  Y  GV N  +C   N DHGV VVG+GT  ++ G
Sbjct: 239 EEALMKAVATMGPVSVAIDASHESFQLYSEGVYNEPECDAQNLDHGVLVVGYGT--DKTG 296

Query: 306 AKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
             YWL+KNSWG TWG+ GYI++ R+    CGIATA+SYP 
Sbjct: 297 LDYWLVKNSWGTTWGDQGYIKMARNQDNQCGIATASSYPT 336


>gi|22653679|sp|Q26636.1|CATL_SARPE RecName: Full=Cathepsin L; Contains: RecName: Full=Cathepsin L
           heavy chain; Contains: RecName: Full=Cathepsin L light
           chain; Flags: Precursor
 gi|505140|dbj|BAA03970.1| cathepsin L precursor [Sarcophaga peregrina]
          Length = 339

 Score =  271 bits (693), Expect = 3e-70,   Method: Compositional matrix adjust.
 Identities = 146/318 (45%), Positives = 200/318 (62%), Gaps = 14/318 (4%)

Query: 38  IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANK---EGNRTYKLGTNEFSDL 94
           I E+   +  QH + Y +E+E+  R+ IF +N   I K N+   +G  +YKLG N+++D+
Sbjct: 24  IKEEWHTYKLQHRKNYANEVEERFRMKIFNENRHKIAKHNQLFAQGKVSYKLGLNKYADM 83

Query: 95  TNEEFRALYTGYNRPVPSVSRQSSR--PSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQC 152
            + EF+    GYN  +  + R+ +    +T+       VP S+DWRE GAVT +KDQG C
Sbjct: 84  LHHEFKETMNGYNHTLRQLMRERTGLVGATYIPPAHVTVPKSVDWREHGAVTGVKDQGHC 143

Query: 153 GSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKG 210
           GSCWAFS+  A+EG      G L+ LSEQ LVDCST   N+GC+GGLMD AF YI +N G
Sbjct: 144 GSCWAFSSTGALEGQHFRKAGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGG 203

Query: 211 LATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQ-PVSVCVDASG 269
           + TE  YPY   + +C   K   + AT + + D+P+GDE+ + +AV+   PVSV +DAS 
Sbjct: 204 IDTEKSYPYEGIDDSCHFNK-ATIGATDTGFVDIPEGDEEKMKKAVATMGPVSVAIDASH 262

Query: 270 RAFHFYKSGVLN-ADCG-NNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRI 327
            +F  Y  GV N  +C   N DHGV VVG+GT  +E+G  YWL+KNSWG TWGE GYI++
Sbjct: 263 ESFQLYSEGVYNEPECDEQNLDHGVLVVGYGT--DESGMDYWLVKNSWGTTWGEQGYIKM 320

Query: 328 LRDA-GLCGIATAASYPV 344
            R+    CGIATA+SYP 
Sbjct: 321 ARNQNNQCGIATASSYPT 338


>gi|269784818|ref|NP_001161481.1| cathepsin L1 precursor [Gallus gallus]
          Length = 353

 Score =  271 bits (693), Expect = 3e-70,   Method: Compositional matrix adjust.
 Identities = 151/354 (42%), Positives = 215/354 (60%), Gaps = 15/354 (4%)

Query: 1   MVLKFEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKA 60
           + L    S  + M V + ++  C     +   + +P +    + W + H + Y  E E++
Sbjct: 4   LFLARRLSRFVNMNVCLTILSLCLGLAFAAPRV-DPDLDSHWQLWKSWHSKDYH-EREES 61

Query: 61  MRLNIFKQNLEYIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQS 117
            R  ++++NL+ IE  N +   G  +YKLG N+F D+T EEFR L  GY       S + 
Sbjct: 62  WRRVVWEKNLKMIELHNLDHSLGKHSYKLGMNQFGDMTAEEFRQLMNGYKH---KKSERK 118

Query: 118 SRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIE 177
            R S F   +  + P S+DWREKG VT +KDQGQCGSCWAFS   A+EG      GKL+ 
Sbjct: 119 YRGSQFLEPSFLEAPRSVDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQHFRKTGKLVS 178

Query: 178 LSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVA 235
           LSEQ LVDCS    N GC+GGLMD+AF+Y+ +N G+ +E  YPY  ++      K +  A
Sbjct: 179 LSEQNLVDCSRPEGNQGCNGGLMDQAFQYVQDNGGIDSEESYPYTAKDDEDCRYKAEYNA 238

Query: 236 ATISKYEDLPKGDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGV 292
           A  + + D+P+G E+AL++AV++  PVSV +DA   +F FY+SG+    DC + + DHGV
Sbjct: 239 ANDTGFVDIPQGHERALMKAVASVGPVSVAIDAGHSSFQFYQSGIYYEPDCSSEDLDHGV 298

Query: 293 AVVGFG-TAEEENGAKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
            VVG+G   E+ +G KYW++KNSWGE WG+ GYI + +D    CGIATAASYP+
Sbjct: 299 LVVGYGFEGEDVDGKKYWIVKNSWGEKWGDKGYIYMAKDRKNHCGIATAASYPL 352


>gi|413953665|gb|AFW86314.1| hypothetical protein ZEAMMB73_546353 [Zea mays]
          Length = 233

 Score =  271 bits (692), Expect = 4e-70,   Method: Compositional matrix adjust.
 Identities = 128/233 (54%), Positives = 165/233 (70%), Gaps = 12/233 (5%)

Query: 120 PSTFKYQNVTD--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIE 177
           P+ F+Y+NV+   +PT+IDWR KGAVT IKDQGQCG CWAFSAVAA EGI +I+ GKL+ 
Sbjct: 4   PTGFRYENVSADALPTTIDWRTKGAVTPIKDQGQCGCCWAFSAVAATEGIVKISTGKLVS 63

Query: 178 LSEQQLVDCST--DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVA 235
           L+EQ+LVDC    ++ GC GGLMD AF++II+N GL TE+ YPY   +G C +      A
Sbjct: 64  LAEQELVDCDVHDEDQGCEGGLMDDAFKFIIKNGGLTTESSYPYTAADGKCKSGSNS--A 121

Query: 236 ATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVV 295
           ATI  YED+P  DE AL++AV+NQPVSV VD     F FY  GV+   CG + DHG+A +
Sbjct: 122 ATIKGYEDVPANDEAALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAI 181

Query: 296 GFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
           G+G  +  +G KYWL+KNSWG TWGE+GY+R+ +D     G+CG+A   SYP 
Sbjct: 182 GYG--KTSDGTKYWLMKNSWGTTWGENGYLRMEKDISDKRGMCGLAMEPSYPT 232


>gi|326520387|dbj|BAK07452.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 349

 Score =  271 bits (692), Expect = 4e-70,   Method: Compositional matrix adjust.
 Identities = 146/344 (42%), Positives = 203/344 (59%), Gaps = 17/344 (4%)

Query: 10  IIPMFVIIILVITCASQVVSGRSMHEPSIV--EKHEQWMAQHGRTYKDELEKAMRLNIFK 67
           +IP+ V++   +  A    SG  +    ++  ++  QW A H R+Y    E+  R  +++
Sbjct: 11  VIPILVLLTGGLFAAFPAASGGRVDAGDMLMMDRFRQWQATHNRSYLSAEERLRRFEVYR 70

Query: 68  QNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTG--YNRPVPSVSRQSSRPSTFKY 125
            N+EYI+  N+ G  TY+LG N+F+DLT EEF A Y G      + + +      S+   
Sbjct: 71  TNVEYIDATNRRGGLTYELGENQFADLTGEEFLARYAGGHTGSAITTAAEADGLWSSGGS 130

Query: 126 QNV--TDVPTSIDWREKGAVTHIKDQG-QCGSCWAFSAVAAVEGITQITRGKLIELSEQQ 182
                 D P S+DWR KGAVT +K+QG QC SCWAFSAVA +E +  I  GKL+ LSEQQ
Sbjct: 131 DGSLEADPPASVDWRAKGAVTPVKNQGSQCYSCWAFSAVATMESLYFIKTGKLVALSEQQ 190

Query: 183 LVDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYE 242
           LVDC   + GC+ G   +AF++I+EN G+ T A YPY+   G C   K    A TI+ + 
Sbjct: 191 LVDCDKYDGGCNKGYYHRAFQWIMENGGITTAAQYPYKAVRGACSAAKP---AVTITGHL 247

Query: 243 DLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEE 302
            + K +E AL  AV+ QP+ V ++    +  FYKSGV +A CG    H V  VG+G   +
Sbjct: 248 AVAK-NELALQSAVARQPIGVAIEVP-ISMQFYKSGVFSAACGIQMSHAVVTVGYGA--D 303

Query: 303 ENGAKYWLIKNSWGETWGESGYIRILRD---AGLCGIATAASYP 343
            +G KYWL+KNSWG+TWGE+GYIR+ RD    GLCGIA   +YP
Sbjct: 304 ASGLKYWLVKNSWGQTWGEAGYIRMRRDVGGGGLCGIALDTAYP 347


>gi|66735056|gb|AAY53767.1| cysteine protease [Saprolegnia parasitica]
          Length = 523

 Score =  271 bits (692), Expect = 4e-70,   Method: Compositional matrix adjust.
 Identities = 140/311 (45%), Positives = 192/311 (61%), Gaps = 12/311 (3%)

Query: 41  KHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFR 100
           K   WM +      + LE   R  +F  N + IE  NK+ + ++ +G NE+S LT +EF+
Sbjct: 27  KFLSWMKKFAVKL-NPLEWVHRFEVFILNDQRIEAHNKDASSSFTMGHNEYSHLTFDEFK 85

Query: 101 ALYTGYNRPVPSVSRQSSRPSTFKYQ-NVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFS 159
            L TG  R  PS  +  ++ +      N+TDVP  +DW E+G VT +K+QG CGSCWAFS
Sbjct: 86  KLRTGL-RVSPSYIQSRAKYALMAPAVNMTDVPNEMDWVEQGGVTPVKNQGMCGSCWAFS 144

Query: 160 AVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYP 218
              A+EG   ++  +L+ +SEQ+LVDC  + + GC+GGLMD AF+++  +KGL  E DYP
Sbjct: 145 TTGAIEGAAFVSSKQLVSVSEQELVDCDHNGDMGCNGGLMDNAFKWVKTHKGLCKEEDYP 204

Query: 219 YRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSG 278
           Y  +EGTC  +K K V   ++ + D+P  DEQAL  AV+ QPVSV ++A    F FYKSG
Sbjct: 205 YHAKEGTCALKKCKPV-TKVTAFHDVPANDEQALKAAVAKQPVSVAIEADQPEFQFYKSG 263

Query: 279 VLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILR----DAGLC 334
           V +  CG   DHGV VVG+G   EE G KYW +KNSWG  WG+ GYI++ R    + G C
Sbjct: 264 VFDKSCGTKLDHGVLVVGYG---EEGGKKYWKVKNSWGADWGDKGYIKLAREFGPETGQC 320

Query: 335 GIATAASYPVA 345
           G+A   SYP A
Sbjct: 321 GVAMVPSYPTA 331


>gi|299507656|gb|ADJ21807.1| cathepsin L [Oplegnathus fasciatus]
          Length = 336

 Score =  271 bits (692), Expect = 4e-70,   Method: Compositional matrix adjust.
 Identities = 151/342 (44%), Positives = 210/342 (61%), Gaps = 17/342 (4%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
           M  + +L + C S  +S  S+ +P + E  + W + H + Y  E E+  R  ++++NL+ 
Sbjct: 1   MLPVAVLAV-CLSAALSAPSL-DPQLDEHWDLWKSWHTKKYH-EKEEGWRRMVWEKNLKK 57

Query: 73  IEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
           IE  N E   G  TY+LG N F D+T+EEFR +  GY R     S +  + S F   N  
Sbjct: 58  IELHNLEHSMGEHTYRLGMNHFGDMTHEEFRQIMNGYKRK----SERKFKGSLFMEPNFL 113

Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD 189
           + P S+DWR+ G VT +KDQGQCGSCWAFS   A+EG      GKL+ LSEQ LVDCS  
Sbjct: 114 EAPRSVDWRDNGYVTPVKDQGQCGSCWAFSTTGAMEGQHFRKTGKLVSLSEQNLVDCSRP 173

Query: 190 --NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
             N GC+GGLMD+AF+YI +N+GL +E  YPY   +    +   K  +A  + + D+P G
Sbjct: 174 EGNEGCNGGLMDQAFQYIKDNQGLDSEDSYPYLGTDDQPCHYDPKYNSANDTGFIDIPSG 233

Query: 248 DEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFG-TAEEE 303
            E+AL++AV+   PVSV +DA   +F FY+SG+    +C +   DHGV VVG+G   E+ 
Sbjct: 234 KERALMKAVAAVGPVSVAIDAGHESFQFYQSGIYYEKECSSEELDHGVLVVGYGFEGEDV 293

Query: 304 NGAKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
           +G KYW++KNSW E WG+ GYI + +D    CGIATAASYP+
Sbjct: 294 DGKKYWIVKNSWSEKWGDKGYIYMAKDRKNHCGIATAASYPL 335


>gi|2507252|sp|P14080.2|PAPA2_CARPA RecName: Full=Chymopapain; AltName: Full=Papaya proteinase II;
           Short=PPII; Flags: Precursor
 gi|1332461|emb|CAA66378.1| chymopapain [Carica papaya]
          Length = 352

 Score =  271 bits (692), Expect = 4e-70,   Method: Compositional matrix adjust.
 Identities = 139/345 (40%), Positives = 203/345 (58%), Gaps = 14/345 (4%)

Query: 7   KSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEK----HEQWMAQHGRTYKDELEKAMR 62
           K   +   +II + ++ A     G S  + + +E+     + WM +H + Y+   EK  R
Sbjct: 9   KIIFLATCLIIHMGLSSADFYTVGYSQDDLTSIERLIQLFDSWMLKHNKIYESIDEKIYR 68

Query: 63  LNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPST 122
             IF+ NL YI++ NK+ N +Y LG N F+DL+N+EF+  Y G+         +      
Sbjct: 69  FEIFRDNLMYIDETNKK-NNSYWLGLNGFADLSNDEFKKKYVGF-VAEDFTGLEHFDNED 126

Query: 123 FKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQ 182
           F Y++VT+ P SIDWR KGAVT +K+QG CGSCWAFS +A VEGI +I  G L+ELSEQ+
Sbjct: 127 FTYKHVTNYPQSIDWRAKGAVTPVKNQGACGSCWAFSTIATVEGINKIVTGNLLELSEQE 186

Query: 183 LVDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYE 242
           LVDC   ++GC GG    + +Y + N G+ T   YPY+ ++  C    +      I+ Y+
Sbjct: 187 LVDCDKHSYGCKGGYQTTSLQY-VANNGVHTSKVYPYQAKQYKCRATDKPGPKVKITGYK 245

Query: 243 DLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEE 302
            +P   E + L A++NQP+SV V+A G+ F  YKSGV +  CG   DH V  VG+GT++ 
Sbjct: 246 RVPSNCETSFLGALANQPLSVLVEAGGKPFQLYKSGVFDGPCGTKLDHAVTAVGYGTSDG 305

Query: 303 ENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYP 343
           +N   Y +IKNSWG  WGE GY+R+ R +    G CG+  ++ YP
Sbjct: 306 KN---YIIIKNSWGPNWGEKGYMRLKRQSGNSQGTCGVYKSSYYP 347


>gi|55740406|gb|AAV63979.1| cathepsin L1 precursor [Artemia parthenogenetica]
          Length = 338

 Score =  271 bits (692), Expect = 4e-70,   Method: Compositional matrix adjust.
 Identities = 148/343 (43%), Positives = 208/343 (60%), Gaps = 15/343 (4%)

Query: 11  IPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNL 70
           +    +I L+     Q+ +  S+      E H  + A H + Y  +LE+ +R+ I+ +N 
Sbjct: 1   MKQITLIFLLAAVLVQLSAALSLTNLLADEWH-LFKATHKKEYPSQLEEKLRMKIYLENK 59

Query: 71  EYIEKAN---KEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQN 127
             + K N   ++G ++Y++  N+F DL + EFR++  GY     + SR  S  +  +  N
Sbjct: 60  HKVAKHNILYEKGEKSYQVAMNKFGDLLHHEFRSIMNGYQHKKQNSSRAESTFTFMEPAN 119

Query: 128 VTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
           V +VP S+DWREKGA+T +KDQGQCGSCWAFS+  A+EG T    GKL+ LSEQ L+DCS
Sbjct: 120 V-EVPESVDWREKGAITPVKDQGQCGSCWAFSSTGALEGQTFRKTGKLVSLSEQNLIDCS 178

Query: 188 TD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLP 245
               N GC+GGLMD+AF+YI +NKG+ TE  YPY  E+G C     +   A    + D+P
Sbjct: 179 GKYGNEGCNGGLMDQAFQYIKDNKGIDTENTYPYEAEDGVC-RYNPRNRGAVDRGFVDIP 237

Query: 246 KGDEQALLQAVSN-QPVSVCVDASGRAFHFYKSG-VLNADC-GNNCDHGVAVVGFGTAEE 302
            G+E  L  AV+   PVSV +DAS  +F FY  G      C  ++ DHGV VVG+G+   
Sbjct: 238 SGEEDKLKAAVATVGPVSVAIDASHESFQFYSKGXYYEPSCDSDDLDHGVLVVGYGS--- 294

Query: 303 ENGAKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
           +NG  YWL+KNSW E WG+ GYI+I R+    CG+ATAASYP+
Sbjct: 295 DNGEDYWLVKNSWSEHWGDEGYIKIARNRKNHCGVATAASYPL 337


>gi|357114837|ref|XP_003559200.1| PREDICTED: fruit bromelain-like [Brachypodium distachyon]
          Length = 371

 Score =  271 bits (692), Expect = 4e-70,   Method: Compositional matrix adjust.
 Identities = 141/325 (43%), Positives = 188/325 (57%), Gaps = 28/325 (8%)

Query: 38  IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
           ++++  +W A H RTY D  E+  R  +++ N+EYIE  N+ G  TY+LG N+F+DLT+E
Sbjct: 55  MLDRFVRWQAAHNRTYGDAEERLRRFQVYRANIEYIEATNRRGGLTYELGENQFADLTSE 114

Query: 98  EFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDV---------------PTSIDWREKGA 142
           EF ++Y        S      R         TDV               P S DWR KGA
Sbjct: 115 EFLSMYA-------SSYDAGDRADDEAALITTDVAGDGAWSDGDLEALPPPSWDWRAKGA 167

Query: 143 VTHIKDQGQ-CGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKA 201
           VT  K+QG  C SCWAF  VA +EG+T I  GKLI LSEQQLVDC   + GC+ G   + 
Sbjct: 168 VTPPKNQGPTCSSCWAFVTVATIEGLTFIKTGKLISLSEQQLVDCDMYDGGCNTGSYSRG 227

Query: 202 FEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPV 261
           F +++EN GL TEA+YPY    G C+  K    AA I+    +P  +E  + +AV+ QPV
Sbjct: 228 FRWVLENGGLTTEAEYPYTAARGPCNRAKSAHHAAKITGQGRIPPQNELVMQKAVAGQPV 287

Query: 262 SVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGE 321
            V ++  G    FYK+GV +  CG N  H V VVG+G  +  +GAKYW++KNSWG+ WGE
Sbjct: 288 GVAIEV-GSGMQFYKTGVYSGPCGTNLAHAVTVVGYGV-DPASGAKYWIVKNSWGQAWGE 345

Query: 322 SGYIRILRDA---GLCGIATAASYP 343
            G+IR+ RD    GLCGIA   +YP
Sbjct: 346 RGFIRMRRDVGGPGLCGIALDVAYP 370


>gi|357133074|ref|XP_003568153.1| PREDICTED: cysteine proteinase RD21a-like [Brachypodium distachyon]
          Length = 565

 Score =  271 bits (692), Expect = 5e-70,   Method: Compositional matrix adjust.
 Identities = 139/318 (43%), Positives = 186/318 (58%), Gaps = 23/318 (7%)

Query: 43  EQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNR--------TYKLGTNEFSDL 94
           E W A+HG+ Y    E+A RL  F  N  ++   N  G          +Y L  N F+DL
Sbjct: 43  EAWCAEHGKAYASPGERAARLAAFADNAAFVAAHNAGGGGAGGSNAAPSYTLALNAFADL 102

Query: 95  TNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQN---VTDVPTSIDWREKGAVTHIKDQGQ 151
           T+ EFRA   G      +V    + PS   +     V  VP ++DWR+ GAVT +KDQG 
Sbjct: 103 THAEFRAARLGRL----AVGGARAPPSEGGFAGSVGVGAVPEALDWRQSGAVTKVKDQGS 158

Query: 152 CGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKG 210
           CG+CW+FSA  A+EGI +I  G LI LSEQ+L+DC    N GC GGLMD A+ ++I+N G
Sbjct: 159 CGACWSFSATGAIEGINKIKTGSLISLSEQELIDCDRSYNAGCGGGLMDYAYRFVIKNGG 218

Query: 211 LATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGR 270
           + TE DYPYR  +GTC+  K K    TI  Y D+P   E +LLQAV+ QP+SV +  S R
Sbjct: 219 IDTEDDYPYREADGTCNKNKLKRHVVTIDGYSDVPANKEDSLLQAVAQQPISVGICGSAR 278

Query: 271 AFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD 330
           AF  Y  G+ +  C  + DH V +VG+G+   E G  YW++KNSWGE WG  GY+ + R+
Sbjct: 279 AFQLYSQGIFDGPCPTSLDHAVLIVGYGS---EGGKDYWIVKNSWGERWGMKGYMHMHRN 335

Query: 331 ----AGLCGIATAASYPV 344
               +G+CGI   AS+P 
Sbjct: 336 TGSSSGICGINMMASFPT 353


>gi|194757786|ref|XP_001961143.1| GF13722 [Drosophila ananassae]
 gi|190622441|gb|EDV37965.1| GF13722 [Drosophila ananassae]
          Length = 417

 Score =  271 bits (692), Expect = 5e-70,   Method: Compositional matrix adjust.
 Identities = 153/356 (42%), Positives = 211/356 (59%), Gaps = 19/356 (5%)

Query: 1   MVLKFEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKA 60
           +V+ F  +FI+    + IL    A Q +        +        + +H + Y DE E+ 
Sbjct: 68  VVMLFVNAFIL----VFILKKRKAYQNLKATEEQPRTSYAATSTHVLEHRKNYLDETEER 123

Query: 61  MRLNIFKQNLEYIEKANK---EGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSR-- 115
            RL IF +N   I K N+    G  +YKL  N+++D+ + EFR L  G+N  +    R  
Sbjct: 124 FRLKIFNENKHKIAKHNQLWASGKVSYKLAVNKYADMLHHEFRQLMNGFNYTLHKELRAA 183

Query: 116 -QSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGK 174
            +S +  TF       +P S+DWR+KGAVT +KDQG CGSCWAFS+  A+EG      G 
Sbjct: 184 DESFKGVTFISPEHVTLPKSVDWRDKGAVTGVKDQGHCGSCWAFSSTGALEGQHYRKSGV 243

Query: 175 LIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEK 232
           L+ LSEQ LVDCST   N+GC+GGLMD AF YI +N G+ TE  YPY   + +C   K  
Sbjct: 244 LVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYPYEALDDSCHFNK-G 302

Query: 233 AVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADC-GNNCD 289
            + AT   + D+P+G+E+ L +AV+   PVSV +DAS  +F FY  GV +   C   N D
Sbjct: 303 TIGATDRGFVDIPQGNEKKLAEAVATIGPVSVAIDASHESFQFYSEGVYVEPACDAQNLD 362

Query: 290 HGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
           HGV VVGFGT  +E+G  YWL+KNSWG TWG+ G+I++LR+    CGIA+A+SYP+
Sbjct: 363 HGVLVVGFGT--DESGQDYWLVKNSWGTTWGDKGFIKMLRNKDNQCGIASASSYPL 416


>gi|112490572|pdb|2FO5|A Chain A, Crystal Structure Of Recombinant Barley Cysteine
           Endoprotease B Isoform 2 (Ep-B2) In Complex With
           Leupeptin
 gi|112490573|pdb|2FO5|B Chain B, Crystal Structure Of Recombinant Barley Cysteine
           Endoprotease B Isoform 2 (Ep-B2) In Complex With
           Leupeptin
 gi|112490574|pdb|2FO5|C Chain C, Crystal Structure Of Recombinant Barley Cysteine
           Endoprotease B Isoform 2 (Ep-B2) In Complex With
           Leupeptin
 gi|112490575|pdb|2FO5|D Chain D, Crystal Structure Of Recombinant Barley Cysteine
           Endoprotease B Isoform 2 (Ep-B2) In Complex With
           Leupeptin
          Length = 262

 Score =  270 bits (691), Expect = 5e-70,   Method: Compositional matrix adjust.
 Identities = 131/225 (58%), Positives = 159/225 (70%), Gaps = 10/225 (4%)

Query: 128 VTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
           V+D+P S+DWR+KGAVT +KDQG+CGSCWAFS V +VEGI  I  G L+ LSEQ+L+DC 
Sbjct: 1   VSDLPPSVDWRQKGAVTGVKDQGKCGSCWAFSTVVSVEGINAIRTGSLVSLSEQELIDCD 60

Query: 188 T-DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCD---NQKEKAVAATISKYED 243
           T DN GC GGLMD AFEYI  N GL TEA YPYR   GTC+     +   V   I  ++D
Sbjct: 61  TADNDGCQGGLMDNAFEYIKNNGGLITEAAYPYRAARGTCNVARAAQNSPVVVHIDGHQD 120

Query: 244 LPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEE 303
           +P   E+ L +AV+NQPVSV V+ASG+AF FY  GV   +CG   DHGVAVVG+G AE+ 
Sbjct: 121 VPANSEEDLARAVANQPVSVAVEASGKAFMFYSEGVFTGECGTELDHGVAVVGYGVAED- 179

Query: 304 NGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
            G  YW +KNSWG +WGE GYIR+ +D+    GLCGIA  ASYPV
Sbjct: 180 -GKAYWTVKNSWGPSWGEQGYIRVEKDSGASGGLCGIAMEASYPV 223


>gi|449513868|ref|XP_002191976.2| PREDICTED: cathepsin L1-like [Taeniopygia guttata]
          Length = 443

 Score =  270 bits (691), Expect = 6e-70,   Method: Compositional matrix adjust.
 Identities = 152/353 (43%), Positives = 216/353 (61%), Gaps = 20/353 (5%)

Query: 2   VLKFEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAM 61
           + K +++ +IP      +    +++ +  R   +P +    + W + H + Y  E E+  
Sbjct: 100 LRKLQRNQVIP------VTKENSTETLHCRWQVDPELDGHWQLWKSWHRKDYH-EREEGW 152

Query: 62  RLNIFKQNLEYIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSS 118
           R  ++++NL+ IE  N +   G  +YKLG N+F D+T EEFR L  GY   V   S +  
Sbjct: 153 RRVVWEKNLKMIEIHNLDHALGKHSYKLGMNQFGDMTTEEFRQLMNGY---VHKKSERKY 209

Query: 119 RPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIEL 178
           R S F   N  + P S+DWREKG VT +KDQGQCGSCWAFS   A+EG      GKL+ L
Sbjct: 210 RGSQFLEPNFLEAPRSVDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQHFRKTGKLVSL 269

Query: 179 SEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAA 236
           SEQ LVDCS    N GC+GGLMD+AF+Y+ +N G+ +E  YPY  ++      K +  AA
Sbjct: 270 SEQNLVDCSRPEGNQGCNGGLMDQAFQYVQDNGGIDSEESYPYTAKDDEDCRYKAEYNAA 329

Query: 237 TISKYEDLPKGDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVA 293
             + + D+P+G E+AL++AV+   PVSV +DA   +F FY+SG+    DC + + DHGV 
Sbjct: 330 NDTGFVDIPQGHERALMKAVAAVGPVSVAIDAGHSSFQFYQSGIYYEPDCSSEDLDHGVL 389

Query: 294 VVGFG-TAEEENGAKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
           VVG+G   E+ +G KYW++KNSWGE WG+ GYI + +D    CGIATAASYP+
Sbjct: 390 VVGYGFEGEDVDGKKYWIVKNSWGEKWGDKGYIYMAKDRKNHCGIATAASYPL 442


>gi|342305188|dbj|BAK55648.1| cathepsin L [Oplegnathus fasciatus]
          Length = 336

 Score =  270 bits (691), Expect = 6e-70,   Method: Compositional matrix adjust.
 Identities = 151/342 (44%), Positives = 210/342 (61%), Gaps = 17/342 (4%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
           M  + +L + C S  +S  S+ +P + E  + W + H + Y  E E+  R  ++++NL+ 
Sbjct: 1   MLPVAVLAV-CLSAALSAPSL-DPQLDEHWDLWKSWHTKKYH-EKEEGWRRMVWEKNLKK 57

Query: 73  IEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
           IE  N E   G  TY+LG N F D+T+EEFR +  GY R     S +  + S F   N  
Sbjct: 58  IELHNLEHSMGEHTYRLGMNHFGDMTHEEFRQIMYGYKRK----SERKFKGSLFMEPNFL 113

Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD 189
           + P S+DWR+ G VT +KDQGQCGSCWAFS   A+EG      GKL+ LSEQ LVDCS  
Sbjct: 114 EAPRSVDWRDNGYVTPVKDQGQCGSCWAFSTTGAMEGQHFRKTGKLVSLSEQNLVDCSRP 173

Query: 190 --NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
             N GC+GGLMD+AF+YI +N+GL +E  YPY   +    +   K  +A  + + D+P G
Sbjct: 174 EGNEGCNGGLMDQAFQYIKDNQGLDSEDSYPYLGTDDQPCHYDPKYNSANDTGFIDIPSG 233

Query: 248 DEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFG-TAEEE 303
            E+AL++AV+   PVSV +DA   +F FY+SG+    +C +   DHGV VVG+G   E+ 
Sbjct: 234 KERALMKAVAAVGPVSVAIDAGHESFQFYQSGIYYEKECSSEELDHGVLVVGYGFEGEDV 293

Query: 304 NGAKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
           +G KYW++KNSW E WG+ GYI + +D    CGIATAASYP+
Sbjct: 294 DGKKYWIVKNSWSEKWGDKGYIYMAKDRKNHCGIATAASYPL 335


>gi|359359068|gb|AEV40975.1| putative cysteine protease [Oryza punctata]
          Length = 464

 Score =  270 bits (690), Expect = 7e-70,   Method: Compositional matrix adjust.
 Identities = 146/316 (46%), Positives = 203/316 (64%), Gaps = 18/316 (5%)

Query: 42  HEQWMAQH---GRTYKDEL-EKAMRLNIFKQNLEYIE--KANKEGNRTYKLGTNEFSDLT 95
           ++ W+A+H   G ++   + E   R  +F  NL++++   A+ +G+  ++LG N F+DLT
Sbjct: 66  YDLWVARHRHGGGSHNGFVGEYERRFRVFWDNLKFVDAHNAHADGHGGFRLGMNRFADLT 125

Query: 96  NEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAV-THIKDQGQCGS 154
           N+EFRA Y G         R       +++  V  +P S+DWR+KGAV + +K+QGQCGS
Sbjct: 126 NDEFRAAYLG----TTPAGRGRHVGEMYRHDGVEALPDSVDWRDKGAVVSPVKNQGQCGS 181

Query: 155 CWAFSAVAAVEGITQITRGKLIELSEQQLVDCS--TDNHGCSGGLMDKAFEYIIENKGLA 212
           CWAFSAVAAVEGI +I  G+L+ LSEQ+LV+C+    N GC+GG+MD AF +I  N GL 
Sbjct: 182 CWAFSAVAAVEGINKIVTGELVSLSEQELVECARNGGNSGCNGGIMDDAFAFITRNGGLD 241

Query: 213 TEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAF 272
           TE DYPY   +G CD  K+     +I  +ED+P+ DE +L +AV++QPVSV +DA GR F
Sbjct: 242 TEEDYPYTAMDGKCDLAKKSRKVVSIDGFEDVPENDELSLQKAVAHQPVSVAIDAGGREF 301

Query: 273 HFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD-- 330
             Y SGV    CG + DHGV  VG+GT +   G  YW ++NSWG  WGE+GYIR+ R+  
Sbjct: 302 QLYDSGVFTGRCGTSLDHGVVAVGYGT-DAATGTDYWTVRNSWGPDWGENGYIRMERNVT 360

Query: 331 --AGLCGIATAASYPV 344
              G CGIA  ASYP+
Sbjct: 361 ARTGKCGIAMMASYPI 376


>gi|302845628|ref|XP_002954352.1| hypothetical protein VOLCADRAFT_76255 [Volvox carteri f.
           nagariensis]
 gi|300260282|gb|EFJ44502.1| hypothetical protein VOLCADRAFT_76255 [Volvox carteri f.
           nagariensis]
          Length = 489

 Score =  270 bits (690), Expect = 7e-70,   Method: Compositional matrix adjust.
 Identities = 146/351 (41%), Positives = 211/351 (60%), Gaps = 24/351 (6%)

Query: 14  FVIIILVITCASQVVSG-----RSMHEPSIVEKH-------EQWMAQHGRTYKDEL-EKA 60
           F+I  L++  +  V +      R  HE  +++         +QWM Q+ + Y +++ E  
Sbjct: 5   FLIAALLVAASGGVGAAPELQLREQHEKLLLDAKANPMAAFQQWMMQYTKAYANDIKELE 64

Query: 61  MRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVS-RQSSR 119
            R +++ +NL YI   N     ++ L  N F+DLT +EFR    GY+      S R  S 
Sbjct: 65  TRFSVWLENLNYILAYNAR-TTSHWLHLNAFADLTTDEFRN-RLGYDFKARQASNRLQSS 122

Query: 120 PSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELS 179
           P  +   +   +PT IDWR+KGAVT +K+QGQCGSCWAF+   +VEGI  I  G+L  LS
Sbjct: 123 PFIYDNVDANQLPTEIDWRKKGAVTEVKNQGQCGSCWAFATTGSVEGINAIVTGELASLS 182

Query: 180 EQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATI 238
           EQ+LVDC TD + GCSGGLMD A+++II+N GL TE DYPY  E+G C   K+     TI
Sbjct: 183 EQELVDCDTDEDRGCSGGLMDYAYQWIIKNGGLDTEDDYPYTAEDGVCVAAKKNRRVVTI 242

Query: 239 SKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVL-NADCGNNCDHGVAVVGF 297
             Y D+P+ DE AL +A ++QP++V ++A  ++F  Y  GV  +  CG + +HGV VVG+
Sbjct: 243 DGYVDIPENDEVALKKAAAHQPIAVAIEADAKSFQLYGGGVYDDPTCGTSLNHGVLVVGY 302

Query: 298 GTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
           G  ++ +   YW++KNSWG  WG++GYIR+   A    G+CGIA A S+P 
Sbjct: 303 G--KDPHFGNYWIVKNSWGPEWGDNGYIRLRMGAEDVQGMCGIAMAPSFPT 351


>gi|320169652|gb|EFW46551.1| cathepsin L2 [Capsaspora owczarzaki ATCC 30864]
          Length = 325

 Score =  270 bits (690), Expect = 7e-70,   Method: Compositional matrix adjust.
 Identities = 147/307 (47%), Positives = 185/307 (60%), Gaps = 11/307 (3%)

Query: 44  QWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALY 103
           +W A H R Y    E+A+R  I+  NLE I + N  G  +Y LG NEF DL + EF A Y
Sbjct: 23  EWKALHNRQYASAQEEALRQEIYLSNLELINEHNAAGRHSYTLGMNEFGDLAHHEFAAKY 82

Query: 104 TGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAA 163
            G       V+   S  S+     +  +P S+DWR  G VT +K+QGQCGSCW+FS   +
Sbjct: 83  LGVR--FNGVNATKSFASSTYLPRMVSLPDSVDWRTAGIVTPVKNQGQCGSCWSFSTTGS 140

Query: 164 VEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRH 221
           VEG      G L+ LSEQ LVDCS+   N GC+GGLMD AFEYII+N G+ TEA YPY  
Sbjct: 141 VEGQHARKTGTLVSLSEQNLVDCSSQEGNEGCNGGLMDDAFEYIIKNGGIDTEASYPYTA 200

Query: 222 EEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGVL 280
             GTC       + AT++ Y+D+  G E  L  AV+   PVSV +DAS   F FY +GV 
Sbjct: 201 TTGTCKFNAAN-IGATVASYQDIITGSESDLQNAVATVGPVSVAIDASHINFQFYFTGVY 259

Query: 281 N-ADCGNN-CDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIA 337
           N   C     DHGV  VG+GT+ E  G  YWL+KNSWG TWG++GYI + R+A   CGIA
Sbjct: 260 NEKKCSTTQLDHGVLAVGYGTSTE--GKDYWLVKNSWGATWGKAGYIWMSRNADNQCGIA 317

Query: 338 TAASYPV 344
           T+ASYP+
Sbjct: 318 TSASYPL 324


>gi|326494040|dbj|BAJ85482.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 355

 Score =  270 bits (690), Expect = 7e-70,   Method: Compositional matrix adjust.
 Identities = 146/318 (45%), Positives = 187/318 (58%), Gaps = 18/318 (5%)

Query: 40  EKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEF 99
            +HE+WMA++GR Y D  EK  R  +F  N  +I+  N+ GNRTY LG N FSDLTNEEF
Sbjct: 39  HRHERWMAKYGRVYADAAEKLRRQEVFAANARHIDAVNRAGNRTYTLGLNHFSDLTNEEF 98

Query: 100 RALYTGY-NRPVPSVSR-QSSRPSTFKYQNVTD-----VPTSIDWREKGAVTHIKDQGQC 152
              + GY ++P P   R + S P+     NVTD      P S+DWR +GAVT +K QG C
Sbjct: 99  AQTHLGYRHQPGPGGLRPEDSSPAAAV--NVTDAQLQSTPDSVDWRARGAVTPVKHQGHC 156

Query: 153 GSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLA 212
           GSCWAF+AVAA EG+ QI  G LI +SEQQ++DC+     C  G ++ A  YI  + GL 
Sbjct: 157 GSCWAFAAVAATEGLVQIATGNLISMSEQQVLDCTGGTSSCKSGYVNAALTYITASGGLQ 216

Query: 213 TEADYPYRHEEGTCDN--QKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGR 270
           TEA Y Y  E+G C +      + AA       +  GDE AL   V+ QPV+V V+A   
Sbjct: 217 TEAAYAYSAEQGACRSGGASPNSAAAVGVHRSAMLNGDEGALQVLVAGQPVAVAVEAE-P 275

Query: 271 AFHFYKSGVL--NADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRIL 328
            FH YKSGV   +  CG    H V VVG+G   + +G  YW++KN WG  WGE GY+R+ 
Sbjct: 276 DFHHYKSGVYVGSPSCGQKLHHAVTVVGYGA--DGDGQGYWVVKNQWGAGWGEVGYMRLT 333

Query: 329 RDAGL--CGIATAASYPV 344
           R  G   CG+AT A YP 
Sbjct: 334 RGNGGNNCGMATHAYYPT 351


>gi|194352758|emb|CAQ00107.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
          Length = 457

 Score =  270 bits (690), Expect = 8e-70,   Method: Compositional matrix adjust.
 Identities = 136/314 (43%), Positives = 189/314 (60%), Gaps = 16/314 (5%)

Query: 41  KHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANK------EGNRTYKLGTNEFSDL 94
           + E W A+HG+ Y    E+A RL  F +N  ++   N        G  +Y L  N F+DL
Sbjct: 38  QFEAWCAEHGKAYATPGERAARLAAFAENAAFVAAHNDAVASSGPGGPSYTLALNAFADL 97

Query: 95  TNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQN-VTDVPTSIDWREKGAVTHIKDQGQCG 153
           T++EFRA   G     P      S PS   ++  V  VP ++DWR+ GAVT +KDQG CG
Sbjct: 98  THDEFRAARLGRLAVGPGPLGAPS-PSDGGFEGRVGAVPDALDWRQSGAVTKVKDQGSCG 156

Query: 154 SCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLA 212
           +CW+FSA  A+EGI +IT G L+ LSEQ+L+DC    N GC GGLM  A++++I+N G+ 
Sbjct: 157 ACWSFSATGAMEGINKITTGSLLSLSEQELIDCDRSYNTGCGGGLMTYAYKFVIKNGGID 216

Query: 213 TEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAF 272
           TE DYP+R  +GTC+  K K    TI  Y+++P   E  LLQAV+ QP+SV +  S RAF
Sbjct: 217 TEDDYPFREADGTCNKNKLKKHVVTIDGYKEVPSSKEDLLLQAVAQQPISVGICGSARAF 276

Query: 273 HFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD-- 330
             Y  G+ +  C  + DH V +VG+G+   E G  YW++KNSWGE WG  GY+ + R+  
Sbjct: 277 QLYSQGIFDGPCPTSLDHAVLIVGYGS---EGGKDYWIVKNSWGERWGMKGYMHMHRNTG 333

Query: 331 --AGLCGIATAASY 342
             +G+CGI   AS+
Sbjct: 334 SSSGICGINMMASF 347


>gi|432114311|gb|ELK36239.1| Cathepsin S [Myotis davidii]
          Length = 340

 Score =  270 bits (689), Expect = 9e-70,   Method: Compositional matrix adjust.
 Identities = 153/348 (43%), Positives = 216/348 (62%), Gaps = 22/348 (6%)

Query: 6   EKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKH-EQWMAQHGRTYKDELEKAMRLN 64
           E+  +  M  ++++++ C+S +     +H+   ++ H + W   +G+ Y +E E+  R  
Sbjct: 3   EQQTVQRMKWLLLVLLGCSSAMAQ---LHKDPTLDHHWDLWKKTYGKQYTEENEEVTRRF 59

Query: 65  IFKQNLEYIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPS 121
           I+++NL+Y+   N E   G  +Y LG N  +D+T+EE   L +     VPS   Q  R  
Sbjct: 60  IWEKNLKYVMLHNLEHSMGMHSYDLGMNHLADMTSEEVMLLMSSLR--VPS---QWQRNV 114

Query: 122 TFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQ 181
           TFK      +P S+DWR+KG VT +K QG CGSCWAFSAV A+E   ++  GKL+ LS Q
Sbjct: 115 TFKSNPNQKLPDSMDWRDKGCVTEVKYQGSCGSCWAFSAVGALEAQLKLKTGKLVSLSVQ 174

Query: 182 QLVDCST---DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATI 238
            LVDCST    N GC+GG M +AF+YII+N G+ +EA YPY+  +G C     K  AAT 
Sbjct: 175 NLVDCSTGKYSNKGCNGGFMTEAFQYIIDNNGIDSEASYPYKAMDGKCQ-YDVKNRAATC 233

Query: 239 SKYEDLPKGDEQALLQAVSNQ-PVSVCVDASGRAFHFYKSGV-LNADCGNNCDHGVAVVG 296
           SKY +LP G+E+AL +AV+N+ PVSV +DAS  +F  Y+SGV  +  C  N +HGV  VG
Sbjct: 234 SKYVELPFGNEEALKEAVANKGPVSVAIDASHPSFFLYRSGVYYDKACTLNVNHGVLAVG 293

Query: 297 FGTAEEENGAKYWLIKNSWGETWGESGYIRILRDAG-LCGIATAASYP 343
           +G     NG  YWL+KNSWG  +GE GYIR+ R++G  CGIA+  SYP
Sbjct: 294 YGNY---NGKDYWLVKNSWGLHFGEQGYIRMARNSGNHCGIASYPSYP 338


>gi|226499806|ref|NP_001151335.1| cysteine protease 1 [Zea mays]
 gi|195645896|gb|ACG42416.1| cysteine protease 1 precursor [Zea mays]
          Length = 258

 Score =  270 bits (689), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 148/266 (55%), Positives = 184/266 (69%), Gaps = 20/266 (7%)

Query: 89  NEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT-----DVPTSIDWREKGAV 143
           NEF+D+TN+EF A+YTG  RPVP+ ++   + + FKY NVT     D   ++DWR+KGAV
Sbjct: 4   NEFADMTNDEFMAMYTGL-RPVPAGAK---KMAGFKYGNVTLSDADDDQQTVDWRQKGAV 59

Query: 144 THIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAF 202
           T IKDQ QCG CWAF+AVAAVEGI QIT G L+ LSEQQ++DC TD N+GC+GG +D AF
Sbjct: 60  TGIKDQRQCGCCWAFAAVAAVEGIHQITTGNLVSLSEQQVLDCDTDGNNGCNGGYIDNAF 119

Query: 203 EYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVS 262
           +YI+ N GLATE  YPY   +  C  Q  + VAA IS Y+D+P GDE AL  AV+NQPVS
Sbjct: 120 QYIVGNGGLATEDAYPYTAAQAMC--QSVQPVAA-ISGYQDVPSGDEAALAAAVANQPVS 176

Query: 263 VCVDASGRAFHFYKSGVLN-ADCGN--NCDHGVAVVGFGTAEEENGAKYWLIKNSWGETW 319
           V +DA    F  Y  GV+  A C    N +H V  VG+GTAE+  G  YWL+KN WG+ W
Sbjct: 177 VAIDA--HNFQLYGGGVMTAASCSTPPNLNHAVTAVGYGTAED--GTPYWLLKNQWGQNW 232

Query: 320 GESGYIRILRDAGLCGIATAASYPVA 345
           GE GY+R+ R A  CG+A  ASYPVA
Sbjct: 233 GEGGYLRLERGANACGVAQQASYPVA 258


>gi|383849553|ref|XP_003700409.1| PREDICTED: cathepsin L-like [Megachile rotundata]
          Length = 343

 Score =  270 bits (689), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 149/345 (43%), Positives = 212/345 (61%), Gaps = 23/345 (6%)

Query: 15  VIIILVITCASQVVSGRSMHEPSIVEKHEQWM---AQHGRTYKDELEKAMRLNIFKQNLE 71
           +++++VITCA+  V   S  E      +++W+    +H + YK E E+ +R+ I+ +N  
Sbjct: 4   ILLLIVITCAA--VQAISFFELV----NQEWINFKMEHKKCYKHEAEERLRMKIYMKNKL 57

Query: 72  YIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRP--STFKYQ 126
            I + N +      TY+L  N++ D+ N EF+ +  GYNR +    R    P  + F   
Sbjct: 58  QIAQHNCDYELKKVTYRLKINKYGDMLNHEFKNMLNGYNRTINHTLRNERLPVGAAFIEP 117

Query: 127 NVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC 186
              ++P  +DWR+ GAVT +KDQG CGSCWAFSA  ++EG      G L+ LSEQ L+DC
Sbjct: 118 CNVELPKMVDWRKCGAVTEVKDQGHCGSCWAFSATGSLEGQHFRRTGVLVSLSEQNLIDC 177

Query: 187 STD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDL 244
           S    N+GC+GGLMD+AF YI +NKGL TE  YPY  E+  C   K  + A+ +  + D+
Sbjct: 178 SGSYGNNGCNGGLMDQAFSYIKDNKGLDTEKTYPYEGEDDKCRYDKRSSGASDVG-FVDI 236

Query: 245 PKGDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFGTAE 301
           P GDEQ L  AV+   PVSV +DAS ++F FY  G+    +C + N DHGV VVG+GT E
Sbjct: 237 PVGDEQKLKAAVATVGPVSVAIDASHQSFQFYSDGIYFEPECSSTNLDHGVLVVGYGTDE 296

Query: 302 EENGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPVA 345
           E  G  YW++KNSWGE+WGE GYI++ R+    CGIA++ASYP+ 
Sbjct: 297 E--GRDYWIVKNSWGESWGEKGYIKMARNIDNHCGIASSASYPIV 339


>gi|158300877|ref|XP_001689282.1| AGAP011828-PA [Anopheles gambiae str. PEST]
 gi|157013372|gb|EDO63348.1| AGAP011828-PA [Anopheles gambiae str. PEST]
          Length = 344

 Score =  270 bits (689), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 152/350 (43%), Positives = 209/350 (59%), Gaps = 29/350 (8%)

Query: 14  FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMA---QHGRTYKDELEKAMRLNIFKQNL 70
           F+I+IL    A+  +S   + +       E+W A   QH + Y  E E+ +R+ I+ QN 
Sbjct: 4   FLILILGFVAAANAISIFELVK-------EEWTAFKLQHRKKYDSETEERIRMKIYVQNK 56

Query: 71  EYIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVS-------RQSSRP 120
             I K N+    G   ++L  N+++DL +EEF     G+NR V           +    P
Sbjct: 57  HKIAKHNQRYDLGQEKFRLRVNKYADLLHEEFVHTLNGFNRSVSGKGQLLRGELKPIEEP 116

Query: 121 STFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSE 180
            T+      DVPT++DWR KGAVT +KDQG CGSCW+FSA  A+EG      GKL+ LSE
Sbjct: 117 VTWIEPANVDVPTAMDWRTKGAVTQVKDQGHCGSCWSFSATGALEGQHFRKTGKLVSLSE 176

Query: 181 QQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATI 238
           Q LVDCS    N+GC+GG+MD AF+YI +NKG+ TE  YPY   +  C +   KAV AT 
Sbjct: 177 QNLVDCSQKYGNNGCNGGMMDFAFQYIKDNKGIDTEKSYPYEAIDDEC-HYNPKAVGATD 235

Query: 239 SKYEDLPKGDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVV 295
             + D+P+G+E+AL++A++   PVSV +DAS  +F FY  GV     C +   DHGV  V
Sbjct: 236 KGFVDIPQGNEKALMKALATVGPVSVAIDASHESFQFYSEGVYYEPQCDSEQLDHGVLAV 295

Query: 296 GFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
           G+GT E+  G  YWL+KNSWG TWG+ GY+++ R+    CGIAT ASYP+
Sbjct: 296 GYGTTED--GEDYWLVKNSWGTTWGDQGYVKMARNRDNHCGIATTASYPL 343


>gi|357130486|ref|XP_003566879.1| PREDICTED: fruit bromelain-like [Brachypodium distachyon]
          Length = 354

 Score =  269 bits (688), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 143/321 (44%), Positives = 196/321 (61%), Gaps = 15/321 (4%)

Query: 37  SIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTN 96
           ++  +HE+WMA+ GR YKD  EKA R  +F  N  +++  N+ GNRTY LG N FSDLT+
Sbjct: 33  TVASRHERWMARFGRAYKDADEKARRQEVFGANARHVDAVNRSGNRTYTLGLNHFSDLTD 92

Query: 97  EEFRALYTGY--NRPVPS----VSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQG 150
            EF   + GY  ++P P        Q    +T       DVP S+DWR +GAVT IK+Q 
Sbjct: 93  HEFLQQHLGYRHHQPGPGGLLRPEDQDMSKATALADYGQDVPDSVDWRAQGAVTEIKNQR 152

Query: 151 QCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKG 210
            CGSCWAF+AVAA EG+ +I  G LI +SEQQ++DC+   + C GG ++ A  Y+  + G
Sbjct: 153 SCGSCWAFAAVAATEGLVKIATGNLISMSEQQVLDCTGGGNTCDGGDINAALRYVAASGG 212

Query: 211 LATEADYPYRHEEGTCDNQKEKAVAATI--SKYEDLPKGDEQALLQAVSNQPVSVCVDAS 268
           L  EA Y Y  ++G C        AA++  +++  L  GDE AL    + QPV+V ++AS
Sbjct: 213 LQPEAAYAYAAQKGACRGASPANSAASVGGARFARL-GGDEGALRGLAAGQPVAVALEAS 271

Query: 269 GRAFHFYKSGVL--NADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIR 326
              F  YKSGV   +A CG   +HGV VVG+G AE+++G +YW++KN WG  WGE GY+R
Sbjct: 272 EPDFRHYKSGVYAGSASCGRRLNHGVTVVGYG-AEDDSGDEYWVVKNQWGTLWGEKGYMR 330

Query: 327 ILRD--AGL-CGIATAASYPV 344
           + R   AG  CGIA+ A YP 
Sbjct: 331 VARGDVAGANCGIASYAYYPT 351


>gi|222641485|gb|EEE69617.1| hypothetical protein OsJ_29194 [Oryza sativa Japonica Group]
          Length = 360

 Score =  269 bits (688), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 134/302 (44%), Positives = 182/302 (60%), Gaps = 11/302 (3%)

Query: 38  IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
           ++++   W   H R+Y    E   R +++++N E+I+  N  G+ TY+L  NEF+DLT E
Sbjct: 47  MMDRFRAWQGAHNRSYPSAEEALQRFDVYRRNAEFIDAVNLRGDLTYQLAENEFADLTEE 106

Query: 98  EFRALYTGY---NRPVPS---VSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQ-G 150
           EF A YTGY   + PV      +      ++F Y+   DVP S+DWR +GAV   K Q  
Sbjct: 107 EFLATYTGYYAGDGPVDDSVITTGAGDVDASFSYR--VDVPASVDWRAQGAVVPPKSQTS 164

Query: 151 QCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKG 210
            C SCWAF   A +E +  I  GKL+ LSEQQLVDC + + GC+ G   +A+++++EN G
Sbjct: 165 TCSSCWAFVTAATIESLNMIKTGKLVSLSEQQLVDCDSYDGGCNLGSYGRAYKWVVENGG 224

Query: 211 LATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGR 270
           L TEADYPY    G C+  K    AA I+ +  +P  +E AL  AV+ QPV+V ++  G 
Sbjct: 225 LTTEADYPYTARRGPCNRAKSAHHAAKITGFGKVPPRNEAALQAAVARQPVAVAIEV-GS 283

Query: 271 AFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD 330
              FYK GV    CG    H V VVG+GT +  +GAKYW IKNSWG++WGE GYIRILRD
Sbjct: 284 GMQFYKGGVYTGPCGTRLAHAVTVVGYGT-DASSGAKYWTIKNSWGQSWGERGYIRILRD 342

Query: 331 AG 332
            G
Sbjct: 343 VG 344


>gi|4469153|emb|CAB38314.1| chymopapain isoform II [Carica papaya]
          Length = 352

 Score =  269 bits (688), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 138/345 (40%), Positives = 202/345 (58%), Gaps = 14/345 (4%)

Query: 7   KSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEK----HEQWMAQHGRTYKDELEKAMR 62
           K   +   +II + ++ A     G S  + + +E+     + WM +H + Y+   EK  R
Sbjct: 9   KIIFLATCLIIHMGLSSADFYTVGYSQDDLTSIERLIQLFDSWMLKHNKIYESIDEKIYR 68

Query: 63  LNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPST 122
             IF+ NL YI++ NK+ N +Y LG N F+DL+N+EF+  Y G+         +      
Sbjct: 69  FEIFRDNLMYIDETNKK-NNSYWLGLNGFADLSNDEFKKKYVGF-VAEDFTGLEHFDNED 126

Query: 123 FKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQ 182
           F Y++VT+ P SIDWR KGAVT +K+QG CGSCWAFS +A VEGI +I  G L+ELSEQ+
Sbjct: 127 FTYKHVTNYPQSIDWRAKGAVTPVKNQGACGSCWAFSTIATVEGINKIVTGNLLELSEQE 186

Query: 183 LVDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYE 242
           LVDC   ++GC GG    + +Y + N G+ T   YPY+ ++  C    +      I+ Y+
Sbjct: 187 LVDCDKHSYGCKGGYQTTSLQY-VANNGVHTSKVYPYQAKQYKCRATDKPGPKVKITGYK 245

Query: 243 DLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEE 302
            +P   E + L A++NQP+S  V+A G+ F  YKSGV +  CG   DH V  VG+GT++ 
Sbjct: 246 RVPSNCETSFLGALANQPLSFLVEAGGKPFQLYKSGVFDGPCGTKLDHAVTAVGYGTSDG 305

Query: 303 ENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYP 343
           +N   Y +IKNSWG  WGE GY+R+ R +    G CG+  ++ YP
Sbjct: 306 KN---YIIIKNSWGPNWGEKGYMRLKRQSGNSQGTCGVYKSSYYP 347


>gi|242088413|ref|XP_002440039.1| hypothetical protein SORBIDRAFT_09g024940 [Sorghum bicolor]
 gi|241945324|gb|EES18469.1| hypothetical protein SORBIDRAFT_09g024940 [Sorghum bicolor]
          Length = 463

 Score =  269 bits (688), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 136/315 (43%), Positives = 186/315 (59%), Gaps = 18/315 (5%)

Query: 43  EQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNR--------TYKLGTNEFSDL 94
           + W A+HG+ Y    E+A RL +F  N  ++   N   N         +Y L  N F+DL
Sbjct: 42  DAWCAEHGKAYATPEERAARLAVFADNAAFVAAHNARVNAAGGGGAPPSYTLALNAFADL 101

Query: 95  TNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQN-VTDVPTSIDWREKGAVTHIKDQGQCG 153
           T+EEFRA   G      +  R  + P        +  VP ++DWRE GAVT +KDQG CG
Sbjct: 102 THEEFRAARLGRIAAGAAALRSPAAPVYRGLDGGLGAVPDALDWRENGAVTKVKDQGSCG 161

Query: 154 SCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLA 212
           +CW+FSA  A+EGI +I  G L+ LSEQ+L+DC    N GC GGLMD A++++++N G+ 
Sbjct: 162 ACWSFSATGAMEGINKIKTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYKFVVKNGGID 221

Query: 213 TEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAF 272
           TE DYPYR  +GTC+  K K    TI  Y D+P   E  LLQAV+ QPVSV +  S RAF
Sbjct: 222 TEEDYPYREADGTCNKNKLKKRIVTIDGYSDVPSNKEDLLLQAVAQQPVSVGICGSARAF 281

Query: 273 HFY-KSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA 331
             Y + G+ +  C  + DH V +VG+G+   E G  YW++KNSWGE+WG  GY+ + R+ 
Sbjct: 282 QLYSQQGIFDGPCPTSLDHAVLIVGYGS---EGGKDYWIVKNSWGESWGMKGYMHMHRNT 338

Query: 332 ----GLCGIATAASY 342
               G+CGI   AS+
Sbjct: 339 GDSKGVCGINMMASF 353


>gi|351705687|gb|EHB08606.1| Cathepsin S [Heterocephalus glaber]
          Length = 331

 Score =  269 bits (688), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 147/335 (43%), Positives = 208/335 (62%), Gaps = 19/335 (5%)

Query: 19  LVITCASQVVSGRSMHEPSIVEKHEQ-WMAQHGRTYKDELEKAMRLNIFKQNLEYIEKAN 77
           L   C +  ++G  + +  +++ H   W   +G+ Y+++ E+ +R  I+++NL+++   N
Sbjct: 4   LAWVCVTCSLAGAQLQQDPMLDYHWHLWKKTYGKHYQEKNEEQVRRLIWEKNLKFVMLHN 63

Query: 78  KE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTS 134
            E   G  +Y LG N   D+T+EE R+L +    P     RQ  R  T+K      +P S
Sbjct: 64  LEHSMGMHSYDLGMNHLGDMTSEEVRSLMSSLRVP-----RQWLRNVTYKSDPNQKLPDS 118

Query: 135 IDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD---NH 191
           +DWREKG VT +K QG CGSCWAFSAV A+EG  ++  GKL+ LS Q LVDCST+   N 
Sbjct: 119 VDWREKGCVTEVKYQGACGSCWAFSAVGALEGQLKLKTGKLVSLSAQNLVDCSTEKYRNK 178

Query: 192 GCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQA 251
           GCSGG M +AF+Y+I+N G+ +E  YPY+  +  C +   K  AAT S+Y +LP G E+A
Sbjct: 179 GCSGGFMTEAFQYVIDNNGIDSETSYPYKATDEKC-HYDSKNRAATCSRYTELPYGSEEA 237

Query: 252 LLQAVSNQ-PVSVCVDASGRAFHFYKSGVL-NADCGNNCDHGVAVVGFGTAEEENGAKYW 309
           L +AV+N+ PVSV VDAS  +F  YK+GV  +  C  N  HGV  VG+G     NG  YW
Sbjct: 238 LKEAVANKGPVSVAVDASRPSFFLYKNGVYDDPSCTQNVTHGVLAVGYGNL---NGKDYW 294

Query: 310 LIKNSWGETWGESGYIRILRDAG-LCGIATAASYP 343
           L+KNSWG  +G+ GYIR+ R+ G  CGIA+ +SYP
Sbjct: 295 LVKNSWGLYFGDQGYIRMARNKGNHCGIASYSSYP 329


>gi|55740402|gb|AAV63977.1| cathepsin L precursor [Artemia franciscana]
          Length = 338

 Score =  269 bits (687), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 149/343 (43%), Positives = 207/343 (60%), Gaps = 15/343 (4%)

Query: 11  IPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNL 70
           +    +I L+     Q+ +  S+      E H  + A H + Y  +LE+  R+ I+ +N 
Sbjct: 1   MKQITLIFLLGAVLVQLSAALSLTNLLADEWH-LFKATHKKEYPSQLEEKFRMKIYLENK 59

Query: 71  EYIEKAN---KEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQN 127
             + K N   ++G ++Y++  N+F DL + EFR++  GY     + SR  S  +  +  N
Sbjct: 60  HKVAKHNILYEKGEKSYQVAMNKFGDLLHHEFRSIMNGYQHKKQNSSRAESTFTFMEPAN 119

Query: 128 VTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
           V +VP S+DWREKGA+T +KDQGQCGSCWAFS+  A+EG T    GKLI LSEQ L+DCS
Sbjct: 120 V-EVPESVDWREKGAITPVKDQGQCGSCWAFSSTGALEGQTFRKTGKLISLSEQNLIDCS 178

Query: 188 TD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLP 245
               N GC+GGLMD+AF+YI +NKG+ TE  YPY  E+  C     +   A    + D+P
Sbjct: 179 GKYGNEGCNGGLMDQAFQYIKDNKGIDTENTYPYEAEDDVC-RYNPRNRGAVDRGFVDIP 237

Query: 246 KGDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADC-GNNCDHGVAVVGFGTAEE 302
            G+E  L  AV+   PVSV +DAS  +F FY  GV     C  ++ DHGV VVG+G+   
Sbjct: 238 SGEEDKLKAAVATVGPVSVAIDASHESFQFYSKGVYYEPSCDSDDLDHGVLVVGYGS--- 294

Query: 303 ENGAKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
           +NG  YWL+KNSW E WG+ GYI+I R+    CG+ATAASYP+
Sbjct: 295 DNGKDYWLVKNSWSEHWGDEGYIKIARNRKNHCGVATAASYPL 337


>gi|410923307|ref|XP_003975123.1| PREDICTED: cathepsin L1-like [Takifugu rubripes]
          Length = 336

 Score =  269 bits (687), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 152/343 (44%), Positives = 211/343 (61%), Gaps = 19/343 (5%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
           MF +++L + C +  +S  S+ +P + E    W   H + Y  E E+  R  ++++NL+ 
Sbjct: 1   MFPVVVLAL-CVTAALSAPSL-DPQLDEHWNLWKDWHSKKYH-EKEEGWRRMVWEKNLKK 57

Query: 73  IEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
           IE  N E   G  TY LG N F D+T+EEFR +  GY       S++  R S F   N  
Sbjct: 58  IELHNLEHSMGKHTYSLGMNHFGDMTHEEFRQIMNGYKLK----SQRKLRGSLFMEPNFL 113

Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD 189
           + P S+DWR+KG VT +KDQGQCGSCWAFS   A+EG      G L+ LSEQ LVDCS  
Sbjct: 114 EAPRSVDWRDKGYVTPVKDQGQCGSCWAFSTTGAMEGQHFRKTGTLVSLSEQNLVDCSRP 173

Query: 190 --NHGCSGGLMDKAFEYIIENKGLATEADYPYR-HEEGTCDNQKEKAVAATISKYEDLPK 246
             N GC+GGLMD+AF+YI +N GL +E  YPY   +EG C +      +A  + + D+P 
Sbjct: 174 EGNEGCNGGLMDQAFQYIKDNGGLDSEESYPYLGTDEGPC-HYDPSYNSANDTGFVDVPS 232

Query: 247 GDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFG-TAEE 302
           G E+AL++AV++  PVSV +DA   +F FY SG+  + +C +   DHGV VVG+G   ++
Sbjct: 233 GSERALMKAVASVGPVSVAIDAGHESFQFYHSGIYYDKECSSEELDHGVLVVGYGFEGKD 292

Query: 303 ENGAKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
            +G KYW++KNSW E WG+ GYI + +D    CGIATAASYP+
Sbjct: 293 VDGKKYWIVKNSWSENWGDKGYIYMAKDKKNHCGIATAASYPL 335


>gi|417409876|gb|JAA51427.1| Putative cathepsin s, partial [Desmodus rotundus]
          Length = 342

 Score =  269 bits (687), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 149/343 (43%), Positives = 216/343 (62%), Gaps = 22/343 (6%)

Query: 11  IPMFVIIILVITCASQVVSGRSMHEPSIVEKH-EQWMAQHGRTYKDELEKAMRLNIFKQN 69
           I M  ++++++ C+S +     +H+   +++H + W   +G+ YK++ E+ +R  I+++N
Sbjct: 10  IIMKWLVLVLLGCSSAMAQ---LHKDPTLDRHWDLWKKTYGKQYKEKNEEGVRRLIWEKN 66

Query: 70  LEYIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQ 126
           L+++   N E   G  +Y LG N   D+T+EE  AL +     VPS   Q  R  T+K  
Sbjct: 67  LKFVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVTALMSSLR--VPS---QWQRNVTYKSN 121

Query: 127 NVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC 186
               +P S+DWR+KG VT +K QG CGSCWAFSAV A+E   ++  GKL+ LS Q LVDC
Sbjct: 122 PNQKLPDSVDWRDKGCVTDVKYQGSCGSCWAFSAVGALEAQVKLKTGKLVSLSAQNLVDC 181

Query: 187 ST---DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYED 243
           S     N GC+GG M +AF+YII+N G+ +EA YPY+  +G C     K  AAT S+Y +
Sbjct: 182 SVGKYSNRGCNGGFMTEAFQYIIDNNGIESEASYPYKAMDGKC-QYDSKYRAATCSRYTE 240

Query: 244 LPKGDEQALLQAVSNQ-PVSVCVDASGRAFHFYKSGV-LNADCGNNCDHGVAVVGFGTAE 301
           LP+  E AL +AV+N+ PVSV +DAS  +F  Y+SGV  +  C  + +HGV VVG+G   
Sbjct: 241 LPEDSEDALKEAVANKGPVSVAIDASHPSFFLYRSGVYYDPACTLHVNHGVLVVGYGNL- 299

Query: 302 EENGAKYWLIKNSWGETWGESGYIRILRDAG-LCGIATAASYP 343
             NG  YWL+KNSWG  +G+ GYIR+ R++G  CGIA+ ASYP
Sbjct: 300 --NGKDYWLVKNSWGLHFGDQGYIRMARNSGNHCGIASYASYP 340


>gi|357160095|ref|XP_003578656.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP2-like
           [Brachypodium distachyon]
          Length = 377

 Score =  269 bits (687), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 148/339 (43%), Positives = 200/339 (58%), Gaps = 33/339 (9%)

Query: 35  EPSIVE----KHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE--GNRTYKLGT 88
           +P+I++    + ++W A+HGR Y    E+  RL ++ +N+ YIE AN +     TY+LG 
Sbjct: 42  DPTILQTMAPRFQRWKAEHGRAYATRDEELRRLRVYARNVRYIEAANGDPAAGLTYQLGE 101

Query: 89  NEFSDLTNEEFRALYTGYNRPVPSVSRQ----------SSRPSTFK------YQNVTDV- 131
             ++DLT +EF A+YT    P P +S            ++R           Y NV+   
Sbjct: 102 TAYTDLTADEFTAMYT---SPSPVLSAHDDEAAGAMMITTRAGAVDAGGQQVYFNVSTAG 158

Query: 132 -PTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDN 190
            P S+DWR KGAVT +K+QG+CGSCWAFS VA VEGI QI  G LI LSEQ+LVDC T +
Sbjct: 159 APASVDWRAKGAVTEVKNQGRCGSCWAFSTVAVVEGIHQIRTGNLISLSEQELVDCDTLD 218

Query: 191 HGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQ 250
           +GC GG+   A E+I  N G+ATEADYPY  ++G C   K    AA IS +  +    E 
Sbjct: 219 YGCDGGVSYHALEWIASNGGIATEADYPYTGKDGACVANKLPLHAAAISGFARVATRSEP 278

Query: 251 ALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWL 310
           +L  AV+ QPV+V ++A G  F  Y  GV N  CG   +HGV VV     EE +G KYW+
Sbjct: 279 SLANAVAAQPVAVSIEAGGANFQHYVKGVYNGPCGTRLNHGVTVV-GYGEEEGDGEKYWI 337

Query: 311 IKNSWGETWGESGYIRILRDA-----GLCGIATAASYPV 344
           +KNSWG+ WG+ GY R+ +D      GLCGIA   S+P+
Sbjct: 338 VKNSWGKKWGDGGYFRMKKDVAGKPEGLCGIAIRPSFPL 376


>gi|4731372|gb|AAD28476.1|AF133838_1 papain-like cysteine protease [Sandersonia aurantiaca]
          Length = 370

 Score =  268 bits (686), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 123/227 (54%), Positives = 166/227 (73%), Gaps = 8/227 (3%)

Query: 123 FKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQ 182
           ++Y+    +P S+DWREKGAV  IKDQG CGSCWAFS +A+VEGI +I  G LI LSEQ+
Sbjct: 33  YRYRAGDALPDSVDWREKGAVVPIKDQGGCGSCWAFSTIASVEGINKIVTGDLISLSEQE 92

Query: 183 LVDCS-TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKY 241
           LVDC  T N GC+GGLMD AF++II+N G+ TE DYPY  ++G CD+ ++ A   +I+ Y
Sbjct: 93  LVDCDKTYNDGCNGGLMDYAFQFIIDNGGIDTEKDYPYTEQDGRCDSYRKNAKVVSINSY 152

Query: 242 EDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAE 301
           ED+P  DEQAL +A ++QP++V +D  GR+F  Y SG+    CG + DHGV VVG+G+  
Sbjct: 153 EDVPVNDEQALKKAAASQPIAVAIDGGGRSFQLYNSGIFTGKCGTSLDHGVTVVGYGS-- 210

Query: 302 EENGAKYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPV 344
            E+G  YW+++NSWGE+WGE GYIR+ R+    +G+CGIA  ASYP+
Sbjct: 211 -ESGKDYWIVRNSWGESWGEKGYIRMARNIDSPSGICGIAMEASYPI 256


>gi|226509942|ref|NP_001146834.1| cysteine protease precursor [Zea mays]
 gi|159506725|gb|ABW97700.1| cysteine protease [Zea mays]
 gi|414867308|tpg|DAA45865.1| TPA: cysteine protease [Zea mays]
          Length = 352

 Score =  268 bits (686), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 140/311 (45%), Positives = 193/311 (62%), Gaps = 8/311 (2%)

Query: 38  IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
           ++++   W A + R+Y    E+  R  ++++N+E+IE  N+ GN TY LG N+F+DLT E
Sbjct: 45  MMDRFLSWQATYNRSYPTAEERQRRFQVYRRNIEHIEATNRAGNLTYTLGENQFADLTEE 104

Query: 98  EFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQG-QCGSCW 156
           EF  LYT    PV   + +  R +        D PTS+DWR KGAVT IK+QG  C SCW
Sbjct: 105 EFLDLYTMKGMPVRRDAGKK-RANVSSSAAAVDAPTSVDWRSKGAVTPIKNQGPSCSSCW 163

Query: 157 AFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLATEAD 216
           AF   A +E IT+IT GKL+ LSEQ+L+DC   + GC+ G     + ++I+N GL TEA+
Sbjct: 164 AFVTAATIESITKITTGKLVSLSEQELIDCDPYDGGCNLGYFVNGYRWVIQNGGLTTEAN 223

Query: 217 YPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYK 276
           YPY+     C   +    AATIS Y  LP G+ Q L QAV+ QPV+  ++  G +  FY 
Sbjct: 224 YPYQARRYACSRSRAAQHAATISDYVQLPAGEGQ-LQQAVAQQPVAAAIEMGG-SLQFYS 281

Query: 277 SGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD---AGL 333
            GV +  CG   +H + VVG+G A+  +G KYWL+KNSWG++WGE GY+R+ RD    GL
Sbjct: 282 GGVFSGQCGTRMNHAITVVGYG-ADSSSGLKYWLVKNSWGQSWGERGYLRMRRDVGRGGL 340

Query: 334 CGIATAASYPV 344
           CGIA   +YPV
Sbjct: 341 CGIALDLAYPV 351


>gi|221090861|ref|XP_002167224.1| PREDICTED: cathepsin L-like [Hydra magnipapillata]
          Length = 324

 Score =  268 bits (686), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 153/335 (45%), Positives = 207/335 (61%), Gaps = 24/335 (7%)

Query: 16  IIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEK 75
           +++L +T A       ++  P   E   QW   H + Y  + E+ +R  I+K N   I +
Sbjct: 7   LLLLGVTLA------YTIERPVKDESWIQWKMYHNKVYSHDGEETVRYTIWKDNERRIRE 60

Query: 76  ANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSI 135
            N +G   + L  N+F D+TN EF+A + GY      +S +    STF   N    P ++
Sbjct: 61  HNLKGG-DFLLKMNQFGDMTNSEFKA-FNGY------LSHKHVNGSTFLTPNNFVAPDTV 112

Query: 136 DWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST--DNHGC 193
           DWR +G VT +KDQGQCGSCWAFS   ++EG      GKL+ LSEQ LVDCST   N+GC
Sbjct: 113 DWRNEGYVTPVKDQGQCGSCWAFSTTGSLEGQHFKKTGKLVSLSEQNLVDCSTAYGNNGC 172

Query: 194 SGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALL 253
           +GGLMD AF YI ENKG+ +EA YPY  E+G C  +K  +VAAT + + DLP+G+E  L 
Sbjct: 173 NGGLMDNAFTYIKENKGIDSEASYPYTAEDGKCVFKK-PSVAATDTGFVDLPEGNENKLK 231

Query: 254 QAVSN-QPVSVCVDASGRAFHFYKSGVLN-ADCGNN-CDHGVAVVGFGTAEEENGAKYWL 310
           +AV++  P+SV +DAS  +F FY SGV N   C +   DHGV VVG+GT   E+G  YWL
Sbjct: 232 EAVASVGPISVAIDASHESFQFYSSGVYNEPSCSSTELDHGVLVVGYGT---ESGKDYWL 288

Query: 311 IKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
           +KNSW  +WG+ GYI++ R+A   CGIAT ASYP+
Sbjct: 289 VKNSWNTSWGDKGYIKMRRNAKNQCGIATKASYPL 323


>gi|330805273|ref|XP_003290609.1| hypothetical protein DICPUDRAFT_92519 [Dictyostelium purpureum]
 gi|325079248|gb|EGC32857.1| hypothetical protein DICPUDRAFT_92519 [Dictyostelium purpureum]
          Length = 333

 Score =  268 bits (685), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 147/307 (47%), Positives = 191/307 (62%), Gaps = 16/307 (5%)

Query: 45  WMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYT 104
           WM +H R Y  E E   R   FK+N+++I K N + + T  LG  +F+DLTNEE++  Y 
Sbjct: 36  WMRKHDRAYSHE-EFTDRYQAFKENMDFIHKWNSQESDTV-LGLTKFADLTNEEYKKHYL 93

Query: 105 GYNRPVP-SVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAA 163
           G    V  +++        FK+      P SIDWREKGAV+ +KDQGQCGSCW+FS   A
Sbjct: 94  GIKVNVKKNLNAAQKGLKFFKFTG----PDSIDWREKGAVSQVKDQGQCGSCWSFSTTGA 149

Query: 164 VEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRH 221
           VEG  QI  G ++ LSEQ LVDCS    N GC GGLM  AFEYII+N G+ATE+ YPY  
Sbjct: 150 VEGAHQIKSGNMVSLSEQNLVDCSGQYGNQGCEGGLMVNAFEYIIDNGGIATESSYPYTA 209

Query: 222 EEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLN 281
            +G C   K     A I  Y+++P+G+E +L  A++ QPVSV +DAS  +F  Y SGV +
Sbjct: 210 AQGRCKFTKSMN-GANIIGYKEIPQGEEDSLTAALAKQPVSVAIDASHMSFQLYSSGVYD 268

Query: 282 --ADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIAT 338
             A      DHGV  VG+GT E   G  Y++IKNSWG TWG+ GYI + R+A   CG+AT
Sbjct: 269 EPACSSEALDHGVLAVGYGTLE---GKDYYIIKNSWGPTWGQDGYIFMSRNAQNQCGVAT 325

Query: 339 AASYPVA 345
            ASYP++
Sbjct: 326 MASYPIS 332


>gi|307192137|gb|EFN75465.1| Cathepsin L [Harpegnathos saltator]
          Length = 339

 Score =  268 bits (685), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 147/341 (43%), Positives = 208/341 (60%), Gaps = 18/341 (5%)

Query: 15  VIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIE 74
           + ++L I  A+Q +S  ++    + E+   +   H + Y  ++E++ R+ IF +N   I 
Sbjct: 5   IFLLLGILAAAQAISFFNL----VTEEWNTFKVTHRKAYDSKIEESFRMKIFMENWHKIA 60

Query: 75  KANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRP--STFKYQNVT 129
             N++      +YKLG N++ D+ + EF     G+N+ V +  R   RP  S F      
Sbjct: 61  LHNQKYELNEVSYKLGMNKYGDMLHHEFINTLNGFNKSVSAQLRAQRRPIGSRFIEPANV 120

Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS-- 187
           ++P+S+DWR  GAVT IKDQG CGSCW+FSA  A+EG      GKL+ LSEQ L+DCS  
Sbjct: 121 EIPSSVDWRTHGAVTPIKDQGHCGSCWSFSATGALEGQHYRITGKLVSLSEQNLIDCSGR 180

Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
             N+GC+GGLMD+AF+YI +N GL TE  YPY  E   C     +   AT S Y D+P+G
Sbjct: 181 YGNNGCNGGLMDQAFQYIKDNHGLDTEISYPYEAENDKC-RYNPRNNGATDSGYVDIPEG 239

Query: 248 DEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFGTAEEEN 304
           +E+ L  AV+   PVSV +DAS  +F FY+ GV     C + N DHGV VVG+GT  ++N
Sbjct: 240 NEKKLKAAVATIGPVSVAIDASAESFQFYREGVYYEPRCSSENLDHGVLVVGYGT--DDN 297

Query: 305 GAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
              YWL+KNSWG TWG+ GYI++ R+    CGIA++ASYP+
Sbjct: 298 DQDYWLVKNSWGVTWGDEGYIKMARNKDNHCGIASSASYPL 338


>gi|125552927|gb|EAY98636.1| hypothetical protein OsI_20560 [Oryza sativa Indica Group]
          Length = 449

 Score =  268 bits (685), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 136/307 (44%), Positives = 186/307 (60%), Gaps = 10/307 (3%)

Query: 41  KHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFR 100
           + E W A+HGR+Y    E+A RL  F  N  ++  A+     +Y L  N F+DLT++EFR
Sbjct: 37  QFEAWCAEHGRSYATPGERAARLAAFADNAAFV-AAHNGAPASYALALNAFADLTHDEFR 95

Query: 101 ALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSA 160
           A   G         R    P       V  VP ++DWR+ GAVT +KDQG CG+CW+FSA
Sbjct: 96  AARLGRLAAA-GPGRDGGAPYLGVDGGVGAVPDAVDWRQSGAVTKVKDQGSCGACWSFSA 154

Query: 161 VAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPY 219
             A+EGI +I  G LI LSEQ+L+DC    N GC GGLMD A++++++N G+ TEADYPY
Sbjct: 155 TGAMEGINKIKTGSLISLSEQELIDCDRSYNSGCGGGLMDYAYKFVVKNGGIDTEADYPY 214

Query: 220 RHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGV 279
           R  +GTC+  K K    TI  Y+D+P  +E  LLQAV+ QPVSV +  S RAF  Y  G+
Sbjct: 215 RETDGTCNKNKLKRRVVTIDGYKDVPANNEDMLLQAVAQQPVSVGICGSARAFQLYSKGI 274

Query: 280 LNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----GLCG 335
            +  C  + DH + +VG+G+   E G  YW++KNSWGE+WG  GY+ + R+     G+CG
Sbjct: 275 FDGPCPTSLDHAILIVGYGS---EGGKDYWIVKNSWGESWGMKGYMYMHRNTGNSNGVCG 331

Query: 336 IATAASY 342
           I    S+
Sbjct: 332 INQMPSF 338


>gi|344275470|ref|XP_003409535.1| PREDICTED: cathepsin S-like isoform 1 [Loxodonta africana]
          Length = 331

 Score =  268 bits (685), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 150/340 (44%), Positives = 209/340 (61%), Gaps = 20/340 (5%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
           M  ++ + + C+S +   R   +P++    + W   + + YK++ E+  R  I+++NL++
Sbjct: 1   MKWLLWVALVCSSAMA--RLHKDPTLDNHWDLWKKTYSKQYKEKNEEVARRLIWEKNLKF 58

Query: 73  IEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
           +   N E   G  +Y L  N   D+T+EE  +L +     VPS   Q  R  TFK     
Sbjct: 59  VMLHNLEHSMGMHSYDLSMNHLGDMTSEEVMSLMSSLR--VPS---QWQRNVTFKSNPNQ 113

Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD 189
            +P S+DWREKG VT +K QG CG+CWAFSAV A+E   ++  GKL+ LS Q LVDCS +
Sbjct: 114 KLPDSLDWREKGCVTDVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSGE 173

Query: 190 ---NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPK 246
              N GC+GG M +AF+YII+N G+ +EA YPY+  +G C     K  AAT SKY +LP 
Sbjct: 174 KYSNKGCNGGFMTRAFQYIIDNNGIDSEASYPYKATDGKCQ-YDPKNRAATCSKYTELPY 232

Query: 247 GDEQALLQAVSNQ-PVSVCVDASGRAFHFYKSGV-LNADCGNNCDHGVAVVGFGTAEEEN 304
           G E AL +AV+N+ PVSV +DAS  +F  YKSGV  +  C +N +HGV VVG+G     N
Sbjct: 233 GSEDALKEAVANKGPVSVGIDASRPSFFLYKSGVYYDPSCTDNVNHGVLVVGYGNL---N 289

Query: 305 GAKYWLIKNSWGETWGESGYIRILRDAG-LCGIATAASYP 343
           G  YWL+KNSWG  +GE GYIR+ R++G  CGIA+  SYP
Sbjct: 290 GKDYWLVKNSWGLNFGEQGYIRMARNSGNHCGIASFPSYP 329


>gi|195429415|ref|XP_002062758.1| GK19626 [Drosophila willistoni]
 gi|194158843|gb|EDW73744.1| GK19626 [Drosophila willistoni]
          Length = 341

 Score =  268 bits (685), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 143/343 (41%), Positives = 209/343 (60%), Gaps = 19/343 (5%)

Query: 14  FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYI 73
            + +++ +   +Q VS   +    + E+   +  +H + Y D  E+  R+ IF +N  +I
Sbjct: 5   LITLLIALVAMTQAVSYSEL----VREEWNTFKLEHRKNYADSTEETFRMKIFNENKHHI 60

Query: 74  EKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSR---QSSRPSTFKYQN 127
            K N+    G  +YKL  N+++D+ + EFR    G+N  +    R   +S    TF    
Sbjct: 61  AKHNQRYATGEVSYKLALNKYADMLHHEFRETMNGFNYTLHKQLRSTDESFTGVTFISPE 120

Query: 128 VTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
              +PT++DWR KGAVT +KDQG CGSCWAFS+  A+EG      G L+ LSEQ LVDCS
Sbjct: 121 HVKLPTAVDWRTKGAVTEVKDQGHCGSCWAFSSTGAIEGQHFRKSGTLVSLSEQNLVDCS 180

Query: 188 TD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLP 245
           T   N+GC+GGLMD AF Y+ +N G+ TE  Y Y   + +C   K  ++ AT   + D+P
Sbjct: 181 TKYGNNGCNGGLMDNAFRYVKDNGGIDTEKSYAYEGIDDSCHFDK-NSIGATDRGFADIP 239

Query: 246 KGDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGVLN-ADC-GNNCDHGVAVVGFGTAEE 302
           +G+E+ L QAV+   PVSV +DAS ++F FY  GV +  +C   N DHGV VVG+GT  E
Sbjct: 240 QGNEKKLAQAVATIGPVSVAIDASQQSFQFYSEGVYDEPNCSAENLDHGVLVVGYGT--E 297

Query: 303 ENGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
           ++G+ YWL+KNSWG TWG+ G+I++ R+    CGIA+A+SYP+
Sbjct: 298 KDGSDYWLVKNSWGTTWGDKGFIKMSRNKENQCGIASASSYPL 340


>gi|440799058|gb|ELR20119.1| cysteine proteinase [Acanthamoeba castellanii str. Neff]
          Length = 401

 Score =  268 bits (685), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 145/322 (45%), Positives = 191/322 (59%), Gaps = 20/322 (6%)

Query: 36  PSIVEKHEQ-----WMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE--GNRTYKLGT 88
           P  VE  EQ     WM  H ++Y  +     R  I+K N  +I   NK+     ++ +  
Sbjct: 84  PRDVELEEQRAFTEWMRTHRKSYHHD-HFLPRFEIWKTNNRWITHWNKKHANASSFTVAI 142

Query: 89  NEFSDLTNEEFRALYTGYNR-PVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIK 147
           N+F DLT++EF  LY G +    P  S +  RP   ++ N   +P S DWR+KG V+ +K
Sbjct: 143 NQFGDLTSDEFNRLYNGLHVFSAPKASEKVERPR--QWANTAGIPESGDWRQKGVVSRVK 200

Query: 148 DQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST---DNHGCSGGLMDKAFEY 204
           DQG CGSCWAFS   + EGI  IT  +L+ LSEQ LVDC+T   DN+GC+GG MD AF Y
Sbjct: 201 DQGMCGSCWAFSTTGSTEGINAITTSRLVPLSEQNLVDCATAAYDNYGCNGGFMDNAFRY 260

Query: 205 IIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVC 264
           II+NKG+ +EA YPY   +G C    +          + LPKGDE+ALL A + QP+SV 
Sbjct: 261 IIDNKGIDSEASYPYVAADGQCRFNPKTVYGGKGGTLKSLPKGDEKALLVAAARQPISVG 320

Query: 265 VDASGRAFHFYKSGVLN-ADCGNN-CDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGES 322
           +DA   +F FY  GV N  +C +   +HGV +VG+G    E G  YWL+KNSWG+TWG  
Sbjct: 321 IDAGRPSFQFYSKGVYNEPECSSTELNHGVLIVGWGV---ERGQAYWLVKNSWGQTWGMD 377

Query: 323 GYIRILRDA-GLCGIATAASYP 343
           GYI++ RD    CGIAT ASYP
Sbjct: 378 GYIKMSRDKNNQCGIATLASYP 399


>gi|344953542|gb|AEN28617.1| cathepsin L-like cysteine protease [Epinephelus coioides]
          Length = 336

 Score =  268 bits (685), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 148/345 (42%), Positives = 210/345 (60%), Gaps = 20/345 (5%)

Query: 10  IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
           ++P+ V+ +    C S  +S  S+ +P + +  E W + H + Y  E E+  R  ++++N
Sbjct: 1   MLPLAVVAL----CLSAALSAPSL-DPQLDDHWELWKSWHSKKYH-EKEEGWRRMVWEKN 54

Query: 70  LEYIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQ 126
           L+ IE  N E   G  +Y+LG N F D+T+EEFR L  GY R   +     +R S F   
Sbjct: 55  LKKIELHNLEHSMGTHSYRLGMNHFGDMTHEEFRQLMNGYKRKAET----KARGSLFLEP 110

Query: 127 NVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC 186
           N  + P S+DWR+ G VT +KDQGQCGSCWAFS   A+EG      GKL+ LSEQ LVDC
Sbjct: 111 NFLEAPKSVDWRDNGYVTPVKDQGQCGSCWAFSTTGALEGQHFRKTGKLVSLSEQNLVDC 170

Query: 187 STD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDL 244
           S    N GC+GGLMD+AF+Y+ +N+GL +E  YPY   +    +      +   + + D+
Sbjct: 171 SRPEGNEGCNGGLMDQAFQYVKDNQGLDSEDSYPYLGTDDQPCHYDPTYNSVNDTGFVDI 230

Query: 245 PKGDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFG-TA 300
           P G E+AL++AV+   PVSV +DA   +F FY+SG+    +C +   DHGV VVG+G   
Sbjct: 231 PSGKERALMKAVAAVGPVSVAIDAGHESFQFYQSGIYYEKECSSEELDHGVLVVGYGFQG 290

Query: 301 EEENGAKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
           E+ +G KYW++KNSW E WG+ GYI + +D    CGIATAASYP+
Sbjct: 291 EDVDGKKYWIVKNSWSEKWGDKGYIYMAKDRKNHCGIATAASYPL 335


>gi|355763133|gb|EHH62119.1| hypothetical protein EGM_20318 [Macaca fascicularis]
          Length = 331

 Score =  268 bits (685), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 151/341 (44%), Positives = 210/341 (61%), Gaps = 22/341 (6%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKH-EQWMAQHGRTYKDELEKAMRLNIFKQNLE 71
           M  +I +++ C+S V     +H+   ++ H   W   +G+ YK++ E+A+R  I+++NL+
Sbjct: 1   MKQLICVLLVCSSAVAQ---LHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLK 57

Query: 72  YIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNV 128
           ++   N E   G  +Y LG N   D+T+EE  +L +     VPS   Q  R  T+K    
Sbjct: 58  FVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLR--VPS---QWQRNITYKSNAN 112

Query: 129 TDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST 188
             +P S+DWREKG VT +K QG CG+CWAFSAV A+E   ++  GKL+ LS Q LVDCST
Sbjct: 113 QILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCST 172

Query: 189 D---NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLP 245
           +   N GC+GG M +AF+YII+N G+ ++A YPY+  +  C     K  AAT SKY +LP
Sbjct: 173 EKYGNKGCNGGFMTRAFQYIIDNNGIDSDASYPYKATDQKCQ-YDSKYRAATCSKYTELP 231

Query: 246 KGDEQALLQAVSNQ-PVSVCVDASGRAFHFYKSGV-LNADCGNNCDHGVAVVGFGTAEEE 303
            G E  L + V+N+ PVSV VDAS  +F  Y+SGV     C  N +HGV VVG+G     
Sbjct: 232 YGREDVLKEVVANKGPVSVGVDASHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYGVL--- 288

Query: 304 NGAKYWLIKNSWGETWGESGYIRILRDAG-LCGIATAASYP 343
           NG +YWL+KNSWG  +GE GYIR+ R+ G  CGIA+  SYP
Sbjct: 289 NGKEYWLVKNSWGRNFGEEGYIRMARNKGNHCGIASFPSYP 329


>gi|41688064|dbj|BAD08618.1| cathepsin L preproprotein [Cyprinus carpio]
          Length = 337

 Score =  268 bits (684), Expect = 4e-69,   Method: Compositional matrix adjust.
 Identities = 149/342 (43%), Positives = 203/342 (59%), Gaps = 16/342 (4%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
           M V +     C S V +  ++ +  +    EQW   HG+ Y  E E+  R  ++++NL+ 
Sbjct: 1   MRVFLAAFALCLSAVFAAPTL-DKQLDNHWEQWKNWHGKKYH-EKEEGWRRMVWEKNLQK 58

Query: 73  IEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
           IE  N E   G  TY+LG N F D+T+EEFR +  GY         +  R S F   N  
Sbjct: 59  IELHNLEHSMGTHTYRLGMNRFGDMTHEEFRQVMNGYKHK----KERRFRGSLFMEPNFL 114

Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD 189
           +VP S+DWREKG VT +KDQG+CGSCWAFS   A+EG      GKL+ LSEQ LVDCS  
Sbjct: 115 EVPNSLDWREKGYVTPVKDQGECGSCWAFSTTGAMEGQMFRKTGKLVSLSEQNLVDCSRP 174

Query: 190 --NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
             N GC+GGLMD+AF+YI +  GL +E  YPY   +    +   K  AA  + + D+P G
Sbjct: 175 EGNEGCNGGLMDQAFQYIKDQNGLDSEESYPYVGTDDQPCHYDPKYSAANDTGFVDIPSG 234

Query: 248 DEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFG-TAEEE 303
            E AL++A++   PVSV +DA   +F FY+SG+    +C +   DHGV  VG+G   E+ 
Sbjct: 235 KEHALMKAIAAVGPVSVAIDAGHESFQFYQSGIYYEKECSSEELDHGVLAVGYGFEGEDV 294

Query: 304 NGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
           +G KYW++KNSW E WG+ GY+ + +D    CGIATAASYP+
Sbjct: 295 DGKKYWIVKNSWSENWGDKGYVYMAKDRHNHCGIATAASYPL 336


>gi|402856105|ref|XP_003892640.1| PREDICTED: cathepsin S isoform 1 [Papio anubis]
          Length = 331

 Score =  267 bits (683), Expect = 4e-69,   Method: Compositional matrix adjust.
 Identities = 150/341 (43%), Positives = 210/341 (61%), Gaps = 22/341 (6%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKH-EQWMAQHGRTYKDELEKAMRLNIFKQNLE 71
           M  ++ +++ C+S V     +H+   ++ H   W   +G+ YK++ E+A+R  I+++NL+
Sbjct: 1   MKRLVCVLLVCSSAVAQ---LHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLK 57

Query: 72  YIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNV 128
           ++   N E   G  +Y LG N   D+T+EE  +L +     VPS   Q  R  T+K    
Sbjct: 58  FVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLR--VPS---QWQRNITYKSNPN 112

Query: 129 TDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST 188
             +P S+DWREKG VT +K QG CG+CWAFSAV A+E   ++  GKL+ LS Q LVDCST
Sbjct: 113 QMLPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCST 172

Query: 189 D---NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLP 245
           +   N GC+GG M +AF+YII+N G+ ++A YPY+  +  C     K  AAT SKY +LP
Sbjct: 173 EKYGNKGCNGGFMTRAFQYIIDNNGIDSDASYPYKATDQKCQ-YDSKYRAATCSKYTELP 231

Query: 246 KGDEQALLQAVSNQ-PVSVCVDASGRAFHFYKSGV-LNADCGNNCDHGVAVVGFGTAEEE 303
            G E  L + V+N+ PVSV VDAS  +F  Y+SGV     C  N +HGV VVG+G     
Sbjct: 232 YGREDVLKEVVANKGPVSVGVDASHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYGVL--- 288

Query: 304 NGAKYWLIKNSWGETWGESGYIRILRDAG-LCGIATAASYP 343
           NG +YWL+KNSWG  +GE GYIR+ R+ G  CGIA+  SYP
Sbjct: 289 NGKEYWLVKNSWGRNFGEEGYIRMARNKGNHCGIASFPSYP 329


>gi|355558399|gb|EHH15179.1| hypothetical protein EGK_01236 [Macaca mulatta]
 gi|380809986|gb|AFE76868.1| cathepsin S isoform 1 preproprotein [Macaca mulatta]
 gi|383416071|gb|AFH31249.1| cathepsin S isoform 1 preproprotein [Macaca mulatta]
 gi|383416073|gb|AFH31250.1| cathepsin S isoform 1 preproprotein [Macaca mulatta]
 gi|383416075|gb|AFH31251.1| cathepsin S isoform 1 preproprotein [Macaca mulatta]
 gi|383416077|gb|AFH31252.1| cathepsin S isoform 1 preproprotein [Macaca mulatta]
 gi|383416079|gb|AFH31253.1| cathepsin S isoform 1 preproprotein [Macaca mulatta]
          Length = 331

 Score =  267 bits (683), Expect = 5e-69,   Method: Compositional matrix adjust.
 Identities = 150/341 (43%), Positives = 210/341 (61%), Gaps = 22/341 (6%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKH-EQWMAQHGRTYKDELEKAMRLNIFKQNLE 71
           M  ++ +++ C+S V     +H+   ++ H   W   +G+ YK++ E+A+R  I+++NL+
Sbjct: 1   MKQLVCVLLVCSSAVAQ---LHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLK 57

Query: 72  YIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNV 128
           ++   N E   G  +Y LG N   D+T+EE  +L +     VPS   Q  R  T+K    
Sbjct: 58  FVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLR--VPS---QWQRNITYKSNAN 112

Query: 129 TDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST 188
             +P S+DWREKG VT +K QG CG+CWAFSAV A+E   ++  GKL+ LS Q LVDCST
Sbjct: 113 QILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCST 172

Query: 189 D---NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLP 245
           +   N GC+GG M +AF+YII+N G+ ++A YPY+  +  C     K  AAT SKY +LP
Sbjct: 173 EKYGNKGCNGGFMTRAFQYIIDNNGIDSDASYPYKATDQKCQ-YDSKYRAATCSKYTELP 231

Query: 246 KGDEQALLQAVSNQ-PVSVCVDASGRAFHFYKSGV-LNADCGNNCDHGVAVVGFGTAEEE 303
            G E  L + V+N+ PVSV VDAS  +F  Y+SGV     C  N +HGV VVG+G     
Sbjct: 232 YGREDVLKEVVANKGPVSVGVDASHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYGVL--- 288

Query: 304 NGAKYWLIKNSWGETWGESGYIRILRDAG-LCGIATAASYP 343
           NG +YWL+KNSWG  +GE GYIR+ R+ G  CGIA+  SYP
Sbjct: 289 NGKEYWLVKNSWGRNFGEEGYIRMARNKGNHCGIASFPSYP 329


>gi|326672302|ref|XP_003199633.1| PREDICTED: cathepsin L1-like [Danio rerio]
 gi|157423549|gb|AAI53506.1| Im:6910535 [Danio rerio]
          Length = 335

 Score =  267 bits (683), Expect = 5e-69,   Method: Compositional matrix adjust.
 Identities = 142/340 (41%), Positives = 208/340 (61%), Gaps = 15/340 (4%)

Query: 14  FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYI 73
            +  +LV  C S V +  S+ +  + +    W +QHG++Y +++E   R+ I+++NL  I
Sbjct: 1   MMFALLVTLCISAVFTAPSI-DIQLDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRKI 58

Query: 74  EKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
           E+ N E   GN T+K+G N+F D+TNEEFR    GY +       ++S+ + F   +   
Sbjct: 59  EQHNFEYSYGNHTFKMGMNQFGDMTNEEFRQAMNGYKQD----PNRTSKGALFMEPSFFA 114

Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD- 189
            P  +DWR++G VT +KDQ QCGSCW+FS+  A+EG      GKLI +SEQ LVDCS   
Sbjct: 115 APQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQ 174

Query: 190 -NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
            N GC+GG+MD+AF+Y+ ENKGL +E  YPY   +        +   A I+ + D+PKG+
Sbjct: 175 GNQGCNGGIMDQAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPKGN 234

Query: 249 EQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGNNCDHGVAVVGFG-TAEEENG 305
           E AL+ AV+   PVSV +DAS ++  FY+SG+     C +  DH V VVG+G    +  G
Sbjct: 235 ELALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACTSRLDHAVLVVGYGYQGADVAG 294

Query: 306 AKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
            +YW++KNSW + WG+ GYI + +D    CGIAT ASYP+
Sbjct: 295 NRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGIATMASYPL 334


>gi|61368403|gb|AAX43172.1| cathepsin S [synthetic construct]
          Length = 332

 Score =  267 bits (683), Expect = 5e-69,   Method: Compositional matrix adjust.
 Identities = 151/341 (44%), Positives = 211/341 (61%), Gaps = 22/341 (6%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKH-EQWMAQHGRTYKDELEKAMRLNIFKQNLE 71
           M  ++ +++ C+S V     +H+   ++ H   W   +G+ YK++ E+A+R  I+++NL+
Sbjct: 1   MKRLVCVLLVCSSAVAQ---LHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLK 57

Query: 72  YIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNV 128
           ++   N E   G  +Y LG N   D+T+EE  +L +     VPS   Q  R  T+K    
Sbjct: 58  FVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLR--VPS---QWQRNITYKSNPN 112

Query: 129 TDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST 188
             +P S+DWREKG VT +K QG CG+CWAFSAV A+E   ++  GKL+ LS Q LVDCST
Sbjct: 113 RILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCST 172

Query: 189 D---NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLP 245
           +   N GC+GG M  AF+YII+NKG+ ++A YPY+  +  C     K  AAT SKY +LP
Sbjct: 173 EKYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKAMDQKCQ-YDSKYRAATCSKYTELP 231

Query: 246 KGDEQALLQAVSNQ-PVSVCVDASGRAFHFYKSGV-LNADCGNNCDHGVAVVGFGTAEEE 303
            G E  L +AV+N+ PVSV VDA   +F  Y+SGV     C  N +HGV VVG+G   + 
Sbjct: 232 YGREDVLKEAVANKGPVSVGVDARHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYG---DL 288

Query: 304 NGAKYWLIKNSWGETWGESGYIRILRDAG-LCGIATAASYP 343
           NG +YWL+KNSWG  +GE GYIR+ R+ G  CGIA+  SYP
Sbjct: 289 NGKEYWLVKNSWGHNFGEEGYIRMARNKGNHCGIASFPSYP 329


>gi|303283194|ref|XP_003060888.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226457239|gb|EEH54538.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 422

 Score =  267 bits (683), Expect = 5e-69,   Method: Compositional matrix adjust.
 Identities = 148/323 (45%), Positives = 205/323 (63%), Gaps = 19/323 (5%)

Query: 37  SIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIE---KANKEGNRTYKLGTNEFSD 93
           +I  + ++W+A HG+ Y    E+A RL IF  N E++    +A+  G +++ L  N  +D
Sbjct: 65  TIEARFDRWLATHGKAYACPKERAKRLAIFADNAEFVRVHNEAHAAGKKSHWLRLNHLAD 124

Query: 94  LTNEEFRALYTGYNRPVPSVSRQSSRP----STFKYQNVTDVPTSIDWREKGAVTHIKDQ 149
           LT EEF+ +  GY+     V  +SS P    + ++Y +VT  P ++DW  +GAVT +K+Q
Sbjct: 125 LTREEFKHML-GYDASKKRV--ESSSPPVDAANWEYADVTP-PETMDWVSRGAVTPVKNQ 180

Query: 150 GQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST--DNHGCSGGLMDKAFEYIIE 207
           GQCGSCWAFS V AVEG+  +  G LI LSEQ+LV C+    N+GC GGLMD  FE+I+E
Sbjct: 181 GQCGSCWAFSTVGAVEGVVAVKTGDLISLSEQELVSCAKIGGNNGCKGGLMDNGFEWIVE 240

Query: 208 NKGLATEADYPYRHEEGTCD-NQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVD 266
           N+G+  E D+ Y  ++  C+  +K +A AA+I  ++D+P+ DE AL +AVS QPV+V ++
Sbjct: 241 NRGVDDEEDWGYLAKDRRCNWFKKRRAKAASIDGFKDVPRNDEDALKKAVSQQPVAVAIE 300

Query: 267 ASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK-YWLIKNSWGETWGESGYI 325
           A  R F  Y  GV + +CG N DHGV VVG+G   E  G K YW +KNSWG  WGE GYI
Sbjct: 301 ADHREFQLYSGGVFDGECGTNLDHGVLVVGYGYDGESAGHKHYWTVKNSWGAKWGEEGYI 360

Query: 326 RILR----DAGLCGIATAASYPV 344
           RI R     AG CG+A  ASYP 
Sbjct: 361 RIARGGMGPAGQCGVAMQASYPT 383


>gi|226499884|ref|NP_001148278.1| thiol protease SEN102 precursor [Zea mays]
 gi|195617112|gb|ACG30386.1| thiol protease SEN102 precursor [Zea mays]
          Length = 374

 Score =  267 bits (683), Expect = 5e-69,   Method: Compositional matrix adjust.
 Identities = 142/348 (40%), Positives = 199/348 (57%), Gaps = 33/348 (9%)

Query: 24  ASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNR- 82
           A   +   S  + S++E+ ++W A + ++Y    E+  R  ++ +N+ YIE  N E    
Sbjct: 32  AGDTMGSMSNDDSSMIERFQRWKAAYNKSYATVAEERRRFRVYARNMAYIEATNAEAEAA 91

Query: 83  --TYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFK---------------- 124
             TY+LG   ++DLTN+EF A+YT      P++++  +  S                   
Sbjct: 92  GLTYELGETAYTDLTNQEFMAMYT-----APALAQLPADESVITTRAGPVDAVGGAPGQL 146

Query: 125 --YQNVT-DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQ 181
             Y N++   P S+DWR  GAVT +K+QG+CGSCWAFS VA VEGI QI  GKL+ LSEQ
Sbjct: 147 PVYVNLSASAPASVDWRASGAVTPVKNQGRCGSCWAFSTVAVVEGIYQIRTGKLVSLSEQ 206

Query: 182 QLVDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKY 241
           +LVDC T + GC GG+  +A  +I  N G+ TEADYPY      C+  K    A +I+  
Sbjct: 207 ELVDCDTLDDGCDGGISYRALRWIASNGGITTEADYPYTGTTDACNRAKLSHNAVSIAGL 266

Query: 242 EDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAE 301
             +    E +L  AV+ QPV+V ++A G  F  YK GV N  CG N +HGV VVG+G  E
Sbjct: 267 RRVATRSEASLANAVAGQPVAVSIEAGGDNFQHYKKGVYNGPCGTNLNHGVTVVGYGQ-E 325

Query: 302 EENGAKYWLIKNSWGETWGESGYIRILRDA-----GLCGIATAASYPV 344
              G +YW++KNSWG+ WG+ GYIR+ +D      GLCGIA   SYP+
Sbjct: 326 AAAGDRYWIVKNSWGQGWGDDGYIRMKKDVAGKPEGLCGIAIRPSYPL 373


>gi|301116794|ref|XP_002906125.1| cysteine protease family C01A, putative [Phytophthora infestans
           T30-4]
 gi|262107474|gb|EEY65526.1| cysteine protease family C01A, putative [Phytophthora infestans
           T30-4]
          Length = 535

 Score =  267 bits (682), Expect = 6e-69,   Method: Compositional matrix adjust.
 Identities = 145/307 (47%), Positives = 192/307 (62%), Gaps = 13/307 (4%)

Query: 45  WMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRT-YKLGTNEFSDLTNEEFRALY 103
           WM  H  ++ D LE A RL  +  N  YI + N E   T  KL  NEFS ++ EEF+   
Sbjct: 32  WMKTHSVSFSDALEFAKRLENYIANDMYIMEHNLENAWTGVKLDHNEFSSMSFEEFKFKM 91

Query: 104 TGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAA 163
           TGY  P   + ++ +      + +V  VP S+DW++KG VT +K+QG CGSCWAFS   A
Sbjct: 92  TGYVMPEGYLEQRLASRVDNLWSDVQ-VPDSVDWQDKGGVTPVKNQGMCGSCWAFSTTGA 150

Query: 164 VEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHE 222
           VEG   ++ GKL+ LSEQ+LVDC  + + GC+GGLMD AF +I +N G+ +E DY Y+ +
Sbjct: 151 VEGAAFVSSGKLVSLSEQELVDCDHNGDMGCNGGLMDHAFAWIEDNGGICSEDDYEYKAK 210

Query: 223 EGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNA 282
              C +  EK V   IS ++D+   DE AL  AV+ QPVSV ++A  +AF FYKSGV N 
Sbjct: 211 AQVCRDC-EKVV--KISGFQDVNPQDEHALKVAVAQQPVSVAIEADQKAFQFYKSGVFNL 267

Query: 283 DCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD----AGLCGIAT 338
            CG   DHGV  VG+G+   ENG K+W +KNSWG +WGE GYIR+ R+    AG CGIA+
Sbjct: 268 TCGTRLDHGVLAVGYGS---ENGQKFWKVKNSWGSSWGEKGYIRLAREENGPAGQCGIAS 324

Query: 339 AASYPVA 345
             SYP A
Sbjct: 325 VPSYPFA 331


>gi|66270077|gb|AAY43368.1| cysteine protease [Phytophthora infestans]
          Length = 510

 Score =  267 bits (682), Expect = 6e-69,   Method: Compositional matrix adjust.
 Identities = 145/307 (47%), Positives = 192/307 (62%), Gaps = 13/307 (4%)

Query: 45  WMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRT-YKLGTNEFSDLTNEEFRALY 103
           WM  H  ++ D LE A RL  +  N  YI + N E   T  KL  NEFS ++ EEF+   
Sbjct: 32  WMKTHSVSFSDALEFAKRLENYIANDMYIMEHNLENAWTGVKLDHNEFSSMSFEEFKFKM 91

Query: 104 TGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAA 163
           TGY  P   + ++ +      + +V  VP S+DW++KG VT +K+QG CGSCWAFS   A
Sbjct: 92  TGYVMPEGYLEQRLASRVDNLWSDVQ-VPDSVDWQDKGGVTPVKNQGMCGSCWAFSTTGA 150

Query: 164 VEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHE 222
           VEG   ++ GKL+ LSEQ+LVDC  + + GC+GGLMD AF +I +N G+ +E DY Y+ +
Sbjct: 151 VEGAAFVSSGKLVSLSEQELVDCDHNGDMGCNGGLMDHAFAWIEDNGGICSEDDYEYKAK 210

Query: 223 EGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNA 282
              C +  EK V   IS ++D+   DE AL  AV+ QPVSV ++A  +AF FYKSGV N 
Sbjct: 211 AQVCRDC-EKVV--KISGFQDVNPQDEHALKVAVAQQPVSVAIEADQKAFQFYKSGVFNL 267

Query: 283 DCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD----AGLCGIAT 338
            CG   DHGV  VG+G+   ENG K+W +KNSWG +WGE GYIR+ R+    AG CGIA+
Sbjct: 268 TCGTRLDHGVLAVGYGS---ENGQKFWKVKNSWGSSWGEKGYIRLAREENGPAGQCGIAS 324

Query: 339 AASYPVA 345
             SYP A
Sbjct: 325 VPSYPFA 331


>gi|29165304|gb|AAO65603.1| cathepsin L precursor [Hydra vulgaris]
          Length = 324

 Score =  267 bits (682), Expect = 6e-69,   Method: Compositional matrix adjust.
 Identities = 152/335 (45%), Positives = 206/335 (61%), Gaps = 24/335 (7%)

Query: 16  IIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEK 75
           +++L +T A       ++  P   E   QW   H + Y  + E+ +R  I+K N   I +
Sbjct: 7   LLLLGVTLA------YTIERPVKDESWIQWKMYHNKVYSHDGEETVRYTIWKDNERRIRE 60

Query: 76  ANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSI 135
            N +G   + L  N+F D+TN EF+A + GY      +S +    STF   N    P ++
Sbjct: 61  HNLKGGD-FILKMNQFGDMTNSEFKA-FNGY------LSHKHVNGSTFLTPNNFVAPDTV 112

Query: 136 DWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST--DNHGC 193
           DWR +G VT +KDQGQCGSCWAFS   ++EG      GKL+ LSEQ LVDCST   N+GC
Sbjct: 113 DWRNEGYVTPVKDQGQCGSCWAFSTTGSLEGQHFKKTGKLVSLSEQNLVDCSTAYGNNGC 172

Query: 194 SGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALL 253
            GGLMD AF YI ENKG+ +EA YPY  E+G C  +K  +VAAT + + D+P+G+E  L 
Sbjct: 173 DGGLMDNAFTYIKENKGIDSEASYPYTAEDGKCVFKKS-SVAATDTGFVDIPEGNENKLK 231

Query: 254 QAVSN-QPVSVCVDASGRAFHFYKSGVLN-ADCGNN-CDHGVAVVGFGTAEEENGAKYWL 310
           +AV++  P+SV +DAS  +F FY SGV N   C +   DHGV VVG+GT   E+G  YWL
Sbjct: 232 EAVASVGPISVAIDASHESFQFYSSGVYNEPSCSSTELDHGVLVVGYGT---ESGKDYWL 288

Query: 311 IKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
           +KNSW  +WG+ GYI++ R+A   CGIAT ASYP+
Sbjct: 289 VKNSWNTSWGDKGYIKMRRNAKNQCGIATKASYPL 323


>gi|195381187|ref|XP_002049336.1| GJ20806 [Drosophila virilis]
 gi|194144133|gb|EDW60529.1| GJ20806 [Drosophila virilis]
          Length = 339

 Score =  267 bits (682), Expect = 6e-69,   Method: Compositional matrix adjust.
 Identities = 149/343 (43%), Positives = 206/343 (60%), Gaps = 21/343 (6%)

Query: 15  VIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIE 74
           +  +L +   +Q VS    +   I E+ + +  +H + Y DE E+  RL IF +N   I 
Sbjct: 4   LFALLALVAVAQAVS----YADVIKEEWQTFKLEHRKNYVDETEERFRLKIFNENKHKIA 59

Query: 75  KANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPS----TFKYQN 127
           K N+    G  ++K+  N+++D+ + EF     G+N  +    R +S PS    TF    
Sbjct: 60  KHNQRYASGEVSFKMAVNKYADMLHHEFHTTMNGFNYTLHKQLR-ASDPSFVGVTFISPE 118

Query: 128 VTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
              +P S+DWR KGAVT +KDQG CGSCWAFS+  A+EG      G LI LSEQ LVDCS
Sbjct: 119 HVKIPKSVDWRSKGAVTEVKDQGHCGSCWAFSSTGALEGQHFRKAGTLISLSEQNLVDCS 178

Query: 188 TD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLP 245
           T   N+GC+GGLMD AF YI +N G+ TE  YPY   + +C   K   + AT     D+P
Sbjct: 179 TKYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYPYEGIDDSCHFNK-ATIGATDRGSVDIP 237

Query: 246 KGDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGVLN-ADCG-NNCDHGVAVVGFGTAEE 302
           +GDE+ + +AV+   PVSV +DAS  +F FY  G+ N   C   N DHGV VVG+GT  +
Sbjct: 238 QGDEKKMAEAVATIGPVSVAIDASHESFQFYSEGIYNEPQCDPQNLDHGVLVVGYGT--D 295

Query: 303 ENGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
           E+G  YWL+KNSWG TWG+ G+I++ R+A   CGIA+A+SYP+
Sbjct: 296 ESGQDYWLVKNSWGTTWGDKGFIKMARNADNQCGIASASSYPL 338


>gi|114559418|ref|XP_001171268.1| PREDICTED: cathepsin S isoform 3 [Pan troglodytes]
 gi|397492866|ref|XP_003817341.1| PREDICTED: cathepsin S isoform 1 [Pan paniscus]
 gi|410225070|gb|JAA09754.1| cathepsin S [Pan troglodytes]
 gi|410251608|gb|JAA13771.1| cathepsin S [Pan troglodytes]
 gi|410328325|gb|JAA33109.1| cathepsin S [Pan troglodytes]
 gi|410328327|gb|JAA33110.1| cathepsin S [Pan troglodytes]
          Length = 331

 Score =  267 bits (682), Expect = 6e-69,   Method: Compositional matrix adjust.
 Identities = 151/341 (44%), Positives = 211/341 (61%), Gaps = 22/341 (6%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKH-EQWMAQHGRTYKDELEKAMRLNIFKQNLE 71
           M  ++ +++ C+S V     +H+   ++ H   W   +G+ YK++ E+A+R  I+++NL+
Sbjct: 1   MKRLVCVLLVCSSAVAQ---LHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLK 57

Query: 72  YIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNV 128
           ++   N E   G  +Y LG N   D+T+EE  +L +     VPS   Q  R  T+K    
Sbjct: 58  FVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLR--VPS---QWQRNITYKSNPN 112

Query: 129 TDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST 188
             +P S+DWREKG VT +K QG CG+CWAFSAV A+E   ++  GKL+ LS Q LVDCST
Sbjct: 113 QILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCST 172

Query: 189 D---NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLP 245
           +   N GC+GG M  AF+YII+NKG+ ++A YPY+  +  C     K  AAT SKY +LP
Sbjct: 173 EKYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKATDQKCQ-YDSKYRAATCSKYTELP 231

Query: 246 KGDEQALLQAVSNQ-PVSVCVDASGRAFHFYKSGV-LNADCGNNCDHGVAVVGFGTAEEE 303
            G E  L +AV+N+ PVSV VDA   +F  Y+SGV     C  N +HGV VVG+G   + 
Sbjct: 232 YGREDVLKEAVANKGPVSVGVDALHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYG---DL 288

Query: 304 NGAKYWLIKNSWGETWGESGYIRILRDAG-LCGIATAASYP 343
           NG +YWL+KNSWG  +GE GYIR+ R+ G  CGIA+  SYP
Sbjct: 289 NGKEYWLVKNSWGHNFGEEGYIRMARNKGNHCGIASFPSYP 329


>gi|23110962|ref|NP_004070.3| cathepsin S isoform 1 preproprotein [Homo sapiens]
 gi|88984046|sp|P25774.3|CATS_HUMAN RecName: Full=Cathepsin S; Flags: Precursor
 gi|60816153|gb|AAX36372.1| cathepsin S [synthetic construct]
 gi|61358282|gb|AAX41541.1| cathepsin S [synthetic construct]
 gi|119573903|gb|EAW53518.1| cathepsin S, isoform CRA_b [Homo sapiens]
 gi|119573904|gb|EAW53519.1| cathepsin S, isoform CRA_b [Homo sapiens]
          Length = 331

 Score =  267 bits (682), Expect = 6e-69,   Method: Compositional matrix adjust.
 Identities = 151/341 (44%), Positives = 211/341 (61%), Gaps = 22/341 (6%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKH-EQWMAQHGRTYKDELEKAMRLNIFKQNLE 71
           M  ++ +++ C+S V     +H+   ++ H   W   +G+ YK++ E+A+R  I+++NL+
Sbjct: 1   MKRLVCVLLVCSSAVAQ---LHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLK 57

Query: 72  YIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNV 128
           ++   N E   G  +Y LG N   D+T+EE  +L +     VPS   Q  R  T+K    
Sbjct: 58  FVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLR--VPS---QWQRNITYKSNPN 112

Query: 129 TDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST 188
             +P S+DWREKG VT +K QG CG+CWAFSAV A+E   ++  GKL+ LS Q LVDCST
Sbjct: 113 RILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCST 172

Query: 189 D---NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLP 245
           +   N GC+GG M  AF+YII+NKG+ ++A YPY+  +  C     K  AAT SKY +LP
Sbjct: 173 EKYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKAMDQKCQ-YDSKYRAATCSKYTELP 231

Query: 246 KGDEQALLQAVSNQ-PVSVCVDASGRAFHFYKSGV-LNADCGNNCDHGVAVVGFGTAEEE 303
            G E  L +AV+N+ PVSV VDA   +F  Y+SGV     C  N +HGV VVG+G   + 
Sbjct: 232 YGREDVLKEAVANKGPVSVGVDARHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYG---DL 288

Query: 304 NGAKYWLIKNSWGETWGESGYIRILRDAG-LCGIATAASYP 343
           NG +YWL+KNSWG  +GE GYIR+ R+ G  CGIA+  SYP
Sbjct: 289 NGKEYWLVKNSWGHNFGEEGYIRMARNKGNHCGIASFPSYP 329


>gi|395819351|ref|XP_003783057.1| PREDICTED: cathepsin L1-like [Otolemur garnettii]
          Length = 333

 Score =  267 bits (682), Expect = 7e-69,   Method: Compositional matrix adjust.
 Identities = 151/342 (44%), Positives = 210/342 (61%), Gaps = 20/342 (5%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
           M +++IL   C   + S  SM + S+     +W A+H + Y    E+  R  ++++N++ 
Sbjct: 1   MNLLLILAAFCVG-ITSATSMFDGSLNAHWYRWKAKHRKLYGMR-EEGWRRAVWEKNMKM 58

Query: 73  IEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
           IE  N+E   G   + +  N F D+TNEEFR +  G+       +++  +   F+  +  
Sbjct: 59  IEVHNQEYSQGKHGFTMAMNAFGDMTNEEFRQVMNGFR------NQKHKKGKVFQEPSFL 112

Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS-- 187
           +VP S+DWREKG VT +K+QGQCGSCWAFSA  A+EG      GKLI LSEQ LVDCS  
Sbjct: 113 EVPKSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLISLSEQNLVDCSRP 172

Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
             N GC GGLMD AF+YI EN GL +E  YPY   + +C  + E +VA   + + D+PK 
Sbjct: 173 QGNEGCDGGLMDYAFQYIKENGGLDSEESYPYDAMDESCKYRPEYSVAND-TGFVDIPK- 230

Query: 248 DEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADC-GNNCDHGVAVVGFGTAE-EE 303
           +E+AL++AV+   P+SV +DA   +F FYK GV    +C  +N DHGV VVG+G  E E 
Sbjct: 231 EEKALMKAVATVGPISVAIDAGHESFQFYKEGVYFEPECSSDNVDHGVLVVGYGYEETES 290

Query: 304 NGAKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
           +  K+WL+KNSWGE WG  GYI++ +D    CGIATAASYP 
Sbjct: 291 DNNKFWLVKNSWGEEWGLGGYIKMTKDQKNHCGIATAASYPT 332


>gi|449275508|gb|EMC84350.1| Cathepsin L1, partial [Columba livia]
          Length = 319

 Score =  266 bits (681), Expect = 7e-69,   Method: Compositional matrix adjust.
 Identities = 146/320 (45%), Positives = 202/320 (63%), Gaps = 14/320 (4%)

Query: 35  EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEF 91
           +P +    + W + H + Y  E E++ R  ++++NL+ IE  N +   G  +YKLG N+F
Sbjct: 3   DPELDGHWQLWKSWHNKDYH-EREESWRRVVWEKNLKMIELHNLDHTLGKHSYKLGMNQF 61

Query: 92  SDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQ 151
            D+T EEFR L  GY       S +  R S F   +  + P S+DWREKG VT +KDQGQ
Sbjct: 62  GDMTTEEFRQLMNGY---AHKKSERKYRGSQFLEPSFLEAPRSVDWREKGYVTPVKDQGQ 118

Query: 152 CGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENK 209
           CGSCWAFS   A+EG      GKL+ LSEQ LVDCS    N GC+GGLMD+AF+Y+ +N 
Sbjct: 119 CGSCWAFSTTGALEGQHFRKTGKLVSLSEQNLVDCSRPEGNQGCNGGLMDQAFQYVQDNG 178

Query: 210 GLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDAS 268
           G+ +E  YPY  ++      K +  AA  + + D+P+G E+AL++AV+   PVSV +DA 
Sbjct: 179 GIDSEESYPYTAKDDEDCRYKAEYNAANDTGFVDIPQGHERALMKAVAAVGPVSVAIDAG 238

Query: 269 GRAFHFYKSGV-LNADCGN-NCDHGVAVVGFG-TAEEENGAKYWLIKNSWGETWGESGYI 325
             +F FY+SG+    DC + + DHGV VVG+G   E+ +G KYW++KNSWGE WG+ GYI
Sbjct: 239 HSSFQFYQSGIYYEPDCSSEDLDHGVLVVGYGFEGEDVDGKKYWIVKNSWGEKWGDKGYI 298

Query: 326 RILRD-AGLCGIATAASYPV 344
            + +D    CGIATAASYP+
Sbjct: 299 YMAKDRKNHCGIATAASYPL 318


>gi|94448674|emb|CAI91575.1| cathepsin L2 [Lubomirskia baicalensis]
          Length = 324

 Score =  266 bits (681), Expect = 7e-69,   Method: Compositional matrix adjust.
 Identities = 150/312 (48%), Positives = 195/312 (62%), Gaps = 25/312 (8%)

Query: 45  WMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE-GNRTYKLGTNEFSDLTNEEFRALY 103
           W A+HG++Y++  E+ +R   ++ N +YI++ N+  G   Y L  N+F DL N EF++LY
Sbjct: 25  WKAEHGKSYRNHKEEMLRHVTWQANKKYIDEHNQHAGVFGYTLKMNQFGDLENSEFKSLY 84

Query: 104 TGYNRPVPSVSRQSSRPSTFK----YQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFS 159
            GY        R S+ P   K       V D+P S+DW +KG VT +K+QGQCGSCW+FS
Sbjct: 85  NGY--------RMSNAPRKGKPFVPAARVQDLPASVDWSKKGWVTPVKNQGQCGSCWSFS 136

Query: 160 AVAAVEGITQITRGKLIELSEQQLVDCST--DNHGCSGGLMDKAFEYIIENKGLATEADY 217
           A  ++EG      G L+ LSEQ LVDCS    NHGC+GGLMD AFEY+I+N G+ TEA Y
Sbjct: 137 ATGSMEGQHFNATGTLMSLSEQNLVDCSAAEGNHGCNGGLMDDAFEYVIKNNGIDTEASY 196

Query: 218 PYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDASGRAFHFYK 276
           PYR  + TC       V ATIS Y D+ K  E  L  AV+   PVSV +DAS  +F FY 
Sbjct: 197 PYRAVDSTCKFNTAD-VGATISGYVDVTKDSESDLQVAVATIGPVSVAIDASHISFQFYS 255

Query: 277 SGVLNAD--CGNNCDHGVAVVGFGTAEEENGAK-YWLIKNSWGETWGESGYIRILRDA-G 332
           SGV +       N DHGV  VG+GT    +G+K YWL+KNSWG +WG SGYI ++R+   
Sbjct: 256 SGVYDPLICSSTNLDHGVLAVGYGT----DGSKDYWLVKNSWGASWGMSGYIEMVRNHNN 311

Query: 333 LCGIATAASYPV 344
            CGIAT+ASYPV
Sbjct: 312 KCGIATSASYPV 323


>gi|195153545|ref|XP_002017686.1| GL17172 [Drosophila persimilis]
 gi|194113482|gb|EDW35525.1| GL17172 [Drosophila persimilis]
          Length = 341

 Score =  266 bits (681), Expect = 7e-69,   Method: Compositional matrix adjust.
 Identities = 144/343 (41%), Positives = 207/343 (60%), Gaps = 19/343 (5%)

Query: 14  FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYI 73
            ++ +L +   +Q VS    +   I E+   +  +H + Y+DE E+  RL IF +N   I
Sbjct: 5   LILPLLALVAVAQAVS----YAEVIQEEWHTFKLEHRKNYQDETEERFRLKIFNENKHKI 60

Query: 74  EKANK---EGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSR---QSSRPSTFKYQN 127
            K N+    G  ++K+  N+++D+ + EF +   G+N  +    R   +S +  TF    
Sbjct: 61  AKHNQLWATGAVSFKMAVNKYADMLHHEFYSTMNGFNYTLHKQLRNADESFKGVTFISPE 120

Query: 128 VTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
              +P  +DWR KGAVT +KDQG CGSCWAFS+  A+EG      G L+ LSEQ LVDCS
Sbjct: 121 HVTLPKQVDWRTKGAVTDVKDQGHCGSCWAFSSTGALEGQHYRKSGVLVSLSEQNLVDCS 180

Query: 188 TD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLP 245
           T   N+GC+GGLMD AF YI +N G+ TE  YPY   + +C   K  ++ AT   + D+P
Sbjct: 181 TKYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYPYEAIDDSCHFNK-GSIGATDRGFVDIP 239

Query: 246 KGDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGVLNADC--GNNCDHGVAVVGFGTAEE 302
           +G+E+ + +AV+   PV+V +DAS  +F FY  GV N       N DHGV VVGFGT  +
Sbjct: 240 QGNEKKMAEAVATIGPVAVAIDASHESFQFYSEGVYNEPACDAQNLDHGVLVVGFGT--D 297

Query: 303 ENGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
           E+G  YWL+KNSWG TWG+ G+I++LR+    CGIA+A+SYP+
Sbjct: 298 ESGEDYWLVKNSWGTTWGDKGFIKMLRNKENQCGIASASSYPL 340


>gi|179957|gb|AAC37592.1| cathepsin S [Homo sapiens]
          Length = 331

 Score =  266 bits (681), Expect = 8e-69,   Method: Compositional matrix adjust.
 Identities = 151/341 (44%), Positives = 211/341 (61%), Gaps = 22/341 (6%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKH-EQWMAQHGRTYKDELEKAMRLNIFKQNLE 71
           M  ++ +++ C+S V     +H+   ++ H   W   +G+ YK++ E+A+R  I+++NL+
Sbjct: 1   MKRLVCVLLVCSSAVAQ---LHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLK 57

Query: 72  YIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNV 128
           ++   N E   G  +Y LG N   D+T+EE  +L +     VPS   Q  R  T+K    
Sbjct: 58  FVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLR--VPS---QWQRNITYKSNPN 112

Query: 129 TDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST 188
             +P S+DWREKG VT +K QG CG+CWAFSAV A+E   ++  GKL+ LS Q LVDCST
Sbjct: 113 RILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCST 172

Query: 189 D---NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLP 245
           +   N GC+GG M  AF+YII+NKG+ ++A YPY+  +  C     K  AAT SKY +LP
Sbjct: 173 EKYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKAMDLKCQ-YDSKYRAATCSKYTELP 231

Query: 246 KGDEQALLQAVSNQ-PVSVCVDASGRAFHFYKSGV-LNADCGNNCDHGVAVVGFGTAEEE 303
            G E  L +AV+N+ PVSV VDA   +F  Y+SGV     C  N +HGV VVG+G   + 
Sbjct: 232 YGREDVLKEAVANKGPVSVGVDARHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYG---DL 288

Query: 304 NGAKYWLIKNSWGETWGESGYIRILRDAG-LCGIATAASYP 343
           NG +YWL+KNSWG  +GE GYIR+ R+ G  CGIA+  SYP
Sbjct: 289 NGKEYWLVKNSWGHNFGEEGYIRMARNKGNHCGIASFPSYP 329


>gi|297830594|ref|XP_002883179.1| hypothetical protein ARALYDRAFT_318695 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297329019|gb|EFH59438.1| hypothetical protein ARALYDRAFT_318695 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 308

 Score =  266 bits (681), Expect = 8e-69,   Method: Compositional matrix adjust.
 Identities = 137/310 (44%), Positives = 195/310 (62%), Gaps = 26/310 (8%)

Query: 42  HEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRA 101
           +E+W+ ++ + Y    EK  R  IFK+NL++I++ N   N+T+++G   F+DLTN+E   
Sbjct: 2   YERWLVENRKNYNGLGEKERRCKIFKENLKFIDEHNSLPNQTFEVGLTRFADLTNDE--- 58

Query: 102 LYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAV 161
                    P    ++ R   + Y+    +P  IDWR KGAV  +KDQG CGSCWAFSAV
Sbjct: 59  ---------PKDFMKADR---YLYKEGDILPDEIDWRAKGAVVPVKDQGNCGSCWAFSAV 106

Query: 162 AAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPY 219
            AVEGI QI  G+LI LS+Q+L+DC     N GC GG+M+ AFE+II N G+ ++ DYPY
Sbjct: 107 GAVEGINQIKTGELISLSDQELIDCDRGFVNAGCEGGVMNYAFEFIINNGGIESDQDYPY 166

Query: 220 RHEE-GTCD-NQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKS 277
              + G C+ ++K       I  YE + + DE++L +AV++QPV V ++AS +AF  YKS
Sbjct: 167 TATDLGVCNADKKNNTRVVKIDGYEYVAQNDEKSLKKAVAHQPVGVAIEASSQAFKLYKS 226

Query: 278 GVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----GL 333
           GV    CG   DHGV VVG+GT+   +G  YW+I+NSWG  WGE+GY+++ R+     G 
Sbjct: 227 GVFTGTCGIYLDHGVVVVGYGTS---SGEDYWIIRNSWGLNWGENGYVKLQRNIDDSFGK 283

Query: 334 CGIATAASYP 343
           CG+A   SYP
Sbjct: 284 CGVAMMPSYP 293


>gi|244539471|dbj|BAH82657.1| cysteine protease [Lotus japonicus]
          Length = 286

 Score =  266 bits (681), Expect = 8e-69,   Method: Compositional matrix adjust.
 Identities = 133/249 (53%), Positives = 176/249 (70%), Gaps = 6/249 (2%)

Query: 38  IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
           ++E  E WM++HG+ Y+   EK +R  IFK NL++I++ NK  +  Y LG NEF+DL++ 
Sbjct: 4   LIELFESWMSRHGKIYESIEEKLLRFEIFKDNLKHIDETNKVVS-NYWLGLNEFADLSHH 62

Query: 98  EFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWA 157
           EF+  Y G      S  R+SS    F Y++V D+P S+DWR+KGAVT+IK+QG CGSCWA
Sbjct: 63  EFKKQYLGLKVDF-STRRESSEE--FTYRDV-DLPKSVDWRKKGAVTNIKNQGSCGSCWA 118

Query: 158 FSAVAAVEGITQITRGKLIELSEQQLVDCS-TDNHGCSGGLMDKAFEYIIENKGLATEAD 216
           FS VAAVEGI QI  G L  LSEQ+L+DC  T N GC+GGLMD AF +I+EN GL  E D
Sbjct: 119 FSTVAAVEGINQIVTGNLTSLSEQELIDCDRTYNSGCNGGLMDYAFSFIVENGGLHKEDD 178

Query: 217 YPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYK 276
           YPY  EEGTC+  KE++   TIS Y D+P+ +EQ+LL+A++NQP+SV ++ASGR F FY 
Sbjct: 179 YPYIMEEGTCEMSKEESQVVTISGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQFYS 238

Query: 277 SGVLNADCG 285
            GV +  CG
Sbjct: 239 GGVFDGHCG 247


>gi|224079085|ref|XP_002305743.1| predicted protein [Populus trichocarpa]
 gi|222848707|gb|EEE86254.1| predicted protein [Populus trichocarpa]
          Length = 494

 Score =  266 bits (681), Expect = 8e-69,   Method: Compositional matrix adjust.
 Identities = 140/318 (44%), Positives = 194/318 (61%), Gaps = 12/318 (3%)

Query: 35  EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYI-EKANKEGNRTYKLGTNEFSD 93
           + SI+E  +QW  +H + YK   E   R   FK+NL+YI EK  KE    +++G N+F+D
Sbjct: 36  DESIIEIFQQWRDRHQKAYKHAEEAEKRFGNFKRNLKYIIEKTGKETTLRHRVGLNKFAD 95

Query: 94  LTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCG 153
           L+NEEF+ LY    +   + +R  +   + +     D P+S+DWR+KG VT +KDQG CG
Sbjct: 96  LSNEEFKQLYLSKVKKPINKTRIDAEDRSRRNLQSCDAPSSLDWRKKGVVTAVKDQGDCG 155

Query: 154 SCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLAT 213
           SCW+FS   A+EGI  I    LI LSEQ+LVDC T N+GC GG MD AFE++I N G+ T
Sbjct: 156 SCWSFSTTGAIEGINAIVTSDLISLSEQELVDCDTTNYGCEGGYMDYAFEWVINNGGIDT 215

Query: 214 EADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFH 273
           EA+YPY   +GTC+  KE+    +I  Y+D+ + D  ALL A + QP+SV +D S   F 
Sbjct: 216 EANYPYTGVDGTCNTAKEEIKVVSIDGYKDVDETD-SALLCAAAQQPISVGIDGSAIDFQ 274

Query: 274 FYKSGVL---NADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD 330
            Y  G+     +D  ++ DH V +VG+G+   ENG  YW++KNSWG +WG  GY  I R+
Sbjct: 275 LYTGGIYDGDCSDDPDDIDHAVLIVGYGS---ENGEDYWIVKNSWGTSWGIEGYFYIKRN 331

Query: 331 A----GLCGIATAASYPV 344
                G+C I   ASYP 
Sbjct: 332 TDLPYGVCAINAMASYPT 349


>gi|159479072|ref|XP_001697622.1| cysteine endopeptidase [Chlamydomonas reinhardtii]
 gi|158274232|gb|EDP00016.1| cysteine endopeptidase [Chlamydomonas reinhardtii]
          Length = 469

 Score =  266 bits (681), Expect = 8e-69,   Method: Compositional matrix adjust.
 Identities = 137/311 (44%), Positives = 197/311 (63%), Gaps = 12/311 (3%)

Query: 43  EQWMAQHGRTY-KDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRA 101
           ++W   H R+Y  D  E   R  ++ +NLEY+   N     ++ L  N  +DL+  E+++
Sbjct: 14  KEWAQTHSRSYVNDVAEFENRFKVWLENLEYVLAYNAR-TTSHWLTLNHLADLSTPEYKS 72

Query: 102 LYTGYNRPVPSVSRQSSRPSTFKYQNVTD--VPTSIDWREKGAVTHIKDQGQCGSCWAFS 159
              G++     V+R   + + F+Y++V    +P +IDWR+K AV  +K+QGQCGSCWAF+
Sbjct: 73  KLLGFDNQA-RVARNKLK-TGFRYEDVDAEALPPAIDWRKKNAVAEVKNQGQCGSCWAFA 130

Query: 160 AVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYP 218
              +VEGI  I  G L+ LSEQ+LVDC T+ + GCSGGLMD A+ +II+NKG+ TE DYP
Sbjct: 131 TTGSVEGINAIVTGSLVSLSEQELVDCDTEQDKGCSGGLMDYAYAWIIKNKGINTEEDYP 190

Query: 219 YRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSG 278
           Y   +G CD  K K    TI  YED+P+ DE AL +A ++QPV+V ++A  ++F  Y  G
Sbjct: 191 YTAMDGQCDVAKMKRRVVTIDSYEDVPENDEVALKKAAAHQPVAVAIEADAKSFQLYGGG 250

Query: 279 VL-NADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----GL 333
           V  +  CG + +HGV VVG+G     +G+ YW++KNSWG  WG++GYIR+   +    GL
Sbjct: 251 VYDDPTCGTSLNHGVLVVGYGKDVTGSGSNYWIVKNSWGAEWGDAGYIRLKMGSTDAEGL 310

Query: 334 CGIATAASYPV 344
           CGIA A SYPV
Sbjct: 311 CGIAMAPSYPV 321


>gi|194701748|gb|ACF84958.1| unknown [Zea mays]
 gi|414589103|tpg|DAA39674.1| TPA: thiol protease SEN102 [Zea mays]
          Length = 374

 Score =  266 bits (681), Expect = 8e-69,   Method: Compositional matrix adjust.
 Identities = 143/338 (42%), Positives = 193/338 (57%), Gaps = 29/338 (8%)

Query: 32  SMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNR---TYKLGT 88
           S  + S++E+ ++W A + ++Y    E+  R  +  +N+ YIE  N E      TY+LG 
Sbjct: 40  STDDSSMIERFQRWKAAYNKSYATVAEERRRFRVCARNMAYIEATNAEAEAAGLTYELGE 99

Query: 89  NEFSDLTNEEFRALYTGYNRPVPS---------------VSRQSSRPSTFK-YQNV-TDV 131
             ++DLTN+EF A+YT    P P+               V      P     Y N+ T  
Sbjct: 100 TAYTDLTNQEFMAMYTA---PAPAQLPADESVITTRAGPVDAVGGAPGQLPVYVNLSTSA 156

Query: 132 PTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNH 191
           P S+DWR  GAVT +K+QG+CGSCWAFS VA VEGI QI  GKL+ LSEQ+LVDC T + 
Sbjct: 157 PASVDWRASGAVTPVKNQGRCGSCWAFSTVAVVEGIYQIRTGKLVSLSEQELVDCDTLDD 216

Query: 192 GCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQA 251
           GC GG+  +A  +I  N G+ TE DYPY      C+  K    A +I+    +    E +
Sbjct: 217 GCDGGISYRALRWIASNGGITTETDYPYTGTTDACNRAKLSHNAVSIAGLRRVATRSEAS 276

Query: 252 LLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLI 311
           L  AV+ QPV+V ++A G  F  YK GV N  CG N +HGV VVG+G  E   G +YW++
Sbjct: 277 LANAVAGQPVAVSIEAGGDNFQHYKKGVYNGPCGTNLNHGVTVVGYGQ-EAAGGDRYWIV 335

Query: 312 KNSWGETWGESGYIRILRDA-----GLCGIATAASYPV 344
           KNSWG+ WG+ GYIR+ +D      GLCGIA   SYP+
Sbjct: 336 KNSWGQGWGDDGYIRMKKDVAGKPEGLCGIAIRPSYPL 373


>gi|332220183|ref|XP_003259237.1| PREDICTED: cathepsin S isoform 1 [Nomascus leucogenys]
          Length = 331

 Score =  266 bits (681), Expect = 8e-69,   Method: Compositional matrix adjust.
 Identities = 151/341 (44%), Positives = 211/341 (61%), Gaps = 22/341 (6%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKH-EQWMAQHGRTYKDELEKAMRLNIFKQNLE 71
           M  ++ +++ C+S V     +H+   ++ H   W   +G+ YK++ E+A+R  I+++NL+
Sbjct: 1   MKWLVCVLLVCSSAVAQ---LHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLK 57

Query: 72  YIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNV 128
           ++   N E   G  +Y LG N   D+T+EE  +L +     VPS   Q  R  T+K    
Sbjct: 58  FVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLR--VPS---QWQRNITYKSNPN 112

Query: 129 TDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST 188
             +P S+DWREKG VT +K QG CG+CWAFSAV A+E   ++  GKL+ LS Q LVDCST
Sbjct: 113 QILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCST 172

Query: 189 D---NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLP 245
           +   N GC+GG M  AF+YII+NKG+ ++A YPY+  +  C     K  AAT SKY +LP
Sbjct: 173 EKYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKAMDQKCQ-YDSKYRAATCSKYTELP 231

Query: 246 KGDEQALLQAVSNQ-PVSVCVDASGRAFHFYKSGV-LNADCGNNCDHGVAVVGFGTAEEE 303
              E  L +AV+N+ PVSV VDAS  +F  Y+SGV     C  N +HGV VVG+G   + 
Sbjct: 232 YSREDVLKEAVANKGPVSVGVDASHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYG---DL 288

Query: 304 NGAKYWLIKNSWGETWGESGYIRILRDAG-LCGIATAASYP 343
           NG +YWL+KNSWG  +GE GYIR+ R+ G  CGIA+  SYP
Sbjct: 289 NGKEYWLVKNSWGRNFGEEGYIRMARNKGNHCGIASFPSYP 329


>gi|12803615|gb|AAH02642.1| Cathepsin S [Homo sapiens]
 gi|49456313|emb|CAG46477.1| CTSS [Homo sapiens]
 gi|60821573|gb|AAX36579.1| cathepsin S [synthetic construct]
 gi|189069420|dbj|BAG37086.1| unnamed protein product [Homo sapiens]
 gi|261858586|dbj|BAI45815.1| cathepsin S [synthetic construct]
          Length = 331

 Score =  266 bits (681), Expect = 9e-69,   Method: Compositional matrix adjust.
 Identities = 151/341 (44%), Positives = 211/341 (61%), Gaps = 22/341 (6%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKH-EQWMAQHGRTYKDELEKAMRLNIFKQNLE 71
           M  ++ +++ C+S V     +H+   ++ H   W   +G+ YK++ E+A+R  I+++NL+
Sbjct: 1   MKRLVCVLLVCSSAVAQ---LHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLK 57

Query: 72  YIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNV 128
           ++   N E   G  +Y LG N   D+T+EE  +L +     VPS   Q  R  T+K    
Sbjct: 58  FVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLR--VPS---QWQRNITYKSNPN 112

Query: 129 TDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST 188
             +P S+DWREKG VT +K QG CG+CWAFSAV A+E   ++  GKL+ LS Q LVDCST
Sbjct: 113 WILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCST 172

Query: 189 D---NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLP 245
           +   N GC+GG M  AF+YII+NKG+ ++A YPY+  +  C     K  AAT SKY +LP
Sbjct: 173 EKYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKAMDQKCQ-YDSKYRAATCSKYTELP 231

Query: 246 KGDEQALLQAVSNQ-PVSVCVDASGRAFHFYKSGV-LNADCGNNCDHGVAVVGFGTAEEE 303
            G E  L +AV+N+ PVSV VDA   +F  Y+SGV     C  N +HGV VVG+G   + 
Sbjct: 232 YGREDVLKEAVANKGPVSVGVDARHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYG---DL 288

Query: 304 NGAKYWLIKNSWGETWGESGYIRILRDAG-LCGIATAASYP 343
           NG +YWL+KNSWG  +GE GYIR+ R+ G  CGIA+  SYP
Sbjct: 289 NGKEYWLVKNSWGHNFGEEGYIRMARNKGNHCGIASFPSYP 329


>gi|156739275|ref|NP_001096585.1| cathepsin L1-like precursor [Danio rerio]
 gi|156230123|gb|AAI52285.1| MGC174857 protein [Danio rerio]
          Length = 335

 Score =  266 bits (681), Expect = 9e-69,   Method: Compositional matrix adjust.
 Identities = 144/340 (42%), Positives = 206/340 (60%), Gaps = 15/340 (4%)

Query: 14  FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYI 73
            +  +LV  C S V +  S+ +  + +    W +QHG++Y ++LE   R+ I+++NL  I
Sbjct: 1   MMFALLVTLCISAVFTAPSI-DIQLDDHWNSWKSQHGKSYHEDLEVGRRM-IWEENLRKI 58

Query: 74  EKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
           E+ N E   GN T+K+G N+F D+TNEEFR    GY       +R S  P  F   +   
Sbjct: 59  EQHNFEYSYGNHTFKMGMNQFGDMTNEEFRQAMNGYKH---DPNRTSQGP-LFMEPSFFA 114

Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD- 189
            P  +DWR++G VT +KDQ QCGSCW+FS+  A+EG      GKLI +SEQ LVDCS   
Sbjct: 115 APQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQ 174

Query: 190 -NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
            N GC+GG+MD+AF+Y+ ENKGL +E  YPY   +        +   A I+ + D+P+G+
Sbjct: 175 GNQGCNGGIMDQAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPRGN 234

Query: 249 EQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGNNCDHGVAVVGFG-TAEEENG 305
           E AL+ AV+   PVSV +DAS ++  FY+SG+     C +  DH V VVG+G    +  G
Sbjct: 235 ELALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACTSRLDHAVLVVGYGYQGADVAG 294

Query: 306 AKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
            +YW++KNSW + WG+ GYI + +D    CGIAT ASYP+
Sbjct: 295 NRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGIATMASYPL 334


>gi|179959|gb|AAA35655.1| cathepsin [Homo sapiens]
 gi|248406|gb|AAB22005.1| cathepsin S [Homo sapiens]
          Length = 331

 Score =  266 bits (681), Expect = 9e-69,   Method: Compositional matrix adjust.
 Identities = 151/341 (44%), Positives = 211/341 (61%), Gaps = 22/341 (6%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKH-EQWMAQHGRTYKDELEKAMRLNIFKQNLE 71
           M  ++ +++ C+S V     +H+   ++ H   W   +G+ YK++ E+A+R  I+++NL+
Sbjct: 1   MKRLVCVLLVCSSAVAQ---LHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLK 57

Query: 72  YIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNV 128
           ++   N E   G  +Y LG N   D+T+EE  +L +     VPS   Q  R  T+K    
Sbjct: 58  FVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLTSSLR--VPS---QWQRNITYKSNPN 112

Query: 129 TDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST 188
             +P S+DWREKG VT +K QG CG+CWAFSAV A+E   ++  GKL+ LS Q LVDCST
Sbjct: 113 RILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVTLSAQNLVDCST 172

Query: 189 D---NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLP 245
           +   N GC+GG M  AF+YII+NKG+ ++A YPY+  +  C     K  AAT SKY +LP
Sbjct: 173 EKYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKAMDQKCQ-YDSKYRAATCSKYTELP 231

Query: 246 KGDEQALLQAVSNQ-PVSVCVDASGRAFHFYKSGV-LNADCGNNCDHGVAVVGFGTAEEE 303
            G E  L +AV+N+ PVSV VDA   +F  Y+SGV     C  N +HGV VVG+G   + 
Sbjct: 232 YGREDVLKEAVANKGPVSVGVDARHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYG---DL 288

Query: 304 NGAKYWLIKNSWGETWGESGYIRILRDAG-LCGIATAASYP 343
           NG +YWL+KNSWG  +GE GYIR+ R+ G  CGIA+  SYP
Sbjct: 289 NGKEYWLVKNSWGHNFGEEGYIRMARNKGNHCGIASFPSYP 329


>gi|326672297|ref|XP_003199631.1| PREDICTED: cathepsin L1-like [Danio rerio]
          Length = 336

 Score =  266 bits (681), Expect = 9e-69,   Method: Compositional matrix adjust.
 Identities = 144/341 (42%), Positives = 206/341 (60%), Gaps = 16/341 (4%)

Query: 14  FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYI 73
            +  +LV  C S V +  S+ +  + +    W +QHG++Y +++E   R+ I+++NL  I
Sbjct: 1   MMFALLVTLCISAVFAASSI-DIQLDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRKI 58

Query: 74  EKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
           E+ N E   GN T+K+G N+F D+TNEEFR    GY         Q+S+   F   +   
Sbjct: 59  EQHNFEYSYGNHTFKMGMNQFGDMTNEEFRHAMNGYKHD----PNQTSQGPLFMEPSFFA 114

Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD- 189
            P  +DWR++G VT +KDQ QCGSCW+FS+  A+EG      GKLI +SEQ LVDCS   
Sbjct: 115 APQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPH 174

Query: 190 -NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
            N GC+GGLMD+AF+Y+ ENKGL +E  YPY   +        +   A I+ + D+PKG+
Sbjct: 175 GNQGCNGGLMDQAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPKGN 234

Query: 249 EQALLQAVSN-QPVSVCVDASGRAFHFYKSGVL--NADCGNNCDHGVAVVGFG-TAEEEN 304
           E AL+ AV+   PVSV +DAS ++  FY+SG+    A   +  DH V VVG+G    +  
Sbjct: 235 ELALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACSSSRLDHAVLVVGYGYQGADVA 294

Query: 305 GAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
           G +YW++KNSW + WG+ GYI + +D    CGIAT ASYP+
Sbjct: 295 GNRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGIATMASYPL 335


>gi|125811033|ref|XP_001361727.1| GA25021 [Drosophila pseudoobscura pseudoobscura]
 gi|54636904|gb|EAL26307.1| GA25021 [Drosophila pseudoobscura pseudoobscura]
          Length = 341

 Score =  266 bits (680), Expect = 9e-69,   Method: Compositional matrix adjust.
 Identities = 144/343 (41%), Positives = 206/343 (60%), Gaps = 19/343 (5%)

Query: 14  FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYI 73
            ++ +L +   +Q VS    +   I E+   +  +H + Y+DE E+  RL IF +N   I
Sbjct: 5   LILPLLALVAVAQAVS----YAEVIQEEWHTFKLEHRKNYQDETEERFRLKIFNENKHKI 60

Query: 74  EKANK---EGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSR---QSSRPSTFKYQN 127
            K N+    G  ++K+  N+++D+ + EF +   G+N  +    R   +S +  TF    
Sbjct: 61  AKHNQLWATGAVSFKMAVNKYADMLHHEFYSTMNGFNYTLHKQLRNADESFKGVTFISPE 120

Query: 128 VTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
              +P  +DWR KGAVT +KDQG CGSCWAFS+  A+EG      G L+ LSEQ LVDCS
Sbjct: 121 HVTLPKQVDWRTKGAVTDVKDQGHCGSCWAFSSTGALEGQHYRKSGVLVSLSEQNLVDCS 180

Query: 188 TD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLP 245
           T   N+GC+GGLMD AF YI +N G+ TE  YPY   + +C   K   + AT   + D+P
Sbjct: 181 TKYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYPYEAIDDSCHFNK-GTIGATDRGFVDIP 239

Query: 246 KGDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGVLNADC--GNNCDHGVAVVGFGTAEE 302
           +G+E+ + +AV+   PV+V +DAS  +F FY  GV N       N DHGV VVGFGT  +
Sbjct: 240 QGNEKKMAEAVATIGPVAVAIDASHESFQFYSEGVYNEPACDAQNLDHGVLVVGFGT--D 297

Query: 303 ENGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
           E+G  YWL+KNSWG TWG+ G+I++LR+    CGIA+A+SYP+
Sbjct: 298 ESGQDYWLVKNSWGTTWGDKGFIKMLRNKENQCGIASASSYPL 340


>gi|151573014|gb|ABS17682.1| cathepsin L-1 [Artemia salina]
          Length = 334

 Score =  266 bits (680), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 147/339 (43%), Positives = 205/339 (60%), Gaps = 15/339 (4%)

Query: 15  VIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIE 74
            +I L+     Q+ +  S+      E H  + A H + Y  +LE+  R+ I+ +N   + 
Sbjct: 1   TLIFLLGAVLVQLSAALSLTNLLADEWH-LFKATHKKEYPSQLEEKFRMKIYLENKHKVA 59

Query: 75  KAN---KEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDV 131
           K N   ++G ++Y +  N+F DL + EFR++  GY     + SR  S  +  +  NVT V
Sbjct: 60  KHNILYEKGEKSYHVAMNKFGDLLHHEFRSIMNGYQHKKQNSSRAESTFTFMEPANVT-V 118

Query: 132 PTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-- 189
           P S+DWREKGA+T +KDQGQCGSCWAFS+  A+EG T    GKL+ LSEQ L+DCS    
Sbjct: 119 PESVDWREKGAITPVKDQGQCGSCWAFSSTGALEGQTFRKTGKLVSLSEQNLIDCSGKYG 178

Query: 190 NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDE 249
           N GC+GGLMD+AF+YI +NKG+ TE  YPY  E+  C     +   A    + D+P G+E
Sbjct: 179 NEGCNGGLMDQAFQYIKDNKGIDTENTYPYEAEDDVC-RYNPRNRGAVDRGFVDIPSGEE 237

Query: 250 QALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADC-GNNCDHGVAVVGFGTAEEENGA 306
             L  AV+   PVSV +DAS  +F FY  GV     C  ++ DHGV VVG+G+   +NG 
Sbjct: 238 DKLKAAVATVGPVSVAIDASHESFQFYSKGVYYEPSCDSDDLDHGVLVVGYGS---DNGK 294

Query: 307 KYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
            YWL+KNSW E WG+ GYI++ R+    CG+A+AASYP+
Sbjct: 295 DYWLVKNSWSEHWGDEGYIKMARNRKNHCGVASAASYPL 333


>gi|449500383|ref|XP_004161083.1| PREDICTED: vignain-like [Cucumis sativus]
          Length = 345

 Score =  266 bits (680), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 140/315 (44%), Positives = 202/315 (64%), Gaps = 10/315 (3%)

Query: 35  EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
           E S+ + +E+W  +H    ++  EK  R ++FK+N+ ++   N+  ++ YKL  N+F+D+
Sbjct: 34  EESLWQLYERW-GKHHTISRNLKEKHKRFSVFKENVNHVFTVNQM-DKPYKLKLNKFADM 91

Query: 95  TNEEFRALYTGYN-RPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCG 153
           +N EF   Y   N      +  +      F Y+  TD+P+S+D RE+GAV  +K+QG+CG
Sbjct: 92  SNYEFVNFYARSNISHYRKLHERRRGAGGFMYEQDTDLPSSVDGRERGAVNAVKEQGRCG 151

Query: 154 SCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLAT 213
           SCWAFS+VAAVEGI +I   +L+ LSEQ+L+DC+  N GC+GG M+ AF++I  N G+AT
Sbjct: 152 SCWAFSSVAAVEGINKIKTNQLLSLSEQELLDCNYRNKGCNGGFMEIAFDFIKRNGGIAT 211

Query: 214 EADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFH 273
           E  YPY    G C + +  +    I  YE +P+ +E AL+QAV+NQPVSV +DA+GR F 
Sbjct: 212 ENSYPYHGSRGLCRSSRISSPIVKIDGYESVPE-NEDALMQAVANQPVSVAIDAAGRDFQ 270

Query: 274 FYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA-- 331
           FY  GV +  CG   +HGV  +G+GT E+  G  YWL++NSWG  WGE GY+R+ R    
Sbjct: 271 FYSQGVFDGYCGTELNHGVVAIGYGTTED--GTDYWLVRNSWGVGWGEDGYVRMKRGVEQ 328

Query: 332 --GLCGIATAASYPV 344
             GLCGIA  ASYP+
Sbjct: 329 AEGLCGIAMEASYPI 343


>gi|350412176|ref|XP_003489564.1| PREDICTED: cathepsin L-like [Bombus impatiens]
          Length = 343

 Score =  266 bits (680), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 148/348 (42%), Positives = 212/348 (60%), Gaps = 24/348 (6%)

Query: 11  IPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNL 70
           + +F+++I+ I   +Q +S   +    + ++   +  +H + YK+++E+  R+ IF  N 
Sbjct: 1   MKLFLLLIVAILATAQAISFFEL----VNQEWTTFKMEHNKVYKNDIEERFRMKIFMDNK 56

Query: 71  EYIEKANKEGNR-----TYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRP---ST 122
             I K N  GN      +YKL  N++ D+ + EF     G+N+ + +  R    P   S 
Sbjct: 57  HKIAKHN--GNYEMKKVSYKLKMNKYGDMLHHEFVNTLNGFNKSINTQLRSERLPIGASF 114

Query: 123 FKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQ 182
            +  NV  +P ++DWRE GAVT +KDQG CGSCW+FSA  A+EG      G LI LSEQ 
Sbjct: 115 IEPANVV-LPKTVDWREHGAVTPVKDQGHCGSCWSFSATGALEGQHFRRTGILIPLSEQN 173

Query: 183 LVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISK 240
           L+DCS    N+GC+GGLMD+AF+YI +NKGL TE  YPY  E   C      + A  +  
Sbjct: 174 LIDCSGKYGNNGCNGGLMDQAFQYIKDNKGLDTEVTYPYEAENDKCRYNAANSGARDVG- 232

Query: 241 YEDLPKGDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGF 297
           Y D+P+G+E+ L  AV+   PVSV +DAS ++F FY  GV    +C + N DHGV  VG+
Sbjct: 233 YVDIPQGNEKKLKAAVATIGPVSVAIDASHQSFQFYSEGVYYEPECSSENLDHGVLAVGY 292

Query: 298 GTAEEENGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
           GT  +ENG  YWL+KNSWGETWG++GYI++ R+    CGIA+ ASYP+
Sbjct: 293 GT--DENGQDYWLVKNSWGETWGDNGYIKMARNKLNHCGIASTASYPL 338


>gi|296228726|ref|XP_002759933.1| PREDICTED: cathepsin S isoform 1 [Callithrix jacchus]
          Length = 330

 Score =  266 bits (680), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 149/339 (43%), Positives = 210/339 (61%), Gaps = 19/339 (5%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
           M  ++ ++  C+S V   + + +P++      W   +G+ YK++ E+A+R  I+++NL++
Sbjct: 1   MKQLVCVLFVCSSAV--AQLLKDPTLDHHWNLWKKTYGKQYKEKNEEAVRRLIWEKNLKF 58

Query: 73  IEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
           +   N E   G  +Y LG N   D+T+EE  +L +     VPS   Q  R  T+K     
Sbjct: 59  VMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLR--VPS---QWQRNITYKSNPNQ 113

Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD 189
            +P S+DWREKG VT +K QG CG+CWAFSAV A+E   ++  GKL+ LS Q LVDCS  
Sbjct: 114 MLPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSEK 173

Query: 190 --NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
             N GC+GG M +AF+YII+NKG+ +EA YPY+  +  C     K  AAT SKY +LP G
Sbjct: 174 YGNKGCNGGFMTEAFQYIIDNKGIDSEASYPYKAMDQKCQ-YDSKYRAATCSKYTELPYG 232

Query: 248 DEQALLQAVSNQ-PVSVCVDASGRAFHFYKSGV-LNADCGNNCDHGVAVVGFGTAEEENG 305
            E  L +AV+N+ PV V VDAS  +F  Y+SGV  +  C  N +HGV V+G+G   + NG
Sbjct: 233 REDVLKEAVANKGPVCVGVDASHSSFFLYRSGVYYDPACTQNVNHGVLVIGYG---DLNG 289

Query: 306 AKYWLIKNSWGETWGESGYIRILRDAG-LCGIATAASYP 343
            +YWL+KNSWG  +GE GYIR+ R+ G  CGIA+  SYP
Sbjct: 290 EEYWLVKNSWGSNFGERGYIRMARNKGNHCGIASYPSYP 328


>gi|449530091|ref|XP_004172030.1| PREDICTED: vignain-like [Cucumis sativus]
          Length = 351

 Score =  266 bits (680), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 135/355 (38%), Positives = 217/355 (61%), Gaps = 20/355 (5%)

Query: 2   VLKFEKSFIIPMFVIIILVITCASQVVSGRSMH-EPSIVEKHEQWMAQHGRTYKDELEKA 60
           V+KF    I+P+ +I  L   C S  +  +    E S+++ +++W + H R  ++  E  
Sbjct: 3   VMKF---LIVPLVLIAFLCNICESFELERKDFESEKSLMQLYKRWSSHH-RISRNANEMH 58

Query: 61  MRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTG---YNRPVPS--VSR 115
            R  +FK N +++ K N  G ++ KL  N+F+D++++EFR +Y+    Y + + +  +  
Sbjct: 59  NRFKVFKNNAKHVFKVNLMG-KSLKLKLNQFADMSDDEFRNMYSSNITYYKDLHAKKIEA 117

Query: 116 QSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKL 175
              R   F Y++  ++P+SIDWR+KGAV  IK+QG+CGSCWAF+AVAAVE I QI   +L
Sbjct: 118 TGGRIGGFMYEHANNIPSSIDWRKKGAVNAIKNQGRCGSCWAFAAVAAVESIHQIKTNEL 177

Query: 176 IELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVA 235
           + LSE++++DC   + GC GG  + AFE++++N G+  E +YPY    G C  +  +   
Sbjct: 178 VSLSEEEVLDCDYRDGGCRGGFYNSAFEFMMDNDGVTIEDNYPYYEGNGYCRRRGGRNKR 237

Query: 236 ATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVL--NADCGNNCDHGVA 293
             I  YE++P+ +E AL++AV++QPV+V + + G  F FY  G+   N  CG N DH V 
Sbjct: 238 VRIDGYENVPRNNEYALMKAVAHQPVAVAIASGGSDFKFYGGGMFTENDFCGFNIDHTVV 297

Query: 294 VVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
           VVG+GT E+     YW+I+N +G  WG +GY+++ R A    G+CG+A   +YPV
Sbjct: 298 VVGYGTDED---GDYWIIRNQYGHRWGMNGYMKMQRGAHSPQGVCGMAMQPAYPV 349


>gi|125604306|gb|EAZ43631.1| hypothetical protein OsJ_28254 [Oryza sativa Japonica Group]
          Length = 369

 Score =  266 bits (680), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 148/316 (46%), Positives = 184/316 (58%), Gaps = 25/316 (7%)

Query: 35  EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
           E ++ E +E+W  QH R  +D  EKA R N+FK N+  I + N+  +  YKL  N F D+
Sbjct: 41  EEALWELYERWRGQH-RVARDLGEKARRFNVFKDNVRLIHEFNRR-DEPYKLRLNRFGDM 98

Query: 95  TNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGS 154
           T +E    Y             SSR S  +             R  GAV  +KDQGQCGS
Sbjct: 99  TADESAGAYA------------SSRVSHHRMFRGRGEKAQ---RLHGAVGAVKDQGQCGS 143

Query: 155 CWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLA 212
           CWAFS +AAVEGI  I    L  LSEQQLVDC T   N GC GGLMD AF+YI ++ G+A
Sbjct: 144 CWAFSTIAAVEGINAIRTSNLTALSEQQLVDCDTKTGNAGCDGGLMDNAFQYIAKHGGVA 203

Query: 213 TEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAF 272
             + YPYR  + +C +    + A TI  YED+P   E AL +AV+NQPVSV ++A G  F
Sbjct: 204 ASSAYPYRARQSSCKSSAASSPAVTIDGYEDVPANSESALKKAVANQPVSVAIEAGGSHF 263

Query: 273 HFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA- 331
            FY  GV    CG   DHGVA VG+GT  +  G KYW+++NSWG  WGE GYIR+ RD  
Sbjct: 264 QFYSEGVFAGKCGTELDHGVAAVGYGTTVD--GTKYWIVRNSWGADWGEKGYIRMKRDVS 321

Query: 332 ---GLCGIATAASYPV 344
              GLCGIA  ASYP+
Sbjct: 322 AKEGLCGIAMEASYPI 337


>gi|156739289|ref|NP_001096592.1| uncharacterized protein LOC569326 precursor [Danio rerio]
 gi|156230119|gb|AAI52283.1| Im:6910535 protein [Danio rerio]
          Length = 335

 Score =  266 bits (679), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 140/340 (41%), Positives = 208/340 (61%), Gaps = 15/340 (4%)

Query: 14  FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYI 73
            +  +L+  C S V +  S+ +  + +    W +QHG++Y +++E   R+ I+++NL  I
Sbjct: 1   MMFALLITLCISAVFTAPSI-DIQLDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRKI 58

Query: 74  EKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
           E+ N E   GN T+K+G N+F D+TNEEFR    GY +       ++S+ + F   +   
Sbjct: 59  EQHNFEYSLGNHTFKMGMNQFGDMTNEEFRQAMNGYKQD----PNRTSKGALFMEPSFFA 114

Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD- 189
            P  +DWR++G VT +KDQ QCGSCW+FS+  A+EG      GKLI +SEQ LVDCS   
Sbjct: 115 APQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQ 174

Query: 190 -NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
            N GC+GG+MD+AF+Y+ ENKGL +E  YPY   +        +   A I+ + D+P+G+
Sbjct: 175 GNQGCNGGIMDQAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPRGN 234

Query: 249 EQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGNNCDHGVAVVGFG-TAEEENG 305
           E AL+ AV+   PVSV +DAS ++  FY+SG+     C +  DH V VVG+G    +  G
Sbjct: 235 ELALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACTSRLDHAVLVVGYGYQGADVAG 294

Query: 306 AKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
            +YW++KNSW + WG+ GYI + +D    CGIAT ASYP+
Sbjct: 295 NRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGIATMASYPL 334


>gi|225706370|gb|ACO09031.1| Cathepsin L precursor [Osmerus mordax]
          Length = 337

 Score =  266 bits (679), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 144/341 (42%), Positives = 205/341 (60%), Gaps = 17/341 (4%)

Query: 14  FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYI 73
             +++LV+   + + + R   +    E  + W + H + Y+ E E+  R  ++++NL+ I
Sbjct: 3   LYLVVLVLCTGAALAAPR--FDAQFDEHWDLWKSWHSKNYQHEKEEGWRRMVWEKNLKKI 60

Query: 74  EKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
           E  N E   G  +Y LG N F D+TNEEFR +  GY      + ++  + S F   N  +
Sbjct: 61  EMHNLEHSLGKHSYSLGMNHFGDMTNEEFRQVMNGY-----KLQQRKFKGSLFLEPNNME 115

Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD- 189
            P  +DWRE+G VT +KDQGQCGSCWAFS   A+EG       KL+ LSEQ LVDCS   
Sbjct: 116 APKQVDWREEGYVTPVKDQGQCGSCWAFSTTGAMEGQMFRKTQKLVSLSEQNLVDCSRPE 175

Query: 190 -NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
            N GC+GGLMD+AF+YI +N GL +E  YPY   +    N K +  AA  + + D+P G 
Sbjct: 176 GNEGCNGGLMDQAFQYIQDNSGLDSEEAYPYLGTDDQPCNYKAEFSAANDTGFMDIPSGK 235

Query: 249 EQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFG-TAEEEN 304
           E AL++A+++  PVSV +DA   +F FY+SG+    +C +   DHGV  VG+G   E+ +
Sbjct: 236 EHALMKAIASVGPVSVAIDAGHESFQFYQSGIYYEKECSSEELDHGVLAVGYGFEGEDVD 295

Query: 305 GAKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
           G KYW++KNSW E WG+ GYI + +D    CGIATAASYP+
Sbjct: 296 GKKYWIVKNSWSEKWGDKGYILMAKDRKNHCGIATAASYPL 336


>gi|442754503|gb|JAA69411.1| Putative cathepsin l-like cysteine proteinase b [Ixodes ricinus]
          Length = 335

 Score =  266 bits (679), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 144/309 (46%), Positives = 197/309 (63%), Gaps = 17/309 (5%)

Query: 47  AQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEFSDLTNEEFRALY 103
           A+HG++Y  E E+  RL I+ +N   I K N++   G   Y +  NEF D+ + EF +  
Sbjct: 32  AKHGKSYVSETEEVFRLKIYMENRHKIAKHNEKYARGEVPYSMAMNEFGDMLHHEFVSTR 91

Query: 104 TGYNRPVPSVSRQSSRPSTFKYQNVTD--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAV 161
            G+ R      R+ S  +  + +N+ D  +P ++DWR KGAVT +K+QGQCGSCWAFSA 
Sbjct: 92  NGFKRNYKDQPREGS--TYLEPENIEDFSLPKTVDWRTKGAVTPVKNQGQCGSCWAFSAT 149

Query: 162 AAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPY 219
            ++EG      G ++ LSEQ LVDCSTD  N+GC GGLMD AF+YI  NKG+ TE  YPY
Sbjct: 150 GSLEGQHFRKSGSMVSLSEQNLVDCSTDFGNNGCEGGLMDNAFKYIRANKGIDTEKSYPY 209

Query: 220 RHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDASGRAFHFYKSG 278
              +GTC + K+  V AT S + D+ +G E  L +AV+   P+SV +DAS  +F FY  G
Sbjct: 210 NGTDGTC-HFKKSTVGATDSGFVDIKEGSETQLKKAVATVGPISVAIDASHESFQFYSDG 268

Query: 279 VLN-ADCGN-NCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD-AGLCG 335
           V +  +C + + DHGV VVG+GT    NG  YWL+KNSWG TWG+ GYIR+ R+    CG
Sbjct: 269 VYDEPECDSESLDHGVLVVGYGTL---NGTDYWLVKNSWGTTWGDEGYIRMSRNKKNQCG 325

Query: 336 IATAASYPV 344
           IA++ASYP+
Sbjct: 326 IASSASYPL 334


>gi|242048430|ref|XP_002461961.1| hypothetical protein SORBIDRAFT_02g011230 [Sorghum bicolor]
 gi|241925338|gb|EER98482.1| hypothetical protein SORBIDRAFT_02g011230 [Sorghum bicolor]
          Length = 380

 Score =  266 bits (679), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 143/333 (42%), Positives = 197/333 (59%), Gaps = 27/333 (8%)

Query: 38  IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNR---TYKLGTNEFSDL 94
           ++E+ ++W A + ++Y    E   R  ++ +N+ YIE  N E      TY+LG   ++DL
Sbjct: 48  MIERFQRWKAAYNKSYATVAEDRRRFLVYARNMAYIEATNAEAEAAGLTYELGETAYTDL 107

Query: 95  TNEEFRALYTGYNRP--VPSVSRQ--------SSRPSTFK-------YQNV-TDVPTSID 136
           TN+EF A+YT    P  +P+   +        ++R            Y N+ T  P S+D
Sbjct: 108 TNQEFMAMYTAAPSPAQLPADEDEDDAAEAVITTRAGPVDAVGQLPVYVNLSTAAPASVD 167

Query: 137 WREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGG 196
           WR  GAVT +K+QG+CGSCWAFS VA VEGI QI  GKL+ LSEQ+LVDC T + GC GG
Sbjct: 168 WRASGAVTPVKNQGRCGSCWAFSTVAVVEGIYQIRTGKLVSLSEQELVDCDTLDAGCDGG 227

Query: 197 LMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAV 256
           +  +A  +I  N GL TE DYPY      C+  K    AA+I+    +    E +L  AV
Sbjct: 228 ISYRALRWITSNGGLTTEEDYPYTGTTDACNRAKLAHNAASIAGLRRVATRSEASLANAV 287

Query: 257 SNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWG 316
           + QPV+V ++A G  F  YK GV N  CG + +HGV VVG+G  EEE+G KYW+IKNSWG
Sbjct: 288 AGQPVAVSIEAGGDNFQHYKRGVYNGPCGTSLNHGVTVVGYGQ-EEEDGDKYWIIKNSWG 346

Query: 317 ETWGESGYIRILRDA-----GLCGIATAASYPV 344
            +WG+ GYI++ +D      GLCGIA   S+P+
Sbjct: 347 ASWGDGGYIKMRKDVAGKPEGLCGIAIRPSFPL 379


>gi|157644745|gb|ABV59078.1| cathepsin L [Lates calcarifer]
          Length = 337

 Score =  265 bits (678), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 146/345 (42%), Positives = 211/345 (61%), Gaps = 19/345 (5%)

Query: 10  IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
           ++P+ V+ +    C S  +S  S+ +P + +  + W + H + Y  E E+  R  ++++N
Sbjct: 1   MLPLAVLAV----CLSAALSAPSL-DPQLDDHWDLWKSWHSKKYH-EKEEGWRRMVWEKN 54

Query: 70  LEYIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQ 126
           L+ IE  N E   G   Y+LG N F D+T+EEFR +  GY +     + +  + S F   
Sbjct: 55  LKKIELHNLEHSMGKHPYRLGMNHFGDMTHEEFRQIMNGYKQ---RKTERKFKGSLFMEP 111

Query: 127 NVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC 186
           N  + P ++DWR+KG VT +KDQGQCGSCWAFS   A+EG      GKL+ LSEQ LVDC
Sbjct: 112 NFLEAPRALDWRDKGYVTPVKDQGQCGSCWAFSTTGALEGQQFRKTGKLVSLSEQNLVDC 171

Query: 187 STD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDL 244
           S    N GC+GGLMD+AF+Y+ +N+GL +E  YPY   +    +      +A  + + D+
Sbjct: 172 SRPEGNEGCNGGLMDQAFQYVKDNQGLDSEDSYPYLGTDDQPCHYDPNYNSANDTGFVDV 231

Query: 245 PKGDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFG-TA 300
           P G E+AL++AV+   PVSV +DA   +F FY+SG+    DC +   DHGV VVG+G   
Sbjct: 232 PSGKERALMKAVAAVGPVSVAIDAGHESFQFYQSGIYYEKDCSSEELDHGVLVVGYGYEG 291

Query: 301 EEENGAKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
           E+ +G KYW++KNSW E WG+ GYI + +D    CGIATAASYP+
Sbjct: 292 EDVDGKKYWIVKNSWSEKWGDKGYIYMAKDRKNHCGIATAASYPL 336


>gi|157787177|ref|NP_001099150.1| cathepsin L1-like precursor [Danio rerio]
 gi|157422879|gb|AAI53505.1| MGC174152 protein [Danio rerio]
          Length = 336

 Score =  265 bits (678), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 143/341 (41%), Positives = 206/341 (60%), Gaps = 16/341 (4%)

Query: 14  FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYI 73
            +  +LV  C S V +  S+ +  + +    W +QHG++Y +++E   R+ I+++NL  I
Sbjct: 1   MMFALLVTLCISAVFAASSI-DIQLDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRKI 58

Query: 74  EKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
           E+ N E   GN T+K+G N+F D+TNEEFR    GY         Q+S+   F   +   
Sbjct: 59  EQHNFEYSYGNHTFKMGMNQFGDMTNEEFRHAMNGYKHD----PNQTSQGPLFMEPSFFA 114

Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD- 189
            P  +DWR++G VT +KDQ QCGSCW+FS+  A+EG      GKLI +SEQ LVDCS   
Sbjct: 115 APQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQ 174

Query: 190 -NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
            N GC+GGLMD+AF+Y+ ENKGL +E  YPY   +        +   A I+ + D+P+G+
Sbjct: 175 GNQGCNGGLMDQAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPRGN 234

Query: 249 EQALLQAVSN-QPVSVCVDASGRAFHFYKSGVL--NADCGNNCDHGVAVVGFG-TAEEEN 304
           E AL+ AV+   PVSV +DAS ++  FY+SG+    A   +  DH V VVG+G    +  
Sbjct: 235 ELALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACSSSRLDHAVLVVGYGYQGADVA 294

Query: 305 GAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
           G +YW++KNSW + WG+ GYI + +D    CGIAT ASYP+
Sbjct: 295 GNRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGIATMASYPL 335


>gi|162463334|ref|NP_001104878.1| maize insect resistance2 precursor [Zea mays]
 gi|2425064|gb|AAB88262.1| cysteine proteinase Mir2 [Zea mays]
          Length = 493

 Score =  265 bits (678), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 138/296 (46%), Positives = 185/296 (62%), Gaps = 21/296 (7%)

Query: 62  RLNIFKQNLEYIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNR-----PVPSV 113
           RL +F+ NL YI+  N E   G   ++LG   F+DLT EE+RA     +R      V  V
Sbjct: 92  RLEVFRDNLRYIDAHNAEADAGLHGFRLGLTRFADLTLEEYRARLLLGSRGRNGTAVGVV 151

Query: 114 SRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRG 173
            R+   P   +      +P ++DWRE+GAV  +KDQGQCG CWAFSAVAAVEGI +I  G
Sbjct: 152 GRRRYLPLAGE-----QLPDAVDWRERGAVAEVKDQGQCGGCWAFSAVAAVEGINKIVTG 206

Query: 174 KLIELSEQQLVDCST-DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEK 232
            LI LSEQ+L+DC    + GC GGLMD AF ++I+N G+ TEADYP+   +GTCD + + 
Sbjct: 207 SLISLSEQELIDCDKFQDQGCDGGLMDNAFVFMIKNGGIDTEADYPFTGHDGTCDLKLKN 266

Query: 233 AVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGV 292
               +I  +E +P   E+AL +AV++QPVS  ++AS RAF  Y SG+ +  CG   DHGV
Sbjct: 267 TRVVSIDSFERVPINYERALQKAVAHQPVSASIEASRRAFQLYSSGIFDGRCGTYLDHGV 326

Query: 293 AVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDAGL----CGIATAASYPV 344
            VVG+G+   E G  YW++KNSWG  WGE+GY+R+ R+  +     GIA    YPV
Sbjct: 327 TVVGYGS---EGGKDYWIVKNSWGTQWGEAGYVRMARNVRVRPPSAGIAMEPLYPV 379


>gi|380236892|emb|CBK52289.1| cathepsin S protein [Dicentrarchus labrax]
          Length = 337

 Score =  265 bits (678), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 148/340 (43%), Positives = 200/340 (58%), Gaps = 20/340 (5%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
           M   ++LV  C    V   +M EP +    + W   HG+ Y+ E+E   R  ++++NL  
Sbjct: 9   MLGSLMLVSLC----VGAAAMFEPKLDAHWKLWKMTHGKKYQTEVEDVSRRELWEKNLML 64

Query: 73  IEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
           I   N E   G  TY+L  N   DLT EE    +   + P   + R +S    F      
Sbjct: 65  ITMHNLEASMGLHTYELSMNHMGDLTQEEIMQSFATLSPPT-DIQRAAS---PFAGTTGA 120

Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD 189
           DVP ++DWREKG VT +K QG CGSCWAFSA  A+EG    T GKL++LS Q LVDCST 
Sbjct: 121 DVPDTMDWREKGCVTSVKMQGSCGSCWAFSAAGALEGQLAKTTGKLVDLSPQNLVDCSTK 180

Query: 190 --NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
             NHGC+GGLM  AF+Y+I+N+G+ ++A YPY    G C     K  AA  S+Y  LP+G
Sbjct: 181 YGNHGCNGGLMHHAFQYVIDNQGIDSDASYPYTGRNGEC-RYNSKFRAANCSQYSFLPEG 239

Query: 248 DEQALLQAVSN-QPVSVCVDASGRAFHFYKSGVLN-ADCGNNCDHGVAVVGFGTAEEENG 305
           +E AL +A++N  P+SV +DA+   F FY+SGV N  +C    +HGV  VG+GT +   G
Sbjct: 240 NEGALKEALANIGPISVAIDATRPTFTFYRSGVYNDPNCSQKVNHGVLAVGYGTLD---G 296

Query: 306 AKYWLIKNSWGETWGESGYIRILRDAG-LCGIATAASYPV 344
             YWL+KNSWG+T+G+ GYIR+ R+    CGIA    YP+
Sbjct: 297 QDYWLVKNSWGKTFGDQGYIRMSRNKNDQCGIALYGCYPI 336


>gi|151176971|gb|ABR88030.1| digestive cysteine protease [Dermestes frischii]
          Length = 339

 Score =  265 bits (678), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 151/347 (43%), Positives = 205/347 (59%), Gaps = 27/347 (7%)

Query: 14  FVIIILVITCASQVVSGRSMHEPSIVEKHEQW---MAQHGRTYKDELEKAMRLNIFKQNL 70
           F ++ LV    +Q VS   + +       EQW     QH + YK + E+  R+ IF +N 
Sbjct: 3   FFVLALVFIVGAQAVSFFDLVQ-------EQWGTFKLQHKKQYKSDTEEKFRMKIFMENS 55

Query: 71  EYIEKANK---EGNRTYKLGTNEFSDLTNEEFRALYTGYNR----PVPSVSRQSSRPSTF 123
             + K NK    G  +YKL  N+++D+ + EF     G+NR    P+   S +  + +TF
Sbjct: 56  HKVAKXNKLYEMGLVSYKLKINKYADMLHHEFVHTVNGFNRTKNTPLLGTS-EDEQGATF 114

Query: 124 KYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQL 183
                   P ++DWRE GAVT +KDQG CGSCW+FSA  A+EG       KL+ LSEQ L
Sbjct: 115 IAPANVKFPENVDWREHGAVTXVKDQGHCGSCWSFSATGALEGQHFRKTNKLVSLSEQNL 174

Query: 184 VDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKY 241
           VDCST   N GC+GGLMD AF+Y+  N G+ TEA YPY  ++  C +   K   AT   +
Sbjct: 175 VDCSTKFGNDGCNGGLMDNAFKYVKYNHGIDTEASYPYHADDEKC-HYNPKTSGATDRGF 233

Query: 242 EDLPKGDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFG 298
            D+P GDE+ L+ AV+   PVSV +DAS  +F  Y  GV  + +C +   DHGV VVG+G
Sbjct: 234 VDIPTGDEEKLMAAVATVGPVSVAIDASHESFQLYSEGVYYDPECSSEELDHGVLVVGYG 293

Query: 299 TAEEENGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
           T  +ENG  YW++KNSWGE+WGE GYI++ R+    CGIAT ASYP+
Sbjct: 294 T--DENGQDYWIVKNSWGESWGEQGYIKMARNRDNNCGIATQASYPL 338


>gi|161408097|dbj|BAF94152.1| cathepsin L-like cysteine protease 2 [Plautia stali]
          Length = 334

 Score =  265 bits (677), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 151/343 (44%), Positives = 210/343 (61%), Gaps = 29/343 (8%)

Query: 18  ILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKAN 77
           +LV++C   +    S  + S  ++   +   H + Y +ELE++ R  IF +N + IEK N
Sbjct: 4   LLVLSCLIALGQAVSFFDLS-ADEFTLFKKFHRKEYDNELEESYRKKIFLENKKRIEKHN 62

Query: 78  ---KEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTS 134
              K+G  ++KL  N  +D+   E+  +Y G+N+        SS+ +  K Q+ T +P +
Sbjct: 63  SRYKQGKVSFKLKLNHLADMLIHEYSDVYLGFNK--------SSKANNNKLQSYTFIPPA 114

Query: 135 -------IDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
                  +DWR KGAVT +K+QG CGSCWAFS   A+EG      GKL+ LSEQ LVDCS
Sbjct: 115 HVTLNKEVDWRTKGAVTPVKNQGHCGSCWAFSTTGALEGQNFRKTGKLVSLSEQNLVDCS 174

Query: 188 TD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLP 245
               N+GC GGLMD AF+YI EN G+ TE  YPY  E+ TC  +K  ++ AT S + D+ 
Sbjct: 175 GSYGNNGCEGGLMDNAFQYIKENHGIDTEKSYPYEGEDETCRFRK-TSIGATDSGFVDIT 233

Query: 246 KGDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFGTAEE 302
           +GDE+AL+QAV+   P+SV +DAS ++F FY  GV    +C + N DHGV VVG+G    
Sbjct: 234 QGDEEALMQAVATIGPISVAIDASHQSFQFYSEGVYYEPECSSENLDHGVLVVGYGV--- 290

Query: 303 ENGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
           E+  KYWL+KNSWG  WG+ GYI++ RD    CGIAT ASYP+
Sbjct: 291 EDNQKYWLVKNSWGTQWGDGGYIKMARDQDNNCGIATQASYPL 333


>gi|156739281|ref|NP_001096588.1| cathepsin L1-like precursor [Danio rerio]
 gi|166158351|ref|NP_001107526.1| uncharacterized protein LOC100135391 precursor [Xenopus (Silurana)
           tropicalis]
 gi|326672305|ref|XP_003199634.1| PREDICTED: cathepsin L1-like [Danio rerio]
 gi|156230096|gb|AAI52237.1| MGC174155 protein [Danio rerio]
 gi|163916362|gb|AAI57707.1| LOC100135391 protein [Xenopus (Silurana) tropicalis]
          Length = 335

 Score =  265 bits (677), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 141/340 (41%), Positives = 206/340 (60%), Gaps = 15/340 (4%)

Query: 14  FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYI 73
            +  +LV  C S V +  S+ +  + +    W +QHG++Y +++E   R+ I+++NL  I
Sbjct: 1   MMFALLVTLCISAVFAASSI-DIQLDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRKI 58

Query: 74  EKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
           E+ N E   GN T+K+G N+F D+TNEEFR    GY         ++S+   F   +   
Sbjct: 59  EQHNFEYSYGNHTFKMGMNQFGDMTNEEFRQAMNGYKHD----PNRTSQGPLFMEPSFFA 114

Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD- 189
            P  +DWR++G VT +KDQ QCGSCW+FS+  A+EG      GKLI +SEQ LVDCS   
Sbjct: 115 APQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQ 174

Query: 190 -NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
            N GC+GG+MD+AF+Y+ ENKGL +E  YPY   +        +   A I+ + D+P+G+
Sbjct: 175 GNQGCNGGIMDQAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPRGN 234

Query: 249 EQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGNNCDHGVAVVGFG-TAEEENG 305
           E AL+ AV+   PVSV +DAS ++  FY+SG+     C +  DH V VVG+G    +  G
Sbjct: 235 ELALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACTSRLDHAVLVVGYGYQGADVAG 294

Query: 306 AKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
            +YW++KNSW + WG+ GYI + +D    CGIAT ASYP+
Sbjct: 295 NRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGIATMASYPL 334


>gi|4469155|emb|CAB38315.1| chymopapain isoform III [Carica papaya]
          Length = 361

 Score =  265 bits (677), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 137/345 (39%), Positives = 201/345 (58%), Gaps = 14/345 (4%)

Query: 7   KSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEK----HEQWMAQHGRTYKDELEKAMR 62
           K   +   +II + ++ A     G S  + + +E+     + WM +H + Y+   EK  R
Sbjct: 9   KIIFLATCLIIHMGLSSADFYTVGYSQDDLTSIERLIQLFDSWMLKHNKIYESIDEKIYR 68

Query: 63  LNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPST 122
             IF+ NL YI++ NK+ N +Y LG N F+DL+N+EF+  Y G+         +      
Sbjct: 69  FEIFRDNLMYIDETNKK-NNSYWLGLNGFADLSNDEFKKKYVGF-VAEDFTGLEHFDNED 126

Query: 123 FKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQ 182
           F Y++VT+ P SIDWR KGAVT +K+QG CGSCWAFS +A VEGI +I  G L+ELSEQ+
Sbjct: 127 FTYKHVTNYPQSIDWRAKGAVTPVKNQGACGSCWAFSTIATVEGINKIVTGNLLELSEQE 186

Query: 183 LVDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYE 242
           LVDC   ++GC GG    + +Y + N G+ T   YP + ++  C    +      I+ Y+
Sbjct: 187 LVDCDKHSYGCKGGYQTTSLQY-VANNGVHTSKVYPCQAKQYKCRATDKPGPKVKITGYK 245

Query: 243 DLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEE 302
            +P   E + L A++NQP+S  V+A G+ F  YKSGV +  CG   DH V  VG+GT++ 
Sbjct: 246 RVPSNCETSFLGALANQPLSFLVEAGGKPFQLYKSGVFDGPCGTKLDHAVTAVGYGTSDG 305

Query: 303 ENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYP 343
           +N   Y +IKNSWG  WGE GY+R+ R +    G CG+  ++ YP
Sbjct: 306 KN---YIIIKNSWGPNWGEKGYMRLKRQSGNSQGTCGVYKSSYYP 347


>gi|340371596|ref|XP_003384331.1| PREDICTED: digestive cysteine proteinase 2-like [Amphimedon
           queenslandica]
          Length = 327

 Score =  265 bits (677), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 140/334 (41%), Positives = 201/334 (60%), Gaps = 19/334 (5%)

Query: 14  FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYI 73
           FV ++L+I   S  V+          E+   W  ++G+TY+   E  MR  I+ QN +Y+
Sbjct: 9   FVAVLLLIGLVSAAVND--------AEEWRLWKGKYGKTYRSIYEDNMRQKIWLQNRDYV 60

Query: 74  EKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPT 133
            + N   + +++L  NEF+DLT EEF ++Y GY +     +R++   +T        +P 
Sbjct: 61  NEHNSM-DSSFQLEVNEFADLTAEEFSSIYNGYGK---GRNRENHENTTIYRYTGGAIPD 116

Query: 134 SIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGC 193
           S+DWR KG VT +K+Q QCGSCWAFS   ++EG      GKL+ LSEQ LVDC   +HGC
Sbjct: 117 SVDWRTKGLVTPVKNQKQCGSCWAFSTTGSLEGAHAKKTGKLVSLSEQNLVDCDKKDHGC 176

Query: 194 SGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALL 253
            GGLM  AF+YI ENKG+ TE  YPY+ + G C+ +K+  + AT+ ++  +   D +AL 
Sbjct: 177 QGGLMTTAFKYIEENKGIDTEESYPYKAKNGRCEFKKDD-IGATVERHVSILTTDCEALK 235

Query: 254 QAVSN-QPVSVCVDASGRAFHFYKSGVLNAD--CGNNCDHGVAVVGFGTAEEENGAKYWL 310
           +AV+   P+SV +DAS  +F  YKSG+ +         DHGV VVG+G   +E+G +YWL
Sbjct: 236 KAVAEIGPISVAMDASHSSFQLYKSGIYDPKICSSRKLDHGVLVVGYG---KEDGEEYWL 292

Query: 311 IKNSWGETWGESGYIRILRDAGLCGIATAASYPV 344
           +KNSWG+ WG  GY +I     LCGI T+A YPV
Sbjct: 293 VKNSWGKNWGMEGYFKIASKKNLCGICTSACYPV 326


>gi|238816977|gb|ACR56863.1| cathepsin L-like cysteine proteinase [Delia coarctata]
          Length = 338

 Score =  265 bits (677), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 142/341 (41%), Positives = 210/341 (61%), Gaps = 18/341 (5%)

Query: 15  VIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIE 74
           V+ +L +    Q +S   +    I E+ + +  +H + Y  E+E+  R+ IF +N   I 
Sbjct: 4   VLALLALVAFVQAISITDV----IKEEWQTFKMEHRKNYLSEVEERFRMKIFNENRHKIA 59

Query: 75  KANK---EGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDV 131
           K N+   +G  ++KLG N+++D+ + EF+    GYN  +    R     +   Y +  +V
Sbjct: 60  KHNQLYAQGKVSFKLGLNKYADMLHHEFKETMNGYNHTMRKELRAQEGFNGITYISPANV 119

Query: 132 --PTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD 189
             P ++DWR+ GAVT +KDQG CGSCW+FS+  ++EG      G L+ LSEQ LVDCST 
Sbjct: 120 QVPKAVDWRQHGAVTSVKDQGHCGSCWSFSSTGSLEGQHFRKAGVLVSLSEQNLVDCSTK 179

Query: 190 --NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
             N+GC+GGLMD AF YI +N G+ TE  YPY   + +C   K   V AT + + D+P+G
Sbjct: 180 YGNNGCNGGLMDNAFRYIKDNGGVDTEKSYPYEGIDDSCHFNK-ATVGATDTGFVDIPQG 238

Query: 248 DEQALLQAVSNQ-PVSVCVDASGRAFHFYKSGVLN-ADC-GNNCDHGVAVVGFGTAEEEN 304
           DE+A+++AV+   PV+V +DAS  +F  Y  GV N  +C  +N DHGV VVG+GT  +++
Sbjct: 239 DEEAMMKAVATMGPVAVAIDASNESFQLYSEGVYNDPNCSSDNLDHGVLVVGYGT--DKD 296

Query: 305 GAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
           G  YWL+KNSWG TWG+ GYI++ R+    CGIATA+S+P 
Sbjct: 297 GQDYWLVKNSWGTTWGDQGYIKMARNQDNQCGIATASSFPT 337


>gi|413944252|gb|AFW76901.1| hypothetical protein ZEAMMB73_101481 [Zea mays]
          Length = 232

 Score =  265 bits (677), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 126/230 (54%), Positives = 161/230 (70%), Gaps = 12/230 (5%)

Query: 123 FKYQNVT--DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSE 180
           F+Y+NV+   +P +IDWR  GAVT IKDQGQCG CWAFSAVAA EGI +I+ GKLI LSE
Sbjct: 6   FRYENVSVDAIPATIDWRTNGAVTPIKDQGQCGCCWAFSAVAATEGIVKISTGKLISLSE 65

Query: 181 QQLVDCST--DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATI 238
           Q+LVDC    ++ GC GGLMD AF++II+N GL TE++YPY   +G C +      AA I
Sbjct: 66  QELVDCDVYGEDQGCEGGLMDDAFKFIIKNGGLTTESNYPYTAADGKCKSGSNS--AANI 123

Query: 239 SKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFG 298
             YED+P  DE AL++AV+NQPVSV VD     F FY  GV+   CG + DHG+A +G+G
Sbjct: 124 KGYEDVPTNDEAALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYG 183

Query: 299 TAEEENGAKYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPV 344
             +  +G KYWL+KNSWG TWGE+GY+R+ +D     G+CG+A   SYP 
Sbjct: 184 --KTSDGTKYWLMKNSWGTTWGENGYLRMEKDISDKKGMCGLAIEPSYPT 231


>gi|30023547|gb|AAO48766.2| cathepsin L-like cysteine proteinase [Tenebrio molitor]
          Length = 337

 Score =  265 bits (677), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 152/344 (44%), Positives = 206/344 (59%), Gaps = 23/344 (6%)

Query: 14  FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMA---QHGRTYKDELEKAMRLNIFKQNL 70
           F+I + +    SQ VS   + +       EQW A    H + Y+ E E+  R+ IF +N 
Sbjct: 3   FLIFLAICVAGSQAVSFFDLVQ-------EQWGAFKMTHNKQYQSETEERFRMKIFMENS 55

Query: 71  EYIEKANK---EGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSV-SRQSSRPSTFKYQ 126
             + K NK   +G  ++KLG N+++D+ + EF  +  G+NR    + S +S    TF   
Sbjct: 56  HTVAKHNKLYAQGLVSFKLGINKYADMLHHEFVQVLNGFNRTKSGLRSGESDDSVTFLPP 115

Query: 127 NVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC 186
               +P  IDWR+KGAVT +KDQGQCGSCW+FSA  ++EG      GKL+ LSEQ LVDC
Sbjct: 116 ANVQLPGQIDWRDKGAVTPVKDQGQCGSCWSFSATGSLEGQHFRQSGKLVSLSEQNLVDC 175

Query: 187 STD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDL 244
           S    N+GC+GGLMD AF YI  N G+ TE  YPY+ E+  C + K K   AT   Y D+
Sbjct: 176 SEKFGNNGCNGGLMDNAFRYIKANGGIDTEQAYPYKAEDEKC-HYKPKNKGATDRGYVDI 234

Query: 245 PKGDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADC-GNNCDHGVAVVGFGTAE 301
             G+E  L  AV+   PVSV +DAS ++F  Y  GV    DC  +  DHGV VVG+GT  
Sbjct: 235 ESGNEDKLQSAVATVGPVSVAIDASHQSFQLYSGGVYYEPDCSASQLDHGVLVVGYGT-- 292

Query: 302 EENGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
           E++G  YWL+KNSWG++WG+ GYI++ R+    CGIAT ASYP+
Sbjct: 293 EDDGTDYWLVKNSWGKSWGDQGYIKMARNRNNNCGIATEASYPL 336


>gi|340727787|ref|XP_003402217.1| PREDICTED: cathepsin L-like [Bombus terrestris]
          Length = 343

 Score =  265 bits (677), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 145/347 (41%), Positives = 209/347 (60%), Gaps = 22/347 (6%)

Query: 11  IPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNL 70
           + +F+ +I+ +   +Q +S   +    + ++   +  +H + YK+++E+  R+ IF  N 
Sbjct: 1   MKLFLFLIVAVLATAQAISFFEL----VNQEWTTFKMEHNKVYKNDVEERFRMKIFMDNK 56

Query: 71  EYIEKANKEGNR-----TYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKY 125
             I K N  GN      +YKL  N++ D+ + EF     G+N+ + +  R    P    +
Sbjct: 57  HKIAKHN--GNYEMKKVSYKLKMNKYGDMLHHEFVNTLNGFNKSINTQLRSERLPIAASF 114

Query: 126 QNVTDV--PTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQL 183
               +V  P ++DWRE GAVT +KDQG CGSCW+FSA  A+EG      G LI LSEQ L
Sbjct: 115 IEPANVVLPKTVDWREHGAVTPVKDQGHCGSCWSFSATGALEGQHFRRTGILIPLSEQNL 174

Query: 184 VDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKY 241
           +DCS    N+GC+GGLMD+AF+YI +NKGL TE  YPY  E   C      + A  +  Y
Sbjct: 175 IDCSGKYGNNGCNGGLMDQAFQYIKDNKGLDTEVTYPYEAENDKCRYNAANSGARDVG-Y 233

Query: 242 EDLPKGDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFG 298
            D+P+G+E+ L  AV+   PVSV +DAS ++F FY  GV    +C + N DHGV  VG+G
Sbjct: 234 VDIPQGNEKKLKAAVATIGPVSVAIDASHQSFQFYSEGVYYEPECSSENLDHGVLAVGYG 293

Query: 299 TAEEENGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
           T  +ENG  YWL+KNSWGETWG++GYI++ R+    CGIA+ ASYP+
Sbjct: 294 T--DENGQDYWLVKNSWGETWGDNGYIKMARNKLNHCGIASTASYPL 338


>gi|125564712|gb|EAZ10092.1| hypothetical protein OsI_32402 [Oryza sativa Indica Group]
          Length = 382

 Score =  265 bits (677), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 146/378 (38%), Positives = 209/378 (55%), Gaps = 45/378 (11%)

Query: 9   FIIPMFVII--ILVITCAS----QVVSGRSMH---EP---SIVEKHEQWMAQHGRTYKDE 56
           F +P  +I+  +  I C+S    +V S  + +   EP   +++E  ++W A++ R+Y   
Sbjct: 7   FSMPCLLILLGVFFIGCSSGTARRVTSDTAANTDGEPAATTMMEMFQRWKAEYNRSYATP 66

Query: 57  LEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSR- 115
            E+  RL ++ +N+ YIE  N      Y+LG   ++DLTN+EF A+YT    P+ S +  
Sbjct: 67  EEERRRLRVYARNVRYIEATNAAAGLAYELGETAYTDLTNDEFMAMYTA--PPLRSAADD 124

Query: 116 ------------------QSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWA 157
                             +  +P  + +      P S+DWR  GAVT +KDQG+CGSCWA
Sbjct: 125 DDDAATTTIITTRAGPVDEHQQPEVY-FNESAGAPASVDWRASGAVTEVKDQGRCGSCWA 183

Query: 158 FSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLATEADY 217
           FS VA VEGI +I +GKL+ LSEQ+LVDC T + GC GG+  +A E+I  N G+ T  DY
Sbjct: 184 FSTVAVVEGIQKIKKGKLVSLSEQELVDCDTLDSGCDGGVSYRALEWITANGGITTRDDY 243

Query: 218 PYR-HEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYK 276
           PY       CD  K    AATI+    +    E +L  A + QPV+V ++A G  F  Y+
Sbjct: 244 PYTGAAAAACDRAKLGHHAATIAGLRRVATRSEASLQNAAAAQPVAVSIEAGGDNFQHYR 303

Query: 277 SGVLNADCGNNCDHGVAVVGFGTAE-----EENGAKYWLIKNSWGETWGESGYIRILRDA 331
            GV +  CG   +HGV VVG+G  E        G KYW+IKNSWG+ WG+ GYI++ +D 
Sbjct: 304 KGVYDGPCGTRLNHGVTVVGYGQEEAPVDGSAAGDKYWIIKNSWGKNWGDQGYIKMKKDV 363

Query: 332 -----GLCGIATAASYPV 344
                GLCGIA   S+P+
Sbjct: 364 AGKPEGLCGIAIRPSFPL 381


>gi|375340657|emb|CBJ56264.1| cathepsin S protein [Dicentrarchus labrax]
          Length = 337

 Score =  265 bits (677), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 147/340 (43%), Positives = 200/340 (58%), Gaps = 20/340 (5%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
           M   ++LV  C    V   +M EP +    + W   HG+ Y+ E+E   R  ++++NL  
Sbjct: 9   MLGSLMLVSLC----VGAAAMFEPKLDAHWKLWKMTHGKKYQTEVEDVSRRELWEKNLML 64

Query: 73  IEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
           I   N E   G  TY+L  N   DLT EE    +   + P   + R +S    F      
Sbjct: 65  ITMHNLEASMGLHTYELSMNHMGDLTQEEIMQSFATLSPPT-DIQRAAS---PFAGTTGA 120

Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD 189
           DVP ++DWREKG VT +K QG CGSCWAFSA  A+EG    T GKL++LS Q LVDCST 
Sbjct: 121 DVPDTMDWREKGCVTSVKMQGSCGSCWAFSAAGALEGQLAKTTGKLVDLSPQNLVDCSTK 180

Query: 190 --NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
             NHGC+GG M +AF+Y+I+N+G+ ++A YPY    G C     K  AA  S+Y  LP+G
Sbjct: 181 YGNHGCNGGFMHQAFQYVIDNQGIDSDASYPYTGRNGEC-RYNSKFRAANCSQYSFLPEG 239

Query: 248 DEQALLQAVSN-QPVSVCVDASGRAFHFYKSGVLN-ADCGNNCDHGVAVVGFGTAEEENG 305
           +E AL +A++N  P+SV +DA+   F FY+SGV N  +C    +HGV  VG+GT +   G
Sbjct: 240 NEGALKEALANIGPISVAIDATRPTFTFYRSGVYNDPNCSQKVNHGVLAVGYGTLD---G 296

Query: 306 AKYWLIKNSWGETWGESGYIRILRDAG-LCGIATAASYPV 344
             YWL+KNSWG+T+G+ GYIR+ R+    CGIA    YP+
Sbjct: 297 QDYWLVKNSWGKTFGDQGYIRMSRNKNDQCGIALYGCYPI 336


>gi|350583407|ref|XP_003481511.1| PREDICTED: cathepsin S [Sus scrofa]
          Length = 331

 Score =  265 bits (676), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 148/341 (43%), Positives = 210/341 (61%), Gaps = 22/341 (6%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKH-EQWMAQHGRTYKDELEKAMRLNIFKQNLE 71
           M  ++ +++ C+S +     +H    +++H + W   +G+ YK++ E+  R  I+++NL+
Sbjct: 1   MKCLVWVLLLCSSAMAQ---LHRDPTLDRHWDLWKKTYGKQYKEKNEEVARRLIWEKNLK 57

Query: 72  YIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNV 128
            +   N E   G  +Y LG N   D+T+EE  +L +     VPS   Q  R  T+K    
Sbjct: 58  TVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVISLMSCVR--VPS---QWPRNVTYKSNPN 112

Query: 129 TDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST 188
             +P S+DWREKG VT +K QG CGSCWAFSAV A+E   ++  G+L+ LS Q LVDCST
Sbjct: 113 QKLPDSMDWREKGCVTEVKYQGSCGSCWAFSAVGALEAQVKMKTGRLVSLSAQNLVDCST 172

Query: 189 D---NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLP 245
           +   N GC+GG M +AF+YII+N G+ +EA YPY+  +G C     K  AAT S+Y +LP
Sbjct: 173 EKYRNKGCNGGFMTEAFQYIIDNNGIDSEASYPYKAVDGKC-KYDSKNRAATCSRYTELP 231

Query: 246 KGDEQALLQAVSNQ-PVSVCVDASGRAFHFYKSGV-LNADCGNNCDHGVAVVGFGTAEEE 303
             DE AL +AV+N+ PVSV +DA   +F FY+SGV  +  C  N +HGV VVG+G     
Sbjct: 232 FADEYALKEAVANKGPVSVAIDAKHSSFFFYRSGVYYDPSCTQNVNHGVLVVGYGNL--- 288

Query: 304 NGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYP 343
           NG  YWL+KNSWG  +G+ GYIR+ R++   CGIA   SYP
Sbjct: 289 NGKDYWLVKNSWGLNFGDGGYIRMARNSENHCGIANYPSYP 329


>gi|346469447|gb|AEO34568.1| hypothetical protein [Amblyomma maculatum]
          Length = 333

 Score =  265 bits (676), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 147/341 (43%), Positives = 209/341 (61%), Gaps = 21/341 (6%)

Query: 16  IIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEK 75
           ++   + CA    +  +  +  +  + E + + H +TYK  +E+ +R  IF +N  +I K
Sbjct: 1   MLRFALLCAIVAAATAATSQEILRTEWEAFKSTHKKTYKSNVEELLRFKIFTENSLFIAK 60

Query: 76  AN---KEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTF---KYQNVT 129
            N    +G  +YKLG N+F+DL   EF  +  GY        R + R ST+      N +
Sbjct: 61  HNVKYAKGLVSYKLGINQFADLLPHEFVKMMNGYQG-----KRLAGRGSTYLPPANLNDS 115

Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST- 188
            +P ++DWR+KGAVT +KDQGQCGSCWAFS+  ++EG   +  GKL+ LSEQ LVDCS+ 
Sbjct: 116 SLPKTVDWRKKGAVTPVKDQGQCGSCWAFSSTGSLEGQHFLKTGKLVSLSEQNLVDCSSA 175

Query: 189 -DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
             N GC+GGLMD +F YI  N G+ TE  YPY  E+G C  +KE  V AT + + D+ +G
Sbjct: 176 YGNQGCNGGLMDNSFNYIKANGGIDTEDSYPYEAEDGDCRYKKED-VGATDTGFVDIKEG 234

Query: 248 DEQALLQAVSN-QPVSVCVDASGRAFHFYKSGVLN-ADCGN-NCDHGVAVVGFGTAEEEN 304
            E+ L +AV+   PVSV +DAS ++F  Y  GV +  +C + + DHGV  VG+G    +N
Sbjct: 235 SEKDLQKAVATVGPVSVAIDASQQSFQLYSEGVYDEPNCSSESLDHGVLAVGYGV---KN 291

Query: 305 GAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
           G KYWL+KNSW ETWG+ GYI + RD    CGIA++ASYP+
Sbjct: 292 GKKYWLVKNSWAETWGQDGYILMSRDKNNQCGIASSASYPL 332


>gi|403302730|ref|XP_003942006.1| PREDICTED: cathepsin S isoform 1 [Saimiri boliviensis boliviensis]
          Length = 339

 Score =  265 bits (676), Expect = 4e-68,   Method: Compositional matrix adjust.
 Identities = 148/342 (43%), Positives = 208/342 (60%), Gaps = 21/342 (6%)

Query: 11  IPMFVIIILVITCASQVVSGRSMHEPSIVEKH-EQWMAQHGRTYKDELEKAMRLNIFKQN 69
           I M  ++ ++  C+S V     +H+   ++ H   W   +G+ YK++ E+A+R  I+++N
Sbjct: 8   ITMKQLVCVLFVCSSAVTQ---LHKDPTLDHHWNLWKKTYGKQYKEKNEEAVRRLIWEKN 64

Query: 70  LEYIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQ 126
           L+++   N E   G  +Y LG N   D+T+EE  +L +    P      Q  R  T+K  
Sbjct: 65  LKFVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLRVP-----NQWQRNITYKSN 119

Query: 127 NVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC 186
               +P S+DWREKG VT +K QG CG+CWAFSAV A+E   ++  GKL+ LS Q LVDC
Sbjct: 120 PNQMLPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDC 179

Query: 187 STD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDL 244
           S    N GC+GG M +AF+YII+NKG+ +EA YPY+  +  C     K  AAT SKY +L
Sbjct: 180 SEKYGNKGCNGGFMTEAFQYIIDNKGIDSEASYPYKATDQKCQ-YDSKYRAATCSKYTEL 238

Query: 245 PKGDEQALLQAVSNQ-PVSVCVDASGRAFHFYKSGV-LNADCGNNCDHGVAVVGFGTAEE 302
           P G E  L +AV+N+ PV V VDAS  +F  Y+SGV  +  C    +HGV V+G+G   +
Sbjct: 239 PYGREDVLKEAVANKGPVCVGVDASHPSFFLYRSGVYYDPACTQKVNHGVLVIGYG---D 295

Query: 303 ENGAKYWLIKNSWGETWGESGYIRILRDAG-LCGIATAASYP 343
            NG +YWL+KNSWG  +GE GYIR+ R+ G  CGIA+  SYP
Sbjct: 296 LNGKEYWLVKNSWGSNFGEQGYIRMARNKGNHCGIASYPSYP 337


>gi|27728675|gb|AAO18731.1| cysteine protease [Gossypium hirsutum]
          Length = 389

 Score =  264 bits (675), Expect = 4e-68,   Method: Compositional matrix adjust.
 Identities = 135/322 (41%), Positives = 193/322 (59%), Gaps = 15/322 (4%)

Query: 33  MHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYI--EKANKEGNR-TYKLGTN 89
           + E  ++E  +QW  +H + Y+   E   R   FK NL+YI    A ++ N+  + +G N
Sbjct: 40  LSEERVLEIFQQWKEKHRKVYRHAEEAEKRFENFKGNLKYILERNAKRKANKWEHHVGLN 99

Query: 90  EFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQ 149
           +F+D++NEEFR  Y    +   +     SR    K Q+  D P+S+DWR  G VT +KDQ
Sbjct: 100 KFADMSNEEFRKAYLSKVKKPINKGITLSRNMRRKVQSC-DAPSSLDWRNYGVVTAVKDQ 158

Query: 150 GQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENK 209
           G CGSCWAFS+  A+EGI  +  G LI LSEQ+LV+C T N+GC GG MD AFE++I N 
Sbjct: 159 GSCGSCWAFSSTGAMEGINALVTGDLISLSEQELVECDTSNYGCEGGYMDYAFEWVINNG 218

Query: 210 GLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASG 269
           G+ +E+DYPY   +GTC+  KE+    +I  Y+D+ + D  ALL AV+ QPVSV +D S 
Sbjct: 219 GIDSESDYPYTGVDGTCNTTKEETKVVSIDGYQDVEQSD-SALLCAVAQQPVSVGIDGSA 277

Query: 270 RAFHFYKSGVLNADCG---NNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIR 326
             F  Y  G+ +  C    ++ DH V +VG+G+ + E   +YW++KNSWG +WG  GY  
Sbjct: 278 IDFQLYTGGIYDGSCSDDPDDIDHAVLIVGYGSEDSE---EYWIVKNSWGTSWGIDGYFY 334

Query: 327 ILRDA----GLCGIATAASYPV 344
           + RD     G+C +   ASYP 
Sbjct: 335 LKRDTDLPYGVCAVNAMASYPT 356


>gi|357167707|ref|XP_003581294.1| PREDICTED: actinidain-like [Brachypodium distachyon]
          Length = 358

 Score =  264 bits (675), Expect = 4e-68,   Method: Compositional matrix adjust.
 Identities = 138/321 (42%), Positives = 189/321 (58%), Gaps = 16/321 (4%)

Query: 37  SIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTN 96
           ++  +HE+WMA+ GR+Y D  EKA R  +F  N  +++  N+ GNRTY LG N+FSDLT+
Sbjct: 37  TMASRHERWMARFGRSYTDAGEKARRQEVFGANARHVDAVNRAGNRTYTLGLNQFSDLTD 96

Query: 97  EEFRALYTGYNRPVPS----VSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQC 152
            EF   + GY R        +  +   P         D+P S+DWR KGAVT IK+Q  C
Sbjct: 97  HEFLQQHLGYGRHHGQRGLLLPEEEVMPKATALGYGQDMPYSVDWRAKGAVTEIKNQRSC 156

Query: 153 GSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLA 212
           GSCWAF+AVAA EG+ +I  G LI +SEQQ++DC+ D   C  G +  A  Y++ + GL 
Sbjct: 157 GSCWAFAAVAATEGLVKIATGNLISMSEQQVLDCTGDRSSCDSGYISDALRYVVTSGGLQ 216

Query: 213 TEADYPYRHEEGTCDNQ---KEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASG 269
            EA Y Y  ++G C ++   +  + A+    +     GDE AL    + QPV+V V+AS 
Sbjct: 217 REAAYAYTGQKGACGSRRFARPNSAASVGGVHMATLNGDEGALQGLAARQPVAVIVEASE 276

Query: 270 RAFHFYKSGVL--NADCGNNCDHGVAVVGFGTAEEENGA-KYWLIKNSWGETWGESGYIR 326
             F  Y SGV   +A CG   +H + VVG+GT   ENGA +YWL+KN WG  WGE+GY+R
Sbjct: 277 PDFRHYSSGVYAGSASCGRELNHALTVVGYGT---ENGAGEYWLVKNQWGTWWGENGYMR 333

Query: 327 ILRDAGL---CGIATAASYPV 344
           + R  G    CGIA+ A YP 
Sbjct: 334 VARRNGAGANCGIASVAFYPT 354


>gi|32396018|gb|AAP41846.1| cysteine protease [Anthurium andraeanum]
          Length = 502

 Score =  264 bits (675), Expect = 4e-68,   Method: Compositional matrix adjust.
 Identities = 143/319 (44%), Positives = 187/319 (58%), Gaps = 18/319 (5%)

Query: 40  EKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYK----LGTNEFSDLT 95
           E  E+WM +H + Y    EKA R   F  NL ++ K N EG R       +G N F+DL+
Sbjct: 49  ELFERWMEKHRKVYAHPGEKARRYANFLSNLAFVRKRNAEGRRAPSSGQGVGMNVFADLS 108

Query: 96  NEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT--DVPTSIDWREKGAVTHIKDQGQCG 153
           NEEFR +Y+       +   + +R    + + V   D P S+DWR++GAVT +K+QG CG
Sbjct: 109 NEEFREVYSSRVLRKKAAEGRGARRRAGEGRVVAGCDAPASLDWRKRGAVTAVKNQGDCG 168

Query: 154 SCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLAT 213
           SCWAFS+  A+EGI  IT G+LI LSEQ+LVDC T N GC GG MD AFE++I N G+ +
Sbjct: 169 SCWAFSSTGAMEGINAITTGELISLSEQELVDCDTTNEGCDGGYMDYAFEWVINNGGIDS 228

Query: 214 EADYPYRHE-EGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAF 272
           EA+YPY  + +  C+  KE+    +I  YED+    E ALL A   QPVSV +D S   F
Sbjct: 229 EANYPYTGQADSVCNTTKEEIKVVSIDGYEDVAT-SESALLCAAVQQPVSVGIDGSSLDF 287

Query: 273 HFYKSGVLNADCGNN---CDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILR 329
             Y  G+ + DC  N    DH V VVG+G   ++ G  YW++KNSWG  WG  GYI I R
Sbjct: 288 QLYAGGIYDGDCSGNPDDIDHAVLVVGYG---QQGGTDYWIVKNSWGTDWGMQGYIYIRR 344

Query: 330 DAGL----CGIATAASYPV 344
           + GL    C I   ASYP 
Sbjct: 345 NTGLPYGVCAIDAMASYPT 363


>gi|330803818|ref|XP_003289899.1| hypothetical protein DICPUDRAFT_154350 [Dictyostelium purpureum]
 gi|325080010|gb|EGC33584.1| hypothetical protein DICPUDRAFT_154350 [Dictyostelium purpureum]
          Length = 326

 Score =  264 bits (675), Expect = 4e-68,   Method: Compositional matrix adjust.
 Identities = 146/344 (42%), Positives = 206/344 (59%), Gaps = 28/344 (8%)

Query: 10  IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
           ++   +   L+I C S   + R   +       + WM +H ++Y ++ E   R ++F+ N
Sbjct: 3   LVLALIFCFLIINCCS---AARIFSQKQYQTAFQNWMVKHQKSYTND-EFGSRYSVFQDN 58

Query: 70  LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQN-- 127
           ++ + K N++G+ T  LG N  +DLTNEEF+ LY G    V           T+K +   
Sbjct: 59  MDIVAKWNQKGSNTI-LGLNVMADLTNEEFKKLYLGTKANV-----------TYKKKTLV 106

Query: 128 -VTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC 186
            V+ +P S+DWR  GAVT +K+QGQCG C+AFS   +VEGI +IT  +L+ LSEQQ++DC
Sbjct: 107 GVSGLPASVDWRANGAVTAVKNQGQCGGCYAFSTTGSVEGIHEITSQQLVPLSEQQILDC 166

Query: 187 STD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDL 244
           S    N+GC GGLM  +FEYII   GL TEA YPY  E G C   K K + ATI+ Y+++
Sbjct: 167 SGSEGNNGCDGGLMTNSFEYIIAVGGLDTEASYPYTGEVGKCKFNK-KNIGATITGYKNV 225

Query: 245 PKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGV-LNADCGNN-CDHGVAVVGFGTAEE 302
             G E  L  AV+ QPVSV +DAS  +F  Y SGV    +C +   DHGV  VG+G+   
Sbjct: 226 ESGSESDLQTAVAAQPVSVAIDASQSSFQLYASGVYYEPECSSTQLDHGVLAVGYGS--- 282

Query: 303 ENGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPVA 345
           ++G  YW++KNSWG  WGE+G+I + R+    CGIAT AS+P A
Sbjct: 283 QSGQDYWIVKNSWGADWGENGFILMARNKDNNCGIATMASFPTA 326


>gi|119433808|gb|ABL74967.1| cysteine protease [Acanthamoeba castellanii]
          Length = 330

 Score =  264 bits (675), Expect = 4e-68,   Method: Compositional matrix adjust.
 Identities = 142/317 (44%), Positives = 191/317 (60%), Gaps = 14/317 (4%)

Query: 34  HEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSD 93
           H+P +      WM  H ++Y +E E   R N++++N  +I++ N++ N +Y L  N+F D
Sbjct: 23  HDP-LTGVFADWMRTHTKSYSNE-EFVFRWNVWRENYNFIQEENRK-NNSYYLTMNKFGD 79

Query: 94  LTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCG 153
           LTN EF  +Y G        +      +         +P + DWR+KGAVTH+K+QGQCG
Sbjct: 80  LTNAEFNKVYKG--LAFDYSAHILKAKAATPAAPAPGLPANFDWRQKGAVTHVKNQGQCG 137

Query: 154 SCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGL 211
           SCW+FS   + EG   + RG L+ LSEQ L+DCS    N+GC+GGLMD AFEYII NKG+
Sbjct: 138 SCWSFSTTGSTEGANFLKRGTLVSLSEQNLIDCSGSYGNNGCNGGLMDYAFEYIINNKGI 197

Query: 212 ATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRA 271
            TEA YPY   +  C      +   +++ Y D+  GDE ALL AV+ +P SV +DAS  +
Sbjct: 198 DTEASYPYETAQYNCRYNPANS-GGSLTSYTDVSSGDENALLNAVAIEPTSVAIDASHNS 256

Query: 272 FHFYKSGV-LNADCGNN-CDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILR 329
           F FY  GV   + C +   DHGV  VG+GT   ENG  YWL+KNSWG  WG  GYI++ R
Sbjct: 257 FQFYSGGVYYESSCSSTQLDHGVLAVGWGT---ENGQDYWLVKNSWGADWGLQGYIKMAR 313

Query: 330 DA-GLCGIATAASYPVA 345
           +    CGIATAASYP A
Sbjct: 314 NRHNNCGIATAASYPTA 330


>gi|5081735|gb|AAD39513.1|AF147207_1 cathepsin L-like protease precursor [Artemia franciscana]
          Length = 338

 Score =  264 bits (675), Expect = 4e-68,   Method: Compositional matrix adjust.
 Identities = 148/343 (43%), Positives = 205/343 (59%), Gaps = 15/343 (4%)

Query: 11  IPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNL 70
           +    +I L+     Q+ +  S+      E H  + A H + Y  +LE+  R+ I+ +N 
Sbjct: 1   MKQITLIFLLGAVLVQLSAALSLTNLLADEWH-LFKATHKKEYPSQLEEKFRMKIYLENK 59

Query: 71  EYIEKAN---KEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQN 127
             + K N   ++G ++Y++  N+F DL + EFR++  GY     + SR  S  +  +  N
Sbjct: 60  HKVAKHNILYEKGEKSYQVAMNKFGDLLHHEFRSIMNGYQHKKQNSSRAESTFTFMEPAN 119

Query: 128 VTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
           V +VP S+DWR KGA+T +KDQGQCGSCWAFS+  A+EG T    GKLI LSEQ L+DCS
Sbjct: 120 V-EVPESVDWRVKGAITPVKDQGQCGSCWAFSSTGALEGQTFRKTGKLISLSEQNLIDCS 178

Query: 188 TD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLP 245
               N GC+GGLMD+AF+YI +NKG+ TE  YPY  E+  C     +   A    +  +P
Sbjct: 179 GKYGNEGCNGGLMDQAFQYIKDNKGIDTENTYPYEAEDNVC-RYNPRNRGAIDRGFVHIP 237

Query: 246 KGDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADC-GNNCDHGVAVVGFGTAEE 302
            G+E  L  AV+   PVSV +DAS  +F FY  GV     C  ++ DHGV VVG+G+   
Sbjct: 238 SGEEDKLKAAVATVGPVSVAIDASHESFQFYSKGVYYEPSCDSDDLDHGVLVVGYGS--- 294

Query: 303 ENGAKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
           +NG  YWL+KNSW E WG+ GYI+I R+    CGIATAASYP+
Sbjct: 295 DNGKDYWLVKNSWSEHWGDEGYIKIARNRKNHCGIATAASYPL 337


>gi|151573016|gb|ABS17683.1| cathepsin L-1 [Artemia persimilis]
          Length = 334

 Score =  264 bits (675), Expect = 4e-68,   Method: Compositional matrix adjust.
 Identities = 146/339 (43%), Positives = 204/339 (60%), Gaps = 15/339 (4%)

Query: 15  VIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIE 74
            +I L+     Q+ +  S+      E H  + A H + Y  +LE+  R+ I+ +N   + 
Sbjct: 1   TLIFLLGAVFVQLSAALSLTNLLADEWH-LFKATHKKEYPSQLEEKFRMKIYLENKHKVA 59

Query: 75  KAN---KEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDV 131
           K N   ++G ++Y++  N+F DL + EFR++  GY     + SR  S  +  +  NV +V
Sbjct: 60  KHNILFEKGEKSYQVAMNKFGDLLHHEFRSIMNGYQHKKQNSSRAESTFTFMEPANV-EV 118

Query: 132 PTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-- 189
           P S+DWREKGA+T +KDQGQCG CWAFS+  A+EG T    GKL+ L EQ L+DCS    
Sbjct: 119 PESVDWREKGAITPVKDQGQCGPCWAFSSTGALEGQTFRKTGKLVSLREQNLIDCSGKYG 178

Query: 190 NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDE 249
           N GC+GGLMD+AF+YI +NKG+ TE  YPY  E+  C     +   A    + D+P G+E
Sbjct: 179 NEGCNGGLMDQAFQYIKDNKGIDTENTYPYEAEDDVC-RYNPRNRGAVDRGFVDIPSGEE 237

Query: 250 QALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADC-GNNCDHGVAVVGFGTAEEENGA 306
             L  AV+   PVSV +DAS  +F FY  GV     C  ++ DHGV VVG+G+   +NG 
Sbjct: 238 DKLKAAVATVGPVSVAIDASHESFQFYSKGVYYEPSCDSDDLDHGVLVVGYGS---DNGK 294

Query: 307 KYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
            YWL+KNSW E WG+ GYI+I R+    CG+ATAASYP+
Sbjct: 295 DYWLVKNSWSEHWGDQGYIKIARNRKNHCGVATAASYPL 333


>gi|33112583|gb|AAP94047.1| cathepsin-L-like cysteine peptidase 03 [Tenebrio molitor]
          Length = 337

 Score =  264 bits (675), Expect = 4e-68,   Method: Compositional matrix adjust.
 Identities = 151/344 (43%), Positives = 206/344 (59%), Gaps = 23/344 (6%)

Query: 14  FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMA---QHGRTYKDELEKAMRLNIFKQNL 70
           F+I + +    SQ VS   + +       EQW A    H + Y+ + E+  R+ IF +N 
Sbjct: 3   FLIFLAICVAGSQAVSFFDLVQ-------EQWGAFKMTHNKQYQSDTEERFRMKIFMENS 55

Query: 71  EYIEKANK---EGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSV-SRQSSRPSTFKYQ 126
             + K NK   +G  ++KLG N+++D+ + EF  +  G+NR    + S +S    TF   
Sbjct: 56  HTVAKHNKLYAQGLVSFKLGINKYADMLHHEFVQVLNGFNRTKSGLRSGESDDSVTFLPP 115

Query: 127 NVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC 186
               +P  IDWR+KGAVT +KDQGQCGSCW+FSA  ++EG      GKL+ LSEQ LVDC
Sbjct: 116 ANVQLPGQIDWRDKGAVTPVKDQGQCGSCWSFSATGSLEGQHFRKSGKLVSLSEQNLVDC 175

Query: 187 STD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDL 244
           S    N+GC+GGLMD AF YI  N G+ TE  YPY+ E+  C + K K   AT   Y D+
Sbjct: 176 SEKFGNNGCNGGLMDNAFRYIKANGGIDTEQAYPYKAEDEKC-HYKPKNKGATDRGYVDI 234

Query: 245 PKGDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADC-GNNCDHGVAVVGFGTAE 301
             G+E  L  AV+   PVSV +DAS ++F  Y  GV    DC  +  DHGV VVG+GT  
Sbjct: 235 ESGNEDKLQSAVATVGPVSVAIDASHQSFQLYSGGVYYEPDCSASQLDHGVLVVGYGT-- 292

Query: 302 EENGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
           E++G  YWL+KNSWG++WG+ GYI++ R+    CGIAT ASYP+
Sbjct: 293 EDDGTDYWLVKNSWGKSWGDQGYIKMARNRDNNCGIATEASYPL 336


>gi|167526493|ref|XP_001747580.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163774026|gb|EDQ87660.1| predicted protein [Monosiga brevicollis MX1]
          Length = 330

 Score =  264 bits (674), Expect = 5e-68,   Method: Compositional matrix adjust.
 Identities = 148/328 (45%), Positives = 194/328 (59%), Gaps = 30/328 (9%)

Query: 25  SQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDEL---EKAMRLNIFKQNLEYIEKANKEGN 81
           SQ +  R++H   +++    +   HG  Y  +L   E A R ++   NL  IE A+  GN
Sbjct: 11  SQFLPRRNLH--LVLKGPTAFRRIHGVFYSSQLGLCEPAFRCHL--ANLRVIE-AHNAGN 65

Query: 82  RTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTS-IDWREK 140
            ++ +G  +F+DLT  EF A        V       +RP    +  +T+ P   +DWR+K
Sbjct: 66  SSFTMGITQFADLTAAEFSAY-------VKRFPMNVTRPRNEVW--ITEAPLQEVDWRQK 116

Query: 141 GAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST--DNHGCSGGLM 198
            AVT IK+QGQCGSCW+FS   +VEG   I  GKL+ LSEQQL+DCST   NHGC+GGLM
Sbjct: 117 NAVTEIKNQGQCGSCWSFSTTGSVEGAHAIATGKLVSLSEQQLMDCSTRYGNHGCNGGLM 176

Query: 199 DKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN 258
           D AFEY+I N GL TE DYPY  E+G C+ +KEK  AA I  + ++PK  E  L  AVS 
Sbjct: 177 DYAFEYVIANGGLDTEEDYPYTAEDGKCNTEKEKKHAAEIHGFRNVPKEHEDQLAAAVSI 236

Query: 259 QPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGET 318
            PVSV ++A    F  Y SGV +  CG + DHGV VVG+          YW++KNSWG++
Sbjct: 237 GPVSVAIEADQAGFQHYTSGVFDGKCGTSLDHGVLVVGYSD-------DYWIVKNSWGKS 289

Query: 319 WGESGYIRILRDA---GLCGIATAASYP 343
           WGE GYIR+ R     G+CGI   ASYP
Sbjct: 290 WGEEGYIRLKRGVDKKGMCGITMQASYP 317


>gi|224116880|ref|XP_002317417.1| predicted protein [Populus trichocarpa]
 gi|118488173|gb|ABK95906.1| unknown [Populus trichocarpa]
 gi|222860482|gb|EEE98029.1| predicted protein [Populus trichocarpa]
          Length = 498

 Score =  264 bits (674), Expect = 5e-68,   Method: Compositional matrix adjust.
 Identities = 138/325 (42%), Positives = 192/325 (59%), Gaps = 20/325 (6%)

Query: 33  MHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKAN--KEGNRTYKLGTNE 90
           + E  I E  + W  +H + YK   E   R+  FK+NL+YI + N  ++    +K+G N+
Sbjct: 41  LTEEGITEVFKLWKEKHQKVYKHAEEAERRIGNFKRNLKYIIEKNGKRKSGLEHKVGLNK 100

Query: 91  FSDLTNEEFRALY-TGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQ 149
           F+DL+NEEFR +Y +   +P+    ++  R     +    D P+S+DWR KG VT +KDQ
Sbjct: 101 FADLSNEEFREMYLSKVKKPITIEEKRKHR-----HLQTCDAPSSLDWRNKGVVTAVKDQ 155

Query: 150 GQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC-STDNHGCSGGLMDKAFEYIIEN 208
           G CGSCW+FS   A+E I  I  G LI LSEQ+LVDC +T+N+GC GG MD AF+++I N
Sbjct: 156 GDCGSCWSFSTTGAIEAINAIVTGDLISLSEQELVDCDTTNNYGCEGGDMDSAFQWVIGN 215

Query: 209 KGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDAS 268
            G+ TEADYPY   +GTC+  KE+    +I  Y D+   D  ALL A   QP+SV +D S
Sbjct: 216 GGIDTEADYPYTGVDGTCNTAKEEKKVVSIEGYVDVDPSD-SALLCATVQQPISVGMDGS 274

Query: 269 GRAFHFYKSGVLNADCG---NNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYI 325
              F  Y  G+ + DC    N+ DH + +VG+G+   EN   YW++KNSWG  WG  GY 
Sbjct: 275 ALDFQLYTGGIYDGDCSGDPNDIDHAILIVGYGS---ENDEDYWIVKNSWGTEWGMEGYF 331

Query: 326 RILRDA----GLCGIATAASYPVAI 346
            I R+     G+C I   ASYP  +
Sbjct: 332 YIRRNTSKPYGVCAINADASYPTKV 356


>gi|219687002|dbj|BAH08632.1| daikon cysteine protease RD21 [Raphanus sativus]
          Length = 289

 Score =  264 bits (674), Expect = 5e-68,   Method: Compositional matrix adjust.
 Identities = 123/219 (56%), Positives = 158/219 (72%), Gaps = 8/219 (3%)

Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD- 189
           +P S+DWR++GAV  +KDQG CGSCWAFS + AVEGI +I  G LI LSEQ+LVDC T  
Sbjct: 3   IPESVDWRKEGAVAAVKDQGSCGSCWAFSTIGAVEGINKIVTGDLISLSEQELVDCDTSY 62

Query: 190 NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDE 249
           N GC+GGLMD AFE+II+N G+ TE DYPY+  +G CD  ++ A   TI  YED+P+ +E
Sbjct: 63  NQGCNGGLMDYAFEFIIKNGGIDTEEDYPYKAADGRCDQNRKNAKVVTIDAYEDVPENNE 122

Query: 250 QALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYW 309
            AL +A++NQP+SV ++A GRAF  Y SGV +  CG   DHGV  VG+GT   ENG  YW
Sbjct: 123 AALKKALANQPISVAIEAGGRAFQLYSSGVFDGTCGTELDHGVVAVGYGT---ENGKDYW 179

Query: 310 LIKNSWGETWGESGYIRILRD----AGLCGIATAASYPV 344
           +++NSWG +WGESGYI++ R+     G CGIA  ASYP+
Sbjct: 180 IVRNSWGGSWGESGYIKMARNIAEATGKCGIAMEASYPI 218


>gi|157311713|ref|NP_001098585.1| uncharacterized protein LOC564979 precursor [Danio rerio]
 gi|156230121|gb|AAI52284.1| Wu:fa26c03 protein [Danio rerio]
          Length = 336

 Score =  264 bits (674), Expect = 5e-68,   Method: Compositional matrix adjust.
 Identities = 141/341 (41%), Positives = 207/341 (60%), Gaps = 16/341 (4%)

Query: 14  FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYI 73
            +  +LV  C S V +  S+ +  + +    W +QHG++Y +++E   R+ I+++NL  I
Sbjct: 1   MMFALLVTLCISAVFAASSI-DIQLDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRKI 58

Query: 74  EKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
           E+ N E   GN T+K+G N+F D+TNEEFR    GY         ++S+   F   +   
Sbjct: 59  EQHNFEYSYGNHTFKMGMNQFGDMTNEEFRQAMNGYKHD----PNRTSQGPLFMEPSFFA 114

Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD- 189
            P  +DWR++G VT +KDQ QCGSCW+FS+  A+EG      GKLI +SEQ LVDCS   
Sbjct: 115 APQQVDWRQRGFVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQ 174

Query: 190 -NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
            N GC+GGLMD+AF+Y+ ENKGL +E  YPY   +        +   A I+ + D+P+G+
Sbjct: 175 GNQGCNGGLMDQAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPRGN 234

Query: 249 EQALLQAVSN-QPVSVCVDASGRAFHFYKSGVL--NADCGNNCDHGVAVVGFG-TAEEEN 304
           E AL+ AV+   PVSV +DAS ++  FY+SG+    A   +  DH V VVG+G    +  
Sbjct: 235 ELALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACSSSRLDHAVLVVGYGYQGADVA 294

Query: 305 GAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
           G +YW++KNSW + WG+ GYI + +D    CG+AT+ASYP+
Sbjct: 295 GNRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGVATSASYPL 335


>gi|334324655|ref|XP_001370975.2| PREDICTED: cathepsin S-like isoform 1 [Monodelphis domestica]
          Length = 331

 Score =  264 bits (674), Expect = 5e-68,   Method: Compositional matrix adjust.
 Identities = 141/321 (43%), Positives = 198/321 (61%), Gaps = 19/321 (5%)

Query: 33  MHEPSIVEKH-EQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGT 88
           +H   +++ H + W   HG+ YK + E+  R  I+++NL+Y+   N E   G  +Y L  
Sbjct: 18  LHRDPMLDGHWDLWKKTHGKQYKGQNEEIARRLIWEKNLKYVTLHNLEHSMGLHSYDLSM 77

Query: 89  NEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKD 148
           N   D+T+EE  +L +    P      Q +R +T++  +   +P S+DWREKG VT +K 
Sbjct: 78  NHLGDMTSEEVISLMSSLRIP-----NQWNRNTTYRLSSNQKLPDSVDWREKGCVTEVKY 132

Query: 149 QGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST---DNHGCSGGLMDKAFEYI 205
           QG CGSCWAFSAV A+E   ++  GKL+ LS Q LVDCST   DNHGC+GG M  AF+Y+
Sbjct: 133 QGSCGSCWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTDKYDNHGCNGGFMTSAFQYV 192

Query: 206 IENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQ-PVSVC 264
           I+N G+ ++  YPY+  +G C      + AAT SKY +LP G E+AL +AV+N+ PVSV 
Sbjct: 193 IDNNGIDSDVSYPYKATDGKC-QYNPASRAATCSKYTELPYGSEEALKEAVANKGPVSVG 251

Query: 265 VDASGRAFHFYKSGV-LNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESG 323
           +DA   +F  YKSGV  +  C    +HGV V+G+G  +   G  YWL+KNSWG  +G+ G
Sbjct: 252 IDAKTPSFFLYKSGVYYDPSCTQKVNHGVLVIGYGNLD---GQDYWLVKNSWGLHFGDKG 308

Query: 324 YIRILRDAG-LCGIATAASYP 343
           Y+RI R+ G  CGIA   SYP
Sbjct: 309 YVRIARNRGNHCGIANFPSYP 329


>gi|356545112|ref|XP_003540989.1| PREDICTED: LOW QUALITY PROTEIN: KDEL-tailed cysteine endopeptidase
           CEP1-like [Glycine max]
          Length = 400

 Score =  264 bits (674), Expect = 6e-68,   Method: Compositional matrix adjust.
 Identities = 139/311 (44%), Positives = 201/311 (64%), Gaps = 8/311 (2%)

Query: 18  ILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKAN 77
           ++ +TC  Q  S +S  E    E+HE+WMAQ+G+ Y+D  E   R  IFK N+++IE  N
Sbjct: 92  LVGVTCGRQCRS-KSRLEACTSERHEKWMAQYGKVYEDAAEMEKRFQIFKNNVQFIESFN 150

Query: 78  KEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQN-VTDVPTSID 136
             G++ + +  N+F DL +EEF+AL     R V  V   ++  ++F+Y + VT++P ++D
Sbjct: 151 VAGDKPFNIRINQFPDLHDEEFKALLINGQRKVSGV-ETATEETSFRYGSVVTNIPATMD 209

Query: 137 WREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVD-CSTDNHGCSG 195
            R+KG VT IKDQG  GSCWA SAVAA+EGI QIT  KL+ LS+Q+LVD    ++ GC G
Sbjct: 210 GRKKGVVTPIKDQGIIGSCWALSAVAAIEGIHQITTSKLMFLSKQKLVDSVKGESEGCIG 269

Query: 196 GLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQA 255
           G ++ AFE+I++  G+ +E  YPY+     C  +KE    A I  YE +P  +++ALL+ 
Sbjct: 270 GYVEDAFEFIVKKGGILSETHYPYKGVN-XCKVEKETHSVAHIKGYEKVPSNNKKALLKV 328

Query: 256 VSNQPVSVCVDASGRAFHFYKSGVLNA-DCGNNCDHGVAVVGFGTAEEENGAKYWLIKNS 314
           V+NQPVSV +D    AF +Y S + NA +CG++ +H VAVVG+G A +  GAKYW +KNS
Sbjct: 329 VANQPVSVYIDVGAHAFKYYSSEIFNARNCGSDPNHVVAVVGYGKALD--GAKYWPVKNS 386

Query: 315 WGETWGESGYI 325
           WG  WG   Y+
Sbjct: 387 WGTEWGGKWYM 397


>gi|281352890|gb|EFB28474.1| hypothetical protein PANDA_008012 [Ailuropoda melanoleuca]
          Length = 328

 Score =  264 bits (674), Expect = 6e-68,   Method: Compositional matrix adjust.
 Identities = 144/318 (45%), Positives = 199/318 (62%), Gaps = 18/318 (5%)

Query: 35  EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEF 91
           +P++      W   +G+ YK++ E+  R  I+++NL+++   N E   G  +Y LG N  
Sbjct: 18  DPALDHHWNLWKKTYGKQYKEKNEEVARRLIWEKNLKFVTLHNLEHSMGMHSYDLGMNHL 77

Query: 92  SDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQ 151
            D+T+EE  +L +     VPS   Q  R  T+K  +   +P S+DWREKG VT +K QG 
Sbjct: 78  GDMTSEEVISLMSSLR--VPS---QWPRNVTYKSNSNQKLPDSVDWREKGCVTKVKYQGA 132

Query: 152 CGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD---NHGCSGGLMDKAFEYIIEN 208
           CG+CWAFSAV A+E   ++  GKL+ LS Q LVDCST+   N GC+GG M +AF+YII+N
Sbjct: 133 CGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTEAFQYIIDN 192

Query: 209 KGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQ-PVSVCVDA 267
            G+ +EA YPY+  +G C     K  AAT SKY +LP G E  L +AV+N+ PVSV +DA
Sbjct: 193 NGIDSEASYPYKATDGKC-RYDSKNRAATCSKYTELPSGSEDDLKEAVANKGPVSVAIDA 251

Query: 268 SGRAFHFYKSGV-LNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIR 326
              +F  Y+SGV  +  C  N +HGV VVG+G     NG  YWL+KNSWG  +G+ GYIR
Sbjct: 252 RHSSFFLYRSGVYYDPSCTQNVNHGVLVVGYGNL---NGKDYWLVKNSWGLNFGDQGYIR 308

Query: 327 ILRDAG-LCGIATAASYP 343
           + R++G  CGIA+  SYP
Sbjct: 309 MARNSGNHCGIASYPSYP 326


>gi|301767946|ref|XP_002919405.1| PREDICTED: cathepsin S-like [Ailuropoda melanoleuca]
          Length = 340

 Score =  264 bits (674), Expect = 6e-68,   Method: Compositional matrix adjust.
 Identities = 144/318 (45%), Positives = 199/318 (62%), Gaps = 18/318 (5%)

Query: 35  EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEF 91
           +P++      W   +G+ YK++ E+  R  I+++NL+++   N E   G  +Y LG N  
Sbjct: 30  DPALDHHWNLWKKTYGKQYKEKNEEVARRLIWEKNLKFVTLHNLEHSMGMHSYDLGMNHL 89

Query: 92  SDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQ 151
            D+T+EE  +L +     VPS   Q  R  T+K  +   +P S+DWREKG VT +K QG 
Sbjct: 90  GDMTSEEVISLMSSLR--VPS---QWPRNVTYKSNSNQKLPDSVDWREKGCVTKVKYQGA 144

Query: 152 CGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD---NHGCSGGLMDKAFEYIIEN 208
           CG+CWAFSAV A+E   ++  GKL+ LS Q LVDCST+   N GC+GG M +AF+YII+N
Sbjct: 145 CGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTEAFQYIIDN 204

Query: 209 KGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQ-PVSVCVDA 267
            G+ +EA YPY+  +G C     K  AAT SKY +LP G E  L +AV+N+ PVSV +DA
Sbjct: 205 NGIDSEASYPYKATDGKC-RYDSKNRAATCSKYTELPSGSEDDLKEAVANKGPVSVAIDA 263

Query: 268 SGRAFHFYKSGV-LNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIR 326
              +F  Y+SGV  +  C  N +HGV VVG+G     NG  YWL+KNSWG  +G+ GYIR
Sbjct: 264 RHSSFFLYRSGVYYDPSCTQNVNHGVLVVGYGNL---NGKDYWLVKNSWGLNFGDQGYIR 320

Query: 327 ILRDAG-LCGIATAASYP 343
           + R++G  CGIA+  SYP
Sbjct: 321 MARNSGNHCGIASYPSYP 338


>gi|327263389|ref|XP_003216502.1| PREDICTED: cathepsin L1-like [Anolis carolinensis]
          Length = 339

 Score =  263 bits (673), Expect = 7e-68,   Method: Compositional matrix adjust.
 Identities = 151/340 (44%), Positives = 207/340 (60%), Gaps = 19/340 (5%)

Query: 20  VITCASQVVSGRSMHEPSI---VEKHEQ-WMAQHGRTYKDELEKAMRLNIFKQNLEYIEK 75
           V  CA  +        PS+   ++ H Q W   H + Y  + E+  R  I+++NL+ I+ 
Sbjct: 3   VYLCALALFLEACFAAPSLDSALDDHWQAWKTWHSKKYHQQ-EEGWRRMIWEKNLKMIQL 61

Query: 76  ANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVP 132
            N +   G  +Y+LG N F D+TNEEFR +  GY     S + +  R S F   N   VP
Sbjct: 62  HNLDHSLGKHSYRLGMNHFGDMTNEEFRQVMNGYKH---SKTEKKYRGSEFLEPNFLVVP 118

Query: 133 TSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--N 190
            S+DWREKG VT +KDQGQCGSCWAFS   ++EG      GKL+ LSEQ LVDCS    N
Sbjct: 119 KSVDWREKGYVTPVKDQGQCGSCWAFSTTGSLEGQHFRKTGKLVSLSEQNLVDCSRPEGN 178

Query: 191 HGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQ 250
            GC+GGLMD+AFEYI +N G+ +E  YPY  ++      K +  AA  + + D+P+G E+
Sbjct: 179 QGCNGGLMDQAFEYIADNGGIDSEESYPYIAKDDEDCLYKSEFNAANDTGFVDVPEGHER 238

Query: 251 ALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFG--TAEEENG 305
           AL++AV+   PVSV +DAS   F FY+SG+  + DC +   DHGV VVG+G    +++N 
Sbjct: 239 ALMKAVAAVGPVSVAIDASHSTFQFYESGIYYDPDCSSEELDHGVLVVGYGFEGTDDDNK 298

Query: 306 AKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
            KYW++KNSW + WG+ GYI + +D    CGIATAASYP+
Sbjct: 299 KKYWIVKNSWSDKWGDKGYILMAKDRNNHCGIATAASYPL 338


>gi|225709022|gb|ACO10357.1| Cathepsin L precursor [Caligus rogercresseyi]
          Length = 332

 Score =  263 bits (673), Expect = 7e-68,   Method: Compositional matrix adjust.
 Identities = 146/316 (46%), Positives = 196/316 (62%), Gaps = 16/316 (5%)

Query: 38  IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEFSDL 94
           ++   E W   HG++Y+  +E+ +RL I  +N   I + N E   G  +Y +  N + DL
Sbjct: 23  VLSDWESWKLTHGKSYESSIEEKLRLKIHMENSLKISRHNAEAINGKHSYYMKMNHYGDL 82

Query: 95  TNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGS 154
            + EF A+  GY      V++ S   S    +NV  +PT +DWRE GAVT +K+QGQCGS
Sbjct: 83  LHHEFVAMVNGYEY----VNKTSLGGSFIPSKNVK-LPTHVDWREDGAVTPVKNQGQCGS 137

Query: 155 CWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLA 212
           CWAFS+  ++EG T    GKLI LSEQ LVDCS    N+GC GGLMD AF YI +NKG+ 
Sbjct: 138 CWAFSSTGSLEGQTFRKTGKLIPLSEQNLVDCSRKYGNNGCEGGLMDFAFTYIRDNKGID 197

Query: 213 TEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDASGRA 271
           TE  YPY    G C     K  ++ I  + D+ KG E+ LL+AV++  PVSV +DAS  +
Sbjct: 198 TEGSYPYEGVGGRCHYDPSKKGSSDIG-FVDVKKGSEEELLKAVASVGPVSVAIDASHMS 256

Query: 272 FHFYKSGV-LNADCG-NNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILR 329
           F FY  GV   + C   N DHGV VVG+GT +E +G  YWL+KNSW E WG+ GYI++ R
Sbjct: 257 FQFYSHGVYFESKCSPENLDHGVLVVGYGT-DENSGEDYWLVKNSWSENWGDQGYIKMAR 315

Query: 330 D-AGLCGIATAASYPV 344
           +   +CGIA++ASYPV
Sbjct: 316 NKKNMCGIASSASYPV 331


>gi|148224022|ref|NP_001087489.1| cathepsin L2 precursor [Xenopus laevis]
 gi|51258284|gb|AAH80004.1| MGC81823 protein [Xenopus laevis]
          Length = 335

 Score =  263 bits (673), Expect = 8e-68,   Method: Compositional matrix adjust.
 Identities = 144/342 (42%), Positives = 213/342 (62%), Gaps = 18/342 (5%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
           M + ++    C + V +  +  +P++ +    W   H ++Y  + E+  R  ++++NL  
Sbjct: 1   MALYLVAAALCLTTVFAAPTT-DPALDDHWHLWKNWHKKSYLPK-EEGWRRVLWEKNLRT 58

Query: 73  IEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
           IE  N +   G  +Y+LG N+F D+TNEEFR L  GY       +++  + STF   N  
Sbjct: 59  IEFHNLDHSLGKHSYRLGMNQFGDMTNEEFRQLMNGYK------NQKMIKGSTFLAPNNF 112

Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS-- 187
           + P ++DWREKG VT +KDQGQCGSCWAFS   A+EG      GKLI LSEQ LVDCS  
Sbjct: 113 EAPKTVDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQHYRKAGKLISLSEQNLVDCSRA 172

Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
             N GC+GGLMD+AF+Y+ +N G+ +E  YPY  ++    +      +A  + + D+P G
Sbjct: 173 QGNQGCNGGLMDQAFQYVKDNGGIDSEDSYPYTAKDDQECHYDPNYNSANDTGFVDVPSG 232

Query: 248 DEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFG-TAEEE 303
            E+ L++AV++  PVSV VDA  ++F FY+SG+  + +C + + DHGV VVG+G   E+ 
Sbjct: 233 SEKDLMKAVASVGPVSVAVDAGHKSFQFYQSGIYYDPECSSEDLDHGVLVVGYGFEGEDV 292

Query: 304 NGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
           +G +YW++KNSW E WG +GYI+I +D    CGIATAASYP+
Sbjct: 293 DGKRYWIVKNSWSEKWGNNGYIKIAKDRHNHCGIATAASYPL 334


>gi|156399477|ref|XP_001638528.1| predicted protein [Nematostella vectensis]
 gi|156225649|gb|EDO46465.1| predicted protein [Nematostella vectensis]
          Length = 325

 Score =  263 bits (673), Expect = 8e-68,   Method: Compositional matrix adjust.
 Identities = 151/344 (43%), Positives = 205/344 (59%), Gaps = 27/344 (7%)

Query: 7   KSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIF 66
           K+F+  + V + L+  C S++   R  H          W   HG+TY  E E+ +R  I+
Sbjct: 2   KAFLACLLVAV-LIAQCFSELSQDRQWHA---------WKDFHGKTYTGE-EEDLRRAIW 50

Query: 67  KQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQ 126
             NLE ++K N E N +YKL  N F+DLT  EF+  + GY       +  S+  STF   
Sbjct: 51  NDNLEIVKKHNAE-NHSYKLDMNHFADLTVTEFKQRFMGYR-----AASNSTGGSTFLPL 104

Query: 127 NVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC 186
           +   +P  +DWR+KG VT +K+QGQCGSCWAFS+  ++EG      GKL+ LSEQ LVDC
Sbjct: 105 SNVQLPAEVDWRDKGFVTAVKNQGQCGSCWAFSSTGSLEGQHFRKTGKLVSLSEQNLVDC 164

Query: 187 STD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDL 244
           S    N+GC GGLMD AF+YI  N G+ TE  YPY   +G C + K  +V AT++ Y D+
Sbjct: 165 SKKYGNNGCEGGLMDYAFKYIKNNDGIDTEQSYPYTARDGQC-HFKPGSVGATVTGYTDV 223

Query: 245 PKGDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGVLN-ADCGNN-CDHGVAVVGFGTAE 301
            +G E  L  AV+   P+SV +DA   +F  YK+GV +  DC +   DHGV  VG+G   
Sbjct: 224 QRGSEGDLQSAVATVGPISVAIDAGHSSFQLYKTGVYSEPDCSSTQLDHGVLAVGYGA-- 281

Query: 302 EENGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
            E+G  YWL+KNSWGE WG +GYI++ R+    CGIAT ASYP+
Sbjct: 282 -EDGKDYWLVKNSWGEGWGMNGYIKMSRNKDNQCGIATQASYPL 324


>gi|440906716|gb|ELR56945.1| Cathepsin S, partial [Bos grunniens mutus]
          Length = 342

 Score =  263 bits (673), Expect = 8e-68,   Method: Compositional matrix adjust.
 Identities = 147/342 (42%), Positives = 210/342 (61%), Gaps = 20/342 (5%)

Query: 11  IPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNL 70
           I M  ++  ++ C+S +   +   +P++    + W   +G+ YK++ E+  R  I+++NL
Sbjct: 10  ITMNWLVWALLLCSSAMA--QVHRDPTLDHHWDLWKKTYGKQYKEKNEEVARRLIWEKNL 67

Query: 71  EYIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQN 127
           + +   N E   G  +Y+LG N   D+T+EE  +L +    P      Q  R  T+K   
Sbjct: 68  KTVTLHNLEHSMGMHSYELGMNHLGDMTSEEVISLMSSLRVP-----SQWPRNVTYKSDP 122

Query: 128 VTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
              +P S+DWREKG VT +K QG CGSCWAFSAV A+E   ++  GKL+ LS Q LVDCS
Sbjct: 123 NQKLPDSMDWREKGCVTEVKYQGACGSCWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCS 182

Query: 188 T---DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDL 244
           T    N GC+GG M +AF+YII+N G+ +EA YPY+  +G C     K  AAT S+Y +L
Sbjct: 183 TAKYGNKGCNGGFMTEAFQYIIDNNGIDSEASYPYKAMDGKC-QYDVKNRAATCSRYIEL 241

Query: 245 PKGDEQALLQAVSNQ-PVSVCVDASGRAFHFYKSGV-LNADCGNNCDHGVAVVGFGTAEE 302
           P G E+AL +AV+N+ PVSV +DAS  +F  YK+GV  +  C  N +HGV VVG+G  + 
Sbjct: 242 PFGSEEALKEAVANKGPVSVGIDASHSSFFLYKTGVYYDPSCTQNVNHGVLVVGYGNLD- 300

Query: 303 ENGAKYWLIKNSWGETWGESGYIRILRDAG-LCGIATAASYP 343
             G  YWL+KNSWG  +G+ GYIR+ R++G  CGIA+  SYP
Sbjct: 301 --GKDYWLVKNSWGLHFGDQGYIRMARNSGNHCGIASYPSYP 340


>gi|391336140|ref|XP_003742440.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
          Length = 330

 Score =  263 bits (672), Expect = 8e-68,   Method: Compositional matrix adjust.
 Identities = 146/317 (46%), Positives = 194/317 (61%), Gaps = 17/317 (5%)

Query: 36  PSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRT--YKLGTNEFSD 93
           PS       + + H ++Y+D  E+ +R  IF+ NL  IE+ N+       + LG NEF+D
Sbjct: 22  PSAEPHWNAFKSTHLKSYRDGQEELIRRFIFEDNLHTIEEFNRVNASLAGFTLGVNEFAD 81

Query: 94  LTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCG 153
           +TN EF  +  G          + +  S F+  +V D+P  +DW +KG VT +K+QGQCG
Sbjct: 82  MTNTEFSNMLLGLGG-----RNKIAGDSVFESSHVQDLPAEVDWTQKGYVTEVKNQGQCG 136

Query: 154 SCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGL 211
           SCWAFS   ++EG      GKL+ LSEQ LVDCST   N GC+GGLMD+AF YI +N G+
Sbjct: 137 SCWAFSTTGSLEGQVFKKTGKLVSLSEQNLVDCSTSEGNQGCNGGLMDQAFTYIKKNGGI 196

Query: 212 ATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDASGR 270
            TEA YPY   +GTC   + K V AT+S + D+  GDE AL +AV+   P+SV +DAS  
Sbjct: 197 DTEAAYPYTGSDGTCRFLENK-VGATVSGFVDVKSGDENALKEAVATVGPISVAIDASSI 255

Query: 271 AFHFYKSGVLNA-DCGNN-CDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRIL 328
            F FY+ GV N   C +   DHGV VVG+GT   E G  YWL+KNSWG +WG  GYI+++
Sbjct: 256 FFQFYRGGVYNPWFCSSTELDHGVLVVGYGT---EGGKDYWLVKNSWGSSWGLKGYIKMV 312

Query: 329 RD-AGLCGIATAASYPV 344
           R+    CGIAT ASYP 
Sbjct: 313 RNKKNRCGIATQASYPT 329


>gi|355681664|gb|AER96818.1| cathepsin S [Mustela putorius furo]
          Length = 338

 Score =  263 bits (672), Expect = 8e-68,   Method: Compositional matrix adjust.
 Identities = 147/329 (44%), Positives = 201/329 (61%), Gaps = 18/329 (5%)

Query: 24  ASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---G 80
           AS     +  ++P++      W   +GR Y+++ E+  R  I+++NL+ +   N E   G
Sbjct: 18  ASSYAVAQVQNDPTLDHHWNLWKKTYGRQYQEKNEEVARRLIWEKNLKSVMLHNLEYSMG 77

Query: 81  NRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREK 140
             +Y LG N  +D+T+EE  +L +     VPS   Q     T+K  +   +P S+DWREK
Sbjct: 78  MHSYDLGMNHLADMTSEEVSSLMSSLR--VPS---QWQANVTYKSNSNQKLPDSVDWREK 132

Query: 141 GAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD---NHGCSGGL 197
           G VT +K QG CG+CWAFSAV A+E   ++  G L+ LS Q LVDCST+   N GC+GG 
Sbjct: 133 GCVTEVKYQGACGACWAFSAVGALEAQLKLKTGNLVSLSAQNLVDCSTERYGNKGCNGGF 192

Query: 198 MDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVS 257
           M KAF+YII+N G+ +E  YPY+  +G C     K  AAT SKY +LP G E AL +AV+
Sbjct: 193 MTKAFQYIIDNNGIDSEVSYPYKAMDGNC-RYDSKHRAATCSKYTELPFGSEDALKEAVA 251

Query: 258 NQ-PVSVCVDASGRAFHFYKSGV-LNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSW 315
           N+ PVSV +DA   +F  YKSGV  +  C  N +HGV VVG+G     NG  YWL+KNSW
Sbjct: 252 NKGPVSVAIDAKHSSFFLYKSGVYYDPSCTQNVNHGVLVVGYGNL---NGRDYWLVKNSW 308

Query: 316 GETWGESGYIRILRDAG-LCGIATAASYP 343
           G  +GE GYIR+ R++G  CGIA+  SYP
Sbjct: 309 GLNFGEQGYIRMARNSGNHCGIASYPSYP 337


>gi|2961621|gb|AAC05781.1| cathepsin S [Mus musculus]
          Length = 340

 Score =  263 bits (672), Expect = 8e-68,   Method: Compositional matrix adjust.
 Identities = 145/319 (45%), Positives = 199/319 (62%), Gaps = 19/319 (5%)

Query: 35  EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEF 91
           +P++    + W   H + YKD+ E+ +R  I+++NL++I   N E   G  TY++G N+ 
Sbjct: 29  DPTLDYHWDLWKKTHEKEYKDKNEEEVRRLIWEKNLKFIMIHNLEYSMGMHTYQVGMNDM 88

Query: 92  SDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQ 151
            D+TNEE              +SRQS +  TF+  +   +P ++DWREKG VT +K QG 
Sbjct: 89  GDMTNEEISC-----RMGALRISRQSPKTVTFRSYSNRTLPDTVDWREKGCVTEVKYQGS 143

Query: 152 CGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD----NHGCSGGLMDKAFEYIIE 207
           CG+CWAFSAV A+EG  ++  GKLI LS Q LVDCS +    N GC GG M +AF+YII+
Sbjct: 144 CGACWAFSAVGALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQYIID 203

Query: 208 NKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQ-PVSVCVD 266
           N G+  +A YPY+  +  C +   K  AAT S+Y  LP GDE AL +AV+ + PVSV +D
Sbjct: 204 NGGIEADASYPYKATDEKC-HYNSKNRAATCSRYIQLPFGDEDALKEAVATKGPVSVGID 262

Query: 267 ASGRAFHFYKSGVL-NADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYI 325
           AS  +F FYKSGV  +  C  N +HGV VVG+GT +   G  YWL+KNSWG  +G+ GYI
Sbjct: 263 ASHSSFFFYKSGVYDDPSCTGNVNHGVLVVGYGTLD---GKDYWLVKNSWGLNFGDQGYI 319

Query: 326 RILR-DAGLCGIATAASYP 343
           R+ R +   CGIA+  SYP
Sbjct: 320 RMARNNKNHCGIASYCSYP 338


>gi|6650705|gb|AAF21977.1|AF115280_1 thiolproteinase SmTP1 [Sarcocystis muris]
          Length = 394

 Score =  263 bits (672), Expect = 8e-68,   Method: Compositional matrix adjust.
 Identities = 136/309 (44%), Positives = 191/309 (61%), Gaps = 12/309 (3%)

Query: 44  QWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALY 103
           Q+   H + Y  E E+  R  IFK NL YI   N +G  +Y L  N+F DLT EEFR  Y
Sbjct: 91  QFQRDHNKFYATEEERLKRYAIFKNNLTYIHNHNMQG-YSYVLKMNKFGDLTLEEFRQRY 149

Query: 104 TGYNRP-VPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVA 162
            GY +P + +  R+    +T +     D+PT +DWR++G VT +KDQG CGSCWAFSA  
Sbjct: 150 LGYKKPDLRTPPREVD--TTLESVEDNDIPTHVDWRQRGCVTSVKDQGDCGSCWAFSATG 207

Query: 163 AVEGITQITRGKLIELSEQQLVDCS--TDNHGCSGGLMDKAFEYIIENKGLATEADYPYR 220
           A+EG+     GKL+ LS+QQLVDCS    N GC GG M++AFEY++EN G+ +  +YPY 
Sbjct: 208 AMEGVYCAKTGKLVNLSQQQLVDCSRFLGNQGCDGGRMEEAFEYVVENGGICSGENYPYM 267

Query: 221 HEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVS-NQPVSVCVDASGRAFHFYKSGV 279
            ++G C + +  +V ATI+ Y  +P+  E+++  A++   PVSV + A+  AF FY  G+
Sbjct: 268 RKDGVCKSSQCTSV-ATITGYRSVPRRSEKSMKTALALRSPVSVAIQANQAAFQFYYDGI 326

Query: 280 LNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD---AGLCGI 336
            +A CG N DHGV +VG+ +AE      YW++KNSWG  WG+ GY+ +      AG CG+
Sbjct: 327 FDAPCGTNLDHGVLLVGY-SAETAGQGDYWIMKNSWGAAWGKGGYMLMAMHKGPAGQCGV 385

Query: 337 ATAASYPVA 345
               S+PVA
Sbjct: 386 LLDGSFPVA 394


>gi|3850787|emb|CAA05360.1| cathepsin S [Mus musculus]
          Length = 330

 Score =  263 bits (672), Expect = 8e-68,   Method: Compositional matrix adjust.
 Identities = 146/325 (44%), Positives = 200/325 (61%), Gaps = 19/325 (5%)

Query: 29  SGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYK 85
           +G +   P++    + W   H + YKD+ E+ +R  I+++NL++I   N E   G  TY+
Sbjct: 13  NGATAERPTLDHHWDLWKKTHEKEYKDKNEEEVRRLIWEKNLKFIMIHNLEYSMGMHTYQ 72

Query: 86  LGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTH 145
           +G N+  D+TNEE          P     RQS +  TF+  +   +P ++DWREKG VT 
Sbjct: 73  VGMNDMGDMTNEEILCRMGALRIP-----RQSPKTVTFRSYSNRTLPDTVDWREKGCVTE 127

Query: 146 IKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD----NHGCSGGLMDKA 201
           +K QG CG+CWAFSAV A+EG  ++  GKLI LS Q LVDCS +    N GC GG M +A
Sbjct: 128 VKYQGSCGACWAFSAVGALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEA 187

Query: 202 FEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQ-P 260
           F+YII+N G+  +A YPY+  +  C +   K  AAT S+Y  LP GDE AL +AV+ + P
Sbjct: 188 FQYIIDNGGIEADASYPYKAMDEKC-HYNSKNRAATCSRYIQLPFGDEDALKEAVATKGP 246

Query: 261 VSVCVDASGRAFHFYKSGVL-NADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETW 319
           VSV +DAS  +F FYKSGV  +  C  N +HGV VVG+GT +   G  YWL+KNSWG  +
Sbjct: 247 VSVGIDASHSSFFFYKSGVYDDPSCTGNVNHGVLVVGYGTLD---GKDYWLVKNSWGLNF 303

Query: 320 GESGYIRILR-DAGLCGIATAASYP 343
           G+ GYIR+ R +   CGIA+  SYP
Sbjct: 304 GDQGYIRMARNNKNHCGIASYCSYP 328


>gi|37786769|gb|AAO64471.1| cathepsin L precursor [Fundulus heteroclitus]
          Length = 337

 Score =  263 bits (672), Expect = 8e-68,   Method: Compositional matrix adjust.
 Identities = 143/342 (41%), Positives = 209/342 (61%), Gaps = 16/342 (4%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
           M  + +L +  +S V+S  S+ +P + +    W + H + Y    E   RL ++++NL+ 
Sbjct: 1   MLPVAVLTLCLSSAVLSAPSL-DPQLDQHWNLWKSWHSKNYHQREEGWRRL-VWEKNLKK 58

Query: 73  IEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
           IE  N E   G  +Y+LG N F D+T+EEF+ +  GY       + +  + S F   N  
Sbjct: 59  IELHNLEHSMGKHSYRLGMNHFGDMTHEEFKQIMNGYKHK----AERKFKGSLFLEPNFL 114

Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD 189
           + P S+DWREKG VT +KDQG+CGSCWAFS   A+EG      GKL+ LS Q LV+CS  
Sbjct: 115 EAPRSVDWREKGYVTPVKDQGECGSCWAFSTTGALEGQEFTRTGKLVSLSGQNLVECSRP 174

Query: 190 --NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
             N GC+GGLMD+AF+Y+ +N+GL +E  YPY   +    +   K  AA  + + D+P G
Sbjct: 175 EGNEGCNGGLMDQAFQYVKDNQGLDSEDSYPYLGTDDQPCHYDPKFSAANDTGFVDIPSG 234

Query: 248 DEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFG-TAEEE 303
           +E+AL++AV++  PVSV +DA   +F FY+SG+    +C +   DHGV  VG+G   E+ 
Sbjct: 235 NERALMKAVASVGPVSVAIDAGHESFQFYQSGIYYEKECSSEELDHGVLAVGYGFQGEDV 294

Query: 304 NGAKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
           +G K+W++KNSW E WG+ GYI + +D    CGIATAASYP+
Sbjct: 295 DGKKFWIVKNSWSENWGDKGYIYMAKDRKNHCGIATAASYPL 336


>gi|443698586|gb|ELT98517.1| hypothetical protein CAPTEDRAFT_128252 [Capitella teleta]
          Length = 324

 Score =  263 bits (672), Expect = 9e-68,   Method: Compositional matrix adjust.
 Identities = 149/335 (44%), Positives = 204/335 (60%), Gaps = 21/335 (6%)

Query: 19  LVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANK 78
           ++  C +  ++   + + ++ E    +   H +TY  E E  MR  I++++L  I + N 
Sbjct: 1   MLACCIAATLASPLVFDEALDEMWTLFKTTHSKTYATEAED-MRRFIWERHLNMINQHNI 59

Query: 79  E---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSI 135
           E   G  T+ LG NE+ DLT  E+ A+ +GY     SV      P   +      VP ++
Sbjct: 60  EADLGKHTFSLGMNEYGDLTQHEYAAM-SGYKMAKSSVGSSFLEPENLQ------VPKTV 112

Query: 136 DWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGC 193
           DWREKG VT +K+QGQCGSCWAFS+  ++EG      G+L  +SEQ LVDCS D  N GC
Sbjct: 113 DWREKGYVTPVKNQGQCGSCWAFSSTGSLEGQVFRKTGRLPSISEQNLVDCSRDEGNMGC 172

Query: 194 SGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALL 253
           SGGLMD AF YI +N G+ +E  YPY   +G C  +K  +V  T S + D+P GDE AL 
Sbjct: 173 SGGLMDNAFTYIKKNMGIDSEKSYPYEAVDGECRYKKSDSV-TTDSGFVDIPHGDETALR 231

Query: 254 QAVSN-QPVSVCVDASGRAFHFYKSGVLN-ADCGNN-CDHGVAVVGFGTAEEENGAKYWL 310
            AV++  PVSV +DAS  +F FYK+GV   A+C +   DHGV VVG+G    ENG  YWL
Sbjct: 232 TAVASVGPVSVAIDASHTSFQFYKTGVYTEANCSSTQLDHGVLVVGYGV---ENGQDYWL 288

Query: 311 IKNSWGETWGESGYIRILRDAG-LCGIATAASYPV 344
           +KNSWG +WGE+GYI++ R+ G  CGIA+ ASYP+
Sbjct: 289 VKNSWGASWGEAGYIKLARNHGNQCGIASQASYPL 323


>gi|291383517|ref|XP_002708299.1| PREDICTED: cathepsin L1 [Oryctolagus cuniculus]
          Length = 333

 Score =  263 bits (672), Expect = 9e-68,   Method: Compositional matrix adjust.
 Identities = 149/343 (43%), Positives = 203/343 (59%), Gaps = 23/343 (6%)

Query: 12  PMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLE 71
           P F++  +    AS +       + ++  +  QW A H R Y    E+  R  ++++N+ 
Sbjct: 3   PSFLLAAVCWGIASAIPK----FDQNLDTQWYQWKATHKRLYGLN-EEGWRRAVWEKNMR 57

Query: 72  YIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNV 128
            IE  N E   G   + +G N + D+TNEEFR +  G+        +    P   +Y   
Sbjct: 58  MIELHNGEYSQGKHGFTMGMNAYGDMTNEEFRQVMNGFQNQKHKKGKMFRDPLLLQY--- 114

Query: 129 TDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS- 187
              P S+DWREKG VT +K+QGQCGSCWAFSA  A+EG      GKLI LSEQ LVDCS 
Sbjct: 115 ---PKSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFQKTGKLISLSEQNLVDCSH 171

Query: 188 -TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPK 246
              N GC+GGLMD AF+Y+ +N GL +E  YPY   +GTC  + E +VA   + + D+P 
Sbjct: 172 PQGNQGCNGGLMDYAFQYVKDNSGLDSEESYPYEGMDGTCKYKPECSVAND-TGFVDIP- 229

Query: 247 GDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFG-TAEE 302
           G E+ALL+AV+   P+S  +DA   +F FYKSG+  + DC + + DHG+ VVG+G     
Sbjct: 230 GHEKALLRAVATVGPISAAIDAGHMSFQFYKSGIYYDPDCSSKDLDHGILVVGYGFEGTN 289

Query: 303 ENGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
            N  KYWL+KNSWG TWG+ GY++I+RD    CGIATAASYP 
Sbjct: 290 SNATKYWLVKNSWGTTWGDEGYVKIIRDKDNHCGIATAASYPT 332


>gi|2746723|gb|AAB94925.1| cathepsin S precursor [Mus musculus]
          Length = 340

 Score =  263 bits (672), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 145/319 (45%), Positives = 199/319 (62%), Gaps = 19/319 (5%)

Query: 35  EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEF 91
           +P++    + W   H + YKD+ E+ +R  I+++NL++I   N E   G  TY++G N+ 
Sbjct: 29  DPTLDYHWDLWKKTHEKEYKDKNEEEVRRLIWEKNLKFIMIHNLEYSMGMHTYQVGMNDM 88

Query: 92  SDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQ 151
            D+TNEE              +SRQS +  TF+  +   +P ++DWREKG VT +K QG 
Sbjct: 89  GDMTNEEISC-----RMGALRISRQSPKTVTFRSYSNRTLPDTVDWREKGCVTEVKYQGS 143

Query: 152 CGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD----NHGCSGGLMDKAFEYIIE 207
           CG+CWAFSAV A+EG  ++  GKLI LS Q LVDCS +    N GC GG M +AF+YII+
Sbjct: 144 CGACWAFSAVGALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQYIID 203

Query: 208 NKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQ-PVSVCVD 266
           N G+  +A YPY+  +  C +   K  AAT S+Y  LP GDE AL +AV+ + PVSV +D
Sbjct: 204 NGGIEADASYPYKAMDEKC-HYNSKNRAATCSRYIQLPFGDEDALKEAVATKGPVSVGID 262

Query: 267 ASGRAFHFYKSGVL-NADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYI 325
           AS  +F FYKSGV  +  C  N +HGV VVG+GT +   G  YWL+KNSWG  +G+ GYI
Sbjct: 263 ASHSSFFFYKSGVYDDPSCTGNVNHGVLVVGYGTLD---GKDYWLVKNSWGLNFGDQGYI 319

Query: 326 RILR-DAGLCGIATAASYP 343
           R+ R +   CGIA+  SYP
Sbjct: 320 RMARNNKNHCGIASYCSYP 338


>gi|62510453|sp|Q8HY82.1|CATS_SAIBB RecName: Full=Cathepsin S; Flags: Precursor
 gi|27497536|gb|AAO13008.1| cathepsin S preproprotein [Saimiri boliviensis]
          Length = 330

 Score =  263 bits (672), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 147/340 (43%), Positives = 207/340 (60%), Gaps = 21/340 (6%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKH-EQWMAQHGRTYKDELEKAMRLNIFKQNLE 71
           M  ++ ++  C+S V     +H+   ++ H   W   +G+ YK++ E+A+R  I+++NL+
Sbjct: 1   MKQLVCVLFVCSSAVTQ---LHKDPTLDHHWNLWKKTYGKQYKEKNEEAVRRLIWEKNLK 57

Query: 72  YIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNV 128
           ++   N E   G  +Y LG N   D+T+EE  +L +    P      Q  R  T+K    
Sbjct: 58  FVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLRVP-----NQWQRNITYKSNPN 112

Query: 129 TDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST 188
             +P S+DWREKG VT +K QG CG+CWAFSAV A+E   ++  GKL+ LS Q LVDCS 
Sbjct: 113 QMLPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSE 172

Query: 189 D--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPK 246
              N GC+GG M +AF+YII+NKG+ +EA YPY+  +  C     K  AAT SKY +LP 
Sbjct: 173 KYGNKGCNGGFMTEAFQYIIDNKGIDSEASYPYKATDQKCQ-YDSKYRAATCSKYTELPY 231

Query: 247 GDEQALLQAVSNQ-PVSVCVDASGRAFHFYKSGV-LNADCGNNCDHGVAVVGFGTAEEEN 304
           G E  L +AV+N+ PV V VDAS  +F  Y+SGV  +  C    +HGV V+G+G   + N
Sbjct: 232 GREDVLKEAVANKGPVCVGVDASHPSFFLYRSGVYYDPACTQKVNHGVLVIGYG---DLN 288

Query: 305 GAKYWLIKNSWGETWGESGYIRILRDAG-LCGIATAASYP 343
           G +YWL+KNSWG  +GE GYIR+ R+ G  CGIA+  SYP
Sbjct: 289 GKEYWLVKNSWGSNFGEQGYIRMARNKGNHCGIASYPSYP 328


>gi|354622947|ref|NP_001002938.2| cathepsin S precursor [Canis lupus familiaris]
          Length = 339

 Score =  263 bits (671), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 151/340 (44%), Positives = 207/340 (60%), Gaps = 20/340 (5%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
           M  ++ L+  C+  V   +   +P++      W   + + YK+E E+  R  I+++NL++
Sbjct: 9   MKWLVGLLPLCSYAVA--QVHKDPTLDHHWNLWKKTYSKQYKEENEEVARRLIWEKNLKF 66

Query: 73  IEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
           +   N E   G  +Y LG N   D+T EE  +L  G  R VPS   Q  R  T++  +  
Sbjct: 67  VMLHNLEHSMGMHSYDLGMNHLGDMTGEEVISL-MGSLR-VPS---QWQRNVTYRSNSNQ 121

Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD 189
            +P S+DWREKG VT +K QG CG+CWAFSAV A+E   ++  GKL+ LS Q LVDCST+
Sbjct: 122 KLPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTE 181

Query: 190 ---NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPK 246
              N GC+GG M  AF+YII+N G+ +EA YPY+   G C    +K  AAT SKY +LP 
Sbjct: 182 KYGNKGCNGGFMTTAFQYIIDNNGIDSEASYPYKAMNGKCRYDSKKR-AATCSKYTELPF 240

Query: 247 GDEQALLQAVSNQ-PVSVCVDASGRAFHFYKSGV-LNADCGNNCDHGVAVVGFGTAEEEN 304
           G E AL +AV+N+ PVSV +DAS  +F  Y+SGV     C  N +HGV VVG+G     N
Sbjct: 241 GSEDALKEAVANKGPVSVAIDASHYSFFLYRSGVYYEPSCTQNVNHGVLVVGYGNL---N 297

Query: 305 GAKYWLIKNSWGETWGESGYIRILRDAG-LCGIATAASYP 343
           G  YWL+KNSWG  +G+ GYIR+ R++G  CGIA+  SYP
Sbjct: 298 GKDYWLVKNSWGLNFGDQGYIRMARNSGNHCGIASYPSYP 337


>gi|395856029|ref|XP_003800445.1| PREDICTED: cathepsin S [Otolemur garnettii]
          Length = 331

 Score =  263 bits (671), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 147/338 (43%), Positives = 205/338 (60%), Gaps = 21/338 (6%)

Query: 15  VIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIE 74
           ++  L++ C++     R   +P++      W   +G+ Y ++ E+  R  I+++NL+++ 
Sbjct: 4   LVWTLLVCCSAMAQLHR---DPALDHHWHLWKKTYGKQYTEKNEETERRLIWEKNLKFVM 60

Query: 75  KANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDV 131
             N E   G  +Y LG N   D+T+EE  +L T    P     RQS R  T+K      +
Sbjct: 61  LHNLEHSMGMHSYDLGMNHLGDMTSEEVVSLMTCLKVP-----RQSQRNVTYKSSPNQKL 115

Query: 132 PTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-- 189
           P S+DWREKG VT +K QG CGSCWAFSAV A+E   ++T GKL+ LS Q LVDCST+  
Sbjct: 116 PDSLDWREKGCVTEVKYQGSCGSCWAFSAVGALEAQLKLTTGKLVSLSAQNLVDCSTEKY 175

Query: 190 -NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
            N GC GG M +AF+YII+N G+ +EA YPY+  +  C     K  AAT SKY +LP G 
Sbjct: 176 RNEGCHGGFMTEAFQYIIDNNGIDSEASYPYKAMDEKCQ-YDSKNRAATCSKYTELPFGS 234

Query: 249 EQALLQAVSNQ-PVSVCVDASGRAFHFYKSGV-LNADCGNNCDHGVAVVGFGTAEEENGA 306
           E+AL +AV+++ PVSV +DAS  +F  Y+SGV     C    +HGV VVG+G     NG 
Sbjct: 235 EEALKEAVASKGPVSVAIDASHSSFFLYRSGVYYEPACTQVVNHGVLVVGYGNL---NGN 291

Query: 307 KYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYP 343
            YWL+KNSWG  +G+ GYIR+ R+    CGIA+ +SYP
Sbjct: 292 DYWLVKNSWGLYFGDKGYIRMARNRENHCGIASYSSYP 329


>gi|189525870|ref|XP_001923796.1| PREDICTED: cathepsin L1 [Danio rerio]
          Length = 335

 Score =  263 bits (671), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 144/340 (42%), Positives = 204/340 (60%), Gaps = 15/340 (4%)

Query: 14  FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYI 73
            +  +LV    S V +  S+ +  + +    W +QHG++Y +++E   R+ I+++NL  I
Sbjct: 1   MMFALLVTLYISAVFAAPSI-DIQLDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRKI 58

Query: 74  EKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
           E+ N E   GN T+K+G N+F D+TNEEFR    GY       +R S  P  F       
Sbjct: 59  EQHNFEYSLGNHTFKMGMNQFGDMTNEEFRQAMNGYKH---DPNRTSQGP-LFMEPKFFA 114

Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD- 189
            P  +DWR++G VT +KDQ QCGSCW+FS+  A+EG      GKLI +SEQ LVDCS   
Sbjct: 115 APQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPH 174

Query: 190 -NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
            N GC+GGLMD+AF+Y+ ENKGL +E  YPY   +        +   A I+ + D+PKG+
Sbjct: 175 GNQGCNGGLMDQAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPKGN 234

Query: 249 EQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGNNCDHGVAVVGFG-TAEEENG 305
           E AL+ AV+   PVSV +DAS ++  FY+SG+     C +  DH V VVG+G    +  G
Sbjct: 235 ELALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACTSQLDHAVLVVGYGYQGADVAG 294

Query: 306 AKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
            +YW++KNSW + WG+ GYI + +D    CGIAT ASYP+
Sbjct: 295 NRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGIATMASYPL 334


>gi|38147395|gb|AAR12010.1| cathepsin L-like proteinase [Triatoma infestans]
          Length = 328

 Score =  263 bits (671), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 143/314 (45%), Positives = 195/314 (62%), Gaps = 22/314 (7%)

Query: 43  EQWMA---QHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEFSDLTN 96
           E+W+A   Q G++YK+  E+  R+N++K+N   I++ NK    G  +YKL  N F DL  
Sbjct: 24  EEWLAFKAQFGKSYKNSFEELFRMNVYKENQRKIDEHNKRYENGEVSYKLKMNHFGDLMQ 83

Query: 97  EEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCW 156
            EF+AL       +   ++Q +    F+      +P  +DWR+KGAVT +KD GQCGSCW
Sbjct: 84  HEFKAL-----NKLKRSAKQQNSGEVFRATG-GKLPAKVDWRQKGAVTPVKDPGQCGSCW 137

Query: 157 AFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATE 214
           AFS+  ++ G   +   KL+ LSEQQLVDCS +  N GC GG+M +AF+YI  N G+ TE
Sbjct: 138 AFSSTGSLGGQLFLKNKKLVSLSEQQLVDCSGNYGNDGCDGGIMVQAFQYIKGNGGIDTE 197

Query: 215 ADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDASGRAFH 273
             YPY  E+  C   K K+VA T   Y D+ +GDE AL +AV+   P+SV +DA   +F 
Sbjct: 198 GSYPYEAEDDKC-RYKTKSVAGTDKGYVDIAQGDENALKEAVAEIGPISVAIDAGNLSFQ 256

Query: 274 FYKSGVLNAD-CGNN-CDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA 331
           FY  G+ +   C N   DHGV VVG+GT   ENG  YWL+KNSWG +WGE+GYI+I R+ 
Sbjct: 257 FYSEGIYDEPFCSNTELDHGVLVVGYGT---ENGQDYWLVKNSWGPSWGENGYIKIARNH 313

Query: 332 -GLCGIATAASYPV 344
              CGIA+ ASYP+
Sbjct: 314 NNHCGIASMASYPI 327


>gi|307175095|gb|EFN65237.1| Cathepsin L [Camponotus floridanus]
          Length = 372

 Score =  263 bits (671), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 147/332 (44%), Positives = 196/332 (59%), Gaps = 23/332 (6%)

Query: 23  CASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNR 82
           C   V +G S H              H + YK  +E+  R+ IF  N   I + N++   
Sbjct: 55  CCGSVFAGSSCHR-----------THHKKVYKSPIEEGYRMKIFLDNKRKIVEHNRKYEM 103

Query: 83  ---TYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWRE 139
               YKLG N++ D+ + E      G+N+ V +VS +    +TF      ++P S+DWR+
Sbjct: 104 KEVNYKLGMNKYGDMLHHELINTLNGFNKSV-TVSEEQLIGATFIEPANVELPKSVDWRK 162

Query: 140 KGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGL 197
           KGAVT IKDQGQCGSCWAFS+  A+EG      G L+ LSEQ L+DCS    N+GC+GGL
Sbjct: 163 KGAVTAIKDQGQCGSCWAFSSTGALEGQHFRQSGVLVSLSEQNLIDCSGKYGNNGCNGGL 222

Query: 198 MDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVS 257
           MD AF YI ENKGL TE  YPY  E   C    + + A+ +  + D+P+GDE  L  AV+
Sbjct: 223 MDYAFRYIKENKGLDTEKSYPYEAENDQCRYNPKNSGASDVG-FVDIPEGDEDKLKAAVA 281

Query: 258 N-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFGTAEEENGAKYWLIKNS 314
              P+SV +DAS  +FHFY  GV    +C   N DHGV +VG+GT +   G  YWL+KNS
Sbjct: 282 TIGPISVAIDASHESFHFYSEGVYYEPECSPANLDHGVLIVGYGT-DSGTGEDYWLVKNS 340

Query: 315 WGETWGESGYIRILRDA-GLCGIATAASYPVA 345
           WGETWGE GYI++ R+    CGIA++ASYP+ 
Sbjct: 341 WGETWGEKGYIKMARNKENHCGIASSASYPLV 372


>gi|380014284|ref|XP_003691169.1| PREDICTED: cathepsin L-like [Apis florea]
          Length = 345

 Score =  263 bits (671), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 148/347 (42%), Positives = 207/347 (59%), Gaps = 24/347 (6%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEK-HEQWMA---QHGRTYKDELEKAMRLNIFKQ 68
           M + +IL IT  + V      H  S  E  +++WM    +H + YK ++E+  R+ IF  
Sbjct: 1   MKLFLILFITIFATV------HAVSFFELVNQEWMTFKMEHKKAYKSDVEERFRMKIFMD 54

Query: 69  NLEYIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRP--STF 123
           N   I K N        +YKL  N++ D+ + EF  +  G+N+ + +  R    P  ++F
Sbjct: 55  NKHKIAKHNSNYEMKKVSYKLKMNKYGDMLHHEFVNILNGFNKSINTQLRSERMPIGASF 114

Query: 124 KYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQL 183
                  +P  +DWR++GAVT +KDQG CGSCW+FSA  A+EG      G L+ LSEQ L
Sbjct: 115 IEPANVALPKKVDWRKEGAVTPVKDQGHCGSCWSFSATGALEGQHFRRTGVLVSLSEQNL 174

Query: 184 VDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKY 241
           +DCS    N+GC+GGLMD+AF+YI +NKGL TEA YPY  E   C      + A  +  Y
Sbjct: 175 IDCSGKYGNNGCNGGLMDQAFQYIKDNKGLDTEASYPYEAENDKCRYNPANSGAIDVG-Y 233

Query: 242 EDLPKGDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFG 298
            D+P G+E+ L  AV+   PVSV +DAS ++F FY  GV    +C +   DHGV V+G+G
Sbjct: 234 IDIPTGNEKLLKAAVATIGPVSVAIDASHQSFQFYSEGVYYEPECSSEELDHGVLVIGYG 293

Query: 299 TAEEENGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
           T   ENG  YWL+KNSWGETWG +GYI++ R+    CGIA++ASYP+
Sbjct: 294 T--NENGEDYWLVKNSWGETWGNNGYIKMARNKLNHCGIASSASYPL 338


>gi|348687948|gb|EGZ27762.1| papain-like cysteine protease C1 [Phytophthora sojae]
          Length = 533

 Score =  262 bits (670), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 138/307 (44%), Positives = 193/307 (62%), Gaps = 13/307 (4%)

Query: 45  WMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRT-YKLGTNEFSDLTNEEFRALY 103
           WM+ HG T+ D LE A RL  +  N  YI + N E   T  KLG N FS ++ +EF+   
Sbjct: 31  WMSAHGVTFSDALEFARRLENYIANDMYILEHNAENAWTGVKLGHNAFSHMSFDEFKFKM 90

Query: 104 TGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAA 163
           TG   P   + ++ +      + +V +VP+++DW +KG VT +K+QG CGSCWAFS   A
Sbjct: 91  TGLVLPEGYLEQRLASRVDGLWSDV-EVPSAVDWVDKGGVTPVKNQGMCGSCWAFSTTGA 149

Query: 164 VEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHE 222
           VEG T ++ GKL+ LSEQ+LVDC  + + GC+GGLMD AF++I ++ G+ +E DY Y+ +
Sbjct: 150 VEGATFVSSGKLLSLSEQELVDCDHNGDMGCNGGLMDHAFQWIEDHGGICSEDDYEYKAK 209

Query: 223 EGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNA 282
              C   ++      ++ ++D+   DE AL  AV+ QPVSV ++A  +AF FYKSGV N 
Sbjct: 210 AQVC---RKCDSVVKVTGFQDVNPQDEHALKVAVAQQPVSVAIEADQKAFQFYKSGVFNL 266

Query: 283 DCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD----AGLCGIAT 338
            CG   DHGV  VG+G    +NG K+W +KNSWG +WGE GYIR+ R+    AG CGIA+
Sbjct: 267 TCGTRLDHGVLAVGYGN---DNGQKFWKVKNSWGASWGEQGYIRLAREENGPAGQCGIAS 323

Query: 339 AASYPVA 345
             SYP A
Sbjct: 324 VPSYPFA 330


>gi|62510452|sp|Q8HY81.1|CATS_CANFA RecName: Full=Cathepsin S; Flags: Precursor
 gi|27497538|gb|AAO13009.1| cathepsin S preproprotein [Canis lupus familiaris]
          Length = 331

 Score =  262 bits (670), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 151/340 (44%), Positives = 207/340 (60%), Gaps = 20/340 (5%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
           M  ++ L+  C+  V   +   +P++      W   + + YK+E E+  R  I+++NL++
Sbjct: 1   MKWLVGLLPLCSYAVA--QVHKDPTLDHHWNLWKKTYSKQYKEENEEVARRLIWEKNLKF 58

Query: 73  IEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
           +   N E   G  +Y LG N   D+T EE  +L  G  R VPS   Q  R  T++  +  
Sbjct: 59  VMLHNLEHSMGMHSYDLGMNHLGDMTGEEVISL-MGSLR-VPS---QWQRNVTYRSNSNQ 113

Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD 189
            +P S+DWREKG VT +K QG CG+CWAFSAV A+E   ++  GKL+ LS Q LVDCST+
Sbjct: 114 KLPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTE 173

Query: 190 ---NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPK 246
              N GC+GG M  AF+YII+N G+ +EA YPY+   G C    +K  AAT SKY +LP 
Sbjct: 174 KYGNKGCNGGFMTTAFQYIIDNNGIDSEASYPYKAMNGKCRYDSKKR-AATCSKYTELPF 232

Query: 247 GDEQALLQAVSNQ-PVSVCVDASGRAFHFYKSGV-LNADCGNNCDHGVAVVGFGTAEEEN 304
           G E AL +AV+N+ PVSV +DAS  +F  Y+SGV     C  N +HGV VVG+G     N
Sbjct: 233 GSEDALKEAVANKGPVSVAIDASHYSFFLYRSGVYYEPSCTQNVNHGVLVVGYGNL---N 289

Query: 305 GAKYWLIKNSWGETWGESGYIRILRDAG-LCGIATAASYP 343
           G  YWL+KNSWG  +G+ GYIR+ R++G  CGIA+  SYP
Sbjct: 290 GKDYWLVKNSWGLNFGDQGYIRMARNSGNHCGIASYPSYP 329


>gi|196002275|ref|XP_002111005.1| expressed hypothetical protein [Trichoplax adhaerens]
 gi|190586956|gb|EDV27009.1| expressed hypothetical protein [Trichoplax adhaerens]
          Length = 325

 Score =  262 bits (670), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 148/338 (43%), Positives = 203/338 (60%), Gaps = 20/338 (5%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
           M  + + +  C + VVS   + +PS     E W + HG+ Y ++ E   R  +F QN++ 
Sbjct: 1   MKTLSVFLAICLA-VVSAIPLKDPSW----EAWKSFHGKKYHNQGEDDFRHYVFLQNIKT 55

Query: 73  IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVP 132
           I   N +   T+K+  NEFSDLT +EF   Y GY     S+ + +++PSTF     T++P
Sbjct: 56  IAAHNAKS--TFKMAINEFSDLTRKEFVKTYNGYRL---SMKKSTNKPSTFMAPLNTNMP 110

Query: 133 TSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST--DN 190
           T +DWR++G VT IK+QG+CGSCWAFS   ++EG      GKL+ LSEQ L+DCS    N
Sbjct: 111 TEVDWRKEGYVTPIKNQGRCGSCWAFSTTGSLEGQHFRKTGKLVSLSEQNLIDCSAAEGN 170

Query: 191 HGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQ 250
            GC GG MD AFEYI  N G+ TEA YPY   +  C  +K     A  + Y D+ +  E 
Sbjct: 171 DGCGGGFMDDAFEYIKLNNGIDTEASYPYEGRDDICRYKKTNK-GAIDTGYMDIKQYSED 229

Query: 251 ALLQAVSN-QPVSVCVDASGRAFHFYKSGVLN-ADCGNNC-DHGVAVVGFGTAEEENGAK 307
            L  AV+   P+SV +DAS ++FH Y +GV +  +C     DHGV VVG+GT   ENG  
Sbjct: 230 DLKAAVATVGPISVAIDASHKSFHMYHTGVYHEPECSQTVLDHGVLVVGYGT---ENGED 286

Query: 308 YWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
           YWL+KNSWG  WG +GYI++ R+ +  CGIAT ASYP+
Sbjct: 287 YWLVKNSWGTDWGMNGYIKMSRNRSNNCGIATNASYPL 324


>gi|156397875|ref|XP_001637915.1| predicted protein [Nematostella vectensis]
 gi|156225031|gb|EDO45852.1| predicted protein [Nematostella vectensis]
          Length = 331

 Score =  262 bits (670), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 148/338 (43%), Positives = 199/338 (58%), Gaps = 14/338 (4%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
           M  +I  V+ C S  ++   M EP      + W + HG+ Y ++ E+ MR  I++ NL+ 
Sbjct: 1   MEAVIFAVLLCISSALAMPPM-EPLQDPNWKAWKSFHGKEYPNKNEETMRNFIWQNNLKK 59

Query: 73  IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVP 132
           I   N EG  ++KL  N   D+T+ E      G    +   +    + +TF       V 
Sbjct: 60  IVTHN-EGKHSFKLAMNHLGDMTSLEISQTLLGLK--LKKHAESQPKGATFLPPANVKVV 116

Query: 133 TSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--N 190
            SIDWR KG VT +K+QGQCGSCWAFS   A+EG      GKL+ LSEQ LVDCS    N
Sbjct: 117 DSIDWRSKGYVTPVKNQGQCGSCWAFSTTGALEGQHFRKTGKLVSLSEQNLVDCSGKYGN 176

Query: 191 HGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQ 250
           +GC GGLMD AF+YI EN G+ TE  YPY  ++G C   K  A+ A  + + D+P GDE 
Sbjct: 177 NGCEGGLMDNAFQYIKENGGIDTEKSYPYLAKDGVCHYNKS-AIGAKDTGFVDIPTGDEN 235

Query: 251 ALLQAVSN-QPVSVCVDASGRAFHFYKSGVL-NADCGNN-CDHGVAVVGFGTAEEENGAK 307
           AL QA+++  P+S+ +DAS   FHFY  GV  + DC +   DHGV  VG+GT   ++G  
Sbjct: 236 ALQQALASVGPISIAIDASQSTFHFYHQGVYDDPDCSSTRLDHGVLAVGYGT---DDGKD 292

Query: 308 YWLIKNSWGETWGESGYIRILR-DAGLCGIATAASYPV 344
           YWL+KNSWG +WGE GYI+I R D   CG+A+ ASYP+
Sbjct: 293 YWLVKNSWGPSWGEEGYIKIARNDHDKCGVASKASYPL 330


>gi|390608645|ref|NP_001254624.1| cathepsin S isoform 1 preproprotein [Mus musculus]
 gi|74214026|dbj|BAE29430.1| unnamed protein product [Mus musculus]
          Length = 343

 Score =  262 bits (670), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 145/319 (45%), Positives = 198/319 (62%), Gaps = 19/319 (5%)

Query: 35  EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEF 91
           +P++    + W   H + YKD+ E+ +R  I+++NL++I   N E   G  TY++G N+ 
Sbjct: 32  DPTLDYHWDLWKKTHEKEYKDKNEEEVRRLIWEKNLKFIMIHNLEYSMGMHTYQVGMNDM 91

Query: 92  SDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQ 151
            D+TNEE          P     RQS +  TF+  +   +P ++DWREKG VT +K QG 
Sbjct: 92  GDMTNEEILCRMGALRIP-----RQSPKTVTFRSYSNRTLPDTVDWREKGCVTEVKYQGS 146

Query: 152 CGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD----NHGCSGGLMDKAFEYIIE 207
           CG+CWAFSAV A+EG  ++  GKLI LS Q LVDCS +    N GC GG M +AF+YII+
Sbjct: 147 CGACWAFSAVGALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQYIID 206

Query: 208 NKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQ-PVSVCVD 266
           N G+  +A YPY+  +  C +   K  AAT S+Y  LP GDE AL +AV+ + PVSV +D
Sbjct: 207 NGGIEADASYPYKATDEKC-HYNSKNRAATCSRYIQLPFGDEDALKEAVATKGPVSVGID 265

Query: 267 ASGRAFHFYKSGVL-NADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYI 325
           AS  +F FYKSGV  +  C  N +HGV VVG+GT +   G  YWL+KNSWG  +G+ GYI
Sbjct: 266 ASHSSFFFYKSGVYDDPSCTGNVNHGVLVVGYGTLD---GKDYWLVKNSWGLNFGDQGYI 322

Query: 326 RILR-DAGLCGIATAASYP 343
           R+ R +   CGIA+  SYP
Sbjct: 323 RMARNNKNHCGIASYCSYP 341


>gi|22661|emb|CAA49504.1| papaya proteinase omega [Carica papaya]
          Length = 367

 Score =  262 bits (670), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 139/352 (39%), Positives = 201/352 (57%), Gaps = 16/352 (4%)

Query: 1   MVLKFEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQ----WMAQHGRTYKDE 56
           M+    K   + + + + + ++     + G S  + +  E+  Q    WM  H + Y++ 
Sbjct: 3   MIPSISKLLFVAICLFVHMSVSFGDFSIVGYSQDDLTSTERLIQLFNSWMLNHNKFYENV 62

Query: 57  LEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQ 116
            EK  R  IFK NL YI++ NK+ N +Y+LG NEF+DL+N+EF   Y G    +   + +
Sbjct: 63  DEKLYRFEIFKDNLNYIDETNKK-NNSYRLGLNEFADLSNDEFNEKYVG---SLIDATIE 118

Query: 117 SSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLI 176
            S    F  +++ ++P ++DWR+KGAVT ++ QG CGSCWAFSAVA VEGI +I  GKL+
Sbjct: 119 QSYDEEFINEDIVNLPENVDWRKKGAVTPVRHQGSCGSCWAFSAVATVEGINKIRTGKLV 178

Query: 177 ELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAA 236
           ELSEQ+LVDC   +HGC GG    A EY+ +N G+   + YPY+ ++GTC  ++      
Sbjct: 179 ELSEQELVDCERRSHGCKGGYPPYALEYVAKN-GIHLRSKYPYKAKQGTCRAKQVGGPIV 237

Query: 237 TISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVG 296
             S    +   +E  LL A++ QPVSV V++ GR F  YK G+    CG   DH V  V 
Sbjct: 238 KTSGVGRVQPNNEGNLLNAIAKQPVSVVVESKGRPFQLYKGGIFEGPCGTKVDHAVTAV- 296

Query: 297 FGTAEEENGAKYWLIKNSWGETWGESGYIRILR----DAGLCGIATAASYPV 344
                +  G  Y LIKNSWG  WGE GYIRI R      G+CG+  ++ YP+
Sbjct: 297 --GYGKSGGKGYILIKNSWGTAWGEKGYIRIKRAPGNSPGVCGLYKSSYYPI 346


>gi|74178074|dbj|BAE29827.1| unnamed protein product [Mus musculus]
 gi|74178231|dbj|BAE29900.1| unnamed protein product [Mus musculus]
 gi|74220784|dbj|BAE31361.1| unnamed protein product [Mus musculus]
          Length = 326

 Score =  262 bits (670), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 145/319 (45%), Positives = 198/319 (62%), Gaps = 19/319 (5%)

Query: 35  EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEF 91
           +P++    + W   H + YKD+ E+ +R  I+++NL++I   N E   G  TY++G N+ 
Sbjct: 15  DPTLDYHWDLWKKTHEKEYKDKNEEEVRRLIWEKNLKFIMIHNLEYSMGMHTYQVGMNDM 74

Query: 92  SDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQ 151
            D+TNEE          P     RQS +  TF+  +   +P ++DWREKG VT +K QG 
Sbjct: 75  GDMTNEEILCRMGALRIP-----RQSPKTVTFRSYSNRTLPDTVDWREKGCVTEVKYQGS 129

Query: 152 CGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD----NHGCSGGLMDKAFEYIIE 207
           CG+CWAFSAV A+EG  ++  GKLI LS Q LVDCS +    N GC GG M +AF+YII+
Sbjct: 130 CGACWAFSAVGALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQYIID 189

Query: 208 NKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQ-PVSVCVD 266
           N G+  +A YPY+  +  C +   K  AAT S+Y  LP GDE AL +AV+ + PVSV +D
Sbjct: 190 NGGIEADASYPYKATDEKC-HYNSKNRAATCSRYIQLPFGDEDALKEAVATKGPVSVGID 248

Query: 267 ASGRAFHFYKSGVL-NADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYI 325
           AS  +F FYKSGV  +  C  N +HGV VVG+GT +   G  YWL+KNSWG  +G+ GYI
Sbjct: 249 ASHSSFFFYKSGVYDDPSCTGNVNHGVLVVGYGTLD---GKDYWLVKNSWGLNFGDQGYI 305

Query: 326 RILR-DAGLCGIATAASYP 343
           R+ R +   CGIA+  SYP
Sbjct: 306 RMARNNKNHCGIASYCSYP 324


>gi|318037269|ref|NP_001187182.1| cathepsin L precursor [Ictalurus punctatus]
 gi|196475596|gb|ACG76367.1| cathepsin L [Ictalurus punctatus]
          Length = 336

 Score =  262 bits (670), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 149/342 (43%), Positives = 203/342 (59%), Gaps = 17/342 (4%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
           M + + ++  C S V +  ++ +  +    +QW   H + Y  E E+  R  ++++NL+ 
Sbjct: 1   MRLCLAVLAVCLSTVSAAPTV-DRELDGHWQQWKEWHNKDYH-EKEEGWRRMVWEKNLKK 58

Query: 73  IEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
           IE  N E   G  +Y+L  N F D+ +EEFR +  GY   V  +     R S F   N  
Sbjct: 59  IELHNLEHSLGKHSYRLAMNHFGDMPHEEFRQVMNGYKHKVRKI-----RGSLFMEPNFL 113

Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD 189
           + P+ +DWREKG VT +KDQGQCGSCWAFS   A+EG      GKL+ LSEQ LVDCS  
Sbjct: 114 EAPSKLDWREKGYVTPVKDQGQCGSCWAFSTTGAMEGQQFRKTGKLVSLSEQNLVDCSRP 173

Query: 190 --NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
             N GC+GGLMD+AF+YI +N GL TE  YPY   +    +      AA  + + D+P G
Sbjct: 174 EGNEGCNGGLMDQAFQYIKDNGGLDTEKFYPYLGTDDQPCHYDPSYSAANDTGFVDIPSG 233

Query: 248 DEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFG-TAEEE 303
            E AL++AV+   PVSV +DA   +F FY+SG+   ADC + + DHGV VVG+G   E  
Sbjct: 234 KEHALMKAVTAVGPVSVAIDAGHESFQFYQSGIYYEADCSSEDLDHGVLVVGYGYEGENV 293

Query: 304 NGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
           +G KYW++KNSW E WG  GYI + +D    CGIATAASYP+
Sbjct: 294 DGKKYWIVKNSWSEQWGNKGYIYMAKDRHNHCGIATAASYPL 335


>gi|392306967|ref|NP_067256.3| cathepsin S isoform 2 preproprotein [Mus musculus]
 gi|26390492|dbj|BAC25906.1| unnamed protein product [Mus musculus]
 gi|148706872|gb|EDL38819.1| cathepsin S [Mus musculus]
          Length = 342

 Score =  262 bits (670), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 145/319 (45%), Positives = 198/319 (62%), Gaps = 19/319 (5%)

Query: 35  EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEF 91
           +P++    + W   H + YKD+ E+ +R  I+++NL++I   N E   G  TY++G N+ 
Sbjct: 31  DPTLDYHWDLWKKTHEKEYKDKNEEEVRRLIWEKNLKFIMIHNLEYSMGMHTYQVGMNDM 90

Query: 92  SDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQ 151
            D+TNEE          P     RQS +  TF+  +   +P ++DWREKG VT +K QG 
Sbjct: 91  GDMTNEEILCRMGALRIP-----RQSPKTVTFRSYSNRTLPDTVDWREKGCVTEVKYQGS 145

Query: 152 CGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD----NHGCSGGLMDKAFEYIIE 207
           CG+CWAFSAV A+EG  ++  GKLI LS Q LVDCS +    N GC GG M +AF+YII+
Sbjct: 146 CGACWAFSAVGALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQYIID 205

Query: 208 NKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQ-PVSVCVD 266
           N G+  +A YPY+  +  C +   K  AAT S+Y  LP GDE AL +AV+ + PVSV +D
Sbjct: 206 NGGIEADASYPYKATDEKC-HYNSKNRAATCSRYIQLPFGDEDALKEAVATKGPVSVGID 264

Query: 267 ASGRAFHFYKSGVL-NADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYI 325
           AS  +F FYKSGV  +  C  N +HGV VVG+GT +   G  YWL+KNSWG  +G+ GYI
Sbjct: 265 ASHSSFFFYKSGVYDDPSCTGNVNHGVLVVGYGTLD---GKDYWLVKNSWGLNFGDQGYI 321

Query: 326 RILR-DAGLCGIATAASYP 343
           R+ R +   CGIA+  SYP
Sbjct: 322 RMARNNKNHCGIASYCSYP 340


>gi|189525868|ref|XP_001341714.2| PREDICTED: cathepsin L1-like isoform 1 [Danio rerio]
          Length = 336

 Score =  262 bits (669), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 140/340 (41%), Positives = 204/340 (60%), Gaps = 15/340 (4%)

Query: 15  VIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIE 74
           ++  L++T +   V   S  +  + +    W +QHG++Y +++E   R+ I+++NL  IE
Sbjct: 1   MMFALLVTLSISAVFAASSIDIQLDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRKIE 59

Query: 75  KANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDV 131
           + N E   GN T+K+G N+F D+TNEEFR    GY         Q+S+   F   +    
Sbjct: 60  QHNFEYSYGNHTFKMGMNQFGDMTNEEFRQAMNGYKHD----PNQTSQGPLFMEPSFFAA 115

Query: 132 PTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-- 189
           P  +DWR++G VT +KDQ QCGSCW+FS+  A+EG      GKLI +SEQ LVDCS    
Sbjct: 116 PQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQG 175

Query: 190 NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDE 249
           N GC+GGLMD+AF+Y+ ENKGL +E  YPY   +        +   A I+ + D+P G+E
Sbjct: 176 NQGCNGGLMDQAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPSGNE 235

Query: 250 QALLQAVSN-QPVSVCVDASGRAFHFYKSGVL--NADCGNNCDHGVAVVGFG-TAEEENG 305
            AL+ AV+   PVSV +DAS ++  FY+SG+    A   +  DH V VVG+G    +  G
Sbjct: 236 LALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACSSSRLDHAVLVVGYGYQGADVAG 295

Query: 306 AKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
            +YW++KNSW + WG+ GYI + +D    CG+AT ASYP+
Sbjct: 296 NRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGVATKASYPL 335


>gi|325303202|tpg|DAA34687.1| TPA_inf: cathepsin L-like cysteine proteinase B [Amblyomma
           variegatum]
          Length = 337

 Score =  262 bits (669), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 145/344 (42%), Positives = 214/344 (62%), Gaps = 20/344 (5%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
           M   ++L   CA+   +  + H+  +  +   + A HG+ Y+ E E+  RL I+ +N   
Sbjct: 1   MRGFVVLCFLCAAMTAAAIT-HQELVGAEWSAFKALHGKEYQSETEEYYRLKIYMENRMM 59

Query: 73  IEKANKE--GNR-TYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSS---RPSTFKYQ 126
           I + N++   N+ +YKL  NE+ D+ + EF +   G+ R   S  RQ S    P   + +
Sbjct: 60  IARHNEKYANNKVSYKLAMNEYGDMLHHEFVSTRNGFRRDYRSKPRQGSFYIEPEGIEDK 119

Query: 127 NVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC 186
           ++   P ++DWR+KGAVT +K+QGQCGSCWAFS   ++EG      G ++ LSEQ LVDC
Sbjct: 120 HL---PKTVDWRKKGAVTPVKNQGQCGSCWAFSTTGSLEGQHFRKSGDMVSLSEQNLVDC 176

Query: 187 ST--DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDL 244
           ST   N+GC GGLMD AF+YI  N G+ TE  YPY   +GTC + K+  V AT + + D+
Sbjct: 177 STAFGNNGCEGGLMDNAFKYIKANGGIDTEKSYPYNGTDGTC-HFKKSDVGATDTGFVDI 235

Query: 245 PKGDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGVLN-ADCGN-NCDHGVAVVGFGTAE 301
           P+G+E  L +AV+   P+SV +DAS ++F FY  GV +  +C + N DHGV VVG+GT +
Sbjct: 236 PEGNEHLLKKAVATVGPISVAIDASHQSFQFYSQGVYDEPECSSENLDHGVLVVGYGTKD 295

Query: 302 EENGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
           +++   YWL+KNSWG TWG+ GYI + R+    CGIA++ASYP+
Sbjct: 296 DQD---YWLVKNSWGTTWGDGGYIYMTRNKDNQCGIASSASYPL 336


>gi|33112581|gb|AAP94046.1| cathepsin-L-like cysteine peptidase 02 [Tenebrio molitor]
          Length = 337

 Score =  262 bits (669), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 150/344 (43%), Positives = 206/344 (59%), Gaps = 23/344 (6%)

Query: 14  FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMA---QHGRTYKDELEKAMRLNIFKQNL 70
           F+I + +    SQ VS   + +       EQW A    H + Y+ + E+  R+ IF +N 
Sbjct: 3   FLIFLAICVAGSQAVSFFDLVQ-------EQWGAFKMTHNKQYQSDTEERFRMKIFMENS 55

Query: 71  EYIEKANK---EGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSV-SRQSSRPSTFKYQ 126
             + K NK   +G  ++KLG N+++D+ + EF  +  G+NR    + S +S    TF   
Sbjct: 56  HTVAKHNKLYAQGLVSFKLGINKYADMLHHEFVQVLNGFNRTKSGLRSGESDDSVTFLPP 115

Query: 127 NVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC 186
               +P  IDWR+KGAVT +KDQGQCGSCW+FSA  ++EG      GKL+ LSEQ LVDC
Sbjct: 116 ANVQLPGQIDWRDKGAVTPVKDQGQCGSCWSFSATGSLEGQHFRKSGKLVSLSEQNLVDC 175

Query: 187 STD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDL 244
           S    N+GC+GGLMD AF YI  N G+ TE  YPY+ E+  C + K K   AT   Y D+
Sbjct: 176 SEKFGNNGCNGGLMDNAFRYIKANGGIDTEQAYPYKAEDEKC-HYKPKNKGATDRGYVDI 234

Query: 245 PKGDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCG-NNCDHGVAVVGFGTAE 301
             G+E  L  AV+   PVSV +DAS ++F  Y  GV    +C  +  DHGV VVG+GT  
Sbjct: 235 ESGNEDKLQSAVATVGPVSVAIDASHQSFQLYSGGVYYEPECSPSQLDHGVLVVGYGT-- 292

Query: 302 EENGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
           E++G  YWL+KNSWG++WG+ GYI++ R+    CGIAT ASYP+
Sbjct: 293 EDDGTDYWLVKNSWGKSWGDQGYIKMARNRDNNCGIATEASYPL 336


>gi|341940310|sp|O70370.2|CATS_MOUSE RecName: Full=Cathepsin S; Flags: Precursor
          Length = 340

 Score =  262 bits (669), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 145/319 (45%), Positives = 198/319 (62%), Gaps = 19/319 (5%)

Query: 35  EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEF 91
           +P++    + W   H + YKD+ E+ +R  I+++NL++I   N E   G  TY++G N+ 
Sbjct: 29  DPTLDYHWDLWKKTHEKEYKDKNEEEVRRLIWEKNLKFIMIHNLEYSMGMHTYQVGMNDM 88

Query: 92  SDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQ 151
            D+TNEE          P     RQS +  TF+  +   +P ++DWREKG VT +K QG 
Sbjct: 89  GDMTNEEILCRMGALRIP-----RQSPKTVTFRSYSNRTLPDTVDWREKGCVTEVKYQGS 143

Query: 152 CGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD----NHGCSGGLMDKAFEYIIE 207
           CG+CWAFSAV A+EG  ++  GKLI LS Q LVDCS +    N GC GG M +AF+YII+
Sbjct: 144 CGACWAFSAVGALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQYIID 203

Query: 208 NKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQ-PVSVCVD 266
           N G+  +A YPY+  +  C +   K  AAT S+Y  LP GDE AL +AV+ + PVSV +D
Sbjct: 204 NGGIEADASYPYKATDEKC-HYNSKNRAATCSRYIQLPFGDEDALKEAVATKGPVSVGID 262

Query: 267 ASGRAFHFYKSGVL-NADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYI 325
           AS  +F FYKSGV  +  C  N +HGV VVG+GT +   G  YWL+KNSWG  +G+ GYI
Sbjct: 263 ASHSSFFFYKSGVYDDPSCTGNVNHGVLVVGYGTLD---GKDYWLVKNSWGLNFGDQGYI 319

Query: 326 RILR-DAGLCGIATAASYP 343
           R+ R +   CGIA+  SYP
Sbjct: 320 RMARNNKNHCGIASYCSYP 338


>gi|387015022|gb|AFJ49630.1| Cathepsin L1-like [Crotalus adamanteus]
          Length = 338

 Score =  262 bits (669), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 145/343 (42%), Positives = 212/343 (61%), Gaps = 17/343 (4%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
           M  + IL ++  +   +     +P++ +    W + H + Y  E E+  R  I+++NL+ 
Sbjct: 1   MIYLCILALSFGASFAA--PGLDPALNDHWLSWKSWHSKKYH-EKEEGWRRMIWEKNLKM 57

Query: 73  IEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
           IE  N +   G  +Y+LG N F D+TNEEFR +  G+ +   S S++  + S F   N  
Sbjct: 58  IELHNLDHSLGKHSYRLGMNHFGDMTNEEFRQVMNGFKQ---SRSQRKYKGSQFLEPNFL 114

Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS-- 187
             P S+DWREKG VT +KDQGQCGSCWAFSA  A+EG      GKL+ LSEQ L+DCS  
Sbjct: 115 QAPKSVDWREKGYVTPVKDQGQCGSCWAFSATGALEGQHFRKTGKLVSLSEQNLIDCSGP 174

Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
             N GC+GGLMD+AF+YI +N G+ +E  YPY  ++      K +  +A  + + D+P+G
Sbjct: 175 EGNQGCNGGLMDQAFQYIKDNNGIDSEESYPYIGKDDEDCLYKPEYNSANDTGFVDIPEG 234

Query: 248 DEQALLQAVSN-QPVSVCVDASGRAFHFYKSGVLNADCGNN--CDHGVAVVGFGT--AEE 302
            E+AL++AV+   P+SV +DAS  +F FY+SGV      N+   DHGV VVG+G    ++
Sbjct: 235 RERALMKAVAAVGPISVAIDASHTSFQFYESGVYYEPQCNSEELDHGVLVVGYGYEGTDD 294

Query: 303 ENGAKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
           +N  +YW++KNSW E WG+ GYI + +D +  CGIA+AASYP+
Sbjct: 295 DNKKRYWIVKNSWSEKWGDQGYIHMAKDRSNNCGIASAASYPM 337


>gi|116563690|gb|ABJ99858.1| cathepsin L [Hippoglossus hippoglossus]
          Length = 336

 Score =  262 bits (669), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 145/341 (42%), Positives = 209/341 (61%), Gaps = 17/341 (4%)

Query: 15  VIIILVIT-CASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYI 73
           ++ +LV+T C S V+S   + +  + E  + W + H + Y  E E+  R  ++++NL+ I
Sbjct: 1   MLPLLVLTACLSSVLSAPVL-DAQLNEHWDLWKSWHSKKYH-EKEEGWRRMVWEKNLQKI 58

Query: 74  EKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
           E  N E   G  +++LG N F D+T+EEFR +  GY       +++    S F   N   
Sbjct: 59  ELHNLEHSMGTHSFRLGMNHFGDMTHEEFRQIMNGYKLK----TQRKFTGSLFMEPNFMT 114

Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD- 189
            P+++DWREKG VT +KDQGQCGSCWAFS   A+EG      GKL+ LSEQ LVDCS   
Sbjct: 115 APSAVDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQQFRKTGKLVSLSEQNLVDCSRPE 174

Query: 190 -NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
            N GC GGLMD+AF+Y+ +N+GL +E  YPY   +    +      +A  + + D+P G 
Sbjct: 175 GNEGCGGGLMDQAFQYVTDNQGLDSEDSYPYTGTDDQPCHYDPLYNSANDTGFVDVPSGK 234

Query: 249 EQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFG-TAEEEN 304
           E AL++AV++  PVSV +DA   +F FY+SG+    +C +   DHGV  VG+G   E++ 
Sbjct: 235 EHALMKAVASVGPVSVAIDAGHESFQFYQSGIYYEKECSSEELDHGVLAVGYGFEGEDKM 294

Query: 305 GAKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
           G K+W++KNSWGE WG+ GYI + +D    CGIATAASYP+
Sbjct: 295 GKKFWIVKNSWGEKWGDKGYIYMAKDRKNHCGIATAASYPL 335


>gi|12805315|gb|AAH02125.1| Ctss protein [Mus musculus]
          Length = 340

 Score =  262 bits (669), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 145/319 (45%), Positives = 198/319 (62%), Gaps = 19/319 (5%)

Query: 35  EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEF 91
           +P++    + W   H + YKD+ E+ +R  I+++NL++I   N E   G  TY++G N+ 
Sbjct: 29  DPTLDYHWDLWKKTHEKEYKDKNEEEVRRLIWEKNLKFIMIHNLEYSMGMHTYQVGMNDM 88

Query: 92  SDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQ 151
            D+TNEE          P     RQS +  TF+  +   +P ++DWREKG VT +K QG 
Sbjct: 89  GDMTNEEILCRMGALRIP-----RQSPKTVTFRSYSNRTLPDTVDWREKGCVTEVKYQGS 143

Query: 152 CGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD----NHGCSGGLMDKAFEYIIE 207
           CG+CWAFSAV A+EG  ++  GKLI LS Q LVDCS +    N GC GG M +AF+YII+
Sbjct: 144 CGACWAFSAVGALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQYIID 203

Query: 208 NKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQ-PVSVCVD 266
           N G+  +A YPY+  +  C +   K  AAT S+Y  LP GDE AL +AV+ + PVSV +D
Sbjct: 204 NGGIEADASYPYKAMDEKC-HYNSKNRAATCSRYIQLPFGDEDALKEAVATKGPVSVGID 262

Query: 267 ASGRAFHFYKSGVL-NADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYI 325
           AS  +F FYKSGV  +  C  N +HGV VVG+GT +   G  YWL+KNSWG  +G+ GYI
Sbjct: 263 ASHSSFFFYKSGVYDDPSCTGNVNHGVLVVGYGTLD---GKDYWLVKNSWGLNFGDQGYI 319

Query: 326 RILR-DAGLCGIATAASYP 343
           R+ R +   CGIA+  SYP
Sbjct: 320 RMARNNKNHCGIASDCSYP 338


>gi|387914010|gb|AFK10614.1| cathepsin L [Callorhinchus milii]
 gi|392873762|gb|AFM85713.1| cathepsin L [Callorhinchus milii]
 gi|392877488|gb|AFM87576.1| cathepsin L [Callorhinchus milii]
          Length = 338

 Score =  262 bits (669), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 142/342 (41%), Positives = 212/342 (61%), Gaps = 15/342 (4%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
           M +  +++  C +  ++  S+ +P +    EQW + HG++Y ++ E+  R  +++++L  
Sbjct: 1   MRLPFVVLSLCLAGGLAAPSL-DPGLDTHWEQWKSWHGKSY-EQKEETWRRMVWEKHLRV 58

Query: 73  IEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
           IE  N E   G  +++LG N F D+ NEEFR L  GY         Q S    F   N  
Sbjct: 59  IEIHNLEHSLGKHSFRLGMNHFGDMPNEEFRQLMNGYKYKQTHKKLQGSH---FLEPNFL 115

Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD 189
           +VP  +DWR++G VT +KDQGQCGSCWAFS   A+EG      G+L+ LSEQ LV+CS  
Sbjct: 116 EVPKHVDWRDEGYVTPVKDQGQCGSCWAFSTTGALEGQHFRRTGQLVSLSEQNLVECSKP 175

Query: 190 --NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
             N GC+GGLMD+AF+Y+ +N G+ +E  YPY   + T  +   +  AA  + + D+P G
Sbjct: 176 EGNEGCNGGLMDQAFQYVKDNGGIDSEDSYPYVGTDDTPCHYNPQYNAANDTGFVDIPSG 235

Query: 248 DEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFGTAEEE- 303
            E+AL++A++   PVSV +DA   +F FY+SG+   A+C + + DHGV VVG+G  + + 
Sbjct: 236 KERALMKAIAAVGPVSVAIDAGHTSFQFYQSGIYFEAECSSTDLDHGVLVVGYGVEKRDT 295

Query: 304 NGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
           +G KYW++KNSW E WG++GYI + +D    CGIATAASYP+
Sbjct: 296 DGKKYWIVKNSWSEKWGQNGYILMAKDKDNHCGIATAASYPL 337


>gi|392884266|gb|AFM90965.1| cathepsin L [Callorhinchus milii]
          Length = 338

 Score =  262 bits (669), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 142/342 (41%), Positives = 212/342 (61%), Gaps = 15/342 (4%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
           M +  +++  C +  ++  S+ +P +    EQW + HG++Y+ + E   R+ +++++L  
Sbjct: 1   MRLPFVVLSLCLAGGLAAPSL-DPGLDTHWEQWKSWHGKSYEQKEETWRRM-VWEEHLRV 58

Query: 73  IEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
           IE  N E   G  +++LG N F D+ NEEFR L  GY         Q S    F   N  
Sbjct: 59  IEIHNLEHSLGKHSFRLGMNHFGDMPNEEFRQLMNGYKYKQTHKKLQGSH---FLEPNFL 115

Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD 189
           +VP  +DWR++G VT +KDQGQCGSCWAFS   A+EG      G+L+ LSEQ LV+CS  
Sbjct: 116 EVPKHVDWRDEGYVTPVKDQGQCGSCWAFSTTGALEGQHFRRTGQLVSLSEQNLVECSKP 175

Query: 190 --NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
             N GC+GGLMD+AF+Y+ +N G+ +E  YPY   + T  +   +  AA  + + D+P G
Sbjct: 176 EGNEGCNGGLMDQAFQYVKDNGGIDSEDSYPYVGTDDTPCHYNPQYNAANDTGFVDIPSG 235

Query: 248 DEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFGTAEEE- 303
            E+AL++A++   PVSV +DA   +F FY+SG+   A+C + + DHGV VVG+G  + + 
Sbjct: 236 KERALMKAIAAVGPVSVAIDAGHTSFQFYQSGIYFEAECSSTDLDHGVLVVGYGVEKRDT 295

Query: 304 NGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
           +G KYW++KNSW E WG++GYI + +D    CGIATAASYP+
Sbjct: 296 DGKKYWIVKNSWSEKWGQNGYILMAKDKDNHCGIATAASYPL 337


>gi|431896621|gb|ELK06033.1| Cathepsin S [Pteropus alecto]
          Length = 331

 Score =  261 bits (668), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 145/340 (42%), Positives = 210/340 (61%), Gaps = 20/340 (5%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
           M  +  +++ C++ V   +   +P++    + W   + + Y++++E+  R  I+++NL++
Sbjct: 1   MKWLACVLLGCSAAVA--QLQRDPTLDRHWDLWKKTYSKHYREKIEEVARRLIWEKNLKF 58

Query: 73  IEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
           +   N E   G  +Y LG N   D+T+EE  +L       VPS   Q  R  T+K     
Sbjct: 59  VMLHNLEHSMGMHSYDLGMNHLGDMTSEEVISLMGSLT--VPS---QWQRNVTYKSNPNQ 113

Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD 189
            +P S+DWR+KG VT +K QG CGSCWAFSAV A+E   ++  GKL+ LS Q LVDCST+
Sbjct: 114 KLPDSLDWRDKGCVTEVKYQGSCGSCWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTE 173

Query: 190 ---NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPK 246
              N GC+GG M  AF+YII+N G+ +EA YPY+ ++G C     K  AAT SKY +LP 
Sbjct: 174 KYSNKGCNGGFMTSAFQYIIDNNGIDSEASYPYKAQDGKC-QYDSKFRAATCSKYTELPF 232

Query: 247 GDEQALLQAVSNQ-PVSVCVDASGRAFHFYKSGV-LNADCGNNCDHGVAVVGFGTAEEEN 304
           G E+AL +AV+N+ PVSV +DAS  +F  Y+SGV  +  C    +HGV VVG+G  +   
Sbjct: 233 GSEEALKEAVANKGPVSVAIDASHPSFFLYRSGVYYDQSCTLKVNHGVLVVGYGNLD--- 289

Query: 305 GAKYWLIKNSWGETWGESGYIRILRDAG-LCGIATAASYP 343
           G  YWL+KNSWG  +G+ GYIR+ R++G  CGIA+  SYP
Sbjct: 290 GKDYWLVKNSWGLNFGDKGYIRMARNSGNHCGIASYPSYP 329


>gi|392881548|gb|AFM89606.1| cathepsin L [Callorhinchus milii]
          Length = 338

 Score =  261 bits (668), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 142/342 (41%), Positives = 212/342 (61%), Gaps = 15/342 (4%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
           M +  +++  C +  ++  S+ +P +    EQW + HG++Y ++ E+  R  +++++L  
Sbjct: 1   MRLPFVVLSLCLAGGLAAPSL-DPGLDTHWEQWKSWHGKSY-EQKEETWRRMVWEKHLRV 58

Query: 73  IEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
           IE  N E   G  +++LG N F D+ NEEFR L  GY         Q S    F   N  
Sbjct: 59  IEIHNLEHSLGKHSFRLGMNHFGDMPNEEFRQLMNGYKYKQTHKKLQGSH---FLEPNFQ 115

Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD 189
           +VP  +DWR++G VT +KDQGQCGSCWAFS   A+EG      G+L+ LSEQ LV+CS  
Sbjct: 116 EVPKHVDWRDEGYVTPVKDQGQCGSCWAFSTTGALEGQHFRRTGQLVSLSEQNLVECSKP 175

Query: 190 --NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
             N GC+GGLMD+AF+Y+ +N G+ +E  YPY   + T  +   +  AA  + + D+P G
Sbjct: 176 EGNEGCNGGLMDQAFQYVKDNGGIDSEDSYPYVGTDDTPCHYNPQYNAANDTGFVDIPSG 235

Query: 248 DEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFGTAEEE- 303
            E+AL++A++   PVSV +DA   +F FY+SG+   A+C + + DHGV VVG+G  + + 
Sbjct: 236 KERALMKAIAAVGPVSVAIDAGHTSFQFYQSGIYFEAECSSTDLDHGVLVVGYGVEKRDT 295

Query: 304 NGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
           +G KYW++KNSW E WG++GYI + +D    CGIATAASYP+
Sbjct: 296 DGKKYWIVKNSWSEKWGQNGYILMAKDKDNHCGIATAASYPL 337


>gi|330803820|ref|XP_003289900.1| hypothetical protein DICPUDRAFT_80649 [Dictyostelium purpureum]
 gi|325080011|gb|EGC33585.1| hypothetical protein DICPUDRAFT_80649 [Dictyostelium purpureum]
          Length = 328

 Score =  261 bits (668), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 143/342 (41%), Positives = 200/342 (58%), Gaps = 22/342 (6%)

Query: 10  IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
           II   V   L++ C S   + R   +       + WM +H ++Y ++ E   R  IF+ N
Sbjct: 3   IILALVFCFLIVNCIS---AARVFSQKQYQTAFQNWMVKHQKSYTND-EFGSRYTIFQDN 58

Query: 70  LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPS-TFKYQNV 128
           ++++ K N++G+ T  LG N  +DLTN+E++ +Y G    V        +P+      +V
Sbjct: 59  MDFVTKWNQKGSDTI-LGLNSMADLTNQEYQRIYLGTKTTVK-------KPNLIIGVTDV 110

Query: 129 TDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST 188
           +  P S+DWR  GAVT +K+QGQCG C++FS   +VEGI +IT  +L+ LSEQQ++DCS 
Sbjct: 111 SKAPASVDWRANGAVTAVKNQGQCGGCYSFSTTGSVEGIHEITSKQLVSLSEQQILDCSG 170

Query: 189 D--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPK 246
              N+GC GGLM  +FEYII   GL TEA YPY    G C   K   + ATI+ Y+++  
Sbjct: 171 SEGNNGCDGGLMTNSFEYIIAVGGLDTEASYPYEGVVGKCKFNKAN-IGATITGYKNVKS 229

Query: 247 GDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVL--NADCGNNCDHGVAVVGFGTAEEEN 304
           G E  L  AV+ QPVSV +DAS  +F  Y SGV    A      DHGV  VG+G+   ++
Sbjct: 230 GSESDLQTAVAAQPVSVAIDASQNSFQLYSSGVYYEPACSSTQLDHGVLAVGYGS---QS 286

Query: 305 GAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPVA 345
           G  YW++KNSWG  WGE G+I + R+    CGIAT ASYP A
Sbjct: 287 GQDYWIVKNSWGADWGEKGFILMARNKHNNCGIATMASYPTA 328


>gi|291224870|ref|XP_002732425.1| PREDICTED: cathepsin L2-like [Saccoglossus kowalevskii]
          Length = 326

 Score =  261 bits (668), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 154/343 (44%), Positives = 205/343 (59%), Gaps = 27/343 (7%)

Query: 11  IPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNL 70
           + MF+ + LV   A+           S+  + E W   +G+ Y  + E+A+R  I+  NL
Sbjct: 1   MKMFISLALVAMAAA----------TSVNTEWESWKRTYGKEYTQK-EEALRHMIWNVNL 49

Query: 71  EYIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQN 127
           + I+  N++   G  TY    N+F DLTNEE+R L  GY +   +V    S+PSTF   +
Sbjct: 50  KMIQMHNEKYMSGKSTYTQNMNQFGDLTNEEYRELMCGYKKSNKTVI---SKPSTFLLPS 106

Query: 128 VTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
               P SIDWR +G VT +KDQG CGSCWAFS+  ++EG T    GKL+ LSEQQLVDCS
Sbjct: 107 NYRAPASIDWRTQGYVTDVKDQGACGSCWAFSSTGSLEGQTFKKTGKLVPLSEQQLVDCS 166

Query: 188 TD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLP 245
            D  N GC GG MD+AF Y I++KG  +E  YPY   + TC     K V AT + Y D+P
Sbjct: 167 GDYGNMGCGGGWMDQAFSY-IKDKGEESEDGYPYTGTDDTCVYDASK-VVATDTGYTDIP 224

Query: 246 KGDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGVLN-ADCG-NNCDHGVAVVGFGTAEE 302
           + DE AL QAV+   P+SV +DA+  +F FY+SGV +  +C   N DH V  VG+GT+EE
Sbjct: 225 EMDENALQQAVATVGPISVAIDATHSSFQFYESGVYDEPECSQTNLDHAVLAVGYGTSEE 284

Query: 303 ENGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
             G  YW++KNSW   WG  GYI + R+    CGIA+ ASYPV
Sbjct: 285 --GLDYWIVKNSWSTGWGMQGYIEMSRNKDNQCGIASKASYPV 325


>gi|110349473|gb|ABG73217.1| cathepsin L 1 precursor [Diaprepes abbreviatus]
          Length = 322

 Score =  261 bits (668), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 148/334 (44%), Positives = 200/334 (59%), Gaps = 22/334 (6%)

Query: 18  ILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKAN 77
           + +  C    VS   + E  +  K + +  +HG+TYK+++E+  R NIFK NL  IE+ N
Sbjct: 3   VFIAACLLVAVSATVLEETGV--KFQAFKLKHGKTYKNQVEETARFNIFKDNLRAIEQHN 60

Query: 78  ---KEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTS 134
              ++G  +YK G N F+D+T EEFRA  T       S S++    +T        VP S
Sbjct: 61  VLYEQGLVSYKKGINRFTDMTQEEFRAFLT------LSSSKKPHFNTTEHVLTGLAVPDS 114

Query: 135 IDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGC 193
           IDWR KG VT +KDQG CGSCWAFS   + E       GKL+ LSEQQLVDCSTD N GC
Sbjct: 115 IDWRTKGQVTGVKDQGNCGSCWAFSVTGSTEAAYYRKAGKLVSLSEQQLVDCSTDINAGC 174

Query: 194 SGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALL 253
           +GG +D+ F Y +++KGL  E+ YPY+  +G+C     K V   +S ++ L   DE ALL
Sbjct: 175 NGGYLDETFTY-VKSKGLEAESTYPYKGTDGSCKYSASKVVTK-VSGHKSLKSEDENALL 232

Query: 254 QAVSN-QPVSVCVDASGRAFHFYKSGVLNAD-CG-NNCDHGVAVVGFGTAEEENGAKYWL 310
            AV N  PVSV +DA+      Y+SG+   D C  +  +HGV VVG+GT+   NG KYW+
Sbjct: 233 DAVGNVGPVSVAIDAT--YLSSYESGIYEDDWCSPSELNHGVLVVGYGTS---NGKKYWI 287

Query: 311 IKNSWGETWGESGYIRILRDAGLCGIATAASYPV 344
           +KNSWG ++GESGY R+LR    CG+A    YP+
Sbjct: 288 VKNSWGGSFGESGYFRLLRGKNECGVAEDTVYPI 321


>gi|225719058|gb|ACO15375.1| Cathepsin L1 precursor [Caligus clemensi]
          Length = 326

 Score =  261 bits (667), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 143/310 (46%), Positives = 191/310 (61%), Gaps = 18/310 (5%)

Query: 44  QWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEFSDLTNEEFR 100
           +W A HG+ Y    E+++R  IF++N   I + N+E   G  TY LG N F DL + EF 
Sbjct: 25  KWKATHGKVYNSADEESLRFKIFQENSLMITQHNEEYRQGFHTYILGMNHFGDLLHSEFL 84

Query: 101 ALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSA 160
               G+   V       S    F +     VP+  +W  KGAVT +KDQG+CGSCWAFSA
Sbjct: 85  ERSNGFQGGV-------SGGDVFTFDTNAPVPSYANWTAKGAVTPVKDQGKCGSCWAFSA 137

Query: 161 VAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYP 218
             +VEG   + + KL+ LSEQQLVDCS D  N GC GGLMD AF+Y I NKG+A E  YP
Sbjct: 138 TGSVEGQIFLKKKKLMSLSEQQLVDCSGDEGNLGCGGGLMDNAFKYFIANKGIANEKSYP 197

Query: 219 YRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDASGRAFHFYKS 277
           Y  ++  C  +K  +V ATIS ++D+   DE  L  AV+N  PVSV +DAS   F FY+S
Sbjct: 198 YTAKDNDCKYKKSMSV-ATISSFKDVKHKDEDQLKMAVANVGPVSVAIDASSSKFQFYES 256

Query: 278 GV-LNADCGNNC-DHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA-GLC 334
           GV  + +C +   DHGV  VG+GT ++++G  +WL+KNSW  +WG +GYI++ R+    C
Sbjct: 257 GVYYDENCSSEVLDHGVLAVGYGT-DKKSGMDFWLVKNSWAASWGLNGYIKMARNKDNNC 315

Query: 335 GIATAASYPV 344
           GIAT ASYP+
Sbjct: 316 GIATMASYPI 325


>gi|33242886|gb|AAQ01147.1| cathepsin [Haplochromis chilotes]
          Length = 334

 Score =  261 bits (667), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 142/321 (44%), Positives = 190/321 (59%), Gaps = 16/321 (4%)

Query: 32  SMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGT 88
           +M E ++    E W   HG++YK+++E A R  ++  NL+ I   N E   G  TY+LG 
Sbjct: 21  AMFESTLDAHWELWKKTHGKSYKNDVENAHRRELWGNNLKMITVHNLEASMGLHTYELGM 80

Query: 89  NEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKD 148
           N   DLT EE    +     P   + R    PS F   + + +P ++DWREKG VT +K 
Sbjct: 81  NHMGDLTEEEIMQFFASLTPPT-DIQRA---PSPFAGASGSGIPDTMDWREKGCVTKVKM 136

Query: 149 QGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYII 206
           QG CGSCWAFSA  A+EG    + GKL++LS Q LVDCS    NHGC+GG M +AF+Y+I
Sbjct: 137 QGACGSCWAFSAAGALEGQLAKSTGKLVDLSPQNLVDCSGKYGNHGCNGGFMTRAFQYVI 196

Query: 207 ENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCV 265
           +N G+ ++A YPY   +  C +      AA  S Y+ LP+GDE AL Q ++   P+SV +
Sbjct: 197 DNHGIDSDASYPYIGRDDQC-HYNPATRAANCSSYQFLPEGDENALKQGLATVGPISVAI 255

Query: 266 DASGRAFHFYKSGVLN-ADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGY 324
           DA    F FY+SGV N   C    +HGV  VG+GT    NG  YWL+KNSWG T+G+ GY
Sbjct: 256 DARRPRFSFYRSGVYNDPSCTQKVNHGVLAVGYGTL---NGQDYWLVKNSWGTTFGDQGY 312

Query: 325 IRILRDAG-LCGIATAASYPV 344
           IR+ R+ G  CGIA    YPV
Sbjct: 313 IRMARNTGNQCGIALYPCYPV 333


>gi|302779822|ref|XP_002971686.1| hypothetical protein SELMODRAFT_16221 [Selaginella moellendorffii]
 gi|300160818|gb|EFJ27435.1| hypothetical protein SELMODRAFT_16221 [Selaginella moellendorffii]
          Length = 214

 Score =  261 bits (667), Expect = 4e-67,   Method: Compositional matrix adjust.
 Identities = 121/215 (56%), Positives = 159/215 (73%), Gaps = 6/215 (2%)

Query: 134 SIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHG 192
           S+DWR+KG VT IKDQG CG+CWAFSA+AAVEG+T ++ G L+ LSEQ+LVDC T  N G
Sbjct: 1   SVDWRKKGGVTEIKDQGDCGNCWAFSAIAAVEGLTFLSTGTLVSLSEQELVDCDTTVNQG 60

Query: 193 CSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQAL 252
           C GG+MD AF+Y+I N G+ ++++YPYR + G CD  K K  AATI+ ++ +P   E+ L
Sbjct: 61  CDGGMMDYAFQYMIRNGGITSQSNYPYRAQRGACDKDKVKYHAATINGFQAIPPQSEELL 120

Query: 253 LQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIK 312
           L+AV+NQPVSV ++A G+ F  Y SGV   +CG+N DHGVA+VG+GT  +  G +YWL+K
Sbjct: 121 LRAVANQPVSVAIEAGGQDFQLYSSGVFTGECGSNLDHGVAIVGYGT--DAGGRQYWLVK 178

Query: 313 NSWGETWGESGYIRILRD---AGLCGIATAASYPV 344
           NSWG  WGESGY+R+ R    AG+CGI   ASYP 
Sbjct: 179 NSWGSGWGESGYVRMERQGPGAGVCGINLDASYPT 213


>gi|115715524|ref|XP_780580.2| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
          Length = 334

 Score =  261 bits (666), Expect = 5e-67,   Method: Compositional matrix adjust.
 Identities = 152/341 (44%), Positives = 206/341 (60%), Gaps = 20/341 (5%)

Query: 14  FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYI 73
           ++ ++LV  C   VVS  SM      E   QW  +HG+ Y  + E+A R  I+++NL+ +
Sbjct: 3   YLSVLLVAAC---VVSSLSMSFIDFDEDWNQWKNEHGKRYLSDEEEASRRLIWQKNLDIV 59

Query: 74  EKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTF-KYQNVT 129
            K N +   G+ TY LG N+F+DL NEEF +L  G+       S +++R STF    NV 
Sbjct: 60  IKHNLKYDLGHFTYDLGMNQFADLKNEEFVSLMNGFRGN----SSKATRGSTFLPPSNVF 115

Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD 189
           D+PT +DWR KG VT +K+Q QCGSCWAFSA  ++EG      GKL+ LSEQ LVDCS  
Sbjct: 116 DMPTMVDWRTKGYVTPVKNQLQCGSCWAFSATGSLEGQHFKKTGKLVSLSEQNLVDCSGK 175

Query: 190 --NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
             N GC GGLMD+AF+YI++  G+ TE  YPY   +G C   K   + AT + Y D+  G
Sbjct: 176 EGNMGCEGGLMDQAFQYILDVGGIDTEMSYPYTAMDGQCHFNKAN-IGATDTGYTDVTTG 234

Query: 248 DEQALLQAVSN-QPVSVCVDASGRAFHFYKSGVLN--ADCGNNCDHGVAVVGFGTAEEEN 304
            E AL  AV++  P+SV +DAS ++F  YKSGV N  A      DHGV  VG+GT+ +  
Sbjct: 235 SESALQMAVASVGPISVAIDASHQSFQLYKSGVYNEPACSSTLLDHGVLAVGYGTSSD-- 292

Query: 305 GAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
           G  Y+   +SWG  WG +GY+ + R+    CGIAT ASYP+
Sbjct: 293 GTDYFFFFHSWGAAWGMNGYLWMSRNKDNQCGIATKASYPL 333


>gi|75812934|ref|NP_001028787.1| cathepsin S precursor [Bos taurus]
 gi|115503669|sp|P25326.2|CATS_BOVIN RecName: Full=Cathepsin S; Flags: Precursor
 gi|74353837|gb|AAI02246.1| Cathepsin S [Bos taurus]
 gi|296489535|tpg|DAA31648.1| TPA: cathepsin S precursor [Bos taurus]
          Length = 331

 Score =  261 bits (666), Expect = 5e-67,   Method: Compositional matrix adjust.
 Identities = 146/340 (42%), Positives = 207/340 (60%), Gaps = 20/340 (5%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
           M  ++  ++ C+S +       +P++    + W   +G+ YK++ E+  R  I+++NL+ 
Sbjct: 1   MNWLVWALLLCSSAMA--HVHRDPTLDHHWDLWKKTYGKQYKEKNEEVARRLIWEKNLKT 58

Query: 73  IEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
           +   N E   G  +Y+LG N   D+T+EE  +L +    P      Q  R  T+K     
Sbjct: 59  VTLHNLEHSMGMHSYELGMNHLGDMTSEEVISLMSSLRVP-----SQWPRNVTYKSDPNQ 113

Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST- 188
            +P S+DWREKG VT +K QG CGSCWAFSAV A+E   ++  GKL+ LS Q LVDCST 
Sbjct: 114 KLPDSMDWREKGCVTEVKYQGACGSCWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCSTA 173

Query: 189 --DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPK 246
              N GC+GG M +AF+YII+N G+ +EA YPY+  +G C     K  AAT S+Y +LP 
Sbjct: 174 KYGNKGCNGGFMTEAFQYIIDNNGIDSEASYPYKAMDGKC-QYDVKNRAATCSRYIELPF 232

Query: 247 GDEQALLQAVSNQ-PVSVCVDASGRAFHFYKSGV-LNADCGNNCDHGVAVVGFGTAEEEN 304
           G E+AL +AV+N+ PVSV +DAS  +F  YK+GV  +  C  N +HGV VVG+G  +   
Sbjct: 233 GSEEALKEAVANKGPVSVGIDASHSSFFLYKTGVYYDPSCTQNVNHGVLVVGYGNLD--- 289

Query: 305 GAKYWLIKNSWGETWGESGYIRILRDAG-LCGIATAASYP 343
           G  YWL+KNSWG  +G+ GYIR+ R++G  CGIA   SYP
Sbjct: 290 GKDYWLVKNSWGLHFGDQGYIRMARNSGNHCGIANYPSYP 329


>gi|61661067|gb|AAX51229.1| cathepsin S cysteine protease [Paralichthys olivaceus]
          Length = 337

 Score =  261 bits (666), Expect = 5e-67,   Method: Compositional matrix adjust.
 Identities = 149/340 (43%), Positives = 198/340 (58%), Gaps = 21/340 (6%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
           M   ++LV  C    V   +M +  +    E W   HG+TY +E+E   R  ++++NL  
Sbjct: 10  MLASLLLVSLC----VEAAAMLDVRLDVHWELWKKSHGKTYPNEVEDVRRRELWERNLML 65

Query: 73  IEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
           I K N E   G +TY L  N   DLT EE    Y     P   + R    P+ F   +  
Sbjct: 66  ITKHNLEASMGLQTYDLSMNHMGDLTTEEIMQSYATLTPPA-DIQRA---PAPF-VGSGA 120

Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD 189
           DVP S+DWR +G VT +K QG CGSCWAFSA  A+EG    T GKL++LS Q LVDCS  
Sbjct: 121 DVPVSVDWRLQGCVTSVKMQGSCGSCWAFSAAGALEGQLAKTTGKLVDLSPQNLVDCSLK 180

Query: 190 --NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
             N GC+GG MD+AF+Y+I+NKG+ +EA YPYR +   C +      AA  S+Y  LP+G
Sbjct: 181 YGNKGCNGGFMDRAFQYVIDNKGIDSEASYPYRGQLQQC-SYNPSYRAANCSRYSFLPEG 239

Query: 248 DEQALLQAVSN-QPVSVCVDASGRAFHFYKSGVLN-ADCGNNCDHGVAVVGFGTAEEENG 305
           DE AL  A++   P+SV +DA+   F FY+SGV N   C    +HGV  VG+GT   E+G
Sbjct: 240 DEGALKNALATIGPISVAIDATRPTFAFYRSGVYNDPTCTQRVNHGVLAVGYGT---ESG 296

Query: 306 AKYWLIKNSWGETWGESGYIRILRDAG-LCGIATAASYPV 344
             YWL+KNSWG ++G+ GYIR+ R+    CGIA   SYP+
Sbjct: 297 QDYWLVKNSWGTSFGDKGYIRMSRNKNDQCGIALYCSYPI 336


>gi|330805277|ref|XP_003290611.1| hypothetical protein DICPUDRAFT_81345 [Dictyostelium purpureum]
 gi|325079250|gb|EGC32859.1| hypothetical protein DICPUDRAFT_81345 [Dictyostelium purpureum]
          Length = 330

 Score =  261 bits (666), Expect = 5e-67,   Method: Compositional matrix adjust.
 Identities = 139/306 (45%), Positives = 188/306 (61%), Gaps = 17/306 (5%)

Query: 45  WMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYT 104
           WM +H R+Y    E   +   FK N+++I   N   N    LG  +F+DLTNEE+R +Y 
Sbjct: 36  WMKKHDRSYHHH-EFNNKYQAFKDNMDFIHNWNTNKNSKTVLGLTQFADLTNEEYRKIYL 94

Query: 105 GYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAV 164
           G    V      +     F   + T  P SIDWR KGAV+H+KDQGQCGSCW+FS   +V
Sbjct: 95  GTKVNV------APEKHNFNMIHFTG-PDSIDWRTKGAVSHVKDQGQCGSCWSFSTTGSV 147

Query: 165 EGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHE 222
           EG  QI  G ++ LSEQ LVDCS    N+GC GGLM  AF++I+   G+ATE  YPY   
Sbjct: 148 EGAHQIKTGNMVTLSEQNLVDCSGKFGNNGCDGGLMVNAFKFIMSQGGVATEDSYPYNAV 207

Query: 223 EGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLN- 281
           +G C   K   V A IS Y+++ +G E  L  A++ QPVS+ +DAS ++F  YKSGV + 
Sbjct: 208 QGKCKFTKS-MVGANISGYKEITQGSELELQAALTKQPVSIAIDASQQSFQLYKSGVYDE 266

Query: 282 ADCGN-NCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATA 339
            +C +   DHGV  VG+GT   ENG  Y+++KNSW ++WG+ GYI + R+A   CG+AT 
Sbjct: 267 PECSSYQLDHGVLAVGYGT---ENGKDYYIVKNSWADSWGQDGYIFMSRNAKNQCGVATM 323

Query: 340 ASYPVA 345
           ASYP++
Sbjct: 324 ASYPIS 329


>gi|342675481|gb|AEL31666.1| cathepsin L [Cynoglossus semilaevis]
          Length = 336

 Score =  260 bits (665), Expect = 5e-67,   Method: Compositional matrix adjust.
 Identities = 146/345 (42%), Positives = 208/345 (60%), Gaps = 20/345 (5%)

Query: 10  IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
           ++P+ ++ + V    S V+S  S+ +  + +  E W   H + Y  E E+  R  I+++N
Sbjct: 1   MLPLALLALGV----SAVLSAPSL-DARLSDHWELWKNWHSKKYH-EKEEGWRRMIWEKN 54

Query: 70  LEYIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQ 126
           L  IE  N E   G  +Y+LG N F D+T+EEFR +  GY R     + + +  S F   
Sbjct: 55  LNKIELHNLEHSMGKHSYRLGMNHFGDMTHEEFRQIMNGYQRK----TERKAIGSLFMEP 110

Query: 127 NVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC 186
           N    P+++DWREKG VT +KDQGQCGSCWAFS   A+ZG      GKL+ LSEQ LVDC
Sbjct: 111 NFMVAPSAVDWREKGYVTPVKDQGQCGSCWAFSTTGALZGQNFRKMGKLVSLSEQNLVDC 170

Query: 187 STD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDL 244
           S    N GC GGLMD+AF+Y+ +N+GL +E  YPY   +    +   K  +   + + D+
Sbjct: 171 SRPEGNEGCGGGLMDQAFQYVKDNQGLDSEDSYPYLGTDDQPCHYDPKYNSVNDTGFVDI 230

Query: 245 PKGDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFG-TA 300
           P G E AL++AV++  PVSV +DA   +F FY+SG+    +C +   DHGV  VG+G   
Sbjct: 231 PSGKEHALMKAVASVGPVSVAIDAGHESFQFYQSGIYYEKECSSEELDHGVLAVGYGFEG 290

Query: 301 EEENGAKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
           E+ +G KYW++KNSW E WG+ GYI + +D    CGIATAASYP+
Sbjct: 291 EDVDGKKYWIVKNSWSEKWGDKGYIYMAKDRKNHCGIATAASYPL 335


>gi|149751225|ref|XP_001490531.1| PREDICTED: cathepsin S-like [Equus caballus]
          Length = 332

 Score =  260 bits (665), Expect = 5e-67,   Method: Compositional matrix adjust.
 Identities = 144/318 (45%), Positives = 197/318 (61%), Gaps = 18/318 (5%)

Query: 35  EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEF 91
           +P++    + W   +G+ YK++ E+  R  I+++NL+++   N E   G  +Y LG N  
Sbjct: 22  DPTLDNHWDLWKKTYGKQYKEKNEEVARRLIWERNLKFVMLHNLEHSMGMHSYDLGMNHL 81

Query: 92  SDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQ 151
            D+T+EE  +L +     VPS   Q  R  T+K      +P S+DWREKG VT +K QG 
Sbjct: 82  GDMTSEEVTSLMSSLR--VPS---QWQRNVTYKSNPNEKLPDSLDWREKGCVTEVKYQGS 136

Query: 152 CGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD---NHGCSGGLMDKAFEYIIEN 208
           CG+CWAFSAV A+E   ++  G L+ LS Q LVDCST+   N GC+GG M  AF+YII+N
Sbjct: 137 CGACWAFSAVGALEAQLKLKTGNLVSLSAQNLVDCSTEKYSNKGCNGGFMTAAFQYIIDN 196

Query: 209 KGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQ-PVSVCVDA 267
            G+ ++A YPY+  +G C     K  AAT SKY +LP G E  L +AV+N+ PVSV +DA
Sbjct: 197 NGIDSDASYPYKAMDGKC-RYDSKNRAATCSKYTELPFGSEDDLKEAVANKGPVSVAIDA 255

Query: 268 SGRAFHFYKSGV-LNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIR 326
           S  +F  YKSGV  +  C  N +HGV VVG+G     NG  YWL+KNSWG  +G+ GYIR
Sbjct: 256 SHPSFFLYKSGVYYDPSCTQNVNHGVLVVGYGNL---NGKDYWLVKNSWGINFGDKGYIR 312

Query: 327 ILRDAG-LCGIATAASYP 343
           + R++G  CGIA   SYP
Sbjct: 313 MARNSGNHCGIANYCSYP 330


>gi|81542|pir||S02728 actinidain (EC 3.4.22.14) precursor (clone pAC.1) - kiwi fruit
           (fragment)
 gi|15957|emb|CAA31435.1| actinidin precursor [Actinidia chinensis]
 gi|166319|gb|AAA32630.1| actinidin precursor [Actinidia deliciosa]
 gi|226542|prf||1601514A actinidin
          Length = 302

 Score =  260 bits (665), Expect = 5e-67,   Method: Compositional matrix adjust.
 Identities = 133/282 (47%), Positives = 181/282 (64%), Gaps = 16/282 (5%)

Query: 70  LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
           L +I++ N + NR+YK+G N+F+DLT EEFR+ Y G+       S ++   + ++ +   
Sbjct: 1   LRFIDEHNADTNRSYKVGLNQFADLTGEEFRSTYLGFT----GGSNKTKVSNRYEPRVSQ 56

Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC--S 187
            +P+ +DWR  GAV  IK QG+CG CWAFSA+A VEGI +I  G LI LSEQ+L+ C  +
Sbjct: 57  VLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIGCGGT 116

Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTC--DNQKEKAVAATISKYEDLP 245
            +  GC+GG +   F++II N G+ T  +YPY  ++G C  D Q EK V  TI  Y ++P
Sbjct: 117 QNTRGCNGGYITDGFQFIINNGGINTGENYPYTAQDGECNLDLQNEKYV--TIDTYGNVP 174

Query: 246 KGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENG 305
             +E AL  AV+ QPVSV +DA+G AF  Y SG+    CG   DH V +VG+GT   E G
Sbjct: 175 YNNEWALQTAVTYQPVSVALDAAGDAFKHYSSGIFTGPCGTAIDHAVTIVGYGT---EGG 231

Query: 306 AKYWLIKNSWGETWGESGYIRILRD---AGLCGIATAASYPV 344
             YW+++NSW  TWGE GY+RILR+   AG CGIAT  SYPV
Sbjct: 232 IDYWIVENSWDTTWGEEGYMRILRNVGGAGTCGIATMPSYPV 273


>gi|225718114|gb|ACO14903.1| Cathepsin L precursor [Caligus clemensi]
          Length = 336

 Score =  260 bits (665), Expect = 6e-67,   Method: Compositional matrix adjust.
 Identities = 146/340 (42%), Positives = 202/340 (59%), Gaps = 19/340 (5%)

Query: 14  FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYI 73
            ++ +LVI   +  VS   +    ++   E W   HG+TY   +E+ +RL I+ +N   I
Sbjct: 6   LLLSVLVIASTANAVSFFDV----VLSDWESWKLMHGKTYSSSIEEKLRLKIYMENSLKI 61

Query: 74  EKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
            + N E   G   Y +  N + DL + EF A+  GY       ++ +S   T+       
Sbjct: 62  SRHNSEALNGIHPYYMKMNHYGDLLHHEFVAMVNGYQY----ANKTASLGGTYIPNKNIQ 117

Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD- 189
           +PT +DWRE+GAVT +K+QGQCGSCW+FSA  A+EG      GKLI LSEQ LVDCS   
Sbjct: 118 LPTHVDWREEGAVTPVKNQGQCGSCWSFSATGALEGQDFRKTGKLISLSEQNLVDCSRKF 177

Query: 190 -NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
            N+GC GGLMD AF YI +NKG+ TEA YPY   +G C    +    + I  + D+ KG 
Sbjct: 178 GNNGCEGGLMDFAFTYIRDNKGIDTEASYPYEGIDGHCHYNPKNKGGSDIG-FVDIKKGS 236

Query: 249 EQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFGTAEEENG 305
           E+ L +AV+   P+SV +DAS  +F FY  GV + + C +   DHGV VVGFGT +  +G
Sbjct: 237 EKDLKKAVAGVGPISVAIDASHMSFQFYSHGVYVESKCSSEELDHGVLVVGFGT-DSVSG 295

Query: 306 AKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
             YWL+KNSW E WG+ GYI++ R+   +CGIA++ASYPV
Sbjct: 296 EDYWLVKNSWSEKWGDQGYIKMARNKENMCGIASSASYPV 335


>gi|354473025|ref|XP_003498737.1| PREDICTED: cathepsin S-like [Cricetulus griseus]
          Length = 341

 Score =  260 bits (665), Expect = 6e-67,   Method: Compositional matrix adjust.
 Identities = 147/343 (42%), Positives = 208/343 (60%), Gaps = 22/343 (6%)

Query: 11  IPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNL 70
           I  ++  + ++ C   +   +   +P++    + W   HG+ YK++ E+  R  I+++NL
Sbjct: 9   ITRWLFWVPMVCC---LAGDQLQRDPTLDHHWDLWKKFHGKQYKEKNEEEARRLIWEKNL 65

Query: 71  EYIEKANKEGN---RTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQN 127
           + +   N E +    +Y LG N   D+T+EE      G  RP+  V  Q  R ST+K   
Sbjct: 66  KLVMLHNLEYSLEMHSYSLGMNHMGDMTSEEV----LGQMRPL-RVPSQRHRNSTYKSNP 120

Query: 128 VTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
              +P S+DWREKG VT +K QG CGSCWAFSAV A+E   ++  GKL+ LS Q LVDCS
Sbjct: 121 NQKLPDSMDWREKGCVTEVKYQGSCGSCWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCS 180

Query: 188 TD----NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYED 243
           T+    N GC GG M +AF+YII+N G+ ++A YPY+     C +   K+ AAT S+Y +
Sbjct: 181 TEEKYGNKGCDGGFMTRAFQYIIDNGGIDSDASYPYKAVAEKC-HYDSKSRAATCSRYME 239

Query: 244 LPKGDEQALLQAVSNQ-PVSVCVDASGRAFHFYKSGVLN-ADCGNNCDHGVAVVGFGTAE 301
           LP GDE+AL +AV+N+ PVSV +DAS  +F  YKSGV +   C  N +HGV VVG+G  +
Sbjct: 240 LPSGDEEALKEAVANKGPVSVGIDASHPSFFLYKSGVYDEPSCTENVNHGVLVVGYGNLD 299

Query: 302 EENGAKYWLIKNSWGETWGESGYIRILR-DAGLCGIATAASYP 343
              G  YWL+KNSWG  +G+ GYIR+ R +   CGIA+  SYP
Sbjct: 300 ---GKDYWLVKNSWGLHFGDQGYIRMARNNKNQCGIASYGSYP 339


>gi|328776427|ref|XP_625135.3| PREDICTED: cathepsin L-like [Apis mellifera]
          Length = 351

 Score =  260 bits (665), Expect = 6e-67,   Method: Compositional matrix adjust.
 Identities = 142/318 (44%), Positives = 196/318 (61%), Gaps = 19/318 (5%)

Query: 42  HEQWMA---QHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEFSDLT 95
           +++WM    +H + YK ++E+  R+ IF  N   I K N        +YKL  N++ D+ 
Sbjct: 31  NQEWMTFKMEHKKVYKSDVEERFRMKIFMDNKHKIAKHNSNYEMKKVSYKLKMNKYGDML 90

Query: 96  NEEFRALYTGYNRPVPSVSRQSSRP---STFKYQNVTDVPTSIDWREKGAVTHIKDQGQC 152
           + EF  +  G+N+ + +  R    P   S  +  NV  +P  +DWR++GAVT +KDQG C
Sbjct: 91  HHEFVNILNGFNKSINTQLRSERLPVGASFIEPANVV-LPKKVDWRKEGAVTPVKDQGHC 149

Query: 153 GSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKG 210
           GSCW+FSA  A+EG      G L+ LSEQ L+DCS    N+GC+GGLMD+AF+YI +NKG
Sbjct: 150 GSCWSFSATGALEGQHFRRTGVLVSLSEQNLIDCSGKYGNNGCNGGLMDQAFQYIKDNKG 209

Query: 211 LATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDASG 269
           L TEA YPY  E   C      + A  +  Y D+P GDE+ L  AV+   PVSV +DAS 
Sbjct: 210 LDTEASYPYEAENDKCRYNPANSGAIDVG-YIDIPTGDEKLLKAAVATIGPVSVAIDASH 268

Query: 270 RAFHFYKSGV-LNADCGN-NCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRI 327
           ++F FY  GV    +C +   DHGV V+G+GT   ENG  YWL+KNSWGETWG +GYI++
Sbjct: 269 QSFQFYSEGVYYEPECSSEELDHGVLVIGYGT--NENGQDYWLVKNSWGETWGNNGYIKM 326

Query: 328 LRDA-GLCGIATAASYPV 344
            R+    CGIA++ASYP+
Sbjct: 327 ARNKLNHCGIASSASYPL 344


>gi|410898132|ref|XP_003962552.1| PREDICTED: cathepsin L-like [Takifugu rubripes]
          Length = 335

 Score =  260 bits (665), Expect = 6e-67,   Method: Compositional matrix adjust.
 Identities = 146/314 (46%), Positives = 198/314 (63%), Gaps = 23/314 (7%)

Query: 45  WMAQHGRTYKDELEKAMRLNIFKQNLEYIEKAN---KEGNRTYKLGTNEFSDLTNEEFRA 101
           W  + GR+Y+   E+  R+ I+  N + +   N    +G ++Y+LG  +F+D+ NEE+++
Sbjct: 30  WKLKFGRSYRTPSEEVQRMQIWLNNRKLVLVHNILADQGIKSYRLGMTQFADMDNEEYKS 89

Query: 102 LYT-----GYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCW 156
           L +      +N   P   R+ S  + F+    T +PT++DWR+KG VT +KDQ QCGSCW
Sbjct: 90  LISLGCLRAFNTSAP---RRGS--AFFRLAEGTHLPTTVDWRDKGYVTGVKDQKQCGSCW 144

Query: 157 AFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATE 214
           AFSA  ++EG      GKL+ LSEQQLVDCS D  N GC+GGLMD AF+YI EN G+ TE
Sbjct: 145 AFSATGSLEGQNFRKTGKLVSLSEQQLVDCSGDYGNMGCNGGLMDYAFKYIQENGGIDTE 204

Query: 215 ADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDASGRAFH 273
             YPY  E+G C  + E  V A  + Y D+  GDE AL +AV+   PVSV +DAS  +F 
Sbjct: 205 KSYPYEAEDGQCRFKPEN-VGAKCTGYVDVTVGDEDALKEAVATIGPVSVGIDASHSSFQ 263

Query: 274 FYKSGVLN-ADCGN-NCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA 331
            Y SGV +  DC + + DHGV  VG+GT   +NG  YWL+KNSWG  WG+ GYI + R+ 
Sbjct: 264 LYDSGVYDEQDCSSQDLDHGVLAVGYGT---DNGQDYWLVKNSWGLGWGQEGYIMMSRNK 320

Query: 332 -GLCGIATAASYPV 344
              CGIATAASYP+
Sbjct: 321 DNQCGIATAASYPL 334


>gi|334332720|ref|XP_001367595.2| PREDICTED: cathepsin L1-like [Monodelphis domestica]
          Length = 333

 Score =  260 bits (664), Expect = 8e-67,   Method: Compositional matrix adjust.
 Identities = 145/340 (42%), Positives = 210/340 (61%), Gaps = 18/340 (5%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
           M   + L   C   +V+     + ++  +  QW AQH RTY    E   R   +++NL+ 
Sbjct: 1   MNFYLCLASLCLG-LVAATPEFDQTLDSQWHQWKAQHRRTYAAN-EDGWRRATWEKNLKM 58

Query: 73  IEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
           IE  N E   G  +++LG N+F D+T EEF+ +  GYN    + S++ ++ S ++   + 
Sbjct: 59  IEMHNLEYSAGKHSFQLGMNKFGDMTTEEFKQVMNGYNS---NGSQKRTKGSLYREPLLA 115

Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD 189
            +P S+DWREKG VT +K+QGQCGSCWAFSA  ++EG       KL+ LSEQ LVDCST 
Sbjct: 116 QLPKSVDWREKGYVTPVKNQGQCGSCWAFSATGSLEGQWFHKTKKLVSLSEQNLVDCSTS 175

Query: 190 --NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
             N+GCSGGLMD AFEY+  N G+ TE  YPY  ++  C  + E    A ++ + D+P  
Sbjct: 176 EGNNGCSGGLMDNAFEYVKNNGGIDTEQAYPYLGQDNECKYRAE-CSGANVTGFVDIPSM 234

Query: 248 DEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGNN-CDHGVAVVGFGTAEEEN 304
           +E+AL++AV+N  P+SV +DA   +F FY+SGV     C ++  DHGV VVG+G+  ++ 
Sbjct: 235 NERALMKAVANVGPISVAIDAGNPSFQFYESGVYYEPQCSSSQLDHGVLVVGYGSIGKD- 293

Query: 305 GAKYWLIKNSWGETWGESGYIRILR-DAGLCGIATAASYP 343
             +YW++KNSWGE WG+ GY+ + +     CGIATAASYP
Sbjct: 294 --EYWIVKNSWGEEWGKKGYVLMAKFRNNHCGIATAASYP 331


>gi|186701255|gb|ACC91281.1| putative cysteine proteinase [Capsella rubella]
          Length = 324

 Score =  260 bits (664), Expect = 8e-67,   Method: Compositional matrix adjust.
 Identities = 142/349 (40%), Positives = 210/349 (60%), Gaps = 52/349 (14%)

Query: 10  IIPMFVIIILVITCAS----QVVSG--RSMHEPSIVEKHEQWMAQHGRTYKDEL-EKAMR 62
           +I + ++II ++  +S     V SG  RS  E   +   + WM++HG+TY + L +K  R
Sbjct: 9   MITLSLLIIFLLPPSSAMDLSVTSGGLRSNEEVGFI--FQTWMSKHGKTYTNALGDKEQR 66

Query: 63  LNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPST 122
              FK NL +I++ N + N +Y+LG  +F+DLT +E++ L++G  RP+    +Q +   T
Sbjct: 67  FQNFKDNLRFIDQHNAK-NLSYRLGLTQFADLTVQEYQDLFSG--RPI---QKQKALRVT 120

Query: 123 FKYQNVTD--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSE 180
            +Y  + +  +P S+DWR+KGAV+ IKDQG+C           VE I +I  G+LI LSE
Sbjct: 121 HRYVPLAEDQLPQSVDWRQKGAVSEIKDQGRC----------TVESINKIVTGELISLSE 170

Query: 181 QQLVDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCD-NQKEKAVAATIS 239
           Q+LVDCS DNHGC+GGLMD AF+++I N GL  ++DYPY+  +G C+ NQ        I 
Sbjct: 171 QELVDCSIDNHGCNGGLMDSAFQFLINNNGLEYQSDYPYQAVQGYCNHNQNTSKKVIKID 230

Query: 240 KYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGT 299
            YED+P  +E +L +AV++QP                 G+    CG + DH V +VG+GT
Sbjct: 231 GYEDVPANNENSLQKAVAHQP-----------------GIYTGPCGTDLDHAVVIVGYGT 273

Query: 300 AEEENGAKYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPV 344
              ENG  YW+++NSWG  WGE+GY +I R+     G+CGIA  ASYP+
Sbjct: 274 ---ENGQDYWIVRNSWGTVWGEAGYAKIARNFENPTGVCGIAMVASYPI 319


>gi|75067394|sp|Q9GKL8.1|CATL1_CERAE RecName: Full=Cathepsin L1; AltName: Full=Major excreted protein;
           Short=MEP; Contains: RecName: Full=Cathepsin L1 heavy
           chain; Contains: RecName: Full=Cathepsin L1 light chain;
           Flags: Precursor
 gi|11493685|gb|AAG35605.1|AF201700_1 cysteine protease [Chlorocebus aethiops]
          Length = 333

 Score =  260 bits (664), Expect = 8e-67,   Method: Compositional matrix adjust.
 Identities = 145/343 (42%), Positives = 207/343 (60%), Gaps = 23/343 (6%)

Query: 12  PMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLE 71
           P F++  L +  AS  ++       S+  +  +W A H R Y    E+  R  ++++N++
Sbjct: 3   PTFILAALCLGIASATLT----FNHSLEAQWTKWKAMHNRLYGMN-EEGWRRAVWEKNMK 57

Query: 72  YIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNV 128
            IE  N+E   G  ++ +  N F D+T+EEFR +  G+       +R+  +   F+    
Sbjct: 58  MIELHNQEYSQGKHSFTMAMNTFGDMTSEEFRQVMNGFQ------NRKPRKGKVFQEPLF 111

Query: 129 TDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS- 187
            + P S+DWREKG VT +K+QGQCGSCWAFSA  A+EG      GKL+ LSEQ LVDCS 
Sbjct: 112 YEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSG 171

Query: 188 -TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPK 246
              N GC+GGLMD AF+Y+ +N GL +E  YPY   E +C    E +V A  + + D+PK
Sbjct: 172 PQGNEGCNGGLMDYAFQYVADNGGLDSEESYPYEATEESCKYNPEYSV-ANDTGFVDIPK 230

Query: 247 GDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFG-TAEE 302
             E+AL++AV+   P+SV +DA   +F FYK G+    DC + + DHGV VVG+G  + E
Sbjct: 231 -QEKALMKAVATVGPISVAIDAGHESFMFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTE 289

Query: 303 ENGAKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
            + +KYWL+KNSWGE WG  GYI++ +D    CGIA+AASYP 
Sbjct: 290 SDNSKYWLVKNSWGEEWGMGGYIKMAKDRRNHCGIASAASYPT 332


>gi|157093563|gb|ABV22436.1| cysteine proteinase [Oxyrrhis marina]
          Length = 329

 Score =  260 bits (664), Expect = 8e-67,   Method: Compositional matrix adjust.
 Identities = 134/316 (42%), Positives = 191/316 (60%), Gaps = 20/316 (6%)

Query: 38  IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
           I  + E++ A+ G +Y  E E+A R  +F QN++ I + N +G+ TY LG N+F+DLT E
Sbjct: 15  IDAQWEEFKAKFGESYNGEEEEAERKGVFAQNVQLINEENSKGH-TYTLGVNQFADLTVE 73

Query: 98  EFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD---VPTSIDWREKGAVTHIKDQGQCGS 154
           EF   Y G+ +P      Q    + +  ++V +   +PTS+DW  +GAVT +K+QGQCGS
Sbjct: 74  EFSKTYMGFKKPA-----QKYGDAAYLGRHVYNGEALPTSVDWSSQGAVTPVKNQGQCGS 128

Query: 155 CWAFSAVAAVEGITQITRGKLIELSEQQLVDCST--DNHGCSGGLMDKAFEYIIENKGLA 212
           CW+FS   ++EG  +I+ GKL+ LSEQQ VDC+    N GC+GGLMD AF+Y  E   L 
Sbjct: 129 CWSFSTTGSLEGANEISTGKLVSLSEQQFVDCAGTYGNQGCNGGLMDSAFKY-AEANALC 187

Query: 213 TEADYPYRHEEGTCDNQ--KEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGR 270
           TE  YPY+  +G+C            ++S Y+D+    EQ ++ AV+ QPVS+ ++A   
Sbjct: 188 TEQSYPYKGTDGSCQASSCSTGLAKGSVSGYKDVSSDSEQDMMSAVAQQPVSIAIEADKS 247

Query: 271 AFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILR- 329
            F  Y  GVL   CG + DHGV  VG+GT    +G  YW +KNSWG TWG SGY+ + R 
Sbjct: 248 VFQLYSGGVLTGACGASLDHGVLAVGYGTL---SGTDYWKVKNSWGSTWGMSGYVLLQRG 304

Query: 330 --DAGLCGIATAASYP 343
              +G CG+ +  SYP
Sbjct: 305 KGGSGECGLLSEPSYP 320


>gi|157278115|ref|NP_001098156.1| cathepsin L precursor [Oryzias latipes]
 gi|50251128|dbj|BAD27581.1| cathepsin L [Oryzias latipes]
          Length = 336

 Score =  260 bits (664), Expect = 8e-67,   Method: Compositional matrix adjust.
 Identities = 139/320 (43%), Positives = 199/320 (62%), Gaps = 15/320 (4%)

Query: 35  EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEF 91
           +P + +  + W   H + Y ++ E   RL ++++NL  IE  N E   G  +Y+LG N F
Sbjct: 21  DPQLDQHWQLWKGWHSKNYHEKEEGWRRL-VWEKNLRKIELHNLEHSMGKHSYRLGMNHF 79

Query: 92  SDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQ 151
            D+T+EEFR +  GY R      ++    S F   N  + P ++DWR+KG VT +KDQGQ
Sbjct: 80  GDMTHEEFRQIMNGYKR----REQRKYSGSLFMEPNFLEAPRAVDWRDKGYVTPVKDQGQ 135

Query: 152 CGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENK 209
           CGSCWAFS   A+EG      GKL+ LSEQ LVDCS    N GC+GGLMD+AF+Y+ +N+
Sbjct: 136 CGSCWAFSTTGALEGQQFRKTGKLVSLSEQNLVDCSRPEGNEGCNGGLMDQAFQYVKDNQ 195

Query: 210 GLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDAS 268
           GL +E  YPY+  +        +  A   + + D+P G E+AL++AV++  PVSV +DA 
Sbjct: 196 GLDSEDFYPYKGTDDQPCQYNAQYSAVNDTGFVDIPSGKERALMKAVASVGPVSVAIDAG 255

Query: 269 GRAFHFYKSGV-LNADCGNN-CDHGVAVVGFG-TAEEENGAKYWLIKNSWGETWGESGYI 325
             +F FY+SG+    +C ++  DHGV VVG+G   E+ +G KYW++KNSW E WG+ G+I
Sbjct: 256 HESFQFYQSGIYFEKECSSDELDHGVLVVGYGFEGEDVDGKKYWIVKNSWSEKWGDKGFI 315

Query: 326 RILRDA-GLCGIATAASYPV 344
            + +D    CGIATAASYP+
Sbjct: 316 YMAKDRHNHCGIATAASYPL 335


>gi|380790141|gb|AFE66946.1| cathepsin L1 preproprotein [Macaca mulatta]
 gi|384939708|gb|AFI33459.1| cathepsin L1 preproprotein [Macaca mulatta]
          Length = 333

 Score =  260 bits (664), Expect = 8e-67,   Method: Compositional matrix adjust.
 Identities = 145/343 (42%), Positives = 206/343 (60%), Gaps = 23/343 (6%)

Query: 12  PMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLE 71
           P F++    +  AS  ++       S+  +  +W A H R Y    E+  R  ++++N++
Sbjct: 3   PTFILAAFCLGIASATLT----FNHSLEAQWTKWKAMHNRLYGMN-EEGWRRAVWEKNMK 57

Query: 72  YIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNV 128
            IE  N+E   G  ++ +  N F D+T+EEFR L  G+       +R+  +   F+    
Sbjct: 58  MIELHNQEYSQGKHSFTMAMNTFGDMTSEEFRQLMNGFQ------NRKPRKGKVFQEPLF 111

Query: 129 TDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS- 187
            + P S+DWREKG VT +K+QGQCGSCWAFSA  A+EG      GKL+ LSEQ LVDCS 
Sbjct: 112 YEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSG 171

Query: 188 -TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPK 246
              N GC+GGLMD AF+Y+ +N GL +E  YPY   E +C    E +V A  + + D+PK
Sbjct: 172 PQGNEGCNGGLMDYAFQYVADNGGLDSEESYPYEATEESCKYNPEYSV-ANDTGFVDIPK 230

Query: 247 GDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFG-TAEE 302
             E+AL++AV+   P+SV +DA   +F FYK G+    DC + + DHGV VVG+G  + E
Sbjct: 231 -QEKALMKAVATVGPISVAIDAGHESFMFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTE 289

Query: 303 ENGAKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
            + +KYWL+KNSWGE WG  GYI++ +D    CGIA+AASYP 
Sbjct: 290 SDNSKYWLVKNSWGEEWGMGGYIKMAKDRRNHCGIASAASYPT 332


>gi|332375975|gb|AEE63128.1| unknown [Dendroctonus ponderosae]
          Length = 338

 Score =  260 bits (664), Expect = 9e-67,   Method: Compositional matrix adjust.
 Identities = 146/340 (42%), Positives = 201/340 (59%), Gaps = 16/340 (4%)

Query: 15  VIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIE 74
           +++++ I  A Q VS   +    + E+   +  QH + Y+ E E+  R+ IF  N   + 
Sbjct: 4   LVLLVTIAVACQAVSFSEL----VQEQWNSFKVQHKKQYESETEERFRMKIFMDNSHKVA 59

Query: 75  KANK---EGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPS-TFKYQNVTD 130
           K NK   +G   YKL  N++ DL + EF  L  G+NR    + R   + S TF      D
Sbjct: 60  KHNKLFEQGLYPYKLAMNKYGDLLHHEFVGLLNGFNRTKTYLKRGELQDSITFIEPAHVD 119

Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST-- 188
           +P ++DWR++GAVT +KDQG CGSCW+FSA  A+EG       KL+ LSEQ LVDCS+  
Sbjct: 120 IPDTVDWRQEGAVTPVKDQGHCGSCWSFSATGALEGQHFRQTKKLVSLSEQNLVDCSSRF 179

Query: 189 DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
            N+GC+GGLMD AF YI  N G+ TEA YPY  E+        K   AT   + D+P GD
Sbjct: 180 GNNGCNGGLMDNAFRYIKNNGGIDTEAAYPYMGEDEKF-RYSAKNRGATDKGFVDIPSGD 238

Query: 249 EQALLQAVSN-QPVSVCVDASGRAFHFYKSGVL-NADCGNN-CDHGVAVVGFGTAEEENG 305
           E  L  AV+   P+S+ +DAS  +F  Y +GV  +  C +   DHGV VVG+GT +E+ G
Sbjct: 239 EDKLKAAVATVGPISIAIDASHESFQLYSNGVYSDPTCSSTELDHGVLVVGYGT-DEKTG 297

Query: 306 AKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
             YWL+KNSWG+TWG  GYI++ R+    CG+AT ASYP+
Sbjct: 298 MDYWLVKNSWGDTWGLDGYIKMARNQDNQCGVATQASYPL 337


>gi|325185016|emb|CCA19507.1| cysteine protease family C01A putative [Albugo laibachii Nc14]
          Length = 492

 Score =  259 bits (663), Expect = 9e-67,   Method: Compositional matrix adjust.
 Identities = 143/339 (42%), Positives = 189/339 (55%), Gaps = 40/339 (11%)

Query: 15  VIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIE 74
            +I L +  A     G++  E         W+  H  T+ D  E A RL  +  N  YI 
Sbjct: 8   TLIALSLLFAQNRADGKTFKEYE--SDFVSWLKTHHLTFSDAFEYAKRLETYIANDIYIL 65

Query: 75  KANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSR---QSSRPSTFKYQNVTDV 131
             N +   ++KLG N FS LTNEEFR  + G+      +++   QS+  S+  +Q + D+
Sbjct: 66  THNLQ-ESSFKLGHNAFSHLTNEEFRQRFNGFKASDDYLTKRLAQSNVASSTNFQYI-DL 123

Query: 132 PTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-N 190
           P S+DW EKGAVT +K+QG CGSCWAFS   A+EG T I+ GKL+ LSEQ+LVDC  + +
Sbjct: 124 PESVDWVEKGAVTGVKNQGMCGSCWAFSTTGAIEGATFISSGKLVSLSEQELVDCDHNGD 183

Query: 191 HGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQ 250
           HGC+GGLMD AF +I E+ G+ +E DY Y H +  C + K                    
Sbjct: 184 HGCNGGLMDHAFSWISEHDGICSEEDYAYIHSQSLCRSCKPVV----------------- 226

Query: 251 ALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWL 310
                    PV+V +DA  R+F FY+SGV N  CG   DHGV  VG+G    E+G KYW 
Sbjct: 227 --------SPVAVAIDAGDRSFQFYQSGVYNKTCGTQLDHGVLTVGYGV---EDGQKYWK 275

Query: 311 IKNSWGETWGESGYIRILRD----AGLCGIATAASYPVA 345
           +KNSWG +WGE GYIR+ RD    +G CGIA   SYP A
Sbjct: 276 VKNSWGNSWGEKGYIRLSRDQNGRSGQCGIAMVPSYPTA 314


>gi|326531188|dbj|BAK04945.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 360

 Score =  259 bits (663), Expect = 9e-67,   Method: Compositional matrix adjust.
 Identities = 141/350 (40%), Positives = 200/350 (57%), Gaps = 20/350 (5%)

Query: 12  PMFVIIILVITCASQVVSGRSMH--EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
           P+     +++  A+   SGR +   +  ++++   W A H ++Y+   E+  R  +++ N
Sbjct: 10  PVITASTILLAWAAAAASGRGVDVGDMLMMDRFLMWQATHNQSYRSAEERLRRFQVYRDN 69

Query: 70  LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKY---- 125
           +EYIE  N+ G+ TY+LG N+F+DLT EEF A +T YN          S  +T       
Sbjct: 70  VEYIETTNRRGDLTYQLGENQFADLTREEFIARFTSYNGDDDRTGDDDSVITTAAVGGGD 129

Query: 126 --------QNVTDVPTSIDWREKGAVTHIKDQGQCGSC-WAFSAVAAVEGITQITRGKLI 176
                    +V+  P S+DWR KGAV   K Q    S  WAF AVA +E +  I  GKL+
Sbjct: 130 PDLWSSGGDDVSLDPPSVDWRAKGAVVPPKSQSSSCSSSWAFVAVATIESLHAIKTGKLV 189

Query: 177 ELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAA 236
            LSEQQLVDC   + GC+ G   +AF ++I+N GL TEA+YPY   +GTC++ K     A
Sbjct: 190 ALSEQQLVDCDQYDGGCNRGTFRRAFHWVIQNGGLTTEAEYPYTAAQGTCNSAKSDHHVA 249

Query: 237 TISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVG 296
            IS +  +P  +E A+  AV+ QPV+  ++  G    FYKSGV +  CG   +H V VVG
Sbjct: 250 AISGHASVPGSNELAMKHAVATQPVAAAIEL-GSDMQFYKSGVYSGPCGARLEHAVTVVG 308

Query: 297 FGTAEEENGAKYWLIKNSWGETWGESGYIRILRD---AGLCGIATAASYP 343
           +G A+E  G KYW++KNSWG+TWGE GYIR+ R     GLCGI    +YP
Sbjct: 309 YG-ADESTGDKYWIVKNSWGQTWGERGYIRMQRKILGPGLCGIMLDVAYP 357


>gi|242093944|ref|XP_002437462.1| hypothetical protein SORBIDRAFT_10g027570 [Sorghum bicolor]
 gi|241915685|gb|EER88829.1| hypothetical protein SORBIDRAFT_10g027570 [Sorghum bicolor]
          Length = 366

 Score =  259 bits (663), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 141/331 (42%), Positives = 184/331 (55%), Gaps = 28/331 (8%)

Query: 35  EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
           E S+   +E+W A H    +D  EK  R ++FK+N   I + N +GN TY LG N FSD+
Sbjct: 41  EESLWALYERWCA-HYNMARDHGEKTRRFDLFKENARRIYEHNHQGNATYTLGLNRFSDM 99

Query: 95  TNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD----------------VPTSIDWR 138
           T+EEF     G     P +S          +    D                 P ++DWR
Sbjct: 100 TDEEFNRSPYGGCLTAPRMSDDEIEELHHHHHQQEDDGSFNLTHGSGGGKLGAPPAVDWR 159

Query: 139 EKGAVTHIKDQGQ-CGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGL 197
            + AVT +KDQG  CGSCWAFSA+AAVEGI  I    L+ LSEQQLVDC   NHGC+GGL
Sbjct: 160 GR-AVTRVKDQGPTCGSCWAFSAIAAVEGINAIRTRNLVPLSEQQLVDCDKLNHGCNGGL 218

Query: 198 MDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVS 257
           M  AF +++ N+G+  E  YPY   EG C +     V  TI  Y+ +P+ D  AL+ AV+
Sbjct: 219 MTTAFSFVVRNRGVVPEGAYPYMGREGRCKHVMAPPV--TIYGYQRVPRFDANALMNAVA 276

Query: 258 NQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGE 317
            QPVSV ++AS   F  Y+ GV N +CG    H    VG+G    + G  +W++KNSWG 
Sbjct: 277 AQPVSVAIEASSFEFRHYQGGVFNGNCGGRLGHAATAVGYGA---DAGGPFWIVKNSWGP 333

Query: 318 TWGESGYIRILRDA----GLCGIATAASYPV 344
            WGE GY+RI R+     G+CGI T  SYPV
Sbjct: 334 GWGEGGYVRISRNTPVRQGVCGILTENSYPV 364


>gi|449683741|ref|XP_002155462.2| PREDICTED: cathepsin L-like [Hydra magnipapillata]
          Length = 324

 Score =  259 bits (662), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 147/340 (43%), Positives = 204/340 (60%), Gaps = 23/340 (6%)

Query: 11  IPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNL 70
           + +F  ++L+    + ++       P+  +   +W   H + Y  + E+ +R  I+K N 
Sbjct: 1   MKVFCALLLLGVTLAYII-----ERPTEDDSWIRWKMAHNKAYSHDGEETVRYTIWKDNE 55

Query: 71  EYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
             I + N +G   + L  N+F D+TN EF+  + GY      +S +    STF   N   
Sbjct: 56  RRIREHNLQGG-DFLLEMNQFGDMTNNEFKD-FNGY------LSHKHVSGSTFLTPNSFV 107

Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST-- 188
            P S+DWR +G VT +KDQGQCGSCWAFS   ++EG      GKL+ LSEQ LVDCST  
Sbjct: 108 APDSVDWRNEGYVTPVKDQGQCGSCWAFSTTGSLEGQNFKKTGKLVSLSEQNLVDCSTAY 167

Query: 189 DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
            N+GC+GGLMD AF YI EN G+ +EA YPY  ++G C   K   VAAT + + D+P GD
Sbjct: 168 GNNGCNGGLMDNAFTYIKENNGIDSEASYPYTAKDGKCAFTKPN-VAATDTGFVDIPSGD 226

Query: 249 EQALLQAVSN-QPVSVCVDASGRAFHFYKSGVLNA-DCGNN-CDHGVAVVGFGTAEEENG 305
           E  L +AV++  P+SV +DAS  +F FY+ GV N   C +   DHGV VVG+GT   E+G
Sbjct: 227 ENKLKEAVASVGPISVAIDASHFSFQFYRKGVYNERKCSSTELDHGVLVVGYGT---ESG 283

Query: 306 AKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
             YWL+KNSW  +WG+ GYI++ R+A   CGIAT ASYP+
Sbjct: 284 KDYWLVKNSWNTSWGDKGYIKMSRNAKNQCGIATNASYPL 323


>gi|18858809|ref|NP_571273.1| cathepsin L, 1 b precursor [Danio rerio]
 gi|1752664|emb|CAA69623.1| cathepsin L [Danio rerio]
          Length = 336

 Score =  259 bits (662), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 141/341 (41%), Positives = 204/341 (59%), Gaps = 16/341 (4%)

Query: 14  FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYI 73
            +  +LV    S V +  S+ +  + +    W +QHG++Y +++E   R+ I+++NL  I
Sbjct: 1   MMFALLVTLYISAVFAAPSI-DIQLDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRKI 58

Query: 74  EKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
           E+ N E   GN T+K+G N+F D+TNEEFR    GY         Q+S+   F   +   
Sbjct: 59  EQHNFEYSYGNHTFKMGMNQFGDMTNEEFRQAMNGYTHD----PNQTSQGPLFMEPSFFA 114

Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD- 189
            P  +DWR++G VT +KDQ QCGSCW+FS+  A+EG      GKLI +SEQ LVDCS   
Sbjct: 115 APQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQ 174

Query: 190 -NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
            N GC+GGLMD+AF+Y+ ENKGL +E  YPY   +        +   A I+ + D+P G+
Sbjct: 175 GNQGCNGGLMDQAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPSGN 234

Query: 249 EQALLQAVSN-QPVSVCVDASGRAFHFYKSGVL--NADCGNNCDHGVAVVGFG-TAEEEN 304
           E AL+ AV+   PVSV +DAS ++  FY+SG+    A   +  DH V VVG+G    +  
Sbjct: 235 ELALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACSSSRLDHAVLVVGYGYQGADVA 294

Query: 305 GAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
           G +YW++KNSW + WG+ GYI + +D    CG+AT ASYP+
Sbjct: 295 GNRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGVATKASYPL 335


>gi|354549232|gb|AER27707.1| putative cysteine protease [Phytophthora sp. SH-2011]
          Length = 533

 Score =  259 bits (662), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 138/307 (44%), Positives = 190/307 (61%), Gaps = 13/307 (4%)

Query: 45  WMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRT-YKLGTNEFSDLTNEEFRALY 103
           WM  HG T+ D LE A RL  +  N  YI + N E   T   LG N FS ++ +EF+   
Sbjct: 31  WMGAHGVTFSDALEFARRLENYIVNDMYIMEHNAENAWTGVTLGHNAFSHMSFDEFKFKM 90

Query: 104 TGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAA 163
           TG   P   + ++ +      + +V +VP+++DW +KG VT +K+QG CGSCWAFS   A
Sbjct: 91  TGLVLPEGYLEQRLASRVDGLWSDV-EVPSAVDWVDKGGVTPVKNQGMCGSCWAFSTTGA 149

Query: 164 VEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHE 222
           VEG T ++ GKL  LSEQ+LVDC  + + GC+GGLMD AF++I ++ G+ +E DY Y+ +
Sbjct: 150 VEGATFVSSGKLPSLSEQELVDCDHNGDMGCNGGLMDHAFQWIEDHGGICSEDDYEYKAK 209

Query: 223 EGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNA 282
              C   +E      ++ ++D+   DE AL  AV+ QPVSV ++A  +AF FYKSGV N 
Sbjct: 210 AQVC---RECDSVVKVTGFQDVNPQDEHALKVAVAQQPVSVAIEADQKAFQFYKSGVFNL 266

Query: 283 DCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD----AGLCGIAT 338
            CG   DHGV  VG+G    +NG K+W +KNSWG +WGE GYIR+ R+    AG CGIA+
Sbjct: 267 TCGTRLDHGVLAVGYGN---DNGHKFWKVKNSWGASWGEQGYIRLAREENGPAGQCGIAS 323

Query: 339 AASYPVA 345
             SYP A
Sbjct: 324 VPSYPFA 330


>gi|291398027|ref|XP_002715626.1| PREDICTED: cathepsin S [Oryctolagus cuniculus]
          Length = 331

 Score =  259 bits (662), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 144/341 (42%), Positives = 205/341 (60%), Gaps = 22/341 (6%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKHEQ-WMAQHGRTYKDELEKAMRLNIFKQNLE 71
           M  ++  ++ C+S V     +H    ++ H   W   +G+ YK++ E+A R  I+++NL+
Sbjct: 1   MKWLVWALLVCSSTVAQ---LHRDPTLDHHWHLWKKAYGKQYKEKNEEAARRLIWEKNLK 57

Query: 72  YIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNV 128
           ++   N E   G  +Y +G N  +D+T+EE  +L +    P      Q  R  T+K    
Sbjct: 58  FVTLHNLEHSMGMHSYDVGMNHLADMTSEEVVSLMSSLRIP-----HQWPRNVTYKLNPN 112

Query: 129 TDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST 188
             +P S+DWRE+G VT +K QG CG+CWAFSAV A+E   ++  G L+ LS Q LVDCST
Sbjct: 113 QKLPDSVDWRERGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGNLVSLSAQNLVDCST 172

Query: 189 D---NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLP 245
               N GC+GG M +AF+YII+N G+ +EA YPY+  +  C +   K  AAT SKY +LP
Sbjct: 173 TKYGNKGCNGGFMTEAFQYIIDNNGIDSEASYPYKAMDQKC-HYDSKHRAATCSKYTELP 231

Query: 246 KGDEQALLQAVSNQ-PVSVCVDASGRAFHFYKSGV-LNADCGNNCDHGVAVVGFGTAEEE 303
            G E+AL +AV+N+ PVSV +DAS  +F  Y+SGV     C  N +HGV  VG+G  +  
Sbjct: 232 FGSEEALKEAVANKGPVSVAIDASHSSFFLYRSGVYYEPSCTQNVNHGVLAVGYGNLK-- 289

Query: 304 NGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYP 343
            G  YWL+KNSWG  +GE GYIR+ R++   CGIA   SYP
Sbjct: 290 -GKDYWLVKNSWGIHFGEQGYIRMARNSKNHCGIANYPSYP 329


>gi|422001787|dbj|BAM66994.1| germination-specific cysteine protease 1, partial [Raphanus
           sativus]
          Length = 235

 Score =  259 bits (662), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 122/220 (55%), Positives = 154/220 (70%), Gaps = 9/220 (4%)

Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD- 189
           +P ++DWR+KGAV  IK+QG CGSCWAFS  A VEGI +I  G+LI LSEQ+LVDC    
Sbjct: 4   LPETVDWRQKGAVNAIKNQGTCGSCWAFSTAAVVEGINKIVTGELISLSEQELVDCDKSY 63

Query: 190 NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDE 249
           N GC+GGLMD AF++I++N GL TE DYPYR  +G C++  + +   TI  YED+P  DE
Sbjct: 64  NQGCNGGLMDYAFQFIMKNGGLNTEQDYPYRGSDGKCNSLLKNSKVVTIDGYEDVPTNDE 123

Query: 250 QALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYW 309
            AL +AVS QPVSV +DA GR F  Y+SG+   +CG   DH V  VG+G+   ENG  YW
Sbjct: 124 TALKRAVSYQPVSVAIDAGGRVFQHYQSGIFTGECGTKMDHAVVAVGYGS---ENGVDYW 180

Query: 310 LIKNSWGETWGESGYIRILRD-----AGLCGIATAASYPV 344
           +++NSWG+ WGE GYIRI R+     +G CGIA  ASYPV
Sbjct: 181 IVRNSWGQKWGEDGYIRIERNLASSKSGKCGIAIEASYPV 220


>gi|402770499|gb|AFQ98384.1| cathepsin L, partial [Hyalomma anatolicum anatolicum]
          Length = 312

 Score =  259 bits (662), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 146/316 (46%), Positives = 196/316 (62%), Gaps = 25/316 (7%)

Query: 44  QWMA---QHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEFSDLTNE 97
           QW A    H ++Y+ ++E+ +R  IF +N   I K N +   G  +YKLG N+F DL   
Sbjct: 6   QWEAFKTTHKKSYQSKMEELLRYKIFTENSLLIAKHNAKYAKGLVSYKLGMNQFGDLLPH 65

Query: 98  EFRALYTGYNRPVPSVSRQSSRPSTF-KYQNVTD--VPTSIDWREKGAVTHIKDQGQCGS 154
           EF  ++ GY+        +  R STF    NV D  +P ++DWR+KGAVT +KDQGQCGS
Sbjct: 66  EFAKMFNGYH------GERKGRGSTFLPPANVNDSSLPKTVDWRKKGAVTPVKDQGQCGS 119

Query: 155 CWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLA 212
           CWAFSA  ++EG   +  GKL+ LSEQ L+DCS    N GC GGLMD AF+YI  N G+ 
Sbjct: 120 CWAFSATGSLEGQHFLKSGKLVSLSEQNLIDCSGSFGNEGCGGGLMDNAFKYIKANDGID 179

Query: 213 TEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDASGRA 271
           TE  YPY   +G C  +KE  V AT + + D+ +G E  L +AV+   P+SV +DAS  +
Sbjct: 180 TEESYPYEAMDGDCRFKKED-VGATDTGFVDIQQGSEDDLQKAVATVGPISVAIDASHSS 238

Query: 272 FHFYKSGVLN-ADCGN-NCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILR 329
           F  Y  GV +  +C +   DHGV  VG+G    +NG KYWL+KNSW ETWG++GYI + R
Sbjct: 239 FQLYSEGVYDEPNCSSEELDHGVLAVGYGV---KNGKKYWLVKNSWAETWGDNGYILMSR 295

Query: 330 DA-GLCGIATAASYPV 344
           D    CGIA++ASYP+
Sbjct: 296 DKDNQCGIASSASYPL 311


>gi|47086859|ref|NP_997749.1| cathepsin L, 1 a precursor [Danio rerio]
 gi|42542930|gb|AAH66490.1| Cathepsin L1, a [Danio rerio]
          Length = 337

 Score =  259 bits (662), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 145/342 (42%), Positives = 202/342 (59%), Gaps = 16/342 (4%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
           M V +     C S V +  ++ +  + +  +QW   H + Y    E+  R  I+++NL+ 
Sbjct: 1   MRVFLAAFTLCLSAVFAAPTL-DQQLNDHWDQWKKWHSKKYH-ATEEGWRRVIWEKNLKK 58

Query: 73  IEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
           IE  N E   G  TY+LG N F D+T+EEFR +  G+         +  R S F   N  
Sbjct: 59  IEMHNLEHSMGIHTYRLGMNHFGDMTHEEFRQVMNGFKHK----KDRRFRGSLFMEPNFI 114

Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD 189
           +VP  +DWREKG VT +KDQG+CGSCWAFS   A+EG      GKL+ LSEQ LVDCS  
Sbjct: 115 EVPNKLDWREKGYVTPVKDQGECGSCWAFSTTGALEGQMFRKTGKLVSLSEQNLVDCSRP 174

Query: 190 --NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
             N GC+GGLMD+AF+Y+ +  GL +E  YPY   +    +   K  AA  + + D+P G
Sbjct: 175 EGNEGCNGGLMDQAFQYVKDQNGLDSEESYPYLGTDDQPCHFDPKNSAANDTGFVDIPSG 234

Query: 248 DEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFG-TAEEE 303
            E+AL++A++   PVSV +DA   +F FY+SG+    +C +   DHGV  VG+G   E+ 
Sbjct: 235 KERALMKAIAAVGPVSVAIDAGHESFQFYQSGIYYEKECSSEELDHGVLAVGYGFEGEDV 294

Query: 304 NGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
           +G KYW++KNSW E WG+ GYI + +D    CGIATAASYP+
Sbjct: 295 DGKKYWIVKNSWSENWGDKGYIYMAKDRHNHCGIATAASYPL 336


>gi|2765358|emb|CAA74241.1| cathepsin L [Litopenaeus vannamei]
          Length = 325

 Score =  259 bits (662), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 140/318 (44%), Positives = 195/318 (61%), Gaps = 18/318 (5%)

Query: 36  PSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKAN---KEGNRTYKLGTNEFS 92
           PS+ ++ + + A+HGR Y    E+  RL++F+QN ++I+  N   + G  T+ L  N+F 
Sbjct: 16  PSLRQQWQNFKAEHGRRYASVQEERYRLSVFEQNQQFIDDHNARFENGEVTFTLQMNQFG 75

Query: 93  DLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQC 152
           D+T+EE  A   G+      +   + RP+     +   +P  +DWR KGAVT +KDQ QC
Sbjct: 76  DMTSEEIVATMNGF------LGAPTRRPAAVLKADDETLPEKVDWRTKGAVTPVKDQKQC 129

Query: 153 GSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKG 210
           GSCWAFS   ++EG   +  GKL+ LSEQ LVDCS    N GC GGLMD+AF YI  NKG
Sbjct: 130 GSCWAFSTTGSLEGQHFLKDGKLVSLSEQNLVDCSDKFRNMGCMGGLMDQAFRYIKANKG 189

Query: 211 LATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDASG 269
           + TE  YPY  ++G C       V AT + Y D+  G E AL +AV+   P+SV +DAS 
Sbjct: 190 IDTEDSYPYEAQDGKCRFDASN-VGATDTGYVDVEHGSESALKKAVATIGPISVGIDASQ 248

Query: 270 RAFHFYKSGVLNAD-CGNN-CDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRI 327
             FHFY +GV + D C +   DHGV  VG+G+  +ENG  +WL+KNSW  +WG+ GYI++
Sbjct: 249 STFHFYHTGVYHDDHCSSTMLDHGVLAVGYGS--DENGGDFWLVKNSWNTSWGDKGYIKM 306

Query: 328 LRDA-GLCGIATAASYPV 344
            R+    CGIA+ ASYP+
Sbjct: 307 SRNRNNNCGIASQASYPL 324


>gi|428186189|gb|EKX55040.1| hypothetical protein GUITHDRAFT_63227 [Guillardia theta CCMP2712]
          Length = 344

 Score =  259 bits (661), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 143/300 (47%), Positives = 185/300 (61%), Gaps = 16/300 (5%)

Query: 58  EKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVS 114
           E      +F++NL+ I K N+E   G ++Y++G N F+ LT EEF A Y GY        
Sbjct: 47  ESTRAFEVFQKNLDMIMKHNEEYNQGLQSYEMGLNGFAHLTFEEFSAQYLGYG-GAEVEQ 105

Query: 115 RQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGK 174
            ++ R    + ++ +++P S+DWREKGAV  +K+QG CGSCWAFSAVAA+EG   +  G+
Sbjct: 106 PKTRRAGKHERKSRSEIPASVDWREKGAVAEVKNQGACGSCWAFSAVAALEGAHFLNSGE 165

Query: 175 LIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLA--TEADYPYRHEEGTCDNQK 230
           LI LSEQQLVDCS    NHGC+GG MD AFEY + N G    +E DYPY+  +G C    
Sbjct: 166 LISLSEQQLVDCSKKFGNHGCAGGYMDNAFEYWMNNTGHGDDSEKDYPYKGMDGKCKFSA 225

Query: 231 EKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGVLN---ADCGN 286
           +  V ATIS Y D+ +G+E  LL AV+N  PVSV + A G A  FY  GV N     C  
Sbjct: 226 D-GVRATISGYNDVKQGNETDLLDAVANVGPVSVAIHA-GAALQFYLRGVFNGVAGTCFG 283

Query: 287 NCDHGVAVVGFGTAEEENGAK--YWLIKNSWGETWGESGYIRILRDAGLCGIATAASYPV 344
             +HGV  VG+GTA    G K  YW+IKNSWG  WGE G++R  R   LCG+A  ASYP+
Sbjct: 284 PLNHGVTAVGYGTASLRFGRKMDYWIIKNSWGMGWGEKGFVRFARGKNLCGVANGASYPL 343


>gi|359483753|ref|XP_002266308.2| PREDICTED: oryzain alpha chain-like [Vitis vinifera]
          Length = 501

 Score =  259 bits (661), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 140/319 (43%), Positives = 191/319 (59%), Gaps = 14/319 (4%)

Query: 35  EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
           E  + E    W  +H R YK   E A R  IFK+NL+Y+ + N +G+R + LG N+F+D+
Sbjct: 39  EERVRELFHLWKERHKRVYKHAEETAKRFEIFKENLKYVIERNSKGHR-HTLGMNKFADM 97

Query: 95  TNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT--DVPTSIDWREKGAVTHIKDQGQC 152
           +NEEF+  Y    +   +      R S  + +     + P+S+DWR+KG VT IKDQG C
Sbjct: 98  SNEEFKEKYLSKIKKPINKKNNYLRRSMQQKKGTASCEAPSSLDWRKKGVVTGIKDQGDC 157

Query: 153 GSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLA 212
           GSCWAFS+  A+EGI  I  G LI LSEQ+LVDC T N+GC GG MD AFE++I N G+ 
Sbjct: 158 GSCWAFSSTGAMEGINAIVTGDLISLSEQELVDCDTTNYGCEGGYMDYAFEWVISNGGID 217

Query: 213 TEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAF 272
           +E+DYPY   +GTC+  KE     +I  Y+D+ + D  ALL A  NQP+SV +D S   F
Sbjct: 218 SESDYPYTGTDGTCNTTKEDTKVVSIDGYKDVDESD-SALLCAAVNQPISVGMDGSALDF 276

Query: 273 HFYKSGVL---NADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILR 329
             Y SG+     +D  ++ DH V +VG+G+ + E+   YW+ KNSWG +WG  GY  I R
Sbjct: 277 QLYTSGIYAGDCSDDPDDIDHAVLIVGYGSEDSED---YWICKNSWGTSWGMEGYFYIKR 333

Query: 330 DAGL----CGIATAASYPV 344
           +  L    C I   ASYP 
Sbjct: 334 NTDLPYGECAINAMASYPT 352


>gi|226443040|ref|NP_001140018.1| Cathepsin L1 precursor [Salmo salar]
 gi|221221188|gb|ACM09255.1| Cathepsin L1 precursor [Salmo salar]
          Length = 338

 Score =  259 bits (661), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 146/341 (42%), Positives = 203/341 (59%), Gaps = 18/341 (5%)

Query: 15  VIIILVITCASQVVSGRSMHEPSIVEKHEQ-WMAQHGRTYKDELEKAMRLNIFKQNLEYI 73
           + + +++ C S V +       S +E H   W   H ++Y  E E+  R  ++++NL+ I
Sbjct: 4   LYLAVLVLCVSAVCAAPRF--DSQLEDHWHLWKNWHSKSYH-ESEEGWRRMVWEKNLKKI 60

Query: 74  EKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
           E  N E   G  +Y+LG N F D+TNEEFR    GY +     + +  + S F   N   
Sbjct: 61  EMHNLEHTMGKHSYRLGMNHFGDMTNEEFRQTMNGYKQ----TTERKFKGSLFMEPNYLQ 116

Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD- 189
            P ++DWREKG VT +KDQG CGSCWAFS   A+EG      GKL+ LSEQ LVDCS   
Sbjct: 117 APKAVDWREKGYVTPVKDQGSCGSCWAFSTTGAMEGQQFRKTGKLVSLSEQNLVDCSRPE 176

Query: 190 -NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
            N GC+GGLMD+AF+YI +N GL TE  YPY   +    + K +   A  + + D+P G 
Sbjct: 177 GNEGCNGGLMDQAFQYIQDNAGLDTEESYPYVGTDEDPCHYKPEFSGANETGFVDIPSGK 236

Query: 249 EQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFG-TAEEEN 304
           E A+++AV+   PVSV +DA   +F FY+SG+    +C +   DHGV VVG+G   E+ +
Sbjct: 237 EHAMMKAVAAVGPVSVAIDAGHESFQFYESGIYYEKECSSEELDHGVLVVGYGFEGEDVD 296

Query: 305 GAKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
           G KYW++KNSW E WG+ GYI + +D    CGIATA+SYP+
Sbjct: 297 GKKYWIVKNSWSEKWGDKGYIYMAKDRKNHCGIATASSYPL 337


>gi|185135439|ref|NP_001117777.1| procathepsin L precursor [Oncorhynchus mykiss]
 gi|14582899|gb|AAK69706.1|AF358668_1 procathepsin L [Oncorhynchus mykiss]
          Length = 338

 Score =  259 bits (661), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 147/341 (43%), Positives = 203/341 (59%), Gaps = 18/341 (5%)

Query: 15  VIIILVITCASQVVSGRSMHEPSIVEKHEQ-WMAQHGRTYKDELEKAMRLNIFKQNLEYI 73
           + + +++ C S V +       S +E H   W   H + Y  E E+  R  ++++NL+ I
Sbjct: 4   LYLAVLVLCVSAVCAAPRF--DSQLEDHWHLWKNWHSKHYH-ESEEGWRRMVWEKNLKKI 60

Query: 74  EKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
           E  N E   G  +Y+LG N F D+TNEEFR    GY +     + +  + S F   N   
Sbjct: 61  EIHNLEHTMGKHSYRLGMNHFGDMTNEEFRQTMNGYKQ----TTERKFKGSLFMEPNYLQ 116

Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD- 189
            P ++DWREKG VT +KDQG CGSCWAFS   A+EG      GKL+ LSEQ LVDCS   
Sbjct: 117 APKAVDWREKGYVTPVKDQGSCGSCWAFSTTGAMEGQQFRKTGKLVSLSEQNLVDCSRPE 176

Query: 190 -NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
            N GC+GGLMD+AF+YI +N GL TE  YPY   +    + K +  AA  + + D+P G 
Sbjct: 177 GNEGCNGGLMDQAFQYIQDNAGLDTEESYPYVGTDEDPCHYKPEFSAANETGFVDIPSGK 236

Query: 249 EQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFG-TAEEEN 304
           E A+++AV+   PVSV +DA   +F FY+SG+    +C +   DHGV VVG+G   E+ +
Sbjct: 237 EHAMMKAVAAVGPVSVAIDAGHESFQFYESGIYYEKECSSEELDHGVLVVGYGFEGEDVD 296

Query: 305 GAKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
           G KYW++KNSW E WG+ GYI + +D    CGIATA+SYP+
Sbjct: 297 GKKYWIVKNSWSEKWGDKGYIYMAKDRKNHCGIATASSYPL 337


>gi|444514070|gb|ELV10520.1| Cathepsin L1 [Tupaia chinensis]
          Length = 450

 Score =  259 bits (661), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 145/337 (43%), Positives = 196/337 (58%), Gaps = 25/337 (7%)

Query: 17  IILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKA 76
           + L   C   + S     + ++      W + H R Y    E+  R  ++++N++ IE  
Sbjct: 129 LFLAALCLG-IASATPNSDQNLDTSWHHWKSTHRRLYGKN-EEGWRRAVWEKNMKMIEMH 186

Query: 77  NKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPT 133
           N E   G   + +G N F D+TNEEFR +  G+       +++      F    +   P 
Sbjct: 187 NHEYSNGKHGFTMGMNAFGDMTNEEFRQVMNGFR------NQKQKSGKVFHAPLLLQAPK 240

Query: 134 SIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS--TDNH 191
           S+DWREKG VT +K+QGQCGSCWAFSA  A+EG      GKLI LSEQ LVDCS    N 
Sbjct: 241 SVDWREKGFVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLISLSEQNLVDCSRRQGNL 300

Query: 192 GCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQA 251
           GC GGLMD AF+YI +N GL +E  YPY+  +GTC  + E AVA           G E+A
Sbjct: 301 GCQGGLMDNAFQYIKDNGGLDSEESYPYKGMDGTCQYKAEWAVANDT--------GFEKA 352

Query: 252 LLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFGTAEEENGAKY 308
           L++AV++  P+SV +DA   +F FYK G+    DC + N DHGV VVG+G  +  +  KY
Sbjct: 353 LMKAVASVGPISVAIDAGHASFQFYKDGIYYEPDCSSENLDHGVLVVGYGVEKRNSNDKY 412

Query: 309 WLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
           WLIKNSWGE WG +GY++I +D    CG+A+AASYPV
Sbjct: 413 WLIKNSWGEQWGANGYVKIAKDRNNHCGVASAASYPV 449


>gi|728637|emb|CAA59441.1| cathepsin l [Litopenaeus vannamei]
          Length = 326

 Score =  259 bits (661), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 140/318 (44%), Positives = 195/318 (61%), Gaps = 18/318 (5%)

Query: 36  PSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKAN---KEGNRTYKLGTNEFS 92
           PS+ ++ + + A+HGR Y    E+  RL++F+QN ++I+  N   + G  T+ L  N+F 
Sbjct: 17  PSLRQQWQNFKAEHGRRYASVQEERYRLSVFEQNQQFIDDHNARFENGEVTFTLQMNQFG 76

Query: 93  DLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQC 152
           D+T+EE  A   G+      +   + RP+     +   +P  +DWR KGAVT +KDQ QC
Sbjct: 77  DMTSEEIVATMNGF------LGAPTRRPAAVLKADDETLPEKVDWRTKGAVTPVKDQKQC 130

Query: 153 GSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKG 210
           GSCWAFS   ++EG   +  GKL+ LSEQ LVDCS    N GC GGLMD+AF YI  NKG
Sbjct: 131 GSCWAFSTTGSLEGQHFLKDGKLVSLSEQNLVDCSDKFGNMGCMGGLMDQAFRYIKANKG 190

Query: 211 LATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDASG 269
           + TE  YPY  ++G C       V AT + Y D+  G E AL +AV+   P+SV +DAS 
Sbjct: 191 IDTEDSYPYEAQDGKCRFDASN-VGATDTGYVDVEHGSESALKKAVATIGPISVGIDASQ 249

Query: 270 RAFHFYKSGVLNAD-CGNN-CDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRI 327
             FHFY +GV + D C +   DHGV  VG+G+  +ENG  +WL+KNSW  +WG+ GYI++
Sbjct: 250 STFHFYHTGVYHDDHCSSTMLDHGVLAVGYGS--DENGGDFWLVKNSWNTSWGDKGYIKM 307

Query: 328 LRDA-GLCGIATAASYPV 344
            R+    CGIA+ ASYP+
Sbjct: 308 SRNRNNNCGIASQASYPL 325


>gi|413919735|gb|AFW59667.1| hypothetical protein ZEAMMB73_680472 [Zea mays]
          Length = 344

 Score =  259 bits (661), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 127/266 (47%), Positives = 168/266 (63%), Gaps = 9/266 (3%)

Query: 27  VVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKAN---KEGNRT 83
           +VS     E      + +WMA HGRTY    E+  R  +F+ NL Y++  N     G  +
Sbjct: 31  IVSYGERSEEEARRMYAEWMAAHGRTYNAVGEEERRFEVFRDNLRYVDAHNAAADAGVHS 90

Query: 84  YKLGTNEFSDLTNEEFRALYTGY-NRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGA 142
           ++LG N F+DLTN+E+RA Y G  +RP     R+      +   +  D+P S+DWR KGA
Sbjct: 91  FRLGLNRFADLTNDEYRATYLGVRSRP----QRERRLGDRYLAGDNEDLPESVDWRAKGA 146

Query: 143 VTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKA 201
           V  +KDQG CGSCWAFS +AAVEGI QI  G +I LSEQ+LVDC T  N GC+GGLMD A
Sbjct: 147 VAEVKDQGSCGSCWAFSTIAAVEGINQIVTGDMISLSEQELVDCDTSYNQGCNGGLMDYA 206

Query: 202 FEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPV 261
           FE+II N G+ TE DYPY+  +G CD  ++ A   TI  YED+P   E++L +AV+NQP+
Sbjct: 207 FEFIINNGGIDTEEDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANSEKSLQKAVANQPI 266

Query: 262 SVCVDASGRAFHFYKSGVLNADCGNN 287
           SV ++A GRAF  Y SG+    CGN+
Sbjct: 267 SVAIEAGGRAFQLYNSGIFTGTCGNS 292


>gi|260516672|gb|ACX43963.1| cysteine protease 3, partial [Brachiaria hybrid cultivar]
          Length = 319

 Score =  259 bits (661), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 142/287 (49%), Positives = 179/287 (62%), Gaps = 17/287 (5%)

Query: 45  WMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYT 104
           +M Q+ + Y    E + R N FK ++E I   N   N +Y +G NEF+DL+ EEF+  Y 
Sbjct: 45  FMKQYSKAYS-HAEFSSRFNQFKASVETIRLHNTLANASYTMGLNEFADLSFEEFKGKYF 103

Query: 105 GYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAV 164
           G       V R+ +R +   +Q V   PTSIDWR   AVT IKDQGQCGSCWAFSA  ++
Sbjct: 104 G----CKHVEREFARSNNL-HQEVEAAPTSIDWRTSNAVTPIKDQGQCGSCWAFSATGSI 158

Query: 165 EGITQITRGK--LIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYR 220
           EG   + +GK  L  LSEQQLVDCST   N GC+GGLMD AFEYII NKG+  E+ YPY+
Sbjct: 159 EG-AWVLQGKHTLTSLSEQQLVDCSTSYGNAGCNGGLMDYAFEYIIANKGICAESAYPYK 217

Query: 221 HEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV 279
              G C     K V  TIS ++D+  GDE + L AV    PVSV ++A    F FY SGV
Sbjct: 218 GVGGLCQKSCTKVV--TISGHKDVASGDEASSLNAVGTVGPVSVAIEADQAGFQFYSSGV 275

Query: 280 LNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIR 326
            +  CG+N DHGV  VG+GT   ++   YW++KNSWG +WGESGYIR
Sbjct: 276 FSGTCGHNLDHGVLAVGYGTTGSQD---YWIVKNSWGTSWGESGYIR 319


>gi|34559455|gb|AAQ75437.1| cathepsin L-like protease [Helicoverpa armigera]
 gi|338855117|gb|AEJ31938.1| cathepsin L-like protease [Helicoverpa assulta]
          Length = 341

 Score =  259 bits (661), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 140/319 (43%), Positives = 199/319 (62%), Gaps = 20/319 (6%)

Query: 43  EQWMA---QHGRTYKDELEKAMRLNIFKQNLEYIEKANK---EGNRTYKLGTNEFSDLTN 96
           E+W A   +H + Y  E+E   R+ I+ +N   I K N+   +G  +YKL  N+++D+ +
Sbjct: 25  EEWSAFKLEHSKRYDSEVEDKFRMKIYLENKHRIAKHNQRFEQGAVSYKLRPNKYADMLS 84

Query: 97  EEFRALYTGYNRPV--PSVSR---QSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQ 151
            EF  +  G+N+ +  P       + SRP+TF        P  +DWR+KGAVT +KDQG+
Sbjct: 85  HEFVHVMNGFNKTLKHPKAVHGKGRESRPATFIAPAHVTYPDHVDWRKKGAVTEVKDQGK 144

Query: 152 CGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST--DNHGCSGGLMDKAFEYIIENK 209
           CGSCWAFS   A+EG      G L+ LSEQ L+DCS    N+GC+GGLMD AF+YI +N 
Sbjct: 145 CGSCWAFSTTGALEGQHFRKTGYLVSLSEQNLIDCSAAYGNNGCNGGLMDNAFKYIKDNG 204

Query: 210 GLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDAS 268
           G+ TE  YPY   +  C    + + A  +  + D+P+GDE+ L+QAV+   PVSV +DAS
Sbjct: 205 GIDTEKAYPYEGVDDKCRYNAKNSGADDVG-FVDIPQGDEEKLMQAVATVGPVSVAIDAS 263

Query: 269 GRAFHFYKSGVL-NADCGN-NCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIR 326
             +F FY  GV  + +C + + DHGV VVG+GT  +E G  YWL+KNSWG TWG+ GYI+
Sbjct: 264 QESFQFYSDGVYYDENCSSTDLDHGVMVVGYGT--DEQGGDYWLVKNSWGRTWGDLGYIK 321

Query: 327 ILRDA-GLCGIATAASYPV 344
           + R+    CGIA++ASYP+
Sbjct: 322 MARNKNNHCGIASSASYPL 340


>gi|52345644|ref|NP_001004869.1| cathepsin L2 precursor [Xenopus (Silurana) tropicalis]
 gi|49522051|gb|AAH74718.1| MGC69486 protein [Xenopus (Silurana) tropicalis]
          Length = 335

 Score =  259 bits (661), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 146/342 (42%), Positives = 207/342 (60%), Gaps = 18/342 (5%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
           M + + +   C + V +  +  +P++      W   H ++Y  + E+  R  ++++NL  
Sbjct: 1   MALYLGIAAICLTTVFAAPTT-DPALDNHWNLWKNWHKKSYAPK-EEGWRRVLWEKNLRM 58

Query: 73  IEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
           IE  N E   G  ++ LG N+F D+TNEEFR L  GY       +++  R STF   N  
Sbjct: 59  IEFHNLEHSLGKHSHSLGMNQFGDMTNEEFRQLMNGYK------NQKKIRGSTFLAPNNF 112

Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS-- 187
           + P S+DWR+KG VT +KDQGQCGSCWAFS   A+EG      GK+I LSEQ LVDCS  
Sbjct: 113 ESPKSVDWRKKGYVTPVKDQGQCGSCWAFSTTGALEGQHYRNTGKMISLSEQNLVDCSRA 172

Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
             N GC+GGLMD+AF+Y+ +N G+ +E  YPY  ++    +      +A  + + D+  G
Sbjct: 173 QGNQGCNGGLMDQAFQYVKDNGGIDSEDSYPYTAKDDQECHYDPNYNSANDTGFVDVTSG 232

Query: 248 DEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFG-TAEEE 303
            E+ L+ AV++  PVSV VDA  ++F FYKSG+    +C + + DHGV VVG+G   E+E
Sbjct: 233 SEKDLMNAVASVGPVSVAVDAGHQSFQFYKSGIYYEPECSSEDLDHGVLVVGYGFEGEDE 292

Query: 304 NGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
           +G KYW++KNSW E WG  GYI I +D    CGIATAASYP+
Sbjct: 293 DGKKYWIVKNSWSEKWGNDGYIYIAKDRHNHCGIATAASYPL 334


>gi|444519959|gb|ELV12909.1| Cathepsin L1 [Tupaia chinensis]
          Length = 333

 Score =  258 bits (660), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 143/341 (41%), Positives = 207/341 (60%), Gaps = 26/341 (7%)

Query: 17  IILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKA 76
           + L I C   + S    H+ S+ E+  QW A+HG+ Y    E+++R  ++++NL+ IE+ 
Sbjct: 5   LFLTILCLG-IASAAPTHDQSLDEQWNQWTAEHGKVYSTG-EESLRRAVWEKNLKMIEQH 62

Query: 77  NKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPT 133
           N E   G  T+ +G N F D+TNE+FR + TG+       +++ ++   F+     +VP 
Sbjct: 63  NLEYSQGKHTFTMGMNAFGDMTNEDFRQMMTGFQ------NQKYNKGEVFQPPQPLEVPE 116

Query: 134 SIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNH-- 191
           S+DWREKG VT +K+Q +CGSCWAFSA  A+EG      GKL+ LSEQ LVDCS   H  
Sbjct: 117 SVDWREKGYVTPVKNQHRCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSQPQHNS 176

Query: 192 GCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQA 251
           GC GGL+ KAF+Y+ +N GL +E  YPY   E TC      + AAT++ ++ +P  +E+A
Sbjct: 177 GCKGGLVIKAFQYVKDNGGLDSEESYPYEEMESTCRYSPGNS-AATVTGFKHIP-AEEKA 234

Query: 252 LLQAVSN-QPVSVCVDASGRAFHFYKSGVLNADCGNNC-----DHGVAVVGFGTAEE-EN 304
           L +AV++  P+SV +DA   +F FY  G+L+     NC     +H V VVG+G  +E  N
Sbjct: 235 LEKAVASVGPISVAIDAHHHSFQFYTGGILHEP---NCSPKWLNHAVLVVGYGVMQEGSN 291

Query: 305 GAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
              YWL+KNSWGE WG  GYI + +D    CGIA+ A YP+
Sbjct: 292 NNTYWLVKNSWGERWGVGGYIMMAKDKNNHCGIASDALYPI 332


>gi|307111936|gb|EFN60170.1| hypothetical protein CHLNCDRAFT_59551 [Chlorella variabilis]
          Length = 364

 Score =  258 bits (660), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 131/291 (45%), Positives = 179/291 (61%), Gaps = 12/291 (4%)

Query: 62  RLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPS 121
           R NI+  NL +  + N   + ++ L    ++DL+ +E+R+   GYN  +    ++  R +
Sbjct: 71  RFNIWLDNLRFAHEYNAR-HTSHWLSMGVYADLSQDEYRSKALGYNAHLHK--KRPLRAA 127

Query: 122 TFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQ 181
            F Y+  T  P  +DW   GAVT +KDQ  CGSCWAFS   AVEG   I  GKL+ LSEQ
Sbjct: 128 PFLYKG-TVPPEEVDWVAGGAVTPVKDQLLCGSCWAFSTTGAVEGANAIATGKLVSLSEQ 186

Query: 182 QLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISK 240
            LVDC  + + GC GG MD AF++I+ N G+ TE DYPYR E+G C + + +    TI  
Sbjct: 187 MLVDCDREYDTGCRGGFMDSAFDFIVNNGGIDTEDDYPYRAEDGICQDNRTRRHVVTIDG 246

Query: 241 YEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTA 300
           Y+D+P  DE AL++AV++QPVSV ++A   AF  Y  GV +A+CG   DH V VVG+GTA
Sbjct: 247 YQDVPPNDENALMKAVAHQPVSVAIEADQLAFQLYGGGVFDAECGTALDHAVLVVGYGTA 306

Query: 301 EE-ENGAKYWLIKNSWGETWGESGYIRILRDA------GLCGIATAASYPV 344
               +   YWL+KNSWG  WGE GYIR+LR+       G CG+A  AS+P+
Sbjct: 307 SNGTHNLPYWLVKNSWGAEWGEKGYIRLLRNLGKDAPEGQCGLAMYASFPI 357


>gi|330805275|ref|XP_003290610.1| hypothetical protein DICPUDRAFT_98747 [Dictyostelium purpureum]
 gi|325079249|gb|EGC32858.1| hypothetical protein DICPUDRAFT_98747 [Dictyostelium purpureum]
          Length = 334

 Score =  258 bits (660), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 147/341 (43%), Positives = 199/341 (58%), Gaps = 21/341 (6%)

Query: 11  IPMFVIIILVI----TCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIF 66
           + +F+I+ LVI     CA+  +     ++ S +     WM +H + Y    E   +   F
Sbjct: 3   LAVFLIVSLVILSINVCAATNLFSAQTYQTSFL----GWMKKHNKAYHHH-EFNDKYQTF 57

Query: 67  KQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQ 126
           K N+++I   N + + T  LG N F+DLTNEE++  Y G +  V   + Q    +   ++
Sbjct: 58  KDNMDFIHNWNSKESDTV-LGLNRFADLTNEEYKKTYLGMSINVNLRANQVPM-NGLNFE 115

Query: 127 NVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC 186
             T  P+SIDWR+ GAV ++KDQG CGSCWAF+   AVEG  QI  G ++  SEQ LVDC
Sbjct: 116 RFTG-PSSIDWRQNGAVAYVKDQGHCGSCWAFATTGAVEGAHQIKTGNMVTFSEQHLVDC 174

Query: 187 S--TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDL 244
           S    N+GC GGLM  AF+YII+N G+ATE  YPY   +  C       +   IS Y+D+
Sbjct: 175 SGRYGNNGCDGGLMTSAFKYIIDNDGIATEEAYPYTATQNRCV-YNTTMLGTAISGYKDV 233

Query: 245 PKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLN-ADCGN-NCDHGVAVVGFGTAEE 302
           P+G E AL  A+S QPV+V +DAS   F  YKSGV   A C +   +HGV  VG+GT E 
Sbjct: 234 PRGSESALTAAISKQPVAVAIDASPITFQLYKSGVYQEATCSSYRLNHGVLAVGYGTLE- 292

Query: 303 ENGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASY 342
             G  Y+++KNSW ETWG  GYI + R+A   CGIAT ASY
Sbjct: 293 --GKDYYIVKNSWAETWGNQGYILMARNANNHCGIATMASY 331


>gi|162138968|ref|NP_001104662.1| uncharacterized protein LOC567623 precursor [Danio rerio]
 gi|158254065|gb|AAI54241.1| Zgc:174153 protein [Danio rerio]
          Length = 336

 Score =  258 bits (660), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 144/342 (42%), Positives = 205/342 (59%), Gaps = 18/342 (5%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
           MF +II +  C S V +  S+ +  + +    W +QHG++Y +++E   R+ I+++NL  
Sbjct: 2   MFALIITL--CISAVFTAPSI-DIQLDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRK 57

Query: 73  IEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
           IE+ N E   GN T+K+G N+F D+TNEEFR    GY       +R S  P  F   +  
Sbjct: 58  IEQHNFEYSYGNHTFKMGMNQFGDMTNEEFRQAMNGYKH---DPNRTSQGP-LFMEPSFF 113

Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD 189
             P  +DWR++G VT +KDQ QCGSCW+FS+  A+EG      GKLI +SEQ LVDCS  
Sbjct: 114 AAPQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRP 173

Query: 190 --NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
             N GC+GGLMD AF+Y+ ENKGL +E  YPY   +        +   A  + + D+P G
Sbjct: 174 QGNQGCNGGLMDLAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKSTGFVDIPSG 233

Query: 248 DEQALLQAVSN-QPVSVCVDASGRAFHFYKSGVL--NADCGNNCDHGVAVVGFG-TAEEE 303
           +E AL+ AV+   PVSV +DAS ++  FY+SG+    A   +  DH V VVG+G    + 
Sbjct: 234 NEPALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACSSSRLDHAVLVVGYGYQGADV 293

Query: 304 NGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
            G +YW++KNSW + WG+ GYI + +D    CG+AT ASYP+
Sbjct: 294 AGNRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGVATKASYPL 335


>gi|109112057|ref|XP_001086247.1| PREDICTED: cathepsin L1-like isoform 5 [Macaca mulatta]
 gi|402897797|ref|XP_003911929.1| PREDICTED: cathepsin L1 [Papio anubis]
          Length = 333

 Score =  258 bits (660), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 144/343 (41%), Positives = 206/343 (60%), Gaps = 23/343 (6%)

Query: 12  PMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLE 71
           P F++    +  AS  ++       S+  +  +W A H R Y    E+  R  ++++N++
Sbjct: 3   PTFILAAFCLGIASATLT----FNHSLEAQWTKWKAMHNRLYGMN-EEGWRRAVWEKNMK 57

Query: 72  YIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNV 128
            IE  N+E   G  ++ +  N F D+T+EEFR +  G+       +R+  +   F+    
Sbjct: 58  MIELHNQEYSQGKHSFTMAMNTFGDMTSEEFRQVMNGFQ------NRKPRKGKVFQEPLF 111

Query: 129 TDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS- 187
            + P S+DWREKG VT +K+QGQCGSCWAFSA  A+EG      GKL+ LSEQ LVDCS 
Sbjct: 112 YEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSG 171

Query: 188 -TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPK 246
              N GC+GGLMD AF+Y+ +N GL +E  YPY   E +C    E +V A  + + D+PK
Sbjct: 172 PQGNEGCNGGLMDYAFQYVADNGGLDSEESYPYEATEESCKYNPEYSV-ANDTGFVDIPK 230

Query: 247 GDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFG-TAEE 302
             E+AL++AV+   P+SV +DA   +F FYK G+    DC + + DHGV VVG+G  + E
Sbjct: 231 -QEKALMKAVATVGPISVAIDAGHESFMFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTE 289

Query: 303 ENGAKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
            + +KYWL+KNSWGE WG  GYI++ +D    CGIA+AASYP 
Sbjct: 290 SDNSKYWLVKNSWGEEWGMGGYIKMAKDRRNHCGIASAASYPT 332


>gi|157093355|gb|ABV22332.1| cysteine protease 1 [Noctiluca scintillans]
          Length = 338

 Score =  258 bits (660), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 140/330 (42%), Positives = 190/330 (57%), Gaps = 14/330 (4%)

Query: 21  ITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEG 80
           I  A  V +G  +  P  +     +  ++G+ Y    E A+R  IFK N++ I   N   
Sbjct: 6   IAAAVLVAAGHEVPPPDYMMMFNNFKTKYGKVYNGINEDAVRFGIFKANVDIIYATNAR- 64

Query: 81  NRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREK 140
           N T+ LG NEF+DLT EE  A YTG  +P  S+     R ST +Y N   + +S+DW  +
Sbjct: 65  NLTFALGVNEFTDLTQEELAASYTGL-KPA-SLWSGLPRLSTHEY-NGAPLASSVDWTTQ 121

Query: 141 GAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDK 200
           G VT +K+QGQCGSCW+FS   A+EG   ++ G L+ LSEQQ VDC T + GC+GG MD 
Sbjct: 122 GVVTPVKNQGQCGSCWSFSTTGALEGAWALSTGNLVSLSEQQFVDCDTTDSGCNGGWMDN 181

Query: 201 AFEYIIENKGLATEADYPYRHEEGTCD--NQKEKAVAATISKYEDLPKGDEQALLQAVSN 258
           AF +  +N  + TE  YPY   +GTC+    +       +  Y D+    EQA++ AV+ 
Sbjct: 182 AFSFAKKNS-ICTEGSYPYTATDGTCNLSGCQVGIPQGGVVGYTDVSTDSEQAMMSAVAQ 240

Query: 259 QPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGET 318
           QPVS+ ++A   +F  Y SGVL A CG   DHGV  VG+G+   E G  YW +KNSWG +
Sbjct: 241 QPVSIAIEADQYSFQLYSSGVLTASCGTRLDHGVLAVGYGS---EAGTDYWKVKNSWGSS 297

Query: 319 WGESGYIRILR---DAGLCG-IATAASYPV 344
           WGE GY+R+ R    AG CG +A   SYPV
Sbjct: 298 WGEQGYVRLQRGKGGAGECGLLAGPPSYPV 327


>gi|297818854|ref|XP_002877310.1| hypothetical protein ARALYDRAFT_484828 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297323148|gb|EFH53569.1| hypothetical protein ARALYDRAFT_484828 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 376

 Score =  258 bits (660), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 145/330 (43%), Positives = 200/330 (60%), Gaps = 19/330 (5%)

Query: 27  VVSGRSMH--EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTY 84
           VV+    H  E  +   +E+W+ +HG+ Y    EK  R  IFK NL++IE+ N + NR+Y
Sbjct: 24  VVTATESHRNEAEVRTIYERWLVEHGKNYNGLGEKERRFKIFKDNLKHIEEHNSDPNRSY 83

Query: 85  KLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVT 144
             G N+FSDLT +EF+A Y G      S+S  + R   ++Y+    +P  +DWRE+GAV 
Sbjct: 84  DRGLNQFSDLTVDEFQASYLGGKIEKKSLSDVAER---YQYKEGDILPDEVDWRERGAVV 140

Query: 145 -HIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS--TDNHGCSGGLMDKA 201
             +K QG CGSCWAF+A  AVEGI QIT G+L+ LSEQ+L+DC    DN GC+GG    A
Sbjct: 141 PRVKRQGDCGSCWAFAATGAVEGINQITTGELLSLSEQELIDCDRGKDNFGCAGGGAVWA 200

Query: 202 FEYIIENKGLATEADYPYRHEE-GTCDNQKEKAV-AATISKYEDLPKGDEQALLQAVSNQ 259
           FE+I EN G+ T+ DY Y  ++   C   + K     TI+ +E +P  DE +L +AVS Q
Sbjct: 201 FEFIKENGGIVTDEDYGYTGDDTAACKAIEMKTTRVVTINGHEVVPVNDEMSLKKAVSYQ 260

Query: 260 PVSVCVDASGRAFHFYKSGVLNADCGNNC-DHGVAVVGFGTAEEENGAKYWLIKNSWGET 318
           P+SV + A+  +   YKSGV    C N   DH V +VG+GT+ +E    YWLI+NSWG  
Sbjct: 261 PISVMISAANMS--DYKSGVYKGPCSNLWGDHNVLIVGYGTSSDE--GDYWLIRNSWGPG 316

Query: 319 WGESGYIRILRD----AGLCGIATAASYPV 344
           WGE GY+R+ R+     G C +A A  YP+
Sbjct: 317 WGEGGYLRLQRNFNEPTGKCAVAVAPVYPI 346


>gi|119389039|pdb|2C0Y|A Chain A, The Crystal Structure Of A Cys25ala Mutant Of Human
           Procathepsin S
          Length = 315

 Score =  258 bits (660), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 145/318 (45%), Positives = 198/318 (62%), Gaps = 18/318 (5%)

Query: 35  EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEF 91
           +P++      W   +G+ YK++ E+A+R  I+++NL+++   N E   G  +Y LG N  
Sbjct: 5   DPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLKFVMLHNLEHSMGMHSYDLGMNHL 64

Query: 92  SDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQ 151
            D+T+EE  +L +     VPS   Q  R  T+K      +P S+DWREKG VT +K QG 
Sbjct: 65  GDMTSEEVMSLMSSLR--VPS---QWQRNITYKSNPNRILPDSVDWREKGCVTEVKYQGS 119

Query: 152 CGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD---NHGCSGGLMDKAFEYIIEN 208
           CG+ WAFSAV A+E   ++  GKL+ LS Q LVDCST+   N GC+GG M  AF+YII+N
Sbjct: 120 CGAAWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDN 179

Query: 209 KGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQ-PVSVCVDA 267
           KG+ ++A YPY+  +  C     K  AAT SKY +LP G E  L +AV+N+ PVSV VDA
Sbjct: 180 KGIDSDASYPYKAMDQKCQ-YDSKYRAATCSKYTELPYGREDVLKEAVANKGPVSVGVDA 238

Query: 268 SGRAFHFYKSGV-LNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIR 326
              +F  Y+SGV     C  N +HGV VVG+G   + NG +YWL+KNSWG  +GE GYIR
Sbjct: 239 RHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYG---DLNGKEYWLVKNSWGHNFGEEGYIR 295

Query: 327 ILRDAG-LCGIATAASYP 343
           + R+ G  CGIA+  SYP
Sbjct: 296 MARNKGNHCGIASFPSYP 313


>gi|156398078|ref|XP_001638016.1| predicted protein [Nematostella vectensis]
 gi|156225133|gb|EDO45953.1| predicted protein [Nematostella vectensis]
          Length = 326

 Score =  258 bits (660), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 146/335 (43%), Positives = 202/335 (60%), Gaps = 20/335 (5%)

Query: 16  IIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEK 75
           +++ V+  +S+  S R   +   V     W + HG++Y D  E+  R+ I++QNLE I++
Sbjct: 5   LVLCVLVASSRGWSVRFGQDSEWVA----WKSYHGKSYSDVHEERTRMAIWQQNLEKIKR 60

Query: 76  ANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSI 135
            N E + +YK+  N   DLT +EFR  Y G      S  R     +T+   +   +P+S+
Sbjct: 61  HNAE-DHSYKMAMNHLGDLTEDEFRYFYLGVRAHHNSTKRG---WATYMPPSNVKIPSSV 116

Query: 136 DWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGC 193
           DW +KG VT +K+QGQCGSCWAFS   +VEG      G L+ LSEQ L+DCS    N+GC
Sbjct: 117 DWSQKGYVTGVKNQGQCGSCWAFSTTGSVEGQHFRKTGSLVSLSEQNLIDCSGSYGNNGC 176

Query: 194 SGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALL 253
            GGLMD AF YI  N G+ TE+ YPY  ++G+C +     V A ++ Y+D+P+G EQAL 
Sbjct: 177 QGGLMDNAFRYIESNGGIDTESSYPYLGQQGSC-HFSSSHVGARVTGYQDIPQGSEQALQ 235

Query: 254 QAVSN-QPVSVCVDASGRAFHFYKSGVL-NADCGNN-CDHGVAVVGFGTAEEENGAKYWL 310
            AV+   PVSV VDAS   + FY SGV  N  C +   DHGV V+G+G     NG  YWL
Sbjct: 236 SAVATVGPVSVAVDAS--QWQFYSSGVYDNPYCSSTQLDHGVLVIGYGNY---NGQDYWL 290

Query: 311 IKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
           +KNSWG +WG  GYI + R+    CGIA++ASYP+
Sbjct: 291 VKNSWGYSWGVEGYIMMSRNKNNQCGIASSASYPL 325


>gi|126681066|gb|ABO26562.1| cathepsin L-like cysteine protease [Ixodes ricinus]
          Length = 335

 Score =  258 bits (660), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 141/309 (45%), Positives = 194/309 (62%), Gaps = 17/309 (5%)

Query: 47  AQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEFSDLTNEEFRALY 103
           A+HG++Y  E E+  RL I+ +N   I K N++   G   Y +  NEF D+ + EF +  
Sbjct: 32  AKHGKSYVSETEEVFRLKIYMENRHKIAKHNEKYARGEVPYSMAMNEFGDMLHHEFVSTR 91

Query: 104 TGYNRPVPSVSRQSSRPSTFKYQNVTD--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAV 161
            G+ R      R+ S  +  + +N+ D  +P ++DWR KGAVT +K+QGQCGSCWAFSA 
Sbjct: 92  NGFKRNYKDQPREGS--TYLEPENIEDFSLPKTVDWRTKGAVTPVKNQGQCGSCWAFSAT 149

Query: 162 AAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPY 219
            ++EG      G ++ LSEQ LV CSTD  N+GC GGLMD AF+YI  NKG+ TE  YPY
Sbjct: 150 GSLEGQHFRKSGSMVSLSEQNLVGCSTDFGNNGCEGGLMDDAFKYIRANKGIDTEKSYPY 209

Query: 220 RHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDASGRAFHFYKSG 278
              +GTC + K+  V AT S + D+ +G E  L +AV+   P+SV +DAS  +F FY  G
Sbjct: 210 NGTDGTC-HFKKSTVGATDSGFVDIKEGSETQLKKAVATVGPISVAIDASHESFQFYSDG 268

Query: 279 VLN-ADCGN-NCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD-AGLCG 335
           V +  +C + + DHGV VVG+GT    NG  YW +KNSWG TWG+ GYIR+ R+    CG
Sbjct: 269 VYDEPECDSESLDHGVLVVGYGTL---NGTDYWFVKNSWGTTWGDEGYIRMSRNKKNQCG 325

Query: 336 IATAASYPV 344
           IA++AS P+
Sbjct: 326 IASSASIPL 334


>gi|390347681|ref|XP_801784.2| PREDICTED: cathepsin L1-like isoform 2 [Strongylocentrotus
           purpuratus]
          Length = 336

 Score =  258 bits (659), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 140/340 (41%), Positives = 213/340 (62%), Gaps = 19/340 (5%)

Query: 14  FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYI 73
           F++ I ++ CA+      +   P +  +  +W   H ++Y +++ +  R  ++++N++ I
Sbjct: 6   FLVAIGLVACATAAFVKPT--NPDLDSRWLEWKIAHTKSYTNDMHELERRLVWEENVKMI 63

Query: 74  EKANKEGN---RTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
              N + +   + ++LG NE+ D+   E R+   GY     S +    + STF   +   
Sbjct: 64  NMHNLDHSLHKKGFRLGMNEYGDMRLHEVRSTMNGYK----SSNVTKVQGSTFLTPSNIQ 119

Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD- 189
           VP ++DWR KG VT +K+QGQCGSCWAFS   ++EG T     KL+ LSEQ LVDCS   
Sbjct: 120 VPDTVDWRTKGYVTPVKNQGQCGSCWAFSTTGSLEGQTFKKTSKLVSLSEQNLVDCSRTE 179

Query: 190 -NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
            N GC GGLMD+ F+Y+I+N G+ +E  YPY  E+ TC + K    +A ++ + D+  GD
Sbjct: 180 GNMGCEGGLMDQGFQYVIDNHGIDSEDCYPYDAEDETC-HYKASCDSAEVTGFTDVTSGD 238

Query: 249 EQALLQAVSN-QPVSVCVDASGRAFHFYKSGVLN-ADCGNN-CDHGVAVVGFGTAEEENG 305
           EQAL++AV++  PVSV +DAS ++F  Y+SGV +  +C ++  DHGV VVG+GT   + G
Sbjct: 239 EQALMEAVASVGPVSVAIDASHQSFQLYESGVYDEPECSSSELDHGVLVVGYGT---DGG 295

Query: 306 AKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
             YWL+KNSWGETWG SGYI++ R+ +  CGIAT+ASYP+
Sbjct: 296 KDYWLVKNSWGETWGLSGYIKMSRNKSNQCGIATSASYPL 335


>gi|355753449|gb|EHH57495.1| Cathepsin L1 [Macaca fascicularis]
          Length = 333

 Score =  258 bits (659), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 144/343 (41%), Positives = 206/343 (60%), Gaps = 23/343 (6%)

Query: 12  PMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLE 71
           P F++    +  AS  ++       S+  +  +W A H R Y    E+  R  ++++N++
Sbjct: 3   PTFILAAFCLGIASATLT----FNHSLEAQWTKWKAMHNRLYGMN-EEGWRRAVWEKNMK 57

Query: 72  YIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNV 128
            IE  N+E   G  ++ +  N F D+T+EEFR +  G+       +R+  +   F+    
Sbjct: 58  MIELHNQEYSQGKHSFTMAMNTFGDMTSEEFRQVMNGFQ------NRKPRKGKVFQELLF 111

Query: 129 TDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS- 187
            + P S+DWREKG VT +K+QGQCGSCWAFSA  A+EG      GKL+ LSEQ LVDCS 
Sbjct: 112 YEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSW 171

Query: 188 -TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPK 246
              N GC+GGLMD AF+Y+ +N GL +E  YPY   E +C    E +V A  + + D+PK
Sbjct: 172 PQGNEGCNGGLMDYAFQYVADNGGLDSEESYPYEATEESCKYNPEYSV-ANDTGFVDIPK 230

Query: 247 GDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFG-TAEE 302
             E+AL++AV+   P+SV +DA   +F FYK G+    DC + + DHGV VVG+G  + E
Sbjct: 231 -QEKALMKAVATVGPISVAIDAGHESFMFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTE 289

Query: 303 ENGAKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
            + +KYWL+KNSWGE WG  GYI++ +D    CGIA+AASYP 
Sbjct: 290 SDNSKYWLVKNSWGEEWGMGGYIKMAKDRRNHCGIASAASYPT 332


>gi|308810026|ref|XP_003082322.1| cysteine protease-1 (ISS) [Ostreococcus tauri]
 gi|116060790|emb|CAL57268.1| cysteine protease-1 (ISS) [Ostreococcus tauri]
          Length = 430

 Score =  258 bits (659), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 141/337 (41%), Positives = 198/337 (58%), Gaps = 31/337 (9%)

Query: 37  SIVEKHEQWMAQHG--RTYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEF 91
           ++    E+W ++HG  R  +D  E A RL  F +N  Y+ + N     G  ++ +G N  
Sbjct: 93  ALARHFERWCSEHGLERYLRDTEEYAKRLATFAENAAYVVEHNALYAIGEVSHWVGLNSL 152

Query: 92  SDLTNEEFRALYTGYNRPV-----------PSVSRQSSRPSTFKYQNVTDVPTSIDWREK 140
           +  T EE+RAL  GY   +            S  +     ++++Y +V D P +IDW E 
Sbjct: 153 AATTREEYRALL-GYKPELRSSGDAEMLEATSTDKVEQYKASWEYASV-DPPEAIDWVEL 210

Query: 141 GAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDK 200
           GAVT  K+QGQCGSCWAFS   AVEGIT+I  G+L+ LSEQ++V CS  N GC+GGLMD 
Sbjct: 211 GAVTPPKNQGQCGSCWAFSTTGAVEGITKIRTGRLVSLSEQEMVSCSKQNMGCNGGLMDY 270

Query: 201 AFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQP 260
           AF +I++N G+ +E  YPY  E   C+  K +   ATI  ++D+P GDE+ L +AVS QP
Sbjct: 271 AFRWIVKNGGIDSEFQYPYSAEALACNRWKLQLHVATIDGFKDVPPGDEKELEKAVSQQP 330

Query: 261 VSVCVDASGRAFHFYKSGVLNA-DCGNNCDHGVAVVGFG--------TAEEENGAKYWLI 311
           VS+ ++A  ++F  Y  GV ++ +CG+  DHGV VVG+G        T   +    +W +
Sbjct: 331 VSIAIEADTKSFQLYDGGVYDSKECGSQVDHGVLVVGYGFDDTHHNATKHHKRHRHFWKV 390

Query: 312 KNSWGETWGESGYIRILR----DAGLCGIATAASYPV 344
           KNSWG TWGE G+IR+ R    + G CGI TA SYP 
Sbjct: 391 KNSWGGTWGEGGFIRMARRISDETGQCGITTAPSYPT 427


>gi|308321226|gb|ADO27765.1| cathepsin S [Ictalurus furcatus]
          Length = 329

 Score =  258 bits (659), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 144/340 (42%), Positives = 200/340 (58%), Gaps = 20/340 (5%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
           M   ++  + C + V    ++ +PS+      W   H +TY  ELE+  R  I+++NL  
Sbjct: 1   MLRSLLFTVICGAVV----ALQDPSLDMHWLMWKKNHSKTYTSELEELGRREIWERNLRL 56

Query: 73  IEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
           I   N E   G  TY LG N   D+T EE   ++ G  R  P+++R   R S F      
Sbjct: 57  ITVHNLEASLGMHTYDLGMNHMGDMTREEILQMFAG-TRVRPNLTR---RSSPFVASAGI 112

Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD 189
            VP S+DWREKG VT +K+QG CGSCWAFSA  A+EG  + T G++  LS Q LVDCS+ 
Sbjct: 113 SVPDSVDWREKGYVTEVKNQGSCGSCWAFSAAGALEGQLKRTTGQVKSLSPQNLVDCSSK 172

Query: 190 --NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
             N GC+GG M +AF+Y+I++ G+ ++  YPY   +G C   + +  AA  S Y  + +G
Sbjct: 173 YGNKGCNGGFMTQAFQYVIDDGGIDSDEAYPYTAMDGQCRYDQSQR-AANCSSYNYVSEG 231

Query: 248 DEQALLQAVSN-QPVSVCVDASGRAFHFYKSGVL-NADCGNNCDHGVAVVGFGTAEEENG 305
           DE+AL QAV+   P+SV +DA+   F  Y SGV  +  C  N +HGV VVG+G+    NG
Sbjct: 232 DEEALKQAVATIGPISVAIDATRPMFILYHSGVYSDPTCTQNVNHGVLVVGYGSL---NG 288

Query: 306 AKYWLIKNSWGETWGESGYIRILRDAG-LCGIATAASYPV 344
             YWL+KNSWG  +G+ GYIRI R+ G +CGIA  A YP+
Sbjct: 289 EDYWLVKNSWGTRFGDGGYIRIARNKGNMCGIANYACYPL 328


>gi|1709574|sp|P10056.2|PAPA3_CARPA RecName: Full=Caricain; AltName: Full=Papaya peptidase A; AltName:
           Full=Papaya proteinase III; Short=PPIII; AltName:
           Full=Papaya proteinase omega; Flags: Precursor
 gi|18098|emb|CAA46862.1| proteinase omega [Carica papaya]
          Length = 348

 Score =  258 bits (659), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 139/351 (39%), Positives = 198/351 (56%), Gaps = 16/351 (4%)

Query: 1   MVLKFEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQ----WMAQHGRTYKDE 56
           M+    K   + + + + + ++     + G S  + +  E+  Q    WM  H + Y++ 
Sbjct: 3   MIPSISKLLFVAICLFVHMSVSFGDFSIVGYSQDDLTSTERLIQLFNSWMLNHNKFYENV 62

Query: 57  LEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQ 116
            EK  R  IFK NL YI++ NK+ N +Y LG NEF+DL+N+EF   Y G    +   + +
Sbjct: 63  DEKLYRFEIFKDNLNYIDETNKK-NNSYWLGLNEFADLSNDEFNEKYVG---SLIDATIE 118

Query: 117 SSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLI 176
            S    F  ++  ++P ++DWR+KGAVT ++ QG CGSCWAFSAVA VEGI +I  GKL+
Sbjct: 119 QSYDEEFINEDTVNLPENVDWRKKGAVTPVRHQGSCGSCWAFSAVATVEGINKIRTGKLV 178

Query: 177 ELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAA 236
           ELSEQ+LVDC   +HGC GG    A EY+ +N G+   + YPY+ ++GTC  ++      
Sbjct: 179 ELSEQELVDCERRSHGCKGGYPPYALEYVAKN-GIHLRSKYPYKAKQGTCRAKQVGGPIV 237

Query: 237 TISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVG 296
             S    +   +E  LL A++ QPVSV V++ GR F  YK G+    CG   DH V  V 
Sbjct: 238 KTSGVGRVQPNNEGNLLNAIAKQPVSVVVESKGRPFQLYKGGIFEGPCGTKVDHAVTAV- 296

Query: 297 FGTAEEENGAKYWLIKNSWGETWGESGYIRILR----DAGLCGIATAASYP 343
                +  G  Y LIKNSWG  WGE GYIRI R      G+CG+  ++ YP
Sbjct: 297 --GYGKSGGKGYILIKNSWGTAWGEKGYIRIKRAPGNSPGVCGLYKSSYYP 345


>gi|357130488|ref|XP_003566880.1| PREDICTED: fruit bromelain-like [Brachypodium distachyon]
          Length = 356

 Score =  258 bits (659), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 138/320 (43%), Positives = 188/320 (58%), Gaps = 20/320 (6%)

Query: 41  KHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFR 100
           +HE+WMA+ GR Y D  EKA R  +F  N  Y++  N+ GNRTY LG N+FSDLT++EF 
Sbjct: 38  RHEEWMAKFGRVYTDAQEKARRQEVFGANARYVDAVNRAGNRTYTLGLNKFSDLTDDEFV 97

Query: 101 ALYTGYNRPVPSVSRQS----SRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCW 156
             + GY        R      S+ +   Y    D+P S+DWR +GAVT +K+QG CG CW
Sbjct: 98  QTHLGYRGHQQGGLRPEEENVSKVAALGYGQA-DMPESVDWRAQGAVTGVKNQGSCGCCW 156

Query: 157 AFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHG------CSGGLMDKAFEYIIENKG 210
           AF+AVAA EG+ +I  G LI +SEQQ++DC+  + G      C GG +D A  Y+  ++G
Sbjct: 157 AFAAVAATEGLVKIATGNLISMSEQQVLDCTGQSPGMGNTNTCDGGHIDDALRYVAASRG 216

Query: 211 LATEADYPYRHEEGTCDNQKEKAVAATISKYEDLP-KGDEQALLQAVSNQPVSVCVDASG 269
           L  EA Y Y   +G C +      AA+  + + +  +GDE  L   V+ QP++V V+AS 
Sbjct: 217 LQPEAAYAYTGLQGACQSGFTPNSAASFGEPQTVTLQGDEGRLQGLVAGQPIAVSVEAS- 275

Query: 270 RAFHFYKSGVLNA---DCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIR 326
             F  Y SGV  A    CG   +H V VVG+G+A  + G +YWL+KN WG +WGE GY+R
Sbjct: 276 DDFRHYMSGVFTAGTSSCGQRLNHAVTVVGYGSA--DGGQEYWLVKNQWGTSWGEGGYMR 333

Query: 327 ILRDAGL--CGIATAASYPV 344
           I R  G   CGI+  A YP 
Sbjct: 334 IARGNGAPNCGISAYAYYPT 353


>gi|81294188|gb|AAI08032.1| Cathepsin L, 1 b [Danio rerio]
          Length = 336

 Score =  258 bits (659), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 141/341 (41%), Positives = 203/341 (59%), Gaps = 16/341 (4%)

Query: 14  FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYI 73
            +  +LV    S V +  S+ +  + +    W +QHG++Y +++E   R+ I+++NL  I
Sbjct: 1   MMFALLVTLYISAVFAAPSI-DIQLDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRKI 58

Query: 74  EKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
           E+ N E   GN T+K+G N+F D+TNEEFR    GY         Q+S+   F   +   
Sbjct: 59  EQHNFEYSYGNHTFKMGMNQFGDMTNEEFRQAMNGYTHD----PNQTSQGPLFMEPSFFA 114

Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD- 189
            P  +DWR++G VT +KDQ QCGSCW+FS+  A+EG      GKLI +SEQ LVDCS   
Sbjct: 115 APQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQ 174

Query: 190 -NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
            N GC+GGLMD AF+Y+ ENKGL +E  YPY   +        +   A I+ + D+P G+
Sbjct: 175 GNQGCNGGLMDLAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPSGN 234

Query: 249 EQALLQAVSN-QPVSVCVDASGRAFHFYKSGVL--NADCGNNCDHGVAVVGFG-TAEEEN 304
           E AL+ AV+   PVSV +DAS ++  FY+SG+    A   +  DH V VVG+G    +  
Sbjct: 235 ELALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACSSSRLDHAVLVVGYGYQGADVA 294

Query: 305 GAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
           G +YW++KNSW + WG+ GYI + +D    CG+AT ASYP+
Sbjct: 295 GNRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGVATKASYPL 335


>gi|238481789|gb|ACR43934.1| cathepsin L-like cysteine proteinase [Haliotis diversicolor
           supertexta]
          Length = 347

 Score =  258 bits (659), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 149/307 (48%), Positives = 195/307 (63%), Gaps = 18/307 (5%)

Query: 48  QHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYT 104
           QHGR Y+   E+  R  IFKQNL+YIE+ NK+   G ++Y LG N+F+D+ NEEFR +Y 
Sbjct: 48  QHGRLYEKHEEEEERFEIFKQNLQYIEEHNKKFSLGQKSYYLGINQFADMKNEEFR-MYN 106

Query: 105 GYNRPVP-SVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAA 163
           G  R    S   Q S   T +Y      P  +DWR+KG VT +K+QGQCGSCW+FS   +
Sbjct: 107 GLRRDYNYSREVQCSNHLTPEY---LVAPDEVDWRKKGYVTAVKNQGQCGSCWSFSTTGS 163

Query: 164 VEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRH 221
           +EG      GKL+ LSEQQLVDCS    N GC+GGLMD+AFEYII N G+ TE +YPY  
Sbjct: 164 LEGQHFHKSGKLVSLSEQQLVDCSGKFGNEGCNGGLMDQAFEYIITNGGIETEEEYPYDA 223

Query: 222 EEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGVL 280
            +  C  +K + VAAT S   D+  GDE  L  +V+   PVS+ +DAS ++F  Y  GV 
Sbjct: 224 RQERCHFKKSE-VAATASGCVDVKSGDETDLKNSVAEVGPVSIAIDASHQSFQLYSGGVY 282

Query: 281 N-ADCGNN-CDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIA 337
           +   C +   DHGV VVG+GT   ++G  YWL+KNSWG TWG  GY+++ R+    CG+A
Sbjct: 283 DEPKCSSTELDHGVLVVGYGT---DDGQDYWLVKNSWGTTWGLEGYVKMSRNQDNQCGVA 339

Query: 338 TAASYPV 344
           T ASYP+
Sbjct: 340 TQASYPL 346


>gi|405966498|gb|EKC31776.1| Cathepsin L [Crassostrea gigas]
          Length = 330

 Score =  258 bits (658), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 147/310 (47%), Positives = 192/310 (61%), Gaps = 20/310 (6%)

Query: 45  WMAQHGRTYKDELEKAMRLNIFKQNLEYIEKAN---KEGNRTYKLGTNEFSDLTNEEFRA 101
           ++  HG+ Y  E E+A R  I++ NL+YIEK N     G+ ++ LG NE+ D+TNEEFR+
Sbjct: 30  YLKAHGKQYGAE-EEARRRVIWEGNLDYIEKHNLAADRGDYSFWLGMNEYGDMTNEEFRS 88

Query: 102 LYTGYNRPVPSVSRQSSRPSTF-KYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSA 160
              GY      +   +SR S +    N+ D+P ++DWR KG VT IK+QGQCGSCW+FSA
Sbjct: 89  TMNGYK-----MRNGTSRGSLYLPPSNIGDLPDTVDWRPKGYVTPIKNQGQCGSCWSFSA 143

Query: 161 VAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYP 218
             ++EG T    GKL  LSEQ LVDCS    NHGC GGLMD AF+YI +N G+ TE+ YP
Sbjct: 144 TGSLEGQTFKKTGKLPSLSEQNLVDCSQKQGNHGCQGGLMDDAFQYIKDNSGIDTESSYP 203

Query: 219 YRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDASGRAFHFYKS 277
           Y  + G C       V AT S + D+    E  L  AV+   P+SV +DAS  +F  Y+S
Sbjct: 204 YEAKNGKCRFNAAN-VGATDSGFTDIKSKSESDLQSAVATVGPISVAIDASHMSFQLYRS 262

Query: 278 GVLNA-DCG-NNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA-GLC 334
           GV +   C     DHGV  VG+GT   E+G  YWL+KNSWGE+WG+ GYI + R+    C
Sbjct: 263 GVYHEFFCSETRLDHGVLAVGYGT---ESGKDYWLVKNSWGESWGQKGYIMMSRNKRNNC 319

Query: 335 GIATAASYPV 344
           GIAT+ASYP 
Sbjct: 320 GIATSASYPT 329


>gi|21483184|gb|AAF86584.1| cathepsin L cysteine protease [Haemonchus contortus]
          Length = 355

 Score =  258 bits (658), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 144/304 (47%), Positives = 189/304 (62%), Gaps = 14/304 (4%)

Query: 50  GRTYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGY 106
           G++Y+ E E    +  F +N+ +IE+ NKE   G +T+++G NE +DL   ++R L  GY
Sbjct: 56  GKSYEPEEENDY-MEAFVKNVIHIEEHNKEHRLGRKTFEMGLNEIADLPFSQYRKL-NGY 113

Query: 107 NRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEG 166
                      S  + F       +P S+DWRE+G VT +K+QG CGSCWAFS+  A+EG
Sbjct: 114 RMRRQFGDSMQSNGTKFLVPFNVQIPESVDWREEGLVTPVKNQGMCGSCWAFSSTGALEG 173

Query: 167 ITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEG 224
                 GKL+ LSEQ LVDCST   NHGC+GGLMD AFEYI EN G+ TE  YPY   E 
Sbjct: 174 QHARATGKLVSLSEQNLVDCSTKYGNHGCNGGLMDLAFEYIKENHGVDTEDSYPYVGRET 233

Query: 225 TCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQ-PVSVCVDASGRAFHFYKSGV-LNA 282
            C + K   V A    + DLP+GDE+AL +AV+ Q P+S+ +DA  R+F  YK GV  + 
Sbjct: 234 KC-HFKRNTVGADDKGFVDLPEGDEEALKKAVATQGPISIAIDAGHRSFQLYKKGVYFDE 292

Query: 283 DCGN-NCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAA 340
           +C +   DHGV +VG+GT  E     YWL+KNSWG TWGE GYIRI R+    CG+AT A
Sbjct: 293 ECSSEELDHGVLLVGYGTDPE--AGDYWLVKNSWGPTWGEKGYIRIARNRNNHCGVATKA 350

Query: 341 SYPV 344
           SYP+
Sbjct: 351 SYPL 354


>gi|358345461|ref|XP_003636796.1| Cysteine proteinase [Medicago truncatula]
 gi|355502731|gb|AES83934.1| Cysteine proteinase [Medicago truncatula]
          Length = 475

 Score =  258 bits (658), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 136/319 (42%), Positives = 183/319 (57%), Gaps = 31/319 (9%)

Query: 35  EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRT--YKLGTNEFS 92
           E  +VE  +QW  +H + Y    E A+RL  FK+NL+YI + N   N    + LG N F+
Sbjct: 44  EEQVVELFQQWKKEHQKFYIHPEEAALRLENFKRNLKYIVERNAMRNSPVGHHLGLNRFA 103

Query: 93  DLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQC 152
           D++NEEF+  +                    K ++  D P S+DWR+KG VT +KDQG C
Sbjct: 104 DMSNEEFKNKFIS------------------KVESCDDAPYSLDWRKKGVVTGVKDQGNC 145

Query: 153 GSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLA 212
           GSCW+FS+  A+EG+  I  G LI LSEQ+LVDC T N GC GG MD AFE++I N G+ 
Sbjct: 146 GSCWSFSSTGAIEGVNAIVTGDLISLSEQELVDCDTTNDGCEGGYMDYAFEWVINNGGID 205

Query: 213 TEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAF 272
           TEADYPY    GTC+  KE+    TI  Y D+ + D  AL  A   QP+SV +D S   F
Sbjct: 206 TEADYPYIGVGGTCNVTKEETKVVTIDGYTDVTQSD-SALFCATVKQPISVGIDGSTLDF 264

Query: 273 HFYKSGVLNADCGNN---CDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILR 329
             Y  G+ + DC +N    DH V +VG+G+   +    YW++KNSWG +WG  G+I I R
Sbjct: 265 QLYTGGIYDGDCSSNPDDIDHAVLIVGYGS---DGNQDYWIVKNSWGTSWGIEGFIYIRR 321

Query: 330 DA----GLCGIATAASYPV 344
           +     G+C I   AS+P 
Sbjct: 322 NTNLKYGVCAINYMASFPT 340


>gi|405958751|gb|EKC24845.1| Cathepsin L [Crassostrea gigas]
          Length = 330

 Score =  258 bits (658), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 147/310 (47%), Positives = 192/310 (61%), Gaps = 20/310 (6%)

Query: 45  WMAQHGRTYKDELEKAMRLNIFKQNLEYIEKAN---KEGNRTYKLGTNEFSDLTNEEFRA 101
           ++  HG+ Y  E E+A R  I++ NL+YIEK N     G+ ++ LG NE+ D+TNEEFR+
Sbjct: 30  YLKAHGKQYGAE-EEARRRVIWEGNLDYIEKHNLAADRGDYSFWLGMNEYGDMTNEEFRS 88

Query: 102 LYTGYNRPVPSVSRQSSRPSTF-KYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSA 160
              GY      +   +SR S +    N+ D+P ++DWR KG VT IK+QGQCGSCW+FSA
Sbjct: 89  TMNGY-----KMRNGTSRGSLYLPPSNIGDLPDTVDWRPKGYVTPIKNQGQCGSCWSFSA 143

Query: 161 VAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYP 218
             ++EG T    GKL  LSEQ LVDCS    NHGC GGLMD AF+YI +N G+ TE+ YP
Sbjct: 144 TGSLEGQTFKKTGKLPSLSEQNLVDCSQKQGNHGCQGGLMDDAFQYIKDNNGIDTESSYP 203

Query: 219 YRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDASGRAFHFYKS 277
           Y  + G C       V AT S + D+    E  L  AV+   P++V +DAS  +F  YKS
Sbjct: 204 YEAKNGKCRFNAAN-VGATDSGFTDIKSKSESDLQSAVATVGPIAVAIDASHMSFQLYKS 262

Query: 278 GVLNA-DCG-NNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA-GLC 334
           GV +   C     DHGV  VG+GT   E+G  YWL+KNSWGE+WG+ GYI + R+    C
Sbjct: 263 GVYHEFFCSETRLDHGVLAVGYGT---ESGKDYWLVKNSWGESWGQKGYIMMSRNKRNNC 319

Query: 335 GIATAASYPV 344
           GIAT+ASYP 
Sbjct: 320 GIATSASYPT 329


>gi|355567871|gb|EHH24212.1| Cathepsin L1 [Macaca mulatta]
          Length = 333

 Score =  258 bits (658), Expect = 4e-66,   Method: Compositional matrix adjust.
 Identities = 144/343 (41%), Positives = 206/343 (60%), Gaps = 23/343 (6%)

Query: 12  PMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLE 71
           P F++    +  AS  ++       S+  +  +W A H R Y    E+  R  ++++N++
Sbjct: 3   PTFILAAFCLGIASATLT----FNHSLEAQWTKWKAMHNRLYGMN-EEGWRRAVWEKNMK 57

Query: 72  YIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNV 128
            IE  N+E   G  ++ +  N F D+T+EEFR +  G+       +R+  +   F+    
Sbjct: 58  MIELHNQEYSQGKHSFTMAMNTFGDMTSEEFRQVMNGFQ------NRKPRKGKVFQEPLF 111

Query: 129 TDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS- 187
            + P S+DWREKG VT +K+QGQCGSCWAFSA  A+EG      GKL+ LSEQ LVDCS 
Sbjct: 112 YEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSG 171

Query: 188 -TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPK 246
              N GC+GGLMD AF+Y+ +N GL +E  YPY   E +C    E +V A  + + D+PK
Sbjct: 172 PQGNEGCNGGLMDYAFQYVADNGGLDSEEAYPYEATEESCKYNPEYSV-ANDTGFVDIPK 230

Query: 247 GDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFG-TAEE 302
             E+AL++AV+   P+SV +DA   +F FYK G+    DC + + DHGV VVG+G  + E
Sbjct: 231 -QEKALMKAVATVGPISVAIDAGHESFMFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTE 289

Query: 303 ENGAKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
            + +KYWL+KNSWGE WG  GYI++ +D    CGIA+AASYP 
Sbjct: 290 SDNSKYWLVKNSWGEEWGMGGYIKMAKDRRNHCGIASAASYPT 332


>gi|330434686|gb|AEC22811.1| cathepsin L [Macrobrachium nipponense]
          Length = 342

 Score =  258 bits (658), Expect = 4e-66,   Method: Compositional matrix adjust.
 Identities = 140/322 (43%), Positives = 199/322 (61%), Gaps = 20/322 (6%)

Query: 38  IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANK---EGNRTYKLGTNEFSDL 94
           ++E+ E +  +H + Y+ + E+  R+ IF +N + I   NK    G++TYKLG N++ D+
Sbjct: 25  VMEEWESFKFEHSKKYESDTEETFRMKIFAENKQKIAAHNKLYHTGSKTYKLGMNKYGDM 84

Query: 95  TNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD------VPTSIDWREKGAVTHIKD 148
            + EF  +  G+         +++R   F+  +  +      +P S+DWREKGAVT +KD
Sbjct: 85  LHHEFVNMMNGFRANTSGAGYKANR--GFQGAHFVEPPEDVVMPKSVDWREKGAVTEVKD 142

Query: 149 QGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYII 206
           QG CGSCWAFSA  A+EG      G L+ LSEQ LVDCS+   N+GC+GGLMD AF+YI 
Sbjct: 143 QGSCGSCWAFSATGALEGQHYRQTGDLVSLSEQNLVDCSSKFGNNGCNGGLMDNAFQYIK 202

Query: 207 ENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCV 265
            N G+ TE  YPY  E+  C      A  A    + D+ +G+E AL +A++   PVSV +
Sbjct: 203 VNGGIDTEKSYPYEAEDEPCRYNPANA-GADDRGFVDVREGNENALKKAIATIGPVSVAI 261

Query: 266 DASGRAFHFYKSGVL-NADC-GNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESG 323
           DAS  +F FY+ GV  + DC   N DHGV  VG+GT E+  G  YWL+KNSW ++WG+ G
Sbjct: 262 DASQDSFQFYQHGVYSDPDCSAENLDHGVLAVGYGTTED--GQDYWLVKNSWSKSWGDQG 319

Query: 324 YIRILRDA-GLCGIATAASYPV 344
           YI+I R+   +CGIA+AASYP+
Sbjct: 320 YIKIARNQNNMCGIASAASYPL 341


>gi|84660246|emb|CAI43320.1| cathepsin L [Lubomirskia baicalensis]
 gi|85677150|emb|CAI46307.1| cathepsin L [Lubomirskia baicalensis]
          Length = 327

 Score =  258 bits (658), Expect = 4e-66,   Method: Compositional matrix adjust.
 Identities = 142/313 (45%), Positives = 194/313 (61%), Gaps = 16/313 (5%)

Query: 40  EKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNR-TYKLGTNEFSDLTNEE 98
           E+ E W  +HG+ Y  + E+  R  I++ N +Y+++ N    +  + +G N+F+DL + E
Sbjct: 20  EEWESWKKEHGKVYNSDREELTRHIIWQANRKYVDEHNAHAEKFGFTVGMNQFADLESSE 79

Query: 99  FRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAF 158
           F  LY GYN   PS+ +  S+  + K   V D+PTS+DWR KG VT IK+QGQCGSCWAF
Sbjct: 80  FGRLYNGYNNK-PSMKKAQSKVFSTK---VGDLPTSVDWRTKGFVTAIKNQGQCGSCWAF 135

Query: 159 SAVAAVEGITQITRGKLIELSEQQLVDCST--DNHGCSGGLMDKAFEYIIENKGLATEAD 216
           SAVA +EG      G L+ LSEQ LVDCST   N GC+GGLMD AF+Y+I+N G+ TEA 
Sbjct: 136 SAVAGLEGQHFNATGTLVSLSEQNLVDCSTAEGNQGCNGGLMDNAFQYVIKNGGIDTEAS 195

Query: 217 YPYRHEEGTCDNQKEKAVAATISKYED-LP-KGDEQALLQAVSNQPVSVCVDASGRAFHF 274
           YPY+  +  C       V +T S + D LP K +    +      P+SV +DAS  +F  
Sbjct: 196 YPYKAVDQKCKFNAAN-VGSTCSGFSDILPHKSEAALQVAVAVVGPISVAIDASHTSFQL 254

Query: 275 YKSGVL--NADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA- 331
           YKSGV   +A    + DHGV  VG+   +  +G  YW++KNSWG TWG++GYI + R+  
Sbjct: 255 YKSGVYSESACSQTSLDHGVTAVGY---DSSSGVAYWIVKNSWGTTWGQAGYIWMSRNKN 311

Query: 332 GLCGIATAASYPV 344
             CGIATAASYP+
Sbjct: 312 NQCGIATAASYPI 324


>gi|357122137|ref|XP_003562772.1| PREDICTED: fruit bromelain-like [Brachypodium distachyon]
          Length = 358

 Score =  258 bits (658), Expect = 4e-66,   Method: Compositional matrix adjust.
 Identities = 144/332 (43%), Positives = 199/332 (59%), Gaps = 35/332 (10%)

Query: 38  IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
           ++++   + A + RTY    E+  R  ++++N++YIE  N+ G+ TY+LG N+F+DLT +
Sbjct: 36  MMDRFRAFQATYNRTYASPEERLRRFEVYRRNVDYIEAMNRRGDLTYELGENQFADLTVQ 95

Query: 98  EFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDV---------------------PTSID 136
           EFRA+YT     +P+  R  SRP  ++ + +                        PTS+D
Sbjct: 96  EFRAMYT-----MPA--RVDSRPDAWRRRQMITTLAGPVTEDGGSYYSDAWEEAGPTSVD 148

Query: 137 WREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGG 196
           WR KGAVT +KDQG CG CWAF+ VA +EG+ +I  G+L+ LSEQ+LVDC   + GC GG
Sbjct: 149 WRSKGAVTPVKDQGGCGCCWAFATVATIEGLHKIKTGQLVSLSEQELVDCDDADDGCGGG 208

Query: 197 LMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAV 256
           L + A E++  N GL TEA+YPY  + G CD  K    AA I+  + +    E  L +AV
Sbjct: 209 LPEIAMEWVAHNGGLTTEANYPYTGKAGKCDRGKASNHAAKIAAAQMVRANSEAELERAV 268

Query: 257 SNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWG 316
           + QPV+V ++A   +  FYKSGV +  C    DH V VVG+G   +  G KYW+IKNSW 
Sbjct: 269 ARQPVAVAINAPD-SLMFYKSGVYSGPCTAEFDHAVTVVGYGA--DNKGHKYWIIKNSWA 325

Query: 317 ETWGESGYIRILRDA----GLCGIATAASYPV 344
           ETWGE GY R+ R      GLCGIAT ASYPV
Sbjct: 326 ETWGEKGYGRMQRGVAAKEGLCGIATHASYPV 357


>gi|288548566|gb|ADC52431.1| cathepsin L2 cysteine protease [Pinctada fucata]
          Length = 330

 Score =  258 bits (658), Expect = 4e-66,   Method: Compositional matrix adjust.
 Identities = 142/306 (46%), Positives = 191/306 (62%), Gaps = 18/306 (5%)

Query: 48  QHGRTYKDELEKAMRLNIFKQNLEYIEKAN---KEGNRTYKLGTNEFSDLTNEEFRALYT 104
           Q+ + Y++E E   RL +++ NL++I   N     G  T+ +G NE+ D+TNEEF     
Sbjct: 33  QYNKLYQNEEEARRRL-VWESNLDFITLHNLAADRGEHTFWVGMNEYGDMTNEEFTKTMN 91

Query: 105 GYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAV 164
           GY       ++ S+ P      N+ D+P ++DWR KG VT IK+QGQCGSCW+FSA  ++
Sbjct: 92  GYRMR----NKTSNAPVFMPPNNMGDLPDTVDWRPKGYVTPIKNQGQCGSCWSFSATGSL 147

Query: 165 EGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHE 222
           EG T    GKL+ LSEQ LVDCS    NHGC GGLMD AF YI  N G+ TEA YPY+  
Sbjct: 148 EGQTFKKTGKLVSLSEQNLVDCSKKQGNHGCEGGLMDDAFTYIKANNGIDTEASYPYKAR 207

Query: 223 EGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGVL- 280
           +G C+  K   V AT + + D+   DE+AL QAV+   P+SV +DAS  +F  Y++GV  
Sbjct: 208 DGKCE-FKSADVGATDTGFVDIKTKDEEALKQAVATVGPISVAIDASHMSFQLYRTGVYH 266

Query: 281 NADCGNN-CDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD-AGLCGIAT 338
           +  C     DHGV  VG+GT   E+   YWL+KNSWGE+WG+ GYI++ R+    CGIAT
Sbjct: 267 DWFCSQTKLDHGVLAVGYGT---EDSKDYWLVKNSWGESWGQKGYIQMSRNRRNNCGIAT 323

Query: 339 AASYPV 344
           +ASYP 
Sbjct: 324 SASYPT 329


>gi|297663703|ref|XP_002810310.1| PREDICTED: LOW QUALITY PROTEIN: cathepsin S [Pongo abelii]
          Length = 330

 Score =  258 bits (658), Expect = 4e-66,   Method: Compositional matrix adjust.
 Identities = 148/341 (43%), Positives = 209/341 (61%), Gaps = 23/341 (6%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKH-EQWMAQHGRTYKDELEKAMRLNIFKQNLE 71
           M  ++ +++ C+S V     +H+   ++ H   W   +G+ YK++ E+A+R  I+++NL+
Sbjct: 1   MKQLVCVLLVCSSAVAQ---LHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLK 57

Query: 72  YIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNV 128
           ++   N E   G  +Y LG N   D+T+EE  +L +     VPS   Q  R  T+K    
Sbjct: 58  FVMIHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLR--VPS---QWQRNITYKSNPN 112

Query: 129 TDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST 188
             +P S+DWREKG VT +K QG CG+CWAFSAV A+E   ++  GKL+ LS Q LVDCST
Sbjct: 113 RILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCST 172

Query: 189 D---NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLP 245
           +   N GC+GG M  AF+YII+NKG+ ++A YPY+       + K +  AAT SKY D  
Sbjct: 173 EKYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKAMVKCQYDSKYR--AATCSKYTDFX 230

Query: 246 KGDEQALLQAVSNQ-PVSVCVDASGRAFHFYKSGV-LNADCGNNCDHGVAVVGFGTAEEE 303
            G E  L +AV+N+ PVSV VDA   +F  Y+SGV     C  N +HGV VVG+G   + 
Sbjct: 231 YGREDVLKEAVANKGPVSVGVDARHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYG---DL 287

Query: 304 NGAKYWLIKNSWGETWGESGYIRILRDAG-LCGIATAASYP 343
           NG +YWL+KNSWG  +GE GYIR+ R+ G  CGIA+  S+P
Sbjct: 288 NGKEYWLVKNSWGRNFGEEGYIRMARNKGNHCGIASFPSFP 328


>gi|217072410|gb|ACJ84565.1| unknown [Medicago truncatula]
          Length = 328

 Score =  258 bits (658), Expect = 4e-66,   Method: Compositional matrix adjust.
 Identities = 124/220 (56%), Positives = 155/220 (70%), Gaps = 9/220 (4%)

Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD- 189
           +P S+DWR++GAV  +KDQ  CGSCWAFSA+AAVEGI +I  G LI LSEQ+LVDC T  
Sbjct: 24  LPESVDWRKEGAVVGVKDQASCGSCWAFSAIAAVEGINKIVTGDLISLSEQELVDCDTSY 83

Query: 190 NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDE 249
           N GC+GGLMD AFE+II N G+ +E DYPY+  +G CD  ++ A   TI  YED+P  DE
Sbjct: 84  NEGCNGGLMDYAFEFIISNGGIDSEDDYPYKAVDGRCDQNRKNAKVVTIDDYEDVPAYDE 143

Query: 250 QALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYW 309
            AL +AV+NQP++V V+  GR F  Y+ GVL   CG   DHGVA VG+GT   ENG  YW
Sbjct: 144 LALQKAVANQPIAVAVEGGGREFQLYEYGVLTGRCGTALDHGVAAVGYGT---ENGKDYW 200

Query: 310 LIKNSWGETWGESGYIRILRD-----AGLCGIATAASYPV 344
           +++NSWG +WGE GYIR+ R+     AG CGIA   SYP+
Sbjct: 201 IVRNSWGGSWGEQGYIRLERNLASSRAGKCGIAIEPSYPI 240


>gi|71897043|ref|NP_001026516.1| cathepsin S precursor [Gallus gallus]
 gi|53126701|emb|CAG30977.1| hypothetical protein RCJMB04_1f23 [Gallus gallus]
          Length = 328

 Score =  258 bits (658), Expect = 4e-66,   Method: Compositional matrix adjust.
 Identities = 147/339 (43%), Positives = 201/339 (59%), Gaps = 27/339 (7%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
           M V++ LV       V G    +P++ +  + W   HG+ Y+ + E+  R   +++NL  
Sbjct: 7   MAVLVTLV------AVMGHP--DPTLDQHWQLWKKAHGKEYRHQAEEGQRRATWEKNLRL 58

Query: 73  IEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
           +   N E   G  +Y+LG N   D+T+E+  AL TG    VP    Q+S      Y+   
Sbjct: 59  VMLHNLEHSLGLHSYQLGMNHMGDMTSEDVAALLTGLR--VPYGHNQTS-----TYRRRG 111

Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST- 188
             P ++DWREKG VT +K+QG CG+CWAFSAV A+E   ++  GKL+ LS Q LVDCS  
Sbjct: 112 GAPDAMDWREKGCVTEVKNQGACGACWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCSMM 171

Query: 189 -DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
             N GC GG M +AF+YII+N G+ +E  YPY  + GTC        AAT SKY +LP  
Sbjct: 172 YGNKGCGGGFMTRAFQYIIDNNGIDSEESYPYMAQNGTC-QYNVSTRAATCSKYVELPYA 230

Query: 248 DEQALLQAVSN-QPVSVCVDASGRAFHFYKSGVL-NADCGNNCDHGVAVVGFGTAEEENG 305
           DE AL  AV+N  PVSV +DA+   F  Y+SGV  +  C    +HGV VVG+GT  E++ 
Sbjct: 231 DEAALKDAVANVGPVSVAIDATQPTFFLYRSGVYDDPRCTQEVNHGVLVVGYGTLNEKD- 289

Query: 306 AKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYP 343
             +WL+KNSWGE +G+ GYIR+ R+ A  CGIA+ ASYP
Sbjct: 290 --FWLVKNSWGERFGDGGYIRMSRNHANHCGIASYASYP 326


>gi|313118768|gb|ADR32296.1| C14 cysteine protease [Solanum demissum]
 gi|313118770|gb|ADR32297.1| C14 cysteine protease [Solanum demissum]
          Length = 217

 Score =  257 bits (657), Expect = 5e-66,   Method: Compositional matrix adjust.
 Identities = 119/218 (54%), Positives = 150/218 (68%), Gaps = 8/218 (3%)

Query: 132 PTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-N 190
           P S+DWR+KG +  +KDQG CGSCWAFSAVAA+E I  I  G LI LSEQ+LVDC    N
Sbjct: 2   PVSVDWRDKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGNLISLSEQELVDCDKSYN 61

Query: 191 HGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQ 250
            GC GGLMD AFE++I N G+ TE DYPY+   G CD  ++ A   TI  YED+P  +E+
Sbjct: 62  EGCDGGLMDYAFEFVINNGGIDTEEDYPYKERNGVCDQYRKNAKVVTIDSYEDVPVNNEK 121

Query: 251 ALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWL 310
           AL +AV++QPVS+ ++A GR F  YKSG+    CG   DHGV V G+GT   ENG  YW+
Sbjct: 122 ALQKAVAHQPVSIALEAGGRDFQHYKSGIFTGKCGTAVDHGVVVAGYGT---ENGMDYWI 178

Query: 311 IKNSWGETWGESGYIRILRD----AGLCGIATAASYPV 344
           ++NSWG  WGE GY+R+ R+    +GLCG+A   SYPV
Sbjct: 179 VRNSWGAKWGEKGYLRVQRNVASSSGLCGLAIEPSYPV 216


>gi|17062058|gb|AAL34984.1|AF320565_1 cathepsine L-like cysteine protease [Rhodnius prolixus]
          Length = 316

 Score =  257 bits (657), Expect = 5e-66,   Method: Compositional matrix adjust.
 Identities = 142/323 (43%), Positives = 194/323 (60%), Gaps = 25/323 (7%)

Query: 36  PSIVEKHEQWMA---QHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTN 89
           PS     ++W+A    HG+ Y+++ E+  R+ +F  N + I++ N +   G  +YK+  N
Sbjct: 4   PSFDIDPQEWLAFKAMHGKNYRNQFEEIFRMKVFIDNKKKIDEHNAKYELGEASYKMKMN 63

Query: 90  EFSDLTNEEFRALYTGYNRPVPSVSRQSS--RPSTFKYQNVTDVPTSIDWREKGAVTHIK 147
              DL   EF+AL  G+ +  P+  R      PS        ++P S+DWR++GAVT +K
Sbjct: 64  HLGDLMVHEFKALMNGFKK-TPNAERNGKIYVPSN------ENLPKSVDWRQRGAVTPVK 116

Query: 148 DQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYI 205
           DQG CGSCW+FSA  ++EG   +  G+L+ LSEQ LVDCS    N GC GGLM++AF+Y+
Sbjct: 117 DQGHCGSCWSFSATGSLEGQLFLKTGRLVSLSEQNLVDCSKTYGNSGCEGGLMNQAFQYV 176

Query: 206 IENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVC 264
            +NKG+ TEA YPY   E  C   KE  V  T   Y D+ +  E+ L  AV+   P+SV 
Sbjct: 177 RDNKGIDTEASYPYEARENNC-RFKEDKVGGTDKGYVDILEASEKDLQSAVATVGPISVR 235

Query: 265 VDASGRAFHFYKSGVLNAD-CG-NNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGES 322
           +DAS  +F FY  GV     C  +  DHGV  VG+GT   ENG  YWL+KNSWG +WGES
Sbjct: 236 IDASHESFQFYSEGVYKEQYCSPSQLDHGVLTVGYGT---ENGQDYWLVKNSWGPSWGES 292

Query: 323 GYIRILRD-AGLCGIATAASYPV 344
           GYI+I R+    CGIA+ ASYPV
Sbjct: 293 GYIKIARNHKNHCGIASMASYPV 315


>gi|332260024|ref|XP_003279085.1| PREDICTED: cathepsin L1 isoform 3 [Nomascus leucogenys]
 gi|441593306|ref|XP_004087072.1| PREDICTED: cathepsin L1 [Nomascus leucogenys]
 gi|441593309|ref|XP_004087073.1| PREDICTED: cathepsin L1 [Nomascus leucogenys]
          Length = 333

 Score =  257 bits (657), Expect = 5e-66,   Method: Compositional matrix adjust.
 Identities = 145/342 (42%), Positives = 204/342 (59%), Gaps = 20/342 (5%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
           M   +IL   C   + S     + S+  +  +W A H R Y    E+  R  ++++N++ 
Sbjct: 1   MNPTLILAAFCLG-IASATLTFDHSLEAQWTKWKAMHNRLYGMN-EEGWRRAVWEKNMKM 58

Query: 73  IEKAN---KEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
           IE+ N   +EG  ++ +  N F D+T+EEFR +  G+    P   +    P  +      
Sbjct: 59  IEQHNQEYREGKHSFTMAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFY------ 112

Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS-- 187
           + P S+DWREKG VT +K+QGQCGSCWAFSA  A+EG      GKL+ LSEQ LVDCS  
Sbjct: 113 EAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSGP 172

Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
             N GC+GGLMD AF+Y+ +N GL +E  YPY   E +C    + +V A  + + D+PK 
Sbjct: 173 QGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSV-ANDTGFVDIPK- 230

Query: 248 DEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFG-TAEEE 303
            E+AL++AV+   P+SV VDA  ++F FYK G+    DC + + DHGV VVG+G  + E 
Sbjct: 231 QEKALMKAVATVGPISVAVDAGHQSFQFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTES 290

Query: 304 NGAKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
           +  KYWL+KNSWGE WG  GYI++ +D    CGIA+AASYP 
Sbjct: 291 DNNKYWLVKNSWGEEWGMGGYIKMAKDRRNHCGIASAASYPT 332


>gi|21489677|gb|AAM55195.1|AF412313_1 cathepsin L cysteine protease [Haemonchus contortus]
 gi|21483192|gb|AAL14224.1| cathepsin L [Haemonchus contortus]
          Length = 354

 Score =  257 bits (656), Expect = 6e-66,   Method: Compositional matrix adjust.
 Identities = 145/313 (46%), Positives = 193/313 (61%), Gaps = 14/313 (4%)

Query: 41  KHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEFSDLTNE 97
           K + +    G++Y+ + E    +  F +N+ +IE+ NKE   G +T+++G NE +DL   
Sbjct: 46  KWDDYKETFGKSYEPDEENDY-MEAFVKNVIHIEEHNKEHRLGRKTFEMGLNEIADLPFS 104

Query: 98  EFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWA 157
           ++R L  GY           S  + F       +P S+DWRE+G VT +K+QG CGSCWA
Sbjct: 105 QYRKL-NGYRMRRQFGDSLQSNGTKFLVPFNVQIPESVDWREEGLVTPVKNQGMCGSCWA 163

Query: 158 FSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEA 215
           FS+  A+EG      GKL+ LSEQ LVDCST   NHGC+GGLMD AFEYI EN G+ TE 
Sbjct: 164 FSSTGALEGQHARATGKLVSLSEQNLVDCSTKYGNHGCNGGLMDLAFEYIKENHGVDTED 223

Query: 216 DYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQ-PVSVCVDASGRAFHF 274
            YPY   E  C + K  AV A    + DLP+GDE+AL +AV+ Q P+S+ +DA  R+F  
Sbjct: 224 SYPYVGRETKC-HFKRNAVGADDKGFVDLPEGDEEALKKAVATQGPISIAIDAGHRSFQL 282

Query: 275 YKSGV-LNADCGN-NCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA- 331
           YK GV  + +C +   DHGV +VG+GT  E     YWL+KNSWG TWGE GYIRI R+  
Sbjct: 283 YKKGVYFDEECSSEELDHGVLLVGYGTDPE--AGDYWLVKNSWGPTWGEKGYIRIARNRN 340

Query: 332 GLCGIATAASYPV 344
             CG+AT ASYP+
Sbjct: 341 NHCGVATKASYPL 353


>gi|121543825|gb|ABM55577.1| putative cathepsin L-like protease [Maconellicoccus hirsutus]
          Length = 341

 Score =  257 bits (656), Expect = 6e-66,   Method: Compositional matrix adjust.
 Identities = 147/345 (42%), Positives = 197/345 (57%), Gaps = 16/345 (4%)

Query: 11  IPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNL 70
           +  F  +  V+   S  V+  S ++  I E+ E +  Q  + Y  E+E+  R+ +F  N 
Sbjct: 1   MKAFAFLCCVLIYHSNSVTAVSFNDL-IAEEWELFKTQFSKAYNTEIEEKFRMKVFMDNK 59

Query: 71  EYIEKANK---EGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQN 127
             I + NK    G  +Y+L  N F DL + EF     GY   +  V+       TF    
Sbjct: 60  HKIARHNKLFQNGEVSYELEMNHFGDLLHHEFVKTVNGYRHSLRRVTGDEIDSVTFIPAY 119

Query: 128 VTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
              VP S+DWR +GAVT +K+QGQCGSCWAFS   ++EG       +L  LSEQ L+DCS
Sbjct: 120 NVTVPDSVDWRTEGAVTEVKNQGQCGSCWAFSTTGSLEGQHFRNTKQLTSLSEQNLIDCS 179

Query: 188 TD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLP 245
               N+GCSGGLMD AF YI  NKG+ TE  YPY   +  C   K +   AT   + D+P
Sbjct: 180 GKYGNNGCSGGLMDNAFAYIKSNKGIDTEQSYPYEGIDDKC-RYKPQESGATDKGFVDIP 238

Query: 246 KGDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN---NCDHGVAVVGFGTA 300
           +GDE+ L  AV+   P+SV +DAS ++F FYK GV  +  CGN   + DHGV  VG+GT 
Sbjct: 239 QGDEEKLKLAVATVGPISVAIDASHQSFQFYKKGVYYDKGCGNGEEDLDHGVLAVGYGT- 297

Query: 301 EEENGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
             ENG  YWL+KNSWG+ WG  GYI++ R+    CGIAT+ASYP+
Sbjct: 298 --ENGKDYWLVKNSWGKRWGLDGYIKMARNKHNHCGIATSASYPL 340


>gi|348565223|ref|XP_003468403.1| PREDICTED: cathepsin L1-like [Cavia porcellus]
          Length = 333

 Score =  257 bits (656), Expect = 7e-66,   Method: Compositional matrix adjust.
 Identities = 142/344 (41%), Positives = 201/344 (58%), Gaps = 21/344 (6%)

Query: 10  IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
           + P FV+  L +     +VS     + ++  + +QW A HGR Y    E+  R  ++++N
Sbjct: 1   MTPSFVLAALCLG----IVSALPKLDQTLDAQWDQWKAAHGRLYGLN-EEGWRRAVWEKN 55

Query: 70  LEYIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQ 126
           L  IE  N E   G  ++ LG N F D+TNEEFR +  G+        +    P   +  
Sbjct: 56  LRMIELHNGEYSQGRHSFTLGMNHFGDMTNEEFRQVMNGFQHQKHKTGKMYQEPLLLQ-- 113

Query: 127 NVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC 186
               +P S+DWREKG VT +K+QGQCGSCWAFSA  ++EG      G L+ LSEQ LVDC
Sbjct: 114 ----LPKSVDWREKGYVTEVKNQGQCGSCWAFSATGSLEGQMFHKTGNLVSLSEQNLVDC 169

Query: 187 S--TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDL 244
           S    N GC+GGLMD AF+Y+ +NKGL  E  YPY  ++G C  + E + AA  + + D+
Sbjct: 170 SRPQGNQGCNGGLMDFAFQYVKDNKGLEAEKSYPYVGKDGECKYKPELS-AANDTGFVDV 228

Query: 245 PKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFGTAEE 302
           P+ ++       +  P+SV +DA  ++F FYK G+  +  C + + +HGV +VG+GT   
Sbjct: 229 PQREKVVQKALATVGPLSVAIDAGLQSFQFYKEGIYYDPGCSSRDLNHGVLLVGYGTDAS 288

Query: 303 ENG-AKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
           E G   YWLIKNSWG TWG  GY++I R+    CG+ATAASYP+
Sbjct: 289 ETGKGDYWLIKNSWGTTWGADGYVKIARNRNNHCGVATAASYPL 332


>gi|432936690|ref|XP_004082231.1| PREDICTED: cathepsin L-like [Oryzias latipes]
          Length = 334

 Score =  257 bits (656), Expect = 7e-66,   Method: Compositional matrix adjust.
 Identities = 145/315 (46%), Positives = 194/315 (61%), Gaps = 25/315 (7%)

Query: 45  WMAQHGRTYKDELEKAMRLNIFKQNLEYIEKAN---KEGNRTYKLGTNEFSDLTNEEFRA 101
           W  + GRTY    E+A R   +  N + +   N    +G ++Y+LG   F+D+ NEE++ 
Sbjct: 29  WRLKFGRTYSSPTEEAQRRQTWLNNRKLVLVHNILADQGIKSYRLGMTYFADMENEEYKR 88

Query: 102 LYT-----GYNRPVPSVSRQSSRPSTF-KYQNVTDVPTSIDWREKGAVTHIKDQGQCGSC 155
           L +      +N  +P       R STF +     D+P ++DWR+KG VT +KDQ QCGSC
Sbjct: 89  LISQGCLGSFNASLPR------RGSTFFRLPENKDLPAAVDWRDKGYVTDVKDQKQCGSC 142

Query: 156 WAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLAT 213
           WAFSA  ++EG T    GKL+ LSEQQLVDCS D  N GC GGLMD AF YI    G+ T
Sbjct: 143 WAFSATGSLEGQTFRKTGKLVSLSEQQLVDCSGDYGNMGCGGGLMDDAFRYIQATGGIDT 202

Query: 214 EADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDASGRAF 272
           E  YPY  E+G C   K  AV AT + Y D+  GDE AL +AV+   P+SV +DAS  +F
Sbjct: 203 EESYPYEAEDGEC-RYKPDAVGATCTGYVDVSSGDEDALQEAVATIGPISVGIDASHISF 261

Query: 273 HFYKSGVLN-ADCGNN-CDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD 330
             Y+SG+ +   C ++  DHGV  VG+G+   ENG  YWL+KNSWG TWG+ GYI++ ++
Sbjct: 262 QLYESGLYDEPQCSSSELDHGVLAVGYGS---ENGQDYWLVKNSWGLTWGDQGYIKMSKN 318

Query: 331 -AGLCGIATAASYPV 344
            +  CGIATAASYP+
Sbjct: 319 KSNQCGIATAASYPL 333


>gi|149755226|ref|XP_001494409.1| PREDICTED: cathepsin L1-like [Equus caballus]
          Length = 334

 Score =  257 bits (656), Expect = 7e-66,   Method: Compositional matrix adjust.
 Identities = 143/338 (42%), Positives = 202/338 (59%), Gaps = 19/338 (5%)

Query: 17  IILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKA 76
           + L   C   + S     +PS+  +  QW A H R Y    E+  R  ++++N+  IE  
Sbjct: 5   LFLAALCLG-IASAAPKLDPSLDAQWYQWKATHRRLYGVN-EEGWRRAVWEKNMRMIELH 62

Query: 77  NKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPT 133
           N+E   G   + +  N F D+TNEEFR +  G+       +++  +   F      +VP 
Sbjct: 63  NQEYSQGKHGFTMAMNAFGDMTNEEFRQVMNGFQ------NQKHKKGRVFLEPLFLEVPK 116

Query: 134 SIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST--DNH 191
           ++DWREKG VT +K+QG CGSCWAFSA  A+EG      GKL+ LSEQ LVDCS    N 
Sbjct: 117 TVDWREKGYVTPVKNQGPCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAEGNQ 176

Query: 192 GCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQA 251
           GC+GGLMD AF+Y+ +N GL +E  YPY  +EG   N K +  AA  + Y D+P+  E+A
Sbjct: 177 GCNGGLMDNAFQYVKDNGGLDSEESYPYLAKEGNNCNYKPEYSAANDTGYVDIPQ-KEKA 235

Query: 252 LLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFG-TAEEENGAK 307
           L++AV+   P+SV +DA   +F FYKSG+  + DC + + DHGV VVG+G    + N  K
Sbjct: 236 LMKAVATVGPISVAIDAGHESFQFYKSGIYYDPDCSSKDLDHGVLVVGYGFEGRDSNNNK 295

Query: 308 YWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
           +W++KNSWG  WG +GY+++ +D    CGIATAASYP 
Sbjct: 296 FWIVKNSWGPEWGWNGYVKMAKDQNNHCGIATAASYPT 333


>gi|348514005|ref|XP_003444531.1| PREDICTED: cathepsin L1-like [Oreochromis niloticus]
          Length = 338

 Score =  257 bits (656), Expect = 7e-66,   Method: Compositional matrix adjust.
 Identities = 146/334 (43%), Positives = 206/334 (61%), Gaps = 24/334 (7%)

Query: 25  SQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---GN 81
           S V+S   + +P + E    W + H + Y  E E+  R  ++++NL+ IE  N +   G 
Sbjct: 14  SSVLSAPHL-DPQLDEHWNLWKSWHTKKYH-EKEEGWRRMVWEKNLKKIELHNLDHSMGK 71

Query: 82  RTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKG 141
            TY+LG N F D+TNEEFR L  GY       + +  + S F   N  + P S+DWR+KG
Sbjct: 72  HTYRLGMNHFGDMTNEEFRQLMNGYKHK----AERKVKGSLFLEPNFLEAPRSLDWRDKG 127

Query: 142 AVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMD 199
            VT +KDQGQCGSCWAFSA  A+EG      GK+++LSEQ LV+CS    N GC+GGLMD
Sbjct: 128 YVTPVKDQGQCGSCWAFSATGALEGQQFRKTGKMVQLSEQNLVECSRPEGNEGCNGGLMD 187

Query: 200 KAFEYIIENKGLATEADYPYRHEEGTCDNQK----EKAVAATISKYEDLPKGDEQALLQA 255
           +AF+Y+ +N+GL +E  YPY    GT D+QK     +  A   + + D+  G E AL++A
Sbjct: 188 QAFQYVKDNQGLDSEESYPYL---GT-DDQKCHYDPRYNAVNDTGFVDIKSGSEHALMKA 243

Query: 256 VSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFG-TAEEENGAKYWLI 311
           V+   P+SV +DA   +F FY+SG+    +C +   DHGV +VG+G   E+ +G KYW++
Sbjct: 244 VTAVGPISVAIDAGHESFQFYQSGIYYEPECSSEELDHGVLLVGYGFEGEDVDGKKYWIV 303

Query: 312 KNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
           KNSW E WG+ GY+ + +D    CGIATAASYP+
Sbjct: 304 KNSWSEKWGDKGYVYMAKDRQNHCGIATAASYPL 337


>gi|309380130|gb|ADO65978.1| cathepsin L [Eriocheir sinensis]
 gi|309380134|gb|ADO65980.1| cathepsin L [Eriocheir sinensis]
          Length = 325

 Score =  257 bits (656), Expect = 7e-66,   Method: Compositional matrix adjust.
 Identities = 139/312 (44%), Positives = 194/312 (62%), Gaps = 20/312 (6%)

Query: 43  EQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEFSDLTNEEF 99
           +Q+ A++G+ Y+   E + R ++++QN E+I   N++   G  ++ L  N+F D+T EE 
Sbjct: 23  QQFKARYGKQYRSTKEDSYRQSVYEQNQEFINSHNEQYENGLVSFTLAMNQFGDMTTEEI 82

Query: 100 RALYTGYNRPVPSVSRQSSRPSTFKYQNVTD-VPTSIDWREKGAVTHIKDQGQCGSCWAF 158
            A   G+      +S     P    YQ + D +P ++DWR+KGAVT +KDQ  CGSCWAF
Sbjct: 83  NAAMNGF------LSAGKKVPRGTMYQPLVDELPDTVDWRDKGAVTPVKDQKACGSCWAF 136

Query: 159 SAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEAD 216
           SA  ++EG   ++ GKL+ LSEQ LVDCS    N GC GGLMD AF YI +N G+ TE  
Sbjct: 137 SATGSLEGQHFLSTGKLVSLSEQNLVDCSDKYGNFGCGGGLMDNAFRYIKDNNGIDTEES 196

Query: 217 YPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQ-PVSVCVDASGRAFHFY 275
           YPY  + G C    +  V AT+S Y D+  G E  L +AV+ + PVSV +DAS   FHFY
Sbjct: 197 YPYEAKNGPCRFNSDN-VGATLSSYVDIQHGSEDDLQKAVAEKGPVSVAIDASTSTFHFY 255

Query: 276 KSGV-LNADCGNN-CDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA-G 332
             G+  +  C ++  DHGV  VG+GT   ++ + YWL+KNSW ETWG+SGYI++ R+   
Sbjct: 256 SRGIYYDEKCSSSFLDHGVLAVGYGT---DDSSDYWLVKNSWNETWGDSGYIKMSRNRNN 312

Query: 333 LCGIATAASYPV 344
            CGIA+ ASYPV
Sbjct: 313 NCGIASQASYPV 324


>gi|392873948|gb|AFM85806.1| cathepsin L [Callorhinchus milii]
          Length = 338

 Score =  256 bits (655), Expect = 7e-66,   Method: Compositional matrix adjust.
 Identities = 141/342 (41%), Positives = 211/342 (61%), Gaps = 15/342 (4%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
           M +  +++  C +  ++  S+ +P +    EQW + HG++Y ++ E+  R  +++++L  
Sbjct: 1   MRLPFVVLSLCLAGGLAAPSL-DPGLDTHWEQWKSWHGKSY-EQKEETWRRMVWEKHLRV 58

Query: 73  IEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
           IE  N E   G  +++LG N F D+ NEEFR L  GY         Q S    F   N  
Sbjct: 59  IEIHNLEHSLGKHSFRLGMNHFGDMPNEEFRQLMNGYKYKQTHKKLQGSH---FLEPNFL 115

Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD 189
           +VP  +DWR++G VT +KDQGQCGSCWAFS   A+EG      G+L+ LSEQ LV+CS  
Sbjct: 116 EVPKHVDWRDEGYVTPVKDQGQCGSCWAFSTTGALEGQHFRRTGQLVSLSEQNLVECSKP 175

Query: 190 --NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
             N GC+GGLMD+AF+Y+ +N G+ +E  YPY   + T  +   +  AA  + + D+P G
Sbjct: 176 EGNEGCNGGLMDQAFQYVKDNGGIDSEDSYPYVGTDDTPCHYNPQYNAANDTGFVDIPSG 235

Query: 248 DEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFGTAEEE- 303
            E+AL++A++   PVSV +DA   +F FY+SG+   A+C + + DHGV VVG+G  + + 
Sbjct: 236 KERALMKAIAAVGPVSVAIDAGHTSFQFYQSGIYFEAECSSTDLDHGVLVVGYGVEKRDT 295

Query: 304 NGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
           +G KYW++KNSW E  G++GYI + +D    CGIATAASYP+
Sbjct: 296 DGKKYWIVKNSWSEKLGQNGYILMAKDKDNHCGIATAASYPL 337


>gi|296082368|emb|CBI21373.3| unnamed protein product [Vitis vinifera]
          Length = 245

 Score =  256 bits (655), Expect = 8e-66,   Method: Compositional matrix adjust.
 Identities = 122/220 (55%), Positives = 157/220 (71%), Gaps = 9/220 (4%)

Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD- 189
           +P S+DWRE GAV  +KDQ  CGSCWAFS VAAVEGI QI  G+LI LSEQ+LVDC T+ 
Sbjct: 6   LPESVDWRETGAVNPVKDQRSCGSCWAFSTVAAVEGINQIVTGELISLSEQELVDCDTEY 65

Query: 190 NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDE 249
           + GC+GGLMD AF++II+N GL TE DYPY   +G C+   + +   +I  YED+P  DE
Sbjct: 66  DMGCNGGLMDYAFDFIIKNGGLDTEKDYPYTGFDGECNLSGKSSKVVSIDGYEDVPPFDE 125

Query: 250 QALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYW 309
           +AL +AV++QPVSV V+A GRA   Y SG+   +CG   DHG+  VG+GT   ENG  YW
Sbjct: 126 KALQKAVAHQPVSVAVEAGGRALQLYVSGIFTGECGTALDHGIVAVGYGT---ENGTDYW 182

Query: 310 LIKNSWGETWGESGYIRILRD-----AGLCGIATAASYPV 344
           +++NSWG +WGE+GYIR+ R+     +G CGIA  ASYP+
Sbjct: 183 IVRNSWGSSWGENGYIRMERNMADAFSGKCGIAMEASYPI 222


>gi|223646726|gb|ACN10121.1| Cathepsin L1 precursor [Salmo salar]
 gi|223672581|gb|ACN12472.1| Cathepsin L1 precursor [Salmo salar]
          Length = 338

 Score =  256 bits (655), Expect = 8e-66,   Method: Compositional matrix adjust.
 Identities = 145/341 (42%), Positives = 202/341 (59%), Gaps = 18/341 (5%)

Query: 15  VIIILVITCASQVVSGRSMHEPSIVEKHEQ-WMAQHGRTYKDELEKAMRLNIFKQNLEYI 73
           + + +++ C S V +       S +E H   W   H ++Y  E E+  R  ++++NL+ I
Sbjct: 4   LYLAVLVLCVSAVCAAPRF--DSQLEDHWHLWKNWHSKSYH-ESEEGWRRMVWEKNLKKI 60

Query: 74  EKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
           E  N E   G  +Y+LG N F D+TNEEFR    GY +     + +  + S F   N   
Sbjct: 61  EMHNLEHTMGKHSYRLGMNHFGDMTNEEFRQTMNGYKQ----TTERKFKGSLFMEPNYLQ 116

Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD- 189
            P ++DWREKG VT +KDQG CGSCWAFS   A+EG      GKL+ LSEQ LVDCS   
Sbjct: 117 APKAVDWREKGYVTPVKDQGSCGSCWAFSTTGAMEGQQFRKTGKLVSLSEQNLVDCSRPE 176

Query: 190 -NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
            N GC+GGLMD+AF+YI +N GL TE  YPY   +    + K +   A  + + D+P G 
Sbjct: 177 GNEGCNGGLMDQAFQYIQDNAGLDTEESYPYVGTDEDPCHYKPEFSGANETGFVDIPSGK 236

Query: 249 EQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFG-TAEEEN 304
           E A+++AV+   PVSV +DA   +F FY+ G+    +C +   DHGV VVG+G   E+ +
Sbjct: 237 EHAMMKAVAAVGPVSVAIDAGHESFQFYEFGIYYEKECSSEELDHGVLVVGYGFEGEDVD 296

Query: 305 GAKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
           G KYW++KNSW E WG+ GYI + +D    CGIATA+SYP+
Sbjct: 297 GKKYWIVKNSWSEKWGDKGYIYMAKDRKNHCGIATASSYPL 337


>gi|426216524|ref|XP_004002512.1| PREDICTED: cathepsin S isoform 1 [Ovis aries]
          Length = 331

 Score =  256 bits (655), Expect = 8e-66,   Method: Compositional matrix adjust.
 Identities = 145/333 (43%), Positives = 206/333 (61%), Gaps = 20/333 (6%)

Query: 20  VITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE 79
           ++ C+S +   +   +P++    + W   +G+ Y+++ E+  R  I+++NL+ +   N E
Sbjct: 8   LLLCSSAMA--QVHRDPTLDHHWDLWKKTYGKQYEEKNEEVARRLIWEKNLKTVMLHNLE 65

Query: 80  ---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSID 136
              G  +Y+LG N   D+T+EE  +  +     VPS   Q  R  T+K      +P S+D
Sbjct: 66  HSMGMHSYELGMNHLGDMTSEEVISSMSSLR--VPS---QWPRNVTYKSSPNQKLPDSLD 120

Query: 137 WREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST---DNHGC 193
           WREKG VT +K QG CGSCWAFSAV A+E   ++  GKL+ LS Q LVDCST    N GC
Sbjct: 121 WREKGCVTEVKYQGACGSCWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCSTVKYGNKGC 180

Query: 194 SGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALL 253
           +GG M +AF+YII+N G+ +EA YPY+  +G C     K  AAT S+Y +LP G E+AL 
Sbjct: 181 NGGFMTEAFQYIIDNNGIDSEASYPYKAMDGRCQ-YDVKNRAATCSRYIELPFGSEEALK 239

Query: 254 QAVSNQ-PVSVCVDASGRAFHFYKSGV-LNADCGNNCDHGVAVVGFGTAEEENGAKYWLI 311
           +AV+N+ PVSV +DA   +F  YK+GV  +  C  N +HGV VVG+G+    NG  YWL+
Sbjct: 240 EAVANKGPVSVGIDAKQTSFFLYKTGVYYDPSCTQNVNHGVLVVGYGSL---NGKDYWLV 296

Query: 312 KNSWGETWGESGYIRILRDAG-LCGIATAASYP 343
           KNSWG  +G+ GYIR+ R++G  CGIA   SYP
Sbjct: 297 KNSWGLNFGDQGYIRMARNSGNHCGIANFPSYP 329


>gi|402770503|gb|AFQ98386.1| cathepsin L [Rhipicephalus microplus]
          Length = 332

 Score =  256 bits (655), Expect = 8e-66,   Method: Compositional matrix adjust.
 Identities = 144/340 (42%), Positives = 205/340 (60%), Gaps = 20/340 (5%)

Query: 16  IIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEK 75
           ++ L + CA   V+  +  +  +  + E +   H +TY+  +E+ +R  IF +N   I K
Sbjct: 1   MLRLSVLCAIVAVTVAASSQEILRTQWEAFKTTHKKTYQSHMEELLRFKIFTENSLIIAK 60

Query: 76  ANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD-- 130
            N +   G  +YKLG N+F DL   EF  ++ G++      +R++   +     NV D  
Sbjct: 61  HNAKYAKGLVSYKLGMNQFGDLLAHEFARIFNGHHG-----TRKTGGSTFLPPANVNDSS 115

Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD- 189
           +P  +DWR+KGAVT +KDQGQCGSCWAFSA  ++EG   +  G+L+ LSEQ LVDCS   
Sbjct: 116 LPKVVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGRHFLKNGELVSLSEQNLVDCSQSF 175

Query: 190 -NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
            N+GC GGLM+ AF+YI EN G+ TE  YPY   +G C  +KE  V AT + Y ++  G 
Sbjct: 176 GNNGCEGGLMEDAFKYIKENDGIDTEKSYPYEAVDGECRFKKED-VGATDTGYVEIKAGS 234

Query: 249 EQALLQAVSN-QPVSVCVDASGRAFHFYKSGVLN-ADCGN-NCDHGVAVVGFGTAEEENG 305
           E  L +AV+   P+SV +DAS  +F  Y  GV +  +C + + DHGV VVG+G    + G
Sbjct: 235 EDDLKKAVATVGPISVAIDASHSSFQLYSEGVYDEPECSSEDLDHGVLVVGYGV---KGG 291

Query: 306 AKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
            KYWL+KNSW E+WG+ GYI + RD    CGIA+ ASYP+
Sbjct: 292 KKYWLVKNSWAESWGDQGYILMSRDNNNQCGIASQASYPL 331


>gi|157093357|gb|ABV22333.1| cysteine protease 1 [Noctiluca scintillans]
          Length = 338

 Score =  256 bits (655), Expect = 9e-66,   Method: Compositional matrix adjust.
 Identities = 138/324 (42%), Positives = 188/324 (58%), Gaps = 14/324 (4%)

Query: 27  VVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKL 86
           V +G  +  P  +     +  ++G+ Y    E A+R  IFK N++ I   N   N T+ L
Sbjct: 12  VAAGHEVPPPDYMMMFNNFKTKYGKVYNGINEDAVRFGIFKANVDIIYATNAR-NLTFAL 70

Query: 87  GTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHI 146
           G NEF+DLT EEF A YTG  +P  S+     R ST +Y N   + +S+DW  +G VT +
Sbjct: 71  GVNEFTDLTQEEFAASYTGL-KPA-SLWSGLPRLSTHEY-NGAPLASSVDWTTQGVVTPV 127

Query: 147 KDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYII 206
           K+QGQCGSCW+FS   A+EG   ++ G L+ LSEQQ  DC T + GC+GG MD AF +  
Sbjct: 128 KNQGQCGSCWSFSTTGALEGAWALSTGNLVSLSEQQFEDCDTTDSGCNGGWMDNAFSFAK 187

Query: 207 ENKGLATEADYPYRHEEGTCD--NQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVC 264
           +N  + TE  YPY   +GTC+    +       +  Y D+    EQA++ AV+ QPVS+ 
Sbjct: 188 KNS-ICTEGSYPYTATDGTCNLSGCQVGIPQGGVVGYTDVSTDSEQAMMSAVAQQPVSIA 246

Query: 265 VDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGY 324
           ++A   +F  Y SGVL A CG   DHGV  VG+G+   E G  YW +KNSWG +WGE GY
Sbjct: 247 IEADQYSFQLYSSGVLTASCGTRLDHGVLAVGYGS---EAGTDYWKVKNSWGSSWGEQGY 303

Query: 325 IRILR---DAGLCG-IATAASYPV 344
           +R+ R    AG CG +A   SYPV
Sbjct: 304 VRLQRGKGGAGECGLLAGPPSYPV 327


>gi|356515116|ref|XP_003526247.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
          Length = 333

 Score =  256 bits (654), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 145/340 (42%), Positives = 188/340 (55%), Gaps = 48/340 (14%)

Query: 41  KHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFR 100
           + ++W+  +G  Y+D+ E  +R  I++ N+EYI    K    +Y L  N+F+DLTNEEF 
Sbjct: 4   RFDRWLKXNGXNYEDKEEWEIRFVIYQANVEYI-GCKKSQKNSYNLTDNKFADLTNEEFV 62

Query: 101 ALYTGY-NRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCG------ 153
           + Y G+  R +P         + FKY    ++P S DWR++GAVT IKDQG CG      
Sbjct: 63  STYLGFATRLIPH--------TRFKYHEHGNLPXSKDWRKEGAVTDIKDQGNCGKHSTWF 114

Query: 154 -----------------------SCWAFSAVAAVEGITQITRGKLIELSEQQLV--DCST 188
                                  S WAFS VAAVE I +I  GKL+ LSEQ+LV  D + 
Sbjct: 115 SPEISHNLRNILTNYNTINFRDISFWAFSVVAAVERINKIKSGKLVSLSEQELVDYDVAN 174

Query: 189 DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
            N GC GGLMD  F +I +N GL T  DYPY   +G+C+ +K    A  IS YE  P  D
Sbjct: 175 KNQGCEGGLMDTTFAFIKKNGGLTTSKDYPYEGVDGSCNKEKALHHAVNISGYERAPSKD 234

Query: 249 EQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKY 308
           E  L  A +NQP+SV +DA G AF  Y  GV +  CG   +HGV +VG+     +   KY
Sbjct: 235 EAMLKVAAANQPISVAIDAGGYAFQLYSQGVFSGVCGKKLNHGVTIVGYDKGTFD---KY 291

Query: 309 WLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPV 344
             +KNS G  WGESGYIR+ RD    AG CGIA  ASYP+
Sbjct: 292 RTVKNSXGADWGESGYIRMKRDAFDKAGTCGIAMKASYPL 331


>gi|426219875|ref|XP_004004143.1| PREDICTED: cathepsin L1 [Ovis aries]
          Length = 333

 Score =  256 bits (654), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 146/338 (43%), Positives = 201/338 (59%), Gaps = 20/338 (5%)

Query: 17  IILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKA 76
           ++L + C   + S     + S+  + E W A H + Y D  E+  R  ++K+N++ IE  
Sbjct: 5   LLLTVLCLG-IASAAPKFDHSLNTQWELWKAVHRKPY-DLNEEGWRKAVWKKNMKMIELH 62

Query: 77  NKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPT 133
           N+E   G  ++ +  N F DLT+EEFR +  G+ R      +++ +   F       +P 
Sbjct: 63  NQEYSQGKHSFSMAMNAFGDLTSEEFRQMMNGFQR------QENKKGKVFHETIFASIPP 116

Query: 134 SIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NH 191
           S+DWREKG VT +K+QG+CGSCWAFS   A+EG      GKL+ LSEQ LVDCS    N 
Sbjct: 117 SVDWREKGYVTPVKNQGKCGSCWAFSTTGALEGQMFRKTGKLVSLSEQNLVDCSQPEGNR 176

Query: 192 GCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQA 251
           GC GGLMD AF+Y+++  GL +E  YPY    GTC N   K  AA  + + DLPK  E A
Sbjct: 177 GCHGGLMDNAFQYVLDVGGLDSEESYPYTGLVGTC-NYNPKNSAANETGFVDLPK-QENA 234

Query: 252 LLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADC-GNNCDHGVAVVGFG-TAEEENGAK 307
           L++AV+   P+SV VDAS  +F FYKSG+     C   + DHGV VVG+G    + +  K
Sbjct: 235 LMKAVATLGPISVAVDASNPSFQFYKSGIYYEPKCKSESVDHGVLVVGYGFEGADSDDNK 294

Query: 308 YWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
           YWL+KNSWG+ WG +GYI++ +D    CGIAT ASYP 
Sbjct: 295 YWLVKNSWGKHWGINGYIKMAKDQNNHCGIATMASYPT 332


>gi|146216002|gb|ABQ10203.1| cysteine protease Cp5 [Actinidia deliciosa]
          Length = 509

 Score =  256 bits (654), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 141/329 (42%), Positives = 193/329 (58%), Gaps = 18/329 (5%)

Query: 30  GRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRT--YKLG 87
           G S+ E  +VE  ++W  +HG+ YK   E   +   F+ NL Y+ + N E   +  + +G
Sbjct: 39  GESIAEERVVELFKKWTEKHGKVYKHGQEVEKKFQNFRDNLRYVMEKNGERGASGGHLVG 98

Query: 88  TNEFSDLTNEEFRALYTGYNRPVPSVS-----RQSSRPSTFKYQNVTDVPTSIDWREKGA 142
            N+F+D++NEEFR +Y    +   S       R+  + +  K     D PTS+DWR+ G 
Sbjct: 99  LNKFADMSNEEFREVYVSKVKKPTSKRMAIERRRQGKAAAAKAVAACDGPTSLDWRKYGI 158

Query: 143 VTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAF 202
           VT +KDQG CGSCWAFS+  A+EGI  +  G LI LSEQ+LVDC + N GC GG MD AF
Sbjct: 159 VTGVKDQGDCGSCWAFSSTGAIEGINALANGDLISLSEQELVDCDSTNDGCEGGYMDYAF 218

Query: 203 EYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVS 262
           E+++ N G+ TE DYPY  E+GTC+  KE+  A +I  YED+ + +E AL  AV  QP+S
Sbjct: 219 EWVMSNGGIDTETDYPYTGEDGTCNTTKEETKAVSIDGYEDVAE-EESALFCAVLKQPIS 277

Query: 263 VCVDASGRAFHFYKSGVL---NADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETW 319
           V +D     F  Y  G+     +D  ++ DH V VVG+G    E+G +YW+IKNSWG  W
Sbjct: 278 VGIDGGAIDFQLYTGGIYDGDCSDDPDDIDHAVLVVGYGA---ESGEEYWIIKNSWGTDW 334

Query: 320 GESGYIRILR----DAGLCGIATAASYPV 344
           G  GY  I R    D G+C I   ASYP 
Sbjct: 335 GMKGYAYIKRNTSKDYGVCAINAMASYPT 363


>gi|449673497|ref|XP_002169904.2| PREDICTED: cathepsin L-like [Hydra magnipapillata]
          Length = 325

 Score =  256 bits (654), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 146/337 (43%), Positives = 198/337 (58%), Gaps = 22/337 (6%)

Query: 14  FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYI 73
            + + L+  C   ++  + + E S       W   H + Y  E E+ +R  I+K N+  I
Sbjct: 4   LIFVSLITLCFGYIIE-KPIRESSWY----VWKMAHNKAYSHESEENVRYAIWKDNMNRI 58

Query: 74  EKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPT 133
            + N + ++   L  N F D+TN EFRA   G       +  +    STF   + T  P 
Sbjct: 59  TEYNSK-SKNVILRMNHFGDMTNTEFRAKMNGL------LLHKHQNGSTFLVPSHTAAPD 111

Query: 134 SIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NH 191
           ++DWR +G VT +K+QGQCGSCWAFS+  A+EG      G+L+ LSEQ LVDCSTD  N+
Sbjct: 112 AVDWRSEGYVTPVKNQGQCGSCWAFSSTGALEGQHFKKTGRLVSLSEQNLVDCSTDYGNN 171

Query: 192 GCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQA 251
           GC+GGLMD AF YI  N G+ TE  YPY  ++GTC   K  ++ A  + + D+P+GDE A
Sbjct: 172 GCNGGLMDNAFSYIKANGGIDTETGYPYEGQDGTCRYSKS-SIGADDTGFVDIPEGDEDA 230

Query: 252 LLQAVSN-QPVSVCVDASGRAFHFYKSGVLN-ADCGNNC-DHGVAVVGFGTAEEENGAKY 308
           L QAV+   PVSV +DAS  +F FY SGV +   C  +  DHGV VVG+GT   +NG  Y
Sbjct: 231 LKQAVATVGPVSVAIDASHMSFQFYHSGVYDEPQCSPSALDHGVLVVGYGT---DNGKDY 287

Query: 309 WLIKNSWGETWGESGYIRILR-DAGLCGIATAASYPV 344
           WL+KNSWG  WG  GYI + R +   CGIA+ ASYP+
Sbjct: 288 WLVKNSWGTGWGTEGYIYMSRNNQNQCGIASKASYPL 324


>gi|402770507|gb|AFQ98388.1| cathepsin L [Rhipicephalus microplus]
          Length = 332

 Score =  256 bits (654), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 145/340 (42%), Positives = 204/340 (60%), Gaps = 20/340 (5%)

Query: 16  IIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEK 75
           ++ L + CA   V+  +  +  +  + E +   H +TY+  +E+ +R  IF +N   I K
Sbjct: 1   MLRLSVLCAIVAVTVAASSQEILRTQWEAFKTTHKKTYQSHMEELLRFKIFTENSLIIAK 60

Query: 76  ANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD-- 130
            N +   G  +YKLG N+F DL   EF  ++ GY+      SR+S   +     NV D  
Sbjct: 61  HNAKYAKGLVSYKLGMNQFGDLLAHEFARIFNGYHG-----SRKSGGSTFLPPANVNDSS 115

Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD- 189
           +P ++DWR+KGAVT +KDQGQCGSCWAFS   ++EG   +  G+L+ LSEQ LVDCS   
Sbjct: 116 LPKAVDWRKKGAVTPVKDQGQCGSCWAFSTTGSLEGQHFLKNGELVSLSEQNLVDCSQSF 175

Query: 190 -NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
            N+GC GGLM+ AF+YI  N G+ TE  YPY   +G C  +KE  V AT + Y ++  G 
Sbjct: 176 GNNGCEGGLMEDAFKYIKANDGIDTEKSYPYEAVDGECRFKKED-VGATDTGYVEIKAGC 234

Query: 249 EQALLQAVSN-QPVSVCVDASGRAFHFYKSGVLN-ADCGN-NCDHGVAVVGFGTAEEENG 305
           E  L +AV+   P+SV +DAS  +F  Y  GV +  +C + + DHGV VVG+G    + G
Sbjct: 235 EDDLKKAVATVGPISVAIDASHSSFQLYSEGVYDEPECSSEDLDHGVLVVGYGV---KGG 291

Query: 306 AKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
            KYWL+KNSW E+WG+ GYI + RD    CGIA+ ASYP+
Sbjct: 292 KKYWLVKNSWAESWGDQGYILMSRDNNNQCGIASQASYPL 331


>gi|424513619|emb|CCO66241.1| predicted protein [Bathycoccus prasinos]
          Length = 396

 Score =  256 bits (654), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 138/333 (41%), Positives = 196/333 (58%), Gaps = 22/333 (6%)

Query: 31  RSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLG 87
           R + E  I +  + W+ ++ +   +  E+  RL IF +N  ++ + N +   G  ++ + 
Sbjct: 61  RVLRESKIEDAFDAWLVKYDKEIANAEERLKRLKIFGENYLFVLEHNAKYVAGKVSHYVE 120

Query: 88  TNEFSDLTNEEFRALYTGYNRPV---PSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVT 144
            N+F+  T EE+R +  G+ + +         +   S ++Y+ V + P SIDW ++G +T
Sbjct: 121 MNKFAAHTREEYRKML-GFKKSLRRKKDSGEAAKDVSLWEYEGV-EAPESIDWVDEGVIT 178

Query: 145 HIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAF 202
             K+QG CGSCWAFSA+ AVEGI  I  GKL+ LSEQ+LV C+ +  N GC+GGLMD AF
Sbjct: 179 TPKNQGSCGSCWAFSAIGAVEGINAIRTGKLVSLSEQELVSCAREGGNQGCNGGLMDNAF 238

Query: 203 EYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVS 262
           E+I+EN G+ +E  Y Y+     C  +K     A+I  + D+P  DE AL +AVS QPVS
Sbjct: 239 EWIVENGGVDSEKQYQYKASFDDCKTRKTLLHIASIDGFNDVPSNDETALKKAVSQQPVS 298

Query: 263 VCVDASGRAFHFYKSGVLNA-DCGNNCDHGVAVVGFGTAEEENGA-------KYWLIKNS 314
           V ++A  R+F  Y  GV +A DCG   DHGV VVG+G     +         KYW IKNS
Sbjct: 299 VAIEADQRSFQLYGGGVYHAEDCGTQLDHGVLVVGYGIDHNSSNVIIPGATKKYWKIKNS 358

Query: 315 WGETWGESGYIRILRD----AGLCGIATAASYP 343
           W E WGE GYIRI RD    +G+CG+A  ASYP
Sbjct: 359 WSEQWGEGGYIRIARDVESPSGMCGVAEMASYP 391


>gi|402770517|gb|AFQ98393.1| cathepsin L [Rhipicephalus microplus]
          Length = 332

 Score =  256 bits (654), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 144/340 (42%), Positives = 205/340 (60%), Gaps = 20/340 (5%)

Query: 16  IIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEK 75
           ++ L + CA   V+  +  +  +  + E +   H +TY+  +E+ +R  IF +N   I K
Sbjct: 1   MLRLSVLCAIAAVTVAASSQEILRTQWEAFKTTHKKTYQSHMEELLRFKIFTENSLIIAK 60

Query: 76  ANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD-- 130
            N +   G  +YKLG N+F DL   EF  ++ G++      +R++   S     NV D  
Sbjct: 61  HNAKYAKGLVSYKLGMNQFGDLLAHEFARIFNGHHG-----TRKTGGSSFLPPANVNDSS 115

Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD- 189
           +P  +DWR+KGAVT +KDQGQCGSCWAFSA  ++EG   +  G+L+ LSEQ LVDCS   
Sbjct: 116 LPKVVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQSF 175

Query: 190 -NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
            N+GC GGLM+ AF+YI  N G+ TE  YPY+  +G C  +KE  V AT + Y ++  G 
Sbjct: 176 GNNGCEGGLMEDAFKYIKANDGIDTEKSYPYKAVDGECRFKKED-VGATDTGYVEIKAGS 234

Query: 249 EQALLQAVSN-QPVSVCVDASGRAFHFYKSGVLN-ADCGN-NCDHGVAVVGFGTAEEENG 305
           E  L +AV+   P+SV +DAS  +F  Y  GV +  +C + + DHGV VVG+G    + G
Sbjct: 235 EVDLKKAVATVGPISVAIDASHSSFQLYSEGVYDEPECSSEDLDHGVLVVGYGV---KGG 291

Query: 306 AKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
            KYWL+KNSW E+WG+ GYI + RD    CGIA+ ASYP+
Sbjct: 292 KKYWLVKNSWAESWGDQGYILMSRDNNNQCGIASQASYPL 331


>gi|310975577|gb|ADP55137.1| cathepsin S [Miichthys miiuy]
          Length = 338

 Score =  256 bits (654), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 145/336 (43%), Positives = 194/336 (57%), Gaps = 19/336 (5%)

Query: 17  IILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKA 76
           ++L   CA       +M +  +    E W   HG+TY++ +E   R  ++++NL  I   
Sbjct: 13  LLLFSLCAGAA----AMFDSKLDGHWELWKKMHGKTYRNYVEDESRRELWEKNLVLITMH 68

Query: 77  NKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPT 133
           N E   G  TYKL  N   DLT EE    +     P   + R    PS F   +   VP 
Sbjct: 69  NLEASMGLHTYKLSMNHMGDLTPEEIMQSFATLTPPT-DIQRA---PSPFAGTSGAAVPD 124

Query: 134 SIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NH 191
           ++DWREKG VT +K QG CGSCWAFSA  A+EG    T GKL++LS Q LVDCST   NH
Sbjct: 125 TMDWREKGCVTSVKMQGACGSCWAFSAAGALEGQLAKTTGKLVDLSPQNLVDCSTKYGNH 184

Query: 192 GCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQA 251
           GC+GG M KAF+Y+I+N G+ ++A YPY   +    +   K  AA  S+Y  LP+GDE A
Sbjct: 185 GCNGGFMHKAFQYVIDNHGIDSDAAYPYTGRQSQECHYSPKFRAANCSQYSFLPEGDEGA 244

Query: 252 LLQAVSN-QPVSVCVDASGRAFHFYKSGVL-NADCGNNCDHGVAVVGFGTAEEENGAKYW 309
           L QA++   P+SV +DA    F FY SGV  +  C  + +HGV  VG+GT    NG  YW
Sbjct: 245 LKQALATIGPISVAIDARRPRFAFYSSGVYDDPSCSQDVNHGVLAVGYGTL---NGQDYW 301

Query: 310 LIKNSWGETWGESGYIRILRDAG-LCGIATAASYPV 344
           L+KNSWG+T+G++GYIR+ R+    CGIA    YP+
Sbjct: 302 LVKNSWGQTFGDNGYIRMARNKNDQCGIARYGCYPI 337


>gi|356582227|ref|NP_001239115.1| cathepsin L1 precursor [Canis lupus familiaris]
 gi|62899810|sp|Q9GL24.1|CATL1_CANFA RecName: Full=Cathepsin L1; Contains: RecName: Full=Cathepsin L1
           heavy chain; Contains: RecName: Full=Cathepsin L1 light
           chain; Flags: Precursor
 gi|10185020|emb|CAC08809.1| cathepsin L [Canis lupus familiaris]
          Length = 333

 Score =  256 bits (654), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 140/337 (41%), Positives = 202/337 (59%), Gaps = 18/337 (5%)

Query: 17  IILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKA 76
           + L   C   + S     + S+  +  QW A H R Y    E+  R  ++++N++ IE  
Sbjct: 5   LFLTALCLG-IASAAPKFDQSLNAQWYQWKATHRRLYGMN-EEGWRRAVWEKNMKMIELH 62

Query: 77  NKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPT 133
           N+E   G   + +  N F D+TNEEFR +  G+       +++  +   F+     ++P 
Sbjct: 63  NREYSQGKHGFTMAMNAFGDMTNEEFRQVMNGFQ------NQKHKKGKMFQEPLFAEIPK 116

Query: 134 SIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS--TDNH 191
           S+DWREKG VT +K+QGQCGSCWAFSA  A+EG      GKL+ LSEQ LVDCS    N 
Sbjct: 117 SVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQGNE 176

Query: 192 GCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQA 251
           GC+GGLMD AF Y+ +N GL +E  YPY   +    N K +  AA  + + DLP+  E+A
Sbjct: 177 GCNGGLMDNAFRYVKDNGGLDSEESYPYLGRDTETCNYKPECSAANDTGFVDLPQ-REKA 235

Query: 252 LLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFGTAEEENGAKY 308
           L++AV+   P+SV +DA  ++F FYKSG+  + DC + + DHGV VVG+G    ++  K+
Sbjct: 236 LMKAVATLGPISVAIDAGHQSFQFYKSGIYFDPDCSSKDLDHGVLVVGYGFEGTDSNNKF 295

Query: 309 WLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
           W++KNSWG  WG +GY+++ +D    CGIATAASYP 
Sbjct: 296 WIVKNSWGPEWGWNGYVKMAKDQNNHCGIATAASYPT 332


>gi|402770501|gb|AFQ98385.1| cathepsin L [Rhipicephalus microplus]
          Length = 332

 Score =  256 bits (654), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 143/340 (42%), Positives = 205/340 (60%), Gaps = 20/340 (5%)

Query: 16  IIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEK 75
           ++ L + CA   V+  +  +  +  + E +   H +TY+  +E+ +R  IF +N   I K
Sbjct: 1   MLRLSVLCAIAAVTVAASSQEILRTQWEAFKTTHKKTYQSHMEELLRFKIFTENSLIIAK 60

Query: 76  ANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD-- 130
            N +   G  +YKLG N+F DL   EF  ++ G++      +R++   +     NV D  
Sbjct: 61  HNAKYAKGLVSYKLGMNQFGDLLAHEFARIFNGHHG-----TRKTGGSTFLPPANVNDSS 115

Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD- 189
           +P ++DWR+KGAVT +KDQGQCGSCWAFSA  ++EG   +  G+L+ LSEQ LVDCS   
Sbjct: 116 LPKAVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQSF 175

Query: 190 -NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
            N+GC GGLM+ AF+YI  N G+ TE  YPY   +G C  +KE  V AT + Y ++  G 
Sbjct: 176 GNNGCEGGLMEDAFKYIKANDGIDTEKSYPYEAVDGECRFKKED-VGATDTGYVEIKAGS 234

Query: 249 EQALLQAVSN-QPVSVCVDASGRAFHFYKSGVLN-ADCGN-NCDHGVAVVGFGTAEEENG 305
           E  L +AV+   P+SV +DAS  +F  Y  GV +  +C + + DHGV VVG+G    + G
Sbjct: 235 EDDLKKAVATVGPISVAIDASHSSFQLYSEGVYDEPECSSEDLDHGVLVVGYGV---KGG 291

Query: 306 AKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
            KYWL+KNSW E+WG+ GYI + RD    CGIA+ ASYP+
Sbjct: 292 KKYWLVKNSWAESWGDQGYILMSRDNNNQCGIASQASYPL 331


>gi|383410403|gb|AFH28415.1| cathepsin L1 preproprotein [Macaca mulatta]
          Length = 333

 Score =  256 bits (653), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 144/343 (41%), Positives = 205/343 (59%), Gaps = 23/343 (6%)

Query: 12  PMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLE 71
           P F++    +  AS  ++       S+  +  +W A H R Y    E+  R  ++++N++
Sbjct: 3   PTFILAAFCLGIASATLT----FNHSLEAQWTKWKAMHNRLYGMN-EEGWRRAVWEKNMK 57

Query: 72  YIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNV 128
            IE  N+E   G  ++ +  N F D+T+EEFR +  G+       +R+  +   F+    
Sbjct: 58  MIELHNQEYSQGKHSFTMAMNTFGDMTSEEFRQVMNGFQ------NRKPRKGKVFQEPLF 111

Query: 129 TDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS- 187
            + P S+DWREKG VT +K+QGQCGSCWAFSA  A+EG      GKL+ LSEQ LVDCS 
Sbjct: 112 YEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSG 171

Query: 188 -TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPK 246
              N GC+GGLMD AF+Y+ +N GL +E  YPY   E +C    E +V A  + + D+PK
Sbjct: 172 PQGNEGCNGGLMDYAFQYVADNGGLDSEESYPYEATEESCKYNPEYSV-ANDTGFVDIPK 230

Query: 247 GDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFG-TAEE 302
             E+AL++AV+   P+SV +DA   +F FYK G+    DC + + DHGV VVG+G  + E
Sbjct: 231 -QEKALMKAVATVGPISVAIDAGHESFMFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTE 289

Query: 303 ENGAKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
            + +KYWL KNSWGE WG  GYI++ +D    CGIA+AASYP 
Sbjct: 290 SDNSKYWLGKNSWGEEWGMGGYIKMAKDRRNHCGIASAASYPT 332


>gi|21593501|gb|AAM65468.1| cysteine proteinase [Arabidopsis thaliana]
          Length = 376

 Score =  256 bits (653), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 140/321 (43%), Positives = 198/321 (61%), Gaps = 17/321 (5%)

Query: 34  HEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSD 93
           +E  ++  +EQW+ ++G+ Y    EK  R  IFK NL+ IE+ N + NR+Y+ G N+FSD
Sbjct: 33  NEGGVLTMYEQWLVENGKNYNGLGEKERRFKIFKDNLKRIEEHNSDPNRSYERGLNKFSD 92

Query: 94  LTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVT-HIKDQGQC 152
           LT +EF+A Y G      S+S  + R   ++Y+    +P  +DWRE+GAV   +K QG+C
Sbjct: 93  LTADEFQASYLGGKMEKKSLSDVAER---YQYKEGDVLPDEVDWRERGAVVPRVKRQGEC 149

Query: 153 GSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC--STDNHGCSGGLMDKAFEYIIENKG 210
           GSCWAF+A  AVEGI QIT G+L+ LSEQ+L+DC    DN GC+GG    AFE+I EN G
Sbjct: 150 GSCWAFAATGAVEGINQITTGELVSLSEQELIDCDRGNDNFGCAGGGAVWAFEFIKENGG 209

Query: 211 LATEADYPYRHEE-GTCDNQKEKAV-AATISKYEDLPKGDEQALLQAVSNQPVSVCVDAS 268
           + ++  Y Y  E+   C   + K     TI+ +E +P  DE +L +AV+ QP+SV + A+
Sbjct: 210 IVSDEVYGYTGEDTAACKAIEMKTTRVVTINGHEVVPVNDEMSLKKAVAYQPISVMISAA 269

Query: 269 GRAFHFYKSGVLNADCGNNC-DHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRI 327
             +   YKSGV    C N   DH V +VG+GT+ +E    YWLI+NSWG  WGE GY+R+
Sbjct: 270 NMS--DYKSGVYKGACSNLWGDHNVLIVGYGTSSDE--GDYWLIRNSWGPEWGEGGYLRL 325

Query: 328 LRD----AGLCGIATAASYPV 344
            R+     G C +A A  YP+
Sbjct: 326 QRNFHEPTGKCAVAVAPVYPI 346


>gi|7523482|dbj|BAA94210.1| putative cysteine protease [Oryza sativa Japonica Group]
 gi|10800060|dbj|BAB16480.1| putative cysteine protease [Oryza sativa Japonica Group]
          Length = 349

 Score =  256 bits (653), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 136/322 (42%), Positives = 184/322 (57%), Gaps = 27/322 (8%)

Query: 39  VEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEE 98
            +  E+WMA+ G+ Y    EK  R  +F+ N+ +I            L  N+F+DLTN+E
Sbjct: 38  TQMFEEWMAKFGKKYPCHGEKEYRFGVFRDNVRFIRSYRPPAGYNSALRVNQFADLTNDE 97

Query: 99  FRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAF 158
           F + +TG   P P  + +   P          +P  IDWR KGAVT +KDQG CGSCWAF
Sbjct: 98  FVSTHTGAKPPCPKDAPRGVDP--------IWLPCCIDWRYKGAVTDVKDQGACGSCWAF 149

Query: 159 SAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYP 218
           +AVAA+EG+TQI  GKL  LSEQ+LVDC T + GC+GG  D+AFE +    G+  E+ Y 
Sbjct: 150 AAVAAIEGLTQIRTGKLTPLSEQELVDCDTGSSGCAGGHTDRAFELVAAKGGITAESGYR 209

Query: 219 YRHEEGTCDNQKEKAV---AATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFY 275
           Y    G C  + + A+   AA I  +  +P GDE+ L  AV+ QPV+  +DASG AF FY
Sbjct: 210 YEGYRGKC--RADDALFNHAARIGGHRAVPPGDERQLATAVARQPVTAYIDASGPAFQFY 267

Query: 276 KSGVLNADCGN---------NCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIR 326
            SGV    CG+           +H V +VG+   +  +G KYW+ KNSWG+TWGE GYI 
Sbjct: 268 GSGVFPGPCGSGSGAAAAAPTTNHAVTLVGY-CQDGASGKKYWVAKNSWGKTWGEKGYIL 326

Query: 327 ILRDA----GLCGIATAASYPV 344
           + +D     G CG+A +  YP 
Sbjct: 327 LEKDVASPHGTCGVAVSPFYPT 348


>gi|326430491|gb|EGD76061.1| cathepsin [Salpingoeca sp. ATCC 50818]
          Length = 381

 Score =  256 bits (653), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 133/276 (48%), Positives = 169/276 (61%), Gaps = 22/276 (7%)

Query: 51  RTYKDELEKAMRLNIFKQNLEYIEKANKEGNR---TYKLGTNEFSDLTNEEFRALYTGYN 107
           + Y+   E+A R  IF  NL +I + N E  R   T+ +G N+F+DLTNEE+R LY    
Sbjct: 29  KQYESPEEEARRFAIFADNLAFIARHNAEAARGLHTHTVGVNQFADLTNEEYRQLYL--- 85

Query: 108 RPVPSVSRQSSRPSTFKYQNVTDVPT--SIDWREKGAVTHIKDQGQCGSCWAFSAVAAVE 165
           RP P+      R   +      D P   S+DWR+KGAVT IK+QGQCGSCW+FS   +VE
Sbjct: 86  RPYPTELLGRERQEVW-----LDGPNAGSVDWRQKGAVTPIKNQGQCGSCWSFSTTGSVE 140

Query: 166 GITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEE 223
           G   I  G L+ LSEQQLVDCS    N GC+GGLMD AF+YII N GL TE DYPY   +
Sbjct: 141 GAHAIATGNLVSLSEQQLVDCSGSFGNQGCNGGLMDNAFKYIISNGGLDTEQDYPYTARD 200

Query: 224 GTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNAD 283
           G CD  KE   A +IS Y+D+P+ +E  L  AV   PVSV ++A  ++F  Y SGV +  
Sbjct: 201 GVCDKSKESKHAVSISGYKDVPQNNEDQLAAAVEKGPVSVAIEADQQSFQMYSSGVFSGP 260

Query: 284 CGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETW 319
           CG N DHGV VVG+ +        YW++KNSWG +W
Sbjct: 261 CGTNLDHGVLVVGYTS-------DYWIVKNSWGASW 289


>gi|426219849|ref|XP_004004130.1| PREDICTED: cathepsin L1 isoform 1 [Ovis aries]
 gi|426219851|ref|XP_004004131.1| PREDICTED: cathepsin L1 isoform 2 [Ovis aries]
          Length = 334

 Score =  256 bits (653), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 143/343 (41%), Positives = 205/343 (59%), Gaps = 22/343 (6%)

Query: 12  PMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLE 71
           P F + +L +     V S     +P++     QW A H R Y    E+  R  ++++N +
Sbjct: 3   PSFFLTVLCLG----VASAAPKLDPNLDAHWHQWKATHRRLYGMN-EEGWRRAVWEKNKK 57

Query: 72  YIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNV 128
            I+  N+E   G   + +  N F D+TNEEFR +  G+       +++  +   F+   +
Sbjct: 58  IIDLHNQEYSQGKHGFSMAMNAFGDMTNEEFRQVMNGFQ------NQKRKKGKLFREPLL 111

Query: 129 TDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST 188
            DVP S+DW +KG VT +K+QGQCGSCWAFSA  A+EG      GKL+ LSEQ LVDCS 
Sbjct: 112 IDVPKSVDWTKKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSR 171

Query: 189 D--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPK 246
              N GC+GGLMD AF+YI EN GL +E  YPY   + +  N K +  AA  + + D+P+
Sbjct: 172 PQGNQGCNGGLMDNAFQYIKENGGLDSEESYPYLATDTSSCNYKPECSAANDTGFVDIPQ 231

Query: 247 GDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFG-TAEE 302
             E+AL++AV+   P+SV +DA   +F FYKSG+  + DC + + DHGV VVG+G    +
Sbjct: 232 -REKALMKAVATVGPISVAIDAGHASFQFYKSGIYYDPDCSSKDLDHGVLVVGYGFEGTD 290

Query: 303 ENGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
            N  K+W++KNSWG  WG +GY+++ +D    CGIATAASYP 
Sbjct: 291 SNNNKFWIVKNSWGPEWGWNGYVKMAKDQNNHCGIATAASYPT 333


>gi|115438530|ref|NP_001043562.1| Os01g0613500 [Oryza sativa Japonica Group]
 gi|11034572|dbj|BAB17096.1| cysteine proteinase-like [Oryza sativa Japonica Group]
 gi|113533093|dbj|BAF05476.1| Os01g0613500 [Oryza sativa Japonica Group]
 gi|215697766|dbj|BAG91959.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 360

 Score =  256 bits (653), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 144/322 (44%), Positives = 190/322 (59%), Gaps = 16/322 (4%)

Query: 37  SIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEG-NRTYKLGTNEFSDLT 95
           S+  +HE+WMA+ GR Y D  EKA R+ +F  N E ++ AN+ G +RTY LG N+FSDLT
Sbjct: 38  SMAARHERWMARFGRAYADAAEKARRMEVFAANAERVDAANRAGGDRTYTLGLNQFSDLT 97

Query: 96  NEEFRALYTGYN-RPVPSVSRQSSRP---STFKYQNVTDVPTSIDWREKGAVTHIKDQGQ 151
           ++EF   + GY+  P P   R   R    +     + TDVP S+DWR +GAVT +K+Q  
Sbjct: 98  DDEFAQTHLGYSWAPPPPSHRHGHRAENGTAAAAADDTDVPDSVDWRARGAVTEVKNQRS 157

Query: 152 CGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGL 211
           CGSCWAF+AVAA EG+ Q+  G L+ LSEQQ++DC+   + CSGG +  A  YI  + GL
Sbjct: 158 CGSCWAFAAVAATEGLVQLATGNLVSLSEQQVLDCTGGANTCSGGDVSAALRYIAASGGL 217

Query: 212 ATEADYPYRHEEGTCD----NQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDA 267
            TEA Y Y  ++G C          A A   +++  L  GDE AL    + QPV V V+A
Sbjct: 218 QTEAAYAYGGQQGACRAGGFAAPNSAAAVGGARWARL-YGDEGALQALAAGQPVVVVVEA 276

Query: 268 SGRAFHFYKSGVL--NADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYI 325
           S   F  Y+SGV   +A CG   +H V VV    A  + G +YWL+KN WG  WGE GY+
Sbjct: 277 SEPDFRHYRSGVYAGSAACGRRLNHAVTVV-GYGAAADGGGEYWLVKNQWGTWWGEGGYM 335

Query: 326 RILRD---AGLCGIATAASYPV 344
           R+ R     G CGIAT A YP 
Sbjct: 336 RVARGGAAGGNCGIATYAFYPT 357


>gi|18407678|ref|NP_566867.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|30315950|sp|Q9LXW3.1|CPR2_ARATH RecName: Full=Probable cysteine proteinase At3g43960; Flags:
           Precursor
 gi|7594557|emb|CAB88124.1| cysteine proteinase-like protein [Arabidopsis thaliana]
 gi|26452289|dbj|BAC43231.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|332644328|gb|AEE77849.1| putative cysteine proteinase [Arabidopsis thaliana]
          Length = 376

 Score =  256 bits (653), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 140/321 (43%), Positives = 198/321 (61%), Gaps = 17/321 (5%)

Query: 34  HEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSD 93
           +E  ++  +EQW+ ++G+ Y    EK  R  IFK NL+ IE+ N + NR+Y+ G N+FSD
Sbjct: 33  NEGEVLTMYEQWLVENGKNYNGLGEKERRFKIFKDNLKRIEEHNSDPNRSYERGLNKFSD 92

Query: 94  LTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVT-HIKDQGQC 152
           LT +EF+A Y G      S+S  + R   ++Y+    +P  +DWRE+GAV   +K QG+C
Sbjct: 93  LTADEFQASYLGGKMEKKSLSDVAER---YQYKEGDVLPDEVDWRERGAVVPRVKRQGEC 149

Query: 153 GSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS--TDNHGCSGGLMDKAFEYIIENKG 210
           GSCWAF+A  AVEGI QIT G+L+ LSEQ+L+DC    DN GC+GG    AFE+I EN G
Sbjct: 150 GSCWAFAATGAVEGINQITTGELVSLSEQELIDCDRGNDNFGCAGGGAVWAFEFIKENGG 209

Query: 211 LATEADYPYRHEE-GTCDNQKEKAV-AATISKYEDLPKGDEQALLQAVSNQPVSVCVDAS 268
           + ++  Y Y  E+   C   + K     TI+ +E +P  DE +L +AV+ QP+SV + A+
Sbjct: 210 IVSDEVYGYTGEDTAACKAIEMKTTRVVTINGHEVVPVNDEMSLKKAVAYQPISVMISAA 269

Query: 269 GRAFHFYKSGVLNADCGNNC-DHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRI 327
             +   YKSGV    C N   DH V +VG+GT+ +E    YWLI+NSWG  WGE GY+R+
Sbjct: 270 NMS--DYKSGVYKGACSNLWGDHNVLIVGYGTSSDE--GDYWLIRNSWGPEWGEGGYLRL 325

Query: 328 LRD----AGLCGIATAASYPV 344
            R+     G C +A A  YP+
Sbjct: 326 QRNFHEPTGKCAVAVAPVYPI 346


>gi|158268253|gb|ABW25046.1| cathepsin L-like protease [Strongylus vulgaris]
          Length = 354

 Score =  256 bits (653), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 156/359 (43%), Positives = 211/359 (58%), Gaps = 33/359 (9%)

Query: 13  MFVIIILVITCASQVVS--GRSMHEPSIVEKHEQWMAQH---------------GRTY-K 54
           MF ++ LV+ CAS   S    S H+ +I     + + Q                G++Y K
Sbjct: 1   MFRLLSLVLLCASVFASIDSGSRHDHTIRLHRVKSLRQKIDEAFKLWDDYKESFGKSYNK 60

Query: 55  DELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVP 111
           DE    M    F +N+ +I++ N+E   G +T+++G N  +DL   ++R L    +R   
Sbjct: 61  DEENDYME--AFVKNVIHIDEHNQEHRLGRKTFEMGLNSIADLPFSQYRKLNGYRHRRNF 118

Query: 112 SVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQIT 171
             S QS+        NV ++P S+DWR+KG VT +K+QG CGSCWAFSA  A+EG     
Sbjct: 119 GDSMQSNGTKWLAPFNV-EIPDSVDWRDKGLVTDVKNQGMCGSCWAFSATGALEGQHARA 177

Query: 172 RGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQ 229
            GK++ LSEQ LVDCST   NHGC+GGLMD AFEYI +N G+ TE  YPY   E  C + 
Sbjct: 178 SGKMVSLSEQNLVDCSTKYGNHGCNGGLMDLAFEYIKDNHGIDTEESYPYVGRETKC-HF 236

Query: 230 KEKAVAATISKYEDLPKGDEQALLQAVSNQ-PVSVCVDASGRAFHFYKSGV-LNADCGN- 286
           K+K + A    + DLP+GDE+AL  AV+ Q P+S+ +DA  R F  YK GV  + +C + 
Sbjct: 237 KKKDIGAEDKGFVDLPEGDEEALKVAVATQGPISIAIDAGHRTFQLYKKGVYYDEECSSE 296

Query: 287 NCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
             DHGV +VG+GT  E     YWLIKNSWG  WGE GYIRI R+ +  CG+AT ASYP+
Sbjct: 297 ELDHGVLLVGYGTDPE--AGDYWLIKNSWGPGWGEKGYIRIARNRSNHCGVATKASYPL 353


>gi|125526835|gb|EAY74949.1| hypothetical protein OsI_02845 [Oryza sativa Indica Group]
          Length = 360

 Score =  256 bits (653), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 144/322 (44%), Positives = 190/322 (59%), Gaps = 16/322 (4%)

Query: 37  SIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEG-NRTYKLGTNEFSDLT 95
           S+  +HE+WMA+ GR Y D  EKA R+ +F  N E ++ AN+ G +RTY LG N+FSDLT
Sbjct: 38  SMAARHERWMARFGRAYADAAEKARRMEVFAANAERVDAANRAGGDRTYTLGLNQFSDLT 97

Query: 96  NEEFRALYTGYN-RPVPSVSRQSSRP---STFKYQNVTDVPTSIDWREKGAVTHIKDQGQ 151
           ++EF   + GY+  P P   R   R    +     + TDVP S+DWR +GAVT +K+Q  
Sbjct: 98  DDEFARTHLGYSWAPPPPSHRHGHRAENGTAAAAADDTDVPDSVDWRARGAVTEVKNQRS 157

Query: 152 CGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGL 211
           CGSCWAF+AVAA EG+ Q+  G L+ LSEQQ++DC+   + CSGG +  A  YI  + GL
Sbjct: 158 CGSCWAFAAVAATEGLVQLATGNLVSLSEQQVLDCTGGANTCSGGDVSAALRYIAASGGL 217

Query: 212 ATEADYPYRHEEGTCD----NQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDA 267
            TEA Y Y  ++G C          A A   +++  L  GDE AL    + QPV V V+A
Sbjct: 218 QTEAAYAYGGQQGACRAGGFAAPNSAAAVGGARWARL-YGDEGALQALAAGQPVVVVVEA 276

Query: 268 SGRAFHFYKSGVL--NADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYI 325
           S   F  Y+SGV   +A CG   +H V VV    A  + G +YWL+KN WG  WGE GY+
Sbjct: 277 SEPDFRHYRSGVYAGSAACGRRLNHAVTVV-GYGAAADGGGEYWLVKNQWGTWWGEGGYM 335

Query: 326 RILRD---AGLCGIATAASYPV 344
           R+ R     G CGIAT A YP 
Sbjct: 336 RVARGGAAGGNCGIATYAFYPT 357


>gi|242020003|ref|XP_002430447.1| Cathepsin L precursor, putative [Pediculus humanus corporis]
 gi|212515585|gb|EEB17709.1| Cathepsin L precursor, putative [Pediculus humanus corporis]
          Length = 345

 Score =  256 bits (653), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 144/348 (41%), Positives = 211/348 (60%), Gaps = 22/348 (6%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
           M ++  + +T  S  ++  S ++  ++E+ + + A+H + Y +++E+  R+ IF  N + 
Sbjct: 1   MKILFFIALTVLS--INAVSFYD-LVMEEWQLFKAEHKKNYNNDVEEKFRMKIFMDNKQK 57

Query: 73  IEKAN---KEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPV-PSVSRQSS-----RPSTF 123
           I K N   + G   YKLG N++SD+ + EF   + G+N+ + P   R ++     + S F
Sbjct: 58  ITKHNTKYQRGEVGYKLGLNKYSDMLHHEFINTFNGFNKSIIPPHLRSNNGKTHLKGSFF 117

Query: 124 KYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQL 183
                  +P  +DW + GAVT +KDQG CGSCWAFSA  A+EG+       L+ LSEQ L
Sbjct: 118 IPPANVKLPKHVDWVKLGAVTPVKDQGHCGSCWAFSATGALEGLHFRKTKVLVSLSEQNL 177

Query: 184 VDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKY 241
           +DCST+  N+GC+GGLMD+AF+Y+  N G+ TE  YPY      C  + E +  A  + Y
Sbjct: 178 IDCSTEEGNNGCNGGLMDQAFQYVRINGGIDTERSYPYEGNNDVCRYEPENS-GAIDTGY 236

Query: 242 EDLPKGDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN---NCDHGVAVVG 296
            D+P GDE AL  AV+   PVSV +DAS  +F  Y SGV    +C N   + DHGV VVG
Sbjct: 237 TDVPLGDEDALKSAVATVGPVSVAIDASQESFQLYSSGVYFEPNCKNEPESLDHGVLVVG 296

Query: 297 FGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYP 343
           +GT +EE    YWL+KNSWG++WGE+GYI++ R+A   CGIAT  S+P
Sbjct: 297 YGT-DEETQQDYWLVKNSWGDSWGENGYIKMARNADNQCGIATQPSFP 343


>gi|225719768|gb|ACO15730.1| Cathepsin L1 precursor [Caligus clemensi]
          Length = 338

 Score =  256 bits (653), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 145/341 (42%), Positives = 202/341 (59%), Gaps = 18/341 (5%)

Query: 15  VIIILVITCASQVVSGRSMHEPSIVEKHEQ-WMAQHGRTYKDELEKAMRLNIFKQNLEYI 73
           + + +++ C S V +       S +E H   W   H + Y    E+  R  ++++NL+ I
Sbjct: 4   LYLAVLVLCVSAVCAAPRF--DSQLEDHWHLWKNWHSKNYHAS-EEGWRRMVWEKNLKKI 60

Query: 74  EKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
           E  N E   G  +++LG N F D+TNEEFR    GY +     + +  + S F   N   
Sbjct: 61  EIHNLEHTMGKHSHRLGMNHFGDMTNEEFRQTMNGYKQ----TTERKFKGSLFMEPNYLQ 116

Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD- 189
            P ++DWREKG VT +KDQG CGSCWAFS   A+EG      GKL+ LSEQ LVDCS   
Sbjct: 117 APKAVDWREKGYVTPVKDQGSCGSCWAFSTTGAMEGQPFRKTGKLVSLSEQNLVDCSRPE 176

Query: 190 -NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
            N GC+GGLMD+AF+YI +N GL TE  YPY   +    + K +  AA  + + D+P G 
Sbjct: 177 GNEGCNGGLMDQAFQYIQDNAGLDTEESYPYVGTDEDPCHYKPEFSAANETGFVDIPSGK 236

Query: 249 EQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFG-TAEEEN 304
           E A+++AV+   PVSV +DA   +F FY+SG+    +C +   DHGV VVG+G   E+ +
Sbjct: 237 EHAMMKAVAAVGPVSVAIDAGHESFQFYESGIYYEKECSSEELDHGVLVVGYGFEGEDVD 296

Query: 305 GAKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
           G KYW++KNSW E WG+ GYI + +D    CGIATA+SYP+
Sbjct: 297 GKKYWIVKNSWSEKWGDKGYIYMAKDRKNHCGIATASSYPL 337


>gi|346466067|gb|AEO32878.1| hypothetical protein [Amblyomma maculatum]
          Length = 358

 Score =  256 bits (653), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 144/343 (41%), Positives = 207/343 (60%), Gaps = 24/343 (6%)

Query: 14  FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYI 73
           FV++  +   A+ +      H+  +  +   + A HG+ Y  E E+  RL I+ +N   I
Sbjct: 27  FVVLGCLFVTAAAIT-----HQELVGAEWSAFKALHGKEYHSETEEYYRLKIYMENRLKI 81

Query: 74  EKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSS---RPSTFKYQN 127
            + N++      +YKL  NEF DL + EF +   G+ R   S  R+ S    P   + ++
Sbjct: 82  ARHNEKYANNKASYKLAMNEFGDLLHHEFVSTRNGFKRNYRSTPREGSFYIEPEGIEDKH 141

Query: 128 VTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
           +   P ++DWR+KGAVT +K+QGQCGSCWAFS   ++EG      G+++ LSEQ LVDCS
Sbjct: 142 L---PKTVDWRKKGAVTPVKNQGQCGSCWAFSTTGSLEGQHFRKTGRMVSLSEQNLVDCS 198

Query: 188 TD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLP 245
               N+GC GGLMD AF+YI  N G+ TE  YPY   +G C  +K   V AT + + D+P
Sbjct: 199 GKFGNNGCEGGLMDNAFKYIKANGGIDTELSYPYNGTDGICHFEKSD-VGATDTGFVDIP 257

Query: 246 KGDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGVLN-ADCGN-NCDHGVAVVGFGTAEE 302
           +G+EQ L +AV+   PVSV +DAS  +F FY  GV +  +C + + DHGV VVG+GT   
Sbjct: 258 EGNEQLLKKAVATVGPVSVAIDASHESFQFYSQGVYDEPECSSESLDHGVLVVGYGT--- 314

Query: 303 ENGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
           ++G  YWL+KNSWG TWG+ GYI + R+    CGIA++ASYP+
Sbjct: 315 KDGQDYWLVKNSWGTTWGDDGYIYMTRNKENQCGIASSASYPL 357


>gi|91092014|ref|XP_970644.1| PREDICTED: similar to cathepsin-L-like cysteine peptidase 02
           [Tribolium castaneum]
 gi|270001249|gb|EEZ97696.1| cathepsin L precursor [Tribolium castaneum]
          Length = 337

 Score =  255 bits (652), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 147/344 (42%), Positives = 202/344 (58%), Gaps = 23/344 (6%)

Query: 14  FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMA---QHGRTYKDELEKAMRLNIFKQNL 70
           F++ + +    SQ VS   + +       EQW A    H + Y+ E E+  R+ IF +N 
Sbjct: 3   FLVFVALCVVGSQAVSFFDLVQ-------EQWGAFKVTHKKQYESETEERFRMKIFMENA 55

Query: 71  EYIEKANK---EGNRTYKLGTNEFSDLTNEEFRALYTGYNRP-VPSVSRQSSRPSTFKYQ 126
             + K NK   +G  ++KLG N++SD+ N EF     GYNR   P  S +     TF   
Sbjct: 56  HKVAKHNKLYAQGLVSFKLGVNKYSDMLNHEFVHTLNGYNRSKTPLRSGELDESITFIPP 115

Query: 127 NVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC 186
              ++P  IDWR+ GAVT +KDQGQCGSCW+FS   ++EG       KL+ LSEQ L+DC
Sbjct: 116 ANVELPKQIDWRKLGAVTPVKDQGQCGSCWSFSTTGSLEGQHFRKSKKLVSLSEQNLIDC 175

Query: 187 STD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDL 244
           S    N+GC+GGLMD AF YI +N G+ TE  YPY+ E+  C + K +   AT   + D+
Sbjct: 176 SEKYGNNGCNGGLMDNAFRYIKDNGGIDTEQSYPYKAEDEKC-HYKPRNKGATDRGFVDI 234

Query: 245 PKGDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFGTAE 301
             GDE+ L  AV+   P+SV +DAS   F  Y  GV    +C +   DHGV VVG+GT  
Sbjct: 235 ESGDEEKLKAAVATVGPISVAIDASHPTFQQYSEGVYYEPECSSEQLDHGVLVVGYGT-- 292

Query: 302 EENGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
           +E+G  YWL+KNSWG++WG+ GYI++ R+    CGIAT ASYP+
Sbjct: 293 DEDGNDYWLVKNSWGDSWGDQGYIKMARNRDNNCGIATQASYPL 336


>gi|89272015|emb|CAJ83143.1| cathepsin L2 [Xenopus (Silurana) tropicalis]
          Length = 335

 Score =  255 bits (652), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 145/342 (42%), Positives = 206/342 (60%), Gaps = 18/342 (5%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
           M + + +   C + V +  +  +P++      W   H ++Y  + E+  R  ++++NL  
Sbjct: 1   MALYLGIAAICLTTVFAAPTT-DPALDNHWNLWKNWHKKSYAPK-EEGWRRVLWEKNLRM 58

Query: 73  IEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
           IE  N E   G  ++ LG N+F D+TNEEFR L  GY       +++  R STF   N  
Sbjct: 59  IEFHNLEHSLGKHSHSLGMNQFGDMTNEEFRQLMNGYK------NQKKIRGSTFLAPNNF 112

Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS-- 187
           + P S+DWR+KG VT +KDQGQCGSCWAFS   A+EG      GK+I LSEQ LVDCS  
Sbjct: 113 ESPKSVDWRKKGYVTPVKDQGQCGSCWAFSTTGALEGQHYRNTGKMISLSEQNLVDCSRA 172

Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
             N GC+GGLMD+AF+Y+ +N G+ +E  YPY  ++    +      +A  + + D+   
Sbjct: 173 QGNQGCNGGLMDQAFQYVKDNGGIDSEDSYPYTAKDDQECHYDPNYNSANDTGFVDVTSE 232

Query: 248 DEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFG-TAEEE 303
            E+ L+ AV++  PVSV VDA  ++F FYKSG+    +C + + DHGV VVG+G   E+E
Sbjct: 233 SEKDLMNAVASVGPVSVAVDAGHQSFQFYKSGIYYEPECSSEDLDHGVLVVGYGFEGEDE 292

Query: 304 NGAKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
           +G KYW++KNSW E WG  GYI I +D    CGIATAASYP+
Sbjct: 293 DGKKYWIVKNSWSEKWGNDGYIYIAKDRHNHCGIATAASYPL 334


>gi|256082975|ref|XP_002577726.1| subfamily C1A unassigned peptidase (C01 family) [Schistosoma
           mansoni]
          Length = 1471

 Score =  255 bits (652), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 141/312 (45%), Positives = 195/312 (62%), Gaps = 21/312 (6%)

Query: 48  QHGRTYKDELEKAMRLNIFKQNLEYIEKAN---KEGNRTYKLGTNEFSDLTNEEFRALYT 104
           Q  R Y    E+  R  IF  N   + + N   +EG  TYK+G NEF+D T+ E + L  
Sbjct: 66  QFKRAYNGIHEETRRFFIFSANFVKMMEHNHAFQEGKVTYKMGVNEFTDKTDYELKKL-R 124

Query: 105 GYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAV 164
           GY     ++  + S   TF     T +P+ +DWR +GAVT +K+QGQCGSCWAFS   A+
Sbjct: 125 GYKVTSGAIRHKGS---TFIRSEHTKLPSKVDWRREGAVTDVKNQGQCGSCWAFSTTGAI 181

Query: 165 EGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHE 222
           EG       +L+ LSEQQLVDCS    N+GCSGGLM+ AFEY+ +N+G+ +E  YPY   
Sbjct: 182 EGQHYRKTNRLVNLSEQQLVDCSKSYGNNGCSGGLMNSAFEYVRDNEGIDSEISYPYVSG 241

Query: 223 EGTCDNQ---KEKAVAATISKYEDLPKGDEQALLQAVSNQ-PVSVCVDASGRAFHFYKSG 278
           +GT +N+       + A ++ Y ++ +GDE+AL+ AV+ + PVSV ++A   +F  YKSG
Sbjct: 242 DGTENNRCLFNASNILAQVTGYVNIHEGDERALMDAVATKGPVSVAINAGLPSFSMYKSG 301

Query: 279 VL-NADCG---NNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA-GL 333
           +  + DC    +  DHGV VVG+G   EENG  YWLIKNSWGE WGE GYI+I + +  +
Sbjct: 302 IYSDTDCEGTLDALDHGVLVVGYG---EENGRSYWLIKNSWGEEWGEKGYIKISKGSHNM 358

Query: 334 CGIATAASYPVA 345
           CG+A+AASYP+ 
Sbjct: 359 CGVASAASYPLV 370


>gi|356545079|ref|XP_003540973.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
          Length = 330

 Score =  255 bits (652), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 135/301 (44%), Positives = 186/301 (61%), Gaps = 20/301 (6%)

Query: 24  ASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRT 83
           ASQV   R++ + S+ E+HE+WM+++G+ YKD  E+  R  IFK+N+ YIE +N    + 
Sbjct: 5   ASQVTC-RTLQDASMYERHEEWMSRYGKVYKDPREREKRFRIFKENMNYIETSNNVAIKP 63

Query: 84  YKLGTNEFSDLTNEEF---RALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREK 140
            KL  N+F+DL NEEF   R ++ G       + R  SR  TF +      P      +K
Sbjct: 64  XKLVINQFADLNNEEFIAPRNIFKGM-----ILCRFLSRKHTFPF------PYVFLGHKK 112

Query: 141 GAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLM 198
           GAVT +KDQG CG CWAF  VA+ EGI  +T GKLI LSEQ+LVDC T   + GC  GLM
Sbjct: 113 GAVTPVKDQGHCGFCWAFYDVASTEGILALTAGKLISLSEQELVDCDTKGVDQGCECGLM 172

Query: 199 DKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN 258
           D AF++II+N G+  +A+YPY+  +G C+  +E   AATI+  ED+P  +E+AL + V+N
Sbjct: 173 DDAFKFIIQNHGV-XDANYPYKGVDGKCNANEEANPAATITGXEDVPANNEKALQKVVAN 231

Query: 259 QPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGET 318
           QPV V +DA    F FYKSGV    C    +HGV  +G+G + +  G +YWL+KNS    
Sbjct: 232 QPVFVAIDACDSDFQFYKSGVFTGSCETELNHGVTTMGYGVSHD--GTQYWLVKNSXETE 289

Query: 319 W 319
           W
Sbjct: 290 W 290


>gi|218187750|gb|EEC70177.1| hypothetical protein OsI_00904 [Oryza sativa Indica Group]
 gi|222617983|gb|EEE54115.1| hypothetical protein OsJ_00884 [Oryza sativa Japonica Group]
          Length = 327

 Score =  255 bits (652), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 136/322 (42%), Positives = 184/322 (57%), Gaps = 27/322 (8%)

Query: 39  VEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEE 98
            +  E+WMA+ G+ Y    EK  R  +F+ N+ +I            L  N+F+DLTN+E
Sbjct: 16  TQMFEEWMAKFGKKYPCHGEKEYRFGVFRDNVRFIRSYRPPAGYNSALRVNQFADLTNDE 75

Query: 99  FRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAF 158
           F + +TG   P P  + +   P          +P  IDWR KGAVT +KDQG CGSCWAF
Sbjct: 76  FVSTHTGAKPPCPKDAPRGVDPIW--------LPCCIDWRYKGAVTDVKDQGACGSCWAF 127

Query: 159 SAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYP 218
           +AVAA+EG+TQI  GKL  LSEQ+LVDC T + GC+GG  D+AFE +    G+  E+ Y 
Sbjct: 128 AAVAAIEGLTQIRTGKLTPLSEQELVDCDTGSSGCAGGHTDRAFELVAAKGGITAESGYR 187

Query: 219 YRHEEGTCDNQKEKAV---AATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFY 275
           Y    G C  + + A+   AA I  +  +P GDE+ L  AV+ QPV+  +DASG AF FY
Sbjct: 188 YEGYRGKC--RADDALFNHAARIGGHRAVPPGDERQLATAVARQPVTAYIDASGPAFQFY 245

Query: 276 KSGVLNADCGN---------NCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIR 326
            SGV    CG+           +H V +VG+   +  +G KYW+ KNSWG+TWGE GYI 
Sbjct: 246 GSGVFPGPCGSGSGAAAAAPTTNHAVTLVGY-CQDGASGKKYWVAKNSWGKTWGEKGYIL 304

Query: 327 ILRDA----GLCGIATAASYPV 344
           + +D     G CG+A +  YP 
Sbjct: 305 LEKDVASPHGTCGVAVSPFYPT 326


>gi|149617838|ref|XP_001521715.1| PREDICTED: cathepsin L1-like [Ornithorhynchus anatinus]
          Length = 338

 Score =  255 bits (652), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 145/342 (42%), Positives = 209/342 (61%), Gaps = 15/342 (4%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
           M +++ LV  C    VS   + +  +    + W   H ++Y  E E+  R  ++++NL+ 
Sbjct: 1   MNLLVCLVSLCWGLAVSA-PLGDSELDRHWKLWKNWHQKSYH-EAEEGWRRTVWEENLKA 58

Query: 73  IEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
           I+  N E   G  TY+LG N+F DLTNEEF+ + TG  R     +R +   S F   N  
Sbjct: 59  IQLHNLEQSLGLHTYRLGMNQFGDLTNEEFQEILTG-ERHFSKGNRING--SAFLEANFV 115

Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS-- 187
            VPTS+DWR+ G VT +K+QG CGSCWAFS   A+EG      G+LI LSEQ LVDCS  
Sbjct: 116 QVPTSVDWRDHGYVTPVKNQGHCGSCWAFSTTGALEGQLFRKSGRLISLSEQNLVDCSWQ 175

Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
             N GC GG++D AF+YI++N+G+ +E  YPY  ++      K +   A ++ + D+P  
Sbjct: 176 QGNQGCHGGIVDLAFQYILQNQGIDSEDCYPYTAKDTAQCTFKPECATAPVTGFVDIPPH 235

Query: 248 DEQALLQAVSN-QPVSVCVDASGRAFHFYKSGVL-NADCGN-NCDHGVAVVGFG-TAEEE 303
            E+AL++AV+   PVSV +DAS  +F FY+SG+  +  C + + DH V VVG+G   E+E
Sbjct: 236 SEEALMKAVATVGPVSVGIDASSTSFRFYQSGIFYDPKCSSESLDHAVLVVGYGYEREDE 295

Query: 304 NGAKYWLIKNSWGETWGESGYIRILRDAG-LCGIATAASYPV 344
            G KYW++KNSWG+ WG+ GY+ + +D G  CGIAT ASYP+
Sbjct: 296 AGKKYWIVKNSWGKHWGDRGYVYMSKDRGNHCGIATVASYPL 337


>gi|281203744|gb|EFA77940.1| hypothetical protein PPL_08585 [Polysphondylium pallidum PN500]
          Length = 505

 Score =  255 bits (652), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 147/375 (39%), Positives = 207/375 (55%), Gaps = 38/375 (10%)

Query: 5   FEKSF---------IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKD 55
           FE SF         I+  ++ I+L+I     + +     E     + E W+ +  + Y D
Sbjct: 135 FESSFRCFSIIFLKIMNRYINILLLIFGLIAISNALLFSEEQYKNEFENWIDRFEKKY-D 193

Query: 56  ELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSR 115
             E   R +IFK N++++   N + ++T  LG N  +DLTN E+R  Y G ++   +V  
Sbjct: 194 VSEFKKRFSIFKSNMDFVHSWNSKNSQTV-LGLNHLADLTNLEYRQFYLGTHKK--AVLG 250

Query: 116 QSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKL 175
                     Q+V     ++DWR+KGAV+ IKDQGQCGSCW+FS   +VEG  QI  G +
Sbjct: 251 TPGNHEVSNLQSVFGDSATVDWRQKGAVSPIKDQGQCGSCWSFSTTGSVEGAHQIKSGNM 310

Query: 176 IELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKA 233
           +ELSEQ LVDCST   N GC+GGLMD AFEYII N G+ TE+ YPY    GT     +  
Sbjct: 311 VELSEQNLVDCSTSEGNMGCNGGLMDYAFEYIITNNGIDTESSYPYTASSGTTCKYNKAN 370

Query: 234 VAATISKYEDLPKGDEQALLQAVSNQ-PVSVCVDASGRAFHFYKSGV-LNADCGN-NCDH 290
             ATIS Y+++  G E  L  AV N  PVSV +DAS  +F  Y  G+  +A C + N DH
Sbjct: 371 SGATISSYKNITAGSESDLADAVKNAGPVSVAIDASHNSFQLYSHGIYYDASCSSVNLDH 430

Query: 291 GVAVVGFGTAEEENGAK-------------------YWLIKNSWGETWGESGYIRILRDA 331
           GV VVG+G+   ++ ++                   YW++KNSWG +WG+ G+I + +D 
Sbjct: 431 GVLVVGYGSGTPDSDSRVHKGSQVRVKVPKTDDTKNYWIVKNSWGTSWGDKGFIYMSKDR 490

Query: 332 -GLCGIATAASYPVA 345
              CGIA+ ASYP+ 
Sbjct: 491 DNNCGIASCASYPIV 505


>gi|66823245|ref|XP_644977.1| cysteine proteinase 5 precursor [Dictyostelium discoideum AX4]
 gi|166201986|sp|P54640.2|CYSP5_DICDI RecName: Full=Cysteine proteinase 5; Flags: Precursor
 gi|60473097|gb|EAL71045.1| cysteine proteinase 5 precursor [Dictyostelium discoideum AX4]
          Length = 344

 Score =  255 bits (651), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 142/352 (40%), Positives = 200/352 (56%), Gaps = 29/352 (8%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
           M V+  L +   S   + +   E         WM  H ++Y  E E   R NIFK N++Y
Sbjct: 1   MKVLSFLCVLLVSVATAKQQFSELQYRNAFTDWMITHQKSYTSE-EFGARYNIFKANMDY 59

Query: 73  IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPS-VSRQSSRPSTFKYQNVTDV 131
           +++ N +G+ T  LG N F+D+TNEE+R  Y G      S +  Q  +  T      T  
Sbjct: 60  VQQWNSKGSETV-LGLNNFADITNEEYRNTYLGTKFDASSLIGTQEEKVFT------TSS 112

Query: 132 PTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNH 191
             S DWR +GAVT +K+QGQCG CW+FS   + EG    ++G+L+ LSEQ L+DCST+N 
Sbjct: 113 AASKDWRSEGAVTPVKNQGQCGGCWSFSTTGSTEGAHFQSKGELVSLSEQNLIDCSTENS 172

Query: 192 GCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQA 251
           GC GGLM  AFEYII N G+ TE+ YPY+ E G C+ + E +  AT+S Y+ +  G E +
Sbjct: 173 GCDGGLMTYAFEYIINNNGIDTESSYPYKAENGKCEYKSENS-GATLSSYKTVTAGSESS 231

Query: 252 LLQAVSNQPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFGTAEEENGA--- 306
           L  AV+  PVSV +DAS ++F  Y SG+    +C + N DHGV  VG+G+    +     
Sbjct: 232 LESAVNVNPVSVAIDASHQSFQLYTSGIYYEPECSSENLDHGVLAVGYGSGSGSSSGQSS 291

Query: 307 -------------KYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
                        +YW++KNSWG +WG  GYI + R+    CGIA++AS+PV
Sbjct: 292 GQSSGNLSASSSNEYWIVKNSWGTSWGIEGYILMSRNRDNNCGIASSASFPV 343


>gi|311265493|ref|XP_003130681.1| PREDICTED: cathepsin L1-like [Sus scrofa]
          Length = 332

 Score =  255 bits (651), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 143/337 (42%), Positives = 200/337 (59%), Gaps = 19/337 (5%)

Query: 17  IILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKA 76
           ++L   C   + S    H+ S+     +W A H + Y    E+  R  I+++N++ IE+ 
Sbjct: 5   LLLAAFCLG-IASAAPRHDHSLDADWYKWKATHRKLYGLN-EEGRRRAIWEKNMKMIERH 62

Query: 77  N---KEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPT 133
           N   ++G  ++ +  N F D+TNEEFR    G+       +++  +   F        P 
Sbjct: 63  NWEHRQGKHSFTMAMNAFGDMTNEEFRKTMNGFQ------NQKHKKGKVFLDAGSALTPH 116

Query: 134 SIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS--TDNH 191
           S+DWREKG VT +K+QG CGSCWAFSA  A+EG       KLI LSEQ LVDCS    N 
Sbjct: 117 SVDWREKGYVTAVKNQGHCGSCWAFSATGALEGQMFRKTSKLISLSEQNLVDCSWPEGNE 176

Query: 192 GCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQA 251
           GC+GGLMD AF+YI +N GL +E  YPY  ++G+C   K ++ AA  + Y D+PK  E+A
Sbjct: 177 GCNGGLMDNAFQYIKDNGGLDSEESYPYFGKDGSC-KYKPQSSAANDTGYVDIPK-QEKA 234

Query: 252 LLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFGTAEEENGAKY 308
           L++AV+   P+SV +DAS  +F FY +G+     C + + DHGV VVG+G     +  KY
Sbjct: 235 LMKAVATVGPISVGIDASHESFQFYSTGIYFEPQCSSEDLDHGVLVVGYGVEGAHSNNKY 294

Query: 309 WLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
           WL+KNSWG TWG  GYI++ +D    CGIAT ASYPV
Sbjct: 295 WLVKNSWGNTWGMDGYIKMTKDQNNHCGIATMASYPV 331


>gi|410519429|gb|AFV73398.1| cathepsin L [Haliotis discus hannai]
          Length = 326

 Score =  255 bits (651), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 142/309 (45%), Positives = 185/309 (59%), Gaps = 23/309 (7%)

Query: 48  QHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNR---TYKLGTNEFSDLTNEEFRALYT 104
           +H + YKD  E+A R  +F + +EYI++ N E +R   ++++G NE++D+ NEEF  +  
Sbjct: 28  RHNKQYKDNQEEAYRKGVFMKAVEYIQQHNLEADRGVHSFRVGINEYADMPNEEFVRVMN 87

Query: 105 GYNRPVPSVSRQSSRPSTFKYQ---NVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAV 161
           GY         Q  RP    Y    NV D+P ++DWR KG VT +K+QGQCGSCWAFS+ 
Sbjct: 88  GY-------KMQEQRPKAPTYMPPSNVGDLPATVDWRTKGYVTEVKNQGQCGSCWAFSST 140

Query: 162 AAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPY 219
            ++EG T     KLI LSEQ LVDCST+  N GC GGLMD+AF YI  N G+ TE  YPY
Sbjct: 141 GSLEGQTFKKYNKLISLSEQNLVDCSTEQGNMGCGGGLMDQAFTYIKVNDGIDTETSYPY 200

Query: 220 RHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDASGRAFHFYKSG 278
               G C   K   V A  + Y D+    E  L  AV+   P++V +DAS  +F  YKSG
Sbjct: 201 EAASGKCRFNKAN-VGANDTGYTDIKSKSESDLQSAVATVGPIAVAIDASHMSFQLYKSG 259

Query: 279 VLN-ADCGNN-CDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA-GLCG 335
           V +   C     DHGV  VG+GT   ++G  YWL+KNSWG TWG+ GYI + R+    CG
Sbjct: 260 VYHYIFCSQTRLDHGVLAVGYGT---DSGKDYWLVKNSWGATWGQQGYIMMSRNRDNNCG 316

Query: 336 IATAASYPV 344
           IAT ASYP 
Sbjct: 317 IATQASYPT 325


>gi|402770515|gb|AFQ98392.1| cathepsin L [Rhipicephalus microplus]
          Length = 332

 Score =  255 bits (651), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 144/340 (42%), Positives = 204/340 (60%), Gaps = 20/340 (5%)

Query: 16  IIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEK 75
           ++ L + CA   V+  +  +  +  + E +   H +TY+  +E+ +R  IF +N   I K
Sbjct: 1   MLRLSVLCAIAAVTVAASSQEILRTQWEAFKTTHKKTYQSHMEELLRFKIFTENSLIIAK 60

Query: 76  ANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD-- 130
            N +   G  +YKLG N+F DL   EF  ++ G++      +R++   S     NV D  
Sbjct: 61  HNAKYAKGLVSYKLGMNQFGDLLAHEFARIFNGHHG-----TRKTGGSSFLPPANVNDSS 115

Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD- 189
           +P  +DWR+KGAVT +KDQGQCGSCWAFSA  ++EG   +  G+L+ LSEQ LVDCS   
Sbjct: 116 LPKVVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQSF 175

Query: 190 -NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
            N+GC GGLM+ AF+YI  N G+ TE  YPY   +G C  +KE  V AT + Y ++  G 
Sbjct: 176 GNNGCEGGLMEDAFKYIKANDGIDTEKSYPYEAVDGECRFKKED-VGATDTGYVEIKAGS 234

Query: 249 EQALLQAVSN-QPVSVCVDASGRAFHFYKSGVLN-ADCGN-NCDHGVAVVGFGTAEEENG 305
           E  L +AV+   P+SV +DAS  +F  Y  GV +  +C + + DHGV VVG+G    + G
Sbjct: 235 EVDLKKAVATVGPISVAIDASHSSFQLYSEGVYDEPECSSEDLDHGVLVVGYGV---KGG 291

Query: 306 AKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
            KYWL+KNSW E+WG+ GYI + RD    CGIA+ ASYP+
Sbjct: 292 KKYWLVKNSWAESWGDQGYILMSRDNNNQCGIASQASYPL 331


>gi|254746340|emb|CAX16635.1| putative C1A cysteine protease precursor [Manduca sexta]
          Length = 342

 Score =  255 bits (651), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 141/319 (44%), Positives = 193/319 (60%), Gaps = 20/319 (6%)

Query: 43  EQWMA---QHGRTYKDELEKAMRLNIFKQNLEYIEKANK---EGNRTYKLGTNEFSDLTN 96
           E+W+A   QH + Y  E+E   R+ I+ +N   I K N+   +G  +YKLG N+++D+ +
Sbjct: 26  EEWVAFKMQHDKKYDSEVEDRFRMKIYAENKHKIAKHNQLYEQGLVSYKLGPNKYTDMLH 85

Query: 97  EEFRALYTGYNRPVPSVS-----RQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQ 151
            EF     GYNR           +   R +TF        P  +DW +KGAVT +KDQG+
Sbjct: 86  HEFIQAMNGYNRTAKHNKGLYGKKHDVRGATFIPPAHVKYPDHVDWTKKGAVTEVKDQGK 145

Query: 152 CGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENK 209
           CGSCWAFS   A+EG      G L+ LSEQ L+DCS+   N+GC+GGLMD AF+YI +N 
Sbjct: 146 CGSCWAFSTTGALEGQHFRKSGYLVSLSEQNLIDCSSTYGNNGCNGGLMDNAFKYIKDNG 205

Query: 210 GLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDAS 268
           G+ TE  YPY   +  C    + + A  +  + D+P GDE+ L+QAV+   PVSV +DAS
Sbjct: 206 GIDTEKTYPYEGVDDKCRYNPKNSGAEDVG-FVDIPSGDEEKLMQAVATVGPVSVAIDAS 264

Query: 269 GRAFHFYKSGVL-NADCGN-NCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIR 326
             +F FY  GV  + +C + + DHGV VVG+GT  +E G  YWL+KNSW  TWGE GYI+
Sbjct: 265 QNSFQFYSGGVYYDTECSSTDLDHGVLVVGYGT--DEAGGDYWLVKNSWSRTWGELGYIK 322

Query: 327 ILRDA-GLCGIATAASYPV 344
           + R+    CGIAT ASYP+
Sbjct: 323 MARNRDNHCGIATDASYPL 341


>gi|47522698|ref|NP_999057.1| cathepsin L1 precursor [Sus scrofa]
 gi|2499874|sp|Q28944.1|CATL1_PIG RecName: Full=Cathepsin L1; Contains: RecName: Full=Cathepsin L1
           heavy chain; Contains: RecName: Full=Cathepsin L1 light
           chain; Flags: Precursor
 gi|1468964|dbj|BAA07140.1| porcine cathepsin L [Sus scrofa]
 gi|15027272|emb|CAC44793.1| cathepsin L [Sus scrofa]
          Length = 334

 Score =  255 bits (651), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 138/311 (44%), Positives = 194/311 (62%), Gaps = 18/311 (5%)

Query: 44  QWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEFSDLTNEEFR 100
           +W A HGR Y    E+  R  ++++N++ IE  N+E   G   + +  N F D+TNEEFR
Sbjct: 31  KWKATHGRLYGMN-EEGWRRAVWEKNMKMIELHNQEYSQGKHGFSMAMNAFGDMTNEEFR 89

Query: 101 ALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSA 160
            +  G+       +++  +   F    V +VP S+DWREKG VT +K+QGQCGSCWAFSA
Sbjct: 90  QVMNGFQ------NQKHKKGKVFHESLVLEVPKSVDWREKGYVTAVKNQGQCGSCWAFSA 143

Query: 161 VAAVEGITQITRGKLIELSEQQLVDCS--TDNHGCSGGLMDKAFEYIIENKGLATEADYP 218
             A+EG      GKL+ LSEQ LVDCS    N GC+GGLMD AF+Y+ +N GL TE  YP
Sbjct: 144 TGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQGCNGGLMDNAFQYVKDNGGLDTEESYP 203

Query: 219 YRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDASGRAFHFYKS 277
           Y   E      K +  AA  + + D+P+  E+AL++AV+   P+SV +DA   +F FYKS
Sbjct: 204 YLGRETNSCTYKPECSAANDTGFVDIPQ-REKALMKAVATVGPISVAIDAGHSSFQFYKS 262

Query: 278 GV-LNADCGN-NCDHGVAVVGFG-TAEEENGAKYWLIKNSWGETWGESGYIRILRDA-GL 333
           G+  + DC + + DHGV VVG+G    + N +K+W++KNSWG  WG +GY+++ +D    
Sbjct: 263 GIYYDPDCSSKDLDHGVLVVGYGFEGTDSNSSKFWIVKNSWGPEWGWNGYVKMAKDQNNH 322

Query: 334 CGIATAASYPV 344
           CGI+TAASYP 
Sbjct: 323 CGISTAASYPT 333


>gi|158268255|gb|ABW25047.1| cathepsin L-like protease [Strongylus vulgaris]
          Length = 354

 Score =  255 bits (651), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 157/359 (43%), Positives = 209/359 (58%), Gaps = 33/359 (9%)

Query: 13  MFVIIILVITCASQVVS----GRSMH----------EPSIVEKHEQW---MAQHGRTY-K 54
           MF ++ LV+ CAS   S     R  H             I E  + W       G++Y K
Sbjct: 1   MFRLLSLVLLCASVFASIDSGSRRDHTIRLHRVKSLRQKIDEAFKLWDDYKEAFGKSYNK 60

Query: 55  DELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVP 111
           DE    M    F +N+ +I++ N+E   G +T+++G N  +DL   ++R L    +R   
Sbjct: 61  DEENDYME--AFVKNVIHIDEHNQEHRLGRKTFEMGLNSIADLPFSQYRKLNGYRHRRNF 118

Query: 112 SVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQIT 171
             S QS+        NV ++P S+DWR+KG VT +K+QG CGSCWAFSA  A+EG     
Sbjct: 119 GDSMQSNGTKWLAPFNV-EIPDSVDWRDKGLVTDVKNQGMCGSCWAFSATGALEGQHARA 177

Query: 172 RGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQ 229
            GK++ LSEQ LVDCST   NHGC+GGLMD AFEYI +N G+ TE  YPY   E  C + 
Sbjct: 178 SGKMVSLSEQNLVDCSTKYGNHGCNGGLMDLAFEYIKDNHGIDTEESYPYVGRETKC-HF 236

Query: 230 KEKAVAATISKYEDLPKGDEQALLQAVSNQ-PVSVCVDASGRAFHFYKSGV-LNADCGN- 286
           K+K + A    + DLP+GDE+AL  AV+ Q P+S+ +DA  R F  YK GV  + +C + 
Sbjct: 237 KKKDIGAEDKGFVDLPEGDEEALKVAVATQGPISIAIDAGHRTFQLYKKGVYYDEECSSE 296

Query: 287 NCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
             DHGV +VG+GT  E     YWLIKNSWG  WGE GYIRI R+ +  CG+AT ASYP+
Sbjct: 297 ELDHGVLLVGYGTDPE--AGDYWLIKNSWGPGWGEKGYIRIARNRSNHCGVATKASYPL 353


>gi|226821421|gb|ACO82386.1| cathepsin L-like protein [Lutjanus argentimaculatus]
          Length = 301

 Score =  255 bits (651), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 138/299 (46%), Positives = 188/299 (62%), Gaps = 14/299 (4%)

Query: 56  ELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPS 112
           E E+  R  ++++NL+ IE  N E   G  +Y+LG N F D+T+EEFR +  GY R    
Sbjct: 6   EKEEGWRRMVWEKNLKKIEMHNLEHSMGTHSYRLGMNHFGDMTHEEFRQIMNGYKRK--- 62

Query: 113 VSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITR 172
             ++    S F   N  + P ++DWR+ G VT +KDQGQCGSCWAFS   A+EG      
Sbjct: 63  -PQRKFTGSLFMEPNFLEAPRAVDWRDNGYVTPVKDQGQCGSCWAFSTTGALEGQHFRKT 121

Query: 173 GKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQK 230
           GKL+ LSEQ LVDCS    N GC+GGLMD+AF+YI +N+GL +E  YPY   +    +  
Sbjct: 122 GKLVSLSEQNLVDCSRPEGNEGCNGGLMDQAFQYIKDNQGLDSEDSYPYLGTDDQPCHYD 181

Query: 231 EKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-N 287
            K  +A  + + D+P G E+AL++AV+   PVSV +DA   +F FY+SG+    DC +  
Sbjct: 182 PKYNSANDTGFVDIPSGKERALMKAVAAVGPVSVAIDAGHESFQFYQSGIYYEKDCSSEE 241

Query: 288 CDHGVAVVGFG-TAEEENGAKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
            DHGV VVG+G   E+ +G KYW++KNSW E WG+ GYI + +D    CGIATAASYP+
Sbjct: 242 LDHGVLVVGYGFEGEDVDGKKYWIVKNSWSEKWGDKGYIYMAKDRKNHCGIATAASYPL 300


>gi|33348834|gb|AAQ16117.1| cathepsin L-like cysteine proteinase A [Rhipicephalus
           haemaphysaloides haemaphysaloides]
          Length = 332

 Score =  255 bits (651), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 145/341 (42%), Positives = 205/341 (60%), Gaps = 22/341 (6%)

Query: 16  IIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEK 75
           ++ L + CA   V+  +     +  + E +   H ++Y+  +E+ +R  IF +N   I K
Sbjct: 1   MLRLSLLCAIVAVTVAANSHEILRTQWEAFKTTHKKSYESHMEELLRFKIFTENSLIIAK 60

Query: 76  ANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTF-KYQNVTD- 130
            N +   G  +YKLG N+F DL   EF  ++ GY        +++SR STF    NV D 
Sbjct: 61  HNAKYAKGLVSYKLGMNQFGDLLAHEFAKIFNGYR------GQRTSRGSTFMPPANVNDS 114

Query: 131 -VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD 189
            +P+++DWR+KGAVT +KDQGQCGSCWAFSA  ++EG   +  G+L+ LSEQ LVDCS  
Sbjct: 115 SLPSTVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKDGELVSLSEQNLVDCSQS 174

Query: 190 --NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
             N+GC GGLMD AF+YI  N G+  E  YPY   +  C  +KE  V AT + + D+  G
Sbjct: 175 FGNNGCEGGLMDNAFKYIKANDGIDAEESYPYEAMDDKCRFKKED-VGATDTGFVDIEGG 233

Query: 248 DEQALLQAVSN-QPVSVCVDASGRAFHFYKSGVLN-ADCGN-NCDHGVAVVGFGTAEEEN 304
            E  L +AV+   P+SV +DA   +F  Y  GV +  +C +   DHGV  VG+G    ++
Sbjct: 234 SEDDLKKAVATVGPISVAIDAGHSSFQLYSEGVYDEPECSSEELDHGVLAVGYGV---KD 290

Query: 305 GAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
           G KYWL+KNSWG +WG++GYI + RD    CGIA+AASYP+
Sbjct: 291 GKKYWLVKNSWGGSWGDNGYILMSRDKNNQCGIASAASYPL 331


>gi|357216861|gb|AET71138.1| cysteine peptidase isoform b [Sphenophorus levis]
          Length = 324

 Score =  254 bits (650), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 143/345 (41%), Positives = 199/345 (57%), Gaps = 30/345 (8%)

Query: 7   KSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIF 66
           KSFI+      +LV+  ++ ++     H        + +  +HG+TYK++ E+  R  IF
Sbjct: 2   KSFILAS----LLVVAVSATLLKEDGAH-------FQSFKLKHGKTYKNQAEETKRFAIF 50

Query: 67  KQNLEYIEKAN---KEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTF 123
           ++NL  IE  N   K+G  +Y  G N+F+D+T  EF+A+     +  PS+        TF
Sbjct: 51  RENLRKIEAHNAEYKQGIHSYTQGINKFADMTRAEFKAMLATQVKTKPSIVATK----TF 106

Query: 124 KYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQL 183
           +  +   VP SIDWR +  VT IKDQ QCGSCWAF+ V + EG   ++ GKL   SEQQL
Sbjct: 107 QLADGVSVPESIDWRSRNVVTPIKDQAQCGSCWAFAVVGSTEGAYALSTGKLTRFSEQQL 166

Query: 184 VDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYE 242
           VDC+TD N+GC GG +D  F YI  N GL  E+DYPY   +G C  +  K V   +S Y 
Sbjct: 167 VDCTTDLNYGCDGGYLDDTFPYIQTN-GLELESDYPYTGYDGYCSYESSK-VVTKVSSYV 224

Query: 243 DLPKGDEQALLQAVSNQ-PVSVCVDASGRAFHFYKSGVLNADCGNN--CDHGVAVVGFGT 299
            +P  +EQALL+AV    PV++ ++A    F+F  SG+++    +    DHGV  VG+  
Sbjct: 225 SVP-ANEQALLEAVGTAGPVAIAINADDLQFYF--SGIIDDKYCDPEYLDHGVLAVGY-- 279

Query: 300 AEEENGAKYWLIKNSWGETWGESGYIRILRDAGLCGIATAASYPV 344
            + ENG  YWLIKNSWG  WGESGY R LR   +CG+   A YP+
Sbjct: 280 -DSENGRDYWLIKNSWGADWGESGYFRFLRGQNICGVKEDAVYPL 323


>gi|401397136|ref|XP_003879989.1| cathepsin L, related [Neospora caninum Liverpool]
 gi|325114397|emb|CBZ49954.1| cathepsin L, related [Neospora caninum Liverpool]
          Length = 415

 Score =  254 bits (650), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 138/353 (39%), Positives = 198/353 (56%), Gaps = 29/353 (8%)

Query: 5   FEKSFIIPMFVIIILVITCASQVV-----SGRSMHEPSIVEKHE------QWMAQH---- 49
           F+    I +    + ++  AS ++       R +  PS VE H+      +W  +H    
Sbjct: 59  FKTRAWIALVAAAVSLLVFASFLIQWQGDDDRGVFPPSPVEDHKTPVNIWEWKEEHFQNA 118

Query: 50  --------GRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRA 101
                   G++Y  E E   R  IFK NL YI   N++G  +Y L  N F DL+ EEFR 
Sbjct: 119 FGSFRATYGKSYATEEETQKRYAIFKNNLAYIHTHNQQG-YSYSLKMNHFGDLSREEFRR 177

Query: 102 LYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAV 161
            Y GYN+     S      +     + +DVP+++DWREKG VT +KDQ  CGSCWAFSA 
Sbjct: 178 KYLGYNKSRNLKSNNLGVATELLKVSPSDVPSAVDWREKGCVTPVKDQRDCGSCWAFSAT 237

Query: 162 AAVEGITQITRGKLIELSEQQLVDCST--DNHGCSGGLMDKAFEYIIENKGLATEADYPY 219
            A+EG      G+L+ LSEQ+LVDCS    N GCSGG M+ AF+Y++++ GL +E  YPY
Sbjct: 238 GALEGAHCAKTGELLSLSEQELVDCSLAEGNQGCSGGEMNDAFQYVVDSGGLCSEEGYPY 297

Query: 220 RHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGV 279
              +G C    +K V  TIS ++D+P+  E A+  A+++ PVS+ ++A    F FY  GV
Sbjct: 298 LARDGECKRACKKVV--TISGFKDVPRKSETAMKAALAHSPVSIAIEADQLPFQFYHEGV 355

Query: 280 LNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDAG 332
            +A CG + DHGV +VG+GT ++E    +W++KNSWG  WG  GY+ +    G
Sbjct: 356 FDASCGTDLDHGVLLVGYGT-DKETKKDFWIMKNSWGSGWGRDGYMYMAMHKG 407


>gi|255635645|gb|ACU18172.1| unknown [Glycine max]
          Length = 355

 Score =  254 bits (650), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 133/345 (38%), Positives = 198/345 (57%), Gaps = 16/345 (4%)

Query: 10  IIPMFVIIILVITCASQVVSGRSMH--------EPSIVEKHEQWMAQHGRTYKDELEKAM 61
           I+ +F++  +       ++S  + H        +  ++   E+W+ +H + Y    EK  
Sbjct: 5   IVLLFMVFAVSSALDMSIISHDNAHADRATRRTDDEVMSMFEEWLVKHDKVYNALGEKEK 64

Query: 62  RLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPS 121
           R  IFK NL +I++ N   NRTYKLG N F+DLTN E+RA+Y       P +   +   +
Sbjct: 65  RFQIFKNNLRFIDERNSL-NRTYKLGLNVFADLTNAEYRAMYLRTWDDGPRLDLDTPPRN 123

Query: 122 TFKYQNVTDVPTSIDWREKGAVTHIKDQG-QCGSCWAFSAVAAVEGITQITRGKLIELSE 180
            +  +    +P S+DWR++GAVT +K+QG  C SCWAF+AV AVE + +I  G LI LSE
Sbjct: 124 RYVPRVGDTIPKSVDWRKEGAVTPVKNQGATCNSCWAFTAVGAVESLVKIKTGDLISLSE 183

Query: 181 QQLVDCST-DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATIS 239
           Q++VDC+T  + GC GG +   + YI +N G++ E DYPYR +EG CD+ K+ A+  TI 
Sbjct: 184 QEVVDCTTSSSRGCGGGDIQHGYIYIRKN-GISLEKDYPYRGDEGKCDSNKKNAI-VTID 241

Query: 240 KYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGT 299
            +  +P   E+AL Q ++NQPV+V + A    F +Y SGV    CG   +H + +VG+G 
Sbjct: 242 GHGWVPTQLEEALKQGIANQPVAVPIPADDYEFQYYTSGVFKGKCGTELNHALLLVGYGA 301

Query: 300 AEEENGAKYWLIKNSWGETWGESGYIRILRDAGLCGIATAASYPV 344
              E    YW+ KNS+ + WGE+GYIRI R    C       YP+
Sbjct: 302 ---EKDGDYWIAKNSYSDKWGENGYIRIQRKLSTCKFGNGGYYPI 343


>gi|402770511|gb|AFQ98390.1| cathepsin L [Rhipicephalus microplus]
 gi|402770513|gb|AFQ98391.1| cathepsin L [Rhipicephalus microplus]
          Length = 332

 Score =  254 bits (650), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 144/340 (42%), Positives = 204/340 (60%), Gaps = 20/340 (5%)

Query: 16  IIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEK 75
           ++ L + CA   V+  +  +  +  + E +   H +TY+  +E+ +R  IF +N   I K
Sbjct: 1   MLRLSVLCAIVAVTVAASSQEILRTQWEAFKTTHKKTYQSHMEELLRFKIFTENSLIIAK 60

Query: 76  ANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD-- 130
            N +   G  +YKLG N+F DL   EF  ++ G++      +R++   S     NV D  
Sbjct: 61  HNAKYAKGLVSYKLGMNQFGDLLAHEFARIFNGHHG-----TRKTGGSSFLPPANVNDSS 115

Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD- 189
           +P  +DWR+KGAVT +KDQGQCGSCWAFSA  ++EG   +  G+L+ LSEQ LVDCS   
Sbjct: 116 LPKVVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQSF 175

Query: 190 -NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
            N+GC GGLM+ AF+YI  N G+ TE  YPY   +G C  +KE  V AT + Y ++  G 
Sbjct: 176 GNNGCEGGLMEDAFKYIKANDGIDTEKSYPYEAVDGECRFKKED-VGATDTGYVEIKAGS 234

Query: 249 EQALLQAVSN-QPVSVCVDASGRAFHFYKSGVLN-ADCGN-NCDHGVAVVGFGTAEEENG 305
           E  L +AV+   P+SV +DAS  +F  Y  GV +  +C + + DHGV VVG+G    + G
Sbjct: 235 EVDLKKAVATVGPISVAIDASHSSFQLYSEGVYDEPECSSEDLDHGVLVVGYGV---KGG 291

Query: 306 AKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
            KYWL+KNSW E+WG+ GYI + RD    CGIA+ ASYP+
Sbjct: 292 KKYWLVKNSWAESWGDQGYILMSRDNNNQCGIASQASYPL 331


>gi|261289779|ref|XP_002611751.1| hypothetical protein BRAFLDRAFT_284345 [Branchiostoma floridae]
 gi|229297123|gb|EEN67761.1| hypothetical protein BRAFLDRAFT_284345 [Branchiostoma floridae]
          Length = 330

 Score =  254 bits (650), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 147/342 (42%), Positives = 205/342 (59%), Gaps = 29/342 (8%)

Query: 17  IILVITCASQVVSGRSMHEPSIVEKHEQWMA---QHGRTYKDELEKAMRLNIFKQNLEYI 73
           +++++ C +   +     E        QW A   +H + Y ++ E A RL IF+ NL+ I
Sbjct: 3   LLVLLACVAMATAASLSFES-------QWEAFKIKHDKVYSEKEEYARRL-IFQDNLKTI 54

Query: 74  EKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
           E  N+E   G  +Y LG N+F+D+T+ E+     G      ++++  SR +T++Y     
Sbjct: 55  ESHNQEADTGKHSYWLGVNQFADMTHAEYLNQVIGGCLITSNLTKTGSR-ATYRYMPNMQ 113

Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD- 189
           V  ++DWR+KG VT IKDQGQCGSCWAFS   ++EG      G L+ LSEQ LVDCS   
Sbjct: 114 VNDTVDWRDKGLVTDIKDQGQCGSCWAFSTTGSLEGQHAKATGTLVSLSEQNLVDCSRQE 173

Query: 190 -NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTC--DNQKEKAVAATISKYEDLPK 246
            N GC GG MD+ F+YII+NKG+ TE  YPY+ +   C  DN     + AT+S + D+  
Sbjct: 174 GNKGCEGGDMDQGFQYIIQNKGIDTEQCYPYKAKNHRCKFDN---SCIGATMSSFTDVTS 230

Query: 247 GDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGVLNA-DCGNN-CDHGVAVVGFGTAEEE 303
           GDE AL QA +N  P+SV +DAS ++F FY SGV N  +C +   DHGV VVG+GT   +
Sbjct: 231 GDEDALKQACANIGPISVGIDASHQSFQFYSSGVYNEFECSSTKLDHGVLVVGYGTYGSK 290

Query: 304 NGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
           +   YWL+KNSWG  WG  GYI + R+    CG+AT AS+PV
Sbjct: 291 D---YWLVKNSWGTVWGNEGYIMMSRNKDNQCGVATDASFPV 329


>gi|402770505|gb|AFQ98387.1| cathepsin L [Rhipicephalus microplus]
          Length = 332

 Score =  254 bits (650), Expect = 4e-65,   Method: Compositional matrix adjust.
 Identities = 143/340 (42%), Positives = 204/340 (60%), Gaps = 20/340 (5%)

Query: 16  IIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEK 75
           ++ L + CA   V+  +  +  +  + E +   H +TY+  +E+ +R  IF +N   I K
Sbjct: 1   MLRLSVLCAIVAVTVAASSQEILRTQWEAFKTTHKKTYQSHMEELLRFKIFTENSLIIAK 60

Query: 76  ANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD-- 130
            N +   G  +YKLG N+F DL   EF  ++ G+       +R++   +     NV D  
Sbjct: 61  HNAKYAKGLVSYKLGMNQFGDLLAHEFARIFNGHRG-----TRKTGGSTFLPPANVNDSS 115

Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD- 189
           +P ++DWR+KGAVT +KDQGQCGSCWAFSA  ++EG   +  G+L+ LSEQ LVDCS   
Sbjct: 116 LPKAVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQSF 175

Query: 190 -NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
            N+GC GGLM+ AF+YI  N G+ TE  YPY   +G C  +KE  V AT + Y ++  G 
Sbjct: 176 GNNGCEGGLMEDAFKYIKANDGIDTEKSYPYEAVDGECRFKKED-VGATDTGYVEIKAGS 234

Query: 249 EQALLQAVSN-QPVSVCVDASGRAFHFYKSGVLN-ADCGN-NCDHGVAVVGFGTAEEENG 305
           E  L +AV+   P+SV +DAS  +F  Y  GV +  +C + + DHGV VVG+G    + G
Sbjct: 235 EVDLKKAVATVGPISVAIDASHSSFQLYSEGVYDEPECSSEDLDHGVLVVGYGV---KGG 291

Query: 306 AKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
            KYWL+KNSW E+WG+ GYI + RD    CGIA+ ASYP+
Sbjct: 292 KKYWLVKNSWAESWGDQGYILMSRDNNNQCGIASQASYPL 331


>gi|225706086|gb|ACO08889.1| Cathepsin S precursor [Osmerus mordax]
          Length = 333

 Score =  254 bits (649), Expect = 4e-65,   Method: Compositional matrix adjust.
 Identities = 136/310 (43%), Positives = 191/310 (61%), Gaps = 16/310 (5%)

Query: 43  EQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEFSDLTNEEF 99
           + W  QHG+ YK E+E+  R  ++++NL+ I   N E   G  TY LG N   D+T EE 
Sbjct: 31  QMWKKQHGKNYKTEVEELGRREVWERNLQLISLHNLEASMGMHTYDLGMNHMGDMTEEEI 90

Query: 100 RALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFS 159
              +     P   + R+   PS F   + T VP ++DWR+KG VT +K+QG CGSCWAFS
Sbjct: 91  LQSFASLKVPA-DLKRE---PSAFVASSGTPVPDTVDWRQKGYVTQVKNQGSCGSCWAFS 146

Query: 160 AVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADY 217
           +V A+EG    T GKL++LS Q LVDCS+   N GC+GG M +AF+Y+I+NKG+ ++  Y
Sbjct: 147 SVGALEGQLMRTTGKLLDLSPQNLVDCSSKYGNKGCNGGFMSEAFQYVIDNKGIDSDTSY 206

Query: 218 PYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDASGRAFHFYK 276
           PY+  +GTC +      +A  ++Y  LP+GDE  L QAV+   P+SV +DA+  +F  ++
Sbjct: 207 PYQGVQGTC-HYNPSYRSANCTRYSFLPEGDETTLKQAVAMIGPISVAIDATRPSFILWR 265

Query: 277 SGVLN-ADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA-GLC 334
           SGV N   C    +H V VVG+GT +   G  YWL+KNSWG  +GE+GYIR+ R+    C
Sbjct: 266 SGVYNDLTCTQKINHAVLVVGYGTLD---GQDYWLVKNSWGTRFGENGYIRMSRNRNNQC 322

Query: 335 GIATAASYPV 344
           GIA    YP+
Sbjct: 323 GIALYGCYPI 332


>gi|18202415|sp|P82474.1|CPGP2_ZINOF RecName: Full=Zingipain-2; AltName: Full=Cysteine proteinase GP-II
 gi|6137410|pdb|1CQD|A Chain A, The 2.1 Angstrom Structure Of A Cysteine Protease With
           Proline Specificity From Ginger Rhizome, Zingiber
           Officinale
 gi|6137411|pdb|1CQD|B Chain B, The 2.1 Angstrom Structure Of A Cysteine Protease With
           Proline Specificity From Ginger Rhizome, Zingiber
           Officinale
 gi|6137412|pdb|1CQD|C Chain C, The 2.1 Angstrom Structure Of A Cysteine Protease With
           Proline Specificity From Ginger Rhizome, Zingiber
           Officinale
 gi|6137413|pdb|1CQD|D Chain D, The 2.1 Angstrom Structure Of A Cysteine Protease With
           Proline Specificity From Ginger Rhizome, Zingiber
           Officinale
          Length = 221

 Score =  254 bits (649), Expect = 4e-65,   Method: Compositional matrix adjust.
 Identities = 121/219 (55%), Positives = 153/219 (69%), Gaps = 8/219 (3%)

Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD 189
           D+P SIDWRE GAV  +K+QG CGSCWAFS VAAVEGI QI  G LI LSEQQLVDC+T 
Sbjct: 2   DLPDSIDWRENGAVVPVKNQGGCGSCWAFSTVAAVEGINQIVTGDLISLSEQQLVDCTTA 61

Query: 190 NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDE 249
           NHGC GG M+ AF++I+ N G+ +E  YPYR ++G C N    A   +I  YE++P  +E
Sbjct: 62  NHGCRGGWMNPAFQFIVNNGGINSEETYPYRGQDGIC-NSTVNAPVVSIDSYENVPSHNE 120

Query: 250 QALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYW 309
           Q+L +AV+NQPVSV +DA+GR F  Y+SG+    C  + +H + VVG+GT   EN   +W
Sbjct: 121 QSLQKAVANQPVSVTMDAAGRDFQLYRSGIFTGSCNISANHALTVVGYGT---ENDKDFW 177

Query: 310 LIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
           ++KNSWG+ WGESGYIR  R+     G CGI   ASYPV
Sbjct: 178 IVKNSWGKNWGESGYIRAERNIENPDGKCGITRFASYPV 216


>gi|226821425|gb|ACO82388.1| cathepsin S [Lutjanus argentimaculatus]
          Length = 337

 Score =  254 bits (649), Expect = 4e-65,   Method: Compositional matrix adjust.
 Identities = 140/321 (43%), Positives = 190/321 (59%), Gaps = 16/321 (4%)

Query: 32  SMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGT 88
           +M E  +    + W   H + Y++E+E+  R  ++++NL  I   N E   G  TY+LG 
Sbjct: 24  AMFESRLDAHWDLWKKTHEKKYQNEVEEFSRRRLWEKNLMLITMHNLEASMGLHTYELGM 83

Query: 89  NEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKD 148
           N   D+T EE   ++  +    P    Q + PS F   +  D+P ++DWREKG VT +K 
Sbjct: 84  NHMGDMTPEE---IWQSFATLTPPTDIQRA-PSPFAGSSGADIPDTMDWREKGCVTSVKT 139

Query: 149 QGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYII 206
           QG CGSCWAFSAV A+EG      GKL++LS Q LVDCST   NHGC+GG MD AF+Y+I
Sbjct: 140 QGSCGSCWAFSAVGALEGQLAKKTGKLVDLSPQNLVDCSTKYGNHGCNGGFMDHAFQYVI 199

Query: 207 ENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCV 265
           +N+G+ ++A YPY      C +      AA  S Y  LP+GDE AL QA++   P+SV +
Sbjct: 200 DNQGIDSDASYPYTGRSDQC-HYNPSYRAANCSSYNFLPEGDEGALKQALATIGPISVAI 258

Query: 266 DASGRAFHFYKSGVLN-ADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGY 324
           DA+   F FY+SGV N   C    +HGV  VG+GT    NG  YWL+KNSWG  +G+ GY
Sbjct: 259 DATRPRFIFYRSGVYNDPSCSQEVNHGVLAVGYGTL---NGQDYWLVKNSWGTKFGDQGY 315

Query: 325 IRILRDAG-LCGIATAASYPV 344
           IR+ R+    CGIA    YP+
Sbjct: 316 IRMARNQNDQCGIAMYGCYPI 336


>gi|348525618|ref|XP_003450319.1| PREDICTED: cathepsin S-like [Oreochromis niloticus]
          Length = 330

 Score =  254 bits (649), Expect = 4e-65,   Method: Compositional matrix adjust.
 Identities = 139/344 (40%), Positives = 202/344 (58%), Gaps = 24/344 (6%)

Query: 10  IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
           ++P+  +++L  +   +        E S+  + E+W + H R Y    E+ +R  I+++N
Sbjct: 1   MLPLVCVLLLATSALGR------FDESSLDAQWEEWKSTHRREYNGLGEEGIRRAIWEKN 54

Query: 70  LEYIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQ 126
           +  IE  N+E   G  ++++G N   D+T+EE     TG   P+        R  T    
Sbjct: 55  MRMIEAHNEEAALGIHSFEMGMNHLGDMTSEEVVEKMTGLQIPM-----NQERSFTLAMD 109

Query: 127 NV-TDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVD 185
           ++ + +P S+D+R+KG VT +K+QG CGSCWAFSA  A+EG    + GKL++LS Q LVD
Sbjct: 110 DMPSKIPKSVDYRKKGMVTSVKNQGACGSCWAFSAAGALEGQLAKSTGKLVDLSPQNLVD 169

Query: 186 CSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYED 243
           CS    NHGC+GG M +AF+Y+I+N G+ ++A YPY   +  C        AA  S Y+ 
Sbjct: 170 CSGKYGNHGCNGGFMTRAFQYVIDNHGIDSDASYPYTGRDEQC-RYNPATRAANCSSYQF 228

Query: 244 LPKGDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGVLN-ADCGNNCDHGVAVVGFGTAE 301
           LP+GDE AL QA++   P+SV +DA    F FY+SGV N   C    +HGV  VG+G+  
Sbjct: 229 LPEGDENALKQALATIGPISVAIDARRPRFSFYRSGVYNDPSCTQEVNHGVLAVGYGSL- 287

Query: 302 EENGAKYWLIKNSWGETWGESGYIRILRDAG-LCGIATAASYPV 344
             NG  YWL+KNSWG T+G+ GYIR+ R+ G  CGIA  A YPV
Sbjct: 288 --NGQDYWLVKNSWGSTFGDQGYIRMARNTGNQCGIALYACYPV 329


>gi|1483570|emb|CAA68066.1| cathepsin l [Litopenaeus vannamei]
          Length = 328

 Score =  254 bits (649), Expect = 4e-65,   Method: Compositional matrix adjust.
 Identities = 144/320 (45%), Positives = 192/320 (60%), Gaps = 21/320 (6%)

Query: 36  PSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKAN---KEGNRTYKLGTNEFS 92
           PS+ ++   + A+HGR Y    E+  RL++F+QN ++I+  N   + G  T+ L  N+F 
Sbjct: 18  PSLRQQWRDFKAEHGRRYASVQEERYRLSVFEQNQQFIDDHNARFENGEVTFTLQMNQFG 77

Query: 93  DLTNEEFRALYTGY-NRPVPSVSRQSSRPSTFKYQNVTD-VPTSIDWREKGAVTHIKDQG 150
           D+T+EEF A   G+ N P       S RP+     +  + +P  +DWR KGAVT +KDQ 
Sbjct: 78  DMTSEEFTATMNGFLNVP-------SRRPTAILRADPDETLPKEVDWRTKGAVTPVKDQK 130

Query: 151 QCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIEN 208
           QCGSCWAFS   ++EG   +  GKL+ LSEQ LVDCS    N GC GGLMD+AF YI  N
Sbjct: 131 QCGSCWAFSTTGSLEGQHFLKDGKLVSLSEQNLVDCSDKFGNMGCMGGLMDQAFRYIKAN 190

Query: 209 KGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDA 267
           KG+ TE  YPY  ++G C       V AT + Y D+  G E AL +AV+   P+SV +DA
Sbjct: 191 KGIDTEDSYPYEAQDGKCRFDASN-VGATDTGYVDVEHGSESALKKAVATIGPISVAIDA 249

Query: 268 SGRAFHFYKSGVLNAD-CGNN-CDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYI 325
           S  +F FY  GV   + C +   DHGV  VG+G  E E G  YWL+KNSW  +WG  GYI
Sbjct: 250 SQPSFQFYHDGVYYEEGCSSTMLDHGVLAVGYG--ETEKGEAYWLVKNSWNTSWGNKGYI 307

Query: 326 RILRD-AGLCGIATAASYPV 344
           ++ RD    CGIA+ ASYP+
Sbjct: 308 QMSRDKKNNCGIASQASYPL 327


>gi|217323618|gb|ACK38176.1| midgut cysteine peptidase, partial [Sphenophorus levis]
          Length = 324

 Score =  254 bits (649), Expect = 4e-65,   Method: Compositional matrix adjust.
 Identities = 142/345 (41%), Positives = 200/345 (57%), Gaps = 30/345 (8%)

Query: 7   KSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIF 66
           KSFI+      +LV+  ++ ++    +H        + +  +HG+TYK++ E+  R  IF
Sbjct: 2   KSFILAS----LLVVAVSATLLKEDGVH-------FQSFKLKHGKTYKNQAEETKRFAIF 50

Query: 67  KQNLEYIEKAN---KEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTF 123
           ++NL  IE  N   K+G  +Y  G N+F+D+T  EF+A+     +  PS+        TF
Sbjct: 51  RENLRKIEAHNAEYKQGIHSYTQGINKFADMTRAEFKAMLATQVKTKPSIVATK----TF 106

Query: 124 KYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQL 183
           +  +   VP SIDWR +  VT IKDQ QCGSCW+F+ V + EG   ++ GKL   SEQQL
Sbjct: 107 QLADGVSVPESIDWRSRNVVTPIKDQAQCGSCWSFAVVGSTEGAYALSTGKLTRFSEQQL 166

Query: 184 VDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYE 242
           VDC+TD N+GC GG +D  F YI  N GL  E+DYPY   +G+C     K V   +S Y 
Sbjct: 167 VDCTTDLNYGCDGGYLDDTFPYIQTN-GLELESDYPYTGYDGSCSYDSSK-VVTKVSSYV 224

Query: 243 DLPKGDEQALLQAVSNQ-PVSVCVDASGRAFHFYKSGVLNADCGNN--CDHGVAVVGFGT 299
            +P  +EQALL+AV    PV++ ++A    F+F  SG+++    +    DHGV  VG+ +
Sbjct: 225 SVP-ANEQALLEAVGTAGPVAIAINADDLQFYF--SGIIDDKYCDPEWLDHGVLAVGYNS 281

Query: 300 AEEENGAKYWLIKNSWGETWGESGYIRILRDAGLCGIATAASYPV 344
              ENG  YWLIKNSWG  WGESGY R LR   +CG+   A YP+
Sbjct: 282 ---ENGLDYWLIKNSWGADWGESGYFRFLRGQNICGVKEDAVYPL 323


>gi|146217394|gb|ABQ10739.1| cathepsin L [Penaeus monodon]
          Length = 341

 Score =  254 bits (649), Expect = 5e-65,   Method: Compositional matrix adjust.
 Identities = 139/319 (43%), Positives = 200/319 (62%), Gaps = 15/319 (4%)

Query: 38  IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANK---EGNRTYKLGTNEFSDL 94
           ++E+ E +  +H + Y  E+E++ R+ IF +N   I   NK   +G+ TYKL  N++ D+
Sbjct: 25  VLEEWEAFKLEHSKKYDSEVEESFRMKIFTENKHKIANHNKGFAQGHHTYKLSMNKYGDM 84

Query: 95  TNEEFRALYTGY--NRPVPSVSRQSSRPSTF-KYQNVTDVPTSIDWREKGAVTHIKDQGQ 151
            + EF +   G+  N      + ++   +TF +  +   +P ++DWR KGAVT IKDQGQ
Sbjct: 85  LHHEFVSTMNGFRGNHTGGYKNNRAYTGATFIEPDDDVQLPKNVDWRTKGAVTPIKDQGQ 144

Query: 152 CGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENK 209
           CGSCWAFSA  A+EG T    G+L+ LSEQ LVDCS    N+GC+GGLMD AFEY+ EN 
Sbjct: 145 CGSCWAFSATGALEGQTFRKTGQLVSLSEQNLVDCSRKFGNNGCNGGLMDNAFEYVKENG 204

Query: 210 GLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDAS 268
           G+ TE  YPY  E+  C +   +A  A    + D+ +G E AL +AV+   PVSV +DAS
Sbjct: 205 GIDTEESYPYDAEDEKC-HYNPRAAGAEDKGFVDVREGSEHALKKAVATVGPVSVAIDAS 263

Query: 269 GRAFHFYKSGV-LNADCGNN-CDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIR 326
             +F FY  GV +  +C     DHGV VVG+G   +++G  YWL+KNSWG TWG+ GY++
Sbjct: 264 HESFQFYSHGVYIEPECSPEMLDHGVLVVGYGI--DDDGTDYWLVKNSWGTTWGDQGYVK 321

Query: 327 ILRDA-GLCGIATAASYPV 344
           + R+    CGIA++AS+P+
Sbjct: 322 MARNRDNQCGIASSASFPL 340


>gi|223673161|gb|ACN12762.1| Cathepsin S precursor [Salmo salar]
          Length = 330

 Score =  254 bits (648), Expect = 5e-65,   Method: Compositional matrix adjust.
 Identities = 139/340 (40%), Positives = 198/340 (58%), Gaps = 20/340 (5%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
           M   ++L + C + V    ++ +P + +  + W   HG+ Y+ E+E+  R  ++++NL+ 
Sbjct: 2   MLWSLLLAVLCGTAV----ALFDPMLEQHWQMWKKTHGKNYQTEVEELGRREVWERNLQL 57

Query: 73  IEKANKEGN---RTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
           I   N E +    TY LG N   D+T EE    +     P   + R+   PS F   +  
Sbjct: 58  ISLHNLEASMDMHTYDLGMNHMGDMTQEEIAQSFASLLVPA-DLKRE---PSAFAGSSGA 113

Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD 189
            +P + DWREKG VT +K QG CGSCWAFS+V A+EG    T GKLI+LS Q LVDCS+ 
Sbjct: 114 PIPDTFDWREKGYVTGVKMQGSCGSCWAFSSVGALEGQLMKTTGKLIDLSPQNLVDCSSK 173

Query: 190 --NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
             N GC GG M KAF+Y+I+N+G+A++  YPY+  +  C     +  AA  S+Y  LP+G
Sbjct: 174 YGNKGCHGGFMTKAFQYVIDNQGIASDQSYPYKGVQQQCIYNPAQR-AANCSRYSFLPEG 232

Query: 248 DEQALLQAVSN-QPVSVCVDASGRAFHFYKSGVLN-ADCGNNCDHGVAVVGFGTAEEENG 305
           DE  L +A++   P+SV +DA+  +F FY+SGV N   C    +H V  VG+GT     G
Sbjct: 233 DEGVLKEALATIGPISVGIDATRPSFAFYRSGVYNDPTCTKKTNHAVLAVGYGTL---GG 289

Query: 306 AKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
             YWL+KNSWG +WG+ GYIR+ R+    CGIA    YPV
Sbjct: 290 QDYWLVKNSWGLSWGDQGYIRMSRNKDNQCGIALYGCYPV 329


>gi|27806673|ref|NP_776457.1| cathepsin L2 precursor [Bos taurus]
 gi|1542853|emb|CAA62870.1| cathepsin L [Bos taurus]
          Length = 334

 Score =  254 bits (648), Expect = 5e-65,   Method: Compositional matrix adjust.
 Identities = 142/343 (41%), Positives = 203/343 (59%), Gaps = 22/343 (6%)

Query: 12  PMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLE 71
           P F + +L +     V S     +P++     QW A H R Y    E+  R  ++++N +
Sbjct: 3   PSFFLTVLCLG----VASAAPKLDPNLDAHWHQWKATHRRLYGMN-EEEWRRAVWEKNKK 57

Query: 72  YIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNV 128
            I+  N+E   G   +++  N F D+TNEEFR +  G+       +++  +   F    +
Sbjct: 58  IIDLHNQEYSEGKHAFRMAMNAFGDMTNEEFRQVMNGFQ------NQKHKKGKLFHEPLL 111

Query: 129 TDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS- 187
            DVP S+DW +KG VT +K+QGQCGSCWAFSA  A+EG      GKL+ LSEQ LVDCS 
Sbjct: 112 VDVPKSVDWTKKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSR 171

Query: 188 -TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPK 246
              N GC+GGLMD AF+YI +N GL +E  YPY   +    N K +  AA  + + D+P+
Sbjct: 172 AQGNQGCNGGLMDNAFQYIKDNGGLDSEESYPYLATDTNSCNYKPECSAANDTGFVDIPQ 231

Query: 247 GDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCG-NNCDHGVAVVGFG-TAEE 302
             E+AL++AV+   P+SV +DA   +F FYKSG+  + DC   + DHGV VVG+G    +
Sbjct: 232 -REKALMKAVATVGPISVAIDAGHTSFQFYKSGIYYDPDCSCKDLDHGVLVVGYGFEGTD 290

Query: 303 ENGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
            N  K+W++KNSWG  WG +GY+++ +D    CGIATAASYP 
Sbjct: 291 SNNNKFWIVKNSWGPEWGWNGYVKMAKDQNNHCGIATAASYPT 333


>gi|357477225|ref|XP_003608898.1| Cysteine proteinase, partial [Medicago truncatula]
 gi|355509953|gb|AES91095.1| Cysteine proteinase, partial [Medicago truncatula]
          Length = 260

 Score =  254 bits (648), Expect = 5e-65,   Method: Compositional matrix adjust.
 Identities = 130/265 (49%), Positives = 169/265 (63%), Gaps = 33/265 (12%)

Query: 89  NEFSDLTNEEFRALY----TGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVT 144
           N+F+D+TN EFR++Y      ++R    +S  +     F Y+NV  VP+SIDWR+ GAVT
Sbjct: 3   NKFADMTNYEFRSIYADSKVNHHRMFRGMSHDNG---PFMYENVEGVPSSIDWRKIGAVT 59

Query: 145 HIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFE 203
            +KDQGQCGSCWAFS + AVEGI QI   KL+ LSEQ+LVDC T+ N GC+GGLM+ AFE
Sbjct: 60  GVKDQGQCGSCWAFSTIVAVEGINQIKTQKLVSLSEQELVDCDTEVNQGCNGGLMEYAFE 119

Query: 204 YIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSV 263
           +I +N G+ TE +YPY  ++GTC+ QKE   A +I  +E++P  +E+ALL+A +NQP+SV
Sbjct: 120 FIKQN-GITTETNYPYAAKDGTCNIQKENKPAVSIDGHENVPANNEKALLKAAANQPISV 178

Query: 264 CVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESG 323
            +DA G  F FY  GV    CG   +HGV                    NSWG  WGE G
Sbjct: 179 AIDAGGSDFQFYSEGVFTGHCGTELNHGV--------------------NSWGSEWGEQG 218

Query: 324 YIRILR----DAGLCGIATAASYPV 344
           YIR+ R      GLCGIA  ASYP+
Sbjct: 219 YIRMQRAISHKQGLCGIAMEASYPI 243


>gi|37994576|gb|AAH60335.1| Unknown (protein for MGC:68554) [Xenopus laevis]
          Length = 335

 Score =  254 bits (648), Expect = 5e-65,   Method: Compositional matrix adjust.
 Identities = 143/342 (41%), Positives = 211/342 (61%), Gaps = 18/342 (5%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
           M   +I+   C + + +  +  +P++      W   H +TY  + E+  R  ++++NL+ 
Sbjct: 1   MTPYLIIGAICLTTLYAAPAT-DPALDNHWYSWKDWHKKTYAPK-EEGWRRVLWEKNLKM 58

Query: 73  IEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
           IE  N +   G  +Y+LG N+F D+TNEEF+ L  GY       +++  R STF   N  
Sbjct: 59  IEFHNLDHSLGKHSYRLGMNQFGDMTNEEFKQLMNGYK------NQKMIRGSTFLAPNNF 112

Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS-- 187
           + P S+DWR+KG VT +KDQGQCGSCWAFS   A+EG       KLI LSEQ LVDCS  
Sbjct: 113 EAPKSVDWRKKGYVTPVKDQGQCGSCWAFSTTGALEGQHYRKTSKLISLSEQNLVDCSRA 172

Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
             N GC+GGLMD+AF+Y+ +N G+ +E  YPY  ++    +      +A  + + D+  G
Sbjct: 173 QGNEGCNGGLMDQAFQYVKDNGGIDSEDSYPYTAKDDQECHYDPNNNSANDTGFVDVQSG 232

Query: 248 DEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFG-TAEEE 303
            E+ L++AV++  PVSV +DA  ++F FY+SG+    +C + + DHGV VVG+G  +E+ 
Sbjct: 233 CEKDLMKAVASVGPVSVAIDAGHQSFQFYQSGIYYEPECSSEDLDHGVLVVGYGFESEDV 292

Query: 304 NGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
           +G KYW++KNSW E WG++GYI I +D    CGIATAASYP+
Sbjct: 293 DGKKYWIVKNSWSEKWGDNGYINIAKDRHNHCGIATAASYPL 334


>gi|12847813|dbj|BAB27719.1| unnamed protein product [Mus musculus]
          Length = 334

 Score =  254 bits (648), Expect = 5e-65,   Method: Compositional matrix adjust.
 Identities = 144/311 (46%), Positives = 189/311 (60%), Gaps = 19/311 (6%)

Query: 44  QWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEFSDLTNEEFR 100
           QW + H R Y    E+  R  I+++N+  I+  N E   G   + +  N F D+TNEEFR
Sbjct: 31  QWKSTHRRLYGTN-EEEWRRAIWEKNMRMIQLHNGEYSNGQHGFSMEMNAFGDMTNEEFR 89

Query: 101 ALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSA 160
            +  GY        R    P   K      +P S+DWREKG VT +K+QGQCGSCWAFSA
Sbjct: 90  QVVNGYRHQKHKKGRLFQEPLMLK------IPKSVDWREKGCVTPVKNQGQCGSCWAFSA 143

Query: 161 VAAVEGITQITRGKLIELSEQQLVDCS--TDNHGCSGGLMDKAFEYIIENKGLATEADYP 218
              +EG   +  GKLI LSEQ LVDCS    N GC+GGLMD AF+YI EN GL +E  YP
Sbjct: 144 SGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGCNGGLMDYAFQYIKENGGLDSEESYP 203

Query: 219 YRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDASGRAFHFYKS 277
           Y  ++G+C  + E AV A  + + D+P+  E+AL++AV+   P+SV +DAS  +  FY S
Sbjct: 204 YEAKDGSCKYRAEFAV-ANDTGFVDIPQ-QEKALMKAVATVGPISVAMDASHPSLQFYSS 261

Query: 278 GV-LNADCGN-NCDHGVAVVGFG-TAEEENGAKYWLIKNSWGETWGESGYIRILRDA-GL 333
           G+    +C + N DHGV +VG+G    + N  KYWL+KNSWG  WG  GYI+I +D    
Sbjct: 262 GIYYEPNCSSKNLDHGVLLVGYGYEGTDSNKNKYWLVKNSWGSEWGMEGYIKIAKDRDNH 321

Query: 334 CGIATAASYPV 344
           CG+ATAASYPV
Sbjct: 322 CGLATAASYPV 332


>gi|293342579|ref|XP_001065885.2| PREDICTED: cathepsin L1 [Rattus norvegicus]
 gi|293354415|ref|XP_225137.5| PREDICTED: cathepsin L1 [Rattus norvegicus]
 gi|149039747|gb|EDL93863.1| rCG24278 [Rattus norvegicus]
          Length = 330

 Score =  254 bits (648), Expect = 5e-65,   Method: Compositional matrix adjust.
 Identities = 140/336 (41%), Positives = 194/336 (57%), Gaps = 18/336 (5%)

Query: 17  IILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKA 76
           I L+ T    ++S    H+PS     E+W  +HG+TY    E+  +  +++ N++ I   
Sbjct: 4   IFLLATLCLGMISAAPTHDPSFDTVWEEWKTKHGKTYNTN-EEGQKRAVWENNMKMINLH 62

Query: 77  NKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPT 133
           N++   G   + L  N F DLTN EFR L TG+    P         + F+   + D+P 
Sbjct: 63  NEDYLKGKHGFSLEMNAFGDLTNTEFRELMTGFQSMGPK------ETTIFREPFLGDIPK 116

Query: 134 SIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NH 191
           S+DWRE G VT +K+QGQCGSCWAFSAV ++EG      GKL+ LSEQ LVDCS    N 
Sbjct: 117 SLDWREHGYVTPVKNQGQCGSCWAFSAVGSLEGQIFKKTGKLVSLSEQNLVDCSWSYGNL 176

Query: 192 GCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQA 251
           GC+GGLM+ AF+Y+ EN+GL T   Y Y  ++G C     K  AA ++ +  +P  ++  
Sbjct: 177 GCNGGLMEFAFQYVKENRGLDTGESYAYEAQDGLC-RYNPKYSAANVTGFVKVPLSEDDL 235

Query: 252 LLQAVSNQPVSVCVDASGRAFHFYKSGV-LNADCGNN-CDHGVAVVGFGTAEEENGAKYW 309
           +    S  PVSV +D+  ++F FY  G+    DC +   DH V VVG+G  EE +G KYW
Sbjct: 236 MSAVASVGPVSVGIDSHHQSFRFYSGGMYYEPDCSSTEMDHAVLVVGYG--EESDGGKYW 293

Query: 310 LIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
           L+KNSWGE WG  GYI++ +D    CGIAT A YP 
Sbjct: 294 LVKNSWGEDWGMDGYIKMAKDQNNNCGIATYAIYPT 329


>gi|109940313|sp|P25975.3|CATL1_BOVIN RecName: Full=Cathepsin L1; Contains: RecName: Full=Cathepsin L1
           heavy chain; Contains: RecName: Full=Cathepsin L1 light
           chain; Flags: Precursor
 gi|74354943|gb|AAI02313.1| CTSL2 protein [Bos taurus]
 gi|154425700|gb|AAI51426.1| Cathepsin L2 [Bos taurus]
 gi|296484466|tpg|DAA26581.1| TPA: cathepsin L2 precursor [Bos taurus]
 gi|440898893|gb|ELR50299.1| Cathepsin L1 [Bos grunniens mutus]
          Length = 334

 Score =  254 bits (648), Expect = 6e-65,   Method: Compositional matrix adjust.
 Identities = 142/343 (41%), Positives = 204/343 (59%), Gaps = 22/343 (6%)

Query: 12  PMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLE 71
           P F + +L +     V S     +P++     QW A H R Y    E+  R  ++++N +
Sbjct: 3   PSFFLTVLCLG----VASAAPKLDPNLDAHWHQWKATHRRLYGMN-EEEWRRAVWEKNKK 57

Query: 72  YIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNV 128
            I+  N+E   G   +++  N F D+TNEEFR +  G+       +++  +   F    +
Sbjct: 58  IIDLHNQEYSEGKHGFRMAMNAFGDMTNEEFRQVMNGFQ------NQKHKKGKLFHEPLL 111

Query: 129 TDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS- 187
            DVP S+DW +KG VT +K+QGQCGSCWAFSA  A+EG      GKL+ LSEQ LVDCS 
Sbjct: 112 VDVPKSVDWTKKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSR 171

Query: 188 -TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPK 246
              N GC+GGLMD AF+YI +N GL +E  YPY   +    N K +  AA  + + D+P+
Sbjct: 172 AQGNQGCNGGLMDNAFQYIKDNGGLDSEESYPYLATDTNSCNYKPECSAANDTGFVDIPQ 231

Query: 247 GDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFG-TAEE 302
             E+AL++AV+   P+SV +DA   +F FYKSG+  + DC + + DHGV VVG+G    +
Sbjct: 232 -REKALMKAVATVGPISVAIDAGHTSFQFYKSGIYYDPDCSSKDLDHGVLVVGYGFEGTD 290

Query: 303 ENGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
            N  K+W++KNSWG  WG +GY+++ +D    CGIATAASYP 
Sbjct: 291 SNNNKFWIVKNSWGPEWGWNGYVKMAKDQNNHCGIATAASYPT 333


>gi|74151179|dbj|BAE27712.1| unnamed protein product [Mus musculus]
          Length = 334

 Score =  254 bits (648), Expect = 6e-65,   Method: Compositional matrix adjust.
 Identities = 144/311 (46%), Positives = 189/311 (60%), Gaps = 19/311 (6%)

Query: 44  QWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEFSDLTNEEFR 100
           QW + H R Y    E+  R  I+++N+  I+  N E   G   + +  N F D+TNEEFR
Sbjct: 31  QWKSTHRRLYGTN-EEEWRRAIWEKNMRMIQLHNGEYSNGQHGFSMEMNAFGDMTNEEFR 89

Query: 101 ALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSA 160
            +  GY        R    P   K      +P S+DWREKG VT +K+QGQCGSCWAFSA
Sbjct: 90  QVVNGYRHQKHKKGRLFQEPLMLK------IPKSVDWREKGCVTPVKNQGQCGSCWAFSA 143

Query: 161 VAAVEGITQITRGKLIELSEQQLVDCS--TDNHGCSGGLMDKAFEYIIENKGLATEADYP 218
              +EG   +  GKLI LSEQ LVDCS    N GC+GGLMD AF+YI EN GL +E  YP
Sbjct: 144 SGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGCNGGLMDFAFQYIKENGGLDSEESYP 203

Query: 219 YRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDASGRAFHFYKS 277
           Y  ++G+C  + E AV A  + + D+P+  E+AL++AV+   P+SV +DAS  +  FY S
Sbjct: 204 YEAKDGSCKYRAEFAV-ANDTGFVDIPQ-QEEALMKAVATVGPISVAMDASHPSLQFYSS 261

Query: 278 GV-LNADCGN-NCDHGVAVVGFG-TAEEENGAKYWLIKNSWGETWGESGYIRILRDA-GL 333
           G+    +C + N DHGV +VG+G    + N  KYWL+KNSWG  WG  GYI+I +D    
Sbjct: 262 GIYYEPNCSSKNLDHGVLLVGYGYEGTDSNKNKYWLVKNSWGSEWGMEGYIKIAKDRDNH 321

Query: 334 CGIATAASYPV 344
           CG+ATAASYPV
Sbjct: 322 CGLATAASYPV 332


>gi|33348836|gb|AAQ16118.1| cathepsin L-like cysteine proteinase B [Rhipicephalus
           haemaphysaloides haemaphysaloides]
          Length = 335

 Score =  254 bits (648), Expect = 6e-65,   Method: Compositional matrix adjust.
 Identities = 143/337 (42%), Positives = 200/337 (59%), Gaps = 19/337 (5%)

Query: 20  VITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE 79
           ++ C   V +    H+  +  +   + A HG+ Y  + E+  RL I+ +N   I + N++
Sbjct: 5   IVLCCLFVTAAAITHQELVGAEWSAFKALHGKDYASDTEEYYRLKIYMENRLKIARHNEK 64

Query: 80  GNRT---YKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSS---RPSTFKYQNVTDVPT 133
             ++   YKL  NEF DL + EF +   G+ R      R+ S    P  F+      +P 
Sbjct: 65  YAKSQVSYKLAMNEFGDLLHHEFVSTRNGFKRNYRDSPREGSFFVEPEGFE---DLQLPK 121

Query: 134 SIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NH 191
           ++DWR+KGAVT +K+QGQCGSCWAFS   ++EG       KL+ LSEQ LVDCS    N+
Sbjct: 122 TVDWRKKGAVTPVKNQGQCGSCWAFSTTGSLEGPHFRKTRKLVSLSEQNLVDCSRSFGNN 181

Query: 192 GCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQA 251
           GC GGLMD AF+YI  NKG+ TE  YPY   +G C   +   V AT + + D+P+GDE  
Sbjct: 182 GCEGGLMDNAFKYIKSNKGIDTEWSYPYNATDGVCHFNRSD-VGATDTGFVDIPEGDENK 240

Query: 252 LLQAVSN-QPVSVCVDASGRAFHFYKSGVLN-ADCGN-NCDHGVAVVGFGTAEEENGAKY 308
           L +AV+   PVSV +DAS  +F FY  GV +  +C +   DHGV VVG+GT   ++G  Y
Sbjct: 241 LKKAVAAVGPVSVAIDASHESFQFYSEGVYDEPECSSEQLDHGVLVVGYGT---KDGQDY 297

Query: 309 WLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
           WL+KNSWG TWG+ GYI + R+    CGIA++ASYP+
Sbjct: 298 WLVKNSWGTTWGDEGYIYMTRNKDNQCGIASSASYPL 334


>gi|60827856|gb|AAX36816.1| cathepsin L [synthetic construct]
          Length = 334

 Score =  254 bits (648), Expect = 6e-65,   Method: Compositional matrix adjust.
 Identities = 143/344 (41%), Positives = 205/344 (59%), Gaps = 20/344 (5%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
           M   +IL   C   + S     + S+  +  +W A H R Y    E+  R  ++++N++ 
Sbjct: 1   MNPTLILAAFCLG-IASATLTFDHSLEAQWTKWKAMHNRLYGMN-EEGWRRAVWEKNMKM 58

Query: 73  IEKAN---KEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
           IE  N   +EG  ++ +  N F D+T+EEFR +  G+       +R+  +   F+     
Sbjct: 59  IELHNQEYREGKHSFTMAMNAFGDMTSEEFRQVMNGFQ------NRKPRKGKVFQEPLFY 112

Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS-- 187
           + P S+DWREKG VT +K+QGQCGSCWAFSA  A+EG      G+LI LSEQ LVDCS  
Sbjct: 113 EAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGP 172

Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
             N GC+GGLMD AF+Y+ +N GL +E  YPY   E +C    + +V A  + + D+PK 
Sbjct: 173 QGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSV-ANDTGFVDIPK- 230

Query: 248 DEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFG-TAEEE 303
            E+AL++AV+   P+SV +DA   +F FYK G+    DC + + DHGV VVG+G  + E 
Sbjct: 231 QEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTES 290

Query: 304 NGAKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPVAI 346
           +  KYWL+KNSWGE WG  GY+++ +D    CGIA+AASYP  +
Sbjct: 291 DNNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTVL 334


>gi|6753558|ref|NP_034114.1| cathepsin L1 preproprotein [Mus musculus]
 gi|115742|sp|P06797.2|CATL1_MOUSE RecName: Full=Cathepsin L1; AltName: Full=Major excreted protein;
           Short=MEP; AltName: Full=p39 cysteine proteinase;
           Contains: RecName: Full=Cathepsin L1 heavy chain;
           Contains: RecName: Full=Cathepsin L1 light chain; Flags:
           Precursor
 gi|53047|emb|CAA29470.1| unnamed protein product [Mus musculus]
 gi|309186|gb|AAA37445.1| preprocysteine proteinase [Mus musculus]
 gi|12832050|dbj|BAB21945.1| unnamed protein product [Mus musculus]
 gi|26340196|dbj|BAC33761.1| unnamed protein product [Mus musculus]
 gi|45768760|gb|AAH68163.1| Cathepsin L [Mus musculus]
 gi|74139700|dbj|BAE31701.1| unnamed protein product [Mus musculus]
 gi|74146632|dbj|BAE41323.1| unnamed protein product [Mus musculus]
 gi|74151584|dbj|BAE41141.1| unnamed protein product [Mus musculus]
 gi|74185397|dbj|BAE30172.1| unnamed protein product [Mus musculus]
 gi|74197196|dbj|BAE35143.1| unnamed protein product [Mus musculus]
 gi|74203006|dbj|BAE26206.1| unnamed protein product [Mus musculus]
 gi|74219606|dbj|BAE29572.1| unnamed protein product [Mus musculus]
 gi|148684295|gb|EDL16242.1| cathepsin L [Mus musculus]
          Length = 334

 Score =  254 bits (648), Expect = 6e-65,   Method: Compositional matrix adjust.
 Identities = 144/311 (46%), Positives = 189/311 (60%), Gaps = 19/311 (6%)

Query: 44  QWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEFSDLTNEEFR 100
           QW + H R Y    E+  R  I+++N+  I+  N E   G   + +  N F D+TNEEFR
Sbjct: 31  QWKSTHRRLYGTN-EEEWRRAIWEKNMRMIQLHNGEYSNGQHGFSMEMNAFGDMTNEEFR 89

Query: 101 ALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSA 160
            +  GY        R    P   K      +P S+DWREKG VT +K+QGQCGSCWAFSA
Sbjct: 90  QVVNGYRHQKHKKGRLFQEPLMLK------IPKSVDWREKGCVTPVKNQGQCGSCWAFSA 143

Query: 161 VAAVEGITQITRGKLIELSEQQLVDCS--TDNHGCSGGLMDKAFEYIIENKGLATEADYP 218
              +EG   +  GKLI LSEQ LVDCS    N GC+GGLMD AF+YI EN GL +E  YP
Sbjct: 144 SGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGCNGGLMDFAFQYIKENGGLDSEESYP 203

Query: 219 YRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDASGRAFHFYKS 277
           Y  ++G+C  + E AV A  + + D+P+  E+AL++AV+   P+SV +DAS  +  FY S
Sbjct: 204 YEAKDGSCKYRAEFAV-ANDTGFVDIPQ-QEKALMKAVATVGPISVAMDASHPSLQFYSS 261

Query: 278 GV-LNADCGN-NCDHGVAVVGFG-TAEEENGAKYWLIKNSWGETWGESGYIRILRDA-GL 333
           G+    +C + N DHGV +VG+G    + N  KYWL+KNSWG  WG  GYI+I +D    
Sbjct: 262 GIYYEPNCSSKNLDHGVLLVGYGYEGTDSNKNKYWLVKNSWGSEWGMEGYIKIAKDRDNH 321

Query: 334 CGIATAASYPV 344
           CG+ATAASYPV
Sbjct: 322 CGLATAASYPV 332


>gi|391338876|ref|XP_003743781.1| PREDICTED: cathepsin L-like isoform 4 [Metaseiulus occidentalis]
          Length = 336

 Score =  253 bits (647), Expect = 6e-65,   Method: Compositional matrix adjust.
 Identities = 141/340 (41%), Positives = 203/340 (59%), Gaps = 21/340 (6%)

Query: 14  FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYI 73
           F+I+ +++  AS  ++   + +     + + +   H + Y+    +A R  IF QN   I
Sbjct: 8   FLILAVLVGAASAALTLEQLFDA----EWQNFKVHHNKKYEGSTVEAFRKKIFLQNTHLI 63

Query: 74  EKAN---KEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
            + N    +G  TYKL  N+F D+ + EF +   G  R     S ++   ST+       
Sbjct: 64  ARHNIKHAKGETTYKLKMNQFGDMLHHEFVSTMNGLLR-----SNRTYFGSTWIEPESVS 118

Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD- 189
           +P S+DWREKGAVT +K+QG CGSCW+FS   A+EG      G+L+ LSEQ L+DCST  
Sbjct: 119 LPKSVDWREKGAVTPVKNQGHCGSCWSFSTTGALEGQLFRKTGELVSLSEQNLIDCSTSY 178

Query: 190 -NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
            N+GC GGLMD AF YI EN G+ TE  YPY  ++G C   KE + A   + + D+P G+
Sbjct: 179 GNNGCGGGLMDNAFTYIKENHGIDTEESYPYEGKQGKCRYHKEDS-AGRDTGFVDIPSGN 237

Query: 249 EQALLQAVSN-QPVSVCVDASGRAFHFYKSGVLN-ADC-GNNCDHGVAVVGFGTAEEENG 305
           E+AL +A++   PVSV +DAS  +F FY  GV N  DC  ++ DHGV  VG+GT ++  G
Sbjct: 238 ERALAKALATIGPVSVAIDASHESFQFYHEGVYNPPDCDSHSLDHGVLAVGYGTTDD--G 295

Query: 306 AKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
             Y++IKNSWGE WG+ GY+ + R++   CG+AT ASYP+
Sbjct: 296 QDYYIIKNSWGERWGQEGYVLMARNSKNECGVATQASYPL 335


>gi|327289213|ref|XP_003229319.1| PREDICTED: cathepsin S-like [Anolis carolinensis]
          Length = 333

 Score =  253 bits (647), Expect = 6e-65,   Method: Compositional matrix adjust.
 Identities = 139/318 (43%), Positives = 202/318 (63%), Gaps = 18/318 (5%)

Query: 37  SIVEKH-EQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEFS 92
           S+++ H E W  ++ + Y+++ E+ +R  I+++NL ++   N E   G  +Y+LG N   
Sbjct: 23  SMLDGHWELWKKKYNKEYQNKEEEGVRRVIWEKNLRFVMLHNLEQSLGLHSYELGMNHLG 82

Query: 93  DLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQC 152
           D+T+EE  AL TG   PV     QS   + +  +     P ++DWREKG VT++K+QG C
Sbjct: 83  DMTSEEVTALMTGLKIPVS----QSRNSTLYWARQGASAPDTVDWREKGCVTNVKNQGSC 138

Query: 153 GSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST--DNHGCSGGLMDKAFEYIIENKG 210
           GSCWAFSAV A+E   ++  G L+ LS Q LVDCS+   NHGC+GG +  AF+Y+I N G
Sbjct: 139 GSCWAFSAVGALECQLKLKTGNLVSLSPQNLVDCSSAFGNHGCNGGYISAAFQYVIYNNG 198

Query: 211 LATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDASG 269
           + +EA YPY  + GTC     +  AAT S+Y DLP G+E AL  AV+N  PVSV +DAS 
Sbjct: 199 IDSEASYPYTGQSGTC-RYNLQGRAATCSRYVDLPSGNEAALKDAVANFGPVSVAIDASR 257

Query: 270 RAFHFYKSGVL-NADCGN-NCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRI 327
            +F  ++ GV  +  C + + +HGV VVG+GT   E+G  YWL+KNSWG ++G+ GYI+I
Sbjct: 258 PSFFLFRKGVYDDPSCTSAHINHGVLVVGYGT---EDGIDYWLVKNSWGVSFGDQGYIKI 314

Query: 328 LRDA-GLCGIATAASYPV 344
            R+    CGIA+  +YP+
Sbjct: 315 ARNHDNRCGIASQCTYPL 332


>gi|391338870|ref|XP_003743778.1| PREDICTED: cathepsin L-like isoform 1 [Metaseiulus occidentalis]
 gi|391338872|ref|XP_003743779.1| PREDICTED: cathepsin L-like isoform 2 [Metaseiulus occidentalis]
 gi|391338874|ref|XP_003743780.1| PREDICTED: cathepsin L-like isoform 3 [Metaseiulus occidentalis]
          Length = 331

 Score =  253 bits (647), Expect = 7e-65,   Method: Compositional matrix adjust.
 Identities = 141/340 (41%), Positives = 203/340 (59%), Gaps = 21/340 (6%)

Query: 14  FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYI 73
           F+I+ +++  AS  ++   + +     + + +   H + Y+    +A R  IF QN   I
Sbjct: 3   FLILAVLVGAASAALTLEQLFDA----EWQNFKVHHNKKYEGSTVEAFRKKIFLQNTHLI 58

Query: 74  EKAN---KEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
            + N    +G  TYKL  N+F D+ + EF +   G  R     S ++   ST+       
Sbjct: 59  ARHNIKHAKGETTYKLKMNQFGDMLHHEFVSTMNGLLR-----SNRTYFGSTWIEPESVS 113

Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD- 189
           +P S+DWREKGAVT +K+QG CGSCW+FS   A+EG      G+L+ LSEQ L+DCST  
Sbjct: 114 LPKSVDWREKGAVTPVKNQGHCGSCWSFSTTGALEGQLFRKTGELVSLSEQNLIDCSTSY 173

Query: 190 -NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
            N+GC GGLMD AF YI EN G+ TE  YPY  ++G C   KE + A   + + D+P G+
Sbjct: 174 GNNGCGGGLMDNAFTYIKENHGIDTEESYPYEGKQGKCRYHKEDS-AGRDTGFVDIPSGN 232

Query: 249 EQALLQAVSN-QPVSVCVDASGRAFHFYKSGVLN-ADC-GNNCDHGVAVVGFGTAEEENG 305
           E+AL +A++   PVSV +DAS  +F FY  GV N  DC  ++ DHGV  VG+GT ++  G
Sbjct: 233 ERALAKALATIGPVSVAIDASHESFQFYHEGVYNPPDCDSHSLDHGVLAVGYGTTDD--G 290

Query: 306 AKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
             Y++IKNSWGE WG+ GY+ + R++   CG+AT ASYP+
Sbjct: 291 QDYYIIKNSWGERWGQEGYVLMARNSKNECGVATQASYPL 330


>gi|6630972|gb|AAF19630.1|AF194426_1 cysteine proteinase precursor [Myxine glutinosa]
          Length = 324

 Score =  253 bits (647), Expect = 7e-65,   Method: Compositional matrix adjust.
 Identities = 139/311 (44%), Positives = 195/311 (62%), Gaps = 16/311 (5%)

Query: 43  EQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKAN---KEGNRTYKLGTNEFSDLTNEEF 99
           E W  ++G++Y    E+ +R  +++ NL+ +++ N    +G   Y+LG N ++DL NEEF
Sbjct: 20  ESWKGKYGKSYLGRGEEVLRKRVWESNLQIVQQHNVLADQGQANYRLGMNTYADLYNEEF 79

Query: 100 RALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFS 159
            AL    +  +     QSS   TFK      +P+S+DWR +G VT +KDQGQCGSCW+FS
Sbjct: 80  MALKG--SSGILQAKDQSS-TQTFKPLVGVTLPSSVDWRNQGYVTPVKDQGQCGSCWSFS 136

Query: 160 AVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADY 217
           A  ++EG      G L+ LSEQQLVDCS    N+GCSGGLM+ A++YI +  G+  E+ Y
Sbjct: 137 ATGSLEGQHFAKTGTLVSLSEQQLVDCSWSYGNYGCSGGLMESAYDYIRDAGGVQLESAY 196

Query: 218 PYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDASGRAFHFYK 276
           PY  + G C   + KAV AT + +  +P GDEQ+L+QAV    PV+V +DASG  F  Y+
Sbjct: 197 PYTAQNGRCHFDQSKAV-ATCTGHVAIPSGDEQSLMQAVGTVGPVAVAIDASGYDFQLYE 255

Query: 277 SGVLN-ADC-GNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD-AGL 333
           SGV + + C  ++ DHGV   G+GT   E G  YWL+KNSWG  WG  GYI++ R+ +  
Sbjct: 256 SGVYDRSRCSSSSLDHGVLAAGYGT---EGGNDYWLVKNSWGPGWGAQGYIKMSRNKSNQ 312

Query: 334 CGIATAASYPV 344
           CGIAT A YP+
Sbjct: 313 CGIATMACYPL 323


>gi|262410743|gb|ACY66807.1| cathepsin L [Aphis gossypii]
          Length = 341

 Score =  253 bits (647), Expect = 7e-65,   Method: Compositional matrix adjust.
 Identities = 144/342 (42%), Positives = 201/342 (58%), Gaps = 17/342 (4%)

Query: 15  VIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIE 74
           VI++ ++  A   VS  +++E  I E+   + AQ  + Y+D  E+A R  ++  N   I 
Sbjct: 4   VIVLGLVVFAISSVSSINLNE-VIEEEWSLFKAQFKKIYEDVKEEAFRKKVYLDNKLKIA 62

Query: 75  KANK---EGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPST---FKYQNV 128
           + NK    G  TY L  N F DL   E++ +  G+   +    +  +        K +NV
Sbjct: 63  RHNKLYETGEETYALEMNHFGDLMQHEYKKMMNGFKPSLAGGDKNFTDDDAVTFLKSENV 122

Query: 129 TDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST 188
             VP +IDWR+KG VT +K+QGQCGSCW+FSA  ++EG      G L+ LSEQ L+DCS 
Sbjct: 123 V-VPKAIDWRKKGYVTPVKNQGQCGSCWSFSATGSLEGQHFRKTGVLVSLSEQNLIDCSR 181

Query: 189 D--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPK 246
              N+GC GGLMD AF+YI  NKGL TE  YPY  E+  C    E +  AT   + D+P+
Sbjct: 182 KYGNNGCEGGLMDLAFKYIKSNKGLDTEKSYPYEAEDDKCRYNPENS-GATDKGFVDIPE 240

Query: 247 GDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGVL-NADCGNN-CDHGVAVVGFGTAEEE 303
           GDE AL+ A++   PVS+ +DAS   F FYK GV  N  C +   DHGV  VG+GT  + 
Sbjct: 241 GDEDALMHALATVGPVSIAIDASSEKFQFYKKGVFYNPRCSSTELDHGVLAVGYGT--DH 298

Query: 304 NGAKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
            G  YW++KNSWG+TWG+ GYI + R+    CG+A++ASYP+
Sbjct: 299 KGGDYWIVKNSWGKTWGDQGYIMMARNKKNNCGVASSASYPL 340


>gi|261289785|ref|XP_002611754.1| hypothetical protein BRAFLDRAFT_284341 [Branchiostoma floridae]
 gi|229297126|gb|EEN67764.1| hypothetical protein BRAFLDRAFT_284341 [Branchiostoma floridae]
          Length = 327

 Score =  253 bits (647), Expect = 7e-65,   Method: Compositional matrix adjust.
 Identities = 148/346 (42%), Positives = 200/346 (57%), Gaps = 37/346 (10%)

Query: 14  FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYI 73
           F+I++L +T A+           ++  + E +   HG+ YK   E+ +R  IF+ N + I
Sbjct: 3   FLILVLSVTMAT-----------AMDVEWEAFKLTHGKQYKSPDEENVRRAIFRDNNQMI 51

Query: 74  EKANKE---GNRTYKLGTNEFSDLTNEEFRALYTG-----YNRPVPSVSRQSSRPSTFKY 125
           ++ N+E   G R+Y +G N+F DL + E+  L  G      N   PS +   S P     
Sbjct: 52  KEHNQEAAMGRRSYFMGMNQFGDLAHSEYLELVVGPGLLPLNLSTPSENVFESTPGL--- 108

Query: 126 QNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVD 185
                V  ++DWR+KGAVT IKDQG CGSCWAFS   ++EG   +  GKL+ LSEQ L+D
Sbjct: 109 ----QVDDTVDWRQKGAVTPIKDQGHCGSCWAFSTTGSLEGQHFMKTGKLVSLSEQNLLD 164

Query: 186 CST--DNHGCSGGLMDKAFEYIIENKGLATEADYPYR-HEEGTCDNQKEKAVAATISKYE 242
           CS    N GC GGLMD+AF YI  N G+ TE  YPY   +E  CD  K     AT+S Y 
Sbjct: 165 CSRRFGNKGCEGGLMDQAFRYIKSNGGIDTEECYPYMAKDEKVCD-YKTSCSGATLSSYT 223

Query: 243 DLPKGDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGVLN-ADCG-NNCDHGVAVVGFGT 299
           D+   DE AL+QAV    PVSV +DAS ++  FYKSG+ +  +C     DHGV  VG+G+
Sbjct: 224 DIKAMDEMALMQAVGTVGPVSVAIDASHKSLRFYKSGIYDEPECSRTKLDHGVLAVGYGS 283

Query: 300 AEEENGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
            +   G  YWL+KNSWG  WG+ GY+++ R+    CGIAT ASYPV
Sbjct: 284 MD---GMDYWLVKNSWGSAWGDMGYVKMTRNKNNQCGIATKASYPV 326


>gi|74149661|dbj|BAE36450.1| unnamed protein product [Mus musculus]
          Length = 334

 Score =  253 bits (647), Expect = 7e-65,   Method: Compositional matrix adjust.
 Identities = 144/311 (46%), Positives = 189/311 (60%), Gaps = 19/311 (6%)

Query: 44  QWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEFSDLTNEEFR 100
           QW + H R Y    E+  R  I+++N+  I+  N E   G   + +  N F D+TNEEFR
Sbjct: 31  QWKSTHRRLYGTN-EEEWRRAIWEKNMRMIQLHNGEYSNGQHGFSMEMNAFGDMTNEEFR 89

Query: 101 ALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSA 160
            +  GY        R    P   K      +P S+DWREKG VT +K+QGQCGSCWAFSA
Sbjct: 90  QVVNGYRHQKHKKGRLFQEPLMLK------IPKSVDWREKGCVTPVKNQGQCGSCWAFSA 143

Query: 161 VAAVEGITQITRGKLIELSEQQLVDCS--TDNHGCSGGLMDKAFEYIIENKGLATEADYP 218
              +EG   +  GKLI LSEQ LVDCS    N GC+GGLMD AF+YI EN GL +E  YP
Sbjct: 144 SGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGCNGGLMDFAFQYIKENGGLDSEESYP 203

Query: 219 YRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDASGRAFHFYKS 277
           Y  ++G+C  + E AVA   + + D+P+  E+AL++AV+   P+SV +DAS  +  FY S
Sbjct: 204 YEAKDGSCKYRAEFAVANG-TGFVDIPQ-QEKALMKAVATVGPISVAMDASHPSLQFYSS 261

Query: 278 GV-LNADCGN-NCDHGVAVVGFG-TAEEENGAKYWLIKNSWGETWGESGYIRILRDA-GL 333
           G+    +C + N DHGV +VG+G    + N  KYWL+KNSWG  WG  GYI+I +D    
Sbjct: 262 GIYYEPNCSSKNLDHGVLLVGYGYEGTDSNKNKYWLVKNSWGSEWGMEGYIKIAKDRDNH 321

Query: 334 CGIATAASYPV 344
           CG+ATAASYPV
Sbjct: 322 CGLATAASYPV 332


>gi|254674508|dbj|BAH86062.1| cysteine protease [Haemaphysalis longicornis]
          Length = 333

 Score =  253 bits (647), Expect = 8e-65,   Method: Compositional matrix adjust.
 Identities = 141/340 (41%), Positives = 199/340 (58%), Gaps = 19/340 (5%)

Query: 16  IIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEK 75
           ++ L   C     +  +  +  +  + E + +QH + Y   +E+ +R  IF +N   + K
Sbjct: 1   MLRLAFLCGCVAAAIAASSQEILRTEWEAFKSQHNKAYSSHVEELLRFKIFTENTLLVAK 60

Query: 76  ANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD-- 130
            N +   G  +YKL  N+F DL   EF  +  GY         +  RP+     N+ D  
Sbjct: 61  HNAKYAKGLVSYKLAMNKFGDLLPHEFAKMVNGYR----GKQNKEQRPTFIPPANLNDSS 116

Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD- 189
           +PT++DWR+KGAVT +K+QGQCGSCWAFS   ++EG      GKL+ LSEQ LVDCS D 
Sbjct: 117 LPTTVDWRKKGAVTPVKNQGQCGSCWAFSTTGSLEGQHFRKTGKLVSLSEQNLVDCSDDF 176

Query: 190 -NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
            N GC+GGLMD  F+YI  N G+ TE  +PY  ++G C  +K   V AT + + D+ +G 
Sbjct: 177 GNQGCNGGLMDNGFQYIKANGGIDTEESHPYTAQDGDCKFKKAD-VGATDAGFVDIQQGS 235

Query: 249 EQALLQAVSN-QPVSVCVDASGRAFHFYKSGVLN-ADCGNN-CDHGVAVVGFGTAEEENG 305
           E  L +AV+   PVSV +DAS  +F  Y  GV +  DC ++  DHGV  VG+G    +NG
Sbjct: 236 EDDLKKAVATVGPVSVAIDASHGSFQLYSQGVYDEPDCSSSQLDHGVLTVGYGV---KNG 292

Query: 306 AKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
            KYWL+KNSWG  WG++GYI + RD    CGIA++ASYP+
Sbjct: 293 KKYWLVKNSWGGDWGDNGYILMSRDKDNQCGIASSASYPL 332


>gi|4886998|gb|AAD32136.1|AF121837_1 cathepsin L [Mus musculus]
 gi|4887000|gb|AAD32137.1|AF121838_1 cathepsin L [Mus musculus]
 gi|4887002|gb|AAD32138.1|AF121839_1 cathepsin L [Mus musculus]
 gi|200501|gb|AAA39984.1| preprocathepsin L precursor [Mus musculus]
          Length = 334

 Score =  253 bits (647), Expect = 8e-65,   Method: Compositional matrix adjust.
 Identities = 144/311 (46%), Positives = 189/311 (60%), Gaps = 19/311 (6%)

Query: 44  QWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEFSDLTNEEFR 100
           QW + H R Y    E+  R  I+++N+  I+  N E   G   + +  N F D+TNEEFR
Sbjct: 31  QWKSTHRRLYGTN-EEEWRRAIWEKNMRIIQLHNGEYSNGQHGFSMEMNAFGDMTNEEFR 89

Query: 101 ALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSA 160
            +  GY        R    P   K      +P S+DWREKG VT +K+QGQCGSCWAFSA
Sbjct: 90  QVVNGYRHQKHKKGRLFQEPLMLK------IPKSVDWREKGCVTPVKNQGQCGSCWAFSA 143

Query: 161 VAAVEGITQITRGKLIELSEQQLVDCS--TDNHGCSGGLMDKAFEYIIENKGLATEADYP 218
              +EG   +  GKLI LSEQ LVDCS    N GC+GGLMD AF+YI EN GL +E  YP
Sbjct: 144 SGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGCNGGLMDFAFQYIKENGGLDSEESYP 203

Query: 219 YRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDASGRAFHFYKS 277
           Y  ++G+C  + E AV A  + + D+P+  E+AL++AV+   P+SV +DAS  +  FY S
Sbjct: 204 YEAKDGSCKYRAEFAV-ANDTGFVDIPQ-QEKALMKAVATVGPISVAMDASHPSLQFYSS 261

Query: 278 GV-LNADCGN-NCDHGVAVVGFG-TAEEENGAKYWLIKNSWGETWGESGYIRILRDA-GL 333
           G+    +C + N DHGV +VG+G    + N  KYWL+KNSWG  WG  GYI+I +D    
Sbjct: 262 GIYYEPNCSSKNLDHGVLLVGYGYEGTDSNKNKYWLVKNSWGSEWGMEGYIKIAKDRDNH 321

Query: 334 CGIATAASYPV 344
           CG+ATAASYPV
Sbjct: 322 CGLATAASYPV 332


>gi|156371477|ref|XP_001628790.1| predicted protein [Nematostella vectensis]
 gi|156215775|gb|EDO36727.1| predicted protein [Nematostella vectensis]
          Length = 330

 Score =  253 bits (646), Expect = 9e-65,   Method: Compositional matrix adjust.
 Identities = 147/343 (42%), Positives = 197/343 (57%), Gaps = 23/343 (6%)

Query: 11  IPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQ---HGRTYKDELEKAMRLNIFK 67
           + + V   L+   AS  V           E  +QW A    H + Y    E+  R  I++
Sbjct: 1   MKLLVAACLLFAVASGFV-------VKFDEDEQQWQAWKLFHTKKYTTVTEEGARKAIWR 53

Query: 68  QNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQN 127
            NL+ I+K N EG+ ++ L  N   DLT +EFR  YTG      + +++  + S F   +
Sbjct: 54  DNLKKIQKHNAEGH-SFTLAMNHLGDLTQDEFRYFYTGMRSHYSNYTKK--QGSAFLAPS 110

Query: 128 VTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
              VP ++DWR++G VT +K+QGQCGSCWAFS   ++EG      GKL+ LSEQ LVDCS
Sbjct: 111 HVQVPDTVDWRKEGYVTPVKNQGQCGSCWAFSTTGSLEGQNFKKTGKLVSLSEQNLVDCS 170

Query: 188 T--DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLP 245
           T   N+GC GGLMD AF+YI EN G+ TE  YPY      C  QK   + A  + + D+ 
Sbjct: 171 TAYGNNGCQGGLMDYAFKYIKENGGIDTEESYPYEARNDRCRFQKSN-IGAVDTGFVDVT 229

Query: 246 KGDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGVL-NADCGN-NCDHGVAVVGFGTAEE 302
            GDE+AL  A     P+SV +DA   +F FY SGV  NA C + + DHGV VVG+GT + 
Sbjct: 230 HGDEEALKTAAGTVGPISVAIDAGHMSFQFYHSGVYNNAGCSSTSLDHGVLVVGYGTYQ- 288

Query: 303 ENGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
             G+ YWL+KNSWGE WG  GYI + R+    CG+AT ASYP+
Sbjct: 289 --GSDYWLVKNSWGERWGMEGYIMMSRNKNNQCGVATQASYPL 329


>gi|74200292|dbj|BAE22939.1| unnamed protein product [Mus musculus]
          Length = 308

 Score =  253 bits (646), Expect = 9e-65,   Method: Compositional matrix adjust.
 Identities = 144/311 (46%), Positives = 189/311 (60%), Gaps = 19/311 (6%)

Query: 44  QWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEFSDLTNEEFR 100
           QW + H R Y    E+  R  I+++N+  I+  N E   G   + +  N F D+TNEEFR
Sbjct: 5   QWKSTHRRLYGTN-EEEWRRAIWEKNMRMIQLHNGEYSNGQHGFSMEMNAFGDMTNEEFR 63

Query: 101 ALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSA 160
            +  GY        R    P   K      +P S+DWREKG VT +K+QGQCGSCWAFSA
Sbjct: 64  QVVNGYRHQKHKKGRLFQEPLMLK------IPKSVDWREKGCVTPVKNQGQCGSCWAFSA 117

Query: 161 VAAVEGITQITRGKLIELSEQQLVDCS--TDNHGCSGGLMDKAFEYIIENKGLATEADYP 218
              +EG   +  GKLI LSEQ LVDCS    N GC+GGLMD AF+YI EN GL +E  YP
Sbjct: 118 SGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGCNGGLMDFAFQYIKENGGLDSEESYP 177

Query: 219 YRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDASGRAFHFYKS 277
           Y  ++G+C  + E AV A  + + D+P+  E+AL++AV+   P+SV +DAS  +  FY S
Sbjct: 178 YEAKDGSCKYRAEFAV-ANDTGFVDIPQ-QEKALMKAVATVGPISVAMDASHPSLQFYSS 235

Query: 278 GV-LNADCGN-NCDHGVAVVGFG-TAEEENGAKYWLIKNSWGETWGESGYIRILRDA-GL 333
           G+    +C + N DHGV +VG+G    + N  KYWL+KNSWG  WG  GYI+I +D    
Sbjct: 236 GIYYEPNCSSKNLDHGVLLVGYGYEGTDSNKNKYWLVKNSWGSEWGMEGYIKIAKDRDNH 295

Query: 334 CGIATAASYPV 344
           CG+ATAASYPV
Sbjct: 296 CGLATAASYPV 306


>gi|384941728|gb|AFI34469.1| cathepsin L2 preproprotein [Macaca mulatta]
          Length = 334

 Score =  253 bits (646), Expect = 9e-65,   Method: Compositional matrix adjust.
 Identities = 143/342 (41%), Positives = 200/342 (58%), Gaps = 19/342 (5%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
           M + ++L   C   + S     + ++  K  QW A H R Y    E+  R  ++++N++ 
Sbjct: 1   MNLSLVLAAFCLG-IASAVPKFDQNLDTKWYQWKATHRRLYGAS-EEGWRRAVWEKNMKM 58

Query: 73  IEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
           IE  N E   G   + +  N F D+TNEEFR +   +       +++  +   F+     
Sbjct: 59  IELHNGEYSQGKHGFAMAMNAFGDMTNEEFRQVMGCFR------NQKLRKGKLFREPLFL 112

Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS-- 187
           D+P S+DWR+KG VT +K+Q QCGSCWAFSA  A+EG      GKL+ LSEQ LVDCS  
Sbjct: 113 DLPKSVDWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRP 172

Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
             N GC+GG M+ AF Y+ EN GL +E  YPY   +G C  + E +V A  + +E +P G
Sbjct: 173 QGNQGCNGGFMNSAFRYVKENGGLDSEESYPYVAMDGICKYRSENSV-ANDTGFEVVPAG 231

Query: 248 DEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFG-TAEEE 303
            E+AL++AV+   P+SV +DA   +F FYKSG+    DC + N DHGV VVG+G      
Sbjct: 232 KEKALMKAVATVGPISVAMDAGHSSFQFYKSGIYFEPDCSSKNLDHGVLVVGYGFEGANS 291

Query: 304 NGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
           +  KYWL+KNSWG  WG +GY++I +D    CGIATAASYP 
Sbjct: 292 DNNKYWLVKNSWGPEWGSNGYVKIAKDKDNHCGIATAASYPT 333


>gi|402770509|gb|AFQ98389.1| cathepsin L [Rhipicephalus microplus]
          Length = 332

 Score =  253 bits (646), Expect = 9e-65,   Method: Compositional matrix adjust.
 Identities = 141/340 (41%), Positives = 205/340 (60%), Gaps = 20/340 (5%)

Query: 16  IIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEK 75
           ++ L + CA   V+  +  +  +  + E +   H +TY+  +E+ +R  IF ++   I +
Sbjct: 1   MLRLSVLCAIAAVTVAASSQEILRTQWEAFKTTHKKTYQSHMEELLRFKIFTESSLIIAR 60

Query: 76  ANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD-- 130
            N +   G  +YKLG N+F DL   EF  ++ G++      +R++   +     NV D  
Sbjct: 61  HNAKYAKGLVSYKLGMNQFGDLLAHEFARIFNGHHG-----TRKTGGSTFLPPANVNDSS 115

Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD- 189
           +P ++DWR+KGAVT +KDQGQCGSCWAFSA  ++EG   +  G+L+ LSEQ LVDCS   
Sbjct: 116 LPKAVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQSF 175

Query: 190 -NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
            N+GC GGLM+ AF+YI  N G+ TE  YPY   +G C  +KE  V AT + Y ++  G 
Sbjct: 176 GNNGCEGGLMEDAFKYIKANDGIDTEKSYPYEAVDGECRFKKED-VGATDTGYVEIKAGS 234

Query: 249 EQALLQAVSN-QPVSVCVDASGRAFHFYKSGVLN-ADCGN-NCDHGVAVVGFGTAEEENG 305
           E  L +AV+   P+SV +DAS  +F  Y  GV +  +C + + DHGV VVG+G    + G
Sbjct: 235 EDDLKKAVATVGPISVAIDASHSSFQLYSEGVYDEPECSSEDLDHGVLVVGYGV---KGG 291

Query: 306 AKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
            KYWL+KNSW E+WG+ GYI + RD    CGIA+ ASYP+
Sbjct: 292 KKYWLVKNSWAESWGDQGYILMSRDNNNQCGIASQASYPL 331


>gi|164420679|ref|NP_001037464.2| fibroinase precursor [Bombyx mori]
 gi|40556818|gb|AAR87763.1| fibroinase precursor [Bombyx mori]
          Length = 341

 Score =  253 bits (646), Expect = 9e-65,   Method: Compositional matrix adjust.
 Identities = 142/344 (41%), Positives = 204/344 (59%), Gaps = 24/344 (6%)

Query: 18  ILVITCASQVVSGRSMHEPSIVEKHEQWMA---QHGRTYKDELEKAMRLNIFKQNLEYIE 74
           ++++ CA   VS     +  +V+  E+W A   QH   Y+ E+E   R+ I+ ++   I 
Sbjct: 4   LVLLLCAVAAVSAVQFFD--LVK--EEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIA 59

Query: 75  KANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPS-----VSRQSSRPSTFKYQ 126
           K N++   G  +YKLG N++ D+ + EF     G+N+         +   S R + F   
Sbjct: 60  KHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISP 119

Query: 127 NVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC 186
               +P  +DWR+ GAVT IKDQG+CGSCW+FS   A+EG      G L+ LSEQ L+DC
Sbjct: 120 ANVKLPEQVDWRKHGAVTDIKDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDC 179

Query: 187 STD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDL 244
           S    N+GC+GGLMD AF+YI +N G+ TE  YPY   +  C    +   A  +  + D+
Sbjct: 180 SEQYGNNGCNGGLMDNAFKYIKDNGGIDTEQTYPYEGVDDKCRYNPKNTGAEDVG-FVDI 238

Query: 245 PKGDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGVLNAD--CGNNCDHGVAVVGFGTAE 301
           P+GDEQ L++AV+   PVSV +DAS  +F  Y SGV N +     + DHGV VVG+GT  
Sbjct: 239 PEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECSSTDLDHGVLVVGYGT-- 296

Query: 302 EENGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
           +E G  YWL+KNSWG +WGE GYI+++R+    CGIA++ASYP+
Sbjct: 297 DEQGVDYWLVKNSWGRSWGELGYIKMIRNKNNRCGIASSASYPL 340


>gi|348546019|ref|XP_003460476.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
 gi|348546143|ref|XP_003460538.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
          Length = 334

 Score =  253 bits (646), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 143/315 (45%), Positives = 195/315 (61%), Gaps = 25/315 (7%)

Query: 45  WMAQHGRTYKDELEKAMRLNIFKQNLEYIEKAN---KEGNRTYKLGTNEFSDLTNEEFRA 101
           W  +  R+Y    E+A R  I+  N +++   N    +G ++Y+LG   F+D+ NEE++ 
Sbjct: 29  WKLKFERSYHSPSEEAHRRQIWLNNRKFVLVHNILADQGLKSYRLGMTYFADMENEEYKR 88

Query: 102 LYT-----GYNRPVPSVSRQSSRPSTF-KYQNVTDVPTSIDWREKGAVTHIKDQGQCGSC 155
           + +      +N  +P       R STF +    TD+P ++DWR+KG VT +KDQ QCGSC
Sbjct: 89  VISQGCLHSFNASLPR------RGSTFFRLPEGTDLPDAVDWRDKGYVTDVKDQKQCGSC 142

Query: 156 WAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLAT 213
           WAFSA  ++EG      G L+ LSEQQLVDCS D  N GC GGLMD AF+YI  N G+ T
Sbjct: 143 WAFSATGSLEGQHFRKTGTLVSLSEQQLVDCSGDYGNMGCMGGLMDYAFQYIQANGGIDT 202

Query: 214 EADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDASGRAF 272
           E  YPY  E G C    +  + AT + Y ++ +GDE AL +AV+   P+SV +DAS  +F
Sbjct: 203 EESYPYEAENGKCRYNPDN-IGATSTGYTEVSQGDEDALKEAVATIGPISVGIDASQMSF 261

Query: 273 HFYKSGVLN-ADCGN-NCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD 330
            FY+SGV N  DC +   DHGV  VG+GT   E+G  YWL+KNSWG  WG+ GYI++ R+
Sbjct: 262 QFYESGVYNEPDCSSLELDHGVLAVGYGT---EDGNDYWLVKNSWGLEWGDKGYIKMSRN 318

Query: 331 -AGLCGIATAASYPV 344
            +  CGIATAASYP+
Sbjct: 319 KSNQCGIATAASYPL 333


>gi|297845822|ref|XP_002890792.1| hypothetical protein ARALYDRAFT_473117 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297336634|gb|EFH67051.1| hypothetical protein ARALYDRAFT_473117 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 322

 Score =  253 bits (646), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 136/329 (41%), Positives = 196/329 (59%), Gaps = 37/329 (11%)

Query: 25  SQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTY 84
           SQ     +++E SIV+ H+QWM Q  R Y+DE EK MRL +FK+NL++IE  N  GN++Y
Sbjct: 21  SQARPHVTLNEQSIVDYHQQWMTQFSRVYQDESEKEMRLQVFKKNLKFIENFNNMGNQSY 80

Query: 85  KLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPT---SIDWREKG 141
            +G NEF+D T EEF A +TG    V ++S   +     +  N++D+     S DWR++G
Sbjct: 81  TVGVNEFTDWTIEEFLATHTGLRVNVTTLSELFNETMPSRNWNISDIDIDDESKDWRDEG 140

Query: 142 AVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDK 200
           AV  +K QG C             G+T+I+   L+ LSEQQL+DC T+ N GC GG +++
Sbjct: 141 AVIPVKVQGAC-------------GLTKISGKNLLTLSEQQLIDCDTEKNTGCDGGGIEE 187

Query: 201 AFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQP 260
           AF+YII+N G++ E +YPY+ ++G+C      A    I  +E +P  +E+ALL+AV  QP
Sbjct: 188 AFKYIIKNGGVSLETEYPYQVKKGSCRANARSATQTQIRGFEMVPSHNERALLEAVRRQP 247

Query: 261 VSVCVDASGRAFHFYKSGVL-NADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETW 319
           VSV +DA   +F  YK GV    DCG + +H V  VG+GT                 ++W
Sbjct: 248 VSVLIDARADSFKTYKGGVYAGLDCGTDVNHAVTFVGYGTMI---------------QSW 292

Query: 320 GESGYIRILRDA----GLCGIATAASYPV 344
           GE+GY+RI RD     G+CGIA  A+YP+
Sbjct: 293 GENGYMRIRRDVEWPQGMCGIAQVAAYPI 321


>gi|402898110|ref|XP_003912074.1| PREDICTED: cathepsin L2 [Papio anubis]
          Length = 334

 Score =  253 bits (646), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 143/342 (41%), Positives = 200/342 (58%), Gaps = 19/342 (5%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
           M + ++L   C   + S     + ++  K  QW A H R Y    E+  R  ++++N++ 
Sbjct: 1   MNLSLVLAAFCLG-IASAVPKFDQNLDTKWYQWKATHRRLYGAS-EEGWRRAVWEKNMKM 58

Query: 73  IEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
           IE  N E   G   + +  N F D+TNEEFR +   +       +++  +   F+     
Sbjct: 59  IELHNGEYSQGKHGFTMAMNAFGDMTNEEFRQVMGCFR------NQKLRKGKLFREPLFL 112

Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS-- 187
           D+P S+DWR+KG VT +K+Q QCGSCWAFSA  A+EG      GKL+ LSEQ LVDCS  
Sbjct: 113 DLPKSVDWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRP 172

Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
             N GC+GG M+ AF Y+ EN GL +E  YPY   +G C  + E +V A  + +E +P G
Sbjct: 173 QGNQGCNGGFMNSAFRYVKENGGLDSEESYPYVAMDGICKYRPENSV-ANDTGFEVVPAG 231

Query: 248 DEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFG-TAEEE 303
            E+AL++AV+   P+SV +DA   +F FYKSG+    DC + N DHGV VVG+G      
Sbjct: 232 KEKALMKAVATVGPISVAMDAGHSSFQFYKSGIYFEPDCSSKNLDHGVLVVGYGFEGANS 291

Query: 304 NGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
           +  KYWL+KNSWG  WG +GY++I +D    CGIATAASYP 
Sbjct: 292 DNNKYWLVKNSWGPEWGSNGYVKIAKDKDNHCGIATAASYPT 333


>gi|313118772|gb|ADR32298.1| C14 cysteine protease [Solanum demissum]
          Length = 217

 Score =  253 bits (646), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 117/218 (53%), Positives = 148/218 (67%), Gaps = 8/218 (3%)

Query: 132 PTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-N 190
           P S+DWR+KG +  +KDQG CGSCWAFSAVAA+E I  I  G LI LSEQ+LVDC    N
Sbjct: 2   PVSVDWRDKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGDLISLSEQELVDCDKSYN 61

Query: 191 HGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQ 250
            GC GGLMD AFE++I N G+ TE DYPY+     CD  ++ A    I  YED+P  +E+
Sbjct: 62  QGCDGGLMDYAFEFVINNGGIDTEEDYPYKERNDVCDQYRKNAKVVKIDSYEDVPVNNEK 121

Query: 251 ALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWL 310
           AL +AV++QPVS+ ++A GR F  YKSG+    CG   DHGV   G+GT   ENG  YW+
Sbjct: 122 ALQKAVAHQPVSIALEAGGRDFQHYKSGIFTGKCGTAVDHGVVAAGYGT---ENGMDYWI 178

Query: 311 IKNSWGETWGESGYIRILRD----AGLCGIATAASYPV 344
           ++NSWG  WGE GY+R+ R+    +GLCG+AT  SYPV
Sbjct: 179 VRNSWGAKWGEKGYLRVQRNIASSSGLCGLATEPSYPV 216


>gi|118429523|gb|ABK91809.1| cathepsin L-like proteinase precursor [Clonorchis sinensis]
          Length = 373

 Score =  253 bits (646), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 151/344 (43%), Positives = 200/344 (58%), Gaps = 33/344 (9%)

Query: 24  ASQVVSGRSMHEPSIV---------EKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIE 74
           AS + S  SMH   ++            + +M  + R Y D  E   R  IF  N   I 
Sbjct: 39  ASPLTSLDSMHMQDVIGVDWNFTLSSIWKHFMTTYKRNYIDPSEHERRFKIFANNFVRIS 98

Query: 75  KANK---EGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDV 131
           K N    +G  +Y +G NEFSD T+EE + L     R   + SR  S+  T         
Sbjct: 99  KHNVRFIQGQVSYTMGINEFSDKTDEELKRLRCF--RGSLNASRDGSKYITI----AAPP 152

Query: 132 PTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-- 189
           P+ IDWR KGAVT +K+QG CGSCWAFSA  A+EG   +  G L+ LSEQQLVDCS++  
Sbjct: 153 PSEIDWRNKGAVTPVKNQGNCGSCWAFSATGAIEGQNFLATGNLVSLSEQQLVDCSSEYG 212

Query: 190 NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEG-----TCDNQKEKAVAATISKYEDL 244
           N+ C+GGLMD AF+Y+ ++ G+ TEA YPY   E      TC    ++AV   ++ Y DL
Sbjct: 213 NNACNGGLMDNAFKYVKDSNGIDTEASYPYVSGETGDANPTCRFNLKEAV-VRVTGYIDL 271

Query: 245 PKGDEQALLQAVSNQ-PVSVCVDASGRAFHFYKSGVLNAD--CGNNCDHGVAVVGFGTAE 301
           P+G    L QAV +  P+SV ++A   +F  YKSGV + D    ++ DHGV +VG+G   
Sbjct: 272 PRGQVSELKQAVGHYGPISVAINAGLPSFMSYKSGVYSDDQCSSDDLDHGVLLVGYG--- 328

Query: 302 EENGAKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
           EENG  YWLIKNSWG  WGE+GY++ILRD   LCG+A+ ASYP+
Sbjct: 329 EENGIPYWLIKNSWGPHWGENGYVKILRDHNNLCGVASMASYPL 372


>gi|23306947|dbj|BAC16538.1| cathepsin L [Engraulis japonicus]
          Length = 336

 Score =  253 bits (645), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 146/342 (42%), Positives = 202/342 (59%), Gaps = 17/342 (4%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
           M+V  +  + C S V++  S  +  + +    W   H + Y  E E+  R  ++++NL  
Sbjct: 1   MYVAAVFTL-CLSAVLAAPSF-DRELDDHWNHWKNFHTKKYH-EKEEGWRRVVWEKNLRK 57

Query: 73  IEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
           IE  N E   G  +Y+LG N F D+T+EEFR +  GY       + +  + S F   N  
Sbjct: 58  IEMHNLEHSMGAHSYRLGMNHFGDMTHEEFRQVMNGYKHK----AERRVKGSLFMEPNFI 113

Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST- 188
           + P  ID+R+ G  T +KDQGQCGSCWAFS   A+EG      GKL+ LSEQ LVDCS  
Sbjct: 114 EAPKKIDYRDLGYATPVKDQGQCGSCWAFSTTGAMEGQLFREGGKLVSLSEQNLVDCSRP 173

Query: 189 -DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
             N GC+GGLMD+AF+YI +N GL TE  YPY   +    +   K  AA  + + D+P+G
Sbjct: 174 EGNEGCNGGLMDQAFQYIKDNGGLDTEDAYPYLGTDDQDCHYDPKYSAANDTGFVDIPEG 233

Query: 248 DEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGNN-CDHGVAVVGFG-TAEEE 303
            E+AL++AV+   PVSV +DA   +F FY SG+    +C +   DHGV VVG+G   E+ 
Sbjct: 234 KERALMKAVAAVGPVSVAIDAGHESFQFYHSGIYFEKECSSTELDHGVLVVGYGFEGEDV 293

Query: 304 NGAKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
           +G KYW++KNSW E WG+ GYI + +D    CGIATAASYP+
Sbjct: 294 DGKKYWIVKNSWSEKWGDEGYIYMAKDRKNHCGIATAASYPL 335


>gi|34850847|dbj|BAC87861.1| cathepsin L [Engraulis japonicus]
          Length = 336

 Score =  253 bits (645), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 146/342 (42%), Positives = 202/342 (59%), Gaps = 17/342 (4%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
           M+V  +  + C S V++  S  +  + +    W + H + Y  E E+  R  ++++NL  
Sbjct: 1   MYVAAVFTL-CLSAVLAAPSF-DRELDDHWNHWKSFHTKKYH-EKEEGWRRVVWEKNLRK 57

Query: 73  IEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
           IE  N E   G  +Y+LG N F D+T+EEFR +  GY       + +  + S F   N  
Sbjct: 58  IEMHNLEHSMGAHSYRLGMNHFGDMTHEEFRQVMNGYKHK----AERRVKGSLFMEPNFI 113

Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD 189
           + P  ID+R+ G  T +KDQGQCGSCWAFS   A+EG      GKL+ LSEQ LVDCS  
Sbjct: 114 EAPKKIDYRDLGYATPVKDQGQCGSCWAFSTTGAMEGQLFREGGKLVSLSEQNLVDCSRP 173

Query: 190 --NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
             N GC+GGLMD+AF+YI +N GL TE  YPY   +    +   K  AA  + + D+P+G
Sbjct: 174 EGNEGCNGGLMDQAFQYIKDNGGLDTEDAYPYLGTDDQDCHYDPKYSAANDTGFVDIPEG 233

Query: 248 DEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGNN-CDHGVAVVGFG-TAEEE 303
            E+AL++AV+   PVSV +DA    F FY SG+    +C +   DHGV VVG+G   E+ 
Sbjct: 234 KERALMKAVAAVGPVSVAIDAGHECFQFYHSGIYFEKECSSTELDHGVLVVGYGFEGEDV 293

Query: 304 NGAKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
           +G KYW++KNSW E WG+ GYI + +D    CGIATAASYP+
Sbjct: 294 DGKKYWIVKNSWSEKWGDEGYIYMAKDRKNHCGIATAASYPL 335


>gi|355567966|gb|EHH24307.1| Cathepsin L2 [Macaca mulatta]
 gi|355753494|gb|EHH57540.1| Cathepsin L2 [Macaca fascicularis]
 gi|380790509|gb|AFE67130.1| cathepsin L2 preproprotein [Macaca mulatta]
          Length = 334

 Score =  253 bits (645), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 143/342 (41%), Positives = 200/342 (58%), Gaps = 19/342 (5%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
           M + ++L   C   + S     + ++  K  QW A H R Y    E+  R  ++++N++ 
Sbjct: 1   MNLSLVLAAFCLG-IASAVPKFDQNLDTKWYQWKATHRRLYGAS-EEGWRRAVWEKNMKM 58

Query: 73  IEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
           IE  N E   G   + +  N F D+TNEEFR +   +       +++  +   F+     
Sbjct: 59  IELHNGEYSQGKHGFAMAMNAFGDMTNEEFRQVMGCFR------NQKLRKGKLFREPLFL 112

Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS-- 187
           D+P S+DWR+KG VT +K+Q QCGSCWAFSA  A+EG      GKL+ LSEQ LVDCS  
Sbjct: 113 DLPKSVDWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSHP 172

Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
             N GC+GG M+ AF Y+ EN GL +E  YPY   +G C  + E +V A  + +E +P G
Sbjct: 173 QGNQGCNGGFMNSAFRYVKENGGLDSEESYPYVAMDGICKYRPENSV-ANDTGFEVVPAG 231

Query: 248 DEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFG-TAEEE 303
            E+AL++AV+   P+SV +DA   +F FYKSG+    DC + N DHGV VVG+G      
Sbjct: 232 KEKALMKAVATVGPISVAMDAGHSSFQFYKSGIYFEPDCSSKNLDHGVLVVGYGFEGANS 291

Query: 304 NGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
           +  KYWL+KNSWG  WG +GY++I +D    CGIATAASYP 
Sbjct: 292 DNNKYWLVKNSWGPEWGSNGYVKIAKDKDNHCGIATAASYPT 333


>gi|4503155|ref|NP_001903.1| cathepsin L1 isoform 1 preproprotein [Homo sapiens]
 gi|22202619|ref|NP_666023.1| cathepsin L1 isoform 1 preproprotein [Homo sapiens]
 gi|384081592|ref|NP_001244900.1| cathepsin L1 isoform 1 preproprotein [Homo sapiens]
 gi|384081594|ref|NP_001244901.1| cathepsin L1 isoform 1 preproprotein [Homo sapiens]
 gi|332832229|ref|XP_003312197.1| PREDICTED: cathepsin L1 isoform 2 [Pan troglodytes]
 gi|332832233|ref|XP_001137800.2| PREDICTED: cathepsin L1 isoform 1 [Pan troglodytes]
 gi|397470218|ref|XP_003806728.1| PREDICTED: cathepsin L1 isoform 1 [Pan paniscus]
 gi|397470220|ref|XP_003806729.1| PREDICTED: cathepsin L1 isoform 2 [Pan paniscus]
 gi|397470222|ref|XP_003806730.1| PREDICTED: cathepsin L1 isoform 3 [Pan paniscus]
 gi|410042824|ref|XP_003951515.1| PREDICTED: cathepsin L1 [Pan troglodytes]
 gi|115741|sp|P07711.2|CATL1_HUMAN RecName: Full=Cathepsin L1; AltName: Full=Major excreted protein;
           Short=MEP; Contains: RecName: Full=Cathepsin L1 heavy
           chain; Contains: RecName: Full=Cathepsin L1 light chain;
           Flags: Precursor
 gi|29715|emb|CAA30981.1| pro-(cathepsin L) [Homo sapiens]
 gi|190418|gb|AAA66974.1| preprocathepsin L precursor [Homo sapiens]
 gi|31873292|emb|CAD97637.1| hypothetical protein [Homo sapiens]
 gi|48146223|emb|CAG33334.1| CTSL [Homo sapiens]
 gi|119583135|gb|EAW62731.1| cathepsin L, isoform CRA_a [Homo sapiens]
 gi|119583136|gb|EAW62732.1| cathepsin L, isoform CRA_a [Homo sapiens]
 gi|119583137|gb|EAW62733.1| cathepsin L, isoform CRA_a [Homo sapiens]
 gi|119583138|gb|EAW62734.1| cathepsin L, isoform CRA_a [Homo sapiens]
 gi|119583140|gb|EAW62736.1| cathepsin L, isoform CRA_a [Homo sapiens]
 gi|208965934|dbj|BAG72981.1| cathepsin L1 [synthetic construct]
 gi|410303006|gb|JAA30103.1| cathepsin L1 [Pan troglodytes]
 gi|410303008|gb|JAA30104.1| cathepsin L1 [Pan troglodytes]
 gi|410303010|gb|JAA30105.1| cathepsin L1 [Pan troglodytes]
          Length = 333

 Score =  253 bits (645), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 143/342 (41%), Positives = 204/342 (59%), Gaps = 20/342 (5%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
           M   +IL   C   + S     + S+  +  +W A H R Y    E+  R  ++++N++ 
Sbjct: 1   MNPTLILAAFCLG-IASATLTFDHSLEAQWTKWKAMHNRLYGMN-EEGWRRAVWEKNMKM 58

Query: 73  IEKAN---KEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
           IE  N   +EG  ++ +  N F D+T+EEFR +  G+       +R+  +   F+     
Sbjct: 59  IELHNQEYREGKHSFTMAMNAFGDMTSEEFRQVMNGFQ------NRKPRKGKVFQEPLFY 112

Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS-- 187
           + P S+DWREKG VT +K+QGQCGSCWAFSA  A+EG      G+LI LSEQ LVDCS  
Sbjct: 113 EAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGP 172

Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
             N GC+GGLMD AF+Y+ +N GL +E  YPY   E +C    + +V A  + + D+PK 
Sbjct: 173 QGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSV-ANDTGFVDIPK- 230

Query: 248 DEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFG-TAEEE 303
            E+AL++AV+   P+SV +DA   +F FYK G+    DC + + DHGV VVG+G  + E 
Sbjct: 231 QEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTES 290

Query: 304 NGAKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
           +  KYWL+KNSWGE WG  GY+++ +D    CGIA+AASYP 
Sbjct: 291 DNNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPT 332


>gi|290462225|gb|ADD24160.1| Cathepsin L [Lepeophtheirus salmonis]
          Length = 334

 Score =  253 bits (645), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 142/339 (41%), Positives = 199/339 (58%), Gaps = 20/339 (5%)

Query: 15  VIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIE 74
           ++++ VI   +  VS   +    ++   E W   H + Y   +E+ +RL IF +N   I 
Sbjct: 6   ILLLSVIISTASAVSFFDV----VLSDWESWKLTHQKGYDSSVEEKLRLKIFMENSLRIS 61

Query: 75  KANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDV 131
           + N E   G  TY +  N + DL + EF A+  GY       + +++   TF      ++
Sbjct: 62  RHNAEAIQGRHTYFMKMNHYGDLLHHEFVAMVNGY-----IYNNKTTLGGTFIPSKNINL 116

Query: 132 PTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-- 189
           P  +DWRE+GAVT +K+QGQCGSCW+FSA  ++EG      GKLI LSEQ LVDCS    
Sbjct: 117 PEHVDWREEGAVTPVKNQGQCGSCWSFSATGSLEGQDFRKTGKLISLSEQNLVDCSRKYG 176

Query: 190 NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDE 249
           N+GC GGLMD AF+YI +N G+ TEA YPY   +G C    +    + I  + D+ KG E
Sbjct: 177 NNGCEGGLMDYAFKYIQDNNGIDTEASYPYEGIDGHCHYDPKNKGGSDIG-FVDIKKGSE 235

Query: 250 QALLQAVSN-QPVSVCVDASGRAFHFYKSGVLN-ADCG-NNCDHGVAVVGFGTAEEENGA 306
           + L +A++   P+SV +DAS  +F FY  GV +   C   N DHGV  VG+GT +E  G 
Sbjct: 236 KDLQKALATVGPISVAIDASHMSFQFYSHGVYSEKKCSPENLDHGVLAVGYGT-DEVTGE 294

Query: 307 KYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
            YWL+KNSW E WGE GYI++ R+   +CGIA++ASYPV
Sbjct: 295 DYWLVKNSWSEKWGEDGYIKMARNKDNMCGIASSASYPV 333


>gi|157779038|gb|ABV71063.1| cathepsin L3 precursor [Schistosoma mansoni]
 gi|360044915|emb|CCD82463.1| subfamily C1A unassigned peptidase (C01 family) [Schistosoma
           mansoni]
          Length = 370

 Score =  253 bits (645), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 141/311 (45%), Positives = 195/311 (62%), Gaps = 21/311 (6%)

Query: 48  QHGRTYKDELEKAMRLNIFKQNLEYIEKAN---KEGNRTYKLGTNEFSDLTNEEFRALYT 104
           Q  R Y    E+  R  IF  N   + + N   +EG  TYK+G NEF+D T+ E + L  
Sbjct: 66  QFKRAYNGIHEETRRFFIFSANFVKMMEHNHAFQEGKVTYKMGVNEFTDKTDYELKKL-R 124

Query: 105 GYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAV 164
           GY     ++  + S   TF     T +P+ +DWR +GAVT +K+QGQCGSCWAFS   A+
Sbjct: 125 GYKVTSGAIRHKGS---TFIRSEHTKLPSKVDWRREGAVTDVKNQGQCGSCWAFSTTGAI 181

Query: 165 EGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHE 222
           EG       +L+ LSEQQLVDCS    N+GCSGGLM+ AFEY+ +N+G+ +E  YPY   
Sbjct: 182 EGQHYRKTNRLVNLSEQQLVDCSKSYGNNGCSGGLMNSAFEYVRDNEGIDSEISYPYVSG 241

Query: 223 EGTCDNQ---KEKAVAATISKYEDLPKGDEQALLQAVSNQ-PVSVCVDASGRAFHFYKSG 278
           +GT +N+       + A ++ Y ++ +GDE+AL+ AV+ + PVSV ++A   +F  YKSG
Sbjct: 242 DGTENNRCLFNASNILAQVTGYVNIHEGDERALMDAVATKGPVSVAINAGLPSFSMYKSG 301

Query: 279 VL-NADCG---NNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA-GL 333
           +  + DC    +  DHGV VVG+G   EENG  YWLIKNSWGE WGE GYI+I + +  +
Sbjct: 302 IYSDTDCEGTLDALDHGVLVVGYG---EENGRSYWLIKNSWGEEWGEKGYIKISKGSHNM 358

Query: 334 CGIATAASYPV 344
           CG+A+AASYP+
Sbjct: 359 CGVASAASYPL 369


>gi|395740610|ref|XP_002819972.2| PREDICTED: cathepsin L1 [Pongo abelii]
          Length = 333

 Score =  253 bits (645), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 143/342 (41%), Positives = 203/342 (59%), Gaps = 20/342 (5%)

Query: 13  MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
           M   + L   C   + S     + S+  +  +W A H R Y    E+  R  ++++N++ 
Sbjct: 1   MNPTLFLAAFCLG-IASATLTFDHSLEARWTKWKAMHNRLYGMN-EEGWRRAVWEKNMKM 58

Query: 73  IEKAN---KEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
           IE  N   +EG  ++ +  N F D+T+EEFR +  G+       +R+  +   F+     
Sbjct: 59  IELHNQEYREGKHSFTMAMNAFGDMTSEEFRQVMNGFQ------NRKPRKGKVFQEPLFY 112

Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS-- 187
           + P S+DWREKG VT +K+QGQCGSCWAFSA  A+EG      GKLI LSEQ LVDCS  
Sbjct: 113 EAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLISLSEQNLVDCSGP 172

Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
             N GC+GGLMD AF+Y+ +N GL +E  YPY   E +C    + +V A  + + D+PK 
Sbjct: 173 QGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSV-ANDTGFVDIPK- 230

Query: 248 DEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFG-TAEEE 303
            E+AL++AV+   P+SV +DA   +F FYK G+    DC + + DHGV VVG+G  + E 
Sbjct: 231 QEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTES 290

Query: 304 NGAKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
           +  KYWL+KNSWGE WG  GY+++ +D    CGIA+AASYP 
Sbjct: 291 DNNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPT 332


>gi|530734|emb|CAA56914.1| cathepsin l [Nephrops norvegicus]
 gi|1582620|prf||2119193A cathepsin L-related Cys protease
          Length = 324

 Score =  253 bits (645), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 139/317 (43%), Positives = 192/317 (60%), Gaps = 29/317 (9%)

Query: 43  EQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEFSDLTNEEF 99
           E++  + GR Y D  E+  RLN+F  NL+YIE+ NK+   G  TY L  N+FSDLTN+EF
Sbjct: 21  EEFKGKFGRKYVDLEEERYRLNVFLDNLQYIEEFNKKYESGEVTYNLAINQFSDLTNDEF 80

Query: 100 RALYTGYN-----RPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGS 154
            ++  GY      +PV   +   + P T          T +DWR KG VTH+KDQGQCGS
Sbjct: 81  NSMMKGYKTSLRPKPVAVFTSTDAAPET----------TEVDWRTKGCVTHVKDQGQCGS 130

Query: 155 CWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD---NHGCSGGLMDKAFEYIIENKGL 211
           CWAFSA  ++EG   +  G+L+ L+EQQLVDC+     N GC+GG +++AF+YI  N G+
Sbjct: 131 CWAFSATGSLEGQHFLKYGELVSLAEQQLVDCAGGIYYNQGCNGGWVNQAFKYIKANGGI 190

Query: 212 ATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQ-PVSVCVDASGR 270
            TE+ YPY   + TC      +VAAT S +  + +G E   ++  +N  P+SV +DA+ R
Sbjct: 191 DTESSYPYEARDNTC-RFNSNSVAATCSGFVSIAQGSESPEVRRTTNTGPISVAIDAAHR 249

Query: 271 AFHFYKSGV-LNADCGNN-CDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRIL 328
           +F  Y SGV     C ++  DH V  VG+G+   E G  +WL+KNSWG +WG +GYI + 
Sbjct: 250 SFQSYSSGVYYEPSCSSSQLDHAVLAVGYGS---EGGQDFWLVKNSWGTSWGSAGYINMA 306

Query: 329 RDA-GLCGIATAASYPV 344
           R+    CGIAT ASYP 
Sbjct: 307 RNRNNNCGIATDASYPT 323


>gi|115743|sp|P07154.2|CATL1_RAT RecName: Full=Cathepsin L1; AltName: Full=Cyclic protein 2;
           Short=CP-2; AltName: Full=Major excreted protein;
           Short=MEP; Contains: RecName: Full=Procathepsin L;
           Contains: RecName: Full=Cathepsin L1 heavy chain;
           Contains: RecName: Full=Cathepsin L1 light chain; Flags:
           Precursor
 gi|38648869|gb|AAH63175.1| Cathepsin L1 [Rattus norvegicus]
 gi|149029152|gb|EDL84437.1| cathepsin L, isoform CRA_a [Rattus norvegicus]
 gi|386267881|dbj|BAM14518.1| cathepsin L [Rattus norvegicus]
          Length = 334

 Score =  253 bits (645), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 141/311 (45%), Positives = 191/311 (61%), Gaps = 19/311 (6%)

Query: 44  QWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEFSDLTNEEFR 100
           QW + H R Y    E+  R  ++++N+  I+  N E   G   + +  N F D+TNEEFR
Sbjct: 31  QWKSTHRRLYGTN-EEEWRRAVWEKNMRMIQLHNGEYSNGKHGFTMEMNAFGDMTNEEFR 89

Query: 101 ALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSA 160
            +  GY        R    P   +      +P ++DWREKG VT +K+QGQCGSCWAFSA
Sbjct: 90  QIVNGYRHQKHKKGRLFQEPLMLQ------IPKTVDWREKGCVTPVKNQGQCGSCWAFSA 143

Query: 161 VAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYP 218
              +EG   +  GKLI LSEQ LVDCS D  N GC+GGLMD AF+YI EN GL +E  YP
Sbjct: 144 SGCLEGQMFLKTGKLISLSEQNLVDCSHDQGNQGCNGGLMDFAFQYIKENGGLDSEESYP 203

Query: 219 YRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDASGRAFHFYKS 277
           Y  ++G+C  + E AVA   + + D+P+  E+AL++AV+   P+SV +DAS  +  FY S
Sbjct: 204 YEAKDGSCKYRAEYAVAND-TGFVDIPQ-QEKALMKAVATVGPISVAMDASHPSLQFYSS 261

Query: 278 GV-LNADCGN-NCDHGVAVVGFG-TAEEENGAKYWLIKNSWGETWGESGYIRILRDA-GL 333
           G+    +C + + DHGV VVG+G    + N  KYWL+KNSWG+ WG  GYI+I +D    
Sbjct: 262 GIYYEPNCSSKDLDHGVLVVGYGYEGTDSNKDKYWLVKNSWGKEWGMDGYIKIAKDRNNH 321

Query: 334 CGIATAASYPV 344
           CG+ATAASYP+
Sbjct: 322 CGLATAASYPI 332


>gi|302763127|ref|XP_002964985.1| hypothetical protein SELMODRAFT_406652 [Selaginella moellendorffii]
 gi|300167218|gb|EFJ33823.1| hypothetical protein SELMODRAFT_406652 [Selaginella moellendorffii]
          Length = 320

 Score =  253 bits (645), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 136/329 (41%), Positives = 191/329 (58%), Gaps = 31/329 (9%)

Query: 8   SFIIPMFVIIILVITCASQVVSGRSMHEPS----IVEKHEQWMAQHGRTYKDELEKAMRL 63
           S +I + +I+++V+  A   ++  +  E      I    E W A+HG++Y  + EKA R+
Sbjct: 3   SNMIALILILLVVVGAAPFAIARPAALEDDRALEIKNMFEDWAAKHGKSYSSDWEKARRM 62

Query: 64  NIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTF 123
            IF   L YIEK N   N T+ LG N+FSDLTN EFRA Y G  +P      Q  RP+  
Sbjct: 63  TIFSDTLAYIEKHNALPNTTFTLGLNKFSDLTNAEFRANYVGKFKPP---RYQDRRPAKD 119

Query: 124 KYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQL 183
              +V+ +PTS+DWR++GAVT IKDQGQCGSCWAFSA+A++E    +   +L+ LSEQQL
Sbjct: 120 VDVDVSSLPTSLDWRQEGAVTPIKDQGQCGSCWAFSAIASIESAHFLATNQLVSLSEQQL 179

Query: 184 VDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYED 243
           +DC T + GC                    E  YPY    G+C+  K K   A I+ +  
Sbjct: 180 IDCDTVDEGCQ-------------------EEAYPYTGLAGSCNANKNK--VAEITGFNV 218

Query: 244 LPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEE 303
           + K    AL++AVS  PV+V +  S + F  Y+SG+L+  C N+ DH V V+G+GT   E
Sbjct: 219 VTKDKADALMKAVSKTPVTVGICGSDQNFQNYRSGILSGQCCNSRDHVVLVIGYGT---E 275

Query: 304 NGAKYWLIKNSWGETWGESGYIRILRDAG 332
            G  YW+IKNSWG +WGE G+++I +  G
Sbjct: 276 GGMPYWIIKNSWGTSWGEDGFMKIEKKDG 304


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.316    0.132    0.398 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,522,330,294
Number of Sequences: 23463169
Number of extensions: 233223774
Number of successful extensions: 619409
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 6734
Number of HSP's successfully gapped in prelim test: 905
Number of HSP's that attempted gapping in prelim test: 586823
Number of HSP's gapped (non-prelim): 10033
length of query: 346
length of database: 8,064,228,071
effective HSP length: 143
effective length of query: 203
effective length of database: 9,003,962,200
effective search space: 1827804326600
effective search space used: 1827804326600
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 77 (34.3 bits)